The present invention relates to the technical field of single cell sequencing, particularly to an ultra-high-throughput single cell sequencing method.
Since the introduction of single cell sequencing technology by Fuchou Tang in 2009, single cell high-throughput sequencing platforms have sprung up, such as microfluidics-based Drop-seq and inDrop-seq platforms, microwell plate-based Microwell-seq and Seq-well platforms as well as the common commercial platform 10×Genomics. However, taking microwell plate as an example, when cell input is up to a certain level, it is easy to have two or even more cells captured in one microwell, resulting in contamination of transcripts between cells. To avoid this phenomenon, it was found by analysis that each bead capturing only one cell can be achieved when the capture rate of cells in the microwell plate is approximately 1/10 of the total number of wells in the microwell plate. However, this will result in a large number of droplets or microwells having empty microbeads only but without cells in them, which greatly reduces experimental efficiency while increasing the cost, leading to cell throughput per run on these platforms in the order of magnitude of tens of thousands only. However, the total number of cells in a single individual of a species, taking mice and humans as examples, exceeds trillions. Therefore, the current platform throughput is far from meeting sequencing requirements. Insufficient sequencing throughput tends to result in loss of a large amount of biological information, so it is especially important to improve the throughput per sequencing run.
Recent years have seen a continuous development of ultra-high-throughput technologies using individual cells as independent reaction systems, such as sci-RNA-seq and SPLiT-seq, these technologies including: firstly fixing and permeabilizing fresh cells and incorporating a round of unique molecular label into each cell by reverse transcription, followed by incorporating different molecular labels to the cells through split-pool, such that the transcript in each cell is finally incorporated with a unique molecular label through multiple combinations of molecular labels, thus greatly improving the throughput per run. However, since these methods are based on intracellular ligation reactions, there are problems such as low reaction efficiency and high contamination rate because of easy leakage of transcripts between cells, which reduce the practicality of the methods.
Paul et al. (Datlinger, P., et al., Ultra-high throughput single-cell RNA sequencing by combinatorial fluidic indexing. BioRxiv, 2019.) from CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences increased the throughput of single-cell sequencing up to 15-fold in December 2019 by a method of introducing a round of 96/384 types of molecular labels to fixed cells with reverse transcription followed by combinatorial microfluidics. However, cells are not sequenced in a parallel way on microfluidics-based sequencing platforms, leading to more significant batch effect with abnormal UMI/Gene ratios, and moreover, there are disadvantages including expensive equipment, difficulty to carry, and high sequencing cost.
In a genome, most of the chromatin is tightly wound in the nucleus and is transcriptionally inactive. Chromatin state is dynamically regulated in a cell type-specific manner, with some of the dense chromatin becoming loose in a specific cell state, these loose chromatins being referred to as open chromatin or accessible chromatin. Assay for chromatin accessibility in cells can give information on transcription regulation in cells, such as where transcription factors can bind to gene promoters and which genes of the cells may be efficiently transcribed. The commonly used assays are ATAC-seq, DNase-seq, MNase-seq, FAIRE-seq, ChIP-seq, etc. These methods include fragmenting and tagmenting open chromatin regions based on different principles, among which ATAC-seq (assay for transposase-accessible chromatin with high-throughput sequencing) employs a modified Tn5 transposase that randomly inserts a specific DNA sequence as a transposon into an open chromatin region, and can capture the entire open-region sequence directly in an intact manner, so ATAC-seq is currently widely used in open chromosome sequencing.
The present invention provides an ultra-high-throughput single cell sequencing method, which can obtain specific transcriptome information of millions of single cells per run.
First, the present invention provides an ultra-high-throughput single cell sequencing method, comprising the following steps:
The sequencing method is a transcriptome sequencing method. By dividing the cell barcode sequence into a first cell barcode sequence and a second cell barcode sequence, and by introducing the second cell barcode sequence in the form of a reverse transcription sequence int o the reverse transcription sequence of each mRNA during intracellular reverse transcription, a plurality of cells can be distinguished based on the second cell barcode sequence, in the event that a molecular labeled microbead binds to a plurality of cells; otherwise, it would be impossible to distinguish sequences derived from a plurality of cells that bind to the same molecular labeled microbead by relying solely on the first cell barcode sequence on the molecular labeled microbead. In the prior art, in order to avoid one molecular labeled microbead binding to a plurality of cells, strict condition control is required, for example, the microwells on the microwell plate for the experiment should be prepared as well as possible, such that the size of the microwells is for accommodating only one molecular labeled microbead and one cell (In this case, the relative size of the molecular labeled microbeads and the cells should not be too large; otherwise, it will be difficult to achieve one molecular labeled microbead binding to one cell); and meanwhile, it is also necessary to control the cell capture rate at a relatively low level so that the cells are well dispersed. However, it is still impossible to avoid one molecular labeled microbead binding to a plurality of cells, in which case the sequences will be wrongly determined as originating from a same cell according to the final sequencing result.
Meanwhile, the combination of the second cell barcode sequence and the first cell barcode sequence to form a cell barcode sequence increases the number of combinations of cell barcode sequences, thus enabling assay for cells of larger quantities in one run.
The present invention further provides an ultra-high-throughput single cell sequencing method, comprising the following steps:
The sequencing method is an assay for transposase-accessible chromatin sequencing method. By dividing the cell barcode sequence into a first cell barcode sequence and a second cell barcode sequence, and by incorporating the second cell barcode sequence into the corresponding sequence of a genome during transposition reaction with Tn5 transposase, a plurality of cells can be distinguished based on the second cell barcode sequence in the event that a molecular labeled microbead binds to a plurality of cells; otherwise, it would be impossible to distinguish sequences derived from a plurality of cells that bind to the same molecular labeled microbead by relying solely on the first cell barcode sequence on the molecular labeled microbead.
Preferably, the microbead is coupled to the molecular label sequence in such a way comprising: replacing the hydroxyl group with an amine group at the C6 position of the nucleotide at the 5′ end of the molecular label sequence, having a surface of the microbead modified with a carboxyl group, and coupling through condensation of the amino group and the carboxyl group. Since the molecular label sequence is a single-stranded oligonucleotide, the hydroxyl group on the first nucleotide at its 5′ end being replaced with an amine group, and the surface of the microbead is modified with a carboxyl group, the molecular label sequence is coupled to the microbead through the reaction of the amino group and the carboxyl group.
Preferably, at least a part of the molecular label sequence is a random sequence that is randomly synthesized.
Preferably, the first cell barcode sequence comprises a plurality of specific fragments, and the second cell barcode sequence comprises at least one specific fragment, the specific fragments at different locations being selected from the same or different libraries of specific fragments, and the first cell barcode sequence and the second cell barcode sequence identifying cells by using different combinations and arrangements of the specific fragments.
Further preferably, the preparation method of the molecular labeled microbead comprises the following step:
Preferably, the molecular label sequence is:
′-TTTAGGGATAACAGGGTAATAAGCAGTGGTATCAACGCAGAGTACGTNNNNNNCG ACTCACTACAGGGNNNNNNTCGGTGACACGATCGNNNNNNTCGTCGGCAGCGTC-3′ (SEQ ID No. 4), wherein, N represents any one of A/T/C/G and is randomly synthesized. The reverse transcription sequence is:
5′-[phos]ACACTCTTTCCCTACACGACGNNNNNNnnnnnnnnnnTTTTTTTTTTTTTTTTTTT TTTTTTVN-3′(SEQ ID No. 6), wherein, the phosphorylation modification at the 5′ end provides a phosphate group for the ligation reaction; N represents any one of A/T/C/G and is randomly synthesized; n represents any one of A/T/C/G and is randomly synthesized; and V at the 3′ end represents any one of A/C/G with V being randomly synthesized. The specific molecular barcode sequence is:
5′-ACACTCTTTCCCTACACGACGNNNNNNNNNNAGATGTGTATAAGAGACAG-3′(SEQ ID No. 11), wherein, N represents any one of A/T/C/G and is randomly synthesized; the complementary Mosaic Ends sequence to form a double strand is:
5′-CTGTCTCTTATACACATCT-3′ (SEQ ID No. 12). The bridge primer is:
5′-CGTCGTGTAGGGAAAGAGTGTGACGCTGCCGACGA[ddC]-3′(SEQ ID No. 5), wherein ddC is dideoxycytidine modification. Alternatively, the ddC at the 3′ end can be removed. In the reverse transcription sequence, the 6×N random sequence is used as the second cell barcode sequence, and the 10×n random sequence is used as the molecular barcode sequence.
For ATAC-seq (assay for transposase-accessible chromatin with high-throughput sequencing), one transposase complex of the specific molecular barcode transposase-embedded complex carries two gene fragments, both gene fragments being the specific molecular barcode sequence; alternatively, one gene fragment being the specific molecular barcode sequence and the other gene fragment being a universal sequence, the universal sequence comprising:
Preferably, the cell sample to be sequenced comprises 2 or more types of cells. The ultra-high-throughput single cell sequencing method of the present application can realize parallel sequencing of multiple types of cells.
Preferably, the microbead body is a magnetic bead; intracellularly reverse transcribed cells (reverse transcription sequencing) or transposed nuclei (ATAC-seq) are added to a microwell plate, followed by the addition of the molecular labeled microbead, the microwell in the microwell plate having a diameter big enough to just accommodate one molecular labeled microbead and one or more cells or nuclei; the capture rate of cells or nuclei in the microwell plate is maintained at over 80%; and the capture rate of the molecular labeled microbead in the microwell plate is over 99%.
The ultra-high-throughput single cell sequencing method of the present application can achieve one molecular labeled microbead binding a plurality of cells, therefore, in the case of the microbead body being a magnetic bead and in the case of using the microwell plate method, the capture rate of cells in microwell plates can be greatly improved.
Besides of being used for microwell plate-based single cell sequencing with the microbead body being a magnetic bead, the present invention may also be used for microfluidics-based single cell sequencing platforms.
Further preferably, the microwell depth in the microwell plate is 30-160 m and the microwell diameter is 20-150 m; and the diameter of the microbead body is 20-145 m.
Further preferably, the preparation method of the microwell plate is: (1) etching microwells on a silicon wafer as an initial mold; (2) pouring polydimethylsiloxane over the initial mold and removing the polydimethylsiloxane after molding to produce a second mold with microwells; and (3) pouring molten agarose with a mass to volume ratio of 4% to 6% on the second mold, cooling and molding followed by removing the agarose to obtain the microwell plate.
The ultra-high-throughput single cell sequencing method of the present invention can be used for ultra-high-throughput sequencing of millions of single cells, with the number of cells per run up to millions of single cells, thereby greatly improving the throughput of single cell sequencing.
The size of a microwell plate was designed according to the experimental scale (e.g., in the case of 500,000 each of human 293T cells and mouse 3T3 cells, the well plate size was 1.8 cm×1.8 cm), and microwells were etched on a silicon wafer as the initial mold, the microwells being cylindrical, wherein the microwell depth was 60 m, the microwell diameter was 50 m, and the well spacing was 70 m. Next, polydimethylsiloxane (PDMS) was poured on the silicon wafer, and the PDMS was removed after molding to produce a second mold with micropillars on the plate. The finished microwell plate used in the experiment was prepared by pouring molten agarose (prepared with enzyme-free water) with a concentration of 5% (mass ratio) into the PDMS micropillar plate, cooling and solidifying, the agarose-plate casting being peeled off to produce a microwell plate of a certain thickness (
The magnetic beads were purchased from Suzhou Knowledge & Benefit Sphere Tech. Co., Ltd (cat #MagCOOH-20190725), which are coated with carboxyl groups on the surface and have a diameter of 45 m. The preparation process of the molecular labeled magnetic beads is as shown in
Wherein, 6×N was the core sequence of the cell barcode sequence, this core sequence corresponding to each magnetic bead being different, and the 6×N sequences in the three sequence segments corresponding to a same magnetic bead being different as well. Since there are four options from A/T/C/G for each site, there were 46 options for the 6×N sequence. N represents any one of A/T/C/G and was randomly synthesized.
Once completed, the molecular label sequence was as follows: 5′-TTTAGGGATAAC AGGGTAATAAGCAGTGGTATCAACGCAGAGTACGTNNNNNNCGACTCACTACAGGGN NNNNNTCGGTGACACGATCGNNNNNNTCGTCGGCAGCGTC-3′(SEQ ID No. 4), and could be used for ultra-high-throughput single cell sequencing.
Specific transcriptome ultra-high-throughput single cell sequencing.
Mouse embryonic stem cells (ESC) 3T3 and human embryonic kidney cells (293T), 5 million each, were fixed at −20° C. for 30 min separately by slowly dripping 5-10 ml of methanol (pre-chilled at −20° C.) into the cells. Meanwhile, the bridge primers were dispensed into an 8-tube stripe, 6.5 μl per tube, and then dispensed into a 96-well plate containing 0.5 μl of the reverse transcription primer to mix well and stand still such that there were a total of 1 μl of mixed primers for reverse transcription per well. The sequence of the bridge primer was 5′-CGTCGTGTAGGGAAAGAGTGTGACGCTGCCGACGA[ddC]-3′(SEQ ID No. 5), the ddC modification being added to the 3′ end to prevent the production of byproducts resulted from extension of the bridge primer during reverse transcription.
There were 96 types of reverse transcription primers (reverse transcription sequences), the same as the above-mentioned cell barcode sequence, with the core sequence of 6×N, and each type of primer was arranged independently in each well. The 6×N random sequence could be used as a part of the cell barcode sequence, and subsequently used in conjunction with the cell barcode sequence on the molecular labeled magnetic beads in Embodiment 1 to identify the cells from which the mRNA corresponding to each sequence in the sequencing library constructed subsequently is derived, thus there being a total of 96×96×96×96 types of single-stranded oligonucleotides for identifying cells, sufficient to be used for millions of cells per run. Phosphorylation modification was added to the 5′ end of the molecular barcode sequence of the last segment of the reverse transcription primer to provide phosphate groups for the ligation reaction, wherein n of 10×n represents any one of A/T/C/G and was randomly synthesized, V at the 3′ end denotes any one of A/C/G, and N denotes any one of A/T/C/G and was randomly synthesized, with the main purpose of having the primer bind to the end of the polyA tail while avoiding the binding of the primer to the middle part of the polyA tail. The specific sequence of the reverse transcription primer was as follows: 5′-[phos]ACACTCTTTCCCTACACACGACGNNNNNNNnnnnnnnnnnnnnnnnnnTTTTTTTTTTTTTTT TTTTTTTTTTTTVN-3′(SEQ ID No. 6). 310 μl of a reverse transcription system (50 μl of dNTP, 200 μl of buffer, 50 μl of reverse transcriptase and 10 μl of RNAase inhibitor) was prepared, mixed well, and dispensed into a 96-well plate containing mixed primers for reverse transcription, 3.1 μl per well. Next the two types of fixed cells were washed once by centrifugation at 500 g, and 2.5 million cells each were mixed. The mixed cells (approximately 50,000 cells per well) were dispensed evenly into a 96-well plate containing a pre-mixed reverse transcription system (6 μl of cell suspension, 0.5 μl of dNTP, 1 μl of mixed primers for reverse transcription, 2 μl of buffer, 0.5 μl of reverse transcriptase and 0.1 μl of RNAase inhibitor) to react at 42° C. for 1.5 hours. After reverse transcription, the cells in the 96-well plate were first collected into an 8-tube stripe using a multichannel pipette, then collectively transferred to a clean 1.5 ml EP tube, washed once with DPBS solution (Gibco, Cat #14190-144), and centrifuged at 500 g for 5 min. After aspiration of the supernatant, the cells were resuspended in 500 μl of DPBS solution, and then the cell suspension was dripped into a microwell plate such that greater than 80% of the microwells were occupied by cells (
After completion of the ligation reaction, the molecular labeled magnetic beads were washed three times on a magnetic rack, the supernatant being aspirated and discarded, and 200 μl of exonuclease EXON I mix (EXON I buffer 1× and EXON I 1×) was added to react at 37° C. for 0.5 hour to remove from the magnetic beads the oligonucleotides which did not capture mRNA. After the exonucleolytic cleavage, the molecular labeled magnetic beads were washed three times on a magnetic rack, the supernatant being aspirated and discarded, and 500 μl of 0.1% NaOH solution was added for 5 min treatment to obtain single-stranded cDNA for the subsequent second strand synthesis reaction. The NaOH-treated molecular labeled magnetic beads were washed three times on a magnetic rack to remove residual NaOH, and then 100 μl of a second strand synthesis mix (20 μl of reverse transcription buffer, 40 μl of 30% PEG8000, 10 μl of 10 mM dNTP, 10 μl of 100 μM random primer, 2.5 μl of Klenow polymerase, and 17.5 μl of ddH2O) was added to react at 37° C. for 1 hour, wherein the random primer sequence was 5′-AAGCAGTGGTATCAACGCAGAGTGANNNGGNNNB-3′(SEQ ID No. 7), where B represents one of G/T/C, and N represents any one of A/T/C/G and was randomly synthesized.
The random primer sequence would bind randomly to the single-stranded sequence of NaOH-treated magnetic beads, and the polymerase continued to synthesize the complementary strand in the 5′ to 3′ direction of this primer, wherein NNNGGNNNB was a randomly combined sequence.
After the second strand synthesis reaction, the molecular labeled magnetic beads were washed three times on a magnetic rack, the supernatant being aspirated and discarded, and a PCR reagent mix (KAPA HiFi Hot Start Ready Mix 1× and TSO-PCR primer 0.1 μM) was added.
Where, the TSO-PCR primer sequence was: 5′-AAGCAGTGGTATCAACGCAGAGT-3′(SEQ ID No. 8). The PCR program was as follows: 98° C. pre-denaturation for 3 min; 98° C. denaturation for 20 sec, 67° C. annealing for 15 sec and 72° C. extension for 6 min, which were repeated for 12 cycles; 72° C. extension for 5 min and 4° C. hold, such that a large amount of labeled cDNA was obtained. Vazyme DNA clean beads were used to purify the PCR products. The DNA clean beads were shaken to mix well and placed at room temperature for at least 30 min prior to use. The purification procedure was as follows:
The gene sequencing library was constructed using the following cDNA sequencing library construction method, and the constructed gene sequencing library was sent to Hangzhou Repugene Technology Co., Ltd. for sequencing. A gene expression profile was obtained by demultiplexing, screening and comparison of the returned sequencing data. The matrix file was imported for R language analysis such that the matrix data could be converted for visualization. As can be seen from
2. Construction of cDNA Sequencing Library.
The kit TruePrep™ Index Kit V2 for Illumina® (Vazyme #TD202) was used, and the kit manual was referred to for the detailed procedure.
Where, the “×” was calculated from the volume of PCR products, for example, “0.60×” indicates 0.60×50 μl=30.0 μl.
The sequencing library can be stored at −20° C.
As in Embodiment 2, after loading cells carrying molecular labels on the microwell plate, excess molecular labeled magnetic beads were washed away with DPBS solution, and lysis buffer of three different formulations were added into three microwell plates, respectively, the 3 lysis buffers in turn including: control lysis buffer (0.1 M Tris-HCl pH 7.5, 0.5 M LiCl, 1% SDS, 10 mM EDTA and 5 mM dithiothreitol), lysis buffer 20 (ddH2O, 1% SDS, 20% formamide and 3×SSC) and lysis buffer 50 (ddH2O, 1% SDS, 50% formamide and 3×SSC), followed by incubating for 30 min for sufficient lysis. The steps thereafter were the same as in Embodiments 2. Among the components, SDS played a major role in lysis, formamide facilitated hybridization of nucleic acid molecules, and SSC assisted in enhancing hybridization efficiency. The constructed gene sequencing library was sent to Hangzhou Repugene Technology Co., Ltd. for sequencing with the HiSeq2500PE125 sequencing strategy. As shown by the analysis of the sequencing results, lysis buffer 50 significantly improved the number of cells with UMI of greater than 500 as well as the average number of gens (
Five million detached mouse cells were employed for library construction and obtaining a sequencing library according to the procedure in Embodiment 2. The constructed gene sequencing library was sent to Hangzhou Repugene Technology Co., Ltd. for sequencing with the HiSeq2500PE125 sequencing strategy. A gene expression profile was obtained by demultiplexing, screening and comparison of the returned sequencing data. The matrix file was imported into R for analysis such that the matrix data could be converted for visualization.
The present invention can also be used for microfluidics-based single cell sequencing platforms. After obtaining fixed cells carrying one round of cell barcode by using the method of Embodiment 2, Chromium Chip E by 10× Genomics (10×Genomics #2000121) was used for reactions. Inlet 1 was injected with 75 μl of fixed cells mixed with the ligation system (10 μl of cells, 50.5 μl of enzyme-free water, 7.5 μl of T4 Ligation Buffer, 3 μl of T4 ligase, 1.5 μl of 10× Reducing Agent B and 2.5 μl of 100 mM Bridge Primer); inlet 2 was injected with 40 μl of single cell ATAC gel microbeads containing the Lysis Buffer (10×Genomics #2000132, the microbead sequence as described in Embodiment 1); and inlet 3 was injected with 240 μl of Partitioning Oil (10×Genomics #220088). Then the emulsion-coated microbeads were incubated at 37° C. for 1.5 hours to allow for a sufficient ligation reaction. After the reaction, the microbeads were collected and subjected to exonucleolytic cleavage, single-stranding and second strand synthesis reactions according to the protocol described in Embodiment 2, and finally the cDNA library was obtained by PCR amplification and purification.
Annealing of the labeled magnetic beads in Embodiment 1 and the bridge primer was executed. The magnetic beads and the bridge primer (bridge primer sequence:
5′-CGTCGTGTAGGGAAAGTGTGACGCTGCCGACGA-3′) (SEQ ID No. 9) were mixed thoroughly and subjected to gradient annealing to form sticky ends. Sequences which did not bind to the bridge primer were resected using the exonuclease EXO I. After the resection, the beads were washed once with 150 μl of TE-SDS and TE-TW, and finally resuspended with TE-TW and stored at 4° C. for use later.
The naked Tn5 transposase was purchased from Nanjing Vazyme Biotech Co. Ltd. The transposase and the embedment buffer were provided by the (Vazyme) Tn5 Transposome (S111) kit produced by Nanjing Vazyme Biotech Co. Ltd.
Wherein, the specific molecular barcode 10×N included in the specific barcode segment was the core sequence of the cell barcode sequence, this core sequence corresponding to each Tn5 complex being different, and since there were four options of A/T/C/G for each site, there were 410 options for the 10×N sequence. N represents any one of A/T/C/G and was randomly synthesized.
Two million each of mouse fibroblast cells (3T3) and human embryonic kidney cells (293T) were mixed thoroughly and washed once with PBS, then transferred to an 1.5 ml EP tube and subsequently resuspended with 1 ml of the lysis buffer (ddH2O, 10 mM Tris-HCl pH 7.4, 10 mM NaCl, 3 mM MgCl2 1 % BSA, 0.1 % Tween-20, 0.1 % IGEPAL CA-630, 0.01 % digitonin) and lysed on ice for 3 min. The lysed sample was wash twice with the RSBT buffer (ddH2O, 10 mM Tris-HCl pH 7.4, 10 mM NaCl, 3 mM MgCl2, 1 % BSA and 0.1 % Tween-20). The nuclei were fixed with 1% formaldehyde at room temperature for 10 min. The fixation and crosslinking was terminated with glycine. The crytopreservation buffer (ddH2O, 50 mM Tris-HCl pH 8.0, 25% glycerol, 5 mM Mg(OAc)2 and 0.1 mM EDTA) was prepared. The nuclei were resuspended with 975 μl of the crytopreservation buffer, 5 μl of 5 mM DTT and 20 μl of 50×protease inhibitor cocktail, following which the nuclei can be crytopreserved at −80° C. Alternatively, the nuclei could be suspended in 1 ml of the RSBT buffer, filtered and counted for use later. For resuscitation of the crytopreserved sample, the sample was taken out and put in an oven to thaw at 37° C. for 2 min, centrifuged at 500 g/5 min with the supernatant being discarded, resuspended in 200 μl of the lysis buffer, and put on ice for 3 min. And the sample was washed with RSBT, filtered and counted for use later.
The prepared nuclei were resuspended in 1 ml of RSBT, filtered and counted. The 2×TD buffer (ddH2O, 20 mM Tris-HCl pH7.6, 10 mM MgCl2, 20% dimethylformamide) was prepared. The tagmentation buffer (22.5 μl per well, comprising 12.5 μl of 2×TD buffer, 9.5 μl of 1×DPBS, 0.25 μl of 1% digitonin, and 0.25 μl of 10% Tween-20) was prepared. The nuclei were resuspended in the tagmentation buffer, and dispensed into a 96-well plate with approximately 10,000 nuclei per well. The transposase-embedded complex of Embodiment 3 was added into the 96-well plate containing the nuclei at 2.5 μl per well, and placed at 55° C. to react for 30 min. After the reaction, 25 μl of the 2×Terminate Solution (25 ml of 40 mM EDTA and 3.9 μl of 6.4 M Spermidine) was added to each well and left to stand still at 37° C. for 15 min. The liquid was collected and mixed in a 15 ml centrifuge tube, spun down at 500g/5 min with the supernatant being discarded, and washed once with 1 ml of RSBT. The nuclei were counted and dispensed into a 1.5 ml EP tube at 500,000 nuclei per tube, and resuspended in 20 μl of RSBT, followed by addition of 30 μl of PNK reaction solution (5 μl of 10×PNK buffer, 5 μl of 10 mM ATP, 10 μl of ddH2O and 10 μl of PNK enzyme) and incubation at 37° C. for 30 min, and then centrifuged when the reaction was completed, the supernatant being discarded. The sample was washed twice with RSBT and then put on ice for use later.
The Tn5-labeled nuclei were employed. The nuclei suspension was dripped into a microwell plate, and briefly centrifuged such that 80-90% of the nuclei were loaded in the microwells, with each well capturing 0-10 nuclei. Cellular DNA fragments within the same well were distinguished by the Cell Barcode 4 on Tn5. The bridge primer and the labeled magnetic beads were annealed such that the sticky ends were formed on fragments on the labeled magnetic beads. 200,000 molecular labeled magnetic beads were added to the nuclei-captured microwell plate, and the plate was put on a magnet and mixed well gently so that the magnetic beads covered more than 99% of the microwells. The excess molecular labeled magnetic beads were washed away with RSBT solution. 200 μl of the lysis buffer (100 μl of 10% SDS, 40 μl of proteinase K, 100 μl of 10×T4 buffer, 200 μl of 50% PEG 8000 (mass volume ratio) and 560 μl of 10 mM Tris-HCl pH 8.0) was dripped slowly into the microwell plate covered with magnetic beads, followed by lysis incubation at room temperature for 30 min to allow sufficient complementary hybridization of the bridge sequence on the magnetic beads and the bridge primers on the nuclei. After incubation, the microwell plate was inverted onto a magnet, and magnetic beads carrying molecular label-DNA complex were collected, transferred to a 1.5 ml EP tube and washed twice with 6×SSC, and then washed once again with 50 mM Tris pH8.0.; 50 μl of ligation mix (2 μl of T4 ligase, 5 μl of T4 buffer, 2 μl of dNTP, 10 μl of 50% PEG8000 (mass volume ratio) and 31 μl of ddH2O) was added to the EP tube containing the molecular labeled magnetic beads, and reacted at 25° C. for 1.5 h. After the ligation reaction, the molecular labeled magnetic beads were washed once with TE-SDS, TE-TW and 10 mM Tris pH8.0, respectively, on a magnetic rack, and then the magnetic beads were suspended in 100 μl of an extension system (20 μl of 5×RT buffer, 10 μl of dNTP, 2.5 μl of Klenow polymerase, 20 μl of 50% PEG (mass volume ratio) and 47.5 μl of ddH2O), and reacted at 37° C. for 1 hour. After the reaction, the molecular labeled magnetic beads were washed once with TE-SDS, TE-TW and 10 mM Tris pH8.0, respectively on a magnetic rack. The magnetic beads were suspended in 500 μl of 0.1 M NaOH and incubated at room temperature for 5 min for single stranding. Then the beads were washed twice with TE-TW and washed once with 10 mM Tris pH8.0. The target segment was enriched from the magnetic beads with the primers index P5 and index P7, and the PCR products were purified with Vazyme DNA clean beads to obtain products of 300-500 bp.
The detailed workflow of the above reactions is shown in
The gene sequencing library was constructed using the method above, and the constructed gene sequencing library was sequenced with an MGI/Illumina Next Gen Sequencing instrument.
A cell by peak matrix was obtained by demultiplexing, screening and comparison of the returned sequencing data. The matrix file was imported into R for analysis such that the matrix data could be converted for visualization. Figures. 12-17 indicated a very small amount of doublet contamination (
The sequences of primers for library construction are shown in Table 3 hereinafter. Where, N represents any one of A/T/C/G and was randomly synthesized. S denotes a thio modification of the terminal base to improve the terminal base stability.
The assay for transposase-accessible chromatin ultra-high-throughput single cell sequencing of the present invention can also be used for microfluidics-based single cell sequencing platforms. After obtaining fixed cells carrying one round of cell barcode by using the method of Embodiment 6, Chromium Chip E by 10×Genomics (10×Genomics #2000121) was used for reactions. Inlet 1 was injected with 75 μl of fixed cells mixed with the ligation system (10 μl of cells, 50.5 μl of enzyme-free water, 7.5 μl of T4 ligation buffer, 3 μl of T4 ligase, 1.5 μl of 10×Reducing Agent B and 2.5 μl of 100 mM bridge primer); inlet 2 was injected with 40 μl of single cell ATAC gel microbeads containing the lysis buffer (10×Genomics #2000132, microbead sequence as described in Embodiment 1); and inlet 3 was injected with 240 μl of Partitioning Oil (10×Genomics #220088). Next, the emulsion-coated microbeads were incubated at 37° C. for 1.5 hours to allow for a sufficient ligation reaction. After the reaction, the microbeads were collected and subjected to exonucleolytic cleavage, single stranding and second strand synthesis reactions according to the protocol described in Embodiment 7, and finally the gene sequencing library for sequencing and bioinformatic analysis was obtained by PCR amplification and purification. Sequencing and bioinformatic analysis of the gene sequencing library was carried out according to the workflow in Embodiment 7.
The assay for transposase-accessible chromatin ultra-high-throughput single cell sequencing of the present invention can also be used for Tn5 embedment method with both ends labeled.
Embedment was executed according to method of Embodiment 6 except that each well of the 96 wells comprised 1 embedment fragment, i.e. the specific barcode segment+Mosaic Ends segment specific between wells, and annealing of the specific barcode segment+Mosaic Ends segment produced a specific molecular barcode sequence, i.e., the Bridge Sequence 2-Cell Barcode 4-Mosaic Ends/Mosaic Ends in
Next, the nuclei were labeled according to the protocol in Embodiment 6, and the labeled nuclei were captured on a microwell plate, ligated and extended. 500 μl of 0.1M NaOH solution was added to treat the nuclei for 5 minutes for single stranding, then the P7 adapter-Mosaic Ends primer (i.e. the primer comprising the sequences of P7 adapter and Mosaic Ends fragments) was added for extension After completion of extension, single-stranded products in the supernatant were collected through high-temperature denaturation and the magnetic beads were removed. The gene sequencing library was obtained by PCR amplification and purification with primers index P5 and index P7 (
Number | Date | Country | Kind |
---|---|---|---|
202110517750.6 | May 2021 | CN | national |
202110911428.1 | Aug 2021 | CN | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/CN2021/119166 | 9/17/2021 | WO |