The Sequence Listing associated with this application is provided in XML format in lieu of a paper copy and is hereby incorporated by reference into the specification. The name of the XML file containing the Sequence Listing is P01221628PUS_Substitute Sequence Listing.xml. The text file is 9,568 bytes, was created on Apr. 14, 2023, and is being submitted electronically via Patent Center.
The present disclosure relates to the technical filed of molecular biology high-throughput sequencing, and specifically, to a high-compatibility PCR-free library construction method and a high-compatibility PCR-free sequencing method, which are applicable to samples whose DNA is subjected to library construction without PCR amplification.
Next-generation sequencing (NGS) is the most common technique used in modern molecular biology high-throughput sequencing research. NGS workflows for DNA mainly include two steps: library construction and on-machine sequencing. In the library construction step, randomly interrupted genomic fragments are amplified by standard PCR. However, for some special templates, the presence of factors such as complex secondary structures or poor thermostability may cause PCR amplification bias of the templates. Therefore, not all sequences are equally represented in the PCR-amplified library. Especially for some templates with high GC content or high AT content, it is sometimes difficult to construct a library by the PCR method. There is no obvious difference between a PCR-free library and a conventional PCR library when they are sequenced, except that PCR is not required during construction of a PCR-free library. A PCR-free library can theoretically improve data read distribution and has more uniform sequence coverage. In the on-machine sequencing step of the NGS workflows, a signal to be detected is required to be amplified by PCR in most cases. Therefore, even if a PCR-free library is constructed, problems introduced by PCR during sequencing still cannot be avoided.
At present, most library construction kits of the self-developed platforms are based on PCR construction. However, PCR library construction has the disadvantages of low coverage, GC bias, and low InDel detection accuracy and sensitivity. In addition, the PCR library construction requires a long period of time, has high requirements for fully automated instruments and laboratory, and costs a lot in library construction, labor, and depreciation. Illumina's Truseq DNA PCR-free sample preparation kit can theoretically construct a library within one day and is compatible with ever-increasing reads in the Illumina sequencing platform. However, this kit is only compatible with a library construction based on physical interruption and an input of 1 to 2 μg. Thus, this kit is mainly used for human resequencing, but it is incompatible with digestion interruption and FFPE and is also incompatible with samples of a small input such as cfDNA, thereby having a narrow application range and low flexibility. In addition, the Illumina platform adopts bridge PCR for library amplification. That is, a PCR-free constructed library is still amplified by PCR before sequencing. Therefore, the Illumina platform is not of a pure PCR-free library construction and sequencing. The available PCR-free kits have the following advantages and disadvantages as listed in Table 1.
Conventional PCR library construction requires a long period of time and high costs, and cannot avoid base pairing bias introduced by PCR, thereby leading to base pairing mistake, data bias, and repetitive sequences. Furthermore, a PCR library has poor performance in Indel calling compared to a PCR-free library. The existing PCR-free kits on the market are mainly manufactured by overseas companies. High requirements for inputs are an important factor to limit application of the PCR-free kits, and it is important to reduce inputs for library construction by improving library construction efficiency. The existing PCR-free kits require the lowest input of 50 ng for gDNA and 5 ng for cfDNA. Most of the library construction kits have poor compatibility, which is reflected in the following aspects: 1) some kits are only compatible with physical interruption; 2) some kits are only compatible with normal genomic DNA and incompatible with FFPE, cfDNA, and severely degraded DNA; and 3) some kits are not equipped with a matching digestion interruption kit.
In view of the deficiencies in the prior art, the present disclosure is to provide a new quick and efficient method for constructing a PCR-free sequencing library, and the method is applicable to the sequencing platform self-developed by MGI. In the method, a library suitable for sequencing can be constructed by ligating an adapter and directly performing single-strand cyclization without PCR amplification, thereby reducing base paring mistake, data bias, and repetitive sequences that may be introduced by PCR.
In a first aspect, the present disclosure provides a PCR-free high-throughput sequencing method.
The PCR-free high-throughput sequencing method claimed in the present disclosure is the following method A, method B or method C.
The method A may include the following steps:
The method B may include the following steps:
The method C may include the following steps:
In step (B1), or (C1), the fragmentation is performed by digesting the nucleic acid sample with fragmentmase.
Further, the fragmentmase may be, for example, NEBNext® Ultra™ II FS DNA Module, Qiagen5X WGS Fragmentation Mix, or a self-developed enzyme for interruption.
In the method A, the method B, and the method C, the adapter contains a barcode. Preferably, the adapter includes two barcodes. The adapter including two barcodes is ligated to both ends of the nucleic acid sample to form a library having dual barcodes.
Further, the adapter is formed by annealing two partially complementary single-stranded nucleic acids, and the two barcodes are located in a non-complementary region of the two single-stranded nucleic acids.
In step (A1), (B1), or (C1), the nucleic acid sample may be DNA or RNA.
Further, the DNA is genomic DNA, a naturally occurring small-molecule DNA, or an amplified DNA product.
The naturally occurring small-molecule DNA may be, for example, cfDNA. The amplified DNA product may be, for example, an MDA product, a cDNA product, or an amplicon.
A starting sample directly used in the PCR-free library construction method of the present disclosure may be a non-DNA sample such as a blood or saliva sample.
In step (A1), the fragmentation may be performed through physical interruption or digestion interruption.
The physical interruption may be, for example, ultrasonic interruption.
If the nucleic acid sample is larger than the DNA fragment of target size (e.g., genomic DNA), the nucleic acid sample is required to be interrupted, and the DNA fragment of target size is selected by a magnetic bead method. If the nucleic acid sample is not larger than the DNA fragment of target size (e.g., cfDNA, which is a small DNA fragment and has a concentrated main band), the nucleic acid sample is not required to be subjected to fragmentation.
If the nucleic acid sample is RNA, the RNA is subjected to reverse transcription to obtain DNA; and the fragmentation is performed on the RNA or the DNA obtained by the reverse transcription of the RNA.
In step (A1), (B1), or (C1), a size of the DNA fragment of target size may ranges from 150 bp to 800 bp, for example, 300 bp to 500 bp.
In step (A2), the end repair and the A-tailing reaction are performed in one step by mixing and reacting an end repair-A-tailing reaction solution with the product of step (A1), to obtain the product of step (A2).
The end repair-A-tailing reaction solution contains a T4 polynucleotide kinase buffer (T4 PNK buffer), adenylate deoxyribonucleic acids (dATP), a mixed deoxyribonucleic acid solution (dNTPs), T4 DNA polymerases, T4 polynucleotide kinases (T4 PNK), and Taq DNA polymerases (rTaq).
Further, in the end repair-A-tailing reaction solution, the adenylate deoxyribonucleic acids (dATP), the mixed deoxyribonucleic acid solution (dNTPs), the T4 DNA polymerases, the T4 PNK, the Taq DNA polymerases (rTaq) satisfy a ratio of 50 nmol:12.5 nmol (for each dNTP): 6 U: 10 U:(2 to 5) U.
In a specific example of the present disclosure, 10×T4 PNK buffer, an adenylate deoxyribonucleic acid solution (dATP) at a concentration of 100 mM, a mixed solution of 4 kinds of deoxyribonucleic acid (dNTPs) in which the concentration of each deoxyribonucleic acid is 25 mM, T4 DNA polymerases at a concentration of 3 U/μL, T4 PNK at a concentration of 10 U/μL, and Taq DNA polymerases (rTaq) at a concentration of 5 U/μL are mixed in a volume ratio of 5:0.5:0.5:2:1:(0.4-1), to prepare the end repair-A-tailing reaction solution.
In step (A2), the end repair-A-tailing reaction solution is mixed with the product of step (A1) in volume ratio of 1:4.
In step (A2), the end repair-A-tailing reaction solution and the product of step (A1), after being mixed, react under the following conditions: 1) at 14° C. for 15 min, at 37° C. for 25 min, and at 65° C. for 15 min; and kept at 4° C., the heated lid of the PCR instrument is set to 70° C.; or, 2) incubated at 37° C. for 10 min and at 72° C. for 15 min, and cooled to 4° C. at a rate of 0.1 s.
In step (B1), the fragmentation, the end repair, and the A-tailing reaction are performed in one step by mixing and reacting a fragmentation-end repair-A-tailing reaction solution with the nucleic acid sample, to obtain the product of step (B1).
The fragmentation-end repair-A-tailing reaction solution contains fragmentmase, a fragmentmase reaction buffer, adenylate deoxyribonucleic acids, a mixed deoxyribonucleic acid solution, T4 DNA polymerases, Taq DNA polymerases, and a TE buffer.
Further, in the fragmentation-end repair-A-tailing reaction solution, the adenylate deoxyribonucleic acids, the mixed deoxyribonucleic acid solution, the T4 DNA polymerases, and the Taq DNA polymerases may satisfy a ratio of 170 nmol:57.5 nmol:3 U:5 U. The amount of enzyme for interruption is determined according to the instruction or results of multiple experiments.
Further, in step (B1), the fragmentation-end repair-A-tailing reaction solution and the nucleic acid sample, after being mixed, may be incubated at 37° C. for 10 to 20 min and at 65° C. for 30 min; and the temperature of the mixture is cooled to 4° C. The heated lid of the PCR instrument is set to 70° C.
In step (C1), the fragmentation and the end repair are performed in one step by mixing and reacting a fragmentation-end repair reaction solution with the nucleic acid sample, to obtain the product of step (C1).
The f fragmentation-end repair reaction solution contains fragmentmase, a fragmentmase reaction buffer, a mixed deoxyribonucleic acid solution, DNA polymerase I, and MgCl2, and enzyme-free water.
Further, in the fragmentation-end repair reaction solution, the mixed deoxyribonucleic acid solution, the DNA polymerases I, and the MgCl2 may be satisfy a ratio of 75 nmol (for each dNTP):20 U:0.3 μmol. The amount of enzyme for interruption is determined according to the instruction or results of multiple experiments.
Further, in step (C1), the fragmentation-end repair reaction solution and the nucleic acid sample, after being mixed, may react at 37° C. for 30 min, and the temperature of the mixture is kept at 4° C. The heated lid of the PCR instrument is set to 70° C. After the reaction is completed, the sample is collected and placed on ice immediately, and TE is added to make up the total volume of the sample to 30 μL.
In step (C2), an A-tailing reaction solution for the A-tailing reaction performed on the product of (C1) contains a T4 PNK buffer, adenylate deoxyribonucleic acids (dATP), a mixed deoxyribonucleic acid solution (dNTPs), and Taq DNA polymerases (rTaq).
Further, in the A-tailing reaction solution, the adenylate deoxyribonucleic acids (dATP), the mixed deoxyribonucleic acid solution (dNTPs), and the Taq DNA polymerases (rTaq) satisfy a ratio of 50 nmol:8.75 nmol (for each dNTP): 1 U.
In a specific example of the present disclosure, 10×T4 PNK buffer, an adenylate deoxyribonucleic acid solution (dATP) at a concentration of 100 mM, a mixed solution of 4 kinds of deoxyribonucleic acid in which the concentration of each deoxyribonucleic acid is 25 mM, Taq DNA polymerases (rTaq) at a concentration of 5 U/μL, and enzyme-free water are mixed in volume ratio of 5:0.5:0.35:0.2:1, to prepare the A-tailing reaction solution.
In step (A3), (B2), or (C3), the adapter is formed by annealing a B strand and a T strand. A 3′-end of the B strand is complementary with a 5′-end of the T strand, and the remaining region of the B strand is non-complementary with the remaining region of the T stand. The 3′-end of the B strand has a protruding dT. The non-complementary region of the B strand and/or the non-complementary region of the T strand contain a barcode for identifying different samples. A 5′-end of the B strand and the 5′-end of the T strand are each modified with a phosphate group or ligated with a single-stranded oligonucleotide fragment having a U-base at 3′-end.
If the adapter contains a single-stranded oligonucleotide fragment having a U-base at 3′-end, the adapter is required to be subjected to USER enzyme treatment. The USER enzyme treatment and the ligation may be performed simultaneously, or the USER enzyme treatment may be performed after the ligation of the adapter and purification.
Further, the adapter may be any one of the adapter 1, adapter 2, adapter 3, and adapter 4.
The adapter 1 is formed by annealing a single-stranded DNA set forth in SEQ ID NO: 1 with a phosphate group-modified 5′-end and a single-stranded DNA set forth in SEQ ID NO: 2 with a phosphate group-modified 5′-end.
The adapter 1 is a Y adapter, and after the adapter is ligated, the product can be directly subjected to cyclization or simultaneously subjected to cyclization and rolling circle replication.
The adapter 2 is formed by annealing a single-stranded DNA set forth in SEQ ID NO: 3 or SEQ ID NO: 4 and a single-stranded DNA set forth in SEQ ID NO: 2.
The adapter 2 is a Bubble U-shaped adapter, and can be simultaneously subjected to ligation and USER enzyme treatment. The product can be directly and simultaneously subjected to cyclization or subjected to cyclization and rolling circle replication.
The adapter 3 is formed by annealing a single-stranded DNA set forth in SEQ ID NO: 5 and a single-stranded DNA set forth in SEQ ID NO: 6.
The adapter 3 is a dual-barcode U-shaped adapter, and can be simultaneously subjected to ligation and USER enzyme treatment. The product can be directly and simultaneously subjected to cyclization or subjected to cyclization and rolling circle replication.
The adapter 4 is formed by annealing a single-stranded DNA set forth in SEQ ID NO: 7 and a single-stranded DNA set forth in SEQ ID NO: 6.
The adapter 4 is a dual-barcode Y adapter sequence, and after the adapter is ligated, the product can be directly and simultaneously subjected to cyclization or subjected to cyclization and rolling circle replication.
AY portion of the two strands of the adapter have each a barcode, which is added to a library by ligation (see
After barcodes are ligated to both ends of a library, types of barcodes are greatly increased through the combinations of the barcodes at both ends, thereby achieving sequencing for many mixed libraries. As shown in
In addition, in the case that each barcode is used uniquely at each of the both ends, barcode skipping errors during library construction or sequencing can be greatly eliminated by use of the unique correspondence between the barcodes at both ends.
In each adapter sequence, the 5′-end is on the left, the 3′-end is on the left, “II” represents a modifying group, and “phos” represents phosphorylation. In the T strand of adapter, a sequence of 10 bases, i.e., NNNNNNNNNN, represents a barcode, and N may be A, T, C or G. The barcode is used to identify different samples. Samples having different barcodes can be used to construct a mixed library.
The adapter sequence is not limited to the above sequences. Even if the sequence is modified, a similar effect can be achieved, as long as the structure can be sequenced on BGI/MGI platform.
In addition, by modifying the adapter sequence, for example, by adding a corresponding barcode NNN (N may be A, T, C or G, and in specific examples, the length of N can be set according to experimental objectives) to the adapter sequence, unique molecular identifiers (UMIs) can also be added while ligating the adapter to the library. The UMI can be used as an identifier for each sample strand and mostly used for detection of low-frequency variants. The UMI is usually adjacent to a target nucleic acid or a barcode, and shares one sequencing primer with the target nucleic acid or the barcode during sequencing to achieve continuous sequencing. The UMI may be separated from the target nucleic acid and the barcode, and uses a self-developed sequencing primer during sequencing.
In step (A3), (B2), or (C3), the adapter is ligated to the product of step (A2), (B1), or (C2) by mixing and reacting the adapter and the product of step (A2), (B1), or (C2) with a ligation reaction solution, to obtain the product of step (A3), (B2), or (C3).
The ligation reaction solution contains a T4 PNK buffer, adenylate ribonucleic acids (ATP), PEG8000, T4 DNA ligases, and enzyme-free water.
Further, in the ligation reaction solution, the adenylate ribonucleic acids (ATP), the PEG8000, and the T4 DNA ligase satisfy a ratio of 0.8 μmol of adenylate ribonucleic acids: (10 to 16) μL of 50% PEG8000 (e.g., Rigaku's products): (1,200 to 3,000) U of T4 DNA ligases, for example, 0.8 μmol of adenylate ribonucleic acids: 16 μL of 50% PEG8000:3,000 U of T4 DNA ligase.
In a specific example of the present disclosure, 10×T4 PNK buffer, adenylate ribonucleic acids (ATP) at a concentration of 0.1 M, PEG8000 at a concentration of 50% (e.g., Rigaku's products), T4 DNA ligases at a concentration of 600 U/4, and enzyme-free water are mixed in volume ratio of 3:0.8:16:5:0.2, to prepare the ligation reaction solution.
In step (A3), (B2), or (C3), the adapter, the product of step (A2), (B1), or (C2), and the ligation reaction solution are mixed by mixing an adapter solution containing the adapter and the product of step (A2), (B1), or (C2) with the ligation reaction solution in a volume ratio of (1 to 5):50:(25 to 29); specifically, e.g. 5:50:25 or 1:50:29. A concentration of the adapter in the adapter solution is 6 μM or 1 μM.
In step (A3), (B2), or (C3), the adapter, the product of step (A2), (B1), or (C2), and the ligation reaction solution, after being mixed, may react at 25° C. for 10 to 30 min (e.g., 30 min); and the temperature of the mixture is kept at 4° C. The heated lid of the PCR instrument is set to 30° C.
In the present disclosure, the adapters compatible with DNBSEQ™ series sequencing platform and multiple high-throughput sequencing platforms self-developed by MGI, such as single-barcode adapters and dual-barcode adapters, are designed. The adapter sequence contains the barcode sequences, which can be used to identify multiple samples at the same time, and the samples with different barcode sequences can be used to construct a mixed library. The optimized adapter ligation system used in the present disclosure can still ensure high ligation efficiency despite a small input of adapters (e.g., a molar ratio of adapters to DNAs is 5:1 to 50:1), which advantageously avoiding adapter contamination caused by too many adapters and improving the efficiency of converting DNA templates into sequencing libraries with adapters at both ends.
In step (A4), (B3), or (C4), the product of step (A3), (B2), or (C3) is purified before the single-strand cyclization and the rolling circle replication.
Further, the purification is a magnetic bead purification (e.g., using XP magnetic beads or various indigenous magnetic beads for purification).
ATE buffer is added to the product of step (A3), (B2), or (C3), XP magnetic beads (Beckman Coulter) or various indigenous magnetic beads for purification are added to purify the product, and a collected product is dissolved in the TE buffer.
In step (A4), (B3), or (C4), specifically, said obtaining DNA nanoballs by performing the single-strand cyclization on the product of step (A3), (B2), or (C3) and the rolling circle replication includes any one of:
In step (a1), the single-strand cyclization may be performed by a manner I or manner II.
The manner I includes the following steps:
The single-strand cyclization reaction solution 1 contains a TA buffer, adenylate ribonucleic acids (ATP), mediation fragments, T4 DNA ligases, and enzyme-free water. The mediation fragments are each a single-stranded DNA, which has a 5′-end reversely complementary to a′-end of the B strand constituting the adapter, and a 3′-end reversely complementary to a 3′-end of the T strand constituting the adapter.
In the present disclosure, if the adapter is the adapter 1 or the adapter 2 described above, the mediation fragments have a nucleotide sequence set forth in SEQ ID NO: 8.
Further, in the single-strand cyclization reaction solution 1, the adenylate ribonucleic acids (ATP), the mediation fragments, and the T4 DNA ligase satisfy a ratio of 60 nmol:62.5 pmol:600 U.
In a specific example of the present disclosure, 10×TA buffer, adenylate ribonucleic acids (ATP) at a concentration of 100 mM, mediation fragments at a concentration of 25 μM, T4 DNA ligases at a concentration of 600 U/4, and enzyme-free water are mixed in volume ratio of 6:0.6:2.5:1:1.9, to prepare the single-strand cyclization reaction solution 1.
The single-strand cyclization reaction solution 1 is mixed with the incubation product in volume ratio of 48:12.
The single-strand cyclization reaction solution 1 and the incubation product, after being mixed, are incubated at 37° C. for 60 min; and the temperature of the mixture is kept at 4° C. The heated lid of the PCR instrument is set to 42° C.
The manner II includes the following steps: mixing the product of step (A3), (B2), or (C3) with the mediation fragments and a NaOH solution; incubating the mixture at the room temperature for 5 min; mixing the mixture with a Tris-HCL solution; and adding a single-strand cyclization reaction solution 2 for reaction to obtain a cyclization product.
The concentration of the NaOH solution is 2 M, and the concentration of the Tris-HCL solution is 1 M, and the pH is 6.8.
Further, the NaOH, the mediation fragments, and the Tris-HCL satisfy 5 μma 100 pmol: 5 μmol.
The single-strand cyclization reaction solution 2 contains a TA buffer, adenylate ribonucleic acids (ATP), and T4 DNA ligases.
In the present disclosure, in the single-strand cyclization reaction solution 2, a ratio of the adenylate ribonucleic acids (ATP) to the T4 DNA ligases is 60 nmol: 240 U.
In a specific example of the present disclosure, 10×TA buffer, adenylate ribonucleic acid (ATP) at a concentration of 100 mM, and T4 DNA ligases at a concentration of 600 U/4 are mixed in volume ratio of 6:0.6:0.4, to prepare the single-strand cyclization reaction solution 2.
In a specific example of the present disclosure, the product of step (A3), (B2), or (C3), the mediation fragments at a concentration of 20 μM, the NaOH solution at a concentration of 2 M, and the 1 M Tris-HCL to the single-strand cyclization reaction solution 2 satisfy a volume ratio of 48:5:2.5:5:7.
In the manner II, the reaction may occur at 37° C. for 30 min; and the temperature of the mixture is kept at 4° C. The heated lid of the PCR instrument is set to 42° C.
In step (a1), the linear single strand digestion includes the following steps:
step 3: mixing and reacting a digestion reaction solution with the cyclization product to obtain a digestion product.
The digestion reaction solution contains a TA buffer, ExoI enzymes, ExoIII enzymes, and enzyme-free water.
Further, in the digestion reaction solution, a ratio of the ExoI enzymes to the ExoIII enzymes may be 4 U: 1 U.
In a specific example of the present disclosure, 10×TA buffer, ExoI enzymes at a concentration of 20 U/μL, ExoIII enzymes at a concentration of 10 U/μL, and enzyme-free water are mixed in a volume ratio of 0.4:2:1:0.6, to prepare the digestion reaction solution.
The digestion reaction solution is mixed with the cyclization product in volume ratio of 4:60 or 4:67.5.
The digestion reaction solution and the cyclization product, after being mixed, may be incubated at 37° C. for 30 min.
At this step, EDTA may be added to and uniformly mixed with the reaction product.
In step (a1), the purification is a magnetic bead purification. Specifically, XP magnetic beads are used to purify and collect the product of the previous step, and the product is dissolved in a TE buffer.
In step (A5), (B4), or (C5), the sequencing may be specifically performed on BGISEQ, MGISEQ or DNBSEQ™ series sequencing platform.
In a second aspect, the present disclosure provides a method for constructing a DNA library applicable to PCR-free high-throughput sequencing.
The method for constructing a DNA library applicable to PCR-free high-throughput sequencing, as provided in the present disclosure, may include steps (A1) to (A4), (B1) to (B3), or (C1) to (C4) as described in the first aspect.
In a third aspect, the present disclosure provides a DNA library constructed by the method described in the second aspect.
In a fourth aspect, the present disclosure provides an adapter.
The adapter claimed in the present disclosure is the adapter described in the first aspect.
In a fifth aspect, the present disclosure provides a kit.
The kit provided in the present disclosure contains the adapter described in the fourth aspect and all or some of:
In a sixth aspect, the present disclosure provides a system.
The system provided in the present disclosure contains the kit described in the fifth aspect, and a DNBSEQ™ sequencing reagent and/or device.
In a seventh aspect, the present disclosure provides use of the DNA library described in the third aspect, the adapter described in the fourth aspect, the kit described in the fifth aspect or the system described in the sixth aspect in PCR-free high-throughput sequencing.
In an eighth aspect, the present disclosure provides use of the adapter described in the fourth aspect or the kit described in the fifth aspect in construction of the DNA library described in the third aspect.
The following examples are only for a better understanding of, rather than limiting, the present disclosure. Unless otherwise indicated, experimental methods in the following examples are conventional methods. Unless otherwise indicated, test materials used in the following examples are purchased from conventional biochemical reagent stores. Quantitative tests in the following examples are all repeated three times and test results are averaged.
Experimental objective: constructing a whole genome library from a human gDNA sample by using an MGI PCR-free kit in combination with an NEB digestion interruption kit.
Sources of experimental samples: NA12878 standard DNA (catalog number: NA12878, manufacturer: CORIELL INSTITUTE).
1. Interruption of DNA Sample with the NEB Digestion Interruption Kit
1 μg of standard DNA (dissolved in TE) was placed into each tube and subjected to digestion interruption with NEBNext® Ultra™ II FS DNA Module (NEB), and the volume of an interruption system was 35 μL. NEBNext Ultra II FS Reaction Buffer was thawed in advance and vortex-mixed, NEBNext Ultra II FS Enzyme Mix was uniformly mixed in an upside-down manner and placed on ice. A reaction system shown in Table 1 was prepared on ice.
After NEBNext Ultra II FS Enzyme Mix was uniformly pipetted, 2 μL of NEBNext Ultra II FS Enzyme Mix was added to the sample, the mixture was gently and uniformly pipetted 6-8 times (vortex-mixing was prohibited), subjected to transient centrifugation, and immediately placed in a thermocycler for reaction, the reaction conditions were as follows: 4° C. forever; 37° C. for 10 min; 65° C. for 30 min; and 4° C. forever; the heated lid of the PCR instrument was set to 70° C. After the reaction was completed, the sample was collected and placed on ice, and TE was added to make up the volume of the sample to 65 μL.
2. Selection of DNA Fragment
(1) 100 μL of interrupted sample was taken and transferred into a new 1.5 mL non-stick tube, 60 μL of XP magnetic beads was added to and uniformly mixed with the sample by shaking, and allowed to bind to DNA at the room temperature for 10 min. The tube was placed onto a magnetic rack, the magnetic beads were allowed to bind to DNA for 2 min (until the liquid became clear), and the supernate was carefully taken by suction and transferred into a new 1.5 mL EP tube (the supernate was reserved at this step). 15 μL of XP magnetic beads was added to and uniformly mixed with the supernate by shaking, and allowed to bind to DNA at the room temperature for 10 min, the tube was placed onto the magnetic rack, the magnetic beads were allowed to bind to DNA for 2 min (until the liquid became clear), and the supernate was removed by suction.
(2) 500 μL of 75% ethanol was placed into the non-stick tube on the magnetic rack, the tube cap was closed, the mixture in the tube was uniformly mixed, and the supernate was removed. After washing with 500 μL of 75% ethanol again, residual ethanol was removed as much as possible by using a pipette with a small measurement range, and the magnetic beads were air-dried at the room temperature.
(3) The magnetic beads were resuspended in and uniformly mixed with 42 μL of TE by shaking, and allowed to bind to DNA at the room temperature for 10 min; the tube was placed onto the magnetic rack; the magnetic beads were allowed to bind to DNA for 2 min (until the liquid became clear); and 40 μL of supernate was carefully taken by suction and transferred into a new 1.5 mL EP tube for next reaction, or it stored in a refrigerator at −20° C.
3. Quantitation and Normalization of the Sample
2 μL of purified DNA was taken and subjected to Qubit dsDNA HS quantitation. The selected DNA fragment was normalized according to the concentration determined by Qubit quantitation. The mass of the DNA fragment was adjusted to 150 ng, and 1×TE was added to make up the total volume of 40 μL. If necessary, the normalized samples can be stored in a refrigerator at −20° C.
The size of the obtained DNA fragment was 300 bp to 500 bp.
4. End Repair and A-Tailing
First, an end repair-A-tailing reaction solution was prepared according to Table 2.
10 μL of prepared end repair-A-tailing reaction solution was added to and uniformly vortex-mixed with 40 μL of product of step 3, the mixture was subjected to transient centrifugation, and the total volume of the mixture was made up to 50 μL. The reaction sample was placed into the PCR instrument for reaction, the reaction conditions were as follows: 14° C. for 15 min; 37° C. for 25 min; 65° C. for 15 min; and 4° C. forever, and the heated lid of the PCR instrument was set to 70° C.
5. Ligation of an Adapter
An adapter sequence used in the present protocol is as follows (in the present example, the 5′-end of the sequence is on the left side, the 3′-end of the sequence is on the right side, “//” represents a modifying group, “phos” represents phosphorylation, and the underlined bases represent a barcode of 10 bases).
where N may be A, T, C or G.
Preparation of adapter: 20 μL of adapter B strands (100 μM), 20 μL of adapter T strands (100 μM), and 40 μL of 2×adapter buffer (components: 50 mM Tris-HCl (pH=8.0), 0.1 mM EDTA, and 50 mM NaCl) were mixed to prepare adapters A (25 μM). The adapters A were placed at the room temperature for more than half an hour and then diluted to a use concentration or stored at −20° C. Before use, the adapters A (25 μM) were diluted with TE prepare adapters B (6 μM).
5 μL of prepared adapters B (6 μM) was added to and thoroughly mixed with the product of step 4.
A ligation reaction solution was prepared according to Table 3.
The prepared ligation reaction solution was uniformly vortex-mixed with the mixture of the adapters B and the product of step 4, and the mixture was subjected to transient centrifugation. The reaction sample was placed into the PCR instrument for reaction; the reaction conditions were as follows: 25° C. for 30 min; and 4° C. hold; and the heated lid of the PCR instrument was set to 30° C. After the reaction was completed, 20 μL of TE buffer was added, 50 μL of XP magnetic beads was added for purification, and the collected product was dissolved in 50 μL of TE buffer.
6. Single-Strand Cyclization
48 μL of purified product was incubated at 95° C. for 3 min and at 4° C. for 10 min.
A single-strand cyclization reaction solution was prepared according to Table 4.
12 μL of the prepared single-strand cyclization reaction solution was uniformly vortex-mixed with 48 μL of thermal denaturation product, and the mixture was subjected to transient centrifugation. The reaction sample was placed into the PCR instrument for reaction, the reaction conditions were as follows: 37° C. for 60 min; and 4° C. hold, and the heated lid of the PCR instrument was set to 42° C.
20 μM fragments for mediation have a corresponding complementary sequence to be ligated to both ends of the single strand. The corresponding complementary sequence is (in the present example, the 5′-end of the sequence is on the left side, and the 3′-end of the sequence is on the right side): GCCATGTCGTTCTGTGAGCCAAGG (SEQ ID NO: 8).
7. Digestion of a Linear Single Strand
A digestion reaction solution was prepared according to Table 5.
4 μL of prepared digestion reaction solution was added to and uniformly mixed with 60 μL of reaction product of the previous step. The mixture was incubated at 37° C. for 30 min, and added and uniformly mixed with 3 μL of EDTA (500 mM, Ambion). A product was purified and collected with 120 μL of XP magnetic beads, and dissolved in 30 μL of TE buffer.
8. Quantitation of a Single-Stranded Circle
The single-stranded cyclization product obtained through the digestion of the linear single strand in the previous step was quantitated by using a Qubit ssDNA Assay Kit.
9. Sequencing
DNA nanoballs were prepared with the constructed single-stranded circular DNA library and sequenced on MGISEQ-2000 PE150. The sequencing and data analysis followed the standard operating process of MGISEQ-2000 PE150.
10. Library Construction and Sequencing Results of the Present Example
Table 6 illustrates the sequencing quality of the human sample WGS PCR-free library (based on NEB digestion interruption) obtained by the library construction and sequencing method of the present example. Table 6 indicates that the human sample WGS PCR-free library (based on NEB digestion interruption) has relatively high sequencing quality on the high-throughput sequencing platform MGISEQ-2000RS PE150, which is self-developed by MGI.
Table 7 shows SNP and Indel variation detection and analysis results of the NA12878 WGS PCR-free library obtained by the library construction and sequencing method of the present example. Table 7 indicates that the PCR-free library (based on NEB digestion interruption) of the present disclosure is significantly superior to the PCR library in terms of Indel calling, and its overall performance is similar to NovaSeq PCR-free PE150 on the Illumina platform.
Experimental objective: a plasma sample library was constructed by using an MGI PCR-free kit.
Sources of experimental samples: 20 plasma samples, including 2 abnormal chromosome samples.
1. Sample Collection and Treatment
2. End Repair and Addition of Adenylate Deoxyribonucleic Acid
An end repair-A-tailing reaction solution was prepared according to Table 8.
10 μL of prepared end repair-A-tailing reaction solution was added to and uniformly mixed with 40 μL of DNA, and the mixture was incubated at 37° C. for 10 min and at 72° C. for 15 min, and cooled to 4° C. at a rate of 0.1 s.
3. Ligation of an Adapter
An adapter sequence used in the present protocol is as follows (in the present example, the 5′-end of the sequence is on the left side, the 3′-end of the sequence is on the right side, “//” represents a modifying group, “phos” represents phosphorylation, and the underlined bases represent a barcode of 10 bases).
Preparation of adapter: 20 μL of adapter B strands (100 μM), 20 μL of adapter strands (100 μM), and 40 μL of 2×adapter buffer (components: 50 mM Tris-HCl (pH=8.0), 0.1 mM EDTA, and 50 mM NaCl) were mixed to prepare adapters A (25 μM). The adapters A were placed at the room temperature for more than half an hour and diluted to a use concentration or stored at −20° C. Before use, the adapters A (25 μM) were diluted with TE to prepare adapters B (1 μM).
1 μL of prepared adapters B (1 μM) was added to and thoroughly mixed with the product of step 3.
A ligation reaction solution was prepared according to Table 9.
The prepared ligation reaction solution was uniformly vortex-mixed with the mixture of the adapters B and the product of step 4, and the mixture was subjected to transient centrifugation. The reaction sample was placed into the PCR instrument for reaction, the reaction conditions were as follows: 25° C. for 30 min and 4° C. hold, and the heated lid of the PCR instrument was set to 30° C. After the reaction was completed, 20 μL of TE buffer was added, 50 μL of XP magnetic beads (Beckman Coulter) was added for purification, and the collected product was dissolved in 22 μL of TE buffer. 15 μL of the respective samples was taken, multiple samples of the same volume were mixed and purified by adding XP magnetic beads (Beckman Coulter) in twice the volume of the sample mixture, and the collected product was dissolved in 22 μL of TE buffer.
4. Sequencing
A DNA nanoball can be prepared by multiple methods.
Method 1: referring to steps 6 to 9 of the whole genome library construction and sequencing of Example 1, single-strand cyclization of a sample, digestion of a linear single strand, preparation of DNA nanoballs, and sequencing on BGISEQ-500SE50+10 are performed. The sequencing and data analysis follow the standard operating process of BGISEQ-500 SE50+10.
Method 2: the constructed ligation product is taken and subjected to one-step preparation of DNA nanoballs and sequencing on BGISEQ-500SE50+10. The sequencing and data analysis follow the standard operating process of BGISEQ-500 SE50+10. In the present example, the method 2 was adopted.
5. Library Construction and Sequencing Results of the Present Example
1 μL of ligation product was taken and subjected to PCR, and the size of an adapter-ligated fragment was verified. Results are shown in
Concentration detection results of the products in the respective steps are shown in Table 10. It can be seen that this method can be used to prepare a library from this type of sample.
Experimental objective: a whole genome library was prepared from a human gDNA sample by using an MGI PCR-free kit in combination with an NEB digestion interruption kit.
Sources of experimental samples: NA12878 standard DNA (catalog number: NA12878, manufacturer: CORIELL INSTITUTE).
1. Digestion Interruption, End Repair, and A-Tailing of the DNA Sample
1 μg of standard DNA (dissolved in TE) was placed into each tube, interrupted with dsDNA Fragmentase, and subjected to end repair and A-tailing, and the volume of an interruption system was 50 μL. A corresponding reagent was thawed in advance and uniformly mixed, an enzyme reagent was uniformly mixed in an upside-down manner and placed on ice. A reaction system was prepared according to Table 11 on ice.
The DNA sample was added to and uniformly mixed with the prepared reaction system by pipetting or vortex-mixing; the mixture was subjected to transient centrifugation and immediately placed into the thermocycler for reaction, the reaction conditions were as followings: 4° C. forever; 37° C. for 20 min; 65° C. for 30 min; and 4° C. forever, and the heated lid of the PCR instrument was set to 70° C. After the reaction was completed, the sample was collected and placed on ice immediately, and TE was added to make up the volume of the sample to 50 μL.
2. Selection of a DNA Fragment
(1) 100 μL of interrupted sample was taken and transferred into a new 1.5 mL non-stick tube; 60 μL of XP magnetic beads was added to and uniformly mixed with the sample by shaking, and allowed to bind to DNA at the room temperature for 10 min; the tube was placed onto the magnetic rack; the magnetic beads were allowed to bind to DNA for 2 min (until the liquid became clear); and the supernate was carefully taken by suction and transferred into a new 1.5 mL EP tube (the supernate was reserved at this step). 15 μL of XP magnetic beads was added to and uniformly mixed with the supernate by shaking, and allowed to bind to DNA at the room temperature for 10 min; the tube was placed onto the magnetic rack, the magnetic beads were allowed to bind to DNA for 2 min (until the liquid became clear); and the supernate was removed by suction.
(2) 500 μL of 75% ethanol was placed into the non-stick tube on the magnetic rack, the tube cap was closed, the mixture in the tube was uniformly mixed, and the supernate was removed. After washing with 500 μL of 75% ethanol again, residual ethanol was removed as much as possible by using a pipette with a small measurement range, and the magnetic beads were air-dried at the room temperature.
(3) The magnetic beads were resuspended and uniformly mixed with 42 μL of TE by shaking, and allowed to bind to DNAs at the room temperature for 10 min; the tube was placed onto the magnetic rack, the magnetic beads were allowed to bind to DNAs for 2 min (until the liquid became clear); and 40 μL of supernate was carefully taken by suction and transferred into a new 1.5 mL EP tube for next reaction, or it stored in a refrigerator at −20° C.
3. Quantitation and Normalization of the Sample
2 μL of purified DNA was taken and subjected to Qubit dsDNA HS quantitation. The selected DNA fragments were normalized according to the concentration determined by Qubit quantitation; the mass of the DNA fragment was adjusted to 150 ng; and 1×TE was added to make up the total volume of 40 μL. If necessary, the normalized samples can be stored in a refrigerator at −20° C.
The size of the obtained DNA fragment was 300 bp to 500 bp.
5. Ligation of Adapter
An adapter sequence used in the present protocol is as follows (in the present example, the 5′-end of the sequence is on the left side, the 3′-end of the sequence is on the right side, “//” represents a modifying group, “phos” represents phosphorylation, and the underlined bases represent a barcode of 10 bases).
Preparation of the adapter: 20 μL of adapter B strands (100 μM), 20 μL of adapter T strands (100 μM), and 40 μL of 2×adapter buffer (components: 50 mM Tris-HCl (pH=8.0), 0.1 mM EDTA, and 50 mM NaCl) were mixed to prepare adapters A (25 μM); and the adapters A were placed at the room temperature for more than half an hour and then diluted to a use concentration or stored at −20° C. Before use, the adapters A (25 μM) were diluted with TE to prepare adapters B (6 μM).
5 μL of the prepared adapters B (6 μM) was added to and thoroughly mixed with the product of step 4.
A ligation reaction solution was prepared according to Table 12.
The prepared ligation reaction solution was uniformly vortex-mixed with the mixture of the adapters B and the product of step 4, and the mixture was subjected to transient centrifugation. The reaction sample was placed into the PCR instrument for reaction, the reaction conditions were as follows: 25° C. for 30 min; and 4° C. hold, and the heated lid of the PCR instrument was set to 30° C. After the reaction was completed, 20 μL of TE buffer was added, 50 μL of XP magnetic beads was added for purification, and the collected product was dissolved in 50 μL of TE buffer.
6. Single-Strand Cyclization
48 μL of purified product was incubated at 95° C. for 3 min and at 4° C. for 10 min.
A single-strand cyclization reaction solution was prepared according to Table 13.
12 μL of the prepared single-strand cyclization reaction solution was uniformly vortex-mixed with 48 μL of thermal denaturation product, and the mixture was subjected to transient centrifugation. The reaction sample was placed into the PCR instrument for reaction, the reaction conditions were as follows: 37° C. for 60 min; and 4° C. hold, and the heated lid of the PCR instrument was set to 42° C.
20 μM fragments for mediation have a corresponding complementary sequence to be ligated to both ends of the single strand. The corresponding complementary sequence is (in the present example, the 5′-end of the sequence is on the left side, and the 3′-end of the sequence is on the right side): GCCATGTCGTTCTGTGAGCCAAGG (SEQ ID NO: 8).
7. Digestion of a Linear Single Strand
A digestion reaction solution was prepared according to Table 14.
4 μL of prepared digestion reaction solution was added to and uniformly mixed with 60 μL of reaction product of the previous step. The mixture was incubated at 37° C. for 30 min, and added and uniformly mixed with 3 μL of EDTA (500 mM, Ambion). A product was purified and collected with 120 μL of XP magnetic beads, and dissolved in 30 μL of TE buffer.
8. Quantitation of a Single-Stranded Circle
The single-stranded cyclization product obtained through the digestion of the linear single strand in the previous step was quantitated by using a Qubit ssDNA Assay Kit.
Concentration detection results of the products in the respective steps are shown in Table 15. It can be seen that, the integration of the interruption, end repair, and A-tailing in one step is also suitable for the PCR-free library construction.
Experimental objective: a whole genome library was prepared from a human gDNA sample by using an MGI PCR-free kit in combination with an NEB digestion interruption kit.
Sources of experimental samples: NA12878 standard DNA (catalog number: NA12878, manufacturer: CORIELL INSTITUTE).
1. Digestion Interruption and End Repair of the DNA Sample
1 μg of standard DNA (dissolved in TE) was placed into each tube, interrupted with dsDNA Fragmentase (catalog number: M0348, NEB), and subjected to end repair, and the volume of the system was 50 μL. 10× Fragmentase Reaction Buffer v2 was thawed in advance and uniformly vortex-mixed, dsDNA Fragmentase was uniformly vortex-mixed and placed on ice. A reaction system was prepared according to Table 16 on ice.
The above reaction system was uniformly pipetted, 20 μL of gDNA sample (total mass was 1 ug) was added to and uniformly mixed with the reaction system by gently pipetting 6-8 times or vortex-mixing; the mixture was subjected to transient centrifugation and immediately placed into the thermocycler for reaction; the reaction conditions were as follows: 37° C. for 30 min; and 4° C. forever, and the heated lid of the PCR instrument was set to 70° C. After the reaction was completed, the sample was collected and placed on ice immediately, and TE was added to make up the volume of the sample to 30 μL.
2. Selection of a DNA Fragment
(1) 100 μL of interrupted sample was taken and transferred into a new 1.5 mL non-stick tube; 52 μL of XP magnetic beads was added to and uniformly mixed with the sample by shaking, and allowed to bind to DNA at the room temperature for 10 min; after transient centrifugation, the tube was placed onto the magnetic rack, the magnetic beads were allowed to bind to DNA for 2 min (until the liquid became clear); and the supernate was carefully taken by suction and transferred into a new 1.5 mL EP tube (the supernate was reserved at this step). 15 μL of XP magnetic beads was added to and uniformly mixed with the supernate by shaking, and allowed to bind to DNA at the room temperature for 10 min; the tube was placed onto the magnetic rack, the magnetic beads were allowed to bind to DNA for 2 min (until the liquid became clear); and the supernate was removed by suction.
(2) 500 μL of 75% ethanol was placed into the non-stick tube on the magnetic rack, the tube cap was closed, the mixture in the tube was uniformly mixed, and the supernate was removed. After washing with 500 μL of 75% ethanol again, residual ethanol was removed as much as possible by using a pipette with a small measurement range, and the magnetic beads were air-dried at the room temperature.
(3) The magnetic beads were resuspended in and uniformly mixed with 42 μL of TE by shaking, and allowed to bind to DNA at the room temperature for 10 min; after transient centrifugation, the tube was placed onto the magnetic rack, the magnetic beads were allowed to bind to DNA for 2 min (until the liquid became clear); and 40 μL of supernate was carefully taken by suction and transferred into a new 1.5 mL EP tube for the next reaction, or it was stored in a refrigerator at −20° C.
3. Quantitation and Normalization of the Sample
2 μL of purified DNA was taken and subjected to Qubit dsDNA HS quantitation. The selected DNA fragment was normalized according to the concentration determined by Qubit quantitation; the mass of the DNA fragment was adjusted to 150 ng; and 1×TE was added to make up the total volume of 40 μL. If necessary, the normalized samples can be stored in a refrigerator at −20° C.
The size of the obtained DNA fragment was 300 bp to 500 bp.
4. A-tailing
First, an A-tailing reaction solution was prepared according to Table 17.
10 μL of prepared A-tailing reaction solution was added to and vortex-mixed with 40 μL of product of step 3; the mixture was subjected to transient centrifugation; and the total volume of the mixture was made up to 50 μL. The reaction sample was placed into the PCR instrument for reaction, the reaction conditions were as follows: 65° C. for 30 min; and 4° C. forever, and the heated lid of the PCR instrument was set to 70° C.
5. Ligation of Adapter
An adapter sequence used in the present protocol is as follows (in the present example, the 5′-end of the sequence is on the left side, the 3′-end of the sequence is on the right side, “//” represents a modifying group, “phos” represents phosphorylation, and the underlined bases represent a barcode of 10 bases).
Preparation of the adapter: 20 μL of adapter B strands (100 μM), 20 μL of adapter T strands (100 μM), and 40 μL of 2× adapter buffer (components: 50 mM Tris-HCl (pH=8.0), 0.1 mM EDTA, and 50 mM NaCl) were mixed to prepare adapters A (25 μM); and the adapters A were placed at the room temperature for more than half an hour and then diluted to a use concentration or stored at −20° C. Before use, the adapters A (25 μM) were diluted with TE to prepare adapters B (6 μM).
5 μL of prepared adapters B (6 μM) was added to and thoroughly mixed with the product of step 4.
A ligation reaction solution was prepared according to Table 18.
The prepared ligation reaction solution was uniformly vortex-mixed with the mixture of the adapters B and the product of step 4, and the mixture was subjected to transient centrifugation. The reaction sample was placed into the PCR instrument for reaction, the reaction conditions were as follows: 25° C. for 30 min; and 4° C. hold, and the heated lid of the PCR instrument was set to 30° C. After the reaction was completed, 20 μL of TE buffer was added, 50 μL of XP magnetic beads was added for purification, and the collected product was dissolved in 50 μL of TE buffer.
6. Single-Strand Cyclization
5 μL of fragments (20 μM) for mediation and 2.5 μL of NaOH (2 M, Sigma) were added to and uniformly vortex-mixed with 48 μL of purified product, and the mixture was placed at the room temperature for 5 min. 5 μL of Tris-HCl (1 M, pH=6.8) was added to and uniformly vortex-mixed with the mixture, and a single-strand cyclization reaction solution shown in Table 19 was added.
The reaction sample was placed into the PCR instrument for reaction, the reaction conditions were as follows: 37° C. for 30 min; and 4° C. hold, and the heated lid of the PCR instrument was set to 42° C.
20 μM fragments for mediation have a corresponding complementary sequence to be ligated to both ends of the single strand. The corresponding complementary sequence is (in the present example, the 5′-end of the sequence is on the left side, and the 3′-end of the sequence is on the right side): GCCATGTCGTTCTGTGAGCCAAGG (SEQ ID NO: 8).
7. Digestion of a Linear Single Strand
A digestion reaction solution was prepared according to Table 20.
4 μL of prepared digestion reaction solution was added to and uniformly mixed with 67.5 μL of reaction product of the previous step. The mixture was incubated at 37° C. for 30 min, and added and uniformly mixed with 3 μL of EDTA (500 mM, Ambion). A product was purified and collected with 120 μL of XP magnetic beads, and dissolved in 30 μL of TE buffer.
8. Quantitation of a Single-Stranded Circle
The single-stranded cyclization product obtained through the digestion of the linear single strand in the previous step was quantitated by using a Qubit ssDNA Assay Kit.
Concentration detection results of the products in the respective steps are shown in Table 21. It can be seen that, the integration of the interruption, end repair, and A-tailing in one step is also suitable for PCR-free library construction.
Experimental objective: a whole genome library was prepared from a human gDNA sample by using an MGI PCR-free kit in combination with a dual-barcode adapter and an NEB digestion interruption kit.
Sources of experimental samples: NA12878 standard DNA (catalog number: NA12878, manufacturer: CORIELL INSTITUTE).
1. Digestion Interruption of the DNA Sample
1 μg of standard DNA (dissolved in TE) was placed into each tube and interrupted with dsDNA Fragmentase (catalog number: M0348, NEB), the volume of an interruption system was 50 μL. 10×Fragmentase Reaction Buffer v2 was thawed in advance and uniformly vortex-mixed, and dsDNA Fragmentase was uniformly vortex-mixed and placed on ice. A reaction system was prepared according to Table 22 on ice.
The sample, after being pipetted, was added with and gently uniformly mixed with 3 μL of dsDNA Fragmentase by pipetting 6-8 times or vortex-mixing. After transient centrifugation, the mixture was immediately placed into the thermocycler for reaction, the reaction conditions were as follows: 37° C. for 25 min; 65° C. for 15 min; and 4° C. forever, and the heated lid of the PCR instrument was set to 70° C. After the reaction was completed, the sample was collected and placed on ice immediately, and TE was added to make up the total volume of the sample to 70 μL.
2. Selection of a DNA Fragment
(1) 100 μL of interrupted sample was taken and transferred into a new 1.5 mL non-stick tube; 60 μL of XP magnetic beads was added to and uniformly mixed with the sample by shaking, and allowed to bind to DNA at the room temperature for 10 min; after transient centrifugation, the tube was placed onto the magnetic rack, the magnetic beads were allowed to bind to DNA for 2 min (until the liquid became clear); and the supernate was carefully taken by suction and transferred into a new 1.5 mL EP tube (the supernate was reserved at this step). 15 μL of XP magnetic beads was added to and uniformly mixed with the supernate by shaking, and allowed to bind to DNA at the room temperature for 10 min. The tube was placed onto the magnetic rack, the magnetic beads were allowed to bind to DNA for 2 min (until the liquid became clear), and the supernate was removed by suction.
(2) 500 μL of 75% ethanol was placed into the non-stick tube on the magnetic rack, the tube cap was closed, the mixture in the tube was uniformly mixed. The supernate was removed. After washing with 500 μL of 75% ethanol again, residual ethanol was removed as much as possible by using a pipette with a small measurement range, and the magnetic beads were air-dried at the room temperature.
(3) The magnetic beads were resuspended in and uniformly mixed with 42 μL of TE by shaking, and allowed to bind to DNA at the room temperature for 10 min; after transient centrifugation, the tube was placed onto the magnetic rack, the magnetic beads were allowed to bind to DNA for 2 min (until the liquid became clear); and 40 μL of supernate was carefully taken by suction and transferred into a new 1.5 mL EP tube for next reaction, or it was stored in a refrigerator at −20° C.
3. Quantitation and Normalization of the Sample
2 μL of purified DNA was taken and subjected to Qubit dsDNA HS quantitation. The selected DNA fragment was normalized according to the concentration determined by Qubit quantitation, the mass of the DNA fragment was adjusted to 150 ng, and 1×TE was added to make up the total volume of 40 μL. If necessary, the normalized samples can be stored in a refrigerator at −20° C.
The size of the obtained DNA fragment was 300 bp to 500 bp.
4. End Repair and A-Tailing
First, an end repair-A-tailing reaction solution was prepared according to Table 23.
10 μL of prepared end repair-A-tailing reaction solution was added to and uniformly vortex-mixed with 40 μL of product of step 3; the mixture was subjected to transient centrifugation; and the total volume of the mixture was made up to 50 μL. The reaction sample was placed into the PCR instrument for reaction, the reaction conditions were as follows: 14° C. for 15 min; 37° C. for 25 min; 65° C. for 15 min; and 4° C. forever, and the heated lid of the PCR instrument was set to 70° C.
5. Ligation of an Adapter
An adapter sequence used in the present protocol is as follows (in the present example, the 5′-end of the sequence is on the left side, the 3′-end of the sequence is on the right side, “//” represents a modifying group, “phos” represents phosphorylation, and the underlined bases represent a barcode of 10 bases).
NNNCAACTCCTTGGCTCACAGAACGACATGGC
Preparation of the adapter: 20 μL of adapter B strands (100 μM), 20 μL of adapter T strands (100 μM), and 40 μL of 2×adapter buffer (components: 50 mM Tris-HCl (pH=8.0), 0.1 mM EDTA, and 50 mM NaCl) were mixed to prepare adapters A (25 μM); and the adapters A were placed at the room temperature for more than half an hour and then diluted to a use concentration or stored at −20° C. Before use, the adapters A (25 μM) were diluted with TE to prepare adapters B (6 μM).
5 μL of the prepared adapters B (6 μM) was added to and thoroughly mixed with the product of step 4.
A ligation reaction solution was prepared according to Table 24.
The prepared ligation reaction solution was uniformly vortex-mixed with the mixture of the adapters B and the product of step 4, and the mixture was subjected to transient centrifugation. The reaction sample was placed into the PCR instrument for reaction, the reaction conditions were as follows: 25° C. for 30 min; and 4° C. hold, and the heated lid of the PCR instrument was set to 30° C. After the reaction was completed, 20 μL of TE buffer was added, 50 μL XP magnetic beads was added for purification, and the collected product was dissolved in 50 μL of TE buffer.
6. Single-Strand Cyclization
48 μL of purified product was incubated at 95° C. for 3 min and at 4° C. for 10 min.
A single-strand cyclization reaction solution was prepared according to Table 25.
12 μL of the prepared single-strand cyclization reaction solution was uniformly vortex-mixed with 48 μL of thermal denaturation product, and the mixture was subjected to transient centrifugation. The reaction sample was placed into the PCR instrument for reaction, the reaction conditions were as follows: 37° C. for 60 min; and 4° C. hold, and the heated lid of the PCR instrument was set to 42° C.
20 μM fragments for mediation have a corresponding complementary sequence to be ligated to both ends of the single strand. The corresponding complementary sequence is (in the present example, the 5′-end of the sequence is on the left side, and the 3′-end of the sequence is on the right side): TGCTGACGTACTGAGAGGCATGGCGACCT (SEQ ID NO: 8).
7. Digestion of a Linear Single Strand
A digestion reaction solution was prepared according to Table 26.
4 μL of prepared digestion reaction solution was added to and uniformly mixed with 60 μL of reaction product of the previous step. The mixture was incubated at 37° C. for 30 min, and added and uniformly mixed with 3 μL of EDTA (500 mM, Ambion). A product was purified and collected with 120 μL of XP magnetic beads, and dissolved in 30 μL of TE buffer.
8. Quantitation of a Single-Stranded Circle
The single-stranded cyclization product obtained through the digestion of the linear single strand in the previous step was quantitated by using a Qubit ssDNA Assay Kit.
9. Sequencing
DNA nanoballs were prepared with the constructed single-stranded circular DNA library and sequenced on MGISEQ-2000 PE15. The sequencing and data analysis followed the standard operating process of MGISEQ-2000 PE150.
10. Library Construction and Sequencing Results Ofthe Present Example
Concentration detection results ofthe products in the respective steps are shown in Table 27. Table 28 shows the sequencing quality of the human sample WGS PCR-free library (based on digestion interruption) obtained by the library construction and sequencing method of the present example. Table 28 indicates that the human sample WGS PCR-free library (based on digestion interruption) has relatively high sequencing quality on the high-throughput sequencing platform MGISEQ-2000RS PE150, self-developed by MGI.
Table 29 shows SNP and Indel variation detection and analysis results of the NA12878 PCR-free library (based on NEB digestion interruption) obtained by the library construction and sequencing method of the present example. Table 29 indicates that the PCR-free library (based on digestion interruption) of the present disclosure is significantly superior to the PCR library in terms of Indel calling, and its overall performance is similar to that of NovaSeq PCR-free PE150 on the Illumina platform.
The PCR-free construction solutions provided by the present disclosure overcomes the problems such as base pairing mistake, data bias, and repetitive sequences, which may be introduced by PCR during library construction. Furthermore, these solutions are compatible with library construction based on different interruption methods and small inputs. In combination with the nanoball preparation method based on rolling circle replication, the PCR-free library construction method of the present disclosure achieves true PCR-free library construction and sequencing for samples, thereby achieving whole PCR-free process. In the present disclosure, by adopting an optimized system for end repair and adapter ligation and the one-step reaction of single-strand cyclization and rolling circle replication, the library construction efficiency of the self-developed platform PCR-free library construction kit is improved, and required DNA inputs are reduced. The library construction system of the present disclosure is compatible with different types of starting samples, which include, but are not limited to, genomic DNA, interrupted DNA, DNA or an interrupted DNA product obtained by whole genome amplification, amplicon DNA, cfDNA, and DNA obtained by reverse transcription of RNA. Specifically, compared to the prior art, the present disclosure has the following advantages. 1) Wide applicability: the present disclosure is applicable to all species having known or unknown reference sequences; it can be adopted by general molecular biology laboratories; and it is compatible with library construction based on physical interruption and digestion interruption, compatible with different types of samples, which include, but are not limited to, genomic DNA, interrupted DNA, DNA or an interrupted DNA product obtained by whole genome amplification, amplicon DNA, cfDNA, and DNA obtained by reverse transcription of RNA. 2) Simple operation and shorter time for library construction: according to the present disclosure, the end repair and the A-tailing reaction are performed in the same tube; the magnetic bead purification is omitted to directly perform the adapter ligation; and the conventional PCR amplification and purification are omitted, thereby greatly shortening the time for library construction. Furthermore, according to the library construction method, the cyclization and the rolling circle replication are performed simultaneously, thereby further shortening the time for library construction. 3) High library construction efficiency: according to the library construction method, by adopting the optimized system for end repair and A-tailing, the optimized system for adapter ligation, and the one-step of cyclization and rolling circle replication reaction, a pooling library construction and sequencing with small inputs of starting samples as well as the PCR-free library construction using 200 μL of plasma DNA can be achieved. 4) Enhanced accuracy of sequencing data on the self-developed platform: the methods of the present disclosure achieve the true PCR-free library construction and sequencing, which can improve the accuracy and sensitivity of SNP and InDel detection, and the methods of the present disclosure especially have excellent performance in term of InDel detection over other platforms from business competitors, for example, Illumina.
This application is a continuation of International Application No. PCT/CN2020/096987, filed on Jun. 19, 2020, the disclosure of which is hereby incorporated by reference in entirety.
Number | Date | Country | |
---|---|---|---|
Parent | PCT/CN2020/096987 | Jun 2020 | US |
Child | 18067540 | US |