The present application relates to the field of next-generation sequencing, and specifically, to the field of target region hybridization capture and universal blocking oligonucleotide sequence design and synthesis.
Target region hybridization capture is a diagnostic technology based on next-generation sequencing and belongs to the field of personalized medicine and companion diagnostics. The human genome encodes a large amount of genetic information, having 23 pairs of chromosomes and 20,000 to 25,000 genes. A large number of studies have shown that base mutations at specific sites in some genes can be used as biomarkers for the diagnosis of some diseases. Although the cost of whole exome sequencing and even genome sequencing has been significantly reduced with the development of next-generation sequencing technology, if the whole exome sequencing or genome sequencing is used only for detection of a certain type of disease, it will cause serious waste of sequencing throughput and increase of detection costs. To improve sequencing efficiency and reduce detection costs, genes related to a certain type of disease are usually screened in combination, and only these genes are sequenced.
In this case, a key step is to extract and isolate target genes from the genome. The commonly method used in the market is to design primer probes that can target bind to the target gene sequence, so as to isolate the target gene sequence from many heterogeneous gene sequences. The primer probe usually contains biotin modification. After it binds to the target gene sequence, specific purification can be carried out by using magnetic beads coated with streptavidin.
Illumina's next-generation sequencer is widely used in this field. After the genomic DNA is extracted, capture sample preparation will be carried out according to Illumina's library preparation kits. The genomic DNA is usually fragmented, fragments ranging from 300 to 500 bp in length are screened out, and reverse complementary adapter sequences are added at both ends of the fragment for sequencing. In this case, in addition to that the biotin-modified primer probes can bind to the target gene sequence to achieve isolation by magnetic beads, the gene sequence in the non-target region can also be hybridized with complementary adapter sequences at both ends and indirectly isolated by magnetic beads, thereby resulting in non-specific capture. To solve this problem, a large number of oligonucleotide sequences that contain blockers and are reverse complementary to the adapter sequences can be added to the hybridization system. However, when multiple samples are captured simultaneously for target sequences, a universal blocking oligonucleotide sequence cannot be found due to the inconsistent index sequence within the adapter sequence in different samples.
To solve the foregoing technical problem, according to an aspect, the present application provides an oligonucleotide containing a blocker introduced into a basic sequence, where the backbone of the blocker contains a carbon-carbon bond, a carbon-oxygen bond, a carbon-nitrogen bond, a nitrogen-oxygen bond, a phosphorus-oxygen bond, or a combination thereof, and the quantity of the carbon-carbon bond, the carbon-oxygen bond, the carbon-nitrogen bond, the nitrogen-oxygen bond, the phosphorus-oxygen bond, or the combination thereof is 10-900.
In some embodiments, the quantity of the carbon-carbon bond, the carbon-oxygen bond, the carbon-nitrogen bond, the nitrogen-oxygen bond, the phosphorus-oxygen bond, or the combination thereof is 20-200.
In some embodiments, the quantity of the carbon-carbon bond, the carbon-oxygen bond, the carbon-nitrogen bond, the nitrogen-oxygen bond, the phosphorus-oxygen bond, or the combination thereof is 20.
In some embodiments, the quantity of the carbon-carbon bond, the carbon-oxygen bond, the carbon-nitrogen bond, the nitrogen-oxygen bond, the phosphorus-oxygen bond, or the combination thereof is 30-54.
In some embodiments, the quantity of the carbon-carbon bond, the carbon-oxygen bond, the carbon-nitrogen bond, the nitrogen—oxygen bond, the phosphorus-oxygen bond, or the combination thereof is 31.
In some embodiments, the quantity of the carbon-carbon bond, the carbon-oxygen bond, the carbon-nitrogen bond, the nitrogen-oxygen bond, the phosphorus-oxygen bond, or the combination thereof is 32.
In some embodiments, the quantity of the carbon-carbon bond, the carbon-oxygen bond, the carbon-nitrogen bond, the nitrogen-oxygen bond, the phosphorus-oxygen bond, or the combination thereof is 33.
In some embodiments, the quantity of the carbon-carbon bond, the carbon-oxygen bond, the carbon-nitrogen bond, the nitrogen-oxygen bond, the phosphorus-oxygen bond, or the combination thereof is 34.
In some embodiments, the quantity of the carbon-carbon bond, the carbon—oxygen bond, the carbon-nitrogen bond, the nitrogen-oxygen bond, the phosphorus-oxygen bond, or the combination thereof is 35.
In some embodiments, the quantity of the carbon-carbon bond, the carbon-oxygen bond, the carbon-nitrogen bond, the nitrogen-oxygen bond, the phosphorus-oxygen bond, or the combination thereof is 36.
In some embodiments, the quantity of the carbon-carbon bond, the carbon-oxygen bond, the carbon-nitrogen bond, the nitrogen-oxygen bond, the phosphorus-oxygen bond, or the combination thereof is 37.
In some embodiments, the quantity of the carbon-carbon bond, the carbon-oxygen bond, the carbon-nitrogen bond, the nitrogen-oxygen bond, the phosphorus-oxygen bond, or the combination thereof is 38.
In some embodiments, the quantity of the carbon-carbon bond, the carbon-oxygen bond, the carbon-nitrogen bond, the nitrogen-oxygen bond, the phosphorus-oxygen bond, or the combination thereof is 39.
In some embodiments, the quantity of the carbon-carbon bond, the carbon-oxygen bond, the carbon-nitrogen bond, the nitrogen-oxygen bond, the phosphorus-oxygen bond, or the combination thereof is 40.
In some embodiments, the quantity of the carbon-carbon bond, the carbon-oxygen bond, the carbon-nitrogen bond, the nitrogen-oxygen bond, the phosphorus-oxygen bond, or the combination thereof is 41.
In some embodiments, the quantity of the carbon-carbon bond, the carbon-oxygen bond, the carbon-nitrogen bond, the nitrogen-oxygen bond, the phosphorus-oxygen bond, or the combination thereof is 42.
In some embodiments, the quantity of the carbon-carbon bond, the carbon-oxygen bond, the carbon-nitrogen bond, the nitrogen-oxygen bond, the phosphorus-oxygen bond, or the combination thereof is 43.
In some embodiments, the quantity of the carbon-carbon bond, the carbon-oxygen bond, the carbon-nitrogen bond, the nitrogen-oxygen bond, the phosphorus-oxygen bond, or the combination thereof is 44.
In some embodiments, the quantity of the carbon-carbon bond, the carbon-oxygen bond, the carbon-nitrogen bond, the nitrogen-oxygen bond, the phosphorus-oxygen bond, or the combination thereof is 45.
In some embodiments, the quantity of the carbon-carbon bond, the carbon-oxygen bond, the carbon-nitrogen bond, the nitrogen-oxygen bond, the phosphorus-oxygen bond, or the combination thereof is 46.
In some embodiments, the quantity of the carbon-carbon bond, the carbon-oxygen bond, the carbon-nitrogen bond, the nitrogen-oxygen bond, the phosphorus-oxygen bond, or the combination thereof is 47.
In some embodiments, the quantity of the carbon-carbon bond, the carbon-oxygen bond, the carbon-nitrogen bond, the nitrogen-oxygen bond, the phosphorus-oxygen bond, or the combination thereof is 48.
In some embodiments, the quantity of the carbon-carbon bond, the carbon-oxygen bond, the carbon-nitrogen bond, the nitrogen-oxygen bond, the phosphorus-oxygen bond, or the combination thereof is 49.
In some embodiments, the quantity of the carbon-carbon bond, the carbon-oxygen bond, the carbon-nitrogen bond, the nitrogen-oxygen bond, the phosphorus-oxygen bond, or the combination thereof is 50.
In some embodiments, the quantity of the carbon-carbon bond, the carbon-oxygen bond, the carbon-nitrogen bond, the nitrogen-oxygen bond, the phosphorus-oxygen bond, or the combination thereof is 51.
In some embodiments, the quantity of the carbon-carbon bond, the carbon-oxygen bond, the carbon-nitrogen bond, the nitrogen-oxygen bond, the phosphorus-oxygen bond, or the combination thereof is 52.
In some embodiments, the quantity of the carbon-carbon bond, the carbon-oxygen bond, the carbon-nitrogen bond, the nitrogen-oxygen bond, the phosphorus-oxygen bond, or the combination thereof is 53.
In some embodiments, the quantity of the carbon-carbon bond, the carbon-oxygen bond, the carbon-nitrogen bond, the nitrogen-oxygen bond, the phosphorus-oxygen bond, or the combination thereof is 54.
In some embodiments, the carbon-carbon bond, the carbon-oxygen bond, the carbon-nitrogen bond, the nitrogen-oxygen bond, the phosphorus-oxygen bond, or the combination thereof exists in a glycol molecule, nitroindole, deoxyinosine, furan, or a phosphine oxide thereof.
In some embodiments, the glycol molecule is selected from ethylene glycol, propylene glycol, butylene glycol, pentylene glycol, hexylene glycol, nonamethylene glycol, dodecamethylene glycol, or polyethylene glycol.
In some embodiments, the polyethylene glycol is selected from triethylene glycol, tetraethylene glycol, hexaethylene glycol, polyethylene glycol 2000, or polyethylene glycol 4500.
In some embodiments, the blocker further contains a phosphodiester bond.
In some embodiments, the blocker includes the following structural formula, m being an integer from 3-150:
In some embodiments, m is 6.
In some embodiments, m is 7.
In some embodiments, m is 8.
In some embodiments, m is 9.
In some embodiments, m is 10.
In some embodiments, the blocker includes the following structural formula, n being an integer from 4-150:
In some embodiments, n is 5.
In some embodiments, n is 6.
In some embodiments, n is 7.
In some embodiments, n is 8.
In some embodiments, n is 9.
In some embodiments, the blocker includes the following structural formula, a being an integer from 3-150:
In some embodiments, a is 3.
In some embodiments, a is 4.
In some embodiments, a is 5.
In some embodiments, a is 6.
In some embodiments, a is 7.
In some embodiments, a is 8.
In some embodiments, a is 9.
In some embodiments, a is 10.
In some embodiments, a is 45.
In some embodiments, a is 46.
In some embodiments, a is 102.
In some embodiments, a is 103.
In some embodiments, the basic sequence is selected from SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, or SEQ ID NO: 4.
In some embodiments, the basic sequence is selected from SEQ ID NO: 1.
In some embodiments, the basic sequence is selected from SEQ ID NO: 2.
In some embodiments, the basic sequence is selected from SEQ ID NO: 3.
In some embodiments, the basic sequence is selected from SEQ ID NO: 4.
In some embodiments, the basic sequence further includes a modified base, and the modified base is selected from a bicyclic nucleotide modified base, a phosphorothioate modified base, a 2′-methoxy modified base, or a combination thereof.
In some embodiments, the quantity of the modified base is 1-20.
In some embodiments, the quantity of the modified base is 2-18.
In some embodiments, the quantity of the modified base is 3-16.
In some embodiments, the bicyclic nucleotide modified base is selected from a locked nucleic acid modified base.
In some embodiments, the locked nucleic acid modified base is selected from LNA-A, LNA-G, LNA-C, LNA-T, or a combination thereof.
In some embodiments, the locked nucleic acid modified base is LNA-A, LNA-C, or LNA-T.
In some embodiments, the phosphorothioate modified base is selected from phosphorothioate modified adenine, phosphorothioate modified guanine, phosphorothioate modified cytosine, phosphorothioate modified thymine, phosphorothioate modified uracil, or a combination thereof.
In some embodiments, preferably, the phosphorothioate modified base is selected from phosphorothioate modified adenine, phosphorothioate modified cytosine, phosphorothioate modified thymine, phosphorothioate modified uracil, or a combination thereof.
In some embodiments, the 2′-methoxy modified base is selected from 2′-methoxy modified adenine, 2′-methoxy modified guanine, 2′-methoxy modified cytosine, 2′-methoxy modified thymine, 2′-methoxy modified uracil, or a combination thereof.
In some embodiments, preferably, the 2′-methoxy modified base is selected from 2′-methoxy modified guanine, 2′-methoxy modified cytosine, 2′-methoxy modified uracil, or a combination thereof.
In some embodiments, the oligonucleotide further includes a 3′-end modification. In some embodiments, in the oligonucleotide, the 3′-end modification is selected from 3′-Spacer C3 modification or 3′-ddC modification.
According to another aspect, the present application provides a combination of oligonucleotide sequences, including at least two oligonucleotides.
In some embodiments, the combination of oligonucleotide sequences includes two oligonucleotides.
In some embodiments, the combination of oligonucleotide sequences includes an oligonucleotide as set forth in SEQ ID NO: 5 and an oligonucleotide as set forth in SEQ ID NO: 6.
In some embodiments, the combination of oligonucleotide sequences includes an oligonucleotide as set forth in SEQ ID NO: 7 and an oligonucleotide as set forth in SEQ ID NO: 8.
In some embodiments, the combination of oligonucleotide sequences includes an oligonucleotide as set forth in SEQ ID NO: 9 and an oligonucleotide as set forth in SEQ ID NO: 10.
In some embodiments, the combination of oligonucleotide sequences includes an oligonucleotide as set forth in SEQ ID NO: 11 and an oligonucleotide as set forth in SEQ ID NO: 12.
In some embodiments, the combination of oligonucleotide sequences includes an oligonucleotide as set forth in SEQ ID NO: 13 and an oligonucleotide as set forth in SEQ ID NO: 14.
In some embodiments, the combination of oligonucleotide sequences includes an oligonucleotide as set forth in SEQ ID NO: 15 and an oligonucleotide as set forth in SEQ ID NO: 16.
According to still another aspect, the present application provides a kit, including the oligonucleotide or the combination of oligonucleotide sequences.
Further, the present application provides use of the oligonucleotide, the combination of oligonucleotide sequences, or the kit in sequencing.
In some specific embodiments, the use in sequencing is high-throughput sequencing, polynucleotide sequencing, forensic science, disease detection, medical diagnosis, precision medicine, companion diagnostics, non-invasive prenatal testing, or early tumor screening.
Oligonucleotide: the oligonucleotide of the present application refers to a polynucleotide chain with a relatively small length, preferably a single strand. The length of the oligonucleotide is usually less than 200 residues (for example, between 15 and 100), and preferably the length of the basic sequence in the oligonucleotide is 58-64 residues. The oligonucleotide further includes a blocker. The backbone of the blocker contains a carbon-carbon bond, a carbon-oxygen bond, a carbon-nitrogen bond, a nitrogen-oxygen bond, or a combination thereof, and the quantity of the carbon-carbon bond, the carbon-oxygen bond, the carbon-nitrogen bond, the nitrogen-oxygen bond, or the combination thereof is 4-150.
Backbone: a natural or non-natural linear modification structure inserted between two natural nucleotides in an oligonucleotide sequence containing a blocker. For example, the backbone of an ethylene glycol phosphodiester bond structure contains 2 carbon-oxygen bonds, 1 carbon-carbon bond, and 2 phosphorus-oxygen bonds; the backbone of a propylene glycol phosphodiester bond structure contains 2 carbon-oxygen bonds, 2 carbon-carbon bonds, and 2 phosphorus-oxygen bonds; and the backbone of a hexaethylene glycol phosphodiester bond structure contains 12 carbon-oxygen bonds, 6 carbon-carbon bonds, and 2 phosphorus-oxygen bonds.
The ethylene glycol phosphodiester bond structure: a structural unit of a chain finally formed after solid-phase synthesis of a raw material of ethylene glycol phosphoramidite monomer, with a structural formula of:
The propylene glycol phosphodiester bond structure: a structural unit of a chain finally formed after solid-phase synthesis of a raw material of propylene glycol phosphoramidite monomer, with a structural formula of:
The hexaethylene glycol phosphodiester bond structure: a structural unit of a chain finally formed after solid-phase synthesis of a raw material of hexaethylene glycol phosphoramidite monomer, with a structural formula of:
Bicyclic nucleotide modified base: a nucleotide-like derivative, the oxygen atom at position 2′ and the carbon atom at position 4′ in the ribonucleotide are covalently linked through a chemical bond.
Locked nucleic acid modified base: one of bicyclic nucleotide modified bases, it is a base derivative with a rigid structure formed by dehydration at positions 2′-O and 4′-C of β-D-ribofuranose. For example, the locked nucleic acid modified base includes locked nucleic acid modified adenine (LNA-A, LA for short), locked nucleic acid modified cytosine (LNA-C, LC for short), locked nucleic acid modified guanine (LNA-G, LG for short), and locked nucleic acid modified thymine (LNA-T, LT for short).
Phosphorothioate modification: the phosphorothioate modification is carried out by substituting one oxygen atom in the phosphodiester bond between single nucleotides with sulfur. Binding modification is carried out after the base is added. The common double bond “O” in the phosphate between two bases can be substituted with the double bond “S”.
2′-methoxy modification: a nucleotide derivative in which the hydrogen atom connected to the carbon atom at position 2′ of pentose in the deoxynucleotide is substituted with the methoxy.
3′-Spacer C3 structure: 3′-Spacer C3 is used to introduce a 3′ spacer arm to prevent 3′-end exonuclease and 3′-end polymerase from functioning.
3′-ddC structure: 3′ddC Dideoxy-C is linked at the end to prevent the DNA polymerase from extending at the end.
The “sample” in the present application is used according to its broadest meaning. In one meaning, it includes a samples or cultures obtained from any source, preferably a biological source (including eukaryotes or prokaryotes). Biological samples may be obtained from animals (including humans) and include fluids, solids, and tissues. Biological samples include blood products, such as plasma and serum, from non-human primates, sheep, cattle, ruminants, rabbits, pigs, goats, horses, dogs, cats, poultry, etc. In addition, the samples used herein include biological samples from plants, for example, samples from any organism in the plantae (such as monocotyledon or dicotyledon). The samples may also be from fungi, algae, bacteria, etc. It is expected that the present application does not limit the source of the samples. The samples used herein are usually “samples of nucleic acid”, “nucleic acid samples”, “target nucleic acid samples”, or “target samples” containing nucleic acid from any source (such as DNA, RNA, cDNA, mRNA, tRNA, miRNA, and rRNA).
Capture efficiency: the number of bases in the target region included in the sequencing reads/the number of all bases in the sequencing reads.
“Library” or “target sequence” refers to specific target nucleic acid (deoxyribonucleic acid or ribonucleic acid) sequences used in research, isolation, amplification, or other processes, including single-stranded sequences, double-stranded sequences, or their complementary sequences, and also including one type of in vitro synthesis products converted from another type of nucleic acid molecules (such as DNA, RNA, and cDNA) and synthetic molecules containing nucleotide analogs.
The present application provides oligonucleotide containing a blocker. The use of the oligonucleotide or a combination thereof can not only have a good blocking effect on the adapter sequence when the target sequence in a single sample is captured, reducing non-specific capture and improving capture efficiency, but also particularly effectively block the adapter sequences at both ends of the target sequences in a plurality of samples, improving the capture efficiency of the target sequence. In addition, the universal blocking oligonucleotide provided in the present application includes at least two short nucleic acid sequences, which can significantly reduce the production cost and cycle compared with similar products on the market.
The Sequence Listing is submitted as an ASCII text file Sequence_Listing.txt, generated on Jun. 3, 2022, 22,663 bytes, which is herein incorporated by reference in its entirety.
SEQ ID NOS: 1-4 are exemplary basic oligonucleotide sequences.
SEQ ID NOS: 5-16 are an exemplary oligonucleotide sequences containing a blocker.
Preparation of reagents, raw materials, and apparatus:
A deprotection reagent was prepared by dissolving dichloroacetic acid in a dichloromethane solution (3%, v/v).
An activator was prepared by dissolving 5-ethylthio-1H-tetrazole in an acetonitrile solution (0.25 M).
A capping reagent A was prepared by dissolving N-methylimidazole in an acetonitrile solution (20%, v/v).
A capping reagent B was prepared by dissolving acetic anhydride in an acetonitrile solution (30%, v/v).
An oxidizing agent (0.06 M) was prepared by dissolving iodine in a pyridine solution/(a mixed solution of pyridine and acetonitrile (3/2, v/v) of 3-((dimethylamino-methylene)amino)-3H-1,2,4-dithiazole-3-thione (0.06 M)).
Phosphoramidite monomer (A/T/G/C/mA/mG/mC/mU) solutions (0.067 M) were separately prepared by separately dissolving phosphoramidite monomers (A/T/G/C/mA/mG/mC/mU) in acetonitrile solutions.
Locked nucleic acid phosphoramidite monomer (LNA-A/LNA-G/LNA-C/LNA-T) solutions were separately prepared by dissolving LNA-C (0.067 M) in a mixed solution of acetonitrile and dichloromethane (1/1, v/v) and separately dissolving LNA-A/LNA-G/LNA-T (0.067 M) in acetonitrile solutions.
Raw materials (0.067 M) for synthesizing a blocker were prepared by separately dissolving an ethylene glycol phosphoramidite monomer, a propylene glycol phosphoramidite monomer, and a hexaethylene glycol phosphoramidite monomer in acetonitrile solutions.
Solid-phase synthesis supports: 3′-Spacer C3-CPG support: 200 nmol/piece; and 3′-ddC-CPG support: 200 nmol/piece.
Apparatus: Dr. Oligo 96/192 synthesizer.
The LC modification positions (starting from the 5′-end) are 8, 11, 14, 17, 24, 32, 34, 36, 40, 42, 45, 47, 50, 52, 54, and 58.
Nis a hexaethylene glycol phosphodiester bond structure.
The LC modification positions (starting from the 5′-end) are 5, 13, 15, 17, 20, 25, 27, 28, 32, 38, 40, 46, 47, 50, 53, and 56.
N is a hexaethylene glycol phosphodiester bond structure.
Solid-phase synthesis steps:
The LC modification positions (starting from the 5-end) are 8, 11, 14, 17, 24, 39, 41, 43, 47, 49, 52, 54, 57, 59, 61, and 65.
N is an ethylene glycol phosphodiester bond structure.
The LC modification positions (starting from the 5-end) are 5, 13, 15, 17, 20, 25, 27, 28, 32, 45, 47, 53, 54, 57, 60, and 63.
Nis an ethylene glycol phosphodiester bond structure.
Referring to the method in Example 1, a blocking oligonucleotide combination 2 was obtained by solid-phase synthesis and purification. The two blocking oligonucleotides therein were mixed in equal volumes and centrifuged to mix well to form a universal blocking oligonucleotide mixture 2. The purities of the two synthetic oligonucleotides are shown in
The LC modification positions (starting from the 5-end) are 8, 11, 14, 17, 24, 38, 40, 42, 46, 48, 51, 53, 56, 58, 60, and 64.
N is a propylene glycol phosphodiester bond structure.
The LC modification positions (starting from the 5-end) are 5, 13, 15, 17, 20, 25, 27, 28, 32, 44, 46, 52, 53, 56, 59, and 62.
N is a propylene glycol phosphodiester bond structure.
Referring to the method in Example 1, a blocking oligonucleotide combination 3 was obtained by solid-phase synthesis and purification. The two blocking oligonucleotides therein were mixed in equal volumes and centrifuged to mix well to form a universal blocking oligonucleotide mixture 3. The purities of the two synthetic oligonucleotides are shown in
The LC modification positions (starting from the 5-end) are 5, 7, 13, 15, 18, 21, 25, 27, 29, 34, 45, 47, 54, 57, 60, and 63.
N is an ethylene glycol phosphodiester bond structure.
The LC modification positions (starting from the 5-end) are 5, 7, 13, 15, 18, 22, 24, 28, 31, 50, 52, 59, 61, and 67. The LT modification positions (starting from the 5-end) are 45 and 55.
N is an ethylene glycol phosphodiester bond structure.
Referring to the method in Example 1, a blocking oligonucleotide combination 4 was obtained by solid-phase synthesis and purification. The two blocking oligonucleotides therein were mixed in equal volumes and centrifuged to mix well to form a universal blocking oligonucleotide mixture 4. The purities of the two synthetic oligonucleotides are shown in
The LC modification positions (starting from the 5-end) are 5, 7, 13, 15, 18, 21, 25, 27, 29, 34, 45, 47, 54, 57, 60, and 63.
N is an ethylene glycol phosphodiester bond structure, and * is phosphorothioate modification.
The LC modification positions (starting from the 5-end) are 5, 7, 13, 15, 18, 22, 24, 28, 31, 50, 52, 59, 61, and 67. The LT modification positions (starting from the 5-end) are 45 and 55.
N is an ethylene glycol phosphodiester bond structure, and * is phosphorothioate modification.
Referring to the method in Example 1, a blocking oligonucleotide combination 5 was obtained by solid-phase synthesis and purification. The two blocking oligonucleotides therein were mixed in equal volumes and centrifuged to mix well to form a universal blocking oligonucleotide mixture 5. The purities of the two synthetic oligonucleotides are shown in
The LC modification positions (starting from the 5-end) are 5, 7, 13, 15, 18, 21, 25, 27, 29, 34, 44, 46, 53, 56, 59, and 62.
N is a propylene glycol phosphodiester bond structure.
The LC modification positions (starting from the 5-end) are 5, 7, 13, 15, 18, 22, 24, 28, 31, 49, 51, 58, 60, and 66. The LT modification positions (starting from the 5-end) are 44 and 54.
N is a propylene glycol phosphodiester bond structure.
Referring to the method in Example 1, a blocking oligonucleotide combination 6 was obtained by solid-phase synthesis and purification. The two blocking oligonucleotides therein were mixed in equal volumes and centrifuged to mix well to form a universal blocking oligonucleotide mixture 6. The purities of the two synthetic oligonucleotides are shown in
1. Hybridization reaction
1.1. Preparation of the following blocking mixture:
The corresponding “universal blocking oligonucleotide mixture” was not added to the blocking mixture in a negative control group.
1.2. 7 μL of the blocking mixture was transferred into a PCR tube to capture a library, where each 7 μL of the blocking mixture corresponds to a 500 ng library.
1.3. The mixture in the PCR tube was concentrated in a vacuum to no liquid.
1.4. The hybridization elution kit produced by IDT (Integrated DNA Technologies, Inc.) was taken out in advance and equilibrated at room temperature.
1.5. Preparation of hybridization mixture:
1.6. After shaking, mixing, and centrifugation, 17 μL of the hybridization mixture was taken into the concentrated PCR tube.
1.7. After a short centrifugation and mixing, incubation was carried out at room temperature for at least 5 min.
1.8. The incubated PCR tube was placed on a PCR machine to run with the following hybridization procedures:
2. Capture
2.1. 2×magnetic bead wash buffer, 10×wash buffer 1, 10×wash buffer 2, 10×wash buffer 3, 10×wash enhancement buffer, 2×hybridization buffer, and hybridization enhancer buffer were mixed well at room temperature and diluted to 1×working liquid for use (amount of each sample):
2.2. Preparation of magnetic bead resuspension mixture:
2.3. Magnetic beads were taken out, and shaking and mixing were carried out for 15s.
2.4. 50 μL of magnetic beads were taken into a 1.5 mL centrifuge tube, where each 50 μL of magnetic beads corresponds to a 500 ng library.
2.5. 100 μL of 1×magnetic bead wash buffer was added into the centrifuge tube to draw up and dispense 10 times for mixing.
2.6. The centrifuge tube was placed on a magnetic stand to stand for 1 min until the solution was clear, and then the supernatant was removed.
2.7. 100 μL of magnetic bead wash buffer was added into the centrifuge tube to draw up and dispense 10 times for mixing, the centrifuge tube was then placed on the magnetic stand to stand for 1 min until the solution was clear, and then the supernatant was removed.
2.8. The foregoing step was repeated for a total of 2 times of washing.
2.9. 17 μL of magnetic bead resuspension mixture was added into the centrifuge tube to draw up and dispense for mixing and centrifuged at 400 rpm for 10s, to ensure that there are no dried magnetic beads left on the tube wall.
2.10. After the hybridization reaction, 17 μL of the magnetic beads were transferred into the PCR tube to draw up and dispense for mixing, and in addition, the procedure of the PCR machine was set to 65° C. for constant (the thermal cover 70° C.).
2.11. The mixture was placed in the PCR machine to undergo the washing procedure at 65° C. for 45 min (need timing) and shaken quickly and slightly for mixing well every 10 min, and the wash buffer 1 and wash enhancement buffer were put into a water bath at 65° C. for at least 15 min.
2.12. After the 45 min timing was over, the sample was taken out from the PCR machine.
2.13. 100 μL of pre-heated wash buffer 1 was taken into the PCR tube to draw up and dispense 10 times for mixing to avoid bubbles. After the wash buffer 1 was used, the PCR tube was equilibrated at room temperature.
2.14. The sample was placed on the magnetic stand to stand for 1 min, and then the supernatant was removed.
2.15. 150 μL of pre-heated wash enhancement buffer was taken into the PCR tube to draw up and dispense 10 times for mixing to avoid bubbles.
2.16. The PCR tube was placed on the PCR machine to react for 5 min.
2.17. The sample was placed on the magnetic stand to stand for 1 min, and then the supernatant was removed.
2.18. 150 μL of pre-heated wash enhancement buffer was added to draw up and dispense
10 times for mixing and then react on the PCR machine for 5 min.
2.19. The sample was placed on the magnetic stand to stand for 1 min, and then the supernatant was removed.
3. Washing
3.1. 150 μL of wash buffer 1 at room temperature was added into a reaction tube to shake and mix thoroughly.
3.2. The incubation was carried out at room temperature for 2 min with shaking and mixing every 30s.
3.3. The sample after short centrifugation was placed on a magnetic stand to stand for 1 min until the solution was clear, and then the supernatant was removed.
3.4. 150 μL of wash buffer 2 at room temperature was added to shake and mix thoroughly.
3.5. The incubation was carried out at room temperature for 2 min with shaking and mixing every 30s.
3.6. The sample after short centrifugation was placed on a magnetic stand to stand for 1 min until the solution was clear, and then the supernatant was removed.
3.7. 150 μL of wash buffer 3 at room temperature was added to shake and mix thoroughly.
3.8. The incubation was carried out at room temperature for 2 min with shaking and mixing every 30s.
3.9. The sample after short centrifugation was placed on a magnetic stand to stand for 1 min until the solution was clear, and then the supernatant was removed.
3.10. The magnetic beads were dried (until there was no obvious residue of the wash buffer 3).
4. PCR after capture
4.1. Preparation of the following PCR mixture:
4.2. The PCR mixture was added into a PCR tube with dried magnetic beads to draw up and dispense for mixing.
4.3. Run with the following PCR procedures:
5. Magnetic bead purification
5.1. 1×(50 μL) Yeasen Beads were directly added into 50 μL of PCR reaction tube for magnetic bead purification, and finally the product was eluted with 21 μL of nuclease-free water.
One human genome NA12878 was used as a sample to prepare a library using the ABclonal Rapid DNA Lib Prep Kit, and then 500 ng of the library was used as a starting amount, the universal blocking oligonucleotide mixture obtained in Examples 1 to 6 was applied to obtain a hybridized capture library, respectively, with reference to the foregoing procedures of the hybridization reaction example. No blocking oligonucleotide was added in the negative control group during the hybridization capture. On-machine sequencing was performed by using the Nextseq next-generation sequencer, and the percentage of bases in the target region to total bases in the sequencing result was analyzed, to obtain the hybridization capture efficiency with the sample size of 1 as shown in Table 1. The result shows that, compared with the negative control group, the addition of the universal blocking oligonucleotide of the present application significantly improves the capture efficiency of the target sequence with the improvement rate of 144.8%-180.5%.
8 human genomes NA12878 were used as samples. The ABclonal Rapid DNA Lib Prep Kit was used to separately prepare libraries. The 8 libraries were mixed at 62.5 ng each to ensure that the total amount of the library was 500 ng. The universal blocking oligonucleotide mixture obtained in Examples 2 and 3 was used to obtain a hybridized capture library with reference to the foregoing procedures of the hybridization reaction example. No blocking oligonucleotide was added in the negative control group during the hybridization capture. On-machine sequencing was performed by using the Nextseq next-generation sequencer, and the percentage of bases in the target region to total bases in the sequencing result was analyzed, to obtain the hybridization capture efficiency with the sample size of 8 as shown in Table 2. The result shows that, compared with the negative control group, the addition of the universal blocking oligonucleotide of the present application significantly improves the capture efficiency of the target sequence with the improvement rate of 160.5%-160.9%. In addition, comparing Table 1 and Table 2, it is found that the capture efficiency of the universal primer of the present application in multi-sample target sequence capture is not significantly different from the capture efficiency of the universal primer of the present application in single-sample target sequence capture. It can be learned from this that the universal blocking oligonucleotide provided by the present application has a significant specific capture effect in multi-sample target sequence collective capture, which can greatly reduce the operation time and cost of the target sequence library construction.
Number | Date | Country | Kind |
---|---|---|---|
201910560841.0 | Jun 2019 | CN | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/CN2020/098473 | 6/28/2020 | WO |