The contents of the electronic sequence listing (Untitled ST25.txt; Size: 18,000 bytes; and Date of Creation: Aug. 18, 2020) is herein incorporated by reference in its entirety.
This application claims the benefit of priority from Chinese Patent Application No. 201811406204.X, filed on Nov. 23, 2018. The content of the aforementioned application, including any intervening amendments thereto, is incorporated herein by reference.
This application relates to molecular labeling, particularly to the labeling of DNA library, and more particularly to an inline index based on Illumina sequencing, a DNA library labeled thereby and a method for constructing the same.
Currently, the construction method of a universal DNA library involving Illumina sequencing generally includes the steps of: breaking the DNA sequence of a biological sample; repairing the flat end; ligating adapters of IS7 and IS8 primer recognition sites; filling the gap between respective adapters and the sequence; introducing a barcode to the sequence during the indexing PCR to differentiate the samples; and sorting the sequencing data according to the barcode. However, in the above method, the barcode is introduced at the end of the entire experiment, so that if the samples are mutually contaminated before, it is impossible to determine which sample the data is derived from.
In the prior art, Chinese Patent Application No. 107502607A discloses a method involving the molecular barcode labeling, library construction and sequencing for mRNA from a large number of tissues and cells, which can be applied to the labeling during the reverse transcription of mRNA and the synthesis of cDNA. However, this method is not suitable for the construction of DNA library. Chinese Patent No. 104573407B discloses a method of searching a species-specific endogenous barcode and an application thereof in the pooled sequencing for multiple samples. In this method, the PCR products are labeled using gene splicing by overlap extension PCR (SOE PCR) according to the differences in the endogenous DNA from individual samples, which does not involve the DNA library construction and the barcode labeling. At present, the cross contamination is frequently observed among DNA samples, greatly affecting the subsequent assembly and analysis of data.
An object of this application is to provide an inline index based on Illumina sequencing, and a DNA library labeled thereby and a method for constructing the same to overcome the defects in the prior art. In the library construction method provided herein, an inline index adapter is introduced in the initial stage, and the diverse DNA samples can be accurately distinguished by labeling the constructed DNA library with the inline index. Compared to the conventional molecular index technique, this application can not only correct the contaminated data, but also greatly increase the number of index combinations, facilitating the differentiation of more samples and lowering the cost.
The technical solutions of this application are described as follows. In a first aspect, this application provides an inline index based on Illumina sequencing, wherein the inline index is a 6-bp DNA sequence, and the inline index has the following characteristics:
i) it has a minimum editing length of 3 bp or less;
ii) it comprises fragments of different bases;
iii) it comprises bases of different laser colors, wherein adjacent bases are different in laser color; and
iv) it is free of ‘AAA’, ‘ACA’, ‘CCC’, ‘CAC’, ‘GGG’, ‘GTG’, ‘TTT’ and ‘TGT’.
In an embodiment, the inline index comprises an IS1 inline index, an IS2 inline index and an IS3 inline index corresponding to the IS1 inline index and IS2 inline index, where the IS3 inline index is partially complementary to the IS1 inline index and the IS2 inline index.
In an embodiment, the IS1 inline index comprises IS1 sequence and IS3X′ sequence partially complementary to the IS1 sequence; and the IS1 sequence and the IS3X′ sequence are ligated by cooling from 95° C. to 12° C. at a rate of 0.1° C/s.
In an embodiment, the IS2 inline index comprises IS2 sequence and IS3Y′ sequence partially complementary to the IS2 sequence; and the IS2 sequence and the IS3Y′ sequence are ligated by cooling from 95° C. to 12° C. at a rate of 0.1° C/s.
In a second aspect, this application further provides a method of constructing a DNA library labeled by the above inline index, which specifically comprises:
(1) breaking a DNA sequence of a biological sample to obtain a DNA fragment;
(2) repairing a blunt endof the DNA fragment;
(3) ligating an adapter of the IS1 inline index to a 5′ end of the DNA fragment; and ligating an adapter of the IS2 inline index to a 3′ end of the DNA fragment; wherein the adapter of the IS1 inline index is inside a binding site of a primer IS7, and the adapter of the IS2 inline index is inside a binding site of a primer IS8;
(4) repairing a gap of the DNA fragment and extending the repaired DNA fragment; and
(5) subjecting the extended DNA fragment to PCR amplification in the use of the primers IS7 and IS8 to produce the DNA library.
This application has the following beneficial effects.
In the conventional methods for constructing a DNA library, the cleaning process is required to be repeated, which will easily cause cross contamination among the samples or contamination derived from exogenous DNA. In addition, due to the high sensitivity of the biological probe, a low concentration of exogenous DNA may also be captured and sequenced. This application introduces an inline index to label the DNA in advance, which greatly reduce the occurrence that the contamination data is recognized, significantly improving the problem that cross contamination frequently occurs among the samples during the construction of DNA library and the enrichment of genes. Moreover, after labeled with the inline index provided herein, the DNA samples may be mixed for the gene enrichment and sequencing, greatly reducing the complexity in the operation of gene enrichment.
This application will be described in detail below with reference to the drawings and embodiments, but these embodiments are not intended to limit the application.
Inline indexes were respectively synthesized according to the DNA sequences listed in Table 1, and the process was specifically described as follows.
An Oligo Hybridization Buffer consisting of 1 mL of 5 M NaCl, 100 μL of 1 M Tris-HCl (pH 8.0), 20 μL of 0.5 M EDTA (pH 8.0) and 8.88 mL of H2O was prepared in advance.
10 μL of 500 μM IS1_adapter_P5.F, 10 μL of 500 μM IS3 adapter P5+P7.R, 10 μL of 10x Oligo Hybridization Buffer and 70 μL of H20 were mixed for the preparation of a double-strand adapter.
10 μL of 500 μM IS2 adapter P7.F, 10 μL of 500 μM IS3 adapter P5+P7.R, 10 μL of 10× Oligo Hybridization Buffer and 70 μL of H20 were mixed for the preparation of a double-strand adapter.
The above two reaction mixtures were respectively reacted at 95° C. in a PCR instrument for 10 s and then cooled to 12° C. at a rate of 0.1° C/s. Then the two reaction mixtures were respectively dispensed at 20 μL, per tube, and the tubes were numbered and stored at −20° C. for use. The resulting double-strand adapters were shown in
The DNA library was constructed as follows.
A DNA sequence of a biological sample was fractured into multiple DNA fragments with a length of about 250-500 bp using a Covaris M220 Ultrasonic Processor (Covaris, Inc. Massachusetts USA), where the fracturing process was programmed as follows: running for 90 s (50 peak power, 25 duty factor, 200 cycles/burst) followed by a pause for 60 s; and running for 90 s (50 peak power, 25 duty factor, 200 cycles/burst). 35 μL of MagNA beads were added to a 270 μL centrifuge tube, and then the centrifuge tube was allowed to stand on a magnetic plate for 1 min. The supernatant was removed, and the MagNA beads were dried, added with 60 μL of the DNA sample and 54 μL of MagNA beads Buffer and mixed uniformly. Positive and negative controls were prepared at the same time. The centrifuge tube was placed at room temperature for 10 min and then transferred to the magnetic plate. The supernatant was removed, and the beads were added with 186 μL 70% ethanol and placed for 1 min. Then the ethanol was removed, and the process of adding and removing ethanol was repeated once. The centrifuge tube was placed for 5 min with the cover opened to allow the residual ethanol to volatize.
20 μL of a first mixture, consisting of 2.2 μL of 10× Buffer Tango, 0.22 μL of 1 × dNTPs (10 mM each), 0.22 μL of 100 mM ATP, 1.1 μL of T4 polynucleotide kinase (10 U/μL), 0.44 μL of T4 DNA polymerase (5 U/μL) and 17.82 μL of H2O, was added to the centrifuge containing the dried MagNA beads under an ice bath. The reaction mixture was mixed uniformly, and then the centrifuge tube was transferred to a PCR instrument to repair a sticky end of the DNA sample, where the reparation was programmed as follows: 25° C. for 15 min and 12° C. for 5 min. Then the centrifuge tube was taken out and added with 18 μL of MagNA beads Buffer. The reaction mixture was mixed uniformly and allowed to stand on the magnetic plate for 5 min. The supernatant was removed, and the beads were added with 186 μL of 70% ethanol and allowed to stand at room temperature for 1 min. Then the ethanol was removed, and the process of adding and removing ethanol was repeated once. The centrifuge tube was allowed to stand for 5 min with the cover opened to allow the residual ethanol to volatize. 38 μL of a second mixture, consisting of 4.4 μL of 10×T4DNA ligase buffer, 4.4 μL of 50% PEG-4000, 1.1 μL of T4 DNA ligase (5U/μL) and 31.9 μL of H2O, was prepared and added to the centrifuge tube containing the dried MagNA beads under an ice bath. Inline indexes of the sample, positive control and negative control were respectively added. Then the centrifuge tube was placed in the PCR instrument and the operation was performed at 22° C. for 30 min. After that, the centrifuge tube was taken out and added with 36 μL of MagNA beads Buffer. The reaction mixture was fully mixed and allowed to stand on the magnetic board for 5 min. The supernatant was removed, and the beads were added with 186 μL of 70% ethanol and allowed to stand at room temperature for 1 min. Then the ethanol was removed, and the process of adding and removing ethanol was repeated once. The centrifuge tube was allowed to stand for 5 min with the cover opened to allow the residual ethanol to volatize.
Bsm polymerase was introduced to extend the sequence to fill the gap between the adapter and the sequence after the addition of the inline index. 40 μL of the Bsm polymerase mixture, consisting of 4.4 μL of 10× Bsm buffer, 1.1 μL of dNTPs (10 mM each), Bsm polymerase, 1.65 μL of large fragment (8U/μL) and 36.85 μL of H2O, was prepared and added to the centrifuge tube containing the dried MAGNA beads under an ice bath. Then the centrifuge tube was transferred to the PCR instrument and the operation was performed at 37° C. for 20 min. After that, the centrifuge tube was taken out and added with 36 μL of MagNA beads Buffer. The reaction mixture was fully mixed and allowed to stand on the magnetic board for 5 min. The supernatant was removed, and the beads were added with 186 μL of 70% ethanol and allowed to stand at room temperature for 1 min. Then the ethanol was removed, and the process of adding and removing ethanol was repeated once. The centrifuge tube was allowed to stand for 5 min with the cover opened to allow the residual ethanol to volatize. The beads were added with 35 μL of TE Buffer, and the mixture was transferred to another centrifuge tube (named as lib) and stored at −20° C. for use.
Referring to
Number | Date | Country | Kind |
---|---|---|---|
201811406204.X | Nov 2018 | CN | national |