The present invention belongs to the field of molecular biology, specifically relates to a method and a kit for non-specific amplification of natural short-fragment nucleic acid.
Natural short-fragment nucleic acids are extracellular free DNAs (cell-free DNAs, cDNAs) existing in animal, plant, and human body fluid, typically is less than 500 bp in length. During apoptosis, DNAs in human cells are cleaved and secreted extracellularly to form cfDNAs. Currently, the use of cfDNAs as a marker has been demonstrated in studies of tumor and prenatal diagnostic. However, in one aspect, the amount of cfDNAs in body fluid is very low, for example, in plasma, even lower than 10 ng/mL, and loss of cfDNAs tends to occur during extraction, resulting in a difficult process of extraction. cfDNAs, in another aspect, should be used in a huge amount in fluid biopsy and non-invasive prenatal screening. These make it difficult for the directly extracted natural short-fragment nucleic acids to fulfill the requirements of the test, standard, quality control and reference substances in terms of amount and quality.
Currently, in some methods, long-fragment DNAs are ultrasonicated to obtain DNA fragments similar to natural short-fragment nucleic acids in size. However, the DNA fragment thus obtained is inherently different from the natural short-fragment nucleic acid, e.g. in size distribution. In addition, in some methods the short-fragment DNA is obtained through enzymatic digestion. However, the length distribution of enzymatic digestion products is also significantly different from that of the natural short-fragment nucleic acid, and limited by enzymatic digestion site. Generally, it is difficult for enzymatic digestion product to have the size of the natural short-fragment nucleic acid (for example, the average length of plasma cfDNA is only about 170 bp), and the ends thereof are less variable than the natural short-fragment nucleic acid. Therefore, neither ultrasonicated nor enzymatically digested DNA fragment is able to replace the natural short-fragment nucleic acid as the test, standard, quality control and reference substances.
Therefore, a method capable of obtaining large amounts of natural short-fragment nucleic acids is in need to address the issue of research and testing for such DNAs.
In response to the above-mentioned demand, the present invention provides a method for non-specific amplification of natural short-fragment nucleic acids. The method efficiently amplifies the natural short-fragment nucleic acid as starting material using a specifically designed linker and primer, to obtain large quantities of DNA fragments substantially identical to the natural short-fragment nucleic acid in both DNA sequence and fragment length distribution. The present invention also provides a kit for non-specific amplification of the natural short-fragment nucleic acids.
The basic principle of the present invention is: a double-stranded linker containing only three types of bases in each strand is attached to both ends of the natural short-fragment nucleic acids, then the linker-added DNA fragments are amplified with a deoxyuridine-labeled primer by using PCR technique, and then the PCR amplification product is enzymatically digested using an enzyme having a deoxyuridine cleavage function and an enzyme with both 5′→3′ polymerase activity and 3′→5′ exonuclease activity successively to completely remove the introduced linker sequence, thereby obtaining large quantities of DNA fragments substantially identical to the natural short-fragment nucleic acids in terms of DNA sequence and fragment length distribution.
Accordingly, in a first aspect, the present invention provides a method for non-specific amplification of a natural short-fragment nucleic acid comprising the following steps:
(1) performing end repair on the natural short-fragment nucleic acid to obtain an end-repaired nucleic acid;
(2) connecting the end-repaired nucleic acid to a double-stranded linker to obtain a ligation product, in which each strand of the double-stranded linker contains only three types of bases;
(3) performing PCR amplification on the ligation product using a PCR primer labeled with deoxyuridine to obtain a PCR product, in which the PCR primer is completely or partially complementary to a strand of the double-stranded linker and contains only three types of bases; and
(4) digesting the PCR product by using an enzyme having a deoxyuridine cleavage function, followed by digesting the PCR product by using an enzyme with both 5′→3′ polymerase activity and 3′→5′ exonuclease activity in the presence of a deoxynucleotide solution to obtain a non-specific amplification product of the natural short-fragment nucleic acid; the deoxynucleotide solution only contains the complementary base of base lacking in the PCR primer.
In one embodiment, the natural short-fragment nucleic acid is a double-stranded DNA of less than 500 bp.
In one embodiment, the origins of the natural short-fragment nucleic acid include, but not limited to, blood, serum, plasma, joint fluid, semen, urine, sweat, saliva, stool, cerebrospinal fluid, ascites, pleural effusion, bile, pancreatic fluid, and the like. Preferably, the natural short-fragment nucleic acid is derived from plasma, blood, or urine.
In one embodiment, the end repair employs one or more enzymes selected from the group consisting of: T4 DNA polymerase, klenow enzyme, T4 polynucleotide kinase and Klenow Fragment enzyme. Other enzymes known in the art to be useful for end-repair are also suitable for use in the invention.
In one embodiment, said method further comprises the step of addition of poly-adenine at 3′ end of the end-repaired nucleic acid prior to being connected to the linker. In this embodiment, the steps of end repair and addition of poly-adenine at 3′ end could be performed in one reaction system, i.e., after the end repair, the poly-adenine should be added at 3′ end after purification. Alternatively, and preferably, the steps of end repair and addition of poly-adenine at 3′ end are performed in one reaction system, i.e., end-repair and addition of poly-adenine at 3′ end are completed simultaneously followed by the purification of the nucleic acid. Alternatively, and more preferably, end repair, addition of poly-adenine at 3′ end, and linker ligation are performed in one reaction system, and the ligation reaction is followed by purification of the nucleic acid for PCR amplification. In this embodiment, the step of addition of poly-adenine at 3′ end may employ klenow ex-enzyme, Taq enzyme, or a combination of klenow ex-enzyme and Taq enzyme. Other enzymes known in the art to be useful for adding poly-adenine at 3′ end are also suitable for use in the present invention.
In one embodiment, each strand forming the double-stranded linker comprises only any three types of bases, e.g., only A/C/T, A/C/G, C/T/G, or A/T/G. Each strand may contain combinations of bases different from each other, for example one strand may contain A/C/T and the other one A/C/G. Preferably, two strands of the double-stranded linker are completely or partially reverse complementary. In one embodiment, 5′ end of one of the strands forming the double-stranded linker carries phosphate groups. More preferably, in both strands forming the double-stranded linker, 3′ end of one strand is deoxyuridine U and 5′ end of the other strand carries phosphate groups. In that case, the presence of deoxyuridine in the double-stranded linker facilitates cleavage of the linker linked to the natural short-fragment nucleic acid as template, thereby increasing the yield of non-specific amplification; in other words, the final product obtained in that case still contains a copy of the natural short-fragment nucleic acid as template.
For instance, the linker is formed by annealing of the single-stranded DNAs shown in the following SEQ ID NO: 1 and SEQ ID NO: 2:
In one embodiment, the linker ligation is performed by T4 DNA ligase and/or T7 DNA ligase.
In one embodiment, a PCR amplification primer that is completely or partially complementary to one strand of the double-stranded linker has a deoxyuridine label and comprises only three types of bases. The deoxyuridine label contained in the primer allows a single base notch formed between the primer and the target nucleic acid upon subsequent enzymatic digestion with an enzyme having a deoxyuridine cleavage function, thereby completely removing the primer sequence. For example, the primers have the following sequences:
In one embodiment, the PCR product is digested with an enzyme having a deoxyuridine cleavage function followed by digestion with an enzyme having both 5′→3′ polymerase activity and 3′→5′ exonuclease activity in the presence of deoxynucleotide solution to remove primer sequence and linker in PCR product; wherein the enzymes having deoxyuridine cleavage function include, but not limited to: USER™ enzymes, and a mixture comprising Endonuclease VIII and UDG; and the enzymes having both 5′→3′ polymerase activity and 3′→5′ exonuclease activity include, but not limited to, T4 DNA polymerase.
In one embodiment, the deoxynucleotide solution contains merely complementary base of the base lacking in the PCR primer. For example, when the double PCR primers contain only A/C/T (i.e., it lacks G), the added deoxynucleotide solution only contains base C complementary to G.
For instance, T4 DNA polymerase possesses both 5′→3′ polymerase activity and 3′→5′ exonuclease activity, and substantially exerts 5′→3′ polymerase activity in the presence of adequate dNTPs contained in reaction substrate, while mainly 3′→5′ exonuclease activity in case of insufficient dNTPs in reaction substrate. Since the PCR primer contains only three types of bases, no base complementary to the primer is present in the system at the beginning of the reaction, allowing T4 DNA polymerase to perform 3′→5′ exonuclease function. When the linker/primer is completely cleaved on both ends and lacking base of the above primer appears in sequence, the base contained in the deoxynucleotide solution added in the system is capable of being complementary to said lacking base, resulting in T4 DNA polymerase exerts 5′→3′ polymerase activity, thereby achieving an equilibrium of the reaction, stopping an enzymatic digestion reaction, and obtaining a product with substantially identical to the original cfDNA fragments in size.
In the second aspect, the invention provides a kit for non-specific amplification of a natural short-fragment nucleic acid comprising:
(1) a reagent for performing end repair;
(2) a reagent for connecting an linker comprising a double-stranded linker, wherein each strand of the double-stranded linker comprises only three types of bases;
(3) a reagent for PCR amplification comprising a PCR primer with a deoxyuridine label, wherein the PCR primer is completely or partially complementary to one strand of the double-stranded linker, and comprises only three types of bases;
(4) a reagent for enzymatic digestion, comprising an enzyme with deoxyuridine cleavage function, an enzyme with both 5′→3′ polymerase activity and 3′→5′ exonuclease activity, and a deoxynucleotide solution containing only the complementary base of the base lacking in the PCR primer.
In one embodiment, the agent for performing end repair comprises one or more enzymes selected from the group consisting of: T4 DNA polymerase, T4 polynucleotide kinase, Klenow Fragment enzyme and Klenow enzyme.
In one embodiment, the kit further comprises a reagent for adding poly-adensine to 3′ end. Specifically, the reagent for adding poly-adensine to 3′ end comprises klenow ex-enzyme, Taq enzyme, or a combination of klenow ex-enzyme and Taq enzyme.
In one embodiment, the reagent for connecting a linker further comprises T4 DNA ligase and/or T7 DNA ligase.
In one embodiment, each strand of the double-stranded linker comprises only any three types of bases, e.g., only comprises A/C/T, A/C/G, C/T/G, or A/T/G. Each strand may contain combinations of bases different from each other, for example one strand may contain A/C/T and the other one A/C/G. In one embodiment, preferably, two strands of the double-stranded linker are completely or partially reverse complementary. In one embodiment, 5′ end of one of the strands forming the double-stranded linker carries phosphate groups. More preferably, in the two single-stranded DNA forming linker, the 3′ end of one strand is deoxyuridine U and the 5′ end of the other strand carries phosphate groups.
In one embodiment, the kit of the invention may further comprise reagents for purification.
In one embodiment, the enzymes having deoxyuridine cleavage function include, but not limited to: USER™ enzyme, and a mixture comprising Endonuclease VIII and UDG. In another embodiment, the enzymes with both 5′→3′ polymerase activity and 3′→5′ exonuclease activity include, but not limited to, T4 DNA polymerase.
In one embodiment, the reagent for enzymatic digestion comprises deoxynucleotide solution containing merely the complementary base of the base lacking in the PCR primer. For example, when the PCR primer contains only A/C/T (i.e., it lacks G), the added deoxynucleotides solution contains only base C complementary to G.
In the kit of the invention, each reagent or device is preferably packaged separately, or may be packaged in combination without affecting the practice of the invention.
In a third aspect, the invention also relates to a non-specifically amplified natural short-fragment nucleic acid obtained using the method or kit of the invention, their use as the test, standard, control or reference substances, and composition comprising the same.
The excellent technical effects of the present invention are as follows: a non-destructive amplification of natural short-fragment nucleic acid was achieved by the non-specific amplification method, which effectively reduced the preference of enzyme digestion and damage to DNAs. DNA fragments mimicking the natural short-fragment nucleic acid can be prepared in large quantities, which are basically identical to natural short-fragment nucleic acid in terms of sequence and length distribution. In addition, the method of the present invention enables an efficient enrichment of fetal-derived DNAs in cfDNAs with a non-destructive amplification, allowing for increasing content of fetal DNAs in a non-invasive pre-natal testing, thereby improving detection sensitivity and accuracy.
The invention will now be described in more detail by way of examples with reference to the accompanying drawings. It should be understood by those skilled in the art that the drawings and examples thereof are for illustrative purposes only and not to be construed as limiting the invention in any way. The embodiments and features thereof in the present application can be combined with each other without contradiction.
1. Preparation of Plasma Free DNAs
Free DNAs were extracted from peripheral blood of pregnant women by using a plasma free DNA extraction kit (Hangzhou Berrygenomics Diagnostic Technology Co. Ltd. Cat No. R0011) to obtain 4 ng of plasma DNAs.
2. End Repair and Addition of ‘A’s
The NEBNext end repair/dAs tail-adding Module kit (NEB, E7442S) was used to prepare the reaction system shown in Table 1 for simultaneous end repair and treatment of adding poly-adenine at 3′ end. The reaction procedure was as follows: 37° C., 20 min; 72° C., 20 min; stored at 4° C. After completion of the reaction, the next step was conducted without purification.
3. Ligation of Linker
(1) Preparation of Linker
The sequence as shown in Table 2 was synthesized, with SEQ ID NO: 1 having deoxyuridine label at 3′ end and SEQ ID NO: 2 having phosphate group label at 5′ end:
The synthesized sequence dry powder was diluted with sterilized water to a concentration of 100 μM, to 25 μL each was added 20 μL of 5× annealing buffer and 30 μL of sterilized water, mixed evenly, and then annealed.
The annealing procedure was as follows: reaction at 95° C. for 2 min, cooling at 0.1° C./s to 90° C., reaction for 2 min, cooling at 0.1° C./s to 85° C., reaction for 2 min, and so on, until the temperature dropped to 25° C., reaction for 2 min, and stored at 4° C. to obtain 25 μM linker, and diluted to 2.5 μM with linker diluent for further use.
(2) Ligation of the Linker
The linker ligation reaction system shown in Table 3 was prepared using T4 DNA ligase (NEB, M0202L) and the reaction was carried out according to the following procedure: 20° C., 15 min; 65° C., 10 min; stored at 4° C. After completion of the reaction, 40 μL of L AMPure XP Beads was added for recovery and then eluted with 26 μL of EB to give the ligation product.
4. PCR Amplification
(1) Preparation of Primers
The primer sequences shown in Table 4 were synthesized, with both SEQ ID NO: 3 and SEQ ID NO: 4 having deoxyuridine label at 3′ end:
The synthesized primers dry powder was diluted with sterilized water to the concentration of 10 μM for use.
(2) PCR Amplification
The amplification reaction system shown in Table 5 was prepared using KAPA2G Robust HotStart PCR Kit (KAPA, KK5517) and the reaction procedure was as follows: 98° C., 30 s; 98° C. 10 s, 62° C. 30 s, 72° C. 30 s, 20 cycles; 72° C. 5 minutes; stored at 4° C.
After the reaction was complete, 60 μL AMPure XP Beads was added for purification followed by elution with 50 μL of EB to yield PCR amplification product.
5. Enzymatic Digestion Reaction
The reaction system set forth in Table 6 was prepared and the amplification product was digested into notched DNA fragment using USER™ enzyme (NEB, M5508).
The reaction procedure was as follows: 37° C., 30 min; 65° C., 10 min; stored at 4° C. After completion of the reaction, the next step was carried out without purification.
The reaction set forth in Table 7 was then prepared and the notched DNA fragment was treated with T4 DNA polymerase (NEB, M0203L) to completely cleave the linker sequence.
The above system was incubated for 5 minutes at 70° C. and stopped, and T4 DNA polymerase was added and incubated for an additional 5 minutes at 37° C. for digestion. The enzyme was inactivated by vigorous vibration at end of the reaction. Further purification by addition of 90 μL AMPure XP Beads followed by elution with 50 μL EB yielded a non-specific amplification product with a yield of 80 ng, a 20-fold increase compared to the concentration of original plasma DNA.
With fetal chromosomal aneuploidy (T13/T18/T21) detection kit (Hangzhou Berrygenomics Diagnostic Technology Co. Ltd. Cat No. R0000), library construction and high-throughput sequencing were performed using directly extracted original plasma DNAs (i.e. natural cfDNAs), commercially available free DNAs standard (Accuragen, cat No. AG-STD-S-KA-8) and the non-specific amplification product obtained according to method of the present invention, respectively, and the data generated by double-end sequencing was compared and analyzed. The obtained fragment distribution statistics are shown in
As can be seen from
The steps of Example 2 were as those of Example 1, except for the original DNA sample. Specifically, tumor plasma free DNAs were extracted with agMAX Cell-Free DNA Isolation Kit (Thermo Fisher, Cat No. A29319) to obtain free DNAs in a total amount of 6 ng of DNAs. The yield was 132 ng after the non-specific amplification, a 22-fold increase compared to the concentration of original plasma DNA.
4 ng of natural plasma free DNAs and the non-specifically amplified product prepared according to method of Example 1 were taken, respectively, and according to the manufacturer's instructions, a sequencing library was constructed using the fetal chromosomal aneuploidy (T13/T18/T21) detection kit (Beijing Berrygenomics, Cat No. R1000). Sequencing was then performed using Nextseq CN500 sequencer with the sequencing type of 36SE and 5M data obtained for each sample. The quality of the library sequencing data is shown in Table 8, wherein Total Rds is the total number of sequences detected, Uniq % is the percentage of DNA sequences uniquely aligned to human genome (hg19) in Total Rds, Redundancy % is the percentage of redundant reads in Total Rds, and Unimap_GC % is the GC percentage of DNA sequences uniquely aligned to human genome (hg19)
As can be seen from Table 8, Total Rds, Uniq %, Redundancy %, and UniMap_GC % of the non-specific amplification product prepared according to Example 1 of the present invention all have achieved the required amount of data and close to natural plasma free DNAs.
The sequencing data was analyzed for chromosomal aneuploidy using analysis software and the result was shown in Table 9, wherein Chr13 Z value is the Z value of chromosome 13 of the sample, i.e., the deviation between the percentage of bases detected on chromosome 13 in all the detected bases of the sample and the percentage of bases on chromosome 13 of normal samples in all the bases in parameter database. Chr18 Z value and the Chr21 Z value are the Z values of chromosome 18 and chromosome 21, respectively. Samples with Z values between [−3.00, 3.00] are normal ones. The Cff % content is the fetal DNAs content in free DNAs.
As can be seen from Table 9, Chr13 Z value, Chr18 Z value and Chr21 Z value of the non-specific amplification product prepared according to Example 1 of the present invention and the natural plasma free DNA samples are all between [−3.00, 3.00], the detection results are consistent, and the sample is negative. Furthermore, the content of fetal DNAs in the non-specific amplification product obtained by the method of the present invention is up to 8.17%, an increase of 28.66% compared to original sample (6.35%).
Thus, the non-specific amplification product prepared according to Example 1 of the present invention was substantially identical in sequence to natural plasma free DNA samples. While nondestructively amplifying natural plasma free DNAs, the method of the present invention also significantly enriched the DNAs derived from fetus, which is advantageous for improving detection sensitivity and accuracy in prenatal diagnosis.
5 ng of natural tumor plasma free DNAs and 5 ng of non-specifically amplified product prepared according to method of Example 2 were taken, respectively, and according to the manufacturer's instructions, a sequencing library was constructed using cSMART tumor gene mutation detection kit 1 (10 genes) (Beijing Berrygenomics. Cat No. R0024). Sequencing was then performed using Nextseq CN 500 sequencer with the sequencing type of 150PE and 10M data obtained for each sample. The quality of the library sequencing data is shown in Table 10, wherein Clean Reads is the percentage of reads after removing redundant sequences, CleanQ30 is the Q30 of Clean Reads sequence, and MeanDepth is average sequencing depth.
As can be seen from Table 10, Clean Reads, CleanQ30 and MeanDepth of the non-specific amplification product prepared according to Example 2 of the present invention all have achieved the required amount of data and are close to that of natural tumor plasma free DNAs.
SNP and InDel analysis was performed on sequencing data using analysis software, and it is found that the non-specific amplification product prepared by method of the present invention is consistent with natural tumor plasma free DNAs in the proportion of 0.3% gene mutation, and no more than 0.3% of SNP and InDel was detected, indicating that the sequences of the non-specific amplification product prepared according to Example 2 of the present invention is substantially consistent with that of the natural tumor plasma free DNA sample.
PCR amplification product was obtained according to steps 1-4 of the method described in Example 1. 200 ng PCR amplification product was digested with different digestion systems as shown in Table 11 below. After completion of digestion reaction, the product was purified with 90 μL AMPure XP Beads and eluted with 50 μL of EB, and the digestion product contained in eluent was quantified.
The above results show that the specific digestion reaction used in the method of the present invention could obtain a significantly higher yield with an excellent technical effect, as compared to that where only one enzyme is used for digestion and two enzymes are used for digestion in a reverse order.
It should be noted that the above illustrated is the preferred embodiments of the invention rather than limiting the invention. Various modifications and changes can be made to the present invention for those skilled in the Art. It will be understood by the skilled in the art that any change, equivalent replacement, and improvement made shall be included in the protection scope of the present invention without departing from the concepts and principles of the invention.
Number | Date | Country | Kind |
---|---|---|---|
201910712493.4 | Aug 2019 | CN | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/CN2020/106373 | 7/31/2020 | WO |