Method for Rapidly Constructing Amplicon Library Through One-Step Process

Information

  • Patent Application
  • 20190352711
  • Publication Number
    20190352711
  • Date Filed
    March 28, 2018
    6 years ago
  • Date Published
    November 21, 2019
    4 years ago
Abstract
The present invention discloses a method for rapidly constructing amplicon library including the following steps: 1. Synthesizing a primer combination for constructing an amplicon library of a DNA sample, the primer combination of the amplicon library used to construct the DNA sample includes: an upstream fusion primer designed according to the target amplicon, a downstream fusion primer designed according to the target amplicon, an upstream universal primer and a downstream universal primer; 2. Constructing a PCR reaction system for the DNA sample; 3. Performing PCR. The method according to the present invention can be used to construct an amplicon library in a simple and rapid manner, and since a barcode is introduced before the start of PCR, the possibility of cross-contamination between the sample and the library is greatly reduced.
Description

The present invention claims priority of a Chinese patent application filed with the China Patent Office by the Genetron Health(Beijing) Co., Ltd. on Apr. 5, 2017, with the application number of 201710218529.4, and entitled “method for rapidly constructing amplicon library through one-step process”. The entire content of this application are incorporated herein by reference.


TECHNICAL FIELD

The invention relates to a field of biotechnology, and in particular to a method for rapidly constructing amplicon library through one-step process.


BACKGROUND OF RELATED ART

Next-generation sequencing (NGS) has been widely used in disease research, diagnosis and treatment in recent years due to its high throughput, high sensitivity, and high automation. Compared with traditional detection method, NGS technology can achieve multi-gene parallel detection and save samples. Besides, it has higher sensitivity which can restore the panoramic view of tumor variation in a more realistic way. However, the traditional method for constructing an amplicon library in the Life NGS platform is cumbersome, requires PCR amplification, digestion, addition, and purification, and takes about 5 hours. Further, because of the need to open the lid in a multi-step operation, the library is easily contaminated and the library loss rate is high. In addition, in the traditional method of constructing the amplicon library, the cost of establishing a library for a single sample is relatively high, which is about 200-1000 RMB per case.


The information disclosed in background part is only intended to enhance an understanding of the general background of the invention, and should not be taken as an acknowledgement or any form of suggestion that this information forms the prior art already known to a person skilled in the art.


SUMMARY OF THE INVENTION

The object of the present invention is to provide a method for rapidly constructing an amplicon library through one-step process. The method can construct the amplicon library by one-step PCR in a simple and rapid manner, and since the barcode is introduced before the start of PCR, the possibility of cross-contamination between samples and libraries is greatly reduced, and the requirements of the experimental site partition can be simplified. The method also controls the cost of establishing a single sample library at 30 RMB per case.


To achieve the above object, the present invention provides a method for constructing an amplicon library of a DNA sample, comprising the following steps.


Step 1: synthesizing a primer combination for constructing an amplicon library of a DNA sample, the primer combination of the amplicon library that is used to construct the DNA sample includes:


a fusion primer that is designed according to a target amplicon. The upstream fusion primer comprises a first linker sequence (Bridge sequence) arranged in the order of 5′ to 3′ and a specific upstream primer sequence designed according to the target amplicon;


a downstream fusion primer that is designed according to the target amplicon. The downstream fusion primer comprises a second linker sequence (trP1 sequence) arranged in the order of 5′ to 3′ and a specific downstream primer sequence designed according to the target amplicon;


an upstream universal primer which comprises a third linker sequence (A sequence), a barcode sequence and a first linker sequence arranged in the order of 5′ to 3′; and a downstream universal primer which comprises a universal sequence (Uni sequence) and a second linker sequence arranged in the order of 5′ to 3′;


Step 2: constructing a PCR reaction system for the DNA sample, and mixing the upstream fusion primers designed according to the target amplicon, the downstream fusion primers designed according to the target amplicon, the upstream universal primers and the downstream universal primers together, to serve as a primer combination in the PCR reaction system;


Step 3: performing PCR.


In an embodiment of the present invention, the first linker sequence comprises a sequence of SEQ ID: 1, and the nucleotide sequence of the sequence of SEQ ID: 1 is GGCATACGTCCTCGTCTA.


In an embodiment of the present invention, the second linker sequence comprises a sequence of SEQ ID: 2, and the nucleotide sequence of the sequence of SEQ ID: 2 is TCTATGGGCAGTCGGTGAT.


In an embodiment of the present invention, the third linker sequence comprises a sequence of SEQ ID: 3, and a nucleotide sequence of the sequence of SEQ ID:3 is CCATCTCATCCCTGCGTGTCTCCGACTCAG.


In an embodiment of the present invention, the universal sequence comprises a sequence of SEQ ID: 4, and a nucleotide sequence of the sequence of SEQ ID: 4 is CCACTACGCCTCCGCTTTCCTC.


In an embodiment of the present invention, in the primer combination for constructing an amplicon library of the same DNA sample, the barcode sequence in the upstream universal primer is the same. In the primer combinations for constructing amplicon libraries of the different DNA samples, the barcode sequences in the upstream universal primers are different. The barcode sequence corresponds to the sample. The barcode sequence is different between different samples. As long as different samples can be distinguished, the barcode sequence is not specific and its sequence can be changed.


In an embodiment of the present invention, the concentration of the upstream fusion primer designed according to any one of the target amplicon, the concentration of downstream fusion primer designed according to any one of the target amplicon, the concentration of upstream universal primer, and the concentration of downstream universal primer are all 100 μM.


In an embodiment of the present invention, when the number of target amplicons in the same PCR reaction is greater than 1, the upstream fusion primer designed according to a target amplicon is a combination of upstream fusion primers designed according to each target amplicon, the downstream fusion primer designed according to the target amplicon is a combination of downstream fusion primers designed according to each target amplicon.


In an embodiment of the present invention, the molar ratio of the upstream fusion primer designed according to any one of the target amplicon to the downstream fusion primer designed according to the target amplicon is 1:1; the molar ratio of the upstream universal primer to the downstream universal primer is 1:1. The specific amount of upstream universal primers and downstream universal primers should be adjusted according to the number of target amplicons during PCR amplification. For example, when PCR amplification, 5 target amplicons need to be amplified and 22 target amplicons need to be amplified, the specific amount of the upstream universal primer and the downstream universal primer may be different, and a specific amount of the upstream universal primer and the downstream universal primer may be determined by those skilled in the art according to conventional techniques in the art.


In an embodiment of the present invention, the DNA sample is genomic DNA.


In an embodiment of the present invention, the genomic DNA is extracted from a tissue sample or a formalin-fixed paraffin-embedded sample.


In an embodiment of the present invention, the target amplicon comprises at least one selected from the group consisting of 22 target amplicons:










Chr2:29432588-29432707 (Hg19) amplicon of the ALK gene, the sequence of which is 



shown in SEQ ID: 5:


TAAGGGACAAGCAGCCACACCCCATTCTTGAGGGGCTGAGGTGGAAGAGACAG





GCCCGGAGGGGTGAGGCAGTCTTTACTCACCTGTAGATGTCTCGGGCCATCCCGAAG





TCTCCAATCTTGGCCACTCTTCCAGGGCCTGGACAGGTCAAGAGGCAGT;





Chr2:29443616-29443730 (Hg19) amplicon of the ALK gene, the sequence of which is


shown in SEQ ID: 6:


CGGAGGAAGGACTTGAGGTCTCCCCCCGCCATGAGCTCCAGCAGGATGAACCG





GGGCAGGGATTGCAGGCTCACCCCAATGCAGCGAACAATGTTCTGGTGGTTGAATTT





GCTGCAGAGCAGAGAGGGATGTAACCAAAATTAACTGAGCTGAGTCTGG;





Chr7:140453091-140453197 (Hg19) amplicon of the BRAF gene, the sequence of which is


shown in SEQ ID: 7:


CCTCAATTCTTACCATCCACAAAATGGATCCAGACAACTGTTCAAACTGATGGG





ACCCACTCCATCGAGATTTCACTGTAGCTAGACCAAAATCACCTATTTTTACTGTGA





GGTCTTCATGAAGAAATATATCTGAGGTGTAGTAAGTAAAGGAAAACAGTAG;





Chr7:55241604-55241726 (Hg19) amplicon of the EGFR gene, the sequence of which is


shown in SEQ ID: 8:


TGACCCTTGTCTCTGTGTTCTTGTCCCCCCCAGCTTGTGGAGCCTCTTACACCCAGTG





GAGAAGCTCCCAACCAAGCTCTCTTGAGGATCTTGAAGGAAACTGAATTCAAAAAG





ATCAAAGTGCTGGGCTCCGGTGCGTTCGGCACGGTGTATAAGGTAAGGTCCCTGG;





Chr7:55242398-55242513 (Hg19) amplicon of the EGFR gene, the sequence of which is


shown in SEQ ID: 9:


ACAATTGCCAGTTAACGTCTTCCTTCTCTCTCTGTCATAGGGACTCTGGATCCCAGAA





GGTGAGAAAGTTAAAATTCCCGTCGCTATCAAGGAATTAAGAGAAGCAACATCTCC





GAAAGCCAACAAGGAAATCCTCGATGTGAGTTTCTGCTTTGCTGTGT;





Chr7:55248970-55249096 (Hg19) amplicon of the EGFR gene, the sequence of which is


shown in SEQ ID:10:


GAAGCCACACTGACGTGCCTCTCCCTCCCTCCAGGAAGCCTACGTGATGGCCA





GCGTGGACAACCCCCACGTGTGCCGCCTGCTGGGCATCTGCCTCACCTCCACCGTGC





AGCTCATCACGCAGCTCATGCCCTTCGGCTGCCTCCTGGACTATGTCCGGGAACAC;





Chr7:55259505-55259621 (Hg19) amplicon of the EGFR gene, the sequence of which is


shown in SEQ ID: 11:


CCGCAGCATGTCAAGATCACAGATTTTGGGCTGGCCAAACTGCTGGGTGCGGA





AGAGAAAGAATACCATGCAGAAGGAGGCAAAGTAAGGAGGTGGCTTTAGGTCAGC





CAGCATTTTCCTGACACCAGGGACCAGGCTGCCTTCCCACTAGCTGTATTGTTTA;





Chr17:37880969-37881082 (Hg19) amplicon of the ERBB2 gene, the sequence of which is


shown in SEQ ID: 12:


CATACCCTCTCAGCGTACCCTTGTCCCCAGGAAGCATACGTGATGGCTGGTGT





GGGCTCCCCATATGTCTCCCGCCTTCTGGGCATCTGCCTGACATCCACGGTGCAGCT





GGTGACACAGCTTATGCCCTATGGCTGCCTCTTAGACCATGTCCG;





Chr12:25380261-25380363 (Hg19) amplicon of the KRAS gene, the sequence of which is


shown in SEQ ID: 13:


AGTCCTCATGTACTGGTCCCTCATTGCACTGTACTCCTCTTGACCTGCTGTGTC





GAGAATATCCAAGAGACAGGTTTCTCCATCAATTACTACTTGCTTCCTGTAGGAATC





CTGAGAAGGGAGAAACACAGTCTGGATTATTACAGTGCA;





Chr12: 25398183-25398310 (Hg19) amplicon of the KRAS gene, the sequence of which is


shown in SEQ ID: 14:


AAAGAATGGTCCTGCACCAGTAATATGCATATTAAAACAAGATTTACCTCTAT





TGTTGGATCATATTCGTCCACAAAATGATTCTGAATTAGCTGTATCGTCAAGGCACT





CTTGCCTACGCCACCAGCTCCAACTACCACAAGTTTATATTCAGTCATTTTCAGCAG





GCCTT;





Chr7:116340233-116340335 (Hg19) amplicon of the MET gene, the sequence of which is


shown in SEQ ID: 15:


TCGATCTGCCATGTGTGCATTCCCTATCAAATATGTCAACGACTTCTTCAACAA





GATCGTCAACAAAAACAATGTGAGATGTCTCCAGCATTTTTACGGACCCAATCATGA





GCACTGCTTTAATAGGGTAAGTCACATCAGTTCCC;





Chr7: 116411880-116412005 (Hg19) amplicon of the MET gene, the sequence of which is


shown in SEQ ID: 16:


CCATGATAGCCGTCTTTAACAAGCTCTTTCTTTCTCTCTGTTTTAAGATCTGGG





CAGTGAATTAGTTCGCTACGATGCAAGAGTACACACTCCTCATTTGGATAGGCTTGT





AAGTGCCCGAAGTGTAAGCCCAACTACAGAAATGGTTTCAAATGAATCTGTAGACT





ACCGAGCT;





Chr7:116417426-116417546 (Hg19) amplicon of the MET gene, the sequence of which is


shown in SEQ ID: 17:


ATGTTACGCAGTGCTAACCAAGTTCTTTCTTTTGCACAGGGCATTTTGGTTGTG





TATATCATGGGACTTTGTTGGACAATGATGGCAAGAAAATTCACTGTGCTGTGAAAT





CCTTGAACAGTAAGTGGCATTTTATTTAACCATGGAGTATACTTTTGTGGTTTGCAAC;





Chr7: 116423399-116423499 (Hg19) amplicon of the MET gene, the sequence of which is


shown in SEQ ID: 18:


CAGTCAAGGTTGCTGATTTTGGTCTTGCCAGAGACATGTATGATAAAGAATAC





TATAGTGTACACAACAAAACAGGTGCAAAGCTGCCAGTGAAGTGGATGGCTTTGGA





AAGTCTGCAAACTCAAAAGTTTACCACCAAGTCAGATGTG;





Chr1:115256507-115256586 (Hg19) amplicon of the NRAS gene, the sequence of which is


shown in SEQ ID: 19:


TTCGCCTGTCCTCATGTATTGGTCTCTCATGGCACTGTACTCTTCTTGTCCAGCT





GTATCCAGTATGTCCAACAAACAGGTTTCACCATCTATAACCACTTGTTTTCTGTAAG





AATCCTGGGGGTG;





Chr 1: 115258651-115258755 (Hg19) amplicon of the NRAS gene, the sequence of which is


shown in SEQ ID: 20:


TGAGAGACAGGATCAGGTCAGCGGGCTACCACTGGGCCTCACCTCTATGGTGG





GATCATATTCATCTACAAAGTGGTTCTGGATTAGCTGGATTGTCAGTGCGCTTTTCCC





AACACCACCTGCTCCAACCACCACCAGTTTGTACTCAG;





Chr3:178936056-178936179 (Hg19) amplicon of the PIK3CA gene, the sequence of which is


shown in SEQ ID: 21:


GGAAAATGACAAAGAACAGCTCAAAGCAATTTCTACACGAGATCCTCTCTCTG





AAATCACTGAGCAGGAGAAAGATTTTCTATGGAGTCACAGGTAAGTGCTAAAATGG





AGATTCTCTGTTTCTTTTTCTTTATTACAGAAAAAATAACTGAATTTGGCTGATCTCA





GCATGTT;





Chr3:178952000-178952092 (Hg19) amplicon of the PIK3CA gene, the sequence of which is


shown in SEQ ID: 22:


ATGCCAGAACTACAATCTTTTGATGACATTGCATACATTCGAAAGACCCTAGC





CTTAGATAAAACTGAGCAAGAGGCTTTGGAGTATTTCATGAAACAAATGAATGATG





CACATCATGGTGGCTGGACAACAAAAATGGATTG;





Chr17:7577027-7577154 (Hg19) amplicon of the TP53 gene, the sequence of which is shown


in SEQ ID: 23:


CTTCTTGTCCTGCTTGCTTACCTCGCTTAGTGCTCCCTGGGGGCAGCTCGTGGT





GAGGCTCCCCTTTCTTGCGGAGATTCTCTTCCTCTGTGCGCCGGTCTCTCCCAGGACA





GGCACAAACACGCACCTCAAAGCTGTTCCGTCCCAGTAGATTACCACTACTCAGGAT





AGGAAAAGAG;





Chr17:7577507-7577613 (Hg19) amplicon of the TP53 gene, the sequence of which is 


shown in SEQ ID: 24:


GCAAGTGGCTCCTGACCTGGAGTCTTCCAGTGTGATGATGGTGAGGATGGGCC





TCCGGTTCATGCCGCCCATGCAGGAACTGTTACACATGTAGTTGTAGTGGATGGTGG





TACAGTCAGAGCCAACCTAGGAGATAACACAGGCCCAAGA;





Chr17:7578182-7578298 (Hg19) amplicon of the TP53 gene, the sequence of which is shown 


in SEQ ID: 25:


CCCCAGTTGCAAACCAGACCTCAGGCGGCTCATAGGGCACCACCACACTATGT





CGAAAAGTGTTTCTGTCATCCAAATACTCCACACGCAAATTTCCTTCCACTCGGATA





AGATGCTGAGGAGGGGCCAGACCTAAGAGCAATCAGTGAGGAATCAGAGG;





Chr17:7578389-7578537 (Hg19) amplicon of the TP53 gene, the sequence of which is shown 


in SEQ ID: 26:


ACCATCGCTATCTGAGCAGCGCTCATGGTGGGGGCAGCGCCTCACAACCTCCG





TCATGTGCTGTGACTGCTTGTAGATGGCCATGGCGCGGACGCGGGTGCCGGGCGGG





GGTGTGGAATCAACCCACAGCTGCACAGGGCAGGTCTTGGCCAGTTGGCAAAACAT





CTTGTTGAGGGCAGGGGAGTACTG.






In an embodiment of the present invention, the specific upstream primer sequence designed according to the Chr2:29432588-29432707 (Hg19) amplicon of the ALK gene is shown as SEQ ID: 27: ACTGCCTCTTGACCTGTCC; the specific downstream primer sequence designed according to the Chr2:29432588-29432707 (Hg19) amplicon of the ALK gene is shown as SEQ ID: 28: TAAGGGACAAGCAGCCACAC.


In an embodiment of the present invention, the specific upstream primer sequence designed according to the Chr2: 29443616-29443730 (Hg19) amplicon of ALK gene is shown as SEQ ID: 29: CCAGACTCAGCTCAGTTAATTTTGG; the specific downstream primer sequence designed according to the Chr2: 29443616-29443730 (Hg19) amplicon of the ALK gene is shown as SEQ ID: 30: CGGAGGAAGGACTTGAGGT.


In an embodiment of the present invention, the specific upstream primer sequence designed according to the Chr7: 140453091-140453197(Hg19) amplicon of BRAF gene is shown as SEQ ID: 31: CTACTGTTTTCCTTTACTTACTACACCTC; the specific downstream primer sequence designed according to the Chr7: 140453091-140453197(Hg19) amplicon of the BRAF gene is shown as SEQ ID: 32: CCTCAATTCTTACCATCCACAAAATGG.


In an embodiment of the present invention, the specific upstream primer sequence designed according to the Chr7: 55241604-55241726(Hg19) amplicon of EGFR gene is shown as SEQ ID: 33: TGACCCTTGTCTCTGTGTTCTTG; the specific downstream primer sequence designed according to the Chr7: 55241604-55241726(Hg19) amplicon of the BRAF gene is shown as SEQ ID: 34: CCAGGGACCTTACCTTATACACC.


In an embodiment of the present invention, the specific upstream primer sequence designed according to the Chr7: 55242398-55242513 (Hg19) amplicon of EGFR gene is shown as SEQ ID:35: ACAATTGCCAGTTAACGTCTTCC; the specific downstream primer sequence designed according to the Chr7: 55242398-55242513 (Hg19) amplicon of the EGFR gene is shown as SEQ ID: 36: ACACAGCAAAGCAGAAACTCAC.


In an embodiment of the present invention, the specific upstream primer sequence designed according to the Chr7: 55248970-55249096 (Hg19) amplicon of EGFR gene is shown as SEQ ID: 37: GAAGCCACACTGACGTGC; the specific downstream primer sequence designed according to the Chr7: 55248970-55249096 (Hg19) amplicon of the EGFR gene is shown as SEQ ID: 38: GTGTTCCCGGACATAGTCCAG.


In an embodiment of the present invention, the specific upstream primer sequence designed according to the Chr7: 55259505-55259621 (Hg19) amplicon of EGFR gene is shown as SEQ ID: 39: CCGCAGCATGTCAAGATCACA; the specific downstream primer sequence designed according to the Chr7: 55259505-55259621 (Hg19) amplicon of the EGFR gene is shown as SEQ ID: 40: TAAACAATACAGCTAGTGGGAAGGC.


In an embodiment of the present invention, the specific upstream primer sequence designed according to the Chr17: 37880969-37881082 (Hg19) amplicon of ERBB2 gene is shown as SEQ ID: 41: CATACCCTCTCAGCGTACCC; the specific downstream primer sequence designed according to the Chr17: 37880969-37881082 (Hg19) amplicon of the ERBB2 gene is shown as SEQ ID: 42: CGGACATGGTCTAAGAGGCAG.


In an embodiment of the present invention, the specific upstream primer sequence designed according to the Chr12: 25380261-25380363 (Hg19) amplicon of KRAS gene is shown as SEQ ID: 43: TGCACTGTAATAATCCAGACTGTGT; the specific downstream primer sequence designed according to the Chr12: 25380261-25380363 (Hg19) amplicon of the KRAS gene is shown as SEQ ID: 44: AGTCCTCATGTACTGGTCCCTC.


In an embodiment of the present invention, the specific upstream primer sequence designed according to the Chr12: 25398183-25398310 (Hg19) amplicon of KRAS gene is shown as SEQ ID: 45: AAGGCCTGCTGAAAATGACTGA; the specific downstream primer sequence designed according to the Chr12: 25398183-25398310 (Hg19) amplicon of the KRAS gene is shown as SEQ ID: 46: AAAGAATGGTCCTGCACCAGTA.


In an embodiment of the present invention, the specific upstream primer sequence designed according to the Chr7: 116340233-116340335 (Hg19) amplicon of MET gene is shown as SEQ ID: 47: TCGATCTGCCATGTGTGCATT; the specific downstream primer sequence designed according to the Chr7: 116340233-116340335 (Hg19) amplicon of the MET gene is shown as SEQ ID: 48: GGGAACTGATGTGACTTACCCT.


In an embodiment of the present invention, the specific upstream primer sequence designed according to the Chr7: 116411880-116412005 (Hg19) amplicon of MET gene is shown as SEQ ID: 49: CCATGATAGCCGTCTTTAACAAGC; the specific downstream primer sequence designed according to the Chr7: 116411880-116412005 (Hg19) amplicon of the MET gene is shown as SEQ ID: 50: AGCTCGGTAGTCTACAGATTCATTT.


In an embodiment of the present invention, the specific upstream primer sequence designed according to the Chr7: 116417426-116417546 (Hg19) amplicon of MET gene is shown as SEQ ID: 51: ATGTTACGCAGTGCTAACCAAG; the specific downstream primer sequence designed according to the Chr7: 116417426-116417546 (Hg19) amplicon of the MET gene is shown as SEQ ID: 52: GTTGCAAACCACAAAAGTATACTCCA.


In an embodiment of the present invention, the specific upstream primer sequence designed according to the Chr7: 116423399-116423499 (Hg19) amplicon of MET gene is shown as SEQ ID: 53: CAGTCAAGGTTGCTGATTTTGGTC; the specific downstream primer sequence designed according to the Chr7: 116423399-116423499 (Hg19) amplicon of the MET gene is shown as SEQ ID: 54: CACATCTGACTTGGTGGTAAACTT.


In an embodiment of the present invention, the specific upstream primer sequence designed according to the Chr1: 115256507-115256586 (Hg19) amplicon of NRAS gene is shown as SEQ ID: 55: CACCCCCAGGATTCTTACAGAAAA; the specific downstream primer sequence designed according to the Chr1: 115256507-115256586 (Hg19) amplicon of the NRAS gene is shown as SEQ ID: 56: TTCGCCTGTCCTCATGTATTGG.


In an embodiment of the present invention, the specific upstream primer sequence designed according to the Chr1: 115258651-115258755 (Hg19) amplicon of NRAS gene is shown as SEQ ID: 57: CTGAGTACAAACTGGTGGTGGT; the specific downstream primer sequence designed according to the Chr1: 115258651-115258755 (Hg19) amplicon of the NRAS gene is shown as SEQ ID: 58: TGAGAGACAGGATCAGGTCAGC.


In an embodiment of the present invention, the specific upstream primer sequence designed according to the Chr3: 178936056-178936179 (Hg19) amplicon of PIK3CA gene is shown as SEQ ID: 59: GGAAAATGACAAAGAACAGCTCAAAG; the specific downstream primer sequence designed according to the Chr3: 178936056-178936179 (Hg19) amplicon of the PIK3CA gene is shown as SEQ ID: 60: AACATGCTGAGATCAGCCAAATTC.


In an embodiment of the present invention, the specific upstream primer sequence designed according to the Chr3: 178952000-178952092 (Hg19) amplicon of PIK3CA gene is shown as SEQ ID: 61: ATGCCAGAACTACAATCTTTTGATGAC; the specific downstream primer sequence designed according to the Chr3: 178952000-178952092 (Hg19) amplicon of the PIK3CA gene is shown as SEQ ID: 62: CAATCCATTTTTGTTGTCCAGCC.


In an embodiment of the present invention, the specific upstream primer sequence designed according to the Chr17: 7577027-7577154 (Hg19) amplicon of TP53 gene is shown as SEQ ID: 63: CTCTTTTCCTATCCTGAGTAGTGGTAATC; the specific downstream primer sequence designed according to the Chr17: 7577027-7577154 (Hg19) amplicon of the TP53 gene is shown as SEQ ID: 64: CTTCTTGTCCTGCTTGCTTACC.


In an embodiment of the present invention, the specific upstream primer sequence designed according to the Chr17: 7577507-7577613 (Hg19) amplicon of TP53 gene is shown as SEQ ID: 65: TCTTGGGCCTGTGTTATCTCCTAG; the specific downstream primer sequence designed according to the Chr17: 7577507-7577613 (Hg19) amplicon of the TP53 gene is shown as SEQ ID: 66: GCAAGTGGCTCCTGACCTG.


In an embodiment of the present invention, the specific upstream primer sequence designed according to the Chr17: 7578182-7578298 (Hg19) amplicon of TP53 gene is shown as SEQ ID: 67: CCTCTGATTCCTCACTGATTGCTC; the specific downstream primer sequence designed according to the Chr17: 7578182-7578298 (Hg19) amplicon of the TP53 gene is shown as SEQ ID: 68: CCCCAGTTGCAAACCAGAC.


In an embodiment of the present invention, the specific upstream primer sequence designed according to the Chr17: 7578389-7578537 (Hg19) amplicon of TP53 gene is shown as SEQ ID: 69: CAGTACTCCCCTGCCCTCAA; the specific downstream primer sequence designed according to the Chr17: 7578389-7578537 (Hg19) amplicon of the TP53 gene is shown as SEQ ID: 70: ACCATCGCTATCTGAGCAGC.


In an embodiment of the present invention, the target amplicons are the following 22 species: Chr2:29432588-29432707 (Hg19) amplicon of the ALK gene, the sequence of which is shown in SEQ ID:5;


Chr2: 29443616-29443730 (Hg19) amplicon of the ALK gene, the sequence of which is shown in SEQ ID:6;


Chr7: 140453091-140453197(Hg19) amplicon of the BRAF gene, the sequence of which is shown in SEQ 1D:7;


Chr7: 55241604-55241726(Hg19) amplicon of the EGFR gene, the sequence of which is shown in SEQ ID:8;


Chr7: 55242398-55242513 (Hg19) amplicon of the EGFR gene, the sequence of which is shown in SEQ ID:9;


Chr7: 55248970-55249096 (Hg19) amplicon of the EGFR gene, the sequence of which is shown in SEQ ID:10;


Chr7: 55259505-55259621 (Hg19) amplicon of the EGFR gene, the sequence of which is shown in SEQ ID:11;


Chr17: 37880969-37881082 (Hg19) amplicon of the ERBB2 gene, the sequence of which is shown in SEQ ID:12;


Chr12: 25380261-25380363 (Hg19) amplicon of the KRAS gene, the sequence of which is shown in SEQ ID:13;


Chr12: 25398183-25398310 (Hg19) amplicon of the KRAS gene, the sequence of which is shown in SEQ ID:14;


Chr7: 116340233-116340335 (Hg19) amplicon of the MET gene, the sequence of which is shown in SEQ ID:15;


Chr7: 116411880-116412005 (Hg19) amplicon of the MET gene, the sequence of which is shown in SEQ ID:16;


Chr7: 116417426-116417546 (Hg19) amplicon of the MET gene, the sequence of which is shown in SEQ ID:17;


Chr7: 116423399-116423499 (Hg19) amplicon of the MET gene, the sequence of which is shown in SEQ ID:18;


Chr1: 115256507-115256586 (Hg19) amplicon of the NRAS gene, the sequence of which is shown in SEQ ID:19;


Chr1: 115258651-115258755 (Hg19) amplicon of the NRAS gene, the sequence of which is shown in SEQ ID:20;


Chr3: 178936056-178936179 (Hg19) amplicon of the PIK3CA gene, the sequence of which is shown in SEQ ID:21;


Chr3: 178952000-178952092 (Hg19) amplicon of the PIK3CA gene, the sequence of which is shown in SEQ ID:22;


Chr17: 7577027-7577154 (Hg19) amplicon of the TP53 gene, the sequence of which is shown in SEQ ID:23;


Chr17: 7577507-7577613 (Hg19) amplicon of the TP53 gene, the sequence of which is shown in SEQ ID:24;


Chr17: 7578182-7578298 (Hg19) amplicon of the TP53 gene, the sequence of which is shown in SEQ ID:25; and


Chr17: 7578389-7578537 (Hg19) amplicon of the TP53 gene, the sequence of which is shown in SEQ ID:26.


In an embodiment of the present invention, the molar ratio of the combination of the upstream fusion primers designed according to the above 22 target amplicons, the combination of the downstream fusion primers designed according to the above 22 target amplicons, the upstream universal primer and the downstream universal primer is: 0.1-0.3: 0.1-0.3: 0.5-1: 0.5-1, for example, 0.1:0.1:0.5:0.5.


In an embodiment of the present invention, the molar ratio of the upstream fusion primer designed according to the Chr2:29432588-29432707 (Hg19) amplicon of ALK gene, the upstream fusion primer designed according to the Chr2: 29443616-29443730 (Hg19) amplicon of ALK gene, the upstream fusion primer designed according to the Chr7: 140453091-140453197(Hg19) amplicon of BRAF gene, the upstream fusion primer designed according to the Chr7: 55241604-55241726(Hg19) amplicon of EGFR gene, the upstream fusion primer designed according to the Chr7: 55242398-55242513 (Hg19) amplicon of EGFR gene, the upstream fusion primer designed according to the Chr7: 55248970-55249096 (Hg19) amplicon of EGFR gene, the upstream fusion primer designed according to the Chr7: 55259505-55259621 (Hg19) amplicon of EGFR gene, the upstream fusion primer designed according to the Chr17: 37880969-37881082 (Hg19) amplicon of ERBB2 gene; the upstream fusion primer designed according to the Chr12: 25380261-25380363 (Hg19) amplicon of KRAS gene, the upstream fusion primer designed according to the Chr12: 25398183-25398310 (Hg19) amplicon of KRAS gene ; the upstream fusion primer designed according to the Chr7: 116340233-116340335 (Hg19) amplicon of MET gene, the upstream fusion primer designed according to the Chr7: 116411880-116412005 (Hg19) amplicon of MET gene, the upstream fusion primer designed according to the Chr7: 116417426-116417546 (Hg19) amplicon of MET gene, the upstream fusion primer designed according to the Chr7: 116423399-116423499 (Hg19) amplicon of MET gene, the upstream fusion primer designed according to the Chr1: 115256507-115256586 (Hg19) amplicon of NRAS gene, the upstream fusion primer designed according to the Chr1: 115258651-115258755 (Hg19) amplicon of NRAS gene, the upstream fusion primer designed according to the Chr3: 178936056-178936179 (Hg19) amplicon of PIK3CA gene, the upstream fusion primer designed according to the Chr3: 178952000-178952092 (Hg19) amplicon of PIK3CA gene, the upstream fusion primer designed according to the Chr17: 7577027-7577154 (Hg19) amplicon of TP53 gene, the upstream fusion primer designed according to the Chr17: 7577507-7577613 (Hg19) amplicon of TP53 gene, the upstream fusion primer designed according to the Chr17: 7578182-7578298 (Hg19) amplicon of TP53 gene, and the upstream fusion primer designed according to the Chr17: 7578389-7578537 (Hg19) amplicon of TP53 gene is: 1:2:1:4:2:1:2:4:2:2:2:2:1:4:2:2:2:2:4:2:4:2.


In an embodiment of the present invention, the PCR reaction system includes the following components:















PCR master mix
 10 μl;


DNA sample
1-8 μl total 20 ng;


Primer combination for constructing an amplicon
  2 μl;


library of the same DNA sample


DNAase-free H2O
making up to 20 μl.









In an embodiment of the present invention, the PCR master mix is KAPA HiFi PCR Kits 2x.


In an embodiment of the present invention, the reaction procedure for performing PCR is:



















Number of



Temperature
Time
cycles









98° C.
30 s




98° C.
10 s
22 cycles



60° C.
90 s



72° C.
90 s



72° C.
10 min



 4° C.











In an embodiment of the present invention, after the PCR reaction, a step of purifying the PCR amplification product is also included.


Compared with the prior art, the present invention has the following advantages:


The method disclosed in the present invention is based on the design of the PGM platform, and can effectively amplify multiple target regions (amplicons) at the same time. In the process of using the library for construction, the present invention only involves one round of PCR reaction and one round of product purification steps, which greatly simplifies the experimental operation of the existing commercial kit (such as PCR process, purification step, digestion and joints, etc.) Step), and saves the construction time. The entire database construction process only takes 2.5 hours (including the same sample of DNA and RNA database).


Effectively elimination of sample and library contamination is achieved. The significant simplified operation process makes the library construction process more secure and reliable, and the reduction of operation process and steps effectively eliminates the library pollution that may be caused during the database construction process.


Streamlined bioinformatics analysis process is obtained. The amplicon library obtained by the method has a single structure and reliable data, and the DNA strand composition of the obtained library is simple and clear, and the subsequent bioinformatics analysis is more simplified.


After the library is constructed, the library is only needed to be quantified by the instrument “Qubit 2.0”, which eliminates quantification step by the instrument “qPCR”. Therefore, the database construction time is shortened and corresponding operation steps are reduced, and the experimental errors that may be caused by the cumbersome experimental process are avoided.





BRIEF DESCRIPTION OF THE DRAWINGS


FIG. 1 is a distribution diagram of an amplification product detected after completion of construction of an amplicon library in Example 1 of the present invention.



FIG. 2 is a related parameter of 22 amplicons in the library obtained in Example 1 of the present invention.





DETAILED DESCRIPTION OF THE EMBODIMENTS

The specific embodiments of the present invention are described in detail below with reference to the accompanying drawings, but it is understood that the scope of the present invention is not limited by the specific embodiments.


EXAMPLE 1

The samples to be tested are 6 FFPE samples (i.e., formalin-fixed paraffin-embedded samples, FFPE stands for Formalin-Fixed and Parrffin-Embedded), 4 of which are FFPE samples from patients with non-small cell lung cancer, and 2 are non-FFPE samples from patients with cancer. Amplified primers are used to construct an amplicon DNA library from 6 FFPE samples using a specific designed fusion primer. The specific process is as follows:


1. Extraction of genomic DNA:


The genomic DNA in the FFPE sample is extracted using the “Qiagen FFPE DNA Kit”. The detailed steps for extraction can be referred to the kit instructions. The genomic DNA is dissolved in “Tris-HCl ” buffer, quality of the extracted DNA is detected using “Nano Drop”. After the concentration of the sample DNA is detected using the instrument “quiz 3.0”, each genomic DNA sample is diluted to a concentration of 20 ng/μl.


2. Design and synthesize primers:


An upstream fusion primer is designed according to the target amplicon. The upstream fusion primer includes a first linker sequence arranged in the order of 5′ to 3′ and a specific upstream primer sequence that is designed according to the target amplicon;


A downstream fusion primer is designed according to the target amplicon. The downstream fusion primer includes a second linker sequence arranged in the order of 5′ to 3′ and a specific downstream primer sequence designed according to the target amplicon;


An upstream universal primer comprises a third linker sequence, barcode sequence and a first linker sequence arranged in the order of 5′ to 3′; and


A downstream universal primer comprises a universal sequence and a second linker sequence arranged in the order of 5′ to 3′.


In the primer combination for constructing the amplicon library of the DNA sample, the information of the specific upstream primer sequence and the specific downstream primer sequence designed according to the target amplicon are as follows:


Information on the different target amplicons is given in the table below, and the specific upstream primer sequence “Special Primer Start” and the specific downstream primer sequence “Special Primer End” designed for these amplicons are also given. Sequences of upstream fusion primers designed according to the target amplicon, downstream fusion primers designed according to the target amplicon, upstream universal primers, and downstream universal primers are also given. Puf represents an alternative upstream universal primer and Pur represents a downstream universal primer.






















Gene/

Amp
Ins

Amp

Primer
Special
Primer
Special Primer


Amplicons
Chr
Start
Start
Ins End
End
length
name
Primer Start
name
End







ALK-001
 2
2943
294325
294327
294327
159
Pspf-1
ACTGCCTC
pspr-1
TAAGGGACA




2569
88
07
27


TTGACCTG

AGCAGCCAC










TCC

AC





ALK-002
 2
2944
294436
294437
294437
159
Pspf-2
CCAGACTC
pspr-2
CGGAGGAAG




3591
16
30
49


AGCTCAGT

GACTTGAGGT










TAATTTTG












G







BRAF-001
 7
1404
140453
140453
140453
163
Pspf-3
CTACTGTT
pspr-3
CCTCAATTCT




5306
091
197
224


TTCCTTTA

TACCATCCAC




2





CTTACTAC

AAAATGG










ACCTC







EGFR-001
 7
5524
552416
552417
552417
169
Pspf-4
TGACCCTT
pspr-4
CCAGGGACCT




1581
04
26
49


GTCTCTGT

TACCTTATAC










GTTCTTG

ACC





EGFR-002
 7
5524
552423
552425
552425
161
Pspf-5
ACAATTGC
pspr-5
ACACAGCAA




2375
98
13
35


CAGTTAAC

AGCAGAAAC










GTCTTCC

TCAC





EGFR-003
 7
5524
552489
552490
552491
166
Pspf-6
GAAGCCAC
pspr-6
GTGTTCCCGG




8952
70
96
17


ACTGACGT

ACATAGTCCA










GC

G





EGFR-004
 7
5525
552595
552596
552596
163
Pspf-7
CCGCAGCA
pspr-7
TAAACAATAC




9484
05
21
46


TGTCAAGA

AGCTAGTGG










TCACA

GAAGGC





ERBB2-001
17
3788
378809
378810
378811
155
Pspf-8
CATACCCT
pspr-8
CGGACATGGT




0949
69
82
03


CTCAGCGT

CTAAGAGGC










ACCC

AG





KRAS-001
12
2538
253802
253803
253803
150
Pspf-9
TGCACTGT
pspr-9
AGTCCTCATG




0236
61
63
85


AATAATCC

TACTGGTCCC










AGACTGTG

TC










T







KRAS-002
12
2539
253981
253983
253983
172
Pspf-10
AAGGCCTG
pspr-10
AAAGAATGG




8161
83
10
32


CTGAAAAT

TCCTGCACCA










GACTGA

GTA





MET-001
 7
1163
116340
116340
116340
146
Pspf-11
TCGATCTG
pspr-11
GGGAACTGA




4021
233
335
357


CCATGTGT

TGTGACTTAC




2





GCATT

CCT





MET-002
 7
1164
116411
116412
116412
175
Pspf-12
CCATGATA
pspr-12
AGCTCGGTAG




1185
880
005
030


GCCGTCTT

TCTACAGATT




6





TAACAAGC

CATTT





MET-003
 7
1164
116417
116417
116417
169
Pspf-13
ATGTTACG
pspr-13
GTTGCAAACC




1740
426
546
572


CAGTGCTA

ACAAAAGTA




4





ACCAAG

TACTCCA





MET-004
 7
1164
116423
116423
116423
149
Pspf-14
CAGTCAAG
pspr-14
CACATCTGAC




2337
399
499
523


GTTGCTGA

TTGGTGGTAA




5





TTTTGGTC

ACTT





NRAS-001
 1
1152
115256
115256
115256
126
Pspf-15
CACCCCCA
pspr-15
TTCGCCTGTC




5648
507
586
608


GGATTCTT

CTCATGTATT




3





ACAGAAA

GG










A







NRAS-002
 1
1152
115258
115258
115258
149
Pspf-16
CTGAGTAC
pspr-16
TGAGAGACA




5862
651
755
777


AAACTGGT

GGATCAGGTC




9





GGTGGT

AGC





PIK3CA-001
 3
1789
178936
178936
178936
174
Pspf-17
GGAAAATG
pspr-17
AACATGCTGA




3603
056
179
203


ACAAAGA

GATCAGCCA




0





ACAGCTCA

AATTC










AAG







PIK3CA-002
 3
1789
178952
178952
178952
143
Pspf-18
ATGCCAGA
pspr-18
CAATCCATTT




5197
000
092
115


ACTACAAT

TTGTTGTCCA




3





CTTTTGAT

GCC










GAC







TP53-001
17
7576
757702
757715
757717
179
Pspf-19
CTCTTTTC
pspr-19
CTTCTTGTCC




998
7
4
6


CTATCCTG

TGCTTGCTTA










AGTAGTGG

CC










TAATC







TP53-002
17
7577
757750
757761
757763
150
Pspf-20
TCTTGGGC
pspr-20
GCAAGTGGCT




483
7
3
2


CTGTGTTA

CCTGACCTG










TCTCCTAG







TP53-003
17
7578
757818
757829
757831
159
Pspf-21
CCTCTGAT
pspr-21
CCCCAGTTGC




158
2
8
7


TCCTCACT

AAACCAGAC










GATTGCTC







TP53-004
17
7578
757838
757853
757855
189
Pspf-22
CAGTACTC
pspr-22
ACCATCGCTA




369
9
7
7


CCCTGCCC

TCTGAGCAGC










TCAA




















Primer



name
Primer sequence







Pspf-1
GGCATACGTCCTCGTCTAACTGCCTCTTGACCTGTCC





Pspf-2
GGCATACGTCCTCGTCTACCAGACTCAGCTCAGTTAATTTTGG





Pspf-3
GGCATACGTCCTCGTCTACTACTGTTTTCCTTTACTTACTACACCTC





Pspf-4
GGCATACGTCCTCGTCTATGACCCTTGTCTCTGTGTTCTTG





Pspf-5
GGCATACGTCCTCGTCTAACAATTGCCAGTTAACGTCTTCC





Pspf-6
GGCATACGTCCTCGTCTAGAAGCCACACTGACGTGC





Pspf-7
GGCATACGTCCTCGTCTACCGCAGCATGTCAAGATCACA





Pspf-8
GGCATACGTCCTCGTCTACATACCCTCTCAGCGTACCC





Pspf-9
GGCATACGTCCTCGTCTATGCACTGTAATAATCCAGACTGTGT





Pspf-10
GGCATACGTCCTCGTCTAAAGGCCTGCTGAAAATGACTGA





Pspf-11
GGCATACGTCCTCGTCTATCGATCTGCCATGTGTGCATT





Pspf-12
GGCATACGTCCTCGTCTACCATGATAGCCGTCTTTAACAAGC





Pspf-13
GGCATACGTCCTCGTCTAATGTTACGCAGTGCTAACCAAG





Pspf-14
GGCATACGTCCTCGTCTACAGTCAAGGTTGCTGATTTTGGTC





Pspf-15
GGCATACGTCCTCGTCTACACCCCCAGGATTCTTACAGAAAA





Pspf-16
GGCATACGTCCTCGTCTACTGAGTACAAACTGGTGGTGGT





Pspf-17
GGCATACGTCCTCGTCTAGGAAAATGACAAAGAACAGCTCAAAG





Pspf-18
GGCATACGTCCTCGTCTAATGCCAGAACTACAATCTTTTGATGAC





Pspf-19
GGCATACGTCCTCGTCTACTCTTTTCCTATCCTGAGTAGTGGTAATC





Pspf-20
GGCATACGTCCTCGTCTATCTTGGGCCTGTGTTATCTCCTAG





Pspf-21
GGCATACGTCCTCGTCTACCTCTGATTCCTCACTGATTGCTC





Pspf-22
GGCATACGTCCTCGTCTACAGTACTCCCCTGCCCTCAA





pspr-1
TCTATGGGCAGTCGGTGATTAAGGGACAAGCAGCCACAC





pspr-2
TCTATGGGCAGTCGGTGATCGGAGGAAGGACTTGAGGT





pspr-3
TCTATGGGCAGTCGGTGATCCTCAATTCTTACCATCCACAAAATGG





pspr-4
TCTATGGGCAGTCGGTGATCCAGGGACCTTACCTTATACACC





pspr-5
TCTATGGGCAGTCGGTGATACACAGCAAAGCAGAAACTCAC





pspr-6
TCTATGGGCAGTCGGTGATGTGTTCCCGGACATAGTCCAG





pspr-7
TCTATGGGCAGTCGGTGATTAAACAATACAGCTAGTGGGAAGGC





pspr-8
TCTATGGGCAGTCGGTGATCGGACATGGTCTAAGAGGCAG





pspr-9
TCTATGGGCAGTCGGTGATAGTCCTCATGTACTGGTCCCTC





pspr-10
TCTATGGGCAGTCGGTGATAAAGAATGGTCCTGCACCAGTA





pspr-11
TCTATGGGCAGTCGGTGATGGGAACTGATGTGACTTACCCT





pspr-12
TCTATGGGCAGTCGGTGATAGCTCGGTAGTCTACAGATTCATTT





pspr-13
TCTATGGGCAGTCGGTGATGTTGCAAACCACAAAAGTATACTCCA





pspr-14
TCTATGGGCAGTCGGTGATCACATCTGACTTGGTGGTAAACTT





pspr-15
TCTATGGGCAGTCGGTGATTTCGCCTGTCCTCATGTATTGG





pspr-16
TCTATGGGCAGTCGGTGATTGAGAGACAGGATCAGGTCAGC





pspr-17
TCTATGGGCAGTCGGTGATAACATGCTGAGATCAGCCAAATTC





pspr-18
TCTATGGGCAGTCGGTGATCAATCCATTTTTGTTGTCCAGCC





pspr-19
TCTATGGGCAGTCGGTGATCTTCTTGTCCTGCTTGCTTACC





pspr-20
TCTATGGGCAGTCGGTGATGCAAGTGGCTCCTGACCTG





pspr-21
TCTATGGGCAGTCGGTGATCCCCAGTTGCAAACCAGAC





pspr-22
TCTATGGGCAGTCGGTGATACCATCGCTATCTGAGCAGC





puf-1
CCATCTCATCCCTGCGTGTCTCCGACTCAGCTTGACACCGCGGCATACGTCCTCGTCTA





puf-2
CCATCTCATCCCTGCGTGTCTCCGACTCAGTTGGAGGCCAGCGGCATACGTCCTCGTCTA





puf-3
CCATCTCATCCCTGCGTGTCTCCGACTCAGTGGAGCTTCCTCGGCATACGTCCTCGTCTA





puf-4
CCATCTCATCCCTGCGTGTCTCCGACTCAGTCAGTCCGAACGGCATACGTCCTCGTCTA





puf-5
CCATCTCATCCCTGCGTGTCTCCGACTCAGTAAGGCAACCACGGCATACGTCCTCGTCTA





puf-6
CCATCTCATCCCTGCGTGTCTCCGACTCAGTTCTAAGAGACGGCATACGTCCTCGTCTA





puf-7
CCATCTCATCCCTGCGTGTCTCCGACTCAGTCCTAACATAACGGCATACGTCCTCGTCTA





puf-8
CCATCTCATCCCTGCGTGTCTCCGACTCAGCGGACAATGGCGGCATACGTCCTCGTCTA





puf-9
CCATCTCATCCCTGCGTGTCTCCGACTCAGTTGAGCCTATTCGGCATACGTCCTCGTCTA





puf-10
CCATCTCATCCCTGCGTGTCTCCGACTCAGCCGCATGGAACGGCATACGTCCTCGTCTA





pur
CCACTACGCCTCCGCTTTCCTCTCTATGGGCAGTCGGTGAT









The first linker sequence is GGCATACGTCCTCGTCTA, the second linker sequence is TCTATGGGCAGTCGGTGAT, the third linker sequence is CCATCTCATCCCTGCGTGTCTCCGACTCAG, and the universal sequence is CCACTACGCCTCCGCTTTCCTC.


3. Form a PCR reaction system. The specific PCR reaction system is as follows:













PCR reaction system component
content







KAPA HiFi PCR Kits 2x
10 μl


Genomic DNA (10 ng/μl itself)
 2 μl


Primer combination for constructing an
 2 μl


amplicon library of the same DNA sample


DNAase free H2O
make up to the total of 20 μl


Total
20 μl









Primer combinations for constructing an amplicon library of the same DNA sample are prepared by the following methods: (1) the upstream universal primer, the downstream universal primer, and each upstream fusion primer designed according to the 22 target amplicons and each downstream fusion primer are dissolved in water to a concentration of 100 μM; (2) 22 upstream fusion primers with a serial number ranging from small to large are respectively mixed with a concentration of 100 μM, and the molar ratio is 1:2:1:4:2:1:2:4:2:2:2:2:2:4:2:2:2:2:4:2:4:2, so as to obtain the upstream fusion primer combination, and 22 downstream fusion primers with a concentration of 100 μM are respectively mixed with the corresponding upstream fusion primers in equal volume to obtain a downstream fusion primer combination, and then the upstream fusion primer combination and the downstream fusion primer combination are mixed in equal volume; (3) mixing in equal volume of upstream universal primers and downstream universal primers with concentrations of 100 μM; (4) the upstream fusion primer combination, the downstream fusion primer combination, the upstream universal primer and the downstream universal primer are mixed according to a molar ratio of 0.1:0.1:0.5:0.5, so that the amplicons for constructing the DNA sample are obtained. Six different sets of samples to be tested need to correspond to primer combinations containing six different barcode sequence tags.


4. Carry out the PCR program. The PCR instrument is the 2720 Thermal Cycler of Applied Bio-system. The PCR reaction procedure is as follows:

















Temperature
Time
Number of cycles









98° C.
30 s




98° C.
10 s
22 Cycles



60° C.
90 s



72° C.
90 s



72° C.
10 min



 4° C.











5. After the PCR reaction, purification is carried out using “Agencourt AMPure XP Kit” (Cat. No. A63880/A63881/A63882) from Beckman Coulter company. The steps are as follows:


1) take out the Agencourt AMPure XP Kit 30 minutes in advance, rotate the magnetic beads in the Kit thoroughly, and keep the EP tube at room temperature.


2) After the completion of the PCR reaction, the magnetic beads are rotated again sufficiently, and 20 ul of magnetic beads are added to the system, repeatedly blow 5 times or more, or rotate thoroughly, and allow the Kit to be placed at room temperature for 5 minutes.


3) Transfer the EP tube to the magnetic stand and keep for 5 minutes until the solution is clarified. Carefully remove the supernatant with a pipette, taking care not to touch the beads.


4) Add 100 ul of freshly prepared 80% ethanol solution to each tube, and place the EP tube on the magnetic stand and rotate two turns, keep it for 5 minutes, and discard the supernatant.


5) Repeat step 4) once.


6) Open the EP tube and keep it at room temperature to make the liquid volatilize completely. Make sure the surfaces of the magnetic beads are dull, and be careful not to over-dry the magnetic beads.


7) Remove the EP tube from the magnetic stand, add 30 ul of PCR-grade purified water, rotate and mix, and keep it for 10 minutes at room temperature.


8) Place the EP tube on the magnetic stand for 2 minutes or until the solution is clarified. Carefully suck the supernatant from the side away from the magnet with a pipette, taking care not to touch the beads.


At this point, the amplicon library is constructed. FIG. 1 shows the distribution of amplified products detected by Agilent 2200 TapeStation Systems after the completion of the library. The abscissa is the length of the fragment, the ordinate is the signal intensity


(FU), and the lower peak is the 25 bp position marker, the upper peak is a 1500 bp position marker. As shown in FIG. 1, the PCR products obtained by PCR amplification are concentrated in the range of 241-271 bp. FIG. 1 shows that the experimental results are consistent with the experimental design. From FIG. 1, the size of the constructed library and the library concentration can be judged.


6. On-machine sequencing and results analysis


The amplicon library is obtained by the fusion primer one-step method. The amplicon sequencing is performed using the chip 318 of the Ion PGM platform, and the data amount of each library is 50 M bps. The average sequencing depth of each sample is not less than 1600×, and the single amplicon sequencing depth reached 600×. The obtained sequencing results are shown in FIG. 2. From FIG. 2, it is possible to further analyze whether or not each amplicon of the 22 amplicons is amplified and the amplification uniformity of each amplicon.


The results of sequencing are analyzed by data processing and bioinformatics analysis to obtain mutations in the detected genes. The data processing process includes conversion, quality control and sequence alignment of the sequencing data (reference genome is NCBI GRCh37/Hg19), mutation site analysis and other processes, and the mutation information of the detected samples is obtained through data processing analysis.


The actual sample collection is as follows: Among the FFPE samples of 6 subjects, no tumor-related mutations are detected in 2 normal human samples, among the 4 FFPE samples of tumor patients, p.R248W mutation is detected in Samplel, p.T790M mutation is detected in sample2, p.G12A mutation is detected in sample3, and p.E545K mutation is detected in Sample4. This result is consistent with the results of the sanger test. The practical applicability and good specificity of the present invention are fully illustrated.


The foregoing descriptions of specific embodiments of the present invention are presented for purposes of illustration and description. It is not intended to be exhaustive or to limit the invention to the precise form disclosed, and obviously many modifications and variations are possible in light of the above teaching. The exemplary embodiments are chosen and described in order to explain certain principles of the invention and their practical application, to thereby enable others skilled in the art to make and utilize various exemplary embodiments of the present invention, as well as various alternatives and modifications thereof. The invention is to be limited only by the claims provided below and equivalents thereof.

Claims
  • 1. A method for constructing an amplicon library of a DNA sample, comprising the following steps: 1) synthesizing a primer combination which is used for constructing an amplicon library of a DNA sample, wherein the primer combination of the amplicon library used to construct the DNA sample includes:an upstream fusion primer designed according to a target amplicon, the upstream fusion primer comprising a first linker sequence (Bridge sequence) arranged in the order of 5′ to 3′ and a specific upstream primer sequence designed according to the target amplicon;a downstream fusion primer designed according to the target amplicon, the downstream fusion primer comprising a second linker sequence (trP1 sequence) arranged in the order of 5′ to 3′ and a specific downstream primer sequence designed according to the target amplicon;an upstream universal primer comprising a third linker sequence (A sequence), a barcode sequence and a first linker sequence arranged in the order of 5′ to 3′; anda downstream universal primer comprising a universal sequence (Uni sequence) and a second linker sequence arranged in the order of 5′ to 3′;2) constructing a PCR reaction system for the DNA sample, and mixing the upstream fusion primers designed according to the target amplicon, the downstream fusion primers designed according to the target amplicon, the upstream universal primers and the downstream universal primers together, to serve as a primer combination in the PCR reaction system; and3) performing PCR.
  • 2. The method for constructing an amplicon library of a DNA sample according to claim 1, wherein the first linker sequence comprises a sequence of SEQ ID: 1; the second linker sequence comprises a sequence of SEQ ID 2; the third linker sequence comprises a sequence of SEQ ID: 3; the universal sequence comprises a sequence of SEQ ID: 4.
  • 3. The method for constructing an amplicon library of a DNA sample according to claim 1, wherein in a primer combination for constructing a plurality of amplicon libraries of the same DNA sample, the barcode sequences in the upstream universal primers are the same; in a primer combination for constructing amplicon libraries of different DNA samples, the barcode sequences in the upstream universal primers are different.
  • 4. The method for constructing an amplicon library of a DNA sample according to claim 1, wherein when the number of target amplicon in a same PCR reaction is greater than one, the upstream fusion primer designed according to the target amplicon is a combination of upstream fusion primers designed according to each target amplicon, the downstream fusion primer designed according to the target amplicon is a combination of downstream fusion primers designed according to each target amplicon.
  • 5. The method for constructing an amplicon library of a DNA sample according to claim 1, wherein the DNA sample is genomic DNA, and the genomic DNA is extracted from a tissue sample or a formalin-fixed paraffin-embedded sample.
  • 6. The method for constructing an amplicon library of a DNA sample according to claim 1, wherein the target amplicon comprises at least one selected from the group consisting of twenty-two target amplicons: Chr2:29432588-29432707 (Hg19) amplicon of the ALK gene, the sequence of which is shown in SEQ ID:5;Chr2: 29443616-29443730 (Hg19) amplicon of the ALK gene, the sequence of which is shown in SEQ ID:6;Chr7: 140453091-140453197(Hg19) amplicon of the BRAF gene, the sequence of which is shown in SEQ ID:7;Chr7: 55241604-55241726(Hg19) amplicon of the EGFR gene, the sequence of which is shown in SEQ ID:8;Chr7: 55242398-55242513 (Hg19) amplicon of the EGFR gene, the sequence of which is shown in SEQ ID:9;Chr7: 55248970-55249096 (Hg19) amplicon of the EGFR gene, the sequence of which is shown in SEQ ID:10;Chr7: 55259505-55259621 (Hg19) amplicon of the EGFR gene, the sequence of which is shown in SEQ ID:11;Chr17: 37880969-37881082 (Hg19) amplicon of the ERBB2 gene, the sequence of which is shown in SEQ ID:12;Chr12: 25380261-25380363 (Hg19) amplicon of the KRAS gene, the sequence of which is shown in SEQ ID:13;Chr12: 25398183-25398310 (Hg19) amplicon of the KRAS gene, the sequence of which is shown in SEQ ID:14;Chr7: 116340233-116340335 (Hg19) amplicon of the MET gene, the sequence of which is shown in SEQ ID:15;Chr7: 116411880-116412005 (Hg19) amplicon of the MET gene, the sequence of which is shown in SEQ ID:16;Chr7: 116417426-116417546 (Hg19) amplicon of the MET gene, the sequence of which is shown in SEQ ID:17;Chr7: 116423399-116423499 (Hg19) amplicon of the MET gene, the sequence of which is shown in SEQ ID:18;Chr1: 115256507-115256586 (Hg19) amplicon of the NRAS gene, the sequence of which is shown in SEQ ID:19;Chr1: 115258651-115258755 (Hg19) amplicon of the NRAS gene, the sequence of which is shown in SEQ ID:20;Chr3: 178936056-178936179 (Hg19) amplicon of the PIK3CA gene, the sequence of which is shown in SEQ ID:21;Chr3: 178952000-178952092 (Hg19) amplicon of the PIK3CA gene, the sequence of which is shown in SEQ ID:22;Chr17: 7577027-7577154 (Hg19) amplicon of the TP53 gene, the sequence of which is shown in SEQ ID:23;Chr17: 7577507-7577613 (Hg19) amplicon of the TP53 gene, the sequence of which is shown in SEQ ID:24;Chr17: 7578182-7578298 (Hg19) amplicon of the TP53 gene, the sequence of which is shown in SEQ ID:25; andChr17: 7578389-7578537 (Hg19) amplicon of the TP53 gene, the sequence of which is shown in SEQ ID:26.
  • 7. The method for constructing an amplicon library of a DNA sample according to claim 6, wherein, the specific upstream primer sequence designed according to the Chr2:29432588-29432707 (Hg19) amplicon of the ALK gene is shown as SEQ ID: 27; the specific downstream primer sequence designed according to the Chr2:29432588-29432707 (Hg19) amplicon of the ALK gene is shown as SEQ ID: 28;the specific upstream primer sequence designed according to the Chr2: 29443616-29443730 (Hg19) amplicon of ALK gene is shown as SEQ ID: 29; the specific downstream primer sequence designed according to the Chr2: 29443616-29443730 (Hg19) amplicon of the ALK gene is shown as SEQ ID: 30;the specific upstream primer sequence designed according to the Chr7: 140453091-140453197(Hg19) amplicon of BRAF gene is shown as SEQ ID: 31; the specific downstream primer sequence designed according to the Chr7: 140453091-140453197(Hg19) amplicon of the BRAF gene is shown as SEQ ID: 32;the specific upstream primer sequence designed according to the Chr7: 55241604-55241726(Hg19) amplicon of EGFR gene is shown as SEQ ID: 33; the specific downstream primer sequence designed according to the Chr7: 55241604-55241726(Hg19) amplicon of the BRAF gene is shown as SEQ ID: 34;
  • 8. The method for constructing an amplicon library of a DNA sample according to claim 1, wherein the target amplicons are the following twenty-two species: Chr2:29432588-29432707 (Hg19) amplicon of the ALK gene, the sequence of which is shown in SEQ ID:5;Chr2: 29443616-29443730 (Hg19) amplicon of the ALK gene, the sequence of which is shown in SEQ ID:6;Chr7: 140453091-140453197(Hg19) amplicon of the BRAF gene, the sequence of which is shown in SEQ ID:7;Chr7: 55241604-55241726(Hg19) amplicon of the EGFR gene, the sequence of which is shown in SEQ ID:8;Chr7: 55242398-55242513 (Hg19) amplicon of the EGFR gene, the sequence of which is shown in SEQ ID:9;Chr7: 55248970-55249096 (Hg19) amplicon of the EGFR gene, the sequence of which is shown in SEQ ID:10;Chr7: 55259505-55259621 (Hg19) amplicon of the EGFR gene, the sequence of which is shown in SEQ ID:11;Chr17: 37880969-37881082 (Hg19) amplicon of the ERBB2 gene, the sequence of which is shown in SEQ ID:12;Chr12: 25380261-25380363 (Hg19) amplicon of the KRAS gene, the sequence of which is shown in SEQ ID:13;Chr12: 25398183-25398310 (Hg19) amplicon of the KRAS gene, the sequence of which is shown in SEQ ID:14;Chr7: 116340233-116340335 (Hg19) amplicon of the MET gene, the sequence of which is shown in SEQ ID:15;Chr7: 116411880-116412005 (Hg19) amplicon of the MET gene, the sequence of which is shown in SEQ ID:16;Chr7: 116417426-116417546 (Hg19) amplicon of the MET gene, the sequence of which is shown in SEQ ID:17;Chr7: 116423399-116423499 (Hg19) amplicon of the MET gene, the sequence of which is shown in SEQ ID:18;Chr1: 115256507-115256586 (Hg19) amplicon of the NRAS gene, the sequence of which is shown in SEQ ID:19;Chr1: 115258651-115258755 (Hg19) amplicon of the NRAS gene, the sequence of which is shown in SEQ ID:20;Chr3: 178936056-178936179 (Hg19) amplicon of the PIK3CA gene, the sequence of which is shown in SEQ ID:21;Chr3: 178952000-178952092 (Hg19) amplicon of the PIK3CA gene, the sequence of which is shown in SEQ ID:22;Chr17: 7577027-7577154 (Hg19) amplicon of the TP53 gene, the sequence of which is shown in SEQ ID:23;Chr17: 7577507-7577613 (Hg19) amplicon of the TP53 gene, the sequence of which is shown in SEQ ID:24;Chr17: 7578182-7578298 (Hg19) amplicon of the TP53 gene, the sequence of which is shown in SEQ ID:25; andChr17: 7578389-7578537 (Hg19) amplicon of the TP53 gene, the sequence of which is shown in SEQ ID:26.
  • 9. The method for constructing an amplicon library of a DNA sample according to claim 8, wherein the molar ratio of the combination of the upstream fusion primers designed according to the twenty-two target amplicons, the combination of the downstream fusion primers designed according to the above twenty-two target amplicons, the upstream universal primer and the downstream universal primer is: 0.1-0.3:0.1-0.3:0.5-1:0.5-1.
  • 10. The method for constructing an amplicon library of a DNA sample according to claim 9, wherein the PCR reaction system includes the following components: PCR master mix, 10 μl;DNA sample 1-8 μl total 20 ng;Primer combination for constructing an amplicon library of the same DNA sample 2 μl;DNAase-free H2O making up to 20 μl.
  • 11. The method for constructing an amplicon library of a DNA sample according to claim 6, wherein, the specific upstream primer sequence designed according to the Chr7: 55242398-55242513 (Hg19) amplicon of EGFR gene is shown as SEQ ID:35; the specific downstream primer sequence designed according to the Chr7: 55242398-55242513 (Hg19) amplicon of the EGFR gene is shown as SEQ ID: 36;the specific upstream primer sequence designed according to the Chr7: 55248970-55249096 (Hg19) amplicon of EGFR gene is shown as SEQ ID: 37; the specific downstream primer sequence designed according to the Chr7: 55248970-55249096 (Hg19) amplicon of the EGFR gene is shown as SEQ ID: 38;the specific upstream primer sequence designed according to the Chr7: 55259505-55259621 (Hg19) amplicon of EGFR gene is shown as SEQ ID: 39; the specific downstream primer sequence designed according to the Chr7: 55259505-55259621 (Hg19) amplicon of the EGFR gene is shown as SEQ ID: 40.
  • 12. The method for constructing an amplicon library of a DNA sample according to claim 6, wherein, the specific upstream primer sequence designed according to the Chr17: 37880969-37881082 (Hg19) amplicon of ERBB2 gene is shown as SEQ ID: 41; the specific downstream primer sequence designed according to the Chr17: 37880969-37881082 (Hg19) amplicon of the ERBB2 gene is shown as SEQ ID: 42;the specific upstream primer sequence designed according to the Chr12: 25380261-25380363 (Hg19) amplicon of KRAS gene is shown as SEQ ID: 43; the specific downstream primer sequence designed according to the Chr12: 25380261-25380363 (Hg19) amplicon of the KRAS gene is shown as SEQ ID: 44;the specific upstream primer sequence designed according to the Chr12: 25398183-25398310 (Hg19) amplicon of KRAS gene is shown as SEQ ID: 45; the specific downstream primer sequence designed according to the Chr12: 25398183-25398310 (Hg19) amplicon of the KRAS gene is shown as SEQ ID: 46.
  • 13. The method for constructing an amplicon library of a DNA sample according to claim 6, wherein, the specific upstream primer sequence designed according to the Chr7: 116340233-116340335 (Hg19) amplicon of MET gene is shown as SEQ ID: 47; the specific downstream primer sequence designed according to the Chr7: 116340233-116340335 (Hg19) amplicon of the MET gene is shown as SEQ ID: 48;the specific upstream primer sequence designed according to the Chr7: 116411880-116412005 (Hg19) amplicon of MET gene is shown as SEQ ID: 49; the specific downstream primer sequence designed according to the Chr7: 116411880-116412005 (Hg19) amplicon of the MET gene is shown as SEQ ID: 50;the specific upstream primer sequence designed according to the Chr7: 116417426-116417546 (Hg19) amplicon of MET gene is shown as SEQ ID: 51; the specific downstream primer sequence designed according to the Chr7: 116417426-116417546 (Hg19) amplicon of the MET gene is shown as SEQ ID: 52.
  • 14. The method for constructing an amplicon library of a DNA sample according to claim 6, wherein, the specific upstream primer sequence designed according to the Chr7: 116423399-116423499 (Hg19) amplicon of MET gene is shown as SEQ ID: 53; the specific downstream primer sequence designed according to the Chr7: 116423399-116423499 (Hg19) amplicon of the MET gene is shown as SEQ ID: 54;the specific upstream primer sequence designed according to the Chr1: 115256507-115256586 (Hg19) amplicon of NRAS gene is shown as SEQ ID: 55; the specific downstream primer sequence designed according to the Chr1: 115256507-115256586 (Hg19) amplicon of the NRAS gene is shown as SEQ ID: 56;the specific upstream primer sequence designed according to the Chr1: 115258651-115258755 (Hg19) amplicon of NRAS gene is shown as SEQ ID: 57; the specific downstream primer sequence designed according to the Chr1: 115258651-115258755 (Hg19) amplicon of the NRAS gene is shown as SEQ ID: 58.
  • 15. The method for constructing an amplicon library of a DNA sample according to claim 6, wherein, the specific upstream primer sequence designed according to the Chr3: 178936056-178936179 (Hg19) amplicon of PIK3CA gene is shown as SEQ ID: 59; the specific downstream primer sequence designed according to the Chr3: 178936056-178936179 (Hg19) amplicon of the PIK3CA gene is shown as SEQ ID: 60;the specific upstream primer sequence designed according to the Chr3: 178952000-178952092 (Hg19) amplicon of PIK3CA gene is shown as SEQ ID: 61; the specific downstream primer sequence designed according to the Chr3: 178952000-178952092 (Hg19) amplicon of the PIK3CA gene is shown as SEQ ID: 62;the specific upstream primer sequence designed according to the Chr17: 7577027-7577154 (Hg19) amplicon of TP53 gene is shown as SEQ ID: 63; the specific downstream primer sequence designed according to the Chr17: 7577027-7577154 (Hg19) amplicon of the TP53 gene is shown as SEQ ID: 64.
  • 16. The method for constructing an amplicon library of a DNA sample according to claim 6, wherein, the specific upstream primer sequence designed according to the Chr17: 7577507-7577613 (Hg19) amplicon of TP53 gene is shown as SEQ ID: 65; the specific downstream primer sequence designed according to the Chr17: 7577507-7577613 (Hg19) amplicon of the TP53 gene is shown as SEQ ID: 66;the specific upstream primer sequence designed according to the Chr17: 7578182-7578298 (Hg19) amplicon of TP53 gene is shown as SEQ ID: 67; the specific downstream primer sequence designed according to the Chr17: 7578182-7578298 (Hg19) amplicon of the TP53 gene is shown as SEQ ID: 68;the specific upstream primer sequence designed according to the Chr17: 7578389-7578537 (Hg19) amplicon of TP53 gene is shown as SEQ ID: 69; the specific downstream primer sequence designed according to the Chr17: 7578389-7578537 (Hg19) amplicon of the TP53 gene is shown as SEQ ID: 70.
Priority Claims (1)
Number Date Country Kind
201710218529.4 Apr 2017 CN national
PCT Information
Filing Document Filing Date Country Kind
PCT/CN2018/080864 3/28/2018 WO 00