The instant application contains a Sequence Listing that has been submitted in ASCII format via EFS-Web and is hereby incorporated by reference in its entirety. Said ASCII copy, created on Apr. 2, 2015, is named LNK—164 US_SEQID_ST25.txt and is 14,856 bytes in size.
The present invention relates to a method for the simultaneous amplification of a plurality of different nucleic acid target sequences, to a kit for carrying out the method and to a library of nucleic acid polymers, in particular a DNA or a RNA library. The invention further relates to the use of the method for a gene probe assay as well as in molecular cloning.
The detection of specific nucleic acid polymers is an important tool for diagnostic medicine and molecular biology research. Gene probe assays currently play a role e.g. in identifying infectious organisms such as bacteria and viruses, in probing the expression of normal genes and in identifying mutant genes such as oncogenes, in tissue typing for compatibility preceding tissue transplantation, in matching tissue or blood samples for forensic medicine, and for exploring homology among genes from different species.
Ideally, a gene probe assay should be sensitive, specific and easily automatable. The requirement for sensitivity (i.e. low detection limits) has been greatly improved by the development of the polymerase chain reaction (PCR) and other amplification technologies which allow researchers to amplify exponentially a specific target sequence before analysis. The PCR technology is for example described in U.S. Pat. No. 4,683,202.
In the last years progress has been made by the development of new technologies which are promising in reducing costs and accelerating the development of new molecular diagnostics. DNA analysis instruments are becoming increasingly more powerful in the capacity of sequence analysis. DNA resequencing microarrays (Chee et al., 1996, Patil et al., 2001) and high throughput parallel sequencing instruments (Margulies et al., 2005, Shendure et al., 2005) are currently used for whole genome analyses of low complexity genomes down to single nucleotide resolution. However, the human genome remains too large to access without complexity reduction by directed amplification of specific sequences. To match the throughput of these instruments, the amplification bottleneck needs to be addressed with more efficient technologies. Enrichment of target sequences becomes therefore key for comprehensive resequencing of human exons at a fraction of the cost of whole-genome sequencing. Recently several sequence capture methods have been developed like molecular inversion probe technology (Dahl et al., 2007, Dahl et al., 2005, Porreca et al., 2007), approaches using microarray technologies (Okou et al., 2007, Hodges et al., 2007, Albert et al., 2007), hybridization in solution technologies using RNA oligo capture probes (Gnirke et al., 2009), or microfluidic technology using emulsion PCR in small droplets (Tewhey et al., 2009).
In theory, the currently most powerful and fastest amplification technology is PCR and is widely used in molecular diagnostics. To increase assay throughput and allow for more efficient use of precious DNA samples, simultaneous amplification of several targets can be carried out by combining many specific primer pairs in individual PCRs (Chamberlain et al., 1988, Shigemori et al., 2005). However, it is one of the crucial problems with PCR that when large numbers of specific primer pairs are added to the same reaction, both correct and incorrect amplicons are generated. In addition, even when primer dimers can be avoided and specific amplification is achieved the targets have different PCR efficiencies due to amplicon length and sequence properties (GC content). At a later stage, this skews the uniformity of the products to the point where many amplicons drop out in favor of highly efficient amplified amplicons and artifacts. In order to optimize multiplex PCR, the concentrations of primer, buffer dNTPs, enzymes, and MgCl2 need to be determined empirically for each set of primer combinations. This is a time-consuming process which needs to be conducted for each lot of the produced assay. A successful multiplex PCR is not guaranteed even after exhaustive optimization experiments.
Even with careful attention paid to the design of primers in case of multiplexing, PCR is usually limited to 10-20 simultaneous reactions before yield and evenness is compromised by the accumulation of irrelevant amplification products (Syvänen, 2005, Broude et al., 2001). Therefore, large numbers of separate PCRs are typically performed whenever many genomic sequences need to be analyzed. Thus the major challenge in multiplexing PCR is to overcome two major problems: the incompatibility of primers leading to unspecific amplifications (like primer dimers) and the differences in amplification efficiencies of different targets.
Considering the drawbacks of the methods according to the state of the art, the problem to be solved by the present invention is thus to provide a simple, rapid and inexpensive method for simultaneously amplifying a plurality of different nucleic acid target sequences, in particular DNA and/or RNA target sequences. In this regard, the method shall allow amplification of practically all target sequences at a more uniform abundance than with conventional methods, and in particular with standard PCR. This invention provides a novel multiplex technology solving both fundamental problems thereby allowing uniform amplification of multiple targets in one single reaction.
The principle of this method is based on the fact that a single mismatch at the 3 prime end of the primer/template hybrid strongly inhibits PCR amplification. Although a single 3 prime mismatch may allow primer annealing, the extension step performed by the polymerase is inhibited (
Sequence Capture: Preparation of Efficiency Tagged Template Library from Genomic DNA (Preferred Embodiment)
To select the regions of interest (i.e. certain exons of a gene) from a DNA source like genomic DNA or cDNA, we developed a novel sequence capture approach. The method results in single stranded copy of the regions of interest flanked by efficiency tags and the common priming sequence for further uniform amplification with etPCR as described above. Targeting of specific sequences is achieved through a hybridization step of oligonucleotides flanking the region of interest. The oligonucleotides consist of four different parts: a target specific sequence, an efficiency tag, the common priming sequence and a “exonuclease block” consisting of phosphothioates at their outer end (
The present invention relates to the following embodiments (1) to (15).
(1) A method for the simultaneous amplification of a plurality of different nucleic acid target sequences comprising the steps of
providing a set of forward primer oligonucleotides capable of annealing to the same nucleotide sequence, said set comprising a first forward primer oligonucleotide having the structure
5′-X—N1-3′,
and a second forward primer oligonucleotide having the structure
5′-X—N1—N2-3′,
wherein X is a nucleotide sequence which is capable of annealing to a first primer annealing sequence X′, N1 is nothing or consists of one or more nucleotides, and N2 consists of one or more nucleotides;
providing a plurality of different nucleic acid polymers as templates, each template comprising (i) a forward primer annealing sequence X′ which is complementary to the nucleotide sequence X, and (ii) a specific target sequence; and
amplifying the templates by a polymerase dependent amplification reaction using said set of forward primer oligonucleotides and one or more reverse primer oligonucleotide(s), characterized in that the 3′-terminal nucleotide of the first forward primer oligonucleotide, when annealed to the templates, has a perfect match with at least two different template sequences, and the 3′-terminal nucleotide of the second forward primer oligonucleotide, when annealed to the templates, has a mismatch with at least one of said at least two different template sequences and a perfect match with at least one of said at least two different template sequences.
(2) The method of (1), further comprising the steps of providing a set of reverse primer oligonucleotides capable of annealing to the same nucleotide sequence, comprising a first reverse primer oligonucleotide having the structure
5′-Y-M1-3′,
and a second reverse primer oligonucleotide having the structure
5′-Y-M1-M2-3′
wherein Y is a nucleotide sequence which is capable of annealing to a reverse primer annealing sequence, M1 is nothing or consists of one or more nucleotides, and M2 consists of one or more nucleotides;
wherein each template further comprises a reverse primer annealing sequence which is complementary to the nucleotide sequence Y, the target sequence is located between the forward primer annealing sequence and the reverse primer annealing sequence, and the polymerase dependent amplification reaction is carried out using said set of forward primer oligonucleotides and said set of reverse primer oligonucleotides,
characterized in that the 3′-terminal nucleotide of the first reverse primer oligonucleotide, when annealed to the templates, has a perfect match with at least two different template sequences, and the 3′-terminal nucleotide of the second reverse primer oligonucleotide, when annealed to the templates, has a mismatch with at least one of said at least two different template sequences and a perfect match with at least one of said at least two different template sequences
(3) The method of (2), wherein the number of templates is v, and each template comprises the structure
5′-X-etXw-Tw-etY′w-Y′-3′
wherein
v is an integer greater than 1,
w is an integer running from 1 to v, each specific template being assigned an individual value w,
X is as defined in claim 1,
etXw is a first efficiency tag sequence,
Tw is the target sequence or complement thereof,
etY′w is the complementary sequence of a second efficiency tag sequence,
Y′ is the reverse primer annealing sequence.
(4) The method of (3), characterized in that each efficiency tag sequence comprises from 1 to 10 nucleotides, preferably from 2 to 7 nucleotides, most preferably from 3 to 5 nucleotides.
(5) The method of (3) or (4), characterized in that each of the templates is provided by the subsequent steps of:
providing a single stranded primal nucleic acid polymer comprising a primal target sequence to be amplified;
hybridizing to the 5′-end of the primal target sequence an oligonucleotide probe, the sequence of the oligonucleotide probe comprising a portion of the target sequence complementary to the 5′-end of the primal target sequence, the primer annealing sequence and the efficiency tag sequence, and to the 3′-end of the primal target sequence a further oligonucleotide probe, the sequence of the further oligonucleotide probe comprising a portion of the target sequence complementary to the 3′-end of the primal target sequence, the primer annealing complementary sequence and the efficiency tag complementary sequence;
synthesizing a strand complementary to the primal target sequence by means of a polymerase and a ligase to produce the template; and
isolating the templates produced.
(6) The method of (5), characterized in that the ends of the template produced are protected against exonucleases.
(7) The method of (5) or (6), characterized in that the template produced comprises free ends, one or more nucleotides in the region of both ends being modified to form an exonuclease protection.
(8) The method of (7), characterized in that the one or more modified nucleotides are phosphorothioated.
(9) The method of any of (5) to (8), characterized in that the step of isolating the templates produced is performed by digesting the remaining nucleic acid components with an exonuclease.
(10) A library of nucleic acid polymers, in particular a DNA or a RNA library, comprising a plurality of templates as defined in any one of (2) to (9).
(11) A kit for carrying out the method according to any of (1) to (9) comprising a first set of oligonucleotide probes, the sequence of each oligonucleotide probe of the first set comprising
The method of the invention comprises the step of providing a set of forward primer oligonucleotides. The set comprises r different forward primer oligonucleotides, wherein r is an integer greater than 1. That is, r is at least 2, preferably at least 3, more preferably at least 4, most preferably at least 5. Typically, r ranges from 2 to 20, preferably from 2 to 10, more preferably from 3 to 7, most preferably r is 4 or 5.
A first forward primer oligonucleotide has the structure
5′-X—N1-3′,
and a second forward primer oligonucleotide having the structure
5′-X—N1—N2-3′,
wherein X is a nucleotide sequence which is capable of annealing to a first primer annealing sequence X′, N1 is nothing or consists of one or more nucleotides, and N2 consists of one or more nucleotides. Each forward primer within the set of forward primer oligonucleotides is capable of annealing to the same nucleotide sequence via its portion X. The sequence N1 may be nothing or consist of one or more nucleotides, e.g. of 1 to 20 nucleotides. The sequence N2 may independently consist of 1 to 20 nucleotides. Preferably, N2 consists of 1 to 10, more preferably of 1 to 5, most preferably of 1 to 3 nucleotides, e.g. of 1, 2 or 3 nucleotides. Preferably, N2 consists of one nucleotide.
The different forward primer oligonucleotides typically differ only in their 3′ ends, i.e. in the sequence which is located 3′ to the sequence X.
In one aspect of the invention, the set of forward primers comprises r different forward primer oligonucleotides, and the structure of primer No. q is 5′-X-(n)(q-1)-3′, wherein q ranges from 1 to r, r is as defined above, and each n independently is any nucleotide. In other words, the first forward primer oligonucleotide (i.e. q=1) has the structure: 5′-X-3′; the second forward primer oligonucleotide (i.e. q=2) has the structure: 5′-X-n-3′; the third forward primer oligonucleotide (i.e. q=3) has the structure: 5′-X-nn-3′; the fourth forward primer oligonucleotide (i.e. q=4) has the structure: 5′-X-nnn-3′; and the fifth forward primer oligonucleotide (i.e. q=5) has the structure: 5′-X-nnnn-3′. This list can be extended. Preferably, each n is independently selected from the group consisting of the nucleotides a, c, g and t.
According to a preferred embodiment of this aspect, the set of forward primer oligonucleotides comprises 4 different forward primer oligonucleotides (r=4), the first forward primer oligonucleotide has the structure: 5′-X-3′; the second forward primer oligonucleotide has the structure: 5′-X-n-3′; the third forward primer oligonucleotide has the structure: 5′-X-nn-3′; and the fourth forward primer oligonucleotide has the structure: 5′-X-nnn-3′.
According to a another preferred embodiment of this aspect, the set of forward primer oligonucleotides comprises 5 different forward primer oligonucleotides (r=5), the first forward primer oligonucleotide has the structure: 5′-X-3′; the second forward primer oligonucleotide has the structure: 5′-X-n-3′; the third forward primer oligonucleotide has the structure: 5′-X-nn-3′; the fourth forward primer oligonucleotide has the structure: 5′-X-nnn-3′; and the fifth forward primer oligonucleotide has the structure: 5′-X-nnnn-3′.
According to a yet another preferred embodiment of this aspect, the set of forward primer oligonucleotides comprises 6 different forward primer oligonucleotides (r=6), the first forward primer oligonucleotide has the structure: 5′-X-3′; the second forward primer oligonucleotide has the structure: 5′-X-n-3′; the third forward primer oligonucleotide has the structure: 5′-X-nn-3′; the fourth forward primer oligonucleotide has the structure: 5′-X-nnn-3′; the fifth forward primer oligonucleotide has the structure: 5′-X-nnnn-3′; and. the sixth forward primer oligonucleotide has the structure: 5′-X-nnnnn-3′.
X is a nucleotide sequence which is capable of annealing to a first primer annealing sequence. X has a length of at least 6 nucleotides, preferably of at least 8, more preferably of at least 10, most preferably of at least 12 nucleotides. Typically, the length of X ranges from 6 to 100, preferably from 8 to 75, more preferably from 10 to 50, more preferably from 12 to 30, most preferably from 15 to 25 nucleotides.
In order to prevent digestion of the primers of structure 5′-X-(n)(q-1)-3′ by polymerase exonuclease activity, its ends are preferably protected against exonucleases. Particularly, to prevent nucleotide removal by the exonuclease activity of certain polymerases, one or more nucleotides at the 3′ end being modified to form an exonuclease protection. More particularly, the one or more modified nucleotides are phosphorothioated.
The method of the invention further comprises providing a plurality of different nucleic acid polymers as templates, each template comprising a specific target sequence and a forward primer annealing sequence which is complementary to the nucleotide sequence X. The number of different nucleic acid templates is at least 2, preferably at least 3, more preferably at least 5. Typically, the number of different templates provided ranges from 2 to 100,000, preferably from 3 to 1000, more preferably from 4 to 500, more preferably from 5 to 200, most preferably from 10 to 50. Preferably, the forward primer annealing sequence is located upstream to the specific target sequence, i.e. 5′ to the target sequence. It is preferred that the forward primer annealing sequence and the target sequence are separated by a so-called ‘efficiency tag sequence’ as explained further below.
The length of the target sequence may range from about 10 to about 50,000 nucleotides; preferably it ranges from about 50 to about 10,000 nucleotides, more preferably from about 75 to about 5,000 nucleotides, most preferably from about 100 to about 1,500 nucleotides. The templates usually have identical primer annealing sequences and differ in their target sequences.
The method of the invention further comprises amplifying the templates by a polymerase dependent amplification reaction using said set of forward primer oligonucleotides and one or more reverse primer oligonucleotide(s). In one embodiment, the reverse primer is a single oligonucleotide capable of annealing to substantially all template molecules, preferably at a location downstream to the target sequence. In another embodiment, a set of reverse primer oligonucleotides is used. This latter embodiment will be explained further below.
According to this invention the 3′-terminal nucleotide of the first forward primer oligonucleotide has a perfect match with at least two different template sequences, whereas the second forward primer oligonucleotide has a mismatch with at least one of said at least two different template sequences. That is, in its simplest variant, the first forward primer oligonucleotide will amplify two different templates, and the second forward primer oligonucleotide will amplify only one of these two different templates.
The method of the invention may further comprise the steps of providing a set of reverse primer oligonucleotides capable of annealing to the same nucleotide sequence within the template sequence. The set comprises p different reverse primer oligonucleotides, wherein p is an integer greater than 1. That is, p is at least 2, preferably at least 3, more preferably at least 4, most preferably at least 5. Typically, p ranges from 2 to 20, preferably from 2 to 10, more preferably from 3 to 7, most preferably p is 4 or 5. Each reverse primer within the set of reverse primer oligonucleotides is capable of annealing to the same nucleotide sequence via its portion Y. The first reverse primer oligonucleotide has the structure
5′-Y-M1-3′,
and a second reverse primer oligonucleotide having the structure
5′-Y-M1-M2-3′,
wherein Y is a nucleotide sequence which is capable of annealing to a reverse primer annealing sequence, M1 is nothing or consists of one or more nucleotides, and M2 consists of one or more nucleotides. The sequence M1 may be nothing or consist of one or more nucleotides, e.g of 1 to 20 nucleotides. The sequence M2 may independently consist of 1 to 20 nucleotides. Preferably, M2 consists of 1 to 10, more preferably of 1 to 5, most preferably of 1 to 3 nucleotides, e.g. of 1, 2 or 3 nucleotides. Preferably, M2 consists of one nucleotide.
The different reverse primer oligonucleotides typically differ only in their 3′ ends, i.e. in the sequence which is located 3′ to the sequence Y.
In one aspect of the invention, the set of reverse primers comprises p different reverse primer oligonucleotides, and the structure of reverse primer No. s is 5′-Y-(n)(s-1)-3′, wherein s ranges from 1 to p, p is as defined above, and each n independently is any nucleotide. In other words, the first reverse primer oligonucleotide (i.e. s=1) has the structure: 5′-Y-3′; the second reverse primer oligonucleotide (i.e. s=2) has the structure: 5′-Y-n-3′; the third reverse primer oligonucleotide (i.e. s=3) has the structure: 5′-Y-nn-3′; the fourth reverse primer oligonucleotide (i.e. s=4) has the structure: 5′-Y-nnn-3′; and the fifth reverse primer oligonucleotide (i.e. s=5) has the structure: 5′-Y-nnnn-3′. This list can be extended. Preferably, each n is independently selected from the group consisting of the nucleotides a, c, g and t.
According to a preferred embodiment of this aspect, the set of reverse primer oligonucleotides comprises 4 different reverse primer oligonucleotides (p=4), the first reverse primer oligonucleotide has the structure: 5′-Y-3′; the second reverse primer oligonucleotide has the structure: 5′-Y-n-3′; the third reverse primer oligonucleotide has the structure: 5′-Y-nn-3′; and the fourth reverse primer oligonucleotide has the structure: 5′-Y-nnn-3′.
According to a another preferred embodiment of this aspect, the set of reverse primer oligonucleotides comprises 5 different reverse primer oligonucleotides (p=5), the first reverse primer oligonucleotide has the structure: 5′-Y-3′; the second reverse primer oligonucleotide has the structure: 5′-Y-n-3′; the third reverse primer oligonucleotide has the structure: 5′-Y-nn-3′; the fourth reverse primer oligonucleotide has the structure: 5′-Y-nnn-3′; and the fifth reverse primer oligonucleotide has the structure: 5′-Y-nnnn-3′.
According to a yet another preferred embodiment of this aspect, the set of reverse primer oligonucleotides comprises 6 different reverse primer oligonucleotides (p=6), the first reverse primer oligonucleotide has the structure: 5′-Y-3′; the second reverse primer oligonucleotide has the structure: 5′-Y-n-3′; the third reverse primer oligonucleotide has the structure: 5′-Y-nn-3′; the fourth reverse primer oligonucleotide has the structure: 5′-Y-nnn-3′; the fifth reverse primer oligonucleotide has the structure: 5′-Y-nnnn-3′; and the sixth reverse primer oligonucleotide has the structure: 5′-Y-nnnnn-3′.
Y is a nucleotide sequence which is capable of annealing to a reverse primer annealing sequence. Y has a length of at least 6 nucleotides, preferably of at least 8, more preferably of at least 10, most preferably of at least 12 nucleotides. Typically, the length of Y ranges from 6 to 100, preferably from 8 to 75, more preferably from 10 to 50, more preferably from 12 to 30, most preferably from 15 to 25 nucleotides.
According to this invention the 3′-terminal nucleotide of the first reverse primer oligonucleotide has a perfect match with at least two different template sequences, whereas the second reverse primer oligonucleotide has a mismatch with at least one of said at least two different template sequences. That is, in its simplest variant, the first reverse primer oligonucleotide will amplify two different templates, and the second reverse primer oligonucleotide will amplify only one of these two different templates.
In one embodiment, each template comprises a reverse primer annealing sequence which is complementary to the nucleotide sequence Y, the target sequence is located between the forward primer annealing sequence and the reverse primer annealing sequence, and the polymerase dependent amplification reaction is carried out using said set of forward primer oligonucleotides and said set of reverse primer oligonucleotides.
In one aspect of this invention, the number of templates is v, and each template comprises the structure
5′-X-etxw-Tw-etrw-Y′-3′
wherein
v is an integer greater than 1,
w is an integer running from 1 to v, each specific template being assigned an individual value w,
X is as defined in claim 1,
etXw is a first efficiency tag sequence,
Tw is the target sequence or complement thereof,
etY′w is the complementary sequence of a second efficiency tag sequence,
Y′ is the reverse primer annealing sequence.
As will be shown in detail below, the present invention allows a “graded” amplification reaction to be performed in the sense that the amplification efficiency can be adapted specifically for each target sequence. In particular in multiplex PCR, the amplification efficiency of the different targets can thus be levelled leading to a more or less uniform number of replicates for each target.
According to a very straightforward and thus particularly preferred embodiment, each template comprises between the primer annealing sequence and the target sequence a specific efficiency tag sequence (ETS). Depending on their specific ETS, the templates can thus be divided into different template groups, whereby the number of primer oligonucleotides having an extension fully matching the ETS or fully matching a portion of the ETS, is different from template group to template group. The ETS thus permits on the one hand a selective polymerase-mediated extension for primer oligonucleotides having an extension fully matching the ETS or fully matching a portion of the ETS. On the other hand, only inefficient polymerase mediated extension will occur for primer oligonucleotides having an extension that does not match or only partly matches the ETS or a portion thereof.
By appropriately attributing the different ETS to the different targets, a less efficient amplification can be achieved for high abundance amplicons.
More specifically, in a “graded” amplification reaction the lowest grade of efficiency is achieved for an ETS which shows no complementarity with any of the extensions of the primer oligonucleotides, since for this, the only primer oligonucleotide of the set that can be extended by polymerase is the one having no extension at all. A higher grade is achieved for an ETS showing complementarity with the first nucleotide of the extension of the oligonucleotides, since for this, the primer oligonucleotide having no extension at all and the primer oligonucleotide that is extended by one single nucleotide will anneal and can be extended by polymerase. An even higher efficiency is achieved for an ETS showing complementarity with the first two nucleotides of the extension and so on.
The ETS preferably comprises from 1 to 10 nucleotides, more preferably from 2 to 7 nucleotides, most preferably from 3 to 5 nucleotides. If, for example, an ETS having 4 nucleotides is used, five different efficiency grades can be established, one for an ETS fully corresponding to all nucleotides of the extension of the primer oligonucleotide, one for an ETS complementary only to the first three nucleotides of the extension, one for an ETS complementary only to the first two nucleotides of the extension, one for an ETS complementary only to the first nucleotide of the extension and one for an ETS which shows no complementarity with the extension at all.
According to a particularly preferred embodiment, the primer annealing sequence is identical for all templates. Thus, primer oligonucleotides comprising a universal primer sequence can be used in this embodiment, allowing both amplification of targets and their subsequent sequencing.
As for the set of primer oligonucleotides described above, the further primer annealing sequence is preferably identical for all templates. Thus, primer oligonucleotides comprising a universal primer sequence can also be used for the further set used in this embodiment.
It is in this regard also preferred that the only difference between the primer oligonucleotides of the further set is in the length of their extension, as it is the case for the set of primer oligonucleotides described above. This allows the template complementary strands to be divided into different “complementary strand groups”, whereby the number of primer oligonucleotides of the further set having an extension fully matching the further ETS or a portion thereof, is different from “complementary strand group” to “complementary strand group”. The ETS thus permits a selective polymerase mediated extension for extended primer oligonucleotides having an extension fully matching the further ETS or fully matching a portion of the ETS, which on the one hand allows for selective and thus highly efficient amplification of low efficient amplifiable targets, and an insufficient annealing of all other primer oligonucleotides, which on the other hand allows for less efficient amplification of high abundance targets, as mentioned above in connection with the ETS of the templates.
According to a further preferred embodiment, one or more regions of at least a portion of the templates and/or of the primer oligonucleotides encode a bar code, thus allowing attributing the replicated templates to their origins in an easy manner. In particular when DNA from different patients is assayed in parallel, such as in multiplex PCR, the bar code allows attributing the replicated DNA sequences to each individual patient (Binladen et al. (2007) PLoS ONE 2(2):e197, incorporated herein by reference).
For providing the templates comprising—in addition to the specific target sequence—also an ETS and a primer annealing sequence, a method is preferably used comprising the subsequent steps of:
providing a single stranded primal nucleic acid polymer comprising a primal target sequence to be amplified;
hybridizing to the 5′-end of the primal target sequence an oligonucleotide probe, the sequence of the oligonucleotide probe comprising a portion of the target sequence complementary to the 5′-end of the primal target sequence, the ETS and the primer annealing sequence, and to the 3′-end of the primal target sequence a further oligonucleotide probe, the sequence of the further oligonucleotide probe comprising a portion of the target sequence complementary to the 3′-end of the primal target sequence, the efficiency tag complementary sequence and the primer annealing complementary sequence;
synthesizing a strand complementary to the primal target sequence by means of a polymerase and a ligase to produce the template; and
isolating the templates produced.
In order to allow the ligase to close the nick between the strand produced and the oligonucleotide probe at the 3′-end (comprising the portion of the target complementary to the 5′-end of the primal target sequence), said oligonucleotide probe is generally 5′-end phosphorylated.
Thus, a newly synthesized single nucleic acid strand comprising the target sequence and the primer annealing sequences can be obtained which can then be used for amplification in a universal PCR.
Also, a “tailor-made” ETS can be introduced for each template by this method, ultimately allowing the amplification efficiency of each template to be modulated, as described in detail above.
Since the present invention is preferably used for gene probe assays, in particular for identifying infectious organisms or mutant genes, or for molecular cloning, the primal nucleic acid polymer is at least one selected from the group consisting of genomic DNA, mitochondrial DNA, mRNA, viral DNA, bacterial DNA, viral RNA and cDNA.
In order to allow easy isolation of the template produced, its ends are preferably protected against exonucleases. Particularly, the template produced comprises free ends, one or more nucleotides in the region of both ends being modified to form an exonuclease protection. More particularly, the one or more modified nucleotides are phosphorothioated.
Thus, the step of isolating the templates produced can be easily performed by digesting the remaining nucleic acid components using an exonuclease, leaving only the protected templates intact. The method for providing the templates is in the context of the present invention also referred to as “enrichment step”.
According to a further preferred embodiment, the oligonucleotide probe is synthesized on a microchip.
Alternatively to the method using an ETS, it is also thinkable that a set of primer oligonucleotides is used at least some of which are blocked and thus not able to be extended by polymerases. Depending on the desired grade of efficiency for each template to be amplified, the ratio of blocked species to unblocked species can be adapted for each primer oligonucleotide. In view of achieving a more uniform amplification, the ratio of blocked primer oligonucleotides is higher for more abundant target sequences, and lower for less abundant target sequences.
According to a further aspect, the present invention further relates to a DNA or a RNA library comprising templates as described above. For example a plurality of multiple DNA probes, which are used for hybridization procedures, can be synthesized on a microchip and released as DNA probe pool in solution. Such a DNA probe pool can be amplified using PCR. Using universal primer annealing sequences on the synthesized DNA probes, the DNA probe pool can be amplified using one primer pair. The efficiency and the final amount of the single DNA probes mainly depend on the target sequence like length and sequence composition. The “graded” PCR can be applied to obtain a more uniform amplification and therefore nearly equal amounts of each DNA probe. In some instances it is desired to produce higher amount of certain DNA probes and/or lower amount of certain DNA probes. Using graded PCR the amplification efficiency of each DNA probe can be adjusted according the desired final probe amount.
According to a still further aspect, the present invention further relates to a kit for carrying out the method described above.
Said kit comprises
a first set of oligonucleotide probes, the sequence of each oligonucleotide probe of the first set comprising
Preferably, the kit further comprises a polymerase and a ligase. As mentioned above, each oligonucleotide probe of the first set is typically 5′-end phosphorylated in order to allow the ligase to close the nick between said oligonucleotide probe and the strand produced. Further the first one to six nucleotides at the 3′-end of the first set of oligonucleotide probes are modified to be resistant against exonuclease cleavage. The last one to six nucleotides at the 5′-end of the second set of oligonucleotide probes are modified to be resistant against exonuclease cleavage. More particularly, the one or more modified nucleotides are phosphorothioated
Since the present invention is particularly suitable for gene probe assays, in particular for identifying infectious organisms or mutant genes, the present invention further relates to the use of the method described above for this purpose.
Alternatively, the present invention also relates to the use of the described method in molecular cloning.
The method present invention is illustrated further by way of the attached Figures.
In the embodiment shown in
The oligonucleotide probes (OP) 4 can be divided into two parts:
Each of the oligonucleotides of the first part 4a, 4b comprises a portion 3a, 3b, respectively, of a target sequence, complementary to the 5′-end of one of the primal target sequences 2a, 2b, respectively, a primer annealing sequence 6 and an ETS 8 located between the portion of a target sequence and the primer annealing sequence.
Each of the oligonucleotide probes of the second part 4a′, 4b′ comprises a portion 3a′, 3b′, respectively, of a target sequence complementary to the 3′-end of one of the primal primal target sequences 2a, 2b, respectively, a primer annealing complementary sequence 6′ and an efficiency tag complementary sequence 8′ located between the portion of the target sequence and the primer annealing complementary sequence.
Further, the oligonucleotide probes of the first part comprise an exonuclease-block 12 at their 3′-end, whereas the oligonucleotide probes of the second part comprise an exonuclease-block 12′ at their 5′-end. The exonuclease-block can be achieved in numerous ways. According to a preferred embodiment, phosphorothioated, nuclease resistant nucleotides are added to both ends of the flanked target sequence.
Then, the oligonucleotide probes 4a, 4b of the first part are hybridized with the 5′-end of the respective primal target sequence 2a, 2b and the oligonucleotide probes 4a′, 4b′ of the second part are hybridized with the 3′-end of the respective primal target sequence 2a, 2b. Hybridisation comprises both denaturation of the genomic DNA, typically carried out at 95° C. for 10 minutes, and annealing of the oligonucleotide probes, typically at about 60° C. for 14 hours.
After hybridization, the gap between the flanking oligonucleotide probes is filled by synthesizing the strand complementary to the target sequence by means of a polymerase 14, which fills the gap by adding nucleotides. By means of a ligase 16, the nick between the strand produced and the probe at the 3′-end is ultimately closed, as shown in
By the synthesizing steps, templates 18a, 18b are achieved which comprise at their 3′-end a primer annealing sequence 6 followed by an ETS 8 and at their 5′-end a primer annealing complementary sequence 6′ followed in direction to the 3′-end by an efficiency tag complementary sequence 8′. The target sequence 20a, 20b complementary to the primal target sequence 2a, 2b, respectively, is arranged between the ETS 8 and the efficiency tag complementary sequence 8′. Both the 3′- and the 5′-end of the template are protected by an exonuclease block 12, 12′, respectively.
In a further step, an exonuclease or a mixture of multiple exonucleases 22 is added which digests all nucleic acid polymers that are not exonuclease-blocked at both ends, i.e. all nucleic acid polymers apart from the templates 18a, 18b produced, as shown in
Based on the templates produced, PCR is then performed using a set of primer oligonucleotides 24. In
Abbreviations: E=Exonuclease; L=Ligase; Poly=Polymerase; PNAP=nucleic acid polymer; OP=oligonucleotide probes.
Abbreviations: CCS=common core sequence; PS=primer set; T1=target 1 with perfect matchas to all primers; T1=target 2 with mismatches to certain primers; ET=efficiency tag; Ef=PCR Efficiency.
In the specific example shown in
Depending on the specific ETS attributed to a given target sequence, the templates can be attributed to different template groups, the number of primer oligonucleotides having an extension fully matching the ETS or fully matching a portion of the ETS is different from template group to template group.
Abbreviations: PS=primer oligonucleotide set; T=targt; is =target sequence.
Abbreviations: Ex17=Exon 17; Ex18—19=Exon 18 and exon 19; Ex22=Exon 22.
Abbreviations: P1=standard PCR; P2=efficiency tag PCR; Ex17=Exon 17; Ex18—19=Exon 18 and exon 19; Ex22=Exon 22.
Abbreviations: AMP=amplicon; GT=genomic target; ETS_A=efficiency tag sequence A; ETS_B=efficiency tag sequence B; UPS_A=universal priming site A; UPS_B=universal priming site B.
Abbreviations: nFU=normalized Fluorescence Units; Cy=Cylces; S=amplicon size in base pairs; GC=GC content in percentage.
Abbreviations: nFU=normalized Fluorescence Units; Cy=Cylces; P1=standard PCR; P2=efficiency tag PCR; CF=correction factor; ET=efficiency tag.
Abbreviations: M=Size Marker; P1=standard PCR; P2=efficiency tag PCR; C=amplicon concentration after amplification in pmol/l; R=Ratio to lowest abundant target.
The invention is further illustrated by the following working examples:
Oligonucleotide probes are designed to target three genomic locations of the Calpain-3 gene, namely Exon 17, Exon 18&19 and Exon 22, as shown in
The reverse oligonucleotide probes (CAPN3_Exon17_rev_ET1, CAPN3_Exon18-19_rev_ET5 and CAPN3_Exon22_rev_ET1 for the respective exon) are phosphorylated at the 5′ end and comprise a portion of the target sequence complementary to the primal target sequence, the efficiency tag sequence (underlined), the universal reverse primer annealing sequence and six phosphorothioate analogues of nucleotides at their 3′ end (indicated by an asterisk).
The forward oligonucleotide probe (CAPN3_Exon17_for_ET1, CAPN3_Exon18-19_for_ET5 and CAPN3_Exon22_for_ET1) comprises six phosphorothioate analogues of nucleotides at their 5′ end, a universal forward primer annealing complementary sequence, an efficiency tag complementary sequence (underlined) and a portion of the target sequence complementary to the primal target sequence.
GATC
TTAACAGAGGAGCTTGCCTCACA
GATC
GCTCTTTGTTTTGCAAAGTGTCCG
GATC
TTAAAGGGAAAATAGAGGCAGGC
ATGCCGTCTTCTGCTTG*G*T*A*C*T*A
ATGCCGTCTTCTGCTTG*G*T*A*C*T*A
ATGCCGTCTTCTGCTTG*G*T*A*C*T*A
To hybridize the oligonucleotide probes to genomic DNA, a 10 μl reaction containing 200 pM oligonucleotide probes and 1 μg genomic DNA in 1× amplication buffer (Epicentre) is incubated at 95° C. for 5 min, cooled down to 60° C. in a PCR cycler using a ramp rate of 1° C. per minute. After 14 hours hybridization at 60° C. two units Stoffel Polymerase (Applied Biosystems), 10 units Ampligase (Epicentre) and dNTPS with a final concentration of 12 pM are added and incubated at 60° C. for 2 more hours. After the gap filling reaction the samples are digested using a exonuclease mix (Exonuclease I, Exonuclease III, Exonuclease Lambda) for 2 hours at 37° C. After heat inactivation of the exonuclease at 80° C. for 20 min, 1 μl of the resulting sample is used for uniform amplification using etPCR.
For uniform amplification using etPCR, a set of primer oligonucleotides comprising a universal primer sequence is used, as given in Table 2.
As primer oligonucleotides a 7:1:1:1:1 mixture of the forward primer oligonucleotides UFP1 (7 parts), UFP2 (1 part), UFP3 (1 part), UFP4 (1 part), UFP5 (1 part) is used at a concentration of 200 nM total forward primer oligonucleotides and 200 nM of universal reverse primer oligonucleotide 1 (URP1). PCR amplification is done using Power SYBR Green Master Mix (Applied Biosystems) and a StepOnePlus Thermocycler with the following PCR program: initial denaturation for 15 minutes at 95° C. followed by 40 amplification cycles (10 sec at 95° C., 15 sec at 60° C., 30 sec at 72° C.). Amplified targets are analyzed on a 1% agarose gel.
As depicted in
Although the specific working example refers to a method in which only on one end an ETS is provided for which five different primer oligonucleotides are used for the polymerase mediated extension, it is understood that an ETS and a set of different oligonucleotides may additionally be used for the opposite end. If also at the opposite end an ETS of four nucleotides and correspondingly a set comprising five different primer oligonucleotides are used, 25 different efficiency grades of amplification may be obtained.
As a model we selected the human dystrophin gene, which is the largest (not exon wise but coverage wise) known human gene consisting of 79 exons. Since the first report of multiplex PCR by Chamberlain the dystrophin gene has been used as a model for multiplex PCR also by other investigators. To establish our new technology we designed 78 different targets covering all 79 exons by using ExonPrimer. To allow fast analyis by gel electrophoresis we selected 12 targets which differ in size to be easily discriminated when resolved on a gel (
To prove the ability of etPCT to control PCR efficiency we first generated single templates with efficiency tags and the common priming sequence by PCR for each of the 12 targets (
We than investigated whether etPCR was able to influence the PCR efficiency of the same templates by using the set of five universal forward primers, just differing in length. The targets had different efficiency tags, and as expected, a tag matching the entire set of universal primers (TAG 5) had no influence on efficiency when performing etPCR (
We wondered whether these efficiency tags could be used to adjust the amplification efficiencies in multiplex reaction to obtain uniform amplification. Targeting oligonucleotides were designed and the capture reaction was performed as described in material and methods. The selection of the efficiency tags were made according the amplicon size to correct for size dependent amplification bias. Using our novel capture technology we were able to capture the targets from small amounts of genomic DNA (200 ng) and successfully amplify them by PCR (
We have developed efficiency tag PCR (etPCR), a novel method for multiplex PCR that is capable of uniformly amplifying multiple targets from genomic DNA simultaneously. The target selection protocol is an addition-only reaction and can be performed in a single tube per sample, making it amenable to automation. The application of etPCR is manifold: molecular diagnostics for genetic testing, prenatal testing, cancer profiling as well as for diagnosis of infectious disease organisms and their resistances. In addition it can be applied in forensic applications, detection of genetically modified organisms (GMO) in food and feed, environmental and water testing or synthetic biology.
In this study we focused on the application of etPCR in molecular diagnosis for inherented disorders like Duchene Muscular Dystrophy. The etPCR can be performed on multiple samples in parallel, which can then be labeled with sample-specific DNA barcodes and sequenced as a pool. The choice of targets and target boundaries is flexible, and a wide range of target sequences can be amplified simultaneously (here, 154 bp to 724 bp). Based on the obtained results further adjustment of the efficiency tag can be made, thereby improving uniformity. The number of cycles of adjustment that have to be performed to obtain best uniformity has to be evaluated.
Recently, several new methods have been developed for the multiplex selection, amplification, and sequencing of genomics subsets (Fredriksson et al. 2007, Bashiardes et al. 2005, Dahl et al. 2005, Dahl et al. 2007, Albert et al. 2007, Hodges et al. 2007, Meuzelaar et al. 2007, Okou et al. 2007, Porreca et al. 2007). Several of these methods have several performance disadvantages in different areas, such as the precise definition of target boundaries (Albert et al. 2007, Dahl et al. 2007, Dahl et al. 2005, Okou et al. 2007), the reproducible capture of target regions (Porreca et al. 2007), or the fraction of reads matching target sequences (Albert et al. 2007, Hodges et al. 2007, Okou et al. 2007). In this proof-of-principle study, we did not determine the upper limit of the number of target sequences that can be amplified by etPCR, making it difficult to directly compare our method to these technologies, particularly for applications where a high degree of multiplexing is required. However, etPCR should prove useful for the amplification of an intermediate number (10-1000) of candidate regions in a large number of samples. It is particularly well suited for these applications because it can incorporate sample-specific DNA barcodes, allowing for the precise definition of the boundaries of targeted sequences, is reproducible, is highly specific, and uniformly amplifies the targeted sequences.
We anticipate that etPCR will be useful for a variety of applications. Because the method is based on PCR, it will likely have the same sensitivity as PCR to detect pathogen DNA in a high background of host DNA (Elnifro et al. 2000; Akhras et al. 2007a, b) or to detect rare DNA biomarkers in samples (Fackler et al. 2006). Also, it is likely to have the sensitivity to amplify targets from degraded samples, an area for which there are no robust methods to allow for multiplexed or genome-wide amplification. Other applications that rely heavily on PCR may benefit from higher levels of multiplexing, such as the engineered assembly of many DNA fragments simultaneously in synthetic biology experiments (Reisinger et al. 2006; Forster and Church 2007). By barcoding different samples this method will be useful for selectively sequencing candidate regions in large cohorts of patients to identify variants associated with disease. EtPCR promises to improve many other methods that rely on the sensitivity of PCR and could benefit from higher multiplexing and uniformity such as pathogen detection, biomarker detection in body fluids, and for synthetic DNA assembly.
To design primer for targeting the 79 exons of the dystrophin gene we extracted the genomic sequence information from the GRCh37/hg19 build using the UCSC Genome Browser. Templates including target specific sequences for the target oligonucleotides were selected using the ExonPrimer software, which is based on the Primer3 algorithm. To facilitate fast analysis by gel electrophoresis we selected the 12 templates with a highest diversity concerning target size. To target specific genomic regions a left target oligonucleotide (LTO) and a right target oligonucleotide (RTO) are designed for each selected target. The LTO is capped at the 5 prime end by phosphotioate nucleotides functioning as an exonuclease block. The block is followed by a universal sequence common to all targets to allow PCR amplification and by an efficiency tag necessary to control uniform amplification. Finally, the 3 prime end is composed of a target specific sequence. The RTO is composed contrariwise, starting with the target specific sequence at the 5 prime end and ending with an exonuclease block at the 3 prime end. Additionally the RTO are 5 prime phosphorylated. Oligonucleotides were synthesized by Microsynth (Switzerland), pooled in groups with similar length and gel purified.
To analyze the effect of efficiency tags on single targets, templates were produced by standard PCR with the LTOs described above and “right” PCR Primers, which were complementary to the right target oligonucleotides without phosphorylation. Amplification was done with 100 ng genomic DNA, 200 nM of each primers and a commercially available Mastermix (SolisBiodyne) containing a hot start Taq Polymerase and 2.5 mM MgCl. PCR was performed according the following cycling protocol: 95° C. for 12 min, 35 cycles with 20 seconds for 95° C., 20 seconds 60° C., 1 minute 72° C., and a final extension step of 5 minutes at 72° C. PCR products were gel purified and quantified.
Quantitative PCR (qPCR) was performed using the StepOnePlus Cylcer (Applied Biosystems) and Power SYBR Green PCR Master Mix (Invitrogen). Primer concentration in all experiments was 200 nM and template concentrations were 10 attomole and 3.3 attomole. Fourty Cylces were performed with following steps: denaturation for 20″ at 95° C., annealing for 20″ at 60° C. and elongation at 72° C. for 60″. Individual PCR efficiencies were calculated by a linear regression analysis using the software package LinReg.
To enrich the selected targets a 10 μl capture reaction was established using following components: 1 fmol of each target oligonucleotide probes, 200 ng genomic DNA, 0.5 U Phusion Hot Start Polymerase, 5 U Ampligase, 0.1 mM dNTPs in 1× ampligase buffer (Epicentre). The reaction was performed in a PCR cycler with following steps: 1) 95° C. for 5 min, 56° C. for 2 h, and finally hold at 4° C. After the initial gap filling reaction 5 μl of an exonuclease cocktail (Exonuclease I, Exonuclease III, Exonuclease lambda) was added. After digestion of the not incorporated oligonucleotide probes as well as the genomic DNA for 1 hour at 37° C. the exonucleases were heat inactivated for 10 min at 80° C. and the samples were stored at 4° C. Before amplification by etPCR 2 μl of 50 mM EDTA was added. For etPCR amplification 5 μl of this capture reaction was used.
For the etPCR following set of universal primer oligonucleotides were use: a set of forward primer oligonucleotides consisting of F1 (1 part), F2 (2 parts), F3 (3 parts), F4 (4 parts), F5 (5 parts) and R1 as reverse primer oligonucleotide. The 3′ ends of the primer were blocked to prevent digestion by the 3′ exonuclease activity of proof reading polymerase like the Phusion polymerase. PCR amplification was done in 30 μl using 0.2 mM dNTPs, 200 nM of total forward primer oligonucleotides and 200 nM of reverse primer oligonucleotide, 5 ul of capture reaction, 1×GC Phusion buffer, 0.3 μl Phusion Hot Start polymerase, 2.5 mM MgCl2. The amplification reaction was performed in a Thermocylcer using following cycling program: initial denaturation for 15 minutes at 95° C. followed by 40 amplification cycles (10 sec at 95° C., 20 sec at 60° C., 45 sec at 72° C.). Amplified targets were analyzed on a 1.8% agarose gel and using the bioanalyszer 2100 system from agilent.
Number | Date | Country | Kind |
---|---|---|---|
12190754.7 | Oct 2012 | EP | regional |
This application corresponds to the U.S. national phase of International Application No. PCT/EP2013/072749, filed Oct. 30, 2013, which, in turn, claims priority to European Patent Application No. 12.190754.7 filed Oct. 31, 2012, the contents of which are incorporated by reference herein in their entirety.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2013/072749 | 10/30/2013 | WO | 00 |