The present invention relates to a nucleic acid probe for detecting target nucleic acid sequences, and more particularly to a probe for detecting variant nucleic acid sequences from genomic or amplified DNA. Further aspects of the invention relate to methods for detecting variant nucleic acid sequences.
Nucleic acid probes are often used to identify the presence of specific target sequences in genomic or amplified DNA. The annealing or melting temperature of the probe to the target is affected by the length of the complementary region shared by the probe and the target, and by the existence of any mismatches between the otherwise complementary base pairs. This can be used to detect the presence of variants, for example SNPs or multiple repeats. A probe can be designed to have a first melting temperature (Tm) to a wild type sequence, and the annealing of the probe to the target monitored, for example through development of fluorescence on annealing. If the Tm is different to the expected value, then the target sequence includes a variant.
However, this process may be difficult to implement with longer probe regions. In particular, a long probe may not provide adequate variation in Tm between a wild type and a variant sequence.
The present invention is intended to address these and other disadvantages with conventional probes.
US2007/0065847 describes a collection of nucleotide probes labelled with a common sequence to act as a tag. The probes are used for probing libraries of compounds.
WO2011/027966 describes probes having first and second regions, the first region including a reporter molecule and the second region including a quencher molecule. When the probe hybridises to a target, exonucleases can cleave the reporter from the quencher, leading to signal detection. When there is a mismatch such that the reporter portion is not hybridised, the reporter is not cleaved, and there is no signal detection.
WO2007/058499 describes probes for respiratory viruses having first and second hybridisation portions separated by a separation portion.
US2007/0259347 describes probes for screening arrays, the probes having two portions separated by a linker where the linker segment is selected to minimise homology noise associated with hybridisation.
US2003/0170711 describes oligonucleotide probes including universal bases, including 2-deoxyinosine.
WO2011/087928 describes oligonucleotide probes for detecting KRAS and PIK3CA SNPs in a background of wild type sequences, including polydeoxyinosine linkers.
According to a first aspect of the present invention, there is provided a nucleic acid probe comprising a first nucleic acid sequence being complementary to a first target nucleic acid sequence; a second nucleic acid sequence being complementary to a second target nucleic acid sequence; and a linker nucleic acid sequence joining the first and second nucleic acid sequences; wherein the linker separates the two first and second sequences such that the melting temperature of the first sequence annealed to the first target nucleic acid sequence and of the second sequence annealed to the second target nucleic acid sequence are discrete.
The presence of the linker region allows the probes to be split into functional elements that have different hybridisation characteristics. Inclusion of these linkers creates ‘bubble’ structures, isolating the elements of the probe from a thermodynamic perspective, to provide regions with different binding characteristics. Further, the presence of the linker nucleic acid sequence allows the whole probe to have the characteristics of a single polynucleotide molecule, but to behave as if composed of separate shorter nucleic acid probes. The linker region may fold to form a loop out when the first and second sequences hybridise to their respective target sequences.
In certain embodiments, the probe may be further extended to have further multiple hybridisation regions separated by multiple linkers.
This can be used in a number of different ways. For example, the probe structure allows probing of contiguous regions, where longer probes (for example, a single probe spanning both first and second target regions) would not provide adequate reporting through Tm analysis to differentiate variants.
Preferably the linker is a nucleoside linker; more preferably the linker comprises polydeoxyribonucleotides; most preferably the linker comprises or consists of polydeoxyinosine. Deoxyinosine has a low melting temperature relative to natural bases due to weaker hydrogen bonding. Other nucleosides may be used.
Preferably the linker is up to 5, 10, 15, 20, 30, 40, 50 nucleotides in length.
At least one of the first and second nucleic acid sequences is a reporter region. A reporter region includes a labelled moiety; preferably a fluorescent label. This allows detection of the probe in the event of binding to a target sequence, and monitoring of annealing over a temperature range in order to determine the presence of any variant target sequences. The probe preferably does not comprise a quencher moiety, nor is the label intended to be used with a quencher. Suitable labels include FAM, TET, HEX, ROX, TAMRA, Cy3, and Cy5. Other suitable labels will be known to the skilled person. Preferably the label is incorporated on to a T nucleotide, although any suitable nucleotide may be used.
The reporter region is preferably 15-200 nt in length, more preferably 15-150, more preferably still 15-100, or 20-100, 30-80, 40-60, or around 50 nt in length.
The reporter region is designed to have a first Tm in respect of an entirely complementary target sequence, and an altered, preferably lower, Tm, in respect of a variant target sequence. For example, the variant may represent a SNP (single nucleotide polymorphism), an insertion, a deletion, an inversion, or a repeated sequence.
In this embodiment, the melting temperature of the first sequence annealed to the first target nucleic acid sequence may be different to that of the second sequence annealed to the second target nucleic acid sequence. Preferably, however, both first and second nucleic acid sequences are selected to have similar Tm to their respective target sequences.
The reporter region may further comprise a blocking region; that is, a portion which serves to block extension of the nucleic acid strand by DNA polymerase, so preventing strand extension during, for example, PCR. A polymerase enzyme blocking group is one which should have the functional properties of blocking further elongation of the polymer. A blocking group may be any chemical group which can be attached to a nucleotide which will allow the 5′ end of the modified nucleotide to attach to a 3′ end of another nucleotide in a DNA chain but will not allow attachment of a nucleotide to the 3′hydroxyl group of the modified nucleotide. Suitably, the absence of an OH group in the 3′ position will prevent further elongation by polymerase activity. In a particularly preferred embodiment, the blocking group is selected from acetyl, CH3, glycyl, leucyl and alanyl groups. In another embodiment, the blocking group may be in the form of a di or tri peptide.
In an embodiment of the invention, both the first and second nucleic acid sequences are reporter regions. They may include different labels. Such a probe may be used as a multiplex reporter, allowing detection of target sequences over an extended range with a single probe.
In this embodiment, the probe is split into multiple discrete reporter regions. Each reporter region has different annealing temperatures and has 1 or more fluorescent nucleotides, preferably FAM-T, or different/multiple colours. The reporter may be used to report the presence of specific sequences or sequence variants, SNPs, insertions, deletions, etc. This allows multiple sequences over an extended range to be detected with a single probe. Each region is tuned to have a similar Tm in the case of the wild type target sequence, but a shifted Tm in the case of a mutation; this means that a user only has to detect the shifted Tm to know the variant is present.
As an alternative to detecting multiple target sequences, the probe may be structured for example to provide a control binding region and a test binding region, to improve the reliability of an assay.
In an alternative embodiment of the invention, the second nucleic acid sequence may be a reporter region, and the first nucleic acid sequence is an anchor region. In this embodiment, the melting temperature of the first sequence annealed to the first target nucleic acid sequence is higher than that of the second sequence annealed to the second target nucleic acid sequence. The linker sequence may form a loop-out region or may hybridise to any inter-probe nucleic acid sequence. The Tm of the anchor region to the target sequence is significantly higher (at least 0.5, 1, 1.5° C. but more generally up to 5° C. to 20° C. higher than the Tm of the reporter region with the target sequence.
The anchor region is preferably at least 50 nt in length, more preferably at least 60, 70, 80, 90, 100, 125, 150 nt. The anchor region is preferably no more than 200 nt in length and no less than 20 nt in length.
In use, as the anchor region has a substantially higher annealing temperature it serves to anchor the probe to the correct region. The reporter region has a lower annealing temperature to the anchor region and has 1 or more fluorescent nucleotides, preferably FAM-T. The reporter is used to report the presence of a specific sequence or sequence variants, SNPs, insertions, deletions, etc.
In preferred embodiments, the probe is used to detect size/length polymorphisms e.g. STR profiling, Fragile X, etc. The anchor region is used to anneal to the downstream of the repeat region, while the reporter region is homologous to a short stretch of repeats. Since the reporter is isolated from the main anchor region, and since the fluorophores are only on the reporter region, it makes no difference if the reporter binds at different sites to cause loop outs in the amplicon—normally this would alter the Tm of the product, making measurement difficult. In certain embodiments, the anchor region may be designed to be homologous to a short stretch of the repeat region as well as a flanking region; for example, the anchor region may include up to 50 nt of repeat sequence and a longer stretch, eg 150 nt, of flanking sequence.
A further aspect of the invention provides a nucleic acid probe comprising a first anchor nucleic acid sequence being complementary to a first target nucleic acid sequence; a second reporter nucleic acid sequence being complementary to a second target nucleic acid sequence; and a linker nucleic acid sequence joining the first and second nucleic acid sequences; wherein the linker separates the first and second sequences such that the melting temperatures of the first and second sequences hybridised to their respective target sequences is discrete, and where the melting temperature of the anchor sequence annealed to the first target nucleic acid sequence is higher than that of the reporter sequence annealed to the second target nucleic acid sequence; and wherein each reporter sequence comprises at least one detectable label.
The reporter preferably comprises a plurality of repeated sequence units. Preferably at least 3, 4, or 5 repeated units.
The linker preferably comprises polydeoxyinosine.
The anchor may comprise one or more repeated sequence units, but preferably comprises unique sequence. The anchor is preferably from 50-200 nt in length.
The probe preferably further comprises a blocking sequence to prevent strand extension during PCR.
The probe may further comprise one or more additional linker and reporter sequences.
A yet further aspect of the invention provides a nucleic acid probe comprising a first reporter nucleic acid sequence being complementary to a first target nucleic acid sequence; a second reporter nucleic acid sequence being complementary to a second target nucleic acid sequence; and a linker nucleic acid sequence joining the first and second nucleic acid sequences; wherein the melting temperature of the first reporter sequence annealed to the first target nucleic acid sequence is similar or equal to that of the second reporter sequence annealed to the second target nucleic acid sequence; and wherein each reporter sequence comprises at least one detectable label.
The probe preferably further comprises a blocking sequence to prevent strand extension during PCR.
A further aspect of the invention provides a method for detecting one or more polymorphisms in a target nucleic acid sequence, the method comprising:
This method may also be used to improve the utility of melting temperature whereby the Tm of a first reporting region acts as a reference to measure the Tm of the variant sequence region. Generally the position of the recorded melting temperature which is useful in determining the underlying sequence, may be affected by the concentration of salts in the reaction mixture. By comparing the melting position of the first reporter region as a known sequence, it is possible to more accurately determine the sequence under the second reporter sequence since both regions will be equally affected by interfering substances such as external salts.
Thus, a further aspect of the invention provides a method for detecting one or more polymorphisms in a target nucleic acid sequence, the method comprising:
A still further aspect of the invention provides a method for detecting tandem repeats in a target nucleic acid sequence, the method comprising:
Binding may be detected by determining the Tm of the reporter sequence to the test nucleic acid sequence, and/or by detecting label (eg, fluorescence) from the reporter sequence. Binding indicates that the number of repeats in the target sequence is greater than or equal to the number of repeats in the reporter sequence; if there are more repeats in the reporter sequence, then there is either no binding or reduced binding. In certain embodiments, the Tm may be considered to be generally proportional to the number of repeats in the target sequence, and the Tm may be determined in order to determine the number of repeats. Alternatively, or in addition, the reporter sequence of the probe may include one or more labelled moieties in each repeat unit; in this way, the detected label may be taken as being proportional to the number of repeats in the target sequence.
These and other aspects of the present invention will now be described by way of example only and with reference to the accompanying drawings, in which:
The present invention relates to a nucleic acid probe which is split into two or more functional elements, separated by a nucleic acid linker. The linker is preferably deoxyinosine, which has a low Tm relative to the naturally occurring DNA bases. Other suitable linkers may be used.
The probe sequences are designed such that the Tm of the anchor region to the target is significantly higher than the Tm of the reporter to the target, which in turn is significantly higher than the Tm of the linker to any of the sequences in the target nucleic acid.
In these scenarios, the probe is being used to determine size/length polymorphisms e.g. STR profiling, Fragile X, etc. The anchor region is used to anneal the probe to the downstream of the repeat region, while the reporter region is homologous to a short stretch of repeats. Since the reporter is isolated from the main anchor region, and since the fluorophores are only on the reporter region, it makes no difference if the reporter binds at different sites to cause loop outs in the amplicon—normally this would alter the Tm of the product, making measurement difficult.
Where the length of the target repeated sequence region is less than that of the reporter region (
Where the length of the target repeated sequence region is equal to or longer than the length of the reporter region, then any of the scenarios in
If the reporter length is increased to accommodate multiple repeats (
An alternative probe architecture is shown in
The probe could also be structured for example to provide a control binding region and test binding region.
An example of use of a multiplexed reporter probe to detect variants in the Mycobacterium tuberculosis rpoB gene is now given. Multi drug resistance in M. tuberculosis is complex. Rifampin is a first line M. tuberculosis medication and is the main target to identify in the field prior to treatment. Rifampin resistant M. tuberculosis have mutations in the 81-bp core region of the rpoB gene, which encodes the β-subunit of RNA polymerase. 96% of Rifampin resistant clinical isolates of M. tuberculosis have mutations in this gene. Mutations in codons 516, 526, or 531 result in high level Rifampin resistance. However, detecting mutations across an 81-bp gene region would typically require multiple conventional probes, several of which would need to overlap, so requiring multiple detection steps.
The probe binding to WT sequence in both positions gives the same Tm of 60 C for each reporter sequence. The presence of mutant sequences shifts this to 52 C-56 C.
Each of these Tms is distinguishable from one another, and so the single probe may be used to determine the presence of three distinct mutations in the rpoB gene.
A further illustrative use of such a probe is shown in
The probe sequence is:
The two reporter regions span the 30 nucleotide sequence shown in
The forward and reverse primers are:
The TH gene includes a short tandem repeat sequence of (AATG)n, which is known to be polymorphic. To confirm that the present invention is capable of distinguishing between different lengths of STR, two probes were compared for senstivity and accuracy. Both probes were of the same sequence, probe 1 was prepared without a linker, while probe 2 includes a five inosine base sequence acting as a linker between the two portions of the probe. Probe sequences were:
After amplifying the target sequence with the primers, melt curve analysis was performed with the amplicon and either probe 1 or probe 2. The curves are shown in
Number | Date | Country | Kind |
---|---|---|---|
1116131.2 | Sep 2011 | GB | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/GB2012/052305 | 9/19/2012 | WO | 00 | 3/19/2014 |