The present invention relates to methods for analyzing hybridisation, more particularly to determine the presence or absence of specific polynucleotide variants or mutants in a sample by analysing the hybridisation of the polynucleotide variants or mutants with specific probes.
In diagnostic or prognostic tests, one is often confronted with samples of mixed genomic sequence variants, some possibly present in minority. Nevertheless, an accurate identification of reliable mutation markers is crucial for allowing accurate molecular subtyping of disease and the rational use of molecularly guided therapies e.g. in cancer. The heterogeneity of the patient sample often complicates the detection and/or identification of such mutations. Indeed, biopsies such as from cancer tissue usually contain a mixture of cancerous and non-cancerous cells, with varying ratio of tumor vs non-tumor cells and wherein the mutations may be heterozygous or homozygous and possibly change over time.
In practice, the detection of DNA mutations is typically performed using polymerase chain reaction (PCR) and sequencing. The use of hybridisation techniques like microarrays for the detection of DNA mutations is rather uncommon, even though this technology is mature, affordable, widely used and very flexible towards type and number of DNA sequences to be analyzed. A remaining challenge with hybridisation techniques however is obtaining a clear quantitative interpretation of the microarray data. WO2011/035801 describes a method for analyzing hybridisation, involving the analysis of hybridisation intensities for different probes as a function of hybridisation free energy. Hooyberghs et al. (Biosensors and Bioelectronics, 2010, 26: 1692-1694) relates to a microarray and hybridisation-based method of detecting small concentrations in a mixture of mutant and wild type polynucleotides, based on the observation of a shift of cluster of probes with respect to a thermodynamic baseline (by plotting the hybridisation intensity against the ΔΔG), wherein hybridisation intensities obtained for the mixture are compared to this thermodynamic baseline. However, this and other hybridisation-based methods are prone to errors due to concentration variations in the sample.
Accordingly, there is still a need for improved methods for analyzing hybridisation, in particular for the reliable identification and/or quantification of mutant polynucleotides in a sample.
The present invention relates to methods for analyzing hybridisation. More particularly, the methods described herein allow for the detection of the presence of specific polynucleotide variants such as mutant genes in a sample, particularly in a sample comprising both a target, wild type polynucleotide Twt and a variant TM thereof, and allow a reliable identification and/or quantification of such variant polynucleotides in a sample with a minimal number of probes and more particularly without the need for analysing a reference sample.
A first aspect of the present invention provides a method for determining the presence of a mutant polynucleotide TM in a sample solution, particularly a sample solution comprising a target polynucleotide Twt and said mutant polynucleotide TM, said mutant polynucleotide TM comprising a mutant sequence and differing from a target polynucleotide Twt comprising a target sequence in one or more nucleotides of said target sequence, said method comprising:
In particular embodiments, said first probe PA and second probe PB are selected so that the ratio of the hybridisation intensity [I(PA)] for the hybridisation between the target polynucleotide and the first probe PA to the hybridisation intensity [I(PB)] for the hybridisation between the target polynucleotide Twt and the second probe PB is known and ranges between 0.02 and 50, preferably between 0.1 and 10. In particular embodiments the ratio I(PA)/I(PB) is about 1.
In particular embodiments, said first probe PA is fully complementary to said mutant sequence of TM.
In particular embodiments, the hybridisation sequence of said second probe PB is selected using a theoretical model for ΔG calculation, preferably is designed and chosen based on a Nearest-Neighbor model, particularly comprising the steps of calculating or estimating the hybridisation free energy for the hybridisation between the target polynucleotide Twt and probe PA; followed by estimating or calculating the hybridisation free energy for the hybridisation between the target polynucleotide Twt and a plurality of candidate probes PB based on a Nearest-Neighbor model. Alternatively or additionally, probe PB is selected via a hybridisation experiment between a plurality of candidate probes PB and the target polynucleotide Twt.
In particular embodiments, the methods as envisaged herein further comprise determining the relative amount of said target polynucleotide Twt and said mutant polynucleotide TM in said sample solution. Preferably, determining the relative amount of said target polynucleotide Twt and said mutant polynucleotide TM in said sample solution is performed using a calibration range for mutant polynucleotide TM/target polynucleotide Twt mixtures.
In particular embodiments said methods as envisaged herein further comprises determining which of a plurality of candidate mutant polynucleotides is present in said sample solution. Preferably, this comprises providing a plurality of probe pairs, wherein probe PA of each probe pair is specific for a candidate mutant polynucleotide; obtaining a first and second measured hybridisation intensity I(PA)m and I(PB)m for each probe pair of said plurality of probe pairs; and comparing I(PA)m and I(PB)m for each probe pair of said plurality of probe pairs.
In particular embodiments, said sample solution as envisaged herein is prepared by:
In particular embodiments, said hybridisation intensities of step (ii) are induced by emission of a label associated with a hybrid formed by binding of said target polynucleotide or mutants thereof and said probes. In particular embodiments, said label comprises a hybridisation sequence complementary to a sequence on said mutant polynucleotide TM and said target polynucleotide outside said target sequence. In certain embodiments, said probe PA and probe PB as envisaged herein are differently labelled.
A second aspect of the present invention provides a computer program product for performing, when executed on a computing device, a method for determining the presence of a mutant TM of a target polynucleotide in a sample solution according to the methods as envisaged herein, said computer program product being configured for
Another aspect of the present invention relates to a device configured for performing the method for determining the presence of a mutant TM of a target polynucleotide in a sample solution as envisaged herein, comprising one or more sets of reaction vessels, feeds for reagents connected thereto and a detection unit and a processing unit comprising the computer program product as envisaged herein.
The above and other characteristics, features and advantages of the concepts described herein will become apparent from the following detailed description, which illustrates, by way of example, the principles of the invention.
The following description of the figures of specific embodiments of the methods and instruments described herein is merely exemplary in nature and is not intended to limit the present teachings, their application or uses. Throughout the drawings, corresponding reference numerals indicate like or corresponding parts and features.
While potentially serving as a guide for understanding, any reference signs used herein and in the claims shall not be construed as limiting the scope thereof.
As used herein, the singular forms “a”, “an”, and “the” include both singular and plural referents unless the context clearly dictates otherwise.
The terms “comprising”, “comprises” and “comprised of” as used herein are synonymous with “including”, “includes” or “containing”, “contains”, and are inclusive or open-ended and do not exclude additional, non-recited members, elements or method steps. The terms “comprising”, “comprises” and “comprised of” when referring to recited components, elements or method steps also include embodiments which “consist of” said recited components, elements or method steps.
Furthermore, the terms first, second, third and the like in the description and in the claims, are used for distinguishing between similar elements and not necessarily for describing a sequential or chronological order, unless specified. It is to be understood that the terms so used are interchangeable under appropriate circumstances and that the embodiments described herein are capable of operation in other sequences than described or illustrated herein.
The values as used herein when referring to a measurable value such as a parameter, an amount, a temporal duration, and the like, is meant to encompass variations of +/−10% or less, preferably +/−5% or less, more preferably +/−1% or less, and still more preferably +/−0.1% or less of and from the specified value, insofar such variations are appropriate to ensure one or more of the technical effects envisaged herein. It is to be understood that each value as used herein is itself also specifically, and preferably, disclosed. Typically, the term “about” should be read in this context, in particular in the context of the value of the ratio I(PA)/I(PB).
The recitation of numerical ranges by endpoints includes all numbers and fractions subsumed within the respective ranges, as well as the recited endpoints.
All documents cited in the present specification are hereby incorporated by reference in their entirety.
Unless otherwise defined, all terms used in disclosing the concepts described herein, including technical and scientific terms, have the meaning as commonly understood by one of ordinary skill in the art. By means of further guidance, definitions for the terms used in the description are included to better appreciate the teaching of the present disclosure. The terms or definitions used herein are provided solely to aid in the understanding of the teachings provided herein.
The term “polynucleotide” as used herein may include oligonucleotides and refers to a polymer composed of nucleotide monomers, typically having a length of at least 10 nucleotides. Typically, the polynucleotides such as the target polynucleotides and probes referred to herein are single-stranded polynucleotides. As used herein, the term “polynucleotide” may include deoxyribonucleic acid (DNA), ribonucleic acid (RNA) or peptide nucleic acid (PNA).
The term “equilibrium” as used herein refers to thermodynamic equilibrium and indicates a situation wherein a steady state is obtained such that the number of conventional target-probe bindings does not substantially change over time. The term “non-equilibrium” or “non-equilibrium effects” refers to occurrence of a target-probe binding state that may change over time. The term “free energy” as used herein refers the Gibbs free energy (ΔG) or chemical potential.
The term “hybridisation” as used refers to nucleic acid hybridisation. This refers to the process of establishing a non-covalent sequence-specific interaction between two or more complementary strands of nucleic acids into a single hybrid. The strands of nucleic acids that may bind to their complement can for example be oligonucleotides, DNA, RNA or PNA. Nucleotides form the basic components of the strands of nucleic acids. Hybridisation comprises binding of two perfectly complementary strands (in the Watson-Crick base-pairing senses), but also binding of non-perfect complementary strands. With a non-perfect complementary strand reference may be made to strands having a small number of non-complementary elements such as one, two or more non-complementary elements, preferably one or two non-complementary elements. In principle there is no limit to the number of non-complementary elements but the more non-complementary elements, the easier these are detectable.
Reference throughout this specification to “one embodiment” or “an embodiment” means that a particular feature, structure or characteristic described in connection with the embodiment is included in at least one embodiment envisaged herein. Thus, appearances of the phrases “in one embodiment” or “in an embodiment” in various places throughout this specification are not necessarily all referring to the same embodiment, but may. Furthermore, the particular features, structures or characteristics may be combined in any suitable manner, as would be apparent to a person skilled in the art from this disclosure, in one or more embodiments. Furthermore, while some embodiments described herein include some but not other features included in other embodiments, combinations of features of different embodiments are also envisaged herein, and form different embodiments, as would be understood by those in the art. For example, in the appended claims, any of the features of the claimed embodiments can be used in any combination.
Provided herein are methods and systems for analyzing hybridisation for determining the presence of a mutant polynucleotide comprising a mutant sequence, also referred to herein as “mutant”, in a sample solution, particularly in a sample solution comprising both the mutant polynucleotide and the target, wild type polynucleotide. Accordingly, in a first aspect, the present application provides methods for determining the presence of a mutant polynucleotide in a sample solution preferably (also) comprising the wild type polynucleotide.
The term “mutant polynucleotide” or “mutant” (TM) as used herein refers to a polynucleotide having a sequence (the mutant sequence) which differs from the sequence of a certain target ‘wild type’ polynucleotide (Twt) in one or more nucleotides. It will be understood by the skilled person that the term “mutant” is not limited to sequences which are the result of a change in the target polynucleotide in a specific organism, tissue or cell but also include naturally occurring (i.e. evolutionary) sequence variants. More particularly in the context of the present application, these differences or mutations are located within a certain subsequence of the target polynucleotide, referred to herein as the “target sequence”. Again, it will be understood that the “target sequence” is the nucleotide sequence as present in the target wild type polynucleotide and is used as the reference sequence. In particular embodiments, the mutant polynucleotide TM only differs from the target (wild type) polynucleotide Twt in one or more nucleotides within the target sequence. Preferably, the mutant polynucleotide TM differs from the target polynucleotide Twt in a limited number of nucleotides within the target sequence, preferably in at most two nucleotides, such as in one or two nucleotides. In particular embodiments, the mutant polynucleotide differs from the target wildtype polynucleotide in only one nucleotide. Although the present method focuses on the interaction between a strand that initially is in a sample solution, and a strand that is bound to a surface, it is noted that hybridisation may occur between nucleic acid strands that both are in solution. The strands initially present in the sample solution and of interest for analysis are typically referred to in the art as the “target”, whereas the strand which is to hybridize to the target is referred to as “probe”. In the context of the methods provided herein, the mutant polynucleotide(s) TM and target wild type polynucleotide Twt referred to herein are both potentially present in the sample and may both be considered as “targets”. In preferred embodiments both the mutant polynucleotide(s) TM and target wild type polynucleotide Twt referred to herein are both present in the sample. The probe may for example be an oligonucleotide, such a DNA, RNA or PNA sequence (partially) complementary to a target. In the methods envisaged herein, the probes may be contacted with the target in the sample by introducing the probes into the sample solution. Preferably, the probe is preferably bound to a surface, such as a carrier, particularly on a microarray.
However, in particular embodiments of the present methods the probes are present in solution. Typically, the probes as envisaged for use herein are different (single-stranded) probes having different binding affinities for the (target sequence of the) target polynucleotide Twt and the mutant polynucleotide Tm. It will be understood by the skilled person however that the reference to the “target” polynucleotide and the “mutant” polynucleotide is arbitrary in that they can be interchanged.
The probes envisaged for use in the methods provided herein may contain a hybridisation sequence. The hybridisation sequence is the sequence intended for hybridisation with the target sequence and thus its sequence will be determined by the target sequence. Optionally, the probe may further comprise a tail sequence, which may be used to hybridize to other sequences, for tagging of the probe, etc. . . . . The length of the hybridisation sequence typically corresponds to the target sequence, i.e. contains the same number of nucleotides. A so-called “perfect match probe” as used herein refers to a probe having a hybridisation sequence which is completely complementary to a target sequence of a target polynucleotide. The term “mismatch probe” as used herein refers to a probe having a hybridisation sequence which is non-complementary to the target sequence. The probe is considered non-complementary as soon as one nucleotide differs from the target sequence of the target polynucleotide. In particular embodiments, the hybridisation sequence of the probe comprises one or more, such as at most two, non-complementary nucleotides with respect to the target sequence in the target polynucleotide.
The methods for determining the presence of a mutant polynucleotide TM in a sample solution as envisaged herein comprise the steps of contacting the sample with two different probes which differ in their complementarity to the target and mutant sequence. The present inventors have found that by comparing the hybridisation intensities of hybridisation experiments on a sample solution using a specifically designed or selected probe pair, it is possible to identify mutant polynucleotides in a mixture of mutant TM and wild type polynucleotide Twt at low concentrations of the mutant relative to the wild type, with a minimal set of probes without the need of a parallel reference measurement or a reference sample. The methods are based on the observation that the detection of a mutant sequence in a sample can be detected easily and without the need for a reference sequence provided that two probes are used for which the ratio of the hybridisation intensity for the hybridisation with the target is known. Indeed, where the ratio of hybridisation intensities of the different probes is known, it is possible to easily identify the presence or absence of a mutant in the sample.
Accordingly, the methods of the present invention are based on the provision of a suitable pair of probes. More particularly the methods encompass the step of designing and/or selecting probes, named probes PA and PB herein. The probe pair PA and PB can be designed such that each probe (and more particularly the hybridisation sequence of the probe) contains a (different) level of mismatch with the wild type target sequence. Accordingly, more particularly, the methods provided herein comprise the steps of:
(i) providing a probe pair consisting of a first probe PA and a second (different) probe PB;
(ii) contacting a sample solution with said first probe PA and said second probe PB, and obtaining or detecting hybridisation intensities I(PA)m and I(PB)m for said first probe PA and said second probe PB of said probe pair, respectively; and
(iii) analysing and comparing I(PA)m and I(PB)m and determining the presence of said mutant polynucleotide TM in said sample solution based thereon.
More particularly, in these methods a first probe PA and a second (different) probe PB are designed or selected such that the hybridisation sequences of said first probe PA and second probe PB are characterized in that
(i) they each comprise at least one non-complementary nucleotide with respect to the target sequence. In particular, the at least one non-complementary nucleotide of probe PA (vs the target sequence of Twt) is different from the non-complementary nucleotide of probe PB (vs the target sequence of Twt);
In particular embodiments, said first probe PA and second probe PB are selected such that only said first probe PA has a hybridisation sequence specific for the mutant sequence, i.e. comprising at least one nucleotide complementary to the at least one nucleotide of the mutant sequence differing from the target sequence of the target wild type polynucleotide Twt; and that the ratio of the hybridisation intensity [I(PA)] for the hybridisation between the target polynucleotide Twt and the first probe PA to the hybridisation intensity [I(PB)] for the hybridisation between the target polynucleotide Twt and the second probe PB is a known value known and ranges between 0.02 and 50, more preferably between 0.05 and 20 or between 0.1 and 10. Preferably, the first probe PA and second probe PB have a hybridisation sequence with each at least one different non-complementary nucleotide with respect to the target, wild type sequence Twt.
In particular embodiments, probe PA comprises a nucleotide complementary to the one or more nucleotides differing between the sequence of the wild type and the mutant polynucleotides, or stated differently, probe PA is specific for the mutant polynucleotide, whereas this is not the case for probe PB, which comprises a non-matching nucleotide to the mutant nucleotide. In preferred embodiments, said first probe PA is fully complementary to the mutant sequence of the mutant polynucleotide. As used herein, probe PA is also referred to as the “mutant probe”.
In addition, the hybridisation sequence of probe PB comprises a suitable sequence variation of the target (wild type) sequence Twt such that the ratio I(Twt,PA)/I(Twt,PB) is a known value, preferably ranging between 0.02 and 50, more preferably between 0.05 and or most particularly between 0.1 and 10, even more preferably ranging between 0.5 and 2. In particularly preferred embodiments, Probe PB is selected or designed such that I(Twt,PA) is about equal I(Twt,PB), or stated differently wherein the ratio I(Twt,PA)/I(Twt,PB) is about 1 (or 1+/−50%, such as 1+/−20% or 1+/−10%).
In certain embodiments, probe PB, and optionally also probe PA, is designed using a theoretical model for ΔG calculation, preferably using a Nearest Neighbor model ((Bloomfield Va. et al., “Nucleic Acids Structures, Properties and Functions, University Science Books, Mill Valley, 2000). This is possible since a stringent relation (e.g. the Langmuir isotherm under certain experimental conditions) exists between measured intensities I and the corresponding free energy ΔG. Nearest neighbor models provide a good estimation of the hybridisation free energy as a sum of dinucleotide parameters. In these embodiments, the method may comprise calculating or estimating the hybridisation free energy for the hybridisation between the wild type target polynucleotide Twt and probe PA; followed by estimating or calculating the hybridisation free energy for the hybridisation between the wild type target polynucleotide Twt and a plurality of candidate probes PB based on a Nearest-Neighbor model. In particular embodiments, probe PB is selected following this theoretical calculation as the probe for which ΔG(Twt,PB) minus ΔG(Twt,PA) is minimal, more preferably as the probe for which ΔG(Twt,PB) is about equal to ΔG(Twt,PA) (or wherein ΔG(Twt,PB) minus ΔG(Twt,PA) is about 0). It is an advantage of embodiments according to the present invention that hybridisation free energy can be determined accurately for hybridisation between a target initially in solution and a probe bound to a surface, such as for example may occur in microarrays. Advantageously, using a theoretical model is a quick and straightforward way to design probe PB. It will be understood that in what follows when referring to the nature of the probes, more particularly to their ability to hybridize with a target sequence, this in fact refers to the hybridisation sequence of the probes.
In certain embodiments, probe PB is selected via an experiment which allows the evaluation of ΔG(Twt,PA) and ΔG(Twt,PB), and hence allows the evaluation of the ratio ΔG(Twt,PA)/AG(Twt,PB), and accordingly the ratio I(Twt,PA)/I(Twt,PB). For instance this can be done by a hybridisation experiment between different (individual) probes and the target wildtype polynucleotide Twt. Advantageously, said experiment is a microarray experiment, allowing many sequences to be evaluated in parallel. For instance the array can be carried out with a test sample comprising only target wildtype polynucleotide (Twt) and whereby one spot of the array contains probe PA and wherein other spots of the array contain each individually a different candidate probe PB. For instance, the different candidate probes PB can be complementary to the wild type except for at least one nucleotide variation. Preferably, probe PB is selected based on the fact that it corresponds to the candidate probe PB which gives a hybridisation intensity with the target nucleotide Twt comparable to or about equal to the hybridisation intensity of probe PA with the target polynucleotide Twt. In certain embodiments, probe B is selected via theoretical modelling and calculation combined with experimental data.
In certain embodiments, in case additional degrees of freedom in the selection of suitable probes PA and PB are required, probe PA can be designed to be specific for the mutant polynucleotide Twt, i.e. comprising a nucleotide complementary to the one or more nucleotides differing between the sequence of the wild type and the mutant polynucleotides, but comprises in addition one or more nucleotides non-complementary to the target wild type polynucleotide Twt. Probe PB is designed or selected as described above.
The methods as envisaged herein are of particular interest for the analysis of a sample solution which comprises a mixture of the target polynucleotide [Twt or wild type (wt)] and a mutant polynucleotide [TM or (mut)]. More particular they are of interest in samples which are characterized in that the concentration of the mutant polynucleotide [c(mut)] is significantly smaller than the concentration of the target or wild type polynucleotide [c(wt)]. The ratio of these concentrations [c(mut)/c(wt)] is also referred to herein as the “relative concentration” of mutant polynucleotide in the sample solution. In preferred embodiments, the relative concentration of mutant polynucleotide in the sample solution is between 0.01 and 0.5, more preferably between 0.01 and 0.1. The relative concentration may also be expressed as a percentage, which refers to 100*[c(mut)/c(wt)].
The sample solution may be prepared for use in the methods envisaged herein using standard methods known in the art. This may include extracting DNA or other polynucleotides from a sample of interest, followed by amplification of certain fragments within the extracted DNA. Typically, amplification is performed using PCR (polymerase chain reaction). However, this results in double stranded DNA, whereas single-stranded DNA is preferred for the present methods. Indeed, hybridisation of double-stranded DNA with nucleic acid probes is hampered by competition between the complementary non-target strand and the probe. Such competition can be avoided by degradation of the complementary strands, for example using lambda exonuclease. Lambda exonuclease is a processive enzyme that acts in the 5′ to 3′ direction, catalyzing the removal of 5′ mononucleotides from duplex DNA. The preferred substrate is 5′-phosphorylated double stranded DNA. Accordingly, in certain embodiments, the preparation of the sample solution may comprise the steps of:
The contacting of the sample with the probes (referred to as step (ii) above) is typically performed under conditions suitable for hybridisation of the target and mutant polynucleotides to said probes. The skilled person understands that relevant parameters for optimizing hybridisation include hybridisation time, temperature, and probe length. In preferred embodiments, the probes have a length ranging from about 20 to about 30 nucleotides. Furthermore, it is preferred that the hybridisation experiments are performed under such conditions that hybridisation has reached equilibrium, e.g. by selecting suitable probe lengths, temperatures, and hybridisation time. A method for determining for which probes or spots the hybridisation has reached equilibrium is described in international patent application WO2011/035801, which is hereby incorporated by reference in its entirety.
Although the methods envisaged herein may be carried out by using probes which are added to the sample in solution, it is preferred that the probes are provided on a surface. The probes may be provided on any type of carrier, such as magnetic beads or fibers. However, it is preferred that the probes are provided on a microarray. Thus, in particular embodiments of the methods described herein, the first and second probe of the probe pair are provided on separate spots of a microarray. A microarray as a hybridisation platform contains a large number of probes which are immobilized on a solid surface. Preferably, the probes are provided in spatially separated spots, wherein each spot comprises one (and only one) type of probe. Typically, each spot comprises only a few picomoles of each probe. Typical microarrays comprise hundreds or even thousands of spots. A plurality of microarray platforms suitable for use in the present methods are commercially available, and include but are not limited to the platform provided by Agilent, the GeneChips platform from Affymetrix or CodeLink Bioarray platform from Amersham Biosciences.
In preferred embodiments, the first and second probes may be provided on the same microarray. This can facilitate comparing the hybridisation intensities for the probe pair.
The hybridisation intensity as envisaged herein is a value representing the fraction of a certain probe which is hybridized. In the methods described herein, detection of hybridisation intensity may be performed using a marker associated with the formed hybrid, such as for example a fluorescence marker or a radio-active marker, or other markers known in the art. In preferred embodiments, the marker used for probe PA is the same as the marker used for probe PB. However, this is not critical for the present methods. Accordingly, different markers may be used for probe PA and probe PB of the probe pair. In certain embodiments, the detection of the hybridisation intensity may be performed using a label-free method as known by the skilled person, such as surface-enhanced Raman spectroscopy.
Typically in hybridisation experiments, intensity of the radiation or fluorescence provided by the markers is detected and representative for the number of hybrids formed. Thus, in certain embodiments, the hybridisation intensities may be induced by emission of a label associated with a hybrid formed by binding of the target polynucleotide or mutant thereof and said probes. Suitable fluorescence markers for the present methods include, but are not limited to, Cy3 and Cy5, which are dyes of the cyanine dye family.
The markers or labels may be associated to the target or mutant polynucleotide prior to or after hybridisation, such as during PCR amplification of the sample DNA. In some embodiments, a fluorescent dye or other marker compound may be associated directly to the target or mutant thereof. In other embodiments, the marker compounds may be associated to the target or mutant thereof in an indirect manner, for example via a “barcode”, which is a strand having a hybridisation sequence which is complementary to a tail sequence which is present on the mutant polynucleotide of interest and on the target polynucleotide, thereby allowing hybridisation between the barcode and target (or mutant thereof), and therefore indirect coupling of the fluorescence marker or other marker to the target. More particularly, the strand hybridizes to a tail sequence outside the target sequence of the target polynucleotide, such that it does not significantly interfere with the hybridisation between the targets and the probes.
After the detection of the signal resulting from the hybridisation of the sample DNA with the probes, the analysis is performed. In particular embodiments, the analysis (referred to as step (iii) above) of the present methods, comprises the comparison of the measured hybridisation intensities of the sample solution with each probe of the probe pair. Thus, the measured intensity of the hybridisation of the sample solution with probe PA [I(PA)m] is compared to the measured intensity of the hybridisation of the sample solution with probe PB [I(PB)m]. Based on the comparison of said hybridisation intensities, particularly by determining the ratio of the measured intensity of the hybridisation with probe PA vs the measured intensity of the hybridisation with probe PB [i.e. I(PA)m/I(PB)m], the presence of one or more mutant polynucleotides can be determined.
In particular embodiments, the hybridisation intensities may be analyzed using statistical methods, wherein I(PA)m/I(PB)m is compared to the known ratio I(PA)/I(PB) of step (i).
Indeed, in case I(PA)m/I(PB)m deviates from, particularly is higher than, the ratio I(PA)/I(PB) of step (i), this corresponds to the presence of a mutant polynucleotide in the sample solution. A deviation from the ratio I(PA)/I(PB) of step (i) may be a deviation, preferably increase, of at least 5%, more preferably of at least 10%, still more preferably of at least 25%, even more preferably of at least 33%.
In this context, specific ways to detect and/or identify mutant polynucleotides in a sample solution will be explained below, using theoretical concepts based on the thermodynamics of DNA hybridisation, as illustrated for a particular embodiment in
In microarray data analysis, each probe intensity (hybridisation intensity) is associated to a signal from a spot. A spot is a local space on the microarray slide that contains a large number of identical sequences corresponding to a certain type of probe in the probe set. Therefore, typically, each spot represents a single type of probe. Each of these identical sequences within a spot is supposed to be hybridized to a floating target sequence depending on the affinity between the two sequences. This affinity is sequence dependent and determines the fraction of hybridized probes in a spot.
In a hybridisation model with Langmuir isotherm (Hooyberghs et al., Nucleic Acids Res. 2009, 37, e53), the relationship between the detected intensity and the hybridisation affinity for the target-probe hybrid can approximately be written as equation (1) (assuming the hybridisation between the target and probe sequences are in thermodynamic equilibrium, and that the fraction of hybridized probes in a microarray spot is such that the detected intensities are significantly above the background yet far from saturation):
I=A.e
−ΔG/RT (1)
wherein I is the detected hybridisation intensity, A is a proportionality factor for the intensity and is e.g. system dependent, ΔG is hybridisation free energy as a sequence dependent measure for the affinity, i.e. the free energy difference between two ssDNA sequences and their DNA duplex formed by the hybridisation, R is the ideal gas constant, and T is the experimental temperature.
This equation can be calculated for each spot of the microarray, i.e. for each probe type in a probe set.
In case of the hybridisation between a sample containing either a target (wild type) sequence (Twt) or either a mixture of a target sequence Twt and a mutant sequence Tm and a probe pair, PA and PB, wherein probe PA comprises a sequence which is fully complementary to the mutant sequence Tm and wherein probe PB comprises a sequence which is fully complementary to the wild type target sequence Twt except for at least one (different) variation, the following relations can be deduced.
When a wild type target sequence Twt hybridizes to probe PA, the Langmuir isotherm (equation 1) representing the intensity of the spot corresponding to this probe can be described as follows:
I=A.e
−ΔG(T
P
)/RT (2)
With −ΔG(Twt,PA) a measure of the affinity between probe PA and the wild type target Twt. As the PA−Twt duplex contains at least one mismatch (PA is fully complementary to a mutant sequence Tm, differing in at least one nucleotide from the wild type target Twt), which gives an affinity penalty and a corresponding reduction of the hybridisation intensity (cf. Langmuir isotherm; equation 1 and 2)
A similar situation occurs for the hybridisation between the wild type target sequence Twt and probe PB, and the corresponding Langmuir isotherm (equation 1) can be described as follows:
I=A.e
−ΔG(T
,P
)/RT (3)
With −ΔG(Twt,PB) a measure of the affinity between probe PA and the wild type target Twt.
Equations (2) and (3) allow to select and/or design a probe PB (e.g. by including at least one well-chosen mismatch different from a mutant nucleotide) so that in a particular embodiment ΔG(Twt,PA)=ΔG(Twt,PB), via theoretical modelling (e.g. via the nearest neighbor model) or by performing hybridisation experiments, or stated differently, wherein the ratio ΔG(Twt,PA)/AG(Twt,PB)=1.
(1) Thus, for a sample containing only wild type target sequence Twt, since ΔG(Twt,PA)=ΔG(Twt,PB), the affinity of probes PA and PB with respect to the target sequence Twt are equal (
(2) In the case of a hybridisation where the sample contains mixture of a target and mutant sequences i.e. a wild type sequence and a mutant such as in a clinical situation, there will be a competition between the two target sequences to hybridize to a single probe. To describe this competitive hybridisation, equation 1 can be extended to:
I=I(wt)+I(mut)=A.e−ΔG(wt)/RT+A.e−ΔG(mut)/RT (4)
wherein I(wt) is the wild type contribution to the total signal and I(mut) is the mutant contribution, ΔG(wt) is free energy of the wild type and ΔG(mut) is free energy of the mutant.
Accordinlgy, if a sample contains the wild type target sequence Twt and a mutant sequence Tm, both sequences will hybridize to each probe, and the resulting hybridisation intensity for probe PA and probe PB can be described as in equation 5 and 6, respectively:
I(PA)=A.[e−ΔG(T
I(PB)=A.[e−ΔG(T
As before, the contribution of the first term (relating to the hybridisation between the wild type target Twt to each probe) is equal for both probe PA and PB, but the contribution of the second term to the hybridisation intensity is highly different for the different probes:
With the sequence of probe PA fully matching the mutant sequence Twt, they will have a much higher affinity to bind: i.e. PA-Tm affinity is high. In contrast, the sequence of probe PB contains at least two mismatches with respect to TM (i.e. the at least one nucleotide differing between Twt and Tm, and an additional mismatch nucleotide), and consequently the PB-Tm affinity is low (
Accordingly, ΔG(Tm,PA)>ΔG(Tm,PB), and consequently, I(PA)>I(PB), or, stated differently, the ratio I(PA)/I(PB)>1.
This reasoning clearly shows that even when limited amounts of the mutant sequence are present, the hybridisation intensities of the two probes with the sample solution will no longer be equal. Advantageously, by calculating the ratio of the intensities with probe PA vs probe PB, the analysis is independent from the proportionality factor A, and thus there is no need to analyse a reference sample.
While in this setup the theoretical concept is explained in terms of I(PA)/I(PB)=1, the skilled person will understand this also applies for other setups where the ratio I(PA)/I(PB) is known, but not equal to 1: in this case, even at low levels of the mutant sequence in the sample solution, the ratio I(PA)/I(PB) will increase as well and the measured ratio I(PA)m/I(PB)m will deviate from the known ratio I(PA)/I(PB).
In particular embodiments of the present invention, the method further comprises determining the relative amount of said target polynucleotide and said mutant polynucleotide in said sample solution, for instance by using a calibration range for mutant polynucleotide/target wild type polynucleotide mixtures.
In certain embodiments, the present methods may comprise determining which of a plurality of candidate mutant polynucleotides is present in the sample solution. In this context, the method comprises contacting the sample solution with a plurality of probe pairs as envisaged herein, wherein each probe pair comprises a probe PA specific for one of the candidate mutant polynucleotides and a probe PB designed or selected as above. It is understood that the plurality of probe pairs may comprise a same or different probe PB. Next, the mutant polynucleotide(s) present in said sample can be identified and determined based on the probe pair(s) out of said plurality of probe pair(s) for which the ratio I(PA)m/I(PB)m deviates from the ratio I(PA)/I(PB) of step (i)
Further provided herein is a computer program product for performing, when executed on a computing device, at least a part of a method for determining the presence of a mutant of a target polynucleotide as described herein. For example, the computer programs may be configured for receiving and analyzing hybridisation intensities according to the methods described herein. For example, the computer program may be configured to analyse the intensity of the hybridisation of the sample solution with a first and second probe of a probe pair, and to statistically analyse and compare the ratio of the hybridisation intensities with the first and second probe as described herein. In particular embodiments, the computer program product is configured for receiving a hybridisation intensity for a sample solution with a first probe PA of a probe pair, for receiving a hybridisation intensity for a sample solution with a second probe PB of said probe pair and analyzing the ratio of the measured intensities and comparing it to the (theoretical) ratio of the hybridisation intensities of the wild type polynucleotide Twt with probes PA and PB, calculated using e.g. a Nearest Neighbor model.
The software may further be configured to perform a statistical analysis of the hybridisation intensity data using a first and second probe of one or more probe pairs in order to determine which of a plurality of candidate mutants is present in a sample solution. In certain embodiments, the computer programs may further be configured for designing suitable probe sets based on information of the target sequence and/or mutations thereof, particularly based on a Nearest Neighbor model for determining the difference in free energy.
In case of implementation or partly implementation as software, such software may be adapted to run on suitable computer or computer platform, based on one or more processors. The software may be adapted for use with any suitable operating system. The computing means may comprise a processing means or processor for processing data.
A further tool provided herein is a device configured for carrying out the methods provided herein. More particularly the device comprises the combination of the necessary hardware and software for carrying out the different steps of these methods. The device may comprise hardware, in the form of reaction vessels and feeds for reagents connected thereto and a detection unit, which can ensure the contacting a sample solution with a first probe PA and a second probe PB, and hybridisation of the sample solution with said probe PA and said probe PB. Moreover, the device comprises a processing unit provided with the necessary software for performing the analysis step involving the comparison of the intensity measurements with the first and second probe of a probe pair and optionally a display unit to present the results of said analysis to a user. In particular embodiments, the results are displayed as information on the presence of a mutant polynucleotide in the sample solution.
This concept is further exemplified in the examples below.
The following examples are provided for the purpose of illustrating the claimed methods and applications and by no means are meant and in no way should be interpreted to limit the scope of the present invention.
The inventors have applied the present method for the detection of point mutations in the K-RAS oncogene, which is an important genetic marker for colorectal and lung cancer diagnostics and treatment stratification.
The nucleotide sequence of the target polynucleotide (wild type K-RAS SEQ ID No 1) and a mutant polynucleotide (SEQ ID No 2) are as follows, with the target and mutant sequence being underlined and the nucleotide in the mutant sequence differing from the wild type target sequence marked by (*) (“mutant polynucleotide”):
GTCCTGCACCAGTAATATGCATATTAAAACAAGATTTACCTCTATTGTTG
GTCCTGCACCAGTAATATGCATATTAAAACAAGATTTACCTCTATTGTTG
In a first exemplary setup, a probe pair was designed with probe PA being a perfect match probe for the mutant sequence and probe PB being fully complementary to the target sequence, except for one nucleotide, different from the mutant nucleotide (nucleotides not complementary to the target sequence are underlined):
A hybridisation experiment with a sample comprising only the wild type polynucleotide and 0% mutant polynucleotide was performed and the ratio of the hybridisation intensity of the hybridisation of probe PA with said sample (IA) to the ratio of the hybridisation intensity of the hybridisation of probe PB with said sample (IB) was measured: the ratio IA/IB equaled 1.0229. This shows that with a one-point mutation a probe PB can be obtained having about the same hybridisation intensity for hybridisation with the target (wild type) sequence as probe PA.
When performing hybridisation experiments with samples with increasing levels of mutant polynucleotide, IA increases, due to the perfect match hybridisation between probe PA and the mutant polynucleotide, and IB decreases due to the mutant sequence presenting two non-complementary nucleotides with probe PB. Accordingly, the ratio IA/IB increases: IA/IB>1. This is represented in
In a second exemplary setup, a probe pair was designed with probe PA being fully complementary to the mutant target sequence except for one nucleotide and probe PB being fully complementary to the target sequence, except for two nucleotides, different from the non-matching nucleotides of probe PA: (nucleotides not complementary to the target sequence are underlined):
A hybridisation experiment with a sample comprising only the wild type polynucleotide and 0% mutant polynucleotide was performed and the ratio of the hybridisation intensity of the hybridisation of probe PA with said sample to the ratio of the hybridisation intensity of the hybridisation of probe PB with said sample was measured and yielded a ratio IA/IB=1.0126. This shows that a probe pair can be obtained having a very similar hybridisation intensity for hybridisation with the target (wild type) sequence.
When performing hybridisation experiments with samples with increasing levels of mutant polynucleotide, IA increases, and IB decreases, as above. Accordingly, the ratio IA/IB increases with increasing mutant polynucleotide concentration. This is represented in
Number | Date | Country | Kind |
---|---|---|---|
16160862.5 | Mar 2016 | EP | regional |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2017/056344 | 3/17/2017 | WO | 00 |