SINGLE NUCLEOTIDE DETECTION METHOD

Abstract
A method of sequencing a nucleic acid comprising the steps of (1) generating a stream of single nucleotides by progressive pyrophosphorolysis; (2) producing at least one substantially double-stranded oligonucleotide used probe comprising (a) a first single-stranded oligonucleotide labelled with first and second regions of detectable element types and (b) second and third single-stranded oligonucleotides capable of hybridising to complementary regions on the first oligonucleotide; (2a) either (i) treating the used probe with a restriction endonuclease to cut the first oligonucleotide strand at the recognition site or (ii) treating the used probe with restriction endonuclease to cut the first oligonucleotide strand at the recognition site; (3) digesting the first oligonucleotide strand of the used probe with an enzyme ide; (4) reacting the fourth oligonucleotide with another first oligonucleotide to produce a substantially double-stranded oligonucleotide product corresponding to the used probe; (5) repeating the above steps; and (6) detecting the detectable elements.
Description

This invention relates to a method for detecting and characterising single nucleotides. It is especially suitable for use in the sequencing of DNA or RNA and detecting the location of methylated and un-methylated nucleotide bases in such sequences.


Next generation sequencing of genetic material is already making a significant impact on the biological sciences in general and medicine in particular as the unit cost of sequencing falls in line with the coming to market of faster and faster sequencing machines.


In our previous applications WO 2014/053853, WO 2014/053854, W02014/167323, WO2014/167324 and WO2014/111723 we have described a new sequencing method which involves progressive digestion of a polynucleotide analyte to generate an ordered stream of single nucleotides, preferably a stream of single deoxyribonucleoside triphosphates, each of which can be captured one-by-one into corresponding droplets in a microdroplet stream. Thereafter, each droplet can be chemically and/or enzymatically manipulated to reveal the particular single nucleotide it originally contained. In one embodiment, these chemical and/or enzymatic manipulations comprise a method involving the use of one or more two-component oligonucleotide probe types each of which is adapted to be able to selectively capture one of the single nucleotide types from which the analyte is constituted. Typically, in each of such probe types, one of the two oligonucleotide components comprises characteristic fluorophores and in the probe's unused state the ability of these fluorophores to fluoresce remains extinguished by virtue of the presence of quenchers located close-by or by self-quenching. In use, when the probe has captured its corresponding single nucleotide, it is rendered susceptible to subsequent exonucleolysis thereby liberating the fluorophores from the quenchers and/or each other enabling them to fluoresce freely. By this means, the original single nucleotide present in each droplet can be identified indirectly by spectroscopic means.


Fan et al in Nature Reviews Genetics 7(8) 632-644 (2006) provide a general review of the development of methods and platforms that have enabled highly parallel genomic assays for genotyping, copy-number measurements, sequencing and detecting loss of heterozygosity, allele-specific expression and methylation. FIG. 2a of this review schematically shows the use of a circularizable probe with 3′ and 5′ ends that anneal upstream and downstream of a site of single nucleotide polymorphism (SNP) on an analyte thereby leaving a gap which is subsequently filled with a nucleotide which is the complement of the SNP to form a complete circular probe which may then be amplified after release. However unlike our method, the nucleotide which is captured during the filling process is not obtained directly from the analyte itself.


WO03080861 discloses a process wherein a nucleic acid analyte is subjected to progressive pyrophosphorolysis in the presence of a nucleotide-specific reactive label which attaches directly to the nucleotide as it is released. Not only is this quite different from the method we employ but in practice the fluorescence signal measured when the labelled nucleotides are subsequently interrogated would likely be too weak to enable reliable identification above the associated background noise.


Finally, WO9418218 teaches a DNA sequencing method in which the analyte is subjected to progressive exonucleolysis to generate a stream of single nucleotide diphosphates or monophosphates which are then incorporated into a fluorescence-enhancing matrix before being detected. Not only is this a completely different approach to the one we describe but we again observe that any signal generated would likely be too weak to be reliably detected and identified.


In our application WO 2016/012789 we disclose a new version of the sequencing method described in our above-mentioned patent applications involving an improved probe system. In this method, a first single-stranded oligonucleotide bearing e.g. fluorophores in an undetectable state is caused to undergo reaction in the presence of a polymerase and ligase with the nucleotide to be captured and second and third single-stranded oligonucleotides to produce a double-stranded used probe in which only the strand comprising the first oligonucleotide is susceptible to exonucleolysis. This enables the other strand produced during the capture to participate in a cycle of first oligonucleotide annealing and exonucleolysis steps so that the signal from the liberated fluorophores (now in a detectable state) grows rapidly with each cycle and becomes much easier to detect. In one embodiment, the 3′ end(s) of the second and/or third oligonucleotides are rendered resistant to exonucleolysis by chemical modification or preferably linking the relevant ends of each so that when the nucleotide is captured the strand of the used probe in which these oligonucleotides are located is rendered closed-loop.


We have now developed an improved version of the probe system and method described in this patent application which makes it possible to distinguish between nucleotides in which the associated nucleobases are substituted or unsubstituted; for example methylated or un-methylated. This makes the method especially suitable for example in detecting the location of methylated cytosine or adenine nucleobases in sequences. This is extremely beneficial as in some instances the locations of such methylation are known to correlate with genetic conditions and diseases such as cancer.


Thus according to a first aspect of the present invention there is provided a method of sequencing a nucleic acid characterised by the steps of (1) generating a stream of single nucleotides by progressive pyrophosphorolysis of the nucleic acid; (2) producing at least one substantially double-stranded oligonucleotide used probe by reacting, in the presence of a polymerase and a ligase, one of the single nucleotides with a corresponding probe system comprising (a) a first single-stranded oligonucleotide labelled with first and second regions of characteristic detectable element types in an undetectable state located respectively on the X′ and Y′ end sides of a third region comprising a restriction enzyme recognition site element including the capture site and an exonuclease-blocking site on the X′ side thereof (wherein either X′ is 3′ and Y′ is 5′ or X′ is 5′ and Y′ is 3′) and (b) second and third single-stranded oligonucleotides capable of hybridising to complementary regions on the first oligonucleotide flanking the capture site; (2a) either (i) treating the used probe with a conventional or nicking substitution-dependent restriction endonuclease to cut the first oligonucleotide strand at the recognition site if and only if the single nucleotide captured comprises a nucleobase which is substituted or (ii) treating the used probe with a conventional or nicking substitution-sensitive restriction endonuclease to cut the first oligonucleotide strand at the recognition site if and only if the single nucleotide captured comprises a nucleobase which is unsubstituted; (3) digesting the first oligonucleotide strand of the used probe with an enzyme having double-stranded exonucleolytic activity in the X′-Y′ direction corresponding to the first oligonucleotide to yield detectable elements derived from either the first region, the second region, or the first and second regions in a detectable state and a single-stranded fourth oligonucleotide which is at least in part the sequence complement of the first oligonucleotide; (4) reacting the fourth oligonucleotide with another first oligonucleotide to produce a substantially double-stranded oligonucleotide product corresponding to the used probe; (5) repeating steps (2a), (3) and (4) in a cycle and (6) detecting the detectable elements released in each iteration of step (3) wherein if the endonuclease employed is of the conventional type the second or third oligonucleotide includes an endonucleolysis-directing linkage at or close to its X′ or Y′ end respectively.


Step (1) of the method of the present invention comprises generating a stream of single nucleotides (here single nucleoside triphosphates) from a nucleic acid analyte by pyrophosphorolysis. The analyte employed in this step is suitably a double-stranded polynucleotide the length of which can in principle be unlimited including up to the many millions of nucleotide bases found in a human genome fragment. Typically however the polynucleotide will be at least 50, preferably at least 150 nucleotide pairs long; suitably it will be great than 500, greater than 1000 and in many cases thousands of nucleotide pairs long. The analyte itself is suitably RNA or DNA of natural origin (e.g. derived from a plant, animal, bacterium or a virus) although the method can also be used to sequence synthetically produced RNA or DNA or other nucleic acids made up wholly or in part of nucleobases that are not commonly encountered in nature; i.e. nucleobases other than adenine, thymine, guanine, cytosine and uracil. Examples of such nucleobases include 4-acetylcytidine, 5-(carboxyhydroxylmethyl)uridine, 2-O-methylcytidine, 5-carboxymethylaminomethyl-2-thiouridine, 5-carboxymethylamino-methyluridine, dihydrouridine, 2-O-methylpseudouridine, 2-O-methylguanosine, inosine, N6-isopentyladenosine, 1-methyladenosine, 1-methylpseudouridine, 1-methylguanosine, 1-methylinosine, 2,2-dimethylguanosine, 2-methyladenosine, 2-methylguanosine, 3-methylcytidine, 5-methylcytidine, N6-methyladenosine, 7-methylguanosine, 5-methylaminomethyluridine, 5-methoxyaminomethyl-2-thiouridine, 5-methoxyuridine, 5-methoxycarbonylmethyl-2-thiouridine, 5-methoxycarbonylmethyluridine, 2-methylthio-N6-isopentenyladenosine, uridine-5-oxyacetic acid-methylester, uridine-5-oxyacetic acid, wybutoxosine, wybutosine, pseudouridine, queuosine, 2-thiocytidine, 5-methyl-2-thiouridine, 2-thiouridine, 4-thiouridine, 5-methyluridine, 2-O-methyl-5-methyluridine and 2-O-methyluridine. In the case of DNA the single nucleotides generated are deoxyribonucleoside triphosphates whilst in the case of RNA they are ribonucleoside triphosphates.


In one embodiment of the invention, step (1) comprises a first sub-step of attaching the analyte to a substrate. Typically, the substrate comprises a microfluidic surface, a micro-bead or a permeable membrane; for example one made out of glass or a non-degradable polymer. Preferably, the substrate further comprises a surface adapted to receive the analyte. There are many ways in which the analyte can be attached to such surfaces any of which can in principle be used in this sub-step. For example, one method involves priming a glass surface with a functionalised silane such as an epoxysilane, an aminohydrocarbylsilane or a mercaptosilane. The reactive sites so generated can then be treated with a derivative of the analyte which has been modified to include a terminal amine, succinyl or thiol group.


In one embodiment of step (1), the analyte is pyrophosphorolysed progressively to generate a stream of single nucleotides the order of which corresponds to that of the sequence of the analyte. Pyrophosphorolysis may be carried out at a temperature in the range 20 to 90° C. in the presence of a reaction medium comprising a polymerase. Preferably it is carried out under conditions of continuous flow so that the single nucleotides are continually removed from the reaction zone as they are liberated. Most preferably, the pyrophosphorolysis is carried out by causing an aqueous buffered medium containing the enzyme and the other typical additives to continuously flow over the surface to which the analyte is bound.


In one embodiment, the enzyme used is one which can cause progressive 3′-5′ pyrophosphorolytic degradation of the analyte to yield a stream of nucleotides with high fidelity and at a reasonable reaction rate. Preferably this degradation rate is as fast as possible and in one embodiment is in the range 1 to 50 nucleotides per second. Further information about the pyrophosphorolysis reaction as applied to the degradation of polynucleotides can be found for example in J. Biol. Chem. 244 (1969) pp. 3019-3028. Suitably, the pyrophosphorolysis is carried out in the presence of a medium which further comprises pyrophosphate anion and magnesium cations; preferably in millimolar concentrations.


In step (2) of the method of the present invention at least one single nucleotide in the stream, preferably each single nucleotide in an ordered stream, is reacted, in the presence of a polymerase and a ligase, with a probe system to generate a substantially double-stranded used probe. In one embodiment, before this step is carried out the product of step (1) is treated with a pyrophosphatase to hydrolyse any residual pyrophosphate to phosphate anion.


The polymerase used in this step is in one embodiment selected from the group consisting of those which show essentially neither exo- nor endonuclease activity under the reaction conditions. Examples of polymerases which can be advantageously used include, but are not limited to, the prokaryotic pol 1 enzymes or enzyme derivatives obtained from bacteria such as Escherichia coli (e.g. Klenow fragment polymerase), Thermus aquaticus (e.g. Taq Pol), Bacillus stearothermophilus, Bacillus caldovelox and Bacillus caldotenax. Any suitable ligase can be used in this step.


The probe system employed in step (2) is comprised of three components; (a) a first single-stranded oligonucleotide labelled with first and second regions of different detectable element types in an undetectable state and (b) second and third unlabelled single-stranded oligonucleotides capable of hybridising to complementary regions on the first oligonucleotide located either side of a capture site. In one embodiment, where the restriction endonuclease is not a nicking endonuclease, the second and/or third oligonucleotide includes an endonucleolysis-directing linkage located at or close to their X′ or Y′ end respectively so that ultimately this linkage comprises part of a restriction enzyme recognition site created in the used probe. In one embodiment this endonucleolysis-directing linkage is a phosphorothioate linkage. In another, it is one selected from the group consisting of phosphoramidite linkages such as C3 spacer, Spacer 9, Spacer 12, Spacer 18, d-Spacer (abasic furan type) or other like spacers or modifications used between nucleotides in oligonucleotide synthesis.


In one embodiment, the second and third oligonucleotides are discrete entities whilst in another they are linked to each other by means of a linker region. In another, the linker region links the Y′ end of the second oligonucleotide and the X′ end of the third oligonucleotide together where here and hereinafter the terms X′ and Y′ are used with respect to the direction in which the exonucleolysis in step (3) occurs. This linker region can in principle be any divalent group but is conveniently another single- or double-stranded oligonucleotide fragment. In one embodiment, the linker region is unable to hybridise substantially to the first oligonucleotide.


The first oligonucleotide further includes first and second regions bearing different detectable elements in an undetectable state on the X′ and Y′ end sides of a third region located therebetween which includes the capture site mentioned above. This capture site comprises a single nucleotide whose nucleobase is complementary to the one borne by the nucleotide to be detected, making the probe system highly selective for that particular nucleotide. Thus, for example, if the analyte is derived from DNA and the first, second and third oligonucleotides are deoxyribonucleotides, the capture site will be highly selective for deoxyadenosine triphosphate if the capture site nucleotide bears a thymine base. In one useful embodiment of the invention therefore step (2) may be carried out in the presence of a plurality of probe system types; for example one, two, three or four probe system types each of which comprises a first oligonucleotide having (a) a different capture site nucleotide characteristic of the various different nucleobases sought and (b) different associated detectable elements. In one embodiment all the first oligonucleotides are of the type described herein. In another a mixture of at least one of this type with one or more of the first oligonucleotides described in WO 2016/012789 is employed.


The capture site itself forms part of a restriction site element which enables a restriction enzyme recognition site to be created when the used probe is formed. Typically the restriction site element is between 4 and 8 nucleotides long, but in principle it may be of any length.


The third region also includes an exonuclease-blocking site designed to halt X′-Y′ exonucleolytic digestion of the first oligonucleotide strand of the used probe in step (3) before the recognition site is reached. By this means, the detectable elements associated with both first and second regions can only be released in detectable form if the first oligonucleotide strand of the used probe is cleaved at the recognition site. This exonuclease-blocking element is therefore suitably located in the backbone of the third region on the X′ end side of the restriction site element. It is suitably an element selected from a phosphorothioate linkage, a G-Quadruplex, a boronated nucleotide, an inverted dT or ddT, a C3 spacer, a phosphate group, or any of the various exonuclease-inhibiting linkers or spacers commonly available as internal oligonucleotide modifications.


Typically, the first oligonucleotide is up to 150 nucleotides long, preferably between 20 and 100 nucleotides. In one embodiment, the second oligonucleotide is longer than the complementary X′ side flanking region of the first oligonucleotide by up to 10 preferably from 1 to 5 nucleotides. In another, there is a single nucleotide mismatch between the 3′ end of the first oligonucleotide and the nucleotide opposite it on the second or third oligonucleotide to prevent the nucleoside triphosphate being captured by the polymerase at this point. In yet another embodiment, the X′ end of the third oligonucleotide includes an element resistant to exonucleolytic degradation to ensure that the fourth oligonucleotide produced in step (3) is not itself subsequently digested. This can be achieved for example by way of incorporating one or more phosphorothioate linkages, a G-Quadruplex, a boronated nucleotide, an inverted dT or ddT, a C3 spacer, a phosphate group or any of the various exonuclease-inhibiting linkers and spacers commonly available, at or near that particular end.


It is a feature of the first oligonucleotide that it is labelled with first and second detectable element regions each of which is labelled, preferably multiply labelled, with its own unique and characteristic type of detectable element(s) and that these detectable elements are substantially undetectable when the probe system is in an unused state. Suitably these detectable elements are ones adapted to be detected after an optical event has taken place. In one preferred embodiment, the detectable elements comprise fluorophores and each unused first oligonucleotide is essentially non-fluorescing at those wavelengths where the fluorophores are designed to be detected. Thus, although a fluorophore may exhibit general, low-level background fluorescence across a wide part of the electromagnetic spectrum, there will typically be one or a small number of specific wavelengths or wavelength envelopes where the intensity of the fluorescence is at a maximum. It is at one or more of these maxima where the fluorophore is characteristically detected that essentially no fluorescence should occur. In the context of this patent, by the term ‘essentially non-fluorescing’ or equivalent wording is meant that the intensity of fluorescence of the total number of fluorophores attached to the first oligonucleotide at the relevant characteristic wavelength or wavelength envelope is less than 25%; preferably less than 10%; more preferably less than 1% and most preferably less than 0.1% of the corresponding intensity of fluorescence of an equivalent number of free fluorophores.


If fluorophores are employed in a given first and/or second region, then in principle, any method can be used to ensure that in the first oligonucleotide's unused state they are essentially non-fluorescing. One approach is to additionally attach quenchers in close proximity to them. Another is based on the observation that when multiple fluorophores are located in close proximity to each other they tend to quench each other sufficiently well that the criterion described in the previous paragraph can be achieved without the need for quenchers. In this context of this patent, what constitutes ‘close proximity’ between fluorophores or between fluorophores and quenchers will depend on the particular fluorophores and quenchers used and possibly the structural characteristics of the first oligonucleotide. Consequently, it is intended that this term should be construed with reference to the required outcome rather than any particular structural arrangement of the various elements. However, and for the purposes of providing exemplification only, it is pointed out that when adjacent fluorophores or adjacent fluorophores and quenchers are separated by a distance corresponding to the characteristic Förster distance (typically less than 5 nm) sufficient quenching will generally be achieved.


Suitably each first and second region is labelled with at least 1, preferably up to 20 fluorophores. To obtain maximum advantage, it is preferred that each region is labelled with at least 2 preferably at least 3 fluorophores. Consequently, ranges constructed from any permutation of these maximum and minimum numbers are specifically envisaged herein. If quenchers are employed, it is likewise preferred that each region is labelled with up to 20, preferably up to 10 and most preferably up to 5 of the same.


As regards the fluorophores themselves, they can in principle be chosen from any of those conventionally used in the art including but not limited to xanthene moieties e.g. fluorescein, rhodamine and their derivatives such as fluorescein isothiocyanate, rhodamine B and the like; coumarin moieties (e.g. hydroxyl-, methyl- and aminocoumarin) and cyanine moieties such as Cy2, Cy3, Cy5 and Cy7. Specific examples include fluorophores derived from the following commonly used dyes: Alexa dyes, cyanine dyes, Atto Tec dyes, and rhodamine dyes. Examples also include: Atto 633 (ATTO-TEC GmbH), Texas Red™, Atto 740 (ATTO-TEC GmbH), Rose Bengal, Alexa Fluor™ 750 C5-maleimide (Invitrogen), Alexa Fluor™ 532 C2-maleimide (Invitrogen) and Rhodamine Red C2-maleimide and Rhodamine Green as well as phosphoramadite dyes such as Quasar 570. Alternatively, a quantum dot or a near infra-red dye such as those supplied by LI-COR Biosciences can be employed. The fluorophore is typically attached to the first oligonucleotide via a nucleotide base using chemical methods known in the art.


Suitable quenchers include those which work by a Förster resonance energy transfer (FRET) mechanism. Examples of commercially available quenchers which can be used in association with the above mentioned-fluorophores include but are not limited to DDQ-1, Dabcyl, Eclipse, Iowa Black FQ and RQ, IR Dye-QC1, BHQ-0, BHQ-1, -2 and -3 and QSY-7 and -21.


In one embodiment the second and third oligonucleotides are not labelled with detectable elements.


Step (2) is suitably carried out by contacting each single nucleotide in the stream with one or more probe systems as described above at a temperature in the range 20 to 80° C.


The product of step (2) of the method of the invention is, as mentioned above, a substantially double-stranded used probe whose constituent strands are respectively the first oligonucleotide and a complementary fourth oligonucleotide which when read in its Y′-X′ direction is comprised of the second oligonucleotide, the captured nucleotide and finally the third oligonucleotide. Thus, if the second and third oligonucleotides have previously been joined together by a linker region it will be readily apparent that the fourth oligonucleotide will comprise a closed loop strand that is highly resistant to exonucleolysis.


In step (2a) the used probe is treated with a substitution-dependent or substitution sensitive restriction endonuclease to enable the method to detect the presence or absence of a nucleobase substitution (e.g. an alkyl, amino, nitro, halide, sulphur or like substitution) in the single nucleotide. By this means the method can be used, for example, to detect the presence or absence of methylated nucleobases although this should not be construed as limiting. In this particular application, either a methylation-dependent or a methylation-sensitive restriction endonuclease, preferably at a temperature in the range up to 70° C., is used to cleave the first oligonucleotide strand at the recognition site if and only if the single nucleotide captured comprises a methylated or non-methylation nucleobase (as the case may be). For example, the methylation-sensitive endonucleases MspI and HpaII cleave double-stranded oligonucleotides only at a recognition site having the sequence 5′-CC*GG-3′ where C* is non-methylated cytosine and the DnpI endonuclease cleaves only at a recognition site having the sequence 5′-GA*TC-3′ where A* is methylated adenine. It is an important feature of this step that only the first oligonucleotide strand of the used probe is capable of being cleaved by the endonuclease either by virtue of the restriction enzyme being a nicking endonuclease or by virtue of the presence of the endonucleolysis-directing linkage derived from the second or third oligonucleotide in the other strand. As a consequence, if the endonuclease employed is methylation-dependent and the single nucleotide captured by the capture site is methylated the restriction endonuclease will cleave the first oligonucleotide strand of the used probe in two, making both first and second regions susceptible to digestion in step (3) whilst if the single nucleotide captured is un-methylated the restriction enzyme will have no effect and digestion of the first oligonucleotide strand will continue in a X′-Y′ direction until stopped by the exonuclease-blocking site. Thus, only the detectable elements associated with the first region will be liberated in a detectable form. Conversely, if the endonuclease employed is methylation-sensitive and the single nucleotide captured is non-methylated the restriction endonuclease will cleave the first oligonucleotide strand of the used probe in two, making both first and second regions susceptible to digestion in step (3) whilst if the single nucleotide captured is methylated the restriction enzyme will have no effect and digestion of the first nucleotide strand will continue in a X′-Y′ direction until stopped by the exonuclease-blocking site. Thus, only the detectable elements associated with the first region will be liberated in a detectable form. This means that in step (6) the observer will detect both types of detectable element if the single nucleotide captured was methylated and only the second type if it was un-methylated when using a methylation-dependent restriction enzyme, with the opposite being true when using a methylation-sensitive restriction enzyme. In another embodiment, cleavage at the restriction site allows the oligonucleotide fragment comprising the first region to melt, resulting in only the second region being digested, whilst lack of cleavage again results in the digestion of only the first region.


In step (3) the product of step (2a) is treated with an enzyme at a temperature in the range 30 to 100° C. In this step the strand or parts of the strand of the used probe derived from the first oligonucleotide are digested in the X′-Y′ direction into their constituent nucleotides (deoxyribonucleoside monophosphates or ribonucleoside monophosphates as the case may be) in the process separating the detectable elements from each other and causing them to become unquenched and therefore detectable. Thus if the detectable elements are fluorophores which have been quenched into an undetectable state in the first oligonucleotide, step (3) will liberate the fluorophores from each other and any quenchers thereby causing them to fluoresce. As the digestion process occurs the observer therefore sees rapid growth in the fluorescence signal or signals as a cascade of fluorophores is generated. The characteristics of this fluorescence then indirectly reflects the nature of the single nucleotide originally captured by the relevant probe system.


Enzymes which can be used in step (3) comprise exonucleases or polymerases which exhibit 3′-5′ or 5′-3′ exonucleolytic activity. The 3′-5′ class of enzyme includes Q5, Q5 Hot Start, Phusion, Phusion HS, Phusion II, Phusion II HS, Dnase I (RNase-free), Exonuclease I or III (ex E. coli), Exonuclease T, Exonuclease V (RecBCD), Lambda Exonuclease, Micrococcal Nuclease, Mung Bean Nuclease, Nuclease BAL-31, RecJf, T5 Exonuclease and T7 Exonuclease. Step (3) is preferably carried out at a temperature in the range 60-90° C.


In one embodiment, at the end of step (3) and before step (4) the reaction mixture is preferably heated to a temperature in the range 80-100° C. to remove from the fourth oligonucleotide any attached residual oligonucleotide fragments left over from the digestion.


In step (4) the fourth oligonucleotide, now present in single-stranded form, is caused to hybridise to another first oligonucleotide molecule thereby producing a new substantially double-stranded oligonucleotide product corresponding to, i.e. one having the same chemical and physical structure as the used probe. This product is then digested in a repeat of steps (2a) and (3) thereby releasing further detectable elements in a detectable state and again regenerating the fourth oligonucleotide. Thereafter, according to step (5), steps (2a), (3) and (4) are allowed to continue in a cyclic way causing further enhancement in the signal from the free detectable elements, e.g. the fluorescence signal, in principle until substantially all of the first oligonucleotide has been consumed. As a consequence the observer sees a much greater enhancement of the fluorescence signal than has been previously obtained.


Thereafter, and in step (6), the detectable elements liberated in the various iterations of step (3) are detected and the nature of the nucleobase attached to the original single nucleotide determined. By carrying out the method of the invention systematically for all the single nucleotides in the stream generated in step (1), data characteristic of the sequence of original nucleic acid analyte can therefore be generated and methylation sites located. Methods of doing this detection are well-known in the art; for example fluorescence may be detected using a photodetector or an equivalent device tuned to the characteristic fluorescence wavelength(s) or wavelength envelope(s) of the various fluorophore types. This in turn causes the photodetector to generate a characteristic electrical signal which can be processed and analysed in a computer using known algorithms. In one embodiment, a period of time is allowed to elapse between steps (5) and (6) to ensure that the number of detectable elements in a detectable state has grown to a maximum.


In one particularly preferred embodiment, the method of the present invention is carried out wholly or partially in a stream of microdroplets, at least some of which contain a single nucleotide; suitably an ordered stream. Such a method may begin, for example, by inserting the nucleotides generated in step (1) one-by-one into a corresponding stream of aqueous microdroplets maintained in an immiscible carrier solvent such as a hydrocarbon or silicone oil to help preserve the ordering. Advantageously, this can be achieved by directly creating the microdroplets downstream of the pyrophosphorolysis reaction zone; for example by causing the reaction medium to emerge from a microdroplet head of suitable dimensions into a flowing stream of the solvent. Alternatively, small aliquots of the reaction medium from step (1) can be regularly and sequentially injected into a stream of pre-existing aqueous microdroplets suspended in the solvent. If this latter approach is adopted, each microdroplet may already contain the various components of the probe system(s) together with the enzymes and any other reagents (e.g. buffer) required to effect steps (2) to (5). In yet another approach, the microdroplets created in the former embodiment can be caused to coalesce subsequently with a stream of such pre-existing microdroplets to achieve a similar outcome. In these microdroplet methods, step (6) then preferably involves interrogating each microdroplet to identify the detectable elements liberated and hence the nature of the nucleoside triphosphate it originally contained. However in a most preferred embodiment of all the reaction medium from step (1) is printed as a series of microdroplets onto a moveable surface covered with the carrier solvent at droplet-receiving locations on the surface optionally comprised of wells and the like. Thereafter, the other components mentioned above may be introduced sequentially into each location in the form of further aqueous microdroplets which undergo coalescence with the original reaction medium. If so desired, these further aqueous microdroplets may be introduced only after an incubation period has elapsed. If this approach is adopted then step (6) involves the interrogation of each droplet-receiving location in turn.


In any of these approaches, to avoid the risk that a given microdroplet contains more than one nucleotide it is preferred to release each nucleotide in step (1) at a rate such that each filled microdroplet is separated on average by from 1 to 20 preferably 2 to 10 empty ones. Suitably the microdroplets employed have a finite diameter of less than 100 microns, preferably less than 50 microns, more preferably less than 20 microns and even more preferably less than 15 microns. Most preferably of all their diameters are in the range 2 to 20 microns. In one embodiment, the microdroplet generation rate through the whole system is in the range 50 to 3000 microdroplets per second preferably 100 to 2000.


According to a second aspect of the invention there is provided a multi-component biological probe system characterised by comprising (a) a first single-stranded oligonucleotide labelled with first and second regions of different detectable element types in an undetectable state located respectively on the X′ and Y′ end sides of a third region comprising a single nucleotide capture site complementary to one of the nucleobases of DNA or RNA having (1) an associated restriction enzyme recognition site element and (2) an exonuclease-blocking site on its X′ side and (b) second and third unlabelled single-stranded oligonucleotides capable of hybridising respectively to the complementary X′ and Y′ side regions on the first oligonucleotide flanking the capture wherein either X′ is 3′ and Y′ is 5′ or X is 5′ and Y is 3′.


In a first embodiment the third oligonucleotide includes an endonucleolysis-directing linkage at or close to its Y′ end. In another the second oligonucleotide has a similar linkage at or close to its X′ end. In one embodiment this linkage is a phosphorothioate linkage. In another, it is one selected from the group consisting of phosphoramidite linkages such as C3 spacer, Spacer 9, Spacer 12, Spacer 18, d-Spacer (abasic furan type) or other like spacers or modifications used between nucleotides in oligonucleotide synthesis. Preferably, the Y′ end of the second oligonucleotide and X′ end of the third oligonucleotide are connected by a linker region suitably a single- or double-stranded oligonucleotide region. In another embodiment the detectable elements in the first and second regions are fluorophores of the type mentioned above optionally in the presence of quenchers. In yet another embodiment the second oligonucleotide is longer than the X′ flanking region of the first oligonucleotide. Preferably, the nucleotide at the 3′ end of the first oligonucleotide is a mismatch with the corresponding nucleotide in the second or third oligonucleotide.


When the second or third oligonucleotides are not connected by a linker region the second and/or third oligonucleotide suitably also includes an element resistant to exonucleolytic degradation at its X′ end.


The method and probe systems described above can be used to advantage in a sequencing device and such devices and uses are envisaged as being within the scope of the invention.


The present invention in its various aspects will now be illustrated with reference to the following example.







EXAMPLE 1
Preparation and Use of a Probe System

A single-stranded first oligonucleotide 1 was prepared, having the following nucleotide sequence:









5′-ACATCACGACTATCTAYYZCAGGACGG/mC/CCGTTAT/C3/CAXXQ





AACCGCACGA*-3′







wherein A, C, G, and T represent nucleotides bearing the relevant characteristic nucleotide base of DNA; X represents a deoxythymidine nucleotide (T) labelled with Atto 655 dye, Q represents a deoxythymidine nucleotide labelled with a BHQ-2 quencher, Y represents a deoxythymidine nucleotide labelled with Atto 532 dye, Z represents a deoxythymidine nucleotide labelled with a BHQ-1 quencher, /C3/ represents a C3-spacer modification, /mC/ represents a 5-methylcytosine nucleotide and * represents a hexanediol modification. It further comprises a capture region (G nucleotide) selective for capturing deoxycytosine triphosphate nucleotides (dCTPs) in a mixture of deoxynucleotide triphosphates (dNTPs).


Two further single-stranded oligonucleotides 2 and 3 were also prepared, each having regions complementary to regions of oligonucleotide 1, with a single base gap formed at the capture region of oligonucleotide 1 when both oligonucleotides 2 and 3 are annealed thereto. Oligonucleotide 2 had the following nucleotide sequence:











5′-GTGATGTTAACGTGCGGTTAAATGATAACG**GG-3′







wherein ** represents a phosphorothioate linkage. Oligonucleotide 3 had the following sequence:











5′-PCGTCCTGAAATAGATAGTCGTGATGTCTAGCATGACAT







A/IdT/-3′







wherein P represents a 5′ phosphate group and /IdT/ represents a 3′ inverted dT base.


A reaction mixture comprising the probe system was then prepared. It had a composition corresponding to that derived from the following formulation:


20 nM oligonucleotide 1


10 nM oligonucleotide 2


50 nM oligonucleotide 3


10 nM dCTP, 5 mdCTP or equivalent volume of water


1× buffer pH 7.6


1.4 reactions/mL Pfu Ultra Fusion II Hot Start polymerase


57 U/mL Bst Large Fragment polymerase


71 U/mL E. coli ligase


3.6 U/mL AOXI


14 U/mL Thermostable Inorganic Pyrophosphatase

  • wherein 5× buffer comprised the following mixture:


60 uL Trizma hydrochloride, 1M, pH 7.6


33.3 uL KCl, 3M


25 uL aqueous MgCl2, 1M


20 uL Nicotinamide adenine dinucleotide, 100 uM


2.5 uL Dithiothreitol, 1M


50 uL Triton X-100 surfactant (10%)


Water to 1 mL


Capture of the dCTPs or 5 mdCTPs and ligation of oligonucleotide 2 to oligonucleotide 3 was then carried out by incubating the mixture at 37° C. for 10 minutes after which the temperature was increased to 60° C. for a further 20 minutes. The temperature was then increased to 95° C. for 30 seconds before finally being decreased to 70° C. for 90 minutes. The reaction mixture was then illuminated using the 633 nm line of a Helium-Neon laser and the 532 nm line of a diode laser and the resulting characteristic fluorescence of the Atto 655 and Atto 532 dyes detected using a camera.


The growth over time in intensity of fluorescence over background in the presence of the dNTP component of the reaction was monitored and the results shown graphically in FIGS. 1 and 2. From these it can be seen that the probe system efficiently captures both dCTPs and 5 mdCTPs. In the presence of dCTP a fluorescence signal is observed only in the Atto 655 channel while in the presence of 5 mdCTP a signal is observed in both the Atto 655 and Atto 532 channels. In a comparative experiment where no dNTPs were present in the reaction mixture neither of the Atto 655 and Atto 532 dyes on oligonucleotide 1 exhibited fluorescence to any significant extent.

Claims
  • 1. A method of sequencing a nucleic acid comprising the steps of: (a) generating a stream of single nucleotides by progressive pyrophosphorolysis of the nucleic acid;(b) producing at least one substantially double-stranded oligonucleotide used probe by reacting, in the presence of a polymerase and a ligase, one of the single nucleotides with a corresponding probe system comprising: (i) a first single-stranded oligonucleotide labelled with first and second regions of a characteristic detectable element in an undetectable state located respectively on the X′ and Y′ sides of a third region comprising a restriction enzyme recognition site element including a capture site and an exonuclease-blocking site on the X′ side thereof wherein either X′ is 3′ and Y′ is 5′, or X′ is 5′ and Y′ is 3′; and(ii) second and third single-stranded oligonucleotides that hybridize to complementary regions on the first oligonucleotide flanking the capture site;(c) either (iii) treating the used probe with a conventional or nicking substitution-dependent restriction endonuclease to cut the first oligonucleotide strand at the recognition site if and only if the single nucleotide captured comprises a nucleobase which is substituted or(iv) treating the used probe with a conventional or nicking substitution-sensitive restriction endonuclease to cut the first oligonucleotide strand at the recognition site if and only if the single nucleotide captured comprises a nucleobase which is unsubstituted;(d) digesting the first oligonucleotide strand of the used probe with an enzyme having double-stranded exonucleolytic activity in the X′-Y′ direction corresponding to the first oligonucleotide to yield detectable elements derived from either the first region, the second region, or the first and second regions in a detectable state and a single-stranded fourth oligonucleotide which is at least in part the sequence complement of the first oligonucleotide;(e) reacting the fourth oligonucleotide with another first oligonucleotide to produce a substantially double-stranded oligonucleotide product corresponding to the used probe;(f) repeating steps (c), (d) and (e) in a cycle; and(g) detecting the detectable elements released in each iteration of step (d) wherein if the endonuclease employed is of the conventional type the second or third oligonucleotide includes an endonucleolysis-directing linkage at or close to its X′ or Y′ end respectively.
  • 2. The method of claim 1 wherein the Y′ end of the second oligonucleotide and the X′ end of the third oligonucleotide are connected by a linker region.
  • 3. The method of claim 2 wherein the linker region comprises an oligonucleotide region.
  • 4. The method of claim 2 wherein the fourth oligonucleotide generated comprises a closed loop.
  • 5. The method of claim 1, wherein the second oligonucleotide (a) hybridises to a flanking region on the X′ side of the capture region and (b) is longer than the region on that side.
  • 6. The method of claim 1, wherein there is at least one nucleotide base mismatch between the 3′ end of the first oligonucleotide and the corresponding region of the second or third oligonucleotide.
  • 7. The method of claim 1, wherein the detectable elements are fluorophores rendered undetectable in the first oligonucleotide by at least one quencher.
  • 8. The method of claim 1, wherein a substituted single nucleotide triphosphate detected includes either a methylated cytosine or methylated adenine nucleotide base.
  • 9. The method of claim 1, wherein steps (c) and (d) are carried out at different temperatures.
  • 10. The method of claim 1, wherein the probe system further comprises a plurality of first oligonucleotide types each provided with a different capture region and characteristic first or first and second detectable elements.
  • 11. The method of claim 10 wherein up to four different sets of oligonucleotide probe systems are employed, the first oligonucleotide of each set having a capture region selective for one of the characteristic nucleotide bases of naturally-occurring DNA or RNA and different first or pairs of first and second detectable elements.
  • 12. The method of claim 1, wherein step (a) further comprises containing each single nucleoside triphosphate in a corresponding microdroplet and that steps (b) to (g) and (c) are carried out in each microdroplet.
  • 13. The method of claim 12 wherein the results obtained by applying step (6) to each microdroplet are assembled into a stream of data characteristic of the sequence of the nucleic acid.
  • 14. A multi-component biological probe system comprising (a) a first single-stranded oligonucleotide labelled with first and second regions of different detectable element types in an undetectable state located respectively on the X′ and Y′ end sides of a third region comprising a single nucleotide capture site complementary to one of the nucleobases of DNA or RNA having (i) an associated restriction enzyme recognition site and (ii) an exonuclease-blocking site on its X′ side and (b) second and third unlabelled single-stranded oligonucleotides capable of hybridising respectively to the complementary X′ and Y′ side regions on the first oligonucleotide flanking the capture site wherein either X′ is 3′ and Y′ is 5′ or X′ is 5′ and Y′ is 3′.
  • 15. The biological probe system of claim 14 wherein the second or third oligonucleotide includes an endonucleolysis-directing linkage at or close to its X′ or Y′ end respectively.
  • 16. The biological probe system of claim 14, wherein the Y′ end of the second oligonucleotide and the X′ end of the third oligonucleotide are connected by a linker region.
  • 17. The biological probe system of claim 16 wherein the linker region comprises an oligonucleotide region.
  • 18. The biological probe system of claim 14, wherein the detectable elements in the first and second regions are fluorophores optionally associated with quenchers.
  • 19. The biological probe system of claim 14, wherein the second oligonucleotide is longer than the X′ side flanking region of the first oligonucleotide.
  • 20. The biological probe system of claim 14, wherein a nucleotide at the 3′ end of the first oligonucleotide is a mismatch with the corresponding nucleotide in the second or third oligonucleotide.
  • 21. The biological probe system of claim 14, wherein the X′ end of the third oligonucleotide comprises a region resistant to exonucleolysis.
  • 22. The biological probe system of claim 15, wherein the endonuclease-directing linkage is a phosphorothioate or phosphoramidite linkage.
  • 23. The biological probe system of claim 14, comprising from one to four different first oligonucleotides types differing in sequence, the nucleotide base characteristic of the capture region, and the detectable elements used.
Priority Claims (1)
Number Date Country Kind
16157229.2 Feb 2016 EP regional
PCT Information
Filing Document Filing Date Country Kind
PCT/EP2017/054306 2/24/2017 WO 00