The application contains a Sequence Listing which has been submitted electronically in .XML format and is hereby incorporated by reference in its entirety. Said .XML copy, created on Jan. 13, 2025, is named “10669-402US1.xml” and is 68,218 bytes in size. The sequence listing contained in this .XML file is part of the specification and is hereby incorporated by reference herein in its entirety.
The present disclosure is related to a system for the detection of single nucleotide variations in analytes using a universal molecular beacon probe and additional oligonucleotides.
Single nucleotide variations (SNVs) are the most common cause of genetic alterations in the human genome.1-3 The identification of specific SNVs aids in the management of human genetic disorders, and early SNV detection in clinically relevant microbes is crucial in treating infections caused by drug resistant pathogens.4-8
Traditional methods for SNV detection include DNA sequencing, polymerase chain reaction (PCR) with melting curve analysis,9 and hybridization assays. DNA sequencing, such as next-generation sequencing (NGS), requires expensive instrumentation and a significant amount of time for data processing.10 PCR has an astounding range of applications from probe-based real-time PCR to post-amplification product analysis but relies on expensive instrumentation with precise temperature control for SNV differentiation.11-14 Hybridization assays utilizing peptide nucleic acid and locked nucleic acid probes,15,16 cycling probe technology,17 TaqMan,18 and Molecular Beacon (MB)19 probes all suffer from the affinity/selectivity dilemma, which declares that tight binding of a probe to an analyte is associated with low selectivity.20,21 Recent advances in SNV detection include ratio sensing via depletion of wild-type (WT) target,22 programmable DNAzymes,23 the use of CRISPR/Cas systems in conjunction with hybridization chain reactions,24 and detection via lateral flow dipsticks after recombinase polymerase amplification with altered primers.25 The best studied hybridization probes, however, all share the challenges of inefficient hybridization with RNA and DNA analytes folded in stable secondary structures, difficulty differentiating between wild-type (WT) and SNV-containing DNA at ambient temperatures, and their high synthetic cost.20,21,26,27
Of the hybridization assays, the MB probe, a fluorophore- and quencher-labeled DNA harpin, has one of the most elegant designs.19,28 The GC rich stem enables the quencher and fluorophore to remain in proximity for more efficient quenching in the absence of the complementary analyte sequence. Upon hybridization to the complementary analyte, the MB probe opens into an elongated conformation, and fluorescence is observed.19,28 A typical MB probe achieves a limit of detection (LOD) of ˜1 nM,28 establishing it as a significant diagnostic tool capable of detecting specific nucleic acids after amplification.29-31
Although the MB probe is able to differentiate SNVs in a broader temperature range than linear (hairpin-free) probes,26 they fail in differentiating SNV at ambient (0-40° C.) temperatures and, in practice, require costly instrumentation capable of measuring DNA-melting profiles.29-31 Moreover, the MB probe is unable to hybridize with analytes folded in stable secondary structures because it first needs to overcome unfolding of its own stem-loop before hybridizing with another nucleic acid sequence.19,32
To enable SNV differentiation at ambient temperatures, in a previous study, DNA nanotechnology was utilized, and an MB probe-based sensor was designed, which forms a four-stranded complex in the presence of an analyte, dubbed OWL sensor.33 In the OWL structure, P and R strands hybridize to the analyte adjacent to each other and cooperatively open the MB probe hairpin, which is now called OWL1. Indeed, OWL1 sensor differentiated SNVs in the entire range of 5-32° C. with single-base mismatched analytes producing only background fluorescence.33
Importantly, at least in part, this unprecedented SNV selectivity was attributed to the unique rigid OWL nanoscale structure: both strands P and R must fold in ‘circular’ forms with 3′- and 5′-terminal base pairs being in stacking interactions with each other, thus creating a structural lock. That is, if the probe binds a 10-nucleotide segment (which is close to the shortest possible in practice), a single base mismatch will destabilize the complex by only ˜10% on average. While increasing the length of the recognized fragment provides greater affinity, a mismatch then contributes to a proportionally lower destabilization effect, leading to even poorer SNV selectivity and differentiation. Balancing probe affinity and selectivity is a fundamental limitation of the conventional hybridization probes. That is, a rigid object fails more easily than a flexible one. Based on this idea, OWL sensor, now called OWL1, that forms a rigid and structurally imperfect nucleotide complex when it binds to a complementary SNV on the analyte was designed, OWL1 sensor. In OWL1 sensor, a single base mismatch serves the role of ‘stress’ and causes the collapse of the entire fluorescent structure, allowing the sensor to effectively differentiate between fully complementary and mismatched SNV on the analytes. This feature of the nanoscale structure makes the OWL sensor structurally constrained and less tolerant to mismatches in comparison with other hybridization probes that possess ‘unlocked’ ends (e.g. MB probe).33
OWL1 has another advantage over previously known SNV probe. Adjusting the OWL1 sensor to each new analyte requires changing only analyte binding portion of R and P strands, while the same MB probe can be used for the analysis of any nucleic acid sequence. This allows for an opportunity to optimize only one universal MB (UMB) probe, which reduces the optimization efforts and the assay cost in comparison with the MB probe approach if multiple sequences are to be detected.
However, the OWL1 sensor structure is too ‘fragile’ to form a complex with RNA or ssDNA analytes folded in stable secondary structures.
Further aspects of the present disclosure will be more readily appreciated upon review of the detailed description of its various embodiments, described below, when taken in conjunction with the accompanying drawings.
The drawings are described in greater detail in the description and examples below.
The details of some exemplary embodiments of the methods and systems of the present disclosure are set forth in the description below. Other features, objects, and advantages of the disclosure will be apparent to one of skill in the art upon examination of the following description, drawings, examples and claims. It is intended that all such additional systems, methods, features, and advantages be included within this description, be within the scope of the present disclosure, and be protected by the accompanying claims.
Before the present disclosure is described in greater detail, it is to be understood that this disclosure is not limited to particular embodiments described, and as such may, of course, vary. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only, and is not intended to be limiting, since the scope of the present disclosure will be limited only by the appended claims.
Where a range of values is provided, it is understood that each intervening value, to the tenth of the unit of the lower limit unless the context clearly dictates otherwise, between the upper and lower limit of that range and any other stated or intervening value in that stated range, is encompassed within the disclosure. The upper and lower limits of these smaller ranges may independently be included in the smaller ranges and are also encompassed within the disclosure, subject to any specifically excluded limit in the stated range. Where the stated range includes one or both of the limits, ranges excluding either or both of those included limits are also included in the disclosure.
Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure belongs. Although any methods and materials similar or equivalent to those described herein can also be used in the practice or testing of the present disclosure, the preferred methods and materials are now described.
All publications and patents cited in this specification are herein incorporated by reference as if each individual publication or patent were specifically and individually indicated to be incorporated by reference and are incorporated herein by reference to disclose and describe the methods and/or materials in connection with which the publications are cited. The citation of any publication is for its disclosure prior to the filing date and should not be construed as an admission that the present disclosure is not entitled to antedate such publication by virtue of prior disclosure. Further, the dates of publication provided could be different from the actual publication dates that may need to be independently confirmed.
As will be apparent to those of skill in the art upon reading this disclosure, each of the individual embodiments described and illustrated herein has discrete components and features which may be readily separated from or combined with the features of any of the other several embodiments without departing from the scope or spirit of the present disclosure. Any recited method can be carried out in the order of events recited or in any other order that is logically possible.
Embodiments of the present disclosure will employ, unless otherwise indicated, techniques of medicine, organic chemistry, biochemistry, molecular biology, pharmacology, and the like, which are within the skill of the art. Such techniques are explained fully in the literature.
It must be noted that, as used in the specification and the appended claims, the singular forms “a,” “an,” and “the” include plural referents unless the context clearly dictates otherwise. Thus, for example, reference to “a support” includes a plurality of supports. In this specification and in the claims that follow, reference will be made to a number of terms that shall be defined to have the following meanings unless a contrary intention is apparent.
As used herein, the following terms have the meanings ascribed to them unless specified otherwise. In this disclosure, “comprises,” “comprising,” “containing” and “having” and the like can have the meaning ascribed to them in U.S. patent law and can mean “includes,” “including,” and the like; “consisting essentially of” or “consists essentially” or the like, when applied to methods and compositions encompassed by the present disclosure refers to compositions like those disclosed herein, but which may contain additional structural groups, composition components or method steps (or analogs or derivatives thereof as discussed above). Such additional structural groups, composition components or method steps, etc., however, do not materially affect the basic and novel characteristic(s) of the compositions or methods, compared to those of the corresponding compositions or methods disclosed herein. “Consisting essentially of” or “consists essentially” or the like, when applied to methods and compositions encompassed by the present disclosure have the meaning ascribed in U.S. patent law and the term is open-ended, allowing for the presence of more than that which is recited so long as basic or novel characteristics of that which is recited is not changed by the presence of more than that which is recited, but excludes prior art embodiments.
Prior to describing the various embodiments, the following definitions are provided and should be used unless otherwise indicated.
In describing and claiming the disclosed subject matter, the following terminology will be used in accordance with the definitions set forth below.
The terms “oligonucleotide” and “polynucleotide” as used herein refer to any polyribonucleotide or polydeoxyribonucleotide that may be unmodified RNA or DNA or modified RNA or DNA. Thus, for instance, oligonucleotides as used herein refers to, among others, single- and double-stranded DNA, DNA that is a mixture of single- and double-stranded regions, single- and double-stranded RNA, and RNA that is mixture of single- and double-stranded regions, hybrid molecules comprising DNA and RNA that may be single-stranded or, more typically, double-stranded or a mixture of single- and double-stranded regions. The terms “nucleic acid,” “nucleic acid sequence,” or “oligonucleotide” also encompass a polynucleotide as defined above.
The term “nucleotide” as used herein refers to a sub-unit of a nucleic acid (whether DNA or RNA or an analogue thereof) which may include, but is not limited to, a phosphate group, a sugar group and a nitrogen containing base, as well as analogs of such sub-units. Other groups (e.g., protecting groups) can be attached to the sugar group and nitrogen containing base group. It will be appreciated that, as used herein, the terms “nucleotide” and “nucleoside” will include those moieties which contain not only the naturally occurring purine and pyrimidine bases, e.g., adenine (A), thymine (T), cytosine (C), guanine (G), or uracil (U), but also modified purine and pyrimidine bases and other heterocyclic bases which have been modified (these moieties are sometimes referred to herein, collectively, as “purine and pyrimidine bases and analogs thereof”). Such modifications include, e.g., diaminopurine and its derivatives, inosine and its derivatives, alkylated purines or pyrimidines, acylated purines or pyrimidines thiolated purines or pyrimidines, and the like, or the addition of a protecting group such as acetyl, difluoroacetyl, trifluoroacetyl, isobutyryl, benzoyl, 9-fluorenylmethoxycarbonyl, phenoxyacetyl, dimethylformamidine, N,N-diphenyl carbamate, or the like. The purine or pyrimidine base may also be an analog of the foregoing; suitable analogs will be known to those skilled in the art and are described in pertinent texts and literature. Common analogs include, but are not limited to, 1-methyladenine, 2-methyladenine, N6-methyladenine, N6-isopentyladenine, 2-methylthio-N6-isopentyladenine, N,N-dimethyladenine, 8-bromoadenine, 2-thiocytosine, 3-methylcytosine, 5-methylcytosine, 5-ethylcytosine, 4-acetylcytosine, 1-methylguanine, 2-methylguanine, 7-methylguanine, 2,2-dimethylguanine, 8-bromoguanine, 8-chloroguanine, 8-aminoguanine, 8-methylguanine, 8-thioguanine, 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, 5-ethyluracil, 5-propyluracil, 5-methoxyuracil, 5-hydroxymethyluracil, 5-(carboxyhydroxymethyl) uracil, 5-(methylaminomethyl) uracil, 5-(carboxymethylaminomethyl)-uracil, 2-thiouracil, 5-methyl-2-thiouracil, 5-(2-bromovinyl) uracil, uracil-5-oxyacetic acid, uracil-5-oxyacetic acid methyl ester, pseudouracil, 1-methylpseudouracil, queuosine, inosine, 1-methylinosine, hypoxanthine, xanthine, 2-aminopurine, 6-hydroxyaminopurine, 6-thiopurine, and 2,6-diaminopurine.
The term “hybridization” as used herein refers to the process of association of two nucleic acid strands to form an anti-parallel duplex stabilized by means of hydrogen bonding between residues of the opposite nucleic acid strands. The terms “hybridizing” and “binding”, with respect to polynucleotides, are used interchangeably and is meant the formation of A-T and C-G base pairs between the nucleotide sequence of a fragment of a segment of a polynucleotide and a complementary nucleotide sequence of an oligonucleotide. By complementary is meant that at the locus of each A, C, G or T (or U in a ribonucleotide) in the fragment sequence, the oligonucleotide sequence has a T, G, C or A, respectively. The hybridized fragment/oligonucleotide is called a “duplex.”
The terms “hybridizing specifically to” and “specific hybridization” and “selectively hybridize to,” as used herein refer to the binding, duplexing, or hybridizing of a nucleic acid molecule preferentially to a particular nucleotide sequence under stringent conditions.
The term “complementary” as used herein refers to a sufficient number in the oligonucleotide of complementary base pairs in its sequence to interact specifically (hybridize) with the target nucleic acid sequence to be amplified or detected. As known to those skilled in the art, a very high degree of complementarity is needed for specificity and sensitivity involving hybridization, although it need not be 100%. Thus, for example, an oligonucleotide that is identical in nucleotide sequence to an oligonucleotide disclosed herein, except for one base change or substitution, may function equivalently to the disclosed oligonucleotides. A “complementary DNA” or “cDNA” gene includes recombinant genes synthesized by reverse transcription of messenger RNA (“mRNA”).
The terms “analyte” and “target analyte” as used herein refer to an oligonucleotide or polynucleotide for which it is desired to detect. The analyte for use in the methods herein disclosed may be an oligonucleotide or a polynucleotide, immobilized on a solid support or in free solution, which is isolated from a plant or animal host, a cultured cell or a cell or population of cells in a tissue of a plant or animal.
The terms “probe” as used herein refers to an oligonucleotide of nucleic acid sequences of variable length for the detection of identical, similar, or complementary nucleic acid sequences by hybridization.
The term “molecular beacon” and “oligonucleotide molecular beacon” as used herein refer to a form of an oligonucleotide having a fluorophore- and quencher-moiety attached stem-loop structure of oligo-deoxyribonucleotide that is widely used for real-time detection of specific RNA/DNA sequences. In the stem-loop conformation, a fluorophore is brought close to the quencher enabling efficient fluorescent quenching. Hybridization to a complementary DNA/RNA target analyte switches the molecular beacon conformation to an elongated form, which is accompanied by the fluorescence increase.
The term “fluorophore” as used herein refers to any fluorescent dye group whose presence can be detected by its light absorbing or light emitting properties. For example, Cy5 is a reactive water-soluble fluorescent dye of the cyanine dye family. Cy5 is fluorescent in the red region (about 650 to about 670 nm). It may be synthesized with reactive groups on either one or both of the nitrogen side chains so that they can be chemically linked to either nucleic acids or protein molecules. Labeling is done for visualization and quantification purposes. Cy5 is excited maximally at about 649 nm and emits maximally at about 670 nm, in the far-red part of the spectrum; quantum yield is 0.28. FW=792. Suitable fluorophores (chromes) for the probes of the disclosure may be selected from, but not intended to be limited to fluorescein amidite, fluorescein isothiocyanate (FITC, green), 4,5,6,7-tetrachlorofluorescein, 6-carboxy-2′,4,4′,5′,7,7-hexachlorofluorescein, cyanine dyes Cy2, Cy3, Cy3.5, Cy5, Cy5.5 Cy7, Cy7.5 (ranging from green to near-infrared), Texas Red, rhodamine 123 (hydrochloride), sulforhodamine 101 acid chloride succinimidyl ester, 2-3-(dimethylamino)-6-dimethyliminio-xanthen-9-ylbenzoate, (2E)-2-(2E,4E)-5-(2-tert-butyl-9-ethyl-6,8,8-trimethyl-pyrano 3,2-gquinolin-1-ium-4-yl) penta-2,4-dienylidene-1-(6-hydroxy-6-oxo-hexyl)-3,3-dimethylindoline-5-sulfonate, and the like. Derivatives of these dyes for use in the embodiments of the disclosure may be, but are not limited to, Cy dyes (Amersham Bioscience), Alexa Fluors (Molecular Probes Inc.), HILYTE® Fluors (AnaSpec), and DYLITE® Fluors (Pierce, Inc).
The terms “quench” or “quenches” or “quenching” or “quenched” as used herein refer to reducing the fluorescence signal produced by a molecule. It includes, but is not limited to, reducing the signal produced to zero or to below a detectable limit. Hence, a given molecule can be “quenched” by, for example, another molecule and still produce a detectable signal albeit the intensity of the signal produced by the quenched molecule is smaller when the molecule is quenched than when the molecule is not quenched.
The terms “fluorescence quencher” or “quencher” as used herein refer to a molecule that interferes with the fluorescence emitted by a fluorophore. Quenchers prevent the emission of fluorescence when they are physically near a fluorophore. This quencher can be selected from non-fluorescent dark quenchers, which are substances that absorb excitation energy from fluorophores and dissipate the energy as heat. Exemplary dark quenchers include, but are not limited to, such as Dabsyl (4-((4′-dimethylamino)phenylazo)benzoic acid), BLACK HOLE QUENCHER®, IOWA BLACK®. The quencher can also be, but is not limited to, a fluorescent molecule, for example TAMRA (carboxytetramethylrhodamine). When the quencher is a fluorescent dye, its fluorescence wavelength is typically substantially different from that of the reporter dye and the quencher fluorescence is usually not monitored during an assay.
The term “solid support” is used herein refers to any insoluble and inert inorganic or organic material, preferably having a large surface area to which surface organic molecules can be attached through bond formation or absorbed through electronic or static interactions. Representative examples of a “solid support” in context with the present invention are silicates, such as SiO2 resin, including, but not limited to, ion-exchange resins, glass, dextranes, celluloses or hydrophilic or hydrophobic polymers.
By “immobilized on a solid support” is meant that an oligonucleotide is attached to a substance at a particular location in such a manner that the system containing the immobilized fragment, primer or oligonucleotide may be subjected to washing or other physical or chemical manipulation without being dislodged from that location. A number of solid supports and means of immobilizing nucleotide-containing molecules to them are known in the art; any of these supports and means may be used in the methods of this disclosure.
Further definitions are provided in context below. Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art of molecular biology. Although methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present disclosure, suitable methods and materials are described herein.
The developments of DNA nanotechnology have been applied to designing smart hybridization sensors that can simplify sequence specific analysis of nucleic acids. An ideal hybridization sensor should be able to detect a targeted sequence within the context of folded RNA or dsDNA with selectivity close to absolute (measured as Df near 1 in this study) and under broad experimental conditions including all practically significant temperatures. The probe should be affordable and easily adjustable to each new analyte sequence. Toward satisfying these requirements, OWL2 sensor was developed. The OWL2 sensor shines where most hybridization probes fall short. The remarkable characteristics of the sensor include a S/B of 18 and LOD in sub-nanomolar range both for DNA and RNA analytes. It has an extraordinary ability to differentiate mismatched analytes from the fully matched ones including the most challenging G-T mismatches in the temperature range of 5-38° C. Despite looking complex, the sensor is cost efficient when applied for new analytes. The UMB reporter, the most expensive and hard to design component, is analyte independent, so that can be optimized once and then used for the analysis of any DNA or RNA sequences. These features make the OWL2 sensor a highly specific, selective, and versatile tool, which seeks to improve the field of hybridization assays.
The OWL2 sensor was able to tightly bind analytes folded in secondary structure as stable as −8.76 kcal/mol. The achieved signal turn-on ratio of ˜18 were comparable with that of a typical MB probe detecting a linear analytes and exceeded that of the earlier reported OWL1 sensor by ˜6 times. Measured LOD of 0.25 nM exceeded that of MB probe (˜1 nM) and OWL 1 sensor (˜6 nM).33 The improvements in S/B and LOD are the consequences of the tight analyte binding by T2 and T4 arms, which increased the local concentration of the analyte near R strand, which also contributes to stabilization of the entire OWL structure.
The outstanding quality of OWL2 sensor is its selectivity to SNV in the broad temperature range of 5-38° C. Such a broad range cannot be explained neither by the thermodynamic considerations or by the kinetic inversion phenomenon.33 It has been hypothesized that such performance is the consequence of the rigid structure of the OWL complex (
The OWL2 design seems more complex than ‘straightforward’ design of MB probe. In practice, the design of MB probe is associated with loop and stem interferences problems to the extent that it is impossible to design classical MB probes for a broad range of unfolded sequences let alone for folded nucleic acids. OWL2 design can accommodate any MB sequence, which opens a possibility to optimize one universal MB probe and use it for all analytes. This makes OWL2 technology more practical than MB probe technology.
In OWL2 technology, to adjust OWL sensor to each new analyte, only changing the sequences of 4 portions (T2-arm of T2 strand T4-arm of T4 strand, R strand of T3 strand), and P strand to each new analyte is required. In addition, according to Integrated DNA technologies Inc. prices, the total cost of the four DNA strands is $34.06 (minimal synthetic scale). At the same time, the cost of a new MB probe at the minimal synthetic scale is $250.00, or 7.33 times more. Overall, this makes OWL2 more affordable for research and teaching communities tank MB probe, which can save resources on MB optimization stage and if multiple analytes are to be detected.
To enable SNV differentiation at ambient temperatures, DNA nanotechnology was used to design a molecular beacon (MB) probe-based sensor, which forms a four-stranded complex in the presence of an analyte, dubbed OWL sensor33 (OWL1 sensor in
As shown in
R and P strands hybridize to two regions close or adjacent to each other on the analyte, respectively, and on the oligonucleotide of MB probe, respectively, at the same time to cooperatively open the MB probe hairpin structure. P strand has only 9-10 nucleotides complementary to the analyte. Therefore, a single base mismatch would destabilize the hybridization between the P strand and the analyte. That is, without P strand, which has a complementary nucleotide to the SNV of the analyte, the fluorophore and quencher are still in proximity even in the presence of R strand and the analyte, and as an MB probe is folded, only basal level of fluorescence is generated. However, when P strand fully binds to the SNV of the analyte, the MB probe is unfolded and gives enough distance between the fluorophore and the quencher to allow the fluorescence to be generated, which allows for the detection of the desired sequence. Indeed, OWL1 sensor differentiated SNVs in analytes in the entire range of 5-32° C., producing only the background florescence.
Importantly, at least in part, this unprecedented SNV selectivity was attributed to the unique rigid structure of the OWL1 complex: for the generation of fluorescence signals, both P and R strands must fold in circular forms with 3′ and 5′ terminal base pairs being in stacking interactions with each other, i.e., ‘locked’ ends in which the 5′-terminus and 3′ terminus of a strand are parallelly forming base pairs with opposing nucleotides on the oligonucleotide of an MB probe. This feature makes OWL1 sensor structurally constrained and less tolerant to mismatches in comparison with other hybridization probes that possess ‘unlocked’ ends (e.g., MB probe in
In addition, OWL1 sensor has another advantage that for the adjustment of OWL1 sensor to a new analyte, only analyte binding regions of P and R strands are to be changed, while the same MB probe can be used for analysis of any nucleotide sequences. This opens an opportunity to optimize only one universal MB (UMB) probe for use in the analysis of SNVs in multiple analytes. This reduces the optimization efforts and the assay cost in comparison with MB probe approach if multiple analytes are to be detected.
However, OWL1 sensor structure is too ‘fragile’ to form a complex with RNA or ssDNA analytes which are folded in stable secondary structures. Therefore, a new application approach of OWL sensor system is needed, especially for folded nucleic acids.
To this end, a new OWL sensor system, OWL2 sensor (
The oligonucleotide R strand comprises: (i) at each of a 5′-terminus and 3′-terminus thereof, a region having a nucleotide sequence of 4-6 nucleotides, respectively, preferably a total of 10 nucleotides, which is complementary to a nucleotide sequence of a first region of an oligonucleotide of a UMB probe; and (ii) between the 5′-terminus and 3′-terminus thereof, a region having a nucleotide sequence of 8-12 nucleotides, preferably 10 nucleotides, which is complementary to a nucleotide sequence of a first region of an analyte. However, in OWL2 sensor, the R strand is covalently linked to the T3 strand via at least one linker or is extended from the T3 strand without any linker, i.e., the T3 strand can be connected at its 3′-terminus to the 5′ terminus of the R strand, or at its 5′-terminus to the 3′ terminus of the R strand, which is described below.
The oligonucleotide P strand comprises: (i) at each of 5′-terminus and 3′-terminus thereof, a region having a nucleotide sequence of 4-6 nucleotides, respectively, preferably a total of 9 nucleotides, which is complementary to a nucleotide sequence of a second region of the oligonucleotide of the UMB probe; and (ii) between the 5′-terminus and 3′-terminus thereof, a region having a nucleotide sequence of 8-12 nucleotides, preferably 8-9 nucleotides, which is complementary to the nucleotide sequence of a second region of an analyte, which has at least one SNV.
The UMB probe comprises (i) an oligonucleotide comprising: a first region having a nucleotide sequence complementary to the 5′-terminus and 3′-terminus of the R strand; and a second region having a nucleotide sequence complementary to the 5′-terminus and 3′-terminus of the P strand; and comprises (ii) a fluorophore at one end of the oligonucleotide and a quencher at the other end. In this disclosure, an oligonucleotide of SEQ ID NO:1 is presented as an example of an oligonucleotide of UMB probe for OWL2 sensor, however any proper length of oligonucleotides can be used for the same purpose.
The fluorophore can be a single fluorophore or combination of at least 2 fluorophores to make individual tags for different kinds of SNVs, and the quencher is selected according to the selected fluorophore.
As shown in
T1 strand is an oligonucleotide having about 50-60 nucleotides, preferably 52 nucleotides, for example, an oligonucleotide of SEQ ID NO:2 (5′-GTA TCA GTC ATT ACC AGT AGT (21 nts)-CGGAC CTAGG (10 nts) CTCT CGGT CTAG CCAC TT AAC-3′ (21 nts)), which is a nucleotide sequence complementary to T2, T3 and T4 strands. T2, T3, and T4 strands hybridize to T1 strand; for example, T2 strand hybridizes to the 21 nucleotides of the 5′-terminus of the T1 strand, T3 strand to the 10 nucleotides in the middle, and T4 strand to the 21 nucleotides of the 3′-terminus. This 5′-T1-3′/(3′-T2. T3, T4-5′) hybridization DNA fragment functions as a DNA scaffold platform in OWL2 sensor. Although in the embodiments of the disclosure, T1 strand of SEQ ID NO:2, T2 strand of 5′-ACT ACT GGT AAT GAC TGA TAC-3′ (SEQ ID NO: 49), T3 strand of 5′-CCTAG GTCCG-3′ (SEQ ID NO: 50), and T4 strand of 5′-GTT AA GTGG CTAG ACCG AGAG-3′ (SEQ ID NO: 51) are provided in Table 2 as examples, these are just exemplary nucleotide sequences, and any other nucleotide sequences with proper nucleotide length may be used for the same purpose.
In addition, any of T1, T2, T3 or T4 strands, in particular T1 strand may be biotinylated and used as an anchor to attach the T1/(T2, T3, T4) hybridization fragment and OWL2 sensor to a streptavidin-coated substrate or beads for microarray analysis or individual SNV analysis.
As illustrated in
T2 strand comprises at its 3′-end an oligonucleotide arm (T2 arm), wherein the T2 arm is optionally connected via a linker such as at least one tri-thymidine (ttt) or hexaethylene glycol linker, and the T2 arm hybridizes to a region on the analyte, which locates 3′ direction out of the R and P strand binding regions on the analyte.
T4 strand comprises at its 5′-terminus an oligonucleotide arm (T4 arm), wherein the T4 arm is optionally connected via a linker such as at least one tri-thymidine (ttt) or hexaethylene glycol linker, and the T4 arm hybridizes to another region on the analyte, which locates 5′ direction out of the R and P strand binding regions on the analyte.
In this structure, the R strand binding region on the analyte can be either 3′ location or 5′ location relative to the P strand binding region on the analyte.
The nucleotide sequences of T2 and T4 arms, which are to be determined according to the nucleotide sequences on the analyte, are chosen to have melting temperatures above the assay/room temperature (24° C.) and to have little or no secondary structures to ensure tight analyte association with the tile.
As a model analyte to demonstrate the function of T2 and T4 arms, synthetic analytes of tau gene, Tau60-WT and its SNVs named ‘0C’ and ‘1A’ were used.
The OWL2 sensor produced fluorescence (signal to background (S/B)) of ˜18 with Tau60-WT, but fluorescence (S/B) of basal level with Tau60 SNVs (
OWL2 sensor without T2 and T4 arms as well as OWL1 failed to detect Tau60-WT (
Interestingly, OWL2 sensor and OWL2 sensor without T2 and T4 arms produced ˜8 times lower fluorescence (S/B) with Tau18-WT that has no T2 and T4 arm binding region than with Tau60-WT, and this indicates low level of partial unfolding of UMB and incomplete binding of P and R strands to the analyte, supporting the hypothesis that the binding of T2 and T4 arms to the analyte is important in removing the structural constrain. In addition, it may be explained that T2 and T4 arms may help localizing the analyte at the right position, which overall stabilizes the bulky OWL structure.
On the other hand, OWL1 sensor produced greater fluorescence (S/B) than OWL2 sensor with Tau18-WT (
As mentioned above, T3 strand is connected to R strand, and there can be optionally a linker such as tri-thymidine or hexaethylene glycol at the connection point so as to give flexibility between those two strands since the accommodation of the fragile OWL structure of bulky DNA scaffold platform in OWL2 sensor system might be challenging due to the steric hindrance.
Since it was hypothesized that the capability of OWL structure for SNV differentiation is at least in part the consequence of the conformational strain of the small P and/or R strands folded in a circular form, the effect of the locked or open ends of the P strand is tested for achieving SNV differentiation with P strands P98 and P99, both having “locked” ends, and P strand C98 having open ends (5′-terminal nucleotides form base pairs with nucleotides on the oligonucleotide of UMB probe and 3′-terminal nucleotides with nucleotides on the analyte) (
However, the increased flexibility of R-strand by introducing a flexible iSp18 spacer between the UMB-binding and analyte-binding regions of R1010 showed little effects on the selectivity and performance of OWL structure (
Next, the effect of a gap between the P-strand and T4-arm on the function of OWL2 sensor was tested by introducing a gap.
Although OWL2 sensor having P98 strand showed that the presence of a gap caused just slightly reduced fluorescence (S/B) without influence on SNV differentiation (
To ensure that OWL2 sensor can be used for detection of various SNVs on various analytes, OWL2 sensor was redesigned for a new sequence of another analyte, Covid-19 virus (
In this embodiment, CP99-equipped OWL-2 sensor provided greater fluorescence (S/B) than CP98-equipped OWL-2 sensor (
In addition, it was shown that OWL2 sensor can differentiate an analyte which forms a single G-T mismatch at room temperature (
Based on the results described above, this disclosure provides an OWL2 sensor system as well as its use-kit and method for detecting at least one single nucleotide variation (SNV) in an analyte.
In one embodiment, a universal molecule beacon (UMB)-based hybridization sensor system is provided. The system includes
The T2 strand comprises an oligonucleotide arm (T2 arm) on the 3′-end of the T2 strand, and the T2 arm comprises 10-30 nucleotides that hybridize to a third region on the analyte at a 3′ location relative to the R and P strand binding regions on the analyte, and the T4 strand comprises an oligonucleotide arm (T4 arm) on the 5′-end of the T4 strand, wherein the T4 arm comprises 10-30 nucleotides that hybridize to a fourth region on the analyte at a 5′ location relative to the R and P strand binding regions on the analyte.
If the R strand binding region on the analyte is at a 3′ location relative to the P strand binding region on the analyte, the T2 arm binds to the third region on the analyte at a 3′ location relative to the R strand, and the T4 arm binds to the fourth region on the analyte at a 5′ location relative to the P strand; or if the R strand binding region on the analyte is at a 5′ location relative to the P strand binding region on the analyte, the T2 arm binds to the third region on the analyte at a 3′ location relative to the P strand, and the T4 arm binds to the fourth region on the analyte at a 5′ location relative to the R strand. T3 strand is connected at its 3′-terminus to the 5′ terminus of the R strand, or at its 5′-terminus to the 3′ terminus of the R strand. The R and P strands, UMB probe oligonucleotide, and an analyte form a four-stranded complex, and wherein the four-stranded complex is associated with the DNA scaffold of the T1 strand hybridized to T2, T3, T4 strands, via the T2 and T4 arms binding to the analyte and T3 strand linked to the R strand. Further, in the UMB-based hybridization sensor described above, there is no nucleotide gap on the analyte between the P strand binding region and T4 arm binding region if the R strand binding region on the analyte is 3′ location relative to the P strand binding region on the analyte; or between the P strand binding region and T2 arm binding region if the R strand binding region on the analyte is 5′ location relative to the P strand binding region on the analyte.
In another embodiment, a kit comprising the UMB-based hybridization sensor system described above and a plurality of oligonucleotide sets of T2-T2 arm, T3-R strand, T4-T4 arm, and P strand is provided, wherein the analyte binding regions of the R strand, P strand, T2 arm, and T4 arm are independently selected for specific bindings to various SNVs on various analytes.
In another embodiment, a method of identifying a single nucleotide variation in an analyte using the UMB-based hybridization sensor system describe above is provided, comprising the steps of:
Reagents. All solutions were made using DNAse/protease-free water purchased from Fisher Scientific. Synthesized oligonucleotides were obtained from Integrated DNA Technologies, Inc (Coralville, IA) and concentrations of oligonucleotide stock solutions were quantified via absorbance at 260 nm on a Thermo Scientific NanoDrop One (Waltham, MA).
Annealing of DNA scaffolds. All tiles were annealed overnight (˜8-10 h) in 2 L of water after being heated to 95° C. and boiled for 5 min. Oligonucleotides were combined in a total volume of 1 mL with a concentration of 100 nM. Unless otherwise specified, tiles were annealed with variations of T1, T2, T3, and T4, but not the P-strand in hybridization buffer 1:50 mM Tris-HCl, 50 mM MgCl2, 0.1% Tween-20, and pH 7.4.
Melt Curve Fluorescence Assays. Using DNA scaffolds previously annealed, UMB and ROX were added to the solutions to a final concentration of 25 nM, and P-strand was added to a final concentration of 200 nM. Bringing the total volume to 30 μL each using either water or analyte, the samples were then added to the 96-well plate. A control consisting of only UMB and ROX were used, in addition to a sample containing the DNA scaffold with no P-strand added, and a sample with no analyte added. After adding the samples to the plate, an optical adhesive cover was securely fitted to the top of the plate, and wells were sealed using a tool provided with the QuantStudio™ 6 Flex System. The plate was lightly flicked to eliminate bubbles and was vortexed and centrifuged for 20 s on a Fisher Scientific Mini Plate Spinner Centrifuge (Hamptom, NH). After allowing 30 min for annealing the plate was then placed into the QuantStudio™ Flex 6 system and cooled to 5° C. where it was held for 5 min. The fluorescence of the samples was then read continuously as the samples were heated from 5° C. to 70° C. (0.1° C./s). ROX was selected as a passive reference and FAM™ was read as the ‘Target’. Although the system was calibrated to account for well factors, background, and dye fluorescence, there were small variations between the background fluorescence of UMB and controls without analyte; therefore, there may be small fluorescence value variations observed depending on the date of the experiment. Data was exported to excel and subsequently to OriginLab 2021 (Northampton, MA) for data normalization and processing. The processed readings from at least three wells were averaged and plotted as a function of FFAM/FROX. The derivative of fluorescence vs time was calculated by the QuantStudio™ Real-Time PCR Software to determine the melting temperature (Tm).
Limit of Detection. The limit of detection was determined for each study by conducting fluorescence experiments using a 60 μL quartz cuvette in a Perkin-Elmer (San Jose, CA) LS-55 Fluorescence Spectrophotometer with a xenon lamp. (λlex=485 nm, λem=517 nm). The samples were used directly from annealed tiles and, to this solution, P-strand was added such that the final concentration was 200 nM and UMB was added to a final concentration of 25 nM. After the addition of analyte at varying concentrations, the samples were incubated in a 24° C. water bath for 30 min before being taken out of the bath and analyzed. Fluorescent values at 517 nm were recorded for three independent trials for each sample. The averages and standard deviations were plotted in Excel and OriginLab 2021 (Northampton, MA), and the linear region was found and fitted with an equation. The LOD was determined by using the equation with the fluorescent signal of the blank+3×Standard deviation of the blank.
Differentiation Fluorescent Assays. The differentiation of each tested sensor was determined by conducting fluorescence experiments in a similar manner to the limit of detection. For the OWL1 design, the samples were made such that Rx and Py were added to final concentrations of 150 nM and 200 nM, respectively, unless stated otherwise. The samples were incubated in hybridization buffer 1 at room temperature (24° C.) for 20 min followed by measuring fluorescence intensity at 517 nm. The differentiation factor (Fm−F0)/(Fmm−F0) was calculated by taking the Fluorescence of the Wild-Type analyte (Fm), subtracting the blank (F0), and dividing it by the fluorescence of the respective SNV-containing analyte corrected by the blank. Differentiation was calculated using the fluorescent average of at least three trials.
Kinetics Assays. Using the same experimental conditions as for the other fluorescent assays, the fluorescence was measured over 45 minutes on a Cary Agilent Fluorimeter for 45 min. The OWL2 sensor and P-strand were mixed, analyte was added, and fluorescence was read immediately afterwards.
To overcome the limitations of the OWL1 sensor, OWL2 sensor was designed (
As a model analyte for initial optimization of the OWL2 sensor, SNV ‘0C’ and ‘1A’ (
The secondary structures of synthetic fully matched analytes Tau60-WT and Tau18-WT are shown in
The binding site of the P strand was chosen such that the two SNV sites corresponded to the middle positions of the strand for best SNV differentiation.33 The analyte-binding site for the R strand, adjacent to the P strand binding site, formed a 10 base-pair (bp) duplex with the analyte and enabled both UMB-binding arms of the R strand to be positioned on the same side of the B DNA helix as needed for the formation of OWL structure.33 The T2- and T4-arms were chosen to have melting temperatures above the assay temperature (24° C.) and to have little or no secondary structures to ensure tight analyte association with the OWL nanostructure.
We optimized the concentration and sequences of the P strand to produce the highest signal-to-background ratio (S/B) and the greatest SNV differentiation (see details below). The optimal P strand had 9 nts and 8 nts complementary to the UMB probe and the analyte, respectively, and was, therefore named P98. It was used at the concentration of 200 nM, which provided the highest S/B (
Next, it was demonstrated that each feature of the OWL2 sensor contributes to at least one of the following functions: (1) enabling detection of folded analytes and (2) accurate discrimination of SNVs, (3) maintaining detection efficiency and selectivity over a range of ambient and low temperatures, and (4) ensuring low reagent cost due to “universality” of the UMB probe.
To overcome the limitations of OWL1 sensor, OWL2 sensor was designed (
As a model analyte, SNVs named ‘0C’ and ‘1A’ (
The binding site of P strand was chosen such that the two SNV sites were in the middle positions of the strand for best SNV differentiation.17,18 Binding site for R strand located next to P strand to form 10 bp duplex with the analyte. The 10 bp binding site satisfies the requirement for both UMB binding arm to face the same side of the analyte/strand R double stranded hybrid to allow UMB binding.13 T2 and T4 arms were chosen to have melting temperatures above the assay temperature (24° C.) and to have little or no secondary structures to ensure tight analyte association with the OWL tile.
The concentrations of UMB probe and P strand (data not shown) were optimized, followed by optimization of the sequence of P strand to produce the highest signal-to-background ratio (S/B) and the greatest SNV differentiation (see details below). The optimal P strand had 8 nucleotides complementary to the analyte and two arms totaling 9 nucleotides complementary to UMB probe thus named P98. The optimized OWL2 sensor (
Interesting to note that Tau18-WT lacking the fragments for binding T2 and T4 arms was detected with S/B ˜8 times lower than Tau60-WT. This suggests that the important function of T2 and T4 is not only in removing the structural constrain created by the Tau60-WT secondary structure, but also in localizing the analyte next to R strand, which overall stabilizes the OWL structure. On the other hand, OWL1 produced greater S/B than OWL2 sensor in complex with Tau18-WT (
The removal of the T2 and T4 arms resulted in a loss of the OWL2 ability to detect folded Tau60-WT (
On the other hand, OWL1 in complex with Tau18-WT produced a greater S/B than OWL2 lacking sensor T2 and T4 (“OWL2 no arms”) (
Positioning of the fragile OWL structure near a bulky DNA scaffold formed by T1, T2, T3 and T4 in the OWL2 sensor might be challenging due to steric hindrance, which is hard to predict without knowing the crystal structure of the OWL2 sensor. The nature of the linker between the R strand and the scaffold-forming fragment of the T3 strand was varied ranging from the least flexible regular phosphodiester (PDE) to more flexible tri-thymidine (ttt), and the most flexible hexaethylene glycol (iSp18) linker (
Following from our previous results,33 it was hypothesized that the unprecedented SNV differentiation, at least in part, is the consequence of the conformational strain by the OWL structure and the locked ends (
First, the optimal P98 strand was redesigned to have opened ends, named C98 strand (
Next, the effect of the flexibility of R-strand on the selectivity and S/B was tested. For this purpose, an iSp18 spacer was introduced between the UMB-hybridizing and analyte-hybridizing regions of R1010 near its 5′-end (
Indeed, the constrained and rigid nature of the SNV-selective P-strand contributes the most to differentiation of WT from the mutants. By designing the P-strand with locked 5′- and 3′-ends in complex with UMB, a conformational strain was created that is unable to remain stable unless all 8 base pairs are complementary to the analyte. In the presence of a mis match, the strain experienced by P98 is great enough to inhibit P-strand hybridization to the analyte, which decomposes the OWL complex. If there are no mismatches, the P-strand is stabilized by the 8 base pairs complementary to the analyte, the stress of the conformational strain is insufficient to cause dissociation of the P-strand, and the scaffolding for UMB hybridization is complete.
Previous studies have shown that the stability of multistranded DNA complexes are affected by the distance between adjacent DNA strands hybridized to a complementary nucleic acid.38-40 Therefore, a single nucleotide gap was introduced between the P-strand and T4-arm. The introduced gap did not significantly affect the S/B or DF of the optimal sensor containing P98 (
It was interesting to investigate if excellent selectivity of the OWL2 sensor allows detecting the matched analyte in the presence of excess amounts of a mismatched analyte. This capability of the sensor would be useful for detecting small fractions of cancerous DNA in an excess amount of healthy DNA for early-stage cancer diagnosis.22 The LOD of the fully matched Tau60-WT analyte was measured with the optimal OWL2 sensor in the presence of 50 nM Tau60-0C as a buffer component (
G-T mismatches are known to be the least destabilizing of all base-mispairing scenarios and, therefore, the most challenging to discriminate.42,43 Here, it was investigated if the OWL2 sensor is capable of differentiating an analyte that forms a single G-T mismatch with the sensor. It was found that P98 has Df of 0.45 when tested against the Tau60-2G analyte, which has an A >G substitution (
Discrimination using the original P98 was expectedly poor since the G-T was situated between the two stable G-C base pars and shifted from the middle of the stand P-analyte hybrid. Mismatches on the ends of hybridization sites are known to be less destabilizing than those in the center. 32.33 Expectedly, the mismatches closer to the center (P98 A >G and P98 C >T) were better discriminated. However, P98 C >T had a greater A/T content, which possibly led to the best discrimination of the three. It was shown that, through modification of the P-strand, it was possible to differentiate even G-T mismatches, with the best discriminating ability of the sensors containing G-T mismatch in the middle position of stand P/analyte complex and when flanked by A-T base pairs (
Since the characteristics of RNA/DNA helical structure are somewhat different due to the difference in ribose and deoxyribose conformation,44 it was investigated if the same OWL-2 sensor that performs well with DNA analytes is suitable for detecting an RNA analyte. It was found that the OWL2 sensor equipped with P98 strand was able to detect Tau60-WT RNA at a LOD of 0.8 nM, which is comparable to the 0.4 nM LOD of Tau-60 DNA. The ability of the OWL sensor to detect RNA may have practical significance since Tau-60 DNA is associated with the development of Alzheimer's disease.35,36
To ensure that OWL2 sensor can be easily redesigned for other analytes, it was applied to a sequence from the Covid-19-causing SARS-CoV2 virus). By only changing the analyte-binding portions of T2, T3, T4, i.e., T2-arm, R strand, and T4-arm sequences, and P-strand (named CP-strand for Covid-19), it was possible to show that both the CP98 and CP99 allowed for differentiation of the fully matched CVD60-WT from the mismatched CVD60-1C and CVD60-0G analytes (
Overall, these results show that the OWL2 design can be easily adapted to detect another analyte without the need for costly changes. The cost of one nucleotide addition in IDT Inc. is $0.42 (minimum synthetic scale), which comes to 56.7 USD for adaptation of T2, T3, T4, and P-strand to each new analyte. At the same time, the cost of a new MB probe is ˜350 USD (minimum synthetic scale) due to the need for conjugation of the oligonucleotide with two dyes and double HPLC purification. Additionally, the design of an MB probe for each new analyte is known to be associated with many problems, such as stem invasion and loop interference, to the degree that it is impossible to design an efficient MB probe for some analytes.28,32 By designing the UMB-hybridizing regions of R- and P strands to be independent of the analyte sequence, it is allowed for the UMB technology to be applied to analytes of potentially any sequence. Furthermore, it was shown that OWL2 design is applicable to both DNA and RNA analytes which contain an SNV in both the stem and the loop regions (
GTTG CAC ACT GCC GATTG
GTTG CAC ACT GCC ATTG
GTTG CAC ACT GC GATTG
GGTTG CAC ACT GC CGATT
GGTTG CAC ACT GCC CGATT
GTTG CAC ACT GC ATTG
GTTG CAC ACT G GATTG
GTTG AGTA AACGA GATTG
GTTG AGTA AACG GATTG
This invention was made with government support under cooperative agreement SHF-1907824 and CCF: SHF-2226021 awarded by National Science Foundation through the CCF: Software and Hardware Foundations. The government has certain rights in the invention.
| Number | Date | Country | |
|---|---|---|---|
| 63523420 | Jun 2023 | US |