The Sequence Listing associated with this application is provided in text format in lieu of a paper copy, and is hereby incorporated by reference into the specification. The name of the text file containing the Sequence Listing is P36231-US-3_SEQ_LIST_ST25. The text file is 3 KB, was created on Aug. 4, 2022, and is being submitted electronically via EFS-Web.
This patent application is a continuation of International Patent Application No. PCT/US2020/032950, filed May 14, 2020, which claims priority to and the benefit of United States Provisional Application No. U.S. 62/852,262, filed May 23, 2019, United States Provisional Application No. U.S. 62/877,183, filed Jul. 22, 2019 and U.S. Provisional Application No. 62/885,746 filed Aug. 12, 2019. Each of the above patent applications is incorporated herein by reference as if set forth in its entirety.
The present invention relates generally to new synthetic reporter constructs, more specifically to new nucleotide-free, phosphoramidite-based translocation control elements, reporter codes and other features that generate unique signals when passed through a nanopore, and methods for the manufacture and utilization thereof, particularly in nanopore-based polymer sequencing methods.
Measurement of biomolecules is a foundation of modern medicine and is broadly used in medical research, and more specifically in diagnostics and therapy, as well in drug development. Nucleic acids encode the necessary information for living things to function and reproduce, and are essentially a blueprint for life. Determining such blueprints is useful in pure research as well as in applied sciences. In medicine, sequencing can be used for diagnosis and to develop treatments for a variety of pathologies, including cancer, heart disease, autoimmune disorders, multiple sclerosis, and obesity. In industry, sequencing can be used to design improved enzymatic processes or synthetic organisms. In biology, this tool can be used to study the health of ecosystems, for example, and thus have a broad range of utility. Similarly, measurement of proteins and other biomolecules has provided markers and understanding of disease and pathogenic propagation.
An individual's unique DNA sequence provides valuable information concerning their susceptibility to certain diseases. It also provides patients with the opportunity to screen for early detection and/or to receive preventative treatment. Furthermore, given a patient's individual blueprint, clinicians will be able to administer personalized therapy to maximize drug efficacy and/or to minimize the risk of an adverse drug response. Similarly, determining the blueprint of pathogenic organisms can lead to new treatments for infectious diseases and more robust pathogen surveillance. Low cost, whole genome DNA sequencing will provide the foundation for modern medicine. To achieve this goal, sequencing technologies must continue to advance with respect to throughput, accuracy, and read length.
Over the last decade, a multitude of next generation DNA sequencing technologies have become commercially available and have dramatically reduced the cost of sequencing whole genomes. These include sequencing by synthesis (“SBS”) platforms (Illumina, Inc., 454 Life Sciences, Ion Torrent, Pacific Biosciences) and analogous ligation based platforms (Complete Genomics, Life Technologies Corporation). A number of other technologies are being developed that utilize a wide variety of sample processing and detection methods. For example, GnuBio, Inc. (Cambridge, Mass.) uses picoliter reaction vessels to control millions of discreet probe sequencing reactions, whereas Halcyon Molecular (Redwood City, Calif.) was attempting to develop technology for direct DNA measurement using a transmission electron microscope.
Nanopore based nucleic acid sequencing is a compelling approach that has been widely studied. Kasianowicz et al. (Proc. Natl. Acad. Sci. USA 93: 13770-13773, 1996) characterized single-stranded polynucleotides as they were electrically translocated through an alpha hemolysin nanopore embedded in a lipid bilayer. It was demonstrated that during polynucleotide translocation partial blockage of the nanopore aperture could be measured as a decrease in ionic current. Polynucleotide sequencing in nanopores, however, is burdened by having to resolve tightly spaced bases (0.34 nm) with small signal differences immersed in significant background noise. The measurement challenge of single base resolution in a nanopore is made more demanding due to the rapid translocation rates observed for polynucleotides, which are typically on the order of 1 base per microsecond. Translocation speed can be reduced by adjusting run parameters such as voltage, salt composition, pH, temperature, and viscosity, to name a few. However, such adjustments have been unable to reduce translocation speed to a level that allows for single base resolution.
Stratos Genomics has developed a method called Sequencing by Expansion (“SBX”) that uses a biochemical process to transcribe the sequence of DNA onto a measurable polymer called an “Xpandomer” (Kokoris et al., U.S. Pat. No. 7,939,259, “High Throughput Nucleic Acid Sequencing by Expansion”). The transcribed sequence is encoded along the Xpandomer backbone in high signal-to-noise reporters that are separated by ˜10 nm and are designed for high-signal-to-noise, well-differentiated responses. These differences provide significant performance enhancements in sequence read efficiency and accuracy of Xpandomers relative to native DNA. Xpandomers can enable several next generation DNA sequencing detection technologies and are well suited to nanopore sequencing.
Nanopores have proven to be powerful amplifiers, much like their highly-exploited predecessors, Coulter Counters. However, the current generation of organic nanopores (such as Hemolysin and MspA), that have been tasked with base recognition of DNA, are transmembrane proteins that do not interact with DNA in nature. They do not have natural functions for controlling DNA translocation. This is a recognized shortcoming that some have attempted to correct by adding functionality with protein motors adjacent to the nanopores. For example, Akeson's group added phi 29 polymerase adjacent to the alpha-hemolysin nanopore so that ss-DNA could be fed into the pore at a controlled rate (see G. M. Cherf et al. “Automated forward and reverse ratcheting of DNA in a nanopore at 5-A precision,” Nat Biotech, vol. advance online publication, February 2012). This approach complicates the assay and imposes a separation of the measurement region in the alpha hemolysin from the position control in the polymerase that can introduce additional noise and sequence dependent variation to the measurement.
In another approach, referred to as translocation control by hybridization (TCH), a nanopore translocation event is paused by using a structure created by hybridization, which disassociates for translocation to proceed (see, e.g. U.S. Pat. No. 10,457,979 to McRuer and Kokoris). Akeson et al. (U.S. Pat. No. 6,465,193) first demonstrated this by pausing DNA translocation with sequential hairpin duplexed regions. Translocation stopped at the duplex because it was larger than the alpha-hemolysin nanopore aperture. When the duplex released due to stochastic thermal fluctuation, translocation proceeded to the next duplex. During each pause, the region of DNA located in the nanopore (adjacent to the duplex) could be measured and identified. When applied to nanopore sequencing, this duplexing approach to translocation control suffers from limitations, including incomplete duplex formation, or hybridization fill rate, and the stochastics of duplex dissociation, which can lead to deletions or insertions events. Insertion and deletions that cannot be localized can seriously degrade the data quality.
While significant advances have been made in this field, commercially viable implementation of translocation control with, for example, Xpandomers, would benefit from improvements that overcome limitations caused by duplexing. The present invention fulfills these needs and provides further related advantages as discussed below.
All of the subject matter discussed in the Background section is not necessarily prior art and should not be assumed to be prior art merely as a result of its discussion in the Background section. Along these lines, any recognition of problems in the prior art discussed in the Background section or associated with such subject matter should not be treated as prior art unless expressly stated to be prior art. Instead, the discussion of any subject matter in the Background section should be treated as part of the inventor's approach to the particular problem, which in and of itself may also be inventive.
In brief, compounds (e.g., XNTPs) including polymeric reporter and linker constructs synthesized from a collection of novel phosphoramidite monomeric units and methods are disclosed for improved nanopore sequencing (for example, generating sequences of higher read length, accuracy, and/or throughput) of polymeric analytes (e.g., Xpandomers).
In some embodiments, the polymeric constructs may be designed to completely lack nucleotides.
In one aspect, the present disclosure provides a compound (i.e., an XNTP) having the following structure:
wherein R is OH or H; nucleobase is adenine, cytosine, guanine, thymine, uracil or a nucleobase analog; reporter construct is a polymer having a first end and a second end, and includes, in series from the first end to the second end, a first reporter code, a symmetrical chemical brancher bearing a translocation control element, and a second reporter code; linker A joins the oxygen atom of the alpha phosphoramidate to the first end of the reporter construct; linker B joins the nucleobase to the second end of the reporter construct; and in which the translocation control element is a polymer as described below.
In one embodiment, the translocation control element is a polymer comprising two or more repeat units selected from: 1,3-O-bis(phosphodiester)-2S—O-mPEG4-propane (compound 12b), 1,3-O-bis(phosphodiester)-2-(4-Me-O-PEG3)-1-(Et-O—Ac)-1,2,3-triazole)-propane (compound 35b), 1,3-O-bis(phosphodiester)-2s-O-mPEG6-propane (compound 12c), 1,2-O-bis(phosphodiester)-3-O-mPEG2-propane (compound 16), 2,3-O-bis(phosphodiester)-1-(5-benzofuran)-propane (compound 20i), 1,2-O-bis(phosphodiester)-3-(4-methylpiperazine-1-yl)-propane (compound 20j), 2,3-O-bis(phosphodiester)-1-(N1-(2-Me-5-nitroindole)-propane (compound 20g), 1,8-O-bis(phosphodiester)-N,N-Diethylpiperazine (compound 26h), 1,2-O-bis(phosphodiester)-3-(4-(Me-O-PEG3-O-Bz)-1-(1,2,3-triazole))-propane (compound 31d), 1,3-O-bis(phosphodiester)-2s-O-(4-(Me-O-PEG2)-1-(Et-OBz)-1,2,3-triazole)-propane (compound 35a), 1,3-O-bis(phosphodiester-2-(4-(Me-O-PEG5)-1-(Et-O—Ac)-1,2,3-triazole)-propane (compound 35c), 1,3-O-bis(phosphodiester-2s-O-(4-(Me-O-PEG7)-1-(Et-OBz)-1,2,3-triazole)-propane (compound 35d), 1,3-O-bis(phosphodiester-2s-O-(4-(Me-O-PEG3)-1-(Me-acetate)-1,2,3-triazole)-propane (compound 35e), 1,3-O-bis(phosphodiester-2s-O-(4-(Me-O-PEG3)-1-(Et-2,2,2-Tris-(Me-O-Bz))-1,2,3-triazole)-propane (compound 37a), 1,3-O-bis(phosphodiester-2-(4-(Me-O-PEG5)-1-(Et-2,2,2-Tris-(Me-O—Ac))-1,2,3-triazole)-propane (compound 37b), 1,3-O-bis(phosphodiester-2S—O-(PEG4-O-Bz)-propane (compound 38b), 1,3-O-bis(phosphodiester-2,2-bis(Me-O-mPEG2)-propane (compound 45b), 1,3-O-bis(phosphodiester-2,2-bis(4-(Me-O-PEG2-O-Me)-1-(Et-O-Bz)-1,2,3-triazole)-propane (compound 47f), 1,3-O-bis(phosphodiester-2,2-bis(4-(Me-O-PEG3-O-Me)-1-(Et-O-Bz)-1,2,3-triazole)-propane (compound 47Gg, 1,3-O-bis(phosphodiester-2,2-bis(4-(Me-O-PEG3-O-Me)-1-(Et-2,2,2-Tris-(Me-O-Bz))-1,2,3-triazole)-propane (compound 47i), or 1,3-O-bis(phosphodiester-2,2-bis(1-Me-4-(Me-O-PEG2-O-Bz)-1,2,3-triazole)-propane (compound 52).
In some embodiments, R is OH.
In some embodiments, R is H.
In other embodiments, nucleobase is adenine, cytosine, guanine, thymine, or uracil.
In other embodiments, nucleobase is a nucleobase analog.
In other embodiments, the symmetrical chemical brancher is 1,2,3-O-tris-(phosphosphodiester)-propane, 1,3-bis-(5-O-phosphodiester-pentylamido)-2-O-phosphodiester-propane, or 1,4,7-O-tris-(phosphodiester)-heptane.
In other embodiments, the symmetrical chemical brancher is 1,2,3-O-tris-(phosphosphodiester)-propane.
In other embodiments, the translocation control element is a polymer comprising two or more repeat units selected from Table 1A.
In other embodiments, the translocation control element is a polymer comprising two or more repeat units selected from 1,3-O-bis(phosphodiester)-2S—O-mPEG4-propane (compound 12b) and 1,3-O-bis(phosphodiester)-2-(4-Me-O-PEG3)-1-(Et-O—Ac)-1,2,3-triazole)-propane (compound 35b).
In yet other embodiments, the translocation control element is a polymer comprising the following sequence: [(1,3-O-bis(phosphodiester)-2S—O-mPEG4-propane (compound 12b))]n1[(1,3-O-bis(phosphodiester)-2-(4-Me-O-PEG3)-1-(Et-O—Ac)-1,2,3-triazole)-propane (compound 35b))]n2, wherein n1 is from 0 to 6 and n2 is from 6 to 10.
In other embodiments, the first and second reporter codes are identical.
In further embodiments, the first and second reporter codes are polymers comprising two or more repeat units selected from: hexaethylene glycol (D), ethane (L), triaethylene glycol (X), 1,3-O-bis(phosphodiester)-2S—O-mPEG4-propane (compound 12b), 1,3-O-bis(phosphodiester)-2-(4-Me-O-PEG3)-1-(Et-O—Ac)-1,2,3-triazole)-propane (compound 35b), 1,3-O-bis(phosphodiester-2,2-bis(Me-O-mPEG2)-propane (compound 45b), 1,3-O-bis(phosphodiester-2S—O-(PEG4-O-Bz)-propane (compound 38b), 1,3-O-bis(phosphodiester)-2s-O-mPEG6-propane (compound 12c), 1,3-O-bis(phosphodiester-2s-O-(4-(Me-O-PEG3)-1-(Et-2,2,2-Tris-(Me-O-Bz))-1,2,3-triazole)-propane (compound 37a), 1,3-O-bis(phosphodiester-2s-O-(4-(Me-O-PEG3)-1-(Me-acetate)-1,2,3-triazole)-propane (compound 35e), 1,3-O-bis(phosphodiester)-2s-O-(4-(Me-O-PEG2)-1-(Et-OBz)-1,2,3-triazole)-propane (compound 35a), 1,3-O-bis(phosphodiester)-2-(4-Et-1-(Et-O-mPEG1)-1,2,3-triazole)-propane (compound 31a), 2,3-O-bis(phosphodiester)-1-(1 dimethoxyquinazolinedione)-propane (compound 20c), 2,3-O-bis(phosphodiester)-1-(N9-(3,6-dimethoxycarbazole)-propane (compound 20e), 1,1′-O-bis(phosphodiester)-2,2′-(sulfonylbis(benz-4-yl))-diethanol (compound 26d), 1,1′-O-bis(phosphodiester)-2,2′-bipyridin-4,4′-yl)-dimethanol (compound 26a), 2,3-O-bis(phosphodiester)-1-(N1-(4,6-dimethoxy-3-Me-indole)-propane (compound 20b), 3-(1,2-O-bis(phosphodiester)-propyl)-8,8-dimethylhexahydro-3H-3a,6-methanobenzo[c]isothiazole 2,2-dioxide (compound 20d), 2,3-O-bis(phosphodiester)-1-(N1-(6-Azathymine))-propane (compound 20f), 1,5-O-bis(phosphodiester)-hexahydrofuro[2,6]furan (compound 23), 1,1′-O-bis(phosphodiester)-octahydro-2,6-dimethyl-3,8:4,7-dimethano-2,6-naphthyridin-4,8-diyl)-dimethanol (compound 26e), 2,3-O-bis(phosphodiester)-1-(N1-(2-Me-5-nitroindole)-propane (compound 20h), 2,3-O-bis(phosphodiester)-1-(N1-(2-Me-5-nitroindole)-propane (compound 20g), 2,3-O-bis(phosphodiester)-1-(5-benzofuran)-propane (compound 20i), 1,2-O-bis(phosphodiester)-3-O-mPEG2-propane (compound 5b), 1,3-O-bis(phosphodiester)-2-(4-Et-1-(Et-O-mPEG3)-1,2,3-triazole)-propane (compound 31b), and 1,3-O-bis(phosphodiester)-3-O-mPEG4-propane (compound 5a).
In other embodiments, the first and second reporter codes are polymers comprising two or more repeat units selected from hexaethylene glycol, ethane, triaethylene glycol, and any of the compounds set forth in Table 1A.
In further embodiments, the first and second reporter codes are polymers comprising two or more repeat units selected from hexaethylene glycol, ethane, triaethylene glycol, and 1,3-O-bis(phosphodiester)-2S—O-mPEG4-propane (compound 12b).
In more specific embodiments, the first and second reporter codes are polymers comprising a sequence selected from: (i) [(hexaethylene glycol)2(ethane)3(hexaethylene glycol)(triaethylene glycol)], (ii) [(hexaethyleneglycol)2(1,3-O-bis(phosphodiester)-2S—O-mPEG4-propane (compound 12b))2(ethane)(triaethylene glycol)3], (iii) [(hexaethylene glycol)2(1,3-O-bis(phosphodiester)-2S—O-mPEG4-propane (compound 12b))3(ethane)2(hexaethylene glycol)(triaethylene glycol)], and (iv) [(triaethylene glycol)2(ethane)(1,3-O-bis(phosphodiester)-2S—O-mPEG4-propane (compound 12b))6(ethane)7].
In other embodiments, Linker A and Linker B are polymers comprising two or more repeat units selected from: spermine (Q), hexaethylene glycol (D), 2-((4-((3-(benzoyloxy)-2-(((1-(3-(benzoyloxy)-2-((benzoyloxy)methyl)-2-((phosphodiester-oxy)methyl)propyl)-1H-1,2,3-triazol-4-yl)methoxy)methyl)-2-((benzoyloxy)methyl)propoxy)methyl)-1H-1,2,3-triazol-1-yl)methyl)-2-O-phosphodiester-propane-1,3-diyl dibenzoate (compound 62), 1,3-O-bis(phosphodiester-2,2-bis(1-Me-4-(Me-O-PEG2-O-Bz)-1,2,3-triazole)-propane (compound 52), 1,3-O-bis(phosphodiester-2-(4-(Me-O-PEG5)-1-(Et-O—Ac)-1,2,3-triazole)-propane (compound 35c), 1,3-O-bis(phosphodiester-2s-O-(4-(Me-O-PEG7)-1-(Et-OBz)-1,2,3-triazole)-propane (compound 35d), 1,3-O-bis(phosphodiester-2s-O-(4-(Me-O-PEG3)-1-(Et-2,2,2-Tris-(Me-O-Bz))-1,2,3-triazole)-propane (compound 37a), 1,3-O-bis(phosphodiester-2-(4-(Me-O-PEG5)-1-(Et-2,2,2-Tris-(Me-O—Ac))-1,2,3-triazole)-propane (compound 37b), 1,2-O-bis(phosphodiester)-3-(4-(Me-O-PEG3-O-Bz)-1-(1,2,3-triazole))-propane (compound 31d), 1,3-O-bis(phosphodiester-2,2-bis(4-(Me-O-PEG2-O-Me)-1-(Et-O-Bz)-1,2,3-triazole)-propane (compound 47f), 1,3-O-bis(phosphodiester-2,2-bis(4-(Me-O-PEG3-O-Me)-1-(Et-2,2,2-Tris-(Me-O-Bz))-1,2,3-triazole)-propane (compound 47i), 1,2-O-bis(phosphodiester)-3-(4-methylpiperazine-1-yl)-propane (compound 20j), 1,3-O-bis(phosphodiester-2,2-bis(4-(Me-O-PEG3-O-Me)-1-(Et-O-Bz)-1,2,3-triazole)-propane (compound 47g), and 1,1′-O-bis(phosphodiester)-N(p-tolyl)-diethanolamine (compound 26b).
In other embodiments, Linker A and Linker B are polymers comprising two or more repeat units selected from spermine and any of the compounds set forth in Table 1A.
In yet other embodiments, Linker A and Linker B comprise a polymerase enhancement region comprising two repeat units of spermine.
In further embodiments, Linker A and Linker B comprise a translocation deceleration region comprising two or more repeat units selected from: 1,3-O-bis(phosphodiester-2-(4-(Me-O-PEG5)-1-(Et-O—Ac)-1,2,3-triazole)-propane (compound 35c), 1,3-O-bis(phosphodiester-2s-O-(4-(Me-O-PEG7)-1-(Et-OBz)-1,2,3-triazole)-propane (compound 35d), 1,3-O-bis(phosphodiester-2s-O-(4-(Me-O-PEG3)-1-(Et-2,2,2-Tris-(Me-O-Bz))-1,2,3-triazole)-propane (compound 37a), and 1,3-O-bis(phosphodiester-2-(4-(Me-O-PEG5)-1-(Et-2,2,2-Tris-(Me-O—Ac))-1,2,3-triazole)-propane (compound 37b).
In more specific embodiments, Linker A and Linker B comprise a translocation deceleration region comprising a polymer selected from: (i) [((hexaethylene glycol) (1,3-O-bis(phosphodiester-2-(4-(Me-O-PEG5)-1-(Et-O—Ac)-1,2,3-triazole)-propane (compound 35c))3(hexaethylene glycol)2], (ii) [((hexathylene glycol)(1,3-O-bis(phosphodiester-2-(4-(Me-O-PEG5)-1-(Et-O—Ac)-1,2,3-triazole)-propane (compound 35c))4(hexaethylene glycol)2], (iii) [((hexathylene glycol)(1,3-O-bis(phosphodiester-2s-O-(4-(Me-O-PEG7)-1-(Et-OBz)-1,2,3-triazole)-propane (compound 35d))4(hexaethylene glycol)2], and (iv) [((hexathylene glycol)(1,3-O-bis(phosphodiester-2-(4-(Me-O-PEG5)-1-(Et-2,2,2-Tris-(Me-O—Ac))-1,2,3-triazole)-propane (compound 37b))4(hexaethylene glycol)2].
In other embodiments, Liker A is joined to the oxygen atom of the alpha phosphoramidate by a linkage comprising a triazole and Liker B is joined to the nucleobase by a linkage comprising a triazole.
In another aspect, the present invention provides a reporter construct comprising a polymer having a first end and a second end, and including in series from the first end to the second end a first reporter code, a symmetrical chemical brancher bearing a translocation control element, and a second reporter code; and in which the translocation control element is a polymer comprising two or more repeat units selected from: 1,3-O-bis(phosphodiester)-2S—O-mPEG4-propane (compound 12b), 1,3-O-bis(phosphodiester)-2-(4-Me-O-PEG3)-1-(Et-O—Ac)-1,2,3-triazole)-propane (compound 35b), 1,3-O-bis(phosphodiester)-2s-O-mPEG6-propane (compound 12c), 1,2-O-bis(phosphodiester)-3-O-mPEG2-propane (compound 16), 2,3-O-bis(phosphodiester)-1-(5-benzofuran)-propane (compound 20i), 1,2-O-bis(phosphodiester)-3-(4-methylpiperazine-1-yl)-propane (compound 20j), 2,3-O-bis(phosphodiester)-1-(N1-(2-Me-5-nitroindole)-propane (compound 20g), 1,8-O-bis(phosphodiester)-N,N-Diethylpiperazine (compound 26h), 1,2-O-bis(phosphodiester)-3-(4-(Me-O-PEG3-O-Bz)-1-(1,2,3-triazole))-propane (compound 31d), 1,3-O-bis(phosphodiester)-2s-O-(4-(Me-O-PEG2)-1-(Et-OBz)-1,2,3-triazole)-propane (compound 35a), 1,3-O-bis(phosphodiester-2-(4-(Me-O-PEG5)-1-(Et-O—Ac)-1,2,3-triazole)-propane (compound 35c), 1,3-O-bis(phosphodiester-2s-O-(4-(Me-O-PEG7)-1-(Et-OBz)-1,2,3-triazole)-propane (compound 35d), 1,3-O-bis(phosphodiester-2s-O-(4-(Me-O-PEG3)-1-(Me-acetate)-1,2,3-triazole)-propane (compound 35e), 1,3-O-bis(phosphodiester-2s-O-(4-(Me-O-PEG3)-1-(Et-2,2,2-Tris-(Me-O-Bz))-1,2,3-triazole)-propane (compound 37a), 1,3-O-bis(phosphodiester-2-(4-(Me-O-PEG5)-1-(Et-2,2,2-Tris-(Me-O—Ac))-1,2,3-triazole)-propane (compound 37b), 1,3-O-bis(phosphodiester-2S—O-(PEG4-O-Bz)-propane (compound 38b), 1,3-O-bis(phosphodiester-2,2-bis(Me-O-mPEG2)-propane (compound 45b), 1,3-O-bis(phosphodiester-2,2-bis(4-(Me-O-PEG2-O-Me)-1-(Et-O-Bz)-1,2,3-triazole)-propane (compound 47f), 1,3-O-bis(phosphodiester-2,2-bis(4-(Me-O-PEG3-O-Me)-1-(Et-O-Bz)-1,2,3-triazole)-propane (compound 47Gg, 1,3-O-bis(phosphodiester-2,2-bis(4-(Me-O-PEG3-O-Me)-1-(Et-2,2,2-Tris-(Me-O-Bz))-1,2,3-triazole)-propane (compound 47i), or 1,3-O-bis(phosphodiester-2,2-bis(1-Me-4-(Me-O-PEG2-O-Bz)-1,2,3-triazole)-propane (compound 52).
In some embodiments, the symmetrical chemical brancher is 1,2,3-O-tris-(phosphosphodiester)-propane, 1,3-bis-(5-O-phosphodiester-pentylamido)-2-O-phosphodiester-propane, or 1,4,7-O-tris-(phosphodiester)-heptane.
In another embodiment, the symmetrical chemical brancher is 1,2,3-O-tris-(phosphosphodiester)-propane.
In other embodiments, the translocation control element is a polymer comprising two or more repeat units selected from Table 1A.
In yet other embodiments, the translocation control element is a polymer comprising two or more repeat units selected from 1,3-O-bis(phosphodiester)-2S—O-mPEG4-propane (compound 12b) and 1,3-O-bis(phosphodiester)-2-(4-Me-O-PEG3)-1-(Et-O—Ac)-1,2,3-triazole)-propane (compound 35b).
In further embodiments, the translocation control element is a polymer comprising the following sequence: [(1,3-O-bis(phosphodiester)-2S—O-mPEG4-propane (compound 12b))]n1[(1,3-O-bis(phosphodiester)-2-(4-Me-O-PEG3)-1-(Et-O—Ac)-1,2,3-triazole)-propane (compound 35b))]n2, wherein n1 is from 0 to 6 and n2 is from 6 to 10.
In other embodiments, the first and second reporter codes are identical.
In some embodiments, the first and second reporter codes are polymers comprising two or more repeat units selected from: hexaethylene glycol (D), ethane (L), triaethylene glycol (X), 1,3-O-bis(phosphodiester)-2S—O-mPEG4-propane (compound 12b), 1,3-O-bis(phosphodiester)-2-(4-Me-O-PEG3)-1-(Et-O—Ac)-1,2,3-triazole)-propane (compound 35b), 1,3-O-bis(phosphodiester-2,2-bis(Me-O-mPEG2)-propane (compound 45b), 1,3-O-bis(phosphodiester-2S—O-(PEG4-O-Bz)-propane (compound 38b), 1,3-O-bis(phosphodiester)-2s-O-mPEG6-propane (compound 12c), 1,3-O-bis(phosphodiester-2s-O-(4-(Me-O-PEG3)-1-(Et-2,2,2-Tris-(Me-O-Bz))-1,2,3-triazole)-propane (compound 37a), 1,3-O-bis(phosphodiester-2s-O-(4-(Me-O-PEG3)-1-(Me-acetate)-1,2,3-triazole)-propane (compound 35e), 1,3-O-bis(phosphodiester)-2s-O-(4-(Me-O-PEG2)-1-(Et-OBz)-1,2,3-triazole)-propane (compound 35a), 1,3-O-bis(phosphodiester)-2-(4-Et-1-(Et-O-mPEG1)-1,2,3-triazole)-propane (compound 31a), 2,3-O-bis(phosphodiester)-1-(1 dimethoxyquinazolinedione)-propane (compound 20c), 2,3-O-bis(phosphodiester)-1-(N9-(3,6-dimethoxycarbazole)-propane (compound 20e), 1,1′-O-bis(phosphodiester)-2,2′-(sulfonylbis(benz-4-yl))-diethanol (compound 26d), 1,1′-O-bis(phosphodiester)-2,2′-bipyridin-4,4′-yl)-dimethanol (compound 26a), 2,3-O-bis(phosphodiester)-1-(N1-(4,6-dimethoxy-3-Me-indole)-propane (compound 20b), 3-(1,2-O-bis(phosphodiester)-propyl)-8,8-dimethylhexahydro-3H-3a,6-methanobenzo[c]isothiazole 2,2-dioxide (compound 20d), 2,3-O-bis(phosphodiester)-1-(N1-(6-Azathymine))-propane (compound 20f), 1,5-O-bis(phosphodiester)-hexahydrofuro[2,6]furan (compound 23), 1,1′-O-bis(phosphodiester)-octahydro-2,6-dimethyl-3,8:4,7-dimethano-2,6-naphthyridin-4,8-diyl)-dimethanol (compound 26e), 2,3-O-bis(phosphodiester)-1-(N1-(2-Me-5-nitroindole)-propane (compound 20h), 2,3-O-bis(phosphodiester)-1-(N1-(2-Me-5-nitroindole)-propane (compound 20g), 2,3-O-bis(phosphodiester)-1-(5-benzofuran)-propane (compound 20i), 1,2-O-bis(phosphodiester)-3-0-mPEG2-propane (compound 5b), 1,3-O-bis(phosphodiester)-2-(4-Et-1-(Et-O-mPEG3)-1,2,3-triazole)-propane (compound 31b), and 1,3-O-bis(phosphodiester)-3-O-mPEG4-propane (compound 5a).
In other embodiments, the first and second reporter codes are polymers comprising two or more repeat units selected from hexaethylene glycol, ethane, triaethylene glycol, and any of the compounds set forth in Table 1A.
In further embodiments, the first and second reporter codes are polymers comprising two or more repeat units selected from hexaethylene glycol, ethane, triaethylene glycol, and 1,3-O-bis(phosphodiester)-2S—O-mPEG4-propane (compound 12b).
In further embodiments, the first and second reporter codes are polymers comprising a sequence selected from: (i) [(hexaethylene glycol)2(ethane)3(hexaethylene glycol)(triaethylene glycol)], (ii) [(hexaethyleneglycol)2(1,3-O-bis(phosphodiester)-2S—O-mPEG4-propane (compound 12b))2(ethane)(triaethylene glycol)3], (iii) [(hexaethylene glycol)2(1,3-O-bis(phosphodiester)-2S—O-mPEG4-propane (compound 12b))3(ethane)2(hexaethylene glycol)(triaethylene glycol)], and (iv) [(triaethylene glycol)2(ethane)(1,3-O-bis(phosphodiester)-2S—O-mPEG4-propane (compound 12b))6(ethane)7].
In another aspect, the present invention provides a symmetrically synthesized report tether (SSRT), in which the symmetrically synthesized reporter tether is a polymer having a first end and a second end, and includes in series from the first end to the second end a first linker, a reporter construct according to any one of the above reporter constructs, and a second linker, in which the first and second linkers are identical and are polymers comprising two or more repeat units selected from: spermine (Q), hexaethylene glycol (D), 2-((4-((3-(benzoyloxy)-2-(((1-(3-(benzoyloxy)-2-((benzoyloxy)methyl)-2-((phosphodiester-oxy)methyl)propyl)-1H-1,2,3-triazol-4-yl)methoxy)methyl)-2-((benzoyloxy)methyl)propoxy)methyl)-1H-1,2,3-triazol-1-yl)methyl)-2-O-phosphodiester-propane-1,3-diyl dibenzoate (compound 62), 1,3-O-bis(phosphodiester-2,2-bis(1-Me-4-(Me-O-PEG2-O-Bz)-1,2,3-triazole)-propane (compound 52), 1,3-O-bis(phosphodiester-2-(4-(Me-O-PEG5)-1-(Et-O—Ac)-1,2,3-triazole)-propane (compound 35c), 1,3-O-bis(phosphodiester-2s-O-(4-(Me-O-PEG7)-1-(Et-OBz)-1,2,3-triazole)-propane (compound 35d), 1,3-O-bis(phosphodiester-2s-O-(4-(Me-O-PEG3)-1-(Et-2,2,2-Tris-(Me-O-Bz))-1,2,3-triazole)-propane (compound 37a), 1,3-O-bis(phosphodiester-2-(4-(Me-O-PEG5)-1-(Et-2,2,2-Tris-(Me-O—Ac))-1,2,3-triazole)-propane (compound 37b), 1,2-O-bis(phosphodiester)-3-(4-(Me-O-PEG3-O-Bz)-1-(1,2,3-triazole))-propane (compound 31d), 1,3-O-bis(phosphodiester-2,2-bis(4-(Me-O-PEG2-O-Me)-1-(Et-O-Bz)-1,2,3-triazole)-propane (compound 47f), 1,3-O-bis(phosphodiester-2,2-bis(4-(Me-O-PEG3-O-Me)-1-(Et-2,2,2-Tris-(Me-O-Bz))-1,2,3-triazole)-propane (compound 47i), 1,2-O-bis(phosphodiester)-3-(4-methylpiperazine-1-yl)-propane (compound 20j), 1,3-O-bis(phosphodiester-2,2-bis(4-(Me-O-PEG3-O-Me)-1-(Et-O-Bz)-1,2,3-triazole)-propane (compound 47g), and 1,1′-O-bis(phosphodiester)-N(p-tolyl)-diethanolamine (compound 26b).
In some embodiments, the symmetrically synthesized reporter tether (SSRT) includes a polymerase enhancement region comprising two repeat units of spermine.
In other embodiments, the symmetrically synthesized reporter tether (SSRT) includes a translocation deceleration region comprising two or more repeat units selected from: 1,3-O-bis(phosphodiester-2-(4-(Me-O-PEG5)-1-(Et-O—Ac)-1,2,3-triazole)-propane (compound 35c), 1,3-O-bis(phosphodiester-2s-O-(4-(Me-O-PEG7)-1-(Et-OBz)-1,2,3-triazole)-propane (compound 35d), 1,3-O-bis(phosphodiester-2s-O-(4-(Me-O-PEG3)-1-(Et-2,2,2-Tris-(Me-O-Bz))-1,2,3-triazole)-propane (compound 37a), and 1,3-O-bis(phosphodiester-2-(4-(Me-O-PEG5)-1-(Et-2,2,2-Tris-(Me-O—Ac))-1,2,3-triazole)-propane (compound 37b).
In other embodiments, the symmetrically synthesized reporter tether (SSRT) includes a translocation deceleration region comprising a polymer selected from: (i) [((hexaethylene glycol) (1,3-O-bis(phosphodiester-2-(4-(Me-O-PEG5)-1-(Et-O—Ac)-1,2,3-triazole)-propane (compound 35c))3(hexaethylene glycol)2], (ii) [((hexathylene glycol)(1,3-O-bis(phosphodiester-2-(4-(Me-O-PEG5)-1-(Et-O—Ac)-1,2,3-triazole)-propane (compound 35c))4(hexaethylene glycol)2], (iii) [((hexathylene glycol)(1,3-O-bis(phosphodiester-2s-O-(4-(Me-O-PEG7)-1-(Et-OBz)-1,2,3-triazole)-propane (compound 35d))4(hexaethylene glycol)2], and (iv) [((hexathylene glycol)(1,3-O-bis(phosphodiester-2-(4-(Me-O-PEG5)-1-(Et-2,2,2-Tris-(Me-O—Ac))-1,2,3-triazole)-propane (compound 37b))4(hexaethylene glycol)2].
In yet other embodiments, the first end and the second end of the symmetrically synthesized reporter tether (SSRT) include a linkage moiety and, in certain embodiments, the linkage moiety is an azido (—N3) group.
In another aspect, the present invention provides a method for sequencing a target nucleic acid, comprising the steps of: a) providing a daughter strand produced by a template-directed synthesis, the daughter strand comprising a plurality of XNTP subunits coupled in a sequence corresponding to a contiguous nucleotide sequence of all or a portion of the target nucleic acid, wherein the individual XNTP subunits of the daughter strand comprise a reporter construct, a nucleobase residue, and a selectively cleavable bond, and wherein the reporter construct, upon cleavage of the selectively cleavable bond, permits lengthening of the subunits of the daughter strand; b) cleaving the selectively cleavable bonds to yield an Xpandomer of a length longer than the plurality of the subunits of daughter strand, the Xpandomer comprising the reporter constructs for parsing genetic information in a sequence corresponding to the contiguous nucleotide sequence of all or a portion of the target nucleic acid; and c) detecting the reporter constructs of the Xpandomer.
In some aspects, the reporter constructs for parsing the genetic information comprise a reporter code and a translocation control element, wherein the translocation control element provides translocation control by steric hindrance and pauses translocation of the Xpandomer when passed through a nanopore subjected to a baseline voltage, wherein the translocation control element engages the reporter code within the aperture of the nanopore, wherein the reporter code is sensed by the nanopore.
In some embodiments, the Xpandomer resumes translocation through the nanopore by application of a pulse voltage, in which the pulse voltage is sufficient to allow translocation of the translocation control element, while leaving the next reporter construct of the Xpandomer free to engage with the nanopore.
In other embodiments, the translocation control element of the reporter construct engaged with the nanopore by steric hindrance translocates upon each pulse of the pulsed voltage.
In some embodiments, the target construct is sensed by the nanopore during the time period between pulses of the pulsed voltage.
In certain embodiments, the baseline voltage is from about 55 mV to about 75 mV and the pulse voltage is from about 550 mV to about 700 mV.
In some embodiments, the pulse voltage has a duration from about 5 μs to about 10 μs and a periodicity from about 0.5 ms to 1.5 ms.
In other embodiments, the nanopore is subjected to an alternating current (AC).
In further embodiments, one or more of the XNTP subunits includes a 2′ Fluoroarabinosyl epimer.
In another aspect, the present disclosure provides a buffer for controlling the rate of translocation of a polymer through a nanopore comprising at least one salt selected from the group consisting of NH4Cl, MgCl2, LiCl, KCl, CsCl, NaCl, and CaCl2.
In some embodiments, the buffer further comprises at least one solvent selected from the group consisting of 3-methyl-2-oxazolidinone (MOA), DMF, ACN, DMSO, and NMP, wherein the solvent is present in the range from about 1% vol/vol to about 35% vol/vol.
In other embodiments, the buffer further comprises at least one additive selected from the group consisting of sodium hexanoate (NaHex), EDTA, redox reagents, PEG, glycerol, ficoll, and the like.
In another aspect, the present disclosure provides a buffer system for controlling the rate of translocation of a polymer through a nanopore detector comprising a cis buffer and a trans buffer, wherein the cis buffer has a first salt concentration and the trans buffer has a second salt concentration, wherein the first salt concentration is lower than the second salt concentration.
The present invention may be understood more readily by reference to the following detailed description of preferred embodiments of the invention and the Examples included herein. Unless otherwise explained, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure belongs.
Reference throughout this specification to “one embodiment” or “an embodiment” and variations thereof means that a particular feature, structure, or characteristic described in connection with the embodiment is included in at least one embodiment. Thus, the appearances of the phrases “in one embodiment” or “in an embodiment” in various places throughout this specification are not necessarily all referring to the same embodiment. Furthermore, the particular features, structures, or characteristics may be combined in any suitable manner in one or more embodiments.
As used in this specification and the appended claims, the singular forms “a,” “an,” and “the” include plural referents, i.e., one or more, unless the content and context clearly dictates otherwise. It should also be noted that the conjunctive terms, “and” and “or” are generally employed in the broadest sense to include “and/or” unless the content and context clearly dictates inclusivity or exclusivity as the case may be. Thus, the use of the alternative (e.g., “or”) should be understood to mean either one, both, or any combination thereof of the alternatives. In addition, the composition of “and” and “or” when recited herein as “and/or” is intended to encompass an embodiment that includes all of the associated items or ideas and one or more other alternative embodiments that include fewer than all of the associated items or ideas.
Unless the context requires otherwise, throughout the specification and claims that follow, the word “comprise” and synonyms and variants thereof such as “have” and “include”, as well as variations thereof such as “comprises” and “comprising” are to be construed in an open, inclusive sense, e.g., “including, but not limited to.” The term “consisting essentially of” limits the scope of a claim to the specified materials or steps, or to those that do not materially affect the basic and novel characteristics of the claimed invention.
The abbreviation, “e.g.” is derived from the Latin exempli gratia, and is used herein to indicate a non-limiting example. Thus, the abbreviation “e.g.” is synonymous with the term “for example.” It is also to be understood that as used herein and in the appended claims, the singular forms “a,” “an,” and “the” include plural reference unless the context clearly dictates otherwise, the term “X and/or Y” means “X” or “Y” or both “X” and “Y”, and the letter “s” following a noun designates both the plural and singular forms of that noun. In addition, where features or aspects of the invention are described in terms of Markush groups, it is intended, and those skilled in the art will recognize, that the invention embraces and is also thereby described in terms of any individual member and any subgroup of members of the Markush group, and Applicants reserve the right to revise the application or claims to refer specifically to any individual member or any subgroup of members of the Markush group.
Any headings used within this document are only being utilized to expedite its review by the reader, and should not be construed as limiting the invention or claims in any manner. Thus, the headings and Abstract of the Disclosure provided herein are for convenience only and do not interpret the scope or meaning of the embodiments.
Where a range of values is provided herein, it is understood that each intervening value, to the tenth of the unit of the lower limit unless the context clearly dictates otherwise, between the upper and lower limit of that range and any other stated or intervening value in that stated range is encompassed within the invention. The upper and lower limits of these smaller ranges may independently be included in the smaller ranges is also encompassed within the invention, subject to any specifically excluded limit in the stated range. Where the stated range includes one or both of the limits, ranges excluding either or both of those included limits are also included in the invention.
For example, any concentration range, percentage range, ratio range, or integer range provided herein is to be understood to include the value of any integer within the recited range and, when appropriate, fractions thereof (such as one tenth and one hundredth of an integer), unless otherwise indicated. Also, any number range recited herein relating to any physical feature, such as polymer subunits, size or thickness, are to be understood to include any integer within the recited range, unless otherwise indicated. As used herein, the term “about” means±20% of the indicated range, value, or structure, unless otherwise indicated.
The “Sequencing by Expansion” (SBX) protocol, developed by Stratos Genomics (see, e.g., Kokoris et al., U.S. Pat. No. 7,939,259, “High Throughput Nucleic Acid Sequencing by Expansion”) is based on the polymerization of highly modified, non-natural nucleotide analogs referred to as “XNTPs”. In general terms, SBX uses biochemical polymerization to transcribe the sequence of a DNA template onto a measurable polymer called an “Xpandomer”. The transcribed sequence is encoded along the Xpandomer backbone in high signal-to-noise reporters that are separated by ˜10 nm and are designed for high-signal-to-noise, well-differentiated responses. These differences provide significant performance enhancements in sequence read efficiency and accuracy of Xpandomers relative to natural DNA. A generalized overview of the SBX process is depicted in
XNTPs are expandable, 5′ triphosphate modified non-natural nucleotide analogs compatible with template dependent enzymatic polymerization. A highly simplified XNTP is illustrated in
Synthesis of an Xpandomer polymer is summarized in
As shown in
In this embodiment, SSRT 275 includes several functional elements, or “features” such as polymerase enhancement regions 280A and 280B, reporter codes 285A and 285B, and translation control element (TCEs) 290A and 290B. In other embodiments, the SSRT includes a single TCE. Each of these features performs a unique function during translocation of the Xpandomer through a nanopore to produce a series of unique and reproducible electronic signal. SSRT 275 is designed for controlling the rate of Xpandomer translocation by the TCE through a combination of sterics and/or electrorepulsion, as discussed further herein. Different reporter codes are sized to block ion flow through a nanopore at different measureable levels. Specific SSRT polymeric sequences can be efficiently synthesized using phosphoramidite chemistry typically used for oligonucleotide synthesis. Reporter codes and other features can be designed by selecting a sequence of specific phosphoramidites from commercially available and/or proprietary libraries. Such libraries include, but are not limited to, polyethylene glycol with lengths of 1 to 12 or more ethylene glycol units and aliphatic polymers with lengths of 1 to 12 or more carbon units. In certain embodiments, the SSRTs include features referred to as “polymerase enhancement regions” at the ends of the SSRTs proximal to the nucleotide triphosphoramidate diester. Polymerase enhancement regions may include positively charged polyamine spacers (e.g., primary, secondary, tertiary, or quarternary amines) or triamine spacers (three secondary amines each separated by three carbons) that facilitate incorporation of XNTP structures by a nucleic acid polymerase. In certain embodiments, the polymerase enhancement region includes two repeat units of spermine, in which the spermine moiety is provided by a phosphoramidite monomer having the following structure (as one of skill in the art will recognize, the trifluoroacetamide protecting groups are removed at the end of SSRT synthesis to expose the amine groups on spermine):
As used throughout the present disclosure, the term “reporter construct” refers to the element of the SSRT that includes the reporter codes, a symmetrical chemical brancher, and a translocation control element. In certain embodiments, the reporter construct is a polymer that includes, in series, from a first end to a second end, a first reporter code, a symmetrical chemical brancher bearing a translocation control element, and a second reporter code. The term “bearing” refers to a covalent linkage between the symmetrical brancher and the translocation control element, which produces an advantageous orientation of the translocation control element with respect to the two reporter codes. As discussed further herein and with reference to
As used throughout the present disclosure, the terms “linker A” and “linker B” refer to the regions of the SSRT that each include a polymerase enhancing region and one or more translocation deceleration features or regions, and, in certain embodiments, a spacer region that includes a polymer of, e.g., PEG6, which can be customized to modulate the length of the SSRT traversed in a nanopore.
In certain embodiments, an XNTP may be a compound having the following generalized structure:
In one embodiment, R may be H, for example, when the compounds are used to sequence a DNA template. In another embodiment, R may be OH, for example, when the compounds are used to sequence an RNA template.
In certain embodiments, nucleobase is adenine, cytosine, guanine, thymine, uracil or a nucleobase analog. As one of skill in the art will appreciate, adenine, cytosine, guanine, thymine, and uracil are naturally occurring nucleobases. As used herein, the term “nucleobase analog” refers to non-naturally occurring nucleobases that are capable of forming Watson and Crick base pair with a complementary nucleobase on an adjacent single-stranded nucleic acid template. Exemplary nucleobase analogs include, but are not limited to, 5-fluorouracil; 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl-2-thiouridine, 5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta-D-mannosylqueosine, 5′-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 3-(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, 2,6-diaminopurine, 3-nitropyrrole, 8-aza-7-deazaguanine, 8-aza-7-deazainosine, and 8-aza-7-deazaadenine.
As discussed herein, the reporter construct is a polymer having a first end and a second end, and includes, in series from the first end to the second end, the first reporter code, the symmetrical chemical brancher bearing the translocation control element, and the second reporter code. This series of features reflects the symmetrical structure of the reporter construct (and the entire SSRT, which includes the symmetrical linkers, linker A and linker B), in which the sequences of the two reporter codes are identical and joined, in-line in reverse orientation by the symmetrical chemical brancher. Synthesis of the entire SSRT, including the reporter construct, is discussed further herein with reference to
The α-hemolysin nanopore is typically oriented so translocation occurs by entering the vestibule side and exiting the stem side. As shown in
Phosphoramidite chemistry, typically used for automated oligonucleotide synthesis, provides an efficient and convenient means to synthesize polymeric SSRTs. However, the ultimate potential of SSRT feature design is significantly limited by the repertoire of phosphoramidite monomers (PPAs) available in commercial libraries. Commercial PPAs are largely based on nucleosidic core structures and therefore do not offer the range of physicochemical properties necessary for the design of a broader array of features that improve the efficiency and accuracy of nanopore reads. To address this shortcoming in the art, the inventors have designed and synthesized a large collection of new PPA monomeric compounds. Significantly, these compounds are not based on nucleosidic core structures, which are well known in the art and, as mentioned, constrain feature design.
As used herein, the abbreviation “PPA” refers to phosphoramidites that are O-(2-cyanoethyl)-(N,N-diisopropyl)-phosphoramidites. It is readily understood by one of skill in the art, that the term “phosphoramidite” refers to the structure of the monomeric precursor; following in-line polymerization of PPAs into an SSRT, the monomers are converted into phosphodiester linked oligomeric products.
Other methods used to make phosphodiester backbones polymers can be used to synthesize SSRTs. Accordingly, the monomers used with these chemistries can also produce SSRTs with non-nucleosidic elements. Additional methods of assembly may involve use of automated or manual assembly strategies done in solution phase or on a solid support. H-phosphonate synthesis and phosphotriester synthesis are examples known in the art. In addition, methods using enzymatic synthesis may be adapted to synthesize SSRTs (e.g., those employed in enzymatic oligonucleotide synthesis). In some embodiments, synthesis of an SSRT may be based on a combination of any of the above synthesis methods.
What follows is a brief, non-limiting summary of certain principles used to guide PPA monomer design. 1) Phosphate spacing. Compounds were designed that maintained a C3 (3 atom) spacing, which mimics the spacing of a natural nucleotide backbone. Other suitable spacings include, in certain embodiments, from 2 to 20 atom spacing. Unexpectedly, atom spacing was found to influence the rate of nanopore translocation, allowing for fine-tuning of translocation control. 2) Hydrophilicity. Compounds were designed to optimize the hydrophilic properties of SSRT features, as desired for particular functionalities. Several monomer designs were based on PEG, due to its ability to increase water solubility, which is an important property of, e.g., reporter codes. The inventors were able to fine-tune the hydrophilicity of PPA monomers by adjusting the length of the PEG polymer, as well as by terminating the PEG polymer with methyl ether or introducing 1,2,3-triazoles into the polymer, which had the unanticipated effect of further improving water solubility. 3) Steric volume. Several alternative configurations of linear, branched, cyclic, and dendrimeric PPA structures were designed and tested to evaluate the effect of steric volume on current flow through the nanopore. 4) Chirality in the backbone. The nanopore is a chiral environment. Enantiomeric compounds were designed to determine whether key nanopore signal properties were affected in any way. 5) Charge. In addition to back phosphate charge, certain compounds carry either a positive charge, e.g., teritiary amines or a negative charge, e.g., carboxylic acid. 6) Aromaticity. Compounds composed of a wide variety of aromatic hydrocarbons and heteroaromatic structures were incorporated into the backbone to determine if interactions with the nanopore produced desirable signal properties.
PPA monomeric compounds, in addition to attenuating nanopore signal properties, such as translocation rate control or current level control, also influence physicochemical properties of the Xpandomer. The Xpandomer, as a semi-synthetic polymer, exhibits properties associated with both natural polymers, e.g., DNA, and synthetic polymers. In some embodiments, certain PPA monomers may enable attenuation of undesirable inter-Xpandomer interactions or interaction between the Xpanodmer and certain process elements of the SBX work-flow. For example, it may be possible to reduce Xpandomer self-aggregation, formation of higher order glasses or gelatin, isolation, passive adsorption to, or interaction with, surfaces of containers, walls of nanochannels or fabrication devices containing the membrane and nanopore.
One class of compounds that has proven to provide outstanding functionality when incorporated into SSRT features is referred to herein as “pendant PEG”. These structures are based on a molecular core that enables linkage of one or more PEG-containing polymers in a pendant configuration relative to the core. A structural analogy can be drawn between polymers of pendant PEG compounds and a comb, in which the phosphodiester bonds between individual compounds form the base of the comb and the PEG-based polymers form the teeth. Advantageously, several properties of the pendant PEG “teeth” can be customized for particular SSRT features, e.g., one or more of the spacing, length, and composition of the polymeric teeth. Structures 1a, 2a, 3a, and 4a below illustrate four exemplary embodiments of pendant PEG core structures.
In certain non-limiting embodiments, X or X′ may represent —CH2O—[CH2CH2O—]mO— in which m is 1-10 and Y or Y′ may represent —H, —CH3,
Tables 1A-C set forth non-limiting collections of novel phosphoramidite monomeric compounds for use in, e.g., SSRT feature design. Synthetic schemes for each compound are referred to with reference to the relevant Example and specific precursors are included for each. Analytic data characterizing the purified synthesized compounds are also set forth in Table 1A. These compounds may be used to synthesize any suitable polymeric feature, e.g., SSRT reporter codes, translocation control elements and translocation deceleration features, as described in further detail herein. Table 1A also provides the names of the compounds with reference to the in-line structures they assume following incorporation into synthetic polymers.
31P
As discussed herein, the TCE feature of an SSRT is designed to stall Xpandomer translocation so as to position the reporter code within the nanopore aperture for measurement. The availability of the new phosphoramidite monomeric compounds of the present invention has enabled design of next-generation TCE structures, which control translocation rate through one or more of steric hindrance, electro-repulsion, and preferential interaction with the nanopore. The resistance of the TCE to the driving force of the ion current when positioned at the pore aperture and the consequent increase in applied voltage (i.e., the voltage pulse) necessary to overcome the arrest and resume translocation, can be customized by modulating various properties of the TCE, (and in some embodiments, the reporter codes and other elements of the SSRT) e.g., the bulk, length, and/or charge density. Importantly, because translocation rate is controlled by properties intrinsic to the TCE, translocation control is relieved of the burden of relying on prior art strategies, which employ, e.g., nucleotide hybridization strategies based on reversible interaction with soluble oligonucleotides.
In certain embodiments, TCEs are polymers produced by solid-phase synthesis using the phosphoramidite method with suitable monomeric building blocks that terminate with a branched structure (i.e., the “brancher”). Branched phosphoramidites are known in the art and include both symmetrical and asymmetrical branchers, commercially available from, e.g., Glen Research and ChemGenes. In one embodiment, the TCE brancher is a symmetrical branching CED phosphoramidite, wherein each arm of the brancher is linked to a reporter code. Exemplary symmetrical chemical branchers include 1,2,3-O-tris-(phosphosphodiester)-propane, 1,3-bis-(5-O-phosphodiester-pentylamido)-2-O-phosphodiester-propane, and 1,4,7-O-tris-(phosphodiester)-heptane.
To customize translocation control, several structural properties of the TCE (and in certain embodiments, other features of the SSRT) can be adapted. For example, one or more of the length, bulk, and charge density of the TCE and the spatial positioning of charged elements within the barrel of the nanopore can be modified. In some embodiments, the bulk of the TCE is increased by incorporating one or more pendant PEG phosphoramidites into the polymeric structure. For example, the TCE may incorporate from 2 to 30, from 2 to 20, from 3 to 15, or from 4 to 14 pendant PEG phosphoramidite compounds. In other embodiments, TCEs may include any suitable number and combination of phosphoramidite compounds set forth in Tables 1A-C. For example, a TCE may include from 1 to 10, 2 to 8, or 2 or 3 different phosphoramidite compounds, in any order; in certain embodiments, at least one of the phosphoramidite compounds is a pendant PEG phosphoramidite. In certain embodiments, the length of the entire TCE may include from 2 to 30, from 2 to 20, from 3 to 15, or from 4 to 14 phosphoramidite compounds. In some embodiments, the formula of the TCE may be represented by (PPA1)n1(PPA2)n2 wherein PPA1 and/or PPA2 represent a pendant PEG phosphoramidite compound and n1=1 to 12 and n2=0 to 10. The inventors have discovered that TCE based on this formula significantly reduce sequencing errors (e.g., insertion or deletion events) and enable single-pulse transitions between sequential SSRTs. In certain embodiments, the TCE includes a polymer synthesized from phosphoramidite compounds with the following sequence: [(1-O-DMT-3-O-PPA-2S—O-mPEG4-propane (compound 12b))]n1[(1-O-DMT-3-O-PPA-2-(4-Me-O-PEG3)-1-(Et-O—Ac)-1,2,3-triazole)-propane (compound 35b)]n2, in which n1 is from 0 to 6 and n2 is from 6 to 10. In other embodiments, the TCE includes one or more phosphoramidite chromophores that can be detected by UV radiation, e.g., benzofuran or triazole-containing PPAs.
In other embodiments, TCEs may include a brancher structure with more than two arms. In one embodiment, the phosphoramidite brancher may have a terminally branched structure. In this embodiment, the brancher has four arms, two of which are linked to the reporter codes, and two of which contribute to translocation control. In other embodiments. The brancher may be customized to optimize features such as size, polarity, and stability. In one embodiment, the brancher includes an isocyanuate trimer.
In further embodiments, the TCEs of the present invention may include two branchers (i.e., “double brancher” TCEs), in which the branchers are separated by a plurality of unbranched phosphoramidites. The branchers may be symmetrical or asymmetrical structures. The asymmetric structure may be a single enantiomer or racemic. Moreover, a combination of the racemate and/or both enantiomers can be used at different positions in the TCE. In this embodiment, each brancher contributes to a distinct translocation pause event, the first of which may be referred to as a “code pause” that maintains a reporter code in the nanopore, and the second of which may be referred to as a “clock pause” that produces a unique signal indicating that the preceding reporter codes has been “read” by the detection system.
Table 2 sets forth several exemplary TCE sequences. It is to be emphasized that the present invention is not intended to be limited to these particular embodiments, as the skilled artisan will appreciate that, based on the present disclosure, an extensive library of diverse TCEs can be designed to suit a wide range of experimental requirements. In certain embodiments, any of the TCE sequences set forth below could terminate at the end distal to the brancher with one or more spacer compounds, including C3, benzofuran, or PEG3. The key in Table 2 identifies the compounds in their form as phosphoramidite monomers. It will be readily apparent to one of ordinary skill in the art that the descriptor “phosphoramidite” only applies to the compounds in monomeric form; descriptors that apply to the compounds in multimeric, “in-line”, form are set forth in Table 1A.
In other aspects, the present invention provides means of translocation control through Xpandomer modification in combination with discrete translocation deceleration features (referred to herein also as “D-cells”) designed into the SSRT. As disclosed herein, Xpandomers are subjected to several processing steps following synthesis, including an amine modification step. During amine modification, Xpandomers are treated with succinic anhydride, which reacts with the secondary amine groups (and, in certain circumstances, the primary amine group introduced by the acid cleavage step) on the spermine constituents of the polymerase enhancement regions of the SSRT. Succinylation of an amine group results in the introduction of a negatively charged hemi-succinate group. By increasing the degree of succinylation, the degree of negative charge on the spermine-based enhancers is likewise increased. Each spermine phosphoramidite constituent has a net charge of (+3); in a standard modification reaction, the charge of each individual amine moiety is changed from (+1) to (−1). The inventors have discovered conditions that give varying degrees of spermine succinylation such that the net charge of a spermine constituent can be changed from between (+3) to (−5). Under certain conditions, increasing the negative charge of the enhancer regions may be desirable so as to increase the rate of Xpandomer translocation upon application of a voltage pulse (referred to herein as an enhancer electromobillity. Notably, this electromobility has been found to reduce the percentage of insertion errors during the sequence read as well as to increase overall sequencing throughput.
Thus, in certain embodiments, Xpandomer processing includes an amine modification step. Amine (e.g., spermine) modification may be achieved through altering one or more of the succinylation reaction conditions, e.g., the reaction time, temperature, pH, and/or the concentration of succinic anhydride used in the reaction. In other embodiments, Xpandomer processing further includes one or more of a HEPES wash step following the amine modification step in order to achieve more complete amine succinlyation.
In another aspect, the present disclosure provides one or more translocation deceleration features or regions (the terms “features” and “regions” are used interchangeably in this context), which are permanently charged, e.g., tertiary or quarternary amines and/or bulky compounds. Translocation deceleration features may be introduced into the SSRT at a position within or adjacent to the polymerase enhancers. The deceleration features are selected so as not to be altered during the Xpandomer modification (e.g., succinylation) reaction. Inclusion of one or more deceleration features into a suitable location in the SSRT has been found to reduce the percentage of deletion errors, which arise due to the increased translocation rate resulting from over-modification of the enhancer. Without being bound by theory, it is speculated that the bulk of the deceleration feature creates a “friction”-type of force that reduces the rate of Xpandomer translocation upon encountering the nanopore aperture. Typically, the deceleration features are introduced into the SSRT at a position between the polymerase enhancer and the reporter code (i.e., adjacent to the enhancer).
The translocation deceleration features of the present invention may incorporate any suitable number and combination of the phosphoramidite compounds set forth in Tables 1A-1C or commercially available phosphoramidites. In some embodiments, the deceleration features include a combination of from 1 to 4 different monomeric units. In other embodiments, the deceleration features may include 1 or 2 different monomeric units. The entire length of a deceleration feature may be from 1 to 15 monomeric units or, in other embodiments, from 4 to 12 or from 6 to 10 monomeric units. Table 3 sets forth non-limiting examples of alternative translocation deceleration features. The key in Table 3 identifies the compounds in their form as phosphoramidite monomers. It will be readily apparent to one of ordinary skill in the art that the descriptor “phosphoramidite” only applies to the compounds in monomeric form; descriptors that apply to the compounds in multimeric, “in-line”, form are set forth in Table 1A.
Each SSRT uses the TCE to position the reporter code within a zone of the nanopore that has high ion current resistance. In alpha hemolysin, this zone is the stem. In this zone, different reporters are sized to block ion flow at different measurable levels. Reporters can be designed by selecting a sequence of specific phosphoramidites from the collection of phosphoramidite monomeric compounds set for in Tables 1A-1C and/or commercially available libraries. Suitable monomeric compounds are also disclosed in Applicants' U.S. Pat. No. 10,457,979, which is herein incorporated by reference in its entirety, including PEG3, PEG6, and C2.
Each constituent monomeric compound contributes to the net current resistance according to its position in the nanopore, its displacement, its charge, its interaction with the nanopore, its chemical and thermal environment and other factors.
Reporter code design is guided by balancing measurement characteristics including: (i) normalized ion current (I/Io): where I is ion current and Io is the open channel current; (ii) ion current noise: includes multi-state responses, blockages, random spiking, and the like; and/or (iii) release time of the control moiety or the time during which the TCE is otherwise is stalled at the stem entrance.
Reporter ion current blockage and its duplex release time is also modulated by measurement conditions such as: (i) voltage; (ii) electrolyte; (iii) temperature; (iv) pressure; and/or (v) pH, as described further herein.
In some embodiments, the TCE associated with the reporter also contributes to the ion current blockage.
For a given set of measurement conditions reporters can be designed for a minimum and maximum I/Io levels that define the measurement dynamic range. Other reporters can be designed with different I/Io levels within the dynamic range. As each reporter is paused in the nanopore, the measured I/Io level must remain stationary long enough and have low enough noise that the reporter type can be uniquely distinguished. Dynamic range is maximized by selecting a backbone of low impedance molecules (reporter code polymers), typically those with small physical cross-sections and low linear mass densities.
Table 4 sets forth exemplary reporter codes, though It is to be understood that the present invention contemplates reporter codes incorporating any suitable combination and number of phosphoramidite compounds disclosed herein. The key in Table 4 identifies the compounds in their form as phosphoramidite monomers. It will be readily apparent to one of ordinary skill in the art that the descriptor “phosphoramidite” only applies to the compounds in monomeric form; descriptors that apply to the compounds in multimeric, “in-line”, form are set forth in Table 1A.
As disclosed herein, symmetrically synthesized reporter tethers (SSRTs) are synthesized using standard automated oligonucleotide synthesis protocols.
In one embodiment, SSRT synthesis utilizes a four-step iterative process that includes 1) synthesis of SSRT polymers on solid support controlled pore glass beads (reflected in the cartoons of steps A-D). In this step, SSRTs are synthesized one reporter construct at a time at a 1 μM scale using a MerMade™ 12 Synthesizer (commercially available from BioAutomation). The MerMade™ sequence manager is first prepared followed by preparation of the phosphoramidites (e.g., preparation of 0.067M solution of each phosphoramidite). Suitable coupling times for each phosphoramidite are programmed into the synthesizer. The SSRT synthesis cycle is based on a conventional four step process: detritylation (using a solvent of, e.g., 3% DCA in dichloromethane), monomer coupling (using a solvent of, e.g., 0.25M ETT in acetonitrile), capping (using solvents of, e.g., THF/lutidine/Ac2O (CAP A) and 16% methylimidazole in THF (CAP B)), and oxidation (using a solvent of, e.g., 0.02M 12 in THF/pyridine/H2O). Step 2) functionalization of the 5′ end with a manual conversion that displaces the Br with azide (reflected in the cartoon of step E), i.e. “azido modification”. In this step, the synthesis column is washed with 1 mL DCM and transferred to a 2 mL tube; an azide conversion solution is prepared (100 mM sodium iodide and 100 mM sodium azide in DMF) and 1.6 mL is added to the tube and incubated for 2 hrs. at RT; the support is then rinsed with 1 mL DMF and transferred to the column; the column is rinsed with 2 mL DMF followed by 3 mL ACN and 1 mL DCM. Step 3) removal of cyanoethyl protection groups. In this step, a 10% DEA solution is prepared in ACN that may include 0.1M nitromethanse; with vacuum, a steady stream of this solution is passed through the column for at least 10′; the column is then rinsed with 2 mL ACN followed by 1 mL DCM. Step 4) cleavage complete deprotection of the SSRT from the solid support (reflected in the cartoon of step F). In this step, the support is transferred to a 2 ml tube and 500 μL of 30% NH4OH that may include 100 mM nitromethane is added to the tube and incubated for 30′ at 55° C.; the tube is then chilled for 5′ in a freezer; 500 μL of 40% methylamine is added to the tube and incubated for 1 hr at 65° C. The sample is then chilled for 5′ in a freezer; the sample is then desalted by draining the column and rinsing with 15 mL H2O; the SSRT is then eluted from the column with 100 mM TEAA and quantitated.
In certain embodiments, the nucleobase of the XNTP may be a non-natural analog, e.g., 7-deazaadenine, 7-deazaguanine, and the like.
This embodiment of translocation control is illustrated in simplified form in
Any suitable set of reversible binding partners may be used for translocation control according to the present invention. In one embodiment, the TCEs include a derivative of biotin, while the translocation control moiety is provided by streptavidin. In this embodiment, the biotin derivative may be engineered to bind streptavidin with lower affinity than natural biotin. One example of a suitable biotin derivative is desthiobiotin (DTB), which is depicted in
It has been found that fine tuning of various conditions used in the nanopore-based detection system improves Xpandomer translocation control and the accuracy of code reads. Thus, in other aspects, the present disclosure provides means to improve the rate of polymer translocation through a nanopore by modification of one or more of the following run conditions:
A. Voltage Parameters
The flow of ions from the cis chamber to the trans chamber of the nanopore-based detection systems described herein results from the application of a voltage potential across the membrane that is referred to interchangeably as the “read voltage” or “baseline voltage”. In one embodiment of the present invention, Xpandomer translocation rate is modulated by altering the baseline voltage. In some embodiments, the baseline voltage may be in the range of from about 40 mV to about 150 mV. In other embodiments, the baseline voltage may be in the range of from about 90 mV to about 110 mV. In yet other embodiments, the baseline voltage may be in the range of from about 55 mV to about 75 mV. In some embodiments, a higher baseline voltage may be desired to capture reporter code reads at a higher rate.
As discussed herein, Xpandomer translocation is arrested when the TCE proximal to a reporter code encounters the aperture of the pore. The reporter code is maintained in the pore until a voltage pulse is applied that is sufficiently strong to overcome the resistance provided by TCE structure held at the pore. Thus, in another embodiment, Xpandomer translocation rate is modulated by altering the strength of the pulse voltage. In some embodiments, the pulse voltage is in the range of from about 250 mV to about 2000 mV. In other embodiments, the pulse voltage is in the range of from about 550 mV to about 700 mV. Likewise, the duration of the voltage pulse can influence the rate of Xpandomer translocation. In some embodiments, the duration of the voltage pulse is in the range of from about 1 μs to about 50 μs. In other embodiments, the duration of the voltage pulse is in the range of from about 5 μs to about 10 μs. In another embodiment, the periodicity of the pulse voltages may be optimized. In some embodiments, the periodicity is in the range of from about 0.5 ms to about 20 ms. In yet other embodiments, the periodicity of the pulse voltages is from about 0.5 ms to about 1.5 ms. The skilled artisan will appreciate that the strength, duration, and periodicity of the optimal voltage pulses will depend upon many factors, e.g., the force of TCE.
B. Salts
The rate of current flow through the nanopore-based detection systems described herein can be influenced by the salt composition of the buffers that fill the cis and trans chambers of the system. Thus, in certain embodiments, the rate of Xpandomer translocation through the pore can be modulated by salt composition. In these embodiments, salts comprising any suitable mono- or di-valent cation may be utilized. In some embodiments, suitable salts include, but are not limited, to NH4Cl, MgCl2, LiCl, KCl, CsCl, NaCl, and CaCl2. In other embodiments, suitable salts include those in which the anion is acetate. In conditions where a slower current is desired, salts with a lower ionic mobility, e.g., LiCl may be advantageous. In some embodiments, the trans chamber comprises 2M NH4Cl and a second optional salt with a suitable molarity around 0.2M and the cis chamber comprises NH4Cl with a suitable molarity in the range from about 0.4M to about 1M and a second optional salt with a suitable molarity in the range from about 0.2M to about 0.8M. In other embodiments, other molarities lying outside of these ranges and/or other combinations of salts may be desirable.
C. Chaotropic Agents
In certain other aspects, the cis chamber of the nanopore-based detection systems of the present invention may include one or more chaotropic agents to improve translocation of individual polymeric analytes, e.g., linearized Xpandomers. Any suitable chaotropic agent may be employed, e.g., urea and/or guanidine hydrochloride (GuCl). In some embodiments, the buffer compositions of the cis chamber include GuCl and/or urea in the range from about 200 mM to about 2M.
D. Osmotic Gradients
In other aspects, present invention provides nanopore-based detection systems in which an osmotic gradient is established across the membrane to influence that rate of Xpandomer translocation through the pore. Without being bound by theory, it is hypothesized that a gradient, wherein the concentration of salts and/or other additives is higher in the trans chamber relative to the cis chamber, generates a flow of water towards the nanopore, thereby drawing Xpandomers towards the pore. Under these conditions, an increase in the rate of event frequencies (e.g., code reads) may be observed at a lower run voltage. Thus, in some embodiments, the run conditions include establishment of an osmotic gradient of around 50% across the membrane; e.g., a salt (and/or other additive) concentration of around 1M in the cis chamber and a salt (and/or other additive) concentration of around 2M in the trans chamber. In further embodiments, any other suitable osmotic gradient may be employed.
E. Solvents
It has been found that certain solvents can enhance Xpandomer solubility and improve the rate translocation through a nanopore. Thus, in certain embodiments, the sample buffers of the present invention include one or more organic solvents. Suitable solvents include, but are not limited to, 3-methyl-2-oxazolidinone (MOA), DMF, ACN, DMSO, and NMP used in the range of from about 1% to about 25%.
F. Buffers, Additives, and Other Run Conditions
Suitable buffers for use in the present invention include, but are not limited to, 20 mM-100 mM HEPES with a pH of about 7.4 and bis-tris-propane buffers with a molarity in the range of from about 25 mM to about 250 mM and with a pH in the range of from about 6 to about 10. In other aspects, the buffers of the present invention may include certain detergent additives, such as sodium hexanoate (NaHex), to enhance the rate of Xpandomer translocation. In certain embodiments, the sample buffers of the present invention include around 20 mM NaHex. Other suitable additives include, but are not limited to, stabilizers such as EDTA and redox reagents. The viscosity of any of the buffers may also be altered by additives such as PEG, glycerol, ficoll and the like.
In other aspects, translocation rate of Xpandomers may be modulated by temperature. In some embodiments, the run temperature may be in the range from about 4° C. to about 40° C. In other embodiments, the run temperature may be in the range from about 16° C. to about 22° C.
In another aspect, the present disclosure provides cleavable extension oligonucleotides (EO) for Xpandomer synthesis. The cleavable design feature enables the EO to be removed, i.e., cleaved from, the Xpandomer following synthesis and prior to nanopore analysis. This functionality provides advantages when it is undesirable to translocate a polynucleotide sequence through a nanopore. Xpandomer synthesis, processing, and nanopore sequence analysis are carried out as has been described, e.g., in Applicants' PCT patent application no. PCT/US18/67763, which is herein incorporated by reference in its entirety.
One embodiment of a cleavable extension oligonucleotide is illustrated in simplified form in
One drawback of nanopore-based detection systems practiced in the art is the depletion of current over time resulting from electrolyte exhaustion that occurs during continuous application of DC voltage. For example, where an electrolyte circuit is based on a ferrocyanide-ferricyanide redox couple, each well in a nanopore array has a limited volume and thus contains a limited number of these redox ion species. Under DC voltage, one species converts to the other and will cause a drop in current. To overcome this problem of current depletion and maintain a more balanced current over time, the present disclosure provides means for detecting polymeric analytes with a nanopore-based detection system that relies, instead, on an alternating current (AC) pattern of voltage application. This pattern is referred to herein as “ratcheting”. A generalized overview of ratcheting is presented in
The bottom panel of
Although the ratcheting pattern depicted in
In other embodiments, ratcheting provides means for compensating, or correcting, for one or both of current depletion due to pulsing and asymmetry in the resistance of the different reporter codes. For example, in some embodiments, the reverse read voltage can be increased to compensate for the current loss due to the pulses applied during the forward read voltage. The percent increase in the reverse read voltage can also be adjusted to balance the current when the different reporter codes have different intrinsic resistances.
In other variations of the ratcheting scheme, the sequence of forward, pulse, and reverse voltages can be altered. For example, in one embodiment, a ratcheting cycle could be run as follows: (forward read voltage), (forward read voltage), (reverse read voltage). In another embodiment, a ratcheting cycle could be run as follows: (forward read)n(reverse read)n in which “n” represents the total number of monomeric units in the polymer being measured by the nanopore.
In a related idea, “flossing” was proposed whereby DNA may be read in a nanopore along its full length, stopped and then upon reversing the voltage polarity could be read in the other direction (see, e.g., Kasianowicz, John J. “Nanopores: Flossing with DNA.” Nature Materials 3, no. 6 (2004): 355-56. https://doi.org/10.1038/nmat1143). This is a less efficient method in an array because the DNA polymers are not captured or stopped synchronously whereas ratcheting is a continuous forward process on these time scales.
In another aspect, the present invention discloses methods and kits for the detection and diagnostics of genetic alterations/mutations in a target sample, which may be a solid tissue or a bodily fluid. The genetic alterations may be either germline or somatic mutations. The invention may be used for detection and diagnostics related to cancer, auto-immune disease, organ transplant rejection, genetic fetal abnormalities, pathogens, and other suitable conditions.
The following materials, having the abbreviations as indicated, were obtained from the mentioned sources in the United States, unless otherwise indicated. 2-Phenyl-1,3-dioxan-5-ol, TBDPS-Cl (t-butyldiphenylchlorosilane), DMAP (4-dimethylaminopyridine), (R)-(+)-glycidol, (+)-2,3-O-Isopropylidene-L-threitol, isosorbide, 4,4′-Bis(hydroxymethyl)-2,2′-bipyridine, TBTA (Tris[(1-benzyl-1H-1,2,3-triazol-4-yl)methyl]amine) from TCI America (Portland, Oreg.). NaH (sodium hydride), MeOH (methanol), toluene, THF (tetrahydrofuran), TBAF (tetrabutylammonium fluoride), DCM (dichloromethane), HCl (concentrated hydrochloric acid), DMSO (dimethylsulfoxide), Na ascorbate (sodium ascorbate), sodium bicarbonate, copper sulfate, dimethyl propargylmalonate, lithium borohydride, and acetic acid were obtained from Sigma-Aldrich (St. Louis, Mo.). DMT-Cl (4,4′-dimethoxytrityl chloride) and PPA-Cl (N,N-diisopropylamino cyanoethyl phosphonamidic chloride) from ChemGenes Corporation (Wilmington, Mass.). TEA (triethylamine), hexanes, ethyl acetate, EDTA (ethylenediaminetetraacetic acid), diethyl ether from EMD Millipore (Billerica, Mass.). m-PEG4-Tos was made from m-PEG4-OH (Cat. No. BP-23742). Furo[3,2-c]pyridin-4(5h)-one (Combi-Blocks, San Diego, Calif.).
High performance liquid chromatography (HPLC) was performed on a ProStar Helix™ HPLC system from Agilent Technologies, Inc. (Santa Clara, Calif.) consisting of two pumps (ProStar 210 Solvent Delivery Modules) with 10 ml titanium pump heads, a column oven (ProStar 510 Air Oven), a UV detector (ProStar 320 UV/Vis Detector) set at 292 nm. The system is controlled by Star Chromatography Workstation Software (version 6.41). The column used was a Cadenza Guard Column System CD-C18 (2.0 mm×5 mm) both from Imtakt USA (Portland, Oreg.). The buffers used are Buffer A (100 mM triethylammonium acetate, pH 7.0) and Buffer B (100 mM triethylammonium acetate, pH 7.0 with 95% by volume acetonitrile). Automated solid phase phosphoramidite synthesis was done on a MerMade™ 12 synthesizer (Bioautomation Corp, Plano, Tex.). Synthesis solutions for the MerMade™ were purchased from Glen Research (Sterling, Va.).
2-Phenyl-1,3-dioxan-5-ol (1, 2.7 g, 15 mmol) was dissolved in 30 mL anhydrous THF. Sodium hydride (1.08 g, 27 mmol) was added to generate alkoxide. When the bubbling ceased, mPEG4-Tos (4.94 g, 18 mmol) was dissolved in 10 mL THF and added portion-wise. The reaction was brought to 40° C. and incubated with stirring for 3 h, then allowed to come down to room temperature overnight. Excess NaH was quenched with 1 mL MeOH, then diluted with water and extracted with DCM. The combined organic layers were concentrated under reduced pressure. The residue was resuspended in toluene, separated from remaining salts, and purified by flash chromatography to afford 2 in 73% yield.
Benzylidine protected 2b (3.05 g, 10.8 mmol) was dissolved in 10 mL MeOH and HCl (0.2 mL, 2.3 mmol) was added. The solution was incubated for 20 minutes, then neutralized with sodium bicarbonate (200 mg) and dried under reduced pressure. The residue was resuspended in DCM and purified by flash chromatography to afford diol 3 in 72% yield.
Diol 3 (1.52 g, 7.8 mmol) was dissolved in 20 mL DCM and TEA (2.17 mL, 15.6 mmol). A solution of DMT-Cl (1.85 g, 5.46 mmol) in 10 mL DCM was added portion-wise over 90 minutes to maximize monotritylation. MeOH (1 mL) was added and the reaction was dried under reduced pressure. The residue was resuspended in toluene and separated from the salts, then purified by flash chromatography to afford the mono-trityl product 4 in 48% yield as well as recovered starting diol.
Monotrityl 4 (1.84 g, 3.7 mmol) was dissolved in 10 mL DCM and TEA (1.03 mL 7.4 mmol). PPA-Cl (1.05 g, 4.4 mmol) was added and the reaction was incubated 15 minutes. The reaction was dried down under reduced pressure and resuspended in toluene with 1% TEA, then purified by flash chromatography. Phosphoramidite 5 (2.03 g, 2.9 mmol) was afforded in 79% yield and confirmed by 1H and 31P NMR.
2,3-Isopropylidene-sn-glycerol 6 was dissolved in anhydrous DCM and TEA. DMAP and TBDPS-Cl were added. The reaction was extracted from water with DCM and purified by flash chromatography to afford product 7.
Silyl ether 7 was dissolved in MeOH and HCl was added. The solution was incubated 20 minutes, then neutralized with sodium bicarbonate and dried under reduced pressure. The residue was resuspended in DCM and purified by flash chromatography to afford diol 8.
Diol 8 was dissolved in DCM and TEA. A solution of DMT-Cl in DCM was added portion-wise. MeOH was added and the reaction was dried under reduced pressure. The residue was resuspended in toluene and separated from the salts, then purified by flash chromatography to afford the mono-trityl product 9.
Secondary alcohol 9 was dissolved in anhydrous THF. Sodium hydride was added to generate alkoxide. When the bubbling ceased, mPEG4-Tos was dissolved in THF and added portion-wise. The reaction was brought to 40° C. and incubated with stirring for 3 h, then allowed to come down to room temperature overnight. Excess NaH was quenched with 1 mL MeOH, then diluted with water and extracted with DCM. The combined organic layers were dried under reduced pressure. The residue was resuspended in toluene, separated from remaining salts, and purified by flash chromatography to afford 10.
mPEG4 ether 10b was resuspended in THF and TBAF was added. The reaction was concentrated under reduced pressure and purified by flash chromatography to afford 11.
DMT PEG4 alcohol 11b was dissolved in DCM and TEA. PPA-Cl was added and the reaction and incubated 15 minutes. The reaction was dried down under reduced pressure and resuspended in toluene with 1% TEA, then purified by flash chromatography. Phosphoramidite 12 was isolated and confirmed by 1H and 31P NMR.
2,3-Isopropylidene-sn-glycerol 6 was dissolved in anhydrous THF. Sodium hydride was added to generate alkoxide. When the bubbling ceased, mPEG2-Tos (Broadpharm Cat. No. BP-2-982) was dissolved in THF and added portion-wise. The reaction was incubated for 24 hours. Excess NaH was quenched with MeOH, then diluted with water and extracted with DCM. The combined organic layers were concentrated under reduced pressure. The residue was resuspended in toluene, separated from remaining salts, and purified by flash chromatography to afford 13.
The PEG2 product 13 was dissolved in MeOH and HCl was added. The solution was incubated 20 minutes, then neutralized with sodium bicarbonate and dried under reduced pressure. The residue was resuspended in DCM and purified by flash chromatography to afford diol 14.
Diol 14 was dissolved in DCM and TEA. A solution of DMT-Cl in DCM was added portion-wise. MeOH was added and the reaction was dried under reduced pressure. The residue was resuspended in toluene and separated from the salts, then purified by flash chromatography to afford the mono-trityl product 15.
Mono DMT 15 was dissolved in DCM and TEA. PPA-Cl was added and the reaction was incubated 15 minutes. The reaction was dried down under reduced pressure and resuspended in toluene with 1% TEA, then purified by flash chromatography. Phosphoramidite 16 was isolated and confirmed by 1H and 31P NMR.
(R)-(+)-glycidol 17 was dissolved in DCM and TEA. A solution of DMT-Cl in DCM was added portion-wise. MeOH was added and the reaction was dried under reduced pressure. The residue was resuspended in toluene and separated from the salts, then purified by flash chromatography to afford DMT ether 18.
DMT ether 18 was dissolved in anhydrous DMF. Sodium hydride was added to generate alkoxide. When the bubbling ceased, Furo [3,2-c]pyridin-4(5h)-one was dissolved in THF and added portion-wise. The reaction was brought to 100° C. and incubated with stirring for 12 h. Excess NaH was quenched with MeOH, then diluted with water and extracted with DCM. The combined organic layers were concentrated under reduced pressure. The residue was resuspended in toluene, separated from remaining salts, and purified by flash chromatography to afford 19.
Secondary alcohol 19 was dissolved in DCM and TEA. PPA-Cl was added and the reaction and incubated 15 minutes. The reaction was dried down under reduced pressure and resuspended in toluene with 1% TEA, then purified by flash chromatography. Phosphoramidite 20 was isolated and confirmed by 1H and 31P NMR.
Isosorbide 21 was dissolved in DCM and TEA. A solution of DMT-Cl in DCM was added portion-wise. MeOH was added and the reaction was dried under reduced pressure. The residue was resuspended in toluene and separated from the salts, then purified by flash chromatography to afford monotrityl 22.
Monotrityl 22 was dissolved in DCM and TEA. PPA-Cl was added and the reaction was incubated 15 minutes. The reaction was dried down under reduced pressure and resuspended in toluene with 1% TEA, then purified by flash chromatography. Phosphoramidite 23 was isolated and confirmed by 1H and 31P NMR.
4,4′-Bis(hydroxymethyl)-2,2′-bipyridine 24 was dissolved in DCM and TEA. A solution of DMT-Cl in DCM was added portion-wise. MeOH was added and the reaction was dried under reduced pressure. The residue was resuspended in toluene and separated from the salts, then purified by flash chromatography to afford monotrityl 25.
Monotrityl 25 was dissolved in DCM and TEA. PPA-Cl was added and the reaction was incubated 15 minutes. The reaction was dried down under reduced pressure and resuspended in toluene with 1% TEA, then purified by flash chromatography. Phosphoramidite 26 was isolated and confirmed by 1H and 31P NMR.
Dimethyl propargylmalonate 27 was added dropwise to a cold suspension of lithium borohydride in diethyl ether. The reaction was warmed to room temperature and incubated overnight. The reaction was quenched with methanol, then water, then acetic acid. The solution was extracted with ether and the combined organic layers were concentrated under reduced pressure. The crude material was purified by flash chromatography to afford diol 28.
Diol 28 was dissolved in DCM and TEA. A solution of DMT-Cl in DCM was added portion-wise. MeOH was added and the reaction was dried under reduced pressure. The residue was resuspended in toluene and separated from the salts, then purified by flash chromatography to afford DMT alcohol 29.
DMT alcohol 29 was dissolved in DMSO and azide (Cat. No. BP-20988 Broadpharm) was added. Separately, TBTA was dissolved in DMSO and sodium ascorbate and copper sulfate were combined. The TBTA solution was added to the alkyne/azide solution in portions with stirring. After 45 minutes of incubation, the reaction was quenched with EDTA. The solution was diluted with water and extracted with ethyl acetate, then the organic layers were concentrated under reduced pressure and purified by flash chromatography to afford 30.
1,2,3-Triazole 30a was dissolved in DCM and TEA. PPA-Cl was added and the reaction was incubated 15 minutes. The reaction was dried down under reduced pressure and resuspended in toluene with 1% TEA, then purified by flash chromatography. Phosphoramidite 31 was isolated and confirmed by 1H and 31P NMR.
1-O-TBDPS-3-O-DMTr-propane-1,2,3-triol 9 (from Example 2) was dissolved in anhydrous THF. Sodium hydride was added to generate alkoxide. When the bubbling ceased, tosylate (prepared via tosylation of Cat. No. BP-21657, Broadpharm) was added portion-wise. The reaction was incubated with stirring for 48 h. Excess NaH was quenched with water, then the solution was transferred to a separatory funnel and extracted with ethyl acetate. The combined organic layers were washed with brine, dried with sodium sulfate, filtered and concentrated under reduced pressure. The residue was resuspended in toluene, separated from remaining salts, and purified by flash chromatography to afford 32.
Alkyne 32a was resuspended in THF and TBAF was added. The reaction was concentrated under reduced pressure and purified by flash chromatography to afford 33.
To a solution of alkyne 33a in DMSO was added 2-azidoethyl acetate. In a separate vial, dissolve sodium ascorbate in water and add DMSO followed by 1M CuSO4, to prepare catalyst mixture. Add catalyst mixture to solution of alkyne/azide dropwise over 10 minutes. Upon completion, quench with 0.5M EDTA and stir for 15 minutes. Dilute with water and extract with ethyl acetate three times. Wash combined organic extractions with brine and dry over sodium sulfate. The residue was resuspended in toluene and purified by flash chromatography to afford 34a.
Triazole 34a was dissolved in DCM and TEA. PPA-Cl was added and the reaction was incubated 15 minutes. The reaction was dried down under reduced pressure and resuspended in toluene with 1% TEA, then purified by flash chromatography. Phosphoramidite 35 was isolated and confirmed by 1H and 31P NMR.
To a solution of product 33 in DMSO was added 2-(acetoxymethyl)-2-(azidomethyl)propane-1,3-diyl diacetate (prepared by dissolving 2-(Bromomethyl)-2-(hydroxymethyl)-1,3-propanediol in DMF and subsequent addition of NaN3. The reaction was incubated at 110° C., concentrated, and purified by flash chromatography. Upon isolation of the product, the residue was reacted with acetic anhydride and purified). In a separate vial, dissolve sodium ascorbate in water and add DMSO followed by 1M CuSO4, to prepare catalyst mixture. Add catalyst mixture to solution of alkyne/azide dropwise over 10 minutes. Upon completion, quench with 0.5M EDTA and stir for 15 minutes. Dilute with water and extract with ethyl acetate three times. Wash combined organic extractions with brine and dry over sodium sulfate. The residue was resuspended in toluene and purified by flash chromatography to afford 36.
Primary alcohol 36b was dissolved in DCM and TEA. PPA-Cl was added and the reaction was incubated 15 minutes. The reaction was dried down under reduced pressure and resuspended in toluene with 1% TEA, then purified by flash chromatography. Phosphoramidite 37 was isolated and confirmed by 1H and 31P NMR.
1-O-TBDPS-3-O-DMTr-propane-1,2,3-triol 9 (from Example 2) was dissolved in anhydrous THF. Sodium hydride was added to generate alkoxide. When the bubbling ceased, tosylate (prepared by sequential tosylation and silyl-protection of Cat. No. BP-21036, Broadpharm) was dissolved in THF and added portion-wise. The reaction was incubated with stirring overnight. Excess NaH was quenched with water and extracted with DCM. The combined organic layers were dried under reduced pressure. The residue was resuspended in toluene, separated from remaining salts, and purified by flash chromatography to afford 38.
Bis-silyl ether 38 was resuspended in THF and TBAF was added. The reaction was concentrated under reduced pressure and purified by flash chromatography. The purified material was resuspended in DCM and TEA was added. BzCl was added dropwise. The reaction was stirred at room temperature until complete. The reaction was concentrated under reduced pressure and purified by flash chromatography to afford alcohol 39.
Alcohol 39 was dissolved in DCM and TEA. PPA-Cl was added and the reaction and incubated 15 minutes. The reaction was dried down under reduced pressure and resuspended in toluene with 1% TEA, then purified by flash chromatography. Phosphoramidite 40 was isolated and confirmed by 1H and 31P NMR.
O,O′-Benzylidenepentaerythritol (41, Cat. No. B2682, TCI) was dissolved in anhydrous THF. Sodium hydride was added to generate alkoxide. When the bubbling ceased, tosylate (prepared via tosylation of Cat. No. BP-21397, Broadpharm or Cat. No. BP-21657, Broadpharm) was dissolved in THF and added portion-wise. The reaction was incubated with stirring overnight. Excess NaH was quenched with water and extracted with DCM. The combined organic layers were dried under reduced pressure. The residue was resuspended in toluene, separated from remaining salts, and purified by flash chromatography to afford 42a-h.
Products 42a-h were dissolved in MeOH and HCl was added. The reaction was incubated at room temperature overnight, then neutralized with sodium bicarbonate. It was concentrated under reduced pressure and purified by flash chromatography to afford 43a-h.
Products 43a-h were dissolved in DCM and TEA. A solution of DMT-Cl in DCM was added portion-wise. The reaction was dried under reduced pressure. The residue was resuspended in toluene and separated from the salts, then purified by flash chromatography to afford 44a-h.
Products 44a-d were dissolved in DCM and TEA. PPA-Cl was added and the reaction was incubated 15 minutes. The reaction was dried down under reduced pressure and resuspended in toluene with 1% TEA, then purified by flash chromatography to afford phosphoramidites 45a-d.
To a solution of product 44 (e-h) in DMSO was added 2-azidoethyl acetate. In a separate vial, dissolve sodium ascorbate in water and add DMSO followed by 1M CuSO4, to prepare catalyst mixture. Add catalyst mixture to solution of alkyne/azide dropwise over 10 minutes. Upon completion, quench with 0.5M EDTA and stir for 15 minutes. Dilute with water and extract with ethyl acetate three times. Wash combined organic extractions with brine and dry over sodium sulfate. The residue was resuspended in toluene and purified by flash chromatography to afford 46e-h.
Products 46e-h were dissolved in DCM and TEA. PPA-Cl was added and the reaction and incubated 15 minutes. The reaction was dried down under reduced pressure and resuspended in toluene with 1% TEA, then purified by flash chromatography to afford phosphoramidites 47e-h.
2,2-Bis(bromomethyl)-1,3-propanediol (48, Cat. No. D1808, TCI) was dissolved in DMF and NaN3 was added. The reaction was incubated at 110° C., concentrated, and purified by flash chromatography to afford product 49.
To a solution of 49 in DMSO was added 2-(2-(prop-2-yn-1-yloxy)ethoxy)ethyl benzoate (prepared by benzoylation of the commercial alcohol precursor). In a separate vial, dissolve sodium ascorbate in water and add DMSO followed by 1M CuSO4, to prepare catalyst mixture. Add catalyst mixture to solution of alkyne/azide dropwise over 10 minutes. Upon completion, quench with 0.5M EDTA and stir for 15 minutes. Dilute with water and extract with ethyl acetate three times. Wash combined organic extractions with brine and dry over sodium sulfate. The residue was resuspended in toluene and purified by flash chromatography to afford 50.
Product 50 was dissolved in DCM and TEA. A solution of DMT-Cl in DCM was added portion-wise. The reaction was dried under reduced pressure. The residue was resuspended in toluene and separated from the salts, then purified by flash chromatography to afford 51.
Product 51 dissolved in DCM and TEA. PPA-Cl was added and the reaction was incubated 15 minutes. The reaction was dried down under reduced pressure and resuspended in toluene with 1% TEA, then purified by flash chromatography to afford phosphoramidite 52.
2-(Bromomethyl)-2-(hydroxymethyl)-1,3-propanediol (53, Cat. No. B4057, TCI) was dissolved in DMF and NaN3 was added. The reaction was incubated at 110° C., concentrated, and purified by flash chromatography to afford product 54.
Product 54 was dissolved in DCM and TEA. Benzoyl chloride was added. The solution was incubated overnight at room temperature. The reaction was extracted from water with DCM and purified by flash chromatography to afford product 55, the bis-Bz which was separated from any mono- or tri-protected species and divided.
A portion of product 55 was dissolved in DCM and TEA. A solution of DMT-Cl in DCM was added portion-wise. MeOH was added and the reaction was dried under reduced pressure. The residue was resuspended in toluene and separated from the salts, then purified by flash chromatography to afford the tritylated 56.
41 was dissolved in anhydrous THF. Sodium hydride was added to generate alkoxide. When the bubbling ceased, propargyl bromide was dissolved in THF and added portion-wise. Excess NaH was quenched with 1 mL MeOH, then diluted with water and extracted with DCM. The combined organic layers were dried under reduced pressure. The residue was resuspended in toluene, separated from remaining salts, and purified by flash chromatography to afford 57.
Product 57 was dissolved in MeOH and HCl was added. The reaction was incubated at room temperature overnight, then neutralized with sodium bicarbonate. It was concentrated under reduced pressure and purified by flash chromatography to afford 58.
Product 58 was dissolved in DCM and TEA. Benzoyl chloride was added and the reaction and incubated 60 minutes. The reaction was extracted from water with DCM and purified by flash chromatography to afford product 59.
Products 56 and 59 were dissolved in 9:1 DMSO:H2O. A solution of TBTA, sodium ascorbate and copper sulfate was added and the reaction was incubated 60 minutes. The reaction was extracted from water with DCM and purified by flash chromatography to afford product 60.
Products 60 and 55 were dissolved in 9:1 DMSO:H2O. A solution of TBTA, sodium ascorbate and copper sulfate was added and the reaction was incubated 60 minutes. The reaction was extracted from water with DCM and purified by flash chromatography to afford product 61.
Product 61 was dissolved in DCM and TEA. PPA-Cl was added and the reaction was incubated 15 minutes. The reaction was dried down under reduced pressure and resuspended in toluene with 1% TEA, then purified by flash chromatography. Phosphoramidite 62 was isolated and confirmed by 1H and 31P NMR.
A solution of (+)-2,3-O-Isopropylidene-L-threitol 63 in anhydrous DMF was slowly added to a mixture of NaH in anhydrous DMF (Note: vigorous evolution of H2 gas). When the bubbling ceased, mPEG2-Tos (Broadpharm Cat. No. BP-20983) was dissolved in DMF and added portion-wise to stir at ambient temperature overnight. The reaction mixture was poured over water, extracted with ethyl acetate and purified by flash chromatography to afford 64.
Product 64 was dissolved in MeOH and HCl was added. The solution was incubated 20 minutes, then neutralized with sodium bicarbonate and dried under reduced pressure. The residue was resuspended in ethyl acetate and purified by flash chromatography to afford 65.
Product 65 was dissolved in DCM and TEA. A solution of DMT-Cl in DCM was added portion-wise. The reaction was dried under reduced pressure. The residue was resuspended in toluene and separated from the salts, then purified by flash chromatography to afford 66.
Product 66 was dissolved in DCM and TEA. PPA-Cl was added and the reaction was incubated 15 minutes. The reaction was dried down under reduced pressure and resuspended in toluene with 1% TEA, then purified by flash chromatography. Phosphoramidite 67 was confirmed by 1H and 31P NMR.
To a solution of pentaerythritol (68, Cat. No. P0039, TCI) in DMF was added p-toluenesulfonic acid. The reaction was neutralized with triethylamine, concentrated and purified by flash chromatography to afford 69.
Product 69 was added to a stirring solution of EDC-HCl, DMAP and levulinic acid in THF and stirred overnight at ambient temperature. The solution was concentrated and purified by flash chromatography to afford 70.
Product 70 was dissolved in DCM and TEA. A solution of DMT-Cl in DCM was added portion-wise. The reaction was dried under reduced pressure. The residue was resuspended in toluene and separated from the salts, then purified by flash chromatography to afford 71.
Product 71 was dissolved in DCM and TEA. PPA-Cl was added and the reaction was incubated 15 minutes. The reaction was dried down under reduced pressure and resuspended in toluene with 1% TEA, then purified by flash chromatography. Phosphoramidite 72 was confirmed by 1H and 31P NMR.
In this Example, reporter codes were synthesized with PEG-based phosphoramidites; notably, these codes do not contain nucleotides. Four exemplary reporter codes are set forth in Table 5.
The level discrimination (i.e. distinguishable electronic signal) and translocation time of each code was assessed by synthesizing an 100mer Xpandomer copy of a sequence derived from the HIV-2 genome that incorporates XNTPs in which each of the four XNTPs includes a unique code from the group set forth in Table 4. Xpandomer synthesis, processing, and nanopore sequence analysis were carried out as described in Applicants' PCT patent application no. PCT/US18/67763, which is herein incorporated by reference in its entirety. As a control, an Xpandomer copy of the same HIV-2 sequence incorporating different, known codes was sequenced in parallel. A representative trace illustrating the level discrimination and translocation time of each code is shown in
To assess the accuracy of the Xpandomer sequence information, sequence data was analyzed by histogram display of the population of sequence reads from the SBX reactions. The analysis software aligns each sequence read to the sequence of the template and trims the extent of the sequence at the end of the reads that does not align with the correct template sequence. Representative histograms of SBX sequencing of the 100mer template are presented in
In this Example, translocation control with a TCE incorporating a pendant PEG phosphoramidite was assessed by using the SBX protocol to sequence a simple 60mer template consisting of TG dinucleotide repeats. Both XATP and XCTP substrates were designed to incorporate the following TCE: Y22222222222255, in which “Y” represents the symmetric phosphoramidite brancher; “2” represents pendant PEG2; and “5” represents benzofuran. The XATP substrate was designed to incorporate the following reporter code: DDDDDDLLLL, in which “D” represents PEG6 and “L” represents C2. The XCTP substrate was designed to incorporate the following reporter code: DDDDXX44XXDL, in which “X” represents PEG3 and “4” represents pendant PEG4.
To produce Xpandomer copies of the 60mer template, primer extension reactions were conducted using 4 pm of an extension oligonucleotide and 250 pm of each XNTP. The 10 μL extension reaction included the following reagents: 50 mM TrisCl, pH 8.84, 200 mM NH4OAc, 20% PEG8K, 5% NMS, 0.75nmol polyphosphate PP-60.20, 2 μg SSB, 0.5M urea, 5 mM PEM additive (suitable Polymerase Enhancing Molecules are disclosed in Applicants' pending PCT patent application no. PCT/US18/67763, which is herein incorporated by reference in its entirety), and 1.2 μg purified recombinant DNA polymerase (suitable engineered variants of DPO4 polymerase are disclosed in Applicants' PCT patent application no.s WO2017/087281, PCT/US2018/030972, and PCT/U.S. Pat. No. 1,864,794, which are herein incorporated by reference in their entireties) The extension reaction was run for 30 minutes at 42° C.
Xpandomer products of the extension reactions were next sequenced using the SBX protocol. Briefly, the constrained Xpandomer products were cleaved to generate linearized Xpandomers. This was accomplished by first quenching the extension reaction and subjecting the Xpandomers to amine modification with 2M succinic anhydride. The phosphoramidate bonds of the Xpandomers were then cleaved by treating the sample with 11.7M DCl for 30 minutes at 23° C. Linearized Xpandomers were purified by ethanol precipitation and resuspended in a buffer supplemented with 34% ACN and 15% DMF.
For sequencing, Xpandomers were added to a sample buffer of 2.8M NH4Cl, 1.2M GuanCl, 20 mM NaHex, 10% DMF, 2 mM EDTA, and 20 mM HEPES pH 7.4. Protein nanopores were prepared by inserting α-hemolysin into a DPhPE/hexadecane bilayer member in a buffer containing 2 M NH4Cl and 100 mM HEPES, pH 7.4. This experiment used buffers of 0.4M NH4Cl, 600 mM GuanCl, and 100 mM HEPES, pH 7.4 in the cis well and 2M NH4Cl and 100 mM HEPES, pH 7.4 in the trans well of the detection system. The Xpandomer sample was heated to 70° C. for 2 minutes, cooled completely, followed by addition of 2 μL of the sample to the cis well. The voltage parameters run were as follows: 60 mV/300 mV/10 μs/2 ms (read voltage/pulse voltage/pulse voltage duration/pulse frequency). Data were acquired via Labview acquisition software. A representative trace from this run is shown in
As shown by the level numbers superimposed above the trace in
In this Example, translocation control with a TCE incorporating a pendant PEG phosphoramidite was assessed by using the SBX protocol to sequence a 60mer template consisting of repeats of the sequence, CATG. All XNTP substrates were designed to incorporate the following TCE: Y444444444444455, in which “Y” represents the symmetric phosphoramidite brancher; “4” represents pendant PEG4; and “5” represents benzofuran. The XATP substrate was designed to incorporate the following reporter code: DDDDDDLLDX; the XCTP substrate was designed to incorporate the following reporter code: DDDDDDLLLL; the XTTP substrate was designed to incorporate the following reporter code: DDDDDD44LXXX; and the XGTP substrate was designed to incorporate the following reporter code: DDDDXXL444444XLLLL, in which “D” represents PEG6, “L” represents C2, “X” represents PEG3 and “4” represents pendant PEG4.
To produce Xpandomer copies of the 60mer template, primer extension reactions were conducted using 4 pm of an extension oligonucleotide and 1000 pm of each XNTP and. The 10 μL extension reaction included the following reagents: 50 mM TrisCl, pH 8.84, 200 mM NH4OAc, 20% PEG8K, 10% NMP, 3 nmol polyphosphate PP-60.20, 2 μg SSB, 1M urea, 10 mM PEM additive, and 1.8 μg purified recombinant DNA polymerase. The extension reaction was run for 30 minutes at 37° C.
Xpandomer products of the extension reactions were next sequenced using the SBX protocol. Briefly, the constrained Xpandomer products were cleaved to generate linearized Xpandomers. This was accomplished by first quenching the extension reaction and subjecting the Xpandomers to amine modification with 2M succinic anhydride. The phosphoramidate bonds of the Xpandomers were then cleaved by treating the sample with 11.7M DCl for 30 minutes at 23° C. Linearized Xpandomers were purified by ethanol precipitation and resuspended in a buffer supplemented with 34% ACN and 15% DMF.
For sequencing, Xpandomers were added to a sample buffer of 0.8M NH4Cl, 1.2M GuanCl, and 200 mM HEPES, pH 7.4. Protein nanopores were prepared by inserting α-hemolysin into a DPhPE/hexadecane bilayer member in a buffer containing 2 M NH4Cl and 100 mM HEPES, pH 7.4. This experiment used buffers of 0.4M NH4Cl, 600 mM GuanCl, and 100 mM HEPES, pH 7.4 in the cis well and 2M NH4Cl and 100 mM HEPES, pH 7.4 in the trans well. The Xpandomer sample was heated to 70° C. for 2 minutes, cooled completely, followed by addition of 2 μL of the sample to the cis well. The voltage parameters run were as follows: 70 mV/650 mV/6 μs/1.5 ms (read voltage/pulse voltage/pulse voltage duration/pulse frequency). Data were acquired via Labview acquisition software. A representative trace from this run is shown in
The level numbers superimposed above the trace in
In this Example, translocation control with a TCE incorporating a pendant PEG phosphoramidite was assessed by using the SBX protocol to sequence a complex 100mer template. Each XNTP substrate was synthesized with the following TCE: Y22222222222255, in which “Y” represents the symmetric phosphoramidite brancher; “2” represents pendant PEG2; and “5” represents benzofuran. The XNTP substrates were synthesized with the following reporter codes: (XC)DDDDDDLLLL, in which “D” represents PEG6 and “L” represents C2; (XT)DDDDDD44LDX, in which “X” represents PEG3 and “4” represents pendant PEG4; (XA)DDDDXX44XXDL; and (XG)DDDDXXL.
To produce Xpandomer copies of the 100mer template solid-state primer extension reactions were conducted using lpmol of XATP and XCTP and 1.5 pmol of XGTP and XTTP (solid-state Xpandomer synthesis in which the extension oligo is covalently bound to a chip substrate is described in Applicants' provisional patent application no. 62/826,805, which is herein incorporated by reference in its entirety). The 50 μL extension reaction included the following reagents: 50 mM TrisCl, pH 8.84, 200 mM NH4OAc, 20% PEG8K, 10% NMP, 15 pmol polyphosphate PP-60.20, 10 μg SSB, 1M urea, 10 mM PEM additive and 9 μg purified recombinant DNA polymerase. The extension reaction was run for 30 minutes at 37° C.
Xpandomer products were next sequenced using the SBX protocol. Briefly, the constrained Xpandomer products were cleaved to generate linearized Xpandomers. This was accomplished by first quenching the extension reaction and subjecting the Xpandomers to amine modification with succinic anhydride. The phosphoramidate bonds of the Xpandomers were then cleaved by treating the sample with 7.5M DCl for 30 minutes at 23° C. Linearized Xpandomers were released from the chip substrate by photocleavage of the extension oligonucleotide and recovered in elution buffer supplemented with 15% ACN and 5% DMSO (20% final solvent).
For sequencing, Xpandomers were added to a sample buffer of 0.8M NH4Cl, 1.2M GuCl, 200 mM HEPES; pH 7.4. Protein nanopores were prepared by inserting α-hemolysin into a DPhPE/hexadecane bilayer member in a buffer of 2 M NH4Cl and 100 mM HEPES, pH 7.4. The cis well was perfused with buffer containing 0.4M NH4Cl, 600 mM GuanCl, 100 mM HEPES; pH 7.4 and the trans well was perfused with a buffer containing 2M NH4Cl, 100 mM HEPES; pH 7.4. The Xpandomer sample was heated to 70° C. for 2 minutes, cooled completely and vortexed, then a 2 μL aliquot was added to the cis well. The voltage parameters were run as follows: 60 mV/600 mV/6 μs/1.5 ms (read voltage/pulse voltage/pulse voltage duration/pulse frequency). Data were acquired via Labview acquisition software. A representative trace from this run is shown in
As shown in
In this Example, translocation control with pendant PEG-based TCEs was assessed by using the SBX protocol to sequence a complex 222mer template. Each XNTP substrate was synthesized to include the following TCE: Y44444444444455, in which “Y” represents the symmetric phosphoramidite brancher; “4” represents pendant PEG-4; and “5” represents benzofuran. The XNTP substrates were synthesized with the following reporter codes: (XC)DDDDDDLLLDX; (XT)DDDDDD44LXXX; (XA)DDDDDD444LLDX; and (XG)DDDDXXL444444XLLLL, in which “D” represents PEG-6, “L” represents C2, “X” represents PEG-3, and “4” represents pendant PEG-4.
To produce Xpandomer copies of the 222mer template, solid-state primer extension reactions were conducted using 1.25 pmol of each XNTP, 10 pmol template and 20 pmol E-oligo primer (solid-state Xpandomer synthesis in which the extension oligo is covalently bound to a chip substrate is described in Applicants' provisional patent application no. 62/826,805, which is herein incorporated by reference in its entirety). The 50 μL extension reaction included the following reagents: 50 mM TrisCl, pH 8.84, 200 mM NH4OAc, 20% PEG8K, 8% NMP, 15 nmol polyphosphate PP-60.20, 10 μg SSB, 1M urea, 5 mM PEM additive and 9 μg purified recombinant DNA polymerase. The extension reaction was run for 30 minutes at 37° C.
Xpandomer products were next sequenced using the SBX protocol. Briefly, the constrained Xpandomer products were washed in buffer B.001 (1% Tween-20/3% SDS/5 mM HEPES, pH 8.0/100 mM NaPO4/15% DMF) and cleaved to generate linearized Xpandomer by adding 200 μl buffer C.001 (7.5M DCl) and incubating for 30 minutes at 23° C. The sample was then neutralized by adding 1000 μl buffer B.001. The Xpandomer sample was then subjected to amine modification by adding 666 μmol succinic anhydride and incubating for 5 minutes at 23° C. The sample was then washed in buffer D.094 (50% ACN) and the Xpandomers were released from the substrate by photocleavage and stored in buffer AG497 (0.8M NH4Cl/1.2M GuanCl/200 mM HEPES, pH 7.4)
Protein nanopores were prepared by inserting α-hemolysin into a DPhPE/hexadecane bilayer member in a buffer of 2 M NH4Cl and 100 mM HEPES, pH 7.4. The cis well was perfused with buffer containing 0.4M NH4Cl, 600 mM GuanCl, 100 mM HEPES; pH 7.4 and the trans well was perfused with a buffer containing 2M NH4Cl, 100 mM HEPES; pH 7.4. The Xpandomer sample was heated to 70° C. for 2 minutes, cooled completely and vortexed, then a 2 μL aliquot was added to the cis well. The voltage parameters were run as follows: 60 mV/650 mV/6 μs/1.0 ms (read voltage/pulse voltage/pulse voltage duration/pulse frequency). Data were acquired via Labview acquisition software. A representative trace from this run is shown in
As shown in
In this Example, translocation control with pendant PEG-based TCEs and D-cell features was assessed by using the SBX protocol to sequence a complex 222mer template. Each XNTP substrate was synthesized to include the following TCE: Y(32)(32)(32)(32)(32)(32)(61)(61)(61)(61)(61)(61), in which “Y” represents the symmetric phosphoramidite brancher; “32” represents pendant mPEG4 (PPA032); and 61” represents pendant PEG (PPA061). Each XNTP also included the following D-cell feature: D(63)D(63)D(63)DD in which “D” represents PEG6 and “63” represent pendant PEG (PPA063). The XNTP substrates were synthesized with the following reporter codes: (XC)DDLLLX; (XT)LXXX; (XA)DD(32)(32)(32)LLLLLLL; and (XG)XXL(32)(32)(32)(32)(32)(32)LLLLLLL, in which “D” represents PEG-6, “L” represents C2, “X” represents PEG-3, and “32” represents pendant mPEG-4 (PPA032).
To produce Xpandomer copies of the 222mer template, solid-state primer extension reactions were conducted using 5000 pmol of each XNTP, 4 pmol template and 20 pmol E-oligo primer (solid-state Xpandomer synthesis in which the extension oligo is covalently bound to a chip substrate is described in Applicants' provisional patent application no. 62/826,805, which is herein incorporated by reference in its entirety). The 50 μL extension reaction included the following reagents: 50 mM TrisCl, pH 8.84, 200 mM NH4OAc, 50 mM GuCl20% PEG8K, 10% NMP, 15 nmol polyphosphate PP-60.23, 2.5 μg Kod SSB, 0.1M urea, 15 mM PEM additive and 13 μg purified recombinant DNA polymerase (a variant of DPO4 polymerase). The extension reaction was run for 60 minutes at 37° C.
Xpandomer products were next sequenced using the SBX protocol. Briefly, the constrained Xpandomer products were washed in buffer B.064 (1% Tween-20/3% SDS/5 mM HEPES, pH 8.0/100 mM NaPO4/15% DMF) and cleaved to generate linearized Xpandomer by adding 200 μl buffer C.001 (7.5M DCl) and incubating for 30 minutes at 23° C. The sample was then neutralized by adding 2000 μl buffer B.064 and incubating for 2′ at RT. The Xpandomer sample was then subjected to amine modification by adding 500 μmol succinate anhydride in buffer B.065 and incubating for 5 minutes at 23° C. The sample was then washed in buffer D.102 (50% ACN) and the Xpandomers were released from the substrate by photocleavage and eluted in 60 μl elution buffer.
Protein nanopores were prepared by inserting α-hemolysin into a DPhPE/hexadecane bilayer member in a buffer of 2 M NH4Cl and 100 mM HEPES, pH 7.4. The cis well was perfused with buffer AG242 containing 0.4M NH4Cl, 600 mM GuanCl, 100 mM HEPES; pH 7.4, and 5% glycerol and the trans well was perfused with buffer AB080 containing 0.4M NH4Cl, 600 mM GuanCl, 5% ethyl acetate, 10 mM HEPES; pH 7.4. The Xpandomer sample was heated to 70° C. for 2 minutes, cooled completely and vortexed, then a 2 μL aliquot was added to the cis well. The voltage parameters were run as follows: 70 mV/625 mV/6 μs/1.0 ms (read voltage/pulse voltage/pulse voltage duration/pulse frequency). Data were acquired via Labview acquisition software.
To assess the accuracy of the Xpandomer sequence information, sequence data was analyzed by histogram display of the population of sequence reads from the SBX reaction. The analysis software aligns each sequence read to the sequence of the template and trims the extent of the sequence at the end of the reads that does not align with the correct template sequence. A representative histogram of SBX sequencing of the 222mer template is presented in
In this example of ratcheting a single hemolysin nanopore is prepared in a lipid bilayer with vestibule on the trans-side and having reagent mix composed of 0.4M NH4Cl, 600 mM GuanCl, 100 mM HEPES; pH 7.4 in the cis reservoir and 2M NH4Cl, 100 mM HEPES; pH 7.4 in the trans reservoir. Current passing between Ag/AgCl electrodes located in each reservoir is measured by an Axopatch 200B amplifier and digitized at 100 k samples/s. To drive the current through the nanopore, a square wave with 50% duty cycle alternating between +70 mV and −50 mV is applied to the trans reservoir along with a 6 μs pulse of +600 mV applied between the transition from positive to negative voltage (all voltages referenced to the cis reservoir potential). With this applied pulse train, assuming ideal translocation with no deletions or insertions, both reporters for each XNTP in incorporated into an Xpandomer are measured, one with +70 mV and the other with −50 mV. Having two measurements for each base provides redundancy that can provide higher confidence in matched results and also help identify deletions and insertions in non-homopolymer sequence.
Using an SBX synthesis and purification protocol, an Xpandomer sample generated from a synthetic DNA template of known sequence was introduced to the cis reservoir and measurement proceeded. An example of the current measurement for a translocating Xpandomer is shown in
This Example describes the synthesis of 2′ fluoro (F) epimers of each XNTP (2′ FANA XNTPs). These epimers are based on fluorinated nucleosides, referred to as “fluoroarabinosyl nucleic acids” (FANA). It is predicted that the 2′ F epimers will demonstrate increased stability during acid treatment, which is a critical step in the synthetic pathway that produces the linearized Xpandomer product. Below are synthetic schemes for generating each 2′ FANA XNTP.
A.
Process for Making 2′FANA XTTP
In the first step, fialuridine (compound 1, available from TCI America) is coupled to 1-8 octadiyne via a Sonogashira reaction (see, e.g., Bag, S., Jana, S., and Kasula, M. (2018). Sonogashira Cross-Coupling: Alkyne-Modified Nucleosides and Their Applications. In Palladium-Catalyzed Modification of Nucleosides, Nucleosides, and Oligonucleotides (pp. 75-146). Elsevier). In the second step, compound 2 is treated with approximately one equivalent of DMTrCL in pyridine to produce compound 3. In the third step, compound 3 is converted to the amidate triphosphate, following the protocol described in U.S. Pat. No. 10,301,345 to Kokoris et al. entitled, “Phosphoramidate esters and use and synthesis thereof”, which is herein incorporated by reference in its entirety.
B.
Process for Making 2′FANA XCTP
In the first step, fialcitabine (compound 5, available from TRC Canada) is coupled to to 1-8 octadiyne via a Sonogashira reaction (as described above) to produce compound 6. In the second step, compound 6 is treated with approximately one equivalent of DMTrCL in pyridine to produce compound 7. In the third step, the exocyclic amine of compound 7 is protected by an acetyl group (see, e.g., Fan, Y., Gaffney, B., and Jones, R. (2004). Transient Silylation of the Guanosine O6 and the Amino Groups Faciltates N-Acylation. Organic Letters, 6, 15, 2555-2557.) and subsequently converted to the amidate triphosphate 8, as described in U.S. Pat. No. 10,301,345 to Kokoris et al.
C.
Process for Making 2′FANA XGTP
In the first step, 7-Deaza-7-iodoguanosine (Compound 9 available from Granlen; CAS: 444020-71-7) is treated with 1 equivalent of 1,3-Dichloro-1,1,3,3-tetramethyldisiloxane to provide compound 10 (see, e.g., Markiewicz, W. T. and Wiewiorowski, M. (1978) A new type of silyl protecting groups in nucleoside chemistry. Nuc. Acids Res. 5, s185-ss190). In the second step, compound 10 is converted to compound 11 by using fluorinating agent DAST (see, e.g., Pankiewicz, K., Kreminski, J., Ciszewski, L., Ren, W., and Watanabe, K. (1992). A synthesis of 9-(2-deoxy-2-fluoro-B-D-arabinofuranosyl)adenine and -hypoxanthine. An effect of C3′-endo to C2′-endo conformational shift on the reaction course of 2′-hydroxyl group with DAST. J. of Organic Chem. 57, 2, 553-559.) In the third step, the exocyclic amine in compound 11 is protected with a phenoxyacetyl group as described above. In the fourth step, the resulting compound 12 is coupled to 1-8 octadiyne by the Sonogashira reaction described above to afford compound 13. In the fifth step, deprotection of the siloxane group as described above will give compound 14. In the sixth step, treatment of compound 14 with 1 equivalent of DMTrCl in pyridine produces compound 15. In the seventh step, compound 15 is converted to guanosine amidate triphosphate 16 as described in U.S. Pat. No. 10,301,345 to Kokoris et al.
This same scheme can be used to synthesize the following adenosine triphosphoramidate analog:
from starting compound 7-Deaza-7-iodoadenosine (available from Granlen, CAS:24386-93-4).
All of the U.S. patents, U.S. patent application publications, U.S. patent applications, foreign patents, foreign patent applications and non-patent publications referred to in this specification and/or listed in the Application Data Sheet, including but not limited to, U.S. Provisional Patent Application No. 62/852,262 filed on May 23, 2019, U.S. Provisional Patent Application No. 62/877,183 filed on Jul. 22, 2019, and U.S. Provisional Patent Application No. 62/885,746 filed on Aug. 12, 2019, are incorporated herein by reference, in their entirety. Such documents may be incorporated by reference for the purpose of describing and disclosing, for example, materials and methodologies described in the publications, which might be used in connection with the presently described invention. The publications discussed above and throughout the text are provided solely for their disclosure prior to the filing date of the present application. Nothing herein is to be construed as an admission that the inventors are not entitled to antedate any referenced publication by virtue of prior invention.
The following embodiments are specifically contemplated as part of the disclosure. This is not intended to be an exhaustive listing of potentially claimed embodiments included within the scope of the disclosure.
Embodiment 1. A compound having the following structure:
wherein
R is OH or H;
nucleobase is adenine, cytosine, guanine, thymine, uracil or a nucleobase analog;
reporter construct is a polymer having a first end and a second end, and comprising, in series from the first end to the second end, a first reporter code, a symmetrical chemical brancher bearing a translocation control element, and a second reporter code;
linker A joins the oxygen atom of the alpha phosphoramidate to the first end of the reporter construct;
linker B joins the nucleobase to the second end of the reporter construct; and wherein
the translocation control element is a polymer comprising two or more repeat units selected from: 1,3-O-bis(phosphodiester)-2S—O-mPEG4-propane (compound 12b), 1,3-O-bis(phosphodiester)-2-(4-Me-O-PEG3)-1-(Et-O—Ac)-1,2,3-triazole)-propane (compound 35b), 1,3-O-bis(phosphodiester)-2s-O-mPEG6-propane (compound 12c), 1,2-O-bis(phosphodiester)-3-O-mPEG2-propane (compound 16), 2,3-O-bis(phosphodiester)-1-(5-benzofuran)-propane (compound 20i), 1,2-O-bis(phosphodiester)-3-(4-methylpiperazine-1-yl)-propane (compound 20j), 2,3-O-bis(phosphodiester)-1-(N1-(2-Me-5-nitroindole)-propane (compound 20g), 1,8-O-bis(phosphodiester)-N,N-Diethylpiperazine (compound 26h), 1,2-O-bis(phosphodiester)-3-(4-(Me-O-PEG3-O-Bz)-1-(1,2,3-triazole))-propane (compound 31d), 1,3-O-bis(phosphodiester)-2s-O-(4-(Me-O-PEG2)-1-(Et-OBz)-1,2,3-triazole)-propane (compound 35a), 1,3-O-bis(phosphodiester-2-(4-(Me-O-PEG5)-1-(Et-O—Ac)-1,2,3-triazole)-propane (compound 35c), 1,3-O-bis(phosphodiester-2s-O-(4-(Me-O-PEG7)-1-(Et-OBz)-1,2,3-triazole)-propane (compound 35d), 1,3-O-bis(phosphodiester-2s-O-(4-(Me-O-PEG3)-1-(Me-acetate)-1,2,3-triazole)-propane (compound 35e), 1,3-O-bis(phosphodiester-2s-O-(4-(Me-O-PEG3)-1-(Et-2,2,2-Tris-(Me-O-Bz))-1,2,3-triazole)-propane (compound 37a), 1,3-O-bis(phosphodiester-2-(4-(Me-O-PEG5)-1-(Et-2,2,2-Tris-(Me-O—Ac))-1,2,3-triazole)-propane (compound 37b), 1,3-O-bis(phosphodiester-2S—O-(PEG4-O-Bz)-propane (compound 38b), 1,3-O-bis(phosphodiester-2,2-bis(Me-O-mPEG2)-propane (compound 45b), 1,3-O-bis(phosphodiester-2,2-bis(4-(Me-O-PEG2-O-Me)-1-(Et-O-Bz)-1,2,3-triazole)-propane (compound 47f), 1,3-O-bis(phosphodiester-2,2-bis(4-(Me-O-PEG3-O-Me)-1-(Et-O-Bz)-1,2,3-triazole)-propane (compound 47g), 1,3-O-bis(phosphodiester-2,2-bis(4-(Me-O-PEG3-O-Me)-1-(Et-2,2,2-Tris-(Me-O-Bz))-1,2,3-triazole)-propane (compound 47i), or 1,3-O-bis(phosphodiester-2,2-bis(1-Me-4-(Me-O-PEG2-O-Bz)-1,2,3-triazole)-propane (compound 52).
Embodiment 2. The compound of Embodiment 1, wherein R is OH.
Embodiment 3. The compound of Embodiment 1, wherein R is H.
Embodiment 4. The compound of any one of Embodiments 1-3 wherein nucleobase is adenine.
Embodiment 5. The compound of any one of Embodiments 1-3 wherein nucleobase is cytosine.
Embodiment 6. The compound of any one of Embodiments 1-3 wherein nucleobase is guanine.
Embodiment 7. The compound of any one of Embodiments 1-3 wherein nucleobase is thymine.
Embodiment 8. The compound of any one of Embodiments 1-3 wherein nucleobase is uracil.
Embodiment 9. The compound of any one of Embodiments 1-3 wherein nucleobase is a nucleobase analog.
Embodiment 10. The compound of any one of Embodiments 1-9 wherein the symmetrical chemical brancher is selected from 1,2,3-O-tris-(phosphosphodiester)-propane, 1,3-bis-(5-O-phosphodiester-pentylamido)-2-O-phosphodiester-propane, and 1,4,7-O-tris-(phosphodiester)-heptane.
Embodiment 11. The compound of any one of Embodiments 1-9 wherein the symmetrical chemical brancher is 1,2,3-O-tris-(phosphosphodiester)-propane.
Embodiment 12. The compound of any one of Embodiments 1-11, wherein the translocation control element is a polymer comprising two or more repeat units selected from Table 1A.
Embodiment 13. The compound of any one of Embodiments 1-11, wherein the translocation control element is a polymer comprising two or more repeat units selected from 1,3-O-bis(phosphodiester)-2S—O-mPEG4-propane (compound 12b) and 1,3-O-bis(phosphodiester)-2-(4-Me-O-PEG3)-1-(Et-O—Ac)-1,2,3-triazole)-propane (compound 35b).
Embodiment 14. The compound of any one of claims 1-11, wherein the translocation control element is a polymer comprising the following sequence: [(1,3-O-bis(phosphodiester)-2S—O-mPEG4-propane (compound 12b))]n1[(1,3-O-bis(phosphodiester)-2-(4-Me-O-PEG3)-1-(Et-O—Ac)-1,2,3-triazole)-propane (compound 35b))]n2, wherein n1 is from 0 to 6 and n2 is from 6 to 10.
Embodiment 15. The compound of any one Embodiments 1-14, wherein the first and second reporter codes are identical.
Embodiment 16. The compound of any one Embodiments 1-14, wherein the first and second reporter codes are polymers comprising two or more repeat units selected from: hexaethylene glycol (D), ethane (L), triaethylene glycol (X), 1,3-O-bis(phosphodiester)-2S—O-mPEG4-propane (compound 12b), 1,3-O-bis(phosphodiester)-2-(4-Me-O-PEG3)-1-(Et-O—Ac)-1,2,3-triazole)-propane (compound 35b), 1,3-O-bis(phosphodiester-2,2-bis(Me-O-mPEG2)-propane (compound 45b), 1,3-O-bis(phosphodiester-2S—O-(PEG4-O-Bz)-propane (compound 38b), 1,3-O-bis(phosphodiester)-2s-O-mPEG6-propane (compound 12c), 1,3-O-bis(phosphodiester-2s-O-(4-(Me-O-PEG3)-1-(Et-2,2,2-Tris-(Me-O-Bz))-1,2,3-triazole)-propane (compound 37a), 1,3-O-bis(phosphodiester-2s-O-(4-(Me-O-PEG3)-1-(Me-acetate)-1,2,3-triazole)-propane (compound 35e), 1,3-O-bis(phosphodiester)-2s-O-(4-(Me-O-PEG2)-1-(Et-OBz)-1,2,3-triazole)-propane (compound 35a), 1,3-O-bis(phosphodiester)-2-(4-Et-1-(Et-0-mPEG1)-1,2,3-triazole)-propane (compound 31a), 2,3-O-bis(phosphodiester)-1-(1 dimethoxyquinazolinedione)-propane (compound 20c), 2,3-O-bis(phosphodiester)-1-(N9-(3,6-dimethoxycarbazole)-propane (compound 20e), 1,1′-O-bis(phosphodiester)-2,2′-(sulfonylbis(benz-4-yl))-diethanol (compound 26d), 1,1′-O-bis(phosphodiester)-2,2′-bipyridin-4,4′-yl)-dimethanol (compound 26a), 2,3-O-bis(phosphodiester)-1-(N1-(4,6-dimethoxy-3-Me-indole)-propane (compound 20b), 3-(1,2-O-bis(phosphodiester)-propyl)-8,8-dimethylhexahydro-3H-3a,6-methanobenzo[c]isothiazole 2,2-dioxide (compound 20d), 2,3-O-bis(phosphodiester)-1-(N1-(6-Azathymine))-propane (compound 20f), 1,5-O-bis(phosphodiester)-hexahydrofuro[2,6]furan (compound 23), 1,1′-O-bis(phosphodiester)-octahydro-2,6-dimethyl-3,8:4,7-dimethano-2,6-naphthyridin-4,8-diyl)-dimethanol (compound 26e), 2,3-O-bis(phosphodiester)-1-(N1-(2-Me-5-nitroindole)-propane (compound 20h), 2,3-O-bis(phosphodiester)-1-(N1-(2-Me-5-nitroindole)-propane (compound 20g), 2,3-O-bis(phosphodiester)-1-(5-benzofuran)-propane (compound 20i), 1,2-O-bis(phosphodiester)-3-O-mPEG2-propane (compound 5b), 1,3-O-bis(phosphodiester)-2-(4-Et-1-(Et-O-mPEG3)-1,2,3-triazole)-propane (compound 31b), and 1,3-O-bis(phosphodiester)-3-O-mPEG4-propane (compound 5a).
Embodiment 17. The compound of any one of Embodiments 1-14, wherein the first and second reporter codes are polymers comprising two or more repeat units selected from hexaethylene glycol, ethane, triaethylene glycol, and any of the compounds set forth in Table 1A.
Embodiment 18. The reporter code of any one of Embodiments 1-14, wherein the first and second reporter codes are polymers comprising two or more repeat units selected from hexaethylene glycol, ethane, triaethylene glycol, and 1,3-O-bis(phosphodiester)-2S—O-mPEG4-propane (compound 12b).
Embodiment 19. The compound of any one of Embodiments 1-14, wherein the first and second reporter codes are polymers comprising a sequence selected from: (i) [(hexaethylene glycol)2(ethane)3(hexaethylene glycol)(triaethylene glycol)], (ii) [(hexaethyleneglycol)2(1,3-O-bis(phosphodiester)-2S—O-mPEG4-propane (compound 12b))2(ethane)(triaethylene glycol)3], (iii) [(hexaethylene glycol)2(1,3-O-bis(phosphodiester)-2S—O-mPEG4-propane (compound 12b))3(ethane)2(hexaethylene glycol)(triaethylene glycol)], and (iv) [(triaethylene glycol)2(ethane)(1,3-O-bis(phosphodiester)-2S—O-mPEG4-propane (compound 12b))6(ethane)7].
Embodiment 20. The compound of any one of Embodiments 1-19, wherein Linker A and Linker B are polymers comprising two or more repeat units selected from: spermine (Q), hexaethylene glycol (D), 2-((4-((3-(benzoyloxy)-2-(((1-(3-(benzoyloxy)-2-((benzoyloxy)methyl)-2-((phosphodiester-oxy)methyl)propyl)-1H-1,2,3-triazol-4-yl)methoxy)methyl)-2-((benzoyloxy)methyl)propoxy)methyl)-1H-1,2,3-triazol-1-yl)methyl)-2-O-phosphodiester-propane-1,3-diyl dibenzoate (compound 62), 1,3-O-bis(phosphodiester-2,2-bis(1-Me-4-(Me-O-PEG2-O-Bz)-1,2,3-triazole)-propane (compound 52), 1,3-O-bis(phosphodiester-2-(4-(Me-O-PEG5)-1-(Et-O—Ac)-1,2,3-triazole)-propane (compound 35c), 1,3-O-bis(phosphodiester-2s-O-(4-(Me-O-PEG7)-1-(Et-OBz)-1,2,3-triazole)-propane (compound 35d), 1,3-O-bis(phosphodiester-2s-O-(4-(Me-O-PEG3)-1-(Et-2,2,2-Tris-(Me-O-Bz))-1,2,3-triazole)-propane (compound 37a), 1,3-O-bis(phosphodiester-2-(4-(Me-O-PEG5)-1-(Et-2,2,2-Tris-(Me-O—Ac))-1,2,3-triazole)-propane (compound 37b), 1,2-O-bis(phosphodiester)-3-(4-(Me-O-PEG3-O-Bz)-1-(1,2,3-triazole))-propane (compound 31d), 1,3-O-bis(phosphodiester-2,2-bis(4-(Me-O-PEG2-O-Me)-1-(Et-O-Bz)-1,2,3-triazole)-propane (compound 47f), 1,3-O-bis(phosphodiester-2,2-bis(4-(Me-O-PEG3-O-Me)-1-(Et-2,2,2-Tris-(Me-O-Bz))-1,2,3-triazole)-propane (compound 47i), 1,2-O-bis(phosphodiester)-3-(4-methylpiperazine-1-yl)-propane (compound 20j), 1,3-O-bis(phosphodiester-2,2-bis(4-(Me-O-PEG3-O-Me)-1-(Et-O-Bz)-1,2,3-triazole)-propane (compound 47g), and 1,1′-O-bis(phosphodiester)-N(p-tolyl)-diethanolamine (compound 26b).
Embodiment 21. The compound of any one of Embodiments 1-19, wherein Linker A and Linker B are polymers comprising two or more repeat units selected from spermine and any of the compounds set forth in Table 1A.
Embodiment 22. The compound of any one of Embodiments 1-19 wherein Linker A and Linker B comprise a polymerase enhancement region comprising two repeat units of spermine.
Embodiment 23. The compound of any one of Embodiments 1-22 wherein Linker A and Linker B comprise a translocation deceleration region comprising two or more repeat units selected from: 1,3-O-bis(phosphodiester-2-(4-(Me-O-PEG5)-1-(Et-O—Ac)-1,2,3-triazole)-propane (compound 35c), 1,3-O-bis(phosphodiester-2s-O-(4-(Me-O-PEG7)-1-(Et-OBz)-1,2,3-triazole)-propane (compound 35d), 1,3-O-bis(phosphodiester-2s-O-(4-(Me-O-PEG3)-1-(Et-2,2,2-Tris-(Me-O-Bz))-1,2,3-triazole)-propane (compound 37a), and 1,3-O-bis(phosphodiester-2-(4-(Me-O-PEG5)-1-(Et-2,2,2-Tris-(Me-O—Ac))-1,2,3-triazole)-propane (compound 37b).
Embodiment 24. The compound of any one of Embodiments 1-22 wherein Linker A and Linker B comprise a translocation deceleration region comprising a polymer selected from: (i) [((hexaethylene glycol) (1,3-O-bis(phosphodiester-2-(4-(Me-O-PEG5)-1-(Et-O—Ac)-1,2,3-triazole)-propane (compound 35c))3(hexaethylene glycol)2], (ii) [((hexathylene glycol)(1,3-O-bis(phosphodiester-2-(4-(Me-O-PEG5)-1-(Et-O—Ac)-1,2,3-triazole)-propane (compound 35c))4(hexaethylene glycol)2], (iii) [((hexathylene glycol)(1,3-O-bis(phosphodiester-2s-O-(4-(Me-O-PEG7)-1-(Et-OBz)-1,2,3-triazole)-propane (compound 35d))4(hexaethylene glycol)2], and (iv) [((hexathylene glycol)(1,3-O-bis(phosphodiester-2-(4-(Me-O-PEG5)-1-(Et-2,2,2-Tris-(Me-O—Ac))-1,2,3-triazole)-propane (compound 37b))4(hexaethylene glycol)2].
Embodiment 25. The compound of any one of Embodiments 1-24 wherein Liker A is joined to the oxygen atom of the alpha phosphoramidate by a linkage comprising a triazole.
Embodiment 26. The compound of any one of Embodiments 1-24 wherein Liker B is joined to the nucleobase by a linkage comprising a triazole.
Embodiment 27. A reporter construct comprising a polymer having a first end and a second end, and comprising, in series from the first end to the second end, a first reporter code, a symmetrical chemical brancher bearing a translocation control element, and a second reporter code; and wherein the translocation control element is a polymer comprising two or more repeat units selected from: 1,3-O-bis(phosphodiester)-2S—O-mPEG4-propane (compound 12b), 1,3-O-bis(phosphodiester)-2-(4-Me-O-PEG3)-1-(Et-O—Ac)-1,2,3-triazole)-propane (compound 35b), 1,3-O-bis(phosphodiester)-2s-O-mPEG6-propane (compound 12c), 1,2-O-bis(phosphodiester)-3-O-mPEG2-propane (compound 16), 2,3-O-bis(phosphodiester)-1-(5-benzofuran)-propane (compound 20i), 1,2-O-bis(phosphodiester)-3-(4-methylpiperazine-1-yl)-propane (compound 20j), 2,3-O-bis(phosphodiester)-1-(N1-(2-Me-5-nitroindole)-propane (compound 20g), 1,8-O-bis(phosphodiester)-N,N-Diethylpiperazine (compound 26h), 1,2-O-bis(phosphodiester)-3-(4-(Me-O-PEG3-O-Bz)-1-(1,2,3-triazole))-propane (compound 31d), 1,3-O-bis(phosphodiester)-2s-O-(4-(Me-O-PEG2)-1-(Et-OBz)-1,2,3-triazole)-propane (compound 35a), 1,3-O-bis(phosphodiester-2-(4-(Me-O-PEG5)-1-(Et-O—Ac)-1,2,3-triazole)-propane (compound 35c), 1,3-O-bis(phosphodiester-2s-O-(4-(Me-O-PEG7)-1-(Et-OBz)-1,2,3-triazole)-propane (compound 35d), 1,3-O-bis(phosphodiester-2s-O-(4-(Me-O-PEG3)-1-(Me-acetate)-1,2,3-triazole)-propane (compound 35e), 1,3-O-bis(phosphodiester-2s-O-(4-(Me-O-PEG3)-1-(Et-2,2,2-Tris-(Me-O-Bz))-1,2,3-triazole)-propane (compound 37a), 1,3-O-bis(phosphodiester-2-(4-(Me-O-PEG5)-1-(Et-2,2,2-Tris-(Me-O—Ac))-1,2,3-triazole)-propane (compound 37b), 1,3-O-bis(phosphodiester-2S—O-(PEG4-O-Bz)-propane (compound 38b), 1,3-O-bis(phosphodiester-2,2-bis(Me-O-mPEG2)-propane (compound 45b), 1,3-O-bis(phosphodiester-2,2-bis(4-(Me-O-PEG2-O-Me)-1-(Et-O-Bz)-1,2,3-triazole)-propane (compound 47f), 1,3-O-bis(phosphodiester-2,2-bis(4-(Me-O-PEG3-O-Me)-1-(Et-O-Bz)-1,2,3-triazole)-propane (compound 47Gg, 1,3-O-bis(phosphodiester-2,2-bis(4-(Me-O-PEG3-O-Me)-1-(Et-2,2,2-Tris-(Me-O-Bz))-1,2,3-triazole)-propane (compound 47i), or 1,3-O-bis(phosphodiester-2,2-bis(1-Me-4-(Me-O-PEG2-O-Bz)-1,2,3-triazole)-propane (compound 52).
Embodiment 28. The reporter construct of Embodiment 27, wherein the symmetrical chemical brancher is selected from 1,2,3-O-tris-(phosphosphodiester)-propane, 1,3-bis-(5-O-phosphodiester-pentylamido)-2-O-phosphodiester-propane, and 1,4,7-O-tris-(phosphodiester)-heptane.
Embodiment 29. The reporter construct of Embodiment 27, wherein the symmetrical chemical brancher is 1,2,3-O-tris-(phosphosphodiester)-propane.
Embodiment 30. The reporter construct of any one of Embodiment 27-29, wherein the translocation control element is a polymer comprising two or more repeat units selected from Table 1A.
Embodiment 31. The reporter construct of any one of Embodiments 27-29, wherein the translocation control element is a polymer comprising two or more repeat units selected from 1,3-O-bis(phosphodiester)-2S—O-mPEG4-propane (compound 12b) and 1,3-O-bis(phosphodiester)-2-(4-Me-O-PEG3)-1-(Et-O—Ac)-1,2,3-triazole)-propane (compound 35b).
Embodiment 32. The reporter construct of any one of Embodiments 27-29, wherein the translocation control element is a polymer comprising the following sequence: [(1,3-O-bis(phosphodiester)-2S—O-mPEG4-propane (compound 12b))]n1[(1,3-O-bis(phosphodiester)-2-(4-Me-O-PEG3)-1-(Et-O—Ac)-1,2,3-triazole)-propane (compound 35b))]n2, wherein n1 is from 0 to 6 and n2 is from 6 to 10.
Embodiment 33. The reporter construct of any one of Embodiments 27-32, wherein the wherein the first and second reporter codes are identical.
Embodiment 34. The reporter construct of any one of Embodiments 27-32, wherein the wherein the first and second reporter codes are polymers comprising two or more repeat units selected from: hexaethylene glycol (D), ethane (L), triaethylene glycol (X), 1,3-O-bis(phosphodiester)-2S—O-mPEG4-propane (compound 12b), 1,3-O-bis(phosphodiester)-2-(4-Me-O-PEG3)-1-(Et-O—Ac)-1,2,3-triazole)-propane (compound 35b), 1,3-O-bis(phosphodiester-2,2-bis(Me-O-mPEG2)-propane (compound 45b), 1,3-O-bis(phosphodiester-2S—O-(PEG4-O-Bz)-propane (compound 38b), 1,3-O-bis(phosphodiester)-2s-O-mPEG6-propane (compound 12c), 1,3-O-bis(phosphodiester-2s-O-(4-(Me-O-PEG3)-1-(Et-2,2,2-Tris-(Me-O-Bz))-1,2,3-triazole)-propane (compound 37a), 1,3-O-bis(phosphodiester-2s-O-(4-(Me-O-PEG3)-1-(Me-acetate)-1,2,3-triazole)-propane (compound 35e), 1,3-O-bis(phosphodiester)-2s-O-(4-(Me-O-PEG2)-1-(Et-OBz)-1,2,3-triazole)-propane (compound 35a), 1,3-O-bis(phosphodiester)-2-(4-Et-1-(Et-O-mPEG1)-1,2,3-triazole)-propane (compound 31a), 2,3-O-bis(phosphodiester)-1-(1 dimethoxyquinazolinedione)-propane (compound 20c), 2,3-O-bis(phosphodiester)-1-(N9-(3,6-dimethoxycarbazole)-propane (compound 20e), 1,1′-O-bis(phosphodiester)-2,2′-(sulfonylbis(benz-4-yl))-diethanol (compound 26d), 1,1′-O-bis(phosphodiester)-2,2′-bipyridin-4,4′-yl)-dimethanol (compound 26a), 2,3-O-bis(phosphodiester)-1-(N1-(4,6-dimethoxy-3-Me-indole)-propane (compound 20b), 3-(1,2-O-bis(phosphodiester)-propyl)-8,8-dimethylhexahydro-3H-3a,6-methanobenzo[c]isothiazole 2,2-dioxide (compound 20d), 2,3-O-bis(phosphodiester)-1-(N1-(6-Azathymine))-propane (compound 20f), 1,5-O-bis(phosphodiester)-hexahydrofuro[2,6]furan (compound 23), 1,1′-O-bis(phosphodiester)-octahydro-2,6-dimethyl-3,8:4,7-dimethano-2,6-naphthyridin-4,8-diyl)-dimethanol (compound 26e), 2,3-O-bis(phosphodiester)-1-(N1-(2-Me-5-nitroindole)-propane (compound 20h), 2,3-O-bis(phosphodiester)-1-(N1-(2-Me-5-nitroindole)-propane (compound 20g), 2,3-O-bis(phosphodiester)-1-(5-benzofuran)-propane (compound 20i), 1,2-O-bis(phosphodiester)-3-0-mPEG2-propane (compound 5b), 1,3-O-bis(phosphodiester)-2-(4-Et-1-(Et-O-mPEG3)-1,2,3-triazole)-propane (compound 31b), and 1,3-O-bis(phosphodiester)-3-O-mPEG4-propane (compound 5a).
Embodiment 35. The reporter construct of any one of Embodiments 27-32, wherein the wherein the first and second reporter codes are polymers comprising two or more repeat units selected from hexaethylene glycol, ethane, triaethylene glycol, and any of the compounds set forth in Table 1A.
Embodiment 36. The reporter construct of any one of Embodiments 27-32, wherein the wherein the first and second reporter codes are polymers comprising two or more repeat units selected from hexaethylene glycol, ethane, triaethylene glycol, and 1,3-O-bis(phosphodiester)-2S—O-mPEG4-propane (compound 12b).
Embodiment 37. The reporter construct of any one of Embodiments 27-32, wherein the wherein the first and second reporter codes are polymers comprising a sequence selected from: (i) [(hexaethylene glycol)2(ethane)3(hexaethylene glycol)(triaethylene glycol)], (ii) [(hexaethyleneglycol)2(1,3-O-bis(phosphodiester)-2S—O-mPEG4-propane (compound 12b))2(ethane)(triaethylene glycol)3], (iii) [(hexaethylene glycol)2(1,3-O-bis(phosphodiester)-2S—O-mPEG4-propane (compound 12b))3(ethane)2(hexaethylene glycol)(triaethylene glycol)], and (iv) [(triaethylene glycol)2(ethane)(1,3-O-bis(phosphodiester)-2S—O-mPEG4-propane (compound 12b))6(ethane)7].
Embodiment 38. A symmetrically synthesized report tether (SSRT), wherein the symmetrically synthesized reporter tether is a polymer having a first end and a second end, and comprising in series from the first end to the second end a first linker, a reporter construct according to any one of claims 27-37, and a second linker, wherein the first and second linkers are identical and are polymers comprising two or more repeat units selected from: spermine (Q), hexaethylene glycol (D), 2-((4-((3-(benzoyloxy)-2-(((1-(3-(benzoyloxy)-2-((benzoyloxy)methyl)-2-((phosphodiester-oxy)methyl)propyl)-1H-1,2,3-triazol-4-yl)methoxy)methyl)-2-((benzoyloxy)methyl)propoxy)methyl)-1H-1,2,3-triazol-1-yl)methyl)-2-O-phosphodiester-propane-1,3-diyl dibenzoate (compound 62), 1,3-O-bis(phosphodiester-2,2-bis(1-Me-4-(Me-O-PEG2-O-Bz)-1,2,3-triazole)-propane (compound 52), 1,3-O-bis(phosphodiester-2-(4-(Me-O-PEG5)-1-(Et-O—Ac)-1,2,3-triazole)-propane (compound 35c), 1,3-O-bis(phosphodiester-2s-O-(4-(Me-O-PEG7)-1-(Et-OBz)-1,2,3-triazole)-propane (compound 35d), 1,3-O-bis(phosphodiester-2s-O-(4-(Me-O-PEG3)-1-(Et-2,2,2-Tris-(Me-O-Bz))-1,2,3-triazole)-propane (compound 37a), 1,3-O-bis(phosphodiester-2-(4-(Me-O-PEG5)-1-(Et-2,2,2-Tris-(Me-O—Ac))-1,2,3-triazole)-propane (compound 37b), 1,2-O-bis(phosphodiester)-3-(4-(Me-O-PEG3-O-Bz)-1-(1,2,3-triazole))-propane (compound 31d), 1,3-O-bis(phosphodiester-2,2-bis(4-(Me-O-PEG2-O-Me)-1-(Et-O-Bz)-1,2,3-triazole)-propane (compound 47f), 1,3-O-bis(phosphodiester-2,2-bis(4-(Me-O-PEG3-O-Me)-1-(Et-2,2,2-Tris-(Me-O-Bz))-1,2,3-triazole)-propane (compound 47i), 1,2-O-bis(phosphodiester)-3-(4-methylpiperazine-1-yl)-propane (compound 20j), 1,3-O-bis(phosphodiester-2,2-bis(4-(Me-O-PEG3-O-Me)-1-(Et-O-Bz)-1,2,3-triazole)-propane (compound 47g), and 1,1′-O-bis(phosphodiester)-N(p-tolyl)-diethanolamine (compound 26b).
Embodiment 39. The symmetrically synthesized reporter tether (SSRT) of Embodiment 38, wherein the first and second linker comprise a polymerase enhancement regions comprising two repeat units of spermine.
Embodiment 40. The symmetrically synthesized reporter tether (SSRT) of Embodiments 38 or 39 comprising a translocation deceleration region comprising two or more repeat units selected from: 1,3-O-bis(phosphodiester-2-(4-(Me-O-PEG5)-1-(Et-O—Ac)-1,2,3-triazole)-propane (compound 35c), 1,3-O-bis(phosphodiester-2s-O-(4-(Me-O-PEG7)-1-(Et-OBz)-1,2,3-triazole)-propane (compound 35d), 1,3-O-bis(phosphodiester-2s-O-(4-(Me-O-PEG3)-1-(Et-2,2,2-Tris-(Me-O-Bz))-1,2,3-triazole)-propane (compound 37a), and 1,3-O-bis(phosphodiester-2-(4-(Me-O-PEG5)-1-(Et-2,2,2-Tris-(Me-O—Ac))-1,2,3-triazole)-propane (compound 37b).
Embodiment 41. The symmetrically synthesized reporter tether (SSRT) of Embodiments 38 or 39 comprising a translocation deceleration region comprising a polymer selected from: (i) [((hexaethylene glycol) (1,3-O-bis(phosphodiester-2-(4-(Me-O-PEG5)-1-(Et-O—Ac)-1,2,3-triazole)-propane (compound 35c))3(hexaethylene glycol)2], (ii) [((hexathylene glycol)(1,3-O-bis(phosphodiester-2-(4-(Me-O-PEG5)-1-(Et-O—Ac)-1,2,3-triazole)-propane (compound 35c))4(hexaethylene glycol)2], (iii) [((hexathylene glycol)(1,3-0-bis(phosphodiester-2s-O-(4-(Me-O-PEG7)-1-(Et-OBz)-1,2,3-triazole)-propane (compound 35d))4(hexaethylene glycol)2], and (iv) [((hexathylene glycol)(1,3-O-bis(phosphodiester-2-(4-(Me-O-PEG5)-1-(Et-2,2,2-Tris-(Me-O—Ac))-1,2,3-triazole)-propane (compound 37b))4(hexaethylene glycol)2].
Embodiment 42. The symmetrically synthesized reporter tether (SSRT) of any one of Embodiments 38-41 wherein the first end and the second end comprise a linkage moiety.
Embodiment 43. The symmetrically synthesized reporter tether (SSRT) of Embodiment 42 wherein the linkage moieties comprise an azido (—N3) group.
Embodiment 44. A method for sequencing a target nucleic acid, comprising: a) providing a daughter strand produced by a template-directed synthesis, the daughter strand comprising a plurality of XNTP subunits of claim 1 coupled in a sequence corresponding to a contiguous nucleotide sequence of all or a portion of the target nucleic acid, wherein the individual XNTP subunits of the daughter strand comprise a reporter construct, a nucleobase residue, and a selectively cleavable bond, and wherein the reporter construct, upon cleavage of the selectively cleavable bond, permits lengthening of the subunits of the daughter strand; b) cleaving the selectively cleavable bonds to yield an Xpandomer of a length longer than the plurality of the subunits of daughter strand, the Xpandomer comprising the reporter constructs for parsing genetic information in a sequence corresponding to the contiguous nucleotide sequence of all or a portion of the target nucleic acid; and c) detecting the reporter constructs of the Xpandomer.
Embodiment 45. The method of Embodiment 44, wherein the reporter constructs for parsing the genetic information comprise a reporter code and a translocation control element, wherein the translocation control element provides translocation control by steric hindrance and pauses translocation of the Xpandomer when passed through a nanopore subjected to a baseline voltage, wherein the translocation control element engages the reporter code within the aperture of the nanopore, wherein the reporter code is sensed by the nanopore.
Embodiment 46. The method of Embodiment 44, wherein the Xpandomer resumes translocation through the nanopore by application of a pulse voltage, wherein the pulse voltage is sufficient to allow translocation of the translocation control element, while leaving the next reporter construct of the Xpandomer free to engage with the nanopore.
Embodiment 47. The method of Embodiment 45, wherein the translocation control element of the reporter construct engaged with the nanopore by steric hindrance translocates upon each pulse of the pulsed voltage.
Embodiment 48. The method of Embodiment 45, wherein the target construct is sensed by the nanopore during the time period between pulses of the pulsed voltage.
Embodiment 49. The method of Embodiment 44, wherein the baseline voltage is from about 55 mV to about 75 mV.
Embodiment 50. The method of Embodiment 45, wherein the pulse voltage is from about 550 mV to about 700 mV.
Embodiment 51. The method of Embodiment 45, wherein the pulse voltage has a duration from about 5 μs to about 10 μs.
Embodiment 52. The method of Embodiment 45, wherein periodicity of the pulse voltage is from about 0.5 ms to 1.5 ms.
Embodiment 53. The method of Embodiment 44, wherein the nanopore is subjected to an alternating current (AC).
Embodiment 54. The method of any one of Embodiments 44-53, wherein one or more of the plurality of XNTP subunits comprises a 2′ fluoroarabinosyl epimer.
Embodiment 55. A buffer for controlling the rate of translocation of a polymer through a nanopore comprising at least one salt selected from the group consisting of NH4Cl, MgCl2, LiCl, KCl, CsCl, NaCl, and CaCl2).
Embodiment 56. The buffer of Embodiment 55, further comprising at least one solvent selected from the group consisting of 3-methyl-2-oxazolidinone (MOA), DMF, ACN, DMSO, and NMP, wherein the solvent is present in the range from about 1% vol/vol to about 35% vol/vol.
Embodiment 57. The buffer of Embodiment 55, further comprising at least one additive selected from the group consisting of sodium hexanoate (NaHex), EDTA, redox reagents, PEG, glycerol, ficoll and the like.
Embodiment 58. A buffer system for controlling the rate of translocation of a polymer through a nanopore detector comprising a cis buffer and a trans buffer, wherein the cis buffer comprises a first salt concentration and the trans buffer comprises a second salt concentration, wherein the first salt concentration is lower than the second salt concentration.
Number | Date | Country | |
---|---|---|---|
62852262 | May 2019 | US | |
62877183 | Jul 2019 | US | |
62885746 | Aug 2019 | US |
Number | Date | Country | |
---|---|---|---|
Parent | PCT/US2020/032950 | May 2020 | US |
Child | 17456342 | US |