The present disclosure generally relates to electronic data storage and retrieval, and more particularly to an amplification-free DNA information storage and retrieval system for storing and retrieving digital data using DNA molecules.
The advent of digital computing in the 20th Century created the need for archival storage of large amounts of digital or binary data. Archival storage is intended to house data for long periods of time, e.g., years, decades or longer, in a way that is very low cost, and that supports the rare need to re-access the data. Although an archival storage system may feature the ability to hold unlimited amounts of data at very low cost, such as through a physical storage medium able to remain dormant for long periods of time, the data writing and recovery in such a system can be the relatively slow or otherwise costly processes. The dominant forms of archival digital data storage that have been developed to date include magnetic tape, and, more recently, compact optical disc (CD). However, as data production grows, there is a need for even higher density, lower cost, and longer lasting archival digital data storage systems.
It has been observed that in biology, the genomic DNA of a living organism functions as a form of digital information archival storage. On the timescale of the existence of a species, which may extend for thousands to millions of years, the genomic DNA in effect stores the genetic biological information that defines the species. The complex enzymatic, biochemical processes embodied in the biology, reproduction and survival of the species provide the means of writing, reading and maintaining this information archive. This observation has motivated the idea that perhaps the fundamental information storage capacity of DNA could be harnessed as the basis for high density, long duration archival storage of more general forms of digital information.
What makes DNA attractive for information storage is the extremely high information density resulting from molecular scale storage of information. In theory for example, all human-produced digital information recorded to date, estimated to be approximately 1 ZB (ZettaByte) (˜1021 Bytes), could be recorded in less than 1022 DNA bases, or 1/60th of a mole of DNA bases, which would have a mass of just 10 grams. In addition to high data density, DNA is also a very stable molecule, which can readily last for thousands of years without substantial damage, and which could potentially last far longer, for tens of thousands of years, or even millions of years, such as observed naturally with DNA frozen in permafrost or encased in amber.
In spite of these attractions, using a single molecule of DNA for digital information storage and retrieval could be inefficient or even impossible due to the many sources of molecular structure errors in synthesizing a DNA molecule, loss/degradation of the molecule, and limits of signal detection from DNA sequencers used to sequence the molecule. Thus, it is frequently proposed that amplification be incorporated to provide many more molecules to engage in all these processes. However, amplification will add cost, time and operational complexity to the DNA information system. Therefore what are needed are specific processes that individually or collectively remove the need for amplification steps in the various processes that comprise a DNA data storage system.
In various embodiments, an amplification-free DNA information storage and retrieval system is disclosed. In various aspects, the system comprises a DNA reading device, a digital data encoding/decoding algorithm, and a DNA writing device, wherein the properties of these three elements are co-optimized to minimize or reduce various cost metrics and increase overall system performance. In various aspects, the co-optimization may comprise reducing the error rate of the system, through balancing, avoiding, or correcting the errors in DNA reading and writing. In other instances, the co-optimization may comprise reducing the DNA reading or writing time in the system, e.g., by avoiding the use of slower speed DNA sequence motifs, and/or by using error correction/avoidance to compensate for errors incurred from rapid operation of the system.
In various embodiments of the present disclosure, a method of archiving information is described. The method comprises: converting the information into one or more nucleotides using an encoding scheme, the nucleotides predetermined to generate distinguishable signals relating to the information in a measurable electrical parameter of a molecular electronics sensor; assembling the one or more nucleotides into a nucleotide sequence; and synthesizing a pool of replicate DNA molecules without amplification of the DNA molecules, wherein each replicate DNA molecule incorporates the nucleotide sequence.
In various embodiments, the information comprises a string of binary data.
In various embodiments, the encoding scheme converts one or more 0/1 bits of binary data within the string of binary data into a sequence motif comprising more than one nucleotide.
In various embodiments, the step of converting the information comprises dividing the string of binary data into segments, wherein each segment encodes one sequence motif.
In various embodiments, the binary data bit 0 encodes a homopolymer of A, and the binary data bit 1 encodes a homopolymer of C.
In various embodiments, one or more of the nucleotides comprises a modified nucleotide.
In various embodiments, the one or more nucleotides comprise nucleotides that are resistant to secondary structure formation in the replicate DNA molecules compared to a variant of the same nucleotides.
In various embodiments, the encoding scheme comprises any one or combination of BES1, BES2, BES3, BES4, BES5 and BES6 illustrated in
In various embodiments, the method of archiving information further comprises: exposing at least one of the replicate DNA molecules to the molecular electronics sensor without prior amplification of the DNA molecules; generating the distinguishable signals; and converting the distinguishable signals into the information, wherein the molecular electronics sensor comprises a pair of spaced-apart electrodes and a molecular sensor complex attached to each electrode to form a sensor circuit, wherein the molecular sensor complex comprises a bridge molecule electrically wired to each electrode in the pair of spaced-apart electrodes and a probe molecule conjugated to the bridge molecule.
In various embodiments, the step of exposing at least one of the replicate DNA molecules to the molecular electronics sensor comprises suspending the pool of DNA molecules in a buffer, taking an aliquot of the buffer, and providing the aliquot to the sensor.
In various embodiments, the buffer solution comprises modified dNTPs.
In various embodiments, the measurable electrical parameter of the sensor comprises a source-drain current between the spaced-apart electrodes and through the molecular sensor complex.
In various embodiments, the probe molecule for the sensor comprises a polymerase and the measurable electrical parameter of the sensor is modulated by enzymatic activity of the polymerase while processing any one of the replicate DNA molecules.
In various embodiments, the polymerase comprises the Klenow Fragment of E. coli Polymerase I, and the bridge molecule comprises a double-stranded DNA molecule.
In various embodiments of the present disclosure, a method of archiving and retrieving a string of binary data in an amplification-free DNA information storage and retrieval system is described. The method comprises: dividing the string of binary data into segments of at least one binary bit; assigning each segment to a sequence motif, each sequence motif comprising at least two nucleotides, the sequence motifs predetermined to generate distinguishable signals in a measurable electrical parameter of a molecular electronics sensor; assembling the sequence motifs into a nucleotide sequence; synthesizing a pool of replicate DNA molecules using an amplification-free DNA writing method on a solid support, each replicate DNA molecule incorporating the nucleotide sequence; suspending the pool of DNA molecules in a buffer; taking an aliquot of the buffer; providing the aliquot to the sensor without prior amplification of the DNA molecules; generating the distinguishable signals; and converting the distinguishable signals into the string of binary data, wherein the sensor comprises a pair of spaced apart electrodes and a molecular sensor complex attached to each electrode to form a molecular electronics circuit, wherein the molecular sensor complex comprises a bridge molecule electrically wired to each electrode in the pair of spaced-apart electrodes and a probe molecule conjugated to the bridge molecule.
In various embodiments, methods, apparatus and systems are disclosed that utilize DNA molecules as a general purpose means of digital information storage without amplification of the DNA. In various aspects, the physical DNA does not require amplification in any aspect of the entire information storage system or in any specific subsystems of the information storage system herein. Amplification processes can impose burdens of cost, time, complexity, performance variability, and other limitations to a DNA information storage system. Further, amplification is incompatible with DNA comprising modified bases. Therefore, methods, apparatus, and systems for DNA information storage in accordance with the present disclosure are configured to avoid DNA amplification.
In various embodiments, a DNA data storage system utilizing DNA molecules as a general purpose means of digital information storage is disclosed. In certain aspects, a system for digital information storage comprises a DNA reading device, an information encoder/decoder algorithm, and a DNA writing device. In various aspects, the system further comprises a subsystem for managing physical DNA molecules to support data archival operations. The interrelation of these elements and their co-optimization are disclosed.
In various embodiments, a data reader for a DNA data storage system is disclosed. In various aspects, a DNA reading device comprises a sensor that extracts information from a single DNA molecule, thus not requiring DNA amplification. The sensor may be deployed in a chip-based format. In various examples, data reading systems that support such a chip-based sensor device are disclosed.
As used herein, the term “DNA” may refer not only to a biological DNA molecule, but also to fully synthetic versions, made by various methods of synthetic chemistry, such as nucleotide phosphoramidite chemistry, or by serial ligation of DNA oligomers, and also to forms made with chemical modifications present on the bases, sugar, or backbone, of which many are known to those skilled in nucleic acid biochemistry, including methylated bases, adenylated bases, other epigenetically marked bases, or also including non-standard or universal bases, such as inosine or 3-nitropyrrole, or other nucleotide analogues, or ribobases, or abasic sites, or damaged sites, and also including such DNA analogues as Peptide Nucleic Acids (PNA), Locked Nucleic Acids (LNA), Xeno Nucleic Acids (XNA) (a family of sugar-modified forms of DNA, including Hexitol Nucleic Acid (HNA)), Glycol Nucleic Add (GNA), etc., and also including the biochemically similar RNA molecule along with synthetic RNA and modified forms of RNA. All these biochemically closely related forms are implied by the use of the term DNA, in the context of referring to the data storage molecule used in a DNA storage system, including a template single strand, a single strand with oligomers bound thereon, double-stranded DNA, and double strands with bound groups such as groups to modify various bases. In addition, as used herein, the term DNA may refer to the single-stranded forms of such molecules, as well as double helix or double-stranded forms, including hybrid duplex forms, including forms that containing mismatched or non-standard base pairings, or non-standard helical forms such as triplex forms, as well as molecules that are partially double-stranded, such as a single-stranded DNA bound to an oligonucleotide primer, or a molecule with a hairpin secondary structure. In various embodiments, DNA refers to a molecule comprising a single-stranded DNA component having bound oligonucleotide segments and/or perturbing groups that can act as the substrate for a probe molecule, such as a polymerase, to process, and in doing so, generate distinguishable signals in a monitored electrical parameter of a molecular sensor.
DNA sequences as written herein, such as GATTACA, refer to DNA in the 5′ to 3′ orientation, unless specified otherwise. For example, GATTACA as written herein represents the single-stranded DNA molecule 5′-G-A-T-T-A-C-A-3′. In general, the convention used herein follows the standard convention for written DNA sequences used in the field of molecular biology.
As used herein, the term “oligonucleotide” or “binding oligonucleotide” refers to a short segment of DNA, or analog forms described above, having a length in the range of 3 to 100 bases, or 5 to 40 bases, or 10 to 30 bases, which can hybridize with a complementary sequence contained in a template strand. Such hybridization may be through perfect Watson-Crick base-paring matches, or may involve mismatches or nonstandard base pairings.
As used herein, the term “probe molecule” refers to a molecule electrically wired between two electrodes in a pair of spaced apart electrodes in a molecular sensor, capable of interacting with molecules in the environment around the sensor to provide perturbations in a monitored electrical parameter of the molecular sensor relating to the molecular interactions. A probe molecule herein may comprise a polymerase molecule, or any other processive enzyme such as a helicase or exonuclease. In a molecular sensor herein used as a DNA reading device, a probe molecule may be conjugated to a bridge molecule that is directly wired across two spaced apart electrodes in a pair of electrodes by direct bonds between the bridge molecule and the electrodes.
As used herein, the term “polymerase” refers to an enzyme that catalyzes the formation of a nucleotide chain by incorporating DNA or DNA analogues, or RNA or RNA analogues, against a template DNA or RNA strand. The term polymerase includes, but is not limited to, wild-type and mutant forms of DNA polymerases, such as Klenow, E. coli Pol I, Bst, Taq, Phi29, and T7, wild-type and mutant forms of RNA polymerases, such as T7 and RNA Pol I, and wild-type and mutant reverse transcriptases that operate on an RNA template to produce DNA, such as AMV and MMLV. A polymerase is a choice for a probe molecule in a molecular sensor herein usable as a DNA reader.
As used herein, a “bridge molecule” refers to a molecule bound between two spaced-apart electrodes in a pair of electrodes, to span the electrode gap between the two and complete an electrical circuit of a molecular sensor. In various embodiments, a bridge molecule has roughly the same length as the electrode gap, such as 1 nm to 100 nm, or in some cases, about 10 nm. Bridge molecules for use herein may comprise double-stranded DNA, other analog DNA duplex structures, such as DNA-RNA, DNA-PNA or DNA-LNA or DNA-XNA duplex hybrids, peptides, protein alpha-helix structures, antibodies or antibody Fab domains, graphene nanoribbons or carbon nanotubes, silicon nanowires, or any other of a wide array of molecular wires or conducting molecules known to those skilled in the art of molecular electronics. A bridge molecule herein may be described as having a “first” and “second” end, such as a base at or near the 3′ end and a base at or near the 5′ end of a DNA molecule acting as a bridge molecule. For example, each end may be chemically modified such that the first end of a bridge molecule bonds to a first electrode and the second end of a bridge molecule bonds to a second electrode in a pair of spaced-apart electrodes. This nomenclature aids in visualizing a bridge molecule spanning an electrode gap and bonding to each electrode in a pair of spaced-apart electrodes. In various embodiments, the first and second ends of a bridge molecule may be chemically modified so as to provide for self-assembly between the bridge molecule and a probe molecule such as a polymerase, and/or between the bridge molecule and one or both electrodes in a pair of electrodes. In a non-limiting example, the ends of a bridge molecule are bonded to each of two electrodes in a pair of spaced apart electrodes by thiol (—SH)-gold bonds.
As used herein, the term “sensor molecular complex” or “sensor probe complex” refers to the combination of a probe molecule and a bridge molecule, with the two molecules conjugated together, and the assemblage wired into the sensor circuit, or any combination of more than two molecules that together are wired into the sensor circuit.
As used herein, the term “dNTP” refers to both the standard, naturally occurring nucleoside triphosphates used in biosynthesis of DNA (i.e., dATP, dCTP, dGTP, and dTTP), and natural or synthetic analogues or modified forms of these, including those that carry base modifications, sugar modifications, or phosphate group modifications, such as an alpha-thiol modification or gamma phosphate modifications, or the tetra-, penta-, hexa- or longer phosphate chain forms, or any of the aforementioned with additional groups conjugated to any of the phosphates, such as the beta, gamma or higher order phosphates in the chain. In general, as used herein, “dNTP” refers to any nucleoside triphosphate analogue or modified form that can be incorporated by a polymerase enzyme as it extends a primer, or that would enter the active pocket of such an enzyme and engage transiently as a trial candidate for incorporation.
As used herein, “buffer,” “buffer solution” and “reagent solution” refers to a solution which provides an environment in which a molecular sensor can operate and produce signals from supplied DNA templates. In various embodiments, the solution is an aqueous solution, which may comprise dissolved, suspended or emulsified components such as salts, pH buffers, divalent cations, surfactants, blocking agents, solvents, template primer oligonucleotides, other proteins that complex with the polymerase of the sensor, and also possibly including the polymerase substrates, i.e., dNTPs, analogues or modified forms of dNTPs, and DNA molecule substrates or templates. In non-limiting examples, a buffer is used to hydrate and suspend DNA molecules that have been left in a lyophilized state in a DNA information library, in order to provide the DNA to a DNA reader for decoding of stored information.
As used herein, “binary data” or “digital data” refers to data encoded using the standard binary code, or a base 2 {0,1} alphabet, data encoded using a hexadecimal base 16 alphabet, data encoded using the base 10 {0-9} alphabet, data encoded using ASCII characters, or data encoded using any other discrete alphabet of symbols or characters in a linear encoding fashion.
As used herein, “digital data encoded format” refers to a series of binary digits, or other symbolic digits or characters that come from the primary translation of DNA sequence features used to encode information in DNA, or the equivalent logical string of such classified DNA features. In some embodiments, information to be archived as DNA may be translated into binary data, or may exist initially as binary data, and then this data may be further encoded with error correction and assembly information, into the format that is directly translated into the code provided by the distinguishable DNA sequence features. This latter association is the primary encoding format of the information. Application of the assembly and error correction procedures is a further, secondary level of decoding, back towards recovering the source information.
As used herein, “distinguishable DNA sequence features” means those features of a data-encoding DNA molecule that, when processed by a molecular sensor, such as one comprising a polymerase, produces distinct signals corresponding to the encoded information. Such features may be, for example, different bases, different modified bases or base analogues, different sequences or sequence motifs, or combinations of such to achieve features that produce distinguishable signals when processed by a sensor polymerase.
As used herein, a “DNA sequence motif” refers to either a specific letter (base) sequence, or a pattern, representing any member of a specific set of such letter sequences. For example, the following are sequence motifs that are specific letter sequences: GATTACA, TAC, or C. In contrast, the following are sequence motifs that are patterns: G[A/T]A is a pattern representing the explicit set of sequences {GAA, GTA}, and G[2-5] is a pattern referring to the set of sequences {GG, GGG, GGGG, GGGGG}. The explicit set of sequences is the unambiguous description of the motif, while pattern shorthand notations such as these are common compact ways of describing such sets. Motif sequences such as these may be describing native DNA bases, or may be describing modified bases, in various contexts. In various contexts, the motif sequences may be describing the sequence of a template DNA molecule, and/or may be describing the sequence on the molecule that complements the template.
As used herein, “sequence motifs with distinguishable signals,” in the cases of patterns, means that there is a first motif pattern representing a first set of explicit sequences, and any of said sequences produces the first signal, and there is a second motif pattern representing a second set of explicit sequences, and any of said sequences produces the second signal, and the first signal is distinguishable from the second signal. For example, if motif G[A/T]A and motif G[3-5] produce distinguishable signals, it means that any of the set {GAA, GTA} produces a first signal, and any of the set {GGG, GGGG, GGGGG} produces a second signal that is distinguishable from the first signal.
As used herein, “distinguishable signals” refers to one electrical signal from a sensor being discernably different than another electrical signal from the sensor, either quantitatively (e.g., peak amplitude, signal duration, and the like) or qualitatively (e.g., peak shape, and the like), such that the difference can be leveraged for a particular use. In a non-limiting example, two current peaks versus time from an operating molecular sensor are distinguishable if there is more than about a 1×10−10 Amp difference in their amplitudes. This difference is sufficient to use the two peaks as two distinct binary bit readouts, e.g., a 0 and a 1. In some instances, a first peak may have a positive amplitude, e.g., from about 1×10−10 Amp to about 20×10−10 Amp amplitude, whereas a second peak may have a negative amplitude, e.g., from about 0 Amp to about −5×10−10 Amp amplitude, making these peaks discernably different and usable to encode different binary bits, i.e., 0 or 1.
As used herein, a “data-encoding DNA molecule,” or “DNA data encoding molecule,” refers to a DNA molecule synthesized to encode data within the DNA's molecular structure, which can be retrieved at a later time.
As used herein, “reading data from DNA” refers to any method of measuring distinguishable events, such as electrical signals or other perturbations in a monitored electrical parameter of a circuit, which correspond to molecular features in a synthetic DNA molecule that were built into the synthetic DNA to encode information into the DNA molecule.
As used herein, “electrodes” refer to nano-scale electrical conductors (more simply, “nano-electrodes”), disposed in pairs and spaced apart by a nanoscale-sized electrode gap between the two electrodes in any pair of electrodes. In various embodiments, the term “electrode” may refer to a source, drain or gate. A gate electrode may be capacitively coupled to the gap region between source and drain electrodes, and comprise a “buried gate,” “back gate,” or “side gate.” The electrodes in a pair of spaced-apart electrodes may be referred to specifically (and labeled as such in various drawing figures) as the “source” and “drain” electrodes, “positive” and “negative” electrodes, or “first” and “second” electrodes. Whenever electrodes in any of the drawing figures herein are labeled “positive electrode” and “negative electrode,” it should be understood the polarity indicated may be reversed, (i.e., the labels of these two elements in the drawings can be reversed), unless indicated otherwise, (such as an embodiment where electrons may be flowing to a negative electrode). Nano-scale electrodes in a pair of electrodes are spaced apart by an electrode gap measuring about 1 nm to 100 nm, and each electrode may have other critical dimensions, such as width, height, and length, also in this same nanoscale range. Such nano-electrodes may be composed of a variety of materials that provide conductivity and mechanical stability. They may be comprised of metals, or semiconductors, for example, or of a combination of such materials. Metal electrodes may comprise, for example, titanium, chromium, platinum, or palladium. Pairs of spaced-apart electrodes may be disposed on a substrate by nano-scale lithographic techniques.
As used herein, the term “conjugation” refers to a chemical linkage, (i.e., bond), of any type known in the chemical arts, e.g., covalent, ionic, Van der Waals, etc. The conjugations of a probe molecule, such as a polymerase, to a bridge molecule, such as a double-stranded DNA molecule, or conjugations between a bridge molecule to an electrode or a metal deposit on an electrode, may be accomplished by a diverse array of conjugation methods known to those skilled in the art of conjugation chemistry, such as biotin-avidin couplings, thiol-gold couplings, cysteine-maleimide couplings, gold binding peptides or material binding peptides, click chemistry coupling, Spy-SpyCatcher protein interaction coupling, or antibody-antigen binding (such as the FLAG peptide tag/anti-FLAG antibody system), and the like. Conjugation of a probe molecule to each electrode in a pair of spaced-apart electrodes comprises an “electrical connection” or the “electrical wiring” of the probe molecule into a circuit that includes the probe molecule and the pair of electrodes. In other words, the probe molecule is conjugated to each electrode in a pair of electrodes to provide a conductive pathway between the electrodes that would be otherwise be insulated from one another by the electrode gap separating them. A conductive pathway is provided by electron delocalization/movement through the chemical bonds of the probe molecule, such as through C—C bonds. Conjugation sites engineered into a probe molecule, such as a polymerase, by recombinant methods or methods of synthetic biology, may in various embodiments comprise any one of a cysteine, an aldehyde tag site (e.g., the peptide motif CxPxR), a tetracysteine motif (e.g., the peptide motif CCPGCC), and an unnatural or non-standard amino acid (NSAA) site, such as through the use of an expanded genetic code to introduce a p-acetylphenylalanine, or an unnatural crosslinkable amino acid, such as through the use of RNA- or DNA-protein cross-link using 5-bromouridine, (see Gott, J. M., et al., Biochemistry, 30 (25), 6290-6295 (1991)).
As used herein, the term “amplification” refers to molecular biology methods that make one or more copies of a DNA molecule, and that, when performed on a pool of suitable DNA molecules collectively achieve copying of the pool. Such copying methods include converting a DNA molecule to RNA, or vice-versa. Such methods include all forms of exponential copying, such as PCR, in which the number of copies produced from an initial set of templates grows exponentially with cycle number or time. This includes the many variants or extensions of PCR known to those skilled in molecular biology. These include, for example, isothermal methods and rolling circle methods and methods that rely on nicking or recombinase to create priming sites, or amplification methods such as LAMP, DMA or RPA. Such methods also include linear amplification methods, in which the number of copies produced grows linearly with cycle number or time, such as T7 amplification or using a single primer with thermocycling or with isothermal means of reinitiating polymerase extension of a primer. This includes use of degenerate primers or random primers. Amplification as used herein also explicitly includes the special case of creation of the complementary strand of a single-stranded template, when such complementary strand is also used to represent the stored information in the processes of data storage—for example, in the context of readers that read both strands in the process of recovering the stored information. Such a complementary strand may remain in the double-stranded physical conformation with its complement in the storage DNA molecule, or may exist separated from its complementary strand in the storage system, in either case this constitutes amplification, i.e. copying, of the primary template, for information storage purposes. This is distinguished from the case where a DNA reader creates a complementary strand in the course reading data from a single-stranded template, which is not amplification as the term is used herein—for in this case, such a strand is merely a byproduct of the reading process, and not itself used as an information encoding molecule from which information is potentially extracted. DNA readers that create such byproduct strands include the polymerase molecular electronics readers described herein and illustrated in
Amplification-Free DNA Digital Data Storage:
General aspects of amplification-free DNA data storage methods, apparatus and systems, in accordance with the present disclosure and usable for archiving and later accessing stored data, are disclosed in reference to the various drawing figures:
As indicated in
Each major element of a DNA data storage system in accordance with the present disclosure is detailed herein below, including how each element of the system relates to, or involves, DNA amplification, and how the relevant amplification-free elements can be configured for a DNA data storage system.
In various aspects of the present disclosure, a DNA information storage system comprises: an encoder/decoder; a DNA writing device; and a DNA reading device.
Encoder/Decoder:
In various aspects, the encoder/decoder provides two functions: the encoder portion translates given digital/binary information or data into a specific set of DNA sequence data that are inputs to the DNA writer. Second, the decoder portion translates a given set of DNA sequences of the type provided by the DNA reader back into digital information.
Primary translation from binary to DNA sequence is what, in various embodiments, is performed by binary encoding schemes (BES), such as those exemplified in
Encoding schemes for use herein must have a cognate sensor, such as a polymerase-based molecular sensor, capable of distinguishing the signals of the encoding features, so that the choices of BES are directly related to the properties of the sensor in distinguishing features. Digital data formats or alphabets other than binary, such as hexadecimal, decimal, ASCII, etc., can equally well be encoded into DNA signaling features by similar schemes as the BES of
In an exemplary embodiment, a polymerase-based molecular electronics sensor produced distinguishable signals in a monitored electrical parameter of the sensor when the sensor encountered the distinguishable signaling features of oligonucleotides 5′-CCCC-3′, 5′-GGGG-3′, and 5′-AAAA-3′, when bound to the respective reverse complement template segments F1=5′-GGGG-3′, F2=5′-CCCC-3′, and F3=5′-TTTT-3′, presented in a synthetic DNA molecule provided in a suitable buffer to the sensor. In this embodiment, a binary encoding scheme was used wherein the bit 0 was encoded as GGGG (i.e., F1), the bit 1 was encoded as CCCC, i.e., F2, and the binary string 00 was encoded as TTTT i.e., F3. Note this encoding scheme included the encoding of 00 as TTTT, i.e., encoding as the data string 00 rather than as two consecutive data bits of 0, which would have encoded as GGGGGGGG. This encoding scheme was then used to encode an input binary data payload of “01001” into a nucleotide sequence for incorporation in the synthetic DNA molecule. The conversion to a feature sequence of F1-F2-F3-F1 began by dividing the input data string of binary data 01001 into the segments 0, 1, 00, and 1, and converting these data segments into a DNA data payload segment of the encoded DNA molecule as 5′-GGGGCCCCTTTTGGGG-3′ (SEQ ID NO:3). In other embodiments, there may be “punctuation” sequence segments inserted between the distinguishable signal features, which do not alter the distinguishable features, e.g., bound oligonucleotides, which provide benefits such as accommodating special properties or constraints of the DNA synthesis chemistry, or to provide spacers for added time separation between signal features, or reduced steric hindrance, or to improve the structure of the DNA molecule. For example, if A were such a punctuation sequence, the DNA encoding sequence would become 5′-AGGGGACCCCATTTTAGGGGA-3′ (SEQ ID NO:4). In general, such insertion of punctuation sequences or filler sequences may be part of the process of translating from a digital data payload to the encoding DNA sequence to be synthesized.
In various embodiments, information as binary data such as 010011100010 may be encoded using three states A, B, C, wherein 0 is encoded as A, 1 is encoded as B, and 00 is encoded as C whenever 00 occurs, (i.e., such as not to encode 00 as AA). In accordance with this scheme, the binary word 010011100010 is equivalent to the encoded form ABCBBBCABA.
In general, for converting a binary or other digital data payload string or collection of strings into a DNA sequence string or collection of such strings, many of the methods of lossless and lossy encoding or compression, e.g., those well known in computer science, can be used to devise schemes for the primary conversion of input digital data payloads to DNA sequence data payloads, as strings of distinguishable feature DNA segments, generalizing the examples of
With continued reference to
DNA Writing Device:
In various embodiments, a DNA writing device for use herein takes a given set of input DNA sequence data and produces the DNA molecules having these sequences. For each desired sequence, multiple DNA molecules representing that sequence are produced. The multiplicity of molecules produced can be in the ranges of 10's, 100's, 1000's, millions, billions or trillions of copies of DNA molecules for each desired sequence. All of these copies representing all the desired sequences may be pooled into one master pool of molecules. It is typical of such DNA writing systems that the writing is not perfect, and if N molecules are synthesized to represent a given input sequence, not all of these will actually realize the desired sequence. For example, they may contain erroneous deletions, insertions, or incorrect or physically damaged bases. Such a system will typically rely on some primary means of synthesizing DNA molecules, such as comprising chemical reactions and a fluidic system for executing the processes on a large scale in terms of the number of distinct sequences being synthesized, (see, for example, Kosuri and Church, “Large Scale de novo DNA Synthesis: Technologies and Applications,” Nature Methods, 11: 499-509, 2014). Non-limiting examples of methods and devices for synthesizing DNA molecules include commercial technology offered by Agilent Technologies and Twist BioScience.
In various embodiments, nucleotides can be preferentially selected for incorporation in nucleotide sequences based on their ease of synthesis in the writing process that forms molecules, reduced propensity to form secondary structure in the synthesized molecules, and/or ease in reading during the data decoding process. In various aspects, bad writing motifs and bad reading motifs are avoided in the selection of nucleotides for incorporation into nucleotide sequences, with a focus on incorporating segments in the nucleotide sequence that will produce mutually distinguishable signals when that nucleotide sequence is read to decode the encoded information. For example, in reading a nucleotide sequence, A and T are mutually distinguishable, C and G are mutually distinguishable, A, C and G are mutually distinguishable, AAA and TT are mutually distinguishable, A, GG and ATA are mutually distinguishable, and C, G, AAA, TTTT, and GTGTG are mutually distinguishable. These and many other sets of nucleotide and nucleotide segments provide mutually distinguishable signals in a reader, and thus can be considered for incorporation in a nucleotide sequence when encoding a set of information into a nucleotide sequence.
Additionally, there are nucleotide segments that are difficult to write, and thus should be avoided when encoding a set of information into a nucleotide sequence. In various embodiments, encoding of a set of information into a nucleotide sequence comprises the use of one of the remaining distinguishable feature sets as the encoding symbols, such as may correspond to binary 0/1, trinary 0/1/2 or quaternary 0/1/2/3 code, etc., along with an error correcting encoding to define the set of information in a way that avoids the hard to read and hard to write features. In this way, overall performance of an information storage system is improved.
DNA Reading Device:
In various aspects, the DNA reading device used herein is a device that takes a pool of DNA molecules and produces a set of measured signals for each of the molecules sampled or selected from this pool. Such signals are then translated into a DNA sequence, or otherwise used to characterize the base patterns or motifs present in the DNA molecules. Current methods for reading data stored in DNA may rely on commercial next-generation DNA sequencers for the primary recovery of sequences from DNA samples. Such readers actually survey only a small portion of the DNA molecules introduced into the system, so that only a small fraction will undergo an actual read attempt. Thus, amplification prior to DNA reading is common, given that most input DNA molecules are never analyzed, and are simply wasted. Furthermore, many methods of DNA sequencing have an amplification step as a fundamental part of the process that amplifies DNA onto a surface or a bead in preparation for sequencing, such as exemplified by the commercial Illumina HiSeq System (Illumina, Inc.), the 454 System (Roche, Inc.), or the Ion Torrent System (Thermo Fisher, Inc.), or as is done in classical Sanger sequencing, such as using a thermocycling terminator sequencing reaction to produce sufficient input material required to meet the limits of the detection process. Further, there may be amplification steps performed in nanopore sequencing. Other methods may use amplification to add tags for sequencing.
DNA sequencing methods may also separately rely on one or more rounds of amplification procedures during the sample preparation phase. Such methods have been used for the addition of adapter DNA segments to support subsequent processing. Also, some sequencing methods at least require that a single-stranded template have its complementary strand present for sequencing, such as the “Circular Consensus Sequencing” of Pacific BioSciences, Inc., or the “2D” hairpin sequencing of Oxford Nanopore Technologic, Inc. Such a method, if presented with a single-stranded template as input, requires a process with at least one round of extension, a form of amplification, to create the complementary strand before actual sequencing can begin. In addition, methods that require a relatively large amount of input template for the primary sequencing process, such as nanopore sequencing with inefficient pore loading, also may require amplification of input DNA to achieve the require input amount lower limits.
DNA Library Management:
In various embodiments, DNA library management comprises a collection of operational procedures and related methods and apparatus carried out on physical DNA. Some such procedures relate to the mechanics of physical storage and retrieval of the DNA, such as drying down DNA from solution, re-suspending dried DNA into solution, and the transfer and storage of the physical quantities of DNA, such as into and out from freezers. Other procedures relate to the information storage management, such as making copies of data, deleting data, and selecting subsets of the data, all of which entail physical operations on the DNA material. Copying DNA or selecting DNA from a pool are generally performed using PCR amplification or linear amplification methods, thus common methods for library management may rely on amplification of DNA. For example, for a library prepared with PCR primer sites in place, an entire archive can be copied by taking a representative sample of the DNA and then PCR amplifying this up to the requisite amount for a copy. For further example, for a library prepared with volume-specific PCR primers, a volume from the library can be selected by using PCR primers to amplify up just the desired volume from a small DNA sample representing the entire library.
Motivations for Amplification in DNA Digital Data Storage:
DNA information storage systems envisioned from the above elements (DNA writer, DNA reader) may not work effectively at a specific single molecule level, and this motivates the use of amplification. That is, if there were a target DNA data storage sequence, such as GATTACA, it would not be feasible to make only a single physical molecule representing that sequence, then archive and handle the single molecule, and then read data from that single molecule. The infeasibility is due to the many sources of molecular structure error, loss of molecules, and limits of signal detection that exist in many such component processes. Thus, it is frequently proposed that there be amplification of the single molecule, at various stages in the process, to provide many more trial molecules to engage in all these processes. Thus, the goal achieved herein is to provide specific processes that individually or collectively remove the need for amplification steps in the various processes that comprise the DNA data storage system.
Benefits of Amplification-Free Methods and Apparatus in DNA Data Storage:
There are many benefits to amplification-free processes in a DNA information storage system. In general, amplification of DNA in the context of an information storage system will add cost, time, and operational complexity to the system, directly from the demands of the procedure. Amplification also typically amplifies some sequences more than others, and thus it may introduce representational bias into the data in storage system that could result in loss of information or inaccurate information, or an increase in the time or cost to recover information. Amplification can also produce errors in the DNA sequences, as the enzymes involved can make errors during the copying process, or can create chimeric molecules that contain sequence parts of different template DNAs, or partial molecules that are not complete copies. Thus amplification can produce errors in the data, or spurious “noise” molecules. DNA amplification can also lead to contamination, as the large quantities of DNA generated during amplification can contaminate other non-amplified samples and result in a substantial fraction of the total DNA content in such samples coming from the source of contamination. Thus, amplification could produce a “corruption” of stored data. Amplification methods also typically require one, two, or more flanking primer sequences at the ends of the DNA molecules to support the priming and enzymatic extension processes used to achieve amplification. Such primer sequences, which are typically in the range of 6 to 30 bases in length, must be synthesized into the DNA molecules, and thus this increases the cost, complexity and potential for errors in the DNA writing processes.
Amplification also generally cannot reproduce DNA modifications, of which there are a great many known in nucleic acid chemistry and which are used in the methods described herein. Thus, the use of amplification at any point in the DNA data storage system greatly limits the ability to use this great diversity of modified DNA, which could otherwise be used to improve the performance of a DNA data storage system. It is thus a benefit of amplification-free systems that such systems enable the use of such modified forms of DNA to be used as the information storage molecule. For example, modified DNA may comprise substituent groups on the DNA bases that increase signal to noise in a sensor when the DNA is read by the sensor, thereby greatly improving the power of the reading system. DNA modifications can be used to enhance the writing process, the stability of the resulting molecules, or to enhance the ability to manipulate and read data from the molecules. Use of modified DNA can provide data security or encryption, by having detectable modifications that are known only to trusted parties, or that only special reading systems could read. There are many types of modifications known to those skilled in such chemistry, which could potentially be used to enhance the capabilities of DNA data storage system, such as modified bases, modifications to the DNA backbone, such as in Peptide Nucleic Acids (PNAs), or thiol-phosphate or iodo-phosphate modifications of the backbone, or other DNA analogs such as Locked Nucleic Acids (LNAs), or diverse Xeno Nucleic Acids (XNAs), or modifications to the sugar ring, or methylated bases, or labeled bases, or the addition of other chemical groups at various sites of the DNA molecule, such as biotin or other conjugation or binding groups, or groups that create stronger signals for the reader.
A DNA digital data storage system in accordance with the present disclosure benefits from having lower operational costs by being entirely amplification-free or by comprising at least one amplification-free subsystem. It is a further benefit of the present system that the time it takes to store and/or recover information may be reduced. It is a further benefit that the system may have lower complexity, and consequently lower total ownership costs, lower risks of failure, or greater mean time between failures. It is a further benefit that the representation biases inherent in amplification processes are avoided, so that in the writing, extracting or reading DNA, the diverse sequences involved get more equal representation in these respective processes and in the overall system for storing and retrieving information. It is a further benefit to avoid the forms of introduced error or data corruption that are contributed by amplification. It is a further benefit to avoid the need to synthesize amplification primer sequences into the DNA molecules. It is a further benefit that potential contamination of other DNA data storage samples by amplification products is eliminated, thereby increasing system integrity, robustness, efficiency and security. It is a further benefit that amplification-free DNA information systems or subsystems therein remain compatible with the use of modified DNA, which may comprise modifications to the bases, sugars or backbone of the DNA, and which may provide for more effective reading systems (e.g., enhanced sensor signals) or more effective writing systems (e.g., more efficient synthesis chemistry), or which may provide more options for encoding information into DNA.
Methods of Avoiding Amplification in Writing:
In various embodiments, synthetic methods for writing DNA are provided that co-synthesize many instances of physical DNA molecules for each desired sequence.
Methods of Avoiding Amplification in Reading:
When reading DNA using present sequencing technologies, there is often a requirement to amplify the input DNA. In some methods, this occurs because the method requires larger input amounts, and therefore requires a grossly larger quantity of DNA than would be typically available from various sample sources. In other methods, the creation of the sequencing library includes amplification steps. In yet other methods, commonly known as clonal sequencing methods, many copies of the molecule to be sequenced must be produced directly and localized on a support as an integral part of the sequencing process, such as the DNA clusters used in the Solexa/HiSeq instruments (Illumina, Inc.), the DNA “SNAP or ISP” beads used in the Ion Torrent instruments (Thermo Fisher, Inc.), or the DNA beads required by the ABI SOLiD instruments (Life Technologies, Inc.), or the DNA beads used in the 454 instruments (Roche, Inc.). Such amplification requirements can be eliminated by using a suitable single-molecule sequencing method. In such methods, a single DNA molecule is analyzed by a sensor to produce the fundamental sequence read or data extraction from the molecule. Such methods in principle do not require amplifying the DNA molecules prior to analysis by the sensor. Thus, utilizing a suitable single molecule DNA reading sensor provides the means to achieve amplification-free reading. Such single molecule methods of DNA sequencing are illustrated in
Also of note are methods that use a carbon nanotube having a polymerase attached thereon to produce electrical signals, as in the sensor illustrated in
An embodiment for a single molecule DNA reader not requiring any amplification of the DNA to be read is a molecular electronics sensor deployed on a CMOS sensor pixel array chip, as illustrated in
Methods of Avoiding Amplification in DNA Archive Management:
The major operations a DNA data storage archive management system may comprise are considered below.
1. DNA Storage Archive Operations
For a given archive, it may be desirable to perform the following operations:
In various embodiments, a DNA archive in accordance with the present disclosure exists in its primary physical state as a pool (i.e., a mixture) of DNA molecules, with each desired DNA sequence represented by a number of molecular exemplars. This pool of DNA molecules could be stored in a dry state, or in solution phase. In any case, the archive can be temporarily brought up to working temperatures in a compatible buffer solution to perform these operations. These operations would be performed efficiently by the physical storage system, which may include automation for handling of tubes, liquid handling, performing biochemical reactions, and the other procedures related to maintaining and manipulating the physical archive material.
These storage-related operations can be achieved without amplification, in contrast to doing these operations as they would commonly be done with amplification, as follows:
2. Copying
Copying an archive may be performed without any amplification by simply taking an aliquot of the stock solution. This provides a functional copy as long as a sufficient amount is taken to support future retrieval of information, and also to perhaps support limited numbers of further archive operations, such as further copying. For contrast, copying would more commonly be done via amplification, such as by including in the encoding DNA molecules amplification primer sites, and thus a small sample from the stock can be taken, followed by priming and amplifying, in linear or exponential amplification reactions, to obtain a substantial amount of material representing a functional copy of the original archive. Thus, a beneficial way to avoid such amplification processes for copying an archive is provided.
Copying of an archive may also be performed without amplification by using an amplification-free DNA reading system to read all the information from the archive, and then using an amplification-free DNA writing system to write all of the information into a new DNA archive, thereby achieving a DNA data copy of the original DNA data archive.
3. Appending
Appending data to the archive or merging archives can be achieved simply by pooling in and mixing with the additional DNA or archive material. This does not require amplification.
4. Targeted Reading
Working with individual “volumes” within an archive can be performed in an amplification-free manner by encoding into the DNA molecules sequence-specific oligonucleotide binding sites, with a different identifier/binding sequence for each volume to be made so accessible. Then, to readout a specific volume, hybridization-based capture could be used to select out specific DNA fragments with desired binding sequences. This process can be amplification free. Volume identifiers could also be added by synthesizing DNA with nucleotide modifications, so the relevant binding targets are not via DNA-sequence specific hybridization per se, but in other modifications on the bases used in the synthesis. For example, use of biotinylated bases, or bases with various hapten modifications, PNAs, non-classical DNA bases, or segments that carry epitope targets for antibody binding, or the use of PNA primer sites for improved binding affinity, all similarly provide selective ability to bind or manipulate subsets of the DNA via the corresponding interaction partners for these modifications intentionally introduced in the synthesis. These amplification-free targeting methods are all in contrast to targeted reading that relies on PCR-like processes to amplify out the target volume of interest.
Another embodiment of amplification-free targeted reading is the process whereby the archive, or a representative sample of the archive, is first presented to an amplification-free reader, which obtains reads sampling from the information content of the entire archive. Presuming the reads have a volume identifier in them, the read data from the desired volume are selected informatically from all such read data, thereby achieving the targeted reading through informatics selection. Another such embodiment relies on a reader that can in real-time read the volume identifier on a fragment, and either halts, or rejects and acquires another DNA fragment, if the identifier is not in the target volume, but otherwise completes the read if it detects the target volume identifier, achieving targeted reading. This is a dynamic informatics selection, which has the benefit of reading less unneeded information in the course of reading the targeted volume. Readers that can provide this capability include the molecular electronics sensor described in detail below, as well as certain embodiments of nanopore sequencing sensors.
5. Searching
Search of an archive for a literal input string can be achieved by encoding the search string or strings of interest into DNA form, synthesizing a complementary form or related primers for the desired DNA sequences and using hybridization extract from the archive of these desired sequence fragments. The hits can be identified by quantifying the amount of DNA recovered, or by using the DNA data reader to survey the recovered material. The search could report either presence or absence, or could recover the associated fragments containing the search string for complete reading. In contrast, searching methods that rely on PCR-like processes to amplify out the search target or to capture such targets and then amplify the results are to be avoided.
Embodiments of Amplification-Free Reading:
In various embodiments, amplification-free DNA reading herein comprises an all-electronic measurement of a single DNA molecule as it is processed by a polymerase or other probe molecule integrated into an electrical circuit that monitors an electrical parameter of the sensor circuit, such as the current. In an embodiment for DNA data storage reading, these sensors are deployed on a CMOS sensor array chip, with a large pixel array that provides the current measurement circuitry. Such a sensor chip may have millions of sensors, each processing successive DNA molecules, so that the required amount of input DNA may be as low as millions of total molecules. This provides for highly scalable, fast reading of DNA molecules without the need to pre-amplify the DNA, or amplify a specific target DNA molecule it as part of the reading process.
In various embodiments of the DNA information storage system herein, the DNA reading device comprises a massively parallel DNA sequencing device, which is capable of high speed reading of bases from each specific DNA molecule such that the overall rate of reading stored DNA information can be fast enough, and at high enough volume, for practical use in large scale archival information retrieval. The rate of reading bases sets a minimum time on data retrieval, related to the length of stored DNA molecules.
In various embodiments of a molecular electronics sensor for use herein, the polymerase may be a native or mutant form of Klenow, Taq, Bst, Phi29 or T7, or may be a reverse transcriptase. In various embodiments, the mutated polymerase forms will enable site specific conjugation of the polymerase to the bridge molecule, arm molecule or electrodes, through introduction of specific conjugation sites in the polymerase. Such conjugation sites engineered into the protein by recombinant methods or methods of synthetic biology may, in various embodiments, comprise a cysteine, an aldehyde tag site (e.g., the peptide motif CxPxR), a tetracysteine motif (e.g., the peptide motif CCPGCC), or an unnatural or non-standard amino acid (NSAA) site, such as through the use of an expanded genetic code to introduce a p-acetylphenylalanine, or an unnatural cross-linkable amino acid, such as through use of RNA- or DNA-protein cross-link using 5-bromouridine.
In various embodiments, the bridge molecule may comprise double-stranded DNA, other DNA duplex structures, such as DNA-PNA or DNA-LNA or DNA-RNA duplex hybrids, peptides, protein alpha-helix structures, antibodies or antibody Fab domains, graphene nanoribbons or carbon nanotubes, or any other of a wide array of molecular wires or conducting molecules known to those skilled in the art of molecular electronics. The conjugations of polymerase to such molecules, or of such molecules to the electrodes, may be by a diverse array of conjugation methods known to those skilled in the art of conjugation chemistry, such as biotin-avidin couplings, thiol-gold couplings, cysteine-maleimide couplings, gold or material binding peptides, click chemistry coupling, Spy-SpyCatcher protein interaction coupling, antibody-antigen binding (such as the FLAG peptide tag/anti-FLAG antibody system), and the like. Coupling to electrodes may be through material binding peptides, or through the use of a SAM (Self-Assembling-Monolayer) or other surface derivatization on the electrode surface to present suitable functional groups for conjugation, such as azide or amine groups. The electrodes comprise electrically conducting structures, which may comprise any metal, such as gold, silver, platinum, palladium, aluminum, chromium, or titanium, layers of such metals in any combination, such as gold on chromium, or semiconductors, such as doped silicon, or in other embodiments, a contact point of a first material on a support comprising a second material, such that the contact point is a site that directs chemical self-assembly of the molecular complex to the electrode.
In various embodiments, electrical parameters measured in a sensor, such as the sensor illustrated in
In various embodiments, the measured parameter in a molecular electronics sensor, such as the sensor of
The use of a sensor, such as the sensor illustrated in
The use of a sensor such as the sensor of
In various aspects of DNA reading herein, if a system reads a DNA molecule at a speed of 1 base per 10 minutes, as is representative of current next generation, optical dye-labeled terminator sequencers, then reading a 300 base DNA molecule takes at least 3,000 minutes (50 hours), aside from any time required to prepare the sample for reading. Such relatively slower systems therefore favor storing information in a larger number of shorter reads, such as 30 base reads that could be read in 5 hours. However, this requires a larger number of total reads, so the system must support billions or more such reads, as it the case on such sequencers. The current generation of optical massively parallel sequencers, read on the order of 3 billion letters of DNA per 6-minute cycle, or roughly the equivalent of 1 billion bits per minute, or 2 MB per second, although for data stored as 100 base DNA words, this would also require 600 minutes (5 hours). This can be seen to be a relatively low rate of data reading, although within a practical realm, as atypical book may contain 1 MB of textual data. The overall rate is practical, but the slow per base time makes this highly inefficient for reading a single book of data, and ideally matched to bulk reading of 36,000 books in parallel, over 5 hours. Thus, there is also a lack of scalability in this current capability, and also a high capital cost of the reading device (optical DNA sequencers cost in the $100,000 to $1,000,000 range presently). More critically, on such current systems, the cost of sequencing a human genome worth of DNA, 100 billion bases, is roughly $1,000, which means the cost of reading information is $1,000 per 200 Giga-bits, or $40 per GB. This is radically higher than the cost of reading information from magnetic tape storage or CDs, which is on the order of $1 per 10,000 GB, or $0.0001 per GB, 400,000 fold less costly. Thus the cost of reading DNA should be reduced by several orders of magnitude, even by 1,000,000 fold, to make this attractive for large scale, long term archival storage, not considering other advantages. Such improvements may indeed be possible, as evidenced by the million-fold reduction in costs of sequencing that has already occurred since the first commercial sequencers were produced.
In various embodiment, the DNA reader of the present system comprises substantially lower instrument capital costs, and higher per-base reading speed, and greater scalability in total number of reads per run, compared to currently available optical next generation sequencing instruments. In various aspects, the reading device for use herein is based on a CMOS chip sensor array device in order to increase the speed and scalability and decrease the capital costs. An embodiment of such a device comprises a CMOS sensor array device, wherein each sensor pixel contains a molecular electronic sensor capable of reading a single molecule of DNA without any molecular amplification or copying, such as PCR, required. In various embodiments, the CMOS chip comprises a scalable pixel array, with each pixel containing a molecular electronic sensor, and such a sensor comprising a bridge molecule and polymerase enzyme, configured so as to produce sequence-related modulations of the electrical current (or related electrical parameters such as voltage, conductance, etc.) as the enzyme processes the DNA template molecule.
An exemplary molecular sensor and chip combination usable as a DNA reader device in the present DNA data storage system is depicted in
As illustrated in
The use of the sensors of the present disclosure to measure distinguishable features of a DNA molecule requires the polymerase be provided with primed, single-stranded template DNA molecules as a substrate for polymerization of a complementary strand, in the course of generating the associated signals. In the context of encoding information in synthetic DNA molecules, these template molecules may be wholly chemically synthetic, and can therefore be provided with chemical or structural modifications or properties beyond those of native DNA, which may be used to enable or enhance the production of distinguishable signals for various embodiments. The polymerase, native or an engineered mutant, can accept as a substrate a great many such modified or analogue forms of DNA, many of which are well known to those skilled in the field of molecular biology. The use of such modifications to the template DNA can be used to create features with distinguishable signals.
In various embodiments, the DNA supplied to the polymerase as a template comprises some form of primed (double-stranded/single-stranded transition) site to act as an initiation site for the polymerase. For the purpose of storing digital data in DNA, in various embodiments, this priming will be pre-assembled into the encoding molecule, so that no further sample preparation is needed to prime the DNA template molecules.
Since the secondary structure in a DNA template can interfere with the processive action of a polymerase, it may be advantageous to reduce, avoid or eliminate secondary structure in the DNA data encoding template molecules used in DNA data reader sensors. Many methods to reduce secondary structure interference are known to those skilled in the field of molecular biology. Methods to reduce, avoid or eliminate secondary structure include, but are not limited to: using polymerases that possess strong secondary structure displacing capabilities, such as Phi29 or Bst or T7, either native or mutant forms of these; adding to the buffer solvents such as betaine, DMSO, ethylene glycol or 1,2-propanediol; decreasing the salt concentration of the buffer; increasing the temperature of the solution; and adding single strand binding protein or degenerate binding oligonucleotides to hybridize along the single strand. Methods such as these can have the beneficial effect of reducing secondary structure interference with the polymerase processing the encoding DNA and producing proper signals.
Additional methods available to reduce unwanted secondary structure for DNA data reading in accordance with the present disclosure comprise adding properties to DNA molecules produced by synthetic chemistry. For example, in some embodiments of the present disclosure, the data encoding the DNA molecule itself can be synthesized from base analogues that reduce secondary structure, such as using deaza-G (7-deaza-2′-deoxyguanosine) in place of G, which weakens G/C base pairing, or by using a locked nucleic acid (LNA) in the strand, which stiffens the backbone to reduce secondary structure. A variety of such analogues with such effects are known to those skilled in the field of nucleic acid chemistry.
Further methods are available in the present disclosure to reduce unwanted secondary structure for the DNA data reading sensor, because the DNA data encoding scheme determines the template sequence, and thus there is potential to choose the encoding scheme to avoid sequences prone to secondary structure. Such Secondary Structure Avoiding (“SSA”) encoding schemes are therefore a beneficial aspect of the present disclosure. In general, for encoding schemes as described herein, which use distinguishable signal sequence features as the encoding elements, to the extent there are options in the choice of encoding schemes, all such alternative schemes could be considered, and the schemes that produce less (or the least) secondary structure would be favored for use. The alternative schemes are assessed relative to a specific digital data payload, or statistically across a representative population of such data payloads to be encoded.
For example, the importance of SSA encoding is illustrated in the embodiment where the sensor provides three distinguishable signal sequence features: AAAAA, TTTTT, and CCCCC. If all three features are used in encoding in the same strand (or on other strands), there is a strong potential for the AAAAA and TTTTT encoding elements, being complementary, to hybridize and lead to secondary structure, either within the strand or between DNA strands. Thus, if the data were instead encoded entirely by the scheme where the bit 0 encodes to AAAAA and the bit 1 encodes to CCCCC, (i.e., ignoring the use of TTTTT completely), all potential secondary structure is avoided. Thus, this encoding (or the other SSA choice, the bit 0 encoding to TTTTT and the bit 1 encoding to CCCCC) is preferred over a scheme that uses self-complementary sequences, even though information density is reduced by giving up one of the three available encoding elements. Thus, in general, SSA codes can be used when there are encoding options and when there is a potential for DNA secondary structure to form. As shown in this example, desirable SSA codes to reduce DNA secondary structure may be less information dense than what is theoretically possible for the distinguishable signal states. However, this tradeoff can result in a net gain of information density, or related overall cost or speed improvements, by avoiding data loss related to DNA secondary structure.
In various embodiments, methods for reducing secondary structure comprises the use of binding oligonucleotides to protect the single strand, wherein the oligonucleotides are chosen with sequence or sequence composition that will preferentially bind to the encoding features. Such binding oligonucleotides may more effectively protect the single strand and general degenerate oligonucleotides. For example, in the case described above with three distinguishable signal sequence features AAAAA, TTTTT, and CCCCC, all three could be used as encoding features, and they could be protected in single-stranded form by binding the template to the oligonucleotides TTTTT, AAAAA, and GGGGG, or to enhanced binding analogues of these, such as RNA, LNA or PNA forms, instead of DNA. Thus, use of binding oligonucleotides that preferentially bind to the encoding features is another means to mitigate unwanted secondary structure effects, although such binding oligonucleotides must be used with strand-displacing polymerases, such as native or mutant forms of Klenow, Bst or Phi29, such that the oligonucleotides themselves do not interfere. A further method for avoiding secondary structure is to prepare the information encoding DNA in primarily double-stranded form, with a nick or gap at the primer site for polymerase initiation, and the rest of the molecule in duplex form (with or without a hairpin bend) so that the DNA molecule exists in solution in a substantially duplex form, free of secondary structure due to single-strand interactions, within or between molecules.
In various embodiments, DNA molecules used to encode information for reading by the cognate molecular sensor can be prepared with architecture facilitating the reading process as well as the encoding and decoding processes. Various embodiments of DNA architecture are illustrated in
With continued reference to
With further reference to
With continued reference to
In various aspects of the present disclosure, a DNA data payload of interest is processed by a polymerase sensor multiple times to provide a more robust recovery of digital data from DNA storage. In other aspects, a collection of such payloads on average are processed some expected number of multiple times. These examples benefit from a more accurate estimation of the encoding distinguishable features by aggregating the multiple observations. Multiple processing also has the benefit of overcoming fundamental Poisson sampling statistical variability to ensure that, with high confidence, a data payload of interest is sampled and observed at least once, or at least some desirable minimal number of times.
In various embodiments, the number of such repeat interrogations is in the range of 1 to about 1000 times, or in the range of about 10 to 100 times. Such multiple observations may comprise: (i) observations of the same physical DNA molecule by the polymerase sensor, and/or (ii) one or more polymerase sensors processing multiple, physically distinct DNA molecules that carry the same data payload. In the latter case, such multiple, physically distinct DNA molecules with the same data payload may be the DNA molecules produced by the same bulk synthesis reaction, the molecules obtained from distinct synthesis reactions targeting the same data payload, or replicate molecules produced by applying amplification or replication methods such as PCR, T7 amplification, rolling circle amplification, or other forms of replication known to those skilled in molecular biology. The aggregation of such multiple observations may be done through many methods, such as averaging or voting, maximum likelihood estimation, Bayesian estimation, hidden Markov methods, graph theoretic or optimization methods, or deep learning neural network methods.
In various embodiments of the present disclosure, digital data stored in DNA is read at a high rate, such as approaching 1 Gigabyte per second for the recovery of digital data, as is possible with large scale magnetic tape storage systems. Because the maximum processing speed of a polymerase enzyme is in the range of 100-1000 bases per second, depending on the type, the bit recovery rate of a polymerase-based sensor is limited to a comparable speed. Thus, in various embodiments millions of sensors are deployed in a cost effective format to achieve the desired data reading capacity.
In various embodiments, many individual molecular sensors are deployed in a large scale sensor array on a CMOS sensor pixel array chip, which is the most cost-effective, semiconductor chip manufacturing process.
In various embodiments of a DNA reader device, use of a CMOS chip device in conjunction with nano-scale manufacturing technologies, ultimately yield a much low cost, high throughput, fast, and scalable system. For example, sensors such as this can process DNA templates at the rate of 10 or more bases per second, 100 or more times faster than current optical sequencers. The use of CMOS chip technology ensures scalability and low system cost in a mass-producible format that leverages the enormous infrastructure of the semiconductor industry. As noted, whatever error modes or accuracy limitations may exist in a DNA sensor, or that may arise at faster reading speed (e.g., by modifying the enzyme or buffer or temperature or environmental factors, or sample data at lower time resolution), can be compensated for in the overall encoder/decoder-reader-writer framework described.
In various embodiments of the present disclosure, a DNA reader chip for use herein comprises at least 1 million sensors, at least 10 million sensors, at least 100 million sensors, or at least 1 billion sensors. Recognizing that at a typical sensor data sampling rate of 10 kHz, and recording 1 byte per measurement, a 100 million sensor chip produces raw signal data at a rate of 1 Terabyte (TB) per second. In considering how many sensors are desirable on a single chip, one critical consideration is the rate at which such a chip can decode digital data stored in DNA compared to the desirable digital data reading rates. It is, for example, desirable to have digital data read out at a rate of up to about 1 Gigabyte per second. Note that each bit of digital data encoded as DNA will require multiple signal measurements to recover, given that a feature of the signal use used to store this information, so this raw signal data production rate for the measured signal will be much higher that the recovery rate of encoded digital data. For example, if 10 signal measurements are required to recover 1 bit of stored digital data, and each measurement is an 8-bit byte, that is a factor of 80 bits of signal data to recover 1 bit of stored digital data. Thus, digital data reading rates are anticipated to be on the order of 100 times slower than the sensor raw signal data acquisition rate. For this reason, achieving desirable digital data reading rate of 1 Gigabyte/sec would require nearly 0.1 TB/sec of usable raw signal data. Further, given that not all the sensors in a single chip may be producing usable data, the need for chips that produce up to 1 TB/sec of raw data is desirable, based on the desired ultimate digital data recover rates from data stored as DNA. In various embodiments, such recovery rates correspond to a 100 million sensor pixel chip.
In various embodiments of the present disclosure, multiple chips are deployed within a reader system to achieve desired system-level digital data reading rates. The DNA data reader chip of
The features of an embodiment of a complete system are illustrated in
In some embodiments, chips within the reader system may be disposable, and replaced after a certain duty cycle, such as 24 hours to 48 hours. In other embodiments, the chips may be reconditioned in place after such a usage period, whereby the molecular complex, and possibly conjugating groups, are removed, and then replaced with new such components through a serious of chemical solution exposures. The removal process may comprise using voltages applied to the electrodes to drive removal, such as an elevated violated applied to the electrodes, or an alternating voltage applied to the electrodes, or a voltage sweep. The process may also comprise the use chemicals that denature, dissolve or dissociate or otherwise eliminate such groups, such as high molarity urea, or guanidine or other chaotropic salts, proteases such as Proteinase K, acids such as HCl, bases such as KOH or NaOH, or other agents well known in molecular biology and biochemistry for such purposes. This process may also include the use of applied temperature or light to drive the removal, such as elevated temperature or light in conjunction with photo-cleavable groups in the molecular complex or conjugation groups.
In various embodiments, a molecular electronics sensor comprises the configuration illustrated in
In various embodiments, a molecular electronic sensor for reading DNA comprises a carbon nanotube. As illustrated in
An alternative sensor that produces optical signals is a Zero Mode Waveguide sensor, such as the sensor illustrated in
In various embodiments, an amplification-free DNA archival storage system comprises: (i) an amplification-free subsystem for writing DNA data molecules; (ii) an amplification-free subsystem for managing the DNA data molecules; and (iii) an amplification-free subsystem for reading DNA data molecules.
In various embodiments, a reduced-amplification DNA archival storage system comprises an amplification-free subsystem for writing DNA data molecules.
In various embodiments, a reduced-amplification DNA archival storage system comprises an amplification-free subsystem for managing the DNA data molecules.
In various embodiments, a reduced-amplification DNA archival storage system comprises an amplification-free subsystem for reading the DNA data molecules.
In various embodiments, the amplification-free subsystem for writing DNA data molecules comprises the use of isolated, localized, phosphoramidite synthesis reactions.
In various embodiments, the amplification-free subsystem for managing DNA data molecules comprises the taking of aliquots for copying without amplification.
In various embodiments, the amplification-free subsystem for managing DNA data molecules comprises the use of hybridization for selecting volumes or searching for data, without amplification.
In various embodiments, the amplification-free subsystem for reading DNA data molecules comprises the use of a single molecule DNA sequencing system.
In various embodiments, the amplification-free subsystem for reading DNA data molecules comprises the use of a molecular electronic sensor that performs single molecule analysis of DNA.
In various embodiments, the amplification-free subsystem for reading DNA data molecules comprises the use of a molecular electronic sensor that comprises a polymerase, and performs single molecule analysis of DNA.
In various embodiments, the amplification-free subsystem for reading DNA data molecules comprises the use of a plurality of molecular electronic sensors deployed as a sensor array on a CMOS sensor pixel chip.
In various embodiments, a cloud based DNA data storage information system comprises any of the above amplification-free or reduced amplification subsystems.
In various embodiments, a method for retrieving data in an amplification-free DNA data storage and retrieval system comprises: a. obtaining a sample from the DNA molecular storage archive; and, b. reading the DNA data from the sample with an amplification free reader.
In various embodiments, a method of amplification-free DNA data storage comprises: a. writing DNA data with an amplification-free method; b. manipulating the archive with amplification free methods; and c. reading the DNA data with an amplification-free DNA reader.
In various embodiments, these methods above are performed using cloud-based systems.
In various embodiments, an apparatus for retrieving data in an amplification-free DNA data storage system comprises an amplification-free DNA reader device for reading the data encoded in a DNA molecule.
In various embodiments, an apparatus for amplification-free DNA data storage comprises: a. apparatus for writing DNA data with an amplification-free method; b. apparatus for manipulating the archive with amplification free methods; and c. apparatus for reading the DNA data with an amplification free reader.
Amplification-free DNA information storage methods, apparatus and systems are provided. References to “various embodiments”, “one embodiment”, “an embodiment”, “an example embodiment”, etc., indicate that the embodiment described may include a particular feature, structure, or characteristic, but every embodiment may not necessarily include the particular feature, structure, or characteristic. Moreover, such phrases are not necessarily referring to the same embodiment. Further, when a particular feature, structure, or characteristic is described in connection with an embodiment, it is submitted that it is within the knowledge of one skilled in the art to affect such feature, structure, or characteristic in connection with other embodiments whether or not explicitly described. After reading the description, it will be apparent to one skilled in the relevant art(s) how to implement the disclosure in alternative embodiments.
Benefits, other advantages, and solutions to problems have been described with regard to specific embodiments. However, the benefits, advantages, solutions to problems, and any elements that may cause any benefit, advantage, or solution to occur or become more pronounced are not to be construed as critical, required, or essential features or elements of the disclosure. The scope of the disclosure is accordingly to be limited by nothing other than the appended claims, in which reference to an element in the singular is not intended to mean “one and only one” unless explicitly so stated, but rather “one or more.” Moreover, where a phrase similar to ‘at least one of A, B, and C’ or ‘at least one of A, B, or C’ is used in the claims or specification, it is intended that the phrase be interpreted to mean that A alone may be present in an embodiment, B alone may be present in an embodiment, C alone may be present in an embodiment, or that any combination of the elements A, B and C may be present in a single embodiment; for example, A and B, A and C, B and C, or A and B and C.
All structural, chemical, and functional equivalents to the elements of the above-described various embodiments that are known to those of ordinary skill in the art are expressly incorporated herein by reference and are intended to be encompassed by the present claims. Moreover, it is not necessary for a device or method to address each and every problem sought to be solved by the present disclosure, for it to be encompassed by the present claims. Furthermore, no element, component, or method step in the present disclosure is intended to be dedicated to the public regardless of whether the element, component, or method step is explicitly recited in the claims. No claim element is intended to invoke 35 U.S.C. 112(f) unless the element is expressly recited using the phrase “means for.” As used herein, the terms “comprises”, “comprising”, or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a molecule, composition, process, method, or device that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such molecules, compositions, processes, methods, or devices.
This application claims priority to and the benefit of U.S. Provisional Patent Application Ser. No. 62/570,458, filed Oct. 10, 2017 and entitled “Methods, Apparatus and Systems for Amplification-Free DNA Data Storage,” the disclosure of which is incorporated herein by reference in its entirety.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2018/055264 | 10/10/2018 | WO | 00 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2019/075100 | 4/18/2019 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
4923586 | Katayama et al. | May 1990 | A |
5082627 | Stanbro | Jan 1992 | A |
5194133 | Clark et al. | Mar 1993 | A |
5366140 | Koskenmaki et al. | Nov 1994 | A |
5414588 | Barbee, Jr. | May 1995 | A |
5486449 | Honso et al. | Jan 1996 | A |
5532128 | Eggers et al. | Jul 1996 | A |
5583359 | Ng et al. | Dec 1996 | A |
5639507 | Galvagni et al. | Jun 1997 | A |
5646420 | Yamashita | Jul 1997 | A |
5767687 | Geist | Jun 1998 | A |
5871918 | Thorp et al. | Feb 1999 | A |
5881184 | Guidash | Mar 1999 | A |
5965452 | Kovacs | Oct 1999 | A |
5982018 | Wark | Nov 1999 | A |
6051380 | Sosnowski et al. | Apr 2000 | A |
6060023 | Maracas | May 2000 | A |
6094335 | Early | Jul 2000 | A |
6110354 | Saban | Aug 2000 | A |
6123819 | Peeters | Sep 2000 | A |
6144023 | Clerc | Nov 2000 | A |
6238927 | Abrams et al. | May 2001 | B1 |
6440662 | Gerwen et al. | Aug 2002 | B1 |
6464889 | Lee et al. | Oct 2002 | B1 |
6506564 | Mirkin et al. | Jan 2003 | B1 |
6537747 | Mills, Jr. et al. | Mar 2003 | B1 |
6670131 | Hashimoto | Dec 2003 | B2 |
6673533 | Wohlstadter et al. | Jan 2004 | B1 |
6716620 | Bashir et al. | Apr 2004 | B2 |
6749731 | Kobori | Jun 2004 | B2 |
6762050 | Fukushima et al. | Jul 2004 | B2 |
6764745 | Karasawa et al. | Jul 2004 | B1 |
6790341 | Saban | Sep 2004 | B1 |
6824974 | Pisharody et al. | Nov 2004 | B2 |
6861224 | Fujita et al. | Mar 2005 | B2 |
6916614 | Takenaka et al. | Jul 2005 | B1 |
6958216 | Kelley | Oct 2005 | B2 |
7015046 | Wohlstadter et al. | Mar 2006 | B2 |
7075428 | Oleynik | Jul 2006 | B1 |
7169272 | Fritsch et al. | Jan 2007 | B2 |
7183055 | Van Der Weide | Feb 2007 | B2 |
7189435 | Tuominen et al. | Mar 2007 | B2 |
7202480 | Yokoi et al. | Apr 2007 | B2 |
7208077 | Albers et al. | Apr 2007 | B1 |
7276206 | Augustine et al. | Oct 2007 | B2 |
7399585 | Gau | Jul 2008 | B2 |
7432120 | Mascolo et al. | Oct 2008 | B2 |
7470533 | Xu et al. | Dec 2008 | B2 |
7507320 | Hwang et al. | Mar 2009 | B2 |
7531120 | Van Rijn et al. | May 2009 | B2 |
7579823 | Ayliffe | Aug 2009 | B1 |
7691433 | Kronholz et al. | Apr 2010 | B2 |
7785785 | Pourmand et al. | Aug 2010 | B2 |
7834344 | Mascolo et al. | Nov 2010 | B2 |
7851045 | Gandon et al. | Dec 2010 | B2 |
7886601 | Merassi et al. | Feb 2011 | B2 |
7901629 | Calatzis et al. | Mar 2011 | B2 |
7943394 | Flandre et al. | May 2011 | B2 |
8241508 | D'Urso | Aug 2012 | B2 |
8313633 | Li et al. | Nov 2012 | B2 |
8351181 | Ahn | Jan 2013 | B1 |
8591816 | Calatzis et al. | Nov 2013 | B2 |
8652768 | Huber et al. | Feb 2014 | B1 |
8753893 | Liu et al. | Jun 2014 | B2 |
8927464 | Aizenberg et al. | Jan 2015 | B2 |
8940663 | Iqbal et al. | Jan 2015 | B2 |
9070733 | Rajagopal et al. | Jun 2015 | B2 |
9108880 | Jin et al. | Aug 2015 | B2 |
9139614 | Medintz | Sep 2015 | B2 |
9306164 | Chang et al. | Apr 2016 | B1 |
9829456 | Merriman et al. | Nov 2017 | B1 |
9956743 | Jin et al. | May 2018 | B2 |
10036064 | Merriman et al. | Jul 2018 | B2 |
10125420 | Jin et al. | Nov 2018 | B2 |
10151722 | Jin et al. | Dec 2018 | B2 |
10508296 | Merriman et al. | Dec 2019 | B2 |
10526696 | Jin et al. | Jan 2020 | B2 |
10584410 | Jin et al. | Mar 2020 | B2 |
10597767 | Merriman et al. | Mar 2020 | B2 |
10712334 | Choi et al. | Jul 2020 | B2 |
20020022223 | Connolly | Feb 2002 | A1 |
20020090649 | Chan et al. | Jul 2002 | A1 |
20020137083 | Kobori et al. | Sep 2002 | A1 |
20020138049 | Allen et al. | Sep 2002 | A1 |
20020142150 | Baumann et al. | Oct 2002 | A1 |
20020142477 | Lewis et al. | Oct 2002 | A1 |
20020172963 | Kelley et al. | Nov 2002 | A1 |
20020184939 | Yadav | Dec 2002 | A1 |
20030025133 | Brousseau | Feb 2003 | A1 |
20030040000 | Connolly et al. | Feb 2003 | A1 |
20030040173 | Fonash | Feb 2003 | A1 |
20030064390 | Schülein et al. | Apr 2003 | A1 |
20030087296 | Fujita et al. | May 2003 | A1 |
20030109031 | Chafin et al. | Jun 2003 | A1 |
20030141189 | Lee et al. | Jul 2003 | A1 |
20030141276 | Lee et al. | Jul 2003 | A1 |
20030186263 | Frey et al. | Oct 2003 | A1 |
20030224387 | Kunwar et al. | Dec 2003 | A1 |
20040014106 | Patno et al. | Jan 2004 | A1 |
20040023253 | Kunwar et al. | Feb 2004 | A1 |
20040038090 | Faris | Feb 2004 | A1 |
20040048241 | Freeman et al. | Mar 2004 | A1 |
20040063100 | Wang | Apr 2004 | A1 |
20040086929 | Weide et al. | May 2004 | A1 |
20040096866 | Hoffman et al. | May 2004 | A1 |
20040012161 | Chiu | Jun 2004 | A1 |
20040146863 | Pisharody et al. | Jul 2004 | A1 |
20040209355 | Edman et al. | Oct 2004 | A1 |
20040209435 | Patridge et al. | Oct 2004 | A1 |
20040229247 | DeBoer et al. | Nov 2004 | A1 |
20040235016 | Hamers | Nov 2004 | A1 |
20040248282 | Sobha | Dec 2004 | A1 |
20050029227 | Chapman | Feb 2005 | A1 |
20050067086 | Ito et al. | Mar 2005 | A1 |
20050074911 | Kornilovich et al. | Apr 2005 | A1 |
20050151541 | Brinz et al. | Jul 2005 | A1 |
20050156157 | Parsons et al. | Jul 2005 | A1 |
20050164371 | Arinaga | Jul 2005 | A1 |
20050172199 | Miller et al. | Aug 2005 | A1 |
20050181195 | Dubrow | Aug 2005 | A1 |
20050221473 | Dubin et al. | Oct 2005 | A1 |
20050227373 | Flandre et al. | Oct 2005 | A1 |
20050247573 | Nakamura et al. | Nov 2005 | A1 |
20050285275 | Son | Dec 2005 | A1 |
20050287548 | Bao et al. | Dec 2005 | A1 |
20050287589 | Connolly | Dec 2005 | A1 |
20060003482 | Chinthakindi et al. | Jan 2006 | A1 |
20060019273 | Connolly et al. | Jan 2006 | A1 |
20060024504 | Nelson et al. | Feb 2006 | A1 |
20060024508 | D'Urso et al. | Feb 2006 | A1 |
20060029808 | Zhai et al. | Feb 2006 | A1 |
20060051919 | Mascolo et al. | Mar 2006 | A1 |
20060051946 | Mascolo et al. | Mar 2006 | A1 |
20060105449 | Larmer et al. | May 2006 | A1 |
20060105467 | Niksa et al. | May 2006 | A1 |
20060128239 | Nun et al. | May 2006 | A1 |
20060147983 | O'uchi | Jul 2006 | A1 |
20060154489 | Tornow | Jul 2006 | A1 |
20060275853 | Matthew et al. | Dec 2006 | A1 |
20070026193 | Luzinov et al. | Feb 2007 | A1 |
20070048748 | Williams et al. | Mar 2007 | A1 |
20070140902 | Calatzis et al. | Jun 2007 | A1 |
20070148815 | Chao et al. | Jun 2007 | A1 |
20070186628 | Curry et al. | Aug 2007 | A1 |
20070184247 | Simpson et al. | Sep 2007 | A1 |
20070207487 | Emig et al. | Sep 2007 | A1 |
20070231542 | Deng | Oct 2007 | A1 |
20080012007 | Li et al. | Jan 2008 | A1 |
20080098815 | Merassi et al. | May 2008 | A1 |
20080149479 | Olofsson et al. | Jun 2008 | A1 |
20080199657 | Capron et al. | Aug 2008 | A1 |
20080199659 | Zhao | Aug 2008 | A1 |
20090011222 | Xiu et al. | Jan 2009 | A1 |
20090017571 | Nuckolls | Jan 2009 | A1 |
20090020428 | Levitan | Jan 2009 | A1 |
20090027036 | Nuckolls et al. | Jan 2009 | A1 |
20090062684 | Gregersen et al. | Mar 2009 | A1 |
20090152109 | Whitehead et al. | Jun 2009 | A1 |
20090162927 | Naaman et al. | Jun 2009 | A1 |
20090170716 | Su et al. | Jul 2009 | A1 |
20090178935 | Reymond et al. | Jul 2009 | A1 |
20090295372 | Krstic et al. | Dec 2009 | A1 |
20090297913 | Zhang et al. | Dec 2009 | A1 |
20090306578 | Sivan et al. | Dec 2009 | A1 |
20090324308 | Law et al. | Dec 2009 | A1 |
20100038342 | Lim et al. | Feb 2010 | A1 |
20100044212 | Kim et al. | Feb 2010 | A1 |
20100055397 | Kurihara et al. | Mar 2010 | A1 |
20100132771 | Lu | Jun 2010 | A1 |
20100142259 | Drndic et al. | Jun 2010 | A1 |
20100149530 | Tomaru | Jun 2010 | A1 |
20100167938 | Su et al. | Jul 2010 | A1 |
20100184062 | Steinmueller-Nethl et al. | Jul 2010 | A1 |
20100188109 | Edel et al. | Jul 2010 | A1 |
20100194409 | Gao et al. | Aug 2010 | A1 |
20100201381 | Iqbal et al. | Aug 2010 | A1 |
20100206367 | Jeong et al. | Aug 2010 | A1 |
20100227416 | Koh et al. | Sep 2010 | A1 |
20100280397 | Feldman et al. | Nov 2010 | A1 |
20100285275 | Baca et al. | Nov 2010 | A1 |
20100285601 | Kong et al. | Nov 2010 | A1 |
20100288543 | Hung et al. | Nov 2010 | A1 |
20100300899 | Levine et al. | Dec 2010 | A1 |
20110056845 | Stellacci | Mar 2011 | A1 |
20110065588 | Su et al. | Mar 2011 | A1 |
20110076783 | Liu et al. | Mar 2011 | A1 |
20110091787 | McGrath et al. | Apr 2011 | A1 |
20110160077 | Chaisson et al. | Jun 2011 | A1 |
20110166034 | Kwong et al. | Jul 2011 | A1 |
20110217763 | Rasooly et al. | Sep 2011 | A1 |
20110227558 | Mannion et al. | Sep 2011 | A1 |
20110229667 | Jin et al. | Sep 2011 | A1 |
20110233075 | Soleymani et al. | Sep 2011 | A1 |
20110248315 | Nam et al. | Oct 2011 | A1 |
20110287956 | Iqbal et al. | Nov 2011 | A1 |
20110291673 | Shibata et al. | Dec 2011 | A1 |
20110311853 | Fratti | Dec 2011 | A1 |
20110312529 | He et al. | Dec 2011 | A1 |
20120060905 | Fogel et al. | Mar 2012 | A1 |
20120122715 | Gao et al. | May 2012 | A1 |
20120220046 | Chao | Aug 2012 | A1 |
20120258870 | Schwartz et al. | Oct 2012 | A1 |
20120286332 | Rothberg et al. | Nov 2012 | A1 |
20120309106 | Eichen et al. | Dec 2012 | A1 |
20130049158 | Hong et al. | Feb 2013 | A1 |
20130071289 | Knoll | Mar 2013 | A1 |
20130108956 | Lu et al. | May 2013 | A1 |
20130109577 | Korlach et al. | May 2013 | A1 |
20130162276 | Lee et al. | Jun 2013 | A1 |
20130183492 | Lee et al. | Jul 2013 | A1 |
20130214875 | Duncan et al. | Aug 2013 | A1 |
20130239349 | Knights et al. | Sep 2013 | A1 |
20130245416 | Yarmush et al. | Sep 2013 | A1 |
20130273340 | Neretina et al. | Oct 2013 | A1 |
20130281325 | Elibol et al. | Oct 2013 | A1 |
20130331299 | Reda et al. | Dec 2013 | A1 |
20140001055 | Elibol et al. | Jan 2014 | A1 |
20140011013 | Jin | Jan 2014 | A1 |
20140018262 | Reda et al. | Jan 2014 | A1 |
20140048776 | Huang et al. | Feb 2014 | A1 |
20140054788 | Majima et al. | Feb 2014 | A1 |
20140057283 | Wang et al. | Feb 2014 | A1 |
20140061049 | Lo et al. | Mar 2014 | A1 |
20140079592 | Chang et al. | Mar 2014 | A1 |
20140027775 | Quick et al. | Jun 2014 | A1 |
20140170567 | Sakamoto et al. | Jun 2014 | A1 |
20140174927 | Bashir et al. | Jun 2014 | A1 |
20140197459 | Kis et al. | Jul 2014 | A1 |
20140218637 | Gao et al. | Aug 2014 | A1 |
20140235493 | Zang et al. | Aug 2014 | A1 |
20140253827 | Gao et al. | Sep 2014 | A1 |
20140284667 | Basker et al. | Sep 2014 | A1 |
20140320849 | Chou et al. | Oct 2014 | A1 |
20140367749 | Bai et al. | Dec 2014 | A1 |
20140377900 | Yann et al. | Dec 2014 | A1 |
20150005188 | Levner et al. | Jan 2015 | A1 |
20150017655 | Huang et al. | Jan 2015 | A1 |
20150049332 | Sun et al. | Feb 2015 | A1 |
20150057182 | Merriman et al. | Feb 2015 | A1 |
20150065353 | Turner et al. | Mar 2015 | A1 |
20150068892 | Ueno et al. | Mar 2015 | A1 |
20150077183 | Ciubotaru | Mar 2015 | A1 |
20150148264 | Esfandyarpour et al. | May 2015 | A1 |
20150177150 | Rothberg et al. | Jun 2015 | A1 |
20150191709 | Heron et al. | Jul 2015 | A1 |
20150263203 | Lewis et al. | Sep 2015 | A1 |
20150293025 | Ninomiya et al. | Oct 2015 | A1 |
20150294875 | Khondaker et al. | Oct 2015 | A1 |
20150344945 | Mandell et al. | Dec 2015 | A1 |
20160017416 | Boyanov et al. | Jan 2016 | A1 |
20160045378 | Geloen | Feb 2016 | A1 |
20160155971 | Strachan et al. | Jun 2016 | A1 |
20160187282 | Gardner et al. | Jun 2016 | A1 |
20160265047 | van Rooyen et al. | Sep 2016 | A1 |
20160284811 | Yu et al. | Sep 2016 | A1 |
20160290957 | Ram | Oct 2016 | A1 |
20160319342 | Kawai et al. | Nov 2016 | A1 |
20160377564 | Carmignani et al. | Dec 2016 | A1 |
20170023512 | Cummins et al. | Jan 2017 | A1 |
20170037462 | Turner et al. | Feb 2017 | A1 |
20170038333 | Turner et al. | Feb 2017 | A1 |
20170043355 | Fischer | Feb 2017 | A1 |
20170044605 | Merriman | Feb 2017 | A1 |
20170131237 | Ikeda | May 2017 | A1 |
20170184542 | Chatelier et al. | Jun 2017 | A1 |
20170234825 | Elibol et al. | Aug 2017 | A1 |
20170240962 | Merriman | Aug 2017 | A1 |
20170288017 | Majima et al. | Oct 2017 | A1 |
20170332918 | Keane | Nov 2017 | A1 |
20180014786 | Keane | Jan 2018 | A1 |
20180031508 | Jin | Feb 2018 | A1 |
20180031509 | Jin | Feb 2018 | A1 |
20180045665 | Jin | Feb 2018 | A1 |
20180259474 | Jin | Sep 2018 | A1 |
20180297321 | Jin et al. | Oct 2018 | A1 |
20180305727 | Merriman | Oct 2018 | A1 |
20180340220 | Merriman | Nov 2018 | A1 |
20190004003 | Merriman | Jan 2019 | A1 |
20190033244 | Jin | Jan 2019 | A1 |
20190039065 | Choi | Feb 2019 | A1 |
20190041355 | Merriman | Feb 2019 | A1 |
20190041378 | Choi | Feb 2019 | A1 |
20190094175 | Merriman | Mar 2019 | A1 |
20190194801 | Jin et al. | Jun 2019 | A1 |
20190355442 | Merriman et al. | Nov 2019 | A1 |
20190376925 | Choi et al. | Dec 2019 | A1 |
20190383770 | Choi et al. | Dec 2019 | A1 |
20200157595 | Merriman et al. | May 2020 | A1 |
20200217813 | Merriman et al. | Jul 2020 | A1 |
20200277645 | Merriman et al. | Sep 2020 | A1 |
20200385850 | Merriman et al. | Dec 2020 | A1 |
20200385855 | Jin et al. | Dec 2020 | A1 |
20200393440 | Jin et al. | Dec 2020 | A1 |
Number | Date | Country |
---|---|---|
1795376 | Jun 2006 | CN |
101231287 | Jul 2008 | CN |
102706940 | Oct 2012 | CN |
104685066 | Jun 2015 | CN |
104703700 | Jun 2015 | CN |
108027335 | May 2018 | CN |
102012008375 | Oct 2012 | DE |
102013012145 | Jan 2015 | DE |
2053383 | Apr 2009 | EP |
3403079 | Nov 2018 | EP |
3408219 | Dec 2018 | EP |
3408220 | Dec 2018 | EP |
3414784 | Dec 2018 | EP |
3420580 | Jan 2019 | EP |
2485559 | May 2012 | GB |
0233981 | Jul 1990 | JP |
2008-258594 | Oct 2008 | JP |
2018-522236 | Aug 2018 | JP |
20070059880 | Jun 2007 | KR |
20110104245 | Sep 2011 | KR |
2001044501 | Jun 2001 | WO |
2002049980 | Jun 2002 | WO |
2002074985 | Sep 2002 | WO |
2003042396 | May 2003 | WO |
2004096986 | Nov 2004 | WO |
2004099307 | Nov 2004 | WO |
2005108612 | Nov 2005 | WO |
2007054649 | May 2007 | WO |
2007102960 | Sep 2007 | WO |
2007126432 | Nov 2007 | WO |
2007128965 | Nov 2007 | WO |
2009003208 | Jan 2009 | WO |
2009035647 | Mar 2009 | WO |
2010022107 | Feb 2010 | WO |
2012083249 | Jun 2012 | WO |
2012087352 | Jun 2012 | WO |
2012152056 | Nov 2012 | WO |
2013096851 | Jun 2013 | WO |
2014182630 | Jul 2014 | WO |
2015167019 | Nov 2015 | WO |
2015176990 | Nov 2015 | WO |
2015188197 | Dec 2015 | WO |
2016016635 | Feb 2016 | WO |
2016100635 | Jun 2016 | WO |
2016100637 | Jun 2016 | WO |
2016196755 | Dec 2016 | WO |
2016210386 | Dec 2016 | WO |
2017027518 | Feb 2017 | WO |
2017041056 | Mar 2017 | WO |
2017042038 | Mar 2017 | WO |
2017061129 | Apr 2017 | WO |
2017123416 | Jul 2017 | WO |
2017132567 | Aug 2017 | WO |
2017132586 | Aug 2017 | WO |
2017139493 | Aug 2017 | WO |
2017147187 | Aug 2017 | WO |
2017151680 | Sep 2017 | WO |
2017184677 | Oct 2017 | WO |
2018022799 | Feb 2018 | WO |
2018026855 | Feb 2018 | WO |
2018098286 | May 2018 | WO |
2018132457 | Jul 2018 | WO |
2018136148 | Jul 2018 | WO |
2018200687 | Nov 2018 | WO |
2018208505 | Nov 2018 | WO |
2003091458 | Jan 2019 | WO |
Entry |
---|
Church et al. Next-Generation Digital Information Storage in DNA Science vol. 337, p. 1628 and supplementary information (Year: 2012). |
USPTO; Notice of Allowance dated May 11, 2020 in U.S. Appl. No. 16/073,706. |
USPTO; Notice of Allowance dated Jun. 1, 2020 in U.S. Appl. No. 16/076,673. |
USPTO; Non-Final Office Action dated Jun. 2, 2020 in U.S. Appl. No. 16/684,338. |
USPTO; Non-Final Office Action dated Jun. 15, 2020 in U.S. Appl. No. 16/878,484. |
USPTO; Non-Final Office Action dated Jun. 30, 2020 in U.S. Appl. No. 16/479,257. |
USPTO; Non-Final Office Action dated Jun. 30, 2020 in U.S. Appl. No. 16/477,106. |
EP; European Search Report dated Jun. 18, 2020 in Application No. 16815467.2. |
CN; Office Action dated Jun. 5, 2020 in Chinese Patent Application No. 2017800204782. |
EP; European Search Report dated Jun. 26, 2020 in Application No. 17874229.2. |
Li et al., “Graphene Channel Liquid Container Field Effect Transistor as pH Sensor,” Hindawi Publishing Corp., Journal of Nanomaterials 2014. |
USPTO; Requirement for Restriction dated Nov. 2, 2011 in U.S. Appl. No. 12/667,583. |
USPTO; Non-Final Office Action dated Sep. 28, 2018 in U.S. Appl. No. 12/667,583. |
USPTO; Final Office Action dated Feb. 19, 2019 in U.S. Appl. No. 12/667,583. |
USPTO; Non-Final Office Action dated Aug. 19, 2019 in U.S. Appl. No. 12/667,583. |
USPTO; Requirement for Restriction dated Dec. 1, 2016 in U.S. Appl. No. 13/996,477. |
USPTO; Non-Final Office Action dated May 5, 2017 in U.S. Appl. No. 13/996,477. |
USPTO; Final Office Action dated Oct. 4, 2017 in U.S. Appl. No. 13/996,477. |
USPTO; Notice of Allowance dated Jan. 3, 2018 in U.S. Appl. No. 13/996,477. |
USPTO; Final Office Action date Dec. 30, 2016 in U.S. Appl. No. 15/050,270. |
USPTO; Advisory Action dated Mar. 14, 2017 in U.S. Appl. No. 15/050,270. |
USPTO; Non-Final Office Action dated Sep. 29, 2017 in U.S. Appl. No. 15/050,270. |
USPTO; Final Office Action dated Jul. 10, 2018 in U.S. Appl. No. 15/050,270. |
USPTO; Advisory Action dated Sep. 26, 2018 in U.S. Appl. No. 15/050,270. |
USPTO; Non-Final Office Action dated Feb. 26, 2019 in U.S. Appl. No. 15/050,270. |
USPTO; Final Office Action dated Jul. 10, 2019 in U.S. Appl. No. 15/050,270. |
USPTO; Notice of Allowance dated Jan. 6, 2020 in U.S. Appl. No. 15/050,270. |
USPTO; Non-Final Office Action dated Oct. 19, 2016 in U.S. Appl. No. 15/220,307. |
USPTO; Notice of Allowance dated Jul. 28, 2017 in U.S. Appl. No. 15/220,307. |
USPTO; Requirement for Restriction dated Jan. 17, 2017 in U.S. Appl. No. 15/336,557. |
USPTO; Non-Final Office Action dated May 16, 2017 in U.S. Appl. No. 15/336,557. |
USPTO; Final Office Action dated Mar. 8, 2018 in U.S. Appl. No. 15/336,557. |
USPTO; Notice of Allowance dated May 25, 2018 in U.S. Appl. No. 15/336,557. |
USPTO; Non-Final Office Action dated Feb. 9, 2018 in U.S. Appl. No. 15/728,400. |
USPTO; Final Office Action dated Jul. 10, 2018 in U.S. Appl. No. 15/728,400. |
USPTO; Advisory Action dated Oct. 12, 2018 in U.S. Appl. No. 15/728,400. |
USPTO; Advisory Action dated Nov. 14, 2018 in U.S. Appl. No. 15/728,400. |
USPTO; Notice of Allowance dated Dec. 6, 2018 in U.S. Appl. No. 15/728,400. |
USPTO; Non-Final Office Action dated Feb. 23, 2018 in U.S. Appl. No. 15/728,412. |
USPTO; Notice of Allowance dated Sep. 12, 2018 in U.S. Appl. No. 15/728,412. |
USPTO; Final Office Action dated Jun. 13, 2018 in U.S. Appl. No. 15/728,412. |
USPTO; Non-Final Office Action dated Feb. 23, 2018 in U.S. Appl. No. 15/796,080. |
USPTO; Final Office Action dated Jun. 14, 2018 in U.S. Appl. No. 15/796,080. |
USPTO; Advisory Action dated Sep. 4, 2018 in U.S. Appl. No. 15/796,080. |
USPTO; Notice of Allowance dated Oct. 11, 2018 in U.S. Appl. No. 15/796,080. |
USPTO; Non-Final Office Action dated Mar. 7, 2019 in U.S. Appl. No. 15/944,356. |
USPTO; Non-Final Office Action dated Sep. 4, 2018 in U.S. Appl. No. 15/979,135. |
USPTO; Non-Final Office Action dated Nov. 30, 2018 in U.S. Appl. No. 15/979,135. |
USPTO; Final Office Action dated Mar. 1, 2019 in U.S. Appl. No. 15/979,135. |
USPTO; Advisory Action dated May 22, 2019 in U.S. Appl. No. 15/979,135. |
USPTO; Non-Final Office Action dated Jun. 25, 2019 in U.S. Appl. No. 15/979,135. |
USPTO; Notice of Allowance dated Dec. 11, 2019 in U.S. Appl. No. 15/979,135. |
USPTO; Non-Final Office Action dated Aug. 22, 2019 in the U.S. Appl. No. 16/011,065. |
USPTO; Final Office Action dated Mar. 6, 2020 in U.S. Appl. No. 16/011,065. |
USPTO; Requirement for Restriction dated Oct. 15, 2018 in U.S. Appl. No. 16/015,028. |
USPTO; Non-Final Office Action dated Dec. 26, 2018 in U.S. Appl. No. 16/015,028. |
USPTO; Final Office Action dated Apr. 15, 2019 in U.S. Appl. No. 16/015,028. |
USPTO; Non-Final Office Action dated Jul. 30, 2019 in the U.S. Appl. No. 16/015,028. |
USPTO; Notice of Allowance dated Nov. 8, 2019 in U.S. Appl. No. 16/015,028. |
USPTO; Requirement for Restriction dated Dec. 17, 2018 in U.S. Appl. No. 16/015,049. |
USPTO; Non-Final Office Action dated Mar. 6, 2019 in U.S. Appl. No. 16/015,049. |
USPTO; Final Office Action dated Jun. 19, 2019 in U.S. Appl. No. 16/015,049. |
USPTO; Non-Final Office Action dated Nov. 5, 2019 in U.S. Appl. No. 16/015,049. |
USPTO; Notice of Allowance dated Feb. 20, 2020 in U.S. Appl. No. 16/015,049. |
USPTO; Non-Final Office Action dated Apr. 13, 2020 in U.S. Appl. No. 16/070,133. |
USPTO; Restriction Requirement dated Sep. 19, 2019 in U.S. Appl. No. 16/073,706. |
USPTO; Non-Final Office Action dated Oct. 24, 2019 in U.S. Appl. No. 16/073,706. |
USPTO; Non-Final Office Action dated Jan. 10, 2020 in U.S. Appl. No. 16/076,673. |
USPTO; Non-Final Office Action dated Feb. 1, 2019 in U.S. Appl. No. 16/152,190. |
USPTO; Notice of Allowance dated May 30, 2019, in U.S. Appl. No. 16/152,190. |
USPTO; Restriction Requirement dated May 29, 2019 in U.S. Appl. No. 16/250,929. |
USPTO; Notice of Allowance dated Oct. 23, 2019 in U.S. Appl. No. 16/250,929. |
USPTO; Restriction Requirement dated Apr. 8, 2020 in U.S. Appl. No. 16/479,257. |
PCT; International Search Report and Written Opinion dated Nov. 29, 2012 in Application No. PCT/US2011/001995. |
PCT; International Search Report and Written Opinion dated Apr. 13, 2018 in Application No. PCT/US2018/013140. |
PCT; International Search Report and Written Opinion dated Jan. 27, 2017 in Application No. PCT/US2017/015437. |
PCT; International Search Report and Written Opinion dated Jan. 27, 2017 in Application No. PCT/US2017/015465. |
PCT; International Search Report and Written Opinion dated Jul. 26, 2017 in Application No. PCT/US2017/017231. |
PCT; International Search Report and Written Opinion dated May 25, 2017 in Application No. PCT/US2017/018950. |
PCT; International Search Report and Written Opinion dated Jul. 20, 2018 in Application No. PCT/US2018/029382. |
PCT; International Search Report and Written Opinion dated Jul. 20 ,2018 in Application No. PCT/US2018/029393. |
PCT; International Search Report and Written Opinion dated Sep. 27, 2016 in Application No. PCT/US2016/039446. |
PCT; International Search Report and Written Opinion dated Nov. 22, 2017 in Application No. PCT/US2017/044023. |
PCT; International Search Report and Written Opinion dated Dec. 26, 2017 in Application No. PCT/US2017/044965. |
PCT; International Search Report and Written Opinion received Nov. 9, 2018 in Application No. PCT/US2018/048873. |
PCT; International Search Report and Written Opinion dated Apr. 8, 2010 in Application No. PCT/US2009/054235. |
PCT; International Search Report and Written Opinion dated Jan. 18, 2019 in Application No. PCT/US2018/055264. |
PCT; International Search Report and Written Opinion dated Mar. 12, 2018 in Application No. PCT/US2017/063025. |
PCT; International Search Report and Written Opinion dated Mar. 7, 2018 in Application No. PCT/US2017/063105. |
PCT; International Search Report and Written Opinion dated Apr. 18. 2017 in Application No. PCT/US2016/068922. |
CN; Notice of the First Office Action dated Sep. 2, 2019 in Chinese Application No. 201680049272.8. |
CN; Notice of the First Office Action dated Sep. 30, 2019 in Chinese Application No. 201780020478.2. |
EP; European Search Report dated Jan. 30, 2019 in Application No. 16815467.2. |
EP; European Search Report dated Aug. 2, 2019 in Application No. 16885434.7. |
EP; European Search Report dated Jan. 29, 2020 in Application No. 17745013.7. |
EP; European Search Report dated Aug. 2, 2019 in Application No. 17745026.9. |
EP; European Search Report dated Jan. 29, 2020 in Application No. 17750776.1. |
EP; European Search Report dated Oct. 24, 2019 in Application No. 17757146.0. |
EP; European Search Report dated Mar. 6, 2020 in Application No. 17835231.6. |
EP; European Search Report dated Feb. 7, 2020 in Application No. 17837566.3. |
Ahn et al., “Electrical Immunosensor Based on a Submicron-Gap Interdigitated Electrode and Gold Enhancement,” Biosensors and Bioelectronics, vol. 26, pp. 4690-4696, (2011). |
Alayo et al., “Gold Interdigitated Nanoelectrodes as a Sensitive Analytical Tool for Selective Detection of Electroactive Species via Redox Cycling,” Microchim Acta, vol. 183, pp. 1633-1639, (2016). |
Antibody Structure Downloaded from https://absoluteantibody.com/antibody-resources/antibody-overview/antibody-structure/ (Mar. 1, 2019). |
Bai et al., “Review: Gas Sensors Based on Conducting Polymers,” Sensors, vol. 7, pp. 267-307, (2007). |
Bailey et al., “DNA-Encoded Antibody Libraries: A Unified Platform for Multiplexed Cell Sorting and Detection of Genes and Proteins,” Journal of American Chemical Society, vol. 129, pp. 1959-1967, (2007). |
Bechelany et al. “Synthesis Mechanisms of Organized Nanoparticles: Influence of Annealing Temperature and Atmosphere,” Crystal Growth and Design, vol. 10, pp. 587-596 (Oct. 21, 2010). |
Berdat et al., “Label-Free Detection of DNA with Interdigitated Micro-Electrodes in a Fluidic Cell,” Lab on a Chip, vol. 8, pp. 302-308, (2008). |
Bhura, “3D Interdigitated Electrode Array (IDEA) Biosensor for Detection of Serum Biomarker,” Master Thesis, Portland State University, 68 Pages, (2011). |
Blossey, R., “Self-Cleaning Surfaces-Virtual Realities,” Nature Materials, vol. 2(5), pp. 301-306, (May 2006). |
Bonilla et al., “Electrical Readout of Protein Microarrays on Regular Glass Slides,” Analytical Chemistry, vol. 83, pp. 1726-1731, (2011). |
Botsialas et al., “A Miniaturized Chemocapacitor System for the Detection of Volatile Organic Compounds,” Sensors and Actuators B, Chemical, vol. 177, pp. 776-784, (2013). |
Branagan et al., “Enhanced Mass Transport of Electroactive Species to Annular Nanoband Electrodes Embedded in Nanocapillary Array Membranes,” Journal of the American Chemical Society, vol. 134, pp. 8617-8624, (2012). |
Braun et al., “DNA-Templated Assembly and Electrode Attachment of a Conducting Silver Wire,” Letters to Nature, vol. 391(6669), pp. 775-778, (Feb. 1998). |
Briglin et al., “Exploitation of Spatiotemporal Information and Geometric Optimization of Signal/Noise Performance Using Arrays of Carbon Black-Polymer Composite Vapor Detectors,” Sensors and Actuators B, vol. 82, pp. 54-74, (2002). |
Cassie, A.B.D. et al., “Wettability of Porous Surfaces,” Transitions of the Faraday Society, vol. 40, pp. 546-551, (Jan. 1944) (Abstract Only). |
Cerofolini et al., “A Hybrid Approach to Nanoelectronics: A Hybrid Approach to Nanoelectrics,” Nanotechnology, Institute of Physics Publishing, GB, vol. 16, No. 8, pp. 1040-1047 (2005). |
Chen, X. et al., “Electrical Nanogap Devices for Biosensing,” Materials Today, vol. 13, pp. 28-41, (Nov. 2010). |
Chen et al., “Electrochemical Approach for Fabricating Nanogap Electrodes with Well Controllable Separation,” Applied Physics Letters, vol. 86, pp. 123105.1-123105.3, (2005). |
Chen et al., “Fabrication of Submicron-Gap Electrodes by Silicon Volume Expansion for DNA-Detection,” Sensors and Actuators A, vol. 175, pp. 73-77, (2012). |
Choi, J. E. et al., “Fabrication of Microchannel with 60 Electrodes and Resistance Measurement,” Flow Measurement and Instrumentation, vol. 21, pp. 178-183, (Sep. 2010) (Abstract Only). |
Choi Y.S. et al., “Hybridization by an Electroatomical Genome on Detection on Using an Indicator-Free DNA on a Microelectrode-Array DNA Chip,” Bulletin of the Korean Chemistry Society, vol. 26, pp. 379-383, (2005). |
Choi, C. et al., “Strongly Superhydrophobic Silicon Nanowires by Supercritical CO2 Drying,” Electronic Materials Letters, vol. 6 (2), pp. 59-64, (Jun. 2010). |
Church et al., “Next-Generation Digital Information Storage in DNA,” Science, vol. 337(6102), p. 6102, (Sep. 28, 2012). |
Cosofret et al., “Microfabricated Sensor Arrays Sensitive to pH and K+ for Ionic Distribution Measurements in the Beating Heart,” Analytical Chemistry, vol. 67, pp. 1647-1653, (1995). |
Coulson S.R. et al., “Super-Repellent Composite Fluoropolymer Surfaces,” The Journal of Physical Chemistry B., vol. 104(37), pp. 8836-8840, (Aug. 2000). |
Dickey et al., “Electrically Addressable Parallel Nanowires with 30 NM Spacing from Micromolding and Nanoskiving,” Nano Letters, vol. 8(12), pp. 4568-4573, (2008). |
Fan et al., “Detection of MicroRNAs Using Target-Guided Formation of Conducting Polymer Nanowires in Nanogaps,” Journal of the American Chemical Society, vol. 129, pp. 5437-5443, (2007). |
Fink et al. “Electrical Conduction Through DNA Molecules,” Nature, vol. 398, pp. 407-410 (Jan. 20, 1999). |
Fuller et al., “Real-Time Single-Molecule Electronic DNA Sequencing by Synthesis Using Polymer-Tagged Nucleotides on a Nanopore Array,” Proceedings of the National Academy of Sciences, vol. 113(19), pp. 5233-5523, (May 10, 2016). |
Gapin, A.I. et al., “CoPt Patterned Media in Anodized Aluminum Oxide Templates,” Journal of Applied Physics, vol. 99(8), pp. 08G902 (1-3), (Apr. 2006). |
Ghindilis, A. et al., “Real Time Biosensor Platforms Fully Integrated Device for Impedimetric Assays,” ECS Transactions, vol. 33, pp. 59-68, (2010). |
Guo et al., “Conductivity of a single DNA duplex bridging a carbon nanotube gap,” Nat. Nanotechnol., vol. 3, No. 3, pp. 1-12 (2008). |
Han, “Energy Band Gap Engineering of Graphene Nanoribbons,” Physical Review Letters, vol. 98, pp. 1-7, (May 16, 2007). |
Han et al., “Redox Cycling in Nanopore-Confined Recessed Dual-Ring Electrode Arrays,” Journal of Physical Chemistry C, vol. 120, pp. 20634-20641, (2016). |
Hanief, Topic, Pineda-Vargas, “Solid State Dewetting of Continuous Thin Platinum Coatings,” Nuclear Instruments and Methods in Physics Research, vol. 363, pp. 173-176, (2015). |
Hashioka et al., “Deoxyribonucleic Acid Sensing Device with 40-NM-Gap-Electrodes Fabricated by Low-Cost Conventional Techniques,” Applied Physics Letters, vol. 85(4), p. 687-688, (Jul. 2004). |
He et al., “Electromechanical Fabrication of Atomically Thin Metallic Wires and Electrodes Separated with Molecular-Scale Gaps,” Journal of Electroanalytical Chemistry, vol. 522, pp. 167-172, (Jan. 2002). |
Heerema et al., “Graphene Nanodevices for DNA Sequencing,” Nature Nanotechnology, vol. 11, pp. 127-136, (Feb. 3, 2016). |
Henry et al., “Microcavities Containing Individually Addressable Recessed Microdisk and Tubular Nanoband Electrodes,” Journal of the Electrochemical Society, vol. 146(9), pp. 3367-3373, (1999). |
Hwang et al., “Electrical Transport Through 60 Base Pairs of Poly (dG)-Poly (dC) DNA Molecules,” Applied Physics Letters, vol. 81(6), p. 1134-1136, (Aug. 2002). |
Ino et al., “Addressable Electrode Array Device with IDA Electrodes for High-Throughput Detection,” Lab on a Chip, vol. 11, p. 385-388, (2011). |
Ino et al., “Local Redox-Cycling-Based Electrochemical Chip Device with Seep Microwells for Evaluation of Embryoid Bodies,” Angewandte Chemie International Edition, vol. 51, pp. 6648-6652, (2012). |
Iqbal et al., “Direct Current Electrical Characterization of ds-DNA in Nanogap Junctions,” Applied Physics Letter, vol. 86, p. 153901-1-153901-3, (Apr. 2005). |
Javey et al., “Layer-By-Layer Assembly of Nanowires for Three-Dimensional, Multifunctional Electronics,” Nano Letters, vol. 7, pp. 773-777, (2007). |
Khawli et al., “Charge Variants in IgG1-Isolation, Characterization, in Vitro Binding Properties and Pharmacokinetics in Rats,” Landes Bioscience, vol. 2(6), pp. 613-623, (2010). |
Kim, J. Y. et al., “Optically Transparent Glass with Vertically Aligned Surface AI203 Nanowires Having Superhydrophobic Characteristics,” NANO: Brief Reports and Reviews, vol. 5(2), pp. 89-95, (Apr. 2010) (Abstract Only). |
Kim et al., “Rapid Fabrication of Uniformly Sized Nanopores and Nanopore Arrays for Parallel DNA Analysis,” Advances Materials, vol. 18, pp. 3149-3153, (Dec. 4, 2006). |
Kitsara et al., “Single Chip Interdigitated Electrode Capacitive Chemical Sensor Arrays,” Sensors and Actuators B, vol. 127, pp. 186-192, (2007). |
Kitsara et al., “Small-Volume Multiparametric Electrochemical Detection at Low Cost Polymeric Devices Featuring Nanoelectrodes,” SPIE, vol. 9518, 9 Pages, (2015). |
Kraft, “Doped Diamond: A Compact Review on a New, Versatile Electrode Material,” International Journal of Electrochemistry, vol. 2, pp. 355-385, (May 2007). |
Kumar et al., “Terminal Phosphate Labeled Nucleotides: Synthesis, Applications and Linker Effect on Incorporation by DNA Polymerases,” Nucleosides, Nucleotides and Nucleic Acids, Taylor and Francis, vol. 24, No. 5-7, pp. 401-408 (2005). |
Lee, K. H. et al., “One-Chip Electronic Detection of DNA Hybridization using Precision Impedance-Based CMOS Array Sensor,” Biosensors and Bioelectronics, vol. 26, pp. 1373-1379, (Dec. 15, 2010). |
Lin et al., “An Addressable Microelectrode Array for Electrichemical Detection,” Analytical Chemistry, vol. 80, pp. 6830-6833, (2008). |
Liu et al., “Atomically Thin Molybdenum Disulfide Nanopores with High Sensitivity for DNA Translocation,” ACS Nano, vol. 8, pp. 2504-2511, (Feb. 18, 2014). |
Liu et al., “An Enzyme-Based E-DNA Sensor for Sequence-Specific Detection of Femtomolar DNA Targets,” J. Am. Chem. Soc., vol. 130(21), pp. 6820-6825, (2008). |
Liu et al., “Controllable Nanogap Fabrication on Microchip by Chronopotentiometry,” Electrochimica Acta, vol. 50, pp. 3041-3047, (2005). |
MacNaughton et al., “High-Throughput Heterogeneous Integration of Diverse Nanomaterials on a Single Chip for Sensing Applications,” PLOS One, vol. 9(10), e111377, 7 Pages, (2014). |
Mastrototaro et al., “Thin-Film Flexible Multielectrode Arrays for Voltage Measurements in the Heart,” IEEE Engineering in Medicine & Biology Society 11th Annual International Conference, 1 Page, (1989). |
Mastrototaro et al., “Rigid and Flexible Thin-Film Multielectrode Arrays for Transmural Cardiac Recording,” IEEE Transactions on Biomedical Engineering, vol. 39, pp. 217-279, (1992). |
Mirando-Castro et al., “Hairpin-DNA Probe for Enzyme-Amplified Electrochemical Detection of Legionella pnuemophila,” Anal. Chem., vol. 79, pp. 4050-4055, (Jun. 1, 2007). |
Nishida, et al. “Self-Oriented Immobilization of DNA Polymerase Tagged by Titanium-Binding Peptide Motif,” Langmuir, vol. 31, pp. 732-740 (Dec. 17, 2014). |
Niwa, O. et al., “Fabrication and Characteristics of Vertically Separated Interdigitated Array Electrodes,” Journal of Electroanalytical Chemistry and Interfacial Electrochemistry, vol. 267 pp. 291-297, (Aug. 10, 1989) (Abstract Only). |
Okinaka et al., ““Polymer” Inclusions in Cobalt-Hardened Electroplated Gold,” Journal the of Electrochemical Society, vol. 125, p. 1745, (1978). (Abstract Only). |
Park, S.J. et al., “Array-Based Electrical Detection of DNA with Nanoparticle Probes,” Science, vol. 295, pp. 1503-1506, (Feb. 22, 2002). |
Park, C.W. et al., “Fabrication of Poly-Si/ AU Nano-Gaps Using Atomic-Layer-Deposited AI2O3 as a Sacrificial Layer,” Nanotechnology, vol. 16, pp. 361-364, (Feb. 1, 2005) (Abstract Only). |
Parkin, I. P. et al., “Self-Cleaning Coatings,” Journal of Materials Chemistry, vol. 15(17), pp. 1689-1695, (Dec. 2004). |
Prins et al., “Room-Temperature Gating of Molecular Junctions Using Few-Layer Graphene Nanogap Electrodes,” Nano Letters, vol. 11, pp. 4607-4611, (Oct. 21, 2011). |
Pugliese et al., “Processive Inforporation of Deoxynucleoside Triphosphate Analogs by Single-Molecule DNA Polymerase I (Klenow Fragment) Nanocircuits,” Journal of the American Chemical Society, vol. 137, No. 30, pp. 9587-9594 (2015). |
Qing et al., “Finely Tuning Metallic Nanogap Size with Electrodeposition by Utilizing High-Frequency Impedance in Feedback,” Angewandte Chemie Int ed, vol. 44, pp. 7771-7775, (2005). |
Reed et al., “Conductance of a Molecular Junction Reports,” Science, vol. 278, pp. 252-254, (Oct. 1997). |
Reichert et al., “Driving Current Through Single Organic Molecules,” Physical Review Letters, vol. 88(17), pp. 176804-1-176804-4, (Apr. 2002). |
Roppert et al., “A New Approach for an Interdigitated Electrodes DNA-Sensor,” XVIIIth International Symposium on Bioelectrochemistry and Bioenergetics, Bioelectrochemistry, p. 143, (2005). |
Roy, S. et al., “Mass-Produced Nanogap Sensor Arrays for Ultra-Sensitive Detection of DNA,” Journal of the American Chemical Society, vol. 131, pp. 12211-12217, (Aug. 5, 2009) (Abstract Only). |
Ruttkowski, E. et al., “CMOS based Arrays of Nanogaps Devices for Molecular Devices,” Proceedings of 2005 5th IEEE Conference on Nanotechnology, vol. 1, pp. 438-441, (Jul. 2005) (Abstract Only). |
Sanguino et al., “Interdigitated Capacitive Immunosensors with PVDF Immobilization Layers,” IEEE Sensors Journal, vol. 14(4), pp. 1260-1265, (Apr. 2014). |
Santschi et al., “Interdigitated 50nm Ti Electrode Arrays Fabricated using XeF2 Enhanced Focused Ion Beam Etching,” Nanotechnology, vol. 17, pp. 2722-2729, (2006). |
Schaefer et al., “Stability and Dewetting Kinetics of Thin Gold Films on Ti, TiOx, and ZnO Adhesion Layers,” Acta Materialia, vol. 61, pp. 7841-7848, (2013). |
Schrott, W. et al., “Metal Electrodes in Plastic Microfluidic Systems,” Microelectronic Engineering, vol. 86, pp. 1340-1342, (Jun. 2009). |
Shimanovsky et al., “Hiding Data in DNA,” International Workshop on Information Hiding, Lecture Notes in Computer Science, pp. 373-386, (Dec. 18, 2012). |
Shimoda, T. et al., “Solution-Processed Silicon Films and Transistors,” Nature, vol. 440(7085), pp. 783-786, (Apr. 2006). |
Sholders et al., “Distinct Conformations of a Putative Translocation Element in Poliovirus Polymerase,” Journal of Molecular Biology, vol. 426(7), pp. 1407-1419, (Apr. 3, 2014). |
Singh et al., “3D Nanogap Interdigitated Electrode Array Biosensors,” Analytical and Bioanalytical Chemistry, vol. 397, pp. 1493-1502, (2010). |
Singh et al., “Evaluation of Nanomaterials-Biomolecule Hybrids for Signals Enhancement of Impedimetric Biosensors,” 11th IEEE International Conference on Nanotechnology, pp. 707-710, (2011). |
Singh et al., “Nanoparticle-Enhanced Sensitivity of a Nanogap-Interdigitated Electrode Array Impedimetric Biosensor,” Langmuir, vol. 27, pp. 13931-13939, (2011). |
Stagni, C. et al., “CMOS DNA Sensor Array with Integrated A/D Conversation Based on Label-Free Capacitance Measurement,” IEEE Journal of Solid-State Circuits, vol. 41, pp. 2956-2964, (Nov. 20, 2006). |
Stenning, “The Investigation of Grain Boundary Development and Crystal Synthesis of Thin Gold Films on Silicon Wafers,” http://www.ucl.ac.uk/˜ucapikr/projects, (March 31, 2009). |
Su, Y., “Modeling and Characteristic Study of Thin Film Based Biosensor Based on COMSOL,” Mathematical Problems in Engineering, Article 581063 (6 Pages), (Apr. 7, 2014). |
Thompson, “Solid-State Dewetting of Thin Films,” Department of Materials Science and Engineering, vol. 42, pp. 399-434, (2012). |
Urban, M. et al., “A Paralleled Readout System for an Electrical DNA-Hybridization Assay Based on a Microstructured Electrode Array,” Review of Scientific Instruments, vol. 74, pp. 1077-1081, (Jan. 2003) (Abstract Only). |
Van Gerwin et al., “Nanoscaled Interdigitated Electrode Arrays for Biochemical Sensors,” Sensors and Actuators B, vol. 49, pp. 73-80, (1998). |
Van Megan et al., “Submicron Electrode Gaps Fabricated by Gold Electrodeposition at Interdigitated Electrodes,” Key Engineering Materials, vol. 605, pp. 107-110, (2014). |
Wang et al., “Electronics and Optoelectronics of Two-Dimensional Transition Metal Dichalcogenides,” Nature Nanotechnology, vol. 7, pp. 699-712, (Nov. 6, 2012). |
Xu et al., “Fabrication of Complex Metallic Nanostructures by Nanoskiving,” American Chemical Society Nano, vol. 1(3), pp. 215-227, (2007). |
Zafarani et al., “Electrochemical Redox Cycling in a New Nanogap Sensor: Design and Simulation,” Journal of Electroanalytical Chemistry, vol. 760, pp. 42-47, (2015). |
USPTO; Non-Final Office Action dated Sep. 22, 2020 in U.S. Appl. No. 16/639,716. |
USPTO; Non-Final Office Action dated Oct. 2, 2020 in U.S. Appl. No. 16/073,693. |
USPTO; Non-Final Office Action dated Nov. 9, 2020 in U.S. Appl. No. 16/731,749. |
PCT; International Search Report and Written Opinion dated Jun. 9, 2020 in Application No. PCT/US2020/13218. |
PCT; International Search Report and Written Opinion dated Aug. 6, 2020 in Application No. PCT/US2020/25068. |
PCT; International Search Report and Written Opinion dated Sep. 4, 2020 in Application No. PCT/US2020/28004. |
EP; European Search Report dated Sep. 30, 2020 in Application No. 17893481.6. |
JP; Office Action dated Aug. 13, 2020 in Japanese Application No. 2017-566864. |
CN; Office Action dated Aug. 14, 2020 in Chinese Patent Application No. 201680083636.4. |
Yang et al., “Two-Dimensional Graphene Nanoribbons,” J. Am. Chem. Soc. vol. 130, Issue 13 (2008). |
USPTO; Notice of Allowance dated Nov. 24, 2020 in U.S. Appl. No. 16/477,106. |
USPTO; Notice of Allowance dated Dec. 7, 2020 in U.S. Appl. No. 16/878,484. |
USPTO; Final Office Action dated Dec. 14, 2020 in U.S. Appl. No. 16/684,338. |
USPTO; Final Office Action dated Jan. 6, 2021 in U.S. Appl. No. 16/070,133. |
USPTO; Final Office Action dated Jan. 11, 2021 in U.S. Appl. No. 16/479,257. |
USPTO; Non-Final Office Action dated Dec. 15, 2020 in U.S. Appl. No. 16/831,722. |
JP; Office Action dated Dec. 2, 2020 in Japanese Patent Application No. 2018-536737. |
EP; European Search Report dated Dec. 23, 2020 in Application No. 18790713.4. |
EP; European Search Report dated Nov. 19, 2020 in Application No. 18739158.6. |
EP; European Search Report dated Dec. 14, 2020 in Application No. 18799263.1. |
Ali et al., “DNA hybridization detection using less than 10-nm gap silicon nanogap structure,” Sensors and Actuators A. vol. 199, pp. 304-309 (2013). |
Bornholt et al., “A DNA-Based Archival Storage System”, Architectural Support for Programming Languages and Operating Systems, pp. 637-649 (2016). |
Chen et al., “Silicon nanowire field-effect transistor-based biosensors for biomedical diagnosis and cellular recording investigation”, Nano Today, Elsevier, Amsterdam, NL, vol. 6, No. 2, pp. 131-154 (2011). |
Grass et al., “Robust Chemical Preservation of Digital Information on DNA in Silica with Error-Correcting Codes”, Angewandte Chemie International Edition, vol. 54, No. 8, pp. 2552-2555 (2015). |
Hatcher et al., “PNA versus DNA: Effects of Structural Fluctuations on Electronic Structure and Hole-Transport Mechanisms,” J. Amer. Chem. Soc., 130, pp. 11752-11761 (2008). |
Korlach et al., “Real-time DNA sequencing from single polymerase molecules,”11, Methods in Enzymology, Academy Press, vol. 472, pp. 431-455 (2010). |
Paul et al., “Charge transfer through Single-Stranded Peptide Nucleic Acid Composed of Thymine Nucleotides,” J. Phy. Chem. C 2008, 112, pp. 7233-7240 (2008). |
Shin et al., “Distance Dependence of Electron Transfer Across Peptides with Different Secondary Structures: The Role of Peptide Energetics and Electronic Coupling,” J. Amer. Chem. Soc. 2003, 125, pp. 3722-3732 (2003). |
Venkatramani et al., “Nucleic Acid Charge Transfer: Black, White and Gray,” Coard Chem Rev., 255(7-8): pp. 635-648 (2011). |
Number | Date | Country | |
---|---|---|---|
20200242482 A1 | Jul 2020 | US |
Number | Date | Country | |
---|---|---|---|
62570458 | Oct 2017 | US |