This disclosure is in the field of synthetic biology. In particular, this disclosure describes nanochannel devices for the assembly of long DNA molecules, the use of semiconductor chips for such devices, and methods for using the same in synthetic biology.
The field of synthetic biology is broadly concerned with engineering biological processes to achieve desired functional endpoints. One major area of focus is in modifying the DNA of organisms as a means of engineering their functional properties. In particular, this includes the need to synthesize the DNA sequences that correspond to individual genes, sets of genes, and larger genomic DNA constructs such as plasmid or chromosomal elements, up to the extremes of entire genomes of artificial organisms.
The standard commercial method of primary DNA chemical synthesis, the phosphoramidite method, is in practice limited to making DNA oligomers, up to several hundred bases in length. In contrast, common coding gene lengths are in the range of 1000-3000 bases, and thus even for producing such genes, primary synthesis oligomers must be further joined to make these longer fragments, at the scale of joining ˜10-˜100 such oligomers. For the exonic format of genes, such as in animals, the genomic extent of a gene is often on the order of 10,000-100,000 bases, and thus fabricating genes in this format would require joining DNA on an order of magnitude larger scale. Similarly, the set of genes that comprise a biochemical synthesis pathway may typically be 10-100 genes, and packaging these into a single DNA segment “cassette” would require joining together fragments to achieve lengths of ˜10,000-˜100,000 bases. Making the genomes of the smallest single celled organisms, such as mycoplasma, also requires making ˜100,000 base fragments, and the larger chromosomes of bacteria, fungi, or other simple organisms would require making DNA in the 1 megabase-10 megabase regime. Thus, the field of synthetic biology requires methods to produce ultimate products in the length range of 10,000-10,000,000 bases.
In practice, the classical method of Gibson assembly, and related methods, are used to assemble oligomers towards these greater lengths. Such methods generally take the input fragments, and pool them in an assembly reaction in solution. Such approaches are limited in the length and sequence content of the resulting products. The result is that using such methods, there is substantial process complexity, high cost, long processing time, along with substantial sequence content restrictions, and a high rate of failed assembly reactions. Therefore, new methods of assembly not having these deficiencies are beneficial for the future of bioengineering.
Nanochannels are nano-fluidic structures that are used to isolate small numbers of molecules. Such constructs were introduced in the 1990's. They consist of a long fluid channel, where the width and depth are on the nanometer (nm) scale, in the range of 10-200 nm, and the channel length dimension is much larger, typically in the range of microns to millimeters. Such channels, when loaded with solution, and covered, can also be loaded with molecules, such as through the application of pressure (to drive solution in) and/or voltages, to drive charged molecules into the channels. Studying the behavior of molecules loaded into nanochannels is the basic aspect of the field of nanofluidics.
In an aspect, a DNA assembly device is disclosed. The device includes a chip having at least one nanochannel integrated therein, the at least one nanochannel having a reaction zone where ends of at least two distinct DNA molecules can be maintained proximate to one another; and circuits and electrodes for controlling movement of the at least two distinct DNA molecules.
In embodiments, movement of the at least two DNA molecules is controlled electrically. In embodiments, the DNA molecules have lengths of 50-1,000 bases, or 1,000-10,000 bases, or 10,000-100,000 bases, or 100,000-1,000,000 bases, or 1,000,000-10,000,000 bases, or more than 10,000,000 bases. In certain embodiments, the chip contains multiple nanochannels, each of which can operate independently. In certain embodiments, the chip is a CMOS chip. In certain embodiments, the top of the at least one nanochannel is sealed.
In embodiments, the surfaces of the at least one nanochannel are functionalized. In embodiments, at least one portion of the at least one nanochannel is constricted. In certain embodiments, the reaction zone is fluidically connected to an end of the at least one nanochannel. In certain embodiments, the reaction zone is configured along the length of the at least one nanochannel.
In another aspect, a system is disclosed that includes first and second devices as disclosed herein, wherein each of the respective nanochannels are fluidically connected.
In another aspect, a system is disclosed that includes a device as disclosed herein having multiple nanochannels. In this system, the device contains a network of nanochannels. In embodiments, the number of networked nanochannels is 1-10, or 10-100, or 100-1000, or 1000-10,000, or 10,000-100,000, or 100,000-1,000,000, or more than 1,000,000. In embodiments, the chip is a CMOS chip.
In another aspect, a method of joining two DNA molecules is disclosed. The method involves introducing a first DNA molecule to a first nanochannel, wherein the first nanochannel contains a reaction zone; introducing a second DNA molecule strand to a second nanochannel, wherein the second nanochannel is also connected to the reaction zone; and electrically controlling the movement of the first and second DNA molecules strands to the reaction zone where the first and second nucleotide strands undergo a series of joining reactions. In certain embodiments, the first and second nanochannels are the same. In certain embodiments, each of the DNA molecules contain a mechanical blocking group. In certain embodiments, the method further involves removing the mechanical blocking groups following the joining of the first and second DNA molecules. In certain embodiments, the DNA molecules contain unique barcode elements for tracking each of the DNA molecules. The DNA molecules may have lengths of 50-1,000 bases, or 1,000-10,000 bases, or 10,000-100,000 bases, or 100,000-1,000,000 bases, or 1,000,000-10,000,000 bases, or more than 10,000,000 bases.
Further, and without limiting the foregoing, disclosed herein are nanochannel device compositions that are useful for joining single molecules of DNA to form longer contiguous single DNA molecules. This has the advantage that nanochannels can hold and move long single molecules of DNA with much less risk of the DNA strand breaking, as happens commonly when fragments of DNA are manipulated by classical bulk solution phase methods such as pipetting, mixing, centrifuging, and gel electrophoresis.
Further, disclosed herein are nanochannel devices and methods that can be used to join given single molecule strands of DNA to form longer continuous single molecule DNA strands. This includes devices and methods for joining strands that may be in the length range of 100-1000 bases, or in the range of 1,000-10,000 bases, or 10,000-100,000 bases, or 100,000-1,000,000 bases, or 1,000,000-10,000,000 bases, or strands longer than 10,000,000 bases. This has the advantage that such devices and methods can support the joining of such long strands with much less risk of breaking them or the resulting strand than classical bulk solution methods. This also has the advantage of requiring much less DNA material than classical bulk solution methods, since the operations are done at the single-molecule level. This also has the advantage that such constructs can be tracked and collected at the single molecule level, thereby avoiding the need for subsequent purification or isolation processes.
Further, disclosed herein are nanochannel devices and methods that can be used to join strands through a series of pair-wise joining reactions, that can be used join from 2-10 strands in series, 10-100 strands in series, or 100-1000 strands in series, or more than 1000 strands in series. These pairwise assemblies can be further joined in a hierarchical assembly process. This has the advantage of producing long products while eliminating constraints on the DNA sequence content of the resulting strand, because each segment is added without restriction on how its sequence is related to that of the growing strand, as it is prevented by its extended form in the nanochannel from having physical interactions with the rest of the strand. In contrast, classical assembly methods, such as the well-known bulk phase Gibson assembly, allow the segments and growing strand to interact, which can result in incorrect joins, or interference in such joins. These classical unwanted interactions typically result in constraints whereby joining can only be efficiently performed if the sequences are not too repetitive, low-complexity, or self-complementary.
Further, disclosed herein are nanochannel devices and methods that can be used to join together a series of DNA strands, where each strand in the series is further selectable from a selection of N strand options provided for each segment, all under nanochannel control. Here N may be in the range of 2-10 options, 10-100 options, or 100 or more options, and the strand options may be different for each segment in the series. By these means, the resulting DNA fragment F can the combinatorial structure of F=S1−S2− . . . −Si−. . . −Sn; where each DNA segment Si is selected from a set of Ni≥1 total options Oi={Si,1, Si,2, . . . , Si, Ni}, for i=1, . . . , n. The total number of possible combinations of parts is C=N1×N2× . . . ×Nn, which can be a very large number of possible combinations. The advantage of this approach is that it allows greater efficiency, speed, and low reagent consumption, as well as allowing for much longer products without breakage, and allowing for better tracking of the resulting products, when performing such combinatorial assembly methods from input sets “parts”, as compared to classic bulk fluidic methods. The advantage is also that the nanochannel devices can be embodied in a highly scalable fashion, to enable production of a much larger number of such combinations, and at a greater rate than is possible with classical bulk assembly methods.
Further, disclosed herein are nanochannel devices and methods that can perform the above joining processes in parallel, and under electronic control, such that many such joined strands may be formed in parallel, and rapidly and efficiently through electronic control. This has the advantage of providing scalable devices, that can carry out many such reactions in parallel. The number of parallel strand assembles on one device may be up to 10, up to 100, up to 1000, up to 10,000, or up to 1 million or more.
Further, disclosed herein are complementary metal-oxide semiconductor (CMOS) chips devices that comprise the nanochannels for the above, and which can apply the electronic control of the operations for the nanochannels for the above. This has the advantage that the CMOS chips enjoy of the greatest existing manufacturing base among all types of semiconductor chips, and the greatest capacity for production and low-cost mass manufacturing, providing for both the fundamental cost reductions for deploying the nanochannel DNA assembly processes disclosed, as well as scalability of the number and complexity of such devices that can be implemented on a chip, or per square millimeter of chip area.
Further, disclosed herein are methods of monitoring and checking such nanochannel assembly processes, through various sensor modalities built into the devices. This has the advantage that it can be known whether the steps of assembly, or final assembly, of strands has been properly performed, thereby allowing a higher yield of properly formed target strands.
Further, disclosed herein are systems that make use of such nanochannel devices and methods for the production of many long strands of DNA formed by joining shorter strands. This provides the advantage of enabling end-to-end automated processing, to go from input fragments to output long DNA joined strands, and with tracking of the final output strands.
Further, and without limiting any of the foregoing, disclosed herein are compositions, devices, methods, and systems for the fabrication of long DNA molecules for applications in synthetic biology.
A DNA assembly device is disclosed. The device includes a chip having at least one nanochannel integrated therein, the at least one nanochannel having a reaction zone where ends of at least two distinct DNA molecules can be maintained proximate to one another; and circuits and electrodes for controlling movement of the at least two distinct DNA molecules.
Additionally, a system is disclosed that includes first and second devices as disclosed herein, wherein each of the respective nanochannels are fluidically connected. Another system is disclosed
that includes a device as disclosed herein having multiple nanochannels. In this system, the device contains a network of nanochannels. In embodiments, the number of networked nanochannels is 1-10, or 10-100, or 100-1000, or 1000-10,000, or 10,000-100,000, or 100,000-1,000,000, or more than 1,000,000.
Additionally, a method of joining two DNA molecules is disclosed. The method involves introducing a first DNA molecule to a first nanochannel, wherein the first nanochannel contains a reaction zone; introducing a second DNA molecule strand to a second nanochannel, wherein the second nanochannel is also connected to the reaction zone; and electrically controlling the movement of the first and second DNA molecules strands to the reaction zone where the first and second nucleotide strands undergo a series of joining reactions.
As used herein, the term “DNA” in various contexts may refer to single stranded or double stranded forms of the molecule. The term “DNA” may also in various contexts refers not only to strands composed of the four bases A, C, G, T, but also of ribonucleotides such as in RNA, other base analogues, such as U (uracil), I (Inosine), and other well-known universal bases or base analogues or modified or marked bases, including well-known epigenetics marks on bases, such as 5mC (5-methyl-C), as well as dye-labelled bases, or bases modified for future labelling or conjugation, such as biotinylated bases, or thiolate bases, and in general any other widely known modified forms of bases used in DNA oligos, including possible modifications in the sugar or backbone of DNA as well. In addition, where it makes sense in context, the term DNA encompasses other nucleic acid (NA) polymers such as RNA (Ribo-), PNA (Peptide-), LNA (Locked-), and diverse forms of XNA (Xeno-).
As used herein, the terms “DNA assembly” or “joining DNA” refers to any process for physically connecting together two or more existing DNA molecule strands, with the connection at or near the ends, to produce a single molecule strand. In various contexts, such strands may be entirely single stranded DNA, or entirely double stranded DNA, or DNA that is partially single stranded and double stranded. Such strands that are assemble or joined may be connected by covalent phosphate backbone bonds, or may be joined through the hydrogen bonding of complementary regions, or in some contexts may be joined through other chemical reactions and chemical groups, such as carbon chain linkers from the end of one backbone to the start of another.
As used herein, the term “nanochannel” refers to any channel structure that is nanometer (nm) scale in its width and depth dimensions, such as up to 10 nm, or up to 100 nm, or several hundred nm, and substantially longer in its third dimension of length, such as 1000 nm or longer, 10,000 nm or longer, or 100,000 nm or longer, or up to 1 millimeter (mm) or longer, or 10 mm or longer. Such channels may be straight, curved, or branched in various contexts. Such channels may reside in a single plane, or may extend into 3D within a material substrate.
As used here in, “phosphoramidite synthesis” or “the phosphoramidite method” or “chemical synthesis” refer to any of the family of standard or well-known chemical cycles employed for synthesis using phosphoramidite bases, such as those used for commercial DNA oligo synthesis, or those deriving from the original methods such as put forth by Marvin Caruthers.
As used herein, the term “chip” refers to a semiconductor integrated circuit chip. In certain contexts where this is clear, it may refer to a CMOS chip.
As used herein, the term “CMOS”, which is an acronym for “complementary metal-oxide semiconductor”, and refers to chips that are made by the CMOS process.
As used herein, “DNA synthesis” refers to fabrication of physical DNA through a series of chemical reactions, in accordance with producing a desired specific target sequence or sequences.
As used herein, “DNA sequencing” refers to processes for reading the identities of the series of bases in a DNA strand or strands.
As used herein, the term “PCR”, which is an acronym for Polymerase Chain Reaction, generally refers to any means of amplifying or copying DNA, including by thermo-cycling PCR, or isothermal PCR reactions, or generally any other processes that can be used to amplify or copy DNA.
As used herein, the term “error correction” or “error correcting code” or ECC, refer to means of transforming a primary data string into another data string or strings such that various types of errors or corruptions of the source string be detected and corrected to recover the source string. In contexts where this makes sense, these may also refer to the process of using such encodings to correct errors. Many such methods are well-known for error correction for the transmission of binary strings, i.e., strings composed of the two symbols “0” and “1”. Many of such well known methods have versions that extend to 4 symbol strings, such as DNA sequences composed of the 4 symbols “A”, “C”, “G”, and “T”, or conversely two DNA letters may be used as binary string symbols, such that all binary ECC methods may apply directly. Any such methods are encompassed by use of this term, as they may apply in context.
Disclosed herein are compositions, devices, methods, and systems related to using nanochannels for the joining, or “assembly” of multiple shorter DNA molecules to form a single longer continuous DNA molecule.
In an aspect, a DNA assembly device is disclosed. The device includes a chip having at least one nanochannel integrated therein, the at least one nanochannel having a reaction zone where ends of at least two distinct DNA molecules can be maintained proximate to one another; and circuits and electrodes for controlling movement of the at least two distinct DNA molecules.
In embodiments, movement of the at least two DNA molecules is controlled electrically. In embodiments, the DNA molecules have lengths of about 50-about 1,000 bases, or about 1,000-about 10,000 bases, or about 10,000-about 100,000 bases, or about 100,000-about 1,000,000 bases, or about 1,000,000-about 10,000,000 bases, or more than about 10,000,000 bases. In certain embodiments, the chip contains multiple nanochannels, each of which can operate independently. In certain embodiments, the chip is a CMOS chip. In certain embodiments, the top of the at least one nanochannel is sealed. Various methods and implements can be used to seal the nanochannel.
In embodiments, the surfaces of the at least one nanochannel are functionalized. In embodiments, at least one portion of the at least one nanochannel is constricted. In certain embodiments, the reaction zone is fluidically connected to an end of the at least one nanochannel. This permits a DNA molecule from one nanochannel to interact with a DNA molecule from a second nanochannel. In certain embodiments, the reaction zone is configured along the length of the at least one nanochannel.
In another aspect, a system is disclosed that includes first and second devices as disclosed herein, wherein each of the respective nanochannels are fluidically connected.
In another aspect, a system is disclosed that includes a device as disclosed herein having multiple nanochannels. In this system, the device contains a network of nanochannels. In embodiments, the number of networked nanochannels is about 1-about 10, or about 10-about 100, or about 100-about 1000, or about 1000-about 10,000, or about 10,000-about 100,000, or about 100,000-about 1,000,000, or more than about 1,000,000. In embodiments, the chip is a CMOS chip.
In another aspect, a method of joining two DNA molecules is disclosed. The method involves introducing a first DNA molecule to a first nanochannel, wherein the first nanochannel contains a reaction zone; introducing a second DNA molecule strand to a second nanochannel, wherein the second nanochannel is also connected to the reaction zone; and electrically controlling the movement of the first and second DNA molecules strands to the reaction zone where the first and second nucleotide strands undergo a series of joining reactions. In certain embodiments, the first and second nanochannels are the same. In certain embodiments, each of the DNA molecules contain a mechanical blocking group. In certain embodiments, the method further involves removing the mechanical blocking groups following the joining of the first and second DNA molecules. In certain embodiments, the DNA molecules contain unique barcode elements for tracking each of the DNA molecules. The DNA molecules may have lengths of about 50-about 1,000 bases, or about 1,000-about 10,000 bases, or about 10,000
For considering nanochannel dimensions that may be relevant for holding DNA molecules, it is noted that single stranded DNA molecules have a width of ˜1 nm, and double stranded DNA molecules have a width of ˜2 nm, and the length of such molecules, fully extended, is approximately 3.4 microns for every 1000 base pairs for double stranded DNA in a classical double helix (B conformation), or for single stranded DNA, 7.5 microns from every 1000 bases. Thus, for nanochannels to be of use for joining DNA, the channel width and depth must be >2 nm. The channel dimensions should also preferably be less than the persistence length of DNA, so as to discourage the DNA from balling-up in the channel, and thus in preferred embodiments would be <˜50 nm or <˜100 nm for dsDNA and <˜10 nm for ssDNA. In preferred embodiments, groups can be bound to the DNA, such as single-stranded-binding protein for ssDNA, and can increase the stiffness and persistence length of the DNA, enabling larger channel widths and depths, especially for ssDNA. The length of DNA dictates the channel lengths required for joining and manipulating DNA, and thus channels that in preferred embodiments may encompass the full length of molecules with 1000 bases to 1 million bases would imply channel lengths from several microns up to several millimeters, and similarly longer for even longer fragments.
The substrate material may be a semiconductor material, such as silicon or silicon oxide or quartz, or other semiconductors or semiconductor oxides, or metals or metal oxides, such as aluminum oxide, or a polymer or plastic material, or a glass or quartz, or a metal, or any material that would allow the fabrication of such a channel, many of which would be obvious to those skilled in nanofabrication. In preferred embodiments, the substrate material is an electrical insulator, so that voltages may be applied between the ends of the channel, without producing electrical currents in the substrate. In preferred embodiments, the material is silicon dioxide or aluminum oxide. Such channels may be formed, in preferred embodiment, by the standard methods of lithographic patterning, material deposition, and etching or liftoff processes.
Such channels may also have surfaces and sidewalls that are coated or functionalized to promote fluid flowing into the channels, such as treating the walls to make them hydrophilic, and also to prevent fouling or adhering of molecules to the wall. Such methods may include salinization, or other well-known coating methods that would be obvious based on this disclosure.
In certain embodiments, the assembly reaction zones may comprise zones located at locations where incoming fragments for assembly enter the chip, such that in such zones, a multiplicity of an incoming sequence fragment, in the 100b to 20 kb size range, and preferably in the 1 kb-10 kb size range, are delivered to a reaction zone at an inlet to the chip, and a long fragment resident in a nanochannel in contact with this zone is positioned so that an end to be joined to the incoming fragment is in the reaction zone, and is allowed to potentially react with the multiplicity of fragments introduced to the zone. This may have the benefit of accelerating the reaction, due to the multiplicity of targets, and may also facilitate providing other reagents needed for assembly, such as enzymes, which may otherwise be limited in delivery if they must traverse an extended nanochannel to reach the reaction zone. In preferred embodiments, the incoming fragments will have a single reactive end, relative to the long partner for the reaction, so that only a single extension reaction may occur in the reaction zone, extending the long fragment from the nanochannel.
For the basic joining as shown in
Of note, many functionally similar variations on the disclosed architecture are obvious based on the present disclosure, and all of these are intended to be covered by the present disclosure.
In various embodiments, it is desirable to be able to monitor and detect that DNA strands, or other particles, have entered various channels or are located at certain points within a channel. Ways to monitor this include dye labeling of the DNA fragment, and use of optical microscopy to detect the presence of the molecule in various channels. The ways to monitor also include incorporating electronic sensors in the nanochannel, such as impedance sensors, or impedance spectroscopy sensors that measure across the channel at a check point of interest. Such sensor may also be deployed at motion stop points, to detect the presence of the DNA, or of the stop molecule.
The nanochannel device architectures of
The basic nanochannel device or chips can be integrated into a system for the controlled production of long DNA fragments, built from various shorter building blocks in parallel.
The nanochannel devices and methods disclosed herein, in certain embodiments, may further make use of these features to enhance performance, as detailed below.
In certain embodiments, the nanochannel device is provided with methods for monitoring, tracking or detecting the location of DNA or other useful particles in the channels. Such methods in preferred embodiments provide the ability to determine the location of one or both ends of the DNA molecule in a channel, at a point in time, or continuously in time, or to determine the time at which an end of DNA molecule arrives at or is present at some location, or to determine at a given point in the channel whether a DNA molecule is proximate or not. Such location monitoring can provide benefits for the assembly process, such as the ability to position the DNA strands with greater accuracy to facilitate their efficient assembly, to check or verify that assembly has occurred, and to track the growing strand through the assembly process, so as to be able to maintain the identity of a strand and at all times to know the current sequence of the strand under assembly. Methods that may be used for such location monitoring may encompass:
Optical monitoring may be performed using an external optical or imaging sensor, to detect an optical reporter signal originating from the DNA, or integrated optical sensors embedded in, or proximate to, the channel at one or multiple locations. The optical signal may arise from intrinsic fluorescence of the DNA, or from a stain or dye label applied to the DNA. This may include sequence specific labels, such as dye-labeled oligos that hybridize to the DNA strand, or optical barcodes based on sequence or sequence motif-specific binding of optical reporters. The means for achieving this encompasses many such alternatives and combinations for optical monitoring that are obvious to those skilled in the art of optical mapping of DNA, and microscopy of DNA, and optical sensors.
Electrical monitoring may be performed using various types of electronic sensors that are embedded in or otherwise integrated into the channel at one or multiple points. Such sensors may encompass localized transverse impedance monitors, localized nanochannel constriction/nanopore ionic current sensors, or localize charge proximity sensors, or redox electrochemical sensors. The use of such sensors may be enabled or enhanced by the use of electronic labels or reports on the DNA strand, such as sequence specific or sequence motif-specific bound oligos providing electronic signature labels integrated into the oligo, such as charged groups, conductive groups, redox-active groups, polarizable groups, or size-exclusion groups, or groups that otherwise produce localized changes in conductance or capacitance. Such labelling may further encompass electronic barcoding for tracking of motion of fragments, through a spatial barcode pattern of signals along the DNA, or other electronic signatures that provide an identifying signature for tagging and tracing the specific identity of DNA strands. The means for achieving this may comprise many such alternatives and combinations for electronic monitoring that are obvious to those skilled in the art of electronic sensors.
When multiple DNA strands are being processed in one or multiple nanochannels, it may be beneficial to be able to assign distinct identifiers to the strands, to enable tracking, distinguishing and determining the identities of the distinct strands at various times in the processing. Methods for this may comprise location-based identifiers, starting with knowing the location of a strand in a channel at a point in time, such as the point where the strand is introduced to a channel. This location-time stamp can become the identifier and may be affiliated with later location-time identifiers, to track strand identity. In other preferred embodiments, a spatial pattern of a label along the strand can provide a spatial barcode (such as using optical or electronic labels) on fragments, and this spatial barcode may be used to provide the unique identifier. Other means of creating a unique signature may be used, such as single labels that create a unique, repeatable signal signature in an electrical or optical sensor. The means for achieving this may comprise many such alternatives and combinations that are obvious to those skilled in the art of optical and electrical labeling methods.
As disclosed, any combination of parameters may be used to uniquely identify a strand/associate it with its point of origin or history throughout the assembly process on the chip, including overall length and mix of optical and electronic barcodes associated with each molecule. In some cases, the unique labels can be imparted by incorporation of modified or unnatural triphosphates during a step of templated enzymatic extension. Modified triphosphates may be used to impart all manner of the disclosed labels, including fluorophores, reactive handles for fluorophores and other post-synthetic derivatization. Triphosphates including these reactive handles may be functionalized with aminoallyl, azide, alkyne, thiol, alkene, amino, biotin, or desthiobiotin. It is preferred that the incorporation events are of relatively low probability relative to the natural incorporation events so that there are few modifications throughout a long strand, increasing the probability that the distance between the modifications provides a unique ‘code’ for each molecule that may help uniquely differentiate it. Generally, there is a degree of kinetic discrimination against incorporation of modified triphosphates relative to native ones, and generally the ratio of modified dNTP:natural dNTP in any given extension will be chosen to correct for this. In some embodiments, the ratio may be between 1:10, 1:100, 1:1,000, 1:10,1000, or 1:100,0000.
In other embodiments, strand labelling modifications may be imparted by binding another agent to a strand in a sequence specific manner. This may include proteins which have preferential binding motifs or designed nucleic acid probes which can bind as a triplex or invade a dsDNA duplex in a programmable manner, such as sequences capable of binding certain motifs through Hoogstein paring or PNA probes. In some embodiments, the presence of the binding molecule may be sufficient for labelling, as the additional bulk may provide an increased level of current deflection through a channel or pore. As before, the binding molecule may itself contain another label or a functional handle for post-synthetic installation of such a label. An advantage of this mode of labelling is that it is non-covalent and potentially reversible with change of local conditions such as pH or temperature.
Either mode of labelling may be performed ‘on-chip’ throughout various points of the process or in bulk reactions prior to the loading material on the chip. Some embodiments, particularly those utilizing bulk labelling, may include an optional step of filtering strands which are not uniquely identifiable relative to other strands in the ensemble of an assembly reaction. Filtering may be accomplished, for example, by selective use of potential to eject unidentifiable constructs from channels so that they may be used as substrates for further labelling or end up as waste.
When performing a process to assemble strands in a nanochannel, it may be beneficial to have means to assess when a join has occurred, or to assess whether an attempt at joining was successful. Such monitoring to assess whether two strands have physically joined may encompass electrical or optical detection, in conjunction with a test method that provides a determination of the join having occurred or not. In one preferred embodiment, the methods comprise using forces applied so as to pull the strands apart, such as pulling on one end, or in opposing directions on the two strands, and using the location detection sensors to determine if the two strands move in a highly correlated way or not under such a test, thereby indicating they are joined, or not. In another preferred embodiment, the method encompasses making a strand length measurement, to assess whether the strand length corresponds to the joined length, or not. Such a method may comprise pulling one end to move the strand into a process that performs a length measurement of the strand, based on either detecting locations of the ends, or measuring a transit time past a sensor, and using the transit time or strand velocity/mobility to determine if a join has occurred or not. In another preferred embodiment, the method may comprise the use of detectable barcodes on each fragment, assessing whether the molecule has the combined barcode of the two fragments or not.
When performing a process to assemble strands in a nanochannel, it may be beneficial to have methods to detect assembly errors, and to correct such errors. In preferred embodiments, the error detection may comprise measurement of a DNA strand length or length-related feature such as mobility or velocity, or a barcode, or other feature of the strand that indicates an incorrect assembly via not matching to the expected properties of the intended assembled strand. In preferred embodiments, the corrective methods could comprise discarding failed assemblies, by moving the residual fragments to a disposal site, or may comprise retrying the assembly, such as in the case of a failed join.
To assemble DNA strands in a nanochannel, it is beneficial to have means to control the motion of DNA strands in the channels. Such controls may be beneficial for bringing the ends to distinct strands together with precision for efficient joining reactions, for the routing of fragments through networks of channels, to stage fragments as needed to carry out a specified series of joins, and for checking whether joins have occurred through coordinated strand motion and detection, and for the error detection and correction process on the assembled strands as indicated above. Motion control may comprise the use of passive controls on motion and position, as well as active controls based on a feedback loop coordinating input readings from the strand position monitoring sensors, and from the strand identity monitoring sensors, all in conjunction with application of the driving forces to actuate motions, which could be fluidic or electrical.
Passive motion control comprises applying motion driving forces to a strand, in combination with physical structures that result in the strand coming into an equilibrium position at a defined location, or with a strand end at a defined location. This does not require a sensor to provide any information on the strand location, that is determined by the equilibrium between the driving force, and the stop restricting further motion when engaged with the strand. Methods for such passive control may comprise physical motion stops or barriers that restrict further strand motion once the strand or strand end has contacted the stop or barrier. Methods for producing such a stop may comprise adding a large physical side group to the strand, which hits against a stop obstruction extending into the channel, thereby creating a localized restriction in channel width or cross section. Or such groups may be pinned within a trap, such as a localized indentation extending into the side of the channel, that is capable of catching the group. Such a group could in preferred embodiments comprise a magnetic or charged group that is held via magnetic or electrical forces. Such stops may be located at the end or ends of the strands, to thereby control the motion or portion of the strand end. In preferred embodiments such groups may comprise a DNA oligo hybridized to a single-stranded segment of the DNA strand, where the oligo may itself be the stop group (protruding from the primary single stranded DNA strand) or may carry on it a group that acts as the stop group, such as a bulky molecule or nanoparticle attached to the oligo. In preferred embodiments such stop groups may comprise a DNA hairpin, secondary structure, or knot at or near the end of the strand, acting as the stop. Such secondary structure stops provide the benefit that they may be displaced or dislodged by applying a sufficiently strong pulling force on the DNA. Stops at one end may serve the purpose of controlling the motion and location of that end, or they may be used to fix the location of that end, for the purpose of controlling the motion or position the other free end, under a force that extends the DNA strand in the channel.
The active motion control may comprise taking inputs from the location monitoring sensors, and use these sensor inputs in control logic that turns on or off the driving forces for strand motion or length extension (stretching), or maintaining position, so as to bring the strand or one or more ends of the strand to desired locations, at desired times.
Electrically controlled motion, whether passive or active, can be used to move DNA through the nanochannels (using DC or AC methods), or to precisely localize DNA at a point within a nanochannel (using DC or AC methods).
For purposes of motion and location control, it may be beneficial to prepare or condition the DNA strand to provide it with more favorable mechanical properties. In preferred embodiments, such conditioning may comprise means to make the strand more stiff, rigid and straight, or increase its mechanical persistence length, relative to the properties of the native (ssDNA or dsDNA) form of the strand. Means for stiffening the strand may comprise coating it with ssDNA binding proteins, dsDNA binding proteins, or other chemical agents that bind to or intercalate the DNA, or making more of the DNA strand into the dsDNA form rather than ssDNA form, by binding it to complementary DNA strands, or DNA analog strands that may even further enhance stiffness (such as PNA or LNA strands). In other preferred embodiments, the conditioning of the DNA strand mechanical properties may comprise altering the solution medium containing the DNA, to include gel matrix material, to increase the solution viscosity, or to include other matrix materials or reagents that prevents long DNA fragments from breaking or tangling. Many such means of stiffening DNA or reducing breakages or tangles are obvious to those skilled in the art of nucleic acid biochemistry.
In other embodiments, particles could be added to the DNA strand, which may have specific properties to assist in the assembly process. The particles may be at or near the terminus of one end, where the other end is undergoing the assembly process or otherwise available for enzymatic reactions. The particles may serve a number of purposes throughout the disclosed assembly procedures, including acting as additional means of controlling DNA mobility in inlets or through channels, wherein the size and nature of the particle may promote or discourage the entry of a DNA strand into a channel, or stop passage at particular points within the chip as detailed above. Particles may also act as solid-phase anchors during fluidic exchange as various buffers and reagents are passed through the channels, provide additional types of barcoding, where the particle size, fluorescence, or other property aids in strand identification, or otherwise alter the electrokinetic properties of a strand. The response of a strand-particle conjugate in response to applied AC or DC field may be greater or less than that of the strand alone, depending on the particulars of the strand size, particle size, and particle material. The particles may be comprised of any material typically associated with nanoparticles or microspheres to skilled artisans and can include polystyrene spheres, coated iron oxide particles, silica particles metal nanoparticles, quantum dots, diamond or carbon spheres, proteins or large complexes proteins, or resins commonly used for affinity chromatography. The nature of the conjugation between particle and DNA strand for assembly may be covalent or non-covalent, though in general non-covalent conjugates are easier to reverse. A preferred mode of reversible conjugation is hybridization, where a particle is coated with a capture oligonucleotide sequence at least partially complementary to the DNA strand undergoing assembly. In some embodiments, the duplex DNA strand can undergo hybridization if it is prepared with ‘sticky ends’, such as those generated by exonucleolytic digestion of one strand at its termini. Other embodiments may instead rely upon use of strands on the particles which are capable of binding to duplexed DNA in a sequence specific way, either by forming triplexes or by invasion of the duplex itself. There are diverse methods of reversing the conjugation once it has been formed including temperature, pH, applied electric potential, or by performing any chemistry which may cleave the particle bound complement from the particle itself. The precise chemistry depends upon the linkage between the capture oligonucleotide but there are diverse linkers and cleavage methodologies known to skilled artisans. Methods compatible with reuse of the particle are generally preferred.
It is generally preferred that a given particle is bound to only a single strand for DNA assembly at a time and aspects of the invention may include methods for associating particles and assembly strands in a 1:1 ratio. In some cases, assembly strands may be loaded into the channels prior to association with a particle, then repositioned so that a termini is protruding from an accessible region of the channel, where a particle may be positioned in close proximity to capture the assembly strand. In some embodiments, the particles may reside on one face of a nanochannel array like that shown in
There are diverse assembly strategies and workflows which are compatible with the disclosed chip designs and capabilities. In some embodiments, assembly strands are loaded into the chip, like that depicted in
It is beneficial for a nanochannel assembly device to have a means for the Input of the DNA strands for processing and Output of the post processing resulting strands. These constitute the primary I/O capabilities of the device. In preferred embodiments, the input allows for many component DNA pieces to be entered into the channels. The incoming fragments may be oligos, with lengths as short as 25 or 50 bases, or more preferably fragments that are at least 100 to 500 bases, or at least 1000 bases, or 5000 bases or 10,000 bases, or 20,000 to 50,000 bases. In preferred embodiments, multiple identical fragments representing the component sequences of interest are transported into the nanochannels at multiple channel inlets, and the number of such inlet channels may be 1, or up to 10, or 100, or 1000, or 10,000, or 100,000 or 1,000,000 inlet channels. Incoming solutions carrying the different sequences may interface to 1 or multiple channels, up to 10, up to 100, or up to 1000 channels being fed by the same incoming solution of DNA fragments, with single molecules from this pool entering various of the multiple channels. In preferred embodiments, incoming pools may be transported by microfluidic lines or lanes interfacing to the 1 or more channels, or as droplets moving on an electrowetting digital microfluidics array to provide the droplets of DNA fragments to interface to the nanochannels.
The output DNA fragments in preferred embodiments will pass from nanochannels into holding channels, or isolation chambers or wells of a well plate, such as a 96-well plate or 384-well plate, or into droplets that are moved in a droplet microfluidic system, such as an electrowetting digital microfluidics system, or into the lanes or lines of a nanofluidic or microfluidic system. In preferred embodiments, the outputs media and physical format may comprise solutions or material matrices, such as gels, or processes and geometries that that avoid breaking long DNA as they exit the device.
In certain embodiments, the output of DNA strands may interface, via microfluidics, to other microfluidics chips for further processing.
Without limiting any of the foregoing or anything else detailed herein, the present disclosure discloses nanochannel devices with measuring sensors for checking and tracking assembly of DNA molecules. In an aspect, CMOS-Chip-based versions of nanochannel assembly devices are discloses. In an aspect, methods for assembling DNA molecules using such devices are disclosed. Additionally, methods for serial addition of segments to growing DNA strands are disclosed. Additionally, methods for parallel assembly of DNA strands are disclosed. Additionally, massively parallel nanochannel systems for DNA assembly are disclosed. Additionally, use of passive and active stops for precision motion control during assembly are disclosed. Additionally, nanochannel assembly devices and methods that comprise specific elements of function are disclosed. Additionally, barcoding and sensor-based tracking of fragments passing through the disclosed systems is contemplated. Input DNA molecules may have lengths of about 50, about 100, about 500, about 1000, about 2000, about 5000, about 10,000, about 20,000, about 50,000, or well above 50,000 bases, as has been provided for herein.
In certain embodiments, the number of joins may be about 1, about 10, about 100, about 1000, about 10,000, to about 100,000 or more joins. Further, the number of parallel strands processed on a device may be about 1, about 10, about 100, about 1000, about 10,000, about 100,000, or about 1 million or more.
In an aspect, a system is disclosed that includes first and second devices as disclosed herein, wherein each of the respective nanochannels are fluidically connected. In another aspect, a system is disclosed that includes a device as disclosed herein having multiple nanochannels. In this system, the device contains a network of nanochannels. In embodiments, the number of networked nanochannels is 1-10, or 10-100, or 100-1000, or 1000-10,000, or 10,000-100,000, or 100,000-1,000,000, or more than 1,000,000. In embodiments, the chip is a CMOS chip. The system may include the nanochannel(s) being equipped with measuring sensors for tracking the DNA molecules and/or DNA assembly. In certain embodiments, the chip disclosed herein is integrated into a system for the synthesis and assembly of DNA.
This application claims priority to and the benefit of U.S. Provisional Patent Application Ser. No. 63/291,124, filed Dec. 17, 2021, and entitled “Electronic Assembly of Long DNA Molecules for Synthetic Biology”, the disclosure of which is incorporated herein by reference in its entirety.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2022/053406 | 12/19/2022 | WO |
Number | Date | Country | |
---|---|---|---|
62291124 | Feb 2016 | US |