The present invention is in the field of molecular biology, more precisely in the field of oligonucleotides synthesis, more precisely in the field of preparation of oligonucleotide arrays.
2D oligonucleotide arrays are widely used in molecular biology.
Oligonucleotide arrays may be prepared by in situ synthesis on the surface from individual nucleotides (Agilent, Affymetrix). During such synthesis oligonucleotides are synthesized in 3′ to 5′ direction, so that their 3′ ends are attached to the surface. Synthesis in other direction (5′ to 3′) is also possible, but is much more difficult and expensive and currently is not available commercially.
For many applications free 3′ ends of the oligonucleotides on the array are required. The current solution is to immobilize full-length pre-synthesized oligonucleotides. However when large number of types of oligonucleotides have to be immobilized this is very expensive. One of the parameters contributing to the high costs is functional groups required for immobilization. Usually it would at least double the price of oligonucleotides. Besides, full-length oligonucleotides have often a considerable length, which also contributes to the price.
In the present invention we suggest an approach of preparation of oligonucleotide arrays using shorter conventionally synthesized oligonucleotides. From such shorter “blocks” the full-length oligonucleotides are built up. We describe several realizations of this approach using enzymatic reactions to combine those “blocks”.
The invention relates to a method for generating a two-dimensional oligonucleotide array comprising a method for generating a two-dimensional oligonucleotide array comprising
The idea of this invention is to prepare ordered oligonucleotide arrays from two or more pre-synthesized shorter parts—block-synthesis approach. The parts are linked together enzymatically to form a full-length oligonucleotide of a desired sequence. Such approach allows splitting the oligonucleotide sequences into common and unique parts. It gives the possibility to place the functional group on a common part and to minimize the length of the unique parts. Method of the invention allows combinatorial synthesis of position-specific regions. Using combinatorial approach, position-specific regions are generated by linking two or more unique oligonucleotides, so that just few said unique oligonucleotides give rise to a large variety of codes, for example, 10 unique oligonucleotides linked pairwise can produce 100 position-specific regions.
In comparison to preparation of oligonucleotide arrays by spotting of full-length sequences, suggested approach is more cost-efficient, allows flexibility in generating position-specific unique sequences and is less prone to oligonucleotide length restrictions.
In comparison to in situ synthesis of oligonucleotides from nucleotides, current invention allows cost-efficient solution for synthesis of oligonucleotides with free 3′ ends.
Important application of the current invention is preparation of two-dimensional oligonucleotide arrays for construction of sequencing libraries from 2D distributed nucleic acid molecules. Oligonucleotides on such arrays need to have position-specific sequences and free 3′ ends for further enzymatic reactions.
In 2005 we patented Ligation-based synthesis of oligonucleotides with block structure (EP 1 616 008 A2), which suggested synthesizing long oligonucleotides from shorter oligonucleotides in solution. Current application partly uses the same idea in application of oligonucleotide array preparation.
Oligonucleotides may be prepared using any suitable method, such as, for example, the phosphotriester and phosphodiester methods or automated embodiments thereof. In one such automated embodiment diethylophosphoramidites are used as starting materials and may be synthesized as described by Beaucage et al., Tetrahedron Letters, 22:1859-1862 (1981), which is hereby incorporated by reference. One method for synthesizing oligonucleotides on a modified solid support is described in U.S. Pat. No. 4,458,006, which is hereby incorporated by reference. It is also possible to use an oligonucleotide which has been isolated from a biological source (such as a restriction endonuclease digest).
Functional group: an oligonucleotide modification which allows specific binding of oligonucleotides to the surface.
Position labelling or position specific region in an oligonucleotide which should be specific for certain positions on the array.
Within the context of the invention the term junction sequence refers to a defined nucleic acid sequence on one oligonucleotide and to a homologous or complementing sequence on a second oligonucleotide, allowing hybridization of the two oligonucleotides.
Besides position-specific and junction region, oligonucleotides may contain sequences required for later applications of the array (for example, for hybridisation to nucleic acid molecules applied to the array), and other sequences which might have or might not have any particular purpose. For example, there may be regions just for increasing the length of the oligonucleotide, to provide certain melting temperature, to provide binding sites for certain proteins, etc.
The idea of this invention is to prepare oligonucleotides on the arrays from pre-synthesized shorter parts—block-synthesis approach. These parts are linked together enzymatically to form a full-length oligonucleotide of desired sequence. Such an approach makes the use of splitting the oligonucleotide sequences into common and unique parts. It allows to:
Oligonucleotides on array may be covalently or non-covalently bound to the array surface.
Examples for non-covalent binding methods for nucleotides are: Ni-NTA interaction, maltose-maltose-binding-protein interactions, biotin-streptavidin interaction.
Examples of covalent binding methods are: binding of thiol- or amino-modified oligonucleotides to the epoxy-, carboxy-, or aldehyde-modified glass surface, copolymerization of the acrydite-modified oligonucleotides with the acrylamide.
Shorter oligonucleotides from which full-length oligonucleotides are synthesized on the surface of the array may be synthesized using solid-phase synthesis or any other method the synthesis of synthetic oligonucleotides.
Oligonucleotides are spotted to a certain position on a microarray one after another. There may be washing of previous set before adding the next one, or there may be not. If more than two sets of oligonucleotides are used for preparation of full-length oligonucleotide on the surface, enzymatic addition of oligonucleotides may be performed for each set separately, or for all together.
In a preferred embodiment the oligonucleotides in each of the two sets have a length of less than 200 nucleotides, preferably less than 100 nucleotides, more preferably less than 75 nucleotides, even preferably less than 50 nucleotides, most preferably less than 40 nucleotides.
The length of the position specific sequence in either set of oligonucleotides is directly dependent on the amount of “coordinates” necessary on the 2D dimensional array. A label sequence with a length of 10 nucleotides can provide a possible differentiation of 1048576 positions, if it therefore two label sequences, each comprising 10 nucleotides, would be combined it would allow a possible differentiation of about 1.1×1012 spatial positions, which would be suitable to differentiate single oligonucleotides on said array. If only distinct regions on said array should be differentiated a shorter label sequence might be suitable.
The defined junction sequence on both sets of oligonucleotides may or may not be present. If present, the two sets of nucleotides should be able to hybridize to each other over said junction region. Hence the junction region on the second set of oligonucleotides should preferably represent a complement of the region of the first set of oligonucleotides. The length of said region should be sufficient to allow stable binding of the two regions.
In one embodiment of the invention the junction sequence of the oligonucleotides of the first and/or second set has a length of 20 or less nucleotides, in a preferred embodiment the junction sequence has length of 15 or less nucleotides and in a more preferred embodiment the junction sequence has a length of 10 or less nucleotides. In the most preferred embodiment the junction sequence of the first and/or second set of oligonucleotides has a length of 10, 9, 8, 7, 6, 5 or 4 nucleotides.
Oligonucleotides may additionally comprise a capture sequence or a reverse complement thereof for capturing of nucleic acid molecules applied to the array. The capture sequence might be used to limit the number of target molecules. Any sequence may be suitable as capture sequence, non-limiting examples are: sequences of short-tandem repeats, known single-nucleotide polymorphisms or simply a repetitive sequence, random sequence, locus-specific sequence. Capture sequences may be different within one set of oligonucleotides.
The oligonucleotides of one or both of the two subsequent sets of oligonucleotides might include other sequences, e.g. another label sequence or spacer sequences. Additionally, the oligonucleotides of either set might comprise a restriction site or form a restriction site, which is present when the sequences of the oligonucleotides of both sets are fused.
Preferably according to one embodiment of this invention the two sets of oligonucleotides are single-stranded oligonucleotides. In another embodiment of the invention at least one set of oligonucleotides comprises single-stranded oligonucleotides. In yet another embodiment at least one set of oligonucleotides comprises double-stranded oligonucleotides.
To form the solid support it is necessary to connect the sequences of the two or more sets of oligonucleotides. There are several possible ways to do, non-limiting examples include: extension reaction, ligation, recombination, or a combination thereof.
Preferably the connection of the sequences is performed in a way that the oligonucleotides on the array are uniquely identifiable and the combinations of position specific sequences allow an exact identification of the position of the oligonucleotide or a group of oligonucleotides on the two dimensional array.
The array is preferably created on a solid support. The solid support may be made of any suitable material. Preferred but non-limiting examples include the following materials for the solid support: glass, plastic, metal, paper, or a membrane. In a preferred embodiment of the invention the material of the solid support is glass.
The solid support might be coated to allow binding of the oligonucleotides. The coating may be of any material as suitable. In one embodiment the coating is a gel. In a preferred embodiment the solid support is coated with substances to allow the immobilization of the first set of oligonucleotides, preferably by covalent or non-covalent binding. In a more preferred embodiment the coating of the solid support allows non-covalent immobilization of the first set of oligonucleotides. In another embodiment the coating comprises multiple components.
In one embodiment of the invention the first set of oligonucleotides is immobilized on the solid support in an ordered manner. Preferably the immobilization of the different position specific sequences is done in an ordered manner in a way that the first label could act as a coordinate in a coordinate system and would already allow a broad distinction of the regions on the solid support (see
In another embodiment of the invention, the solid support is split in multiple parts, which could later be assembled together (see
The oligonucleotides may be immobilized by any suitable way. Preferably the immobilization is covalent. It is important that the immobilization is stable under conditions, which would cleave double stranded DNA. If double stranded oligonucleotides are used and immobilized, preferably only one strand of the double stranded oligonucleotide is immobilized.
Oligonucleotides of the first set, which had not been immobilized, are preferably removed. The person skilled in the art knows suitable methods to remove non-immobilized nucleotides. Preferably the unbound oligonucleotides are removed by multiple washing steps.
To create a 2D-array according to the present invention it is now necessary to add a second location information, present in the position-specific sequence of the second set of oligonucleotides. In one embodiment of the invention the second set of oligonucleotides is added and the sequence of the second set is added to the immobilized oligonucleotides, thus creating elongated oligonucleotides comprising at least two label sequences. In a preferred embodiment of the invention the sequences of the second set of oligonucleotides are added in a manner, that each single oligonucleotide has a unique combination of labels or only a group of oligonucleotides in close proximity has the same label and each group label is unique. In a preferred embodiment each oligonucleotide has a unique label.
The person skilled in the art readily knows suitable methods to transfer the sequence information onto the first set of oligonucleotides immobilized on the solid support. Non-limiting examples of potential methods include: elongation by a polymerase, ligation, recombination or a combination thereof.
In a preferred embodiment of the invention the subsequent sets of oligonucleotides consist of single-stranded oligonucleotides according to oligonucleotides #surf and #position (e.g.
In another preferred embodiment the subsequent sets of oligonucleotides are connected using ligation, wherein the ligation results in a single stranded immobilized polynucleotide (
Depending on the selected method the two-dimensional array comprises immobilized oligonucleotides, which are single or double-stranded and comprise two label sequences. In a preferred embodiment the 2-dimensional array comprises oligonucleotides comprising two label sequences and the oligonucleotides are ordered in a manner that the labels allow an exact identification of the position of the oligonucleotide or at least a group of oligonucleotides on the array.
Depending on the method, the array comprises single or double-stranded oligonucleotides. For the preferred use of the two-dimensional array it is required, that the array comprises single-stranded oligonucleotides. Therefore if the array comprises double stranded oligonucleotides it is necessary to cleave the double-stranded oligonucleotides to receive a two-dimensional array with immobilized single-stranded oligonucleotides, which comprise at least two position-specific sequences.
The provided two-dimensional array is then suitable for further applications, for example 2d sequencing library preparation.
Synthesis of 71nt and 94nt long oligonucleotides from shorter blocks was performed using nick ligation with T4 DNA ligase.
The 71nt oligonucleotide was obtained by ligating the two oligonucleotides #sc_001 and #sc_010, using the adapter oligonucleotide #sc_002 to bring the oligos together (
The 94nt oligonucleotide was obtained by ligating the three oligonucleotides #sc_001, #sc_015 and #sc_012 (
Both for two- and three-oligos ligation, oligonucleotides were mixed together and ligation was performed in 1×T4 DNA ligase buffer with 1 mM ATP, 15% PEG6000, 40 u/μl T4 DNA Ligase (NEB, #M0202) at room temperature for 15 minutes.
The molar ratios of oligonucleotides participating in the two oligonucleotides ligation are shown in
Similarly, the molar ratios of oligonucleotides participating in the three oligonucleotides ligation (
This example demonstrates that ligation based synthesis of long oligonucleotide is quantitative.
Preparation of oligonucleotide microarrays according to the current invention requires stepwise addition of components of enzymatic reactions to the surface of the chip. Creating a two-dimensional oligonucleotide array where oligonucleotides in each feature of the array have a position-specific code requires:
The sciFLEXARRAYER (Scienion, Berlin, Germany) is an automated non-contact dispensing system of ultra-low volumes. It is capable of distributing down to 100 pl droplets on e.g. a glass surface, and then distributing droplets of another solution to exactly the same positions as the first solution. Even when in between the reactions it is necessary to take the microarray slide out of the machine for washing or scanning, it is possible to return it back and still to preserve the coordinates of the spotting positions.
We used the sciFLEXARRAYER to print 5-biotinilated oligonucleotides on a streptavidin-coated glass slide (PolyAn, Berlin, Germany). We determined the optimal loading concentration of the oligonucleotide for the maximal binding—15-30 μM.
Using the sciFLEXARRAYER it is possible to increase the humidity in the microarray chamber up to 70% to decrease the drying time.
The Agilent 1M microarray has 974016 features with in situ synthesized oligonucleotides, attached to the surface by their 3′ ends. Using enzymatic reactions on the surface we were able to synthesize oligonucleotides, the 5′ parts of which are complementary to the oligonucleotides on the surface and the 3′ parts are single stranded.
An Agilent 1M microarray with 60 nt long oligonucleotides was used. The sequences of the oligonucleotides were the same for all features of the array except for 14 nucleotides in the central part (N14 sequence is the
The scheme of the chip modification is shown on
Primer extension was performed in 1×NEB2 Buffer, 1.5 μM primer #ext, 20 μM dNTPs, 0.5 u/μl Klenow exo (−) polymerase (NEB) at 37° C. for three hours. The chamber was then detached and the slide was washed to remove the reaction components: two times in 1×PBS, 0.1% Triton X-100 at 37° C. for 15 minutes, followed by a single wash in 1×PBS at room temperature for 5 minutes. The slide was dried out with nitrogen flow. A new SecureSeal chamber was attached to the microarray to cover the same surface area as in extension reaction.
The extension products have single nucleotide overhangs at their 3′ ends, added by Klenow exo(−). In the order of descending preference these nucleotides are: A>G,C>T (checked experimentally, results not shown). To provide the maximum efficiency chip modification, ligation was performed in two steps. In the first step duplexes of #ad/#y_064 were added where #ad had either a T or a G nucleotide at the 3′ end. Then the ligation buffer was removed, and ligation mix with duplex where #ad had C or T at 3′ end was added to the chamber.
The ligation reaction was performed in 1×T4DNA ligase buffer, 5 μM #y_064, 6 μM #ad (3 μM #ad_T and 3 μM #ad_G in the first ligation, and 3 μM #ad_C and 3 μM #ad_A in the second ligation), 40 u/μl T4 DNA ligase (NEB) at 37° C. for three hours (each ligation step—1.5 hours). The chamber was then detached and the slide was washed to remove the reaction components: two times in 1×PBS, 0.1% Triton X-100 at 37° C. for 15 minutes, followed by a single wash in 1×PBS at room temperature for 5 minutes. The slide was dried out with nitrogen flow.
To visualize the reaction results primer extension was performed with Cy3 labeled dCTP (2 μM in the extension reaction), and ligation was performed with the oligonucleotide #y_064 with a Cy5 label on the 3′ end.
To estimate the amount of the full-length extension-ligation products, they were washed off from the area of the microarray where the reactions were performed and their quantity was estimated by qPCR with primers #ext and #y_065 (complementary to #y_064). An artificial oligonucleotide corresponding to the extension-ligation product was used as reference. The amount of product is estimates as about 600000 molecules per feature of the microarray.
Number | Date | Country | Kind |
---|---|---|---|
13197470.1 | Dec 2013 | EP | regional |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2014/076888 | 12/8/2014 | WO | 00 |