The invention provides a combinatorial library, a method for preparation of that combinatorial library, a method for sequence identification, a method for sequencing the elements of combinatorial libraries of oligonucleotides and/or oligonucleotide analogues, the use of a linker to generate combinatorial libraries and a sequence identification set. More precisely, the objective of the invention is the ability to “read” sequences of selected elements of combinatorial libraries of freely modified synthetic oligonucleotides. The solution may be used both for researching for leading compounds in the pharmaceutical industry, and as a tool for studying the properties of oligonucleotides in the aspect of their potential use in experimental antisense or antigen therapy. The invention develops an appropriate strategy for determining the structure of isolated library elements, as usefulness of the combinatorial library depends on the ability to recognize the structure of its elements. Thanks to the developed and presented method for sequencing combinatorial oligonucleotide libraries it is possible to indisputably identify the sequences of biologically active elements selected by the combinatorial approach.
Combinatorial synthesis consists in deliberate construction of a set of molecules based on logical design of chemical reactions that lead to linking selected monomers in various combinations. Thus obtained library is then searched in order to identify elements having desired properties. As a result of the selection process, active molecules of unknown structure are isolated from the library.
Following approaches were proposed to solve the problem of “reading” the structure of combinatorial library elements:
A special case are oligonucleotide libraries with elements that do not include modification; in such case, direct amplification of the element, followed by sequencing of the obtained copy, may be applied [C. Tuerk, L. Gold, Science, 1990, 24:505; A. D. Ellington, J. W. Szostak, Nature, 1990, 346:818].
Another method is to infer the library element structure in recursive deconvolution algorithm [E. Erb; K. D. Janda; S. Brenner, Proc. Natl. Acad. Sci. USA, 1994, Vol. 91:11422; K. D. Janda; Proc. Natl. Acad. Sci. USA; 1994, Vol. 91:10779]. The basic assumption is to define sublibraries, or partial libraries, at every stage of oligomer synthesis.
An alternative to recurrent deconvolution is the method consisting in reading the structure of tags coding the sequence of the original library element.[S. Brenner, R. A. Lerner, Proc. Natl. Acad Sci. USA, 1992, Vol. 89:5381; M. C. Needels, D. G. Jones, E. H. Tate, G. L. Heinkel, L. M. Kochersperger, W. J. Dower, R. W. Barret, M. A Gallop, Proc. Natl. Acad. Sci. USA, 1993, Vol. 90:10700]. Reports included oligonucleotide-coded oligonucleotide libraries [P. A. Sacca, A. Fontana, J. M. Montserrat, A. M. Iribarren, Chemistry&Biodiversity, 2004, 1:595], as well as the use of different types of tags not bound in sequences, such as those based on trityl skeleton [M. S. Shchepinov, R. Chalk, E. M. Southern, Tetrahedron, 2000, 56:2713], pyrrole [R. H. C. Scott, C. Barnes, U. Gerhard, A. Balasubramanian, Chem. Commun., 1999, 1331], halogen benzene derivatives [M. H. J. Ohlmeyer, R. N. Swanson, L. W. Dillard, J. C. Reader, G. Asouline, R. Kobayashi, M. Wigler, W. C. Still, Proc. Natl. Acad. Sci. 1993, 90:10922; H. P. Nestler, P. A. Bartlett, W. C. Still, J. Org. Chem., 1994, 59:4723], and dialkylamine tags [Z.-J. Ni, D. Maclean, C. P. Holmes, M. M. Murphy, B. Ruhland, J. W. Jacobs, E. M. Gordon, M. A. Gallop, J. Med. Chem. 1996, 39:1601; W. L. Fitch, T. A. Baer, W. Chen, F. Holden, C. P. Holmes, D. Maclean, N. Shah, E. Sullivan, M. Tang, P. Waybourn, J. Comb. Chem. 1999, 1:188], as well as fluorous derivatives of carboxylic acids [J. E. Hochlowski, D. N. Whittern, T. J. Sowin, J. Comb. Chem., 1999, 1:291], fluorophores [R. H. Scott, S. Balasubramanian, Bioorg, Med. Chem. Lett., 1997, Vol. 7, No. 12:1567; B. J. Egner, S. Rana, H. Smith, N. Bouloc, J. G. Frey, W. S. Bocklesby, M. Bradley, Chem. Commun., 1997, 735] and compounds with characteristic IR absorption bands [S. S. Rahman, D. J. Busby, D. C. Lee, J. Org. Chem., 1998, 63:6196]. In addition, a method for coding the structure of combinatorial library elements by microchip-coded information was developed [E. J. Moran, S. Sarshar, J. F. Cargill, M. M. Shahbaz, A. Lio, A. M. M. Mjalli, R. W. Armstrong, J. Am. Chem. Soc., 1995, 117:10787]. Methods providing direct information on compound structure include positional coding within oligonucleotide arrays [S. P. A. Fodor; D. Solas, Science, 1991, 767; K. S. Lam, M. Renil, Curr. Opin. Chem. Biol. 2002, 6:353; R. Frank, Tetrahedron, 1992, 48:9217].
An alternative to tagging methods is the controlled preparation of shorter sequences in the oligonucleotide synthesis process followed by their analysis by means of mass spectrometry [R. S. Youngquist, G. R. Fuentes, M. P. Lacey, T. Keough, J. Am. Chem. Soc., 1995, 117:3900; C. Hoffman, D. Blechschmidt, R. Kruger, M. Karas, C. Griesinger, J. Comb. Chem., 2002, 4:79].
The method consists in that at each stage of library generation, elongation of ca. 10% of peptide chains is terminated. This leads to formation of oligomers of different chain lengths. When analyzed by MALDI mass spectrometry, products of such synthesis give spectra containing series of signals. Mass differences between adjacent peaks provide information on the peptide structure.
Patent application no. US20040265912 A1 (publication date Dec. 30, 2004) describes composition and methods for making and using a combinatorial library to identify modified thioaptamers that bind to, and affect the immune response of a host animal, transcription factors such as IL-6, NF-κB, AP-1 and the like. Composition and methods are also provided for the treatment of viral infections, as well as, vaccines and vaccine adjuvants are provided that modify host immune responses
Patent application no. WO2005003291 (publication date Jan. 13, 2005) describes composition and methods for making and using a combinatorial library having two or more beads, wherein attached to each bead is a unique nucleic acid aptamer that have disposed thereon a unique sequence. The library aptamers may be attached covalently to the one or more beads, which may be polystyrene beads. The aptamers may include phosphorothioate, phosphorodithioate and/or methylphosphonate linkages and may be single or double stranded DNA, RNA, or even PNAs.
Patent application no. WO2005037053 (publication date Apr. 28, 2005) composition and methods for making and using a combinatorial library to identify thioaptamers that bind to targets on or about pathogens. Compositions, sets and methods are also provided for the identification of pathogens, e.g., viral, bacterial or other proteins related infectious disease, as well as, vaccines and vaccine adjuvants are provided that modify host immune responses.
Patent application no. U.S. Pat. No. 6,287,765 (publication date Sep. 11, 2001) describes multimolecular devices and drug delivery systems prepared from synthetic heteropolymers, heteropolymeric discrete structures, multivalent heteropolymeric hybrid structures, aptameric multimolecular devices, multivalent imprints, tethered specific recognition devices, paired specific recognition devices, nonaptameric multimolecular devices and immobilized multimolecular structures are provided, including molecular adsorbents and multimolecular adherents, adhesives, transducers, switches, sensors and delivery systems. Methods for selecting single synthetic nucleotides, shape-specific probes and specifically attractive surfaces for use in these multimolecular devices are also provided. In addition, paired nucleotide-nonnucleotide mapping libraries for transposition of selected populations of selected nonoligonucleotide molecules into selected populations of replicable nucleotide sequences are described.
Patent applications no. US 20060073485 (publication date Apr. 6, 2006) and U.S. Pat. No. 7,316,931 (publication date Jan. 8, 2008) mass tagging methods are provided that, when incorporated to the analyzed substance, increase the mass spectrometer detection sensitivity and molecular discrimination. In particular the methods are useful for discriminating tagged molecules and fragments of molecules from chemical noise in the mass spectrum.
Despite the diversity of proposals and current state of art techniques presented above, they lack a simple, universal, direct and inexpensive technique allowing for determination of the structure of selected elements of oligonucleotide libraries while being universal enough to allow structure determinations of freely modified oligonucleotides.
The aim of the invention is providing the ability to “read” sequences of selected elements of combinatorial libraries of freely modified synthetic oligonucleotides. Combinatorial synthesis is used mostly for researching for leading compounds in the pharmaceutical industry. However, it is also used as a tool for studying the properties of oligonucleotides in the aspect of their potential use in experimental antisense or antigen therapy. The main problem associated with combinatorial synthesis is the development of an appropriate strategy for determining the structure of isolated library elements, as usefulness of the combinatorial library depends on the ability to recognize the structure of its elements. Thanks to the developed and presented method for sequencing combinatorial oligonucleotide and/or oligonucleotide analogue libraries it is possible to indisputably identify the sequences of biologically active elements selected by the combinatorial approach.
This goal, combined with the potential of using a universal, direct and inexpensive/accessible technique allowing for identification of structure of a freely modified oligonucleotide, is achieved in this invention. Thus, the solution according to the invention will contribute to the development of novel therapies of cancer and viral diseases.
The invention provides a combinatorial library of oligonucleotides and/or oligonucleotide analogues characterized in that it includes a linker of formula (I)
chemically linked to the support, where R1 and R2 are independently two substituents of any type terminated with functional groups, and R3 and R4 are independent or together form a cyclic system, and
Preferably, when in the linker of formula (I)
R3 and R4 together form a cyclic system, the linker preferably comprises a residue of formula (II)
where R1 and R2 are independent substituents terminated with any functional groups.
The invention also provides a method of preparation of a combinatorial library of oligonucleotides and/or oligonucleotide analogues, characterized in that it includes
Preferably, when the terminating agents are used preferably from the stage of linking the fifth monomer, particularly preferably the eight monomer, the monomer is used at the terminator/monomer ratio of at least 7%.
The invention also provides a method of sequence identification preceded or not preceded by a combinatorial oligonucleotide or oligonucleotide analogue library element selection stage, characterized in that it comprises the stage described above and that a single support bead is isolated, followed by cleavage of the vicinal diol system in the linker as a result of an oxidizing agent consisting in ammonium periodate NH4IO4 or ammonium periodate of formula [R1R2R3R4N]+[IO4]−, wherein R1, R2, R3 and R4 are independently alkyl groups or hydrogen atoms, thus releasing the oligonucleotide comprised of nucleotides and/or oligonucleotide analogues from the support, wherein in case when the linker structure is as in formula (II), final detachment of the oligonucleotide comprised of nucleotides and/or oligonucleotide analogues from the support is a result of treatment with a basic agent; next, the mixture of oligonucleotides of different length, detached from the support bead is submitted to spectroscopic analysis.
Preferably, the basic agent is methionine.
Another aspect of the invention is the method of sequencing the elements of combinatorial oligonucleotide and/or oligonucleotide analogue libraries characterized in that it involves the stages described above and that the type and order of nucleotides and/or their analogues in the sequence is determined from calculation of mass differences between two adjacent signals within the spectrum, corresponding to nucleotide or analogue masses, wherein calculation starts with the signal of the highest m/z value.
Another aspect of the invention is the use of the linker of formula (I),
chemically linked to the support, where R1 and R2 are independently two substituents of any type terminated with functional groups, and R3 and R4 are independent or together form a cyclic system, and
Preferably, when in the linker of formula (I)
R3 and R4 together form a cyclic system, the linker preferably comprises a residue of formula (II)
where R1 and R2 are independent substituents terminated with any functional groups.
Another aspect of the invention is a sequence identification set characterized in that it includes a linker of formula (I),
chemically linked to the support, where R1 and R2 are independently two substituents of any type terminated with functional groups, and R3 and R4 are independent or form a cyclic system, and
Preferably, the linker of formula (I)
R3 and R4 together form a cyclic system, the linker preferably comprises a residue of formula (II)
where R1 and R2 are independent substituents terminated with any functional groups.
At the same time, one must keep in mind that whenever the aforementioned terms are used in the description, they should be understood as follows:
The enclosed figures facilitate better explanation of the nature of the invention.
For better understanding of the invention, the following example solutions are presented.
Preparation of “oxylabile” support for combinatorial oligonucleotide libraries. Below is and outline of preparation of “oxylabile” support, described in detail further in the description of this invention.
Preparation of “oxylabile” support: i) TMSCl, Py; ii) p-toluenesulfonyl chloride, Py; iii) NH3 aq; iv) DMTCl, Py; v) 2,2′-(ethylenedioxy)-bis(ethylamine), Py; vi) DCC, DMAP, Et3N, CH2Cl2, vii) Ac2O, NMI, 2,6-lutidine, MeCN.
In order to obtain the “oxylabile” support (7), the following stages have to be performed:
Cytidine hydrochloride (2.5 g, 8.96 mmol, 1 eq.) was evaporated with pyridine (three times) and then dissolved in 20 mL pyridine; next, trimethylsilyl chloride (6.5 mL, 44.8 mmol, 5 eq.) was added. After about 1 hour stirring at room temperature, the substrate was quantitatively converted as shown by TLC (CH2Cl2/MeOH, 9:1). The silylation product was directly submitted to next reaction. p-Toluenesulfonyl chloride (4 g, 17.9 mmol, 2 eq.) was added to the reaction mixture. The resulting solution was refluxed for 1.5-2 hours. After this time, TLC analysis showed full substrate conversion. Excess silyl and tosyl chlorides were decomposed by adding saturated NaHCO3 solution. The aqueous phase was extracted with three portions of methylene chloride. Combined organic layers were dried over anhydrous Na2SO4. The solvents were evaporated under reduced pressure. Next, the crude product was dissolved in 10 mL of MeOH and 15 mL of aqueous NH3 solution were added. The reaction mixture was left on a magnetic stirrer for about 45 minutes at room temperature. After this time, TLC analysis showed complete deprotection of 4-N-(p-toluenesulfonyl)cytidine hydroxyl groups. The solvents were evaporated under reduced pressure. The crude product was dried by evaporation with pyridine (three times). Dry 4-N-(p-toluenesulfonyl)cytidine was dissolved in 20 mL of pyridine and 4,4′-dimethoxytrityl chloride (3.34 g, 9.85 mmol, 1.1 eq.) was added. The reaction mixture was left on a magnetic stirrer for about 1 hour at room temperature. The reaction was stopped by adding saturated solution of NaHCO3, which was then extracted with three portions of CH2Cl2. After organic layers were combined, dried on anhydrous Na2SO4 and evaporated under reduced pressure, the dry product was purified by chromatography (CH2Cl2/MeOH, 2%). Pure 4-N-p-toluenesulfonyl-5′-O-(4,4′-dimethoxytrityl)cytidine (3) was obtained as white foam. 5.44 g (85%) of white solid was obtained after lyophilization from benzene.
1H NMR (DMSO): δ (ppm) 12.17 (s, 1H, HN4); 7.63 (d, J=8.4 Hz, 1H, H6); 7.36-7.39 (m, 4H, H—Ar); 7.30-7.34 (m, 5H, H—Ar); 7.24-7.28 (m, 4H, H—Ar); 6.89-6.92 (m, 4H, H—Ar); 6.23 (d, J=8 Hz, 1H, H5); 5.68 (d, J=2.4 Hz, 1H, OH); 5.59 (d, J=4.8 Hz, 1H, OH); 5.16 (d, J=6.8 Hz, 1H, H1′); 4.14-4.17 (m, 1H, H4′); 4.06-4.09 (m, 1H, H2′); 3.96-3.98 (m, 1H, H3′); 3.76 (s, 6H, OCH3); 3.23-3.32 (m, 2H, H5′, H5″); 2,37 (s, 3H, CH3). 13C NMR (DMSO): δ (ppm) 159.51 (C4); 158.16 (C—OCH3, DMTr); 149.51 (C2); 144.35 (Ar); 142.34 (C—CH3, Tos); 142.08 (C6); 139.69; 136.10; 135.45; 135.08; 129.74; 129.63, 129.35; 128.30; 127.92; 127.73; 126.85; 126.04; 123.88; 113.25 (Ar); 95.67 (C5); 89.83 (C1′); 85.96 (4° C., DMTr); 82.06 (C4′); 73.08 (C2′); 68.78 (C3′); 62.04 (C5′); 55.01 (OCH3); 20.93 (CH3).
4-N-p-Toluenesulfonyl-5′-O-(4,4′-dimethoxytrityl)cytidine (3) (4.44 g, 6.35 mmol, 1 eq.) was evaporated three times with pyridine to remove the trace amounts of water. Next, the substrate was dissolved in pyridine (20 mL) and 2,2′-(ethylenedioxy)-bis(ethylamine) (5.5 mL, 44.45 mmol, 7 eq.) was added. The flask was tightly closed and placed overnight in a drying oven at 80° C. In the morning, TLC analysis (MeOH:H2O:CH3NH2, 7:2:1), was performed, showing quantitative conversion of the substrate. The reaction mixture was diluted with 30 mL of H2O. The aqueous layer was extracted with three portions of AcOEt (in case of emulsification, the solution was centrifuged). Organic layers were combined and dried/over anhydrous Na2SO4. After the solvents were evaporated under reduced pressure, the crude product was purified by column chromatography (CH2Cl2/MeOH, 6:4). 3.5 g (82%) of pure 4-N-(8-Amino-3,6-dioxaoctyl)-5′-O-(4,4′-dimethoxytrityl)cytidine (4) was obtained as white foam.
1H NMR (DMSO): δ (ppm) 7.86 (t, J=5.6 Hz, 1H, H—N4); 7.72 (d, J=7,6 Hz, 1H, H5); 7.22-7.39 (m, 9H, H—Ar); 6.91 (m, 4H, H—Ar); 5.78 (d, J=2.8 Hz, 1H, H1′); 5.60 (d, J=7.6 Hz, 1H, H6); 4.08 (t, J=6 Hz, H4′); 3.93-3.95 (m, 2H, H2′, H3”); 3.75 (s, 6H, OCH3); 3.50-3,55 (m, 8H, H5′, H5″, CH2 of hydroxydiethoxyethyl); 3.20-3.43 (m, 6H, CH2 of hydroxydiethoxyethyl); 2.66 (t, J=5.6 Hz, 2H, NH2).
13C NMR (DMSO): δ (ppm) 163.43 (C4); 158.12 (C—OCH3, DMTr); 155.06 (C2); 144.71 (C6); 139.81; 135.46; 135.29; 129.87; 127.72; 126.77; 113.23 (Ar); 94.44 (C5); 89.78 (C1′); 85.77 (4° C., DMTr); 81.65 (C4′); 74.08 (C2′); 71.77 (CH2 of hydroxydiethoxyethyl); 69.58; 69.55 (CH2 of hydroxydiethoxyethyl); 69.27 (C3′); 68.65 (CH2 of hydroxydiethoxyethyl); 62.73 (C5′); 55.03 (OCH3); 40.77 (CH2 [hydroxydiethoxyethyl]).
Methylamine-functionalized polystyrene support (50mg) was suspended in, l mL of anhydrous CH2Cl2 and succinic anhydride (50 mg), Et3N (50 μL) and DMAP (20 mg) were added. The flask was tightly closed and shook overnight at room temperature. In the morning, the support was filtered off and washed with MeOH (50 mL) and CH2Cl2 (50 mL). Next, the support was dried in the air. The resulting carboxyl group-carrying support (5) (1 g) was suspended in 10 mL of methylene chloride. 4-N-(8-Amino-3,6-dioxaoctyl)-5′-O-(4,4′-dimethoxytrityl)cytidine (5) (1 g, 1.5 mmol, 1 eq.), DCC (700 mg, 1.5 mmol, 1 eq.), Et3N (450 μL, 1.5 mmol, 1 eq.) and DMAP (20 mg) were added to the suspension. The mixture was shook overnight at room temperature. The support was filtered off and washed with methanol (50 mL) and methylene chloride (50 mL). 10 mL of NH3 aq. (32%) were added to decompose the by-products. The reaction was conducted for 4 hours at 60° C. Next, support beads (6) were suspended in MeCN (10 mL) and acetic anhydride (500 μL), N-methylimidazole (1 mL) and 2,6-lutidine(500 μL) were added. The mixture was shook for 2 hours; after this time, support 7 was filtered off, washed with MeOH, CH2Cl2 and dried in the air.
The synthesis termination procedure was developed for the purposes of peptide library sequencing. Therefore, the original form of this procedure may be used only in case of short oligonucleotides.
The idea of using a single terminating agent regardless of the type of unit attached at particular stage turned out to be useful in case of short peptide chains. Large diversity of monomers used was an argument for such solution. However, applying this approach resulted with chaos that makes difficult to interpret and unambiguously identify signals in MALDI spectra of oligonucleotides from combinatorial libraries.
A series of synthesis termination agents was designed so as to comply with the above assumptions. Nucleoside 3′-phosphoramidites with 5′-hydroxyl function protected with fluorenylmethyloxycarbonyl group (Fmoc), stable in the conditions of oligonucleotide synthesis, were used. This is a base-labile group, which is easily cleaved by β-elimination in ammonia solutions used for oligonucleotide deprotection, thus freeing the 5′-hydroxyl group of the nucleoside.
The synthesis of nucleoside phosphoramidites with base-labile Fmoc group in 5′-hydroxyl position was performed in three stages.
Preparation of oligonucleotide synthesis terminators: i) benzoyl chloride, Py; ii) iso-butyryl chloride, Py; iii) fluorenylmethyloxycarbonyl chloride, Py; iv) bis(diisopropylamine)-(2-cyanoethyl)-phosphine, thioethyltetrazole, CH2Cl2.
The synthesis of terminating agents (N-protected 5′-O-fluorenylmethyloxycarbonyl-2′-deoxynucleoside 3′-phosphoramidites) was conducted in the following stages:
N-protected 2′-deoxynucleosides were obtained from 2′-deoxynucleosides (8 mmol), which had been evaporated three times with pyridine and dissolved in pyridine. Next, trimethylsilyl chloride (4.5 mL, 40 mmol, 5 eq.) was added. The reaction mixture was left on a magnetic stirrer for about 1.5 hours at room temperature. After complete conversion of the substrate was confirmed (TLC—CH2Cl2/MeOH, 9:1), benzoyl chloride (1.8 mL, 16 mmol, 2 eq.) was added and the stirring was continued for 2 hours. After this time, no substrate was observed. The reaction mixture was cooled down in ice bath and 25 mL of ammonia solution were added portionwise. After 30 minutes, the solvents were evaporated under reduced pressure. The residue was dissolved in 35 mL of H2O and 25 mL of AcOEt were added. The mixture was shook and placed in a refrigerator. After the mixture was cooled down, white soft flaky solid precipitated. The solid was filtered off, washed with AcOEt and dried under reduced pressure over P2O5.
1H NMR (DMSO): δ (ppm) 11.18 (s, 1H, HNCO); 8.74 (s, 1H, H8); 8,73 (s, 1H, H2); 8.14 (m, 2H, H—Ar); 7.28-7.67 (m, 3H, H—Ar); 6.49 (t, J=4.8 Hz, 1H, H1′); 5.43 (d, 1H, 3′OH); 5.06 (t, 1H, 5′OH); 4.46 (m, 1H, H4′); 3.91 (m, 1H, H3′); 3.52-3.65 (m, 2H, H5′, H5″); 2.75-2.82 (m, 1H, H2′); 2.34-2.39 (m, 1H, H2″).
13C NMR (DMSO): δ (ppm) 165.70 (CONH); 151.89 (C6); 151.48 (C2); 150.29 (C4); 143.06 (C8); 133.37; 132.42; 128.55; 128.45 (Ar); 124.08 (C5); 87.99 (C4′); 83.72 (C1′); 70.68 (C3′); 61.58 (C4′); 38.97 (C2′).
1H NMR (DMSO): δ (ppm) 11.23 (s, 1H, H—N4); 8.41 (d, J=7.2 Hz, 1H, H6); 8.01 (m, 2H, H—Ar); 7.48-7.65 (m, 3H, H—Ar); 6.16 (t, J=6 Hz, 1H, H1′), 5.30 (d, J=4.2 Hz, 1H, 3′OH); 5.11 (t, J=5.1 Hz, 1H, 5′OH); 4.26 (m, 1H, H4′); 3.89 (m, 1H, H3′); 3.55-3.68 (m, 2H, H5′, H5″); 2.28-2.36 (m, 1H, H2′); 2.02-2.1 (m, 1H, H2″),
13C NMR (DMSO): δ (ppm) 167.41 (COOPh); 162.98 (C4); 154.36 (C2); 144.96 (C6); 133.18; 132.69; 130.02; 128.43 (Ar); 96.03 (C5); 87.94 (C4′); 86.19 (C1′); 69.91 (C3′); 60.92 (C5′); 40.89 (C2′).
1H NMR (DMSO): δ (ppm) 12.06 (s, 1H, HNCO); 11.71 (s, 1H, H—N1); 8.23 (s, 1H, H8); 6.22 (t, J=6.6 Hz, 1H, H1′); 5.37 (d, 1H, 3′OH); 4.95 (t, 1H, 5′OH); 4.37 (m, 1H, H4′); 3.84 (m, 1H, H3′); 3.47-3.59 (m, 2H, H5′, H5″); 2.73-2.82 (m, 1H, CH, i-Bu group); 2.49-2.59 (m, 1H, H2′); 2.24-2.31 (m, 1H, H2″); 1.12 and 1.09 (s, 6H, CH3 of i-Bu).
13C NMR (DMSO): δ (ppm) 179.99 (CONH); 154.86 (C6); 148.36 (C4); 148.05 (C2); 137.46 (C8); 120.15 (C5); 87,72 (C4′); 82.98 (C1′); 70.48 (C3′); 61.44 (C5′), 39.69 (C2′); 34.71 (CH of i-Bu); 18.84 (CH3 of i-Bu).
N-protected nucleosides or thymidine(1 eq.) were evaporated three times with pyridine. Then, the compounds were dissolved in pyridine, and fluorenylmethyloxycarbonyl chloride (1.1 eq.) was added. The reaction mixture was left on a magnetic stirrer for 1.5-2 hours. After this time, TLC analysis showed full substrate conversion. Saturated NaHCO3 solution was added. The aqueous layer was extracted with CH2Cl2. Extracts were combined, dried on anhydrous Na2SO4 and evaporated under reduced pressure. Crude products were purified by chromatography (CH2Cl2/MeOH; 3%), giving pure N-protected 5′-O-fluorenylmethyloxycarbonyl-2′-deoxynucleosides as white solids.
1H NMR (DMSO): δ (ppm) 11.28 (s, 1H, HNCO); 8.65 (s, 1H, H8); 8.57 (s, 1H, H2); 7.29-8.12 (m, 13H, H—Ar); 6.38 (m, 1H, H1′); 5.51 (d, 1H, 3′OH); 4.47-4.68 (m, 2H, CH2-COO); 4.43 (m, 1H, H4′); 4.26-4.32 (m, 2H, H5′, H5″); 3.98 (m, 1H, H3′); 2.11-2.51 (m, 3H, CH of Fmoc, H2′, H2″).
1H NMR (DMSO): δ (ppm) 11.32 (s, 1H, H—N4); 7.80-8.02 (m, 6H, H6, H—Ar); 7.27-7.69 (m, 8H, H—Ar); 6.19 (m, J=6Hz, 1H, H1′), 5,30 (d, J=4.2 Hz, 1H, 3′OH); 4.50-4.59 (m, 2H, CH2—COO); 4.38 (m, 1H, H4′); 4.28-4.33 (m, 2H, H5′, H5″); 4.20 (m, 1H, H3′); 2.01-2.45 (m, 3H, CH of Fmoc, H2′, H2″).
1H NMR (DMSO): δ (ppm) 12.11 (s, 1H, HNCO); 11.86 (s, 1H, H—N1); 7.31-8.15 (m, 9H, H8, H—Ar); 6.17 (t, J=5.8 Hz, 1H, H1′); 5.15 (d, 1H, 3′OH); 4.38-4.59 (m, 2H, CH2—COO); 4.28 (m, 1H, H4′); 4.19-4.27 (m, 2H, H5′, H5″); 4.06 (m, 1H, H3′); 2.69-2.78 (m, 1H, CH of i-Bu); 2.10-2.48 (m, 3H, CH of Fmoc, H2′, H2″); 1.10 and 1.07 (s, 6H, CH3 of i-Bu).
1H NMR (DMSO): δ (ppm) 11.27 (s, 1H, H—N3); 7.27-7.85 (m, 9H, H6, H—Ar); 6.31 (t, J=4.5 Hz, 1H, H1′); 5.01 (d, 1H, 3′OH); 4.41-4.58 (m, 2H, CH2—COO); 4.39 (m, 1H, H4′); 4.09-4.35 (m, 2H, H5′, H5″); 3.87 (m, 1H, H3′); 2.03-2.41 (in, 3H, H2′, H2″, CH of Fmoc); 1.98 (s, 3H, CH3).
N-protected 5′-O-fluorenylmethyloxycarbonyl-2′-deoxynucleoside 3′-phosphoramidites were obtained in the following procedure:
N-protected 5′-O-fluorenylmethyloxycarbonyl-2′-deoxynucleoside (1 eq.) and thioethylotetrazole (0.9 eq.) were dried overnight over P2O5 in a vacuum dessicator. Next, the protected 2′-deoxynucleoside was dissolved in methylene chloride (20 mL) and bis(N,N′-diisopropylamine)(2-cyanoethoxy)phosphine (1.1 eq.) and thioethyltetrazole (portionwise) were added. After 1 hour 31P NMR analysis showed full phosphine conversion. Saturated solution of NaHCO3 was added to the reaction mixture and then extracted with three portions of CH2Cl2. The organic layers were combined, dried on anhydrous Na2SO4 and the solvents were evaporated under reduced pressure. Next, the N-protected 5′-O-fluorenylmethyloxycarbonyl-2′-deoxynucleoside 3′-phosphoramidites were dissolved in 3 mL of methylene chloride and precipitated from n-hexane (800 mL).
31P NMR (CH2Cl2): δ (ppm) 148.97, 148.84.
1H NMR (DMSO): δ (ppm) 10.99 (s, 1H, H—N4); 8.76 (d, J=2 Hz, 1H, H8); 8.65 (d, J=5.6 Hz, 1H, H2); 7.59-8.13 (m, 13H, H—Ar); 6.54 (m, 1H, H1′); 4.84 (m, 1H, H4′); 4.27-4.52 (m, 5H, H5′, H5″, CH, CH2 of Fmoc); 3.84 (m, 2H, CH2 of Fmoc); 3.64 (m, 1H, H3′); 3.09 (m, 2H, [i-Pr]); 2.77 (m, 2H, CH2 of Fmoc); 2.34-2.61 (m, 2H, H2′, H2″); 1.18 (s, 12H, [i-Pr]).
13C NMR (DMSO): δ (ppm) 165.67 (CO—Ph); 154.22 (C4); 151.73 (C2); 151.50 (O—CO—O); 150.48 (C8); 143.31 (C4); 140.75; 139.39; 137.40; 133.43; 132.42; 129.15 (Ar); 128.88 (C5); 128.56; 128,47; 128.43; 127,69; 127.25; 127.12; 125.99; 124.87; 121.35; 120.15 (Ar); 118.98 (CN); 83.86 (CH2 of Fmoc); 83.04 (C1′); 82.75 (C4′); 68.88 (C5′); 58.54 (C3′); 58.36 (CH of Fmoc); 42.71; 42.60 (i-Pr); 37.30 (C2′); 24.39; 24.33; 24.24; 24.16 (i-Pr); 19.80 (CH2 of cyanoethylof cyanoethyl).
31P NMR (CH2Cl2): δ (ppm) 149.15, 148.98.
1H NMR (DMSO): δ (ppm) 11.26 (s, 1H, H—N4); 7.81-8.09 (m, 5H, H6, H—Ar); 7.28-7.64 (m, 9H, H—Ar); 6.21 (m, 1H, H1′); 4.26-4.56 (m, 7H, H5, H4′, H5′, H5″, CH, CH2 of Fmoc); 3.78 (m, 2H, CH2of cyanoethyl); 3.42-3.57 (m, 3H, H3′, i-Pr); 2.73-2.80 (m, 2H, CH2 of cyanoethyl); 2.19-2.27 (m, 2H, H2′, H2″); 1.15 (s, 12H, CH3 [i-Pr]).
13C NMR (DMSO): δ (ppm) 167.49 (CO—Ph); 165.46 (C4); 163.08 (C2); 154.21 (O—CO—O); 144.83 (C6); 143.42; 143.39; 143.22; 142.55; 140.77; 140.70; 139.39; 137.40; 133.53; 133.13; 132.71; 129.19; 128.89; 128.41; 128.15; 127.42; 127.25; 127.10; 124.82 (Ar); 119.99 (CN); 96.38 (C5); 86.60 (CH2 of Fmoc); 72.86 (C4′); 68.89 (C5′); 66.91 (C3′); 58.43 (CH2 of cyanoethylof cyanoethyl); 46.22 (Fmoc); 44.54 (C1′); 42.72 (i-Pr); 24.35; 24.28; 24.22; 24.14 (i-Pr); 19.75 (CH2 of cyanoethylof cyanoethyl).
31P NMR (CH2Cl2): δ (ppm) 149.15, 148.98.
1H NMR (DMSO): δ (ppm) 11.65 (s, 1H, H—NCO); 8.20 (d, J=4.8 Hz, 1H, H8); 7.83-7.89 (m, 2H, H—Ar); 7.61-7.65 (m, 2H, H—Ar); 7.30-7.43 (m, 4H, H—Ar); 6.26 (t, J=7.2 Hz, 1H, H1′); 4.60 (m, 1H, H4′); 4.46-4,55 (m, 2H, CH2 of Fmoc); 4.21-4.34 (m, 3H, H5′, H5″, CH of Fmoc); 4.15 (m, 1H, H3′); 3.68-3.80 (m, 2H, CH2 of cyanoethylof cyanoethyl); 3.52-3.62 (m, 2H, CH [i-Pr]); 2.91 (m, 1H, CH of i-Bu); 2.73-2.79 (m, 2H, CH2 of cyanoethylof cyanoethyl); 2.42-2.55 (m, 2H, H2′, H2″); 1.23 (s, 6H, CH3 of i-Bu); 1.32 (s, 12H, CH3 [i-Pr]).
13C NMR (DMSO): δ (ppm) 180.13 (CO of i-Bu); 180.06 (C6); 154.79 (C2); 154.20 (O—CO—O), 148.50 (C8); 143.23; 143.20 (Ar); 140.78 (C4); 140.75 (C5); 139.39; 137.40; 137.29; 128.88; 127.70; 127.75; 127.11; 124.84; 124.18; 121.35; 120.17 (Ar); 119.99 (CN); 82.99 (CH2 of Fmoc); 82.82 (C4′); 68.88 (C5′); 66.95 (C1′); 61.16 (C3′); 58.45 (CH2 of cyanoethyl); 46.17 (CH of Fmoc); 42,66 (CH of i-Bu and [i-Pr]); 34.73 (C2′); 24,29 (CH3 of i-Bu); 22.03 (CH2 of cyanoethyl); 18.82 (CH3 of i-Bu).
31P NMR (CH2Cl2): δ (ppm) 149.06, 148.93.
1H NMR (DMSO): δ (ppm) 11.32 (m, 1H, H—N4); 7.39-7.89 (m, 9H, H6, H—Ar); 6.19 (m, 1H, H1′); 4.53-4.62 (m, 2H, CH2 of Fmoc); 4,46 (m, 1H, H4′); 4,24-4.35 (m, 3H, H5′, H5″, CH of Fmoc); 4.13 (m, 1H, H3′); 3.64-3.72 (m, 2H, CH2 of cyanoethyl); 3.52-3.59 (m, 2H, CH [i-Pr]); 2.72-2.79 (m, 2H, CH2 of cyanoethyl); 2.21-2.36 (m, 2H, H2′, H2″); 1.14 (s, 12H, CH3 [i-Pr]); 1.10 (CH3 [T]).
13C NMR (DMSO): δ (ppm) 163.59 (C4); 154.28 (C2); 150.33 (O—CO—O); 143.22; 140.79 (Ar); 135.89 (C6); 127.73; 127.24; 124.79; 120.18 (Ar); 118.93 (CN); 109.87 (C5); 84.14 (CH2 Fmoc); 82.41 (C1′); 82.01 (C2′); 72.89 (C5′); 68.82 (C4′); 58.41 (CH2 of cyanoethyl); 46.25 (CH of Fmoc); 42.70 (CH [i-Pr]); 37.41 (C2′); 24.33; 24.27; 24.18; 24.12 (CH3 [i-Pr]); 22.55 (CH2 of cyanoethyl); 12.06 (CH3 of T).
Standard nucleoside 3′-phosphoramidites of nucleosides or their analogues were mixed with terminating agents in 9:1 molar ratio. 0.07 M solutions in acetonitrile were prepared and used for oligonucleotide synthesis.
Oligonucleotides were synthesized in 0.2 μM scale according to the standard program:
Protective groups were removed from oligonucleotides anchored to the “oxylabile support” by overnight treatment with aqueous 32% ammonia solution at 55° C.
The amount of oligonucleotide(s) on a single support bead does not exceed several picomoles. Analysis of complex mixtures at such minimum quantities is a difficult task. Oxidation of ribose residue cis-diol system is conducted using ammonium periodate.
A single support bead was isolated using a glass capillary. The operation was monitored under a Nikon Diaphot inverted fluorescence microscope. The bead suspended in 0.1-0.2 μL H2O was placed in an eppendorf tube. Next, the suspension was centrifuged and 0.1 M solution of NH4IO4 (0.25 μL) was added. The solution was centrifuged again and set aside for 30 minutes. Next, 0.3 M L-methionine solution (0.25 μL) was added, the mixture was centrifuged and set aside for 2 hours
After cleavage of the nucleotide from the support, the mixture was submitted to MALDI-TOFF analysis.
2,4,6-trihydroxyacetophenone (THAP) was used as the matrix. In order to prepare the matrix solution, 2 mg of THAP were dissolved in 200 μL of MeCN/H2O mixture (1:1) and 0.1 M diammonium citrate solution (70 μL) was added. The mixture was shaken and centrifuged.
Analyses were performed by the dried droplet method on prestructured MALDI AnchorChip plates with spot diameters of 600 and 400 μm. Oligonucleotide solution (0.5 μL) was applied onto the spot, followed by the matrix solution. The mixture was stirred using the pipette tip. In order to maintain uniform crystallization conditions (air humidity), plates were placed over a drying agent (CaO) in a dessicator.
An exemplary MALDI-TOF spectra were obtained for the oligonucleotide of the following sequence: 5′-d(CGG ATT TAT GCA)-3′and the oligonucleotide analogue: 5′-d(ATT CSpPGT GTG CAG A)-3′. The first three nucleotides must be known, since the signals representing these nucleotides are located in the region of matrix signals.
The
The type and order of nucleotides and/or their analogues in the sequence is determined from calculation of mass differences between two adjacent signals within the spectrum, corresponding to nucleotide or analogue masses. Calculation starts with the signal of the highest m/z value. The left-to-right spectrum sequence is interpreted in 3′→5′ direction. Units at the 3′ terminus of the oligomer, having molecular mass of up to 1000 Da, are predefined, since the signals that represent these units are located in the region of matrix signals; the sequence is a part of the library or a mass marker—a chemical compound structurally different from the components of the oligonucleotide or its analogue).
Preparation of “oxylabile” support 2: i) TMSCl, Py; ii) 1,2-bis[(dimethylamine)-methylene]hydrazine, Py; iii) DMTCl, Py; iv) 2,2′-(ethylenedioxy)-bis(ethylamine), Py; v) DCC, DMAP, Et3N, CH2Cl2, vi) NH3 aq; vii) Ac2O, NMI, 2,6-lutidine, MeCN.
The synthesis of “oxylabile” support 2 (13) was conducted in following stages:
Adenosine (1 eq.) was evaporated with pyridine (three times) and then dissolved in pyridine, and trimethylsilyl chloride (5 eq.) was added. After about 1 hour of stirring at room temperature, the substrate was quantitatively converted. The silylation product was directly submitted to next reaction. 1,2-bis[(dimethylamino)-methylene]hydrazine (4 eq.) was added to the reaction mixture. The resulting solution was refluxed at pyridine boiling point (96-100° C.) for 24 hours. After this time, TLC analysis showed full substrate conversion. Excess silyl chloride was decomposed by adding saturated NaHCO3 solution. The aqueous phase was extracted with three portions of methylene chloride. Combined organic layers were dried on anhydrous Na2SO4. The solvents were evaporated under reduced pressure. Next, the crude product was dissolved in 10 mL of MeOH and left for 10 hours at room temperature. The product 10 was crystallized from methanol. Crystals were filtered off and washed with hexane and diethyl ether. The product 10 was then dried by evaporation with pyridine (three times) and dissolved in pyridine. 4,4′-dimethoxytrityl chloride (1.1 eq.) was added. The reaction mixture was left on a magnetic stirrer for about 6 hours at room temperature. The reaction was stopped by adding saturated solution of NaHCO3, which was then extracted with three portions of CH2Cl2. After organic layers were combined, dried on anhydrous Na2SO4 and evaporated under reduced pressure, the dry product was purified by chromatography (CH2Cl2/MeOH, 2-4%). Pure 9-[5′-0-(4,4′-dimethoxytrityl)-(β-D-erythro-pentofuranosyl)]-6-(1,2,4-triazol-4-yl)purine (10) was obtained as white oil. White solid was obtained after lyophilization from benzene.
9- [5′-O-(4,4′-dimethoxytrityl)-(β-D-erythro-pentofuranosyl)]-6-(1,2,4-triazol-4-yl)purine (10) (1 eq.) was evaporated three times with pyridine to remove the trace amounts of water. Next, the substrate was dissolved in pyridine (20 mL) and 2,2′-(ethylenedioxy)-bis(ethylamine) (5.5 mL, 44.45 mmol, 7 eq.) was added. The flask was tightly closed and placed overnight in an oven at 80° C. In the morning, TLC analysis (MeOH:H2O:CH3NH2, 7:2:1) was performed, showing quantitative conversion of the substrate. The reaction mixture was diluted with 30 mL of H2O. The aqueous layer was extracted with three portions of AcOEt (in case of emulsification, the solution was centrifuged). Organic layers were combined and dried on anhydrous Na2SO4. After the solvents were evaporated under reduced pressure, the crude product was purified by chromatography (CH2Cl2/MeOH, 6:4), Pure 4-N-(8-Amino-3,6-dioxaoctyl)-5′-O-(4,4′-dimethoxytrityl)adenosine (11) was obtained as white foam.
Methylamine-functionalized polystyrene support (50 mg) was suspended in 1 mL of anhydrous CH2Cl2 and succinic anhydride (50 mg), Et3N (50 μL) and DMAP (20 mg) were added. The flask was tightly closed and shook overnight at room temperature. In the morning, the support was filtered off and washed with MeOH (50 mL) and CH2Cl2 (50 mL). Next, the support was dried in the air. The resulting carboxyl group-carrying support (5) (1 g) was suspended in 10 mL of methylene chloride. 4-N-(8-Amino-3,6-dioxaoctyl)-5′-O-(4,4′-dimethoxytrityl)adenosine (11) (1 g, 1.5 mmol, 1 eq.), DCC (700 mg, 1.5 mmol, 1 eq.), Et3N (450 μL, 1.5 mmol, 1 eq.) and DMAP (20 mg) were added to the suspension. The mixture was shook overnight at room temperature. The support was filtered off and washed with methanol (50 mL) and methylene chloride (50 mL). 10 mL of NH3 aq. (32%) were added to decompose the by-products. The reaction was conducted for 4 hours at 60° C. Next, support beads (12) were suspended in MeCN (10 mL) and acetic anhydride (500 μL), N-methyloimidazole (1.34 mL) and 2,6-lutidine (500 μL) were added. The mixture was shook for 2 hours; after this time, support 13 was filtered off, washed with MeOH, CH2Cl2 and dried in the air.
Number | Date | Country | Kind |
---|---|---|---|
P-390 797 | Mar 2010 | PL | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/PL2011/000033 | 3/21/2011 | WO | 00 | 9/24/2012 |