This specification relates to a modified fluorescent nucleobase triphosphate that can be used to biosynthetically produce labelled RNA, which is in turn amenable to in cell translation. This enables live cell imaging in which the labelled messenger RNA and its corresponding translation product (when a fluorescent fusion protein) can be visualised simultaneously.
RNA plays a fundamental role in human biology. It is the main player of the central dogma of biochemistry and a crucial regulator of gene expression via for instance micro and small interfering RNA, as well as through its intrinsic catalytic activity. It has, for these reasons, also emerged as a highly promising and versatile new drug modality: since RNA therapeutics have the potential to modify cellular function at the translational level, they may open up new opportunities to address previously undruggable targets.
An increased molecular and mechanistic knowledge of the biological processes involving RNA is therefore vital to understanding diseases and treat them. For example, there is a growing body of evidence suggesting that the key to unleashing the full potential of RNA-based drugs lies in understanding the processes of cell uptake and endosomal release (Dowdy, S. F., Nat. Biotechnol. 35, 222-229, [2017]). Regardless of the endocytosis mechanism, the delivery of a nucleic acid cargo to the cytoplasm always relies on endosomal escape, the understanding of which, despite extensive investigations, remains elusive (Crooke, S. T. et al., Nat. Biotechnol. 35, 230 [2017]; Pei, D. & Buyanova, M., Bioconjugate Chem. [2018]). In this context, tracking of endogenous and exogenous (therapeutic) RNAs inside cells, including their translocation, localization, splicing and degradation, is of great importance.
Recent advances have resulted in the development of a broad spectrum of tools and probes by which RNA can be analysed and quantified, but they generally involve heavily modified oligonucleotides with properties significantly different from natural ones, potentially resulting in loss of ability to be recognized and processed by the enzymatic machinery of cells. For example, a drawback of existing fluorescence-based technologies for studying cellular localization of RNA is that they primarily rely on highly amphiphilic and/or bulky external fluorescent constructs which could impair motility and perturb localization of the RNA and its molecular interactions with (for example) membrane constituents. In addition, a majority of these technologies are incompatible with live cell imaging (Li, Y., Ke, K. & Spitale, R. C., Biochemistry 58, 379-386 [2019]).
To overcome these issues and provide an improved method of investigating RNA mechanisms, this specification discloses a modified nucleobase triphosphate (compound (I), “tCOTP”) which can be used to incorporate minimally perturbing and internal labels (“tCO”) into functional messenger RNA (“mRNA”), giving it utility in live cell imaging and for drug delivery studies. Once incorporated into RNA, the structure of tCO enables it to retain base-pairing and stacking, so that it minimally perturbs natural biological processes (
To exemplify the possible applications of tCO, the specification describes successful in vitro transcription and also effective in cell translation of a full-length mRNA internally labelled with this fluorescent nucleobase analogue. In addition, by using a transcript encoding for the histone protein H2B fused to a Green Fluorescent Protein (“H2B:GFP”) which localizes to the nucleus, the specification shows that it is possible to visualize the labelled mRNA transcripts inside cells while concomitantly recording the fluorescence emanating from the expressed H2B:GFP protein. This approach should be generally applicable to any fluorescent protein.
Wilhelmsson, M. et al. Sci. Rep. 7, 2393 [2017] reports the preparation of RNA molecules labelled with tCO, but only discloses very short fluorescent RNA oligomers prepared by non-enzymatic solid-phase oligonucleotide synthesis, as opposed to full length RNAs accessible by biosynthesis. By virtue of their method of preparation the labelled RNA molecules are not amenable to in cell “live” analysis of transcription, translation or delivery of long therapeutic RNAs (which are mRNA-based) and therefore do not allow the same level of mechanistic insight.
WO2011/034895 concerns methods for labelling DNA and RNA. It mentions a structurally different fluorescent ribonucleotide analogue 1,3-diaza-2-oxophenothiazine-ribose-5-triphosphate (“tCTP”) which is used during in vitro transcription reactions to prepare labelled RNA. However, unlike the present specification only non-coding labelled RNA was prepared and there is no disclosure of any in cell biosynthesis or visualisation.
The differently labelled, full-length coding RNA polymers accessible using the technology disclosed in the present specification also have advantageous properties over the labelled RNAs in WO2011/034895, for example 1) improved fluorescence levels and label photostability; 2) improved in vitro transcription fidelity; and 3) native-like levels of in cell translation of tCO-labelled mRNA resulting in the correct protein product and localization.
In summary therefore, this specification discloses a labelling technique that not only allows localisation and tracking of the tagged RNA, but also facilitates analysis of the biological functionality and delivery efficacy of mRNA, an important future drug modality. Since the internal tCO label is compatible with biological processes that RNA participates in it holds a great potential to be used as a powerful imaging tool in live cell microscopy, for example in detailed investigations of cellular uptake, endosomal release, exosomal loading and trafficking. These have significant potential to elucidate how these vital delivery pathways work and can be controlled.
A primary objective of the present specification is to provide a modified nucleobase triphosphate that can be used to make labelled RNA especially suitable for in vitro and in vivo mechanistic investigations.
Accordingly, this specification describes, in part, a compound of formula (I) or a salt thereof as claimed in claim 1.
This specification also describes, in part, a process for preparing a compound of formula (I) or a salt thereof as claimed in claim 5.
This specification also describes, in part, a composition for preparing a tCO labelled RNA molecule comprising a compound of formula (I) as claimed in claim 16.
This specification also describes, in part, the use of a compound of formula (I) or a salt thereof to enzymatically prepare a tCO labelled RNA molecule as claimed in claim 17.
This specification also describes, in part, a process for preparing a tCO labelled RNA molecule as claimed in claim 19.
This specification also describes, in part, the use of a tCO labelled mRNA molecule to prepare a protein encoded by the mRNA by translation as claimed in claim 20.
The invention detailed in this specification should not be interpreted as being limited to any of the recited embodiments or examples. Other embodiments will be readily apparent to a reader skilled in the art.
“A” or “an” mean “at least one”. In any embodiment where “a” or “an” are used to denote a given material or element, “a” or “an” may mean one. In any embodiment where “a” or “an” are used to denote a given material or element, “a” or “an” may mean 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 100, 1000, 10000, 100000 or 1000000 (1 million).
When an embodiment includes “a” or “an” feature X, subsequent referrals to “the” feature X do not imply only one of the feature is present. Instead the above interpretation of “a” or “an” continues to apply so that “the” also means “at least one”. In other words, embodiments comprising “a feature X, where the feature X is . . . ” should be construed as “at least one feature X, where the at least one feature X is . . . ”.
“Comprising” means that a given embodiment may contain other features. For example, in any embodiment where a material “comprising” certain materials or elements is mentioned, the given material may be formed of at least 10% w/w, at least 20% w/w, at least 30% w/w, or at least 40% w/w of the materials or elements (or combination of materials or elements).
In any embodiment where “comprising” is mentioned, “comprising” may also mean “consisting of” (or “consists of”) or “consisting essentially of” (or “consists essentially of”).
With respect to embodiments of a material, “consisting of” or “consists of” means the material or element is formed entirely of the material or element (or combination of materials or elements). In any embodiment where “consisting of” or “consists of” is mentioned the given material or element may be formed of 100% w/w of the material or element.
With respect to embodiments of a material, “consisting essentially of” or “consists essentially of” means that a given material or element consists almost entirely of that material or element (or combination of materials or elements). In any embodiment where “consisting essentially of” or “consists essentially of” is mentioned the given material or element may be formed of at least 50% w/w, at least 60% w/w, at least 70% w/w, at least 80% w/w, at least 90% w/w, at least 95% w/w or at least 99% w/w of the material or element.
In any embodiment where “is” or “may be” is used to define a material or element, “is” or “may be” may mean the material or element “consists of” or “consists essentially of” the material or element.
When it is mentioned that “in some embodiments . . . ” a certain element may be present, the element may be present in a suitable embodiment in any part of the specification, not just a suitable embodiment in the same section or textual region of the specification.
When a feature is “selected from” a list, the feature is selected from a list consisting of the specified alternatives (i.e. a list of the alternatives specified and no others).
Claims are embodiments.
In one embodiment there is provided a compound of formula (I) or a salt thereof:
Compounds and salts described in this specification may exist as a mixture of tautomers (structural isomers resulting from the migration of a hydrogen atom that exist in equilibrium). Relevant embodiments include all tautomers of compounds of formula (I) or salts thereof.
Atoms of the compounds and salts described in this specification may exist as their isotopes. Embodiments include all compounds of formula (I) where an atom is replaced by one or more of its isotopes (for example a compound of formula (I) where one or more carbon atom is an 11C or 13C carbon isotope, or where one or more hydrogen atom is a 2H or 3H isotope).
A suitable salt of a compound of formula (I) is for example a base-addition salt. A base-addition salt is formed by bringing the compound of formula (I) into contact with a suitable organic or inorganic base. A base addition salt may be formed using a suitable organic base like a nitrogen base, for example ammonia or a trialkylamine like triethylamine. A base addition salt may also for example be formed using a suitable inorganic base like an alkali metal or rare earth hydroxide, for example potassium hydroxide, sodium hydroxide, magnesium hydroxide or manganese hydroxide.
In one embodiment there is provided a compound of formula (I) which is a free acid.
In one embodiment there is provided a compound of formula (I) which is a salt.
In one embodiment there is provided a compound of formula (I) which is a sodium, potassium, magnesium, or ammonium salt.
In one embodiment there is provided a compound of formula (I) which is a sodium, potassium, or ammonium salt.
In one embodiment there is provided a compound of formula (I) which is a sodium or ammonium salt.
In one embodiment there is provided a compound of formula (I) which is a monosodium, disodium, trisodium, tetrasodium, monoammonium, diammonium, triammonium or tetraammonium salt.
In one embodiment there is provided a compound of formula (I) which is a monosodium, disodium, trisodium or tetrasodium salt.
In one embodiment there is provided a compound of formula (I) which is a monosodium salt.
In one embodiment there is provided a compound of formula (I) which is a disodium salt.
In one embodiment there is provided a compound of formula (I) which is a trisodium salt.
In one embodiment there is provided a compound of formula (I) which is a monoammonium, diammonium, triammonium or tetraammonium salt.
In one embodiment there is provided a compound of formula (I) which is a monoammonium salt.
In one embodiment there is provided a compound of formula (I) which is a diammonium salt.
In one embodiment there is provided a compound of formula (I) which is a triammonium salt.
In one embodiment there is provided a process for preparing a compound of formula (I) or a salt thereof comprising:
A protecting group (“PG”, for example PG1 and PG2) is any group suitable for temporarily protecting a reactive centre, for example a hydroxyl group. Suitable protecting groups for the reactive centres disclosed herein may be found for example in “Greene's Protective Groups in Organic Synthesis, Fourth Edition”, Greene T. W., Wuts P. G. M.; John Wiley & Sons, Inc. 2007, doi: 10.1002/0470053488), the contents of all of which are herein incorporated by reference.
A “hydro” group is equivalent to a hydrogen atom. Atoms with a hydro group attached to them can be regarded as unsubstituted.
A “C1-3alkyl group” is a straight chain or branched saturated alkyl group with the indicated number of carbons. Example C1-3alkyl groups include methyl, ethyl, propyl and isopropyl.
In step iii) above, the secondary alcohols to be capped may be those on the ribose part of the molecule.
This overall process is an advantageous preparation of the compound of formula (I) for several reasons:
In some embodiments R1 may be a hydro group.
In some embodiments R1 may be a C1-3alkyl group. It has been observed that when R1 is a C1-3alkyl group, the phosphoramidite reagent preparation is easier and higher yielding, but performs at least as well in step v above as when R1 is a hydro group.
In some embodiments R1 may be methyl.
In one embodiment there is provided a compound of formula (III):
Where R1 is a C1-3 alkyl group.
In one embodiment there is provided a compound of formula (IIIa):
In some embodiments the support may be a solid polymer.
In some embodiments the support may be a solid polymer selected from controlled-porosity glass and polystyrene.
In some embodiments the support may be polystyrene.
In some embodiments the support may be controlled-porosity glass.
In some embodiments the support may be functionalised with a primary amino group. This may form the reactive point of attachment to the support.
In some embodiments the support may be controlled-porosity glass functionalised with a primary amino group (for example Amino-SynBase™).
In some embodiments PG1 may be selected from trityl, dimethoxytrityl and trimethoxytrityl.
In some embodiments PG2 may be selected from acetyl, benzoyl, 2,2,2-trichloroethylcarbonyl, paramethoxybenzyl, methyl, tetrahydropyranyl, triethylsilyl, triisopropylsilyl, trimethylsilyl, tert-butyldimethylsilyl and methoxyethyl.
In some embodiments PG2 may be acetyl. Where an immobilised molecule is base labile, this allows for an efficient synthesis in which removal of the PG2 group and cleavage from the resin may be accomplished in a single step.
In some embodiments PG1 may be dimethoxytrityl and PG2 may be acetyl.
In some embodiments immobilisation of the compound of formula (II) in step i) may occur mainly at the 2′-hydroxy position.
When immobilisation occurs mainly at the 2′-hydroxy position, this may be >50%, >60%, >70%, >80%, >90% or 100% of the total immobilisation (i.e. the total covalent binding of both secondary hydroxyl groups to the support).
In some embodiments the tetraalkylammonium pyrophosphate may be tetrabutylammonium pyrophosphate.
In one embodiment there is provided a process for preparing a compound of formula (I) or a salt thereof comprising:
In one embodiment there is provided a process for preparing a compound of formula (I) or a salt thereof comprising:
Suitable conditions and reagents to effect each of steps i) to ix) above are known to the skilled person or can be found in the Detailed Description.
In some embodiments immobilising the compound of formula (II) or salt thereof in step ii) may be accomplished by a coupling reagent (for example succinic anhydride catalysed by dimethylaminopyridine when the support is functionalised with a primary amino group).
In some embodiments reaction of the exposed primary alcohol group with a compound of formula (III) may be accomplished using an activator (for example BTT activator or Activator 42®).
In some embodiments the phosphorus (Ill) compound in step vi) may be oxidised to a phosphorus (V) compound using aqueous pyridine and iodine.
In some embodiments cleaving the triphosphate from the support may be accomplished using basic conditions (for example by treating with AMA). When there is a base-labile support and a base-labile protecting group is chosen for PG2, using these conditions allows simultaneous deprotection and cleavage.
In one embodiment there is provided a composition for preparing a tCO labelled RNA molecule comprising a compound of formula (I) and a natural ribonucleotide triphosphate.
A “natural ribonucleotide triphosphate” comprises the appropriate natural ribonucleoside with a triphosphate group bonded to the 5′ hydroxy position. It is equivalent to a natural ribonucleoside triphosphate. In some embodiments a natural ribonucleotide triphosphate may be selected from cytidine 5′-triphosphate, uridine 5′-triphosphate, adenosine 5-triphosphate and guanidine 5′-triphosphate. A composition of natural ribonucleotide triphosphates (i.e. one comprising a ribonucleotide triphosphate as defined herein) may comprise combinations of varying amounts of these building blocks, in amounts sufficient to construct the target RNA molecule (for example as provided in NTP mix).
In one embodiment there is provided the use of a compound of formula (I) or a salt thereof to enzymatically prepare a tCO labelled RNA molecule.
A tCO labelled RNA molecule comprises at least one tCO residue but is otherwise similar to the natural RNA molecule (i.e. one with an unmodified cytosine residue at the same location as the tCO residue).
In some embodiments a tCO labelled RNA molecule may comprise >10%, >20%, >30%, >40%, >50%, >60%, >70%, >80%, >90% or 100% of tCO residues in place of unmodified cytosine residues.
In some embodiments a tCO labelled RNA molecule may comprise 10%-20%, 10%-30%, 10%-40%, 20%-50%, 30%-60%, 40%-70%, 50%-80% or 50%-90% of tCO residues in place of unmodified cytosine residues.
In one embodiment there is provided a process for preparing a tCO labelled RNA molecule comprising providing a DNA template to composition comprising a compound of formula (I) and a natural ribonucleotide triphosphate (for example a combination of varying amounts of cytidine 5′-triphosphate, uridine 5′-triphosphate, adenosine 5-triphosphate and/or guanidine 5′-triphosphate in amounts sufficient to construct the target RNA molecule, for example as provided in NTP mix), then treating the resultant mixture with an RNA polymerase.
In some embodiments the tCO labelled RNA molecule may be a tCO labelled mRNA.
In some embodiments the tCO labelled RNA molecule may encode for a protein fused to a fluorescent protein. Example fusable fluorescent proteins include Green Fluorescent Protein (GFP) and mFruit family proteins. When the target protein is fluorescent (either inherently or due to a tag), it is possible to simultaneously visualise both the labelled RNA molecule and the protein it is being used to synthesise, giving a greater degree of mechanistic insight.
In some embodiments the tCO labelled RNA molecule may encode for a protein selected from H2B, calmodulin, H2B:GFP and calmodulin-3:GFP.
The terminology “:GFP” means that the protein target preceding the colon is fused to a Green Fluorescent Protein (GFP) family protein.
In some embodiments the tCO labelled RNA molecule may encode for H2B:GFP.
In some embodiments the tCO labelled RNA molecule may encode for calmodulin-3.
In some embodiments the RNA polymerase may be selected from T7 polymerase and SP6 polymerase.
In some embodiments a process for preparing a tCO labelled RNA molecule may be carried out in the presence of transcription buffer (e.g. 5×transcription buffer), magnesium salt (e.g. magnesium(II) chloride) and/or an RNase inhibitor (e.g. Ribolock).
In some embodiments a process for preparing a tCO labelled RNA molecule may be carried out in the presence of transcription buffer (e.g. 5×transcription buffer), magnesium salt (e.g. magnesium(II) chloride) and an RNase inhibitor (e.g. Ribolock). In some embodiments a process for preparing a process for preparing a tCO labelled RNA molecule may be carried out substantially as described in the experimental section (e.g. as detailed in the section headed “H2B:GFP RNA transcription and purification”).
In one embodiment there is provided a kit for preparing a tCO labelled RNA molecule comprising:
In one embodiment there is provided the use of a tCO labelled mRNA molecule to prepare a protein encoded by the mRNA by translation.
“Translation” refers to the central biological process whereby mRNA is decoded in a ribosome to produce a specific polypeptide, which may fold into an active protein before performing its functions in a cell.
In one embodiment there is provided the use of a tCO labelled mRNA molecule to prepare a protein encoded by the mRNA by in vitro translation (for example, substantially as described in the part of the experimental section (e.g. as detailed under the heading “cell-free translation”).
In some embodiments, in vitro translation may be performed using E. coli bacterial lysates and/or the Expressway® mini cell-free expression system.
In one embodiment there is provided the use of a tCO labelled mRNA molecule to prepare a protein encoded by the mRNA by in cell translation (for example, substantially as described in the parts of the experimental section (e.g. as detailed under the headings “cell culture” and “electroporation or chemical transfection”).
In some embodiments, in cell translation may be performed in human neuroblastoma cells (e.g. SH-SY5Y cells).
In one embodiment there is provided the translation of a tCO labelled RNA molecule into a protein.
In one embodiment there is provided the in vitro translation of a tCO labelled RNA molecule into a protein.
In one embodiment there is provided the in cell translation of a tCO labelled RNA molecule into a protein.
In some embodiments the encoded protein may be fused to a fluorescent protein (for example a GFP or mFruit family protein). When this is the case, it is possible to simultaneously visualise both the labelled RNA molecule and the protein it is being used to synthesise, giving a greater degree of mechanistic insight.
In some embodiments the tCO labelled mRNA and the encoded protein may be simultaneously analysed spatiotemporally using confocal microscopy (for example fluorescence confocal microscopy).
This is a convenient way to simultaneously monitor a labelled RNA and protein, and RNA containing a fluorescent base analogue has never before been used in such live cell visualisation.
Compound (I) may be prepared according to the scheme shown in
Compound 1 was prepared according to the literature (Füchtbauer, A. F. et al., Sci. Rep. 7, 2393 [2017])
MS (ESI−) [M−H]−=634.5. 1H NMR (500 MHz, DMSO-d6) δ 10.61 (bs, 1H), 7.42 (d, J=7.7 Hz, 2H), 7.27-7.35 (m, 7H), 7.22 (t, J=7.1 Hz, 1H), 6.90 (dd, J=8.6, 4.2 Hz, 4H), 6.75-6.87 (m, 3H), 6.46 (d, J=7.8 Hz, 1H), 5.71 (d, J=3.6 Hz, 1H), 5.49 (bs, 1H), 5.18 (bs, 1H), 4.08 (d, J=5.3 Hz, 1H), 4.04 (s, 1H), 3.94 (s, 1H), 3.71 (s, 3H), 3.70 (s, 3H), 3.29 (d, J=4.8 Hz, 1H), 3.16 (d, J=9.1 Hz, 1H).
Amino-SynBase™ CPG 500/110 (LCAA) from LinkTech (Nu. 1397-C025, 1 g, 0.08 mmol) was activated by shaking in trichloroacetic acid 3% in DCE (8 mL, 0.08 mmol) for 18 h. The activated support was then filtered off and washed with 9:1 triethylamine:diisopropylethylamine (20 mL), dichloromethane (20 mL) and diethyl ether (20 mL). The activated support was dried under vacuum for 2 days before use. Subsequently, the support (1 g, 0.08 mmol), succinic anhydride (0.345 g, 3.44 mmol) and N,N-dimethylpyridin-4-amine (0.070 g, 0.57 mmol) were suspended in dry Pyridine (3 mL) under N2. The reaction mixture was then gently shaken at RT for 4 h. After 4 h, solvent was filtered off and the support washed successively with pyridine (20 mL), dichloromethane (20 mL), diethyl ether (20 mL) and air-dried. Negative ninhydrin test on a small portion of support proved full succinylation. Succinylated CPG could thereafter be kept at room temperature for several months.
In a 10 mL syringe with PTFE filter, succinylated support 3 (1.420 g, 82 μmol/g, 0.12 mmol), DMAP (0.028 g, 0.23 mmol), DIC (719 μl, 4.64 mmol), 1 (0.076 g, 0.12 mmol) and triethylamine (49 μl, 0.35 mmol) were suspended pyridine (5 mL). The mixture was gently shaken for 18 h at RT. After 18 h, the syringe was purged and the support washed with pyridine (5 mL), dichloromethane (5 mL) and diethyl ether. Subsequently, in the same syringe, DMAP (0.028 g, 0.23 mmol), diisopopylcarbodiimide (719 μl, 4.64 mmol), triethylamine (49 μl, 0.35 mmol) and 2,3,4,5,6-pentachlorophenol (0.309 g, 1.16 mmol) were added to the support and suspended in pyridine (4 mL). The mixture was gently shaken for 4 h at RT before a solution of piperidine (2 mL, 20% in DMF—for capping of the unreacted carboxylic acids on the support) was added for 1 min (longer exposure time will reduce loading as piperidine cleaves the ester bonds with the nucleoside), then quickly washed away with DMF (3×5 mL), dichloromethane (5 mL) and diethyl ether (5 mL). Finally, the resin was shaken in a CAP A+CAP B mix (50/50 v/v) for 2 hours under argon atmosphere, then washed with DMF (5 mL), dichloromethane (5 mL), diethyl ether (5 mL) and argon-dried (final loading: 13 μmol/g—determined by reading optical density of a DMT solution cleaved from a weighed amount of support—ε=70000 M-1·cm-1 at 498 nm). Final loading can be increased by performing a second coupling with 1 in the same conditions before capping (typical loading after second coupling 20-25 μmol/g). Concentrating the reaction mixture and washing the residue multiple times with water and diethyl ether allows recovery of nearly 85% of unreacted nucleoside 1.
Compound 5 was prepared according to the literature (Ducho, C. et al., J. Med. Chem. 50, 1335-1346 [2007]). Briefly, 5-chlorosalicylic acid was reduced with LAH (0.5 equiv.) at −20° C. and the resulting 5-chlorosalicylic alcohol was cyclized into 2,6-dichloro-4H-benzo[d][1,3,2]dioxaphosphinine using PCI3 (1.2 equiv.) and triethylamine (2.3 equiv.) at −20° C. under argon. Low temperature and use of triethylamine as the base were decisive in avoiding rapid and quantitative Arbuzov rearrangement of the desired product into the more stable 2,5-dichloro-3H-benzo[d][1,2]oxaphosphole 2-oxide. The crude 2,6-dichloro-4H-benzo[d][1,3,2]dioxaphosphinine was subsequently treated with diisopropylamine (3 equiv.) for 2 h at room temperature. The mixture was then filtered under argon, concentrated to dryness and taken in 20% diisopropylamine in heptane. Quick filtration on a small silica gel plug allowed desired compound 5 as a colourless oil, crystallizing over time at −20° C. Any attempt of more thorough column chromatography on compound 5 would lead to quantitative Arbuzov rearrangement.
1H NMR (500 MHz, DMSO-d6) δ=7.23 (dd, J=8.6, 2.6 Hz, 1H), 7.20 (d, J=2.4 Hz, 1H), 6.92 (d, J=8.6 Hz, 1H), 5.06 (dd, J=14.7, 5.2 Hz, 1H), 4.89 (dd, J=19.6, 14.8 Hz, 1H), 3.53-3.63 (m, 2H), 1.15-1.19 (dd, J=8.0, 7.0 Hz, 12H). 31P NMR (202 MHz, DMSO-d6) δ=136.00 (s, 1P).
Compound 6 was prepared according to the literature (Warnecke, S. & Meier, C., J. Org. Chem. 74, 3024-3030 [2009]).
1H NMR (500 MHz, D2O) δ 3.04-3.13 (m, 16H), 1.53 (bs, 16H), 1.24 (h, J=7.3, 16H), 0.83 (t, J=7.4, 24H). 31P NMR (202 MHz, D2O) 6=−10.78 (s, 2P).
Reactions were performed in a 5 mL syringe with PTFE filter loaded with 4 (800 mg, 0.016 mmol) under an argon atmosphere and with shaking.
Steps were performed as following:
HRMS (ESI-TOF) m/z calc. for C15H18N3O15P3[M+H]+: 574.0029, found: 574.0013; m/z calc. for C15H18N3O15P3 [M−H]−: 571.9878, found: 571.9872. 1H NMR (500 MHz, D2O) δ 7.44 (s, 1H), 6.84-6.94 (m, 3H), 6.79 (dd, J=7.5, 1.7 Hz, 1H), 5.91 (d, J=4.9 Hz, 1H), 4.36 (t, J=4.8 Hz, 1H), 4.29 (t, J=5.1 Hz, 1H), 4.25 (d, J=4.1 Hz, 3H). 13C NMR (126 MHz, D2O) δ 155.8, 154.8, 142.4, 129.4, 124.9, 124.3, 122.3, 116.6, 88.8, 82.8, 73.4, 69.7, 64.5. 31P NMR (202 MHz, D2O) 5-10.89 (d, J=18.5 Hz, 1P), −11.46 (d, J=19.7 Hz, 1P), −23.21 (t, J=19 Hz, 1P).
Compound (I) can also be made by a slightly modified route wherein the coupling step (b above) is carried out with a modified phosphoramidite such as 6-chloro-N,N-diisopropyl-4-methyl-4H-benzo[d][1,3,2]dioxaphosphinin-2-amine 8 (compound (IIIa) above). This reagent has been found to be more easily prepared, and compound 8 is obtainable in a yield of 60% compared to around 3-10% for the preparation of compound 5 under the conditions in this specification.
5-chloro-2-hydroxybenzaldehyde was reacted with methylmagnesium bromide (2.5 equiv.) at −20° C. and the resulting 4-chloro-2-(1-hydroxyethyl)phenol was cyclized into 2,6-dichloro-4-methyl-4H-benzo[d][1,3,2]dioxaphosphinine using PCI3 (1.2 equiv.) and triethylamine (2.3 equiv.) at −20° C. under argon. The crude 2,6-dichloro-4-methyl-4H-benzo[d][1,3,2]dioxaphosphinine was subsequently treated with diisopropylamine (3 equiv.) for 2 h at room temperature. The mixture was then filtered under argon, concentrated to dryness and taken in 20% diisopropylamine in heptane. Quick filtration on a small silica gel plug furnished desired compound 8 as a colourless oil.
1H NMR (500 MHz, DMSO-d6) 5=6.96 (d, J=8.5 Hz, 1H), 6.87 (d, J=8.5 Hz, 1H), 6.74 (d, J=8.4 Hz, 1H), 5.19-5.26 (m, 1H), 5.16 (dq, J=10.4, 6.6 Hz, 1H), 3.57 (tdt, J=13.6, 10.6, 6.8 Hz, 2H), 1.63 (d, J=6.6 Hz, 3H), 1.55 (d, J=6.4 Hz, 2H), 1.16-1.19 (m, 24H). 31P NMR (202 MHz, DMSO-d6) 5=137.63 (s, 1P), 127.90 (s, 1P).
The utility of compound (1) in RNA labelling was demonstrated by its cell-free in vitro transcription to produce fluorescent full-length messenger RNA (mRNA), from a DNA template encoding for H2B histone protein fused to GFP (H2B:GFP).
The template was codon optimized to limit the number of C repeats, preventing self-quenching and improving brightness. Efficient transcription and tCO incorporation was observed using two different bacteriophage RNA polymerases, T7 and SP6 at tCO TP/canonical CTP ratios ranging from 0 to 100% (full replacement), as demonstrated by agarose bleach gel electrophoresis (
All RNA transcripts run as one single band on the gels, with a size corresponding to the expected 1247 nt mRNA product (H2B:GFP), demonstrating that the full-length mRNA is formed. The tCO-containing mRNA bands could be directly visualized upon 302 nm excitation (
Furthermore, no shorter transcripts were observed, suggesting that T7 processes tCO TP correctly and without premature abortion. Higher order bands are apparent in all lanes of the gel (
Therefore, these results demonstrate that tCO can be successfully incorporated into full-length RNA transcripts even under conditions where all canonical CTP is replaced with tCO TP (0% CTP; i.e. 100% C-labelling efficiency).
A spectroscopic approach was used to quantify the incorporation efficiency of tCO TP, compared to the canonical CTP. To enable this, all RNA transcripts were purified using a Monarch RNA Cleanup kit, ensuring complete removal of unreacted tCO TP. Absorption spectra (
By relating the absorption of the purified RNA transcripts at 260 nm, which reflects their total concentration, to the absorption at 370 nm (emanating exclusively from tCO), the relative rate constants for the incorporation of CTP and tCO TP (kC and ktCO, respectively, see later for details). The calculated quotients kC/ktCO (
The emissive behaviour of tCO was also investigated in the mRNA transcripts exploring the relation to the tCO TP fraction added to the initial reaction mixture. A substantial decrease in fluorescence quantum yield (from 0.18 to 0.09,
This may be ascribed to electronic interaction (coupling) of molecular states of the tCO fluorophore and self-quenching effect caused by the expected increasing concentration of vicinal tCO s. Importantly, this quenching effect at high tCO fractions is balanced by the large overall number of incorporations and does not prevent visualization of the mRNA, even for transcripts where all Cs are replaced by tCO s.
In order to verify the functionality of the tCO-labelled mRNA transcripts the translation of a tCO-labelled Calmodulin-3 mRNA in cell-free conditions using bacterial lysates was investigated. The labelled mRNAs encoding for the 17 kDa protein were transcribed from a commercial Calmodulin-3 DNA template plasmid using the same tCO TP/CTP ratios as for the H2B:GFP encoding mRNA (0 to 100% of tCO TP). After RNA purification and cell-free translation, the presence of Calmodulin-3 was confirmed by Coomassie staining (
Electroporation was used to introduce in vitro-transcribed tCO-labelled mRNA transcripts into human neuroblastoma SH-SY5Y cells. Taking advantage of them encoding for a fluorescent fusion protein with nuclear localization (H2B:GFP), the translation was detected by fluorescence (
Live-cell confocal microscopy and flow cytometry showed that GFP fluorescence in the cell nuclei could be detected in 32, 25, 18, and 12% of the cells 24 hours post-electroporation for mRNA's containing 25, 50, 75 and 100% of tCO, respectively (
Using flow cytometry, the levels of H2B:GFP fluorescence in the cells was quantified (
Importantly, no evidence was found of mRNA-induced cell toxicity at 24 hours post-electroporation (
It is evident from the images in
H2B:GFP fluorescence was found to increase gradually with time between 24 h and 72 h (
When delivered using lipofectamine, the tCO-labelled mRNAs were found to promote very similar H2B:GFP translation compared to the corresponding non-labelled mRNA, as indicated by the fluorescence levels in
Importantly, the complexation of the tCO-labelled mRNA with lipofectamine enabled its direct visualization inside cells using live cell confocal microscopy (
The original coding sequence for H2B:GFP was taken from pCS2-H2B:GFP plasmid (Addgene, Plasmid #53744, manually codon-optimized to minimize the occurrence of poly-Cn stretches (n<3), in silico-assembled with an additional T7 promoter and other desired features (Shine-Dalgarno/Kozak consensus sequences for enhancement of translation and a 3×Stop, respectively at the 5′ and 3′ of the coding sequence itself, plus the needed HindIII/SnaBI restriction sites, to generate the ligation-prone sticky ends) and ordered from Twist Bioscience as a synthetic gene block. The obtained sequence was then PCR-amplified, using a Phusion Hot Start High-Fidelity Taq (Thermo Scientific), and subcloned into a HindIII/SnaBI-digested (Fast Digest enzymes, Thermo Scientific) empty pCS2 backbone. After ligation with T4 ligase for 1 h at room temperature (Roche), DH5a E. coli competent cells (Invitrogen) were transformed following the recommended protocol, and obtained colonies were screened by colony-PCR. The selected colony was then inoculated into a midiprep-scale volume of liquid Luria-Bertani growth medium (VWR) and plasmid DNA isolated using a PureLink Fast Low-Endotoxin Midi Plasmid Purification Kit (Thermo Scientific). The purified plasmid was finally digested again with HindIII/SnaBI and gel-purified, to generate the transcription template with the desired size.
Primers (Eurofins Genomics):
In-vitro transcription reactions, for T7 and SP6 polymerases (Thermo Scientific), were assembled as recommended by the corresponding protocols, with a few modifications that resulted in a consistently increased yield in all conditions:
In-vitro transcriptions were always performed at 20° C. for 14 h, then RNAs were purified using a Monarch RNA Cleanup kit (NEB), or homemade equivalent buffers and regenerated columns following the same rationale. It was possible to partially recover unreacted tCO TP from the transcription mixtures by HPLC to re-use for further assays. For cellular studies, each batch of RNA was then enzymatically added with a polyA tail (with a Poly(A) Polymerase, NEB protocol #M0276 with incubation extended to 1 hour) and a Cap 0 analogue (using a Vaccinia capping system, NEB protocol #M2080), following the recommended procedures.
For a qualitative check of all in vitro synthesized RNAs, a denaturing agarose gel was run, in presence of 1.5% bleach (Sigma Aldrich), as recommended in Aranda, P. S., LaJoie, D. M. & Jorcyk, C. L., Electrophoresis 33, 366-369 [2012]. RNAs were first mixed with a 6×DNA loading dye (Invitrogen) and then heat-denatured at 70° C. for 10 min in a heating block, then immediately transferred and kept on ice. The RiboRuler High Range RNA Ladder (Thermo Scientific) underwent the same treatment; 2 μl of RNA ladder were loaded along the samples and the gel was run at constant voltage (70 V) for 1 h and then imaged, under UV transillumination (302 nm) using a ChemiDoc Touch (BioRad). To counterstain the whole gel, and especially the lanes without tCO TP-containing samples, a standard ethidium bromide staining was finally performed at room temperature for 10 min and gentle rocking, followed by two washes in TAE and then a final wash in distilled water (10 min each).
Human neuroblastoma SH-SY5Y cells (Sigma-Aldrich) were grown in a 1:1 mixture of minimal essential medium (HyClone) and nutrient mixture F-12 Ham (Sigma-Aldrich) supplemented with 10% fetal bovine serum (FBS), 1% non-essential amino acids (Lonza) and 2 mM L-glutamine. For the tracking experiments, an in-house generated model of human hepatic Huh-7 cells stably overexpressing mRFP-Rab5 were cultured in DMEM/GlutaMax/High glucose (Gibco) supplemented with 10% FBS. The cells are detached with trypsin-EDTA 0.05% (Gibco) and passaged twice a week.
Cells were electroporated either with 9.7 μg of tCO TP (for in vitro incorporation experiments) or 100 ng of tCO-labelled mRNA per 105 cells (for in vitro translation, cytotoxicity assessment, flow cytometry analysis and confocal microscopy), using a Neon Transfection System (Invitrogen, Carlsbad, Calif., US) and following the protocol for 10 μL Neon Tip provided by the manufacturer, with a triple pulse of 1200 V and a pulse width of 20 ms. For chemical transfection, SH-SY5Y cells were seeded one day prior transfection at a density of 0.8 106 cells/mL, in 48-well plate or glass-bottomed culture dishes for flow cytometry or confocal microscopy analysis, respectively. Lipofectamine MessengerMAX was used as chemical reagent for transfection according to the manufacturer's instructions. Briefly, the reagent was diluted and incubated for 10 min at room temperature in Opti-MEM medium. The tCO-mRNA constructs were added to the reagent to reach a 1:1 final ratio reagent-mRNA (v/w), followed by a 5 min incubation at room temperature allowing the complex mRNA-lipid to form. Cells were incubated with this complex up to 72 h. To address the impact of the dye incorporation on RNA translation, SH-SY5Y cells were electroporated or chemical transfected with commercially available non-labelled (NL) or Cyanine5-labelled (Cy5) eGFP encoding mRNAs (Trilink®) has described here.
Cell membrane integrity was determined using the Pierce™ LDH Cytotoxicity Assay Kit (Invitrogen) according to the manufacturer's instructions. Briefly, LDH released in the supernatants of cells 24 h post-electroporated or post-transfected with tCO-labelled mRNA, or Cy5-mRNA, was measured with a coupled enzymatic assay which results in the conversion of a tetrazolium salt into a red formazan product. The absorbance was recorded at 490 nm and 680 nm. The toxicity was expressed as the percentage of LDH release in supernatant compared to maximum LDH release (supernatant+cell lysate). Data are means±SD from three experiments performed in triplicate.
Following electroporation of tCO-labelled mRNA, cells were seeded in 48-well plate (2.105 cells/well) and the expression of H2B:GFP in cells was quantified by flow cytometry. Briefly, 24 h, 48 h or 72 h post-electroporation or post-transfection with tCO-mRNA, non-labelled mRNA or Cy5-mRNA, cells were harvested and analysed on a Guava EasyCyte 8HTflow cytometer (Millipore). Data are mean fluorescence intensities±SD of gated single living cells from three experiments performed in triplicate. The average fluorescence intensities were baseline corrected by subtracting the signal for RNase-free water electroporated or transfected cells. All flow cytometry data were analysed in Flowing software (version 2.5.1) and displayed using R (http://www.R-project.org/). H2B:GFP: Excitation 488 nm; Emission 525-530 nm.
After electroporation, cells were seeded in glass-bottomed culture dishes (MatTek glass-bottomed or in 4-sectors subdivided CELLview dishes; 2.105 cells/chamber). For tracking experiment, the Huh-7 cells stably overexpressing mRFP-Rab5 were incubated with lipofectamine/tCO-mRNA complex and time-lapse was recorded up to 20 h post-chemical transfection. Confocal images were acquired on a Nikon C2+ confocal microscope equipped with a C2-DUVB GaAsP Detector Unit and using an oil-immersion 60×1.4 Nikon APO objective (Nikon Instruments, Amsterdam, Netherlands). Data were processed with the Fiji software. H2B:GFP: Excitation 488 nm; Emission 495-558 nm. tCO-labelled mRNA: Exc. 405 nm; Em. 447-486 nm. Cy5-labelled mRNA: Exc. 640 nm; Em. 652-700 nm. mRFP-Rab5: Exc. 561 nm; Em. 565-720 nm.
Cell-free translation reactions were performed using E. coli bacterial lysates and an Expressway™ Mini Cell-Free Expression System (Thermo Scientific). Calmodulin-like 3 protein is provided as a positive control plasmid (pEXP5-NT/CALML3) in the kit itself; this DNA vector contains a T7 polymerase promoter and a 6×His tag, hence it was first in-vitro-transcribed in presence of the desired concentrations of tCO TP (vide supra). The obtained RNAs, once purified, were used as templates for the cell-free translation reaction according to the manufacturer's recommendations: E. coli slyD—Extract—20 μl; 2.5×IVPS E. coli Reaction Buffer (-A.A.)—20 μl; 50 mM Amino Acids (-Met)—1.25 μl; 75 mM Methionine*—1 μl; T7 Enzyme Mix—1 μl (omitted when using tCO-labelled RNAs); DNA Template—1 μg (when testing the tCO-labelled RNAs, added the same amount of RNA instead); DNase/RNase-free distilled water qsp 50 μl.
Protein samples from in vitro translation experiments were quantified with a Qubit Protein Assay kit (Thermo Scientific), mixed with 6×SDS Laemmli reducing buffer (Alfa Aesar), then heat-denatured at 85° C. for 10 min and kept at room temperature until needed. Samples were generally run in 1 mm polyacrylamide 4-20% Novex MES/SDS gels (Thermo Scientific) and using a Mini Gel Tank, with the PSU set at constant voltage (200 V). For Coomassie staining, the gel was then washed three times in boiling water, to remove excess of SDS, on a benchtop shaker; a 1×Coomassie non-toxic staining solution was added to the gel and microwaved until initial boiling.
Gel was finally washed after the appropriate incubation time, to remove excess of background noise, in distilled water and imaged using a ChemiDoc Touch. For Western Blot, the gels were blotted onto PVDF LF ethanol-activated membranes (BioRad) with a TransBlot semi-dry system (BioRad), according to manufacturer's recommendations (settings for 1 mm-thick gels and mixed weight proteins). PVDF membranes were then washed 5 min in TBS-T (TBS and 0.1% Tween-20, Sigma Aldrich), blocked in 5% milk in TBS for 1 h at room temperature and incubated with the appropriate primary antibody dilutions.
After 3×5 min washes in TBST and an incubation of 1 h with the corresponding HRP-conjugated secondary antibodies, the membrane was washed again three times in TBS-T, once in TBS and once more in distilled water. Finally, membranes were incubated with a minimal volume of SuperSignal West Pico PLUS (Thermo Scientific) and imaged with a ChemiDoc Touch. Primary antibodies: mouse monoclonal anti-6×Histidine tag (Invitrogen) and mouse monoclonal anti-GAPDH (ref. 437000, Invitrogen), both diluted 1:1000 in 3% BSA/TBS-T. Secondary antibodies: HRP-conjugated polyclonal goat anti-Ms and anti-Rb Cross-Adsorbed IgG (H+L) (ref. A16072 and A16104, Invitrogen), used at 1:10000 dilution in TBS-T.
The tCO-RNA products from the cell-free transcription reactions (prior to polyadenylation and capping, see Methods: Bio for details) were measured as received, i.e. in RNAse free Milli Q water. All measurements were carried out at room temperature (ca. 22° C.) in a 3.0 mm path length quartz cuvette, with a sample volume of ca. 60 μL.
Absorption spectra were recorded on a Cary 5000 (Varian Technologies) spectrophotometer with a wavelength interval of 1.0 nm, integration time of 0.1 s, and a spectral band width (SBW) of 1 nm. All spectra were baseline corrected by subtracting the corresponding absorption from the solvent only. A second-order polynomial Savitzky-Golay (five points) smoothing filter was applied to all spectra. For samples exhibiting significant scattering, as evidenced by characteristic absorption in the long wavelength region (here for λ>475 nm), an additional correction was applied. The scattering contribution (Ascatter) to the absorption was in such cases fitted (using absorption at 550-475 nm as input) to the Rayleigh scattering function (equation S1), where c is a proportionality constant and A0 a constant, and then subtracted for all wavelengths.
Emission spectra were recorded on a SPEX Fluorolog (Jobin Yvon Horiba) fluorimeter with excitation at 356 nm. Emission was collected at a right angle with an integration time of 0.1 s and wavelength interval of 1 nm. Monochromator slits were adjusted to achieve optimal signal output, leading to SBWs in the interval 1.5-2.5 nm on both the excitation and emission side. Emission spectra were corrected for Raman scattering by subtracting the corresponding emission from a sample containing only solvent. A second-order polynomial Savitzky-Golay (five points) smoothing filter was applied to all spectra.
Sample fluorescence quantum yields (ΦF) were determined relative to a solution of quinine sulphate (Sigma) in 0.5 M H2SO4 (ΦF,REF=0.546) and calculated according to equation S2.
Emission spectra for the sample, IS(λ) and reference, IREF(λ), were integrated between λi=365 nm and λf=700 nm. Absorption at the excitation wavelength (356 nm) for the sample (As) and reference (AREF) were in the interval 0.05-0.11 for all samples. Adopted solvent refractive indices for the samples (water) and reference (0.5 M H2SO4) were ηS=1.333 and ηREF=1.339, respectively. All quantum yields are presented as mean±standard deviation of two independent cell-free transcription reactions.
Fluorescence lifetimes were determined using time-correlated single photon counting (TCSPC). Samples were excited using an LDH-P-C-375 (PicoQuant) pulsed laser diode with emission centred at 377 nm (FWHM pulse width was 1 nm and 70 ps with respect to wavelength and time, respectively), operated with a PDL 800-B (PicoQuant) laser driver at a repetition frequency of 10 MHz. Sample emission (458 nm, SBW=10 nm) was collected at a right angle, through an emission polarizer set at 54.9° (magic angle detection). Photon counts were recorded on a R3809U 50 microchannel plate PMT (Hamamatsu) and fed into a LifeSpec multichannel analyser (Edinburgh Analytical Instruments) with 2048 active channels (24.4 ps/channel), until the stop condition of 104 counts in the top channel was met. The instrument response function (IRF) was determined using a frosted glass (scattering) modular insert while observing the emission at 377 nm (SBW=10 nm).
The intensity decays were fitted with IRF re-convolution to the multiexponential model shown in equation S3.
The least-square re-convolution fitting procedure was carried out using the DecayFit software (http://www.fluortools.com/software/decayfit). All decays were fitted to a tri-exponential (n=3) model. The presented lifetimes are amplitude-weighted average lifetimes (
The ratio of the rate constants for cytosine vs. tCO incorporation (kC/ktC
Solving S5 and S6 for the respective rate constants renders equation S7, in which [C] and [tCO] denote the concentration of incorporated C and tCO, respectively.
Using the Lambert-Beer law, absorption is related to nucleobase concentration according to equations S8 and S9. The following molar absorptivities (unit: M−1 cm−1) were adopted:
Assuming that the product RNA is uniform in size (1247 nucleotides), its base composition (A: 408, U: 272, G: 307, C: 260) allows for equations S10-S13.
[tC0]+[C]=[RNA]×260 (S10)
[A]=[RNA]×408 (S11)
[U]=[RNA]×272 (S12)
[G]=[RNA]×307 (S13)
Solving the equation system composed of S7 through S13 allows for quantification of kc/ktCO, [tC0], [RNA], [C], [A], [U], and [G]. The average-strand tCO incorporation degree (θtC
Using the volume of the cell-free reaction (50 μL) and resulting product solution (100 μL), equation S15 was applied to calculate the tC0 incorporation yield (ηtC
Consequently, the RNA yield (ηRNA) was calculated according to equation S16.
This specification demonstrates that an artificial, size-expanded analogue of cytosine takes the role of natural cytosine and is correctly recognized by several enzymatic machineries, including the ribosome. This fluorescent base analogue, tCO, is demonstrated to be a suitable intrinsic imaging label of different size RNAs which minimally perturbs native properties and is compatible with enzymatic labelling processes.
Modified transcripts are non-toxic and translationally active both in bacterial lysate and in eukaryotic systems, regardless of their degree of tCO incorporation. This conveniently allows for simultaneous monitoring of mRNA uptake and translation into H2B:GFP in live-cell confocal microscopy using selective excitation, an approach that should be applicable to the translation of any protein similarly tagged with a GFP family protein.
The intrinsic fluorescence RNA-labelling methodologies disclosed herein are therefore excellent non-invasive ways to, in real time, elucidate cellular trafficking mechanisms such as endosomal escape or exosomes formation, both of which are of fundamental importance for pharmaceutical applications. As such the technology for live cell imaging should enable new and improved delivery strategies for next-generation nucleic acid-based drugs.
Number | Date | Country | Kind |
---|---|---|---|
2009705.1 | Jun 2020 | GB | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2021/067342 | 6/24/2021 | WO |