The instant application contains an electronic sequence listing. The contents of the electronic sequence listing H2998864.xml; Size: 34,034 bytes; and Date of Creation: May 10, 2024 is herein incorporated by reference in its entirety.
Nucleic acids are the basic components of all life and are essential for cell activities such as division, migration, apoptosis, and proliferation. To modify RNAs' structure and function, more than 170 chemical alterations are commonly added to both coding and non-coding regions. RNA modifications such as methylation, acetylation, phosphorylation, glycosylation, palmitoylation, and prenylation not only diversify RNA structure but also play important functional roles in normal cell growth and development as well as response regulations to environmental stress. Furthermore, chemical groups that modify the canonical nucleobases have been shown to involve in regulating the interactions of RNAs with other biomacromolecules such as proteins, lipids, sugars, and other forms of nucleic acids. Dysfunctional RNA alterations are connected to a variety of human diseases, including cancer and viral infections as well as developmental abnormalities and cancer, extending the networks of interaction. Common chemical modifications include as m6A and m5C, on various RNA species, playing crucial functions in biological processes.
A significant hydrophobic, C10H15 lipid moiety modification has been identified on the wobble position of some specific bacterial tRNAs as a geranyl group attached to the sulfur atom at position 2 of uridine (ges2U). This geranyl group modifies about 0.4% of tRNAs, specifically for lysine, glutamine and glutamic acid in Escherichia coli, Enterobacter aerogenes, Pseudomonas aeruginosa, and Salmonella enterica serovar Typhimurium. The geranyl modification improves tRNA translation fidelity and reduces frameshifting errors, yielding tRNA that preferentially base pairs with G over A, resulting in a significant improvement in base pairing discrimination and codon recognition. Geranylated tRNA may also be a transient intermediate in the selenation process, another important tRNA modification pathway in bacteria.
In general, it has been discovered that lipid-like hydrophobic prenyl groups, such as geranyl, farnesyl, and geranyl-geranyl, change both proteins and RNAs with important activities. Many organic substances, including protein enzymes like KRAS and caryophyllene are prenylated. The farnesyl and geranylgeranyl groups on cysteine in the CAAX motif of RAS superfamily proteins are important for signaling transduction in anchoring the proteins to the cell membrane. 2-selenouridine synthase (SelU), a conserved enzyme essential for efficient bacterial growth, mediates geranylation in ges2U, but understanding of the fundamental regulations that govern geranylation process and its functions is still relatively restricted. Nothing is known about the regulatory relationships between geranylation and other reader, writer, and eraser enzymes. Biochemical tools that allow for the labeling of lipidated RNAs to enable the study of hydrophobic modifications in normal and diseased biological processes are lacking. Incorporating similar lipid analogs into tRNAs could also provide a useful platform for the development of new molecular tools for labeling, detecting, and isolating specific tRNAs for in vitro, in situ, and in vivo applications.
Hydrophobic prenyl groups (attached to i6A ribonucleotide analogs introduced by the Mia family enzyme) have also been discovered in tRNAs. These modifications are usually found in the position 37 of anticodon stem loops with great biological significance in enhancing base pairing specificity, codon recognition fidelity and translational efficiency. Prenyl modification of tRNA affects plant root growth. In addition, they have been actively involved in a variety of human diseases such as mitochondrial respiratory chain defects, Alzheimer's disease, breast cancer and diabetes. The prenyl groups including terpenes compounds have also been known as the fundamental feedstock in a wide range of biological processes that play key roles in metabolism and human diseases. The transcriptome-wide mapping of 2-methylthio-i6A (ms2i6A), a common i6A analog, has been achieved through chemoselective bioconjugation of the methylthio group. Elucidating the biological significance of these prenyl-modified RNAs remains very challenging due to the lack of genome wide detection and sequencing tools.
Antigen-antibody specificity is one of the main methodologies to identify and profile RNA modifications, and chemical pulldown methods have been used to study modified residues in RNAs such as hm5C, f5C, and pseudouridine, etc. In contrast to using antibodies to recognize specific epitope of antigen, chemical pulldown strategies target defined functionalities on RNAs and employ highly selective chemical probes to tag the RNAs containing these modifications within the complex cell environment. However, there remains a need for more general approaches targeting the prenyl-group to better profile these modifications
The present disclosure is directed to overcoming these and other deficiencies in the art.
In an aspect, provided is a compound of Formula I:
wherein X1 may be —C(═O)— or —CH2—, each may be, independently, a single bond or a double bond, X2 may be —NH— or —C(CH3)—, m may be 0 or 1, X3 may be —CH2— or —C(CH3)—, n may be an integer of from 0 to 8, and Y may be N3 or
wherein Z may be a fluorescent label. n may be an integer of from 1 to 8. m may be 1. m may be 1 and n may be 8. m may be 0 and n may be 1.
Each of X2 and X3 may be —C(CH3)— and each may be a double bond. X1 may be —CH2—, X2 may be —C(CH3)—, each may be a double bond, m may be 1, X3 may be —C(CH3)—, and n may be 1; or X1 may be —C(═O)—, X2 may be —NH—, each may be a single bond, m may be 0, X3 may be —CH2—, and n may be 1. Z may be a fluorescent label selected from Cy5, FITC, and BODIPY.
In another aspect, provided is a method of forming a tagged tRNA, including contacting a 2-thiouridine tRNA with a compound of Formula I:
wherein X1 may be —C(═O)— or —CH2—, each may be, independently, a single bond or a double bond, X2 may be —NH— or —C(CH3)—, m may be 0 or 1, X3 may be —CH2— or —C(CH3)—, n may be an integer of from 0 to 8, and Y may be an azide or
wherein Z may be a fluorescent label; and a step for covalently attaching the compound of Formula I to the 2-thiouridine tRNA. n may be an integer of from 1 to 8. m may be 1. m may be 1 and n may be 8. m may be 0 and n may be 1.
Each of X2 and X3 may be —C(CH3)— and each may be a double bond. X1 may be —CH2—, X2 may be —C(CH3)—, each may be a double bond, m may be 1, X3 may be —C(CH3)—, and n may be 1; or X1 may be —C(═O)—, X2 may be —NH—, each may be a single bond, m may be 0, X3 may be —CH2—, and n may be 1.
Y may be an azide, and forming the tagged tRNA may further include contacting the azide with a DBCO-activated fluorescent label to cause a click-chemistry reaction therebetween. Y may be
and Z may be a fluorescent label.
The step for covalently attaching the compound of Formula I to the 2-thiouridine tRNA may occur intracellularly. The step for covalently attaching the compound of Formula I to the 2-thiouridine tRNA may occur extracellularly. Z may be a fluorescent label selected from Cy5, FITC, and BODIPY.
In another aspect, provided is a method of forming a tagged tRNA, comprising contacting a prenylated tRNA with a 4-phenyl-1,2,4-triazoline-3,5-dione activated fluorescence label to cause an ene reaction therebetween.
In another aspect, provided is a method of adding a label to a ribonucleotide including a prenyl group, comprising contacting the ribonucleotide with an activated label, wherein the ribonucleotide includes an N6-isopentenyladenosine-prenylated RNA and the activated label includes the label coupled to 4-phenyl-1,2,4-triazoline-3,5-dione, and wherein the contacting comprises forming a covalent attachment between the prenyl group and the 4-phenyl-1,2,4-triazoline-3,5-dione. The N6-isopentenyladenosine-prenylated RNA may include a polynucleotide. The label may include a fluorine. The label may include a fluorescent label. The fluorescent label may be selected from Cy5, FITC, and BODIPY.
In another aspect, provided is a method of sequencing a polyribonucleotide comprising a prenyl group, including contacting the prenyl group of the ribonucleotide with I2, wherein the ribonucleotide includes an N6-isopentenyladenosine-prenylated RNA and contacting the prenyl group of the ribonucleotide with I2 includes causing a cyclisation reaction of the prenyl group, reverse transcribing the polyribonucleotide to form a cDNA of the polyribonucleotide, and sequencing the cDNA of the polyribonucleotide. The method may further include reverse transcribing a control RNA having the same sequence as the polyribonucleotide to form a control cDNA, wherein the control RNA was not contacted with I2, sequencing the control cDNA, and detecting an N6-isopentenyladenosine in the polyribonucleotide wherein detecting comprises identifying a C or G in a position of the cDNA of the polyribonucleotide corresponding to a position of an A in the control cDNA.
These and other features, aspects, and advantages of the present disclosure will become better understood when the following detailed description is read with reference to the accompanying drawings, wherein:
This disclosure relates to a method of labeling lipidated ribonucleosides, and also of identifying lipidated ribonucleosides by cyclization followed by reverse transcription and sequencing of cDNA. In one respect, a significant hydrophobic modification is present on the wobble position of some specific bacterial tRNAs (ges2U,
Also disclosed herein is an Ene-ligation strategy for probing chemical properties of a modified ribonucleotide's prenyl group. Hydrophobic prenyl groups (mainly in i6A analogs introduced by the Mia family enzyme,
Disclosed are analogues of, for example, geranyl-pyrophosphate (geranyl-OPPi), Ge1:
Compounds include a compound of Formula I:
In an aspect, provided is A compound of Formula I:
wherein X1 may be —C(═O)— or —CH2—, each may be, independently, a single bond or a double bond, X2 may be —NH— or —C(CH3)—, m may be 0 or 1, X3 may be —CH2— or —C(CH3)—, n may be an integer of from 0 to 8, and Y may be N3 or
wherein Z may be a fluorescent label. n may be an integer of from 1 to 8. m may be 1. m may be 1 and n may be 8. m may be 0 and n may be 1. Some examples are shown in
A compound of Formula I may be added to a tRNA ribonucleotide, including a tRNA polyribonucleotide, such as by an enzyme such as SelU. As disclosed herein, inclusion of a reactive moiety for Y in Formula I allows for subsequent addition of a label thereto, where the label includes a chemical group that reacts with the chemical moiety of Y. Chemical moieties and chemical groups that may for covalent attachments to each other via click chemistry reactions may be used, for example addition of an azide group at the Y position of a compound of Formula I, and addition of a cyclooctyne such as dibenzocyclooctyne (DBCO) added to a label. A click chemistry reaction may then be performed to covalently attach the label to the compound of Formula via click chemistry.
A compound of Formula I may be attached to a tRNA during a first reaction, such as in a reaction catalyzed by an enzyme such as SelU, before a label is attached to the compound of Formula I. A second reaction, such as a click chemistry reaction, may then be performed to attach a label activated with a click chemistry appropriate chemical group, resulting in attachment of the label to the tRNA. In another example, a label may be attached at the Y of Formula I (e.g., via a click chemistry reaction) before attachment of the compound of Formula I to an RNA. Subsequent attachment of such a compounds of Formula I to an RNA may result in covalent attachment of the label to the RNA.
A compound of Formula I as disclosed herein may be Ge5 or Ge6:
Or, the compound of Formula I may be Ge5 or Ge6 further modified by a click chemistry reaction between the azide of Ge5 or Ge6 with a cyclooctyne of a label, such as a DBCO-activated label. The label may be a fluorescent label. Also disclosed herein is a tRNA with a compound of Formula I, such as Ge5 or Ge6, covalently attached thereto, such as via its 2-thiouridine residue as disclosed herein. Or, in such an example, rather than such an example of a compound of Formula I where the Y position is a moiety for participation in a click chemistry reaction such as an azide as in Ge5 and Ge6, a label may be attached at the Y position of said compound of Formula I, whether by an azide-DBCO connection or other as disclosed above.
A label as disclosed herein may be any chemical structure for identifying, locating, isolating, tagging, or otherwise modifying an RNA. In an example, a label may be a fluorescent label, such as Cy5, FITC, BODIPY, or any other fluorescent composition. Some examples of fluorescent labels include for possible use in a method as disclosed herein include FAM, TAMRA, Cyanine Dye Cy3, Cy3.5, Cy5, Cy5.5, Cy7, Cy7.5, EDANS/Dabcyl, Alexa Fluor, BODIPY FL, and ATTO dyes. Any one of the foregoing fluorescent label may be modified by addition or inclusion of a chemical group and the Y position of a compound of Formula I modified with a complementary moiety whereby the chemical group and the complementary moiety form a bond-forming pair as disclosed above, suitable for attachment of the label to the compound of Formula I and, thus, to a tRNA, e.g. via enzyme (e.g. SelU)-mediated attachment to a 2-thiouridine residue of the tRNA.
Attachment of a label to a compound of Formula I to form a labeled compound of Formula I, or of a compound of Formula I to a tRNA, or a labeled compound of Formula I to a tRNA, or of a tRNA-attached compound of Formula I to a label, or any combination of the foregoing, may be accomplished extracellularly or intracellularly. For intracellular attachment, as disclosed herein, a cell may either intrinsically express an enzyme for enzyme-mediated attachment of a compound of Formula I or labeled compound of Formula I to a 2-thiouridine residue of a tRNA or a transgene may be introduced into the cell to cause the cell to express an enzyme that catalyzes addition of a compound of Formula I or a labeled compound of Formula I to a 2-thiouridine residue of a tRNA, such as SelU as disclosed herein. Or, as disclosed herein, an in vitro method for attaching a compound of Formula I, or labeled compound of Formula I, to a to a 2-thiouridine residue of a tRNA may involve contacting the tRNA and the compound with an enzyme that may catalyze their attachment to each other, such as SelU. Such examples of enzyme-mediated, including SelU-mediated, attachment of a compound of Formula I, or a labeled compound of Formula I, to a 2-thiouridine residue of a tRNA, are a step for covalently attaching a compound of Formula I, or labeled compound of Formula I, to a 2-thiouridine of a tRNA.
Also disclosed is an example of attaching a fluorescent label to a geranylated RNA via Ene ligation. As disclosed herein, direct labelling of prenyl-containing RNAs may be accomplished by such a method. Some tRNAs contains geranyl groups on the wobble position of 34U. As disclosed herein, a 4-phenyl-1,2,4-triazoline-3,5-dione (PTAD) activated fluorescence dye molecule can be directly attached to such terpene terminals through an Ene-reaction, where PTAD reacts with conjugated diene group. For example, tRNA possessing a geranyl group may be fluorescently labeled by contacting it with a PTAD-bound fluorescent label (as a non-limiting example, by way of illustration only, PTAD-DBCO-Cy5). A PTAD-mediated attachment of a label to a geranylated tRNA is a step for covalently attaching a compound of Formula I, or labeled compound of Formula I, to a 2-thiouridine of a tRNA.
Also disclosed herein is synthesis of i6A triphosphates (i6ATP) and incorporation thereof into RNA strands, and labeling of such compositions. For example, PTAD-activated label such as fluorescent label, as disclosed herein, ay directly bond to an i6A moiety of an RNA (e.g., tRNA, mRNA, rRNA, or other RNA). Such PTAD-mediated attachment of a label, including a fluorescent label, to an i6A of an RNA strand is a step for performing such a labeling method.
Also disclosed herein is a method of cyclizing i6A moieties of RNA by addition of I2, a reagent for the cyclisation reaction of a prenyl group. Incubating RNA into which i′A has been incorporated (e.g., by an RNA polymerase by adding i6A during an RNA polymerization, in living cells or in vitro) then contacting the RNA with I2 causes a cyclisation of the i6A which can subsequently be detected. Adenosine with prenyl modification displays normal base pairing with thymidine. By contrast, the addition of iodine induces the transformation of i6A to its cyclic form, thereby diminishing the normal base pairing specificity and resulting in a mixture of complementary G/C/U (
The following examples are intended to illustrate particular embodiments of the present disclosure, but are by no means intended to limit the scope thereof.
To a stirred solution containing (E)-1-bromo-3,7-dimethylocta-2,6-diene (50.0 mg, 0.23 mmol) dissolved in acetonitrile (1.5 ml), tris (tetra-N-butylammonium) hydrogen pyrophosphate (200 mg, 0.48 mmol) was carefully and gradually introduced. This reaction mixture was maintained under a nitrogen atmosphere and stirred at room temperature for a period of 5 hours. Upon completion of the reaction, the crude product was subjected to purification through high-performance liquid chromatography (HPLC). Subsequently, the purified product was freeze-dried to obtain a dry solid, yielding a total of 4.3 mg with a percent yield of 6%.
White solid. 1H NMR (400 MHz, D2O) δ 5.37 (t, J=6.8 Hz, 1H), 5.11 (t, J=6.7 Hz, 1H), 4.38 (t, J=6.4 Hz, 2H), 2.06 (d, J=7.4 Hz, 2H), 2.01 (d, J=6.7 Hz, 2H), 1.60 (d, J=4.4 Hz, 6H). 31P NMR (162 MHz, D2O) δ −6.85 (d, J=22.68 Hz), −10.57 (d, J=22.68 Hz). MS (ESI): Calculated [M+Na]+=337.2. Found [M+Na]+=337.0.
In a well-stirred solution, 3,7-dimethyloct-6-en-1-ol (150 mg, 0.96 mmol) was mixed with carbon tetrabromide (414 mg, 1.25 mmol) and triphenylphosphine (387 mg, equal to 1.47 mmol) in a volume of 3.0 ml of dichloromethane. This reaction mixture was then stirred at a controlled temperature of 0 degrees overnight under an inert nitrogen atmosphere. The crude product was purified using silica gel flash chromatography, which yielded 8-bromo-2,6-dimethyloct-2-ene as a yellow oil (150 mg, 0.69 mmol, 71% yield).
Subsequently, to a fresh solution containing 8-bromo-2,6-dimethyloct-2-ene (80.0 mg, 0.37 mmol), a solution of tris (tetra-N-butylammonium) hydrogen pyrophosphate (423 mg, 1.01 mmol) in 1.5 ml of acetonitrile (CH3CN) was gently added. This new mixture was again stirred for a duration of 5 hours at room temperature, maintaining an inert nitrogen atmosphere throughout. Upon completion, the resulting crude product was further purified using high-performance liquid chromatography (HPLC). Following purification, the product was lyophilized to obtain a dry solid, with a final yield of 8.2 mg and a percentage yield of 7%.
1H NMR (400 MHz, CDCl3) δ 5.13-5.02 (m, 1H), 4.10 (q, J=7.1 Hz, 1H), 3.52-3.30 (m, 2H), 2.01-1.91 (m, 2H), 1.95-1.83 (m, 2H), 1.67 (s, 3H), 1.59 (s, 3H).
Compound Ge2: 1H NMR (400 MHz, D2O) δ 5.21 (t, J=7.2 Hz, 2H), 4.02-3.85 (m, 4H), 2.01-1.96 (m, 3H), 1.65 (s, 6H), 1.47-1.38 (m, 3H), 1.15 (dd, J=14.7, 6.6 Hz, 2H), 1.06 (t, J=7.5 Hz, 1H), 0.85 (m, 3H). 31P NMR (162 MHz, D2O) δ −8.45 (d, J=17.3 Hz), −10.53 (d, J=21.06 Hz). MS (ESI): Calculated [M+H]+=336.2. Found [M+H]+=336.3.
A stirred solution containing 5-bromo-2-methylpent-2-ene (50.0 mg, 0.31 mmol) was gradually treated with a solution of tris (tetra-N-butylammonium) hydrogen pyrophosphate (331 mg, 0.79 mmol) dissolved in 1.5 ml of acetonitrile (CH3CN). The combined mixture was then stirred for a period of 5 hours at room temperature under an inert nitrogen atmosphere. Upon completion of the stirring period, the resultant crude product was purified using high-performance liquid chromatography (HPLC). Following purification, the product was subjected to lyophilization to achieve a dry state, yielding 4.8 mg with a 6% yield.
White solid. 1H NMR (400 MHz, D2O) δ 5.17 (s, 1H), 3.81 (s, 2H), 2.27 (s, 2H), 1.84 (d, J=101.2 Hz, 6H). 31P NMR (162 MHz, D2O) δ −6.52 (d, J=22.68 Hz), −10.59 (d, J=21.06). MS (ESI): Calculated [M+H]+=536.4. Found [M+H]+=536.8.
3-Benzoylbenzoic acid (450 mg, 1.99 mmol) was dissolved in a mixture of dichloromethane (4.4 ml) along with a small quantity of dimethylformamide. To this solution, thionyl chloride (SOCl2, 0.3 ml, 3.97 mmol) was added, and the resulting mixture was continuously stirred at a temperature maintained at reflux for an overnight duration. After the reaction had completed, it was sequentially washed with a 0.1 M sodium hydroxide solution and subsequently extracted using dichloromethane. The organic layer was then dried over anhydrous sodium sulfate. Next, the solvents were removed via evaporation, and the residue obtained was immediately utilized for the subsequent step without further purification or isolation.
Approximately 200 mg of compound 4 (0.82 mmol) was combined with (2E, 6E)-8-((tert-butyl dimethylsilyl)oxy)-3,7-dimethylocta-2,6-dien-1-ol (125 mg, 0.44 mmol) in a reaction mixture that also included triethylamine (Et3N, 0.2 ml, 1.44 mmol) and 4-dimethylaminopyridine (DMAP, 4.0 mg, 0.033 mmol) dissolved in 2.0 ml of pyridine. This mixture was then stirred consistently for a duration of 5 hours at a temperature of 65 degrees. The reaction was subsequently terminated by the addition of water, followed by a washing step with brine and extraction using dichloromethane. The organic phase was then dried over anhydrous sodium sulfate. Next, the crude product underwent purification through silica-gel flash chromatography, ultimately yielding compound 5 as a white solid (101.9 mg, 47% yield).
Compound 5 (230 mg, 0.47 mmol) and TBAF (600 μl, 0.6 mmol) were combined in 1.5 mL THF and stirred under a nitrogen atmosphere at 30° C. overnight. The reaction was halted using saturated NH4Cl. After removing THE, the residue was extracted with ethyl acetate and then dried over Na2SO4. Purification by silica gel flash chromatography resulted in compound 6 with a yield of 159.8 mg (90%).
At 0° C., to a solution of compound 6 (159.8 mg, 0.42 mmol) in 2.0 ml of dichloromethane, PBr3 (42.5 μl, 0.44 mmol) was cautiously added. The reaction mixture was stirred overnight at room temperature. Upon completion, it was quenched with water, washed with sodium bicarbonate and brine, then extracted with dichloromethane and dried over anhydrous Na2SO4. The crude product was purified by silica gel flash chromatography, yielding compound 7 (180.0 mg, 0.41 mmol, 97% yield).
Under N2 at 0° C., 0.41 mmol of compound 7 (180.0 mg) dissolved in 0.5 mL CH3CN was added dropwise to a solution of 0.86 mmol tris (tetra-N-butylammonium) hydrogen pyrophosphate (360.0 mg) in 0.5 mL CH3CN. The reaction was stirred at room temperature for 3 hours and then purified by HPLC, yielding a white solid (10.9 mg, 5% yield).
Compound 5: 1H NMR (400 MHz, CDCl3) δ 8.44 (d, J=7.2 Hz, 1H), 8.26 (t, J=5.2 Hz, 1H), 8.02-7.96 (m, 1H), 7.80 (d, J=8.9 Hz, 2H), 7.61 (t, J=8.7 Hz, 1H), 7.56 (d, J=7.7 Hz, 1H), 7.50 (t, J=7.6 Hz, 2H), 5.47 (t, J=7.0 Hz, 1H), 5.37 (t, J=6.7 Hz, 1H), 4.86 (d, J=7.1 Hz, 2H), 3.99 (s, 2H), 2.21-2.13 (m, 2H), 2.14-2.07 (m, 2H), 1.59 (s, 5H), 0.89 (s, 9H), 0.05 (s, 6H). 13C NMR {1H} (101 MHz, CDCl3) δ 165.85 (s), 142.57 (s), 137.93 (s), 137.08 (s), 134.84 (s), 133.97 (s), 133.22 (s), 132.80 (s), 131.00 (s), 128.47 (s), 123.58 (s), 118.23 (s), 68.50 (s), 39.26 (s), 25.96 (s), 18.43 (s), 16.59 (s), 13.47 (s), 0.01 (s).
Compound 6: 1H NMR (400 MHz, CDCl3) δ 8.24 (d, J=7.9 Hz, 2H), 7.96 (t, J=6.5 Hz, 2H), 7.48 (t, J=7.6 Hz, 5H), 5.50 (dd, J=15.2, 8.3 Hz, 1H), 5.44 (d, J=7.0 Hz, 1H), 4.83 (d, J=7.1 Hz, 2H), 4.69 (s, 2H), 2.24-2.16 (m, 2H), 2.15-2.06 (m, 2H), 1.75 (s, 2H), 1.70 (s, 3H). 13C NMR {1H} (101 MHz, CDCl3) δ 196.37 (s), 166.36 (s), 142.63 (s), 138.46 (s), 137.56 (s), 134.56 (d, J=4.0 Hz), 133.73 (s), 133.35 (s), 131.52 (d, J=5.1 Hz), 130.86 (s), 130.60 (s), 129.68 (s), 129.23-128.90 (m), 119.10 (s), 71.43 (s), 62.70 (s), 39.41 (s), 30.23 (s), 17.13 (s), 14.63 (s).
Compound 7: 1H NMR (400 MHz, CDCl3) δ 8.43 (d, J=1.5 Hz, 2H), 8.25 (d, J=7.7 Hz, 2H), 7.80 (d, J=7.0 Hz, 5H), 5.22 (dd, J=17.3, 1.0 Hz, 1H), 5.10-5.02 (m, 1H), 4.38 (d, J=6.0 Hz, 2H), 3.95 (s, 2H), 2.15-2.08 (m, 2H), 2.07-2.02 (m, 2H), 1.74 (d, J=5.5 Hz, 6H). 13C NMR {1H} (101 MHz, CDCl3) δ 195.98 (s), 166.03 (s), 138.14 (s), 137.23 (s), 134.22 (s), 133.34 (s), 133.02 (s), 132.27 (s), 131.52 (s), 131.15 (s), 130.93 (s), 130.34 (d, J=15.1 Hz), 129.69 (s), 128.72 (d, J=9.6 Hz), 121.33 (s), 68.73 (s), 63.97 (s), 36.31 (s), 35.60 (s), 29.81 (s), 25.90 (s), 19.60 (s), 14.84 (s).
White solid. Pyrophosphate Ge3: 1H NMR (400 MHz, D2O) δ 8.32 (d, J=6.5 Hz, 4H), 8.11 (d, J=8.1 Hz, 2H), 7.83 (d, J=7.6 Hz, 5H), 7.76 (t, J=8.1 Hz, 2H), 7.65 (t, J=7.7 Hz, 4H), 5.61 (t, J=5.8 Hz, 2H), 4.44 (s, 2H), 4.36 (d, J=5.8 Hz, 3H), 2.13-2.07 (m, 2H), 1.89-1.82 (m, 2H), 1.72 (s, 6H). 31P NMR (162 MHz, D2O) δ −9.71 (d, J=26.3 Hz), −10.86 (d, J=24.3 Hz). MS (ESI): Calculated [M+H]+=536.4. Found [M+H]+=536.8.
(E)-3,7-dimethylocta-2,6-dien-1-ylacetate (201.0 mg, 1.03 mmol) was reacted with t-BuOOH (122 μL, 2.06 mmol) and SeO2 (17.1 mg, 0.15 mmol) in 2.0 ml dichloromethane at room temperature for 3 days. The cooled mixture to 0° C. was then treated with NaBH4 (total 600 mg in six portions) until defoaming ceased after 5 hours stirring. The reaction was washed with water, extracted with dichloromethane, and dried over Na2SO4. Purification via silica gel flash chromatography gave compound 8 (120 mg, 0.57 mmol, 55% yield).
Subsequently, compound 8 (120 mg, 0.57 mmol) was then reacted with PBr3 (1.31 g, 4.83 mmol) in 0.8 mL diethyl ether for 4 hours at an ice bath under a nitrogen atmosphere. Further purification of the crude residue through silica gel flash chromatography led to the isolation of the desired product 9 (141.2 mg, 0.51 mmol, 90% yield).
Compound 9 (50.0 mg, 0.18 mmol) and NaN3 (23.0 mg, 0.35 mmol) were dissolved in 1.00 mL DMF and stirred under reflux at 75° C. overnight. Silica gel flash chromatography purified the reaction mixture, yielding compound 10 (40.0 mg, 0.17 mmol) with a 94% yield.
To a stirred solution of compound 10 (40.0 mg, 0.17 mmol) in methanol (2.0 ml), potassium carbonate (58.0 mg, 0.42 mmol) was added and stirred for 3 hours at room temperature. The mixture was then washed with deionized water, extracted with ethyl acetate, and dried over anhydrous sodium sulfate. This crude product was transferred into anhydrous methanol (5.0 ml), the reaction was stirred overnight at room temperature under a nitrogen atmosphere. After treating with saturated sodium bicarbonate and further drying over anhydrous Na2SO4, the crude product was purified through silica gel flash chromatography with a 10% ethyl acetate in petroleum ether (PE) eluent, affording compound 11 (33.0 mg, 0.17 mmol, 100% yield).
Compound 11 (33.0 mg, 0.17 mmol) was dissolved in 1.0 ml dichloromethane, followed by the addition of MsCl (20.0 μL, 0.26 mmol) and Et3N (40.0 μL, 0.29 mmol) at 0° C. The mixture was stirred overnight and then vacuum-evaporated. The residue was purified by silica gel flash chromatography to give 12, which was directly used for the subsequent step without further isolation.
Compound 12 (0.17 mmol) was dissolved in a stirred solution, then tris (tetra-N-butylammonium) hydrogen pyrophosphate (220 mg, 0.53 mmol) in 1.0 ml CH3CN was slowly added. The reaction was stirred for 5 hours at room temperature under N2. The crude product was purified by HPLC and lyophilized, yielding a white solid (3.0 mg, 5% yield).
Alcohol 8: 1H NMR (400 MHz, CDCl3) δ 5.35 (m, 2H), 4.57 (s, 2H), 3.97 (s, 2H), 2.16 (dd, J=14.6, 7.0 Hz, 2H), 2.07 (t, J=5.9 Hz, 2H), 2.04 (s, 3H), 2.03 (s, 2H), 1.65 (s, 4H). 13C NMR {1H} (101 MHz, CDCl3) δ 171.21 (s), 141.73 (s), 135.22 (s), 124.74 (s), 118.47 (s), 68.31 (s), 61.33 (s), 39.01 (s), 25.63 (s), 20.88 (s), 16.21 (s), 13.54 (s). HRMS (ESI): Calculated [M+Na]=235.1310. Found [M+Na]=235.1308.
Bromide 9: 1H NMR (400 MHz, CDCl3) δ 5.56 (dd, J=12.3, 6.2 Hz, 1H), 5.33 (t, J=7.6 Hz, 1H), 4.57 (d, J=7.1 Hz, 2H), 3.95 (s, 2H), 2.18-2.08 (m, 7H), 1.74 (s, 3H), 1.69 (s, 3H). 13C NMR {1H} (101 MHz, CDCl3) δ 171.28 (s), 141.50 (s), 132.66 (s), 130.65 (s), 119.08 (s), 61.48 (s), 41.74 (s), 38.75 (s), 26.60 (s), 21.24 (s), 16.63 (s), 14.87 (s).
Azide 10: 1H NMR (400 MHz, CDCl3) δ 5.46-5.40 (m, 1H), 5.38 (m, 1H), 4.96 (dd, J=2.6, 1.3 Hz, 2H), 4.56 (d, J=7.1 Hz, 2H), 3.77 (dd, J=7.5, 5.4 Hz, 2H), 2.32-2.20 (m, 7H), 1.70 (d, J=0.8 Hz, 6H). 13C NMR {1H} (101 MHz, CDCl3) δ 171.17 (s), 142.28 (s), 130.46 (s), 129.56 (s), 114.61 (d, J=14.6 Hz), 68.80 (d, J=6.7 Hz), 62.86 (s), 59.41 (s), 33.15 (s), 25.20 (s), 20.99 (s).
Compound 11: 1H NMR (400 MHz, CDCl3) δ 5.50-5.43 (m, 1H), 5.42 (t, J=6.6 Hz, 2H), 4.98 (dd, J=3.0, 1.5 Hz, 2H), 3.81 (d, J=5.8 Hz, 2H), 2.05 (td, J=14.8, 6.9 Hz, 4H), 1.72 (d, J=0.8 Hz, 6H). 13C NMR {1H} (101 MHz, CDCl3) δ 131.25 (s), 129.97 (s), 115.13 (d, J=16.3 Hz), 69.35 (s), 61.61 (s), 59.99 (s), 40.33 (s), 37.28 (s), 33.87 (s), 30.23 (s), 29.62 (s), 25.81 (s), 19.96 (s), 18.04 (s), 15.14 (s).
White solid. Pyrophosphate Ge5: 1H NMR (400 MHz, D2O) δ 5.52-5.48 (m, 2H), 3.60-3.56 (m, 2H), 2.17 (m, 5.3 Hz, 2H), 2.11-2.04 (m, 2H), 1.68 (s, 3H), 1.65 (s, 3H). 31P NMR (162 MHz, D2O) δ −6.64 (d, J=22.7 Hz), −10.44 (d, J=22.7 Hz). MS (ESI): Calculated [M+H]+=375.2. Found [M+H]+=375.5.
2-Bromoacetyl bromide (13, 0.696 mL, 8.0 mmol), 3-azidopropan-1-amine (14, 100 mg, 1.0 mmol), and NaOH (200 mg, 5.0 mmol) were combined in a CH2Cl2/H2O (2.5 mL: 1.2 mL) solvent system and stirred overnight at room temperature. The aqueous layer was extracted with CH2Cl2, washed with 50 mM Na2CO3, dried over Na2SO4, and purified via silica gel flash chromatography, yielding compound 15 (221.1 mg, 1.0 mmol, 100% yield).
Compound 15 (50.0 mg, 0.23 mmol) was added slowly to a stirred solution containing tris (tetra-N-butylammonium) hydrogen pyrophosphate (220.1 mg, 0.53 mmol) in 1.0 mL of CH3CN. The mixture was stirred for 5 hours at room temperature under a nitrogen atmosphere. The crude product was purified by HPLC, lyophilized, and obtained as a white solid (5.1 mg, 7% yield).
Compound 15: 1H NMR (400 MHz, CDCl3) δ 4.22 (t, J=6.0 Hz, 2H), 4.12 (t, J=14.3, 7.3 Hz, 3H), 2.46-2.36 (m, 2H), 2.33-2.25 (m, 2H).
White solid. Pyrophosphate Ge6: 1H NMR (400 MHz, D2O) δ 4.88 (d, J=6.4 Hz, 2H), 3.50-3.29 (m, 2H), 1.17 (dd, J=14.9, 7.4 Hz, 2H), 1.07 (t, J=7.4 Hz, 2H). 31P NMR (162 MHz, D2O) δ −5.87 (d, J=22.7 Hz), −11.19 (d, J=22.3 Hz). MS (ESI): Calculated [M+H]+=316.1. Found [M+H]+=316.5.
Compound 8 (500 mg, 2.36 mmol) in anhydrous pyridine (6.0 ml) was treated with tert-butyl chlorodimethylsilane (423 mg, 2.81 mmol) under N2 at 0° C. for 2 hours. Next, solvents were evaporated under vacuum, and the residue was dissolved in CH2Cl2. The crude product was purified via silica gel flash chromatography using a petroleum ether-ethyl acetate (14:1) eluent, yielding compound 16 (600 mg, 2.11 mmol) with an 89% yield.
To a stirred dichloromethane (1.5 ml) solution, compound 16 (60.0 mg, 0.21 mmol), 17 (50.0 mg, 0.16 mmol), dicyclohexylcarbodiimide (80.0 mg, 0.39 mmol), and N-(4-pyridyl)-dimethylamine (10.0 mg, 0.082 mmol) were added, then the mixture was stirred overnight at room temperature under N2. Upon reaction completion, pH was adjusted to 5-6 using 0.1 M HCl, followed by washing with saturated NaHCO3 and saturated aqueous NaCl, extracting with dichloromethane, and drying over Na2SO4. The crude product was concentrated in a rotary evaporator and purified by silica gel flash chromatography with a gradient eluent of petroleum ether-ethyl acetate (from 60:1 to 30:1), yielding compound 18 (58.6 mg, 0.10 mmol) with a 64% yield.
To a stirred tetrahydrofuran (1.0 ml) solution containing compound 18 (60.0 mg, 0.10 mmol), tetrabutylammonium fluoride (0.4 ml, 0.40 mmol) was added. The mixture was stirred overnight at room temperature under a nitrogen atmosphere. Upon completion, the reaction was quenched with saturated NH4Cl and concentrated in a rotary evaporator. The crude product was extracted with CH2Cl2, dried over Na2SO4, and further concentrated. It was then purified via silica gel flash chromatography using a gradient elution of petroleum ether-ethyl acetate (30:1 to 10:1) to afford alcohol 19 (38.1 mg, 0.083 mmol) with an 83% yield.
Alcohol 19 (38 mg, 0.083 mmol) was dissolved in 1.0 ml dichloromethane and cooled to 0° C. MsCl (20.0 μl, 0.26 mmol) and Et3N (40.0 μl, 0.29 mmol) were then added. The mixture was stirred overnight and vacuum-evaporated to yield compound 20, which was directly employed in the subsequent step without further purification.
To a stirred solution of compound 20, tris (tetra-N-butylammonium) hydrogen pyrophosphate (101.1 mg, 0.24 mmol) in 1.0 ml CH3CN was added slowly. The mixture was stirred for 5 hours at room temperature under a nitrogen atmosphere. The crude product was purified by HPLC, lyophilized, and obtained as a white solid with a yield of 5% (2.5 mg).
Compound 18: 1H NMR (400 MHz, CDCl3) δ 7.68 (d, J=7.6 Hz, 1H), 7.51 (dd, J=5.7, 3.2 Hz, 1H), 7.40 (dd, J=8.9, 5.1 Hz, 2H), 7.37 (d, J=1.4 Hz, 1H), 7.35 (d, J=1.6 Hz, 1H), 7.35-7.32 (m, 1H), 5.34 (t, J=6.8 Hz, 1H), 5.26 (t, J=7.6 Hz, 1H), 5.16 (d, J=13.9 Hz, 1H), 3.99 (s, 2H), 3.66 (d, J=8.3 Hz, 1H), 2.74 (ddd, J=16.2, 8.3, 6.4 Hz, 1H), 2.63 (ddd, J=17.1, 8.3, 6.3 Hz, 1H), 2.31 (dt, J=17.2, 6.1 Hz, 1H), 2.15-2.09 (m, 2H), 1.95 (dd, J=10.2, 6.2 Hz, 2H), 1.92 (d, J=6.0 Hz, 2H), 1.73-1.69 (m, 2H), 1.67 (d, J=4.8 Hz, 2H), 1.65 (s, 3H), 0.89 (s, 8H), 0.04 (s, 6H).
Alcohol 19: 1H NMR (400 MHz, CD3OD) δ 7.54 (d, J=8.1 Hz, 1H), 7.52-7.47 (m, 1H), 7.36 (d, J=2.2 Hz, 2H), 7.29-7.19 (m, 2H), 7.15 (d, J=7.2 Hz, 1H), 7.12 (d, J=2.4 Hz, 1H), 5.25 (dd, J=10.0, 7.1 Hz, 2H), 3.80 (s, 2H), 3.66-3.56 (m, 1H), 2.41-2.17 (m, 4H), 2.00-1.88 (m, 4H), 1.86 (s, 3H), 1.57 (s, 3H), 1.54 (s, 3H).
White solid. Pyrophosphate Ge7: 1H NMR (400 MHz, CDCl3) δ 7.69 (dd, J=17.0, 8.2 Hz, 4H), 7.61-7.56 (m, 4H), 7.37 (s, 2H), 4.30 (d, J=1.5 Hz, 6H), 4.10-4.08 (m, 2H), 3.75 (s, 2H), 3.62 (s, 2H), 2.20-2.08 (m, 4H), 2.04 (s, 3H), 1.97 (s, 1H), 1.79 (s, 3H), 1.76 (s, 3H). 31P NMR (162 MHz, D2O) δ −6.61 (d, J=22.0 Hz), −10.74 (d, J=19.5 Hz). MS (ESI): Calculated [M−H]+=613.5. Found [M−H]+=613.0.
The mixture solution of nucleoside 21 (s2U, 5.0 mg, 0.019 mmol), geranyl bromide (13.3 mg, 0.057 mmol) and DIPEA (7.0 mg, 0.057 mmol) in MeOH (0.5 mL) was stirred overnight at r.t. under N2 atmosphere. The crude residue was purified by preparative thin layer chromatography and confirmed by 1H NMR and MS (ESI).
1H NMR (400 MHz, CDCl3) δ 8.23 (d, J=8.0 Hz, 1H), 6.05 (s, 1H), 6.03 (s, 1H), 5.87 (d, J=3.3 Hz, 1H), 5.58-5.57 (m, 1H), 5.51 (s, 1H), 5.06 (s, 1H), 4.86 (s, 1H), 4.37 (s, 1H), 4.07 (s, 1H), 4.06 (s, 1H), 2.00-1.99 (m, 2H), 1.99-1.97 (m, 2H), 1.75 (s, 3H), 1.74 (s, 3H). MS (ESI): Calculated [M+H]+=397.5. Found [M+H]+=398.2.
The mixture solution of nucleoside 21 (s2U, 5.0 mg, 0.0190 mmol), 5-bromo-2-methylpent-2-ene (13.3 mg, 0.0570 mmol) and DIPEA (7.0 mg, 0.057 mmol) in MeOH (0.5 mL) was stirred overnight at r.t. under N2 atmosphere. The residue was purified by preparative thin layer chromatography and confirmed by 1H NMR and MS (ESI).
1H NMR (400 MHz, CDCl3) δ 8.18 (s, 1H), 6.19 (d, J=5.3 Hz, 1H), 4.47-4.42 (m, 2H), 4.40 (t, J=3.6 Hz, 1H), 4.36 (d, J=4.2 Hz, 1H), 4.32 (d, J=6.6 Hz, 1H), 4.13-4.01 (m, 2H), 3.43 (dd, J=15.2, 11.8 Hz, 2H), 2.94-2.83 (m, 2H), 1.68 (d, J=16.1 Hz, 6H). MS (ESI): Calculated [M+Na]+=365.4. Found [M+Na]+=365.3.
To a stirred solution of nucleoside 21 (s2U, 5.0 mg, 0.0190 mmol), 5-bromo-2-methylpent-2-ene (13.3 mg, 0.0570 mmol) and DIPEA (7.00 mg, 0.057 mmol) in MeOH (0.5 mL). The mixture was stirred overnight at r.t. under N2 atmosphere. The residue was purified by preparative thin layer chromatography and confirmed by 1H NMR and MS (ESI).
1H NMR (400 MHz, CDCl3) δ 8.07 (d, J=6.9 Hz, 1H), 5.78 (d, J=3.4 Hz, 1H), 5.33 (s, 1H), 4.87 (dd, J=12.9, 2.4 Hz, 1H), 4.77 (d, J=2.7 Hz, 1H), 4.67 (dd, J=12.5, 2.9 Hz, 1H), 3.69-3.55 (m, 1H), 2.20 (dd, J=11.9, 5.1 Hz, 1H), 2.02-1.97 (m, 1H), 1.40 (d, J=4.8 Hz, 2H), 1.30 (d, J=5.3 Hz, 2H). MS (ESI): Calculated [M+Na]+=423.4. Found [M+Na]+=423.0.
To a stirred solution of nucleoside 21 (s2U, 5.0 mg, 0.019 mmol), 5-bromo-2-methylpent-2-ene (13.3 mg, 0.057 mmol) and DIPEA (7.0 mg, 0.057 mmol) in MeOH (0.5 mL). The mixture was stirred overnight at r.t. under N2 atmosphere. The residue was purified by preparative thin layer chromatography and confirmed by 1H NMR and MS (ESI).
1H NMR (400 MHz, CDCl3) δ 8.34 (d, J=8.2 Hz, 1H), 6.74 (d, J=8.4 Hz, 1H), 6.04-5.87 (m, 1H), 5.33 (s, 1H), 4.94 (dd, J=26.2, 14.0 Hz, 1H), 4.66 (d, J=12.8 Hz, 1H), 3.80-3.74 (m, 2H), 2.16 (s, 2H), 1.86 (s, 2H), 1.61 (s, 4H), 1.59 (s, 4H), 0.99 (d, J=6.6 Hz, 3H). MS (ESI): Calculated [M+H2O+H]+=417.5. Found [M+H2O+H]+=417.1.
SelU-mediated geranylation of tRNA in the fluorescent labeling investigations.
General procedure: Partially purified SelU-His6 (15 μg, ca. 0.35 nmol), 20 μg (ca. 3.74 nmol) tRNA and geranyl pyrophosphate ammonium salt (5 eq. Sigma) dissolved in 100 μl of buffer containing 10 mM Tricine-KOH, pH=7.2, 0.2 mM dithiothreitol (DTT), and 100 mM MgCl2, the sample mixture was then incubated at 25° C. for 24 h. The Ge-tRNA product was monitored by RP-HPLC using a Kinetex C18 column (5μ, 100 Å) using the gradient parameters of buffer B: 0-10 min 0% B; 10-40 min 0-35% B; 40-45 min 35-100% B; 45-50 min, 100% B; 50-55 min, 100-0% B; 55-60 min, 0% B. Buffer A: 0.1 M CH3CO2NH4; pH=6.8. Buffer B: 0.1 M CH3CO2NH4 with 40% CH3CN; the collected fraction was desalted and the geranylated product was tested by fluorescent labeling or MALDI-TOF MS.
Direct labeling of prenylated tRNA using this Ene-ligation.
Scheme S8. Direct fluorescent labeling using fluorescent probes via Ene-ligation. Specifically, for prenylated substrates such as Ge1, Ge2 and Ge3.
RNase T1 is an endoribonuclease that specifically degrades single-stranded RNA at G residues. It cleaves the phosphodiester bond between 3′-guanylic residue and the 5′-OH residues of adjacent nucleotide with the formation of corresponding intermediate 2′, 3′-cyclic phosphate. The reaction products are 3′-GMP and oligonucleotides with a terminal 3′-GMP. RNase T1 does not require metal ions for activity.
tRNAGlu from E. coli. mass analysis: Calculated MS 24441 (or 24124)+77.952+15.023+181.129 (or 225.119)−108.042−108.042+1004=24715.105/24579.233/24759.094/24442.094 (+1004). Found 25417.4251, 25724.55. It is reported that geranylated RNA in Escherichia coli, Enterobacter aerogenes, Pseudomonas aeruginosa and Salmonella enterica var. Typhimurium. Ana these geranylated nucleotides occur in the first anticodon position of tRNAGluUUC, tRNALysUUU, tRNAGlnUUG at a frequency up to 6.7% (˜400 geranylated nucleotides per cell).
Cas9 Nuclease, Streptococcus pyogenes (product #M0646), T7 Endonuclease I (product #M0302) Ribonucleotide solution mix (NTPs) and deoxy-ribonucleoside triphosphates (dNTPs) were purchased from New England Biolabs (USA). Transcript Aid T7 High Yield Transcription kit (product #K0441) and Glycogen (product #R0561) were purchased from Thermo Fisher Scientific. Pyrobest™ DNA Polymerase and PrimeSTAR HS DNA Polymerase were purchased from TaKaRa Shuzo Co. Ltd. (Tokyo, Japan). DNA Clean & Concentrator™-5 kit (product #D4014) was purchased from Zymo Research Corp. The DNeasy Blood & Tissue Kit was purchased from QIAGEN. The oligonucleotides at HPLC purity were obtained from TaKaRa Company (Dalian, China). The nucleic acid stains Super GelRed (No.: S-2001) was bought from US Everbright Inc. (Suzhou, China). DPBA (Cas #17261-28-8), TCEP (Cas #51805-45-9), 4-(2-hydroxyethyl)-piperazine-1-ethanesulfonic acid (HEPES, Cas #7365-45-9) and Thiazolyl Blue Tetrazolium Bromide (MTT, Cas #298-93-1) were purchased from Sigma-Aldrich Inc. (Shanghai, China). DPBS (Cas #63995-75-5) was purchased from TCI Development Co., Ltd (Shanghai). The concentration of DNA or RNA was quantified by NanoDrop 2000c (Thermo Scientific, USA). Gel Imaging was performed using Pharos FX Molecular imager (Bio-Rad, USA).
Expression. Strain pET28b-SelU (BL21) was cultivated in Kan (kanamycin) LB overnight at 37° C., followed by inoculation into fresh 200 mL Kan LB at 220 r.p.m./min, 37° C. for about 3 h to OD=0.6. Subsequently, the protein expression was induced by adding 1 mmol/L IPTG at 220 r.p.m./min, 28° C. for 5 h.
Purifications. The His×6 Ni Gravity Column (1.0 ml) was equilibrated and all buffers were settled to the working temperature. The column was then washed with 5-10 column volumes of His×6 Ni Equilibration Buffer (50 mM sodium phosphate, 6 M guanidine-HCl, 300 mM NaCl, 20 mM imidazole; pH=7.4). The clarified sample was added to the column and sealed with column stopper. The target protein was allowed to bind by slowly inverting the column for 1 h. The bounded protein in the resin was then settled to the bottom of the column by placed the column in a standing position. The column was washed with 10 column volumes of His×6 Ni Equilibration Buffer followed by 10 column volumes of His×6 Ni Wash Buffer (50 mM sodium phosphate, 6.0 M guanidine-HCl, 300 mM NaCl, 40 mM imidazole; pH=7.4). The target protein was eluted with approximately 10 column volumes of Elution Buffer (50 mM sodium phosphate, 6 M guanidine-HCl, 300 mM NaCl, 300 mM imidazole; pH=7.4) and collect 1 mL per tube. The sample was analyzed and identified by 5% agarose gel electrophoresis.
Measurements were carried out using 1H, 13C NMR, and 31P NMR. Chemical shifts (δ) were reported in parts per million (ppm) and referenced to the solvent signals. Data are reported as follows: chemical shift, multiplicity (s=singlet, d=doublet, t=triplet, and m=multiplet) and coupling constants (in hertz). Bruker AM-400 spectrometer (400 MHz) and a Bruker AVANCE NEO 600 (600 MHz) was used for NMR data collection and spectral interpretation.
Typical procedure: To a stirred solution of (E)-1-bromo-3,7-dimethylocta-2,6-diene (50.0 mg, 0.220 mmol) in CH3CN (1.5 ml) was slowly added tris (tetra-N-butylammonium) hydrogen pyrophosphate (200 mg, 0.264 mmol). The mixture was stirred for 5 h at room temperature under nitrogen atmosphere. The crude product was purified by HPLC and lyophilized to dryness.
Molecular Docking Study of SelU with Pyrophosphates Ge
The molecular docking studies were performed by using AutoDock Vina 1.1.2. Ref. M1 The ligand sites were detected by using AutoLigand, Ref. M2 and the ligands (pyrophosphate Ge1/Ge5/Ge6) site containing Cys97 and Gly67 was used to define the binding pocket by establishing a grid box centered on X: −26.5 Y: 2.6 Z: 7.8 Å with the dimensions of X: 26.9 Y: 26.0 Z: 19.1 Å. The magnesium ion was first docked into the binding pocket. The docked position of the magnesium ion with the lowest docked energy was selected to prepare a protein-ion complex. Then each ligand was docked into the binding pocket with the presence of the magnesium ion. The docking poses of ligands were also selected for subsequent analyses according to the lowest docked energies. Top references:
General experimental procedures for SelU-mediated geranylation Enzymatic fluorescent labelling analysis with pyrophosphates Ge1-7.
Partially purified SelU-His×6 (15 μg, ca. 0.35 nmol), 20 μg (ca. 3.74 nmol) tRNA and geranyl pyrophosphate ammonium salt (Sigma, 5 equiv.) dissolved in 100 μl of buffer containing 10 mM Tricine-KOH, pH=7.2, 0.2 mM dithiothreitol (DTT), and 100 mM MgCl2, and the sample was incubated at 25° C. for 24 h. The Ge-tRNA product was monitored by RP-HPLC using a Kinetex C18 column (5μ, 100 A, 150×4.6 mm) using the gradient parameters of Buffer B: 0-10 min 0% B; 10-40 min 0-35% B; 40-45 min 35-100% B; 45-50 min, 100% B; 50-55 min, 100-0% B; 55-60 min, 0% B. Buffer A: 0.1 M CH3CO2NH4; pH=6.8. Buffer B: 0.1 M CH3CO2NH4 with 40% CH3CN; the collected fraction was desalted and the geranylated product was tested by fluorescent labelling or MALDI-TOF MS.
With pyrophosphates Ge2, 3, 7.
RNase T1 is an endoribonuclease that specifically degrades single-stranded RNA at G residues. It cleaves the phosphodiester bond between 3′-guanylic residue and the 5′-OH residues of adjacent nucleotide with the formation of corresponding intermediate 2′, 3′-cyclic phosphate. The reaction products are 3′-GMP and oligonucleotides with a terminal 3′-GMP. RNase T1 does not require metal ions for activity. The Ge-tRNA product was hydroxylated to give nucleosides with varied modifications, which were further analyzed by AB Sciex 4500, Wuhan biological sample 4500 Q-trap (Wuhan Biobank Co., Ltd).
With pyrophosphates Ge1, 5, 6.
SelU-catalyzed tRNA geranylation with pyrophosphates Ge5-7 as substrates, the in-gel fluorescent imaging was performed therefore. In the fluorescent labelling studies when using Ge5-6 and Ge7 as substrates, DBCO-Cy5 was used for labelling of Ge5 or Ge6, while BODIPY-R6G-N3 was used for labelling of Ge7. Confocal parameters are: Blue channel (E.X. 492 nm. E.M. 507 nm), red channel (E.X. 649 nm. E.M. 660 nm) and green channel (nm). See Scheme S7 for details.
Direct fluorescent labelling of prenylated tRNA by using PTAD-DBCO-Cy5 probe.
tRNA (1.0 mg) isolated from E. coli using Zymo purification kit strictly following the instruction protocol was dissolved in PBS buffer (0.1 M, pH=7.4) and the PTAD-DBCO-Cy5 probe (1.0 mM) generated by in-situ oxidation with N-bromosuccinimide (NBS, 1 M in DMF) was added in one portion at 4° C. The mixture was incubated at 4° C. for 15 min in a sterile tube. And excess fluorescent probe was removed by 3K filter (Millipore) and the resulting nucleic acid was analyzed by in-gel fluorescence (Scheme S8).
Mass Spectra identification of prenyl-containing nucleic acid labelled with PTAD-DBCO-Cy5.
tRNAGlu from aforementioned procedure was further analyzed by MALDI-TOF/TOF mass spectra (AXIMA-PerformanceMA, Qinghua University). The details are presented below:
3-Hydroxypicolinic Acid (3-HPA, Sigma Aldrich, USA) was chosen as MALDI matrix for DNA detection. The matrix solution was prepared by dissolving 20 mg 3-HPA and 45 mg dihydrogen ammonium citrate (DHAC) in 1 mL mixture solution of 50% acetonitrile/50% water. A home-built inkjet printing device was developed to print an array of matrix droplets on sample. This device consisted of a piezoelectric inkjet print head for matrix solution ejection (Fuji Electrics Systems Co., Ltd, Japan) and an XY-motorized stage where the ITO glass was placed (MMU-30X, Chuo Precision Industrial Co., Ltd, Japan). A laboratory-made software was used to control the inkjet waveform and the movement of x-y stage. ITO glass with cells was firstly washed with 100 mM ammonium acetate solution to remove non-volatile salts which might affect ionization efficiency. After air drying, a 6×6 matrix array of circular regions ˜300 μm in diameter was printed onto the ITO glass and imaged with fluorescent microscope (DMI 4000B, Leica, Germany). For quantitative analysis, DNA internal standard and matrix solution were loaded into different channels of the inkjet printing head. The internal standard was printed prior to the matrix onto the same position.
MALDI-MS analysis was performed in an AXIMA Performance MALDI-TOF/TOF mass spectrometer (Shimadzu Co. Ltd., Japan). This instrument was equipped with a 337 nm nitrogen laser. ITO glass with cells was attached to the stainless MALDI plate by conductive tapes. Data were acquired in a linear negative mode and signals between m/z 20000-30000 were collected. Raster scans on cell surfaces were performed automatically using the mass spectrometry software (Shimadzu Biotech., Japan). The scan area was 200 μm×200 μm with the sampling distance of 50 μm. For each coordinate, mass spectra resulting from 20 laser shots at 5 Hz were accumulated to obtain an average mass spectrum.
The HEK293T cells were washed twice with PBS, and trypsin was added for 1 min. Then 2.0 mL DMEM medium was added to the mixed well, followed by centrifuged at 1,000 r.p.m for 5 min, and the supernatant was discarded. Fresh medium was added to resuspend the cell pellet and cells were seeded in a confocal dish at the density of 0.5×1 05, the cells were gently mixed using pipette, and then placed in CO2 incubator.
150 mM sterile NaCl solution was prepared as a diluent for DNA and Vigofect (Vigorous Biotechnology Beijing Co., Ltd.). The culture medium in the confocal dish was then replaced with fresh complete culture medium before transfection, and the culture was incubated at 37° C. under 5% CO2 atmosphere. The pCMV-Myc-SelU-GFP group, pCMV-Myc-SelU-5s-GFP and Ge1 or Ge5 or Ge6 group were set as control groups.
Configure the transfection working solution. First, 2.5 μg pCMV-Myc-SelU-GFP or pCMV-Myc-SelU-5s-GFP was added into the diluent (total 100 μl) and stored at room temperature for 5 min. Next, 2.5 μl VigoFect was added to the diluent (total 100 μl), and the solution was mixed gently and placed at room temperature for 5 min. Next, the diluted VigoFect was added into the diluted DNA solution gently, and the resulting transfection working solution was incubated at room temperature for 15 min. Subsequently, the working solution was added to the cell culture, mixed gently and incubated for 24 h. 24 hours post transfection, the pyrophosphates Ge1, Ge5, Ge6 (1 mM) were added into the medium for 4 h. Cells were washed with PBS 3 times and then fixed with 4% paraformaldehyde for 15 min, and subsequently permeabilized with 0.5% Triton X-100 for 15 min. Next, TMSN3 (50 μM) and Selectfluor (50 μM) were added into Ge1 group for 30 min, cells were then incubated with DBCO-Cy5 (5 μM) for 30 min and then stained with 4′, 6-diamidino-2-phenylindole (DAPI) for 15 min in darkness at room temperature (RT). Samples were washed with PBS three times and examined under a confocal microscope (LSM780, Carl Zeiss).
Amplification, Expression, and Extraction of pCMV-Myc-SelU-GFP, pCMV-Myc-SelU-5s-GFP Plasmid
pCMV-Myc-SelU-GFP or pCMV-Myc-SelU-5s-GFP plasmid was transformed into competent DH5α cells at 4° C. Followed by incubation in an ice-water bath for 30 min, heat shocked at 42° C. for 90s, then placed on ice for another 5 min. 400 μl of LB solution was added and the cells were incubated at 37° C., 220 r.p.m. for 45 min. Finally, the bacterial solution was spread onto the Kan-containing LB solid culture plate and incubated at 37° C. for 12-16 h until a single colony appeared. Next, in sterile surface, a single colony was picked and inoculated into 5 mL of LB solution (kan), incubated overnight at 37° C., 220 r.p.m., until the culture solution appeared turbid, followed by plasmid extraction.
General Procedure: In an anhydrous DMF (0.1 ml) solution of s2U (13.0 mg, 0.05 mmol), K2CO3 (0.2 mmol) was introduced at room temperature. A single portion of bromide (0.2 mmol) was then added, and the mixture was heated to 85° C. for overnight stirring. Solvents were subsequently evaporated under vacuum, and the residue was further purified by HPLC to yield the Ge-s2U product.
Co-incubation experiment of pET28b-SelU lysate and Ge4 (benzophenone pyrophosphate).
(1) Add the E. coli bacterial cells to the lysis solution (Western and IP lysis buffer, Beyotime), sonicate for 5 min (80W, work 10 seconds, stop 10 seconds), lyse until the sterile body precipitates, centrifuge at 10,000×g for 15 min, collect the supernatant, and determine the protein concentration by BCA kit.
(2) Add appropriate amount of Ge4 (benzophenone pyrophosphate) to the protein lysate and incubate overnight at 4° C. with UV (365 nm) irradiation on ice for 15 min.
(3) Take 50 μl of probe from each tube of sample and add excess NBS to oxidize (the reaction solution turns pink when upon addition), incubate the PTAD-DBCO-biotin probe with the above-mentioned lysis solution for 4 h (4° C.). Add 50 μl beads to the EP tube and place the tube on the magnetic stand, discard the supernatant, add buffer 2 to wash the beads twice, then add the lysate to the washed beads. Incubate overnight with rotation at 4° C., when complete incubation, place the tube on a magnetic stand, wash the complex with eluent 3 times, add 20 μl protein loading buffer, centrifuge at 14,000×g for 30s, denaturized at 95° C. for 5 min, centrifuge at 14000×g 1 min, then analyze the supernatant by SDS-PAGE electrophoresis (10%).
After mixing the above reagents thoroughly, cast the gel, add isopropanol to even the gel, solidify at r. t. for 30 min, then discard the isopropanol, and blot the remaining reagents with filter paper.
Stain the gel with Coomassie Brilliant Blue staining solution on a shaker for 30 min and destain with the decolorizing solution until the protein bands appear. The result is shown in the Fig. S9.
Cell-specific labeling of SelU-mediated tRNA geranylation.
Amplification, expression, and extraction of pCMV-Myc-SelU-GFP, pCMV-Myc-SelU-5s-GFP plasmid.
(1) Add 250 μl of Buffer BL to the adsorption column AC and centrifuge at 12,000×g for 1 min to activate the silica gel membrane.
(2) Take 4 ml of the overnight bacterial cultured, centrifuge at 12,000×g for 1 min, and collect the bacterial cells.
(3) Add 200 μl Buffer S1 to resuspend the bacterial pellet, vortex and shake until a sterile block is reached.
(4) Add 200 μl Buffer S2, mix well by invert the column 7 times to fully lyse the bacteria.
(5) Add 200 μl Buffer S3, mix well by invert the column 7 times. The solution should turns into white flocculent precipitate. Centrifuge at 12,000×g for 15 min.
(6) Aspirate the supernatant carefully, transfer the supernatant to the adsorption column AC, centrifuge at 12000×g for 1 min, discard the waste liquid, and put the adsorption column AC back into the empty collection tube
(7) Add 700 μl Buffer W2 to the adsorption column AC, centrifuge at 12,000×g for 1 min, and discard the waste liquid. Repeat this step.
(8) Put the adsorption column AC back into the empty collection tube and centrifuge at 120,00×g for 2 min.
(9) Place the adsorption column AC in a clean 1.5 mL centrifuge tube, let it stand at 25° C. for 2 min, add 30 μl of Eluent to the middle of the adsorption membrane, let it stand at 25° C. for 2 min, then centrifuge at 12,000×g for 2 min. Obtain the plasmid, and determine the concentration.
To test the viability of geranyl tRNA tagging, we first synthesized a series of analogues including geranyl pyrophosphate variants Ge2 and Ge3, a diphenylketone analogue Ge4, azido analogues Ge5 and Ge6, and a DBCO derivative Ge7 (
Following that, we expressed and purified SelU, the enzyme that installs the geranyl modification on tRNAs. Immunoblotting analysis confirmed the successful expression of SelU-His6 with the His-tag used for the affinity column purification (
With these ‘clickable’ tags being recognized by SelU, we further conducted the fluorescent labelling of tRNAGln, Glu, Lys in the presence of the ligands Ge5, Ge6, Ge7 and the purified SelU (the detailed processes are shown in Scheme S7,
This two-step indirect ACT-Flu labelling has great application potential in targeting lipidated RNAs in both healthy and disease eukaryotic cells. Recently, it was shown that the levels of thiolated wobble uridines change in response to hydroxyurea in budding yeasts.4 Similarly, it was discovered that paromomycin therapy altered the amounts of thiolated-tRNA in human colorectal cancer cells. In addition, the human U34 writers related to mcm5s2U have also been shown to confer resistance to targeted therapy in melanomas.43 As a result, our labelling approach could be used to research and track the quantities of these thiolated tRNAs under various cellular conditions and environmental stresses. In summary, these experiments allowed us to identify two pyrophosphate analogues, Ge5 and Ge6, which are suitable for SelU-mediated tRNA geranylation and fluorescent labelling. This two-step process, azidation and click tagging of fluorescent dye (ACT-Flu), may be very effective for in-vitro tRNA labelling and detecting.
We also accessed the feasibility of a direct labelling of prenyl-containing RNAs. Since some bacterial tRNAs contains geranyl groups on the wobble position of 34U, we investigated if a 4-phenyl-1,2,4-triazoline-3,5-dione (PTAD) activated fluorescence dye molecule could be directly attached to these terpene terminals through a known Ene-reaction, where PTAD reacts with conjugated diene group35-37 (
Next, we constructed the SelU-pCMV-Myc-EGFP (5593 bp), which contains E. coli SelU tagged with EGFP, and the 5s-pCMV3 (6333 bp, which contains 5S rRNA) plasmids in order to further apply our methodologies to in-cell tRNA labelling. We then tested the in vivo fluorescent labeling of geranylated tRNAs with pyrophosphates Ge1 coupled with the direct labeling (
In conclusion, we present two chemical labelling strategies for lipid-modified RNAs by taking advantage of the natural SelU-mediated tRNA geranylation process and the unique chemical reactivity of prenyl-groups. We synthesized a series of ‘clickable’ geranyl pyrophosphate analogues and discovered two candidates Ge5 and Ge6 feasible for indirect in-cell RNA labelling via a two-step process, azidation-and-click tagging of fluorescent dye, namely the ACT-Flu technique. Fluorescent microscopy and molecular simulation studies both confirmed that the two ligands could serve as ideal SelU substrates in the tRNA geranylation process, thereby providing a new toolset for detecting and monitoring the 2-thiouridine residue, which is an important biomarker of bacterial infections and cancers. In addition, we developed a direct metabolic incorporation and biorthogonal tagging (MIBT-Tag) method using the PTAD-based Ene-ligation chemistry of the prenyl group. Both approaches have been applied successfully for in-cell RNA fluorescent labelling. The biological importance of prenylation process has been increasingly appreciated with recent studies of OAS1, the double stranded RNA sensor that activates RNase L, indicating that the OAS1 mediated prenylation44, 45 process protects patients from severe illness infected by SARS-CoV-2 virus. The biochemical toolsets developed in our work have significance in vivo application potential. They will not only provide insights into the roles of prenylation modifications in gene regulation and RNA biology, but will also have applications as biomarkers specifically for prenylation levels in both healthy and diseased cells.
All chemicals were purchased from Sigma-Aldrich, Acros, Innochem, Macklin Inc, Energy Chemical and were used without further purification. Extra dry solvents, such as 1,4-dioxane, DMF, and THF, were obtained from Innochem in sealed bottles over 3 or 4 Å molecular sieves and stored under dried nitrogen. Organic solvents were obtained from Sinopharm Chemical Reagent Co., Ltd (Shanghai, China) and used in reactions, column chromatography and recrystallizations. Milli-Q ultrapure water (resistivity, 18 m (2) purified through Millipore Milli-Q Advantage A1 purification system was used for all bioconjugation reactions. The reactions were monitored by thin-layer chromatography (TLC) analysis using silica gel (60-Å pore size, F254, Yantai Chemical Industry Research Institute) plates. Compounds were visualized by UV irradiation (λ=254 nm) and/or spraying TLC stain such as a KMnO4 solution followed by electronic heating. Flash chromatography columns were performed on silica gel (60-Å pore size, 230-400 mesh). HPLC purification were performed using EasyChrom-1000 system with NU3000 serials UV/Vis. detector (Hanbon Sci. & Tech., Jiangsu, China) using an Ultimate® XB-C18 column, 21.2× 250 mm 5 micron (Welch Materials Inc., Shanghai, China). Separation was achieved by gradient elution from 5% to 70% acetonitrile in water (constant 0.1% formic acid) over 20 min, isocratic elution with 70% acetonitrile from 20 to 25 min, and returned to initial conditions and equilibrated for 5 min. The LC chromatograms were recorded by monitoring absorption at 254 nm and 220 nm. 1H and 13C NMR spectra were recorded at room temperature on a Bruker spectrometer (AM-600 or AM-400) operating at 600/400 and 150/100 MHz, respectively. Chemical shifts are given in parts per million, and 1H and 13C {1H} NMR spectra were referenced using the solvent signal as an internal standard. The following abbreviations are used for the proton spectra multiplicities: s: singlet, d: doublet, t: triplet, m: multiplet, br: broad. Coupling constants (J) are reported in Hertz (Hz). HRMS (TOF) were obtained from the Bruker Micro TOF II Spectrometer using Electro Spray Ionization (ESI). MS (ESI) was obtained from the Expression L (Beijing Bohui Innovation Biotechnology Co., Ltd). LC-MS analysis was obtained from the Bruker Orbitrap LC/MS (Q Exactive™) at Huazhong University of Science and Technology Analytical and Testing Center. UV-visible absorbance measurements were performed UV-visible spectrometer (Lambda365, PerkinElmer, German). Confocal Imaging was performed using a LSM 780 confocal microscope (Zeiss) with a 20× objective at 16-bit depth under non-saturating conditions. EGFP were imaged with a 480 nm (excitation) and a 510 nm (emission) and false-colored green.
To a stirred solution of prenyl alcohol (8.6 mg, 0.1 mmol) was added 4-phenyl-3H-1, 2, 4-triazole-3, 5 (4H)-dione (PTAD, 17 mg, 0.1 mmol) in acetonitrile (0.6 mL). The mixture continued for 12 hours. Solvent was removed and the crude product was purified by flash chromatography (eluent: PE:ethyl acetate=10:1) to afford yellowish oil product A1 (20 mg, 0.077 mmol) in 82% yield.
1H NMR (400 MHz, CDCl3) δ 7.50-7.25 (m, 5H), 5.02 (s, 1H), 4.92 (s, 1H), 4.64-4.61 (m, 1H), 4.00-3.95 (m, 1H), 3.88-3.84 (m, 1H), 1.75 (s, 3H). MS (ESI) Calculated for [M+H]+=262.1. found 262.0.
A mixture of adenosine (2.0 g, 7.48 mmol), pyridine (15 mL), and Ac2O (7 mL, 74.2 mmol) was stirred at room temperature overnight. The resulting clear solution was refluxed at 60° C. overnight. The reaction was cooled down and quenched by EtOH. Then the reaction mixture was co-evaporated with addition of excess of EtOH to remove pyridine completely. The resultant foam was dissolved in MeOH (20 mL) and imidazole (0.4 g, 5.88 mmol) was added and the solution was stirred at room temperature. After 8 hours, the solution was diluted with ethyl acetate (50 mL) and washed by brine (5×50 mL). Then the organic solvent was dried over Na2SO4, filtered and concentrated. The residue was purified by silica gel (CH2Cl2:MeOH=30:1) to give B1 (1.48 g, 3.4 mmol, 46%) as yellowish solid. 1H NMR (400 MHz, CDCl3) δ 9.16 (s, 1H), 8.70 (s, 1H), 8.23 (s, 1H), 6.23 (d, J=4.0 Hz, 1H), 5.97 (t, J=8.0 Hz, 1H), 5.68 (t, J=12.0 Hz, 1H), 4.48-4.44 (m, 2H), 4.41-4.36 (m, 1H), 2.63 (s, 3H), 2.16 (s, 3H), 2.12 (s, 3H), 2.09 (s, 3H). 13C NMR (101 MHz, CDCl3) δ 170.80, 170.36, 169.60, 169.39, 152.66, 151.04, 149.52, 141.47, 122.23, 86.45, 80.40, 73.09, 70.61, 63.03, 25.75, 20.75, 20.54, 20.39. MS (ESI) Calculated for [M+Na]+=458.1. found 458.0.
A mixture of B1 (1.3 g, 3 mmol), triphenylphosphine (PPh3, 1.2 g, 4.5 mmol), and 3-methyl-2-buten-1-ol (387 mg, 4.5 mmol) in THF (5.0 mL) was stirred at r.t. until a homogeneous solution was formed. Di-isopropyl azodicarboxylate (DIAD, 0.9 g, 4.5 mmol) was added in one portion. The reaction was monitored by TLC. A second addition of triphenylphosphine (4.5 mmol), alcohol and DIAD was made to achieve complete conversion of starting material B1 after 20 hours. After 5 hours the mixture was evaporated and the residue was purified by column chromatography (CH2Cl2:MeOH=50:1) to give crude product B2 (2.33 g, 4.63 mmol) as white solid, which was taken to the next step directly
Compound B2 (2.33 g, 4.63 mmol) was dissolved in 7.0 M NH3 in MeOH solution (1.0 mL, 7.0 mmol) and the solution was stirred for 48 hours. The volatiles were evaporated under vacuo and the residue was purified by silica gel chromatography (PE:ethyl acetate=1:1) to give nucleoside i6A (4, 800 mg, 2.38 mmol, 80% over 2 steps) as white solid.
1H NMR (400 MHz, DMSO-d6) δ 8.35 (s, 1H), 8.22 (s, 1H), 7.91 (s, 1H), 5.89 (d, J=4.0 Hz, 1H), 5.45-5.42 (m, 2H), 5.31 (t, J=16.0 Hz, 1H), 5.19 (d, J=8.0 Hz, 1H), 4.62 (q, J=16.0 Hz, 1H), 4.16 (q, J=12.0 Hz, 1H), 3.98 (q, J=8.0 Hz, 1H), 3.71-3.66 (m, 1H), 3.59-3.53 (m, 1H), 1.69 (d, J=12.0 Hz, 6H). 13C NMR (101 MHz, DMSO-d6) δ 172.17, 154.80, 152.79, 140.14, 133.81, 122.43, 120.29, 88.45, 86.38, 73.97, 71.13, 62.14, 38.13, 25.84, 22.91, 18.29. HRMS (ESI/Q-TOF): m/z: [M+H]+ Calculated for C15H22N5O4 336.1672. Found 336.1690.
A solution of compound C1 (10 mg, 0.04 mmol) and FITC-DBCO (C2, 5 mg, 0.01 mmol) in dry DMF (1.0 mL) under nitrogen was stirred at room temperature overnight. The reaction mixture was purified by RP-HPLC (separation was achieved using an Ultimate XB-C18 column, 21.2× 250 mm 5 micron (Welch Materials Inc., Shanghai, China) by gradient elution from 5% to 70% acetonitrile in water (constant 0.1% formic acid) over 15 min, isocratic elution with 70% acetonitrile from 15 to 30 minutes, and returned to initial conditions and equilibrated for 5 minutes to give PTAD-DBCO-FITC (7, 1.0 mg, 0.001 mmol, 11%).
1H NMR (400 MHz, methanol-d4) δ 8.23 (s, 1H), 7.99 (d, J=8.0 Hz, 1H), 7.54 (d, J=8.0 Hz, 2H), 7.45 (s, 2H), 7.29 (d, J=4.0 Hz, 1H), 7.25-7.23 (m, 3H), 7.29 (t, J=8.0 Hz, 2H), 7.15 (d, J=8.0 Hz, 1H), 6.95 (d, J=8.0 Hz, 1H), 6.86 (d, J=8.0 Hz, 1H), 6.60 (s, 2H), 6.51 (q, J=12.0 Hz, 2H), 6.46-6.42 (m, 2H), 5.94 (d, J=20.0 Hz, 1H), 4.42 (t, J=16.0 Hz, 1H), 4.36 (t, J=8.0 Hz, 1H), 3.40-3.30 (m, 2H), 2.24-2.08 (m, 2H), 1.96-1.74 (m, 2H). MS (ESI) Calculated for [M+H]+=897.3. found 897.2. HRMS (ESI/Q-TOF): m/z: [M+H]+ Calculated for C49H37N8O10 897.2633. found 897.2616; HRMS (ESI/Q-TOF): m/z: [M+Na]+ Calculated for C49H36N8NaO10 919.2452. found 919.2436. PTAD-DBCO-FITC (8) was obtained by in situ oxidation using NBS (DMF solution) at room temperature for 5 minutes.
A solution of compound C1 (1.7 mg, 0.0064 mmol) and Cy5-DBCO (C5, 2.5 mg, 0.0032 mmol) in dry CH3CN (1.0 mL) under nitrogen was stirred at room temperature overnight. The reaction mixture was purified by RP-HPLC (Separation was achieved using a Ultimate XB-C18 column, 21.2×250 mm 5 micron (Welch Materials Inc., Shanghai, China) by gradient elution from 5% to 70% acetonitrile in water (constant 0.1% formic acid) over 15 minutes, isocratic elution with 70% acetonitrile from 15 to 30 minutes, and returned to initial conditions and equilibrated for 5 minutes) to give compound 9 (3.0 mg, 0.0028 mmol, 90%) as blue solid, which was dissolved in anhydrous DMF (0.14 mL) and was treated equal equivalent N-bromo succinimide (NBS) solution (20 mM in DMF) to prepare PTAD-DBCO-Cy5 stock solution (10, 10 mM in DMF, stored in −20° C.). MS (ESI) Calculated for [M−H]−=1035.5. found 1035.8. HRMS (ESI/Q-TOF): m/z: [M-Cl]+, Calculated for C60H63N10O5+ 1003.4977. found 1003.4964 {for PTAD-DBCO-Cy5 precursor (9)}.
Reaction of i6A (4) with PTAD (5).
To a solution of nucleoside i6A (4, 33 mg, 0.1 mmol) in acetonitrile (1.0 mL) was added 4-phenyl-1, 2, 4-triazoline-3, 5-dione (5, PTAD, 34 mg, 0.2 mmol) and stirred at room temperature until the red color of PTAD disappeared. Then the organic solvents were removed. The residue was purified by silica gel column chromatography (CH2Cl2:MeOH=10:1) to afford adduct 6 (20 mg, 0.039 mmol, 39%) as white solid.
1H NMR (400 MHz, methanol-d4) δ 8.26 (t, J=12.0 Hz, 2H), 7.51 (d, J=4.0 Hz, 1H), 7.42-7.33 (m, 4H), 7.24-7.19 (m, 2H), 5.95 (q, J=8.0 Hz, 1H), 5.18 (d, J=12.0 Hz, 2H), 5.07 (t, J=16.0 Hz, 1H), 4.76-4.70 (m, 1H), 4.32 (q, J=8.0 Hz, 1H), 4.18-4.11 (m, 3H), 3.88 (q, J=16.0 Hz, 1H), 3.76-3.72 (m, 1H), 1.90 (s, 3H). 13C NMR (151 MHz, methanol-d4) δ 154.92, 153.72, 152.02, 140.41, 131.48, 128.75, 128.59, 128.58, 127.86, 126.03, 124.44, 120.04, 113.87, 89.95 (d, J=21.1 Hz), 86.73, 74.06 (d, J=27.2 Hz), 71.29 (d, J=18.1 Hz), 62.10 (d, J=15.1 Hz), 59.06, 48.18, 20.15 (d, J=4.5 Hz). MS (ESI) Calculated for [M+Na]+=533.2. found=533.0. HRMS (ESI/Q-TOF): m/z: [M+H]+, Calculated for C23H27N8O6 511.2054. found 511.2041.
Dynamic Investigations of i6A (4) with PTAD (5).
Nucleoside i6A (4, 10 mg, 0.03 mmol) were dissolved in CD3CN/D2O (1.0 mL, v/v=1:1, final concentration=30 mM). PTAD (5, 26 mg, 0.15 mmol) was then added to the solution (final concentration=150 mM), and the reaction mixture was allowed to stand at room temperature. The 1H NMR spectra were recorded by proton NMR (400 MHz).
Nucleoside i6A (4, 35 mg, 0.1 mmol) and proton sponge (58 mg, 0.25 mmol) were dissolved in trimethyl phosphate (0.7 mL) and was placed under the ice-water bath. Phosphorus oxychloride (POCl3, 33.6 μL, 0.36 mmol) was slowly added and stirred at 0° C. for 12 hours. A solution of tributylamine (175 μL, 0.72 mmol) and tributyl ammonium pyrophosphate (642 mg) in DMF (2.0 mL) was slowly added and the reaction mixture was stirred at 0° C. for 30 minutes. The reaction was quenched by addition of 1.0 M aqueous triethylammonium bicarbonate (TEAB, pH=7.5, 15 mL). The mixture was diluted with H2O (5.0 mL) and subjected to HPLC purification. Separation was achieved using an Ultimate XB-C18 column, 21.2×250 mm 5 micron by gradient elution from 5% to 50% acetonitrile in water (constant 0.1% formic acid) over 25 minutes, isocratic elution with 50% acetonitrile from 25 to 30 minutes, and returned to initial conditions and equilibrated for 5 minutes to give i6ATP (10.0 mg, as tributyl ammonium salt).
1H NMR (400 MHz, methanol-d4) δ 8.40 (s, 2H), 8.30 (s, 1H), 6.13 (d, J=4.0 Hz, 1H), 5.40 (t, J=12.0 Hz, 2H), 4.73 (q, J=16.0 Hz, 1H), 4.57-4.43 (m, 1H), 4.27-4.26 (m, 2H), 4.20-4.13 (m, 2H). 31P NMR (162 MHz, methanol-d4) δ −10.47 (d, J=22.68 Hz, α-P), −11.46 (d, J=22.68 Hz, γ-P), −24.00 (t, J=22.68 Hz, β-P). MS (ESI) Calculated for [M−H]−=574.1. found 574.2.
7.1 Test the Ability of T7 RNA Polymerase to Recognize Modified Nucleotides (i6ATP) at Specific Sites.
In view of the high specificity of T7 RNA polymerase, T7 promoter as a strong promoter can efficiently guide the expression of downstream genes. Here, the T7 promoter sequence is used to guide the transcription of EGFP in vitro.
EGFP gene sequence was amplified from pEGFP-C1 plasmids. Then T7 promoter sequence was linked to the 5′-end by fusion PCR. The primers are summarized as follow:
Using the T7P-EGFP constructed above as a template and rNTP as substrates, the reaction was carried out at 37° C. for 2 hours which was catalyzed by T7 RNA polymerase. After the reaction, the template DNA T7P-EGFP was degraded with DNase I. Finally, the RNA was precipitated and purified, and the transcription effect of the RNA was detected by agarose gel electrophoresis. The T7P-EGFP template sequence and HeLa genomic RNA were used as the control group.
The results show that through the above T7 RNA polymerase transcription system in vitro, the transcription of the target gene has been successfully achieved.
Firstly, the EGFP gene sequence containing the T7 promoter was obtained by PCR amplification, and then the T7P-EGFP gene sequence was used as a template to perform in vitro reverse transcription by T7 RNA polymerase. The HeLa cell genome was used as a control template, and after the transcription was completed, it was detected by agarose gel electrophoresis. Based on the above assays, it has been demonstrated that using T7P-EGFP as a template the in vitro transcription of EGFP RNA sequence.
7.2 Optimization of the Labeling Conditions and Labeling Ability Detection of T7 RNA Polymerase-Promoted Specific RNA Synthesis Using i6ATP as the Substrate.
Preliminarily experiments have unveiled that the recognition efficiency of T7 RNA polymerase for i6ATP is significantly higher than that of K4. Next, i6ATP will be used as a substrate to optimize the labeling conditions when i6ATP is incorporated on the target RNA using the T7 RNA polymerase transcription system, and further test the labeling efficiency.
Step 1. Detection of Fidelity of Chimeric T7P-EGFP Towards i6ATP Substrate.
EGFP-RNA was transcript with T7 transcription kit (Takara) in vitro. The system was kept at 37° C. for 2 hours. 1-4 extended to 6 hours. Then the templets were digested with DNase 1, 37° C. for 30 minutes. After that, the RNA was precipitated with 75% alcohol at −20° C. overnight. Then the RNA was washed with 75% alcohol and re-dissolved with RNase free DEPC water (20 μl) and detected by 1% agarose gel electrophoresis. RNA concentration was determined with a Nanodrop One microvolume UV-Vis. spectrophotometer.
As depicted, it is demonstrated that T7 RNA polymerase could recognize i6ATP, although the recognition efficiency is low comparing with rATP.
The construction systems are shown below:
Then the templets were digested with DNase I, 37° C. for 30 minutes. Subsequently, the RNA was precipitated with 75% alcohol at −20° C. overnight. Then the RNA was washed with 75% alcohol and re-dissolved with DEPC water (20 μl). The RNA will be directly used for the next step.
Step 2. Optimization of the Transcription Efficiency with i6ATP.
In order to improve the transcription efficiency of T7P-EGFP under the condition of i6ATP addition, the above conditions were optimized.
The results showed that the transcription efficiency of T7P-EGFP is low in the presence of low concentration i6ATP in experimental group 1, which is consistent with the previous results. When the same concentration of adenosine triphosphate (ATP) is added, the transcription efficiency of T7P-EGFP is significantly improved, and the transcription efficiency of T7P-EGFP decreases with the increase of the concentration of i6ATP, but it is not obvious.
7.3 Detection of the Labeling Efficiency Using Optimized Condition with i6ATP Targeting mRNA of T7P-EGFP.
Through the above experiments, it has been demonstrated that i6ATP can be recognized by T7 RNA polymerase, and subsequently the in vitro transcription efficiency of EGFP mRNA in the presence of i6ATP has been optimized. Next, the efficiency of i6ATP transcription into EGFP needs to be further verified, and the prenyl group on i6A (4) is specifically labeled by the well-designed fluorescent probe PTAD-DBCO-FITC (8).
The experimental results showed that the control group could not detect the fluorescence signal of PTAD-DBCO-FITC (8), and with the increase of i6ATP concentration, the fluorescence intensity of RNA band gradually increased, which indicated that the content of i6A (4) in RNA gradually increased. The labeling efficiency of RNA gradually increases.
In order to realize the detection of RNA labeling efficiency by use of i6A (4) in eukaryotic cells, the eukaryotic expression system of T7 RNA polymerase was first constructed. By detecting the insertion level of i6A (4) in the downstream target gene mRNA guided by the T7 promoter, the target RNA can be labeled by the addition of i6A (4) in eukaryotic cells. If T7 RNA polymerase in eukaryotic cells can recognize i6A (4), the change in the transcription profile of the cell at a specific time point can be detected by controlling the detection time when i6A (4) is added.
At first stage, detection of the recognition of i6A (4) by T7 RNA polymerase in the eukaryotic cells and the RNA labeling efficiency.
In order to realize the simultaneous expression of T7 RNA polymerase and its target gene in the same cells, the T7 RNA polymerase and EGFP gene expression system were constructed into the same eukaryotic expression system. As shown below, T7 RNA polymerase is transcribed and expressed by the strong eukaryotic promoter CMV, and the EGFP gene sequence is guided by the T7 promoter. In addition, because T7 RNA polymerase transcription produces mRNA lacking the ribosome recognition sequence in eukaryotic cells, the translation process cannot be realized in eukaryotic cells. Therefore, the 5′-end of the EGFP gene is connected to an IRES2 sequence, and the IRES2 transcription sequence can guide the ribosome to downstream genes. Translation process to realize the expression process of EGFP in eukaryotic cells such as tumor cells. In addition, in order to detect the expression of EGFP and T7 RNA polymerase and the relationship between the two, respectively, an expression system for EGFP and T7 RNA polymerase was constructed.
Next, the T7 RNA polymerase constructed above and the expression level of the EGFP eukaryotic expression system mediated by T7 RNA polymerase in tumor cells were further tested. To this end, the above system is transfected into HeLa cells through a cation-mediated transfection reagent. After 36 hours of transfection, the expression of green fluorescent protein was first detected by fluorescence microscope. Then the cells were lysed by RIPA, the total cell protein was extracted, and the expression of T7 RNA polymerase and EGFP were detected by Western-blotting.
The results showed that T7 RNA polymerase successfully achieved expression in Hela cells. Moreover, the expression of EGFP is T7 RNA polymerase dependent (
In the above experiment, the protein expression of T7 RNA polymerase and EGFP has been detected by Western-blotting. Next, the expression of EGFP was further detected by fluorescence microscope. As in the previous step, the (MV-T7 RNA polymerase and T7 RNA polymerase-IRES2-EGFP transcription systems were transfected into HeLa cells, and the two systems were transfected together. In addition, the (MV-T7 RNA polymerase and T7 RNA polymerase-IRES2-EGFP integration systems were separately transfected into an experimental group. 36 hours after transfection, the expression of EGFP fluorescent protein was detected by fluorescence microscope. The experimental results are consistent with the western-blot detection results, that is, the expression of EGFP is T7 polymerase-dependent. No matter in the (MV-T7 RNA polymerase and T7 RNA polymerase-IRES2-EGFP single transfection group or the two integrations group, only two systems existed concurrently, the expression of green fluorescent protein can only be detected.
The above results demonstrated that the system constructed herein has successfully achieved the T7 RNA polymerase specifically directing the transcription of target genes in eukaryotic cells.
7.6 Detection of i6A-Incorporated RNA Labeling Ability in Eukaryotic Cells.
Through the above experiments, we have successfully constructed a T7 RNA polymerase recognition system for i6A (4), and realized that i6A (4) participates in the transcription of the target gene. In addition, a T7 RNA polymerase-guided eukaryotic transcription system was constructed to realize the eukaryotic transcription process of target genes guided by T7 RNA polymerase. Next, we will further test whether i6A (4) can be recognized and participate in the transcription process in eukaryotic cells.
In order to better detect the involvement of i6A (4) in the transcription process of cellular RNA, we also tested the effect of i6A (4) on cytotoxicity and optimized the concentration of i6A (4) to treat cells.
The experimental results show that the i6A (4) concentration within 400 μM has little effect on cell viability. When the concentration reaches 800 μM, cell viability will be significantly inhibited. When the concentration reaches 2.0 mM, i6A (4) has a significant inhibitory effect on cells.
Next, we will further test whether i6A (4) can participate in the RNA transcription process in eukaryotic cells. First culture the HeLa cells in a six-well plate to 70%-80%. Then add different concentrations of i6A (4) to DMEM medium (100 μM, 200 μM and 400 μM of i6A). After 12 hours of culture, collect all cells and extract total RNA.
RNA (i6A-incorporated) labelled by fluorescein PTAD-DBCO-FITC (8).
To the 1.5 mL Eppendorf tube containing PTAD-DBCO-FITC precursor (7, 10 μL, 10 mM solution in DMF) was added N-bromo succinimide (1.0 μL, 100 mM in DMF). The mixture was vortexed gently and formation of the light-red color was observed. The reagent was kept on ice and used for the i6A-incorporated RNA labelling immediately. Labeling protocol is the same stated above.
After labeling process, remove excess PTAD-DBCO-FITC (11) dye to determine whether i6A (4) can be transcribed into RNA efficiently.
The analysis of the results showed that under the conditions of GoldView™ nucleic acid dye staining, it was detected that the total RNA image in the cell with addition of i6A (4) was better than that of the wild type. In addition, its 5s, 18s RNA bands moved up significantly compared with the wild type, while the 28s band did not change significantly.
Through PTAD-DBCO-FITC (8) dye staining, it was found that no fluorescence signal was detected in the control group, but significant fluorescence signal was detected in the total RNA of the cells in the presence of i6A (4), and the addition of i6A (4, 800 μM) resulted in a weaker RNA fluorescence signal, which may be due to i6A (4, 800 μM) has certain cytotoxicity to cells. Through this experiment, we have demonstrated that i6A (4) can be recognized in eukaryotic cells and participate in the RNA transcription process.
7.7 Labeling of i6A-Incorporated RNA In Vivo.
HeLa cells was obtained from ATCC. HeLa cells were cultured in DMED medium with 10% fetal calf serum. Cells were maintained in a humidified incubator at 5% CO2 and 37° C. To label RNA in vivo, HeLa cells were cultured in 6-well plates to 70%-80%. Then different concentration i6A (4) was added to DMEM medium (i6A: 100 μM, 200 μM, 400 μM). After 12 hours, all of the cells were collected and total RNA was extracted with total RNA extraction kit (Takara). The RNA was further labeled with PTAD-DBCO-FITC (8) as described. Then it will be detected by agarose gel electrophoresis.
To the 1.5 mL Eppendorf tube containing PTAD-DBCO-Cy5 precursor (9, 10 μL, 10 mM solution in DMF) was added N-bromo succinimide (1.0 μL, 100 mM in DMF). The mixture was vortexed gently and formation of the light-red color was observed. The reagent was kept on ice and used for the i6A-incorporated RNA labelling immediately. The i6A-incorporated total RNA (10 μl, 20 ng/μl in DEPC water) was incubated with PTAD-DBCO-Cy5 (10, 10 μl, 0.1 mM), shaken for 30 minutes at 0° C. RNA was analyzed by gel electrophoresis on 1% agarose. The electrophoresis conditions were as follows: power supply (DYY-6C power supply, Liuyi Biotechnology) was set to 130 V. 1% agarose gels were run at room temperature (25° C.) for 20 minutes and stained with or without GoodView™ (SBS Genentech, Beijing, China). A DNA size marker (1 kb DNA Ladder, TsingKe Biotech, TSJ102, Beijing, China) was used. Gels were analyzed by in-gel fluorescence measurements on a FluorChem® FC3 imager (Alpha Innotech). Fluorescence was measured with a blue light excitation wavelength (475 nm) and a green filter emission (537 nm).
Model reactions of i6A nucleoside (4). The model reactions were carried out in DMSO-d6 in order to facilitate in situ NMR characterization without further purification. i6A (4, 0.15 mmol) was dissolved in DMSO-d6 (0.5 mL), and iodine (0.45 mmol) was added. The mixture was stirred at 37° C. for 5 minutes and was taken out for 1H NMR.
The nucleoside i6A (4, 0.15 mmol) was dissolved in DMSO (1.0 mL), and iodine (0.45 mmol) was added. The mixture was stirred at 37° C. for 1 hour. Afterwards, saturated Na2S2O3 in H2O was titrated into the solution in order to remove the excess iodine, and then Na2CO3 in H2O was added. The resultant mixture was further stirred at 37° C. for 1 hour and then subjected to HPLC. Separation was achieved using an Ultimate XB-C18 column, 4.6×150 mm 5 micron (Welch Materials Inc., Shanghai, China) by gradient elution at 0.5 mL/min from 5% to 90% acetonitrile in water (constant 0.1% formic acid) over 15 minutes, isocratic elution with 90% acetonitrile from 15 to 18 minutes, and returned to initial conditions and equilibrated for 5 minutes).
9.4 DFT Calculations of the Iodocyclisation Reaction of i6A (4) with Iodine.
Gene sequences used herein.
Templated EGPF sequence used here is shown in
Mutant results for mRNA incorporating i6A (templated with EGPF) is shown in
General information. All chemicals were purchased from Sigma-Aldrich, Acros, Inno-chem, Macklin Inc, Energy Chemical and were used without further purification. Extra dry solvents, such as 1,4-dioxane, DMF, and THF, were obtained from Inno-chem in sealed bottles over 3 or 4 Å molecular sieves and stored under dried nitrogen. Organic solvents were obtained from Sinopharm Chemical Reagent Co., Ltd (Shanghai, China) and used in reactions, column chromatography and recrystallizations. Milli-Q ultrapure water (resistivity, 18 m (2) purified through Millipore Milli-Q Advantage A1 purification system was used for all bioconjugation reactions. The reactions were monitored by thin-layer chromatography (TLC) analysis using silica gel (60-Å pore size, F254, Yantai Chemical Industry Research Institute) plates. Compounds were visualized by UV irradiation (λ=254 nm) and/or spraying TLC stain such as a KMnO4 solution followed by electronic heating. Flash chromatography columns were performed on silica gel (60-Å pore size, 230-400 mesh). HPLC purification were performed using EasyChrom-1000 system with NU3000 serials UV/Vis. detector (Hanbon Sci. & Tech., Jiangsu, China) using an Ultimate® XB-C18 column, 21.2×250 mm 5 micron (Welch Materials Inc., Shanghai, China). Separation was achieved by gradient elution from 5% to 70% acetonitrile in water (constant 0.1% formic acid) over 20 minutes, isocratic elution with 70% acetonitrile from 20 to 25 minutes, and returned to initial conditions and equilibrated for 5 minutes. The LC chromatograms were recorded by monitoring absorption at 254 nm and 220 nm. 1H and 13C NMR spectra were recorded at room temperature on a Bruker spectrometer (AM-600 or AM-400) operating at 600/400 and 150/100 MHz, respectively. Chemical shifts are given in parts per million, and 1H and 13C NMR spectra were referenced using the solvent signal as an internal standard. The following abbreviations are used for the proton spectra multiplicities: s: singlet, d: doublet, t: triplet, m: multiplet, br: broad. Coupling constants (J) are reported in Hertz (Hz). HRMS (TOF) were obtained from the Bruker Micro TOF II Spectrometer using Electro Spray Ionization (ESI). Advion MS (ESI) were obtained from the Expression L (Beijing Bohui Innovation Biotechnology Co., Ltd). LC-MS analysis was obtained from the Bruker Orbitrap LC/MS (Q Exactive™) at Huazhong University of Science and Technology Analytical and Testing Center. UV-visible absorbance measurements were performed with or using UV-visible spectrometer (Lambda365, PerkinElmer, German). Confocal Imaging was performed using a LSM 780 confocal microscope (Zeiss) with a 20× objective at 16-bit depth under non-saturating conditions. EGFP were imaged with a 480 nm (excitation) and a 510 nm (emission) and false-colored green.
Materials. Streptococcus pyogenes (product #M0646), T7 Endonuclease I (product #M0302), Ribonucleotide solution mix (NTPs) and deoxy-ribonucleoside triphosphates (dNTPs) were purchased from New England Biolabs (USA). Transcript Aid T7 High Yield Transcription kit (product #K0441) and Glycogen (product #R0561) were purchased from Thermo Fisher Scientific. Pyrobest™ DNA Polymerase and PrimeSTAR HS DNA Polymerase were purchased from TaKaRa Shuzo Co. Ltd. (Tokyo, Japan). DNA Clean & Concentrator™-5 kit (product #D4014) was purchased from Zymo Research Corp. The DNeasy Blood & Tissue Kit was purchased from QIAGEN. The oligonucleotides at HPLC purity were obtained from TaKaRa company (Dalian, China). The nucleic acid stains Super GelRed (NO.: S-2001) was bought from US Everbright Inc. (Suzhou, China). Thiazolyl Blue Tetrazolium Bromide (MTT, CAS #298-93-1) were purchased from Sigma-Aldrich Inc. (Shanghai, China). DPBS (CAS #63995-75-5) was purchased from TCI (Shanghai) Development Co., Ltd.
Polymerase chain reaction protocol. The concentration of DNA or RNA was quantified by NanoDrop 2000c (Thermo Scientific, USA). Polymerase Chain Reaction was conducted using a A300 Fast Gradient Thermal Cycler (Long Gene Scientific Instruments, Hangzhou, China). Briefly, each reaction was performed under conditions of initial denaturation at 94° C. for 3 minutes, followed by 30 cycles of denaturation (94° C. for 30 seconds), annealing (63° C. for 30 seconds) and extension (72° C. for 60 seconds) and a final extension at 72° C. for 10 minutes.
Electrophoresis protocol. Agarose were purchased from Biofroxx (German). 50× Tris-Acetate EDTA (TAE) buffer (40 mM Tris-acetate and 1 mM EDTA, pH 8.0) were purchased from Biosharp (China). 8 μL of each amplification product was separated by gel electrophoresis on 1% agarose. The electrophoresis conditions were as follows: power supply (DYY-6C power supply, Liuyi Biotechnology) was set to 130 V. 1% agarose gels were run at room temperature (25° C.) for 20 minutes and stained with or without GoodView™ (SBS Genentech, Beijing, China). A DNA size marker (1 kb DNA Ladder, TsingKe Biotech, TSJ102, Beijing, China) was used. Gels were analyzed by in-gel fluorescence measurements on a FluorChem® FC3 imager (Alpha Innotech). Fluorescence was measured with a blue light excitation wavelength (475 nm) and a green filter emission (537 nm) and/or a red-light excitation wavelength (632 nm) and a far-red filter emission (710 nm).
The green fluorescent probe PTAD-DBCO-FITC (8).
A solution of compound PTAD-N3 (10.1 mg, 0.04 mmol) and DBCO-FITC (5.0 mg, 0.01 mmol) in dry N,N-dimethylformamide (DMF, 1.0 mL) under nitrogen was stirred at room temperature overnight. The reaction mixture was purified by RP-HPLC (Separation was achieved using an Ultimate XB-C18 column, 21.2×250 mm 5 micron (Welch Materials Inc., Shanghai, China) by gradient elution from 5% to 70% MeCN in water (constant 0.1% formic acid) over 20 minutes, isocratic elution with 70% MeCN from 20 to 30 minutes, and returned to initial conditions and equilibrated for 5 minutes) to give compound PTAD-DBCO-FITC precursor (6.0 mg, 0.006 mmol, 66%, retention time 22 minutes) as yellow solid. 1H NMR (400 MHz, methanol-d4) δ 8.23 (s, 1H), 7.99 (d, J=8.0 Hz, 1H), 7.54 (d, J=8.0 Hz, 2H), 7.45 (s, 2H), 7.29 (d, J=4.0 Hz, 1H), 7.25-7.23 (m, 3H), 7.29 (t, J=8.0 Hz, 2H), 7.15 (d, J=8.0 Hz, 1H), 6.95 (d, J=8.0 Hz, 1H), 6.86 (d, J=8.0 Hz, 1H), 6.60 (s, 2H), 6.51 (q, J=12.0 Hz, 2H), 6.46-6.42 (m, 2H), 5.94 (d, J=20.0 Hz, 1H), 4.42 (t, J=16.0 Hz, 1H), 4.36 (t, J=8.0 Hz, 1H), 3.40-3.30 (m, 2H), 2.24-2.08 (m, 2H), 1.96-1.74 (m, 2H). HRMS (TOF) Calculated for [M+Na]+=919.2452. found=919.2436. The in situ oxidation of the PTAD-DBCO-FITC precursor in N,N-dimethylformamide (DMF) using an equal equivalent of N-bromosuccinimide (NBS) yielded PTAD-DBCO-FITC (8, 10 mM in DMF, stored at −20° C.). This product was utilized directly without further purification.
The red fluorescent probe PTAD-DBCO-Cy5 (10).
A solution of compound PTAD-N3 (3.4 mg, 0.0128 mmol) and DBCO-Cy5 (5.0 mg, 0.0064 mmol) in dry CH3CN (1.0 mL) under nitrogen was stirred at room temperature overnight. The reaction mixture was purified by RP-HPLC (Separation was achieved using a Ultimate XB-C18 column, 21.2×250 mm 5 micron (Welch Materials Inc., Shanghai, China) by gradient elution from 5% to 70% MeCN in water (constant 0.1% formic acid) over 15 min, isocratic elution with 70% MeCN from 15 to 30 minutes, and returned to initial conditions and equilibrated for 5 minutes) to give PTAD-DBCO-Cy5 precursor (6.0 mg, 0.0057 mmol, 90%, retention time 24 minutes) as blue solid, which was re-dissolved in 0.3 mL of anhydrous N,N-dimethylformamide (DMF) and subsequently treated with an equal equivalent of N-bromosuccinimide (NBS) solution (20 mM in DMF) to formulate a PTAD-DBCO-Cy5 stock solution (10, 10 mM in DMF, stored at −20° C.). MS (ESI) Calculated for [M−H]−=1035.5. found [M−H]−=1034.7. HRMS (TOF) Calculated for [M−Cl]+=1003.4977. found 1003.4964.
Nucleoside i6A (35 mg, 0.1 mmol) and proton sponge (58 mg, 0.25 mmol) were dissolved in trimethyl phosphate (0.7 mL) and was placed under the ice-water bath. phosphorus oxychloride (33.6 μL, 0.36 mmol) was slowly added and stirred at 0° C. for 12 hours. A solution of tributylamine (175 μL, 0.72 mmol) and tributyl ammonium pyrophosphate (642 mg) in DMF (2.0 mL) was slowly added and the reaction mixture was stirred at 0° C. for 30 minutes. The reaction was quenched by addition of 1.0 M aqueous TEAB (pH=7.5, 15 mL). The mixture was diluted with H2O (5.0 mL) and subjected to HPLC purification. Separation was achieved using an Ultimate XB-C18 column, 21.2×250 mm 5 micron by gradient elution from 5% to 50% MeCN in water (constant 0.1% formic acid) over 25 minutes, isocratic elution with 50% MeCN from 25 to 30 minutes, and returned to initial conditions and equilibrated for 5 minutes to give i6ATP (10.0 mg, as tributyl ammonium salt, retention time 20 minutes). 1H NMR (400 MHz, methanol-d4) δ 8.40 (s, 2H), 8.30 (s, 1H), 6.13 (d, J=4.0 Hz, 1H), 5.40 (t, J=12.0 Hz, 2H), 4.73 (q, J=16.0 Hz, 1H), 4.57-4.43 (m, 1H), 4.27-4.26 (m, 2H), 4.20-4.13 (m, 2H). The proton peak of methyl group in prenylated functionality was covered by tributyl ammonium salt in 1H NMR. 31P NMR (162 MHz, methanol-d4) δ −10.47 (d, J=22.68 Hz, α-P), −11.46 (d, J=22.68 Hz, γ-P), −24.00 (t, J=43.74 Hz, β-P). MS (ESI) Calculated for [M−H]−=574.1. found [M−H]−=574.2.
The nucleoside i6A (10 mg, 0.03 mmol) was dissolved in a mixture of CD3CN and D2O (0.5 mL, volume-to-volume ratio of 4:1, resulting in a final concentration of 60 mM). Subsequently, 4-phenyl-3H-1,2,4-triazole-3,5(4H)-dione (PTAD, 26 mg, 0.15 mmol, equivalent to 5 times the amount of i6A) was added to the solution (resulting in a final concentration of 300 mM). The reaction mixture was then allowed to stand at room temperature. The 1H NMR spectra were obtained using a 400 MHz 1H NMR instrument with CD3CN/D2O as the solvent.
The HeLa cells are first cultured in a six-well plate until they reach a confluence of 70%-80%. Different concentrations of i6A (4) are then added to the DMEM medium, consisting of 100 μM, 200 μM, and 400 μM of i6A. After a 12-hour incubation period, all cells are collected, and total RNA is extracted from them.
RNA incorporating i6A was labeled using fluorescein PTAD-DBCO-FITC (8).
To In a 1.5 mL Eppendorf tube containing the PTAD-DBCO-FITC precursor (7, 10 mL of a 10 mM solution in DMF), N-bromosuccinimide (1.0 mL of a 100 mM solution in DMF) was added. The mixture was gently vortexed, and the formation of a light-red color was observed. The reagent was kept on ice and immediately used for labeling the i6A-incorporated RNA.
The i6A-incorporated total RNA (10 mL, 20 ng/mL in DEPC water) was incubated with PTAD-DBCO-FITC (8, 10 mL of a 0.1 mM solution), shaken for 30 minutes at 0° C. The RNA was then analyzed through gel electrophoresis on a 1% agarose gel. The electrophoresis conditions were as follows: power supply (DYY-6C power supply, Liuyi Biotechnology) was set to 130 V. 1% agarose gels were run at room temperature (25° C.) for 20 minutes and stained with or without GoodView™ (SBS Genentech, Beijing, China). A DNA size marker (1 kb DNA Ladder, TsingKe Biotech, TSJ102, Beijing, China) was used. Gels were analyzed by in-gel fluorescence measurements on a FluorChem® FC3 imager (Alpha Innotech). Fluorescence was measured with a blue light excitation wavelength (475 nm) and a green filter emission (537 nm).
RNA containing i6A was labeled using fluorescein PTAD-DBCO-Cy5 (10).
In a 1.5 mL Eppendorf tube, the PTAD-DBCO-Cy5 precursor (9, 10 mL of a 10 mM solution in DMF) was mixed with N-bromosuccinimide (1.0 mL of a 100 mM solution in DMF). The mixture was gently vortexed, and the formation of a light-red color was observed, indicating successful reaction. The prepared reagent was then kept on ice and used immediately for labeling the i6A-incorporated RNA. The i6A-containing total RNA (10 mL, 20 ng/mL in DEPC water) was incubated with the activated PTAD-DBCO-Cy5 (10, 10 mL of a 0.1 mM solution), and the mixture was shaken for 30 minutes at 0° C. to facilitate labeling. Following this incubation, the labeled RNA was analyzed through gel electrophoresis on a 1% agarose gel. The electrophoresis conditions were as follows: power supply (DYY-6C power supply, Liuyi Biotechnology) was set to 130 V. 1% agarose gels were run at room temperature (25° C.) for 20 minutes and stained with or without GoodView™ (SBS Genentech, Beijing, China). A DNA size marker (1 kb DNA Ladder, TsingKe Biotech, TSJ102, Beijing, China) was used. Gels were analyzed by in-gel fluorescence measurements on a FluorChem® FC3 imager (Alpha Innotech). Fluorescence was measured with a blue light excitation wavelength (475 nm) and a green filter emission (537 nm).
The reactions were conducted in DMSO-d6 to enable in situ NMR characterization without the need for additional purification. i6A (4, 0.15 mmol) was dissolved in DMSO-d6 (0.5 mL), and iodine (0.45 mmol in DMSO-d6) was subsequently added to the solution. The mixture was then stirred at 37° C. for 5 minutes. After this incubation period, the reaction mixture was subjected to 1H NMR, 13C NMR, HPLC, UV and LC-MS analysis.
All calculations were conducted using the Gaussian 16, Revision B.01 software program. The geometry optimization of the model systems in the gas phase was performed utilizing the B3LYP/BSI Density Functional Theory (DFT) method, which was augmented with the D3 (BJ) version of Grimme's empirical dispersion correction. In this context, BS1 refers to a basis set that combines the SDD for iodine atoms and the 6-31G (d, p) for all other atoms. Frequency calculations were carried out at the same theoretical level to ascertain whether the optimized structures represent minima (absence of imaginary frequencies) or saddle points (presence of a single imaginary frequency) on the potential energy surface. These calculations also provided thermal corrections to the Gibbs free energies. The solvation energy corrections were computed at the B3LYP/BS1 level employing the SMD solvation model for DMSO, based on the gas-phase optimized geometries. Single point energies were calculated using the M062X/BS2 method, which was also augmented with the D3 version of Grimme's empirical dispersion correction. In this case, BS2 denotes a basis set combining the SDD for iodine atoms and the Def2TZVP for all other atoms.
Illustrations of the profiling methods for i6A residues in cells.
In Vitro Detection of the Efficiency of i6ATP in the T7 RNA Polymerase Transcription System.
The EGFP mRNA sequence was obtained by transcribing under the conditions of i6ATP: ATP ratio of 0:1, 3:7, 7:3, 1:0 through the in vitro transcription system, and then detected each group of EGFP RNA sequence through iodine (0.5 M) treatment. After processing, the cDNA sequence of EGFP in each group was obtained by reverse transcription. Next, the obtained cDNA sequence was used as a template, and the mutant EGFP gene sequence in each group was obtained by PCR amplification, and cloned into the T vector. Through the amplification of DH5α E. coli., 50 clones were picked from each group and sequenced to detect the mutation. After the mutation site of EGFP gene, and analyze the insertion efficiency of i6ATP.
Comparative Analysis of the Recognition Efficiency of T7 RNA Polymerase and RNA Polymerase II for i6ATP in Eukaryotic Cells.
T7 RNA polymerase and T7-IRES2-EGFP expression system were expressed in Hela cells, and RNA was extracted 48 hours after transfection, and then the mixture was treated with iodine to cause i6ATP insertion site mutation. Then, the mutant sequences of the two endogenously expressed genes of GAPDH and Notch1 were obtained by reverse transcription, and the mutant gene sequence of EGFP catalyzed by T7 RNA polymerase transcription was obtained. Same as the previous detection process, by detecting the insertion efficiency of i6ATP in the endogenous highly expressed gene GAPDH and the oncogene Notch1 and the insertion efficiency of i6ATP in T7-EGFP. Compare the recognition efficiency of i6ATP by eukaryotic RNA polymerase and the feasibility of using it as a real-time transcriptome marker.
Yeast Cell Culture and tRNA Isolation.
The wild-type and MOD5 deletion strains of Saccharomyces cerevisiae were kindly provided by Dr. Jia-Xing Yue from Sun Yat-sen University Cancer Center. The S. cerevisiae cells were cultured in yeast extract-peptone-dextrose (YPD) medium at 30° C. until reaching the logarithmic growth phase. For H2O2 treatment, the cells were exposed to 12 mM H2O2 for 1 hour before harvesting, as reported previously (38).
Cells were pelleted by centrifugation, and grinded into fine powders using a mortar and pestle, with the addition of liquid nitrogen. Total RNA was extracted using Trizol (Invitrogen, RN0401) following the manufacturer's instructions. RNAs shorter than 200 nucleotides, primarily consisting of tRNAs, were isolated using the Small RNA clean kit (ZYMO research).
To determine the levels of i6A in tRNAs, liquid chromatography-tandem mass spectrometry (LC-MS/MS) was performed by Wuhan BioBank in Wuhan, China. Synthesized i6A was used as a standard for this analysis.
Initially, 10 μg of tRNAs were treated with 0.1 M Tris-HCl (pH 9.0) at 37° C. for 45 minutes to deacylate the tRNAs. Half of the deacylated tRNAs were treated with 0.5 M I2 for 15 minutes on ice. Following this, saturated Na2S2O3 was added until the solution became colorless, and saturated Na2CO3 was added until it ceased to produce bubbles. Once the reaction was complete, the tRNAs were purified using the RNA Clean & Concentrator Kit (ZYMO Research). Subsequently, both iodine-treated and untreated tRNAs were used for sequencing library construction, following a previously established protocol with some modifications (39). Notably, Induro™ Reverse Transcriptase (NEB, #M0681L) was used instead of TGIRT. The process began with the incubation of deacylated tRNAs with T4 PNK to dephosphorylate their 3′ ends. tRNAs from both samples were then purified using PAGE, followed by the ligation of a 3′ adaptor to the purified tRNAs using T4 RNA Ligase 2, truncated KQ (NEB, M0373L), at 25° C. for 3 hours. After another round of PAGE purification, the ligated products were reverse transcribed using Induro® Reverse Transcriptase (NEB, M0681L) at 55° C. for 16 hours in 1×RT buffer (10 mM MgCl2, 10 mM DTT, and 75 mM KCl). The resulting cDNA was then purified and circularized using Circligase™-II (Lucigen, #CL9025K). Finally, the libraries were amplified using KAPA HiFi Polymerase (Roche, #KK2602) and sequenced on an Illumina Novaseq 6000 platform. The primer sequences are listed in Table S5.
tRNA-Seq Data Analysis.
Adaptors and random sequences were trimmed, and low-quality sequences were discarded using Cutadapt software (40-41). The resulting clean reads were then processed using a previously published pipeline to quantify and analyze tRNA expression and modifications (39). The mature tRNA sequences of S. cerevisiae were obtained from GtRNAdb. The analysis was performed using the following parameters:
To ensure the quality of our tRNA-seq data, we thoroughly examined several key parameters. These included mapping rate, correlation of samples, coverage of tRNAs, and detection of known tRNA modifications.
To identify the i6A site in tRNAs, the mutation rate with and without I2 treatment was calculated to determine the i6A score. The i6A score is defined as relative possibility of i6A modification at a specific site. A larger mutation rate difference corresponds to a higher score. An i6A score threshold of 1 was set, with scores greater than 1 indicating the presence of i6A modification at the respective site. The score is calculated as follows:
Mt represents the mutation rate with I2 treatment, and Mc is the mutation rate without I2 treatment. To ensure efficient calculation of the i6A score, Laplace smoothing is incorporated. All customized scripts used in this study are available from the corresponding author upon request.
Two-tailed t-tests or Wilcoxon's signed-rank tests were employed for group comparisons. Correlation analyses were conducted using Pearson tests. A p-value of less than 0.05 was considered statistically significant. Data visualization was accomplished using custom R scripts.
We synthesized the i6A nucleoside (4 in
To further expand this method and take advantage of the fluorescence labeling using this chemistry, we constructed two fluorescent probes derived from the structures of PTAD (8 and 10,
With the two fluorescent labeling reagents and i6ATP building block in hands, we asked whether i6A nucleoside can be incorporated by RNA polymerases in the process of transcription within eukaryotic cells, and if so can they be recognized and fluorescently labeled (
For the labeling experiment, HeLa cells were harvested after the verification of cytotoxic study (
We further examined the prenyl modification on RNAs (35-37) via the addition of I2, a reagent for the cyclisation reaction of a prenyl group (42-44). As expected, the reaction of i6A with optimized concentration of iodine reagent proceeded very smoothly to yield the full conversion of i6A in less than 5 minutes (
DFT calculations were performed to elucidate the possible reaction pathway and mechanism. The addition of I2 resulted in the spontaneous attachment of di-iodinyl atoms to the double bond of the prenyl group with four potential isomers formed because of the chirality of the products (1a-1b, 2a-2b) (
Profiling of artificially incorporated i6A residues in cell.
We subsequently applied this IMCRT method to detect i6A modification sites in artificial RNAs. The T7P-EGFP gene was utilized as a template for in vitro transcription, employing T7 RNA Polymerase. In the course of cellular metabolism, living HeLa cells were fed with 30% and 70% i6A nucleoside in two groups respectively following the optimized protocol. The resulting EGFP RNA from each experimental group was further treated with I2 (0.5 M). Each EGFP RNA was then reverse transcribed into cDNA followed by PCR amplification. Each amplified EGFP amplicon was next cloned into the T-vector and transformed into DH5α E. coli. A total of 50 clones were picked from each group, followed by high-throughput sequencing and comparing with the native control to determine the sites of mutation on EGFP gene and to analyze the insertion efficiency of i6A (
Characterization of i6A Modification in Yeast tRNAs.
In Saccharomyces cerevisiae, Mod5p is responsible for catalyzing the i6A modification at position 37 in six cytosolic (cy) and three mitochondrial (mt) tRNAs. These tRNAs include cy-tRNA-Ser-TGA-1, cy-tRNA-Ser-AGA-1, cy-tRNA-Ser-AGA-2, cy-tRNA-Ser-CGA-1, cy-tRNA-Cys-GCA-1, cy-tRNA-Tyr-GTA-1, mt-tRNA-Tyr-GTA-1, mt-tRNA-Cys-GCA-1, and mt-tRNA-Trp-TCA-1 (45-47). Our mass spectrometry results confirmed that the deletion of MOD5 leads to the complete loss of i6A modification in tRNAs (
To map i6A sites and quantify their modification levels, we conducted IMCRT tRNA-seq. Gel-purified tRNAs were treated with or without I2 before constructing sequencing libraries (
Importantly, our IMCRT tRNA-seq enables the accurate identification of all yeast tRNAs with i6A modification. For instance, in wild-type cells, the mutation rate (MR) at A37 in cy-tRNA-Ser-AGA-1 was approximately 0% before I2 treatment (
Next, we conducted IMCRT tRNA-seq experiments on tRNAs extracted from yeast cells cultured under different conditions. Consistent with a previous study, H2O2 treatment resulted in decreased tRNA i6A levels, as detected by mass spectrometry (
In summary, we employed Ene-ligation bioorthogonal chemistry and novel PTAD-based fluorescence probes for the direct labeling of i6A-containing RNAs in vitro. We synthesized the i6A triphosphate building block and demonstrated that it can be recognized and incorporated by eukaryotic RNA polymerase during intracellular transcription, allowing for adjustable i6A concentrations and the production of i6A-modified RNA that can be directly labeled by our PTAD-based fluorescent dye (8 and 10). This method enables us to study the functions of i6A and provides a general approach to label fluorescence onto RNAs for molecular tracking based on i6A addition.
We also discovered that iodine-mediated cyclization reactions of the prenyl group occur rapidly, transforming i6A from a hydrogen-bond acceptor to a donor. Leveraging this reactivity, we developed a novel iodine-mediated cyclization and reverse transcription sequencing (termed IMCRT tRNA-seq) method for profiling i6A residues in cellular RNAs. Following established protocols, we demonstrated that IMCRT tRNA-seq accurately identifies tRNAs containing i6A37 with single nucleotide precision, including all yeast tRNAs with i6A modification. Under stress response conditions (eg. H2O2), we observed decreased i6A levels in budding yeast, accompanied by substantial decreases in both the mutation rate at A37 and i6A37 scores. These findings suggest that our IMCRT tRNA-seq technique is not only capable of mapping i6A sites with single nucleotide resolution, but also permits semi-quantification of i6A levels in tRNAs.
In addition to i6A, over 100 RNA modifications have been identified. Accurate mapping and quantification methods are essential for studying their biological roles. While antibodies recognition of some specific modifications like m6A, m1A, m7G, and ac4C have been used for mapping, their resolution is typically limited to tens or hundreds of nucleotides due to fragmented RNA sizes. Alternatively, methods combining unique chemical reactivity and detection of mutation/stop caused during reverse transcription have successfully identified a few modifications such as m5C, pseudouridine (Ψ) and ac4C at single-base resolution (53). However, low abundance modifications require increased sequencing depth or pre-library construction enrichment using antibodies, as in m1A-MAP methods. In budding yeast, the high proportion of i6A-containing tRNAs negates the need for enrichment, but detecting i6A in other RNA species might necessitate enrichment using anti-i6A antibodies (54-55).
While there is currently no clear evidence for the presence and distribution of i6A residues across the transcriptome, likely due to the lack of appropriate molecular tools, the question of whether this prenyl modification exists in other RNA species such as mRNA remains open. Nonetheless, i6A residues have been actively implicated in various human diseases, including diabetes, cancer, and neurodegenerative disorders. Moreover, the biological functions of prenyl modifications in both RNA and proteins have gained increasing recognition. For instance, the expression levels of prenylated 2′-5′-oligoadenylate synthetase 1 (OAS1) have been closely associated with the severity of hospitalized SARS-COV-2 patients, highlighting the potential critical roles of prenylation in the human antiviral defense system (56). Our newly developed chemical biology methods will offer unique toolsets for detecting, profiling, and monitoring i6A residues in both normal and diseased cell environments, facilitating more comprehensive functional studies of this natural modification and shedding light on its implications in various biological processes and pathologies.
Although some non-limiting examples have been depicted and described in detail herein, it will be apparent to those skilled in the relevant art that various modifications, additions, substitutions, and the like can be made without departing from the spirit of the present disclosure and these are therefore considered to be within the scope of the present disclosure as defined in the claims that follow.
It should be appreciated that all combinations of the foregoing concepts and additional concepts discussed in greater detail herein (provided such concepts are not mutually inconsistent) are contemplated as being part of the inventive subject matter disclosed herein. In particular, all combinations of claimed subject matter appearing at the end of this disclosure are contemplated as being part of the inventive subject matter disclosed herein and may be used to achieve the benefits and advantages described herein.
This application claims benefit of priority from U.S. Provisional Patent Application No. 63/465,425, filed May 10, 2023, the entire content of which is incorporated herein by reference.
This invention was made with Government support under grant numbers CHE1845486 and MCB1715234 awarded by the National Science Foundation. The Government has certain rights in the invention.
Number | Date | Country | |
---|---|---|---|
63465425 | May 2023 | US |