The disclosure is generally directed to compositions, systems, and methods for performing atom transfer radical addition (ATRA) or atom transfer radical cyclization (ATRC) reaction, and more specifically the disclosure includes description of modified enzymes for catalyzing stereoselective atom transfer radical addition.
This application hereby incorporates by reference the material of the electronic Sequence Listing filed concurrently herewith. The material in the electronic Sequence Listing is submitted as an XML (.xml) file entitled “07431PCT_SeqList_ST26.xml” created on Aug. 2, 2022, which has a file size of 8 KB, and is herein incorporated by reference in its entirety.
As nature's privileged catalysts, enzymes are well-known for their ability to exert exquisite control over the stereochemical outcome of chemical reactions. Over the past three decades, the advent of directed evolution has enabled the rapid development of customized enzymes to furnish excellent catalytic activity and stereoselectivity which surpass those of traditional small-molecule catalysts, thus highlighting the potential of biocatalysis to revolutionize the practice of asymmetric synthesis. The catalytic repertoire of enzymes has been mostly limited to reactions found in nature, posing constraints on the types of products available from enzyme catalysis. Furthermore, to date, a variety of catalysis modes discovered and optimized by synthetic chemists has not been able to be performed by natural enzymes. To merge the excellent tunability and stereocontrol of biocatalysts with the synthetic versatility of abiotic systems, the discovery and development of new-to-nature enzymatic activity are widely recognized as preeminent objectives at the interface of modern biocatalysis and organic synthesis.
Compositions, systems, and methods for performing biocatalytic atom transfer radical addition chemistry are described. Modified metalloenzymes are utilized to perform enantioselective and/or diastereoselective atom transfer radical addition chemistry on a broad range of substrates. Various biological enzymes can be modified to perform stereoselective ATRA/ATRC chemistry, including (but not limited to) various metalloenzymes such as heme enzymes (e.g., cytochromes P450, cytochromes c, protoglobins, and myoglobins), non-heme Fe enzymes, and other Co-and Cu-dependent metalloenzymes. Further, several embodiments are directed to the ability of performing stereoselective ATRA reactions on substrates having an unsaturated carbon-carbon bond, in which an alkyl halide substrate is selectively transfer to the alkene/alkyne to form a further saturated alkyl halide product.
In an aspect, a method is for conducting a controlled atom transfer radical addition or atom transfer radical cyclization reaction catalyzed by an unnatural metalloenzyme. The method provides a first substrate comprising an alkyl halide moiety, an aryl halide moiety, or other alkyl or aryl radical precursor. The method provides a second substrate comprising an unsaturated carbon-carbon moiety. The method provides a metalloenzyme catalyst that exerts enantiocontrol or diastereocontrol over the first and the second substrates. The method combines the first substrate, the second substrate, and the metalloenzyme catalyst to react such that an atom transfer radical addition reaction or an atom transfer radical cyclization reaction occurs to afford a product wherein the alkyl group and the halogen atom of the alkyl halide moiety are installed across the unsaturated C—C bond of the unsaturated moiety with a desirable enantiomer or and diastereomer outcome.
In some implementations, the first and second substrates are within a single molecule.
In some implementations, the first and second substrates are in two different molecules.
In some implementation, the halide of the alkyl halide moiety or aryl halide moiety bromide or chloride.
In some implementations, the unsaturated carbon-carbon moiety is an alkene or an alkyne.
In some implementations, the metal cofactor is Fe(II)/Fe(III) redox couple.
In some implementations, the metalloenzyme catalyst is: P450ATRAse1, P450ATRAse2, P450ATRAse3, P450ATRAse4, or an earlier variant from the P450ATRAse1, P450ATRAse2, P450ATRAse3, or P450ATRAse4 evolutionary lineage.
In some implementations, a metalloenzyme catalyst comprises a modified Bacillus megaterium P450 (CYP102A1) having one or more of the following mutations: A82T, L181F, L181V, I263Q, I263L, E267A, A268G, H266T, T327I, P327C, T327V, A330T, S400A, L436T, L437I, L437F, T438Q or S438Y.
In some implementations, the modified P450 has one or more of the following mutations: A82T, L181F, I263Q, H266T, or T327I.
In some implementations, the modified P450 has the following mutations: A82T, L181F, I263Q, H266T, and T327I.
In some implementations, the modified P450 has a sequence comprising SEQ ID No. 1.
In some implementations, the modified P450 has one or more of the following mutations: L181V, P327C, S400A, L436T or T438Q.
In some implementations, the modified P450 has the following mutations: L181V, P327C, S400A, L436T and T438Q.
In some implementations, the modified P450 has a sequence comprising SEQ ID No. 2.
In some implementations, the modified P450 has one or more of the following mutations: A82T, L181F, I263Q, E267A, T327I, L437I or S438Y.
In some implementations, the modified P450 has the following mutations: A82T, L181F, I263Q, E267A, T327I, L437I and S438Y.
In some implementations, the modified P450 has a sequence comprising SEQ ID No. 3.
In some implementations, the modified P450 has one or more of the following mutations: A82T, L181V, 1263L, A268G, T327V, A330T, L437F or S438Q.
In some implementations, the modified P450 has the following mutations: A82T, L181V, 1263L, A268G, T327V, A330T, L437F and S438Q.
In some implementations, the modified P450 has a sequence comprising SEQ ID No. 4.
In some implementations, a nucleic acid comprises a sequence for encoding a modified Bacillus megaterium P450 (CYP102A1) having one or more of the following mutations: A82T, L181F, L181V, I263Q, 1263L, E267A, A268G, H266T, T327I, P327C, T327V, A330T, S400A, L436T, L437I, L437F, T438Q or S438Y.
In some implementations, the nucleic acid comprises a sequence for encoding the modified P450 having one or more of the following mutations: A82T, L181F, I263Q, H266T, or T327I.
In some implementations, the nucleic acid comprises a sequence for encoding the modified P450 having the following mutations: A82T, L181F, I263Q, H266T, and T327I.
In some implementations, the nucleic acid comprises a sequence for encoding the modified P450 having a sequence comprising SEQ ID No. 1.
In some implementations, the nucleic acid comprises a sequence for encoding the modified P450 having one or more of the following mutations: L181V, P327C, S400A, L436T or T438Q.
In some implementations, the nucleic acid comprises a sequence for encoding the modified P450 having the following mutations: L181V, P327C, S400A, L436T and T438Q.
In some implementations, the nucleic acid comprises a sequence for encoding the modified P450 having a sequence comprising SEQ ID No. 2.
In some implementations, the nucleic acid comprises a sequence for encoding the modified P450 having one or more of the following mutations: A82T, L181F, I263Q, E267A, T327I, L437I or S438Y.
In some implementations, the nucleic acid comprises a sequence for encoding the modified P450 having the following mutations: A82T, L181F, I263Q, E267A, T327I, L437I and S438Y.
In some implementations, the nucleic acid comprises a sequence for encoding the modified P450 having a sequence comprising SEQ ID No. 3.
In some implementations, the nucleic acid comprises a sequence for encoding the modified P450 having one or more of the following mutations: A82T, L181V, 1263L, A268G, T327V, A330T, L437F or S438Q.
In some implementations, the nucleic acid comprises a sequence for encoding the modified P450 having the following mutations: A82T, L181V, 1263L, A268G, T327V, A330T, L437F and S438Q.
In some implementations, the nucleic acid comprises a sequence for encoding the modified P450 having a sequence comprising SEQ ID No. 4.
The description and claims will be more fully understood with reference to the following figures and data graphs, which are presented as exemplary embodiments and should not be construed as a complete recitation of the scope of the disclosure.
Turning now to the drawings and data, compositions, systems and methods for performing stereoselective atom transfer radical addition (ATRA) reaction or atom transfer radical cyclization (ATRC) and applications thereof are described, in accordance with various embodiments. Several embodiments are directed to the use of modified biological enzymes to perform synthetic chemistry reactions with enantiocontrol and diastereocontrol. Particularly, many embodiments are directed to the use of a modified enzyme to perform an enantioselective ATRA or ATRC reaction. Various biological enzymes can be modified to perform stereoselective ATRA or ATRC chemistry, including (but not limited to) various metalloenzymes such as heme enzymes (e.g, cytochromes P450, cytochromes c, protoglobins, and myoglobins), non-heme Fe enzymes, and other Co- and Cu-dependent metalloenzymes. Further, several embodiments are directed to the ability of performing stereoselective ATRA or ATRC reactions on substrates having an alkene or alkyne, in which an alkyl halide substrate is selectively transfer to the alkene to form a more saturated alkyl halide product (
Due to the highly reactive nature of radical intermediates and the lack of synthetically realizable stereoinduction strategies, imposing enantiocontrol and diastereocontrol over free radical-mediated bond forming processes is a daunting challenge in asymmetric catalysis. As such, despite the widespread utility of atom transfer radical reactions in synthetic chemistry, the development of general methods for stereocontrolled ATRA or ATRC presents significant hurdles for conventional small-molecule catalysts. More specifically, due to the difficulty in maintaining tight association between a small molecule chiral catalyst and a radical intermediate, general catalytic strategies to exert enantiocontrol over, for example, a C—C bond forming radical addition step remains unavailable. Furthermore, imposing catalyst-controlled diastereoselectivity in, as another example, a subsequent halogen-rebound step is similarly challenging. Conventional catalytic strategies to accomplish excellent diastereocontrol over radical-mediated bond forming processes have long eluded synthetic chemists. As a result, radical-mediated reactions usually exhibit low diastereoselectivity under substrate control.
Natural enzymes such as radical SAM enzymes facilitate radical reactions with excellent chemoselectivity, regioselectivity and stereoselectivity. The vast majority of natural radical enzymes, however, display an extremely narrow substrate scope, and, thus, require extensive and often challenging engineering to afford generally useful catalysts for stereocontrolled radical chemistry. Nevertheless, it has been shown that various biological enzymes can be repurposed to catalyze unnatural radical reactions, leading to the development of novel biocatalytic methods, thereby allowing for a diverse array of easily available substrates to be converted with excellent operational simplicity.
Furthermore, metalloenzymes, such as heme-containing proteins can potentially provide opportunities for use as a general platform for asymmetric radical transformations. In general, heme chemistry has been proven useful, for example, in radical polymerizations, wherein polymer scientists have employed hemes to generate radical species for polymerization. In addition, metalloredox couples spanning a wide window of potentials, including Fe(II)/Fe(III), Co(I)/Co(II), and Cu(I)/Cu(II), are readily available in a variety of native metalloenzymes.
Through out the description, the terms ATRA and ATRC are used interchangeably. As would be understood in the art, the enzymes as described herein can be used in ATRA or ATRC reactions, depending the substrates utilized to perform the reaction.
Several embodiments are directed to systems, compositions and methods for controlled formation and transformation of organic radicals catalyzed by modified metalloradical enzymes. In many embodiments, controlled formation and transformation of organic radicals is performed by an ATRA or ATRC reaction. In many embodiments, metalloenzyme catalysts comprise first-row transition-metal cofactors. Various organic radical manipulating methods of various embodiments utilize innate redox properties of the first-row transition-metal cofactors within metalloenzymes, and proceed via ground-state single electron transfer between the enzyme's metallocofactor and the substrate. In many embodiments, metalloenzyme catalysts are modified from natural analogs resulting in an intimate interaction between a desired substrate, including the corresponding reactive radical intermediate, and a metalloprotein scaffold of the metalloenzyme catalyst. In many embodiments, methods utilizing metalloenzyme catalysts provide enantiocontrol and diastereocontrol over the desired radical reactions, especially as compared to methods utilizing conventional small-molecule catalysts.
In many embodiments, metalloenzyme catalysts comprise a metal ion such as (for example) Fe(II)/Fe(III), Co(I)/Co(II), or Cu(I)/Cu(II). In some embodiments, a metalloenzyme catalyst comprise an iron ion supported by a heme scaffold. In embodiments utilizing an iron ion metalloenzyme catalyst, the metalloprotein catalyst in its ferrous state undergoes a single electron transfer with a first substrate comprising an organic halide, and produces a transient radical intermediate in the catalyst's active site (
In many embodiments, the ATRA transformations between a radical precursor (including, but is not limited to, alkyl halides and aryl radical precursors) and an alkene/alkyne afforded by the instant methods is most generally represented by the following scheme (see also
wherein, R1 is an alkyl group; and R2, R3, R4 and R5 are each individually selected from the group consisting of: H, alkyl, aryl, acyl, CN, NO2, R′R″N—, or R′O—, wherein, further, R′ and R″ are each an alkyl group, and wherein X is a halide. In some cases, the ATRA product undergoes elimination to provide the corresponding alkene product after the loss of HX. Although an alkene substrate is presented in the reaction drawing, it is understood that an alkyne substrate can be utilized as well.
In some embodiments, an alkyl halide and unsaturated hydrocarbon moieties are combined in a single substrate to yield synthetic product with a cyclic entity, as illustrated, for example, in
In some embodiments, various particular metalloenzyme catalysts are used to produce compounds with differing stereochemistries from the same substrate. For example, as shown in
In various embodiments, easily accessible substrates and various particular metalloenzyme catalysts are utilized to obtain compounds with any desired diastereochemistry, based on the control of the relative stereochemistry in the C-halide bond forming halogen rebound event (
In several embodiments, useful, practical, large scale synthesis of various products are produced. For example,
In many embodiments, the presence of a halide functional group in the enantioenriched ATRA products of the instant methods allows for a range of diversification reactions to be conveniently carried out post production. For example, as illustrated by
Not to be bound by any theory,
In addition,
While specific examples of methods for performing atom transfer radical addition reactions with modified enzymes are described above, one of ordinary skill in the art can appreciate that various steps of the described methods can be performed in different orders and that certain steps may be optional according to some embodiments. Further, any appropriate reactant molecules and/or modified enzymes for performing an enantioselective and/or diastereoselective atom transfer radical addition reaction can be utilized. As such, it should be clear that the various steps of the method could be used as appropriate to the requirements of specific applications.
Several embodiments are directed to compositions, especially modified enzymes, for performing ATRA chemistry. Furthermore, several embodiments are directed to methods for generating stereoselective enzymes via site-saturation mutagenesis and screening. Generally, a naturally occurring enzyme, such as a metalloenzyme, is modified at an amino acid residue (e.g., residues in the active site) to optimize the desired ATRA chemistry on a variety of substrates. Based on methods described herein, a number of modified enzymes for performing ATRA have been developed.
In accordance with various embodiments, modified enzymes for performing ATRA chemistry can be based on any appropriate enzyme, especially metalloproteins, and especially Fe-, Co-, and Cu-dependent proteins. In embodiments of Fe-dependent enzymes, either heme or non-heme proteins can be utilized. Examples of heme proteins include (but are not limited to) cytochromes P450, protoglobins, myoglobins, and cytochromes c. In some embodiments, a modified P450 enzyme is utilized. In some embodiments, a modified P450 enzyme includes a serine-ligated residue or lacks a coordinating axial residue (e.g., alanine is introduced to replace the original coordinating axial residue).
Particular embodiments of modified enzymes are directed towards modified Bacillus megaterium P450 (CYP102A1), which were developed via site-saturation mutagenesis on active site residues. It was found that the following mutations in P450 greatly enhanced enantioselective properties for performing ATRA: A82T, L181F, I263Q, H266T, and T327I. Accordingly, various embodiments are directed to P450 comprising one or more or all of these mutations. Modified P450 with these beneficial mutations (referred to as P450ATRAse1 in the Example provided; SEQ ID No. 1), or earlier variants in the evolution series, had high enantioselectivity (97% product of major enantiomer) with a broad substrate scope.
It was further found that the following mutations in Bacillus megaterium P450 greatly enhanced enantioselective properties for performing ATRA: L181V, P327C, S400A, L436T and T438Q. Accordingly, various embodiments are directed to P450 comprising one or more or all of these mutations. Modified P450 with all five of these mutations (referred to as P450ATRAse2 in the Example provided; SEQ ID No. 2), had high enantioselectivity (91% product of major enantiomer) with broad substrate scope. Notably, S400A disables serine ligation with the heme group.
It was further found that the following sets of mutations in Bacillus megaterium P450 greatly enhanced diastereoselective properties for performing ATRA in opposite chirality. To yield one of the diastereomers, the following mutations were found to have greatly enhanced diastereoselective properties: A82T, L181F, I263Q, E267A, T327I, L437I and S438Y. Accordingly, various embodiments are directed to P450 comprising one or more or all of these mutations. Modified P450 with all seven of these mutations (referred to as P450ATRAse3in the Example provided; SEQ ID No. 3), had high diastereoselectivity (96% product of major diastereomer) and high enantioselectivity (99% product of major enantiomer) with broad substrate scope. To yield the other diastereomer, the following mutations were found to have greatly enhanced diastereoselective properties: A82T, L181V, 1263L, A268G, T327V, A330T, L437F and S438Q. Accordingly, various embodiments are directed to P450 comprising one or more or all of these mutations. Modified P450 with all seven of these mutations (referred to as P450ATRAse4 in the Example provided; SEQ ID No. 4), had high diastereoselectivity (87% product of major diastereomer) and high enantioselectivity (99% product of major enantiomer) with broad substrate scope.
Turning to
Many embodiments alter the amino acid sequence of the selected enzyme at 1104. In many embodiments, one or more amino acid residues in the active site of the selected enzyme are mutated to catalyze the enantioselective activity. Various methods are known in the art to alter amino acid sequences. Several embodiments utilize site-saturation mutagenesis (SSM) to alter amino acid residues. SSM is a method that systematically replaces wild type amino acids with all 19 non-wild type amino acids at selected positions (e.g., active site amino acid residues). Various embodiments alter the amino acid sequence by changing a coding sequence (e.g., DNA sequence) for the enzyme. Such amino acid changes are possible by making codon substitutions to the coding sequence for the enzyme. Certain embodiments utilize codon optimization of the coding sequence, or utilize codons that are efficient for translation for a particular organism (e.g., E. coli).
At 1106, further embodiments express the enzyme from an expression vector. In many embodiments, expression comprises transfecting or transforming a cell (e.g., bacterial cell, yeast cell, mammal cell, plant cell, etc.) or an organism with a sequence coding for the enzyme. In many embodiments, the coding sequence of the enzyme is assembled in an expression cassette, or construct, comprising one or more of a promoter, a terminator, a signaling peptide, an enhancer, a 5′ untranslated region, a 3′ untranslated region, a splice site, an origin of replication, and/or any other feature to aid in cloning, replication, transfection, or translation of the cassette or its contents.
At 1108, several embodiments screen the enzymes. Screening can include measuring one or more metrics that are preferable for an engineered enzyme. Various embodiments screen for one or more of enantioselective yield, enzymatic activity under specific conditions (e.g., temperature, pH, buffer, etc.), catalytic activity on specific reagents, or any other relevant metric for selecting an enzyme with preferable properties.
Many embodiments are directed to enzymes capable of enantioselectivity such as described herein. Some enzyme embodiments include SEQ ID NOs: 1-4. Further embodiments are directed to nucleic acid sequences (including, but not limited to, DNA and RNA) encoding for SEQ ID NOs: 1-4. Further embodiments are directed to cassettes or constructs used to replicate and/or express a coding sequence for an enantioselective enzyme. Many embodiments are directed to organisms transformed or transfected to express an enantioselective enzyme.
Turning to
Additional embodiments are directed to transcript expression modifiers that are operatively linked to a gene of interest, including, 5′ elements 1204 and/or 3′ elements 1206, such as promoters, enhancers, terminators, etc., that assist with gene transcription and/or translation. Many types of promoters, enhancers, and terminators are known in the art, including constitutive and inducible promoters. Various embodiments include a constitutive promoter, including (but not limited to) T7 promoter, cauliflower mosaic virus 35S (CaMV 35S), cauliflower mosaic virus 19S (CaMV 19S), and/or any other constitutive promoter. Inducible promoters include (but are not limited to) galactose inducible promoters, copper inducible promoters, ethanol inducible promoters, and/or any other inducible promoter. Some embodiments include multiple expression modifiers, 5′ elements 1204 and/or multiple 3′ elements 1206 to further enhance gene transcription.
Further embodiments include one or more tags 1208, labels, or other peptides that can be used to identify, isolate, or export a protein of interest. For example, some cassettes include a His tag, a Myc tag, or other peptide tag that allows for identification or isolation of a protein of interest, while some embodiments include a signal peptide to transport the encoded peptide to a specific part of a cell or to export the encoded peptide outside of the cell.
Further embodiments of expression cassette 1200 include one or more of a splice site, an origin of replication, and/or any other feature to aid in cloning, replication, transfection of the expression cassette 1200. Additionally, while expression cassette 1200 is illustrated as a linear construct, several embodiments utilize a circularized construct (e.g., plasmid, fosmid, BAC).
Various embodiments include a cell transfected with an expression cassette, such as expression cassette 1200. In various embodiments, the cell is a plant cell, mammalian cell, fungal cell, bacterial cell, archaeal cell, etc. In some embodiments, a bacterial cell is used to express and secrete the protein of interest. Various embodiments utilize E. coli as the bacteria for expression.
The embodiments of the description will be better understood with experimental data. Attached herein is a description of methods and systems of exemplary modified enzymes and reactants for performing atom transfer radical addition chemistry. Validation results are also provided.
Unless otherwise noted, all chemicals and reagents were obtained from commercial suppliers (Sigma-Aldrich, VWR, Alfa Aesar, Combi-Blocks and Enamine) and used without further purification. Silica gel chromatography was carried out using AMD Silica Gel 60, 230-400 mesh. 1H, and 13C NMR spectra were recorded on a Bruker 400 or 500 MHz instrument in CDCI3 or DMSO-d6 and are referenced to residual protio solvent signals. 19F NMR spectra (where applicable) were recorded on a Bruker 400 MHZ (1H decoupled) and are referenced to CFCl3 as the external standard. Sonication on a small scale was performed using a BioLogics ultrasonic homogenizer (model 150VT) equipped with a stepped microtip. Sonication on a large scale was performed using a Branson Digital Sonifier 450 Ultrasonic Processor. All IR spectra were taken on a Thermo Scientific Nicolet iS5 spectrometer (iD5 ATR, diamond). High-resolution mass spectrometry data were obtained at the University of California Santa Barbara Mass Spectral Facility. High-resolution accurate mass (HRAM) ESI data was acquired using a Waters Micromass LCT Premier time-of-flight (TOF) mass spectrometer. Masses of positively charged ions were calibrated using methanol solutions of polyethylene glycol or polyethylene glycol monomethyl ether as an internal standard. Masses of negatively charged ions were calibrated using aqueous sodium formate or sodium/cesium iodide as an internal standard as appropriate. All samples were dissolved in methanol and were directly infused unless otherwise noted. Synthetic reactions were monitored by thin layer chromatography (TLC, Merck 60 gel plates) using a UV-lamp or an appropriate TLC stain for visualization.
E. coli cells were grown using Luria-Bertani medium (LB) or Hyperbroth (AthenaES) (HB) with 0.1 mg/mL ampicillin (LBamp or HBamp). Primer sequences for site-saturation mutagenesis or site-directed mutagenesis are provided below. T5 exonuclease, Phusion DNA polymerase, and Taq DNA ligase were purchased from New England Biolabs (NEB, Ipswich, MA). M9-N minimal medium (abbreviated as M9-N buffer; pH 7.4) was used as a buffering system for whole cells and lysates, unless otherwise specified. M9-N buffer was used without a nitrogen source; it contains 47.7 mM Na2HPO4, 22.0 mM KH2PO4, 8.6 mM NaCl, 2.0 mM MgSO4, and 0.1 mM CaCl2.
Analytical reversed-phase high-performance liquid chromatography (HPLC) was carried out using an Agilent 1200 series instrument and a Kromasil 100-5-C18 column (4.6×50 mm, 5 μm) with water and acetonitrile as the mobile phase. Analytical chiral HPLC was conducted using a Shimadzu i-series (66 MPa) instrument with hexanes and isopropanol as the mobile phase. Enantiomers were separated using one of the following chiral columns: Chiralpak IA (4.6 mm×25 cm, 5 micron), Chiralpak IB-N (4.6 mm×25 cm, 5 micron), Chiralpak IC (4.6 mm×25 cm, 5 micron), and Chiralpak IG (4.6 mm×25 cm, 5 micron). Gas chromatography (GC) analysis was carried out using a Shimadzu GC-2030 GC system equipped with an FID detector and with a J&W HP-5 ms column (30 m× 0.25 mm, 0.25 μm film). Chiral GC was conducted using a Shimadzu GC-2030 GC system equipped with an FID detector and with an Agilent CycloSil-B column (30 m× 0.32 mm, 0.25 μm film). Gas chromatography-mass spectrometry (GC-MS) analyses were carried out using a Shimadzu GCMS-QP2020NX system with a GC-2030 front end and a J&W HP-5 ms column (30 m×0.25 mm, 0.25 μm film).
pET-22b(+) was used as the cloning and expression vector for metalloenzymes described in this study. Genes of metalloproteins were codon optimized using E. coli as the host organism and purchased as gBlocks from GeneralBio as plasmids in a desired vector. The genes of metalloenzymes used in this study were either cloned into pET-28a (+) between Nde I and Hind III (N-terminal 6× His tag) or cloned into pET-22b(+) between Nde I and Xho I (C-terminal 6× His tag). All the heme proteins described in this study cloned into pET-22b(+) between Nde I and Xho I with a C-terminal 6× His-tag. Site-saturation mutagenesis was performed using the “22c-trick” method. The PCR products were digested with Dpnl, gel purified, and ligated using a Gibson mix prepared from 5× isothermal (ISO) reaction buffer (25% PEG-8000, 500 mM Tris-HCl PH 7.5, 50 mM MgCl2, 50 mM DTT, 1 mM each of the dNTPs, and 5 mM NAD), T5 exonuclease, Phusion DNA polymerase, and Taq DNA ligase. The ligation mixture was used directly to transform electrocompetent E. coli strain E. cloni BL21 (DE3) cells (Lucigen).
Single colonies from LBamp agar plates were picked using sterile toothpicks and cultured in deep-well 96-well plates containing LBamp (400 μL/well) at 37° C., 230 rpm shaking overnight. HBamp (900 μL/well) in a deep-well 96-well plate was then inoculated with an aliquot (100 μL/well) of these overnight cultures and allowed to shake for 2.5 h at 37° C. and 250 rpm. The plates were cooled on ice for 20 min and the cultures were induced with 0.5 mM isopropyl β-D-1-thiogalactopyranoside (IPTG) and 1.0 mM 5-aminolevulinic acid (final concentrations). Expression was conducted at 22° C., 220 rpm for 22 h.
E. coli (E. cloni BL21 (DE3)) cells in deep-well 96-well plates were pelleted (3,000 g, 5 min, 4° C.) using an Eppendorf tabletop centrifuge 5910R and resuspended in M9-N buffer (345 μL/well) by gentle shaking using a Fisher Scientific microplate shaker (800 rpm, 3 min). 40 μL of D-glucose stock solution (500 mM in M9-N buffer) was added to each well using an Eppendorf Xplorer plus, 12-channel, 15-300 μL electronic pipette, and the 96-well plate was then transferred into a Coy anaerobic chamber. In the anaerobic chamber, 15 μL of the substrate stock solution (267 mM in EtOH) was added into each well using an Eppendorf Xplorer 12-channel pipette (15-300 μL). The plate was sealed with an aluminum foil and shaken in a Corning microplate shaker at room temperature and 680 rpm in the anaerobic chamber. After 7-18 h, the 96-well plate was taken out of the anaerobic chamber, the seal was removed and the analytical scale reactions were worked up following the appropriate method below.
Product formation screening and enantioselectivity screening using GC, GC-MS or normal phase HPLC. After 7-18 h, a solution of 1 mM 1,3,5-trimethoxybenzene (internal standard) in a mixed solvent system (hexanes/EtOAc=1:1, 600 μL) was added using an Eppendorf Xplorer plus, 12-channel, 50-1000 μL electronic pipette. The plate was tightly sealed with a reusable silicone mat, shaken vigorously for 30 times, and centrifuged (4500 g, 5 min) to completely separate the organic and the aqueous layers. The organic layers (350 μL/well) were transferred to 500 μL vial inserts using Eppendorf Xplorer, 12-channel, 15-300 μL electronic pipette. The inserts were then placed in 2 mL vials and analyzed by GC, GC-MS or normal phase HPLC. For normal phase HPLC analysis, chiral columns that are compatible with organic solvents, including IA, IB-N, IC and IG, were used.
Product formation screening using HPLC-MS. After 7-18 h, the reaction mixtures were quenched by the addition of EtOH or MeCN (800 μL/well). The plate containing the resulting mixture was tightly sealed with a reusable silicone mat, shaken vigorously and centrifuged (4500 g, 5 min) to pellet the cells. The supernatant (300 μL/well) was filtered through an AcroPrep 96-well filter plate (0.2 μm) into a shallow-well plate and analyzed by reverse-phase LCMS.
E. coli (E. cloni BL21 (DE3)) cells carrying plasmid encoding the appropriate P450 variant were grown overnight in 4 mL LBamp. Preculture (3 mL) was used to inoculate 30 mL of HBamp in a 125 mL Erlenmeyer flask. This culture was incubated at 37° C., 230 rpm for 2.5 h. The culture was then cooled on ice for 20 min and induced with 0.5 mM IPTG and 1.0 mM 5-aminolevulinic acid (final concentrations). Expression was conducted at 22° C., 130 rpm, for 18 h. E. coli cells were then pelleted by centrifugation (3000 g, 5 min, 4° C.) using an Eppendorf 5910R tabletop centrifuge. Supernatant was removed and the resulting cell pellet was resuspended in M9-N buffer to OD600=1-60 (usually 5-30). An aliquot of this cell suspension (2 mL) was taken to determine protein concentration using the hemochrome assay after lysis by sonication. When applicable, the remaining cell suspension was further diluted with M9-N buffer to the OD600 used for the biotransformation and the concentration of P450 protein in the biotransformation was calculated accordingly.
E. coli cells expressing heme protein resuspended in M9-N buffer were lysed by sonication using a BioLogics ultrasonic homogenizer (model 150VT) equipped with a stepped microtip (6 min in total, 1 sec on, 1 sec off, 40% amplitude); samples were submerged in wet ice for this process. The resulting lysed solution was centrifuged (15,000 rpm, 10 min, 4° C.) using an Eppendorf microcentrifuge 5425R to remove the cell debris. The supernatant (clarified lysate) was separated from the pellet and kept on ice until use.
In a conical tube, a solution of 0.2 M NaOH, 40% (v/v) pyridine, 0.5 mM K3Fe(CN)6 was prepared (pyridine-NaOH—K3Fe(CN)6 solution). In another 1.5 mL centrifuge tube, a solution of 0.5 M Na2S2O4 (sodium dithionite) was prepared in 0.1 M NaOH. 500 μL of clarified lysate in M9-N buffer and 500 μL of the pyridine-NaOH—K3Fe(CN)6 solution were transferred to a cuvette and carefully mixed. The UV-Vis spectrum of the oxidized FeIII state was recorded immediately. To the cuvette was then added 10 μL of the sodium dithionite solution. The cuvette was sealed with parafilm and the UV-Vis spectrum of the reduced FeII state was recorded immediately. A cuvette containing 500 μL of M9-N, 100 μL 1 M NaOH, 200 μL pyridine, and 200 μL water (complete mixture without protein and K3Fe(CN)6) was used as a reference for all absorbance measurements. Concentrations of cytochromes P450 were determined using a published extinction coefficient for heme b, Σ556(reduced)-540(oxidized)=23.98 mM−1cm−1.
Analytical Scale Biotransformations Using Whole E. coli Cells
Suspensions of E. coli (E. cloni BL21 (DE3)) cells expressing the appropriate heme protein variant in M9-N buffer (typically OD600=30) were kept on ice. In another conical tube, a solution of D-glucose (500 mM in M9-N) was prepared. To a 2 mL vial were added the suspension of E. coli cells expressing P450 (typically OD600=30, 345 μL) and D-glucose (40 μL of 500 mM stock solution in M9-N buffer). This 2 mL vial was then transferred into an anaerobic chamber, where the organic substrate (15 μL of 270 mM stock solution in EtOH) in succession. Final reaction volume was 400 μL; final concentrations were 10 mM substrate and 50 mM D-glucose. (Note: reaction performed with E. coli cells resuspended to OD600=30 indicates that 345 μL of OD600=30 cells were added, and likewise for other reaction OD600 descriptions.) The vials were sealed and shaken at room temperature and 720 rpm for 7-18 h.
Preparative Scale Biotransformation Using Whole E. coli Cells
HBamp (1 L) in a 4 L flask was inoculated with an overnight culture (10 mL, 1% v/v, LBamp) of E. coli (E. cloni BL21 (DE3)) cells containing a pET-22b(+) plasmid encoding the desired enzyme variant. The cell culture was shaken at 37° C. and 230 rpm for ca. 3.5 h (OD600=2-2.5). The culture was placed on ice for 30 min, and 5-aminolevulinic acid (1.0 mM final concentration) and IPTG (0.5 mM final concentration) were added. The culture was allowed to shake for 20 h at 22° C. and 150 rpm. Cells were pelleted by centrifugation (3,000 g, 5 min, 4° C.) using a Thermo Scientific Sorvall Lynx 6000 superspeed centrifuge, resuspended in M9-N buffer and adjusted to OD600=30-40. An aliquot of cells (2 mL) was taken for the hemochrome assay to determine the concentration of the P450 enzyme. Cell suspensions in M9-N buffer were kept on ice until use. (Note: leaving the cell suspension at room temperature for an extended period of time will lead to significantly reduced enzyme activity.)
To a 1 L Erlenmeyer flask equipped with a screw cap were added a suspension of E. coli cells in M9-N buffer expressing the desired enzyme variant and a D-glucose stock solution. The flask was transferred to an anaerobic chamber, where a stock solution of the organic substrate (430 mM in EtOH) were added. The flask was capped, sealed with parafilm, taken out of the anaerobic chamber and allowed to shake in an Eppendorf Innova 44R shaker at room temperature at 200 rpm for 7-18 h.
The reaction mixture was extracted with EtOAc. The mixture was transferred to a 500 mL centrifugation bucket and centrifuged (15000 g, 30 min) using a Lynx 6000 superspeed centrifuge to separate the organic layer from the aqueous layer. The aqueous layer was extracted with EtOAc for an additional 2 cycles. Combined organic layers were dried over MgSO4, concentrated in vacuo with the aid of a rotary evaporator and purified by column chromatography with the aid of a Biotage Isolera.
Stock solutions of authentic products (100 mM in EtOAc) were prepared (at least 20 mg of sample was weighed to make a 100 mM stock solution in EtOAc). 1 L of extraction solvent with internal standard was freshly prepared (1 mmol (168 mg) of 1,3,5-trimethoxybenzene was added to a 1:1 mixture of hexanes and EtOAc (total volume=1 L)). (Note: Use a freshly prepared extraction mixture containing the internal standard due to the volatility of organic solvents. Use of aged extraction mixture will lead to inaccurate results.) To a microcentrifuge tube were added 400 μL of M9-N buffer, 3, 6, 12, 24, 48 μL product stock solution, respectively, and 600 μL extraction solvent. The mixture was vortexed (20 s for 3 times) and centrifuged (20,000 g, 5 min) to separate the organic and aqueous layers. The organic layer was transferred to a vial for normal phase HPLC and/or GC analysis. The calibration curves detailed below plot product concentration in mM (y-axis) against the ratio of the peak area of product to the peak area of internal standard from HPLC/GC analysis (x-axis). In the development of calibration curves, care was taken such that the calibration curve samples were prepared in a way similar to enzymatic samples. Similar practices are common in literature. Furthermore, assessment of the extraction efficiency of internal standard (1,3,5-trimethoxybenzene) revealed that this internal standard could be extracted into the organic phase almost quantitatively. In one example (2p), overlay of calibration curves prepared using this method and a method without taking distribution efficiency into consideration were given below, showing that the difference is minimal.
E. coli (E. cloni BL21 (DE3)) cells carrying plasmid encoding a P450 variant were grown overnight in 10 mL LBamp (37° C., 250 rpm). HBamp (1 L) media in a 4 L flask was inoculated with 10 mL of the preculture and shaken for 3.5 h at 37° C. and 230 rpm. Cultures were cooled on ice (30 min) and induced with 0.5 mM IPTG and 1.0 mM 5-aminolevulinic acid (final concentrations). Expression was conducted at 22° C., 150 rpm, for 20 h. Cultures were then centrifuged (3,000 g, 10 min, 4° C.) using a Thermo Scientific Lynx 6000 superspeed centrifuge and the cell pellets were flash frozen with liquid nitrogen. For purification, frozen cells were resuspended in HisTrap buffer A (25 mM Tris, 100 mM NaCl, 20 mM imidazole, pH 7.5, 2-3 mL/g of cell wet weight) and lysed by sonication using a Branson Digital Sonifier 450 Ultrasonic Processor. To pellet cell debris, lysates were centrifuged using a Lynx 6000 superspeed centrifuge (15,000 g, 30 min, 4° C.). Alternatively, lysates were transferred to multiple 2 mL microcentrifuge tubes and centrifuged using an Eppendorf microcentrifuge (15,000 rpm, 30 min, 4° C.). The protein containing a C-terminal 6×His-tag was purified with a Ni-NTA column (5 mL HisTrap HP, GE Healthcare, Piscataway, NJ) using an AKTA Start protein purification system (GE healthcare). Proteins were eluted on a linear gradient from Histrap buffer A to His-trap buffer B (25 mM Tris, 500 mM imidazole, 100 mM NaCl, pH 7.5) over 10 column volumes (CV). Fractions containing the desired heme protein were combined, concentrated, and subjected to three exchanges of phosphate buffer (100 mM KPi, pH 8.0) using ultracentrifugal filters (10 kDa molecular weight cut-off, Amicon Ultra, Sigma Millipore) to remove excess salt and imidazole. Protein concentration was measured with Nanodrop and normalized to ca. 50 mg/mL. Typically, 1 L of HBamp expression culture provides 150 mg protein. Concentrated proteins were aliquoted, flash-frozen in liquid nitrogen and stored at −80° C. until further use. The concentration of the protein sample is determined by BCA assay using Thermo Scientific's protein BCA assay kit prior to use.
Purified metalloprotein (ca. 50 mg/mL, 100 μL) was thawed and kept on ice. Stock solutions of the substrate (400 mM in EtOH), reductant (NADPH, Na2S2O4, or ascorbic acid, 10-100 mM in an appropriate buffer) were prepared and transferred into an anaerobic chamber. Experiments with heme proteins were performed using 10 μM purified heme protein, 10 mM substrate, 1-10 mM reductant, in M9-N buffer (pH =7.4) or 0.1 M KPi buffer (pH=7.5) at room temperature under anaerobic conditions for 24 h. The total volume of analytical scale reactions using purified metalloenzymes is total volume=400 μL. The reaction mixture was then analyzed by chiral HPLC analysis for the determination of yield, TTN, and enantiomeric ratio (e.r.).
At the outset of this study, a diverse collection of metalloenzymes were evaluated for this stereocontrolled atom transfer radical cyclization. While several enzymes displayed measurable initial activities, only two CYP102A1 variants in the metalloenzyme collection furnished measurable enantioselectivities. A subset of our heme protein screening results is summarized below.
Bacillus
megaterium
Sulfolobus
acidocaldarius
Pseudomonas
putida
Physeter
catodon
Rhodothermus
marinus
Bacillus
megaterium
Bacillus
megaterium
Pseudomonas
savastanoi
Mycobacterium
thermoresistibile
Bacillus subtilis
aActivity screening was performed using whole-cell biocatalysts in a 96-well plate in a Coy anaerobic chamber. Heme proteins were induced with IPTG and ALA. Other metalloproteins were induced with IPTG. When induced with IPTG and ALA, control experiment using E. coli cells carrying an empty pET-22b(+) vector provided 3-4% yield. When induced with IPTG, control experiment using E. coli cells carrying an empty pET-22b(+) vector provided <1% yield.
b Activity screening was performed using purified protein catalysts (ca. 0.1 mol % catalyst loading) in a sealed 2.0 mL vial in a Coy anaerobic chamber. Control experiment under otherwise identical conditions in the absence of the metalloprotein catalyst provided no product.
aAll the whole-cell reactions were run in triplicates and averaged TTNs were described here.
aAll the in vitro reactions using purified P450 variants were run in duplicates and averaged yields and TTNs were described here.
As can be seen from Tables 3 and 4, for all the enzyme variants in the evolutionary lineage, the enantiomeric ratio (e.r.) for whole-cell reactions and purified enzyme reactions were identical, indicating minimal background reaction for E. coli cells overexpressing engineered P450ATRAse. Additionally, the yields of purified enzyme reactions were found to be lower than whole-cell biotransformations, and the total turnover numbers (TTN) of purified enzyme reactions were an order of magnitude lower than those of whole-cell biotransformations. NADPH, Na2S2O4 and ascorbic acid served as effective reducing reagent. In vitro reactions in the presence of NADPH provided slightly higher yields compared to those with Na2S2O4 and ascorbic acid.
aHemin + bovine serum albumin (BSA).
Reduced hemin can catalyze this ATRC reaction with low levels of activity in a racemic fashion. Evolved enzymes P450ATRAse displayed a substantially higher activity under the same conditions. The use of hemin also led to the formation of a higher yield of undesired reduction product.
aAll the whole-cell reactions were run in triplicates and averaged TTNs were described here.
aAll the whole-cell reactions were run in triplicates and averaged TTNs were described here.
aAll the whole-cell reactions were run in triplicates and averaged TTNs were described here.
aOD600 here refers to the OD600 of E. coli cell suspension used for catalysis and not the final OD600 of whole-cell biotransformation. See General Procedure above for details.
Importantly prior work showed that bipy, TMEDA, PMDETA, TPMA, Mestren and other polydentate nitrogen ligand-based copper catalysts could affect ATRC reactions, although in a non-stereoselective fashion.
a1H NMR analysis of the crude reaction mixture.
b HPLC analysis of the crude reaction mixture. As can be seen from these data, d.r. measured by HPLC is consistent with that by 1H NMR analysis.
Synthetic route to radical clock substrate 1p is given below.
Result on hemin-catalyzed ring opening of radical clock substrate 1p is given
The conversion of radical clock substrate 1p to the ring opening product 2p using hemin as the catalyst indicated that the reaction proceeds through a radical-medicated mechanism. Use of protein catalysts provided ca. 2-4% yield of this product.
Subjecting the ATRC bromide product to whole E. coli cells for 3 h resulted in the formation of 43% dehydrohalogenation product, suggesting that such elimination reactions occurs in the presence of E. coli cells in water. For comparison, subjecting this ATRC product to M9-N buffer (pH=7.4) for 3 h led to 33% elimination product, indicating that the presence of E. coli cells may accelerate this elimination reaction. In addition, a direct radical-polar crossover mechanism followed by deprotonation cannot be ruled out to account for the formation of this alkene product.
The colony forming units (cfu) of whole-cell reactions (+ substrate 1a) and controls without the substrate (− substrate 1a) were determined with biological replicates of triplicates using the procedure described below. Six 2 ml screw cap vials containing 380 μl suspension of E. coli cells harboring P450ATRAse1 (OD600=30) were transferred to an anaerobic chamber. To three of these vials were added substrate 1a (10 μL, 400 mM in EtOH). These vials were capped and shaken at 500 rpm in the anaerobic chamber (+ substrate 1a). The remaining three vials were capped and shaken in the absence of substrate (− substrate 1a). After 2.5 h, all six vials were removed from the anaerobic chamber. Aliquots of cell suspension were removed from the vials and subjected to serial dilution to obtain stock solutions of 106-fold dilution. 50 μL of each stock solution was plated on LB (amp) agar plates and incubated at 37°° C. overnight. The cfu of the cell suspensions were calculated based on the colony counts of the 106dilution plate (
General procedure for the synthesis of substrates:
At room temperature, the aldehyde (10 mmol, 1 equiv) and the amine (11 mmol, 1.1 equiv) were added to 40 mL MeOH (0.25 M). This mixture was stirred at room temperature for 3-12 h until GC analysis indicated full conversion of the aldehyde substrate. NaBH4 (16 mmol, 1.6 equiv) was slowly added, and the reaction mixture was allowed to stir at room temperature for 10 min. MeOH was removed in vacuo with the aid of a rotary evaporator. The solid residue was diluted with EtOAc and quenched with 1 M NaOH (aq.). The organic and aqueous layers were separated, and the aqueous layer was extracted with EtOAc for three times. Combined organic layers were washed with brine and dried over MgSO4. Solvent was removed in vacuo with the aid of a rotary evaporator. The crude amine product was directly used for the next step without further purification.
A round bottom flask containing the crude amine was evacuated and backfilled with N2 and this process is repeated for three times. The amine was then dissolved in CH2Cl2 (0.50 M). Et3N (2.0 equiv) was added. At 0° C., α-bromoisobutyryl bromide was slowly added. The reaction mixture was allowed to warm to room temperature, stirred for an additional 1 h, and then quenched by the addition of water. CH2Cl2 was added to dilute the reaction mixture. The organic layer was separated, washed with water and 1 M HCl (aq.), and dried over MgSO4. Solvent was removed in vacuo with the aid of a rotary evaporator. The crude amide product was purified by flash column chromatography with the aid of a Biotage Isolera.
At room temperature, Ethyl bromodifluoroacetate (6 mmol, 1.2 equiv) was added to the amine (5 mmol, 1 equiv) at 0° C. The mixture was allowed to warm to room temperature and stirred for 12 h. The reaction mixture was diluted with 1 M HCl and extracted with EtOAc (3×). The organic layer was separated, combined, and dried over MgSO4. Solvent was removed in vacuo with the aid of a rotary evaporator. The crude amide product was purified by flash column chromatography with the aid of a Biotage Isolera.
(E)-1o was synthesized from commercial crotonaldehyde (predominantly trans-, 1H NMR analysis indicated E/Z=33:1) using General Procedure A described above. The E/Z ratio of the reductive amination product was determined to be 12:1 by 1H NMR analysis.
(Z)-1o is synthesized as follows. At room temperature, propargyl bromide (1.0 equiv.) was added slowly to benzylamine, and the reaction mixture was allowed to stir at room temperature for 20 h. The crude reaction mixture was then purified by flash column chromatography with the aid of a Biotage Isolera (1:1 EtOAc/hexanes, with 1% Et3N) to afford N-benzylbut-2-yn-1-amine as a colorless oil.
To a round bottom flask were added nickel (II) acetate tetrahydrate (2.9 mmol, 0.25 equiv) and MeOH (4 mL). The round bottom was evacuated and backfilled with H2 and this sequence was repeated for a total of three times. At 0° C., a MeOH solution of sodium borohydride (2.9 mmol, 0.25 equiv, 4 mL) was slowly added, and the suspension was stirred at room temperature for 15 min. N-Benzylbut-2-yn-1-amine (11.6 mmol, 1 equiv) and ethylene diamine (5.8 mmol, 0.5 equiv) in MeOH (12 mL) were added and the mixture was stirred under H2 at room temperature. The reaction was allowed to proceed at room temperature for ca. 1 h and monitored by TLC analysis every 10 min after the first hour. (Note: Overreduction of the product will result in the formation of N-butylbenzylamine. Thus, carefully monitoring the progress of the semihydrogenation reaction by TLC analysis is the key to avoid overreduction.) Once the starting material is fully consumed, H2 balloon was immediately removed. Solvent was removed in vacuo with the aid of a rotary evaporator and the solid residue was taken up with EtOAc and washed with water. The aqueous layer was separated and extracted with EtOAc for an additional three times. The organic layers were combined, dried over MgSO4 and concentrated in vacuo with the aid of a rotary evaporator. The crude product was used directly in the next step without purification. The E/Z ratio of the crude amine product was determined to be 1:67. The final acylation step is performed by following General Procedure A.
Racemic products were synthesized on a 0.2-5.0 mmol scale using 30 mol % CuBr and 30 mol % L in 70-80% yield. Cu catalysts composed of L1-L3 provided the desired racemic compound with varying yields.
General Procedure: CuBr (30 mol %), L (30 mol %), and the substrate were charged to a reaction tube. Toluene (0.1 M) was added, and the resulting mixture was subjected to three freeze-pump-thaw cycles. The reaction tube was then placed in an oil bath at 110° C. for 12-24 h. After the reaction went to completion, the crude reaction mixture was directly subjected to column chromatography by elution with hexane to hexane/ethyl acetate (2:1) to give the corresponding product.
Enantioenriched enzymatic products were synthesized by preparative scale enzymatic reaction using the general procedure described above.
A screw-cap vial was charged with (R)-1-benzyl-4-(bromomethyl)-3,3-dimethylpyrrolidin-2-one (59 mg, 1.0 equiv, 0.2 mmol), NaN3 (65 mg, 5.0 equiv. 1.0 mmol), and NaI (150 mg, 5.0 equiv, 1.0 mmol). DMF (1 mL) and H2O (200 μL) were then added to the vial. The reaction mixture was stirred for 16 h at 60° C. The reaction mixture was diluted with 2.5 mL of 1.0 M NaOH (aq.), extracted with EtOAc and dried over MgSO4. Solvent was removed in vacuo and the crude product was purified by Biotage (10 g SNAP cartridge, 0-33% EtOAc/hexanes for 8 CV, then 33% EtOAc/hexanes for 5 CV) to afford the product as a colorless oil (0.2 mmol scale, 42 mg, 71% yield).
A screw-cap vial was charged with (R)-1-benzyl-4-(bromomethyl)-3,3-dimethylpyrrolidin-2-one (59 mg, 1.0 equiv, 0.2 mmol), NaCN (49 mg, 5.0 equiv, 1.0 mmol), and NaI (150 mg, 5.0 equiv, 1.0 mmol). DMF (1 mL) and H2O (200 μL) were then added to the vial. The reaction mixture was stirred for 16 h at 60° C. The reaction mixture was diluted with 2.5 mL of 1.0 M NaOH (aq.), extracted with EtOAc and dried over MgSO4. Solvent was removed in vacuo and the crude product was purified by Biotage (10 g SNAP cartridge, 0-50% EtOAc/hexanes for 8 CV, then 50% EtOAc/hexanes for 5 CV) to afford the product as a white solid (0.2 mmol scale, 47 mg, 79% yield).
A screw-cap vial was charged with (R)-1-benzyl-4-(bromomethyl)-3,3-dimethylpyrrolidin-2-one (30 mg, 1.0 equiv, 0.1 mmol) and potassium ethyl xanthate (1.5 equiv). Acetone (0.5 mL) was then added. The mixture was at room temperature stirred overnight and then concentrated in vacuo. EtOAc was added, and the organic layer was washed with water. The aqueous layer was then extracted with ethyl acetate (3×) and dried with MgSO4. Solvent was removed in vacuo and the crude product was purified by Biotage (10 g SNAP cartridge, 0-50% EtOAc/hexanes for 8 CV, then 50% EtOAc/hexanes for 5 CV) to afford the product as a colorless oil (0.1 mmol scale, 21 mg, 63% yield).
All density functional theory (DFT) calculations were carried out using the Gaussian 16 program (59). Geometries of intermediates and transition states were optimized using the dispersion-corrected (U)B3LYP-D3 functional (60-62) with a mixed basis set of LANL2DZ for Fe and 6-31G(d) for other atoms in the gas phase. Vibrational frequency calculations were performed for all stationary points to confirm if each optimized structure is a local minimum or a transition state structure. All optimized transition state structures have only one imaginary (negative) frequency, and all minima (reactants, products, and intermediates) have no imaginary frequencies. The (U)B3LYP-D3 functional with a mixed basis set of LANL2TZ(f) (63) for Fe and 6-311+G(d,p) for other atoms was used for single-point energy calculations in solution. Solvation energy corrections were calculated in chlorobenzene (ε=5.6968) solvent with the SMD continuum solvation model (64) based on the gas-phase optimized geometries. The solvent environment was chosen based on recommendations from previous computational studies of enzymatic reactions to mimic the relative permittivity in enzyme active sites (65, 66). The C—Br and Fe—Br bond dissociation enthalpies (BDEs) at 298 K were calculated at the same level of theory [(U)B3LYP-D3/LANL2TZ(f)-6-311+G(d,p)/SMD//(U)B3LYP-D3/LANL2DZ/6-31G(d)]. Gibbs free energies were calculated at the standard conditions (298 K, 1 M solution). Truhlar's quasi-harmonic corrections (67) were applied for vibrational entropy calculations using 100 cm−1 as the frequency cutoff using GoodVibes (68). It should be noted that entropic corrections were calculated using gas-phase formulas. Although this is a standard practice in computing organic reaction energy profiles, the entropic components in the bimolecular reactions are expected to be overestimated, because the translational and rotational entropies of substrates in the enzyme active site are much less than those in the gas phase. Accurate calculations of entropies in enzyme catalysis require extensive molecular dynamics simulations at different temperatures (69), which is beyond the scope of the present study. It should be noted that the key objectives of the present computational study, including the spin states of the Fe catalyst and intermediates, geometries of the halogen atom abstraction transition states, thermodynamics of the halogen atom abstraction, and the endo/exo selectivity, are not expected to be affected by entropic effects. 3D images of optimized structures were prepared using CYLView and POV-Ray.
The explicitly correlated local coupled cluster method, PNO-LCCSD(T)-F12, was used as references to benchmark DFT methods. The PNO-LCCSD(T)-F12 calculations were performed using Molpro 2020.2 (70, 71). In the PNO-LCCSD(T)-F12 calculations, a mixed basis set of aug-cc-pwCVTZ for Fe, ECP10MDF effective core potential along with aug-cc-pwCVTZ-PP for Br, and VDZ-F12 for other atoms. Def2-TZVPP/JKFIT (Fe), aug-cc-pVTZ/JKFIT (Br), and AVTZ/JKFIT (other atoms) were used as the fitting auxiliary basis to compute the Fock matrix and as the RI basis set. For the density fitting for other calculations of all other 2-electron integrals, the aug-cc-pwCVTZ/MP2FIT (Fe), aug-cc-pVTZ/MP2FIT (Br), and AVDZ/MP2FIT (other atoms) basis sets were used. These basis sets are similar to those used by Werner et al. in recent computational studies (72, 73). In the PNO-LCCSD(T)-F12 calculations of the Fe complex, four outer core orbitals (Fe, 3s3p) were correlated (73). T1/D1 diagnostic values were calculated in all PNO-LCCSD(T)-F12 calculations in the benchmark study to evaluate the multireference character of these complexes. According to the criteria recommended by Wilson et al. for 3d transition metals, T1 values smaller than 0.05 and D1 values smaller than 0.15 would suggest that the single reference PNO-LCCSD(T)-F12 method is adequate to describe the molecular ground state (74). The computed T1/D1 diagnostic values of most complexes in the present study are smaller than the recommended thresholds (see later for details), indicating that the multireference character of these complexes is not important, and thus the coupled cluster method should predict reliable energetics.
In the DFT and PNO-LCCSD(T)-F12 calculations, porphine was used as a model for the porphyrin ligand and methoxy (OMe) was used as the model for the axial serine ligand. Similar model systems have been used in previous computational studies of heme-containing enzymes (75-77). For each Fe complex and transition state, three different spin states were calculated (high-spin, quintet or sextet; intermediate-spin, triplet or quartet; and low-spin, doublet or singlet). Wavefunction stability tests were carried out with the “stable=opt” keyword to ensure all the open-shell calculations adopted stable wavefunctions. The stability test suggested that the closed-shell singlet wavefunctions of Fe complexes are unstable. All singlet Fe complexes in this study are open-shell singlet without α/β symmetries.
The performance of DFT methods for spin-state energies of iron complexes, including model iron-heme complexes, has been extensively studied (78, 79). Here, we performed benchmark calculations of the Fe(II) complex 4 in singlet (14), triplet (34), and quintet (54) spin states and the Fe(III)□Br complex 6 in doublet (26), quartet (46), and sextet (66) spin states to identify suitable DFT methods for energetics of the iron-porphyrin complexes in the present system (Table S12). To obtain accurate single-point energies as references, we performed the explicitly correlated local coupled-cluster calculations (PNO-LCCSD(T)-F12) using the B3LYP-D3-optimized geometries. All benchmark calculations were performed in the gas phase.
PNO-LCCSD(T)-F12 calculations predicted that the high-spin state (quintet) of 4 is highly favorable. The quintet Fe(II) catalyst 54 is 22.1 kcal/mol more stable than the triplet 34 and 23.5 kcal/mol more stable than its singlet 14. The Fe(III)□Br complex prefers the low-spin doublet state in the gas phase, although the sextet (66) is only 3.0 kcal/mol higher in energy than the doublet (26). The quartet (46) is 12.3 kcal/mol less stable than the sextet 66. All DFT methods tested correctly predicted the high-spin quintet state of 4 is the most stable. However, only (U)B3LYP-D3, (U)OLYP (80, 81), and (U)MN15 (82) correctly predicted the lowest-energy spin state for 6. (U)OPBE (83, 84) and (U)ωB97X-D (85) predicted the high-spin sextet state is slightly more stable than the doublet. (U)M06 (86) significantly overestimated the stability of the high-spin sextet state of 6. We chose (U)B3LYP-D3 in the following DFT calculations because it predicted the correct spin states for both 4 and 6 and provided the best agreement with the PNO-LCCSD(T)-F12 spin-state energies (i.e. with the smallest mean absolute error, MAE).
aAll energies are in kcal/mol. For 4, ΔELH = Equintet − Esinglet, ΔEIH = Equintet − Etriplet. For 6, ΔELH = Esextet − Edoublet, ΔEIH = Esextet − Equartet, ΔΔE = ΔEDFT − ΔEPNO-CCSD(T)-F12. Geometries were optimized at the (U)B3LYP/6-31G(d)-LANL2DZ level of theory in gas phase at the corresponding spin state.
The T1/D1 diagnostic values of all Fe complexes are below the threshold values (T1<0.05, D1<0.15), except for 26 where the D1 value (0.183) is slightly higher than the threshold (Table S2). The small T1/D1 diagnostic values of these iron complexes indicate their multireference character is not important, and thus the PNO-LCCSD(T)-F12 results are expected to be reliable as the energy references to benchmark DFT methods.
14
34
54
26
46
66
In addition to the inner-sphere electron transfer (i.e. bromine atom abstraction) pathway, transition metal-mediated carbon-halogen bond activation of alkyl halides may also occur via three-membered oxidative addition, SN2-type of oxidative addition, or outer-sphere single electron transfer. Because the Fe(II)-porphine complex has only one available coordination site to interact with the sterically crowded tertiary alkyl bromide substrate, the concerted three-membered oxidative addition and the SN2-type of oxidative addition are unlikely to take place. The modified Marcus theory was used to estimate the barriers for the outer-sphere single electron transfer pathway. Geometry optimization of the radical anion of the alkyl bromide substrate 1a leads to direct dissociation of the C—Br bond to form an alkyl radical and a bromide anion, indicating that the radical anion of 1a is unstable and the outer-sphere electron transfer would lead to separated alkyl radical and bromide anion in a concerted process, known as dissociative electron transfer (DET) or concerted outer-sphere electron transfer (OSET-C). Therefore, Saveant's modified Marcus theory was used to calculate the activation free energy of this dissociative electron transfer pathway.
In the DET pathway (
where ΔG0‡ is the the intrinsic barrier; Δ=−4.2 kcal/mol is the DFT-calculated reaction energy of the DET process; Dp is the interaction energy between 7 and Br− in the solvent cage, because there is no available experimental data of Dp (usually a small value close to 0), Dp=0 kcal/mol is used in our calculation (91); BDE1a=56.4 kcal/mol is the bond dissociation enthalpy of the C(sp3)-Br bond of 1a; l0 is the solvent reorganization energy (92, 93) that can be calculated as follows:
where nD=1.52 and εs=5.6968 are refractive index and dielectric constants of chlorobenzene solvent, respectively. ε0 is the vacuum permittivity. rD and rA are the hard-sphere radii of the electron donor (54, 5.7 Å) and acceptor (1a, 4.2 Å), respectively, calculated using the “volume” keyword in Gaussian 16.
Thus,
It should be noted that because the “solvent reorganization energy” λ0 was calculated using the model Fe-porphyrin catalyst in a non-polar solvent (εs=5.6968), this value may not represent the exact reorganization energy in an enzymatic environment. Calculating reorganization energies in enzymatic reactions requires the calculations of free energy curve on the diabatic product energy surface, which is often calculated using QM/MM. Although these calculations are beyond the scope of the present study, it should be noted that reorganization energies in engineered enzymes are often higher than those in wild-type enzymes and thus may lead to higher barriers to the electron transfer.
These results suggest that the DET pathway requires 14.4 kcal/mol activation free energy with respect to 54 and 1a. Previous computational studies indicated that the DET pathway in the reaction of Cul/TPMA catalyst and α-bromoisobutyrate requires a much higher barrier (>23 kcal/mol) and the ISET (inner-sphere single electron transfer, a.k.a. halogen atom abstraction) pathway with the Cu catalyst is comparable to the model Fe catalyst in the present study. Therefore, these results indicate that DET may be another viable pathway for the Fe catalyst, while with the Cul/TPMA catalyst, the reaction can only proceed via the ISET pathway due to the high DET barrier. Although it is challenging to directly compare the kinetics of the DET and ISET pathways in the present study because the barriers were calculated using different theories (Marcus theory vs transition state theory), the computational results so far suggest that the C—Br bond cleavage step is best described as an ISET process due to the following reasons. First, the computed barrier for ISET (5TS1, ΔG‡=17.7 kcal/mol) was based on a model Fe catalyst in the absence of the enzyme environment (75). This barrier includes the 4.1 kcal/mol free energy required to form the substrate complex (5). In the enzyme-catalyzed reactions, the substrate binding to form reactant complex is expected to be more favorable, despite the low concentration of the enzyme catalyst. The ISET TS (5TS1) is only 13.6 kcal/mol higher than the substrate complex 5, indicating a faster reaction rate under saturation kinetics when 5 is the resting state prior to the C—Br bond cleavage. Second, the DET barrier calculated in a non-polar solvent environment may be lower than the barrier in an enzymatic environment because the reorganization energies are often relatively high in engineered enzymes (see above). Lastly, the ISET and DET pathways actually share many features—they both involve concerted electron-transfer and C—Br bond cleavage and they form the same products (6 and 7). The only difference is whether there is an inner-sphere interaction between the Br atom and the Fe center during the C—Br bond cleavage. Because the Fe catalyst is relatively rigid and has an empty coordination site, the Br—Fe interaction is expected to be thermodynamically favorable. The binding of the alkyl bromide substrate 1a to the Fe(II) catalyst (4) to form 5 is exothermic (ΔH=−7.3 kcal/mol). Distortion/interaction model analysis revealed that the interaction energies between 1a and 4 are −9.8 and −14.5 kcal/mol in the reactant complex 5 and the ISET transition state (TS1), respectively. On the other hand, the distortion energy of the Fe-porphine complex is small (ΔEdis=0.2 and 6.3 kcal/mol in 5 and TS1, respectively). These results suggest that the inner-sphere Br—Fe coordination is energetically favorable due to favorable interaction energies and relatively small distortion of the Fe-porphine complex. It should be noted that one scenario that could promote the DET pathway is the stabilization of the bromide anion via electrostatic or hydrogen bonding interactions with active site residues. This possibility is being investigated in our laboratories using molecular dynamics simulations. Taken together, the computational studies in the present study demonstrated that the Fe-porphine complex is a competent catalyst to promote C—Br bond cleavage to generate alkyl radical intermediate. This process may occur via either an inner-sphere single electron transfer (ISET) or an outer-sphere dissociative electron transfer (DET) mechanism, depending on whether the Fe center or the active site residues are more effective in stabilizing the bromide anion. The calculations indicate that the interaction between Br and the Fe center is energetically favorable and requires small distortion of the Fe catalyst, supporting an ISET mechanism. However, the DET pathway cannot be completely ruled out and warrants further studies using molecular dynamics simulations. Because the ISET and DET pathways share many mechanistic features and lead to the same products after the C—Br bond cleavage, the uncertainty of the C—Br bond cleavage mechanism will not affect the key conclusions discussed in the present study, including the spin-states of the Fe(II) catalyst and the Fe(III) intermediate, the thermodynamics of the C—Br bond cleavage/formation steps, factors promoting the radical initiation, and the exolendo selectivity in the cyclization step.
Both the Z and E rotamers of the amide substrate 1a and the alkyl radical intermediate 7 were calculated (
The conformational space for transition states and intermediates were sampled using the conformer-rotamer ensemble sampling tool (CREST) program that uses the semiempirical tight-binding based quantum chemistry methods GFN2-xTB to perform metadynamic sampling (MTD) of conformers. iMTD-GC (iterative metadynamic sampling with genetic crossing approach) was used where an iterative root mean square deviation (RMSD) based metadynamic sampling is performed with an extra genetic z matrix crossing (GC) step at the end. The conformational sampling of stationary points was performed in the corresponding spin states. For ground state conformational sampling, no constrains were applied. For transition states sampling, the corresponding forming/breaking bonds were constrained at the transition state geometry obtained from DFT-optimized geometries with a force constant of 0.5 Hartree/Bohr2. The structural similarity was then assessed by the RMSD threshold (1 Å) and energy threshold (10 kcal/mol). The structures obtained from conformational sampling were further optimized at the DFT level.
The CREST/xTB conformational sampling revealed a large conformational space for the transition states and intermediates. The overlays of resulting low-energy conformers (within 3.5 kcal/mol relative to the lowest-energy conformer at the DFT level) of the bromine atom abstraction transition state and the Br rebound transition state are shown in
Following the same protocol, other spin states of transition states TS1 and TS3 were investigated (
To understand the origin of the effectiveness of the Fe-porphyrin catalyst in promoting the ATRC reaction, its electronic properties were directly compared with those of the Cu(TPMA)+ catalyst, which is highly active in the alkyl halide activation in ATRP (kact=4500 with ethyl 2-bromoisobutyrate in acetonitrile). The HOMO energies (EHOMO) of both catalysts and the M(metal)-Br bond dissociation enthalpies [BDE(M-Br)] of the metal bromide intermediates at the (U)B3LYP-D3/6-311+G(d,p)-LANL2TZ (f)/SMD(chlorobenzene)//(U)B3LYP-D3/6-31G(d)-LANL2DZ level of theory were calculated. Those electronic parameters were reported to be the most important descriptors for the prediction of kact in Cu-ATRP. Although the bromine atom abstraction with Fe-porphine is thermodynamically less favorable by 4.8 kcal/mol due to the weaker Fe(III)-Br bond than the Cu(II)-Br bond, the higher HOMO energy kinetically promotes the ISET with the Fe(II) porphyrin complex. Overall, these electronic parameters are consistent with the high reactivities of both the Fe-porphine and Cu(TPMA)+ catalysts in promoting bromine atom abstraction.
The Gibbs free energy of reaction was calculated for the DET pathway (ΔGDET) with the Fe-porphine and Cu(TPMA)+ catalysts, respectively. The DET pathway is highly endergonic (ΔGDET=44.7 kcal/mol) with the Cu(TPMA)+ catalyst in the non-polar solvent. The Cu(TPMA)+ catalyst is known to have a low reactivity in promoting DET in polar solvents. These results indicate Cu(TPMA)+ is even less reactive in DET in a less polar environment. On the other hand, the DET process with Fe-porphine is exergonic (ΔGDET=−4.2 kcal/mol), suggesting a thermodynamically more viable DET pathway in the reaction with the Fe-porphine catalysts.
54
This application claims priority to U.S. Provisional Application Ser. No. 63/228,562, entitled “Compositions, Systems and Methods for Atom Transfer Radical Addition Reaction,” filed Aug. 2, 2021, which is herein incorporated by reference in its entirety.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2022/074439 | 8/2/2022 | WO |
Number | Date | Country | |
---|---|---|---|
63228562 | Aug 2021 | US |