This application contains a Sequence Listing which has been submitted electronically and is hereby incorporated by reference in its entirety. The XML document as filed herewith was originally created on 18 Sep. 2023. The XML document as filed herewith is named NREL 21-63.xml, is 86 kilobytes in size and is submitted with the instant application.
Deciphering the mechanisms of bacterial fatty acid biosynthesis is crucial for both the engineering of bacterial hosts to produce fatty acid-derived molecules and the development of new antibiotics. However, gaps in our understanding of the initiation of fatty acid biosynthesis remain. Here, we demonstrate that the industrially relevant microbe Pseudomonas putida KT2440 contains three distinct pathways to initiate fatty acid biosynthesis. The first two routes employ conventional β-ketoacyl-ACP synthase III enzymes, FabH1 and FabH2, that accept short- and medium-chain-length acyl-CoAs, respectively. The third route utilizes a malonyl-ACP decarboxylase enzyme, MadB. A combination of exhaustive in vivo alanine-scanning mutagenesis, in vitro biochemical characterization, X-ray crystallography, and computational modelling elucidate the presumptive mechanism of malonyl-ACP decarboxylation via MadB. Given that functional homologs of MadB are widespread throughout domain Bacteria, this ubiquitous alternative fatty acid initiation pathway provides new opportunities to target a range of biotechnology and biomedical applications.
Polyhydroxyalkanoates (PHAs) are chiral biopolymers that are naturally accumulated in some bacterial organisms as a mechanism of carbon storage. Due to their natural biodegradability and structural properties, significant investments have been made to develop commercial materials based on these biopolymers. PHAs have been successfully commercialized to replace some high-density polyethylene and other hard plastics. However, despite significant research, a PHA replacement for low density polyethylene (LDPE), the most highly consumed plastic, has been unsuccessful due to the inability to mimic LDPE physical properties. Branched-chain PHAs have the promise to more precisely mimic the properties of LDPE, but their production has never been demonstrated from glucose.
In an aspect, disclosed herein are non-naturally occurring P. putida sp. comprising a non-naturally occurring gene encoding for malonyl-ACP decarboxylase wherein the gene has greater than 70% sequence identity to the nucleotide sequence of PP_0262 from P. putida KT 2440.
In an aspect, disclosed herein are genetically engineered Pseudomonas useful for the degradation of polyhydroxyalkanoates.
In an aspect, disclosed herein is a non-naturally occurring Pseudomonas sp. comprising a non-naturally occurring gene encoding for malonyl-ACP decarboxylase (MadB) wherein the gene (madB) has greater than 70% sequence identity to the nucleotide sequence of PP_0262 from P. putida KT 2440. In an embodiment, the non-naturally occurring Pseudomonas sp. is Pseudomonas sp. is P. putida KT 2440.
In an aspect, disclosed herein is a method for initiating fatty acid biosynthesis in P. putida sp. comprising overexpressing a non-naturally occurring gene encoding for malonyl-ACP decarboxylase (MadB) wherein the gene (madB) has greater than 70% sequence identity to the nucleotide sequence of PP_0262 from P. putida KT 2440.
In an aspect, disclosed herein is a non-naturally occurring Pseudomonas sp. useful for the production of branched-chain polyhydroxyalkanoates, branched-chain 3-hydroxyacids (BCHA) (PHA monomers), and branched-chain fatty acids (BCFA) comprising an overexpressed gene selected from the group consisting of sfabH2, BKD (lpdV-bkdAA-bkd-AB-bkdB), lplA, fadR, phaC, phaG, alkK, acc, alsS-ilvCD, and leuAmodBCD. In an embodiment, the non-naturally occurring Pseudomonas sp. is P. putida KT 2440. In an embodiment, the gene is sfabH2. In an embodiment, the gene is BKD (lpdV-bkdAA-bkd-AB-bkdB). In an embodiment, the gene is lplA. In an embodiment, the gene is fadR. In an embodiment, the gene is phaC. In an embodiment, the gene is phaG. In an embodiment, the gene is alkK. In an embodiment, the gene is acc. In an embodiment, the gene is alsS-ilvCD. In an embodiment, the gene is leuAmodBCD.
Other objects, advantages, and novel features of the present invention will become apparent from the following detailed description of the invention when considered in conjunction with the accompanying drawings.
Pseudomonas putida KT2440 contains three distinct pathways to initiate fatty acid biosynthesis. The first two routes employ conventional β-ketoacyl-ACP synthase III enzymes, FabH1 and FabH2, that accept short- and medium-chain-length acyl-CoAs, respectively. The third route utilizes a malonyl-ACP decarboxylase enzyme, MadB. A combination of exhaustive in vivo alanine-scanning mutagenesis, and in vitro biochemical characterization, elucidate the presumptive mechanism of malonyl-ACP decarboxylation via MadB. Given that functional homologs of MadB are widespread throughout domain Bacteria, this ubiquitous alternative fatty acid initiation pathway provides new opportunities to target a range of biotechnology and biomedical applications.
Polyhydroxyalkanoates (PHAs) are biodegradable polymers many bacteria produce and accumulate as intracellular granules for the purpose of carbon storage. The composition of the PHA synthesized by a given bacterium depends on the monomers it produces and the specificity of its PHA synthase, the enzyme that polymerizes these monomers, both of which can be manipulated using synthetic biology. PHAs with different compositions have different physical properties that make them suitable for different applications. Owing to this and their inherent biodegradability, PHAs have been of commercial interest for decades.
Prior PHA properties have not met the criteria to replace low density polyethylene. By incorporating a branched-chain PHA, we may improve the physical properties of PHAs to serve as direct, biodegradable replacements for LDPE.
The ability to use BCPHA as a LDPE replacement was determined based on the prior characterization of BCPHA by others. However, others first synthesized the branched-chain fatty acid, then fed this fatty acid to the cell, at which point, the cell polymerized the supplemented BCFA into BCPHA. BCFA are incredibly expensive to purchase and difficult to chemically synthesize. Disclosed herein are at least one efficient biological route that can convert a biomass-derived feedstock into branched-chain PHAs, branched-chain 3-hydroxyacids, and BCFA. Prior to methods and compositions disclosed herein, the production of branched-chain 3-hydroxyacids and BCPHA has never been demonstrated from a sugar carbon source.
Pipelines to enable the production of branched-chain polyhydroxyalkanoates, branched-chain 3-hydroxyacids (BCHA) (PHA monomers), and branched-chain fatty acids (BCFA) in Pseudomonas putida KT2440 have been developed. All of the branched-chain molecules enabled by this platform enable novel chemistries that are not accessible via the existing paradigm which is limited to straight-chain molecules. These pathways may also be applied to other production hosts, as the pathways are modular. Pseudomonas putida KT2440 has been extensively validated as a host capable of producing high dry cell weight percentages of polyhydroxyalkanoates, with a variety of functional groups, including branched-chains. Disclosed herein is an organism engineered to accumulate branched-chain 3-hydroxy-acyl-CoAs, which can be incorporated into polymerized BCPHAs by the native PHA synthase PhaC. The 3-hydroxyacid monomers can also be produced as monomers without the presence of PhaC.
Fatty acid biosynthesis in P. putida KT2440 can be used for the production of BCPHA by incorporating a branched-chain production module. Branched-chain-acyl-ACPs undergoing elongation in the Fatty Acid Synthase II complex can be diverted to PHA biosynthesis by expression of the native PhaG, which transacylates the branched 3-hydroxyacyl-ACP to a branched-chain 3-hydroxyacl-CoA which is then the substrate for the PHA polymerase PhaC (
Genes required for expression are described in Table 1, organized by module pictured in
In the upstream Module 1, in addition to overexpression of the isoleucine and leucine biosynthetic genes under constitutive synthetic promoters, a feedback resistant mutant of LeuA (leuAmut, G562D) will be expressed to avoid negative autoregulation of the pathway. Previous work has additionally determined that expression of the branched-chain alpha-ketoacid dehydrogenase limits its own lipoylation and the lipoylation of other 2-oxoacid dehydrogenases, particularly pyruvate dehydrogenase. Expression of the lipoyl ligase A (lplA) can complement a lipoylation defect and concomitantly enhance branched-chain fatty acid production. In order to maximize BCPHA production, additional genes must be deleted to remove negative regulation of PHA biosynthesis, prevent beta-oxidation, and to direct fatty acid biosynthesis strictly towards branched-chain structures. Base strains AG2228 and AG2162 engineered for general PHA accumulation will be modified for BCPHA accumulation. These base strains are designed to limit depolymerization and negative regulation of PHA biosynthesis, along with overexpression of the PHA biosynthetic genes. The full genotypes for each strain are listed in Table 2.
The same modules can be applied to produce branched-chain fatty acids (BCFA), as well as branched-chain 3-hydroxyacid monomers (BCHA). BCPHAs and fatty acids are derived from fatty acid biosynthesis, so the same modules can be applied to produce both BCFA and BCHA.
To that end, to achieve FFA production in P. putida, a thioesterase can be expressed which releases the nascent fatty acyl-ACP from the fatty acid elongation machinery, forming a free fatty acid. We have chosen the highly efficient thioesterase from E. coli, tesA, ensuring that this protein will remain cytosolic where its activity is required. Further, we tested KT2440 expressing 'tesA (GB138), and two strains optimized for PHA production also transformed with the 'tesA (GB136: P. putida KT2440 ΔphaZ ΔfadB1-fadA1 ΔfadB2-fadA2-fadE2 pBTL2-'tesA and GB137: P. putida KT2440 ΔphaZ ΔfadB1-fadA1ΔhdhA::pTac-phaG-alkK-phaC1-phaC2 ΔfadB2-fadA2-fadE2 pBTL2-'tesA).
Expression of a thioesterase in the above strains of P. putida KT2440 enabled the production of FFA, with strain GB136 producing the highest titer of FFA, under a nitrogen replete condition (
BCHA production can be achieved by expression of a medium-chain-specific thioesterase, tesB, as well as overexpression of PhaZ, the PHA depolymerase, in addition to the deletion of PhaC and PhaG, the PHA polymerases. In combination with the branched-chain upstream modules, these modifications enable the production of BCHA, monomers with significant downstream platform chemical potential. Again, the branched-chain enables novel chemistries as the terminal methylation dramatically alters the physical properties of the monomer, as compared to a straight-chain monomer.
sfabH2
Bacillus subtilis
Bacillus subtilis
Escherichia coli
Escherichia coli
P. putida
P. putida
B. subtilis, E. coli
E. coli
In an embodiment, disclosed herein are methods and compositions of matter comprising non-naturally occurring and naturally occurring P. putida KT2440 that harbors three pathways for the initiation of fatty acid biosynthesis. Two of these pathways utilize β-ketoacyl-ACP synthase III enzymes, FabH1 and FabH2, that are specific to short and medium-chain length acyl-CoAs, respectively. The third pathway proceeds through the decarboxylation of malonyl-ACP to acetyl-ACP as catalyzed by a hotdog fold protein, MadB. Of the three pathways, FabH1 and MadB are the primary fatty acid initiation factors in P. putida KT2440. DFT calculation suggests that MadB catalyzes its reaction by stabilizing the C3-carbonyl moiety of the substrate.
Functional homologs of MadB are prevalent in the domain Bacteria and may serve as a novel target for biotechnology and biomedical applications.
The production of membrane lipids is an essential process for all known forms of cellular life. Except for Archaea, which synthesize membrane lipids from isoprenoids, bacteria, plants, and animals utilize fatty acids for membrane production. In animals, the suite of reactions required for fatty acid biosynthesis are performed by a type I fatty acid synthesis (FASI) pathway. Canonical FASI consists of a single polypeptide chain that folds to form several catalytic domains capable of iteratively elongating the covalently attached acyl chain. In contrast, most bacteria have a type II fatty acid synthesis (FASII) pathway, which is given by a distributed system of enzymes that catalyze each step of this biosynthetic process. The modular nature of FASII, in part, underlies the diversity of fatty-acid derived products bacteria can generate.
FASII can be broadly partitioned into three sets of reactions: initiation, elongation, and termination (
More recently an alternate KAS-independent FASII initiation pathway was discovered in E. coli in the form of malonyl-ACP decarboxylase (Mad, also referred to as YiiD). The Mad system was first identified in ΔfabH strains which exhibiting growth and colony morphology defects, yet remained viable, thereby suggesting the presence of an FabH-independent initiation pathway. The catalytic domain of Mad is housed by a hot-dog fold (HDF) domain that may exist as a fusion with a Gcn5-related N-acetyl transferase (GNAT) domain (MadA), such as that discovered in E. coli, or as a stand-alone protein (MadB). Ultimately, the Mad system produces acetyl-ACP that can enter the elongation step via condensing enzymes such as of FabB/F.
Despite discoveries in model bacterial species such as E. coli and P. aeruginosa, there are significant gaps in our understanding of fatty acid biosynthesis in non-traditional model organisms. For example, although numerous publications reported the engineering of the industrially relevant microbe Pseudomonas putida KT2440 to produce polyhydroxyalkanoates, a fatty acid-derived product, a comprehensive, mechanistic understanding of how fatty acids are produced in this species is lacking and thus preventing rational engineering of the system. To that end, here we elucidate the three pathways for the initiation of fatty acid biosynthesis in P. putida. We find that P. putida harbors two KASIII enzymes, FabH1 (PP_4379) and FabH2 (PP_4545), that accept short- and medium-chain-length acyl-CoAs, respectively. We also demonstrate that ΔfabH1 ΔfabH2 double mutants are not only viable but exhibit wild type-like growth rates due to an undescribed gene, PP_0262, that we identified independently in a forward-genetic screen. Biochemical and phylogenetic characterization of PP_0262 established that it is a MadB enzyme. Whole-protein alanine-scanning mutagenesis, isothermal titration calorimetry (ITC), X-ray crystallography, and structural and computational studies elucidate the putative mechanism of action of MadB. In addition, we demonstrate that functional homologs of MadB are present in a variety of bacteria, highlighting the widespread nature of this enzyme in fatty acid biosynthesis.
Material & Methods
Plasmid construction. Primers used in this study are described in Table 4. Plasmids used in this study are described in Table 5. We used Phusion® High-Fidelity PCR Master Mix (NEB) for all polymerase chain reactions. NEBuilder HiFi DNA Assembly Master Mix (NEB) was used for plasmid construction followed by transformation into chemically competent NEB 5-α F′Iq E. coli. Golden Gate assembly was utilized for the generation of CRISPR guide RNA-expressing plasmids. Transformants were selected on plates made with LB Broth with agar (Miller; Sigma-Aldrich) supplemented with kanamycin (5011 g/mL), carbenicillin (10011 g/mL), apramycin (5011 g/mL), or spectinomycin (5011 g/mL) where appropriate and grown overnight at 37° C. Resulting constructs were confirmed by Sanger sequencing (GENEWIZ, Inc.).
coli acpP into pETDuet-1
coli acpP into pETDuet-1
coli acpS into pETDuet-1
coli acpS into pETDuet-1
putida KT2440
putida KT2440
putida KT2440
E. coli fabH with CRISPR-
E. coli fabH
E. coli fabY
E. coli 6xHis-AcpP
E. coli 6xHis-AcpP and AcpS
E. coli fabH and template for
Acaryochloris marina
Arcobacter butzleri
Arthrobacter stackebrandtii
Corallococcus coralloides
Parachlamydia
acanthamoebae
Bdellovibrio bacteriovorus
Marispirochaeta aestuarii
Klebsiella pneumoniae
Chlorobaculum tepidum
Clostridium beijerinckii
Prosthecobacter fusiformis
Fimbriimonas ginsengisoli
Rickettsiales endosymbiont of
Stachyamoeba lipophora
Ktedonobacter racemifer
Salmonella enterica
Neisseria sicca
Mariprofundus ferrooxydans
Aeromonas hydrophila
Pseudomonas syringae
Sebaldella termitidis
parahaemolyticus
Salinibacter ruber
E. coli yiiD with CRISPR-Cas9
Strain construction. CRISPR-Cas9-mediated deletions and replacements in E. coli MG1655 were carried out as previously described. P. putida KT2440 genomic deletions were carried out using a selection (nptII, kanamycin), counterselection (sacB, sucrose) approach outlined previously, utilizing plasmid pK18sB. Genomic modifications were confirmed by colony PCR using MyTaq™ HS Red Mix (Bioline). Proper counterselection in P. putida was further verified by ensuring strains were kanamycin-sensitive. Strains used in this study are described in Table 6.
E. coli MG1655
E. coli MG1655 was transformed with pCas (Jiang, Y et al.
P. putida KT2440
P. putida KT2440
P. putida KT2440
P. putida KT2440
E. coli MG1655
E. coli MG1655 was transformed with pCas (Jiang, Y et al.
P. putida KT2440
P. putida KT2440
E. coli MG1655
P. putida KT2440
P. putida KT2440
Microplate reader experiments. Microplate reader experiments were performed by inoculating seed cultures from glycerol stocks into 10 mL of LB Broth (Miller; Sigma-Aldrich) with appropriate antibiotics in a 125 mL flask and incubating overnight at 30° C. (P. putida) or 37° C. (E. coli) and 225 rpm. Overnight cultures were then used to inoculate a second seed culture into 10 mL of LB Broth (Miller; Sigma-Aldrich) with appropriate antibiotics in a 125 mL flask starting at an optical density of 0.2 measured at 600 nm (0D600) using a GENESYS™ 140 Visible Light Spectrophotometer (Thermo Scientific). The second seed cultures were incubated at 30° C. (P. putida) or 37° C. (E. coli) at 225 rpm for 2-4 hrs until an OD600 of approximately 2 was reached. The second seed cultures were centrifuged at 8,000 rpm for 3 min, the pellets were washed with 1× M9, minimal salts (Sigma-Aldrich) three times, and resuspended to an OD600=3.0. Cells (10 μL) were transferred to five replicate wells in a 100-well Honeycomb 2 plate (Growth Curves USA) containing 290 μL of growth medium, corresponding to initial OD600=0.1. E. coli strains were grown in LB Broth (Miller; Sigma-Aldrich) at 37° C. and supplemented with 50 μg/mL kanamycin and 0.1 mM isopropylthio-β-galactoside (IPTG; GoldBio) when appropriate. P. putida strains were grown in modified M9 minimal media containing 6.78 g/L Na2HPO4, 3 g/L KH2PO4, 1 g/L NH4C1, 0.5 g/L NaCl, 2 mM MgSO4, 100 μM CaCl2, and 2 g/L glucose at 30° C. For fatty acid supplementation experiments, sodium salts of a given fatty acid (Sigma-Aldrich) were added to a final concentration of 100 μg/mL. OD600 measurements were acquired every 15 min with a Bioscreen C Pro (Growth Curves USA) and plate was shaken continuously. Maximum specific growth rates were calculated using a spline model (https://github.com/scott-saunders/growth_curve_fitting).
Construction and screening of P. putida genomic library. A derivative of pBTL-2 (Prior et al., 2010) was constructed that lacked a BamHI restriction site (pKM037), which was further modified to contain a new BamHI site downstream of the pLac promoter (pKM062). pKM062 was subjected to a complete digestion with BamHI-HF (NEB), treatment with Antarctic Phosphatase (NEB), and the resulting linear fragment was isolated with the GeneJET Gel Extraction Kit (ThermoFisher Scientific). Genomic DNA from wild-type P. putida KT2440 was extracted using GeneJET Genomic DNA Purification Kit (ThermoFisher Scientific) and 32 μg was digested for 1 hr at 37° C. with 10 units of Sau3AI (NEB) followed by 20 min inactivation at 65° C. in a 1.6 mL reaction volume. The partially Sau3AI-digested genomic DNA was subjected to gel electrophoresis and fragments 2-6 kb were isolated with the GeneJET Gel Extraction Kit (ThermoFisher Scientific). Genomic DNA fragments (3.2 μg) were ligated to linearized pKM062 (450 ng) with T4 DNA ligase (NEB) for 16 hrs at 4° C. in a 200 uL reaction volume. An aliquot of the ligation mixture (5 μL) was transformed into 50 μL chemically competent NEB 5-α F′Iq E. coli 23 times and the resulting cells split between two LB agar plates supplemented with 50 μg/mL kanamycin, yielding 46 plates, and incubated at 37° C. overnight. Based on a dilution series, we estimated that each plate contained 1,350 colonies or about 62,000 colonies in total. Colony PCR screening of a sampling of isolates demonstrated that 5/21 transformants contained pKM062 without any insertion. The colonies on all 46 plates were collected with 1 mL of LB each and subjected to plasmid extraction with the GeneJET Plasmid Miniprep Kit (ThermoFisher Scientific). Isolated plasmids were pooled and vacuum-concentrated to 1 mL with a final concentration of 145 ng/μL.
To screen for rescue of the E. coli ΔfabH colony size defect, strain KM128 (E. coli MG1655 ΔfabH) was grown overnight at 37° C. and electrocompetent cells were prepared as previously described. Electrocompetent cells were mixed with 15 ng of the genomic library, electroporated, and recovered in 1 mL SOC Outgrowth Medium (NEB) shaking at 37° C. for 1 hr. Each transformation culture was distributed among 10 LB agar+50 μg/mL kanamycin plates (100 μL/plate) and grown at 37° C. overnight. Plates supplemented with 1 mM IPTG (GoldBio) were also utilized. Colonies substantially larger than the rest of the population were identified, subjected to plasmid extraction with the GeneJET Plasmid Miniprep Kit (ThermoFisher Scientific), and the plasmids were sequenced with oKM_0172 and oKM_0173 to determine the genomic fragment present.
Cell size determination by light microscopy. E. coli cultures were inoculated from glycerol stocks into 10 mL of LB Broth (Miller; Sigma-Aldrich) with 50 μg/mL kanamycin in a 125 mL flask and incubating overnight at 37° C. and 225 rpm. Overnight cultures were then used to inoculate a second seed culture into 10 mL of LB Broth (Miller; Sigma-Aldrich) with 50 μg/mL kanamycin in a 125 mL flask starting at an OD600=0.2 using a GENESYS™ 140 Visible Light Spectrophotometer (Thermo Scientific). The second seed cultures were incubated at 37° C. at 225 rpm until an OD600 of approximately 1 was reached. A 10 μL aliquot of each culture was applied to slides treated with 0.1% (w/v) poly-L-lysine to stabilize cells for imaging (Electron Microscopy Sciences). Optical polarized light microscopy was conducted using a Nikon E800 confocal microscope (Nikon). A total of 16 images were captured using a 100×1.4NA Plan Apo objective (Nikon) and a SPOT RTKE CCD camera (Diagnostic Instruments) for each strain. Actively dividing cells were manually curated, and the Analyze Particles functionality of the image processing software Fiji was used to calculate their area.
Gas chromatography-mass spectrometry analysis of fatty acid methyl esters. P. putida seed cultures were inoculated from glycerol stocks into 10 mL of LB Broth (Miller; Sigma-Aldrich) in a 125 mL flask and incubated overnight at 30° C. and 225 rpm. Overnight cultures were then used to inoculate a second seed culture into 20 mL of LB Broth (Miller; Sigma-Aldrich) in a 125 mL flask measured at OD600=0.2 using a GENESYS™ 140 Visible Light Spectrophotometer (Thermo Scientific). The second seed cultures were incubated at 30° C. at 225 rpm until an OD600 of approximately 1 was reached. The second seed cultures were pelleted and washed three times with 1× phosphate-buffered saline (137 mM NaCl, 10 mM phosphate, 2.7 mM KCl, pH 7.5). Cell pellets were resuspended in 100 μL of Nanopure™ water on ice and transferred to 1.8 mL borosilicate glass vials with PTFE-lined caps (DWK Life Sciences, Millville, NJ). Samples were frozen on dry ice and lyophilized overnight in a FreeZone 6 Plus (Labconco Corp.). Prior to esterification, 400 pg of benzoic acid (RING-D5) was added as an internal standard (Cambridge Isotope Laboratories, Inc.). Subsequently, 500 μL of 1.25 M hydrogen chloride-methanol solution (Sigma-Aldrich) was added to the samples and caps were sealed tightly. Vials were vortexed for 20 sec then centrifuged at 3,000 g for 3 min. Samples were incubated at 95° C. for 4 hrs while shaking at 600 rpm in an Eppendorf ThermoMixer, then allowed to cool at room temperature for 15 min. Then, 500 μL of Nanopure water and 500 μL of hexane (Sigma-Aldrich) were added to each vial. Sample vials were vortexed for 30 sec then centrifuged at 5,000 g for 10 min, and the upper hexane phase was transferred to a 2 mL glass autosampler vial (Microsolv Technology Corp.). An additional 500 μL of hexane was added to each sample vial, the vials were vortexed and centrifuged as before, and the hexane phase was again transferred to the autosampler vial. One hundred microliters of each sample were transferred to new autosampler vials equipped with 300 μL glass inserts for GC-MS analysis.
Samples were analyzed by an Agilent 8890 GC using a HP-5MS column (30 m×mm×0.25 μm; Agilent Technologies) coupled with a 5977B single quadrupole MSD (Agilent Technologies). A sample volume of 1 μL was injected into a splitless port with a constant inlet temperature of 250° C. The GC temperature was held at 60° C. for 1 minute following injection, then increased to 325° C. at a rate of 10° C. min-1 and held at 325° C. for 10 min. Fatty acid methyl esters were identified by comparison with authentic standard mixes (C8-28, Sigma-Aldrich; C8-C24, Restek Corp.), by matching experimental spectra with a FAMEs library, containing spectra and validated retention indices, and through matches with the NIST20/Wiley 11th GC-MS library. Peak areas were normalized for the internal standard and dry sample weight.
Protein production. Expression plasmids for producing MadB (pKM139), EcMadA (pKM162), holo-EcAcpP (pKM181), apo-EcAcpP (pKM175), EcFabD (pEUK070), EcFabH (pEUK069), Sfp (pET-Sfp, #159617 Addgene), FabH1 (pEUK078), and FabH2 (pEUK079) were transformed into E. coli BL211(DE3). LB Broth (Miller; Sigma-Aldrich) supplemented with 100 mg/mL ampicillin was used for the protein production. One liter of the main culture was grown in a 2.5 mL baffled flask, seeded with 5 mL of an overnight grown starter culture, and grown at 37° C. and 200 rpm. The culture was induced with 1 mM IPTG upon reaching OD600 ˜0.7 and the temperature was lowered to 18° C. The culture producing holo-EcAcpP was also supplemented with 0.5 mM D-pantothenate at the point of induction. The resulting biomass was harvested by centrifugation following about 16 hours incubation post induction and stored at −80° C. until further use.
Protein purification. Poly-His-tagged proteins (apo-AcpP, holo-AcpP, EcFabD, EcFabH, FabH1, FabH2, EcMadA) were purified using the combination of an immobilized metal affinity chromatography (IMAC) and anion exchange chromatography. MadB was tagless and purified using the combination of anion and cation exchange chromatographies. Sfp was purified following an established protocol (Yin et al., 2006). The ion exchange chromatography was performed using an ÄKTA fast protein liquid chromatography (FPLC) system (Cytiva). All purifications were performed at room temperature using 20 mM HEPES, 100 mM NaCl pH 7.5 (Buffer A) unless stated otherwise. Protein concentration was determined using Pierce™ Rapid Gold BCA Protein Assay Kit (Thermo Scientific). The frozen biomass was thawed, resuspended in minimal volume of Buffer A spiked with a trace (˜2 mg) of DNAseI, and lysed by sonication at 4° C. The lysate was cleared by centrifugation and a passage through 0.45 μm filter prior to purification by chromatography. Cleared lysate of a poly-His-tagged protein was applied to a HisTrap cartridge or a gravity-fed Ni Sepharose resin (Cytiva), washed with up to 20 mM imidazole, and eluted with 400 mM imidazole. The eluate was concentrated and exchanged to buffer A using a spin column with appropriate size cutoff. The protein was further purified using anion exchange chromatography (Source 15Q, Cytiva) and eluted using a linear NaCl gradient in Buffer A. Fractions of interest was pooled, concentrated, exchanged to buffer A using spin columns, frozen as beads over a liquid N2 bath, and stored at −80° C.
Cleared lysate of MadB was applied to an anion exchange resin (Source 15Q, Cytiva) using Buffer A. The flowthrough was collected and concentrated using a spin column. The protein concentrate was serially diluted to 20 mM citrate pH 5. Precipitated protein was removed by centrifugation and the soluble fraction was applied to a cation exchange resin (Source 15S, Cytiva). MadB was eluted with a linear NaCl gradient in 20 mM citrate pH 5. Fractions of interest was pooled, concentrated, exchanged to buffer A using spin columns, frozen as beads over liquid N2 bath, and stored at −80° C.
Size exclusion chromatography (SEC) analysis of MadB. The biologically relevant oligomeric structure of MadB was determined using HiLoad 16/600 Superdex 75 pg operated with an ÄKTA FPLC system (Cytiva) running isocratically at 0.85 mL/min in 20 mM Tris-C1, 100 mM NaCl pH 8 at room temperature. The sample injection was 500 μL of 1 mg/mL MadB diluted in the SEC running buffer. The retention time of MadB was compared to a series of standards (Cytiva): blue dextran (200 kDa), aldolase (158 kDa), conalbumin (75 kDa), ovalbumin (43 kDa), carbonic anhydrase (29 kDa), ribonuclease A (13.7 kDa), and aprotinin (6.5 kDa).
Acyl-EcAcpP production for MS analysis. Various acyl-EcAcpP was produced in one-pot in vitro reactions for MS-analysis in HEPES (I=0.1 M) pH 7.5 supplemented with 10 mM MgCl2 and 1 mM TCEP. The reaction was left at room temperature for at least 5 min after the last component was added. The mixture was flash frozen and stored at −20° C. until LC-MS analysis. EcAcpP, in either apo- or holo-forms, was added at 100 μM, acyl-CoA at 100 μM, and acyl-EcAcpP modifying enzymes at 5 μM.
The activity of FabH1 and FabH2 was evaluated in a mixture containing holo-EcAcpP (100 μM), EcFabD (5 μM), and malonyl-CoA (500 μM) to produce malonyl-EcAcpP. Subsequently, acetyl-CoA (500 μM), octanoyl-CoA (500 μM), and 5 μM of either FabH1 or FabH2 to produce the 3-ketoacyl-EcAcpP species. An equivalent control reaction (without octanoyl-CoA) to produce acetoacetyl-EcAcpP was made using EcFabH.
The decarboxylation of malonyl-EcAcpP by MadB or EcMadA was evaluated in a mixture containing holo-EcAcpP (100 μM), EcFabD (5 μM), malonyl-CoA (500 μM), and MadB (5 μM). Alternatively, the decarboxylation activity was evaluated in a mixture containing apo-EcAcpP (100 μM), Sfp (5 μM), malonyl-CoA (500 μM), and MadB (5 μM). The latter route was used as EcFabD can partially decarboxylate malonyl-EcAcpP to acetyl-EcAcpP; by contrast, Sfp did not appear to have a decarboxylative activity. Control reactions producing malonyl-EcAcpP and acetyl-EcAcpP, with an appropriate acyl-CoA donor, were made similarly employing acyl- and acyl-phosphopantetheine transferase activity of FabD and Sfp, respectively. As MadB was shown to decarboxylate malonyl-CoA, an alternate experimental setup was devised to differentiate whether the acetyl-EcAcpP was produced from the decarboxylation of malonyl-EcAcpP, or from Sfp- or FabD-catalyzed reaction between acetyl-CoA and EcAcpP. In this setup, a mixture producing malonyl-EcAcpP, by means of Sfp, was preincubated for 5 minutes to produce malonyl-EcAcpP prior to being split into three separate aliquots: MadB was added to the first aliquot, EcMadA to the second, and the last aliquot was left as a ‘no enzyme’ control. The extent of the decarboxylation activity was monitored as a function of malonyl-EcAcpP depletion relative to the ‘no enzyme’ control.
Experiments evaluating whether MadB can catalyze the Claisen-condensation reaction (akin to EcFabH) were performed in a mixture containing holo-EcAcpP (100 μM), EcFabD (5 μM), malonyl-CoA (500 μM), acetyl-CoA (500 μM), and MadB (5 μM). A control reaction producing acetoacetyl-EcAcpP was produced by substituting PP_0262 with EcFabH. Similarly, experiments evaluating whether MadB can catalyze a direct acyl-transferase reaction (akin to EcFabD) were performed in a mixture containing holo-EcAcpP (100 μM), acetyl-CoA (500 μM), and MadB (5 μM). A negative control reaction without MadB was also ran for comparison.
Transformation of non-EcAcpP substrate by MadB. Malonyl-CoA, methylmalonyl-CoA, and malonate were tested as potential non-EcAcpP-tethered substrates for MadB. The reaction contained 1 mM of the alternate substrate and 5 μM of MadB in HEPES (I=0.1 M) pH 7.5 supplemented with 10 mM MgCl2 and 1 mM TCEP. The reaction was incubated at room temperature with an intermittent agitation for −1 hour prior to being passed through a 0.2 μm filter and frozen until further analysis.
High-performance liquid chromatography (HPLC) and liquid chromatography mass spectrometry (LC-MS). Intact protein LC-MS was performed by diluting protein samples 500 times in 2% acetonitrile and 0.1% formic acid in water. Five microliters of the diluted solution of each sample were injected into the LC system. A Waters NanoAquity LC system equipped with two binary nanoflow pumps was used. Solvent A was 0.2% formic acid in water, and solvent B was 0.2% formic acid in acetonitrile. One pump was used for online desalting at 3 μL/min for 6 min using 10% solvent B with a short trapping column (in-house packed, 5 cm, inner diameter 150 μm, outer diameter 360 μm, C2 reversed phase, MEB2-3-300, Separation Methods Technologies). The other gradient pump was operated at 0.3 μL/min with an analytical column with C2 stationary phase (in-house packed, 50 cm, inner diameter 100 μm, outer diameter 360 μm, same packing material as the trap column). The separation gradient was from 10-40% solvent B over 25 min, followed by 5 min ramp to 60% B. A Thermo Orbitrap Exploris 480 was used for MS data collection, with data-dependent acquisition with higher energy collision dissociation (HCD). The raw data were summed over 14-24 min in retention time and deconvoluted using Intact Mass v 4.1 (ProteinMetrics). Charge vector spacing was 0.1, baseline radius was 15 m/z, smoothing sigma was 0.01 m/z, spacing was 0.005 m/z, mass smoothing sigma was 0.1, mass spacing was 0.05. The LC-MS data were deposited into MassIVE.ucsd.edu and can be accessed with identifier MassIVE accession: MSV000090294. We also examined intact masses of the same diluted protein samples by matrix assisted laser desorption/ionization (MALDI) using dried spot method as outlined in more detail below. The results were similar to LC-MS but contained more adducts due to lack of desalting and additional LC separation. Thus, the latter results are not reported here.
Detection of various CoA analytes in the samples were performed on an Agilent 1200 HPLC system (Agilent Technologies) equipped with an Agilent 6120 mass spectrometer detector (MS). Each sample and standard were injected at a volume of 2 μL on a Phenomenex Luna C18(2)-HST column 100A′, 25 μm, 2.0×100 mm column (Phenomenex). The column temperature was maintained at 45° C. and the buffers used to separate the analytes of interest was 0.1% formic acid in water (A)/0.1% formic acid in acetonitrile (B). A gradient program was used to separate the analytes of interest: (A)=100% and (B)=0% at time t=0; (A)=100% and (B)=0% at t=1 min; (A)=50% and (B)=50% at t=7.65 min; (A)=30% and (B)=70% at t=9.33 min; (A)=30% and (B)=70% at t=10.67 min; (A)=100% and (B)=0% at t=10.68 min; (A)=100% and (B)=0% at t=13 min. The flow rate was held constant at 0.50 mL/min resulting in a run time of 13 minutes. The MS system was setup in positive electrospray ionization mode with a gas temperature of 350° C., drying gas at 12 L/min, nebulizer pressure set to 35 psig, and a Vcap voltage of 3000v. A total of four different masses in SIM mode from the MS detector were used to determine the presence and or absence of each analyte of interest. A mass of 810.5 m/z (M+H)+ was used for analyte acetyl-CoA, 854.5 m/z (M+H)+ for malonyl-CoA, 868.5 m/z (M+H)+ for methylmalonyl-CoA, and 824.5 m/z (M+H)+ for propionyl-CoA. One standard from each analyte was used to determine the elution order and retention times of the compounds.
Detection of malonic and acetic acid was performed on an Agilent 1290 HPLC system (Agilent Technologies) equipped with an Agilent G7117A diode array detector (DAD). NREL's laboratory analytical procedure “Determination of Structural Carbohydrates and Lignin in Biomass” (Sluiter et al., 2012) HPLC method was used to detect the analytes of interest as outlined below. Each sample was injected at a volume of 20 μL onto an BioRad Aminex HPX-87H column 9 μm, 7.8×300 mm column (BioRad) at a column temperature of 55° C. A DAD detector was used at a wavelength of 210 nm to detect the analytes of interest. Compounds were separated utilizing an isocratic flow of 0.01 N H2SO¬4 in water at 0.6 mL/min for a total run time of 27 min. Acid standards of malonic, and acetic acid were purchased from Sigma Aldrich (Sigma-Aldrich). A one-point curve was used to establish retention times for the analytes of interest.
Matrix assisted laser desorption ionization (MALDI) MS. Protein samples were diluted 50 times in 2% acetonitrile with 0.1% formic acid. MALDI matrix was prepared by dissolving 2,5-dihydroxyacetophenone (15 mg/mL) in 90% acetonitrile with 0.2% trifluoroacetic acid. Protein samples were mixed with the MALDI matrix solution in a 1:1 ratio (v/v) and the mixture (1 μL) was deposited on a conductive indium tin oxide (ITO) glass slide. MALDI mass measurements were performed on ThermoFisher ultra-high mass range (UHMR)-Orbitrap coupled to a MALDI source (Spectroglyph, LLC) with a 1 kHz Explorer One Nd:YAG (349 nm) laser (Zemaitis et al., 2022). The instrument was operated in a positive mode over a m/z range of 6,000 to 20,000 with ˜40K resolving power at 10,000 m/z.
Protein crystallization. For crystallization, MadB was further purified with size-exclusion chromatography and eluted as a dimer into 20 mM Tris pH 8.0 and 150 mM NaCl. The protein was concentrated to 10 mg/mL and sitting drop co-crystallization trials were set up with a Mosquito crystallization robot (sptlabtech) using SWISSCI 3-lens low profile crystallization plates. Crystals grew in condition A5 (25% PEG 1500 and 0.1 M SPG buffer at pH 8.0) of the PACT screen (Molecular Dimensions). Crystals were cryo-protected with 20% glycerol and flash-frozen into liquid nitrogen. Diffraction data was collected on beamline 103 at the Diamond Light Source and automatically processed with STARANISO on ispyb. The structure was solved within CCP4 Cloud by molecular replacement with MOLREP using a search model created by the phyre2 server. Model building was performed in Coot and the structures was refined with REFMAC. MolProbity was used to evaluate the final model and PyMOL (Schrödinger, LLC) for protein model visualization. The atomic coordinates have been deposited in the Protein Data Bank and are available under the accession code 8AYV. Comparison of the MadB structure with the protein architectures of structural homologs was performed with the Dali server.
Benchmark of computational method. To validate the computational method used in the manuscript, we benchmarked the single point energies of the transition states and key intermediates presented in the manuscript with different combinations of density functional, basis set, and solvation models. Because all tested computational methods produced consistent results, suggesting the C3 carbonyl-stabilized pathway is the lowest energy pathway, we conclude that the computational method presented in the manuscript is reliable. Not only are the trends consistent within a certain intermediate despite the functional, the trends in energy change between two different intermediates are also consistent, indicating that the computed reaction mechanism in this study is accurate.
Scan of decarboxylation reaction coordinate without MadB. Scan of the decarboxylation reaction coordinate profile along the C—C distance helped us locating the decarboxylation transition state. We observe that the decarboxylation reaction coordinate is relatively flat, and we expect a transition state with C—C distance close to 2.50 Å.
Effects of single amino acid residues on decarboxylation transition state. We studied the effects of single amino acid residues on the decarboxylation transition state. We observed that N45 and H46 are stronger hydrogen bond donors that stabilize the anionic charge built up on the carbonyl oxygen atom in TS1.
Melting temperature determination by DSC. The apparent melting temperature values for wild-type MadB and the N45A and H46A mutants were assessed by DSC. Immediately prior to DSC analysis, to ensure both mono-dispersity and an optimal buffer match, each enzyme was prepared by SEC through a HiLoad Superdex 75 pg column (Cytiva) pre-equilibrated with the reference buffer comprising 100 mM NaCl, 50 mM NaH2PO4 pH 7.5 (NP75). The SEC column was calibrated with a mixture of globular protein standards (Sigma-Aldrich)—ribonuclease A (13.7 kDa), albumin (67.0 kDa), γ-globulin (158 kDa) and thyroglobulin (670 kDa)— to allow for the calculation of an apparent molecular weight for native MadB from its elution volume. Subsequently, triplicate DSC analyses, using 0.023 to 0.375 mg/mL wild-type enzyme (1.39 to 22.2 μM monomer concentration), were performed on a MicroCal PEAQ-DSC-Automated instrument (Malvern Panalytical). The two mutant enzymes were analyzed at 0.2 mg/mL. The sample and reference cells were raised in temperature from 30° C. to 120° C. at a rate of 1.5° C./min in low feedback mode.
Kinetic determination using ITC. Turnover of coenzyme A derivatives by each MadB variant was investigated using ITC on a MicroCal PEAQ-ITC-Automated instrument (Malvern Panalytical) utilizing the same NP75 buffer as above. All analyses were performed at 30° C. using high feedback mode. Stock solutions of acetyl-CoA and malonyl-CoA were prepared by dissolving the solid substrates (Sigma-Aldrich) in NP75 buffer and adjusting the pH to 7.5 with small aliquots of 1M NaOH, and then quantitated by UV absorbance spectroscopy using a molar extinction coefficient of 15,400 M-1 cm-1. Substrate preference and the apparent reaction enthalpy (ΔHapp) for malonyl-CoA turnover were assessed using a single-injection enzyme kinetics method (Todd and Gomez, 2001) with a reference power of 20 μcal/s and, following the injection, initial concentrations of 9.43 μM wild-type MadB and 371 μM substrate.
Subsequently, the kinetics of malonyl-CoA turnover was studied, in triplicate, using a multi-injection kinetics assay with a reference power of using 20 μcal/s. In each replicate, twelve low-volume aliquots of 47.7 mM malonyl-CoA were injected into the ITC cell containing either 1.00 μM wild-type MadB, 20.0 μM MadB (N45A) or 2.50 μM MadB (H46A) (monomer concentration) at 3 min intervals to give total substrate concentrations of up to 7.2 mM; higher malonyl-CoA concentrations could not be reached because of the large exothermic heats of dilution upon substrate injection. Power measurements were averaged over a window 150-180 s after each injection and converted to reaction rate using the determined ΔHapp of −3.56 kcal/mol (−14.9 kJ/mol). Data analysis and Michaelis-Menten curve fitting was performed using the instrument's analysis software (v1.40).
Results
P. putida strains lacking KASIII homologs are viable. In an effort to identify candidate KASIII enzymes in P. putida, we performed a BLAST search against the P. putida proteome using E. coli FabH (EcFabH, hereafter) as the query sequence. We identified two predicted proteins that we deemed FabH1 (PP_4379) and FabH2 (PP_4545), which exhibit 29% and 26% identity to EcFabH, respectively. Unlike the gene encoding EcFabH, FabH1 and FabH2 are not embedded within an operon that contains other fatty acid biosynthesis genes (
FabH1 and FabH2 exhibit different acyl-CoA substrate specificities. Considering the lack of overt phenotype observed in the double deletion strain, we were curious if either FabH1 or FabH2 exhibited any fatty acid initiation activity at all. To examine the in vivo activity of both proteins, we expressed them heterologously in a ΔfabH E. coli strain we generated using CRISPR-Cas9. While the ΔfabH E. coli strain displayed a reduced maximal growth rate compared to wild type, plasmid-based expression of EcfabH or fabH1 partially restored wild type-like growth in this genetic background (
We next validated the activity and specificity of the P. putida EcFabH homologs in vitro by determining their production of 3-ketoacyl-ACP species using a high mass accuracy intact protein liquid chromatography-mass spectrometry (LC-MS). In the following assays, malonyl-ACP was produced by EcFabD-catalyzed acyl transfer reaction between holo-ACP and malonyl-CoA. In the presence of malonyl-ACP and acetyl-CoA, purified EcFabH produced β-acetoacetyl-ACP (
Expression of PP_0262 rescues E. coli ΔfabH growth and cell size defects. To elucidate other genes in P. putida with fatty acid initiation activity, we utilized a forward genetic screen (
In addition to a small colony phenotype, E. coli strains lacking FabH have been reported to have reduced cell size. To determine if expression of madB can restore wild type-like cell size in a ΔEcfabH strain, we used microscopy to examine actively dividing cells from wild type, ΔEcfabH, and ΔEcfabH+madB strains. As previously reported, we confirmed ΔEcfabH cells are significantly smaller than wild-type cells (P<0.0001) (
MadB supports growth of E. coli lacking both native FabH and MadA. Although MadB appeared to have fatty acid initiation capabilities comparable to EcFabH, it remained a formal possibility that MadB does not have a direct role in fatty acid initiation. For example, E. coli strains lacking FabH can still grow, albeit poorly, due to the presence of MadA (YiiD). Furthermore, overexpression of madA has been demonstrated to rescue both the growth and cell size defects of ΔfabH E. coli strains. To rule out any indirect effects MadB may exert on madA expression, we generated an E. coli strain that lacked madA and had fabH replaced with madB using CRISPR-Cas9 (
Synthetic growth defect of ΔmadB ΔfabH1 P. putida is rescued by medium-chain-length fatty acids. To examine the role of MadB in P. putida fatty acid initiation, we generated strains lacking madB singly and in combination with ΔfabH1 and ΔfabH2. We found that all single and double mutants, except ΔmadB ΔfabH1, displayed wild type-like growth (
Since the ΔmadB ΔfabH1 double mutant still retained FabH2, which we demonstrated can accept a medium-chain-length acyl-CoA (
Loss of FabH1, FabH2, and MadB is synthetically lethal. To determine if we had identified all the fatty acid initiation enzymes in P. putida, we attempted to generate a P. putida strain lacking fabH1, fabH2, and madB. These initial efforts were unsuccessful. To test the hypothesis of synthetic lethality more definitively, we generated a P. putida strain lacking fabH1, fabH2, and madB, but containing a temperature-sensitive plasmid that expressed madB. Whereas wild-type P. putida showed similar growth at 25° C. and 37° C., the triple mutant bearing the temperature-sensitive plasmid displayed a dramatic loss of viability at the restrictive temperature (
MadB is a malonyl-ACP decarboxylase. There are three potential routes to generate β-acetoacetyl-ACP for the initiation of fatty acid biosynthesis. The first route is the direct Claisen-condensation of acetyl-CoA with malonyl-ACP to yield β-acetoacetyl-ACP. We have shown that FabH1 is capable of this activity (
The nature of MadB-catalyzed decarboxylation of malonyl-CoA was also investigated using ITC. The mixing of malonyl-CoA and MadB emitted a sustained heat indicating an exothermic reaction (ΔHapp=−14.9 kJ/mol). In contrast, no heat was detected upon injection of acetyl-CoA, other than the transient heat of dilution due to the injection event. The kinetics of this reaction were investigated by a multi-injection ITC method, which revealed a Km of 12±1 mM and kcat of 1.4±0.1 s-1 for malonyl-CoA at 30° C., a catalytic efficiency that is likely too low to be physiologically relevant. Hence, MadB was appropriately deemed a malonyl-ACP decarboxylase in agreement with prior determination of MadA from E. coli and MadB from Shewanella oneidensis.
Alanine-scanning mutagenesis of MadB indicates residues important for function. To define the amino acids essential for MadB activity and gain insight into the reaction mechanism, we screened a complete library of variants harboring MadB alanine-substituted mutants for their inability to rescue the colony size defect of a ΔfabH E. coli strain, ultimately identifying eight such MadB variants. We subsequently verified the growth defect of these variants using a microplate reader (
MadB belongs to the hotdog fold protein family. To better understand the mechanism of MadB-catalyzed malonyl-ACP decarboxylation, we solved the structure of MadB via X-ray crystallography at 1.04 Å resolution (PDB 8AYV). MadB was structurally closest to a homolog from Chlorobaculum tepidum (PDB 3LMB; 38% sequence identity with a root-mean-square deviation (RMSD) of 1.6 Å). We found that MadB adopts a HDF configuration, which is common among enzymes involved in fatty acid metabolism. MadB exists as a dimer and this arrangement was observed in crystallography and corroborated by size-exclusion chromatography.
Differential scanning calorimetry analysis of MadB at 22.2 μM (monomer concentration) revealed that the enzyme thermally unfolds with a single transition at 66.4° C.; the lack of a second peak indicates that there is no significant monomer-dimer exchange at this concentration. This melting temperature remains constant upon two-fold dilution to 11.1 μM and 5.55 μM monomer.
Next, we sought to generate a model of MadB in complex with malonyl-ACP. We first performed a structural similarity search and found various structures of LnmK (4% sequence identity to MadB with RMSD of 3.0 Å) in substrate analog-bound complexes. LnmK is a previously characterized double-HDF enzyme that is a bifunctional decarboxylase/acyltransferase associated with the biosynthesis of leinamycin, a potential anti-cancer drug. The similar scaffold (HDF), substrate (methylmalonyl-CoA vs. malonyl-ACP), and activity (decarboxylation) of LnmK and MadB guided our efforts to first dock malonyl-CoA into the presumed active site of MadB. An initial model (Minitial) was built via structural alignment of MadB with LnmK in complex with 2-nitronate-propionyl-CoA (PDB 6X7L), which was easily modified in silico to generate the malonyl-CoA ligand. We refined Minitial with a molecular dynamics (MD) protocol that reproduces an induced-fit process, and generated two top models, MMD1_CoA and MMD2_CoA, with AutoDock Vina estimated binding affinities at −13.9 and −13.6 kcal/mol, respectively. Unlike MMD1_CoA, MMD2_CoA implicated residues AsnN45 and R124Arg124, identified in the alanine-scanning mutagenesis screen, as interacting with malonyl-CoA at its reaction center. In addition, MMD2_CoA established a hydrogen bond network with the terminal malonyl group analogous to that observed in LnmK (
We further performed MD-based refinement to generate a model of MadBmalonyl-ACP, using Minitial after the replacement of CoA with ACP from E. coli (PDB 4KEH; 86% identity to PP_1915 gene product, ACP from P. putida). Five conformations selected from the MD simulations suggested that a binding mode like MMD2_CoA is prevalent in the presence of ACP (
Among the functional residues identified via alanine-scanning, our model of MadBmalonyl-ACP seems to explain the importance of residues Asn45 and Arg124 as being directly involved in substrate binding. Specifically, Asn45 and Arg124 bind the terminal carboxyl group of the malonyl and the pantetheine moieties, respectively. In addition, the identification of aromatic residues Trp64, Tyr90, and Tyr147 is likely explained by their role in forming the hydrophobic core underlying the substrate binding site. The remaining identified residues, Ile16, Leu18, and Asn43, are located near the dimer interface where the proposed decarboxylation occurs. To determine if these residues play an outsized role in MadB dimerization, we used the machine learning-based predictor KFC2a, which predicts residues that account for the majority of the binding affinity in a complex. We found that Ile16, Leu18, and Asn43, were among the eight residues predicted by KFC2a to be important for MadB dimerization. The independent identification and correlation of crucial residues in MadB through in vivo and in silico analyses strengthens the validity of our structural binding model.
MadB catalyzes malonyl decarboxylation through carbonyl stabilization. Based on the precedence of the catalytic mechanism proposed for LnmK and the data presented in this study, we propose a catalytic mechanism for MadB (
From the terminal carboxylate of malonyl-ACP (IM0), we modeled the reaction coordinate profile to form the enolate intermediate IM2 through the decarboxylation transition state TS1 (
To further evaluate the relative contribution of amino acid side chains of Asn45 and His46 towards catalysis, we purified Asn45A and His46A variants of MadB and compared their catalytic efficiency for malonyl-CoA decarboxylation against the wild-type counterpart using the same ITC-based assay. This in vitro experiment provides a clearer phenotype when compared to the prior in vivo alanine scan experiment as described previously. Similar analyses, however, are not available for Gly52 and Ile83 as their hydrogen bonding interactions are contributed by the amide main chain. Both variants were purified similarly to the wild-type MadB. As described above, the MadB-catalyzed decarboxylation of malonyl-CoA is not physiologically relevant; however, this scheme is useful for comparing the performance of the different variants. The kinetic results are summarized in Table 1. While both variants have about half the Michaelis constant, their lowered turnover numbers dramatically affected the overall catalytic efficiency with Asn45Ala and His46Ala resulting in 13- and 1.5-fold lower catalytic efficiency, respectively when compared to the wild-type enzyme and are in line with our prior in vivo alanine scan result. The dramatic loss in turnover number in Asn45Ala variant is consistent with its purported role for transition state stabilization. Conversely, the less severe defect observed for His46Ala may stem from a less productive orientation of the bound substrate. Overall, along with the MD-derived model, our DFT calculations highlight the importance of multiple amino acids including Asn45, His46, Gly52, and Ile83 in promoting the decarboxylation reaction of the terminal carboxylate of malonyl-ACP at MadB.
Functional homologs of MadB present in a diversity of bacterial lineages. To determine the phylogenetic distribution of MadB homologs, we utilized an iterative profile-HMM search method, JackHMMER, using MadB as the search query. We identified a total of 479 potential homologs with the same protein domain structure as MadB (
Discussion & Conclusions
Genetically dissecting the components of bacterial fatty acid biosynthesis has been traditionally difficult, in part, due to the essential nature of this process. For example, supplementing E. coli growth medium with fatty acids does not guarantee the viability of a particular fab mutant. Aided by genetic redundancy, here we identified and characterized the three enzymes, and three respective pathways, that initiate fatty acid biosynthesis in P. putida KT2440 (
Our bioinformatics-driven approach identified FabH1 and FabH2 in P. putida KT2440 as potential KASIII enzymes. We found that FabH1, like EcFabH, catalyzes the Claisen condensation of acetyl-CoA with malonyl-ACP to produce β-acetoacetyl-ACP in vitro. In addition, P. putida ΔfabH1 strains display a similar fatty acid biosynthesis defect observed in an E. coli ΔfabH strain. For this reason, fabH1 appears to be the gene conspicuously absent from the P. putida “fabHDG” operon. Sequence analysis of FabH2 revealed high identity with a P. aeruginosa protein, PA3286 gene product. Like PA3286 gene product, FabH2 prefers octanoyl-CoA over acetyl-CoA, and produces β-keto-decanoyl-ACP. In wild-type P. putida, FabH2 likely helps assimilate exogenous fatty acids and recycle endogenous fatty acids. Interestingly, a strain relying on FabH2 alone for growth (ΔmadB ΔfabH1) is viable in minimal medium, suggesting sufficient side-activity with acetyl-CoA to support de novo fatty acid initiation.
Despite the apparent ability of FabH1 and FabH2 to initiate fatty acid biosynthesis, we found that they were unnecessary for P. putida viability due to the presence of madB, which we identified in a forward genetic screen. Loss of MadB alone does not cause an overt defect in fatty acid biosynthesis, unlike loss of FabH1, but the ΔmadB ΔfabH1 double mutant exhibits a severe growth defect. This suggests that FabH1 and MadB to a lesser degree, are the main initiation factors in P. putida. We interrogated three possible routes for MadB-catalyzed fatty acid initiation and discovered that MadB is capable of malonyl-ACP decarboxylation. The acetyl-ACP species produced by MadB is presumably condensed with malonyl-ACP by FabB/F, generating β-acetoacetyl-ACP. Whereas the FabH1 pathway only requires one malonyl-ACP and one acetyl-CoA to produce β-acetoacetyl-ACP, the MadB pathway requires two malonyl-ACP molecules; one to generate acetyl-ACP and another for the condensation reaction with FabB/F. This additional malonyl-ACP molecule, ultimately derived from acetyl-CoA, comes at the cost of ATP hydrolysis (
The decarboxylation of a malonyl group can occur through both abiotic and biotic processes. The non-enzymatic process requires a highly acidic condition and is initiated by the f3-carbonyl group abstracting a proton from the terminal carboxylate (pKa ˜0.3). Alternatively, an uncatalyzed reaction at neutral pH would encounter such a high activation energy barrier that the process would proceed at a glacial pace. By contrast, the MadB-catalyzed reaction enables FabH-independent FASII initiation such that no apparent growth defect is observed in P. putida ΔfabH1 ΔfabH2 strains. In addition to HDF, there are at least three additional enzyme scaffolds that have evolved means to catalyze the decarboxylation of a malonyl group at ambient conditions: crotonase, GNAT, and the biotin dependent Na+ translocating protein families; all but the last protein family are cofactorless systems. Specifically for the GNAT protein family, its counterpart in MadA is non-functional and its function remains unknown. Despite the distinct protein scaffolds, the active site architectures of the cofactorless systems are strikingly similar, employing elements for substrate polarization, often with the amide backbones highlighting a common catabolic logic: the polarization of the terminal carboxylate and a resonance stabilized f3-keto-enolate intermediate.
Unlike the first half of the malonyl decarboxylation, the final resolution of the enolate intermediate (IM2) remains elusive. It is unclear whether this process occurs spontaneously, through a proton exchange with a solvent species, or is assisted by an amino acid side chain. In the case of both methylmalonyl-CoA decarboxylase (MMCD, crotonase family) and LnmK (HDF family) a flexible acid catalyst was proposed facilitate this process; but this residue is not conserved, or its equivalent is missing in other enzyme systems indicating an alternate means for the keto-enol tautomerization to proceed. In MadB, this final acid catalyst role may be fulfilled by His80 as suggested by the predicted models of malonyl-derivatives in complex with MadB. Instead, His80 may possess additional role on an earlier stage of the reaction by attracting the electron density from the terminal carboxylate towards the β-keto carbonyl group. Nevertheless, other adjacent basic amino acids, such as His46 and Lys47, may at least partially compensate for the substitution His80Ala, explaining the non-essential role of His80 for catalysis.
The analyses provided by the structural modeling and the DFT calculation corroborates the function of the residues identified via the in vivo alanine scanning; namely, the direct interaction with the reaction core (Asn45) or with distal regions of the substrate (Arg124), the stabilization of the dimer assembly of the substrate-binding site (Ile16, Leu18, and Asn43), or the formation of its underlying hydrophobic core (Trp64, Tyr90, and Tyr147). Other residues that are key for catalysis according to the proposed reaction mechanism (Gly52 and Ile83) were not identified as functional residues via mutagenesis because, according to our models, their interactions with the substrate involve backbone atoms. Indeed, many of the initial substitutions in the alanine scan did not result in a complete growth defect in the ΔfabH E. coli background, likely due to the complex interplay between the mutant MadB and EcMadA. The result was only confounded as the amino acid residues involved in catalysis serve to polarize the substrate instead of acting as a direct proton shuttle. As such, the substitution effect would be dampened for example through the effect of ordered water or chemical rescue. Additionally, other structural features such as helix II of ACP (“recognition helix”) may play an important role in stabilizing the substrate in a reactive conformation, which may contribute to the preferential activity of MadB toward malonyl-ACP instead of malonyl-CoA through the complementary hydropathicity.
In summary, our study describes three routes for FASII initiation in P. putida KT2440. FabH1 and FabH2 are responsible for the conventional Claisen condensation type reactions utilizing short and medium chain acyl-CoA donors, respectively. The third route utilizes the malonyl-ACP decarboxylase system, MadB. Understanding these enzymes opens an avenue for tunability of fatty acid metabolism in P. putida and provides insight toward the development of a more efficient biocatalyst such as in the production of fatty-acid derived bioproducts. For example, carbon flux may be redirected to produce non-native fatty acid products by employing FabH homologs with different substrate specificities. Moreover, the prevalence of malonyl-ACP decarboxylase system in the domain Bacteria also opens an avenue for biomedical applications such as in the development of novel antibiotics target.
The foregoing disclosure has been set forth merely to illustrate the invention and is not intended to be limiting.
This application claims the benefit of U.S. Patent Application No. 63/321,207 filed on 18 Mar. 2022 and U.S. Patent Application No. 63/479,918 filed on 13 Jan. 2023, both of which are incorporated by reference herein in their entirety.
The United States Government has rights in this invention under Contract No. DE-AC36-08GO28308 between the United States Department of Energy and Alliance for Sustainable Energy, LLC, the Manager and Operator of the National Renewable Energy Laboratory.
Number | Date | Country | |
---|---|---|---|
63321207 | Mar 2022 | US | |
63479918 | Jan 2023 | US |