A Sequence Listing, incorporated herein by reference, is submitted in electronic form as an ASCII text file of size 144 KB, created Sep. 2, 2020, and named “8LX7664.TXT”.
This disclosure relates to Type II biotin synthases, and methods of producing biotin with such an enzyme.
Biotin, also known as vitamin B7, is an essential dietary vitamin for humans because humans are unable to produce biotin, which is a cofactor used in carboxylation, decarboxylation, and transcarboxylation reactions in many organisms, including humans. However, the human gut microbiome has been shown to contain Escherichia coli that contain biotin synthase that can provide a source of some biotin.
Biotin synthase (BioB) (EC 2.8.1.6) is the enzyme that catalyzes the final step in the biotin biosynthetic pathway, conversion of desthiobiotin (DTB) to biotin.
Biotin synthase is a member of the “radical SAM” superfamily, which is characterized by the presence of a conserved CxxxCxxC sequence motif (C, Cys; x, any amino acid) that coordinates an essential [4Fe—4S] cluster and the use of S-adenosyl-L-methionine (AdoMet or SAM) for radical generation in converting desthiobiotin to biotin.
The crystal structure of the E. coli biotin synthase in complex with SAM and desthiobiotin has been determined to 3.4 angstrom resolution (Berkovitch et al., Science, 2004, 303):76-70). The E. coli biotin synthase is a homodimer, with each monomer in the structure containing a triose phosphate isomerase (TIM) barrel with an [4Fe—4S] cluster, SAM, and an [2Fe—2S] cluster. Three of the four Fe atoms in the [4Fe—4S] cluster of the E. coli monomer chain coordinate with the 3 cysteines of the radical SAM sequence motif (Cys-53, Cys537, and Cys-60). The fourth ligand of the [4Fe—4S] cluster is an exchangeable S-adenosyl-L-methionine, which binds as an N/O chelate to the Fe through its amino-group nitrogen and carboxyl-group oxygen. The [2Fe—2S] cluster is coordinated with three cysteines (Cys-97, Cys-128, and Cys-188) and an arginine (arg-260) in the E. coli monomer. Arg-260 was also observed to interact with Ser-43, Ser-218, Ser-283, and Arg-95 in addition to the [2Fe—2S] cluster.
The [4Fe—4S] cluster (the radical SAM or RS cluster) of E. coli BioB is used as a catalytic cofactor, directly coordinating to SAM. The role of the [4Fe—4S] cofactor is to transfer an electron onto SAM, leading to formation of the 5′ deoxyadenosyl radical.
Isotopic labelling and spectroscopic studies show destruction of the auxiliary [2Fe—2S] cluster accompanies E. coli BioB turnover, indicating that it is likely a sulfur from [2Fe—2S] being incorporated into DTB to form biotin.
The E. coli biotin synthase is unable to be reactivated and is thus classified as a “suicide enzyme” since it destroys itself during turnover. The known biotin synthases have been shown to possess very poor kinetic properties.
The worldwide market for biotin as a nutritional supplement is greater than $180 million, and predicted to more than double by 2024. Currently, biotin is manufactured industrially using chemical synthesis because enzymatic synthesis or cell-based production of biotin has not been commercially viable with the previously characterized BioBs, such as the E. coli biotin synthase, due to the poor kinetic properties of the enzymes.
Thus, there is a need for a cost-competitive cell-based biotin manufacturing process to replace the current chemical syntheses.
A method of producing biotin includes contacting desthiobiotin with a Type II biotin synthase (Type II BioB) holo-protein in vitro under conditions effective to produce biotin; and recovering the biotin, wherein the Type II biotin synthase holo-protein comprises, per polypeptide chain, a first [4Fe—4S] cluster (radical SAM (RS) cluster) coordinated to a CxxxCxxC motif in the polypeptide chain and a second [4Fe—4S] cluster.
A recombinant microorganism includes a transgene encoding a polypeptide of a Type II biotin synthase, wherein a holo-protein of the Type II biotin synthase comprises per polypeptide chain a first [4Fe—4S] cluster (radical SAM (RS) cluster) coordinated to a CxxxCxxC motif in the polypeptide chain and a second [4Fe—4S] cluster.
A method for producing biotin includes cultivating the recombinant microorganism in a growth medium to produce a culture; and recovering biotin from the culture.
These and other features and characteristics are more particularly described below.
The following is a brief description of the drawings wherein like elements are numbered alike and which are presented for the purposes of illustrating the exemplary embodiments disclosed herein and not for the purposes of limiting the same.
The inventors have discovered a new class of the radical S-adenosyl methionine (SAM) enzyme biotin synthase (BioB), referred to as “Type II biotin synthases.” Unexpectedly, Type II biotin synthases are shown to contain two [4Fe—4S] clusters per polypeptide chain in contrast to the previously identified biotin synthases, or “Type I biotin synthases”, which contain only one [4Fe—4S] cluster and one [2Fe—2S] cluster. The inventors determined X-ray crystal structures of two different Type II biotin synthases, Blautia obeum and Veillonella parvula HSIVP1, in the presence of the reaction substrates desthiobiotin (DTB) and SAM and observed the presence of two bound [4Fe—4S] clusters. One of the [4Fe—4S] clusters is a “radical SAM” (RS) cluster involved in AdoMet cleavage and ligated to a radical SAM binding motif CxxxCxxC in the primary sequence of the Type II BioB in which each of the cysteines coordinates to 3 of the Fe in the RS cluster. In the B. obeum Type II BioB sequence, the residues of the radical SAM binding motif are C-53xxxC-57xxC-60, while in the V. parvula Type II BioB sequence, the residues are C-79xxxC-83xxC-86. Three Fe atoms of the auxiliary [4Fe—4S] cluster are coordinated to three cysteine residues (C-52, C-138, and C-198 in the B. obeum Type II BioB; C-69, C-156, and C-216 in the V. parvula Type II BioB sequence. In the crystal structure, the fourth Fe in the auxiliary [4Fe—4S] cluster is coordinated to a fifth sulfur atom in close proximity to the DTB. The fifth sulfur is believed to be donated to DTB to create the biotin ring system, leaving the auxiliary [4Fe—4S] cluster intact and poised to accept a new fifth sulfur for subsequent rounds of catalysis. In vitro comparison of the biotin synthesis reaction rate of Type II BioBs and Type I BioBs unexpectedly showed that the Type II biotin synthases had a greater than 10-fold increase in enzymatic reaction rate compared to Type I biotin synthases.
A biotin synthase can be identified as a Type I or Type II biotin synthase by structural characterization, such as by X-ray crystallography, to determine the presence per polypeptide chain of a radical SAM [4Fe—4S] cluster and of an auxiliary [4Fe—4S] cluster (Type II) or an auxiliary [2Fe—2S] cluster (Type I).
Type II biotin synthases can also be identified from known biotin synthase polypeptide sequences by aligning the candidate biotin synthase sequence against a Type I BioB reference sequence, such as the E. coli biotin synthase (SEQ ID No:13), to determine if a cys-ser swap is present in the candidate sequence at the positions corresponding to E. coli S-43 and C-97. When the corresponding positions in the candidate sequence are cysteine and serine, respectively (i.e., show a cys-ser swap), the biotin synthase is a Type II biotin synthase. When the corresponding positions in the candidate sequence are serine and cysteine, respectively (i.e., no cys-ser swap), the biotin synthase is a Type I biotin synthase. Pairwise sequence alignment of the two sequences can be performed, for example, by using the BLAST program e.g. the BLASTP program (freely available online at the website of National Center for Biotechnology Information, U.S. National Library of Medicine, 8600 Rockville Pike, Bethesda, Md., 20894 USA). Multiple sequence alignment can be performed using one of the tools available online from the European Bioinformatics Institute (EMBL-EBI).
Examples of Type I Biotin synthases include those from Escherichia coli (SEQ ID No: 13); Candidatus Chloracidobacterium thermophilum B (SEQ ID No: 15; Streptomyces lydicus (SEQ ID No: 16); Paracoccus denitrificans (SEQ ID No: 17); Paracoccus denitrificans PD1222 (SEQ ID No: 18); Agrobacterium vitis (SEQ ID No: 19); Ruegeria pomeroyi (SEQ ID No: 20); Agrobacterium fabrum (SEQ ID No: 21); Wolbachia endosymbiont of Cimex lectularius (SEQ ID No: 22); Sphingomonas paucimobilis (SEQ ID No: 23); Acidithiobacillus ferrivorans (SEQ ID No: 24); Gallionella capsiferriformans (SEQ ID No: 25); Ralstonia eutropha (SEQ ID No: 26); Bordetella parapertussis (SEQ ID No: 27); Pusillimonas sp. (SEQ ID No: 28); Cenarchaeum symbiosum sp. (SEQ ID No: 29); Alicyclobacillus acidocaldarius sp. (SEQ ID No: 30); Geobacillus thermoglucosidasius (SEQ ID No: 31); Bacillus subtilis (SEQ ID No: 32); Lysinibacillus sphaericus (SEQ ID No: 33); Methylococcus capsulatus (SEQ ID No: 34); Leclercia adecarboxylata (SEQ ID No: 35); Chromohalobacter salexigens (SEQ ID No: 36); Pseudomonas spp (for example SEQ ID Nos: 37, 38, 39, 40, 41, 42, 43, 44, or 45).
Alternatively, Type II biotin synthases can be identified from known biotin synthase polypeptide sequences by aligning the candidate biotin synthase sequence against a Type II BioB reference sequence, such as the B. obeum biotin synthase (SEQ ID No:1), to determine whether the residues at the positions corresponding to B. obeum C-52 and S 106 are also a cys and a ser, respectively, indicating the candidate sequence is a Type II biotin synthase or are a ser and a cys, respectively, indicating the candidate sequence is a Type I biotin synthase.
A novel sequence can be identified as a potential biotin synthase using, for example, the online publically accessible InterPro database, which classifies protein sequences into families and predicts the presence of functionally important domains and sites using predictive models, referred to as “signatures” (See A L Mitchell, et al. (2019). InterPro in 2019: improving coverage, classification and access to protein sequence annotations. Nucleic Acids Research, January 2019; doi: 10.1093/nar/gkyI100). InterPro v. 75.0, released Jul. 4, 2019, had classified 18722 proteins as belonging to the Biotin synthase Family (IPR024177). The publically available software package InterProScan allows new sequences (protein or nucleic acid) to be scanned against InterPro's signature database for functional analysis and classification (P Jones, et al. (2014). InterProScan 5: genome-scale protein function classification. Bioinformatics, January 2014; doi:10.1093/bioinformatics/btu031). InterPro and its associated software are widely utilized by the scientific community.
Table 1 below tabulates UniProt accession numbers (entry identifiers) of Type II BioB proteins identified in the UniProt databases (The UniProt Consortium, Nucleic Acids Res. 47: D506-515 (2019)) available Aug. 27, 2019.
indicates data missing or illegible when filed
A method of producing biotin is disclosed. The method can comprise contacting desthiobiotin with a Type II biotin synthase (Type II BioB) holo-protein in vitro under conditions effective to produce biotin; and recovering the biotin.
The Type II BioB can be selected to provide desired characteristics in production of biotin, such as kinetic efficiency, absolute production, and scalability of the process. For example, the Type II biotin synthase can be a Blautia sp. biotin synthase, a Clostridium sp. biotin synthase, a Bacteroides sp. biotin synthase, a Porphyromonas sp. biotin synthase, a Veillonella sp. biotin synthase, a Cyanothece sp. biotin synthase, an Akkermansia sp. biotin synthase, or a combination thereof. Preferably, the type II the biotin synthase can be a Blautia obeum biotin synthase (SEQ ID NO:1), a Clostridium sp. HMSC19B10 biotin synthase (SEQ ID NO:2), a Bacteroides caccae (ATCC 43185) biotin synthase (SEQ ID NO:3), a [Clostridium] spiroforme DSM 1552 biotin synthase (SEQ ID NO:4), a Porphyromonas gingivalis (strain ATCC BAA-308/W83) biotin synthase (SEQ ID NO:5), a Bacteroides cellulosilyticus biotin synthase (SEQ ID NO:6), a Clostridium perfringens biotin synthase (SEQ ID NO:7), a Clostridium thermocellum biotin synthase (SEQ ID NO:8), a Veillonella parvula HSIVP1 biotin synthase (SEQ ID NO:9), a Cyanothece sp. (strain ATCC 51142) biotin synthase (SEQ ID NO:10), a Porphyromonas gingivalis (strain ATCC 33277) biotin synthase (SEQ ID NO:11), an Akkermansia muciniphila (strain ATCC BAA-835/Muc) biotin synthase (SEQ ID NO:12), or a combination thereof.
Contacting the desthiobiotin with the Type II biotin synthase holo-protein can occur in a cell-free system.
The conditions effective to produce biotin can include S-adenosylmethionine (SAM), NADPH, and one or more polypeptides to mediate transfer of an electron from NADPH to the radical SAM [4Fe—4S] cluster. Polypeptides to mediate transfer of an electron from NADPH to the radical SAM [4Fe—4S] cluster include a flavodoxin/ferredoxin-NADP reductase; a pyruvate-flavodoxin/ferredoxin oxidoreductase; a flavodoxin; a ferredoxin; or a combination thereof. In a cell-free system, amounts of the Type II BioB, DTB, SAM, and NADPH and the electron transfer polypeptides can be selected to optimize production of biotin.
Recovering the biotin from the reaction can be performed by any suitable method. For example, the enzymes can be precipitated from the reaction, and the biotin in the supernatant can be absorbed on active carbon, then eluted and purified further with an ion exchange resin. Alternatively, the reaction can be applied directly to an ion exchange resin and, after the elution, the biotin can be recrystallized from a mixture of alcohol and water.
Biotin can be produced using recombinant microorganisms, e.g., bacterial cells such as recombinant E. coli cells, expressing a Type II BioB by culturing the recombinant microorganism in a culture medium suitable for supporting growth as well as comprising a carbon source suitable for the biosynthesis of biotin.
The method for producing biotin can comprise: culturing a recombinant microorganism comprising a transgene encoding a polypeptide of a biotin synthase comprising two (4Fe—4S) clusters per polypeptide chain (Type II BioB) in a growth medium to produce a culture; and recovering biotin produced by the culture.
The culture medium and the temperature and time for cultivation of the recombinant microorganism are selected to optimize production of biotin.
The growth medium used in the method for producing biotin can comprise a carbon source selected from desthiobiotin, glucose, maltose, galactose, fructose, sucrose, arabinose, xylose, raffinose, mannose, lactose, or any combination thereof.
The pH of the culture medium can be about 5.0 to 9.0, preferably 6.5 to 7.5. The cultivation temperature can be about 10 to 400° C., preferably 26 to 30° C. The cultivation time may be about 1 to 10 days, preferably 2 to 7 days, more preferably about 2 to 4 days (48 to 96 hours). During cultivation, aeration and agitation usually give favorable results.
Recovering the biotin from the culture medium can be performed by any suitable method. For example, the cells can be removed from the culture medium, the desired product in the filtrate can be absorbed on active carbon, then eluted and purified further with an ion exchange resin. Alternatively, the culture filtrate can be applied directly to an ion exchange resin and, after the elution, the desired product can be recrystallized from a mixture of alcohol and water.
The Type II biotin synthase can be expressed in a heterologous microorganism. The expressed Type II biotin synthase can be purified and used in a cell-free system to produce biotin. Alternatively, the microorganism expressing the Type II biotin synthase can be cultured under conditions permitting biotin production. The amount of recovered biotin can be increased compared to amount of recovered biotin from culturing the same microorganism that does not express the biotin synthase. The microorganism can also comprise transgenes for expression of other enzymes or regulatory moieties involved in the biotin synthetic pathway to further enhance biotin production or in assembling Type II BioB holo-protein. For example, the microorganism can also comprise the plasmid pDB1282 containing the isc operon from Azotobacter vinelandii, which encodes the proteins IscS, IscU, IscA, HscB, HscA, and Fdx. The plasmid is inducible with arabinose and confers antibiotic resistance to ampicillin. Additionally or alternatively, the microorganism can also comprise the plasmid pPH151 containing the suf operon from E. coli, encoding the proteins SufA, SufB, SufC, SufD, SufS, and SufE, which is inducible with IPTG and confers antibiotic resistance to chloramphenicol.
Exemplary microorganisms for expressing a Type II BioB or for producing biotin include bacteria, yeast, and filamentous fungi.
Exemplary bacteria for expressing a Type II BioB or for producing biotin include a species of Escherichia, Bacillus, Brevibacterium, Burkholderia, Campylobacter, Corynebacterium, Serratia, Lactobacillus, Lactococcus, Acinetobacter, Acetobacter, Pseudomonas, and Vibrio natriegens; preferably the bacterium is a species of Escherichia or Corynebacterium; for example Escherichia coli or Corynebacterium glutamicum.
Exemplary yeast for expressing a Type II BioB or for producing biotin include Saccharomyces cerevisiae, Pichia pastoris, Hansenula polymorpha, Komagataella sp., Kluyveromyces lactis, and Yarrowia lipolytica.
Examples of filamentous fungi for expressing a Type II BioB or for producing biotin include Aspergillus, Trichoderma, Penicillium and Rhizopus species. Preferably the fungi is Trichoderma reesei, Aspergillus niger, or Aspergillus oryzae.
A recombinant microorganism comprising a transgene encoding a polypeptide of a biotin synthase, wherein the biotin synthase comprises two (4Fe—4S) clusters per polypeptide chain (Type II BioB) is disclosed. The Type II BioB transgene can be operably linked to a constitutive promoter.
Optionally, the recombinant microorganism may further comprise one or more additional transgenes encoding polypeptides that catalyze additional steps in the biotin biosynthetic pathway. An increase in the levels of those polypeptides that catalyze steps in the biotin biosynthetic pathway enhances the synthesis of both intermediates in the biotin pathway, and the end product of the pathway (biotin) in the cell.
The polypeptides that are encoded by the additional transgenes in the recombinant microbial cell, and whose activity serves to enhance the synthesis of both intermediates and products of the biotin pathway, can include a polypeptide having SAM (S-adenosylmethionine)-dependent methyltransferase activity (BioC); a polypeptide having 7-keto-8-aminopelargonic acid (KAPA) synthase activity (BioF); a polypeptide having 7,8-Diaminopelargonic Acid (DAPA) Synthase activity (BioA); or L-lysine: 8-amino-7-oxononanoate aminotransferase (BioK); a polypeptide having Desthiobiotin (DTB) Synthetase activity (BioD); a polypeptide having Pimeloyl-[acyl-carrier protein] methyl ester esterase (BioH); a polypeptide having 6-carboxyhexanoate-CoA ligase activity (BioW) or a combination of the foregoing.
The transgene encoding BioB together with one or more additional transgenes encoding polypeptides that catalyze additional steps in the biotin pathway, are located in the genome of the recombinant microorganism, either integrated into the chromosome or on a self-replicating plasmid. The transgenes encoding BioB and one or more enzymes in the biotin pathway (BioABFCD and H or W) can be present in the genome within one or more operon.
The promoter driving expression of the transgene encoding BioB together with one or more additional transgenes is preferably a non-native promoter, which can be a heterologous constitutive-promoter or an inducible-promoter. Examples of a suitable heterologous constitutive promoter include members of the apFab family [SEQ ID Nos: 46-48] while a suitable inducible promoter includes: pBad (arabinose inducible [SEQ ID No: 49] and Lacl [SEQ ID No: 50]. Suitable terminators include members of the apFAB terminator family including [SEQ ID No:51-53]. The selected promoter and terminator can be operably linked to the coding sequence for BioB. The selected promoter and terminator can also be operably linked to the coding sequence for BioB and to the coding sequence of the polypeptides of BioC, BioD, BioA, BioF, BioW BioH, or a combination thereof.
The terms “polypeptide,” “peptide”, and “protein” are used interchangeably herein to refer to a molecule formed from the linking, in a defined order, of at least two amino acids. The link between one amino acid residue and the next is an amide bond and is sometimes referred to as a peptide bond. A polypeptide can be obtained by a suitable method known in the art, including isolation from natural sources, expression in a recombinant expression system, chemical synthesis, or enzymatic synthesis. The terms also apply to amino acid polymers in which one or more amino acid residue is an artificial chemical mimetic of a corresponding naturally occurring amino acid, as well as to naturally occurring amino acid polymers and non-naturally occurring amino acid polymers.
An “apo-protein” refers to a BioB polypeptide chain without its complement of Fe—S clusters, while a “holo-protein” refers to a BioB polypeptide chain refers to a BioB polypeptide chain with its complement of Fe—S clusters.
The terms “isolated” or “purified”, used interchangeably herein, refers to a nucleic acid, a polypeptide, or other biological moiety that is removed from components with which it is naturally associated. The term “isolated” can refer to a polypeptide that is separate and discrete from the whole organism with which the molecule is found in nature or is present in the substantial absence of other biological macro-molecules of the same type. The term “isolated” with respect to a polynucleotide can refer to a nucleic acid molecule devoid, in whole or part, of sequences normally associated with it in nature; or a sequence, as it exists in nature, but having heterologous sequences in association therewith; or a molecule disassociated from the chromosome. Purity and homogeneity are typically determined using analytical chemistry techniques, for example polyacrylamide gel electrophoresis or high performance liquid chromatography. A protein that is the predominant species present in a preparation is substantially purified. In some embodiments, the term “purified” means that the nucleic acid or protein is at least 85% pure, specifically at least 90% pure, more specifically at least 95% pure, or yet more specifically at least 99% pure
The term “recombinant” can be used to describe a nucleic acid molecule and refers to a polynucleotide of genomic, RNA, DNA, cDNA, viral, semisynthetic, or synthetic origin which, by virtue of its origin or manipulation is not associated with all or a portion of the polynucleotide with which it is associated in nature. The term “recombinant” as used with respect to a protein or polypeptide can refer to a polypeptide produced by expression of a recombinant polynucleotide. In general, the gene of interest is cloned and then expressed in a transformed organism, by a suitable method. The host organism, or recombinant organism, expresses the foreign gene to produce the protein under expression conditions.
An expression vector comprising a polynucleotide encoding a Type II BioB polypeptide is also disclosed.
The term “vector” means a nucleic acid sequence to express a target gene in a host cell. Examples include a plasmid vector, a cosmid vector, a bacteriophage vector, and a viral vector. Examples of viral vectors include a bacteriophage vector, an adenovirus vector, a retrovirus vector, and an adeno-associated virus vector.
For example, the vector may be an expression vector including a membrane targeting or secretion signaling sequence or a leader sequence, in addition to an expression control element such as promoter, operator, initiation codon, termination codon, polyadenylation signal, and enhancer. The vector may be manufactured in various ways depending on the purpose. An expression vector may include a selection marker for selecting a host cell containing the vector. Further, a replicable expression vector may include an origin of replication
The term “recombinant vector” or “expression vector” means a vector operably linked to a heterologous nucleotide sequence for the purpose of expression, production, and isolation of the heterologous nucleotide sequence. The heterologous nucleotide sequence can be a nucleotide sequence encoding all or part of a Type II BioB.
The recombinant vector may be constructed for use in prokaryotic or eukaryotic host cells. For example, when a prokaryotic cell is used as a host cell, the expression vector used generally includes a strong promoter capable of initiating transcription (for example, pLλ promoter, trp promoter, lac promoter, tac promoter, T7 promoter), a ribosome binding site for initiating translation, and a transcription/translation termination sequence. When a eukaryotic cell is used as a host cell, the vector used generally includes the origin of replication acting in the eukaryotic cell, for example f1 origin of replication, SV40 origin of replication, pMB1 origin of replication, adeno origin of replication, AAV origin of replication, or BBV origin of replication, but is not limited thereto.
A nucleic acid is “operably linked” when it is placed into a functional relationship with another nucleic acid sequence. For instance, a promoter or enhancer is operably linked to a coding sequence if it affects the transcription of the sequence. With respect to transcription regulatory sequences, operably linked means that the nucleic acid sequences being linked are contiguous and, where necessary to join two protein coding regions, contiguous and in reading frame.
A “transgene” is an exogenous gene that has been introduced into the genome of a bacterium by means of genetic engineering. In the context of the present invention, said genome includes both chromosomal and episomal genetic elements.
The term “sequence identity” as used herein, indicates a quantitative measure of the degree of homology between two amino acid sequences of substantially equal length. The two sequences to be compared must be aligned to give a best possible fit, by means of the insertion of gaps or alternatively, truncation at the ends of the protein sequences. The sequence identity can be calculated as ((Nref−Ndif) 100)/(Nref), wherein Ndif is the total number of non-identical residues in the two sequences when aligned and wherein Nref is the number of residues in one of the sequences. Readily available computer programs can be used to aid in the analysis of sequence identity. Sequence identity calculations are preferably automated using for example the BLAST program e.g. the BLASTP program, available on the internet from the National Center for Biotechnology Information. Multiple sequence alignment (MSA) can be performed with one of the programs for MSA available on the internet from, for example, the European Bioinformatics Institute. Two nucleic acid or two polypeptide sequences are “substantially identical” to each other when the sequences exhibit at least about 50%, specifically at least about 75%, more specifically at least about 80%-85%, at least about 90%, and most specifically at least about 95%-98% sequence identity over a defined length of the molecules.
The numbers of substitutions, insertions, additions or deletions of one or more amino acid residues in the polypeptide as compared to its comparator polypeptide can be limited, i.e. no more than 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 substitutions, no more than 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 insertions, no more than 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 additions, and no more than 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 deletions. Preferably the substitutions are conservative amino acid substitutions: limited to exchanges within members of group 1: Glycine, Alanine, Valine, Leucine, Isoleucine; group 2: Serine, Cysteine, Selenocysteine, Threonine, Methionine; group 3 proline; group 4: Phenylalanine, Tyrosine, Tryptophan; Group 5: Aspartate, Glutamate, Asparagine, Glutamine.
An “endogenous gene” a gene in a bacterial cell genome that is homologous in origin to a host bacterium (i.e. a native gene of the host bacterium). The endogenous gene may be genetically modified using tools known in the art whereby the genetically modified endogenous gene encodes a mutant polypeptide whose amino acid sequence differs at one or more position from the polypeptide encoded by the parent endogenous gene from which it was derived.
The term “genome” is the genetic material present in a cell or organism comprising all of the information needed to build and maintain that cell or organism. Herein, genome includes the genetic material in both chromosome(s) and plasmid(s) present within the cell or organism.
A “native gene” is an endogenous gene in a bacterial cell genome, homologous to host bacterium.
A “non-native promoter”, in the context of a recombinant microorganism, is a promoter that is operably-linked to a gene or transgene in the microorganism, which would not be found operably-linked to the gene or transgene in the microorganism cell found in nature.
The following examples are merely illustrative of the methods and compositions disclosed herein and are not intended to limit the scope hereof.
UV-vis spectrophotometry was performed using Agilent Technologies 8453. In general, UV-visible spectra at 10-20 μM protein by diluted concentrated stocks in size exclusion buffer
Diffraction-quality crystals were obtained by sitting-drop vapor diffusion at 20° C. in an anaerobic chamber maintained at <0.1 ppm oxygen (MBraun, Stratham, N.H.). Drops of 0.4 μL TEV-cleaved protein solution at 20 mg/mL in 25 mM HEPES, pH 7.5, 0.7 mM SAM, and 0.7 mM DTB were mixed with 0.4 μL precipitant (0.1 M ammonium acetate, 0.1 M Bis-tris, pH 5.5, 17% polyethylene glycol 10,000) and equilibrated against a solution of 0.5 M LiCl. Diffraction data were collected at the Advanced Photon Source (Argonne National Laboratory, Argonne Ill.) and the structure was solved by SAD phasing using the intrinsic Fe absorption.
Blautia obeum (formerly Ruminococcus obeum) is a species of anaerobic, gram-positive bacteria found in the human gut that was identified as having a biotin synthase-like protein (UniProtKB A5ZUL4) in its genomic sequence.
Alignment of the B. obeum amino acid sequence with the E. coli BioB sequence showed that the B. obeum sequence was roughly 50% identical to that of the E. coli BioB sequence as shown in
The B. obeum sequence was expressed and purified from BL-21(DE3) cells containing the pPH151 plasmid. The transformants were selected on an LB/agar plate containing 50 μg/mL kanamycin and 34 μg/mL chloramphenicol. A single colony was used to inoculate 20 mL of LB overnight culture containing the above antibiotics. 20 mL of the overnight culture was used to inoculate 2 L of LB media housed in a 2 L PYREX® media bottle. Cultures were grown with constant aeration using a sparging stone attached to a pressurized, 0.22 μm filtered air source all in a water bath maintained at 37° C. After 5 hr, aeration was stopped and the culture was placed in an ice bath for 1 hr. The culture was returned to a 22° C. water bath and light aeration was resumed. After 5 min, cysteine and IPTG were added to a final concentration of 600 and 500 μM, respectively. The culture was grown at 22° C. for ˜20 hr before being harvest by centrifugation at 10,000×g. Cell pellets were flash frozen and stored in liquid N2 until purification. All subsequent steps were carried out in an MBraun anaerobic chamber maintained at <0.1 ppm oxygen (MBraun, Stratham, N.H.). In a typical purification, ˜30 grams of cell paste was resuspended in 30 mL of lysis buffer containing 50 mM HEPES, pH 7.5, 300 mM KCl, 4 mM imidazole, 10 mM 2-mercaptoethanol (BME), 10% glycerol, and 1% Triton-X305. The resuspension was subjected to 50 rounds of sonic disruption (80% output, 3 s pulse on, 12 s pulse of) at 4° C. The lysate was cleared by centrifugation at 4° C. for 1 hr at 15,000×g. The supernatant was loaded with an ÄKTA express FPLC system onto a 5 mL fast-flow HisTrap™ column (GE Healthcare Life Sciences) equilibrated in lysis buffer lacking Triton-X305. The column was washed with 10 column volumes of lysis buffer before elution with 5 mL of buffer containing 50 mM HEPES, pH 7.5, 300 mM KCl, 300 mM imidazole, 10 mM BME, and 10% glycerol. The fractions containing protein, based on absorbance at 280 nm, were pooled and reconstituted with Fe and sulfur as previously described. The reconstituted proteins were then passed over a HiPrep 16/60 Sephacryl S-200 HR column equilibrated in 20 mM HEPES, pH 7.5, 300 mM KCl, 5 mM DTT, and 10% glycerol. The proteins were concentrated to ˜1 mL with a vivaspin 20 concentrator (Sartorius Stedium Biotech). The protein concentration was estimated by A280 using the extinction coefficient calculated based on the targets corresponding amino acid sequence.
As shown in
Crystals of the B. obeum protein were grown and the structure was determined by X-ray crystallography. A ribbon representation of the structure determined for the B. obeum protein in the presence of DTB and SAM is shown in
The active sites of E. coli BioB and the B. obeum BioB are compared in
When the sequence of the V. parvula BioB (Uniprot Accession TOTAB9; SEQ ID NO:9)) is aligned with the E. coli Type I BioB sequence (SEQ ID NO:13), the residues corresponding to E. coli BioB residues S-43 and C-97 in V. parvula BioB (C-69 and S-123) showed the occurrence of a cys-ser swap, indicating that V. parvula BioB is a Type II BioB.
This identification was confirmed by growing crystals of the V. parvula BioB, and determining the structure by X-ray crystallography, by procedures in general accordance with those described above for the B. obeum BioB. The structure of the V. parvula BioB showed the presence of two [4Fe—4S] clusters, the RS cluster and the 2nd cluster, per polypeptide chain confirming the identification by sequence alignment that the V. parvula BioB is a Type II BioB.
The disclosure herein include(s) at least the following aspects:
Aspect 1. A method of producing biotin comprises contacting desthiobiotin with a Type II biotin synthase (Type II BioB) holo-protein in vitro under conditions effective to produce biotin; and recovering the biotin, wherein the Type II biotin synthase holo-protein comprises, per polypeptide chain, a first [4Fe—4S] cluster (radical SAM (RS) cluster) coordinated to a CxxxCxxC motif in the polypeptide chain and a second [4Fe—4S] cluster.
Aspect 2. The method of aspect 1, wherein the Type II biotin synthase is a Blautia sp. biotin synthase, a Clostridium sp. biotin synthase, a Bacteroides sp. biotin synthase, a Porphyromonas sp. biotin synthase, a Veillonella sp. biotin synthase, a Cyanothece sp. biotin synthase, an Akkermansia sp. biotin synthase, or a combination thereof
Aspect 3. The method of aspect 1 or 2, wherein the Type II biotin synthase is a Blautia obeum biotin synthase (SEQ ID NO:1), a Clostridium sp. HMSC19B10 biotin synthase (SEQ ID NO:2), a Bacteroides caccae (ATCC 43185) biotin synthase (SEQ ID NO:3), a [Clostridium] spiroforme DSM 1552 biotin synthase (SEQ ID NO:4), a Porphyromonas gingivalis (strain ATCC BAA-308/W83) biotin synthase (SEQ ID NO:5), a Bacteroides cellulosilyticus biotin synthase (SEQ ID NO:6), a Clostridium perfringens biotin synthase (SEQ ID NO:7), a Clostridium thermocellum biotin synthase (SEQ ID NO:8), a Veillonella parvula HSIVP1 biotin synthase (SEQ ID NO:9), a Cyanothece sp. (strain ATCC 51142) biotin synthase (SEQ ID NO:10), a Porphyromonas gingivalis (strain ATCC 33277) biotin synthase (SEQ ID NO:11), an Akkermansia muciniphila (strain ATCC BAA-835/Muc) biotin synthase (SEQ ID NO:12), or a combination thereof.
Aspect 4. The method of any one of aspects 1 to 3, wherein the conditions comprise S-adenosylmethionine (SAM), NADPH, and one or more polypeptides to mediate transfer of an electron from NADPH to the radical SAM [4Fe—4S] cluster
Aspect 5. The method of aspect 4, wherein the one or more polypeptides to mediate transfer of an electron from NADPH to the radical SAM [4Fe—4S] cluster comprises a flavodoxin/ferredoxin-NADP reductase; a pyruvate-flavodoxin/ferredoxin oxidoreductase; a flavodoxin; a ferredoxin; or a combination thereof.
Aspect 6. The method of any one of aspects 1 to 5, wherein the conditions comprise a temperature of 30 C to 45 C.
Aspect 7. The method of any one of aspects 1 to 6, wherein the conditions comprise added Na2S
Aspect 8. The method of any one of aspects 1 to 7, wherein the contacting occurs in a cell-free system
Aspect 9. The method of any one of aspects 1 to 8, wherein the Type II biotin synthase holo-protein is a B. obeum Type II biotin synthase holo-protein.
Aspect 10. A recombinant microorganism comprises a transgene encoding a polypeptide of a Type II biotin synthase, wherein a holo-protein of the Type II biotin synthase comprises per polypeptide chain a first [4Fe—4S] cluster (radical SAM (RS) cluster) coordinated to a CxxxCxxC motif in the polypeptide chain and a second [4Fe—4S] cluster.
Aspect 11. The microorganism of aspect 10 wherein the transgene is operably linked to a constitutive promoter.
Aspect 12. The microorganism of aspect 10 or 11, wherein the microorganism is a bacterium, a yeast, or a filamentous fungus.
Aspect 13. The microorganism of aspect 12, wherein the microorganism is a bacterium, wherein the bacterium species is Escherichia, Bacillus, Brevibacterium, Burkholderia, Campylobacter, Corynebacterium, Pseudomonas, Serratia, Lactobacillus, Lactococcus, Acinetobacter, Pseudomonas, or Acetobacter.
Aspect 14. The microorganism of aspect 10 or 11, wherein the microorganism is a yeast, wherein the yeast is Saccharomyces cerevisiae, Pichia pastoris, Hansenula polymorpha, Komagataella sp., Kluyveromyces lactis, and Yarrowia lipolytica.
Aspect 15. The microorganism of aspect 10 or 11, wherein the microorganism is a filamentous fungus, wherein the filamentous fungus is a species of Aspergillus, Trichoderma, Penicillium, or Rhizopus.
Aspect 16. A method for producing biotin comprises cultivating the recombinant microorganism of any one of aspects 10 to 15 in a growth medium to produce a culture; and recovering biotin from the culture.
Aspect 17. The method of aspect 16 further comprising purifying the recovered biotin; or introducing the recombinant microorganism to the growth medium.
Aspect 18. The method of aspect 16 or 17, wherein the growth medium comprises a carbon source selected from glucose, maltose, galactose, fructose, sucrose, arabinose, xylose, raffinose, mannose, lactose, and a combination thereof.
Aspect 19. The method of any one of aspects 16 to 18, wherein cultivating is performed under conditions to produce biotin.
Aspect 20. The method of any one of aspects 16 to 19, wherein the recovered biotin is increased compared to recovered biotin from cultivating the same microorganism not expressing the Type II biotin synthase holo-protein.
Aspect 21. A method of producing biotin comprising aligning a query sequence with a reference sequence, wherein the reference sequence is a known Type I biotin synthase (Type I BioB) holo-protein, preferably an E. coli K12 Type 1 biotin synthase holo-protein sequence (SEQ ID NO: 13), which reference sequence contains Serine at amino acid 43 (Ser-43) and Cysteine at amino acid 97 (Cys-97);
identifying the amino acid of the query sequence corresponding to Ser-43 of the reference sequence to provide query corresponding amino acid 43;
identifying the amino acid of the query sequence corresponding to Cys-97 of the reference sequence to provide query corresponding amino acid 97;
classifying the query sequence as a Type II biotin synthase holo-protein if the query corresponding amino acid 43 is Cysteine and the query corresponding amino acid 97 is Serine to provide an identified Type II biotin synthase holo-protein sequence;
contacting desthiobiotin with a protein having the amino acid sequence of the identified Type II biotin synthase holo-protein sequence in vitro under conditions effective to produce biotin; and recovering the biotin.
Aspect 22. The method of any one of aspects 1-8, wherein the Type II biotin synthase holo-protein is a protein identified as having a Cysteine at the amino acid position corresponding to Ser-43 in SEQ ID NO: 13 and having a Serine at the amino acid position corresponding to Cys-97 in SEQ ID NO:13.
In general, the invention may alternately comprise, consist of, or consist essentially of, any appropriate components herein disclosed. The invention may additionally, or alternatively, be formulated so as to be devoid, or substantially free, of any components, materials, ingredients, adjuvants or species used in the prior art compositions or that are otherwise not necessary to the achievement of the function and/or objectives of the present invention. The endpoints of all ranges directed to the same component or property are inclusive and independently combinable (e.g., ranges of “less than or equal to 25 wt %, or 5 wt % to 20 wt %,” is inclusive of the endpoints and all intermediate values of the ranges of “5 wt % to 25 wt %,” etc.). Disclosure of a narrower range or more specific group in addition to a broader range is not a disclaimer of the broader range or larger group. “Combination” is inclusive of blends, mixtures, alloys, reaction products, and the like. Furthermore, the terms “first,” “second,” and the like, herein do not denote any order, quantity, or importance, but rather are used to denote one element from another. The terms “a” and “an” and “the” herein do not denote a limitation of quantity, and are to be construed to cover both the singular and the plural, unless otherwise indicated herein or clearly contradicted by context. “Or” means “and/or.” The suffix “(s)” as used herein is intended to include both the singular and the plural of the term that it modifies, thereby including one or more of that term (e.g., the film(s) includes one or more films). Reference throughout the specification to “one embodiment”, “another embodiment”, “an embodiment”, and so forth, means that a particular element (e.g., feature, structure, and/or characteristic) described in connection with the embodiment is included in at least one embodiment described herein, and may or may not be present in other embodiments. In addition, it is to be understood that the described elements may be combined in any suitable manner in the various embodiments.
The modifier “about” used in connection with a quantity is inclusive of the stated value and has the meaning dictated by the context (e.g., includes the degree of error associated with measurement of the particular quantity). The notation “±10%” means that the indicated measurement can be from an amount that is minus 10% to an amount that is plus 10% of the stated value. The terms “front”, “back”, “bottom”, and/or “top” are used herein, unless otherwise noted, merely for convenience of description, and are not limited to any one position or spatial orientation. “Optional” or “optionally” means that the subsequently described event or circumstance can or cannot occur, and that the description includes instances where the event occurs and instances where it does not. Unless defined otherwise, technical and scientific terms used herein have the same meaning as is commonly understood by one of skill in the art to which this invention belongs. A “combination” is inclusive of blends, mixtures, alloys, reaction products, and the like.
Unless otherwise specified herein, any reference to standards, regulations, testing methods and the like, refer to the standard, regulation, guidance, or method that is in force at the time of filing of the present application.
All cited patents, patent applications, and other references are incorporated herein by reference in their entirety. However, if a term in the present application contradicts or conflicts with a term in the incorporated reference, the term from the present application takes precedence over the conflicting term from the incorporated reference.
While particular embodiments have been described, alternatives, modifications, variations, improvements, and substantial equivalents that are or may be presently unforeseen may arise to applicants or others skilled in the art. Accordingly, the appended claims as filed and as they may be amended are intended to embrace all such alternatives, modifications variations, improvements, and substantial equivalents.
This application claims priority to U.S. Provisional Application No. 62/897,486, filed Sep. 9 2019, incorporated by reference in its entirety herein.
This invention was made in part with support from National Institutes of Health grant number P01 GM118303-01, National Institutes of Health grant number R21 AI133329, National Institutes of Health grant number U54 GM093342, and National Institutes of Health grant number U54 GM094662. The U.S. government has certain rights in the invention.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2020/049339 | 9/4/2020 | WO |
Number | Date | Country | |
---|---|---|---|
62897486 | Sep 2019 | US |