The instant application contains a Sequence Listing which has been submitted via EFS-web and is hereby incorporated by reference in its entirety. The ASCII copy created on 7 May2021, is named NREL PCT 20-86_ST25.txt and is 61 kilobytes in size.
Poly (ethylene terephthalate) (PET) is one of the most abundant manmade synthetic polyesters. Crystalline PET is being widely used for production of single-use beverage bottles, clothing, packaging, and carpeting materials. PET resistance to biodegradation due to limited accessibility to ester linkage, and disposal of PET products into the environment pose a serious threat to biosphere, particularly to marine environment. PET can be chemically recycled. However, the extra costs in chemical recycling are not justified when converting PET back to PET. Thus, there remains a need for alternative strategies for recycling/recovering/reusing plastics, for example, polyesters such as PET.
An aspect of the present disclosure is a non-naturally occurring enzyme that includes a first polypeptide that catalyzes the hydrolysis of a polyester to produce mono-(2-hydroxyethyl) terephthalate (MHET), a second polypeptide that catalyzes the cleavage of MHET to produce at least one of terephthalic acid or ethylene glycol, and a third polypeptide that links the first polypeptide with the second polypeptide. In some embodiments of the present disclosure, the enzyme may have a sequence identity that is greater than 80% to SEQ ID NO: 36.
In some embodiments of the present disclosure, the enzyme may have a turnover rate of up to 69 s−1. In some embodiments of the present disclosure, the third polypeptide may have about 8 amino acids. In some embodiments of the present disclosure, the enzyme may have a sequence identity that is greater than 80% to SEQ ID NO: 38. In some embodiments of the present disclosure, the enzyme may have a turnover rate of up to 77 s−1. In some embodiments of the present disclosure, the third polypeptide may have about 12 amino acids. In some embodiments of the present disclosure, the enzyme may have a sequence identity that is greater than 80% to SEQ ID NO: 40. In some embodiments of the present disclosure, the enzyme may have a turnover rate of up to 56−1. In some embodiments of the present disclosure, the third polypeptide may have about 20 amino acids.
In some embodiments of the present disclosure, the polyester may include at least one of polyethylene terephthalate (PET), polyglycolic acid, polylactic acid, polycaprolactone, polyhydroxyalkanoate, polyhydroxybutyrate, polyethylene adipate, polybutylene succinate, poly(3-hydroxybutyrate-co-3-hydroxyvalerate), polybutylene terephthalate, polytrimethylene terephthalate, and/or polyethylene naphthlate. In some embodiments of the present disclosure, the third polypeptide may have between 1 and 100 amino acids. In some embodiments of the present disclosure, the third polypeptide may include at least one of glycine, serine, proline, and/or threonine. In some embodiments of the present disclosure, the third polypeptide may covalently link the C-terminus of the second polypeptide to the N-terminus of the first polypeptide.
In some embodiments of the present disclosure, the enzyme may further include a fourth polypeptide capable of catalyzing hydrolysis of a polyester to produce mono-(2-hydroxyethyl) terephthalate (MHET) and a fifth polypeptide, where the fifth polypeptide covalently links the fourth polypeptide with the second polypeptide. In some embodiments of the present disclosure, the enzyme may further include a mutation of at least one of a S to G, a T to L, F, or Y, a E to N, T, D, Q, or G, a R to F, E, T, A, Y, I, S, W, L, V, Q, G, M, or N, a F to P, D, L, A, S, T, E, N, G, or V, a S to A, G, Q, P, E, D, or V, a S to R, A, K, Q, or G, a T to V or L, and/or a F to I. In some embodiments of the present disclosure, the mutation may occur in the second polypeptide.
An aspect of the present disclosure is a genetically modified organism that expresses the enzyme as described herein. In some embodiments of the present disclosure, the organism may include at least one of Pseudomonas putida and/or Escherichia coli.
An aspect of the present disclosure is a method for degrading a polyester, where the method includes contacting an organism as described herein with the polyester.
Some embodiments are illustrated in referenced figures of the drawings. It is intended that the embodiments and figures disclosed herein are to be considered illustrative rather than limiting.
The present disclosure may address one or more of the problems and deficiencies of the prior art discussed above. However, it is contemplated that some embodiments as disclosed herein may prove useful in addressing other problems and deficiencies in a number of technical areas.
Therefore, the embodiments described herein should not necessarily be construed as limited to addressing any of the particular problems or deficiencies discussed herein.
References in the specification to “one embodiment”, “an embodiment”, “an example embodiment”, “some embodiments”, etc., indicate that the embodiment described may include a particular feature, structure, or characteristic, but every embodiment may not necessarily include the particular feature, structure, or characteristic. Moreover, such phrases are not necessarily referring to the same embodiment. Further, when a particular feature, structure, or characteristic is described in connection with an embodiment, it is submitted that it is within the knowledge of one skilled in the art to affect such feature, structure, or characteristic in connection with other embodiments whether or not explicitly described.
As used herein the term “substantially” is used to indicate that exact values are not necessarily attainable. By way of example, one of ordinary skill in the art will understand that in some chemical reactions 100% conversion of a reactant is possible, yet unlikely. Most of a reactant may be converted to a product and conversion of the reactant may asymptotically approach 100% conversion. So, although from a practical perspective 100% of the reactant is converted, from a technical perspective, a small and sometimes difficult to define amount remains. For this example of a chemical reactant, that amount may be relatively easily defined by the detection limits of the instrument used to test for it. However, in many cases, this amount may not be easily defined, hence the use of the term “substantially”. In some embodiments of the present invention, the term “substantially” is defined as approaching a specific numeric value or target to within 20%, 15%, 10%, 5%, or within 1% of the value or target. In further embodiments of the present invention, the term “substantially” is defined as approaching a specific numeric value or target to within 1%, 0.9%, 0.8%, 0.7%, 0.6%, 0.5%, 0.4%, 0.3%, 0.2%, or 0.10% of the value or target.
As used herein, the term “about” is used to indicate that exact values are not necessarily attainable. Therefore, the term “about” is used to indicate this uncertainty limit. In some embodiments of the present invention, the term “about” is used to indicate an uncertainty limit of less than or equal to ±20%, ±15%, +10%, ±5%, or ±1% of a specific numeric value or target. In some embodiments of the present invention, the term “about” is used to indicate an uncertainty limit of less than or equal to ±1%, ±0.9%, ±0.8%, ±0.7%, ±0.6%, ±0.5%, ±0.4%, ±0.3%, ±0.2%, or ±0.1% of a specific numeric value or target.
A “vector” or “recombinant vector” is a nucleic acid molecule that is used as a tool for manipulating a nucleic acid sequence of choice or for introducing such a nucleic acid sequence into a host cell. A vector may be suitable for use in cloning, sequencing, or otherwise manipulating one or more nucleic acid sequences of choice, such as by expressing or delivering the nucleic acid sequence(s) of choice into a host cell to form a recombinant cell. Such a vector typically contains heterologous nucleic acid sequences not naturally found adjacent to a nucleic acid sequence of choice, although the vector can also contain regulatory nucleic acid sequences (e.g., promoters, untranslated regions) that are naturally found adjacent to the nucleic acid sequences of choice or that are useful for expression of the nucleic acid molecules.
A vector can be either RNA or DNA, either prokaryotic or eukaryotic, and typically is a plasmid. The vector can be maintained as an extrachromosomal element (e.g., a plasmid) or it can be integrated into the chromosome of a recombinant host cell. The entire vector can remain in place within a host cell, or under certain conditions, the plasmid DNA can be deleted, leaving behind the nucleic acid molecule of choice. An integrated nucleic acid molecule can be under chromosomal promoter control, under native or plasmid promoter control, or under a combination of several promoter controls. Single or multiple copies of the nucleic acid molecule can be integrated into the chromosome. A recombinant vector can contain at least one selectable marker.
The term “expression vector” refers to a recombinant vector that is capable of directing the expression of a nucleic acid sequence that has been cloned into it after insertion into a host cell or other (e.g., cell-free) expression system. A nucleic acid sequence is “expressed” when it is transcribed to yield an mRNA sequence. In most cases, this transcript will be translated to yield an amino acid sequence. The cloned gene is usually placed under the control of (i.e., operably linked to) an expression control sequence. The phrase “operatively linked” refers to linking a nucleic acid molecule to an expression control sequence in a manner such that the molecule can be expressed when introduced (i.e., transformed, transduced, transfected, conjugated or conduced) into a host cell.
Vectors and expression vectors may contain one or more regulatory sequences or expression control sequences. Regulatory sequences broadly encompass expression control sequences (e.g., transcription control sequences or translation control sequences), as well as sequences that allow for vector replication in a host cell. Transcription control sequences are sequences that control the initiation, elongation, or termination of transcription. Suitable regulatory sequences include any sequence that can function in a host cell or organism into which the recombinant nucleic acid molecule is to be introduced, including those that control transcription initiation, such as promoter, enhancer, terminator, operator and repressor sequences. Additional regulatory sequences include translation regulatory sequences, origins of replication, and other regulatory sequences that are compatible with the recombinant cell. The expression vectors may contain elements that allow for constitutive expression or inducible expression of the protein or proteins of interest. Numerous inducible and constitutive expression systems are known in the art.
Typically, an expression vector includes at least one nucleic acid molecule of interest operatively linked to one or more expression control sequences (e.g., transcription control sequences or translation control sequences). In one aspect, an expression vector may comprise a nucleic acid encoding a recombinant polypeptide, as described herein, operably linked to at least one regulatory sequence. It should be understood that the design of the expression vector may depend on such factors as the choice of the host cell to be transformed and/or the type of polypeptide to be expressed. As used herein, a “non-natural” polypeptide is synonymous with a “recombinant” polypeptide.
Expression and recombinant vectors may contain a selectable marker, a gene encoding a protein necessary for survival or growth of a host cell transformed with the vector. The presence of this gene allows growth of only those host cells that express the vector when grown in the appropriate selective media. Typical selection genes encode proteins that confer resistance to antibiotics or other toxic substances, complement auxotrophic deficiencies, or supply critical nutrients not available from a particular media. Markers may be an inducible or non-inducible gene and will generally allow for positive selection. Non-limiting examples of selectable markers include the ampicillin resistance marker (i.e., beta-lactamase), tetracycline resistance marker, neomycin/kanamycin resistance marker (i.e., neomycin phosphotransferase), dihydrofolate reductase, glutamine synthetase, and the like. The choice of the proper selectable marker will depend on the host cell, and appropriate markers for different hosts as understood by those of skill in the art.
Suitable expression vectors may include (or may be derived from) plasmid vectors that are well known in the art, such as those commonly available from commercial sources. Vectors can contain one or more replication and inheritance systems for cloning or expression, one or more markers for selection in the host, and one or more expression cassettes. The inserted coding sequences can be synthesized by standard methods, isolated from natural sources, or prepared as hybrids. Ligation of the coding sequences to transcriptional regulatory elements or to other amino acid encoding sequences can be carried out using established methods. A large number of vectors, including bacterial, yeast, and mammalian vectors, have been described for replication and/or expression in various host cells or cell-free systems, and may be used with the sequences described herein for simple cloning or protein expression.
Nucleic acids referred to herein as “isolated” are nucleic acids that have been removed from their natural milieu or separated away from the nucleic acids of the genomic DNA or cellular RNA of their source of origin (e.g., as it exists in cells or in a mixture of nucleic acids such as a library), and may have undergone further processing. Isolated nucleic acids include nucleic acids obtained by methods described herein, similar methods or other suitable methods, including essentially pure nucleic acids, nucleic acids produced by chemical synthesis, by combinations of biological and chemical methods, and recombinant nucleic acids that are isolated.
Nucleic acids referred to herein as “recombinant” are nucleic acids which have been produced by recombinant DNA methodology, including those nucleic acids that are generated by procedures that rely upon a method of artificial replication, such as the polymerase chain reaction (PCR) and/or cloning or assembling into a vector using restriction enzymes.
Recombinant nucleic acids also include those that result from recombination events that occur through the natural mechanisms of cells but are selected for after the introduction to the cells of nucleic acids designed to allow or make probable a desired recombination event. Portions of isolated nucleic acids that code for polypeptides having a certain function can be identified and isolated by, for example, the method disclosed in U.S. Pat. No. 4,952,501.
A nucleic acid molecule or polynucleotide can include a naturally occurring nucleic acid molecule that has been isolated from its natural source or produced using recombinant DNA technology (e.g., polymerase chain reaction (PCR) amplification, cloning) or chemical synthesis. Isolated nucleic acid molecules can include, for example, genes, natural allelic variants of genes, coding regions or portions thereof, and coding and/or regulatory regions modified by nucleotide insertions, deletions, substitutions, and/or inversions in a manner such that the modifications do not substantially interfere with the nucleic acid molecule's ability to encode a polypeptide or to form stable hybrids under stringent conditions with natural gene isolates. An isolated nucleic acid molecule can include degeneracies. As used herein, nucleotide degeneracy refers to the phenomenon that one amino acid can be encoded by different nucleotide codons. Thus, the nucleic acid sequence of a nucleic acid molecule that encodes a protein or polypeptide can vary due to degeneracies.
Unless so specified, a nucleic acid molecule is not required to encode a protein having enzyme activity. A nucleic acid molecule can encode a truncated, mutated, or inactive protein, for example. In addition, nucleic acid molecules may also be useful as probes and primers for the identification, isolation and/or purification of other nucleic acid molecules, independent of a protein-encoding function.
Suitable nucleic acids include fragments or variants that encode a functional enzyme. For example, a fragment can comprise the minimum nucleotides required to encode a functional enzyme. Nucleic acid variants include nucleic acids with one or more nucleotide additions, deletions, substitutions, including transitions and transversions, insertion, or modifications (e.g., via RNA or DNA analogs). Alterations may occur at the 5′ or 3′ terminal positions of the reference nucleotide sequence or anywhere between those terminal positions, interspersed either individually among the nucleotides in the reference sequence or in one or more contiguous groups within the reference sequence.
Embodiments of the nucleic acids include those that encode the polypeptides that function as an O-dealkylase or a reductase or functional equivalents thereof. A functional equivalent includes fragments or variants of these that exhibit the ability to function as an O-dealkylase or a reductase. As a result of the degeneracy of the genetic code, many nucleic acid sequences can encode a given polypeptide with a particular enzymatic activity. Such functionally equivalent variants are contemplated herein.
Nucleic acids may be derived from a variety of sources including DNA, cDNA, synthetic DNA, synthetic RNA, or combinations thereof. Such sequences may comprise genomic DNA, which may or may not include naturally occurring introns. Moreover, such genomic DNA may be obtained in association with promoter regions or poly (A) sequences. The sequences, genomic DNA, or cDNA may be obtained in any of several ways. Genomic DNA can be extracted and purified from suitable cells by means well known in the art. Alternatively, mRNA can be isolated from a cell and used to produce cDNA by reverse transcription or other means.
Also disclosed herein are recombinant vectors, including expression vectors, containing nucleic acids encoding polypeptides and/or enzymes. A “recombinant vector” is a nucleic acid molecule that is used as a tool for manipulating a nucleic acid sequence of choice or for introducing such a nucleic acid sequence into a host cell. A recombinant vector may be suitable for use in cloning, assembling, sequencing, or otherwise manipulating the nucleic acid sequence of choice, such as by expressing or delivering the nucleic acid sequence of choice into a host cell to form a recombinant cell. Such a vector typically contains heterologous nucleic acid sequences not naturally found adjacent to a nucleic acid sequence of choice, although the vector can also contain regulatory nucleic acid sequences (e.g., promoters, untranslated regions) that are naturally found adjacent to the nucleic acid sequences of choice or that are useful for expression of the nucleic acid molecules.
The nucleic acids described herein may be used in methods for production of enzymes and enzyme cocktails through incorporation into cells, tissues, or organisms. In some embodiments, a nucleic acid may be incorporated into a vector for expression in suitable host cells. The vector may then be introduced into one or more host cells by any method known in the art. One method to produce an encoded protein includes transforming a host cell with one or more recombinant nucleic acids (such as expression vectors) to form a recombinant cell. The term “transformation” is generally used herein to refer to any method by which an exogenous nucleic acid molecule (i.e., a recombinant nucleic acid molecule) can be inserted into a cell but can be used interchangeably with the term “transfection.”
Non-limiting examples of suitable host cells include cells from microorganisms such as bacteria, yeast, fungi, and filamentous fungi. Exemplary microorganisms include, but are not limited to, bacteria such as E. coli; bacteria from the genera Pseudomonas (e.g., P. putida or P. fluorescens), Bacillus (e.g., B. subtilis, B. megaterium or B. brevis), Caulobacter (e.g., C. crescentus), Lactoccocus (e.g., L. lactis), Streptomyces (e.g., S. coelicolor), Streptococcus (e.g., S. lividans), and Corynybacterium (e.g., C. glutamicum); fungi from the genera Trichoderma (e.g., T. reesei, T. viride, T. koningii, or T. harzianum), Penicillium (e.g., P. funiculosum), Humicola (e.g., H. insolens), Chrysosporium (e.g., C. lucknowense), Gliocladium, Aspergillus (e.g., A. niger, A. nidulans, A. awamori, or A. aculeatus), Fusarium, Neurospora, Hypocrea (e.g., H. jecorina), and Emericella; yeasts from the genera Saccharomyces (e.g., S. cerevisiae), Pichia (e.g., P. pastoris), or Kluyveromyces (e.g., K. lactis). Cells from plants such as Arabidopsis, barley, citrus, cotton, maize, poplar, rice, soybean, sugarcane, wheat, switch grass, alfalfa, miscanthus, and trees such as hardwoods and softwoods are also contemplated herein as host cells.
Host cells can be transformed, transfected, or infected as appropriate by any suitable method including electroporation, calcium chloride-, lithium chloride-, lithium acetate/polyene glycol-calcium, phosphate-, DEAE-dextran-, liposome-mediated DNA uptake, spheroplasting, injection, microinjection, microprojectile bombardment, phage infection, viral infection, or other established methods. Alternatively, vectors containing the nucleic acids of interest can be transcribed in vitro, and the resulting RNA introduced into the host cell by well-known methods, for example, by injection. Exemplary embodiments include a host cell or population of cells expressing one or more nucleic acid molecules or expression vectors described herein (for example, a genetically modified microorganism). The cells into which nucleic acids have been introduced as described above also include the progeny of such cells.
Vectors may be introduced into host cells such as those from bacteria or fungi by direct transformation, in which DNA is mixed with the cells and taken up without any additional manipulation, by conjugation, electroporation, or other means known in the art. Expression vectors may be expressed by bacteria or fungi or other host cells episomally or the gene of interest may be inserted into the chromosome of the host cell to produce cells that stably express the gene with or without the need for selective pressure. For example, expression cassettes may be targeted to neutral chromosomal sites by recombination.
Host cells carrying an expression vector (i.e., transformants or clones) may be selected using markers depending on the mode of the vector construction. The marker may be on the same or a different DNA molecule. In prokaryotic hosts, the transformant may be selected, for example, by resistance to ampicillin, tetracycline or other antibiotics. Production of a particular product based on temperature sensitivity may also serve as an appropriate marker.
Host cells may be cultured in an appropriate fermentation medium. An appropriate, or effective, fermentation medium refers to any medium in which a host cell, including a genetically modified microorganism, when cultured, is capable of growing or expressing the polypeptides described herein. Such a medium is typically an aqueous medium comprising assimilable carbon, nitrogen, and phosphate sources, but can also include appropriate salts, minerals, metals and other nutrients. Microorganisms and other cells can be cultured in conventional fermentation bioreactors and by any fermentation process, including batch, fed-batch, cell recycle, and continuous fermentation. The pH of the fermentation medium is regulated to a pH suitable for growth of the particular organism. Culture media and conditions for various host cells are known in the art. A wide range of media for culturing bacteria or fungi, for example, are available from ATCC. Exemplary culture/fermentation conditions and reagents are provided in the Examples that follow. Media may be supplemented with aromatic substrates like guaiacol, guaethol or anisole for dealkylation reactions.
As used herein, the terms “protein” and “polypeptide” are synonymous. “Peptides” are defined as fragments or portions of polypeptides, preferably fragments or portions having at least one functional activity as the complete polypeptide sequence. “Isolated” proteins or polypeptides are proteins or polypeptides purified to a state beyond that in which they exist in cells. In certain embodiments, they may be at least 10% pure; in others, they may be substantially purified to 80% or 90% purity or greater. Isolated proteins or polypeptides include essentially pure proteins or polypeptides, proteins or polypeptides produced by chemical synthesis or by combinations of biological and chemical methods, and recombinant proteins or polypeptides that are isolated. Proteins or polypeptides referred to herein as “recombinant” are proteins or polypeptides produced by the expression of recombinant nucleic acids.
Polypeptides may be retrieved, obtained, or used in “substantially pure” form, a purity that allows for the effective use of the protein in any method described herein or known in the art. For a protein to be most useful in any of the methods described herein or in any method utilizing enzymes of the types described herein, it is most often substantially free of contaminants, other proteins and/or chemicals that might interfere or that would interfere with its use in the method (e.g., that might interfere with enzyme activity), or that at least would be undesirable for inclusion with a protein. In an embodiment, a non-naturally occurring enzyme may also be referred to as a recombinant protein Among other things, the present disclosure relates to fusion proteins, chimeric enzymes, for depolymerizing plastics, for example, polyethylene terephthalate (PET). As described herein, fusion proteins are disclosed having at least a two-enzyme system of a first enzyme (i.e., a first polypeptide) for deconstructing PET (i.e., a PETase) to its constituent monomers, including mono-(2-hydroxyethyl) terephthalate (MHET), and a second enzyme (i.e., a second polypeptide), a MHETase, which cleaves the MHET to yield terephthalic acid (TPA) and ethylene glycol (EG).
In some embodiments of the present disclosure, a fusion protein 100 may include a first polypeptide 110 capable of catalyzing hydrolysis of a polyester to produce a first intermediate covalently linked to a second polypeptide 120 capable of catalyzing cleavage of the first intermediate to produce smaller molecular weight compounds. The first polypeptide 110 may be covalently linked to the second polypeptide 120 by a third polypeptide, for example a flexible chain of amino acids. For the example where the polyester includes polyethylene terephthalate (PET), a first polypeptide 110 capable of catalyzing hydrolysis of the PET to produce at least mono-(2-hydroxyethyl) terephthalate (MHET) is referred to herein as a PETase and the second polypeptide 120 capable of further degrading the MHET to at least one of terephthalic acid and/or ethylene glycol is referred to herein as a MHETase.
In some embodiments of the present disclosure, a fusion protein 100 may be capable of degrading a plastic such as a polyester to smaller molecular weight compounds that may be reused to produce valuable materials. Examples of polyesters that may be degraded using the enzymes, organisms, and methods described herein include at least one of polyethylene terephthalate (PET), polyglycolic acid, polylactic acid, polycaprolactone, polyhydroxyalkanoate, polyhydroxybutyrate, polyethylene adipate, polybutylene succinate, poly(3-hydroxybutyrate-co-3-hydroxyvalerate), polybutylene terephthalate, polytrimethylene terephthalate, and/or polyethylene naphthlate.
In some embodiments of the present disclosure, at least one of the first polypeptide (e.g., PETase) and/or the second polypeptide (e.g., MHETase) may be derived from at least one of a bacterium and/or a fungus. In some embodiments of the present disclosure, the first polypeptide and/or the second polypeptide may be derived from a fungus such as Fusarium solani. In some embodiments of the present disclosure, the first polypeptide and/or the second polypeptide may be derived from a bacterium from a family that includes at least one of Comamonadaceae and/or Nocardiopsaceae. In some embodiments of the present disclosure, the first polypeptide and/or the second polypeptide may be derived from a bacterium from a genus that includes at least one of Ideonella, Comamonas, Hydrogenophaga, and/or Thermobifida. In some embodiments of the present disclosure, the first polypeptide and/or the second polypeptide may be derived from a bacterium that includes at least one of Ideonella sakaiensis, and/or Comamonas thiooxydans.
In some embodiments of the present disclosure, a third polypeptide 130 that covalently links a first polypeptide 110 to a second polypeptide may include between 1 amino acid and 100 amino acids, inclusively. In an embodiment, a third peptide is from about 10 to about 50 amino acids. In an embodiment, a third peptide is from about 20 to about 50 amino acids. In an embodiment, a third peptide is from about 10 to about 80 amino acids. In an embodiment, a third peptide is from about 20 to about 80 amino acids. In an embodiment, a third peptide is from about 10 to about 90 amino acids. In an embodiment, a third peptide is from about 20 to about 90 amino acids. In some embodiments of the present disclosure a third polypeptide 130, i.e., a linking protein chain, may include at least 2 amino acids, at least 5 amino acids, at least 8 amino acids, at least 11 amino acids, at least 14 amino acids, at least 17 amino acids, or at least 20 amino acids. In some embodiments of the present disclosure a third polypeptide 130, i.e., a linking protein chain, may include up to 25 amino acids, up to 50 amino acids, up to 75 amino acids, or up to 100 amino acids. A linking protein may be constructed of amino acids that include, among others, at least one of glycine, serine, proline, and/or threonine. In some embodiments of the present disclosure, a third polypeptide 130 (i.e., a linking protein chain) may covalently link the C-terminus of the second polypeptide 120 to the N-terminus of the first polypeptide 110. In some embodiments of the present disclosure, a first polypeptide 110 may be covalently linked to a third polypeptide 130 by a maleimide crosslinker, provided each polypeptide has a sulfhydryl group (—SH). Examples of a maleimide include bis-maleimidoethane and 1,4-di(maleimido)butane.
In some embodiments of the present disclosure, at least one of the first polypeptide 110 and/or the second polypeptide 120 may include a mutation to at least one amino acid, resulting in improved catalytic activity by the mutated polypeptide, as described herein. In some embodiments of the present disclosure at least one of the amino acids of a MHETase as described herein, may be mutated at least one of the following locations along the MHETase polypeptide: 131 (S to G), 133 (T to L, F, or Y), 226 (E to N, T, D, Q, or G), 411 (R to F, E, T, A, Y, I, S, W, L, V, Q, G, M, or n), 415 (F to P, D, L, A, S, T, E, N, G, or V), 416 (S to A, G, Q, P, E, D, or V), 419 (S to R, A, K, Q, or G), 494 (a TO V or L), or 495 (F to I). (See
In an embodiment, additional enzymes are contemplated herein that at least 80% sequence identity to the enzymes disclosed herein. In other embodiments, additional enzymes are contemplated herein that at least 85%, 90%, 95%, 98%, 99%, and up to 100% sequence identity to the enzymes disclosed herein.
As described herein, an organism may be genetically modified to manufacture the fusion proteins described herein. In some embodiments of the present disclosure, an organism for producing a fusion protein may include a bacterium such as at least one of a Pseudomonas putida and/or Escherichia coli. Further, as described herein, a plastic (e.g., PET) may be degraded to smaller molecular weight compounds by mixing and/or contacting at least one of the fusion proteins and/or organisms producing the fusion proteins with the plastic, where the mixing/contacting results in the degrading of the plastic to smaller molecular weight components.
As shown herein, fusion proteins (i.e., chimeric proteins) of MHETase and PETase can improve PET degradation and MHET hydrolysis rates. As described below in more detail, in view of the synergistic relationship between PETase and MHETase on amorphous PET, the relationship between the proximity of the two enzymes and hydrolytic activity was examined. Chimeric proteins covalently linking the C-terminus of MHETase to the N-terminus of PETase using flexible glycine-serine linkers of 8, 12, and 20 total glycine and serine residues were generated and tested for degradation of amorphous PET (see
In addition, as shown herein, PETase and MHETase act synergistically during PET depolymerization. While MHET is susceptible to hydrolysis by a number of PET-degrading cutinases, I. sakaiensis favors the action of two enzymes for PET degradation to liberate TPA and EG. To better understand this two-enzyme system, the extent of hydrolysis was measured of a commercial amorphous PET substrate over 96 hours at 30° C. using PETase and MHETase at varying concentrations (see
Further, using the multiple sequence alignment of 6,671 tannase family sequences, conservation analysis was performed with MHETase sequence positions as a reference (see
In an embodiment, a MHETase-8 amino acid linker-PETase chimeric enzyme was created having a DNA sequence of SEQ ID NO: 35 and an expressed polypeptide sequence of SEQ ID NO: 36. The expressed chimeric enzyme with an 8 aa linker (SEQ ID NO: 36) exhibited a turnover number of 68.91+/−8.66−1. In an embodiment, a MHETase-12 amino acid linker-PETase chimeric enzyme was created having a DNA sequence of SEQ ID NO: 37 and an expressed polypeptide sequence of SEQ ID NO: 38. The expressed chimeric enzyme with a 12 aa linker (SEQ ID NO: 38) exhibited a turnover number of 76.94+/−12.89 s−1. In an embodiment, a MHETase-20 amino acid linker-PETase chimeric enzyme was created having a DNA sequence of SEQ ID NO: 39 and an expressed polypeptide sequence of SEQ ID NO: 40. The expressed chimeric enzyme with a 20 amino acid linker (SEQ ID NO: 40) exhibited a turnover number of 56.25+/−4.27 s−1.
Plasmid construction (see Table 2 for plasmid construction, Table 3 for synthesized DNA fragments and (where applicable) translated polypeptide sequences, and Table 3 for primers): pET-21b(+) (EMD Millipore)-based plasmids for expression of the various Ideonella sakaiensis PETase and MHETase enzymes, as well as homologous, and mutant proteins were either synthesized by Twist Bioscience or constructed using NEBuilder® HiFi DNA Assembly Master Mix (New England Biolabs) and/or the Q5® Site-Directed Mutagenesis Kit (New England Biolabs) such that each protein has a C-terminal hexa-histidine epitope tag. For DNA assembly, DNA fragments were either amplified using Q5® High-Fidelity 2X Master Mix (New England Biolabs) or synthesized by Integrated DNA Technologies. Kits and master mixes were used according to the manufacturer's instructions. Plasmids were initially transformed into NEB® 5-alpha F′Iq Competent E. coli (New England Biolabs) and confirmed using Sanger sequencing by GENEWIZ, Inc.
Protein expression and purification: For initial screening for soluble protein expression of the proteins of interest, various cell lines and induction methods were used to purify protein for kinetic assays. For expression and purification, OverExpress™ E. coli C41 (DE3) (Lucigen) cells were transformed with pET21b(+) plasmid constructed with the gene of interest. Single colonies from transformation were then inoculated into a starter culture of Luria Broth (LB) media containing 100 μg/mL ampicillin and grown at 37° C. overnight. The starter culture was inoculated at a 100-fold dilution into a 2xYT medium containing 100 μg/mL ampicillin and grown at 37° C. until the optical density measured at 600 nM (OD600) reached between 0.6 and 0.8. Protein expression was then induced by addition of isopropyl β-D-1-thiogalactopyranoside (IPTG) to a final concentration of 1 mM. Cells were maintained at 20° C. for 18 to 24 hours following IPTG induction, harvested by centrifugation, and stored at −80° C. until purification. Harvested cells were resuspended in a lysis buffer (300 mM NaCl, 10 mM imidazole, 20 mM Tris HCl, pH 8.0) and lysed using a bead beater (BioSpec Products, Inc.). Lysate was clarified by centrifugation at 40,000×g for 45 minutes. Clarified lysate was then applied to a 5 mL HisTrap HP (GE Healthcare) Ni-NTA column using an ÄKTA Pure chromatography system (GE Healthcare) and eluted using 300 mM NaCl, 300 mM imidazole, 20 mM Tris HCl, pH 8.0. Resulting fractions containing proteins of interest were applied to a Sephacryl S-100 26/60 HR (GE Healthcare) size exclusion column equilibrated with 100 mM NaCl, 20 mM Tris HCl, pH 7.5 for biochemical assays, or the fractions were applied to a Superdex 75 pg 16/60 (GE Healthcare) size exclusion column equilibrated with 100 mM NaCl, 20 mM Tris HCl, pH 7.5 for crystallography. Protein in eluted fractions from Ni-NTA and size exclusion columns were assessed using SDS-PAGE with Coomassie staining and Western blot using primary antibody against the hexa-histidine epitope tag (Invitrogen). Total protein was assessed by BCA assay. For proteins that did not express, or expressed in inclusion bodies, using the above-described expression protocol, additional E. coli expression cell lines were tested, including Rosetta 2 (DE3) (Novagen), BL21 (DE3), and Lemo21 (DE3) (New England Biolabs), as was expression by autoinduction at 30° C. in ZYP-5052 media.
MHETase:PETase fusion proteins: Fusion proteins were expressed and purified as described above with the following noted exceptions: Single colonies from transformation into C41 (DE3) competent cells were used to inoculate a starter culture of 200 mL Terrific Broth (TB) media containing 100 μg/mL ampicillin for overnight outgrowth at 37° C. From the starter culture, 50 mL was used to inoculate 1 L of TB media containing 100 μg/mL ampicillin. For purification, cells were disrupted by sonication. In the final chromatography step a Superdex 200 pg 16/600 (GE Healthcare) size exclusion column equilibrated with 100 mM NaCl, 20 mM Tris HCl, pH 7.5 was used.
Crystallography. After purification, as described above, MHETase protein was concentrated to a range of concentrations (9-14 mg/mL) and dialyzed into 100 mM NaCl, 10 mM Tris, pH 7.0 for crystallography.
For seleno-methionine labeling of MHETase, K-MOPS minimal media was used. Cells were grown to an OD600 of 0.5 after which 100 mg/L of DL-seleno-methionine (Sigma), 100 mg/L lysine, threonine and phenylalanine, leucine, isoleucine and valine were added as solids. IPTG (1 mM final concentration) was then added after 20 min and cells were grown for a further 16 h at 20° C. Seleno-methionine labeled protein was purified as described above. MHETase was crystallized at a range of concentrations from 9-14 mg/mL by sitting-drop vapor diffusion. Several conditions yielded crystals, four of which were used to obtain datasets, one of which contained seleno-methionine labelled protein. The crystals were cryo-cooled in liquid nitrogen after the addition of glycerol to 20% (v/v) while leaving the other components of the mother liquor at the same concentration. Seleno-methionine MHETase crystals belonging to space group P22121 were used to obtain phase information using the 103 beamline at the Diamond Light Source (Oxford, UK). Data were obtained from 3600 images collected at 0.9795 A with 0.1° increments. All images were integrated using XDS (4) and scaled using SCALA. Phases were obtained using PHASERSAD in the CCP4i software in combination with PARROT and SHELXD. The initial output was subsequently built using BUCCANEER and further refined using iterative rounds of COOT and PHENIX. One molecule of MHETase was observed in the asymmetric unit of the P22121 seleno-methionine SAD dataset. Three additional native datasets, each containing 1800 images collected at 0.1° increments, were collected at beamline 103 of the Diamond Light Source. The structure of native MHETase were obtained using molecular replacement from a refined molecule of MHETase obtained initially from the seleno-methionine SAD data. All structures were refined using iterative rounds of COOT and PHENIX.
Determination of enzyme turnover rates. Comparative assays for each enzyme were performed at the same enzyme and substrate concentration. Reactions were performed in triplicate over a 15 min time course using 5 nM enzyme concentration and 250 μM MHET in 90 mM NaCl, 10% (v/v) DMSO, 45 mM sodium phosphate, pH 7.5, at 30° C. Reactions were terminated using an equal volume of 100% methanol followed by heat treatment at 85° C. for 10 min. Product and substrate were quantified by HPLC. Apparent turnover rate (kcat) was determined by terephthalic acid (TPA) produced.
Michaelis-Menten kinetics of MHETase and variants. Reactions were performed in triplicate over a 10 min time course using 5 nM enzyme and substrate concentrations ranging from 10 μM to 250 μM MHET in 90 mM NaCl, 10% (v/v) DMSO, 45 mM sodium phosphate, pH 7.5, at 30° C. Each reaction was terminated using an equal volume of 100% methanol and heat treatment at 85° C. for 10 min. Product and substrate were quantified by HPLC. Initial reaction velocities were calculated from TPA produced over time and kinetic parameters were determined by nonlinear regression of the initial velocities fit to the Michealis-Menten equation with substrate inhibition using GraphPad Prism version 8.4.1 for MacOS (GraphPad Software, San Diego, Calif. USA), as follows:
While both substrate inhibition and product inhibition are possible in these reactions, the relationship between initial reaction velocity and initial substrate concentration indicates substrate inhibition predominates in these reaction conditions. Low substrate concentrations were considered in these kinetic studies in order to minimize the effect of substrate inhibition.
Enzymatic degradation of PET film. Amorphous PET film (2-3% crystallinity, Goodfellow, UK) was incubated with enzyme of interest in polypropylene tubes containing 90 mM NaCl, 10% (v/v) DMSO, 45 mM sodium phosphate, pH 7.5, at 30° C. for 96 hours. The reaction was terminated by addition of equal volume 100% methanol and PET coupons were removed from the reaction solution. The reaction solution was heat treated at 85° C. for 10 minutes. PET coupons were washed with consecutive rinses of 1% SDS, 100% DMSO, ultrapure water, and 95% ethanol. Coupons were then vacuum dried for 24 h at 60° C. for scanning electron microscopy.
Activity assay of MHETase with non-MHET substrates. Evaluation of MHETase activity was performed in triplicate using 5 nM enzyme concentration and 25 μM, 50 μM, and 250 μM substrate concentration at 30° C. for 24 h in a 0.5 mL reaction volume. The reaction was carried out in 90 mM NaCl, 10% (v/v) DMSO, 45 mM sodium phosphate, pH 7.5, reaction buffer with three concentrations of each substrate (MHET, MHEI, or MHEF). Reactions commenced upon addition of enzyme or an equal volume of reaction buffer for the no enzyme controls. At the end of 24 h the reactions were terminated using an equal volume of 100% DMSO and heat treatment at 85° C. for 10 min. Product and substrate were analyzed by HPLC. Values reported as percentage of substrate hydrolyzed into product.
HPLC method. Standards of BHET, TPA, 2,5-furandicarboxylic acid, and isophthalate were obtained from Sigma Aldrich. MHET, MHEI, and MHEF were synthesized as described above. Analyte analysis of samples was performed on an Agilent 1260 LC system (Agilent Technologies, Santa Clara, Calif.) equipped with a G1315A diode array detector (DAD). Each sample and standard were injected using a volume of 10 μL onto a Phenomenex Luna C18(2) column, 5 μm, 4.6×150 mm (Phenomenex, Torrance, Calif.). The column temperature was maintained at 40° C. and the mobile phase used to separate the analytes of interest was composed of 20 mM phosphoric acid in water (A) and 100% methanol (B). The separation was carried out using a constant flow rate of 0.6 mL/min and a gradient program of: at t=0 min (A)=80% and (B)=20%; at t=15 min (A)=35% and (B)=65%; at t=15.01 min through 20 min (A) =80% and (B)=20% for a total run time of 20 min. The calibration curve for each analyte was evaluated between concentrations of 0.1-200 mg/L. DAD detection at a wavelength of 240 nm was performed for each analyte. Ten calibration standards were used with an r2 coefficient of 0.995 or better and a calibration verification standard (CVS) at 100 mg/L for each analyte was analyzed every 18 samples to ensure the integrity of the initial calibration. Samples were diluted with an equal volume of ultrapure water for analysis.
Comamonas
thiooxydans
Comamonas thiooxydans (Genbank
Comamonas
thiooxydans with
Comomonas thiooxydans (Genbank
Hydrogenophaga
Hydrogenophaga sp. PML113 (Genbank
Hydrogenophaga
Hydrogenophaga sp. PML113 (Genbank
Ideonella sakaiensis
sakaiensis
thiooxydans expression
Table 5 depicts the Michaelis-Menten kinetic parameters of fitting initial reaction velocities of enzymatic turnover for Is MHETase, Is MHETase S131G, Comamonas thiooxydans MHETase, and Hydrogenophaga sp. PML113 MHETase at MHET substrate concentrations between 10.M and 250.M using the Michaelis-Menten model with substrate inhibition. Non-linear regression was performed using GraphPad Prism (8.4.1) along with 95 confidence intervals for each parameter and R2 value given for fit of the model to the data.
Comamonas
thiooxydans
Hydrogenophaga
The foregoing discussion and examples have been presented for purposes of illustration and description. The foregoing is not intended to limit the aspects, embodiments, or configurations to the form or forms disclosed herein. In the foregoing Detailed Description for example, various features of the aspects, embodiments, or configurations are grouped together in one or more embodiments, configurations, or aspects for the purpose of streamlining the disclosure. The features of the aspects, embodiments, or configurations, may be combined in alternate aspects, embodiments, or configurations other than those discussed above. This method of disclosure is not to be interpreted as reflecting an intention that the aspects, embodiments, or configurations require more features than are expressly recited in each claim. Rather, as the following claims reflect, inventive aspects lie in less than all features of a single foregoing disclosed embodiment, configuration, or aspect. While certain aspects of conventional technology have been discussed to facilitate disclosure of some embodiments of the present invention, the Applicants in no way disclaim these technical aspects, and it is contemplated that the claimed invention may encompass one or more of the conventional technical aspects discussed herein. Thus, the following claims are hereby incorporated into this Detailed Description, with each claim standing on its own as a separate aspect, embodiment, or configuration.
This application claims priority from U.S. Provisional Patent Application No. 63/022,784 filed on May 11, 2020, the contents of which is incorporated herein by reference in their entirety.
This invention was made with government support under Contract No. DE-AC36-08G028308 awarded by the Department of Energy. The government has certain rights in the invention.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2021/031610 | 5/10/2021 | WO |
Number | Date | Country | |
---|---|---|---|
63022784 | May 2020 | US |