The content of the electronically submitted sequence (Name 4431-064PC01_SL_ST25.txt; Size: 82,152 bytes; and Date of Creation: Feb. 10, 2021) is incorporated herein by reference in its entirety.
Described herein are monomeric prolyl 4-hydroxylase proteins and their use in fermentation, methods for production of said proteins, and methods for in vitro and in vivo hydroxylation of proteins.
There is an entire industry using microorganisms to make compounds for commercial applications. The microorganisms are typically engineered with DNA necessary to make the compounds. Examples of these microorganisms include yeast and bacteria. Compounds that are made include drugs, fragrances, flavors, proteins and the like.
Engineered proteins are created through protein engineering, mutagenesis and protein evolution. One purpose of creating engineered proteins in drug development is to improve their activity under various reaction conditions.
In some embodiments, this disclosure provides a yeast host cell comprising a recombinant monomeric prolyl 4-hydroxylase. In some embodiments, the monomeric prolyl 4-hydroxylase can be secreted. In certain embodiments, the recombinant monomeric prolyl 4-hydroxylase can be from a virus, algae, or a plant. In some embodiments, the recombinant monomeric prolyl 4-hydroxylase can be from mimivirus. In one embodiment, the recombinant monomeric prolyl 4-hydroxylase can be from Arabidopsis thaliana. In some embodiments, the recombinant monomeric prolyl 4-hydroxylase can be from C. reinhardtii. In some embodiments, the recombinant monomeric prolyl 4-hydroxylase can be from Paramecium bursaria Chlorella virus-1. In some embodiment, the recombinant monomeric prolyl 4-hydroxylase can have at least 80% identical to a prolyl 4-hydroxylase selected from the group consisting of: SEQ ID NOs: 2, 3, 6, 7 and 8. In certain embodiment, the yeast can be Pichia.
In some embodiments, the yeast host cell can further comprise a second protein to be hydroxylated. In certain embodiments, the second protein can be selected from the group consisting of: collagen, recombinant collagen, and collagen-like proteins.
In some embodiments, this disclosure provides a microorganism comprising a recombinant monomeric prolyl 4-hydroxylase, wherein the recombinant monomeric prolyl 4-hydroxylase can be from algae or a plant. In certain embodiments, the monomeric prolyl 4-hydroxylase can be secreted. In some embodiments, the recombinant monomeric prolyl 4-hydroxylase can be from Arabidopsis thaliana. In certain embodiments, the recombinant monomeric prolyl 4-hydroxylase can be from C. reinhardtii. In some embodiments, the recombinant monomeric prolyl 4-hydroxylase can be at least 80% identical to a prolyl 4-hydroxylase selected from the group consisting of: SEQ ID NOs: 7 and 8.
In some embodiments, the microorganism can be a yeast or a bacteria. In some embodiments, the microorganism can be E. coli. In other embodiments, the microorganism can be Pichia.
In some embodiments, the microorganism can further comprise a second protein to be hydroxylated. In some embodiments, the second protein can be selected from the group consisting of: collagen, recombinant collagen, and collagen-like proteins.
In some embodiments, this disclosure provides a method of producing a recombinant monomeric prolyl 4-hydroxylase, comprising purifying the recombinant monomeric prolyl 4-hydroxylase from a yeast host cell disclosed herein.
In some embodiments, this disclosure provides a method of producing a recombinant monomeric prolyl 4-hydroxylase, comprising purifying the recombinant monomeric prolyl 4-hydroxylase from a microorganism disclosed herein.
In some embodiments, this disclosure provides an in vitro method for hydroxylating a protein comprising: lysing a microorganism comprising a protein to be hydroxylated to create a lysate; adding a specific concentration of a monomeric prolyl 4-hydroxylase to the lysate; and incubating the lysate and the monomeric prolyl 4-hydroxylase in reaction conditions that promote the hydroxylation of the protein by the a monomeric prolyl 4-hydroxylase.
In certain embodiments, this disclosure provides an in vitro method for hydroxylating a protein comprising: lysing a first microorganism comprising a protein to be hydroxylated to create a lysate; adding a specific concentration of a monomeric prolyl 4-hydroxylase to the lysate; and incubating the lysate and the monomeric prolyl 4-hydroxylase in reaction conditions that promote the hydroxylation of the protein by the a monomeric prolyl 4-hydroxylase.
In some embodiments, this disclosure provides an in vitro method for hydroxylating a protein comprising: adding a specific concentration of a monomeric prolyl 4-hydroxylase purified from a yeast host cell disclosed herein to a reaction mixture; adding a specific concentration of a protein to be hydroxylated to the reaction mixture; and incubating the reaction micture under reaction conditions that promote hydroxylation of the protein by the a monomeric prolyl 4-hydroxylase.
In some embodiments, this disclosure provides an in vitro method for hydroxylating a protein comprising: adding a specific concentration of a monomeric prolyl 4-hydroxylase purified from a microorganism disclosed herein to a reaction mixture; adding a specific concentration of a protein to be hydroxylated to the reaction mixture; and incubating the reaction micture under reaction conditions that promote hydroxylation of the protein by the a monomeric prolyl 4-hydroxylase.
In certain embodiments, this disclosure provides an ex vivo method for hydroxylating a protein comprising: lysing a microorganism disclosed herein to create a lysate; incubating the lysate and a protein to be hydroxylated under reaction conditions that promote hydroxylation of the protein by the monomeric prolyl 4-hydroxylase.
In some embodiments, this disclosure provides an ex vivo method for hydroxylating a protein comprising: lysing a yeast host cell to create a lysate; incubating the lysate and a a protein to be hydroxylated under reaction conditions that promote hydroxylation of a protein in the lysate by the monomeric prolyl 4-hydroxylase.
In certain embodiments, this disclosure provides an ex vivo method for hydroxylating a protein comprising: lysing a microorganism comprising a monomeric prolyl 4-hydroxylase to create a first lysate; lysing a second microorganism comprising a protein to be hydroxylated to create a second lysate; and incubating the first lysate and the second lysate under reaction conditions that promote hydroxylation of the protein by the monomeric prolyl 4-hydroxylase.
In some embodiments, this disclosure provides an ex vivo method for hydroxylating a protein comprising: lysing a yeast host cell comprising a recombinant monomeric prolyl-4 hydroxylase to create a yeast host cell lysate; lysing a microorganism comprising a protein to be hydroxylated to create a protein containing lysate; and incubating yeast host cell lysate and the protein containing lysate under reaction conditions that promote hydroxylation of the protein by the monomeric prolyl 4-hydroxylase.
The indefinite articles “a” and “an” to describe an element or component means that one or at least one of these elements or components is present. Although these articles are conventionally employed to signify that the modified noun is a singular noun, as used herein the articles “a” and “an” also include the plural, unless otherwise stated in specific instances. Similarly, the definite article “the,” as used herein, also signifies that the modified noun can be singular or plural, again unless otherwise stated in specific instances.
As used in the claims, “comprising” or “comprises” is an open-ended transitional phrase. A list of elements following the transitional phrase “comprising” is a non-exclusive list, such that elements in addition to those specifically recited in the list can also be present. As used herein, the terms “includes,” “including,” “has,” “having” or any other variation thereof, are intended to cover a non-exclusive inclusion
Further, unless expressly stated to the contrary, “or” and “and/or” refers to an inclusive and not to an exclusive. For example, a condition A or B, or A and/or B, is satisfied by any one of the following: A is true (or present) and B is false (or not present), A is false (or not present) and B is true (or present), and both A and B are true (or present).
When the term “about” is used, it is used to mean a certain effect or result can be obtained within a certain tolerance, and the skilled person knows how to obtain the tolerance. When the term “about” is used in describing a value or an end-point of a range, the disclosure should be understood to include the specific value or end-point referred to. In certain embodiments, “about” can mean a range of up to 10% (i.e., ±10%).
Any numerical range recited herein is intended to include all sub-ranges subsumed therein. Where a range of numerical values is recited herein, comprising upper and lower values, unless otherwise stated in specific circumstances, the range is intended to include the endpoints thereof, and all integers and fractions within the range. It is not intended that the scope of the claims be limited to the specific values recited when defining a range. Further, when an amount, concentration, or other value or parameter is given as a range, one or more preferred ranges or a list of upper preferable values and lower preferable values, this is to be understood as specifically disclosing all ranges formed from any pair of any upper range limit or preferred value and any lower range limit or preferred value, regardless of whether such pairs are separately disclosed. Finally, when the term “about” is used in describing a value or an end-point of a range, the disclosure should be understood to include the specific value or end-point referred to. Whether or not a numerical value or end-point of a range recites “about,” the numerical value or end-point of a range is intended to include two embodiments: one modified by “about,” and one not modified by “about.”
As used herein “collagen” refers to the family of at least 28 distinct naturally occurring collagen types including, but not limited to collagen types I, II, III, IV, V, VI, VII, VIII, IX, X, XI, XII, XIII, XIV, XV, XVI, XVII, XVIII, XIX, and XX. The term collagen as used herein also refers to collagen prepared using recombinant techniques. The term collagen includes collagen, collagen fragments, collagen-like proteins, triple helical collagen, alpha chains, monomers, gelatin, trimers and combinations thereof. Recombinant expression of collagen and collagen-like proteins is known in the art (see, e.g., Bell, EP 1232182B1, Bovine collagen and method for producing recombinant gelatin; Olsen, et al., U.S. Pat. No. 6,428,978 and VanHeerde, et al., U.S. Pat. No. 8,188,230, incorporated by reference herein in their entireties) Unless otherwise specified, collagen of any type, whether naturally occurring or prepared using recombinant techniques, can be used in any of the embodiments described herein. That said, in some embodiments, the composite materials described herein can be prepared using Bovine Type I collagen.
Collagens are characterized by a repeating triplet of amino acids, -(Gly-X-Y)n-, so that approximately one-third of the amino acid residues in collagen are glycine. X is often proline and Y is often hydroxyproline. Thus, the structure of collagen may consist of three intertwined peptide chains of differing lengths. Different animals may produce different amino acid compositions of the collagen, which may result in different properties (and differences in the resulting leather). Collagen triple helices (also called monomers or tropocollagen) can be produced from alpha-chains of about 1050 amino acids long, so that the triple helix takes the form of a rod of about approximately 300 nm long, with a diameter of approximately 1.5 nm. In the production of extracellular matrix by fibroblast skin cells, triple helix monomers can be synthesized and the monomers may self-assemble into a fibrous form. These triple helices can be held together by electrostatic interactions (including salt bridging), hydrogen bonding, Van der Waals interactions, dipole-dipole forces, polarization forces, hydrophobic interactions, and covalent bonding. Triple helices can be bound together in bundles called fibrils, and fibrils can further assemble to create fibers and fiber bundles. In some embodiments, fibrils can have a characteristic banded appearance due to the staggered overlap of collagen monomers. This banding can be called “D-banding.” The bands are created by the clustering of basic and acidic amino acids, and the pattern is repeated four times in the triple helix (D-period). (See, e.g., Covington, A., Tanning Chemistry: The Science of Leather (2009)) The distance between bands can be approximately 67 nm for Type 1 collagen. These bands can be detected using diffraction Transmission Electron Microscope (TEM), which can be used to access the degree of fibrillation in collagen. Fibrils and fibers typically branch and interact with each other throughout a layer of skin. Variations of the organization or crosslinking of fibrils and fibers can provide strength to a material disclosed herein. In some embodiments, protein is formed, but the entire collagen structure is not triple helical. In certain embodiments, the collagen structure can be about 10%, about 15%, about 20%, about 25%, about 30%, about 35%, about 40%, about 45%, about 50%, about 55%, about 60%, about 65%, about 70%, about 75%, about 80%, about 85%, about 90%, about 95%, about 96%, about 97%, about 98%, about 99% or 100% triple helical.
Regardless of the type of collagen, all are formed and stabilized through a combination of physical and chemical interactions including electrostatic interactions (including salt bridging), hydrogen bonding, Van der Waals interactions, dipole-dipole forces, polarization forces, hydrophobic interactions, and covalent bonding often catalyzed by enzymatic reactions. For Type I collagen fibrils, fibers, and fiber bundles, its complex assembly is achieved in vivo during development and is critical in providing mechanical support to the tissue while allowing for cellular motility and nutrient transport.
Various distinct collagen types have been identified in vertebrates, including bovine, ovine, porcine, chicken, and human collagens. Generally, the collagen types are numbered by Roman numerals, and the chains found in each collagen type are identified by Arabic numerals. Detailed descriptions of structure and biological functions of the various different types of naturally occurring collagens are generally available in the art; see, e.g., Ayad et al. (1998) The Extracellular Matrix Facts Book, Academic Press, San Diego, CA; Burgeson, R E., and Nimmi (1992) “Collagen types: Molecular Structure and Tissue Distribution” in Clin. Orthop. 282:250-272; Kielty, C. M. et al. (1993) “The Collagen Family: Structure, Assembly And Organization In The Extracellular Matrix,” Connective Tissue And Its Heritable Disorders, Molecular Genetics, And Medical Aspects, Royce, P. M. and B. Steinmann eds., Wiley-Liss, NY, pp. 103-147; and Prockop, D.J- and K.I. Kivirikko (1995) “Collagens: Molecular Biology, Diseases, and Potentials for Therapy,” Annu. Rev. Biochem., 64:403-434.) In some embodiments, the sequence can be a sequence that is about 80%, about 85%, about 90%, about 95%, about 96%, about 97%, about 98%, about 99% or 100% identical to the collagen sequence of SEQ ID NO: 24.
Type I collagen is the major fibrillar collagen of bone and skin, comprising approximately 80-90% of an organism’s total collagen. Type I collagen is the major structural macromolecule present in the extracellular matrix of multicellular organisms and comprises approximately 20% of total protein mass. Type I collagen is a heterotrimeric molecule comprising two α1(I) chains and one α2(I) chain, encoded by the COL1A1 and COL1A2 genes, respectively. Other collagen types are less abundant than type I collagen, and exhibit different distribution patterns. For example, type II collagen is the predominant collagen in cartilage and vitreous humor, while type III collagen is found at high levels in blood vessels and to a lesser extent in skin.
Type II collagen is a homotrimeric collagen comprising three identical al(II) chains encoded by the COL2A1 gene. Purified type II collagen can be prepared from tissues by, methods known in the art, for example, by procedures described in Miller and Rhodes (1982) Methods In Enzymology 82:33-64.
Type III collagen is a major fibrillar collagen found in skin and vascular tissues. Type III collagen is a homotrimeric collagen comprising three identical α1(III) chains encoded by the COL3A1 gene. Methods for purifying type III collagen from tissues can be found in, for example, Byers et al. (1974) Biochemistry 13:5243-5248; and Miller and Rhodes, supra.
Type IV collagen is found in basement membranes in the form of sheets rather than fibrils. Most commonly, type IV collagen contains two α1(IV) chains and one α2(IV) chain. The particular chains comprising type IV collagen are tissue-specific. Type IV collagen can be purified using, for example, the procedures described in Furuto and Miller (1987) Methods in Enzymology, 144:41-61, Academic Press.
Type V collagen is a fibrillar collagen found in, primarily, bones, tendon, cornea, skin, and blood vessels. Type V collagen exists in both homotrimeric and heterotrimeric forms. One form of type V collagen is a heterotrimer of two α1(V) chains and one α2(V) chain. Another form of type V collagen is a heterotrimer of α1(V), α2(V), and α3(V) chains. A further form of type V collagen is a homotrimer of α1(V). Methods for isolating type V collagen from natural sources can be found, for example, in Elstow and Weiss (1983) Collagen Rel. Res. 3:181-193, and Abedin et al. (1982) Biosci. Rep. 2:493-502.
Type VI collagen has a small triple helical region and two large non-collagenous remainder portions. Type VI collagen is a heterotrimer comprising α1(VI), α2(VI), and α3(VI) chains. Type VI collagen is found in many connective tissues. Descriptions of how to purify type VI collagen from natural sources can be found, for example, in Wu et al. (1987) Biochem. J. 248:373-381, and Kielty et al. (1991) J. Cell Sci. 99:797-807.
Type VII collagen is a fibrillar collagen found in particular epithelial tissues. Type VII collagen is a homotrimeric molecule of three α1(VII) chains. Descriptions of how to purify type VII collagen from tissue can be found in, for example, Lunstrum et al. (1986) J. Biol. Chem. 261:9042-9048, and Bentz et al. (1983) Proc. Natl. Acad. Sci. USA 80:3168-3172. Type VIII collagen can be found in Descemet’s membrane in the cornea. Type VIII collagen is a heterotrimer comprising two α1(VIII) chains and one α2(VIII) chain, although other chain compositions have been reported. Methods for the purification of type VIII collagen from nature can be found, for example, in Benya and Padilla (1986) J. Biol. Chem. 261:4160-4169, and Kapoor et al. (1986) Biochemistry 25:3930-3937.
Type IX collagen is a fibril-associated collagen found in cartilage and vitreous humor. Type IX collagen is a heterotrimeric molecule comprising α1(IX), α2(IX), and α3 (IX) chains. Type IX collagen has been classified as a FACIT (Fibril Associated Collagens with Interrupted Triple Helices) collagen, possessing several triple helical domains separated by non-triple helical domains. Procedures for purifying type IX collagen can be found, for example, in Duance, et al. (1984) Biochem. J. 221:885-889; Ayad et al. (1989) Biochem. J. 262:753-761; and Grant et al. (1988) The Control of Tissue Damage, Glauert, A. M., ed., Elsevier Science Publishers, Amsterdam, pp. 3-28.
Type X collagen is a homotrimeric compound of α1(X) chains. Type X collagen has been isolated from, for example, hypertrophic cartilage found in growth plates. (See, e.g., Apte et al. (1992) Eur J Biochem 206 (1):217-24.)
Type XI collagen can be found in cartilaginous tissues associated with type II and type IX collagens, and in other locations in the body. Type XI collagen is a heterotrimeric molecule comprising α1(XI), α2(XI), and α3(XI) chains. Methods for purifying type XI collagen can be found, for example, in Grant et al., supra.
Type XII collagen is a FACIT collagen found primarily in association with type I collagen. Type XII collagen is a homotrimeric molecule comprising three α1(XII) chains. Methods for purifying type XII collagen and variants thereof can be found, for example, in Dublet et al. (1989) J. Biol. Chem. 264:13150-13156; Lunstrum et al. (1992) J. Biol. Chem. 267:20087-20092; and Watt et al. (1992) J. Biol. Chem. 267:20093-20099.
Type XIII is a non-fibrillar collagen found, for example, in skin, intestine, bone, cartilage, and striated muscle. A detailed description of type XIII collagen can be found, for example, in Juvonen et al. (1992) J. Biol. Chem. 267: 24700-24707.
Type XIV is a FACIT collagen characterized as a homotrimeric molecule comprising α1(XIV) chains. Methods for isolating type XIV collagen can be found, for example, in Aubert-Foucher et al. (1992) J. Biol. Chem. 267:15759-15764, and Watt et al., supra.
Type XV collagen is homologous in structure to type XVIII collagen. Information about the structure and isolation of natural type XV collagen can be found, for example, in Myers et al. (1992) Proc. Natl. Acad. Sci. USA 89:10144-10148; Huebner et al. (1992) Genomics 14:220-224; Kivirikko et al. (1994) J. Biol. Chem. 269:4773-4779; and Muragaki, J. (1994) Biol. Chem. 264:4042-4046.
Type XVI collagen is a fibril-associated collagen, found, for example, in skin, lung fibroblast, and keratinocytes. Information on the structure of type XVI collagen and the gene encoding type XVI collagen can be found, for example, in Pan et al. (1992) Proc. Natl. Acad. Sci. USA 89:6565-6569; and Yamaguchi et al. (1992) J. Biochem. 112:856-863.
Type XVII collagen is a hemidesmosal transmembrane collagen, also known at the bullous pemphigoid antigen. Information on the structure of type XVII collagen and the gene encoding type XVII collagen can be found, for example, in Li et al. (1993) J. Biol. Chem. 268(12):8825-8834; and McGrath et al. (1995) Nat. Genet. 11(1):83-86.
Type XVIII collagen is similar in structure to type XV collagen and can be isolated from the liver. Descriptions of the structures and isolation of type XVIII collagen from natural sources can be found, for example, in Rehn and Pihlajaniemi (1994) Proc. Natl. Acad. Sci USA 91:4234-4238; Oh et al. (1994) Proc. Natl. Acad. Sci USA 91:4229-4233; Rehn et al. (1994) J. Biol. Chem. 269:13924-13935; and Oh et al. (1994) Genomics 19:494-499.
Type XIX collagen is believed to be another member of the FACIT collagen family, and has been found in mRNA isolated from rhabdomyosarcoma cells. Descriptions of the structures and isolation of type XIX collagen can be found, for example, in Inoguchi et al. (1995) J. Biochem. 117:137-146; Yoshioka et al. (1992) Genomics 13:884-886; and Myers et al., J. Biol. Chem. 289:18549-18557 (1994).
Type XX collagen is a newly found member of the FACIT collagenous family, and has been identified in chick cornea. (See, e.g., Gordon et al. (1999) FASEB Journal 13:A1119; and Gordon et al. (1998), IOVS 39:S1128.)
In the context of the present application a “variant” includes an amino acid sequence having at least 70%, 75%, 80%, 85%, 87.5%, 90%, 92.5%, 95%, 97.5%, 98%, or 99% sequence identity, or similarity to a reference amino acid, such as a monomeric P4H amino acid sequence or an amino acid of selected from any one of SEQ ID NOs: 2, 3, 6, 7 and 8, using a similarity matrix such as BLOSUM45, BLOSUM62 or BLOSUM80 where BLOSUM45 can be used for closely related sequences, BLOSUM62 for midrange sequences, and BLOSUM80 for more distantly related sequences. Unless otherwise indicated a similarity score will be based on use of BLOSUM62. When BLASTP is used, the percent similarity is based on the BLASTP positives score and the percent sequence identity is based on the BLASTP identities score. BLASTP “Identities” shows the number and fraction of total residues in the high scoring sequence pairs which are identical; and BLASTP “Positives” shows the number and fraction of residues for which the alignment scores have positive values and which are similar to each other. Amino acid sequences having these degrees of identity or similarity or any intermediate degree of identity or similarity to the amino acid sequences disclosed herein are contemplated and encompassed by this disclosure. A representative BLASTP setting uses an Expect Threshold of 10, a Word Size of 3, BLOSUM 62 as a matrix, and Gap Penalty of 11 (Existence) and 1 (Extension) and a conditional compositional score matrix adjustment. In typical embodiments, the “variant” retains prolyl-4-hydroxylase activity.
The principal post-translational modifications to protein polypeptides that contain proline and lysine residues, such as collagen, are 1) hydroxylation of proline and lysine residues to yield 4-hydroxyproline, 3-hydroxyproline (Hyp), and hydroxylysine (Hyl); and 2) glycosylation of hydroxylysyl residues. These modifications are catalyzed by three hydroxylases: prolyl 4-hydroxylase, prolyl 3-hydroxylase, and lysyl hydroxylase; and two glycosyl transferases, respectively. In vivo these reactions occur until the polypeptides form the triple-helical collagen structure.
The “prolyl-4-hydroxylase” or “P4H” enzyme catalyzes hydroxylation of proline residues to (2S,4R)-4-hydroxyproline (Hyp). See, Gorres, et al., Critical Reviews in Biochemistry and Molecular Biology 45 (2): (2010), which is incorporated by reference in its entierty. In collagen and related proteins, prolyl 4-hydroxylase catalyzes the formation of 4-hydroxyproline, whichis necessary for the proper three-dimensional folding of newly synthesized procollagen chains.
Monomeric prolyl-4-hydroxylase enzymes are a group of enzymes that function as a single unit (as opposed to animal P4H enzymes that functions as a heterotetramer). The monomeric P4H enzymes are typically much smaller in size (20-50 kD) than the P4H tetramer (120 kD). Monomeric P4H enzymes can be found in, and isolated from, bacteria, algae, plants, and viruses,
In some embodiments, the present disclosure provides a recombinant host cell comprising a recombinant monomeric P4H enzyme. In certain embodiments, the recombinant monomeric P4H enzyme in the host cell is from a virus, an algae, or a plant. In some embodiments, the recombinant monomeric P4H enzyme in the host cell is from mimivirus. In certain embodiments, the recombinant monomeric P4H enzyme in the host cell is from Arabidopsis thaliana. In another embodiment, the recombinant monomeric P4H enzyme in the host cell is from C. reinhardtii. In some embodiments, the recombinant monomeric P4H enzyme in the host cell is from Paramecium bursaria Chlorella virus-1. Isoforms, orthologs, variants, fragments and prolyl-4-hydroxylases from other sources can also be used in the host cell as long as they retain hydroxylase activity in a host cell. In certain embodiments, the recombinant monomeric P4H enzyme in the host cell can have an amino acid sequence selected from the group consisting of SEQ ID NOs: 2, 3, 6, 7 and 8. In some embodiments, the recombinant monomeric P4H enzyme in the host cell can have a sequence that is about 80%, about 85%, about 90%, about 95%, or about 99% identical to a sequence selected from SEQ ID NOs: 2, 3, 6, 7 and 8. In some embodiments, the recombinant monomeric P4H enzyme in the host cell has an amino acid sequence that is a variant of any sequence disclosed herein.
In some embodiments, host cells are engineered to overproduce prolyl-4-hydroxylase. For example, a polynucleotide encoding the prolyl-4-hydroxylase, an isoform thereof, an ortholog thereof, a variant thereof, or a fragment thereof that expresses prolyl-4-hydroxylase activity, can be incorporated into an expression vector. In some embodiments, the expression vector containing the polynucleotide encoding the prolyl-4-hydroxylase, the isoform thereof, the ortholog thereof, the variant thereof, or the fragment thereof, can be under the control of an inducible promoter. Suitable host cells, expression vectors, and promoters are described below.
DNA encoding the monomeric P4H enzyme can be transformed or transfected into an organism. Suitable organisms include, but are not limited to, yeast, bacteria, fungi and the like. In some embodiments, the bacteria can be Bacillus or Escherichia coli. In some embodiments, the microorganism can be a filamentous fungi. In some embodiments, the organism can be yeast. In certain embodiments, the yeast can be Pichia pastoris. In some embodiments, the monomeric P4H enzyme can be used in a method for in vitro hydroxylation of proteins. In some embodiments, monomeric P4H enzyme can be used in a method for in vivo hydroxylation of proteins. In some embodiments, the monomeric P4H enzyme can be used in a method for ex vivo hydroxylation of proteins.
In certain embodiments, monomeric P4H enzyme expressed by a host cell can be secreted.
In some embodiments, monomeric P4H enzyme can be used to hydroxylate proteins in vitro. Microorganisms that contain protein such as collagen can be lysed creating a lysate. The lysate can be processed to create purified proteins. Monomeric P4H enzyme can be added to purified samples of protein or added to the lysate. In some embodiments, co-factors for the hydroxylation reaction can include one or more of ascorbic acid/sodium ascorbate, or an iron (II) containing species, for example FeSO4. In other embodiments, co-factors for hydroxylation reaction can include alpha-Ketoglutarate (AKG or 2-oxoglutarate) and/or molecular oxygen. In some embodiments, the substrate for the hydroxylation reaction can be collagen. In some embodiments, bovine serum albumin and/or catalase can be added to the reaction to promote hydroxylation efficiency. In some embodiments, the catalase can be bovine catalase (Available from SigmaAldrich: Catalog Number C40).
In some embodiments, the hydroxylation reaction can be performed at a temperature ranging from about 16° C. to about 40° C., for example about 32° C. In some embodiments, the hydroxylation reaction can be performed at about 16° C., about 17° C., about 18° C., about 19° C., about 20° C., about 21° C., about 22° C., about 23° C., about 24° C., about 25° C., about 26° C., about 27° C., about 28° C., about 29° C., about 30° C., about 31° C., about 32° C., about 33° C., about 34° C., about 35° C., about 36° C., about 37° C., about 38° C., about 39° C., or at about 40° C.
The amount of monomeric P4H enzyme added to the hydroxylation reaction can range from about 0.05 uM to about 20 uM, for example about 5 uM. In some embodiments, the amount of monomeric P4H enzyme added can be about 0.05 uM, about 0.1 uM, about 0.15 uM, about 0.2 uM, about 0.25 uM, about 0.3 uM, about 0.35 uM, about 0.4 uM, about 0.5 uM, about 0.6 uM, about 0.7 uM, about 0.8 uM, about 0.9 uM, about 1.0 uM, about 1.1 uM, about 1.2 uM, about 1.3 uM, about 1.4 uM, about 1.5 uM, about 1.6 uM, about 1.7 uM, about 1.8 uM, about 1.9 uM, about 2.0 uM, about 2.5 uM, about 3.0 uM, about 3.5 uM, about 4.0 uM, about 4.5 uM, about 5 uM, about 7 uM, about 10 uM, about 15 uM, or about 20 uM.
In some embodiments, the hydroxylation reaction can take place at a pH ranging from about 5 to about 12, for example about 7.5. In some embodiments, the pH can be about 5.0, about 5.5, about 6, about 6.5, about 7, about 7.5, about 8, about 8.5, about 9.0, about 9.5, about 10.0, about 10.5, about 11, about 11.5, or about 12.
In some embodiments, the hydroxylation reaction can take place over about 30 mins to about 5 hours, for example about 1 hour. In some embodiments, the hydroxylation can take place over about 30 minutes, about 45 minutes, about 1 hour, about 1.5 hours, about 2 hours, about 2.5 hours, about 3 hours, about 3.5 hours, about 4 hours, about 4.5 hours, or about 5 hours. In certain embodiments, and after the reaction is complete or has proceeded for a sufficient amount of time, the monomeric P4H enzyme can be inactivated by adding an acid to lower the pH of the solution to about 4. Alternatively, 50% - 80% methanol (by volume) can be added to inactive the enzyme. In some embodiments, the in vitro hydroxylation can be performed using any method disclosed in U.S. Pat. No. 7,932,053, which is incorporated herein by reference in its entirety.
In some embodiments, the monomeric P4H enzyme can be used to hydroxylate proteins ex vivo. Microorganisms that contain protein such as collagen and also monomeric P4H enzyme can be lysed at a pH of about 12 to create a lysate. In some embodiments, the cells can be lysed at a pH of about 7, about 8, about 9, about 10, about 11, about 12, about 13 or higher. In some embodiments, the pH of the lysate can then be lowered to about 7.5. In certain embodiments, the pH can lowered to about 10, about 9, about 8, about 7.5, about 7, about 6, or about 5. In particular embodiments, reaction components, including one or more of ascorbic acid, sodium ascorbate, DTT, or an iron (II) species (such as FeSO4) can be added to the lysate following pH reduction. In certain embodiments, alpha-Ketoglutarate (AKG or 2-oxoglutarate) can also be added to the reaction.
In certain embodiments, the ex vivo hydroxylation reaction can be performed at a temperature ranging from about 16° C. to about 40° C., for example about 32° C. In some embodiments, the hydroxylation reaction can be performed at about 16° C., about 17° C., about 18° C., about 19° C., about 20° C., about 21° C., about 22° C., about 23° C., about 24° C., about 25° C., about 26° C., about 27° C., about 28° C., about 29° C., about 30° C., about 31° C., about 32° C., about 33° C., about 34° C., about 35° C., about 36° C., about 37° C., about 38° C., about 39° C. or about 40° C. In some embodiments, the ex vivo hydroxylation reaction can take place over about 30 mins to about 5 hours, for example about 3 hours. In some embodiments, the ex vivo hydroxylation can take place over about 30 minutes, about 45 minutes, about 1 hour, about 1.5 hours, about 2 hours, about 2.5 hours, about 3 hours, about 3.5 hours, about 4 hours, about 4.5 hours, or about 5 hours.
Once the ex vivo hydroxylation reaction is complete, the monomeric P4H can be inactivated by adding an acid to lower the pH of the solution to 4 or adding 50% - 80% methanol by volume.
In an alternative embodiment, the DNA sequence of the monomeric P4H enzyme can be transfected into a microorganism and utilized to hydroxylate proteins intracellularly/in vivo. In some embodiments, the microorganism can also express a protein to be hydroxylated. In some embodiments, the microorganism can express collagen as the protein to be hydroxylated.
In typical embodiments, the transfected microorganism can be grown in media appropriate for the particular microorganism under conditions well known to one of ordinary skill in the art. In some embodiments, suitable media for the reaction can be, for example, LB (Lysogeny broth) for E.coli, BMGY (Buffered Glycerol-complex Medium) for Pichia, YPD (yeast extract peptone dextrose) for Pichia, or HMP (Sodium hexametaphosphate) for Pichia. The temperature of the media can range from about 16° C. to about 42° C. In some embodiments, the temperature of the media can be about 16° C., about 18° C., about 20° C., about 22° C., about 24° C., about 26° C., about 28° C., about 29° C., about 30° C., about 31° C., about 32° C., about 33° C., about 34° C., about 35° C., about 36° C., about 37° C., about 38° C., about 39° C., about 40° C., about 41° C., or about 42° C.
In some embodiments, the transfected microorganism can be Pichia, and the temperature of the media can range from about 28° C. to about 36° C., for example about 32° C. In some embodiments, the temperature of the media can be about 28° C., about 29° C., about 30° C., about 31° C., about 32° C., about 33° C., about 34° C., about 35° C. or about 36° C.
In some embodiments, the transfected microorganism can be grown for a time ranging from about 50 hours to about 72 hours, for example about 68 hours. In some embodiments, the microorganism can be grown for about 50 hours, about 51 hours, about 52 hours, about 53 hours, about 54 hours, about 55 hours, about 56 hours, about 57 hours, about 58 hours, about 59 hours, about 60 hours, about 61 hours, about 62 hours, about 63 hours, about 64 hours, about 65 hours, about 66 hours, about 67 hours, about 68 hours, about 69 hours, about 70 hours, about 71 hours, or about 72 hours. In certain embodiments, co-factors for hydroxylation reaction can include: alpha-Ketoglutarate (AKG or 2-oxoglutarate) and /or molecular oxygen. In embodiments, the substrate for the hydroxylation reaction is molecular collagen.
In some embodiments, the DNA sequence for the monomeric P4H enzyme can be placed in a vector along with: a DNA sequence for a promotor; a DNA sequence for a terminator; a DNA sequence for a selection marker, a DNA sequence for a promoter for the selection marker; a DNA sequence for a terminator for the selection marker; a DNA sequence for a replication origin selected from one for bacteria and one for yeast; and/or a DNA sequence containing homology to the yeast genome (optional to improve efficiency when transformed into a yeast). In some embodiments, the vector can be inserted into (or episomal to) an organism. In some embodiments, the vector then can be transformed into the organism by methods known in the art such as electroporation. In certain embodiments, the organism can be a microorganism. In some embodiments, the vector can also possess a DNA sequence for a secretion signal.
In some embodiments, the DNA of the recombinant P4H enzyme can be transformed into a microorganism along with DNA encoding a protein to be hydroxylated. In some embodiments, the DNA sequence for the monomeric P4H enzyme can be placed in a first vector along with: a DNA sequence for a promoter for the monomeric P4H sequence; a DNA terminator sequence for the monomeric P4H sequence; a DNA sequence for a selection marker; a DNA sequence for a promoter for the selection marker; a DNA sequence for a terminator for the selection marker; a DNA sequence for a replication origin selected from one for bacteria and one for yeast; and/or a DNA sequence containing homology to the host microorganism’s genome. In some embodiments, the DNA sequence for the protein to be hydroxylated can be placed on a second vector along with: a DNA sequence for a promoter for the protein to be hydroxylated; a DNA sequence for a terminator for the protein to be hydroxylated; a DNA sequence for a selection marker; a DNA sequence for a promoter for the selection marker; a DNA sequence for a terminator for the selection marker; a DNA sequence for a replication origin selected from one for bacteria and one for yeast; and/or a DNA sequence containing homology to the host organism’s genome. In some embodiments, the two vectors can then be transformed into the microorganism by methods known in the art such as electroporation. In some embodiments, any vector disclosed herein can also include a DNA sequence for a secretion signal.
Alternatively, in some embodiments, an all-in-one vector can be used, wherein the DNA for the monomeric P4H enzyme, including a promoter and a terminator for the monomeric P4H enzyme sequence; the DNA for the protein to be hydroxylated, including a promoter and a terminator for the sequence of the protein to be hydroxylated; a DNA for a selection marker, including a promoter and a terminator for the selection marker; and/or DNAs with homology to the organism’s genome for integration into the genome are included in the all-in-one vector. The all-in-one vector then can be transformed into the microorganism by methods known in the art such as electroporation.
Suitable promoters for use in the present disclosure include, but are not limited to, AOX1 methanol induced promoter, pDF de-repressed promoter, pCAT de-repressed promoter, Das1-Das2 methanol induced bi-directional promoter, pHTXl constitutive Bi-directional promoter, pGCW14-pGAP1 constitutive Bi-directional promoter and combinations thereof.
The monomeric P4H enzyme described herein can be useful for personal care compositions suitable for application to the skin. The monomeric P4H enzyme can be included in the personal care compostion at a particular purity level. For example, and in some embodiments, the monomeric P4H enzyme can be added as isolated or purified monomeric P4H enzyme (i.e. without any impurities). Alternatively, the monomeric P4H enzyme can be added in lower purity, (e.g., about 25% purified, about 50% purified, about 65% purified, about 75% purified, about 85% purified, about 90% purified, about 95% purified, about 96% purified, about 97% purified, about 98% purified, or about 99% purified by weight). In some embodiments, the amount of monomeric P4H is quanitified by qSDS. In other words, the monomeric P4H enzyme can be added to a personal care product as a purified protein or it can be added as part of the fraction from which the protein is found. In certain embodiments, the monomeric P4H enzyme can be formulated into a cream, a lotion, an ointment, a gel, a serum, or other type of formulation suitable for topical application to the skin of a subject in need thereof.
In some embodiments, the composition can further include a cosmetically-acceptable carrier. The cosmetically-acceptable carrier can comprise from about 50% to about 99%, by weight, of the composition (e.g., from about 80% to about 95%, by weight, of the composition). In some embodiments, the carrier can be about 50%, about 55%, about 60%, about 65%, about 70%, about 75%, about 80%, about 85%, about 90%, about 95%, about 96%, about 97%, about 98%, or about 99%, by weight, of the composition.
The compositions can be use in a wide variety of product types that include but are not limited to liquid compositions such as lotions, creams, gels, sticks, sprays, shaving creams, ointments, cleansing liquid washes and solid bars, pastes, powders, mousses, masks, peels, make-ups, and wipes. These product types can comprise several types of cosmetically acceptable carriers including, but not limited to solutions, emulsions (e.g., microemulsions and nanoemulsions), gels, solids and liposomes).
In some embodiments, the topical compositions described herein can be formulated as solutions. Solutions typically include an aqueous solvent (e.g., from about 50% by weight to about 99% by weight or from about 90% by weight to about 95% by weight of a cosmetically acceptable aqueous solvent). In some embodiments, the solution can be about 50%, about 55%, about 60%, about 65%, about 70%, about 75%, about 80%, about 85%, about 90%, about 95%, about 96%, about 97%, about 98%, or about 99 % by weight of a cosmetically acceptable aqueous solvent. In certain embodiments, the aqueous solvent can be water. In other embodiments, the aqueous solvent can be a mixture of water and one more water-soluble solvents, such as ethanol, isopropanol, glycerol, and the like.
In some embodiments, the topical compositions can be formulated as a solution comprising one or more emollients. Such compositions can contain from about 2% to about 50% by weight of the one or more emollients. In some embodiments, the composition comprises about 2%, about 3%, about 4%, about 5%, about 6%, about 7%, about 8%, about 9%, about 10%, about 12%, about 15%, about 20%, about 25%, about 30%, about 35%, about 40%, about 45%, or about 50% by weight of the one or more emollients. As used herein, “emollients” refer to materials used for the prevention or relief of dryness, as well as for the protection of the skin. A wide variety of suitable emollients are known and can be useful in the personal care compositions. See International Cosmetic Ingredient Dictionary and Handbook, eds. Wenninger and McEwen, (The Cosmetic, Toiletry, and Fragrance Assoc., Washington, D.C., 7.sup.th Edition, 1997) (hereinafter “CTFA Handbook”) which contains numerous examples of suitable materials.
In some embodiments, the composition can be a lotion. In some embodiments, the lotion comprises from about 1% to about 20% by weight (e.g., from about 5% n to about 10% by weight) of one or more emollients and from about 50% n to about 90% by weight (e.g., from about 60% by weight to about 80% by weight) water. In some embodiments, the lotion can comprise about 1%, about 2%, about 3%, about 4%, about 5%, about 6%, about 7%, about 8%, about 9%, about 10%, about 11%, about 12%, about 13%, about 14%, about 15%, about 16%, about 17%, about 18%, about 19%, or about 20% by weight of one or more emollients. In some embodiments, the lotion can comprise about 50%, about 55%, about 60%, about 65%, about 70%, about 75%, or about 80% by weight water.
In yet another embodiment, the composition can be a cream. In certain embodiments, a cream typically comprises from about 5% to about 50% by weight (e.g., from about 10% by weight to about 20% by weight) of one or more emollients and from about 45% by weight to about 85% by weight (e.g., from about 50% by weight to about 75% by weight) water. In some embodiments, the cream can comprise about 5%, about 6%, about 7%, about 8%, about 9%, about 10% about 15%, about 20%, about 25%, about 30%, about 35%, about 40%, about 45%, or about 50% by weight of one or more emollients. In some embodiments, the cream can comprise about 45%, about 50%, about 55%, about 60%, about 65%, about 70%, about 75%, about 80%, or about 85% by weight water.
In still another embodiment, the composition can be an ointment. In certain embodiments, the ointment can comprise a base of comprising one or more animal or vegetable oils or one or more semi-solid hydrocarbons. In certain embodiments, the ointment can comprise from about 2% by weight to about 10% by weight of an emollieiit(s) plus from about 0.1% by weight to about 2% by weight of one or more thickening agents. In some embodiments, the ointment can comprise about 2%, about 3%, about 4%, about 5%, about 6%, about 7%, about 8%, about 9% or about 10% by weight of one or more emollients. In some embodiments, the ointment can comprsie about 0.1%, about 0.2%, about 0.3%), about 0.4%, about 0.6%, about 0.8%, about 1.0%, about 1.2%, about 1.4%, about 1.6%, about 1.8% or about 2.0% by weight of one or more thickening agents. Suitable thickening agents are known to those of ordinary skill in the art as set forth in the CTFA Handbook.
In some embodiments, the composition can be an emulsion. If the carrier is an emulsion, from about 1% to about 10% by weight (e.g., from about 2% to about 5% by weight) of the carrier can comprise an emulsifier(s). In some embodiments, about 1%, about 2%, about 3%, about 4%, about 5%, about 6%, about 7%, about 8%, about 9%, or about 10% by weight of the carrier can comprise an emulsifier(s). Emulsifiers can be nonionic, anionic or cationic.
In some embodiments, the lotions pr creams can be formulated as emulsions. Typically, such lotions can comprise from 0.5% to about 5% by weight of an emulsifier(s). Such creams would typically comprise from about 1% to about 20% by weight (e.g., from about 5% to about 10% by weight) of an emollient(s); from about 20% to about 80% by weight (e.g., from 30% to about 70% by weight) of water; and from about 1 % to about 10% by weight (e.g., from about 2% to about 5% by weight) of an emulsifier(s).
Single emulsion skin care compositions, such as lotions and creams, of the oil-in-water type and water-in-oil type are well-known in the cosmetic art and are useful for the personal care compositions. Multiphase emulsion compositions, such as the water-in-oil-in-water type are also useful. In general, such single or multiphase emulsions contain water, emollients, and emulsifiers as essential ingredients.
The personal care compositions of this disclosure can also be formulated as a gel (e.g., an aqueous gel using a suitable gelling agent(s)). Suitable gelling agents for aqueous gels include, but are not limited to, natural gums, acrylic acid and acrylate polymers and copolymers, and cellulose derivatives (e.g., hydroxymethyl cellulose and hydroxypropyl cellulose). Suitable gelling agents for oils (such as mineral oil) include, but are not limited to, hydrogenated butylene/ethylene/styrene copolymer and hydrogenated ethylene/propylene/styrene copolymer. Such gels typically comprise between about 0.1% and 5%, by weight, of such gelling agents. In some embodiments, the gel comprises about 0.1%, about 0.2%, about 0.3%, about 0.4%, about 0.5%, about 1.0%, about 1.5%, about 2.0%, about 2.5%, about 3.0%, about 3.5%, about 4.0%, about 4.5%, or about 5.0% by weight, of such gelling agents.
The personal care compositions useful in the subject disclosure can contain, in addition to the aforementioned components, a wide variety of additional oil-soluble materials and/or water-soluble materials conventionally used in compositions for use on the skin at their art-established levels.
The personal care compositions can be applied to or on skin as needed and/or as part of a regular regimen ranging from application once a week up to one or more times a day (e.g., twice a day). The amount used will vary with the age and physical condition of the end user, the duration of the treatment, the specific compound, product, or composition employed, the particular cosmetically-acceptable earner utilized, and like factors.
The monomeric P4H enzyme described herein can be useful for skin care benefits in personal care applications such as anti-wrinkle, improved skin pigmentation, hydration, reduction of acne, prevention of acne, reduction of black heads, prevention of blackheads, reduction of stretch marks, prevention of stretch marks, prevention of cellulite, reduction of cellulite and the like. By improved skin pigmentation is meant either evening out skin pigmentation or reducing skin pigmentation to provide fair skin.
The monomeric P4H enzyme described herein can also be combined with other skin care benefit ingredients such as, but not limited to salicylic acid, retinol, benzoyl peroxide, vitamin C, glycerin, alpha-hydroxy acids, hydroquinone, kojic acid, hyaluronic acid and the like.
In the context of the present description, all publications, patent applications, patents and other references mentioned herein, if not otherwise indicated, are explicitly incorporated by reference herein in their entirety for all purposes as if fully set forth, and shall be considered part of the present disclosure in their entirety.
Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure belongs. In case of conflict, the present specification, including definitions, will control.
When an amount, concentration, or other value or parameter is given as a range, or a list of upper and lower values, this is to be understood as specifically disclosing all ranges formed from any pair of any upper and lower range limits, regardless of whether ranges are separately disclosed. Where a range of numerical values is recited herein, unless otherwise stated, the range is intended to include the endpoints thereof, and all integers and fractions within the range. It is not intended that the scope of the present disclosure be limited to the specific values recited when defining a range.
Further, unless otherwise explicitly stated to the contrary, when one or multiple ranges or lists of items are provided, this is to be understood as explicitly disclosing any single stated value or item in such range or list, and any combination thereof with any other individual value or item in the same or any other list.
The examples are illustrative, but not limiting, of the present disclosure. Other suitable modifications and adaptations of the variety of conditions and parameters normally encountered in the field, and which would be apparent to those skilled in the art, are within the spirit and scope of the disclosure.
It is to be understood that the phraseology or terminology used herein is for the purpose of description and not of limitation. The breadth and scope of the present disclosure should not be limited by any of the above-described exemplary embodiments, but should be defined in accordance with the following claims and their equivalents,
For N terminal His tag:
Forward (SEQ ID NO: 15)
Reverse (SEQ ID NO: 16):
For C terminal His Tag: Forward (SEQ ID NO: 17):
Reverse (SEQ ID NO: 18):
gBlock was ordered from IDT and gene was amplified using standard PCR conditions.
The reaction mix components are as follows: pfu polymerase buffer 1x, 0.2 mM dNTPs each, 0.5 µM forward primer, 0.5 µM reverse primer, 0.02 U/µL pfu polymerase and 10 ng/mL gBlock. The thermal cycler was programmed as follows:
The amplified gene was cut with restriction enzymes EcoR I and Kpn I. The digested DNA was cleaned by agarose gel extraction using commercial kit before ligation into pCOLDIII vector. Ligation was set-up with a molar ratio of 1:3 (plasmid: insert) in 10 µL reaction mix. Typically, a ligase reaction mix had 3 ng/L digested plasmid vector, 9 ng/mL of the insert, 1 µL 10 X ligase buffer and 1 U/mL ligase. Ligation reaction mix was transformed into E. coli DH5a cells. Cells were spread on LB Ampicillin plates (6.25 g LB powder mix, 4 g agar, 250 mL DDI water, 0.1 mg/mL Ampicillin) before recovering in SOC medium for 1 hour at 37° C. Plates were incubated at 37° C. overnight; individual colonies that appeared next day were tested for gene fragments by colony PCR. Clones that showed amplification for desired fragments were inoculated on LB broth having 0.10 mg/mL ampicillin and grown overnight at 37° C., 250 rpm. Recombinant plasmid from these overnight grown cultures were isolated using kit from Zymergen and given for sequencing. Plasmid sequencing was done at Eueofin Inc. sequencing facility and gene specific primers were used for sequencing reactions.
Confirmed plasmids (
5 ml Ni-NTA (10 ml of 50% solution) beads were washed with 2 X volume of water and then with 5 X volume of lysis buffer (25 mM Tris pH 7.5, 50 mM NaCl and 20 mM Imidazole). Clarified lysate and Ni-NTA beads (equilibrated with lysis buffer as above) were mixed for 1 hour. This mix was poured into centrifuge columns and centrifuged at 1000 X g for 1-2 minutes at 4° C. About 2.5 ml beads should be there in 2 purification columns to get original volume of total 5 ml. The flow through was stored to check for any protein loss during the binding step. Beads that were collected in the centrifuge columns were washed with 50 ml of wash buffer (25 mM Tris pH 7.5, 50 mM NaCl and 50 mM Imidazole) sequentially, adding 10 ml at a time, centrifuging for 1000 X g for 1-2 minutes 4° C. Washings were also collected to check for the loss of mVP4H (Mimivirus P4H) during the washing step. 6 elution fractions were collected from each of the purification columns by passing 2.5 ml of elution buffer (25 mM Tris pH 7.5, 50 mM NaCl and 300 mM Imidazole) each time and centrifuge at 1000 rpm for 1-2 minutes at 4° C. Centrifuge elution fractions at 14000 X g for 5 minutes to remove any insoluble debris. Flow through, washings and all the fractions were checked on SDS PAGE (
The DNA sequence of monomeric prolyl 4-hydroxylase was acquired from IDT. Polymerase chain reactions were done using the DNA sequences as templates with primers MM-0579 (SEQ ID NO: 10); MM-0580 (SEQ ID NO: 20); MM-1569 (SEQ ID NO: 21), MM-1570 (SEQ ID NO: 22); MM-0784 (SEQ ID NO: 23) and Gibson assembled into vector MMV-644 (SEQ ID NO: 12). The final vector MMV-644 (
Reaction mix: pfu polymerase buffer 1 x, 0.2 mM dNTPs each, 0.5 µM forward primer, 0.5 µM reverse primer, 0.02 U/µL pfu polymerase and 10 ng/mL gBlock.
Thermal cycler was programmed as:
PP421 was generated by digesting MMV-398 (
PP654 was generated by digesting MMV-580 (
PP657 was generated by digesting MMV-580 (
1. Ni-NTA purification: 5 ml Ni-NTA (10 ml of 50% solution) beads were washed with 2 X volume of water and then with 5 X volume of lysis buffer (25 mM Tris pH 7.5, 50 mM NaCl and 20 mM Imidazole). pH of the 20 ml media was adjusted to 7.5 using 2N NaOH for the secreted mimi P4H. pH adjusted media and Ni-NTA beads (equilibrated with lysis buffer as above) were mixed for 3 hours at 4° C.
For the intracellular mimi P4H, pellets were resuspended in lysis buffer, mixed with beads and lysed using tissulyser. Lysed culture was clarified at 12000 x g, 4° C. for 30 minutes and supernatant thus obtained was mixed with beads overnight at 4° C. The steps are common for both secreted and intracellular mimiP4H purification.
The mix was poured into centrifuge columns and centrifuged at 1000 X g for 1-2 minutes at 4° C. About 2.5 ml beads should be there in 2 purification columns to get original volume of total 5 ml. The flow through was stored to check for any P4H loss during the binding step. Beads that were collected in the centrifuge columns were washed with 50 ml of wash buffer (25 mM Tris pH 7.5, 50 mM NaCl and 50 mM Imidazole) sequentially, adding 10 ml at a time, centrifuging for 1000 X g for 1-2 minutes 4° C. Washings were also collected to check for the loss of mVP4H (mimivirus P4H) during the washing step. Elution fractions were collected from each of the purification columns by passing 2.5 ml of elution buffer (25 mM Tris pH 7.5, 50 mM NaCl and 300 mM Imidazole) each time and centrifuge at 1000 rpm for 1-2 minutes at 4 For the intracellular. Centrifuge elution fractions at 14000 X g for 5 minutes to remove any insoluble debris. Flow through, washings and all the fractions were checked on SDSPAGE. Elution fractions were pooled and concentrated down to ~ 10 ml using 10 MW cut off protein concentrator. Concentrated purified mVP4H put for dialysis overnight in ~ 1 liters of 50 mM Tris-HCl pH 7.5, 100 mM NaCl buffer using 10 kDa cut off dialysis tubing in the cold room. One buffer change done next day for at least 3 hours under the cold condition (4° C.) and then dialyzed protein was taken out from dialysis tubes, centrifuge at 14000 X g for 10 minutes to remove any insoluble/aggregated protein. Q-bit protein estimation done on purified protein (at least 50 times diluted). Purified protein stored in several 500 ul aliquots at -80° C.
2. Direct Media Dialysis: For the secreted mimi P4H, fermentation media was directly transferred into dialysis tubing (10 ml, 10 kDa cut off) and put for dialysis overnight in 1 liters of 50 mM Tris-HCl pH 7.5, 100 mM NaCl buffer at 4° C. in the cold room. Two buffer changes were done next day for at least 3 hours each. Dialyzed protein taken out from dialysis tubes, centrifuge at 14000 X g for 10 minutes to remove any insoluble/aggregated protein. Q-bit protein estimation done on purified protein (at least 50 times diluted). Purified protein stored in several 500 ul aliquots at -80° C.
Fermentation grown samples were run on SDS PAGE gel, specific collagen band was cut and sent out for Mass spec analysis.
Ex Vivo (Method:1): Step wise method is described in
Reaction buffer has following components:
Fermenter grown samples were collected in micro centrifuge tubes, 300 mg of pellets were resuspended in reaction buffer and lysis was performed in 96 well plate. 300 mg cell pellet was resuspended in 2 ml buffer and distributed into 3 different 96 well plate. Cells were lysed in tissue lyser for 15 minutes. The pH of the lysate was checked and adjusted to 7.5 and incubated at 32° C. for 1.5 and 2.5 hours. Later the collagen was purified using our standard high low pH protocol, quantified on qSDS gels (
Ex Vivo (Method:2, lysate: lysate mixing):
Two different lysates were used in this method ()
These strains were grown separately in a shake flask with BMGY media.
The cell pellets (mixed pellets) were combined in 1:10 ratio (0.1 g Col3 strain: 1 g P4H strain) in 10 ml reaction buffer (same steps as in
The ‘mixed’ pellets were lysed in 10 ml reaction buffer, pH adjusted and incubated for 2 hours at 32° C.
The ‘reaction mix’ was purified for Col3 using high-low pH method.
qSDS followed by Hyp% assay was performed.
PP681 was generated by digesting MMV-589 (
Small P4Hs (including mimiP4H) were transformed into strains that have non FoldON collagen (PP681). Therefore, PP681 background was used. A Western blot was performed to confirm the clones (
All 4 transformants were tested using the ex vivo steps described in
PP765 was generated by digesting MMV-644 (
A secretory N terminal signal sequence was introduced in the mimiP4H plasmids (MMV-644) and the plasmids were transformed into His- strains. Different transformants for PP765 (without collagen), PP766 (with native signal sequence collagen) and PP767 (with Pho1 signal sequence collagen) were tested by western blot and on coomassie stained SDS PAGE gels. The transformants were first grown in 24 well plate in BMGY media, later confirmed transformants were also grown in shake flask and fermenters and supernatant was checked in all the cases (
One transformant each for PP765, PP766 and PP767 was grown in 50 ml BMGY media in shake flask and tested in western blot and coomassie stained gels). Most of the mimiP4H was secreted in the media, providing an advantage over intracellular mimiP4H. PP765, PP766 and PP767 were also grown in bioreactors in HMP+peptone media. Different time points of the cultures were collected and analyzed on gel (
Activity tests: Secreted Mimi P4H from the fermentation supernatant was purified using dialysis and also by Ni-NTA column. Purified P4H was used for the in-vitro hydroxylation reaction was set as described in
We also demonstrated that mimi virus P4H from fermenter supernatant is active without purification. Fermentation supernatant from three separate Mimi virus P4H secretion strains were collected and 0.05, 0.1 and 0.5 mg of purified collagen were added along with the reaction components (
The monomeric prolyl 4-hydroxylase enzymatic activity from PP765 was measured by a Hydroxylation assay. Acid hydrolysis of in-vitro hydroxylation reactions containing collagen were mixed with concentrated hydrochloric acid (1:1) and were performed at 125° C. for a minimum of 18 hours. The hydrolysis products were then dried completely and then resuspended with Milli-Q water. The resuspended samples were then centrifuged at 15,0000 rpm for 5 minutes to remove precipitates and debris. A reaction solution, with component final concentrations upon addition to the centrifuged supernatant were the following - 2.67% citric acid (w/v), 3.86% sodium acetate (w/v), 1.87% sodium hydroxide (w/v), 0.64% glacial acetic acid (v/v), 6.7% isopropanol (v/v) and 34 mM Chloramine T. This mixture was incubated at 30° C. for 25 minutes with shaking at 400 rpm. A separate reaction solution, with final concentrations added to the above mixture consisted of 536 mM p-dimethylaminobenzaldehyde (4-DMAB), 12% HC1 (v/v) and 28% isopropanol (v/v) and was incubated for 25 minutes at 65° C. with shaking at 250 rpm. The absorbance was measured immediately at 560 nm using a spectrophotometer. The molecular weight of collagen used and the number of hydroxyproline sites and prolines in the helical region are needed to calculate percent hydroxyproline.
In-vitro hydroxylation in lysate was performed on cells lysed at pH 12 using NaPO4 buffer followed by mixing with 0.1 mM FeSO4, 2 mM ascorbic acid, 25 mM DTT and 25 mM alpha-ketoglutaric acid. The mixture was adjusted to pH 7.5 and incubated for 3 hours at 32° C. by shaking in an incubator for the reaction to proceed. Following completion of the reaction, the pH was dropped to 4 and the reaction was mixed overnight (~ 18 hours) at 25° C. and centrifuged at ~ 7,000 xg to harvest the supernatant. The supernatant was dialyzed against water or buffer and used in the hydroxyproline assay.
Freshly harvested fermentation broth, consisting of media and cells, is spun at 17,000 xg for 5 minutes to create a cell pellet and supernatant. This supernatant is poured off and called ferm-sup, it can now be frozen.
The ferm-sup is thawed if frozen, and 750 uL aliquoted into 1.5 mL microcentrifuge tubes. Reaction components are added to the tubes to a final concentration of 25 mM Alpha-ketoglutarate, 25 mM DTT, 2 mM Ascorbate, 0.1 mM Iron Sulfate. Purified collagen is then added to the tubes, in the experiment 500 ug, 100 ug, and 50ug were added from the same stock. The tubes are then placed into the heat block of a thermomixer at 32C and left shaking at 3000 rpm for 3 hours. After the reaction the samples are run on SDS-PAGE gel and the bands corresponding the collagen cut out and sent for Liquid Chromatography Mass-Spec to determine their hydroxylation state. Since pp766 and pp767 excrete their own collagen and it has not been cleaved during the purification process, it runs slightly higher than the spiked in collagen. These are represented as “endogenous”, meaning to the ferm-sup and strain, and “PP685” the strain which we derived the purified collagen from. The reported hydroxylation state of PP685 collagen before the reaction is 4%.
A sample solution which contains at least 50 µg of protein to 200 µl with 100 mM Tris-HCl, pH 8.5 is used 55 µg of Abcam recombinant Human Collagen (Abcam, catalog # ab73160,) is used as the positive control. 800 µL of methanol is added to the sample, mixed and stored at -80° C. overnight. The samples are spun at 21,000 xg for 30 min at 4° C. The supernatant is aspirated and, 5-10 µl is left in the tube so as not to disturb the pellet. The pellet is washed twice with 500 µl of cold acetone (100% (v/v) each time. After each wash, it is spun at 21,000 xg for 10 min. The pellet is air dried under hood for 20 to 25 min. If the samples are not dry after 25 min, they are left in the hood until they are dry. To the air-dried pellet, 30 µL of 100 mM Tris-HCl, pH 8.5, 8 M Urea (Sigma, catalog # U5128) is added, and gently mixed to resuspend. If the sample is not totally dissolved, it is spun at 21,000 xg for 15 min. 1.5 µL of 100 mM TCEP (Sigma, catalog # 68957) solution is added to the sample. The sample is incubated at room temperature for 30 min in the dark. 0.6 µL of 500 mM chloroacetamide is added (Sigma catalog # C0267) to the sample. The sample is incubated in the dark at room temperature for another 30 min. 90 µL of 100 mM Tris-HCl, pH 8.5 is added to the sample. 0.6 µL of 500 mM CaC12 is added to the sample. To each sample, 10 µL Trypsin (Promega catalog # V5111) at 0.1 ug/uL is added. The samples are incubated at 37° C. for 18 hours in thethermomixer at 900 RPM. 8 µL of formic acid is added to quench the digestion reaction. 100 µL of sample is tranferred to a mass-spectrometry vial.The samples are tested by Agilent LC-QTOF system (LC: Agilent 1290 Infinity II, MS: Agilent 6545XT). The samples are first separated by an Agilent Peptide Mapping Column held at 50° C. Pure water with 0.1% formic acid is used as mobile phase A while acetonitrile/water (95%/5%, v/v) is used as mobile phase B. The sample is measured in positive mode with Auto MS/MS function. 8 max precursors per cycle. The acquired data is processed by BioConfirm software (Agilent) where the data is searched against predefine collagen sequence. The result in Bioconfirm is exported as a .csv file and then processed by an in-house python script to calculate the Proline Hydroxylation%. For every proline detected in the experiment, the script sums up the peak area of its hydroxylated version (SUMHyP)and non-hydroxylated version (SUMnonHyP), respectively. For each proline, its own Hydroxylation% = SUMHyP/ (SUMHyP + SUMnonHyP). At last, the average Hydroxylation% of all the detected Proline is reported.
SEQ ID NO 1: Mimivirus P4H codon optimized nucleotide sequence for E. coli:
SEQ ID NO 2: Mimivirus P4H amino acid sequence in E. coli:
SEQ ID NO 3: Mimi virus Protein sequence in Pichia.
SEQ ID NO 4: Codon optimized gene sequence (for Pichia).
SEQ ID NO 5: Codon optimized Mimivirus P4H gene sequence with secretion signal (for Pichia).
SEQ ID NO 6: PBCV-1 protein sequence.
SEQ ID NO 7: Cr-1 protein sequence.
SEQ ID NO 8: Arabidopsis thaliana protein sequence.
SEQ ID NO 9: MMV-398
SEQ ID NO 10: MMV-589
SEQ ID NO 11: MMV-619
SEQ ID NO 12: MMV-644
SEQ ID NO 13: MMV-580
SEQ ID NO 14: MMV-630
SEQ ID NO 15: primer
SEQ ID NO 16: primer
SEQ ID NO 17: primer
SEQ ID NO 18: primer
SEQ ID NO 19: MM-0579
SEQ ID NO 20: MM-0580
SEQ ID NO 21: MM-1569
SEQ ID NO 22: MM-1570
SEQ ID NO 23: MM-0784
SEQ ID NO 24: Protein sequence in PP681
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2021/017861 | 2/12/2021 | WO |
Number | Date | Country | |
---|---|---|---|
62976632 | Feb 2020 | US |