The field of the invention relates to promoters from fungal cells such as Pichia pastoris and methods of use thereof.
The methylotrophic yeast Pichia pastoris is one of the most widely used expression hosts for genetic engineering. This ascomycetous single-celled budding yeast has been used for the heterologous expression of hundreds of proteins (Lin-Cereghino, Curr Opin Biotech, 2002; Macauley-Patrick, Yeast, 2005). As a protein expression system, P. pastoris provides the advantages of a microbial system with facile genetics, shorter cycle times and the capability of achieving high cell densities. Secreted protein productivities have routinely been reported in the multi-gram per liter ranges. Several promoter systems are available for expression of proteins, for example, the methanol-inducible AOX1 promoter. The AOX1 promoter is a desirable aspects of the P. pastoris system because it is tightly regulated and highly induced on methanol (Cregg, Biotechnology, 1993, 11:905-910). The native Aox1p can be expressed up to 30% of total cellular protein when cells are grown on methanol. One drawback to this system is that cultivation on methanol during large scale fermentation can be complicated.
Constitutive promoter systems have been developed using the GAPDH promoter and more recently the TEF promoter (Waterham, Gene 1997, 186: 37-44; Ahn, Appl Microb Biotech, 2007, 74:601-608). These promoters are not as strong as AOX1, but, in some instances have proven to yield higher levels of secreted product than expression by AOX1, probably due to cultivation on a more energetically rich carbon source such as glycerol or glucose.
Importantly, P. pastoris is a eukaryote which provides the further advantage of having basic machinery for protein folding and post-translational modifications. Recent progress in the field, including humanization of the P. pastoris N-glycosylation pathway and a better understanding of the yeast secretory pathway, has resulted in the need to express multiple heterologous genes in the same strain, in some cases up to a dozen or more (Hamilton, Science, 2006, 313: 1441-1443; Wildt, Nat Rev Microbiol, 2005, 3: 119-128). Consequently, bottlenecks in strain engineering can arise with the availability of expression tools such as gene regulatory elements (i.e., promoters) and selection markers to introduce them. Several selectable markers have been developed for gene expression in P. pastoris including the recyclable URA5 system and multiple gene cassettes can be linked to the same marker to alleviate the problem of introduction of large numbers of genes. Moreover, a number of useful promoters have been identified aside from those named above including several methanol-inducible promoters such as AOX2 (the isogene of AOX1), Dihydroxyacetone Synthase (DAS), Formaldehyde Synthase (FLD1), and PEX8, other genes in core metabolism such as Isocitrate Lyase (ICL1), phosphate inducible PHO89, as well as the copper inducible heterologous S. cerevisiae CUP1 (Kobayashi, J. Biosci. & Bioeng., 2000, 89:479-484; Tschopp, Nuc. Acids. Res., 1987, 15; 3859-3876; Resina, J. Biotech., 2004, 109: 103-113; Menendez, Yeast, 2003, 20: 1097-1108; Ahn, AEM, 2009; Koller, Yeast, 2000, 96:651-656; U.S. Pat. No. 4,855,231). However, many of these promoter systems are not compatible with each other or would require starving for multiple nutrients, which can complicate bioprocess development. Moreover, many promoters that are considered to be constitutive, such as core metabolism and glycolysis genes, are significantly up- or down-regulated when carbon source conditions vary. For example, GAPDH is significantly down-regulated during cultivation on methanol, which can impact the expression of the desired gene of interest (Zhang, J. Ind. Micro. & Biotech., 2007, 34: 117-122). Therefore, additional useful promoters would be of value and interest to the field. Here, we present the identification of several novel P. pastoris promoters under relevant bioprocess conditions, and provide examples to demonstrate the utility of these promoters for heterologous gene expression.
The present invention provides an isolated hybrid polynucleotide comprising a promoter selected from the group consisting of: Pichia pastoris GAPDH promoter; Pichia pastoris Pp02g05010 (PpPIR1) promoter; Pichia pastoris Pp05g08520 (ScCCW12) promoter; Pichia pastoris Pp01g10900 (ScCHT2) promoter; Pichia pastoris Pp05g07900 (ScAAC2/PET9) promoter; Pichia pastoris Pp02g01530 (ScPST1) promoter; Pichia pastoris Pp05g00700 (unknown) promoter; Pichia pastoris Pp02g04110 (ScPOR1) promoter; Pichia pastoris Pp01g03600 (ScBGL2) promoter; Pichia pastoris Pp01g14410 (ScACO1) promoter; Pichia pastoris Pp01g09650 (ScYHR021C) promoter; Pichia pastoris Pp01g02780 (ScYLR388W) promoter; Pichia pastoris Pp03g09940 (ScPIL1) promoter; Pichia pastoris Pp02g10710 (ScMDH1) promoter; Pichia pastoris 01g09290 (ScFBA1) promoter; Pichia pastoris Pp03g03520 (PpDAS2) promoter; Pichia pastoris Pp03g08760 (ScCWP1) promoter; Pichia pastoris Pp03g00990 (ScYGR201c) promoter; Pichia pastoris Pp02g05270 (AN2948.2) promoter; Pichia pastoris Pp02g12310 (ScDUR3) promoter; Pichia pastoris Pp03g05430 (ScTHI4) promoter; Pichia pastoris Pp03g03490 (AN2957.2) promoter; Pichia pastoris Pp05g09410 (ScTHI13) promoter; Pichia pastoris Pp02g07970 (ScPEX11/PMP27) promoter; Pichia pastoris Pp01g12200 (AN7917.2) promoter; Pichia pastoris Pp03g11380 (ScPMP47) promoter; Pichia pastoris Pp03g08340 (unknown) promoter; Pichia pastoris Pp05g04390 (ScTIR3) promoter; Pichia pastoris Pp01g08380 (ScYIL057c) promoter; Pichia pastoris Pp01g05090 (ScSAY1) promoter; Pichia pastoris Pp01g13950 (ScTPN1) promoter; Pichia pastoris Pp03g11420 (ScARO10) promoter; Pichia pastoris Pp02g11560 (ScMET6) promoter; Pichia pastoris Pp01g08650 (ScYNL067W) promoter; Pichia pastoris Pp01g01850 (PpPDHbeta1) promoter; Pichia pastoris Pp03g03020 (ScSAM2) promoter; and Pichia pastoris Pp03g02860 (PpSAHH) promoter (e.g., any of: nucleotides 1-1000 of SEQ ID NO: 14; nucleotides 1-1000 of SEQ ID NO: 15; nucleotides 1-1000 of SEQ ID NO: 16; nucleotides 1-1000 of SEQ ID NO: 17; nucleotides 1-1000 of SEQ ID NO: 18; nucleotides 1-1001 of SEQ ID NO: 19; nucleotides 1-1000 of SEQ ID NO: 20; nucleotides 1-1000 of SEQ ID NO: 21; nucleotides 1-1000 of SEQ ID NO: 22; nucleotides 1-1000 of SEQ ID NO: 23; nucleotides 1-1000 of SEQ ID NO: 24; nucleotides 1-1000 of SEQ ID NO: 25; nucleotides 1-1000 of SEQ ID NO: 26; nucleotides 1-1000 of SEQ ID NO: 27; nucleotides 1-1000 of SEQ ID NO: 28; nucleotides 1-1000 of SEQ ID NO: 29; and SEQ ID NOs: 47-63 and 70-76); operably linked to a heterologous polynucleotide (e.g., encoding an interferon or an immunoglobulin, for example, an immunoglobulin chain of an antibody or antigen-binding fragment thereof that binds specifically to VEGF, HER1, HER2, HER3, glycoprotein IIb/IIIa, CD52, IL-2R alpha receptor (CD25), epidermal growth factor receptor (EGFR), Complement system protein C5, CD11a, TNF alpha, CD33, IGF1R, CD20, T cell CD3 Receptor, alpha-4 (alpha 4) integrin, PCSK9, immunoglobulin E (IgE), RSV F protein or ErbB2; or, VEGF, HER1, HER2, HER3, glycoprotein IIb/IIIa, CD52, IL-2R alpha receptor (CD25), epidermal growth factor receptor (EGFR), Complement system protein C5, CD11a, TNF alpha, CD33, IGF1R, CD20, T cell CD3 Receptor, alpha-4 (alpha 4) integrin, PCSK9, immunoglobulin E (IgE), RSV F protein or ErbB2 polypeptide; or an immunogenic polypeptide fragment thereof; or a detectable reporter such as green fluorescent protein, Aequorea victoria GFP mutant 3, luciferase, Renilla luciferase, Photinus pyralis luciferase, Photinus pyralis luciferase slk mutant, Vibrio fischeri luxA, Vibrio fischeri luxB, Vibrio fischeri luxC, Vibrio fischeri luxD, Vibrio fischeri luxE, Vibrio fischeri luxAB, Vibrio fischeri luxCDABE, Vibrio harveyi luxA, Vibrio harveyi luxB, Vibrio harveyi luxC, Vibrio harveyi luxD, Vibrio harveyi luxE, Vibrio harveyi luxAB, Vibrio harveyi luxCDABE, Photorhabdus luminscens LuxA, Photorhabdus luminscens LuxB, Photorhabdus luminscens LuxC, Photorhabdus luminscens LuxD, Photorhabdus luminscens LuxE, Photorhabdus luminscens LuxCDABE, E. coli lacZ, the Aequorea victoria Aequorin, KanMX, pat1, nat1, hph, CAT, Sh Ble, GUS, CYH2 or CAN1. In an embodiment of the invention, a hybrid polynucleotide of the present invention is in an isolated vector and/or an isolated host cell (e.g., wherein the host comprises a vector that comprise the hybrid polynucleotide). Examples of host cells include fungal cells such as a Pichia cell, Pichia pastoris, Pichia flnlandica, Pichia trehalophila, Pichia koclamae, Pichia membranaefaciens, Pichia minuta (Ogataea minuta, Pichia lindneri), Pichia opuntiae, Pichia thermotolerans, Pichia salictaria, Pichia guercuum, Pichia pijperi, Pichia stiptis, Pichia methanolica, Pichia, Saccharomyces cerevisiae, Saccharomyces, Hansenula polymorpha, Kluyveromyces, Kluyveromyces lactis, Candida albicans, Aspergillus nidulans, Aspergillus niger, Aspergillus oryzae, Trichoderma reesei, Chrysosporium lucknowense, Fusarium, Fusańum gramineum, Fusarium venenatum and Neuraspora crassa. The present invention further comprises a composition comprising the host cell and growth culture medium (e.g., wherein the medium also includes methanol and/or the polypeptide encoded by the heterologous polynucleotide, for example, wherein the polypeptide is secreted from the host cell).
The present invention also provides a method for making a polypeptide comprising introducing, into an isolated fungal host cell (e.g., a Pichia cell, Pichia pastoris, Pichia flnlandica, Pichia trehalophila, Pichia koclamae, Pichia membranaefaciens, Pichia minuta (Ogataea minuta, Pichia lindneri), Pichia opuntiae, Pichia thermotolerans, Pichia salictaria, Pichia guercuum, Pichia pijperi, Pichia stiptis, Pichia methanolica, Pichia, Saccharomyces cerevisiae, Saccharomyces, Hansenula polymorpha, Kluyveromyces, Kluyveromyces lactic, Candida albicans, Aspergillus nidulans, Aspergillus niger, Aspergillus oryzae, Trichoderma reesei, Chrysosporium lucknowense, Fusarium, Fusańum gramineum, Fusarium venenatum and Neuraspora crassa), an isolated hybrid polynucleotide comprising a promoter selected from the group consisting of Pichia pastoris GAPDH promoter; Pichia pastoris Pp02g05010 (PpPIR1) promoter; Pichia pastoris Pp05g08520 (ScCCW12) promoter; Pichia pastoris Pp01g10900 (ScCHT2) promoter; Pichia pastoris Pp05g07900 (ScAAC2/PET9) promoter; Pichia pastoris Pp02g01530 (ScPST1) promoter; Pichia pastoris 01g09290 (ScFBA1) promoter; Pichia pastoris Pp03g03520 (PpDAS2) promoter; Pichia pastoris Pp03g08760 (ScCWP1) promoter; Pichia pastoris Pp03g00990 (ScYGR201c) promoter; Pichia pastoris Pp02g05270 (AN2948.2) promoter; Pichia pastoris Pp02g12310 (ScDUR3) promoter; Pichia pastoris Pp03g05430 (ScTHI4) promoter; Pichia pastoris Pp03g03490 (AN2957.2) promoter; Pichia pastoris Pp05g09410 (ScTHI13) promoter; Pichia pastoris Pp02g07970 (ScPEX11/PMP27) promoter; Pichia pastoris Pp01g12200 (AN7917.2) promoter; Pichia pastoris Pp03g11380 (ScPMP47) promoter; Pichia pastoris Pp03g08340 (unknown) promoter; Pichia pastoris Pp05g04390 (ScTIR3) promoter; Pichia pastoris Pp01g08380 (ScYIL057c) promoter; Pichia pastoris Pp01g05090 (ScSAY1) promoter; Pichia pastoris Pp01g13950 (ScTPN1) promoter; Pichia pastoris Pp03g11420 (ScARO10) promoter; Pichia pastoris Pp02g11560 (ScMET6) promoter; Pichia pastoris Pp01g08650 (ScYNL067W) promoter; Pichia pastoris Pp01g01850 (PpPDHbeta1) promoter; Pichia pastoris Pp03g03020 (ScSAM2) promoter; and Pichia pastoris Pp03g02860 (PpSAHH) promoter; operably linked to a heterologous polynucleotide; and culturing the host cell under conditions wherein said polynucleotide is expressed; optionally wherein said host cell is cultured in the presence of methanol.
The present invention further comprises a method for inducing expression of a heterologous polynucleotide in a fungal host cell, wherein said host cell comprises a promoter selected from the group consisting of: Pichia pastoris 01g09290 (ScFBA1) promoter; Pichia pastoris Pp03g03520 (PpDAS2) promoter; Pichia pastoris Pp03g08760 (ScCWP1) promoter; Pichia pastoris Pp03g00990 (ScYGR201c) promoter; Pichia pastoris Pp02g05270 (AN2948.2) promoter; Pichia pastoris Pp02g12310 (ScDUR3) promoter; Pichia pastoris Pp03g05430 (ScTHI4) promoter; Pichia pastoris Pp03g03490 (AN2957.2) promoter; Pichia pastoris Pp05g09410 (ScTHI13) promoter; Pichia pastoris Pp02g07970 (ScPEX11/PMP27) promoter; Pichia pastoris Pp01g12200 (AN7917.2) promoter; Pichia pastoris Pp03g11380 (ScPMP47) promoter; Pichia pastoris Pp03g08340 (unknown) promoter; Pichia pastoris Pp05g04390 (ScTIR3) promoter; Pichia pastoris Pp01g08380 (ScYIL057c) promoter; Pichia pastoris Pp01g05090 (ScSAY1) promoter; Pichia pastoris Pp01g13950 (ScTPN1) promoter; operably linked to the heterologous polynucleotide, comprising culturing the fungal host cell in a growth medium comprising methanol.
The present invention further comprises a method for repressing expression of a heterologous polynucleotide in a fungal host cell, wherein said host cell comprises a promoter selected from the group consisting of: Pichia pastoris Pp03g11420 (ScARO10) promoter; Pichia pastoris Pp02g11560 (ScMET6) promoter; Pichia pastoris Pp01g08650 (ScYNL067W) promoter; Pichia pastoris Pp01g01850 (PpPDHbeta1) promoter; Pichia pastoris Pp03g03020 (ScSAM2) promoter; and Pichia pastoris Pp03g02860 (PpSAHH) promoter; operably linked to the heterologous polynucleotide, comprising culturing the fungal host cell in a growth medium comprising methanol.
A hybrid polynucleotide of the present invention refers to a polynucleotide comprising a promoter of the present invention operably linked a heterologous polynucleotide.
A heterologous polynucleotide e.g., that is operably linked to a promoter of the present invention, refers to a polynucleotide encoding a polypeptide that is not naturally contiguous with or operably linked to the nucleotide sequence of the promoter of the present invention. Heterologous polynucleotides encoding a heterologous polypeptide (e.g., an immunogenic polypeptide or oligopeptide) include for example, polynucleotides encoding a detectable reporter, interferon (interferon alpha 2a or interferon alpha 2b) or an immunoglobulin (e.g., a heavy chain and/or light chain, e.g., linked to an immunoglobulin light chain constant domain such as kappa or lambda; or heavy chain constant domain such as gamma, e.g., gamma, gamma-1, gamma-2, gamma-3 or gamma-4) which can form part of an antibody or antigen-binding fragment thereof such as, anti-VEGF, anti-HER1, anti-HER2, anti-HER3, anti-glycoprotein IIb/IIIa, anti-CD52, anti-IL-2R alpha receptor (CD25), anti-epidermal growth factor receptor (EGFR), anti-Complement system protein C5, anti-CD20, anti-CD11a, anti-TNF alpha, anti-CD33, anti-IGF1R, anti-CD20, anti-T cell CD3 Receptor, anti-alpha-4 (alpha 4) integrin, anti-PCSK9, anti-immunoglobulin E (IgE), anti-RSV F protein or anti-ErbB2.
In an embodiment of the invention, a detectable reporter is green fluorescent protein, such as Aequorea victoria GFP mutant 3, luciferase, Renilla luciferase, Photinus pyralis luciferase, Photinus pyralis luciferase slk mutant, Vibrio fischeri luxA, Vibrio fischeri luxB, Vibrio fischeri luxC, Vibrio fischeri luxD, Vibrio fischeri luxE, Vibrio fischeri luxAB, Vibrio fischeri luxCDABE, Vibrio harveyi luxA, Vibrio harveyi luxB, Vibrio harveyi luxC, Vibrio harveyi luxD, Vibrio harveyi luxE, Vibrio harveyi luxAB, Vibrio harveyi luxCDABE, Photorhabdus luminscens LuxA, Photorhabdus luminscens LuxB, Photorhabdus luminscens LuxC, Photorhabdus luminscens LuxD, Photorhabdus luminscens LuxE, Photorhabdus luminscens LuxCDABE, E. coli lacZ, the Aequorea victoria Aequorin gene, KanMX, pat1, nat1, hph, CAT, Sh Ble, GUS, CYH2 or CAN1.
“MeOH” is methanol.
In accordance with the present invention there may be employed conventional molecular biology, microbiology, and recombinant DNA techniques within the skill of the art. Such techniques are explained in the literature. See, e.g., Sambrook, Fritsch & Maniatis, Molecular Cloning: A Laboratory Manual, Second Edition (1989) Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (herein “Sambrook, at al., 1989”); DNA Cloning: A Practical Approach, Volumes I and II (D. N. Glover ed. 1985); Oligonucleotide Synthesis (M. J. Gait ed. 1984); Nucleic Acid Hybridization (B. D. Hames & S. J. Higgins eds. (1985)); Transcription And Translation (B. D. Hames & S. J. Higgins, eds. (1984)); Animal Cell Culture (R. I. Freshney, ed. (1986)); Immobilized Cells And Enzymes (IRL Press, (1986)); B. Perbal, A Practical Guide To Molecular Cloning (1984); F. M. Ausubel, et al. (eds.), Current Protocols in Molecular Biology, John Wiley & Sons, Inc. (1994).
A “polynucleotide”, “nucleic acid” includes DNA and RNA in single stranded form, double-stranded form or otherwise.
A “polynucleotide sequence” or “nucleotide sequence” is a series of nucleotide bases (also called “nucleotides”) in a nucleic acid, such as DNA or RNA, and means a series of two or more nucleotides. Any polynucleotide comprising a nucleotide sequence set forth herein (e.g., promoters of the present invention) forms part of the present invention.
A “coding sequence” or a sequence “encoding” an expression product, such as an RNA or polypeptide is a nucleotide sequence (e.g., heterologous polynucleotide) that, when expressed, results in production of the product (e.g., a heterologous polypeptide such as an immunoglobulin heavy chain and/or light chain).
As used herein, the term “oligonucleotide” refers to a nucleic acid, generally of no more than about 100 nucleotides (e.g., 30, 40, 50, 60, 70, 80, or 90), that may be hybridizable to a polynucleotide molecule. Oligonucleotides can be labeled, e.g., by incorporation of 32P-nucleotides, 3H-nucleotides, 14C-nucleotides, 35S-nucleotides or nucleotides to which a label, such as biotin, has been covalently conjugated.
A “protein”, “peptide” or “polypeptide” (e.g., a heterologous polypeptide such as an immunoglobulin heavy chain and/or light chain) includes a contiguous string of two or more amino acids.
A “protein sequence”, “peptide sequence” or “polypeptide sequence” or “amino acid sequence” refers to a series of two or more amino acids in a protein, peptide or polypeptide.
The term “isolated polynucleotide” or “isolated polypeptide” includes a polynucleotide or polypeptide, respectively, which is partially or fully separated from other components that are normally found in cells or in recombinant DNA expression systems or any other contaminant. These components include, but are not limited to, cell membranes, cell walls, ribosomes, polymerases, serum components and extraneous genomic sequences. The scope of the present invention includes the isolated polynucleotides set forth herein, e.g., the promoters set forth herein; and methods related thereto, e.g., as discussed herein.
An isolated polynucleotide or polypeptide will, preferably, be an essentially homogeneous composition of molecules but may contain some heterogeneity.
“Amplification” of DNA as used includes the use of polymerase chain reaction (PCR) to increase the concentration of a particular DNA sequence within a mixture of DNA sequences. For a description of PCR see Saiki, et al., Science (1988) 239:487.
In general, a “promoter” or “promoter sequence” is a DNA regulatory region capable of binding an RNA polymerase in a cell (e.g., directly or through other promoter-bound proteins or substances) and initiating transcription of a coding sequence to which it operably links. A “promoter of the present invention” includes any of the following promoters:
Pichia pastoris GAPDH promoter (e.g., wherein any sequence operably linked to the promoter is also operably linked to a downstream CYC1 terminator);
Pichia pastoris Pp02g05010 (PpPIR1) promoter;
Pichia pastoris Pp05g08520 (ScCCW12) promoter;
Pichia pastoris Pp01g10900 (ScCHT2) promoter;
Pichia pastoris Pp05g07900 (ScAAC2/PET9) promoter;
Pichia pastoris Pp02g01530 (ScPST1) promoter;
Pichia pastoris Pp05g00700 (unknown) promoter;
Pichia pastoris Pp02g04110 (ScPOR1) promoter;
Pichia pastoris Pp01g03600 (ScBGL2) promoter;
Pichia pastoris Pp01g14410 (ScACO1) promoter;
Pichia pastoris Pp01g09650 (ScYHR021C) promoter;
Pichia pastoris Pp01g02780 (ScYLR388W) promoter;
Pichia pastoris Pp03g09940 (ScPIL1) promoter;
Pichia pastoris Pp02g10710 (ScMDH1) promoter;
Pichia pastoris Pp01g09290 (ScFBA1) promoter;
Pichia pastoris Pp03g03520 (PpDAS2) promoter;
Pichia pastoris Pp03g08760 (ScCWP1) promoter;
Pichia pastoris Pp03g00990 (ScYGR201c) promoter;
Pichia pastoris Pp02g05270 (AN2948.2) promoter;
Pichia pastoris Pp02g12310 (ScDUR3) promoter;
Pichia pastoris Pp03g05430 (ScTHI4) promoter;
Pichia pastoris Pp03g03490 (AN2957.2) promoter;
Pichia pastoris Pp05g09410 (ScTHI13) promoter;
Pichia pastoris Pp02g07970 (ScPEX11/PMP27) promoter;
Pichia pastoris Pp01g12200 (AN7917.2) promoter;
Pichia pastoris Pp03g11380 (ScPMP47) promoter;
Pichia pastoris Pp03g08340 (unknown) promoter;
Pichia pastoris Pp05g04390 (ScTIR3) promoter;
Pichia pastoris Pp01g08380 (ScYIL057c) promoter;
Pichia pastoris Pp01g05090 (ScSAY1) promoter;
Pichia pastoris Pp01g13950 (ScTPN1) promoter;
Pichia pastoris Pp03g11420 (ScARO10) promoter;
Pichia pastoris Pp02g11560 (ScMET6) promoter;
Pichia pastoris Pp01g08650 (ScYNL067W) promoter;
Pichia pastoris Pp01g01850 (PpPDHbeta1) promoter;
Pichia pastoris Pp03g03020 (ScSAM2) promoter; or
Pichia pastoris Pp03g02860 (PpSAHH) promoter;
(e.g., nucleotides 1-1000 of SEQ ID NO: 14; nucleotides 1-1000 of SEQ ID NO: 15; nucleotides 1-1000 of SEQ ID NO: 16; nucleotides 1-1000 of SEQ ID NO: 17; nucleotides 1-1000 of SEQ ID NO: 18; nucleotides 1-1001 of SEQ ID NO: 19; nucleotides 1-1000 of SEQ ID NO: 20; nucleotides 1-1000 of SEQ ID NO: 21; nucleotides 1-1000 of SEQ ID NO: 22; nucleotides 1-1000 of SEQ ID NO: 23; nucleotides 1-1000 of SEQ ID NO: 24; nucleotides 1-1000 of SEQ ID NO: 25; nucleotides 1-1000 of SEQ ID NO: 26; nucleotides 1-1000 of SEQ ID NO: 27; nucleotides 1-1000 of SEQ ID NO: 28; nucleotides 1-1000 of SEQ ID NO: 29; and SEQ ID NOs: 47-63 and 70-75) and/or functional variants thereof. Promoter functional variants are discussed in greater detail below.
A coding sequence (e.g., of a heterologous polynucleotide, e.g., reporter gene or immunoglobulin heavy and/or light chain) is “operably linked to”, “under the control of”, “functionally associated with” or “operably associated with” a transcriptional and translational control sequence (e.g., a promoter of the present invention) when the sequence directs RNA polymerase mediated transcription of the coding sequence into RNA, preferably mRNA, which then may be RNA spliced (if it contains introns) and, optionally, translated into a protein encoded by the coding sequence. A promoter of the present invention operably linked to a coding sequence forms part of the present invention. In an embodiment of the invention, a polynucleotide is operably linked to a transcriptional terminator sequence, e.g., any of those that are included in SEQ ID NOs: 14-29.
The scope of the present invention includes cassettes comprising any of the promoters of the present invention upstream of a polylinker sequence into which a polynucleotide (e.g., a heterologous polynucleotide) can be inserted if desired, optionally, operably linked to a transcriptional terminator sequence (e.g., any of SEQ ID NOs: 14-29). Methods for recombining a cassette (e.g., any of SEQ ID NOs: 14-29) with a polynucleotide (e.g., a heterologous polynucleotide) comprising cleaving the polylinker (e.g., with a restriction endonuclease) and inserting the polynucleotide into the cassette at the cleaved polylinker and religating the recombined polynucleotides together, form part of the present invention as does any such recombined cassette, e.g., formed by such a method. Host cells and uses of such recombined cassettes for expressing a polypeptide (e.g., heterologous polypeptide) discussed herein form part of the present invention as well.
The present invention includes vectors which comprise promoters of the invention optionally operably linked to a heterologous polynucleotide. The term “vector” includes a vehicle (e.g., a plasmid) by which a DNA or RNA sequence can be introduced into a host cell, so as to transform the host and, optionally, promote expression and/or replication of the introduced sequence. In general, a plasmid is circular, includes an origin (e.g., 2 μm origin) and, preferably includes a selectable marker. In plasmids which can be maintained in yeast, commonly used yeast markers include URA3, HIS3, LEU2, TRP1 and LYS2, which complement specific auxotrophic mutations in a yeast host cell, such as ura3-52, his3-D1, leu2-D1, trp1-D1 and lys2-201, respectively. If the plasmid can be maintained in E. coli, it may include a bacterial origin (ori) and/or a selectable market such as the β-lactamase gene (bla or AMPr). Commonly used yeast/E. coli shuttle vectors are the Yip (see Myers et al., Gene 45: 299-310, (1986)), YEp (see Myers et al., Gene 45: 299-310, (1986)), YCp and YRp plasmids. The YIp integrative vectors do not replicate autonomously, but integrate into the genome at low frequencies by homologous recombination. The YEp yeast episomal plasmid vectors replicate autonomously because of the presence of a segment of the yeast 2 μm plasmid that serves as an origin of replication (2 μm ori). The 2 μm ori is responsible for the high copy-number and high frequency of transformation of YEp vectors. The YCp yeast centromere plasmid vectors are autonomously replicating vectors containing centromere sequences, CEN, and autonomously replicating sequences, ARS. The YCp vectors are typically present at very low copy numbers, from 1 to 3 per cell. Autonomously replicating plasmids (YRp) which carry a yeast origin of replication (ARS sequence; but not centromere) that allows the transformed plasmids to be propagated several hundred-fold. YIp, YEp, YCp and YRp are commonly known in the art and widely available. Another acceptable yeast vector is a yeast artificial chromosome (MAC). A yeast artificial chromosome is a biological vector. It is an artificially constructed chromosome and contains the telomeric, centromeric, and replication origin sequences needed for replication in yeast cells (see Marchuk et al., Nucleic Acids Res. 16(15):7743 (1988); Rech et al., Nucleic Acids Res. 18(5):1313 (1990)).
Vectors that could be used in this invention include plasmids, viruses, bacteriophage, integratable DNA fragments, and other vehicles that may facilitate introduction of the nucleic acids into the genome of a host cell (e.g., Pichia pastoris). Plasmids are the most commonly used form of vector but all other forms of vectors which serve a similar function and which are, or become, known in the art are suitable for use herein. See, e.g., Pouwels, et al., Cloning Vectors: A Laboratory Manual, 1985 and Supplements, Elsevier, N.Y., and Rodriguez et al. (eds.), Vectors: A Survey of Molecular Cloning Vectors and Their Uses, 1988, Buttersworth, Boston, Mass.
A polynucleotide (e.g., a heterologous polynucleotide, e.g., encoding an immunoglobulin heavy chain and/or light chain), operably linked to a promoter of the present invention, may be expressed in an expression system. The term “expression system” means a host cell and compatible vector which, under suitable conditions, can express a protein or nucleic acid which is carried by the vector and introduced to the host cell. Common expression systems include fungal host cells (e.g., Pichia pastoris) and plasmid vectors, insect host cells and Baculovirus vectors, and mammalian host cells and vectors.
The term methanol-induction refers to increasing expression of a polynucleotide (e.g., a heterologous polynucleotide) operably linked to a methanol-inducible promoter of the present invention in a host cell by exposing the host cells to methanol.
The term methanol-repression refers to decreasing expression of a polynucleotide (e.g., a heterologous polynucleotide) operably linked to a methanol-repressible promoter of the present invention in a host cell by exposing the host cells to methanol.
The present invention also contemplates any superficial or slight modification to a promoter of the present invention. For example, the present invention includes any “functional variant” of any of: Pichia pastoris Pp02g05010 (PpPIR1) promoter; Pichia pastoris Pp05g08520 (ScCCW12) promoter; Pichia pastoris Pp01g10900 (ScCHT2) promoter; Pichia pastoris Pp05g07900 (ScAAC2/PET9) promoter; Pichia pastoris Pp02g01530 (ScPST1) promoter; Pichia pastoris Pp05g00700 (unknown) promoter; Pichia pastoris Pp02g04110 (ScPOR1) promoter; Pichia pastoris Pp01g03600 (ScBGL2) promoter; Pichia pastoris Pp01g14410 (ScACO1) promoter; Pichia pastoris Pp01g09650 (ScYHR021C) promoter; Pichia pastoris Pp01g02780 (ScYLR388W) promoter; Pichia pastoris Pp03g09940 (ScPIL1) promoter; Pichia pastoris Pp02g10710 (ScMDH1) promoter; Pichia pastoris 01g09290 (ScFBA1) promoter; Pichia pastoris Pp03g03520 (PpDAS2) promoter; Pichia pastoris Pp03g08760 (ScCWP1) promoter; Pichia pastoris Pp03g00990 (ScYGR201c) promoter; Pichia pastoris Pp02g05270 (AN2948.2) promoter; Pichia pastoris Pp02g12310 (ScDUR3) promoter; Pichia pastoris Pp03g05430 (ScTHI4) promoter; Pichia pastoris Pp03g03490 (AN2957.2) promoter; Pichia pastoris Pp05g09410 (ScTHI13) promoter; Pichia pastoris Pp02g07970 (ScPEX11/PMP27) promoter; Pichia pastoris Pp01g12200 (AN7917.2) promoter; Pichia pastoris Pp03g11380 (ScPMP47) promoter; Pichia pastoris Pp03g08340 (unknown) promoter; Pichia pastoris Pp05g04390 (ScTIR3) promoter; Pichia pastoris Pp01g08380 (ScYIL057c) promoter; Pichia pastoris Pp01g05090 (ScSAY1) promoter; Pichia pastoris Pp01g13950 (ScTPN1) promoter; Pichia pastoris Pp03g11420 (ScARO10) promoter; Pichia pastoris Pp02g11560 (ScMET6) promoter; Pichia pastoris Pp01g08650 (ScYNL067W) promoter; Pichia pastoris Pp01g01850 (PpPDHbeta1) promoter; Pichia pastoris Pp03g03020 (ScSAM2) promoter; or Pichia pastoris Pp03g02860 (PpSAHH) promoter (e.g., any of nucleotides 1-1000 of SEQ ID NO: 14; nucleotides 1-1000 of SEQ ID NO: 15; nucleotides 1-1000 of SEQ ID NO: 16; nucleotides 1-1000 of SEQ ID NO: 17; nucleotides 1-1000 of SEQ ID NO: 18; nucleotides 1-1001 of SEQ ID NO: 19; nucleotides 1-1000 of SEQ ID NO: 20; nucleotides 1-1000 of SEQ ID NO: 21; nucleotides 1-1000 of SEQ ID NO: 22; nucleotides 1-1000 of SEQ ID NO: 23; nucleotides 1-1000 of SEQ ID NO: 24; nucleotides 1-1000 of SEQ ID NO: 25; nucleotides 1-1000 of SEQ ID NO: 26; nucleotides 1-1000 of SEQ ID NO: 27; nucleotides 1-1000 of SEQ ID NO: 28; nucleotides 1-1000 of SEQ ID NO: 29; and SEQ ID NOs: 47-63 and 70-75). A functional variant of a promoter includes any sequence variant (e.g., comprising one or more point mutations and/or deletions) that retains the ability to cause the expression of an operably linked polynucleotide (e.g., of a coding sequence) at any detectable level or at a level at least equal to that of the corresponding non-variant promoter. Methods for determining whether a particular promoter (e.g., comprising one or more point mutations and/or deletions) promotes expression (e.g., transcription) of a sequence to which it is functionally linked are conventional and well known in the art. For example, expression can be determined by Northern blot detection of RNA; or, ELISA or Western blot detection of protein encoded by the operably linked coding sequence.
The present invention includes polynucleotides which hybridize to a promoter of the present invention or a complement thereof (e.g., any of nucleotides 1-1000 of SEQ ID NO: 14; nucleotides 1-1000 of SEQ ID NO: 15; nucleotides 1-1000 of SEQ ID NO: 16; nucleotides 1-1000 of SEQ ID NO: 17; nucleotides 1-1000 of SEQ ID NO: 18; nucleotides 1-1001 of SEQ ID NO: 19; nucleotides 1-1000 of SEQ ID NO: 20; nucleotides 1-1000 of SEQ ID NO: 21; nucleotides 1-1000 of SEQ ID NO: 22; nucleotides 1-1000 of SEQ ID NO: 23; nucleotides 1-1000 of SEQ ID NO: 24; nucleotides 1-1000 of SEQ ID NO: 25; nucleotides 1-1000 of SEQ ID NO: 26; nucleotides 1-1000 of SEQ ID NO: 27; nucleotides 1-1000 of SEQ ID NO: 28; nucleotides 1-1000 of SEQ ID NO: 29; and SEQ ID NOs: 47-63 and 70-75) but which retain the ability to drive expression, e.g., at a detectable level or at a level at least equal to that of the corresponding non-variant promoter. Preferably, the polynucleotides hybridize under low stringency conditions, more preferably under moderate stringency conditions and most preferably under high stringency conditions. A polynucleotide is “hybridizable” to another polynucleotide when a single stranded form of the nucleic acid molecule (e.g., either strand) can anneal to the other nucleic acid molecule under the appropriate conditions of temperature and solution ionic strength (see Sambrook, at al., supra). The conditions of temperature and ionic strength determine the “stringency” of the hybridization. Low stringency hybridization conditions may be 55° C., 5×SSC, 0.1% SDS, 0.25% milk, and no formamide; or 30% formamide, 5×SSC, 0.5% SDS. Moderate stringency hybridization conditions are similar to the low stringency conditions except the hybridization is carried out in 40% formamide, with 5× or 6×SSC. High stringency hybridization conditions are similar to low stringency conditions except the hybridization conditions are carried out in 50% formamide, 5× or 6×SSC and, optionally, at a higher temperature (e.g., 57° C., 59° C., 60° C., 62° C., 63° C., 65° C. or 68° C.). In general, SSC is 0.15M NaCl and 0.015M sodium citrate. Hybridization requires that the two nucleic acids contain complementary sequences, although, depending on the stringency of the hybridization, mismatches between bases are possible. The appropriate stringency for hybridizing nucleic acids depends on the length of the nucleic acids and the degree of complementation, variables well known in the art. The greater the degree of similarity or homology between two nucleotide sequences, the higher the stringency under which the nucleic acids may hybridize. For hybrids of greater than 100 nucleotides in length, equations for calculating the melting temperature have been derived (see Sambrook, et al., supra, 9.50-9.51). For hybridization with shorter nucleic acids, i.e., oligonucleotides, the position of mismatches becomes more important, and the length of the oligonucleotide determines its specificity (see Sambrook, et al., supra, 11.7-11.8).
Also included in the present invention are polynucleotides comprising nucleotide sequences which are at least about 70% identical, preferably at least about 80% identical, more preferably at least about 90% identical and most preferably at least about 95% identical (e.g., 95%, 96%, 97%, 98%, 99%, 100%) to a promoter of the present invention (reference polynucleotide; e.g., any of nucleotides 1-1000 of SEQ ID NO: 14; nucleotides 1-1000 of SEQ ID NO 15; nucleotides 1-1000 of SEQ ID NO: 16; nucleotides 1-1000 of SEQ ID NO: 17; nucleotides 1-1000 of SEQ ID NO: 18; nucleotides 1-1001 of SEQ ID NO: 19; nucleotides 1-1000 of SEQ ID NO: 20; nucleotides 1-1000 of SEQ ID NO: 21; nucleotides 1-1000 of SEQ ID NO: 22; nucleotides 1-1000 of SEQ ID NO: 23; nucleotides 1-1000 of SEQ ID NO: 24; nucleotides 1-1000 of SEQ ID NO: 25; nucleotides 1-1000 of SEQ ID NO: 26; nucleotides 1-1000 of SEQ ID NO: 27; nucleotides 1-1000 of SEQ ID NO: 28; nucleotides 1-1000 of SEQ ID NO: 29; and SEQ ID NOs: 47-63 and 70-75) when the comparison is performed by a BLAST algorithm wherein the parameters of the algorithm are selected to give the largest match between the respective sequences over the entire length of the respective reference sequences; but which retain the ability to drive expression, e.g., at a detectable level or at a level at least equal to that of the corresponding non-variant promoter.
Functional variants of the promoters disclosed herein include truncations of the nucleotide sequences set forth herein (e.g., any of nucleotides 1-1000 of SEQ ID NO: 14; nucleotides 1-1000 of SEQ ID NO: 15; nucleotides 1-1000 of SEQ ID NO: 16; nucleotides 1-1000 of SEQ ID NO: 17; nucleotides 1-1000 of SEQ ID NO: 18; nucleotides 1-1001 of SEQ ID NO: 19; nucleotides 1-1000 of SEQ ID NO: 20; nucleotides 1-1000 of SEQ ID NO: 21; nucleotides 1-1000 of SEQ ID NO: 22; nucleotides 1-1000 of SEQ ID NO: 23; nucleotides 1-1000 of SEQ ID NO: 24; nucleotides 1-1000 of SEQ ID NO: 25; nucleotides 1-1000 of SEQ ID NO: 26; nucleotides 1-1000 of SEQ ID NO: 27; nucleotides 1-1000 of SEQ ID NO: 28; nucleotides 1-1000 of SEQ ID NO: 29; and SEQ ID NOs: 47-63 and 70-75) e.g., wherein the 5′ or 3′ end of the sequence is truncated by 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 50, 100, 200 or 500 nucleotides; but which retain the ability to drive expression, e.g., at a detectable level or at a level at least equal to that of the corresponding non-variant promoter.
The following references regarding the BLAST algorithm are herein incorporated by reference: BLAST ALGORITHMS: Altschul, S. F., et al., J. Mol. Biol. (1990) 215:403-410; Gish, W., et al., Nature Genet. (1993) 3:266-272; Madden, T. L., et al., Meth. Enzymol. (1996) 266:131-141; Altschul, S. F., et al., Nucleic Acids Res. (1997) 25:3389-3402; Zhang, J., et al., Genome Res. (1997) 7:649-656; Wootton, J. C., et al., Comput. Chem. (1993) 17:149-163; Hancock, J. M., et al., Comput. Appl. Biosci. (1994) 10:67-70; ALIGNMENT SCORING SYSTEMS: Dayhoff, M. O., et al., “A model of evolutionary change in proteins.” in Atlas of Protein Sequence and Structure, (1978) vol. 5, suppl. 3. M. O. Dayhoff (ed.), pp. 345-352, Natl. Biomed. Res. Found., Washington, D.C.; Schwartz, R. M., et al., “Matrices for detecting distant relationships.” in Atlas of Protein Sequence and Structure, (1978) vol. 5, suppl. 3.” M. O. Dayhoff (ed.), pp. 353-358, Natl. Biomed. Res. Found., Washington, D.C.; Altschul, S. F., J. Mol. Biol. (1991) 219:555-565; States, D. J., at al., Methods (1991) 3:66-70; Henikoff, S., et al., Proc. Natl. Acad. Sci. USA (1992)89:10915-10919; Altschul, S. F., at al., J. Mol. Evol. (1993) 36:290-300; ALIGNMENT STATISTICS: Karlin, S., at al., Proc. Natl. Acad. Sci. USA (1990) 87:2264-2268; Karlin, S., et al., Proc. Natl. Acad. Sci. USA (1993) 90:5873-5877; Dembo, A., et al., Ann. Prob. (1994) 22:2022-2039; and Altschul, S. F. “Evaluating the statistical significance of multiple distinct local alignments.” in Theoretical and Computational Methods in Genome Research (S. Suhai, ed.), (1997) pp. 1-14, Plenum, New York.
The present invention encompasses any isolated host cell (e.g., fungal, such as Pichia pastoris, bacterial, mammalian) including a promoter of the present invention, e.g., operably linked to a polynucleotide encoding a heterologous polypeptide (e.g., a reporter or immunoglobulin heavy and/or light chain) as well as methods of use thereof, e.g., methods for expressing the heterologous polypeptide in the host cell. Host cells of the present invention, comprising a promoter of the present invention, may be genetically engineered so as to express particular glycosylation patterns on polypeptides that are expressed in such cells. Host cells of the present invention are discussed in detail herein. Any host cell comprising a promoter of the present invention disclosed herein forms part of the present invention.
A “host cell” that may be used in a composition or method of the present invention, as is discussed herein, includes cells comprising a promoter of the present invention in which such a promoter can cause expression of a polynucleotide encoding a heterologous polypeptide to which it is operably linked. Higher eukaryote cells which are host cells include mammalian (e.g., Chinese hamster ovary (CHO) cells), insect, and plant cells. In an embodiment of the invention, the host cell is a lower eukaryote such as a yeast or filamentous fungi cell, which, for example, is selected from the group consisting of any Pichia cell, Pichia pastoris, Pichia flnlandica, Pichia trehalophila, Pichia koclamae, Pichia membranaefaciens, Pichia minuta (Ogataea minuta, Pichia lindneri), Pichia opuntiae, Pichia thermotolerans, Pichia salictaria, Pichia guercuum, Pichia pijperi, Pichia stiptis, Pichia methanolica, Pichia, Saccharomyces cerevisiae, Saccharomyces, Hansenula polymorpha, Kluyveromyces, Kluyveromyces lactis, Candida albicans, Aspergillus nidulans, Aspergillus niger, Aspergillus oryzae, Trichoderma reesei, Chrysosporium lucknowense, Fusarium, Fusańum gramineum, Fusarium venenatum and Neuraspora crassa.
As used herein, the terms “N-glycan” and “glycoform” are used interchangeably and refer to an N-linked oligosaccharide, e.g., one that is attached by an asparagine-N-acetylglucosamine linkage to an asparagine residue of a polypeptide. N-linked glycoproteins contain an N-acetylglucosamine residue linked to the amide nitrogen of an asparagine residue in the protein. Predominant sugars found on glycoproteins are glucose, galactose, mannose, fucose, N-acetylgalactosamine (GalNAc), N-acetylglucosamine (GlcNAc) and sialic acid (e.g., N-acetyl-neuraminic acid (NANA)).
N-glycans have a common pentasaccharide core of Man3GlcNAc2 (“Man” refers to mannose; “Glc” refers to glucose; and “NAc” refers to N-acetyl; GlcNAc refers to N-acetylglucosamine). N-glycans differ with respect to the number of branches (antennae) comprising peripheral sugars (e.g., GlcNAc, galactose, fucose and sialic acid) that are added to the Man3GlcNAc2 (“Man3”) core structure which is also referred to as the “trimannose core”, the “pentasaccharide core” or the “paucimannose core”. N-glycans are classified according to their branched constituents (e.g., high mannose, complex or hybrid). A “high mannose” type N-glycan has five or more mannose residues. A “complex” type N-glycan typically has at least one GlcNAc attached to the 1,3 mannose arm and at least one GlcNAc attached to the 1,6 mannose arm of a “trimannose” core. Complex N-glycans may also have galactose (“Gal”) or N-acetylgalactosamine (“GalNAc”) residues that are optionally modified with sialic acid or derivatives (e.g., “NANA” or “NeuAc”, where “Neu” refers to neuraminic acid and “Ac” refers to acetyl). Complex N-glycans may also have intrachain substitutions comprising “bisecting” GlcNAc and core fucose (“Fuc”). Complex N-glycans may also have multiple antennae on the “trimannose core,” often referred to as “multiple antennary glycans.” A “hybrid” N-glycan has at least one GlcNAc on the terminal of the 1,3 mannose arm of the trimannose core and zero or more mannoses on the 1,6 mannose arm of the trimannose core. The various N-glycans are also referred to as “glycoforms.” “FNGase”, or “glycanase” or “glucosidase” refer to peptide N-glycosidase F (EC 3.2.2.18).
In an embodiment of the invention, O-glycosylation of glycoproteins in a host cell is controlled. The scope of the present invention includes isolated host cells (e.g., fungal cells such as Pichia pastoris) comprising a promoter of the present invention (e.g., operably linked to a heterologous polynucleotide encoding a heterologous polypeptide) wherein O-glycosylation is controlled (as discussed herein) and methods of use thereof. For example, host cells are part of the present invention wherein O-glycan occupancy and mannose chain length are reduced. In lower eukaryote host cells such as yeast, O-glycosylation can be controlled by deleting the genes encoding one or more protein O-mannosyltransferases (Dol-PMan: Protein (Ser/Thr) Mannosyl Transferase genes) (PMTs) or by growing the host in a medium containing one or more Pmtp inhibitors. Thus, the present invention includes isolated host cells comprising a promoter of the present invention (e.g., operably linked to a heterologous polynucleotide encoding a heterologous polypeptide) e.g., comprising a deletion of one or more of the genes encoding PMTs, and/or, e.g., wherein the host cell can be cultivated in a medium that includes one or more Pmtp inhibitors. Pmtp inhibitors include but are not limited to a benzylidene thiazolidinedione. Examples of benzylidene thiazolidinediones are 5-[[3,4bis(phenylmethoxy)phenyl]methylene]-4-oxo-2-thioxo-3-thiazolidineacetic Acid; 5-[[(3-(1-25 Phenylethoxy)-4-(2-phenylethoxy)]phenyl]methylene]-4-oxo-2-thioxo-3-thiazolidineacetic Acid; and 5-[[3-(1-Phenyl-2-hydroxy)ethoxy)-4-(2-phenylethoxy)]phenyl]methylene]-4-oxo-2-thioxo3-thiazolidineacetic acid.
In an embodiment of the invention, a host cell (e.g., a fungal cell such as Pichia pastoris) includes a nucleic acid that encodes an alpha-1,2-mannosidase that has a signal peptide that directs it for secretion. For example, in an embodiment of the invention, the host cell is engineered to express an exogenous alpha-1,2-mannosidase enzyme having an optimal pH between 5.1 and 8.0, preferably between 5.9 and 7.5. In an embodiment of the invention, the exogenous enzyme is targeted to the endoplasmic reticulum or Golgi apparatus of the host cell, where it trims N-glycans such as Man8GlcNAc2 to yield Man8GlcNAc2. See U.S. Pat. No. 7,029,872.
Host cells (e.g., a fungal cell such as Pichia pastoris) comprising a promoter of the present invention (e.g., operably linked to a heterologous polynucleotide encoding a heterologous polypeptide) are, in an embodiment of the invention, genetically engineered to eliminate glycoproteins having alpha-mannosidase-resistant N-glycans by deleting or disrupting one or more of the beta-mannosyltransferase genes (e.g., BMT1, BMT2, BMT3, and BMT4)(See, U.S. Published Patent Application No. 2006/0211085) or abrogating translation of RNAs encoding one or more of the beta-mannosyltransferases using interfering RNA, antisense RNA, or the like. The scope of the present invention includes such an isolated fungal host cell (e.g., Pichia pastoris) comprising a promoter of the present invention (e.g., operably linked to a heterologous polynucleotide encoding a heterologous polypeptide).
Host cells (e.g., a fungal cell such as Pichia pastoris) comprising a promoter of the present invention (e.g., operably linked to a heterologous polynucleotide encoding a heterologous polypeptide) also include those that are genetically engineered to eliminate glycoproteins having phosphomannose residues, e.g., by deleting or disrupting one or both of the phosphomannosyl transferase genes PNO1 and MNN4B (See for example, U.S. Pat. Nos. 7,198,921 and 7,259,007), which can include deleting or disrupting the MNN4A gene or abrogating translation of RNAs encoding one or more of the phosphomannosyltransferases using interfering RNA, antisense RNA, or the like. In an embodiment of the invention, a “eukaryotic host cell” has been genetically modified to produce glycoproteins that have predominantly an N-glycan selected from the group consisting of complex N-glycans, hybrid N-glycans, and high mannose N-glycans wherein complex N-glycans are, in an embodiment of the invention, selected from the group consisting of Man3GlcNAc2, GlcNAC(1-4)Man3GlcNAc2, NANA(1-4)GlcNAc(1-4)Man3GlcNAc2, and NANA(1-4)Gal(1-4)Man3GlcNAc2; hybrid N-glycans are, in an embodiment of the invention, selected from the group consisting of Man9GlcNAc2, GlcNAcMan5GlcNAc2, GalGlcNAcMan5GlcNAc2, and NANAGalGlcNAcMan5GlcNAc2; and high mannose N-glycans are, in an embodiment of the invention, selected from the group consisting of Man6GlcNAc2, Man7GlcNAc2, Man9GlcNAc2, and Man9GlcNAc2. The scope of the present invention includes such an isolated fungal host cell (e.g., Pichia pastoris) comprising a promoter of the present invention (e.g., operably linked to a heterologous polynucleotide encoding a heterologous polypeptide).
As used herein, the term “essentially free of” as it relates to lack of a particular sugar residue, such as fucose, or galactose or the like, on a glycoprotein, is used to indicate that the glycoprotein composition is substantially devoid of N-glycans which contain such residues. Expressed in terms of purity, essentially free means that the amount of N-glycan structures containing such sugar residues does not exceed 10%, and preferably is below 5%, more preferably below 1%, most preferably below 0.5%, wherein the percentages are by weight or by mole percent.
As used herein, a glycoprotein composition “lacks” or “is lacking” a particular sugar residue, such as fucose or galactose, when no detectable amount of such sugar residue is present on the N-glycan structures. For example, in an embodiment of the present invention, glycoprotein compositions are expressed using a promoter of the present invention (e.g., operably linked to a heterologous polynucleotide encoding a heterologous polypeptide), as discussed herein, and will “lack fucose,” because the cells do not have the enzymes needed to produce fucosylated N-glycan structures. Thus, the term “essentially free of fucose” encompasses the term “lacking fucose.” However, a composition may be “essentially free of fucose” even if the composition at one time contained fucosylated N-glycan structures or contains limited, but detectable amounts of fucosylated N-glycan structures as described above.
The present invention encompasses any isolated polynucleotide comprising any of the promoters set forth herein and functional variants thereof, (e.g., operably linked to a heterologous polynucleotide encoding a heterologous polypeptide and/or a terminator from the same or from a different gene). Vectors comprising such polynucleotides as well host cells comprising such vectors and expression methods using such vectors and/or host cells fall within the scope of the present invention.
For example, a promoter of the present invention includes any of the following promoters:
Pichia pastoris Pp02g05010 (PpPIR1) promoter;
Pichia pastoris Pp05g08520 (ScCCW12) promoter;
Pichia pastoris Pp01g10900 (ScCHT2) promoter;
Pichia pastoris Pp05g07900 (ScAAC2/PET9) promoter;
Pichia pastoris Pp02g01530 (ScPST1) promoter;
Pichia pastoris Pp05g00700 (unknown) promoter;
Pichia pastoris Pp02g04110 (ScPOR1) promoter;
Pichia pastoris Pp01g03600 (ScBGL2) promoter;
Pichia pastoris Pp01g14410 (ScACO1) promoter;
Pichia pastoris Pp01g09650 (ScYHR021C) promoter;
Pichia pastoris Pp01g02780 (ScYLR388W) promoter;
Pichia pastoris Pp03g09940 (ScPIL1) promoter;
Pichia pastoris Pp02g10710 (ScMDH1) promoter;
Pichia pastoris Pp01g09290 (ScFBA1) promoter;
Pichia pastoris Pp03g03520 (PpDAS2) promoter;
Pichia pastoris Pp03g08760 (ScCWP1) promoter;
Pichia pastoris Pp03g00990 (ScYGR201c) promoter;
Pichia pastoris Pp02g05270 (AN2948.2) promoter;
Pichia pastoris Pp02g12310 (ScDUR3) promoter;
Pichia pastoris Pp03g05430 (ScTHI4) promoter;
Pichia pastoris Pp03g03490 (AN2957.2) promoter;
Pichia pastoris Pp05g09410 (ScTHI13) promoter;
Pichia pastoris Pp02g07970 (ScPEX11/PMP27) promoter;
Pichia pastoris Pp01g12200 (AN7917.2) promoter;
Pichia pastoris Pp03g11380 (ScPMP47) promoter;
Pichia pastoris Pp03g08340 (unknown) promoter;
Pichia pastoris Pp05g04390 (ScTIR3) promoter;
Pichia pastoris Pp01g08380 (ScYIL057c) promoter;
Pichia pastoris Pp01g05090 (ScSAY1) promoter;
Pichia pastoris Pp01g13950 (ScTPN1) promoter;
Pichia pastoris Pp03g11420 (ScARO10) promoter;
Pichia pastoris Pp02g11560 (ScMET6) promoter;
Pichia pastoris Pp01g08650 (ScYNL067W) promoter;
Pichia pastoris Pp01g01850 (PpPDHbeta1) promoter;
Pichia pastoris Pp03g03020 (ScSAM2) promoter;
Pichia pastoris Pp03g02860 (PpSAHH) promoter; or
Pichia pastoris GAPDH promoter (e.g., operably linked to a terminator, such as the CYC1 terminator; e.g., wherein any sequence operably linked to the promoter is also operably linked to a downstream CYC1 terminator);
or a functional variant of such a promoter; optionally, operably linked to a heterologous polynucleotide, e.g., a reporter or immunoglobulin heavy and/or light chain. For example, specific, non-limiting examples of promoters of the present invention comprising a nucleotide sequence set forth below:
Pp is Pichia pastoris
Sc is Saccharomyces cerevisiae
An is Aspergillus niger
[Unless otherwise described, the polylinker is in bold, and the promoter precedes the polylinker and the terminator follows the polylinker. In some sequences, upstream and/or downstream restriction cloning sites are also bolded at the 5′ and/or 3′ ends of the displayed sequences. The scope of the present invention encompasses embodiments wherein the bolded cloning sites are absent completely or are other than that specifically disclosed herein; as well as wherein sequences not having bolded cloning sites do encompass a cloning site of any sort].
As mentioned, the scope of the present invention encompasses compositions and methods comprising the following whole cassettes or the promoters in said cassettes.
AGATCTTTTTTGTAGAAATGTCTTGGTGTCCTCGTCCAATCAGGTAGCCATCTCTGAAATATCTGGC
CGCGGCGCGCCTTAATTAAATGTTGGGTTTATGGTCATTTCATGATATTGGCTGTTCTTGTGTAAAA
CGCGGCGCGCCTTAATTAAATTGCTTATTATTAACACTGTATTCTATTGTTTCTCATTGTAGCCACA
CGCGGCGCGCCTTAATTAAGTTTGATCACTGAATTACCTCACACGGTTGTATTTTAGCAAAATTTCT
CGCGGCGCGCCTTAATTAAGCCAATTAGTTTGAAGTGAGATTTTATTTCATTCCTGTTAATATTATA
CGCGGCGCGCCTTAATTAAACATAGAGAAATAAAAAGAAGAAGCAGAAAAGTTGAATGGAATATTTG
CCGCGGCGCGCCTTAATTAAGCTGTTTGTCCTTATACATCATTCCAAGGTTAGAAAAGGCCGGAAAA
CGCGGCGCGCCTTAATTAATAAGCGTTCTAGGTAGCAAGTTTTTTAAAGATGAAAAATTAGTAATAT
CGCGGCGCGCCTTAATTAATGAGCTCTATCAAGTCATTTTATTATTACCCTCAAATAGGTCATATAG
CGCGGCGCGCCTTAATTAATAAGTTTTAGCCACCTACAAATTCCAATAATCGGCTGTTTGTTCTGAT
CGCGGCGCGCCTTAATTAATAAAAGAATTATTGAAGATGGTTGAAAGAAGTGGAACGATCAAGAGGA
CGCGGCGCGCCTTAATTAATAAACAAATTTTAGGTCCTTTTCAAAAATTCACATGTATAATTTAATC
CGCGGCGCGCCTTAATTAATAAGATGTCCCATTTAGTATTAGCATTGAACGATGTTGATGTTGTCTG
CGCGGCGCGCCTTAATTAATAAGTTTGATTATATGTACTTAGATTATTTTTCAATGAAATGAATGAG
CGCGGCGCGCCTTAATTAAATTGCTTGAAGCTTTAATTTATTTTATTAACATAATAATAATACAAGC
CGCGGCGCGCCTTAATTAAATCGATTTGTATGTGAAATAGCTGAAATTCGAAAATTTCATTATGGCT
CGCGGCGCGCCTTAATTAAGCTTCACGATTTGTGTTCCAGTTTATCCCCCCTTTATATACCGTTAAC
The present invention encompasses methods for making a polypeptide (e.g., an immunoglobulin chain or an antibody or antigen-binding fragment thereof) comprising introducing, into an isolated fungal host cell (e.g., Pichia, e.g., Pichia pastoris) or an in vitro expression system, an isolated hybrid polynucleotide comprising a promoter of the present invention, e.g., selected from the group consisting of: Pichia pastoris GAPDH promoter (e.g., wherein any sequence operably linked to the promoter is also operably linked to a downstream CYC1 terminator); Pichia pastoris Pp02g05010 (PpPIR1) promoter; Pichia pastoris Pp05g08520 (ScCCW12) promoter; Pichia pastoris Pp01g10900 (ScCHT2) promoter; Pichia pastoris Pp05g07900 (ScAAC2/PET9) promoter; Pichia pastoris Pp02g01530 (ScPST1) promoter; Pichia pastoris Pp05g00700 (unknown) promoter; Pichia pastoris Pp02g04110 (ScPOR1) promoter; Pichia pastoris Pp01g03600 (ScBGL2) promoter; Pichia pastoris Pp01g14410 (ScACO1) promoter; Pichia pastoris Pp01g09650 (ScYHR021C) promoter; Pichia pastoris Pp01g02780 (ScYLR388W) promoter; Pichia pastoris Pp03g09940 (ScPIL1) promoter; Pichia pastoris Pp02g10710 (ScMDH1) promoter; Pichia pastoris 01g09290 (ScFBA1) promoter; Pichia pastoris Pp03g03520 (PpDAS2) promoter; Pichia pastoris Pp03g08760 (ScCWP1) promoter; Pichia pastoris Pp03g00990 (ScYGR201c) promoter; Pichia pastoris Pp02g05270 (AN2948.2) promoter; Pichia pastoris Pp02g12310 (ScDUR3) promoter; Pichia pastoris Pp03g05430 (ScTHI4) promoter; Pichia pastoris Pp03g03490 (AN2957.2) promoter; Pichia pastoris Pp05g09410 (ScTHI13) promoter; Pichia pastoris Pp02g07970 (ScPEX11/PMP27) promoter; Pichia pastoris Pp01g12200 (AN7917.2) promoter; Pichia pastoris Pp03g11380 (ScPMP47) promoter; Pichia pastoris Pp03g08340 (unknown) promoter; Pichia pastoris Pp05g04390 (ScTIR3) promoter; Pichia pastoris Pp01g08380 (ScYIL057c) promoter; Pichia pastoris Pp01g05090 (ScSAY1) promoter; Pichia pastoris Pp01g13950 (ScTPN1) promoter; Pichia pastoris Pp03g11420 (ScARO10) promoter; Pichia pastoris Pp02g11560 (ScMET6) promoter; Pichia pastoris Pp01g08650 (ScYNL067W) promoter; Pichia pastoris Pp01g01850 (PpPDHbeta1) promoter; Pichia pastoris Pp03g03020 (ScSAM2) promoter; and Pichia pastoris Pp03g02860 (PpSAHH) promoter; or a functional variant thereof, operably linked to a heterologous polynucleotide encoding a heterologous polypeptide and culturing the host cell (e.g., in a liquid culture medium, e.g., YPD medium (e.g., comprising 1% yeast extract, 2% peptone, 2% glucose)), optionally in the presence of methanol, under conditions whereby the polynucleotide encoding the polypeptide is expressed, thereby producing the polypeptide. Expression of the polynucleotide may be induced when the promoter of the present invention is methanol-inducible and the host cells are grown in the presence of methanol.
An expression system, comprising the fungal host cell comprising the promoter of the present invention operably linked to the heterologous polynucleotide, e.g., in an ectopic vector or integrated into the genomic DNA of the host cell, forms part of the present invention. A composition comprising the fungal host cell which includes the promoter of the present invention operably linked to the heterologous polynucleotide in liquid culture medium also forms part of the present invention.
In one embodiment of the invention, a method for expressing a heterologous polypeptide, e.g., as discussed herein, does not comprising starving the fungal host cells of a nutrient such as a carbon source such as glycerol or glucose. Other embodiments include methods wherein the cells are starved. For example, the present invention comprises methods for expressing a polypeptide in a fungal glycosylation mutant strain, e.g., as discussed herein, wherein the host cell comprises a promoter of the present invention (e.g., methanol-inducible) operably linked to a heterologous polynucleotide encoding the polypeptide wherein the host cell is not starved and is cultured in the presence of methanol.
Method for expressing any polypeptide using a promoter of the present invention can be done at any volume including, for example, low volumes and high, industrial volumes. For example, expression is performed, in an embodiment of the invention, in 5 liter or 40 liter volumes. Genes operably linked to CHT2 or PIR1 promoters have been done in 40 liter volumes; and genes operably linked to the DAS promoters have been done in 5 liter volumes.
In an embodiment of the invention, the polynucleotide that is operably linked to the promoter of the present invention is in a vector that comprises a selectable marker. In an embodiment of the invention, the fungal host cells, e.g., Pichia cells, are grown in a liquid culture medium and cells including the vector with the selectable marker are selected for growth; e.g., wherein the selectable marker is a drug resistance gene, such as the zeocin resistance gene, and the cells are grown in the presence of the drug, such as zeocin.
The present invention also encompasses methods for growing cells wherein expression of a polynucleotide is inhibited. For example, such a method comprises, in an embodiment of the invention, introducing, into an isolated host cell (e.g., a fungal cell such as Pichia pastoris) a polynucleotide encoding said polypeptide that is operably linked to a methanol-repressible promoter of the present invention (e.g., SEQ ID NO: 70-75) and culturing the host cell (e.g., in a liquid culture medium, e.g., YPD medium (e.g., comprising 1% yeast extract, 2% peptone, 2% glucose)), in the presence of methanol at a sufficient concentration to inhibit expression, at least partially.
In an embodiment of the invention, polypeptide expression using a methanol-inducible promoter of the present invention includes three phases, the glycerol batch phase, the glycerol fed-batch phase and the methanol fed-batch phase. First, in the glycerol batch phase (GBP), host cells are initially grown on glycerol in a batch mode. In the second phase, the glycerol fed-batch phase (GFP), a limited glycerol feed is initiated following exhaustion of the glycerol in the previous phase, and cell mass is increased to a desired level prior to methanol-induction. Furthermore, the methanol-inducible promoters are de-repressed during this phase due to the absence of excess glycerol. The third phase is the methanol fed-batch phase (MFP), in which methanol is fed at a limited feed rate or maintained at some level to induce the methanol-inducible promoters for protein expression. A limited glycerol feed can be simultaneously performed for promoting production when necessary.
Accordingly, the present invention encompasses methods for making a heterologous polypeptide (e.g., an immunoglobulin chain or an antibody or antigen-binding fragment thereof) comprising introducing, into an isolated host cell (e.g., Pichia, such as Pichia pastoris) a heterologous polynucleotide encoding said polypeptide that is operably linked to a methanol-inducible promoter of the present invention (e.g., SEQ ID NO: 47-63) and culturing the host cells,
(i) in a batch phase (e.g., a glycerol batch phase) wherein the cells are grown with a non-fermentable carbon source, such as glycerol, e.g., until the non-fermentable carbon source is exhausted;
(ii) in a batch-fed phase (e.g., a glycerol batch-fed phase) wherein additional non-fermentable carbon source (e.g., glycerol) is fed, e.g., at a growth limiting rate; and
(iii) in a methanol fed-batch phase wherein the cells are grown in the presence of methanol and, optionally, additional glycerol.
In an embodiment of the invention, prior to the batch phase, an initial seed culture is grown to a high density (e.g., OD600 of about 2 or higher) and the cells grown in the seed culture are used to inoculate the initial batch phase culture medium.
In an embodiment of the invention, after the batch-fed phase and before the methanol fed-batch phase, the host cells are grown in a transitional phase wherein cells are grown in the presence of about 2 ml methanol per liter of culture. For example, the cells can be grown in the transitional phase until the methanol concentration reaches about zero.
In an embodiment of the invention, the host cells (e.g., Pichia cells such as Pichia pastoris) are grown under any 1, 2, 3, 4, 5 or 6 of the following conditions:
The present invention provides methods for making polypeptides, such as immunoglobulin chains, antibodies or antigen-binding fragments thereof having modified glycosylation patterns, for example, by expressing a polypeptide in a host cell that introduces a given glycosylation pattern and/or by growing the host cell under conditions wherein the glycosylation is introduced. Some of such host cells are discussed herein. For example, the invention provides methods for making a heterologous protein that is a glycoprotein comprising an N-glycan structure that comprises a Man5GlcNAc2 glycoform; comprising introducing a polynucleotide encoding the polypeptide wherein the polynucleotide is operably linked to a promoter of the present invention into a host cell and culturing the host cell under conditions wherein the polypeptide is expressed with the Man5GlcNAc2 glycoform and/or lacking fucose.
The present invention is intended to exemplify the present invention and not to be a limitation thereof. The methods and compositions disclosed below (including, without limitation, any promoter, terminator, promoter/terminator combination or expression construct, e.g., promoter-gene-terminator) fall within the scope of the present invention.
The complete wild P. pastoris strain NRRL-y11430 genome sequence was determined yielding 9,411,042 bases on 4 large contigs and one smaller contig of 34,728 bp (nucleotide base pairs) that could not be resolved, consistent with the previously published finding that the P. pastoris genome consists of 4 chromosomes. The genome sequence was then annotated using the automated genefinder software FGNESH (Salamov and Solovyev, Genome Res., 2000, 10: 516-522). A total of 5069 protein coding ORFs and 278 non-coding transcripts, were identified. Identified genes were named systematically using the convention Pp (for P. pastoris), the contig number, the letters g (gene) or e (element), and a systematic number. For example, the first gene on Contig 1 is Pp01g00010. Each identified gene was compared to 8 databases using BlastP (Altschul, et al., J. Mol. Biol., 1990, 215: 403-410). The databases were: Aspergillus niger proteins (Pel at al., Nat. Biotechnol., 2007, 25: 221-231), Saccharomyces cerevisiae strain S288C proteins (www.yeastgenome.org), Schizosaccharomyces pombe proteins, Candida albicans proteins, Candida glabrata proteins, Homo sapiens proteins, Pichia stipitis proteins, and the complete UniProtkB protein database (www.uniprot.org). A gene microarray was designed on the Agilent platform in 8×15 format using Agilent earray software using these genes as well as an additional 77 genes that were identified from Genbank as being involved in Glycosylation processes. The 77 non-P. pastoris genes are derived from various species from fungi to human and code for proteins that include glycan transferases, sugar-nucleotide transporters, and enzymes involved in sugar metabolism. Probes were designed for all 5424 genes for 3′ biased hybridization protocol to a density of 2-3 probes per gene (4207 genes with 3 probes/transcript and 1217 genes with 2 probes/transcript). This custom-designed Agilent P. pastoris 15 k 3.0 array (8×15K) gene microarray was used for all whole genome gene-chip RNA expression analyses.
P. pastoris wild type strain NRRL-Y11430 and two N-glycan modified or glycoengineered strains, YGLY8316 and YGLY8323, were chosen for comparative analysis of gene expression. Both N-glycan modified strains have been specifically engineered to produce the galactose terminated human N-glycan intermediate as has been previously reported (Hamilton, Science, 2006; Davidson U.S. Pat. No. 7,795,002). The three strains were each cultivated in quadruplicate in 0.5 L Bioreactors (Sixfors multifermentation system; ATR Biotech, Laurel, Md.) using a standard glycerol-to-methanol fed-batch protocol as described in Barnard at al., 2010 (J. Ind. Microbiol. Biotechnol. 37:961-971). Samples were taken from each bioreactor at the following timepoints:
1) during the middle of glycerol batch at 50 mg/ml of wet cell weight (batch),
2) during the starvation period after glycerol exhaustion (End of Batch) as measured by an increase in dissolved oxygen (DO),
3) 4+/−1 hours into methanol-induction, and
4) 24+/−1 hours into methanol-induction (
At each timepoint, wet cell weight was measured to determine the amount of cells to harvest and then 1×107 (+/−2×) cells were harvested into 2 ml screwcap microcentrifuge tubes, centrifuged briefly at 5000×g, supernatant was discarded, and the cell pellets were flash frozen using dry ice ethanol. The cell pellets were then used for RNA extraction and microarray hybridization (discussed below). This study is referred to herein as “the wild type/glycoengineered strain comparison study.”
A P. pastoris glycoengineered strain, YGLY8316, and four highly related glycoengineered strains expressing the monoclonal antibodies MK-HER2 strain A (YGLY12501), MK-HER2 Strain B (YGLY13992), MK-RSV (YGLY14401), and MK-VEGF (YGLY10360) were cultivated in triplicate in Sartorius Q12 1 L bioreactors (Sartorius, Goettingen, Germany) using a standard fed-batch fermentation protocol as described in Barnard at al., 2010 (J. Ind. Microbiol. Biotechnol. 37:961-971). Samples were taken from each bioreactor at the following timepoints: 1) during the middle of glycerol batch at 50 mg/ml of wet cell weight (batch), 2) during the middle of glycerol fed-batch (4+/−1 hours into fed-batch), 3) 4+/−1 hours into methanol induction, 4) 24+/−1 hours into methanol induction, 5) 48+/−1 hours into methanol induction, 6) 72+/−1 hours into methanol induction, 7) 96+/−1 hours into methanol induction (
Following sample collection, samples were processed. Briefly, total RNA was extracted and scrutinized for quality and yield; mRNA was amplified using Ambion MessageAmp II reagents and protocols and then hybridized to a custom-designed Agilent Pichia pastoris 15 k 3.0 array (8×15K) based upon an internal Pichia pastoris genome sequence for strain NRRL Y-11430; subsequent scanning was performed using Agilent Microarray scanners (version B), and output raw image files in .tif format were processed by Agilent Feature Extractor (FE) software. Microarray quality control data were generated from the FE output data and were reviewed for data quality.
Standard Resolver pipelines for the Agilent Single Color Error Model (Weng et al., Bioinformatics 22, 2006, 1111-1121) were used for data summarization and calling using the following parameters: FRACTION=0.12, POISSON=3, and RANDOM=0.05. Briefly, the data was median normalized, and then a background gradient was calculated and subtracted from the normalized data. Next, intensity and ratio error models were constructed which combined replicate measurements and modeled associated error. These models determined whether a particular gene exhibited differential expression for the ratio comparison specified, although such differential expression calls were typically made via ANOVA and t-test statistical tests that were also performed. In addition to these statistical tests, clustering, PCA, and other operations were also performed upon the data using Resolver software, typically utilizing data ratioed to the pool of all other samples within a specific study unless otherwise indicated. In order to determine promoters with desired characteristics (e.g., little gene expression upon glycerol growth but up-regulation upon methanol growth), the Trend tool was utilized to match the 100 closest matching gene expression profiles by distance as described in the Resolver User's Manual and online help sections (Rosetta Resolver User Guide, 2002, Kirkland, Wash.).
To identify methanol-inducible promoters, gene expression data intensity profiles from the wild type/glycoengineered strains study were analyzed by first ratioing strain-specific, individual sample data to the Batch (50 mg/ml of wet cell weight; glycerol) timepoint. Three individual ANOVA analyses were then performed using 3 factors (Batch, 4 hour MeOH, and 24 hour MeOH), one for each of the strains with individual replicates with a cutoff of P<=0.005. These genes were then clustered by K-means with 6 clusters using a 2 fold change cutoff in at least 4 samples, resulting in a total of 2,882 sequences (
Samples were organized as combined replicates and again referenced strain-specific to the Batch (50 mg/ml of wet cell weight; glycerol) timepoint. Each replicate combined sample for the wild type (y11430) and the glycoengineered strains (YGLY8316 and YGLY8323) was then analyzed individually as an intensity plot comparing the glycerol (Batch) with methanol (24 hrs MeOH) timepoints (
Pp01g09290 (ScFBA1 (one of two identified in P. pastoris, the other FBA1 homolog is not induced by methanol), SEQ ID NO: 30),
Pp03g03520 (DAS2, a second homolog of PpDAS1, SEQ ID NO: 31),
Pp03g08760 (ScCWP1, SEQ ID NO: 32),
Pp03g00990 (Homologous to ScYGR201c, SEQ ID NO: 33),
Pp02g05270 (Homologous to Aspergillus niger AN2948.2, SEQ ID NO: 34),
Pp02g12310 (ScDUR3, SEQ ID NO: 35),
Pp03g05430 (ScTHI4, SEQ ID NO: 36),
Pp03g03490 (homologous to A. niger AN2957.2, SEQ ID NO: 37),
Pp05g09410 (THI13, SEQ ID NO: 38),
Pp02g07970 (ScPEX11, SEQ ID NO: 39),
Pp01g12200 (Homologous to A. niger AN7917.2, SEQ ID NO: 40),
Pp03g11380 (ScPMP47, SEQ ID NO: 41),
Pp03g08340 (unknown, SEQ ID NO: 42),
Pp05g04390 (ScTIR3, SEQ ID NO: 43),
Pp01g08380 (ScYIL057C, SEQ ID NO: 44),
Pp03g11380 (ScPMP47, SEQ ID NO: 45), and
Pp01g13950 (ScTPN1, SEQ ID NO: 46).
The intensity data for these genes was plotted in comparison to AOX1 (Pp05g01320) and GAPDH (Pp02g08660) as controls (
The extracted promoters of these 17 genes are contained herein as SEQ ID NOs: 47 through 63, respectively. The promoters and transcriptional terminators for several exemplary genes of this group were then in vitro synthesized (GeneArt, AG, Regensberg, Germany) as the 5′-proximal 1000 bp of genomic sequence to the ATG of each respective gene and the 500 bp of genomic sequence 3′ proximal to the stop codon of each respective gene. The promoters/terminators for these genes, Pp03g08760 (ScCWP1), Pp03g03520 (DAS2), Pp01g09290 (ScFBA1) and Pp03g00990 (ScYGR201C), as well as Pp03g03500 (DAS1) as a control, were subcloned into the AOX1 containing P. pastoris integration vector pGLY580 at the BglII/RsrII sites to generate plasmids pGLY8529-8533, respectively (These plasmids as well as pGLY580 are depicted in
Gene expression data intensity profiles from the wild type/glycoengineered strains study with data ratioed to the Batch timepoint were analyzed to identify constitutive promoters. In particular, those genes were identified which maintain high intensity in the 4 hour of methanol induction vs. Batch and 24 hour of methanol induction vs. Batch samples. The intersection of the highest intensity genes was analyzed by individually comparing intensity profiles at each timepoint as plotted in dotplots for the glycerol (batch) versus methanol (24 hours MeOH) timepoints (
Pp02g05010 (ScPIR1, SEQ ID NO: 1),
Pp01g10900 (ScCHT2, SEQ ID NO: 2),
Pp05g07900 (ScAAC2/PET9, SEQ ID NO: 3),
Pp05g08520 (ScCCW12, SEQ ID NO: 4),
Pp02g01530 (ScPST1, SEQ ID NO: 5),
Pp05g00700 (unknown, SEQ ID NO: 6),
Pp02g04110 (ScPOR1, SEQ ID NO: 7),
Pp01g03600 (ScBGL2, SEQ ID NO: 8),
Pp01g14410 (ScACO1, SEQ ID NO: 9),
Pp01g09650 (ScYHR021C, SEQ ID NO: 10),
Pp01g02780 (ScYLR388W, SEQ ID NO: 11),
Pp03g09940 (ScPIL1, SEQ ID NO: 12),
Pp02g10710 (ScMDH1, SEQ ID NO: 13), and
Pp03g12300 (unknown).
Surprisingly, despite the fact that the canonically constitutive housekeeping GPD gene shows significant regulation, along with nearly ⅔ of the genome, a number of these genes could be identified as having truly constitutive expression under these diverse carbon source conditions. The intensity data for the identified genes is plotted in comparison to AOX1 and GAPDH (
The gene regulatory regions (promoter/5′ untranslated region or UTR and transcriptional terminator/3′ UTR) for each of these genes was further identified by extracting the 1000 bp upstream of the start (ATG) codon and 500 bp downstream of the stop codon. These sequences were extracted and paired together as regulatory cassettes flanked around the sequences for recognition by restriction endonucleases NotI (GCGGCCGC), AscI (GGCGCGCC), and PacI (TTAATTAA) indicated in bold in the sequences and these regulatory cassettes are identified as SEQ ID NOs: 14-26. The sequence cassettes were then physically synthesized and cloned (GeneArt, AG, Regensberg, Germany) to be used as expression cassettes. As controls, the Pp TEF (Pp01g00550), PpGPD or GAPDH (Pp02g08660), PpPMA1 (Pp01g12610) gene regulatory elements were similarly generated and these cassette sequences are herein identified as SEQ ID NOs: 27-29, respectively. The cassettes for CCW12, CHT2, PET9, PST1, TEF, GPD, and PMA1 were then subcloned into a plasmid containing the P. pastoris URA5 gene and TRP1 integration sequences using the flanking BglII/RsrII restriction sites to generate the P. pastoris expression plasmids pGLY8620-8627, respectively (
Interestingly, among both the induced and constitutive genes, we found that some genes that differed significantly in their expression from wild type to glycoengineered strains were among the strongest expressed genes by intensity profile. For example CWP1 (Pp03g08760) was strongly induced upon switch to methanol in the glycoengineered strains analyzed in both the wildtype/glycoengineered strain comparison study (YGLY8316 and YGLY8323) as well as in all of the strains in the mAb comparison study (YGLY8316, YGLY13992, YGLY12501, YGLY14401, and YGLY10360) but while methanol-induced, is only modestly expressed under either condition in the wild type strain. Similarly among the constitutive genes, Pp05g08520 (CCW12), Pp02g05010 (PIR1) and Pp05g00700 (unknown) are among the stronger constitutive genes in the engineered strains (YGLY8316, YGLY8323, YGLY13992, YGLY12501, YGLY14401, and YGLY10360), but are only expressed either moderately (PIR1, Pp05g00700) or very weakly (CCW12) in the wild type strains. All of these genes display unexpected high expression levels in the glycoengineered strains and this property allows their promoters to be exploited in the engineered strains as useful regulatory sequences.
To identify methanol repressible promoters, gene expression data intensity profiles from the wild type/glycoengineered strains study were analyzed by first ratioing data to the Batch (50 mg/ml of wet cell weight; glycerol) timepoint. Similar to the inducible gene clusters, the number genes repressed by methanol (Clusters 3 and 6 in
Pp03g11420 (ScARO10; SEQ ID NO: 64),
Pp02g11560 (ScMET6; SEQ ID NO: 65),
Pp01g08650 (ScYNL067W; SEQ ID NO: 66),
Pp01g01850 (PDHbeta1; SEQ ID NO: 67),
Pp03g03020 (ScSAM2; SEQ ID NO: 68),
Pp03g02860 (SAHH; SEQ ID NO: 69).
The intensity data for these genes is plotted in comparison to AOX1 and GAPDH (
The promoters for these genes were extracted as the 5′-proximal 1000 bp of genomic sequence to the ATG of each respective gene. These sequences are contained herein as SEQ ID NOs.: 70-75, respectively.
Selected constitutive promoters were fused to the E. coli lacZ (β-galactosidase) gene by cloning a PCR amplified version of the lacZ gene into the NotI/PacI sites in the expression cassettes for promoters PIR1 (Pp02g05010, pGLY8620), CCW12 (Pp05g08520, pGLY8621), CHT2 (Pp01g10900, pGLY8622), PETS (Pp05g07900, pGLY8623), PST1 (Pp02g01530, pGLY8624), TEF (Pp01g00550, pGLY8625), GPD (Pp02g08660, pGLY8626), PMA1 (Pp02g12610, pGLY8627), to generate plasmids pGLY8640-pGLY8647, respectively.
The lacZ containing expression plasmids pGLY8640-8647 were transformed into P. pastoris GFI5.0 strain (Bobrowicz et al., Glycobiol 2004; Davidson U.S. Pat. No. 7,795,002) YGLY8458 and clones were selected on media lacking uracil. Positive transformants were then cultivated in liquid culture in 96 deep well plates on media with glycerol as the sole carbon source for 72 hours and samples of the cells were harvested by centrifugation. The remainder of the culture was then cultivated for an additional 24 hours on media with methanol as the sole carbon source after which samples of the cells were again harvested. The harvested cell pellets were then subjected to a beta-galactosidase assay as previously described (Guarente Methods Emzymol 1983, 101: 181-191). The results of the assay are shown in
Selected methanol-inducible promoters were fused to the E. coli LacZ (β-galactosidase) gene by cloning a PCR amplified version of the lacZ gene into the NotI/PacI sites in the expression cassettes for promoters Pp03g08760 (ScCWP1, pGLY8529), Pp03g03520 (DAS2, pGLY8530), Pp03g00990 (ScYGR201C, pGLY8532), Pp03g03500 (DAS1, pGLY8533), Pp01g09290 (ScFBA1, pGLY8531), to generate plasmids pGLY8549, pGLY8550, pGLY8552, pGLY8553, and pGLY8551, respectively.
Selected inducible promoters were also fused to the Human Fc gene by cloning a PCR amplified version of the Human Fc gene into the NotI/PacI sites in the expression cassettes for promoters CWP1 (Pp03g08760, pGLY8529), PpDAS2 (Pp03g03520, pGLY8530), FBA1 (Pp01g09290, pGLY8531), YGR201C (Pp03g00990, pGLY8532), as well as PpDAS1 (Pp03g03500, (pGLY8533), as a control to generate plasmids pGLY8539, pGLY8540, pGLY8548, pGLY8545, and pGLY8546, respectively. Also as a control, the AOX1 promoter (Pp05g01320) was inserted as a BglII/NotI fragment from plasmid pGLY4464 along with the hFc NotI/PacI PCR fragment, into pGLY580 digested with BglII/PacI to generate pGLY8547.
The hFc containing expression plasmids pGLY8539, pGLY8540, pGLY8545, pGLY8546, pGLY8547, and pGLY8548 were transformed into P. pastoris GFI5.0 strain (Bobrowicz et al., Glycobiol. 2004, 14(9):757-66; Davidson U.S. Pat. No. 7,795,002) YGLY8458 and clones were selected on media lacking uracil. Positive transformants were identified by PCR for the plasmid integration using standard methods.
Positive transformants were then cultivated in liquid culture in an Applikon “micro24” 24 well 5 ml mini fermenter system in media with glycerol as the sole carbon source for 72 hours and sample supernatants were harvested by centrifugation. The remainder of the culture was then cultivated for an additional 72 hours on media with methanol as the sole carbon source after which sample supernatants of the cells were again harvested. The harvested supernatant was then subjected to a HPLC to determine Fc titer. The results of the assay for the 72 hour methanol samples are shown in
Whole antibodies can be displayed on the surface of P. pastoris cells by anchoring a protein-A/S. cerevisiae SED1 protein fusion and capturing secreted mAb provided that the protein-A anchor and the antibody are not co-expressed simultaneously (Prinz US2010/0009866). In that case, the GUT1 promoter (glucose-repressible) was used to drive the protein A-based anchor and GPD (GAPDH) promoter was used to drive the secreted/anchored monoclonal antibody. Increased expression of the GPD-mAb upon switch from glycerol to glucose, in combination with repression of the GUT1-SED1/ProteinA anchor, resulted in successful anchoring of monoclonal antibody on the cell surface. Here, several exemplary methanol repressible promoters were utilized for cell surface display of the protein A-based anchor to provide disparate expression of the anchor from the secreted AOX1-driven monoclonal antibody as depicted in the cartoon in
Four methanol repressible promoters were chosen for the protein-A/SED1 anchor whole antibody cell surface display. The promoters of Pp03g11420 (Homolog to S. cerevisiae ARO10), Pp02g11560 (Homolog to S. cerevisiae METE), Pp01g08650 (Homolog to S. cerevisiae YNL067W, protein component of the large 60S ribosomal subunit), Pp03g03020 (Homolog to S. cerevisiae SAM2) showed strong transcription in the glycerol phase and strong repression in the methanol phase and were therefore chosen to express the protein-A/SED1 anchor.
The sequences of the four promoters were in vitro synthesized (GeneArt, AG, Regensberg, Germany) and subcloned as BglII-EcoRI fragments into pGLY4136, in front of a gene encoding 5 IgG-binding domains of protein-A anchored to the S. cerevisiae SED1 protein, which anchored the protein-A onto the P. pastoris cell surface. The plasmid pGLY4136 also contained the Arsenite (Ars) resistance gene as a selection marker and the P. pastoris URA6 gene as integration site (
Plasmids pGLY9545-9548 were transformed into the empty glycoengineered GS5.0 strain YGLY17108 that does not have a secreted monoclonal antibody construct, as well as glycoengineered GS5.0 strains YGLY13979 containing a secreted AOX1-driven anti-HER2 monoclonal antibody construct, along with YGLY18281 (AX132) and YGLY18483 (AX189), each expressing a distinct secreted AOX1-driven anti-PCSK9 monoclonal antibody construct. Clones were selected on plates containing 1 mM arsenite.
Transformants of the empty glycoengineered GS5.0 strain containing the protein-A/S. cerevisiae SED1 anchor under the four different repressible promoters were grown in glycerol media and then induced in methanol. Samples were taken in glycerol and after 24, 48 and 72 hours of induction in methanol and labeled with fluorescent rabbit IgG1-Alexa Fluor 488. The rabbit IgG1 bound to the protein-A on the yeast cell surface and can be monitored by FACS analysis (Lin et al, J. Immunol. Methods. 2010, 358(1-2):66-74). In glycerol phase, the protein-A was displayed on the cell surface under all four promoters (PpARO10; PpMET6, PpYNL067W, and PpSAM2) while the parental strain, without the protein A display construct, does not show any labeling (
Transformants from the four antibody expressing glycoengineered GS5.0 strains containing the protein-A-S. cerevisiae SED1 anchor, under the four different methanol-repressible promoters, were grown in glycerol media and then induced in methanol for two days. Samples were taken after 24 and 48 hours of induction in methanol. YGLY13979 transformed anti-HER2 monoclonal antibody expressing strains were labeled with fluorescent Fab anti-Fc DyLight-488 and anti-human Kappa-APC conjugated to detect the light chain and the heavy chain of the displayed antibody. The displayed anti-Her2 antibody was efficiently captured on the cell surface at both timepoints as judged by the observed fluorescence shift of these cell populations, while the YGLY17108 strain without expressing an antibody or the strain with neither antibody nor protein A display do not show a fluorescence shift (
Two transformed anti-PCSK9 expressing strains, YGLY22299 and YGLY22301, were labeled with fluorescent Fab anti-Fc DyLight-488 to detect the antibody heavy chain and with biotinylated PCSK9 antigen and further labeled with streptavidin-Alexa Fluor 635 conjugate to detect the biotinylated PCSK9 antigen.
One unexpected aspect to the constitutive promoter analysis was the high level of expression obtained from the control strong GAP (Pp02g08660) promoter. The lacZ construct used to test this promoter included about 1 kb of the GAP promoter as well as 500 bb of the native GAP transcriptional terminator sequence (SEQ ID NO: 28). Previous reports have focused on fusing only 500 bp of the GAP promoter with either the S. cerevisiae CYC1 transcriptional terminator or the P. pastoris AOX1 transcriptional terminator. Here, the increase in expression levels for the GAP promoter (
To this end, an additional control promoter-terminator combination was generated by fusing the traditional 500 bp of the P. pastoris GAP gene, Pp01g08620, promoter (nucleotides 7-492 of SEQ ID NO: 76) and ˜300 bp 3′ terminator region of the S. cerevisiae CYC1 gene (nucleotides 515-807 of SEQ ID NO: 76) from pGLY580 (
Strains containing the GAP-CYC1 fusion (YGLY23848), the PIR1 promoter/terminator fusion (YGLY23728), the CHT2 promoter/terminator fusion (YGLY23734), the TEF promoter/terminator fusion (YGLY23743), the PMA1 promoter/terminator fusion (YGLY23749), and the newly described GAP promoter/terminator fusion from this work (YGLY23747), were cultivated at 40 liter fermentation scale to confirm constitutive promoter activity during the course of a glycerol-to-methanol fermentation process at large scale. First, a Research Cell Bank (RCB) was generated for each strain by cultivating a loopful of cells from a YPD plate for 48 hours in 200 ml of BMGY media (Invitrogen, Carlsbad, Calif.) to a measured optical density of 20-80. Cells are then mixed with 80% glycerol (v/v) to generate a final concentration of 20% glycerol to cell suspension (v/v) and cells are frozen at −80° C. in 1 ml aliquots.
For fermentation, performed in a stainless steel 40 liter Applikon (Foster City, Calif.) bioreactor, a vial (1 mL) of a RCB was inoculated into 500 mL of BSGY medium (4% glycerol, 1% yeast extract, 2% Soytone, 1.34% YNB without amino acids, 0.23% K2HPO4, 1.19% KH2PO4, 8 μg/L biotin) in 2.8 liter-baffled flask. The culture incubated at 24° C., while shaking on an orbital shaker at 180 rpm for 48±4 hours. The bioreactor was inoculated with a 10% volumetric ratio of seed to initial modified BSGY medium containing 50 g/L of maltitol and no sorbitol. Cultivation conditions were as follows: temperature set at 24±0.5° C., pH controlled at 6.5±0.1 with 30% ammonium hydroxide, dissolved oxygen was maintained at 20% of saturation by cascading agitation rate on the addition of pure oxygen to the fixed airflow rate of 0.7 vvm. After depletion of the initial glycerol (4%) charge, a 50% glycerol solution containing 12.5 mL/L of PTM1 salts (6.5 g FeSO4.7H2O, 2.0 g ZnCl2, 0.6 g CuSO4.5H2O, 3.0 g MnSO4.7H2O, 0.5 g CoCl2.6H2O, 0.2 g NaMoO4.2H2O, 0.2 g biotin, 80 mg NaI, 20 mg H3BO4 per L) was fed exponentially at a rate of 0.08 h−1 for 8 hours. After a 30 minute starvation phase, induction was initiated where methanol was fed exponentially starting at 1.5 g/L/h increasing at a rate of 0.008 h−1 and the entire induction phase was conducted under methanol-limited conditions.
Samples were harvested by removing 1 ml of broth and centrifuging for 30 seconds at top speed in a microcentrifuge, then flash freezing at −80° C. Samples were harvested during glycerol batch (˜50 mg/ml of wet cell weight), at the middle of glycerol fedbatch, and at 15+/−2 h, 37+/−2 h, and 60+/−2 hour of methanol induction. For lacZ assays, frozen cell pellets (100-200 ml) were washed twice in 1 ml PBS and resuspended in 200 ul complete protein inhibitor cocktail (Roche, cat #11 873 580 001) containing PBS. The cells were disrupted by vigorously vortexing cell suspension (100 ml) twice with 10 mg of 425-600 mesh glass beads (acid washed and air dried) for 2 minutes following addition of zymolyase (1 U/ml; AMS Biotechnology; Zymolyase®-20T). The mixture was placed at room temperature for 60 minutes with occasional brief vortexing. The protein content of the cell lysate was determined by BCA assay (Pierce, cat#23225). The unit of galactosidase activity was determined by the rate of 4-Methylumbelliferyl β-D-galactopyranoside hydrolysis in PBS per min per mg protein. β-Galactosidase from Kluyveromyces lactis (Sigma, Cat# G3665) was used as standard. The release of 4-Methylumbelliferone was measured by fluorescence detection (ex=355, em=460) for the duration of 5-60 minutes.
The 40 liter lacZ expression data demonstrated the scalability of each of the promoter cassettes tested. Similar to previous results, all promoters drove expression of lacZ under all conditions tested including the new PIR1 and CHT2 promoters and all promoters showed some level of expression reduction at later timepoints on methanol induction. Also, consistent with previous results, the PMA1 promoter, commonly used as a strong constitutive promoter, was quite weak compared to the other promoters tested and was especially reduced in expression on methanol compared to the other promoters. Again, the 1 kb GAP promoter paired with its native terminator was stronger than most of the other promoters, and here was significantly stronger than even the PIR1 or TEF promoters. However, the control 500 bp GAP promoter paired with the CYC1 terminator was significantly weaker than the 1 kb GAP promoter and in fact weaker than the TEF and PIR1 promoters as previously expected. These data demonstrated that the 1 kb GAP promoter paired with its native terminator established a new version of this promoter with a similar near constitutive nature (weaker on methanol than glycerol but still highly active) but much more active than the canonical 500 bp version previously reported. And the surprising identification of this new version of the GAP promoter will be a useful option as a highly active promoter useful for driving strong transcription of transgenes in P. pastoris.
The present invention is not to be limited in scope by the specific embodiments described herein. Indeed, the scope of the present invention includes embodiments specifically set forth herein and other embodiments not specifically set forth herein; the embodiments specifically set forth herein are not necessarily intended to be exhaustive. Various modifications of the invention in addition to those described herein will become apparent to those skilled in the art from the foregoing description. Such modifications are intended to fall within the scope of the claims.
Patents, patent applications, publications, product descriptions, and protocols are cited throughout this application, the disclosures of which are incorporated herein by reference in their entireties for all purposes.
The present application claims the benefit of U.S. provisional patent application No. 61/466,220, filed Mar. 22, 2011; and U.S. provisional patent application No. 61/473,426, filed Apr. 8, 2011; each of which is herein incorporated by reference in its entirety.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/US2012/029146 | 3/15/2012 | WO | 00 | 9/16/2013 |
Number | Date | Country | |
---|---|---|---|
61466220 | Mar 2011 | US | |
61473426 | Apr 2011 | US |