The present invention is in the field of biotechnology, in particular in the field of gene expression and relates to a method for increasing the secretion of a protein of interest (POI) from a eukaryotic cell, comprising co-expression of a recombinant nucleotide sequence encoding a protein of interest and at least one recombinant nucleotide sequence encoding a protein that increases protein secretion. The invention further relates to a yeast promoter sequence, in particular to a promoter sequence of the PET9 gene of Pichia pastoris (P. pastoris), which is particularly useful for expression of a protein of interest in yeast, preferably in a strain of the genus Komagataella (Komagataella pastoris, Komagataella pseudopastoris or Komagataella phaffii), and which has an increased promoter activity relative to the promoter sequence of the glycerol aldehyde phosphate dehydrogenase (GAP) gene of Pichia pastoris under comparable conditions. The invention further relates to an expression vector based on the pPuzzle backbone comprising a PET9 promoter sequence from P. pastoris, as well as to the use of such an expression vector for expression of a protein of interest in a host cell, in particular in a strain of the genus Komagataella (K. pastoris, K. pseudopastoris or K. phaffii).
The invention also relates to new yeast promoter sequences of genes from P. pastoris, which are useful for expression of a protein of interest in yeast, preferably in a strain of the genus Komagataella (K. pastoris, K. pseudopastoris or K. phaffii).
Successful secretion of proteins has been accomplished both with prokaryotic and eukaryotic hosts. The most prominent examples are bacteria like Escherichia coli, yeasts like Saccharomyces cerevisiae, Pichia pastoris or Hansenula polymorpha, filamentous fungi like Aspergillus awamori or Trichoderma reesei, or mammalian cells like e.g. CHO cells. While the secretion of some proteins is readily achieved at high rates, many other proteins are only secreted at comparatively low levels (Punt et al., 2002; Macauley-Patrick et al., 2005; Porro et al., 2005).
The heterologous expression of a gene in a host organism requires a vector allowing stable transformation of the host organism. This vector has to provide the gene with a functional promoter adjacent to the 5′ end of the coding sequence. The transcription is thereby regulated and initiated by this promoter sequence. Most promoters used up to date have been derived from genes that code for metabolic enzymes that are usually present at high concentrations in the cell.
EP 0103409 discloses the use of yeast promoters associated with expression of specific enzymes in the glycolytic pathway, i.e. promoters involved in expression of pyruvate kinase, triosephosphate isomerase, phosphoglucose isomerase, phosphoglycerate mutase, hexokinase 1 and 2, glucokinase, phosphofructose kinase, aldolase and glycolytic regulation gene.
WO 97/44470 describes yeast promoters from Yarrowia lipolytica for the translation elongation factor 1 (TEF1) protein and for the ribosomal protein S7 that are suitable for heterologous expression of proteins in yeast.
WO 2005/003310 provides methods for the expression of a coding sequence of interest in yeast using a promoter of the glyceraldehyde-3-phosphate dehydrogenase or phosphoglycerate mutase from oleaginous yeast Yarrowia lipolytica.
One approach for the improvement of the secretion of a recombinant protein was done by random mutagenesis (Archer et al., 1994; Lang and Looman, 1995). The major disadvantage of this method is that positive results usually cannot be transferred to other strains.
The secretory pathway—the folding and processing of proteins—of eukaryotic organisms, e.g. of yeast, is very complex with many interacting participants. Some of these proteins have catalytic activity on the proteins like protein disulfide isomerase (PDI), others act by binding to the proteins and preventing them from aggregation (chaperones, e.g. BiP), or by stimulating release of the protein to the cell exterior at a later step in the secretory pathway (SSO proteins). Due to this interdependence, increasing the rate of one reaction step in the secretory pathway may not automatically augment secretion of a protein of interest, but instead may cause a rate-limitation at one or more of the subsequent reaction steps and thus may not remove but only shift bottle-neck(s) of the expression system.
The secretory pathway typically starts by translocation of transmembrane polypeptides and polypeptides intended for secretion into the lumen of the endoplasmatic reticulum (ER). For that purpose, these proteins possess an amino-terminal signal sequence. This signal sequence—also called leader sequence—typically consists of 13 to 36 rather hydrophobic amino acids; no special consensus sequence has been identified yet. On the ER luminal side the signal sequence is removed by a signal peptidase, while the nascent polypeptide is bound to chaperones to prevent miscoiling until translation has finished. ER resident proteins are responsible for correct folding mechanisms. They include, for example, calnexin, calreticulin, Erp72, GRP94, and PDI which latter catalyses the formation of disulfide bonds, and the prolyl-isomerase. Besides, some of the post-translational modifications such as N-glycosylation are initiated in the ER lumen. Proteins are exported to the Golgi apparatus by vesicular transport only after the correct conformation of the proteins has been assured by the ER quality control mechanism. Unless there is a differing signal, proteins intended for secretion are directed from the Golgi apparatus to the outside of the plasma membrane by specific transport vesicles (Stryer and Lubert, 1995; Gething and Sambrook, 1992).
In most cases the rate limiting step in the eukaryotic secretion pathway has been identified to be the move of proteins from the ER to the Golgi apparatus (Shuster, 1991). A mechanism called ER-associated protein degradation (ERAD) is responsible for the retention of misfolded or unmodified non-functional proteins in the ER and their subsequent removal.
It has been shown in several cases that the secretion process of heterologous proteins can be enhanced by co-overexpression of certain proteins that are involved in the secretory pathway and which support the folding and/or processing of other proteins (Mattanovich et al., 2004).
Co-expression of the gene encoding PDI and a gene encoding a heterologous disulphide-bonded protein was first suggested in WO 93/25676 as a means of increasing the production of the heterologous protein. WO 93/25676 reports that the recombinant expression of antistasin and tick anticoagulant protein can be increased by co-expression with PDI.
WO 94/08012 provides methods for increasing protein secretion in yeast by increasing expression of a Hsp70 chaperone protein, i.e. KAR2 and BiP or a PDI chaperone protein.
The yeast syntaxin homologs SSO1 and SSO2 are necessary for the fusion of secretory vesicles to the plasma membrane by acting as t-SNAREs.
WO 94/08024 discloses a process for producing increased amounts of secreted foreign or endogenous proteins by co-expression of the genes SSO1 and SOS2.
WO 03/057897 provides methods for the recombinant expression of a protein of interest by co-expressing at least two genes encoding proteins selected from the group consisting of the chaperone proteins GroEL, GRoES, Dnak, DnaJ, GRpe, CIpB and homologs thereof.
WO 2005/0617818 and WO 2006/067511 provide methods for producing a desired heterologous protein in yeast by using a 2 μm-based expression plasmid. It was demonstrated that the production of a heterologous protein is substantially increased when the genes for one or more chaperone protein(s) and a heterologous protein are co-expressed on the same plasmid.
Another approach to stimulate the secretory pathway is to overexpress the unfolded protein response (UPR) activating transcription factor HAC1. Transcriptional analyses revealed that up to 330 genes are regulated by HAC1, most of them belonging to the functional groups of secretion or the biogenesis of secretory organelles (e.g. ER-resident chaperones, foldases, components of the Translocon).
WO 01/72783 describes methods for increasing the amount of a heterologous protein secreted from a eukaryotic cell by inducing an elevated unfolded protein response (UPR), wherein the UPR is modulated by co-expression of a protein selected from the group consisting of HAC1, PTC2 and IRE1.
The flavoenzyme ERO1 is required for oxidation of protein dithiols in the ER. It is oxidized by molecular oxygen and acts as a specific oxidant of PDI. Disulfides generated de novo within ERO1 are transferred to PDI and then to substrate proteins by dithiol-disulfide exchange reactions.
WO 99/07727 discloses the use of ERO1 to enhance disulfide bond formation and thereby to increase the yield of properly folded recombinant proteins.
While these approaches, once established, can be transferred to other strains and used for other proteins as well, they are limited by the actual knowledge about the function of such proteins supporting the secretion of other proteins.
It can be anticipated that the successful high level secretion of a recombinant protein may be limited at a number of different steps, like folding, disulfide bridge formation, glycosylation, transport within the cell, or release from the cell. As many of these processes are still not fully understood, it can also be anticipated that there are many more proteins involved which support the secretion of a protein, than is currently known. However, such helper functions cannot be predicted with the current knowledge of the state-of-the-art, even when the DNA sequence of the entire genome of a host organism is available.
Proteins known to be involved in the yeast secretory pathway frequently influence the process of protein folding and subsequent secretion at different steps of the secretion process.
Accordingly, it is desirable to provide new methods to increase production of secreted proteins in eukaryotic cells which are simple and efficient. It is also desirable to provide new genes to be used in methods for the increased production of secreted proteins. It is also desirable to provide new yeast promoters, especially for use in the expression of heterologous or homologous genes in yeast, in particular in a yeast of the genus Komagataella, but also for expression of a desired gene in any other eukaryotic expression system.
It is an objective of the present invention to provide a method of increasing the secretion of a protein of interest (POI) from a eukaryotic cell, comprising co-expression of a recombinant nucleotide sequence encoding a POI and at least one recombinant nucleotide sequence encoding a protein that increases protein secretion from a host cell. An increase in secretion of the POI is determined on the basis of a comparison of its secretion yield in the presence or absence of co-expression of a said protein that increases protein secretion.
In one aspect the invention relates to such a method including the co-expression of a recombinant nucleotide sequence encoding a POI and of at least one other recombinant nucleotide sequence encoding a protein that increases protein secretion, wherein said protein that increases protein secretion is selected from the group consisting of BMH2, BFR2, C0G6, C0Y1, CUP5, IMH1, KIN2, SEC31, SSA4, SSE1, and a biologically active fragment of any of the foregoing proteins.
In another aspect the invention relates to such a method wherein at least one other recombinant nucleotide sequence is obtained from a yeast, preferably from Saccharomyces cerevisiae or from Pichia pastoris.
In another aspect the invention relates to such a method wherein at least one recombinant nucleotide sequence encoding a protein that increases protein secretion is obtained from Saccharomyces cerevisiae and is identical with or corresponds to and has the functional characteristics of a sequence selected from the group consisting of SEQ ID NO 32, SEQ ID NO 33, SEQ ID NO 34, SEQ ID NO 35, SEQ ID NO 36, SEQ ID NO 37, SEQ ID NO 38, SEQ ID NO 39, SEQ ID NO 40 and SEQ ID NO 41.
In another aspect the invention relates to such a method wherein at least one recombinant nucleotide sequence encoding a protein that increases protein secretion is obtained from Pichia pastoris and is identical with or corresponds to and has the functional characteristics of a sequence selected from the group consisting of SEQ ID NO 42, SEQ ID NO 43, SEQ ID NO 44, SEQ ID NO 45, SEQ ID NO 46, SEQ ID NO 47, SEQ ID NO 48, SEQ ID NO 49, SEQ ID NO 50 and SEQ ID NO 51.
In yet another aspect the invention relates to the use of such a nucleotide sequence encoding a protein that increases protein secretion as a protein secretion enhancer, particularly as an enhancer of the secretion of a POI from a eukaryotic cell.
It is another object of the invention to provide a nucleotide sequence encoding a protein that increases protein secretion from a host cell, wherein the nucleotide sequence is isolated from Pichia pastoris and is identical with or corresponds to and has the functional characteristics of a sequence selected from the group consisting of a nucleotide sequence encoding the protein BMH2 (SEQ ID NO 42), a nucleotide sequence encoding the protein BFR2 (SEQ ID NO 43), a nucleotide sequence encoding the protein C0G6 (SEQ ID NO 44), a nucleotide sequence encoding the protein C0Y1 (SEQ ID NO 45), a nucleotide sequence encoding the protein CUP5 (SEQ ID NO 46), a nucleotide sequence encoding the protein IMH1 (SEQ ID NO 47), a nucleotide sequence encoding the protein KIN2 (SEQ ID NO 48), a nucleotide sequence encoding the protein SEC31 (SEQ ID NO 49), a nucleotide sequence encoding the protein SSA4 (SEQ ID NO 50) and a nucleotide sequence encoding the protein SSE1 (SEQ ID NO 51).
It is another object of the invention to provide a yeast promoter sequence of the PET9 gene from Pichia pastoris, which is useful for expression of a POI in yeast, preferably in a strain of the genus Komagataella, in particular in a strain of K. pastoris, K. pseudopastoris or K. phaffii, and which has, under comparable conditions, an increased promoter activity relative to the promoter sequence of the GAP protein of Pichia pastoris.
It is another object of the invention to provide such a yeast promoter sequence, particularly a yeast promoter sequence identical with or corresponding to and having the functional characteristics of SEQ ID NO 125, or a functionally equivalent variant thereof.
In another aspect the invention relates to an expression vector based on the pPuzzle backbone further comprising such a yeast promoter sequence of the PET9 gene from Pichia pastoris which is identical with or corresponding to and having the functional characteristics of SEQ ID NO 125, or a functionally equivalent variant thereof.
In yet another aspect the invention relates to the use of such a plasmid for the expression of a POI in a host cell, the host cell preferably being a cell of a strain of the genus Komagataella, in particular a cell of a strain of K. pastoris, K. pseudopastoris or K. phaffii.
It is another object of the invention to provide a yeast promoter sequence from Pichia pastoris which is useful for the expression of a POI in yeast, preferably in a strain of the genus Komagataella, wherein the yeast promoter sequence is identical with or corresponds to and has the functional characteristics of a sequence selected from the group consisting of a 1000 bp fragment from the 5′-non coding region of the GND1 gene (SEQ ID NO 126), a 1000 bp fragment from the 5′-non coding region of the GPM1 gene (SEQ ID NO 127), a 1000 bp fragment from the 5′-non coding region of the HSP90 gene (SEQ ID NO 128), a 1000 bp fragment from the 5′-non coding region of the KAR2 gene (SEQ ID NO 129), a 1000 bp fragment from the 5′-non coding region of the MCM1 gene (SEQ ID NO 130), a 1000 bp fragment from the 5′-non coding region of the RAD2 gene (SEQ ID NO 131), a 1000 bp fragment from the 5′-non coding region of the RPS2 gene (SEQ ID NO 132), a 1000 bp fragment from the 5′-non coding region of the RPS31 gene (SEQ ID NO 133), a 1000 bp fragment from the 5′-non coding region of the SSA1 gene (SEQ ID NO 134), a 1000 bp fragment from the 5′-non coding region of the THI3 gene (SEQ ID NO 135), a 1000 bp fragment from the 5′-non coding region of the TPI1 gene (SEQ ID NO 136), a 1000 bp fragment from the 5′-non coding region of the UBI4 gene (SEQ ID NO 137), a 1000 bp fragment from the 5′-non coding region of the ENO1 gene (SEQ ID NO 138), a 1000 bp fragment from the 5′-non coding region of the RPS7A gene (SEQ ID NO 139), a 1000 bp fragment from the 5′-non coding region of the RPL1 gene (SEQ ID NO 140), a 1000 bp fragment from the 5′-non coding region of the TKL1 gene (SEQ ID NO 141), a 1000 bp fragment from the 5′-non coding region of the PIS1 gene (SEQ ID NO 142), a 1000 bp fragment from the 5′-non coding region of the FET3 gene (SEQ ID NO 143), a 1000 bp fragment from the 5′-non coding region of the FTR1 gene (SEQ ID NO 144), a 1000 bp fragment from the 5′-non coding region of the NMT1 gene (SEQ ID NO 145), a 1000 bp fragment from the 5′-non coding region of the PHO8 gene (SEQ ID NO 146), and a 1000 bp fragment from the 5′-non coding region of the FET3 precursor (FET3pre) gene (SEQ ID NO 147), or a functionally equivalent variant of any of the foregoing sequences.
In another aspect the invention relates to an expression vector based on the pPuzzle backbone further comprising such a yeast promoter sequence identical with or corresponding to and having the functional characteristics of a sequence selected from the group consisting of SEQ ID NO 126, SEQ ID NO 127, SEQ ID NO 128, SEQ ID NO 129, SEQ ID NO 130, SEQ ID NO 131, SEQ ID NO 132, SEQ ID NO 133, SEQ ID NO 134, SEQ ID NO 135, SEQ ID NO 136, SEQ ID NO 137, SEQ ID NO 138, SEQ ID NO 139, SEQ ID NO 140, SEQ ID NO 141, SEQ ID NO 142, SEQ ID NO 143, SEQ ID NO 144, SEQ ID NO 145, SEQ ID NO 146 and SEQ ID NO 147, or a functionally equivalent variant of any of the foregoing sequences.
In another aspect the invention relates to the use of such an expression vector for the expression of a POI in a host cell, the host cell being a cell of a strain of the genus Komagataella, in particular a cell of a strain of K. pastoris, K. pseudopastoris or K. phaffii.
The principle of the invention is further described in the independent claims, while the various embodiments of the invention are the subject matter of dependent claims.
To understand more about the gene regulation of a host organism during protein production, DNA microarray hybridization experiments with P. pastoris clones expressing recombinant human (rh) trypsinogen in comparison to a non-producing strain (according to Sauer et al., 2004) were performed. A detailed description of the experimental procedure is found in Example 1. These experiments allow for a determination of the transcription levels of approximately ⅓ of all genes in P. pastoris, but they do not provide direct information on the potential of any hitherto unidentified protein to enhance secretion.
Additional analysis of the data derived from DNA microarray hybridization has allowed the identification of potential secretion supporting proteins, or their genes respectively. To achieve this, the relative expression levels of all measured genes of a P. pastoris strain being transformed with a plasmid carrying a gene for rh trypsinogen were compared to a wild type strain cultivated under the same conditions. Then the genes were ordered by the relative difference of their expression levels, and some 524 genes with the highest difference were considered for further analysis. As the DNA microarrays used for these experiments were derived from Saccharomyces cerevisiae gene sequences, only putative gene functions for P. pastoris can be assigned by the homology to S. cerevisiae. After ranking the 524 differentially regulated genes based on their putative intracellular localisation and function, and focusing on those being involved in secretion and/or general stress response, out of a number of 64 potentially interesting genes 15 were selected for further analysis. These genes were cloned from S. cerevisiae by PCR and subcloned into a P. pastoris expression vector, and subsequently transformed into a P. pastoris strain expressing the Fab fragment of a monoclonal antibody (2F5mAb) against HIV1. By cultivating the clones producing both the Fab fragment and the different putative secretion helper proteins, compared to clones producing only the Fab fragment, a beneficial effect of the overexpression of the following genes encoding putative helper proteins on the secretion of the Fab fragment could be identified: PDI1, CUP5, SSA4, BMH2, KIN2, KAR2, HAC1, ERO1, SSE1, BFR2, C0G6, SSO2, C0Y1, IMH1 and SEC31.
The proteins PDI1, KAR2, HAC1, ERO1 and SSO2 are already known in the art as being successfully applicable folding/secretion helper factors when co-expressed during recombinant expression of heterologous proteins.
The other proteins identified in the DNA microarray assay, i.e. CUP5, SSA4, BMH2, KIN2, SSE1, BFR2, C0G6, C0Y1, IMH1 and SEC31 have not yet been described as having a beneficial effect on the secretion of recombinantly produced POI.
Accordingly, the present invention in its first aspect relates to a method of increasing the secretion of a POI from a eukaryotic cell comprising:
wherein said protein that increases protein secretion is selected from the group consisting of BMH2, BFR2, C0G6, C0Y1, CUP5, IMH1, KIN2, SEC31, SSA4, SSE1, and a biologically active fragment of any of the foregoing proteins.
The term “protein of interest (POI)” as used herein refers to a protein that is produced by means of recombinant technology in a host cell. More specifically, the protein may either be a polypeptide not naturally occurring in the host cell, i.e. a heterologous protein, or else may be native to the host cell, i.e. a homologous protein to the host cell, but is produced, for example, by transformation with a self replicating vector containing the nucleic acid sequence encoding the POI, or upon integration by recombinant techniques of one or more copies of the nucleic acid sequence encoding the POI into the genome of the host cell, or by recombinant modification of one or more regulatory sequences controlling the expression of the gene encoding the POI, e.g. of the promoter sequence.
The POI can be any eukaryotic or prokaryotic protein. The protein can be a naturally secreted protein or an intracellular protein, i.e. a protein which is not naturally secreted. The present invention also includes biologically active fragments of naturally secreted or not naturally secreted proteins.
A secreted POI referred to herein may be but is not limited to a protein suitable as a biopharmaceutical substance like an antibody or antibody fragment, growth factor, hormone, enzyme, vaccine, or a protein which can be used for industrial application like e.g. an enzyme.
A intracellular POI referred to herein may be but is not limited to a helper factor for protein secretion, or an enzyme used for metabolic engineering purposes.
In another embodiment, the POI is a eukaryotic protein or a biologically active fragment thereof, preferably an immunoglobulin or an immunoglobulin fragment such as a Fc fragment or a Fab fragment. Most preferably, the POI is a Fab fragment of the monoclonal anti-HIV1 antibody 2F5.
In general, the proteins of interest referred to herein may be produced by methods of recombinant expression well known to a person skilled in the art.
It is understood that the methods disclosed herein may further include cultivating said recombinant host cells under conditions permitting the expression of the POI. A secreted, recombinantly produced POI can then be isolated from the cell culture medium and further purified by techniques well known to a person skilled in the art.
As used herein, a “biologically active fragment” of a protein shall mean a fragment of a protein that exerts a biological effect similar or comparable to the full length protein. Such fragments can be produced e.g. by amino- and carboxy-terminal deletions as well as by internal deletions.
In general, the host cell from which the proteins are secreted can be any eukaryotic cell suitable for recombinant expression of a POI.
In a preferred embodiment, the invention relates to such a method, wherein the host cell is a fungal cell, e.g. a yeast cell, or a higher eukaryotic cell, e.g. a mammalian cell or a plant cell.
Examples of yeast cells include but are not limited to the Saccharomyces genus (e.g. Saccharomyces cerevisiae), the Komagataella genus (Komagataella pastoris, Komagataella pseudopastoris or Komagataella phaffii), Pichia methanolica, Hansenula polymorpha or Kluyveromyces lactis.
In a preferred embodiment the invention relates to a method, wherein the yeast cell is a cell of the Komagataella genus, in particular a cell of a strain of Komagataella pastoris, Komagataella pseudopastoris or Komagataella phaffii.
The former species Pichia pastoris has been divided and renamed to Komagataella pastoris and Komagataella phaffii (Kurtzman, 2005). Therefore Pichia pastoris is synonymous for both Komagataella pastoris and Komagataella phaffii.
The nucleotide sequences encoding the proteins that increase protein secretion can be obtained from a variety of sources. Said proteins may be involved in the eukaryotic protein secretory pathway.
In one aspect the invention relates to such a method, wherein at least one recombinant nucleotide sequence encoding a protein that increases protein secretion is a yeast nucleotide sequence, preferably but not limited to a nucleotide sequence of the yeast species Saccharomyces cerevisiae or Pichia pastoris. Also, homologous nucleotide sequences from other suitable yeasts or other fungi or from other organisms such as vertebrates can be used.
The term “homologous nucleotide sequences” as used herein refers to nucleotide sequences which are related but not identical in their nucleotide sequence with the contemplated nucleotide sequence, and perform essentially the same function.
In a further aspect the invention relates to such a method, wherein at least one recombinant nucleotide sequence encoding a protein that increases protein secretion is obtained from Saccharomyces cerevisiae and is identical with or corresponds to and has the functional characteristics of a sequence selected from the group consisting of SEQ ID NO 32, SEQ ID NO 33, SEQ ID NO 34, SEQ ID NO 35, SEQ ID NO 36, SEQ ID NO 37, SEQ ID NO 38, SEQ ID NO 39, SEQ ID NO 40 and SEQ ID NO 41.
As used herein, the term “nucleotide sequence that corresponds to and has the functional characteristics of” is meant to encompass variations in its nucleotide composition including variations due to the degeneracy of the genetic code, whereby the nucleotide sequence performs essentially the same function.
By screening a P. pastoris genome database (ERGO™, IG-66, Integrated Genomics) with the nucleotide sequences of the secretion helper factors isolated from Saccharomyces cerevisiae homologous nucleotide sequences in Pichia pastoris have been identified. Preliminary experimental results indicate that these homologous nucleotide sequences isolated from Pichia pastoris show similar effects on protein secretion from a host cell when compared to the corresponding nucleotide sequences isolated from Saccharomyces cerevisiae.
In a further aspect the invention relates to such a method, wherein at least one recombinant nucleotide sequence encoding a protein that increases protein secretion is obtained from Pichia pastoris and is identical with or corresponds to and has the functional characteristics of a sequence selected from the group consisting of SEQ ID NO 42, SEQ ID NO 43, SEQ ID NO 44, SEQ ID NO 45, SEQ ID NO 46, SEQ ID NO 47, SEQ ID NO 48, SEQ ID NO 49, SEQ ID NO 50 and SEQ ID NO 51.
In a further aspect the invention relates to such a method, wherein the recombinant nucleotide sequence encoding the POI is provided on a plasmid suitable for integration into the genome of the host cell, in a single copy or in multiple copies per cell. The recombinant nucleotide sequence encoding the POI may also be provided on an autonomously replicating plasmid in a single copy or in multiple copies per cell.
Alternatively, the recombinant nucleotide sequence encoding the POI and the recombinant nucleotide sequence encoding a protein that increases protein secretion are present on the same plasmid in single copy or multiple copies per cell.
The terms “plasmid” and “vector” as used herein include autonomously replicating nucleotide sequences as well as genome integrating nucleotide sequences.
In a further aspect, the invention relates to such a method, wherein the plasmid is a eukaryotic expression vector, preferably a yeast expression vector.
“Expression vectors” as used herein are defined as DNA sequences that are required for the transcription of cloned recombinant nucleotide sequences, i.e. of recombinant genes and the translation of their mRNA in a suitable host organism. Such expression vectors usually comprise an origin for autonomous replication in the host cells, selectable markers (e.g. an amino acid synthesis gene or a gene conferring resistance to antibiotics such as zeocin, kanamycin, G418 or hygromycin), a number of restriction enzyme cleavage sites, a suitable promoter sequence and a transcription terminator, which components are operably linked together.
The term “operably linked” as used herein refers to the association of nucleotide sequences on a single nucleic acid molecule, e.g. a vector, in a way such that the function of one or more nucleotide sequences is affected by at least one other nucleotide sequence present on said nucleic acid molecule. For example, a promoter is operably linked with a coding sequence of a recombinant gene when it is capable of effecting the expression of that coding sequence.
Expression vectors may include but are not limited to cloning vectors, modified cloning vectors and specifically designed plasmids. The expression vector of the invention may be any expression vector suitable for expression of a recombinant gene in a host cell and is selected depending on the host organism.
In another aspect the invention relates to such a method, wherein the expression vector comprises a secretion leader sequence effective to cause secretion of the POI from the host cell.
The presence of such a secretion leader sequence in the expression vector is required when the POI intended for recombinant expression and secretion is a protein which is not naturally secreted and therefore lacks a natural secretion leader sequence, or its nucleotide sequence has been cloned without its natural secretion leader sequence. In general, any secretion leader sequence effective to cause secretion of the POI from the host cell may be used in the present invention. The secretion leader sequence may originate from yeast source, e.g. from yeast α-factor such as MFα of Saccharomyces cerevisiae, or yeast phosphatase, from mammalian or plant source, or others. The selection of the appropriate secretion leader sequence is apparent to a skilled person.
Alternatively, the secretion leader sequence can be fused to the nucleotide sequence encoding a POI intended for recombinant expression by conventional cloning techniques known to a skilled person prior to cloning of the nucleotide sequence in the expression vector or the nucleotide sequence encoding a POI comprising a natural secretion leader sequence is cloned in the expression vector. In these cases the presence of a secretion leader sequence in the expression vector is not required.
To allow expression of a recombinant nucleotide sequence in a host cell the expression vector has to provide the recombinant nucleotide sequence with a functional promoter adjacent to the 5′ end of the coding sequence. The transcription is thereby regulated and initiated by this promoter sequence.
In a further aspect the invention relates to such a method, wherein the expression vector comprises a promoter sequence effective to control expression of the POI in the host cell.
“Promoter sequence” as used herein refers to a DNA sequence capable of controlling the expression of a coding sequence or functional RNA.
Suitable promoter sequences for use with yeast host cells may include but are not limited to promoters obtained from genes that code for metabolic enzymes which are known to be present at high concentration in the cell, e.g. glycolytic enzymes like triosephosphate isomerase (TPI), phosphoglycerate kinase (PGK), glyceraldehyde-3-phosphate dehydrogenase (GAPDH), alcohol oxidase (AOX), lactase (LAC) and galactosidase (GAL).
Suitable promoter sequences for use with mammalian host cells may include but are not limited to promoters obtained from the genomes of viruses, heterologous mammalian promoters, e.g. the actin promoter or an immunoglobulin promoter, and heat shock protein promoters.
In order to identify novel promoter sequences for use in yeast host cells, preferably for use in a strain of the Komagataella genus, in particular for use in a strain of K. pastoris, K. pseudopastoris or K. phaffii for recombinant expression of a POI, the data derived from the DNA microarray hybridisation described in Example 1 were evaluated in a specific manner.
The promoter sequences of the 23 most interesting genes identified by this analysis (up to 1000 bp of the 5′-region of the respective genes) were amplified from P. pastoris by PCR and cloned into a P. pastoris expression vector, which additionally carries an enhanced green fluorescent protein (eGFP) as a reporter gene. To test the properties of the different promoters, i.e. the promoter activity, the 25 vectors (including two control vectors) were subsequently transformed into a P. pastoris strain. The clones were cultivated under different culturing conditions and the amount of recombinant eGFP was quantified using flow cytometer analysis. A comparative analysis of the well established yeast promoter of GAP and the 23 promoter sequences is provided in Example 5.
The term “promoter activity” as used herein refers to an assessment of the transcriptional efficiency of a promoter. This may be determined directly by measurement of the amount of mRNA transcription from the promoter, e.g. by Northern Blotting or indirectly by measurement of the amount of gene product expressed from the promoter.
It was surprisingly found that a 1000 bp fragment from the 5′-non coding region of the PET9 gene of P. pastoris results in real unexpected high expression levels of recombinant eGFP, ranging from about 700% to about 1600% of the promoter activity of the GAP promoter, depending on the carbon source during cultivation, under the experimental conditions as described in Example 5.
PET9 is known from S. cerevisiae as a major ADP/ATP carrier of the mitochondrial inner membrane, which exchanges cytosolic ADP for mitochondrial synthesized ATP.
In another aspect the invention relates to a method of increasing the secretion of a POI from a eukaryotic cell, wherein the nucleotide sequence encoding the POI is controlled by a promoter sequence which is a 1000 bp fragment from the 5′-non coding region of the PET9 gene of Pichia pastoris corresponding to SEQ ID NO 125, or a functionally equivalent variant thereof and the host cell is a cell of the genus Komagataella, in particular a cell of a strain of K. pastoris, K. pseudopastoris or K. phaffii.
In another aspect the invention relates to the use of a nucleotide sequence isolated from Saccharomyces cerevisiae and encoding a protein that increases protein secretion and being selected from the group consisting of BMH2, BFR2, C0G6, C0Y1, CUP5, IMH1, KIN2, SEC31, SSA4, SSE1, and a biologically active fragment of any of the foregoing proteins, as a secretion enhancer, particularly as an enhancer of the secretion of a POI from a eukaryotic cell, preferably in a yeast cell and most preferred in a cell of a strain of K. pastoris, K. pseudopastoris or K. phaffii.
In a further aspect the invention relates to such a use wherein the nucleotide sequence encoding a protein that increases protein secretion is identical with or corresponds to and has the functional characteristics of a sequence selected from the group consisting of SEQ ID NO 32, SEQ ID NO 33, SEQ ID NO 34, SEQ ID NO 35, SEQ ID NO 36, SEQ ID NO 37, SEQ ID NO 38, SEQ ID NO 39, SEQ ID NO 40 and SEQ ID NO 41.
In another aspect the invention relates to the use of a nucleotide sequence isolated from Pichia pastoris and encoding a protein that increases protein secretion and being selected from the group consisting of BMH2, BFR2, C0G6, C0Y1, CUP5, IMH1, KIN2, SEC31, SSA4, SSE1, and a biologically active fragment of any of the foregoing proteins, as a secretion enhancer, particularly as an enhancer of the secretion of a POI from a eukaryotic cell, preferably in a yeast cell and most preferred in a cell of a strain of K. pastoris, K. pseudopastoris or K. phaffii.
In a further aspect the invention relates to such a use, wherein the nucleotide sequence encoding a protein that increases protein secretion is identical with or corresponds to and has the functional characteristics of a sequence selected from the group consisting of SEQ ID NO 42, SEQ ID NO 43, SEQ ID NO 44, SEQ ID NO 45, SEQ ID NO 46, SEQ ID NO 47, SEQ ID NO 48, SEQ ID NO 49, SEQ ID NO 50 and SEQ ID NO 51.
SSA4 is a member of the HSP70 family of molecular chaperones. SSA4 is participating in the SRP-dependent targeting of protein to the ER membrane prior to the cotranslational translocation of the protein into the ER-lumen, and is induced upon stress response.
The chaperonines of the SSE/HSP110 subclass of the HSP70 family, that are encoded by SSE1 and SSE2, assist in folding by binding to nascent peptides and holding them in a folding-competent state, however, they can not actively promote folding reactions. On the basis of their “holdase” activity, interactions to chaperones such as Ssa1p and Ssb1p of the HSP70 family as well as to the HSP90 complex seem plausible.
Sec31p is an essential phosphoprotein component of the coat protein complex II (COPII) of secretory pathway vesicles, in complex with Sec13p.
Growth defects due to mutations in either Sec13 or Sec23 (as well as Sec16 and Ypt1) can be overcome by overexpression of the essential S. cerevisiae gene BFR2. It has been isolated as a multi-copy suppressor of the drug Brefeldin A, a fungal metabolite that perturbs the protein flux into the Golgi and the structure of the Golgi apparatus itself.
14-3-3 proteins, encoded by BMH1 and BMH2, were identified to participate in multiple steps of vesicular trafficking, especially in protein exit from the ER, forward trafficking of multimeric cell surface membrane proteins as well as in retrograde transportation within the Golgi apparatus.
C0G6 belongs to one of eight genes coding for the Conserved Oligomeric Golgi (COG) complex, an eight-subunit peripheral Golgi protein, that is engaged in membrane trafficking and synthesis of glycoconjugates. Moreover, the COG complex is not only necessary for maintaining normal Golgi structure and function, but is also directly involved in retrograde vesicular transport within the Golgi apparatus.
The molecular function of Coy1, a protein identified by similarity to mammalian CASP, is not established yet, but is seems to be playing a role in Golgi vesicle transport through interaction with Gos1. Gos1 is a SNARE (soluble N-ethylmaleimide-sensitive factor attachment protein receptor) protein commonly used as marker of later compartments of the Golgi in S. cerevisiae.
The product of the IMH1/SYS3 gene is a member of the peripheral membrane Golgins involved in vesicular transport between the late Golgi and a prevacuolar, endosome-like compartment. Imh1 is recruited by to the Golgi by the two ARF-like (ARL) GTPases, Arl1p and Arl3p.
Kin2, and the closely related Kin1, are two serine/threonine protein kinases localized at the cytoplasmic side of the plasma membrane. The catalytic activity of Kin2 is essential for its function in regulation of exocytosis by phosphorylation of the plasma membrane t-SNARE Sec9, a protein acting at the final step of exocytosis. Genetic analysis indicates that the KIN kinases act downstream of the Exocyst, the vesicle tethering factor at the site of exocytosis, and its regulator Sec4 (GTP binding protein of the Ras family).
CUP5 encodes the c subunit of the yeast vacuolar (H)-ATPase (V-ATPase) Vo domain, belonging to a family of ATP-dependent proton pumps that acidify the yeast central vacuole. The Vo domain is an integral membrane structure of five subunits responsible for transporting protons across the membrane.
Assembling of the Vo domain is not possible in the absence of Cup5. V-ATPase function is important for many processes including endocytosis, protein degradation and coupled transport across the vacuolar membrane.
Additionally, a role for V-ATPase in detoxification of copper, iron metabolism and mitochondrial function was reported.
In another aspect the invention relates to a nucleotide sequence encoding a protein that increases protein secretion from a host cell, wherein the nucleotide sequence is isolated from Pichia pastoris and is identical with or corresponds to and has the functional characteristics of a sequence selected from the group consisting of a nucleotide sequence encoding the protein BMH2 (SEQ ID NO 42), a nucleotide sequence encoding the protein BFR2 (SEQ ID NO 43), a nucleotide sequence encoding the protein C0G6 (SEQ ID NO 44), a nucleotide sequence encoding the protein C0Y1 (SEQ ID NO 45), a nucleotide sequence encoding the protein CUP5 (SEQ ID NO 46), a nucleotide sequence encoding the protein IMH1 (SEQ ID NO 47), a nucleotide sequence encoding the protein KIN2 (SEQ ID NO 48), a nucleotide sequence encoding the protein SEC31 (SEQ ID NO 49), a nucleotide sequence encoding the protein SSA4 (SEQ ID NO 50) and a nucleotide sequence encoding the protein SSE1 (SEQ ID NO 51).
In a further aspect the invention relates to a yeast promoter sequence being a 1000 bp fragment from the 5′-non coding region of the PET9 gene corresponding to SEQ ID NO 125, or a functionally equivalent variant thereof and being isolated from Pichia pastoris.
It should be recognized that promoter sequences of various diminishing length may have identical promoter activity and should be therefore also included in the present invention, since the exact boundaries of the regulatory sequence of the 5′-non coding region of the PET9 gene have not been defined.
Therefore the term “functionally equivalent variant” of a promoter sequence as used herein means a nucleotide sequence resulting from modification of this nucleotide sequence by insertion, deletion or substitution of one or more nucleotides within the sequence or at either or both of the distal ends of the sequence, and which modification does not affect (in particular impair) the promoter activity of this nucleotide sequence.
In a further aspect the invention relates to such a yeast promoter sequence which has, under comparable conditions, improved properties for expression of a POI in yeast, preferably in a strain of the genus Komagataella, in particular in a strain of Komagataella pastoris, Komagataella pseudopastoris or Komagataella phaffii, relative to a yeast promoter known in the art, in particular relative to a GAP promoter isolated from Pichia pastoris.
In a further aspect the invention relates to such a yeast promoter sequence, having, under comparable conditions, at least the same, or at least about a 1.5-fold, or at least about 2-fold, or at least about a 4-fold, 7-fold, 10-fold, or at least up to about a 15-fold promoter activity relative to a GAP promoter isolated from Pichia pastoris.
It is desirable to have an expression system for recombinant expression of a nucleotide sequence in a host organism, in particular in a yeast host, more particular in a strain of the genus Komagataella, which offers the opportunity to easily change the different parts of the vector, like the selection marker, e.g. a resistance for zeocin, kanamycin/geneticin, hygromycin and others, the promoter or the transcription terminator. It would be also advantageous if the vector could either be integrated into the genome of the host (using homologous integration sequences) or located episomally by exchanging a part of the vector which is not important for heterologous gene expression.
For construction of a novel vector system pPuzzle which provides the above mentioned advantages, in a first step a vector backbone of pPuzzle was generated carrying an origin of replication and a selection marker for Escherichia coli (E. coli), which enables amplification of the vector backbone in E.coli. In addition, the vector backbone of pPuzzle comprises a multiple cloning site (see
In a second step the pPuzzle expression vector carrying a eukaryotic selection marker, a promoter for recombinant expression of a heterologous or homologous nucleotide sequence, a transcription terminator and optionally sequences for homologous integration of the vector in the host genome was constructed (see Example 4). The selection of the promoter sequence and the selection marker depends on the host organism which is used for recombinant expression of a nucleotide sequence. The transcription terminator can be, in principle, each functional transcription terminator and is in particular the transcription terminator of the cytochrome c gene from S. cerevisiae. Further, the presence of homologous integration sequences depends on whether the nucleotide sequence is intended to be integrated in the genome of the host organism or not. Since the selection marker, the promoter sequence and the homologous integration sequences are flanked by unique restriction enzyme cleavage sites they can easily be exchanged, i.e. cut out and substituted, whereby the vector can be altered or adapted to a selected host organism in a simple and efficient way.
In detail, the selection marker is cloned in a unique KpnI restriction site, the homologous integration sequences are cloned in a unique NotI restriction site, the promoter is cloned by using the ApaI and the SbfI/AarI restriction site and the nucleotide sequence encoding a POI is cloned in the MCS (multiple cloning site) using the restriction sites SbfI and SfII.
In a further aspect the invention relates to an eukaryotic expression vector based on the pPuzzle backbone further comprising the following components operably linked to each other:
wherein the promoter is a 1000 bp fragment from the 5′-non coding region of the PET9 gene of Pichia pastoris (SEQ ID NO 125), or a functionally equivalent variant thereof, the transcription terminator is the transcription terminator of the cytochrome c gene from S. cerevisiae, the selection marker is a zeocin resistance gene and the host cell is a yeast cell, preferably a cell of a strain of the genus Komagataella, in particular a cell of a strain of Komagataella pastoris, Komagataella pseudopastoris or Komagataella phaffii.
A detailed description of the procedure for the construction of such a vector, which additionally contains an enhanced green fluorescent protein eGFP as a reporter gene (pPuzzle_zeoR_Ppet9_eGFP_AOXTT) is found in Examples 3 to 5 and in
It is understood that any heterologous or homologous nucleotide sequence intended for recombinant expression in a host cell can be used in the position of eGFP.
In another aspect the invention relates to the use of such a eukaryotic expression vector for recombinant expression of a POI in a host cell.
Depending on the problem to be solved it can be desirable to either have a strong expression of a protein of interest in a host cell (e.g. for recombinant production of a POI in a host cell) or to have a weak or reduced expression of a protein of interest in a host cell (e.g. when analysing the molecular function of a POI in a host cell).
Particularly, in case of the analysis of the molecular function of a cellular POI or in case of a POI intended for metabolic engineering applications, which protein shall not be secreted, but develop its activity within a desired compartment of the cell, it would be attractive being able to regulate the expression level of this protein of interest via the promoter activity. It can be desirable to either have a strong expression of the POI (comparable to or stronger as from the GAP promoter) or to have a weak or reduced expression of the POI (less than from the GAP promoter). It is therefore useful to have a selection of different promoter sequences suitable for recombinant expression of a heterologous or homologous nucleotide sequence in a host organism, in particular in a yeast host, more particular in a strain of the genus Komagataella, in particular in a cell of a strain of Komagataella pastoris, Komagataella pseudopastoris or Komagataella phaffii, having different promoter activities under comparable cell culture conditions, varying from strong promoter activity to weak or reduced promoter activity as compared to the GAP promoter. This allows to regulate the expression level of a protein of interest by selection of a suitable promoter sequence according to the experimental situation.
From the comparative analysis of promoter sequences as described in Example 5, i.e. from the analysis of the promoter activity, several promoter sequences with different promoter activities, ranging from 0% to about 135% of the promoter activity of a GAP promoter isolated from Pichia pastoris, under the experimental conditions as described in Example 5, have been found.
A summary of the promoter activities of the yeast promoter sequences tested in Example 5 (determined by measurement of the relative expression level in % of the reporter gene product eGFP and standardisation on eGFP expression under the GAP promoter) is found in Table 8.
In detail, a 1000 bp fragment from the 5′-non coding region of the GND1 gene had, under the experimental conditions of Example 5, a promoter activity ranging from 0% to about 67% of the promoter activity of the GAP promoter. A 1000 bp fragment from the 5′-non coding region of the GPM1 gene had, under the experimental conditions of Example 5, a promoter activity ranging from about 19% to about 41% of the promoter activity of the GAP promoter. A 1000 bp fragment from the 5′-non coding region of the HSP90 gene had, under the experimental conditions of Example 5, a promoter activity ranging from about 6% to about 81% of the promoter activity of the GAP promoter. A 1000 bp fragment from the 5′-non coding region of the KAR2 gene had, under the experimental conditions of Example 5, a promoter activity ranging from about 11% to about 135% of the promoter activity of the GAP promoter. A 1000 bp fragment from the 5′-non coding region of the MCM1 gene had, under the experimental conditions of Example 5, a promoter activity ranging from 0% to about 6% of the promoter activity of the GAP promoter. A 1000 bp fragment from the 5′-non coding region of the RAD2 gene had, under the experimental conditions of Example 5, a promoter activity ranging from 0% to about 5% of the promoter activity of the GAP promoter. A 1000 bp fragment from the 5′-non coding region of the RPS2 gene had, under the experimental conditions of Example 5, a promoter activity ranging from 0% to about 12% of the promoter activity of the GAP promoter. A 1000 bp fragment from the 5′-non coding region of the RPS31 gene had, under the experimental conditions of Example 5, a promoter activity ranging from 0% to about 8% of the promoter activity of the GAP promoter. A 1000 bp fragment from the 5′-non coding region of the SSA1 gene had, under the experimental conditions of Example 5, a promoter activity ranging from 0% to about 30% of the promoter activity of the GAP promoter. A 1000 bp fragment from the 5′-non coding region of the THI3 gene had, under the experimental conditions of Example 5, a promoter activity ranging from 0% to about 42% of the promoter activity of the GAP promoter. A 1000 bp fragment from the 5′-non coding region of the TPI1 gene had, under the experimental conditions of Example 5, a promoter activity ranging from 0% to about 92% of the promoter activity of the GAP promoter. A 1000 bp fragment from the 5′-non coding region of the UBI4 gene had, under the experimental conditions of Example 5, a promoter activity ranging from 0% to about 4% of the promoter activity of the GAP promoter. A 1000 bp fragment from the 5′-non coding region of the ENO1 gene had, under the experimental conditions of Example 5, a promoter activity ranging from 17% to about 47% of the promoter activity of the GAP promoter. A 1000 bp fragment from the 5′-non coding region of the RPS7A gene had, under the experimental conditions of Example 5, a promoter activity ranging from 1% to about 18% of the promoter activity of the GAP promoter. A 1000 bp fragment from the 5′-non coding region of the RPL1 gene had, under the experimental conditions of Example 5, a promoter activity ranging from 0% to about 11% of the promoter activity of the GAP promoter. A 1000 bp fragment from the 5′-non coding region of the TKL1 gene had, under the experimental conditions of Example 5, a promoter activity ranging from 0% to about 9% of the promoter activity of the GAP promoter. A 1000 bp fragment from the 5′-non coding region of the PIS1 gene had, under the experimental conditions of Example 5, a promoter activity ranging from 0% to about 7% of the promoter activity of the GAP promoter. A 1000 bp fragment from the 5′-non coding region of the FET3 gene had, under the experimental conditions of Example 5, a promoter activity ranging from 0% to about 7% of the promoter activity of the GAP promoter. A 1000 bp fragment from the 5′-non coding region of the FTR1 gene had, under the experimental conditions of Example 5, a promoter activity ranging from 0% to about 6% of the promoter activity of the GAP promoter. A 1000 by fragment from the 5′-non coding region of the NMT1 gene had, under the experimental conditions of Example 5, a promoter activity ranging from 0% to about 5% of the promoter activity of the GAP promoter. A 1000 by fragment from the 5′-non coding region of the PHO8 gene had, under the experimental conditions of Example 5, a promoter activity ranging from 0% to about 6% of the promoter activity of the GAP promoter. A 1000 bp fragment from the 5′-non coding region of the FET3 precursor (FET3pre) gene had, under the experimental conditions of Example 5, a promoter activity ranging from 0% to about 7% of the promoter activity of the GAP promoter.
In another aspect the invention relates to a yeast promoter sequence being isolated from Pichia pastoris and being identical with or corresponding to and having the functional characteristics of a sequence selected from the group consisting of a 1000 bp fragment from the 5′-non coding region of the GND1 gene (SEQ ID NO 126), a 1000 bp fragment from the 5′-non coding region of the GPM1 gene (SEQ ID NO 127), a 1000 bp fragment from the 5′-non coding region of the HSP90 gene (SEQ ID NO 128), a 1000 bp fragment from the 5′-non coding region of the KAR2 gene (SEQ ID NO 129), a 1000 bp fragment from the 5′-non coding region of the MCM1 gene (SEQ ID NO 130), a 1000 bp fragment from the 5′-non coding region of the RAD2 gene (SEQ ID NO 131), a 1000 by fragment from the 5′-non coding region of the RPS2 gene (SEQ ID NO 132), a 1000 bp fragment from the 5′-non coding region of the RPS31 gene (SEQ ID NO 133), a 1000 bp fragment from the 5′-non coding region of the SSA1 gene (SEQ ID NO 134), a 1000 bp fragment from the 5′-non coding region of the THIS gene (SEQ ID NO 135), a 1000 bp fragment from the 5′-non coding region of the TPI1 gene (SEQ ID NO 136), a 1000 bp fragment from the 5′-non coding region of the UBI4 gene (SEQ ID NO 137), a 1000 bp fragment from the 5′-non coding region of the ENO1 gene (SEQ ID NO 138), a 1000 bp fragment from the 5′-non coding region of the RPS7A gene (SEQ ID NO 139), a 1000 bp fragment from the 5′-non coding region of the RPL1 gene (SEQ ID NO 140), a 1000 bp fragment from the 5′-non coding region of the TKL1 gene (SEQ ID NO 141), a 1000 bp fragment from the 5′-non coding region of the PIS1 gene (SEQ ID NO 142), a 1000 bp fragment from the 5′-non coding region of the FET3 gene (SEQ ID NO 143), a 1000 bp fragment from the 5′-non coding region of the FTR1 gene (SEQ ID NO 144), a 1000 bp fragment from the 5′-non coding region of the NMT1 gene (SEQ ID NO 145), a 1000 bp fragment from the 5′-non coding region of the PHO8 gene (SEQ ID NO 146), and a 1000 bp fragment from the 5′-non coding region of the FET3 precursor (FET3pre) gene (SEQ ID NO 147), or a functionally equivalent variant of any of the foregoing sequences.
Enolase 1 (ENO1) is a phosphopyruvate hydratase that catalyzes the conversion of 2-phosphoglycerate to phosphoenolpyruvate during glycolysis and the reverse reaction during gluconeogenesis.
Triose phosphate isomerase (TPI1) is an abundant glycolytic enzyme. It catalyzes the interconversion of glyceraldehyde-3-phosphate and dihydroxyacetone phosphate during glycolysis.
THI3 is a probable decarboxylase, required for expression of enzymes involved in thiamine biosynthesis and may have a role in catabolism of amino acids to long-chain and complex alcohols.
SSA1 is an ATPase involved in protein folding and nuclear localization signal (NLS)-directed nuclear transport. SSA1 is member of heat shock protein 70 (HSP70) family.
RPS7A is a protein component of the small (40S) ribosomal subunit.
6-Phosphogluconate dehydrogenase (GND1) catalyzes an NADPH regenerating reaction in the pentose phosphate pathway and is required for growth on D-glucono-delta-lactone and adaptation to oxidative stress.
GPM1 encodes the phosphoglycerate mutase, which is a tetrameric enzyme responsible for the conversion of 3-phospholycerate to 2-phosphoglycerate during glycolysis, and the reverse reaction during gluconeogenesis.
Transketolase (TKL1) catalyzes conversion of xylulose-5-phosphate and ribose-5-phosphate to sedoheptulose-7-phosphate and glyceraldehyde-3-phosphate in the pentose phosphate pathway and is needed for synthesis of aromatic amino acids.
Heat Shock Protein 90 (HSP90) is a cytoplasmic chaperone (Hsp90 family).
RPS2 is a protein component of small ribosomal(40S) subunit.
RPS31 is a fusion protein that is cleaved to yield a ribosomal protein of the small (40S) subunit and ubiquitin.
RPL1A is a protein component of the large ribosomal (60S) subunit.
The phosphatidylinositol synthase PIS1 is required for biosynthesis of phosphatidylinositol, which is a precursor for polyphosphoinositides, sphingolipids, and glycolipid anchors for some of the plasma membrane proteins.
Ferro-O2-oxidoreductase (FET3) belongs to class of integral membrane multicopper oxidases and is required for high-affinity iron uptake and involved in mediating resistance to copper ion toxicity, FET3pre its precursor.
The high affinity iron permease (FTR1) is involved in the transport of iron across the plasma membrane and forms complex with Fet3p.
PHO8 is a repressible alkaline phosphatase.
N-myristoyl transferase NMT1 catalyzes the cotranslational, covalent attachment of myristic acid to the N-terminal glycine residue of several proteins involved in cellular growth and signal transduction.
The transcription factor MCM1 is involved in cell-type-specific transcription and pheromone response.
Ubiquitin (UBI4) becomes conjugated to proteins, marking them for selective degradation via the ubiquitin-26S proteasome system.
RAD2, a single-stranded DNA endonuclease, cleaves single-stranded DNA during nucleotide excision repair to excise damaged DNA.
In a further aspect the invention relates to a eukaryotic expression vector based on the pPuzzle backbone further comprising the following components operably linked to each other:
wherein the promoter is a yeast promoter sequence isolated from Pichia pastoris and is identical with or corresponds to and has the functional characteristics of a sequence selected from the group consisting of SEQ ID NO 125, SEQ ID NO 126, SEQ ID NO 127, SEQ ID NO 128, SEQ ID NO 129, SEQ ID NO 130, SEQ ID NO 131, SEQ ID NO 132, SEQ ID NO 133, SEQ ID NO 134, SEQ ID NO 135, SEQ ID NO 136, SEQ ID NO 137, SEQ ID NO 138, SEQ ID NO 139, SEQ ID NO 140, SEQ ID NO 141, SEQ ID NO 142, SEQ ID NO 143, SEQ ID NO 144, SEQ ID NO 145, SEQ ID NO 146 and SEQ ID NO 147, or a functionally equivalent variant of any of the foregoing sequences, and the host cell is a yeast cell, preferably a cell of a strain of the genus Komagataella, in particular a cell of a strain of Komagataella pastoris, Komagataella pseudopastoris or Komagataella phaffii.
In another aspect the invention relates to the use of such a eukaryotic expression vector for recombinant expression of a POI in a host cell.
In case, that the POI is a cellular protein intended for metabolic engineering applications, i.e. for expression and developing its activity within a desired compartment of a host cell the POI may be expressed from a eukaryotic expression vector based on the pPuzzle backbone without a leader sequence effective to cause secretion of the POI from the host cell.
If the cellular POI is a homologous protein to the host cell, i.e. a protein which is naturally occurring in the host cell, the expression of the POI in the host cell may be modulated by the exchange of its native promoter sequence with a yeast promoter sequence isolated from Pichia pastoris and being identical with or corresponding to and having the functional characteristics of a sequence selected from the group consisting of SEQ ID NO 125, SEQ ID NO 126, SEQ ID NO 127, SEQ ID NO 128, SEQ ID NO 129, SEQ ID NO 130, SEQ ID NO 131, SEQ ID NO 132, SEQ ID NO 133, SEQ ID NO 134, SEQ ID NO 135, SEQ ID NO 136, SEQ ID NO 137, SEQ ID NO 138, SEQ ID NO 139, SEQ ID NO 140, SEQ ID NO 141, SEQ ID NO 142, SEQ ID NO 143, SEQ ID NO 144, SEQ ID NO 145, SEQ ID NO 146 and SEQ ID NO 147 or a functionally equivalent variant of any of the foregoing sequences.
This purpose may be achieved e.g. by transformation of a host cell with a recombinant DNA molecule comprising homologous sequences of the target gene to allow site specific recombination, the desired yeast promoter sequence and a selective marker suitable for the host cell. The site specific recombination shall take place in order to operably link the yeast promoter sequence with the nucleotide sequence encoding the POI. This results in the expression of the POI from the yeast promoter sequence instead of from the native promoter sequence.
Depending on the problem to be solved the selected yeast promoter may have either an increased promoter activity relative to the native promoter sequence leading to an increased expression of a POI in the host cell or may have a decreased promoter activity relative to the native promoter sequence leading to a reduced expression of a POI in the host cell.
In another aspect the invention relates to the use of a yeast promoter sequence being isolated from Pichia pastoris and being identical with or corresponding to and having the functional characteristics of a sequence selected from the group consisting of SEQ ID NO 125, SEQ ID NO 126, SEQ ID NO 127, SEQ ID NO 128, SEQ ID NO 129, SEQ ID NO 130, SEQ ID NO 131, SEQ ID NO 132, SEQ ID NO 133, SEQ ID NO 134, SEQ ID NO 135, SEQ ID NO 136, SEQ ID NO 137, SEQ ID NO 138, SEQ ID NO 139, SEQ ID NO 140, SEQ ID NO 141, SEQ ID NO 142, SEQ ID NO 143, SEQ ID NO 144, SEQ ID NO 145, SEQ ID NO 146 and SEQ ID NO 147 or a functionally equivalent variant of any of the foregoing sequences for modulation of the expression of a homologous POI in a host cell.
In another aspect the invention relates to such a use, wherein the yeast promoter sequence has an increased promoter activity relative to the native promoter sequence of the POI.
In another aspect the invention relates to such a use, wherein the yeast promoter sequence has a decreased promoter activity relative to the native promoter sequence of the POI.
In order that the invention described herein may be more fully understood, the following examples are set forth. The examples are for illustrative purposes only and are not to be construed as limiting this invention in any respect. It is further understood that the present invention shall also comprise variations of the expressly disclosed embodiments to an extent as would be contemplated by a person of ordinary skill in the art.
Examples 1 and 2 below illustrate the materials and methods used to investigate the effect of co-expression of different proteins involved in the eukaryotic secretion pathway (secretion helper factors) on the yield of a secreted heterologous protein of interest, i.e. on the secretion of the Fab fragment of the monoclonal anti-HIV1 antibody 2F5 in P. pastoris.
In order to identify genes and their respective proteins which play a potential role during protein production, e.g. in the protein secretory pathway of P. pastoris the gene expression pattern of a P. pastoris strain containing the gene for human trypsinogen 1 was compared before and after induction of heterologous protein production (induction was done by a switch from glycerol to methanol as the sole carbon source), i.e. of trypsinogen production by microarray analysis.
As the genome sequence of P. pastoris has not been published and not many genes are characterized for P. pastoris DNA microarrays of S. cerevisiae were used for heterologous hybridization with P. pastoris cDNA.
The experimental procedure of the microarray hybridisation and the evaluation of the obtained data was carried out as described in Sauer et al. (2004). Further details are found below.
a) Strain:
The expression strain was P. pastoris strain X33 (Invitrogen), a wild type strain which can grow on minimal media without supplements. The selection mechanism was based on the Zeocin™ resistance of the transformation vector. Transformation of the strain was carried out with a plasmid derived from pPICZαB (Invitrogen), containing the gene for human trypsinogen 1 (Hohenblum et al., 2003). pPICZαB utilises the AOX1 promoter of P. pastoris, which promoter is repressed by many carbon sources such as glucose, glycerol or ethanol but induced by the carbon source methanol, and the α-factor leader sequence of S. cerevisiae for product secretion. The selected strain was of the methanol utilisation positive (mut+) phenotype, which means that it is fully capable to metabolise methanol as the sole carbon source.
b) Cell Culture:
Fermentation of P. pastoris
Fed batch fermentations were performed with a MBR mini bioreactor with a final working volume of 2 l, essentially as described by Hohenblum et al. (2003).
The media were as follows:
PTM, Trace Salts Stock Solution Contained Per Litre
6.0 g CuSO4.5H2O, 0.08 g NaI, 3.0 g MnSO4.H2O, 0.2 g Na2MoO4.2H2O, 0.02 g H3BO3, 0.5 g CoCl2, 20.0 g ZnCl2, 65.0 g FeSO4.7H2O, 0.2 g biotin and 5.0 ml H2SO4 (95%-98%). All chemicals for PTM, trace salts stock solution were from Riedel-de Haën, except for biotin (Sigma), and H2SO4 (Merck Eurolab).
Batch Medium Contained Per Litre
23.7 ml H3PO4 (85%), 0.6 g CaSO4.2H2O, 9.5 g K2SO4, 7.8 g MgSO4.7H2O, 2.6 g KOH, 40 g glycerol, 4.4 ml PTM, trace salts stock solution.
Glycerol Fed-Batch Solution Contained Per Litre
632 g glycerol (100%) and 12 ml PTM, trace salts stock solution.
Methanol Fed-Batch Solution Contained Per Litre
988 ml methanol (100%) and 12 ml PTM, trace salts stock solution.
The dissolved oxygen was controlled at DO=30% with the stirrer speed (600-1200 rpm). Aeration rate was 100 l h−1 air, which was supplemented with oxygen (up to 25%) after the begin of the fed batch. The temperature was 25° C., and the pH was controlled with NH3 (25%).
Before starting the fermentation, the pH of 1.2 l batch medium was set to 5.0 with NH3 (25%). The batch phase of approximately 32 h was followed by a 4 h fed batch with glycerol medium (feed rate 15.6 ml h−1), leading to a dry biomass concentration of approximately 40 g l−1. Then, the feed with methanol medium was started with a feed rate of 6.4 ml h−1. Methanol induces the production of the heterologous protein trypsinogen and serves as a carbon source at the same time. The fermentation was terminated 14 h after the methanol feed start. The pH was 5.0 during batch, and kept at 5.0 throughout the fermentation.
Samples were taken at the end of the glycerol fed batch phase (trypsinogen non-expressing cells) and at the end of the methanol fed batch phase (trypsinogen expressing cells), respectively. Cells were centrifuged to separate the cell culture supernatant, then the cell pellets were resuspended in 10× the volume of TRI-reagent (Sigma) and frozen.
c) RNA Isolation:
The samples were thawed on ice and after addition of acid washed glass beads the cells were homogenised in a Ribolyser (Hybaid Ltd.) for 2×20 sec, in between cooling on ice. After addition of chloroform, the samples were centrifuged and the total RNA was precipitated from the aqueous phase adding isopropanol. The pellet was washed 2× with 70% ethanol, dried and re-suspended in RNAse free water. mRNA was isolated using the MicroPoly(A)Purist mRNA purification Kit (Ambion) according to the manufacturers protocol.
d) Synthesis and Labelling of cDNA:
5 μg of mRNA and 0.5 μg of oligodT primer were mixed in 7 μl of water, incubated for 5 min at 70° C. and subsequently at 42° C. for about 3 min. The following components were added to 5 μl of said reaction mixture: 4 μl reaction buffer (5×) for SuperScript II reverse transcriptase (Invitrogen), 2 μl dTTP (2 mM), 2 μl dATP, dGTP, dCTP (5 mM), 2 μl DTT (100 mM), 2.5 μl RNasin (40 U, Promega) and either 2 μl FluoriLink Cy3-dUTP (1 mM) or 2 μl FluoriLink Cy5-dUTP (1 mM, Amersham Biosciences) respectively, and 1 μl SuperScript II reverse transcriptase (200 U, Invitrogen) to result in a total of 19.5 μl. The mixture was incubated for 1 h at 42° C. After addition of further 200 U SuperScript II reverse transcriptase the mixture was incubated for another 1 h at 42° C. 7 μl of 0.5 M NaOH/50 mM EDTA were added and the mixture was incubated at 70° C. for 15 min. The reaction mixture was neutralised by addition of 10 μl Tris-HCl pH 7.5 (1 M). The labelled cDNA was purified with Qiaquick purification columns (Qiagen) according to the manufacturer's protocol.
e) Chip Hybridisation and Set-Up of Microarrays:
The S. cerevisiae cDNA microarrays used for this study were Hyper Gene Yeast Chips from Hitachi Software Engineering Europe AG. According to the manufacturer, about 0.1 to 0.3 ng of PCR amplified cDNA (approximately 200 bp to 8000 bp) were spotted onto a poly-L-lysine coated glass slide and fixed by baking, succinic anhydride blocking and heat denaturation.
Labelled cDNA was resuspended in about 70 μl of 5×SSC/0.05% SDS, heat denatured at 95° C. for 3 min and cooled on ice. SDS crystals appearing were dissolved by short and slight warming and the mixture was gently applied to a Yeast Chip. The spotted area was covered with a cover glass and the chips were placed in an airtight container with a humidified atmosphere at 60° C. for 16 h.
The cover glasses were removed in 2×SSC/0.1% SDS and the chips were washed consecutively for 5-10 min each in 2×SSC/0.1% SDS, 0.5×SSC/0.1% SDS, and 0.2×SSC/0.1% SDS at RT. The chips were centrifuged at 600 rpm for 3 min in order to dry them. The washing conditions were chosen according to the manufacturer's manual.
Each sample (labelled cDNA from trypsinogen non-expressing cells and from trypsinogen expressing cells) was used for hybridisation of two parallel cDNA mircoarrays to test the reproducibility of the signals.
f) Data Acquisition and Statistical Evaluation of Microarray Data:
Images were scanned at a resolution of 50 μm with a G2565AA Microarray scanner (Agilent) and were imported into the GenePix Pro 4.1 (Axon Instruments) microarray analysis software. GenePix Pro 4.1 was used for the quantification of the spot intensities. Each appearing gene spot was averaged. The data set was then imported into GeneSpring 6.1 (Silicon Genetics) for further normalisation and data analysis.
All of the values of each channel on each chip were divided by their respective median for normalisation. Subsequently, the median intensity of all TE spots (spotted with buffer, no DNA) deduced from each value, and all spot values less than the standard deviation of said threshold values were considered to be not significant and were set to the value of the standard deviation. To determine induction or repression of gene activity, the normalised signals on each spot were compared, and all genes showing a signal difference exceeding the threshold (1.5 fold) on both parallel independent microarrays were judged as significantly regulated.
After determination of the the relative expression levels of all measured genes, the genes were ordered by the relative difference of their expression levels, and the 524 with the highest difference were considered for further analysis. As the DNA microarrays used for these experiments were derived from Saccharomyces cerevisiae gene sequences, only putative gene functions for P. pastoris can be assigned by the homology to S. cerevisiae. After ranking the 524 differentially regulated genes based on their putative intracellular localisation and function, and focusing on those being involved in secretion and/or general stress response, out of the 64 potentially interesting genes 15 were selected for further analysis: PDI1, CUP5, SSA4, BMH2, KIN2, KAR2, HAC1, ERO1, SSE1, BFR2, C0G6, SSO2, C0Y1, IMH1 and SEC31.
To generate a vector containing the GAP promoter and the his4 gene as selection marker, the AOX1 promoter of the vector pPIC9 (Invitrogen) was exchanged to the GAP promoter of pGAPZ B (Invitrogen) by restriction digest of both vectors with NotI and Mph1103I and subsequent ligation following a standard protocol. The newly constructed vector is referred to as pGAPHis.
h) Isolation of the Helper Factor Genes from Saccharomyces cerevisiae and Cloning into pGAPHis:
All the genes apart from Had were amplified directly from Saccharomyces cerevisiae genomic DNA by PCR with specific oligonucleotide primers depicted in Table 1. The P. pastoris Kozac sequence (ACG) was inserted directly before the start codon ATG. The non-template coded restrictions sites SacII (XhoI for the gene PDI1) and either PmII or Sfil (EcoR1 for the gene PDI1) were added by using the respective forward and backward primer (see Table 1). After restriction digest of the PCR fragments of correct length (checked by agarose gel separation) with SacII (XhoI for the gene PDI1) and either PmII or Sfil (EcoR1 for the gene PDI1) as shown in Table 1, these fragments were cloned into the pGAPHis vector (also digested with the respective restriction enzymes and treated with alkaline phosphatase). To construct the induced variant of the HAC1 gene of S. cerevisiae, the DNA fragment coding for the first 220 amino acids was combined with the fragment coding for the 18 amino acid exon of the induced Hac1p (Mori et al., 2000) in a two step PCR reaction, and the resulting fragment was ligated into pGAPHis.
All ligated plasmids were transformed into E. coli Top 10F′ (Invitrogen) and plated on Ampicillin containing LB-agar. Restriction enzyme analysis was performed to verify the correct identity of the respective plasmids.
Saccharomyces cerevisiae
The plasmid DNA from E. coli from Example 1 was used to transform P. pastoris strain SMD1168 already containing the expression cassettes for 2F5 Fab under control of the GAP promoter, which strain was pre-selected for a high Fab secretion level. The strain SMD1168 is a P. pastoris his4-defective strain (a pep4 mutant). Selection was based on zeocin resistance for the antibody genes, and histidin auxotrophy for the other genes.
a) Construction of the P. pastoris Strain SMD1168 Secreting the Fab Fragment of the Monoclonal Anti-HIV1 Antibody 2F5:
2F5 antibody fragment sequences for the Fab light and heavy chain were amplified by PCR from pRC/RSV containing the humanized IgG1 mAb as disclosed in Gasser et al., 2006. The restriction sites EcoRI and SacII were used for cloning.
In detail, for the generation of Fab, the entire light chain genes (vL and cL) and the vH and cH1 region of the heavy chain genes were amplified by PCR. The light chain fragment was ligated into a modified version of pGAPZαA, where the AvrII restriction site was changed into NdeI by site directed mutagenesis to allow subsequent linearization of the plasmids containing two cassettes. The heavy chain fragment was inserted into the original version of pGAPZαA, which contains the constitutive P. pastoris glycerol aldehyde phosphate dehydrogenase (GAP) promoter followed by the MFα leader sequence of S. cerevisiae (Invitrogen, Carlsbad, Calif., USA).
Plasmids combining the expression cassettes for both Fab chains on one vector were produced by double digestion of the light chain vector with Bgl II and BamHI, and subsequent insertion into the unique BamHI site of the vector pGAPZαA already containing a single copy of the expression cassette of the heavy chain fragment. Plasmids were then linearized with AvrII prior to electrotransformation into P. pastoris.
All constructed expression cassettes were checked by DNA sequencing with the GAP forw/AOX3′ back primers (Invitrogen).
b) Construction of P. pastoris Strains Co-Expressing 2F5 Fab and a Secretion Helper Factor:
Transformation of P. pastoris strains obtained in step a) was carried out with the plasmids of Example 1, which are linearized in the HIS4 locus. The plasmids were introduced into the cells by electrotransformation. The transformed cells were cultivated on RDB-agar (lacking histidine) for selection of His-prototrophic clones, which contain the expression cassettes for the secretion helper factors.
c) Culturing Transformed P. pastoris Strains in Shake Flask Cultures:
5 ml YP-medium (10 g/l yeast extract, 20 g/l peptone) containing 20 g/l glycerol were inoculated with a single colony of P. pastoris selected from the RDB plates and grown overnight at 28° C. Aliquots of these cultures corresponding to a final OD600 of 0.1 were transferred to 10 ml of main culture medium (per liter: 10 g yeast extract, 10 g peptone, 100 mM potassium phosphate buffer pH 6.0, 13.4 g yeast nitrogen base with ammonium sulfate, 0.4 mg biotin) and incubated for 48 h at 28° C. at vigorous shaking in 100 ml Erlenmeyer flasks. To induce recombinant protein expression, cultures with the GAP promoter were supplemented with 10 g/l glucose. The same amounts of substrate were added repeatedly 4 times every 12 h, before cells were harvested by centrifugation at 2500×g for 5 min at room temperature and prepared for analysis (biomass determination by measuring optical density at 600 nm, ELISA for Fab quantification in the culture supernatant).
d) Evaluation of the Effect of Co-Overexpression of Single Folding Helper Factors by Quantification of 2F5 Fab:
To determine the amount of secreted recombinantly expressed 2F5 Fab, 96 well microtiter plates (MaxiSorb, Nunc, Denmark) were coated with anti-hIgG (Fab specific) overnight at RT (Sigma I-5260;1:1000 in PBS, pH 7.4), before serially diluted supernatants of P. pastoris cultures secreting 2F5 Fab from step c) (starting with a 1:200 dilution in PBS/Tween20 (0.1%)+1% BSA) were applied and incubated for 2 h at RT. A human Fab of normal IgG (Rockland) was used as a standard protein at a starting concentration of 200 ng/ml. After each incubation step the plates were washed four times with PBS containing 0.1% Tween 20 adjusted to pH 7.4. 100 μl of anti-kappa light chain—AP conjugate as secondary antibody (1:1000 in PBS/Tween+1% BSA) were added to each well, and incubated for 1 h at RT. After washing, the plates were stained with pNPP (1 mg/ml p-nitrophenyl phosphate in coating buffer, 0.1 N Na2CO3/NaHCO3; pH 9.6) and read at 405 nm (reference wavelength 620 nm).
Of each of the 15 different secretion helper factor constructs, 16 individual clones were cultivated in shake flask cultures as described in step c) and compared to 16 individual clones of the control strain, that was transformed with the pGAPHis vector lacking a gene. The 2F5 Fab productivity (μg Fab/biomass) was determined for all the analyzed cultures (first screening round). The 6 best clones of each of the constructs were then re-analyzed using the same system in a second screening round (for results see Table 2).
Table 2 shows the mean relative productivity of the 6 best clones of each tested secretion helper factor construct including the control construct (empty pGAPHis vector). The table shows the mean improvement factor of 2F5 Fab secretion of two screening rounds obtained by co-overexpression of the secretion helper factors relative to the control cultures. The secretion helper factors which are known in the art improving the secretion of heterologous proteins when co-overexpressed (PDI1, KAR2, HAC1, ERO1 and SSO2) are included in Table 2 for comparative reasons.
As can be seen from Table 2, that the secretion of the heterologous protein, i.e. the secretion of 2F5 Fab was increased for most of the analyzed secretion helper factors, in a range between 1.2 and 1.7-fold. Apart from the secretion helper factors already known in the art having a positive effect on the secretion of a heterologous protein co-overexpression of the secretion helper factors CUP5, SSA4, BMH2, KIN2, SSE1 and BFR2 showed a highly significant increase in the amount of secreted heterologous protein and co-overexpression of C0G6, C0Y1 and IMH1 showed a significant increase in the amount of secreted heterologous protein.
Sequence information for the secretion helper factors PDI1, KAR2, HAC1, ERO1 and SSO2 is disclosed in the prior art.
The nucleotide sequences of the secretion helper factors which are not yet known in the art improving the secretion of heterologous proteins when co-overexpressed are shown in Table 3 below.
S. cerevisiae BMH2 (SEQ ID NO 32)
S. cerevisiae BFR2 (SEQ ID NO 33)
S. cerevisiae COG6 (SEQ ID NO 34)
S. cerevisiae COY1 (SEQ ID NO 35)
S. cerevisiae CUP5 (SEQ ID NO 36)
S. cerevisiae IMH1 (SEQ ID NO 37)
S. cerevisiae KIN2 (SEQ ID NO 38)
S. cerevisiae SEC31 (SEQ ID NO 39)
S. cerevisiae SSA4 (SEQ ID NO 40)
S. cerevisiae SSE1 (SEQ ID NO 41)
By screening a P. pastoris genome database (ERGO™, IG-66, Integrated Genomics) with the nucleotide sequences of the secretion helper factors isolated from Saccharomyces cerevisiae (SEQ ID NO 32 to SEQ ID NO 41) homologous nucleotide sequences in Pichia pastoris have been identified and are shown in Table 4 below.
Pichia pastoris (IG-66)
Pichia pastoris (IG-66)
Pichia pastoris (IG-66)
Pichia pastoris (IG-66)
Pichia pastoris (IG-66)
Pichia pastoris(IG-66)
Pichia pastoris (IG-66)
For construction of the novel vector system pPuzzle a 2884 bp fragment carrying an origin of replication and a selection marker for E. coli (AmpR cassette) was amplified from a common used cloning vector pBR322 (Fermentas Life Science, Germany, #SD0041 pBR322 DNA) by PCR. Two non-template coded NotI restrictions sites were added by using the forward primer pBR322_FOR_NotI and the backward primer pBR322_BACK_NotI. This PCR fragment was used as a shuttle supplying a temporary origin of replication and a selection marker for amplifying an artificial multiple cloning site in E. coli. A 244 bp synthetic DNA fragment (synthesised and subcloned in the EcoRV site of the pUC57 plasmid by GeneScript Corp. Piscataway, N.J. 08854 USA) was cut with NotI and ligated with the NotI and alkaline phosphatase treated shuttle fragment and amplified in E. coli. The resulting product was called pBR322½artMCS. To generate pBR322½artMCS_ORI, a 670 bp fragment carrying the origin of replication from a commercial available cloning vector pUC19 (Fermentas Life Science Germany; #SD0061 pUC19 DNA; bases 812-1481) was amplified by PCR using the forward primer pUC19ORI #1-SacI and the backward primer pUC19ORI #2-SacI and cloned in the unique SacI site of pBR322½artMCS.
To generate the vector backbone of pPuzzle (see
In a further cloning step the transcription terminator of the cytochrome c gene from S. cerevisiae (a 276 bp fragment of the 3′ region of the Cytochrome c, isoform 1 CYC1 gene from S. cerevisiae chromosome X bases 526663-526937) was amplified by PCR (forward primer cyc1TT_new_FOR_BamH1 and reverse primer cyc1TT #2-AgeI) for genomic DNA and inserted into the BamHI and AgeI (alkaline phosphatase treated) site of pBR322½artMCS_ORI resulting in a vector called pBR322½artMCS_ORI_cyc1TT.
The zeocin selection marker for E. coli and P. pastoris consists of the ORF of the Sh ble gene from Streptoalloteichus hindustanus under the control of the TEF1 (translational elongation factor 1) promoter from S. cerevisiae and an artificial E. coli promoter sequence EM7. The Sh ble gene is flanked by a transcription terminator of the cytochrome c (CYC1) gene from S. cerevisiae. The TEF1 promoter (5′ promoter region of TEF1 alpha of S. cerevisiae chromosome XVI bases 700170-700578) was amplified by PCR from S. cerevisiae genomic DNA using the forward primer zeoR_neu—#1_kpn1 (adding a non-template coded Kpn I site) and the reverse primer TEF1_back:—#1. An artificial E. coli EM7 promoter sequence and an NcoI restriction site were added to the 3′ end of the TEF1 promoter by primer extension using the forward primer zeoR_neu—#1_kpn1 and the reverse primer TEF1_back:—#2NocI. The resulting PCR fragment was treated with NcoI and fused to the NcoI site of the 5′ end of the Sh ble ORF. The Sh ble ORF was amplified by PCR using the forward primer Sh ble_FOR—#1_NcoI (adding a non-template coded NcoI site) and the reverse primer Sh ble_back—#2_AatI (adding a non-template coded AatI site) from a pUT737 plasmid (Cayla Toulouse, France pUT737 catalog # VECT 7371). The product of this fusion was used as a template for PCR (forward primer zeoR_neu—#1_kpn1 and reverse primer Sh ble_back—#2_AatI) resulting in a 893 bp fragment.
The transcription terminator of the cytochrome c (CYC1) gene from S. cerevisiae (Cytochrome c, isoform 1 gene from S. cerevisiae chromosome X bases 526663-526937) was amplified by PCR from genomic DNA using the forward primer cyc1TT_FOR—#1_aat1 (adding a non-template coded AatI site) and the reverse primer cyc1TT_neu_back_Kpn1 (adding a non-template coded Kpn I site), treated by AatI and fused to the AatI treated 893 by hybrid of TEF1 promoter and Sh ble ORF. The zeocin cassette of the final size of 1170 by was amplified by PCR using the forward primer zeoR_neu—#1_kpn1 and the reverse primer cyc1TT_neu_back_KpnI. The PCR product was purified by agarose gel electrophoresis and the fragment of the correct size was used as a template for a second PCR. The second PCR fragment was treaded by KpnI cloned in the KpnI sites of pBR322½artMCS_ORI_cyc1TT vector resulting in a vector called pBR322½artMCS_ORI_cyc1TT_zeoR.
For integration of the pPuzzle vector system in the genome of P. pastoris it was decided to use a target sequence in the 3′ area of the AOX1 gene of P. pastoris. Two 400 bp fragment called AOXTTpart1 and AOXTTpart2 (sequences from Integrated-Genomics, Chicago USA, ERGO database, P. pastoris IG66 Contig 1471 bases 52189-52588 and 52589-52979) were amplified by PCR from genomic DNA of P. pastoris. By using the forward primer 5_AOX TT #1 HindIII/NotI and the reverse primer 5_AOX TT #2 AscI/BamHI non-template coded HindIII and NotI restriction sites were added to the 5′ side and AscI and BamHI restriction sites to the 3′ side of the fragment AOXTTpart1. For adding a 5′ BamHI site, a 3′ NotI site and a 5′ EcoRI site to the fragment AOXTTpart2 the forward primer 3_AOX TT #3 BamHI and the reverse primer 3_AOX TT #4aNotI/EcoRI were used. For assembling AOXTTpart1 and AOXTTpart2 according to their orientation in the genome the fragment AOXTTpart1 was subcloned in the EcoRV site of pSTBlue-1 using the Novogen Perfectly Blunt® Cloning Kits, pSTBlue-1 (Merck Biosciences, Germany). A 500 bp fragment was amplified by PCR using the forward primer T7 and the reverse primer 5_AOX TT #2 AscI/BamHI. This fragment was cut by BamHI and ligated with the BamHI treated AOXTTpart2 fragment. The ligation mixture was used directly as a template for PCR with T7 as forward primer and 3_AOX TT #4NotI/EcoRI as reverse primer. The fragment of the correct size (˜900 bp) was purified by agarose gel electrophoresis and used as a template for a second PCR with 5_AOX TT #1 HindIII/NotI and 3_AOX TT #4NotI/EcoRI. The presents of the AscI restriction site in the middle of the PCR fragment was checked by AscI endonuclease digest of the resulting 800 bp fragment called AOXTTpart1+2 To get ride of the pBR322 shuttle in the pBR322½artMCS_ORI_cyc1TT_zeoR vector it was cut by NotI and the 2270 bp vector backbone of the pPuzzle_zeoR was separated from the 2884 by pBR322 shuttle fragment by agarose gel electrophoresis treated with alkaline phospatase and ligated with the NotI treated PCR fragment AOXTTpart1+2. The resulting vector was called pPuzzle_zeoR_AOXTT.
Starting from the pPuzzle_zeoR_AOXTT vector backbone an enhanced green fluorescent protein (eGFP) gene was inserted into the MCS using the restriction sites SbfI and SfII. The eGFP gene (718 bp) was amplified by PCR e.g. from the vector pcDNA™ 6.2n-EmGFP-DEST (Invitrogen Austria). Two non-template coded restriction sites Sbfl and Sfil were attached by primer extension using the forward primer eGFP#1AarI/SbfI and the reverse primer eGFP#2Sfil. The Sbfl and Sfil treated PCR product of eGFP was inserted into the alkaline phosphatase treated Sbfl and Sfil sites of pPuzzle_zeoR_AOXTT. The resulting vector was called pPuzzle_zeoR_eGFP.
In Table 5 the PCR primer sequences used in the cloning procedures of Example 3 and 4 are summarized.
a) Amplification and Cloning Strategy of Promoter Sequences from P. pastoris:
To identify novel promoter sequences for use in a strain of the genus Komagataella for recombinant expression of a heterologous protein the normalized signals of all measured genes of trypsinogen producing and non-producing cells, respectively, obtained from the DNA microarray hybridisation described in Example 1 were ordered by their relative expression levels. Further the relative expression level of each measured gene was compared between trypsinogen producing and non-producing cells. From these data the 23 genes with the highest expression level in trypsinogen producing and non-producing cells were considered for further analysis. A listing of the genes selected for further analysis is found in Table 6. Further, only such genes for which genomic sequence data were available have been included in the selection. The promoter sequences of these 23 potential interesting genes (up to 1000 bp of the 5′-non coding region of the respective genes) were identified using a P. pastoris genome database (ERGO™, IG-66, Integrated Genomics) and amplified from P. pastoris by PCR. Additionally, the well known promoter sequences of AOX and of GAP were amplified via PCR from P. pastoris for comparative reasons (primer and primer sequences see Tables 6 and 7). In 25 final cloning steps the 25 promoters (including the two control promoter sequences) obtained from P. pastoris were inserted upstream of the start codon of the eGFP gene using the ApaI and the SbfI restriction site of the multiple cloning site of the vector pPuzzle_ZeoR_eGFP or in case of the promoter of FET3pre using the ApaI and the AarI restriction site (see Table 6).
b) Analysis of Promoter Activity in P. pastoris:
To test the properties and the activities of the different promoters, the 25 vectors prepared in step a) were digested with AscI and used for transforming P. pastoris via electroporation (using a standard transformation protocol for P. pastoris). Transformed P. pastoris cells were grown on YPD-medium (1% yeast extract, 2% peptone, 2% glucose) containing 100 mg/l zeocin. From each transformation 10 single colonies were picked on a YPD-zeocin agar plate and used to inoculate a 10 ml liquid culture. The eGFP expression was measured either when the cells were cultured on glucose as the single carbon source or on glycerol/methanol as the single carbon source. The amount of recombinant eGFP was quantified using flow cytometer analysis and the relative eGFP expression levels were calculated as shown below.
A untransformed P. pastoris wild type strain and P. pastoris transformed with a pPuzzle_zeoR_PAOX—IacZ_AOXTT vector were used as negative controls for eGFP expression.
Calculation of Relative eGFP Expression Levels:
FL1 (fluorescence channel 1): GeoMean of 10000 events
FSC (forward scatter): GeoMean of 10000 events
rfu: relative fluorescent units
rel.Exp[%]: relative eGFP expression normalized on GAP promoter
eGFP Expression on Glucose as Single Carbon Source:
Shake flask cultures in 100 ml Erlenmeyer flasks on 10 ml medium (containing 1% yeast extract, 2% peptone, 100 mM potassium phosphate buffer pH 6.0, 1.34% yeast nitrogen base with ammonium sulfate, 4×10−5% biotin, 2% glucose) were inoculated with a single colony from a YPD-zeocin agar master-plate and cultivated at 28° C. and 180 rpm. Glucose was added to a final concentration of 0.5% every 12 h. Samples were taken 16 h, 40 h and 67 h after inoculation, diluted with sterile PBS to OD600 of approximately 0.1-0.2 and analysed on GFP expression by flow cytometer analysis (BD Facs Calibur). The results are shown in Table 8.
eGFP Expression on Glycerol/Methanol as Single Carbon Source:
Shake flask cultures in 100 ml Erlenmeyer flasks on 10 ml YPG-medium (containing 1% yeast extract, 2% peptone, 1% glycerol) were inoculated with a single colony from a YPD-zeocin agar master-plate and cultivated at 28° C. and 180 rpm. After 22 h cells were harvested by centrifugation (1500×g 5 min.) and resuspended in 10 ml MM-medium (100 mM potassium phosphate buffer pH 6.0, 1.34% yeast nitrogen base with ammonium sulfate, 4×10−5% biotin, 0.5% methanol). Every 12 h methanol was added to a final concentration of 0.5%. Samples were taken 22 h, 42 h, 64 h and 90 h after inoculation, diluted with sterile PBS to OD600of approximately 0.1-0.2 and analysed on GFP expression by flow cytometer analysis. The results are shown in Table 8.
From Table 8 can be seen that there are promoters with different transcription levels on different carbon sources in a range from 0% to 1600% available (relative to the eGFP expression under the well known GAP promoter, which was set as 100%). Real unexpected were the high eGFP expression levels obtained from the vector pPuzzle_zeoR_PPET9—eGFP_AOXTT (see
Additional interesting promoter sequences with different transcription levels on different carbon sources in a range from 0% to 135% (relative to the eGFP expression under the GAP promoter) are shown in Table 10.
pastoris (IG-66), Contig1891_9813_11291
pastoris (IG-66), Contig1212_2407_3064
pastoris (IG-66), Contig1951_9775_11895
pastoris (IG-66), Contig1900_3677_1641
pastoris (IG-66), Contig 1815_6838_7665
pastoris (IG-66), Contig1185_5060_8191
pastoris (IG-66), Contig1847_1328_537
pastoris (IG-66), Contig1605_1365_1817
pastoris (IG-66), Contig2026_5140_3168
pastoris (IG-66), Contig2116_14564_15586
pastoris (IG-66), Contig1564_2883_2026
pastoris (IG-66), Contig1945_6169_6092,
pastoris (IG-66), Contig1903_1777_3087
pastoris (IG-66), Contig1695_3185_3751
pastoris (IG-66), Contig1965_1563_2216
pastoris (IG-66), Contig1351_2056_46
pastoris (IG-66), Contig2109_25041_25727
pastoris (IG-66), Contig1928_10057_11658
pastoris (IG-66), Contig1928_8701_7604
pastoris (IG-66), Contig2096_16323_14941
pastoris (IG-66), Contig 1417_1294_2871
pastoris (IG-66), Contig2108_1359_14
Number | Date | Country | Kind |
---|---|---|---|
07008051.0 | Apr 2007 | EP | regional |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/EP2008/003076 | 4/17/2008 | WO | 00 | 10/8/2009 |