The present invention relates to the isolation of polynucleotides comprising regulatory nucleotide sequences, vectors, and expression cassettes containing such regulatory sequences, and host cells comprising the vectors and/or expression cassettes. In particular, the invention relates to 5′ upstream regulatory nucleotide sequences within the promoter and 3′ downstream regulatory nucleotide sequences within the terminator regions of a malate synthase gene, vectors and expression cassettes containing such regulatory sequences, and host cells comprising the vectors and/or expression cassettes.
The methylotrophic yeast Pichia pastoris has been used extensively for re-combinant protein expression. Most Pichia expression vectors are quite similar and are designed to use the promoter from the alcohol oxidase gene (AOX1). This methanol-induced promoter is efficient and tightly regulated, and has been used successfully to induce the expression of heterologous proteins for several years (Cereghino and Cregg, 2000).
Despite its advantages, the AOXI promoter is not ideal for scaled-up expression of proteins, as the amount of methanol required poses engineering and safety challenges—e.g. the volatility and explosion risks associated with large quantities of methanol are not trivial for any laboratory. Furthermore, the addition of methanol not only induces the expression of the desired recombinant protein, but also the protein encoded by the AOX1 gene. This results in high levels of alcohol oxidase which creates a metabolic burden, and can interfere with cell function and analysis (Zupan et al., 2004). Thus, it is desirable to have efficient and tightly regulated promoters for the expression of re-combinant proteins in P. pastoris.
Isocitrate lyase and malate synthase are the key enzymes of the glyoxylate cycle, and are required for the proliferation of yeast cells on C2 substrates such as ethanol or acetate. Specifically, one of the malate synthase genes, MLS1, is highly transcribed on nonfermentable carbon sources and is essential for cell growth on these media. The MLS1 promoter when fused to a reporter protein in Saccaromyces cerevisiae, displays basal level expression in the presence of 2% glucose, and is derepressed more than 100-fold under conditions of sugar limitations (Caspary et al., 1997). Furthermore, it is possible to decouple the carbon source metabolism from induction of the recombinant protein, as a media of both glucose and acetate will provide the preferred glucose for the host cell and the required acetate for heterologous protein induction.
What is desired, therefore, is a promoter for regulated expression of recombinant heterologous proteins in the yeast species, Pichia.
The present invention provides an isolated polynucleotide comprising a regulatory region containing a nucleotide sequence of the non-coding region of the MLS1 gene in Pichia pastoris. The isolated polynucleotide of the present invention comprises or consists of a nucleic acid sequence selected from the group consisting of: (a) SEQ ID NO: 1; (b) the complement of SEQ ID NO: 1;
(c) a nucleic acid sequence at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99% or at least 99.9% (similarity) (idenity) to SEQ ID NO: 1; and (d) a nucleic acid sequence that hybridizes under stringent conditions to SEQ ID NO: 1.
The present invention provides an isolated polynucleotide comprising a regulatory region containing a nucleotide sequence of the non-coding region of the MLS1 gene in Pichia pastoris. The isolated polynucleotide of the present invention comprises or consists of a nucleic acid sequence selected from the group consisting of: (a) SEQ ID NO:4; (b) the complement of SEQ ID NO: 4;
(c) a nucleic acid sequence at least 60%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99% or at least 99.9% (similarity) (identity) to SEQ ID NO: 4; and (d) a nucleic acid sequence that hybridizes under stringent conditions to SEQ ID NO: 4.
The present invention further provides a method for producing heterologous proteins in a host from the genus of Pichia comprising the step of
(a) introducing into the host the polynucleotide sequence of claim 1 or 2; and
(b) culturing the host to produce the heterologous protein.
SEQ ID NO: 1 P. pastoris MLS1 promoter
SEQ ID NO: 2 PpMLS1-P/UP primer
SEQ ID NO: 3 PpMLS1-P/LP primer:
SEQ ID NO: 4 P. pastoris MLS1 transcriptional terminator
SEQ ID NO: 5 PpMLS1-TT/UP primer
SEQ ID NO: 6 PpMLS1-TT/LP primer.
SEQ ID NO: 7 multiple cloning site (MCS)
Terms and abbreviations as used herein are defined below:
As used herein, ‘isolated polynucleotide’ means a polynucleotide that is free of one or both of the nucleotide sequences which flank the polynucleotide in the naturally-occurring genome of the organism from which the polynucleotide is derived. The term includes, for example, a polynucleotide or variant thereof that is incorporated in a vector or expression cassette; into an autonomously replicating plasmid or virus; into the genomic DNA of a prokaryote or eukaryote; or that exists as a separate molecule independent of other polynucleotides. It also includes a recombinant chimeric polynucleotide that is part of a hybrid polynucleotide, for example one encoding a polypeptide sequence. ‘Isolated polynucleotide’ does not necessarily mean a polynucleotide that has been physical removed from any flanking DNA sequences.
As used herein, ‘polynucleotide’ and ‘oligonucleotide’ are used interchangeably and refer to a polymeric (2 or more monomers) form of nucleotides of any length, either ribonucleotides or deoxyribonucleotides. Although nucleotides are usually joined by phosphodiester linkages, the term also includes polymeric nucleotides containing neutral amide backbone linkages composed of aminoethyl glycine units. This term refers only to the primary structure of the molecule. Thus, this term includes double- and single-stranded DNA and RNA. It also includes known types of modifications, for example, labels, methylation, ‘caps’, substitution of one or more of the naturally occurring nucleotides with an analog, internucleotide modifications such as, for example, those with uncharged linkages (e.g., methyl phosphonates, phosphotriesters, phosphoamidates, carbamates, etc.), those containing pendant moieties, such as, for example, proteins (including for e.g., nucleases, toxins, antibodies, signal peptides, poly-L-lysine, etc.), those with intercalators (e.g., acridine, psoralen, etc.), those containing chelators (e.g., metals, radioactive metals, boron, oxidative metals, etc.), those containing alkylators, those with modified linkages (e.g. alpha anomeric nucleic acids, etc.), as well as unmodified forms of the polynucleotide. Polynucleotides include both sense and antisense strands.
As used herein, ‘sequence’ means the linear order in which monomers occur in a polymer, for example, the order of amino acids in a polypeptide or the order of nucleotides in a polynucleotide.
As used herein, ‘variant’ of a reference nucleic acid sequence encompasses nucleic acid sequences that are not necessarily identical in sequence but that are homologous to one another within one or more particular segments. The term ‘degenerate oligonucleotide’ or ‘degenerate primer’ is used to signify an oligonucleotide capable of hybridizing with target nucleic acid sequences that are not necessarily identical in sequence but that are homologous to one another within one or more particular segments.
As used herein, the term ‘complementary’ or ‘complement’ refer to the pairing of bases, purines and pyrimidines that associate through hydrogen bonding in double stranded nucleic acid. The following base pairs are complementary: guanine and cytosine; adenine and thymidine; and adenine and uracil. The terms as used herein include complete and partial complementarity.
As used herein, the term ‘hybridization’ refers to a process in which a strand of nucleic acid joins with a complementary strand through base pairing. The conditions employed in the hybridization of two non-identical, but very similar, complementary nucleic acids vary with the degree of complementarity of the two strands and the length of the strands. Thus the term contemplates partial as well as complete hybridization. Such techniques and conditions are well known to practitioners in this field.
As used herein, ‘expression cassette’ means a genetic module comprising a gene and the regulatory regions necessary for its expression, which may be incorporated into a vector.
As used herein, ‘operably linked’ means any linkage, irrespective of orientation or distance, between a regulatory sequence and coding sequence, where the linkage permits the regulatory sequence to control expression of the coding sequence.
As used herein, ‘heterologous DNA coding sequence’ means any coding sequence other than the one that naturally encodes the malate synthase protein, or any homolog of the malate synthase protein.
As used herein, ‘regulatory sequence’ or ‘regulatory region’ as used in reference to a specific gene, refers to the coding or non-coding nucleotide sequences within that gene that are necessary or sufficient to provide for the regulated expression of the coding region of a gene. Thus, the term encompasses promoter sequences, regulatory protein binding sites, upstream activator sequences and the like. Specific nucleotides within a regulatory region may serve multiple functions. For example, a specific nucleotide may be part of a promoter and participate in the binding of a transcriptional activator protein.
As used herein, ‘coding region’ refers to that portion of a gene which codes for a protein. The term ‘non-coding region’ refers to that portion of a gene that is not a coding region.
Isolation and Induction of the Pichia pastoris MLS1 Promoter
The Pichia expression vectors presently available are designed to use the promoter from the alcohol oxidase gene (AOX1). In addition to the large methanol requirement for AOX1 induction, which poses engineering and safety challenges, the shift from glucose to methanol not only creates a metabolic burden from the induction of large quantities of the Aox1 protein, but this shift in carbon source often results in cell lysis.
The MLS1 promoter region has not been previously cloned from the P. pastoris genome and has not previously been shown to promote protein expression of heterologous proteins in P. pastoris. The isolation of this promoter and terminator sequence as described herein, allows for regulation of heterologous proteins in Pichia with the possibility of decoupling the carbon source metabolism from the induction of the heterologous protein, as a media of both glucose and acetate will provide the preferred glucose for the host cell and the required acetate for heterologous induction.
The present invention provides regulatory nucleotide sequences that are part of the non-coding region of the MLS1 gene in Pichia pastoris. In one embodiment, the 5′ regulatory region of the MLS1 gene, is an inducible promoter, and preferably is induced by ethanol or acetate, and can also be derepressed under glucose starvation conditions.
In one aspect of the present invention, the MLS1 promoter is cloned from a wild type P. pastoris strain, NRRLY-11430 (American Type Culture Collection, ATCC) as a 925 nucleotide fragment corresponding to the 5′ regulatory non-coding region region (SEQ ID NO: 1) directly upstream of the MLS1 gene using a 5′ primer (PpMLS1-P/UP) (SEQ ID NO: 2) and a 3′ primer (PpMLS1-P/LP) (SEQ ID NO: 3) (Example 1). The isolated fragment is subsequently sequenced using the same cloning primers confirming the MLS1 promoter sequence.
The present invention encompasses an isolated polynucleotide comprising a regulatory region containing SEQ ID NO: 1, a variant of SEQ ID NO: 1, the complement of SEQ ID NO: 1, or a variant of the complement of SEQ ID NO: 1. In one embodiment, the polynucleotide (or a variant, or the complement of either the polynucleotide or the variant) is less than about 1000 nucleotides long. In another embodiment, the polynucleotide is between about 4 nucleotides to about 925 nucleotides. More preferably, the polynucleotide is 925 nucleotides as shown in SEQ ID NO: 1.
In one embodiment of the present invention a nucleic acid molecule comprising or consisting of a sequence which is a variant of the P. pastoris MLS1 promoter has at least 70% identity to SEQ ID NO: 1. The nucleic acid sequence can preferably have at least 75% or 80% identity to the wild-type promoter. Even more preferably, the nucleic acid sequence can have 85%, 90%, 95%, 98%, 99%, 99.9% or even higher identity to the SEQ ID NO: 1.
In a preferred embodiment, the MLS1 5′ regulatory region disclosed herein, is approximately a 925 nucleotide fragment that can be operably linked to heterologous DNA coding sequences encoding at least one polypeptide. Suitable heterologous DNA coding sequences encoding at least one polypeptide which could be operably linked with the MLS1 5′ regulatory region for example, include but are not limited to, the kringle 3 (K3) domain of human plasminogen (Example 2). Any heterologous DNA coding sequence used with the present invention preferably contains a 5′ ATG start codon and a 3′ stop codon.
Additionally, a vector that contains a DNA fragment derived from the 5′ regulatory region of the MLS1 gene of P. pastoris, operably linked to the human K3 domain, is provided. The DNA fragment, corresponding to the promoter region of the MLS1 gene can regulate expression of the K3 domain in the host strain. An example of the K3 domain regulated by the MLS1 promoter in P. pastoris is shown in
In another aspect of the present invention, the combination of the MLS1 promoter operably linked to a heterologous DNA coding sequence may be inserted into a suitable vector. Numerous useful vectors have been described for yeast transformation and are known to those skilled in the art. The vectors should contain necessary elements which render the vector capable of growth amplification and rapid propagation in bacteria or yeast (Higgins and Cregg, 1998, Pichia Protocols, Humana Press).
The heterologous DNA sequences that can be operably linked to the MLS1 5′ regulatory region include, but are not limited to, one of the following: erythropoietin, cytokines such as interferon-a, interferon-b, interferon-g, interferon-w, and granulocyte-CSF, GM-CSF, coagulation factors such as factor VIII, factor IX, and human protein C, antithrombin III, thrombin, soluble IgE receptor a-chain, IgG (e.g. a-CD20, a-CD33, a-TNF a, a-EGPR), IgG fragments, IgG fusions, IgM, interleukins, urokinase, chymase, and urea trypsin inhibitor, IGF-binding protein, epidermal growth factor, growth hormone-releasing factor, annexin V fusion protein, angiostatin, vascular endothelial growth factor-2, myeloid progenitor inhibitory factor-1, osteo-protegerin, a-1-antitrypsin, a-feto proteins, DNase II, kringle 3 of human plasminogen, glucocerebrosidase, TNF binding protein 1, follicle stimulating hormone, cytotoxic T lymphocyte associated antigen 4-Ig, transmembrane activator and calcium modulator and cyclophilin ligand, soluble TNF receptor Fc fusion, glucagon like protein 1, IL-2 receptor agonist, and the yeast alpha mating secretion domain alone, or fused to any heterologous sequence (Larouche, et al. 1994, Bio/Technology, 12: 1119-1124).
In another embodiment, suitable host cells for the induction of the MLS1 5′ regulatory region include yeast preferably from the genus Pichia. Yeasts belonging to the genus Pichia according to the present invention include for example, Pichia pastoris, Pichia finlandica, Pichia trehalophila, Pichia koclamae, Pichia membranaefaciens, Pichia methanolica, Pichia minuta (Ogatae minuta, Pichia lindneri), Pichia opuntiae, Pichia thermotolerans, Pichi salictaria, Pichia guercum, Pichia pijperi, Pichia stiptis, Pichia sp., and other yeasts, but not limited thereto. In yet another embodiment, suitable host cells for the induction of the Pichia MLS1 5′regulatory region include yeast such as Saccharomyces, Hansenula, Candida and Pichia. The introduction of the promoter region into the host strains by a compatible vector can be accomplished by suitable transformation techniques known to those skilled in the art. For the transformation of the pBK396 vector carrying the MLS1 promoter and the K3 domain, see Example 2.
Additionally, the P. pastoris MLS1 3′ regulatory region encoding a transcriptional terminator can be used in the place of the AOX1 transcriptional terminator in pBK396. This MLS1 transcriptional terminator (SEQ ID NO: 4) was cloned from a wild type P. pastoris strain, NRRLY-11430 (American Type Culture Collection, ATCC) using a 5′ primer (PpMLS1-TT/UP) (SEQ ID NO: 5) and a 3′ primer (PpMLS1-TT/LP) (SEQ ID NO: 6) (Example 2). This terminator fragment was amplified by PCR and sublconed into a pCR2.1 vector. A BamHI/NotI terminator fragment was then subcloned from this vector into the BamHI/NotI sites in pBK396, resulting in pBK398 (
The following are examples which illustrate the compositions and methods of this invention. These examples should not be construed as limiting—the examples are included for the purposes of illustration only.
The MLS1 5′ regulatory promoter region was cloned from the wild type Pichia pastoris strain NRRLY-11430 (ATCC) using a 5′ PpMLS1-P/UP primer 5′ AGATCTTCCTCACAGGTAAGGTAGAATTACC 3′ (SEQ ID NO: 2) and a 3′ PpMLS1-P/LP primer 5′ GAATTCTTTTGTGATAAAGCGGTAA-ATCTGG 3′ (SEQ ID NO: 3). This 925 bp DNA fragment was subsequently sequenced using the same cloning primers confirming the correct MLS1 promoter sequence (SEQ ID NO: 1). The MLS1 promoter was then subcloned into an expression vector as detailed below, resulting in pBK396.
An inventor-designed multiple cloning site (MCS) (SEQ ID NO: 7) was subcloned into the AatII/AflIII sites of the pUC19 vector (
The human plasminogen kringle 3 DNA sequence with 5′ BstBI and 3′ AccI sites filled in to provide blunt ends and was subcloned into the blunt-ended AfeI site of pBK381 producing pBK396 (
Culturing Conditions for P. pastoris BKY285 and BKY285.1
After incubation on YPD medium containing 100 μg/ml zeocin, colonies were restreaked. Zeocin-resistant clones were selected and grown in buffered dextrose-complex medium (BMDY) media (1% yeast extract, 2% peptone, 100 mM potassium phosphate buffer (pH 6.0), 1.34% yeast nitrogen base, 4×10−5% biotin, and 2% glucose) for two days and then harvested and resuspended in buffered acetate induction media—BMAcY with 0.5% NaOAc for 1 day.
Induced and uninduced cultures of Kringle 3 protein in strain BKY285 were analyzed by SDS-PAGE (
P. pastoris MLS1 promoter
P. pastoris MLS1 transcriptional terminator
This application claims priority to U.S. provisional application Ser. No. 60/590,756 filed on Jul. 22, 2004, which is incorporated by reference herein in its entirety.
| Filing Document | Filing Date | Country | Kind | 371c Date |
|---|---|---|---|---|
| PCT/IB05/52479 | 7/22/2005 | WO | 00 | 6/4/2008 |
| Number | Date | Country | |
|---|---|---|---|
| 60590756 | Jul 2004 | US |