The present invention relates to a process for the production of a protease of interest in a plant or in plant cells and protease obtained thereby. Further, the invention relates to a nucleotide sequence and a vector used for the process of the invention and to a transgenic plant or plant cells containing the nucleotide sequence or vector. Further, the invention relates to the use of a SUMO protein or a derivative thereof for expressing a protease of interest in a plant or in plant cells. vectors for this process and to plants or plant cells transformed therewith.
Recombinant protein production in plant systems has been very successful for many different products, covering proteins with industrial applications, food and feed additives, animal health products and human pharmaceuticals, such as antigens and immune response proteins.
There are many comprehensive reviews describing the field (Hood & Jilka, 1999, Curr. Opin. Biotech., 10:382-386; Doran, 2000, Curr. Opin. Biotech., 11:199-204; Daniell, et al., 2001 Trends Plant Sci., 6:219-226; Larrick & Thomas, 2001, Curr. Opin. Biotech., 12:411-418; Klimyuk et al., 2005, in: Modern Biopharmaceuticals, ed. J. Knaeblein, WILEY-VCH, Weinheim, 893-917; Gleba et al., 2007, Curr Opin. Biotechnol., 18, 134-141). Plants have been considered as a low-cost production system for proteins, that is significantly cheaper in comparison with bacterial, yeast, insect and mammalian cell-based production systems. The available data in this regard confirm the above said.
However, a challenge still exists when a commercially applicable expression level of cytotoxic proteins such as proteases has to be achieved. Proteases are used in many different commercial applications, including pharmaceutical and laboratory uses. The sources of proteases are usually either animal tissues (e.g. bovine and porcine pancreas for trypsin and chymotrypsin production) or bacteria (e.g. strains of Bacillus for subtilisin and thermolysin production). In the case of animal tissue as the source of proteases, potential cross-contamination by undesirable components (e.g. prions or other infectious agents) produced by the animal cells must be taken care of. Bacterial production of many proteases is rather restricted due to their low yield that in many cases is well below commercially viable levels. Kilogram quantities of proteases are often required for industrial scale applications.
Production of chymotrypsin (US2006015971) and trypsin (US6087558) in plants (corn grain) was previously described. This method provides a plant source of recombinant proteases with several of the advantages of a plant production host. In a prior art plant expression system for trypsin (Woodard et al., 2003, Biotechnol. Appl. Biochem., 38 123-130), trypsin was targeted in the zymogen form to the cell wall in corn seeds using an embryo-preferred promoter in transgenic maize plants. Several other promoter or subcellular target sites gave inferior expression yields. Seeds were considered ideal for trypsin expression due to their content of trypsin inhibitor that may minimize detrimental effects of trypsin on the host plant.
However, it follows from the published data (Woodard et al., 2003, Biotechnol. Appl. Biochem., 38 123-130) that the highest expression level of trypsin obtained in corn seeds is rather low (ca. 58 mg/kg of corn grain), which inevitably affects the market cost of recombinant trypsin. Indeed, the cost of plant-derived recombinant trypsin is significantly higher in comparison with that from traditional sources. This is not surprising as the cost of recombinant protein production (including downstream processing) is reversely dependent on the expression level of the protein. For example, extraction of beta-glucuronidase (GUS) from transgenic corn seeds accounts for 94% of the production cost (Evangelista et al., 1998, Biotechnol. Prog., 14:607-614). The calculations were done for transgenic corn containing 0.015% of recombinant GUS. It was stated that increase in recombinant GUS expression level up to 0.08% (4-5 folds) significantly improves the process economics. Also, the production of recombinant protease in agriculturally important plant that is commonly used in feed/food chains creates additional biosafety risks of cross-contaminating non-transgenic seed stock.
Departing from the prior art, it is the problem of the invention to provide a plant expression system and production process for proteases that gives high yield and allows industrial scale quantities of the protease to be produced. It is a further object of the invention to provide a plant expression system for proteases that avoids contamination of food, feed or seed stock intended for human or animal consumption by transgenic plant material. It is a further object of the invention to provide a process of producing a protease of interest in plants or plant cells, which addresses the problems associated with toxicity of said protease to plant cells expressing the protease and offers an economic way of the protease production in plants.
The present invention provides a process of producing a protease in a plant or in plant cells, comprising
The present invention further provides a process of producing a protease in a plant or in plant cells, comprising providing a plant with a replicon comprising a coding sequence encoding a fusion protein, said fusion protein comprising an apoplast or plastid signal peptide, a SUMO protein or a C-terminal domain of a SUMO protein, and a zymogen of said protease.
The invention also provides a, preferably isolated, vector or nucleotide sequence comprising a coding sequence encoding a fusion protein, said fusion protein comprising an apoplast or plastid signal peptide, a SUMO protein or derivative of a SUMO protein, and a zymogen of said protease.
The present invention also provides a transgenic plant or plant cells for expressing a protease or a fusion protein comprising said protease, said plant comprising a nucleotide sequence containing a promoter active in vegetative tissue and, downstream of said promoter and operably linked thereto, a coding sequence encoding a fusion protein comprising a plastid or apoplast signal peptide, a SUMO protein or a derivative thereof, and a zymogen of said protease.
The present invention also provides a use of a SUMO protein or a derivative of a SUMO protein for expressing a protease or a zymogen of a protease in a plant or in plant cells.
The inventors have surprisingly found that proteases that are potentially toxic to plant tissue can be efficiently expressed in plants when targeted to the apoplast or to plastids as a fusion with a SUMO protein (or a derivative thereof). Since the SUMO protein (or a derivative thereof) can be cleaved off from the fusion protein when the zymogen of the protease is converted to the protease, no additional working step for removing the SUMO protein from the protease of interest is necessary, whereby the process of the invention is advantageous and convenient for the downstream isolation and purification of the protease of interest.
Transgenic corns seeds used in the prior art for protease production in plants are a problematic production system, since it is almost impossible to avoid contamination of non-GM corn seeds with traces of transgenic corn seeds in industrial scale agriculture. Further, the expression yield obtained for trypsin in corn seeds is not satisfactory, resulting in high costs of trypsin produced by this method. The inventors of the present invention have surprisingly found that it is possible to obtain a high expression yield of proteases in plant leaves although protease inhibitors (such as trypsin inhibitor) reducing a proteolytic degradation of plant host tissue expressing the protease were not expected to a significant extent in leaf tissue. Contrary to what could be expected from Woodard et al. (2003, Biotechnol. Appl. Biochem., 38 123-130), vegetative (or green) tissue such as leaf material has been found to be an ideal tissue for expressing proteases in plants in a high yield.
The process of the invention is not limited to maize but can be performed in plants such as Nicotiana species that are not part of the human food chain. Further, it is not necessary to harvest seeds or transgenic plants for isolating the protease. Instead, the invention can be performed by harvesting plants having expressed protease before seeds have reached a viable growth state. Thereby, contamination of non-GM corn seeds by transgenic seeds can be effectively avoided. In one embodiment, the invention allows it to avoid the use of transgenic plants by using transient expression of the protease of interest. In contrast, transient expression is difficult to practice if a protease is expressed in corn seeds as described by Woodard et al.
The process of the invention can be performed by transient expression or using transgenic plants. Transgenic plants for the invention contain in a nuclear genome the heterologous nucleotide sequence of the invention. In a transient expression system, said nucleotide sequence is provided to a plant, whereby production of said protease may be triggered by the provision of said nucleotide sequence to a plant. The nucleotide sequence is typically not incorporated into the genome of the plant host when the process of the invention is performed by transient expression. In any event, in a transient expression process, plants or plant cells having incorporated the nucleotide sequence into their genome are not selected, e.g. by using an antibiotic resistance marker, from plants or cells not having the nucleotide sequence incorporated into the genome. As a consequence, plants or plant cells having incorporated the nucleotide sequence of the invention into their genome are not produced to a significant extent, whereby transmission of the nucleotide sequence of the invention to progeny plants and seeds is very unlikely. If the process of the invention is carried out by transient expression, the nucleotide sequence of the invention may be provided to a plant having reached a desired growth state.
Said nucleotide sequence of the invention is heterologous, since it does not naturally occur in genome of the plant, or cells thereof, used in the process of the invention.
The nucleotide sequence of the invention comprises the coding sequence of the invention. The coding sequence encodes the fusion protein of the invention. In addition to said coding sequence, the nucleotide sequence of the invention may further have genetic elements for expressing said coding sequence into said fusion protein. Examples of such genetic elements are a promoter active in said plant and a transcription termination region. Said promoter is operably linked to said coding sequence such that expression of the coding sequence is under the control of the promoter. The promoter may be a constitutive promoter active in said plant such as the CaMV 35S promoter. Alternatively, said promoter may be an inducible promoter so that production of said protease can be induced at will, such as at a desired growth state of the plant. In a further alternative, said nucleotide sequence may be incorporated into a chromosome of said plant such that expression of said coding sequence is possible under the transcriptional control of a native host promoter as described in WO 02/46440.
In one embodiment, said nucleotide sequence is or encodes a replicon comprising said coding sequence encoding said fusion protein. Herein, a replicon is a nucleic acid capable of replicating independently from the plant nuclear replication machinery. For this purpose, said replicon has an origin of replication that can be recognized by a nucleic acid polymerase that is present in or that is provided to cells of said plant. The replicon may be a DNA replicon or an RNA replicon. In one embodiment, the replicon is an RNA replicon. In another embodiment, said replicon is a viral replicon. “Viral” means that the replicon contains, e.g. for replicating the viral replicon, one or more sequence portions of a length of at least 5, preferably at least 10 more preferably of at least 20 contiguous nucleotides, or one or more genetic element derived from a virus. Examples of such genetic elements are an origin of replication recognised by the viral nucleic acid polymerase. The viral replicon may, but does not have to, encode the nucleic acid polymerase of the virus. “Derived” means that the sequence portion or genetic element is taken from a virus or is a DNA copy of a sequence portion taken from an RNA virus. The viral replicon may be a DNA viral replicon or an RNA viral replicon. In an advantageous case, the replicon is an RNA viral replicon. Viral RNA replicons generally use viral polymerases for replicating the replicon; polymerases native to the plant host generally cannot replicate viral RNA replicons. RNA viral replicons further use an origin of replication that is not recognized by native plant polymerases. The viral polymerase may be encoded on the RNA replicon or may be provided in trans from a separate vector or from a transgene encoding such viral polymerase whereby the transgene is incorporated into a nuclear or organellar genome of the plant.
Said nucleotide sequence may be or may encode an RNA replicon such as a viral RNA replicon. Said viral RNA replicon may use the replication and expression machinery of a natural plant RNA virus. Suitable RNA viruses from which RNA replicons of the invention may be built on are, for example, positive-sense single-stranded plant RNA viruses. Examples of such plant RNA viruses are tobamoviruses such tobacco mosaic virus, crucifer-infecting tobamovirus, or turnip vein clearing virus. These viruses and their use for expressing a protein of interest in plants are known (see below). Thus, said RNA replicon may encode an RNA-dependent RNA polymerase (“replicase”) capable of replicating said RNA replicon. Further, the RNA replicon will contain an origin of replication that is recognized by the replicase.
There are various ways how a plant or plant cells can be provided with the replicon of the invention. In one embodiment, the nucleotide sequence of the invention is said replicon, and the plant or plant cells are infected directly with the replicon.
In another embodiment, the plant or plant cells are provided with a nucleotide sequence encoding said replicon. If said replicon is an RNA replicon, the nucleotide sequence may be a DNA nucleotide sequence encoding said RNA replicon. The DNA nucleotide sequence may have a promoter for producing said RNA replicon by transcription of a portion of said nucleotide sequence encoding said replicon, e.g. by a native RNA polymerase of the plant or plant cell.
A DNA nucleotide sequence encoding an RNA replicon may be incorporated into the nuclear genome of a plant or plant cells, whereby a transgenic plant or plant cells, respectively, are produced. The promoter present in the DNA nucleotide sequence may be an inducible promoter. In this way, formation of the replicon from which said fusion protein is expressed can be triggered at will by inducing the inducible promoter. Suitable inducible promoters are known in the art. Said inducible promoter may be part of an alcohol inducible system as described in example 4. Measures to suppress the consequences of any leaky expression by an inducible promoter system may be taken, e.g. as described in WO 2007/137788 that is incorporated herein in its entirety. This embodiment may be used together with transgenic plants containing said nucleotide sequence integrated into a nuclear or organellar chromosome.
In a transient expression method of the invention, a plant may be provided with a replicon. Similarly as described above, said replicon may be a DNA or an RNA replicon such as a viral DNA replicon or a viral RNA replicon. Cells of said plant may be provided with said replicons directly, such as by infecting said plant with said replicon or by particle bombardment using particles coated with said replicon. Alternatively, a plant may be provided with a replicon indirectly, such as by Agrobacterium-mediated transfection using Agrobacteria containing Ti plasmids containing T-DNA comprising a nucleotide sequence encoding said replicon (“agroinfection”). After having entered cells of said plant, the DNA or RNA replicon can be activated from said T-DNA by transcription by a DNA-dependent polymerase such as a DNA-dependent polymerase that is native to said plant. For this purpose, the T-DNA typically contains a promoter upstream of the nucleic acid encoding the replicon. After formation of the replicon in cells of said plant, the replicon replicates and expresses said coding sequence for producing the fusion protein of the invention. Agroinfection may be performed using highly diluted suspensions of Agrobacterium as described in WO 2006/3018, notably page 12 bottom to page 13, middle and page 39.
In an advantageous transient expression method, the replicon is an RNA replicon and said plant is provided with said RNA replicon by transforming said plant or said plant cells with a DNA vector as a DNA nucleotide sequence encoding said RNA replicon. Alternatively, said plant or said plant cells are provided with two or more DNA vectors encoding together said RNA replicon, whereby a DNA nucleotide sequence encoding an RNA replicon may be generated inside cells of said plant e.g. by site-specific DNA recombination as described in WO 02/88369. After recombination, the RNA replicon may be formed by transcription involving a native plant host RNA polmyerase and a promoter present on the DNA nucleotide sequence.
The coding sequence of the invention encodes the fusion protein of the invention, said fusion protein comprising at least the following three fusion protein segments: (i) an apoplast or plastid signal peptide, (ii) a SUMO protein or a derivative of a SUMO protein, and (iii) a zymogen of said protease. These elements of said fusion protein may be arranged in different orders. However, the signal peptide is preferably placed at the N-terminal end of the fusion protein in order to be functional for targeting the fusion protein to the apoplast or into the plastids. The zymogen of said protease may be placed at the C-terminus of the fusion protein, which allows its easy separation from the remainder of the fusion protein by a proteolytic cleavage at a single peptide bond. Thus, in one embodiment, the order of the essential fusion protein segments (i) to (iii) may be, in N-terminal to C-terminal direction, as listed in claim 1.
The fusion protein of the invention may further contain a polypeptide segment usable as a purification tag for facilitating the purification of the fusion protein or the protease of interest. The purification tag may be located at an internal position of said fusion protein such as on the C-terminal side of the signal peptide and on the N-terminal side of the zymogen of the protease. However, other orders are also possible e.g. placing the purification tag at the C-terminus of the SUMO protein. Purification tags that can be used for practicing this invention include, but are not limited to: FLAG tag, polyhistidine tags, polyarginine tags, influenza virus HA tag, GST-tag, protein A tag, maltose binding protein (MBP), S-tag, the AviD tag, etc.
The protease to be produced according to the invention may be any protease that is naturally expressed as a zymogen that is activated by proteolytic cleavage to produce the active protease. Such proteases are known in the art. Chymotrypsin and trypsin are examples of proteins to be produced by the invention. The zymogens of chymotrypsin and trypsin are chymotrypsinogen and trypsinogen, respectively. Other examples may include but are not limited to precursors (zymogens) of barley protease EPB2 indicated for celiac sprue treatment (Mikkonen et al., 1996, Plant Mol Biol., 31, 239-254; Vora et al., 2007, Biotechnol Bioeng., 98, 177-185.), elastase, caspases, carboxypeptidase A, thrombin and other proteases.
Plastid signal peptides (also referred to as “plastid transit peptides” in the art) for targeting proteins into plastids are known in the art. Examples of plastig signal peptides are found in WO 2004/101797. Signal peptides for targeting proteins into the secretory pathway and into the apoplast are also known from general knowledge. The signal peptides described in WO 02/101006 may be used for targeting the fusion protein to the apoplast.
Said fusion protein further comprises a SUMO protein or a derivative of a SUMO protein. SUMO proteins usable in the present invention are SUMO proteins that were used in the prior art in bacterial protein expression systems, cf. Butt et al., 2005, Protein Expr Purif., 43, 1-9; Marblestone et al., 2006, Protein Sci., 15, 182-189; Su et al., 2006, Protein Pept. Lett., 13, 785-792; Weeks et al., 2007, Protein Expr Purif., 53, 40-50; US2004018591; EP1654379).
SUMO (small ubiquitin-like modifier) is a member of the superfamily of ubiquitin-like polypeptides (Melchior, F., 2000, Annu. Rev. Cell. Dev. Biol., 16, 591-696; Schwartz & Hochstrasser, 2003, Trends Biochem. Sci., 28, 321-328; Dohmen, R J., 2004, Biochim. Biophys. Acta, 1695, 113-131; Hay, R T., 2007, Trends Cell Biol., 17, 370-376). All SUMO proteins contain a ubiquitin domain (outlined in
In higher plants several genes were identified that encode different SUMO proteins. The genes are designated SUMO1, SUMO2, SUMO3, SUMO4, SUMO5, SUMO6, SUMO7, SUMO8 and SUMO9 in Arabidopsis, but only four of them (SUMO1, SUMO2, SUMO3 and SUMO5) were found to be transcriptionally active (Kurepa et al., 2003, J. Biol. Chem., 278, 6862-6872). Among these SUMO proteins, the fusion with SUMO1 generally gives the highest expression levels for protein of interest in the present invention. In the present invention, a SUMO protein known from an organism such as a plant, an animal or yeast may be used, whereby plant SUMO proteins are generally preferred.
Alternatively, derivatives of a natural SUMO protein may be used. A derivative of a SUMO protein herein comprises, in one embodiment, at least 50 contiguous amino acid residues, in another embodiment at least 60 contiguous amino acid residues, in a further embodiment at least 70 contiguous amino acid residues, and in a still further embodiment at least 80 contiguous amino acid residues.
A SUMO protein-derivative according to the invention is characterized by comprising the typical consensus sequence of a SUMO protein. Such consensus sequence can be determined by making sequence alignments of known SUMO proteins. The consensus sequence defines the a specific amino acid residue or a selection of specific amino acid residues for certain amino acid residue positions, whereas any desired amino acid residue can be chosen at other positions with little influence on the expression properties of the protease to be expressed according to the invention. Suitable consensus sequences can be defined at varying degrees of specificity. In the broadest sense, the derivative of a SUMO protein has the consensus sequence: -L/F/M-X19-F/I-X3-G/D-X7-T/S-P/A-.
In other embodiments of the invention, a derivative of a SUMO protein has any one of the following amino acid sequences:
In the above sequences of a SUMO protein derivative of the invention, the amino acid consensus sequences are given in N-terminal to C-terminal direction;
a is an integer of 17 or 18;
b is an integer of 16 or 17;
c is an integer of 14 or 15;
each letter stands for an amino acid residue;
X stands for any amino acid residue;
letters other than X stand for amino acid residues in the standard one-letter code;
a numerical subscript to a letter indicates that the amino acid residue defined by said letter is present contiguously and connected by peptide bonds as many times as indicated by the numerical value of the subscript;
“-” stands for a peptide bond connecting adjacent amino acid residues; and
“/” indicates that the amino acid position defined by two consecutive “-” can be occupied by any of the amino acid residues defined by letters separated by “/”.
After having expressed the fusion protein of the invention from said coding sequence, said protease or said zymogen or a fusion protein comprising said zymogen can be isolated from vegetative tissue such as from leaf tissue of said plant and purified according to standard methods of protein purification. If the process of the invention is performed in transgenic plants, the protease or the zymogen or a fusion protein comprising the zymogen are preferably isolated before viable seeds have developed from said plant in order to prevent contamination of non-transgenic seeds with transgenic seeds and in order to avoid spread of transgenic seeds in the environment.
Isolation typically includes the following steps: homogenising the tissue containing expressed protease or fusion protein comprising the protease, extracting the protease or fusion protein comprising the protease into a solvent (usually an aqueous, buffered solvent), and separating cell debris and other material insoluble in the solvent e.g. by centrifugation or filtration. The protease or fusion protein comprising the protease may then be purified from other components derived from the tissue present in the solvent. Purification methods established for the protease or fusion protein to be purified may be used. If the fusion protein contains an affinity tag, purification may include affinity chromatography.
If the isolated protein is not the activated protease but the zymogen of the protease or a fusion protein comprising said zymogen, the active protease may be generated from said zymogen or from said fusion protein by proteolytic cleavage. Said proteolytic cleavage may be achieved by a protease recognizing the cleavage site of said zymogen. If the isolated protein is or comprises trypsinogen, the protease enterokinase may be used for activating trypsin from trypsinogen. Chymotrypsin may be activated from chymotrypsinogen by trypsin.
The present invention can be performed with any plant or cells thereof. It is preferred that the invention is performed with plants. Among plants, higher plants are preferred. Among higher plants, the invention may be performed with monocot or with dicot plants. Plants that are not part of the human food chain are preferred. Examples of plants that may be used in the invention are Nicotiana species such as Nicotiana benthamiana and Nicotiana tabacum.
A process of producing a protease in a plant or in plant cells, comprising
A process of producing a protease in a plant or in plant cells, comprising
A process of producing a protease in a plant or in plant cells, comprising providing a plant with an RNA replicon comprising a coding sequence encoding a fusion protein, said fusion protein comprising preferably in the following order (i) to (iii) in N-terminal to C-terminal direction:
A process of producing a protease in a plant or in plant cells, comprising providing, by Agrobacterium-mediated transformation, a plant with an RNA replicon comprising a coding sequence encoding a fusion protein, said fusion protein comprising in the following order (i) to (iii) in N-terminal to C-terminal direction:
A process of producing a protease of interest in a plant, comprising:
Optionally, said plant may comprise a second heterologous nucleotide sequence comprising a nucleotide sequence encoding a protein enabling cell-to-cell movement of said RNA replicon, wherein said second heterologous nucleotide sequence comprises a second inducible promoter operably linked to said nucleotide sequence encoding said protein enabling cell-to-cell movement of said RNA replicon; said inducible promoter and said second indubible promoter may be the same types of inducible promoters such as promoters of an alcohol inducible system.
Other embodiments described herein may be combined with the above advantageous embodiments. One such embodiment is the combination of trypsin as the protease to be produced and a Nicotiana plant such as a tobacco plant.
Ubiquitin fusion were previously suggested for augmenting protein expression in transgenic plants (Hondred et al., Plant Physiology 119 (1999) 713-723). However, when the inventors of the present invention tried to use ubiquitin fusions intended for large scale applications, it was found that the expression yields were small, which explains why very sensitive immoblot analysis had to be made by Hondred et al. for detecting expressed protein. The inventors have further found (example 6) that ubiquitin has toxic effects on plants, cf.
It was previously shown that the expression level of recombinant proteins of interest can be improved via fusion with other proteins including SUMO (Butt et al., 2005, Protein Expr Purif., 43, 1-9; Marblestone et al., 2006, Protein Sci., 15, 182-189; Su et al., 2006, Protein Pept. Lett., 13, 785-792). Weeks et al., 2007, Protein Expr Purif., 53, 40-50; US2004018591; EP1654379). The above publications mostly relate to expression in E. coli. To the best of our knowledge, SUMO has so far not been used for improving expression levels of proteins of interest in plant expression systems, notably for cytotoxic proteins such as proteases.
The process of producing a protease of interest according to the invention involves expression of said protease of interest as a fusion protein which is compartmentalised within plant or plant cell by means of signal peptide or transit peptide-mediated targeting of the fusion protein. In the apoplast, the fusion protein may be processed to active protease, while in the chloroplast the fusion protein may be accumulated as protein inclusion bodies.
In the first step of the process of the invention, a plant or plant cells are transformed or transfected with a nucleotide sequence having a coding sequence encoding said fusion protein having a signal or transit peptide. Transformation may produce stably transformed plants or plant cells, e.g. transgenic plants. Alternatively, said plant or plant cells may be transfected for transient expression of said fusion protein. Several transformation or transfection methods for plants or plant cells are known in the art and include Agrobacterium-mediated transformation, particle bombardment, PEG-mediated protoplast transformation, viral infection etc.
Said nucleotide sequence may be DNA or RNA depending on the transformation or transfection method. In most cases, it will be DNA. In an important embodiment, however, transformation or transfection is performed using RNA virus-based vectors, in which case said nucleotide sequence is RNA.
Said nucleotide sequence comprises a coding region encoding a fusion protein. Said fusion protein comprises the SUMO protein. Said fusion protein further comprises a precursor of the protease of interest or zymogen that upon processing yields active protease (referred to as “protease of interest” in the following). Said protease of interest may be any protease that can be produced and isolated according to the process of the invention. It may be produced in an unfolded, misfolded or in a natural, functional folding state. The latter possibility is preferred.
Said fusion protein further comprises a signal peptide functional for targeting said fusion protein to the apoplast or for targeting said fusion protein to plastid. The apoplast targeting may be achieved with a signal peptide that targets the fusion protein into the endoplasmatic reticulum (ER) and the secretory pathway. All signal peptides of proteins known to be secreted or targeted to the apoplast may be used for the purposes of the invention. Preferred examples are the signal peptides of tobacco calreticulin, barley or rice amylase. Signal peptides that target a protein to plastids are also referred to as “transit peptides”. Any transit peptides can be utilized for practicing said invention. Preferred examples are artificial transit peptides or transit peptides of small subunit rubisco from tobacco. Other signal peptides for targeting the fusion protein to the apoplast are given in EP 1 423 524. For functional targeting the transit peptide or the signal peptide is positioned at the N-terminus of the fusion protein.
After the fusion protein has been expressed by a plant, the protease of interest or a fusion protein comprising the protease of interest can be isolated from the plant or plant cell by standard protein purification methods. The isolation of a protease of interest or a fusion protein thereof can be facilitated by incorporating a purification tag into the protease or fusion protein. Such systems are commercially available e.g. from Amersham Pharmacia Biotech, Uppsala, Sweden. A specific example frequently used for removing a His-tag is the factor Xa system.
In another embodiment of said invention, an isolated fusion protein can be processed for releasing the protease of interest (e.g. trypsin) from said fusion by using enterokinase that specifically cleaves the zymogen (trypsinogen) at the N-terminus, whereby the active protease (e.g. trypsin) is released.
Construction of the nucleotide sequence of the invention may be done according to standard procedures of molecular biology. The nucleotide sequence may contain a plant-specific promoter operably linked to said coding sequence and a transcription terminator after said coding sequence. In the case of stable transgenic plants, inducible expression of the fusion protein may be achieved if desired by an appropriately selected inducible expression system. In a preferred embodiment, virus-based vectors under control of alcohol-inducible system are used for performing this invention. Construction of such viral vectors are described in the reference examples and in the numerous publications.
The nucleotide sequence comprising the coding sequence encoding the fusion protein of the compartmentalized with the help of signal or transit peptide may be delivered into the plant cell preferably using a DNA or an RNA vector. The recombinant protein fusion is expressed and then targeted to the intercellular space (apoplast) in case of fusion with signal peptide or to plastid. The plants with said fusion protein may then be subjected to processing. Dependent from the form in which the protease is accumulated in the plant, the downstream processing might incorporate protein fusion refolding and cleavage in order to produce active protease (in case of plastids compartmentalization), or lead directly to the isolation of active protease (in case of apolplast targeting).
Various methods can be used to deliver the nucleotide sequence of the invention using a vector into the plant cell, including direct introduction of said vector into a plant cell by means of microprojectile bombardment, electroporation or PEG-mediated treatment of protoplasts (for review see: Gelvin, S. B., 1998, Curr. Opin. Biotechnol., 9 227-232; Hansen & Wright, 1999, Trends Plant Sci., 4, 226-231). Plant RNA and DNA viruses also present efficient delivery systems (Hayes et al., 1988, Nature, 334, 179-182; Palmer et al., 1999, Arch. Virol., 144 1345-1360; Lindbo et al., 2001, Curr. Opin. Plant. Biol., 4, 181-185). Vectors can deliver a transgene either for stable integration into the genome of the plant (direct or Agrobacterium-mediated DNA integration) or for transient expression of the transgene (“agroinfiltration”).
Different vectors may be used to express fusion protein in plant or plant cell. Suitable vectors for practicing said invention are the plant viral vectors. In one embodiment, RNA viral vectors are used. The use of such vectors for optimization of proteins expression and for large-scale production are described in detail in numerous publications (Marillonnet et al., 2004, Proc Natl Acad Sci USA, 101:6852-6857; Marillonnet et al., 2005, Nat Biotechnol., 23:718-723; Giritch et al.,2006, Proc Natl Acad Sci USA., 103, 14701-14706; Santi et al., 2006, Proc Natl Acad Sci USA., 103, 861-866). In one embodiment, cloning of a bovine trypsinogen gene (
We have tested trypsinogen expression of many different fusions including translational fusions with five different A. thaliana SUMO proteins. The best results were obtained for SUMO1-trypsinogen fusion. Based on the results of studies with provectors, assembled plant viral vectors were designed for production of proteases in plant tissues. In
In yet another embodiment of this invention (example 2), the results of apoplast- and chloroplast-targeted trypsinogen expression using transient expression from provectors as well as from assembled viral vectors are described. It is evident from the results of protein (predominantly rubisco) degradation on coomassie-stained gel and protease activity measurement using milk assay (
Targeting of trypsinogen into chloroplasts did not produce visible degradation of plant proteins, but resulted in a major coomassie-stained band on polyacryamide gel corresponding in size to a SUMO-trypsinogen fusion (
For large scale production of proteases in plants, a transgenic version of production host may be advantageous compared to transient expression system. Among transgenic vectors, those providing for controllable expression of the coding sequence of the invention are preferred. Controllable expression can help to further minimize cytotoxic effects of a protease. In case of stable integration of a vector expressing a SUMO-trypsinogen fusion into a plant genome, controllable vectors based on inducible expression of said vector is preferred. In the present invention, inducible promoters can be used to trigger production of a protease of interest in plants or plant cells. Inducible promoters can be divided into two categories according to their induction conditions: those inducible by abiotic factors (temperature, light, chemical substances) and those that can be induced by biotic factors, for example, pathogen or pest attack. Examples of the first category include, but are not limited, heat-inducible (US 05187287) and cold-inducible (US05847102) promoters, a copper-inducible system (Mett et al., 1993, Proc. Natl. Acad. Sci., 90 4567-4571), steroid-inducible systems (Aoyama & Chua, 1997, Plant J., 11, 605-612; McNellis et al., 1998, Plant J., 14, 247-257; US06063985), an ethanol-inducible system (Caddick et al., 1997, Nature Biotech., 16, 177-180; WO09321334; WO0109357; WO02064802), isopropyl beta-D-thiogalacto-pyranoside (IPTG)-inducible system (Wilde et al., 1992, EMBO J., 11:1251-1259) and a tetracycline-inducible system (Weinmann et al., 1994, Plant J., 5 559-569). One of the latest developments in the area of chemically inducible systems for plants is a chimaeric promoter that can be switched on by glucocorticoid dexamethasone and switched off by tetracycline (Bohner et al., 1999, Plant J., 19, 87-95). Chemically inducible systems are the most suitable for practicing the present invention. For a review on chemically inducible systems see: Zuo & Chua, (2000, Current Opin. Biotechnol., 11 146-151) and Moore et al., (2006, Plant J., 45: 651-683). It will be clear for the skilled person that any proteins required for the functionality of the chosen inducible system such as repressors or activators have to be expressed in said plant or said plant cells for rendering the inducible system functional. In one embodiment of the invention, ethanol inducible system for controlled release of viral replicon in plant cell is used. In example 4, an alcohol-inducible system described in detail in WO2007137788 was used for inducible expression of apoplast-targeted SUMO1-trypsinogen fusion. The results obtained demonstrate that tightly controlled inducible expression of protease from plant viral vector is obtained.
In example 5 of this invention, we present data of SUMO-EPB2 protease precursor expression in plant cells. It is evident (
Considering that protease protein production in this invention includes the fusion of the protease with a signaling or transit peptide and SUMO, the separation of the protein of interest from the fusion protein shall be considered. In the invention, the use of SUMO and a protease precursor in such fusions introduces at least two cleavage sites. One cleavage site is located between the C-terminus of SUMO protein and the N-terminus of the protease precursor. This cleavage site is recognized by SUMO-specific proteases. Therefore, separation of the protease from SUMO is not an issue. Plant cells like all other eukaryotes, contain potent SUMO proteases that cleave proteins at the end of SUMO, thus precisely removing from SUMO any C-terminal extensions and fusions (Kurepa et al., 2003, J. Biol. Chem., 278, 6862-6872; Colby et al., 2006, Plant Physiol., 142, 318-332; Novatchkova et al., 2004, Planta, 220, 1-8; Hay, R T., 2004, Trends Cell Biol., 17, 370-376; Johnson et al., 2004, Annu Rev Biochem., 73, 355-382). In the invention, the protease precursor or zymogen is used for fusion with SUMO protein. The protease precursor contains yet another cleavage site not far from the N-terminus of said precursor. Cleavage at this cleavage of said site is important for the maturation of the precursor into the active protease. In case of trypsinogen, precursor of trypsin, enterokinase-mediated removal of a hexapeptide from the N-terminal end of trypsinogen produces trypsin. From our results with apoplast targeted SUMO-trypsinogen fusion, it is evident that a protease with enterokinase-like activity capable of a cleaving hexapeptide from trypsinogen must be present in plants. If desired, a protease capable or producing the protease produced according to the invention from its zymogen may be applied. Notably, such treatment step will be used during isolation and/or purification of the protease of interest that was targeted to plastids.
The presence of a cleavage site within the zymogen allows removal of any fusion protein linked to the N-terminus of said zymogen. Therefore, the use of intact C-terminus of SUMO protein in order to provide cleavage with a SUMO-specific proteases is not necessary in the invention. This allows to use a truncated version of SUMO protein in the fusion protein of the invention and other derivatives of SUMO protein.
Commercially available calf thymus genomic DNA (Sigma-Aldrich., cat. No. D4764) was used as the template for cloning coding regions for trypsinogen and trypsin proteins. The partial coding sequence for activation peptide and pancreas cationic trypsinogen (Core Nucleotide database Acc. No D38507, also see
The Arabidopsis thaliana SUMO1 gene (gene ID 828791) was cloned using genomic DNA as template for PCR amplification. The PCR product containing two original SUMO1 introns, was cloned into intermediate vectors using standard molecular biology techniques ((Sambrook, Fritsch & Maniatis, 1989, Molecular Cloning: a Laboratory Manual, CSH, NY), and used in constructs design.
The viral vector modules with signal peptide, transit peptide-SUMO1 and signal peptide-SUMO1 fusions encoding for 5′-viral pro-vectors and assembled viral vectors were designed using cloning approaches, as it described by Marillonnet et al., 2005, Nat. Biotechnol., 23, 718-723. The restriction maps of whole plasmids and complete coding sequences for the T-DNA regions of assembled viral vectors pICH29090 and pICH29373 (
The complete sequence of pICH29090 is shown in SEQ ID NO: 1. The sequence encoding the apoplast targeting signal peptide of pICH29090 is shown in SEQ ID NO: 2. The sequence encoding the apoplast targeting signal peptide of pICH29373 is shown in SEQ ID NO: 3. The sequence of pICH29373 is identical to that of pICH29090 except that the sequence encoding the apoplast targeting peptide of pICH29090 is replaced by the sequence encoding the plastid transit peptide shown in SEQ ID NO: 3.
Transient Expression of Bovine Trypsinogen in N. benthamiana Using Plant Viral Vectors
All constructs described in example 1 were electroporated into Agrobacterium tumefaciens GV3101. Agroinfiltrations of N. benthamiana plants were done essentially as described in Marillonnet et al., 2004, Proc Natl Acad Sci USA, 101:6852-6857. In case of provectors, three agrobacterial strains containing 5′ provector encoding for targeting signal peptide and SUMO1 fusion or any of the fusion/targeting sequence (not shown), 3′ provector encoding the trypsinogen or trypsin genes and a source of a site-specific recombinase (pICH14011,
Analysis of SUMO1-Trypsinogen Fusion Expressed in N. benthamiana Leaves
All recombinant protein fusions were extracted from infiltrated N. benthamiana leaves 7-12 days after infiltration and analysed by electrophoretic separation in polyacrylamide gels as previously described (Marillonnet et al., 2004, Proc Natl Acad Sci USA, 101:6852-6857; Marillonnet et al., 2005, Nat Biotechnol., 23:718-723). Plant leaf tissue was harvested from different leaves of young (y) or old plants. Tissue was extracted with 3 volumes of 0.15 M Tris-HCl pH8.1; 2mM EDTA, incubated for 10 minutes on ice and centrifuged for 12 minutes at 13 Krpm, 4° C. Supernatant was mixed with equal volume of 2× Laemmli buffer (125 mM Tris/HCl pH 6.8, 10% mercaptoethanol, 20% glycerol, 0.01% bromphenole blue, 4% SDS) and 0.004 ml of mixture corresponding to 1.3 mg of starting leaf tissue was loaded on gels.
The results of electrophoretic analysis are shown in
The relative activity of plant-expressed trypsin in comparison with commercially available trypsin samples (ICN Biomedicals Inc., cat. 101789, 10 mg/ml in 1 mM HCl, working solution 1:10 to 1:100 diluted in 1 mM HCL) was measured by using milk assay. The assay was performed as follows:
0.005-0.02 ml of plant extract in 0.15 M Tris-HCl pH 8.1; 2 mM EDTA 8,0 buffer is mixed with 0.02 ml of 3% solution of dry milk powder (Roth, Cat. No. T145.2) in TBS (25 mM tris-HCl, 142 mM NaCl) and incubated at room temperature until positive control sample with commercially available trypsin does not start to clarify the milk solution due to the digestion of milk protein. Clear solution in test samples means the presence of active trypsin. If the solution remains milky, no trypsin activity is present. The results of the test are shown in
Agroinfiltration of plants with assembled viral vector and provectors providing for expression of chloroplast-targeted SUMO1-trypsinogen fusion was performed as described above for apoplast-targeted SUMO1-trypsinogen fusion. The results of electrophoretic analysis of the expression level are shown in
Extraction of SUMO1-Trypsinogen Fusion from Chloroplasts
The protein fusion accumulated in chloroplasts can be extracted with SDS-PAGE sample (Laemmli) buffer at 95° C. (see example 2) and at least partially with buffers containing chaotropes. The current extraction method was established based on methods suitable for solubilisation of inclusion bodies (IBs) occurring in many recombinant protein expressions in E. coli including trypsinogen (Buswell et al., 2002, Biotechnol. Bioeng., 77, 435-444; Hohenblum et al., 2004, J. Biotechnol., 109, 3-11; Ahsan et al., 2005, Mol. Biotechnol., 30, 193-205; Király et al., 2006, Protein Expr. Purif., 48, 104-111). 5.5 g of plant leaf material containing chloroplast targeted SUMO1-trypsinogen fusion was sequentially treated with the set of following extraction buffers:
Extraction 1: 30 ml buffer E1 was added to the leaf material. The suspension was thoroughly mixed using an ultraturrax (6-7 times 30 s), and kept on ice between the 3 mixing steps. Overall mixing time 10 min. The suspension was centrifuged 15 min at 40000g and 4° C.
Extraction 2: 25 ml buffer E2 were added to the pellet of extraction step 1. The suspension was thoroughly mixed using an ultraturrax (3 times 1 min), and kept on ice between the mixing steps. Overall mixing time 10 min. The suspension was centrifuged 15 min at 40000 g and 4° C.
Extraction 3: 25 ml buffer E3 were added to the pellet of extraction step 2. The suspension was thoroughly mixed using an ultraturrax (3 times 1 min) and kept on ice between the mixing steps. Overall mixing time 10 min. The suspension was centrifuged 15 min at 40000 g and 4° C. This washing step was repeated 3 times.
Extraction 4: 10 ml buffer E4 was added to the pellet of extraction step 3. The suspension was thoroughly mixed using an ultraturrax (5 min), and incubated further for 1 h on a rolling mixer at room temperature. The suspension was centrifuged 15 min at 40000 g and 4° C. The supernatant, assumed to contain solublised SUMO-trypsinogen, was applied to a HiPrep Desalting column to exchange the buffer to 8 M urea, acetic acid, pH 4.0. The fractions containing protein were pooled. All fractions were analysed by SDS-PAGE and Westernblot (not shown).
The final extraction of unsoluble proteins is achieved with the GuaHCl containing buffer. In preliminary experiments, this buffer was found to be suitable to extract more SUMO1-trypsinogen compared to a similar buffer containing 9 M urea as chaotrope.
Additional purification of extracted SUMO1-trypsinogen was carried out using Q Hyper D 20 anion exchange chromatography (Pall, Biosepra, code no. 200683, column 4.6×100 mm). The sample was prepared by adding 0.5 volume of 200 mM Tris-HCl, 1 mM EDTA, pH 8.5, to one volume of solubilised SUMO1-trypsinogen from extraction 4. The pH was adjusted to >8.0 by addition of 1 M NaOH and final solution was loaded onto the column. The elution was carried out by linear NaCl gradient (starting buffer: 50 mM Tris-HCl, 8 M urea, pH 8.5; final buffer: 50 mM Tris-HCl, 8 M urea, 1 M NaCl, pH 8.5; flow rate: 1 ml/min; 20 column volumes, elution fraction volume: 0.5 ml. 8 M urea was used to keep SUMO-trypsinogen fusion solubilised. The resulting purity of SUMO-trypsinogen was comparable to typical proteins solubilised from bacterial inclusion bodies. The final pool contained an overall amount of 0.6 mg in 3 ml.
The folding of purified SUMO1-trypsinogen fusion was carried out as described by Hohenblum and colleagues (J. Biotechnol., 109 (2004) 3-11) with some modifications. Concentration of the purified extract of SUMO1-trypsinogen to 150 μl was carried out by using Vivaspin 500 (Sartorius, MWCO 3,000; prod no. VS0191; Lot no. 07VS50030). Recovery from the Vivaspin concentrator included rinsing with 8 M urea, 50 mM Tris, pH 8.6. Final volume of the concentrated samples 700 μl. To the concentrated sample, 1 M DTE was added to a final concentration of 10 mM and the solution was incubated for 2.5 hours at 37° C. Then, 1 volume of 200 mM GSSG (oxidised glutathion), 8 M urea, pH 8.6 was added and the mixture was incubated for 3 hours at 37° C. After incubation, the buffer was replaced to 50 mM Tris-HCl, 8 M urea, pH 8.6 by using HiTrap desalting columns (GE Healthcare, cat. no. 17-1408-01). Then dilution of the solubilisates into folding buffer (two different buffers, 1:20 dilution; folding buffer 1: 50 mM Tris-HCl, 50 mM CaCl2, 3 mM GSH, 0.3 mM GSSG, pH 8.6; folding buffer 2: 50 mM Tris-HCl, 50 mM CaCl2, 700 mM Arg, 3 mM GSH) and incubation of the folding reaction was carried out at 4° C. for at least 16 hours. After incubation, the folding reaction was concentrated 10-fold with Vivaspin 20 concentrators (Sartorius, MWCO 5,000; prod. no. VS2011, Lot 06VS2050). Then, the folding buffer was replaced by cleavage buffer (20 mM Tris-HCl, 50 mM NaCl, 2 mM CaCl2, pH 8.0) using HiTrap desalting column. The resulting samples were analysed by SDS-PAGE and activity assays were made (results are not shown).
In order to test the quality of SUMO1-trypsinogen folding, enterokinase (stock solution: 1 mg/ml of enterokinase (Sigma, cat. no. E0885) in cleavage buffer) was added to the folding samples. In case of a successful folding, enterokinase cleavage should lead to the formation of active trypsin that can be detected due to the cleavage of the chromogenic substrate BAPNA (Sigma, cat. no. B4875). The enterokinase cleavage and the analytical methods were established with commercially available trypsinogen. In accordance with the SUMO1-trypsinogen concentration in the folding reaction, the analytical method was established with trypsinogen concentrations between 0.1 and 50 μg/ml. The BAPNA assay system was optimised to detect very small amounts of trypsin in enterokinase treated folding reactions. The BAPNA assay was performed as follows:
Ten microliters of enterokinase were added to 990 pl folding sample in cleavage buffer and the mixture was incubated at 37° C. for >16 hours. The sample was pipetted into a cuvette and 50 μl of 2 mM BAPNA solution in cleavage buffer were added. The BAPNA cleavage was detected at 37° C. by absorption spectroscopy at 405 nm over a time period of 60 min. The results (kinetic of absorption) were evaluated in comparison with results obtained from control samples obtained from trypsinogen standard. The results of cleavage experiments are shown in
Use of Alcohol-Inducible System for the Expression of Apoplast Targeted SUMO-Trypsinogen Fusion in Transgenic N. benthamiana Plants
The constructs for inducible expression of trypsin in transgenic plants are shown in
N. benthamiana plants were transformed with pICH28287 according to standard protocols (Horsh et al., 1985, Science, 227 1229-1231). Regenerated plants were analysed for the presence of the transgene by agroinfiltration with the constructs providing for alcR transcriptional activator and functional MP (pICH18693 and pICH26505, respectively, see
Cloning of Barley (Hordeum vulgare) Coding Sequences Encoding for Cysteine Endoprotease B (EPB2) Precursor and its Integration into Plant Viral Vectors
The codon-optimised sequence of barley cysteine endoprotease B isoform 2 (EPB2) gene (GeneBank Acc. No. U19384) was custom-synthesized (GENEART AG, Regensburg, Germany). The sequence of the gene and its translation product are shown in
Analysis of SUMO1-EPB2 Zymogen Fusion Expressed in N. benthamiana Leaves
All recombinant protein fusions were extracted from infiltrated N. benthamiana leaves 7-12 days after infiltration and analysed by electrophoretic separation in polyacrylamide gels as previously described (Marillonnet et al., 2004, Proc Natl Acad Sci USA, 101:6852-6857; Marillonnet et al., 2005, Nat Biotechnol., 23:718-723). It was found that the expression level reached a maximum 8 days after infiltration.
Plant leaf tissue was extracted with 5 volumes of tris extraction buffer (0.1 M Tris-HCl pH 8.0; 5 mM EDTA, 2 mM mercaptoethanol, 0.1% SDS, 15% glycerol), incubated for 10 minutes on ice and centrifuged for 12 minutes at 13 Krpm, 4° C. The supernatant was mixed with equal volume of 2× Laemmli buffer (125 mM Tris/HCl pH 6.8, 10% mercaptoethanol, 20% glycerol, 0.01% Bromphenole blue, 4% SDS) and incubated in a boiling water bath before loading on gels. Alternatively, plant leaf tissue was extracted with 10 volumes (w/v) of 1× Laemmli buffer (62.5 mM Tris/HCl pH 6.8, 5% mercaptoethanol, 10% glycerol, 0.005% Bromphenole blue, 2% SDS). Extracts were incubated in a boiling water bath before loading on gels.
The results of electrophoretic analysis are shown in
GFP fusions with full-length A. thaliana ubiquitin as well as its N-terminally truncated versions were tested using plant virus-derived expression system. The constructs used in the experiment are shown in
The pictures of infiltrated N. benthamiana leaves under day light and UV light (to monitor for GFP expression) are shown in
SUMO-GFP fusions did not show cytotoxic effect (
The entire disclosure of European patent application No. 08 004 005.8 filed on Mar. 4, 2008 including description, claims and figures is incorporated herein by reference.
Number | Date | Country | Kind |
---|---|---|---|
08004005.8 | Mar 2008 | EP | regional |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/EP2009/001502 | 3/3/2009 | WO | 00 | 10/27/2010 |