The present invention relates to methods of intein-mediated ligation of proteins. More specifically, the present invention relates to intein-mediated ligation of expressed proteins containing a predetermined N-terminal residue and/or a C-terminal thioester generated via use of one or more naturally occurring or modified inteins. Preferably, the predetermined residue is cysteine.
Inteins are the protein equivalent of the self-splicing RNA introns (see Perler et al., Nucleic Acids Res. 22:1125-1127 (1994)), which catalyze their own excision from a precursor protein with the concomitant fusion of the flanking protein sequences, known as exteins (reviewed in Perler et al., Curr. Opin. Chem. Biol. 1:292-299 (1997); Perler, F. B. Cell 92(1):1-4 (1998); Xu et al.,EMBO J. 15(19):5146-5153 (1996)).
Studies into the mechanism of intein splicing led to the development of a protein purification system that utilized thiol-induced cleavage of the peptide bond at the N-terminus of the Sce VMA intein (Chong et al., Gene 192(2):271-281 (1997)). Purification with this intein-mediated system generates a bacterially-expressed protein with a C-terminal thioester (Chong et al., (1997)). In one application, where it is described to isolate a cytotoxic protein, the bacterially expressed protein with the C-terminal thioester is then fused to a chemically-synthesized peptide with an N-terminal cysteine using the chemistry described for “native chemical ligation” (Evans et al., Protein Sci. 7:2256-2264 (1998); Muir et al., Proc. Natl. Acad. Sci. USA 95:6705-6710 (1998)).
This technique, referred to as “intein-mediated protein ligation” (IPL), represents an important advance in protein semi-synthetic techniques. However, because chemically-synthesized peptides of larger than about 100 residues are difficult to obtain, the general application of IPL is limited by the requirement of a chemically-synthesized peptide as a ligation partner.
IPL technology would be significantly expanded if an expressed protein with a predetermined N-terminus, such as cysteine, could be generated. This would allow the fusion of one or more expressed proteins from a host cell, such as bacterial, yeast or mammalian cells.
One method of generating an N-terminal cysteine is with the use of proteases. However, proteases have many disadvantages, such as the possibility of multiple protease sites within a protein, as well as the chance of non-specific degradation. Furthermore, following proteolysis, the proteases must be inactivated or purified away from the protein of interest before proceeding with IPL. (Xu, et al., Proc. Natl. Acad. Sci. USA 96(2):388-393 (1999) and Erlandson, et al., Chem. Biol., 3:981-991 (1996))
There is, therefore, a need for an improved intein-mediated protein ligation method which overcomes the noted limitations of current IPL methods and which eliminates the need for use of proteases to generate an N-terminal cysteine residue. Such an improved IPL method would have widespread applicability for the ligation of expressed proteins, for example, labeling of extensive portions of a protein for, among other things, NMR analysis.
In accordance with the present invention, there is provided a method for the ligation of expressed proteins utilizing one or more inteins which display cleavage at their N— and/or C-termini. In accordance with the present invention, such inteins may occur either naturally or may be modified to cleave at their N— and/or C-termini. Inteins displaying N— and/or C-terminal cleavage enable the facile isolation of a protein having a C-terminal thioester and a protein having an N-terminal amino acid residue such as cysteine, respectively, for use in the fusion of one or more expressed proteins. Alternatively, the method may be used to generate a single protein having both a C-terminal thioester and a specified N-terminal amino acid residue, such as cysteine, for the creation of cyclic or polymerized proteins. These methods involve the steps of generating at least one C-terminal thioester-tagged first target protein, generating at least one second target protein having a specified N-terminal amino acid residue, for example cysteine, and ligating these proteins. This method may be used where a single protein is expressed, where, for example, the C-terminal thioester end of the protein is fused to the N-terminal end of the same protein. The method may further include chitin-resin purification steps.
In one preferred embodiment the intein from the RIR1 Methanobacterium thermoautotrophicum is modified to cleave at either the C-terminus or N-terminus. The modified intein allows for the release of a bacterially expressed protein during a one-column purification, thus eliminating the need proteases entirely. DNA encoding these modified inteins and plasmids containing these modified inteins are also provided by the instant invention.
The present invention provides a solution to the limitations of current intein-mediated ligation methods by eliminating the need for a synthetic peptide as a ligation partner, and providing a method which is suitable for the fusion one or more expressed proteins.
In general, any intein displaying N— and/or C-terminal cleavage at its splice junctions can be used to generate a defined N-terminus, such as cysteine as well as a C-terminal thioester for use in the fusion of expressed proteins. Inteins which may be used in practicing the present invention include those described in Perler, et al., Nucleic Acids Res., 27(1):346-347 (1999).
In accordance with one preferred embodiment, an intein found in the ribonucleoside diphosphate reductase gene of Methanobacterium thermoautotrophicum (the Mth RIR1 intein) was modified for the facile isolation of a protein with an N-terminal cysteine for use in the in vitro fusion of two bacterially-expressed proteins. The 134-amino acid Mth RIR1 intein is the smallest of the known mini-inteins, and may be close to the minimum amino acid sequence needed to promote splicing (Smith et.al., J. Bacteriol. 179:7135-7155 (1997)).
The Mth RIR1 intein has a proline residue on the N-terminal side of the first amino acid of the intein. This residue was previously shown to inhibit splicing in the Sce VMA intein (Chong et al., J. Biol. Chem. 273:10567-10577 (1998)). The intein was found to splice poorly in E. coli when this naturally occurring proline is present. Splicing proficiency increases when this proline is replaced with an alanine residue. Constructs that display efficient N— and C-terminal cleavage are created by replacing either the C-terminal asparagine or N-terminal cysteine of the intein, respectively, with alanine.
These constructs allow for the formation of an intein-generated generated C-terminal thioester on a first target protein and an intein-generated N-terminal cysteine on a second target protein. These complementary reactive groups may then be ligated via native chemical ligation to produce a peptide bond (Evans et al supra (1998), Muir et al supra (1998)). Alternatively, a single protein containing both reactive groups may be generated for the creation of cyclic or polymerized proteins. Likewise, more than one first or second target proteins may be generated via use of multiple mutant inteins.
As used herein, the terms fusion and ligation are used interchangeably. Also as used herein, protein shall mean any protein, fragment of any protein, or peptide capable of ligation according to the methods of the instant invention. Further, as used herein, target protein shall mean any protein the ligation of which, according to the methods of the instant invention, is desired.
The general method of intein-mediated protein ligation in accordance with the present invention is as follows:
The methodology described by the instant invention significantly expands the utility of current IPL methods to enable the labeling of extensive portions of a protein for NMR analysis and the isolation of a greater variety of cytotoxic proteins. In addition, this advance opens the possibility of labeling the central portion of a protein by ligating three or more fragments.
The use of an intein or inteins with N-terminal and C-terminal cleavage activity provides the potential to create a defined N-terminus, such as a cysteine, and a C-terminal thioester on a single protein. The intramolecular ligation of the resulting protein generates a circular protein, whereas the intermolecular ligation of several of these proteins generates a protein polymer.
Cleavage at the N— and/or the C-terminus of an intein can be brought about by introducing changes to the intein and/or its extein sequences. Also, naturally occuring inteins may display these properties and require no manipulation. Cleavage at the N— and/or C-terminus of an intein can occur uncontrollably or induced using nucleophilc compounds, such as thiol reagents, temperature, pH, salt, chaotropic agents, or any combination of the aforementioned conditions and/or reagents.
The Examples presented below are only intended as specific preferred embodiments of the present invention and are not intended to limit the scope of the invention except as provided in the claims herein. The present invention encompasses modifications and variations of the methods taught herein which would be obvious to one of ordinary skill in the art.
The references cited above and below are herein incorporated by reference.
The gene encoding the Mth RIR1 intein along with 5 native N— and C-extein residues (Smith et al. supra (1997)) was constructed using 10 oligonucleotides (New England Biolabs, Beverly, Mass.) comprising both strands of the gene, as follows:
To ensure maximal E. coli expression, the coding region of the synthetic Mth RIR1 intein incorporates 61 silent base mutations in 49 of the 134 codons (see
The unique Xhol and Spel sites flanking the N-terminal splice junction and the unique BsrGl and Agel sites flanking the C-terminal splice junction allowed substitution of amino acid residues by linker replacement. The proline residue, Pro−1, preceding the intein in pMRB8P was substituted with alanine or glycine to yield pMRB8A and pMRB8G1, respectively. Substitution of Pro−1-Cys1 with Gly-Ser or Gly-Ala yielded pMRB9GS and pMRB9GA, respectively. Replacing Asn134 with Ala in pMRB8G1 resulted in pMRB1 OG. The following linkers were used for substitution of the native amino acids at the splice junctions (each linker was formed by annealing two synthetic oligonucleotides as described above):
pBRL-A was constructed by substituting the Escherichia coli maltose binding protein (MBP) and the Bacillus circulans chitin binding domain (CBD) coding regions in pMRB9GA with the CBD and the T4 DNA ligase coding regions, respectively, subcloned from the pBYT4 plasmid.
The pMRB10G construct from Example I contains the Mth RIR1 intein engineered to undergo thiol reagent induced cleavage at the N-terminal splice junction (
The cells were pelleted by centrifugation at 3,000×g for 30 minutes followed by resuspension in buffer A (20 mM Tris-HCl, pH 7.5 containing 500 mM NaCI). The cell contents were released by sonication. Cell debris was removed by centrifugation at 23,000×g for 30 minutes and the supernatant was applied to a column packed with chitin resin (10 mL bed volume) equilibrated in buffer A. Unbound protein was washed from the column with 10 column volumes of buffer A.
Thiol reagent-induced cleavage was initiated by rapidly equilibrating the chitin resin in buffer B (20 mM Tris-HCI, pH 8 containing 500 mM NaCI and 100 mM 2-mercaptoethane-sulfonic acid (MESNA)). The cleavage reaction, which simultaneously generates a C-terminal thioester on the target protein, proceeded overnight at 4° C. after which the protein was eluted from the column. The use of the PMRB10G construct resulted in the isolation of MBP with a C-terminal thioester (
The pBRL-A construct from Example I contains an Mth RIR1 intein engineered to undergo controllable cleavage at its C-terminus, and was used to purify proteins with an N-terminal cysteine (
The expression of plasmid pBRL-A resulted in the purification of 4-6 mg/L cell culture of T4 DNA ligase possessing an N-terminal cysteine (
Intein-mediated protein ligation (IPL) was used to fuse two proteins (
Ligation reactions proceeded overnight at 40° C. and were visualized using SDS-PAGE with 12% Tris-glycine gels (Novex Experimental Technology, San Diego, Calif.) stained with Coomassie Brilliant Blue. Typical ligation efficiencies ranged from 20-60%.
A Factor Xa site in MBP that exists 5 amino acids N-terminal terminal from the site of fusion (Maina et al, supra (1988)) allowed amino acid sequencing through the ligation junction. The sequence obtained was NH2-TLEGCGEQPTGXLK—COOH (SEQ ID NO:21 ) which matched the last 4 residues of MBP (TLEG) followed by a linker sequence (CGEQPTG (SEQ ID NO:22)) and the start of T4 DNA ligase (ILK). During amino acid sequencing, the cycle expected to yield an isoleucine did not have a strong enough signal to assign it to a specific residue, so it was represented as an X. The cysteine was identified as the acrylamide alkylation product.
The Factor Xa proteolysis was performed on 2 mg of ligation reaction involving MBP and T4 DNA ligase. This reaction mixture was bound to 3 mL of amylose resin (New England Biolabs, Inc., Beverly, Mass.) equilibrated in buffer A (see Example II). Unreacted T4 DNA ligase was rinsed from the column with 10 column volumes of buffer A. Unligated MBP and the MBP-T4 DNA ligase fusion protein were eluted from the amylose resin using buffer E (20 mM Tris-HCI, pH 07.5 containing 500 mM NaCI and 10 mM maltose). Overnight incubation of the eluted protein with a 200:1 protein:bovine Factor Xa (NEB) ratio (w/w) at 40° C. resulted in the proteolysis of the fusion protein and regeneration of a band on SDS-PAGE gels that ran at a molecular weight similar to T4 DNA ligase. N-terminal amino acid sequencing of the proteolyzed fusion protein was performed on a Procise 494 protein sequencer (PE Applied Biosystems, Foster City, Calif.).
The cleavage and/or splicing activity of the Mth RIR1 intein was more proficient when protein synthesis was induced at 15° C. than when the induction temperature was raised to 37° C. (
This Application is a Continuation application of U.S. application Ser. No. 09/249,543, filed Feb. 12, 1999, which claims priority from U.S. provisional application No. 60/102,413, filed Sep. 30, 1998.
Number | Date | Country | |
---|---|---|---|
60102413 | Sep 1998 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 09249543 | Feb 1999 | US |
Child | 10960905 | Oct 2004 | US |