The present invention relates to an expression cassette including a promoter and a gene encoding a fusion protein of a chaperone and an RNA-binding domain; an expression vector including the expression cassette; a composition for solubilizing a target protein, including the expression cassette or the expression vector; a transformant including the expression cassette or the expression vector; a kit including the composition or the transformant; and a method for producing the target protein by using the same.
When recombinant proteins are produced in heterologous cells, there is a problem that recombinant proteins form insoluble aggregates, and thus there is a need for a method for increasing solubilization of recombinant proteins (Current opinion in biotechnology, 2001, 12, 202-207). However, a method for solubilizing recombinant proteins, which may be applied to various recombinant proteins, has not been developed yet.
Under this background, the present inventors have made many efforts to develop a method for increasing solubilization of a target protein, and as a result, they found that a chaperone-recruiting mRNA scaffold (CRAS) system and a chaperone-substrate co-localized expression (CLEX) system increase solubilization of various target proteins, thereby completing the present invention.
An object of the present invention is to provide a composition for solubilizing a target protein, the composition including an expression cassette including a promoter and a gene encoding a fusion protein of a chaperone and an RNA-binding domain or an expression vector including the expression cassette.
Another object of the present invention is to provide a transformant comprising the expression cassette including the promoter and the gene encoding the fusion protein of the chaperone and the RNA-binding domain or the expression vector including the expression cassette.
Still another object of the present invention is to provide a kit for producing the target protein, the kit comprising the composition or the transformant.
Still another object of the present invention is to provide the expression cassette comprising the promoter and the gene encoding the fusion protein of the chaperone and the RNA-binding domain or the expression vector including the expression cassette.
Still another object of the present invention is to provide an expression cassette comprising a promoter, a gene encoding a target protein, and a gene of a hairpin structure at the 3′-terminus thereof, or an expression vector including the expression cassette.
Still another object of the present invention is to provide a method for producing a target protein, the method comprising the steps of:
(i) introducing into a microorganism (a) the expression cassette including the promoter and the gene encoding the fusion protein of the chaperone and the RNA-binding domain or the expression vector including the expression cassette and (b) the expression cassette including the promoter, the gene encoding the target protein, and the gene of the hairpin structure at the 3′-terminus thereof, or the expression vector including the expression cassette; and
(ii) culturing the microorganism.
Still another object of the present invention is to provide an expression cassette comprising (a) a cistron encoding a chaperone and (b) a cistron encoding a target protein, or an expression vector including the expression cassette.
Still another object of the present invention is to provide a composition for solubilizing a target protein, the composition comprising the expression cassette including (a) the cistron encoding the chaperone and (b) the cistron encoding the target protein, or the expression vector including the expression cassette.
Still another object of the present invention is to provide a transformant comprising the expression cassette including (a) the cistron encoding the chaperone and (b) the cistron encoding the target protein, or the expression vector including the expression cassette.
Still another object of the present invention is to provide a kit for producing the target protein, the kit comprising the expression cassette including (a) the cistron encoding the chaperone and (b) the cistron encoding the target protein; the expression vector including the expression cassette; the composition for solubilizing the target protein including the expression cassette or the expression vector; or the transformant including the expression cassette or the expression vector.
Still another object of the present invention is to provide a method for producing the target protein, the method comprising the steps of:
(i) preparing the transformant including the expression cassette including (a) the cistron encoding the chaperone and (b) the cistron encoding the target protein, or the expression vector including the expression cassette; and
(ii) culturing the transformant.
(A) is a schematic illustration the CRAS system. A fusion protein of a chaperone molecule (DnaJ, DnaK, or a complex of DnaJ and DnaK (DnaJ-DnaK)) and an RNA-binding domain KH of Nova-1 was expressed together with a target recombinant protein to which a KH cognate stem-loop sequence was inserted as shown in (C). The chaperon complex binds with an initial protein to prevent protein aggregation or transfers the protein to other chaperone molecules for correct folding by introducing the KH-binding structure into the 3′UTR of mRNA;
(B) shows the result of solubilization of eight selected aggregatable recombinant proteins (ScFv, HIV-1 protease, BR2-ScFv, UGD, Adhlp, UbiC, Leptin, BMP2) when they were expressed with DnaJ-KH alone; or DnaJ-KH and 3′UTR hairpin loop;
(D) shows the result of solubilization according to one (+) or three (+++) 3′UTR sequence(s), when four recombinant proteins which were not solubilized by the CRAS system using DnaJ-KH were applied to a CRAS system using a chimeric DnaJ-KH;
(E) illustrates a split GFP assay to examine in vivo protein solubilization activity of the CRAS system; and
(F) shows the result of in vivo solubilization of sfGFP N-terminus by the CRAS system. Fluorescence intensities of E. coli BL21(DE3) cells including (1) pET16b and pAMT7 (a negative control group); (2) pET16b-sfGFP and pAMT7-DnaJK-KH (a positive control group); (3) plasmid of sfGFP N-terminus and C-terminus in the absence of KH-binding hairpin and DnaJ-KH; (4) plasmid of sfGFP N-terminus and C-terminus in the presence of KH-binding hairpin and in the absence of DnaJ-KH; (5) plasmid of sfGFP N-terminus and C-terminus in the absence of KH-binding hairpin and in the presence of DnaJ-KH; or (6) plasmid of sfGFP N-terminus and C-terminus in presence of KH-binding hairpin and DnaJ-KH (GRAS system). Quantification of soluble ScFv was performed on SDS-PAGE images obtained by using an ImageJ v1.48 software (NHI, USA) through Coomassie blue staining. The error bars in (B), (D) and (F) represent ±standard deviations from three independent experiments.
lane M, marker (ELPIS, Daejeon, South Korea); lane W, whole cell lysate; lane S, soluble fraction; lane I, insoluble fraction.
lane 1, in the absence of spacer between stop codon and 3′UTR-binding loop; lane 2, in the presence of 5-nt spacer between stop codon and 3′UTR-binding loop; lane 3, in the presence of 30-nt spacer between stop codon and 3′UTR-binding loop; lane M, marker; lane W, whole cell lysate; lane S, soluble fraction; lane I, insoluble fraction.
(A) is a schematic illustration of the CLEX system. A translationally coupled two-cistron expression system was used to translate DnaJ and target protein in proximity, from a single mRNA transcript;
(B) shows overlapping of the stop codon (TAA) of the first cistron with the start codon (ATG) of the second cistron, or a tandem arrangement thereof, or a space of n nt placed therebetween. 12 nucleotides in the 3′-sequence of the first cistron function as a ribosome-binding site (RBS) for the second cistron;
(C) shows the result of solubilization of five aggregatable recombinant proteins when chaperone DnaJ was used as the first or second cistron in the CLEX system;
(D) shows the result of measuring OD450 of samples obtained from E. coli BL2J (DE3) having the following plasmids in order to examine a level of correctly folded leptin: pET16b (a negative control group), pET16b-Leptin (leptin overexpression) and pET16b-DnaJ/Leptin (DnaJ and leptin overexpression in CLEX system, CLEX system);
(E) shows the result of solubilization of BMP2 according to the order of DnaJ and target protein in the translationally coupled two-cistron expression system; and
(F) shows the result of solubilization of BMP2 according to the distance between the cistrons in the translationally coupled two-cistron expression system;
the slash (/) between the proteins represents the order of the genes encoding the respective proteins in mRNA. Relative expression were quantified by using an ImageJ v1.48 software (NJI, USA), based on the result of SDS-PAGE. The error bars in (C), (D), (E), and (F) represent ±standard deviations from three independent experiments.
(A) shows distribution of a DnaJ/DnaK-binding sequence in lipase TliA, indicated by red rectangular blocks. The binding site was predicted by using a Limbo chaperone binding site predictor; and
(B) shows the result of solubilization of TliA1 fragment by the CLEX system. TliA1+DnaJ represents simple coexpression of TliA and DnaJ, DnaJ/TliA1 represents CLEX system using DnaJ as the first cistron. Relative expression levels were quantified by using an ImageJ v1.48 software (NJI, USA), based on the result of SDS-PAGE. The error bars in (B) represent ±standard deviations from three independent experiments.
To achieve the above objects, an aspect of the present invention provides a composition for solubilizing a target protein, the composition including an expression cassette including a promoter and a gene encoding a fusion protein of a chaperone and a RNA-binding domain, or an expression vector including the expression cassette.
The fusion protein expressed by the expression cassette or the expression vector binds to mRNA including a sequence of the target protein via the RNA-binding domain, and as a result, the chaperone of the fusion protein exists near the translated target protein, leading to correct folding of the target protein. That is, the fusion protein may increase solubility of the target protein in a general manner, irrespective of the kind of the target protein. In this case, mRNA including the sequence of the target protein may have a hairpin structure (including an RNA site to which the RNA-binding domain of the fusion protein binds) at 3′-UTR in order to bind with the fusion protein.
As used herein, the term “chaperone” refers to a protein involved in folding and unfolding or assembly and disassembly of other proteins. Specifically, the chaperone of the present invention functions to induce or help folding of the target protein, thereby increasing solubility of the target protein. In the present invention, the chaperone, in the form of the fusion protein with the RNA-binding domain, binds to mRNA including the sequence of the target protein to help folding of the target protein near the translated target protein.
In the present invention, the chaperone is not limited to an origin (e.g., derived from a microorganism) and a specific sequence thereof, as long as it is involved in folding of the target protein. Specifically, the chaperone may be a protein derived from prokaryotes (e.g., bacteria, etc.), eukaryotes, or archaea, and a sequence thereof may be obtained from a known database, GenBank or NCBI, etc. Specifically, the chaperone may be DnaJ, Dnak, GroEL, GroES, HtpG, ClpA, ClpX, or ClpP, and more specifically, DnaJ or Dnak. The DnaJ or Dnak may have an amino acid sequence encoded by a nucleotide sequence of SEQ ID NO: 74 or 75, but is not limited thereto. Further, the chaperone may be Hsp40, Hsp60, Hsp70, Hsp90, Hsp100, or Hsp104.
In the present invention, the chaperone may be in a combination of two or more of the same or different kind of chaperone units. Specifically, the chaperone may be DnaJK in which DnaJ and Dnak fuse to each other, but is not limited thereto.
As used herein, the term “RNA-binding domain (RBD)” refers to a domain binding to RNA. Specifically, the RNA-binding domain of the present invention forms a complex or fusion protein, together with the chaperone, such that the complex or fusion protein including the chaperone binds to mRNA including the sequence of the target protein. That is, the RNA-binding domain locates the chaperone of the fusion protein near the target protein to be translated by ribosomes, leading to correct folding of the target protein through the adjacent chaperone.
In the present invention, the RNA-binding domain is not limited to an origin (e.g., derived from a microorganism) and a specific sequence thereof, as long as it binds to mRNA including the target protein. Specifically, the RNA-binding domain may be K Homology (KH) domain, bacteriophage MS2 coat protein, bacteriophage PP7 coat protein, sterile alpha motif (SAM), or RNA recognition motif (RRM), and a sequence thereof may be obtained from a known database, GenBank of NCBI, etc. Further, the KH domain may have an amino acid sequence encoded by a nucleotide sequence of SEQ ID NO: 76, but is not limited thereto.
In the present invention, the fusion protein may include one or more chaperones and the RNA-binding domain. Further, in the fusion protein, the chaperone may be linked to the N-terminus; the C-terminus; or the N-terminus and C-terminus of the RNA-binding domain, directly or via a linker. In the fusion protein, the RNA-binding domain may be linked to the N-terminus; the C-terminus; or the N-terminus and C-terminus of the chaperone, directly or via a linker.
The linker is not particularly limited, as long as it allows the chaperone and the RNA-binding domain of the fusion protein to exhibit activity. Specifically, the chaperone and the RNA-binding domain may be linked by using an amino acid such as glycine, alanine, leucine, isoleucine, proline, serine, threonine, asparagine, aspartic acid, cysteine, glutamine, glutamic acid, lysine, argininic acid, etc., and more specifically, by using several amino acids such as valine, leucine, aspartic acid, glycine, alanine, proline, etc. Much more specifically, for easy genetic manipulation, 1 to 10 amino acids of glycine, valine, leucine, aspartic acid, etc. may be linked and used. For example, in the present invention, the fusion protein was prepared by linking the C-terminus of the chaperone to the N-terminus of the RNA-binding domain via a G4S linker.
The fusion protein may incline a polypeptide having a sequence in which one or more amino acid residues are different from those of an amino acid sequence of the wild-type of each protein or domain included in the fusion protein. Amino acid exchanges in proteins and polypeptides, which do not generally alter the activity of molecules, are known in the art. The most commonly occurring exchanges are Ala/Ser, Val/Ile, Asp/Glu, Thr/Ser, Ala/Gly, Ala/Thr, Ser/Asn, Ala/Val, Ser/Gly, Thy/Phe, Ala/Pro, Lys/Arg, Asp/Asn, Leu/Ile, Leu/Val, Ala/Gln, Asp/Gly, in both directions. The fusion protein may also include proteins with increased structural stability against heat, pH, etc. or increased activities by mutation or modification of the amino acid sequences.
The fusion protein or the polypeptide constituting the fusion protein may be prepared by a chemical peptide synthetic method known in the art, or prepared by amplifying the gene encoding the fusion protein by polymerase chain reaction (PCR) or synthesizing the gene by a known method and then cloning the gene into an expression vector to express the gene.
In a specific embodiment of the present invention, the fusion protein may have a form (DnaJK-KH), in which DnaJ and Dnak (DnaJK) as the chaperones and the KH domain as the RNA-binding domain bind to each other, but is not limited thereto.
The protein of the present invention may include not only the amino acid sequence encoded by the nucleotide sequence of each SEQ ID NO., but also an amino acid sequence having 80% or more, specifically 90% or more, more specifically 95% or more, much more specifically 97% or more sequence homology with the above sequence. Any amino acid sequence having the homology may be included without limitation, as long as it is an amino acid sequence of a protein having an efficacy that is substantially identical or corresponding to that of the above protein. It is apparent that an amino acid sequence having a deletion, modification, substitution, or addition of some sequence may be also within the scope of the present invention, as long as the amino acid sequence has the homology.
Furthermore, the gene encoding the protein of the present invention may include not only the nucleotide sequence of each SEQ ID NO., but also a nucleotide sequence having 80% or more, specifically 90% or more, more specifically 95% or more, much more specifically 98% or more, and most specifically 99% or more sequence homology with the above sequence, as long as it is a nucleotide sequence encoding a protein having an efficacy that is substantially identical or corresponding to that of the above protein. It is apparent that a nucleotide sequence having a deletion, modification, substitution, or addition of some sequence may be also within the scope of the present invention, as long as the nucleotide sequence has the homology.
As used herein, the term “homology” is intended to indicate the degree of similarity of a nucleotide sequence of a gene encoding a protein or an amino acid sequence, and if homology is sufficiently high, an expression product of the corresponding gene has the same or similar activity.
As used herein, the term “promoter” refers to a nucleotide sequence that regulates expression of another nucleotide sequence operably linked thereto in appropriate host cells, and as used herein, “operably linked” refers to a functional linkage between a nucleotide expression control sequence and a nucleotide sequence encoding a target protein or RNA in such a manner as to allow general functions. For example, a promoter and a nucleotide sequence encoding a protein or RNA are operably linked to influence expression of a coding sequence. A method for operably linking the promoter and the nucleotide sequence encoding the protein or RNA may be performed by a genetic recombinant technique well known in the art, and site-specific DNA cleavage and ligation may be achieved by using enzymes generally known in the art.
As used herein, the term “expression vector” refers to a DNA construct including the expression cassette, which is operably linked to an appropriate regulatory sequence to express the target protein in an appropriate host. The regulatory sequence may include a promoter capable of initiating transcription, an arbitrary operator sequence for regulating transcription, a sequence encoding an appropriate mRNA ribosome binding site, and sequences for regulating the termination of transcription and translation. Once a vector is transformed into an appropriate host, the vector may replicate and function independently of the host genome, or it may be integrated into the genome itself.
As long as the expression vector used in the present invention is replicable in host cells, any vector known in the art may be used without particular limitations. Example of the vector commonly used may include a natural or recombinant plasmid, cosmic, virus, and bacteriophage. Example of the phage vector or the cosmid vector may include pWE15, M13, MBL3, MBL4, IXII, ASHII, APII, t10, t11, Charon4A, Charon21A, etc. The plasmid vector may include pBR type, pUC type, pBluescriptII type, pGEM type, pTZ type, pCL type, pET type, pCP type etc. Specifically, pDZ, pACYC177, pACYC184, pCL, pECCG117, pUC19, pBR322, pMW118, pCClBAC, pCP22, pAMT7, pET16b vector, etc. may be used, but is not limited thereto.
The vector usable in the present application is not particularly limited, and any known expression vector may be used. Also, the polynucleotide encoding the target protein may be inserted into the chromosome by a vector for chromosomal gene insertion. The insertion of the polynucleotide into the chromosome may be achieved by any method known in the art, for example, by homologous recombination, but is not limited thereto. A selection marker for confirming the chromosomal insertion may be further included. The selection marker is used to select a cell transformed with the vector, i.e., to confirm the insertion of the target nucleotide molecule. Markers that provide selectable phenotypes, such as drug resistance, auxotrophy, resistance to cytotoxic agents, or surface protein expression, may be used. Only cells expressing the selection marker are able to survive or to show different phenotypes under the environment treated with the selective agent, and thus the transformed cells may be selected.
The composition of the present invention may further include an expression cassette including a promoter, a gene encoding a target protein, and a gene of a hairpin structure at the 3′-terminus thereof, or an expression vector including the expression cassette. The promoter and the expression vector are the same as described above.
mRNA transcribed by the expression cassette or the expression vector has RNA of the target protein and the hairpin structure at 3′-UTR thereof, and the hairpin and the RNA-binding domain of the fusion protein may bind to each other. By this binding, a scaffold, in which the chaperone region of the fusion protein is recruited to 3′-UTR of mRNA including the sequence of the target protein, is formed. In the present invention, this scaffold is called chaperone-recruiting mRNA scaffold (CRAS).
As used herein, the term “hairpin” means a structure consisting of a double-stranded stem and a loop, which is formed by internal hydrogen bonding between bases of single-stranded DNA or RNA. The hairpin is used interchangeably with stem-loop. A sequence of the hairpin may be any sequence known in the art.
Further, the hairpin structure may have an RNA sequence (site), to which the RNA-binding domain of the fusion protein, i.e., the RNA-binding domain linked to the chaperone binds. Therefore, the gene of the hairpin structure included in the expression cassette or expression vector may include a sequence transcribing the RNA sequence to which the RNA-binding domain binds.
In the present invention, the expression cassette or expression vector may further include a spacer gene between the gene encoding the target protein and the gene of the hairpin structure. The spacer gene refers to a non-coding gene between the genes, and any sequence known in the art may be used.
In the present invention, the gene of the hairpin structure may have one or more of the gene of the hairpin structure. In this case, one or more of the hairpin, to which the fusion protein of the present invention binds, may exist in mRNA including the sequence of the target protein, and therefore, a larger number of chaperones may be recruited to the mRNA.
One more of the of the hairpin structure may include a spacer gene between hairpin structures, but is not limited thereto.
In the present invention, the expression cassette or expression vector may further include a ribosome-binding site. The “ribosome-binding site (RBS)” refers to a ribosome-binding site that allows mRNA transcribed from DNA to bind with ribosomes in host cells during initiation of protein biosynthesis. The ribosome-binding site may be any sequence known in the art.
Another aspect of the present invention provides a transformant including the expression cassette including the promoter and the gene encoding the fusion protein of the chaperone and the RNA-binding domain or the expression vector including the expression cassette. Specifically, the transformant may further include the expression cassette including the promoter, the gene encoding the target protein, and the gene of the hairpin structure at the 3′-terminus thereof, or the expression vector including the expression cassette.
The promoter, the chaperone, the RNA-binding domain, the fusion protein, the gene of the hairpin structure, the expression cassette, and the expression vector are the same as described above.
The transformant may produce the target protein with improved solubility by CRAS system.
As used herein, the term “transformation” means that the expression cassette of the present invention or the expression vector including the expression cassette is introduced into a host cell in such a way that the protein encoded the polynucleotide of the expression cassette or the expression vector is expressed in the host cell. As long as the transformed polynucleotide may be expressed in the host cell, it may be either integrated into and placed in the chromosome of the host cell, or exist extrachromosomally. Further, the polynucleotide may include DNA and RNA encoding the target protein. The polynucleotide may be introduced in any form, as long as it may be introduced into the host cell and expressed therein. For example, the polynucleotide may be introduced into the host cell in the form of an expression cassette, which is a gene construct including all elements required for its autonomous expression. Generally, the expression cassette may include a promoter operably linked to the polynucleotide, transcriptional termination signals, ribosome binding sites, or translation termination signals. The expression cassette may be in the form of a self-replicable expression vector. Also, the polynucleotide as it is may be introduced into the host cell and operably linked to sequences required for expression in the host cell, but is not limited thereto. Any transformation method may be employed, as long as it is used to introduce a nucleotide into a host cell. Depending on the host cell, a suitable standard technique may be selected as known in the art. For example, electroporation, calcium phosphate (CaPO4) precipitation, calcium chloride (CaCl2)) precipitation, microinjection, a polyethyleneglycol (PEG) technique, a DEAE-dextran technique, a cationic liposome technique, and a lithium acetate DMSO technique may be used, but is not limited thereto.
The transformant is not limited, as long as the protein encoded by the polynucleotide of the expression cassette or expression vector of the present invention may be expressed therein. The transformant may be specifically a plant cell, a bacterial cell, a fungal cell, a yeast cell, or an animal cell, and more specifically, a microorganism of the genus Escherichia.
Still another aspect provides a kit for producing the target protein, the kit including the composition for solubilizing the target protein or the transformant.
Still another aspect provides the expression cassette including the promoter and the gene encoding the fusion protein of the chaperone and the RNA-binding domain, or the expression vector including the expression cassette.
The promoter, the chaperone, the RNA-binding domain, the fusion protein, the expression cassette, and the expression vector are the same as described above.
Still another aspect provides the expression cassette including the promoter, the gene encoding the target protein, and the gene of the hairpin structure at the 3′-terminus thereof, or the expression vector including the expression cassette.
The promoter, the gene of the hairpin structure, the expression cassette, and the expression vector are the same as described above.
Still another aspect provides a kit for producing the target protein, the kit including the composition or the transformant.
Still another aspect provides a method for producing the target protein, the method including the steps of:
(i) preparing a transformant including (a) the expression cassette including the promoter and the gene encoding the fusion protein of the chaperone and the RNA-binding domain or the expression vector including the expression cassette and (b) the expression cassette including the promoter, the gene encoding the target protein, and the gene of the hairpin structure at the 3′-terminus thereof, or the expression vector including the expression cassette; and
(ii) culturing the transformant.
The promoter, the chaperone, the RNA-binding domain, the fusion protein, the gene of the hairpin structure, the expression cassette, the expression vector, and the transformant are the same as described above.
The step (i) of preparing the transformant may be a step of additionally introducing (a) the expression cassette including the promoter and the gene encoding the fusion protein or the expression vector including the expression cassette into a host cell previously having (b) the expression cassette including the promoter, the gene encoding the target protein, and the gene of the hairpin structure at the 3′-terminus thereof, or the expression vector including the expression cassette, but is not limited thereto.
Further, the step (i) of preparing the transformant may be a step of additionally introducing (b) the expression cassette including the promoter, the gene encoding the target; protein, and the gene of the hairpin structure at the 3′-terminus thereof, or the expression vector including the expression cassette into a host cell previously having (a) the expression cassette including the promoter and the gene encoding the fusion protein or the expression vector including the expression cassette, but is not limited thereto.
Furthermore, the step (i) of preparing the transformant may be a step of introducing into the host cell (a) the expression cassette including the promoter and the gene encoding the fusion protein or the expression vector including the expression cassette and (b) the expression cassette including the promoter, the gene encoding the target protein, and the gene of the hairpin structure at the 3′-terminus thereof, or the expression vector including the expression cassette sequentially, simultaneously, or reversely, but is not limited thereto.
In the production method, the step of (ii) culturing the transformant may be performed by, but is not particularly limited to, a known batch, continuous, or fed-batch culture method. In this regard, culture conditions may be, but are not particularly limited to, maintained at optimal pH (e.g., pH 5 to pH 9, specifically pH 6 to pH 8, and most specifically pH 6.8) by using basic compounds (e.g., sodium hydroxide, potassium hydroxide, or ammonia) or acidic compounds (e.g., phosphoric acid or sulfuric acid) and at an aerobic condition by adding oxygen or oxygen-containing gas mixture to a cell culture. The culture temperature may be maintained at 20° C. to 45° C., and specifically at 25° C. to 40° C., and cultured for about 10 hours to about 160 hours, but is not limited thereto. The target protein produced by the culturing may be secreted to a culture medium or may remain within cells.
Further, the culture medium to be used may include sugars and carbohydrates (e.g., glucose, sucrose, lactose, fructose, maltose, molasses, starch, and cellulose), oils and fats (e.g., soybean oil, sunflower seed oil, peanut oil, and coconut oil), fatty acids (e.g., palmitic acid, stearic acid, and linoleic acid), alcohols (e.g., glycerol and ethanol), and organic acids (e.g., acetic acid) as a carbon source individually or in combination, but is not limited thereto. As a nitrogen source, nitrogen-containing organic compounds (e.g., peptone, yeast extract, beef stock, malt extract, corn steep liquor, soybean meal powder, and urea), or inorganic compounds (e.g., ammonium sulfate, ammonium chloride, ammonium phosphate, ammonium carbonate, and ammonium nitrate) may be used individually or in combination, but is not limited thereto. As a phosphorus source, potassium dihydrogen phosphate, dipotassium phosphate, or sodium-containing salt corresponding thereto may be used individually or in combination, but is not limited thereto. The culture medium may include other essential growth-stimulating substances such as metal salts (e.g., magnesium sulfate or iron sulfate), amino acids, and vitamins.
The production method may further include the step of collecting the target protein from the transformant or the culture thereof. The method for collecting the target protein produced in the culturing step may be performed by an appropriate method known in the art according to the culturing method, thereby collecting the target protein from the transformant or the culture thereof. For example, cell lysis, centrifugation, filtration, anion exchange chromatography, crystallization, HPLC, etc. may be used. The target protein may be collected from the transformant or the culture thereof by an appropriate method known in the art.
Still another aspect provides a method for solubilizing a target protein, the method including the steps of:
(i) preparing a transformant including (a) the expression cassette including the promoter and the gene encoding the fusion protein of the chaperone and the RNA-binding domain or the expression vector including the expression cassette and (b) the expression cassette including the promoter, the gene encoding the target protein, and the gene of the hairpin structure at the 3′-terminus thereof, or the expression vector including the expression cassette; and
(ii) culturing the transformant.
The promoter, the chaperone, the RNA-binding domain, the fusion protein, the gene of the hairpin structure, the expression cassette, the expression vector, the transformant, step (i), and step (ii) are the same as described above.
Still another aspect of the present invention provides an expression cassette including (a) a cistron encoding a chaperone and (b) a cistron encoding a target protein, or an expression vector including the expression cassette.
The chaperone, the expression cassette, and the expression vector are the same as described above.
Both the chaperone and the target protein are translated by the expression cassette or the expression vector, and as a result, the chaperone may increase correct folding of the target protein. In the present invention, the system is called a CLEX (chaperone-substrate co-localized expression) system.
(a) and (b) may be connected in either sequential or reverse order from 5′- to 3′-direction, and the stop codon of (a) or (b) may be overlapped with the start codon of (b) or (a), but is not limited thereto.
Still another aspect of the present invention provides a composition for solubilizing the target protein, the composition including the expression cassette including (a) the cistron encoding the chaperone and (b) the cistron encoding the target protein, or the expression vector including the expression cassette.
The chaperone, the expression cassette, and the expression vector are the same as described above.
Still another aspect of the present invention provides a transformant including the expression cassette including (a) the cistron encoding the chaperone and (b) the cistron encoding the target protein, or the expression vector including the expression cassette.
The chaperone, the expression cassette, the expression vector, and the transformant are the same as described above.
Still another aspect of the present invention provides a kit for producing the target protein, the kit including the expression cassette including (a) the cistron encoding the chaperone and (b) the cistron encoding the target protein; the expression vector including the expression cassette; the composition for solubilizing the target protein including the expression cassette or the expression vector; or the transformant including the expression cassette or the expression vector.
The chaperone, the expression cassette, the expression vector, and the transformant are the same as described above.
Still another aspect of the present invention provides a method for producing the target protein, the method including the steps of:
(i) preparing the transformant including the expression cassette including (a) the cistron encoding the chaperone and (b) the cistron encoding the target protein, or the expression vector including the expression cassette; and
(ii) culturing the transformant.
The chaperone, the expression cassette, the expression vector, the transformant, the culturing, and the production method are the same as described above.
The production method may further include the step of collecting the target protein from the transformant or the culture thereof.
Hereinafter, the construction and effect of the present invention will be described in more detail with reference to Examples. However, these Examples are for illustrative purposes only, and the scope of the present invention is not intended to be limited by these Examples.
E. coli XL1-Blue (Stratagene, La Jolla, Calif., USA) and BL21(DE3) (Novagen, Madison, Wis., USA) strains were used. XL-1 Blue strain was used for general cloning and BL21(DE3) strain was used for gene expression. A strain free of dnaJ and dnaK genes was prepared by known PI transduction (Current protocols in molecular biology, 2007, Chapter 1, Unit1 17). BW25113 strains having single knock-outs (ΔdnaJ and ΔdnaK) were obtained from Keio collection, and used as a donor strain for BL21(DE3). Colony PCR was performed to screen deletion strains by using primer pairs around a target gene (Table 1). Then, a kanamycin resistance cassette was removed from the integration region in the host genome by using a pCP22 plasmid-derived FLP recombinant enzyme. Oligonucleotides (Genotech, Daejeon, Korea) and genes (Bioneer, Daejeon, Korea) used in the present invention are described in Table 1. Chemicals were purchased from Sigma-Aldrich (St. Louis, Mo., USA).
For a chaperone-recruiting mRNA scaffold (CRAS) system, an expression vector encoding a fusion protein of a chaperone and an RNA-binding domain or a target recombinant protein was constructed.
First, in order to prepare a medium-copy number vector, based on a pACYCDuet-1 vector having two strong T7 promoters (Novagen, Madison, Wis., USA), p15A which is a low-copy number origin of replication was replaced by pBR322 which is a medium-copy number origin of replication to prepare a pAMT7 vector.
Chaperone genes, dnaJ (SEQ ID NO: 74) and dnaK (SEQ ID NO: 75), were amplified by PCR, based on genomic DNA of E. coli BL21 (DE3). For high expression of dnaJ and dnaJ-KH, a synthetic ribosome-binding site was designed by using a ribosome-binding site (RBS) calculator, and incorporated into the 5′-terminus of DnaJNcoF primer.
To prepare the fusion protein of the chaperone and the RNA-binding domain, a gene (SEQ ID NO: 76) of Nova-1. KH3 RNA-binding domain (KH) having a sequence optimized for expression in E. coli was synthesized and bound downstream of dnaJ or dnaJK by recombinant PCR.
A gene encoding the target recombinant protein was amplified by PCR, based on various origins: scfv and br2-scfv were amplified based on pBR2ScFv (PloS one, 2013, 6, e66084), ugd and ubiC were amplified based on the genomic DNA of E. coli BL21(DE3), and adh1p was amplified based on the genomic DNA of S. cerevisae. HIV1-Pr gene was prepared by recombinant PCR (Protein expression and purification, 2015, 116, 59-65). tliA was amplified based on pTliA (Acs Catal, 2015, 5, 5016-5025). For green fluorescent protein (GFP) complementation assay, superfolder green fluorescent protein (sfGFP) (Bioneer, Daejeon, Korea) was used as a template to amplify the N-terminus and the C-terminus of sfGFP. For high expression of sfGFP, synthetic RBS and the N-terminus of sfGFP were designed by using an PBS calculator and incorporated into the 5′-terminus of sfGFPXbaP primer. The used primers are shown in Table 1.
For purification, 6×His-tag was added at the N-terminus of a forward primer for HIV1-Pr and at the C-terminus of a reverse primer for the target recombinant protein. For high expression of HIV1-Pr, synthetic RBS was designed by using the RBS calculator, and included at the 5′-terminus of HIVPrXbaF primer. PCR products and expression vectors corresponding thereto were digested with restriction enzymes in Table 1. Then, the chaperone was ligated to pAMT7 and the recombinant protein was ligated to pET16b.
For the chaperone-recruiting mRNA scaffold (CRAS) system, an RNA scaffold was prepared. In detail, a sequence of a binding loop for KH domain used in the RNA scaffold system was designed by RNA Designer and mFold (Journal of molecular biology, 2004, 336, 607-624; Science 1989, 244, 48-52). A sequence constraint is as follows;
5′-NNNNNNNNACCTAGATCACC(SEQ ID NO: 77)NNNNNNNN-3′, wherein N represents an arbitrary nucleotide of 8 bp for a stem structure, and a sequence underlined represents a loop structure including a binding sequence for the KH RNA-binding domain.
In the case of an RNA scaffold system having a plurality of binding loops, a stem-loop structure was designed by a similar approach to add a 5-nt spacer between respective stem-loop structures. A stem-loop structure for the target protein gene was prepared by PCR using a reverse primer having a sequence of the stem-loop structure in 3′-UTR.
CLEX system of DnaJ chaperone and recombinant protein (ScFv, UbiC, Leptin, HIV1-Pr, BMP2, or TliA) was prepared in two different arrangements in which DnaJ was placed in the first or second cistron. A 12-nt region (5′-GAGGAGGTGGAA-3′, SEQ ID NO: 79) encoding an amino acid sequence of EEVE (EEVE, SEQ ID NO: 78) and including a Shine-Dalgarno (SD) sequence (underlined) was introduced at the C-terminus of the gene in the first cistron in order to improve translation initiation of the gene in the second cistron. To secure translational coupling and to promote interaction between DnaJ and the recombinant protein, the stop codon of the first cistron and the start codon of the second cistron were overlapped through 1 nt (5′-TAATG-3′). To prepare the CLEX system, the DnaJ chaperone and the recombinant protein were assembled by recombinant PCR. Primer sequences are shown in Table 2. Each PCR product and pET16b were digested by restriction enzyme described in Table 2, and ligated to the expression vector.
E. coli BL2(DE3) was co-transformed with the plasmid of the chaperone and the plasmid of the target recombinant protein. Thereafter, the E. coli was inoculated in 3 ml of LB (Lysogenic Broth) medium (10 g/l tryptone, 5 g/l yeast; extract, and 10 g/l sodium chloride) supplemented with ampicillin and chloramphenicol at a final concentration of 50 μg/ml and 25 μg/ml, respectively and cultured in a rotary shaker (200 rpm) at 37° C. overnight. Then, 1 ml of the culture was inoculated in 100 ml of LB medium supplemented with ampicillin and chloramphenicol at a final concentration of 50 μg/ml and 25 μg/ml, respectively and then cultured at 200 rpm and 37° C. When OD600 reached 0.5-0.6, 0.5 mM isopropyl β-d-1-thiogalactopyranoside (IPTG) was added to induce expression of the chaperone and the target protein, and then, further cultured for 4 hours (37° C., 200 rpm). Thereafter, when OD600 value reached 2, 3 ml of cell culture (including about 1.6×109 cells) was centrifuged and resuspended in 500 μl of 10 mM Tris-EDTA (TE) buffer (pH 7.6), followed by sonication. Soluble and insoluble fractions were separated by centrifugation (21, 600×g, 15 min, 4° C.). An insoluble pellet was washed with 1%. Triton X-100 twice, and resuspended in 10 mM TE buffer (pH 7.6) and used as an insoluble fraction. Solubility of the target recombinant protein was measured by dodecyl sodium sulfate polyacrylamide gel electrophoresis (SDS-PAGE). Band intensity of the whole cell lysates and the soluble fractions of the target recombinant proteins stained with Coomassie Blue in the SDS-PAGE gel were compared by using an ImageJ software to calculate relative percentages of the soluble target recombinant proteins.
E. coli BL21(DE3) having pAMT7-DnaJ-KH was cultured in an LB medium supplemented with chloramphenicol at a final concentration of 25 μg/ml. The E. coli was cultured in a rotary shaker (200 rpm) at 37° C. overnight. Then, 1 ml of the culture was inoculated in 100 ml of LB medium supplemented with chloramphenicol at a final concentration of 25 μg/ml, and cultured at 200 rpm at 37° C. When OD600 reached 0.6, 0.5 mM IPTG was added to induce expression of the DnaJ-KH protein, and then, further cultured for 4 hours (37° C., 200 rpm). Thereafter, 20 ml of the cells were harvested by centrifugation, and resuspended in 1× Native IMAC lysis buffer (Biorad, Hercules, Calif., USA), followed by sonication. Transparent cell lysate was further centrifuged (21,600×g, 15 min, 4° C.). DnaJ-KH was purified from a soluble fraction by a Native IMAC method of an automated Profinia™ protein purification system (Biorad, Hercules, Calif., USA). Thereafter, the purified DnaJ-KH was dialyzed against phosphate-buffered saline (PBS) at pH 7.4 and 25 mM Tris-HCl buffer (pH 8.8) containing 5 mM DTT, 100 mM NaCl, and 10% glycerol.
Correct folding of the recombinant protein in the CRAS system was evaluated by a known GFP complementation assay (Nature methods, 2012, 9, 671-675) with slight modification. E. coli BL21(DE3) having pAMT7-DnaJK-KH and pET16b-sfGFP/pET16b-CsfGFP-NsfGFP/pET16b-CsfGFP-NsfGFP3L was inoculated in an LB medium supplemented with ampicillin and chloramphenicol at a final concentration of 50 μg/ml and 25 μg/ml, respectively and cultured in a rotary shaker (200 rpm) at 37° C. overnight. Then, 1 ml of the culture was inoculated in 100 ml of LB medium supplemented with ampicillin and chloramphenicol at a final concentration of 50 μg/ml and 25 μg/ml, respectively, and cultured at 200 rpm at 37° C. When OD600 reached 0.6, 0.5 mM IPTG was added to induce expression of DnaJ-KH and the target protein, and then, further cultured for 4 hours (37° C., 200 rpm). Thereafter, 1 ml of the cells were harvested by centrifugation, and resuspended in 500 μl of PBS (pH 7.4) and diluted until OD600 reached 1. Thereafter, the cells were loaded in a 96-well black plate (SPL Life Sciences, Pocheon, Korea). Tecan Infinite F200 PRO instrument (Tecan Group Ltd., Mannedorf, Switzerland) was used to measure fluorescence intensity of each well (λexc=488 nm/λexc=530 nm).
mRNA of anti p21ras-ScFvd having no loop, one loop, or three loops was prepared by using a HiScribe™ T7 High Yield RNA Synthesis Kit (New England Biolab, Beverly, Mass.). PBS (pH 7.4) having 0.1 nM mRNA and 200 μM of purified DnaJ-KH were mixed. Thereafter, the mixture was incubated at 25° C. for 30 minutes, and analyzed by electrophoresis on 2% agarose gel in Tris-boric acid-EDTA buffer. Electrophoresis was performed at constant 50 V for 20 minutes. Thereafter, the gel was visualized by using a Gel-Doc gel documentation system (Bio-Rad, Hercules, Calif.).
E. coli BL21(DE3) having pET16b-DnaJLep or pET16b-Lep was cultured in an LB medium supplemented with ampicillin at a final concentration of 50 μg/ml in a rotary shaker (200 rpm) at 37° C. overnight. Then, 1 ml of the culture was inoculated in 100 ml of LB medium supplemented with ampicillin at a final concentration of 50 μg/ml, and cultured at 200 rpm at 37° C. When OD600 reached 0.6, 0.5 mM IPTG was added and then, further cultured for 4 hours (37° C., 200 rpm) to induce expression of DnaJ and leptin. Thereafter, 20 ml of the cells were harvested by centrifugation, and resuspended in 1×PBS (pH 7.4), followed by sonication. Transparent cell lysate was further centrifuged (21, 600×g, 15 min, 4° C.). Activity of the purified leptin was analyzed by ELISA. 40 μg of a soluble fraction of the transparent cell lysate (8 mg/ml) was loaded in a 96-well plate (Thermo Fisher Scientific, Waltham, Mass., USA) with carbonate-bicarbonate buffer at 37° C. for 1 hour. Thereafter, the plate was washed with 1×PBS-T (1×PBS containing 0.5% Tween-20) three times, and then each well was blocked with bovine serum albumin (Invitrogen, Carlsbad, Calif., USA) for 1 hour. Thereafter, an anti-leptin antibody (Abeam, Cambridge, UK) was added to each well, and incubated at room temperature for 1 hour. After washing the plate, the plate was incubated with a horseradish peroxide-conjugated anti-rabbit monoclonal antibody (Cell Signaling Technology, Beverly, Mass., USA) at room temperature for 1 hour. Thereafter, the plate was washed with 1×PBS-T four times, and to initiate peroxidase reaction, 100 μl of TMB (3,3′,5,5′-Tetramethylbenzidine) peroxidase substrate (BD, Franklin Lakes, N.J., USA) was added. To stop the reaction, 50 μl of 2 M H2SO4 was added to each well. An ELISA reader (Infinite M200 PRO; Tecan Group Ltd., Mannedorf, Switzerland) was used to measure absorbance of each well at 450 nm wavelength.
DnaJ, which is an essential element for a molecular chaperone system in bacteria, interacts with a translated or misfolded protein and promotes interaction between foldase chaperone such as DnaK or GroESL and a protein in order to produce a correctly folded protein. For rapid folding immediately after translation, DnaJ was placed at the translation termination site of the target protein to increase spatial proximity between DnaJ and the target protein. For anchoring, human Nova-1 protein-derived sequence-specific RNA-binding domain (KH) was bound with DnaJ (
As a result, when the CRAS system was applied, solubility of ScFv, BR2-ScFv, and UGD was greatly increased, as compared with co-expression of the protein and DnaJ-KH without spatial restriction (without binding loops) or single expression of the protein (
The CRAS system did not increase solubility of Adh1p, HIV1-Pr, UbiC, Leptin, and BMP2 (
Up to three of 3′UTR hairpin loops showed no significant effect on solubility increase of the protein. However, in this case, a percentage of the soluble fraction of ScFv was maintained constant for 18 hours of expression, whereas the solubility of the protein having one 3′UTR hairpin loop was gradually reduced (
Further, a space of 30 nt or less between the stop codon and the first loop structure had no influence on solubility increase of ScFv (
DnaK chaperone system is naturally present in E. coli, and the system of the present invention anchors the DnaJ chaperone to a translation site, and therefore, dnaJ or dnaK was deleted in the genome of E. coli, and then whether ScFv solubility was increased or not was analyzed by using CRAS system. Since only DnaJ targeting mRNA is able to efficiently interact with ScFv, and the function of the CRAS system may be reduced in ΔdnaK strain due to lack, of foldase chaperone, it was predicted that the same solubility enhancement efficiency would be observed in the ΔdnaJ strain. The ΔdnaJ strain was as predicted, but the ΔdnaK strain showed slightly increased solubility of ScFV (
It was intended to increase efficiency of the CRAS system by a method of restricting the spatial extent of the chaperone proteins and by a method of increasing the turnover number of each chaperone. To this end, a chimeric chaperone DnaJ-DnaK-KH (DnaJK-KH), which is a complex of DnaJ, DnaK, and KH, was prepared and applied to the CRAS system.
As a result, it was found that solubility of the insoluble proteins was very greatly increased, and up to 90% of the expressed proteins had soluble forms (
To examine functional folding of the target protein in vivo, a GFP complementation assay was designed by splitting superfolder green fluorescent protein (sfGFP) into N-terminus (N-sfGFP) and C-terminus (C-sfGFP) (
To demonstrate importance of spatial restriction to the chaperone and the substrate without limiting the number of chaperone adjacent to the translational machinery, a CLEX (chaperone-substrate co-localized expression) system which is a translationaily coupled two-cistron expression system capable of solubilizing very insoluble proteins was prepared (
As a result, the CLEX system having the overlapping stop-start codons was very effective in solubilizing very insoluble proteins including BMP2, which are not solubilized by the CRAS system. On average, 60% to 90% of the recombinant proteins were obtained as soluble forms (
Further, the role of DnaJ was determined by the order of the two cistrons, which generates a difference in the expression level and reaction time (
Further, it was confirmed that the protein solubility may be increased by increasing a sequence between the genes to 10 nt (
In order to identify the role of DnaK in solubilization of the recombinant protein by the CLEX system, ΔdnaK strain was used (
As a result, it was confirmed that DnaJ was able to sufficiently solubilize BMP2 protein even in the absence of DnaK in the CLEX system, indicating that DnaJ has foldase activity independently of DnaK or other chaperone system is involved in the protein folding in E. coli. Accordingly, it can be seen that the CLEX system using the single chaperone DnaJ has very high efficiency.
Although various recombinant proteins were successfully solubilized, solubility of a high molecular weight protein such as TliA (52 kDa lipase) was not highly increased. This is because high molecular weight proteins tend to have many domains that are likely to be misfolded, and the system of the present invention is insufficient to promote the interaction between the chaperone and a number of substrates before irreversible misfolding. To examine this, tliA gene was cut into two fragments (tliA1, tliA2), and then applied to the system of the present invention.
As a result, TliA1 was very insoluble whereas TliA2 was soluble (
The CRAS system places the chaperone at the 3′UTR of mRNA whereas the CLEX system produces a new chaperone molecule together with the target protein to promote a rapid interaction therebetween. Considering solubilization activity, the CLEX system is superior to the CRAS system. However, the CLEX system is limited in solubilizing a number of proteins at the same time, because dnaJ must be cloned into two cistron systems for each recombinant protein. In contrast, the CRAS system having three loops and DnaJK-KH may improve the solubility of many proteins expressed in the cells by simply introducing the 3′UTR KH-binding hairpin sequence into each mRNA. Thus, the CRAS system may be used to increase soluble fractions of functional enzymes involved in the metabolic pathway, and the CLEX system is more suitable for the production of one kind of recombinant protein, such as therapeutic peptides, enzymes, or peptides.
Based on the above description, it will be understood by those skilled in the art that the present invention may be implemented in a different specific form without changing the technical spirit or essential characteristics thereof. Therefore, it should be understood that the above embodiment is not limitative, but illustrative in ail aspects. The scope of the invention is defined by the appended claims rather than by the description preceding them, and therefore all changes and modifications that fall within metes and bounds of the claims, or equivalents of such metes and bounds are therefore intended to be embraced by the claims.
A CRAS system of the present invention including 3′UTR hairpin sequence-introduced mRNA of a target protein and a fusion protein of a chaperone and an RNA-binding domain may improve solubilization of various target proteins. Further, a CLEX system of the present invention including two cistrons may improve solubilization of various target proteins.
Number | Date | Country | Kind |
---|---|---|---|
10-2017-0139497 | Oct 2017 | KR | national |