Induction promoter gene and secretory signal gene usable in Schizosaccharomyces pombe, expression vectors having the same, and use thereof

Information

  • Patent Grant
  • 6130070
  • Patent Number
    6,130,070
  • Date Filed
    Wednesday, June 30, 1999
    25 years ago
  • Date Issued
    Tuesday, October 10, 2000
    24 years ago
Abstract
An isolated DNA in an invertase gene from Schizosaccharomyces pombe, which is located in a region involved in catabolite repression. The DNA may be incorporated into cloning vector, particularly a vector containing a heterologous protein structural gene. The vector can be used to transform Schizosaccharomyces pombe. A heterologous protein may be produced by incubating the transformant and isolating the protein.
Description

TECHNICAL FIELD
The present invention relates to an inducible promoter gene and secretion signal gene for use in the fission yeast Schizosaccharomyces pombe (hereinafter referred to as S. pombe), an expression vector containing them and a process for producing a protein using them. In particular, it relates to a process for producing a desired protein wherein the S. pombe invertase promoter is used to make it possible to control the timing of the protein production by the presence or absence of a specific nutrient through regulated gene expression, and a process for secretory production of a desired protein by using the secretion signal gene for the S. pombe invertase precursor.
BACKGROUND ART
S. pombe, despite being a eukaryote, has been studied extensively for its high versatility in genetics, molecular biology and cellular biology as a unicellular organism (Nasim A. et al. eds., Molecular biology of the fission yeast, Academic Press, 1989). In its cultures, monosaccharides such as glucose and fructose are used as the main carbon sources. It is known that in a culture medium lacking these monosaccharides, expression of invertase, the enzyme that degrades sucrose into glucose and fructose, is induced to secure the carbon source necessary for its growth (Moreno S. et al., Arch Microbial. 142, 370, 1985).
S. pombe invertase is and is a high-molecular weight glycoprotein located on the cell surface with a molecular weight of about 205000, 67% of which is attributed to sugar chains composed of equimolar amounts of mannose and galactose residues. Molecular weight and amino acid studies of the protein moiety of the pure enzyme and experiments using antibodies have shown high similarlity between S. pombe invertase and the invertase from the budding yeast Saccharomyces cerevisiae from the viewpoint of protein chemistry (Moreno S. et al., Biochem. J. 267, 697, 1990). It is also known that a drop in glucose concentration de-represses synthesis of invertase (Mitchinson J. et al., Cell Sci 5, 373, 1969).
Induced invertase synthesis (de-repression) is also observed in Saccharomyces cerevisiae. Previous detailed studies on genetic regulation of invertase expression, the biosynthetic pathway and the structure of the sugar chain moiety have shown that Saccharomyces cerevisiae invertase is encoded by six overlapping genes, SUC1 to SUC5 and SUC7, on one chromosome and that activation of at least one of these SUC genes leads to utilization of sucrose and raffinose (Hohmann S. et al., Curr Genet 11, 217, 1986).
In contrast, with respect to S. pombe, although purification of the invertase protein has been reported (Moreno S. et al., 1985), no invertase genes had been identified until the present inventors and coworkers recently reported two overlapping invertase genes inv0.sup.+ and inv1.sup.+ in S. pombe. Because inv0.sup.+ is likely a pseudogene having an incomplete open reading frame, inv1.sup.+ is the only one gene encoding S. pombe invertase, which is supposed to confer the ability to grow on sucrose even in the absence of other carbon sources ("Kobogaku" edited by Yositaka Hashitani, Iwanami Shoten, 1967).
Analysis of the promoter region of the isolated gene suggested that a specific sequence between the 1st and 62nd base pairs is involved catabolite repression.
In Saccharomyces cerevisiae, the SUC2 gene is transcribed into two messenger RNAs (mRNAs) from different transcription initiation sites. The shorter one is a constitutive mRNA encoding the intracellular invertase, while the longer one is a mRNA encoding the catabolite-repressible secretory invertase with a de-repression ratio of not less than 200 (Carlson M. et al., Mol. Cell. Biol. 3, 439, 1983). Analysis of the promoter region for the longer mRNA suggested that the transcription initiation factor binds to a specific repeated sequence between positions -650 and -418 (Salokin L et al., Mol. Cell. Biol. 6, 2314, 1986). The region between positions -418 and -140 has been shown to be necessary for glucose repression.
These regions in the SUC2 gene showed no significant homology with the inv1.sup.+ upstream region between positions 1 and 2809. However, multiple copies of a so-called 7-bp motif with the sequence (A/C)(A/G)GAAAT, which is repeated at five sites in the region indispensable for glucose derepression, have been found in the inv1.sup.+ upstream region. Further, while palindrome stem-loops have been identified at almost the same positions in the upstream regions of glucose-repressible genes (SUC, MAL and GAL), palindrome sequences have also been found in the upstream region of the inv1.sup.+ gene from S. pombe. These sequences are anticipated to play an important role in glucose repression in S. pombe.
The yeast S. pombe is phylogenetically different from Saccharomyces cerevisiae. It is quite different from other yeasts in the chromosome structure and various mechanisms for genome replication, RNA splicing, transcription and posttranslational modification, and rather resembles animal cells in some of these aspects. For this reason, S. pombe is widely used as a eukaryotic model (Giga-Hama and Kumagai, eds., Foreign gene expression in fission yeast Schizosaccharomyces pombe, Springer-Verlag, 1997).
S. pombe is also widely used as a host for expression of heterologous protein genes and known to be suited especially for expression of genes from animals including human (JP-A-5-15380 and JP-A-7-163373). For its advanced membrane structures including the Goldi body and the endoplasmic reticulum, S. pombe is also used for expression of membrane proteins and shows high level expression. For S. pombe, constitutive expression vectors (pEVP11, pART1 and pTL2M) and an inducible expression vector using the promoter region of the nmt1.sup.+ gene (pREP1) are usually used as expression vectors. No S. pombe expression vectors of the GAL type or the SUC type have been known though these types of vectors are commonly used for Saccharomyces cerevisiae.
The expression of the SUC2 gene from Saccharomyces cerevisiae in S. pombe has been shown to be constitutive, not catabolite repressible, though the expression product contains galactose residues conferred by the host (Zarate, V. et al., J Applied Bacteriology, 80, 45, 1996), suggesting differences between S. pombe and Saccharomyces cerevisiae in the mechanism for catabolite repression of invertase. The differences are of great significance because the promoter from Saccharomyces cerevisiae usually used by those skilled in the art for construction of inducible expression vectors of the invertase type (the SUC2 type) is not applicable to S. pombe vectors. Therefore, development of S. pombe vectors of this type has been long delayed.
On the other hand, the present inventors constructed an expression vector using the secretion signal gene encoding the secretion signal in the precursor of a S. pombe mating pheromone (WO96/23890). However, this secretion signal gene is not an all-purpose secretion signal gene, and other secretion signal genes that function in S. pombe are desired for production of some types of protein.
DISCLOSURE OF THE INVENTION
As a result of their extensive research with a view to solving the above problems, the present inventors have accomplished the present invention by preparing a new clone of the S. pombe invertase gene and constructing an inducible expression vector. They have also found that the N-terminal 22 amino acid sequence in the amino acid of the invertase precursor functions as a secretion signal. On the basis of these findings, they have constructed an expression vector using the secretion signal gene and established secretory production of desired proteins.
The present invention relates to a region in the invertase gene from S. pombe, which is involved in catabolite repression, an inducible expression vector using the region and a system using it for heterologous gene expression and provides:
a DNA in an invertase gene from Schizosaccharomyces pombe, which is located in a region involved in catabolite repression,
a DNA having the base sequence of bases 1 to 2809 in SEQ ID NO: 1 in the Sequence Listing,
a recombinant vector containing the sequence of the DNA,
a multicloning vector containing the sequence of the DNA and a multicloning site,
a multicloning vector having the structure shown in FIG. 9,
an expression vector for transformation of Schizosaccharomyces pombe containing the sequence of the DNA and a heterologous protein structural gene,
a transformant from Schizosaccharomyces pombe containing the expression vector, and
a process for producing a protein which comprises incubating the transformant and recovering an expressed heterologous protein.
Firstly, the present inventors cloned and sequenced a S. pombe invertase gene, which had not been genetically identified. Then, they demonstrated by gene disruption analysis that the invertase gene is responsible for the overall invertase activity. Further, they identified the region involved in catabolite repression and constructed an inducible expression vector using the region. They actually constructed a recombinant vector carrying the gene of a green fluorescent protein, transformed S. pombe with the vector and confirmed the expression of the protein by assay of invertase activity and immunological analysis. They also demonstrated repression of the heterologous gene expression in the presence of glucose in the culture medium and derepression by exhaustion of glucose.
The present inventors used the following procedure to identify and characterize the gene of the S. pombe invertase precursor:
(1) PCR using a cDNA library from S. pombe as a template and primers based on conserved amino acid sequences in invertase genes from many other organisms;
(2) screening of a genomic library from S. pombe by plaque hybridization using the PCR product as a probe for positive clones;
(3) confirmation of the positive clones by restriction digestion followed by electrophoresis;
(4) Southern hybridization analysis and total sequencing of a fragment with a specific length in the positive clones;
(5) gene disruption analysis of invertase activity;
(6) investigation of the optimum pH for expression of the invertase gene from S. pombe and the effects of the glucose concentration on glucose repression and derepression; and
(7) identification of a region indispensable for glucose repression through subcloning of the related upstream region.
Also, the present inventors constructed a S. pombe invertase inducible expression vector by the following procedure and actually demonstrated inducible expression of a green fluorescent protein:
(1) construction of an inducible multicloning expression vector pRI0M containing an invertase promoter by modifying a S. pombe multicloning vector, pTL2M (JP-A-7-163373);
(2) construction of an inducible expression vector pRI0EGFP for expression of a green fluorescent protein variant from the inducible multicloning vector, pRI0M;
(3) transformation of a wild-type S. pombe strain with the inducible expression vector, pRI0EGFP, for expression of the green fluorescent protein variant;
(4) demonstration of the expression of the green fluorescent protein variant by activity (fluorescence) analysis and SDS-PAGE-western blotting; and
(5) establishment of the conditions for the inducible expression on the basis of the dependence of the expression level on the glucose concentration in the culture medium.
SEQ ID NO: 1 in the Sequence Listing is the base sequence of the gene of the invertase precursor, which contains a region involved in catabolite repression. The region involved in catabolite repression is the DNA sequence between positions 1 to 2809 of SEQ ID NO: 1 or within the DNA sequence. In the DNA sequence between positions 1 and 2809 of SEQ ID NO: 1, the region extending from position 1 to position 620 of SEQ ID NO: 1 and the region extending from position 1610 to position 2610 of SEQ ID NO: 1 are especially important, as is evident from the results of the analysis in Example 6 shown in FIG. 8 (position 2810 in SEQ ID NO: 1 corresponds to position 1 in FIG. 8). This means that the inducible promoter in the present invention is not restricted to a DNA having the base sequence from position 1 to position 2809 of SEQ ID NO: 1 so long as it contains these genes involved in catabolite repression and functions as an inducible promoter. Still, a DNA having a base sequence from position 1 to position 2809 of SEQ ID NO: 1 is preferable as an inducible promoter because it actually functions in S. pombe.
The above-mentioned DNA which contains genes involved in catabolite repression and function as an inducible promoter, preferably having the base sequence from position 1 to position 2809 of SEQ ID NO: 1, is hereinafter referred to as the inducible promoter gene. The inducible promoter gene can be integrated with a vector for construction of recombinant vectors such as multicloning vectors and expression vectors. A multicloning vector is a vector having a multicloning site and provides an expression vector through introduction of a desired structural gene into the multicloning site. An expression vector is a vector containing a structural gene and used for expression of a structural gene encoding a heterologous protein. A "heterologous" protein is a protein which is not inherent in the host. For example, when the host is S. pombe, a heterologous protein is a protein which is not inherent in S. pombe (such as a human protein).
In the expression vector, the inducible promoter gene is located upstream from the heterologous protein structural gene and regulates expression of the structural gene. The inducible promoter gene in the expression vector regulates the expression of the heterologous protein structural gene downstream, like the inducible promoter gene, located upstream in the base sequence represented by SEQ ID NO: 1 regulates the expression of the structural gene of the invertase precursor. In the multicloning vector, the inducible promoter gene is located upstream from the multicloning site into which a heterologous protein structural gene is to be introduced.
One example of the multicloning vector of the present invention is the multiclonig vector pRI0M constructed in Example 9 and has the structure shown in FIG. 9. The entire base sequence of pRI0M is SEQ ID NO: 3. Inv1-P is the above-mentioned inducible promoter gene, and MCS is the multicloning site. One example of the expression vector of the present invention is the inducible expression vector pRI0EGFP for expression of a green fluorescent protein variant constructed in Example by introducing the structural gene (EGFP-ORF) of a green fluorescent protein variant and has the structure shown in FIG. 10. The entire base sequence of the expression vector pRI0EGFP is SEQ ID NO: 14.
The most suitable cell (host) to transform with the expression vector of the present invention is S. pombe because the inducible promoter gene in the present invention is an inducible promoter gene from S. pombe.
Under catabolite repressing conditions (for example, in a culture medium containing a high level of glucose), S. pombe transformed with the expression vector of the present invention grows with no (or low) expression of the heterologous protein. Growth at this stage without the burden of heterologous protein expression is more efficient than growth under the burden. Subsequent incubation under catabolite derepressing conditions (for example, in a culture medium containing no or a low level of glucose) invites the increased number of S. pombe cells to high level expression of the heterologous protein, though growth of S. pombe is less efficient than under catabolite repressing conditions. Thus, controlled transition between growth of S. pombe and heterologous protein expression through catabolite repression allows more efficient production of a heterologous protein.
Catabolite repression can be controlled not only in an active way as described above but also in a passive way. For example, when a S. pombe transformant is incubated in a culture medium containing a given amount of glucose, the S. pombe grows under catabolite repressing conditions containing a high level of glucose in the initial stage, but later on production of a heterologous protein predominates due to catabolite derepression as glucose is exhausted. This way, more efficient heterologous protein production of than ever is possible without active control of the glucose level.
In addition to the above-mentioned total sequencing of the invertase precursor gene from S. pombe, the present inventors determined the complete amino acid sequence of the invertase precursor (the amino acid sequence in SEQ ID NO: 2). Then, they have found that the first 22 amino acid peptide in the amino acid sequence of the invertase precursor (Met Phe Leu Lys Tyr Ile Leu Ala Ser Gly Ile Cys Leu Val Ser Leu Leu Ser Ser Thr Asn Ala) (amino acids 1-22 of SEQ ID NO: 2) acts as a secretion signal. Hereinafter, the peptide is referred to as the secretion signal.
It is expected that a desired heterologous protein produced in transformed S. pombe cells as a protein fusion having the secretion signal at the N-terminal is secreted from the cells after intracellular processing which splits the protein fusion into the secretion signal and the heterologous protein. The present inventors constructed an expression vector carrying a heterologous protein structural gene (specifically, human interleukin 6-a'c1 variant) fused with a DNA encoding the secretion signal (namely, a structural gene of a protein fusion as mentioned above) and demonstrated secretion of the heterologous protein from S. pombe cells transformed with the expression vector.
The present invention provides the secretion signal, a DNA encoding the secretion signal (hereinafter referred to as a secretion signal gene), a recombinant vector carrying the secretion signal gene, a multicloning vector carrying the secretion signal gene, an expression vector carrying the secretion signal gene and a heterologous protein structural gene for transformation of S. pombe, a S. pombe transformant carrying the expression vector and a process for producing a protein which comprises incubating the transformant and recovering the expressed heterologous protein.
The secretion signal gene is not restricted to the 66-bp sequence extending from position 2810 to position 2875 in SEQ ID NO: 1 and may be a DNA having a different base sequence encoding the amino acid sequence of the secretion signal. In the expression vector, the secretion signal and the heterologous protein structural gene is preferably linked directly. But they may be linked via another DNA sequence, for example, extending from position 2876 in SEQ ID NO: 1. In this case, the protein product has extra amino acid residues at the N-terminal of the heterologous protein but can be converted into the desired heterologous protein by trimming off the N-terminal extra amino acid residues. However, the disadvantage from the presence of these extra amino acid residues usually becomes more serious for the desired protein as the number of extra amino acid residues increases. Therefore, as the intervening DNA between the secretion signal gene and the heterologous protein structural gene, a short DNA encoding at most 10 amino acid residues is preferable. Particular preferably, the secretion signal gene and the heterologous protein structural gene are linked directly.
Construction of an expression vector carrying the secretion signal gene using a multicloning vector can be attained by inserting a heterologous protein structural gene fused with the secretion signal gene into the multicloning site of a multicloning vector or by inserting a heterologous protein structural gene into the multicloning site of a multicloning vector carrying the secretion signal gene. The latter method tends to restrict the structure of the multicloning site because the secretion signal gene is preferred to be located immediately in front of the multicloning site as described above. Therefore, the former method is preferred for construction of an expression vector. As a S. pombe multicloning vector, for example, pTL2M, which is disclosed in JP-A-163373, is preferable.
According to the present invention, an expression vector can be constructed by using both the inducible promoter gene and the secretion signal gene. For example, an expression vector which contains the DNA sequence of from position 1 to position 2875 in SEQ ID No: 1 and a heterologous protein structural gene introduced downstream of the DNA sequence can be constructed. Such an expression vector enables catabolite repressible secretory production of a heterologous protein by the host cell. A similar expression vector can be constructed by using a known secretion signal gene (such as the secretion signal gene disclosed in WO96/23890) instead of the above-mentioned secretion signal gene.





BRIEF DESCRIPTION OF DRAWINGS
The following drawings are presented in connection with the section of Best Mode for Carrying Out the Invention.
FIG. 1 shows a comparison of (partial) amino acid sequences deduced from inv1* (amino acids 58-393 of SEQ ID NO: 2), the Schwanniomyces occidentalis invertase gene (SEQ ID NO: 23) and the fission yeast SUC2 gene (SEQ ID NO: 24).
FIG. 2 is the restriction map of the inv1* gene.
FIG. 3 electrophoretically shows disruption of the inv1* gene.
FIGS. 4(a)-4(b). FIG. 4(a) is a photograph of colony gel overlay assay of invertase activity (for phenotype characterization) as a substitute for a drawing. FIG. 4(b) is a schematic explanation of experimental design of the invertase activity assay shown in FIG. 4(a).
FIGS. 5(a)-5(b). FIG. 5(a) is a photograph of colony gel overlay assay of invertase activity (for phenotype characterization) as a substitute for a drawing. FIG. 5(b) is a schematic explanation of experimental design of the invertase activity assay shown in FIG. 5(a).
FIG. 6 graphically shows the relation between the invertase activity and the glucose concentration.
FIG. 7 graphically shows the relation between the invertase activity and the glucose concentration.
FIG. 8 shows the results of analysis of invertase promoters.
FIG. 9 shows the structure of an inducible expression vector pRI0M.
FIG. 10 shows the structure of an inducible expression vector pRI0EGFP for expression of a green fluorescent protein.
FIGS. 11(a) and 11(b) demonstrate the expression of a green fluorescent protein.
FIGS. 12(a) and 12(b) graphically show the relation between the incubation time and the expression level of the green fluorescent protein.
FIGS. 13(a) and 13(b) graphically show the relation between the incubation time and the expression level of the green fluorescent protein.
FIGS. 14(a) and 14(b) show the relation between the incubation time and the expression level of the green fluorescent protein.
FIG. 15 is a SDS-PAGE pattern obtained in analysis of the expression of interleukin 6a'c1 variant.
FIG. 16 is the western blot pattern of the expressed interleukin variant.





BEST MODE FOR CARRYING OUT THE INVENTION
Now, the present invention will be described in further detail with reference to specific Examples.
EXAMPLE 1
Isolation of S. pombe Invertase Gene
PCR using a S. pombe cDNA library as the template and primers designed on the basis of conserved sequences in invertase genes from other organisms shown in SEQ ID NOS: 4 to 6, gave amplification products of about 300 bp and about 400 bp. Each PCR product was purified by using EASY TRAP (Takara Shuzo Co., Ltd.) and sequenced after ligation into a vector by using pMOS Blue T vector kit (Amersham Pharmacia Biotech K.K.). The deduced amino acid sequence indicated that part of the 400-bp PCR product has significant homology with the SUC2 gene from Saccharomyces cerevisiae.
Screening of a genomic library of S. pombe for the entire invertase gene by plaque hybridization using the 400-bp PCR product as a probe picked up 15 positive clones from about 8,000 plaques. Secondary screening of the positive clones by plaque hybridization left four positive clones. Treatment of small amounts of phage DNA extracts from the positive clones with various restriction enzymes gave identical cleavage patterns, and thus revealed that all the clones were identical.
The entire invertase gene was isolated by the following two-step procedure. A 3.0-kb HindIII fragment was isolated from the hybrid-forming clones and ligated into a plasmid pBluescript II SK- (Toyobo Co., Ltd.). Restriction mapping identified BamHI and SalI sites. Subcloning of the fragment using these restriction enzymes and subsequent sequencing using a deletion technique revealed that the HindIII fragment contains the complete ORF of the gene but contains only about 200 bp within the upstream region, which is supposedly involved in the gene expression. Therefore, separately, a 3.5-kb BamHI fragment from the hybrid-forming clones was further digested with HindIII to give a 2.6-kb fragment. Subcloning of the 2.6-kb fragment in plasmid pBluescript II SK-, and subsequent sequencing using a deletion technique revealed that the BamHi-HindIII fragment contained a sequence within the upstream region supposedly involved in the gene expression. The resulting complete 5.6-kb gene was designated as inv1.sup.+. The base sequence of inv1.sup.+ and the amino acid sequence encoded by its ORF are shown in SEQ ID NO: 1 and 2. A plasmid carrying the complete gene was designated as pINV3000.
These results suggest that the inv1.sup.+ product has 16 asparagine-linked glycosylation sites. FIG. 1 shows a comparison of the amino acid sequence deduced from the base sequence of the inv1.sup.+ gene from S. pombe (SEQ ID NO: 2), the amino acid sequence of Schwanniomyces occidentalis (SEQ ID NO: 23) invertase and the amino acid sequence deduced from the base sequence of the SUC2 gene from Saccharomyces cerevisiae (SEQ ID NO: 24). Amino acids that are common to the three are marked with *. FIG. 1 clearly shows the amino acid sequence deduced from the inv1.sup.+ base sequence has significant homology with invertases from other origins such as Schwanniomyces occidentalis and Saccharomyces cerevisiae, which suggests the inv1.sup.+ encodes invertase.
EXAMPLE 2
Disruption of the inv1.sup.+ Gene
The HindIII site in plasmid pBluescript II SK- having a S. pombe ura4.sup.+ gene insert at the ClaI site was disrupted by HindIII digestion followed by blunting and self-ligation (self-cyclizaion). Double restriction digestion of the plasmid with XbaI and HincII gave a ura4.sup.+ fragment. The plasmid pBluescript was integrated with the ura4.sup.+ fragment after restriction digestion with SpeI and subsequent blunting and XbaI digestion, to provide a plasmid having BamHI sites on both sides of ura4.sup.+. The BamHI site in plasmid pBluescript II SK- was disrupted similarly by restriction digestion followed by blunting and self-ligation, and the 3.0-kb fragment containing the inv1.sup.+ ORF was inserted at the HindIII site. BamHI digestion of the plasmide eliminated a 1.4-kb fragment (containing part of the inv1.sup.+ ORF encoding the C-terminal of invertase) from the 3.0-kb insert, and a ura4.sup.+ cassette having BamHI sites at both ends was inserted. HindIII digestion of the resulting plasmid gave a DNA fragment having inv1.sup.+ neighboring regions at both ends (FIG. 2). The restriction map of the inv1.sup.+ gene is shown in FIG. 2, wherein the open reading frame (ORF) is indicated by the arrow (inv1.sup.+ ORF) and the ura4.sup.+ replacement from Schizosaccharomyces pombe is boxed (ura4.sup.+). The disruption mutant strain had an inv1.sup.+ fragment carrying the S. pombe ura4.sup.+ gene instead of the 1.4-kb inv1.sup.+ BamHI-BamHI fragment partly containing the ORF. The inv1.sup.+ fragment was used to transform a wild-type S. pombe strain, TP4-1D [h.sup.-, leu1, ura4, ade6-M216, his2, obtained from Dr. Takashi Toda (Imperial Cancer Research Foundation)], and viable colonies on a uracil-free culture medium were collected. Overlay assay of invertase activity revealed that 7 out of 28 strains, namely 25% of the ura4.sup.+ colonies, lacked invertase activity.
Further, to verify the chromosomal inv1.sup.+ gene disruption, genomic DNA from a strain lacking invertase activity was analyzed after double restriction digestion with HindIII and SalI by Southern hybridization using the inv1.sup.+ HindIII-SalI fragment (2 kb) as the probe. The 3-kb hybridized fragment, which was not digested with SalI, shown in FIG. 3 demonstrates that part of the inv1.sup.+ gene in the chromosomal DNA had been replaced with the ura4.sup.+ gene in the strain which lacked invertase activity.
Thus, the inv1.sup.+ gene proved to be the only one invertase gene expressed in S. pombe.
EXAMPLE 3
Restoration of Invertase Activity by the inv1.sup.+ Gene
The 3.0-kb HindIII fragment containing the entire inv1.sup.+ ORF from the invertase gene was inserted into S. pombe vector pAU-SK (obtained from Dr. Chikashi Shimoda, Department of Science, Osaka City University), and the resulting recombinant vector was used to transform the invertase-defective strains (Example 2). The resulting transformants were streaked on YP sucrose plates (supplemented with 10 .mu.g/ml antimycin A and 20 .mu.g/ml bromocresol purple) for overlay assay of invertase activity. Further, the 2.6-kb inv1.sup.+ BamHI-HindIII fragment containing the upstream promoter region and the 2.0-kb HindIII-SalI fragment from the invertase gene containing the ORF were legated, and the resulting 4.6-kb BamHI-SalI fragment was inserted into pAU-SK. Transformation of the resulting recombinant vector into the invertase-defective strain was followed by similar overlay assay of invertase activity.
Both transformants (inv1.DELTA.[pAU-SK::inv1.sup.+ ]) and the wild-type strain TP4-1D (WT) developed blue stains, which indicate invertase production, unlike the inv1.sup.+ disruption mutant strain (inv1.DELTA.). The addition of the upstream promoter region resulted in stronger stains, which suggest high-level invertase expression (FIGS. 4(a) and 4(b)). FIG. 4(a) is a photograph showing the results of the gel overlay assays, and FIG. 4(b) is a schematic explanation of the stained sections shown in FIG. 4(a).
The invertase-defective strain were hardly viable on YP sucrose plates (supplemented with 10 .mu.g/ml antimycin) whereas the wild-type strain and the transformants were recognizably viable after 5 days incubation at 30.degree. C. (FIGS. 5(a) and (b)). FIG. 5(a) is a photograph showing the results of characterization by colony formation, and FIG. 5(b) schematically explains the characterization shown in FIG. 5(a).
These results demonstrate that the inv1.sup.+ gene expression product is the invertase located on the cell surface.
EXAMPLE 4
Determination of Glucose Concentration for Gene Repression
For determination of the critical glucose concentration for catabolite repression of the invertase gene, the wild-type strain TP4-1D was incubated at 30.degree. C. in 5 ml of MM medium containing 2%, 4%, 8% and 16% glucose with shaking to the mid-logarithmic growth phase. Invertase assays were done by the method of Goldstein et al., and the post-incubational glucose concentrations in the medium were determined by the phenol-sulfate method (FIG. 6). The hatched bars indicate invertase activity per cell (U/OD), and the empty bars indicate the residual glucose concentration. Judging from the graph, a glucose concentration of 8% is the optimum for glucose repressing incubation, because when the glucose concentration was 8%, the invertase activity was sufficiently repressed with little decrease of glucose.
EXAMPLE 5
Determination of Glucose Concentration for Induced Invertase Production
For determination of the most effective glucose concentration for induced invertase production, the wild-type strain TP4-1D and a transformant [obtained by transforming the invertase-defective strain (Example 3) with a pAU-SK vector carrying the inv1.sup.+ BamHI-SalI fragment] were preincubated in a medium containing 2% glucose to the mid-logarithmic growth phase and incubated in an MM medium containing 0%, 0.01%, 0.05%, 0.1%, 0.25%, 0.5%, 1.0% and 2% glucose with shaking at 27.degree. C. for 3 hours, and the invertase activity was assayed (FIG. 7). Each run of assay was carried out at 30.degree. C. over 180 minutes. 1 U of invertase converts 1 nmol of sucrose into glucose per 1 minute at 30.degree. C., pH 4.0.
The optimum glucose concentration for induction was found to be 0.1% for the wild-type strain and 0.05% for the transformant. The invertase activity in the wild-type strain was 40 times higher under derepressing conditions than under repressing conditions. These results demonstrate catabolite repression in S. pombe.
EXAMPLE 6
Analysis of the inv1.sup.+ Promoter Region
Fragments obtained ligating S. pombe inv1.sup.+ upstream sequences extending from positions 1, 620, 1100, 1610 and 2610 of SEQ ID NO: 1, respectively, and the inv1.sup.+ ORF were inserted into expression vector pAU-SK to obtain 5 plasmids for deletion studies. The plasmids containing the upstream sequences extending from positions 1, 1610 and 2610 carry the inv1.sup.+ BamHI-SalI, SacI-SalI and HindIII-SalI fragments, respectively. The plasmids containing the inv1.sup.+ upstream sequences extending from positions 620 and 1100 were constructed by site specific introduction of a SpeI site into pAU-SK::inv1.sup.+ (BamHI-SalI) using primers shown in SEQ ID NOS: 7 and 8, respectively, followed by partial removal of the upstream region by SpeI treatment.
The plasmids thus obtained were used to transform the invertase-defective strain (Example 3). Invertase assays were done to determine the enzyme activity in each transformant (FIG. 8). The results suggest that the sequence between position 1 and position 620 of SEQ ID NO: 1 is essential for glucose repression. The region between position 1620 and position 2610 of SEQ ID NO: 1 was identified as essential for high-level glucose derepression of invertase.
EXAMPLE 7
Construction of Inducible Multicloning Expression Vector pRI0M Carrying the Invertase Promoter
PCR amplification using the plasmid pINV3000 (Example 1) carrying the invertase gene from S. pombe as the template and oligo DNAs shown in SEQ ID NOS: 9 and 10 as the primers was performed to give a sequence which contains the promoter region for the invertase gene and has restriction enzyme recognition sequences at both ends. After terminal double restriction digestion with SpeI (Takara Shuzo Co., Ltd.) and EcoRI (Takara Shuzo Co., Ltd.), the sequence was subjected to agarose gel electrophoresis. Purification of the band of about 3000 bp by the glass beads method using EASY-TRAP gave an insert fragment.
The S. pombe multicloning vector pTL2M (JP-A-7-163373) was subjected to agarose gel electrophoresis after terminal double restriction digestion with SpeI and EcoI. Purification of the band of about 4500 bp by the glass beads method using EASY-TRAP gave a vector fragment.
The two fragments were ligated with a DNA ligation kit (Takara Shuzo Co., Ltd.) and transformed into E. coli strain DH (Toyobo Co., Ltd.). E. coli colonies were screened for the inducible expression vector pRI01M shown in FIG. 9 and SEQ ID NO: 3 in the Sequence Listing through base sequencing and restriction mapping, and the inducible expression vector was recovered by the alkali-SDS method on a preparatory scale.
EXAMPLE 8
Construction of Inducible Expression Vector pRI0EGFP for Expression of Green Fluorescent Protein
PCR amplification using the plasmid pINV3000 (Example 1) carrying the invertase gene from S. pombe as the template and oligo DNAs shown in SEQ ID NOS: 9 and 11 as the primers was performed to give a sequence which contains the promoter region for the invertase gene and has restriction enzyme recognition sequences at both ends. After terminal double restriction digestion with SpeI and NheI (Takara Shuzo Co., Ltd.), the sequence was subjected to agarose gel electrophoresis. Purification of the band of about 3000 bp by the glass beads method using EASY-TRAP gave a fragment for use as the promoter insert.
PCR using the plasmid pEGFP carrying the jellyfish (Aequorea victria) green fluorescent protein variant gene (Clontech) as the template and oligo DNAs shown in SEQ ID NOS: 12 and 13 in the Sequence Listing as the primers was performed to amplify the ORF in the green fluorescent protein variant gene. The PCR product was subjected to agarose gel electrophoresis after terminal double restriction digestion with NheI and HindIII. Purification of the band of about 700 bp by the glass beads method using EASY-TRAP gave a fragment for use as the ORF insert.
The S. pombe multicloning vector pTL2M was cleaved by double restriction digestion with SpeI and HindIII and then subjected to agarose gel electrophoresis. Purification of the band of about 4500 bp by the glass beads method using EASY-TRAP gave a vector fragment.
The three fragments were ligated with a DNA ligation kit and transformed into E. coli strain DH5. E. coli colonies were screened for the inducible expression vector pRI0EGFP for expression of the green fluorescent protein variant shown in FIG. 10 and SEQ ID NO: 14 in the Sequence Listing through base sequencing and restriction mapping, and the inducible expression vector was recovered by the alkali-SDS method on a preparatory scale.
EXAMPLE 9
Preparation of S. pombe Transformant ASP138
S. pombe wild-type strain ARC001 [leu1-32h.sup.- (isogenic to ATCC38399)] was transformed with the inducible expression vector pRI0EGFP for expression of the green fluorescent protein variant and a transducing vector pAL7 as described by Okazaki et al. (Okazaki et al., "Nucleic Acids Res.", 18, 6485-6489, 1990).
1 ml of a preincubated ARC001 culture in YPD medium was incubated in minimum medium MB+Leu with shaking at 30.degree. C. for 14 hours to a cell density of 3.times.10.sup.7 per 1 ml, and the cells were collected, washed with water, suspended in 1 ml of 0.1M lithium acetate (pH 5.0) and incubated at 30.degree. C. for 60 minutes. A 100 .mu.l portion of the suspension was mixed with 4 .mu.g of the inducible expression vector pRI0EGFP and 0.5 .mu.g of PstI-digested pAL7 in 15 .mu.l TE and further with 290 .mu.l of 50% PEG 4000 thoroughly and incubated at 30.degree. C. for 60 minutes, at 43.degree. C. for 15 minutes and at room temperature for 10 minutes, successively. After centrifugal removal of PEG, the cells were suspended in 1 ml of 1/2 YEL+Leu medium. After 10-fold dilution, 1 ml of the suspension was incubated at 32.degree. C. for 2 hours, and a 300 .mu.l portion was spread on minimum medium agar MMA. After 3 days of incubation at 32.degree. C., about 300 independent colonies had developed on the plate.
10 colonies of the transformant were inoculated in 2 ml of YEL medium containing 10 .mu.g/ml antibiotic G418 (YEL10 medium) and incubated with shaking at 32.degree. C. 2 days later, 6 clones were viable. In their subcultures, 4 clones were viable 3 days later. The putative desired transformant (ASP138 strain) was frozen in glycerol and stored for use in subsequent experiments.
EXAMPLE 10
Analysis of Expression of Green Fluorescent Protein Variant
S. pombe transformant ASP138 (Example 9) was inoculated in YPD medium containing 100 .mu.g/ml G418 (YPD100) and incubated at 32.degree. C. for 2 days. Green fluorescence was observed from each cell under a fluorescence microscope (excitation wavelength 490 nm/emission wavelength 530 nm). Green fluorescence emission was also observed upon ultraviolet irradiation from the centrifugally collected cells. Thus, expression of the desired green fluorescent protein in the active form was confirmed.
Strain ASP138 (Example 9) was incubated in 5 ml YPD100 at 32.degree. C. for 3 days, collected, washed and suspended in 50 mM tris-HCl (pH 7.5), disrupted with glass beads in a mini bead beater (Biospec). After removal of the glass beads, the cell extract was heated in the presence of SDS (1%) at 80.degree. C. for 15 minutes. Separately, a negative control was extracted from the transformant carrying pR10M by the same procedure.
50 .mu.g protein from the extract was analyzed by SDS-polyacrylamide gel electrophoresis (FIGS. 11(a) and (b)). After Coomassie Brilliant Blue (CCB) staining, the extract and a recombinant green fluorescent protein (Clonetech) as the positive control showed major bands with a molecular weight of 25000, but the negative control did not. Further, 50 .mu.g protein from the extract was analyzed by SDS-polyacrylamide gel electrophoresis followed by western blotting on a PVDF membrane using an anti-green fluorescent protein antibody (Clonetech). The cell extract and the positive control showed major bands with a molecular weight of 25000, but the negative control did not. These results provide biological evidence of expression of the desired green fluorescent protein.
EXMAPLE 11
Optimization of the Incubation Method (1)
The transformed S. pombe strain ASP (Example 9) was incubated in YPD medium containing 100 .mu.g/ml G418 (YPD100) at 32.degree. C. The expression level of the green fluorescent protein in the cell culture was determined fluorometrically by means of a microplate reader (Corona Electric Co., Ltd.) equipped with a fluorescent attachment (excitation wavelength 490 nm/emission wavelength 530 nm) (FIGS. 12(a) and (b)). OD, FLU/OD and time denote the cell density, the fluorescence intensity per cell and the incubation time, respectively. The results show that strain ASP138 did not express the green fluorescent protein variant until the late-growth phase after glucose exhaustion in the mid-growth phase, unlike strain ASP122 having a non-inducible cytomegalovirus promoter [a transformant carrying a recombination product of phGFPS65T (Clonetech) for expression of a green fluorescent protein variant of S65T prepared as disclosed in JP-A-7-163373], clearly due to repression of the inducible invertase promoter in the presence of glucose and subsequent derepression by glucose exhaustion, demonstrating the applicability of this mechanism to expression of the green fluorescent protein as a heterologous protein.
EXAMPLE 12
Incubation of Strain ASP138 (2)
The transformed S. pombe strain ASP138 (Example 9) was incubated in YPD medium containing 100 .mu.g/ml G418 (glucose concentration 8%) at 32.degree. C. to the mid-growth phase, and after medium change, incubated in YPDG medium containing 100 .mu.g/ml G418 (glucose concentration 0.1%, glycerol concentration 3%) (FIGS. 13(a) and (b)). OD, FLU/OD and time denote the cell density, the fluorescence intensity per cell and the incubation time, respectively. The results show that while the green fluorescent protein was not expressed in the cells incubated without medium change (Untreated), expression of the green fluorescent protein in the cells incubated with the medium change was activated by the medium change (Medium-changed) probably because the depletion of glucose in the medium provoked derepression of the invertase promoter and thereby induced the protein expression. The high level expression induced by the medium change to a low-glucose expression medium suggests that use of a growth medium (with a high glucose concentration) and an expression medium (with a low glucose concentration) can differentiate between cell growth and protein expression. It was demonstrated that the repression of the inducible invertase promoter in the presence of glucose and derepression by exhaustion of glucose could be utilized in expression of the green fluorescent protein as a heterologous protein.
EXAMPLE 13
Incubation of Strain ASP138 (3)
The transformed S. pombe strain ASP138 (Example 9) was incubated in YPD medium containing 100 .mu.g/ml G418 (glucose concentration 8%) at 32.degree. C. to the late-growth phase, and after medium change, incubated in YPDG medium containing 100 .mu.g/ml G418 (glucose concentration 0.1%, glycerol concentration 3%) (FIGS. 14(a) and (b)). OD, FLU/OD and time denote the cell density, the fluorescence intensity per cell and the incubation time, respectively. The results show that while the green fluorescent protein was not expressed in the cells incubated without medium change (Untreated), expression of the green fluorescent protein in the cells incubated with the medium change was activated by the medium change (Medium-changed) probably because the depletion of glucose in the medium provoked derepression of the invertase promoter and thereby induced the protein expression. The high level expression induced by the medium change to a low-glucose expression medium from a high-glucose medium before glucose exhaustion suggests that use of a growth medium (with a high glucose concentration) and an expression medium (with a low glucose concentration) can differentiate between cell growth and protein expression. It was demonstrated that the repression of the inducible invertase promoter in the presence of glucose and derepression by exhaustion of glucose could be utilized in expression of the green fluorescent protein as a heterologous protein.
EXAMPLE 14
Construction of Inducible Lipocortin I Expression Vector pRI0LPI
PCR using plasmid pINV3000 (Example 1) carrying the S. pombe invertase gene as the template and oligo DNAs shown in SEQ ID NOS: 15 and 16 in the Sequence Listing as primers gave an amplification product containing the promoter region in the invertase gene and having restriction enzyme recognition sequences at both ends. After terminal double restriction digestion with SpeI and EcoRI, the amplification product was subjected to agarose gel electrophoresis. Purification of the band of about 3000 bp by the glass beads method using EASY-TRAP gave a fragment for use as a promoter insert.
The expression vector pTL2L (JP-A-7-163373) carrying a human lipocortin I gene was subjected to agarose gel electrophoresis after terminal double restriction digestion with EcoRI and HindIII. Purification of the band of about 1000 bp by the glass beads method using EASY-TRAP gave a fragment for use as the OPF insert.
The S. pombe multicloning vector pTL2M (JP-A-7-163373) was subjected to agarose gel electrophoresis after terminal double restriction digestion with SpeI and HindIII. Purification of the band of about 4500 bp by the glass beads method using EASY-TRAP gave a vector fragment.
The three fragments were ligated with a DNA ligation kit and transformed into E. coli strain DH5. E. coli colonies were screened for the inducible lipocortin I expression vector pRI0LPI through base sequencing and restriction mapping, and the vector was recovered by the alkali-SDS method on a preparatory scale.
EXAMPLE 15
Preparation of Fission Yeast Schizosaccharomyces pombe Transformant ASP139
A S. pombe wild-type strain ARC001 was transformed with the inducible lipocortin I expression vector pRI0LPI and a transducing vector pAL7 as described by Okazaki et al.
1 ml of a preincubated ARC001 culture in YPD medium was incubated in minimum medium MB+Leu with shaking at 30.degree. C. for 16 hours to a cell density of 1.times.10.sup.7 per 1 ml, and the cells were collected, washed with water, suspended in 1 ml of 0.1M lithium acetate (pH 5.0) and incubated at 30.degree. C. for 60 minutes. A 100 .mu.l portion of the suspension was mixed with 2 .mu.g of the recombinant vector pRI0LPI and 0.5 .mu.g of PstI-digested pAL7 in 15 .mu.l TE and further with 290 .mu.l of 50% PEG 4000 thoroughly and incubated at 30.degree. C. for 60 minutes, at 43.degree. C. for 15 minutes and at room temperature for 10 minutes, successively. After centrifugal removal of PEG, the cells were suspended in 1 ml of 1/2 YEL+Leu medium. After 10-fold dilution, 1 ml of the suspension was incubated at 32.degree. C. for 2 hours, and a 300 .mu.l portion was spread on minimum medium agar MMA. After 3 days of incubation at 32.degree. C., about 300 independent colonies had developed on the plate.
10 colonies of the transformant were inoculated in 2 ml of YEL medium containing 10 .mu.g/ml antibiotic G418 (YEL10 medium) and incubated with shaking at 32.degree. C. 2 days later, 2 clones were viable. All the subcultures of them were viable 3 days later. The putative desired transformant (ASP138 strain) was frozen in glycerol and stored for use in subsequent experiments.
EXAMPLE 16
Analysis of Lipocortin I Expression
S. pombe transformant ASP139 (Example 15) was incubated in YPD medium containing 100 .mu.g/ml G418 (glucose concentration 8%) at 32.degree. C. to the stationary phase and collected as a non-inducible cell culture. Separately, ASP139 was incubated in the same medium at first to the mid-growth phase, then after medium change, incubated in YPDG medium containing 100 .mu.g/ml G418 (glucose concentration 0.1%, glycerol concentration 3%) to the stationary phase and collected as an inducible cell culture. Both cell cultures were washed, suspended in 50 mM tris-HCl (pH 7.5) and disrupted with glass beads in a mini bead beater. After removal of the glass beads, the cell extracts were heated in the presence of SDS (1%) at 80.degree. C. for 15 minutes.
50 .mu.g protein from each extract was separated by SDS-polyacrylamide gel electrophoresis and stained with Coomassie Brilliant Blue. The extract from the inducible cell culture showed a major band of a molecular weight of about 45000, which is the same as the deduced molecular weigh of the recombinant lipocortin I protein, but the extract from the non-inducible cell culture did not. More sensitive western analysis of band density showed that the band from the inducible cell culture extract was 10 times denser than the band from the non-inducible cell culture extract. The results show that while lipocortin I was not expressed in the cells incubated without medium change, the expression of lipocortin I in the cells incubated with the medium-change was activated by the medium change probably because the depletion of glucose in the medium provoked derepression of the invertase promoter and thereby induced the protein expression. The high level expression induced by the medium change to a low-glucose expression medium from a high-glucose medium suggests that use of a growth medium (with a high glucose concentration) and an expression medium (with a low glucose concentration) can differentiate between cell growth and protein expression. It was demonstrated that the repression of the inducible invertase promoter in the presence of glucose and derepression by exhaustion of glucose could be utilized in expression of lipocortin I as a heterologous protein.
EXAMPLE 17
Construction of Expression Vector pTL2INV1 Carrying Invertase Gene
PCR using plasmid pINV3000 (Example 1) carrying the S. pombe invertase gene as the template and oligo DNAs shown in SEQ ID NOS: 17 and 18 in the Sequence Listing as primers gave an amplification product containing the ORF in the invertase gene and having restriction enzyme recognition sequences at both ends. After terminal double restriction digestion with AflIII (New England Biolab) and HindIII (Takara Shuzo Co., Ltd), the amplification product was subjected to agarose gel electrophoresis. Purification of the band of about 3000 bp by the glass beads method using EASY-TRAP gave a fragment for use as an insert.
The S. pombe multicloning vector pTL2M (JP-A-7-163373) was subjected to agarose gel electrophoresis after terminal double restriction digestion with SpeI and EcoI. Purification of the band of about 4500 bp by the glass beads method using EASY-TRAP gave a vector fragment.
The two fragments were ligated with a DNA ligation kit (Takara Shuzo Co., Ltd) and transformed into E. coli strain DH5 (Toyobo Co., Ltd.). E. coli colonies were screened for the invertase gene expression vector pRI0LPI through base sequencing and restriction mapping, and the vector was recovered multiplied by the alkali-SDS method on a preparatory scale.
EXAMPLE 18
Construction of Secretory Expression Vector pSL2I06a'c1 Using the Signal Sequence from the Invertase Gene
PCR using plasmid pINV3000 (Example 1) carrying the S. pombe invertase gene as the template and oligo DNAs shown in SEQ ID NOS: 19 and 20 in the Sequence Listing as primers gave an amplification product containing the ORF in the invertase gene and having restriction enzyme recognition sequences at both ends. After terminal double restriction digestion with SpeI (Takara Shuzo Co., Ltd.) and EcoRI (Takara Shuzo Co., Ltd.), the amplification product was subjected to agarose gel electrophoresis. Purification of the band of about 700 bp by the glass beads method using EASY-TRAP (Takara Shuzo Co., Ltd.) gave a fragment for use as a signal insert.
PCR using plasmid pSL2P06a'c1 (WO96/23890) containing human iterleukin 6a'c1 variant cDNA as the template and oligo DNAs shown in SEQ ID NOS: 21 and 22 in the Sequence Listing as primers gave an amplification product containing the iterleukin 6a'c1 variant ORF. After terminal double restriction digestion with EcoRI and HindIII (Takara Shuzo Co., Ltd), the amplification product was subjected to agarose gel electrophoresis. Purification of the band of about 600 bp by the glass beads method using EASY-TRAP (Takara Shuzo Co., Ltd.) gave a fragment for use as a gene insert.
The S. pombe multicloning vector pTL2M (JP-A-7-163373) was subjected to agarose gel electrophoresis after terminal double restriction digestion with SpeI and HindIII. Purification of the band of about 4500 bp by the glass beads method using EASY-TRAP gave a vector fragment.
The three fragments were ligated with a DNA ligation kit and transformed intothe E. coli strain DH5 (Toyobo Co., Ltd.). E. coli colonies were screened for the IL-6a'c1 secretory expression vector pSL2I06a'c1 through base sequencing and restriction mapping, and the vector was recovered by the alkali-SDS method on a preparatory scale.
EXAMPLE 19
Preparation of S. pombe Transformant ASP168
A leucine-requiring S. pombe strain ARC001 was transformed with the interleukin-6a'c1 variant secretory expression vector pSL2I06a'c1 (Example 18) and a transducing vector pAL7 as described by Okazaki et al.
1 ml of a preincubated ARC001 culture in YPD medium was incubated in 100 minimum medium MB+Leu with shaking at 30.degree. C. for 16 hours. The cells were collected, washed with water, suspended in 0.1M lithium acetate (pH 5.0) at a cell density of 10.sup.9 cells/ml and incubated at 30.degree. C. for 60 minutes. A 100 .mu.l portion of the suspension was mixed with 2 .mu.g of the recombinant vector pSL2I06a'c1 and 1.0 .mu.g of PstI-digested pAL7 in 15 .mu.l TE and further with 290 .mu.l of 50% PEG 4000 thoroughly and incubated at 30.degree. C. for 60 minutes, at 43.degree. C. for 15 minutes and at room temperature for 10 minutes, successively. After centrifugal removal of PEG, the cells were suspended in 1 ml of 1/2 YEL+Leu medium. After 10-fold dilution, 1 ml of the suspension was incubated at 32.degree. C. for 2 hours, and a 300 .mu.l portion was spread on minimum medium agar MMA. After 3 days of incubation at 32.degree. C., about 1000 independent colonies had developed on the plate.
The transformants (colonies) were inoculated in 2 ml of YEL medium containing 10 .mu.g/ml antibiotic G418 (YEL10 medium) and incubated with shaking at 32.degree. C. for 5 days. The viable clones of the putative desired transformant, designated as strain ASP168, were frozen in glycerol and stored for use in subsequent experiments.
EXAMPLE 20
Analysis of Expressed Secretory Interleukin-6a'c1 Variant in Culture Medium
A fission yeast Schizosaccharomyces pombe transformant ASP168 (Example 19) was incubated in MA-Casamino acid medium (MA medium containing 2% Casamino acid and 3% glucose; for the composition of MA medium, refer to "Alfa et al., Experiments with Fission Yeast, Cold Spring Harbor Laboratory Press, 1993") containing 500 .mu.g/ml G418 at 32.degree. C. for 2 days.
The cell culture was centrifuged, and the supernatant was concentrated 100-fold through a membrane filter (Amicon Co., Ltd.). Analysis of the concentrated sample by SDS-polyacrylamide electrophoresis followed by Coomassie Brilliant Blue staining gave the SDS-PAGE pattern shown in FIG. 15. Lane 1 is the purified interleukin-6a'c1 variant, lane 2 is the supernatant from the ASP168 cell culture, and lane 3 is the supernatant from a cell culture of the control strain ASP021 [transformant carrying a recombinant vector with no ORF prepared by recombination of pTL2M (JP-A-7-163373) by the method disclosed in JP-A-7-163373 without introduction of any gene to be expressed]. The band with a molecular weight of about 20000 in lane 3 seemed attributable to the interleukin-6a'c1 variant from the comparison of lanes 1 and 3.
Further analysis by western blotting using an anti-IL-6a'c1 gave the pattern shown in FIG. 16. Lane 1 is the purified interleukin-6a'c1 variant, lane 2 is the supernatant from the ASP168 cell culture, and lane 3 is the supernatant from a cell culture of the control strain ASP021. The band with a molecular weight of about 20000 in lane 3 was identified as the interleukin-6a'c1 variant from the comparison of lanes 1 and 3.
__________________________________________________________________________# SEQUENCE LISTING- <160> NUMBER OF SEQ ID NOS: 24- <210> SEQ ID NO 1<211> LENGTH: 4748<212> TYPE: DNA<213> ORGANISM: Schizosaccharomyces pombe<220> FEATURE:<221> NAME/KEY: CDS<222> LOCATION: (2810)..(4552)- <400> SEQUENCE: 1- ggatcctagt ccgcgaaatc gagatgcttt gaagattaaa attaaattta at - #tttatgcg 60- agactggttt ccttattttt tgtatagtcg catgcaagcg aggttcgcat aa - #tttggaaa 120- ataaaggtag tcaagaagac gttgaattaa ggctgcagtt tcaaagtact ct - #acaaacga 180- ttccttttaa aaaaaaagat tcaaaaaaaa ggcaaagggt ttaagtaatg ct - #tgttattt 240- caatttacct ccaaacagtt actaatgcaa ttgcaaaaaa aaaacctacc ta - #ttgaatca 300- aaatttctag cccatccatc gctcctcaag ataaaggaat cgatattttg ag - #tttaaggg 360- agttgctgat agatttcaga attaaaaatt tttggaaaag gatgtcgaga ac - #aagaagat 420- acgtctagat tgctgatgat gcattctagc agacggaaat acaacgatat gt - #ggacagca 480- cgacttttga tccgttcgga tcaaaaggaa gagaaatatc catctttcaa ga - #agaatgca 540- ggaaaagcaa taaatgccca tttgattcct aaattatccc caaaaatgaa ca - #ttatgaga 600- tcttcttgtg ggagacagga aatttcgcaa ttccaaacga aaattcggct ct - #ttttttta 660- ccccacagtt gcggggtaaa tgatgtaacg gaccttgggg gaaaggatga tg - #agttagtt 720- gggaagcgga aaaaatggaa aacggaagta agaatagaaa ccagtatggc tg - #agtgcaat 780- ggcggaaaag attttacaga gatgacaaga atctatttat ctataaggaa aa - #actttttc 840- caaatttgtc taaaaacgca ttctcctcaa ttgcctctag gtagatgata ta - #acgaattg 900- gaacgagaca tcgctaaccg gttttctttg taaatgacat tttgtagtgg ga - #gtaagttt 960- gaatggaggg atagacagat gaatagtatg agatagaaga atagtatata ta - #atgattaa1020- gatgaacaaa taaaaattga aagaaaaaag aaattgttgg ctcatttggt tc - #atacacat1080- gttggttcat acaactttta cccatcgtaa gtattataag taaaaaatag ag - #tacgaaaa1140- gctataagta gtgaagcaaa aaaatagaaa aatagaaaaa aaaatatata ta - #aaaaaata1200- taataaaaat aaaactcata agagacgtaa aacacaagaa ttgtctatca tt - #tgttcttt1260- aagaagcacc accattctgt aaaactcttc atttctcatt agcaaggacc ct - #tttcattc1320- cttcctcttt agaatccttt tcattataac gaattggata atacgcaaat aa - #gaacacat1380- cccctaaata cgatatatcg atccattttt tactttgcct agcttattgc tg - #tacaattc1440- catttaaata gtttctcctc aagaaagatc gtcaatggag gcgacaatat ac - #cggaattt1500- aagttgcgga cacagagctt gaaaagactg cattttgtat tgttttcaag ta - #aatgaaac1560- tgagttttga agtctcaaaa tacatcttat gtattgaaca ttagaagaac at - #ataagata1620- gatcttgaga gctcaattca tcgacattct agccatcata ctgcgatctt ag - #acattgtc1680- agcacaacct tagatcgaaa atgaacacgt taccaaacgt tgtctaaaac tt - #gccgaatc1740- ttatctccgc attacttccg taatccttag tacatacgct gcaatttcgg aa - #ggtcatga1800- tcgacttttt gtgtagctat aagtgacgca aatgagaaac atgacaaggt gc - #gatattta1860- gcaagatatt atgcatttga tggagaaagg aaatttcgga tgtatatata gt - #accgttag1920- ctgcgctttt tttggtcatc cataattttc aaactcactg ctttcgatca ga - #tttaccgt1980- ttttaaggtc tttattgctt tgtgatctgt aggttggaac atctatagtt ca - #ttttctaa2040- aagatccttt catcgtttca tcggatagta atcgttcaag aaaaaaaaag aa - #aaaaagaa2100- aaagaaaaag aaaagaaaaa taaaccgcta taattcatta cctatttgac tg - #aaggttct2160- tcatcttgaa ttgttttgaa tcaaaataaa gaaattatta ttattatttt tt - #ttcttcgc2220- tttttcttta tccattcgtc gaaactattt ttctgctgat aaaagcaatc at - #tccttttt2280- cctgcttctc ttgttattcg aattttaaac gacttttttt cctcgtccat tc - #cctaattc2340- tttgcgacct tttctgattc tatccttggt ttgtactttc gttgtgtaat tg - #ttgagaaa2400- gtgaactgat tatttaattg ttgtgaaaaa aattctaaaa ctattttgtt tt - #tcttgatc2460- attcatcctt tgctcgcttg cttgaatatt acagaaattc gtctcccttt ca - #acggaata2520- tgataatttg ttgaatactc taaatcaatt aacacctatc aaaagctgaa ac - #attaaatc2580- tattctcacc aaaaaaaaag actcaagctt cttcgttgtt ggccggtctc tt - #ttttgttt2640- tacgattgtt aaattttata ctcacaactg ccaattctcc acttttgact at - #ttattgat2700- agtccctatt taattttctg ttcaccgatt atcgtctttt ttgtaaataa tc - #tttcttgg2760- aaccaaccaa ttaatacgtt ataatcgcta actttgaaga tttgctaca atg - # ttt ttg2818#Met Phe Leu# 1- aaa tat att tta gct agt ggc att tgc ctc gt - #c tct ctc tta tca tct2866Lys Tyr Ile Leu Ala Ser Gly Ile Cys Leu Va - #l Ser Leu Leu Ser Ser# 15- aca aac gcg gct ccc cgt cac tta tat gta aa - #a cgt tat cct gtc att2914Thr Asn Ala Ala Pro Arg His Leu Tyr Val Ly - #s Arg Tyr Pro Val Ile# 35- tat aat gct tcc aac atc act gaa gtc agc aa - #t tct acc acc gtt cct2962Tyr Asn Ala Ser Asn Ile Thr Glu Val Ser As - #n Ser Thr Thr Val Pro# 50- cct cct cca ttc gta aat aca acg gcc cct aa - #t ggg act tgt ttg ggt3010Pro Pro Pro Phe Val Asn Thr Thr Ala Pro As - #n Gly Thr Cys Leu Gly# 65- aac tat aac gag tat ctt cct tca gga tat ta - #c aat gct acc gat cgt3058Asn Tyr Asn Glu Tyr Leu Pro Ser Gly Tyr Ty - #r Asn Ala Thr Asp Arg# 80- ccc aaa att cat ttt act cct tct tcc ggt tt - #c atg aat gat cca aac3106Pro Lys Ile His Phe Thr Pro Ser Ser Gly Ph - #e Met Asn Asp Pro Asn# 95- gga ttg gta tat act ggc ggc gtc tat cac at - #g ttc ttc caa tat tca3154Gly Leu Val Tyr Thr Gly Gly Val Tyr His Me - #t Phe Phe Gln Tyr Ser100 1 - #05 1 - #10 1 -#15- cca aaa act cta aca gcc ggc gaa gtt cat tg - #g ggt cac aca gtt tcc3202Pro Lys Thr Leu Thr Ala Gly Glu Val His Tr - #p Gly His Thr Val Ser# 130- aag gat tta atc cat tgg gag aat tat cct at - #t gcc atc tac ccc gat3250Lys Asp Leu Ile His Trp Glu Asn Tyr Pro Il - #e Ala Ile Tyr Pro Asp# 145- gaa cat gaa aac gga gtt ttg tcc ctc cca tt - #t agt ggc agt gca gtc3298Glu His Glu Asn Gly Val Leu Ser Leu Pro Ph - #e Ser Gly Ser Ala Val# 160- gtc gat gtt cat aat tct tcc ggt ctc ttt tc - #c aac gac acc att ccg3346Val Asp Val His Asn Ser Ser Gly Leu Phe Se - #r Asn Asp Thr Ile Pro# 175- gaa gag cgc att gtt tta att tat acc gat ca - #t tgg act ggt gtt gct3394Glu Glu Arg Ile Val Leu Ile Tyr Thr Asp Hi - #s Trp Thr Gly Val Ala180 1 - #85 1 - #90 1 -#95- gag cgt cag gct att gcg tat acc act gat gg - #t gga tat act ttc aaa3442Glu Arg Gln Ala Ile Ala Tyr Thr Thr Asp Gl - #y Gly Tyr Thr Phe Lys# 210- aaa tat tca gga aat ccc gtt ctt gac att aa - #t tca ctt caa ttc cgc3490Lys Tyr Ser Gly Asn Pro Val Leu Asp Ile As - #n Ser Leu Gln Phe Arg# 225- gac ccc aag gta ata tgg gat ttc gat gct aa - #t cgt tgg gtg atg att3538Asp Pro Lys Val Ile Trp Asp Phe Asp Ala As - #n Arg Trp Val Met Ile# 240- gta gct atg tct caa aat tat gga att gcc tt - #t tat tcc tcc tat gac3586Val Ala Met Ser Gln Asn Tyr Gly Ile Ala Ph - #e Tyr Ser Ser Tyr Asp# 255- ttg att cac tgg acc gag tta tct gtt ttc tc - #c act tct ggt tat ttg3634Leu Ile His Trp Thr Glu Leu Ser Val Phe Se - #r Thr Ser Gly Tyr Leu260 2 - #65 2 - #70 2 -#75- ggg ttg caa tat gaa tgc cct gga atg gct cg - #t gtg ccc gtt gaa ggc3682Gly Leu Gln Tyr Glu Cys Pro Gly Met Ala Ar - #g Val Pro Val Glu Gly# 290- acc gat gaa tac aaa tgg gta ctc ttc atc tc - #c atc aat cct ggc gct3730Thr Asp Glu Tyr Lys Trp Val Leu Phe Ile Se - #r Ile Asn Pro Gly Ala# 305- cca ttg gga gga tcc gtt gtc caa tac ttt gt - #t ggc gat tgg aat ggt3778Pro Leu Gly Gly Ser Val Val Gln Tyr Phe Va - #l Gly Asp Trp Asn Gly# 320- aca aac ttc gtc ccc gat gat ggc caa act ag - #a ttc gta gac ttg ggt3826Thr Asn Phe Val Pro Asp Asp Gly Gln Thr Ar - #g Phe Val Asp Leu Gly# 335- aag gac ttt tac gcc agc gct ttg tat cac tc - #g tct tcc gcc aat gcc3874Lys Asp Phe Tyr Ala Ser Ala Leu Tyr His Se - #r Ser Ser Ala Asn Ala340 3 - #45 3 - #50 3 -#55- gat gtt att gga gtt gga tgg gct agc aac tg - #g caa tac acc aac caa3922Asp Val Ile Gly Val Gly Trp Ala Ser Asn Tr - #p Gln Tyr Thr Asn Gln# 370- gct cct act caa gtt ttc cgc agt gct atg ac - #a gtt gca cga aaa ttc3970Ala Pro Thr Gln Val Phe Arg Ser Ala Met Th - #r Val Ala Arg Lys Phe# 385- act ctt cgc gac gtt cct cag aac ccc atg ac - #c aac ctt act tct ctc4018Thr Leu Arg Asp Val Pro Gln Asn Pro Met Th - #r Asn Leu Thr Ser Leu# 400- att caa acc cca ttg aat gtt tct ctc tta cg - #a gat gaa aca cta ttt4066Ile Gln Thr Pro Leu Asn Val Ser Leu Leu Ar - #g Asp Glu Thr Leu Phe# 415- acc gca ccc gtt atc aat agt tca agt agt ct - #t tcg ggc tct ccg att4114Thr Ala Pro Val Ile Asn Ser Ser Ser Ser Le - #u Ser Gly Ser Pro Ile420 4 - #25 4 - #30 4 -#35- act ctt cca agc aat acc gca ttc gag ttc aa - #t gtc aca ctc agt atc4162Thr Leu Pro Ser Asn Thr Ala Phe Glu Phe As - #n Val Thr Leu Ser Ile# 450- aat tac aca gaa ggc tgc aca aca gga tat tg - #t ctg ggg cgt att atc4210Asn Tyr Thr Glu Gly Cys Thr Thr Gly Tyr Cy - #s Leu Gly Arg Ile Ile# 465- att gat tct gat gat cca tac aga tta caa tc - #c atc tcc gtg gac gtt4258Ile Asp Ser Asp Asp Pro Tyr Arg Leu Gln Se - #r Ile Ser Val Asp Val# 480- gat ttt gca gct agc act tta gtc att aat cg - #t gcc aaa gct cag atg4306Asp Phe Ala Ala Ser Thr Leu Val Ile Asn Ar - #g Ala Lys Ala Gln Met# 495- gga tgg ttt aat tca ctt ttc acg cct tct tt - #t gcc aac gat att tac4354Gly Trp Phe Asn Ser Leu Phe Thr Pro Ser Ph - #e Ala Asn Asp Ile Tyr500 5 - #05 5 - #10 5 -#15- att tat gga aac gta act ttg tat ggt att gt - #t gac aat gga ttg ctt4402Ile Tyr Gly Asn Val Thr Leu Tyr Gly Ile Va - #l Asp Asn Gly Leu Leu# 530- gaa ctg tat gtc aat aat ggc gaa aaa act ta - #c act aat gac ttt ttc4450Glu Leu Tyr Val Asn Asn Gly Glu Lys Thr Ty - #r Thr Asn Asp Phe Phe# 545- ttc ctt caa gga gca aca cct gga cag atc ag - #c ttc gct gct ttc caa4498Phe Leu Gln Gly Ala Thr Pro Gly Gln Ile Se - #r Phe Ala Ala Phe Gln# 560- ggc gtt tct ttc aat aat gtt acc gtt acg cc - #a tta aag act atc tgg4546Gly Val Ser Phe Asn Asn Val Thr Val Thr Pr - #o Leu Lys Thr Ile Trp# 575- aat tgc taaatatttt gtttcaagtt aggaaagtat aataactttt gt - #ccctgcat4602Asn Cys580- attcaattgt aaagtttagt ttatcctttc atcgtaacca caattgtcac ct - #aaatctct4662- aaaaatctct tcacttatct agttaatgtc gtaacaaaaa agtccagtag ct - #tcgggaaa4722# 4748 acaa gtcgac- <210> SEQ ID NO 2<211> LENGTH: 581<212> TYPE: PRT<213> ORGANISM: Schizosaccharomyces pombe- <400> SEQUENCE: 2- Met Phe Leu Lys Tyr Ile Leu Ala Ser Gly Il - #e Cys Leu Val Ser Leu# 15- Leu Ser Ser Thr Asn Ala Ala Pro Arg His Le - #u Tyr Val Lys Arg Tyr# 30- Pro Val Ile Tyr Asn Ala Ser Asn Ile Thr Gl - #u Val Ser Asn Ser Thr# 45- Thr Val Pro Pro Pro Pro Phe Val Asn Thr Th - #r Ala Pro Asn Gly Thr# 60- Cys Leu Gly Asn Tyr Asn Glu Tyr Leu Pro Se - #r Gly Tyr Tyr Asn Ala# 80- Thr Asp Arg Pro Lys Ile His Phe Thr Pro Se - #r Ser Gly Phe Met Asn# 95- Asp Pro Asn Gly Leu Val Tyr Thr Gly Gly Va - #l Tyr His Met Phe Phe# 110- Gln Tyr Ser Pro Lys Thr Leu Thr Ala Gly Gl - #u Val His Trp Gly His# 125- Thr Val Ser Lys Asp Leu Ile His Trp Glu As - #n Tyr Pro Ile Ala Ile# 140- Tyr Pro Asp Glu His Glu Asn Gly Val Leu Se - #r Leu Pro Phe Ser Gly145 1 - #50 1 - #55 1 -#60- Ser Ala Val Val Asp Val His Asn Ser Ser Gl - #y Leu Phe Ser Asn Asp# 175- Thr Ile Pro Glu Glu Arg Ile Val Leu Ile Ty - #r Thr Asp His Trp Thr# 190- Gly Val Ala Glu Arg Gln Ala Ile Ala Tyr Th - #r Thr Asp Gly Gly Tyr# 205- Thr Phe Lys Lys Tyr Ser Gly Asn Pro Val Le - #u Asp Ile Asn Ser Leu# 220- Gln Phe Arg Asp Pro Lys Val Ile Trp Asp Ph - #e Asp Ala Asn Arg Trp225 2 - #30 2 - #35 2 -#40- Val Met Ile Val Ala Met Ser Gln Asn Tyr Gl - #y Ile Ala Phe Tyr Ser# 255- Ser Tyr Asp Leu Ile His Trp Thr Glu Leu Se - #r Val Phe Ser Thr Ser# 270- Gly Tyr Leu Gly Leu Gln Tyr Glu Cys Pro Gl - #y Met Ala Arg Val Pro# 285- Val Glu Gly Thr Asp Glu Tyr Lys Trp Val Le - #u Phe Ile Ser Ile Asn# 300- Pro Gly Ala Pro Leu Gly Gly Ser Val Val Gl - #n Tyr Phe Val Gly Asp305 3 - #10 3 - #15 3 -#20- Trp Asn Gly Thr Asn Phe Val Pro Asp Asp Gl - #y Gln Thr Arg Phe Val# 335- Asp Leu Gly Lys Asp Phe Tyr Ala Ser Ala Le - #u Tyr His Ser Ser Ser# 350- Ala Asn Ala Asp Val Ile Gly Val Gly Trp Al - #a Ser Asn Trp Gln Tyr# 365- Thr Asn Gln Ala Pro Thr Gln Val Phe Arg Se - #r Ala Met Thr Val Ala# 380- Arg Lys Phe Thr Leu Arg Asp Val Pro Gln As - #n Pro Met Thr Asn Leu385 3 - #90 3 - #95 4 -#00- Thr Ser Leu Ile Gln Thr Pro Leu Asn Val Se - #r Leu Leu Arg Asp Glu# 415- Thr Leu Phe Thr Ala Pro Val Ile Asn Ser Se - #r Ser Ser Leu Ser Gly# 430- Ser Pro Ile Thr Leu Pro Ser Asn Thr Ala Ph - #e Glu Phe Asn Val Thr# 445- Leu Ser Ile Asn Tyr Thr Glu Gly Cys Thr Th - #r Gly Tyr Cys Leu Gly# 460- Arg Ile Ile Ile Asp Ser Asp Asp Pro Tyr Ar - #g Leu Gln Ser Ile Ser465 4 - #70 4 - #75 4 -#80- Val Asp Val Asp Phe Ala Ala Ser Thr Leu Va - #l Ile Asn Arg Ala Lys# 495- Ala Gln Met Gly Trp Phe Asn Ser Leu Phe Th - #r Pro Ser Phe Ala Asn# 510- Asp Ile Tyr Ile Tyr Gly Asn Val Thr Leu Ty - #r Gly Ile Val Asp Asn# 525- Gly Leu Leu Glu Leu Tyr Val Asn Asn Gly Gl - #u Lys Thr Tyr Thr Asn# 540- Asp Phe Phe Phe Leu Gln Gly Ala Thr Pro Gl - #y Gln Ile Ser Phe Ala545 5 - #50 5 - #55 5 -#60- Ala Phe Gln Gly Val Ser Phe Asn Asn Val Th - #r Val Thr Pro Leu Lys# 575- Thr Ile Trp Asn Cys 580- <210> SEQ ID NO 3<211> LENGTH: 7286<212> TYPE: DNA<213> ORGANISM: Artificial Sequence<220> FEATURE:#Sequence:DNANFORMATION: Description of Artificial- <400> SEQUENCE: 3- agcttgaaaa aacctcccac acctccccct gaacctgaaa cataaaatga at - #gcaattgt 60- tgttgttaac ttgtttattg cagcttataa tggttacaaa taaagcaata gc - #atcacaaa 120- tttcacaaat aaagcatttt tttcactgca ttctagttgt ggtttgtcca aa - #ctcatcaa 180- tgtatcttat catgtctgga tcgatcccgg caggttgggc gtcgcttggt cg - #gtcatttc 240- gaaccccaga gtcccgctca gaagaactcg tcaagaaggc gatagaaggc ga - #tgcgctgc 300- gaatcgggag cggcgatacc gtaaagcacg aggaagcggt cagcccattc gc - #cgccaagc 360- tcttcagcaa tatcacgggt agccaacgct atgtcctgat agcggtccgc ca - #cacccagc 420- cggccacagt cgatgaatcc agaaaagcgg ccattttcca ccatgatatt cg - #gcaagcag 480- gcatcgccat gggtcacgac gagatcctcg ccgtcgggca tgcgcgcctt ga - #gcctggcg 540- aacagttcgg ctggcgcgag cccctgatgc tcttcgtcca gatcatcctg at - #cgacaaga 600- ccggcttcca tccgagtacg tgctcgctcg atgcgatgtt tcgcttggtg gt - #cgaatggg 660- caggtagccg gatcaagcgt atgcagccgc cgcattgcat cagccatgat gg - #atactttc 720- tcggcaggag caaggtgaga tgacaggaga tcctgccccg gcacttcgcc ca - #atagcagc 780- cagtcccttc ccgcttcagt gacaacgtcg agcacagctg cgcaaggaac gc - #ccgtcgtg 840- gccagccacg atagccgcgc tgcctcgtcc tgcagttcat tcagggcacc gg - #acaggtcg 900- gtcttgacaa aaagaaccgg gcgcccctgc gctgacagcc ggaacacggc gg - #catcagag 960- cagccgattg tctgttgtgc ccagtcatag ccgaatagcc tctccaccca ag - #cggccgga1020- gaacctgcgt gcaatccatc ttgttcaatc atgcgaaacg atcctcatcc tg - #tctcttga1080- tcagatccgg gacctgaaat aaaagacaaa aagactaaac ttaccagtta ac - #tttctggt1140- ttttcagttc ctcgaggagc tttttgcaaa agcctaggcc tccaaaaaag cc - #tcctcact1200- acttctggaa tagctcagag gccgaggcgg cctcggcctc tgcataaata aa - #aaaaatta1260- gtcagccatg gggcggagaa tgggcggaac tgggcggagt taggggcggg at - #gggcggag1320- ttaggggcgg gactatggtt gctgactaat tgagatgcat gctttgcata ct - #tctgcctg1380- ctggggagcc tggggacttt ccacacctgg ttgctgacta attgagatgc at - #gctttgca1440- tacttctgcc tgctggggag cctggggact ttccacaccc taactgacac ac - #attccaca1500- ggacattgat tattgactag ttagtccgcg aaatcgagat gctttgaaga tt - #aaaattaa1560- atttaatttt atgcgagact ggtttcctta ttttttgtat agtcgcatgc aa - #gcgaggtt1620- cgcataattt ggaaaataaa ggtagtcaag aagacgttga attaaggctg ca - #gtttcaaa1680- gtactctaca aacgattcct tttaaaaaaa aagattcaaa aaaaaggcaa ag - #ggtttaag1740- taatgcttgt tatttcaatt tacctccaaa cagttactaa tgcaattgca aa - #aaaaaaac1800- ctacctattg aatcaaaatt tctagcccat ccatcgctcc tcaagataaa gg - #aatcgata1860- ttttgagttt aagggagttg ctgatagatt tcagaattaa aaatttttgg aa - #aaggatgt1920- cgagaacaag aagatacgtc tagattgctg atgatgcatt ctagcagacg ga - #aatacaac1980- gatatgtgga cagcacgact tttgatccgt tcggatcaaa aggaagagaa at - #atccatct2040- ttcaagaaga atgcaggaaa agcaataaat gcccatttga ttcctaaatt at - #ccccaaaa2100- atgaacatta tgagatcttc ttgtgggaga caggaaattt cgcaattcca aa - #cgaaaatt2160- cggctctttt ttttacccca cagttgcggg gtaaatgatg taacggacct tg - #ggggaaag2220- gatgatgagt tagttgggaa gcggaaaaaa tggaaaacgg aagtaagaat ag - #aaaccagt2280- atggctgagt gcaatggcgg aaaagatttt acagagatga caagaatcta tt - #tatctata2340- aggaaaaact ttttccaaat ttgtctaaaa acgcattctc ctcaattgcc tc - #taggtaga2400- tgatataacg aattggaacg agacatcgct aaccggtttt ctttgtaaat ga - #cattttgt2460- agtgggagta agtttgaatg gagggataga cagatgaata gtatgagata ga - #agaatagt2520- atatataatg attaagatga acaaataaaa attgaaagaa aaaagaaatt gt - #tggctcat2580- ttggttcata cacatgttgg ttcatacaac ttttacccat cgtaagtatt at - #aagtaaaa2640- aatagagtac gaaaagctat aagtagtgaa gcaaaaaaat agaaaaatag aa - #aaaaaaat2700- atatataaaa aaatataata aaaataaaac tcataagaga cgtaaaacac aa - #gaattgtc2760- tatcatttgt tctttaagaa gcaccaccat tctgtaaaac tcttcatttc tc - #attagcaa2820- ggaccctttt cattccttcc tctttagaat ccttttcatt ataacgaatt gg - #ataatacg2880- caaataagaa cacatcccct aaatacgata tatcgatcca ttttttactt tg - #cctagctt2940- attgctgtac aattccattt aaatagtttc tcctcaagaa agatcgtcaa tg - #gaggcgac3000- aatataccgg aatttaagtt gcggacacag agcttgaaaa gactgcattt tg - #tattgttt3060- tcaagtaaat gaaactgagt tttgaagtct caaaatacat cttatgtatt ga - #acattaga3120- agaacatata agatagatct tgagagctca attcatcgac attctagcca tc - #atactgcg3180- atcttagaca ttgtcagcac aaccttagat cgaaaatgaa cacgttacca aa - #cgttgtct3240- aaaacttgcc gaatcttatc tccgcattac ttccgtaatc cttagtacat ac - #gctgcaat3300- ttcggaaggt catgatcgac tttttgtgta gctataagtg acgcaaatga ga - #aacatgac3360- aaggtgcgat atttagcaag atattatgca tttgatggag aaaggaaatt tc - #ggatgtat3420- atatagtacc gttagctgcg ctttttttgg tcatccataa ttttcaaact ca - #ctgctttc3480- gatcagattt accgttttta aggtctttat tgctttgtga tctgtaggtt gg - #aacatcta3540- tagttcattt tctaaaagat cctttcatcg tttcatcgga tagtaatcgt tc - #aagaaaaa3600- aaaagaaaaa aagaaaaaga aaaagaaaag aaaaataaac cgctataatt ca - #ttacctat3660- ttgactgaag gttcttcatc ttgaattgtt ttgaatcaaa ataaagaaat ta - #ttattatt3720- attttttttc ttcgcttttt ctttatccat tcgtcgaaac tatttttctg ct - #gataaaag3780- caatcattcc tttttcctgc ttctcttgtt attcgaattt taaacgactt tt - #tttcctcg3840- tccattccct aattctttgc gaccttttct gattctatcc ttggtttgta ct - #ttcgttgt3900- gtaattgttg agaaagtgaa ctgattattt aattgttgtg aaaaaaattc ta - #aaactatt3960- ttgtttttct tgatcattca tcctttgctc gcttgcttga atattacaga aa - #ttcgtctc4020- cctttcaacg gaatatgata atttgttgaa tactctaaat caattaacac ct - #atcaaaag4080- ctgaaacatt aaatctattc tcaccaaaaa aaaagactca agcttcttcg tt - #gttggccg4140- gtctcttttt tgttttacga ttgttaaatt ttatactcac aactgccaat tc - #tccacttt4200- tgactattta ttgatagtcc ctatttaatt ttctgttcac cgattatcgt ct - #tttttgta4260- aataatcttt cttggaacca accaattaat acgttataat cgctaacttt ga - #agatttgc4320- tacaatggca atggtatcag aattcgagct cggtacccgg ggatcctcta ga - #gtcgacct4380- gcaggcatgc aagcttaaat aggaaagttt cttcaacagg attacagtgt ag - #ctacctac4440- atgctgaaaa atatagcctt taaatcattt ttatattata actctgtata at - #agagataa4500- gtccattttt taaaaatgtt ttccccaaac cataaaaccc tatacaagtt gt - #tctagtaa4560- caatacatga gaaagatgtc tatgtagctg aaaataaaat gacgtcacaa ga - #caaaaaaa4620- aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa - #aaaaagta4680- ccttctgagg cggaaagaac cagccggatc cagacatgat aagatacatt ga - #tgagtttg4740- gacaaaccac aactagaatg cagtgaaaaa aatgctttat ttgtgaaatt tg - #tgatgcta4800- ttgctttatt tgtaaccatt ataagctgca ataaacaagt taacaacaac aa - #ttgcattc4860- attttatgtt tcaggttcag ggggaggtgt gggaggtttt ttaaagcaag ta - #aaacctct4920- acaaatgtgg tatggctgat tatgatccgg ctgcctcgcg cgtttcggtg at - #gacggtga4980- aaacctctga cacatgcagc tcccggagac ggtcacagct tgtctgtaag cg - #gatgccgg5040- gagcagacaa gcccgtcagg gcgcgtcagc gggtgttggc gggtgtcggg gc - #gcagccat5100- gacccagtca cgtagcgata gcggagtgta tactggctta actatgcggc at - #cagagcag5160- attgtactga gagtgcacca tatgcggtgt gaaataccgc acagatgcgt aa - #ggagaaaa5220- taccgcatca ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc gg - #tcgttcgg5280- ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggttatccac ag - #aatcaggg5340- gataacgcag gaaagaatga gcaaaaggcc agcaaaaggc caggaaccgt aa - #aaaggccg5400- cgttgctggc gtttttccat aggctccgcc cccctgacga gcatcacaaa aa - #tcgacgct5460- caagtcagag gtggcgaaac ccgacaggac tataaagata ccaggcgttt cc - #ccctggaa5520- gctccctcgt gcgctctcct gttccgaccc tgccgcttac cggatacctg tc - #cgcctttc5580- tcccttcggg aagcgtggcg ctttctcata gctcacgctg taggtatctc ag - #ttcggtgt5640- aggtcgttcg ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc ga - #ccgctgcg5700- ccttatccgg taactatcgt cttgagtcca acccggtaag acacgactta tc - #gccactgg5760- cagcagccac tggtaacagg attagcagag cgaggtatgt aggcggtgct ac - #agagttct5820- tgaagtggtg gcctaactac ggctacacta gaaggacagt atttggtatc tg - #cgctctgc5880- tgaagccagt taccttcgga aaaagagttg gtagctcttg atccggcaaa ca - #aaccaccg5940- ctggtagcgg tggttttttt gtttgcaagc agcagattac gcgcagaaaa aa - #aggatctc6000- aagaagatcc tttgatcttt tctacggggt ctgacgctca gtggaacgaa aa - #ctcacgtt6060- aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt tt - #aaattaaa6120- aatgaagttt taaatcaatc taaagtatat atgagtaaac ttggtctgac ag - #ttaccaat6180- gcttaatcag tgaggcacct atctcagcga tctgtctatt tcgttcatcc at - #agttgcct6240- gactccccgt cgtgtagata actacgatac gggagggctt accatctggc cc - #cagtgctg6300- caatgatacc gcgagaccca cgctcaccgg ctccagattt atcagcaata aa - #ccagccag6360- ccggaagggc cgagcgcaga agtggtcctg caactttatc cgcctccatc ca - #gtctatta6420- attgttgccg ggaagctaga gtaagtagtt cgccagttaa tagtttgcgc aa - #cgttgttg6480- ccattgctgc aggcatcgtg gtgtcacgct cgtcgtttgg tatggcttca tt - #cagctccg6540- gttcccaacg atcaaggcga gttacatgat cccccatgtt gtgcaaaaaa gc - #ggttagct6600- ccttcggtcc tccgatcgtt gtcagaagta agttggccgc agtgttatca ct - #catggtta6660- tggcagcact gcataattct cttactgtca tgccatccgt aagatgcttt tc - #tgtgactg6720- gtgagtactc aaccaagtca ttctgagaat agtgtatgcg gcgaccgagt tg - #ctcttgcc6780- cggcgtcaac acgggataat accgcgccac atagcagaac tttaaaagtg ct - #catcattg6840- gaaaacgttc ttcggggcga aaactctcaa ggatcttacc gctgttgaga tc - #cagttcga6900- tgtaacccac tcgtgcaccc aactgatctt cagcatcttt tactttcacc ag - #cgtttctg6960- ggtgagcaaa aacaggaagg caaaatgccg caaaaaaggg aataagggcg ac - #acggaaat7020- gttgaatact catactcttc ctttttcaat attattgaag catttatcag gg - #ttattgtc7080- tcatgagcgg atacatattt gaatgtattt agaaaaataa acaaataggg gt - #tccgcgca7140- catttccccg aaaagtgcca cctgacgtct aagaaaccat tattatcatg ac - #attaacct7200- ataaaaatag gcgtatcacg aggccctttc gtcttcaaga attggtcgac ca - #attctcat7260# 7286 tcga taagct- <210> SEQ ID NO 4<211> LENGTH: 27<212> TYPE: DNA<213> ORGANISM: Artificial Sequence<220> FEATURE:#Sequence:DNANFORMATION: Description of Artificial- <400> SEQUENCE: 4# 27 gayg gnaaygg- <210> SEQ ID NO 5<211> LENGTH: 24<212> TYPE: DNA<213> ORGANISM: Artificial Sequence<220> FEATURE:#Sequence:DNANFORMATION: Description of Artificial<220> FEATURE:#A23> OTHER INFORMATION: N is G, C, T, or- <400> SEQUENCE: 5# 24ggnc aygc- <210> SEQ ID NO 6<211> LENGTH: 27<212> TYPE: DNA<213> ORGANISM: Artificial Sequence<220> FEATURE:#Sequence:DNANFORMATION: Description of Artificial<220> FEATURE:#A23> OTHER INFORMATION: N is G, C, T, or- <400> SEQUENCE: 6# 27 ggrt cncgraa- <210> SEQ ID NO 7<211> LENGTH: 24<212> TYPE: DNA<213> ORGANISM: Artificial Sequence<220> FEATURE:#Sequence:DNANFORMATION: Description of Artificial- <400> SEQUENCE: 7# 24atga acca- <210> SEQ ID NO 8<211> LENGTH: 23<212> TYPE: DNA<213> ORGANISM: Artificial Sequence<220> FEATURE:#Sequence:DNANFORMATION: Description of Artificial- <400> SEQUENCE: 8# 23ccc aca- <210> SEQ ID NO 9<211> LENGTH: 29<212> TYPE: DNA<213> ORGANISM: Artificial Sequence<220> FEATURE:#Sequence:DNANFORMATION: Description of Artificial- <400> SEQUENCE: 9# 29 cgaa atcgagatg- <210> SEQ ID NO 10<211> LENGTH: 42<212> TYPE: DNA<213> ORGANISM: Artificial Sequence<220> FEATURE:#Sequence:DNANFORMATION: Description of Artificial- <400> SEQUENCE: 10# 42 attg ccattgtagc aaatcttcaa ag- <210> SEQ ID NO 11<211> LENGTH: 30<212> TYPE: DNA<213> ORGANISM: Artificial Sequence<220> FEATURE:#Sequence:DNANFORMATION: Description of Artificial- <400> SEQUENCE: 11# 30 gcaa atcttcaaag- <210> SEQ ID NO 12<211> LENGTH: 27<212> TYPE: DNA<213> ORGANISM: Artificial Sequence<220> FEATURE:#Sequence:DNANFORMATION: Description of Artificial- <400> SEQUENCE: 12# 27 agga gctgttc- <210> SEQ ID NO 13<211> LENGTH: 27<212> TYPE: DNA<213> ORGANISM: Artificial Sequence<220> FEATURE:#Sequence:DNANFORMATION: Description of Artificial- <400> SEQUENCE: 13# 27 cagc tcgtcca- <210> SEQ ID NO 14<211> LENGTH: 7938<212> TYPE: DNA<213> ORGANISM: Artificial Sequence<220> FEATURE:#Sequence:DNANFORMATION: Description of Artificial- <400> SEQUENCE: 14- agcttgaaaa aacctcccac acctccccct gaacctgaaa cataaaatga at - #gcaattgt 60- tgttgttaac ttgtttattg cagcttataa tggttacaaa taaagcaata gc - #atcacaaa 120- tttcacaaat aaagcatttt tttcactgca ttctagttgt ggtttgtcca aa - #ctcatcaa 180- tgtatcttat catgtctgga tcgatcccgg caggttgggc gtcgcttggt cg - #gtcatttc 240- gaaccccaga gtcccgctca gaagaactcg tcaagaaggc gatagaaggc ga - #tgcgctgc 300- gaatcgggag cggcgatacc gtaaagcacg aggaagcggt cagcccattc gc - #cgccaagc 360- tcttcagcaa tatcacgggt agccaacgct atgtcctgat agcggtccgc ca - #cacccagc 420- cggccacagt cgatgaatcc agaaaagcgg ccattttcca ccatgatatt cg - #gcaagcag 480- gcatcgccat gggtcacgac gagatcctcg ccgtcgggca tgcgcgcctt ga - #gcctggcg 540- aacagttcgg ctggcgcgag cccctgatgc tcttcgtcca gatcatcctg at - #cgacaaga 600- ccggcttcca tccgagtacg tgctcgctcg atgcgatgtt tcgcttggtg gt - #cgaatggg 660- caggtagccg gatcaagcgt atgcagccgc cgcattgcat cagccatgat gg - #atactttc 720- tcggcaggag caaggtgaga tgacaggaga tcctgccccg gcacttcgcc ca - #atagcagc 780- cagtcccttc ccgcttcagt gacaacgtcg agcacagctg cgcaaggaac gc - #ccgtcgtg 840- gccagccacg atagccgcgc tgcctcgtcc tgcagttcat tcagggcacc gg - #acaggtcg 900- gtcttgacaa aaagaaccgg gcgcccctgc gctgacagcc ggaacacggc gg - #catcagag 960- cagccgattg tctgttgtgc ccagtcatag ccgaatagcc tctccaccca ag - #cggccgga1020- gaacctgcgt gcaatccatc ttgttcaatc atgcgaaacg atcctcatcc tg - #tctcttga1080- tcagatccgg gacctgaaat aaaagacaaa aagactaaac ttaccagtta ac - #tttctggt1140- ttttcagttc ctcgaggagc tttttgcaaa agcctaggcc tccaaaaaag cc - #tcctcact1200- acttctggaa tagctcagag gccgaggcgg cctcggcctc tgcataaata aa - #aaaaatta1260- gtcagccatg gggcggagaa tgggcggaac tgggcggagt taggggcggg at - #gggcggag1320- ttaggggcgg gactatggtt gctgactaat tgagatgcat gctttgcata ct - #tctgcctg1380- ctggggagcc tggggacttt ccacacctgg ttgctgacta attgagatgc at - #gctttgca1440- tacttctgcc tgctggggag cctggggact ttccacaccc taactgacac ac - #attccaca1500- ggacattgat tattgactag ttagtccgcg aaatcgagat gctttgaaga tt - #aaaattaa1560- atttaatttt atgcgagact ggtttcctta ttttttgtat agtcgcatgc aa - #gcgaggtt1620- cgcataattt ggaaaataaa ggtagtcaag aagacgttga attaaggctg ca - #gtttcaaa1680- gtactctaca aacgattcct tttaaaaaaa aagattcaaa aaaaaggcaa ag - #ggtttaag1740- taatgcttgt tatttcaatt tacctccaaa cagttactaa tgcaattgca aa - #aaaaaaac1800- ctacctattg aatcaaaatt tctagcccat ccatcgctcc tcaagataaa gg - #aatcgata1860- ttttgagttt aagggagttg ctgatagatt tcagaattaa aaatttttgg aa - #aaggatgt1920- cgagaacaag aagatacgtc tagattgctg atgatgcatt ctagcagacg ga - #aatacaac1980- gatatgtgga cagcacgact tttgatccgt tcggatcaaa aggaagagaa at - #atccatct2040- ttcaagaaga atgcaggaaa agcaataaat gcccatttga ttcctaaatt at - #ccccaaaa2100- atgaacatta tgagatcttc ttgtgggaga caggaaattt cgcaattcca aa - #cgaaaatt2160- cggctctttt ttttacccca cagttgcggg gtaaatgatg taacggacct tg - #ggggaaag2220- gatgatgagt tagttgggaa gcggaaaaaa tggaaaacgg aagtaagaat ag - #aaaccagt2280- atggctgagt gcaatggcgg aaaagatttt acagagatga caagaatcta tt - #tatctata2340- aggaaaaact ttttccaaat ttgtctaaaa acgcattctc ctcaattgcc tc - #taggtaga2400- tgatataacg aattggaacg agacatcgct aaccggtttt ctttgtaaat ga - #cattttgt2460- agtgggagta agtttgaatg gagggataga cagatgaata gtatgagata ga - #agaatagt2520- atatataatg attaagatga acaaataaaa attgaaagaa aaaagaaatt gt - #tggctcat2580- ttggttcata cacatgttgg ttcatacaac ttttacccat cgtaagtatt at - #aagtaaaa2640- aatagagtac gaaaagctat aagtagtgaa gcaaaaaaat agaaaaatag aa - #aaaaaaat2700- atatataaaa aaatataata aaaataaaac tcataagaga cgtaaaacac aa - #gaattgtc2760- tatcatttgt tctttaagaa gcaccaccat tctgtaaaac tcttcatttc tc - #attagcaa2820- ggaccctttt cattccttcc tctttagaat ccttttcatt ataacgaatt gg - #ataatacg2880- caaataagaa cacatcccct aaatacgata tatcgatcca ttttttactt tg - #cctagctt2940- attgctgtac aattccattt aaatagtttc tcctcaagaa agatcgtcaa tg - #gaggcgac3000- aatataccgg aatttaagtt gcggacacag agcttgaaaa gactgcattt tg - #tattgttt3060- tcaagtaaat gaaactgagt tttgaagtct caaaatacat cttatgtatt ga - #acattaga3120- agaacatata agatagatct tgagagctca attcatcgac attctagcca tc - #atactgcg3180- atcttagaca ttgtcagcac aaccttagat cgaaaatgaa cacgttacca aa - #cgttgtct3240- aaaacttgcc gaatcttatc tccgcattac ttccgtaatc cttagtacat ac - #gctgcaat3300- ttcggaaggt catgatcgac tttttgtgta gctataagtg acgcaaatga ga - #aacatgac3360- aaggtgcgat atttagcaag atattatgca tttgatggag aaaggaaatt tc - #ggatgtat3420- atatagtacc gttagctgcg ctttttttgg tcatccataa ttttcaaact ca - #ctgctttc3480- gatcagattt accgttttta aggtctttat tgctttgtga tctgtaggtt gg - #aacatcta3540- tagttcattt tctaaaagat cctttcatcg tttcatcgga tagtaatcgt tc - #aagaaaaa3600- aaaagaaaaa aagaaaaaga aaaagaaaag aaaaataaac cgctataatt ca - #ttacctat3660- ttgactgaag gttcttcatc ttgaattgtt ttgaatcaaa ataaagaaat ta - #ttattatt3720- attttttttc ttcgcttttt ctttatccat tcgtcgaaac tatttttctg ct - #gataaaag3780- caatcattcc tttttcctgc ttctcttgtt attcgaattt taaacgactt tt - #tttcctcg3840- tccattccct aattctttgc gaccttttct gattctatcc ttggtttgta ct - #ttcgttgt3900- gtaattgttg agaaagtgaa ctgattattt aattgttgtg aaaaaaattc ta - #aaactatt3960- ttgtttttct tgatcattca tcctttgctc gcttgcttga atattacaga aa - #ttcgtctc4020- cctttcaacg gaatatgata atttgttgaa tactctaaat caattaacac ct - #atcaaaag4080- ctgaaacatt aaatctattc tcaccaaaaa aaaagactca agcttcttcg tt - #gttggccg4140- gtctcttttt tgttttacga ttgttaaatt ttatactcac aactgccaat tc - #tccacttt4200- tgactattta ttgatagtcc ctatttaatt ttctgttcac cgattatcgt ct - #tttttgta4260- aataatcttt cttggaacca accaattaat acgttataat cgctaacttt ga - #agatttgc4320- tacaatggct agcaagggcg aggagctgtt caccggggtg gtgcccatcc tg - #gtcgagct4380- ggacggcgac gtaaacggcc acaagttcag cgtgtccggc gagggcgagg gc - #gatgccac4440- ctacggcaag ctgaccctga agttcatctg caccaccggc aagctgcccg tg - #ccctggcc4500- caccctcgtg accaccctga cctacggcgt gcagtgcttc agccgctacc cc - #gaccacat4560- gaagcagcac gacttcttca agtccgccat gcccgaaggc tacgtccagg ag - #cgcaccat4620- cttcttcaag gacgacggca actacaagac ccgcgccgag gtgaagttcg ag - #ggcgacac4680- cctggtgaac cgcatcgagc tgaagggcat cgacttcaag gaggacggca ac - #atcctggg4740- gcacaagctg gagtacaact acaacagcca caacgtctat atcatggccg ac - #aagcagaa4800- gaacggcatc aaggtgaact tcaagatccg ccacaacatc gaggacggca gc - #gtgcagct4860- cgccgaccac taccagcaga acacccccat cggcgacggc cccgtgctgc tg - #cccgacaa4920- ccactacctg agcacccagt ccgccctgag caaagacccc aacgagaagc gc - #gatcacat4980- ggtcctgctg gagttcgtga ccgccgccgg gatcactctc ggcatggacg ag - #ctgtacaa5040- gtaagcttaa ataggaaagt ttcttcaaca ggattacagt gtagctacct ac - #atgctgaa5100- aaatatagcc tttaaatcat ttttatatta taactctgta taatagagat aa - #gtccattt5160- tttaaaaatg ttttccccaa accataaaac cctatacaag ttgttctagt aa - #caatacat5220- gagaaagatg tctatgtagc tgaaaataaa atgacgtcac aagacaaaaa aa - #aaaaaaaa5280- aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaag ta - #ccttctga5340- ggcggaaaga accagccgga tccagacatg ataagataca ttgatgagtt tg - #gacaaacc5400- acaactagaa tgcagtgaaa aaaatgcttt atttgtgaaa tttgtgatgc ta - #ttgcttta5460- tttgtaacca ttataagctg caataaacaa gttaacaaca acaattgcat tc - #attttatg5520- tttcaggttc agggggaggt gtgggaggtt ttttaaagca agtaaaacct ct - #acaaatgt5580- ggtatggctg attatgatcc ggctgcctcg cgcgtttcgg tgatgacggt ga - #aaacctct5640- gacacatgca gctcccggag acggtcacag cttgtctgta agcggatgcc gg - #gagcagac5700- aagcccgtca gggcgcgtca gcgggtgttg gcgggtgtcg gggcgcagcc at - #gacccagt5760- cacgtagcga tagcggagtg tatactggct taactatgcg gcatcagagc ag - #attgtact5820- gagagtgcac catatgcggt gtgaaatacc gcacagatgc gtaaggagaa aa - #taccgcat5880- caggcgctct tccgcttcct cgctcactga ctcgctgcgc tcggtcgttc gg - #ctgcggcg5940- agcggtatca gctcactcaa aggcggtaat acggttatcc acagaatcag gg - #gataacgc6000- aggaaagaat gagcaaaagg ccagcaaaag gccaggaacc gtaaaaaggc cg - #cgttgctg6060- gcgtttttcc ataggctccg cccccctgac gagcatcaca aaaatcgacg ct - #caagtcag6120- aggtggcgaa acccgacagg actataaaga taccaggcgt ttccccctgg aa - #gctccctc6180- gtgcgctctc ctgttccgac cctgccgctt accggatacc tgtccgcctt tc - #tcccttcg6240- ggaagcgtgg cgctttctca tagctcacgc tgtaggtatc tcagttcggt gt - #aggtcgtt6300- cgctccaagc tgggctgtgt gcacgaaccc cccgttcagc ccgaccgctg cg - #ccttatcc6360- ggtaactatc gtcttgagtc caacccggta agacacgact tatcgccact gg - #cagcagcc6420- actggtaaca ggattagcag agcgaggtat gtaggcggtg ctacagagtt ct - #tgaagtgg6480- tggcctaact acggctacac tagaaggaca gtatttggta tctgcgctct gc - #tgaagcca6540- gttaccttcg gaaaaagagt tggtagctct tgatccggca aacaaaccac cg - #ctggtagc6600- ggtggttttt ttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc tc - #aagaagat6660- cctttgatct tttctacggg gtctgacgct cagtggaacg aaaactcacg tt - #aagggatt6720- ttggtcatga gattatcaaa aaggatcttc acctagatcc ttttaaatta aa - #aatgaagt6780- tttaaatcaa tctaaagtat atatgagtaa acttggtctg acagttacca at - #gcttaatc6840- agtgaggcac ctatctcagc gatctgtcta tttcgttcat ccatagttgc ct - #gactcccc6900- gtcgtgtaga taactacgat acgggagggc ttaccatctg gccccagtgc tg - #caatgata6960- ccgcgagacc cacgctcacc ggctccagat ttatcagcaa taaaccagcc ag - #ccggaagg7020- gccgagcgca gaagtggtcc tgcaacttta tccgcctcca tccagtctat ta - #attgttgc7080- cgggaagcta gagtaagtag ttcgccagtt aatagtttgc gcaacgttgt tg - #ccattgct7140- gcaggcatcg tggtgtcacg ctcgtcgttt ggtatggctt cattcagctc cg - #gttcccaa7200- cgatcaaggc gagttacatg atcccccatg ttgtgcaaaa aagcggttag ct - #ccttcggt7260- cctccgatcg ttgtcagaag taagttggcc gcagtgttat cactcatggt ta - #tggcagca7320- ctgcataatt ctcttactgt catgccatcc gtaagatgct tttctgtgac tg - #gtgagtac7380- tcaaccaagt cattctgaga atagtgtatg cggcgaccga gttgctcttg cc - #cggcgtca7440- acacgggata ataccgcgcc acatagcaga actttaaaag tgctcatcat tg - #gaaaacgt7500- tcttcggggc gaaaactctc aaggatctta ccgctgttga gatccagttc ga - #tgtaaccc7560- actcgtgcac ccaactgatc ttcagcatct tttactttca ccagcgtttc tg - #ggtgagca7620- aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg cgacacggaa at - #gttgaata7680- ctcatactct tcctttttca atattattga agcatttatc agggttattg tc - #tcatgagc7740- ggatacatat ttgaatgtat ttagaaaaat aaacaaatag gggttccgcg ca - #catttccc7800- cgaaaagtgc cacctgacgt ctaagaaacc attattatca tgacattaac ct - #ataaaaat7860- aggcgtatca cgaggccctt tcgtcttcaa gaattggtcg accaattctc at - #gtttgaca7920#7938 ct- <210> SEQ ID NO 15<211> LENGTH: 29<212> TYPE: DNA<213> ORGANISM: Artificial Sequence<220> FEATURE:#Sequence:DNANFORMATION: Description of Artificial- <400> SEQUENCE: 15# 29 cgaa atcgagatg- <210> SEQ ID NO 16<211> LENGTH: 42<212> TYPE: DNA<213> ORGANISM: Artificial Sequence<220> FEATURE:#Sequence:DNANFORMATION: Description of Artificial- <400> SEQUENCE: 16# 42 attg ccattgtagc aaatcttcaa ag- <210> SEQ ID NO 17<211> LENGTH: 28<212> TYPE: DNA<213> ORGANISM: Artificial Sequence<220> FEATURE:#Sequence:DNANFORMATION: Description of Artificial- <400> SEQUENCE: 17# 28 atat attttagc- <210> SEQ ID NO 18<211> LENGTH: 27<212> TYPE: DNA<213> ORGANISM: Artificial Sequence<220> FEATURE:#Sequence:DNANFORMATION: Description of Artificial- <400> SEQUENCE: 18# 27 ccag atagtct- <210> SEQ ID NO 19<211> LENGTH: 20<212> TYPE: DNA<213> ORGANISM: Artificial Sequence<220> FEATURE:#Sequence:DNANFORMATION: Description of Artificial- <400> SEQUENCE: 19# 20 agta- <210> SEQ ID NO 20<211> LENGTH: 29<212> TYPE: DNA<213> ORGANISM: Artificial Sequence<220> FEATURE:#Sequence:DNANFORMATION: Description of Artificial- <400> SEQUENCE: 20# 29 ttta catataagt- <210> SEQ ID NO 21<211> LENGTH: 30<212> TYPE: DNA<213> ORGANISM: Artificial Sequence<220> FEATURE:#Sequence:DNANFORMATION: Description of Artificial- <400> SEQUENCE: 21# 30 cccc aacctcttca- <210> SEQ ID NO 22<211> LENGTH: 20<212> TYPE: DNA<213> ORGANISM: Artificial Sequence<220> FEATURE:#Sequence:DNANFORMATION: Description of Artificial- <400> SEQUENCE: 22# 20 tata- <210> SEQ ID NO 23<211> LENGTH: 332<212> TYPE: PRT<213> ORGANISM: Schwanniomyces occidentalis- <400> SEQUENCE: 23- Pro Leu Thr Thr Thr Phe Phe Gly Tyr Val Al - #a Ser Ser Ser Ile Asp# 15- Leu Ser Val Asp Thr Ser Glu Tyr Asn Arg Pr - #o Leu Ile His Phe Thr# 30- Pro Glu Lys Gly Trp Met Asn Asp Pro Asn Gl - #y Thr Phe Tyr Asp Lys# 45- Thr Ala Lys Thr Trp His Leu Tyr Phe Gln Ty - #r Asn Pro Asn Ala Thr# 60- Ala Trp Gly Gln Pro Leu Tyr Trp Gly His Al - #a Thr Ser Asn Asp Leu# 80- Val His Trp Asp Glu His Glu Met Ala Ile Gl - #y Pro Glu His Asp Asn# 95- Glu Gly Ile Phe Ser Gly Ser Ile Val Val As - #p His Asn Asn Thr Ser# 110- Gly Phe Phe Asn Ser Ser Ile Asp Pro Asn Gl - #n Arg Ile Val Ala Ile# 125- Tyr Thr Asn Asn Met Pro Asp Leu Gln Thr Gl - #n Asp Ile Ala Phe Ser# 140- Leu Asp Gly Gly Tyr Thr Phe Thr Lys Tyr Gl - #u Asn Asn Pro Val Ile145 1 - #50 1 - #55 1 -#60- Asp Val Ser Ser Asn Gln Phe Arg Asp Pro Ly - #s Val Phe Trp His Glu# 175- Arg Phe Lys Ser Met Asp His Gly Cys Ser Gl - #u Ile Ala Arg Val Lys# 190- Ile Gln Ile Phe Gly Ser Ala Asn Leu Lys As - #n Trp Val Leu Asn Ser# 205- Asn Phe Ser Ser Gly Tyr Tyr Gly Asn Gln Ty - #r Gly Met Ser Arg Leu# 220- Ile Glu Val Pro Ile Glu Asn Ser Asp Lys Se - #r Lys Trp Val Met Phe225 2 - #30 2 - #35 2 -#40- Leu Ala Ile Asn Pro Gly Ser Pro Leu Gly Gl - #y Ser Ile Asn Gln Tyr# 255- Phe Val Gly Asp Phe Asp Gly Phe Gln Phe Va - #l Pro Asp Asp Ser Gln# 270- Thr Arg Phe Val Asp Ile Gly Lys Asp Phe Ty - #r Ala Phe Gln Thr Phe# 285- Ser Glu Val Glu His Gly Val Leu Gly Leu Al - #a Trp Ala Ser Asn Trp# 300- Gln Tyr Ala Asp Gln Val Pro Thr Asn Pro Tr - #p Arg Ser Ser Thr Ser305 3 - #10 3 - #15 3 -#20- Leu Ala Arg Asn Tyr Thr Leu Arg Tyr Val Me - #t Gln# 330- <210> SEQ ID NO 24<211> LENGTH: 337<212> TYPE: PRT<213> ORGANISM: Saccharomyces cerevisiae- <400> SEQUENCE: 24- Leu Gln Ala Phe Thr Phe Thr Leu Ala Gly Ph - #e Ala Ala Lys Met Ser# 15- Ala Ser Met Thr Asn Glu Thr Ser Asp Arg Pr - #o Leu Val His Phe Thr# 30- Pro Asn Lys Gly Trp Met Asn Asp Pro Asn Gl - #y Leu Trp Tyr Asp Glu# 45- Lys Asp Ala Lys Trp His Thr Tyr Phe Gln Ty - #r Asn Pro Asn Asp Thr# 60- Val Trp Gly Thr Pro Leu Phe Trp Gly His Al - #a Thr Ser Asp Asp Leu# 80- Thr Asn Trp Glu Asp Gln Pro Ile Ala Ile Al - #a Pro Lys Arg Asn Asp# 95- Ser Gly Ala Phe Ser Gly Ser Met Val Val As - #p Tyr Asn Asn Thr Ser# 110- Gly Phe Phe Asn Asp Thr Ile Asp Pro Arg Gl - #n Arg Cys Val Ala Ile# 125- Trp Thr Tyr Asn Thr Pro Glu Ser Glu Glu Gl - #n Tyr Ile Ser Tyr Ser# 140- Thr Asp Gly Gly Tyr Thr Phe Thr Glu Tyr Gl - #n Lys Asn Pro Val Leu145 1 - #50 1 - #55 1 -#60- Ala Ala Asn Ser Thr Gln Phe Arg Asp Pro Ly - #s Val Phe Trp Tyr Glu# 175- Pro Ser Gln Lys Trp Ile Met Thr Ala Ala Ly - #s Ser Gln Asp Tyr Lys# 190- Ile Glu Ile Tyr Ser Ser Asp Asp Leu Lys Se - #r Trp Lys Thr Glu Ser# 205- Ala Phe Ala Asn Glu Gly Phe Leu Gly Tyr Gl - #n Tyr Glu Cys Pro Gly# 220- Leu Ile Glu Val Pro Thr Glu Gln Asp Pro Se - #r Lys Ser Tyr Trp Val225 2 - #30 2 - #35 2 -#40- Met Phe Ile Ser Ile Asn Pro Gly Ala Pro Al - #a Gly Gly Ser Phe Asn# 255- Gln Tyr Phe Val Gly Ser Phe Asn Gly Thr Hi - #s Phe Glu Ala Phe Asp# 270- Asn Gln Ser Arg Val Val Asp Phe Gly Lys As - #p Tyr Tyr Ala Leu Gln# 285- Thr Phe Phe Asn Thr Asp Pro Thr Tyr Gly Se - #r Ala Leu Gly Ile Ala# 300- Trp Ala Ser Asn Trp Glu Tyr Ser Ala Phe Va - #l Pro Thr Asn Pro Trp305 3 - #10 3 - #15 3 -#20- Arg Ser Ser Met Ser Leu Val Arg Lys Phe Se - #r Leu Asn Thr Glu Tyr# 335- Gln__________________________________________________________________________
Claims
  • 1. An isolated DNA having the base sequence of bases 1 to 2809 in SEQ ID NO: 1 in the Sequence Listing.
  • 2. An isolated DNA encoding a polypeptide consisting essentially of amino acids 1-22 of SEQ ID NO: 2.
  • 3. A recombinant vector containing the sequence of the DNA according to claim 1 or 2.
  • 4. A cloning vector containing the sequence of the DNA according to claim 1 or 2 and a multicloning site.
  • 5. A cloning vector having the structure shown in FIG. 9.
  • 6. An expression vector containing the sequence of the DNA according to claim 1 or 2 and a heterologous protein structural gene.
  • 7. A Schizosaccharomyces pombe transformant carrying the expression vector according to claim 6.
  • 8. A process for producing a protein which comprises incubating the transformant according to claim 7 and recovering an expressed heterologous protein.
  • 9. The DNA of claim 2, wherein the polypeptide consists of amino acids 1-22 of SEQ ID NO: 2.
  • 10. An expression vector containing the DNA of claim 9 and a heterologous protein structural gene.
  • 11. A Schizosaccharomyces pombe transformant carrying the expression vector of claim 10.
  • 12. A method of producing a protein, comprising incubating the transformant of claim 11 and recovering an expressed heterologous protein.
Priority Claims (1)
Number Date Country Kind
9-314608 Oct 1997 JPX
Parent Case Info

This application is a National Stage of International Application PCT/JP98/04929, filed Oct. 30, 1998.

PCT Information
Filing Document Filing Date Country Kind 102e Date 371c Date
PCT/JP98/04929 10/30/1998 6/30/1999 6/30/1999
Publishing Document Publishing Date Country Kind
WO99/23223 5/14/1999
US Referenced Citations (2)
Number Name Date Kind
5817478 Tohda et al. Oct 1998
5919654 Hama et al. Jul 1999
Non-Patent Literature Citations (5)
Entry
N. Tanaka et al., "Isolation and Characterization of an Invertase and its Repressor Genes from Schizosaccharomyces Pombe", Biochem. Biophys. Res. Commun. vol. 245, pp. 246-253, Apr. 1998.
S. Moreno et al., "Purification and Characterization of the Invertase from Schizosaccharomyces", Biochem. J., vol. 267, pp. 697-702, 1990.
S. Yoshioka et al., "Identification of Open Reading Frames in Schizosaccharomyces Pombe cDNAS", DNA Res., vol. 4, pp. 363-369, Dec. 1997.
J.A. Perez et al., "Cloning and Sequence Analysis of the Invertase Gene INV1 from the Yeast Pichia Anomala", Curr. Genet., vol. 29, pp. 234-240, 1996.
L. Salokin et al., "Short Repeated Elements in the Upstream Regulatory Region of the SUC2 Gene of Saccharomyces Cerevisiae", Mol. Cell. Biol., vol. 6, pp. 2324-2333, 1986.