.alpha.-1,4-glucan lyase from a fungus, its purification gene cloning and expression in microorganisms

Information

  • Patent Grant
  • 5908760
  • Patent Number
    5,908,760
  • Date Filed
    Monday, July 8, 1996
    28 years ago
  • Date Issued
    Tuesday, June 1, 1999
    25 years ago
Abstract
A method of preparing .varies.-glucan lyase enzymes is described. The method comprises isolating the enzymes from a culture of fungus wherein the culture is substantially free of any other organisms. Also described are the amino acid sequences for the enzymes and their coding sequences.
Description

The present invention relates to an enzyme, in particular .alpha.-1,4-glucan lyase ("GL"). The present invention also relates to a method of extracting same.
FR-A-2617502 and Baute et al in Phytochemistry �1988! vol. 27 No. 11 pp 3401-3403 report on the production of 1,5-D-anhydrofructose ("AF") in Morchella vulganis by an apparent enzymatic reaction. The yield of production of AF is quite low. Despite a reference to a possible enzymatic reaction, neither of these two documents presents any amino acid sequence data for any enzyme let alone any nucleotide sequence information. These documents say that AF can be a precursor for the preparation of the antibiotic pyrone microthecin.
Yu et al in Biochimica et Biophysica Acta �1993! vol 1156 pp 313-320 report on the preparation of GL from red seaweed and its use to degrade .alpha.-1,4-glucan to produce AF. The yield of production of AF is quite low. Despite a reference to the enzyme GL this document does not present any amino acid sequence data for that enzyme let alone any nucleotide sequence information coding for the same. This document also suggests that the source of GL is just algal.
According to the present invention there is provided a method of preparing the enzyme .alpha.-1,4-glucan lyase comprising isolating the enzyme from a culture of a fungus wherein the culture is substantially free of any other organism.
Preferably the enzyme is isolated and/or further purified using a gel that is not degraded by the enzyme.
Preferably the gel is based on dextrin or derivatives thereof, preferably a cyclodextrin, more preferably beta-cyclodextrin.
According to the present invention there is also provided a GL enzyme prepared by the method of the present invention.
Preferably the fungus is Morchella costata or Morchella vulgaris.
Preferably the enzyme comprises the amino acid sequence SEQ. ID. No. 1 or SEQ. I.D. No. 2, or any variant thereof.
The term "any variant thereof" means any substitution of, variation of, modification of, replacement of, deletion of or addition of an amino acid from or to the sequence providing the resultant enzyme has lyase activity.
According to the present invention there is also provided a nucleotide sequence coding for the enzyme .alpha.-1,4-glucan lyase, preferably wherein the sequence is not in its natural enviroment (i.e. it does not form part of the natural genome of a cellular organism expressing the enzyme).
Preferably the nucleotide sequence is a DNA sequence.
Preferably the DNA sequence comprises a sequence that is the same as, or is complementary to, or has substantial homology with, or contains any suitable codon substitution(s) for any of those of, SEQ. ID. No. 3 or SEQ. ID. No. 4.
The expression "substantial homology" covers homology with respect to structure and/or nucleotide components and/or biological activity.
The expression "contains any suitable codon substitutions" covers any codon replacement or substitution with another codon coding for the same amino acid or any addition or removal thereof providing the resultant enzyme has lyase activity.
In other words, the present invention also covers a modified DNA sequence in which at least one nucleotide has been deleted, substituted or modified or in which at least one additional nucleotide has been inserted so as to encode a polypeptide having the activity of a glucan lyase, preferably an enzyme having an increased lyase activity.
According to the present invention there is also provided a method of preparing the enzyme .alpha.-1,4-glucan lyase comprising expressing the nucleotide sequence of the present invention.
According to the present invention there is also provided the use of beta-cyclodextrin to purify an enzyme, preferably GL.
According to the present invention there is also provided a nucleotide sequence wherein the DNA sequence is made up of at least a sequence that is the same as, or is complementary to, or has substantial homology with, or contains any suitable codon substitutions for any of those of, SEQ. ID. No. 3 or SEQ. ID. No. 4, preferably wherein the sequence is in isolated form.
The present invention therefore relates to the isolation of the enzyme .alpha.-1,4-glucan lyase from a fungus. For example, the fungus can be any one of Discina perlata, Discina parma, Gyromitra gigas, Gyromitra infula, Mitrophora hybrida, Morchella conica, Morchella costata, Morchella elata, Morchella hortensis, Morchella rotunda, Morchella vulgaris, Peziza badia, Sarcosphaera eximia, Disciotis venosa, Gyromitra esculenta, Helvella crispa, Helvella lacunosa, Leptopodia elastica, Verpa digitaliformis, and other forms of Morchella. Preferably the fungus is Morchella costata or Morchella vulgaris.
The initial enzyme purification can be performed by the method as described by Yu et al (ibid).
However, preferably, the initial enzyme purification includes an optimized procedure in which a solid support is used that does not decompose under the purification step. This gel support further has the advantage that it is compatible with standard laboratory protein purification equipment.
The details of this optimized purification strategy are given later on. The purification is terminated by known standard techniques for protein purification.
The purity of the enzyme can be readily established using complementary electrophoretic techniques.
The purified lyase GL has been characterized according to pI, temperature- and pH-optima.
In this regard the fungal lyase shows a pI around 5.4 as determined by isoelectric focusing on gels with pH gradient of 3 to 9. The molecular weight determined by SDS-PAGE on 8-25% gradient gels was 110 kDa. The enzyme exhibits a pH optimum in the range pH 5-7. The temperature optimum was found to lay between 30-45.degree. C.
______________________________________GL sources Optimal pH Optimal pH range Optimal temperature______________________________________M. costata 6.5 5.5-7.5 37 C; 40 C.sup.aM. vulgaris 6.4 5.9-7.6 43 C; 48 C.sup.a______________________________________
Parameters determined using glycogen as substrate; other parameters determined using amylopectin as substrate.
In a preferred embodiment the .alpha.-1,4-glucan lyase is purified from the fungus Morchella costata by affinity chromatography on .beta.-cyclodextrin Sepharose, ion exchange on Mono Q HR 5/5 and gel filtration on Superose 12 columns.
PAS staining indicates that the fungal lyase was not glycosylated. In the cell-free fungus extract, only one form of .alpha.-1,4-glucan lyase was detected by activity gel staining on electrophoresis gels.
The enzyme should preferably be secreted to ease its purification. To do so the DNA encoding the mature enzyme is fused to a signal sequence, a promoter and a terminator from the chosen host.
For expression in Aspergillus niger the gpdA (from the Glyceraldehyde-3-phosphate dehydrogenase gene of Aspergillus nidulans) promoter and signal sequence is fused to the 5' end of the DNA encoding the mature lyase--such as SEQ I.D. No. 3 or SEQ. I.D. No. 4. The terminator sequence from the A. niger trpC gene is placed 3' to the gene (Punt, P. J. et al (1991): J. Biotech. 17, 19-34). This construction is inserted into a vector containing a replication origin and selection origin for E. coli and a selection marker for A. niger. Examples of selection markers for A. niger are the amdS gene, the argB gene, the pyrG gene, the hygB gene, the BmlR gene which all have been used for selection of transformants. This plasmid can be transformed into A. niger and the mature lyase can be recovered from the culture medium of the transformants.
The construction can be transformed into a protease deficient strain to reduce the proteolytic degradation of the lyase in the culture medium (Archer D. B. et al (1992): Biotechnol. Lett. 14, 357-362).
The amino acid composition can be established according to the method of Barholt and Jensen (Anal Biochem �1989! vol 177 pp 318-322). The sample for the amino acid analysis of the purified enzyme can contain 69 ug/ml protein.
The amino acid sequence of the GL enzymes according to the present invention are shown in SEQ. I.D. No. 1 and SEQ. I.D. No. 2.
The following samples were deposited in accordance with the Budapest Treaty at the recognised depositary The National Collections of Industrial and Marine Bacteria Limited (NCIMB) at 23 St. Machar Drive, Aberdeen, Scotland, United Kingdom, AB2 1RY on Oct. 3, 1994:
E. Coli containing plasmid pMC (NCIMB 40687)--�ref. DH5alpha-pMC!;
E. Coli containing plasmid pMV1 (NCIMB 40688)--�ref. DH5alpha-pMV1!; and
E. Coli containing plasmid pMV2 (NCIMB 40689)--�ref. DH5alpha-pMV2!.
Plasmid pMC is a pBluescript II KS containing a 4.1 kb fragment isolated from a genomic library constructed from Morchella costata. The fragment contains a gene coding for .alpha.-1,4-glucan lyase.
Plasmid pMV1 is a pBluescript II KS containing a 2.45 kb fragment isolated from a genomic library constructed from Morchella vulgaris. The fragment contains the 5' end of a gene coding for .alpha.-1,4-glucan lyase.
Plasmid MV2 is a pPUC19 containing a 3.1 kb fragment isolated from a genomic library constructed from Morchella vulgaris. The fragment contains the 3' end of a gene coding for .alpha.-1,4-glucan lyase.
In the following discussion, MC represents Morchella costata and MV represents Morchella vulgaris.
As mentioned, the GL coding sequence from Morchella vulgaris was contained in two plasmids. With reference to FIG. 5 (discussed later) pMV1 contains the nucleotides from position 454 to position 2902; and pMV2 contains the nucleotides downstream from (and including) position 2897. With reference to FIGS. 2 and 3 (discussed later), to ligate the coding sequences one can digest pMV2 with restriction enzymes EcoRI and BamHI and then insert the relevant fragment into pMV1 digested with restriction enzymes EcoRI and BamHI.
Thus highly preferred embodiments of the present invention include a GL enzyme obtainable from the expression of the GL coding sequences present in plasmids that are the subject of either deposit NCIMB 40687 or deposit NCIMB 40688 and deposit NCIMB 40689.
The present invention will now be described only by way of example.





BRIEF DESCRIPTION OF THE DRAWINGS
In the following Examples reference is made to the accompanying figures in which:
FIG. 1 shows a plasmid map of pMC;
FIG. 2 shows a plasmid map of pMV1;
FIG. 3 shows a plasmid map of pMV2;
FIG. 4 shows the GL coding sequence and part of the 5' and 3' non-translated regions for genomic DNA obtained from Morchella costata;
FIG. 5 shows the GL coding sequence and part of the 5' and 3' non-translated regions for genomic DNA obtained from Morchella vulganis;
FIG. 6 shows a comparison of the GL coding sequences and non-translated regions from Morchella costata and Morchella vulgaris;
FIG. 7 shows the amino acid sequence represented as SEQ. I.D. No. 1 showing positions of the peptide fragments that were sequenced; and
FIG. 8 shows the amino acid sequence represented as SEQ. I.D. No. 2 showing positions of the peptide fragments that were sequenced.





In more detail, in FIG. 4, the total number of bases is 4726--and the DNA sequence composition is: 1336 A; 1070 C; 1051 G; 1269 T. The ATG start codon is shown in bold. The introns are underlined. The stop codon is shown in italics.
In FIG. 5, the total number of bases is 4670--and the DNA sequence composition is: 1253 A; 1072 C; 1080 G; 1265 T. The ATG start codon is shown in bold. The introns are underlined. The stop codon is shown in italics.
In FIG. 6, the two aligned sequences are those obtained from MC (total number of residues: 1066) and MV (total number of residues: 1070). The comparison matrix used was a structure-genetic matrix (Open gap cost: 10; Unit gap cost: 2). In this, Figure, the character to show that two aligned residues are identical is `:`. The character to show that two aligned residues are similar is `.`. The amino acids said to be `similar` are: A,S,T; D,E; N,Q; R,K; I,L,M,V; F,Y,W. Overall there is: Identity: 920 (86.30%); Similarity: 51 (4.78%). The number of gaps inserted in MC is 1 and the number of gaps inserted in MV is 1.
In the attached sequence listings: SEQ. I.D.No. 1 is the amino-acid sequence for GL obtained from Morchella costata; SEQ. I.D.No. 2 is the amino-acid sequence for GL obtained from Morchella vulgaris; SEQ. I.D.No. 3 is the nucleotide coding sequence for GL obtained from Morchella costata; and SEQ. I.D.No. 4 is the nucleotide coding sequence for GL obtained from Morchella vulganis.
In SEQ. I.D. No. 1 the total number of residues is 1066. The GL enzyme has an amino acid composition of:
______________________________________46 Ala 13 Cys 25 His 18 Met 73 Thr50 Arg 37 Gln 54 Ile 43 Phe 23 Trp56 Asn 55 Glu 70 Leu 56 Pro 71 Tyr75 Asp 89 Gly 71 Lys 63 Ser 78 Val______________________________________
In SEQ.I.D. No. 2 the total number of residues is 1070. The GL enzyme has an amino acid composition of:
______________________________________51 Ala 13 Cys 22 His 17 Met 71 Thr50 Arg 40 Gln 57 Ile 45 Phe 24 Trp62 Asn 58 Glu 74 Leu 62 Pro 69 Tyr74 Asp 87 Gly 61 Lys 55 Ser 78 Val______________________________________
1. ENZYME PURIFICATION AND CHARACTERIZATION OF THE .alpha.-1,4-GLUCAN LYASE FROM THE FUNGUS MORCHELLA COSTATA
1.1 Materials and Methods
The fungus Morchella costata was obtained from American Type Culture Collection (ATCC). The fungus was grown at 25.degree. C. on a shaker using the culture medium recommended by ATCC. The mycelia were harvested by filtration and washed with 0.9% NaCl.
The fungal cells were broken by homogenization followed by sonication on ice for 6.times.3 min in 50 mM citrate-NaOH pH 6.2 (Buffer A). Cell debris were removed by centrifugation at 25,000.times.g for 40 min. The supernatant obtained at this procedure was regarded as cell-free extract and was used for activity staining and Western blotting after separation on 8-25% gradient gels.
1.2 Separation by .beta.-cyclodextrin Sepharose gel
The cell-free extract was applied directly to a .beta.-cyclodextrin Sepharose gel 4B clolumn (2.6.times.18 cm) pre equilibrated with Buffer A. The column was washed with 3 volumes of Buffer A and 2 volumes of Buffer A containing 1M NaCl. .alpha.-1,4-glucan lyase was eluted with 2% dextrins in Buffer A. Active fractions were pooled and the buffer changed to 20 mM Bis-tris propane-HCl (pH 7.0, Buffer B).
Active fractions were applied onto a Mono Q HR 5/5 column pre-equilibrated with Buffer B. The fungal lyase was eluted with Buffer B in a linear gradient of 0.3M NaCl.
The lyase preparation obtained after .beta.-cyclodextrin Sepharose chromatography was alternatively concentrated to 150 .mu.l and applied on a Superose 12 column operated under FPLC conditions.
1.3 Assay for .alpha.-1,4-glucan lyase activity and conditions for determination of substrate specificity, pH and temperature optimum
The reaction mixture for the assay of the .alpha.-1,4-glucan lyase activity contained 10 mg ml.sup.-1 amylopectin and 25 mM Mes-NaOH (pH 6.0).
The reaction was carried out at 30.degree. C. for 30 min and stopped by the addition of 3,5-dinitrosalicylic acid reagent. Optical density at 550 nm was measured after standing at room temperature for 10 min. 10 mM EDTA was added to the assay mixture when cell-free extracts were used.
The substrate amylopectin in the assay mixture may be replaced with other substrates and the reaction temperature may vary as specified in the text.
In the pH optimum investigations, the reaction mixture contained amylopection or maltotetraose 10 mg ml.sup.-1 in a 40 mM buffer. The buffers used were glycine-NaOH (pH 2.0-3.5), HoAc-NaoAc (pH 3.5-5.5), Mes-NaOH (pH 5.5-6.7), Mops-NaOH (6.0-8.0) and bicine-NaOH (7.6-9.0). The reactions were carried out at 30.degree. C. for 30 min. The reaction conditions in the temperature optimum investigations was the same as above except that the buffer Mops-NaOH (pH 6.0) was used in all experiments. The reaction temperature was varied as indicated in the text.
SDS-PAGE, Native-PAGE and isoelectrofocusing were performed on PhastSystem (Pharmacia, Sweden) using 8-25% gradient gels and gels with a pH gradient of 3-9, respectively. Following electrophoresis, the gels were stained by silver staining according to the procedures recommended by the manufacturer (Pharmacia). The glycoproteins were stained by PAS adapted to the PhastSystem. For activity staining, the electrophoresis was performed under native conditions at 6.degree. C.
Following the electrophoresis, the gel was incubated in the presence of 1% soluble starch at 30.degree. C. overnight. Activity band of the fungal lyase was revealed by staining with I.sub.2 /KI solution.
1.4 Results
1.4.1 Purification, molecular mass and isoelectric point of the .alpha.-1,4-glucan lyase
The fungal lyase was found to adsorb on columns packed with .beta.-cyclodextrin Sepharose, starches and Red Sepharose. Columns packed with .beta.-cyclodextrin Sepharose 4B gel and starches were used for purification purposes.
The lyase preparation obtained by this step contained only minor contaminating proteins having a molecular mass higher than the fungal lyase. The impurity was either removed by ion exchange chromatography on Mono Q HR 5/5 or more efficiently by gel filtration on Superose 12.
The purified enzyme appeared colourless and showed no absorbance in the visible light region. The molecular mass was determined to 110 kDa as estimated on SDS-PAGE.
The purified fungal lyase showed a isoelectric point of pI 5.4 determined by isoelectric focusing on gels with a pH gradient of 3 to 9. In the native electrophoresis gels, the enzyme appeared as one single band. This band showed starch-degrading activity as detected by activity staining. Depending the age of the culture from which the enzyme is extracted, the enzyme on the native and isoelectric focusing gels showed either as a sharp band or a more diffused band with the same migration rate and pI.
1.4.2 The pH and temperature optimum of the fungal lyase catalayzed reaction
The pH optimum pH range for the fungal lyase catalyzed reaction was found to be between pH 5 and pH 7.
1.4.3 Substrate specificity
The purified fungal lyase degraded maltosaccharides from maltose to maltoheptaose. However, the degradation rates varied. The highest activity achieved was with maltotetraose (activity as 100%), followed by maltohexaose (97%), maltoheptaose (76%), maltotriose (56%) and the lowest activity was observed with maltose (2%).
Amylopectin, amylose and glycogen were also degraded by the fungal lyase (% will be determined). The fungal lyase was an exo-lyase, not a endolyase as it degraded p-nitrophenyl .alpha.-D-maltoheptaose but failed to degrade reducing end blocked p-nitrophenyl (.alpha.-D-maltoheptaose.
1.5 Morchella Vulgaris
The protocols for the enzyme purification and charaterisation of alpha 1,4-glucal lyase obtained from Morchella Vulgaris were the same as those above for Morchella Costata (with similar results--see results mentioned above).
2. AMINO ACID SEQUENCING OF THE .alpha.-1,4-GLUCAN LYASE FROM FUNGUS
2.1 Amino acid sequencing of the lyases
The lyases were digested with either endoproteinase Arg-C from Clostridium histolyticum or endoproteinase Lys-C from Lysobacter enzymogenes, both sequencing grade purchased from Boehringer Mannheim, Germany. For digestion with endoproteinase Arg-C, freezedried lyase (0.1 mg) was dissolved in 50 .mu.l 10M urea, 50 mM methylamine, 0.1M Tris-HCl, pH 7.6. After overlay with N.sub.2 and addition of 10 .mu.l of 50 mM DTT and 5 mM EDTA the protein was denatured and reduced for 10 min at 50.degree. C. under N.sub.2. Subsequently, 1 .mu.g of endoproteinase Arg-C in 10 .mu.l of 50 mM Tris-HCl, pH 8.0 was added, N.sub.2 was overlayed and the digestion was carried out for 6 h at 37.degree. C.
For subsequent cysteine derivatization, 12.5 .mu.l 100 mM iodoacetamide was added and the solution was incubated for 15 min at RT in the dark under N.sub.2.
For digestion with endoproteinase Lys-C, freeze dried lyase (0.1 mg) was dissolved in 50 .mu.l of 8M urea, 0.4M NH.sub.4 HCO.sub.3, pH 8.4. After overlay with N.sub.2 and addition of 5 .mu.l of 45 mM DTT, the protein was denatured and reduced for 15 min at 50.degree. C. under N.sub.2. After cooling to RT, 5 .mu.l of 100 mM iodoacetamide was added for the cysteines to be derivatized for 15 min at RT in the dark under N.sub.2. Subsequently, 90 .mu.l of water and 5 .mu.g of endoproteinase Lys-C in 50 .mu.l of 50 mM tricine and 10 mM EDTA, pH 8.0, was added and the digestion was carried out for 24 h at 37.degree. C. under N.sub.2.
The resulting peptides were separated by reversed phase HPLC on a VYDAC C18 column (0.46.times.15 cm; 10 .mu.m; The Separations Group; California) using solvent A: 0.1% TFA in water and solvent B: 0.1% TFA in acetonifile. Selected peptides were rechromatographed on a Develosil C18 column (0.46.times.10 cm; 3 .mu.m; Dr. Ole Schou, Novo Nordisk, Denmark) using the same solvent system prior to sequencing on an
Applied Biosystems 476A sequencer using pulsed-liquid fast cycles.
The amino acid sequence information from the enzyme derived from the fungus Morchella costata is shown FIG. 7.
The amino acid sequence information from the enzyme derived from the fungus Morchella vulgaris is shown FIG. 8.
3. DNA SEQUENCING OF GENES CODING FOR THE .alpha.-1,4-GLUCAN LYASE FROM FUNGUS
3.1 METHODS FOR MOLECULAR BIOLOGY
DNA was isolated as described by Dellaporte et al (1983--Plant Mol Biol Rep vol 1 pp 19-21).
3.2 PCR
The preparation of the relevant DNA molecule was done by use of the Gene Amp DNA Amplification Kit (Perkin Elmer Cetus, USA) and in accordance with the manufactures instructions except that the Taq polymerase was added later (see PCR cycles) and the temperature cycling was changed to the following:
______________________________________PCR cycles:no of cycles C time (min.)______________________________________1 98 5 60 5addition of Taq polymerase and oil35 94 1 47 2 72 31 72 20______________________________________
3.3 CLONING OF PCR FRAGMENTS
PCR fragments were cloned into pT7Blue (from Novagen) following the instructions of the supplier.
3.4 DNA SEQUENCING
Double stranded DNA was sequenced essentially according to the dideoxy method of Sanger et al. (1979) using the Auto Read Sequencing Kit (Pharmacia) and the Pharmacia LKB A.L.F.DNA sequencer. (Ref: Sanger, F., Nicklen, S. and Coulson, A. R. (1979). DNA sequencing with chain-determinating inhibitors. Proc. Natl. Acad. Sci. USA 74: 5463-5467.)
3.5 SCREENING OF THE LIBRARIES
Screening of the Lambda Zap libraries obtained from Stratagene, was performed in accordance with the manufacturer's instructions except that the prehybridization and hybridization was performed in 2.times.SSC, 0.1% SDS, 10.times.Denhardt's and 100 .mu.g/ml denatured salmon sperm DNA.
To the hybridization solution a 32P-labeled denatured probe was added. Hybridization was performed over night at 55.degree. C. The filters were washed twice in 2.times.SSC, 0.1% SDS and twice in 1.times.SSC, 0.1% SDS.
3.6 PROBE
The cloned PCR fragments were isolated from the pT7blue vector by digestion with appropriate restriction enzymes. The fragments were seperated from the vector by agarose gel electrophoresis and the fragments were purified from the agarose by Agarase (Boehringer Mannheim). As the fragments were only 90-240 bp long the isolated fragments were exposed to a ligation reaction before labelling with 32P-dCTP using either Prime-It random primer kit (Stratagene) or Ready to Go DNA labelling kit (Pharmacia).
3.7 RESULTS
3.7.1 Generation of PCR DNA fragments coding for .alpha.-1,4-glucan lyase.
The amino acid sequences (shown below) of three overlapping tryptic peptides from .alpha.-1,4-glucan lyase were used to generate mixed oligonucleotides, which could be used as PCR primers for amplification of DNA isolated from both MC and MV.
Lys Asn Leu His Pro Gln His Lys Met Leu Lys Asp Thr Val Leu Asp Ile Val Lys Pro Gly His Gly Glu Tyr Val Gly Trp Gly Glu Met Gly Gly Ile Gln Phe Met Lys Glu Pro Thr Phe Met Asn Tyr Phe Asn Phe Asp Asn Met Gln Tyr Gln Gln Val Tyr Ala Gln Gly Ala Leu Asp Ser Arg Glu Pro Leu Tyr His Ser Asp Pro Phe Tyr SEQ ID NO: 5.
In the first PCR amplification primers A1/A2 (see below) were used as upstream primers and primers B1/B2 (see below) were used as downstream primer.
Primer A1: CA(GA)CA(CT)AA(GA)ATGCT(GATC)AA(GA)GA(CT)AC SEQ ID NO: 6
Primer A2: CA(GA)CA(CT)AA(GA)ATGTT(GA)AA(GA)GA(CT)AC SEQ ID NO: 7
Primer B1: TA(GA)AA(GATC)GG(GA)TC(GA)CT(GA)TG(GA)TA SEQ ID NO: 8
Primer B2: TA(GA)AA(GATC)GG(GA)TC(GATC)GA(GA)TG(GA)TA SEQ ID NO: 9
The PCR products were analysed on a 2% LMT agarose gel and fragments of the expected sizes were cut out from the gel and treated with Agarase (Boehringer Manheim) and cloned into the pT7blue Vector (Novagen) and sequenced.
The cloned fragments from the PCR amplification coded for amino acids corresponding to the sequenced peptides (see above) and in each case in addition to two intron sequences. For MC the PCR amplified DNA sequence corresponds to the sequence shown as from position 1202 to position 1522 with reference to FIG. 4. For MV the PCR amplified DNA sequence corresponds to the sequence shown as from position 1218 to position 1535 with reference to FIG. 5.
3.7.2 Screening of the genomic libraries with the cloned PCR fragments.
Screening of the libraries with the above-mentioned clone gave two clones for each source. For MC the two clones were combined to form the sequence shown in FIG. 4 (see below). For MV the two clones could be combined to form the sequence shown in FIG. 5 in the manner described above.
An additional PCR was performed to supplement the MC clone with Pstl, PvuII, AscI and NcoI restriction sites immediately in front of the ATG start codon using the following oligonucleotide as an upstream primer:
AAACTGCAGCTGGCGCGCCATGGCAGGATTTCTGAT SEQ ID NO: 10
and a primer containing the complement sequence of bp 1297-1318 in FIG. 4 was used as a downstream primer.
The complete sequence for MC was generated by cloning the 5' end of the gene as a BglII-EcoRI fragment from one of the genomic clone (first clone) into the BamHI-EcoRI sites of pBluescript II KS+vector from Stratagene. The 3' end of the gene was then cloned into the modified pBluescript II KS+vector by ligating an NspV (blunt ended, using the DNA blunting kit from Amersham International)-EcoRI fragment from the other genomic clone (second clone) after the modified pBluescript II KS+vector had been digested with EcoRI and EcoRV. Then the intermediate part of the gene was cloned in to the further modified pBluescript II KS+vector as an EcoRI fragment from the first clone by ligating that fragment into the further modified pBluescript II KS+vector digested with EcoRI.
4. EXPRESSION OF THE GL GENE IN MICRO-ORGANISMS
The DNA sequence encoding the GL can be introduced into microorganisms to produce the enzyme with high specific activity and in large quantities.
In this regard, the MC gene (FIG. 4) was cloned as a XbaI-XhoI blunt ended (using the DNA blunting kit from Amersham International) fragment into the Pichia expression vector pHIL-D2 (containing the AOX1 promoter) digested with EcoRI and blunt ended (using the DNA blunting kit from Amersham International) for expression in Pichia pastoris (according to the protocol stated in the Pichia Expression Kit supplied by Invitrogen).
In another embodiment, the MC gene 1 (same as FIG. 4 except that it was modified by PCR to introduce restriction sites as described above) was cloned as a PvuII-XhoI blunt ended fragment (using the DNA blunting kit from Amersham International) into the Aspergillus expression vector pBARMTE1 (containing the methyl tryptophan resistance promoter from Neuropera crassa) digested with SmaI for expression in Aspergillus niger (Pall et al (1993) Fungal Genet Newslett. vol 40 pages 59-62). The protoplasts were prepared according to Daboussi et al (Curr Genet (1989) vol 15 pp (453-456) using lysing enzymes Sigma L-2773 and the lyticase Sigma L-8012. The transformation of the protoplasts was followed according to the protocol stated by Buxton et al (Gene (1985) vol 37 pp 207-214) except that for plating the transformed protoplasts the protocol laid out in Punt et al (Methods in Enzymology (1992) vol 216 pp 447-457) was followed but with the use of 0.6% osmotic stabilised top agarose.
The results showed that lyase activity was observed in the transformed Pichia pastoris and Aspergillus niger. These experiments are now described.
ANALYSES OF PICHIA LYASE TRANSFORMANTS AND ASPERGILLUS LYASE TRANSFORMANTS
GENERAL METHODS
Preparation of cell-free extracts.
The cells were harvested by centrifugation at 9000 rpm for 5 min and washed with 0.9% NaCl and resuspended in the breaking buffer (50 mM K-phosphate, pH 7.5 containing 1 mM of EDTA, and 5% glycerol). Cells were broken using glass beads and vortex treatment. The breaking buffer contained 1 mM PMSF (protease inhibitor). The lyase extract (supernatant) was obtained after centrifugation at 9000 rpm for 5 min followed by centrifugation at 20,000.times.g for 5 min.
Assay of lyase activity by alkaline 3,5-dinitrosalicylic acid reagent (DNS)
One volume of lyase extract was mixed with an equal volume of 4% amylopectin solution. The reaction mixture was then incubated at a controlled temperature and samples vere removed at specified intervals and analyzed for AF.
The lyase activity was also analyzed using a radioactive method.
The reaction mixture contained 10 .mu.l .sup.14 C-starch solution (1 .mu.Ci; Sigma Chemicals Co.) and 10 .mu.l of the lyase extract. The reaction mixture was left at 25.degree. C. overnight and was then analyzed in the usual TLC system. The radioactive AF produced was detected using an Instant Imager (Pachard Instrument Co., Inc., Meriden, Conn.).
Electrophoresis and Western blotting
SDS-PAGE was performed using 8-25% gradient gels and the PhastSystem (Pharmacia). Western blottings was also run on a Semidry transfer unit of the PhastSystem. Primary antibodies raised against the lyase purified from the red seaweed collected at Qingdao (China) were used in a dilution of 1:100. Pig antirabbit IgG conjugated to alkaline phosphatase (Dako A/S, Glostrup, Denmark) were used as secondary antibodies and used in a dilution of 1:1000.
Part I, Analysis of the Pichia transformantscontaining the above mentioned construct
______________________________________MC-Lyase expressed intracellularly in Pichia pastorisNames of culture Specific activity*______________________________________A18 10A20 32A21 8A22 8A24 6______________________________________ *The specific activity was defined as nmol of AF produced per min per mg protein at 25.degree. C.
Part II, The Aspergilus transformants
Results
I. Lyase activity was determined after 5 days incubation(minimal medium containing 0.2% casein enzymatic hydrolysate analysis by the alkaline 3,5-dinitrosalicylic acid reagent
______________________________________Lyase activity analysis in cell-free extractsName of the culture Specific activity*______________________________________8.13 118.16 5388.19 37______________________________________ *The specific activity was defined as nmol of AF produced per min per mg protein at 25.degree. C.
The results show that the MC-lyase was expressed intracellular in A. niger.
Instead of Aspergillus niger as host, other industrial important nicroorganisms for which good expression systems are known could be used such as: Aspergillus oryzae, Aspergillus sp., Trichoderma sp., Saccharomyces cerevisiae, Kluyveromyces sp., Hansenula sp., Pichia sp., Bacillus subtilis, B. amyloliquefaciens, Bacillus sp., Streptomyces sp. or E. coli.
Other preferred embodiments of the present invention include any one of the following: A transformed host organism having the capability of producing AF as a consequence of the introduction of a DNA sequence as herein described; such a transformed host organism which is a microorganism--preferably wherein the host organism is selected from the group consisting of bacteria, moulds, fungi and yeast; preferably the host organism is selected from the group consisting of Saccharomyces, Kluyveromyces, Aspergillus, Trichoderma Hansenula, Pichia, Bacillus Streptomyces, Eschericia such as Aspergillus oryzae, Saccharomyces cerevisiae, Bacillus sublilis, Bacillus amyloliquefascien, Eschericia coli.; A method for preparing the sugar 1,5-D-anhydrofructose comprising contacting an alpha 1,4-glucan (e.g. starch) with the enzyme .alpha.-1,4-glucan lyase expressed by a transformed host organism comprising a nucleotide sequence encoding the same, preferably wherein the nucleotide sequence is a DNA sequence, preferably wherein the DNA sequence is one of the sequences hereinbefore described; A vector incorporating a nucleotide sequence as hereinbefore described, preferably wherein the vector is a replication vector, preferably wherein the vector is an expression vector containing the nucleotide sequence downstream from a promoter sequence, preferably the vector contains a marker (such as a resistance marker); Cellular organisms, or cell line, transformed with such a vector; A method of producing the product .alpha.-1,4-glucan lyase or any nucleotide sequence or part thereof coding for same, which comprises culturing such an organism (or cells from a cell line) transfected with such a vector and recovering the product.
Other modifications of the present invention will be apparent to those skilled in the art without departing from the scope of the invention.
__________________________________________________________________________# SEQUENCE LISTING- (1) GENERAL INFORMATION:- (iii) NUMBER OF SEQUENCES: 12- (2) INFORMATION FOR SEQ ID NO: 1:- (i) SEQUENCE CHARACTERISTICS:#acids (A) LENGTH: 1066 amino (B) TYPE: amino acid (D) TOPOLOGY: linear- (ii) MOLECULE TYPE: protein#1: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:- Met Ala Gly Phe Ser Asp Pro Leu - # Asn Phe Cys Lys Ala Glu AspTyr# 15- Tyr Ser Val Ala Leu Asp Trp Lys - # Gly Pro Gln Lys Ile Ile GlyVal# 30- Asp Thr Thr Pro Pro Lys Ser Thr - # Lys Phe Pro Lys Asn Trp HisGly# 45- Val Asn Leu Arg Phe Asp Asp Gly - # Thr Leu Gly Val Val Gln PheIle# 60- Arg Pro Cys Val Trp Arg Val Arg - # Tyr Asp Pro Gly Phe Lys ThrSer# 80- Asp Glu Tyr Gly Asp Glu Asn Thr - # Arg Thr Ile Val Gln Asp TyrMet# 95- Ser Thr Leu Ser Asn Lys Leu Asp - # Thr Tyr Arg Gly Leu Thr TrpGlu# 110- Thr Lys Cys Glu Asp Ser Gly Asp - # Phe Phe Thr Phe Ser Ser LysVal# 125- Thr Ala Val Glu Lys Ser Glu Arg - # Thr Arg Asn Lys Val Gly AspGly# 140- Leu Arg Ile His Leu Trp Lys Ser - # Pro Phe Arg Ile Gln Val ValArg# 160- Thr Leu Thr Pro Leu Lys Asp Pro - # Tyr Pro Ile Pro Asn Val AlaAla# 175- Ala Glu Ala Arg Val Ser Asp Lys - # Val Val Trp Gln Thr Ser ProLys# 190- Thr Phe Arg Lys Asn Leu His Pro - # Gln His Lys Met Leu Lys AspThr# 205- Val Leu Asp Ile Val Lys Pro Gly - # His Gly Glu Tyr Val Gly TrpGly# 220- Glu Met Gly Gly Ile Gln Phe Met - # Lys Glu Pro Thr Phe Met AsnTyr# 240- Phe Asn Phe Asp Asn Met Gln Tyr - # Gln Gln Val Tyr Ala Gln GlyAla# 255- Leu Asp Ser Arg Glu Pro Leu Tyr - # His Ser Asp Pro Phe Tyr LeuAsp# 270- Val Asn Ser Asn Pro Glu His Lys - # Asn Ile Thr Ala Thr Phe IleAsp# 285- Asn Tyr Ser Gln Ile Ala Ile Asp - # Phe Gly Lys Thr Asn Ser GlyTyr# 300- Ile Lys Leu Gly Thr Arg Tyr Gly - # Gly Ile Asp Cys Tyr Gly IleSer# 320- Ala Asp Thr Val Pro Glu Ile Val - # Arg Leu Tyr Thr Gly Leu ValGly# 335- Arg Ser Lys Leu Lys Pro Arg Tyr - # Ile Leu Gly Ala His Gln AlaCys# 350- Tyr Gly Tyr Gln Gln Glu Ser Asp - # Leu Tyr Ser Val Val Gln GlnTyr# 365- Arg Asp Cys Lys Phe Pro Leu Asp - # Gly Ile His Val Asp Val AspVal# 380- Gln Asp Gly Phe Arg Thr Phe Thr - # Thr Asn Pro His Thr Phe ProAsn# 400- Pro Lys Glu Met Phe Thr Asn Leu - # Arg Asn Asn Gly Ile Lys CysSer# 415- Thr Asn Ile Thr Pro Val Ile Ser - # Ile Asn Asn Arg Glu Gly GlyTyr# 430- Ser Thr Leu Leu Glu Gly Val Asp - # Lys Lys Tyr Phe Ile Met AspAsp# 445- Arg Tyr Thr Glu Gly Thr Ser Gly - # Asn Ala Lys Asp Val Arg TyrMet# 460- Tyr Tyr Gly Gly Gly Asn Lys Val - # Glu Val Asp Pro Asn Asp ValAsn# 480- Gly Arg Pro Asp Phe Lys Asp Asn - # Tyr Asp Phe Pro Ala Asn PheAsn# 495- Ser Lys Gln Tyr Pro Tyr His Gly - # Gly Val Ser Tyr Gly Tyr GlyAsn# 510- Gly Ser Ala Gly Phe Tyr Pro Asp - # Leu Asn Arg Lys Glu Val ArgIle# 525- Trp Trp Gly Met Gln Tyr Lys Tyr - # Leu Phe Asp Met Gly Leu GluPhe# 540- Val Trp Gln Asp Met Thr Thr Pro - # Ala Ile His Thr Ser Tyr GlyAsp# 560- Met Lys Gly Leu Pro Thr Arg Leu - # Leu Val Thr Ser Asp Ser ValThr# 575- Asn Ala Ser Glu Lys Lys Leu Ala - # Ile Glu Thr Trp Ala Leu TyrSer# 590- Tyr Asn Leu His Lys Ala Thr Trp - # His Gly Leu Ser Arg Leu GluSer# 605- Arg Lys Asn Lys Arg Asn Phe Ile - # Leu Gly Arg Gly Ser Tyr AlaGly# 620- Ala Tyr Arg Phe Ala Gly Leu Trp - # Thr Gly Asp Asn Ala Ser AsnTrp# 640- Glu Phe Trp Lys Ile Ser Val Ser - # Gln Val Leu Ser Leu Gly LeuAsn# 655- Gly Val Cys Ile Ala Gly Ser Asp - # Thr Gly Gly Phe Glu Pro TyrArg# 670- Asp Ala Asn Gly Val Glu Glu Lys - # Tyr Cys Ser Pro Glu Leu LeuIle# 685- Arg Trp Tyr Thr Gly Ser Phe Leu - # Leu Pro Trp Leu Arg Asn HisTyr# 700- Val Lys Lys Asp Arg Lys Trp Phe - # Gln Glu Pro Tyr Ser Tyr ProLys# 720- His Leu Glu Thr His Pro Glu Leu - # Ala Asp Gln Ala Trp Leu TyrLys# 735- Ser Val Leu Glu Ile Cys Arg Tyr - # Tyr Val Glu Leu Arg Tyr SerLeu# 750- Ile Gln Leu Leu Tyr Asp Cys Met - # Phe Gln Asn Val Val Asp GlyMet# 765- Pro Ile Thr Arg Ser Met Leu Leu - # Thr Asp Thr Glu Asp Thr ThrPhe# 780- Phe Asn Glu Ser Gln Lys Phe Leu - # Asp Asn Gln Tyr Met Ala GlyAsp# 800- Asp Ile Leu Val Ala Pro Ile Leu - # His Ser Arg Lys Glu Ile ProGly# 815- Glu Asn Arg Asp Val Tyr Leu Pro - # Leu Tyr His Thr Trp Tyr ProSer# 830- Asn Leu Arg Pro Trp Asp Asp Gln - # Gly Val Ala Leu Gly Asn ProVal# 845- Glu Gly Gly Ser Val Ile Asn Tyr - # Thr Ala Arg Ile Val Ala ProGlu# 860- Asp Tyr Asn Leu Phe His Ser Val - # Val Pro Val Tyr Val Arg GluGly# 880- Ala Ile Ile Pro Gln Ile Glu Val - # Arg Gln Trp Thr Gly Gln GlyGly# 895- Ala Asn Arg Ile Lys Phe Asn Ile - # Tyr Pro Gly Lys Asp Lys GluTyr# 910- Cys Thr Tyr Leu Asp Asp Gly Val - # Ser Arg Asp Ser Ala Pro GluAsp# 925- Leu Pro Gln Tyr Lys Glu Thr His - # Glu Gln Ser Lys Val Glu GlyAla# 940- Glu Ile Ala Lys Gln Ile Gly Lys - # Lys Thr Gly Tyr Asn Ile SerGly# 960- Thr Asp Pro Glu Ala Lys Gly Tyr - # His Arg Lys Val Ala Val ThrGln# 975- Thr Ser Lys Asp Lys Thr Arg Thr - # Val Thr Ile Glu Pro Lys HisAsn# 990- Gly Tyr Asp Pro Ser Lys Glu Val - # Gly Asp Tyr Tyr Thr Ile IleLeu# 10050- Trp Tyr Ala Pro Gly Phe Asp Gly - # Ser Ile Val Asp Val Ser LysThr# 10205- Thr Val Asn Val Glu Gly Gly Val - # Glu His Gln Val Tyr Lys AsnSer# 10405- Asp Leu His Thr Val Val Ile Asp - # Val Lys Glu Val Ile Gly ThrThr# 10550- Lys Ser Val Lys Ile Thr Cys Thr - # Ala Ala 1060 - # 1065- (2) INFORMATION FOR SEQ ID NO: 2:- (i) SEQUENCE CHARACTERISTICS:#acids (A) LENGTH: 1070 amino (B) TYPE: amino acid (D) TOPOLOGY: linear- (ii) MOLECULE TYPE: protein#2: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:- Met Ala Gly Leu Ser Asp Pro Leu - # Asn Phe Cys Lys Ala Glu AspTyr# 15- Tyr Ala Ala Ala Lys Gly Trp Ser - # Gly Pro Gln Lys Ile Ile ArgTyr# 30- Asp Gln Thr Pro Pro Gln Gly Thr - # Lys Asp Pro Lys Ser Trp HisAla# 45- Val Asn Leu Pro Phe Asp Asp Gly - # Thr Met Cys Val Val Gln PheVal# 60- Arg Pro Cys Val Trp Arg Val Arg - # Tyr Asp Pro Ser Val Lys ThrSer# 80- Asp Glu Tyr Gly Asp Glu Asn Thr - # Arg Thr Ile Val Gln Asp TyrMet# 95- Thr Thr Leu Val Gly Asn Leu Asp - # Ile Phe Arg Gly Leu Thr TrpVal# 110- Ser Thr Leu Glu Asp Ser Gly Glu - # Tyr Tyr Thr Phe Lys Ser GluVal# 125- Thr Ala Val Asp Glu Thr Glu Arg - # Thr Arg Asn Lys Val Gly AspGly# 140- Leu Lys Ile Tyr Leu Trp Lys Asn - # Pro Phe Arg Ile Gln Val ValArg# 160- Leu Leu Thr Pro Leu Val Asp Pro - # Phe Pro Ile Pro Asn Val AlaAsn# 175- Ala Thr Ala Arg Val Ala Asp Lys - # Val Val Trp Gln Thr Ser ProLys# 190- Thr Phe Arg Lys Asn Leu His Pro - # Gln His Lys Met Leu Lys AspThr# 205- Val Leu Asp Ile Ile Lys Pro Gly - # His Gly Glu Tyr Val Gly TrpGly# 220- Glu Met Gly Gly Ile Glu Phe Met - # Lys Glu Pro Thr Phe Met AsnTyr# 240- Phe Asn Phe Asp Asn Met Gln Tyr - # Gln Gln Val Tyr Ala Gln GlyAla# 255- Leu Asp Ser Arg Glu Pro Leu Tyr - # His Ser Asp Pro Phe Tyr LeuAsp# 270- Val Asn Ser Asn Pro Glu His Lys - # Asn Ile Thr Ala Thr Phe IleAsp# 285- Asn Tyr Ser Gln Ile Ala Ile Asp - # Phe Gly Lys Thr Asn Ser GlyTyr# 300- Ile Lys Leu Gly Thr Arg Tyr Gly - # Gly Ile Asp Cys Tyr Gly IleSer# 320- Ala Asp Thr Val Pro Glu Ile Val - # Arg Leu Tyr Thr Gly Leu ValGly# 335- Arg Ser Lys Leu Lys Pro Arg Tyr - # Ile Leu Gly Ala His Gln AlaCys# 350- Tyr Gly Tyr Gln Gln Glu Ser Asp - # Leu His Ala Val Val Gln GlnTyr# 365- Arg Asp Thr Lys Phe Pro Leu Asp - # Gly Leu His Val Asp Val AspPhe# 380- Gln Asp Asn Phe Arg Thr Phe Thr - # Thr Asn Pro Ile Thr Phe ProAsn# 400- Pro Lys Glu Met Phe Thr Asn Leu - # Arg Asn Asn Gly Ile Lys CysSer# 415- Thr Asn Ile Thr Pro Val Ile Ser - # Ile Arg Asp Arg Pro Asn GlyTyr# 430- Ser Thr Leu Asn Glu Gly Tyr Asp - # Lys Lys Tyr Phe Ile Met AspAsp# 445- Arg Tyr Thr Glu Gly Thr Ser Gly - # Asp Pro Gln Asn Val Arg TyrSer# 460- Phe Tyr Gly Gly Gly Asn Pro Val - # Glu Val Asn Pro Asn Asp ValTrp# 480- Ala Arg Pro Asp Phe Gly Asp Asn - # Tyr Asp Phe Pro Thr Asn PheAsn# 495- Cys Lys Asp Tyr Pro Tyr His Gly - # Gly Val Ser Tyr Gly Tyr GlyAsn# 510- Gly Thr Pro Gly Tyr Tyr Pro Asp - # Leu Asn Arg Glu Glu Val ArgIle# 525- Trp Trp Gly Leu Gln Tyr Glu Tyr - # Leu Phe Asn Met Gly Leu GluPhe# 540- Val Trp Gln Asp Met Thr Thr Pro - # Ala Ile His Ser Ser Tyr GlyAsp# 560- Met Lys Gly Leu Pro Thr Arg Leu - # Leu Val Thr Ala Asp Ser ValThr# 575- Asn Ala Ser Glu Lys Lys Leu Ala - # Ile Glu Ser Trp Ala Leu TyrSer# 590- Tyr Asn Leu His Lys Ala Thr Phe - # His Gly Leu Gly Arg Leu GluSer# 605- Arg Lys Asn Lys Arg Asn Phe Ile - # Leu Gly Arg Gly Ser Tyr AlaGly# 620- Ala Tyr Arg Phe Ala Gly Leu Trp - # Thr Gly Asp Asn Ala Ser ThrTrp# 640- Glu Phe Trp Lys Ile Ser Val Ser - # Gln Val Leu Ser Leu Gly LeuAsn# 655- Gly Val Cys Ile Ala Gly Ser Asp - # Thr Gly Gly Phe Glu Pro AlaArg# 670- Thr Glu Ile Gly Glu Glu Lys Tyr - # Cys Ser Pro Glu Leu Leu IleArg# 685- Trp Tyr Thr Gly Ser Phe Leu Leu - # Pro Trp Leu Arg Asn His TyrVal# 700- Lys Lys Asp Arg Lys Trp Phe Gln - # Glu Pro Tyr Ala Tyr Pro LysHis# 720- Leu Glu Thr His Pro Glu Leu Ala - # Asp Gln Ala Trp Leu Tyr LysSer# 735- Val Leu Glu Ile Cys Arg Tyr Trp - # Val Glu Leu Arg Tyr Ser LeuIle# 750- Gln Leu Leu Tyr Asp Cys Met Phe - # Gln Asn Val Val Asp Gly MetPro# 765- Leu Ala Arg Ser Met Leu Leu Thr - # Asp Thr Glu Asp Thr Thr PhePhe# 780- Asn Glu Ser Gln Lys Phe Leu Asp - # Asn Gln Tyr Met Ala Gly AspAsp# 800- Ile Leu Val Ala Pro Ile Leu His - # Ser Arg Asn Glu Val Pro GlyGlu# 815- Asn Arg Asp Val Tyr Leu Pro Leu - # Phe His Thr Trp Tyr Pro SerAsn# 830- Leu Arg Pro Trp Asp Asp Gln Gly - # Val Ala Leu Gly Asn Pro ValGlu# 845- Gly Gly Ser Val Ile Asn Tyr Thr - # Ala Arg Ile Val Ala Pro GluAsp# 860- Tyr Asn Leu Phe His Asn Val Val - # Pro Val Tyr Ile Arg Glu GlyAla# 880- Ile Ile Pro Gln Ile Gln Val Arg - # Gln Trp Ile Gly Glu Gly GlyPro# 895- Asn Pro Ile Lys Phe Asn Ile Tyr - # Pro Gly Lys Asp Lys Glu TyrVal# 910- Thr Tyr Leu Asp Asp Gly Val Ser - # Arg Asp Ser Ala Pro Asp AspLeu# 925- Pro Gln Tyr Arg Glu Ala Tyr Glu - # Gln Ala Lys Val Glu Gly LysAsp# 940- Val Gln Lys Gln Leu Ala Val Ile - # Gln Gly Asn Lys Thr Asn AspPhe# 960- Ser Ala Ser Gly Ile Asp Lys Glu - # Ala Lys Gly Tyr His Arg LysVal# 975- Ser Ile Lys Gln Glu Ser Lys Asp - # Lys Thr Arg Thr Val Thr IleGlu# 990- Pro Lys His Asn Gly Tyr Asp Pro - # Ser Lys Glu Val Gly Asn TyrTyr# 10050- Thr Ile Ile Leu Trp Tyr Ala Pro - # Gly Phe Asp Gly Ser Ile ValAsp# 10205- Val Ser Gln Ala Thr Val Asn Ile - # Glu Gly Gly Val Glu Cys GluIle# 10405- Phe Lys Asn Thr Gly Leu His Thr - # Val Val Val Asn Val Lys GluVal# 10550- Ile Gly Thr Thr Lys Ser Val Lys - # Ile Thr Cys Thr Thr Ala# 10700 - # 1065- (2) INFORMATION FOR SEQ ID NO: 3:- (i) SEQUENCE CHARACTERISTICS:#pairs (A) LENGTH: 3201 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear- (ii) MOLECULE TYPE: DNA (genomic)#3: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:- ATGGCAGGAT TTTCTGATCC TCTCAACTTT TGCAAAGCAG AAGACTACTA CA - #GTGTTGCG 60- CTAGACTGGA AGGGCCCTCA AAAAATCATT GGAGTAGACA CTACTCCTCC AA - #AGAGCACC 120- AAGTTCCCCA AAAACTGGCA TGGAGTGAAC TTGAGATTCG ATGATGGGAC TT - #TAGGTGTG 180- GTTCAGTTCA TTAGGCCGTG CGTTTGGAGG GTTAGATACG ACCCTGGTTT CA - #AGACCTCT 240- GACGAGTATG GTGATGAGAA TACGAGGACA ATTGTGCAAG ATTATATGAG TA - #CTCTGAGT 300- AATAAATTGG ATACTTATAG AGGTCTTACG TGGGAAACCA AGTGTGAGGA TT - #CGGGAGAT 360- TTCTTTACCT TCTCATCCAA GGTCACCGCC GTTGAAAAAT CCGAGCGGAC CC - #GCAACAAG 420- GTCGGCGATG GCCTCAGAAT TCACCTATGG AAAAGCCCTT TCCGCATCCA AG - #TAGTGCGC 480- ACCTTGACCC CTTTGAAGGA TCCTTACCCC ATTCCAAATG TAGCCGCAGC CG - #AAGCCCGT 540- GTGTCCGACA AGGTCGTTTG GCAAACGTCT CCCAAGACAT TCAGAAAGAA CC - #TGCATCCG 600- CAACACAAGA TGCTAAAGGA TACAGTTCTT GACATTGTCA AACCTGGACA TG - #GCGAGTAT 660- GTGGGGTGGG GAGAGATGGG AGGTATCCAG TTTATGAAGG AGCCAACATT CA - #TGAACTAT 720- TTTAACTTCG ACAATATGCA ATACCAGCAA GTCTATGCCC AAGGTGCTCT CG - #ATTCTCGC 780- GAGCCACTGT ACCACTCGGA TCCCTTCTAT CTTGATGTGA ACTCCAACCC GG - #AGCACAAG 840- AATATCACGG CAACCTTTAT CGATAACTAC TCTCAAATTG CCATCGACTT TG - #GAAAGACC 900- AACTCAGGCT ACATCAAGCT GGGAACCAGG TATGGTGGTA TCGATTGTTA CG - #GTATCAGT 960- GCGGATACGG TCCCGGAAAT TGTACGACTT TATACAGGTC TTGTTGGACG TT - #CAAAGTTG1020- AAGCCCAGAT ATATTCTCGG GGCCCATCAA GCCTGTTATG GATACCAACA GG - #AAAGTGAC1080- TTGTATTCTG TGGTCCAGCA GTACCGTGAC TGTAAATTTC CACTTGACGG GA - #TTCACGTC1140- GATGTCGATG TTCAGGACGG CTTCAGAACT TTCACCACCA ACCCACACAC TT - #TCCCTAAC1200- CCCAAAGAGA TGTTTACTAA CTTGAGGAAT AATGGAATCA AGTGCTCCAC CA - #ATATCACT1260- CCTGTTATCA GCATTAACAA CAGAGAGGGT GGATACAGTA CCCTCCTTGA GG - #GAGTTGAC1320- AAAAAATACT TTATCATGGA CGACAGATAT ACCGAGGGAA CAAGTGGGAA TG - #CGAAGGAT1380- GTTCGGTACA TGTACTACGG TGGTGGTAAT AAGGTTGAGG TCGATCCTAA TG - #ATGTTAAT1440- GGTCGGCCAG ACTTTAAAGA CAACTATGAC TTCCCCGCGA ACTTCAACAG CA - #AACAATAC1500- CCCTATCATG GTGGTGTGAG CTACGGTTAT GGGAACGGTA GTGCAGGTTT TT - #ACCCGGAC1560- CTCAACAGAA AGGAGGTTCG TATCTGGTGG GGAATGCAGT ACAAGTATCT CT - #TCGATATG1620- GGACTGGAAT TTGTGTGGCA AGACATGACT ACCCCAGCAA TCCACACATC AT - #ATGGAGAC1680- ATGAAAGGGT TGCCCACCCG TCTACTCGTC ACCTCAGACT CCGTCACCAA TG - #CCTCTGAG1740- AAAAAGCTCG CAATTGAAAC TTGGGCTCTC TACTCCTACA ATCTCCACAA AG - #CAACTTGG1800- CATGGTCTTA GTCGTCTCGA ATCTCGTAAG AACAAACGAA ACTTCATCCT CG - #GGCGTGGA1860- AGTTATGCCG GAGCCTATCG TTTTGCTGGT CTCTGGACTG GGGATAATGC AA - #GTAACTGG1920- GAATTCTGGA AGATATCGGT CTCTCAAGTT CTTTCTCTGG GCCTCAATGG TG - #TGTGCATC1980- GCGGGGTCTG ATACGGGTGG TTTTGAACCC TACCGTGATG CAAATGGGGT CG - #AGGAGAAA2040- TACTGTAGCC CAGAGCTACT CATCAGGTGG TATACTGGTT CATTCCTCTT GC - #CGTGGCTC2100- AGGAACCATT ATGTCAAAAA GGACAGGAAA TGGTTCCAGG AACCATACTC GT - #ACCCCAAG2160- CATCTTGAAA CCCATCCAGA ACTCGCAGAC CAAGCATGGC TCTATAAATC CG - #TTTTGGAG2220- ATCTGTAGGT ACTATGTGGA GCTTAGATAC TCCCTCATCC AACTACTTTA CG - #ACTGCATG2280- TTTCAAAACG TAGTCGACGG TATGCCAATC ACCAGATCTA TGCTCTTGAC CG - #ATACTGAG2340- GATACCACCT TCTTCAACGA GAGCCAAAAG TTCCTCGACA ACCAATATAT GG - #CTGGTGAC2400- GACATTCTTG TTGCACCCAT CCTCCACAGT CGCAAAGAAA TTCCAGGCGA AA - #ACAGAGAT2460- GTCTATCTCC CTCTTTACCA CACCTGGTAC CCCTCAAATT TGAGACCATG GG - #ACGATCAA2520- GGAGTCGCTT TGGGGAATCC TGTCGAAGGT GGTAGTGTCA TCAATTATAC TG - #CTAGGATT2580- GTTGCACCCG AGGATTATAA TCTCTTCCAC AGCGTGGTAC CAGTCTACGT TA - #GAGAGGGT2640- GCCATCATCC CGCAAATCGA AGTACGCCAA TGGACTGGCC AGGGGGGAGC CA - #ACCGCATC2700- AAGTTCAACA TCTACCCTGG AAAGGATAAG GAGTACTGTA CCTATCTTGA TG - #ATGGTGTT2760- AGCCGTGATA GTGCGCCGGA AGACCTCCCA CAGTACAAAG AGACCCACGA AC - #AGTCGAAG2820- GTTGAAGGCG CGGAAATCGC AAAGCAGATT GGAAAGAAGA CGGGTTACAA CA - #TCTCAGGA2880- ACCGACCCAG AAGCAAAGGG TTATCACCGC AAAGTTGCTG TCACACAAAC GT - #CAAAAGAC2940- AAGACGCGTA CTGTCACTAT TGAGCCAAAA CACAATGGAT ACGACCCTTC CA - #AAGAGGTG3000- GGTGATTATT ATACCATCAT TCTTTGGTAC GCACCAGGTT TCGATGGCAG CA - #TCGTCGAT3060- GTGAGCAAGA CGACTGTGAA TGTTGAGGGT GGGGTGGAGC ACCAAGTTTA TA - #AGAACTCC3120- GATTTACATA CGGTTGTTAT CGACGTGAAG GAGGTGATCG GTACCACAAA GA - #GCGTCAAG3180# 3201TA A- (2) INFORMATION FOR SEQ ID NO: 4:- (i) SEQUENCE CHARACTERISTICS:#pairs (A) LENGTH: 3213 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear- (ii) MOLECULE TYPE: DNA (genomic)#4: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:- ATGGCAGGAT TATCCGACCC TCTCAATTTC TGCAAAGCAG AGGACTACTA CG - #CTGCTGCC 60- AAAGGCTGGA GTGGCCCTCA GAAGATCATT CGCTATGACC AGACCCCTCC TC - #AGGGTACA 120- AAAGATCCGA AAAGCTGGCA TGCGGTAAAC CTTCCTTTCG ATGACGGGAC TA - #TGTGTGTA 180- GTGCAATTCG TCAGACCCTG TGTTTGGAGG GTTAGATATG ACCCCAGTGT CA - #AGACTTCT 240- GATGAGTACG GCGATGAGAA TACGAGGACT ATTGTACAAG ACTACATGAC TA - #CTCTGGTT 300- GGAAACTTGG ACATTTTCAG AGGTCTTACG TGGGTTTCTA CGTTGGAGGA TT - #CGGGCGAG 360- TACTACACCT TCAAGTCCGA AGTCACTGCC GTGGACGAAA CCGAACGGAC TC - #GAAACAAG 420- GTCGGCGACG GCCTCAAGAT TTACCTATGG AAAAATCCCT TTCGCATCCA GG - #TAGTGCGT 480- CTCTTGACCC CCCTGGTGGA CCCTTTCCCC ATTCCCAACG TAGCCAATGC CA - #CAGCCCGT 540- GTGGCCGACA AGGTTGTTTG GCAGACGTCC CCGAAGACGT TCAGGAAAAA CT - #TGCATCCG 600- CAGCATAAGA TGTTGAAGGA TACAGTTCTT GATATTATCA AGCCGGGGCA CG - #GAGAGTAT 660- GTGGGTTGGG GAGAGATGGG AGGCATCGAG TTTATGAAGG AGCCAACATT CA - #TGAATTAT 720- TTCAACTTTG ACAATATGCA ATATCAGCAG GTCTATGCAC AAGGCGCTCT TG - #ATAGTCGT 780- GAGCCGTTGT ATCACTCTGA TCCCTTCTAT CTCGACGTGA ACTCCAACCC AG - #AGCACAAG 840- AACATTACGG CAACCTTTAT CGATAACTAC TCTCAGATTG CCATCGACTT TG - #GGAAGACC 900- AACTCAGGCT ACATCAAGCT GGGTACCAGG TATGGCGGTA TCGATTGTTA CG - #GTATCAGC 960- GCGGATACGG TCCCGGAGAT TGTGCGACTT TATACTGGAC TTGTTGGGCG TT - #CGAAGTTG1020- AAGCCCAGGT ATATTCTCGG AGCCCACCAA GCTTGTTATG GATACCAGCA GG - #AAAGTGAC1080- TTGCATGCTG TTGTTCAGCA GTACCGTGAC ACCAAGTTTC CGCTTGATGG GT - #TGCATGTC1140- GATGTCGACT TTCAGGACAA TTTCAGAACG TTTACCACTA ACCCGATTAC GT - #TCCCTAAT1200- CCCAAAGAAA TGTTTACCAA TCTAAGGAAC AATGGAATCA AGTGTTCCAC CA - #ACATCACC1260- CCTGTTATCA GTATCAGAGA TCGCCCGAAT GGGTACAGTA CCCTCAATGA GG - #GATATGAT1320- AAAAAGTACT TCATCATGGA TGACAGATAT ACCGAGGGGA CAAGTGGGGA CC - #CGCAAAAT1380- GTTCGATACT CTTTTTACGG CGGTGGGAAC CCGGTTGAGG TTAACCCTAA TG - #ATGTTTGG1440- GCTCGGCCAG ACTTTGGAGA CAATTATGAC TTCCCTACGA ACTTCAACTG CA - #AAGACTAC1500- CCCTATCATG GTGGTGTGAG TTACGGATAT GGGAATGGCA CTCCAGGTTA CT - #ACCCTGAC1560- CTTAACAGAG AGGAGGTTCG TATCTGGTGG GGATTGCAGT ACGAGTATCT CT - #TCAATATG1620- GGACTAGAGT TTGTATGGCA AGATATGACA ACCCCAGCGA TCCATTCATC AT - #ATGGAGAC1680- ATGAAAGGGT TGCCCACCCG TCTGCTCGTC ACCGCCGACT CAGTTACCAA TG - #CCTCTGAG1740- AAAAAGCTCG CAATTGAAAG TTGGGCTCTT TACTCCTACA ACCTCCATAA AG - #CAACCTTC1800- CACGGTCTTG GTCGTCTTGA GTCTCGTAAG AACAAACGTA ACTTCATCCT CG - #GACGTGGT1860- AGTTACGCCG GTGCCTATCG TTTTGCTGGT CTCTGGACTG GAGATAACGC AA - #GTACGTGG1920- GAATTCTGGA AGATTTCGGT CTCCCAAGTT CTTTCTCTAG GTCTCAATGG TG - #TGTGTATA1980- GCGGGGTCTG ATACGGGTGG TTTTGAGCCC GCACGTACTG AGATTGGGGA GG - #AGAAATAT2040- TGCAGTCCGG AGCTACTCAT CAGGTGGTAT ACTGGATCAT TCCTTTTGCC AT - #GGCTTAGA2100- AACCACTACG TCAAGAAGGA CAGGAAATGG TTCCAGGAAC CATACGCGTA CC - #CCAAGCAT2160- CTTGAAACCC ATCCAGAGCT CGCAGATCAA GCATGGCTTT ACAAATCTGT TC - #TAGAAATT2220- TGCAGATACT GGGTAGAGCT AAGATATTCC CTCATCCAGC TCCTTTACGA CT - #GCATGTTC2280- CAAAACGTGG TCGATGGTAT GCCACTTGCC AGATCTATGC TCTTGACCGA TA - #CTGAGGAT2340- ACGACCTTCT TCAATGAGAG CCAAAAGTTC CTCGATAACC AATATATGGC TG - #GTGACGAC2400- ATCCTTGTAG CACCCATCCT CCACAGCCGT AACGAGGTTC CGGGAGAGAA CA - #GAGATGTC2460- TATCTCCCTC TATTCCACAC CTGGTACCCC TCAAACTTGA GACCGTGGGA CG - #ATCAGGGA2520- GTCGCTTTAG GGAATCCTGT CGAAGGTGGC AGCGTTATCA ACTACACTGC CA - #GGATTGTT2580- GCCCCAGAGG ATTATAATCT CTTCCACAAC GTGGTGCCGG TCTACATCAG AG - #AGGGTGCC2640- ATCATTCCGC AAATTCAGGT ACGCCAGTGG ATTGGCGAAG GAGGGCCTAA TC - #CCATCAAG2700- TTCAATATCT ACCCTGGAAA GGACAAGGAG TATGTGACGT ACCTTGATGA TG - #GTGTTAGC2760- CGCGATAGTG CACCAGATGA CCTCCCGCAG TACCGCGAGG CCTATGAGCA AG - #CGAAGGTC2820- GAAGGCAAAG ACGTCCAGAA GCAACTTGCG GTCATTCAAG GGAATAAGAC TA - #ATGACTTC2880- TCCGCCTCCG GGATTGATAA GGAGGCAAAG GGTTATCACC GCAAAGTTTC TA - #TCAAACAG2940- GAGTCAAAAG ACAAGACCCG TACTGTCACC ATTGAGCCAA AACACAACGG AT - #ACGACCCC3000- TCTAAGGAAG TTGGTAATTA TTATACCATC ATTCTTTGGT ACGCACCGGG CT - #TTGACGGC3060- AGCATCGTCG ATGTGAGCCA GGCGACCGTG AACATCGAGG GCGGGGTGGA AT - #GCGAAATT3120- TTCAAGAACA CCGGCTTGCA TACGGTTGTA GTCAACGTGA AAGAGGTGAT CG - #GTACCACA3180# 3213 CTTG CACTACCGCT TAG- (2) INFORMATION FOR SEQ ID NO: 5:- (i) SEQUENCE CHARACTERISTICS:#acids (A) LENGTH: 75 amino (B) TYPE: amino acid (D) TOPOLOGY: linear- (ii) MOLECULE TYPE: peptide#5: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:- Lys Asn Leu His Pro Gln His Lys - # Met Leu Lys Asp Thr Val LeuAsp# 15- Ile Val Lys Pro Gly His Gly Glu - # Tyr Val Gly Trp Gly Glu MetGly# 30- Gly Ile Gln Phe Met Lys Glu Pro - # Thr Phe Met Asn Tyr Phe AsnPhe# 45- Asp Asn Met Gln Tyr Gln Gln Val - # Tyr Ala Gln Gly Ala Leu AspSer# 60- Arg Glu Pro Leu Tyr His Ser Asp - # Pro Phe Tyr# 75- (2) INFORMATION FOR SEQ ID NO: 6:- (i) SEQUENCE CHARACTERISTICS:#pairs (A) LENGTH: 23 base (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear- (ii) MOLECULE TYPE: DNA (genomic)- (ix) FEATURE: (A) NAME/KEY: misc.sub.-- - #difference#"") (B) LOCATION: replace(3,#/standard.sub.-- name= "N is G or A"- (ix) FEATURE: (A) NAME/KEY: misc.sub.-- - #difference#"") (B) LOCATION: replace(6,#/note= "N is C or T"NFORMATION:- (ix) FEATURE: (A) NAME/KEY: misc.sub.-- - #difference#"") (B) LOCATION: replace(3,#/note= "N is G or A"NFORMATION:- (ix) FEATURE: (A) NAME/KEY: misc.sub.-- - #difference#"") (B) LOCATION: replace(9,#/note= "N is G or A"NFORMATION:- (ix) FEATURE: (A) NAME/KEY: misc.sub.-- - #difference#"") (B) LOCATION: replace(15,#/note= "N is G or A or T or C":- (ix) FEATURE: (A) NAME/KEY: misc.sub.-- - #difference#"") (B) LOCATION: replace(18,#/note= "N is G or A"NFORMATION:- (ix) FEATURE: (A) NAME/KEY: misc.sub.-- - #difference#"") (B) LOCATION: replace(21,#/note= "N is C or T"NFORMATION:#6: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:# 23ANGA NAC- (2) INFORMATION FOR SEQ ID NO: 7:- (i) SEQUENCE CHARACTERISTICS:#pairs (A) LENGTH: 23 base (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear- (ii) MOLECULE TYPE: DNA (genomic)- (ix) FEATURE: (A) NAME/KEY: misc.sub.-- - #difference#"") (B) LOCATION: replace(3,#/note= "N is G or A"NFORMATION:- (ix) FEATURE: (A) NAME/KEY: misc.sub.-- - #difference#"") (B) LOCATION: replace(6,#/note= "N is C or T"NFORMATION:- (ix) FEATURE: (A) NAME/KEY: misc.sub.-- - #difference#"") (B) LOCATION: replace(9,#/note= "N is G or A"NFORMATION:- (ix) FEATURE: (A) NAME/KEY: misc.sub.-- - #difference#"") (B) LOCATION: replace(15,#/note= "N is G or A"NFORMATION:- (ix) FEATURE: (A) NAME/KEY: misc.sub.-- - #difference#"") (B) LOCATION: replace(18,#/note= "N is G or A"NFORMATION:- (ix) FEATURE: (A) NAME/KEY: misc.sub.-- - #difference#"") (B) LOCATION: replace(21,#/note= "N is C or T"NFORMATION:#7: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:# 23ANGA NAC- (2) INFORMATION FOR SEQ ID NO: 8:- (i) SEQUENCE CHARACTERISTICS:#pairs (A) LENGTH: 20 base (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear- (ii) MOLECULE TYPE: DNA (genomic)- (ix) FEATURE: (A) NAME/KEY: misc.sub.-- - #difference#"") (B) LOCATION: replace(3,#/note= "N is G or A"NFORMATION:- (ix) FEATURE: (A) NAME/KEY: misc.sub.-- - #difference#"") (B) LOCATION: replace(6,#/note= "N is G or A or T or C":- (ix) FEATURE: (A) NAME/KEY: misc.sub.-- - #difference#"") (B) LOCATION: replace(9,#/note= "N is G or A"NFORMATION:- (ix) FEATURE: (A) NAME/KEY: misc.sub.-- - #difference#"") (B) LOCATION: replace(12,#/note= "N is G or A"NFORMATION:- (ix) FEATURE: (A) NAME/KEY: misc.sub.-- - #difference#"") (B) LOCATION: replace(15,#/note= "N is G or A"NFORMATION:- (ix) FEATURE: (A) NAME/KEY: misc.sub.-- - #difference#"") (B) LOCATION: replace(18,#/note= "N is G or A"NFORMATION:#8: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:# 20 GNTA- (2) INFORMATION FOR SEQ ID NO: 9:- (i) SEQUENCE CHARACTERISTICS:#pairs (A) LENGTH: 20 base (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear- (ii) MOLECULE TYPE: DNA (genomic)- (ix) FEATURE: (A) NAME/KEY: misc.sub.-- - #difference#"") (B) LOCATION: replace(3,#/note= "N is G or A"NFORMATION:- (ix) FEATURE: (A) NAME/KEY: misc.sub.-- - #difference#"") (B) LOCATION: replace(6,#/note= "N is G or A or T or C":- (ix) FEATURE: (A) NAME/KEY: misc.sub.-- - #difference#"") (B) LOCATION: replace(9,#/note= "N is G or A"NFORMATION:- (ix) FEATURE: (A) NAME/KEY: misc.sub.-- - #difference#"") (B) LOCATION: replace(12,#/note= "N is G or A or T or C":- (ix) FEATURE: (A) NAME/KEY: misc.sub.-- - #difference#"") (B) LOCATION: replace(15,#/note= "N is G or A"NFORMATION:- (ix) FEATURE: (A) NAME/KEY: misc.sub.-- - #difference#"") (B) LOCATION: replace(18,#/note= "N is G or A"NFORMATION:#9: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:# 20 GNTA- (2) INFORMATION FOR SEQ ID NO: 10:- (i) SEQUENCE CHARACTERISTICS:#pairs (A) LENGTH: 37 base (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear- (ii) MOLECULE TYPE: DNA (genomic)#10: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:# 37 GCCA TGGCAGGATT TTCTGAT- (2) INFORMATION FOR SEQ ID NO:11:- (i) SEQUENCE CHARACTERISTICS:#pairs (A) LENGTH: 4726 base (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear- (ii) MOLECULE TYPE: cDNA- (xi) SEQUENCE DESCRIPTION: SEQ ID NO:11:#CAAGAAACAA 60TTTGTTT ATTCTATTCT GTGCGGCAGA TATGCACTCA#CGCACAAAGA 120TTCTAAT TACAGTTGTA GGTGCAGTTG AAAATCCGGT#GGGTATGTGT 180AAGATGA TAACGCCTGA TTAGTACTCA AGGTTTAATT#GCGGCAGCAA 240GGCTAGC ATTACCTGAT TGGTTACAAC TGCAAATACT#ATTTTATATT 300CAGCATC GATAGCTCGG CCTCATAAAA ATTGATTTCA#TCTTCATTCT 360TCGAATC CTATATAATG GCCATCGTTC CCTCCTCGCC#TTCCGAACAA 420AGCTCAG TCATCCCTCA ACTTGGCCTC CTCTGATATC#TGGCAGGATT 480ATCTTTT TTTGAGCTAG ATCTCATTAT ACCTCCGTCA#TAGACTGGAA 540AACTTTT GCAAAGCAGA AGACTACTAC AGTGTTGCGC#AGTTCCCCAA 600ATCATTG GAGTAGACAC TACTCCTCCA AAGAGCACCA#TTCAGTTCAT 660GTGAACT TGAGATTCGA TGATGGGACT TTAGGTGTGG#ACGAGTATGG 720TGGAGGG TTAGATACGA CCCTGGTTTC AAGACCTCTG#ATATGCTAAT 780TGAGTTA CCCCATATGT CATTATTGGT AGCGAAAAAC#CTGAGTAATA 840ATATAGG AGGACAATTG TGCAAGATTA TATGAGTACT#GGAGATTTCT 900TAGAGGT CTTACGTGGG AAACCAAGTG TGAGGATTCG#CACCACTAAC 960AAGTGCC AGTACTGCTA TAGCTCCGCT ATATATATAA#CCCGCAACAA 1020TAGTCCA AGGTCACCGC CGTTGAAAAA TCCGAGCGGA#AAGTAGTGCG 1080CTCAGAA TTCACCTATG GAAAAGCCCT TTCCGCATCC#CCGAAGCCCG 1140TTGAAGG ATCCTTACCC CATTCCAAAT GTAGCCGCAG#ACCTGCATCC 1200GTCGTTT GGCAAACGTC TCCCAAGACA TTCAGAAAGA#ATGGCGAGTA 1260CTAAAGG ATACAGTTCT TGACATTGTC AAACCTGGAC#TCATGAACTA 1320GAGATGG GAGGTATCCA GTTTATGAAG GAGCCAACAT#ACTAACCCAG 1380CGAAGAG GTTCCTTATA AATTCTTGGT GGTCATTTTT#TCGATTCTCG 1440AATATGC AATACCAGCA AGTCTATGCC CAAGGTGCTC#CTTTCAACAA 1500GTACCGT CCTGTGGCAC GACTTAACCC AATAACTAAT#AAGAATATCA 1560TCCCTTC TATCTTGATG TGAACTCCAA CCCGGAGCAC#ACCAACTCAG 1620CGATAAC TACTCTCAAA TTGCCATCGA CTTTGGAAAG#AGTGCGGATA 1680GGGAACC AGGTATGGTG GTATCGATTG TTACGGTATC#TTGAAGCCCA 1740TGTACGA CTTTATACAG GTCTTGTTGG ACGTTCAAAG#ATTTATCAGA 1800GGCCCAT CAAGCCTGTA AGTCCTTCCC CTCATGAGTG#AAGTGACTTG 1860CTAACCT CGTTTTCAAA GGTTATGGAT ACCAACAGGA#TCACGTCGAT 1920AGCAGTA CCGTGACTGT AAATTTCCAC TTGACGGGAT#TCTAACTGTG 1980TAAATGG CCATGGTATC ATTGAAGCTT TGAGAAATGT#CTTTCCCTAA 2040TAGGACG GCTTCAGAAC TTTCACCACC AACCCACACA#CCAATATCAC 2100TTTACTA ACTTGAGGAA TAATGGAATC AAGTGCTCCA#AGGGAGTTGA 2160ATTAACA ACAGAGAGGG TGGATACAGT ACCCTCCTTG#ATGCGAAGGA 2220ATCATGG ACGACAGATA TACCGAGGGA ACAAGTGGGA#ATGATGTTAA 2280TACTACG GTGGTGGTAA TAAGGTTGAG GTCGATCCTA#TAGGTAACCC 2340TTTAAAG ACAACTAGTA AGTTGTTTAT TTGACTACGA#AAACAATACC 2400ACATATT TGTAGTGACT TCCCCGCGAA CTTCAACAGC#GATATCTCAC 2460TGTGAGC TACGGTTATG GGAACGGTAG TGTAAGTGAC#GGTTTTTACC 2520TTTATAA GGACTAACTA GACACAAAAA TTTGTAGGCA#TATCTCTTCG 2580AAAGGAG GTTCGTATCT GGTGGGGAAT GCAGTACAAG#ACATCATATG 2640ATTTGTG TGGCAAGACA TGACTACCCC AGCAATCCAC#ACCAATGCCT 2700GTTGCCC ACCCGTCTAC TCGTCACCTC AGACTCCGTC#CACAAAGCAA 2760CGCAATT GAAACTTGGG CTCTCTACTC CTACAATCTC#ATCCTCGGGC 2820TAGTCGT CTCGAATCTC GTAAGAACAA ACGAAACTTC#AATGCAAGTA 2880CGGAGCC TATCGTTTTG CTGGTCTCTG GACTGGGGAT#AATGGTGTGT 2940GAAGATA TCGGTCTCTC AAGTTCTTTC TCTGGGCCTC#GGGGTCGAGG 3000TGATACG GGTGGTTTTG AACCCTACCG TGATGCAAAT#CTCTTGCCGT 3060CCCAGAG CTACTCATCA GGTGGTATAC TGGTTCATTC#TATCCTTTCT 3120TTATGTC AAAAAGGACA GGAAATGGTT CCAGGTAATC#TACCCCAAGC 3180TTGAAGA TACTAAGATA TAATCTAGGA ACCATACTCG#GTTTTGGAGA 3240TCCAGAA CTCGCAGACC AAGCATGGCT CTATAAATCC#GACTGCATGT 3300TGTGGAG CTTAGATACT CCCTCATCCA ACTACTTTAC#CTACCCTAGG 3360CGACGGT ATGCCAATCA CCAGATCTAT GGTATGTATT#ACCGATACTG 3420ATATGCT AACCAATTGA ACCTGGGTTT CTAGCTCTTG#ATGGCTGGTG 3480CTTCAAC GAGAGCCAAA AGTTCCTCGA CAACCAATAT#GAAAACAGAG 3540TGCACCC ATCCTCCACA GTCGCAAAGA AATTCCAGGC#TGGGACGATC 3600TCTTTAC CACACCTGGT ACCCCTCAAA TTTGAGACCA#ACTGCTAGGA 3660GGGGAAT CCTGTCGAAG GTGGTAGTGT CATCAATTAT#GTTAGAGAGG 3720GGATTAT AATCTCTTCC ACAGCGTGGT ACCAGTCTAC#CTAACGCTAT 3780TAATCTC TTCCCAGTTT CAAATACATT TAGCTAGTAG#CCAGGGGGGA 3840CCATCAT CCCGCAAATC GAAGTACGCC AATGGACTGG#CAATGATCAC 3900AGTTCAA CATCTACCCT GGAAAGGATA AGGTAAAATT#TCAAAATACA 3960CATCGCT GGTTTTCTTT ACCCTTACTG ACTTCATTCC#AAGACCTCCC 4020TATCTTG ATGATGGTGT TAGCCGTGAT AGTGCGCCGG#CAAAGCAGAT 4080ACCCACG AACAGTCGAA GGTTGAAGGC GCGGAAATCG#GTTATCACCG 4140GGTTACA ACATCTCAGG AACCGACCCA GAAGCAAAGG#GACATCATGC 4200ACACAAG TAATACCGCC CTTGACTTGT ATCACTTCCT#TCACTATTGA 4260TTTACCT CAAAGACGTC AAAAGACAAG ACGCGTACTG#CCATCATTCT 4320GGATACG ACCCTTCCAA AGAGGTGGGT GATTATTATA#CTGTGAATGT 4380GGTTTCG ATGGCAGCAT CGTCGATGTG AGCAAGACGA#TTGTTATCGA 4440GAGCACC AAGTTTATAA GAACTCCGAT TTACATACGG#CCGCTTAAGG 4500ATCGGTA CCACAAAGAG CGTCAAGATC ACATGTACTG#TAACTCCGGG 4560GCGGGAG GCGAGACCTT CGAAATGTAT ACGGGAGTGG#ATTTCTTTAT 4620GGGGGAT CAAGTTGGAG GGGAATCTGT TTATTTCTTT#GTTGGCCTCT 4680AAATAGG GAGCACAGTT CTGACTGGAT TGGTTTGATT# 4726TTGT CTGGAAATCC AATTTATTGT TATGCG- (2) INFORMATION FOR SEQ ID NO:12:- (i) SEQUENCE CHARACTERISTICS:#pairs (A) LENGTH: 4670 base (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear- (ii) MOLECULE TYPE: cDNA- (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12:#AGGAAACAAA 60CAGGCGT TTTTTGTTTT ATCCGCAGAG GTGCAGCAGC#CACCAAGAAC 120CCTTGAC GCGGTTTTAG GTGCAGTTAA GGCCCGGGCG#GTTTGATTGG 180GTCTAAA AAAGATCATA ATACCCGATT AGTGTTCATG#CCAATATCAC 240TTTTACA GAGTTCAGCT TAGTTCATTG TTCGAAACTA#CTTACATTTC 300GGCATTG ATAGCTCGGC TTGTGAAAGC TGATTACAAT#CCTTTTGGTT 360GACTGAT CTATATATAA GGGTCATCAT TTCCTCTCCG#TCCGTTGATA 420CCAGCCC AATCATCACC GTTGGCCTTT ACTTCTCTCT#CCCTCCAACA 480ACATCTT GTCCACTGTT AGGCTAGCTC CCAGAATTAT#GCTGCTGCCA 540CGACCCT CTCAATTTCT GCAAAGCAGA GGACTACTAC#CAGGGTACAA 600CCCTCAG AAGATCATTC GCTATGACCA GACCCCTCCT#ATGTGTGTAG 660CTGGCAT GCGGTAAACC TTCCTTTCGA TGACGGGACT#AAGACTTCTG 720ACCCTGT GTTTGGAGGG TTAGATATGA CCCCAGTGTC#GCTAGTGATT 780TGAGAAT ACGTGGGTCG CCCAGTCAAT TAACTATGCC#ACAAGACTAC 840GCTAACC GATCAATGAG GCATGTAGGA GGACTATTGT#TTCTACGTTG 900TTGGAAA CTTGGACATT TTCAGAGGTC TTACGTGGGT#TCGAATATAT 960AGTACTA CACCTTCAAG GCAAGCCTCA GTGTTATATC#GACGAAACCG 1020AAACTAA CTAGTCATAC AGTCCGAAGT CACTGCCGTG#AATCCCTTTC 1080CAAGGTC GGCGACGGCC TCAAGATTTA CCTATGGAAA#CCCAACGTAG 1140GCGTCTC TTGACCCCCC TGGTGGACCC TTTCCCCATT#AAGACGTTCA 1200CCGTGTG GCCGACAAGG TTGTTTGGCA GACGTCCCCG#ATTATCAAGC 1260TCCGCAG CATAAGATGT TGAAGGATAC AGTTCTTGAT#ATGAAGGAGC 1320GTATGTG GGTTGGGGAG AGATGGGAGG CATCGAGTTT#GACGGTCGTT 1380TTATTTC AGTAAGCTCT TGAAAGATTT CCTATCTCTT#TGCACAAGGC 1440CTGTAGA CTTTGACAAT ATGCAATATC AGCAGGTCTA#ATTACAGTAA 1500GTGAGCC GTTGTAAGTA ACGTCCTGTG ACATGTCATG#TCCAACCCAG 1560AAGGTAT CACTCTGATC CCTTCTATCT CGACGTGAAC#ATCGACTTTG 1620TACGGCA ACCTTTATCG ATAACTACTC TCAGATTGCC#GATTGTTACG 1680AGGCTAC ATCAAGCTGG GTACCAGGTA TGGCGGTATC#GTTGGGCGTT 1740TACGGTC CCGGAGATTG TGCGACTTTA TACTGGACTT#GCCCCCTTTA 1800CAGGTAT ATTCTCGGAG CCCACCAAGC TTGTAAGCCC#ATACCAGCAG 1860AGGGGTC CACAGACTAA ACTTGTTCCA AAGGTTATGG#GCTTGATGGG 1920ATGCTGT TGTTCAGCAG TACCGTGACA CCAAGTTTCC#TTTGGAGAAT 1980TCGACTT TCAGGTAAAT GGCCCAGGTA TCGTTGAAGC#AACCCGATTA 2040GTAAAAC TTTAAGGACA ATTTCAGAAC GTTTACCACT#AAGTGTTCCA 2100CAAAGAA ATGTTTACCA ATCTAAGGAA CAATGGAATC#ACCCTCAATG 2160TGTTATC AGTATCAGAG ATCGCCCGAA TGGGTACAGT#ACAAGTGGGG 2220AAAGTAC TTCATCATGG ATGACAGATA TACCGAGGGG#GTTAACCCTA 2280TCGATAC TCTTTTTACG GCGGTGGGAA CCCGGTTGAG#TAGGCTACTT 2340TCGGCCA GACTTTGGAG ACAATTAGTA AGTTACTCAA#CTTCAACTGC 2400GGTGGCA TTAACACGAC TATAGTGACT TCCCTACGAA#TGTAAGTGAT 2460ATCATGG TGGTGTGAGT TACGGATATG GGAATGGCAC#AATTTTAGCC 2520TACAACG TAATTCATGG AGACTAATCA GTGGTAAATG#TGCAGTACGA 2580GACCTTA ACAGAGAGGA GGTTCGTATC TGGTGGGGAT#CAGCGATCCA 2640ATGGGAC TAGAGTTTGT ATGGCAAGAT ATGACAACCC#CCGACTCAGT 2700GACATGA AAGGGTTGCC CACCCGTCTG CTCGTCACCG#CCTACAACCT 2760GAGAAAA AGCTCGCAAT TGAAAGTTGG GCTCTTTACT#AACGTAACTT 2820TTCCACG GTCTTGGTCG TCTTGAGTCT CGTAAGAACA#GGACTGGAGA 2880GGTAGTT ACGCCGGTGC CTATCGTTTT GCTGGTCTCT#CTCTAGGTCT 2940TGGGAAT TCTGGAAGAT TTCGGTCTCC CAAGTTCTTT#GTACTGAGAT 3000ATAGCGG GGTCTGATAC GGGTGGTTTT GAGCCCGCAC#GATCATTCCT 3060TATTGCA GTCCGGAGCT ACTCATCAGG TGGTATACTG#AGGTAATATA 3120AGAAACC ACTACGTCAA GAAGGACAGG AAATGGTTCC#ACGCGTACCC 3180CTGAGTA TCGAAGACGC TAAGACAATA TAGGAACCAT#AATCTGTTCT 3240ACCCATC CAGAGCTCGC AGATCAAGCA TGGCTTTACA#TTTACGACTG 3300TACTGGG TAGAGCTAAG ATATTCCCTC ATCCAGCTCC#GCATTTTATC 3360GTGGTCG ATGGTATGCC ACTTGCCAGA TCTATGGTAT#TTGACCGATA 3420GATAATG CACCAGTCTA ACCGAATTTT CTTTTAGCTC#TATATGGCTG 3480CTTCTTC AATGAGAGCC AAAAGTTCCT CGATAACCAA#GGAGAGAACA 3540TGTAGCA CCCATCCTCC ACAGCCGTAA CGAGGTTCCG#CCGTGGGACG 3600CCCTCTA TTCCACACCT GGTACCCCTC AAACTTGAGA#TACACTGCCA 3660TTTAGGG AATCCTGTCG AAGGTGGCAG CGTTATCAAC#TACATCAGAG 3720AGAGGAT TATAATCTCT TCCACAACGT GGTGCCGGTC#TGACACCTTA 3780GAATAAT TTCTTGCAAG TTCCAGATAC AAGTGGTTAC#GGAGGGCCTA 3840CATTCCG CAAATTCAGG TACGCCAGTG GATTGGCGAA#TGACTATCGC 3900CAATATC TACCCTGGAA AGGACAAGGT ATATTCTCCA#GTGACGTACC 3960CTCTACT CGCACTAACT TCATCTGAAT ATAGGAGTAT#CGCGAGGCCT 4020TAGCCGC GATAGTGCAC CAGATGACCT CCCGCAGTAC#ATTCAAGGGA 4080GGTCGAA GGCAAAGACG TCCAGAAGCA ACTTGCGGTC#TATCACCGCA 4140CTTCTCC GCCTCCGGGA TTGATAAGGA GGCAAAGGGT#CTATTATATC 4200ACAGGTA CATGATTTCA TCTTCCTTTT TTCGCAGTCA#ACTGTCACCA 4260TTCTCTT ATTTAAAAGG AGTCAAAAGA CAAGACCCGT#TATACCATCA 4320CAACGGA TACGACCCCT CTAAGGAAGT TGGTAATTAT#GCGACCGTGA 4380ACCGGGC TTTGACGGCA GCATCGTCGA TGTGAGCCAG#ACGGTTGTAG 4440GGTGGAA TGCGAAATTT TCAAGAACAC CGGCTTGCAT#ACTACCGCTT 4500GGTGATC GGTACCACAA AGTCCGTCAA GATCACTTGC#AGCTTCTGGG 4560GAGGGGT ATATGGGAGT GGCAGCTCAG AAATTTGGGA#ATTCTCTCTG 4620TATTTAC TTATTTATTG AATCGACCAA TACGGGTGGG# 4670TATGTTT TACTTGGTCT GAAAATCAAA TTCGTTCTCA__________________________________________________________________________
Claims
  • 1. A method of preparing fungal .alpha.-1,4-glucan lyase comprising isolating .alpha.-1,4-glucan lyase from a culture of a fungus.
  • 2. A method according to claim 1 wherein the .alpha.-1,4-glucan lyase is isolated and/or further purified using a gel that is not degraded by .alpha.-1,4-lyase.
  • 3. A method according to claim 2 wherein the gel comprises dextrin.
  • 4. A method according to claim 1, wherein the fungus is selected from the group consisting of Morchella costata or Morchella vulgaris.
  • 5. An .alpha.-1,4-glucan lyase enzyme prepared by the method of claim 1.
  • 6. An isolated fungal .alpha.-1,4-glucan lyase enzyme comprising an amino acid sequence selected from the group consisting of SEQ ID NO:1, SEQ ID NO:2, and any variant thereof, said enzyme being obtainable from a culture of a fungus.
  • 7. An isolated polynucleotide fragment coding for fungal .alpha.-1,4-glucan lyase.
  • 8. The isolated polynucleotide according to claim 7 wherein the polynucleotide comprises DNA.
  • 9. An isolated polynucleotide according to claim 8 wherein the DNA comprises a sequence that is the same as, or is complementary to, or specifically hybridizes under stringent hybridization conditions with, or contains any suitable codon substitutions for any of those of, SEQ ID NO:3 or SEQ ID NO:4.
  • 10. A method of preparing .alpha.-1,4-glucan lyase enzyme comprising the step of expressing in a biological system a polynucleotide having a sequence selected from the group consisting of SEQ ID NO:3 and SEQ ID NO:4.
  • 11. A method according to claim 2, wherein the gel comprises a cyclodextrin.
  • 12. A method according to claim 11, wherein the gel comprises beta-cyclodextrin.
Priority Claims (1)
Number Date Country Kind
9321302 Oct 1993 GBX
PCT Information
Filing Document Filing Date Country Kind 102e Date 371c Date
PCT/EP94/03398 10/15/1994 7/8/1996 7/8/1996
Publishing Document Publishing Date Country Kind
WO95/10617 4/20/1995
Foreign Referenced Citations (1)
Number Date Country
2617502 Jun 1987 FRX
Non-Patent Literature Citations (25)
Entry
Yu, Shukun, et al.; Biochimica et Biophysuca Acta, 1156(1993) 313-320; a-1, 4-Glucan lyase, a new class of starch/glycogen degrading enzyme. I. Efficient purification and characterization from red seaweeds.
Pall, Martin L., et al.; Fungal Genetics Newsletter; pp. 58-63; The use of Ignite (Basta;glufosinate;phosphinothricin) to select transformants of bar-containing plasmids in Neurospora crassa.
Punt, Peter J., et al.; Fungal Transformation Using Dominant Selection; Methods in Enzymology, vol. 216; 1992; pp. 447-449; Transformation of Filamentous Fungi Based on Mygromycin B and Phleomycin Resistance Markers.
Archer, David B., et al.; Biotechnology Letters, vol. 14, No. 5 (May 1992) pp. 357-362; Proleolytic Degradation of Heterologous Proteins Expressed in Aspergillus Niger.
Punt, Peter J., et al.; Biotechnology, 17 (1991) 19-33; Intracellular and extracellular production of proteins in Aspergillus under the control of expression signals of the highly expressed Aspergillus nidulans gpdA gene.
Daboussi, M.J., et al.; Current Genetics, 1989; 15: 453-456; Short communications; Transformation of seven species of filamentous fungi using the nitrate reductase gene of Aspergillus nidulans.
Dellaporta, Stephen L., et al.; Plant Molecular Biology Reporter; vol. 1, No. 4, Fall 1983; pp. 19-21; A Plant DNA Minipreparation: Version II.
Barkholt, Vibeke, et al.; Analytical Biochemistry 177, 318-211, pp. 318-322 (1989); Amino Acid Analysis: Determination of cysteine plus Half-Cystine in Proteins after Hydrochloric Acid Hydrolysis with a Disulfide Compound as Additive.
Sanger et al., "DNA sequencing with Chain-Terminating inhibitors," Proc National Academy Science, USA vol. 74, pp. 5463-5467 (1979).
Dellaporta et al., "A plant DNA minipreparation: Version II," Plant Molecular Biology Reporter, vol. 1, pp. 19-21 (1983).
Buxton et al., "Transformation of Aspergillus niger using the argB Gene of Aspergillus nidulans," Gene, vol. 37, pp. 207-214 (1985).
Baute et al., "Fungal enzymic activity degrading 1,4-x-D-Glucans to 1,5-D-Anhydrofructose" Phytochemistry, vol. 27, No. 11, pp. 3401-3403 (1988).
Punt et al., "Intracellular and extracellular production of proteins in Aspergillus under . . . " Journal of Biotechnology, vol. 17, pp. 19-34 (1991).
Archer et al., "Proteolytic degradation of Heterologous proteins expressed in . . . ," Biotechnology Letters, vol. 14, No. 5, pp. 357-362 (1992).
Punt et al., "Transformation of Filamentous fungi based on Hygromysin B and Phleomycin . . . ," Methods of Enzymology, vol. 216, pp. 447-457 (1992).
Pall et al., "The use of ignite to select transformants of bar containing plasmids in . . . ," Fungal Genetics Newsletter, vol. 40, pp. 59-62 (1993).
Plant Physiology, Biochemistry and Biophysics, Bio. Abstr. 57: AB-926, No. 52735, referencing Baute, et al., Phytochemistry, 27:3401-3404 (1988).
Baute, et al., Phytochemistry, "Fungal Enzymic Activity Degrading 1,4-a-D-Glucans to 1,5,-D-Anhydrofructose," 27:3401-3403 (1988).
Baute, et al., Bull. Soc. Pharm. Bordeaux, "Bioconversions Fongiques Produisant, A Partr De Sucres, Des Composes Pyrontiques Inhabituels A Activite Antibiotique," 128:9-18 (1989).
Shukun Yu and Marianne Pedersen, Planta, "a-1,4, Glucan lyase, a new class of starch/glycogen-degrading enzyme," 191:137-143, (1993).
Yu et al, Planta (1993) 191:137-142.
Baute et al, Phytochemistry (1988) vol. 27:3401-3403.
Kriegler, Gene Transfer & Expression, Stockton Press, NY, NY, 1990 pp. 114-132.
Yu et al, Biochimica of Biophys Acta (1993)1156: 313-320.
Lee et al, Science (1988)239:1288-1291.