Genes of carotenoid biosynthesis and metabolism and a system for screening for such genes

BACKGROUND OF THE INVENTION
1. Field of the Invention
The present invention describes the DNA sequence for eukaryotic genes encoding .epsilon., isopentenyl pyrophosphate isomerase (IPP) and .beta.-carotene hydroxylase as well as vectors containing the same and hosts transformed with said vectors. The present invention also provides a method for augmenting the accumulation of carotenoids and production of novel and rare carotenoids. The present invention provides methods for controlling the ratio of various carotenoids in a host. Additionally, the present invention provides a method for screening for eukaryotic genes encoding enzymes of carotenoid biosynthesis and metabolism.
2. Discussion of the Background
Carotenoid pigments with cyclic endgroups are essential components of the photosynthetic apparatus in oxygenic photosynthetic organisms (e.g., cyanobacteria, algae and plants; Goodwin, 1980). The symmetrical bicyclic yellow carotenoid pigment .beta.-carotene (or, in rare cases, the asymmetrical bicyclic .alpha.-carotene) is intimately associated with the photosynthetic reaction centers and plays a vital role in protecting against potentially lethal photooxidative damage (Koyama, 1991). .beta.-carotene and other carotenoids derived from it or from .alpha.-carotene also serve as light-harvesting pigments (Siefermann-Harms, 1987), are involved in the thermal dissipation of excess light energy captured by the light-harvesting antenna (Demmig-Adams & Adams, 1992), provide substrate for the biosynthesis of the plant growth regulator abscisic acid (Rock & Zeevaart, 1991), and are precursors of vitamin A in human and animal diets (Krinsky, 1987). Plants also exploit carotenoids as coloring agents in flowers and fruits to attract pollinators and agents of seed dispersal (Goodwin, 1980). The color provided by carotenoids is also of agronomic value in a number of important crops. Carotenoids are currently harvested from plants for use as pigments in food and feed.
The probable pathway for formation of cyclic carotenoids in plants, algae and cyanobacteria is illustrated in FIG. 1. Two types of cyclic endgroups are commonly found in higher plant carotenoids, these are referred to as the .beta. and .alpha. cyclic endgroups (FIG. 3.; the acyclic endgroup is referred to as the .PSI. or psi endgroup). These cyclic endgroups differ only in the position of the double bond in the ring. Carotenoids with two .beta. rings are ubiquitous, and those with one .beta. and one .epsilon. ring are common, but carotenoids with two .epsilon. rings are rarely detected. .beta.-Carotene (FIG. 1) has two .beta. endgroups and is a symmetrical compound that is the precursor of a number of other important plant carotenoids such as zeaxanthin and violaxanthin (FIG. 2).
Carotenoid enzymes have previously been isolated from a variety of sources including bacteria (Armstrong et al., 1989, Mol. Gen. Genet. 216, 254-268; Misawa et al., 1990, J. Bacteriol., 172, 6704-12), fungi (Schmidhauser et al., 1990, Mol. Cell. Biol. 10, 5064-70), cyanobacteria (Chamovitz et al., 1990, Z. Naturforsch, 45c, 482-86) and higher plants (Bartley et al., Proc. Natl. Acad. Sci USA 88, 6532-36; Martinez-Ferez & Vioque, 1992, Plant Mol. Biol. 18, 981-83). Many of the isolated enzymes show a great diversity in function and inhibitory properties between sources. For example, phytoene desaturases from Synechococcus and higher plants carry out a two-step desaturation to yield .zeta.-carotene as a reaction product; whereas the same enzyme from Erwinia introduces four double bonds forming lycopene. Similarity of the amino acid sequences are very low for bacterial versus plant enzymes. Therefore, even with a gene in hand from one source, it is difficult to screen for a gene with similar function in another source. In particular, the sequence similarity between prokaryotic and eukaryotic genes is quite low.
Further, the mechanism of gene expression in prokaryotes and eukaryotes appears to differ sufficiently such that one can not expect that an isolated eukaryotic gene will be properly expressed in a prokaryotic host.
The difficulties in isolating related genes is exemplified by recent efforts to isolated the enzyme which catalyzes the formation of .beta.-carotene from the acyclic precursor lycopene. Although this enzyme had been isolated in a prokaryote, it had not been isolated from any photosynthetic organism nor had the corresponding genes been identified and sequenced or the cofactor requirements established. The isolation and characterization of the enzyme catalyzing formation of .beta.-carotene in the cyanobacterium Synechococcus PCC7942 was described by the present inventors and others (Cunningham et al., 1993 and 1994).
The need remains for the isolation of eukaryotic genes involved in the carotenoid biosynthetic pathway, including a gene encoding an .epsilon. cyclase, IPP isomerase and .beta.-carotene hydroxylase. There remains a need for methods to enhance the production of carotenoids. There also remains a need in the art for methods for screening for eukaryotic genes encoding enzymes of carotenoid biosynthesis and metabolism.
SUMMARY OF THE INVENTION
Accordingly, a first object of this invention is to provide isolated eukaryotic genes which encode enzymes involved in carotenoid biosynthesis; in particular, .epsilon. cyclase, IPP isomerase and .beta.-carotene hydroxylase.
A second object of this invention is to provide eukaryotic genes which encode enzymes which produce novel carotenoids.
A third object of the present invention is to provide vectors containing said genes.
A fourth object of the present invention is to provide hosts transformed with said vectors.
Another object of the present invention is to provide hosts which accumulates novel or rare carotenoids or which overexpress known carotenoids.
Another object of the present invention is to provide hosts with inhibited carotenoid production.
Another object of this invention is to secure the expression of eukaryotic carotenoid-related genes in a recombinant prokaryotic host.
A final object of the present invention is to provide a method for screening for eukaryotic genes which encode enzymes involved in carotenoid biosynthesis and metabolism.
These and other objects of the present invention have been realized by the present inventors as described below.

BRIEF DESCRIPTION OF THE DRAWINGS
A more complete appreciation of the invention and many of the attendant advantages thereof will be readily obtained as the same becomes better understood by reference to the following detailed description when considered in connection with the accompanying drawings, wherein:
FIG. 1 is a schematic representation of the pathway of .beta.-carotene biosynthesis in cyanobacteria, algae and plants. The enzymes catalyzing various steps are indicated at the left. Target sites of the bleaching herbicides NFZ and MPTA are also indicated at the left. Abbreviations: DMAPP, dimethylallyl pyrophosphate; FPP, farnesyl pyrophosphate; GGPP, geranylgeranyl pyrophosphate; GPP, geranyl pyrophosphate; IPP, isopentenyl pyrophosphate; LCY, lycopene cyclase; MVA, mevalonic acid; MPTA, 2-(4-methylphenoxy)triethylamine hydrochloride; NFZ, norflurazon; PDS, phytoene desaturase; PSY, phytoene synthase; ZDS, .zeta.-carotene desaturase; PPPP, prephytoene pyrophosphate.
FIG. 2 depicts possible routes of synthesis of cyclic carotenoids and common plant and algal xanthophylls (oxycarotenoids) from neurosporene. Demonstrated activities of the .beta.- and .epsilon.-cyclase enzymes of A. thaliana are indicated by bold arrows labelled with .beta. or .epsilon. respectively. A bar below the arrow leading to .epsilon.-carotene indicates that the enzymatic activity was examined but no product was detected. The steps marked by an arrow with a dotted line have not been specifically examined. Conventional numbering of the carbon atoms is given for neurosporene and .alpha.-carotene. Inverted triangles (.tangle-soliddn.) mark positions of the double bonds introduced as a consequence of the desaturation reactions.
FIG. 3 depicts the carotene endgroups which are found in plants.
FIG. 4 is a DNA sequence and the predicted amino acid sequence of .epsilon. cyclase isolated from A. thaliana (SEQ ID NOS: 1 and 2). These sequences were deposited under Genbank accession number U50738. This cDNA is incorporated into the plasmid pATeps.
FIG. 5 is a DNA sequence encoding the .beta.-carotene hydroxylase isolated from A. thaliana (SEQ ID NO: 3). This cDNA is incorporated into the plasmid pATOHB.
FIG. 6 is an alignment of the predicted amino acid sequences of A. thaliana .beta.-carotene hydroxylase (SEQ ID NO: 4) with the bacterial enzymes from Alicalgenes sp. (SEQ ID NO: 5) (Genbank D58422), Erwinia herbicola Eho10 (SEQ ID NO.: 6) (GenBank M872280), Erwinia uredovora (SEQ ID NO.: 7) (GenBank D90087) and Agrobacterium aurianticum (SEQ ID NO.: 8) (GenBank D58420). A consensus sequence is also shown. Consensus is identical for all five genes where a capital letter appears. A lowercase letter indicates that three of five, including A. thaliana, have the identical residue. TM; transmembrane
FIG. 7 is a DNA sequence of a cDNA encoding an IPP isomerase isolated from A. thaliana (SEQ ID NO: 9). This cDNA is incorporated into the plasmid pATDP5.
FIG. 8 is a DNA sequence of a second cDNA encoding another IPP isomerase isolated from A. thaliana (SEQ ID NO: 10). This cDNA is incorporated into the plasmid pATDP7.
FIG. 9 is a DNA sequence of a cDNA encoding an IPP isomerase isolated from Haematococcus pluvialis (SEQ ID NO: 11). This cDNA is incorporated into the plasmid pHP04.
FIG. 10 is a DNA sequence of a second cDNA encoding another IPP isomerase isolated from Haematococcus pluvialis (SEQ ID NO: 12). This cDNA is incorporated into the plasmid pHP05.
FIG. 11 is an alignment of the predicted amino acid sequences of the IPP isomerase isolated from A. thaliana (SEQ ID NO.: 16 and 18), H. pluvialis (SEQ ID NOS.: 14 and 15), Clarkia breweri (SEQ ID NO.: 17) (See, Blanc & Pichersky, Plant Physiol. (1995) 108:855; Genbank accession no. X82627) and Saccharomyces cerevisiae (SEQ ID NO.: 19) (Genbank accession no. J05090).
FIG. 12 is a DNA sequence of the cDNA encoding an IPP isomerase isolated from marigold (SEQ ID NO: 13). This cDNA is incorporated into the plasmid pPMDP1. xxx's denote a region not yet sequenced at the time when this application was prepared.
FIG. 13 is an alignment of the consensus sequence of 4 plant .beta.-cyclases (SEQ ID NO.: 20) with the A. thaliana .epsilon.-cyclase (SEQ ID No.: 21). A capital letter in the plant .beta. consensus is used where all 4 .beta. cyclase genes predict the same amino acid residue in this position. A small letter indicates that an identical residue was found in 3 of the 4. Dashes indicate that the amino acid residue was not conserved and dots in the sequence denote a gap. A consensus for the aligned sequences is given, in capital letters below the alignment, where the .beta. and .epsilon. cyclase have the same amino acid residue. Arrows indicate some of the conserved amino acids that will be used as junction sites for construction of chimeric cyclases with novel enzymatic activities. Several regions of interest including a sequence signature indicative of a dinucleotide-binding motif and 2 predicted transmembrane (TM) helical regions are indicated below the alignment and are underlined.

DESCRIPTION OF THE PREFERRED EMBODIMENTS
Isolated eukaryotic genes which encode enzymes involved in carotenoid biosynthesis
The present inventors have now isolated eukaryotic genes encoding .epsilon.and .beta.-carotene hydroxylase from A. thaliana and IPP isomerases from several sources.
The present inventors have now isolated the eukaryotic gene encoding the enzyme IPP isomerase which catalyzes the conversion of isopentenyl pyrophosphate (IPP) to dimethylallyl pyrophosphate (DMAPP). IPP isomeraseswere isolated from A. thaliana, H. pluvialis and marigold.
Alignments of these are shown in FIG. 12 (excluding the marigold sequence).Plasmids containing these genes were deposited with the American Type Culture Collection, 12301 Parklawn Drive, Rockville Md. 20852 on Mar. 4, 1996 under ATCC accession numbers 98000 (pHP05-H. pluvialis); 98001 (pMDP1-marigold); 98002 (pATDP7-H. pluvialis) and 98004 (pHP04-H. pluvialis).
The present inventors have also isolated the gene encoding the enzyme, .epsilon. cyclase, which is responsible for the formation of .epsilon. endgroups in carotenoids. A gene encoding an .epsilon. cyclase from any organism has not heretofore been described. The A. thaliane .epsilon.-cyclase adds an .epsilon.-ring to only one end of the symmetrical lycopene while the related .beta.-cyclase adds a ring at both ends. The DNA of the present invention is shown in FIG. 4 and SEQ ID NO: 1. A plasmid containing this gene was deposited with the American Type Culture Collection, 12301 Parklawn Drive, Rockville Md. 20852 on Mar. 4, 1996 under ATCC accession number 98005 (pATeps -A. thaliana).
The present inventors have also isolated the gene encoding the enzyme, .beta.-carotene hydroxylase, which is responsible for hydroxylating the .beta. endgroup in carotenoids. The DNA of the present invention is shown in SEQ ID NO: 3 and FIG. 5. The full length gene product hydroxylates bothend groups of .beta.-carotene as do products of genes which encode proteinstruncated by up to 50 amino acids from the N-terminus. Products of genes which encode proteins truncated between about 60-110 amino acids from the N-terminus preferentially hydroxylates only one ring. A plasmid containingthis gene was deposited with the American Type Culture Collection, 12301 Parklawn Drive, Rockville Md. 20852 on Mar. 4, 1996 under ATCC accession number 98003 (pATOHB-A. thaliana).
Eukaryotic genes which encode enzymes which produce novel or rare carotenoids
The present invention also relates to novel enzymes which can transform known carotenoids into novel or rare products. That is, currently .epsilon.-carotene (see FIG. 2) and .gamma.-carotene can only be isolated in minor amounts. As described below, an enzyme can be produced which would transform lycopene to .gamma.-carotene and lycopene to .epsilon.-carotene. With these products in hand, bulk synthesis of other carotenoids derived from them are possible. For example, .epsilon.-carotene can be hydroxylated to form an isomer of lutein (1 .epsilon.- and 1 .beta.-ring) and zeaxanthin (2 .beta.-rings) where both endgroups are, instead, .epsilon.-rings.
The eukaryotic genes in the carotenoid biosynthetic pathway differ from their prokaryotic counterparts in their 5' region. As used herein, the 5' region is the region of eukaryotic DNA which precedes the initiation codonof the counterpart gene in prokaryotic DNA. That is, when the consensus areas of eukaryotic and prokaryotic genes are aligned, the eukaryotic genes contain additional coding sequences upstream of the prokaryotic initiation codon.
The present inventors have found that the amount of the 5' region present can alter the activity of the eukaryotic enzyme. Instead of diminishing activity, truncating the 5' region of the eukaryotic gene results in an enzyme with a different specificity. Thus, the present invention relates to enzymes which are truncated to within 0-50, preferably 0-25, codons of the 5' initiation codon of their prokaryotic counterparts as determined byalignment maps.
For example, as discussed above, when the gene encoding A. thaliana .beta.-carotene hydroxylase was truncated, the resulting enzyme catalyzed the formation of .beta.-cryptoxanthin as major product and zeaxanthin as minor product; in contrast to its normal production of zeaxanthin.
In addition to novel enzymes produced by truncating the 5' region of known enzymes, novel enzymes which can participate in the formation of novel carotenoids can be formed by replacing portions of one gene with an analogous sequence from a structurally related gene. For example, .beta.-cyclase and .epsilon.-cyclase are structurally related (see FIG. 13). By replacing a portion of .beta.-lycopene cyclase with the analogous portion of .epsilon.-cyclase, an enzyme which produces .gamma.-carotene will be produced (1 endgroup). Further, by replacing a portion of the .epsilon.-lycopene cyclase with the analogous portion of .beta.-cyclase, an enzyme which produces .epsilon.-carotene will be produced (.epsilon.-cyclase normally produces a compound with 1 .epsilon.-endgroup (.delta.-carotene) not 2). Similarly, .beta.-hydroxylase could be modifiedto produce enzymes of novel function by creation of hybrids with .epsilon.-hydroxylase.
Vectors
The genes encoding the carotenoid enzymes as described above, when cloned into a suitable expression vector, can be used to overexpress these enzymes in a plant expression system or to inhibit the expression of theseenzymes. For example, a vector containing the gene encoding .epsilon.-cyclase can be used to increase the amount of .alpha.-carotene in an organism and thereby alter the nutritional value, pharmacology and visual appearance value of the organism.
In a preferred embodiment, the vectors of the present invention contain a DNA encoding an eukaryotic IPP isomerase upstream of a DNA encoding a second eukaryotic carotenoid enzyme. The inventors have discovered that inclusion of an IPP isomerase gene increases the supply of substrate for the carotenoid pathway; thereby enhancing the production of carotenoid endproducts. This is apparent from the much deeper pigmentation in carotenoid-accumulating colonies of E. coli which also contain one of the aforementioned IPP isomerase genes when compared to colonies that lack this additional IPP isomerase gene. Similarly, a vector comprising an IPP isomerase gene can be used to enhance production of any secondary metabolite of dimethylallyl pyrophosphate (such as isoprenoids, steroids, carotenoids, etc.).
Alternatively, an anti-sense strand of one of the above genes can be inserted into a vector. For example, the .epsilon.-cyclase gene can be inserted into a vector and incorporated into the genomic DNA of a host, thereby inhibiting the synthesis of .epsilon.,.beta. carotenoids (lutein and .alpha.-carotene) and enhancing the synthesis of .beta.,.beta. carotenoids (zeaxanthin and .beta.-carotene).
Suitable vectors according to the present invention comprise a eukaryotic gene encoding an enzyme involved in carotenoid biosynthesis or metabolism and a suitable promoter for the host can be constructed using techniques well known in the art (for example Sambrook et al., Molecular Cloning A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y., 1989).
Suitable vectors for eukaryotic expression in plants are described in Frey et al., Plant J. (1995) 8(5):693 and Misawa et al, 1994a; incorporated herein by reference.
Suitable vectors for prokaryotic expression include pACYC184, pUC119, and pBR322 (available from New England BioLabs, Beverly, Mass.) and pTreHis (Invitrogen) and pET28 (Novagene) and derivatives thereof.
The vectors of the present invention can additionally contain regulatory elements such as promoters, repressors selectable markers such as antibiotic resistance genes, etc.
Hosts
Host systems according to the present invention can comprise any organism that already produces carotenoids or which has been genetically modified to produce carotenoids. The IPP isomerase genes are more broadly applicable for enhancing production of any product dependent on DMAPP as aprecursor.
Organisms which already produce carotenoids include plants, algae, some yeasts, fungi and cyanobacteria and other photosynthetic bacteria. Transformation of these hosts with vectors according to the present invention can be done using standard techniques such as those described inMisawa et al., (1990) supra; Hundle et al., (1991) Photchem. Photobiol. 54,89-93; both incorporated herein by reference.
Alternatively, transgenic organisms can be constructed which include the DNA sequences of the present invention (Bird et al, 1991; Bramley et al, 1992; Misawa et al, 1994a; Misawa et al, 1994b; Cunningham et al, 1993). The incorporation of these sequences can allow the controlling of carotenoid biosynthesis, content, or composition in the host cell. These transgenic systems can be constructed to incorporate sequences which allowover-expression of the carotenoid genes of the present invention. Transgenic systems can also be constructed containing antisense expressionof the DNA sequences of the present invention. Such antisense expression would result in the accumulation of the substrates of the substrates of the enzyme encoded by the sense strand.
A method for screening for eukaryotic genes which encode enzymes involved in carotenoid biosynthesis
The method of the present invention comprises transforming a prokaryotic host with a DNA which may contain a eukaryotic or prokaryotic carotenoid biosynthetic gene; culturing said transformed host to obtain colonies; andscreening for colonies exhibiting a different color than colonies of the untransformed host.
Suitable hosts include E. coli, cyanobacteria such as Synechococcus and Synechocystis, alga and plant cells. E. coli are preferred.
In a preferred embodiment, the above "color complementation test" can be enhanced by using mutants which are either (1) deficient in at least one carotenoid biosynthetic gene or (2) overexpress at least one carotenoid biosynthetic gene. In either case, such mutants will accumulate carotenoidprecursors.
Prokaryotic and eukaryotic DNA libraries can be screened in total for the presence of genes of carotenoid biosynthesis, metabolism and degradation. Preferred organisms to be screened include photosynthetic organisms.
E. coli can be transformed with these eukaryotic cDNA libraries using conventional methods such as those described in Sambrook et al, 1989 and according to protocols described by the venders of the cloning vectors.
For example, the cDNA libraries in bacteriophage vectors such as lambdaZAP (Stratagene) or lambdaZIPOLOX (Gibco BRL) can be excised en masse and usedto transform E. coli can be inserted into suitable vectors and these vectors can the be used to transform E. coli. Suitable vectors include pACYC184, pUC119, pBR322 (available from New England BioLabs, Beverly, Mass.). pACYC is preferred.
Transformed E. coli can be cultured using conventional techniques. The culture broth preferably contains antibiotics to select and maintain plasmids. Suitable antibiotics include penicillin, ampicillin, chloramphenicol, etc. Culturing is typically conducted at 20.degree.-40.degree. C., preferably at room temperature (20.degree.-25.degree. C.), for 12 hours to 7 days.
Cultures are plated and the plates are screened visually for colonies with a different color than the colonies of the untransformed host E. coli. Forexample, E. coli transformed with the plasmid, pAC-BETA (described below), produce yellow colonies that accumulate .beta.-carotene. After transformation with a cDNA library, colonies which contain a different huethan those formed by E. coli/pAC-BETA would be expected to contain enzymes which modify the structure or degree of expression of .beta.-carotene. Similar standards can be engineered which overexpress earlier products in carotenoid biosynthesis, such as lycopene, .gamma.-carotene, etc.
Having generally described this invention, a further understanding can be obtained by reference to certain specific examples which are provided herein for purposes of illustration only and are not intended to be limiting unless otherwise specified.
EXAMPLE
I. Isolation of .beta.-carotene hydroxylase
Plasmid Construction
An 8.6 kb BglII fragment containing the carotenoid biosynthetic genes of Erwinia herbicola was first cloned in the BamHI site of plasmid vector pACYC184 (chloramphenicol resistant), and then a 1.1 kb BamHI fragment containing the .beta.-carotene hydroxylase (CrtZ) was deleted. The resulting plasmid, pAC-BETA, contains all the genes for the formation of .beta.-carotene. E. coli strains containing this plasmid accumulate .beta.-carotene and form yellow colonies (Cunningham et al., 1994).
A full length gene encoding IPP isomerase of Haematococcus pluvialis (HP04)was first cut out with BamHI-KpnI from pBluescript SK+, and then cloned into a pTrcHisA vector with high-level expression from the trc promoter (Invitrogen Inc.). A fragment containing the IPP isomerase and trc promoter was excised with EcoRV-KpnI and cloned in HindIII site of PAC-BETA. E. coli cells transformed with this new plasmid pAC-BETA-04 formorange (deep yellow) colonies on LB plates and accumulate more .beta.-carotene than cells that contain pAC-BETA.
Screening of the Arabidopsis cDNA Library
Several .lambda. cDNA expression libraries of Arabidopsis were obtained from the Arabidopsis Biological Resource Center (Ohio State University, Columbus, Ohio) (Kieber et al., 1993). The .lambda. cDNA libraries were excised in vivo using Stratagene's ExAssist SOLR system to produce a phagemid cDNA library wherein each clone also contained an amphicillin.
E. coli strain DH10BZIP was chosen as the host cells for the screening and pigment production. DH10B cells were transformed with plasmid pAC-BETA-04 and were plated on LB agar plates containing chloramphenicol at 50 .mu.g/ml (from United States Biochemical Corporation). The phagemid Arabidopsis cDNA library was then introduced into DH10B cells already containing pAC-BETA-04. Transformed cells containing both pAC-BETA-04 and Arabidopsis cDNA were selected on chloramphenicol plus ampicillin (150 .mu.g/ml) agar plates. Maximum color development occurred after 5 days incubation at room temperature, and lighter yellow colonies were selected.Selected colonies were inoculated into 3 ml liquid LB medium containing ampicillin and chloramphenicol, and cultures were incubated. Cells were then pelleted and extracted in 80 .mu.l 100% acetone in microfuge tubes. After centrifugation, pigmented supernatant was spotted on silica gel thin-layer chromatography (TLC) plates, and developed with a hexane; ether(1:1) solvent system. .beta.-carotene hydroxylase clones were identified based on the appearance of zeaxanthin on TLC plate.
Subcloning and Sequencing
The .beta.-carotene hydroxylase cDNA was isolated by standard procedures (Sambrook et al., 1989). Restriction maps showed that three independent inserts (1.9 kb, 0.9 kb and 0.8 kb) existed in the cDNA. To determine which cDNA insert confers the .beta.-carotene hydroxylase activity, plasmid DNA was digested with NotI (a site in the adaptor of the cDNA library) and three inserts were subcloned into NotI site of SK vectors. These subclones were used to transform E. coli cells containing pAC-BETA-04 again to test the hydroxylase activity. A fragment of 0.95 kb,later shown to contain the hydroxylase gene, was also blunt-ended and cloned into pTrcHis A,B,C vectors. To remove the N terminal sequence, a restriction site (BglII) was used that lies just before the conserved sequence with bacterial genes. A BglII-XhoI fragment was directionally cloned in BamHI-XhoI digested trc vectors. Functional clones were identified by the color complementation test. A .beta.-carotene hydroxylase enzyme produces a colony with a lighter yellow color than is found in cells containing pAC-BETA-04 alone.
Arabidopsis .beta.-carotene hydroxylase was sequenced completely on both strands on an automatic sequencer (Applied Biosystems, Model 373A, Version2.0.1S).
Pigment Analysis
A single colony was used to inoculate 50 ml of LB containing ampicillin andchloramphenicol in a 250-ml flask. Cultures were incubated at 28.degree. C.for 36 hours with gentle shaking, and then harvested at 5000 rpm in an SS-34 rotor. The cells were washed once with distilled H.sub.2 O and resuspended with 0.5 ml of water. The extraction procedures and HPLC were essentially as described previously (Cunningham et al, 1994).
II. Isolation of .epsilon. cyclase
Plasmid Construction
Construction of plasmids PAC-LYC, PAC-NEUR, and pAC-ZETA is described in Cunningham et al., (1994). In brief, the appropriate carotenoid biosynthetic genes from Erwinia herbicola, Rhodobacter capsulatus, and Synechococcus sp. strain PCC7942 were cloned in the plasmid vector pACYC184 (New England BioLabs, Beverly, Mass.). Cultures of E. coli containing the plasmids PAC-ZETA, pAC-NEUR, and pAC-LYC, accumulate .zeta.-carotene, neurosporene, and lycopene, respectively. The plasmid PAC-ZETA was constructed as follows: an 8.6-kb BglII fragment containing the carotenoid biosynthetic genes of E. herbicola (GenBank M87280; Hundle et al., 1991) was obtained after partial digestion of plasmid pPL376 (Perry et al., 1986; Tuveson et al., 1986) and cloned in the BamHI site ofpACYC184 to give the plasmid pAC-EHER. Deletion of adjacent 0.8- and 1.1-kbBamHI-BamHI fragments (deletion Z in Cunningham et al., 1994), and of a 1.1kB SalI-SalI fragment (deletion X) served to remove most of the coding regions for the E. herbicola .beta.-carotene hydroxylase (crt gene) and zeaxanthin glucosyltransferase (crtX gene), respectively. The resulting plasmid, pAC-BETA, retains functional genes for geranylgeranyl pyrophosphate synthase (crtE), phytoene synthase (crtB), phytoene desaturase (crtI), and lycopene cyclase (crtY). Cells of E. coli containing this plasmid form yellow colonies and accumulate .beta.-carotene. A plasmid containing both the .epsilon.- and .beta.-cyclase cDNAs of A. thaliana was constructed by excising the .epsilon. in clone y2 as a PvuI-PvuII fragment and ligating this piece in the SnaBI site of a plasmid (pSPORT 1 from GIBCO-BRL) that already contained the .beta. cyclase.
Organisms and Growth Conditions
E. coli strains TOP10 and TOP10 F' (obtained from Invitrogen Corporation, San Diego, Calif.) and XL1-Blue (Stratagene) were grown in Luria-Bertani (LB) medium (Sambrook et al., 1989) at 37.degree. C. in darkness on a platform shaker at 225 cycles per min. Media components were from Difco (yeast extract and tryptone) or Sigma (NaCl). Ampicillin at 150 .mu.g/mL and/or chloramphenicol at 50 .mu./mL (both from United States Biochemical Corporation) were used, as appropriate, for selection and maintenance of plasmids.
Mass Excision and Color Complementation Screening of an A. thaliana cDNA Library
A size-fractionated 1-2 kB cDNA library of A. thaliana in lambda ZAPII (Kieber et al., 1993) was obtained from the Arabidopsis Biological Resource Center at The Ohio State University (stock number CD4-14). Other size fractionated libraries were also obtained (stock numbers CD4-13, CD4-15, and CD4-16). An aliquot of each library was treated to cause a mass excision of the cDNAs and thereby produce a phagemid library according to the instructions provided by the supplier of the cloning vector (Stratagene; E. coli strain XL1-Blue and the helper phage R408 wereused). The titre of the excised phagemid was determined and the library wasintroduced into a lycopene-accumulating strain of E. coli TOP10 F' (this strain contained the plasmid pAC-LYC) by incubation of the phagemid with the E. coli cells for 15 min at 37.degree. C. Cells had been grown overnight at 30.degree. C. in LB medium supplemented with 2% (w/v) maltoseand 10 mM MgSO.sub.4 (final concentration), and harvested in 1.5 ml microfuge tubes at a setting of 3 on an Eppendorf microfuge (5415C) for 10min. The pellets were resuspended in 10 mM MgSO.sub.4 to a volume equal to one-half that of the initial culture volume. Transformants were spread on large (150 mm diameter) LB agar petri plates containing antibiotics to provide for selection of cDNA clones (ampicillin) and maintenance of pAC-LYC (chloramphenicol). Approximately 10,000 colony forming units were spread on each plate. Petri plates were incubated at 37.degree. C. for 16 hr and then at room temperature for 2 to 7 days to allow maximum color development. Plates were screened visually with the aid of an illuminated 3.times. magnifier and a low power stage-dissecting microscope for the rare, pale pinkish-yellow to deep-yellow colonies that could be observed in the background of pink colonies. A colony color of yellow or pinkish-yellow was taken as presumptive evidence of a cyclization activity. These yellow colonies were collected with sterile toothpicks andused to inoculate 3 ml of LB medium in culture tubes with overnight growth at 37.degree. C. and shaking at 225 cycles/min. Cultures were split into two aliquots in microfuge tubes and harvested by centrifugation at a setting of 5 in an Eppendorf 5415C microfuge. After discarding the liquid,one pellet was frozen for later purification of plasmid DNA. To the second pellet was added 1.5 ml EtOH, and the pellet was resuspended by vortex mixing, and extraction was allowed to proceed in the dark for 15-30 min with occasional remixing. Insoluble materials were pelleted by centrifugation at maximum speed for 10 min in a microfuge. Absorption spectra of the supernatant fluids were recorded from 350-550 nm with a Perkin Elmer lambda six spectrophotometer.
Analysis of isolated clones
Eight of the yellow colonies contained .beta.-carotene indicating that a single gene product catalyzes both cyclizations required to form the two .beta. endgroups of the symmetrical .beta.-carotene from the symmetrical precursor lycopene. One of the yellow colonies contained a pigment with the spectrum characteristic of .delta.-carotene, a monocyclic carotenoid with a single .epsilon. endgroup. Unlike the .beta. cyclase, this .epsilon. cyclase appears unable to carry out a second cyclization at the other end of the molecule.
The observation that .epsilon. is unable to form two cyclic .epsilon. endgroups (e.g. the bicyclic .epsilon.-carotene) illuminates the mechanismby which plants can coordinate and control the flow of substrate into carotenoids derived from .beta.-carotene versus those derived from .alpha.-carotene and also can prevent the formation of carotenoids with two .epsilon. endgroups.
The availability of the A. thaliana gene encoding the .epsilon. cyclase enables the directed manipulation of plant and algal species for modification of carotenoid content and composition. Through inactivation of the .epsilon. cyclase, whether at the gene level by deletion of the gene or by insertional inactivation or by reduction of the amount of enzyme formed (by such as antisense technology), one may increase the formation of .beta.-carotene and other pigments derived from it. Since vitamin A is derived only from carotenoids with .beta. endgroups, an enhancement of the production of .beta.-carotene versus .alpha.-carotene may enhance nutritional value of crop plants. Reduction of carotenoids with e endgroups may also be of value in modifying the color properties ofcrop plants and specific tissues of these plants. Alternatively, where production of .alpha.-carotene, or pigments such as lutein that are derived from .alpha.-carotene, is desirable, whether for the color properties, nutritional value or other reason, one may overexpress the .epsilon. or express it in specific tissues. Wherever agronomic value of acrop is related to pigmentation provided by carotenoid pigments the directed manipulation of expression of the .epsilon. gene and/or production of the enzyme may be of commercial value.
The predicted amino acid sequence of the A. thaliana .epsilon. cyclase enzyme was determined. A comparison of the amino acid sequences of the .beta. and .epsilon. enzymes of Arabidopsis thaliana (FIG. 13) as predicted by the DNA sequence of the respective genes (FIG. 4 for the .epsilon. cDNA sequence), indicates that these two enzymes have many regions of sequence similarity, but they are only about 37% identical overall at the amino acid level. The degree of sequence identity at the DNA base level, only about 50%, is sufficiently low such that we and others have been unable to detect this gene by hybridization using the .beta. cyclase as a probe in DNA gel blot experiments.
REFERENCES
Bird et al, (1991) Biotechnology 9, 635-639.
Bishop et al., (1995) FEBS Lett. 367, 158-162.
Bramley, P. M. (1985) Adv. Lipid Res. 21, 243-279.
Bramley, P. M. (1992) Plant J. 2, 343-349.
Britton, G. (1988). Biosynthesis of carotenoids. In Plant Pigments, T. W. Goodwin, ed. (London: Academic Press), pp. 133-182.
Britton, G. (1979) Z. Naturforsch. Section C Biosci. 34, 979-985.
Britton, G. (1995) UV/Visible spectroscopy. In Carotenoids, Vol. IB: Spectroscopy, G. Britton, S. Liaaen-Jensen, H. P. Pfander, eds. (Basel: Birkhauser Verlag), pp. 13-62.
Bouvier et al., (1994) Plant J. 6, 45-54.
Cunningham et al., (1985) Photochem. Photobiol. 42: 295-307
Cunningham et al., (1993) FEBS Lett. 328, 130-138.
Cunningham et al., (1994) Plant Cell 6, 1107-1121.
Davies, B. H. (1976). Carotenoids. In Chemistry and Biochemistry of Plant Pigments, Vol. 2, T. W. Goodwin, ed (New York: Academic Press), pp. 38-165.
Del Sal et al., (1988). Nucl. Acids Res. 16, 9878.
Demmig-Adams & Adams, (1992) Ann. Rev. Plant Physiol. Mol. Biol. 43, 599-626.
Enzell & Back, (1995) Mass spectrometry. In Carotenoids, Vol. IB: Spectroscopy, G. Britton, S. Liaaen-Jensen, H. P. Pfander, eds. (Basel: Birkhauser Verlag), pp. 261-320.
Frank & Cogdell (1993) Photochemistry and function of carotenoids in photosynthesis. In Carotenoids in Photosynthesis. A. Young and G. Britton,eds. (London: Chapman and Hall). pp. 253-326.
Goodwin, T. W. (1980). The Biochemistry of the Carotenoids. 2nd ed, Vol. 1 (London: Chapman and Hall.
Horvath et al., (1972) Phytochem. 11, 183-187.
Hugueney et al., (1995) Plant J. 8, 417-424.
Hundle et al., (1991) Photochem. Photobiol. 54, 89-93.
Jensen & Jensen, (1971) Methods Enzymol. 23, 586-602.
Kargl & Quackenbush, (1960) Archives Biochem. Biophys. 88, 59-63.
Kargl et al., (1960) Proc. Am. Hort. Soc. 75, 574-578.
Kieber et al., (1993) Cell 72, 427-441.
Koyama, Y. (1991) J. Photochem. Photobiol., B, 9, 265-80.
Krinsky, N. I. (1987) Medical uses of carotenoids. In Carotenoids, N. I. Krinsky, M. M. Mathews-Roth, and R. F. Taylor, eds. (New York: Plenum), pp. 195-206.
Kyte & Doolittle, (1982) J. Mol. Biol. 157, 105-132.
LaRossa & Schloss, (1984) J. Biol. Chem. 259, 8753-8757.
Misawa et al., (1994a) Plant J. 6, 481-489.
Misawa et al., (1994b) J. Biochem, Tokyo, 116, 980-985.
Norris et al., (1995) Plant Cell 7, 2139-2149.
Pecker et al., (1996) Submitted to Plant Mol. Biol.
Perry et al., (1986) J. Bacteriol. 168, 607-612.
Persson & Argos, (1994) J. Mol. Biol. 237, 182-192.
Plumley & Schmidt, (1987) Proc. Nat. Acad. Sci. USA 83, 146-150.
Plumley & Schmidt, (1995) Plant Cell 7, 689-704.
Rossmann et al., (1974) Nature 250, 194-199.
Rock & Zeevaart (1991) Proc. Nat. Acad. Sci. USA 88, 7496-7499.
Rost et al., (1995) Protein Science 4, 521-533.
Sambrook et al., (1989) Molecular Cloning: A Laboratory Manual, 2nd edition(Cold Spring Harbor, N.Y.: Cold Spring Harbor Laboratory Press).
Sancar, A. (1994) Biochemistry 33, 2-9.
Sander & Schneider, (1991) Proteins 9, 56-68.
Sandmann, G. (1994) Eur. J. Biochem. 223, 7-24.
Scolnik & Bartley, (1995) Plant Physiol. 108, 1342.
Siefermann-Harms, D. (1987) Physiol. Plant. 69, 561-568.
Spurgeon & Porter, (1980). Biosynthesis of carotenoids. In Biochemistry of Isoprenoid Compounds, J. W. Porter, and S. L. Spurgeon, eds. (New York: Wiley), pp. 1-122.
Tomes, M. L. (1963) Bot. Gaz. 124, 180-185.
Tomes, M. L. (1967) Genetics 56, 227-232.
Tuveson et al., (1986) J. Bacteriol. 170, 4675-4680.
Van Beeumen et al., (1991) J. Biol. Chem. 266, 12921-12931.
Weedon & Moss, (1995) Structure and Nomenclature. In Carotenoids, Vol. IB: Spectroscopy, G. Britton, S. Liaaen-Jensen, H. P. Pfander, eds. (Basel: Birkhauser Verlag), pp. 27-70.
Wierenga et al., (1986) J. Mol. Biol. 187, 101-107.
Zechmeister, L. (1962) Cis-Trans Isomeric Carotenoids, Vitamins A and Arylpolyenes. Springer-Verlag, Vienna.
Having now fully described the invention, it will be apparent to one of ordinary skill in the art that many changes and modifications can be made thereto without departing from the spirit or scope of the invention as setforth herein.
__________________________________________________________________________SEQUENCE LISTING(1) GENERAL INFORMATION:(iii) NUMBER OF SEQUENCES: 21(2) INFORMATION FOR SEQ ID NO:1:(i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 1860 base pairs(B) TYPE: nucleic acid(C) STRANDEDNESS: single(D) TOPOLOGY: linear(ii) MOLECULE TYPE: cDNA(ix) FEATURE:(A) NAME/KEY: CDS(B) LOCATION: 109..1680(D) OTHER INFORMATION: /product="E-CYCLASE FROM A.THALIANA"(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:ACAAAAGGAAATAATTAGATTCCTCTTTCTGCTTGCTATACCTTGATAGAACAATATAAC60AATGGTGTAAGTCTTCTCGCTGTATTCGAAATTATTTGGAGGAGGAAAATGGAGTGT117MetGluCysGTTGGGGCTAGGAATTTCGCAGCAATGGCGGTTTCAACATTTCCGTCA165ValGlyAlaArgAsnPheAlaAlaMetAlaValSerThrPheProSer51015TGGAGTTGTCGAAGGAAATTTCCAGTGGTTAAGAGATACAGCTATAGG213TrpSerCysArgArgLysPheProValValLysArgTyrSerTyrArg20253035AATATTCGTTTCGGTTTGTGTAGTGTCAGAGCTAGCGGCGGCGGAAGT261AsnIleArgPheGlyLeuCysSerValArgAlaSerGlyGlyGlySer404550TCCGGTAGTGAGAGTTGTGTAGCGGTGAGAGAAGATTTCGCTGACGAA309SerGlySerGluSerCysValAlaValArgGluAspPheAlaAspGlu556065GAAGATTTTGTGAAAGCTGGTGGTTCTGAGATTCTATTTGTTCAAATG357GluAspPheValLysAlaGlyGlySerGluIleLeuPheValGlnMet707580CAGCAGAACAAAGATATGGATGAACAGTCTAAGCTTGTTGATAAGTTG405GlnGlnAsnLysAspMetAspGluGlnSerLysLeuValAspLysLeu859095CCTCCTATATCAATTGGTGATGGTGCTTTGGATCATGTGGTTATTGGT453ProProIleSerIleGlyAspGlyAlaLeuAspHisValValIleGly100105110115TGTGGTCCTGCTGGTTTAGCCTTGGCTGCAGAATCAGCTAAGCTTGGA501CysGlyProAlaGlyLeuAlaLeuAlaAlaGluSerAlaLysLeuGly120125130TTAAAAGTTGGACTCATTGGTCCAGATCTTCCTTTTACTAACAATTAC549LeuLysValGlyLeuIleGlyProAspLeuProPheThrAsnAsnTyr135140145GGTGTTTGGGAAGATGAATTCAATGATCTTGGGCTGCAAAAATGTATT597GlyValTrpGluAspGluPheAsnAspLeuGlyLeuGlnLysCysIle150155160GAGCATGTTTGGAGAGAGACTATTGTGTATCTGGATGATGACAAGCCT645GluHisValTrpArgGluThrIleValTyrLeuAspAspAspLysPro165170175ATTACCATTGGCCGTGCTTATGGAAGAGTTAGTCGACGTTTGCTCCAT693IleThrIleGlyArgAlaTyrGlyArgValSerArgArgLeuLeuHis180185190195GAGGAGCTTTTGAGGAGGTGTGTCGAGTCAGGTGTCTCGTACCTTAGC741GluGluLeuLeuArgArgCysValGluSerGlyValSerTyrLeuSer200205210TCGAAAGTTGACAGCATAACAGAAGCTTCTGATGGCCTTAGACTTGTT789SerLysValAspSerIleThrGluAlaSerAspGlyLeuArgLeuVal215220225GCTTGTGACGACAATAACGTCATTCCCTGCAGGCTTGCCACTGTTGCT837AlaCysAspAspAsnAsnValIleProCysArgLeuAlaThrValAla230235240TCTGGAGCAGCTTCGGGAAAGCTCTTGCAATACGAAGTTGGTGGACCT885SerGlyAlaAlaSerGlyLysLeuLeuGlnTyrGluValGlyGlyPro245250255AGAGTCTGTGTGCAAACTGCATACGGCGTGGAGGTTGAGGTGGAAAAT933ArgValCysValGlnThrAlaTyrGlyValGluValGluValGluAsn260265270275AGTCCATATGATCCAGATCAAATGGTTTTCATGGATTACAGAGATTAT981SerProTyrAspProAspGlnMetValPheMetAspTyrArgAspTyr280285290ACTAACGAGAAAGTTCGGAGCTTAGAAGCTGAGTATCCAACGTTTCTG1029ThrAsnGluLysValArgSerLeuGluAlaGluTyrProThrPheLeu295300305TACGCCATGCCTATGACAAAGTCAAGACTCTTCTTCGAGGAGACATGT1077TyrAlaMetProMetThrLysSerArgLeuPhePheGluGluThrCys310315320TTGGCCTCAAAAGATGTCATGCCCTTTGATTTGCTAAAAACGAAGCTC1125LeuAlaSerLysAspValMetProPheAspLeuLeuLysThrLysLeu325330335ATGTTAAGATTAGATACACTCGGAATTCGAATTCTAAAGACTTACGAA1173MetLeuArgLeuAspThrLeuGlyIleArgIleLeuLysThrTyrGlu340345350355GAGGAGTGGTCCTATATCCCAGTTGGTGGTTCCTTGCCAAACACCGAA1221GluGluTrpSerTyrIleProValGlyGlySerLeuProAsnThrGlu360365370CAAAAGAATCTCGCCTTTGGTGCTGCCGCTAGCATGGTACATCCCGCA1269GlnLysAsnLeuAlaPheGlyAlaAlaAlaSerMetValHisProAla375380385ACAGGCTATTCAGTTGTGAGATCTTTGTCTGAAGCTCCAAAATATGCA1317ThrGlyTyrSerValValArgSerLeuSerGluAlaProLysTyrAla390395400TCAGTCATCGCAGAGATACTAAGAGAAGAGACTACCAAACAGATCAAC1365SerValIleAlaGluIleLeuArgGluGluThrThrLysGlnIleAsn405410415AGTAATATTTCAAGACAAGCTTGGGATACTTTATGGCCACCAGAAAGG1413SerAsnIleSerArgGlnAlaTrpAspThrLeuTrpProProGluArg420425430435AAAAGACAGAGAGCATTCTTTCTCTTTGGTCTTGCACTCATAGTTCAA1461LysArgGlnArgAlaPhePheLeuPheGlyLeuAlaLeuIleValGln440445450TTCGATACCGAAGGCATTAGAAGCTTCTTCCGTACTTTCTTCCGCCTT1509PheAspThrGluGlyIleArgSerPhePheArgThrPhePheArgLeu455460465CCAAAATGGATGTGGCAAGGGTTTCTAGGATCAACATTAACATCAGGA1557ProLysTrpMetTrpGlnGlyPheLeuGlySerThrLeuThrSerGly470475480GATCTCGTTCTCTTTGCTTTATACATGTTCGTCATTTCACCAAACAAT1605AspLeuValLeuPheAlaLeuTyrMetPheValIleSerProAsnAsn485490495TTGAGAAAAGGTCTCATCAATCATCTCATCTCTGATCCAACCGGAGCA1653LeuArgLysGlyLeuIleAsnHisLeuIleSerAspProThrGlyAla500505510515ACCATGATAAAAACCTATCTCAAAGTATGATTTACTTATCAACTCTT1700ThrMetIleLysThrTyrLeuLysVal520AGGTTTGTGTATATATATGTTGATTTATCTGAATAATCGATCAAAGAATGGTATGTGGGT1760TACTAGGAAGTTGGAAACAAACATGTATAGAATCTAAGGAGTGATCGAAATGGAGATGGA1820AACGAAAAGAAAAAAATCAGTCTTTGTTTTGTGGTTAGTG1860(2) INFORMATION FOR SEQ ID NO:2:(i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 524 amino acids(B) TYPE: amino acid(D) TOPOLOGY: linear(ii) MOLECULE TYPE: protein(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:MetGluCysValGlyAlaArgAsnPheAlaAlaMetAlaValSerThr151015PheProSerTrpSerCysArgArgLysPheProValValLysArgTyr202530SerTyrArgAsnIleArgPheGlyLeuCysSerValArgAlaSerGly354045GlyGlySerSerGlySerGluSerCysValAlaValArgGluAspPhe505560AlaAspGluGluAspPheValLysAlaGlyGlySerGluIleLeuPhe65707580ValGlnMetGlnGlnAsnLysAspMetAspGluGlnSerLysLeuVal859095AspLysLeuProProIleSerIleGlyAspGlyAlaLeuAspHisVal100105110ValIleGlyCysGlyProAlaGlyLeuAlaLeuAlaAlaGluSerAla115120125LysLeuGlyLeuLysValGlyLeuIleGlyProAspLeuProPheThr130135140AsnAsnTyrGlyValTrpGluAspGluPheAsnAspLeuGlyLeuGln145150155160LysCysIleGluHisValTrpArgGluThrIleValTyrLeuAspAsp165170175AspLysProIleThrIleGlyArgAlaTyrGlyArgValSerArgArg180185190LeuLeuHisGluGluLeuLeuArgArgCysValGluSerGlyValSer195200205TyrLeuSerSerLysValAspSerIleThrGluAlaSerAspGlyLeu210215220ArgLeuValAlaCysAspAspAsnAsnValIleProCysArgLeuAla225230235240ThrValAlaSerGlyAlaAlaSerGlyLysLeuLeuGlnTyrGluVal245250255GlyGlyProArgValCysValGlnThrAlaTyrGlyValGluValGlu260265270ValGluAsnSerProTyrAspProAspGlnMetValPheMetAspTyr275280285ArgAspTyrThrAsnGluLysValArgSerLeuGluAlaGluTyrPro290295300ThrPheLeuTyrAlaMetProMetThrLysSerArgLeuPhePheGlu305310315320GluThrCysLeuAlaSerLysAspValMetProPheAspLeuLeuLys325330335ThrLysLeuMetLeuArgLeuAspThrLeuGlyIleArgIleLeuLys340345350ThrTyrGluGluGluTrpSerTyrIleProValGlyGlySerLeuPro355360365AsnThrGluGlnLysAsnLeuAlaPheGlyAlaAlaAlaSerMetVal370375380HisProAlaThrGlyTyrSerValValArgSerLeuSerGluAlaPro385390395400LysTyrAlaSerValIleAlaGluIleLeuArgGluGluThrThrLys405410415GlnIleAsnSerAsnIleSerArgGlnAlaTrpAspThrLeuTrpPro420425430ProGluArgLysArgGlnArgAlaPhePheLeuPheGlyLeuAlaLeu435440445IleValGlnPheAspThrGluGlyIleArgSerPhePheArgThrPhe450455460PheArgLeuProLysTrpMetTrpGlnGlyPheLeuGlySerThrLeu465470475480ThrSerGlyAspLeuValLeuPheAlaLeuTyrMetPheValIleSer485490495ProAsnAsnLeuArgLysGlyLeuIleAsnHisLeuIleSerAspPro500505510ThrGlyAlaThrMetIleLysThrTyrLeuLysVal515520(2) INFORMATION FOR SEQ ID NO:3:(i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 956 base pairs(B) TYPE: nucleic acid(C) STRANDEDNESS: single(D) TOPOLOGY: linear(ii) MOLECULE TYPE: cDNA(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:GCTCTTTCTCCTCCTCCTCTACCGATTTCCGACTCCGCCTCCCGAAATCCTTATCCGGAT60TCTCTCCGTCTCTTCGATTTAAACGCTTTTCTGTCTGTTACGTCGTCGAAGAACGGAGAC120AGAATTCTCCGATTGAGAACGATGAGAGACCGGAGAGCACGAGCTCCACAAACGCTATAG180ACGCTGAGTATCTGGCGTTGCGTTTGGCGGAGAAATTGGAGAGGAAGAAATCGGAGAGGT240CCACTTATCTAATCGCTGCTATGTTGTCGAGCTTTGGTATCACTTCTATGGCTGTTATGG300CTGTTTACTACAGATTCTCTTGGCAAATGGAGGGAGGTGAGATCTCAATGTTGGAAATGT360TTGGTACATTTGCTCTCTCTGTTGGTGCTGCTGTTGGTATGGAATTCTGGGCAAGATGGG420CTCATAGAGCTCTGTGGCACGCTTCTCTATGGAATATGCATGAGTCACATCACAAACCAA480GAGAAGGACCGTTTGAGCTAAACGATGTTTTTGCTATAGTGAACGCTGGTCCAGCGATTG540GTCTCCTCTCTTATGGATTCTTCAATAAAGGACTCGTTCCTGGTCTCTGCTTTGGCGCCG600GGTTAGGCATAACGGTGTTTGGAATCGCCTACATGTTTGTCCACGATGGTCTCGTGCACA660AGCGTTTCCCTGTAGGTCCCATCGCCGACGTCCCTTACCTCCGAAAGGTCGCCGCCGCTC720ACCAGCTACATCACACAGACAAGTTCAATGGTGTACCATATGGACTGTTTCTTGGACCCA780AGGAATTGGAAGAAGTTGGAGGAAATGAAGAGTTAGATAAGGAGATTAGTCGGAGAATCA840AATCATACAAAAAGGCCTCGGGCTCCGGGTCGAGTTCGAGTTCTTGACTTTAAACAAGTT900TTAAATCCCAAATTCTTTTTTTGTCTTCTGTCATTATGATCATCTTAAGACGGTCT956(2) INFORMATION FOR SEQ ID NO:4:(i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 294 amino acids(B) TYPE: amino acid(C) STRANDEDNESS: single(D) TOPOLOGY: linear(ii) MOLECULE TYPE: protein(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:SerPheSerSerSerSerThrAspPheArgLeuArgLeuProLysSer151015LeuSerGlyPheSerProSerLeuArgPheLysArgPheSerValCys202530TyrValValGluGluArgArgGlnAsnSerProIleGluAsnAspGlu354045ArgProGluSerThrSerSerThrAsnAlaIleAspAlaGluTyrLeu505560AlaLeuArgLeuAlaGluLysLeuGluArgLysLysSerGluArgSer65707580ThrTyrLeuIleAlaAlaMetLeuSerSerPheGlyIleThrSerMet859095AlaValMetAlaValTyrTyrArgPheSerTrpGlnMetGluGlyGly100105110GluIleSerMetLeuGluMetPheGlyThrPheAlaLeuSerValGly115120125AlaAlaValGlyMetGluPheTrpAlaArgTrpAlaHisArgAlaLeu130135140TrpHisAlaSerLeuTrpMetAsnHisGluSerHisHisLysProArg145150155160GluGlyProPheGluLeuAsnAspValPheAlaIleValAsnAlaGly165170175ProAlaIleGlyLeuLeuSerTyrGlyPhePheAsnLysGlyLeuVal180185190ProGlyLeuCysPheGlyAlaGlyLeuGlyIleThrValPheGlyIle195200205AlaTyrMetPheValHisAspGlyLeuValHisLysArgPheProVal210215220GlyProIleAlaAspValProTyrLeuArgLysValAlaAlaAlaHis225230235240GlnLeuHisHisThrAspLysPheAsnGlyValProTyrGlyLeuPhe245250255LeuGlyProLysGluLeuGluGluValGlyGlyAsnGluGluLeuAsp260265270LysGluIleSerArgArgIleLysSerTyrLysLysAlaSerGlySer275280285GlySerSerSerSerSer290(2) INFORMATION FOR SEQ ID NO:5:(i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 162 amino acids(B) TYPE: amino acid(C) STRANDEDNESS: single(D) TOPOLOGY: linear(ii) MOLECULE TYPE: protein(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:MetThrGlnPheLeuIleValValAlaThrValLeuValMetGluLeu151015ThrAlaTyrSerValHisArgTrpIleMetHisGlyProLeuGlyTrp202530GlyTrpHisLysSerHisHisGluGluHisAspHisAlaLeuGluLys354045AsnAspLeuTyrGlyValValPheAlaValLeuAlaThrIleLeuPhe505560ThrValGlyAlaTyrTrpTrpProValLeuTrpTrpIleAlaLeuGly65707580MetThrValTyrGlyLeuIleTyrPheIleLeuHisAspGlyLeuVal859095HisGlnArgTrpProPheArgTyrIleProArgArgGlyTyrPheArg100105110ArgLeuTyrGlnAlaHisArgLeuHisHisAlaValGluGlyArgAsp115120125HisCysValSerPheGlyPheIleTyrAlaProProValAspLysLeu130135140LysGlnAspLeuLysArgSerGlyValLeuArgProGlnAspGluArg145150155160ProSer(2) INFORMATION FOR SEQ ID NO:6:(i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 175 amino acids(B) TYPE: amino acid(C) STRANDEDNESS: single(D) TOPOLOGY: linear(ii) MOLECULE TYPE: protein(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:MetLeuAsnSerLeuIleValIleLeuSerValIleAlaMetGluGly151015IleAlaAlaPheThrHisArgTyrIleMetHisGlyTrpGlyTrpArg202530TrpHisGluSerHisHisThrProArgLysGlyValPheGluLeuAsn354045AspLeuPheAlaValValPheAlaGlyValAlaIleAlaLeuIleAla505560ValGlyThrAlaGlyValTrpProLeuGlnTrpIleGlyCysGlyMet65707580ThrValTyrGlyLeuLeuTyrPheLeuValHisAspGlyLeuValHis859095GlnArgTrpProPheHisTrpIleProArgArgGlyTyrLeuLysArg100105110LeuTyrValAlaHisArgLeuHisHisAlaValArgGlyArgGluGly115120125CysValSerPheGlyPheIleTyrAlaArgLysProAlaAspLeuGln130135140AlaIleLeuArgGluArgHisGlyArgProProLysArgAspAlaAla145150155160LysAspArgProAspAlaAlaSerProSerSerSerSerProGlu165170175(2) INFORMATION FOR SEQ ID NO:7:(i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 175 amino acids(B) TYPE: amino acid(C) STRANDEDNESS: single(D) TOPOLOGY: linear(ii) MOLECULE TYPE: protein(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:MetLeuTrpIleTrpAsnAlaLeuIleValPheValThrValIleGly151015MetGluValIleAlaAlaLeuAlaHisLysTyrIleMetHisGlyTrp202530GlyTrpGlyTrpHisLeuSerHisHisGluProArgLysGlyAlaPhe354045GluValAsnAspLeuTyrAlaValValPheAlaAlaLeuSerIleLeu505560LeuIleTyrLeuGlySerThrGlyMetTrpProLeuGlnTrpIleGly65707580AlaGlyMetThrAlaTyrGlyLeuLeuTyrPheMetValHisAspGly859095LeuValHisGlnArgTrpProPheArgTyrIleProArgLysGlyTyr100105110LeuLysArgLeuTyrMetAlaHisArgMetHisHisAlaValArgGly115120125LysGluGlyCysValSerPheGlyPheLeuTyrAlaProProLeuSer130135140LysLeuGlnAlaThrLeuArgGluArgHisGlyAlaArgAlaGlyAla145150155160AlaArgAspAlaGlnGlyGlyGluAspGluProAlaSerGlyLys165170175(2) INFORMATION FOR SEQ ID NO:8:(i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 162 amino acids(B) TYPE: amino acid(C) STRANDEDNESS: single(D) TOPOLOGY: linear(ii) MOLECULE TYPE: protein(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:MetThrAsnPheLeuIleValValAlaThrValLeuValMetGluLeu151015ThrAlaTyrSerValHisArgTrpIleMetHisGlyProLeuGlyTrp202530GlyTrpHisLysSerHisHisGluGluHisAspHisAlaLeuGluLys354045AsnAspLeuTyrGlyLeuValPheAlaValIleAlaThrValLeuPhe505560ThrValGlyTrpIleTrpAlaProValLeuTrpTrpIleAlaLeuGly65707580MetThrValTyrGlyLeuIleTyrPheValLeuHisAspGlyLeuVal859095HisTrpArgTrpProPheArgTyrIleProArgLysGlyTyrAlaArg100105110ArgLeuTyrGlnAlaHisArgLeuHisHisAlaValGluGlyArgAsp115120125HisCysValSerPheGlyPheIleTyrAlaProProValAspLysLeu130135140LysGlnAspLeuLysMetSerGlyValLeuArgAlaGluAlaGlnGlu145150155160ArgThr(2) INFORMATION FOR SEQ ID NO:9:(i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 954 base pairs(B) TYPE: nucleic acid(C) STRANDEDNESS: single(D) TOPOLOGY: linear(ii) MOLECULE TYPE: cDNA(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:CCACGGGTCCGCCTCCCCGTTTTTTTCCGATCCGATCTCCGGTGCCGAGGACTCAGCTGT60TTGTTCGCGCTTTCTCAGCCGTCACCATGACCGATTCTAACGATGCTGGAATGGATGCTG120TTCAGAGACGACTCATGTTTGAAGACGAATGCATTCTCGTTGATGAAAATAATCGTGTGG180TGGGACATGACACTAAGTATAACTGTCATCTGATGGAAAAGATTGAAGCTGAGAATTTAC240TTCACAGAGCTTTCAGTGTGTTTTTATTCAACTCCAAGTATGAGTTGCTTCTCCAGCAAC300GGTCAAAAACAAAGGTTACTTTCCCACTTGTGTGGACAAACACTTGTTGCAGCCATCCTC360TTTACCGTGAATCCGAGCTTATTGAAGAGAATGTGCTTGGTGTAAGAAATGCCGCACAAA420GGAAGCTTTTCGATGAGCTCGGTATTGTAGCAGAAGATGTACCAGTCGATGAGTTCACTC480CCTTGGGACGCATGCTTTACAAGGCACCTTCTGATGGGAAATGGGGAGAGCACGAAGTTG540ACTATCTACTCTTCATCGTGCGGGATGTGAAGCTTCAACCAAACCCAGATGAAGTGGCTG600AGATCAAGTACGTGAGCAGGGAAGAGCTTAAGGAGCTGGTGAAGAAAGCAGATGCTGGCG660ATGAAGCTGTGAAACTATCTCCATGGTTCAGATTGGTGGTGGATAATTTCTTGATGAAGT720GGTGGGATCATGTTGAGAAAGGAACTATCACTGAAGCTGCAGACATGAAAACCATTCACA780AGCTCTGAACTTTCCATAAGTTTTGGATCTTCCCCTTCCCATAATAAAATTAAGAGATGA840GACTTTTATTGATTACAGACAAAACTGGCAACAAAATCTATTCCTAGGATTTTTTTTTGC900TTTTTATTTACTTTTGATTCATCTCTAGTTTAGTTTTCATCTTAAAAAAAAAAA954(2) INFORMATION FOR SEQ ID NO:10:(i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 996 base pairs(B) TYPE: nucleic acid(C) STRANDEDNESS: single(D) TOPOLOGY: linear(ii) MOLECULE TYPE: cDNA(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:CACCAATGTCTGTTTCTTCTTTATTTAATCTCCCATTGATTCGCCTCAGATCTCTCGCTC60TTTCGTCTTCTTTTTCTTCTTTCCGATTTGCCCATCGTCCTCTGTCATCGATTTCACCGA120GAAAGTTACCGAATTTTCGTGCTTTCTCTGGTACCGCTATGACAGATACTAAAGATGCTG180GTATGGATGCTGTTCAGAGACGTCTCATGTTTGAGGATGAATGCATTCTTGTTGATGAAA240CTGATCGTGTTGTGGGGCATGTCAGCAAGTATAATTGTCATCTGATGGAAAATATTGAAG300CCAAGAATTTGCTGCACAGGGCTTTTAGTGTATTTTTATTCAACTCGAAGTATGAGTTGC360TTCTCCAGCAAAGGTCAAACACAAAGGTTACGTTCCCTCTAGTGTGGACTAACACTTGTT420GCAGCCATCCTCTTTACCGTGAATCAGAGCTTATCCAGGACAATGCACTAGGTGTGAGGA480ATGCTGCACAAAGAAAGCTTCTCGATGAGCTTGGTATTGTAGCTGAAGATGTACCAGTCG540ATGAGTTCACTCCCTTGGGACGTATGCTGTACAAGGCTCCTTCTGATGGCAAATGGGGAG600AGCATGAACTTGATTACTTGCTCTTCATCGTGCGAGACGTGAAGGTTCAACCAAACCCAG660ATGAAGTAGCTGAGATCAAGTATGTGAGCCGGGAAGAGCTGAAGGAGCTGGTGAAGAAAG720CAGATGCAGGTGAGGAAGGTTTGAAACTGTCACCATGGTTCAGATTGGTGGTGGACAATT780TCTTGATGAAGTGGTGGGATCATGTTGAGAAAGGAACTTTGGTTGAAGCTATAGACATGA840AAACCATCCACAAACTCTGAACATCTTTTTTTAAAGTTTTTAAATCAATCAACTTTCTCT900TCATCATTTTTATCTTTTCGATGATAATAATTTGGGATATGTGAGACACTTACAAAACTT960CCAAGCACCTCAGGCAATAATAAAGTTTGCGGCCGC996(2) INFORMATION FOR SEQ ID NO:11:(i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 1165 base pairs(B) TYPE: nucleic acid(C) STRANDEDNESS: single(D) TOPOLOGY: linear(ii) MOLECULE TYPE: cDNA(xi) SEQUENCE DESCRIPTION: SEQ ID NO:11:CTCGGTAGCTGGCCACAATCGCTATTTGGAACCTGGCCCGGCGGCAGTCCGATGCCGCGA60TGCTTCGTTCGTTGCTCAGAGGCCTCACGCATATCCCCCGCGTGAACTCCGCCCAGCAGC120CCAGCTGTGCACACGCGCGACTCCAGTTTAAGCTCAGGAGCATGCAGATGACGCTCATGC180AGCCCAGCATCTCAGCCAATCTGTCGCGCGCCGAGGACCGCACAGACCACATGAGGGGTG240CAAGCACCTGGGCAGGCGGGCAGTCGCAGGATGAGCTGATGCTGAAGGACGAGTGCATCT300TGGTGGATGTTGAGGACAACATCACAGGCCATGCCAGCAAGCTGGAGTGTCACAAGTTCC360TACCACATCAGCCTGCAGGCCTGCTGCACCGGGCCTTCTCTGTGTTCCTGTTTGACGATC420AGGGGCGACTGCTGCTGCAACAGCGTGCACGCTCAAAAATCACCTTCCCAAGTGTGTGGA480CGAACACCTGCTGCAGCCACCCTTTACATGGGCAGACCCCAGATGAGGTGGACCAACTAA540GCCAGGTGGCCGACGGAACAGTACCTGGCGCAAAGGCTGCTGCCATCCGCAAGTTGGAGC600ACGAGCTGGGGATACCAGCGCACCAGCTGCCGGCAAGCGCGTTTCGCTTCCTCACGCGTT660TGCACTACTGTGCCGCGGACGTGCAGCCAGCTGCGACACAATCAGCGCTCTGGGGCGAGC720ACGAAATGGACTACATCTTGTTCATCCGGGCCAACGTCACCTTGGCGCCCAACCCTGACG780AGGTGGACGAAGTCAGGTACGTGACGCAAGAGGAGCTGCGGCAGATGATGCAGCCGGACA840ACGGGCTGCAATGGTCGCCGTGGTTTCGCATCATCGCCGCGCGCTTCCTTGAGCGTTGGT900GGGCTGACCTGGACGCGGCCCTAAACACTGACAAACACGAGGATTGGGGAACGGTGCATC960ACATCAACGAAGCGTGAAAGCAGAAGCTGCAGGATGTGAAGACACGTCATGGGGTGGAAT1020TGCGTACTTGGCAGCTTCGTATCTCCTTTTTCTGAGACTGAACCTGCAGTCAGGTCCCAC1080AAGGTCAGGTAAAATGGCTCGATAAAATGTACCGTCACTTTTTGTCGCGTATACTGAACT1140CCAAGAGGTCAAAAAAAAAAAAAAA1165(2) INFORMATION FOR SEQ ID NO:12:(i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 1135 base pairs(B) TYPE: nucleic acid(C) STRANDEDNESS: single(D) TOPOLOGY: linear(ii) MOLECULE TYPE: cDNA(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12:CTCGGTAGCTGGCCACAATCGCTATTTGGAACCTGGCCCGGCGGCAGTCCGATGCCGCGA60TGCTTCGTTCGTTGCTCAGAGGCCTCACGCATATCCCGCGCGTGAACTCCGCCCAGCAGC120CCAGCTGTGCACACGCGCGACTCCAGTTTAAGCTCAGGAGCATGCAGCTGCTTTCCGAGG180ACCGCACAGACCACATGAGGGGTGCAAGCACCTGGGCAGGCGGGCAGTCGCAGGATGAGC240TGATGCTGAAGGACGAGTGCATCTTGGTAGATGTTGAGGACAACATCACAGGCCATGCCA300GCAAGCTGGAGTGTCACAAGTTCCTACCACATCAGCCTGCAGGCCTGCTGCACCGGGCCT360TCTCTGTGTTCCTGTTTGACGATCAGGGGCGACTGCTGCTGCAACAGCGTGCACGCTCAA420AAATCACCTTCCCAAGTGTGTGGACGAACACCTGCTGCAGCCACCCTTTACATGGGCAGA480CCCCAGATGAGGTGGACCAACTAAGCCAGGTGGCCGACGGAACAGTACCTGGCGCAAAGG540CTGCTGCCATCCGCAAGTTGGAGCACGAGCTGGGGATACCAGCGCACCAGCTGCCGGCAA600GCGCGTTTCGCTTCCTCACGCGTTTGCACTACTGTGCCGCGGACGTGCAGCCAGCTGCGA660CACAATCAGCGCTCTGGGGCGAGCACGAAATGGACTACATCTTGTTCATCCGGGCCAACG720TCACCTTGGCGCCCAACCCTGACGAGGTGGACGAAGTCAGGTACGTGACGCAAGAGGAGC780TGCGGCAGATGATGCAGCCGGACAACGGGCTTCAATGGTCGCCGTGGTTTCGCATCATCG840CCGCGCGCTTCCTTGAGCGTTGGTGGGCTGACCTGGACGCGGCCCTAAACACTGACAAAC900ACGAGGATTGGGGAACGGTGCATCACATCAACGAAGCGTGAAGGCAGAAGCTGCAGGATG960TGAAGACACGTCATGGGGTGGAATTGCGTACTTGGCAGCTTCGTATCTCCTTTTTCTGAG1020ACTGAACCTGCAGAGCTAGAGTCAATGGTGCATCATATTCATCGTCTCTCTTTTGTTTTA1080GACTAATCTGTAGCTAGAGTCACTGATGAATCCTTTACAACTTTCAAAAAAAAAA1135(2) INFORMATION FOR SEQ ID NO:13:(i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 960 base pairs(B) TYPE: nucleic acid(C) STRANDEDNESS: single(D) TOPOLOGY: linear(ii) MOLECULE TYPE: cDNA(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13:CCAAAAACAACTCAAATCTCCTCCGTCGCTCTTACTCCGCCATGGGTGACGACTCCGGCA60TGGATGCTGTTCAGCGACGTCTCATGTTTGACGATGAATGCATTTTGGTGGATGAGTGTG120ACAATGTGGTGGGACATGATACCAAATACAATTGTCACTTGATGGAGAAGATTGAAACAG180GTAAAATGCTGCACAGAGCATTCAGCGTTTTTCTATTCAATTCAAAATACGAGTTACTTC240TTCAGCAACGGTCTGCAACCAAGGTGACATTTCCTTTAGTATGGACCAACACCTGTTGCA300GCCATCCACTCTACAGAGAATCCGAGCTTGTTCCCGAAACGCCTGAGAGAATGCTGCACA360GAGGANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN420NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN480NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN540NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN600NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN660NNNNNNNNNNNNNNNNNNNNTCATGTGCAAAAGGGTACACTCACTGAATGCAATTTGATA720TGAAAACCATACACAAGCTGATATAGAAACACACCCTCAACCGAAAAGCAAGCCTAATAA780TTCGGGTTGGGTCGGGTCTACCATCAATTGTTTTTTTCTTTTAACAACTTTTAATCTCTA840TTTGAGCATGTTGATTCTTGTCTTTTGTGTGTAAGATTTTGGGTTTCGTTTCAGTTGTAA900TAATGAACCATTGATGGTTTGCAATTTCAAGTTCCTATCGACATGTAGTGATCTAAAAAA960(2) INFORMATION FOR SEQ ID NO:14:(i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 305 amino acids(B) TYPE: amino acid(C) STRANDEDNESS: single(D) TOPOLOGY: linear(ii) MOLECULE TYPE: protein(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14:MetLeuArgSerLeuLeuArgGlyLeuThrHisIleProArgValAsn151015SerAlaGlnGlnProSerCysAlaHisAlaArgLeuGlnPheLysLeu202530ArgSerMetGlnMetThrLeuMetGlnProSerIleSerAlaAsnLeu354045SerArgAlaGluAspArgThrAspHisMetArgGlyAlaSerThrTrp505560AlaGlyGlyGlnSerGlnAspGluLeuMetLeuLysAspGluCysIle65707580LeuValAspValGluAspAsnIleThrGlyHisAlaSerLysLeuGlu859095CysHisLysPheLeuProHisGlnProAlaGlyLeuLeuHisArgAla100105110PheSerValPheLeuPheAspAspGlnGlyArgLeuLeuLeuGlnGln115120125ArgAlaArgSerLysIleThrPheProSerValTrpThrAsnThrCys130135140CysSerHisProLeuHisGlyGlnThrProAspGluValAspGlnLeu145150155160SerGlnValAlaAspGlyThrValProGlyAlaLysAlaAlaAlaIle165170175ArgLysLeuGluHisGluLeuGlyIleProAlaHisGlnLeuProAla180185190SerAlaPheArgPheLeuThrArgLeuHisTyrCysAlaAlaAspVal195200205GlnProAlaAlaThrGlnSerAlaLeuTrpGlyGluHisGluMetAsp210215220TyrIleLeuPheIleArgAlaAsnValThrLeuAlaProAsnProAsp225230235240GluValAspGluValArgTyrValThrGlnGluGluLeuArgGlnMet245250255MetGlnProAspAsnGlyLeuGlnTrpSerProTrpPheArgIleIle260265270AlaAlaArgPheLeuGluArgTrpTrpAlaAspLeuAspAlaAlaLeu275280285AsnThrAspLysHisGluAspTrpGlyThrValHisHisIleAsnGlu290295300Ala305(2) INFORMATION FOR SEQ ID NO:15:(i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 293 amino acids(B) TYPE: amino acid(C) STRANDEDNESS: single(D) TOPOLOGY: linear(ii) MOLECULE TYPE: protein(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15:MetLeuArgSerLeuLeuArgGlyLeuThrHisIleProArgValAsn151015SerAlaGlnGlnProSerCysAlaHisAlaArgLeuGlnPheLysLeu202530ArgSerMetGlnLeuLeuSerGluAspArgThrAspHisMetArgGly354045AlaSerThrTrpAlaGlyGlyGlnSerGlnAspGluLeuMetLeuLys505560AspGluCysIleLeuValAspValGluAspAsnIleThrGlyHisAla65707580SerLysLeuGluCysHisLysPheLeuProHisGlnProAlaGlyLeu859095LeuHisArgAlaPheSerValPheLeuPheAspAspGlnGlyArgLeu100105110LeuLeuGlnGlnArgAlaArgSerLysIleThrPheProSerValTrp115120125ThrAsnThrCysCysSerHisProLeuHisGlyGlnThrProAspGlu130135140ValAspGlnLeuSerGlnValAlaAspGlyThrValProGlyAlaLys145150155160AlaAlaAlaIleArgLysLeuGluHisGluLeuGlyIleProAlaHis165170175GlnLeuProAlaSerAlaPheArgPheLeuThrArgLeuHisTyrCys180185190AlaAlaAspValGlnProAlaAlaThrGlnSerAlaLeuTrpGlyGlu195200205HisGluMetAspTyrIleLeuPheIleArgAlaAsnValThrLeuAla210215220ProAsnProAspGluValAspGluValArgTyrValThrGlnGluGlu225230235240LeuArgGlnMetMetGlnProAspAsnGlyLeuGlnTrpSerProTrp245250255PheArgIleIleAlaAlaArgPheLeuGluArgTrpTrpAlaAspLeu260265270AspAlaAlaLeuAsnThrAspLysHisGluAspTrpGlyThrValHis275280285HisIleAsnGluAla290(2) INFORMATION FOR SEQ ID NO:16:(i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 284 amino acids(B) TYPE: amino acid(C) STRANDEDNESS: single(D) TOPOLOGY: linear(ii) MOLECULE TYPE: protein(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16:MetSerValSerSerLeuPheAsnLeuProLeuIleArgLeuArgSer151015LeuAlaLeuSerSerSerPheSerSerPheArgPheAlaHisArgPro202530LeuSerSerIleSerProArgLysLeuProAsnPheArgAlaPheSer354045GlyThrAlaMetThrAspThrLysAspAlaGlyMetAspAlaValGln505560ArgArgLeuMetPheGluAspGluCysIleLeuValAspGluThrAsp65707580ArgValValGlyHisValSerLysTyrAsnCysHisLeuMetGluAsn859095IleGluAlaLysAsnLeuLeuHisArgAlaPheSerValPheLeuPhe100105110AsnSerLysTyrGluLeuLeuLeuGlnGlnArgSerAsnThrLysVal115120125ThrPheProLeuValTrpThrAsnThrCysCysSerHisProLeuTyr130135140ArgGluSerGluLeuIleGlnAspAsnAlaLeuGlyValArgAsnAla145150155160AlaGlnArgLysLeuLeuAspGluLeuGlyIleValAlaGluAspVal165170175ProValAspGluPheThrProLeuGlyArgMetLeuTyrLysAlaPro180185190SerAspGlyLysTrpGlyGluHisGluLeuAspTyrLeuLeuPheIle195200205ValArgAspValLysValGlnProAsnProAspGluValAlaGluIle210215220LysTyrValSerArgGluGluLeuLysGluLeuValLysLysAlaAsp225230235240AlaGlyGluGluGlyLeuLysLeuSerProTrpPheArgLeuValVal245250255AspAsnPheLeuMetLysTrpTrpAspHisValGluLysGlyThrLeu260265270ValGluAlaIleAspMetLysThrIleHisLysLeu275280(2) INFORMATION FOR SEQ ID NO:17:(i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 287 amino acids(B) TYPE: amino acid(C) STRANDEDNESS: single(D) TOPOLOGY: linear(ii) MOLECULE TYPE: protein(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17:MetSerSerSerMetLeuAsnPheThrAlaSerArgIleValSerLeu151015ProLeuLeuSerSerProProSerArgValHisLeuProLeuCysPhe202530PheSerProIleSerLeuThrGlnArgPheSerAlaLysLeuThrPhe354045SerSerGlnAlaThrThrMetGlyGluValValAspAlaGlyMetAsp505560AlaValGlnArgArgLeuMetPheGluAspGluCysIleLeuValAsp65707580GluAsnAspLysValValGlyHisGluSerLysTyrAsnCysHisLeu859095MetGluLysIleGluSerGluAsnLeuLeuHisArgAlaPheSerVal100105110PheLeuPheAsnSerLysTyrGluLeuLeuLeuGlnGlnArgSerAla115120125ThrLysValThrPheProLeuValTrpThrAsnThrCysCysSerHis130135140ProLeuTyrArgGluSerGluLeuIleAspGluAsnCysLeuGlyVal145150155160ArgAsnAlaAlaGlnArgLysLeuLeuAspGluLeuGlyIleProAla165170175GluAspLeuProValAspGlnPheIleProLeuSerArgIleLeuTyr180185190LysAlaProSerAspGlyLysTrpGlyGluHisGluLeuAspTyrLeu195200205LeuPheIleIleArgAspValAsnLeuAspProAsnProAspGluVal210215220AlaGluValLysTyrMetAsnArgAspAspLeuLysGluLeuLeuArg225230235240LysAlaAspAlaGluGluGluGlyValLysLeuSerProTrpPheArg245250255LeuValValAspAsnPheLeuPheLysTrpTrpAspHisValGluLys260265270GlySerLeuLysAspAlaAlaAspMetLysThrIleHisLysLeu275280285(2) INFORMATION FOR SEQ ID NO:18:(i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 261 amino acids(B) TYPE: amino acid(C) STRANDEDNESS: single(D) TOPOLOGY: linear(ii) MOLECULE TYPE: protein(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18:ThrGlyProProProArgPhePheProIleArgSerProValProArg151015ThrGlnLeuPheValArgAlaPheSerAlaValThrMetThrAspSer202530AsnAspAlaGlyMetAspAlaValGlnArgArgLeuMetPheGluAsp354045GluCysIleLeuValAspGluAsnAsnArgValValGlyHisAspThr505560LysTyrAsnCysHisLeuMetGluLysIleGluAlaGluAsnLeuLeu65707580HisArgAlaPheSerValPheLeuPheAsnSerLysTyrGluLeuLeu859095LeuGlnGlnArgSerLysThrLysValThrPheProLeuValTrpThr100105110AsnThrCysCysSerHisProLeuTyrArgGluSerGluLeuIleGlu115120125GluAsnValLeuGlyValArgAsnAlaAlaGlnArgLysLeuPheAsp130135140GluLeuGlyIleValAlaGluAspValProValAspGluPheThrPro145150155160LeuGlyArgMetLeuTyrLysAlaProSerAspGlyLysTrpGlyGlu165170175HisGluValAspTyrLeuLeuPheIleValArgAspValLysLeuGln180185190ProAsnProAspGluValAlaGluIleLysTyrValSerArgGluGlu195200205LeuLysGluLeuValLysLysAlaAspAlaGlyAspGluAlaValLys210215220LeuSerProTrpPheArgLeuValValAspAsnPheLeuMetLysTrp225230235240TrpAspHisValGluLysGlyThrIleThrGluAlaAlaAspMetLys245250255ThrIleHisLysLeu260(2) INFORMATION FOR SEQ ID NO:19:(i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 288 amino acids(B) TYPE: amino acid(C) STRANDEDNESS: single(D) TOPOLOGY: linear(ii) MOLECULE TYPE: protein(xi) SEQUENCE DESCRIPTION: SEQ ID NO:19:MetThrAlaAspAsnAsnSerMetProHisGlyAlaValSerSerTyr151015AlaLysLeuValGlnAsnGlnThrProGluAspIleLeuGluGluPhe202530ProGluIleIleProLeuGlnGlnArgProAsnThrArgSerSerGlu354045ThrSerAsnAspGluSerGlyGluThrCysPheSerGlyHisAspGlu505560GluGlnIleLysLeuMetAsnGluAsnCysIleValLeuAspTrpAsp65707580AspAsnAlaIleGlyAlaGlyThrLysLysValCysHisLeuMetGlu859095AsnIleGluLysGlyLeuLeuHisArgAlaPheSerValPheIlePhe100105110AsnGluGlnGlyGluLeuLeuLeuGlnGlnArgAlaThrGluLysIle115120125ThrPheProAspLeuTrpThrAsnThrCysCysSerHisProLeuCys130135140IleAspAspGluLeuGlyLeuLysGlyLysLeuAspAspLysIleLys145150155160GlyAlaIleThrAlaAlaValArgLysLeuAspHisGluLeuGlyIle165170175ProGluAspGluThrLysThrArgGlyLysPheHisPheLeuAsnArg180185190IleHisTyrMetAlaProSerAsnGluProTrpGlyGluHisGluIle195200205AspTyrIleLeuPheTyrLysIleAsnAlaLysGluAsnLeuThrVal210215220AsnProAsnValAsnGluValArgAspPheLysTrpValSerProAsn225230235240AspLeuLysThrMetPheAlaAspProSerTyrLysPheThrProTrp245250255PheLysIleIleCysGluAsnTyrLeuPheAsnTrpTrpGluGlnLeu260265270AspAspLeuSerGluValGluAsnAspArgGlnIleHisArgMetLeu275280285(2) INFORMATION FOR SEQ ID NO:20:(i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 456 amino acids(B) TYPE: amino acid(C) STRANDEDNESS: single(D) TOPOLOGY: linear(ii) MOLECULE TYPE: protein(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20:MetAspThrLeuLeuLysThrProAsnLeuGluPheLeuProHisGly151015PheValLysSerPheSerLysPheGlyLysCysGluGlyValCysVal202530LysSerSerAlaLeuLeuGluLeuValProGluThrLysLysGluAsn354045LeuAspPheGluLeuProMetTyrAspProSerLysGlyValValAsp505560LeuAlaValValGlyGlyGlyProAlaGlyLeuAlaValAlaGlnGln65707580ValSerGluAlaGlyLeuSerValCysSerIleAspProProLysLeu859095IleTrpProAsnAsnTyrGlyValTrpValAspGluPheGluAlaMet100105110AspLeuLeuAspCysLeuAspAlaThrTrpSerGlyAlaValTyrIle115120125AspAspThrLysAspLeuArgProTyrGlyArgValAsnArgLysGln130135140LeuLysSerLysMetMetGlnLysCysIleAsnGlyValLysPheHis145150155160GlnAlaLysValIleLysValIleHisGluGluLysSerMetLeuIle165170175CysAsnAspGlyThrIleGlnAlaThrValValLeuAspAlaThrGly180185190PheSerArgLeuValGlnTyrAspLysProTyrAsnProGlyTyrGln195200205ValAlaTyrGlyIleLeuAlaGluValGluGluHisProPheAspLys210215220MetValPheMetAspTrpArgAspSerHisLeuAsnAsnGluLeuLys225230235240GluArgAsnSerIleProThrPheLeuTyrAlaMetProPheSerSer245250255AsnArgIlePheLeuGluGluThrSerLeuValAlaArgProGlyLeu260265270ArgMetAspAspIleGlnGluArgMetValAlaArgLeuHisLeuGly275280285IleLysValLysSerIleGluGluAspGluHisCysValIleProMet290295300GlyGlyProLeuProValLeuProGlnArgValValGlyIleGlyGly305310315320ThrAlaGlyMetValHisProSerThrGlyTyrMetValAlaArgThr325330335LeuAlaAlaAlaProValValAlaAsnAlaIleIleTyrLeuGlySer340345350GluSerSerGlyGluLeuSerAlaGluValTrpLysAspLeuTrpPro355360365IleGluArgArgArgGlnArgGluPhePheCysPheGlyMetAspIle370375380LeuLeuLysLeuAspLeuProAlaThrArgArgPhePheAspAlaPhe385390395400PheAspLeuGluProArgTyrTrpHisGlyPheLeuSerSerArgLeu405410415PheLeuProGluLeuIleValPheGlyLeuSerLeuPheSerHisAla420425430SerAsnThrSerArgGluIleMetThrLysGlyThrProLeuValMet435440445IleAsnAsnLeuLeuGlnAspGlu450455(2) INFORMATION FOR SEQ ID NO:21:(i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 524 amino acids(B) TYPE: amino acid(C) STRANDEDNESS: single(D) TOPOLOGY: linear(ii) MOLECULE TYPE: protein(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21:MetGluCysValGlyAlaArgAsnPheAlaAlaMetAlaValSerThr151015PheProSerTrpSerCysArgArgLysPheProValValLysArgTyr202530SerTyrArgAsnIleArgPheGlyLeuCysSerValArgAlaSerGly354045GlyGlySerSerGlySerGluSerCysValAlaValArgGluAspPhe505560AlaAspGluGluAspPheValLysAlaGlyGlySerGluIleLeuPhe65707580ValGlnMetGlnGlnAsnLysAspMetAspGluGlnSerLysLeuVal859095AspLysLeuProProIleSerIleGlyAspGlyAlaLeuAspHisVal100105110ValIleGlyCysGlyProAlaGlyLeuAlaLeuAlaAlaGluSerAla115120125LysLeuGlyLeuLysValGlyLeuIleGlyProAspLeuProPheThr130135140AsnAsnTyrGlyValTrpGluAspGluPheAsnAspLeuGlyLeuGln145150155160LysCysIleGluHisValTrpArgGluThrIleValTyrLeuAspAsp165170175AspLysProIleThrIleGlyArgAlaTyrGlyArgValSerArgArg180185190LeuLeuHisGluGluLeuLeuArgArgCysValGluSerGlyValSer195200205TyrLeuSerSerLysValAspSerIleThrGluAlaSerAspGlyLeu210215220ArgLeuValAlaCysAspAspAsnAsnValIleProCysArgLeuAla225230235240ThrValAlaSerGlyAlaAlaSerGlyLysLeuLeuGlnTyrGluVal245250255GlyGlyProArgValCysValGlnThrAlaTyrGlyValGluValGlu260265270ValGluAsnSerProTyrAspProAspGlnMetValPheMetAspTyr275280285ArgAspTyrThrAsnGluLysValArgSerLeuGluAlaGluTyrPro290295300ThrPheLeuTyrAlaMetProMetThrLysSerArgLeuPhePheGlu305310315320GluThrCysLeuAlaSerLysAspValMetProPheAspLeuLeuLys325330335ThrLysLeuMetLeuArgLeuAspThrLeuGlyIleArgIleLeuLys340345350ThrTyrGluGluGluTrpSerTyrIleProValGlyGlySerLeuPro355360365AsnThrGluGlnLysAsnLeuAlaPheGlyAlaAlaAlaSerMetVal370375380HisProAlaThrGlyTyrSerValValArgSerLeuSerGluAlaPro385390395400LysTyrAlaSerValIleAlaGluIleLeuArgGluGluThrThrLys405410415GlnIleAsnSerAsnIleSerArgGlnAlaTrpAspThrLeuTrpPro420425430ProGluArgLysArgGlnArgAlaPhePheLeuPheGlyLeuAlaLeu435440445IleValGlnPheAspThrGluGlyIleArgSerPhePheArgThrPhe450455460PheArgLeuProLysTrpMetTrpGlnGlyPheLeuGlySerThrLeu465470475480ThrSerGlyAspLeuValLeuPheAlaLeuTyrMetPheValIleSer485490495ProAsnAsnLeuArgLysGlyLeuIleAsnHisLeuIleSerAspPro500505510ThrGlyAlaThrMetIleLysThrTyrLeuLysVal515520__________________________________________________________________________

Number	Name	Date	Kind
5539093	Fitzmaurice et al.	Jul 1996
5589581	Misawa et al.	Dec 1996

Genes of carotenoid biosynthesis and metabolism and a system for screening for such genes

Information

Patent Number

Date Filed

Date Issued

Inventors

Original Assignees

Examiners

Agents

CPC

US Classifications

Field of Search

US

International Classifications

Abstract

Description

Claims

US Referenced Citations (2)

Non-Patent Literature Citations (7)

Entry
Buckner et al., Meth. Enzymol. 214:311-323, 1993.
Goodwin, Meth. Enzymol. 214:331-340, 1993.
Archives of Biochemistry and Biophysics, vol. 230, No. 2, pp. 446-454, 1984, Sandra L. Spurgeon, et al., "Isopentenyl Pyrophosphate Isomerase and Prenyltransferase From Tomato Fruit Plastids".
FEBS Letters, vol. 328, No. 1-2, pp. 130-138, Aug. 1993, Francis X. Cunningham, Jr., et al., "Cloning and Functional Expression in Escherichia coli of a Cyanobacterial Gene for Lycopene Cyclase, The Enzyme That Catalyzes the Biosynthesis of .beta.-Carotene".
The Journal of Biological Chemistry, vol. 271, No. 13, Mar. 29, 1996, pp. 7774-7780, Nuria Cunillera, et al., "Arabidopsis thaliana Contains Two Differentially Expressed Farnesyl-Diphosphate Synthase Genes".
The Journal of Biological Chemistry, vol. 271, No. 40, pp. 24349-24352, Oct. 4, 1996, Zairen Sun, et al., "Cloning and Functional Analysis of the .beta.-Carotene Hydroxylase of Arabidopsis thaliana".
Annual Review of Plant Physiology and Molecular Biology, vol. 45, pp. 287-301, Glenn E. Bartley, et al., "Molecular Biology of Carotenoid Biosynthesis in Plants" (1994).