Platenolide synthase gene

Information

  • Patent Grant
  • 5945320
  • Patent Number
    5,945,320
  • Date Filed
    Friday, February 21, 1997
    27 years ago
  • Date Issued
    Tuesday, August 31, 1999
    25 years ago
Abstract
A DNA molecule isolated from Streptomyces ambofaciens encodes the multi-functional proteins which direct the synthesis of the polyketide platenolide.
Description

BACKGROUND OF THE INVENTION
1. Field of the Invention
The present invention is directed to DNA molecules responsible for encoding the multi-functional proteins which direct the synthesis of the polyketide platenolide. The present invention also is directed to use of that DNA to produce compounds exhibiting antibiotic activity based on the platenolide structure, including specifically spiramycin and spiramycin analogues and derivatives.
2. Description of Related Art
Spiramycin is a macrolide antibiotic useful in both veterinary and human medicine produced by streptomycetes such as Streptomyces ambofaciens (ATCC 15154). Spiramycin is a 16-membered cyclic lactone, platenolide, with three attached sugar residues. Spiramycin's antibiotic activity is believed to be due to its inhibition of protein synthesis by a mechanism that involves binding of the antibiotic to a ribosome. Spiramycin is structurally similar to another antibiotic, tylosin, and the biosynthetic pathways of both are known to be similar.
The biosynthesis of tylosin has been thoroughly investigated (Baltz et al., Antimicrobial Agents and Chemotherapy, 20(2):214-225(1981); Beckmann et al., Genetics and Molecular Biology of Industrial Microorganisms, (1989):176-186). Polyketides are synthesized via a common mechanistic scheme thought to be related to fatty acid synthesis. The cyclic lactone framework is prepared by a series of condensations involving small carboxylic acid residues. Modifications of the structure, such as ketoreduction, dehydration and enolylreduction, also occur during the processing. The synthesis is driven by a set of large multi-functional polypeptides, referred to as polyketide syntheses.
PCT Publication WO 93/13663 describes the organization of the gene encoding the polyketide synthase of Saccharapolyspora erythraea. The gene is organized in modules, with each module effecting one condensation step. The precise sequence of chain growth and the processing of the growing chain is determined by the genetic information in each module. This PCT application describes an approach for synthesizing novel polyketide structures by manipulating in several ways the DNA governing the biosynthesis of the cyclic lactone framework. In order to adapt this methodology to other polyketides, however, the DNA molecules directing the biosynthetic processing must first be isolated.
The present invention is directed to the DNA sequence for the gene cluster responsible for encoding platenolide synthase, the building machinery of platenolide which is the basic building block of spiramycin. As a result, the present invention provides the information needed to synthesize novel spiramycin-related polyketides based on platenolide, arising from modifications of this DNA sequence designed to change the number and type of carboxylic acids incorporated into the growing polyketide chain and to change the kind of post-condensation processing that is conducted.
SUMMARY OF THE INVENTION
The present invention provides a DNA molecule comprising an isolated DNA sequence that encodes a platenolide synthase domain. Thus, the present invention provides the DNA molecule of SEQ ID NO:1 and DNA molecules that contain submodules thereof. The present invention also provides the products encoded by said DNA molecules, recombinant DNA expression vectors, and transformed microbial host cells. The present invention is further directed to a method of screening for new antibiotics based on the platenolide structure.





BRIEF DESCRIPTION OF THE DRAWINGS
FIG. 1 shows the map of the srmg region of the S. ambofaciens DNA. Distances in kb are shown relative to the beginning of srmG. Open reading frames (ORF) are indicated by block arrows. The srmG DNA (0-42 kb) is the platenolide PKS region. The indicia Ap, G, E, K, P, and X denote restriction sites Apal, BglII, EcoRI, KpnI, PstI and XhoI, respectively. Predicted domains for the srmG DNA are labeled as shown. ACP stands for acyl carrier protein; AT stands for acyltransferase; DH stands for dehydratase; ER stands for enoylreductase; KR stands for ketoreductase; KS stands for ketosynthase; and KS' stands for a ketosynthase-like domain in which a glutamine residue is present in the position occupied by an active site cysteine in a normal ketosynthase. KR' is a domain that resembles a ketoreductase but which is predicted to be inactive.
FIG. 2 demonstrates the biosynthetic pathway for platenolide synthesis. A denotes malonyl-CoA; B denotes ethylmalcnyl-CoA; P denotes methylmalonyl-CoA; C2 denotes a CoA derivative related to malonyl-CoA but of unknown structure.
FIG. 3 shows the map of two clones that span the whole region of the srmG DNA.





DETAILED DESCRIPTION OF THE INVENTION
The term polyketide defines a class of molecules produced through the successive condensation of small carboxylic acids. This diverse group includes plant flavonoids, fungal aflatoxins, and hundreds of compounds of different structures that exhibit antibacterial, antifungal, antitumor, and anthelmintic properties. Some polyketides produced by fungi and bacteria are associated with sporulation or other developmental pathways; others do not yet have an ascribed function. Some polyketides have more than one pharmacological effect. The diversity of polyketide structures reflects the wide variety of their biological properties. Many cyclized polyketides undergo glycosidation at one or more sites, and virtually all are modified during their synthesis through hydroxylation, reduction, epoxidation, etc.
A common feature of compounds in this class is that their synthesis is directed by a complex of multi-functional peptides, termed a "polyketide synthase". Molecular genetic analysis of polyketide synthase genes has revealed two distinct classes of enzymes operating for different polyketides: (a) the aromatics, which are made through an essentially iterative process; (b) the complex polyketides, which comprise several repeats of the same activities arranged in few, very large polypeptides. A common feature among complex polyketide synthase genes is that they are generally arranged in several open reading frames (ORFs), each of which contains one or more repeated units, designated modules. Each module processes one condensation step and typically requires several activities accomplished by several enzymes including acyl carrier protein (ACP), .beta.-ketosynthase (KS), and acyltransferase (AT).
Therefore a "module" is defined as the genetic element encoding a multi-functional protein segment that is responsible for all of the distinct activities required in a single round of synthesis, i.e., one condensation step and all the .beta.-carbonyl processing steps associated therewith. Each module encodes an ACP, a KS, and an AT activity to accomplish the condensation portion of the synthesis, and selected post-condensation activities to effect .beta.-carbonyl processing. Each module is therefore, further characterized by the inclusion of submodules that are responsible for encoding the distinct activities of a complex polyketide synthase. A "submodule" thus is defined as the portion of the polyketide synthase DNA sequence that encodes a distinct activity, or "domain". A distinct activity or domain is commonly understood to mean that part of the polyketide synthase polyprotein necessary for a given distinct activity.
The protein segments corresponding to each module are called synthase units (SUs). Each SU is responsible for one of the fatty acid-like cycles required for completing the polyketide; it carries the elements required for the condensation process, for selecting the particular extender unit (a coenzyme A thioester of a dicarboxylate) to be incorporated, and for the extent of processing that the .beta.-carbon will undergo. After completion of the cycle, the nascent polyketide is transferred from the ACP it occupies to the KS of the next SU utilized, where the appropriate extender unit and processing level are introduced. This process is repeated, employing a new SU for each elongation cycle, until the programmed length has been reached. As in synthesis, of long chain fatty acids, the number of elongation cycles determines the length of the molecule. However, whereas fatty acid synthesis involves a single SU used iteratively, formation of complex polyketides requires participation of a different SU for each cycle, thereby ensuring that the correct molecular structure is produced.
The composition of the polyketide synthase gene modules are variable. Some carry the full complement of .beta.-ketoreductase(KR), dehydratase(DH), and enoylreductase(ER) domains, and some encode a particular domain only or lack a functional domain, although much of the sequence is preserved.
This variable composition of the modules, which correlate with the asymmetry in the synthesis of the polyketide precursor, enable a specific step to be assigned to each module. Since each enzymatic activity is involved in a single biochemical step in the pathway, loss of any one activity should affect only a single step in the synthesis. Knowledge of the correlation between the structure of the polyketide and the organization of the polyketide synthase genes enables one to produce altered genes selectively which produce a polyketide derivative with predicted structure.
Because the degree of processing appears to depend on the presence of functional domains in a particular SU, inactivation of a KR, DH, or ER will result in a polyketide less processed at a single site, but only if the altered chain thus produced can be utilized as a substrate for the subsequent synthesis steps. Thus, the inactivation of one of these domains should result in the formation of a polyketide retaining a ketone, hydroxyl, or site of unsaturation at the corresponding position. This rationale has led to the successful production of altered erythromycin derivatives from strains in which a KR or an ER domain had been inactivated.
Thus, one can engineer polyketide pathways by genetic intervention of the polyketide synthase and by adding or eliminating modification steps. Many of the enzymes involved in postpolyketide modifications do not seem to have absolute specificity for a particular structure. In addition one can also select the desired components from a library of polyketide and postpolyketide biosynthesis genes and combine them to produce novel structures.
The present invention provides, in particular, the DNA sequence encoding the polyketide synthase responsible for biosynthesis of platenolide, i.e., platenolide synthase. Platenolicle itself is the foundation for spiramycin-related polyketides. The platenolide synthase DNA sequence, which defines the platenolide synthase gene cluster, directs biosynthesis of the platenolide polyketide by encoding the various distinct activities of platenolide synthase.
The gene cluster for platenolide synthase, like other polyketide biosynthetic genes whose organization has been elucidated, is characterized by the presence of several ORFs, each of which contains one or more repeated units termed modules as defined above. Each module also further includes submodules as defined above. Organization of the platenolide synthase gene cluster derived from Streptomyces ambofaciens is shown in FIG. 2. The accompanying synthetic pathway and the specific carboxylic acid substrates that are used for each condensation reaction and the post-condensation activities of platenolide synthesis are indicated in FIG. 1.
A preferred DNA molecule comprising the platenolide synthase gene cluster isolated from Streptomyces ambofaciens is represented by SEQ ID NO: 1. Other preferred DNA molecules of the present invention include the various ORFS of SEQ ID NO: 1 that encode individual multi-functional polypeptides. These are represented by ORF1, 350 to 14002, ORF2, 14046 to 20036, ORF3, 20110 to 31284, ORF4, 31329 to 36071, and ORF5, 36155 to 41830 all in SEQ ID NO: 1. The predicted amino acid sequences of the various peptides encoded by these sequences are shown in SEQ ID NO: 2, 3, 4, 5, and 6.
Yet other preferred DNA molecules of the present invention include the modules that encode all the activities necessary for a single round of synthesis. These are represented by starter module 392 to 3424, module 1, 3527 to 8197, module 2, 8270 to 13720, module 3, 14148 to 19730, module 4, 20215 to 24678, module 5, 24742 to 31002, module 6, 31428 to 35837, and module 7, 36257 to 41395 all in SEQ ID NO: 1. The predicted amino acid sequences of the various synthase units encoded by these modules are represented by starter SU 15 to 1025, SU1, 1060 to 2616, and SU2, 2641 to 4457 in SEQ ID NO: 2; SU3, 35 to 1895 in SEQ ID NO: 3; SU4, 36 to 1523, and SU5, 1545 to 3631 in SEQ ID NO: 4; SU6, 34 to 1503 in SEQ ID NO: 5; SU7, 35 to 1747 all in SEQ ID NO: 6.
Still other preferred DNA molecules include the various submodules that encode the various domains of platenolide synthase. These submodules are represented by KS'(s), 392 to 1603, AT(s), 1922 to 2995, and ACP(s), 3173 to 3424 of starter module in SEQ ID NO:1; KS1, 3527 to 4798, AT1, 5135 to 6208, KR1, 7043 to 7597, and ACP1, 7946 to 8197 of module 1 in SEQ ID NO: 1; KS2, 8270 to 9541, AT2, 9899 to 10909, DH2, 10985 to 11530, KR2, 12596 to 13153, and ACP2, 13469 to 13720 of module 2 in SEQ ID NO: 1; KS3, 14148 to 15422, AT3, 15789 to 16844, DH3, 16914 to 17510, KR3, 18612 to 19166, and ACP3, 19479 to 19730 of module 3 in SEQ ID NO: 1; KS4, 20215 to 21486, AT4, 21889 to 22872, KR'4, 23638 to 24159, and ACP4, 24484 to 24678 of module 4 in SEQ ID NO: 1; KS5, 24742 to 26016, AT5, 26371 to 27381, DH5, 27442 to 27966, ER5, 28843 to 29892, KR5, 29905 to 30462, and ACP5, 30760 to 31002 of module 5 in SEQ ID NO: 1; KS6, 31428 to 32696, AT6, 33024 to 34022, KR6, 34770 to 35327, and ACP6, 35586 to 35837 of module 6 in SEQ ID NO: 1; KS7, 36257 to 37528, AT7, 37898 to 38905, KR7, 39851 to 40408, ACP7, 40658 to 40909, and TE, 41297 to 41395 of module 7 in SEQ ID NO: 1. The predicted amino acid sequences of the various domains encoded by these submodules are represented by KS'(s), 15 to 418, AT(s), 525 to 882, and ACP(s), 942 to 1025 of starter SU in SEQ ID NO:2; KS1, 1060 to 1483, AT1, 1596 to 1953, KR1, 2232 to 2416, and ACP1, 2533 to 2616 of SU1 in SEQ ID NO: 2; KS2, 2641 to 3064, AT2, 3184 to 3520, DH2, 3546 to 3727, KR2, 4083 to 4268, and ACP2, 4374 to 4457 of SU2 in SEQ ID NO: 2; KS3, 35 to 459, AT3, 582 to 933, DH3, 957 to 1155, KR3, 1523 to 1707, and ACP3, 1812 to 1895 of SU3 in SEQ ID NO: 3; KS4, 36 to 459, AT4, 594 to 921, KS.sup.0 4, 1177 to 1350, and ACP4, 1459 to 1523 of SU4 in SEQ ID NO: 4; KS5, 1545 to 1969, AT5, 2088 to 2424, DH5, 2445 to 2619, ER5, 2912 to 3261, KR5, 3266 to 3451, and ACP5, 3551 to 3631 of SU5 in SEQ ID NO: 4; KS6, 34 to 456, AT6, 566 to 898, KR6, 1148 to 1333, and ACP6, 1420 to 1503 of SU6 in SEQ ID NO: 5; KS7, 35 to 458, AT7, 582 to 917, KR7, 1233 to 1418, ACP7, 1502 to 1585, and TE, 1715 to 1747 of SU7 in SEQ ID NO: 6.
Although not wishing to be bound to any particular technical explanation, a sequence similarity exists among domain boundaries in various polyketide synthase genes. Thus, one skilled in the art is able to predict the domain boundaries of newly discovered polyketide synthase genes based on the sequence information of known polyketide synthase genes. In particular, the boundaries of submodules, domains, and open reading frames in the instant application are predicted based on sequence information disclosed in this application and the locations of the domain boundaries of the erythromycin polyketide synthase (Donadio et al., GENE, 111 51-60 (1992)). Furthermore, the genetic organization of the platenolide synthase gene cluster appears to correspond to the order of the reactions required to complete synthesis of platenolide. This means that the polyketide synthase DNA sequence can be manipulated to generate predictable alterations in the final platenolide product.
The DNA sequence of the platenolide synthase gene can be determined from recombinant DNA clones prepared from the DNA of Streptomyces ambofaciens, in particular strain ATCC 15154. The platenolide synthase gene endogenous to Streptomyces ambofaciens (ATCC 15154) is contained in recombinant DNA vectors pKC1080 and pKC1306 (FIG. 2), which are freely available for the duration of the patent term from the National Center for Agricultural Utilization Research, 1815 North University Street, Peoria, Ill. 61604-3999, in E. coli DH10B under accession numbers B-21500 for pKC1080 (deposited Sep. 21, 1995) and B-21499 for pKC1306 (deposited Sep. 21, 1995) respectively.
Techniques of isolating bacterial DNA are readily available and well known in the art. Any such techniques can be employed in this invention. In particular DNA from these deposited cultures can be isolated as follows. Lyophils of E. coli DH10B/pKC1080 or E. coli DH10B/pKC1306 are plated onto L-agar (10 g tryptone, 10 g NaCl, 5 g yeast extract, and 15 g agar per liter) plates containing 100 .mu.g/ml apramycin to obtain a single colony isolate of the strain. This colony is used to inoculate about 500 ml of L-broth (10 g tryptone, 10 g NaCl, 5 g yeast extract per liter) containing 100 .mu.g/ml apramycin, and the resulting culture is incubated at 37.degree. C. with aeration until the cells reach stationary phase. Cosmid DNA can be obtained from the cells in accordance with procedures known in the art (see e.g., Rao et al., 1987 in Methods in Enzymology, 153:166).
DNA of the current invention can be sequenced using any known techniques in the art such as the dideoxynucleotide chain-termination method (Sanger, et al., Proc. Natl. Acad. Sci. 74:5463 (1977)) with either radioisotopic or fluorescent labels. Double-stranded, supercoiled DNA can be used directly for templates in sequence reactions with sequence-specific oligonucleotide primers. Alternatively, fragments can be used to prepare libraries of either random, overlapping sequences in the bacteriophage M13 or nested, overlapping deletions in a plasmid vector. Individual recombinant DNA subclones are then sequenced with vector-specific oligonucleotide primers. Radioactive reaction products are electrophoresed on denaturing polyacrylamide gels and analyzed by autoradiography. Fluorescently labeled reaction products are electrophoresed and analyzed on Applied Biosystems (ABI Division, Perkin Elmer, Foster City, Calif. 94404) model 370A and 373A or Dupont (Wilmington, Del.) Genesis DNA sequencers. Sequence data are assembled and edited using Genetic Center Group (GCG, Madison, Wis.) programs GelAssemble and Seqed or the ABI model 670 Inherit Sequence Analysis system and the AutoAssembler and SeqEd programs.
Polypeptides corresponding to a domain, a submodule, a module, a synthesis unit (SU), or an open reading frame can be produced by transforming a host cell such as bacteria, yeast, or eukaryotic cell-expression system with the cDNA sequence in a recombinant DNA vector. It is well within one skilled in the art to choose among host cells and numerous recombinant DNA expression vectors to practice the instant invention. Multifunctional polypeptides of polyketide platenolide synthase can be extracted from platenolide-producing bacteria such as Streptomyces ambofaciens or translated in a cell-free in vitro translation system. In addition, the techniques of synthetic chemistry can be employed to synthesize some of the polypeptides mentioned above.
Procedures and techniques for isolation and purification of proteins produced in recombinant host cells are known in the art. See, for example, Roberts et al., Eur. J. Biochem. 214, 305-311, (1993) and Caffrey et al., FEBS 304, 225-228 (1992) for detailed description of polyketide synthase purification in bacteria. To achieve a homogeneous preparation of a polypeptide, proteins in the crude cell extract can be separated by size and/or charge through different columns well known in the art once or several times. In particular the crude cell extract can be applied to various cellulose columns commercially available such as DEAE-cellulose columns. Subsequently the bound proteins can be eluted and the fractions can be tested for the presence of the polyketide platenolide synthase or engineered derivative protein. Techniques for detecting the target protein are readily available in the art. Any such techniques can be employed for this invention. In particular the fractions can be analysized on Western blot using antibodies raised against a portion or portions of such polyketide platenolide synthase proteins. The fractions containing the polyketide platenolide synthase protein can be pooled and further purified by passing through more columns well known in the art such as applying the pooled fractions to a gel filtration column. When visualized on SDS-PAGE gels homogeneous preparations contain a single band and are substantially free of other proteins.
Knowledge of the platenolide synthase DNA sequence, its genetic organization, and the activities associated with particular open reading frames, modules, and submodules of the gene enables production of novel polyketides having a predicted structure that are not otherwise available. Modifications may be made to the DNA sequence that either alter the initial carboxylic acid building block used or alter the building block added at any of the condensation steps. The platenolide synthase gene may also be modified to alter the actual number of condensation steps done, thereby changing the size of the carbon backbone. Submodules that are part of the present invention may be selectively inactivated thereby giving rise to predictable, novel polyketide structures. Modifications to portions of the DNA sequence that encode the post-condensation processing activities will alter the functional groups appearing at the various condensation sites on the carbon chain backbone.
One skilled in the art is fully familiar with the degeneracy of the genetic code. Consequently, the skilled artisan can modify the specific DNA sequences provided by this disclosure to provide proteins having the same or improved characteristics compared to those polypeptides specifically provided herein. Also, one skilled in the art can modify the DNA sequences to express an identical protein to those provided, albeit expressed at higher levels. Furthermore, one skilled in the art is familiar with means to prepare synthetically, either partially, or in whole, DNA sequences which would be useful in preparing recombinant DNA vectors or coding sequences which are encompassed by the current invention. Additionally, recombinant means for modifying the DNA sequences provided may include for example site-directed deletion or site-directed mutagenesis. These techniques are well known to those skilled in the art and require no further elaboration here. Consequently, as used herein, DNA which is isolated from natural sources, prepared synthetically or semi-synthetically, or which are modified by recombinant DNA methods, are within the scope of the present invention.
Likewise, those skilled in the art will recognize that the polypeptides of the invention may be expressed recombinantly. Alternatively, these polypeptides may be synthesized as well, either in whole or in part, by conventional known non-recombinant techniques; for example, solid-phase synthesis. Thus, the present invention should not be construed as necessarily limited to any specific vector constructions or means for production of the specific polyketide synthase molecules exemplified. These alternate means for preparing the present polypeptides are meant to be encompassed by the present invention.
Many cyclized polyketides undergo glycosidation at one or more sites. Spiramycin is a 16-membered cyclic lactone, platenolide, with three attached sugar residues. The process of converting platenolide to spiramycin is well known in the art. The present invention also provides the information needed to synthesize novel spiramycin-related polyketides based on platenolide. The principles have already been described above. In addition, any product resulting from post-transcriptional or post-translational modification in vivo or in vitro based on the DNA sequence information disclosed here are meant to be encompassed by the present invention.
The following example is provided for exemplification purposes only and is not intended to limit the scope of the invention which has been described in broad terms above.
EXAMPLE 1
Specific Experimental Details and Results from the Sequencing of Platenolide Synthase
The DNA sequence of the S. ambofaciens platenolide synthase (srmG) gene can be obtained by sequencing inserts of recombinant DNA subclones containing contiguous or overlapping DNA segments of the region indicated in FIG. 3. All sequences representing srmG are fully contained in the overlapping cosmid clones pKC1080 and pKC1306 (FIG. 3). The sequence can be obtained by subcloning and sequencing the fragments bounded by NruI sites at position 1, 0.3 kb, 8.2 kb, 14.1 kb, 20.2 kb, 29.5 kb, 31.4 kb, 41.1 kb and 42.0 kb. In order to obtain the srmG region on a single fragment, the 25.0 kb fragment bounded by the NruI site at position 1 and the SfuI site at 25.0 kb should be isolated from a partial digestion of pKC1080 with restriction enzymes NruI and SfuI. The 17.8 kb DNA fragment bounded by the SfuI sites at 25.0 kb and 42.8 kb should be isolated from a digestion of pKC1306 with the restriction enzyme SfuI. The resulting fragments should be ligated and cloned in an appropriate recombinant DNA vector. Clones containing the correct orientation of the two ligated fragments can be identified by restriction enzyme site mapping.
The principles, preferred embodiments and modes of operation of the present invention have been described in the foregoing specification. The invention which is intended to be protected herein, however, is not to be construed as limited to the particular forms disclosed, since they are to be regarded as illustrative rather than restrictive. Variations and changes may be made by those skilled in the art without departing from the spirit of the invention.
__________________________________________________________________________# SEQUENCE LISTING- (1) GENERAL INFORMATION:- (iii) NUMBER OF SEQUENCES: 6- (2) INFORMATION FOR SEQ ID NO:1:- (i) SEQUENCE CHARACTERISTICS:#pairs (A) LENGTH: 44377 base (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear- (ii) MOLECULE TYPE: DNA (genomic)- (ix) FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 350..14002- (ix) FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 14046..20036- (ix) FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 20110..31284- (ix) FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 31329..36071- (ix) FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 36155..41830- (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:- GACCGCTCGG GGAGACCTGA CATATTCGTC GCGAAGTGGT TGTCCGCGCC GC - #GAGGTACT 60- GAAATCTTCT CCGCTCGCCC AGGACTCCGC GTGCAGGTCA CCGGAGTGCG CG - #ACCGGCCG 120- GGACGTCGGA GCGCCGACCC TGCGGACCTG GTGCGATGCC GTGTGGTCCC GC - #ATGATCCC 180- GCGCCGTCTC CGGTGACGAG AATCGGTGGA CAATCTCCGA ACTTGACACA AT - #TGATTGTC 240- GTTCACCGGC CGTTCCTGTC GCCCGGCAGT TCGCCCGCTG TACGCTCGGG AA - #GATCAAGA 300- AAAGGCAGAA AAGCCACGGC GTGGTACGGC GAACATATGA GGGATGCAGG TG - #TCTGGAGA 360- ACTCGCGATT TCCCGCAGTG ACGACCGGTC CGACGCCGTT GCCGTGGTCG GA - #ATGGCGTG 420- CCGGTTTCCC GGCGCCCCGG GAATTGCCGA ATTCTGGAAA CTGCTGACCG AC - #GGAAGGGA 480- CGCGATCGGC CGGGACGCCG ACGGCCGCCG GCGCGGCATG ATCGAGGCGC CC - #GGCGACTT 540- CGACGCCGCC TTCTTCGGCA TGTCACCCCG CGAGGCCGCC GAGACCGACC CC - #CAGCAGCG 600- CCTGATGCTC GAACTCGGCT GGGAGGCTCT GGAGGACGCC GGCATCGTCC CC - #GGCTCCCT 660- GCGCGGCGAG GCGGTCGGCG TCTTCGTCGG GGCCATGCAC GACGACTACG CC - #ACCCTGCT 720- CCACCGCGCC GGCGCGCCGG TCGGCCCCCA CACCGCCACC GGCCTCCAGC GC - #GCCATGCT 780- CGCCAACCGG CTCTCCTACG TCCTGGGGAC GCGCGGCCCC AGCCTCGCGG TC - #GACACCGC 840- CCAGTCGTCC TCCCTGGTCG CCGTGGCCCT CGCCGTCGAG AGCCTGCGGG CC - #GGCACCTC 900- CCGCGTCGCC GTCGCCGGGG GCGTCAACCT GGTCCTCGCC GACGAGGGAA CG - #GCCGCCAT 960- GGAACGCCTC GGCGCGCTGT CACCCGACGG CCGCTGCCAC ACCTTCGACG CC - #CGTGCCAA1020- CGGCTATGTC CGCGGTGAGG GCGGCGCCGC CGTCGTCCTG AAGCCCCTCG CC - #GACGCCCT1080- GGCCGACGGG GACCCCGTGT ACTGCGTGGT GCGTGGCGTC GCCGTCGGCA AC - #GACGGCGG1140- CGGCCCCGGG CTGACCGCTC CCGACCGCGA GGGACAGGAG GCGGTGCTCC GG - #GCCGCCTG1200- CGCCCAGGCC CGGGTCGACC CCGCCGAGGT GCGTTTCGTC GAACTGCACG GC - #ACGGGAAC1260- CCCGGTGGGC GACCCGGTCG AGGCACACGC CCTCGGCGCG GTGCACGGCT CC - #GGTCGGCC1320- GGCCGACGAC CCCCTGCTGG TGGGGTCGGT GAAGACCAAC ATCGGCCACC TG - #GAGGGCGC1380- CGCCGGCATC GCGGGCCTGG TCAAGGCCGC ACTGTGCCTG CGGGAACGCA CC - #CTTCCCGG1440- CTCGCTGAAC TTCGCCACCC CCTCTCCGGC CATCCCGCTG GACCAGCTCC GG - #CTGAAGGT1500- GCAGACCGCT GCCGCCGAGC TGCCGCTCGC CCCGGGCGGC GCACCCCTGC TG - #GCGGGTGT1560- CAGTTCGTTC GGCATCGGTG GCACCAACTG CCATGTGGTC CTGGAACACC TG - #CCCTCCCG1620- GCCCACCCCG GCCGTCTCCG TCGCCGCCTC GCTTCCGGAC GTCCCGCCGC TG - #TTGTTGTC1680- CGCGCGGTCG GAGGGGGCGT TGCGGGCGCA GGCGGTGCGG TTGGGTGAGT AC - #GTGGAGCG1740- GGTGGGCGCG GATCCGCGGG ATGTGGCTTA TTCGCTGGCT TCGACGCGGA CT - #CTTTTCGA1800- GCACCGTGCG GTGGTGCCGT GTGGTGGGCG TGGGGAGCTC GTCGCTGCTC TT - #GGTGGGTT1860- TGCTGCCGGG AGGGTGTCTG GGGGTGTGCG GTCCGGGCGG GCTGTGCCGG GT - #GGGGTGGG1920- GGTGTTGTTC ACGGGTCAGG GTGCGCAGTG GGTTGGTATG GGGCGTGGGT TG - #TATGCGGG1980- GGGTGGGGTG TTTGCGGAGG TGCTGGATGA GGTGTTGTCG ATGGTGGGGG AG - #GTGGATGG2040- TCGGTCGTTG CGGGATGTGA TGTTCGGCGA CGTCGACGTG GACGCGGGTG CC - #GGGGCTGA2100- TGCGGGTGCC GGTGCGGGTG CTGGGGTCGG TTCTGGTTCC GGTTCTGTGG GT - #GGGTTGTT2160- GGGTCGGACG GAGTTTGCTC AGCCTGCGTT GTTTGCGTTG GAGGTGGCGT TG - #TTCCGGGC2220- GTTGGAGGCT CGGGGTGTGG AGGTGTCGGT GGTGTTGGGT CATTCGGTGG GG - #GAGGTGGC2280- TGCTGCGTAT GTGGCGGGGG TGTTGTCGTT GGGTGATGCG GTGCGGTTGG TG - #GTGGCGCG2340- GGGTGGGTTG ATGGGTGGGT TGCCGGTGGG TGGGGGGATG TGGTCGGTGG GG - #GCGTCGGA2400- GTCGGTGGTG CGGGGGGTTG TTGAGGGGTT GGGGGAGTGG GTGTCGGTTG CG - #GCGGTGAA2460- TGGGCCGCGG TCGGTGGTGT TGTCGGGTGA TGTGGGTGTG CTGGAGTCGG TG - #GTTGCCTC2520- GCTGATGGGG GATGGGGTGG AGTGCCGGCG GTTGGATGTG TCGCATGGGT TT - #CATTCGGT2580- GTTGATGGAG CCGGTGTTGG GGGAGTTCCG GGGGGTTGTG GAGTCGTTGG AG - #TTCGGTCG2640- GGTGCGGCCG GGTGTGGTGG TGGTGTCGGG TGTGTCGGGT GGGGTGGTGG GT - #TCGGGGGA2700- GTTGGGGGAT CCGGGGTATT GGGTGCGTCA TGCGCGGGAG GCGGTGCGTT TC - #GCGGATGG2760- GGTGGGGGTG GTGCGTGGTC TGGGTGTGGG GACGTTGGTG GAGGTGGGTC CG - #CATGGGGT2820- GCTGACGGGG ATGGCGGGTG AGTGCCTGGG GGCCGGTGAT GATGTGGTGG TG - #GTGCCGGC2880- GATGCGGCGG GGCCGTGCGG AGCGGGAGGT GTTCGAGGCG GCGCTGGCGA CG - #GTGTTCAC2940- CCGGGACGCC GGCCTGGACG CCACGGCACT CCACACCGGG AGCACCGGCC GG - #CGCATCGA3000- CCTCCCCACC TACCCCTTCC AACGCCGTAC CCACTGGTCG CCCGCGCTGA GC - #CGGCCGGT3060- CACGGCCGAC GCCGGGGCGG GTGTGACCGC CACCGATGCC GTGGGGCACA GC - #GTCTCCCC3120- GGACCCGGAG AGCACCGAGG GGACGTCCCA CAGGGACACG GACGACGAGG CG - #GACTCGGC3180- GTCACCGGAG CCGATGTCCC CCGAGGATGC CGTCCGCCTG GTCCGCGAGA GC - #ACCGCGGC3240- CGTCCTGGGC CACGACGATC CCGGCGAGGT CGCGCTCGAC CGCACCTTCA CC - #TCCCAGGG3300- CATGGACTCG GTGACCGCGG TCGAGCTGTG CGACCTGCTG AAGGGCGCCT CG - #GGGCTCCC3360- CCTCGCCGCC ACGCTGGTCT ACGACCTGCC CACCCCGCGT GCCGTCGCCG AG - #CACATCGT3420- GGAAGCCGCG GGCGGGCCGA AGGACTCGGT TGCCGGTGGG CCCGGAGTGC TC - #TCGTCGGC3480- CGCGGTAGGG GTGTCGGACG CCCGGGGCGG CAGCCGGGAC GACGACGACC CG - #ATCGCCAT3540- CGTGGGTGTC GGCTGCCGGC TCCCCGGCGG CGTCGACTCG CGCGCCGCTC TC - #TGGGAGCT3600- GCTGGAGTCC GGCGCCGACG CCATCTCGTC CTTCCCCACC GACCGCGGCT GG - #GACCTCGA3660- CGGGCTGTAC GACCCCGAGC CCGGGACGCC CGGCAAGACC TATGTGCGGG AG - #GGCGGGTT3720- CCTGCACTCG GCGGCCGAGT TCGACGCGGA GTTCTTCGGG ATATCGCCGC GC - #GAGGCCAC3780- GGCCATGGAC CCGCAGCAGC GCTTGCTGCT GGAAGCGTCG TGGGAGGCCC TC - #GAGGACGC3840- CGGAGTGCTC CCCGAGTCAC TGCGCGGCGG CGACGCCGGA GTGTTCGTCG GC - #GCCACCGC3900- ACCGGAGTAC GGGCCGAGGC TTCACGAGGG AGCGGACGGA TACGAGGGGT AC - #CTGCTCAC3960- CGGCACCACC GCGAGCGTGG CCTCCGGCCG GATCGCCTAC ACCCTCGGCA CC - #GGCGGACC4020- GGCGCTCACC GTCGACACCG CGTGCTCCTC GTCCCTGGTG GCGCTGCACC TG - #GCCGTGCA4080- GGCGCTGCGC CGGGGCGAGT GCGGGCTGGC TCTGGCGGGC GGCGCCACGG TG - #ATGTCGGG4140- GCCCGGCATG TTCGTGGAGT TCTCGCGGCA GCGCGGGCTC GCCCCCGACG GC - #CGCTGCAT4200- GCCGTTCTCC GCCGATGCCG ACGGTACGGC CTGGTCCGAG GGTGTCGCCG TA - #CTGGCACT4260- GGAGCGGCTC TCCGACGCCC GGCGTGCGGG ACACCGGGTG CTGGGCGTGG TG - #CGGGGCAG4320- TGCGGTCAAC CAGGACGGTG CCAGCAACGG CCTGACCGCT CCCAACCGCT CC - #GCGCAGGA4380- GGGCGTCATC CGAGCTGCCC TGGCCGACGC CGGCCTCGCG CCGGGTGACG TG - #GACGCGGT4440- GGAGGCGCAC GGTACGGGGA CGGCGCTGGG CGATCCGATC GAGGCGAGCG CG - #CTGCTGGC4500- CACGTACGGG CGTGAGCGGG TGGGCGACCC CTTGTGGCTC GGGTCGCTGA AG - #TCCAACGT4560- CGGTCACACC CAGGCCGCCG CGGGGGCCGC GGGTGTGGTC AAGATGCTGC TT - #GCCCTGGA4620- GCACGGCACG CTGCCGCGGA CACTTCACGC GGACCGGCCC AGCACGCACG TC - #GACTGGTC4680- GTCGGGCACC GTCGCCCTGC TGGCAGAGGC GCGCCGGTGG CCCCGGCGGT CG - #GACCGCCC4740- GCGCCGGGCG GCTGTGTCGT CGTTCGGGAT CAGTGGGACG AACGCGCATC TG - #ATCATCGA4800- GGAGGCGCCG GAGTGGGTCG AGGACATCGA CGGCGTCGCT GCTCCTGACC GC - #GGTACCGC4860- GGACGCGGCT GCTCCGTCGC CGCTGTTGTT GTCCGCGCGG TCGGAGGGGG CG - #TTGCGGGC4920- GCAGGCGGTG CGGTTGGGTG AGTACGTGGA GCGGGTGGGT GCGGATCCGC GG - #GATGTGGC4980- TTATTCGCTG GCTTCGACGC GGACTCTTTT CGAGCACCGT GCGGTGGTGC CG - #TGTGGTGG5040- GCGTGGGGAG CTCGTCGCTG CTCTTGGTGG GTTTGCTGCC GGGAGGGTGT CT - #GGGGGTGT5100- GCGGTCCGGG CGGGCTGTGC CGGGTGGGGT GGGGGTGTTG TTCACGGGTC AG - #GGTGCGCA5160- GTGGGTTGGT ATGGGGCGTG GGTTGTATGC GGGGGGTGGG GTGTTTGCGG AG - #GTGCTGGA5220- TGAGGTGTTG TCGATGGTGG GGGAGGTGGA TGGTCGGTCG TTGCGGGATG TG - #ATGTTCGG5280- CGACGTCGAC GTGGACGCGG GTGCCGGGGC TGATGCGGGT GCCGGTGCGG GT - #GCTGGGGT5340- CGGTTCTGGT TCCGGTTCTG TGGGTGGGTT GTTGGGTCGG ACGGAGTTTG CT - #CAGCCTGC5400- GCTGTTTGCG TTGGAGGTGG CGTTGTTCCG GGCGTTGGAG GCTCGGGGTG TG - #GAGGTGTC5460- GGTGGTGTTG GGTCATTCGG TGGGGGAGGT GGCTGCTGCG TATGTGGCGG GG - #GTGTTGTC5520- GTTGGGTGAT GCGGTGCGGT TGGTGGTGGC GCGGGGTGGG TTGATGGGTG GG - #TTGCCGGT5580- GGGTGGGGGG ATGTGGTCGG TGGGGGCGTC GGAGTCGGTG GTGCGGGGGG TT - #GTTGAGGG5640- GTTGGGGGAG TGGGTGTCGG TTGCGGCGGT GAATGGGCCG CGGTCGGTGG TG - #TTGTCGGG5700- TGATGTGGGT GTGCTGGAGT CGGTGGTTGC CTCGCTGATG GGGGATGGGG TG - #GAGTGCCG5760- GCGGTTGGAT GTGTCGCATG GGTTTCATTC GGTGTTGATG GAGCCGGTGT TG - #GGGGAGTT5820- CCGGGGGGTT GTGGAGTCGT TGGAGTTCGG TCGGGTGCGG CCGGGTGTGG TG - #GTGGTGTC5880- GGGTGTGTCG GGTGGGGTGG TGGGTTCGGG GGAGTTGGGG GATCCGGGGT AT - #TGGGTGCG5940- TCATGCGCGG GAGGCGGTGC GTTTCGCGGA TGGGGTGGGG GTGGTGCGTG GT - #CTGGGTGT6000- GGGGACGTTG GTGGAGGTGG GTCCGCATGG GGTGCTGACG GGGATGGCGG GT - #GAGTGCCT6060- GGGGGCCGGT GATGATGTGG TGGTGGTGCC GGCGATGCGG CGGGGCCGTG CG - #GAGCGGGA6120- GGTGTTCGAG GCGGCGCTGG CGACGGTGTT CACCCGGGAC GCCGGCCTGG AC - #GCCACGGC6180- ACTCCACACC GGGAGCACCG GCCGGCGCAT CGACCTCCCC ACCTACCCCT TC - #CAACGCGA6240- CCGCTACTGG CTGGACCCCG TTCGCACCGC CGTGACCGGC GTCGAGCCCG CC - #GGCTCGCC6300- GGCGGACGCT CGGGCCACTG AGCGGGGACG GTCGACGACG GCCGGGATCC GC - #TACCGCGT6360- CGCTTGGCAG CCGGCCGTCG TCGACCGCGG CAACCCCGGG CCTGCCGGTC AT - #GTGCTGCT6420- TCTGGCCCCG GACGAGGACA CGGCCGACTC CGGACTCGCC CCCGCGATCG CA - #CGTGAACT6480- CGCCGTGCGC GGGGCCGAGG TCCACACCGT CGCCGTGCCG GTCGGTACAG GC - #CGGGAGGC6540- AGCCGGGGAC CTGTTGCGGG CCGCCGGTGA CGGTGCCGCC CGCAGCACCC GA - #GTTCTGTG6600- GCTCGCCCCG GCCGAGCCGG ACGCGGCCGA CGCCGTCGCC CTCGTCCAGG CG - #CTGGGCGA6660- GGCGGTACCC GAAGCCCCGC TCTGGATCAC CACCCGTGAG GCGGCGGCCG TG - #CGGCCGGA6720- CGAGACCCCT TCCGTCGGGG GCGCTCAGCT GTGGGGACTC GGACAGGTCG CC - #GCGCTCGA6780- ACTGGGGCGG CGCTGGGGCG GCTTGGCGGA CCTGCCCGGG AGTGCGTCGC CC - #GCGGTGCT6840- CCGTACGTTC GTCGGGGCGC TGCTCGCCGG GGGAGAGAAC CAGTTCGCGG TA - #CGGCCCTC6900- CGGCGTCCAT GTCCGCCGTG TGGTTCCCGC GCCCGTCCCC GTCCCGGCCT CC - #GCTCGCAC6960- CGTCACCACG GCCCCCGCCA CCGCCGTCGG CGAGGACGCA CGGAACGACA CC - #TCGGACGT7020- GGTCGTGCCG GACGACCGGT GGTCCTCCGG CACCGTACTG ATCACCGGGG GC - #ACCGGTGC7080- CCTGGGTGCG CAGGTCGCCC GCAGGCTCGC CCGGTCGGGC GCCGCGCGTC TG - #CTCCTGGT7140- GGGCCGGCGC GGCGCGGCCG GCCCCGGAGT GGGCGAACTC GTCGAGGAGC TG - #ACGGCGCT7200- CGGTTCCGAA GTGGCCGTCG AGGCCTGCGA CGTCGCCGAC CGGGACGCAC TG - #GCCGCGCT7260- CCTCGCGGGC CTCCCCGAGG AGCGGCCCCT CGTCGCCGTA CTGCACGCGG CA - #GGTGTGCT7320- CGACGACGGT GTGCTCGACT CGCTCACCTC CGACCGGGTG GACGCCGTAC TG - #CGGGACAA7380- GGTCACCGCC GCCCGTCACC TGGACGAGCT GACCGCGGAC CTTCCGCTCG AC - #GCCTTCGT7440- GCTCTTCTCC TCCATCGTCG GCGTGTGGGG CAACGGAGGG CAGGCCGTCT AC - #GCGGCCGC7500- CAACGCCGCG CTCGACGCCC TGGCGCAGCG GCGCCGGGCC AGGGGAGCCC GT - #GCCGCCTC7560- GATCGCCTGG GGGCCGTGGG CCGGTGCCGG AATGGCCTCC GGAACGGCGG CG - #AAGTCCTT7620- CGAACGGGAC GGCGTCACGG CCCTGGACCC CGAGCGCGCG CTCGACGTCC TC - #GACGACGT7680- GGTGGGCGCC GGCGGGACCT CTGCCGCAGG GACGCACGCG GCCGGCGAGA GC - #TCCCTGCT7740- CGTCGCCGAC GTGGACTGGG AGACCTTCGT CGGGCGTTCG GTCACCCGCC GT - #ACCTGGTC7800- GCTCTTCGAC GGCGTCTCCG CCGCCCGTTC GGCGCGTGCC GGCCATGCCG CG - #GACGACCG7860- TGCCGCTCTC ACCCCAGGGA CGCGGCCGGG CGACGGCGCA CCGGGCGGGA GC - #GGACAGGA7920- CGGGGGCGAG GGCCGGCCGT GGCTCTCCGT CGGCCCCTCG CCGGCGGAAC GC - #CGTCGTGC7980- TCTGCTCACG CTTGTGCGCT CGGAGGCCGC CGGGATCCTG CGCCACGCCT CG - #GCCGACGC8040- GGTCGACCCG GAGCTGGCCT TCCGGTCCGC CGGGTTCGAC TCCCTCACCG TT - #CTCGAACT8100- GCGTAACCGC CTGACCGCTG CCACCGGCCT GAACCTGCCG AACACGCTGC TC - #TTCGACCA8160- CCCGACCCCC CTCTCGCTCG CCTCCCACCT GCACGACGAA CTGTTCGGTC CC - #GACAGCGA8220- GGCGGAGCCG GCAGCGGCCG CCCCCACGCC GGTCATGGCC GACGAGCGTG AG - #CCGATCGC8280- GATCGTGGGC ATGGCGTGCC GTTACCCGGG CGGTGTGGCG TCGCCGGACG AC - #CTGTGGGA8340- CCTGGTGGCC GGTGACGGGC ACACGCTCTC CCCGTTCCCG GCCGACCGTG GC - #TGGGACGT8400- CGAGGGGCTG TACGACCCGG AGCCGGGGGT GCCGGGCAAG AGCTATGTAC GG - #GAAGGCGG8460- GTTCCTGCGT TCCGCGGCCG AGTTCGACGC GGAGTTCTTC GGGATATCGC CG - #CGCGAGGC8520- CACGGCCATG GACCCGCAGC AGCGGTTGCT GCTGGAGACG TCGTGGGAGG CG - #CTGGAGCG8580- GGCCGGCATC GTTCCGGACT CGCTGCGCGG CACCCGGACC GGTGTCTTCA GC - #GGCATCTC8640- CCAGCAGGAC TACGCGACCC AGCTGGGGGA CGCCGCCGAC ACCTACGGCG GG - #CATGTGCT8700- CACGGGGACC CTCGGCAGTG TGATCTCCGG TCGGGTTGCC TATGCGTTGG GG - #TTGGAGGG8760- GCCGGCGCTG ACGGTGGACA CGGCGTGTTC GTCGTCGTTG GTGGCGTTGC AT - #CTGGCGGT8820- GCAGTCGTTG CGGCGGGGTG AGTGTGATCT GGCGTTGGCC GGTGGGGTGA CG - #GTGATGGC8880- GACGCCGACG GTGTTCGTGG AGTTCTCGCG GCAGCGGGGG CTGGCGGCGG AC - #GGGCGGTG8940- CAAGGCGTTC GCGGAGGGTG CGGACGGGAC GGCGTGGGCG GAGGGTGTGG GT - #GTGCTGCT9000- GGTGGAGCGG CTTTCCGACG CGCGCCGCAA CGGTCATCGG GTGCTGGCGG TG - #GTGCGGGG9060- CAGTGCGGTC AATCAGGACG GTGCGAGCAA TGGGCTGACG GCGCCGAGTG GT - #CCGGCGCA9120- GCAGCGGGTG ATCCGTGAGG CGCTGGCTGA TGCGGGGCTG GTGCCCGCCG AC - #GTGGATGT9180- GGTGGAGGCG CACGGTACGG GGACGGCGCT GGGTGATCCG ATCGAGGCGG GT - #GCGCTGCT9240- GGCCACGTAC GGGCGGGAGC GGGTCGGCGA TCCGTTGTGG CTCGGGTCGT TG - #AAGTCGAA9300- CATCGGGCAT GCGCAGGCGG CTGCGGGTGT GGGTGGTGTG ATCAAGGTGG TG - #CAGGGGAT9360- GCGGCATGGG TCGTTGCCGC GGACGCTGCA TGTGGATGCG CCGTCGTCGA AG - #GTGGAGTG9420- GGCTTCGGGT GCGGTGGAGC TGCTGACCGA GACCCGGTCG TGGCCGCGGC GG - #GTGGAGCG9480- GGTGCGGCGG GCCGCGGTGT CGGCGTTCGG GGTGAGCGGG ACCAACGCCC AT - #GTGGTCCT9540- GGAGGAAGCG CCGGCGGAGG CCGGGAGCGA GCACGGGGAC GGCCCTGAAC CT - #GAGCGGCC9600- CGACGCGGTG ACGGGTCCGT TGTCGTGGGT GCTTTCTGCG CGGTCGGAGG GG - #GCGTTGCG9660- GGCGCAGGCG GTGCGGTTGC GTGAGTGTGT GGAGCGGGTG GGTGCGGATC CG - #CGGGATGT9720- GGCGGGGTCG TTGGTGGTGT CGCGTGCGTC GTTCGGTGAG CGTGCGGTGG TG - #GTGGGCCG9780- GGGGCGTGAG GAGTTGCTGG CGGGTCTGGA TGTGGTGGCT GCCGGGGCTC CT - #GTGGGTGT9840- GTCTTCGGGG GCCGGTGCTG TGGTGCGGGG GAGTGCGGTG CGGGGTCGTG GG - #GTGGGGGT9900- GTTGTTCACG GGTCAGGGTG CGCAGTGGGT TGGTATGGGG CGTGGGTTGT AT - #GCGGGGGG9960- TGGGGTGTTT GCGGAGGTGC TGGATGAGGT GTTGTCGGTG GTGGGGGAGG TG - #GATGGTCG10020- GTCGTTGCGG GATGTGATGT TCGCGGATGC TGACTCGGTT TTGGGTGGGT TG - #TTGGGTCG10080- GACGGAGTTT GCTCAGCCTG CGTTGTTTGC GTTGGAGGTG GCGTTGTTCC GG - #GCGTTGGA10140- GGCTCGGGGT GTGGAGGTGT CGGTGGTGTT GGGTCATTCG GTGGGGGAGG TG - #GCTGCTGC10200- GTATGTGGCG GGGGTGTTGT CGTTGGGTGA TGCGGTGCGG TTGGTGGTGG CG - #CGGGGTGG10260- GTTGATGGGT GGGTTGCCGG TGGGTGGGGG GATGTGGTCG GTGGGGGCGT CG - #GAGTCGGT10320- GGTGCGGGGG GTTGTTGAGG GGTTGGGGGA GTGGGTGTCG GTTGCGGCGG TG - #AATGGGCC10380- GCGGTCGGTG GTGTTGTCGG GTGATGTGGG TGTGCTGGAG TCGGTGGTTG TC - #ACGCTGAT10440- GGGGGATGGG GTGGAGTGCC GGCGGTTGGA TGTGTCGCAT GGGTTTCATT CG - #GTGTTGAT10500- GGAGCCGGTG TTGGGGGAGT TCCGGGGGGT TGTGGAGTCG TTGGAGTTCG GT - #CGGGTGCG10560- GCCGGGTGTG GTGGTGGTGT CGGGTGTGTC GGGTGGGGTG GTGGGTTCGG GG - #GAGTTGGG10620- GGATCCGGGG TATTGGGTGC GTCATGCGCG GGAGGCGGTG CGTTTCGCGG AT - #GGGGTGGG10680- GGTGGTGCGT GGTCTGGGTG TGGGGACGTT GGTGGAGGTG GGTCCGCATG GG - #GTGCTGAC10740- GGGGATGGCG GGTCAGTGCC TGGAGGCCGG TGATGATGTG GTGGTGGTGC CG - #GCGATGCG10800- GCGGGGCCGT CCGGAGCGGG AGGTGTTCGA GGCGGCGCTG GCGACGGTGT TC - #ACCCGGGA10860- CGCCGGCCTC GACGCCACGA CACTCCACAC CGGGAGCACC GGCCGACGCA TC - #GACCTCCC10920- CACCTACCCC TTCCAACACA ACCGCTACTG GGCAACCGGC TCAGTGACCG GT - #GCGACCGG10980- CACCTCGGCA GCCGCGCGCT TCGGCCTGGA GTGGAAGGAC CACCCCTTCC TC - #AGCGGCGC11040- CACGCCGATA GCCGGCTCCG GCGCGCTGCT CCTCACCGGC AGGGTGGGGC TC - #GCTGCCCA11100- CCCGTGGCTG GCCGACCACG CCATCTCCGG CACGGTGCTG CTCCCCGGAA CG - #GCGATCGC11160- CGACCTGCTG CTGCGGGCGG TCGAGGAGGT CGGCGCCGGA GGGGTCGAGG AA - #CTGACGCT11220- CCATGAGCCC CTGCTCCTCC CCGAGCGAGG CGGCCTGCAC GTCCAGGTGC TG - #GTCGAGGC11280- GGCCGACGAG CAGGGACGGC GTGCCGTGGC AGTCGCCGCA CGCCCGGAGG GC - #CCTGGGCG11340- GGACGGTGAG GAACAGGAGT GGACCCGGCA CGCGGAAGGC GTGCTCACCT CC - #ACCGAGAC11400- GGCCGTTCCG GACATGGGCT GGGCCGCCGG GGCCTGGCCG CCGCCCGGTG CC - #GAGCCGAT11460- CGACGTCGAG GAGCTGTACG ACGCGTTCGC CGCGGACGGC TACGGCTACG GC - #CCGGCCTT11520- CACCGCACTG TCCGGCGTGT GGCGTCTCGG CGACGAACTC TTCGCCGAGG TG - #CGGCGGCC11580- CGCGGGGGGC GCGGGCACGA CCGGTGACGG TTTCGGCGTC CACCCCGCAC TC - #TTCGATGC11640- GGCCCTCCAC CCGTGGCGCG CCGGCGGGCT GCTGCCCGAC ACGGGCGGCA CC - #ACCTGGGC11700- GCCGTTCTCC TGGCAGGGCA TCGCGCTCCA CACCACCGGA GCCGAGACGC TC - #CGCGTCAG11760- ACTGGCCCCT GCGGCCGGCG GCACCGAGTC GGCCTTCTCC GTACAGGCCG CC - #GACCCGGC11820- GGGCACCCCG GTCCTCACCC TCGACGCACT GCTGCTCCGC CCGGTGACCC TG - #GGGAGGGC11880- CGACGCGCCG CAACCGCTGT ACCGCGTCGA CTGGCAGCCG GTCGGCCAGG GG - #ACCGAGGC11940- CTCCGGCGCC CAGGGCTGGA CGGTGCTCGG GCAGGCCGCG GCCGAGACGG TC - #GCGCAGCC12000- CGCCGCCCAT GCGGACCTCA CCGCCCTGCG TACGGCTGTG GCCGCGGCGG GA - #ACACCCGT12060- GCCCCGGCTG GTGGTCGTGT CGCCGGTGGA CACCCGGCTG GACGAGGGGC CG - #GTGCTGGC12120- GGACGCCGAG GCTCGGGCCC GTGCGGGTGA CGGCTGGGAC GACGATCCCC TA - #CGTGTCGC12180- CCTCGGGCGC GGCCTGACCC TGGTCCGGGA GTGGGTCGAG GACGAACGGT TG - #GCGGACTC12240- CCGGCTCGTC GTCCTCACCC GTGGCGCGGT GGCGGCCGGT CCCGGCGATG TG - #CCGGACCT12300- GACAGGTGCG GCCCTGTGGG GGCTGCTCCG CTCCGCGCAG TCGGAGTATC CG - #GACCGCTT12360- CACCCTCATC GACGTGGACG ATTCCCCCGA GTCCCGTGCG GCTCTGCCCC GG - #GCTCTGGG12420- ATCGGCCGAG CGACAACTCG CCCTGCGGAC GGGCGACGTG CTGGCGCCGG CC - #CTGGTCCC12480- GATGGCCACC CGGCCGGCGG AGACCACTCC AGCGACGGCG GTCGCCTCGG CG - #ACAACACA12540- GACACAGGTC ACCGCGCCCG CTCCCGACGA CCCGGCTGCG GATGCCGTGT TC - #GACCCGGC12600- GGGCACCGTA CTGATCACCG GCGGCACCGG CGCCCTGGGA CGGCGTGTCG CC - #TCGCACCT12660- CGCGCGCCGG TACGGCGTAC GCCACATGCT TCTGGTCAGC AGGCGTGGAC CG - #GACGCCCC12720- CGAGGCCGGT CCCCTGGAAC GGGAACTCGC CGGTCTCGGA GTCACCGCCA CC - #TTCCTGGC12780- ATGCGACCTC ACCGACATCG AGGCCGTACG GAAGGCCGTC GCCGCGGTGC CG - #TCGGACCA12840- CCCGCTGACC GGTGTGGTGC ACACCGCCGG CGTGCTGGAC GACGGCGCCC TG - #ACCGGCCT12900- GACCCGGCAA CGCCTCGACA CCGTGCTGCG GCCCAAGGCC GACGCCGTGC GG - #AACCTCCA12960- CGAGGCGACC CTCGACCGGC CGCTGCGCGC GTTCGTCCTG TTCTCCGCCG CC - #GCCGGACT13020- CCTGGGCCGC CCCGGGCAGG CCTCCTACGC CGCCGCCAAC GCGGTCCTCG AC - #GCGCTCGC13080- GGGAGCCCGC CGCGCGGCCG GACTGCCCGC AGTGTCCCTG GCGTGGGGCC TG - #TGGGACGA13140- GCAGACGGGC ATGGCAGGAG GCCTCGACGA GATGGCCCTG CGCGTGCTGC GC - #CGGGACGG13200- CATCGCCGCG ATGCCTCCGG AGCAGGGGCT CGAACTGCTC GACCTGGCCC TG - #ACCGGACA13260- CCGGGACGGA CCCGCCGTCC TCGTCCCCCT CCTCCTCGAC GGCGCGGCCC TG - #CGCCGCAC13320- GGCGAAGGAG CGCGGCGCGG CCACGATGTC CCCCTTGCTG CGCGCCCTGC TG - #CCCGCCGC13380- CCTGCGCCGC AGCGGTGGAG CCGGCGCCCC CGCGGCGGCC GACCGGCACG GC - #AAGGAGGC13440- GGACCCCGGT GCGGGACGCC TCGCAGGGAT GGTGGCACTC GAAGCGGCGG AG - #CGTTCCGC13500- GGCCGTCCTT GAGCTGGTCA CCGAACAGGT CGCCGAGGTC CTCGGCTACG CG - #TCGGCCGC13560- GGAGATCGAG CCCGAACGAC CCTTCCGGGA GATCGGCGTC GACTCCCTGG CG - #GCGGTGGA13620- GCTGCGCAAC CGGCTCAGCC GTCTGGTCGG CCTGCGGTTG CCGACCACGC TG - #TCCTTCGA13680- CCACCCCACG CCGAAGGACA TGGCGCAGCA CATCGACGGG CAGCTCCCCC GC - #CCGGCCGG13740- AGCCTCGCCC GCGGACGCAG CGCTGGAAGG GATCGGCGAC CTCGCGCGGG CG - #GTCGCCCT13800- GCTGGGCACG GGCGACGCCC GCCGGGCCGA GGTACGAGAG CAGCTCGTCG GA - #CTGCTGGC13860- CGCGCTCGAC CCACCTGGGC GGACGGGCAC CGCCGCACCC GGCGTCCCCT CC - #GGTGCCGA13920- TGGCGCGGAA CCGACCGTGA CGGACCGGCT CGACGAGGCG ACCGACGACG AG - #ATCTTCGC13980- CTTCCTGGAC GAGCAGCTGT GACCACACCG TGGACCGACC GCATGCCGAG GA - #GTTGGTGG14040- CAGCAATGAC CGCCGAGAAC GACAAGATCC GCAGCTACCT GAAGCGTGCC AC - #CGCCGAAC14100- TGCACCGGAC CAAGTCCCGC CTGGCCGAGG TCGAGTCGGC GAGCCGCGAG CC - #GATCGCGA14160- TCGTGGGCAT GGCGTGCCGT TACCCGGGCG GTGTGGCGTC GCCGGACGAC CT - #GTGGGACC14220- TGGTGGCAGC CGGTACGGAC GCGGTCTCCG CGTTCCCCGT CGACCGTGGC TG - #GGACGTCG14280- AGGGGCTGTA CGACCCCGAT CCGGAGGCGG TGGGGCGTAG TTACGTGCGG GA - #GGGCGGGT14340- TCCTGCACTC GGCGGCCGAG TTCGACGCGG AGTTCTTCGG GATCTCGCCC CG - #TGAGGCGG14400- CGGCGATGGA TCCGCAGCAG CGGTTGCTGC TGGAGACGTC GTGGGAGGCG CT - #GGAGCGGG14460- CGGGGATCGT CCCCGCGTCG CTGCGCGGCA CCCGTACCGG CGTCTTCACC GG - #CGTCATGT14520- ACGACGACTA CGGGTCGCGG TTCGACTCGG CTCCGCCGGA GTACGAGGGC TA - #CCTCGTGA14580- ACGGCAGCGC CGGCAGCATC GCGTCCGGTC GGGTTGCCTA TGCGTTGGGG TT - #GGAGGGGC14640- CGGCGCTGAC GGTGGACACG GCGTGTTCGT CGTCGTTGGT GGCGTTGCAT CT - #GGCGGTGC14700- AGTCGTTGCG GCGGGGTGAG TGTGATCTGG CGTTGGCCGG TGGGGTGACG GT - #GATGGCGA14760- CGCCGACGGT GCTCGTGGAG TTCTCGCGGC AGCGGGGGCT GGCGGCGGAC GG - #GCGGTGCA14820- AGGCGTTCGC GGAGGGTGCG GACGGGACGG CGTGGGCCGA GGGTGTGGGC GT - #GCTGCTGG14880- TGGAGCGGCT CTCCGACGCC CGCCGCAATG GCCATCGGGT GCTGGCGGTG GT - #GCGGGGCA14940- GTGCGGTCAA TCAGGACGGT GCGAGCAACG GGCTGACGGC GCCGAGTGGT CC - #TGCGCAGC15000- AGCGGGTGAT CCGTGAGGCG CTGGCCGACG CGGGGCTGAC GCCCGCCGAC GT - #CGACGCGG15060- TCGAGGCGCA CGGCACCGGC ACACCCCTGG GCGACCCCAT CGAGGCGGGT GC - #GTTGCTGG15120- CCACCTATGG CAGTGAGCGC CAGGGCCAAG GTCCGTTGTG GTTGGGGTCG TT - #GAAGTCGA15180- ACATCGGGCA TGCGCAGGCG GCTGCGGGTG TGGGTGGCGT GATCAAGGTG GT - #GCAGGCGA15240- TGCGGCATGG GTCGTTGCCG CGGACGCTGC ATGTGGATGC GCCGTCGTCG AA - #GGTGGAGT15300- GGGCTTCGGG TGCGGTGGAG CTGCTGACCG AGACCCGGTC GTGGCCGCGG CG - #GGTGGAGC15360- GGGTGCGGCG GGCCGCGGTG TCGGCGTTCG GGGTGAGCGG GACCAACGCC CA - #TGTGGTCC15420- TGGAGGAAGC GCCGGCGGAG GCCGGGAGCG AGCACGGGGA CGGCCCTGAA CC - #CGAGCGGC15480- CCGACGCGGT GACGGGTCCG TTGTCGTGGG TGCTTTCTGC GCGGTCGGAG GG - #GGCGTTGC15540- GGGCGCAGGC GGTGCGGTTG CGTGAGTGTG TGGAGCGGGT GGGTGCGGAT CC - #GCGGGATG15600- TGGCGGGGTC GTTGGTGGTG TCGCGTGCGT CGTTCGGTGA GCGTGCGGTG GT - #GGTGGGCC15660- GGGGGCGTGA GGAGTTGCTG GCGGGTCTGG ATGTGGTGGC TGCCGGGGCT CC - #TGTGGGTG15720- TGTCCGGGGG CGTGTCTTCG GGGGCCGGTG CTGTGGTGCG GGGGAGTGCG GT - #GCGGGGTC15780- GTGGGGTGGG GGTGTTGTTC ACGGGTCAGG GTGCGCAGTG GGTTGGTATG GG - #GCGTGGGT15840- TGTATGCGGG GGGTGGGGTG TTTGCGGAGG TGCTGGATGA GGTGTTGTCG GT - #GGTGGGGG15900- AGGTGGGGGG TTGGTCGTTG CGGGATGTGA TGTTCGGCGA CGTCGACGTG GA - #CGCGGGTG15960- CCGGGGCTGA TGCGGGTGTC GGTTCGGGTG TTGGTGTGGG TGGGTTGTTG GG - #TCGGACGG16020- AGTTTGCTCA GCCTGCGTTG TTTGCGTTGG AGGTGGCGTT GTTCCGGGCG TT - #GGAGGCTC16080- GGGGTGTGGA GGTGTCGGTG GTGTTGGGTC ATTCGGTGGG GGAGGTGGCT GC - #TGCGTATG16140- TGGCGGGGGT GTTGTCGTTG GGTGATGCGG TGCGGTTGGT GGTGGCGCGG GG - #TGGGTTGA16200- TGGGTGGGTT GCCGGTGGGT GGGGGGATGT GGTCGGTGGG GGCGTCGGAG TC - #GGTGGTGC16260- GGGGGGTTGT TGAGGGGTTG GGGGAGTGGG TGTCGGTTGC GGCGGTGAAT GG - #GCCGCGGT16320- CGGTGGTGTT GTCGGGTGAT GTGGGTGTGC TGGAGTCGGT GGTTGCCTCG CT - #GATGGGGG16380- ATGGGGTGGA GTGCCGGCGG TTGGATGTGT CGCATGGGTT TCATTCGGTG TT - #GATGGAGC16440- CGGTGTTGGG GGAGTTCCGG GGGGTTGTGG AGTCGTTGGA GTTCGGTCGG GT - #GCGGCCGG16500- GTGTGGTGGT GGTGTCGAGT GTGTCGGGTG GGGTGGTGGG TTCGGGGGAG TT - #GGGGGATC16560- CGGGGTATTG GGTGCGTCAT GCGCGGGAGG CGGTGCGTTT CGCGGATGGG GT - #GGGGGTGG16620- TGCGTGGTCT GGGTGTGGGG ACGTTGGTGG AGGTGGGTCC GCATGGGGTG CT - #GACGGGGA16680- TGGCGGGTGA GTGCCTGGGG GCCGGTGATG ATGTGGTGGT GGTGCCGGCG AT - #GCGGCGGG16740- GCCGTGCGGA GCGGGAGGTG TTCGAGGCGG CGCTGGCGAC GGTGTTCACC CG - #GGACGCCG16800- GCCTGGACGC CACGACACTC CACACCGGGA GCACCGGCCG ACGCATCGAC CT - #CCCCACCT16860- ACCCCTTCCA ACACGACCGC TACTGGCTGG CCGCCCCGTC CCGGCCCAGG AC - #GGACGGGC16920- TGTCGGCGGC GGGTCTGCGC GAGGTGGAGC ACCCCCTGCT CACCGCCGCC GT - #GGAACTGC16980- CCGGCACCGA CACCGAGGTG TGGACCGGCC GCATATCCGC TGCCGACCTG CC - #CTGGCTCG17040- CCGACCACCT GGTGTGGGAC CGAGGCGTGG TGCCGGGGAC CGCGCTGCTG GA - #GACGGTGC17100- TCCAGGTGGG AAGCCGGATC GGTCTGCCGC GCGTCGCCGA ACTGGTCCTG GA - #GACGCCGC17160- TGACCTGGAC GTCGGACCGC CCGCTCCAGG TCCGGATCGT CGTGACCGCT GC - #CGCCACCG17220- CCCCCGGGGG CGCGCGTGAG CTGACCCTCC ACTCGCGGCC CGAGCCCGTG GC - #CGCCTCCT17280- CGTCCTCCCC GAGTCCCGCC TCTCCCCGGC ACCTCACGGC GCAGGAGAGC GA - #CGACGACT17340- GGACCCGGCA TGCCTCAGGG CTGCTCGCCC CGGCTGCCGG CCTCGCCGAC GA - #CTTCGCCG17400- AGCTCACCGG CGCCTGGCCC CCCGTCGGCG CCGAGCCCCT CGACCTCGCC GG - #TCAGTACC17460- CGCTCTTCGC AGCCGCCGGA GTGCGCTACG AAGGCGCCTT CCGAGGGCTG CG - #CGCGGCAT17520- GGCGTCGAGG CGACGAGGTC TTCGCCGACG TACGGCTGCC CGACGCGCAC GC - #GGTCGACG17580- CTGATCGTTA CGGGGTGCAC CCCGCCCTGC TCGACGCGGT GCTCCACCCG AT - #CGCGTCGC17640- TGGACCCGCT GGGCGACGGC GGGCACGGTC TGCTGCCGTT CTCCTGGACC GA - #CGTACAGG17700- GACACGGCGC CGGCGGACAC GCCCTCCGGG TACGGGTGGC GGCCGTCGAC GG - #CGGCGCGG17760- TGTCGGTCAC CGCGGCCGAC CACGCGGGCA ACCCGGTGTT ATCCGCCCGG TC - #CCTGGCAC17820- TGCGTCGTAT CACCGCGGAC CGGCTTCCCG CCGCGCCCGT CGCCCCTCTC TA - #CCGCGTGG17880- ACTGGCTGCC GTTCCCGGGT CCGGTGCCCG TATCCGCGGG CGGCCGCTGG GC - #GGTCGTCG17940- GACCCGAGGC CGAAGCCACG GCTGCCGGAC TGCGTGCGGT GGGCCTCGAC GT - #GCGTACCC18000- ATGCGCTCCC CCTCGGAGAG CCCCTGCCTC CGCAGGCCGG TACCGACGCG GA - #GGTGATCA18060- TCCTCGACCT GACCACCACC GCAGCCGGCC GTACGGCGTC GGACGGGGGG CG - #GCTCAGTC18120- TCCTCGACGA GGTGCGTGCG ACGGTGCGCC GGACCCTCGA AGCCGTACAG GC - #CCGCCTCG18180- CCGACACCGA AACGGCCCCC GACGTCGACG TCCGTACGGC CGCGCGCCCC CG - #CACAGCCG18240- CCCGTACAAG CCCCCGCGTG GACACCCGCA CGGGAGCCCG CACCGCTGAC GG - #CCCCCGGC18300- TCGTCGTCCT GACCCGGGGC GCGGCCGGAC CCGAGGGAGG CGCGGCCGAT CC - #CGCGGGTG18360- CCGCTGTCTG GGGGCTCGTC CGGGTCGCCC AGGCCGAACA GCCCGGCCGC TT - #CACCCTGG18420- TGGACGTCGA CGGCACCCAG GCGTCGCTGC GGGCCCTGCC CGGTCTGCTG GC - #CACGGATG18480- CCGGCCAGTC GGCCGTGCGC GACGGACGTG TCACCGTCCC GCGCCTCGTC CC - #GGTGGCCG18540- ACCCCGTCCC CCACGGCGGC GGCACGGCGG CCGACGGGAC GGGTGCCGGC GA - #GCCGTCCG18600- CGACCCTGGA CCCCGAAGGC ACCGTGCTGA TCACCGGCGG CACCGGAGCA CT - #GGCCGCGG18660- AAACCGCCCG GCACCTGGTC GACCGGCACA AGGTGCGCCA TCTCCTGCTG GT - #GGGCAGGC18720- GCGGTCCCGA CGCACCCGGC GTCGATCGAC TGGTCGCCGA GTTGACCGAG TC - #GGGTGCCG18780- AGGTCGCCGT ACGGGCCTGT GACGTCACGG ACCGCGACGC CCTGCGCCGC CT - #GCTCGACG18840- CACTCCCCGA CGAACACCCG CTGACCTGCG TGGTGCACAC CGCCGGGGTG CT - #CGACGACG18900- GCGTGCTCTC CGCCCAGACG GCCGAGCGGA TCGACACGGT GCTCCGGCCC AA - #GGCCGACG18960- CCGCCGTCCA CCTGGACGAG CTGACCCGGG AGATCGGACG GGTGCCCCTG GT - #GCTGTACT19020- CCTCGGTCTC GGCCACCCTG GGCAGCGCGG GGCAGGCCGG GTACGCGGCG GC - #CAACGCCT19080- TCATGGACGC GCTGGCCGCC CGGCGGTGCG CCGCCGGGCA CCCCGCGCTG TC - #GCTCGGCT19140- GGGGCTGGTG GTCCGGGGTG GGTCTCGCCA CCGGACTGGA CGGAGCGGAC GC - #GGCGCGGG19200- TCAGGCGCTC GGGTCTCGCC CCGCTCGACG CCGGCGCCGC ACTGGACCTG CT - #CGACCGGG19260- CGCTGACCCG GCCCGAGCCG GCCCTGCTGC CCGTGCGGCT CGACCTGCGC GC - #CGCGGCCG19320- GTGCCACCGC TCTCCCGGAG GTCCTGCGTG ACCTGGCCGG CGTACCGGCG GA - #CGCCCGCA19380- GCACGCCCGG GGCCGCGGCG GGCACCGGGG ACGAGGACGG TGCCGTGCGC CC - #TGCCCCCG19440- CCCCGGCCGA CGCCGCCGGG ACGCTGGCCG CGCGGCTCGC GGGACGTTCC GC - #ACCCGAGC19500- GTACGGCTCT CCTGCTCGAC CTGGTGCGGA CCGAGGTCGC GGCGGTGCTC GG - #ACACGGCG19560- ACCCCGCCGC GATCGGCGCC GCCCGCACCT TCAAGGACGC CGGATTCGAC TC - #CCTCACCG19620- CTGTCGACCT CCGCAACCGG CTGAACACAC GCACCGGACT GCGGCTGCCC GC - #GACCCTCG19680- TCTTCGACCA CCCCACACCG CTCGCCCTCG CCGAACTCCT GCTCGACGGG CT - #GGAGGCGG19740- CCGGTCCAGC GGAACCGGCC GCTGAGGTCC CGGACGAAGC GGCCGGTGCC GA - #GACCCTGT19800- CCGGCGTGAT CGACCGGCTG GAACGCAGCC TCGCCGCGAC CGACGACGGC GA - #CGCCCGGG19860- TCCGCGCGGC ACGGCGGCTG CGCGGCCTGC TGGACGCGCT CCCCGCCGGT CC - #CGGTGCCG19920- CGTCCGGTCC GGATGCCGGA GAGCACGCCC CCGGTCGCGG CGACGTGGTG AT - #CGACCGGC19980- TCAGGTCGGC CTCCGACGAC GACTTGTTCG ACCTGCTCGA CAGCGACTTC CA - #GTGAGCCG20040- GACCGCGCCG CGCGCCGACC GCTGAACCGC TCTTCACCCA GACCCACGAG AC - #CACGCCTG20100- AGGAGAACCG TGTCTGCGAC CAACGAGGAG AAGTTGCGGG AGTACCTGCG GC - #GCGCGATG20160- GCCGACCTGC ACAGCGCACG AGAGCGGTTG CGCGAGGTCG AGTCGGCGAG CC - #GTGAGCCG20220- ATCGCGATCG TGGGCATGGC GTGCCGTTAC CCGGGCGGTG TGGCGTCGCC GG - #AGGAGCTG20280- TGGGACCTGG TGGCCGCCGG TACGGACGCG ATCTCCCCGT TCCCCGTCGA CC - #GCGGCTGG20340- GACGCCGAGG GTCTGTACGA CCCGGAGCCG GGGGTGCCGG GCAAGAGCTA CG - #TGCGCGAG20400- GGCGGGTTCC TGCACTCGGC GGCCGAGTTC GACGCGGAGT TCTTCGGGAT CT - #CGCCGCGT20460- GAGGCGGCGG CGATGGATCC GCAGCAGCGG TTGCTGCTGG AGACGTCGTG GG - #AGGCGCTG20520- GAGCGGGCCG GGATCGTCCC CGCGTCGCTG CGCGGCACCC GTACCGGCGT CT - #TCACCGGC20580- GTCATGTACC ACGACTACGG CAGCCACCAG GTCGGCACCG CCGCCGATCC CA - #GTGGACAG20640- CTCGGCCTCG GCACCGCGGG GAGCGTCGCC TCGGGCCGGG TGGCGTACAC CC - #TCGGTCTA20700- CAGGGGCCGG CCGTGACCAT GGACACGGCA TGCTCGTCCT CGCTGGTGGC GT - #TGCACCTG20760- GCGGTGCAGT CGTTGCGGCG GGGCGAGTGC GATCTCGCGT TGGCCGGCGG GG - #CGACGGTC20820- TTGGCGACGC CCACGGTGTT CGTGGAGTTC TCGCGGCAAC GGGGGCTGGC GG - #CGGACGGA20880- CGGTGCAAGG CGTTCGCGGA GGGCGCCGAC GGCACGGCGT GGGCCGAGGG CG - #CCGGTGTG20940- CTGCTGGTGG AGCGGCTCTC CGACGCCCGC CGCAACGGCC ATCGGGTGCT CG - #CGGTGGTG21000- CGGGGCAGCG CGGTCAACCA GGACGGTGCC AGCAACGGCC TCACCGCACC CA - #GCGGGCCC21060- GCCCAGCAGC GGGTGATCCG TGACGCGCTG GCCGACGCGG GGCTGACGCC CG - #CCGACGTG21120- GACGCGGTCG AGGCGCACGG CACCGGCACA CCGCTCGGCG ACCCGATCGA GG - #CCGGCGCG21180- CTGATGGCCA CCTACGGCAG TGAACGGGTG GGCGACCCGC TGTGGCTGGG TT - #CGCTGAAG21240- TCGAACATCG GACACACCCA GGCCGCCGCC GGAGCCGCCG GCGTCATCAA GA - #TGGTGCAG21300- GCGTTACGGC AGTCCGAGCT GCCGCGCACC CTGCACGTCG ACGCGCCCTC GG - #CCAAGGTC21360- GAATGGGACG CGGGCGCCGT GCAACTGCTC ACCGGCGTCC GGCCATGGCC CC - #GGCGCGAG21420- CACAGGCCCC GGCGGGCCGC GGTCTCCGCC TTCGGCGTCA GCGGCACCAA CG - #CCCACGTC21480- ATCATCGAGG AACCGCCCGC GGCCGGTGAC ACCTCGCCCG CCGGCGACAC CC - #CTGAGCCG21540- GGCGAGGCGA CCGCGTCCCC CTCCACCGCG GCCGGGCCGT CGTCCCCCTC CG - #CGGTGGCC21600- GGGCCGCTGT CCCCCTCCTC CCCGGCCGTG GTCTGGCCCC TGTCCGCCGA GA - #CCGCCCCC21660- GCCCTGCGCG CCCAGGCCGC CCGCCTGCGG GCGCACCTCG AACGCCTCCC CG - #GCACCTCG21720- CCGACCGACA TCGGCCACGC CCTGGCCGCC GAACGCGCCG CCCTCACCCG AC - #GCGTCGTG21780- CTGCTCGGCG ACGACGGAGC CCCGGTCGAC GCACTCGCCG CCCTCGCCGC CG - #GCGAGACC21840- ACCCCCGACG CCGTCCACGG CACCGCGGCG GACATCCGCC GGGTCGCCTT CG - #TGTTCCCC21900- GGCCAGGGTT CCCAGTGGGC CGGGATGGGC GCCGAACTGC TGGACACGGC CC - #CGGCCTTC21960- GCCGCCGAAC TGGACCGCTG CCAGGGCGCG CTCTCCCCGT ACGTGGACTG GA - #ACCTCGCG22020- GACGTGCTGC GCGGCGCGCC CGCGGCGCCC GGCCTCGACC GGGTCGACGT CG - #TCCAGCCG22080- GCCACCTTCG CCGTCATGGT GGGACTCGCC GCGCTGTGGC GCTCCCTCGG GG - #TCGAACCC22140- GCCGCCGTCA TCGGCCACTC CCAGGGCGAG ATCGCCGCGG CCTGCGTGGC GG - #GCGCGCTC22200- TCCCTGGAGG ACGCCGCCCG GATCGTGGCC CTGCGCTCCC AGGTCATCGC CC - #GCGAACTG22260- GCCGGGCGGG GCGGCATGGC CTCGGTGGCC CTGCCCGCGG CGGAGGTCGA GG - #CCCGCCTG22320- GCCGGCGGCG TCGAGATCGC CGCCGTCAAC GGCCCCGGCT CGACCGTCGT CT - #GCGGAGAG22380- CCCGGCGCCC TGGAGGCGTT GCTCGTCACG CTGGAGAGCG AAGGCACCCG GG - #TCCGCCGC22440- ATCGACGTCG ACTACGCGTC CCACTCCCAC TACGTCGAGA GCATCCGGGC GG - #AACTCGCC22500- ACCGTCCTCG GCCCCGTCCG GCCGCGGAGG GGCGACGTGC CCTTCTACTC CA - #CCGTCGAG22560- GCGGCGCTCC TCGACACCGC CACCCTGGAC GCCGACTACT GGTACCGCAA CC - #TGCGCCTC22620- CCGGTGCGCT TCGAGCCGAC CGTACGCGCC ATGCTCGACG ACGGCGTCGA CG - #CGTTCGTG22680- GAGTGCTCCG CGCATCCCGT CCTGACCGTC GGCGTGCGCC AGACCGTGGA GA - #GCGCCGGC22740- GGCGCGGTCC CGGCCCTCGC TTCGCTGCGC CGCGACGAGG GCGGGCTGCG GC - #GCTTCCTC22800- ACCTCCGCCG CCGAGGCCCA GGTCGTCGGC GTCCCCGTGG ACTGGGCGAC GC - #TCCGCCCA22860- GGCGCCGGCC GGGTGGACCT GCCGACCTAC GCCTTCCAGC GCGAACGCCA CT - #GGGTCGGC22920- CCCGCCCGGC CCGACTCCGC GGCGACGGCC GCCACGACCG GTGACGACGC CC - #CGGAGCCC22980- GGAGACCGGC TCGGCTACCA CGTCGCGTGG AAGGGACTGC GCTCCACCAC CG - #GCGGCTGG23040- CGCCCCGGCC TGCGCCTGCT GATCGTGCCC ACCGGGGACC AGTACACCGC CC - #TCGCCGAC23100- ACCCTGGAAC AGGCGGTCGC CTCCTTCGGC GGAACGGTCC GCCGCGTCGC CT - #TCGACCCG23160- GCACGCACCG GACGCGCCGA GCTGTTCGGC CTGCTCGAGA CGGAGATCAA CG - #GCGACACC23220- GCCGTCACCG GCGTCGTCTC GCTGCTCGGA CTGTGCACCG ACGGCAGGCC GG - #ACCACCCC23280- GCCGTGCCCG TCGCCGTCAC CGCCACCCTC GCCCTCGTCC AGGCCCTGGC CG - #ACCTCGGC23340- AGCACCGCAC CGCTGTGGAC CGTCACCTGC GGCGCGGTCG CCACCGCCCC CG - #ACGAACTG23400- CCGTGCACCG CCGGTGCCCA GCTGTGGGGC CTGGGCCGGG TGGCCGCGCT GG - #AGCTGCCC23460- GAGGTGTGGG GCGGCCTCAT CGACCTTCCC GCGCGGCCCG ACGCCCGGGT CC - #TGGACCGT23520- CTCGCCGGCG TCCTCGCCGA ACCCGGCGGC GAGGACCAGA TCGCCGTACG GA - #TGGCGGGC23580- GTCTTCGGCC GCCGGGTCCT GCGGAACCCG GCCGACTCCC GGCCCCCGGC CT - #GGCGCGCC23640- CGGGGCACCG TCCTCATCGC CGGCGACCTC ACGACGGTGC CCGGCCGACT GG - #TCCGGTCC23700- CTCCTCGAGG ACGGCGCGGA CCGCGTGGTG CTGGCCGGAC CCGACGCCCC CG - #CACAGGCC23760- GCCGCCGCCG GACTGACCGG CGTCTCCCTC GTCCCCGTGC GCTGCGACGT CA - #CCGACCGC23820- GCCGCACTGG CCGCGCTGCT CGACGAGCAC GCGCCCACCG TCGCCGTGCA CG - #CCCCGCCC23880- CTGGTGCCCC TGGCGCCGCT GCGGGAGACG GCACCCGGCG ACATCGCCGC CG - #CCCTCGCC23940- GCCAAGACCA CGGCCGCCGG CCACCTGGTC GACCTGGCGC CGGCCGCGGG CC - #TCGACGCG24000- CTGGTGCTGT TCTCCTCGGT CTCCGGAGTG TGGGGCGGCG CGGCCCAGGG CG - #GCTACGCG24060- GCCGCCAGCG CGCACCTCGA CGCGCTGGCC GAACGCGCCC GCGCCGCGGG GG - #TGCCCGCG24120- TTCTCCGTGG CCTGGAGCCC CTGGGCCGGA GGCACGCCCG CCGACGGTGC CG - #AGGCGGAG24180- TTCCTCAGCC GGCGCGGGCT GGCTCCCCTC GACCCCGACC AGGCGGTGCG GA - #CCCTGCGC24240- CGCATGCTGG AGCGCGGCAG CGCCTGCGGT GCGGTCGCCG ACGTCGAGTG GA - #GCCGGTTC24300- GCCGCCTCCT ACACCTGGGT GCGTCCCGCC GTACTCTTCG ACGACATCCC GG - #ACGTGCAG24360- CGGCTGCGCG CGGCCGAACT CGCCCCGAGC ACCGGAGACT CGACCACCTC CG - #AACTCGTC24420- CGCGAGCTGA CCGCGCAGTC CGGCCACAAG CGGCACGCCA CCCTGCTGCG GC - #TGGTGCGC24480- GCACACGCCG CCGCCGTCCT CGGACAGTCC TCCGGCGACG CGGTGAGCAG CG - #CCCGCGCC24540- TTCCGCGACC TCGGCTTCGA CTCGCTGACC GCCCTCGAAC TGCGCGACCG GC - #TCAGCACC24600- AGCACCGGGC TCAAACTGCC CACCTCCCTG GTCTTCGACC ACTCCAGCCC GG - #CCGCGCTC24660- GCCCGGCACC TCGGTGAGGA ACTCCTCGGC CGGAACGACA CCGCCGACCG GG - #CCGGCCCC24720- GACACCCCGG TACGGACGGA CGAGCCCATC GCCATCATCG GCATGGCCTG CC - #GGCTGCCC24780- GGCGGGGTGC AGTCCCCCGA GGACCTGTGG GACCTGCTGA CCGGTGGGAC CG - #ACGCCATC24840- ACCCCCTTCC CGACCAACCG GGGATGGGAC AACGAGACCC TCTACGACCC CG - #ACCCCGAC24900- TCGCCCGGGC ACCACACCTA CGTGCGCGAG GGCGGGTTCC TGCACGACGC GG - #CCGAGTTC24960- GACCCCGGCT TCTTCGGCAT CAGCCCCCGC GAGGCCCTGG CCATGGACCC GC - #AGCAGCGG25020- CTGATCCTGG AGACGTCCTG GGAGTCCTTC GAACGGGCCG GCATCGACCC GG - #TCGAACTG25080- CGCGGCAGCC GCACCGGGGT CTTCGTCGGC ACCAACGGAC AGCACTACGT GC - #CGCTCCTC25140- CAGGACGGCG ACGAGAACTT CGACGGCTAC ATCGCCACCG GCAACTCCGC CA - #GCGTGATG25200- TCCGGCCGGC TCTCCTACGT CTTCGGACTG GAGGGCCCCG CCGTCACCGT CG - #ACACCGCC25260- TGCTCGGCCT CCCTGGCCGC ACTGCACCTG GCGGTGCAGT CACTGCGCCG CG - #GCGAATGC25320- GACTACGCCC TCGCCGGCGG GGCCACGGTG ATGTCCACCC CCGAGATGCT GG - #TGGAGTTC25380- GCCCGTCAGC GAGCGGTGTC GCCGGACGGC CGCAGCAAGG CGTTCGCGGA GG - #CGGCCGAC25440- GGGGTCGGTC TCGCCGAGGG AGCCGGGATG CTGCTCGTGG AGCGGCTGTC GG - #AGGCGCAG25500- AAGAAGGGCC ATCCGGTACT GGCGGTGGTG CGGGGCAGTG CCGTCAACCA GG - #ACGGTGCC25560- AGCAACGGCC TCACCGCACC CAGCGGGCCC GCCCAGCAGC GGGTGATACG GG - #AGGCGCTG25620- GCCGACGCGG GGCTGACGCC CGCCGACGTG GACGCGGTCG AGGCGCACGG CA - #CCGGCACG25680- CCGCTCGGCG ACCCCATCGA GGCCGGCGCG CTGCTCGCCA CGTACGGCCG GG - #ACCGGCGC25740- GACGGCCCGC TGTGGCTGGG TTCGCTGAAG TCGAACATCG GGCACACCCA GG - #CCGCCGCC25800- GGCGTGGCCG GGGTGATCAA GATGGTGCTG GCGCTGCGCC ACGGCGAGCT GC - #CGCGCACC25860- CTGCACGCGT CGACGGCGTC GTCCAGGATC GATTGGGACG CGGGCGCCGT GG - #AGTTGCTG25920- GACGAGGCCA GGCCCTGGCT CCAGCGGGCC GAGGGGCCGC GCCGGGCGGG CA - #TCTCCTCG25980- TTCGGCATCA GCGGCACCAA CGCGCACCTC GTCATCGAGG AGCCGCCGGA GC - #CCACCGCG26040- CCCGAACTGC TCGCGCCCGA ACCGGCCGCC GACGGCGACG TCTGGTCCGA GG - #AGTGGTGG26100- CACGAGGTGA CCGTGCCCCT GATGATGTCC GCGCACAACG AAGCCGCCCT GC - #GCGACCAG26160- GCGCGGCGCC TGCGCGCCGA CCTGCTCGCC CACCCCGAGC TGCACCCGGC CG - #ACGTCGGC26220- TACACCCTCA TCACCACCCG CACCCGGTTC GAGCAGCGGG CCGCCGTCGT CG - #GCGAGAAC26280- TTCACGGAGC TGATCGCGGC CCTCGACGAC CTCGTCGAAG GCCGACCGCA CC - #CGCTCGTG26340- CTGCGGGGCA CCGCCGGCAC CTCCGACCAG GTCGTGTTCG TCTTCCCCGG CC - #AGGGCTCG26400- CAGTGGCCCG AGATGGCCGA CGGGCTGCTG GCCCGCTCCA GCGGCTCCGG CT - #CCTTCCTG26460- GAGACCGCCC GCGCCTGCGA CCTCGCGCTC CGGCCCCACC TCGGCTGGTC CG - #TCCTGGAC26520- GTACTGCGCC GGGAACCCGG CGCGCCCTCG CTCGACCGGG TCGACGTGGT GC - #AGCCCGTG26580- CTGTTCACCA TGATGGTCTC GCTCGCCGAG ACGTGGCGTT CGCTGGGCGT CG - #AACCGGCC26640- GCGGTCGTCG GTCACTCCCA GGGCGAGATC GCCGCCGCCT ACGTCGCCGG CG - #CCCTGACG26700- CTGGACGACG CGGCGCGCAT CGTCGCCCTG CGCAGCCAGG CGTGGCTGCG GC - #TGGCCGGC26760- AAGGGCGGCA TGGTCGCCGT GACCCTGTCC GAACGCGACC TGCGTCCCCG CC - #TGGAGCCC26820- TGGAGCGACC GGCTCGCCGT CGCCGCCGTC AACGGCCCCG AGACCTGCGC CG - #TCTCCGGG26880- GACCCGGACG CCCTGGCGGA GCTGGTCGCC GAACTCGGTG CGGAGGGCGT GC - #ACGCCCGC26940- CCCATCCCCG GCGTCGACAC CGCCGGGCAC TCGCCGCAGG TCGACACGCT GG - #AGGCCCAC27000- CTGCGGAAGG TGCTCGCGCC CGTCGCGCCC CGCACCTCCG ACATCCCGTT CT - #ACTCGACG27060- GTCACCGGAG GACTGATCGA CACCGCCGAG CTGGACGCCG ACTACTGGTA CC - #GCAACATG27120- CGCGAGCCGG TGGAGTTCGA GCAGGCCACC CGCGCCCTGA TCGCCGACGG CC - #ACGACGTG27180- TTCCTGGAGT CGAGCCCGCA CCCCATGCTG GCCGTCTCCC TCCAGGAGAC GA - #TCAGCGAC27240- GCCGGTTCCC CGGCGGCCGT CCTCGGCACC CTGCGGCGCG GCCAGGGCGG CC - #CCCGCTGG27300- CTGGGCGTCG CCCTCTGCCG CGCCTACACC CACGGCCTGG AGATCGACGC CG - #AGGCCATC27360- TTCGGCCCCG ACTCACGCCA GGTGGAACTG CCCACGTACC CCTTCCAGCG CG - #AGCGCTAC27420- TGGTACAGCC CCGGCCACCG CGGTGACGAC CCCGCCTCCC TCGGTCTGGA CG - #CCGTCGAC27480- CACCCGCTGC TGGGCAGCGG CGTCGAACTG CCGGAGTCCG GTGACCGGAT GT - #ACACCGCA27540- CGGCTGGGCG CCGACACCAC CCCGTGGCTG GCCGACCACG CGCTGCTGGG GT - #CGCCGCTG27600- CTGCCCGGCG CCGCCTTCGC CGACCTGGCG CTCTGGGCCG GCCGCCAGGC CG - #GCACCGGC27660- CGCGTCGAGG AGCTCACCCT GGCCGCGCCC CTGGTGCTGC CCGGCTCCGG GG - #GTGTCCGG27720- CTGCGGCTGA ACGTCGGCGC CCCGGGCACC GACGACGCCC GCCGCTTCGC CG - #TGCACGCC27780- CGCGCCGAGG GCGCCACGGA CTGGACCCTG CACGCCGAGG GGCTGCTCAC CG - #CGCAGGAC27840- ACGGCCGACG CGCCGGACGC CTCGGCGGCC ACCCCGCCCC CCGGCGCCGA AC - #AACTGGAC27900- ATCGGCGACT TCTACCAGCG CTTCTCCGAA CTCGGTTACG GCTACGGCCC GT - #TCTTCCGG27960- GGACTGGTGA GCGCCCACCG CTGCGGCCCC GACATCCACG CGGAGGTCGC GC - #TGCCCGTC28020- CAGGCGCAGG GCGACGCGGC CCGCTTCGGC ATCCATCCCG CGCTGCTGGA CG - #CGGCGCTG28080- CAGACCATGA GCCTCGGGGG CTTCTTCCCC GAGGACGGCC GCGTCCGCAT GC - #CGTTCGCC28140- CTGCGCGGCG TTCGGCTGTA CCGCGCCGGA GCCGACCGGC TGCACGTGCG CG - #TCTCGCCC28200- GTCTCCGAGG ACGCGGTCCG CATCAGGTGC GCCGACGGCG AGGGACGGCC GG - #TCGCCGAG28260- ATCGAGTCCT TCATCATGCG GCCGGTCGAC CCGGGACAGC TCCTGGGCGG CC - #GCCCGGTC28320- GGCGCCGACG CGCTCTTCCG CATCGCCTGG CGGGAACTCG CCGCCGGCCC GG - #GCACCCGT28380- ACCGGCGACG GCACCCCTCC CCCGGTGCGC TGGGTGCTGG CGGGACCCGA CG - #CGCTGGGC28440- CTGGCCGAGG CGGCCGACGC CCACCTGCCC GCCGTTCCCG GCCCGGACGG CG - #CACTGCCG28500- TCCCCGACGG GACGCCCGGC GCCGGACGCC GTCGTGTTCG CGGTCCGTGC CG - #GGACCGGC28560- GACGTCGCCG CCGACGCGCA CACCGTGGCC TGCCGGGTGC TGGACCTCGT CC - #AGCGCCGG28620- CTCGCGGCCC CGGAGGGCCC GGACGGCGCC CGCCTGGTGG TGGCCACCCG CG - #GCGCGGTC28680- GCCGTACGCG ACGACGCCGA GGTGGACGAC CCGGCCGCGG CCGCCGCGTG GG - #GCCTGCTG28740- CGCTCCGCGC AGGCCGAGGA GCCCGGCCGG TTCCTGCTCG TGGACCTGGA CG - #ACGACCCG28800- GCGTCCGCCC GGGCGCTGAC CGACGCCCTC GCCTCCGGCG AACCGCAGAC CG - #CGGTCCGG28860- GCCGGGACGG TGTACGTGCC CCGGCTGGAG CGGGCCGCCG ACCGCACGGA CG - #GGCCGCTC28920- ACCCCGCCCG ACGACGGTGC CTGGCGGCTG GGCCGGGGCA CCGACCTCAC CC - #TCGACGGC28980- CTCGCCCTGG TGCCCGCCCC GGACGCCGAG GCGCCGCTGG AGCCCGGCCA GG - #TGCGCGTC29040- GCCGTACGCG CCGCGGGCGT CAACTTCCGC GACGCCCTCA TCGCCCTCGG CA - #TGTACCCG29100- GGCGAGGCGG AGATGGGAAC GGAGGGCGCC GGCACCGTCG TCGAGGTCGG CC - #CCGGCGTC29160- ACCGGTGTCG CCGTCGGCGA CCGCGTGCTC GGCCTGTGGG ACGGCGGCCT GG - #GCCCGCTG29220- TGCGTGGCCG ACCACCGGCT GCTCGCCCCC GTCCCGGACG GCTGGTCCTA CG - #CCCAGGCC29280- GCCTCGGTCC CCGCGGTGTT CCTCAGCGCC TACTACGGTC TGGTCACCCT GG - #CCGGCCTC29340- AGGCCGGGGG AGCGGGTGCT CGTGCACGCC GCCGCCGGGG GCGTCGGCAT GG - #CCGCGGTG29400- CAGATCGCCC GCCACCTCGG CGCGGAGGTG CTGGCCACCG CGAGCCCCGG CA - #AGTGGGAC29460- GCCCTGCGCG CCATGGGCAT CACCGACGAC CACCTCGCCT CCTCCCGCAC CC - #TCGACTTC29520- GCGACCGCCT TCACCGGAGC GGACGGCACG TCCCGCGCGG ACGTCGTCCT GA - #ACTCGCTC29580- ACCAAGGAGT TCGTGGACGC CTCCCTCGGG CTGCTCCGTC CGGGCGGCCG GT - #TCCTGGAG29640- CTGGGCAAGA CCGACGTCCG GGACCCCGAG CGGATCGCCG CCGAACACCC CG - #GGGTGCGC29700- TACCGGGCGT TCGACCTCAA CGAGGCCGGA CCCGACGCAC TCGGCCGGCT GC - #TGCGGGAA29760- CTGATGGACC TGTTCGCCGC CGGCGTGCTG CACCCGCTGC CCGTCGTCAC CC - #ACGACGTG29820- CGCCGGGCCG CGGACGCCCT GCGCACCATC AGCCAGGCCC GGCACACCGG AA - #AGCTCGTC29880- CTGACCATGC CGCCCGCCTG GCACCCGTAC GGCACGGTCC TGGTCACCGG TG - #GCACCGGC29940- GCCCTCGGCA GCCGCATCGC CCGCCACCTG GCGAGCCGGC ACGGCGTCCG CC - #GGCTGCTG30000- ATCGCCGCCC GCCGGGGCCC GGACGGCGAG GGCGCCGCGG AGCTGGTCGC CG - #ACCTCGCC30060- GCCCTGGGCG CGTCGGCCAC CGTGGTCGCC TGCGACGTCT CCGACGCGGA CG - #CCGTCCGC30120- GGACTGCTCG CCGGCATACC GGCCGATCAC CCGCTGACGG CGGTGGTGCA CA - #GCACCGGC30180- GTCCTCGACG ACGGCGTGCT GCCCGGGCTC ACCCCCGAGC GGATGCGGCG CG - #TGCTGCGG30240- CCCAAGGTGG AGGCCGCCGT CCACCTGGAC GAACTCACCC GCGACCTCGA CC - #TGTCGGCG30300- TTCGTCCTCT TCTCCTCCAG CGCCGGTCTG CTGGGCAGCC CGGCCCAGGG CA - #ACTACGCG30360- GCGGCCAACG CCACCCTCGA CGCCCTCGCC GCCCGGCGCC GGTCCCTCGG CC - #TCCCGTCG30420- GTGTCACTCG CCTGGGGTCT GTGGTCCGAC ACCAGCCGGA TGGCACACGC AC - #TGGACCAG30480- GAGAGCCTCC AGCGGCGCTT CGCCCGCAGC GGCTTCCCGC CCCTGTCCGC CA - #CGCTGGGC30540- GCCGCGCTGT TCGACGCCGC CCTGCGGGTC GACGAGGCCG TGCAGGTCCC CA - #TGCGGTTC30600- GACCCGGCCG CGCTGCGCGC CACCGGAAGC GTCCCCGCCC TGCTGTCGGA CC - #TCGTCGGG30660- TCCGCCCCGG CGACCGGGTC CGCGGCCCCG GCGTCCGGCC CCCTTCCGGC TC - #CGGACGCC30720- GGGACCGTCG GCGAGCCGCT CGCCGAGCGG TTGGCCGGAC TCTCCGCCGA GG - #AACGCCAC30780- GACCGGCTGC TCGGCCTGGT CGGCGAACAC GTGGCCGCGG TACTGGGCCA CG - #GCTCCGCC30840- GCCGAGGTCC GGCCCGACCG GCCGTTCCGC GAGGTCGGGT TCGACTCGCT CA - #CGGCCGTG30900- GAACTGCGCA ACCGGATGGC GGCGGTCACC GGGGTCAGGC TCCCCGCCAC CC - #TGGTCTTC30960- GACCACCCCA CCCCCGCCGC GCTGTCCTCG CACCTCGACG GCCTGCTGGC CC - #CGGCACAG31020- CCGGTCACCA CCACACCGCT GCTGTCCGAA CTGGACCGCA TCGAGGAGGC CC - #TGGCCGCC31080- CTCACCCCCG AGCACCTCGC GGAGCTCGCC CCCGCCCCCG ACGACCGGGC CG - #AGGTCGCC31140- CTGCGCCTGG ACGCCCTGGC CGACCGCTGG CGCGCCCTGC ACGACGGCGC GC - #CCGGCGCC31200- GACGACGACA TCACCGACGT GCTGAGCAGC GCCGACGACG ACGAGATCTT CG - #CGTTCATC31260- GACGAGCGGT ACGGCACGTC GTGACCGCCG GCCCGGAGCC CCGCCCGTCA TC - #GAAAGGAA31320- GCACCACCAT GGCGAACGAA GAGAAGCTGC GCGCCTACCT CAAGCGCGTG AC - #GGGTGAGC31380- TGCACCGGGC CACCGAGCAG CTGCGTGCCC TGGACCGGCG GGCCCACGAG CC - #GATCGCGA31440- TCGTCGGGGC GGCCTGCCGA CTCCCCGGCG GCGTCGAGAG TCCGGACGAC CT - #GTGGGAGC31500- TGCTGCACGC CGGTGCCGAC GCGGTCGGCC CGGCCCCCGC CGACCGCGGC TG - #GGACGTGG31560- AGGGAAGGTA CTCGCCCGAC CCCGACACGC CCGGCACCTC GTACTGCCGC GA - #GGGCGGCT31620- TCGTGCAGGG GGCCGACCGG TTCGACCCCG CCCTCTTCGG CATCTCGCCC AA - #CGAGGCGC31680- TCACCATGGA CCCCCAGCAG CGGCTGCTGC TGGAGACCTC CTGGGAGGCG CT - #GGAGCGAG31740- CCGGTCTGGA CCCCCAGTCC CTGGCGGGCA GCCGGACCGG CGTGTTCGCC GG - #GGCGTGGG31800- AGAGCGGCTA CCAGAAGGGC GTCGAAGGGC TCGAAGCCGA TCTGGAGGCC CA - #ACTCCTGG31860- CCGGCATCGT CAGCTTCACC GCCGGCCGCG TCGCCTACGC CCTGGGCCTG GA - #GGGCCCGG31920- CGCTGACGAT CGACACGGCC TGCTCCTCGT CGCTGGTGGC ACTGCACCTG GC - #GGTGCAGT31980- CACTGCGCCG GGGCGAGTGC GACCTCGCAC TGGCGGGCGG CGCCACGGTC AT - #CGCCGACT32040- TCGCGCTCTT CACCCAGTTC TCCCGGCAGC GCGGGCTCGC CCCCGACGGG CG - #GTGCAAGG32100- CCTTCGGTGA GACGGCCGAC GGCTTCGGCC CCGCCGAGGG CGCGGGGATG CT - #GCTGGTCG32160- AGCGGCTGTC GGACGCCCGC CGCAACGGGC ACCCGGTGCT GGCGGTGGTG CG - #GGGCAGTG32220- CCGTCAACCA GGACGGTGCG AGCAATGGGC TGACGGCGCC GAGTGGTCCT GC - #GCAGCAGC32280- GGGTGATCCG TGAGGCGCTG GCCGACGCGG GGCTGACGCC CGCCGACGTG GA - #CGCGGTCG32340- AGGCGCACGG CACCGGCACG CCGCTCGGCG ACCCCATCGA GGCCGGCGCG CT - #CATGGCGA32400- CGTACGGGCA CGAACGGACG GGCGACCCGC TGTGGCTGGG TTCGCTGAAG TC - #GAACATCG32460- GGCACACCCA GGCCGCCGCC GGCGTGGCCG GGGTGATCAA GATGGTGCTG GC - #GCTGCGCC32520- ACGGTGAGCT GCCGCGCACC CTGCACGCGT CGACGGCGTC CTCCAGGATC GA - #ATGGGACG32580- CGGGCGCCGT GGAGTTGCTG GACGAGGCCA GGCCCTGGCC CCGGCGTGCC GA - #GGGGCCGC32640- GCCGGGCGGG CATCTCCTCG TTCGGCATCA GCGGCACCAA CGCGCACCTC GT - #CATCGAGG32700- AGGAGCCGCC CGCCCGGCCG GAGCCCGAGG AGGCCGCGCA GCCGCCCGCC CC - #GGCCACCA32760- CCGTCCTCCC GCTGTCGGCC GCCGGCGCGC GATCCCTGCG CGAGCAGGCC CG - #CAGGCTCG32820- CCGCGCACCT GGCCGGCCAC GAGGAGATCA CCGCCGCCGA CGCCGCCCGC TC - #CGCCGCCA32880- CCACCCGTGC CGCGCTCTCG CACCGGGCCT CGGTCCTGGC CGACGACCGG CG - #GGCGCTGA32940- TCGACAGGCT GACCGCGCTG GCGGAGGACA GGAAGGACCC CGGCGTCACC GT - #CGGCGAGG33000- CGGGCAGCGG CCGGCCCCCC GTCTTCGTCT TCCCGGGACA GGGCTCCCAG TG - #GACGGGCA33060- TGGGCGCCGA ACTCCTGGAC AGGGCACCGG TCTTCCGCGC CAAGGCCGAG GA - #GTGCGCGC33120- GGGCCCTCGC GGCCCACCTC GACTGGTCGG TGCTCGACGT CCTGCGCGAC GC - #GCCCGGCG33180- CCCCGCCGAT CGACCGCGCG GACGTCGTCC AGCCGACCCT GTTCACCATG AT - #GGTCTCCC33240- TCGCGGCGCT GTGGGAGTCC CACGGTGTAC GGCCCGCCGC CGTGGTCGGC CA - #CTCCCAAG33300- GCGAGATCGC CGCCGCCCAC GCGGCCGGTG CCCTGTCCCT CGACGACGCG GC - #CCGCGTGA33360- TCGCCGAGCG CAGCAGGCTC TGGAAGCGGC TGGCCGGAAA CGGCGGCATG CT - #CTCCGTGA33420- TGGCCCCGGC CGACCGGGTC CGCGAACTGA TGGAGCCCTG GGCGGAGCGG AT - #GTCCGTGG33480- CCGCCGTCAA CGGCCCCGCC TCGGTCACCG TGGCCGGTGA CGCGCGGGCG CT - #GGAGGAGT33540- TCGGCGGCCG GCTCTCCGCC GCCGGGGTGC TGCGCTGGCC CCTCGCCGGC GT - #CGACTTCG33600- CCGGACACTC ACCCCAGGTG GAGCAGTTCC GCGCCGAGCT CCTCGACACG CT - #GGGCACCG33660- TCCGCCCGAC CGCCGCCCGG CTGCCCTTCT TCTCCACCGT GACCGCCGCG GC - #GCACGAGC33720- CCGAAGGCCT GGACGCCGCG TACTGGTACC GGAACATGCG CGAACCCGTG GA - #GTTCGCGT33780- CCACCCTGCG GACGCTGCTG CGCGAGGGCC ACCGCACCTT CGTCGAGATG GG - #CCCGCACC33840- CCCTGCTGGG CGCCGCGATC GACGAGGTCG CCGAGGCCGA GGGCGTGCAC GC - #CACCGCCC33900- TCGCCACCCT CCACCGCGGC TCCGGCGGCC TGGACCGGTT CCGCTCCTCG GT - #GGGCGCCG33960- CGTTCGCCCA CGGAGTACGG GTCGACTGGG ACGCCCTCTT CGAGGGCTCC GG - #CGCCCGCC34020- GGGTCCCGCT GCCCACCTAC GCCTTCAGCC GGGACCGGTA CTGGCTGCCC AC - #CGCCATCG34080- GCCGGCGCGC CGTCGAGGCG GCCCCCGTCG ACGCGTCCGC CCCCGGGCGC TA - #CCGCGTCA34140- CCTGGACACC CGTGGCATCC GACGACTCCG GCCGGCCCTC CGGGCGCTGG CT - #GCTGGTGC34200- AGACCCCCGG CACCGCGCCG GACGAGGCGG ACACCGCGGC GTCGGCCCTC GG - #TGCGGCCG34260- GGGTGGTCGT GGAGCGCTGC CTGCTGGATC CCACCGAGGC CGCGCGCGTC AC - #GCTCACCG34320- AGCGACTGGC CGAACTGGAC GCGCAGCCGG AGGGCCTGGC CGGCGTGCTG GT - #GCTGCCCG34380- GCCGTCCGCA GAGCACCGCA CCGGCCGACG CCTCCCCGCT CGACCCGGGG AC - #GGCCGCCG34440- TCCTGCTCGT GGTCCAGGCC GTGCCGGACG CCGCTCCGAA GGCCCGGATC TG - #GGTGGTGA34500- CGCGGGGTGC GGTGGCGGTG GGGTCGGGTG AGGTGCCGTG TGCGGTGGGT GC - #GCGGGTGT34560- GGGGTCTGGG GCGGGTGGCT GCGTTGGAGG TGCCGGTGCA GTGGGGTGGG TT - #GGTGGATG34620- TGGCGGTGGG GGCGGGTGTG CGTGAGTGGC GTCGTGTGGT GGGTGTGGTT GC - #GGGGGGTG34680- GTGAGGATCA GGTGGCGGTG CGTGGTGGGG GTGTGTTCGG TCGTCGTCTG GT - #GGGTGTGG34740- GGGTGCGGGG TGGTTCGGGG GTGTGGCGTG CGCGGGGGTG TGTGGTGGTG AC - #GGGTGGGT34800- TGGGTGGTGT GGGGGGTCAT GTGGCGCGGT GGTTGGCGCG TTCGGGTGCG GA - #GCATGTGG34860- TGTTGGCGGG GCGTCGGGGT GGTGGGGTTG TGGGGGCGGT GGAGTTGGAG CG - #GGAGTTGG34920- TGGGGTTGGG GGCGAAGGTG ACGTTCGTTT CGTGTGATGT GGGGGATCGG GC - #GTCGATGG34980- TGGGGTTGTT GGGTGTGGTG GAGGGGTTGG GGGTGCCGTT GCGTGGTGTG TT - #TCATGCGG35040- CGGGGGTGGC TCAGGTGTCG GGGTTGGGTG AGGTGTCGTT GGCGGAGGCG GG - #TGGTGTGT35100- TGGGGGGTAA GGCGGTGGGG GCTGAGTTGT TGGACGAGTT GACGGCGGGT GT - #GGAGCTGG35160- ATGCGTTCGT GTTGTTCTCG TCGGGTGCTG GGGTGTGGGG GAGTGGGGGG CA - #GTCGGTGT35220- ATGCGGCGGC CAATGCGCAT CTGGATGCGT TGGCGGAGCG TCGTCGTGCG CA - #GGGGCGTC35280- CCGCGACCTC CGTCGCCTGG GGCCTGTGGG GCGGCGAGGG CATGGGAGCG GA - #CGAAGGCG35340- TCACGGAGTT CTACGCCGAG CGCGGCCTCG CCCCCATGCG GCCCGAGTCG GG - #CATCGAGG35400- CACTGCACAC GGCACTGAAC GAGGGCGACA CCTGCGTCAC GGTCGCCGAC AT - #CGACTGGG35460- AACACTTCGT CACCGGGTTC ACCGCCTACC GGCCCAGCCC GCTGATCTCC GA - #CATCCCCC35520- AGGTCCGCGC GTTGCGCACG CCCGAACCCA CCGTGGACGC CTCGGACGGA CT - #GCGCCGGC35580- GCGTCGACGC CGCCCTCACC CCGCGCGAGC GCACCAAGGT CCTGGTCGAC CT - #GGTCCGCA35640- CGGTGGCGGC GGAGGTCCTC GGTCACGACG GGATCGGCGG CATCGGCCAC GA - #CGTGGCCT35700- TCCGGGACCT CGGCTTCGAC TCGCTGGCCG CGGTGCGGAT GCGCGGCCGG CT - #GGCCGAGG35760- CGACCGGACT CGTACTGCCC GCGACGGTCA TCTTCGACCA CCCCACCGTG GA - #CCGGCTCG35820- GCGGCGCGCT GCTGGAGCGG CTGTCCGCGG ACGAACCCGC GCCCGGCGGG GC - #GCCGGAGC35880- CCGCCGGGGG GAGGCCCGCG ACCCCACCGC CCGCACCGGA GCCGGCCGTC CA - #CGACGCCG35940- ACATCGACGA ACTCGACGCG GACGCCCTGA TCCGGCTGGC CACGGGAACC GC - #CGGACCGG36000- CCGACGGCAC GCCGGCCGAC GGCGGGCCCG ACGCGGCGGC GACCGCCCCC GA - #CGGAGCAC36060- CGGAGCAGTA GCGCGCCCTC ACCGGCGCGC CGACCGGCGG AGCGCCGTAC CG - #CCGACGCC36120- CCCCACAGCC AGCGAGCAGA CGAGGAAGCC GAAGATGTCA CCGTCCATGG AC - #GAAGTGCT36180- GGGTGCGCTG CGCACCTCCG TCAAGGAGAC CGAGCGGCTG CGCCGGCACA AC - #CGGGAGCT36240- CCTGGCCGGC GCGCACGAGC CGGTCGCCAT CGTGGGCATG GCCTGCCGCT AC - #CCCGGTGG36300- CGTGAGCACC CCGGACGACC TGTGGGAGCT CGCCGCGGAC GGCGTCGACG CG - #ATCACCCC36360- CTTCCCGGCC GACCGGGGCT GGGACGAGGA CGCCGTCTAC TCGCCCGACC CC - #GACACCCC36420- CGGCACCACC TACTGCCGTG AGGGCGGCTT CCTCACCGGC GCCGGGGACT TC - #GACGCGGC36480- CTTCTTCGGC ATCTCGCCGA ACGAGGCGCT GGTGATGGAC CCGCAGCAGC GG - #CTGTTGCT36540- GGAGACGTCG TGGGAGACGT TGGAGCGGGC CGGCATCGTC CCCGCGTCGC TG - #CGCGGCAG36600- CCGTACCGGT GTCTTCGTCG GAGCCGCGCA CACGGGATAC GTCACCGACA CC - #GCGCGAGC36660- GCCCGAGGGC ACCGAGGGCT ATCTGCTGAC GGGCAACGCC GATGCCGTCA TG - #TCCGGCCG36720- GATCGCCTAC TCCCTGGGTC TGGAGGGGCC GGCGCTGACG ATCGGGACGG CC - #TGCTCGTC36780- GTCGTTGGTG GCGTTGCATC TGGCGGTGCA GTCGTTGCGG CGGGGCGAGT GC - #GACCTGGC36840- GTTGGCCGGC GGCGTCGCGG TCATGCCCGA CCCGACGGTG TTCGTGGAGT TC - #TCGCGGCA36900- GCGGGGGCTG GCGGTGGACG GGCGGTGCAA GGCGTTCGCG GAGGGTGCGG AC - #GGGACGGC36960- GTGGGCGGAG GGAGTGGGTG TGCTGCTGGT GGAGCGGCTT TCCGACGCGC GC - #CGCAATGG37020- CCATCGGGTG CTGGCGGTGG TGCGGGGCAG TGCGGTCAAT CAGGACGGGG CG - #AGCAATGG37080- GCTGACGGCG CCGAGTGGTC CTGCGCAGCA GCGGGTGATC CGTGAGGCGC TG - #GCTGATGC37140- GGGGCTGACG CCCGCCGACG TGGATGTGGT GGAGGCGCAC GGTACGGGGA CG - #GCGTTGGG37200- TGATCCGATC GAGGCGGGTG CGTTGCTGGC CACGTACGGG CGGGAGCGGG TC - #GGTGATCC37260- TTTGTGGTTG GGGTCGTTGA AGTCGAACAT CGGGCATGCG CAGGCGGCTG CG - #GGTGTGGG37320- TGGTGTGATC AAGGTGGTGC AGGCGATGCG GCATGGGTCG TTGCCGCGGA CG - #CTGCATGT37380- GGATGCGCCG TCGTCGAAGG TGGAGTGGGC TTCGGGTGCG GTGGAGCTGC TG - #ACCGAGGG37440- CCGGTCGTGG CCGCGGCGGG TGGAGCGGGT GCGGCGGGCC GCGGTGTCGG CG - #TTCGGGGT37500- GAGCGGGACC AACGCCCATG TGGTCCTGGA GGAAGCACCG GTCGAGGCCG GG - #AGCGAGCA37560- CGGGGACGGC CCCGGACCCG ACCGGCCCGA CGCCGTGACG GGTCCGCTCC CC - #TGGGTGCT37620- CTCGGCACGC TCGCGGGAGG CGCTGCGCGG CCAGGCCGGA CGACTCGCCG CT - #CTCGCCCG37680- CCAGGGGCGC ACGGAGGGCA CCGGCGGCGG CAGCGGACTC GTCGTCCCCG CG - #GCCGACAT37740- CGGATACTCC CTGGCCACCA CCAGGGAGAC CCTGGAGCAC CGGGCGGTGG CG - #CTGGTGCA37800- GGAGAACCGG ACGGCCGGGG AGGACCTCGC CGCGCTGGCC GCCGGCCGCA CA - #CCGGAGAG37860- CGTGGTCACG GGTGTCGCGC GACGTGGCCG CGGGATCGCC TTCCTCTGCT CG - #GGGCAGGG37920- CGCCCAGCGG CTCGGCGCCG GTCGGGAGCT CCGCGGCAGG TTCCCCGTCT TC - #GCCGACGC37980- CCTCGACGAG ATCGCGGCGG AGTTCGACGC CCACCTCGAA CGCCCTCTCC TG - #TCGGTGAT38040- GTTCGCCGAG CCCGCCACGC CGGACGCCGC ACTCCTCGAC CGCACCGACT AC - #ACCCAGCC38100- GGCCCTCTTC GCGGTGGAGA CCGCGCTCTT CCGGCTCCTG GAGAGCTGGG GC - #CTGGTCCC38160- GGACGTCCTC GTGGGCCACT CGATCGGCGG TCTGGTGGCG GCTCACGTGG CG - #GGCGTCTT38220- CTCTGCGGCC GACGCGGCCC GGCTGGTCTC CGCACGCGGC CGGCTCATGC GG - #GCCCTGCC38280- CGAGGGCGGC GCGATGGCGG CCGTGCAGGC CACCGAGCGG GAGGCCGCCG CG - #CTGGAGCC38340- CGTCGCCGCC GGCGGCGCGG TGGTCGCCGC GGTCAACGGC CCGCAGGCCC TC - #GTGCTCTC38400- CGGGGACGAG GCGGCCGTAC TGGCGGCGGC CGGTGAACTG GCCGCCCGCG GA - #CGCCGCAC38460- CAAGCGCCTG AGGGTGAGCC ACGCCTTCCA CTCACCCCGT ATGGACGCCA TG - #CTCGCCGA38520- CTTCCGCGCG GTGGCGGACA CGGTCGACTA CCACGCCCCC CGGCTGCCGG TC - #GTCTCCGA38580- AGTGACCGGC GACCTCGCCG ACGCCGCCCA GCTGACCGAC CCCGGCTACT GG - #ACCCGCCA38640- GGTGCGGCAG CCGGTGCGCT TCGCCGACGC CGTGCGCACC GCGAGCGCCC GG - #GACGCCGC38700- GACCTTCATC GAGCTCGGGC CCGACGCCGT CCTGTGCGGC ATGGCGGAGG AG - #TCCCTGGC38760- CGCGGAGGCC GACGTCGTGT TCGCCCCGGC ACTGCGCCGC GGGCGCCCGG AG - #GGCGACAC38820- CGTGCTCCGG GCCGCCGCGA GCGCGTACGT CCGCGGCGCG GGCCTCGACT GG - #GCCGCGCT38880- CTACGGCGGC ACGGGAGCCC GCCGCACCGA CCTGCCCACC TACGCCTTCC AG - #CACAGCCG38940- CTACTGGCTC GCCCCCGCCT CGGCCGCGGT CGCCCCCGCG ACGGCCGCCC CC - #TCCGTCCG39000- ATCCGTGCCG GAAGCCGAGC AGGACGGGGC GCTGTGGGCC GCCGTGCACG CC - #GGTGACGT39060- CGCCTCGGCC GCGGCGCGAC TGGGCGCCGA CGACGCCGGT ATCGAACACG AA - #CTGCGCGC39120- GGTCCTGCCG CACCTGGCCG CCTGGCACGA CCGCGACCGC GCGACCGCGC GG - #ACCGCGGG39180- CCTGCACTAC CGCGTCACCT GGCAGGCGAT CGAGGCAGAC GCTGTCAGGT TC - #AGCCCCTC39240- GGATCGCTGG CTGATGGTCG AGCATGGGCA GCACACGGAA TGCGCGGACG CC - #GCGGAACG39300- GGCGCTGCGC GCGGCCGGCG CGGAGGTCAC CCGCCTGGTG TGGCCGCTGG AG - #CAGCACAC39360- CGGATCACCG CGGACGGAGA CCCCGGACCG CGGCACCCTG GCGGCCCGGC TG - #GCCGAGCT39420- CGCACGGAGC CCGGAGGGCC TGGCCGGCGT GCTGCTGCTC CCCGACTCGG GC - #GGTGCCGC39480- GGTCGCCGGG CACCCCGGGC TGGACCAGGG AACGGCGGCG GTGCTGCTGA CG - #ATCCAGGC39540- ACTGACCGAC GCCGCGGTGC GGGCACCGCT GTGGGTGGTG ACGCGGGGTG CG - #GTGGCGGT39600- GGGGTCGGGT GAGGTGCCGT GTGCGGTGGG TGCGCGGGTG TGGGGTCTGG GG - #CGGGTGGC39660- TGCGTTGGAG GTGCCGGTGC AGTGGGGTGG GTTGGTGGAT GTGGCGGTGG GG - #GCGGGTGT39720- GCGTGAGTGG CGTCGTGTGG TGGGTGTGGT TGCGGGGGGT GGTGAGGATC AG - #GTGGCGGT39780- GCGTGGTGGG GGTGTGTTCG GTCGTCGTCT GGTGGGTGTG GGGGTGCGGG GT - #GGTTCGGG39840- GGTGTGGCGT GCGCGGGGGT GTGTGGTGGT GACGGGTGGG TTGGGTGGTG TG - #GGGGGTCA39900- TGTGGCGCGG TGGTTGGCGC GTTCGGGTGC GGAGCATGTG GTGTTGGCGG GG - #CGTCGGGG39960- TGGTGGGGTT GTGGGGGCGG TGGAGTTGGA GCGGGAGTTG GTGGGGTTGG GG - #GCGAAGGT40020- GACGTTCGTT TCGTGTGATG TGGGGGATCG GGCGTCGGTG GTGGGGTTGT TG - #GGTGTGGT40080- GGAGGGGTTG GGGGTGCCGT TGCGTGGTGT GTTTCATGCG GCGGGGGTGG CT - #CAGGTGTC40140- GGGGTTGGGT GAGGTGTCGT TGGCGGAGGC GGGTGGTGTG TTGGGGGGTA AG - #GCGGTGGG40200- GGCTGAGTTG TTGGACGAGT TGACGGCGGG TGTGGAGCTG GATGCGTTCG TG - #TTGTTCTC40260- GTCGGGTGCT GGGGTGTGGG GGAGTGGGGG GCAGTCGGTG TATGCGGCGG CC - #AATGCGCA40320- TCTGGATGCG TTGGCGGAGC GTCGTCGTGC GCAGGGGCGT CCCGCGACCT CC - #GTCGCCTG40380- GGGCCCGTGG GACGGCGACG GCATGGGCGA GATGGCGCCC GAGGGCTACT TC - #GCCCGCCA40440- CGGCGTGGCC CCGCTCCACC CCGAGACGGC GCTCACCGCC CTGCACCAGG CC - #ATCGACGG40500- CGGCGAAGCC ACGGTCACCG TGGCGGACAT CGACTGGGAA CGGTTCGCCC CC - #GGCTTCAC40560- CGCCTTCCGT CCCAGCCCCC TGATCGCCGG CATCCCCGCG GCCCGTACGG CG - #CCCGCCGC40620- CGGCCGGCCC GCCGAGGACA CCCCCACCGC CCCCGGCCTC CTGCGGGCGC GG - #CCCGAGGA40680- CCGGCCGCGG CTCGCCCTGG ACCTGGTGCT CCGCCACGTC GCGGCGGTCC TC - #GGCCACTC40740- CGAGGACGCC CGGGTCGACG CCCGGGCCCC CTTCCGGGAC CTCGGCTTCG AC - #TCGCTCGC40800- CGCGGTGCGG CTGCGCCGCC GGCTGGCCGA GGACACCGGG CTCGACCTGC CC - #GGCACCCT40860- CGTCTTCGAC CACGAGGACC CCACCGCGCT GGCCCACCAC CTGGCCGGCC TC - #GCCGACGC40920- GGGGACCCCC GGCCCCCAGG AGGGCACGGC TCGGGCCGAG AGCGGGCTGT TC - #GCCTCCTT40980- CCGCGCCGCC GTCGAACAGC GCAGGTCGAG CGAGGTCGTG GAGCTGATGG CC - #GACCTGGC41040- GGCGTTCCGG CCCGCCTACT CCCGGCAGCA CCCCGGCTCC GGCCGCCCCG CG - #CCCGTACC41100- CCTCGCGACC GGACCGGCGA CGCGTCCCAC GCTGTACTGC TGCGCCGGCA CC - #GCGGTCGG41160- CTCCGGGCCC GCCGAGTACG TCCCGTTCGC CGAAGGACTG CGCGGCGTCC GG - #GAGACGGT41220- CGCCCTTCCC CTGTCCGGCT TCGGCGACCC CGCGGAACCG ATGCCCGCAT CG - #CTCGACGC41280- GCTGATCGAG GTCCAGGCCG ACGTCCTCCT GGAGCACACC GCGGGCAAGC CC - #TTCGCCCT41340- CGCCGGCCAC TCCGCCGGCG CGAACATCGC CCACGCCCTG GCCGCCCGGC TG - #GAGGAACG41400- CGGCTCGGGC CCCGCAGCCG TCGTACTGAT GGACGTCTAC CGTCCCGAGG AC - #CCCGGTGC41460- GATGGGCGAG TGGCGCGACG ACCTGCTCAG CTGGGCGCTC GAACGCAGCA CG - #GTGCCCCT41520- GGAGGACCAC CGGCTCACCG CCATGGCCGG CTATCAGCGG CTGGTGCTCG GA - #ACCCGGCT41580- CACCGCCCTC GAAGCCCCCG TCCTGCTGGC CCGGGCGTCC GAACCCCTGT GC - #GCGTGGCC41640- GCCCGCGGGC GGGGCGCGGG GCGACTGGCG GTCCCAGGTC CCGTTCGCAC GG - #ACCGTCGC41700- CGACGTGCCC GGCAACCACT TCACCATGCT CACCGAACAC GCCCGGCACA CC - #GCGTCCCT41760- GGTGCACGAA TGGCTGGACA GCCTCCCGCA CCAGCCCGGT CCCGCCCCGC TC - #ACCGGAGG41820- GAAACACTGA TGTACGCCGA CGACATCGCG GCCGTCTACG ACCTGGTCCA CG - #AGGGGAAG41880- GGGAAGGACT ACCGGCAGGA GGCCGAGGAG ATCGCCGCAC TCGTGCGCGT CC - #ACCGGCCG41940- GGCGCCCGGA CCCTGCTCGA CGTGGCCTGC GGCACCGGCC AGCACCTGCA CC - #ACCTGGAC42000- GGCCTCTTCG ACCACGTCGA GGGCCTGGAA CTCTCCGCCG ACATGCTGGC CC - #TCGCGACC42060- GGCCGGAACC CCGGTGTCAC CTTCCACCAA GGGGACATGC GCTCGTTCTC CC - #TGGGACGC42120- CGGTTCGACG CGGTGACCTG CATGTTCAGC TCCATAGGCC ACCTGCGGAC CA - #CCGACGAA42180- CTCGACAGCA CGCTGCGGGC CTTCACCGAC CACCTCGAAC CGTCCGGCGT CA - #TCGTCGTC42240- GAACCCTGGT GGTTCCCCGA GTCCTTCACC CCCGGTTACG TCGGCGCCAG CA - #TCACGGAG42300- GCGGGCGAGC GCACCGTCTG CCGGGTCTCG CACTCCGTAC GGGAGGGGAA CG - #CCACCCGC42360- ATCGAGGTGC ACTACCTCCT CGCCGGACCC GGCGGCGTCC GTCACCTGAC CG - #AGGACCAC42420- ACCATCACCC TGTTCCCGCG CGCCGACTAC GAGGCGGCCT TCGAGCGCGC CG - #GCTGCGAC42480- GTGGTCTACC AGGAAGGCGG CCCGTCCGGT CGCGGGCTGT TCATCGGCAC CC - #GCCGCTGA42540- CCCGGTGCCG ACGCGGACCG CCGCGGCCCG GAGGCGGGTT GCCCCGACCC AC - #CCGGCACA42600- CCCGGGTCCC CCGATCGTGC GAGCGCCCCC ATCGACCCGA GAAGAAAGGC AG - #GGCAGCCA42660- TGCCCACCCT TGCCACGGAA ACGGCCCCCG CGAGCACGAG CACGAGCGCG GG - #CACGAGCA42720- CGGGCGTCCG TGCGCTCGGC CGTCGGCTCC AGCTGACCCG GGCCGCACAC TG - #GTGCGCCG42780- GCAACCAGGG CGACCCGTAC GCGCTGATCC TGCGCGCCGT CGCCGACCCC GA - #GCCGTTCG42840- AACGGGAGAT CCGGGCCCGC GGACCGTGGT TCCGCAGCGA ACAGCTGGAC GC - #CTGGGTGA42900- CCGCGGACCC CGAGGTGGCG GCGGCCGTCC TGGCCGACCC GCGCTTCGGC AC - #GCTGGACC42960- GGGCCGGACG CCGCCCGGAC GAGGAACTGC TGCCCCTCGC CGAGGCGTTC CC - #CCACCACG43020- AACGCGCGGA GCTCGTACGC CTGCGGGCGC TGGCCGCCCC GGTGCTCAGC CG - #GTACGCCC43080- CGGCCCAGGC GCCCTGCGCG GCGCGCACCA CCGCCCGCAG AGTGCTCGGC CG - #CCTGCTGC43140- CCACCGGTGA CGCCGGGTTC GACCTTGTCG GCGAGGTCGC CCGGCCCTAC GC - #CGTCGAGC43200- TGATGCTCAG GCTCCTCGGA GTGCCGGGCC GCGACCGCGC CACCGCCGCG CG - #GGCACTCG43260- CCGCCTGCGG CCCCCAGCTC GACGCCCGGA TGGCCCCGCA ACTGCTGACC GT - #GGCCCGGG43320- AGTCCGCCGA CGCCGTCCGC ACACTGGCCG ACCTGGTCCC CGAGCTCGTC GC - #GGAGAAGT43380- CCCGGGGCCT CGGGAACGCC GAGCCCCGGC CCGACGACGT GCTCGCCCTC CT - #CCTGCACG43440- ACGGCGTCGC CCCCGGCGAC GTCGAGCGCA TCGCGCTGCT CCTCGCGGTC GG - #CGCACCCG43500- AACCCGTCGT CACCGCCGTC GCGCACACGG TCCACCGGCT GCTCGGCCGG CC - #GGGGGAGT43560- GGGAGAGGGC CCGCCGGACG CCGGCCGCGG CGAACGCCGT CGACCAGGTG CT - #GCGCGAGC43620- GCCCCCCGGC CCGGCTGGAG AACCGGGTCG CGCACACCGG CCTCGAACTC GG - #CGGCCGCC43680- GGATCACCGC CGACGAGCAC GTCGTGGTGC TGGCCGCCGC CGGACGGGAG AT - #CCCCGGGC43740- CGGAGCCGCT CGGGGGCGCC GACGGACCGC ACCTGGCGCT CGCCCTCCCG CT - #GATCCGCC43800- TGGCCGCCAC CACCGCGGTC CAGGTCACGG CCGGCCGCCT GCCCGGCCTG CG - #GGCCGAGG43860- GACCGCCCCT GACCCGGCCG CGGTCACCGG TCCTGGGCGC CTGCGCCCGC CT - #CCGGGTCC43920- ACCCGGGATG ACCCCGCCGT CCGTACGCCC CCTCCCAGAC CGGAGCCGCT GT - #GCGCGTCC43980- TGCTGACATC CCTCGCCCAC AACACCCACT ACTACAGTCT GGTGCCCCTC GC - #CTGGGCGC44040- TGCGCGCCGC CGGGCACGAG GTACGGGTGG CGAGCCCGCC CTCCCTCACC GA - #CGTCATCA44100- CCTCCACCGG TCTGACCGCC GTACCGGTGG GCGACGACCG ACCGGCCGCG GA - #GCTGCTCG44160- CCGAGATGGG CAGAGACCTC GTCCCCTACC AGAGGGGCTT CGAGTTCGGT GA - #GGTGGAGA44220- GCGAGGAGGA GACCACCTGG GAGTACCTGC TCGGCCAGCA GAGCATGATG GC - #CGCCCTGT44280- GCTTCGCCCC GTTCAACGGC GCCGCCACGA TGGACGAGAT CGTCGACTTC GC - #CCGTGGCT44340# 44377 CGTG TGGGAACCCT GGACCTA- (2) INFORMATION FOR SEQ ID NO:2:- (i) SEQUENCE CHARACTERISTICS:#acids (A) LENGTH: 4550 amino (B) TYPE: amino acid (D) TOPOLOGY: unknown- (ii) MOLECULE TYPE: peptide- (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:- Met Ser Gly Glu Leu Ala Ile Ser Arg Ser As - #p Asp Arg Ser Asp Ala# 15- Val Ala Val Val Gly Met Ala Cys Arg Phe Pr - #o Gly Ala Pro Gly Ile# 30- Ala Glu Phe Trp Lys Leu Leu Thr Asp Gly Ar - #g Asp Ala Ile Gly Arg# 45- Asp Ala Asp Gly Arg Arg Arg Gly Met Ile Gl - #u Ala Pro Gly Asp Phe# 60- Asp Ala Ala Phe Phe Gly Met Ser Pro Arg Gl - #u Ala Ala Glu Thr Asp#80- Pro Gln Gln Arg Leu Met Leu Glu Leu Gly Tr - #p Glu Ala Leu Glu Asp# 95- Ala Gly Ile Val Pro Gly Ser Leu Arg Gly Gl - #u Ala Val Gly Val Phe# 110- Val Gly Ala Met His Asp Asp Tyr Ala Thr Le - #u Leu His Arg Ala Gly# 125- Ala Pro Val Gly Pro His Thr Ala Thr Gly Le - #u Gln Arg Ala Met Leu# 140- Ala Asn Arg Leu Ser Tyr Val Leu Gly Thr Ar - #g Gly Pro Ser Leu Ala145 1 - #50 1 - #55 1 -#60- Val Asp Thr Ala Gln Ser Ser Ser Leu Val Al - #a Val Ala Leu Ala Val# 175- Glu Ser Leu Arg Ala Gly Thr Ser Arg Val Al - #a Val Ala Gly Gly Val# 190- Asn Leu Val Leu Ala Asp Glu Gly Thr Ala Al - #a Met Glu Arg Leu Gly# 205- Ala Leu Ser Pro Asp Gly Arg Cys His Thr Ph - #e Asp Ala Arg Ala Asn# 220- Gly Tyr Val Arg Gly Glu Gly Gly Ala Ala Va - #l Val Leu Lys Pro Leu225 2 - #30 2 - #35 2 -#40- Ala Asp Ala Leu Ala Asp Gly Asp Pro Val Ty - #r Cys Val Val Arg Gly# 255- Val Ala Val Gly Asn Asp Gly Gly Gly Pro Gl - #y Leu Thr Ala Pro Asp# 270- Arg Glu Gly Gln Glu Ala Val Leu Arg Ala Al - #a Cys Ala Gln Ala Arg# 285- Val Asp Pro Ala Glu Val Arg Phe Val Glu Le - #u His Gly Thr Gly Thr# 300- Pro Val Gly Asp Pro Val Glu Ala His Ala Le - #u Gly Ala Val His Gly305 3 - #10 3 - #15 3 -#20- Ser Gly Arg Pro Ala Asp Asp Pro Leu Leu Va - #l Gly Ser Val Lys Thr# 335- Asn Ile Gly His Leu Glu Gly Ala Ala Gly Il - #e Ala Gly Leu Val Lys# 350- Ala Ala Leu Cys Leu Arg Glu Arg Thr Leu Pr - #o Gly Ser Leu Asn Phe# 365- Ala Thr Pro Ser Pro Ala Ile Pro Leu Asp Gl - #n Leu Arg Leu Lys Val# 380- Gln Thr Ala Ala Ala Glu Leu Pro Leu Ala Pr - #o Gly Gly Ala Pro Leu385 3 - #90 3 - #95 4 -#00- Leu Ala Gly Val Ser Ser Phe Gly Ile Gly Gl - #y Thr Asn Cys His Val# 415- Val Leu Glu His Leu Pro Ser Arg Pro Thr Pr - #o Ala Val Ser Val Ala# 430- Ala Ser Leu Pro Asp Val Pro Pro Leu Leu Le - #u Ser Ala Arg Ser Glu# 445- Gly Ala Leu Arg Ala Gln Ala Val Arg Leu Gl - #y Glu Tyr Val Glu Arg# 460- Val Gly Ala Asp Pro Arg Asp Val Ala Tyr Se - #r Leu Ala Ser Thr Arg465 4 - #70 4 - #75 4 -#80- Thr Leu Phe Glu His Arg Ala Val Val Pro Cy - #s Gly Gly Arg Gly Glu# 495- Leu Val Ala Ala Leu Gly Gly Phe Ala Ala Gl - #y Arg Val Ser Gly Gly# 510- Val Arg Ser Gly Arg Ala Val Pro Gly Gly Va - #l Gly Val Leu Phe Thr# 525- Gly Gln Gly Ala Gln Trp Val Gly Met Gly Ar - #g Gly Leu Tyr Ala Gly# 540- Gly Gly Val Phe Ala Glu Val Leu Asp Glu Va - #l Leu Ser Met Val Gly545 5 - #50 5 - #55 5 -#60- Glu Val Asp Gly Arg Ser Leu Arg Asp Val Me - #t Phe Gly Asp Val Asp# 575- Val Asp Ala Gly Ala Gly Ala Asp Ala Gly Al - #a Gly Ala Gly Ala Gly# 590- Val Gly Ser Gly Ser Gly Ser Val Gly Gly Le - #u Leu Gly Arg Thr Glu# 605- Phe Ala Gln Pro Ala Leu Phe Ala Leu Glu Va - #l Ala Leu Phe Arg Ala# 620- Leu Glu Ala Arg Gly Val Glu Val Ser Val Va - #l Leu Gly His Ser Val625 6 - #30 6 - #35 6 -#40- Gly Glu Val Ala Ala Ala Tyr Val Ala Gly Va - #l Leu Ser Leu Gly Asp# 655- Ala Val Arg Leu Val Val Ala Arg Gly Gly Le - #u Met Gly Gly Leu Pro# 670- Val Gly Gly Gly Met Trp Ser Val Gly Ala Se - #r Glu Ser Val Val Arg# 685- Gly Val Val Glu Gly Leu Gly Glu Trp Val Se - #r Val Ala Ala Val Asn# 700- Gly Pro Arg Ser Val Val Leu Ser Gly Asp Va - #l Gly Val Leu Glu Ser705 7 - #10 7 - #15 7 -#20- Val Val Ala Ser Leu Met Gly Asp Gly Val Gl - #u Cys Arg Arg Leu Asp# 735- Val Ser His Gly Phe His Ser Val Leu Met Gl - #u Pro Val Leu Gly Glu# 750- Phe Arg Gly Val Val Glu Ser Leu Glu Phe Gl - #y Arg Val Arg Pro Gly# 765- Val Val Val Val Ser Gly Val Ser Gly Gly Va - #l Val Gly Ser Gly Glu# 780- Leu Gly Asp Pro Gly Tyr Trp Val Arg His Al - #a Arg Glu Ala Val Arg785 7 - #90 7 - #95 8 -#00- Phe Ala Asp Gly Val Gly Val Val Arg Gly Le - #u Gly Val Gly Thr Leu# 815- Val Glu Val Gly Pro His Gly Val Leu Thr Gl - #y Met Ala Gly Glu Cys# 830- Leu Gly Ala Gly Asp Asp Val Val Val Val Pr - #o Ala Met Arg Arg Gly# 845- Arg Ala Glu Arg Glu Val Phe Glu Ala Ala Le - #u Ala Thr Val Phe Thr# 860- Arg Asp Ala Gly Leu Asp Ala Thr Ala Leu Hi - #s Thr Gly Ser Thr Gly865 8 - #70 8 - #75 8 -#80- Arg Arg Ile Asp Leu Pro Thr Tyr Pro Phe Gl - #n Arg Arg Thr His Trp# 895- Ser Pro Ala Leu Ser Arg Pro Val Thr Ala As - #p Ala Gly Ala Gly Val# 910- Thr Ala Thr Asp Ala Val Gly His Ser Val Se - #r Pro Asp Pro Glu Ser# 925- Thr Glu Gly Thr Ser His Arg Asp Thr Asp As - #p Glu Ala Asp Ser Ala# 940- Ser Pro Glu Pro Met Ser Pro Glu Asp Ala Va - #l Arg Leu Val Arg Glu945 9 - #50 9 - #55 9 -#60- Ser Thr Ala Ala Val Leu Gly His Asp Asp Pr - #o Gly Glu Val Ala Leu# 975- Asp Arg Thr Phe Thr Ser Gln Gly Met Asp Se - #r Val Thr Ala Val Glu# 990- Leu Cys Asp Leu Leu Lys Gly Ala Ser Gly Le - #u Pro Leu Ala Ala Thr# 10050- Leu Val Tyr Asp Leu Pro Thr Pro Arg Ala Va - #l Ala Glu His Ile Val# 10205- Glu Ala Ala Gly Gly Pro Lys Asp Ser Val Al - #a Gly Gly Pro Gly Val# 10401030 - # 1035- Leu Ser Ser Ala Ala Val Gly Val Ser Asp Al - #a Arg Gly Gly Ser Arg# 10550- Asp Asp Asp Asp Pro Ile Ala Ile Val Gly Va - #l Gly Cys Arg Leu Pro# 10705- Gly Gly Val Asp Ser Arg Ala Ala Leu Trp Gl - #u Leu Leu Glu Ser Gly# 10850- Ala Asp Ala Ile Ser Ser Phe Pro Thr Asp Ar - #g Gly Trp Asp Leu Asp# 11005- Gly Leu Tyr Asp Pro Glu Pro Gly Thr Pro Gl - #y Lys Thr Tyr Val Arg# 11201110 - # 1115- Glu Gly Gly Phe Leu His Ser Ala Ala Glu Ph - #e Asp Ala Glu Phe Phe# 11350- Gly Ile Ser Pro Arg Glu Ala Thr Ala Met As - #p Pro Gln Gln Arg Leu# 11505- Leu Leu Glu Ala Ser Trp Glu Ala Leu Glu As - #p Ala Gly Val Leu Pro# 11650- Glu Ser Leu Arg Gly Gly Asp Ala Gly Val Ph - #e Val Gly Ala Thr Ala# 11805- Pro Glu Tyr Gly Pro Arg Leu His Glu Gly Al - #a Asp Gly Tyr Glu Gly# 12001190 - # 1195- Tyr Leu Leu Thr Gly Thr Thr Ala Ser Val Al - #a Ser Gly Arg Ile Ala# 12150- Tyr Thr Leu Gly Thr Gly Gly Pro Ala Leu Th - #r Val Asp Thr Ala Cys# 12305- Ser Ser Ser Leu Val Ala Leu His Leu Ala Va - #l Gln Ala Leu Arg Arg# 12450- Gly Glu Cys Gly Leu Ala Leu Ala Gly Gly Al - #a Thr Val Met Ser Gly# 12605- Pro Gly Met Phe Val Glu Phe Ser Arg Gln Ar - #g Gly Leu Ala Pro Asp# 12801270 - # 1275- Gly Arg Cys Met Pro Phe Ser Ala Asp Ala As - #p Gly Thr Ala Trp Ser# 12950- Glu Gly Val Ala Val Leu Ala Leu Glu Arg Le - #u Ser Asp Ala Arg Arg# 13105- Ala Gly His Arg Val Leu Gly Val Val Arg Gl - #y Ser Ala Val Asn Gln# 13250- Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro As - #n Arg Ser Ala Gln Glu# 13405- Gly Val Ile Arg Ala Ala Leu Ala Asp Ala Gl - #y Leu Ala Pro Gly Asp# 13601350 - # 1355- Val Asp Ala Val Glu Ala His Gly Thr Gly Th - #r Ala Leu Gly Asp Pro# 13750- Ile Glu Ala Ser Ala Leu Leu Ala Thr Tyr Gl - #y Arg Glu Arg Val Gly# 13905- Asp Pro Leu Trp Leu Gly Ser Leu Lys Ser As - #n Val Gly His Thr Gln# 14050- Ala Ala Ala Gly Ala Ala Gly Val Val Lys Me - #t Leu Leu Ala Leu Glu# 14205- His Gly Thr Leu Pro Arg Thr Leu His Ala As - #p Arg Pro Ser Thr His# 14401430 - # 1435- Val Asp Trp Ser Ser Gly Thr Val Ala Leu Le - #u Ala Glu Ala Arg Arg# 14550- Trp Pro Arg Arg Ser Asp Arg Pro Arg Arg Al - #a Ala Val Ser Ser Phe# 14705- Gly Ile Ser Gly Thr Asn Ala His Leu Ile Il - #e Glu Glu Ala Pro Glu# 14850- Trp Val Glu Asp Ile Asp Gly Val Ala Ala Pr - #o Asp Arg Gly Thr Ala# 15005- Asp Ala Ala Ala Pro Ser Pro Leu Leu Leu Se - #r Ala Arg Ser Glu Gly# 15201510 - # 1515- Ala Leu Arg Ala Gln Ala Val Arg Leu Gly Gl - #u Tyr Val Glu Arg Val# 15350- Gly Ala Asp Pro Arg Asp Val Ala Tyr Ser Le - #u Ala Ser Thr Arg Thr# 15505- Leu Phe Glu His Arg Ala Val Val Pro Cys Gl - #y Gly Arg Gly Glu Leu# 15650- Val Ala Ala Leu Gly Gly Phe Ala Ala Gly Ar - #g Val Ser Gly Gly Val# 15805- Arg Ser Gly Arg Ala Val Pro Gly Gly Val Gl - #y Val Leu Phe Thr Gly# 16001590 - # 1595- Gln Gly Ala Gln Trp Val Gly Met Gly Arg Gl - #y Leu Tyr Ala Gly Gly# 16150- Gly Val Phe Ala Glu Val Leu Asp Glu Val Le - #u Ser Met Val Gly Glu# 16305- Val Asp Gly Arg Ser Leu Arg Asp Val Met Ph - #e Gly Asp Val Asp Val# 16450- Asp Ala Gly Ala Gly Ala Asp Ala Gly Ala Gl - #y Ala Gly Ala Gly Val# 16605- Gly Ser Gly Ser Gly Ser Val Gly Gly Leu Le - #u Gly Arg Thr Glu Phe# 16801670 - # 1675- Ala Gln Pro Ala Leu Phe Ala Leu Glu Val Al - #a Leu Phe Arg Ala Leu# 16950- Glu Ala Arg Gly Val Glu Val Ser Val Val Le - #u Gly His Ser Val Gly# 17105- Glu Val Ala Ala Ala Tyr Val Ala Gly Val Le - #u Ser Leu Gly Asp Ala# 17250- Val Arg Leu Val Val Ala Arg Gly Gly Leu Me - #t Gly Gly Leu Pro Val# 17405- Gly Gly Gly Met Trp Ser Val Gly Ala Ser Gl - #u Ser Val Val Arg Gly# 17601750 - # 1755- Val Val Glu Gly Leu Gly Glu Trp Val Ser Va - #l Ala Ala Val Asn Gly# 17750- Pro Arg Ser Val Val Leu Ser Gly Asp Val Gl - #y Val Leu Glu Ser Val# 17905- Val Ala Ser Leu Met Gly Asp Gly Val Glu Cy - #s Arg Arg Leu Asp Val# 18050- Ser His Gly Phe His Ser Val Leu Met Glu Pr - #o Val Leu Gly Glu Phe# 18205- Arg Gly Val Val Glu Ser Leu Glu Phe Gly Ar - #g Val Arg Pro Gly Val# 18401830 - # 1835- Val Val Val Ser Gly Val Ser Gly Gly Val Va - #l Gly Ser Gly Glu Leu# 18550- Gly Asp Pro Gly Tyr Trp Val Arg His Ala Ar - #g Glu Ala Val Arg Phe# 18705- Ala Asp Gly Val Gly Val Val Arg Gly Leu Gl - #y Val Gly Thr Leu Val# 18850- Glu Val Gly Pro His Gly Val Leu Thr Gly Me - #t Ala Gly Glu Cys Leu# 19005- Gly Ala Gly Asp Asp Val Val Val Val Pro Al - #a Met Arg Arg Gly Arg# 19201910 - # 1915- Ala Glu Arg Glu Val Phe Glu Ala Ala Leu Al - #a Thr Val Phe Thr Arg# 19350- Asp Ala Gly Leu Asp Ala Thr Ala Leu His Th - #r Gly Ser Thr Gly Arg# 19505- Arg Ile Asp Leu Pro Thr Tyr Pro Phe Gln Ar - #g Asp Arg Tyr Trp Leu# 19650- Asp Pro Val Arg Thr Ala Val Thr Gly Val Gl - #u Pro Ala Gly Ser Pro# 19805- Ala Asp Ala Arg Ala Thr Glu Arg Gly Arg Se - #r Thr Thr Ala Gly Ile# 20001990 - # 1995- Arg Tyr Arg Val Ala Trp Gln Pro Ala Val Va - #l Asp Arg Gly Asn Pro# 20150- Gly Pro Ala Gly His Val Leu Leu Leu Ala Pr - #o Asp Glu Asp Thr Ala# 20305- Asp Ser Gly Leu Ala Pro Ala Ile Ala Arg Gl - #u Leu Ala Val Arg Gly# 20450- Ala Glu Val His Thr Val Ala Val Pro Val Gl - #y Thr Gly Arg Glu Ala# 20605- Ala Gly Asp Leu Leu Arg Ala Ala Gly Asp Gl - #y Ala Ala Arg Ser Thr# 20802070 - # 2075- Arg Val Leu Trp Leu Ala Pro Ala Glu Pro As - #p Ala Ala Asp Ala Val# 20950- Ala Leu Val Gln Ala Leu Gly Glu Ala Val Pr - #o Glu Ala Pro Leu Trp# 21105- Ile Thr Thr Arg Glu Ala Ala Ala Val Arg Pr - #o Asp Glu Thr Pro Ser# 21250- Val Gly Gly Ala Gln Leu Trp Gly Leu Gly Gl - #n Val Ala Ala Leu Glu# 21405- Leu Gly Arg Arg Trp Gly Gly Leu Ala Asp Le - #u Pro Gly Ser Ala Ser# 21602150 - # 2155- Pro Ala Val Leu Arg Thr Phe Val Gly Ala Le - #u Leu Ala Gly Gly Glu# 21750- Asn Gln Phe Ala Val Arg Pro Ser Gly Val Hi - #s Val Arg Arg Val Val# 21905- Pro Ala Pro Val Pro Val Pro Ala Ser Ala Ar - #g Thr Val Thr Thr Ala# 22050- Pro Ala Thr Ala Val Gly Glu Asp Ala Arg As - #n Asp Thr Ser Asp Val# 22205- Val Val Pro Asp Asp Arg Trp Ser Ser Gly Th - #r Val Leu Ile Thr Gly# 22402230 - # 2235- Gly Thr Gly Ala Leu Gly Ala Gln Val Ala Ar - #g Arg Leu Ala Arg Ser# 22550- Gly Ala Ala Arg Leu Leu Leu Val Gly Arg Ar - #g Gly Ala Ala Gly Pro# 22705- Gly Val Gly Glu Leu Val Glu Glu Leu Thr Al - #a Leu Gly Ser Glu Val# 22850- Ala Val Glu Ala Cys Asp Val Ala Asp Arg As - #p Ala Leu Ala Ala Leu# 23005- Leu Ala Gly Leu Pro Glu Glu Arg Pro Leu Va - #l Ala Val Leu His Ala# 23202310 - # 2315- Ala Gly Val Leu Asp Asp Gly Val Leu Asp Se - #r Leu Thr Ser Asp Arg# 23350- Val Asp Ala Val Leu Arg Asp Lys Val Thr Al - #a Ala Arg His Leu Asp# 23505- Glu Leu Thr Ala Asp Leu Pro Leu Asp Ala Ph - #e Val Leu Phe Ser Ser# 23650- Ile Val Gly Val Trp Gly Asn Gly Gly Gln Al - #a Val Tyr Ala Ala Ala# 23805- Asn Ala Ala Leu Asp Ala Leu Ala Gln Arg Ar - #g Arg Ala Arg Gly Ala# 24002390 - # 2395- Arg Ala Ala Ser Ile Ala Trp Gly Pro Trp Al - #a Gly Ala Gly Met Ala# 24150- Ser Gly Thr Ala Ala Lys Ser Phe Glu Arg As - #p Gly Val Thr Ala Leu# 24305- Asp Pro Glu Arg Ala Leu Asp Val Leu Asp As - #p Val Val Gly Ala Gly# 24450- Gly Thr Ser Ala Ala Gly Thr His Ala Ala Gl - #y Glu Ser Ser Leu Leu# 24605- Val Ala Asp Val Asp Trp Glu Thr Phe Val Gl - #y Arg Ser Val Thr Arg# 24802470 - # 2475- Arg Thr Trp Ser Leu Phe Asp Gly Val Ser Al - #a Ala Arg Ser Ala Arg# 24950- Ala Gly His Ala Ala Asp Asp Arg Ala Ala Le - #u Thr Pro Gly Thr Arg# 25105- Pro Gly Asp Gly Ala Pro Gly Gly Ser Gly Gl - #n Asp Gly Gly Glu Gly# 25250- Arg Pro Trp Leu Ser Val Gly Pro Ser Pro Al - #a Glu Arg Arg Arg Ala# 25405- Leu Leu Thr Leu Val Arg Ser Glu Ala Ala Gl - #y Ile Leu Arg His Ala# 25602550 - # 2555- Ser Ala Asp Ala Val Asp Pro Glu Leu Ala Ph - #e Arg Ser Ala Gly Phe# 25750- Asp Ser Leu Thr Val Leu Glu Leu Arg Asn Ar - #g Leu Thr Ala Ala Thr# 25905- Gly Leu Asn Leu Pro Asn Thr Leu Leu Phe As - #p His Pro Thr Pro Leu# 26050- Ser Leu Ala Ser His Leu His Asp Glu Leu Ph - #e Gly Pro Asp Ser Glu# 26205- Ala Glu Pro Ala Ala Ala Ala Pro Thr Pro Va - #l Met Ala Asp Glu Arg# 26402630 - # 2635- Glu Pro Ile Ala Ile Val Gly Met Ala Cys Ar - #g Tyr Pro Gly Gly Val# 26550- Ala Ser Pro Asp Asp Leu Trp Asp Leu Val Al - #a Gly Asp Gly His Thr# 26705- Leu Ser Pro Phe Pro Ala Asp Arg Gly Trp As - #p Val Glu Gly Leu Tyr# 26850- Asp Pro Glu Pro Gly Val Pro Gly Lys Ser Ty - #r Val Arg Glu Gly Gly# 27005- Phe Leu Arg Ser Ala Ala Glu Phe Asp Ala Gl - #u Phe Phe Gly Ile Ser# 27202710 - # 2715- Pro Arg Glu Ala Thr Ala Met Asp Pro Gln Gl - #n Arg Leu Leu Leu Glu# 27350- Thr Ser Trp Glu Ala Leu Glu Arg Ala Gly Il - #e Val Pro Asp Ser Leu# 27505- Arg Gly Thr Arg Thr Gly Val Phe Ser Gly Il - #e Ser Gln Gln Asp Tyr# 27650- Ala Thr Gln Leu Gly Asp Ala Ala Asp Thr Ty - #r Gly Gly His Val Leu# 27805- Thr Gly Thr Leu Gly Ser Val Ile Ser Gly Ar - #g Val Ala Tyr Ala Leu# 28002790 - # 2795- Gly Leu Glu Gly Pro Ala Leu Thr Val Asp Th - #r Ala Cys Ser Ser Ser# 28150- Leu Val Ala Leu His Leu Ala Val Gln Ser Le - #u Arg Arg Gly Glu Cys# 28305- Asp Leu Ala Leu Ala Gly Gly Val Thr Val Me - #t Ala Thr Pro Thr Val# 28450- Phe Val Glu Phe Ser Arg Gln Arg Gly Leu Al - #a Ala Asp Gly Arg Cys# 28605- Lys Ala Phe Ala Glu Gly Ala Asp Gly Thr Al - #a Trp Ala Glu Gly Val# 28802870 - # 2875- Gly Val Leu Leu Val Glu Arg Leu Ser Asp Al - #a Arg Arg Asn Gly His# 28950- Arg Val Leu Ala Val Val Arg Gly Ser Ala Va - #l Asn Gln Asp Gly Ala# 29105- Ser Asn Gly Leu Thr Ala Pro Ser Gly Pro Al - #a Gln Gln Arg Val Ile# 29250- Arg Glu Ala Leu Ala Asp Ala Gly Leu Val Pr - #o Ala Asp Val Asp Val# 29405- Val Glu Ala His Gly Thr Gly Thr Ala Leu Gl - #y Asp Pro Ile Glu Ala# 29602950 - # 2955- Gly Ala Leu Leu Ala Thr Tyr Gly Arg Glu Ar - #g Val Gly Asp Pro Leu# 29750- Trp Leu Gly Ser Leu Lys Ser Asn Ile Gly Hi - #s Ala Gln Ala Ala Ala# 29905- Gly Val Gly Gly Val Ile Lys Val Val Gln Gl - #y Met Arg His Gly Ser# 30050- Leu Pro Arg Thr Leu His Val Asp Ala Pro Se - #r Ser Lys Val Glu Trp# 30205- Ala Ser Gly Ala Val Glu Leu Leu Thr Glu Th - #r Arg Ser Trp Pro Arg# 30403030 - # 3035- Arg Val Glu Arg Val Arg Arg Ala Ala Val Se - #r Ala Phe Gly Val Ser# 30550- Gly Thr Asn Ala His Val Val Leu Glu Glu Al - #a Pro Ala Glu Ala Gly# 30705- Ser Glu His Gly Asp Gly Pro Glu Pro Glu Ar - #g Pro Asp Ala Val Thr# 30850- Gly Pro Leu Ser Trp Val Leu Ser Ala Arg Se - #r Glu Gly Ala Leu Arg# 31005- Ala Gln Ala Val Arg Leu Arg Glu Cys Val Gl - #u Arg Val Gly Ala Asp# 31203110 - # 3115- Pro Arg Asp Val Ala Gly Ser Leu Val Val Se - #r Arg Ala Ser Phe Gly# 31350- Glu Arg Ala Val Val Val Gly Arg Gly Arg Gl - #u Glu Leu Leu Ala Gly# 31505- Leu Asp Val Val Ala Ala Gly Ala Pro Val Gl - #y Val Ser Ser Gly Ala# 31650- Gly Ala Val Val Arg Gly Ser Ala Val Arg Gl - #y Arg Gly Val Gly Val# 31805- Leu Phe Thr Gly Gln Gly Ala Gln Trp Val Gl - #y Met Gly Arg Gly Leu# 32003190 - # 3195- Tyr Ala Gly Gly Gly Val Phe Ala Glu Val Le - #u Asp Glu Val Leu Ser# 32150- Val Val Gly Glu Val Asp Gly Arg Ser Leu Ar - #g Asp Val Met Phe Ala# 32305- Asp Ala Asp Ser Val Leu Gly Gly Leu Leu Gl - #y Arg Thr Glu Phe Ala# 32450- Gln Pro Ala Leu Phe Ala Leu Glu Val Ala Le - #u Phe Arg Ala Leu Glu# 32605- Ala Arg Gly Val Glu Val Ser Val Val Leu Gl - #y His Ser Val Gly Glu# 32803270 - # 3275- Val Ala Ala Ala Tyr Val Ala Gly Val Leu Se - #r Leu Gly Asp Ala Val# 32950- Arg Leu Val Val Ala Arg Gly Gly Leu Met Gl - #y Gly Leu Pro Val Gly# 33105- Gly Gly Met Trp Ser Val Gly Ala Ser Glu Se - #r Val Val Arg Gly Val# 33250- Val Glu Gly Leu Gly Glu Trp Val Ser Val Al - #a Ala Val Asn Gly Pro# 33405- Arg Ser Val Val Leu Ser Gly Asp Val Gly Va - #l Leu Glu Ser Val Val# 33603350 - # 3355- Val Thr Leu Met Gly Asp Gly Val Glu Cys Ar - #g Arg Leu Asp Val Ser# 33750- His Gly Phe His Ser Val Leu Met Glu Pro Va - #l Leu Gly Glu Phe Arg# 33905- Gly Val Val Glu Ser Leu Glu Phe Gly Arg Va - #l Arg Pro Gly Val Val# 34050- Val Val Ser Gly Val Ser Gly Gly Val Val Gl - #y Ser Gly Glu Leu Gly# 34205- Asp Pro Gly Tyr Trp Val Arg His Ala Arg Gl - #u Ala Val Arg Phe Ala# 34403430 - # 3435- Asp Gly Val Gly Val Val Arg Gly Leu Gly Va - #l Gly Thr Leu Val Glu# 34550- Val Gly Pro His Gly Val Leu Thr Gly Met Al - #a Gly Gln Cys Leu Glu# 34705- Ala Gly Asp Asp Val Val Val Val Pro Ala Me - #t Arg Arg Gly Arg Pro# 34850- Glu Arg Glu Val Phe Glu Ala Ala Leu Ala Th - #r Val Phe Thr Arg Asp# 35005- Ala Gly Leu Asp Ala Thr Thr Leu His Thr Gl - #y Ser Thr Gly Arg Arg# 35203510 - # 3515- Ile Asp Leu Pro Thr Tyr Pro Phe Gln His As - #n Arg Tyr Trp Ala Thr# 35350- Gly Ser Val Thr Gly Ala Thr Gly Thr Ser Al - #a Ala Ala Arg Phe Gly# 35505- Leu Glu Trp Lys Asp His Pro Phe Leu Ser Gl - #y Ala Thr Pro Ile Ala# 35650- Gly Ser Gly Ala Leu Leu Leu Thr Gly Arg Va - #l Gly Leu Ala Ala His# 35805- Pro Trp Leu Ala Asp His Ala Ile Ser Gly Th - #r Val Leu Leu Pro Gly# 36003590 - # 3595- Thr Ala Ile Ala Asp Leu Leu Leu Arg Ala Va - #l Glu Glu Val Gly Ala# 36150- Gly Gly Val Glu Glu Leu Thr Leu His Glu Pr - #o Leu Leu Leu Pro Glu# 36305- Arg Gly Gly Leu His Val Gln Val Leu Val Gl - #u Ala Ala Asp Glu Gln# 36450- Gly Arg Arg Ala Val Ala Val Ala Ala Arg Pr - #o Glu Gly Pro Gly Arg# 36605- Asp Gly Glu Glu Gln Glu Trp Thr Arg His Al - #a Glu Gly Val Leu Thr# 36803670 - # 3675- Ser Thr Glu Thr Ala Val Pro Asp Met Gly Tr - #p Ala Ala Gly Ala Trp# 36950- Pro Pro Pro Gly Ala Glu Pro Ile Asp Val Gl - #u Glu Leu Tyr Asp Ala# 37105- Phe Ala Ala Asp Gly Tyr Gly Tyr Gly Pro Al - #a Phe Thr Ala Leu Ser# 37250- Gly Val Trp Arg Leu Gly Asp Glu Leu Phe Al - #a Glu Val Arg Arg Pro# 37405- Ala Gly Gly Ala Gly Thr Thr Gly Asp Gly Ph - #e Gly Val His Pro Ala# 37603750 - # 3755- Leu Phe Asp Ala Ala Leu His Pro Trp Arg Al - #a Gly Gly Leu Leu Pro# 37750- Asp Thr Gly Gly Thr Thr Trp Ala Pro Phe Se - #r Trp Gln Gly Ile Ala# 37905- Leu His Thr Thr Gly Ala Glu Thr Leu Arg Va - #l Arg Leu Ala Pro Ala# 38050- Ala Gly Gly Thr Glu Ser Ala Phe Ser Val Gl - #n Ala Ala Asp Pro Ala# 38205- Gly Thr Pro Val Leu Thr Leu Asp Ala Leu Le - #u Leu Arg Pro Val Thr# 38403830 - # 3835- Leu Gly Arg Ala Asp Ala Pro Gln Pro Leu Ty - #r Arg Val Asp Trp Gln# 38550- Pro Val Gly Gln Gly Thr Glu Ala Ser Gly Al - #a Gln Gly Trp Thr Val# 38705- Leu Gly Gln Ala Ala Ala Glu Thr Val Ala Gl - #n Pro Ala Ala His Ala# 38850- Asp Leu Thr Ala Leu Arg Thr Ala Val Ala Al - #a Ala Gly Thr Pro Val# 39005- Pro Arg Leu Val Val Val Ser Pro Val Asp Th - #r Arg Leu Asp Glu Gly# 39203910 - # 3915- Pro Val Leu Ala Asp Ala Glu Ala Arg Ala Ar - #g Ala Gly Asp Gly Trp# 39350- Asp Asp Asp Pro Leu Arg Val Ala Leu Gly Ar - #g Gly Leu Thr Leu Val# 39505- Arg Glu Trp Val Glu Asp Glu Arg Leu Ala As - #p Ser Arg Leu Val Val# 39650- Leu Thr Arg Gly Ala Val Ala Ala Gly Pro Gl - #y Asp Val Pro Asp Leu# 39805- Thr Gly Ala Ala Leu Trp Gly Leu Leu Arg Se - #r Ala Gln Ser Glu Tyr# 40003990 - # 3995- Pro Asp Arg Phe Thr Leu Ile Asp Val Asp As - #p Ser Pro Glu Ser Arg# 40150- Ala Ala Leu Pro Arg Ala Leu Gly Ser Ala Gl - #u Arg Gln Leu Ala Leu# 40305- Arg Thr Gly Asp Val Leu Ala Pro Ala Leu Va - #l Pro Met Ala Thr Arg# 40450- Pro Ala Glu Thr Thr Pro Ala Thr Ala Val Al - #a Ser Ala Thr Thr Gln# 40605- Thr Gln Val Thr Ala Pro Ala Pro Asp Asp Pr - #o Ala Ala Asp Ala Val# 40804070 - # 4075- Phe Asp Pro Ala Gly Thr Val Leu Ile Thr Gl - #y Gly Thr Gly Ala Leu# 40950- Gly Arg Arg Val Ala Ser His Leu Ala Arg Ar - #g Tyr Gly Val Arg His# 41105- Met Leu Leu Val Ser Arg Arg Gly Pro Asp Al - #a Pro Glu Ala Gly Pro# 41250- Leu Glu Arg Glu Leu Ala Gly Leu Gly Val Th - #r Ala Thr Phe Leu Ala# 41405- Cys Asp Leu Thr Asp Ile Glu Ala Val Arg Ly - #s Ala Val Ala Ala Val# 41604150 - # 4155- Pro Ser Asp His Pro Leu Thr Gly Val Val Hi - #s Thr Ala Gly Val Leu# 41750- Asp Asp Gly Ala Leu Thr Gly Leu Thr Arg Gl - #n Arg Leu Asp Thr Val# 41905- Leu Arg Pro Lys Ala Asp Ala Val Arg Asn Le - #u His Glu Ala Thr Leu# 42050- Asp Arg Pro Leu Arg Ala Phe Val Leu Phe Se - #r Ala Ala Ala Gly Leu# 42205- Leu Gly Arg Pro Gly Gln Ala Ser Tyr Ala Al - #a Ala Asn Ala Val Leu# 42404230 - # 4235- Asp Ala Leu Ala Gly Ala Arg Arg Ala Ala Gl - #y Leu Pro Ala Val Ser# 42550- Leu Ala Trp Gly Leu Trp Asp Glu Gln Thr Gl - #y Met Ala Gly Gly Leu# 42705- Asp Glu Met Ala Leu Arg Val Leu Arg Arg As - #p Gly Ile Ala Ala Met# 42850- Pro Pro Glu Gln Gly Leu Glu Leu Leu Asp Le - #u Ala Leu Thr Gly His# 43005- Arg Asp Gly Pro Ala Val Leu Val Pro Leu Le - #u Leu Asp Gly Ala Ala# 43204310 - # 4315- Leu Arg Arg Thr Ala Lys Glu Arg Gly Ala Al - #a Thr Met Ser Pro Leu# 43350- Leu Arg Ala Leu Leu Pro Ala Ala Leu Arg Ar - #g Ser Gly Gly Ala Gly# 43505- Ala Pro Ala Ala Ala Asp Arg His Gly Lys Gl - #u Ala Asp Pro Gly Ala# 43650- Gly Arg Leu Ala Gly Met Val Ala Leu Glu Al - #a Ala Glu Arg Ser Ala# 43805- Ala Val Leu Glu Leu Val Thr Glu Gln Val Al - #a Glu Val Leu Gly Tyr# 44004390 - # 4395- Ala Ser Ala Ala Glu Ile Glu Pro Glu Arg Pr - #o Phe Arg Glu Ile Gly# 44150- Val Asp Ser Leu Ala Ala Val Glu Leu Arg As - #n Arg Leu Ser Arg Leu# 44305- Val Gly Leu Arg Leu Pro Thr Thr Leu Ser Ph - #e Asp His Pro Thr Pro# 44450- Lys Asp Met Ala Gln His Ile Asp Gly Gln Le - #u Pro Arg Pro Ala Gly# 44605- Ala Ser Pro Ala Asp Ala Ala Leu Glu Gly Il - #e Gly Asp Leu Ala Arg# 44804470 - # 4475- Ala Val Ala Leu Leu Gly Thr Gly Asp Ala Ar - #g Arg Ala Glu Val Arg# 44950- Glu Gln Leu Val Gly Leu Leu Ala Ala Leu As - #p Pro Pro Gly Arg Thr# 45105- Gly Thr Ala Ala Pro Gly Val Pro Ser Gly Al - #a Asp Gly Ala Glu Pro# 45250- Thr Val Thr Asp Arg Leu Asp Glu Ala Thr As - #p Asp Glu Ile Phe Ala# 45405- Phe Leu Asp Glu Gln Leu4545 4550- (2) INFORMATION FOR SEQ ID NO:3:- (i) SEQUENCE CHARACTERISTICS:#acids (A) LENGTH: 1996 amino (B) TYPE: amino acid (D) TOPOLOGY: unknown- (ii) MOLECULE TYPE: peptide- (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:- Met Thr Ala Glu Asn Asp Lys Ile Arg Ser Ty - #r Leu Lys Arg Ala Thr# 15- Ala Glu Leu His Arg Thr Lys Ser Arg Leu Al - #a Glu Val Glu Ser Ala# 30- Ser Arg Glu Pro Ile Ala Ile Val Gly Met Al - #a Cys Arg Tyr Pro Gly# 45- Gly Val Ala Ser Pro Asp Asp Leu Trp Asp Le - #u Val Ala Ala Gly Thr# 60- Asp Ala Val Ser Ala Phe Pro Val Asp Arg Gl - #y Trp Asp Val Glu Gly#80- Leu Tyr Asp Pro Asp Pro Glu Ala Val Gly Ar - #g Ser Tyr Val Arg Glu# 95- Gly Gly Phe Leu His Ser Ala Ala Glu Phe As - #p Ala Glu Phe Phe Gly# 110- Ile Ser Pro Arg Glu Ala Ala Ala Met Asp Pr - #o Gln Gln Arg Leu Leu# 125- Leu Glu Thr Ser Trp Glu Ala Leu Glu Arg Al - #a Gly Ile Val Pro Ala# 140- Ser Leu Arg Gly Thr Arg Thr Gly Val Phe Th - #r Gly Val Met Tyr Asp145 1 - #50 1 - #55 1 -#60- Asp Tyr Gly Ser Arg Phe Asp Ser Ala Pro Pr - #o Glu Tyr Glu Gly Tyr# 175- Leu Val Asn Gly Ser Ala Gly Ser Ile Ala Se - #r Gly Arg Val Ala Tyr# 190- Ala Leu Gly Leu Glu Gly Pro Ala Leu Thr Va - #l Asp Thr Ala Cys Ser# 205- Ser Ser Leu Val Ala Leu His Leu Ala Val Gl - #n Ser Leu Arg Arg Gly# 220- Glu Cys Asp Leu Ala Leu Ala Gly Gly Val Th - #r Val Met Ala Thr Pro225 2 - #30 2 - #35 2 -#40- Thr Val Leu Val Glu Phe Ser Arg Gln Arg Gl - #y Leu Ala Ala Asp Gly# 255- Arg Cys Lys Ala Phe Ala Glu Gly Ala Asp Gl - #y Thr Ala Trp Ala Glu# 270- Gly Val Gly Val Leu Leu Val Glu Arg Leu Se - #r Asp Ala Arg Arg Asn# 285- Gly His Arg Val Leu Ala Val Val Arg Gly Se - #r Ala Val Asn Gln Asp# 300- Gly Ala Ser Asn Gly Leu Thr Ala Pro Ser Gl - #y Pro Ala Gln Gln Arg305 3 - #10 3 - #15 3 -#20- Val Ile Arg Glu Ala Leu Ala Asp Ala Gly Le - #u Thr Pro Ala Asp Val# 335- Asp Ala Val Glu Ala His Gly Thr Gly Thr Pr - #o Leu Gly Asp Pro Ile# 350- Glu Ala Gly Ala Leu Leu Ala Thr Tyr Gly Se - #r Glu Arg Gln Gly Gln# 365- Gly Pro Leu Trp Leu Gly Ser Leu Lys Ser As - #n Ile Gly His Ala Gln# 380- Ala Ala Ala Gly Val Gly Gly Val Ile Lys Va - #l Val Gln Ala Met Arg385 3 - #90 3 - #95 4 -#00- His Gly Ser Leu Pro Arg Thr Leu His Val As - #p Ala Pro Ser Ser Lys# 415- Val Glu Trp Ala Ser Gly Ala Val Glu Leu Le - #u Thr Glu Thr Arg Ser# 430- Trp Pro Arg Arg Val Glu Arg Val Arg Arg Al - #a Ala Val Ser Ala Phe# 445- Gly Val Ser Gly Thr Asn Ala His Val Val Le - #u Glu Glu Ala Pro Ala# 460- Glu Ala Gly Ser Glu His Gly Asp Gly Pro Gl - #u Pro Glu Arg Pro Asp465 4 - #70 4 - #75 4 -#80- Ala Val Thr Gly Pro Leu Ser Trp Val Leu Se - #r Ala Arg Ser Glu Gly# 495- Ala Leu Arg Ala Gln Ala Val Arg Leu Arg Gl - #u Cys Val Glu Arg Val# 510- Gly Ala Asp Pro Arg Asp Val Ala Gly Ser Le - #u Val Val Ser Arg Ala# 525- Ser Phe Gly Glu Arg Ala Val Val Val Gly Ar - #g Gly Arg Glu Glu Leu# 540- Leu Ala Gly Leu Asp Val Val Ala Ala Gly Al - #a Pro Val Gly Val Ser545 5 - #50 5 - #55 5 -#60- Gly Gly Val Ser Ser Gly Ala Gly Ala Val Va - #l Arg Gly Ser Ala Val# 575- Arg Gly Arg Gly Val Gly Val Leu Phe Thr Gl - #y Gln Gly Ala Gln Trp# 590- Val Gly Met Gly Arg Gly Leu Tyr Ala Gly Gl - #y Gly Val Phe Ala Glu# 605- Val Leu Asp Glu Val Leu Ser Val Val Gly Gl - #u Val Gly Gly Trp Ser# 620- Leu Arg Asp Val Met Phe Gly Asp Val Asp Va - #l Asp Ala Gly Ala Gly625 6 - #30 6 - #35 6 -#40- Ala Asp Ala Gly Val Gly Ser Gly Val Gly Va - #l Gly Gly Leu Leu Gly# 655- Arg Thr Glu Phe Ala Gln Pro Ala Leu Phe Al - #a Leu Glu Val Ala Leu# 670- Phe Arg Ala Leu Glu Ala Arg Gly Val Glu Va - #l Ser Val Val Leu Gly# 685- His Ser Val Gly Glu Val Ala Ala Ala Tyr Va - #l Ala Gly Val Leu Ser# 700- Leu Gly Asp Ala Val Arg Leu Val Val Ala Ar - #g Gly Gly Leu Met Gly705 7 - #10 7 - #15 7 -#20- Gly Leu Pro Val Gly Gly Gly Met Trp Ser Va - #l Gly Ala Ser Glu Ser# 735- Val Val Arg Gly Val Val Glu Gly Leu Gly Gl - #u Trp Val Ser Val Ala# 750- Ala Val Asn Gly Pro Arg Ser Val Val Leu Se - #r Gly Asp Val Gly Val# 765- Leu Glu Ser Val Val Ala Ser Leu Met Gly As - #p Gly Val Glu Cys Arg# 780- Arg Leu Asp Val Ser His Gly Phe His Ser Va - #l Leu Met Glu Pro Val785 7 - #90 7 - #95 8 -#00- Leu Gly Glu Phe Arg Gly Val Val Glu Ser Le - #u Glu Phe Gly Arg Val# 815- Arg Pro Gly Val Val Val Val Ser Ser Val Se - #r Gly Gly Val Val Gly# 830- Ser Gly Glu Leu Gly Asp Pro Gly Tyr Trp Va - #l Arg His Ala Arg Glu# 845- Ala Val Arg Phe Ala Asp Gly Val Gly Val Va - #l Arg Gly Leu Gly Val# 860- Gly Thr Leu Val Glu Val Gly Pro His Gly Va - #l Leu Thr Gly Met Ala865 8 - #70 8 - #75 8 -#80- Gly Glu Cys Leu Gly Ala Gly Asp Asp Val Va - #l Val Val Pro Ala Met# 895- Arg Arg Gly Arg Ala Glu Arg Glu Val Phe Gl - #u Ala Ala Leu Ala Thr# 910- Val Phe Thr Arg Asp Ala Gly Leu Asp Ala Th - #r Thr Leu His Thr Gly# 925- Ser Thr Gly Arg Arg Ile Asp Leu Pro Thr Ty - #r Pro Phe Gln His Asp# 940- Arg Tyr Trp Leu Ala Ala Pro Ser Arg Pro Ar - #g Thr Asp Gly Leu Ser945 9 - #50 9 - #55 9 -#60- Ala Ala Gly Leu Arg Glu Val Glu His Pro Le - #u Leu Thr Ala Ala Val# 975- Glu Leu Pro Gly Thr Asp Thr Glu Val Trp Th - #r Gly Arg Ile Ser Ala# 990- Ala Asp Leu Pro Trp Leu Ala Asp His Leu Va - #l Trp Asp Arg Gly Val# 10050- Val Pro Gly Thr Ala Leu Leu Glu Thr Val Le - #u Gln Val Gly Ser Arg# 10205- Ile Gly Leu Pro Arg Val Ala Glu Leu Val Le - #u Glu Thr Pro Leu Thr# 10401030 - # 1035- Trp Thr Ser Asp Arg Pro Leu Gln Val Arg Il - #e Val Val Thr Ala Ala# 10550- Ala Thr Ala Pro Gly Gly Ala Arg Glu Leu Th - #r Leu His Ser Arg Pro# 10705- Glu Pro Val Ala Ala Ser Ser Ser Ser Pro Se - #r Pro Ala Ser Pro Arg# 10850- His Leu Thr Ala Gln Glu Ser Asp Asp Asp Tr - #p Thr Arg His Ala Ser# 11005- Gly Leu Leu Ala Pro Ala Ala Gly Leu Ala As - #p Asp Phe Ala Glu Leu# 11201110 - # 1115- Thr Gly Ala Trp Pro Pro Val Gly Ala Glu Pr - #o Leu Asp Leu Ala Gly# 11350- Gln Tyr Pro Leu Phe Ala Ala Ala Gly Val Ar - #g Tyr Glu Gly Ala Phe# 11505- Arg Gly Leu Arg Ala Ala Trp Arg Arg Gly As - #p Glu Val Phe Ala Asp# 11650- Val Arg Leu Pro Asp Ala His Ala Val Asp Al - #a Asp Arg Tyr Gly Val# 11805- His Pro Ala Leu Leu Asp Ala Val Leu His Pr - #o Ile Ala Ser Leu Asp# 12001190 - # 1195- Pro Leu Gly Asp Gly Gly His Gly Leu Leu Pr - #o Phe Ser Trp Thr Asp# 12150- Val Gln Gly His Gly Ala Gly Gly His Ala Le - #u Arg Val Arg Val Ala# 12305- Ala Val Asp Gly Gly Ala Val Ser Val Thr Al - #a Ala Asp His Ala Gly# 12450- Asn Pro Val Leu Ser Ala Arg Ser Leu Ala Le - #u Arg Arg Ile Thr Ala# 12605- Asp Arg Leu Pro Ala Ala Pro Val Ala Pro Le - #u Tyr Arg Val Asp Trp# 12801270 - # 1275- Leu Pro Phe Pro Gly Pro Val Pro Val Ser Al - #a Gly Gly Arg Trp Ala# 12950- Val Val Gly Pro Glu Ala Glu Ala Thr Ala Al - #a Gly Leu Arg Ala Val# 13105- Gly Leu Asp Val Arg Thr His Ala Leu Pro Le - #u Gly Glu Pro Leu Pro# 13250- Pro Gln Ala Gly Thr Asp Ala Glu Val Ile Il - #e Leu Asp Leu Thr Thr# 13405- Thr Ala Ala Gly Arg Thr Ala Ser Asp Gly Gl - #y Arg Leu Ser Leu Leu# 13601350 - # 1355- Asp Glu Val Arg Ala Thr Val Arg Arg Thr Le - #u Glu Ala Val Gln Ala# 13750- Arg Leu Ala Asp Thr Glu Thr Ala Pro Asp Va - #l Asp Val Arg Thr Ala# 13905- Ala Arg Pro Arg Thr Ala Ala Arg Thr Ser Pr - #o Arg Val Asp Thr Arg# 14050- Thr Gly Ala Arg Thr Ala Asp Gly Pro Arg Le - #u Val Val Leu Thr Arg# 14205- Gly Ala Ala Gly Pro Glu Gly Gly Ala Ala As - #p Pro Ala Gly Ala Ala# 14401430 - # 1435- Val Trp Gly Leu Val Arg Val Ala Gln Ala Gl - #u Gln Pro Gly Arg Phe# 14550- Thr Leu Val Asp Val Asp Gly Thr Gln Ala Se - #r Leu Arg Ala Leu Pro# 14705- Gly Leu Leu Ala Thr Asp Ala Gly Gln Ser Al - #a Val Arg Asp Gly Arg# 14850- Val Thr Val Pro Arg Leu Val Pro Val Ala As - #p Pro Val Pro His Gly# 15005- Gly Gly Thr Ala Ala Asp Gly Thr Gly Ala Gl - #y Glu Pro Ser Ala Thr# 15201510 - # 1515- Leu Asp Pro Glu Gly Thr Val Leu Ile Thr Gl - #y Gly Thr Gly Ala Leu# 15350- Ala Ala Glu Thr Ala Arg His Leu Val Asp Ar - #g His Lys Val Arg His# 15505- Leu Leu Leu Val Gly Arg Arg Gly Pro Asp Al - #a Pro Gly Val Asp Arg# 15650- Leu Val Ala Glu Leu Thr Glu Ser Gly Ala Gl - #u Val Ala Val Arg Ala# 15805- Cys Asp Val Thr Asp Arg Asp Ala Leu Arg Ar - #g Leu Leu Asp Ala Leu# 16001590 - # 1595- Pro Asp Glu His Pro Leu Thr Cys Val Val Hi - #s Thr Ala Gly Val Leu# 16150- Asp Asp Gly Val Leu Ser Ala Gln Thr Ala Gl - #u Arg Ile Asp Thr Val# 16305- Leu Arg Pro Lys Ala Asp Ala Ala Val His Le - #u Asp Glu Leu Thr Arg# 16450- Glu Ile Gly Arg Val Pro Leu Val Leu Tyr Se - #r Ser Val Ser Ala Thr# 16605- Leu Gly Ser Ala Gly Gln Ala Gly Tyr Ala Al - #a Ala Asn Ala Phe Met# 16801670 - # 1675- Asp Ala Leu Ala Ala Arg Arg Cys Ala Ala Gl - #y His Pro Ala Leu Ser# 16950- Leu Gly Trp Gly Trp Trp Ser Gly Val Gly Le - #u Ala Thr Gly Leu Asp# 17105- Gly Ala Asp Ala Ala Arg Val Arg Arg Ser Gl - #y Leu Ala Pro Leu Asp# 17250- Ala Gly Ala Ala Leu Asp Leu Leu Asp Arg Al - #a Leu Thr Arg Pro Glu# 17405- Pro Ala Leu Leu Pro Val Arg Leu Asp Leu Ar - #g Ala Ala Ala Gly Ala# 17601750 - # 1755- Thr Ala Leu Pro Glu Val Leu Arg Asp Leu Al - #a Gly Val Pro Ala Asp# 17750- Ala Arg Ser Thr Pro Gly Ala Ala Ala Gly Th - #r Gly Asp Glu Asp Gly# 17905- Ala Val Arg Pro Ala Pro Ala Pro Ala Asp Al - #a Ala Gly Thr Leu Ala# 18050- Ala Arg Leu Ala Gly Arg Ser Ala Pro Glu Ar - #g Thr Ala Leu Leu Leu# 18205- Asp Leu Val Arg Thr Glu Val Ala Ala Val Le - #u Gly His Gly Asp Pro# 18401830 - # 1835- Ala Ala Ile Gly Ala Ala Arg Thr Phe Lys As - #p Ala Gly Phe Asp Ser# 18550- Leu Thr Ala Val Asp Leu Arg Asn Arg Leu As - #n Thr Arg Thr Gly Leu# 18705- Arg Leu Pro Ala Thr Leu Val Phe Asp His Pr - #o Thr Pro Leu Ala Leu# 18850- Ala Glu Leu Leu Leu Asp Gly Leu Glu Ala Al - #a Gly Pro Ala Glu Pro# 19005- Ala Ala Glu Val Pro Asp Glu Ala Ala Gly Al - #a Glu Thr Leu Ser Gly# 19201910 - # 1915- Val Ile Asp Arg Leu Glu Arg Ser Leu Ala Al - #a Thr Asp Asp Gly Asp# 19350- Ala Arg Val Arg Ala Ala Arg Arg Leu Arg Gl - #y Leu Leu Asp Ala Leu# 19505- Pro Ala Gly Pro Gly Ala Ala Ser Gly Pro As - #p Ala Gly Glu His Ala# 19650- Pro Gly Arg Gly Asp Val Val Ile Asp Arg Le - #u Arg Ser Ala Ser Asp# 19805- Asp Asp Leu Phe Asp Leu Leu Asp Ser Asp Ph - #e Gln1985 1990 - # 1995- (2) INFORMATION FOR SEQ ID NO:4:- (i) SEQUENCE CHARACTERISTICS:#acids (A) LENGTH: 3724 amino (B) TYPE: amino acid (D) TOPOLOGY: unknown- (ii) MOLECULE TYPE: peptide- (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:- Met Ser Ala Thr Asn Glu Glu Lys Leu Arg Gl - #u Tyr Leu Arg Arg Ala# 15- Met Ala Asp Leu His Ser Ala Arg Glu Arg Le - #u Arg Glu Val Glu Ser# 30- Ala Ser Arg Glu Pro Ile Ala Ile Val Gly Me - #t Ala Cys Arg Tyr Pro# 45- Gly Gly Val Ala Ser Pro Glu Glu Leu Trp As - #p Leu Val Ala Ala Gly# 60- Thr Asp Ala Ile Ser Pro Phe Pro Val Asp Ar - #g Gly Trp Asp Ala Glu#80- Gly Leu Tyr Asp Pro Glu Pro Gly Val Pro Gl - #y Lys Ser Tyr Val Arg# 95- Glu Gly Gly Phe Leu His Ser Ala Ala Glu Ph - #e Asp Ala Glu Phe Phe# 110- Gly Ile Ser Pro Arg Glu Ala Ala Ala Met As - #p Pro Gln Gln Arg Leu# 125- Leu Leu Glu Thr Ser Trp Glu Ala Leu Glu Ar - #g Ala Gly Ile Val Pro# 140- Ala Ser Leu Arg Gly Thr Arg Thr Gly Val Ph - #e Thr Gly Val Met Tyr145 1 - #50 1 - #55 1 -#60- His Asp Tyr Gly Ser His Gln Val Gly Thr Al - #a Ala Asp Pro Ser Gly# 175- Gln Leu Gly Leu Gly Thr Ala Gly Ser Val Al - #a Ser Gly Arg Val Ala# 190- Tyr Thr Leu Gly Leu Gln Gly Pro Ala Val Th - #r Met Asp Thr Ala Cys# 205- Ser Ser Ser Leu Val Ala Leu His Leu Ala Va - #l Gln Ser Leu Arg Arg# 220- Gly Glu Cys Asp Leu Ala Leu Ala Gly Gly Al - #a Thr Val Leu Ala Thr225 2 - #30 2 - #35 2 -#40- Pro Thr Val Phe Val Glu Phe Ser Arg Gln Ar - #g Gly Leu Ala Ala Asp# 255- Gly Arg Cys Lys Ala Phe Ala Glu Gly Ala As - #p Gly Thr Ala Trp Ala# 270- Glu Gly Ala Gly Val Leu Leu Val Glu Arg Le - #u Ser Asp Ala Arg Arg# 285- Asn Gly His Arg Val Leu Ala Val Val Arg Gl - #y Ser Ala Val Asn Gln# 300- Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Se - #r Gly Pro Ala Gln Gln305 3 - #10 3 - #15 3 -#20- Arg Val Ile Arg Asp Ala Leu Ala Asp Ala Gl - #y Leu Thr Pro Ala Asp# 335- Val Asp Ala Val Glu Ala His Gly Thr Gly Th - #r Pro Leu Gly Asp Pro# 350- Ile Glu Ala Gly Ala Leu Met Ala Thr Tyr Gl - #y Ser Glu Arg Val Gly# 365- Asp Pro Leu Trp Leu Gly Ser Leu Lys Ser As - #n Ile Gly His Thr Gln# 380- Ala Ala Ala Gly Ala Ala Gly Val Ile Lys Me - #t Val Gln Ala Leu Arg385 3 - #90 3 - #95 4 -#00- Gln Ser Glu Leu Pro Arg Thr Leu His Val As - #p Ala Pro Ser Ala Lys# 415- Val Glu Trp Asp Ala Gly Ala Val Gln Leu Le - #u Thr Gly Val Arg Pro# 430- Trp Pro Arg Arg Glu His Arg Pro Arg Arg Al - #a Ala Val Ser Ala Phe# 445- Gly Val Ser Gly Thr Asn Ala His Val Ile Il - #e Glu Glu Pro Pro Ala# 460- Ala Gly Asp Thr Ser Pro Ala Gly Asp Thr Pr - #o Glu Pro Gly Glu Ala465 4 - #70 4 - #75 4 -#80- Thr Ala Ser Pro Ser Thr Ala Ala Gly Pro Se - #r Ser Pro Ser Ala Val# 495- Ala Gly Pro Leu Ser Pro Ser Ser Pro Ala Va - #l Val Trp Pro Leu Ser# 510- Ala Glu Thr Ala Pro Ala Leu Arg Ala Gln Al - #a Ala Arg Leu Arg Ala# 525- His Leu Glu Arg Leu Pro Gly Thr Ser Pro Th - #r Asp Ile Gly His Ala# 540- Leu Ala Ala Glu Arg Ala Ala Leu Thr Arg Ar - #g Val Val Leu Leu Gly545 5 - #50 5 - #55 5 -#60- Asp Asp Gly Ala Pro Val Asp Ala Leu Ala Al - #a Leu Ala Ala Gly Glu# 575- Thr Thr Pro Asp Ala Val His Gly Thr Ala Al - #a Asp Ile Arg Arg Val# 590- Ala Phe Val Phe Pro Gly Gln Gly Ser Gln Tr - #p Ala Gly Met Gly Ala# 605- Glu Leu Leu Asp Thr Ala Pro Ala Phe Ala Al - #a Glu Leu Asp Arg Cys# 620- Gln Gly Ala Leu Ser Pro Tyr Val Asp Trp As - #n Leu Ala Asp Val Leu625 6 - #30 6 - #35 6 -#40- Arg Gly Ala Pro Ala Ala Pro Gly Leu Asp Ar - #g Val Asp Val Val Gln# 655- Pro Ala Thr Phe Ala Val Met Val Gly Leu Al - #a Ala Leu Trp Arg Ser# 670- Leu Gly Val Glu Pro Ala Ala Val Ile Gly Hi - #s Ser Gln Gly Glu Ile# 685- Ala Ala Ala Cys Val Ala Gly Ala Leu Ser Le - #u Glu Asp Ala Ala Arg# 700- Ile Val Ala Leu Arg Ser Gln Val Ile Ala Ar - #g Glu Leu Ala Gly Arg705 7 - #10 7 - #15 7 -#20- Gly Gly Met Ala Ser Val Ala Leu Pro Ala Al - #a Glu Val Glu Ala Arg# 735- Leu Ala Gly Gly Val Glu Ile Ala Ala Val As - #n Gly Pro Gly Ser Thr# 750- Val Val Cys Gly Glu Pro Gly Ala Leu Glu Al - #a Leu Leu Val Thr Leu# 765- Glu Ser Glu Gly Thr Arg Val Arg Arg Ile As - #p Val Asp Tyr Ala Ser# 780- His Ser His Tyr Val Glu Ser Ile Arg Ala Gl - #u Leu Ala Thr Val Leu785 7 - #90 7 - #95 8 -#00- Gly Pro Val Arg Pro Arg Arg Gly Asp Val Pr - #o Phe Tyr Ser Thr Val# 815- Glu Ala Ala Leu Leu Asp Thr Ala Thr Leu As - #p Ala Asp Tyr Trp Tyr# 830- Arg Asn Leu Arg Leu Pro Val Arg Phe Glu Pr - #o Thr Val Arg Ala Met# 845- Leu Asp Asp Gly Val Asp Ala Phe Val Glu Cy - #s Ser Ala His Pro Val# 860- Leu Thr Val Gly Val Arg Gln Thr Val Glu Se - #r Ala Gly Gly Ala Val865 8 - #70 8 - #75 8 -#80- Pro Ala Leu Ala Ser Leu Arg Arg Asp Glu Gl - #y Gly Leu Arg Arg Phe# 895- Leu Thr Ser Ala Ala Glu Ala Gln Val Val Gl - #y Val Pro Val Asp Trp# 910- Ala Thr Leu Arg Pro Gly Ala Gly Arg Val As - #p Leu Pro Thr Tyr Ala# 925- Phe Gln Arg Glu Arg His Trp Val Gly Pro Al - #a Arg Pro Asp Ser Ala# 940- Ala Thr Ala Ala Thr Thr Gly Asp Asp Ala Pr - #o Glu Pro Gly Asp Arg945 9 - #50 9 - #55 9 -#60- Leu Gly Tyr His Val Ala Trp Lys Gly Leu Ar - #g Ser Thr Thr Gly Gly# 975- Trp Arg Pro Gly Leu Arg Leu Leu Ile Val Pr - #o Thr Gly Asp Gln Tyr# 990- Thr Ala Leu Ala Asp Thr Leu Glu Gln Ala Va - #l Ala Ser Phe Gly Gly# 10050- Thr Val Arg Arg Val Ala Phe Asp Pro Ala Ar - #g Thr Gly Arg Ala Glu# 10205- Leu Phe Gly Leu Leu Glu Thr Glu Ile Asn Gl - #y Asp Thr Ala Val Thr# 10401030 - # 1035- Gly Val Val Ser Leu Leu Gly Leu Cys Thr As - #p Gly Arg Pro Asp His# 10550- Pro Ala Val Pro Val Ala Val Thr Ala Thr Le - #u Ala Leu Val Gln Ala# 10705- Leu Ala Asp Leu Gly Ser Thr Ala Pro Leu Tr - #p Thr Val Thr Cys Gly# 10850- Ala Val Ala Thr Ala Pro Asp Glu Leu Pro Cy - #s Thr Ala Gly Ala Gln# 11005- Leu Trp Gly Leu Gly Arg Val Ala Ala Leu Gl - #u Leu Pro Glu Val Trp# 11201110 - # 1115- Gly Gly Leu Ile Asp Leu Pro Ala Arg Pro As - #p Ala Arg Val Leu Asp# 11350- Arg Leu Ala Gly Val Leu Ala Glu Pro Gly Gl - #y Glu Asp Gln Ile Ala# 11505- Val Arg Met Ala Gly Val Phe Gly Arg Arg Va - #l Leu Arg Asn Pro Ala# 11650- Asp Ser Arg Pro Pro Ala Trp Arg Ala Arg Gl - #y Thr Val Leu Ile Ala# 11805- Gly Asp Leu Thr Thr Val Pro Gly Arg Leu Va - #l Arg Ser Leu Leu Glu# 12001190 - # 1195- Asp Gly Ala Asp Arg Val Val Leu Ala Gly Pr - #o Asp Ala Pro Ala Gln# 12150- Ala Ala Ala Ala Gly Leu Thr Gly Val Ser Le - #u Val Pro Val Arg Cys# 12305- Asp Val Thr Asp Arg Ala Ala Leu Ala Ala Le - #u Leu Asp Glu His Ala# 12450- Pro Thr Val Ala Val His Ala Pro Pro Leu Va - #l Pro Leu Ala Pro Leu# 12605- Arg Glu Thr Ala Pro Gly Asp Ile Ala Ala Al - #a Leu Ala Ala Lys Thr# 12801270 - # 1275- Thr Ala Ala Gly His Leu Val Asp Leu Ala Pr - #o Ala Ala Gly Leu Asp# 12950- Ala Leu Val Leu Phe Ser Ser Val Ser Gly Va - #l Trp Gly Gly Ala Ala# 13105- Gln Gly Gly Tyr Ala Ala Ala Ser Ala His Le - #u Asp Ala Leu Ala Glu# 13250- Arg Ala Arg Ala Ala Gly Val Pro Ala Phe Se - #r Val Ala Trp Ser Pro# 13405- Trp Ala Gly Gly Thr Pro Ala Asp Gly Ala Gl - #u Ala Glu Phe Leu Ser# 13601350 - # 1355- Arg Arg Gly Leu Ala Pro Leu Asp Pro Asp Gl - #n Ala Val Arg Thr Leu# 13750- Arg Arg Met Leu Glu Arg Gly Ser Ala Cys Gl - #y Ala Val Ala Asp Val# 13905- Glu Trp Ser Arg Phe Ala Ala Ser Tyr Thr Tr - #p Val Arg Pro Ala Val# 14050- Leu Phe Asp Asp Ile Pro Asp Val Gln Arg Le - #u Arg Ala Ala Glu Leu# 14205- Ala Pro Ser Thr Gly Asp Ser Thr Thr Ser Gl - #u Leu Val Arg Glu Leu# 14401430 - # 1435- Thr Ala Gln Ser Gly His Lys Arg His Ala Th - #r Leu Leu Arg Leu Val# 14550- Arg Ala His Ala Ala Ala Val Leu Gly Gln Se - #r Ser Gly Asp Ala Val# 14705- Ser Ser Ala Arg Ala Phe Arg Asp Leu Gly Ph - #e Asp Ser Leu Thr Ala# 14850- Leu Glu Leu Arg Asp Arg Leu Ser Thr Ser Th - #r Gly Leu Lys Leu Pro# 15005- Thr Ser Leu Val Phe Asp His Ser Ser Pro Al - #a Ala Leu Ala Arg His# 15201510 - # 1515- Leu Gly Glu Glu Leu Leu Gly Arg Asn Asp Th - #r Ala Asp Arg Ala Gly# 15350- Pro Asp Thr Pro Val Arg Thr Asp Glu Pro Il - #e Ala Ile Ile Gly Met# 15505- Ala Cys Arg Leu Pro Gly Gly Val Gln Ser Pr - #o Glu Asp Leu Trp Asp# 15650- Leu Leu Thr Gly Gly Thr Asp Ala Ile Thr Pr - #o Phe Pro Thr Asn Arg# 15805- Gly Trp Asp Asn Glu Thr Leu Tyr Asp Pro As - #p Pro Asp Ser Pro Gly# 16001590 - # 1595- His His Thr Tyr Val Arg Glu Gly Gly Phe Le - #u His Asp Ala Ala Glu# 16150- Phe Asp Pro Gly Phe Phe Gly Ile Ser Pro Ar - #g Glu Ala Leu Ala Met# 16305- Asp Pro Gln Gln Arg Leu Ile Leu Glu Thr Se - #r Trp Glu Ser Phe Glu# 16450- Arg Ala Gly Ile Asp Pro Val Glu Leu Arg Gl - #y Ser Arg Thr Gly Val# 16605- Phe Val Gly Thr Asn Gly Gln His Tyr Val Pr - #o Leu Leu Gln Asp Gly# 16801670 - # 1675- Asp Glu Asn Phe Asp Gly Tyr Ile Ala Thr Gl - #y Asn Ser Ala Ser Val# 16950- Met Ser Gly Arg Leu Ser Tyr Val Phe Gly Le - #u Glu Gly Pro Ala Val# 17105- Thr Val Asp Thr Ala Cys Ser Ala Ser Leu Al - #a Ala Leu His Leu Ala# 17250- Val Gln Ser Leu Arg Arg Gly Glu Cys Asp Ty - #r Ala Leu Ala Gly Gly# 17405- Ala Thr Val Met Ser Thr Pro Glu Met Leu Va - #l Glu Phe Ala Arg Gln# 17601750 - # 1755- Arg Ala Val Ser Pro Asp Gly Arg Ser Lys Al - #a Phe Ala Glu Ala Ala# 17750- Asp Gly Val Gly Leu Ala Glu Gly Ala Gly Me - #t Leu Leu Val Glu Arg# 17905- Leu Ser Glu Ala Gln Lys Lys Gly His Pro Va - #l Leu Ala Val Val Arg# 18050- Gly Ser Ala Val Asn Gln Asp Gly Ala Ser As - #n Gly Leu Thr Ala Pro# 18205- Ser Gly Pro Ala Gln Gln Arg Val Ile Arg Gl - #u Ala Leu Ala Asp Ala# 18401830 - # 1835- Gly Leu Thr Pro Ala Asp Val Asp Ala Val Gl - #u Ala His Gly Thr Gly# 18550- Thr Pro Leu Gly Asp Pro Ile Glu Ala Gly Al - #a Leu Leu Ala Thr Tyr# 18705- Gly Arg Asp Arg Arg Asp Gly Pro Leu Trp Le - #u Gly Ser Leu Lys Ser# 18850- Asn Ile Gly His Thr Gln Ala Ala Ala Gly Va - #l Ala Gly Val Ile Lys# 19005- Met Val Leu Ala Leu Arg His Gly Glu Leu Pr - #o Arg Thr Leu His Ala# 19201910 - # 1915- Ser Thr Ala Ser Ser Arg Ile Asp Trp Asp Al - #a Gly Ala Val Glu Leu# 19350- Leu Asp Glu Ala Arg Pro Trp Leu Gln Arg Al - #a Glu Gly Pro Arg Arg# 19505- Ala Gly Ile Ser Ser Phe Gly Ile Ser Gly Th - #r Asn Ala His Leu Val# 19650- Ile Glu Glu Pro Pro Glu Pro Thr Ala Pro Gl - #u Leu Leu Ala Pro Glu# 19805- Pro Ala Ala Asp Gly Asp Val Trp Ser Glu Gl - #u Trp Trp His Glu Val# 20001990 - # 1995- Thr Val Pro Leu Met Met Ser Ala His Asn Gl - #u Ala Ala Leu Arg Asp# 20150- Gln Ala Arg Arg Leu Arg Ala Asp Leu Leu Al - #a His Pro Glu Leu His# 20305- Pro Ala Asp Val Gly Tyr Thr Leu Ile Thr Th - #r Arg Thr Arg Phe Glu# 20450- Gln Arg Ala Ala Val Val Gly Glu Asn Phe Th - #r Glu Leu Ile Ala Ala# 20605- Leu Asp Asp Leu Val Glu Gly Arg Pro His Pr - #o Leu Val Leu Arg Gly# 20802070 - # 2075- Thr Ala Gly Thr Ser Asp Gln Val Val Phe Va - #l Phe Pro Gly Gln Gly# 20950- Ser Gln Trp Pro Glu Met Ala Asp Gly Leu Le - #u Ala Arg Ser Ser Gly# 21105- Ser Gly Ser Phe Leu Glu Thr Ala Arg Ala Cy - #s Asp Leu Ala Leu Arg# 21250- Pro His Leu Gly Trp Ser Val Leu Asp Val Le - #u Arg Arg Glu Pro Gly# 21405- Ala Pro Ser Leu Asp Arg Val Asp Val Val Gl - #n Pro Val Leu Phe Thr# 21602150 - # 2155- Met Met Val Ser Leu Ala Glu Thr Trp Arg Se - #r Leu Gly Val Glu Pro# 21750- Ala Ala Val Val Gly His Ser Gln Gly Glu Il - #e Ala Ala Ala Tyr Val# 21905- Ala Gly Ala Leu Thr Leu Asp Asp Ala Ala Ar - #g Ile Val Ala Leu Arg# 22050- Ser Gln Ala Trp Leu Arg Leu Ala Gly Lys Gl - #y Gly Met Val Ala Val# 22205- Thr Leu Ser Glu Arg Asp Leu Arg Pro Arg Le - #u Glu Pro Trp Ser Asp# 22402230 - # 2235- Arg Leu Ala Val Ala Ala Val Asn Gly Pro Gl - #u Thr Cys Ala Val Ser# 22550- Gly Asp Pro Asp Ala Leu Ala Glu Leu Val Al - #a Glu Leu Gly Ala Glu# 22705- Gly Val His Ala Arg Pro Ile Pro Gly Val As - #p Thr Ala Gly His Ser# 22850- Pro Gln Val Asp Thr Leu Glu Ala His Leu Ar - #g Lys Val Leu Ala Pro# 23005- Val Ala Pro Arg Thr Ser Asp Ile Pro Phe Ty - #r Ser Thr Val Thr Gly# 23202310 - # 2315- Gly Leu Ile Asp Thr Ala Glu Leu Asp Ala As - #p Tyr Trp Tyr Arg Asn# 23350- Met Arg Glu Pro Val Glu Phe Glu Gln Ala Th - #r Arg Ala Leu Ile Ala# 23505- Asp Gly His Asp Val Phe Leu Glu Ser Ser Pr - #o His Pro Met Leu Ala# 23650- Val Ser Leu Gln Glu Thr Ile Ser Asp Ala Gl - #y Ser Pro Ala Ala Val# 23805- Leu Gly Thr Leu Arg Arg Gly Gln Gly Gly Pr - #o Arg Trp Leu Gly Val# 24002390 - # 2395- Ala Leu Cys Arg Ala Tyr Thr His Gly Leu Gl - #u Ile Asp Ala Glu Ala# 24150- Ile Phe Gly Pro Asp Ser Arg Gln Val Glu Le - #u Pro Thr Tyr Pro Phe# 24305- Gln Arg Glu Arg Tyr Trp Tyr Ser Pro Gly Hi - #s Arg Gly Asp Asp Pro# 24450- Ala Ser Leu Gly Leu Asp Ala Val Asp His Pr - #o Leu Leu Gly Ser Gly# 24605- Val Glu Leu Pro Glu Ser Gly Asp Arg Met Ty - #r Thr Ala Arg Leu Gly# 24802470 - # 2475- Ala Asp Thr Thr Pro Trp Leu Ala Asp His Al - #a Leu Leu Gly Ser Pro# 24950- Leu Leu Pro Gly Ala Ala Phe Ala Asp Leu Al - #a Leu Trp Ala Gly Arg# 25105- Gln Ala Gly Thr Gly Arg Val Glu Glu Leu Th - #r Leu Ala Ala Pro Leu# 25250- Val Leu Pro Gly Ser Gly Gly Val Arg Leu Ar - #g Leu Asn Val Gly Ala# 25405- Pro Gly Thr Asp Asp Ala Arg Arg Phe Ala Va - #l His Ala Arg Ala Glu# 25602550 - # 2555- Gly Ala Thr Asp Trp Thr Leu His Ala Glu Gl - #y Leu Leu Thr Ala Gln# 25750- Asp Thr Ala Asp Ala Pro Asp Ala Ser Ala Al - #a Thr Pro Pro Pro Gly# 25905- Ala Glu Gln Leu Asp Ile Gly Asp Phe Tyr Gl - #n Arg Phe Ser Glu Leu# 26050- Gly Tyr Gly Tyr Gly Pro Phe Phe Arg Gly Le - #u Val Ser Ala His Arg# 26205- Cys Gly Pro Asp Ile His Ala Glu Val Ala Le - #u Pro Val Gln Ala Gln# 26402630 - # 2635- Gly Asp Ala Ala Arg Phe Gly Ile His Pro Al - #a Leu Leu Asp Ala Ala# 26550- Leu Gln Thr Met Ser Leu Gly Gly Phe Phe Pr - #o Glu Asp Gly Arg Val# 26705- Arg Met Pro Phe Ala Leu Arg Gly Val Arg Le - #u Tyr Arg Ala Gly Ala# 26850- Asp Arg Leu His Val Arg Val Ser Pro Val Se - #r Glu Asp Ala Val Arg# 27005- Ile Arg Cys Ala Asp Gly Glu Gly Arg Pro Va - #l Ala Glu Ile Glu Ser# 27202710 - # 2715- Phe Ile Met Arg Pro Val Asp Pro Gly Gln Le - #u Leu Gly Gly Arg Pro# 27350- Val Gly Ala Asp Ala Leu Phe Arg Ile Ala Tr - #p Arg Glu Leu Ala Ala# 27505- Gly Pro Gly Thr Arg Thr Gly Asp Gly Thr Pr - #o Pro Pro Val Arg Trp# 27650- Val Leu Ala Gly Pro Asp Ala Leu Gly Leu Al - #a Glu Ala Ala Asp Ala# 27805- His Leu Pro Ala Val Pro Gly Pro Asp Gly Al - #a Leu Pro Ser Pro Thr# 28002790 - # 2795- Gly Arg Pro Ala Pro Asp Ala Val Val Phe Al - #a Val Arg Ala Gly Thr# 28150- Gly Asp Val Ala Ala Asp Ala His Thr Val Al - #a Cys Arg Val Leu Asp# 28305- Leu Val Gln Arg Arg Leu Ala Ala Pro Glu Gl - #y Pro Asp Gly Ala Arg# 28450- Leu Val Val Ala Thr Arg Gly Ala Val Ala Va - #l Arg Asp Asp Ala Glu# 28605- Val Asp Asp Pro Ala Ala Ala Ala Ala Trp Gl - #y Leu Leu Arg Ser Ala# 28802870 - # 2875- Gln Ala Glu Glu Pro Gly Arg Phe Leu Leu Va - #l Asp Leu Asp Asp Asp# 28950- Pro Ala Ser Ala Arg Ala Leu Thr Asp Ala Le - #u Ala Ser Gly Glu Pro# 29105- Gln Thr Ala Val Arg Ala Gly Thr Val Tyr Va - #l Pro Arg Leu Glu Arg# 29250- Ala Ala Asp Arg Thr Asp Gly Pro Leu Thr Pr - #o Pro Asp Asp Gly Ala# 29405- Trp Arg Leu Gly Arg Gly Thr Asp Leu Thr Le - #u Asp Gly Leu Ala Leu# 29602950 - # 2955- Val Pro Ala Pro Asp Ala Glu Ala Pro Leu Gl - #u Pro Gly Gln Val Arg# 29750- Val Ala Val Arg Ala Ala Gly Val Asn Phe Ar - #g Asp Ala Leu Ile Ala# 29905- Leu Gly Met Tyr Pro Gly Glu Ala Glu Met Gl - #y Thr Glu Gly Ala Gly# 30050- Thr Val Val Glu Val Gly Pro Gly Val Thr Gl - #y Val Ala Val Gly Asp# 30205- Arg Val Leu Gly Leu Trp Asp Gly Gly Leu Gl - #y Pro Leu Cys Val Ala# 30403030 - # 3035- Asp His Arg Leu Leu Ala Pro Val Pro Asp Gl - #y Trp Ser Tyr Ala Gln# 30550- Ala Ala Ser Val Pro Ala Val Phe Leu Ser Al - #a Tyr Tyr Gly Leu Val# 30705- Thr Leu Ala Gly Leu Arg Pro Gly Glu Arg Va - #l Leu Val His Ala Ala# 30850- Ala Gly Gly Val Gly Met Ala Ala Val Gln Il - #e Ala Arg His Leu Gly# 31005- Ala Glu Val Leu Ala Thr Ala Ser Pro Gly Ly - #s Trp Asp Ala Leu Arg# 31203110 - # 3115- Ala Met Gly Ile Thr Asp Asp His Leu Ala Se - #r Ser Arg Thr Leu Asp# 31350- Phe Ala Thr Ala Phe Thr Gly Ala Asp Gly Th - #r Ser Arg Ala Asp Val# 31505- Val Leu Asn Ser Leu Thr Lys Glu Phe Val As - #p Ala Ser Leu Gly Leu# 31650- Leu Arg Pro Gly Gly Arg Phe Leu Glu Leu Gl - #y Lys Thr Asp Val Arg# 31805- Asp Pro Glu Arg Ile Ala Ala Glu His Pro Gl - #y Val Arg Tyr Arg Ala# 32003190 - # 3195- Phe Asp Leu Asn Glu Ala Gly Pro Asp Ala Le - #u Gly Arg Leu Leu Arg# 32150- Glu Leu Met Asp Leu Phe Ala Ala Gly Val Le - #u His Pro Leu Pro Val# 32305- Val Thr His Asp Val Arg Arg Ala Ala Asp Al - #a Leu Arg Thr Ile Ser# 32450- Gln Ala Arg His Thr Gly Lys Leu Val Leu Th - #r Met Pro Pro Ala Trp# 32605- His Pro Tyr Gly Thr Val Leu Val Thr Gly Gl - #y Thr Gly Ala Leu Gly# 32803270 - # 3275- Ser Arg Ile Ala Arg His Leu Ala Ser Arg Hi - #s Gly Val Arg Arg Leu# 32950- Leu Ile Ala Ala Arg Arg Gly Pro Asp Gly Gl - #u Gly Ala Ala Glu Leu# 33105- Val Ala Asp Leu Ala Ala Leu Gly Ala Ser Al - #a Thr Val Val Ala Cys# 33250- Asp Val Ser Asp Ala Asp Ala Val Arg Gly Le - #u Leu Ala Gly Ile Pro# 33405- Ala Asp His Pro Leu Thr Ala Val Val His Se - #r Thr Gly Val Leu Asp# 33603350 - # 3355- Asp Gly Val Leu Pro Gly Leu Thr Pro Glu Ar - #g Met Arg Arg Val Leu# 33750- Arg Pro Lys Val Glu Ala Ala Val His Leu As - #p Glu Leu Thr Arg Asp# 33905- Leu Asp Leu Ser Ala Phe Val Leu Phe Ser Se - #r Ser Ala Gly Leu Leu# 34050- Gly Ser Pro Ala Gln Gly Asn Tyr Ala Ala Al - #a Asn Ala Thr Leu Asp# 34205- Ala Leu Ala Ala Arg Arg Arg Ser Leu Gly Le - #u Pro Ser Val Ser Leu# 34403430 - # 3435- Ala Trp Gly Leu Trp Ser Asp Thr Ser Arg Me - #t Ala His Ala Leu Asp# 34550- Gln Glu Ser Leu Gln Arg Arg Phe Ala Arg Se - #r Gly Phe Pro Pro Leu# 34705- Ser Ala Thr Leu Gly Ala Ala Leu Phe Asp Al - #a Ala Leu Arg Val Asp# 34850- Glu Ala Val Gln Val Pro Met Arg Phe Asp Pr - #o Ala Ala Leu Arg Ala# 35005- Thr Gly Ser Val Pro Ala Leu Leu Ser Asp Le - #u Val Gly Ser Ala Pro# 35203510 - # 3515- Ala Thr Gly Ser Ala Ala Pro Ala Ser Gly Pr - #o Leu Pro Ala Pro Asp# 35350- Ala Gly Thr Val Gly Glu Pro Leu Ala Glu Ar - #g Leu Ala Gly Leu Ser# 35505- Ala Glu Glu Arg His Asp Arg Leu Leu Gly Le - #u Val Gly Glu His Val# 35650- Ala Ala Val Leu Gly His Gly Ser Ala Ala Gl - #u Val Arg Pro Asp Arg# 35805- Pro Phe Arg Glu Val Gly Phe Asp Ser Leu Th - #r Ala Val Glu Leu Arg# 36003590 - # 3595- Asn Arg Met Ala Ala Val Thr Gly Val Arg Le - #u Pro Ala Thr Leu Val# 36150- Phe Asp His Pro Thr Pro Ala Ala Leu Ser Se - #r His Leu Asp Gly Leu# 36305- Leu Ala Pro Ala Gln Pro Val Thr Thr Thr Pr - #o Leu Leu Ser Glu Leu# 36450- Asp Arg Ile Glu Glu Ala Leu Ala Ala Leu Th - #r Pro Glu His Leu Ala# 36605- Glu Leu Ala Pro Ala Pro Asp Asp Arg Ala Gl - #u Val Ala Leu Arg Leu# 36803670 - # 3675- Asp Ala Leu Ala Asp Arg Trp Arg Ala Leu Hi - #s Asp Gly Ala Pro Gly# 36950- Ala Asp Asp Asp Ile Thr Asp Val Leu Ser Se - #r Ala Asp Asp Asp Glu# 37105- Ile Phe Ala Phe Ile Asp Glu Arg Tyr Gly Th - #r Ser# 3720- (2) INFORMATION FOR SEQ ID NO:5:- (i) SEQUENCE CHARACTERISTICS:#acids (A) LENGTH: 1580 amino (B) TYPE: amino acid (D) TOPOLOGY: unknown- (ii) MOLECULE TYPE: peptide- (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:- Met Ala Asn Glu Glu Lys Leu Arg Ala Tyr Le - #u Lys Arg Val Thr Gly# 15- Glu Leu His Arg Ala Thr Glu Gln Leu Arg Al - #a Leu Asp Arg Arg Ala# 30- His Glu Pro Ile Ala Ile Val Gly Ala Ala Cy - #s Arg Leu Pro Gly Gly# 45- Val Glu Ser Pro Asp Asp Leu Trp Glu Leu Le - #u His Ala Gly Ala Asp# 60- Ala Val Gly Pro Ala Pro Ala Asp Arg Gly Tr - #p Asp Val Glu Gly Arg#80- Tyr Ser Pro Asp Pro Asp Thr Pro Gly Thr Se - #r Tyr Cys Arg Glu Gly# 95- Gly Phe Val Gln Gly Ala Asp Arg Phe Asp Pr - #o Ala Leu Phe Gly Ile# 110- Ser Pro Asn Glu Ala Leu Thr Met Asp Pro Gl - #n Gln Arg Leu Leu Leu# 125- Glu Thr Ser Trp Glu Ala Leu Glu Arg Ala Gl - #y Leu Asp Pro Gln Ser# 140- Leu Ala Gly Ser Arg Thr Gly Val Phe Ala Gl - #y Ala Trp Glu Ser Gly145 1 - #50 1 - #55 1 -#60- Tyr Gln Lys Gly Val Glu Gly Leu Glu Ala As - #p Leu Glu Ala Gln Leu# 175- Leu Ala Gly Ile Val Ser Phe Thr Ala Gly Ar - #g Val Ala Tyr Ala Leu# 190- Gly Leu Glu Gly Pro Ala Leu Thr Ile Asp Th - #r Ala Cys Ser Ser Ser# 205- Leu Val Ala Leu His Leu Ala Val Gln Ser Le - #u Arg Arg Gly Glu Cys# 220- Asp Leu Ala Leu Ala Gly Gly Ala Thr Val Il - #e Ala Asp Phe Ala Leu225 2 - #30 2 - #35 2 -#40- Phe Thr Gln Phe Ser Arg Gln Arg Gly Leu Al - #a Pro Asp Gly Arg Cys# 255- Lys Ala Phe Gly Glu Thr Ala Asp Gly Phe Gl - #y Pro Ala Glu Gly Ala# 270- Gly Met Leu Leu Val Glu Arg Leu Ser Asp Al - #a Arg Arg Asn Gly His# 285- Pro Val Leu Ala Val Val Arg Gly Ser Ala Va - #l Asn Gln Asp Gly Ala# 300- Ser Asn Gly Leu Thr Ala Pro Ser Gly Pro Al - #a Gln Gln Arg Val Ile305 3 - #10 3 - #15 3 -#20- Arg Glu Ala Leu Ala Asp Ala Gly Leu Thr Pr - #o Ala Asp Val Asp Ala# 335- Val Glu Ala His Gly Thr Gly Thr Pro Leu Gl - #y Asp Pro Ile Glu Ala# 350- Gly Ala Leu Met Ala Thr Tyr Gly His Glu Ar - #g Thr Gly Asp Pro Leu# 365- Trp Leu Gly Ser Leu Lys Ser Asn Ile Gly Hi - #s Thr Gln Ala Ala Ala# 380- Gly Val Ala Gly Val Ile Lys Met Val Leu Al - #a Leu Arg His Gly Glu385 3 - #90 3 - #95 4 -#00- Leu Pro Arg Thr Leu His Ala Ser Thr Ala Se - #r Ser Arg Ile Glu Trp# 415- Asp Ala Gly Ala Val Glu Leu Leu Asp Glu Al - #a Arg Pro Trp Pro Arg# 430- Arg Ala Glu Gly Pro Arg Arg Ala Gly Ile Se - #r Ser Phe Gly Ile Ser# 445- Gly Thr Asn Ala His Leu Val Ile Glu Glu Gl - #u Pro Pro Ala Arg Pro# 460- Glu Pro Glu Glu Ala Ala Gln Pro Pro Ala Pr - #o Ala Thr Thr Val Leu465 4 - #70 4 - #75 4 -#80- Pro Leu Ser Ala Ala Gly Ala Arg Ser Leu Ar - #g Glu Gln Ala Arg Arg# 495- Leu Ala Ala His Leu Ala Gly His Glu Glu Il - #e Thr Ala Ala Asp Ala# 510- Ala Arg Ser Ala Ala Thr Thr Arg Ala Ala Le - #u Ser His Arg Ala Ser# 525- Val Leu Ala Asp Asp Arg Arg Ala Leu Ile As - #p Arg Leu Thr Ala Leu# 540- Ala Glu Asp Arg Lys Asp Pro Gly Val Thr Va - #l Gly Glu Ala Gly Ser545 5 - #50 5 - #55 5 -#60- Gly Arg Pro Pro Val Phe Val Phe Pro Gly Gl - #n Gly Ser Gln Trp Thr# 575- Gly Met Gly Ala Glu Leu Leu Asp Arg Ala Pr - #o Val Phe Arg Ala Lys# 590- Ala Glu Glu Cys Ala Arg Ala Leu Ala Ala Hi - #s Leu Asp Trp Ser Val# 605- Leu Asp Val Leu Arg Asp Ala Pro Gly Ala Pr - #o Pro Ile Asp Arg Ala# 620- Asp Val Val Gln Pro Thr Leu Phe Thr Met Me - #t Val Ser Leu Ala Ala625 6 - #30 6 - #35 6 -#40- Leu Trp Glu Ser His Gly Val Arg Pro Ala Al - #a Val Val Gly His Ser# 655- Gln Gly Glu Ile Ala Ala Ala His Ala Ala Gl - #y Ala Leu Ser Leu Asp# 670- Asp Ala Ala Arg Val Ile Ala Glu Arg Ser Ar - #g Leu Trp Lys Arg Leu# 685- Ala Gly Asn Gly Gly Met Leu Ser Val Met Al - #a Pro Ala Asp Arg Val# 700- Arg Glu Leu Met Glu Pro Trp Ala Glu Arg Me - #t Ser Val Ala Ala Val705 7 - #10 7 - #15 7 -#20- Asn Gly Pro Ala Ser Val Thr Val Ala Gly As - #p Ala Arg Ala Leu Glu# 735- Glu Phe Gly Gly Arg Leu Ser Ala Ala Gly Va - #l Leu Arg Trp Pro Leu# 750- Ala Gly Val Asp Phe Ala Gly His Ser Pro Gl - #n Val Glu Gln Phe Arg# 765- Ala Glu Leu Leu Asp Thr Leu Gly Thr Val Ar - #g Pro Thr Ala Ala Arg# 780- Leu Pro Phe Phe Ser Thr Val Thr Ala Ala Al - #a His Glu Pro Glu Gly785 7 - #90 7 - #95 8 -#00- Leu Asp Ala Ala Tyr Trp Tyr Arg Asn Met Ar - #g Glu Pro Val Glu Phe# 815- Ala Ser Thr Leu Arg Thr Leu Leu Arg Glu Gl - #y His Arg Thr Phe Val# 830- Glu Met Gly Pro His Pro Leu Leu Gly Ala Al - #a Ile Asp Glu Val Ala# 845- Glu Ala Glu Gly Val His Ala Thr Ala Leu Al - #a Thr Leu His Arg Gly# 860- Ser Gly Gly Leu Asp Arg Phe Arg Ser Ser Va - #l Gly Ala Ala Phe Ala865 8 - #70 8 - #75 8 -#80- His Gly Val Arg Val Asp Trp Asp Ala Leu Ph - #e Glu Gly Ser Gly Ala# 895- Arg Arg Val Pro Leu Pro Thr Tyr Ala Phe Se - #r Arg Asp Arg Tyr Trp# 910- Leu Pro Thr Ala Ile Gly Arg Arg Ala Val Gl - #u Ala Ala Pro Val Asp# 925- Ala Ser Ala Pro Gly Arg Tyr Arg Val Thr Tr - #p Thr Pro Val Ala Ser# 940- Asp Asp Ser Gly Arg Pro Ser Gly Arg Trp Le - #u Leu Val Gln Thr Pro945 9 - #50 9 - #55 9 -#60- Gly Thr Ala Pro Asp Glu Ala Asp Thr Ala Al - #a Ser Ala Leu Gly Ala# 975- Ala Gly Val Val Val Glu Arg Cys Leu Leu As - #p Pro Thr Glu Ala Ala# 990- Arg Val Thr Leu Thr Glu Arg Leu Ala Glu Le - #u Asp Ala Gln Pro Glu# 10050- Gly Leu Ala Gly Val Leu Val Leu Pro Gly Ar - #g Pro Gln Ser Thr Ala# 10205- Pro Ala Asp Ala Ser Pro Leu Asp Pro Gly Th - #r Ala Ala Val Leu Leu# 10401030 - # 1035- Val Val Gln Ala Val Pro Asp Ala Ala Pro Ly - #s Ala Arg Ile Trp Val# 10550- Val Thr Arg Gly Ala Val Ala Val Gly Ser Gl - #y Glu Val Pro Cys Ala# 10705- Val Gly Ala Arg Val Trp Gly Leu Gly Arg Va - #l Ala Ala Leu Glu Val# 10850- Pro Val Gln Trp Gly Gly Leu Val Asp Val Al - #a Val Gly Ala Gly Val# 11005- Arg Glu Trp Arg Arg Val Val Gly Val Val Al - #a Gly Gly Gly Glu Asp# 11201110 - # 1115- Gln Val Ala Val Arg Gly Gly Gly Val Phe Gl - #y Arg Arg Leu Val Gly# 11350- Val Gly Val Arg Gly Gly Ser Gly Val Trp Ar - #g Ala Arg Gly Cys Val# 11505- Val Val Thr Gly Gly Leu Gly Gly Val Gly Gl - #y His Val Ala Arg Trp# 11650- Leu Ala Arg Ser Gly Ala Glu His Val Val Le - #u Ala Gly Arg Arg Gly# 11805- Gly Gly Val Val Gly Ala Val Glu Leu Glu Ar - #g Glu Leu Val Gly Leu# 12001190 - # 1195- Gly Ala Lys Val Thr Phe Val Ser Cys Asp Va - #l Gly Asp Arg Ala Ser# 12150- Met Val Gly Leu Leu Gly Val Val Glu Gly Le - #u Gly Val Pro Leu Arg# 12305- Gly Val Phe His Ala Ala Gly Val Ala Gln Va - #l Ser Gly Leu Gly Glu# 12450- Val Ser Leu Ala Glu Ala Gly Gly Val Leu Gl - #y Gly Lys Ala Val Gly# 12605- Ala Glu Leu Leu Asp Glu Leu Thr Ala Gly Va - #l Glu Leu Asp Ala Phe# 12801270 - # 1275- Val Leu Phe Ser Ser Gly Ala Gly Val Trp Gl - #y Ser Gly Gly Gln Ser# 12950- Val Tyr Ala Ala Ala Asn Ala His Leu Asp Al - #a Leu Ala Glu Arg Arg# 13105- Arg Ala Gln Gly Arg Pro Ala Thr Ser Val Al - #a Trp Gly Leu Trp Gly# 13250- Gly Glu Gly Met Gly Ala Asp Glu Gly Val Th - #r Glu Phe Tyr Ala Glu# 13405- Arg Gly Leu Ala Pro Met Arg Pro Glu Ser Gl - #y Ile Glu Ala Leu His# 13601350 - # 1355- Thr Ala Leu Asn Glu Gly Asp Thr Cys Val Th - #r Val Ala Asp Ile Asp# 13750- Trp Glu His Phe Val Thr Gly Phe Thr Ala Ty - #r Arg Pro Ser Pro Leu# 13905- Ile Ser Asp Ile Pro Gln Val Arg Ala Leu Ar - #g Thr Pro Glu Pro Thr# 14050- Val Asp Ala Ser Asp Gly Leu Arg Arg Arg Va - #l Asp Ala Ala Leu Thr# 14205- Pro Arg Glu Arg Thr Lys Val Leu Val Asp Le - #u Val Arg Thr Val Ala# 14401430 - # 1435- Ala Glu Val Leu Gly His Asp Gly Ile Gly Gl - #y Ile Gly His Asp Val# 14550- Ala Phe Arg Asp Leu Gly Phe Asp Ser Leu Al - #a Ala Val Arg Met Arg# 14705- Gly Arg Leu Ala Glu Ala Thr Gly Leu Val Le - #u Pro Ala Thr Val Ile# 14850- Phe Asp His Pro Thr Val Asp Arg Leu Gly Gl - #y Ala Leu Leu Glu Arg# 15005- Leu Ser Ala Asp Glu Pro Ala Pro Gly Gly Al - #a Pro Glu Pro Ala Gly# 15201510 - # 1515- Gly Arg Pro Ala Thr Pro Pro Pro Ala Pro Gl - #u Pro Ala Val His Asp# 15350- Ala Asp Ile Asp Glu Leu Asp Ala Asp Ala Le - #u Ile Arg Leu Ala Thr# 15505- Gly Thr Ala Gly Pro Ala Asp Gly Thr Pro Al - #a Asp Gly Gly Pro Asp# 15650- Ala Ala Ala Thr Ala Pro Asp Gly Ala Pro Gl - #u Gln# 15805- (2) INFORMATION FOR SEQ ID NO:6:- (i) SEQUENCE CHARACTERISTICS:#acids (A) LENGTH: 1891 amino (B) TYPE: amino acid (D) TOPOLOGY: unknown- (ii) MOLECULE TYPE: peptide- (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:- Met Ser Pro Ser Met Asp Glu Val Leu Gly Al - #a Leu Arg Thr Ser Val# 15- Lys Glu Thr Glu Arg Leu Arg Arg His Asn Ar - #g Glu Leu Leu Ala Gly# 30- Ala His Glu Pro Val Ala Ile Val Gly Met Al - #a Cys Arg Tyr Pro Gly# 45- Gly Val Ser Thr Pro Asp Asp Leu Trp Glu Le - #u Ala Ala Asp Gly Val# 60- Asp Ala Ile Thr Pro Phe Pro Ala Asp Arg Gl - #y Trp Asp Glu Asp Ala#80- Val Tyr Ser Pro Asp Pro Asp Thr Pro Gly Th - #r Thr Tyr Cys Arg Glu# 95- Gly Gly Phe Leu Thr Gly Ala Gly Asp Phe As - #p Ala Ala Phe Phe Gly# 110- Ile Ser Pro Asn Glu Ala Leu Val Met Asp Pr - #o Gln Gln Arg Leu Leu# 125- Leu Glu Thr Ser Trp Glu Thr Leu Glu Arg Al - #a Gly Ile Val Pro Ala# 140- Ser Leu Arg Gly Ser Arg Thr Gly Val Phe Va - #l Gly Ala Ala His Thr145 1 - #50 1 - #55 1 -#60- Gly Tyr Val Thr Asp Thr Ala Arg Ala Pro Gl - #u Gly Thr Glu Gly Tyr# 175- Leu Leu Thr Gly Asn Ala Asp Ala Val Met Se - #r Gly Arg Ile Ala Tyr# 190- Ser Leu Gly Leu Glu Gly Pro Ala Leu Thr Il - #e Gly Thr Ala Cys Ser# 205- Ser Ser Leu Val Ala Leu His Leu Ala Val Gl - #n Ser Leu Arg Arg Gly# 220- Glu Cys Asp Leu Ala Leu Ala Gly Gly Val Al - #a Val Met Pro Asp Pro225 2 - #30 2 - #35 2 -#40- Thr Val Phe Val Glu Phe Ser Arg Gln Arg Gl - #y Leu Ala Val Asp Gly# 255- Arg Cys Lys Ala Phe Ala Glu Gly Ala Asp Gl - #y Thr Ala Trp Ala Glu# 270- Gly Val Gly Val Leu Leu Val Glu Arg Leu Se - #r Asp Ala Arg Arg Asn# 285- Gly His Arg Val Leu Ala Val Val Arg Gly Se - #r Ala Val Asn Gln Asp# 300- Gly Ala Ser Asn Gly Leu Thr Ala Pro Ser Gl - #y Pro Ala Gln Gln Arg305 3 - #10 3 - #15 3 -#20- Val Ile Arg Glu Ala Leu Ala Asp Ala Gly Le - #u Thr Pro Ala Asp Val# 335- Asp Val Val Glu Ala His Gly Thr Gly Thr Al - #a Leu Gly Asp Pro Ile# 350- Glu Ala Gly Ala Leu Leu Ala Thr Tyr Gly Ar - #g Glu Arg Val Gly Asp# 365- Pro Leu Trp Leu Gly Ser Leu Lys Ser Asn Il - #e Gly His Ala Gln Ala# 380- Ala Ala Gly Val Gly Gly Val Ile Lys Val Va - #l Gln Ala Met Arg His385 3 - #90 3 - #95 4 -#00- Gly Ser Leu Pro Arg Thr Leu His Val Asp Al - #a Pro Ser Ser Lys Val# 415- Glu Trp Ala Ser Gly Ala Val Glu Leu Leu Th - #r Glu Gly Arg Ser Trp# 430- Pro Arg Arg Val Glu Arg Val Arg Arg Ala Al - #a Val Ser Ala Phe Gly# 445- Val Ser Gly Thr Asn Ala His Val Val Leu Gl - #u Glu Ala Pro Val Glu# 460- Ala Gly Ser Glu His Gly Asp Gly Pro Gly Pr - #o Asp Arg Pro Asp Ala465 4 - #70 4 - #75 4 -#80- Val Thr Gly Pro Leu Pro Trp Val Leu Ser Al - #a Arg Ser Arg Glu Ala# 495- Leu Arg Gly Gln Ala Gly Arg Leu Ala Ala Le - #u Ala Arg Gln Gly Arg# 510- Thr Glu Gly Thr Gly Gly Gly Ser Gly Leu Va - #l Val Pro Ala Ala Asp# 525- Ile Gly Tyr Ser Leu Ala Thr Thr Arg Glu Th - #r Leu Glu His Arg Ala# 540- Val Ala Leu Val Gln Glu Asn Arg Thr Ala Gl - #y Glu Asp Leu Ala Ala545 5 - #50 5 - #55 5 -#60- Leu Ala Ala Gly Arg Thr Pro Glu Ser Val Va - #l Thr Gly Val Ala Arg# 575- Arg Gly Arg Gly Ile Ala Phe Leu Cys Ser Gl - #y Gln Gly Ala Gln Arg# 590- Leu Gly Ala Gly Arg Glu Leu Arg Gly Arg Ph - #e Pro Val Phe Ala Asp# 605- Ala Leu Asp Glu Ile Ala Ala Glu Phe Asp Al - #a His Leu Glu Arg Pro# 620- Leu Leu Ser Val Met Phe Ala Glu Pro Ala Th - #r Pro Asp Ala Ala Leu625 6 - #30 6 - #35 6 -#40- Leu Asp Arg Thr Asp Tyr Thr Gln Pro Ala Le - #u Phe Ala Val Glu Thr# 655- Ala Leu Phe Arg Leu Leu Glu Ser Trp Gly Le - #u Val Pro Asp Val Leu# 670- Val Gly His Ser Ile Gly Gly Leu Val Ala Al - #a His Val Ala Gly Val# 685- Phe Ser Ala Ala Asp Ala Ala Arg Leu Val Se - #r Ala Arg Gly Arg Leu# 700- Met Arg Ala Leu Pro Glu Gly Gly Ala Met Al - #a Ala Val Gln Ala Thr705 7 - #10 7 - #15 7 -#20- Glu Arg Glu Ala Ala Ala Leu Glu Pro Val Al - #a Ala Gly Gly Ala Val# 735- Val Ala Ala Val Asn Gly Pro Gln Ala Leu Va - #l Leu Ser Gly Asp Glu# 750- Ala Ala Val Leu Ala Ala Ala Gly Glu Leu Al - #a Ala Arg Gly Arg Arg# 765- Thr Lys Arg Leu Arg Val Ser His Ala Phe Hi - #s Ser Pro Arg Met Asp# 780- Ala Met Leu Ala Asp Phe Arg Ala Val Ala As - #p Thr Val Asp Tyr His785 7 - #90 7 - #95 8 -#00- Ala Pro Arg Leu Pro Val Val Ser Glu Val Th - #r Gly Asp Leu Ala Asp# 815- Ala Ala Gln Leu Thr Asp Pro Gly Tyr Trp Th - #r Arg Gln Val Arg Gln# 830- Pro Val Arg Phe Ala Asp Ala Val Arg Thr Al - #a Ser Ala Arg Asp Ala# 845- Ala Thr Phe Ile Glu Leu Gly Pro Asp Ala Va - #l Leu Cys Gly Met Ala# 860- Glu Glu Ser Leu Ala Ala Glu Ala Asp Val Va - #l Phe Ala Pro Ala Leu865 8 - #70 8 - #75 8 -#80- Arg Arg Gly Arg Pro Glu Gly Asp Thr Val Le - #u Arg Ala Ala Ala Ser# 895- Ala Tyr Val Arg Gly Ala Gly Leu Asp Trp Al - #a Ala Leu Tyr Gly Gly# 910- Thr Gly Ala Arg Arg Thr Asp Leu Pro Thr Ty - #r Ala Phe Gln His Ser# 925- Arg Tyr Trp Leu Ala Pro Ala Ser Ala Ala Va - #l Ala Pro Ala Thr Ala# 940- Ala Pro Ser Val Arg Ser Val Pro Glu Ala Gl - #u Gln Asp Gly Ala Leu945 9 - #50 9 - #55 9 -#60- Trp Ala Ala Val His Ala Gly Asp Val Ala Se - #r Ala Ala Ala Arg Leu# 975- Gly Ala Asp Asp Ala Gly Ile Glu His Glu Le - #u Arg Ala Val Leu Pro# 990- His Leu Ala Ala Trp His Asp Arg Asp Arg Al - #a Thr Ala Arg Thr Ala# 10050- Gly Leu His Tyr Arg Val Thr Trp Gln Ala Il - #e Glu Ala Asp Ala Val# 10205- Arg Phe Ser Pro Ser Asp Arg Trp Leu Met Va - #l Glu His Gly Gln His# 10401030 - # 1035- Thr Glu Cys Ala Asp Ala Ala Glu Arg Ala Le - #u Arg Ala Ala Gly Ala# 10550- Glu Val Thr Arg Leu Val Trp Pro Leu Glu Gl - #n His Thr Gly Ser Pro# 10705- Arg Thr Glu Thr Pro Asp Arg Gly Thr Leu Al - #a Ala Arg Leu Ala Glu# 10850- Leu Ala Arg Ser Pro Glu Gly Leu Ala Gly Va - #l Leu Leu Leu Pro Asp# 11005- Ser Gly Gly Ala Ala Val Ala Gly His Pro Gl - #y Leu Asp Gln Gly Thr# 11201110 - # 1115- Ala Ala Val Leu Leu Thr Ile Gln Ala Leu Th - #r Asp Ala Ala Val Arg# 11350- Ala Pro Leu Trp Val Val Thr Arg Gly Ala Va - #l Ala Val Gly Ser Gly# 11505- Glu Val Pro Cys Ala Val Gly Ala Arg Val Tr - #p Gly Leu Gly Arg Val# 11650- Ala Ala Leu Glu Val Pro Val Gln Trp Gly Gl - #y Leu Val Asp Val Ala# 11805- Val Gly Ala Gly Val Arg Glu Trp Arg Arg Va - #l Val Gly Val Val Ala# 12001190 - # 1195- Gly Gly Gly Glu Asp Gln Val Ala Val Arg Gl - #y Gly Gly Val Phe Gly# 12150- Arg Arg Leu Val Gly Val Gly Val Arg Gly Gl - #y Ser Gly Val Trp Arg# 12305- Ala Arg Gly Cys Val Val Val Thr Gly Gly Le - #u Gly Gly Val Gly Gly# 12450- His Val Ala Arg Trp Leu Ala Arg Ser Gly Al - #a Glu His Val Val Leu# 12605- Ala Gly Arg Arg Gly Gly Gly Val Val Gly Al - #a Val Glu Leu Glu Arg# 12801270 - # 1275- Glu Leu Val Gly Leu Gly Ala Lys Val Thr Ph - #e Val Ser Cys Asp Val# 12950- Gly Asp Arg Ala Ser Val Val Gly Leu Leu Gl - #y Val Val Glu Gly Leu# 13105- Gly Val Pro Leu Arg Gly Val Phe His Ala Al - #a Gly Val Ala Gln Val# 13250- Ser Gly Leu Gly Glu Val Ser Leu Ala Glu Al - #a Gly Gly Val Leu Gly# 13405- Gly Lys Ala Val Gly Ala Glu Leu Leu Asp Gl - #u Leu Thr Ala Gly Val# 13601350 - # 1355- Glu Leu Asp Ala Phe Val Leu Phe Ser Ser Gl - #y Ala Gly Val Trp Gly# 13750- Ser Gly Gly Gln Ser Val Tyr Ala Ala Ala As - #n Ala His Leu Asp Ala# 13905- Leu Ala Glu Arg Arg Arg Ala Gln Gly Arg Pr - #o Ala Thr Ser Val Ala# 14050- Trp Gly Pro Trp Asp Gly Asp Gly Met Gly Gl - #u Met Ala Pro Glu Gly# 14205- Tyr Phe Ala Arg His Gly Val Ala Pro Leu Hi - #s Pro Glu Thr Ala Leu# 14401430 - # 1435- Thr Ala Leu His Gln Ala Ile Asp Gly Gly Gl - #u Ala Thr Val Thr Val# 14550- Ala Asp Ile Asp Trp Glu Arg Phe Ala Pro Gl - #y Phe Thr Ala Phe Arg# 14705- Pro Ser Pro Leu Ile Ala Gly Ile Pro Ala Al - #a Arg Thr Ala Pro Ala# 14850- Ala Gly Arg Pro Ala Glu Asp Thr Pro Thr Al - #a Pro Gly Leu Leu Arg# 15005- Ala Arg Pro Glu Asp Arg Pro Arg Leu Ala Le - #u Asp Leu Val Leu Arg# 15201510 - # 1515- His Val Ala Ala Val Leu Gly His Ser Glu As - #p Ala Arg Val Asp Ala# 15350- Arg Ala Pro Phe Arg Asp Leu Gly Phe Asp Se - #r Leu Ala Ala Val Arg# 15505- Leu Arg Arg Arg Leu Ala Glu Asp Thr Gly Le - #u Asp Leu Pro Gly Thr# 15650- Leu Val Phe Asp His Glu Asp Pro Thr Ala Le - #u Ala His His Leu Ala# 15805- Gly Leu Ala Asp Ala Gly Thr Pro Gly Pro Gl - #n Glu Gly Thr Ala Arg# 16001590 - # 1595- Ala Glu Ser Gly Leu Phe Ala Ser Phe Arg Al - #a Ala Val Glu Gln Arg# 16150- Arg Ser Ser Glu Val Val Glu Leu Met Ala As - #p Leu Ala Ala Phe Arg# 16305- Pro Ala Tyr Ser Arg Gln His Pro Gly Ser Gl - #y Arg Pro Ala Pro Val# 16450- Pro Leu Ala Thr Gly Pro Ala Thr Arg Pro Th - #r Leu Tyr Cys Cys Ala# 16605- Gly Thr Ala Val Gly Ser Gly Pro Ala Glu Ty - #r Val Pro Phe Ala Glu# 16801670 - # 1675- Gly Leu Arg Gly Val Arg Glu Thr Val Ala Le - #u Pro Leu Ser Gly Phe# 16950- Gly Asp Pro Ala Glu Pro Met Pro Ala Ser Le - #u Asp Ala Leu Ile Glu# 17105- Val Gln Ala Asp Val Leu Leu Glu His Thr Al - #a Gly Lys Pro Phe Ala# 17250- Leu Ala Gly His Ser Ala Gly Ala Asn Ile Al - #a His Ala Leu Ala Ala# 17405- Arg Leu Glu Glu Arg Gly Ser Gly Pro Ala Al - #a Val Val Leu Met Asp# 17601750 - # 1755- Val Tyr Arg Pro Glu Asp Pro Gly Ala Met Gl - #y Glu Trp Arg Asp Asp# 17750- Leu Leu Ser Trp Ala Leu Glu Arg Ser Thr Va - #l Pro Leu Glu Asp His# 17905- Arg Leu Thr Ala Met Ala Gly Tyr Gln Arg Le - #u Val Leu Gly Thr Arg# 18050- Leu Thr Ala Leu Glu Ala Pro Val Leu Leu Al - #a Arg Ala Ser Glu Pro# 18205- Leu Cys Ala Trp Pro Pro Ala Gly Gly Ala Ar - #g Gly Asp Trp Arg Ser# 18401830 - # 1835- Gln Val Pro Phe Ala Arg Thr Val Ala Asp Va - #l Pro Gly Asn His Phe# 18550- Thr Met Leu Thr Glu His Ala Arg His Thr Al - #a Ser Leu Val His Glu# 18705- Trp Leu Asp Ser Leu Pro His Gln Pro Gly Pr - #o Ala Pro Leu Thr Gly# 18850- Gly Lys His 1890__________________________________________________________________________
Claims
  • 1. An isolated DNA molecule comprising a nucleotide sequence that encodes a polypeptide comprising a platenolide synthase domain.
  • 2. An isolated DNA molecule comprising a nucleotide sequence that encodes a polypeptide comprising a platenolide synthase domain endogenous to a platenolide-producing organism.
  • 3. An isolated DNA molecule comprising a nucleotide sequence that encodes a polypeptide comprising a platenolide synthase domain endogenous to a spiramycin-producing organism.
  • 4. An isolated DNA molecule comprising a nucleotide sequence that encodes a polypeptide comprising a platenolide synthase domain endogenous to a Streptomycete.
  • 5. An isolated DNA molecule comprising a nucleotide sequence that encodes a polypeptide comprising a platenolide synthase domain endogenous to Streptomyces ambofaciens.
  • 6. The isolated DNA molecule of claim 2 wherein the nucleotide sequence is selected from the group consisting of:
  • nucleotides 392 to 1603, 1922 to 2995, 3173 to 3424, 3527 to 4798, 5135 to 6208, 7043 to 7597, 7946 to 8197, 8270 to 9541, 9899 to 10909, 10985 to 11530, 12596 to 13153, 13469 to 13720, 14148 to 15422, 15789 to 16844, 16914 to 17510, 18612 to 19166, 19479 to 19730, 20215 to 21486, 21889 to 22872, 23638 to 24159, 24484 to 24678, 24742 to 26016, 26371 to 27381, 27442 to 27966, 28843 to 29892, 29905 to 30462, 30760 to 31002, 31428 to 32696, 33024 to 34022, 34770 to 35327, 35586 to 35837, 36257 to 37528, 37898 to 38905, 39851 to 40408, 40658 to 40909, and 41297 to 41395 all in SEQ ID NO: 1.
  • 7. The isolated DNA molecule of claim 2 wherein the nucleotide sequence is selected from the group consisting of:
  • nucleotides 392 to 3424, 3527 to 8197, 8270 to 13720, 14148 to 19730, 20215 to 24678, 24742 to 31002, 31428 to 35837, and 36257 to 41395 all in SEQ ID NO: 1.
  • 8. The isolated DNA molecule of claim 2 wherein the nucleotide sequence is selected from the group consisting of:
  • nucleotides 350 to 14002, 14046 to 20036, 20110 to 31284, 31329 to 36071, and 36155 to 41830 all in SEQ ID NO: 1.
  • 9. An isolated DNA molecule consisting of nucleotide sequence of SEQ ID NO: 1.
  • 10. A recombinant DNA vector comprising the DNA molecule of claim 2.
  • 11. A recombinant DNA vector comprising the DNA molecule of claim 6.
  • 12. A recombinant DNA vector comprising the DNA molecule of claim 7.
  • 13. A recombinant DNA vector comprising the DNA molecule of claim 8.
  • 14. A recombinant DNA vector comprising the DNA molecule of claim 9.
  • 15. The recombinant DNA vector deposited under accession number NRRL B-21500.
  • 16. The recombinant DNA vector deposited under accession number NRRL B-21499.
  • 17. A host cell transformed with a recombinant DNA vector of claim 10.
  • 18. A host cell transformed with a recombinant DNA vector of claim 11.
  • 19. A host cell transformed with a recombinant DNA vector of claim 12.
  • 20. A host cell transformed with a recombinant DNA vector of claim 13.
  • 21. A host cell transformed with a recombinant DNA vector of claim 14.
  • 22. An isolated polypeptide comprising an amino acid sequence wherein said polypeptide comprises a platenolide synthase domain.
  • 23. An isolated polypeptide of claim 22 wherein the amino acid sequence is selected from the group consisting of:
  • (a) amino acids 15 to 418, 525 to 882, 942 to 1025, 1060 to 1483, 1596 to 1953, 2232 to 2416, 2533 to 2616, 2641 to 3064, 3184 to 3520, 3546 to 3727, 4083 to 4268, and 4374 to 4457 all in SEQ ID NO: 2;
  • (b) amino acids 35 to 459, 582 to 933, 957 to 1155, 1523 to 1707, and 1812 to 1895 all in SEQ ID NO: 3;
  • (c) amino acids 36 to 459, 594 to 921, 1177 to 1350, 1459 to 1523, 1545 to 1969, 2088 to 2424, 2445 to 2619, 2912 to 3261, 3266 to 3451, and 3551 to 3631 all in SEQ ID NO: 4;
  • (d) amino acids 34 to 456, 566 to 898, 1148 to 1333, and 1420 to 1503 all in SEQ ID NO: 5; and
  • (e) amino acids 35 to 458, 582 to 917, 1233 to 1418, 1502 to 1585, 1715 to 1747 all in SEQ ID NO: 6.
  • 24. An isolated polypeptide of claim 2 wherein the amino acid sequence is selected from the group consisting of:
  • (a) amino acids 15 to 1025, 1060 to 2616, and 2641 to 4457 all in SEQ ID NO: 2;
  • (b) amino acids 35 to 1895 in SEQ ID NO: 3;
  • (c) amino acids 36 to 1523, and 1545 to 3631 all in SEQ ID NO: 4;
  • (d) amino acids 34 to 1503 in SEQ ID NO: 5; and
  • (e) amino acids 35 to 1747 in SEQ ID NO: 6.
  • 25. A homogenous preparation of a polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO: 2, 3, 4, 5, and 6.
Parent Case Info

This application claims the benefit of U.S. Provisional Application No. 60/012,050, filed Feb. 22, 1996.

US Referenced Citations (6)
Number Name Date Kind
5098837 Beckmann et al. Mar 1992
5252474 Gewain et al. Oct 1993
5614619 Piepersberg et al. Mar 1997
5639949 Ligon et al. Jun 1997
5643774 Ligon et al. Jul 1997
5662898 Ligon et al. Sep 1997
Foreign Referenced Citations (4)
Number Date Country
0463707 Jan 1992 EPX
WO 8703907 Jul 1987 WOX
WO 9313663 Jul 1993 WOX
WO9313663 Jul 1993 WOX
Non-Patent Literature Citations (31)
Entry
David A. Hopwood and David H. Sherman, "Molecular Genetics of Polyketides and its Comparison to Fatty Acid Biosynthesis," Annu. Rev. Genet., 24:37-66 (1990).
Stefano Donadio and Leonard Katz, "Organization of the enzymatic domains in the multifunctional polyketide synthase involved in erythromycin formation in Saccharopolyspora erythraea," Gene, 111:51-60 (1992).
Jesus Cortes, et al., "Repositioning of a Domain in Modular Polyketide Synthase to Promote Specific Chain Cleavage," Science, 268:1487-1489 (1995).
Stefano Donadio, et al., "Modular Organization of Genes Required for Complex Polyketide Biosynthesis," Science, 252:675-679 (1991).
M. A. Richardson, et al., "Cloning of Spiramycin Biosynthetic Genes and Their Use in Constructing Streptomyces ambofaciens Mutants Defective in Spiramycin Biosynthesis," Journal of Bacteriology, vol. 172, No. 7, 3790-3798 (1990).
Robert J. Beckmann, Karen Cox, and Eugene T. Seno, "A Cluster of Tylosin Biosynthetic Genes Is Interrupted by a Structurally Unstable Segment Containing Four Repeated Sequences," Genetics and Molecular Biology of Industrial Microorganisms, 176-186 (1989).
Leonard Katz and Stefano Donadio, "Polyketide Synthesis: Prospects for Hybrid Antibiotics," Annu. Rev. Microbiol., 47:875-912 (1993).
Stefano Donadio, et al., "Biosynthesis of the erythromycin macrolactone and a rational approach for producing hybrid macrolides," Gene, 115:97-103 (1992).
Dougals J. MacNeil, et al., "Complex organization of the Streptomyces avermitilis genes encoding the avermectin polyketide synthase," Gene, 115:119-125 (1992).
Douglas J. MacNeil, et al., "Correlation of the Avermectin Polyketide Synthase Genes to the Avermectin Structure," Annals New York Academy of Sciences, 721:123-132 (1994).
C. Richard Hutchinson, "Drug Synthesis by Genetically Engineered Microorganisms," Bio/Technology, 12:375-380 (1994).
Omura, S., et al., Journal of Biochemistry (Tokyo), vol. 86, Isolation and properties of spiramycin 1 3-hydroxyl acylase from Streptomyces ambofaciens:, pp. 1753-1758, (1979).
Richardson, M.A. et al., Journal of Facteriology, vol. 172, "cloning of spiramycin biosynthetic genes and their use in constructing Streptomyces ambofaciens mutants defective in spiramycin biosynthesis", pp. 3790-3798, (1990).
Geistlch, M., et al., Molecular Microbiology, vol. 6, "Characterization of a novel regulatory gene governing the expression of polyketide synthase gene in Streptomyces ambofaciens", pp. 2019-2029, (1992).
Kirst, H.A., Progress in Medicinal Chemistry, vol. 31, "Semisynthetic derivatives of 16-membered macrolide antibiotics".
Katz, L., et al., in Genetics and Biochemistry of Antibiotic Production, Vining, L.C., et al., Eds., Butterworth-Heinemann, Pub., "Macrolides", pp. 385-420, (1995).
Li, T., et al., Chinese Journal of Biotechnology, vol. 7, "Cloning and expression of spiramycin polyketide synthase genes from S. spiramyceticus U-941", pp. 33-42, (1996).
Kuhstoss, S., et al., Gene, vol. 183, "Production of a novel polyketide through the construction of hybrid polyketide synthase", pp. 231-236, (1996).
Aigle, B., et al., Microbiology, vol. 142, "An amplifiable and deletable locus of Streptomyces ambofaciens RP181110 contains a very large gene homologous to polyketide synthase genes", pp. 2815-2824. (1996).
Maezawa, I., et al., "Biological Gycosidation of Macrolide Aglycones. II Isolation and Characterization of Desosaminyl-Platenolide I," The Journal of Antibiotics, 31:309-318 (1978).
Furumai, T., et al., "Studies on the Biosynthesis of Basic 15-Membered Macrolide Antibiotics, Platenomycins, III Production, Isolation and Structures of Platenolides I and II," The Journal of Antibiotics, 28:783-788 (1975).
Furumai, T., et al., "Studies on the Biosynthesis of Basic 16-Membered Macrolide Antibiotics, Platenomycins, II Production, Isolation and Structures of 3-O-Propionyl-5-Omycaminosyl Platenolides I and II, 9-Dehydro Demycarosyl Platenomycin and Demycarosyl Platenomycin," The Journal of Antibiotics, 28:775-782 (1975).
Furumai T., et al., "Studies on the Biosynthesis of Basic 1-Membered Macrolide Antibiotic, Platenomycins. I* Selection of and Cosynthesis by Non-platenomycin-Producing Mutants," The Journal of Antibiotics, 28:770-774 (1975).
Grafe, U., et al., Isolation and Structures of Nitrogen-Free Platenolide Glycosides 11. The 5-O-(.alpha.-Mycarosyl)-and 5-O(3'-Demethyl-.beta.-Mycarosyl)-Platenolides I and II, The Journal of Antibiotics, 33:574-578 (1980).
Grafe, U. et al., Isolation and Structures of Nitrogen-Free Platenolide Glycosides I. The 5-O-(Deoxy-3'-C-acetyl-.beta.-D-Hexopyranosyl)-Platenolides I and II, The Journal of Antibiotics, 33:566-573 (1980).
Grafe, U., et al., "The Platenolides I and II as Precursors of Turimycin", The Journal of Antibiotics, 33:663-664 (1980).
Rakhit, S., et al., "Structure Activity Relationship in Sixteen Membered Macrolide Antibiotics," The Journal of Antibiotics, 27:221-224 (1974).
Omura, S., et al., "Biosynthetic Origin of Carbons 3 and 4 of Leucomycin Aglycone", The Journal of Antibiotics, 36:611-613 (1983).
Omura, S., et al., Journal of Biochemistry (Tokyo), vol. 86, "Isolation and properties of spiramycin 1 3-hydroxl acylase from Streptomyces ambofaciens", pp. 1753-1758, 1979.
Richardson, M. A., et al., Journal of Bacteriology, vol. 172, "Cloning of spiramycin biosynthetic genes and their use in constructing Streptomyces ambofaciens mutants defectve in spiramycin biosynthesis", pp. 3790-3798, 1990.
Kirst, H.A., Progress in Medicinal Chemistry, vol. 31, "Semi-synthetic derivatives of 16-membered macrolide antibiotics", pp. 266-299, 1994.