NUCLEIC ACID-GUIDED NUCLEASES

Abstract
Disclosed herein are nucleic acid-guided nucleases, guide nucleic acids, and targetable nuclease systems, and methods of use. Disclosed herein are engineered non-naturally occurring nucleic acid-guided nucleases, guide nucleic acids, and targetable nuclease systems, and methods of use. Targetable nuclease systems can be used to edit genetic targets, including recursive genetic engineering and trackable genetic engineering methods.
Description
INCORPORATION BY REFERENCE

Submitted with the present application is an electronically filed sequence listing via EFS-Web as an ASCII formatted sequence listing, entitled “INSC104US8_seqlist_20220309”, created Mar. 9, 2022, and 791,000 bytes in size. The sequence listing is part of the specification filed herewith and is incorporated by reference in its entirety.


BACKGROUND OF THE DISCLOSURE

Nucleic acid-guided nucleases have become important tools for research and genome engineering. The applicability of these tools can be limited by the sequence specificity requirements, expression, or delivery issues.


SEQUENCE LISTING

This application contains a sequence list in Table 6.


SUMMARY OF THE DISCLOSURE

Disclosed herein are methods of modifying a target region in the genome of a cell, the method comprising: (a) contacting a cell with: a non-naturally occurring nucleic-acid-guided nuclease encoded by a nucleic acid having at least 80% identity to SEQ ID NO: 22; an engineered guide nucleic acid capable of complexing with the nucleic acid-guided nuclease; and an editing sequence encoding a nucleic acid complementary to said target region having a change in sequence relative to the target region; and (b) allowing the nuclease, guide nucleic acid, and editing sequence to create a genome edit in a target region of the genome of the cell. In some aspects, the engineered guide nucleic acid and the editing sequence are provided as a single nucleic acid. In some aspects, the single nucleic acid further comprises a mutation in a protospacer adjacent motif (PAM) site. In some aspects, the nucleic acid-guided nuclease is encoded by a nucleic acid with at least 85% identity to SEQ ID NO: 42. In some aspects, the nucleic acid-guided nuclease is encoded by a nucleic acid with at least 85% identity to SEQ ID NO: 128.


Disclosed herein are nucleic acid-guided nuclease systems comprising: (a) a non-naturally occurring nuclease encoded by a nucleic acid having at least 80% identity to SEQ ID NO: 22; (b) an engineered guide nucleic acid capable of complexing with the nucleic acid-guided nuclease, and (c) an editing sequence having a change in sequence relative to the sequence of a target region in a genome of a cell; wherein the system results in a genome edit in the target region in the genome of the cell facilitated by the nuclease, the engineered guide nucleic acid, and the editing sequence. In some aspects, nucleic acid-guided nuclease is encoded by a nucleic acid with at least 85% identity to SEQ ID NO: 42. In some aspects, the nucleic acid-guided nuclease is encoded by a nucleic acid with at least 85% identity to SEQ ID NO: 128. In some aspects, the nucleic acid-guided nuclease is codon optimized for the cell to be edited. In some aspects, the engineered guide nucleic acid and the editing sequence are provided as a single nucleic acid. In some aspects, the single nucleic acid further comprises a mutation in a protospacer adjacent motif (PAM) site.


Disclosed herein are compositions for use in genome editing comprising a non-naturally occurring nuclease encoded by a nucleic acid having at least 75% identity to SEQ ID NO: 22. In some aspects, the nucleic acid has at least 80% identity to SEQ ID NO: 22. In some aspects, the nucleic acid has at least 90% identity to SEQ ID NO: 22. In some aspects, the nuclease is further codon optimized for use in cells from a particular organism. In some aspects, the nuclease is codon optimized for E. Coli In some aspects, the nuclease is codon optimized for S. Cerevisiae. In some aspects, the nuclease is codon optimized for mammalian cells. In some aspects, the nucleic acid-guided nuclease has less than 40% protein identity to SEQ ID NO: 12. In some aspects, the nucleic acid-guided nuclease has less than 40% protein identity to SEQ ID NO: 108.


INCORPORATION BY REFERENCE

All publications and patent applications mentioned in this specification are herein incorporated by reference to the same extent as if each individual publication or patent application was specifically and individually indicated to be incorporated by reference.





BRIEF DESCRIPTION OF THE DRAWINGS

This patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee.



FIG. 1A depicts a partial sequence alignment MAD1-8 (SEQ ID NO: 1-8) and MAD10-12 (SEQ ID NO: 10-12).



FIG. 1B depicts a phylogenetic tree of nucleases including MAD1-8.



FIG. 2 depicts an example protein expression construct.



FIG. 3 depicts an example editing cassette.



FIG. 4 depicts an example screening or selection experiment workflow.



FIG. 5A depicts an example protein expression construct.



FIG. 5B depicts an example editing cassette.



FIG. 5C depicts an example screening or selection experiment workflow.



FIG. 6A depicts an example protein expression construct.



FIG. 6B depicts an example editing cassette.



FIG. 6C depicts an example screening or selection experiment workflow.



FIG. 7A-7B depicts example data from a functional nuclease complex screening or selection experiment.



FIG. 8 depicts example data from a targetable nuclease complex-based editing experiment.



FIG. 9 depicts example data from a targetable nuclease complex-based editing experiment.



FIGS. 10A-10C depict example data from a targetable nuclease complex-based editing experiment.



FIG. 11 depicts a example sequence alignment of select sequences from an editing experiment.



FIG. 12 depicts example data from a targetable nuclease complex-based editing experiment.



FIG. 13A depicts an example alignment of scaffold sequences.



FIG. 13B depicts an example model of a nucleic acid-guided nuclease complexed with a guide nucleic acid and a target sequence.



FIG. 14A-14B depict example data from a primer validation experiment.



FIG. 15 depicts example data from a targetable nuclease complex-based editing experiment.



FIG. 16 depicts example validation data comparing results from two different assays.



FIG. 17A-17C depict an example trackable genetic engineering workflow, including a plasmid comprising an editing cassette and a recording cassette, and downstream sequencing of barcodes in order to identify the incorporated edit or mutation.



FIG. 18 depicts an example trackable genetic engineering workflow, including iterative rounds of engineering with a different editing cassette and recorder cassette with unique barcode (BC) at each round, which can be followed by selection and tracking to confirm the successful engineering step at each round.



FIG. 19 depicts an example recursive engineering workflow.





DETAILED DESCRIPTION OF THE DISCLOSURE

The present disclosure provides nucleic acid-guided nucleases and methods of use. Often, the subject nucleic-acid guided nucleases are part of a targetable nuclease system comprising a nucleic acid-guided nuclease and a guide nucleic acid. A subject targetable nuclease system can be used to cleave, modify, and/or edit a target polynucleotide sequence, often referred to as a target sequence. A subject targetable nuclease system refers collectively to transcripts and other elements involved in the expression of or directing the activity of genes, which may include sequences encoding a subject nucleic acid-guided nuclease protein and a guide nucleic acid as disclosed herein.


Methods, systems, vectors, polynucleotides, and compositions described herein may be used in various applications including altering or modifying synthesis of a gene product, such as a protein, polynucleotide cleavage, polynucleotide editing, polynucleotide splicing; trafficking of target polynucleotide, tracing of target polynucleotide, isolation of target polynucleotide, visualization of target polynucleotide, etc. Aspects of the invention also encompass methods and uses of the compositions and systems described herein in genome engineering, e.g. for altering or manipulating the expression of one or more genes or the one or more gene products, in prokaryotic, archaeal, or eukaryotic cells, in vitro, in vivo or ex vivo.


Nucleic Acid-Guided Nucleases

Bacterial and archaeal targetable nuclease systems have emerged as powerful tools for precision genome editing. However, naturally occurring nucleases have some limitations including expression and delivery challenges due to the nucleic acid sequence and protein size. Targetable nucleases that require PAM recognition are also limited in the sequences they can target throughout a genetic sequence. Other challenges include processivity, target recognition specificity and efficiency, and nuclease acidity efficiency, which often effect genetic editing efficiency.


Non-naturally occurring targetable nucleases and non-naturally occurring targetable nuclease systems can address many of these challenges and limitations.


Disclosed herein are non-naturally targetable nuclease systems. Such targetable nuclease systems are engineered to address one or more of the challenges described above and can be referred to as engineered nuclease systems. Engineered nuclease systems can comprise one or more of an engineered nuclease, such as an engineered nucleic acid-guided nuclease, an engineered guide nucleic acid, an engineered polynucleotides encoding said nuclease, or an engineered polynucleotides encoding said guide nucleic acid. Engineered nucleases, engineered guide nucleic acids, and engineered polynucleotides encoding the engineered nuclease or engineered guide nucleic acid are not naturally occurring and are not found in nature. It follows that engineered nuclease systems including one or more of these elements are non-naturally occurring.


Non-limiting examples of types of engineering that can be done to obtain a non-naturally occurring nuclease system are as follows. Engineering can include codon optimization to facilitate expression or improve expression in a host cell, such as a heterologous host cell. Engineering can reduce the size or molecular weight of the nuclease in order to facilitate expression or delivery. Engineering can alter PAM selection in order to change PAM specificity or to broaden the range of recognized PAMs. Engineering can alter, increase, or decrease stability, processivity, specificity, or efficiency of a targetable nuclease system. Engineering can alter, increase, or decrease protein stability. Engineering can alter, increase, or decrease processivity of nucleic acid scanning. Engineering can alter, increase, or decrease target sequence specificity. Engineering can alter, increase, or decrease nuclease activity. Engineering can alter, increase, or decrease editing efficiency. Engineering can alter, increase, or decrease transformation efficiency. Engineering can alter, increase, or decrease nuclease or guide nucleic acid expression.


Examples of non-naturally occurring nucleic acid sequences which are disclosed herein include sequences codon optimized for expression in bacteria, such as E. coli (e.g., SEQ ID NO: 41-60), sequences codon optimized for expression in single cell eukaryotes, such as yeast (e.g., SEQ ID NO: 127-146), sequences codon optimized for expression in multi cell eukaryotes, such as human cells (e.g., SEQ ID NO: 147-166), polynucleotides used for cloning or expression of any sequences disclosed herein (e.g., SEQ ID NO: 61-80), plasmids comprising nucleic acid sequences (e.g., SEQ ID NO: 21-40) operably linked to a heterologous promoter or nuclear localization signal or other heterologous element, proteins generated from engineered or codon optimized nucleic acid sequences (e.g., SEQ ID NO: 1-20), or engineered guide nucleic acids comprising any one of SEQ ID NO: 84-107. Such non-naturally occurring nucleic acid sequences can be amplified, cloned, assembled, synthesized, generated from synthesized oligonucleotides or dNTPs, or otherwise obtained using methods known by those skilled in the art.


Disclosed herein are nucleic acid-guided nucleases. Subject nucleases are functional in vitro, or in prokaryotic, archaeal, or eukaryotic cells for in vitro, in vivo, or ex vivo applications. Suitable nucleic acid-guided nucleases can be from an organism from a genus which includes but is not limited to Thiomicrospira, Succinivibrio, Candidatus, Porphyromonas, Acidaminococcus, Acidomonococcus, Prevotella, Smithella, Moraxella, Synergistes, Francisella, Leptospira, Catenibacterium, Kandleria, Clostridium, Dorea, Coprococcus, Enterococcus, Fructobacillus, Weissella, Pediococcus, Corynebacter, Sutterella, Legionella, Treponema, Roseburia, Filifactor, Eubacterium, Streptococcus, Lactobacillus, Mycoplasma, Bacteroides, Flaviivola, Flavobacterium, Sphaerochaeta, Azospirillum, Gluconacetobacter, Neisseria, Roseburia, Parvibaculum, Staphylococcus, Nitratifractor, Mycoplasma, Alicyclobacillus, Brevibacilus, Bacillus, Bacteroidetes, Brevibacilus, Carnobacterium, Clostridiaridium, Clostridium, Desulfonatronum, Desulfovibrio, Helcococcus, Leptotrichia, Listeria, Methanomethyophilus, Methylobacterium, Opitutaceae, Paludibacter, Rhodobacter, Sphaerochaeta, Tuberibacillus, Oleiphilus, Omnitrophica, Parcubacteria, and Campylobacter. Species of organism of such a genus can be as otherwise herein discussed. Suitable nucleic acid-guided nucleases can be from an organism from a genus or unclassified genus within a kingdom which includes but is not limited to Firmicute, Actinobacteria, Bacteroidetes, Proteobacteria, Spirochates, and Tenericutes. Suitable nucleic acid-guided nucleases can be from an organism from a genus or unclassified genus within a phylum which includes but is not limited to Erysipelotrichia, Clostridia, Bacilli, Actinobacteria, Bacteroidetes, Flavobacteria, Alphaproteobacteria, Betaproteobacteria, Gammaproteobacteria, Deltaproteobacteria, Epsilonproteobacteria, Spirochaetes, and Mollicutes. Suitable nucleic acid-guided nucleases can be from an organism from a genus or unclassified genus within an order which includes but is not limited to Clostridiales, Lactobacillales, Actinomycetales, Bacteroidales, Flavobacteriales, Rhizobiales, Rhodospirillales, Burkholderiales, Neisseriales, Legionellales, Nautiliales, Campylobacterales, Spirochaetales, Mycoplasmatales, and Thiotrichales. Suitable nucleic acid-guided nucleases can be from an organism from a genus or unclassified genus within a family which includes but is not limited to Lachnospiraceae, Enterococcaceae, Leuconostocaceae, Lactobacillaceae, Streptococcaceae, Peptostreptococcaceae, Staphylococcaceae, Eubacteriaceae, Corynebacterineae, Bacteroidaceae, Flavobacterium, Cryomoorphaceae, Rhodobiaceae, Rhodospirillaceae, Acetobacteraceae, Sutterellaceae, Neisseriaceae, Legionellaceae, Nautiliaceae, Campylobacteraceae, Spirochaetaceae, Mycoplasmataceae, Pisciririckettsiaceae, and Francisellaceae. Other nucleic acid-guided nucleases have been describe in US Patent Application Publication No. US20160208243 filed Dec. 18, 2015, US Application Publication No. US20140068797 filed Mar. 15, 2013, U.S. Pat. No. 8,697,359 filed Oct. 15, 2013, and Zetsche et al., Cell 2015 Oct. 22; 163(3):759-71, each of which are incorporated herein by reference in their entirety.


Some nucleic acid-guided nucleases suitable for use in the methods, systems, and compositions of the present disclosure include those derived from an organism such as, but not limited to, Thiomicrospira sp. XS5, Eubacterium rectale, Succinivibrio dextrinosolvens, Candidatus Methanoplasma termitum, Candidatus Methanomethylophilus alvus, Porphyromonas crevioricanis, Flavobacterium branchiophilum, Acidaminococcus Sp., Acidomonococcus sp., Lachnospiraceae bacterium COE1, Prevotella brevis ATCC 19188, Smithella sp. SCADC, Moraxella bovoculi, Synergistes jonesii, Bacteroidetes oral taxon 274, Francisella tularensis, Leptospira inadai serovar Lyme str. 10, Acidomonococcus sp. crystal structure (5B43) S. mutans, S. agalactiae, S. equisimilis, S. sanguinis, S. pneumonia; C. jejuni, C. coli; N. salsuginis, N. tergarcus; S. auricularis, S. carnosus; N. meningitides, N. gonorrhoeae; L. monocytogenes, L. ivanovii; C. botulinum, C. difficile, C. tetani, C. sordellii; Francisella tularensis 1, Prevotella albensis, Lachnospiraceae bacterium MC2017 1, Butyrivibrio proteoclasticus, Butyrivibrio proteoclasticus B316, Peregrinibacteria bacterium GW2011_GWA2_33_10, Parcubacteria bacterium GW2011_GWC2_44_17, Smithella sp. SCADC, Acidaminococcus sp. BV3L6, Lachnospiraceae bacterium MA2020, Candidatus Methanoplasma termitum, Eubacterium eligens, Moraxella bovoculi 237, Leptospira inadai, Lachnospiraceae bacterium ND2006, Porphyromonas crevioricanis 3, Prevotella disiens, Porphyromonas macacae, Catenibacterium sp. CAG:290, Kandleria vitulina, Clostridiales bacterium KA00274, Lachnospiraceae bacterium 3-2, Dorea longicatena, Coprococcus catus GD/7, Enterococcus columbae DSM 7374, Fructobacillus sp. EFB-N1, Weissella halotolerans, Pediococcus acidilactici, Lactobacillus curvatus, Streptococcus pyogenes, Lactobacillus versmoldensis, Filifactor alocis ATCC 35896, Alicyclobacillus acidoterrestris, Alicyclobacillus acidoterrestris ATCC 49025, Desulfovibrio inopinatus, Desulfovibrio inopinatus DSM 10711, Oleiphilus sp. Oleiphilus sp. HI0009, Candidtus kefeldibacteria, Parcubacteria CasY.4, Omnitrophica WOR 2 bacterium GWF2, Bacillus sp. NSP2.1, and Bacillus thermoamylovorans.


In some instances, a nucleic acid-guided nuclease disclosed herein comprises an amino acid sequence comprising at least 50% amino acid identity to any one of SEQ ID NO: 1-20. In some instances, a nuclease comprises an amino acid sequence comprising at least about 10%, 20%, 30%, 40%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, or 100% amino acid identity to any one of SEQ ID NO: 1-20. In some cases, the nucleic acid-guided nuclease comprises at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, amino acid identity to any one of SEQ ID NO: 1-20. In some cases, the nucleic acid-guided nuclease comprises at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, amino acid identity to any one of SEQ ID NO: 1-8 or 10-12. In some cases, the nucleic acid-guided nuclease comprises at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, amino acid identity to any one of SEQ ID NO: 1-8 or 10-11. In some cases, the nucleic acid-guided nuclease comprises at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, amino acid identity to SEQ ID NO: 2. In some cases, the nucleic acid-guided nuclease comprises at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, amino acid identity to SEQ ID NO: 7.


In some cases, the nucleic acid-guided nuclease comprises any one of SEQ ID NO: 1-20. In some cases, the nucleic acid-guided nuclease comprises any one of SEQ ID NO: 1-8 or 10-12. In some cases, the nucleic acid-guided nuclease comprises any one of SEQ ID NO: 1-8 or 10-11. In some cases, the nucleic acid-guided nuclease comprises SEQ ID NO: 2. In some cases, the nucleic acid-guided nuclease comprises SEQ ID NO: 7.


In some instances, a nucleic acid-guided nuclease comprises an amino acid sequence comprising at most 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, or 95% amino acid identity to any one of SEQ ID NO: 12 or SEQ ID NO: 108-110. In some instances, a nucleic acid-guided nuclease comprises an amino acid sequence comprising at most 50% amino acid identity to any one of SEQ ID NO: 12 or SEQ ID NO: 108-110. In some instances, a nucleic acid-guided nuclease comprises an amino acid sequence comprising at most 45% amino acid identity to any one of SEQ ID NO: 12 or SEQ ID NO: 108-110. In some instances, a nucleic acid-guided nuclease comprises an amino acid sequence comprising at most 40% amino acid identity to any one of SEQ ID NO: 12 or SEQ ID NO: 108-110. In some instances, a nucleic acid-guided nuclease comprises an amino acid sequence comprising at most 35% amino acid identity to any one of SEQ ID NO: 12 or SEQ ID NO: 108-110. In some instances, a nucleic acid-guided nuclease comprises an amino acid sequence comprising at most 30% amino acid identity to any one of SEQ ID NO: 12 or SEQ ID NO: 108-110.


In some instances, a nucleic acid-guided nuclease disclosed herein is encoded by a nucleic acid sequence comprising at least 50% sequence identity to any one of SEQ ID NO: 21-40. In some instances, a nuclease is encoded by a nucleic acid sequence comprising at least about 10%, 20%, 30%, 40%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, or 100% sequence identity to any one of SEQ ID NO: 21-40. In some instances, a nuclease is encoded by a nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, sequence identity to any one of SEQ ID NO: 21-40. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, sequence identity to any one of SEQ ID NO: 21-40. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, sequence identity to any one of SEQ ID NO: 21-28 or 30-32. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, sequence identity to any one of SEQ ID NO: 21-28 or 30-31. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, sequence identity to SEQ ID NO: 22. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, sequence identity to SEQ ID NO: 27.


In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 21-40. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 21-28 or 30-32. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 21-28 or 30-31. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of SEQ ID NO: 22. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of SEQ ID NO: 27.


In some instances, a nucleic acid-guided nuclease disclosed herein is encoded on a nucleic acid sequence. Such a nucleic acid can be codon optimized for expression in a desired host cell. Suitable host cells can include, as non-limiting examples, prokaryotic cells such as E. coli, P. aeruginosa, B. subtilus, and V. natriegens, and eukaryotic cells such as S. cerevisiae, plant cells, insect cells, nematode cells, amphibian cells, fish cells, or mammalian cells, including human cells.


A nucleic acid sequence encoding a nucleic acid-guided nuclease can be codon optimized for expression in gram positive bacteria, e.g., Bacillus subtilis, or gram negative bacteria, e.g., E. coli. In some instances, a nucleic acid-guided nuclease disclosed herein is encoded by a nucleic acid sequence comprising at least 50% sequence identity to any one of SEQ ID NO: 41-60. In some instances, a nuclease is encoded by a nucleic acid sequence comprising at least about 10%, 20%, 30%, 40%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, or 100% sequence identity to any one of SEQ ID NO: 41-60. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ ID NO: 41-60. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ ID NO: 41-48 or 50-52. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ ID NO: 41-48 or 50-51. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to SEQ ID NO: 42. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to SEQ ID NO: 47.


In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 41-60. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 41-48 or 50-52. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 41-48 or 50-51. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of SEQ ID NO: 42. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of SEQ ID NO: 47.


A nucleic acid sequence encoding a nucleic acid-guided nuclease can be codon optimized for expression in a species of yeast, e.g., S. cerevisiae. In some instances, a nucleic acid-guided nuclease disclosed herein is encoded by a nucleic acid sequence comprising at least 50% sequence identity to any one of SEQ ID NO: 127-146. In some instances, a nuclease is encoded by a nucleic acid sequence comprising at least about 10%, 20%, 30%, 40%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, or 100% sequence identity to any one of SEQ ID NO: 127-146. In some instances, a nuclease is encoded by a nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ ID NO: 127-146. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ ID NO: 127-146. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ ID NO: 127-134 or 136-138. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ ID NO: 127-134 or 136-137. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to SEQ ID NO: 128. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to SEQ ID NO: 133.


In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 127-146. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 127-134 or 136-138. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 127-134 or 136-137. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of SEQ ID NO: 128. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of SEQ ID NO: 133.


A nucleic acid sequence encoding a nucleic acid-guided nuclease can be codon optimized for expression in mammalian cells. In some instances, a nucleic acid-guided nuclease disclosed herein is encoded by a nucleic acid sequence comprising at least 50% sequence identity to any one of SEQ ID NO: 147-166. In some instances, a nuclease is encoded by a nucleic acid sequence comprising at least about 10%, 20%, 30%, 40%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, or 100% sequence identity to any one of SEQ ID NO: 147-166. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ ID NO: 147-166. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ ID NO: 147-154 or 156-158. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ ID NO: 147-154 or 156-157. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to SEQ ID NO: 148. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to SEQ ID NO: 153.


In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 147-166. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 147-154 or 156-158. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 147-154 or 156-157. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of SEQ ID NO: 148. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of SEQ ID NO: 153.


A nucleic acid sequence encoding a nucleic acid-guided nuclease can be operably linked to a promoter. Such nucleic acid sequences can be linear or circular. The nucleic acid sequences can be comprised on a larger linear or circular nucleic acid sequences that comprises additional elements such as an origin of replication, selectable or screenable marker, terminator, other components of a targetable nuclease system, such as a guide nucleic acid, or an editing or recorder cassette as disclosed herein. These larger nucleic acid sequences can be recombinant expression vectors, as are described in more detail later.


Guide Nucleic Acid

In general, a guide nucleic acid can complex with a compatible nucleic acid-guided nuclease and can hybridize with a target sequence, thereby directing the nuclease to the target sequence. A subject nucleic acid-guided nuclease capable of complexing with a guide nucleic acid can be referred to as a nucleic acid-guided nuclease that is compatible with the guide nucleic acid. Likewise, a guide nucleic acid capable of complexing with a nucleic acid-guided nuclease can be referred to as a guide nucleic acid that is compatible with the nucleic acid-guided nucleases.


A guide nucleic acid can be DNA. A guide nucleic acid can be RNA. A guide nucleic acid can comprise both DNA and RNA. A guide nucleic acid can comprise modified of non-naturally occurring nucleotides. In cases where the guide nucleic acid comprises RNA, the RNA guide nucleic acid can be encoded by a DNA sequence on a polynucleotide molecule such as a plasmid, linear construct, or editing cassette as disclosed herein.


A guide nucleic acid can comprise a guide sequence. A guide sequence is a polynucleotide sequence having sufficient complementarity with a target polynucleotide sequence to hybridize with the target sequence and direct sequence-specific binding of a complexed nucleic acid-guided nuclease to the target sequence. The degree of complementarity between a guide sequence and its corresponding target sequence, when optimally aligned using a suitable alignment algorithm, is about or more than about 50%, 60%, 75%, 80%, 85%, 90%, 95%, 97.5%, 99%, or more. Optimal alignment may be determined with the use of any suitable algorithm for aligning sequences. In some embodiments, a guide sequence is about or more than about 5, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 35, 40, 45, 50, 75, or more nucleotides in length. In some embodiments, a guide sequence is less than about 75, 50, 45, 40, 35, 30, 25, 20 nucleotides in length. Preferably the guide sequence is 10-30 nucleotides long. The guide sequence can be 15-20 nucleotides in length. The guide sequence can be 15 nucleotides in length. The guide sequence can be 16 nucleotides in length. The guide sequence can be 17 nucleotides in length. The guide sequence can be 18 nucleotides in length. The guide sequence can be 19 nucleotides in length. The guide sequence can be 20 nucleotides in length.


A guide nucleic acid can comprise a scaffold sequence. In general, a “scaffold sequence” includes any sequence that has sufficient sequence to promote formation of a targetable nuclease complex, wherein the targetable nuclease complex comprises a nucleic acid-guided nuclease and a guide nucleic acid comprising a scaffold sequence and a guide sequence. Sufficient sequence within the scaffold sequence to promote formation of a targetable nuclease complex may include a degree of complementarity along the length of two sequence regions within the scaffold sequence, such as one or two sequence regions involved in forming a secondary structure. In some cases, the one or two sequence regions are comprised or encoded on the same polynucleotide. In some cases, the one or two sequence regions are comprised or encoded on separate polynucleotides. Optimal alignment may be determined by any suitable alignment algorithm, and may further account for secondary structures, such as self-complementarity within either the one or two sequence regions. In some embodiments, the degree of complementarity between the one or two sequence regions along the length of the shorter of the two when optimally aligned is about or more than about 25%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 97.5%, 99%, or higher. In some embodiments, at least one of the two sequence regions is about or more than about 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 40, 50, or more nucleotides in length.


A scaffold sequence of a subject guide nucleic acid can comprise a secondary structure. A secondary structure can comprise a pseudoknot region. In some cases, binding kinetics of a guide nucleic acid to a nucleic acid-guided nuclease is determined in part by secondary structures within the scaffold sequence. In some cases, binding kinetics of a guide nucleic acid to a nucleic acid-guided nuclease is determined in part by nucleic acid sequence with the scaffold sequence.


A scaffold sequence can comprise the sequence of any one of SEQ ID NO: 84-107. A scaffold sequence can comprise the sequence of any one of SEQ ID NO: 84-103. A scaffold sequence can comprise the sequence of any one of SEQ ID NO: 84-91 or 93-95. A scaffold sequence can comprise the sequence of any one of SEQ ID NO: 88, 93, 94, or 95. A scaffold sequence can comprise the sequence of SEQ ID NO: 88. A scaffold sequence can comprise the sequence of SEQ ID NO: 93. A scaffold sequence can comprise the sequence of SEQ ID NO: 94. A scaffold sequence can comprise the sequence of SEQ ID NO: 95.


In some aspects, the invention provides a nuclease that binds to a guide nucleic acid comprising a conserved scaffold sequence. For example, the nucleic acid-guided nucleases for use in the present disclosure can bind to a conserved pseudoknot region as shown in FIG. 13A. Specifically, the nucleic acid-guided nucleases for use in the present disclosure can bind to a guide nucleic acid comprising a conserved pseudoknot region as shown in FIG. 13A. Certain nucleic acid-guided nucleases for use in the present disclosure can bind to a pseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to the pseudoknot region of Scaffold-1 (SEQ ID NO: 172). Other nucleic acid-guided nucleases for use in the present disclosure can bind to a pseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to the pseudoknot region of Scaffold-3 (SEQ ID NO: 173). Still other nucleic acid-guided nucleases for use in the present disclosure can bind to a pseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to the pseudoknot region of Scaffold-4 (SEQ ID NO: 174). Other nucleic acid-guided nucleases for use in the present disclosure can bind to a pseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to the pseudoknot region of Scaffold-5 (SEQ ID NO: 175). Other nucleic acid-guided nucleases for use in the present disclosure can bind to a pseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to the pseudoknot region of Scaffold-6 (SEQ ID NO: 176). Still other nucleic acid-guided nucleases for use in the present disclosure can bind to a pseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to the pseudoknot region of Scaffold-7 (SEQ ID NO: 177). Other nucleic acid-guided nucleases for use in the present disclosure can bind to a pseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to the pseudoknot region of Scaffold-8 (SEQ ID NO: 178). Other nucleic acid-guided nucleases for use in the present disclosure can bind to a pseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to the pseudoknot region of Scaffold-10 (SEQ ID NO: 179). Still other nucleic acid-guided nucleases for use in the present disclosure can bind to a pseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to the pseudoknot region of Scaffold-11 (SEQ ID NO: 180). Certain nucleic acid-guided nucleases for use in the present disclosure can bind to a pseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to the pseudoknot region of Scaffold-12 (SEQ ID NO: 181). Additional sequences in FIG. 13A include those for the consensus sequence (SEQ ID No: 190); frame 1 (SEQ ID No: 191); scaffold-1 (SEQ ID No: 192); scaffold-2 (SEQ ID No: 193); scaffold-3 (SEQ ID No: 194); scaffold-4 (SEQ ID No: 195); scaffold-5 (SEQ ID No: 196); scaffold-6 (SEQ ID No: 197); scaffold-7 (SEQ ID No: 198); scaffold-8 (SEQ ID No: 199); scaffold-10 (SEQ ID No: 200); scaffold-11 (SEQ ID No: 201); and scaffold-12 (SEQ ID No: 202).


A guide nucleic acid can comprise the sequence of any one of SEQ ID NO: 84-107. A guide nucleic acid can comprise the sequence of any one of SEQ ID NO: 84-103. A guide nucleic acid can comprise the sequence of any one of SEQ ID NO: 84-91 or 93-95. A guide nucleic acid can comprise the sequence of any one of SEQ ID NO: 88, 93, 94, or 95. A guide nucleic acid can comprise the sequence of SEQ ID NO: 88. A guide nucleic acid can comprise the sequence of SEQ ID NO: 93. A guide nucleic acid can comprise the sequence of SEQ ID NO: 94. A guide nucleic acid can comprise the sequence of SEQ ID NO: 95.


In aspects of the invention the terms “guide nucleic acid” refers to one or more polynucleotides comprising 1) a guide sequence capable of hybridizing to a target sequence and 2) a scaffold sequence capable of interacting with or complexing with an nucleic acid-guided nuclease as described herein. A guide nucleic acid may be provided as one or more nucleic acids. In specific embodiments, the guide sequence and the scaffold sequence are provided as a single polynucleotide.


A guide nucleic acid can be compatible with a nucleic acid-guided nuclease when the two elements can form a functional targetable nuclease complex capable of cleaving a target sequence. Often, a compatible scaffold sequence for a compatible guide nucleic acid can be found by scanning sequences adjacent to a native nucleic acid-guided nuclease loci. In other words, native nucleic acid-guided nucleases can be encoded on a genome within proximity to a corresponding compatible guide nucleic acid or scaffold sequence.


Nucleic acid-guided nucleases can be compatible with guide nucleic acids that are not found within the nucleases endogenous host. Such orthogonal guide nucleic acids can be determined by empirical testing. Orthogonal guide nucleic acids can come from different bacterial species or be synthetic or otherwise engineered to be non-naturally occurring.


Orthogonal guide nucleic acids that are compatible with a common nucleic acid-guided nuclease can comprise one or more common features. Common features can include sequence outside a pseudoknot region. Common features can include a pseudoknot region. Common features can include a primary sequence or secondary structure.


A guide nucleic acid can be engineered to target a desired target sequence by altering the guide sequence such that the guide sequence is complementary to the target sequence, thereby allowing hybridization between the guide sequence and the target sequence. A guide nucleic acid with an engineered guide sequence can be referred to as an engineered guide nucleic acid. Engineered guide nucleic acids are often non-naturally occurring and are not found in nature.


Targetable Nuclease System

Disclosed herein are targetable nuclease systems. A targetable nuclease system can comprise a nucleic acid-guided nuclease and a compatible guide nucleic acid. A targetable nuclease system can comprise a nucleic acid-guided nuclease or a polynucleotide sequence encoding the nucleic acid-guided nuclease. A targetable nuclease system can comprise a guide nucleic acid or a polynucleotide sequence encoding the guide nucleic acid.


In general, a targetable nuclease system as disclosed herein is characterized by elements that promote the formation of a targetable nuclease complex at the site of a target sequence, wherein the targetable nuclease complex comprises a nucleic acid-guided nuclease and a guide nucleic acid.


A guide nucleic acid together with a nucleic acid-guided nuclease forms a targetable nuclease complex which is capable of binding to a target sequence within a target polynucleotide, as determined by the guide sequence of the guide nucleic acid.


In general, to generate a double stranded break, in most cases a targetable nuclease complex binds to a target sequence as determined by the guide nucleic acid, and the nuclease has to recognize a protospacer adjacent motif (PAM) sequence adjacent to the target sequence.


A targetable nuclease complex can comprise a nucleic acid-guided nuclease of any one of SEQ ID NO: 1-20 and a compatible guide nucleic acid. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of any one of SEQ ID NO: 1-8 or 10-12 and a compatible guide nucleic acid. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of any one of SEQ ID NO: 1-8 or 10-11 and a compatible guide nucleic acid. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 2 and a compatible guide nucleic acid. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 7 and a compatible guide nucleic acid. In any of these cases, the guide nucleic acid can comprise a scaffold sequence compatible with the nucleic acid-guided nuclease. In any of these cases, the guide nucleic acid can further comprise a guide sequence. The guide sequence can be engineered to target any desired target sequence. The guide sequence can be engineered to be complementary to any desired target sequence. The guide sequence can be engineered to hybridize to any desired target sequence.


A targetable nuclease complex can comprise a nucleic acid-guided nuclease of any one of SEQ ID NO: 1-20 and a compatible guide nucleic acid comprising any one of SEQ ID NO: 84-107. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of any one of SEQ ID NO: 1-8 or 10-12 and a compatible guide nucleic acid comprising any one of SEQ ID NO: 84-95. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of any one of SEQ ID NO: 1-8 or 10-11 and a compatible guide nucleic acid comprising any one of SEQ ID NO: 84-91 or 93-95. In any of these cases, the guide nucleic acid can further comprise a guide sequence. The guide sequence can be engineered to target any desired target sequence. The guide sequence can be engineered to be complementary to any desired target sequence. The guide sequence can be engineered to hybridize to any desired target sequence.


A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 2 and a compatible guide nucleic acid. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 2 and a compatible guide nucleic acid comprising any one of SEQ ID NO: 88, 93, 94, or 95. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 2 and a compatible guide nucleic acid comprising SEQ ID NO: 88. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 2 and a compatible guide nucleic acid comprising SEQ ID NO: 93. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 2 and a compatible guide nucleic acid comprising SEQ ID NO: 94. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 2 and a compatible guide nucleic acid comprising SEQ ID NO: 95. In any of these cases, the guide nucleic acid can further comprise a guide sequence. The guide sequence can be engineered to target any desired target sequence. The guide sequence can be engineered to be complementary to any desired target sequence. The guide sequence can be engineered to hybridize to any desired target sequence.


A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 7 and a compatible guide nucleic acid. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 7 and a compatible guide nucleic acid comprising any one of SEQ ID NO: 88, 93, 94, or 95. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 7 and a compatible guide nucleic acid comprising SEQ ID NO: 88. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 7 and a compatible guide nucleic acid comprising SEQ ID NO: 93. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 7 and a compatible guide nucleic acid comprising SEQ ID NO: 94. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 7 and a compatible guide nucleic acid comprising SEQ ID NO: 95. In any of these cases, the guide nucleic acid can further comprise a guide sequence. The guide sequence can be engineered to target any desired target sequence. The guide sequence can be engineered to be complementary to any desired target sequence. The guide sequence can be engineered to hybridize to any desired target sequence.


A target sequence of a targetable nuclease complex can be any polynucleotide endogenous or exogenous to a prokaryotic or eukaryotic cell, or in vitro. For example, the target sequence can be a polynucleotide residing in the nucleus of the eukaryotic cell. A target sequence can be a sequence coding a gene product (e.g., a protein) or a non-coding sequence (e.g., a regulatory polynucleotide or a junk DNA). Without wishing to be bound by theory, it is believed that the target sequence should be associated with a PAM; that is, a short sequence recognized by a targetable nuclease complex. The precise sequence and length requirements for a PAM differ depending on the nucleic acid-guided nuclease used, but PAMs are typically 2-5 base pair sequences adjacent the target sequence. Examples of PAM sequences are given in the examples section below, and the skilled person will be able to identify further PAM sequences for use with a given nucleic acid-guided nuclease. Further, engineering of the PAM Interacting (PI) domain may allow programming of PAM specificity, improve target site recognition fidelity, and increase the versatility of a nucleic acid-guided nuclease genome engineering platform. Nucleic acid-guided nucleases may be engineered to alter their PAM specificity, for example as described in Kleinstiver B P et al. Engineered CRISPR-Cas9 nucleases with altered PAM specificities. Nature. 2015 Jul. 23; 523 (7561): 481-5. doi: 10.1038/nature14592.


A PAM site is a nucleotide sequence in proximity to a target sequence. In most cases, a nucleic acid-guided nuclease can only cleave a target sequence if an appropriate PAM is present. PAMs are nucleic acid-guided nuclease-specific and can be different between two different nucleic acid-guided nucleases. A PAM can be 5′ or 3′ of a target sequence. A PAM can be upstream or downstream of a target sequence. A PAM can be 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides in length. Often, a PAM is between 2-6 nucleotides in length.


In some examples, a PAM can be provided on a separate oligonucleotide. In such cases, providing PAM on a oligonucleotide allows cleavage of a target sequence that otherwise would not be able to be cleave because no adjacent PAM is present on the same polynucleotide as the target sequence.


Polynucleotide sequences encoding a component of a targetable nuclease system can comprise one or more vectors. In general, the term “vector” refers to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked. Vectors include, but are not limited to, nucleic acid molecules that are single-stranded, double-stranded, or partially double-stranded; nucleic acid molecules that comprise one or more free ends, no free ends (e.g. circular); nucleic acid molecules that comprise DNA, RNA, or both; and other varieties of polynucleotides known in the art. One type of vector is a “plasmid,” which refers to a circular double stranded DNA loop into which additional DNA segments can be inserted, such as by standard molecular cloning techniques. Another type of vector is a viral vector, wherein virally-derived DNA or RNA sequences are present in the vector for packaging into a virus (e.g. retroviruses, replication defective retroviruses, adenoviruses, replication defective adenoviruses, and adeno-associated viruses). Viral vectors also include polynucleotides carried by a virus for transfection into a host cell. Certain vectors are capable of autonomous replication in a host cell into which they are introduced (e.g. bacterial vectors having a bacterial origin of replication and episomal mammalian vectors). Other vectors (e.g., non-episomal mammalian vectors) are integrated into the genome of a host cell upon introduction into the host cell, and thereby are replicated along with the host genome. Moreover, certain vectors are capable of directing the expression of genes to which they are operatively-linked. Such vectors are referred to herein as “expression vectors.” Common expression vectors of utility in recombinant DNA techniques are often in the form of plasmids. Further discussion of vectors is provided herein.


Recombinant expression vectors can comprise a nucleic acid of the invention in a form suitable for expression of the nucleic acid in a host cell, which means that the recombinant expression vectors include one or more regulatory elements, which may be selected on the basis of the host cells to be used for expression, that is operatively-linked to the nucleic acid sequence to be expressed. Within a recombinant expression vector, “operably linked” is intended to mean that the nucleotide sequence of interest is linked to the regulatory element(s) in a manner that allows for expression of the nucleotide sequence (e.g. in an in vitro transcription/translation system or in a host cell when the vector is introduced into the host cell). With regards to recombination and cloning methods, mention is made of U.S. patent application Ser. No. 10/815,730, published Sep. 2, 2004 as US 2004-0171156 A1, the contents of which are herein incorporated by reference in their entirety.


In some embodiments, a regulatory element is operably linked to one or more elements of a targetable nuclease system so as to drive expression of the one or more components of the targetable nuclease system.


In some embodiments, a vector comprises a regulatory element operably linked to a polynucleotide sequence encoding a nucleic acid-guided nuclease. The polynucleotide sequence encoding the nucleic acid-guided nuclease can be codon optimized for expression in particular cells, such as prokaryotic or eukaryotic cells. Eukaryotic cells can be yeast, fungi, algae, plant, animal, or human cells. Eukaryotic cells may be those of or derived from a particular organism, such as a mammal, including but not limited to human, mouse, rat, rabbit, dog, or non-human mammal including non-human primate.


In general, codon optimization refers to a process of modifying a nucleic acid sequence for enhanced expression in the host cells of interest by replacing at least one codon (e.g. about or more than about 1, 2, 3, 4, 5, 10, 15, 20, 25, 50, or more codons) of the native sequence with codons that are more frequently or most frequently used in the genes of that host cell while maintaining the native amino acid sequence. Various species exhibit particular bias for certain codons of a particular amino acid. Codon bias (differences in codon usage between organisms) often correlates with the efficiency of translation of messenger RNA (mRNA), which is in turn believed to be dependent on, among other things, the properties of the codons being translated and the availability of particular transfer RNA (tRNA) molecules. The predominance of selected tRNAs in a cell is generally a reflection of the codons used most frequently in peptide synthesis. Accordingly, genes can be tailored for optimal gene expression in a given organism based on codon optimization. Codon usage tables are readily available, for example, at the “Codon Usage Database” available at www.kazusa.orjp/codon/(visited Jul. 9, 2002), and these tables can be adapted in a number of ways. See Nakamura, Y., et al. “Codon usage tabulated from the international DNA sequence databases: status for the year 2000” Nucl. Acids Res. 28:292 (2000). Computer algorithms for codon optimizing a particular sequence for expression in a particular host cell are also available, such as Gene Forge (Aptagen; Jacobus, Pa.), are also available. In some embodiments, one or more codons (e.g. 1, 2, 3, 4, 5, 10, 15, 20, 25, 50, or more, or all codons) in a sequence encoding an engineered nuclease correspond to the most frequently used codon for a particular amino acid.


In some embodiments, a vector encodes a nucleic acid-guided nuclease comprising one or more nuclear localization sequences (NLSs), such as about or more than about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more NLSs. In some embodiments, the engineered nuclease comprises about or more than about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more NLSs at or near the amino-terminus, about or more than about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more NLSs at or near the carboxy-terminus, or a combination of these (e.g. one or more NLS at the amino-terminus and one or more NLS at the carboxy terminus). When more than one NLS is present, each may be selected independently of the others, such that a single NLS may be present in more than one copy and/or in combination with one or more other NLSs present in one or more copies. In a preferred embodiment of the invention, the engineered nuclease comprises at most 6 NLSs. In some embodiments, an NLS is considered near the N- or C-terminus when the nearest amino acid of the NLS is within about 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 40, 50, or more amino acids along the polypeptide chain from the N- or C-terminus. Non-limiting examples of NLSs include an NLS sequence derived from: the NLS of the SV40 virus large T-antigen, having the amino acid sequence PKKKRKV (SEQ ID NO: 111); the NLS from nucleoplasmin (e.g. the nucleoplasmin bipartite NLS with the sequence KRPAATKKAGQAKKKK (SEQ ID NO:112)); the c-myc NLS having the amino acid sequence PAAKRVKLD (SEQ ID NO:113) or RQRRNELKRSP (SEQ ID NO:114); the hRNPA1 M9 NLS having the sequence NQSSNFGPMKGGNFGGRSSGPYGGGGQYFAKPRNQGGY (SEQ ID NO: 115); the sequence RMRIZFKNKGKDTAELRRRRVEVSVELRKAKKDEQILKRRNV (SEQ ID NO:1 116) of the IBB domain from importin-alpha; the sequences VSRKRPRP (SEQ ID NO:117) and PPKKARED (SEQ ID NO:115) of the myoma T protein; the sequence PQPKKKPL (SEQ ID NO:119) of human p53; the sequence SALIKKKKKMAP (SEQ ID NO:120) of mouse c-abl IV; the sequences DRLRR (SEQ ID NO:121) and PKQKKRK (SEQ ID NO:122) of the influenza virus NS1; the sequence RKLKKKIKKL (SEQ ID NO:123) of the Hepatitis virus delta antigen; the sequence REKKKFLKRR (SEQ ID NO: 124) of the mouse Mx1 protein; the sequence KRKGDEVDGVDEVAKKKSKK (SEQ ID NO: 125) of the human poly(ADP-ribose) polymerase; and the sequence RKCLQAGMNLEARKTKK (SEQ ID NO: 126) of the steroid hormone receptors (human) glucocorticoid.


In general, the one or more NLSs are of sufficient strength to drive accumulation of the nucleic acid-guided nuclease in a detectable amount in the nucleus of a eukaryotic cell. In general, strength of nuclear localization activity may derive from the number of NLSs, the particular NLS(s) used, or a combination of these factors. Detection of accumulation in the nucleus may be performed by any suitable technique. For example, a detectable marker may be fused to the nucleic acid-guided nuclease, such that location within a cell may be visualized, such as in combination with a means for detecting the location of the nucleus (e.g. a stain specific for the nucleus such as DAPI). Cell nuclei may also be isolated from cells, the contents of which may then be analyzed by any suitable process for detecting protein, such as immunohistochemistry, Western blot, or enzyme activity assay. Accumulation in the nucleus may also be determined indirectly, such as by an assay for the effect of the nucleic acid-guided nuclease complex formation (e.g. assay for DNA cleavage or mutation at the target sequence, or assay for altered gene expression activity affected by targetable nuclease complex formation and/or nucleic acid-guided nuclease activity), as compared to a control not exposed to the nucleic acid-guided nuclease or targetable nuclease complex, or exposed to a nucleic acid-guided nuclease lacking the one or more NLSs.


A nucleic acid-guided nuclease and one or more guide nucleic acids can be delivered either as DNA or RNA. Delivery of an nucleic acid-guided nuclease and guide nucleic acid both as RNA (unmodified or containing base or backbone modifications) molecules can be used to reduce the amount of time that the nucleic acid-guided nuclease persist in the cell. This may reduce the level of off-target cleavage activity in the target cell. Since delivery of a nucleic acid-guided nuclease as mRNA takes time to be translated into protein, it might be advantageous to deliver the guide nucleic acid several hours following the delivery of the nucleic acid-guided nuclease mRNA, to maximize the level of guide nucleic acid available for interaction with the nucleic acid-guided nuclease protein. In other cases, the nucleic acid-guided nuclease mRNA and guide nucleic acid are delivered concomitantly. In other examples, the guide nucleic acid is delivered sequentially, such as 0.5, 1, 2, 3, 4, or more hours after the nucleic acid-guided nuclease mRNA.


In situations where guide nucleic acid amount is limiting, it may be desirable to introduce a nucleic acid-guided nuclease as mRNA and guide nucleic acid in the form of a DNA expression cassette with a promoter driving the expression of the guide nucleic acid. This way the amount of guide nucleic acid available will be amplified via transcription.


Guide nucleic acid in the form of RNA or encoded on a DNA expression cassette can be introduced into a host cell comprising an nucleic acid-guided nuclease encoded on a vector or chromosome. The guide nucleic acid may be provided in the cassette one or more polynucleotides, which may be contiguous or non-contiguous in the cassette. In specific embodiments, the guide nucleic acid is provided in the cassette as a single contiguous polynucleotide.


A variety of delivery systems can be used to introduce a nucleic acid-guided nuclease (DNA or RNA) and guide nucleic acid (DNA or RNA) into a host cell. These include the use of yeast systems, lipofection systems, microinjection systems, biolistic systems, virosomes, liposomes, immunoliposomes, polycations, lipid:nucleic acid conjugates, virions, artificial virions, viral vectors, electroporation, cell permeable peptides, nanoparticles, nanowires (Shalek et al., Nano Letters, 2012), exosomes. Molecular trojan horses liposomes (Pardridge et al., Cold Spring Harb Protoc; 2010; doi:10.1101/pdb.prot5407) may be used to deliver an engineered nuclease and guide nuclease across the blood brain barrier.


In some embodiments, a editing template is also provided. A editing template may be a component of a vector as described herein, contained in a separate vector, or provided as a separate polynucleotide, such as an oligonucleotide, linear polynucleotide, or synthetic polynucleotide. In some cases, a editing template is on the same polynucleotide as a guide nucleic acid. In some embodiments, a editing template is designed to serve as a template in homologous recombination, such as within or near a target sequence nicked or cleaved by a nucleic acid-guided nuclease as a part of a complex as disclosed herein. A editing template polynucleotide may be of any suitable length, such as about or more than about 10, 15, 20, 25, 50, 75, 100, 150, 200, 500, 1000, or more nucleotides in length. In some embodiments, the editing template polynucleotide is complementary to a portion of a polynucleotide comprising the target sequence. When optimally aligned, a editing template polynucleotide might overlap with one or more nucleotides of a target sequences (e.g. about or more than about 1, 5, 10, 15, 20, 25, 30, 35, 40, or more nucleotides). In some embodiments, when a editing template sequence and a polynucleotide comprising a target sequence are optimally aligned, the nearest nucleotide of the template polynucleotide is within about 1, 5, 10, 15, 20, 25, 50, 75, 100, 200, 300, 400, 500, 1000, 5000, 10000, or more nucleotides from the target sequence.


In many examples, an editing template comprises at least one mutation compared to the target sequence. An editing template can comprise an insertion, deletion, modification, or any combination thereof compared to the target sequence. Examples of some editing templates are described in more detail in a later section.


In some aspects, the invention provides methods comprising delivering one or more polynucleotides, such as or one or more vectors or linear polynucleotides as described herein, one or more transcripts thereof, and/or one or proteins transcribed therefrom, to a host cell. In some aspects, the invention further provides cells produced by such methods, and organisms comprising or produced from such cells. In some embodiments, an engineered nuclease in combination with (and optionally complexed with) a guide nucleic acid is delivered to a cell.


Conventional viral and non-viral based gene transfer methods can be used to introduce nucleic acids in cells, such as prokaryotic cells, eukaryotic cells, mammalian cells, or target tissues. Such methods can be used to administer nucleic acids encoding components of an engineered nucleic acid-guided nuclease system to cells in culture, or in a host organism. Non-viral vector delivery systems include DNA plasmids, RNA (e.g. a transcript of a vector described herein), naked nucleic acid, and nucleic acid complexed with a delivery vehicle, such as a liposome. Viral vector delivery systems include DNA and RNA viruses, which have either episomal or integrated genomes after delivery to the cell. For a review of gene therapy procedures, see Anderson, Science 256:808-813 (1992); Nabel & Feigner, TIBTECH 11:211-217 (1993); Mitani & Caskey, TIBTECH 11:162-166 (1993); Dillon. TIBTECH 11:167-175 (1993); Miller, Nature 357:455-460 (1992); Van Brunt, Biotechnology 6(10):1149-1154 (1988); Vigne, Restorative Neurology and Neuroscience 8:35-36 (1995); Kremer & Perricaudet, British Medical Bulletin 51(1):31-44 (1995); Haddada et al., in Current Topics in Microbiology and Immunology Doerfler and Bohm (eds) (1995); and Yu et al., Gene Therapy 1:13-26 (1994).


Methods of non-viral delivery of nucleic acids include lipofection, microinjection, biolistics, virosomes, liposomes, immunoliposomes, polycation or lipid:nucleic acid conjugates, naked DNA, artificial virions, and agent-enhanced uptake of DNA. Lipofection is described in e.g., U.S. Pat. Nos. 5,049,386, 4,946,787; and 4,897,355) and lipofection reagents are sold commercially (e.g., Transfectam™ and Lipofectin™). Cationic and neutral lipids that are suitable for efficient receptor-recognition lipofection of polynucleotides include those of Felgner, WO 91/17424; WO 91/16024. Delivery can be to cells (e.g. in vitro or ex vivo administration) or target tissues (e.g. in vivo administration).


The preparation of lipid:nucleic acid complexes, including targeted liposomes such as immunolipid complexes, is well known to one of skill in the art (see, e.g., Crystal, Science 270:404-410 (1995); Blaese et al., Cancer Gene Ther. 2:291-297 (1995); Behr et al., Bioconjugate Chem. 5:382-389 (1994); Remy et al., Bioconjugate Chem. 5:647-654 (1994); Gao et al., Gene Therapy 2:710-722 (1995); Ahmad et al., Cancer Res. 52:4817-4820 (1992); U.S. Pat. Nos. 4,186,183, 4,217,344, 4,235,871, 4,261,975, 4,485,054, 4,501,728, 4,774,085, 4,837,028, and 4,946,787).


The use of RNA or DNA viral based systems for the delivery of nucleic acids take advantage of highly evolved processes for targeting a virus to specific cells in culture or in the host and trafficking the viral payload to the nucleus or host cell genome. Viral vectors can be administered directly to cells in culture, patients (in vivo), or they can be used to treat cells in vitro, and the modified cells may optionally be administered to patients (ex vivo). Conventional viral based systems could include retroviral, lentivirus, adenoviral, adeno-associated and herpes simplex virus vectors for gene transfer. Integration in the host genome is possible with the retrovirus, lentivirus, and adeno-associated virus gene transfer methods, often resulting in long term expression of the inserted transgene. Additionally, high transduction efficiencies have been observed in many different cell types and target tissues.


The tropism of a retrovirus can be altered by incorporating foreign envelope proteins, expanding the potential target population of target cells. Lentiviral vectors are retroviral vectors that are able to transduce or infect non-dividing cells and typically produce high viral titers. Selection of a retroviral gene transfer system would therefore depend on the target tissue. Retroviral vectors are comprised of cis-acting long terminal repeats with packaging capacity for up to 6-10 kb of foreign sequence. The minimum cis-acting LTRs are sufficient for replication and packaging of the vectors, which are then used to integrate the therapeutic gene into the target cell to provide permanent transgene expression. Widely used retroviral vectors include those based upon murine leukemia virus (MuLV), gibbon ape leukemia virus (GaLV), Simian Immuno deficiency virus (SIV), human immuno deficiency virus (HIV), and combinations thereof (see, e.g., Buchscher et al., J. Virol. 66:2731-2739 (1992); Johann et al., J. Virol. 66:1635-1640 (1992); Sommnerfelt et al., Virol. 176:58-59 (1990); Wilson et al., J. Virol. 63:2374-2378 (1989); Miller et al., J. Virol. 65:2220-2224 (1991); PCT/US94/05700).


In applications where transient expression is preferred, adenoviral based systems may be used. Adenoviral based vectors are capable of very high transduction efficiency in many cell types and do not require cell division. With such vectors, high titer and levels of expression have been obtained. This vector can be produced in large quantities in a relatively simple system.


Adeno-associated virus (“AAV”) vectors may also be used to transduce cells with target nucleic acids, e.g., in the in vitro production of nucleic acids and peptides, and for in vivo and ex vivo gene therapy procedures (see, e.g., West et al., Virology 160:38-47 (1987); U.S. Pat. No. 4,797,368; WO 93/24641; Kotin, Human Gene Therapy 5:793-801 (1994); Muzyczka, J. Clin. Invest. 94:1351 (1994). Construction of recombinant AAV vectors are described in a number of publications, including U.S. Pat. No. 5,173,414; Tratschin et al., Mol. Cell. Biol. 5:3251-3260 (1985); Tratschin, et al., Mol. Cell. Biol. 4:2072-2081 (1984); Hermonat & Muzyczka, PNAS 81:6466-6470 (1984); and Samulski et al., J. Virol. 63:03822-3828 (1989).


In some embodiments, a host cell is transiently or non-transiently transfected with one or more vectors, linear polynucleotides, polypeptides, nucleic acid-protein complexes, or any combination thereof as described herein. In some embodiments, a cell in transfected in vitro, in culture, or ex vivo. In some embodiments, a cell is transfected as it naturally occurs in a subject. In some embodiments, a cell that is transfected is taken from a subject. In some embodiments, the cell is derived from cells taken from a subject, such as a cell line.


In some embodiments, a cell transfected with one or more vectors, linear polynucleotides, polypeptides, nucleic acid-protein complexes, or any combination thereof as described herein is used to establish a new cell line comprising one or more transfection-derived sequences. In some embodiments, a cell transiently transfected with the components of an engineered nucleic acid-guided nuclease system as described herein (such as by transient transfection of one or more vectors, or transfection with RNA), and modified through the activity of an engineered nuclease complex, is used to establish a new cell line comprising cells containing the modification but lacking any other exogenous sequence.


In some embodiments, one or more vectors described herein are used to produce a non-human transgenic cell, organism, animal, or plant. In some embodiments, the transgenic animal is a mammal, such as a mouse, rat, or rabbit. Methods for producing transgenic cells, organisms, plants, and animals are known in the art, and generally begin with a method of cell transformation or transfection, such as described herein.


Methods of Use

In the context of formation of an engineered nuclease complex, “target sequence” refers to a sequence to which a guide sequence is designed to have complementarity, where hybridization between a target sequence and a guide sequence promotes the formation of a engineered nuclease complex. A target sequence may comprise any polynucleotide, such as DNA, RNA, or a DNA-RNA hybrid. A target sequence can be located in the nucleus or cytoplasm of a cell. A target sequence can be located in vitro or in a cell-free environment.


Typically, formation of an engineered nuclease complex comprising a guide nucleic acid hybridized to a target sequence and complexed with one or more engineered nucleases as disclosed herein results in cleavage of one or both strands in or near (e.g. within 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 50, or more base pairs from) the target sequence. Cleavage can occur within a target sequence, 5′ of the target sequence, upstream of a target sequence, 3′ of the target sequence, or downstream of a target sequence.


In some embodiments, one or more vectors driving expression of one or more components of a targetable nuclease system are introduced into a host cell or in vitro such formation of a targetable nuclease complex at one or more target sites. For example, a nucleic acid-guided nuclease and a guide nucleic acid could each be operably linked to separate regulatory elements on separate vectors. Alternatively, two or more of the elements expressed from the same or different regulatory elements, may be combined in a single vector, with one or more additional vectors providing any components of the targetable nuclease system not included in the first vector. Targetable nuclease system elements that are combined in a single vector may be arranged in any suitable orientation, such as one element located 5′ with respect to (“upstream” of) or 3′ with respect to (“downstream” of) a second element. The coding sequence of one element may be located on the same or opposite strand of the coding sequence of a second element, and oriented in the same or opposite direction. In some embodiments, a single promoter drives expression of a transcript encoding a nucleic acid-guided nuclease and one or more guide nucleic acids. In some embodiments, a nucleic acid-guided nuclease and one or more guide nucleic acids are operably linked to and expressed from the same promoter. In other embodiments, one or more guide nucleic acids or polynucleotides encoding the one or more guide nucleic acids are introduced into a cell or in vitro environment already comprising a nucleic acid-guided nuclease or polynucleotide sequence encoding the nucleic acid-guided nuclease.


When multiple different guide sequences are used, a single expression construct may be used to target nuclease activity to multiple different, corresponding target sequences within a cell or in vitro. For example, a single vector may comprise about or more than about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, or more guide sequences. In some embodiments, about or more than about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more such guide-sequence-containing vectors may be provided, and optionally delivered to a cell or in vitro.


Methods and compositions disclosed herein may comprise more than one guide nucleic acid, wherein each guide nucleic acid has a different guide sequence, thereby targeting a different target sequence. In such cases, multiple guide nucleic acids can be using in multiplexing, wherein multiple targets are targeted simultaneously. Additionally or alternatively, the multiple guide nucleic acids are introduced into a population of cells, such that each cell in a population received a different or random guide nucleic acid, thereby targeting multiple different target sequences across a population of cells. In such cases, the collection of subsequently altered cells can be referred to as a library.


Methods and compositions disclosed herein may comprise multiple different nucleic acid-guided nucleases, each with one or more different corresponding guide nucleic acids, thereby allowing targeting of different target sequences by different nucleic acid-guided nucleases. In some such cases, each nucleic acid-guided nuclease can correspond to a distinct plurality of guide nucleic acids, allowing two or more non overlapping, partially overlapping, or completely overlapping multiplexing events.


In some embodiments, the nucleic acid-guided nuclease has DNA cleavage activity or RNA cleavage activity. In some embodiments, the nucleic acid-guided nuclease directs cleavage of one or both strands at the location of a target sequence, such as within the target sequence and/or within the complement of the target sequence. In some embodiments, the nucleic acid-guided nuclease directs cleavage of one or both strands within about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 50, 100, 200, 500, or more base pairs from the first or last nucleotide of a target sequence.


In some embodiments, a nucleic acid-guided nuclease may form a component of an inducible system. The inducible nature of the system would allow for spatiotemporal control of gene editing or gene expression using a form of energy. The form of energy may include but is not limited to electromagnetic radiation, sound energy, chemical energy, light energy, temperature, and thermal energy. Examples of inducible system include tetracycline inducible promoters (Tet-On or Tet-Off), small molecule two-hybrid transcription activations systems (FKBP, ABA, etc), or light inducible systems (Phytochrome, LOV domains, or cryptochorome). In one embodiment, the nucleic acid-guided nuclease may be a part of a Light Inducible Transcriptional Effector (LITE) to direct changes in transcriptional activity in a sequence-specific manner. The components of a light inducible system may include a nucleic acid-guided nuclease, a light-responsive cytochrome heterodimer (e.g. from Arabidopsis thaliana), and a transcriptional activation/repression domain. Further examples of inducible DNA binding proteins and methods for their use are provided in U.S. 61/736,465 and U.S. 61/721,283, which is hereby incorporated by reference in its entirety. An inducible system can be temperature inducible such that the system is turned on or off by increasing or decreasing the temperature. In some temperature inducible systems, increasing the temperature turns the system on. In some temperature inducible systems, increasing the temperature turns the system off.


In some aspects, the invention provides for methods of modifying a target sequence in vitro, or in a prokaryotic or eukaryotic cell, which may be in vivo, ex vivo, or in vitro. In some embodiments, the method comprises sampling a cell or population of cells such as prokaryotic cells, or those from a human or non-human animal or plant (including micro-algae), and modifying the cell or cells. Culturing may occur at any stage in vitro or ex vivo. The cell or cells may even be re-introduced into the host, such as a non-human animal or plant (including micro-algae). For re-introduced cells it is particularly preferred that the cells are stem cells.


In some embodiments, the method comprises allowing a targetable nuclease complex to bind to the target sequence to effect cleavage of said target sequence, thereby modifying the target sequence, wherein the targetable nuclease complex comprises a nucleic acid-guided nuclease complexed with a guide nucleic acid wherein the guide sequence of the guide nucleic acid is hybridized to a target sequence within a target polynucleotide.


In some aspects, the invention provides a method of modifying expression of a target polynucleotide in in vitro or in a prokaryotic or eukaryotic cell. In some embodiments, the method comprises allowing a targetable nuclease complex to bind to a target sequence with the target polynucleotide such that said binding results in increased or decreased expression of said target polynucleotide; wherein the targetable nuclease complex comprises an nucleic acid-guided nuclease complexed with a guide nucleic acid, and wherein the guide sequence of the guide nucleic acid is hybridized to a target sequence within said target polynucleotide. Similar considerations apply as above for methods of modifying a target polynucleotide. In fact, these sampling, culturing and re-introduction options apply across the aspects of the present invention.


In some aspects, the invention provides kits containing any one or more of the elements disclosed in the above methods and compositions. Elements may provide individually or in combinations, and may be provided in any suitable container, such as a vial, a bottle, or a tube. In some embodiments, the kit includes instructions in one or more languages, for example in more than one language.


In some embodiments, a kit comprises one or more reagents for use in a process utilizing one or more of the elements described herein. Reagents may be provided in any suitable container. For example, a kit may provide one or more reaction or storage buffers. Reagents may be provided in a form that is usable in a particular assay, or in a form that requires addition of one or more other components before use (e.g. in concentrate or lyophilized form). A buffer can be any buffer, including but not limited to a sodium carbonate buffer, a sodium bicarbonate buffer, a borate buffer, a Tris buffer, a MOPS buffer, a HEPES buffer, and combinations thereof. In some embodiments, the buffer is alkaline. In some embodiments, the buffer has a pH from about 7 to about 10. In some embodiments, the kit comprises one or more oligonucleotides corresponding to a guide sequence for insertion into a vector so as to operably link the guide sequence and a regulatory element. In some embodiments, the kit comprises a editing template.


In some aspects, the invention provides methods for using one or more elements of a engineered targetable nuclease system. A targetable nuclease complex of the disclosure provides an effective means for modifying a target sequence within a target polynucleotide. A targetable nuclease complex of the disclosure has a wide variety of utility including modifying (e.g., deleting, inserting, translocating, inactivating, activating) a target sequence in a multiplicity of cell types. As such a targetable nuclease complex of the invention has a broad spectrum of applications in, e.g., biochemical pathway optimization, genome-wide studies, genome engineering, gene therapy, drug screening, disease diagnosis, and prognosis. An exemplary targetable nuclease complex comprises a nucleic acid-guided nuclease as disclosed herein complexed with a guide nucleic acid, wherein the guide sequence of the guide nucleic acid can hybridize to a target sequence within the target polynucleotide. A guide nucleic acid can comprise a guide sequence linked to a scaffold sequence. A scaffold sequence can comprise one or more sequence regions with a degree of complementarity such that together they form a secondary structure. In some cases, the one or more sequence regions are comprised or encoded on the same polynucleotide. In some cases, the one or more sequence regions are comprised or encoded on separate polynucleotides.


Provided herein are methods of cleaving a target polynucleotide. The method comprises cleaving a target polynucleotide using a targetable nuclease complex that binds to a target sequence within a target polynucleotide and effect cleavage of said target polynucleotide. Typically, the targetable nuclease complex of the invention, when introduced into a cell, creates a break (e.g., a single or a double strand break) in the target sequence. For example, the method can be used to cleave a target gene in a cell, or to replace a wildtype sequence with a modified sequence.


The break created by the targetable nuclease complex can be repaired by a repair process such as the error prone non-homologous end joining (NHEJ) pathway, the high fidelity homology-directed repair (HDR), or by recombination pathways. During these repair processes, an editing template can be introduced into the genome sequence. In some methods, the HDR or recombination process is used to modify a target sequence. For example, an editing template comprising a sequence to be integrated flanked by an upstream sequence and a downstream sequence is introduced into a cell. The upstream and downstream sequences share sequence similarity with either side of the site of integration in the chromosome, target vector, or target polynucleotide.


An editing template can be DNA or RNA, e.g., a DNA plasmid, a bacterial artificial chromosome (BAC), a yeast artificial chromosome (YAC), a viral vector, a linear piece of DNA, a PCR fragment, oligonucleotide, synthetic polynucleotide, a naked nucleic acid, or a nucleic acid complexed with a delivery vehicle such as a liposome or poloxamer.


An editing template polynucleotide can comprise a sequence to be integrated (e.g, a mutated gene). A sequence for integration may be a sequence endogenous or exogenous to the cell. Examples of a sequence to be integrated include polynucleotides encoding a protein or a non-coding RNA (e.g., a microRNA). Thus, the sequence for integration may be operably linked to an appropriate control sequence or sequences. Alternatively, the sequence to be integrated may provide a regulatory function. Sequence to be integrated may be a mutated or variant of an endogenous wildtype sequence. Alternatively, sequence to be integrated may be a wildtype version of an endogenous mutated sequence. Additionally or alternatively, sequenced to be integrated may be a variant or mutated form of an endogenous mutated or variant sequence.


Upstream and downstream sequences in an editing template polynucleotide can be selected to promote recombination between the target polynucleotide of interest and the editing template polynucleotide. The upstream sequence can be a nucleic acid sequence having sequence similarity with the sequence upstream of the targeted site for integration. Similarly, the downstream sequence can be a nucleic acid sequence having similarity with the sequence downstream of the targeted site of integration. The upstream and downstream sequences in an editing template can have 75%, 80%, 85%, 90%, 95%, or 100% sequence identity with the targeted polynucleotide. Preferably, the upstream and downstream sequences in the editing template polynucleotide have about 95%, 96%, 97%, 98%, 99%, or 100% sequence identity with the targeted polynucleotide. In some methods, the upstream and downstream sequences in the editing template polynucleotide have about 99% or 100% sequence identity with the targeted polynucleotide.


An upstream or downstream sequence may comprise from about 20 bp to about 2500 bp, for example, about 50, 100, 200, 300, 400, 500, 600, 700, 800, 900, 1000, 1100, 1200, 1300, 1400, 1500, 1600, 1700, 1800, 1900, 2000, 2100, 2200, 2300, 2400, or 2500 bp. In some methods, the exemplary upstream or downstream sequence has about 15 bp to about 50 bp, about 30 bp to about 100 bp, about 200 bp to about 2000 bp, about 600 bp to about 1000 bp, or more particularly about 700 bp to about 1000 bp.


In some methods, the editing template polynucleotide may further comprise a marker. Such a marker may make it easy to screen for targeted integrations. Examples of suitable markers include restriction sites, fluorescent proteins, or selectable markers. The exogenous polynucleotide template of the invention can be constructed using recombinant techniques (see, for example, Green and Sambrook et al., 2014 and Ausubel et al., 2017).


In an exemplary method for modifying a target polynucleotide by integrating an editing template polynucleotide, a double stranded break is introduced into the genome sequence by an engineered nuclease complex, the break can be repaired via homologous recombination using an editing template such that the template is integrated into the target polynucleotide. The presence of a double-stranded break can increase the efficiency of integration of the editing template.


Disclosed herein are methods for modifying expression of a polynucleotide in a cell. Some methods comprise increasing or decreasing expression of a target polynucleotide by using a targetable nuclease complex that binds to the target polynucleotide.


In some methods, a target polynucleotide can be inactivated to effect the modification of the expression in a cell. For example, upon the binding of a targetable nuclease complex to a target sequence in a cell, the target polynucleotide is inactivated such that the sequence is not transcribed, the coded protein is not produced, or the sequence does not function as the wild-type sequence does. For example, a protein or microRNA coding sequence may be inactivated such that the protein is not produced.


In some methods, a control sequence can be inactivated such that it no longer functions as a regulatory sequence. As used herein, “regulatory sequence” can refer to any nucleic acid sequence that effects the transcription, translation, or accessibility of a nucleic acid sequence. Examples of regulatory sequences include, a promoter, a transcription terminator, and an enhancer.


An inactivated target sequence may include a deletion mutation (i.e., deletion of one or more nucleotides), an insertion mutation (i.e., insertion of one or more nucleotides), or a nonsense mutation (i.e., substitution of a single nucleotide for another nucleotide such that a stop codon is introduced). In some methods, the inactivation of a target sequence results in “knockout” of the target sequence.


An altered expression of one or more target polynucleotides associated with a signaling biochemical pathway can be determined by assaying for a difference in the mRNA levels of the corresponding genes between the test model cell and a control cell, when they are contacted with a candidate agent. Alternatively, the differential expression of the sequences associated with a signaling biochemical pathway is determined by detecting a difference in the level of the encoded polypeptide or gene product.


To assay for an agent-induced alteration in the level of mRNA transcripts or corresponding polynucleotides, nucleic acid contained in a sample is first extracted according to standard methods in the art. For instance, mRNA can be isolated using various lytic enzymes or chemical solutions according to the procedures set forth in Green and Sambrook (2014), or extracted by nucleic-acid-binding resins following the accompanying instructions provided by the manufacturers. The mRNA contained in the extracted nucleic acid sample is then detected by amplification procedures or conventional hybridization assays (e.g. Northern blot analysis) according to methods widely known in the art or based on the methods exemplified herein.


For purpose of this invention, amplification means any method employing a primer and a polymerase capable of replicating a target sequence with reasonable fidelity. Amplification may be carried out by natural or recombinant DNA polymerases such as TaqGold™, T7 DNA polymerase, Klenow fragment of E. coli DNA polymerase, and reverse transcriptase. A preferred amplification method is PCR. In particular, the isolated RNA can be subjected to a reverse transcription assay that is coupled with a quantitative polymerase chain reaction (RT-PCR) in order to quantify the expression level of a sequence associated with a signaling biochemical pathway.


Detection of the gene expression level can be conducted in real time in an amplification assay. In one aspect, the amplified products can be directly visualized with fluorescent DNA-binding agents including but not limited to DNA intercalators and DNA groove binders. Because the amount of the intercalators incorporated into the double-stranded DNA molecules is typically proportional to the amount of the amplified DNA products, one can conveniently determine the amount of the amplified products by quantifying the fluorescence of the intercalated dye using conventional optical systems in the art. DNA-binding dye suitable for this application include SYBR green, SYBR blue, DAPI, propidium iodine, Hoeste, SYBR gold, ethidium bromide, acridines, proflavine, acridine orange, acriflavine, fluorcoumanin, ellipticine, daunomycin, chloroquine, distamycin D, chromomycin, homidium, mithramycin, ruthenium polypyridyls, anthramycin, and the like.


In another aspect, other fluorescent labels such as sequence specific probes can be employed in the amplification reaction to facilitate the detection and quantification of the amplified products. Probe-based quantitative amplification relies on the sequence-specific detection of a desired amplified product. It utilizes fluorescent, target-specific probes (e.g., TaqMan™ probes) resulting in increased specificity and sensitivity. Methods for performing probe-based quantitative amplification are well established in the art and are taught in U.S. Pat. No. 5,210,015.


In yet another aspect, conventional hybridization assays using hybridization probes that share sequence homology with sequences associated with a signaling biochemical pathway can be performed. Typically, probes are allowed to form stable complexes with the sequences associated with a signaling biochemical pathway contained within the biological sample derived from the test subject in a hybridization reaction. It will be appreciated by one of skill in the art that where antisense is used as the probe nucleic acid, the target polynucleotides provided in the sample are chosen to be complementary to sequences of the antisense nucleic acids. Conversely, where the nucleotide probe is a sense nucleic acid, the target polynucleotide is selected to be complementary to sequences of the sense nucleic acid.


Hybridization can be performed under conditions of various stringency, for instance as described herein. Suitable hybridization conditions for the practice of the present invention are such that the recognition interaction between the probe and sequences associated with a signaling biochemical pathway is both sufficiently specific and sufficiently stable. Conditions that increase the stringency of a hybridization reaction are widely known and published in the art. See, for example, (Green and Sambrook, et al., (2014); Nonradioactive in Situ Hybridization Application Manual, Boehringer Mannheim, second edition). The hybridization assay can be formed using probes immobilized on any solid support, including but are not limited to nitrocellulose, glass, silicon, and a variety of gene arrays. A preferred hybridization assay is conducted on high-density gene chips as described in U.S. Pat. No. 5,445,934.


For a convenient detection of the probe-target complexes formed during the hybridization assay, the nucleotide probes are conjugated to a detectable label. Detectable labels suitable for use in the present invention include any composition detectable by photochemical, biochemical, spectroscopic, immunochemical, electrical, optical or chemical means. A wide variety of appropriate detectable labels are known in the art, which include fluorescent or chemiluminescent labels, radioactive isotope labels, enzymatic or other ligands. In preferred embodiments, one will likely desire to employ a fluorescent label or an enzyme tag, such as digoxigenin, .beta.-galactosidase, urease, alkaline phosphatase or peroxidase, avidin/biotin complex.


Detection methods used to detect or quantify the hybridization intensity will typically depend upon the label selected above. For example, radiolabels may be detected using photographic film or a phosphoimager. Fluorescent markers may be detected and quantified using a photodetector to detect emitted light. Enzymatic labels are typically detected by providing the enzyme with a substrate and measuring the reaction product produced by the action of the enzyme on the substrate; and finally colorimetric labels are detected by simply visualizing the colored label.


An agent-induced change in expression of sequences associated with a signaling biochemical pathway can also be determined by examining the corresponding gene products. Determining the protein level typically involves a) contacting the protein contained in a biological sample with an agent that specifically bind to a protein associated with a signaling biochemical pathway; and (b) identifying any agent:protein complex so formed. In one aspect of this embodiment, the agent that specifically binds a protein associated with a signaling biochemical pathway is an antibody, preferably a monoclonal antibody.


The reaction can be performed by contacting the agent with a sample of the proteins associated with a signaling biochemical pathway derived from the test samples under conditions that will allow a complex to form between the agent and the proteins associated with a signaling biochemical pathway. The formation of the complex can be detected directly or indirectly according to standard procedures in the art. In the direct detection method, the agents are supplied with a detectable label and unreacted agents may be removed from the complex; the amount of remaining label thereby indicating the amount of complex formed. For such method, it is preferable to select labels that remain attached to the agents even during stringent washing conditions. It is preferable that the label does not interfere with the binding reaction. In the alternative, an indirect detection procedure may use an agent that contains a label introduced either chemically or enzymatically. A desirable label generally does not interfere with binding or the stability of the resulting agent:polypeptide complex. However, the label is typically designed to be accessible to an antibody for an effective binding and hence generating a detectable signal.


A wide variety of labels suitable for detecting protein levels are known in the art. Non-limiting examples include radioisotopes, enzymes, colloidal metals, fluorescent compounds, bioluminescent compounds, and chemiluminescent compounds.


The amount of agent:polypeptide complexes formed during the binding reaction can be quantified by standard quantitative assays. As illustrated above, the formation of agent:polypeptide complex can be measured directly by the amount of label remained at the site of binding. In an alternative, the protein associated with a signaling biochemical pathway is tested for its ability to compete with a labeled analog for binding sites on the specific agent. In this competitive assay, the amount of label captured is inversely proportional to the amount of protein sequences associated with a signaling biochemical pathway present in a test sample.


A number of techniques for protein analysis based on the general principles outlined above are available in the art. They include but are not limited to radioimmunoassays, ELISA (enzyme linked immunoradiometric assays), “sandwich” immunoassays, immunoradiometric assays, in situ immunoassays (using e.g., colloidal gold, enzyme or radioisotope labels), western blot analysis, immunoprecipitation assays, immunofluorescent assays, and SDS-PAGE.


Antibodies that specifically recognize or bind to proteins associated with a signaling biochemical pathway are preferable for conducting the aforementioned protein analyses. Where desired, antibodies that recognize a specific type of post-translational modifications (e.g., signaling biochemical pathway inducible modifications) can be used. Post-translational modifications include but are not limited to glycosylation, lipidation, acetylation, and phosphorylation. These antibodies may be purchased from commercial vendors. For example, anti-phosphotyrosine antibodies that specifically recognize tyrosine-phosphorylated proteins are available from a number of vendors including Invitrogen and Perkin Elmer. Anti-phosphotyrosine antibodies are particularly useful in detecting proteins that are differentially phosphorylated on their tyrosine residues in response to an ER stress. Such proteins include but are not limited to eukaryotic translation initiation factor 2 alpha (eIF-2.alpha.). Alternatively, these antibodies can be generated using conventional polyclonal or monoclonal antibody technologies by immunizing a host animal or an antibody-producing cell with a target protein that exhibits the desired post-translational modification.


In practicing a subject method, it may be desirable to discern the expression pattern of an protein associated with a signaling biochemical pathway in different bodily tissue, in different cell types, and/or in different subcellular structures. These studies can be performed with the use of tissue-specific, cell-specific or subcellular structure specific antibodies capable of binding to protein markers that are preferentially expressed in certain tissues, cell types, or subcellular structures.


An altered expression of a gene associated with a signaling biochemical pathway can also be determined by examining a change in activity of the gene product relative to a control cell. The assay for an agent-induced change in the activity of a protein associated with a signaling biochemical pathway will dependent on the biological activity and/or the signal transduction pathway that is under investigation. For example, where the protein is a kinase, a change in its ability to phosphorylate the downstream substrate(s) can be determined by a variety of assays known in the art. Representative assays include but are not limited to immunoblotting and immunoprecipitation with antibodies such as anti-phosphotyrosine antibodies that recognize phosphorylated proteins. In addition, kinase activity can be detected by high throughput chemiluminescent assays such as AlphaScreen™ (available from Perkin Elmer) and eTag™ assay (Chan-Hui, et al. (2003) Clinical Immunology 111: 162-174).


Where the protein associated with a signaling biochemical pathway is part of a signaling cascade leading to a fluctuation of intracellular pH condition, pH sensitive molecules such as fluorescent pH dyes can be used as the reporter molecules. In another example where the protein associated with a signaling biochemical pathway is an ion channel, fluctuations in membrane potential and/or intracellular ion concentration can be monitored. A number of commercial kits and high-throughput devices are particularly suited for a rapid and robust screening for modulators of ion channels. Representative instruments include FLIPR™ (Molecular Devices, Inc.) and VIPR (Aurora Biosciences). These instruments are capable of detecting reactions in over 1000 sample wells of a microplate simultaneously, and providing real-time measurement and functional data within a second or even a minisecond.


In practicing any of the methods disclosed herein, a suitable vector can be introduced to a cell, tissue, organism, or an embryo via one or more methods known in the art, including without limitation, microinjection, electroporation, sonoporation, biolistics, calcium phosphate-mediated transfection, cationic transfection, liposome transfection, dendrimer transfection, heat shock transfection, nucleofection transfection, magnetofection, lipofection, impalefection, optical transfection, proprietary agent-enhanced uptake of nucleic acids, and delivery via liposomes, immunoliposomes, virosomes, or artificial virions. In some methods, the vector is introduced into an embryo by microinjection. The vector or vectors may be microinjected into the nucleus or the cytoplasm of the embryo. In some methods, the vector or vectors may be introduced into a cell by nucleofection.


A target polynucleotide of a targetable nuclease complex can be any polynucleotide endogenous or exogenous to the host cell. For example, the target polynucleotide can be a polynucleotide residing in the nucleus of the eukaryotic cell, the genome of a prokaryotic cell, or an extrachromosomal vector of a host cell. The target polynucleotide can be a sequence coding a gene product (e.g., a protein) or a non-coding sequence (e.g., a regulatory polynucleotide or a junk DNA).


Examples of target polynucleotides include a sequence associated with a signaling biochemical pathway, e.g., a signaling biochemical pathway-associated gene or polynucleotide. Examples of target polynucleotides include a disease associated gene or polynucleotide. A “disease-associated” gene or polynucleotide refers to any gene or polynucleotide which is yielding transcription or translation products at an abnormal level or in an abnormal form in cells derived from a disease-affected tissues compared with tissues or cells of a non-disease control. It may be a gene that becomes expressed at an abnormally high level; it may be a gene that becomes expressed at an abnormally low level, where the altered expression correlates with the occurrence and/or progression of the disease. A disease-associated gene also refers to a gene possessing mutation(s) or genetic variation that is directly responsible or is in linkage disequilibrium with a gene(s) that is responsible for the etiology of a disease. The transcribed or translated products may be known or unknown, and may be at a normal or abnormal level.


Embodiments of the invention also relate to methods and compositions related to knocking out genes, editing genes, altering genes, amplifying genes, and repairing particular mutations. Altering genes may also mean the epigenetic manipulation of a target sequence. This may be the chromatin state of a target sequence, such as by modification of the methylation state of the target sequence (i.e. addition or removal of methylation or methylation patterns or CpG islands), histone modification, increasing or reducing accessibility to the target sequence, or by promoting 3D folding. It will be appreciated that where reference is made to a method of modifying a cell, organism, or mammal including human or a non-human mammal or organism by manipulation of a target sequence in a genomic locus of interest, this may apply to the organism (or mammal) as a whole or just a single cell or population of cells from that organism (if the organism is multicellular). In the case of humans, for instance, Applicants envisage, inter alia, a single cell or a population of cells and these may preferably be modified ex vivo and then re-introduced. In this case, a biopsy or other tissue or biological fluid sample may be necessary. Stem cells are also particularly preferred in this regard. But, of course, in vivo embodiments are also envisaged. And the invention is especially advantageous as to HSCs.


The functionality of a targetable nuclease complex can be assessed by any suitable assay. For example, the components of a targetable nuclease system sufficient to form a targetable nuclease complex, including a guide nucleic acid and nucleic acid-guided nuclease, can be provided to a host cell having the corresponding target sequence, such as by transfection with vectors encoding the components of the engineered nuclease system, followed by an assessment of preferential cleavage within the target sequence. Similarly, cleavage of a target sequence may be evaluated in a test tube by providing the target sequence and components of a targetable nuclease complex. Other assays are possible, and will occur to those skilled in the art. A guide sequence can be selected to target any target sequence. In some embodiments, the target sequence is a sequence within a genome of a cell. Exemplary target sequences include those that are unique in the target genome.


Editing Cassette

Disclosed herein are compositions and methods for editing a target polynucleotide sequence. Such compositions include polynucleotides containing one or more components of targetable nuclease system. Polynucleotide sequences for use in these methods can be referred to as editing cassettes.


An editing cassette can comprise one or more primer sites. Primer sites can be used to amplify an editing cassette by using oligonucleotide primers comprising reverse complementary sequences that can hybridize to the one or more primer sites. An editing cassette can comprise two or more primer times. Sometimes, an editing cassette comprises a primer site on each end of the editing cassette, said primer sites flanking one or more of the other components of the editing cassette. Primer sites can be approximately 10, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26 or more nucleotides in length.


An editing cassette can comprise an editing template as disclosed herein. An editing cassette can comprise an editing sequence. An editing sequence can be homologous to a target sequence. An editing sequence can comprise at least one mutation relative to a target sequence. An editing sequence often comprises homology region (or homology arms) flanking at least one mutation relative to a target sequence, such that the flanking homology regions facilitate homologous recombination of the editing sequence into a target sequence. An editing sequence can comprise an editing template as disclosed herein. For example, the editing sequence can comprise at least one mutation relative to a target sequence including one or more PAM mutations that mutate or delete a PAM site. An editing sequence can comprise one or more mutations in a codon or non-coding sequence relative to a non-editing target site.


A PAM mutation can be a silent mutation. A silent mutation can be a change to at least one nucleotide of a codon relative to the original codon that does not change the amino acid encoded by the original codon. A silent mutation can be a change to a nucleotide within a non-coding region, such as an intron, 5′ untranslated region, 3′ untranslated region, or other non-coding region.


A PAM mutation can be a non-silent mutation. Non-silent mutations can include a missense mutation. A missense mutation can be when a change to at least one nucleotide of a codon relative to the original codon that changes the amino acid encoded by the original codon. Missense mutations can occur within an exon, open reading frame, or other coding region.


An editing sequence can comprise at least one mutation relative to a target sequence. A mutation can be a silent mutation or non-silent mutation, such as a missense mutation. A mutation can include an insertion of one or more nucleotides or base pairs. A mutation can include a deletion of one or more nucleotides or base pairs. A mutation can include a substitution of one or more nucleotides or base pairs for a different one or more nucleotides or base pairs. Inserted or substituted sequences can include exogenous or heterologous sequences.


An editing cassette can comprise a polynucleotide encoding a guide nucleic acid sequence. In some cases, the guide nucleic acid sequence is optionally operably linked to a promoter. A guide nucleic acid sequence can comprise a scaffold sequence and a guide sequence as described herein.


An editing cassette can comprise a barcode. A barcode can be a unique DNA sequence that corresponds to the editing sequence such that the barcode can identify the one or more mutations of the corresponding editing sequence. In some examples, the barcode is 15 nucleotides. The barcode can comprise less than 10, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 88, 90, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, or more than 200 nucleotides. A barcode can be a non-naturally occurring sequence. An editing cassette comprising a barcode can be a non-naturally occurring sequence.


An editing cassette can comprise one or more of an editing sequence and a polynucleotide encoding a guide nucleic acid optionally operably linked to a promoter, wherein the editing cassette and guide nucleic acid sequence are flanked by primer sites. An editing cassette can further comprise a barcode.


An example of an editing cassette is depicted in FIG. 3. Each editing cassette can be designed to edit a site in a target sequence Sites to be targeted can be coding regions, non-coding regions, functionally neutral sites, or they can be a screenable or selectable marker gene. Homology regions within the editing sequence flank the one or more mutations of the editing cassette and can be inserted into the target sequence by recombination. Recombination can comprise DNA cleavage, such as by an nucleic acid-guided nuclease, and repair via homologous recombination.


Editing cassettes can be generated by chemical synthesis, Gibson assembly, SLIC, CPEC, PCA, ligation-free cloning, overlapping oligo extension, in vitro assembly, in vitro oligo assembly, PCR, traditional ligation-based cloning, other known methods in the art, or any combination thereof.


Trackable sequences, such as barcodes or recorder sequences, can be designed in silico via standard code with a degenerate mutation at the target codon. The degenerate mutation can comprise 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, or more than 30 nucleic acid residues. In some examples, the degenerate mutations can comprise 15 nucleic acid residues (N15).


Homology arms can be added to an editing sequence to allow incorporation of the editing sequence into the desired location via homologous recombination or homology-driven repair. Homology arms can be added by synthesis, in vitro assembly, PCR, or other known methods in the art. For example, chemical synthesis, Gibson assembly, SLIC, CPEC, PCA, ligation-free cloning, overlapping oligo extension, in vitro assembly, in vitro oligo assembly, PCR, traditional ligation-based cloning, other known methods in the art, or any combination thereof. A homology arm can be added to both ends of a barcode, recorder sequence, and/or editing sequence, thereby flanking the sequence with two distinct homology arms, for example, a 5′ homology arm and a 3′ homology arm.


A homology arm can comprise sequence homologous to a target sequence. A homology arm can comprise sequence homologous to sequence adjacent to a target sequence. A homology arm can comprise sequence homologous to sequence upstream or downstream of a target sequence. A homology arm can comprise sequence homologous to sequence within the same gene or open reading frame as a target sequence. A homology arm can comprise sequence homologous to sequence upstream or downstream of a gene or open reading frame the target sequence is within. A homology arm can comprise sequence homologous to a 5′ UTR or 3′ UTR of a gene or open reading frame within which is a target sequence. A homology arm can comprise sequence homologous to a different gene, open reading frame, promoter, terminator, or nucleic acid sequence than that which the target sequence is within.


The same 5′ and 3′ homology arms can be added to a plurality of distinct editing sequences, thereby generating a library of unique editing sequences that each have the same targeted insertion site. The same 5′ and 3′ homology arms can be added to a plurality of distinct editing templates, thereby generating a library of unique editing templates that each have the same targeted insertion site. In alternative examples, different or a variety of 5′ or 3′ homology arms can be added to a plurality of editing sequences or editing templates.


A barcode library or recorder sequence library comprising flanking homology arms can be cloned into a vector backbone. In some examples, the barcode comprising flanking homology arms are cloned into an editing cassette. Cloning can occur by chemical synthesis, Gibson assembly, SLIC, CPEC, PCA, ligation-free cloning, overlapping oligo extension, in vitro assembly, in vitro oligo assembly, PCR, traditional ligation-based cloning, other known methods in the art, or any combination thereof.


An editing sequence library comprising flanking homology arms can be cloned into a vector backbone. In some examples, the editing sequence and homology arms are cloned into an editing cassette. Editing cassettes can, in some cases, further comprise a nucleic acid sequence encoding a guide nucleic acid or gRNA engineered to target the desired site of editing sequence insertion, e.g. the target sequence. Editing cassettes can, in some cases, further comprise a barcode or recorder sequence. Cloning can occur by chemical synthesis, Gibson assembly, SLIC, CPEC, PCA, ligation-free cloning, overlapping oligo extension, in vitro assembly, in vitro oligo assembly, PCR, traditional ligation-based cloning, other known methods in the art, or any combination thereof.


Gene-wide or genome-wide editing libraries can be cloned into a vector backbone. A barcode or recorder sequence library can be inserted or assembled into a second site to generate competent trackable plasmids that can embed the recording barcode at a fixed locus while integrating the editing libraries at a wide variety of user defined sites. Cloning can occur by chemical synthesis, Gibson assembly, SLIC, CPEC, PCA, ligation-free cloning, overlapping oligo extension, in vitro assembly, in vitro oligo assembly, PCR, traditional ligation-based cloning, other known methods in the art, or any combination thereof.


A guide nucleic acid or sequence encoding the same can be assembled or inserted into a vector backbone first, followed by insertion of an editing sequence and/or cassette. In other cases, an editing sequence and/or cassette can be inserted or assembled into a vector backbone first, followed by insertion of a guide nucleic acid or sequence encoding the same. In other cases, guide nucleic acid or sequence encoding the same and an editing sequence and/or cassette are simultaneous inserted or assembled into a vector. A recorder sequence or barcode can be inserted before or after any of these steps. In other words, it should be understood that there are many possible permutations to the order in which elements of the disclosure are assembled. The vector can be linear or circular and can be generated by chemical synthesis, Gibson assembly, SLIC, CPEC, PCA, ligation-free cloning, overlapping oligo extension, in vitro assembly, in vitro oligo assembly, PCR, traditional ligation-based cloning, other known methods in the art, or any combination thereof.


A nucleic acid molecule can be synthesized which comprises one or more elements disclosed herein. A nucleic acid molecule can be synthesized that comprises an editing cassette. A nucleic acid molecule can be synthesized that comprises a guide nucleic acid. A nucleic acid molecule can be synthesized that comprises a recorder cassette. A nucleic acid molecule can be synthesized that comprises a barcode. A nucleic acid molecule can be synthesized that comprises a homology arm. A nucleic acid molecule can be synthesized that comprises an editing cassette and a guide nucleic acid. A nucleic acid molecule can be synthesized that comprises an editing cassette and a barcode. A nucleic acid molecule can be synthesized that comprises an editing cassette, a guide nucleic acid, and a recorder cassette. A nucleic acid molecule can be synthesized that comprises an editing cassette, a recorder cassette, and two guide nucleic acids. A nucleic acid molecule can be synthesized that comprises a recorder cassette and a guide nucleic acid. In any of these cases, the guide nucleic acid can optionally be operably linked to a promoter. In any of these cases, the nucleic acid molecule can further include one or more barcodes.


Synthesis can occur by any nucleic acid synthesis method known in the art. Synthesis can occur by enzymatic nucleic acid synthesis. Synthesis can occur by chemical synthesis. Synthesis can occur by array-based synthesis. Synthesis can occur by solid-phase synthesis or phosphoramidite methods. Synthesis can occur by column or multi-well methods. Synthesized nucleic acid molecules can be non-naturally occurring nucleic acid molecules.


Software and automation methods can be used for multiplex synthesis and generation. For example, software and automation can be used to create 10, 102, 103, 104, 105, 106, or more synthesized polynucleotides, cassettes, or plasmids. An automation method can generate desired sequences and libraries in rapid fashion that can be processed through a workflow with minimal steps to produce precisely defined libraries, such as gene-wide or genome-wide editing libraries.


Polynucleotides or libraries can be generated which comprise two or more nucleic acid molecules or plasmids comprising any combination disclosed herein of recorder sequence, editing sequence, guide nucleic acid, and optional barcode, including combinations of one or more of any of the previously mentioned elements. For example, such a library can comprise at least 2, 3, 4, 5, 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 200, 300, 400, 500, 600, 700, 800, 1000, 1500, 2000, 2500, 3000, 3500, 4000, 4500, 5000, 5500, 6000, 6500, 7000, 7500, 8000, 8500, 9000, 9500, 104, 105, 106, 107, 108, 109, 1010, or more nucleic acid molecules or plasmids of the present disclosure. It should be understood that such a library can include any number of nucleic acid molecules or plasmids, even if the specific number is not explicit listed above.


Trackable plasmid libraries or nucleic acid molecule libraries can be sequenced in order to determine the recorder sequence and editing sequence pair that is comprised on each trackable plasmid. In other cases, a known recorder sequence is paired with a known editing sequence during the library generation process. Other methods of determining the association between a recorder sequence and editing sequence comprised on a common nucleic acid molecule or plasmid are envisioned such that the editing sequence can be identified by identification or sequencing of the recorder sequence.


Methods and compositions for tracking edited episomal libraries that are shuttled between E. coli and other organisms/cell lines are provided herein. The libraries can be comprised on plasmids, Bacterial artificial chromosomes (BACs), Yeast artificial chromosomes (YACs), synthetic chromosomes, or viral or phage genomes. These methods and compositions can be used to generate portable barcoded libraries in host organisms, such as E. coli. Library generation in such organisms can offer the advantage of established techniques for performing homologous recombination. Barcoded plasmid libraries can be deep-sequenced at one site to track mutational diversity targeted across the remaining portions of the plasmid allowing dramatic improvements in the depth of library coverage.


Any nucleic acid molecule disclosed herein can be an isolated nucleic acid. Isolated nucleic acids may be made by any method known in the art, for example using standard recombinant methods, assembly methods, synthesis techniques, or combinations thereof. In some embodiments, the nucleic acids may be cloned, amplified, assembled, or otherwise constructed.


Isolated nucleic acids may be obtained from cellular, bacterial, or other sources using any number of cloning methodologies known in the art. In some embodiments, oligonucleotide probes which selectively hybridize, under stringent conditions, to other oligonucleotides or to the nucleic acids of an organism or cell can be used to isolate or identify an isolated nucleic acid.


Cellular genomic DNA, RNA, or cDNA may be screened for the presence of an identified genetic element of interest using a probe based upon one or more sequences. Various degrees of stringency of hybridization may be employed in the assay.


High stringency conditions for nucleic acid hybridization are well known in the art. For example, conditions may comprise low salt and/or high temperature conditions, such as provided by about 0.02 M to about 0.15 M NaCl at temperatures of about 500° C. to about 70° C. It is understood that the temperature and ionic strength of a desired stringency are determined in part by the length of the particular nucleic acid(s), the length and nucleotide content of the target sequence(s), the charge composition of the nucleic acid(s), and by the presence or concentration of formamide, tetramethylammonium chloride or other solvent(s) in a hybridization mixture. Nucleic acids may be completely complementary to a target sequence or may exhibit one or more mismatches.


Nucleic acids of interest may also be amplified using a variety of known amplification techniques. For instance, polymerase chain reaction (PCR) technology may be used to amplify target sequences directly from DNA, RNA, or cDNA. PCR and other in vitro amplification methods may also be useful, for example, to clone nucleic acid sequences, to make nucleic acids to use as probes for detecting the presence of a target nucleic acid in samples, for nucleic acid sequencing, or for other purposes.


Isolated nucleic acids may be prepared by direct chemical synthesis by methods such as the phosphotriester method, or using an automated synthesizer. Chemical synthesis generally produces a single stranded oligonucleotide. This may be converted into double stranded DNA by hybridization with a complementary sequence or by polymerization with a DNA polymerase using the single strand as a template.


Recorder

In some example, two editing cassettes can be used together to track a genetic engineering step. For example, one editing cassette can comprise an editing template and an encoded guide nucleic acid, and a second editing cassette, referred to as a recorder cassette, can comprise an editing template comprising a recorder sequence and an encoded nucleic acid which has a distinct guide sequence compared to that of the first editing cassette. In such cases, the editing sequence and the recorder sequence can be inserted into separate target sequences and determined by their corresponding guide nucleic acids. A recorder sequence can comprise a barcode, trackable or traceable sequence, and/or a regulatory element operable with a screenable or selectable marker.


Through a multiplexed cloning approach, the recorder cassette can be covalently coupled to at least one editing cassette in a plasmid (e.g., FIG. 17A, green cassette) to generate plasmid libraries that have a unique recorder and editing cassette combination. This library can be sequenced to generate the recorder/edit mapping and used to track editing libraries across large segments of the target DNA (e.g., FIG. 17C). Recorder and editing sequences can be comprised on the same cassette, in which case they are both incorporated into the target nucleic acid sequence, such as a genome or plasmid, by the same recombination event. In other examples, the recorder and editing sequences can be comprised on separate cassettes within the same plasmid, in which case the recorder and editing sequences are incorporated into the target nucleic acid sequence by separate recombination events, either simultaneously or sequentially.


Methods are provided herein for combining multiplex oligonucleotide synthesis with recombineering, to create libraries of specifically designed and trackable mutations. Screens and/or selections followed by high-throughput sequencing and/or barcode microarray methods can allow for rapid mapping of mutations leading to a phenotype of interest.


Methods and compositions disclosed herein can be used to simultaneously engineer and track engineering events in a target nucleic acid sequence.


Such plasmids can be generated using in vitro assembly or cloning techniques. For example, the plasmids can be generated using chemical synthesis, Gibson assembly, SLIC, CPEC, PCA, ligation-free cloning, other in vitro oligo assembly techniques, traditional ligation-based cloning, or any combination thereof.


Such plasmids can comprise at least one recording sequence, such as a barcode, and at least one editing sequence. In most cases, the recording sequence is used to record and track engineering events. Each editing sequence can be used to incorporate a desired edit into a target nucleic acid sequence. The desired edit can include insertion, deletion, substitution, or alteration of the target nucleic acid sequence. In some examples, the one or more recording sequence and editing sequences are comprised on a single cassette comprised within the plasmid such that they are incorporated into the target nucleic acid sequence by the same engineering event. In other examples, the recording and editing sequences are comprised on separate cassettes within the plasmid such that they are each incorporated into the target nucleic acid by distinct engineering events. In some examples, the plasmid comprises two or more editing sequences. For example, one editing sequence can be used to alter or silence a PAM sequence while a second editing sequence can be used to incorporate a mutation into a distinct sequence.


Recorder sequences can be inserted into a site separated from the editing sequence insertion site. The inserted recorder sequence can be separated from the editing sequence by 1 bp to 1 Mbp. For example, the separation distance can be about 1 bp, 10 bp, 50 bp, 100 bp, 500 bp, 1 kp, 2 kb, 5 kb, 10 kb, or greater. The separation distance can be any discrete integer between 1 bp and 10 Mbp. In some examples, the maximum distance of separation depends on the size of the target nucleic acid or genome.


Recorder sequences can be inserted adjacent to editing sequences, or within proximity to the editing sequence. For example, the recorder sequence can be inserted outside of the open reading frame within which the editing sequence is inserted. Recorder sequence can be inserted into an untranslated region adjacent to an open reading frame within which an editing sequence has been inserted. The recorder sequence can be inserted into a functionally neutral or non-functional site. The recorder sequence can be inserted into a screenable or selectable marker gene.


In some examples, the target nucleic acid sequence is comprised within a genome, artificial chromosome, synthetic chromosome, or episomal plasmid. In various examples, the target nucleic acid sequence can be in vitro or in vivo. When the target nucleic acid sequence is in vivo, the plasmid can be introduced into the host organisms by transformation, transfection, conjugation, biolistics, nanoparticles, cell-permeable technologies, or other known methods for DNA delivery, or any combination thereof. In such examples, the host organism can be a eukaryote, prokaryote, bacterium, archaea, yeast, or other fungi.


The engineering event can comprise recombineering, non-homologous end joining, homologous recombination, or homology-driven repair. In some examples, the engineering event is performed in vitro or in vivo.


The methods described herein can be carried out in any type of cell in which a targetable nuclease system can function (e.g., target and cleave DNA), including prokaryotic and eukaryotic cells. In some embodiments the cell is a bacterial cell, such as Escherichia spp. (e.g., E. coli). In other embodiments, the cell is a fungal cell, such as a yeast cell, e.g., Saccharomyces spp. In other embodiments, the cell is an algal cell, a plant cell, an insect cell, or a mammalian cell, including a human cell.


In some examples, the cell is a recombinant organism. For example, the cell can comprise a non-native targetable nuclease system. Additionally or alternatively, the cell can comprise recombination system machinery. Such recombination systems can include lambda red recombination system, Cre/Lox, attB/attP, or other integrase systems. Where appropriate, the plasmid can have the complementary components or machinery required for the selected recombination system to work correctly and efficiently.


Method for genome editing can comprise: (a) introducing a vector that encodes at least one editing cassette and at least one guide nucleic acid into a first population of cells, thereby producing a second population of cells comprising the vector; (b) maintaining the second population of cells under conditions in which a nucleic acid-guided nuclease is expressed or maintained, wherein the nucleic acid-guided nuclease is encoded on the vector, a second vector, on the genome of cells of the second population of cells, or otherwise introduced into the cell, resulting in DNA cleavage and incorporation of the editing cassette; (c) obtaining viable cells; and (d) sequencing the target DNA molecule in at least one cell of the second population of cells to identify the mutation of at least one codon.


A method for genome editing can comprise: (a) introducing a vector that encodes at least one editing cassette comprising a PAM mutation as disclosed herein and at least one guide nucleic acid into a first population of cells, thereby producing a second population of cells comprising the vector; (b) maintaining the second population of cells under conditions in which a nucleic acid-guided nuclease is expressed or maintained, wherein the nucleic acid-guided nuclease is encoded on the vector, a second vector, on the genome of cells of the second population of cells, or otherwise introduced into the cell, resulting in DNA cleavage, incorporation of the editing cassette, and death of cells of the second population of cells that do not comprise the PAM mutation, whereas cells of the second population of cells that comprise the PAM mutation are viable; (c) obtaining viable cells; and (d) sequencing the target DNA in at least one cell of the second population of cells to identify the mutation of at least one codon.


Method for trackable genome editing can comprise: (a) introducing a vector that encodes at least one editing cassette, at least one recorder cassette, and at least two guide nucleic acids into a first population of cells, thereby producing a second population of cells comprising the vector; (b) maintaining the second population of cells under conditions in which a nucleic acid-guided nuclease is expressed or maintained, wherein the nucleic acid-guided nuclease is encoded on the vector, a second vector, on the genome of cells of the second population of cells, or otherwise introduced into the cell, resulting in DNA cleavage and incorporation of the editing and recorder cassettes; (c) obtaining viable cells; and (d) sequencing the recorder sequence of the target DNA molecule in at least one cell of the second population of cells to identify the mutation of at least one codon.


In some examples where the plasmid comprises a second editing sequence designed to silence a PAM, a method for trackable genome editing can comprise: (a) introducing a vector that encodes at least one editing cassette, a recorder cassette, and at least two guide nucleic acids into a first population of cells, thereby producing a second population of cells comprising the vector; (b) maintaining the second population of cells under conditions in which a nucleic acid-guided nuclease is expressed or maintained, wherein the nucleic acid-guided nuclease is encoded on the vector, a second vector, on the genome of cells of the second population of cells, or otherwise introduced into the cell, resulting in DNA cleavage, incorporation of the editing and recorder cassettes, and death of cells of the second population of cells that do not comprise the PAM mutation, whereas cells of the second population of cells that comprise the PAM mutation are viable; (c) obtaining viable cells; and (d) sequencing the recorder sequence of the target DNA in at least one cell of the second population of cells to identify the mutation of at least one codon.


In some examples transformation efficiency is determined by using a non-targeting control guide nucleic acid, which allows for validation of the recombineering procedure and CFU/ng calculations. In some cases, absolute efficient is obtained by counting the total number of colonies on each transformation plate, for example, by counting both red and white colonies from a galK control. In some examples, relative efficiency is calculated by the total number of successful transformants (for example, white colonies) out of all colonies from a control (for example, galK control).


The methods of the disclosure can provide, for example, greater than 1000× improvements in the efficiency, scale, cost of generating a combinatorial library, and/or precision of such library generation.


The methods of the disclosure can provide, for example, greater than: 10×, 50×, 100×, 200×, 300×, 400×, 500×, 600×, 700×, 800×, 900×, 1000×, 1100×, 1200×, 1300×, 1400×, 1500×, 1600×, 1700×, 1800×, 1900×, 2000×, or greater improvements in the efficiency of generating genomic or combinatorial libraries.


The methods of the disclosure can provide, for example, greater than: 10×, 50×, 100×, 200×, 300×, 400×, 500×, 600×, 700×, 800×, 900×, 1000×, 1100×, 1200×, 1300×, 1400×, 1500×, 1600×, 1700×, 1800×, 1900×, 2000×, or greater improvements in the scale of generating genomic or combinatorial libraries.


The methods of the disclosure can provide, for example, greater than: 10×, 50×, 100×, 200×, 300×, 400×, 500×, 600×, 700×, 800×, 900×, 1000×, 1100×, 1200×, 1300×, 1400×, 1500×, 1600×, 1700×, 1800×, 1900×, 2000×, or greater decrease in the cost of generating genomic or combinatorial libraries.


The methods of the disclosure can provide, for example, greater than: 10×, 50×, 100×, 200×, 300×, 400×, 500×, 600×, 700×, 800×, 900×, 1000×, 1100×, 1200×, 1300×, 1400×, 1500×, 1600×, 1700×, 1800×, 1900×, 2000×, or greater improvements in the precision of genomic or combinatorial library generation.


Recursive Tracking for Combinatorial Engineering

Disclosed herein are methods and compositions for iterative rounds of engineering. Disclosed herein are recursive engineering strategies that allow implementation of CREATE recording at the single cell level through several serial engineering cycles (e.g., FIG. 18 and FIG. 19). These disclosed methods and compositions can enable search-based technologies that can effectively construct and explore complex genotypic space. The terms recursive and iterative can be used interchangeably.


Combinatorial engineering methods can comprise multiple rounds of engineering. Methods disclosed herein can comprise 2 or more rounds of engineering. For example, a method can comprise 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 25, 30, or more than 30 rounds of engineering.


In some examples, during each round of engineering a new recorder sequence, such as a barcode, is incorporated at the same locus in nearby sites (e.g., FIG. 18, green bars or FIG. 19, black bars) such that following multiple engineering cycles to construct combinatorial diversity throughout the genome (e.g., FIG. 18, green bars or FIG. 19, grey bars) a simple PCR of the recording locus can be used to reconstruct each combinatorial genotype or to confirm that the engineered edit from each round has been incorporated into the target site.


Disclosed herein are methods for selecting for successive rounds of engineering. Selection can occur by a PAM mutation incorporated by an editing cassette. Selection can occur by a PAM mutation incorporated by a recorder cassette. Selection can occur using a screenable, selectable, or counter-selectable marker. Selection can occur by targeting a site for editing or recording that was incorporated by a prior round of engineering, thereby selecting for variants that successfully incorporated edits and recorder sequences from both rounds or all prior rounds of engineering.


Quantitation of these genotypes can be used for understanding combinatorial mutational effects on large populations and investigation of important biological phenomena such as epistasis.


Serial editing and combinatorial tracking can be implemented using recursive vector systems as disclosed herein. These recursive vector systems can be used to move rapidly through the transformation procedure. In some examples, these systems consist of two or more plasmids containing orthogonal replication origins, antibiotic markers, and an encoded guide nucleic acids. The encoded guide nucleic acid in each vector can be designed to target one of the other resistance markers for destruction by nucleic acid-guided nuclease-mediated cleavage. These systems can be used, in some examples, to perform transformations in which the antibiotic selection pressure is switched to remove the previous plasmid and drive enrichment of the next round of engineered genomes. Two or more passages through the transformation loop can be performed, or in other words, multiple rounds of engineering can be performed. Introducing the requisite recording cassettes and editing cassettes into recursive vectors as disclosed herein can be used for simultaneous genome editing and plasmid curing in each transformation step with high efficiencies.


In some examples, the recursive vector system disclosed herein comprises 2, 3, 4, 5, 6, 7, 8, 9, 10, or more than 10 unique plasmids. In some examples, the recursive vector system can use a particular plasmid more than once as long as a distinct plasmid is used in the previous round and in the subsequent round.


Recursive methods and compositions disclosed herein can be used to restore function to a selectable or screenable element in a targeted genome or plasmid. The selectable or screenable element can include an antibiotic resistance gene, a fluorescent gene, a unique DNA sequence or watermark, or other known reporter, screenable, or selectable gene. In some examples, each successive round of engineering can incorporate a fragment of the selectable or screenable element, such that at the end of the engineering rounds, the entire selectable or screenable element has been incorporated into the target genome or plasmid. In such examples, only those genome or plasmids which have successfully incorporated all of the fragments, and therefore all of the desired corresponding mutations, can be selected or screened for. In this way, the selected or screened cells will be enriched for those that have incorporated the edits from each and every iterative round of engineering.


Recursive methods can be used to switch a selectable or screenable marker between an on and an off position, or between an off and an on position, with each successive round of engineering. Using such a method allows conservation of available selectable or screenable markers by requiring, for example, the use of only one screenable or selectable marker. Furthermore, short regulatory sequence or start codon or non-start codons can be used to turn the screenable or selectable marker on and off. Such short sequences can easily fit within a synthesized cassette or polynucleotide.


One or more rounds of engineering can be performed using the methods and compositions disclosed herein. In some examples, each round of engineering is used to incorporate an edit unique from that of previous rounds. Each round of engineering can incorporate a unique recording sequence. Each round of engineering can result in removal or curing of the plasmid used in the previous round of engineering. In some examples, successful incorporation of the recording sequence of each round of engineering results in a complete and functional screenable or selectable marker or unique sequence combination.


Unique recorder cassettes comprising recording sequences such as barcodes or screenable or selectable markers can be inserted with each round of engineering, thereby generating a recorder sequence that is indicative of the combination of edits or engineering steps performed. Successive recording sequences can be inserted adjacent to one another. Successive recording sequences can be inserted within proximity to one another. Successive sequences can be inserted at a distance from one another.


Successive sequences can be inserted at a distance from one another. For example, successive recorder sequences can be inserted and separated by 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, or greater than 100 bp. In some examples, successive recorder sequences are separated by about 10, 50, 100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1100, 1200, 1300, 1400, 1500, or greater than 1500 bp.


Successive recorder sequences can be separated by any desired number of base pairs and can be dependent and limited on the number of successive recorder sequences to be inserted, the size of the target nucleic acid or target genomes, and/or the design of the desired final recorder sequence. For example, if the compiled recorder sequence is a functional screenable or selectable marker, than the successive recording sequences can be inserted within proximity and within the same reading frame from one another. If the compiled recorder sequence is a unique set of barcodes to be identified by sequencing and have no coding sequence element, then the successive recorder sequences can be inserted with any desired number of base pairs separating them. In these cases, the separation distance can be dependent on the sequencing technology to be used and the read length limit.


While preferred embodiments of the present invention have been shown and described herein, it will be obvious to those skilled in the art that such embodiments are provided by way of example only. Numerous variations, changes, and substitutions will now occur to those skilled in the art without departing from the invention. It should be understood that various alternatives to the embodiments of the invention described herein may be employed in practicing the invention. It is intended that the following claims define the scope of the invention and that methods and structures within the scope of these claims and their equivalents be covered thereby.


Some Definitions

As used herein the term “wild type” is a term of the art understood by skilled persons and means the typical form of an organism, strain, gene or characteristic as it occurs in nature as distinguished from mutant or variant forms.


As used herein the term “variant” should be taken to mean the exhibition of qualities that have a pattern that deviates from what occurs in nature.


The terms “orthologue” (also referred to as “ortholog” herein) and “homologue” (also referred to as “homolog” herein) are well known in the art. By means of further guidance, a “homologue” of a protein as used herein is a protein of the same species which performs the same or a similar function as the protein it is a homologue of. Homologous proteins may but need not be structurally related, or are only partially structurally related. An “orthologue” of a protein as used herein is a protein of a different species which performs the same or a similar function as the protein it is an orthologue of Orthologous proteins may but need not be structurally related, or are only partially structurally related. Homologs and orthologs may be identified by homology modelling (see, e.g., Greer, Science vol. 228 (1985) 1055, and Blundell et al. Eur J Biochem vol 172 (1988), 513) or “structural BLAST” (Dey F, Cliff Zhang Q, Petrey D, Honig B. Toward a “structural BLAST”: using structural relationships to infer function. Protein Sci. 2013 April; 22(4):359-66. doi: 10.1002/pro.2225.).


The terms “polynucleotide”, “nucleotide”, “nucleotide sequence”, “nucleic acid” and “oligonucleotide” are used interchangeably. They refer to a polymeric form of nucleotides of any length, either deoxyribonucleotides or ribonucleotides, or analogs thereof. Polynucleotides may have any three dimensional structure, and may perform any function, known or unknown. The following are non-limiting examples of polynucleotides: coding or non-coding regions of a gene or gene fragment, loci (locus) defined from linkage analysis, exons, introns, messenger RNA (mRNA), transfer RNA, ribosomal RNA, short interfering RNA (siRNA), short-hairpin RNA (shRNA), micro-RNA (miRNA), ribozymes, cDNA, recombinant polynucleotides, branched polynucleotides, plasmids, vectors, isolated DNA of any sequence, isolated RNA of any sequence, nucleic acid probes, and primers. The term also encompasses nucleic-acid-like structures with synthetic backbones, see, e.g., Eckstein, 1991; Baserga et al., 1992; Milligan, 1993; WO 97/03211; WO 96/39154; Mata, 1997; Strauss-Soukup, 1997; and Samstag, 1996. A polynucleotide may comprise one or more modified nucleotides, such as methylated nucleotides and nucleotide analogs. If present, modifications to the nucleotide structure may be imparted before or after assembly of the polymer. The sequence of nucleotides may be interrupted by non-nucleotide components. A polynucleotide may be further modified after polymerization, such as by conjugation with a labeling component.


“Complementarity” refers to the ability of a nucleic acid to form hydrogen bond(s) with another nucleic acid sequence by either traditional Watson-Crick base pairing or other non-traditional types. A percent complementarity indicates the percentage of residues in a nucleic acid molecule which can form hydrogen bonds (e.g., Watson-Crick base pairing) with a second nucleic acid sequence (e.g., 5, 6, 7, 8, 9, 10 out of 10 being 50%, 60%, 70%, 80%, 90%, and 100% complementary). “Perfectly complementary” means that all the contiguous residues of a nucleic acid sequence will hydrogen bond with the same number of contiguous residues in a second nucleic acid sequence. “Substantially complementary” as used herein refers to a degree of complementarity that is at least 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98%, 99%, or 100% over a region of 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 30, 35, 40, 45, 50, or more nucleotides, or refers to two nucleic acids that hybridize under stringent conditions.


As used herein, “stringent conditions” for hybridization refer to conditions under which a nucleic acid having complementarity to a target sequence predominantly hybridizes with the target sequence, and substantially does not hybridize to non-target sequences. Stringent conditions are generally sequence-dependent, and vary depending on a number of factors. In general, the longer the sequence, the higher the temperature at which the sequence specifically hybridizes to its target sequence. Non-limiting examples of stringent conditions are described in detail in Tijssen (1993). Laboratory Techniques In Biochemistry And Molecular Biology-Hybridization With Nucleic Acid Probes Part I, Second Chapter “Overview of principles of hybridization and the strategy of nucleic acid probe assay”, Elsevier, N.Y. Where reference is made to a polynucleotide sequence, then complementary or partially complementary sequences are also envisaged. These are preferably capable of hybridising to the reference sequence under highly stringent conditions. Generally, in order to maximize the hybridization rate, relatively low-stringency hybridization conditions are selected: about 20 to 25 degrees Celsius. lower than the thermal melting point (Tm). The Tm is the temperature at which 50% of specific target sequence hybridizes to a perfectly complementary probe in solution at a defined ionic strength and pH. Generally, in order to require at least about 85% nucleotide complementarity of hybridized sequences, highly stringent washing conditions are selected to be about 5 to 15 degrees Celsius lower than the Tm. In order to require at least about 70% nucleotide complementarity of hybridized sequences, moderately-stringent washing conditions are selected to be about 15 to 30 degrees Celsius lower than the Tm. Highly permissive (very low stringency) washing conditions may be as low as 50 degrees Celsius below the Tm, allowing a high level of mis-matching between hybridized sequences. Those skilled in the art will recognize that other physical and chemical parameters in the hybridization and wash stages can also be altered to affect the outcome of a detectable hybridization signal from a specific level of homology between target and probe sequences.


“Hybridization” refers to a reaction in which one or more polynucleotides react to form a complex that is stabilized via hydrogen bonding between the bases of the nucleotide residues. The hydrogen bonding may occur by Watson Crick base pairing, Hoogstein binding, or in any other sequence specific manner. The complex may comprise two strands forming a duplex structure, three or more strands forming a multi stranded complex, a single self-hybridizing strand, or any combination of these. A hybridization reaction may constitute a step in a more extensive process, such as the initiation of PCR, or the cleavage of a polynucleotide by an enzyme. A sequence capable of hybridizing with a given sequence is referred to as the “complement” of the given sequence.


As used herein, the term “genomic locus” or “locus” (plural loci) is the specific location of a gene or DNA sequence on a chromosome. A “gene” refers to stretches of DNA or RNA that encode a polypeptide or an RNA chain that has functional role to play in an organism and hence is the molecular unit of heredity in living organisms. For the purpose of this invention it may be considered that genes include regions which regulate the production of the gene product, whether or not such regulatory sequences are adjacent to coding and/or transcribed sequences. Accordingly, a gene includes, but is not necessarily limited to, promoter sequences, terminators, translational regulatory sequences such as ribosome binding sites and internal ribosome entry sites, enhancers, silencers, insulators, boundary elements, replication origins, matrix attachment sites and locus control regions.


As used herein, “expression of a genomic locus” or “gene expression” is the process by which information from a gene is used in the synthesis of a functional gene product. The products of gene expression are often proteins, but in non-protein coding genes such as rRNA genes or tRNA genes, the product is functional RNA. The process of gene expression is used by all known life—eukaryotes (including multicellular organisms), prokaryotes (bacteria and archaea) and viruses to generate functional products to survive. As used herein “expression” of a gene or nucleic acid encompasses not only cellular gene expression, but also the transcription and translation of nucleic acid(s) in cloning systems and in any other context. As used herein, “expression” also refers to the process by which a polynucleotide is transcribed from a DNA template (such as into and mRNA or other RNA transcript) and/or the process by which a transcribed mRNA is subsequently translated into peptides, polypeptides, or proteins. Transcripts and encoded polypeptides may be collectively referred to as “gene product.” If the polynucleotide is derived from genomic DNA, expression may include splicing of the mRNA in a eukaryotic cell.


The terms “polypeptide”, “peptide” and “protein” are used interchangeably herein to refer to polymers of amino acids of any length. The polymer may be linear or branched, it may comprise modified amino acids, and it may be interrupted by non amino acids. The terms also encompass an amino acid polymer that has been modified; for example, disulfide bond formation, glycosylation, lipidation, acetylation, phosphorylation, or any other manipulation, such as conjugation with a labeling component. As used herein the term “amino acid” includes natural and/or unnatural or synthetic amino acids, including glycine and both the D or L optical isomers, and amino acid analogs and peptidomimetics.


As used herein, the term “domain” or “protein domain” refers to a part of a protein sequence that may exist and function independently of the rest of the protein chain.


As described in aspects of the invention, sequence identity is related to sequence homology. Homology comparisons may be conducted by eye, or more usually, with the aid of readily available sequence comparison programs. These commercially available computer programs may calculate percent (%) homology between two or more sequences and may also calculate the sequence identity shared by two or more amino acid or nucleic acid sequences. Sequence homologies may be generated by any of a number of computer programs known in the art, for example BLAST or FASTA, etc. A suitable computer program for carrying out such an alignment is the GCG Wisconsin Bestfit package (University of Wisconsin. U.S.A; Devereux et al., 1984, Nucleic Acids Research 12:387). Examples of other software than may perform sequence comparisons include, but are not limited to, the BLAST package (see Ausubel et al., 1999 ibid—Chapter 18), FASTA (Atschul et al., 1990, J. Mol. Biol., 403-410) and the GENEWORKS suite of comparison tools. Both BLAST and FASTA are available for offline and online searching (see Ausubel et al., 1999 ibid, pages 7-58 to 7-60). However it is preferred to use the GCG Bestfit program.


Percent homology may be calculated over contiguous sequences, i.e., one sequence is aligned with the other sequence and each amino acid or nucleotide in one sequence is directly compared with the corresponding amino acid or nucleotide in the other sequence, one residue at a time. This is called an “ungapped” alignment. Typically, such ungapped alignments are performed only over a relatively short number of residues.


Although this is a very simple and consistent method, it fails to take into consideration that, for example, in an otherwise identical pair of sequences, one insertion or deletion may cause the following amino acid residues to be put out of alignment, thus potentially resulting in a large reduction in % homology when a global alignment is performed. Consequently, most sequence comparison methods are designed to produce optimal alignments that take into consideration possible insertions and deletions without unduly penalizing the overall homology or identity score. This is achieved by inserting “gaps” in the sequence alignment to try to maximize local homology or identity.


However, these more complex methods assign “gap penalties” to each gap that occurs in the alignment so that, for the same number of identical amino acids, a sequence alignment with as few gaps as possible—reflecting higher relatedness between the two compared sequences—may achieve a higher score than one with many gaps. “Affinity gap costs” are typically used that charge a relatively high cost for the existence of a gap and a smaller penalty for each subsequent residue in the gap. This is the most commonly used gap scoring system. High gap penalties may, of course, produce optimized alignments with fewer gaps. Most alignment programs allow the gap penalties to be modified. However, it is preferred to use the default values when using such software for sequence comparisons. For example, when using the GCG Wisconsin Bestfit package the default gap penalty for amino acid sequences is −12 for a gap and −4 for each extension.


Calculation of maximum % homology therefore first requires the production of an optimal alignment, taking into consideration gap penalties. A suitable computer program for carrying out such an alignment is the GCG Wisconsin Bestfit package (Devereux et al., 1984 Nuc. Acids Research 12 p387). Examples of other software that may perform sequence comparisons include, but are not limited to, the BLAST package (see Ausubel et al., 1999 Short Protocols in Molecular Biology, 4th Ed.—Chapter 18), FASTA (Altschul et al., 1990 J. Mol. Biol. 403-410) and the GENEWORKS suite of comparison tools. Both BLAST and FASTA are available for offline and online searching (see Ausubel et al., 1999, Short Protocols in Molecular Biology, pages 7-58 to 7-60). However, for some applications, it is preferred to use the GCG Bestfit program. A new tool, called BLAST 2 Sequences is also available for comparing protein and nucleotide sequences (see FEMS Microbiol Lett. 1999 174(2): 247-50; FEMS Microbiol Lett. 1999 177(1): 187-8 and the website of the National Center for Biotechnology information at the website of the National Institutes for Health).


Although the final % homology may be measured in terms of identity, the alignment process itself is typically not based on an all-or-nothing pair comparison. Instead, a scaled similarity score matrix is generally used that assigns scores to each pair-wise comparison based on chemical similarity or evolutionary distance. An example of such a matrix commonly used is the BLOSUM62 matrix—the default matrix for the BLAST suite of programs. GCG Wisconsin programs generally use either the public default values or a custom symbol comparison table, if supplied (see user manual for further details). For some applications, it is preferred to use the public default values for the GCG package, or in the case of other software, the default matrix, such as BLOSUM62.


Alternatively, percentage homologies may be calculated using the multiple alignment feature in DNASIS™ (Hitachi Software), based on an algorithm, analogous to CLUSTAL (Higgins D G & Sharp P M (1988), Gene 73(1), 237-244). Once the software has produced an optimal alignment, it is possible to calculate % homology, preferably % sequence identity. The software typically does this as part of the sequence comparison and generates a numerical result.


Sequences may also have deletions, insertions or substitutions of amino acid residues which produce a silent change and result in a functionally equivalent substance. Deliberate amino acid substitutions may be made on the basis of similarity in amino acid properties (such as polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues) and it is therefore useful to group amino acids together in functional groups. Amino acids may be grouped together based on the properties of their side chains alone. However, it is more useful to include mutation data as well. The sets of amino acids thus derived are likely to be conserved for structural reasons. These sets may be described in the form of a Venn diagram (Livingstone C. D. and Barton G. J. (1993) “Protein sequence alignments: a strategy for the hierarchical analysis of residue conservation” Comput. Appl. Biosci. 9: 745-756) (Taylor W. R. (1986) “The classification of amino acid conservation” J. Theor. Biol. 119; 205-218). Conservative substitutions may be made, for example according to the table below which describes a generally accepted Venn diagram grouping of amino acids.


Embodiments of the invention include sequences (both polynucleotide or polypeptide) which may comprise homologous substitution (substitution and replacement are both used herein to mean the interchange of an existing amino acid residue or nucleotide, with an alternative residue or nucleotide) that may occur i.e., like-for-like substitution in the case of amino acids such as basic for basic, acidic for acidic, polar for polar, etc. Non-homologous substitution may also occur i.e., from one class of residue to another or alternatively involving the inclusion of unnatural amino acids such as ornithine (hereinafter referred to as Z), diaminobutyric acid ornithine (hereinafter referred to as B), norleucine ornithine (hereinafter referred to as O), pyridylalanine, thienylalanine, naphthylalanine and phenylglycine.


Variant amino acid sequences may include suitable spacer groups that may be inserted between any two amino acid residues of the sequence including alkyl groups such as methyl, ethyl or propyl groups in addition to amino acid spacers such as glycine or .beta.-alanine residues. A further form of variation, which involves the presence of one or more amino acid residues in peptoid form, may be well understood by those skilled in the art. For the avoidance of doubt, “the peptoid form” is used to refer to variant amino acid residues wherein the .alpha.-carbon substituent group is on the residue's nitrogen atom rather than the .alpha.-carbon. Processes for preparing peptides in the peptoid form are known in the art, for example Simon R J et al., PNAS (1992) 89(20), 9367-9371 and Horwell D C, Trends Biotechnol. (1995) 13(4), 132-134.


The practice of the present invention employs, unless otherwise indicated, conventional techniques of immunology, biochemistry, chemistry, molecular biology, microbiology, cell biology, genomics and recombinant DNA, which are within the skill of the art. See Green and Sambrook, (Molecular Cloning: A Laboratory Manual. 4th, ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 2014); CURRENT PROTOCOLS IN MOLECULAR BIOLOGY (F. M. Ausubel, et al. eds., (2017)); Short Protocols in Molecular Biology, (Ausubel et al., 1999)); the series METHODS IN ENZYMOLOGY (Academic Press, Inc.): PCR 2: A PRACTICAL APPROACH (M. J. MacPherson, B. D. Hames and G. R. Taylor eds. (1995)), ANTIBODIES, A LABORATORY MANUAL, SECOND EDITION (Harlow and Lane, eds. (2014) and CULTURE OF ANIMAL CELLS: A MANUAL BASIC TECHNIQUE, 7TH EDITION (R. I. Freshney, ed. (2016)).


EXAMPLES

The following examples are given for the purpose of illustrating various embodiments of the invention and are not meant to limit the present invention in any fashion. The present examples, along with the methods described herein are presently representative of preferred embodiments, are exemplary, and are not intended as limitations on the scope of the invention. Changes therein and other uses which are encompassed within the spirit of the invention as defined by the scope of the claims will occur to those skilled in the art.


Example 1. Nucleic Acid-Guided Nucleases

Sequences for twenty nucleic acid guided nucleases, termed MAD1-MAD20 (SEQ ID NOs 1-20), were aligned and compared to other nucleic acid guided nucleases. A partial alignment and phylogenetic tree are depicted in FIG. 1A and FIG. 1B respectively. Key residues in that may be involved in the recognition of a PAM site are shown in FIG. 1A. These include amino acids at positions 167, 539, 548, 599, 603, 604, 605, 606, and 607.


Sequence alignments were built using PSI-BLAST to search for MAD nuclease homologs in the NCBI non-redundant databases. Multiple sequence alignments were further refined using the MUSCLE alignment algorithm with default settings as implemented in Geneious 10. The percent identity of each homolog to SpCas9 and AsCpf 1 reference sequences were computed based on the pairwise alignment matching from these global alignments.


Genomic source sequences were identified using Uniprot linkage information or TBLASTN searches of NCBI using the default parameters and searching all possible frames for translational matches.


Percent identities of MAD1-8 and 10-12 to other various nuclease are summarized in Table 1. These percent identities represent the shared amino acid sequence identity between the indicated proteins.




















TABLE 1





Protein identifier or accession number
MAD1
MAD2
MAD3
MAD4
MAD5
MAD6
MAD7
MAD8
MAD10
MAD11
MAD12


























gi|1025734861|pdb|5B43|A
6.4
32.8
33.2
29.7
29.4
31.1
30.3
31.7
26.7
27.9
98.8


gi|1052245173|pdb|5KK5|A
6.4
32.7
33.1
29.7
29.3
31
30.3
31.7
26.7
27.8
98.7


gi|1086216683|emb|SDC16215.1|
6.1
33
34.4
29.6
30.1
33.5
32.3
32.1
26.2
27.2
46.8


gi|1120175333|ref|WP_073043853.1|
5.9
30.9
37.2
32.8
33.6
34.4
35.7
35.1
26.3
28.3
34.9


Cpf1.Sj|WP_081839471
6.6
33.6
41.7
37.2
33.4
37.6
40.1
37.7
29.1
30.3
34.1


Cpf1.Ss|KFO67989
6.9
32.3
35.7
43
33.7
45.9
34.8
48
33.2
33.4
33.8


MAD3
5.8
31
100
32.9
35.9
35
35.6
34.3
28
27.6
33.1


gi|1082474576|gb|OFY19591.1|
7
31.4
35.9
43.2
31.4
45
33.6
48.6
30.8
33.5
33


MAD2
6.1
100
31
30.7
30.2
31
31.2
31.2
25.8
27.7
32.6


Cpf1.Lb5|WP_016301126
7.8
32.8
36.5
38.2
34.2
45.5
35.8
43.6
30.7
35.7
32.5


gi|1088286736|gb|OHB41002.1|
6.7
30.6
35.3
42.4
33.2
44.7
32.1
46.8
30.7
32.6
32.4


gi|1094423310|emb|SER03894.1|
6.8
30.8
36.1
40.4
31.8
50.4
35.2
46.6
30.4
36.8
32.3


gi|493326531|ref|WP_006283774.1|
6.8
30.8
36.1
40.3
31.8
50.3
35.1
46.6
30.4
36.8
32.3


MAD8
7.6
31.2
34.3
40.4
32
41.6
32.8
100
30.1
32.1
31.7


Cpf1.Bot|WP_009217842
6.9
30.1
36.6
41.5
32.5
50.2
35.4
45.5
29.8
34.1
31.6


Cpf1.Li|WP_020988726
7.3
30.2
34.6
39.3
30.3
40.7
31.8
39.4
32.1
31.3
31.5


Cpf1.Pb|WP_044110123
6.3
31.4
31.8
36.1
30.8
45.7
30.4
39.4
27.7
33.5
31.5


gi|817911372|gb|AKG08867.1|
7.3
29.8
35
40.7
32.1
40.3
32.6
41.7
29.1
31
31.4


gi|1052838533|emb|SCH45297.1|
6.6
30.8
35.5
32
31.5
34.4
51.9
33.4
26.1
29
31.3


gi|1053713332|ref|WP_066040075.1|
7.2
29.6
33.2
39.6
29.8
49.1
32.2
41.4
30.1
32.4
31.3


gi|817909002|gb|AKG06878.1|
7.3
29.8
35
40.7
32
40.3
32.5
41.6
29.1
30.9
31.3


gi|1042201477|ref|WP_065256572.1|
7.2
29.5
35.2
40.6
31.9
40.1
32.7
41.6
29
30.8
31.2


MAD6
7.5
31
35
38.9
33.1
100
34.3
41.6
30.5
33.6
31


gi|490468773|ref|WP_004339290.1|
6.8
31.8
31.7
36.2
28.6
36.5
31.4
38.4
28.5
31.4
31


gi|565853704|ref|WP_023936172.1|
7.5
30.8
34.9
38.9
33.1
99.7
34.1
41.6
30.4
33.6
31


gi|739005707|ref|WP_036887416.1|
7.5
30.9
35
38.9
33
99.9
34.2
41.5
30.4
33.5
31


gi|739008549|ref|WP_036890108.1|
7.5
31
35
38.8
33
99.8
34.2
41.5
30.4
33.5
31


Cpf1.Ft|WP_014550095
7.1
31.9
33.8
40.3
29.7
39.4
34.1
41
29.8
32.5
30.8


gi|0504362993|ref|WP_014550095.1|
7.2
32.4
33.8
40.3
29.6
39.4
33.8
40.9
30.1
32.5
30.8


gi|0640557447|ref|WP_024988992.1|
6.6
31.4
34.8
40.7
31.2
48
34.1
45.1
28.8
35.2
30.8


gi|1098944113|ref|WP_071304624.1|
7.1
32.3
33.5
40.3
29.6
39.2
33.8
40.9
30.1
32.5
30.6


gi|0489124848|ref|WP_003034647.1|
7.1
32.3
33.9
40.9
29.9
39.2
33.9
40.9
29.9
32.2
30.6


gi|738967776|ref|WP_036851563.1|
6.8
29.4
33.1
35.5
28.9
40.3
30.7
35.9
28.7
31.3
30.5


MAD7
5.9
31.2
35.6
30.8
33.9
34.3
100
32.8
24.2
28.9
30.5


Cpf1.Lb6|WP_044910713
6.7
29.8
33.7
36.6
30.9
43
34
39.8
29.1
32.1
30.4


gi|1052961977|emb|SCH47915.1|
5.5
30.5
35.8
32.3
34
35
53.8
33.4
26.2
27.4
30.4


gi|817918353|gb|AKG14689.1|
7
29.1
34.4
39.8
31.7
40
32.4
41.1
28.4
30.1
30.3


gi|917059416|ref|WP_051666128.1|
6.9
29.9
31.5
35.7
31.6
41.8
32.9
39.1
30.1
34
30.2


gi|1011649201|ref|WP_062499108.1|
6.8
29
34.7
40.3
31.4
40.1
33.1
41.6
28.5
30.4
30.1


Cpf1.Pm|WP_018359861
6.3
29.2
32.3
34.2
27.4
38.7
29.4
35
27.2
30.1
30


gi|817922537|gb|AKG18099.1|
6.8
29.1
34.5
39.6
31.5
39.9
32.7
40.7
28.3
29.8
30


gi|769142322|ref|WP_044919442.1|
6.7
31
34.6
37.8
31.5
41.4
33.3
39.2
28
31.9
29.9


gi|1023176441|pdb|5ID6|A
6.7
29.7
31.3
35.5
31.3
41
32.6
38.5
29.7
33.3
29.8


gi|0491540987|ref|WP_005398606.1|
5.9
28.3
30.4
29.7
28.5
29
30.7
29.8
25.8
27.8
29.8


gi|652820612|ref|WP_027109509.1|
6.4
31.1
34
35.3
31.7
40.3
33.4
37.5
28.5
33.3
29.8


gi|502240446|ref|WP_012739647.1|
5.9
31.6
36.1
31.2
33
35.4
49.4
34
26.6
29.4
29.7


gi|524278046|emb|CDA41776.1|
5.8
31.6
36
31
33
35.4
50
34
26.6
29.5
29.7


gi|737831580|ref|WP_035798880.1|
6.2
31.3
34.8
38.1
31.5
42.1
33
39.6
28.4
32.4
29.7


gi|909652572|ref|WP_049895985.1|
6.9
30.7
34.2
37.2
30.8
41.5
34.2
38.7
28
32
29.7


MAD4
6.7
30.7
32.9
100
30.7
38.9
30.8
40.4
28.8
29.4
29.7


gi|942073049|ref|WP_055286279.1|
5.9
31.6
36.1
31.1
32.7
35
49.7
33.9
27.1
29.5
29.6


gi|654794505|ref|WP_028248456.1|
7.4
30.5
35.9
37.4
31.3
42.8
34.2
40.2
27.9
33.5
29.5


gi|933014786|emb|CUO47728.1|
5.6
31.3
34.9
31.2
31.5
32.4
46.7
30.6
25.4
27.7
29.4


gi|941887450|ref|WP_055224182.1|
5.6
31.4
35
31.3
31.6
32.5
46.6
30.7
25.3
27.8
29.4


gi|920071674|ref|WP_052943011.1|
6.3
31
31.8
38.8
31.8
41.3
33.8
42.6
29.8
34.7
29


MAD5
5.1
30.2
35.9
30.7
100
33.1
33.9
32
24.3
28.7
29


gi|1081462674|emb|SCZ76797.1|
6.9
30.4
33.5
34.7
29.7
40.1
30.5
37.4
27.3
32.5
28.9


gi|918722523|ref|WP_052585281.1|
7.4
27.5
30.5
35.7
28.3
35.2
28.5
36
26
27.1
28.8


gi|524816323|emb|CDF09621.1|
6.2
30
34.1
29.3
31.2
32.7
47.6
32.2
25.5
25.9
28.4


gi|941782328|ref|WP_055176369.1|
6.2
30.2
33.1
28.9
30.9
32
46.9
32.1
26
27.1
28.4


gi|942113296|ref|WP_055306762.1|
6.4
29.8
33.8
29.7
31.3
33.1
48
32.5
25.8
26.2
28.4


MAD11
6.4
27.7
27.6
29.4
28.7
33.6
28.9
32.1
26.2
100
27.8


gi|653158548|ref|WP_027407524.1|
5.9
26.4
28.1
33.5
27.4
32.5
27.8
32
27
26.8
27.6


gi|652963004|ref|WP_027216152.1|
6.6
30.3
32.5
33.2
30.4
38.2
29.6
34.6
25.9
30.5
27.2


gi|1083069650|gb|OGD68774.1|
6.2
25
24.3
26.6
23.1
28.1
23.2
26.4
45
24.9
27.1


gi|302483275|gb|EFL46285.1|
5.6
24.7
26.8
30.3
24.9
34.8
26
30.4
24.4
27.5
27.1


gi|915400855|ref|WP_050786240.1|
5.6
24.7
26.8
30.3
24.9
34.8
26
30.4
24.4
27.5
27.1


MAD10
5.6
25.8
28
28.8
24.3
30.5
24.2
30.1
100
26.2
26.6


gi|1101117967|gb|OIO75780.1|
6.1
26.8
26
27.3
24.3
28.1
24.4
28.2
44.1
25.4
26.1


g|11088204458|gb|OHA63117.1|
6.5
25.2
23.5
25.8
22.9
27
22
26.1
36.5
24.2
24.7


gi|809198071|ref|WP_046328599.1|
4.9
25.6
26.5
22.2
23.9
23.8
25.8
23.9
20.3
25.1
24


gi|1088079929|gb|OGZ45678.1|
5.6
21.9
23.8
26.9
23.4
27.8
23.3
26.7
28.8
24.7
23.5


gi|1101053499|gb|OIO15737.1|
5.9
23.1
26.2
25.2
23
26.4
25.1
26.5
29.2
23.2
23.4


gi|1101058058|gb|OIO19978.1|
5.4
21.2
22.8
23.6
20.6
25
20.7
25
25.9
22.2
23


gi|1088000848|gb|OGY73485.1|
5.7
23.5
25.2
25.5
23.9
27
25.1
25.6
31.6
23.6
22.9


gi|407014433|gb|EKE28449.1|
5.2
23.5
25.9
26.7
24.3
25.8
23
27.8
29.9
25.3
22.9


gi|818249855|gb|KKP36646.1|
6
21
20.7
23.5
20
24.2
21
24
24.6
21.8
22.6


gi|818703647|gb|KKT48220.1|
5.8
23.3
25
25.1
23.5
26.5
24.7
25.3
31.2
23.3
22.6


gi|818705786|gb|KKT50231.1|
5.8
23.1
24.6
24.7
22.9
26.2
24.2
24.8
30.8
22.9
22.2


gi|1083950632|gb|OGJ66851.1|
4.5
20
22.1
23.5
20.6
24.6
20
24
23.5
20.7
22.1


gi|1083932199|gb|OGJ49885.1|
6
20.4
20.2
22.6
19.3
23.3
20.6
23.2
23.9
21
21.8


gi|1083410735|gb|OGF20863.1|
5
21.7
23.3
25.5
23
25
22.7
25.9
27.2
22.4
21.5


gi|1011480927|ref|WP_062376669.1|
4.7
20.1
20.1
21.4
19.3
23.3
21.4
22
20.2
19.7
20.9


gi|818539593|gb|KKR91555.1|
5.1
19.8
21.6
22.1
20.5
22.9
21.2
22.8
24
20.5
19.9


gi|503048015|ref|WP_013282991.1|
5.1
18.8
20.7
15.3
19.7
18.9
19.3
17.7
15.9
19
19.2


gi|1096232746|ref|WP_071177645.1|
5
19.1
20.5
17.4
20.1
19.7
20.4
20.4
17.5
18.5
18.9


gi|769130404|ref|WP_044910712.1|
4.6
19.4
18.2
16.1
18.1
17.1
18.7
17.9
14.5
16.8
17.5


gi|1085569500|gb|OGX23684.1|
2.6
11.6
12.1
12.7
10.2
12.1
12.7
11.6
10.9
11.1
10.5


gi|818357062|gb|KKQ38176.1|
3.3
10
11.1
10.6
11.1
11.8
12.1
11.5
12.2
10.8
9.8


gi|745626763|gb|KIE18642.1|
3.7
9.4
11.7
11.1
11.1
12.5
11.9
11.9
10.2
10.6
8.8


MAD1
100
6.1
5.8
6.7
5.1
7.5
5.9
7.6
5.6
6.4
6.4


SpCas9
4
6.3
6.5
8.3
5.6
8.1
6.9
7.7
6.9
6.3
6.3


MAD12
6.4
32.6
33.1
29.7
29
31
30.5
31.7
26.6
27.8
100









Example 2: Expression of MAD Nucleases

Wild-type nucleic acid sequences for MAD1-MAD20 include SEQ ID NOs 21-40, respectively. These MAD nucleases were codon optimized for expression in E. coli and the codon optimized sequences are listed as SEQ ID NO: 41-60, respectively (summarized in Table 2).


Codon optimized MAD1-MAD20 were cloned into an expression construct comprising a constitutive or inducible promoter (eg., proB promoter SEQ ID NO: 83, or pBAD promoter SEQ ID NO: 81 or SEQ ID NO: 82) and an optional 6×-His tag (eg., FIG. 2). The generated MAD1-MAD2 expression constructs are provided as SEQ ID NOs: 61-80, respectively. The expression constructs as depicted in FIG. 2 were generated either by restriction/ligation-based cloning or homology-based cloning.


Example 3. Testing Guide Nucleic Acid Sequences Compatible with MAD Nucleases

In order to have a functioning targetable nuclease complex, a nucleic acid-guided nuclease and a compatible guide nucleic acid is needed. To determine the compatible guide nucleic acid sequence, specifically the scaffold sequence portion of the guide nucleic acid, multiple approaches were taken. First, scaffold sequences were looked for near the endogenous loci of each MAD nuclease. In some cases, such as with MAD2, no endogenous scaffold sequence was found. Therefore, we tested the compatibility of MAD2 with scaffold sequences found near the endogenous loci of the other MAD nucleases. A list of the MAD nucleases and corresponding endogenous scaffold sequences that were tested is listed in Table 2.













TABLE 2









Endogenous




Codon optimized

scaffold sequence



WT nucleic acid
nucleic acid
Amino acid
for guide nucleic


MAD nuclease
sequence
sequence
sequence
acid







MAD1
SEQ ID NO: 21
SEQ ID NO: 41
SEQ ID NO: 1
SEQ ID NO: 84


MAD2
SEQ ID NO: 22
SEQ ID NO: 42
SEQ ID NO: 2
None identified


MAD3
SEQ ID NO: 23
SEQ ID NO: 43
SEQ ID NO: 3
SEQ ID NO: 86


MAD4
SEQ ID NO: 24
SEQ ID NO: 44
SEQ ID NO: 4
SEQ ID NO: 87


MAD5
SEQ ID NO: 25
SEQ ID NO: 45
SEQ ID NO: 5
SEQ ID NO: 88


MAD6
SEQ ID NO: 26
SEQ ID NO: 46
SEQ ID NO: 6
SEQ ID NO: 89


MAD7
SEQ ID NO: 27
SEQ ID NO: 47
SEQ ID NO: 7
SEQ ID NO: 90


MAD8
SEQ ID NO: 28
SEQ ID NO: 48
SEQ ID NO: 8
SEQ ID NO: 91


MAD9
SEQ ID NO: 29
SEQ ID NO: 49
SEQ ID NO: 9
SEQ ID NO: 92;






SEQ ID NO: 103;






SEQ ID NO: 106


MAD10
SEQ ID NO: 30
SEQ ID NO: 50
SEQ ID NO: 10
SEQ ID NO: 93


MAD11
SEQ ID NO: 31
SEQ ID NO: 51
SEQ ID NO: 11
SEQ ID NO: 94


MAD12
SEQ ID NO: 32
SEQ ID NO: 52
SEQ ID NO: 12
SEQ ID NO: 95


MAD13
SEQ ID NO: 33
SEQ ID NO: 53
SEQ ID NO: 13
SEQ ID NO: 96;






SEQ ID NO: 105;






SEQ ID NO: 107


MAD14
SEQ ID NO: 34
SEQ ID NO: 54
SEQ ID NO: 14
SEQ ID NO: 97


MAD15
SEQ ID NO: 35
SEQ ID NO: 55
SEQ ID NO: 15
SEQ ID NO: 98


MAD16
SEQ ID NO: 36
SEQ ID NO: 56
SEQ ID NO: 16
SEQ ID NO: 99


MAD17
SEQ ID NO: 37
SEQ ID NO: 57
SEQ ID NO: 17
SEQ ID NO: 100


MAD18
SEQ ID NO: 38
SEQ ID NO: 58
SEQ ID NO: 18
SEQ ID NO: 101


MAD19
SEQ ID NO: 39
SEQ ID NO: 59
SEQ ID NO: 19
SEQ ID NO: 102


MAD20
SEQ ID NO: 40
SEQ ID NO: 60
SEQ ID NO: 20
SEQ ID NO: 103









Editing cassettes as depicted in FIG. 3 were generated to assess the functionality of the MAD nucleases and corresponding guide nucleic acids. Each editing cassette comprises an editing sequence and a promoter operably linked to an encoded guide nucleic acid. The editing cassettes further comprises primer sites (P1 and P2) on flanking ends. The guide nucleic acids comprised various scaffold sequences to be tested, as well as a guide sequence to guide the MAD nuclease to the target sequence for editing. The editing sequences comprised a PAM mutation and/or codon mutation relative to the target sequence. The mutations were flanked by regions of homology (homology arms or HA) which would allow recombination into the cleaved target sequence. (agcagctttatcatctgccg (SEQ ID No: 183); QQLYHLP (SEQ ID No: 184); agcagttataataactgccg (SEQ ID No: 186; and QQLLP (SEQ ID No: 206)



FIG. 4 depicts an experimental designed to test different MAD nuclease and guide nucleic acid combinations. An expression cassette encoding the MAD nuclease or the MAD nuclease protein were added to host cells along with various editing cassettes as described above. In this example, the guide nucleic acids were engineered to target the galK gene in the host cell, and the editing sequence was designed to mutate the targeted galK gene in order to turn the gene off, thereby allowing for screening of successfully edited cells. This design was used for identification of functional or compatible MAD nuclease and guide nucleic acid combinations. Editing efficiency was determined by qPCR to measure the editing plasmid in the recovered cells in a high-throughput manner. Validation of MAD11 and Cas9 primers is shown in FIGS. 14A and 14B. These results show that the selected primer pairs are orthogonal and allow quantitative measurement of input plasmid DNA



FIGS. 5A-5B is a depiction of a similar experimental design. In this case, the editing cassette (FIG. 5B) further comprises a selectable marker, in this case kanamycin resistance (kan) and the MAD nuclease expression vector (FIG. 5A) further comprises a selectable marker, in this case chloramphenicol resistance (Cm), and the lambda RED recombination system to aid homologous recombination (HR) of the editing sequence into the target sequence. A compatible MAD nuclease and guide nucleic acid combination will cause a double strand break in the target sequence if a PAM sequence is present. Since the editing sequence (eg. FIG. 3) contains a PAM mutation that is not recognized by the MAD nuclease, edited cells that contain the PAM mutation survive cleavage by the MAD nuclease, while wild-type non-edited cells die (FIG. 5C). The editing sequence further comprises a mutation in the galK gene that allows for screening of edited cells, while the MAD nuclease expression vector and editing cassette contain drug selection markers, allowing for selection of edited cells.


Using these methods, compatible guide nucleic acids for MAD1-MAD20 were tested. Twenty scaffold sequences were tested. The guide nucleic acids used in the experiments contained one of the twenty scaffold sequences, referred to as scaffold-1, scaffold-2, etc., and a guide sequence that targets the galK gene. Sequences for Scaffold-1 through Scaffold-20 are listed as SEQ ID NO: 84-103, respectively. It should be understood that the guide sequence of the guide nucleic acid is variable and can be engineered or designed to target any desired target sequence. Since MAD2 does not have an endogenous scaffold sequence to test, a scaffold sequence from a close homology (scaffold-2, SEQ ID NO: 85) was tested and found to be a non-functional pair, meaning MAD2 and scaffold-2 were not compatible. Therefore, MAD2 was tested with the other nineteen scaffold sequences, despite the low sequence homology between MAD2 and the other MAD nucleases.


This workflow could also be used to identify or test PAM sequences compatible with a given MAD nuclease. Another method for identifying a PAM site is described in the next example.


In general, for the assays described, transformations were carried out as follows. E. coli strains expressing the codon optimized MAD nucleases were grown overnight. Saturated cultures were diluted 1/100 and grown to an OD600 of 0.6 and induced by adding arabinose at a filing concentration of 0.4% and (if a temperature sensitive plasmid is used) shifting the culture to 42 degrees Celsius in a shaking water bath. Following induction, cells were chilled on ice for 15 min prior to washing thrice with ¼ the initial culture volume with 10% glycerol (for example, 50 mL washed for a 200 mL culture). Cells were resuspended in 1/100 the initial volume (for example, 2 mL for a 200 mL culture) and stores at −90 degrees Celsius until ready to use. To perform the compatibility and editing efficiency screens described here, 50 ng of editing cassette was transformed into cell aliquots by electroporation. Following electroporation, the cells were recovered in LB for 3 hours and 100 μL of cells were plated on Macconkey plates containing 1% galactose.


Editing efficiencies were determined by dividing the number of white colonies (edited cells) by the total number of white and red colonies (edited and non-edited cells).


Example 4. PAM Selection Assay

In order to generate a double strand break in a target sequence, a guide nucleic acid must hybridize to a target sequence, and the MAD nuclease must recognize a PAM sequence adjacent to the target sequence. If the guide nucleic acid hybridizes to the target sequence, but the MAD nuclease does not recognize a PAM site, then cleavage does not occur.


A PAM is MAD nuclease-specific and not all MAD nucleases necessarily recognize the same PAM. In order to assess the PAM site requirements for the MAD nucleases, an assay as depicted in FIGS. 6A-6C was performed.



FIG. 6A depicts a MAD nuclease expression vector as described elsewhere, which also contains a chloramphenicol resistance gene and the lambda RED recombination system.



FIG. 6B depicts a self-targeting editing cassette. The guided nucleic acid is designed to target the target sequence which is contained on the same nucleic acid molecule. The target sequence is flanked by random nucleotides, depicted by N4, meaning four random nucleotides on either end of the target sequence. It should be understood that any number of random nucleotides could also be used (for example, 3, 5, 6, 7, 8, etc). The random nucleotides serve as a library of potential PAMs.



FIG. 6C depicts the experimental design. Basically, the MAD nuclease expression vector and editing cassette comprising the random PAM sites were transformed into a host cell. If a functional targetable nuclease complex was formed and the MAD nuclease recognized a PAM site, then the editing cassette vector was cleaved and which leads to cell death. If a functional targetable complex was not formed or if the MAD nuclease did not recognize the PAM, then the target sequence was not cleaved and the cell survived. Next generation sequence (NGS) was then used to sequence the starting and final cell populations in order to determine what PAM sites were recognized by a given MAD nuclease. These recognized PAM sites were then used to determine a consensus or non-consensus PAM for a given MAD nuclease.


The consensus PAM for MAD1-MAD8, and MAD10-MAD12 was determined to be TTTN. The consensus PAM for MAD9 was determined to be NNG. The consensus PAM for MAD13-MAD15 was determined to be TTN. The consensus PAM for MAD16-MAD18 was determined to be TA. The consensus PAM for MAD19-MAD20 was determined to be TTCN.


Example 5: Testing Heterologous Guide Nucleic Acids

Editing efficiencies were tested for MAD1, MAD2, MAD4, and MAD7 and are depicted in FIG. 7A and FIG. 7B. Experiment details and editing efficiencies are summarized in Table 3. Editing efficiency was determined by dividing the number of edited cells by the total number of recovered cells. Various editing cassettes targeting the galK gene were used to allow screening of editing cells. The guide nucleic acids encoded on the editing cassette contained a guide sequence targeting the galK gene and one of various scaffold sequences in order to test the compatibility of the indicated MAD nuclease with the indicated scaffold sequence, as summarized in Table 3.


Editing efficiencies for compatible MAD nuclease and guide nucleic acids (comprising the indicated scaffold sequences) were observed to have between 75-100% editing efficiency. MAD2 had between a 75-100% editing efficiency and MAD7 had between a 97-100% editing efficiency.


MAD2 combined with scaffold-1, scaffold-2, scaffold-4, or scaffold-13 in these experiments results in 0% editing efficiency. These data imply that MAD2 did not form a functional complex with these tested guide nucleic acids and that MAD2 is not compatible with these scaffold sequences.


MAD7 combined with scaffold-1, scaffold-2, scaffold-4, or scaffold-13 in these experiments results in 0% editing efficiency. These data imply that MAD7 did not form a functional complex with these tested guide nucleic acids and that MAD7 is not compatible with these scaffold sequences.


For MAD1 and MAD4, all tested guide nucleic acid combinations resulted in 0% editing efficiency, implying that MAD1 and MAD4 did not form a functional complex with any of the tested guide nucleic acids. These data also imply that MAD1 and MAD4 are not compatible with the tested scaffold sequences.


Combined, these data highlight the unpredictability of finding a compatible MAD nuclease and scaffold sequence pair in order to form a functional targetable nuclease complex. Some tested MAD nucleases did not function with any tested scaffold sequence. Some tested MAD nucleases only functioned with some tested scaffold sequences and not with others.














TABLE 3








Editing





Nucleic acid-
Guide nucleic acid scaffold
sequence

Editing


#
guided nuclease
sequence
mutation
Target gene
efficiency




















1
MAD1
Scaffold-1; SEQ ID NO: 84
L80**
galK
 0%


2
MAD1
Scaffold-2; SEQ ID NO: 85
Y145**
galK
 0%


3
MAD1
Scaffold-4; SEQ ID NO: 87
Y145**
galK
 0%


4
MAD1
Scaffold-10; SEQ ID NO: 93
Y145**
galK
 0%


5
MAD1
Scaffold-11; SEQ ID NO: 94
L80**
galK
 0%


6
MAD1
Scaffold-12; SEQ ID NO: 95
L10KpnI
galK
 0%


7
MAD1
Scaffold-13; SEQ ID NO: 96
Y145**
galK
 0%


8
MAD1
Scaffold-12; SEQ ID NO: 95
L10KpnI
galK
 0%


9
MAD2
Scaffold-10; SEQ ID NO: 93
L80**
galK
 0%


10
MAD2
Scaffold-10; SEQ ID NO: 93
Y145**
galK
100%


11
MAD2
Scaffold-11; SEQ ID NO: 94
L80**
galK
 98%


12
MAD2
Scaffold-11; SEQ ID NO: 94
Y145**
galK
 99%


13
MAD2
Scaffold-12; SEQ ID NO: 95
Y145**
galK
 98%


14
MAD2
Scaffold-12; SEQ ID NO: 95
Y145**
galK
 0%


15
MAD2
Scaffold-13; SEQ ID NO: 96
Y145**
galK
 0%


16
MAD2
Scaffold-1; SEQ ID NO: 84
L80**
galK
 0%


17
MAD2
Scaffold-2; SEQ ID NO: 85
Y145**
galK
 0%


18
MAD2
Scaffold-2; SEQ ID NO: 85
Y145**
galK
 0%


19
MAD2
Scaffold-4; SEQ ID NO: 87
Y145**
galK
 0%


20
MAD2
Scaffold-5; SEQ ID NO: 88
L80**
galK
 99%


21
MAD2
Scaffold-12; SEQ ID NO: 95
89**
galK
 0%


22
MAD2
Scaffold-12; SEQ ID NO: 95
70**
galK
 75%


23
MAD2
Scaffold-12; SEQ ID NO: 95
L10KpnI
galK
 79%


24
MAD4
Scaffold-1; SEQ ID NO: 84
L80**
galK
 0%


25
MAD4
Scaffold-2; SEQ ID NO: 85
Y145**
galK
 0%


26
MAD4
Scaffold-4; SEQ ID NO: 87
Y145**
galK
 0%


27
MAD4
Scaffold-10; SEQ ID NO: 93
Y145**
galK
 0%


28
MAD4
Scaffold-11; SEQ ID NO: 94
L80**
galK
 0%


29
MAD4
Scaffold-12; SEQ ID NO: 95
L10KpnI
galK
 0%


30
MAD4
Scaffold-13; SEQ ID NO: 96
Y145**
galK
 0%


31
MAD4
Scaffold-12; SEQ ID NO: 95
L10KpnI
galK
 0%


32
MAD7
Scaffold-1; SEQ ID NO: 84
L80**
galK
 0%


33
MAD7
Scaffold-2; SEQ ID NO: 85
Y145**
galK
 0%


34
MAD7
Scaffold-4; SEQ ID NO: 87
Y145**
galK
 0%


35
MAD7
Scaffold-10; SEQ ID NO: 93
Y145**
galK
100%


36
MAD7
Scaffold-11; SEQ ID NO: 94
L80**
galK
 97%


37
MAD7
Scaffold-12; SEQ ID NO: 95
L10KpnI
galK
 0%


38
MAD7
Scaffold-13; SEQ ID NO: 96
Y145**
galK
 0%


39
MAD7
Scaffold-12; SEQ ID NO: 95
L10KpnI
galK
 0%









Example 6. Assessment of MAD2 and MAD7

The ability of MAD2 and MAD7 to function with heterologous guide nucleic acids were tested using a similar experimental design as described above.


The compatibility of MAD2 with other scaffold sequences was tested and the results of an experiment are depicted in FIG. 8. The MAD nucleases, guide nucleic acid scaffold sequences, and editing sequences used in this experiment are summarized in Table 4.


The compatibility of MAD7 with other scaffold sequences was tested and the results of an experiment are depicted in FIG. 9. The MAD nucleases, guide nucleic acid scaffold sequences, and editing sequences used in this experiment are summarized in Table 5.













TABLE 4








Editing




Nucleic acid-
Guide nucleic acid scaffold
sequence
Target


#
guided nuclease
sequence
mutation
gene



















1
MAD2
Scaffold-12; SEQ ID NO: 95
N89KpnI
galK


2
MAD2
Scaffold-10; SEQ ID NO: 93
L80**
galK


3
MAD2
Scaffold-5; SEQ ID NO: 88
L80**
galK


4
MAD2
Scaffold-12; SEQ ID NO: 95
D70KpnI
galK


5
MAD2
Scaffold-12; SEQ ID NO: 95
Y145**
galK


6
MAD2
Scaffold-11; SEQ ID NO: 94
Y145**
galK


7
MAD2
Scaffold-10; SEQ ID NO: 93
Y145**
galK


8
MAD2
Scaffold-12; SEQ ID NO: 95
L10KpnI
galK


9
MAD2
Scaffold-11; SEQ ID NO: 94
L80**
galK


10
SpCas9
S. pyogenese gRNA
Y145**
galK


11
MAD2
Scaffold-2; SEQ ID NO: 85
Y145**
galK


12
MAD2
Scaffold-4; SEQ ID NO: 87
Y145**
galK


13
MAD2
Scaffold-1; SEQ ID NO: 84
L80**
galK


14
MAD2
Scaffold-13; SEQ ID NO: 96
Y145**
galK




















TABLE 5








Editing




Nucleic acid-
Guide nucleic acid scaffold
sequence
Target


#
guided nuclease
sequence
mutation
gene







1
MAD7
Scaffold-1; SEQ ID NO: 84
L80**
galK


2
MAD7
Scaffold-2; SEQ ID NO: 85
Y145**
galK


3
MAD7
Scaffold-4; SEQ ID NO: 87
Y145**
galK


4
MAD7
Scaffold-10; SEQ ID NO: 93
Y145**
galK


5
MAD7
Scaffold-11; SEQ ID NO: 95
L80**
galK









In another experiment, transformation efficiencies (FIG. 10B) were determined by calculating the total number of recovered cells compared to the start number of cells. An example plate image is depicted in FIG. 10C. Editing efficiencies (FIG. 10A) were determined by calculating the ratio of editing colonies (white colonies, edited galK gene) versus total colonies.


In this example (FIG. 10A-10C), cells expressing galK were transformed with expression constructs expressing either MAD2 or MAD7 and a corresponding editing cassette comprising a guide nucleic acid targeting the galK gene. The guide nucleic acid was comprised of a guide sequence targeting the galK gene and the scaffold-12 sequence (SEQ ID NO: 95).


In the depicted example, MAD2 and MAD7 has a lower transformation efficiency compared to S. pyogenes Cas9, though the editing efficiency of MAD2 and MAD7 was slightly higher than S. pyogenes Cas9.



FIG. 11 depicts the sequencing results from select colonies recovered from the assay described above. The target sequence was in the galK coding sequence (CDS). The TTTN PAM is shown as the reverse complement (wild-type NAAA, mutated NGAA). The mutations targeted by the editing sequence are labeled as target codons. Changes compared to the wild-type sequence are highlighted. In these experiments, the scaffold-12 sequence (SEQ ID NO: 95) was used. The guide sequence of the guide nucleic acid targeted the galK gene.


Six of the seven depicted sequences from the MAD2 experiment contained the designed PAM mutation and designed mutations in the target codons of galK, which one sequences colony maintained the wild-type PAM and wild-type target codons while also containing an unintended mutation upstream of the target site.


Two of the four depicted sequences from the MAD7 experiment contained the designed PAM mutation and mutated target codons. One colony comprises a wildtype sequence, while another contained a deletion of eight nucleotides upstream of the target sequence.



FIG. 12 depicts results from another experiment testing the ability to recover edited cells. In Experiment 0, the MAD2 nuclease was used with a guide nucleic acid comprising scaffold-11 sequence and a guide sequence targeting galK. The editing cassette comprised an editing sequence designed to incorporate an L80** mutation into galK, thereby allowing screening of the edited cells. In experiment 1, the MAD2 nuclease was used with a guide nucleic acid comprising scaffold-12 sequence and a guide sequence targeting galK. The editing cassette comprised an editing sequence designed to incorporate an L10KpnI mutation into galK. In both experiments, a negative control plasmid a guide nucleic acid that is not compatible with MAD2 was included in the transformations. Following transformation, the ratio of the compatible editing cassette (those containing scaffold-11 or scaffold-12 guide nucleic acids) to the non-compatible editing cassette (negative control) was measure. The experiments were done in the presence or absence of selection. The results show that more compatible editing cassette containing cells were recovered compared to the non-compatible editing cassette, and this result is magnified when selection is used.


Example 7. Guide Nucleic Acid Characterization

The sequences of scaffolds 1-8, and 10-12 (SEQ ID NO: 84-91, and 93-95) were aligned and are depicted in FIG. 13A. Nucleotides that match the consensus sequence are faded, while those diverging from the consensus sequence are visible. The predicted pseudoknot region is indicated. Without being bound by theory, the region 5′ of the pseudoknot may be influence binding and/or kinetics of the nucleic acid-guided nuclease. As is shown in FIG. 13A, in general, there appears to be less variability in the pseudoknot region (e.g., SEQ ID NO: 172-181) as compared to the sequence outside of the pseudoknot region.



FIG. 13B shows a preliminary model of MAD2 and MAD12 complexed with a guide nucleic acid (in this example, a guide RNA) and target sequence (DNA).


Example 8. Editing Efficiency of the MAD Nucleases

A plate-based editing efficiency assay and a molecular editing efficiency assay were used to test editing efficiency of various MAD nuclease and guide nucleic acid combinations.



FIG. 15 depicts quantification of the data obtained using the molecular editing efficiency assay using MAD2 nuclease with a guide nucleic acid comprising scaffold-12 and a guide sequencing targeting galK. The indicated mutations were incorporated into the galK using corresponding editing cassettes containing the mutation. FIG. 16 shows the comparison of the editing efficiencies determined by the plate-based assay using white and red colonies as described previously, and the molecular editing efficiency assay. As shown in FIG. 16, the editing efficiencies as determined by the two separate assays are consistent.


Example 9. Trackable Editing

Genetic edits can be tracked by the use of a barcode. A barcode can be incorporated into or near the edit site as described in the present specification. When multiple rounds of engineering are being performed, with a different edit being made in each round, it may be beneficial to insert a barcode in a common region during each round of engineering, this way one could sequence a single site and get the sequences of all of the barcodes from each round without the need to sequence each edited site individually. FIGS. 17A-17C, 18, and 19 depict examples of such trackable engineering workflows.


As depicted in FIG. 17A, a cell expressing a MAD nuclease is transformed with a plasmid containing an editing cassette and a recording cassette. The editing cassette contains a PAM mutation and a gene edit. The recorder cassette comprises a barcode, in this case 15N. Both the editing cassette and recording cassette each comprise a guide nucleic acid to a distinct target sequence. Within a library of such plasmids, the recorder cassette for each round can contain the same guide nucleic acid, such that the first round barcode is inserted into the same location across all variants, regardless of what editing cassette and corresponding gene edit is used. The correlation between the barcode and editing cassette is determined beforehand though such that the edit can be identified by sequencing the barcode. FIG. 17B shows an example of a recording cassette designed to delete a PAM site while incorporating a 15N barcode (actatcaatg ggctaactnnnnnnnnnnnnnnntgaaacatctgcaactgcg (SEQ ID No: 203); actatcaatgggctaactac gttcgtggcgtggtgaaacatctgcaactgcg (SEQ ID No: 204). The deleted PAM is used to enrich for edited cells since mutated PAM cells escape cell death while cells containing a wild-type PAM sequence are killed. Fire 21 C depicts how sequencing the barcode region can be used to identify which edit is comprised within each cell.


A similar approach is depicted in FIG. 18. In this case, the recorder cassette from each round is designed to target a sequence adjacent to the previous round, and each time, a new PAM site is deleted by the recorder cassette. The result is a barcode array with the barcodes from each round that can be sequenced to confirm each round of engineering took place and to determine which combination of mutations are contained in the cell, and in which order the mutations were made. Each successive recorder cassette can be designed to be homologous on one end to the region comprising the mutated PAM from the previous round, which could increase the efficiency of getting fully edited cells at the end of the experiment. In other examples, the recorder cassette is designed to target a unique landing site that was incorporated by the previous recorder cassette. This increases the efficiency of recovering cells containing all of the desired mutations since the subsequent recorder cassette and barcode can only target a cell that has successfully completed the previous round of engineering.



FIG. 19 depicts another approach that allows the recycling of selectable markers or to otherwise cure the cell of the plasmid form the previous round of engineering. In this case, the transformed plasmid containing a guide nucleic acid designed to target a selectable marker or other unique sequence in the plasmid form the previous round of engineering.


SEQUENCE LISTING
Table 6








TABLE 6







SEQUENCE LISTING








SEQ



ID



NO:
Sequence





SEQ
MGKMYYLGLDIGTNSVGYAVTDPSYHLLKFKGEPMWGAHVFAAGNQSAERRSFRT


ID
SRRRLDRRQQRVKLVQEIFAPVISPIDPRFFIRLHESALWRDDVAETDKHIFFND


NO:
PTYTDKEYYSDYPTIHHLIVDLMESSEKHDPRLVYLAVAWLVAHRGHFLNEVDKD


1
NIGDVLSFDAFYPEFLAFLSDNGVSPWVCESKALQATLLSRNSVNDKYKALKSLI



FGSQKPEDNFDANISEDGLIQLLAGKKVKVNKLFPQESNDASFTLNDKEDAIEEI



LGTLTPDECEWIAHIRRLFDWAIMKHALKDGRTISESKVKLYEQHHHDLTQLKYF



VKTYLAKEYDDIFRNVDSETTKNYVAYSYHVKEVKGTLPKNKATQEEFCKYVLGK



VKNIECSEADKVDFDEMIQRLTDNSFMPKQVSGENRVIPYQLYYYELKTILNKAA



SYLPFLTQCGKDAISNQDKLLSIMTFRIPYFVGPLRKDNSEHAWLERKAGKIYPW



NFNDKVDLDKSEEAFIRRMTNTCTYYPGEDVLPLDSLIYEKFMILNEINNIRIDG



YPISVDVKQQVFGLFEKKRRVTVKDIQNLLLSLGALDKHGKLTGIDTTIHSNYNT



YHHFKSLMERGVLTRDDVERIVERMTYSDDTKRVRLWLNNNYGTLTADDVKHISR



LRKHDFGRLSKMFLTGLKGVHKETGERASILDFMWNTNDNLMQLLSECYTFSDEI



TKLQEAYYAKAQLSLNDFLDSMYISNAVKRPIYRTLAVVNDIRKACGTAPKRIFI



EMARDGESKKKRSVTRREQIKNLYRSIRKDFQQEVDFLEKILENKSDGQLQSDAL



YLYFAQLGRDMYTGDPIKLEHIKDQSFYNIDHIYPQSMVKDDSLDNKVLVQSEIN



GEKSSRYPLDAAIRNKMKPLWDAYYNHGLISLKKYQRLTRSTPFTDDEKWDFINR



QLVETRQSTKALAILLKRKFPDTEIVYSKAGLSSDFRHEFGLVKSRNINDLHHAK



DAFLAIVTGNVYHERFNRRWFMVNQPYSVKTKTLFTHSIKNGNFVAWNGEEDLGR



IVKMLKQNKNTIHFTRFSFDRKEGLFDIQPLKASTGLVPRKAGLDVVKYGGYDKS



TAAYYLLVRFTLEDKKTQHKLMMIPVEGLYKARIDHDKEFLTDYAQTTISEILQK



DKQKVINIMFPMGTRHIKLNSMISIDGFYLSIGGKSSKGKSVLCHAMVPLIVPHK



IECYIKAMESFARKFKENNKLRIVEKFDKITVEDNLNLYELFLQKLQHNPYNKFF



STQFDVLTNGRSTFTKLSPEEQVQTLLNILSIFKTCRSSGCDLKSINGSAQAARI



MISADLTGLSKKYSDIRLVEQSASGLFVSKSQNLLEYL*





SEQ
MSSLTKFTNKYSKQLTIKNELIPVGKTLENIKENGLIDGDEQLNENYQKAKIIVD


ID
DFLRDFINKALNNTQIGNWRELADALNKEDEDNIEKLQDKIRGIIVSKFETFDLF


NO:
SSYSIKKDEKIIDDDNDVEEEELDLGKKTSSFKYIFKKNLFKLVLPSYLKTTNQD


2
KLKIISSFDNFSTYFRGFFENRKNIFTKKPISTSIAYRIVHDNFPKFLDNIRCFN



VWQTECPQLIVKADNYLKSKNVIAKDKSLANYFTVGAYDYFLSQNGIDFYNNIIG



GLPAFAGHEKIQGLNEFINQECQKDSELKSKLKNRHAFKMAVLFKQILSDREKSF



VIDEFESDAQVIDAVKNFYAEQCKDNNVIFNLLNLIKNIAFLSDDELDGIFIEGK



YLSSVSQKLYSDWSKLRNDIEDSANSKQGNKELAKKIKTNKGDVEKAISKYEFSL



SELNSIVHDNTKFSDLLSCTLHKVASEKLVKVNEGDWPKHLKNNEEKQKIKEPLD



ALLEIYNTLLIFNCKSFNKNGNFYVDYDRCINELSSVVYLYNKTRNYCTKKPYNT



DKFKLNFNSPQLGEGFSKSKENDCLTLLFKKDDNYYVGIIRKGAKINFDDTQAIA



DNTDNCIFKMNYFLLKDAKKFIPKCSIQLKEVKAHFKKSEDDYILSDKEKFASPL



VIKKSTFLLATAHVKGKKGNIKKFQKEYSKENPTEYRNSLNEWIAFCKEFLKTYK



AATIFDITTLKKAEEYADIVEFYKDVDNLCYKLEFCPIKTSFIENLIDNGDLYLF



RINNKDFSSKSTGTKNLHTLYLQAIFDERNLNNPTIMLNGGAELFYRKESIEQKN



RITHKAGSILVNKVCKDGTSLDDKIRNEIYQYENKFIDTLSDEAKKVLPNVIKKE



ATHDITKDKRFTSDKFFFHCPLTINYKEGDTKQFNNEVLSFLRGNPDINIIGIDR



GERNLIYVTVINQKGEILDSVSFNTVTNKSSKIEQTVDYEEKLAVREKERIEAKR



SWDSISKIATLKEGYLSAIVHEICLLMIKHNAIVVLENLNAGFKRIRGGLSEKSV



YQKFEKMLINKLNYFVSKKESDWNKPSGLLNGLQLSDQFESFEKLGIQSGFIFYV



PAAYTSKIDPTTGFANVLNLSKVRNVDAIKSFFSNFNEISYSKKEALFKFSFDLD



SLSKKGFSSFVKFSKSKWNVYTFGERIIKPKNKQGYREDKRINLTFEMKKLLNEY



KVSFDLENNLIPNLTSANLKDTFWKELFFIFKTTLQLRNSVTNGKEDVLISPVKN



AKGEFFVSGTHNKTLPQDCDANGAYHIALKGLMILERNNLVREEKDTKKIMAISN



VDWFEYVQKRRGVL*





SEQ
MNNYDEFTKLYPIQKTIRFELKPQGRTMEHLETFNFFEEDRDRAEKYKILKEAID


ID
EYHKKFIDEHLTNMSLDWNSLKQISEKYYKSREEKDKKVFLSEQKRMRQEIVSEF


NO:
KKDDRFKDLFSKKLFSELLKEEIYKKGNHQEIDALKSFDKFSGYFIGLHENRKNM


3
YSDGDEITAISNRIVNENFPKFLDNLQKYQEARKKYPEWIIKAESALVAHNIKMD



EVFSLEYFNKVLNQEGIQRYNLALGGYVTKSGEKMMGLNDALNLAHQSEKSSKGR



IHMTPLFKQILSEKESFSYIPDVFTEDSQLLPSIGGFFAQIENDKDGNIFDRALE



LISSYAEYDTERIYIRQADINRVSNVIFGEWGTLGGLMREYKADSINDINLERTC



KKVDKWLDSKEFALSDVLEAIKRTGNNDAFNEYISKMRTAREKIDAARKEMKFIS



EKISGDEESIHIIKTLLDSVQQFLHFFNLFKARQDIPLDGAFYAEFDEVHSKLFA



IVPLYNKVRNYLTKNNLNTKKIKLNFKNPTLANGWDQNKVYDYASLIFLRDGNYY



LGIINPKRKKNIKFEQGSGNGPFYRKMVYKQIPGPNKNLPRVFLTSTKGKKEYKP



SKEIIEGYEADKHIRGDKFDLDFCHKLIDFFKESIEKHKDWSKFNFYFSPTESYG



DISEFYLDVEKQGYRMHFENISAETIDEYVEKGDLFLFQIYNKDFVKAATGKKDM



HTIYWNAAFSPENLQDVVVKLNGEAELFYRDKSDIKEIVHREGEILVNRTYNGRT



PVPDKIHKKLTDYHNGRTKDLGEAKEYLDKVRYFKAHYDITKDRRYLNDKIYFHV



PLTLNFKANGKKNLNKMVIEKFLSDEKAHIIGIDRGERNLLYYSIIDRSGKIIDQ



QSLNVIDGFDYREKLNQREIEMKDARQSWNAIGKIKDLKEGYLSKAVHEITKMAI



QYNAIVVMEELNYGFKRGRFKVEKQIYQKFENMLIDKMNYLVFKDAPDESPGGVL



NAYQLTNPLESFAKLGKQTGILFYVPAAYTSKIDPTTGFVNLFNTSSKTNAQERK



EFLQKFESISYSAKDGGIFAFAFDYRKFGTSKTDHKNVWTAYTNGERMRYIKEKK



RNELFDPSKEIKEALTSSGIKYDGGQNILPDILRSNNNGLIYTMYSSFIAAIQMR



VYDGKEDYIISPIKNSKGEFFRTDPKRRELPIDADANGAYNIALRGELTMRAIAE



KFDPDSEKMAKLELKHKDWFEFMQTRGD*





SEQ
MTKTFDSEFFNLYSLQKTVRFELKPVGETASFVEDFKNEGLKRVVSEDERRAVDY


ID
QKVKEIIDDYHRDFIEESLNYFPEQVSKDALEQAFHLYQKLKAAKVEEREKALKE


NO:
WEALQKKLREKVVKCFSDSNKARFSRIDKKELIKEDLINWLVAQNREDDIPTVET


4
FNNFTTYFTGFHENRKNIYSKDDHATAISFRLIHENLPKFFDNVISENKLKEGFP



ELKFDKVKEDLEVDYDLKHAFEIEYFVNFVTQAGIDQYNYLLGGKTLEDGTKKQG



MNEQINLFKQQQTRDKARQIPKLIPLFKQILSERTESQSFIPKQFESDQELFDSL



QKLHNNCQDKFTVLQQAILGLAEADLKKVFIKTSDLNALSNTIFGNYSVFSDALN



LYKESLKTKKAQEAFEKLPAHSIHDLIQYLEQFNSSLDAEKQQSTDTVLNYFIKT



DELYSRFIKSTSEAFTQVQPLFELEALSSKRRPPESEDEGAKGQEGFEQIKRIKA



YLDTLMEAVHFAKPLYLVKGRKMIEGLDKDQSFYEAFEMAYQELESLIIPIYNKA



RSYLSRKPFKADKFKINFDNNTLLSGWDANKETANASILFKKDGLYYLGIMPKGK



TFLFDYFVSSEDSEKLKQRRQKTAEEALAQDGESYFEKIRYKLLPGASKMLPKVF



FSNKNIGFYNPSDDILRIRNTASHTKNGTPQKGHSKVEFNLNDCHKMIDFFKSSI



QKHPEWGSFGFTFSDTSDFEDMSAFYREVENQGYVISFDKIKETYIQSQVEQGNL



YLFQIYNKDFSPYSKGKPNLHTLYWKALFEEANLNNVVAKLNGEAEIFFRRHSIK



ASDKVVHPANQAIDNKNPHTEKTQSTFEYDLVKDKRYTQDKFFFHVPISLNFKAQ



GVSKFNDKVNGFLKGNPDVNIIGIDRGERHLLYFTVVNQKGEILVQESLNTLMSD



KGHVNDYQQKLDKKEQERDAARKSWTTVENIKELKEGYLSHVVHKLAHLIIKYNA



IVCLEDLNFGFKRGRFKVEKQVYQKFEKALIDKLNYLVFKEKELGEVGHYLTAYQ



LTAPFESFKKLGKQSGILFYVPADYTSKIDPTTGFVNFLDLRYQSVEKAKQLLSD



FNAIRFNSVQNYFEFEIDYKKLTPKRKVGTQSKWVICTYGDVRYQNRRNQKGHWE



TEEVNVTEKLKALFASDSKTTTVIDYANDDNLIDVILEQDKASFFKELLWLLKLT



MTLRHSKIKSEDDFILSPVKNEQGEFYDSRKAGEVWPKDADANGAYHIALKGLWN



LQQINQWEKGKTLNLAIKNQDWFSFIQEKPYQE*





SEQ
MHTGGLLSMDAKEFTGQYPLSKTLRFELRPIGRTWDNLEASGYLAEDRHRAECYP


ID
RAKELLDDNHRAFLNRVLPQIDMDWHPIAEAFCKVHKNPGNKELAQDYNLQLSKR


NO:
RKEISAYLQDADGYKGLFAKPALDEAMKIAKENGNESDIEVLEAFNGFSVYFTGY


5
HESRENIYSDEDMVSVAYRITEDNFPRFVSNALIFDKLNESHPDIISEVSGNLGV



DDIGKYFDVSNYNNFLSQAGIDDYNHIIGGHTTEDGLIQAFNVVLNLRHQKDPGF



EKIQFKQLYKQILSVRTSKSYIPKQFDNSKEMVDCICDYVSKIEKSETVERALKL



VRNISSFDLRGIFVNKKNLRILSNKLIGDWDAIETALMHSSSSENDKKSVYDSAE



AFTLDDIFSSVKKFSDASAEDIGNRAEDICRVISETAPFINDLRAVDLDSLNDDG



YEAAVSKIRESLEPYMDLFHELEIFSVGDEFPKCAAFYSELEEVSEQLIEIIPLE



NKARSFCTRKRYSTDKIKVNLKFPTLADGWDLNKERDNKAAILRKDGKYYLAILD



MKKDLSSIRTSDEDESSFEKMEYKLLPSPVKMLPKIFVKSKAAKEKYGLTDRMLE



CYDKGMHKSGSAFDLGFCHELIDYYKRCIAEYPGWDVFDFKFRETSDYGSMKEFN



EDVAGAGYYMSLRKIPCSEVYRLLDEKSIYLFQIYNKDYSENAHGNKNMHTMYWE



GLFSPQNLESPVFKLSGGAELFFRKSSIPNDAKTVHPKGSVLVPRNDVNGRRIPD



SIYRELTRYFNRGDCRISDEAKSYLDKVKTKKADHDIVKDRRFTVDKMMFHVPIA



MNFKAISKPNLNKKVIDGIIDDQDLKIIGIDRGERNLIYVTMVDRKGNILYQDSL



NILNGYDYRKALDVREYDNKEARRNWTKVEGIRKMKEGYLSLAVSKLADMIIENN



AIIVMEDLNHGFKAGRSKIEKQVYQKFESMLINKLGYMVLKDKSIDQSGGALHGY



QLANHVTTLASVGKQCGVIFYIPAAFTSKIDPTTGFADLFALSNVKNVASMREFF



SKMKSVIYDKAEGKFAFTFDYLDYNVKSECGRTLWTVYTVGERFTYSRVNREYVR



KVPTDIIYDALQKAGISVEGDLRDRIAESDGDTLKSIFYAFKYALDMRVENREED



YIQSPVKNASGEFFCSKNAGKSLPQDSDANGAYNIALKGILQLRMLSEQYDPNAE



SIRLPLITNKAWLTFMQSGMKTWKN*





SEQ
MDSLKDFTNLYPVSKTLRFELKPVGKTLENIEKAGILKEDEHRAESYRRVKKIID


ID
TYHKVFIDSSLENMAKMGIENEIKAMLQSFCELYKKDHRTEGEDKALDKIRAVLR


NO:
GLIVGAFTGVCGRRENTVQNEKYESLFKEKLIKEILPDFVLSTEAESLPFSVEEA


6
TRSLKEFDSFTSYFAGFYENRKNIYSTKPQSTAIAYRLIHENLPKFIDNILVFQK



IKEPIAKELEHIRADFSAGGYIKKDERLEDIFSLNYYIHVLSQAGIEKYNALIGK



IVTEGDGEMKGLNEHINLYNQQRGREDRLPLFRPLYKQILSDREQLSYLPESFEK



DEELLRALKEFYDHIAEDILGRTQQLMTSISEYDLSRIYVRNDSQLTDISKKMLG



DWNAIYMARERAYDHEQAPKRITAKYERDRIKALKGEESISLANLNSCIAFLDNV



RDCRVDTYLSTLGQKEGPHGLSNLVENVFASYHEAEQLLSFPYPEENNLIQDKDN



VVLIKNLLDNISDLQRFLKPLWGMGDEPDKDERFYGEYNYIRGALDQVIPLYNKV



RNYLTRKPYSTRKVKLNFGNSQLLSGWDRNKEKDNSCVILRKGQNFYLAIMNNRH



KRSFENKVLPEYKEGEPYFEKMDYKFLPDPNKMLPKVFLSKKGIEIYKPSPKLLE



QYGHGTHKKGDTFSMDDLHELIDFFKHSIEAHEDWKQFGFKFSDTATYENVSSFY



REVEDQGYKLSFRKVSESYVYSLIDQGKLYLFQIYNKDFSPCSKGTPNLHTLYWR



MLFDERNLADVIYKLDGKAEIFFREKSLKNDHPTHPAGKPIKKKSRQKKGEESLF



EYDLVKDRHYTMDKFQFHVPITMNFKCSAGSKVNDMVNAHIREAKDMHVIGIDRG



ERNLLYICVIDSRGTILDQISLNTINDIDYHDLLESRDKDRQQERRNWQTIEGIK



ELKQGYLSQAVHRIAELMVAYKAVVALEDLNMGFKRGRQKVESSVYQQFEKQLID



KLNYLVDKKKRPEDIGGLLRAYQFTAPFKSFKEMGKQNGFLFYIPAWNTSNIDPT



TGFVNLFHAQYENVDKAKSFFQKFDSISYNPKKDWFEFAFDYKNFTKKAEGSRSM



WILCTHGSRIKNFRNSQKNGQWDSEEFALTEAFKSLFVRYEIDYTADLKTAIVDE



KQKDFFVDLLKLFKLTVQMRNSWKEKDLDYLISPVAGADGRFFDTREGNKSLPKD



ADANGAYNIALKGLWALRQIRQTSEGGKLKLAISNKEWLQFVQERSYEKD*





SEQ
MNNGTNNFQNFIGISSLQKTLRNALIPTETTQQFIVKNGIIKEDELRGENRQILK


ID
DIMDDYYRGFISETLSSIDDIDWTSLFEKMEIQLKNGDNKDTLIKEQTEYRKAIH


NO:
KKFANDDRFKNMFSAKLISDILPEFVIHNNNYSASEKEEKTQVIKLFSRFATSFK


7
DYFKNRANCFSADDISSSSCHRIVNDNAEIFFSNALVYRRIVKSLSNDDINKISG



DMKDSLKEMSLEEIYSYEKYGEFITQEGISFYNDICGKVNSFMNLYCQKNKENKN



LYKLQKLHKQILCIADTSYEVPYKFESDEEVYQSVNGFLDNISSKHIVERLRKIG



DNYNGYNLDKIYIVSKFYESVSQKTYRDWETINTALEIHYNNILPGNGKSKADKV



KKAVKNDLQKSITEINELVSNYKLCSDDNIKAETYIHEISHILNNFEAQELKYNP



EIHLVESELKASELKNVLDVIMNAFHWCSVFMTEELVDKDNNFYAELEEIYDEIY



PVISLYNLVRNYVTQKPYSTKKIKLNFGIPTLADGWSKSKEYSNNAIILMRDNLY



YLGIFNAKNKPDKKIIEGNTSENKGDYKKMIYNLLPGPNKMIPKVFLSSKTGVET



YKPSAYILEGYKQNKHIKSSKDFDITFCHDLIDYFKNCIAIHPEWKNFGFDFSDT



STYEDISGFYREVELQGYKIDWTYISEKDIDLLQEKGQLYLFQIYNKDFSKKSTG



NDNLHTMYLKNLFSEENLKDIVLKLNGEAEIFFRKSSIKNPIIHKKGSILVNRTY



EAEEKDQFGNIQIVRKNIPENIYQELYKYFNDKSDKELSDEAAKLKNVVGHHEAA



TNIVKDYRYTYDKYFLHMPITINFKANKTGFINDRILQYIAKEKDLHVIGIDRGE



RNLIYVSVIDTCGNIVEQKSFNIVNGYDYQIKLKQQEGARQIARKEWKEIGKIKE



IKEGYLSLVIHEISKMVIKYNAIIAMEDLSYGFKKGRFKVERQVYQKFETMLINK



LNYLVFKDISITENGGLLKGYQLTYIPDKLKNVGHQCGCIFYVPAAYTSKIDPTT



GFVNIFKFKDLTVDAKREFIKKFDSIRYDSEKNLFCFTFDYNNFITQNTVMSKSS



WSVYTYGVRIKRRFVNGRFSNESDTIDITKDMEKTLEMTDINWRDGHDLRQDIID



YEIVQHIFEIFRLTVQMRNSLSELEDRDYDRLISPVLNENNIFYDSAKAGDALPK



DADANGAYCIALKGLYEIKQITENWKEDGKFSRDKLKISNKDWFDFIQNKRYL*





SEQ
MTNKFTNQYSLSKTLRFELIPQGKTLEFIQEKGLLSQDKQRAESYQEMKKTIDKF


ID
HKYFIDLALSNAKLTHLETYLELYNKSAETKKEQKFKDDLKKVQDNLRKEIVKSF


NO:
SDGDAKSIFAILDKKELITVELEKWFENNEQKDIYFDEKFKTFTTYFTGFHQNRK


8
NMYSVEPNSTAIAYRLIHENLPKFLENAKAFEKIKQVESLQVNFRELMGEFGDEG



LIFVNELEEMFQINYYNDVLSQNGITIYNSIISGFTKNDIKYKGLNEYINNYNQT



KDKKDRLPKLKQLYKQILSDRISLSFLPDAFTDGKQVLKAIFDFYKINLLSYTIE



GQEESQNLLLLIRQTIENLSSFDTQKIYLKNDTHLTTISQQVFGDFSVFSTALNY



WYETKVNPKFETEYSKANEKKREILDKAKAVFTKQDYFSIAFLQEVLSEYILTLD



HTSDIVKKHSSNCIADYFKNHFVAKKENETDKTFDFIANITAKYQCIQGILENAD



QYEDELKQDQKLIDNLKFFLDAILELLHFIKPLHLKSESITEKDTAFYDVFENYY



EALSLLTPLYNMVRNYVTQKPYSTEKIKLNFENAQLLNGWDANKEGDYLTTILKK



DGNYFLAIMDKKHNKAFQKFPEGKENYEKMVYKLLPGVNKMLPKVFFSNKNIAYF



NPSKELLENYKKETHKKGDTFNLEHCHTLIDFFKDSLNKHEDWKYFDFQFSETKS



YQDLSGFYREVEHQGYKINFKNIDSEYIDGLVNEGKLFLFQIYSKDFSPFSKGKP



NMHTLYWKALFEEQNLQNVIYKLNGQAEIFFRKASIKPKNIILHKKKIKIAKKHF



IDKKTKTSEIVPVQTIKNLNMYYQGKISEKELTQDDLRYIDNFSIFNEKNKTIDI



IKDKRFTVDKFQFHVPITMNFKATGGSYINQTVLEYLQNNPEVKIIGLDRGERHL



VYLTLIDQQGNILKQESLNTITDSKISTPYHKLLDNKENERDLARKNWGTVENIK



ELKEGYISQVVHKIATLMLEENAIVVMEDLNFGFKRGRFKVEKQIYQKLEKMLID



KLNYLVLKDKQPQELGGLYNALQLTNKFESFQKMGKQSGFLFYVPAWNTSKIDPT



TGFVNYFYTKYENVDKAKAFFEKFEAIRFNAEKKYFEFEVKKYSDENPKAEGTQQ



AWTICTYGERIETKRQKDQNNKFVSTPINLTEKIEDFLGKNQIVYGDGNCIKSQI



ASKDDKAFFETLLYWFKMTLQMRNSETRTDIDYLISPVMNDNGTFYNSRDYEKLE



NPTLPKDADANGAYHIAKKGLMLLNKIDQADLTKKVDLSISNRDWLQFVQKNK*





SEQ
MEQEYYLGLDMGTGSVGWAVTDSEYHVLRKHGKALWGVRLFESASTAEERRMFRT


ID
SRRRLDRRNWRIEILQEIFAEEISKKDPGFFLRMKESKYYPEDKRDINGNCPELP


NO:
YALFVDDDFTDKDYHKKFPTIYHLRKMLMNTEETPDIRLVYLAIHHMMKHRGHFL


9
LSGDINEIKEFGTTFSKLLENIKNEELDWNLELGKEEYAVVESILKDNMLNRSTK



KTRLIKALKAKSICEKAVLNLLAGGTVKLSDIFGLEELNETERPKISFADNGYDD



YIGEVENELGEQFYIIETAKAVYDWAVLVEILGKYTSISEAKVATYEKHKSDLQF



LKKIVRKYLTKEEYKDIFVSTSDKLKNYSAYIGMTKINGKKVDLQSKRCSKEEFY



DFIKKNVLKKLEGQPEYEYLKEELERETFLPKQVNRDNGVIPYQIHLYELKKILG



NLRDKIDLIKENEDKLVQLFEFRIPYYVGPLNKIDDGKEGKFTWAVRKSNEKIYP



WNFENVVDIEASAEKFIRRMTNKCTYLMGEDVLPKDSLLYSKYMVLNELNNVKLD



GEKLSVELKQRLYTDVFCKYRKVTVKKIKNYLKCEGIISGNVEITGIDGDFKASL



TAYHDFKEILTGTELAKKDKENIITNIVLFGDDKKLLKKRLNRLYPQITPNQLKK



ICALSYTGWGRFSKKFLEEITAPDPETGEVWNIITALWESNNNLMQLLSNEYRFM



EEVETYNMGKQTKTLSYETVENMYVSPSVKRQIWQTLKIVKELEKVMKESPKRVF



IEMAREKQESKRTESRKKQLIDLYKACKNEEKDWVKELGDQEEQKLRSDKLYLYY



TQKGRCMYSGEVIELKDLWDNTKYDIDHIYPQSKTMDDSLNNRVLVKKKYNATKS



DKYPLNENIRHERKGFWKSLLDGGFISKEKYERLIRNTELSPEELAGFIERQIVE



TRQSTKAVAEILKQVFPESEIVYVKAGTVSRFRKDFELLKVREVNDLHHAKDAYL



NIVVGNSYYVKFTKNASWFIKENPGRTYNLKKMFTSGWNIERNGEVAWEVGKKGT



IVTVKQIMNKNNILVTRQVHEAKGGLFDQQIMKKGKGQIAIKETDERLASIEKYG



GYNKAAGAYFMLVESKDKKGKTIRTIEFIPLYLKNKIESDESIALNFLEKGRGLK



EPKILLKKIKIDTLFDVDGFKMWLSGRTGDRLLFKCANQLILDEKIIVTMKKIVK



FIQRRQENRELKLSDKDGIDNEVLMEIYNTFVDKLENTVYRIRLSEQAKTLIDKQ



KEFERLSLEDKSSTLFEILHIFQCQSSAANLKMIGGPGKAGILVMNNNISKCNKI



SIINQSPTGIFENEIDLLK





SEQ
MNKFENFTGLYPISKTLRFELIPQGKTLEYIEKSEILENDNYRAEKYEEVKDIID


ID
GYHKWFINETLHDLHINWSELKVALENNRIEKSDASKKELQRVQKIKREEIYNAF


NO:
IEHEAFQYLFKENLLSDLLPIQIEQSEDLDAEKKKQAVETFNRFSTYFTGFHENR


10
KNIYSKEGISTSVTYRIVHDNFPKFLENMKVFEILRNECPEVISDTANELAPFID



GVRIEDIFLIDFFNSTFSQNGIDYYNRILGGVTTETGEKYRGINEFTNLYRQQHP



EFGKSKKATKMVVLFKQILSDRDTLSFIPEMFGNDKQVQNSIQLFYNREISQFEN



EGVKTDVCTALATLTSKIAEFDTEKIYIQQPELPNVSQRLFGSWNELNACLFKYA



ELKFGTAEKVANRKKIDKWLKSDLFSFTELNKALEFSGKDERIENYFSETGIFAQ



LVKTGFDEAQSILETEYTSEVHLKDQQTDIEKIKTFLDALQNLMHLLKSLCVSEE



ADRDAAFYNEFDMLYNQLKLVVPLYNKVRNYITQKLFRSDKIKIYFENKGQFLGG



WVDSQTENSDNGTQAGGYIFRKENVINEYDYYLGICSDPKLFRRTTIVSENDRSS



FERLDYYQLKTASVYGNSYCGKHPYTEDKNELVNSIDRFVHLSGNNILIEKIAKD



KVKSNPTTNTPSGYLNFIHREAPNTYECLLQDENFVSLNQRVVSALKATLATLVR



VPKALVYAKKDYHLFSEIINDIDELSYEKAFSYFPVSQTEFENSSNRTIKPLLLF



KISNKDLSFAENFEKGNRQKIGKKNLHTLYFEALMKGNQDTIDIGTGMVFHRVKS



LNYNEKTLKYGHHSTQLNEKFSYPIIKDKRFASDKFLFHLSTEINYKEKRKPLNN



SIIEFLTNNPDINIIGLDRGERHLIYLTLINQKGEILRQKTFNIVGNTNYHEKLN



QREKERDNARKSWATIGKIKELKEGFLSLVIHEIAKIMVENNAIVVLEDLNFGFK



RGRFKVEKQIYQKFEKMLIDKLNYLVFKDKKANEAGGVLKGYQLAEKFESFQKMG



KQSGFLFYVPAAYTSKIDPTTGFVNMLNLNYTNMKDAQTLLSGMDKISFNADANY



FEFELDYEKFKTNQTDHTNKWTICTVGEKRFTYNSATKETTTVNVTEDLKKLLDK



FEVKYSNGDNIKDEICRQTDAKFFEIILWLLKLTMQMRNSNTKTEEDFILSPVKN



SNGEFFRSNDDANGIWPADADANGAYHIALKGLYLVKECFNKNEKSLKIEHKNWF



KFAQTRENGSLTKNG*





SEQ
MENFKNLYPINKTLRFELRPYGKTLENFKKSGLLEKDAFKANSRRSMQAIIDEKF


ID
KETIEERLKYTEFSECDLGNMTSKDKKITDKAATNLKKQVILSFDDEIFNNYLKP


NO:
DKNIDALFKNDPSNPVISTFKGFTTYFVNFFEIRKHIFKGESSGSMAYRIIDENL


11
TTYLNNIEKIKKLPEELKSQLEGIDQIDKLNNYNEFITQSGITHYNEIIGGISKS



ENVKIQGINEGINLYCQKNKVKLPRLTPLYKMILSDRVSNSFVLDTIENDTELIE



MISDLINKTEISQDVIMSDIQNIFIKYKQLGNLPGISYSSIVNAICSDYDNNFGD



GKRKKSYENDRKKHLETNVYSINYISELLTDTDVSSNIKMRYKELEQNYQVCKEN



FNATNWMNIKNIKQSEKTNLIKDLLDILKSIQRFYDLFDIVDEDKNPSAEFYTWL



SKNAEKLDFEFNSVYNKSRNYLTRKQYSDKKIKLNFDSPTLAKGWDANKEIDNST



IIMRKENNDRGDYDYFLGIWNKSTPANEKIIPLEDNGLFEKMQYKLYPDPSKMLP



KQFLSKIWKAKHPTTPEFDKKYKEGRHKKGPDFEKEFLHELIDCFKHGLVNHDEK



YQDVFGFNLRNTEDYNSYTEFLEDVERCNYNLSENKIADTSNLINDGKLYVFQIW



SKDFSIDSKGTKNLNTIYFESLFSEENMIEKMFKLSGEAEIFYRPASLNYCEDII



KKGHHHAELKDKFDYPIIKDKRYSQDKFFFHVPMVINYKSEKLNSKSLNNRTNEN



LGQFTHIIGIDRGERHLIYLTVVDVSTGEIVEQKHLDEIINTDTKGVEHKTHYLN



KLEEKSKTRDNERKSWEAIETIKELKEGYISHVINEIQKLQEKYNALIVMENLNY



GFKNSRIKVEKQVYQKFETALIKKFNYIIDKKDPETYIHGYQLTNPITTLDKIGN



QSGIVLYIPAWNTSKIDPVTGFVNLLYADDLKYKNQEQAKSFIQKIDNIYFENGE



FKFDIDFSKWNNRYSISKTKWTLTSYGTRIQTFRNPQKNNKWDSAEYDLTEEFKL



ILNIDGTLKSQDVETYKKFMSLFKLMLQLRNSVTGTDIDYMISPVTDKTGTHFDS



RENIKNLPADADANGAYNIARKGIMAIENIMNGISDPLKISNEDYLKYIQNQQE





SEQ
MTQFEGFTNLYQVSKTLRFELIPQGKTLKHIQEQGFIEEDKARNDHYKELKPIID


ID
RIYKTYADQCLQLVQLDWENLSAAIDSYRKEKTEETRNALIEEQATYRNAIHDYF


NO:
IGRTDNLTDAINKRHAEIYKGLFKAELFNGKVLKQLGTVTTTEHENALLRSFDKF


12
TTYFSGFYENRKNVFSAEDISTAIPHRIVQDNFPKFKENCHIFTRLITAVPSLRE



HFENVKKAIGIFVSTSIEEVFSFPFYNQLLTQTQIDLYNQLLGGISREAGTEKIK



GLNEVLNLAIQKNDETAHIIASLPHRFIPLFKQILSDRNTLSFILEEFKSDEEVI



QSFCKYKTLLRNENVLETAEALFNELNSIDLTHIFISHKKLETISSALCDHWDTL



RNALYERRISELTGKITKSAKEKVQRSLKHEDINLQEIISAAGKELSEAFKQKTS



EILSHAHAALDQPLPTTLKKQEEKEILKSQLDSLLGLYHLLDWFAVDESNEVDPE



FSARLTGIKLEMEPSLSFYNKARNYATKKPYSVEKFKLNFQMPTLASGWDVNKEK



NNGAILFVKNGLYYLGIMPKQKGRYKALSFEPTEKTSEGFDKMYYDYFPDAAKMI



PKCSTQLKAVTAHFQTHTTPILLSNNFIEPLEITKEIYDLNNPEKEPKKFQTAYA



KKTGDQKGYREALCKWIDFTRDFLSKYTKTTSIDLSSLRPSSQYKDLGEYYAELN



PLLYHISFQRIAEKEIMDAVETGKLYLFQIYNKDFAKGHHGKPNLHTLYWTGLFS



PENLAKTSIKLNGQAELFYRPKSRMKRMAHRLGEKMLNKKLKDQKTPIPDTLYQE



LYDYVNHRLSHDLSDEARALLPNVITKEVSHEIIKDRRFTSDKFFFHVPITLNYQ



AANSPSKFNQRVNAYLKEHPETPIIGIDRGERNLIYITVIDSTGKILEQRSLNTI



QQFDYQKKLDNREKERVAARQAWSVVGTIKDLKQGYLSQVIHEIVDLMIHYQAVV



VLENLNFGFKSKRTGIAEKAVYQQFEKMLIDKLNCLVLKDYPAEKVGGVLNPYQL



TDQFTSFAKMGTQSGFLFYVPAPYTSKIDPLTGFVDPFVWKTIKNHESRKHFLEG



FDFLHYDVKTGDFILHFKMNRNLSFQRGLPGFMPAWDIVFEKNETQFDAKGTPFI



AGKRIVPVIENHRFTGRYRDLYPANELIALLEEKGIVFRDGSNILPKLLENDDSH



AIDTMVALIRSVLQMRNSNAATGEDYINSPVRDLNGVCFDSRFQNPEWPMDADAN



GAYHIALKGQLLLNHLKESKDLKLQNGISNQDWLAYIQELRN*





SEQ
MAVKSIKVKLRLDDMPEIRAGLWKLHKEVNAGVRYYTEWLSLLRQENLYRRSPNG


ID
DGEQECDKTAEECKAELLERLRARQVENGHRGPAGSDDELLQLARQLYELLVPQA


NO:
IGAKGDAQQIARKFLSPLADKDAVGGLGIAKAGNKPRWVRMREAGEPGWEEEKEK


13
AETRKSADRTADVLRALADFGLKPLMRVYTDSEMSSVEWKPLRKGQAVRTWDRDM



FQQAIERMMSWESWNQRVGQEYAKLVEQKNRFEQKNFVGQEHLVHLVNQLQQDMK



EASPGLESKEQTAHYVTGRALRGSDKVFEKWGKLAPDAPFDLYDAEIKNVQRRNT



RRFGSHDLFAKLAEPEYQALWREDASFLTRYAVYNSILRKLNHAKMFATFTLPDA



TAHPIWTRFDKLGGNLHQYTFLFNEFGERRHAIRFHKLLKVENGVAREVDDVTVP



ISMSEQLDNLLPRDPNEPIALYFRDYGAEQHFTGEFGGAKIQCRRDQLAHMHRRR



GARDVYLNVSVRVQSQSEARGERRPPYAAVFRLVGDNHRAFVHFDKLSDYLAEHP



DDGKLGSEGLLSGLRVMSVDLGLRTSASISVFRVARKDELKPNSKGRVPFFFPIK



GNDNLVAVHERSQLLKLPGETESKDLRAIREERQRTLRQLRTQLAYLRLLVRCGS



EDVGRRERSWAKLIEQPVDAANHMTPDWREAFENELQKLKSLHGICSDKEWMDAV



YESVRRVWRHMGKQVRDWRKDVRSGERPKIRGYAKDVVGGNSIEQIEYLERQYKF



LKSWSFFGKVSGQVIRAEKGSRFAITLREHIDHAKEDRLKKLADRIIMEALGYVY



ALDERGKGKWVAKYPPCQLILLEELSEYQFNNDRPPSENNQLMQWSHRGVFQELI



NQAQVHDLLVGTMYAAFSSRFDARTGAPGIRCRRVPARCTQEHNPEPFPWWLNKF



VVEHTLDACPLRADDLIPTGEGEIFVSPFSAEEGDFHQIHADLNAAQNLQQRLWS



DFDISQIRLRCDWGEVDGELVLIPRLTGKRTADSYSNKVFYTNTGVTYYERERGK



KRRKVFAQEKLSEEEAELLVEADEAREKSVVLMRDPSGIINRGNWTRQKEFWSMV



NQRIEGYLVKQIRSRVPLQDSACENTGDI*





SEQ
MATRSFILKIEPNEEVKKGLWKTHEVLNHGIAYYMNILKLIRQEAIYEHHEQDPK


ID
NPKKVSKAEIQAELWDFVLKMQKCNSFTHEVDKDVVFNILRELYEELVPSSVEKK


NO:
GEANQLSNKFLYPLVDPNSQSGKGTASSGRKPRWYNLKIAGDPSWEEEKKKWEED


14
KKKDPLAKILGKLAEYGLIPLFIPFTDSNEPIVKEIKWMEKSRNQSVRRLDKDMF



IQALERFLSWESWNLKVKEEYEKVEKEHKTLEERIKEDIQAFKSLEQYEKERQEQ



LLRDTLNTNEYRLSKRGLRGWREIIQKWLKMDENEPSEKYLEVFKDYQRKHPREA



GDYSVYEFLSKKENHFIWRNHPEYPYLYATFCEIDKKKKDAKQQATFTLADPINH



PLWVRFEERSGSNLNKYRILTEQLHTEKLKKKLTVQLDRLIYPTESGGWEEKGKV



DIVLLPSRQFYNQIFLDIEEKGKHAFTYKDESIKFPLKGTLGGARVQFDRDHLRR



YPHKVESGNVGRIYFNMTVNIEPTESPVSKSLKIHRDDFPKFVNFKPKELTEWIK



DSKGKKLKSGIESLEIGLRVMSIDLGQRQAAAASIFEVVDQKPDIEGKLFFPIKG



TELYAVHRASFNIKLPGETLVKSREVLRKAREDNLKLMNQKLNFLRNVLHFQQFE



DITEREKRVTKWISRQENSDVPLVYQDELIQIRELMYKPYKDWVAFLKQLHKRLE



VEIGKEVKHWRKSLSDGRKGLYGISLKNIDEIDRTRKFLLRWSLRPTEPGEVRRL



EPGQRFAIDQLNHLNALKEDRLKKMANTIIMHALGYCYDVRKKKWQAKNPACQII



LFEDLSNYNPYEERSRFENSKLMKWSRREIPRQVALQGEIYGLQVGEVGAQFSSR



FHAKTGSPGIRCSVVTKEKLQDNRFFKNLQREGRLTLDKIAVLKEGDLYPDKGGE



KFISLSKDRKLVTTHADINAAQNLQKRFWTRTHGFYKVYCKAYQVDGQTVYIPES



KDQKQKIIEEFGEGYFILKDGVYEWGNAGKLKIKKGSSKQSSSELVDSDILKDSF



DLASELKGEKLMLYRDPSGNVFPSDKWMAAGVFFGKLERILISKLTNQYSISTIE



DDSSKQSM*





SEQ
MPTRTINLKLVLGKNPENATLRRALFSTHRLVNQATKRIEEFLLLCRGEAYRTVD


ID
NEGKEAEIPRHAVQEEALAFAKAAQRHNGCISTYEDQEILDVLRQLYERLVPSVN


NO:
ENNEAGDAQAANAWVSPLMSAESEGGLSVYDKVLDPPPVWMKLKEEKAPGWEAAS


15
QIWIQSDEGQSLLNKPGSPPRWIRKLRSGQPWQDDFVSDQKKKQDELTKGNAPLI



KQLKEMGLLPLVNPFFRHLLDPEGKGVSPWDRLAVRAAVAHFISWESWNHRTRAE



YNSLKLRRDEFEAASDEFKDDFTLLRQYEAKRHSTLKSIALADDSNPYRIGVRSL



RAWNRVREEWIDKGATEEQRVTILSKLQTQLRGKFGDPDLFNWLAQDRHVHLWSP



RDSVTPLVRINAVDKVLRRRKPYALMTFAHPRFHPRWILYEAPGGSNLRQYALDC



TENALHITLPLLVDDAHGTWIEKKIRVPLAPSGQIQDLTLEKLEKKKNRLYYRSG



FQQFAGLAGGAEVLFHRPYMEHDERSEESLLERPGAVWFKLTLDVATQAPPNWLD



GKGRVRTPPEVHHFKTALSNKSKHTRTLQPGLRVLSVDLGMRTFASCSVFELIEG



KPETGRAFPVADERSMDSPNKLWAKHERSFKLTLPGETPSRKEEEERSIARAEIY



ALKRDIQRLKSLLRLGEEDNDNRRDALLEQFFKGWGEEDVVPGQAFPRSLFQGLG



AAPFRSTPELWRQHCQTYYDKAEACLAKHISDWRKRTRPRPTSREMWYKTRSYHG



GKSIWMLEYLDAVRKLLLSWSLRGRTYGAINRQDTARFGSLASRLLHHINSLKED



RIKTGADSIVQAARGYIPLPHGKGWEQRYEPCQLILFEDLARYRFRVDRPRRENS



QLMQWNHRAIVAETTMQAELYGQIVENTAAGFSSRFHAATGAPGVRCRFLLERDF



DNDLPKPYLLRELSWMLGNTKVESEEEKLRLLSEKIRPGSLVPWDGGEQFATLHP



KRQTLCVIHADMNAAQNLQRRFFGRCGEAFRLVCQPHGDDVLRLASTPGARLLGA



LQQLENGQGAFELVRDMGSTSQMNRFVMKSLGKKKIKPLQDNNGDDELEDVLSVL



PEEDDTGRITVFRDSSGIFFPCNVWIPAKQFWPAVRAMIWKVMASHSLG*





SEQ
MTKLRHRQKKLTHDWAGSKKREVLGSNGKLQNPLLMPVKKGQVTEFRKAFSAYAR


ID
ATKGEMTDGRKNMFTHSFEPFKTKPSLHQCELADKAYQSLHSYLPGSLAHFLLSA


NO:
HALGFRIFSKSGEATAFQASSKIEAYESKLASELACVDLSIQNLTISTLFNALTT


16
SVRGKGEETSADPLIARFYTLLTGKPLSRDTQGPERDLAEVISRKIASSFGTWKE



MTANPLQSLQFFEEELHALDANVSLSPAFDVLIKMNDLQGDLKNRTIVFDPDAPV



FEYNAEDPADIIIKLTARYAKEAVIKNQNVGNYVKNAITTTNANGLGWLLNKGLS



LLPVSTDDELLEFIGVERSHPSCHALIELIAQLEAPELFEKNVFSDTRSEVQGMI



DSAVSNHIARLSSSRNSLSMDSEELERLIKSFQIHTPHCSLFIGAQSLSQQLESL



PEALQSGVNSADILLGSTQYMLTNSLVEESIATYQRTLNRINYLSGVAGQINGAI



KRKAIDGEKIHLPAAWSELISLPFIGQPVIDVESDLAHLKNQYQTLSNEFDTLIS



ALQKNFDLNFNKALLNRTQHFEAMCRSTKKNALSKPEIVSYRDLLARLTSCLYRG



SLVLRRAGIEVLKKHKIFESNSELREHVHERKHFVFVSPLDRKAKKLLRLTDSRP



DLLHVIDEILQHDNLENKDRESLWLVRSGYLLAGLPDQLSSSFINLPIITQKGDR



RLIDLIQYDQINRDAFVMLVTSAFKSNLSGLQYRANKQSFVVTRTLSPYLGSKLV



YVPKDKDWLVPSQMFEGRFADILQSDYMVWKDAGRLCVIDTAKHLSNIKKSVFSS



EEVLAFLRELPHRTFIQTEVRGLGVNVDGIAFNNGDIPSLKTFSNCVQVKVSRTN



TSLVQTLNRWFEGGKVSPPSIQFERAYYKKDDQIHEDAAKRKIRFQMPATELVHA



SDDAGWTPSYLLGIDPGEYGMGLSLVSINNGEVLDSGFIHINSLINFASKKSNHQ



TKVVPRQQYKSPYANYLEQSKDSAAGDIAHILDRLIYKLNALPVFEALSGNSQSA



ADQVWTKVLSFYTWGDNDAQNSIRKQHWFGASHWDIKGMLRQPPTEKKPKPYIAF



PGSQVSSYGNSQRCSCCGRNPIEQLREMAKDTSIKELKIRNSEIQLFDGTIKLFN



PDPSTVIERRRHNLGPSRIPVADRTFKNISPSSLEFKELITIVSRSIRHSPEFIA



KKRGIGSEYFCAYSDCNSSLNSEANAAANVAQKFQKQLFFEL*





SEQ
MKRILNSLKVAALRLLFRGKGSELVKTVKYPLVSPVQGAVEELAEAIRHDNLHLF


ID
GQKEIVDLMEKDEGTQVYSVVDFWLDTLRLGMFFSPSANALKITLGKFNSDQVSP


NO:
FRKVLEQSPFFLAGRLKVEPAERILSVEIRKIGKRENRVENYAADVETCFIGQLS


17
SDEKQSIQKLANDIWDSKDHEEQRMLKADFFAIPLIKDPKAVTEEDPENETAGKQ



KPLELCVCLVPELYTRGFGSIADFLVQRLTLLRDKMSTDTAEDCLEYVGIEEEKG



NGMNSLLGTFLKNLQGDGFEQIFQFMLGSYVGWQGKEDVLRERLDLLAEKVKRLP



KPKFAGEWSGHRMFLHGQLKSWSSNFFRLFNETRELLESIKSDIQHATMLISYVE



EKGGYHPQLLSQYRKLMEQLPALRTKVLDPEIEMTHMSEAVRSYIMIHKSVAGFL



PDLLESLDRDKDREFLLSIFPRIPKIDKKTKEIVAWELPGEPEEGYLFTANNLFR



NFLENPKHVPRFMAERIPEDWTRLRSAPVWFDGMVKQWQKVVNQLVESPGALYQF



NESFLRQRLQAMLTVYKRDLQTEKFLKLLADVCRPLVDFFGLGGNDIIFKSCQDP



RKQWQTVIPLSVPADVYTACEGLAIRLRETLGFEWKNLKGHEREDFLRLHQLLGN



LLFWIRDAKLVVKLEDWMNNPCVQEYVEARKAIDLPLEIFGFEVPIFLNGYLFSE



LRQLELLLRRKSVMTSYSVKTTGSPNRLFQLVYLPLNPSDPEKKNSNNFQERLDT



PTGLSRRFLDLTLDAFAGKLLTDPVTQELKTMAGFYDHLFGFKLPCKLAAMSNHP



GSSSKMVVLAKPKKGVASNIGFEPIPDPAHPVFRVRSSWPELKYLEGLLYLPEDT



PLTIELAETSVSCQSVSSVAFDLKNLTTILGRVGEFRVTADQPFKLTPIIPEKEE



SFIGKTYLGLDAGERSGVGFAIVTVDGDGYEVQRLGVHEDTQLMALQQVASKSLK



EPVFQPLRKGTFRQQERIRKSLRGCYWNFYHALMIKYRAKVVHEESVGSSGLVGQ



WLRAFQKDLKKADVLPKKGGKNGVDKKKRESSAQDTLWGGAFSKKEEQQIAFEVQ



AAGSSQFCLKCGWWFQLGMREVNRVQESGVVLDWNRSIVTFLIESSGEKVYGFSP



QQLEKGFRPDIETFKKMVRDFMRPPMFDRKGRPAAAYERFVLGRRHRRYRFDKVF



EERFGRSALFICPRVGCGNFDHSSEQSAVVLALIGYIADKEGMSGKKLVYVRLAE



LMAEWKLKKLERSRVEEQSSAQ*





SEQ
MAESKQMQCRKCGASMKYEVIGLGKKSCRYMCPDCGNHTSARKIQNKKKRDKKYG


ID
SASKAQSQRIAVAGALYPDKKVQTIKTYKYPADLNGEVHDSGVAEKIAQAIQEDE


NO:
IGLLGPSSEYACWIASQKQSEPYSVVDFWFDAVCAGGVFAYSGARLLSTVLQLSG


18
EESVLRAALASSPFVDDINLAQAEKFLAVSRRTGQDKLGKRIGECFAEGRLEALG



IKDRMREFVQAIDVAQTAGQRFAAKLKIFGISQMPEAKQWNNDSGLTVCILPDYY



VPEENRADQLVVLLRRLREIAYCMGIEDEAGFEHLGIDPGALSNFSNGNPKRGFL



GRLLNNDIIALANNMSAMTPYWEGRKGELIERLAWLKHRAEGLYLKEPHFGNSWA



DHRSRIFSRIAGWLSGCAGKLKIAKDQISGVRTDLFLLKRLLDAVPQSAPSPDFI



ASISALDRFLEAAESSQDPAEQVRALYAFHLNAPAVRSIANKAVQRSDSQEWLIK



ELDAVDHLEFNKAFPFFSDTGKKKKKGANSNGAPSEEEYTETESIQQPEDAEQEV



NGQEGNGASKNQKKFQRIPRFFGEGSRSEYRILTEAPQYFDMFCNNMRAIFMQLE



SQPRKAPRDFKCFLQNRLQKLYKQTFLNARSNKCRALLESVLISWGEFYTYGANE



KKFRLRHEASERSSDPDYVVQQALEIARRLFLFGFEWRDCSAGERVDLVEIHKKA



ISFLLAITQAEVSVGSYNWLGNSTVSRYLSVAGTDTLYGTQLEEFLNATVLSQMR



GLAIRLSSQELKDGFDVQLESSCQDNLQHLLVYRASRDLAACKRATCPAELDPKI



LVLPVGAFIASVMKMIERGDEPLAGAYLRHRPHSFGWQIRVRGVAEVGMDQGTAL



AFQKPTESEPFKIKPFSAQYGPVLWLNSSSYSQSQYLDGFLSQPKNWSMRVLPQA



GSVRVEQRVALIWNLQAGKMRLERSGARAFFMPVPFSFRPSGSGDEAVLAPNRYL



GLFPHSGGIEYAVVDVLDSAGFKILERGTIAVNGFSQKRGERQEEAHREKQRRGI



SDIGRKKPVQAEVDAANELHRKYTDVATRLGCRIVVQWAPQPKPGTAPTAQTVYA



RAVRTEAPRSGNQEDHARMKSSWGYTWGTYWEKRKPEDILGISTQVYWTGGIGES



CPAVAVALLGHIRATSTQTEWEKEEVVFGRLKKFFPS*





SEQ
MEKRINKIRKKLSADNATKPVSRSGPMKTLLVRVMTDDLKKRLEKRRKKPEVMPQ


ID
VISNNAANNLRMLLDDYTKMKEAILQVYWQEFKDDHVGLMCKFAQPASKKIDQNK


NO:
LKPEMDEKGNLTTAGFACSQCGQPLFVYKLEQVSEKGKAYTNYFGRCNVAEHEKL


19
ILLAQLKPEKDSDEAVTYSLGKFGQRALDFYSIHVTKESTHPVKPLAQIAGNRYA



SGPVGKALSDACMGTIASFLSKYQDIIIEHQKVVKGNQKRLESLRELAGKENLEY



PSVTLPPQPHTKEGVDAYNEVIARVRMWVNLNLWQKLKLSRDDAKPLLRLKGFPS



FPVVERRENEVDWWNTINEVKKLIDAKRDMGRVFWSGVTAEKRNTILEGYNYLPN



ENDHKKREGSLENPKKPAKRQFGDLLLYLEKKYAGDWGKVFDEAWERIDKKIAGL



TSHIEREEARNAEDAQSKAVLTDWLRAKASFVLERLKEMDEKEFYACEIQLQKWY



GDLRGNPFAVEAENRVVDISGFSIGSDGHSIQYRNLLAWKYLENGKREFYLLMNY



GKKGRIRFTDGTDIKKSGKWQGLLYGGGKAKVIDLTFDPDDEQLIILPLAFGTRQ



GREFIWNDLLSLETGLIKLANGRVIEKTIYNKKIGRDEPALFVALTFERREVVDP



SNIKPVNLIGVDRGENIPAVIALTDPEGCPLPEFKDSSGGPTDILRIGEGYKEKQ



RAIQAAKEVEQRRAGGYSRKFASKSRNLADDMVRNSARDLFYHAVTHDAVLVFEN



LSRGFGRQGKRTFMTERQYTKMEDWLTAKLAYEGLTSKTYLSKTLAQYTSKTCSN



CGFTITTADYDGMLVRLKKTSDGWATTLNNKELKAEGQITYYNRYKRQTVEKELS



AELDRLSEESGNNDISKWTKGRRDEALFLLKKRFSHRPVQEQFVCLDCGHEVHAD



EQAALNIARSWLFLNSNSTEFKSYKSGKQPFVGAWQAFYKRRLKEVWKPNA





SEQ
MKRINKIRRRLVKDSNTKKAGKTGPMKTLLVRVMTPDLRERLENLRKKPENIPQP


ID
ISNTSRANLNKLLTDYTEMKKAILHVYWEEFQKDPVGLMSRVAQPAPKNIDQRKL


NO:
IPVKDGNERLTSSGFACSQCCQPLYVYKLEQVNDKGKPHTNYFGRCNVSEHERLI


20
LLSPHKPEANDELVTYSLGKFGQRALDFYSIHVTRESNHPVKPLEQIGGNSCASG



PVGKALSDACMGAVASFLTKYQDIILEHQKVIKKNEKRLANLKDIASANGLAFPK



ITLPPQPHTKEGIEAYNNVVAQIVIWVNLNLWQKLKIGRDEAKPLQRLKGFPSFP



LVERQANEVDWWDMVCNVKKLINEKKEDGKVFWQNLAGYKRQEALLPYLSSEEDR



KKGKKFARYQFGDLLLHLEKKHGEDWGKVYDEAWERIDKKVEGLSKHIKLEEERR



SEDAQSKAALTDWLRAKASFVIEGLKEADKDEFCRCELKLQKWYGDLRGKPFAIE



AENSILDISGFSKQYNCAFIWQKDGVKKLNLYLIINYFKGGKLRFKKIKPEAFEA



NRFYTVINKKSGEIVPMEVNFNFDDPNLIILPLAFGKRQGREFIWNDLLSLETGS



LKLANGRVIEKTLYNRRTRQDEPALFVALTFERREVLDSSNIKPMNLIGIDRGEN



IPAVIALTDPEGCPLSRFKDSLGNPTHILRIGESYKEKQRTIQAAKEVEQRRAGG



YSRKYASKAKNLADDMVRNTARDLLYYAVTQDAMLIFENLSRGFGRQGKRTFMAE



RQYTRMEDWLTAKLAYEGLPSKTYLSKTLAQYTSKTCSNCGFTITSADYDRVLEK



LKKTATGWMTTINGKELKVEGQITYYNRYKRQNVVKDLSVELDRLSEESVNNDIS



SWTKGRSGEALSLLKKRFSHRPVQEKFVCLNCGFETHADEQAALNIARSWLFLRS



QEYKKYQTNKTTGNTDKRAFVETWQSFYRKKLKEVWKP





SEQ
atgGGAAAAATGTATTATCTTGGTCTGGATATAGGAACAAATTCTGTTGGATATG


ID
CCGTAACCGACCCATCGTACCATTTGCTCAAATTTAAAGGCGAACCGATGTGGGG


NO:
TGCCCACGTGTTTGCTGCGGGGAATCAATCAGCTGAACGGAGAAGCTTTCGTACG


21
AGCCGCAGACGCCTTGACCGCAGGCAACAGCGTGTCAAACTGGTTCAAGAAATCT



TTGCTCCCGTGATTAGTCCCATTGATCCACGTTTTTTTATCAGACTTCATGAGAG



CGCTTTATGGCGGGATGATGTGGCTGAAACGGATAAACATATTTTCTTTAATGAC



CCGACCTATACGGATAAGGAATATTATTCTGACTATCCAACCATCCATCATCTCA



TTGTGGACCTTATGGAAAGCAGTGAAAAGCATGACCCGCGGCTTGTTTATTTGGC



TGTTGCCTGGCTGGTTGCTCATCGTGGTCATTTCCTCAATGAAGTGGATAAGGAT



AATATTGGGGATGTCCTGAGTTTTGACGCCTTTTATCCTGAGTTTCTGGCATTTC



TTTCCGATAATGGGGTGTCACCTTGGGTATGTGAGTCAAAAGCACTCCAAGCGAC



CCTGCTTTCACGAAACTCCGTCAACGATAAGTATAAAGCCTTGAAGTCTCTGATC



TTTGGCAGCCAAAAGCCGGAGGATAATTTTGATGCCAATATCAGTGAAGATGGAC



TTATCCAACTTTTAGCAGGAAAAAAGGTCAAGGTCAATAAACTTTTTCCTCAAGA



AAGTAATGATGCTTCCTTTACACTCAATGATAAGGAAGATGCAATTGAGGAAATC



TTAGGAACGCTTACACCGGATGAGTGTGAATGGATTGCGCATATTAGGAGGCTGT



TTGATTGGGCCATCATGAAACATGCTCTCAAAGATGGCAGAACAATCTCCGAATC



GAAAGTAAAGCTCTATGAACAGCATCACCATGACTTGACACAGCTCAAGTATTTT



GTGAAGACCTATCTAGCAAAGGAATATGATGACATTTTTCGAAACGTAGATAGTG



AAACAACCAAAAACTATGTCGCATATTCCTATCATGTAAAAGAAGTCAAGGGTAC



ATTGCCCAAAAATAAGGCAACCCAAGAAGAATTTTGCAAGTATGTCCTTGGAAAG



GTAAAGAACATCGAATGCAGTGAAGCTGATAAGGTTGATTTTGATGAAATGATTC



AGCGTCTTACAGACAATTCCTTTATGCCGAAACAAGTATCAGGTGAAAACAGGGT



TATCCCTTACCAGCTTTACTATTATGAACTAAAGACTATTTTGAATAAAGCCGCT



TCTTATCTGCCTTTTTTGACCCAATGCGGAAAAGATGCCATCTCCAATCAAGATA



AGCTCCTTTCCATCATGACCTTTCGGATTCCGTATTTCGTTGGGCCCTTGCGCAA



GGACAATTCAGAGCATGCCTGGCTGGAACGAAAAGCAGGGAAAATCTATCCGTGG



AATTTTAACGACAAAGTTGACCTTGATAAAAGTGAAGAAGCGTTCATTCGGAGAA



TGACGAATACCTGCACTTATTATCCCGGTGAAGATGTTTTGCCACTTGACTCCCT



TATTTATGAAAAATTCATGATCCTCAATGAAATCAATAATATCCGAATTGATGGT



TATCCTATTTCTGTAGATGTAAAACAGCAGGTTTTTGGCCTCTTTGAAAAGAAGA



GAAGAGTGACCGTAAAGGATATCCAGAATCTCCTGCTTTCCTTGGGTGCCTTGGA



TAAGCATGGTAAATTGACGGGAATCGATACTACCATCCATAGCAATTACAATACA



TACCATCATTTTAAATCGCTCATGGAGCGTGGCGTTCTTACTCGTGATGATGTGG



AACGCATTGTGGAGCGTATGACCTATAGTGATGATACAAAACGCGTCCGTCTTTG



GCTGAACAATAATTATGGAACGCTCACTGCTGACGACGTAAAGCATATTTCAAGG



CTCCGAAAGCATGATTTTGGCCGGCTTTCCAAAATGTTCCTCACAGGCCTAAAGG



GAGTTCATAAGGAAACGGGGGAACGAGCTTCCATTTTGGATTTTATGTGGAATAC



CAATGATAACTTGATGCAGCTTTTATCTGAATGTTATACTTTTTCGGATGAAATT



ACCAAGCTGCAGGAAGCATACTATGCCAAGGCGCAGCTTTCCCTGAATGATTTTC



TGGACTCCATGTATATTTCAAATGCTGTCAAACGTCCTATCTATCGAACTCTTGC



CGTTGTAAATGACATACGCAAAGCCTGTGGGACGGCGCCAAAACGCATTTTTATC



GAAATGGCAAGAGATGGGGAAAGCAAAAAGAAAAGGAGCGTAACAAGAAGAGAAC



AAATCAAGAATCTTTATAGGTCCATCCGCAAGGATTTTCAGCAGGAGGTAGATTT



CCTTGAAAAAATCCTTGAAAACAAAAGCGATGGACAGCTGCAAAGCGATGCGCTC



TATCTATACTTTGCGCAGCTTGGAAGGGATATGTATACCGGGGACCCTATCAAGT



TGGAGCATATCAAGGACCAGTCCTTCTATAATATTGATCATATCTATCCCCAAAG



CATGGTCAAGGACGATAGTCTTGATAACAAGGTGTTGGTTCAATCGGAAATTAAT



GGAGAGAAGAGCAGTCGATATCCTCTTGATGCTGCTATCCGTAATAAAATGAAGC



CTCTTTGGGATGCTTATTATAACCATGGCCTGATTTCCCTCAAGAAGTATCAGCG



TTTGACGCGGAGCACTCCCTTTACAGATGATGAAAAGTGGGATTTCATCAATCGG



CAGCTTGTTGAGACAAGACAATCCACGAAGGCCTTGGCAATCTTACTAAAAAGGA



AGTTCCCTGATACGGAGATTGTCTACTCCAAGGCAGGGCTTTCTTCTGATTTTCG



GCATGAGTTTGGTCTCGTAAAATCGAGGAATATCAATGACCTGCACCATGCAAAG



GACGCATTTCTTGCGATTGTAACAGGAAATGTCTATCATGAACGCTTTAATCGCC



GGTGGTTTATGGTGAACCAGCCCTATTCCGTCAAGACCAAGACGTTGTTTACGCA



TTCTATTAAAAATGGTAATTTTGTAGCTTGGAATGGAGAAGAGGATCTTGGCCGC



ATTGTTAAAATGTTAAAGCAAAATAAGAACACTATTCATTTCACGCGGTTCTCTT



TTGATCGAAAGGAAGGCCTGTTTGATATTCAGCCACTAAAAGCGTCAACCGGTCT



TGTACCAAGAAAAGCCGGACTAGACGTGGTAAAATATGGTGGCTATGACAAATCG



ACAGCAGCTTATTATCTCCTTGTTCGATTTACACTAGAAGATAAAAAGACTCAAC



ATAAATTGATGATGATTCCTGTAGAAGGCTTGTATAAAGCTCGAATTGACCATGA



TAAGGAATTCTTAACGGACTATGCACAAACTACAATCAGTGAAATCCTACAAAAA



GATAAACAAAAGGTGATAAATATAATGTTTCCAATGGGAACAAGGCACATTAAAC



TGAATTCCATGATTTCAATCGATGGTTTTTATCTTTCCATTGGAGGAAAGTCTAG



TAAGGGAAAATCGGTGTTGTGTCATGCTATGGTACCTCTTATTGTACCTCATAAG



ATAGAATGTTATATTAAGGCGATGGAGTCTTTTGCACGTAAATTTAAAGAAAATA



ATAAATTAAGGATTGTGGAAAAGTTTGATAAGATTACGGTGGAAGATAACTTGAA



CCTATACGAACTATTTTTACAAAAACTTCAACATAACCCATATAATAAGTTCTTC



TCCACACAATTTGATGTGCTGACTAATGGAAGAAGTACATTTACTAAATTATCTC



CAGAGGAACAAGTTCAAACGTTATTGAATATCTTATCAATTTTTAAAACTTGTCG



GAGCTCTGGCTGCGATTTAAAATCCATTAACGGTTCTGCTCAAGCTGCCAGAATT



ATGATCAGCGCAGATTTAACTGGACTCTCAAAAAAATATTCCGATATTCGGCTTG



TTGAGCAATCAGCATCTGGACTTTTTGTTAGTAAATCACAAAATCTTTTGGAGTA



TTTAtga





SEQ
atgtcttcattaacaaaatttacaaataaatacagtaagcagctaaccataaaaa


ID
atgaactcatcccagtaggaaagactctcgagaacattaaggaaaacggtctcat


NO:
agatggagatgaacagctaaacgagaattatcaaaaagcaaagataatcgttgat


22
gattttctacgagatttcataaataaagctttaaataatacccaaataggaaatt



ggagagaattagcagatgctttaaataaagaagatgaagataacatagaaaagct



ccaagacaaaatcagaggaataattgtaagtaaattcgagacatttgatttgttt



tcttcttactcgataaagaaagacgaaaagataatagatgatgataatgatgttg



aagaagaggagctagatctaggaaaaaaaacttcctcatttaaatatatttttaa



gaaaaacctttttaaattagtacttccttcttatttaaagacaacaaatcaggat



aaactgaaaataatctcttcttttgataatttttctacctatttcagaggattct



ttgagaacagaaaaaatattttcactaagaagcctatatctacgtcaattgccta



cagaattgtccatgataactttccaaagtttctagataacatcagatgttttaat



gtgtggcaaacagaatgcccacagttaattgtaaaggctgataattatttaaaat



caaagaacgtcatagctaaagataaatctttagcaaactattttactgtaggagc



atatgattacttcttatcccagaatggcattgatttctacaacaacattatcggc



ggtctaccagcatttgctggtcatgagaaaatccaaggacttaatgaatttataa



atcaagaatgccaaaaggacagcgaactaaaatctaaactgaaaaacagacatgc



tttcaaaatggctgttctatttaagcaaattctttcagatagagaaaaaagtttt



gttatagacgagttcgaatctgatgctcaggtcatagatgcggttaagaacttct



atgcagaacaatgtaaggataataatgttatttttaaccttctaaatcttatcaa



gaatatagcgttcttatctgatgatgaattagatggaatttttatagaaggcaag



tatttaagctctgtttcccaaaagctatattcagattggtcgaagcttcgaaatg



atattgaagatagtgcaaacagtaaacaaggaaataaagagttagcaaagaaaat



taaaacaaataaaggcgatgttgaaaaggccataagtaaatatgagttttcttta



tcagaacttaactcaattgtacatgataatacaaaattcagtgaccttctttctt



gtacgttacataaagtggctagcgaaaaactagtgaaagttaatgaaggggactg



gccaaaacacctgaaaaataatgaagaaaaacaaaagataaaagagcctttagat



gcattgttagaaatttataatacattgctgatattcaactgcaagtcatttaata



agaacggtaatttctatgttgattatgacagatgcataaatgagctttctagtgt



tgtttatttatataacaaaacaagaaattactgtacaaagaaaccttataacaca



gacaaattcaaattaaactttaacagtcctcaattaggagagggctttagtaagt



cgaaagaaaatgactgtctgacattattatttaaaaaagacgacaattactatgt



tggaattatcagaaaaggggcaaaaattaactttgatgatacacaagccattgca



gacaatacagataactgtatatttaagatgaattatttcctattaaaagatgcta



aaaagtttattcctaaatgttcaattcagttaaaagaagtaaaagcacattttaa



aaaatcagaggatgattatatcctgagtgacaaagaaaaatttgcctctcccctt



gttattaagaaatcaacatttttattagcaacagcacatgtaaaaggaaagaaag



gaaacataaaaaaattccaaaaggaatattctaaggaaaatccaacagaatatag



aaattctctgaatgaatggattgcattttgtaaagaatttctaaaaacatataag



gcggcaacaatctttgacattacaacgttaaaaaaagctgaagaatatgctgata



ttgttgagttttataaggatgtagataatctttgttataaactagagttttgccc



tattaaaacatctttcattgagaatcttattgataatggggacttatatttattc



agaatcaataataaagatttcagttcaaaatctactggtacaaagaatcttcata



cgctctatcttcaggcaatctttgatgaaagaaacctcaataatcctactattat



gttaaatggcggagcagagttattttatcgaaaagaaagcattgaacagaaaaat



aggataactcataaggcaggatcaattcttgtaaacaaggtttgtaaggatggaa



caagtctagatgacaaaatcagaaacgaaatatatcaatatgaaaacaagtttat



tgatacattgtctgatgaagctaaaaaagttttacctaatgtaataaaaaaagaa



gcaactcacgacataacaaaagataagcgatttacatcagataagttctttttcc



attgcccattaacaattaactataaggaaggagatacaaaacaatttaacaatga



ggttttatctttccttagaggtaatccagacattaatatcatcggaattgacaga



ggagaaagaaaccttatatacgtaactgttattaatcagaaaggcgaaatacttg



acagcgtttcgtttaacacagtaacaaacaagtcgagcaaaattgaacaaactgt



tgattatgaggaaaagcttgctgttagggaaaaagaaagaatagaagcaaaaaga



tcctgggattcaatatcaaagatagcaaccttaaaagaaggttatctatcagcta



ttgttcatgagatatgcctactgatgatcaaacacaacgcaatcgttgtacttga



gaatctaaatgcaggatttaagagaattagaggaggattatcagaaaagtctgtt



tatcagaaattcgagaagatgcttattaacaaactaaattactttgtatctaaaa



aagaatcagactggaataaacctagtggacttttaaatggtttacaactttcaga



ccagttcgagtcatttgagaaattaggaattcaatctgggttcatcttctatgtt



cctgcagcatatacatctaagattgatcctacaacaggatttgcaaatgttctta



acttatccaaggtaagaaatgttgatgcaataaagagttttttcagtaatttcaa



tgaaatttcatatagcaaaaaagaagctctctttaaattctcttttgatttagat



tccttatcaaagaagggcttcagctcatttgtaaaattcagtaaatctaaatgga



atgtatatacatttggagagagaataataaaaccaaagaataagcaagggtatcg



tgaagataagagaattaatttaacatttgaaatgaaaaaacttctgaatgaatat



aaagtaagttttgatcttgaaaacaacttaattccaaatctaacctctgcaaatc



tgaaagataccttctggaaagaactattctttatttttaaaacaactctgcagct



tagaaacagtgtaacaaatggcaaagaagatgtactgatttctccagtaaagaac



gctaaaggagagttctttgtatcaggaactcataacaagacattacctcaagact



gtgatgcaaatggagcatatcatatcgccctaaaaggtctgatgattcttgaacg



taacaatcttgttagagaagaaaaagacacaaagaagataatggcaatttctaat



gttgactggtttgagtatgttcaaaaaaggagaggtgtcctgtaa





SEQ
ATGAACAACTATGATGAGTTTACCAAACTGTACCCAATACAGAAAACGATAAGGT


ID
TCGAATTGAAGCCGCAGGGAAGAACGATGGAACACCTCGAAACATTCAACTTTTT


NO:
CGAAGAGGACAGGGATAGAGCGGAGAAATATAAGATTTTAAAGGAAGCAATCGAC


23
GAGTATCATAAGAAGTTTATAGACGAACATCTAACAAATATGTCTCTTGACTGGA



ATTCTTTAAAACAGATTTCAGAGAAATACTATAAGAGTAGAGAGGAAAAAGACAA



GAAAGTTTTTCTGTCAGAACAGAAACGCATGAGGCAAGAGATAGTTTCTGAGTTC



AAAAAAGACGATCGGTTTAAAGATCTTTTTTCAAAAAAATTGTTTTCTGAACTTC



TCAAGGAAGAGATTTACAAAAAAGGAAACCATCAGGAAATTGACGCATTGAAAAG



TTTTGATAAATTCTCAGGCTATTTTATTGGGTTGCATGAGAACCGAAAAAATATG



TATTCTGACGGAGACGAGATCACGGCTATCTCTAACCGTATTGTAAATGAGAATT



TCCCGAAGTTCCTCGACAACCTTCAGAAATATCAGGAAGCTCGTAAAAAATATCC



AGAGTGGATCATTAAGGCAGAATCTGCTTTAGTTGCACATAATATCAAGATGGAT



GAAGTCTTTTCCTTAGAGTATTTCAACAAAGTCCTGAATCAAGAAGGAATACAGA



GATACAATCTCGCCCTAGGTGGCTATGTGACCAAAAGTGGTGAGAAAATGATGGG



GCTTAATGATGCACTTAATCTTGCCCATCAAAGTGAAAAAAGCAGCAAGGGAAGG



ATACACATGACTCCACTCTTCAAACAGATTCTGAGTGAAAAAGAGTCCTTTTCTT



ATATACCAGATGTTTTTACAGAAGACTCTCAACTTTTACCATCCATTGGTGGGTT



CTTTGCACAAATAGAAAATGATAAGGACGGGAATATTTTTGACAGAGCATTAGAA



TTGATATCTTCTTATGCAGAATACGATACAGAAAGGATATATATCAGGCAAGCGG



ACATAAACAGAGTTTCTAATGTTATTTTCGGGGAGTGGGGAACACTGGGGGGGTT



AATGAGGGAATACAAAGCAGACTCTATCAACGACATCAATTTGGAGAGAACATGC



AAGAAGGTAGACAAGTGGCTCGACTCAAAGGAGTTTGCGTTATCAGATGTATTAG



AGGCAATAAAAAGAACCGGCAATAATGATGCTTTTAATGAATATATCTCAAAGAT



GCGCACTGCCAGGGAAAAGATTGACGCTGCAAGAAAGGAAATGAAATTCATTTCG



GAAAAAATATCTGGAGACGAAGAATCGATCCATATTATCAAAACCTTATTGGACT



CGGTGCAACAGTTTTTACATTTTTTCAATTTATTCAAAGCGCGTCAGGACATTCC



TCTTGATGGAGCATTCTATGCGGAGTTCGATGAAGTCCATAGCAAACTGTTTGCT



ATTGTTCCGTTGTATAATAAGGTTAGGAACTATCTTACGAAAAATAACCTTAACA



CGAAAAAGATAAAGCTAAACTTCAAGAATCCAACTCTGGCAAACGGATGGGATCA



AAACAAGGTATATGACTACGCCTCCTTAATCTTTCTCCGCGATGGTAATTATTAT



CTCGGAATAATAAATCCAAAAAGGAAAAAGAATATTAAATTCGAACAAGGGTCTG



GAAATGGCCCATTCTACCGGAAGATGGTGTACAAACAAATTCCAGGGCCGAACAA



GAACTTACCAAGAGTCTTCCTCACATCTACGAAAGGCAAAAAAGAGTACAAGCCG



TCAAAGGAGATAATAGAAGGATATGAAGCGGACAAACACATAAGAGGAGATAAAT



TCGATCTGGATTTCTGTCATAAGCTGATAGACTTCTTCAAGGAATCCATCGAGAA



GCACAAGGACTGGAGTAAGTTCAACTTCTATTTCTCTCCAACTGAATCATATGGA



GACATCAGCGAATTCTATCTGGATGTAGAAAAACAGGGATACCGGATGCATTTTG



AGAATATTTCTGCCGAGACGATTGATGAGTATGTCGAAAAGGGGGACTTATTCCT



CTTCCAGATATACAACAAAGACTTTGTGAAAGCGGCAACCGGAAAAAAAGATATG



CACACCATTTATTGGAACGCGGCATTCTCGCCCGAGAACCTTCAGGATGTGGTAG



TGAAACTGAACGGTGAAGCAGAACTTTTCTACAGAGACAAGAGCGACATCAAGGA



GATAGTTCACAGGGAGGGAGAGATACTGGTCAATCGTACCTACAACGGCAGGACA



CCTGTGCCTGACAAGATCCACAAAAAATTAACAGATTATCATAATGGCCGTACCA



AAGATCTCGGAGAAGCAAAAGAATACCTCGATAAGGTCAGATATTTCAAAGCGCA



CTACGACATCACAAAGGATCGCAGATACCTGAATGATAAAATATACTTCCATGTG



CCTCTGACATTGAATTTCAAAGCAAACGGGAAGAAGAATCTCAATAAGATGGTAA



TTGAAAAGTTCCTCTCGGACGAAAAAGCGCATATTATTGGGATTGATCGCGGGGA



AAGGAATCTTCTTTACTATTCTATCATTGACAGGTCAGGTAAAATAATCGATCAA



CAGAGCCTCAACGTCATCGATGGATTCGATTACCGAGAGAAACTGAATCAGAGGG



AGATCGAGATGAAGGATGCCAGACAAAGCTGGAATGCTATCGGGAAGATAAAGGA



CCTCAAGGAAGGGTATCTTTCAAAAGCGGTCCACGAAATTACCAAGATGGCGATA



CAATACAATGCCATTGTTGTCATGGAGGAACTCAATTATGGGTTCAAACGCGGAC



GTTTCAAAGTTGAGAAGCAGATATATCAGAAATTCGAGAATATGCTGATTGACAA



GATGAATTATCTGGTATTCAAGGATGCTCCGGATGAAAGTCCGGGAGGAGTCCTC



AATGCATATCAGCTTACTAATCCGCTTGAAAGTTTCGCTAAACTTGGGAAACAGA



CAGGAATTCTTTTCTATGTTCCGGCAGCCTATACTTCGAAGATAGATCCGACGAC



CGGGTTTGTCAATCTTTTCAATACTTCAAGTAAAACGAACGCACAGGAAAGAAAA



GAATTCTTGCAAAAATTCGAGTCGATCTCCTATTCCGCTAAAGACGGAGGAATAT



TCGCATTCGCGTTCGATTATCGGAAGTTCGGAACGTCAAAAACAGACCACAAAAA



TGTATGGACCGCATACACGAACGGGGAAAGGATGAGGTACATAAAAGAGAAAAAA



CGCAACGAACTGTTCGACCCCTCGAAGGAGATCAAAGAGGCTCTCACTTCATCAG



GAATCAAATATGACGGCGGACAGAACATATTGCCAGATATCCTGAGGAGCAACAA



TAACGGTCTGATCTACACAATGTATTCCTCTTTCATAGCGGCCATTCAAATGAGG



GTCTATGACGGGAAAGAAGACTATATCATCTCGCCGATAAAGAACAGCAAGGGAG



AGTTCTTCAGGACCGATCCGAAAAGAAGGGAACTTCCGATAGACGCGGATGCGAA



CGGCGCGTATAACATTGCTCTCAGGGGCGAATTGACGATGCGTGCGATAGCGGAG



AAGTTCGATCCGGACTCGGAAAAGATGGCGAAGCTAGAACTGAAACATAAGGACT



GGTTCGAATTCATGCAGACAAGGGGGGATTGA





SEQ
ATGACAAAAACATTTGATTCAGAATTTTTTAATTTATATTCTCTTCAAAAAACAG


ID
TTCGTTTTGAACTCAAGCCGGTTGGTGAAACAGCCTCGTTTGTTGAAGATTTTAA


NO:
AAACGAAGGTTTGAAACGAGTTGTTTCAGAGGATGAACGGCGTGCGGTTGATTAC


24
CAAAAAGTGAAAGAAATTATTGATGACTACCACCGAGATTTTATTGAAGAATCGC



TGAACTATTTTCCTGAGCAGGTCTCAAAAGACGCTTTGGAACAAGCTTTTCACCT



TTATCAAAAACTAAAAGCCGCTAAGGTTGAAGAGCGTGAAAAAGCATTGAAAGAA



TGGGAAGCCCTTCAGAAAAAACTGCGCGAAAAAGTTGTTAAATGTTTTTCAGATT



CAAACAAAGCACGCTTTTCCCGCATTGATAAAAAAGAACTGATTAAAGAAGATTT



AATTAACTGGTTGGTTGCACAAAATCGCGAAGATGACATTCCAACCGTTGAAACC



TTTAACAACTTTACGACTTATTTTACGGGGTTTCATGAAAACCGAAAAAACATTT



ATTCAAAAGACGATCATGCCACAGCCATTTCATTTCGACTCATTCATGAAAACCT



GCCTAAGTTTTTTGATAATGTGATCAGCTTTAATAAATTGAAGGAAGGATTTCCA



GAGCTGAAATTTGATAAGGTTAAGGAAGATTTAGAAGTTGATTATGACTTGAAAC



ATGCCTTTGAAATCGAATACTTTGTCAATTTTGTTACCCAAGCCGGAATTGACCA



ATATAACTATCTTTTGGGGGGTAAAACCTTAGAAGACGGCACCAAAAAGCAAGGC



ATGAATGAACAAATCAATCTGTTCAAGCAACAGCAAACCCGAGACAAAGCCCGAC



AAATTCCCAAACTCATACCATTGTTTAAACAAATTCTAAGCGAACGAACGGAAAG



CCAATCGTTTATTCCAAAACAATTTGAATCAGACCAAGAGCTATTTGACTCACTG



CAAAAACTGCATAACAACTGCCAAGATAAATTTACCGTACTGCAACAAGCCATTT



TAGGCTTAGCCGAAGCAGATCTGAAAAAAGTATTCATTAAAACATCTGATCTTAA



TGCGCTATCAAATACCATTTTTGGAAATTACAGTGTGTTTTCGGATGCGTTGAAT



TTATACAAAGAATCGCTCAAAACAAAAAAGGCGCAAGAAGCGTTTGAAAAACTAC



CCGCTCACAGCATTCATGACTTGATTCAATATTTGGAGCAATTTAATAGCTCTTT



GGATGCAGAAAAACAGCAATCAACTGACACCGTACTGAATTACTTTATTAAAACA



GACGAGCTGTATTCTCGGTTCATAAAATCAACGAGCGAAGCCTTCACACAAGTAC



AACCACTCTTTGAATTGGAAGCATTAAGCTCAAAACGTCGTCCACCGGAAAGTGA



AGACGAAGGCGCAAAAGGTCAGGAAGGGTTTGAGCAAATTAAACGCATAAAAGCC



TATTTGGATACCTTGATGGAGGCGGTGCATTTTGCAAAACCACTTTATCTGGTGA



AGGGGCGCAAAATGATTGAAGGTCTGGACAAAGACCAAAGTTTCTATGAAGCCTT



TGAAATGGCTTACCAAGAACTAGAAAGTCTGATTATTCCAATCTACAACAAAGCT



CGTAGTTATTTAAGTCGTAAACCGTTTAAAGCGGACAAATTCAAAATTAATTTTG



ATAATAATACATTGCTTTCCGGTTGGGATGCTAATAAAGAAACGGCTAACGCTTC



AATTTTGTTTAAGAAGGATGGTTTGTATTATTTAGGAATCATGCCTAAAGGAAAA



ACGTTTTTGTTCGATTACTTCGTTTCATCGGAAGATTCTGAAAAGTTAAAACAAA



GAAGACAAAAAACCGCCGAAGAAGCGCTTGCGCAAGATGGCGAAAGCTACTTTGA



AAAAATTCGTTACAAGCTGTTACCTGGCGCCAGCAAAATGTTGCCGAAAGTATTT



TTTTCCAACAAAAACATAGGGTTTTACAACCCAAGTGATGACATACTTCGTATCA



GGAATACAGCCTCTCACACTAAAAACGGAACACCGCAAAAAGGGCACTCTAAAGT



AGAGTTTAATTTGAATGATTGTCATAAGATGATTGATTTCTTTAAATCAAGCATT



CAAAAGCATCCAGAGTGGGGAAGTTTTGGATTCACCTTTTCAGATACATCAGATT



TTGAAGATATGAGCGCCTTTTATCGAGAAGTCGAAAACCAAGGTTATGTCATTAG



TTTCGATAAAATAAAAGAAACTTACATTCAGAGTCAAGTTGAACAGGGGAACCTA



TATTTATTCCAAATCTACAATAAAGACTTCTCGCCCTACAGCAAAGGCAAACCAA



ATTTACACACGCTTTACTGGAAAGCGTTGTTTGAGGAAGCCAACCTAAATAATGT



GGTGGCAAAACTCAATGGTGAAGCTGAAATTTTCTTTAGGCGACACTCAATCAAA



GCATCTGATAAAGTGGTGCACCCAGCGAATCAAGCCATTGACAATAAAAACCCGC



ATACCGAAAAAACGCAAAGCACCTTTGAATATGATCTTGTAAAAGACAAGCGCTA



TACCCAAGACAAATTCTTCTTCCATGTACCGATTTCATTGAACTTTAAGGCACAA



GGTGTTTCAAAATTTAACGATAAAGTGAATGGATTTTTAAAGGGTAACCCAGATG



TCAATATTATTGGCATTGACCGAGGCGAACGACACCTTCTGTATTTCACTGTGGT



GAATCAGAAAGGTGAAATTTTGGTTCAAGAGTCGCTTAATACCCTAATGAGTGAT



AAAGGGCATGTGAATGACTACCAGCAAAAACTCGACAAAAAAGAACAAGAACGCG



ATGCCGCTCGCAAAAGCTGGACGACGGTTGAAAATATCAAAGAATTAAAAGAAGG



CTATTTATCTCATGTTGTTCATAAGTTGGCACACCTGATTATTAAATACAATGCC



ATTGTTTGCTTGGAAGACCTGAATTTTGGTTTCAAACGCGGGCGTTTTAAAGTGG



AAAAACAAGTTTATCAGAAATTTGAAAAAGCGCTTATTGATAAGCTTAACTACTT



GGTATTTAAAGAAAAAGAGTTAGGCGAGGTGGGCCATTATCTAACCGCCTATCAG



TTGACCGCACCGTTTGAAAGTTTCAAGAAGTTAGGCAAGCAAAGTGGCATATTGT



TTTATGTTCCGGCGGATTACACCTCCAAAATTGACCCAACCACCGGGTTTGTCAA



CTTTCTTGATCTGCGTTATCAGAGTGTCGAAAAAGCCAAACAGCTCTTAAGCGAC



TTTAATGCCATTCGTTTTAATTCAGTACAAAACTATTTTGAGTTCGAAATAGATT



ACAAAAAACTCACACCCAAACGTAAAGTTGGTACTCAGAGTAAATGGGTGATTTG



TACCTATGGAGATGTCCGCTATCAAAATCGGCGTAATCAAAAAGGTCACTGGGAA



ACGGAAGAAGTCAATGTGACTGAAAAACTAAAAGCCCTTTTCGCCAGTGATTCCA



AAACTACAACCGTAATCGATTACGCCAATGACGACAACCTAATTGACGTCATTCT



GGAACAGGACAAAGCCAGCTTCTTCAAAGAACTGTTATGGTTATTAAAACTCACC



ATGACGCTCCGCCACAGCAAAATCAAAAGTGAAGACGACTTTATTCTTTCACCCG



TTAAAAACGAACAAGGCGAGTTTTACGATAGTCGAAAAGCGGGCGAGGTGTGGCC



TAAAGATGCAGACGCCAATGGCGCTTATCACATAGCGTTGAAAGGCTTGTGGAAT



CTGCAACAGATCAATCAGTGGGAAAAGGGTAAAACACTTAATCTGGCGATTAAAA



ACCAGGATTGGTTCAGTTTTATTCAAGAAAAGCCCTATCAAGAATAA





SEQ
ATGCACACAGGCGGATTACTTAGCATGGATGCCAAGGAGTTTACCGGACAGTACC


ID
CCCTTTCGAAGACTCTGCGTTTTGAACTGAGACCGATAGGCAGAACGTGGGACAA


NO:
TCTCGAAGCATCGGGGTATCTTGCGGAGGACAGACACCGTGCAGAATGCTATCCC


25
AGGGCAAAAGAGCTCTTGGACGACAACCATCGTGCATTCCTCAACCGTGTCCTGC



CTCAGATCGATATGGATTGGCACCCGATCGCAGAGGCATTCTGCAAAGTCCACAA



GAATCCGGGAAACAAGGAATTGGCTCAGGATTACAATCTTCAGCTGTCCAAACGC



AGAAAGGAGATTTCGGCCTATCTGCAGGATGCGGACGGCTATAAAGGTCTGTTTG



CCAAACCTGCATTGGATGAAGCAATGAAGATCGCGAAAGAAAACGGAAATGAATC



GGACATAGAGGTTCTTGAGGCATTCAACGGTTTCTCCGTATACTTCACCGGATAT



CATGAGAGCAGGGAGAACATCTATTCGGACGAGGATATGGTGTCGGTAGCTTATC



GCATCACCGAAGACAATTTCCCGAGATTCGTTTCCAATGCGCTTATATTCGATAA



GCTGAATGAGTCGCACCCCGATATAATCTCGGAAGTATCCGGAAATCTGGGCGTA



GACGACATCGGAAAATATTTTGATGTGTCTAACTACAATAATTTCCTGTCGCAGG



CCGGTATAGATGACTACAATCACATCATCGGCGGCCATACGACGGAGGACGGTCT



GATCCAGGCATTCAATGTTGTTCTGAATCTCAGGCATCAGAAAGACCCCGGATTC



GAAAAAATCCAATTCAAACAGCTGTACAAACAGATACTCAGCGTCCGTACATCCA



AATCCTATATCCCGAAACAGTTCGATAATTCGAAGGAGATGGTGGACTGCATCTG



CGACTATGTGTCCAAGATCGAAAAATCCGAAACGGTCGAGAGAGCATTGAAGCTG



GTAAGGAACATATCTTCTTTTGATTTGCGCGGAATATTCGTAAACAAGAAGAATC



TCCGCATTCTTTCCAACAAACTGATTGGTGATTGGGACGCGATCGAAACCGCGCT



GATGCACTCCTCCTCTTCGGAAAATGATAAGAAATCCGTCTACGACAGCGCCGAG



GCATTTACGCTGGATGATATCTTTTCGTCCGTTAAAAAATTCTCAGATGCATCTG



CAGAGGATATCGGAAACCGGGCGGAGGACATATGCAGAGTCATATCTGAGACCGC



TCCGTTCATAAACGATCTGAGGGCTGTCGATTTGGACAGTTTGAATGACGACGGT



TACGAGGCGGCGGTTTCCAAGATAAGGGAATCTCTGGAACCATATATGGATCTGT



TTCATGAACTGGAGATATTCTCCGTAGGCGATGAATTCCCGAAATGTGCAGCTTT



CTACAGTGAACTTGAAGAAGTCTCCGAACAGCTAATCGAGATTATACCGTTATTC



AACAAGGCCCGTTCGTTCTGTACGCGCAAGAGATACAGTACGGACAAGATAAAGG



TCAATTTGAAATTCCCGACACTCGCCGACGGATGGGATCTCAACAAAGAACGCGA



CAACAAAGCCGCAATACTCAGGAAAGACGGAAAGTACTACCTGGCCATACTGGAT



ATGAAGAAAGATCTTTCTTCGATCAGAACTTCGGATGAAGACGAATCCAGTTTTG



AGAAAATGGAGTACAAGCTTCTTCCGAGTCCGGTAAAGATGCTGCCAAAGATCTT



CGTAAAATCGAAGGCGGCCAAGGAGAAGTACGGTCTGACCGACCGTATGCTGGAG



TGCTACGATAAAGGGATGCACAAGAGCGGCAGTGCATTCGATCTCGGATTTTGTC



ACGAATTGATCGATTACTACAAGAGGTGCATCGCAGAATATCCCGGCTGGGACGT



CTTCGATTTCAAGTTCAGGGAAACATCGGATTATGGCAGCATGAAGGAGTTCAAT



GAGGATGTTGCAGGGGCCGGATACTATATGTCCCTCAGAAAGATCCCTTGTTCGG



AGGTCTACAGGCTTCTTGATGAGAAATCGATATATCTTTTCCAGATCTACAACAA



AGATTATTCGGAAAACGCTCATGGGAATAAGAACATGCATACCATGTATTGGGAA



GGGCTCTTTTCCCCCCAGAATCTGGAATCCCCTGTGTTTAAACTCAGCGGCGGTG



CGGAGCTTTTCTTCCGTAAATCCTCCATACCCAATGACGCCAAAACGGTCCATCC



GAAGGGAAGCGTCCTGGTTCCGCGCAATGATGTAAACGGCCGCAGGATACCTGAC



AGCATATATCGGGAGCTCACCAGATATTTCAACCGCGGAGATTGCCGCATAAGCG



ACGAGGCAAAGAGTTATCTGGACAAGGTGAAAACCAAGAAAGCTGACCACGATAT



CGTGAAAGACAGGAGGTTCACGGTGGACAAGATGATGTTCCACGTCCCTATCGCC



ATGAATTTCAAAGCGATTTCGAAGCCGAATCTCAATAAAAAGGTGATTGACGGCA



TAATCGACGACCAAGATCTGAAGATCATCGGCATAGACCGCGGAGAGCGCAACCT



CATCTACGTAACCATGGTGGATCGCAAAGGGAACATCCTCTATCAGGATAGCCTC



AATATTCTGAACGGATACGATTACCGTAAGGCCCTCGACGTCCGCGAATATGACA



ATAAAGAGGCTCGGAGGAACTGGACGAAGGTCGAAGGCATCCGTAAGATGAAAGA



GGGGTATCTGTCGCTTGCAGTCAGCAAATTGGCAGATATGATCATAGAGAACAAT



GCGATTATCGTCATGGAGGATCTCAATCACGGATTCAAGGCAGGGCGTTCGAAGA



TAGAGAAACAGGTCTATCAGAAGTTCGAATCCATGCTCATAAACAAACTCGGTTA



CATGGTCCTCAAGGATAAGTCTATCGATCAGAGCGGCGGAGCTCTCCACGGATAC



CAGCTTGCCAACCATGTGACAACATTGGCATCTGTAGGTAAACAATGTGGAGTGA



TATTCTACATCCCTGCTGCATTTACATCCAAGATAGATCCGACAACAGGATTTGC



AGATCTGTTCGCCCTCAGCAATGTTAAAAACGTGGCATCTATGAGAGAATTTTTC



TCCAAGATGAAGTCTGTAATCTATGATAAGGCGGAGGGAAAATTCGCATTTACCT



TCGACTATCTTGATTATAATGTGAAATCCGAGTGCGGAAGGACCCTTTGGACCGT



GTATACGGTCGGAGAGAGATTCACATACAGCAGGGTCAATAGAGAATATGTCAGA



AAAGTTCCGACAGACATAATCTACGACGCATTGCAAAAGGCAGGAATATCTGTTG



AAGGGGATCTCAGGGACAGGATTGCTGAATCGGATGGCGACACTCTGAAGAGCAT



ATTCTATGCATTCAAGTATGCATTGGATATGAGAGTAGAGAACCGCGAAGAGGAT



TACATACAGTCTCCTGTCAAAAATGCCTCCGGAGAATTCTTCTGTTCCAAGAACG



CAGGCAAATCGCTCCCTCAGGATTCCGATGCGAACGGTGCATACAATATCGCACT



CAAGGGGATCCTGCAGCTACGTATGCTTTCCGAGCAGTATGATCCGAATGCAGAG



AGCATACGGTTGCCACTGATAACCAACAAGGCCTGGCTGACCTTTATGCAGTCCG



GTATGAAGACATGGAAGAACTGA





SEQ
atgGATAGTTTGAAAGATTTCACCAATCTGTACCCTGTCAGTAAGACATTGAGAT


ID
TTGAATTAAAGCCCGTTGGAAAGACTTTAGAAAATATCGAGAAAGCAGGTATTTT


NO:
GAAAGAGGATGAGCATCGTGCAGAAAGTTATCGGAGGGTGAAGAAAATAATTGAT


26
ACTTATCATAAGGTATTTATCGATTCTTCTCTTGAAAATATGGCTAAAATGGGTA



TTGAGAATGAAATAAAAGCAATGCTCCAAAGTTTCTGCGAATTGTATAAAAAAGA



TCATCGCACTGAGGGTGAAGACAAGGCATTAGATAAAATTCGAGCAGTACTTCGT



GGCCTGATTGTTGGGGCTTTCACTGGTGTTTGCGGAAGACGGGAAAATACAGTCC



AAAACGAGAAGTACGAGAGTTTGTTCAAAGAAAAGTTGATAAAAGAAATTTTACC



TGATTTTGTGCTCTCTACTGAGGCTGAAAGCTTGCCTTTCTCTGTTGAAGAAGCT



ACGAGGTCACTGAAGGAGTTTGATAGCTTTACATCCTACTTTGCTGGTTTTTACG



AGAATAGAAAGAATATATACTCGACGAAACCTCAATCCACTGCCATTGCTTATCG



TCTTATTCATGAGAACTTGCCGAAGTTCATTGATAATATTCTTGTTTTTCAGAAG



ATCAAAGAGCCTATAGCCAAAGAGCTGGAACATATTCGTGCGGACTTTTCTGCCG



GGGGGTACATAAAAAAGGATGAGAGATTGGAGGATATTTTTTCGTTGAACTATTA



TATCCACGTGTTATCTCAGGCTGGGATCGAAAAATATAACGCATTGATTGGGAAG



ATTGTGACAGAAGGAGATGGAGAGATGAAAGGGCTCAATGAACACATCAACCTTT



ACAACCAACAAAGAGGCAGAGAGGATCGGCTCCCTCTTTTTAGGCCTCTTTATAA



ACAGATATTGAGTGACAGAGAGCAATTATCATACTTGCCTGAGAGTTTTGAAAAA



GATGAGGAGCTCCTCAGGGCTCTAAAAGAGTTCTATGATCATATCGCAGAAGACA



TTCTCGGACGTACTCAACAGTTGATGACTTCTATTTCAGAATATGATTTATCTCG



GATATACGTAAGGAACGATAGCCAATTGACTGATATATCAAAAAAAATGTTGGGA



GATTGGAATGCTATCTACATGGCTAGAGAACGAGCATATGACCACGAGCAGGCTC



CCAAAAGAATCACGGCGAAATACGAGAGGGACAGGATTAAAGCTCTTAAAGGAGA



AGAGAGTATAAGTCTGGCAAATCTTAATAGTTGTATTGCCTTTCTGGACAATGTT



AGAGATTGCCGTGTAGATACTTATCTTTCCACACTGGGCCAGAAGGAAGGACCAC



ATGGTCTATCTAATCTCGTTGAGAACGTTTTTGCCTCATACCATGAAGCAGAGCA



ATTGTTGAGCTTTCCATACCCCGAAGAGAATAATCTGATTCAGGACAAGGACAAT



GTGGTGTTAATTAAGAATCTTCTCGACAATATCAGTGATCTGCAGAGGTTCTTGA



AACCTCTTTGGGGTATGGGAGACGAACCCGATAAAGATGAAAGATTTTATGGAGA



GTATAATTATATCCGAGGAGCTCTAGATCAGGTGATCCCTCTGTACAATAAGGTA



AGGAACTACCTCACTCGGAAGCCTTATTCGACCAGAAAAGTAAAACTCAATTTTG



GGAATTCTCAATTGCTTAGTGGTTGGGATAGAAATAAGGAAAAGGATAATAGCTG



TGTGATTTTGCGTAAGGGGCAGAACTTCTATTTGGCTATTATGAACAATAGGCAC



AAAAGAAGTTTCGAAAACAAGGTGTTGCCCGAGTATAAGGAGGGAGAACCTTACT



TCGAAAAGATGGATTATAAATTTTTGCCTGATCCTAATAAAATGCTTCCTAAGGT



TTTTCTTTCGAAAAAAGGAATAGAGATATACAAACCAAGTCCGAAGCTTTTAGAA



CAATATGGACATGGAACTCACAAAAAGGGAGATACCTTTAGTATGGATGATTTGC



ACGAACTGATCGATTTCTTCAAACACTCAATCGAGGCTCATGAAGATTGGAAGCA



ATTCGGATTCAAATTTTCTGATACGGCTACTTATGAGAATGTATCTAGTTTCTAT



AGAGAAGTTGAGGATCAGGGGTATAAGCTCTCTTTCCGAAAAGTTTCGGAATCTT



ATGTCTATTCATTAATAGATCAAGGCAAGTTGTATTTATTTCAGATATACAACAA



GGACTTTTCTCCCTGCAGCAAAGGGACACCTAATCTGCATACCTTGTATTGGAGA



ATGCTTTTTGACGAGCGCAATTTGGCAGATGTCATATACAAACTGGATGGGAAGG



CTGAAATCTTTTTCCGAGAGAAGAGTTTGAAAAATGATCATCCCACGCATCCGGC



TGGTAAGCCTATCAAAAAGAAAAGTCGACAAAAAAAAGGAGAGGAGAGTCTGTTT



GAGTATGATTTAGTCAAGGATAGGCACTATACGATGGATAAGTTCCAGTTTCATG



TGCCTATTACTATGAATTTTAAATGTTCTGCAGGAAGCAAAGTCAATGATATGGT



TAATGCTCATATTCGAGAGGCAAAGGATATGCATGTCATTGGAATTGATCGTGGA



GAACGCAATCTGCTGTATATATGCGTGATAGATAGTCGAGGGACGATTTTGGATC



AAATTTCTCTGAATACGATTAACGATATAGACTATCATGATTTATTGGAGAGTCG



AGACAAAGACCGTCAGCAGGAGCGCCGAAACTGGCAAACTATCGAAGGGATCAAG



GAGCTAAAACAAGGCTACCTTAGTCAGGCGGTTCATCGGATAGCCGAACTGATGG



TGGCTTATAAGGCTGTAGTTGCTTTGGAGGATTTGAATATGGGGTTCAAACGTGG



GCGGCAGAAAGTAGAAAGTTCTGTTTATCAGCAGTTTGAGAAACAGCTGATAGAT



AAGCTCAACTATCTTGTGGACAAGAAGAAAAGGCCTGAAGATATTGGAGGATTGT



TGAGAGCCTATCAATTTACGGCCCCATTTAAGAGTTTTAAGGAAATGGGAAAGCA



AAACGGCTTCTTGTTTTATATCCCGGCTTGGAACACGAGCAACATAGATCCGACT



ACTGGATTTGTTAATTTATTTCATGCCCAGTATGAAAATGTAGATAAAGCGAAGA



GCTTCTTTCAAAAGTTTGATTCAATTAGTTACAACCCGAAGAAAGACTGGTTTGA



GTTTGCATTCGATTATAAAAACTTTACTAAAAAGGCTGAAGGAAGTCGTTCTATG



TGGATATTATGCACACATGGTTCCCGAATAAAGAATTTTAGAAATTCCCAGAAGA



ATGGTCAATGGGATTCCGAAGAATTCGCCTTGACGGAGGCTTTTAAGTCTCTTTT



TGTGCGATATGAGATAGATTATACCGCTGATTTGAAAACAGCTATTGTGGACGAA



AAGCAAAAAGACTTCTTCGTGGATCTTCTGAAGCTATTCAAATTGACAGTACAGA



TGCGCAACAGCTGGAAAGAGAAGGATTTGGATTATCTAATCTCTCCTGTAGCAGG



GGCTGATGGCCGTTTCTTCGATACAAGAGAGGGAAATAAAAGTCTGCCTAAGGAT



GCAGATGCCAATGGAGCTTATAATATTGCCCTAAAAGGACTTTGGGCTCTACGCC



AGATTCGGCAAACTTCAGAAGGCGGTAAACTCAAATTGGCGATTTCCAATAAGGA



ATGGCTACAGTTTGTGCAAGAGAGATCTTACGAGAAAGACtga





SEQ
atgaataatggaacaaataactttcagaattttatcggaatttcttctttgcaga


ID
agactcttaggaatgctctcattccaaccgaaacaacacagcaatttattgttaa


NO:
aaacggaataattaaagaagatgagctaagaggagaaaatcgtcagatacttaaa


27
gatatcatggatgattattacagaggtttcatttcagaaactttatcgtcaattg



atgatattgactggacttctttatttgagaaaatggaaattcagttaaaaaatgg



agataacaaagacactcttataaaagaacagactgaataccgtaaggcaattcat



aaaaaatttgcaaatgatgatagatttaaaaatatgttcagtgcaaaattaatct



cagatattcttcctgaatttgtcattcataacaataattattctgcatcagaaaa



ggaagaaaaaacacaggtaattaaattattttccagatttgcaacgtcattcaag



gactattttaaaaacagggctaattgtttttcggctgatgatatatcttcatctt



cttgtcatagaatagttaatgataatgcagagatattttttagtaatgcattggt



gtataggagaattgtaaaaagtctttcaaatgatgatataaataaaatatccgga



gatatgaaggattcattaaaggaaatgtctctggaagaaatttattcttatgaaa



aatatggggaatttattacacaggaaggtatatctttttataatgatatatgtgg



taaagtaaattcatttatgaatttatattgccagaaaaataaagaaaacaaaaat



ctctataagctgcaaaagcttcataaacagatactgtgcatagcagatacttctt



atgaggtgccgtataaatttgaatcagatgaagaggtttatcaatcagtgaatgg



atttttggacaatattagttcgaaacatatcgttgaaagattgcgtaagattgga



gacaactataacggctacaatcttgataagatttatattgttagtaaattctatg



aatcagtttcacaaaagacatatagagattgggaaacaataaatactgcattaga



aattcattacaacaatatattacccggaaatggtaaatctaaagctgacaaggta



aaaaaagcggtaaagaatgatctgcaaaaaagcattactgaaatcaatgagcttg



ttagcaattataaattatgttcggatgataatattaaagctgagacatatataca



tgaaatatcacatattttgaataattttgaagcacaggagcttaagtataatcct



gaaattcatctggtggaaagtgaattgaaagcatctgaattaaaaaatgttctcg



atgtaataatgaatgcttttcattggtgttcggttttcatgacagaggagctggt



agataaagataataatttttatgccgagttagaagagatatatgacgaaatatat



ccggtaatttcattgtataatcttgtgcgtaattatgtaacgcagaagccatata



gtacaaaaaaaattaaattgaattttggtattcctacactagcggatggatggag



taaaagtaaagaatatagtaataatgcaattattctcatgcgtgataatttgtac



tatttaggaatatttaatgcaaaaaataagcctgacaaaaagataattgaaggta



atacatcagaaaataaaggggattataagaagatgatttataatcttctgccagg



accaaataaaatgatccccaaggtattcctctcttcaaaaaccggagtggaaaca



tataagccgtctgcctatatattggagggctataaacaaaacaagcatattaaat



cctctaaggattttgatataacattttgtcacgatttgattgattattttaagaa



ctgtatagcaatacatcctgaatggaagaattttggctttgatttttctgacacc



tccacatatgaagatatcagcggattttacagagaagtcgaattacaaggttata



aaatcgactggacatatatcagcgaaaaggatattgatttgttgcaggaaaaagg



acagttatatttattccaaatatataacaaagatttttccaagaaaagtaccgga



aatgataatcttcatactatgtatttgaagaatttgtttagtgaagagaatttaa



aggatattgtactgaaattaaacggtgaggcggaaatcttctttagaaaatcaag



cataaagaatccaataattcataaaaaaggctctattcttgttaatagaacatat



gaagcagaggaaaaagatcaatttggaaatatccagatagtcagaaaaaacatac



cggaaaatatatatcaggagctttataaatatttcaatgataaaagtgataaaga



actttcggatgaagcagctaagcttaagaatgtagtaggtcatcatgaggctgct



acaaacatagtaaaagattatagatatacatatgataaatattttcttcatatgc



ctattacaatcaattttaaagccaataagacaggctttattaatgacagaatatt



acaatatattgctaaagaaaaggatttgcatgtaataggcattgatcgtggtgaa



agaaacctgatatatgtttcagtaattgatacttgtggaaatattgttgaacaaa



aatcgtttaacattgttaatggatatgattatcagattaagctcaagcagcagga



gggggcgcgacaaatcgcacgaaaagaatggaaagaaatcggcaaaataaaagaa



attaaagaaggctatttatctcttgtaattcatgaaatttcaaagatggttatta



aatataatgccataattgcaatggaggatttaagctacggatttaaaaaaggtcg



tttcaaggttgagcgacaggtttaccagaagtttgagacaatgcttatcaacaaa



ctcaactatctggtatttaaagatatatccataacggaaaacggtggtcttctaa



agggataccagcttacatatattccagataaactgaaaaatgtgggtcatcaatg



tggctgtatattttatgtacctgctgcctatacatcaaaaatagatcctacaacc



ggatttgtaaatatattcaaatttaaagatttaacagttgatgcgaagagagaat



ttataaaaaaatttgacagtatcagatatgattcagaaaaaaatctgttttgttt



tacattcgattataataactttattacgcaaaatactgttatgtcaaagtcaagc



tggagtgtatatacgtacggagttaggataaaaagaagatttgtcaatggcaggt



tctcaaatgaatcggatacaattgatataacaaaagatatggaaaaaacactcga



aatgacagatataaattggagagatggtcatgatctgaggcaggatattattgat



tatgaaatcgtacaacacatatttgagatttttagattgactgtacaaatgagaa



acagtttaagtgaattagaagacagggattatgaccgtttgatttctccggtgct



caatgaaaataatatattttatgattcagctaaagcaggagatgcgttacctaaa



gacgcagatgctaatggtgcatattgtatagctctaaaaggcttgtatgaaatca



aacaaattacagagaattggaaagaagacggtaagttttcaagagataaacttaa



aatttccaataaggactggtttgactttattcaaaataaaaggtatttataa





SEQ
atgacaaacaaatttacaaaccagtactcgctttccaaaacacttcgatttgagt


ID
tgattccacaaggaaaaacattggaatttattcaagaaaaaggattgctctctca


NO:
agataaacaacgagcggagagttatcaagaaatgaaaaaaactattgataaattt


28
cataaatactttatcgatttagctttaagcaatgctaaactaactcatttagaaa



cttacttggaattatacaataaaagtgctgaaacaaaaaaagaacaaaaatttaa



agacgatttaaagaaagtacaagacaatttacgaaaagaaatcgttaaatctttt



tcagatggtgatgcaaaatcaatttttgcaattttggataaaaaagaactgatta



ccgtagaacttgaaaaatggtttgaaaacaacgaacaaaaagacatttattttga



cgaaaaattcaaaacgtttactacttattttactggttttcatcaaaacagaaaa



aacatgtattcggttgaacccaattctacagcaattgcttatcgattgattcatg



aaaatttacctaaatttttagaaaatgctaaagcatttgaaaaaataaaacaagt



agaaagtttgcaagttaattttagagaattaatgggggaatttggagatgaaggg



ctaattttcgtaaatgaattagaagaaatgtttcaaatcaattattataatgatg



tgctttcacaaaatggaattacaatttataatagtataatttcaggatttaccaa



aaatgatataaaatataaaggtctaaatgaatacataaataattacaatcaaacc



aaagacaaaaaagaccgtttgccaaaattaaaacaattgtataaacagattttga



gtgataggatttcactttcgtttttgcccgatgcttttacggatgggaaacaagt



tttgaaagccatatttgacttttataaaatcaacttactttcttataccattgaa



ggacaggaagaaagccaaaatcttttactattaattcgtcagacaattgaaaacc



tttctagttttgatacccaaaaaatttatctaaaaaatgatacccatttaaccac



tatttcacaacaagtatttggcgatttttcggtgttttcaactgctttaaattat



tggtatgaaactaaagtaaatccaaaatttgaaacggaatatagcaaagccaacg



aaaaaaaacgagaaattttagataaagccaaagcggtatttacaaaacaagatta



tttttcaattgcttttttacaagaagtactttcggaatacattcttaccttagat



cacacttctgatattgtaaaaaagcattcctccaactgtattgcggattatttta



aaaatcattttgtagccaaaaaagaaaatgaaaccgacaaaacctttgattttat



tgctaatattactgcaaaataccaatgtattcaaggtattttagaaaatgcagac



caatacgaagacgaactcaaacaagaccaaaaattaattgataatttgaaattct



ttttagatgctattttagaattgttgcattttattaaacctttgcatttaaaatc



agaaagcattaccgaaaaagacactgctttttatgatgtgtttgaaaattattac



gaagcattgagtttgttgaccccattatataatatggtgcgaaactatgtaacgc



aaaagccgtacagcaccgaaaaaataaaattaaattttgaaaatgcacaattatt



gaatggttgggatgccaataaagaaggtgattacctaactaccattttgaaaaaa



gacggtaattattttttagccataatggataaaaagcataacaaagcgtttcaaa



agtttccagaaggaaaagaaaattatgaaaaaatggtgtataaactattgcctgg



agtaaataagatgttgccaaaagtatttttttccaataaaaatattgcttacttc



aacccatcaaaagagttattagaaaactataaaaaagagacgcacaaaaaaggag



acacattcaatttagaacattgtcatacgttgatcgattttttcaaggactcttt



aaacaaacatgaagactggaaatactttgattttcaattttctgaaacaaaatcg



tatcaagatttgagtggtttttatagagaagtagaacatcaaggctacaaaatca



attttaaaaatatcgattcagaatatattgatggtttggtgaacgaaggtaaatt



gtttctatttcaaatttacagcaaagatttttcgcctttttccaaagggaaaccg



aacatgcacactttgtattggaaagccttatttgaagaacaaaatttgcaaaatg



taatctataaattgaatggacaagccgaaatattttttagaaaagcctctataaa



acctaaaaatataatattgcacaaaaagaaaattaaaattgccaaaaagcatttt



attgataaaaaaacaaaaacatctgaaattgttcctgttcaaacaataaaaaacc



tcaatatgtactaccaaggaaaaataagtgaaaaagaattaacacaagatgattt



aaggtatattgataattttagcattttcaatgaaaaaaataaaacaattgatatt



ataaaagacaaacgatttacggttgataaatttcagtttcatgtgccgattacca



tgaactttaaagcaacgggcggaagttatatcaatcaaaccgtattagaatattt



gcaaaacaatcccgaagttaagattattggattggatagaggcgaacgccatttg



gtatatctgacactgatagaccagcaaggaaacatcttgaaacaagaaagtttga



atacaatcaccgattctaaaatctcgacaccttatcataagttgttggataacaa



ggaaaacgagcgtgacttggctcgaaaaaattggggaacggtggaaaacatcaaa



gaactcaaagaaggctacatcagtcaagtggtgcataaaattgctacgttgatgc



tggaagaaaatgccattgtggtaatggaagatttgaattttggatttaaacgtgg



acgttttaaagtggaaaaacaaatttatcaaaagctggaaaaaatgttgattgac



aaattgaattatttggttttaaaagacaaacaacctcaggaattaggcggattgt



acaacgcattacaactcaccaataaatttgaaagtttccaaaaaatgggtaaaca



atcgggctttttttttatgtacccgcttggaacacctccaaaatagacccaacca



cagggtttgtcaattatttttataccaaatatgaaaatgttgacaaagccaaagc



cttttttgaaaaatttgaggcgattcgtttcaatgcagaaaagaagtattttgaa



tttgaagtaaaaaaatatagcgattttaacccaaaagccgaaggcactcaacaag



cctggaccatttgcacgtatggcgaacgaatagaaaccaaacgacaaaaagacca



aaacaacaaatttgtaagcactccaattaatctaaccgaaaagatagaagacttt



ttgggtaaaaaccaaattgtttatggtgatggtaattgcatcaaatctcaaattg



ctagcaaagacgacaaggctttttttgaaaccttattgtattggttcaaaatgac



tttacaaatgcgaaacagcgaaacaagaacagatatagattatctaatttcgccc



gtgatgaatgacaacggaacattttacaacagccgagattatgaaaaattagaaa



atccaactttgcccaaagatgccgatgccaacggagcgtatcatattgccaaaaa



aggattgatgcttttgaataaaatagaccaagccgacttgacaaaaaaagtggat



ttatctattagtaacagagattggttgcaatttgtacaaaaaaataaataa





SEQ
atggaacaggagtactatttaggactggatatgggaaccggatctgtaggatggg


ID
ctgttacagattcggaatatcatgtcttgcgtaaacatggaaaagcactatgggg


NO:
agtccgattatttgaaagtgcatcgacagcagaagaacgaagaatgttccgaaca


29
tcaagaagaagactagatcgaagaaactggagaattgaaattttacaggaaattt



ttgcagaggaaataagtaagaaagatccaggatttttcttgcgaatgaaagaaag



caaatattatccagaagataagcgagatatcaatggaaattgtccggaactgcca



tatgcattatttgttgatgacgattttacagataaagattatcataaaaaatttc



cgacaatttatcatctcaggaaaatgttgatgaatacagaggagacaccggatat



ccggttggtgtatctggcaattcatcatatgatgaagcataggggccatttcttg



ttatctggtgacattaatgagattaaggagttcggaacgacattttcaaaattgt



tggagaatatcaaaaatgaggaattggattggaatcttgaactgggaaaagaaga



atatgctgttgtagaaagtattttaaaagataacatgttaaaccgatccacaaag



aaaaccagattaataaaagcattaaaagcaaaatcaatatgtgaaaaggctgtac



tgaatttattggctggtggaacggtgaaattgagtgatatatttggtcttgaaga



attaaatgagacagaaagaccgaagatttcctttgctgataatggatacgatgat



tatatcggagaagttgaaaatgagctgggagaacaattctatattatagagacgg



caaaagcagtgtatgactgggcggtattagttgaaatattgggaaaatatacgtc



aatttcagaagcgaaagtagcaacgtatgaaaaacataaatcggatttacaattt



ttgaaaaagatagttcggaaatatctgacaaaggaggaatataaagatatttttg



taagtacgagtgacaaattgaaaaattactctgcttatataggaatgacgaaaat



aaatggaaaaaaggttgatttgcagagcaaacggtgcagtaaagaagaattctat



gattttattaagaaaaacgtacttaaaaagctagaaggacaacctgaatatgaat



atttgaaagaagagctagaaagagaaacatttctaccaaaacaggtgaacaggga



taatggtgtaataccgtatcagattcatttgtacgagttgaaaaagatattagga



aatttacgggataaaatagacctcattaaagagaacgaagataaactggttcaat



tatttgaattcagaattccgtattatgttggtccgctgaataagatagatgacgg



aaaagagggaaaatttacatgggctgtacggaaaagtaatgaaaagatatatcca



tggaattttgaaaatgtagttgatatagaagcaagtgcagaaaaatttatccgga



gaatgacaaataagtgtacatatctgatgggcgaagatgtattgccgaaggattc



attgctttacagtaaatatatggttttaaatgaattaaataatgtaaagttggat



ggcgaaaaattatctgtagaattgaaacaacggttgtatacagatgtattttgta



agtatcggaaagtaactgtaaagaagataaaaaattacttgaaatgtgaaggtat



catatccggcaatgtcgaaataactggaattgatggtgattttaaggcatcgtta



acggcatatcatgattttaaagaaatcttgacaggaacagaattggctaaaaagg



acaaagaaaatattattaccaatatagtattgtttggagatgataaaaagctgct



gaaaaagagactgaatcgattatatcctcagattacgccgaatcagttgaagaaa



atatgtgcgctatcctatacaggctggggaagattttctaaaaagttcttagaag



aaataacagctccagatccggaaacgggagaggtatggaatatcattacggcatt



gtgggaatcgaataataatctgatgcaattattaagtaatgaatatcggtttatg



gaagaagtcgaaacatacaatatgggaaaacagactaaaacattgtcgtacgaaa



cagtagagaatatgtatgtttctccatctgtgaaaagacagatatggcagacgct



gaaaatcgtgaaagaattagaaaaagtaatgaaagaatctccgaaacgtgtattt



attgagatggcgagagaaaagcaagaaagtaagagaaccgaatcgcgtaaaaaac



aactaatagatttgtataaggcttgtaaaaatgaagaaaaagattgggtaaaaga



actgggagatcaggaagaacagaaattacgaagcgataagttgtacctatattat



acgcaaaagggtcgttgtatgtattctggcgaggtaatagaactgaaagacttat



gggataatacaaaatatgatattgatcatatatatccacaatctaaaacgatgga



tgacagtcttaataatcgcgtattggtaaaaaagaaatataatgcaacaaaatca



gataagtatccattaaatgaaaatatacgacatgagagaaaaggcttttggaagt



cactgttagatggagggtttataagtaaagaaaaatatgaacgcttaataagaaa



tacagaattgagtccggaagaattagcaggatttattgaaaggcagattgttgaa



acgaggcagagtacaaaagctgtagcggaaatattaaagcaagtgtttccggaaa



gtgaaattgtatatgtcaaagcaggtacggtttcaagattcagaaaagattttga



attactgaaagttcgagaagtgaatgatttgcatcacgcaaaggatgcgtattta



aatattgtagttggtaatagttattatgtgaaatttactaagaatgcatcatggt



ttataaaagaaaatccgggacgtacttacaacttaaaaaagatgtttacatcagg



ttggaatattgaacgaaatggagaagttgcatgggaagtcgggaaaaaaggaaca



attgtaacggtaaaacaaataatgaataaaaataatatattggtgacaagacagg



ttcatgaagcgaaaggtgggctgtttgatcagcagattatgaaaaaaggaaaagg



tcagattgctataaaggaaactgatgaacgtcttgcatcaatagaaaagtatgga



ggctataataaagctgccggggcatattttatgctggtagaatctaaagataaaa



aaggaaaaacaattcgaacgatagaatttataccattatatttaaagaataaaat



cgagtcggatgaatcaatagcattgaactttttagaaaaaggcagaggtttgaaa



gaaccaaagatactattgaaaaaaattaagattgatacattatttgatgtggacg



gattcaaaatgtggttgtctggaagaacaggggacagactactatttaaatgtgc



aaatcaattgattttggatgagaaaataattgtaacaatgaaaaaaattgtaaag



tttattcaaaggagacaagaaaatagagaattaaaattatctgataaagatggaa



ttgataatgaagtacttatggaaatatataacacttttgtggataagttagaaaa



cacagtgtatagaatacgattatccgaacaggcaaaaacgcttatagataaacaa



aaagaatttgaaaggttatcactagaggataaaagtagtactttgtttgaaattt



tacatatttttcagtgtcaaagtagtgcggccaatttaaaaatgataggcggacc



tggaaaagcaggaatattagttatgaataataatataagtaagtgtaacaaaatt



tctattataaatcagtctccaacaggaattttcgaaaatgagattgatttgttaa



agat





SEQ
ATGAAATCTTTCGATTCATTCACAAATCTTTATTCTCTTTCAAAAACCTTGAAAT


ID
TTGAGATGAGACCTGTCGGAAATACCCAAAAAATGCTCGACAATGCAGGAGTATT


NO:
TGAAAAAGACAAACTAATTCAAAAAAAGTACGGAAAAACAAAGCCGTATTTCGAC


30
AGACTCCACAGAGAATTTATAGAAGAAGCGCTCACGGGGGTAGAGCTAATAGGAC



TAGATGAGAACTTTAGGACACTTGTTGACTGGCAAAAAGATAAGAAAAATAATGT



CGCAATGAAAGCGTATGAAAATAGTTTGCAGCGGCTGAGAACGGAAATAGGTAAA



ATATTTAACCTAAAGGCTGAGGATTGGGTAAAGAACAAATATCCAATATTAGGGC



TGAAAAATAAAAATACCGATATTTTATTCGAAGAGGCTGTATTCGGGATATTGAA



AGCCCGATATGGAGAAGAAAAAGATACTTTTATAGAAGTAGAGGAAATAGATAAA



ACCGGCAAATCAAAGATCAATCAAATATCAATTTTCGATAGTTGGAAAGGATTTA



CAGGATATTTCAAAAAATTTTTTGAAACCAGAAAGAATTTTTACAAAAACGACGG



AACTTCTACAGCAATTGCTACAAGGATCATTGATCAAAATCTGAAAAGATTCATA



GATAATCTGTCAATAGTTGAAAGTGTGAGACAAAAGGTTGATCTCGCCGAGACAG



AAAAATCTTTCAGCATATCTCTATCGCAATTCTTCTCAATAGACTTTTATAACAA



GTGTCTCCTTCAAGATGGTATTGATTACTACAACAAGATAATCGGTGGAGAAACT



CTCAAAAATGGCGAAAAACTAATAGGTCTCAATGAACTAATAAATCAATATAGGC



AGAATAATAAGGATCAGAAAATCCCATTTTTCAAACTTCTTGATAAACAAATTTT



GAGTGAAAAGATATTATTTTTGGATGAAATAAAAAATGACACAGAACTGATCGAG



GCGCTGAGTCAGTTCGCAAAAACAGCCGAAGAAAAAACAAAAATTGTCAAAAAGC



TTTTTGCCGATTTTGTAGAAAATAATTCCAAATACGATCTTGCACAGATTTATAT



TTCCCAAGAAGCATTCAATACTATATCAAACAAGTGGACAAGCGAAACTGAGACG



TTCGCTAAATATCTATTCGAAGCAATGAAGAGTGGAAAACTTGCAAAGTATGAGA



AAAAAGATAATAGCTATAAATTTCCTGATTTTATTGCCCTTTCACAGATGAAGAG



TGCTTTATTAAGTATCAGCCTTGAGGGACATTTTTGGAAAGAGAAATACTACAAA



ATTTCAAAATTCCAAGAGAAGACCAATTGGGAGCAGTTTCTTGCAATTTTTCTAT



ACGAGTTTAACTCTCTTTTCAGCGACAAAATAAATACAAAAGATGGAGAAACAAA



GCAAGTTGGATACTATCTATTTGCCAAAGACCTGCATAATCTTATCTTAAGTGAG



CAGATTGATATTCCAAAAGATTCAAAAGTCACAATAAAAGATTTTGCCGATTCTG



TACTCACAATCTACCAAATGGCAAAATATTTTGCGGTAGAAAAAAAACGAGCGTG



GCTTGCCGAGTATGAACTAGATTCATTTTATACCCAGCCAGACACAGGCTATTTA



CAGTTTTATGATAACGCCTACGAGGATATTGTGCAGGTATACAACAAGCTTCGAA



ACTATCTGACCAAAAAGCCATATAGCGAGGAGAAATGGAAGTTGAATTTTGAAAA



TTCTACGCTGGCAAATGGATGGGATAAGAACAAAGAATCTGATAATTCAGCAGTT



ATTCTACAAAAAGGTGGAAAATATTATTTGGGACTGATTACTAAAGGACACAACA



AAATTTTTGATGACCGTTTTCAAGAAAAATTTATTGTGGGAATTGAAGGTGGAAA



ATATGAAAAAATAGTCTATAAATTTTTCCCCGACCAGGCAAAAATGTTTCCCAAA



GTGTGCTTTTCTGCAAAAGGACTCGAATTTTTTAGACCGTCTGAAGAAATTTTAA



GAATTTATAACAATGCAGAGTTTAAAAAAGGAGAAACTTATTCAATAGATAGTAT



GCAGAAGTTGATTGATTTTTATAAAGATTGCTTGACTAAATATGAAGGCTGGGCA



TGTTATACCTTTCGGCATCTAAAACCCACAGAAGAATACCAAAACAATATTGGAG



AGTTTTTTCGAGATGTTGCAGAGGACGGATACAGGATTGATTTTCAAGGCATTTC



AGATCAATATATTCATGAAAAAAACGAGAAAGGCGAACTTCACCTTTTTGAAATC



CACAATAAAGATTGGAATTTGGATAAGGCACGAGACGGAAAGTCAAAAACAACAC



AAAAAAACCTTCATACACTCTATTTCGAATCGCTCTTTTCAAACGATAATGTTGT



TCAAAACTTTCCAATAAAACTCAATGGTCAAGCTGAAATTTTTTATAGACCGAAA



ACGGAAAAAGACAAATTAGAATCAAAAAAAGATAAGAAAGGGAATAAAGTGATTG



ACCATAAACGCTATAGTGAGAATAAGATTTTTTTTCATGTTCCTCTCACACTAAA



CCGCACTAAAAATGACTCATATCGCTTTAATGCTCAAATCAACAACTTTCTCGCA



AATAATAAAGATATCAACATCATCGGTGTAGATAGGGGAGAAAAGCATTTAGTCT



ATTATTCGGTGATTACACAAGCTAGTGACATCTTAGAAAGTGGCTCACTAAATGA



GCTAAATGGCGTGAATTATGCTGAAAAACTGGGAAAAAAGGCAGAAAATCGAGAA



CAAGCACGCAGAGACTGGCAAGACGTACAAGGGATCAAAGACCTCAAGAAAGGAT



ATATTTCACAGGTGGTGCGAAAGCTTGCTGATTTAGCAATTAAACACAATGCCAT



TATCATTCTTGAAGATTTGAATATGAGATTTAAACAAGTTCGGGGCGGTATCGAA



AAATCCATTTATCAACAGTTAGAAAAAGCACTGATAGATAAATTAAGCTTTCTTG



TAGACAAAGGTGAAAAAAATCCCGAGCAAGCAGGACATCTTCTGAAAGCATATCA



GCTTTCGGCCCCATTTGAGACATTTCAAAAAATGGGCAAACAGACGGGTATAATC



TTTTATACACAAGCTTCGTATACCTCAAAAAGTGACCCTGTAACAGGTTGGCGAC



CACACCTGTATCTCAAATATTTCAGTGCCAAAAAAGCAAAAGACGATATTGCAAA



GTTTACAAAAATAGAATTTGTAAACGATAGGTTTGAGCTTACCTATGATATAAAG



GACTTTCAGCAAGCAAAAGAATATCCAAATAAAACTGTTTGGAAAGTTTGCTCAA



ATGTAGAGAGATTCAGGTGGGACAAAAACCTCAATCAAAACAAAGGCGGATATAC



TCACTACACAAATATAACTGAGAATATCCAAGAGCTTTTTACAAAATATGGAATT



GATATCACAAAAGATTTGCTCACACAGATTTCTACAATTGATGAAAAACAAAATA



CCTCATTTTTTAGAGATTTTATTTTTTATTTCAACCTTATTTGCCAAATCAGAAA



TACCGATGATTCTGAGATTGCTAAAAAGAATGGGAAAGATGATTTTATACTGTCA



CCTGTTGAGCCGTTTTTCGATAGCCGAAAAGACAATGGAAATAAACTTCCTGAGA



ATGGAGATGATAACGGCGCGTATAACATAGCAAGAAAAGGGATTGTCATACTCAA



CAAAATCTCACAATATTCAGAGAAAAACGAAAATTGCGAGAAAATGAAATGGGGG



GATTTGTATGTATCAAACATTGACTGGGACAATTTTGTAACCCAAGCTAATGCAC



GGCATTAA





SEQ
ATGATTATCTTATATATTAGTACCTCGAATATGAACATGGAAGGAGTATTTATGG


ID
AAAATTTTAAAAACTTGTATCCAATAAACAAAACACTTCGATTTGAATTAAGACC


NO:
CTATGGAAAAACATTGGAAAATTTTAAAAAATCCGGACTTTTAGAAAAAGATGCC


31
TTTAAGGCAAATAGTAGACGAAGTATGCAAGCTATAATCGATGAAAAATTCAAAG



AGACTATCGAAGAACGCTTAAAGTACACTGAATTCAGTGAATGTGATCTTGGAAA



CATGACATCAAAAGATAAAAAAATAACTGATAAAGCAGCTACAAATTTAAAAAAG



CAAGTTATCTTATCTTTTGACGATGAAATATTTAATAATTACCTAAAACCTGATA



AAAATATTGACGCATTATTTAAAAATGATCCTTCAAATCCTGTAATCTCTACATT



TAAAGGTTTTACGACATATTTTGTGAATTTTTTTGAAATTCGAAAACATATTTTC



AAGGGAGAATCATCAGGCTCAATGGCATACCGAATTATAGATGAAAACCTGACAA



CATACTTGAATAATATTGAAAAAATAAAAAAACTGCCAGAAGAATTAAAATCACA



GCTAGAAGGCATTGATCAGATTGATAAACTTAATAATTATAATGAGTTCATTACA



CAGTCAGGTATAACACACTATAATGAAATCATCGGCGGTATATCAAAATCAGAGA



ATGTCAAAATACAGGGAATTAATGAAGGAATTAATCTATACTGTCAGAAGAACAA



AGTTAAACTTCCTCGACTGACTCCGCTATACAAAATGATATTATCAGACAGAGTT



TCCAACTCTTTTGTATTAGACACTATTGAAAATGACACAGAATTAATTGAAATGA



TAAGTGATTTGATTAATAAGACTGAGATTTCGCAAGATGTTATAATGTCAGATAT



TCAAAATATTTTCATAAAATACAAACAACTTGGTAATTTGCCGGGTATCTCATAT



TCTTCAATAGTTAATGCTATTTGCTCGGATTATGACAACAATTTCGGAGATGGGA



AGCGAAAAAAATCTTACGAAAATGATCGCAAAAAGCATTTGGAGACTAATGTATA



CTCCATAAATTATATTTCTGAATTGCTTACAGATACCGATGTTTCATCAAATATC



AAGATGAGATATAAAGAGCTTGAGCAAAATTATCAGGTTTGCAAAGAAAATTTTA



ATGCCACAAACTGGATGAATATTAAAAATATAAAACAATCTGAAAAAACAAACCT



TATTAAAGATTTGTTAGATATACTTAAATCGATTCAACGTTTCTATGATTTGTTT



GATATTGTTGACGAAGATAAAAATCCAAGTGCTGAATTTTATACCTGGTTATCAA



AAAATGCTGAAAAGCTTGACTTTGAATTCAATTCTGTATATAACAAGTCACGAAA



CTATCTCACCAGGAAACAATACTCTGATAAAAAAATCAAGCTGAATTTTGATTCT



CCAACATTGGCCAAAGGGTGGGATGCTAACAAAGAAATAGATAACTCCACGATTA



TAATGCGTAAATTTAATAATGACAGAGGCGATTATGATTACTTCCTTGGCATATG



GAATAAATCCACACCTGCAAATGAAAAAATAATCCCACTGGAGGATAATGGATTA



TTCGAAAAAATGCAATATAAGCTGTATCCAGATCCTAGTAAGATGTTACCGAAAC



AATTTCTATCAAAAATATGGAAGGCAAAGCATCCTACGACACCTGAATTTGATAA



AAAATATAAAGAGGGAAGACATAAAAAAGGTCCTGATTTCGAAAAAGAATTCCTG



CATGAATTGATTGATTGCTTCAAACATGGTCTTGTTAATCACGATGAAAAATATC



AGGATGTTTTTGGCTTCAATCTCCGTAACACTGAAGATTATAATTCATATACAGA



GTTTCTCGAAGATGTGGAAAGATGCAATTACAATCTTTCATTTAACAAAATTGCT



GATACTTCAAACCTTATTAATGATGGGAAATTGTATGTATTTCAGATATGGTCAA



AAGACTTTTCTATTGATTCAAAAGGTACTAAAAACTTGAATACAATCTATTTTGA



ATCACTATTTTCAGAAGAAAACATGATAGAAAAAATGTTCAAGCTTTCTGGAGAG



GCTGAGATATTCTATCGACCAGCATCGTTGAATTATTGTGAAGATATCATAAAAA



AAGGTCATCACCATGCAGAATTAAAAGATAAGTTTGACTATCCTATAATAAAAGA



TAAGCGATATTCACAAGATAAGTTTTTCTTTCATGTGCCAATGGTTATAAATTAT



AAATCTGAGAAACTGAATTCCAAAAGCCTTAACAACCGAACAAATGAAAACCTGG



GACAGTTTACACATATTATAGGTATAGACAGGGGCGAGCGGCACTTGATTTATTT



AACTGTTGTTGATGTTTCCACTGGTGAAATCGTTGAACAGAAACATCTGGACGAA



ATTATCAATACTGATACCAAGGGAGTTGAACACAAAACCCATTATTTGAATAAAT



TGGAAGAAAAATCTAAAACAAGAGATAACGAGCGTAAATCATGGGAAGCTATTGA



AACTATCAAAGAATTAAAAGAAGGCTATATTTCTCATGTAATTAATGAAATACAA



AAGCTGCAAGAAAAATATAATGCCTTAATCGTAATGGAAAATCTTAACTATGGGT



TCAAAAACTCACGAATCAAAGTTGAAAAACAGGTTTATCAAAAATTCGAGACAGC



ATTGATTAAAAAGTTCAATTATATTATTGATAAAAAAGATCCAGAAACCTATATA



CATGGTTACCAGCTTACAAATCCTATTACCACTCTGGATAAGATTGGAAATCAAT



CTGGAATAGTGCTGTATATTCCTGCGTGGAATACTTCTAAGATAGATCCCGTCAC



AGGATTTGTAAACCTTCTGTACGCAGATGATTTGAAGTATAAAAATCAGGAGCAG



GCCAAATCATTCATTCAGAAAATAGACAACATATATTTTGAAAATGGAGAGTTTA



AATTTGATATTGATTTTTCCAAATGGAATAATCGCTACTCAATAAGTAAAACTAA



ATGGACGTTAACAAGTTATGGGACTCGCATCCAGACATTTAGAAATCCCCAGAAA



AACAATAAGTGGGATTCTGCTGAATATGATTTGACAGAAGAGTTTAAATTAATTT



TAAATATAGACGGAACGTTAAAGTCACAGGACGTAGAAACATACAAAAAATTCAT



GTCTTTATTTAAACTAATGCTACAGCTTCGAAACTCTGTTACAGGAACCGACATT



GATTATATGATCTCTCCTGTCACTGATAAAACAGGAACACATTTCGATTCAAGAG



AAAATATTAAAAATCTTCCTGCCGATGCAGATGCCAATGGTGCCTACAACATTGC



GCGCAAAGGAATAATGGCTATTGAAAATATAATGAACGGTATAAGCGATCCACTA



AAAATAAGCAACGAAGACTATTTAAAGTATATTCAGAATCAACAGGAATAA





SEQ
ATGACCCAATTTGAAGGTTTTACCAATTTATACCAAGTTTCGAAGACCCTTCGTT


ID
TTGAACTGATTCCCCAAGGAAAAACACTCAAACATATCCAGGAGCAAGGGTTCAT


NO:
TGAGGAGGATAAAGCTCGCAATGACCATTACAAAGAGTTAAAACCAATCATTGAC


32
CGCATCTATAAGACTTATGCTGATCAATGTCTCCAACTGGTACAGCTTGACTGGG



AGAATCTATCTGCAGCCATAGACTCCTATCGTAAGGAAAAAACCGAAGAAACACG



AAATGCGCTGATTGAGGAGCAAGCAACATATAGAAATGCGATTCATGACTACTTT



ATAGGTCGGACGGATAATCTGACAGATGCCATAAATAAGCGCCATGCTGAAATCT



ATAAAGGACTTTTTAAAGCTGAACTTTTCAATGGAAAAGTTTTAAAGCAATTAGG



GACCGTAACCACGACAGAACATGAAAATGCTCTACTCCGTTCGTTTGACAAATTT



ACGACCTATTTTTCCGGCTTTTATGAAAACCGAAAAAATGTCTTTAGCGCTGAAG



ATATCAGCACGGCAATTCCCCATCGAATCGTCCAGGACAATTTCCCTAAATTTAA



GGAAAACTGCCATATTTTTACAAGATTGATAACCGCAGTTCCTTCTTTGCGGGAG



CATTTTGAAAATGTCAAAAAGGCCATTGGAATCTTTGTTAGTACGTCTATTGAAG



AAGTCTTTTCCTTTCCCTTTTATAATCAACTTCTAACCCAAACGCAAATTGATCT



TTATAATCAACTTCTCGGCGGCATATCTAGGGAAGCAGGCACAGAAAAAATCAAG



GGACTTAATGAAGTTCTCAATCTGGCTATCCAAAAAAATGATGAAACAGCCCATA



TAATCGCGTCCCTGCCGCATCGTTTTATTCCTCTTTTTAAACAAATTCTTTCCGA



TCGAAATACGTTATCCTTTATTTTGGAAGAATTCAAAAGCGATGAGGAAGTCATC



CAATCCTTCTGCAAATATAAAACCCTCTTGAGAAACGAAAATGTACTGGAGACTG



CAGAAGCCCTTTTCAATGAATTAAATTCCATTGATTTGACTCATATCTTTATTTC



CCATAAAAAGTTAGAAACCATCTCTTCAGCGCTTTGTGACCATTGGGATACCTTG



CGCAATGCACTTTACGAAAGACGGATTTCTGAACTCACTGGCAAAATAACAAAAA



GTGCCAAAGAAAAAGTTCAAAGGTCATTAAAACATGAGGATATAAATCTCCAAGA



AATTATTTCTGCTGCAGGAAAAGAACTATCAGAAGCATTCAAACAAAAAACAAGT



GAAATTCTTTCCCATGCCCATGCTGCACTTGACCAGCCTCTTCCCACAACATTAA



AAAAACAGGAAGAAAAAGAAATCCTCAAATCACAGCTCGATTCGCTTTTAGGCCT



TTATCATCTTCTTGATTGGTTTGCTGTCGATGAAAGCAATGAAGTCGACCCAGAA



TTCTCAGCACGGCTGACAGGCATTAAACTAGAAATGGAACCAAGCCTTTCGTTTT



ATAATAAAGCAAGAAATTATGCGACAAAAAAGCCCTATTCGGTGGAAAAATTTAA



ATTGAATTTTCAAATGCCAACCCTTGCCTCTGGTTGGGATGTCAATAAAGAAAAA



AATAATGGAGCTATTTTATTCGTAAAAAATGGTCTCTATTACCTTGGTATCATGC



CTAAACAGAAGGGGCGCTATAAAGCCCTGTCTTTTGAGCCGACAGAAAAAACATC



AGAAGGATTCGATAAGATGTACTATGACTACTTCCCAGATGCCGCAAAAATGATT



CCTAAGTGTTCCACTCAGCTAAAGGCTGTAACCGCTCATTTTCAAACTCATACCA



CCCCCATTCTTCTCTCAAATAATTTCATTGAACCTCTTGAAATCACAAAAGAAAT



TTATGACCTGAACAATCCTGAAAAGGAGCCTAAAAAGTTTCAAACGGCTTATGCA



AAGAAGACAGGCGATCAAAAAGGCTATAGAGAAGCGCTTTGCAAATGGATTGACT



TTACGCGGGATTTTCTCTCTAAATATACGAAAACAACTTCAATCGATTTATCTTC



ACTCCGCCCTTCTTCGCAATATAAAGATTTAGGGGAATATTACGCCGAACTGAAT



CCGCTTCTCTATCATATCTCCTTCCAACGAATTGCTGAAAAGGAAATCATGGATG



CTGTAGAAACGGGAAAATTGTATCTGTTCCAAATCTACAATAAGGATTTTGCGAA



GGGCCATCACGGGAAACCAAATCTCCACACCCTGTATTGGACAGGTCTCTTCAGT



CCTGAAAACCTTGCGAAAACCAGCATCAAACTTAATGGTCAAGCAGAATTGTTCT



ATCGACCTAAAAGCCGCATGAAGCGGATGGCCCATCGTCTTGGGGAAAAAATGCT



GAACAAAAAACTAAAGGACCAGAAGACACCGATTCCAGATACCCTCTACCAAGAA



CTGTACGATTATGTCAACCACCGGCTAAGCCATGATCTTTCCGATGAAGCAAGGG



CCCTGCTTCCAAATGTTATCACCAAAGAAGTCTCCCATGAAATTATAAAGGATCG



GCGGTTTACTTCCGATAAATTTTTCTTCCATGTTCCCATTACACTGAATTATCAA



GCAGCCAATAGTCCCAGTAAATTCAACCAGCGTGTCAATGCCTACCTTAAGGAGC



ATCCGGAAACGCCCATCATTGGTATCGATCGTGGAGAACGCAATCTAATCTATAT



TACCGTCATTGACAGTACTGGGAAAATTTTGGAGCAGCGTTCCCTGAATACCATC



CAGCAATTTGACTACCAAAAAAAATTGGACAACAGGGAAAAAGAGCGTGTTGCCG



CCCGTCAAGCCTGGTCCGTCGTCGGAACGATCAAAGACCTTAAACAAGGCTACTT



GTCACAGGTCATCCATGAAATTGTAGACCTGATGATTCATTACCAAGCTGTTGTC



GTCCTTGAAAACCTCAACTTCGGATTTAAATCAAAACGGACAGGCATTGCCGAAA



AAGCAGTCTACCAACAATTTGAAAAGATGCTAATAGATAAACTCAACTGTTTGGT



TCTCAAAGATTATCCTGCTGAGAAAGTGGGAGGCGTCTTAAACCCGTATCAACTT



ACAGATCAGTTCACGAGCTTTGCAAAAATGGGCACGCAAAGCGGCTTCCTTTTCT



ATGTACCGGCCCCTTATACCTCAAAGATTGATCCCCTGACTGGTTTTGTCGATCC



CTTTGTATGGAAGACCATTAAAAATCATGAAAGTCGGAAGCATTTCCTAGAAGGA



TTTGATTTCCTGCATTATGATGTCAAAACAGGTGATTTTATCCTCCATTTTAAAA



TGAATCGGAATCTCTCTTTCCAGAGAGGGCTTCCTGGCTTCATGCCAGCTTGGGA



TATTGTTTTCGAAAAGAATGAAACCCAATTTGATGCAAAAGGGACGCCCTTCATT



GCAGGAAAACGAATTGTTCCTGTAATCGAAAATCATCGTTTTACGGGTCGTTACA



GAGACCTCTATCCCGCTAATGAACTCATTGCCCTTCTGGAAGAAAAAGGCATTGT



CTTTAGAGACGGAAGTAATATATTACCCAAACTTTTAGAAAATGATGATTCTCAT



GCAATTGATACGATGGTCGCCTTGATTCGCAGTGTACTCCAAATGAGAAACAGCA



ATGCCGCAACGGGGGAAGACTACATCAACTCTCCCGTTAGGGATCTGAACGGGGT



GTGTTTCGACAGTCGATTCCAAAATCCAGAATGGCCAATGGATGCGGATGCCAAC



GGAGCTTATCATATTGCCTTAAAAGGGCAGCTTCTTCTGAACCACCTCAAAGAAA



GCAAAGATCTGAAATTACAAAACGGCATCAGCAACCAAGATTGGCTGGCCTACAT



TCAGGAACTGAGAAACTGA





SEQ
ATGGCCGTCAAATCCATCAAAGTGAAACTTCGTCTCGACGATATGCCGGAGATTC


ID
GGGCCGGTCTATGGAAACTTCATAAGGAAGTCAATGCGGGGGTTCGATATTACAC


NO:
GGAATGGCTCAGTCTTCTCCGTCAAGAGAACTTGTATCGAAGAAGTCCGAATGGG


33
GACGGAGAGCAAGAATGTGATAAGACTGCAGAAGAATGCAAAGCCGAATTGTTGG



AGCGGCTGCGCGCGCGTCAAGTGGAGAATGGACACCGTGGTCCGGCGGGATCGGA



CGATGAATTGCTGCAGTTGGCGCGTCAACTCTATGAGTTGTTGGTTCCGCAGGCG



ATAGGTGCGAAAGGCGACGCGCAGCAAATTGCCCGCAAATTTTTGAGCCCCTTGG



CCGACAAGGACGCAGTTGGTGGGCTTGGAATCGCGAAGGCGGGGAACAAACCGCG



GTGGGTTCGCATGCGCGAAGCGGGGGAACCAGGCTGGGAAGAGGAGAAGGAGAAG



GCTGAGACGAGGAAATCTGCGGATCGGACTGCGGATGTTTTGCGCGCGCTCGCGG



ATTTTGGGTTAAAGCCACTGATGCGCGTATACACCGATTCTGAGATGTCATCGGT



GGAGTGGAAACCGCTTCGGAAGGGACAAGCCGTTCGGACGTGGGATAGGGACATG



TTCCAACAAGCTATCGAACGGATGATGTCGTGGGAGTCGTGGAATCAGCGCGTTG



GGCAAGAGTACGCGAAACTCGTAGAACAAAAAAATCGATTTGAGCAGAAGAATTT



CGTCGGCCAGGAACATCTGGTCCATCTCGTCAATCAGTTGCAACAAGATATGAAA



GAAGCATCGCCCGGACTCGAATCGAAAGAGCAAACCGCGCACTATGTGACGGGAC



GGGCATTGCGCGGATCGGACAAGGTATTTGAGAAGTGGGGGAAACTCGCCCCCGA



TGCACCTTTCGATTTGTACGACGCCGAAATCAAGAATGTGCAGAGACGTAACACG



AGACGATTCGGATCACATGACTTGTTCGCAAAATTGGCAGAGCCAGAGTATCAGG



CCCTGTGGCGCGAAGATGCTTCGTTTCTCACGCGTTACGCGGTGTACAACAGCAT



CCTTCGCAAACTGAATCACGCCAAAATGTTCGCGACGTTTACTTTGCCGGATGCA



ACGGCGCACCCGATTTGGACTCGCTTCGATAAATTGGGTGGGAATTTGCACCAGT



ACACCTTTTTGTTCAACGAATTTGGAGAACGCAGGCACGCGATTCGTTTTCACAA



GCTATTGAAAGTCGAGAATGGTGTCGCAAGAGAAGTTGATGATGTCACCGTGCCC



ATTTCAATGTCAGAGCAATTGGATAATCTGCTTCCCAGAGATCCCAATGAACCGA



TTGCGCTATATTTTCGAGATTACGGAGCCGAACAGCATTTCACAGGTGAATTTGG



TGGCGCGAAGATCCAGTGCCGCCGGGATCAGCTGGCTCATATGCACCGACGCAGA



GGGGCGAGGGATGTTTATCTCAATGTCAGCGTACGTGTGCAGAGTCAGTCTGAGG



CGCGGGGAGAACGTCGCCCGCCGTATGCGGCAGTATTTCGTCTGGTCGGGGACAA



CCATCGCGCGTTTGTCCATTTCGATAAACTATCGGATTATCTTGCGGAACATCCG



GATGATGGGAAGCTCGGGTCGGAGGGGTTGCTTTCCGGGCTGCGGGTGATGAGTG



TCGATCTCGGCCTTCGCACATCTGCATCGATTTCCGTTTTTCGCGTTGCCCGGAA



GGACGAGTTGAAGCCGAACTCAAAAGGTCGTGTACCGTTTTTCTTTCCGATAAAA



GGGAATGACAATCTCGTCGCGGTTCATGAGCGATCACAACTCTTGAAGCTGCCTG



GCGAAACGGAGTCGAAGGACCTGCGTGCTATCCGAGAAGAACGCCAACGGACATT



GCGGCAGTTGCGGACGCAACTGGCGTATTTGCGGCTGCTCGTGCGGTGTGGGTCG



GAAGATGTGGGGCGGCGTGAACGGAGTTGGGCAAAGCTTATCGAGCAGCCGGTGG



ATGCGGCCAATCACATGACACCGGATTGGCGCGAGGCTTTTGAAAACGAACTTCA



GAAGCTTAAGTCACTCCATGGTATCTGTAGCGACAAGGAATGGATGGATGCTGTC



TACGAGAGCGTTCGCCGCGTGTGGCGTCACATGGGCAAACAGGTTCGCGATTGGC



GAAAGGACGTACGAAGCGGAGAGCGGCCCAAGATTCGCGGCTATGCGAAAGACGT



GGTCGGTGGAAACTCGATTGAGCAAATCGAGTATCTGGAACGTCAGTACAAGTTC



CTCAAGAGTTGGAGCTTCTTTGGTAAGGTGTCGGGACAAGTGATTCGTGCGGAGA



AGGGATCTCGTTTTGCGATCACGCTGCGCGAACACATTGATCACGCGAAGGAAGA



TCGGCTGAAGAAATTGGCGGATCGCATCATTATGGAGGCTCTCGGCTATGTGTAC



GCGTTGGATGAGCGCGGCAAAGGAAAGTGGGTTGCGAAGTATCCGCCGTGCCAGC



TCATCCTGCTGGAGGAATTGAGCGAGTACCAGTTCAATAACGACAGGCCTCCGAG



CGAAAACAACCAGTTGATGCAATGGAGTCATCGCGGCGTGTTCCAGGAGTTGATA



AATCAGGCCCAAGTCCATGATTTACTCGTTGGGACGATGTATGCAGCGTTCTCGT



CGCGATTCGACGCGCGAACTGGGGCACCGGGTATCCGCTGTCGCCGGGTTCCGGC



GCGTTGCACCCAGGAGCACAATCCAGAACCATTTCCTTGGTGGCTGAACAAGTTT



GTGGTGGAACATACGTTGGATGCTTGTCCCCTACGCGCAGACGACCTCATCCCAA



CGGGTGAAGGAGAGATTTTTGTCTCGCCGTTCAGCGCGGAGGAGGGGGACTTTCA



TCAGATTCACGCCGACCTGAATGCGGCGCAAAATCTGCAGCAGCGACTCTGGTCT



GATTTTGATATCAGTCAAATTCGGTTGCGGTGTGATTGGGGTGAAGTGGACGGTG



AACTCGTTCTGATCCCAAGGCTTACAGGAAAACGAACGGCGGATTCATATAGCAA



CAAGGTGTTTTATACCAATACAGGTGTCACCTATTATGAGCGAGAGCGGGGGAAG



AAGCGGAGAAAGGTTTTCGCGCAAGAGAAATTGTCGGAGGAAGAGGCGGAGTTGC



TCGTGGAAGCAGACGAGGCGAGGGAGAAATCGGTCGTTTTGATGCGTGATCCGTC



TGGCATCATCAATCGGGGAAATTGGACCAGGCAAAAGGAATTTTGGTCGATGGTG



AACCAGCGGATCGAAGGATACTTGGTCAAGCAGATTCGCTCGCGCGTTCCATTAC



AAGATAGTGCGTGTGAAAACACGGGGGATATTTAA





SEQ
ATGGCGACACGCAGTTTTATTTTAAAAATTGAACCAAATGAAGAAGTTAAAAAGG


ID
GATTATGGAAGACGCATGAGGTATTGAATCATGGAATTGCCTACTACATGAATAT


NO:
TCTGAAACTAATTAGACAGGAAGCTATTTATGAACATCATGAACAAGATCCTAAA


34
AATCCGAAAAAAGTTTCAAAAGCAGAAATACAAGCCGAGTTATGGGATTTTGTTT



TAAAAATGCAAAAATGTAATAGTTTTACACATGAAGTTGACAAAGATGTTGTTTT



TAACATCCTGCGTGAACTATATGAAGAGTTGGTCCCTAGTTCAGTCGAGAAAAAG



GGTGAAGCCAATCAATTATCGAATAAGTTTCTGTACCCGCTAGTTGATCCGAACA



GTCAAAGTGGGAAAGGGACGGCATCATCCGGACGTAAACCTCGGTGGTATAATTT



AAAAATAGCAGGCGACCCATCGTGGGAGGAAGAAAAGAAAAAATGGGAAGAGGAT



AAAAAGAAAGATCCCCTTGCTAAAATCTTAGGTAAGTTAGCAGAATATGGGCTTA



TTCCGCTATTTATTCCATTTACTGACAGCAACGAACCAATTGTAAAAGAAATTAA



ATGGATGGAAAAAAGTCGTAATCAAAGTGTCCGGCGACTTGATAAGGATATGTTT



ATCCAAGCATTAGAGCGTTTTCTTTCATGGGAAAGCTGGAACCTTAAAGTAAAGG



AAGAGTATGAAAAAGTTGAAAAGGAACACAAAACACTAGAGGAAAGGATAAAAGA



GGACATTCAAGCATTTAAATCCCTTGAACAATATGAAAAAGAACGGCAGGAGCAA



CTTCTTAGAGATACATTGAATACAAATGAATACCGATTAAGCAAAAGAGGATTAC



GTGGTTGGCGTGAAATTATCCAAAAATGGCTAAAGATGGATGAAAATGAACCATC



AGAAAAATATTTAGAAGTATTTAAAGATTATCAACGGAAACATCCACGAGAAGCC



GGGGACTATTCTGTCTATGAATTTTTAAGCAAGAAAGAAAATCATTTTATTTGGC



GAAATCATCCTGAATATCCTTATTTGTATGCTACATTTTGTGAAATTGACAAAAA



AAAGAAAGACGCTAAGCAACAGGCAACTTTTACTTTGGCTGACCCGATTAACCAT



CCGTTATGGGTACGATTTGAAGAAAGAAGCGGTTCGAACTTAAACAAATATCGAA



TTTTAACAGAGCAATTACACACTGAAAAGTTAAAAAAGAAATTAACAGTTCAACT



TGATCGTTTAATTTATCCAACTGAATCCGGCGGTTGGGAGGAAAAAGGTAAAGTA



GATATCGTTTTGTTGCCGTCAAGACAATTTTATAATCAAATCTTCCTTGATATAG



AAGAAAAGGGGAAACATGCTTTTACTTATAAGGATGAAAGTATTAAATTCCCCCT



TAAAGGTACACTTGGTGGTGCAAGAGTGCAGTTTGACCGTGACCATTTGCGGAGA



TATCCGCATAAAGTAGAATCAGGAAATGTTGGACGGATTTATTTTAACATGACAG



TAAATATTGAACCAACTGAGAGCCCTGTTAGTAAGTCTTTGAAAATACATAGGGA



CGATTTCCCCAAGTTCGTTAATTTTAAACCGAAAGAGCTCACCGAATGGATAAAA



GATAGTAAAGGGAAAAAATTAAAAAGTGGTATAGAATCCCTTGAAATTGGTCTAC



GGGTGATGAGTATCGACTTAGGTCAACGTCAAGCGGCTGCTGCATCGATTTTTGA



AGTAGTTGATCAGAAACCGGATATTGAAGGGAAGTTATTTTTTCCAATCAAAGGA



ACTGAGCTTTATGCTGTTCACCGGGCAAGTTTTAACATTAAATTACCGGGTGAAA



CATTAGTAAAATCACGGGAAGTATTGCGGAAAGCTCGGGAGGACAACTTAAAATT



AATGAATCAAAAGTTAAACTTTCTAAGAAATGTTCTACATTTCCAACAGTTTGAA



GATATCACAGAAAGAGAGAAGCGTGTAACTAAATGGATTTCTAGACAAGAAAATA



GTGATGTTCCTCTTGTATATCAAGATGAGCTAATTCAAATTCGTGAATTAATGTA



TAAACCCTATAAAGATTGGGTTGCCTTTTTAAAACAACTCCATAAACGGCTAGAA



GTCGAGATTGGCAAAGAGGTTAAGCATTGGCGAAAATCATTAAGTGACGGGAGAA



AAGGTCTTTACGGAATCTCCCTAAAAAATATTGATGAAATTGATCGAACAAGGAA



ATTCCTTTTAAGATGGAGCTTACGTCCAACAGAACCTGGGGAAGTAAGACGCTTG



GAACCAGGACAGCGTTTTGCGATTGATCAATTAAACCACCTAAATGCATTAAAAG



AAGATCGATTAAAAAAGATGGCAAATACGATTATCATGCATGCCTTAGGTTACTG



TTATGATGTAAGAAAGAAAAAGTGGCAGGCAAAAAATCCAGCATGTCAAATTATT



TTATTTGAAGATTTATCTAACTACAATCCTTACGAGGAAAGGTCCCGTTTTGAAA



ACTCAAAACTGATGAAGTGGTCACGGAGAGAAATTCCACGACAAGTCGCCTTACA



AGGTGAAATTTACGGATTACAAGTTGGGGAAGTAGGTGCCCAATTCAGTTCAAGA



TTCCATGCGAAAACCGGGTCGCCGGGAATTCGTTGCAGTGTTGTAACGAAAGAAA



AATTGCAGGATAATCGCTTTTTTAAAAATTTACAAAGAGAAGGACGACTTACTCT



TGATAAAATCGCAGTTTTAAAAGAAGGAGACTTATATCCAGATAAAGGTGGAGAA



AAGTTTATTTCTTTATCAAAGGATCGAAAGTTGGTAACTACGCATGCTGATATTA



ACGCGGCCCAAAATTTACAGAAGCGTTTTTGGACAAGAACACATGGATTTTATAA



AGTTTACTGCAAAGCCTATCAGGTTGATGGACAAACTGTTTATATTCCGGAGAGC



AAGGACCAAAAACAAAAAATAATTGAAGAATTTGGGGAAGGCTATTTTATTTTAA



AAGATGGTGTATATGAATGGGGTAATGCGGGGAAACTAAAAATTAAAAAAGGTTC



CTCTAAACAATCATCGAGTGAATTAGTAGATTCGGACATACTGAAAGATTCATTT



GATTTAGCAAGTGAACTTAAGGGAGAGAAACTCATGTTATATCGAGATCCGAGTG



GAAACGTATTTCCTTCCGACAAGTGGATGGCAGCAGGAGTATTTTTTGGCAAATT



AGAAAGAATATTGATTTCTAAGTTAACAAATCAATACTCAATATCAACAATAGAA



GATGATTCTTCAAAACAATCAATGTAA





SEQ
ATGCCCACCCGCACCATCAATCTGAAACTTGTTCTTGGGAAAAATCCTGAAAACG


ID
CAACATTGCGACGCGCCCTATTTTCGACACACCGTTTGGTTAACCAAGCGACGAA


NO:
ACGTATTGAGGAATTCTTGTTGCTGTGTCGTGGAGAAGCCTACAGAACAGTGGAT


35
AATGAGGGGAAGGAAGCCGAGATTCCACGTCATGCAGTCCAAGAAGAAGCTCTTG



CCTTTGCCAAAGCTGCTCAACGCCACAACGGCTGTATATCCACCTATGAAGACCA



AGAGATTCTTGATGTACTGCGGCAACTGTACGAACGTCTTGTTCCTTCGGTCAAC



GAAAACAACGAGGCAGGCGATGCTCAAGCTGCTAACGCCTGGGTCAGTCCGCTCA



TGTCGGCAGAAAGCGAAGGAGGCTTGTCGGTCTACGACAAGGTGCTTGATCCACC



GCCGGTTTGGATGAAGCTTAAAGAAGAAAAGGCTCCAGGATGGGAAGCCGCTTCT



CAAATTTGGATTCAGAGTGATGAGGGACAGTCGTTACTTAATAAGCCAGGTAGCC



CTCCCCGCTGGATTCGAAAACTGCGATCTGGGCAACCGTGGCAAGATGATTTCGT



CAGTGACCAAAAGAAAAAGCAAGATGAGCTGACCAAAGGGAACGCACCACTTATA



AAACAACTCAAAGAAATGGGGTTGTTGCCTCTTGTTAACCCATTTTTTAGACATC



TTCTTGACCCTGAAGGTAAAGGCGTGAGTCCATGGGACCGTCTTGCTGTACGCGC



TGCAGTGGCTCACTTTATCTCCTGGGAAAGTTGGAATCATAGAACACGTGCAGAA



TACAATTCCTTGAAACTACGGCGAGACGAGTTTGAGGCAGCATCCGACGAATTCA



AAGACGATTTTACTTTGCTCCGACAATATGAAGCCAAACGCCATAGTACATTGAA



AAGCATCGCGCTGGCCGACGATTCGAACCCTTACCGGATTGGAGTACGTTCTCTG



CGTGCCTGGAACCGCGTTCGTGAAGAATGGATAGACAAGGGTGCAACAGAAGAAC



AACGCGTGACCATATTGTCAAAGCTTCAAACACAACTTCGGGGAAAATTCGGCGA



TCCCGATCTGTTCAACTGGCTAGCTCAGGATAGGCATGTCCATTTGTGGTCTCCT



CGGGACAGCGTGACACCATTGGTTCGCATCAATGCGGTAGATAAAGTTCTGCGTC



GACGAAAACCGTATGCATTGATGACCTTTGCCCATCCCCGCTTCCACCCTCGATG



GATACTGTACGAGGCTCCAGGAGGAAGCAATCTCCGTCAATATGCATTGGATTGT



ACAGAAAACGCTCTACACATCACGTTGCCTTTGCTTGTCGACGATGCGCACGGAA



CCTGGATTGAAAAAAAGATCAGGGTGCCGCTGGCACCATCCGGACAAATTCAAGA



TTTAACTCTGGAAAAACTTGAGAAGAAAAAAAATCGTTTATACTACCGTTCCGGT



TTTCAGCAGTTTGCCGGCTTGGCTGGCGGAGCTGAGGTTCTTTTCCACAGACCCT



ATATGGAACACGACGAACGCAGCGAGGAGTCTCTTTTGGAACGTCCGGGAGCCGT



TTGGTTCAAATTGACCCTGGATGTGGCAACACAGGCTCCCCCGAACTGGCTTGAT



GGTAAGGGCCGTGTCCGTACACCGCCGGAGGTACATCATTTTAAAACCGCATTGT



CGAATAAAAGCAAACATACACGTACGCTGCAGCCGGGTCTCCGTGTCTTGTCAGT



AGACTTGGGCATGCGAACATTCGCCTCCTGCTCAGTATTTGAACTCATCGAGGGA



AAGCCTGAGACAGGCCGTGCCTTCCCTGTTGCCGATGAGAGATCAATGGACAGCC



CGAATAAACTGTGGGCCAAGCATGAACGTAGTTTTAAACTGACGCTCCCCGGCGA



AACCCCTTCTCGAAAGGAAGAGGAAGAGCGTAGCATAGCAAGAGCGGAAATTTAT



GCACTGAAACGCGACATACAACGCCTCAAAAGCCTACTCCGCTTAGGTGAAGAAG



ATAACGATAACCGTCGTGATGCATTGCTTGAACAGTTCTTTAAAGGATGGGGAGA



AGAAGACGTTGTGCCTGGACAAGCGTTTCCACGCTCTCTTTTCCAAGGGTTGGGA



GCTGCCCCGTTTCGCTCAACTCCAGAGTTATGGCGTCAGCATTGCCAAACATATT



ATGACAAAGCGGAAGCCTGTCTGGCTAAACATATCAGTGATTGGCGCAAGCGAAC



TCGTCCCCGTCCGACATCGCGGGAGATGTGGTACAAAACACGTTCCTATCATGGC



GGCAAGTCCATTTGGATGTTGGAATATCTTGATGCCGTTCGAAAACTGCTTCTCA



GTTGGAGCTTACGTGGTCGTACTTACGGTGCCATTAATCGCCAGGATACAGCCCG



GTTTGGTTCTTTGGCATCACGGCTGCTCCACCATATCAATTCCCTAAAGGAAGAC



CGCATCAAAACAGGAGCCGACTCTATCGTTCAGGCTGCTCGCGGGTATATTCCTC



TCCCTCATGGCAAGGGTTGGGAACAAAGATATGAGCCTTGTCAGCTCATATTATT



TGAAGACCTCGCACGATATCGCTTTCGCGTGGATCGACCTCGTCGAGAGAACAGC



CAACTCATGCAGTGGAACCATCGAGCCATCGTGGCAGAAACAACGATGCAAGCCG



AACTCTACGGACAAATTGTCGAAAATACTGCAGCGGGGTTCAGCAGTCGTTTTCA



CGCGGCGACAGGTGCCCCCGGTGTACGTTGTCGTTTTCTTCTAGAAAGAGACTTT



GATAACGATTTGCCCAAACCGTACCTTCTCAGGGAACTTTCTTGGATGCTCGGCA



ATACAAAAGTCGAGTCTGAAGAAGAAAAGCTTCGATTGCTGTCTGAAAAAATCAG



GCCAGGCAGTCTTGTTCCTTGGGATGGAGGCGAACAGTTCGCTACCCTGCATCCC



AAAAGACAAACACTTTGCGTCATTCATGCCGATATGAATGCTGCCCAAAATTTAC



AACGCCGGTTTTTCGGTCGATGCGGCGAGGCCTTTCGGCTTGTTTGTCAACCCCA



CGGTGACGACGTGTTACGACTCGCATCCACCCCAGGAGCTCGTCTTCTTGGAGCC



CTGCAGCAGCTTGAAAATGGACAAGGAGCTTTCGAGTTGGTTCGAGACATGGGGT



CAACAAGTCAAATGAACCGGTTCGTCATGAAGTCTTTGGGAAAAAAGAAAATAAA



ACCCCTTCAGGACAACAATGGAGACGACGAGCTTGAAGACGTGTTGTCCGTACTC



CCGGAGGAAGACGACACAGGACGTATCACAGTCTTCCGCGATTCATCAGGAATCT



TTTTTCCTTGCAACGTCTGGATACCGGCCAAACAGTTTTGGCCAGCAGTACGCGC



CATGATTTGGAAGGTCATGGCTTCCCATTCTTTGGGGTGA





SEQ
ATGACAAAGTTAAGACACCGACAGAAAAAATTAACACACGACTGGGCTGGCTCCA


ID
AAAAGAGGGAAGTATTAGGCTCAAATGGCAAGCTTCAGAATCCGTTGTTAATGCC


NO:
GGTTAAAAAAGGTCAGGTTACTGAGTTCCGGAAAGCGTTTTCTGCGTATGCTCGC


36
GCAACGAAAGGAGAAATGACTGACGGCCGAAAGAATATGTTTACGCATAGTTTCG



AGCCATTTAAGACAAAGCCCTCGCTTCATCAGTGTGAATTGGCAGATAAAGCATA



TCAATCTTTACATTCGTATCTGCCTGGTTCTCTTGCTCATTTTCTATTATCTGCT



CACGCATTAGGTTTTCGTATTTTTTCAAAATCTGGTGAAGCAACTGCATTCCAGG



CATCCTCTAAAATTGAAGCTTACGAATCAAAATTGGCAAGCGAATTAGCTTGTGT



AGATTTATCTATTCAAAACTTGACTATTTCAACGCTTTTTAATGCGCTTACAACG



TCTGTAAGAGGGAAGGGCGAAGAAACTAGCGCTGACCCCTTAATTGCACGATTTT



ACACCTTACTTACTGGCAAGCCTCTGTCTCGAGACACTCAAGGGCCTGAACGTGA



TTTAGCAGAAGTTATCTCGCGTAAGATAGCTAGTTCTTTTGGCACATGGAAAGAA



ATGACGGCAAACCCTCTTCAGTCATTACAATTTTTTGAAGAGGAACTCCATGCGC



TGGATGCCAATGTCTCGCTCTCACCCGCCTTCGACGTTTTAATTAAAATGAATGA



TTTGCAGGGCGATTTAAAAAATCGAACCATTGTTTTTGATCCTGACGCCCCTGTT



TTTGAATATAACGCAGAAGACCCTGCCGACATAATTATTAAACTTACAGCTCGTT



ACGCTAAAGAAGCTGTCATCAAAAATCAAAACGTAGGAAATTACGTTAAAAACGC



TATTACTACCACAAATGCCAATGGTCTTGGTTGGCTTTTGAACAAAGGTTTGTCG



TTACTCCCTGTCTCGACCGATGACGAATTGCTAGAGTTTATTGGCGTTGAACGAT



CTCATCCCTCATGCCATGCCTTAATTGAATTGATTGCACAATTAGAAGCCCCCGA



GCTCTTTGAGAAGAACGTATTTTCAGATACTCGTTCTGAAGTTCAAGGTATGATT



GATTCAGCTGTTTCTAATCATATTGCTCGTCTTTCCAGCTCTAGAAATAGCTTGT



CAATGGATAGTGAAGAATTAGAACGTTTAATCAAAAGCTTTCAGATACACACACC



TCATTGCTCACTTTTTATTGGCGCCCAATCACTTTCACAGCAGTTAGAATCTTTG



CCTGAAGCCCTTCAATCGGGCGTTAATTCAGCCGATATTTTACTAGGCTCTACTC



AATATATGCTCACCAATTCTTTGGTTGAAGAGTCAATTGCAACTTATCAAAGAAC



ACTTAATCGCATCAATTACTTGTCAGGTGTTGCAGGTCAGATTAACGGCGCAATA



AAGCGAAAAGCGATAGATGGAGAAAAAATTCACTTGCCTGCAGCTTGGTCAGAGT



TGATATCTTTACCATTTATAGGCCAGCCTGTTATAGATGTTGAAAGCGATTTAGC



TCATCTAAAAAATCAATACCAAACACTTTCAAATGAGTTTGATACTCTTATATCT



GCTTTGCAAAAGAATTTTGATTTGAACTTTAATAAAGCGCTCCTTAATCGTACTC



AGCATTTTGAAGCCATGTGTAGAAGCACTAAGAAAAACGCTTTATCCAAACCAGA



GATCGTTTCCTATCGCGACCTGCTTGCTCGATTAACTTCTTGTTTGTATCGAGGC



TCTTTAGTTTTGCGTCGTGCCGGCATTGAAGTGTTAAAAAAACATAAAATATTTG



AGTCAAACAGCGAACTTCGTGAACATGTTCATGAAAGAAAGCATTTCGTGTTTGT



TAGTCCTCTAGATCGCAAAGCCAAGAAACTCCTTCGATTAACTGATTCGCGTCCA



GACTTGTTACATGTTATTGATGAAATATTGCAGCACGATAATCTTGAAAACAAAG



ACCGCGAGTCACTTTGGCTAGTTCGCTCTGGTTATTTGCTTGCAGGACTTCCAGA



TCAACTTTCTTCATCTTTTATTAACTTGCCTATCATTACTCAAAAAGGAGATAGA



CGCCTTATAGACCTGATTCAGTATGATCAAATTAATCGTGATGCTTTTGTTATGT



TAGTGACCTCTGCATTCAAGTCTAATTTGTCTGGTCTGCAGTATCGTGCCAATAA



GCAATCGTTCGTTGTTACTCGCACGCTAAGCCCTTATCTCGGCTCAAAACTTGTC



TACGTACCCAAGGATAAAGATTGGTTAGTTCCTTCTCAAATGTTTGAAGGACGAT



TTGCTGACATTCTTCAATCAGATTATATGGTCTGGAAAGATGCCGGTCGTCTTTG



TGTTATTGATACTGCAAAACACCTTTCTAATATAAAGAAGTCTGTATTTTCATCC



GAAGAAGTTCTCGCTTTTTTAAGAGAACTCCCTCACCGCACATTTATCCAGACCG



AAGTTCGCGGCCTTGGCGTTAATGTCGATGGAATTGCATTTAATAATGGTGATAT



TCCGTCATTAAAAACCTTTTCAAATTGCGTTCAGGTAAAAGTTTCTCGGACTAAT



ACATCCCTAGTTCAAACACTTAATCGTTGGTTTGAAGGAGGAAAAGTTTCTCCTC



CGAGCATTCAATTTGAACGGGCGTATTATAAAAAAGACGATCAAATTCATGAAGA



CGCAGCGAAAAGAAAGATACGATTCCAGATGCCCGCAACTGAGTTGGTTCATGCT



TCTGACGATGCGGGGTGGACACCAAGTTATTTGCTCGGCATTGATCCTGGCGAGT



ATGGAATGGGTCTTTCATTGGTTTCGATTAATAACGGAGAAGTCTTAGATTCAGG



CTTTATTCATATTAATTCTCTGATCAATTTTGCCTCTAAAAAGAGCAACCATCAA



ACTAAGGTTGTTCCGCGTCAGCAGTACAAATCTCCTTATGCAAATTATTTAGAAC



AATCTAAAGATTCTGCTGCTGGTGATATTGCGCATATACTCGATCGACTTATATA



CAAATTAAATGCGTTGCCTGTTTTTGAGGCTCTTTCAGGTAATTCTCAGAGTGCT



GCTGATCAAGTTTGGACGAAAGTCTTATCGTTTTACACTTGGGGTGATAATGACG



CTCAGAATTCTATTAGAAAGCAGCATTGGTTTGGAGCCAGTCATTGGGATATCAA



AGGTATGTTAAGGCAACCCCCTACGGAGAAGAAGCCTAAACCGTATATTGCTTTT



CCTGGCTCTCAGGTTTCTTCGTATGGTAATTCCCAACGTTGCTCTTGCTGCGGTC



GCAATCCTATTGAACAACTTCGAGAAATGGCAAAGGATACCTCTATTAAAGAGCT



AAAAATTCGCAATTCTGAGATACAGCTTTTTGACGGAACCATTAAATTATTTAAT



CCAGACCCATCCACTGTGATAGAGAGAAGGCGACATAATCTTGGTCCATCAAGAA



TTCCTGTTGCTGACCGTACTTTCAAAAACATCAGTCCATCAAGTCTAGAATTTAA



AGAATTGATTACTATCGTGTCTCGATCTATCCGTCATTCACCTGAGTTTATCGCT



AAAAAACGCGGCATAGGGTCTGAGTATTTTTGCGCTTATTCCGATTGCAACTCAT



CCTTAAATTCTGAAGCTAACGCAGCTGCTAACGTAGCGCAAAAATTTCAAAAACA



GTTATTTTTTGAGTTATAA





SEQ
ATGAAGAGAATTCTGAACAGTCTGAAAGTTGCTGCCTTGAGACTTCTGTTTCGAG


ID
GCAAAGGTTCTGAATTAGTGAAGACAGTCAAATATCCATTGGTTTCCCCGGTTCA


NO:
AGGCGCGGTTGAAGAACTTGCTGAAGCAATTCGGCACGACAACCTGCACCTTTTT


37
GGGCAGAAGGAAATAGTGGATCTTATGGAGAAAGACGAAGGAACCCAGGTGTATT



CGGTTGTGGATTTTTGGTTGGATACCCTGCGTTTAGGGATGTTTTTCTCACCATC



AGCGAATGCGTTGAAAATCACGCTGGGAAAATTCAATTCTGATCAGGTTTCACCT



TTTCGTAAGGTTTTGGAGCAGTCACCTTTTTTTCTTGCGGGTCGCTTGAAGGTTG



AACCTGCGGAAAGGATACTTTCTGTTGAAATCAGAAAGATTGGTAAAAGAGAAAA



CAGAGTTGAGAACTATGCCGCCGATGTGGAGACATGCTTCATTGGTCAGCTTTCT



TCAGATGAGAAACAGAGTATCCAGAAGCTGGCAAATGATATCTGGGATAGCAAGG



ATCATGAGGAACAGAGAATGTTGAAGGCGGATTTTTTTGCTATACCTCTTATAAA



AGACCCCAAAGCTGTCACAGAAGAAGATCCTGAAAATGAAACGGCGGGAAAACAG



AAACCGCTTGAATTATGTGTTTGTCTTGTTCCTGAGTTGTATACCCGAGGTTTCG



GCTCCATTGCTGATTTTCTGGTTCAGCGACTTACCTTGCTGCGTGACAAAATGAG



TACCGACACGGCGGAAGATTGCCTCGAGTATGTTGGCATTGAGGAAGAAAAAGGC



AATGGAATGAATTCCTTGCTCGGCACTTTTTTGAAGAACCTGCAGGGTGATGGTT



TTGAACAGATTTTTCAGTTTATGCTTGGGTCTTATGTTGGCTGGCAGGGGAAGGA



AGATGTACTGCGCGAACGATTGGATTTGCTGGCCGAAAAAGTCAAAAGATTACCA



AAGCCAAAATTTGCCGGAGAATGGAGTGGTCATCGTATGTTTCTCCATGGTCAGC



TGAAAAGCTGGTCGTCGAATTTCTTCCGTCTTTTTAATGAGACGCGGGAACTTCT



GGAAAGTATCAAGAGTGATATTCAACATGCCACCATGCTCATTAGCTATGTGGAA



GAGAAAGGAGGCTATCATCCACAGCTGTTGAGTCAGTATCGGAAGTTAATGGAAC



AATTACCGGCGTTGCGGACTAAGGTTTTGGATCCTGAGATTGAGATGACGCATAT



GTCCGAGGCTGTTCGAAGTTACATTATGATACACAAGTCTGTAGCGGGATTTCTG



CCGGATTTACTCGAGTCTTTGGATCGAGATAAGGATAGGGAATTTTTGCTTTCCA



TCTTTCCTCGTATTCCAAAGATAGATAAGAAGACGAAAGAGATCGTTGCATGGGA



GCTACCGGGCGAGCCAGAGGAAGGCTATTTGTTCACAGCAAACAACCTTTTCCGG



AATTTTCTTGAGAATCCGAAACATGTGCCACGATTTATGGCAGAGAGGATTCCCG



AGGATTGGACGCGTTTGCGCTCGGCCCCTGTGTGGTTTGATGGGATGGTGAAGCA



ATGGCAGAAGGTGGTGAATCAGTTGGTTGAATCTCCAGGCGCCCTTTATCAGTTC



AATGAAAGTTTTTTGCGTCAAAGACTGCAAGCAATGCTTACGGTCTATAAGCGGG



ATCTCCAGACTGAGAAGTTTCTGAAGCTGCTGGCTGATGTCTGTCGTCCACTCGT



TGATTTTTTCGGACTTGGAGGAAATGATATTATCTTCAAGTCATGTCAGGATCCA



AGAAAGCAATGGCAGACTGTTATTCCACTCAGTGTCCCAGCGGATGTTTATACAG



CATGTGAAGGCTTGGCTATTCGTCTCCGCGAAACTCTTGGATTCGAATGGAAAAA



TCTGAAAGGACACGAGCGGGAAGATTTTTTACGGCTGCATCAGTTGCTGGGAAAT



CTGCTGTTCTGGATCAGGGATGCGAAACTTGTCGTGAAGCTGGAAGACTGGATGA



ACAATCCTTGTGTTCAGGAGTATGTGGAAGCACGAAAAGCCATTGATCTTCCCTT



GGAGATTTTCGGATTTGAGGTGCCGATTTTTCTCAATGGCTATCTCTTTTCGGAA



CTGCGCCAGCTGGAATTGTTGCTGAGGCGTAAGTCGGTGATGACGTCTTACAGCG



TCAAAACGACAGGCTCGCCAAATAGGCTCTTCCAGTTGGTTTACCTACCTCTAAA



CCCTTCAGATCCGGAAAAGAAAAATTCCAACAACTTTCAGGAGCGCCTCGATACA



CCTACCGGTTTGTCGCGTCGTTTTCTGGATCTTACGCTGGATGCATTTGCTGGCA



AACTCTTGACGGATCCGGTAACTCAGGAACTGAAGACGATGGCCGGTTTTTACGA



TCATCTCTTTGGCTTCAAGTTGCCGTGTAAACTGGCGGCGATGAGTAACCATCCA



GGATCCTCTTCCAAAATGGTGGTTCTGGCAAAACCAAAGAAGGGTGTTGCTAGTA



ACATCGGCTTTGAACCTATTCCCGATCCTGCTCATCCTGTGTTCCGGGTGAGAAG



TTCCTGGCCGGAGTTGAAGTACCTGGAGGGGTTGTTGTATCTTCCCGAAGATACA



CCACTGACCATTGAACTGGCGGAAACGTCGGTCAGTTGTCAGTCTGTGAGTTCAG



TCGCTTTCGATTTGAAGAATCTGACGACTATCTTGGGTCGTGTTGGTGAATTCAG



GGTGACGGCAGATCAACCTTTCAAGCTGACGCCCATTATTCCTGAGAAAGAGGAA



TCCTTCATCGGGAAGACCTACCTCGGTCTTGATGCTGGAGAGCGATCTGGCGTTG



GTTTCGCGATTGTGACGGTTGACGGCGATGGGTATGAGGTGCAGAGGTTGGGTGT



GCATGAAGATACTCAGCTTATGGCGCTTCAGCAAGTCGCCAGCAAGTCTCTTAAG



GAGCCGGTTTTCCAGCCACTCCGTAAGGGCACATTTCGTCAGCAGGAGCGCATTC



GCAAAAGCCTCCGCGGTTGCTACTGGAATTTCTATCATGCATTGATGATCAAGTA



CCGAGCTAAAGTTGTGCATGAGGAATCGGTGGGTTCATCCGGTCTGGTGGGGCAG



TGGCTGCGTGCATTTCAGAAGGATCTCAAAAAGGCTGATGTTCTGCCCAAGAAGG



GTGGAAAAAATGGTGTAGACAAAAAAAAGAGAGAAAGCAGCGCTCAGGATACCTT



ATGGGGAGGAGCTTTCTCGAAGAAGGAAGAGCAGCAGATAGCCTTTGAGGTTCAG



GCAGCTGGATCAAGCCAGTTTTGTCTGAAGTGTGGTTGGTGGTTTCAGTTGGGGA



TGCGGGAAGTAAATCGTGTGCAGGAGAGTGGCGTGGTGCTGGACTGGAACCGGTC



CATTGTAACCTTCCTCATCGAATCCTCAGGAGAAAAGGTATATGGTTTCAGTCCT



CAGCAACTGGAAAAAGGCTTTCGTCCTGACATCGAAACGTTCAAAAAAATGGTAA



GGGATTTTATGAGACCCCCCATGTTTGATCGCAAAGGTCGGCCGGCCGCGGCGTA



TGAAAGATTCGTACTGGGACGTCGTCACCGTCGTTATCGCTTTGATAAAGTTTTT



GAAGAGAGATTTGGTCGCAGTGCTCTTTTCATCTGCCCGCGGGTCGGGTGTGGGA



ATTTCGATCACTCCAGTGAGCAGTCAGCCGTTGTCCTTGCCCTTATTGGTTACAT



TGCTGATAAGGAAGGGATGAGTGGTAAGAAGCTTGTTTATGTGAGGCTGGCTGAA



CTTATGGCTGAGTGGAAGCTGAAGAAACTGGAGAGATCAAGGGTGGAAGAACAGA



GCTCGGCACAATAA





SEQ
ATGGCAGAAAGCAAGCAGATGCAATGCCGCAAGTGCGGCGCAAGCATGAAGTATG


ID
AAGTAATTGGATTGGGCAAGAAGTCATGCAGATATATGTGCCCAGATTGCGGCAA


NO:
TCACACCAGCGCGCGCAAGATTCAGAACAAGAAAAAGCGCGACAAAAAGTATGGA


38
TCCGCAAGCAAAGCGCAGAGCCAGAGGATAGCTGTGGCTGGCGCGCTTTATCCAG



ACAAAAAAGTGCAGACCATAAAGACCTACAAATACCCAGCGGATCTTAATGGCGA



AGTTCATGACAGCGGCGTCGCAGAGAAGATTGCGCAGGCGATTCAGGAAGATGAG



ATCGGCCTGCTTGGCCCGTCCAGCGAATACGCTTGCTGGATTGCTTCACAAAAAC



AGAGCGAGCCGTATTCAGTTGTAGATTTTTGGTTTGACGCGGTGTGCGCAGGCGG



AGTATTCGCGTATTCTGGCGCGCGCCTGCTTTCCACAGTCCTCCAGTTGAGTGGC



GAGGAAAGCGTTTTGCGCGCTGCTTTAGCATCTAGCCCGTTTGTAGATGACATTA



ATTTGGCGCAAGCGGAAAAGTTCCTAGCCGTTAGCCGGCGCACAGGCCAAGATAA



GCTAGGCAAGCGCATTGGAGAATGTTTTGCGGAAGGCCGGCTTGAAGCGCTTGGC



ATCAAAGATCGCATGCGCGAATTCGTGCAAGCGATTGATGTGGCCCAAACCGCGG



GCCAGCGGTTCGCGGCCAAGCTAAAGATATTCGGCATCAGTCAGATGCCTGAAGC



CAAGCAATGGAACAATGATTCCGGGCTCACTGTATGTATTTTGCCGGATTATTAT



GTCCCGGAAGAAAACCGCGCGGACCAGCTGGTTGTTTTGCTTCGGCGCTTACGCG



AGATCGCGTATTGCATGGGAATTGAGGATGAAGCAGGATTTGAGCATCTAGGCAT



TGACCCTGGTGCTCTTTCCAATTTTTCCAATGGCAATCCAAAGCGAGGATTTCTC



GGCCGCCTGCTCAATAATGACATTATAGCGCTGGCAAACAACATGTCAGCCATGA



CGCCGTATTGGGAAGGCAGAAAAGGCGAGTTGATTGAGCGCCTTGCATGGCTTAA



ACATCGCGCTGAAGGATTGTATTTGAAAGAGCCACATTTCGGCAACTCCTGGGCA



GACCACCGCAGCAGGATTTTCAGTCGCATTGCGGGCTGGCTTTCCGGATGCGCGG



GCAAGCTCAAGATTGCCAAGGATCAGATTTCAGGCGTGCGTACGGATTTGTTTCT



GCTCAAGCGCCTTCTGGATGCGGTACCGCAAAGCGCGCCGTCGCCGGACTTTATT



GCTTCCATCAGCGCGCTGGATCGGTTTTTGGAAGCGGCAGAAAGCAGCCAGGATC



CGGCAGAACAGGTACGCGCTTTGTACGCGTTTCATCTGAACGCGCCTGCGGTCCG



ATCCATCGCCAACAAGGCGGTACAGAGGTCTGATTCCCAGGAGTGGCTTATCAAG



GAACTGGATGCTGTAGATCACCTTGAATTCAACAAAGCATTTCCGTTTTTTTCGG



ATACAGGAAAGAAAAAGAAGAAAGGAGCGAATAGCAACGGAGCGCCTTCTGAAGA



AGAATACACGGAAACAGAATCCATTCAACAACCAGAAGATGCAGAGCAGGAAGTG



AATGGTCAAGAAGGAAATGGCGCTTCAAAGAACCAGAAAAAGTTTCAGCGCATTC



CTCGATTTTTCGGGGAAGGGTCAAGGAGTGAGTATCGAATTTTAACAGAAGCGCC



GCAATATTTTGACATGTTCTGCAATAATATGCGCGCGATCTTTATGCAGCTAGAG



AGTCAGCCGCGCAAGGCGCCTCGTGATTTCAAATGCTTTCTGCAGAATCGTTTGC



AGAAGCTTTACAAGCAAACCTTTCTCAATGCTCGCAGTAATAAATGCCGCGCGCT



TCTGGAATCCGTCCTTATTTCATGGGGAGAATTTTATACTTATGGCGCGAATGAA



AAGAAGTTTCGTCTGCGCCATGAAGCGAGCGAGCGCAGCTCGGATCCGGACTATG



TGGTTCAGCAGGCATTGGAAATCGCGCGCCGGCTTTTCTTGTTCGGATTTGAGTG



GCGCGATTGCTCTGCTGGAGAGCGCGTGGATTTGGTTGAAATCCACAAAAAAGCA



ATCTCATTTTTGCTTGCAATCACTCAGGCCGAGGTTTCAGTTGGTTCCTATAACT



GGCTTGGGAATAGCACCGTGAGCCGGTATCTTTCGGTTGCTGGCACAGACACATT



GTACGGCACTCAACTGGAGGAGTTTTTGAACGCCACAGTGCTTTCACAGATGCGT



GGGCTGGCGATTCGGCTTTCATCTCAGGAGTTAAAAGACGGATTTGATGTTCAGT



TGGAGAGTTCGTGCCAGGACAATCTCCAGCATCTGCTGGTGTATCGCGCTTCGCG



CGACTTGGCTGCGTGCAAACGCGCTACATGCCCGGCTGAATTGGATCCGAAAATT



CTTGTTCTGCCGGTTGGTGCGTTTATCGCGAGCGTAATGAAAATGATTGAGCGTG



GCGATGAACCATTAGCAGGCGCGTATTTGCGTCATCGGCCGCATTCATTCGGCTG



GCAGATACGGGTTCGTGGAGTGGCGGAAGTAGGCATGGATCAGGGCACAGCGCTA



GCATTCCAGAAGCCGACTGAATCAGAGCCGTTTAAAATAAAGCCGTTTTCCGCTC



AATACGGCCCAGTACTTTGGCTTAATTCTTCATCCTATAGCCAGAGCCAGTATCT



GGATGGATTTTTAAGCCAGCCAAAGAATTGGTCTATGCGGGTGCTACCTCAAGCC



GGATCAGTGCGCGTGGAACAGCGCGTTGCTCTGATATGGAATTTGCAGGCAGGCA



AGATGCGGCTGGAGCGCTCTGGAGCGCGCGCGTTTTTCATGCCAGTGCCATTCAG



CTTCAGGCCGTCTGGTTCAGGAGATGAAGCAGTATTGGCGCCGAATCGGTACTTG



GGACTTTTTCCGCATTCCGGAGGAATAGAATACGCGGTGGTGGATGTATTAGATT



CCGCGGGTTTCAAAATTCTTGAGCGCGGTACGATTGCGGTAAATGGCTTTTCCCA



GAAGCGCGGCGAACGCCAAGAGGAGGCACACAGAGAAAAACAGAGACGCGGAATT



TCTGATATAGGCCGCAAGAAGCCGGTGCAAGCTGAAGTTGACGCAGCCAATGAAT



TGCACCGCAAATACACCGATGTTGCCACTCGTTTAGGGTGCAGAATTGTGGTTCA



GTGGGCGCCCCAGCCAAAGCCGGGCACAGCGCCGACCGCGCAAACAGTATACGCG



CGCGCAGTGCGGACCGAAGCGCCGCGATCTGGAAATCAAGAGGATCATGCTCGTA



TGAAATCCTCTTGGGGATATACCTGGGGCACCTATTGGGAGAAGCGCAAACCAGA



GGATATTTTGGGCATCTCAACCCAAGTATACTGGACCGGCGGTATAGGCGAGTCA



TGTCCCGCAGTCGCGGTTGCGCTTTTGGGGCACATTAGGGCAACATCCACTCAAA



CTGAATGGGAAAAAGAGGAGGTTGTATTCGGTCGACTGAAGAAGTTCTTTCCAAG



CTAG





SEQ
ATGGAAAAGAGAATAAACAAGATACGAAAGAAACTATCGGCCGATAATGCCACAA


ID
AGCCTGTGAGCAGGAGCGGCCCCATGAAAACACTCCTTGTCCGGGTCATGACGGA


NO:
CGACTTGAAAAAAAGACTGGAGAAGCGTCGGAAAAAGCCGGAAGTTATGCCGCAG


39
GTTATTTCAAATAACGCAGCAAACAATCTTAGAATGCTCCTTGATGACTATACAA



AGATGAAGGAGGCGATACTACAAGTTTACTGGCAGGAATTTAAGGACGACCATGT



GGGCTTGATGTGCAAATTTGCCCAGCCTGCTTCCAAAAAAATTGACCAGAACAAA



CTAAAACCGGAAATGGATGAAAAAGGAAATCTAACAACTGCCGGTTTTGCATGTT



CTCAATGCGGTCAGCCGCTATTTGTTTATAAGCTTGAACAGGTGAGTGAAAAAGG



CAAGGCTTATACAAATTACTTCGGCCGGTGTAATGTGGCCGAGCATGAGAAATTG



ATTCTTCTTGCTCAATTAAAACCTGAAAAAGACAGTGACGAAGCAGTGACATACT



CCCTTGGCAAATTCGGCCAGAGGGCATTGGACTTTTATTCAATCCACGTAACAAA



AGAATCCACCCATCCAGTAAAGCCCCTGGCACAGATTGCGGGCAACCGCTATGCA



AGCGGACCTGTTGGCAAGGCCCTTTCCGATGCCTGTATGGGCACTATAGCCAGTT



TTCTTTCGAAATATCAAGACATCATCATAGAACATCAAAAGGTTGTGAAGGGTAA



TCAAAAGAGGTTAGAGAGTCTCAGGGAATTGGCAGGGAAAGAAAATCTTGAGTAC



CCATCGGTTACACTGCCGCCGCAGCCGCATACGAAAGAAGGGGTTGACGCTTATA



ACGAAGTTATTGCAAGGGTACGTATGTGGGTTAATCTTAATCTGTGGCAAAAGCT



GAAGCTCAGCCGTGATGACGCAAAACCGCTACTGCGGCTAAAAGGATTCCCATCT



TTCCCTGTTGTGGAGCGGCGTGAAAACGAAGTTGACTGGTGGAATACGATTAATG



AAGTAAAAAAACTGATTGACGCTAAACGAGATATGGGACGGGTATTCTGGAGCGG



CGTTACCGCAGAAAAGAGAAATACCATCCTTGAAGGATACAACTATCTGCCAAAT



GAGAATGACCATAAAAAGAGAGAGGGCAGTTTGGAAAACCCTAAGAAGCCTGCCA



AACGCCAGTTTGGAGACCTCTTGCTGTATCTTGAAAAGAAATATGCCGGAGACTG



GGGAAAGGTCTTCGATGAGGCATGGGAGAGGATAGATAAGAAAATAGCCGGACTC



ACAAGCCATATAGAGCGCGAAGAAGCAAGAAACGCGGAAGACGCTCAATCCAAAG



CCGTACTTACAGACTGGCTAAGGGCAAAGGCATCATTTGTTCTTGAAAGACTGAA



GGAAATGGATGAAAAGGAATTCTATGCGTGTGAAATCCAACTTCAAAAATGGTAT



GGCGATCTTCGAGGCAACCCGTTTGCCGTTGAAGCTGAGAATAGAGTTGTTGATA



TAAGCGGGTTTTCTATCGGAAGCGATGGCCATTCAATCCAATACAGAAATCTCCT



TGCCTGGAAATATCTGGAGAACGGCAAGCGTGAATTCTATCTGTTAATGAATTAT



GGCAAGAAAGGGCGCATCAGATTTACAGATGGAACAGATATTAAAAAGAGCGGCA



AATGGCAGGGACTATTATATGGCGGTGGCAAGGCAAAGGTTATTGATCTGACTTT



CGACCCCGATGATGAACAGTTGATAATCCTGCCGCTGGCCTTTGGCACAAGGCAA



GGCCGCGAGTTTATCTGGAACGATTTGCTGAGTCTTGAAACAGGCCTGATAAAGC



TCGCAAACGGAAGAGTTATCGAAAAAACAATCTATAACAAAAAAATAGGGCGGGA



TGAACCGGCTCTATTCGTTGCCTTAACATTTGAGCGCCGGGAAGTTGTTGATCCA



TCAAATATAAAGCCTGTAAACCTTATAGGCGTTGACCGCGGCGAAAACATCCCGG



CGGTTATTGCATTGACAGACCCTGAAGGTTGTCCTTTACCGGAATTCAAGGATTC



ATCAGGGGGCCCAACAGACATCCTGCGAATAGGAGAAGGATATAAGGAAAAGCAG



AGGGCTATTCAGGCAGCAAAGGAGGTAGAGCAAAGGCGGGCTGGCGGTTATTCAC



GGAAGTTTGCATCCAAGTCGAGGAACCTGGCGGACGACATGGTGAGAAATTCAGC



GCGAGACCTTTTTTACCATGCCGTTACCCACGATGCCGTCCTTGTCTTTGAAAAC



CTGAGCAGGGGTTTTGGAAGGCAGGGCAAAAGGACCTTCATGACGGAAAGACAAT



ATACAAAGATGGAAGACTGGCTGACAGCGAAGCTCGCATACGAAGGTCTTACGTC



AAAAACCTACCTTTCAAAGACGCTGGCGCAATATACGTCAAAAACATGCTCCAAC



TGCGGGTTTACTATAACGACTGCCGATTATGACGGGATGTTGGTAAGGCTTAAAA



AGACTTCTGATGGATGGGCAACTACCCTCAACAACAAAGAATTAAAAGCCGAAGG



CCAGATAACGTATTATAACCGGTATAAAAGGCAAACCGTGGAAAAAGAACTCTCC



GCAGAGCTTGACAGGCTTTCAGAAGAGTCGGGCAATAATGATATTTCTAAGTGGA



CCAAGGGTCGCCGGGACGAGGCATTATTTTTGTTAAAGAAAAGATTCAGCCATCG



GCCTGTTCAGGAACAGTTTGTTTGCCTCGATTGCGGCCATGAAGTCCACGCCGAT



GAACAGGCAGCCTTGAATATTGCAAGGTCATGGCTTTTTCTAAACTCAAATTCAA



CAGAATTCAAAAGTTATAAATCGGGTAAACAGCCCTTCGTTGGTGCTTGGCAGGC



CTTTTACAAAAGGAGGCTTAAAGAGGTATGGAAGCCCAACGCC





SEQ
ATGAAAAGGATAAATAAAATACGAAGGAGATTGGTAAAGGATAGCAACACGAAAA


ID
AAGCCGGCAAAACCGGCCCTATGAAAACCTTGCTCGTTCGGGTTATGACACCTGA


NO:
CCTGAGAGAAAGGTTAGAGAATCTTCGCAAAAAGCCGGAAAACATTCCTCAGCCC


40
ATTTCAAATACTTCACGTGCAAATTTAAATAAACTCCTCACTGACTATACGGAAA



TGAAGAAAGCAATCCTGCATGTTTATTGGGAAGAGTTCCAAAAAGACCCTGTCGG



ATTGATGAGCAGGGTTGCACAACCAGCGCCCAAGAATATTGATCAGAGAAAATTG



ATTCCGGTGAAGGACGGAAATGAGAGACTAACAAGTTCTGGATTTGCCTGTTCTC



AGTGCTGTCAACCCCTCTATGTTTATAAGCTTGAACAAGTGAATGACAAGGGTAA



GCCCCATACAAATTACTTTGGCCGTTGTAATGTCTCCGAGCATGAACGTTTGATA



TTGCTCTCGCCGCATAAACCGGAGGCAAATGACGAGCTAGTAACGTATTCGTTGG



GGAAGTTCGGTCAAAGGGCATTGGACTTTTATTCAATCCACGTAACAAGAGAATC



GAACCATCCTGTAAAGCCGCTAGAACAGATCGGTGGCAATAGCTGCGCAAGTGGT



CCCGTTGGTAAGGCTTTATCTGATGCCTGTATGGGAGCAGTAGCCAGTTTCCTTA



CAAAGTACCAGGACATCATCCTCGAACACCAAAAGGTTATAAAAAAAAACGAAAA



GAGATTGGCAAATCTAAAGGATATAGCAAGTGCAAACGGGCTTGCATTTCCTAAA



ATCACTCTTCCACCGCAACCGCATACAAAAGAAGGGATTGAAGCTTATAACAATG



TTGTTGCTCAGATAGTGATCTGGGTAAACCTGAATCTTTGGCAGAAACTCAAAAT



TGGCAGGGATGAGGCAAAGCCCTTACAGCGGCTTAAGGGTTTTCCGTCCTTCCCT



CTTGTTGAACGCCAGGCGAATGAGGTTGATTGGTGGGATATGGTCTGTAATGTCA



AAAAGTTGATTAACGAAAAGAAAGAGGACGGGAAGGTCTTCTGGCAAAATCTTGC



TGGATATAAAAGGCAGGAAGCCTTGCTTCCATATCTTTCGTCTGAAGAAGACCGT



AAAAAAGGAAAAAAGTTTGCGCGTTATCAGTTTGGTGACCTTTTGCTTCACCTTG



AAAAGAAACACGGTGAAGATTGGGGCAAAGTTTATGATGAGGCATGGGAAAGAAT



AGATAAAAAAGTTGAAGGTCTGAGTAAGCACATAAAGTTGGAGGAAGAAAGAAGG



TCTGAAGATGCTCAATCAAAGGCTGCCCTCACTGATTGGCTCAGGGCAAAGGCCT



CTTTTGTTATTGAAGGGCTCAAAGAAGCTGATAAGGATGAGTTTTGCAGGTGTGA



GTTAAAGCTTCAAAAGTGGTATGGAGATTTGAGAGGAAAACCATTTGCTATAGAA



GCAGAGAACAGCATTTTAGATATAAGCGGATTTTCTAAACAGTATAATTGTGCAT



TTATATGGCAGAAAGACGGCGTAAAGAAGTTAAATCTTTATTTAATAATAAATTA



CTTCAAAGGTGGTAAGCTACGCTTCAAAAAAATCAAGCCAGAAGCTTTTGAAGCA



AATAGGTTTTATACAGTAATTAATAAAAAAAGCGGTGAGATTGTGCCTATGGAGG



TCAACTTCAATTTTGATGACCCGAATTTGATAATTCTGCCTTTGGCCTTTGGAAA



AAGGCAGGGGAGGGAGTTTATCTGGAACGACCTATTGAGCCTTGAGACGGGTTCA



TTGAAACTCGCCAATGGCAGGGTTATTGAAAAAACGCTCTATAACAGAAGGACGA



GACAGGATGAACCAGCACTTTTTGTTGCCCTGACATTTGAAAGAAGAGAGGTGCT



TGACTCATCGAATATAAAACCGATGAATCTGATAGGAATAGACCGGGGAGAAAAT



ATCCCGGCAGTCATAGCATTAACAGACCCGGAAGGATGCCCCTTGTCAAGATTCA



AAGATTCATTGGGCAATCCAACGCATATTTTGCGAATAGGAGAAAGTTATAAGGA



AAAACAACGGACTATTCAGGCTGCTAAAGAAGTTGAACAAAGGCGGGCAGGCGGA



TATTCGAGAAAATATGCATCAAAGGCGAAGAATCTGGCGGACGATATGGTAAGAA



ATACAGCTCGTGACCTCTTATATTATGCTGTTACTCAAGATGCAATGCTCATTTT



TGAAAATCTTTCCCGCGGTTTTGGTAGACAAGGCAAGAGGACTTTTATGGCGGAA



AGGCAGTACACGAGGATGGAAGACTGGCTGACTGCAAAGCTTGCCTATGAAGGTC



TGCCATCAAAAACCTATCTTTCAAAGACTCTGGCACAGTATACCTCAAAGACATG



TTCTAATTGTGGTTTTACAATCACAAGTGCAGATTATGACAGGGTGCTCGAAAAG



CTCAAGAAGACGGCTACTGGATGGATGACTACAATCAATGGAAAAGAGTTAAAAG



TTGAAGGACAGATAACATACTATAACCGGTATAAAAGGCAGAATGTGGTAAAAGA



CCTCTCTGTAGAGCTGGATAGACTTTCGGAAGAGTCGGTAAATAATGATATTTCT



AGTTGGACAAAAGGCCGCAGTGGTGAAGCTTTATCTCTGCTAAAAAAGAGATTTA



GTCACAGGCCGGTGCAGGAAAAGTTTGTTTGCCTGAACTGTGGTTTTGAAACCCA



TGCAGACGAACAAGCAGCACTGAATATTGCAAGGTCGTGGCTCTTTCTCCGTTCT



CAAGAATATAAGAAGTATCAAACCAATAAAACGACCGGAAATACTGACAAAAGGG



CATTTGTTGAAACATGGCAATCCTTTTACAGAAAGAAGCTCAAAGAAGTATGGAA



ACCA





SEQ
ATGGGTAAAATGTATTACCTTGGTTTAGACATTGGCACGAATTCCGTGGGCTACG


ID
CGGTGACCGACCCCTCATACCACCTGCTGAAGTTTAAGGGGGAACCAATGTGGGG


NO:
TGCGCACGTATTTGCCGCCGGTAATCAGAGCGCGGAACGACGCTCGTTCCGCACA


41
TCGCGTCGTCGTTTGGACCGACGCCAACAGCGCGTTAAACTGGTACAGGAGATTT



TTGCCCCGGTGATTAGTCCGATCGACCCACGCTTCTTCATTCGTCTGCATGAATC



CGCCCTGTGGCGCGATGACGTCGCGGAGACGGATAAACATATCTTTTTCAATGAT



CCTACCTATACCGATAAGGAATATTATAGCGATTACCCGACTATCCATCACCTGA



TCGTTGATCTGATGGAAAGCTCTGAGAAACACGATCCGCGGCTGGTGTACCTTGC



AGTGGCGTGGTTAGTGGCACACCGTGGTCATTTTCTGAACGAGGTGGACAAGGAT



AATATTGGAGATGTGTTGTCGTTCGACGCATTTTATCCGGAGTTTCTCGCGTTCC



TGTCGGACAACGGTGTATCACCGTGGGTGTGCGAAAGCAAAGCGCTGCAGGCGAC



CTTGCTGAGCCGTAACTCAGTGAACGACAAATATAAAGCCCTTAAGTCTCTGATC



TTCGGATCCCAGAAACCTGAAGATAACTTCGATGCCAATATTTCGGAAGATGGAC



TCATTCAACTGCTGGCCGGCAAAAAGGTAAAAGTTAACAAACTGTTCCCTCAGGA



ATCGAACGATGCATCCTTCACATTGAATGATAAAGAAGACGCGATAGAAGAAATC



CTGGGTACGCTTACACCAGATGAATGTGAATGGATTGCGCATATACGCCGCCTTT



TTGACTGGGCTATCATGAAACATGCTCTGAAAGATGGCAGGACTATTAGCGAGTC



AAAAGTCAAACTGTATGAGCAGCACCATCACGATCTGACCCAACTTAAATACTTC



GTGAAAACCTACCTTGCAAAAGAATACGACGATATTTTCCGCAACGTGGATAGCG



AAACAACGAAAAACTATGTAGCGTATTCCTATCATGTGAAAGAGGTGAAAGGCAC



TCTGCCTAAAAATAAGGCAACGCAAGAAGAGTTTTGTAAGTATGTCCTGGGCAAG



GTTAAAAACATTGAATGCTCTGAAGCAGACAAGGTTGACTTTGATGAGATGATTC



AGCGTCTTACCGACAACTCTTTTATGCCTAAGCAGGTTTCGGGCGAAAACCGCGT



TATTCCTTATCAGTTATATTATTATGAACTGAAGACAATTCTGAATAAAGCAGCC



TCGTACCTGCCTTTCCTGACGCAGTGTGGAAAAGATGCAATTTCGAACCAGGACA



AACTACTGTCGATCATGACGTTCCGTATTCCTTACTTCGTCGGACCCTTGCGAAA



AGATAATTCGGAACATGCATGGCTCGAACGAAAGGCCGGTAAGATTTATCCGTGG



AACTTTAACGACAAAGTGGACTTGGATAAATCAGAAGAAGCGTTCATTCGCCGAA



TGACCAATACCTGTACCTATTATCCCGGCGAAGATGTTTTACCGTTGGATTCGCT



GATCTATGAGAAATTTATGATTTTAAATGAAATCAATAATATTCGTATTGACGGC



TACCCGATTAGTGTTGACGTTAAACAGCAGGTTTTTGGCTTGTTCGAAAAAAAAC



GACGCGTAACCGTGAAAGATATTCAGAACCTGCTGCTGTCTCTCGGAGCTCTGGA



CAAACACGGGAAGCTGACAGGCATCGATACCACTATCCACTCAAACTATAATACG



TATCACCATTTTAAATCTCTCATGGAACGCGGCGTCCTGACCCGGGATGACGTGG



AACGCATCGTTGAAAGGATGACCTACAGCGACGATACTAAGCGTGTGCGTCTGTG



GCTGAATAACAACTATGGTACTTTAACCGCCGACGATGTGAAACACATTTCGCGT



CTGCGCAAACACGATTTTGGCCGTTTATCCAAAATGTTCTTAACAGGTCTGAAGG



GTGTCCATAAGGAGACCGGTGAACGTGCCTCCATACTGGATTTCATGTGGAACAC



GAACGATAACCTGATGCAGCTCCTTTCCGAATGCTACACGTTCAGTGATGAAATC



ACAAAGCTGCAAGAGGCGTATTATGCAAAAGCCCAGTTGTCTTTAAACGATTTTT



TAGACTCGATGTACATCTCTAACGCGGTGAAACGTCCGATTTACAGAACTCTGGC



AGTGGTGAACGATATTCGAAAAGCATGTGGGACGGCCCCTAAACGCATTTTCATC



GAAATGGCTCGTGATGGTGAATCAAAAAAAAAGAGAAGTGTTACACGTCGCGAGC



AGATCAAAAACCTGTACCGCTCGATTCGTAAAGATTTCCAGCAGGAAGTTGATTT



TCTGGAAAAGATCCTGGAAAATAAATCTGATGGTCAACTTCAGTCAGATGCTTTG



TATCTTTACTTTGCACAATTAGGGCGCGATATGTACACGGGCGATCCAATAAAGC



TGGAGCACATCAAAGATCAGAGTTTCTATAACATAGACCATATTTACCCGCAGTC



TATGGTGAAAGACGATTCCCTAGATAACAAAGTGCTGGTGCAAAGCGAAATTAAC



GGCGAGAAAAGCTCGCGATACCCTTTGGACGCCGCGATCCGCAATAAAATGAAGC



CCCTTTGGGACGCTTACTATAATCATGGCCTGATCTCCTTAAAGAAATACCAGCG



TCTAACGCGCTCGACCCCGTTTACCGATGATGAAAAATGGGACTTTATTAATCGC



CAGTTAGTGGAAACCCGTCAATCTACCAAAGCGCTGGCCATTTTGTTGAAGCGTA



AGTTTCCAGACACCGAAATTGTGTATTCGAAGGCGGGGTTATCGTCCGACTTCAG



ACATGAATTCGGCCTTGTAAAAAGTCGCAATATTAATGATTTGCACCACGCTAAA



GACGCATTCTTGGCTATCGTTACCGGCAATGTGTACCATGAAAGATTCAATCGCA



GATGGTTTATGGTGAACCAGCCGTACTCAGTTAAAACTAAAACTCTTTTTACCCA



CAGCATAAAGAATGGCAACTTCGTTGCCTGGAACGGCGAAGAAGATCTCGGTCGT



ATTGTAAAAATGCTGAAGCAAAACAAAAATACCATTCACTTCACGCGCTTCTCCT



TCGATCGCAAAGAAGGATTATTTGATATCCAACCTCTGAAAGCCAGCACCGGCTT



AGTCCCACGAAAAGCCGGTCTGGATGTCGTTAAATACGGCGGATATGACAAATCT



ACCGCGGCCTATTACCTGCTGGTGAGGTTCACGCTCGAGGACAAGAAAACCCAGC



ACAAGCTGATGATGATTCCTGTAGAAGGCCTGTACAAGGCTCGCATTGATCATGA



CAAGGAATTTCTTACCGATTATGCGCAAACGACTATAAGCGAAATCCTACAGAAA



GATAAACAGAAAGTGATCAATATTATGTTTCCAATGGGTACGAGGCATATAAAAC



TCAATTCAATGATTAGTATCGATGGCTTCTATCTTAGTATCGGCGGAAAGTCCTC



TAAAGGTAAGTCAGTTCTATGTCACGCAATGGTTCCACTGATCGTCCCTCACAAA



ATCGAATGTTACATTAAAGCAATGGAAAGCTTCGCCCGGAAGTTTAAAGAAAACA



ACAAGCTGCGCATCGTAGAAAAATTCGATAAAATCACCGTTGAAGACAACCTGAA



TCTCTACGAGCTCTTTCTCCAAAAACTGCAGCATAATCCCTATAATAAGTTTTTT



TCGACACAGTTTGACGTACTGACGAACGGCCGTTCTACTTTCACAAAACTGTCGC



CGGAGGAACAGGTACAGACGCTCTTGAACATTTTAAGTATCTTTAAAACATGCCG



CAGTTCGGGTTGCGACCTGAAATCCATCAACGGCAGTGCCCAGGCAGCGCGCATC



ATGATTAGCGCTGACTTAACTGGACTGTCGAAAAAATATTCAGATATTAGGTTGG



TTGAACAGTCAGCTTCTGGTTTGTTCGTATCCAAAAGTCAGAACTTACTGGAGTA



TCTCTAA





SEQ
ATGTCATCGCTCACGAAATTCACTAACAAATACTCTAAACAGCTCACCATTAAGA


ID
ATGAACTCATCCCAGTTGGCAAAACACTGGAGAACATCAAAGAGAATGGTCTGAT


NO:
AGATGGCGACGAACAGCTGAATGAGAATTATCAGAAGGCGAAAATTATTGTGGAT


42
GATTTTCTGCGGGACTTCATTAATAAAGCACTGAATAATACGCAGATCGGGAACT



GGCGCGAACTGGCGGATGCCCTTAATAAAGAGGATGAAGATAACATCGAGAAATT



GCAGGATAAAATTCGGGGAATCATTGTATCCAAATTTGAAACGTTTGATCTGTTT



AGCAGCTATTCTATTAAGAAAGATGAAAAGATTATTGACGACGACAATGATGTTG



AAGAAGAGGAACTGGATCTGGGCAAGAAGACCAGCTCATTTAAATACATATTTAA



AAAAAACCTGTTTAAGTTAGTGTTGCCATCCTACCTGAAAACCACAAACCAGGAC



AAGCTGAAGATTATTAGCTCGTTTGATAATTTTTCAACGTACTTCCGCGGGTTCT



TTGAAAACCGGAAAAACATTTTTACCAAGAAACCGATCTCCACAAGTATTGCGTA



TCGCATTGTTCATGATAACTTCCCGAAATTCCTTGATAACATTCGTTGTTTTAAT



GTGTGGCAGACGGAATGCCCGCAACTAATCGTGAAAGCAGATAACTATCTGAAAA



GCAAAAATGTTATAGCGAAAGATAAAAGTTTGGCAAACTATTTTACCGTGGGCGC



GTATGACTATTTCCTGTCTCAGAATGGTATAGATTTTTACAACAATATTATAGGT



GGACTGCCAGCGTTCGCCGGCCATGAGAAAATCCAAGGTCTCAATGAATTCATCA



ATCAAGAGTGCCAAAAAGACAGCGAGCTGAAAAGTAAGCTGAAAAACCGTCACGC



GTTCAAAATGGCGGTACTGTTCAAACAGATACTCAGCGATCGTGAAAAAAGTTTT



GTAATTGATGAGTTCGAGTCGGATGCTCAAGTTATTGACGCCGTTAAAAACTTTT



ACGCCGAACAGTGCAAAGATAACAATGTTATTTTTAACTTATTAAATCTTATCAA



GAATATCGCTTTCTTAAGTGATGACGAACTGGACGGCATATTCATTGAAGGGAAA



TACCTGTCGAGCGTTAGTCAAAAACTCTATAGCGATTGGTCAAAATTACGTAACG



ACATTGAGGATTCGGCTAACTCTAAACAAGGCAATAAAGAGCTGGCCAAGAAGAT



CAAAACCAACAAAGGGGATGTAGAAAAAGCGATCTCGAAATATGAGTTCTCGCTG



TCGGAACTGAACTCGATTGTACATGATAACACCAAGTTTTCTGACCTCCTTAGTT



GTACACTGCATAAGGTGGCTTCTGAGAAACTGGTGAAGGTCAATGAAGGCGACTG



GCCGAAACATCTCAAGAATAATGAAGAGAAACAAAAAATCAAAGAGCCGCTTGAT



GCTCTGCTGGAGATCTATAATACACTTCTGATTTTTAACTGCAAAAGCTTCAATA



AAAACGGCAACTTCTATGTCGACTATGATCGTTGCATCAATGAACTGAGTTCGGT



CGTGTATCTGTATAATAAAACACGTAACTATTGCACTAAAAAACCCTATAACACG



GACAAGTTCAAACTCAATTTTAACAGTCCGCAGCTCGGTGAAGGCTTTTCCAAGT



CGAAAGAAAATGACTGTCTGACTCTTTTGTTTAAAAAAGACGACAACTATTATGT



AGGCATTATCCGCAAAGGTGCAAAAATCAATTTTGATGATACACAAGCAATCGCC



GATAACACCGACAATTGCATCTTTAAAATGAATTATTTCCTACTTAAAGACGCAA



AAAAATTTATCCCGAAATGTAGCATTCAGCTGAAAGAAGTCAAGGCCCATTTTAA



GAAATCTGAAGATGATTACATTTTGTCTGATAAAGAGAAATTTGCTAGCCCGCTG



GTCATTAAAAAGAGCACATTTTTGCTGGCAACTGCACATGTGAAAGGGAAAAAAG



GCAATATCAAGAAATTTCAGAAAGAATATTCGAAAGAAAACCCCACTGAGTATCG



CAATTCTTTAAACGAATGGATTGCTTTTTGTAAAGAGTTCTTAAAAACTTATAAA



GCGGCTACCATTTTTGATATAACCACATTGAAAAAGGCAGAGGAATATGCTGATA



TTGTAGAATTCTACAAGGATGTCGATAATCTGTGCTACAAACTGGAGTTCTGCCC



GATTAAAACCTCGTTTATAGAAAACCTGATAGATAACGGCGACCTGTATCTGTTT



CGCATCAATAACAAAGACTTCAGCAGTAAATCGACCGGCACCAAGAACCTTCATA



CGTTATATTTACAAGCTATATTCGATGAACGTAATCTGAACAATCCGACAATTAT



GCTGAATGGGGGAGCAGAACTGTTCTATCGTAAAGAAAGTATTGAGCAGAAAAAC



CGTATCACACACAAAGCCGGTTCAATTCTCGTGAATAAGGTGTGTAAAGACGGTA



CAAGCCTGGATGATAAGATACGTAATGAAATTTATCAATATGAGAATAAATTTAT



TGATACCCTGTCTGATGAAGCTAAAAAGGTGTTACCGAATGTCATTAAAAAGGAA



GCTACCCATGACATTACAAAAGATAAACGTTTCACTAGTGACAAATTCTTCTTTC



ACTGCCCCCTGACAATTAATTATAAGGAAGGCGATACCAAGCAGTTCAATAACGA



AGTGCTGAGTTTTCTGCGTGGAAATCCTGACATCAACATTATCGGCATTGACCGC



GGAGAGCGTAATTTAATCTATGTAACGGTTATAAACCAGAAAGGCGAGATTCTGG



ATTCGGTTTCATTCAATACCGTGACCAACAAGAGTTCAAAAATCGAGCAGACAGT



CGATTATGAAGAGAAATTGGCAGTCCGCGAGAAAGAGAGGATTGAAGCAAAACGT



TCCTGGGACTCTATCTCAAAAATTGCGACACTAAAGGAAGGTTATCTGAGCGCAA



TAGTTCACGAGATCTGTCTGTTAATGATTAAACACAACGCGATCGTTGTCTTAGA



GAATCTTAATGCAGGCTTTAAGCGTATTCGTGGCGGTTTATCAGAAAAAAGTGTT



TATCAAAAATTCGAAAAAATGTTGATTAACAAACTGAACTATTTTGTCAGCAAGA



AGGAATCCGACTGGAATAAACCGTCTGGTCTGCTGAATGGACTGCAGCTTTCGGA



TCAGTTTGAAAGCTTCGAAAAACTGGGTATTCAGTCTGGTTTTATTTTTTACGTG



CCGGCTGCATATACCTCAAAGATTGATCCGACCACGGGCTTCGCCAATGTTCTGA



ATCTGTCGAAGGTACGCAATGTTGATGCGATCAAAAGCTTTTTTTCTAACTTCAA



CGAAATTAGTTATAGCAAGAAAGAAGCCCTTTTCAAATTCTCATTCGATCTGGAT



TCACTGAGTAAGAAAGGCTTTAGTAGCTTTGTGAAATTTAGTAAGAGTAAATGGA



ACGTCTACACCTTTGGAGAACGTATCATAAAGCCAAAGAATAAGCAAGGTTATCG



GGAGGACAAAAGAATCAACTTGACCTTCGAGATGAAGAAGTTACTTAACGAGTAT



AAGGTTTCTTTTGATCTTGAAAATAACTTGATTCCGAATCTCACGAGTGCCAACC



TGAAGGATACTTTTTGGAAAGAGCTATTCTTTATCTTCAAGACTACGCTGCAGCT



CCGTAACAGCGTTACTAACGGTAAAGAAGATGTGCTCATCTCTCCGGTCAAAAAT



GCGAAGGGTGAATTCTTCGTTTCGGGAACGCATAACAAGACTCTTCCGCAAGATT



GCGATGCGAACGGTGCATACCATATTGCGTTGAAAGGTCTGATGATACTCGAACG



TAACAACCTTGTACGTGAGGAGAAAGATACGAAAAAGATTATGGCGATTTCAAAC



GTGGATTGGTTCGAGTACGTGCAGAAACGTAGAGGCGTTCTGTAA





SEQ
ATGAACAACTACGACGAATTCACCAAACTGTACCCGATCCAGAAAACCATCCGTT


ID
TCGAACTGAAACCGCAGGGTCGTACCATGGAACACCTGGAAACCTTCAACTTCTT


NO:
CGAAGAAGACCGTGACCGTGCGGAAAAATACAAAATCCTGAAAGAAGCGATCGAC


43
GAATACCACAAAAAATTCATCGACGAACACCTGACCAACATGTCTCTGGACTGGA



ACTCTCTGAAACAGATCTCTGAAAAATACTACAAATCTCGTGAAGAAAAAGACAA



AAAAGTTTTCCTGTCTGAACAGAAACGTATGCGTCAGGAAATCGTTTCTGAATTC



AAAAAAGACGACCGTTTCAAAGACCTGTTCTCTAAAAAACTGTTCTCTGAACTGC



TGAAAGAAGAAATCTACAAAAAAGGTAACCACCAGGAAATCGACGCGCTGAAATC



TTTCGACAAATTCTCTGGTTACTTCATCGGTCTGCACGAAAACCGTAAAAACATG



TACTCTGACGGTGACGAAATCACCGCGATCTCTAACCGTATCGTTAACGAAAACT



TCCCGAAATTCCTGGACAACCTGCAGAAATACCAGGAAGCGCGTAAAAAATACCC



GGAATGGATCATCAAAGCGGAATCTGCGCTGGTTGCGCACAACATCAAAATGGAC



GAAGTTTTCTCTCTGGAATACTTCAACAAAGTTCTGAACCAGGAAGGTATCCAGC



GTTACAACCTGGCGCTGGGTGGTTACGTTACCAAATCTGGTGAAAAAATGATGGG



TCTGAACGACGCGCTGAACCTGGCGCACCAGTCTGAAAAATCTTCTAAAGGTCGT



ATCCACATGACCCCGCTGTTCAAACAGATCCTGTCTGAAAAAGAATCTTTCTCTT



ACATCCCGGACGTTTTCACCGAAGACTCTCAGCTGCTGCCGTCTATCGGTGGTTT



CTTCGCGCAGATCGAAAACGACAAAGACGGTAACATCTTCGACCGTGCGCTGGAA



CTGATCTCTTCTTACGCGGAATACGACACCGAACGTATCTACATCCGTCAGGCGG



ACATCAACCGTGTTTCTAACGTTATCTTCGGTGAATGGGGTACCCTGGGTGGTCT



GATGCGTGAATACAAAGCGGACTCTATCAACGACATCAACCTGGAACGTACCTGC



AAAAAAGTTGACAAATGGCTGGACTCTAAAGAATTCGCGCTGTCTGACGTTCTGG



AAGCGATCAAACGTACCGGTAACAACGACGCGTTCAACGAATACATCTCTAAAAT



GCGTACCGCGCGTGAAAAAATCGACGCGGCGCGTAAAGAAATGAAATTCATCTCT



GAAAAAATCTCTGGTGACGAAGAATCTATCCACATCATCAAAACCCTGCTGGACT



CTGTTCAGCAGTTCCTGCACTTCTTCAACCTGTTCAAAGCGCGTCAGGACATCCC



GCTGGACGGTGCGTTCTACGCGGAATTCGACGAAGTTCACTCTAAACTGTTCGCG



ATCGTTCCGCTGTACAACAAAGTTCGTAACTACCTGACCAAAAACAACCTGAACA



CCAAAAAAATCAAACTGAACTTCAAAAACCCGACCCTGGCGAACGGTTGGGACCA



GAACAAAGTTTACGACTACGCGTCTCTGATCTTCCTGCGTGACGGTAACTACTAC



CTGGGTATCATCAACCCGAAACGTAAAAAAAACATCAAATTCGAACAGGGTTCTG



GTAACGGTCCGTTCTACCGTAAAATGGTTTACAAACAGATCCCGGGTCCGAACAA



AAACCTGCCGCGTGTTTTCCTGACCTCTACCAAAGGTAAAAAAGAATACAAACCG



TCTAAAGAAATCATCGAAGGTTACGAAGCGGACAAACACATCCGTGGTGACAAAT



TCGACCTGGACTTCTGCCACAAACTGATCGACTTCTTCAAAGAATCTATCGAAAA



ACACAAAGACTGGTCTAAATTCAACTTCTACTTCTCTCCGACCGAATCTTACGGT



GACATCTCTGAATTCTACCTGGACGTTGAAAAACAGGGTTACCGTATGCACTTCG



AAAACATCTCTGCGGAAACCATCGACGAATACGTTGAAAAAGGTGACCTGTTCCT



GTTCCAGATCTACAACAAAGACTTCGTTAAAGCGGCGACCGGTAAAAAAGACATG



CACACCATCTACTGGAACGCGGCGTTCTCTCCGGAAAACCTGCAGGACGTTGTTG



TTAAACTGAACGGTGAAGCGGAACTGTTCTACCGTGACAAATCTGACATCAAAGA



AATCGTTCACCGTGAAGGTGAAATCCTGGTTAACCGTACCTACAACGGTCGTACC



CCGGTTCCGGACAAAATCCACAAAAAACTGACCGACTACCACAACGGTCGTACCA



AAGACCTGGGTGAAGCGAAAGAATACCTGGACAAAGTTCGTTACTTCAAAGCGCA



CTACGACATCACCAAAGACCGTCGTTACCTGAACGACAAAATCTACTTCCACGTT



CCGCTGACCCTGAACTTCAAAGCGAACGGTAAAAAAAACCTGAACAAAATGGTTA



TCGAAAAATTCCTGTCTGACGAAAAAGCGCACATCATCGGTATCGACCGTGGTGA



ACGTAACCTGCTGTACTACTCTATCATCGACCGTTCTGGTAAAATCATCGACCAG



CAGTCTCTGAACGTTATCGACGGTTTCGACTACCGTGAAAAACTGAACCAGCGTG



AAATCGAAATGAAAGACGCGCGTCAGTCTTGGAACGCGATCGGTAAAATCAAAGA



CCTGAAAGAAGGTTACCTGTCTAAAGCGGTTCACGAAATCACCAAAATGGCGATC



CAGTACAACGCGATCGTTGTTATGGAAGAACTGAACTACGGTTTCAAACGTGGTC



GTTTCAAAGTTGAAAAACAGATCTACCAGAAATTCGAAAACATGCTGATCGACAA



AATGAACTACCTGGTTTTCAAAGACGCGCCGGACGAATCTCCGGGTGGTGTTCTG



AACGCGTACCAGCTGACCAACCCGCTGGAATCTTTCGCGAAACTGGGTAAACAGA



CCGGTATCCTGTTCTACGTTCCGGCGGCGTACACCTCTAAAATCGACCCGACCAC



CGGTTTCGTTAACCTGTTCAACACCTCTTCTAAAACCAACGCGCAGGAACGTAAA



GAATTCCTGCAGAAATTCGAATCTATCTCTTACTCTGCGAAAGACGGTGGTATCT



TCGCGTTCGCGTTCGACTACCGTAAATTCGGTACCTCTAAAACCGACCACAAAAA



CGTTTGGACCGCGTACACCAACGGTGAACGTATGCGTTACATCAAAGAAAAAAAA



CGTAACGAACTGTTCGACCCGTCTAAAGAAATCAAAGAAGCGCTGACCTCTTCTG



GTATCAAATACGACGGTGGTCAGAACATCCTGCCGGACATCCTGCGTTCTAACAA



CAACGGTCTGATCTACACCATGTACTCTTCTTTCATCGCGGCGATCCAGATGCGT



GTTTACGACGGTAAAGAAGACTACATCATCTCTCCGATCAAAAACTCTAAAGGTG



AATTCTTCCGTACCGACCCGAAACGTCGTGAACTGCCGATCGACGCGGACGCGAA



CGGTGCGTACAACATCGCGCTGCGTGGTGAACTGACCATGCGTGCGATCGCGGAA



AAATTCGACCCGGACTCTGAAAAAATGGCGAAACTGGAACTGAAACACAAAGACT



GGTTCGAATTCATGCAGACCCGTGGTGACTAA





SEQ
ATGACTAAAACATTTGATTCAGAGTTTTTTAATTTGTACTCGCTGCAAAAAACGG


ID
TACGCTTTGAGTTAAAACCCGTGGGAGAAACCGCGTCATTTGTGGAAGACTTTAA


NO:
AAACGAGGGCTTGAAACGTGTTGTGAGCGAAGATGAAAGGCGAGCCGTCGATTAC


44
CAGAAAGTTAAGGAAATAATTGACGATTACCATCGGGATTTCATTGAAGAAAGTT



TAAATTATTTTCCGGAACAGGTGAGTAAAGATGCTCTTGAGCAGGCGTTTCATCT



TTATCAGAAACTGAAGGCAGCAAAAGTTGAGGAAAGGGAAAAAGCGCTGAAAGAA



TGGGAAGCGCTGCAGAAAAAGCTACGTGAAAAAGTGGTGAAATGCTTCTCGGACT



CGAATAAAGCCCGCTTCTCAAGGATTGATAAAAAGGAACTGATTAAGGAAGACCT



GATAAATTGGTTGGTCGCCCAGAATCGCGAGGATGATATCCCTACGGTCGAAACG



TTTAACAACTTCACCACATATTTTACCGGCTTCCATGAGAATCGTAAAAATATTT



ACTCCAAAGATGATCACGCCACCGCTATTAGCTTTCGCCTTATTCATGAAAATCT



TCCAAAGTTTTTTGACAACGTGATTAGCTTCAATAAGTTGAAAGAGGGTTTCCCT



GAATTAAAATTTGATAAAGTGAAAGAGGATTTAGAAGTAGATTATGATCTGAAGC



ATGCGTTTGAAATAGAATATTTCGTTAACTTCGTGACCCAAGCGGGCATAGATCA



GTATAATTATCTGTTAGGAGGGAAAACCCTGGAGGACGGGACGAAAAAACAAGGG



ATGAATGAGCAAATTAATCTGTTCAAACAACAGCAAACGCGAGATAAAGCGCGTC



AGATTCCCAAACTGATCCCCCTGTTCAAACAGATTCTTAGCGAAAGGACTGAAAG



CCAGTCCTTTATTCCTAAACAATTTGAAAGTGATCAGGAGTTGTTCGATTCACTG



CAGAAGTTACATAATAACTGCCAGGATAAATTCACCGTGCTGCAACAAGCCATTC



TCGGTCTGGCAGAGGCGGATCTTAAGAAGGTCTTCATCAAAACCTCTGATTTAAA



TGCCTTATCTAACACCATTTTCGGGAATTACAGCGTCTTTTCCGATGCACTGAAC



CTGTATAAAGAAAGCCTGAAAACGAAAAAAGCGCAGGAGGCTTTTGAGAAACTAC



CGGCCCATTCTATTCACGACCTCATTCAATACTTGGAACAGTTCAATTCCAGCCT



GGACGCGGAAAAACAACAGAGCACCGACACCGTCCTGAACTACTTCATCAAGACC



GATGAATTATATTCTCGCTTCATTAAATCCACTAGCGAGGCTTTCACTCAGGTGC



AGCCTTTGTTCGAACTGGAAGCCCTGTCATCTAAGCGCCGCCCACCGGAATCGGA



AGATGAAGGGGCAAAAGGGCAGGAAGGCTTCGAGCAGATCAAGCGTATTAAAGCT



TACCTGGATACGCTTATGGAAGCGGTACACTTTGCAAAGCCGTTGTATCTTGTTA



AGGGTCGTAAAATGATCGAAGGGCTCGATAAAGACCAGTCCTTTTATGAAGCGTT



TGAAATGGCGTACCAAGAACTTGAATCGTTAATCATTCCTATCTATAACAAAGCG



CGGAGCTATCTGTCGCGGAAACCTTTCAAGGCCGATAAATTCAAGATTAATTTTG



ACAACAACACGCTACTGAGCGGATGGGATGCGAACAAGGAAACTGCTAACGCGTC



CATTCTGTTTAAGAAAGACGGGTTATATTACCTTGGAATTATGCCGAAAGGTAAG



ACCTTTCTCTTTGACTACTTTGTATCGAGCGAGGATTCAGAGAAACTGAAACAGC



GTCGCCAGAAGACCGCCGAAGAAGCTCTGGCGCAGGATGGTGAAAGTTACTTCGA



AAAAATTCGTTATAAACTGTTACCAGGGGCTTCAAAGATGTTACCGAAAGTCTTT



TTTAGCAACAAAAATATTGGCTTTTACAACCCGTCGGATGACATTTTACGCATTC



GCAACACAGCCTCTCACACCAAAAACGGGACCCCTCAGAAAGGCCACTCAAAAGT



TGAGTTTAACCTGAATGATTGTCATAAGATGATTGATTTCTTCAAATCATCAATT



CAGAAACACCCGGAATGGGGGTCTTTTGGCTTTACGTTTTCTGATACCAGTGATT



TTGAAGACATGAGTGCCTTCTACCGGGAAGTAGAAAACCAGGGTTACGTAATTAG



CTTTGACAAAATCAAAGAGACCTATATACAGAGCCAGGTGGAACAGGGTAATCTC



TACTTATTCCAGATTTATAACAAGGATTTCTCGCCCTACAGCAAAGGCAAACCAA



ACCTGCATACTCTGTACTGGAAAGCCCTGTTTGAAGAAGCGAACCTGAATAACGT



AGTGGCGAAGTTGAACGGTGAAGCGGAAATCTTCTTCCGTCGTCACTCCATTAAG



GCCTCTGATAAAGTTGTCCATCCGGCAAATCAGGCCATTGATAATAAGAATCCAC



ACACGGAAAAAACGCAGTCAACCTTTGAATATGACCTCGTTAAAGACAAACGCTA



CACGCAAGATAAGTTCTTTTTCCACGTCCCAATCAGCCTCAACTTTAAAGCACAA



GGGGTTTCAAAGTTTAATGATAAAGTCAATGGGTTCCTCAAGGGCAACCCGGATG



TCAACATTATAGGTATAGACAGGGGCGAACGCCATCTGCTTTACTTTACCGTAGT



GAATCAGAAAGGTGAAATACTGGTTCAGGAATCATTAAATACCTTGATGTCGGAC



AAAGGGCACGTTAATGATTACCAGCAGAAACTGGATAAAAAAGAACAGGAACGTG



ATGCTGCGCGTAAATCGTGGACCACGGTTGAGAACATTAAAGAGCTGAAAGAGGG



GTATCTAAGCCATGTGGTACACAAACTGGCGCACCTCATCATTAAATATAACGCA



ATAGTCTGCCTAGAAGACTTGAATTTTGGCTTTAAACGCGGCCGCTTCAAAGTGG



AAAAACAAGTTTATCAAAAATTTGAAAAGGCGCTTATAGATAAACTGAATTATCT



GGTTTTTAAAGAAAAGGAACTTGGTGAGGTAGGGCACTACTTGACAGCTTATCAA



CTGACGGCCCCGTTCGAATCATTCAAAAAACTGGGCAAACAGTCTGGCATTCTGT



TTTACGTGCCGGCAGATTATACTTCAAAAATCGATCCAACAACTGGCTTTGTGAA



CTTCCTGGACCTGAGATATCAGTCTGTAGAAAAAGCTAAACAACTTCTTAGCGAT



TTTAATGCCATTCGTTTTAACAGCGTTCAGAATTACTTTGAATTCGAAATTGACT



ATAAAAAACTTACTCCGAAACGTAAAGTCGGAACCCAAAGTAAATGGGTAATTTG



TACGTATGGCGATGTCAGGTATCAGAACCGTCGGAATCAAAAAGGTCATTGGGAG



ACCGAAGAAGTGAACGTGACCGAAAAGCTGAAGGCTCTGTTCGCCAGCGATTCAA



AAACTACAACTGTGATCGATTACGCAAATGATGATAACCTGATAGATGTGATTTT



AGAGCAGGATAAAGCCAGCTTTTTTAAAGAACTGTTGTGGCTCCTGAAACTTACG



ATGACCTTACGACATTCCAAGATCAAATCGGAAGATGATTTTATTCTGTCACCGG



TCAAGAATGAGCAGGGTGAATTCTATGATAGTAGGAAAGCCGGCGAAGTGTGGCC



GAAAGACGCCGACGCCAATGGCGCCTATCATATCGCGCTCAAAGGGCTTTGGAAT



TTGCAGCAGATTAACCAGTGGGAAAAAGGTAAAACCCTGAATCTGGCTATCAAAA



ACCAGGATTGGTTTAGCTTTATCCAAGAGAAACCGTATCAGGAATGA





SEQ
ATGCATACAGGCGGTCTTCTTAGTATGGACGCGAAAGAGTTCACAGGTCAGTATC


ID
CGTTGTCGAAAACATTACGATTCGAACTTCGGCCCATCGGCCGCACGTGGGATAA


NO:
CCTGGAGGCCTCAGGCTACTTAGCGGAAGACCGCCATCGTGCCGAATGTTATCCT


45
CGTGCGAAAGAGTTATTGGATGACAACCATCGTGCCTTCCTGAATCGTGTGTTGC



CACAAATCGATATGGATTGGCACCCGATTGCGGAGGCCTTTTGTAAGGTACATAA



AAACCCTGGTAATAAAGAACTTGCCCAGGATTACAACCTTCAGTTGTCAAAGCGC



CGTAAGGAGATCAGCGCATATCTTCAGGATGCAGATGGCTATAAAGGCCTGTTCG



CGAAGCCCGCCTTAGACGAAGCTATGAAAATTGCGAAAGAAAACGGGAACGAAAG



TGATATTGAGGTTCTCGAAGCGTTTAACGGTTTTAGCGTATACTTCACCGGTTAT



CATGAGTCACGCGAGAACATTTATAGCGATGAGGATATGGTGAGCGTAGCCTACC



GAATTACTGAGGATAATTTCCCGCGCTTTGTCTCAAACGCTTTGATCTTTGATAA



ATTAAACGAAAGCCATCCGGATATTATCTCTGAAGTATCGGGCAATCTTGGAGTT



GATGACATTGGTAAGTACTTTGACGTGTCGAACTATAACAATTTTCTTTCCCAGG



CCGGTATAGATGACTACAATCACATTATTGGCGGCCATACAACCGAAGACGGACT



GATACAAGCGTTTAATGTCGTATTGAACTTACGTCACCAAAAAGACCCTGGCTTT



GAAAAAATTCAGTTCAAACAGCTCTACAAACAAATCCTGAGCGTGCGTACCAGCA



AAAGCTACATCCCGAAACAGTTTGACAACTCTAAGGAGATGGTTGACTGCATTTG



CGATTATGTCAGCAAAATAGAGAAATCCGAAACAGTAGAACGGGCCCTGAAACTA



GTCCGTAATATCAGTTCTTTCGACTTGCGCGGGATCTTTGTCAATAAAAAGAACT



TGCGCATACTGAGCAACAAACTGATAGGAGATTGGGACGCGATCGAAACCGCATT



GATGCATAGTTCTTCATCAGAAAACGATAAGAAAAGCGTATATGATAGCGCGGAG



GCTTTTACGTTGGATGACATCTTTTCAAGCGTGAAAAAATTTTCTGATGCCTCTG



CCGAAGATATTGGCAACAGGGCGGAAGACATCTGTAGAGTGATAAGTGAGACGGC



CCCTTTTATCAACGATCTGCGAGCGGTGGACCTGGATAGCCTGAACGACGATGGT



TATGAAGCGGCCGTCTCAAAAATTCGGGAGTCGCTGGAGCCTTATATGGATCTTT



TCCATGAACTGGAAATTTTCTCGGTTGGCGATGAGTTCCCAAAATGCGCAGCATT



TTACAGCGAACTGGAGGAAGTCAGCGAACAGCTGATCGAAATTATTCCGTTATTC



AACAAGGCGCGTTCGTTCTGCACCCGGAAACGCTATAGCACCGATAAGATTAAAG



TGAACTTAAAATTCCCGACCTTGGCGGACGGGTGGGACCTGAACAAAGAGAGAGA



CAACAAAGCCGCGATTCTGCGGAAAGACGGTAAGTATTATCTGGCAATTCTGGAT



ATGAAGAAAGATCTGTCAAGCATTAGGACCAGCGACGAAGATGAATCCAGCTTCG



AAAAGATGGAGTATAAACTGTTACCGAGTCCAGTAAAAATGCTGCCAAAGATATT



CGTAAAATCGAAAGCCGCTAAGGAAAAATATGGCCTGACAGATCGTATGCTTGAA



TGCTACGATAAAGGTATGCATAAGTCGGGTAGTGCGTTTGATCTTGGCTTTTGCC



ATGAACTCATTGATTATTACAAGCGTTGTATCGCGGAGTACCCAGGCTGGGATGT



GTTCGATTTCAAGTTTCGCGAAACTTCCGATTATGGGTCCATGAAAGAGTTCAAT



GAAGATGTGGCCGGAGCCGGTTACTATATGAGTCTGAGAAAAATTCCGTGCAGCG



AAGTGTACCGTCTGTTAGACGAGAAATCGATTTATCTATTTCAAATTTATAACAA



AGATTACTCTGAAAATGCACATGGTAATAAGAACATGCATACCATGTACTGGGAG



GGTCTCTTTTCCCCGCAAAACCTGGAGTCGCCCGTTTTCAAGTTGTCGGGTGGGG



CAGAACTTTTCTTTCGAAAATCCTCAATCCCTAACGATGCCAAAACAGTACACCC



GAAAGGCTCAGTGCTGGTTCCACGTAATGATGTTAACGGTCGGCGTATTCCAGAT



TCAATCTACCGCGAACTGACACGCTATTTTAACCGTGGCGATTGCCGAATCAGTG



ACGAAGCCAAAAGTTATCTTGACAAGGTTAAGACTAAAAAAGCGGACCATGACAT



TGTGAAAGATCGCCGCTTTACCGTGGATAAAATGATGTTCCACGTCCCGATTGCG



ATGAACTTTAAGGCGATCAGTAAACCGAACTTAAACAAAAAAGTCATTGATGGCA



TCATTGATGATCAGGATCTGAAAATCATTGGTATTGATCGTGGCGAGCGGAACTT



AATTTACGTCACGATGGTTGACAGAAAAGGGAATATCTTATATCAGGATTCTCTT



AACATCCTCAATGGCTACGACTATCGTAAAGCTCTGGATGTGCGCGAATATGACA



ACAAGGAAGCGCGTCGTAACTGGACTAAAGTGGAGGGCATTCGCAAAATGAAGGA



AGGCTATCTGTCATTAGCGGTCTCGAAATTAGCGGATATGATTATCGAAAATAAC



GCCATCATCGTTATGGAGGACCTGAACCACGGATTCAAAGCGGGCCGCTCAAAGA



TTGAAAAACAAGTTTATCAGAAATTTGAGAGTATGCTGATTAACAAACTGGGCTA



TATGGTGTTAAAAGACAAGTCAATTGACCAATCAGGTGGCGCGCTGCATGGATAC



CAGCTGGCGAACCATGTTACCACCTTAGCATCAGTTGGAAAGCAGTGTGGGGTTA



TCTTTTATATACCGGCAGCGTTCACTAGTAAAATAGATCCGACCACTGGTTTCGC



CGATCTCTTTGCCCTGAGTAACGTTAAAAACGTAGCGAGCATGCGTGAATTCTTT



TCCAAAATGAAATCTGTCATTTATGATAAAGCTGAAGGCAAATTCGCATTCACCT



TTGATTACTTGGATTACAACGTGAAGAGCGAATGTGGTCGTACGCTGTGGACCGT



TTACACCGTTGGTGAGCGCTTCACCTATTCCCGTGTGAACCGCGAATATGTACGT



AAAGTCCCCACCGATATTATCTATGATGCCCTCCAGAAAGCAGGCATTAGCGTCG



AAGGAGACTTAAGGGACAGAATTGCCGAAAGCGATGGCGATACGCTGAAGTCTAT



TTTTTACGCATTCAAATACGCGCTAGATATGCGCGTTGAGAATCGCGAGGAAGAC



TACATTCAATCACCTGTGAAAAATGCCTCTGGGGAATTTTTTTGTTCAAAAAATG



CTGGTAAAAGCCTCCCACAAGATAGCGATGCAAACGGTGCATATAACATTGCCCT



GAAAGGTATTCTTCAATTACGCATGCTGTCTGAGCAGTACGACCCCAACGCGGAA



TCTATTAGACTTCCGCTGATAACCAATAAAGCCTGGCTGACATTCATGCAGTCTG



GCATGAAGACCTGGAAAAATTAG





SEQ
ATGGATAGTTTAAAAGATTTTACGAATCTATATCCCGTAAGCAAAACTCTTCGTT


ID
TTGAACTGAAACCTGTTGGAAAAACGTTGGAGAATATCGAGAAAGCGGGCATCCT


NO:
GAAAGAAGACGAGCACCGTGCCGAAAGCTACAGGCGTGTCAAAAAGATTATCGAT


46
ACTTATCACAAAGTGTTCATTGATAGCAGTCTGGAGAACATGGCAAAAATGGGCA



TAGAAAATGAAATCAAAGCAATGCTGCAGAGCTTTTGCGAGCTCTACAAGAAAGA



TCACCGAACGGAAGGTGAAGATAAAGCACTGGACAAAATTCGCGCCGTTCTTCGC



GGTCTGATTGTTGGCGCGTTCACCGGCGTGTGCGGCCGCCGTGAAAACACCGTGC



AGAACGAAAAGTACGAGTCGCTGTTCAAAGAAAAACTGATAAAAGAAATTTTGCC



TGACTTTGTGCTTTCGACCGAAGCGGAATCCCTGCCATTTTCTGTCGAAGAAGCG



ACCCGCAGCCTGAAAGAATTTGACTCATTCACAAGTTACTTTGCAGGCTTCTACG



AAAACCGTAAAAACATCTACAGCACGAAGCCACAGAGCACGGCTATTGCTTATCG



CCTGATTCATGAGAACCTGCCGAAGTTCATCGATAACATCCTTGTTTTTCAAAAA



ATTAAAGAGCCGATTGCGAAAGAGTTAGAACATATTCGAGCTGACTTTTCTGCGG



GTGGGTACATTAAAAAAGATGAGCGGCTGGAAGACATCTTCAGTCTAAACTATTA



TATCCACGTTCTGTCGCAGGCAGGCATTGAGAAATATAATGCGCTGATTGGTAAG



ATTGTCACAGAAGGCGATGGTGAGATGAAAGGTCTTAATGAACATATCAATCTGT



ATAACCAGCAGCGTGGTCGCGAAGACCGTCTTCCACTGTTCCGCCCACTGTATAA



ACAGATCCTGTCTGACCGGGAACAGCTGTCCTACCTGCCGGAAAGCTTTGAAAAG



GATGAAGAGCTACTTCGCGCATTAAAGGAGTTTTACGACCATATTGCGGAAGACA



TTTTGGGTAGAACGCAGCAACTGATGACGTCAATTTCTGAATACGATCTGAGTAG



AATCTACGTTAGGAATGATAGCCAGCTGACCGATATTAGCAAAAAAATGCTGGGC



GACTGGAACGCTATCTATATGGCACGTGAACGTGCATATGATCATGAACAAGCAC



CGAAACGTATAACCGCGAAATATGAGCGTGATCGCATTAAGGCGCTAAAGGGAGA



AGAAAGCATCTCACTCGCAAACCTGAACTCCTGTATCGCTTTCTTAGATAACGTG



CGCGATTGTCGCGTCGACACGTATCTGTCAACCCTTGGGCAGAAAGAGGGTCCAC



ATGGTCTGTCTAACCTGGTGGAAAATGTCTTTGCGAGTTACCATGAAGCGGAACA



ACTGCTGTCTTTTCCATACCCCGAAGAAAACAATCTAATACAGGATAAAGATAAC



GTGGTGTTAATCAAAAACCTGCTGGACAACATCAGCGATCTGCAACGTTTCCTGA



AACCTTTGTGGGGTATGGGTGACGAGCCAGACAAAGACGAACGTTTTTATGGTGA



GTATAATTATATACGTGGCGCCCTTGACCAAGTTATTCCGCTGTATAACAAAGTA



CGGAACTATCTGACCCGTAAGCCATATTCTACCCGTAAAGTGAAACTGAACTTTG



GCAACTCGCAACTGCTGTCGGGTTGGGATCGTAACAAAGAAAAAGATAATAGTTG



TGTTATCCTGCGTAAGGGACAAAATTTTTACCTCGCGATTATGAACAACAGACAC



AAGCGTTCATTTGAAAATAAGGTTCTGCCGGAGTATAAAGAGGGCGAACCGTACT



TCGAGAAAATGGATTATAAGTTCTTACCAGACCCTAATAAGATGTTACCGAAAGT



CTTTCTTTCGAAAAAAGGCATAGAAATCTATAAGCCGTCCCCGAAATTACTCGAA



CAGTATGGGCACGGGACCCACAAGAAAGGGGATACTTTTAGCATGGACGATCTGC



ACGAACTGATCGATTTTTTTAAACACTCCATCGAAGCCCATGAAGACTGGAAACA



GTTTGGGTTCAAGTTCTCTGATACAGCCACATACGAGAATGTGTCTAGTTTTTAT



CGGGAAGTGGAGGATCAGGGCTACAAACTTAGTTTTCGTAAAGTTTCAGAGAGTT



ATGTTTATAGTTTAATTGATCAGGGAAAACTTTACCTGTTCCAGATCTACAACAA



AGATTTCTCGCCATGTAGTAAGGGTACCCCGAATCTGCATACACTCTATTGGAGA



ATGTTATTCGATGAGCGTAACTTAGCGGATGTCATTTATAAATTGGACGGGAAAG



CAGAGATCTTTTTTCGTGAAAAATCACTGAAGAATGACCACCCGACTCATCCGGC



CGGGAAACCGATCAAAAAAAAATCCCGCCAGAAAAAAGGAGAAGAGTCTCTGTTT



GAATATGATCTGGTGAAAGACCGTCATTACACTATGGATAAATTTCAATTTCATG



TTCCAATTACAATGAACTTCAAATGTTCGGCGGGTTCCAAAGTAAATGATATGGT



AAACGCCCATATTCGCGAAGCGAAAGATATGCATGTTATTGGCATCGATAGAGGC



GAAAGAAACCTGCTTTATATTTGCGTAATTGACAGCCGTGGTACCATTCTGGACC



AGATCTCTTTAAACACCATCAATGACATCGATTATCACGACCTGTTGGAGTCTCG



GGACAAGGACCGCCAGCAGGAGCGCCGTAATTGGCAGACAATTGAAGGCATAAAA



GAATTAAAACAGGGTTACCTTTCCCAGGCCGTACACCGCATAGCGGAACTGATGG



TGGCCTACAAAGCCGTAGTTGCCCTGGAAGACTTGAATATGGGGTTTAAACGTGG



CCGTCAAAAAGTCGAGAGCAGCGTGTATCAGCAATTTGAAAAACAGTTGATTGAC



AAGTTGAATTATTTGGTTGATAAAAAGAAACGTCCAGAAGATATTGGTGGCTTAC



TGCGTGCATACCAGTTTACGGCACCTTTTAAGTCCTTCAAAGAAATGGGTAAACA



GAACGGGTTTCTGTTTTACATCCCGGCCTGGAATACATCCAACATCGATCCTACC



ACCGGGTTTGTCAACCTGTTTCATGCACAATATGAAAACGTGGATAAAGCGAAGA



GTTTTTTCCAAAAATTCGATAGTATTTCGTATAACCCAAAAAAAGATTGGTTTGA



GTTTGCGTTCGATTATAAAAATTTTACTAAAAAGGCTGAGGGATCCCGCAGTATG



TGGATCCTCTGCACCCATGGCAGTCGTATTAAAAATTTTCGTAATTCGCAAAAGA



ATGGCCAGTGGGACTCGGAAGAGTTTGCCCTGACCGAAGCGTTCAAATCGCTGTT



TGTACGCTACGAAATTGACTACACAGCAGATCTGAAAACAGCCATCGTCGATGAA



AAACAGAAAGATTTTTTTGTAGATCTCCTAAAACTGTTCAAACTGACTGTTCAGA



TGCGCAATTCCTGGAAAGAGAAAGACCTGGATTATCTGATTAGCCCGGTAGCCGG



TGCTGATGGACGATTTTTCGATACTCGTGAAGGTAACAAAAGTCTCCCGAAAGAT



GCTGATGCCAATGGTGCATACAATATTGCATTAAAGGGGCTATGGGCCTTGCGAC



AGATCCGCCAGACCAGCGAAGGCGGCAAGCTGAAATTGGCCATATCGAATAAGGA



ATGGTTACAATTTGTTCAGGAACGTAGCTATGAAAAAGATTGA





SEQ
ATGAACAACGGCACAAATAATTTTCAGAACTTCATCGGGATCTCAAGTTTGCAGA


ID
AAACGCTGCGCAATGCTCTGATCCCCACGGAAACCACGCAACAGTTCATCGTCAA


NO:
GAACGGAATAATTAAAGAAGATGAGTTACGTGGCGAGAACCGCCAGATTCTGAAA


47
GATATCATGGATGACTACTACCGCGGATTCATCTCTGAGACTCTGAGTTCTATTG



ATGACATAGATTGGACTAGCCTGTTCGAAAAAATGGAAATTCAGCTGAAAAATGG



TGATAATAAAGATACCTTAATTAAGGAACAGACAGAGTATCGGAAAGCAATCCAT



AAAAAATTTGCGAACGACGATCGGTTTAAGAACATGTTTAGCGCCAAACTGATTA



GTGACATATTACCTGAATTTGTCATCCACAACAATAATTATTCGGCATCAGAGAA



AGAGGAAAAAACCCAGGTGATAAAATTGTTTTCGCGCTTTGCGACTAGCTTTAAA



GATTACTTCAAGAACCGTGCAAATTGCTTTTCAGCGGACGATATTTCATCAAGCA



GCTGCCATCGCATCGTCAACGACAATGCAGAGATATTCTTTTCAAATGCGCTGGT



CTACCGCCGGATCGTAAAATCGCTGAGCAATGACGATATCAACAAAATTTCGGGC



GATATGAAAGATTCATTAAAAGAAATGAGTCTGGAAGAAATATATTCTTACGAGA



AGTATGGGGAATTTATTACCCAGGAAGGCATTAGCTTCTATAATGATATCTGTGG



GAAAGTGAATTCTTTTATGAACCTGTATTGTCAGAAAAATAAAGAAAACAAAAAT



TTATACAAACTTCAGAAACTTCACAAACAGATTCTATGCATTGCGGACACTAGCT



ATGAGGTCCCGTATAAATTTGAAAGTGACGAGGAAGTGTACCAATCAGTTAACGG



CTTCCTTGATAACATTAGCAGCAAACATATAGTCGAAAGATTACGCAAAATCGGC



GATAACTATAACGGCTACAACCTGGATAAAATTTATATCGTGTCCAAATTTTACG



AGAGCGTTAGCCAAAAAACCTACCGCGACTGGGAAACAATTAATACCGCCCTCGA



AATTCATTACAATAATATCTTGCCGGGTAACGGTAAAAGTAAAGCCGACAAAGTA



AAAAAAGCGGTTAAGAATGATTTACAGAAATCCATCACCGAAATAAATGAACTAG



TGTCAAACTATAAGCTGTGCAGTGACGACAACATCAAAGCGGAGACTTATATACA



TGAGATTAGCCATATCTTGAATAACTTTGAAGCACAGGAATTGAAATACAATCCG



GAAATTCACCTAGTTGAATCCGAGCTCAAAGCGAGTGAGCTTAAAAACGTGCTGG



ACGTGATCATGAATGCGTTTCATTGGTGTTCGGTTTTTATGACTGAGGAACTTGT



TGATAAAGACAACAATTTTTATGCGGAACTGGAGGAGATTTACGATGAAATTTAT



CCAGTAATTAGTCTGTACAACCTGGTTCGTAACTACGTTACCCAGAAACCGTACA



GCACGAAAAAGATTAAATTGAACTTTGGAATACCGACGTTAGCAGACGGTTGGTC



AAAGTCCAAAGAGTATTCTAATAACGCTATCATACTGATGCGCGACAATCTGTAT



TATCTGGGCATCTTTAATGCGAAGAATAAACCGGACAAGAAGATTATCGAGGGTA



ATACGTCAGAAAATAAGGGTGACTACAAAAAGATGATTTATAATTTGCTCCCGGG



TCCCAACAAAATGATCCCGAAAGTTTTCTTGAGCAGCAAGACGGGGGTGGAAACG



TATAAACCGAGCGCCTATATCCTAGAGGGGTATAAACAGAATAAACATATCAAGT



CTTCAAAAGACTTTGATATCACTTTCTGTCATGATCTGATCGACTACTTCAAAAA



CTGTATTGCAATTCATCCCGAGTGGAAAAACTTCGGTTTTGATTTTAGCGACACC



AGTACTTATGAAGACATTTCCGGGTTTTATCGTGAGGTAGAGTTACAAGGTTACA



AGATTGATTGGACATACATTAGCGAAAAAGACATTGATCTGCTGCAGGAAAAAGG



TCAACTGTATCTGTTCCAGATATATAACAAAGATTTTTCGAAAAAATCAACCGGG



AATGACAACCTTCACACCATGTACCTGAAAAATCTTTTCTCAGAAGAAAATCTTA



AGGATATCGTCCTGAAACTTAACGGCGAAGCGGAAATCTTCTTCAGGAAGAGCAG



CATAAAGAACCCAATCATTCATAAAAAAGGCTCGATTTTAGTCAACCGTACCTAC



GAAGCAGAAGAAAAAGACCAGTTTGGCAACATTCAAATTGTGCGTAAAAATATTC



CGGAAAACATTTATCAGGAGCTGTACAAATACTTCAACGATAAAAGCGACAAAGA



GCTGTCTGATGAAGCAGCCAAACTGAAGAATGTAGTGGGACACCACGAGGCAGCG



ACGAATATAGTCAAGGACTATCGCTACACGTATGATAAATACTTCCTTCATATGC



CTATTACGATCAATTTCAAAGCCAATAAAACGGGTTTTATTAATGATAGGATCTT



ACAGTATATCGCTAAAGAAAAAGACTTACATGTGATCGGCATTGATCGGGGCGAG



CGTAACCTGATCTACGTGTCCGTGATTGATACTTGTGGTAATATAGTTGAACAGA



AAAGCTTTAACATTGTAAACGGCTACGACTATCAGATAAAACTGAAACAACAGGA



GGGCGCTAGACAGATTGCGCGGAAAGAATGGAAAGAAATTGGTAAAATTAAAGAG



ATCAAAGAGGGCTACCTGAGCTTAGTAATCCACGAGATCTCTAAAATGGTAATCA



AATACAATGCAATTATAGCGATGGAGGATTTGTCTTATGGTTTTAAAAAAGGGCG



CTTTAAGGTCGAACGGCAAGTTTACCAGAAATTTGAAACCATGCTCATCAATAAA



CTCAACTATCTGGTATTTAAAGATATTTCGATTACCGAGAATGGCGGTCTCCTGA



AAGGTTATCAGCTGACATACATTCCTGATAAACTTAAAAACGTGGGTCATCAGTG



CGGCTGCATTTTTTATGTGCCTGCTGCATACACGAGCAAAATTGATCCGACCACC



GGCTTTGTGAATATCTTTAAATTTAAAGACCTGACAGTGGACGCAAAACGTGAAT



TCATTAAAAAATTTGACTCAATTCGTTATGACAGTGAAAAAAATCTGTTCTGCTT



TACATTTGACTACAATAACTTTATTACGCAAAACACGGTCATGAGCAAATCATCG



TGGAGTGTGTATACATACGGCGTGCGCATCAAACGTCGCTTTGTGAACGGCCGCT



TCTCAAACGAAAGTGATACCATTGACATAACCAAAGATATGGAGAAAACGTTGGA



AATGACGGACATTAACTGGCGCGATGGCCACGATCTTCGTCAAGACATTATAGAT



TATGAAATTGTTCAGCACATATTCGAAATTTTCCGTTTAACAGTGCAAATGCGTA



ACTCCTTGTCTGAACTGGAGGACCGTGATTACGATCGTCTCATTTCACCTGTACT



GAACGAAAATAACATTTTTTATGACAGCGCGAAAGCGGGGGATGCACTTCCTAAG



GATGCCGATGCAAATGGTGCGTATTGTATTGCATTAAAAGGGTTATATGAAATTA



AACAAATTACCGAAAATTGGAAAGAAGATGGTAAATTTTCGCGCGATAAACTCAA



AATCAGCAATAAAGATTGGTTCGACTTTATCCAGAATAAGCGCTATCTCTAA





SEQ
ATGACCAATAAATTCACTAACCAGTATTCTCTCTCTAAGACCCTGCGCTTTGAAC


ID
TGATTCCGCAGGGGAAAACCTTGGAGTTCATTCAAGAAAAAGGCCTCTTGTCTCA


NO:
GGATAAACAGAGGGCTGAATCTTACCAAGAAATGAAGAAAACTATTGATAAGTTT


48
CATAAATATTTCATTGATTTAGCCTTGTCTAACGCCAAATTAACTCACTTGGAAA



CGTATCTGGAGTTATACAACAAATCTGCCGAAACTAAGAAAGAACAGAAATTTAA



AGACGATTTGAAAAAAGTACAGGACAATCTGCGTAAAGAAATTGTCAAATCCTTC



AGTGACGGCGATGCTAAAAGCATTTTTGCCATTCTGGACAAAAAAGAGTTGATTA



CTGTGGAATTAGAAAAGTGGTTTGAAAACAATGAGCAGAAAGACATCTACTTCGA



TGAGAAATTCAAAACTTTCACCACCTATTTTACAGGATTTCATCAAAACCGGAAG



AACATGTACTCAGTAGAACCGAACTCCACGGCCATTGCGTATCGTTTGATCCATG



AGAATCTGCCTAAATTTCTGGAGAATGCGAAAGCCTTTGAAAAGATTAAGCAGGT



CGAATCGCTGCAAGTGAATTTTCGTGAACTCATGGGCGAATTTGGTGACGAAGGT



CTAATCTTCGTTAACGAACTGGAAGAAATGTTTCAGATTAATTACTACAATGACG



TGCTATCGCAGAACGGTATCACAATCTACAATAGTATTATCTCAGGGTTCACAAA



AAACGATATAAAATACAAAGGCCTGAACGAGTATATCAATAACTACAACCAAACA



AAGGACAAAAAGGATAGGCTTCCGAAACTGAAGCAGTTATACAAACAGATTTTAT



CTGACAGAATCTCCCTGAGCTTTCTGCCGGATGCTTTCACTGATGGGAAGCAGGT



TCTGAAAGCGATTTTCGATTTTTATAAGATTAACTTACTGAGCTACACGATTGAA



GGTCAAGAAGAATCTCAAAACTTACTGCTCTTGATCCGTCAAACCATTGAAAATC



TATCATCGTTCGATACGCAGAAAATCTACCTCAAAAACGATACTCACCTGACTAC



GATCTCTCAGCAGGTTTTCGGGGATTTTAGTGTATTTTCAACAGCTCTGAACTAC



TGGTATGAAACCAAAGTCAATCCGAAATTCGAGACGGAATATTCTAAGGCCAACG



AAAAAAAACGTGAGATTCTTGATAAAGCTAAAGCCGTATTTACTAAACAGGATTA



CTTTTCTATTGCTTTCCTGCAGGAAGTTTTATCGGAGTATATCCTGACCCTGGAT



CATACATCTGATATCGTTAAAAAACACAGCAGCAATTGCATCGCTGACTATTTCA



AAAACCACTTTGTCGCCAAAAAAGAAAACGAAACAGACAAGACTTTCGATTTCAT



TGCTAACATCACCGCAAAATACCAGTGTATTCAGGGTATCTTGGAAAACGCCGAC



CAATACGAAGACGAACTGAAACAAGATCAGAAGCTGATCGATAATTTAAAATTCT



TCTTAGATGCAATCCTGGAGCTGCTGCACTTCATCAAACCGCTTCATTTAAAGAG



CGAGTCCATTACCGAAAAGGACACCGCCTTCTATGACGTTTTTGAAAATTATTAT



GAAGCCCTCTCCTTGCTGACTCCGCTGTATAATATGGTACGCAATTACGTAACCC



AGAAACCATATTCTACCGAAAAAATTAAACTGAACTTTGAAAACGCACAGCTGCT



CAACGGTTGGGACGCGAATAAAGAAGGTGACTACCTCACCACCATCCTGAAAAAA



GATGGTAACTATTTTCTGGCAATTATGGATAAGAAACATAATAAAGCATTCCAGA



AATTTCCTGAAGGGAAAGAAAATTACGAAAAGATGGTGTACAAACTCTTACCTGG



AGTTAACAAAATGTTGCCGAAAGTATTTTTTAGTAATAAGAACATCGCGTACTTT



AACCCGTCCAAAGAACTGCTGGAAAATTATAAAAAGGAGACGCATAAGAAAGGGG



ATACCTTTAACCTGGAACATTGCCATACCTTAATAGACTTCTTCAAGGATTCCCT



GAATAAACACGAGGATTGGAAATATTTCGATTTTCAGTTTAGTGAGACCAAGTCA



TACCAGGATCTTAGCGGCTTTTATCGCGAAGTAGAACACCAAGGCTATAAAATTA



ACTTCAAAAACATCGACAGCGAATACATCGACGGTTTAGTTAACGAGGGCAAACT



GTTTCTGTTCCAGATCTATTCAAAGGATTTTAGCCCGTTCTCTAAAGGCAAACCA



AATATGCATACGTTGTACTGGAAAGCACTGTTTGAAGAGCAAAACCTGCAGAATG



TGATTTATAAACTGAACGGCCAAGCTGAGATTTTTTTCCGTAAAGCCTCGATTAA



ACCGAAAAATATCATCCTTCATAAGAAGAAAATAAAGATCGCTAAAAAACACTTC



ATAGATAAAAAAACCAAAACCTCCGAAATAGTGCCTGTTCAAACAATTAAGAACT



TGAATATGTACTACCAGGGCAAGATATCGGAAAAGGAGTTGACTCAAGACGATCT



TCGCTATATCGATAACTTTTCGATTTTTAACGAAAAAAACAAGACGATCGACATC



ATCAAAGATAAACGCTTCACTGTAGATAAGTTCCAGTTTCATGTGCCGATTACTA



TGAACTTCAAAGCTACCGGGGGTAGCTATATCAACCAAACGGTGTTGGAATACCT



GCAGAATAACCCGGAAGTCAAAATCATTGGGCTGGACCGCGGAGAACGTCACCTT



GTGTACTTGACCTTAATCGATCAGCAAGGCAACATCTTAAAACAAGAATCGCTGA



ATACCATTACGGATTCAAAGATTAGCACCCCGTATCATAAGCTGCTCGATAACAA



GGAGAATGAGCGCGACCTGGCCCGTAAAAACTGGGGCACGGTGGAAAACATTAAG



GAGTTAAAGGAGGGTTATATTTCCCAGGTAGTGCATAAGATCGCCACTCTCATGC



TCGAGGAAAATGCGATCGTTGTCATGGAAGACTTAAACTTCGGATTTAAACGTGG



GCGATTTAAAGTAGAGAAACAAATCTACCAGAAGTTAGAAAAAATGCTGATTGAC



AAATTAAATTACTTGGTCCTAAAAGACAAACAGCCGCAAGAATTGGGTGGATTAT



ACAACGCCCTCCAACTTACCAATAAATTCGAAAGTTTTCAGAAAATGGGTAAACA



GTCAGGCTTTCTTTTTTATGTTCCTGCGTGGAACACATCCAAAATCGACCCTACA



ACCGGCTTCGTCAATTACTTCTATACTAAATATGAAAACGTCGACAAAGCAAAAG



CATTCTTTGAAAAGTTCGAAGCAATACGTTTTAACGCTGAGAAAAAATATTTCGA



GTTCGAAGTCAAGAAATACTCAGACTTTAACCCCAAAGCTGAGGGCACACAGCAA



GCGTGGACAATCTGCACCTACGGCGAGCGCATCGAAACGAAGCGTCAAAAAGATC



AGAATAACAAATTTGTTTCAACACCTATCAACCTGACCGAGAAGATTGAAGACTT



CTTAGGTAAAAATCAGATTGTTTATGGCGACGGTAACTGTATAAAATCTCAAATA



GCCTCAAAGGATGATAAAGCATTTTTCGAAACATTATTATATTGGTTCAAAATGA



CACTGCAGATGCGCAATAGTGAGACGCGTACAGATATTGATTATCTTATCAGCCC



GGTCATGAACGACAACGGTACTTTTTACAACTCCAGAGACTATGAAAAACTTGAG



AATCCAACTCTCCCCAAAGATGCTGATGCGAACGGTGCTTATCACATCGCGAAAA



AAGGTCTGATGCTGCTGAACAAAATCGACCAAGCCGATCTGACTAAGAAAGTTGA



CCTAAGCATTTCAAATCGGGACTGGTTACAGTTTGTTCAAAAGAACAAATGA





SEQ
ATGGAACAGGAATATTATCTGGGCTTGGACATGGGCACCGGTTCCGTCGGCTGGG


ID
CTGTTACTGACAGTGAATATCACGTTCTAAGAAAGCATGGTAAGGCATTGTGGGG


NO:
TGTAAGACTTTTCGAATCTGCTTCCACTGCTGAAGAGCGTAGAATGTTTAGAACG


49
AGTCGACGTAGGCTAGACAGGCGCAATTGGAGAATCGAAATTTTACAAGAAATTT



TTGCGGAAGAGATATCTAAGAAAGACCCAGGCTTTTTCCTGAGAATGAAGGAATC



TAAGTATTACCCTGAGGATAAAAGAGATATAAATGGTAACTGTCCCGAATTGCCT



TACGCATTATTTGTGGACGATGATTTTACCGATAAGGATTACCATAAAAAGTTCC



CAACTATCTACCATTTACGCAAAATGTTAATGAATACAGAGGAAACCCCAGACAT



AAGACTAGTTTATCTGGCAATACACCATATGATGAAACATAGAGGCCATTTCTTA



CTTTCCGGGGATATCAACGAAATCAAAGAGTTTGGTACCACATTTAGTAAGTTAC



TGGAAAACATAAAGAATGAAGAATTGGATTGGAACTTAGAACTCGGAAAAGAAGA



ATACGCGGTTGTCGAATCTATCCTGAAGGATAATATGCTGAATAGGTCGACCAAA



AAAACTAGGCTGATCAAAGCACTGAAAGCCAAATCTATCTGCGAAAAAGCTGTTT



TAAATTTACTTGCTGGTGGCACTGTTAAGTTATCAGACATTTTTGGTTTGGAAGA



ATTGAACGAAACCGAGCGTCCAAAAATTAGTTTCGCTGATAATGGCTACGATGAT



TACATTGGTGAGGTGGAAAACGAGTTGGGCGAACAATTTTATATTATAGAGACAG



CTAAGGCAGTCTATGACTGGGCTGTTTTAGTAGAAATCCTTGGTAAATACACATC



TATCTCCGAAGCGAAAGTTGCTACTTACGAAAAGCACAAGTCCGATCTCCAGTTT



TTGAAGAAAATTGTCAGGAAATATCTGACTAAGGAAGAATATAAAGATATTTTCG



TTAGTACCTCTGACAAACTGAAAAATTACTCCGCTTACATCGGGATGACCAAGAT



TAATGGCAAAAAAGTTGATCTGCAAAGCAAAAGGTGTTCGAAGGAAGAATTTTAT



GATTTCATTAAAAAGAATGTCTTAAAAAAATTAGAAGGTCAGCCAGAATACGAAT



ATTTGAAAGAAGAACTGGAAAGAGAGACATTCTTACCAAAACAAGTCAACAGAGA



TAATGGGGTAATTCCATATCAAATTCACCTCTACGAATTAAAAAAAATTTTAGGC



AATTTACGCGATAAAATTGACCTTATCAAAGAAAATGAGGATAAGCTGGTTCAAC



TCTTTGAATTCAGAATACCCTATTATGTGGGCCCACTGAACAAGATTGATGACGG



CAAAGAAGGTAAATTCACATGGGCCGTCCGCAAATCCAATGAAAAAATTTACCCA



TGGAACTTTGAAAATGTAGTAGATATTGAAGCGTCTGCGGAGAAATTTATTCGAA



GAATGACTAATAAATGCACTTACTTGATGGGAGAGGATGTTCTGCCTAAAGACAG



CTTATTATACAGCAAGTACATGGTTCTAAACGAACTTAACAACGTTAAGTTGGAC



GGTGAGAAATTAAGTGTAGAATTGAAACAAAGATTGTATACTGACGTCTTCTGCA



AGTACAGAAAAGTGACAGTTAAAAAAATTAAGAATTACTTGAAGTGCGAAGGTAT



AATTTCTGGAAACGTAGAGATTACTGGTATTGATGGTGATTTCAAAGCATCCCTA



ACAGCTTACCACGATTTCAAGGAAATCCTGACAGGAACTGAACTCGCAAAAAAAG



ATAAAGAAAACATTATTACTAATATTGTTCTTTTCGGTGATGACAAGAAATTGTT



GAAGAAAAGACTGAATAGACTTTACCCCCAGATTACTCCCAATCAACTTAAGAAA



ATTTGTGCTTTGTCTTACACAGGATGGGGTCGTTTTTCAAAAAAGTTCTTAGAAG



AGATTACCGCACCTGATCCAGAAACAGGCGAAGTATGGAATATAATTACCGCCTT



ATGGGAATCGAACAATAATCTTATGCAACTTCTGAGCAATGAATATCGTTTCATG



GAAGAAGTTGAGACTTACAACATGGGCAAACAGACGAAGACTTTATCCTATGAAA



CTGTGGAAAATATGTATGTATCACCTTCTGTCAAGAGACAAATTTGGCAAACCTT



AAAAATTGTCAAAGAATTAGAAAAGGTAATGAAGGAGTCTCCTAAACGTGTGTTT



ATTGAAATGGCTAGAGAAAAACAAGAGTCAAAAAGAACCGAGTCAAGAAAGAAGC



AGTTAATCGATTTATATAAGGCTTGTAAAAACGAAGAGAAAGATTGGGTTAAAGA



ATTGGGGGACCAAGAGGAACAAAAACTACGGTCGGATAAGTTGTATTTATACTAT



ACGCAAAAGGGACGATGTATGTATTCCGGCGAGGTAATAGAATTGAAGGATTTAT



GGGACAATACAAAATATGACATAGACCATATATATCCCCAATCAAAAACGATGGA



CGATAGCTTGAACAATAGAGTACTCGTGAAAAAAAAATATAATGCGACCAAATCT



GATAAGTATCCTCTGAATGAAAATATCAGACATGAAAGAAAGGGGTTCTGGAAGT



CCTTGTTAGATGGTGGGTTTATAAGCAAAGAAAAGTACGAGCGTCTAATAAGAAA



CACGGAGTTATCGCCAGAAGAACTCGCTGGTTTTATTGAGAGGCAAATCGTGGAA



ACGAGACAATCTACCAAAGCCGTTGCTGAGATCCTAAAGCAAGTTTTCCCAGAGT



CGGAGATTGTCTATGTCAAAGCTGGCACAGTGAGCAGGTTTAGGAAAGACTTCGA



ACTATTAAAGGTAAGAGAAGTGAACGATTTACATCACGCAAAGGACGCTTACCTA



AATATCGTTGTAGGTAACTCATATTATGTTAAATTTACCAAGAACGCCTCTTGGT



TTATAAAGGAGAACCCAGGTAGAACATATAACCTGAAAAAGATGTTCACCTCTGG



TTGGAATATTGAGAGAAACGGAGAAGTCGCATGGGAAGTTGGTAAGAAAGGGACT



ATAGTGACAGTAAAGCAAATTATGAACAAAAATAATATCCTCGTTACAAGGCAGG



TTCATGAAGCAAAGGGCGGCCTTTTTGACCAACAAATTATGAAGAAAGGGAAAGG



TCAAATTGCAATAAAAGAAACCGATGAGAGACTAGCGTCAATAGAAAAGTATGGT



GGCTATAATAAAGCTGCGGGTGCATACTTTATGCTTGTTGAATCAAAAGACAAGA



AAGGTAAGACTATTAGAACTATAGAATTTATACCCCTGTACCTTAAAAACAAAAT



TGAATCGGATGAGTCAATCGCGTTAAATTTTCTAGAGAAAGGAAGGGGTTTAAAA



GAACCAAAGATCCTGTTAAAAAAGATTAAGATTGACACCTTGTTCGATGTAGATG



GATTTAAAATGTGGTTATCTGGCAGAACAGGCGATAGACTTTTGTTTAAGTGCGC



TAATCAATTAATTTTGGATGAGAAAATCATTGTCACAATGAAAAAAATAGTTAAG



TTTATTCAGAGAAGACAAGAAAACAGGGAGTTGAAATTATCTGATAAAGATGGTA



TCGACAATGAAGTTTTAATGGAAATCTACAATACATTCGTTGATAAACTTGAAAA



TACCGTATATCGAATCAGGTTAAGTGAACAAGCCAAAACATTAATTGATAAACAA



AAAGAATTTGAAAGGCTATCACTGGAAGACAAATCCTCCACCCTATTTGAAATTT



TGCATATATTCCAGTGCCAATCTTCAGCAGCTAATTTAAAAATGATTGGCGGACC



TGGGAAAGCCGGCATCCTAGTGATGAACAATAATATCTCCAAGTGTAACAAAATA



TCAATTATTAACCAATCTCCGACAGGTATTTTTGAAAATGAAATAGACTTGCTTA



AGATATAA





SEQ
ATGTCTTTCGACTCTTTCACCAACCTGTACTCTCTGTCTAAAACCCTGAAATTCG


ID
AAATGCGTCCGGTTGGTAACACCCAGAAAATGCTGGACAACGCGGGTGTTTTCGA


NO:
AAAAGACAAACTGATCCAGAAAAAATACGGTAAAACCAAACCGTACTTCGACCGT


50
CTGCACCGTGAATTCATCGAAGAAGCGCTGACCGGTGTTGAACTGATCGGTCTGG



ACGAAAACTTCCGTACCCTGGTTGACTGGCAGAAAGACAAAAAAAACAACGTTGC



GATGAAAGCGTACGAAAACTCTCTGCAGCGTCTGCGTACCGAAATCGGTAAAATC



TTCAACCTGAAAGCGGAAGACTGGGTTAAAAACAAATACCCGATCCTGGGTCTGA



AAAACAAAAACACCGACATCCTGTTCGAAGAAGCGGTTTTCGGTATCCTGAAAGC



GCGTTACGGTGAAGAAAAAGACACCTTCATCGAAGTTGAAGAAATCGACAAAACC



GGTAAATCTAAAATCAACCAGATCTCTATCTTCGACTCTTGGAAAGGTTTCACCG



GTTACTTCAAAAAATTCTTCGAAACCCGTAAAAACTTCTACAAAAACGACGGTAC



CTCTACCGCGATCGCGACCCGTATCATCGACCAGAACCTGAAACGTTTCATCGAC



AACCTGTCTATCGTTGAATCTGTTCGTCAGAAAGTTGACCTGGCGGAAACCGAAA



AATCTTTCTCTATCTCTCTGTCTCAGTTCTTCTCTATCGACTTCTACAACAAATG



CCTGCTGCAGGACGGTATCGACTACTACAACAAAATCATCGGTGGTGAAACCCTG



AAAAACGGTGAAAAACTGATCGGTCTGAACGAACTGATCAACCAGTACCGTCAGA



ACAACAAAGACCAGAAAATCCCGTTCTTCAAACTGCTGGACAAACAGATCCTGTC



TGAAAAAATCCTGTTCCTGGACGAAATCAAAAACGACACCGAACTGATCGAAGCG



CTGTCTCAGTTCGCGAAAACCGCGGAAGAAAAAACCAAAATCGTTAAAAAACTGT



TCGCGGACTTCGTTGAAAACAACTCTAAATACGACCTGGCGCAGATCTACATCTC



TCAGGAAGCGTTCAACACCATCTCTAACAAATGGACCTCTGAAACCGAAACCTTC



GCGAAATACCTGTTCGAAGCGATGAAATCTGGTAAACTGGCGAAATACGAAAAAA



AAGACAACTCTTACAAATTCCCGGACTTCATCGCGCTGTCTCAGATGAAATCTGC



GCTGCTGTCTATCTCTCTGGAAGGTCACTTCTGGAAAGAAAAATACTACAAAATC



TCTAAATTCCAGGAAAAAACCAACTGGGAACAGTTCCTGGCGATCTTCCTGTACG



AATTCAACTCTCTGTTCTCTGACAAAATCAACACCAAAGACGGTGAAACCAAACA



GGTTGGTTACTACCTGTTCGCGAAAGACCTGCACAACCTGATCCTGTCTGAACAG



ATCGACATCCCGAAAGACTCTAAAGTTACCATCAAAGACTTCGCGGACTCTGTTC



TGACCATCTACCAGATGGCGAAATACTTCGCGGTTGAAAAAAAACGTGCGTGGCT



GGCGGAATACGAACTGGACTCTTTCTACACCCAGCCGGACACCGGTTACCTGCAG



TTCTACGACAACGCGTACGAAGACATCGTTCAGGTTTACAACAAACTGCGTAACT



ACCTGACCAAAAAACCGTACTCTGAAGAAAAATGGAAACTGAACTTCGAAAACTC



TACCCTGGCGAACGGTTGGGACAAAAACAAAGAATCTGACAACTCTGCGGTTATC



CTGCAGAAAGGTGGTAAATACTACCTGGGTCTGATCACCAAAGGTCACAACAAAA



TCTTCGACGACCGTTTCCAGGAAAAATTCATCGTTGGTATCGAAGGTGGTAAATA



CGAAAAAATCGTTTACAAATTCTTCCCGGACCAGGCGAAAATGTTCCCGAAAGTT



TGCTTCTCTGCGAAAGGTCTGGAATTCTTCCGTCCGTCTGAAGAAATCCTGCGTA



TCTACAACAACGCGGAATTCAAAAAAGGTGAAACCTACTCTATCGACTCTATGCA



GAAACTGATCGACTTCTACAAAGACTGCCTGACCAAATACGAAGGTTGGGCGTGC



TACACCTTCCGTCACCTGAAACCGACCGAAGAATACCAGAACAACATCGGTGAAT



TCTTCCGTGACGTTGCGGAAGACGGTTACCGTATCGACTTCCAGGGTATCTCTGA



CCAGTACATCCACGAAAAAAACGAAAAAGGTGAACTGCACCTGTTCGAAATCCAC



AACAAAGACTGGAACCTGGACAAAGCGCGTGACGGTAAATCTAAAACCACCCAGA



AAAACCTGCACACCCTGTACTTCGAATCTCTGTTCTCTAACGACAACGTTGTTCA



GAACTTCCCGATCAAACTGAACGGTCAGGCGGAAATCTTCTACCGTCCGAAAACC



GAAAAAGACAAACTGGAATCTAAAAAAGACAAAAAAGGTAACAAAGTTATCGACC



ACAAACGTTACTCTGAAAACAAAATCTTCTTCCACGTTCCGCTGACCCTGAACCG



TACCAAAAACGACTCTTACCGTTTCAACGCGCAGATCAACAACTTCCTGGCGAAC



AACAAAGACATCAACATCATCGGTGTTGACCGTGGTGAAAAACACCTGGTTTACT



ACTCTGTTATCACCCAGGCGTCTGACATCCTGGAATCTGGTTCTCTGAACGAACT



GAACGGTGTTAACTACGCGGAAAAACTGGGTAAAAAAGCGGAAAACCGTGAACAG



GCGCGTCGTGACTGGCAGGACGTTCAGGGTATCAAAGACCTGAAAAAAGGTTACA



TCTCTCAGGTTGTTCGTAAACTGGCGGACCTGGCGATCAAACACAACGCGATCAT



CATCCTGGAAGACCTGAACATGCGTTTCAAACAGGTTCGTGGTGGTATCGAAAAA



TCTATCTACCAGCAGCTGGAAAAAGCGCTGATCGACAAACTGTCTTTCCTGGTTG



ACAAAGGTGAAAAAAACCCGGAACAGGCGGGTCACCTGCTGAAAGCGTACCAGCT



GTCTGCGCCGTTCGAAACCTTCCAGAAAATGGGTAAACAGACCGGTATCATCTTC



TACACCCAGGCGTCTTACACCTCTAAATCTGACCCGGTTACCGGTTGGCGTCCGC



ACCTGTACCTGAAATACTTCTCTGCGAAAAAAGCGAAAGACGACATCGCGAAATT



CACCAAAATCGAATTCGTTAACGACCGTTTCGAACTGACCTACGACATCAAAGAC



TTCCAGCAGGCGAAAGAATACCCGAACAAAACCGTTTGGAAAGTTTGCTCTAACG



TTGAACGTTTCCGTTGGGACAAAAACCTGAACCAGAACAAAGGTGGTTACACCCA



CTACACCAACATCACCGAAAACATCCAGGAACTGTTCACCAAATACGGTATCGAC



ATCACCAAAGACCTGCTGACCCAGATCTCTACCATCGACGAAAAACAGAACACCT



CTTTCTTCCGTGACTTCATCTTCTACTTCAACCTGATCTGCCAGATCCGTAACAC



CGACGACTCTGAAATCGCGAAAAAAAACGGTAAAGACGACTTCATCCTGTCTCCG



GTTGAACCGTTCTTCGACTCTCGTAAAGACAACGGTAACAAACTGCCGGAAAACG



GTGACGACAACGGTGCGTACAACATCGCGCGTAAAGGTATCGTTATCCTGAACAA



AATCTCTCAGTACTCTGAAAAAAACGAAAACTGCGAAAAAATGAAATGGGGTGAC



CTGTACGTTTCTAACATCGACTGGGACAACTTCGTT





SEQ
ATGGAAAACTTTAAAAACTTATACCCAATAAACAAAACGTTACGTTTTGAACTGC


ID
GTCCATATGGTAAAACACTGGAAAACTTTAAAAAAAGCGGTTTGTTGGAGAAGGA


NO:
TGCATTTAAAGCGAACTCTCGCAGATCCATGCAGGCCATCATTGATGAAAAATTT


51
AAAGAGACGATCGAAGAACGTCTGAAATACACGGAATTTAGTGAGTGTGACTTAG



GTAATATGACTTCTAAAGATAAGAAAATCACCGATAAGGCGGCGACCAACCTGAA



GAAGCAAGTCATTTTATCTTTTGATGATGAAATCTTTAACAACTATTTGAAACCG



GACAAAAACATCGATGCCTTATTTAAAAATGACCCTTCGAACCCGGTGATTAGCA



CATTTAAGGGCTTCACAACGTATTTTGTCAATTTTTTTGAAATTCGTAAACATAT



CTTCAAAGGAGAATCAAGCGGCTCTATGGCTTATCGCATTATTGATGAAAACCTG



ACGACCTATTTGAATAACATTGAAAAAATCAAAAAACTGCCAGAGGAATTAAAGT



CTCAGTTAGAAGGCATCGACCAGATCGACAAACTCAACAACTATAACGAATTTAT



TACGCAGTCTGGTATCACCCACTATAATGAAATTATTGGAGGTATCAGTAAATCA



GAAAATGTGAAAATCCAAGGGATTAATGAAGGCATTAACCTCTATTGCCAGAAAA



ATAAAGTGAAACTGCCGAGGCTGACTCCACTCTACAAAATGATCCTGTCTGACCG



CGTCTCGAATAGCTTTGTCCTGGACACAATTGAAAACGATACGGAATTGATTGAG



ATGATAAGCGATCTGATTAACAAAACCGAAATTTCACAGGATGTAATCATGAGTG



ATATACAAAACATCTTTATTAAATATAAACAGCTTGGTAATCTGCCTGGAATTAG



CTATTCGTCAATAGTGAACGCAATCTGTTCTGATTATGATAACAATTTTGGCGAC



GGTAAGCGTAAAAAGAGTTATGAAAACGATAGGAAAAAACACCTGGAAACTAACG



TGTATTCTATCAACTATATCAGCGAACTGCTTACGGACACCGATGTGAGTTCAAA



CATTAAGATGCGGTATAAGGAGCTTGAACAGAACTACCAGGTCTGTAAGGAAAAC



TTCAACGCAACCAACTGGATGAACATTAAAAATATCAAACAATCCGAGAAGACCA



ACTTAATCAAAGATCTGCTGGATATTTTGAAGAGCATTCAACGTTTTTATGATCT



GTTCGATATCGTTGATGAAGACAAGAATCCTAGTGCGGAATTTTATACATGGCTG



TCTAAAAATGCGGAGAAATTGGATTTCGAATTCAATTCTGTTTATAATAAATCAC



GCAACTATTTGACCCGCAAACAATACAGCGACAAAAAGATAAAACTAAACTTCGA



CAGTCCGACATTGGCAAAGGGCTGGGACGCAAATAAGGAAATCGATAACTCTACG



ATAATTATGCGTAAGTTCAATAATGATCGAGGTGATTATGATTATTTCTTAGGCA



TTTGGAACAAAAGCACCCCGGCCAACGAAAAGATAATTCCACTGGAGGATAACGG



TCTGTTCGAAAAAATGCAGTACAAATTATATCCGGATCCAAGCAAGATGCTTCCA



AAGCAGTTTCTGTCTAAAATTTGGAAAGCTAAGCATCCGACCACCCCAGAATTTG



ACAAGAAATATAAGGAAGGCCGCCATAAGAAAGGTCCCGATTTTGAAAAAGAATT



CTTGCACGAACTGATTGATTGCTTTAAACATGGCTTAGTCAATCACGATGAAAAG



TATCAAGATGTTTTTGGATTCAATTTGAGAAACACAGAAGACTACAATTCCTACA



CTGAGTTTCTCGAAGATGTGGAACGATGTAATTATAATCTGAGCTTTAACAAAAT



CGCGGACACCTCGAATCTGATTAACGATGGTAAACTTTATGTTTTCCAGATCTGG



AGCAAGGATTTCTCTATTGACAGCAAAGGCACCAAAAACCTGAACACCATTTACT



TTGAAAGTCTCTTCAGCGAAGAAAATATGATTGAGAAAATGTTTAAACTTAGCGG



TGAAGCTGAAATATTCTATCGCCCGGCAAGCCTGAACTATTGCGAAGACATTATC



AAAAAGGGTCATCACCACGCTGAACTGAAAGATAAATTTGATTATCCTATCATAA



AAGATAAACGCTATAGCCAGGATAAATTTTTTTTTCATGTTCCTATGGTCATTAA



CTACAAATCAGAAAAACTGAACTCTAAAAGCCTCAATAATCGAACCAATGAAAAC



CTTGGGCAGTTTACCCATATAATTGGAATTGATCGCGGAGAGCGTCATTTAATCT



ACCTGACCGTAGTCGATGTATCGACCGGCGAGATCGTCGAGCAGAAGCACTTAGA



CGAGATTATCAACACTGATACCAAAGGTGTTGAGCATAAGACGCACTATCTAAAC



AAGCTGGAGGAAAAATCGAAAACCCGTGATAATGAACGTAAGAGTTGGGAGGCAA



TTGAAACGATTAAAGAACTGAAGGAGGGTTATATCAGCCACGTAATCAATGAAAT



TCAAAAACTGCAGGAAAAATACAACGCCCTGATCGTTATGGAAAATCTGAATTAC



GGTTTCAAAAATTCTCGCATCAAAGTGGAAAAACAGGTATATCAGAAGTTCGAGA



CGGCATTAATTAAAAAGTTTAATTACATCATTGACAAAAAAGATCCGGAAACTTA



TATTCATGGCTATCAGCTGACGAACCCGATCACCACACTGGATAAAATTGGTAAC



CAGTCTGGTATCGTGCTTTACATCCCTGCCTGGAATACCAGTAAAATCGATCCGG



TAACGGGATTCGTCAACCTTCTATATGCAGATGACCTCAAATATAAGAATCAGGA



ACAGGCCAAGTCTTTTATTCAGAAAATCGATAACATTTACTTTGAGAATGGGGAA



TTCAAATTTGATATTGATTTTTCTAAATGGAACAATCGTTATAGTATATCTAAGA



CGAAATGGACGCTCACCTCGTACGGAACCCGAATCCAGACATTCCGCAATCCGCA



GAAGAACAATAAATGGGACAGCGCCGAGTATGATCTCACTGAAGAATTCAAATTG



ATTCTGAACATTGACGGTACCCTGAAAAGCCAGGATGTCGAAACCTATAAAAAAT



TTATGTCTCTGTTCAAGCTGATGCTGCAACTTAGGAACTCTGTTACCGGCACTGA



TATCGATTATATGATCTCCCCTGTCACTGATAAAACAGGTACGCATTTCGATTCG



CGCGAAAATATCAAAAATCTGCCCGCAGATGCCGACGCCAATGGGGCGTACAATA



TTGCACGCAAGGGTATCATGGCGATCGAAAACATTATGAATGGTATCAGCGACCC



GCTGAAAATCTCAAACGAAGATTATTTGAAATATATCCAAAACCAGCAGGAATAA





SEQ
ATGACCCAGTTCGAAGGTTTCACCAACCTGTACCAGGTTTCTAAAACCCTGCGTT


ID
TCGAACTGATCCCGCAGGGTAAAACCCTGAAACACATCCAGGAACAGGGTTTCAT


NO:
CGAAGAAGACAAAGCGCGTAACGACCACTACAAAGAACTGAAACCGATCATCGAC


52
CGTATCTACAAAACCTACGCGGACCAGTGCCTGCAGCTGGTTCAGCTGGACTGGG



AAAACCTGTCTGCGGCGATCGACTCTTACCGTAAAGAAAAAACCGAAGAAACCCG



TAACGCGCTGATCGAAGAACAGGCGACCTACCGTAACGCGATCCACGACTACTTC



ATCGGTCGTACCGACAACCTGACCGACGCGATCAACAAACGTCACGCGGAAATCT



ACAAAGGTCTGTTCAAAGCGGAACTGTTCAACGGTAAAGTTCTGAAACAGCTGGG



TACCGTTACCACCACCGAACACGAAAACGCGCTGCTGCGTTCTTTCGACAAATTC



ACCACCTACTTCTCTGGTTTCTACGAAAACCGTAAAAACGTTTTCTCTGCGGAAG



ACATCTCTACCGCGATCCCGCACCGTATCGTTCAGGACAACTTCCCGAAATTCAA



AGAAAACTGCCACATCTTCACCCGTCTGATCACCGCGGTTCCGTCTCTGCGTGAA



CACTTCGAAAACGTTAAAAAAGCGATCGGTATCTTCGTTTCTACCTCTATCGAAG



AAGTTTTCTCTTTCCCGTTCTACAACCAGCTGCTGACCCAGACCCAGATCGACCT



GTACAACCAGCTGCTGGGTGGTATCTCTCGTGAAGCGGGTACCGAAAAAATCAAA



GGTCTGAACGAAGTTCTGAACCTGGCGATCCAGAAAAACGACGAAACCGCGCACA



TCATCGCGTCTCTGCCGCACCGTTTCATCCCGCTGTTCAAACAGATCCTGTCTGA



CCGTAACACCCTGTCTTTCATCCTGGAAGAATTCAAATCTGACGAAGAAGTTATC



CAGTCTTTCTGCAAATACAAAACCCTGCTGCGTAACGAAAACGTTCTGGAAACCG



CGGAAGCGCTGTTCAACGAACTGAACTCTATCGACCTGACCCACATCTTCATCTC



TCACAAAAAACTGGAAACCATCTCTTCTGCGCTGTGCGACCACTGGGACACCCTG



CGTAACGCGCTGTACGAACGTCGTATCTCTGAACTGACCGGTAAAATCACCAAAT



CTGCGAAAGAAAAAGTTCAGCGTTCTCTGAAACACGAAGACATCAACCTGCAGGA



AATCATCTCTGCGGCGGGTAAAGAACTGTCTGAAGCGTTCAAACAGAAAACCTCT



GAAATCCTGTCTCACGCGCACGCGGCGCTGGACCAGCCGCTGCCGACCACCCTGA



AAAAACAGGAAGAAAAAGAAATCCTGAAATCTCAGCTGGACTCTCTGCTGGGTCT



GTACCACCTGCTGGACTGGTTCGCGGTTGACGAATCTAACGAAGTTGACCCGGAA



TTCTCTGCGCGTCTGACCGGTATCAAACTGGAAATGGAACCGTCTCTGTCTTTCT



ACAACAAAGCGCGTAACTACGCGACCAAAAAACCGTACTCTGTTGAAAAATTCAA



ACTGAACTTCCAGATGCCGACCCTGGCGTCTGGTTGGGACGTTAACAAAGAAAAA



AACAACGGTGCGATCCTGTTCGTTAAAAACGGTCTGTACTACCTGGGTATCATGC



CGAAACAGAAAGGTCGTTACAAAGCGCTGTCTTTCGAACCGACCGAAAAAACCTC



TGAAGGTTTCGACAAAATGTACTACGACTACTTCCCGGACGCGGCGAAAATGATC



CCGAAATGCTCTACCCAGCTGAAAGCGGTTACCGCGCACTTCCAGACCCACACCA



CCCCGATCCTGCTGTCTAACAACTTCATCGAACCGCTGGAAATCACCAAAGAAAT



CTACGACCTGAACAACCCGGAAAAAGAACCGAAAAAATTCCAGACCGCGTACGCG



AAAAAAACCGGTGACCAGAAAGGTTACCGTGAAGCGCTGTGCAAATGGATCGACT



TCACCCGTGACTTCCTGTCTAAATACACCAAAACCACCTCTATCGACCTGTCTTC



TCTGCGTCCGTCTTCTCAGTACAAAGACCTGGGTGAATACTACGCGGAACTGAAC



CCGCTGCTGTACCACATCTCTTTCCAGCGTATCGCGGAAAAAGAAATCATGGACG



CGGTTGAAACCGGTAAACTGTACCTGTTCCAGATCTACAACAAAGACTTCGCGAA



AGGTCACCACGGTAAACCGAACCTGCACACCCTGTACTGGACCGGTCTGTTCTCT



CCGGAAAACCTGGCGAAAACCTCTATCAAACTGAACGGTCAGGCGGAACTGTTCT



ACCGTCCGAAATCTCGTATGAAACGTATGGCGCACCGTCTGGGTGAAAAAATGCT



GAACAAAAAACTGAAAGACCAGAAAACCCCGATCCCGGACACCCTGTACCAGGAA



CTGTACGACTACGTTAACCACCGTCTGTCTCACGACCTGTCTGACGAAGCGCGTG



CGCTGCTGCCGAACGTTATCACCAAAGAAGTTTCTCACGAAATCATCAAAGACCG



TCGTTTCACCTCTGACAAATTCTTCTTCCACGTTCCGATCACCCTGAACTACCAG



GCGGCGAACTCTCCGTCTAAATTCAACCAGCGTGTTAACGCGTACCTGAAAGAAC



ACCCGGAAACCCCGATCATCGGTATCGACCGTGGTGAACGTAACCTGATCTACAT



CACCGTTATCGACTCTACCGGTAAAATCCTGGAACAGCGTTCTCTGAACACCATC



CAGCAGTTCGACTACCAGAAAAAACTGGACAACCGTGAAAAAGAACGTGTTGCGG



CGCGTCAGGCGTGGTCTGTTGTTGGTACCATCAAAGACCTGAAACAGGGTTACCT



GTCTCAGGTTATCCACGAAATCGTTGACCTGATGATCCACTACCAGGCGGTTGTT



GTTCTGGAAAACCTGAACTTCGGTTTCAAATCTAAACGTACCGGTATCGCGGAAA



AAGCGGTTTACCAGCAGTTCGAAAAAATGCTGATCGACAAACTGAACTGCCTGGT



TCTGAAAGACTACCCGGCGGAAAAAGTTGGTGGTGTTCTGAACCCGTACCAGCTG



ACCGACCAGTTCACCTCTTTCGCGAAAATGGGTACCCAGTCTGGTTTCCTGTTCT



ACGTTCCGGCGCCGTACACCTCTAAAATCGACCCGCTGACCGGTTTCGTTGACCC



GTTCGTTTGGAAAACCATCAAAAACCACGAATCTCGTAAACACTTCCTGGAAGGT



TTCGACTTCCTGCACTACGACGTTAAAACCGGTGACTTCATCCTGCACTTCAAAA



TGAACCGTAACCTGTCTTTCCAGCGTGGTCTGCCGGGTTTCATGCCGGCGTGGGA



CATCGTTTTCGAAAAAAACGAAACCCAGTTCGACGCGAAAGGTACCCCGTTCATC



GCGGGTAAACGTATCGTTCCGGTTATCGAAAACCACCGTTTCACCGGTCGTTACC



GTGACCTGTACCCGGCGAACGAACTGATCGCGCTGCTGGAAGAAAAAGGTATCGT



TTTCCGTGACGGTTCTAACATCCTGCCGAAACTGCTGGAAAACGACGACTCTCAC



GCGATCGACACCATGGTTGCGCTGATCCGTTCTGTTCTGCAGATGCGTAACTCTA



ACGCGGCGACCGGTGAAGACTACATCAACTCTCCGGTTCGTGACCTGAACGGTGT



TTGCTTCGACTCTCGTTTCCAGAACCCGGAATGGCCGATGGACGCGGACGCGAAC



GGTGCGTACCACATCGCGCTGAAAGGTCAGCTGCTGCTGAACCACCTGAAAGAAT



CTAAAGACCTGAAACTGCAGAACGGTATCTCTAACCAGGACTGGCTGGCGTACAT



CCAGGAACTGCGTAACTA





SEQ
ATGGCGGTTAAATCTATCAAAGTTAAACTGCGTCTGGACGACATGCCGGAAATCC


ID
GTGCGGGTCTGTGGAAACTGCACAAAGAAGTTAACGCGGGTGTTCGTTACTACAC


NO:
CGAATGGCTGTCTCTGCTGCGTCAGGAAAACCTGTACCGTCGTTCTCCGAACGGT


53
GACGGTGAACAGGAATGCGACAAAACCGCGGAAGAATGCAAAGCGGAACTGCTGG



AACGTCTGCGTGCGCGTCAGGTTGAAAACGGTCACCGTGGTCCGGCGGGTTCTGA



CGACGAACTGCTGCAGCTGGCGCGTCAGCTGTACGAACTGCTGGTTCCGCAGGCG



ATCGGTGCGAAAGGTGACGCGCAGCAGATCGCGCGTAAATTCCTGTCTCCGCTGG



CGGACAAAGACGCGGTTGGTGGTCTGGGTATCGCGAAAGCGGGTAACAAACCGCG



TTGGGTTCGTATGCGTGAAGCGGGTGAACCGGGTTGGGAAGAAGAAAAAGAAAAA



GCGGAAACCCGTAAATCTGCGGACCGTACCGCGGACGTTCTGCGTGCGCTGGCGG



ACTTCGGTCTGAAACCGCTGATGCGTGTTTACACCGACTCTGAAATGTCTTCTGT



TGAATGGAAACCGCTGCGTAAAGGTCAGGCGGTTCGTACCTGGGACCGTGACATG



TTCCAGCAGGCGATCGAACGTATGATGTCTTGGGAATCTTGGAACCAGCGTGTTG



GTCAGGAATACGCGAAACTGGTTGAACAGAAAAACCGTTTCGAACAGAAAAACTT



CGTTGGTCAGGAACACCTGGTTCACCTGGTTAACCAGCTGCAGCAGGACATGAAA



GAAGCGTCTCCGGGTCTGGAATCTAAAGAACAGACCGCGCACTACGTTACCGGTC



GTGCGCTGCGTGGTTCTGACAAAGTTTTCGAAAAATGGGGTAAACTGGCGCCGGA



CGCGCCGTTCGACCTGTACGACGCGGAAATCAAAAACGTTCAGCGTCGTAACACC



CGTCGTTTCGGTTCTCACGACCTGTTCGCGAAACTGGCGGAACCGGAATACCAGG



CGCTGTGGCGTGAAGACGCGTCTTTCCTGACCCGTTACGCGGTTTACAACTCTAT



CCTGCGTAAACTGAACCACGCGAAAATGTTCGCGACCTTCACCCTGCCGGACGCG



ACCGCGCACCCGATCTGGACCCGTTTCGACAAACTGGGTGGTAACCTGCACCAGT



ACACCTTCCTGTTCAACGAATTCGGTGAACGTCGTCACGCGATCCGTTTCCACAA



ACTGCTGAAAGTTGAAAACGGTGTTGCGCGTGAAGTTGACGACGTTACCGTTCCG



ATCTCTATGTCTGAACAGCTGGACAACCTGCTGCCGCGTGACCCGAACGAACCGA



TCGCGCTGTACTTCCGTGACTACGGTGCGGAACAGCACTTCACCGGTGAATTCGG



TGGTGCGAAAATCCAGTGCCGTCGTGACCAGCTGGCGCACATGCACCGTCGTCGT



GGTGCGCGTGACGTTTACCTGAACGTTTCTGTTCGTGTTCAGTCTCAGTCTGAAG



CGCGTGGTGAACGTCGTCCGCCGTACGCGGCGGTTTTCCGTCTGGTTGGTGACAA



CCACCGTGCGTTCGTTCACTTCGACAAACTGTCTGACTACCTGGCGGAACACCCG



GACGACGGTAAACTGGGTTCTGAAGGTCTGCTGTCTGGTCTGCGTGTTATGTCTG



TTGACCTGGGTCTGCGTACCTCTGCGTCTATCTCTGTTTTCCGTGTTGCGCGTAA



AGACGAACTGAAACCGAACTCTAAAGGTCGTGTTCCGTTCTTCTTCCCGATCAAA



GGTAACGACAACCTGGTTGCGGTTCACGAACGTTCTCAGCTGCTGAAACTGCCGG



GTGAAACCGAATCTAAAGACCTGCGTGCGATCCGTGAAGAACGTCAGCGTACCCT



GCGTCAGCTGCGTACCCAGCTGGCGTACCTGCGTCTGCTGGTTCGTTGCGGTTCT



GAAGACGTTGGTCGTCGTGAACGTTCTTGGGCGAAACTGATCGAACAGCCGGTTG



ACGCGGCGAACCACATGACCCCGGACTGGCGTGAAGCGTTCGAAAACGAACTGCA



GAAACTGAAATCTCTGCACGGTATCTGCTCTGACAAAGAATGGATGGACGCGGTT



TACGAATCTGTTCGTCGTGTTTGGCGTCACATGGGTAAACAGGTTCGTGACTGGC



GTAAAGACGTTCGTTCTGGTGAACGTCCGAAAATCCGTGGTTACGCGAAAGACGT



TGTTGGTGGTAACTCTATCGAACAGATCGAATACCTGGAACGTCAGTACAAATTC



CTGAAATCTTGGTCTTTCTTCGGTAAAGTTTCTGGTCAGGTTATCCGTGCGGAAA



AAGGTTCTCGTTTCGCGATCACCCTGCGTGAACACATCGACCACGCGAAAGAAGA



CCGTCTGAAAAAACTGGCGGACCGTATCATCATGGAAGCGCTGGGTTACGTTTAC



GCGCTGGACGAACGTGGTAAAGGTAAATGGGTTGCGAAATACCCGCCGTGCCAGC



TGATCCTGCTGGAAGAACTGTCTGAATACCAGTTCAACAACGACCGTCCGCCGTC



TGAAAACAACCAGCTGATGCAGTGGTCTCACCGTGGTGTTTTCCAGGAACTGATC



AACCAGGCGCAGGTTCACGACCTGCTGGTTGGTACCATGTACGCGGCGTTCTCTT



CTCGTTTCGACGCGCGTACCGGTGCGCCGGGTATCCGTTGCCGTCGTGTTCCGGC



GCGTTGCACCCAGGAACACAACCCGGAACCGTTCCCGTGGTGGCTGAACAAATTC



GTTGTTGAACACACCCTGGACGCGTGCCCGCTGCGTGCGGACGACCTGATCCCGA



CCGGTGAAGGTGAAATCTTCGTTTCTCCGTTCTCTGCGGAAGAAGGTGACTTCCA



CCAGATCCACGCGGACCTGAACGCGGCGCAGAACCTGCAGCAGCGTCTGTGGTCT



GACTTCGACATCTCTCAGATCCGTCTGCGTTGCGACTGGGGTGAAGTTGACGGTG



AACTGGTTCTGATCCCGCGTCTGACCGGTAAACGTACCGCGGACTCTTACTCTAA



CAAAGTTTTCTACACCAACACCGGTGTTACCTACTACGAACGTGAACGTGGTAAA



AAACGTCGTAAAGTTTTCGCGCAGGAAAAACTGTCTGAAGAAGAAGCGGAACTGC



TGGTTGAAGCGGACGAAGCGCGTGAAAAATCTGTTGTTCTGATGCGTGACCCGTC



TGGTATCATCAACCGTGGTAACTGGACCCGTCAGAAAGAATTCTGGTCTATGGTT



AACCAGCGTATCGAAGGTTACCTGGTTAAACAGATCCGTTCTCGTGTTCCGCTGC



AGGACTCTGCGTGCGAAAACACCGGTGACATCTAA





SEQ
ATGGCGACCCGTTCTTTCATCCTGAAAATCGAACCGAACGAAGAAGTTAAAAAAG


ID
GTCTGTGGAAAACCCACGAAGTTCTGAACCACGGTATCGCGTACTACATGAACAT


NO:
CCTGAAACTGATCCGTCAGGAAGCGATCTACGAACACCACGAACAGGACCCGAAA


54
AACCCGAAAAAAGTTTCTAAAGCGGAAATCCAGGCGGAACTGTGGGACTTCGTTC



TGAAAATGCAGAAATGCAACTCTTTCACCCACGAAGTTGACAAAGACGTTGTTTT



CAACATCCTGCGTGAACTGTACGAAGAACTGGTTCCGTCTTCTGTTGAAAAAAAA



GGTGAAGCGAACCAGCTGTCTAACAAATTCCTGTACCCGCTGGTTGACCCGAACT



CTCAGTCTGGTAAAGGTACCGCGTCTTCTGGTCGTAAACCGCGTTGGTACAACCT



GAAAATCGCGGGTGACCCGTCTTGGGAAGAAGAAAAAAAAAAATGGGAAGAAGAC



AAAAAAAAAGACCCGCTGGCGAAAATCCTGGGTAAACTGGCGGAATACGGTCTGA



TCCCGCTGTTCATCCCGTTCACCGACTCTAACGAACCGATCGTTAAAGAAATCAA



ATGGATGGAAAAATCTCGTAACCAGTCTGTTCGTCGTCTGGACAAAGACATGTTC



ATCCAGGCGCTGGAACGTTTCCTGTCTTGGGAATCTTGGAACCTGAAAGTTAAAG



AAGAATACGAAAAAGTTGAAAAAGAACACAAAACCCTGGAAGAACGTATCAAAGA



AGACATCCAGGCGTTCAAATCTCTGGAACAGTACGAAAAAGAACGTCAGGAACAG



CTGCTGCGTGACACCCTGAACACCAACGAATACCGTCTGTCTAAACGTGGTCTGC



GTGGTTGGCGTGAAATCATCCAGAAATGGCTGAAAATGGACGAAAACGAACCGTC



TGAAAAATACCTGGAAGTTTTCAAAGACTACCAGCGTAAACACCCGCGTGAAGCG



GGTGACTACTCTGTTTACGAATTCCTGTCTAAAAAAGAAAACCACTTCATCTGGC



GTAACCACCCGGAATACCCGTACCTGTACGCGACCTTCTGCGAAATCGACAAAAA



AAAAAAAGACGCGAAACAGCAGGCGACCTTCACCCTGGCGGACCCGATCAACCAC



CCGCTGTGGGTTCGTTTCGAAGAACGTTCTGGTTCTAACCTGAACAAATACCGTA



TCCTGACCGAACAGCTGCACACCGAAAAACTGAAAAAAAAACTGACCGTTCAGCT



GGACCGTCTGATCTACCCGACCGAATCTGGTGGTTGGGAAGAAAAAGGTAAAGTT



GACATCGTTCTGCTGCCGTCTCGTCAGTTCTACAACCAGATCTTCCTGGACATCG



AAGAAAAAGGTAAACACGCGTTCACCTACAAAGACGAATCTATCAAATTCCCGCT



GAAAGGTACCCTGGGTGGTGCGCGTGTTCAGTTCGACCGTGACCACCTGCGTCGT



TACCCGCACAAAGTTGAATCTGGTAACGTTGGTCGTATCTACTTCAACATGACCG



TTAACATCGAACCGACCGAATCTCCGGTTTCTAAATCTCTGAAAATCCACCGTGA



CGACTTCCCGAAATTCGTTAACTTCAAACCGAAAGAACTGACCGAATGGATCAAA



GACTCTAAAGGTAAAAAACTGAAATCTGGTATCGAATCTCTGGAAATCGGTCTGC



GTGTTATGTCTATCGACCTGGGTCAGCGTCAGGCGGCGGCGGCGTCTATCTTCGA



AGTTGTTGACCAGAAACCGGACATCGAAGGTAAACTGTTCTTCCCGATCAAAGGT



ACCGAACTGTACGCGGTTCACCGTGCGTCTTTCAACATCAAACTGCCGGGTGAAA



CCCTGGTTAAATCTCGTGAAGTTCTGCGTAAAGCGCGTGAAGACAACCTGAAACT



GATGAACCAGAAACTGAACTTCCTGCGTAACGTTCTGCACTTCCAGCAGTTCGAA



GACATCACCGAACGTGAAAAACGTGTTACCAAATGGATCTCTCGTCAGGAAAACT



CTGACGTTCCGCTGGTTTACCAGGACGAACTGATCCAGATCCGTGAACTGATGTA



CAAACCGTACAAAGACTGGGTTGCGTTCCTGAAACAGCTGCACAAACGTCTGGAA



GTTGAAATCGGTAAAGAAGTTAAACACTGGCGTAAATCTCTGTCTGACGGTCGTA



AAGGTCTGTACGGTATCTCTCTGAAAAACATCGACGAAATCGACCGTACCCGTAA



ATTCCTGCTGCGTTGGTCTCTGCGTCCGACCGAACCGGGTGAAGTTCGTCGTCTG



GAACCGGGTCAGCGTTTCGCGATCGACCAGCTGAACCACCTGAACGCGCTGAAAG



AAGACCGTCTGAAAAAAATGGCGAACACCATCATCATGCACGCGCTGGGTTACTG



CTACGACGTTCGTAAAAAAAAATGGCAGGCGAAAAACCCGGCGTGCCAGATCATC



CTGTTCGAAGACCTGTCTAACTACAACCCGTACGAAGAACGTTCTCGTTTCGAAA



ACTCTAAACTGATGAAATGGTCTCGTCGTGAAATCCCGCGTCAGGTTGCGCTGCA



GGGTGAAATCTACGGTCTGCAGGTTGGTGAAGTTGGTGCGCAGTTCTCTTCTCGT



TTCCACGCGAAAACCGGTTCTCCGGGTATCCGTTGCTCTGTTGTTACCAAAGAAA



AACTGCAGGACAACCGTTTCTTCAAAAACCTGCAGCGTGAAGGTCGTCTGACCCT



GGACAAAATCGCGGTTCTGAAAGAAGGTGACCTGTACCCGGACAAAGGTGGTGAA



AAATTCATCTCTCTGTCTAAAGACCGTAAACTGGTTACCACCCACGCGGACATCA



ACGCGGCGCAGAACCTGCAGAAACGTTTCTGGACCCGTACCCACGGTTTCTACAA



AGTTTACTGCAAAGCGTACCAGGTTGACGGTCAGACCGTTTACATCCCGGAATCT



AAAGACCAGAAACAGAAAATCATCGAAGAATTCGGTGAAGGTTACTTCATCCTGA



AAGACGGTGTTTACGAATGGGGTAACGCGGGTAAACTGAAAATCAAAAAAGGTTC



TTCTAAACAGTCTTCTTCTGAACTGGTTGACTCTGACATCCTGAAAGACTCTTTC



GACCTGGCGTCTGAACTGAAAGGTGAAAAACTGATGCTGTACCGTGACCCGTCTG



GTAACGTTTTCCCGTCTGACAAATGGATGGCGGCGGGTGTTTTCTTCGGTAAACT



GGAACGTATCCTGATCTCTAAACTGACCAACCAGTACTCTATCTCTACCATCGAA



GACGACTCTTCTAAACAGTCTATGTAA





SEQ
ATGCCGACCCGTACCATCAACCTGAAACTGGTTCTGGGTAAAAACCCGGAAAACG


ID
CGACCCTGCGTCGTGCGCTGTTCTCTACCCACCGTCTGGTTAACCAGGCGACCAA


NO:
ACGTATCGAAGAATTCCTGCTGCTGTGCCGTGGTGAAGCGTACCGTACCGTTGAC


55
AACGAAGGTAAAGAAGCGGAAATCCCGCGTCACGCGGTTCAGGAAGAAGCGCTGG



CGTTCGCGAAAGCGGCGCAGCGTCACAACGGTTGCATCTCTACCTACGAAGACCA



GGAAATCCTGGACGTTCTGCGTCAGCTGTACGAACGTCTGGTTCCGTCTGTTAAC



GAAAACAACGAAGCGGGTGACGCGCAGGCGGCGAACGCGTGGGTTTCTCCGCTGA



TGTCTGCGGAATCTGAAGGTGGTCTGTCTGTTTACGACAAAGTTCTGGACCCGCC



GCCGGTTTGGATGAAACTGAAAGAAGAAAAAGCGCCGGGTTGGGAAGCGGCGTCT



CAGATCTGGATCCAGTCTGACGAAGGTCAGTCTCTGCTGAACAAACCGGGTTCTC



CGCCGCGTTGGATCCGTAAACTGCGTTCTGGTCAGCCGTGGCAGGACGACTTCGT



TTCTGACCAGAAAAAAAAACAGGACGAACTGACCAAAGGTAACGCGCCGCTGATC



AAACAGCTGAAAGAAATGGGTCTGCTGCCGCTGGTTAACCCGTTCTTCCGTCACC



TGCTGGACCCGGAAGGTAAAGGTGTTTCTCCGTGGGACCGTCTGGCGGTTCGTGC



GGCGGTTGCGCACTTCATCTCTTGGGAATCTTGGAACCACCGTACCCGTGCGGAA



TACAACTCTCTGAAACTGCGTCGTGACGAATTCGAAGCGGCGTCTGACGAATTCA



AAGACGACTTCACCCTGCTGCGTCAGTACGAAGCGAAACGTCACTCTACCCTGAA



ATCTATCGCGCTGGCGGACGACTCTAACCCGTACCGTATCGGTGTTCGTTCTCTG



CGTGCGTGGAACCGTGTTCGTGAAGAATGGATCGACAAAGGTGCGACCGAAGAAC



AGCGTGTTACCATCCTGTCTAAACTGCAGACCCAGCTGCGTGGTAAATTCGGTGA



CCCGGACCTGTTCAACTGGCTGGCGCAGGACCGTCACGTTCACCTGTGGTCTCCG



CGTGACTCTGTTACCCCGCTGGTTCGTATCAACGCGGTTGACAAAGTTCTGCGTC



GTCGTAAACCGTACGCGCTGATGACCTTCGCGCACCCGCGTTTCCACCCGCGTTG



GATCCTGTACGAAGCGCCGGGTGGTTCTAACCTGCGTCAGTACGCGCTGGACTGC



ACCGAAAACGCGCTGCACATCACCCTGCCGCTGCTGGTTGACGACGCGCACGGTA



CCTGGATCGAAAAAAAAATCCGTGTTCCGCTGGCGCCGTCTGGTCAGATCCAGGA



CCTGACCCTGGAAAAACTGGAAAAAAAAAAAAACCGTCTGTACTACCGTTCTGGT



TTCCAGCAGTTCGCGGGTCTGGCGGGTGGTGCGGAAGTTCTGTTCCACCGTCCGT



ACATGGAACACGACGAACGTTCTGAAGAATCTCTGCTGGAACGTCCGGGTGCGGT



TTGGTTCAAACTGACCCTGGACGTTGCGACCCAGGCGCCGCCGAACTGGCTGGAC



GGTAAAGGTCGTGTTCGTACCCCGCCGGAAGTTCACCACTTCAAAACCGCGCTGT



CTAACAAATCTAAACACACCCGTACCCTGCAGCCGGGTCTGCGTGTTCTGTCTGT



TGACCTGGGTATGCGTACCTTCGCGTCTTGCTCTGTTTTCGAACTGATCGAAGGT



AAACCGGAAACCGGTCGTGCGTTCCCGGTTGCGGACGAACGTTCTATGGACTCTC



CGAACAAACTGTGGGCGAAACACGAACGTTCTTTCAAACTGACCCTGCCGGGTGA



AACCCCGTCTCGTAAAGAAGAAGAAGAACGTTCTATCGCGCGTGCGGAAATCTAC



GCGCTGAAACGTGACATCCAGCGTCTGAAATCTCTGCTGCGTCTGGGTGAAGAAG



ACAACGACAACCGTCGTGACGCGCTGCTGGAACAGTTCTTCAAAGGTTGGGGTGA



AGAAGACGTTGTTCCGGGTCAGGCGTTCCCGCGTTCTCTGTTCCAGGGTCTGGGT



GCGGCGCCGTTCCGTTCTACCCCGGAACTGTGGCGTCAGCACTGCCAGACCTACT



ACGACAAAGCGGAAGCGTGCCTGGCGAAACACATCTCTGACTGGCGTAAACGTAC



CCGTCCGCGTCCGACCTCTCGTGAAATGTGGTACAAAACCCGTTCTTACCACGGT



GGTAAATCTATCTGGATGCTGGAATACCTGGACGCGGTTCGTAAACTGCTGCTGT



CTTGGTCTCTGCGTGGTCGTACCTACGGTGCGATCAACCGTCAGGACACCGCGCG



TTTCGGTTCTCTGGCGTCTCGTCTGCTGCACCACATCAACTCTCTGAAAGAAGAC



CGTATCAAAACCGGTGCGGACTCTATCGTTCAGGCGGCGCGTGGTTACATCCCGC



TGCCGCACGGTAAAGGTTGGGAACAGCGTTACGAACCGTGCCAGCTGATCCTGTT



CGAAGACCTGGCGCGTTACCGTTTCCGTGTTGACCGTCCGCGTCGTGAAAACTCT



CAGCTGATGCAGTGGAACCACCGTGCGATCGTTGCGGAAACCACCATGCAGGCGG



AACTGTACGGTCAGATCGTTGAAAACACCGCGGCGGGTTTCTCTTCTCGTTTCCA



CGCGGCGACCGGTGCGCCGGGTGTTCGTTGCCGTTTCCTGCTGGAACGTGACTTC



GACAACGACCTGCCGAAACCGTACCTGCTGCGTGAACTGTCTTGGATGCTGGGTA



ACACCAAAGTTGAATCTGAAGAAGAAAAACTGCGTCTGCTGTCTGAAAAAATCCG



TCCGGGTTCTCTGGTTCCGTGGGACGGTGGTGAACAGTTCGCGACCCTGCACCCG



AAACGTCAGACCCTGTGCGTTATCCACGCGGACATGAACGCGGCGCAGAACCTGC



AGCGTCGTTTCTTCGGTCGTTGCGGTGAAGCGTTCCGTCTGGTTTGCCAGCCGCA



CGGTGACGACGTTCTGCGTCTGGCGTCTACCCCGGGTGCGCGTCTGCTGGGTGCG



CTGCAGCAGCTGGAAAACGGTCAGGGTGCGTTCGAACTGGTTCGTGACATGGGTT



CTACCTCTCAGATGAACCGTTTCGTTATGAAATCTCTGGGTAAAAAAAAAATCAA



ACCGCTGCAGGACAACAACGGTGACGACGAACTGGAAGACGTTCTGTCTGTTCTG



CCGGAAGAAGACGACACCGGTCGTATCACCGTTTTCCGTGACTCTTCTGGTATCT



TCTTCCCGTGCAACGTTTGGATCCCGGCGAAACAGTTCTGGCCGGCGGTTCGTGC



GATGATCTGGAAAGTTATGGCGTCTCACTCTCTGGGTTAA





SEQ
ATGACCAAACTGCGTCACCGTCAGAAAAAACTGACCCACGACTGGGCGGGTTCTA


ID
AAAAACGTGAAGTTCTGGGTTCTAACGGTAAACTGCAGAACCCGCTGCTGATGCC


NO:
GGTTAAAAAAGGTCAGGTTACCGAATTCCGTAAAGCGTTCTCTGCGTACGCGCGT


56
GCGACCAAAGGTGAAATGACCGACGGTCGTAAAAACATGTTCACCCACTCTTTCG



AACCGTTCAAAACCAAACCGTCTCTGCACCAGTGCGAACTGGCGGACAAAGCGTA



CCAGTCTCTGCACTCTTACCTGCCGGGTTCTCTGGCGCACTTCCTGCTGTCTGCG



CACGCGCTGGGTTTCCGTATCTTCTCTAAATCTGGTGAAGCGACCGCGTTCCAGG



CGTCTTCTAAAATCGAAGCGTACGAATCTAAACTGGCGTCTGAACTGGCGTGCGT



TGACCTGTCTATCCAGAACCTGACCATCTCTACCCTGTTCAACGCGCTGACCACC



TCTGTTCGTGGTAAAGGTGAAGAAACCTCTGCGGACCCGCTGATCGCGCGTTTCT



ACACCCTGCTGACCGGTAAACCGCTGTCTCGTGACACCCAGGGTCCGGAACGTGA



CCTGGCGGAAGTTATCTCTCGTAAAATCGCGTCTTCTTTCGGTACCTGGAAAGAA



ATGACCGCGAACCCGCTGCAGTCTCTGCAGTTCTTCGAAGAAGAACTGCACGCGC



TGGACGCGAACGTTTCTCTGTCTCCGGCGTTCGACGTTCTGATCAAAATGAACGA



CCTGCAGGGTGACCTGAAAAACCGTACCATCGTTTTCGACCCGGACGCGCCGGTT



TTCGAATACAACGCGGAAGACCCGGCGGACATCATCATCAAACTGACCGCGCGTT



ACGCGAAAGAAGCGGTTATCAAAAACCAGAACGTTGGTAACTACGTTAAAAACGC



GATCACCACCACCAACGCGAACGGTCTGGGTTGGCTGCTGAACAAAGGTCTGTCT



CTGCTGCCGGTTTCTACCGACGACGAACTGCTGGAATTCATCGGTGTTGAACGTT



CTCACCCGTCTTGCCACGCGCTGATCGAACTGATCGCGCAGCTGGAAGCGCCGGA



ACTGTTCGAAAAAAACGTTTTCTCTGACACCCGTTCTGAAGTTCAGGGTATGATC



GACTCTGCGGTTTCTAACCACATCGCGCGTCTGTCTTCTTCTCGTAACTCTCTGT



CTATGGACTCTGAAGAACTGGAACGTCTGATCAAATCTTTCCAGATCCACACCCC



GCACTGCTCTCTGTTCATCGGTGCGCAGTCTCTGTCTCAGCAGCTGGAATCTCTG



CCGGAAGCGCTGCAGTCTGGTGTTAACTCTGCGGACATCCTGCTGGGTTCTACCC



AGTACATGCTGACCAACTCTCTGGTTGAAGAATCTATCGCGACCTACCAGCGTAC



CCTGAACCGTATCAACTACCTGTCTGGTGTTGCGGGTCAGATCAACGGTGCGATC



AAACGTAAAGCGATCGACGGTGAAAAAATCCACCTGCCGGCGGCGTGGTCTGAAC



TGATCTCTCTGCCGTTCATCGGTCAGCCGGTTATCGACGTTGAATCTGACCTGGC



GCACCTGAAAAACCAGTACCAGACCCTGTCTAACGAATTCGACACCCTGATCTCT



GCGCTGCAGAAAAACTTCGACCTGAACTTCAACAAAGCGCTGCTGAACCGTACCC



AGCACTTCGAAGCGATGTGCCGTTCTACCAAAAAAAACGCGCTGTCTAAACCGGA



AATCGTTTCTTACCGTGACCTGCTGGCGCGTCTGACCTCTTGCCTGTACCGTGGT



TCTCTGGTTCTGCGTCGTGCGGGTATCGAAGTTCTGAAAAAACACAAAATCTTCG



AATCTAACTCTGAACTGCGTGAACACGTTCACGAACGTAAACACTTCGTTTTCGT



TTCTCCGCTGGACCGTAAAGCGAAAAAACTGCTGCGTCTGACCGACTCTCGTCCG



GACCTGCTGCACGTTATCGACGAAATCCTGCAGCACGACAACCTGGAAAACAAAG



ACCGTGAATCTCTGTGGCTGGTTCGTTCTGGTTACCTGCTGGCGGGTCTGCCGGA



CCAGCTGTCTTCTTCTTTCATCAACCTGCCGATCATCACCCAGAAAGGTGACCGT



CGTCTGATCGACCTGATCCAGTACGACCAGATCAACCGTGACGCGTTCGTTATGC



TGGTTACCTCTGCGTTCAAATCTAACCTGTCTGGTCTGCAGTACCGTGCGAACAA



ACAGTCTTTCGTTGTTACCCGTACCCTGTCTCCGTACCTGGGTTCTAAACTGGTT



TACGTTCCGAAAGACAAAGACTGGCTGGTTCCGTCTCAGATGTTCGAAGGTCGTT



TCGCGGACATCCTGCAGTCTGACTACATGGTTTGGAAAGACGCGGGTCGTCTGTG



CGTTATCGACACCGCGAAACACCTGTCTAACATCAAAAAATCTGTTTTCTCTTCT



GAAGAAGTTCTGGCGTTCCTGCGTGAACTGCCGCACCGTACCTTCATCCAGACCG



AAGTTCGTGGTCTGGGTGTTAACGTTGACGGTATCGCGTTCAACAACGGTGACAT



CCCGTCTCTGAAAACCTTCTCTAACTGCGTTCAGGTTAAAGTTTCTCGTACCAAC



ACCTCTCTGGTTCAGACCCTGAACCGTTGGTTCGAAGGTGGTAAAGTTTCTCCGC



CGTCTATCCAGTTCGAACGTGCGTACTACAAAAAAGACGACCAGATCCACGAAGA



CGCGGCGAAACGTAAAATCCGTTTCCAGATGCCGGCGACCGAACTGGTTCACGCG



TCTGACGACGCGGGTTGGACCCCGTCTTACCTGCTGGGTATCGACCCGGGTGAAT



ACGGTATGGGTCTGTCTCTGGTTTCTATCAACAACGGTGAAGTTCTGGACTCTGG



TTTCATCCACATCAACTCTCTGATCAACTTCGCGTCTAAAAAATCTAACCACCAG



ACCAAAGTTGTTCCGCGTCAGCAGTACAAATCTCCGTACGCGAACTACCTGGAAC



AGTCTAAAGACTCTGCGGCGGGTGACATCGCGCACATCCTGGACCGTCTGATCTA



CAAACTGAACGCGCTGCCGGTTTTCGAAGCGCTGTCTGGTAACTCTCAGTCTGCG



GCGGACCAGGTTTGGACCAAAGTTCTGTCTTTCTACACCTGGGGTGACAACGACG



CGCAGAACTCTATCCGTAAACAGCACTGGTTCGGTGCGTCTCACTGGGACATCAA



AGGTATGCTGCGTCAGCCGCCGACCGAAAAAAAACCGAAACCGTACATCGCGTTC



CCGGGTTCTCAGGTTTCTTCTTACGGTAACTCTCAGCGTTGCTCTTGCTGCGGTC



GTAACCCGATCGAACAGCTGCGTGAAATGGCGAAAGACACCTCTATCAAAGAACT



GAAAATCCGTAACTCTGAAATCCAGCTGTTCGACGGTACCATCAAACTGTTCAAC



CCGGACCCGTCTACCGTTATCGAACGTCGTCGTCACAACCTGGGTCCGTCTCGTA



TCCCGGTTGCGGACCGTACCTTCAAAAACATCTCTCCGTCTTCTCTGGAATTCAA



AGAACTGATCACCATCGTTTCTCGTTCTATCCGTCACTCTCCGGAATTCATCGCG



AAAAAACGTGGTATCGGTTCTGAATACTTCTGCGCGTACTCTGACTGCAACTCTT



CTCTGAACTCTGAAGCGAACGCGGCGGCGAACGTTGCGCAGAAATTCCAGAAACA



GCTGTTCTTCGAACTGTAA





SEQ
ATGAAACGTATCCTGAACTCTCTGAAAGTTGCGGCGCTGCGTCTGCTGTTCCGTG


ID
GTAAAGGTTCTGAACTGGTTAAAACCGTTAAATACCCGCTGGTTTCTCCGGTTCA


NO:
GGGTGCGGTTGAAGAACTGGCGGAAGCGATCCGTCACGACAACCTGCACCTGTTC


57
GGTCAGAAAGAAATCGTTGACCTGATGGAAAAAGACGAAGGTACCCAGGTTTACT



CTGTTGTTGACTTCTGGCTGGACACCCTGCGTCTGGGTATGTTCTTCTCTCCGTC



TGCGAACGCGCTGAAAATCACCCTGGGTAAATTCAACTCTGACCAGGTTTCTCCG



TTCCGTAAAGTTCTGGAACAGTCTCCGTTCTTCCTGGCGGGTCGTCTGAAAGTTG



AACCGGCGGAACGTATCCTGTCTGTTGAAATCCGTAAAATCGGTAAACGTGAAAA



CCGTGTTGAAAACTACGCGGCGGACGTTGAAACCTGCTTCATCGGTCAGCTGTCT



TCTGACGAAAAACAGTCTATCCAGAAACTGGCGAACGACATCTGGGACTCTAAAG



ACCACGAAGAACAGCGTATGCTGAAAGCGGACTTCTTCGCGATCCCGCTGATCAA



AGACCCGAAAGCGGTTACCGAAGAAGACCCGGAAAACGAAACCGCGGGTAAACAG



AAACCGCTGGAACTGTGCGTTTGCCTGGTTCCGGAACTGTACACCCGTGGTTTCG



GTTCTATCGCGGACTTCCTGGTTCAGCGTCTGACCCTGCTGCGTGACAAAATGTC



TACCGACACCGCGGAAGACTGCCTGGAATACGTTGGTATCGAAGAAGAAAAAGGT



AACGGTATGAACTCTCTGCTGGGTACCTTCCTGAAAAACCTGCAGGGTGACGGTT



TCGAACAGATCTTCCAGTTCATGCTGGGTTCTTACGTTGGTTGGCAGGGTAAAGA



AGACGTTCTGCGTGAACGTCTGGACCTGCTGGCGGAAAAAGTTAAACGTCTGCCG



AAACCGAAATTCGCGGGTGAATGGTCTGGTCACCGTATGTTCCTGCACGGTCAGC



TGAAATCTTGGTCTTCTAACTTCTTCCGTCTGTTCAACGAAACCCGTGAACTGCT



GGAATCTATCAAATCTGACATCCAGCACGCGACCATGCTGATCTCTTACGTTGAA



GAAAAAGGTGGTTACCACCCGCAGCTGCTGTCTCAGTACCGTAAACTGATGGAAC



AGCTGCCGGCGCTGCGTACCAAAGTTCTGGACCCGGAAATCGAAATGACCCACAT



GTCTGAAGCGGTTCGTTCTTACATCATGATCCACAAATCTGTTGCGGGTTTCCTG



CCGGACCTGCTGGAATCTCTGGACCGTGACAAAGACCGTGAATTCCTGCTGTCTA



TCTTCCCGCGTATCCCGAAAATCGACAAAAAAACCAAAGAAATCGTTGCGTGGGA



ACTGCCGGGTGAACCGGAAGAAGGTTACCTGTTCACCGCGAACAACCTGTTCCGT



AACTTCCTGGAAAACCCGAAACACGTTCCGCGTTTCATGGCGGAACGTATCCCGG



AAGACTGGACCCGTCTGCGTTCTGCGCCGGTTTGGTTCGACGGTATGGTTAAACA



GTGGCAGAAAGTTGTTAACCAGCTGGTTGAATCTCCGGGTGCGCTGTACCAGTTC



AACGAATCTTTCCTGCGTCAGCGTCTGCAGGCGATGCTGACCGTTTACAAACGTG



ACCTGCAGACCGAAAAATTCCTGAAACTGCTGGCGGACGTTTGCCGTCCGCTGGT



TGACTTCTTCGGTCTGGGTGGTAACGACATCATCTTCAAATCTTGCCAGGACCCG



CGTAAACAGTGGCAGACCGTTATCCCGCTGTCTGTTCCGGCGGACGTTTACACCG



CGTGCGAAGGTCTGGCGATCCGTCTGCGTGAAACCCTGGGTTTCGAATGGAAAAA



CCTGAAAGGTCACGAACGTGAAGACTTCCTGCGTCTGCACCAGCTGCTGGGTAAC



CTGCTGTTCTGGATCCGTGACGCGAAACTGGTTGTTAAACTGGAAGACTGGATGA



ACAACCCGTGCGTTCAGGAATACGTTGAAGCGCGTAAAGCGATCGACCTGCCGCT



GGAAATCTTCGGTTTCGAAGTTCCGATCTTCCTGAACGGTTACCTGTTCTCTGAA



CTGCGTCAGCTGGAACTGCTGCTGCGTCGTAAATCTGTTATGACCTCTTACTCTG



TTAAAACCACCGGTTCTCCGAACCGTCTGTTCCAGCTGGTTTACCTGCCGCTGAA



CCCGTCTGACCCGGAAAAAAAAAACTCTAACAACTTCCAGGAACGTCTGGACACC



CCGACCGGTCTGTCTCGTCGTTTCCTGGACCTGACCCTGGACGCGTTCGCGGGTA



AACTGCTGACCGACCCGGTTACCCAGGAACTGAAAACCATGGCGGGTTTCTACGA



CCACCTGTTCGGTTTCAAACTGCCGTGCAAACTGGCGGCGATGTCTAACCACCCG



GGTTCTTCTTCTAAAATGGTTGTTCTGGCGAAACCGAAAAAAGGTGTTGCGTCTA



ACATCGGTTTCGAACCGATCCCGGACCCGGCGCACCCGGTTTTCCGTGTTCGTTC



TTCTTGGCCGGAACTGAAATACCTGGAAGGTCTGCTGTACCTGCCGGAAGACACC



CCGCTGACCATCGAACTGGCGGAAACCTCTGTTTCTTGCCAGTCTGTTTCTTCTG



TTGCGTTCGACCTGAAAAACCTGACCACCATCCTGGGTCGTGTTGGTGAATTCCG



TGTTACCGCGGACCAGCCGTTCAAACTGACCCCGATCATCCCGGAAAAAGAAGAA



TCTTTCATCGGTAAAACCTACCTGGGTCTGGACGCGGGTGAACGTTCTGGTGTTG



GTTTCGCGATCGTTACCGTTGACGGTGACGGTTACGAAGTTCAGCGTCTGGGTGT



TCACGAAGACACCCAGCTGATGGCGCTGCAGCAGGTTGCGTCTAAATCTCTGAAA



GAACCGGTTTTCCAGCCGCTGCGTAAAGGTACCTTCCGTCAGCAGGAACGTATCC



GTAAATCTCTGCGTGGTTGCTACTGGAACTTCTACCACGCGCTGATGATCAAATA



CCGTGCGAAAGTTGTTCACGAAGAATCTGTTGGTTCTTCTGGTCTGGTTGGTCAG



TGGCTGCGTGCGTTCCAGAAAGACCTGAAAAAAGCGGACGTTCTGCCGAAAAAAG



GTGGTAAAAACGGTGTTGACAAAAAAAAACGTGAATCTTCTGCGCAGGACACCCT



GTGGGGTGGTGCGTTCTCTAAAAAAGAAGAACAGCAGATCGCGTTCGAAGTTCAG



GCGGCGGGTTCTTCTCAGTTCTGCCTGAAATGCGGTTGGTGGTTCCAGCTGGGTA



TGCGTGAAGTTAACCGTGTTCAGGAATCTGGTGTTGTTCTGGACTGGAACCGTTC



TATCGTTACCTTCCTGATCGAATCTTCTGGTGAAAAAGTTTACGGTTTCTCTCCG



CAGCAGCTGGAAAAAGGTTTCCGTCCGGACATCGAAACCTTCAAAAAAATGGTTC



GTGACTTCATGCGTCCGCCGATGTTCGACCGTAAAGGTCGTCCGGCGGCGGCGTA



CGAACGTTTCGTTCTGGGTCGTCGTCACCGTCGTTACCGTTTCGACAAAGTTTTC



GAAGAACGTTTCGGTCGTTCTGCGCTGTTCATCTGCCCGCGTGTTGGTTGCGGTA



ACTTCGACCACTCTTCTGAACAGTCTGCGGTTGTTCTGGCGCTGATCGGTTACAT



CGCGGACAAAGAAGGTATGTCTGGTAAAAAACTGGTTTACGTTCGTCTGGCGGAA



CTGATGGCGGAATGGAAACTGAAAAAACTGGAACGTTCTCGTGTTGAAGAACAGT



CTTCTGCGCAGTAA





SEQ
ATGGCGGAATCTAAACAGATGCAGTGCCGTAAATGCGGTGCGTCTATGAAATACG


ID
AAGTTATCGGTCTGGGTAAAAAATCTTGCCGTTACATGTGCCCGGACTGCGGTAA


NO:
CCACACCTCTGCGCGTAAAATCCAGAACAAAAAAAAACGTGACAAAAAATACGGT


58
TCTGCGTCTAAAGCGCAGTCTCAGCGTATCGCGGTTGCGGGTGCGCTGTACCCGG



ACAAAAAAGTTCAGACCATCAAAACCTACAAATACCCGGCGGACCTGAACGGTGA



AGTTCACGACTCTGGTGTTGCGGAAAAAATCGCGCAGGCGATCCAGGAAGACGAA



ATCGGTCTGCTGGGTCCGTCTTCTGAATACGCGTGCTGGATCGCGTCTCAGAAAC



AGTCTGAACCGTACTCTGTTGTTGACTTCTGGTTCGACGCGGTTTGCGCGGGTGG



TGTTTTCGCGTACTCTGGTGCGCGTCTGCTGTCTACCGTTCTGCAGCTGTCTGGT



GAAGAATCTGTTCTGCGTGCGGCGCTGGCGTCTTCTCCGTTCGTTGACGACATCA



ACCTGGCGCAGGCGGAAAAATTCCTGGCGGTTTCTCGTCGTACCGGTCAGGACAA



ACTGGGTAAACGTATCGGTGAATGCTTCGCGGAAGGTCGTCTGGAAGCGCTGGGT



ATCAAAGACCGTATGCGTGAATTCGTTCAGGCGATCGACGTTGCGCAGACCGCGG



GTCAGCGTTTCGCGGCGAAACTGAAAATCTTCGGTATCTCTCAGATGCCGGAAGC



GAAACAGTGGAACAACGACTCTGGTCTGACCGTTTGCATCCTGCCGGACTACTAC



GTTCCGGAAGAAAACCGTGCGGACCAGCTGGTTGTTCTGCTGCGTCGTCTGCGTG



AAATCGCGTACTGCATGGGTATCGAAGACGAAGCGGGTTTCGAACACCTGGGTAT



CGACCCGGGTGCGCTGTCTAACTTCTCTAACGGTAACCCGAAACGTGGTTTCCTG



GGTCGTCTGCTGAACAACGACATCATCGCGCTGGCGAACAACATGTCTGCGATGA



CCCCGTACTGGGAAGGTCGTAAAGGTGAACTGATCGAACGTCTGGCGTGGCTGAA



ACACCGTGCGGAAGGTCTGTACCTGAAAGAACCGCACTTCGGTAACTCTTGGGCG



GACCACCGTTCTCGTATCTTCTCTCGTATCGCGGGTTGGCTGTCTGGTTGCGCGG



GTAAACTGAAAATCGCGAAAGACCAGATCTCTGGTGTTCGTACCGACCTGTTCCT



GCTGAAACGTCTGCTGGACGCGGTTCCGCAGTCTGCGCCGTCTCCGGACTTCATC



GCGTCTATCTCTGCGCTGGACCGTTTCCTGGAAGCGGCGGAATCTTCTCAGGACC



CGGCGGAACAGGTTCGTGCGCTGTACGCGTTCCACCTGAACGCGCCGGCGGTTCG



TTCTATCGCGAACAAAGCGGTTCAGCGTTCTGACTCTCAGGAATGGCTGATCAAA



GAACTGGACGCGGTTGACCACCTGGAATTCAACAAAGCGTTCCCGTTCTTCTCTG



ACACCGGTAAAAAAAAAAAAAAAGGTGCGAACTCTAACGGTGCGCCGTCTGAAGA



AGAATACACCGAAACCGAATCTATCCAGCAGCCGGAAGACGCGGAACAGGAAGTT



AACGGTCAGGAAGGTAACGGTGCGTCTAAAAACCAGAAAAAATTCCAGCGTATCC



CGCGTTTCTTCGGTGAAGGTTCTCGTTCTGAATACCGTATCCTGACCGAAGCGCC



GCAGTACTTCGACATGTTCTGCAACAACATGCGTGCGATCTTCATGCAGCTGGAA



TCTCAGCCGCGTAAAGCGCCGCGTGACTTCAAATGCTTCCTGCAGAACCGTCTGC



AGAAACTGTACAAACAGACCTTCCTGAACGCGCGTTCTAACAAATGCCGTGCGCT



GCTGGAATCTGTTCTGATCTCTTGGGGTGAATTCTACACCTACGGTGCGAACGAA



AAAAAATTCCGTCTGCGTCACGAAGCGTCTGAACGTTCTTCTGACCCGGACTACG



TTGTTCAGCAGGCGCTGGAAATCGCGCGTCGTCTGTTCCTGTTCGGTTTCGAATG



GCGTGACTGCTCTGCGGGTGAACGTGTTGACCTGGTTGAAATCCACAAAAAAGCG



ATCTCTTTCCTGCTGGCGATCACCCAGGCGGAAGTTTCTGTTGGTTCTTACAACT



GGCTGGGTAACTCTACCGTTTCTCGTTACCTGTCTGTTGCGGGTACCGACACCCT



GTACGGTACCCAGCTGGAAGAATTCCTGAACGCGACCGTTCTGTCTCAGATGCGT



GGTCTGGCGATCCGTCTGTCTTCTCAGGAACTGAAAGACGGTTTCGACGTTCAGC



TGGAATCTTCTTGCCAGGACAACCTGCAGCACCTGCTGGTTTACCGTGCGTCTCG



TGACCTGGCGGCGTGCAAACGTGCGACCTGCCCGGCGGAACTGGACCCGAAAATC



CTGGTTCTGCCGGTTGGTGCGTTCATCGCGTCTGTTATGAAAATGATCGAACGTG



GTGACGAACCGCTGGCGGGTGCGTACCTGCGTCACCGTCCGCACTCTTTCGGTTG



GCAGATCCGTGTTCGTGGTGTTGCGGAAGTTGGTATGGACCAGGGTACCGCGCTG



GCGTTCCAGAAACCGACCGAATCTGAACCGTTCAAAATCAAACCGTTCTCTGCGC



AGTACGGTCCGGTTCTGTGGCTGAACTCTTCTTCTTACTCTCAGTCTCAGTACCT



GGACGGTTTCCTGTCTCAGCCGAAAAACTGGTCTATGCGTGTTCTGCCGCAGGCG



GGTTCTGTTCGTGTTGAACAGCGTGTTGCGCTGATCTGGAACCTGCAGGCGGGTA



AAATGCGTCTGGAACGTTCTGGTGCGCGTGCGTTCTTCATGCCGGTTCCGTTCTC



TTTCCGTCCGTCTGGTTCTGGTGACGAAGCGGTTCTGGCGCCGAACCGTTACCTG



GGTCTGTTCCCGCACTCTGGTGGTATCGAATACGCGGTTGTTGACGTTCTGGACT



CTGCGGGTTTCAAAATCCTGGAACGTGGTACCATCGCGGTTAACGGTTTCTCTCA



GAAACGTGGTGAACGTCAGGAAGAAGCGCACCGTGAAAAACAGCGTCGTGGTATC



TCTGACATCGGTCGTAAAAAACCGGTTCAGGCGGAAGTTGACGCGGCGAACGAAC



TGCACCGTAAATACACCGACGTTGCGACCCGTCTGGGTTGCCGTATCGTTGTTCA



GTGGGCGCCGCAGCCGAAACCGGGTACCGCGCCGACCGCGCAGACCGTTTACGCG



CGTGCGGTTCGTACCGAAGCGCCGCGTTCTGGTAACCAGGAAGACCACGCGCGTA



TGAAATCTTCTTGGGGTTACACCTGGGGTACCTACTGGGAAAAACGTAAACCGGA



AGACATCCTGGGTATCTCTACCCAGGTTTACTGGACCGGTGGTATCGGTGAATCT



TGCCCGGCGGTTGCGGTTGCGCTGCTGGGTCACATCCGTGCGACCTCTACCCAGA



CCGAATGGGAAAAAGAAGAAGTTGTTTTCGGTCGTCTGAAAAAATTCTTCCCGTC



TTAA





SEQ
ATGGAAAAACGTATCAACAAAATCCGTAAAAAACTGTCTGCGGACAACGCGACCA


ID
AACCGGTTTCTCGTTCTGGTCCGATGAAAACCCTGCTGGTTCGTGTTATGACCGA


NO:
CGACCTGAAAAAACGTCTGGAAAAACGTCGTAAAAAACCGGAAGTTATGCCGCAG


59
GTTATCTCTAACAACGCGGCGAACAACCTGCGTATGCTGCTGGACGACTACACCA



AAATGAAAGAAGCGATCCTGCAGGTTTACTGGCAGGAATTCAAAGACGACCACGT



TGGTCTGATGTGCAAATTCGCGCAGCCGGCGTCTAAAAAAATCGACCAGAACAAA



CTGAAACCGGAAATGGACGAAAAAGGTAACCTGACCACCGCGGGTTTCGCGTGCT



CTCAGTGCGGTCAGCCGCTGTTCGTTTACAAACTGGAACAGGTTTCTGAAAAAGG



TAAAGCGTACACCAACTACTTCGGTCGTTGCAACGTTGCGGAACACGAAAAACTG



ATCCTGCTGGCGCAGCTGAAACCGGAAAAAGACTCTGACGAAGCGGTTACCTACT



CTCTGGGTAAATTCGGTCAGCGTGCGCTGGACTTCTACTCTATCCACGTTACCAA



AGAATCTACCCACCCGGTTAAACCGCTGGCGCAGATCGCGGGTAACCGTTACGCG



TCTGGTCCGGTTGGTAAAGCGCTGTCTGACGCGTGCATGGGTACCATCGCGTCTT



TCCTGTCTAAATACCAGGACATCATCATCGAACACCAGAAAGTTGTTAAAGGTAA



CCAGAAACGTCTGGAATCTCTGCGTGAACTGGCGGGTAAAGAAAACCTGGAATAC



CCGTCTGTTACCCTGCCGCCGCAGCCGCACACCAAAGAAGGTGTTGACGCGTACA



ACGAAGTTATCGCGCGTGTTCGTATGTGGGTTAACCTGAACCTGTGGCAGAAACT



GAAACTGTCTCGTGACGACGCGAAACCGCTGCTGCGTCTGAAAGGTTTCCCGTCT



TTCCCGGTTGTTGAACGTCGTGAAAACGAAGTTGACTGGTGGAACACCATCAACG



AAGTTAAAAAACTGATCGACGCGAAACGTGACATGGGTCGTGTTTTCTGGTCTGG



TGTTACCGCGGAAAAACGTAACACCATCCTGGAAGGTTACAACTACCTGCCGAAC



GAAAACGACCACAAAAAACGTGAAGGTTCTCTGGAAAACCCGAAAAAACCGGCGA



AACGTCAGTTCGGTGACCTGCTGCTGTACCTGGAAAAAAAATACGCGGGTGACTG



GGGTAAAGTTTTCGACGAAGCGTGGGAACGTATCGACAAAAAAATCGCGGGTCTG



ACCTCTCACATCGAACGTGAAGAAGCGCGTAACGCGGAAGACGCGCAGTCTAAAG



CGGTTCTGACCGACTGGCTGCGTGCGAAAGCGTCTTTCGTTCTGGAACGTCTGAA



AGAAATGGACGAAAAAGAATTCTACGCGTGCGAAATCCAGCTGCAGAAATGGTAC



GGTGACCTGCGTGGTAACCCGTTCGCGGTTGAAGCGGAAAACCGTGTTGTTGACA



TCTCTGGTTTCTCTATCGGTTCTGACGGTCACTCTATCCAGTACCGTAACCTGCT



GGCGTGGAAATACCTGGAAAACGGTAAACGTGAATTCTACCTGCTGATGAACTAC



GGTAAAAAAGGTCGTATCCGTTTCACCGACGGTACCGACATCAAAAAATCTGGTA



AATGGCAGGGTCTGCTGTACGGTGGTGGTAAAGCGAAAGTTATCGACCTGACCTT



CGACCCGGACGACGAACAGCTGATCATCCTGCCGCTGGCGTTCGGTACCCGTCAG



GGTCGTGAATTCATCTGGAACGACCTGCTGTCTCTGGAAACCGGTCTGATCAAAC



TGGCGAACGGTCGTGTTATCGAAAAAACCATCTACAACAAAAAAATCGGTCGTGA



CGAACCGGCGCTGTTCGTTGCGCTGACCTTCGAACGTCGTGAAGTTGTTGACCCG



TCTAACATCAAACCGGTTAACCTGATCGGTGTTGACCGTGGTGAAAACATCCCGG



CGGTTATCGCGCTGACCGACCCGGAAGGTTGCCCGCTGCCGGAATTCAAAGACTC



TTCTGGTGGTCCGACCGACATCCTGCGTATCGGTGAAGGTTACAAAGAAAAACAG



CGTGCGATCCAGGCGGCGAAAGAAGTTGAACAGCGTCGTGCGGGTGGTTACTCTC



GTAAATTCGCGTCTAAATCTCGTAACCTGGCGGACGACATGGTTCGTAACTCTGC



GCGTGACCTGTTCTACCACGCGGTTACCCACGACGCGGTTCTGGTTTTCGAAAAC



CTGTCTCGTGGTTTCGGTCGTCAGGGTAAACGTACCTTCATGACCGAACGTCAGT



ACACCAAAATGGAAGACTGGCTGACCGCGAAACTGGCGTACGAAGGTCTGACCTC



TAAAACCTACCTGTCTAAAACCCTGGCGCAGTACACCTCTAAAACCTGCTCTAAC



TGCGGTTTCACCATCACCACCGCGGACTACGACGGTATGCTGGTTCGTCTGAAAA



AAACCTCTGACGGTTGGGCGACCACCCTGAACAACAAAGAACTGAAAGCGGAAGG



TCAGATCACCTACTACAACCGTTACAAACGTCAGACCGTTGAAAAAGAACTGTCT



GCGGAACTGGACCGTCTGTCTGAAGAATCTGGTAACAACGACATCTCTAAATGGA



CCAAAGGTCGTCGTGACGAAGCGCTGTTCCTGCTGAAAAAACGTTTCTCTCACCG



TCCGGTTCAGGAACAGTTCGTTTGCCTGGACTGCGGTCACGAAGTTCACGCGGAC



GAACAGGCGGCGCTGAACATCGCGCGTTCTTGGCTGTTCCTGAACTCTAACTCTA



CCGAATTCAAATCTTACAAATCTGGTAAACAGCCGTTCGTTGGTGCGTGGCAGGC



GTTCTACAAACGTCGTCTGAAAGAAGTTTGGAAACCGAACGCG





SEQ
ATGAAACGTATCAACAAAATCCGTCGTCGTCTGGTTAAAGACTCTAACACCAAAA


ID
AAGCGGGTAAAACCGGTCCGATGAAAACCCTGCTGGTTCGTGTTATGACCCCGGA


NO:
CCTGCGTGAACGTCTGGAAAACCTGCGTAAAAAACCGGAAAACATCCCGCAGCCG


60
ATCTCTAACACCTCTCGTGCGAACCTGAACAAACTGCTGACCGACTACACCGAAA



TGAAAAAAGCGATCCTGCACGTTTACTGGGAAGAATTCCAGAAAGACCCGGTTGG



TCTGATGTCTCGTGTTGCGCAGCCGGCGCCGAAAAACATCGACCAGCGTAAACTG



ATCCCGGTTAAAGACGGTAACGAACGTCTGACCTCTTCTGGTTTCGCGTGCTCTC



AGTGCTGCCAGCCGCTGTACGTTTACAAACTGGAACAGGTTAACGACAAAGGTAA



ACCGCACACCAACTACTTCGGTCGTTGCAACGTTTCTGAACACGAACGTCTGATC



CTGCTGTCTCCGCACAAACCGGAAGCGAACGACGAACTGGTTACCTACTCTCTGG



GTAAATTCGGTCAGCGTGCGCTGGACTTCTACTCTATCCACGTTACCCGTGAATC



TAACCACCCGGTTAAACCGCTGGAACAGATCGGTGGTAACTCTTGCGCGTCTGGT



CCGGTTGGTAAAGCGCTGTCTGACGCGTGCATGGGTGCGGTTGCGTCTTTCCTGA



CCAAATACCAGGACATCATCCTGGAACACCAGAAAGTTATCAAAAAAAACGAAAA



ACGTCTGGCGAACCTGAAAGACATCGCGTCTGCGAACGGTCTGGCGTTCCCGAAA



ATCACCCTGCCGCCGCAGCCGCACACCAAAGAAGGTATCGAAGCGTACAACAACG



TTGTTGCGCAGATCGTTATCTGGGTTAACCTGAACCTGTGGCAGAAACTGAAAAT



CGGTCGTGACGAAGCGAAACCGCTGCAGCGTCTGAAAGGTTTCCCGTCTTTCCCG



CTGGTTGAACGTCAGGCGAACGAAGTTGACTGGTGGGACATGGTTTGCAACGTTA



AAAAACTGATCAACGAAAAAAAAGAAGACGGTAAAGTTTTCTGGCAGAACCTGGC



GGGTTACAAACGTCAGGAAGCGCTGCTGCCGTACCTGTCTTCTGAAGAAGACCGT



AAAAAAGGTAAAAAATTCGCGCGTTACCAGTTCGGTGACCTGCTGCTGCACCTGG



AAAAAAAACACGGTGAAGACTGGGGTAAAGTTTACGACGAAGCGTGGGAACGTAT



CGACAAAAAAGTTGAAGGTCTGTCTAAACACATCAAACTGGAAGAAGAACGTCGT



TCTGAAGACGCGCAGTCTAAAGCGGCGCTGACCGACTGGCTGCGTGCGAAAGCGT



CTTTCGTTATCGAAGGTCTGAAAGAAGCGGACAAAGACGAATTCTGCCGTTGCGA



ACTGAAACTGCAGAAATGGTACGGTGACCTGCGTGGTAAACCGTTCGCGATCGAA



GCGGAAAACTCTATCCTGGACATCTCTGGTTTCTCTAAACAGTACAACTGCGCGT



TCATCTGGCAGAAAGACGGTGTTAAAAAACTGAACCTGTACCTGATCATCAACTA



CTTCAAAGGTGGTAAACTGCGTTTCAAAAAAATCAAACCGGAAGCGTTCGAAGCG



AACCGTTTCTACACCGTTATCAACAAAAAATCTGGTGAAATCGTTCCGATGGAAG



TTAACTTCAACTTCGACGACCCGAACCTGATCATCCTGCCGCTGGCGTTCGGTAA



ACGTCAGGGTCGTGAATTCATCTGGAACGACCTGCTGTCTCTGGAAACCGGTTCT



CTGAAACTGGCGAACGGTCGTGTTATCGAAAAAACCCTGTACAACCGTCGTACCC



GTCAGGACGAACCGGCGCTGTTCGTTGCGCTGACCTTCGAACGTCGTGAAGTTCT



GGACTCTTCTAACATCAAACCGATGAACCTGATCGGTATCGACCGTGGTGAAAAC



ATCCCGGCGGTTATCGCGCTGACCGACCCGGAAGGTTGCCCGCTGTCTCGTTTCA



AAGACTCTCTGGGTAACCCGACCCACATCCTGCGTATCGGTGAATCTTACAAAGA



AAAACAGCGTACCATCCAGGCGGCGAAAGAAGTTGAACAGCGTCGTGCGGGTGGT



TACTCTCGTAAATACGCGTCTAAAGCGAAAAACCTGGCGGACGACATGGTTCGTA



ACACCGCGCGTGACCTGCTGTACTACGCGGTTACCCAGGACGCGATGCTGATCTT



CGAAAACCTGTCTCGTGGTTTCGGTCGTCAGGGTAAACGTACCTTCATGGCGGAA



CGTCAGTACACCCGTATGGAAGACTGGCTGACCGCGAAACTGGCGTACGAAGGTC



TGCCGTCTAAAACCTACCTGTCTAAAACCCTGGCGCAGTACACCTCTAAAACCTG



CTCTAACTGCGGTTTCACCATCACCTCTGCGGACTACGACCGTGTTCTGGAAAAA



CTGAAAAAAACCGCGACCGGTTGGATGACCACCATCAACGGTAAAGAACTGAAAG



TTGAAGGTCAGATCACCTACTACAACCGTTACAAACGTCAGAACGTTGTTAAAGA



CCTGTCTGTTGAACTGGACCGTCTGTCTGAAGAATCTGTTAACAACGACATCTCT



TCTTGGACCAAAGGTCGTTCTGGTGAAGCGCTGTCTCTGCTGAAAAAACGTTTCT



CTCACCGTCCGGTTCAGGAAAAATTCGTTTGCCTGAACTGCGGTTTCGAAACCCA



CGCGGACGAACAGGCGGCGCTGAACATCGCGCGTTCTTGGCTGTTCCTGCGTTCT



CAGGAATACAAAAAATACCAGACCAACAAAACCACCGGTAACACCGACAAACGTG



CGTTCGTTGAAACCTGGCAGTCTTTCTACCGTAAAAAACTGAAAGAAGTTTGGAA



ACCG





SEQ
AAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAA


ID
GCATTGATAATTGAGATCCCTCTCCCTGACAGGATGATTACATAAATAATAGTGA


NO:
CAAAAATAAATTATTTATTTATCCAGAAAATGAATTGGAAAATCAGGAGAGCGTT


61
TTCAATCCTACCTCTGGCGCAGTTGATATGTcaaaCAGGTtgccgtcactgcgtc



ttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagcattc



tgtaacaaagcgggaccaaagccatgacaaaaacgcgtaacaaaagtgtctataa



tcacggcagaaaagtccacattgattatttgcacggcgtcacactttgctatgcc



atagcatttttatccataagattagcggatcctacctgacgctttttatcgcaac



tctctactgtttctccatacccgtttttttgggctagcaccgcctatctcgtgtg



agataggcggagatacgaactttaagAAGGAGatataccATGGGTAAAATGTATT



ACCTTGGTTTAGACATTGGCACGAATTCCGTGGGCTACGCGGTGACCGACCCCTC



ATACCACCTGCTGAAGTTTAAGGGGGAACCAATGTGGGGTGCGCACGTATTTGCC



GCCGGTAATCAGAGCGCGGAACGACGCTCGTTCCGCACATCGCGTCGTCGTTTGG



ACCGACGCCAACAGCGCGTTAAACTGGTACAGGAGATTTTTGCCCCGGTGATTAG



TCCGATCGACCCACGCTTCTTCATTCGTCTGCATGAATCCGCCCTGTGGCGCGAT



GACGTCGCGGAGACGGATAAACATATCTTTTTCAATGATCCTACCTATACCGATA



AGGAATATTATAGCGATTACCCGACTATCCATCACCTGATCGTTGATCTGATGGA



AAGCTCTGAGAAACACGATCCGCGGCTGGTGTACCTTGCAGTGGCGTGGTTAGTG



GCACACCGTGGTCATTTTCTGAACGAGGTGGACAAGGATAATATTGGAGATGTGT



TGTCGTTCGACGCATTTTATCCGGAGTTTCTCGCGTTCCTGTCGGACAACGGTGT



ATCACCGTGGGTGTGCGAAAGCAAAGCGCTGCAGGCGACCTTGCTGAGCCGTAAC



TCAGTGAACGACAAATATAAAGCCCTTAAGTCTCTGATCTTCGGATCCCAGAAAC



CTGAAGATAACTTCGATGCCAATATTTCGGAAGATGGACTCATTCAACTGCTGGC



CGGCAAAAAGGTAAAAGTTAACAAACTGTTCCCTCAGGAATCGAACGATGCATCC



TTCACATTGAATGATAAAGAAGACGCGATAGAAGAAATCCTGGGTACGCTTACAC



CAGATGAATGTGAATGGATTGCGCATATACGCCGCCTTTTTGACTGGGCTATCAT



GAAACATGCTCTGAAAGATGGCAGGACTATTAGCGAGTCAAAAGTCAAACTGTAT



GAGCAGCACCATCACGATCTGACCCAACTTAAATACTTCGTGAAAACCTACCTTG



CAAAAGAATACGACGATATTTTCCGCAACGTGGATAGCGAAACAACGAAAAACTA



TGTAGCGTATTCCTATCATGTGAAAGAGGTGAAAGGCACTCTGCCTAAAAATAAG



GCAACGCAAGAAGAGTTTTGTAAGTATGTCCTGGGCAAGGTTAAAAACATTGAAT



GCTCTGAAGCAGACAAGGTTGACTTTGATGAGATGATTCAGCGTCTTACCGACAA



CTCTTTTATGCCTAAGCAGGTTTCGGGCGAAAACCGCGTTATTCCTTATCAGTTA



TATTATTATGAACTGAAGACAATTCTGAATAAAGCAGCCTCGTACCTGCCTTTCC



TGACGCAGTGTGGAAAAGATGCAATTTCGAACCAGGACAAACTACTGTCGATCAT



GACGTTCCGTATTCCTTACTTCGTCGGACCCTTGCGAAAAGATAATTCGGAACAT



GCATGGCTCGAACGAAAGGCCGGTAAGATTTATCCGTGGAACTTTAACGACAAAG



TGGACTTGGATAAATCAGAAGAAGCGTTCATTCGCCGAATGACCAATACCTGTAC



CTATTATCCCGGCGAAGATGTTTTACCGTTGGATTCGCTGATCTATGAGAAATTT



ATGATTTTAAATGAAATCAATAATATTCGTATTGACGGCTACCCGATTAGTGTTG



ACGTTAAACAGCAGGTTTTTGGCTTGTTCGAAAAAAAACGACGCGTAACCGTGAA



AGATATTCAGAACCTGCTGCTGTCTCTCGGAGCTCTGGACAAACACGGGAAGCTG



ACAGGCATCGATACCACTATCCACTCAAACTATAATACGTATCACCATTTTAAAT



CTCTCATGGAACGCGGCGTCCTGACCCGGGATGACGTGGAACGCATCGTTGAAAG



GATGACCTACAGCGACGATACTAAGCGTGTGCGTCTGTGGCTGAATAACAACTAT



GGTACTTTAACCGCCGACGATGTGAAACACATTTCGCGTCTGCGCAAACACGATT



TTGGCCGTTTATCCAAAATGTTCTTAACAGGTCTGAAGGGTGTCCATAAGGAGAC



CGGTGAACGTGCCTCCATACTGGATTTCATGTGGAACACGAACGATAACCTGATG



CAGCTCCTTTCCGAATGCTACACGTTCAGTGATGAAATCACAAAGCTGCAAGAGG



CGTATTATGCAAAAGCCCAGTTGTCTTTAAACGATTTTTTAGACTCGATGTACAT



CTCTAACGCGGTGAAACGTCCGATTTACAGAACTCTGGCAGTGGTGAACGATATT



CGAAAAGCATGTGGGACGGCCCCTAAACGCATTTTCATCGAAATGGCTCGTGATG



GTGAATCAAAAAAAAAGAGAAGTGTTACACGTCGCGAGCAGATCAAAAACCTGTA



CCGCTCGATTCGTAAAGATTTCCAGCAGGAAGTTGATTTTCTGGAAAAGATCCTG



GAAAATAAATCTGATGGTCAACTTCAGTCAGATGCTTTGTATCTTTACTTTGCAC



AATTAGGGCGCGATATGTACACGGGCGATCCAATAAAGCTGGAGCACATCAAAGA



TCAGAGTTTCTATAACATAGACCATATTTACCCGCAGTCTATGGTGAAAGACGAT



TCCCTAGATAACAAAGTGCTGGTGCAAAGCGAAATTAACGGCGAGAAAAGCTCGC



GATACCCTTTGGACGCCGCGATCCGCAATAAAATGAAGCCCCTTTGGGACGCTTA



CTATAATCATGGCCTGATCTCCTTAAAGAAATACCAGCGTCTAACGCGCTCGACC



CCGTTTACCGATGATGAAAAATGGGACTTTATTAATCGCCAGTTAGTGGAAACCC



GTCAATCTACCAAAGCGCTGGCCATTTTGTTGAAGCGTAAGTTTCCAGACACCGA



AATTGTGTATTCGAAGGCGGGGTTATCGTCCGACTTCAGACATGAATTCGGCCTT



GTAAAAAGTCGCAATATTAATGATTTGCACCACGCTAAAGACGCATTCTTGGCTA



TCGTTACCGGCAATGTGTACCATGAAAGATTCAATCGCAGATGGTTTATGGTGAA



CCAGCCGTACTCAGTTAAAACTAAAACTCTTTTTACCCACAGCATAAAGAATGGC



AACTTCGTTGCCTGGAACGGCGAAGAAGATCTCGGTCGTATTGTAAAAATGCTGA



AGCAAAACAAAAATACCATTCACTTCACGCGCTTCTCCTTCGATCGCAAAGAAGG



ATTATTTGATATCCAACCTCTGAAAGCCAGCACCGGCTTAGTCCCACGAAAAGCC



GGTCTGGATGTCGTTAAATACGGCGGATATGACAAATCTACCGCGGCCTATTACC



TGCTGGTGAGGTTCACGCTCGAGGACAAGAAAACCCAGCACAAGCTGATGATGAT



TCCTGTAGAAGGCCTGTACAAGGCTCGCATTGATCATGACAAGGAATTTCTTACC



GATTATGCGCAAACGACTATAAGCGAAATCCTACAGAAAGATAAACAGAAAGTGA



TCAATATTATGTTTCCAATGGGTACGAGGCATATAAAACTCAATTCAATGATTAG



TATCGATGGCTTCTATCTTAGTATCGGCGGAAAGTCCTCTAAAGGTAAGTCAGTT



CTATGTCACGCAATGGTTCCACTGATCGTCCCTCACAAAATCGAATGTTACATTA



AAGCAATGGAAAGCTTCGCCCGGAAGTTTAAAGAAAACAACAAGCTGCGCATCGT



AGAAAAATTCGATAAAATCACCGTTGAAGACAACCTGAATCTCTACGAGCTCTTT



CTCCAAAAACTGCAGCATAATCCCTATAATAAGTTTTTTTCGACACAGTTTGACG



TACTGACGAACGGCCGTTCTACTTTCACAAAACTGTCGCCGGAGGAACAGGTACA



GACGCTCTTGAACATTTTAAGTATCTTTAAAACATGCCGCAGTTCGGGTTGCGAC



CTGAAATCCATCAACGGCAGTGCCCAGGCAGCGCGCATCATGATTAGCGCTGACT



TAACTGGACTGTCGAAAAAATATTCAGATATTAGGTTGGTTGAACAGTCAGCTTC



TGGTTTGTTCGTATCCAAAAGTCAGAACTTACTGGAGTATCTCTAAGAAATCATC



CTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATTTATTATATCGCGTTGATTA



TTGATGCTGTTTTTAGTTTTAACGGCAATTAATATATGTGTTATTAATTGAATGA



ATTTTATCATTCATAATAAGTATGTGTAGGATCAAGCTCAGGTTAAATATTCACT



CAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACAGAATTATCTCATAACAAGT



GTTAAGGGATGTTATTTCC





SEQ
AAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAA


ID
GCATTGATAATTGAGATCCCTCTCCCTGACAGGATGATTACATAAATAATAGTGA


NO:
CAAAAATAAATTATTTATTTATCCAGAAAATGAATTGGAAAATCAGGAGAGCGTT


62
TTCAATCCTACCTCTGGCGCAGTTGATATGTcaaaCAGGTtgccgtcactgcgtc



ttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagcattc



tgtaacaaagcgggaccaaagccatgacaaaaacgcgtaacaaaagtgtctataa



tcacggcagaaaagtccacattgattatttgcacggcgtcacactttgctatgcc



atagcatttttatccataagattagcggatcctacctgacgctttttatcgcaac



tctctactgtttctccatacccgtttttttgggctagcaccgcctatctcgtgtg



agataggcggagatacgaactttaagAAGGAGatataccATGTCATCGCTCACGA



AATTCACTAACAAATACTCTAAACAGCTCACCATTAAGAATGAACTCATCCCAGT



TGGCAAAACACTGGAGAACATCAAAGAGAATGGTCTGATAGATGGCGACGAACAG



CTGAATGAGAATTATCAGAAGGCGAAAATTATTGTGGATGATTTTCTGCGGGACT



TCATTAATAAAGCACTGAATAATACGCAGATCGGGAACTGGCGCGAACTGGCGGA



TGCCCTTAATAAAGAGGATGAAGATAACATCGAGAAATTGCAGGATAAAATTCGG



GGAATCATTGTATCCAAATTTGAAACGTTTGATCTGTTTAGCAGCTATTCTATTA



AGAAAGATGAAAAGATTATTGACGACGACAATGATGTTGAAGAAGAGGAACTGGA



TCTGGGCAAGAAGACCAGCTCATTTAAATACATATTTAAAAAAAACCTGTTTAAG



TTAGTGTTGCCATCCTACCTGAAAACCACAAACCAGGACAAGCTGAAGATTATTA



GCTCGTTTGATAATTTTTCAACGTACTTCCGCGGGTTCTTTGAAAACCGGAAAAA



CATTTTTACCAAGAAACCGATCTCCACAAGTATTGCGTATCGCATTGTTCATGAT



AACTTCCCGAAATTCCTTGATAACATTCGTTGTTTTAATGTGTGGCAGACGGAAT



GCCCGCAACTAATCGTGAAAGCAGATAACTATCTGAAAAGCAAAAATGTTATAGC



GAAAGATAAAAGTTTGGCAAACTATTTTACCGTGGGCGCGTATGACTATTTCCTG



TCTCAGAATGGTATAGATTTTTACAACAATATTATAGGTGGACTGCCAGCGTTCG



CCGGCCATGAGAAAATCCAAGGTCTCAATGAATTCATCAATCAAGAGTGCCAAAA



AGACAGCGAGCTGAAAAGTAAGCTGAAAAACCGTCACGCGTTCAAAATGGCGGTA



CTGTTCAAACAGATACTCAGCGATCGTGAAAAAAGTTTTGTAATTGATGAGTTCG



AGTCGGATGCTCAAGTTATTGACGCCGTTAAAAACTTTTACGCCGAACAGTGCAA



AGATAACAATGTTATTTTTAACTTATTAAATCTTATCAAGAATATCGCTTTCTTA



AGTGATGACGAACTGGACGGCATATTCATTGAAGGGAAATACCTGTCGAGCGTTA



GTCAAAAACTCTATAGCGATTGGTCAAAATTACGTAACGACATTGAGGATTCGGC



TAACTCTAAACAAGGCAATAAAGAGCTGGCCAAGAAGATCAAAACCAACAAAGGG



GATGTAGAAAAAGCGATCTCGAAATATGAGTTCTCGCTGTCGGAACTGAACTCGA



TTGTACATGATAACACCAAGTTTTCTGACCTCCTTAGTTGTACACTGCATAAGGT



GGCTTCTGAGAAACTGGTGAAGGTCAATGAAGGCGACTGGCCGAAACATCTCAAG



AATAATGAAGAGAAACAAAAAATCAAAGAGCCGCTTGATGCTCTGCTGGAGATCT



ATAATACACTTCTGATTTTTAACTGCAAAAGCTTCAATAAAAACGGCAACTTCTA



TGTCGACTATGATCGTTGCATCAATGAACTGAGTTCGGTCGTGTATCTGTATAAT



AAAACACGTAACTATTGCACTAAAAAACCCTATAACACGGACAAGTTCAAACTCA



ATTTTAACAGTCCGCAGCTCGGTGAAGGCTTTTCCAAGTCGAAAGAAAATGACTG



TCTGACTCTTTTGTTTAAAAAAGACGACAACTATTATGTAGGCATTATCCGCAAA



GGTGCAAAAATCAATTTTGATGATACACAAGCAATCGCCGATAACACCGACAATT



GCATCTTTAAAATGAATTATTTCCTACTTAAAGACGCAAAAAAATTTATCCCGAA



ATGTAGCATTCAGCTGAAAGAAGTCAAGGCCCATTTTAAGAAATCTGAAGATGAT



TACATTTTGTCTGATAAAGAGAAATTTGCTAGCCCGCTGGTCATTAAAAAGAGCA



CATTTTTGCTGGCAACTGCACATGTGAAAGGGAAAAAAGGCAATATCAAGAAATT



TCAGAAAGAATATTCGAAAGAAAACCCCACTGAGTATCGCAATTCTTTAAACGAA



TGGATTGCTTTTTGTAAAGAGTTCTTAAAAACTTATAAAGCGGCTACCATTTTTG



ATATAACCACATTGAAAAAGGCAGAGGAATATGCTGATATTGTAGAATTCTACAA



GGATGTCGATAATCTGTGCTACAAACTGGAGTTCTGCCCGATTAAAACCTCGTTT



ATAGAAAACCTGATAGATAACGGCGACCTGTATCTGTTTCGCATCAATAACAAAG



ACTTCAGCAGTAAATCGACCGGCACCAAGAACCTTCATACGTTATATTTACAAGC



TATATTCGATGAACGTAATCTGAACAATCCGACAATTATGCTGAATGGGGGAGCA



GAACTGTTCTATCGTAAAGAAAGTATTGAGCAGAAAAACCGTATCACACACAAAG



CCGGTTCAATTCTCGTGAATAAGGTGTGTAAAGACGGTACAAGCCTGGATGATAA



GATACGTAATGAAATTTATCAATATGAGAATAAATTTATTGATACCCTGTCTGAT



GAAGCTAAAAAGGTGTTACCGAATGTCATTAAAAAGGAAGCTACCCATGACATTA



CAAAAGATAAACGTTTCACTAGTGACAAATTCTTCTTTCACTGCCCCCTGACAAT



TAATTATAAGGAAGGCGATACCAAGCAGTTCAATAACGAAGTGCTGAGTTTTCTG



CGTGGAAATCCTGACATCAACATTATCGGCATTGACCGCGGAGAGCGTAATTTAA



TCTATGTAACGGTTATAAACCAGAAAGGCGAGATTCTGGATTCGGTTTCATTCAA



TACCGTGACCAACAAGAGTTCAAAAATCGAGCAGACAGTCGATTATGAAGAGAAA



TTGGCAGTCCGCGAGAAAGAGAGGATTGAAGCAAAACGTTCCTGGGACTCTATCT



CAAAAATTGCGACACTAAAGGAAGGTTATCTGAGCGCAATAGTTCACGAGATCTG



TCTGTTAATGATTAAACACAACGCGATCGTTGTCTTAGAGAATCTTAATGCAGGC



TTTAAGCGTATTCGTGGCGGTTTATCAGAAAAAAGTGTTTATCAAAAATTCGAAA



AAATGTTGATTAACAAACTGAACTATTTTGTCAGCAAGAAGGAATCCGACTGGAA



TAAACCGTCTGGTCTGCTGAATGGACTGCAGCTTTCGGATCAGTTTGAAAGCTTC



GAAAAACTGGGTATTCAGTCTGGTTTTATTTTTTACGTGCCGGCTGCATATACCT



CAAAGATTGATCCGACCACGGGCTTCGCCAATGTTCTGAATCTGTCGAAGGTACG



CAATGTTGATGCGATCAAAAGCTTTTTTTCTAACTTCAACGAAATTAGTTATAGC



AAGAAAGAAGCCCTTTTCAAATTCTCATTCGATCTGGATTCACTGAGTAAGAAAG



GCTTTAGTAGCTTTGTGAAATTTAGTAAGAGTAAATGGAACGTCTACACCTTTGG



AGAACGTATCATAAAGCCAAAGAATAAGCAAGGTTATCGGGAGGACAAAAGAATC



AACTTGACCTTCGAGATGAAGAAGTTACTTAACGAGTATAAGGTTTCTTTTGATC



TTGAAAATAACTTGATTCCGAATCTCACGAGTGCCAACCTGAAGGATACTTTTTG



GAAAGAGCTATTCTTTATCTTCAAGACTACGCTGCAGCTCCGTAACAGCGTTACT



AACGGTAAAGAAGATGTGCTCATCTCTCCGGTCAAAAATGCGAAGGGTGAATTCT



TCGTTTCGGGAACGCATAACAAGACTCTTCCGCAAGATTGCGATGCGAACGGTGC



ATACCATATTGCGTTGAAAGGTCTGATGATACTCGAACGTAACAACCTTGTACGT



GAGGAGAAAGATACGAAAAAGATTATGGCGATTTCAAACGTGGATTGGTTCGAGT



ACGTGCAGAAACGTAGAGGCGTTCTGTAAGAAATCATCCTTAGCGAAAGCTAAGG



ATTTTTTTTATCTGAAATTTATTATATCGCGTTGATTATTGATGCTGTTTTTAGT



TTTAACGGCAATTAATATATGTGTTATTAATTGAATGAATTTTATCATTCATAAT



AAGTATGTGTAGGATCAAGCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCA



GGAAGCAAAGAGGATTACAGAATTATCTCATAACAAGTGTTAAGGGATGTTATTT



CC





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGT


ID
TGCCGTCACTGCGTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGC


NO:
TTATTAAAAGCATTCTGTAACAAAGCGGGACCAAAGCCATGACAAAAACGCGTAA


63
CAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATTTGCACGGCGTC



ACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGAC



GCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAG



TAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAAC



TTTAAGAGGAGGATATACCATGCACCATCATCATCACCATAACAACTACGACGAA



TTCACCAAACTGTACCCGATCCAGAAAACCATCCGTTTCGAACTGAAACCGCAGG



GTCGTACCATGGAACACCTGGAAACCTTCAACTTCTTCGAAGAAGACCGTGACCG



TGCGGAAAAATACAAAATCCTGAAAGAAGCGATCGACGAATACCACAAAAAATTC



ATCGACGAACACCTGACCAACATGTCTCTGGACTGGAACTCTCTGAAACAGATCT



CTGAAAAATACTACAAATCTCGTGAAGAAAAAGACAAAAAAGTTTTCCTGTCTGA



ACAGAAACGTATGCGTCAGGAAATCGTTTCTGAATTCAAAAAAGACGACCGTTTC



AAAGACCTGTTCTCTAAAAAACTGTTCTCTGAACTGCTGAAAGAAGAAATCTACA



AAAAAGGTAACCACCAGGAAATCGACGCGCTGAAATCTTTCGACAAATTCTCTGG



TTACTTCATCGGTCTGCACGAAAACCGTAAAAACATGTACTCTGACGGTGACGAA



ATCACCGCGATCTCTAACCGTATCGTTAACGAAAACTTCCCGAAATTCCTGGACA



ACCTGCAGAAATACCAGGAAGCGCGTAAAAAATACCCGGAATGGATCATCAAAGC



GGAATCTGCGCTGGTTGCGCACAACATCAAAATGGACGAAGTTTTCTCTCTGGAA



TACTTCAACAAAGTTCTGAACCAGGAAGGTATCCAGCGTTACAACCTGGCGCTGG



GTGGTTACGTTACCAAATCTGGTGAAAAAATGATGGGTCTGAACGACGCGCTGAA



CCTGGCGCACCAGTCTGAAAAATCTTCTAAAGGTCGTATCCACATGACCCCGCTG



TTCAAACAGATCCTGTCTGAAAAAGAATCTTTCTCTTACATCCCGGACGTTTTCA



CCGAAGACTCTCAGCTGCTGCCGTCTATCGGTGGTTTCTTCGCGCAGATCGAAAA



CGACAAAGACGGTAACATCTTCGACCGTGCGCTGGAACTGATCTCTTCTTACGCG



GAATACGACACCGAACGTATCTACATCCGTCAGGCGGACATCAACCGTGTTTCTA



ACGTTATCTTCGGTGAATGGGGTACCCTGGGTGGTCTGATGCGTGAATACAAAGC



GGACTCTATCAACGACATCAACCTGGAACGTACCTGCAAAAAAGTTGACAAATGG



CTGGACTCTAAAGAATTCGCGCTGTCTGACGTTCTGGAAGCGATCAAACGTACCG



GTAACAACGACGCGTTCAACGAATACATCTCTAAAATGCGTACCGCGCGTGAAAA



AATCGACGCGGCGCGTAAAGAAATGAAATTCATCTCTGAAAAAATCTCTGGTGAC



GAAGAATCTATCCACATCATCAAAACCCTGCTGGACTCTGTTCAGCAGTTCCTGC



ACTTCTTCAACCTGTTCAAAGCGCGTCAGGACATCCCGCTGGACGGTGCGTTCTA



CGCGGAATTCGACGAAGTTCACTCTAAACTGTTCGCGATCGTTCCGCTGTACAAC



AAAGTTCGTAACTACCTGACCAAAAACAACCTGAACACCAAAAAAATCAAACTGA



ACTTCAAAAACCCGACCCTGGCGAACGGTTGGGACCAGAACAAAGTTTACGACTA



CGCGTCTCTGATCTTCCTGCGTGACGGTAACTACTACCTGGGTATCATCAACCCG



AAACGTAAAAAAAACATCAAATTCGAACAGGGTTCTGGTAACGGTCCGTTCTACC



GTAAAATGGTTTACAAACAGATCCCGGGTCCGAACAAAAACCTGCCGCGTGTTTT



CCTGACCTCTACCAAAGGTAAAAAAGAATACAAACCGTCTAAAGAAATCATCGAA



GGTTACGAAGCGGACAAACACATCCGTGGTGACAAATTCGACCTGGACTTCTGCC



ACAAACTGATCGACTTCTTCAAAGAATCTATCGAAAAACACAAAGACTGGTCTAA



ATTCAACTTCTACTTCTCTCCGACCGAATCTTACGGTGACATCTCTGAATTCTAC



CTGGACGTTGAAAAACAGGGTTACCGTATGCACTTCGAAAACATCTCTGCGGAAA



CCATCGACGAATACGTTGAAAAAGGTGACCTGTTCCTGTTCCAGATCTACAACAA



AGACTTCGTTAAAGCGGCGACCGGTAAAAAAGACATGCACACCATCTACTGGAAC



GCGGCGTTCTCTCCGGAAAACCTGCAGGACGTTGTTGTTAAACTGAACGGTGAAG



CGGAACTGTTCTACCGTGACAAATCTGACATCAAAGAAATCGTTCACCGTGAAGG



TGAAATCCTGGTTAACCGTACCTACAACGGTCGTACCCCGGTTCCGGACAAAATC



CACAAAAAACTGACCGACTACCACAACGGTCGTACCAAAGACCTGGGTGAAGCGA



AAGAATACCTGGACAAAGTTCGTTACTTCAAAGCGCACTACGACATCACCAAAGA



CCGTCGTTACCTGAACGACAAAATCTACTTCCACGTTCCGCTGACCCTGAACTTC



AAAGCGAACGGTAAAAAAAACCTGAACAAAATGGTTATCGAAAAATTCCTGTCTG



ACGAAAAAGCGCACATCATCGGTATCGACCGTGGTGAACGTAACCTGCTGTACTA



CTCTATCATCGACCGTTCTGGTAAAATCATCGACCAGCAGTCTCTGAACGTTATC



GACGGTTTCGACTACCGTGAAAAACTGAACCAGCGTGAAATCGAAATGAAAGACG



CGCGTCAGTCTTGGAACGCGATCGGTAAAATCAAAGACCTGAAAGAAGGTTACCT



GTCTAAAGCGGTTCACGAAATCACCAAAATGGCGATCCAGTACAACGCGATCGTT



GTTATGGAAGAACTGAACTACGGTTTCAAACGTGGTCGTTTCAAAGTTGAAAAAC



AGATCTACCAGAAATTCGAAAACATGCTGATCGACAAAATGAACTACCTGGTTTT



CAAAGACGCGCCGGACGAATCTCCGGGTGGTGTTCTGAACGCGTACCAGCTGACC



AACCCGCTGGAATCTTTCGCGAAACTGGGTAAACAGACCGGTATCCTGTTCTACG



TTCCGGCGGCGTACACCTCTAAAATCGACCCGACCACCGGTTTCGTTAACCTGTT



CAACACCTCTTCTAAAACCAACGCGCAGGAACGTAAAGAATTCCTGCAGAAATTC



GAATCTATCTCTTACTCTGCGAAAGACGGTGGTATCTTCGCGTTCGCGTTCGACT



ACCGTAAATTCGGTACCTCTAAAACCGACCACAAAAACGTTTGGACCGCGTACAC



CAACGGTGAACGTATGCGTTACATCAAAGAAAAAAAACGTAACGAACTGTTCGAC



CCGTCTAAAGAAATCAAAGAAGCGCTGACCTCTTCTGGTATCAAATACGACGGTG



GTCAGAACATCCTGCCGGACATCCTGCGTTCTAACAACAACGGTCTGATCTACAC



CATGTACTCTTCTTTCATCGCGGCGATCCAGATGCGTGTTTACGACGGTAAAGAA



GACTACATCATCTCTCCGATCAAAAACTCTAAAGGTGAATTCTTCCGTACCGACC



CGAAACGTCGTGAACTGCCGATCGACGCGGACGCGAACGGTGCGTACAACATCGC



GCTGCGTGGTGAACTGACCATGCGTGCGATCGCGGAAAAATTCGACCCGGACTCT



GAAAAAATGGCGAAACTGGAACTGAAACACAAAGACTGGTTCGAATTCATGCAGA



CCCGTGGTGACTAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGA



AATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAG



CAAAGAGGATTACA





SEQ
AAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAA


ID
GCATTGATAATTGAGATCCCTCTCCCTGACAGGATGATTACATAAATAATAGTGA


NO:
CAAAAATAAATTATTTATTTATCCAGAAAATGAATTGGAAAATCAGGAGAGCGTT


64
TTCAATCCTACCTCTGGCGCAGTTGATATGTcaaaCAGGTtgccgtcactgcgtc



ttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagcattc



tgtaacaaagcgggaccaaagccatgacaaaaacgcgtaacaaaagtgtctataa



tcacggcagaaaagtccacattgattatttgcacggcgtcacactttgctatgcc



atagcatttttatccataagattagcggatcctacctgacgctttttatcgcaac



tctctactgtttctccatacccgtttttttgggctagcaccgcctatctcgtgtg



agataggcggagatacgaactttaagAAGGAGatataccATGACTAAAACATTTG



ATTCAGAGTTTTTTAATTTGTACTCGCTGCAAAAAACGGTACGCTTTGAGTTAAA



ACCCGTGGGAGAAACCGCGTCATTTGTGGAAGACTTTAAAAACGAGGGCTTGAAA



CGTGTTGTGAGCGAAGATGAAAGGCGAGCCGTCGATTACCAGAAAGTTAAGGAAA



TAATTGACGATTACCATCGGGATTTCATTGAAGAAAGTTTAAATTATTTTCCGGA



ACAGGTGAGTAAAGATGCTCTTGAGCAGGCGTTTCATCTTTATCAGAAACTGAAG



GCAGCAAAAGTTGAGGAAAGGGAAAAAGCGCTGAAAGAATGGGAAGCGCTGCAGA



AAAAGCTACGTGAAAAAGTGGTGAAATGCTTCTCGGACTCGAATAAAGCCCGCTT



CTCAAGGATTGATAAAAAGGAACTGATTAAGGAAGACCTGATAAATTGGTTGGTC



GCCCAGAATCGCGAGGATGATATCCCTACGGTCGAAACGTTTAACAACTTCACCA



CATATTTTACCGGCTTCCATGAGAATCGTAAAAATATTTACTCCAAAGATGATCA



CGCCACCGCTATTAGCTTTCGCCTTATTCATGAAAATCTTCCAAAGTTTTTTGAC



AACGTGATTAGCTTCAATAAGTTGAAAGAGGGTTTCCCTGAATTAAAATTTGATA



AAGTGAAAGAGGATTTAGAAGTAGATTATGATCTGAAGCATGCGTTTGAAATAGA



ATATTTCGTTAACTTCGTGACCCAAGCGGGCATAGATCAGTATAATTATCTGTTA



GGAGGGAAAACCCTGGAGGACGGGACGAAAAAACAAGGGATGAATGAGCAAATTA



ATCTGTTCAAACAACAGCAAACGCGAGATAAAGCGCGTCAGATTCCCAAACTGAT



CCCCCTGTTCAAACAGATTCTTAGCGAAAGGACTGAAAGCCAGTCCTTTATTCCT



AAACAATTTGAAAGTGATCAGGAGTTGTTCGATTCACTGCAGAAGTTACATAATA



ACTGCCAGGATAAATTCACCGTGCTGCAACAAGCCATTCTCGGTCTGGCAGAGGC



GGATCTTAAGAAGGTCTTCATCAAAACCTCTGATTTAAATGCCTTATCTAACACC



ATTTTCGGGAATTACAGCGTCTTTTCCGATGCACTGAACCTGTATAAAGAAAGCC



TGAAAACGAAAAAAGCGCAGGAGGCTTTTGAGAAACTACCGGCCCATTCTATTCA



CGACCTCATTCAATACTTGGAACAGTTCAATTCCAGCCTGGACGCGGAAAAACAA



CAGAGCACCGACACCGTCCTGAACTACTTCATCAAGACCGATGAATTATATTCTC



GCTTCATTAAATCCACTAGCGAGGCTTTCACTCAGGTGCAGCCTTTGTTCGAACT



GGAAGCCCTGTCATCTAAGCGCCGCCCACCGGAATCGGAAGATGAAGGGGCAAAA



GGGCAGGAAGGCTTCGAGCAGATCAAGCGTATTAAAGCTTACCTGGATACGCTTA



TGGAAGCGGTACACTTTGCAAAGCCGTTGTATCTTGTTAAGGGTCGTAAAATGAT



CGAAGGGCTCGATAAAGACCAGTCCTTTTATGAAGCGTTTGAAATGGCGTACCAA



GAACTTGAATCGTTAATCATTCCTATCTATAACAAAGCGCGGAGCTATCTGTCGC



GGAAACCTTTCAAGGCCGATAAATTCAAGATTAATTTTGACAACAACACGCTACT



GAGCGGATGGGATGCGAACAAGGAAACTGCTAACGCGTCCATTCTGTTTAAGAAA



GACGGGTTATATTACCTTGGAATTATGCCGAAAGGTAAGACCTTTCTCTTTGACT



ACTTTGTATCGAGCGAGGATTCAGAGAAACTGAAACAGCGTCGCCAGAAGACCGC



CGAAGAAGCTCTGGCGCAGGATGGTGAAAGTTACTTCGAAAAAATTCGTTATAAA



CTGTTACCAGGGGCTTCAAAGATGTTACCGAAAGTCTTTTTTAGCAACAAAAATA



TTGGCTTTTACAACCCGTCGGATGACATTTTACGCATTCGCAACACAGCCTCTCA



CACCAAAAACGGGACCCCTCAGAAAGGCCACTCAAAAGTTGAGTTTAACCTGAAT



GATTGTCATAAGATGATTGATTTCTTCAAATCATCAATTCAGAAACACCCGGAAT



GGGGGTCTTTTGGCTTTACGTTTTCTGATACCAGTGATTTTGAAGACATGAGTGC



CTTCTACCGGGAAGTAGAAAACCAGGGTTACGTAATTAGCTTTGACAAAATCAAA



GAGACCTATATACAGAGCCAGGTGGAACAGGGTAATCTCTACTTATTCCAGATTT



ATAACAAGGATTTCTCGCCCTACAGCAAAGGCAAACCAAACCTGCATACTCTGTA



CTGGAAAGCCCTGTTTGAAGAAGCGAACCTGAATAACGTAGTGGCGAAGTTGAAC



GGTGAAGCGGAAATCTTCTTCCGTCGTCACTCCATTAAGGCCTCTGATAAAGTTG



TCCATCCGGCAAATCAGGCCATTGATAATAAGAATCCACACACGGAAAAAACGCA



GTCAACCTTTGAATATGACCTCGTTAAAGACAAACGCTACACGCAAGATAAGTTC



TTTTTCCACGTCCCAATCAGCCTCAACTTTAAAGCACAAGGGGTTTCAAAGTTTA



ATGATAAAGTCAATGGGTTCCTCAAGGGCAACCCGGATGTCAACATTATAGGTAT



AGACAGGGGCGAACGCCATCTGCTTTACTTTACCGTAGTGAATCAGAAAGGTGAA



ATACTGGTTCAGGAATCATTAAATACCTTGATGTCGGACAAAGGGCACGTTAATG



ATTACCAGCAGAAACTGGATAAAAAAGAACAGGAACGTGATGCTGCGCGTAAATC



GTGGACCACGGTTGAGAACATTAAAGAGCTGAAAGAGGGGTATCTAAGCCATGTG



GTACACAAACTGGCGCACCTCATCATTAAATATAACGCAATAGTCTGCCTAGAAG



ACTTGAATTTTGGCTTTAAACGCGGCCGCTTCAAAGTGGAAAAACAAGTTTATCA



AAAATTTGAAAAGGCGCTTATAGATAAACTGAATTATCTGGTTTTTAAAGAAAAG



GAACTTGGTGAGGTAGGGCACTACTTGACAGCTTATCAACTGACGGCCCCGTTCG



AATCATTCAAAAAACTGGGCAAACAGTCTGGCATTCTGTTTTACGTGCCGGCAGA



TTATACTTCAAAAATCGATCCAACAACTGGCTTTGTGAACTTCCTGGACCTGAGA



TATCAGTCTGTAGAAAAAGCTAAACAACTTCTTAGCGATTTTAATGCCATTCGTT



TTAACAGCGTTCAGAATTACTTTGAATTCGAAATTGACTATAAAAAACTTACTCC



GAAACGTAAAGTCGGAACCCAAAGTAAATGGGTAATTTGTACGTATGGCGATGTC



AGGTATCAGAACCGTCGGAATCAAAAAGGTCATTGGGAGACCGAAGAAGTGAACG



TGACCGAAAAGCTGAAGGCTCTGTTCGCCAGCGATTCAAAAACTACAACTGTGAT



CGATTACGCAAATGATGATAACCTGATAGATGTGATTTTAGAGCAGGATAAAGCC



AGCTTTTTTAAAGAACTGTTGTGGCTCCTGAAACTTACGATGACCTTACGACATT



CCAAGATCAAATCGGAAGATGATTTTATTCTGTCACCGGTCAAGAATGAGCAGGG



TGAATTCTATGATAGTAGGAAAGCCGGCGAAGTGTGGCCGAAAGACGCCGACGCC



AATGGCGCCTATCATATCGCGCTCAAAGGGCTTTGGAATTTGCAGCAGATTAACC



AGTGGGAAAAAGGTAAAACCCTGAATCTGGCTATCAAAAACCAGGATTGGTTTAG



CTTTATCCAAGAGAAACCGTATCAGGAATGAGAAATCATCCTTAGCGAAAGCTAA



GGATTTTTTTTATCTGAAATTTATTATATCGCGTTGATTATTGATGCTGTTTTTA



GTTTTAACGGCAATTAATATATGTGTTATTAATTGAATGAATTTTATCATTCATA



ATAAGTATGTGTAGGATCAAGCTCAGGTTAAATATTCACTCAGGAAGTTATTACT



CAGGAAGCAAAGAGGATTACAGAATTATCTCATAACAAGTGTTAAGGGATGTTAT



TTCC





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGT


ID
TGCCGTCACTGCGTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGC


NO:
TTATTAAAAGCATTCTGTAACAAAGCGGGACCAAAGCCATGACAAAAACGCGTAA


65
CAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATTTGCACGGCGTC



ACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGAC



GCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAG



TAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAAC



TTTAAGAGGAGGATATACCATGCACCATCATCATCACCATCATACAGGCGGTCTT



CTTAGTATGGACGCGAAAGAGTTCACAGGTCAGTATCCGTTGTCGAAAACATTAC



GATTCGAACTTCGGCCCATCGGCCGCACGTGGGATAACCTGGAGGCCTCAGGCTA



CTTAGCGGAAGACCGCCATCGTGCCGAATGTTATCCTCGTGCGAAAGAGTTATTG



GATGACAACCATCGTGCCTTCCTGAATCGTGTGTTGCCACAAATCGATATGGATT



GGCACCCGATTGCGGAGGCCTTTTGTAAGGTACATAAAAACCCTGGTAATAAAGA



ACTTGCCCAGGATTACAACCTTCAGTTGTCAAAGCGCCGTAAGGAGATCAGCGCA



TATCTTCAGGATGCAGATGGCTATAAAGGCCTGTTCGCGAAGCCCGCCTTAGACG



AAGCTATGAAAATTGCGAAAGAAAACGGGAACGAAAGTGATATTGAGGTTCTCGA



AGCGTTTAACGGTTTTAGCGTATACTTCACCGGTTATCATGAGTCACGCGAGAAC



ATTTATAGCGATGAGGATATGGTGAGCGTAGCCTACCGAATTACTGAGGATAATT



TCCCGCGCTTTGTCTCAAACGCTTTGATCTTTGATAAATTAAACGAAAGCCATCC



GGATATTATCTCTGAAGTATCGGGCAATCTTGGAGTTGATGACATTGGTAAGTAC



TTTGACGTGTCGAACTATAACAATTTTCTTTCCCAGGCCGGTATAGATGACTACA



ATCACATTATTGGCGGCCATACAACCGAAGACGGACTGATACAAGCGTTTAATGT



CGTATTGAACTTACGTCACCAAAAAGACCCTGGCTTTGAAAAAATTCAGTTCAAA



CAGCTCTACAAACAAATCCTGAGCGTGCGTACCAGCAAAAGCTACATCCCGAAAC



AGTTTGACAACTCTAAGGAGATGGTTGACTGCATTTGCGATTATGTCAGCAAAAT



AGAGAAATCCGAAACAGTAGAACGGGCCCTGAAACTAGTCCGTAATATCAGTTCT



TTCGACTTGCGCGGGATCTTTGTCAATAAAAAGAACTTGCGCATACTGAGCAACA



AACTGATAGGAGATTGGGACGCGATCGAAACCGCATTGATGCATAGTTCTTCATC



AGAAAACGATAAGAAAAGCGTATATGATAGCGCGGAGGCTTTTACGTTGGATGAC



ATCTTTTCAAGCGTGAAAAAATTTTCTGATGCCTCTGCCGAAGATATTGGCAACA



GGGCGGAAGACATCTGTAGAGTGATAAGTGAGACGGCCCCTTTTATCAACGATCT



GCGAGCGGTGGACCTGGATAGCCTGAACGACGATGGTTATGAAGCGGCCGTCTCA



AAAATTCGGGAGTCGCTGGAGCCTTATATGGATCTTTTCCATGAACTGGAAATTT



TCTCGGTTGGCGATGAGTTCCCAAAATGCGCAGCATTTTACAGCGAACTGGAGGA



AGTCAGCGAACAGCTGATCGAAATTATTCCGTTATTCAACAAGGCGCGTTCGTTC



TGCACCCGGAAACGCTATAGCACCGATAAGATTAAAGTGAACTTAAAATTCCCGA



CCTTGGCGGACGGGTGGGACCTGAACAAAGAGAGAGACAACAAAGCCGCGATTCT



GCGGAAAGACGGTAAGTATTATCTGGCAATTCTGGATATGAAGAAAGATCTGTCA



AGCATTAGGACCAGCGACGAAGATGAATCCAGCTTCGAAAAGATGGAGTATAAAC



TGTTACCGAGTCCAGTAAAAATGCTGCCAAAGATATTCGTAAAATCGAAAGCCGC



TAAGGAAAAATATGGCCTGACAGATCGTATGCTTGAATGCTACGATAAAGGTATG



CATAAGTCGGGTAGTGCGTTTGATCTTGGCTTTTGCCATGAACTCATTGATTATT



ACAAGCGTTGTATCGCGGAGTACCCAGGCTGGGATGTGTTCGATTTCAAGTTTCG



CGAAACTTCCGATTATGGGTCCATGAAAGAGTTCAATGAAGATGTGGCCGGAGCC



GGTTACTATATGAGTCTGAGAAAAATTCCGTGCAGCGAAGTGTACCGTCTGTTAG



ACGAGAAATCGATTTATCTATTTCAAATTTATAACAAAGATTACTCTGAAAATGC



ACATGGTAATAAGAACATGCATACCATGTACTGGGAGGGTCTCTTTTCCCCGCAA



AACCTGGAGTCGCCCGTTTTCAAGTTGTCGGGTGGGGCAGAACTTTTCTTTCGAA



AATCCTCAATCCCTAACGATGCCAAAACAGTACACCCGAAAGGCTCAGTGCTGGT



TCCACGTAATGATGTTAACGGTCGGCGTATTCCAGATTCAATCTACCGCGAACTG



ACACGCTATTTTAACCGTGGCGATTGCCGAATCAGTGACGAAGCCAAAAGTTATC



TTGACAAGGTTAAGACTAAAAAAGCGGACCATGACATTGTGAAAGATCGCCGCTT



TACCGTGGATAAAATGATGTTCCACGTCCCGATTGCGATGAACTTTAAGGCGATC



AGTAAACCGAACTTAAACAAAAAAGTCATTGATGGCATCATTGATGATCAGGATC



TGAAAATCATTGGTATTGATCGTGGCGAGCGGAACTTAATTTACGTCACGATGGT



TGACAGAAAAGGGAATATCTTATATCAGGATTCTCTTAACATCCTCAATGGCTAC



GACTATCGTAAAGCTCTGGATGTGCGCGAATATGACAACAAGGAAGCGCGTCGTA



ACTGGACTAAAGTGGAGGGCATTCGCAAAATGAAGGAAGGCTATCTGTCATTAGC



GGTCTCGAAATTAGCGGATATGATTATCGAAAATAACGCCATCATCGTTATGGAG



GACCTGAACCACGGATTCAAAGCGGGCCGCTCAAAGATTGAAAAACAAGTTTATC



AGAAATTTGAGAGTATGCTGATTAACAAACTGGGCTATATGGTGTTAAAAGACAA



GTCAATTGACCAATCAGGTGGCGCGCTGCATGGATACCAGCTGGCGAACCATGTT



ACCACCTTAGCATCAGTTGGAAAGCAGTGTGGGGTTATCTTTTATATACCGGCAG



CGTTCACTAGTAAAATAGATCCGACCACTGGTTTCGCCGATCTCTTTGCCCTGAG



TAACGTTAAAAACGTAGCGAGCATGCGTGAATTCTTTTCCAAAATGAAATCTGTC



ATTTATGATAAAGCTGAAGGCAAATTCGCATTCACCTTTGATTACTTGGATTACA



ACGTGAAGAGCGAATGTGGTCGTACGCTGTGGACCGTTTACACCGTTGGTGAGCG



CTTCACCTATTCCCGTGTGAACCGCGAATATGTACGTAAAGTCCCCACCGATATT



ATCTATGATGCCCTCCAGAAAGCAGGCATTAGCGTCGAAGGAGACTTAAGGGACA



GAATTGCCGAAAGCGATGGCGATACGCTGAAGTCTATTTTTTACGCATTCAAATA



CGCGCTAGATATGCGCGTTGAGAATCGCGAGGAAGACTACATTCAATCACCTGTG



AAAAATGCCTCTGGGGAATTTTTTTGTTCAAAAAATGCTGGTAAAAGCCTCCCAC



AAGATAGCGATGCAAACGGTGCATATAACATTGCCCTGAAAGGTATTCTTCAATT



ACGCATGCTGTCTGAGCAGTACGACCCCAACGCGGAATCTATTAGACTTCCGCTG



ATAACCAATAAAGCCTGGCTGACATTCATGCAGTCTGGCATGAAGACCTGGAAAA



ATTAGGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGG



AGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGA



TTACA





SEQ
AAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAA


ID
GCATTGATAATTGAGATCCCTCTCCCTGACAGGATGATTACATAAATAATAGTGA


NO:
CAAAAATAAATTATTTATTTATCCAGAAAATGAATTGGAAAATCAGGAGAGCGTT


66
TTCAATCCTACCTCTGGCGCAGTTGATATGTcaaaCAGGTtgccgtcactgcgtc



ttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagcattc



tgtaacaaagcgggaccaaagccatgacaaaaacgcgtaacaaaagtgtctataa



tcacggcagaaaagtccacattgattatttgcacggcgtcacactttgctatgcc



atagcatttttatccataagattagcggatcctacctgacgctttttatcgcaac



tctctactgtttctccatacccgtttttttgggctagcaccgcctatctcgtgtg



agataggcggagatacgaactttaagAAGGAGatataccatgGATAGTTTGAAAG



ATTTCACCAATCTGTACCCTGTCAGTAAGACATTGAGATTTGAATTAAAGCCCGT



TGGAAAGACTTTAGAAAATATCGAGAAAGCAGGTATTTTGAAAGAGGATGAGCAT



CGTGCAGAAAGTTATCGGAGGGTGAAGAAAATAATTGATACTTATCATAAGGTAT



TTATCGATTCTTCTCTTGAAAATATGGCTAAAATGGGTATTGAGAATGAAATAAA



AGCAATGCTCCAAAGTTTCTGCGAATTGTATAAAAAAGATCATCGCACTGAGGGT



GAAGACAAGGCATTAGATAAAATTCGAGCAGTACTTCGTGGCCTGATTGTTGGGG



CTTTCACTGGTGTTTGCGGAAGACGGGAAAATACAGTCCAAAACGAGAAGTACGA



GAGTTTGTTCAAAGAAAAGTTGATAAAAGAAATTTTACCTGATTTTGTGCTCTCT



ACTGAGGCTGAAAGCTTGCCTTTCTCTGTTGAAGAAGCTACGAGGTCACTGAAGG



AGTTTGATAGCTTTACATCCTACTTTGCTGGTTTTTACGAGAATAGAAAGAATAT



ATACTCGACGAAACCTCAATCCACTGCCATTGCTTATCGTCTTATTCATGAGAAC



TTGCCGAAGTTCATTGATAATATTCTTGTTTTTCAGAAGATCAAAGAGCCTATAG



CCAAAGAGCTGGAACATATTCGTGCGGACTTTTCTGCCGGGGGGTACATAAAAAA



GGATGAGAGATTGGAGGATATTTTTTCGTTGAACTATTATATCCACGTGTTATCT



CAGGCTGGGATCGAAAAATATAACGCATTGATTGGGAAGATTGTGACAGAAGGAG



ATGGAGAGATGAAAGGGCTCAATGAACACATCAACCTTTACAACCAACAAAGAGG



CAGAGAGGATCGGCTCCCTCTTTTTAGGCCTCTTTATAAACAGATATTGAGTGAC



AGAGAGCAATTATCATACTTGCCTGAGAGTTTTGAAAAAGATGAGGAGCTCCTCA



GGGCTCTAAAAGAGTTCTATGATCATATCGCAGAAGACATTCTCGGACGTACTCA



ACAGTTGATGACTTCTATTTCAGAATATGATTTATCTCGGATATACGTAAGGAAC



GATAGCCAATTGACTGATATATCAAAAAAAATGTTGGGAGATTGGAATGCTATCT



ACATGGCTAGAGAACGAGCATATGACCACGAGCAGGCTCCCAAAAGAATCACGGC



GAAATACGAGAGGGACAGGATTAAAGCTCTTAAAGGAGAAGAGAGTATAAGTCTG



GCAAATCTTAATAGTTGTATTGCCTTTCTGGACAATGTTAGAGATTGCCGTGTAG



ATACTTATCTTTCCACACTGGGCCAGAAGGAAGGACCACATGGTCTATCTAATCT



CGTTGAGAACGTTTTTGCCTCATACCATGAAGCAGAGCAATTGTTGAGCTTTCCA



TACCCCGAAGAGAATAATCTGATTCAGGACAAGGACAATGTGGTGTTAATTAAGA



ATCTTCTCGACAATATCAGTGATCTGCAGAGGTTCTTGAAACCTCTTTGGGGTAT



GGGAGACGAACCCGATAAAGATGAAAGATTTTATGGAGAGTATAATTATATCCGA



GGAGCTCTAGATCAGGTGATCCCTCTGTACAATAAGGTAAGGAACTACCTCACTC



GGAAGCCTTATTCGACCAGAAAAGTAAAACTCAATTTTGGGAATTCTCAATTGCT



TAGTGGTTGGGATAGAAATAAGGAAAAGGATAATAGCTGTGTGATTTTGCGTAAG



GGGCAGAACTTCTATTTGGCTATTATGAACAATAGGCACAAAAGAAGTTTCGAAA



ACAAGGTGTTGCCCGAGTATAAGGAGGGAGAACCTTACTTCGAAAAGATGGATTA



TAAATTTTTGCCTGATCCTAATAAAATGCTTCCTAAGGTTTTTCTTTCGAAAAAA



GGAATAGAGATATACAAACCAAGTCCGAAGCTTTTAGAACAATATGGACATGGAA



CTCACAAAAAGGGAGATACCTTTAGTATGGATGATTTGCACGAACTGATCGATTT



CTTCAAACACTCAATCGAGGCTCATGAAGATTGGAAGCAATTCGGATTCAAATTT



TCTGATACGGCTACTTATGAGAATGTATCTAGTTTCTATAGAGAAGTTGAGGATC



AGGGGTATAAGCTCTCTTTCCGAAAAGTTTCGGAATCTTATGTCTATTCATTAAT



AGATCAAGGCAAGTTGTATTTATTTCAGATATACAACAAGGACTTTTCTCCCTGC



AGCAAAGGGACACCTAATCTGCATACCTTGTATTGGAGAATGCTTTTTGACGAGC



GCAATTTGGCAGATGTCATATACAAACTGGATGGGAAGGCTGAAATCTTTTTCCG



AGAGAAGAGTTTGAAAAATGATCATCCCACGCATCCGGCTGGTAAGCCTATCAAA



AAGAAAAGTCGACAAAAAAAAGGAGAGGAGAGTCTGTTTGAGTATGATTTAGTCA



AGGATAGGCACTATACGATGGATAAGTTCCAGTTTCATGTGCCTATTACTATGAA



TTTTAAATGTTCTGCAGGAAGCAAAGTCAATGATATGGTTAATGCTCATATTCGA



GAGGCAAAGGATATGCATGTCATTGGAATTGATCGTGGAGAACGCAATCTGCTGT



ATATATGCGTGATAGATAGTCGAGGGACGATTTTGGATCAAATTTCTCTGAATAC



GATTAACGATATAGACTATCATGATTTATTGGAGAGTCGAGACAAAGACCGTCAG



CAGGAGCGCCGAAACTGGCAAACTATCGAAGGGATCAAGGAGCTAAAACAAGGCT



ACCTTAGTCAGGCGGTTCATCGGATAGCCGAACTGATGGTGGCTTATAAGGCTGT



AGTTGCTTTGGAGGATTTGAATATGGGGTTCAAACGTGGGCGGCAGAAAGTAGAA



AGTTCTGTTTATCAGCAGTTTGAGAAACAGCTGATAGATAAGCTCAACTATCTTG



TGGACAAGAAGAAAAGGCCTGAAGATATTGGAGGATTGTTGAGAGCCTATCAATT



TACGGCCCCATTTAAGAGTTTTAAGGAAATGGGAAAGCAAAACGGCTTCTTGTTT



TATATCCCGGCTTGGAACACGAGCAACATAGATCCGACTACTGGATTTGTTAATT



TATTTCATGCCCAGTATGAAAATGTAGATAAAGCGAAGAGCTTCTTTCAAAAGTT



TGATTCAATTAGTTACAACCCGAAGAAAGACTGGTTTGAGTTTGCATTCGATTAT



AAAAACTTTACTAAAAAGGCTGAAGGAAGTCGTTCTATGTGGATATTATGCACAC



ATGGTTCCCGAATAAAGAATTTTAGAAATTCCCAGAAGAATGGTCAATGGGATTC



CGAAGAATTCGCCTTGACGGAGGCTTTTAAGTCTCTTTTTGTGCGATATGAGATA



GATTATACCGCTGATTTGAAAACAGCTATTGTGGACGAAAAGCAAAAAGACTTCT



TCGTGGATCTTCTGAAGCTATTCAAATTGACAGTACAGATGCGCAACAGCTGGAA



AGAGAAGGATTTGGATTATCTAATCTCTCCTGTAGCAGGGGCTGATGGCCGTTTC



TTCGATACAAGAGAGGGAAATAAAAGTCTGCCTAAGGATGCAGATGCCAATGGAG



CTTATAATATTGCCCTAAAAGGACTTTGGGCTCTACGCCAGATTCGGCAAACTTC



AGAAGGCGGTAAACTCAAATTGGCGATTTCCAATAAGGAATGGCTACAGTTTGTG



CAAGAGAGATCTTACGAGAAAGACtgaGAAATCATCCTTAGCGAAAGCTAAGGAT



TTTTTTTATCTGAAATTTATTATATCGCGTTGATTATTGATGCTGTTTTTAGTTT



TAACGGCAATTAATATATGTGTTATTAATTGAATGAATTTTATCATTCATAATAA



GTATGTGTAGGATCAAGCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGG



AAGCAAAGAGGATTACAGAATTATCTCATAACAAGTGTTAAGGGATGTTATTTCC





SEQ
AAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAA


ID
GCATTGATAATTGAGATCCCTCTCCCTGACAGGATGATTACATAAATAATAGTGA


NO:
CAAAAATAAATTATTTATTTATCCAGAAAATGAATTGGAAAATCAGGAGAGCGTT


67
TTCAATCCTACCTCTGGCGCAGTTGATATGTcaaaCAGGTtgccgtcactgcgtc



ttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagcattc



tgtaacaaagcgggaccaaagccatgacaaaaacgcgtaacaaaagtgtctataa



tcacggcagaaaagtccacattgattatttgcacggcgtcacactttgctatgcc



atagcatttttatccataagattagcggatcctacctgacgctttttatcgcaac



tctctactgtttctccatacccgtttttttgggctagcaccgcctatctcgtgtg



agataggcggagatacgaactttaagAAGGAGatataccATGAACAACGGCACAA



ATAATTTTCAGAACTTCATCGGGATCTCAAGTTTGCAGAAAACGCTGCGCAATGC



TCTGATCCCCACGGAAACCACGCAACAGTTCATCGTCAAGAACGGAATAATTAAA



GAAGATGAGTTACGTGGCGAGAACCGCCAGATTCTGAAAGATATCATGGATGACT



ACTACCGCGGATTCATCTCTGAGACTCTGAGTTCTATTGATGACATAGATTGGAC



TAGCCTGTTCGAAAAAATGGAAATTCAGCTGAAAAATGGTGATAATAAAGATACC



TTAATTAAGGAACAGACAGAGTATCGGAAAGCAATCCATAAAAAATTTGCGAACG



ACGATCGGTTTAAGAACATGTTTAGCGCCAAACTGATTAGTGACATATTACCTGA



ATTTGTCATCCACAACAATAATTATTCGGCATCAGAGAAAGAGGAAAAAACCCAG



GTGATAAAATTGTTTTCGCGCTTTGCGACTAGCTTTAAAGATTACTTCAAGAACC



GTGCAAATTGCTTTTCAGCGGACGATATTTCATCAAGCAGCTGCCATCGCATCGT



CAACGACAATGCAGAGATATTCTTTTCAAATGCGCTGGTCTACCGCCGGATCGTA



AAATCGCTGAGCAATGACGATATCAACAAAATTTCGGGCGATATGAAAGATTCAT



TAAAAGAAATGAGTCTGGAAGAAATATATTCTTACGAGAAGTATGGGGAATTTAT



TACCCAGGAAGGCATTAGCTTCTATAATGATATCTGTGGGAAAGTGAATTCTTTT



ATGAACCTGTATTGTCAGAAAAATAAAGAAAACAAAAATTTATACAAACTTCAGA



AACTTCACAAACAGATTCTATGCATTGCGGACACTAGCTATGAGGTCCCGTATAA



ATTTGAAAGTGACGAGGAAGTGTACCAATCAGTTAACGGCTTCCTTGATAACATT



AGCAGCAAACATATAGTCGAAAGATTACGCAAAATCGGCGATAACTATAACGGCT



ACAACCTGGATAAAATTTATATCGTGTCCAAATTTTACGAGAGCGTTAGCCAAAA



AACCTACCGCGACTGGGAAACAATTAATACCGCCCTCGAAATTCATTACAATAAT



ATCTTGCCGGGTAACGGTAAAAGTAAAGCCGACAAAGTAAAAAAAGCGGTTAAGA



ATGATTTACAGAAATCCATCACCGAAATAAATGAACTAGTGTCAAACTATAAGCT



GTGCAGTGACGACAACATCAAAGCGGAGACTTATATACATGAGATTAGCCATATC



TTGAATAACTTTGAAGCACAGGAATTGAAATACAATCCGGAAATTCACCTAGTTG



AATCCGAGCTCAAAGCGAGTGAGCTTAAAAACGTGCTGGACGTGATCATGAATGC



GTTTCATTGGTGTTCGGTTTTTATGACTGAGGAACTTGTTGATAAAGACAACAAT



TTTTATGCGGAACTGGAGGAGATTTACGATGAAATTTATCCAGTAATTAGTCTGT



ACAACCTGGTTCGTAACTACGTTACCCAGAAACCGTACAGCACGAAAAAGATTAA



ATTGAACTTTGGAATACCGACGTTAGCAGACGGTTGGTCAAAGTCCAAAGAGTAT



TCTAATAACGCTATCATACTGATGCGCGACAATCTGTATTATCTGGGCATCTTTA



ATGCGAAGAATAAACCGGACAAGAAGATTATCGAGGGTAATACGTCAGAAAATAA



GGGTGACTACAAAAAGATGATTTATAATTTGCTCCCGGGTCCCAACAAAATGATC



CCGAAAGTTTTCTTGAGCAGCAAGACGGGGGTGGAAACGTATAAACCGAGCGCCT



ATATCCTAGAGGGGTATAAACAGAATAAACATATCAAGTCTTCAAAAGACTTTGA



TATCACTTTCTGTCATGATCTGATCGACTACTTCAAAAACTGTATTGCAATTCAT



CCCGAGTGGAAAAACTTCGGTTTTGATTTTAGCGACACCAGTACTTATGAAGACA



TTTCCGGGTTTTATCGTGAGGTAGAGTTACAAGGTTACAAGATTGATTGGACATA



CATTAGCGAAAAAGACATTGATCTGCTGCAGGAAAAAGGTCAACTGTATCTGTTC



CAGATATATAACAAAGATTTTTCGAAAAAATCAACCGGGAATGACAACCTTCACA



CCATGTACCTGAAAAATCTTTTCTCAGAAGAAAATCTTAAGGATATCGTCCTGAA



ACTTAACGGCGAAGCGGAAATCTTCTTCAGGAAGAGCAGCATAAAGAACCCAATC



ATTCATAAAAAAGGCTCGATTTTAGTCAACCGTACCTACGAAGCAGAAGAAAAAG



ACCAGTTTGGCAACATTCAAATTGTGCGTAAAAATATTCCGGAAAACATTTATCA



GGAGCTGTACAAATACTTCAACGATAAAAGCGACAAAGAGCTGTCTGATGAAGCA



GCCAAACTGAAGAATGTAGTGGGACACCACGAGGCAGCGACGAATATAGTCAAGG



ACTATCGCTACACGTATGATAAATACTTCCTTCATATGCCTATTACGATCAATTT



CAAAGCCAATAAAACGGGTTTTATTAATGATAGGATCTTACAGTATATCGCTAAA



GAAAAAGACTTACATGTGATCGGCATTGATCGGGGCGAGCGTAACCTGATCTACG



TGTCCGTGATTGATACTTGTGGTAATATAGTTGAACAGAAAAGCTTTAACATTGT



AAACGGCTACGACTATCAGATAAAACTGAAACAACAGGAGGGCGCTAGACAGATT



GCGCGGAAAGAATGGAAAGAAATTGGTAAAATTAAAGAGATCAAAGAGGGCTACC



TGAGCTTAGTAATCCACGAGATCTCTAAAATGGTAATCAAATACAATGCAATTAT



AGCGATGGAGGATTTGTCTTATGGTTTTAAAAAAGGGCGCTTTAAGGTCGAACGG



CAAGTTTACCAGAAATTTGAAACCATGCTCATCAATAAACTCAACTATCTGGTAT



TTAAAGATATTTCGATTACCGAGAATGGCGGTCTCCTGAAAGGTTATCAGCTGAC



ATACATTCCTGATAAACTTAAAAACGTGGGTCATCAGTGCGGCTGCATTTTTTAT



GTGCCTGCTGCATACACGAGCAAAATTGATCCGACCACCGGCTTTGTGAATATCT



TTAAATTTAAAGACCTGACAGTGGACGCAAAACGTGAATTCATTAAAAAATTTGA



CTCAATTCGTTATGACAGTGAAAAAAATCTGTTCTGCTTTACATTTGACTACAAT



AACTTTATTACGCAAAACACGGTCATGAGCAAATCATCGTGGAGTGTGTATACAT



ACGGCGTGCGCATCAAACGTCGCTTTGTGAACGGCCGCTTCTCAAACGAAAGTGA



TACCATTGACATAACCAAAGATATGGAGAAAACGTTGGAAATGACGGACATTAAC



TGGCGCGATGGCCACGATCTTCGTCAAGACATTATAGATTATGAAATTGTTCAGC



ACATATTCGAAATTTTCCGTTTAACAGTGCAAATGCGTAACTCCTTGTCTGAACT



GGAGGACCGTGATTACGATCGTCTCATTTCACCTGTACTGAACGAAAATAACATT



TTTTATGACAGCGCGAAAGCGGGGGATGCACTTCCTAAGGATGCCGATGCAAATG



GTGCGTATTGTATTGCATTAAAAGGGTTATATGAAATTAAACAAATTACCGAAAA



TTGGAAAGAAGATGGTAAATTTTCGCGCGATAAACTCAAAATCAGCAATAAAGAT



TGGTTCGACTTTATCCAGAATAAGCGCTATCTCTAAGAAATCATCCTTAGCGAAA



GCTAAGGATTTTTTTTATCTGAAATTTATTATATCGCGTTGATTATTGATGCTGT



TTTTAGTTTTAACGGCAATTAATATATGTGTTATTAATTGAATGAATTTTATCAT



TCATAATAAGTATGTGTAGGATCAAGCTCAGGTTAAATATTCACTCAGGAAGTTA



TTACTCAGGAAGCAAAGAGGATTACAGAATTATCTCATAACAAGTGTTAAGGGAT



GTTATTTCC





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGT


ID
TGCCGTCACTGCGTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGC


NO:
TTATTAAAAGCATTCTGTAACAAAGCGGGACCAAAGCCATGACAAAAACGCGTAA


68
CAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATTTGCACGGCGTC



ACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGAC



GCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAG



TAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAAC



TTTAAGAGGAGGATATACCATGCACCATCATCATCACCATACCAATAAATTCACT



AACCAGTATTCTCTCTCTAAGACCCTGCGCTTTGAACTGATTCCGCAGGGGAAAA



CCTTGGAGTTCATTCAAGAAAAAGGCCTCTTGTCTCAGGATAAACAGAGGGCTGA



ATCTTACCAAGAAATGAAGAAAACTATTGATAAGTTTCATAAATATTTCATTGAT



TTAGCCTTGTCTAACGCCAAATTAACTCACTTGGAAACGTATCTGGAGTTATACA



ACAAATCTGCCGAAACTAAGAAAGAACAGAAATTTAAAGACGATTTGAAAAAAGT



ACAGGACAATCTGCGTAAAGAAATTGTCAAATCCTTCAGTGACGGCGATGCTAAA



AGCATTTTTGCCATTCTGGACAAAAAAGAGTTGATTACTGTGGAATTAGAAAAGT



GGTTTGAAAACAATGAGCAGAAAGACATCTACTTCGATGAGAAATTCAAAACTTT



CACCACCTATTTTACAGGATTTCATCAAAACCGGAAGAACATGTACTCAGTAGAA



CCGAACTCCACGGCCATTGCGTATCGTTTGATCCATGAGAATCTGCCTAAATTTC



TGGAGAATGCGAAAGCCTTTGAAAAGATTAAGCAGGTCGAATCGCTGCAAGTGAA



TTTTCGTGAACTCATGGGCGAATTTGGTGACGAAGGTCTAATCTTCGTTAACGAA



CTGGAAGAAATGTTTCAGATTAATTACTACAATGACGTGCTATCGCAGAACGGTA



TCACAATCTACAATAGTATTATCTCAGGGTTCACAAAAAACGATATAAAATACAA



AGGCCTGAACGAGTATATCAATAACTACAACCAAACAAAGGACAAAAAGGATAGG



CTTCCGAAACTGAAGCAGTTATACAAACAGATTTTATCTGACAGAATCTCCCTGA



GCTTTCTGCCGGATGCTTTCACTGATGGGAAGCAGGTTCTGAAAGCGATTTTCGA



TTTTTATAAGATTAACTTACTGAGCTACACGATTGAAGGTCAAGAAGAATCTCAA



AACTTACTGCTCTTGATCCGTCAAACCATTGAAAATCTATCATCGTTCGATACGC



AGAAAATCTACCTCAAAAACGATACTCACCTGACTACGATCTCTCAGCAGGTTTT



CGGGGATTTTAGTGTATTTTCAACAGCTCTGAACTACTGGTATGAAACCAAAGTC



AATCCGAAATTCGAGACGGAATATTCTAAGGCCAACGAAAAAAAACGTGAGATTC



TTGATAAAGCTAAAGCCGTATTTACTAAACAGGATTACTTTTCTATTGCTTTCCT



GCAGGAAGTTTTATCGGAGTATATCCTGACCCTGGATCATACATCTGATATCGTT



AAAAAACACAGCAGCAATTGCATCGCTGACTATTTCAAAAACCACTTTGTCGCCA



AAAAAGAAAACGAAACAGACAAGACTTTCGATTTCATTGCTAACATCACCGCAAA



ATACCAGTGTATTCAGGGTATCTTGGAAAACGCCGACCAATACGAAGACGAACTG



AAACAAGATCAGAAGCTGATCGATAATTTAAAATTCTTCTTAGATGCAATCCTGG



AGCTGCTGCACTTCATCAAACCGCTTCATTTAAAGAGCGAGTCCATTACCGAAAA



GGACACCGCCTTCTATGACGTTTTTGAAAATTATTATGAAGCCCTCTCCTTGCTG



ACTCCGCTGTATAATATGGTACGCAATTACGTAACCCAGAAACCATATTCTACCG



AAAAAATTAAACTGAACTTTGAAAACGCACAGCTGCTCAACGGTTGGGACGCGAA



TAAAGAAGGTGACTACCTCACCACCATCCTGAAAAAAGATGGTAACTATTTTCTG



GCAATTATGGATAAGAAACATAATAAAGCATTCCAGAAATTTCCTGAAGGGAAAG



AAAATTACGAAAAGATGGTGTACAAACTCTTACCTGGAGTTAACAAAATGTTGCC



GAAAGTATTTTTTAGTAATAAGAACATCGCGTACTTTAACCCGTCCAAAGAACTG



CTGGAAAATTATAAAAAGGAGACGCATAAGAAAGGGGATACCTTTAACCTGGAAC



ATTGCCATACCTTAATAGACTTCTTCAAGGATTCCCTGAATAAACACGAGGATTG



GAAATATTTCGATTTTCAGTTTAGTGAGACCAAGTCATACCAGGATCTTAGCGGC



TTTTATCGCGAAGTAGAACACCAAGGCTATAAAATTAACTTCAAAAACATCGACA



GCGAATACATCGACGGTTTAGTTAACGAGGGCAAACTGTTTCTGTTCCAGATCTA



TTCAAAGGATTTTAGCCCGTTCTCTAAAGGCAAACCAAATATGCATACGTTGTAC



TGGAAAGCACTGTTTGAAGAGCAAAACCTGCAGAATGTGATTTATAAACTGAACG



GCCAAGCTGAGATTTTTTTCCGTAAAGCCTCGATTAAACCGAAAAATATCATCCT



TCATAAGAAGAAAATAAAGATCGCTAAAAAACACTTCATAGATAAAAAAACCAAA



ACCTCCGAAATAGTGCCTGTTCAAACAATTAAGAACTTGAATATGTACTACCAGG



GCAAGATATCGGAAAAGGAGTTGACTCAAGACGATCTTCGCTATATCGATAACTT



TTCGATTTTTAACGAAAAAAACAAGACGATCGACATCATCAAAGATAAACGCTTC



ACTGTAGATAAGTTCCAGTTTCATGTGCCGATTACTATGAACTTCAAAGCTACCG



GGGGTAGCTATATCAACCAAACGGTGTTGGAATACCTGCAGAATAACCCGGAAGT



CAAAATCATTGGGCTGGACCGCGGAGAACGTCACCTTGTGTACTTGACCTTAATC



GATCAGCAAGGCAACATCTTAAAACAAGAATCGCTGAATACCATTACGGATTCAA



AGATTAGCACCCCGTATCATAAGCTGCTCGATAACAAGGAGAATGAGCGCGACCT



GGCCCGTAAAAACTGGGGCACGGTGGAAAACATTAAGGAGTTAAAGGAGGGTTAT



ATTTCCCAGGTAGTGCATAAGATCGCCACTCTCATGCTCGAGGAAAATGCGATCG



TTGTCATGGAAGACTTAAACTTCGGATTTAAACGTGGGCGATTTAAAGTAGAGAA



ACAAATCTACCAGAAGTTAGAAAAAATGCTGATTGACAAATTAAATTACTTGGTC



CTAAAAGACAAACAGCCGCAAGAATTGGGTGGATTATACAACGCCCTCCAACTTA



CCAATAAATTCGAAAGTTTTCAGAAAATGGGTAAACAGTCAGGCTTTCTTTTTTA



TGTTCCTGCGTGGAACACATCCAAAATCGACCCTACAACCGGCTTCGTCAATTAC



TTCTATACTAAATATGAAAACGTCGACAAAGCAAAAGCATTCTTTGAAAAGTTCG



AAGCAATACGTTTTAACGCTGAGAAAAAATATTTCGAGTTCGAAGTCAAGAAATA



CTCAGACTTTAACCCCAAAGCTGAGGGCACACAGCAAGCGTGGACAATCTGCACC



TACGGCGAGCGCATCGAAACGAAGCGTCAAAAAGATCAGAATAACAAATTTGTTT



CAACACCTATCAACCTGACCGAGAAGATTGAAGACTTCTTAGGTAAAAATCAGAT



TGTTTATGGCGACGGTAACTGTATAAAATCTCAAATAGCCTCAAAGGATGATAAA



GCATTTTTCGAAACATTATTATATTGGTTCAAAATGACACTGCAGATGCGCAATA



GTGAGACGCGTACAGATATTGATTATCTTATCAGCCCGGTCATGAACGACAACGG



TACTTTTTACAACTCCAGAGACTATGAAAAACTTGAGAATCCAACTCTCCCCAAA



GATGCTGATGCGAACGGTGCTTATCACATCGCGAAAAAAGGTCTGATGCTGCTGA



ACAAAATCGACCAAGCCGATCTGACTAAGAAAGTTGACCTAAGCATTTCAAATCG



GGACTGGTTACAGTTTGTTCAAAAGAACAAATGAGAAATCATCCTTAGCGAAAGC



TAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCA



GGAAGTTATTACTCAGGAAGCAAAGAGGATTACA





SEQ
AAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAA


ID
GCATTGATAATTGAGATCCCTCTCCCTGACAGGATGATTACATAAATAATAGTGA


NO:
CAAAAATAAATTATTTATTTATCCAGAAAATGAATTGGAAAATCAGGAGAGCGTT


69
TTCAATCCTACCTCTGGCGCAGTTGATATGTcaaaCAGGTtgccgtcactgcgtc



ttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagcattc



tgtaacaaagcgggaccaaagccatgacaaaaacgcgtaacaaaagtgtctataa



tcacggcagaaaagtccacattgattatttgcacggcgtcacactttgctatgcc



atagcatttttatccataagattagcggatcctacctgacgctttttatcgcaac



tctctactgtttctccatacccgtttttttgggctagcaccgcctatctcgtgtg



agataggcggagatacgaactttaagAAGGAGatataccATGGAACAGGAATATT



ATCTGGGCTTGGACATGGGCACCGGTTCCGTCGGCTGGGCTGTTACTGACAGTGA



ATATCACGTTCTAAGAAAGCATGGTAAGGCATTGTGGGGTGTAAGACTTTTCGAA



TCTGCTTCCACTGCTGAAGAGCGTAGAATGTTTAGAACGAGTCGACGTAGGCTAG



ACAGGCGCAATTGGAGAATCGAAATTTTACAAGAAATTTTTGCGGAAGAGATATC



TAAGAAAGACCCAGGCTTTTTCCTGAGAATGAAGGAATCTAAGTATTACCCTGAG



GATAAAAGAGATATAAATGGTAACTGTCCCGAATTGCCTTACGCATTATTTGTGG



ACGATGATTTTACCGATAAGGATTACCATAAAAAGTTCCCAACTATCTACCATTT



ACGCAAAATGTTAATGAATACAGAGGAAACCCCAGACATAAGACTAGTTTATCTG



GCAATACACCATATGATGAAACATAGAGGCCATTTCTTACTTTCCGGGGATATCA



ACGAAATCAAAGAGTTTGGTACCACATTTAGTAAGTTACTGGAAAACATAAAGAA



TGAAGAATTGGATTGGAACTTAGAACTCGGAAAAGAAGAATACGCGGTTGTCGAA



TCTATCCTGAAGGATAATATGCTGAATAGGTCGACCAAAAAAACTAGGCTGATCA



AAGCACTGAAAGCCAAATCTATCTGCGAAAAAGCTGTTTTAAATTTACTTGCTGG



TGGCACTGTTAAGTTATCAGACATTTTTGGTTTGGAAGAATTGAACGAAACCGAG



CGTCCAAAAATTAGTTTCGCTGATAATGGCTACGATGATTACATTGGTGAGGTGG



AAAACGAGTTGGGCGAACAATTTTATATTATAGAGACAGCTAAGGCAGTCTATGA



CTGGGCTGTTTTAGTAGAAATCCTTGGTAAATACACATCTATCTCCGAAGCGAAA



GTTGCTACTTACGAAAAGCACAAGTCCGATCTCCAGTTTTTGAAGAAAATTGTCA



GGAAATATCTGACTAAGGAAGAATATAAAGATATTTTCGTTAGTACCTCTGACAA



ACTGAAAAATTACTCCGCTTACATCGGGATGACCAAGATTAATGGCAAAAAAGTT



GATCTGCAAAGCAAAAGGTGTTCGAAGGAAGAATTTTATGATTTCATTAAAAAGA



ATGTCTTAAAAAAATTAGAAGGTCAGCCAGAATACGAATATTTGAAAGAAGAACT



GGAAAGAGAGACATTCTTACCAAAACAAGTCAACAGAGATAATGGGGTAATTCCA



TATCAAATTCACCTCTACGAATTAAAAAAAATTTTAGGCAATTTACGCGATAAAA



TTGACCTTATCAAAGAAAATGAGGATAAGCTGGTTCAACTCTTTGAATTCAGAAT



ACCCTATTATGTGGGCCCACTGAACAAGATTGATGACGGCAAAGAAGGTAAATTC



ACATGGGCCGTCCGCAAATCCAATGAAAAAATTTACCCATGGAACTTTGAAAATG



TAGTAGATATTGAAGCGTCTGCGGAGAAATTTATTCGAAGAATGACTAATAAATG



CACTTACTTGATGGGAGAGGATGTTCTGCCTAAAGACAGCTTATTATACAGCAAG



TACATGGTTCTAAACGAACTTAACAACGTTAAGTTGGACGGTGAGAAATTAAGTG



TAGAATTGAAACAAAGATTGTATACTGACGTCTTCTGCAAGTACAGAAAAGTGAC



AGTTAAAAAAATTAAGAATTACTTGAAGTGCGAAGGTATAATTTCTGGAAACGTA



GAGATTACTGGTATTGATGGTGATTTCAAAGCATCCCTAACAGCTTACCACGATT



TCAAGGAAATCCTGACAGGAACTGAACTCGCAAAAAAAGATAAAGAAAACATTAT



TACTAATATTGTTCTTTTCGGTGATGACAAGAAATTGTTGAAGAAAAGACTGAAT



AGACTTTACCCCCAGATTACTCCCAATCAACTTAAGAAAATTTGTGCTTTGTCTT



ACACAGGATGGGGTCGTTTTTCAAAAAAGTTCTTAGAAGAGATTACCGCACCTGA



TCCAGAAACAGGCGAAGTATGGAATATAATTACCGCCTTATGGGAATCGAACAAT



AATCTTATGCAACTTCTGAGCAATGAATATCGTTTCATGGAAGAAGTTGAGACTT



ACAACATGGGCAAACAGACGAAGACTTTATCCTATGAAACTGTGGAAAATATGTA



TGTATCACCTTCTGTCAAGAGACAAATTTGGCAAACCTTAAAAATTGTCAAAGAA



TTAGAAAAGGTAATGAAGGAGTCTCCTAAACGTGTGTTTATTGAAATGGCTAGAG



AAAAACAAGAGTCAAAAAGAACCGAGTCAAGAAAGAAGCAGTTAATCGATTTATA



TAAGGCTTGTAAAAACGAAGAGAAAGATTGGGTTAAAGAATTGGGGGACCAAGAG



GAACAAAAACTACGGTCGGATAAGTTGTATTTATACTATACGCAAAAGGGACGAT



GTATGTATTCCGGCGAGGTAATAGAATTGAAGGATTTATGGGACAATACAAAATA



TGACATAGACCATATATATCCCCAATCAAAAACGATGGACGATAGCTTGAACAAT



AGAGTACTCGTGAAAAAAAAATATAATGCGACCAAATCTGATAAGTATCCTCTGA



ATGAAAATATCAGACATGAAAGAAAGGGGTTCTGGAAGTCCTTGTTAGATGGTGG



GTTTATAAGCAAAGAAAAGTACGAGCGTCTAATAAGAAACACGGAGTTATCGCCA



GAAGAACTCGCTGGTTTTATTGAGAGGCAAATCGTGGAAACGAGACAATCTACCA



AAGCCGTTGCTGAGATCCTAAAGCAAGTTTTCCCAGAGTCGGAGATTGTCTATGT



CAAAGCTGGCACAGTGAGCAGGTTTAGGAAAGACTTCGAACTATTAAAGGTAAGA



GAAGTGAACGATTTACATCACGCAAAGGACGCTTACCTAAATATCGTTGTAGGTA



ACTCATATTATGTTAAATTTACCAAGAACGCCTCTTGGTTTATAAAGGAGAACCC



AGGTAGAACATATAACCTGAAAAAGATGTTCACCTCTGGTTGGAATATTGAGAGA



AACGGAGAAGTCGCATGGGAAGTTGGTAAGAAAGGGACTATAGTGACAGTAAAGC



AAATTATGAACAAAAATAATATCCTCGTTACAAGGCAGGTTCATGAAGCAAAGGG



CGGCCTTTTTGACCAACAAATTATGAAGAAAGGGAAAGGTCAAATTGCAATAAAA



GAAACCGATGAGAGACTAGCGTCAATAGAAAAGTATGGTGGCTATAATAAAGCTG



CGGGTGCATACTTTATGCTTGTTGAATCAAAAGACAAGAAAGGTAAGACTATTAG



AACTATAGAATTTATACCCCTGTACCTTAAAAACAAAATTGAATCGGATGAGTCA



ATCGCGTTAAATTTTCTAGAGAAAGGAAGGGGTTTAAAAGAACCAAAGATCCTGT



TAAAAAAGATTAAGATTGACACCTTGTTCGATGTAGATGGATTTAAAATGTGGTT



ATCTGGCAGAACAGGCGATAGACTTTTGTTTAAGTGCGCTAATCAATTAATTTTG



GATGAGAAAATCATTGTCACAATGAAAAAAATAGTTAAGTTTATTCAGAGAAGAC



AAGAAAACAGGGAGTTGAAATTATCTGATAAAGATGGTATCGACAATGAAGTTTT



AATGGAAATCTACAATACATTCGTTGATAAACTTGAAAATACCGTATATCGAATC



AGGTTAAGTGAACAAGCCAAAACATTAATTGATAAACAAAAAGAATTTGAAAGGC



TATCACTGGAAGACAAATCCTCCACCCTATTTGAAATTTTGCATATATTCCAGTG



CCAATCTTCAGCAGCTAATTTAAAAATGATTGGCGGACCTGGGAAAGCCGGCATC



CTAGTGATGAACAATAATATCTCCAAGTGTAACAAAATATCAATTATTAACCAAT



CTCCGACAGGTATTTTTGAAAATGAAATAGACTTGCTTAAGATATAAGAAATCAT



CCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATTTATTATATCGCGTTGATT



ATTGATGCTGTTTTTAGTTTTAACGGCAATTAATATATGTGTTATTAATTGAATG



AATTTTATCATTCATAATAAGTATGTGTAGGATCAAGCTCAGGTTAAATATTCAC



TCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACAGAATTATCTCATAACAAG



TGTTAAGGGATGTTATTTCC





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGT


ID
TGCCGTCACTGCGTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGC


NO:
TTATTAAAAGCATTCTGTAACAAAGCGGGACCAAAGCCATGACAAAAACGCGTAA


70
CAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATTTGCACGGCGTC



ACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGAC



GCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAG



TAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAAC



TTTAAGAGGAGGATATACCATGCACCATCATCATCACCATTCTTTCGACTCTTTC



ACCAACCTGTACTCTCTGTCTAAAACCCTGAAATTCGAAATGCGTCCGGTTGGTA



ACACCCAGAAAATGCTGGACAACGCGGGTGTTTTCGAAAAAGACAAACTGATCCA



GAAAAAATACGGTAAAACCAAACCGTACTTCGACCGTCTGCACCGTGAATTCATC



GAAGAAGCGCTGACCGGTGTTGAACTGATCGGTCTGGACGAAAACTTCCGTACCC



TGGTTGACTGGCAGAAAGACAAAAAAAACAACGTTGCGATGAAAGCGTACGAAAA



CTCTCTGCAGCGTCTGCGTACCGAAATCGGTAAAATCTTCAACCTGAAAGCGGAA



GACTGGGTTAAAAACAAATACCCGATCCTGGGTCTGAAAAACAAAAACACCGACA



TCCTGTTCGAAGAAGCGGTTTTCGGTATCCTGAAAGCGCGTTACGGTGAAGAAAA



AGACACCTTCATCGAAGTTGAAGAAATCGACAAAACCGGTAAATCTAAAATCAAC



CAGATCTCTATCTTCGACTCTTGGAAAGGTTTCACCGGTTACTTCAAAAAATTCT



TCGAAACCCGTAAAAACTTCTACAAAAACGACGGTACCTCTACCGCGATCGCGAC



CCGTATCATCGACCAGAACCTGAAACGTTTCATCGACAACCTGTCTATCGTTGAA



TCTGTTCGTCAGAAAGTTGACCTGGCGGAAACCGAAAAATCTTTCTCTATCTCTC



TGTCTCAGTTCTTCTCTATCGACTTCTACAACAAATGCCTGCTGCAGGACGGTAT



CGACTACTACAACAAAATCATCGGTGGTGAAACCCTGAAAAACGGTGAAAAACTG



ATCGGTCTGAACGAACTGATCAACCAGTACCGTCAGAACAACAAAGACCAGAAAA



TCCCGTTCTTCAAACTGCTGGACAAACAGATCCTGTCTGAAAAAATCCTGTTCCT



GGACGAAATCAAAAACGACACCGAACTGATCGAAGCGCTGTCTCAGTTCGCGAAA



ACCGCGGAAGAAAAAACCAAAATCGTTAAAAAACTGTTCGCGGACTTCGTTGAAA



ACAACTCTAAATACGACCTGGCGCAGATCTACATCTCTCAGGAAGCGTTCAACAC



CATCTCTAACAAATGGACCTCTGAAACCGAAACCTTCGCGAAATACCTGTTCGAA



GCGATGAAATCTGGTAAACTGGCGAAATACGAAAAAAAAGACAACTCTTACAAAT



TCCCGGACTTCATCGCGCTGTCTCAGATGAAATCTGCGCTGCTGTCTATCTCTCT



GGAAGGTCACTTCTGGAAAGAAAAATACTACAAAATCTCTAAATTCCAGGAAAAA



ACCAACTGGGAACAGTTCCTGGCGATCTTCCTGTACGAATTCAACTCTCTGTTCT



CTGACAAAATCAACACCAAAGACGGTGAAACCAAACAGGTTGGTTACTACCTGTT



CGCGAAAGACCTGCACAACCTGATCCTGTCTGAACAGATCGACATCCCGAAAGAC



TCTAAAGTTACCATCAAAGACTTCGCGGACTCTGTTCTGACCATCTACCAGATGG



CGAAATACTTCGCGGTTGAAAAAAAACGTGCGTGGCTGGCGGAATACGAACTGGA



CTCTTTCTACACCCAGCCGGACACCGGTTACCTGCAGTTCTACGACAACGCGTAC



GAAGACATCGTTCAGGTTTACAACAAACTGCGTAACTACCTGACCAAAAAACCGT



ACTCTGAAGAAAAATGGAAACTGAACTTCGAAAACTCTACCCTGGCGAACGGTTG



GGACAAAAACAAAGAATCTGACAACTCTGCGGTTATCCTGCAGAAAGGTGGTAAA



TACTACCTGGGTCTGATCACCAAAGGTCACAACAAAATCTTCGACGACCGTTTCC



AGGAAAAATTCATCGTTGGTATCGAAGGTGGTAAATACGAAAAAATCGTTTACAA



ATTCTTCCCGGACCAGGCGAAAATGTTCCCGAAAGTTTGCTTCTCTGCGAAAGGT



CTGGAATTCTTCCGTCCGTCTGAAGAAATCCTGCGTATCTACAACAACGCGGAAT



TCAAAAAAGGTGAAACCTACTCTATCGACTCTATGCAGAAACTGATCGACTTCTA



CAAAGACTGCCTGACCAAATACGAAGGTTGGGCGTGCTACACCTTCCGTCACCTG



AAACCGACCGAAGAATACCAGAACAACATCGGTGAATTCTTCCGTGACGTTGCGG



AAGACGGTTACCGTATCGACTTCCAGGGTATCTCTGACCAGTACATCCACGAAAA



AAACGAAAAAGGTGAACTGCACCTGTTCGAAATCCACAACAAAGACTGGAACCTG



GACAAAGCGCGTGACGGTAAATCTAAAACCACCCAGAAAAACCTGCACACCCTGT



ACTTCGAATCTCTGTTCTCTAACGACAACGTTGTTCAGAACTTCCCGATCAAACT



GAACGGTCAGGCGGAAATCTTCTACCGTCCGAAAACCGAAAAAGACAAACTGGAA



TCTAAAAAAGACAAAAAAGGTAACAAAGTTATCGACCACAAACGTTACTCTGAAA



ACAAAATCTTCTTCCACGTTCCGCTGACCCTGAACCGTACCAAAAACGACTCTTA



CCGTTTCAACGCGCAGATCAACAACTTCCTGGCGAACAACAAAGACATCAACATC



ATCGGTGTTGACCGTGGTGAAAAACACCTGGTTTACTACTCTGTTATCACCCAGG



CGTCTGACATCCTGGAATCTGGTTCTCTGAACGAACTGAACGGTGTTAACTACGC



GGAAAAACTGGGTAAAAAAGCGGAAAACCGTGAACAGGCGCGTCGTGACTGGCAG



GACGTTCAGGGTATCAAAGACCTGAAAAAAGGTTACATCTCTCAGGTTGTTCGTA



AACTGGCGGACCTGGCGATCAAACACAACGCGATCATCATCCTGGAAGACCTGAA



CATGCGTTTCAAACAGGTTCGTGGTGGTATCGAAAAATCTATCTACCAGCAGCTG



GAAAAAGCGCTGATCGACAAACTGTCTTTCCTGGTTGACAAAGGTGAAAAAAACC



CGGAACAGGCGGGTCACCTGCTGAAAGCGTACCAGCTGTCTGCGCCGTTCGAAAC



CTTCCAGAAAATGGGTAAACAGACCGGTATCATCTTCTACACCCAGGCGTCTTAC



ACCTCTAAATCTGACCCGGTTACCGGTTGGCGTCCGCACCTGTACCTGAAATACT



TCTCTGCGAAAAAAGCGAAAGACGACATCGCGAAATTCACCAAAATCGAATTCGT



TAACGACCGTTTCGAACTGACCTACGACATCAAAGACTTCCAGCAGGCGAAAGAA



TACCCGAACAAAACCGTTTGGAAAGTTTGCTCTAACGTTGAACGTTTCCGTTGGG



ACAAAAACCTGAACCAGAACAAAGGTGGTTACACCCACTACACCAACATCACCGA



AAACATCCAGGAACTGTTCACCAAATACGGTATCGACATCACCAAAGACCTGCTG



ACCCAGATCTCTACCATCGACGAAAAACAGAACACCTCTTTCTTCCGTGACTTCA



TCTTCTACTTCAACCTGATCTGCCAGATCCGTAACACCGACGACTCTGAAATCGC



GAAAAAAAACGGTAAAGACGACTTCATCCTGTCTCCGGTTGAACCGTTCTTCGAC



TCTCGTAAAGACAACGGTAACAAACTGCCGGAAAACGGTGACGACAACGGTGCGT



ACAACATCGCGCGTAAAGGTATCGTTATCCTGAACAAAATCTCTCAGTACTCTGA



AAAAAACGAAAACTGCGAAAAAATGAAATGGGGTGACCTGTACGTTTCTAACATC



GACTGGGACAACTTCGTTGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTAT



CTGAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAG



GAAGCAAAGAGGATTACA





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGT


ID
TGCCGTCACTGCGTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGC


NO:
TTATTAAAAGCATTCTGTAACAAAGCGGGACCAAAGCCATGACAAAAACGCGTAA


71
CAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATTTGCACGGCGTC



ACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGAC



GCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAG



TAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAAC



TTTAAGAGGAGGATATACCATGCACCATCATCATCACCATAACAAATTCGAAAAC



TTCACCGGTCTGTACCCGATCTCTAAAACCCTGCGTTTCGAACTGATCCCGCAGG



GTAAAACCCTGGAATACATCGAAAAATCTGAAATCCTGGAAAACGACAACTACCG



TGCGGAAAAATACGAAGAAGTTAAAGACATCATCGACGGTTACCACAAATGGTTC



ATCAACGAAACCCTGCACGACCTGCACATCAACTGGTCTGAACTGAAAGTTGCGC



TGGAAAACAACCGTATCGAAAAATCTGACGCGTCTAAAAAAGAACTGCAGCGTGT



TCAGAAAATCAAACGTGAAGAAATCTACAACGCGTTCATCGAACACGAAGCGTTC



CAGTACCTGTTCAAAGAAAACCTGCTGTCTGACCTGCTGCCGATCCAGATCGAAC



AGTCTGAAGACCTGGACGCGGAAAAAAAAAAACAGGCGGTTGAAACCTTCAACCG



TTTCTCTACCTACTTCACCGGTTTCCACGAAAACCGTAAAAACATCTACTCTAAA



GAAGGTATCTCTACCTCTGTTACCTACCGTATCGTTCACGACAACTTCCCGAAAT



TCCTGGAAAACATGAAAGTTTTCGAAATCCTGCGTAACGAATGCCCGGAAGTTAT



CTCTGACACCGCGAACGAACTGGCGCCGTTCATCGACGGTGTTCGTATCGAAGAC



ATCTTCCTGATCGACTTCTTCAACTCTACCTTCTCTCAGAACGGTATCGACTACT



ACAACCGTATCCTGGGTGGTGTTACCACCGAAACCGGTGAAAAATACCGTGGTAT



CAACGAATTCACCAACCTGTACCGTCAGCAGCACCCGGAATTCGGTAAATCTAAA



AAAGCGACCAAAATGGTTGTTCTGTTCAAACAGATCCTGTCTGACCGTGACACCC



TGTCTTTCATCCCGGAAATGTTCGGTAACGACAAACAGGTTCAGAACTCTATCCA



GCTGTTCTACAACCGTGAAATCTCTCAGTTCGAAAACGAAGGTGTTAAAACCGAC



GTTTGCACCGCGCTGGCGACCCTGACCTCTAAAATCGCGGAATTCGACACCGAAA



AAATCTACATCCAGCAGCCGGAACTGCCGAACGTTTCTCAGCGTCTGTTCGGTTC



TTGGAACGAACTGAACGCGTGCCTGTTCAAATACGCGGAACTGAAATTCGGTACC



GCGGAAAAAGTTGCGAACCGTAAAAAAATCGACAAATGGCTGAAATCTGACCTGT



TCTCTTTCACCGAACTGAACAAAGCGCTGGAATTCTCTGGTAAAGACGAACGTAT



CGAAAACTACTTCTCTGAAACCGGTATCTTCGCGCAGCTGGTTAAAACCGGTTTC



GACGAAGCGCAGTCTATCCTGGAAACCGAATACACCTCTGAAGTTCACCTGAAAG



ACCAGCAGACCGACATCGAAAAAATCAAAACCTTCCTGGACGCGCTGCAGAACCT



GATGCACCTGCTGAAATCTCTGTGCGTTTCTGAAGAAGCGGACCGTGACGCGGCG



TTCTACAACGAATTCGACATGCTGTACAACCAGCTGAAACTGGTTGTTCCGCTGT



ACAACAAAGTTCGTAACTACATCACCCAGAAACTGTTCCGTTCTGACAAAATCAA



AATCTACTTCGAAAACAAAGGTCAGTTCCTGGGTGGTTGGGTTGACTCTCAGACC



GAAAACTCTGACAACGGTACCCAGGCGGGTGGTTACATCTTCCGTAAAGAAAACG



TTATCAACGAATACGACTACTACCTGGGTATCTGCTCTGACCCGAAACTGTTCCG



TCGTACCACCATCGTTTCTGAAAACGACCGTTCTTCTTTCGAACGTCTGGACTAC



TACCAGCTGAAAACCGCGTCTGTTTACGGTAACTCTTACTGCGGTAAACACCCGT



ACACCGAAGACAAAAACGAACTGGTTAACTCTATCGACCGTTTCGTTCACCTGTC



TGGTAACAACATCCTGATCGAAAAAATCGCGAAAGACAAAGTTAAATCTAACCCG



ACCACCAACACCCCGTCTGGTTACCTGAACTTCATCCACCGTGAAGCGCCGAACA



CCTACGAATGCCTGCTGCAGGACGAAAACTTCGTTTCTCTGAACCAGCGTGTTGT



TTCTGCGCTGAAAGCGACCCTGGCGACCCTGGTTCGTGTTCCGAAAGCGCTGGTT



TACGCGAAAAAAGACTACCACCTGTTCTCTGAAATCATCAACGACATCGACGAAC



TGTCTTACGAAAAAGCGTTCTCTTACTTCCCGGTTTCTCAGACCGAATTCGAAAA



CTCTTCTAACCGTACCATCAAACCGCTGCTGCTGTTCAAAATCTCTAACAAAGAC



CTGTCTTTCGCGGAAAACTTCGAAAAAGGTAACCGTCAGAAAATCGGTAAAAAAA



ACCTGCACACCCTGTACTTCGAAGCGCTGATGAAAGGTAACCAGGACACCATCGA



CATCGGTACCGGTATGGTTTTCCACCGTGTTAAATCTCTGAACTACAACGAAAAA



ACCCTGAAATACGGTCACCACTCTACCCAGCTGAACGAAAAATTCTCTTACCCGA



TCATCAAAGACAAACGTTTCGCGTCTGACAAATTCCTGTTCCACCTGTCTACCGA



AATCAACTACAAAGAAAAACGTAAACCGCTGAACAACTCTATCATCGAATTCCTG



ACCAACAACCCGGACATCAACATCATCGGTCTGGACCGTGGTGAACGTCACCTGA



TCTACCTGACCCTGATCAACCAGAAAGGTGAAATCCTGCGTCAGAAAACCTTCAA



CATCGTTGGTAACACCAACTACCACGAAAAACTGAACCAGCGTGAAAAAGAACGT



GACAACGCGCGTAAATCTTGGGCGACCATCGGTAAAATCAAAGAACTGAAAGAAG



GTTTCCTGTCTCTGGTTATCCACGAAATCGCGAAAATCATGGTTGAAAACAACGC



GATCGTTGTTCTGGAAGACCTGAACTTCGGTTTCAAACGTGGTCGTTTCAAAGTT



GAAAAACAGATCTACCAGAAATTCGAAAAAATGCTGATCGACAAACTGAACTACC



TGGTTTTCAAAGACAAAAAAGCGAACGAAGCGGGTGGTGTTCTGAAAGGTTACCA



GCTGGCGGAAAAATTCGAATCTTTCCAGAAAATGGGTAAACAGTCTGGTTTCCTG



TTCTACGTTCCGGCGGCGTACACCTCTAAAATCGACCCGACCACCGGTTTCGTTA



ACATGCTGAACCTGAACTACACCAACATGAAAGACGCGCAGACCCTGCTGTCTGG



TATGGACAAAATCTCTTTCAACGCGGACGCGAACTACTTCGAATTCGAACTGGAC



TACGAAAAATTCAAAACCAACCAGACCGACCACACCAACAAATGGACCATCTGCA



CCGTTGGTGAAAAACGTTTCACCTACAACTCTGCGACCAAAGAAACCACCACCGT



TAACGTTACCGAAGACCTGAAAAAACTGCTGGACAAATTCGAAGTTAAATACTCT



AACGGTGACAACATCAAAGACGAAATCTGCCGTCAGACCGACGCGAAATTCTTCG



AAATCATCCTGTGGCTGCTGAAACTGACCATGCAGATGCGTAACTCTAACACCAA



AACCGAAGAAGACTTCATCCTGTCTCCGGTTAAAAACTCTAACGGTGAATTCTTC



CGTTCTAACGACGACGCGAACGGTATCTGGCCGGCGGACGCGGACGCGAACGGTG



CGTACCACATCGCGCTGAAAGGTCTGTACCTGGTTAAAGAATGCTTCAACAAAAA



CGAAAAATCTCTGAAAATCGAACACAAAAACTGGTTCAAATTCGCGCAGACCCGT



TTCAACGGTTCTCTGACCAAAAACGGTTAAGAAATCATCCTTAGCGAAAGCTAAG



GATTTTTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAA



GTTATTACTCAGGAAGCAAAGAGGATTACA





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGT


ID
TGCCGTCACTGCGTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGC


NO:
TTATTAAAAGCATTCTGTAACAAAGCGGGACCAAAGCCATGACAAAAACGCGTAA


72
CAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATTTGCACGGCGTC



ACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGAC



GCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAG



TAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAAC



TTTAAGAGGAGGATATACCATGCACCATCATCATCACCATACCCAGTTCGAAGGT



TTCACCAACCTGTACCAGGTTTCTAAAACCCTGCGTTTCGAACTGATCCCGCAGG



GTAAAACCCTGAAACACATCCAGGAACAGGGTTTCATCGAAGAAGACAAAGCGCG



TAACGACCACTACAAAGAACTGAAACCGATCATCGACCGTATCTACAAAACCTAC



GCGGACCAGTGCCTGCAGCTGGTTCAGCTGGACTGGGAAAACCTGTCTGCGGCGA



TCGACTCTTACCGTAAAGAAAAAACCGAAGAAACCCGTAACGCGCTGATCGAAGA



ACAGGCGACCTACCGTAACGCGATCCACGACTACTTCATCGGTCGTACCGACAAC



CTGACCGACGCGATCAACAAACGTCACGCGGAAATCTACAAAGGTCTGTTCAAAG



CGGAACTGTTCAACGGTAAAGTTCTGAAACAGCTGGGTACCGTTACCACCACCGA



ACACGAAAACGCGCTGCTGCGTTCTTTCGACAAATTCACCACCTACTTCTCTGGT



TTCTACGAAAACCGTAAAAACGTTTTCTCTGCGGAAGACATCTCTACCGCGATCC



CGCACCGTATCGTTCAGGACAACTTCCCGAAATTCAAAGAAAACTGCCACATCTT



CACCCGTCTGATCACCGCGGTTCCGTCTCTGCGTGAACACTTCGAAAACGTTAAA



AAAGCGATCGGTATCTTCGTTTCTACCTCTATCGAAGAAGTTTTCTCTTTCCCGT



TCTACAACCAGCTGCTGACCCAGACCCAGATCGACCTGTACAACCAGCTGCTGGG



TGGTATCTCTCGTGAAGCGGGTACCGAAAAAATCAAAGGTCTGAACGAAGTTCTG



AACCTGGCGATCCAGAAAAACGACGAAACCGCGCACATCATCGCGTCTCTGCCGC



ACCGTTTCATCCCGCTGTTCAAACAGATCCTGTCTGACCGTAACACCCTGTCTTT



CATCCTGGAAGAATTCAAATCTGACGAAGAAGTTATCCAGTCTTTCTGCAAATAC



AAAACCCTGCTGCGTAACGAAAACGTTCTGGAAACCGCGGAAGCGCTGTTCAACG



AACTGAACTCTATCGACCTGACCCACATCTTCATCTCTCACAAAAAACTGGAAAC



CATCTCTTCTGCGCTGTGCGACCACTGGGACACCCTGCGTAACGCGCTGTACGAA



CGTCGTATCTCTGAACTGACCGGTAAAATCACCAAATCTGCGAAAGAAAAAGTTC



AGCGTTCTCTGAAACACGAAGACATCAACCTGCAGGAAATCATCTCTGCGGCGGG



TAAAGAACTGTCTGAAGCGTTCAAACAGAAAACCTCTGAAATCCTGTCTCACGCG



CACGCGGCGCTGGACCAGCCGCTGCCGACCACCCTGAAAAAACAGGAAGAAAAAG



AAATCCTGAAATCTCAGCTGGACTCTCTGCTGGGTCTGTACCACCTGCTGGACTG



GTTCGCGGTTGACGAATCTAACGAAGTTGACCCGGAATTCTCTGCGCGTCTGACC



GGTATCAAACTGGAAATGGAACCGTCTCTGTCTTTCTACAACAAAGCGCGTAACT



ACGCGACCAAAAAACCGTACTCTGTTGAAAAATTCAAACTGAACTTCCAGATGCC



GACCCTGGCGTCTGGTTGGGACGTTAACAAAGAAAAAAACAACGGTGCGATCCTG



TTCGTTAAAAACGGTCTGTACTACCTGGGTATCATGCCGAAACAGAAAGGTCGTT



ACAAAGCGCTGTCTTTCGAACCGACCGAAAAAACCTCTGAAGGTTTCGACAAAAT



GTACTACGACTACTTCCCGGACGCGGCGAAAATGATCCCGAAATGCTCTACCCAG



CTGAAAGCGGTTACCGCGCACTTCCAGACCCACACCACCCCGATCCTGCTGTCTA



ACAACTTCATCGAACCGCTGGAAATCACCAAAGAAATCTACGACCTGAACAACCC



GGAAAAAGAACCGAAAAAATTCCAGACCGCGTACGCGAAAAAAACCGGTGACCAG



AAAGGTTACCGTGAAGCGCTGTGCAAATGGATCGACTTCACCCGTGACTTCCTGT



CTAAATACACCAAAACCACCTCTATCGACCTGTCTTCTCTGCGTCCGTCTTCTCA



GTACAAAGACCTGGGTGAATACTACGCGGAACTGAACCCGCTGCTGTACCACATC



TCTTTCCAGCGTATCGCGGAAAAAGAAATCATGGACGCGGTTGAAACCGGTAAAC



TGTACCTGTTCCAGATCTACAACAAAGACTTCGCGAAAGGTCACCACGGTAAACC



GAACCTGCACACCCTGTACTGGACCGGTCTGTTCTCTCCGGAAAACCTGGCGAAA



ACCTCTATCAAACTGAACGGTCAGGCGGAACTGTTCTACCGTCCGAAATCTCGTA



TGAAACGTATGGCGCACCGTCTGGGTGAAAAAATGCTGAACAAAAAACTGAAAGA



CCAGAAAACCCCGATCCCGGACACCCTGTACCAGGAACTGTACGACTACGTTAAC



CACCGTCTGTCTCACGACCTGTCTGACGAAGCGCGTGCGCTGCTGCCGAACGTTA



TCACCAAAGAAGTTTCTCACGAAATCATCAAAGACCGTCGTTTCACCTCTGACAA



ATTCTTCTTCCACGTTCCGATCACCCTGAACTACCAGGCGGCGAACTCTCCGTCT



AAATTCAACCAGCGTGTTAACGCGTACCTGAAAGAACACCCGGAAACCCCGATCA



TCGGTATCGACCGTGGTGAACGTAACCTGATCTACATCACCGTTATCGACTCTAC



CGGTAAAATCCTGGAACAGCGTTCTCTGAACACCATCCAGCAGTTCGACTACCAG



AAAAAACTGGACAACCGTGAAAAAGAACGTGTTGCGGCGCGTCAGGCGTGGTCTG



TTGTTGGTACCATCAAAGACCTGAAACAGGGTTACCTGTCTCAGGTTATCCACGA



AATCGTTGACCTGATGATCCACTACCAGGCGGTTGTTGTTCTGGAAAACCTGAAC



TTCGGTTTCAAATCTAAACGTACCGGTATCGCGGAAAAAGCGGTTTACCAGCAGT



TCGAAAAAATGCTGATCGACAAACTGAACTGCCTGGTTCTGAAAGACTACCCGGC



GGAAAAAGTTGGTGGTGTTCTGAACCCGTACCAGCTGACCGACCAGTTCACCTCT



TTCGCGAAAATGGGTACCCAGTCTGGTTTCCTGTTCTACGTTCCGGCGCCGTACA



CCTCTAAAATCGACCCGCTGACCGGTTTCGTTGACCCGTTCGTTTGGAAAACCAT



CAAAAACCACGAATCTCGTAAACACTTCCTGGAAGGTTTCGACTTCCTGCACTAC



GACGTTAAAACCGGTGACTTCATCCTGCACTTCAAAATGAACCGTAACCTGTCTT



TCCAGCGTGGTCTGCCGGGTTTCATGCCGGCGTGGGACATCGTTTTCGAAAAAAA



CGAAACCCAGTTCGACGCGAAAGGTACCCCGTTCATCGCGGGTAAACGTATCGTT



CCGGTTATCGAAAACCACCGTTTCACCGGTCGTTACCGTGACCTGTACCCGGCGA



ACGAACTGATCGCGCTGCTGGAAGAAAAAGGTATCGTTTTCCGTGACGGTTCTAA



CATCCTGCCGAAACTGCTGGAAAACGACGACTCTCACGCGATCGACACCATGGTT



GCGCTGATCCGTTCTGTTCTGCAGATGCGTAACTCTAACGCGGCGACCGGTGAAG



ACTACATCAACTCTCCGGTTCGTGACCTGAACGGTGTTTGCTTCGACTCTCGTTT



CCAGAACCCGGAATGGCCGATGGACGCGGACGCGAACGGTGCGTACCACATCGCG



CTGAAAGGTCAGCTGCTGCTGAACCACCTGAAAGAATCTAAAGACCTGAAACTGC



AGAACGGTATCTCTAACCAGGACTGGCTGGCGTACATCCAGGAACTGCGTAACTA



GAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGACC



CTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGT


ID
TGCCGTCACTGCGTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGC


NO:
TTATTAAAAGCATTCTGTAACAAAGCGGGACCAAAGCCATGACAAAAACGCGTAA


73
CAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATTTGCACGGCGTC



ACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGAC



GCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAG



TAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAAC



TTTAAGAGGAGGATATACCATGCACCATCATCATCACCATGCGGTTAAATCTATC



AAAGTTAAACTGCGTCTGGACGACATGCCGGAAATCCGTGCGGGTCTGTGGAAAC



TGCACAAAGAAGTTAACGCGGGTGTTCGTTACTACACCGAATGGCTGTCTCTGCT



GCGTCAGGAAAACCTGTACCGTCGTTCTCCGAACGGTGACGGTGAACAGGAATGC



GACAAAACCGCGGAAGAATGCAAAGCGGAACTGCTGGAACGTCTGCGTGCGCGTC



AGGTTGAAAACGGTCACCGTGGTCCGGCGGGTTCTGACGACGAACTGCTGCAGCT



GGCGCGTCAGCTGTACGAACTGCTGGTTCCGCAGGCGATCGGTGCGAAAGGTGAC



GCGCAGCAGATCGCGCGTAAATTCCTGTCTCCGCTGGCGGACAAAGACGCGGTTG



GTGGTCTGGGTATCGCGAAAGCGGGTAACAAACCGCGTTGGGTTCGTATGCGTGA



AGCGGGTGAACCGGGTTGGGAAGAAGAAAAAGAAAAAGCGGAAACCCGTAAATCT



GCGGACCGTACCGCGGACGTTCTGCGTGCGCTGGCGGACTTCGGTCTGAAACCGC



TGATGCGTGTTTACACCGACTCTGAAATGTCTTCTGTTGAATGGAAACCGCTGCG



TAAAGGTCAGGCGGTTCGTACCTGGGACCGTGACATGTTCCAGCAGGCGATCGAA



CGTATGATGTCTTGGGAATCTTGGAACCAGCGTGTTGGTCAGGAATACGCGAAAC



TGGTTGAACAGAAAAACCGTTTCGAACAGAAAAACTTCGTTGGTCAGGAACACCT



GGTTCACCTGGTTAACCAGCTGCAGCAGGACATGAAAGAAGCGTCTCCGGGTCTG



GAATCTAAAGAACAGACCGCGCACTACGTTACCGGTCGTGCGCTGCGTGGTTCTG



ACAAAGTTTTCGAAAAATGGGGTAAACTGGCGCCGGACGCGCCGTTCGACCTGTA



CGACGCGGAAATCAAAAACGTTCAGCGTCGTAACACCCGTCGTTTCGGTTCTCAC



GACCTGTTCGCGAAACTGGCGGAACCGGAATACCAGGCGCTGTGGCGTGAAGACG



CGTCTTTCCTGACCCGTTACGCGGTTTACAACTCTATCCTGCGTAAACTGAACCA



CGCGAAAATGTTCGCGACCTTCACCCTGCCGGACGCGACCGCGCACCCGATCTGG



ACCCGTTTCGACAAACTGGGTGGTAACCTGCACCAGTACACCTTCCTGTTCAACG



AATTCGGTGAACGTCGTCACGCGATCCGTTTCCACAAACTGCTGAAAGTTGAAAA



CGGTGTTGCGCGTGAAGTTGACGACGTTACCGTTCCGATCTCTATGTCTGAACAG



CTGGACAACCTGCTGCCGCGTGACCCGAACGAACCGATCGCGCTGTACTTCCGTG



ACTACGGTGCGGAACAGCACTTCACCGGTGAATTCGGTGGTGCGAAAATCCAGTG



CCGTCGTGACCAGCTGGCGCACATGCACCGTCGTCGTGGTGCGCGTGACGTTTAC



CTGAACGTTTCTGTTCGTGTTCAGTCTCAGTCTGAAGCGCGTGGTGAACGTCGTC



CGCCGTACGCGGCGGTTTTCCGTCTGGTTGGTGACAACCACCGTGCGTTCGTTCA



CTTCGACAAACTGTCTGACTACCTGGCGGAACACCCGGACGACGGTAAACTGGGT



TCTGAAGGTCTGCTGTCTGGTCTGCGTGTTATGTCTGTTGACCTGGGTCTGCGTA



CCTCTGCGTCTATCTCTGTTTTCCGTGTTGCGCGTAAAGACGAACTGAAACCGAA



CTCTAAAGGTCGTGTTCCGTTCTTCTTCCCGATCAAAGGTAACGACAACCTGGTT



GCGGTTCACGAACGTTCTCAGCTGCTGAAACTGCCGGGTGAAACCGAATCTAAAG



ACCTGCGTGCGATCCGTGAAGAACGTCAGCGTACCCTGCGTCAGCTGCGTACCCA



GCTGGCGTACCTGCGTCTGCTGGTTCGTTGCGGTTCTGAAGACGTTGGTCGTCGT



GAACGTTCTTGGGCGAAACTGATCGAACAGCCGGTTGACGCGGCGAACCACATGA



CCCCGGACTGGCGTGAAGCGTTCGAAAACGAACTGCAGAAACTGAAATCTCTGCA



CGGTATCTGCTCTGACAAAGAATGGATGGACGCGGTTTACGAATCTGTTCGTCGT



GTTTGGCGTCACATGGGTAAACAGGTTCGTGACTGGCGTAAAGACGTTCGTTCTG



GTGAACGTCCGAAAATCCGTGGTTACGCGAAAGACGTTGTTGGTGGTAACTCTAT



CGAACAGATCGAATACCTGGAACGTCAGTACAAATTCCTGAAATCTTGGTCTTTC



TTCGGTAAAGTTTCTGGTCAGGTTATCCGTGCGGAAAAAGGTTCTCGTTTCGCGA



TCACCCTGCGTGAACACATCGACCACGCGAAAGAAGACCGTCTGAAAAAACTGGC



GGACCGTATCATCATGGAAGCGCTGGGTTACGTTTACGCGCTGGACGAACGTGGT



AAAGGTAAATGGGTTGCGAAATACCCGCCGTGCCAGCTGATCCTGCTGGAAGAAC



TGTCTGAATACCAGTTCAACAACGACCGTCCGCCGTCTGAAAACAACCAGCTGAT



GCAGTGGTCTCACCGTGGTGTTTTCCAGGAACTGATCAACCAGGCGCAGGTTCAC



GACCTGCTGGTTGGTACCATGTACGCGGCGTTCTCTTCTCGTTTCGACGCGCGTA



CCGGTGCGCCGGGTATCCGTTGCCGTCGTGTTCCGGCGCGTTGCACCCAGGAACA



CAACCCGGAACCGTTCCCGTGGTGGCTGAACAAATTCGTTGTTGAACACACCCTG



GACGCGTGCCCGCTGCGTGCGGACGACCTGATCCCGACCGGTGAAGGTGAAATCT



TCGTTTCTCCGTTCTCTGCGGAAGAAGGTGACTTCCACCAGATCCACGCGGACCT



GAACGCGGCGCAGAACCTGCAGCAGCGTCTGTGGTCTGACTTCGACATCTCTCAG



ATCCGTCTGCGTTGCGACTGGGGTGAAGTTGACGGTGAACTGGTTCTGATCCCGC



GTCTGACCGGTAAACGTACCGCGGACTCTTACTCTAACAAAGTTTTCTACACCAA



CACCGGTGTTACCTACTACGAACGTGAACGTGGTAAAAAACGTCGTAAAGTTTTC



GCGCAGGAAAAACTGTCTGAAGAAGAAGCGGAACTGCTGGTTGAAGCGGACGAAG



CGCGTGAAAAATCTGTTGTTCTGATGCGTGACCCGTCTGGTATCATCAACCGTGG



TAACTGGACCCGTCAGAAAGAATTCTGGTCTATGGTTAACCAGCGTATCGAAGGT



TACCTGGTTAAACAGATCCGTTCTCGTGTTCCGCTGCAGGACTCTGCGTGCGAAA



ACACCGGTGACATCTAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATC



TGAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGG



AAGCAAAGAGGATTACA





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGT


ID
TGCCGTCACTGCGTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGC


NO:
TTATTAAAAGCATTCTGTAACAAAGCGGGACCAAAGCCATGACAAAAACGCGTAA


74
CAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATTTGCACGGCGTC



ACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGAC



GCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAG



TAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAAC



TTTAAGAGGAGGATATACCATGCACCATCATCATCACCATGCGACCCGTTCTTTC



ATCCTGAAAATCGAACCGAACGAAGAAGTTAAAAAAGGTCTGTGGAAAACCCACG



AAGTTCTGAACCACGGTATCGCGTACTACATGAACATCCTGAAACTGATCCGTCA



GGAAGCGATCTACGAACACCACGAACAGGACCCGAAAAACCCGAAAAAAGTTTCT



AAAGCGGAAATCCAGGCGGAACTGTGGGACTTCGTTCTGAAAATGCAGAAATGCA



ACTCTTTCACCCACGAAGTTGACAAAGACGTTGTTTTCAACATCCTGCGTGAACT



GTACGAAGAACTGGTTCCGTCTTCTGTTGAAAAAAAAGGTGAAGCGAACCAGCTG



TCTAACAAATTCCTGTACCCGCTGGTTGACCCGAACTCTCAGTCTGGTAAAGGTA



CCGCGTCTTCTGGTCGTAAACCGCGTTGGTACAACCTGAAAATCGCGGGTGACCC



GTCTTGGGAAGAAGAAAAAAAAAAATGGGAAGAAGACAAAAAAAAAGACCCGCTG



GCGAAAATCCTGGGTAAACTGGCGGAATACGGTCTGATCCCGCTGTTCATCCCGT



TCACCGACTCTAACGAACCGATCGTTAAAGAAATCAAATGGATGGAAAAATCTCG



TAACCAGTCTGTTCGTCGTCTGGACAAAGACATGTTCATCCAGGCGCTGGAACGT



TTCCTGTCTTGGGAATCTTGGAACCTGAAAGTTAAAGAAGAATACGAAAAAGTTG



AAAAAGAACACAAAACCCTGGAAGAACGTATCAAAGAAGACATCCAGGCGTTCAA



ATCTCTGGAACAGTACGAAAAAGAACGTCAGGAACAGCTGCTGCGTGACACCCTG



AACACCAACGAATACCGTCTGTCTAAACGTGGTCTGCGTGGTTGGCGTGAAATCA



TCCAGAAATGGCTGAAAATGGACGAAAACGAACCGTCTGAAAAATACCTGGAAGT



TTTCAAAGACTACCAGCGTAAACACCCGCGTGAAGCGGGTGACTACTCTGTTTAC



GAATTCCTGTCTAAAAAAGAAAACCACTTCATCTGGCGTAACCACCCGGAATACC



CGTACCTGTACGCGACCTTCTGCGAAATCGACAAAAAAAAAAAAGACGCGAAACA



GCAGGCGACCTTCACCCTGGCGGACCCGATCAACCACCCGCTGTGGGTTCGTTTC



GAAGAACGTTCTGGTTCTAACCTGAACAAATACCGTATCCTGACCGAACAGCTGC



ACACCGAAAAACTGAAAAAAAAACTGACCGTTCAGCTGGACCGTCTGATCTACCC



GACCGAATCTGGTGGTTGGGAAGAAAAAGGTAAAGTTGACATCGTTCTGCTGCCG



TCTCGTCAGTTCTACAACCAGATCTTCCTGGACATCGAAGAAAAAGGTAAACACG



CGTTCACCTACAAAGACGAATCTATCAAATTCCCGCTGAAAGGTACCCTGGGTGG



TGCGCGTGTTCAGTTCGACCGTGACCACCTGCGTCGTTACCCGCACAAAGTTGAA



TCTGGTAACGTTGGTCGTATCTACTTCAACATGACCGTTAACATCGAACCGACCG



AATCTCCGGTTTCTAAATCTCTGAAAATCCACCGTGACGACTTCCCGAAATTCGT



TAACTTCAAACCGAAAGAACTGACCGAATGGATCAAAGACTCTAAAGGTAAAAAA



CTGAAATCTGGTATCGAATCTCTGGAAATCGGTCTGCGTGTTATGTCTATCGACC



TGGGTCAGCGTCAGGCGGCGGCGGCGTCTATCTTCGAAGTTGTTGACCAGAAACC



GGACATCGAAGGTAAACTGTTCTTCCCGATCAAAGGTACCGAACTGTACGCGGTT



CACCGTGCGTCTTTCAACATCAAACTGCCGGGTGAAACCCTGGTTAAATCTCGTG



AAGTTCTGCGTAAAGCGCGTGAAGACAACCTGAAACTGATGAACCAGAAACTGAA



CTTCCTGCGTAACGTTCTGCACTTCCAGCAGTTCGAAGACATCACCGAACGTGAA



AAACGTGTTACCAAATGGATCTCTCGTCAGGAAAACTCTGACGTTCCGCTGGTTT



ACCAGGACGAACTGATCCAGATCCGTGAACTGATGTACAAACCGTACAAAGACTG



GGTTGCGTTCCTGAAACAGCTGCACAAACGTCTGGAAGTTGAAATCGGTAAAGAA



GTTAAACACTGGCGTAAATCTCTGTCTGACGGTCGTAAAGGTCTGTACGGTATCT



CTCTGAAAAACATCGACGAAATCGACCGTACCCGTAAATTCCTGCTGCGTTGGTC



TCTGCGTCCGACCGAACCGGGTGAAGTTCGTCGTCTGGAACCGGGTCAGCGTTTC



GCGATCGACCAGCTGAACCACCTGAACGCGCTGAAAGAAGACCGTCTGAAAAAAA



TGGCGAACACCATCATCATGCACGCGCTGGGTTACTGCTACGACGTTCGTAAAAA



AAAATGGCAGGCGAAAAACCCGGCGTGCCAGATCATCCTGTTCGAAGACCTGTCT



AACTACAACCCGTACGAAGAACGTTCTCGTTTCGAAAACTCTAAACTGATGAAAT



GGTCTCGTCGTGAAATCCCGCGTCAGGTTGCGCTGCAGGGTGAAATCTACGGTCT



GCAGGTTGGTGAAGTTGGTGCGCAGTTCTCTTCTCGTTTCCACGCGAAAACCGGT



TCTCCGGGTATCCGTTGCTCTGTTGTTACCAAAGAAAAACTGCAGGACAACCGTT



TCTTCAAAAACCTGCAGCGTGAAGGTCGTCTGACCCTGGACAAAATCGCGGTTCT



GAAAGAAGGTGACCTGTACCCGGACAAAGGTGGTGAAAAATTCATCTCTCTGTCT



AAAGACCGTAAACTGGTTACCACCCACGCGGACATCAACGCGGCGCAGAACCTGC



AGAAACGTTTCTGGACCCGTACCCACGGTTTCTACAAAGTTTACTGCAAAGCGTA



CCAGGTTGACGGTCAGACCGTTTACATCCCGGAATCTAAAGACCAGAAACAGAAA



ATCATCGAAGAATTCGGTGAAGGTTACTTCATCCTGAAAGACGGTGTTTACGAAT



GGGGTAACGCGGGTAAACTGAAAATCAAAAAAGGTTCTTCTAAACAGTCTTCTTC



TGAACTGGTTGACTCTGACATCCTGAAAGACTCTTTCGACCTGGCGTCTGAACTG



AAAGGTGAAAAACTGATGCTGTACCGTGACCCGTCTGGTAACGTTTTCCCGTCTG



ACAAATGGATGGCGGCGGGTGTTTTCTTCGGTAAACTGGAACGTATCCTGATCTC



TAAACTGACCAACCAGTACTCTATCTCTACCATCGAAGACGACTCTTCTAAACAG



TCTATGTAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGT



AGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAG



AGGATTACA





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGT


ID
TGCCGTCACTGCGTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGC


NO:
TTATTAAAAGCATTCTGTAACAAAGCGGGACCAAAGCCATGACAAAAACGCGTAA


75
CAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATTTGCACGGCGTC



ACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGAC



GCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAG



TAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAAC



TTTAAGAGGAGGATATACCATGCACCATCATCATCACCATCCGACCCGTACCATC



AACCTGAAACTGGTTCTGGGTAAAAACCCGGAAAACGCGACCCTGCGTCGTGCGC



TGTTCTCTACCCACCGTCTGGTTAACCAGGCGACCAAACGTATCGAAGAATTCCT



GCTGCTGTGCCGTGGTGAAGCGTACCGTACCGTTGACAACGAAGGTAAAGAAGCG



GAAATCCCGCGTCACGCGGTTCAGGAAGAAGCGCTGGCGTTCGCGAAAGCGGCGC



AGCGTCACAACGGTTGCATCTCTACCTACGAAGACCAGGAAATCCTGGACGTTCT



GCGTCAGCTGTACGAACGTCTGGTTCCGTCTGTTAACGAAAACAACGAAGCGGGT



GACGCGCAGGCGGCGAACGCGTGGGTTTCTCCGCTGATGTCTGCGGAATCTGAAG



GTGGTCTGTCTGTTTACGACAAAGTTCTGGACCCGCCGCCGGTTTGGATGAAACT



GAAAGAAGAAAAAGCGCCGGGTTGGGAAGCGGCGTCTCAGATCTGGATCCAGTCT



GACGAAGGTCAGTCTCTGCTGAACAAACCGGGTTCTCCGCCGCGTTGGATCCGTA



AACTGCGTTCTGGTCAGCCGTGGCAGGACGACTTCGTTTCTGACCAGAAAAAAAA



ACAGGACGAACTGACCAAAGGTAACGCGCCGCTGATCAAACAGCTGAAAGAAATG



GGTCTGCTGCCGCTGGTTAACCCGTTCTTCCGTCACCTGCTGGACCCGGAAGGTA



AAGGTGTTTCTCCGTGGGACCGTCTGGCGGTTCGTGCGGCGGTTGCGCACTTCAT



CTCTTGGGAATCTTGGAACCACCGTACCCGTGCGGAATACAACTCTCTGAAACTG



CGTCGTGACGAATTCGAAGCGGCGTCTGACGAATTCAAAGACGACTTCACCCTGC



TGCGTCAGTACGAAGCGAAACGTCACTCTACCCTGAAATCTATCGCGCTGGCGGA



CGACTCTAACCCGTACCGTATCGGTGTTCGTTCTCTGCGTGCGTGGAACCGTGTT



CGTGAAGAATGGATCGACAAAGGTGCGACCGAAGAACAGCGTGTTACCATCCTGT



CTAAACTGCAGACCCAGCTGCGTGGTAAATTCGGTGACCCGGACCTGTTCAACTG



GCTGGCGCAGGACCGTCACGTTCACCTGTGGTCTCCGCGTGACTCTGTTACCCCG



CTGGTTCGTATCAACGCGGTTGACAAAGTTCTGCGTCGTCGTAAACCGTACGCGC



TGATGACCTTCGCGCACCCGCGTTTCCACCCGCGTTGGATCCTGTACGAAGCGCC



GGGTGGTTCTAACCTGCGTCAGTACGCGCTGGACTGCACCGAAAACGCGCTGCAC



ATCACCCTGCCGCTGCTGGTTGACGACGCGCACGGTACCTGGATCGAAAAAAAAA



TCCGTGTTCCGCTGGCGCCGTCTGGTCAGATCCAGGACCTGACCCTGGAAAAACT



GGAAAAAAAAAAAAACCGTCTGTACTACCGTTCTGGTTTCCAGCAGTTCGCGGGT



CTGGCGGGTGGTGCGGAAGTTCTGTTCCACCGTCCGTACATGGAACACGACGAAC



GTTCTGAAGAATCTCTGCTGGAACGTCCGGGTGCGGTTTGGTTCAAACTGACCCT



GGACGTTGCGACCCAGGCGCCGCCGAACTGGCTGGACGGTAAAGGTCGTGTTCGT



ACCCCGCCGGAAGTTCACCACTTCAAAACCGCGCTGTCTAACAAATCTAAACACA



CCCGTACCCTGCAGCCGGGTCTGCGTGTTCTGTCTGTTGACCTGGGTATGCGTAC



CTTCGCGTCTTGCTCTGTTTTCGAACTGATCGAAGGTAAACCGGAAACCGGTCGT



GCGTTCCCGGTTGCGGACGAACGTTCTATGGACTCTCCGAACAAACTGTGGGCGA



AACACGAACGTTCTTTCAAACTGACCCTGCCGGGTGAAACCCCGTCTCGTAAAGA



AGAAGAAGAACGTTCTATCGCGCGTGCGGAAATCTACGCGCTGAAACGTGACATC



CAGCGTCTGAAATCTCTGCTGCGTCTGGGTGAAGAAGACAACGACAACCGTCGTG



ACGCGCTGCTGGAACAGTTCTTCAAAGGTTGGGGTGAAGAAGACGTTGTTCCGGG



TCAGGCGTTCCCGCGTTCTCTGTTCCAGGGTCTGGGTGCGGCGCCGTTCCGTTCT



ACCCCGGAACTGTGGCGTCAGCACTGCCAGACCTACTACGACAAAGCGGAAGCGT



GCCTGGCGAAACACATCTCTGACTGGCGTAAACGTACCCGTCCGCGTCCGACCTC



TCGTGAAATGTGGTACAAAACCCGTTCTTACCACGGTGGTAAATCTATCTGGATG



CTGGAATACCTGGACGCGGTTCGTAAACTGCTGCTGTCTTGGTCTCTGCGTGGTC



GTACCTACGGTGCGATCAACCGTCAGGACACCGCGCGTTTCGGTTCTCTGGCGTC



TCGTCTGCTGCACCACATCAACTCTCTGAAAGAAGACCGTATCAAAACCGGTGCG



GACTCTATCGTTCAGGCGGCGCGTGGTTACATCCCGCTGCCGCACGGTAAAGGTT



GGGAACAGCGTTACGAACCGTGCCAGCTGATCCTGTTCGAAGACCTGGCGCGTTA



CCGTTTCCGTGTTGACCGTCCGCGTCGTGAAAACTCTCAGCTGATGCAGTGGAAC



CACCGTGCGATCGTTGCGGAAACCACCATGCAGGCGGAACTGTACGGTCAGATCG



TTGAAAACACCGCGGCGGGTTTCTCTTCTCGTTTCCACGCGGCGACCGGTGCGCC



GGGTGTTCGTTGCCGTTTCCTGCTGGAACGTGACTTCGACAACGACCTGCCGAAA



CCGTACCTGCTGCGTGAACTGTCTTGGATGCTGGGTAACACCAAAGTTGAATCTG



AAGAAGAAAAACTGCGTCTGCTGTCTGAAAAAATCCGTCCGGGTTCTCTGGTTCC



GTGGGACGGTGGTGAACAGTTCGCGACCCTGCACCCGAAACGTCAGACCCTGTGC



GTTATCCACGCGGACATGAACGCGGCGCAGAACCTGCAGCGTCGTTTCTTCGGTC



GTTGCGGTGAAGCGTTCCGTCTGGTTTGCCAGCCGCACGGTGACGACGTTCTGCG



TCTGGCGTCTACCCCGGGTGCGCGTCTGCTGGGTGCGCTGCAGCAGCTGGAAAAC



GGTCAGGGTGCGTTCGAACTGGTTCGTGACATGGGTTCTACCTCTCAGATGAACC



GTTTCGTTATGAAATCTCTGGGTAAAAAAAAAATCAAACCGCTGCAGGACAACAA



CGGTGACGACGAACTGGAAGACGTTCTGTCTGTTCTGCCGGAAGAAGACGACACC



GGTCGTATCACCGTTTTCCGTGACTCTTCTGGTATCTTCTTCCCGTGCAACGTTT



GGATCCCGGCGAAACAGTTCTGGCCGGCGGTTCGTGCGATGATCTGGAAAGTTAT



GGCGTCTCACTCTCTGGGTTAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTT



TTATCTGAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTAC



TCAGGAAGCAAAGAGGATTACA





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGT


ID
TGCCGTCACTGCGTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGC


NO:
TTATTAAAAGCATTCTGTAACAAAGCGGGACCAAAGCCATGACAAAAACGCGTAA


76
CAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATTTGCACGGCGTC



ACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGAC



GCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAG



TAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAAC



TTTAAGAGGAGGATATACCATGCACCATCATCATCACCATACCAAACTGCGTCAC



CGTCAGAAAAAACTGACCCACGACTGGGCGGGTTCTAAAAAACGTGAAGTTCTGG



GTTCTAACGGTAAACTGCAGAACCCGCTGCTGATGCCGGTTAAAAAAGGTCAGGT



TACCGAATTCCGTAAAGCGTTCTCTGCGTACGCGCGTGCGACCAAAGGTGAAATG



ACCGACGGTCGTAAAAACATGTTCACCCACTCTTTCGAACCGTTCAAAACCAAAC



CGTCTCTGCACCAGTGCGAACTGGCGGACAAAGCGTACCAGTCTCTGCACTCTTA



CCTGCCGGGTTCTCTGGCGCACTTCCTGCTGTCTGCGCACGCGCTGGGTTTCCGT



ATCTTCTCTAAATCTGGTGAAGCGACCGCGTTCCAGGCGTCTTCTAAAATCGAAG



CGTACGAATCTAAACTGGCGTCTGAACTGGCGTGCGTTGACCTGTCTATCCAGAA



CCTGACCATCTCTACCCTGTTCAACGCGCTGACCACCTCTGTTCGTGGTAAAGGT



GAAGAAACCTCTGCGGACCCGCTGATCGCGCGTTTCTACACCCTGCTGACCGGTA



AACCGCTGTCTCGTGACACCCAGGGTCCGGAACGTGACCTGGCGGAAGTTATCTC



TCGTAAAATCGCGTCTTCTTTCGGTACCTGGAAAGAAATGACCGCGAACCCGCTG



CAGTCTCTGCAGTTCTTCGAAGAAGAACTGCACGCGCTGGACGCGAACGTTTCTC



TGTCTCCGGCGTTCGACGTTCTGATCAAAATGAACGACCTGCAGGGTGACCTGAA



AAACCGTACCATCGTTTTCGACCCGGACGCGCCGGTTTTCGAATACAACGCGGAA



GACCCGGCGGACATCATCATCAAACTGACCGCGCGTTACGCGAAAGAAGCGGTTA



TCAAAAACCAGAACGTTGGTAACTACGTTAAAAACGCGATCACCACCACCAACGC



GAACGGTCTGGGTTGGCTGCTGAACAAAGGTCTGTCTCTGCTGCCGGTTTCTACC



GACGACGAACTGCTGGAATTCATCGGTGTTGAACGTTCTCACCCGTCTTGCCACG



CGCTGATCGAACTGATCGCGCAGCTGGAAGCGCCGGAACTGTTCGAAAAAAACGT



TTTCTCTGACACCCGTTCTGAAGTTCAGGGTATGATCGACTCTGCGGTTTCTAAC



CACATCGCGCGTCTGTCTTCTTCTCGTAACTCTCTGTCTATGGACTCTGAAGAAC



TGGAACGTCTGATCAAATCTTTCCAGATCCACACCCCGCACTGCTCTCTGTTCAT



CGGTGCGCAGTCTCTGTCTCAGCAGCTGGAATCTCTGCCGGAAGCGCTGCAGTCT



GGTGTTAACTCTGCGGACATCCTGCTGGGTTCTACCCAGTACATGCTGACCAACT



CTCTGGTTGAAGAATCTATCGCGACCTACCAGCGTACCCTGAACCGTATCAACTA



CCTGTCTGGTGTTGCGGGTCAGATCAACGGTGCGATCAAACGTAAAGCGATCGAC



GGTGAAAAAATCCACCTGCCGGCGGCGTGGTCTGAACTGATCTCTCTGCCGTTCA



TCGGTCAGCCGGTTATCGACGTTGAATCTGACCTGGCGCACCTGAAAAACCAGTA



CCAGACCCTGTCTAACGAATTCGACACCCTGATCTCTGCGCTGCAGAAAAACTTC



GACCTGAACTTCAACAAAGCGCTGCTGAACCGTACCCAGCACTTCGAAGCGATGT



GCCGTTCTACCAAAAAAAACGCGCTGTCTAAACCGGAAATCGTTTCTTACCGTGA



CCTGCTGGCGCGTCTGACCTCTTGCCTGTACCGTGGTTCTCTGGTTCTGCGTCGT



GCGGGTATCGAAGTTCTGAAAAAACACAAAATCTTCGAATCTAACTCTGAACTGC



GTGAACACGTTCACGAACGTAAACACTTCGTTTTCGTTTCTCCGCTGGACCGTAA



AGCGAAAAAACTGCTGCGTCTGACCGACTCTCGTCCGGACCTGCTGCACGTTATC



GACGAAATCCTGCAGCACGACAACCTGGAAAACAAAGACCGTGAATCTCTGTGGC



TGGTTCGTTCTGGTTACCTGCTGGCGGGTCTGCCGGACCAGCTGTCTTCTTCTTT



CATCAACCTGCCGATCATCACCCAGAAAGGTGACCGTCGTCTGATCGACCTGATC



CAGTACGACCAGATCAACCGTGACGCGTTCGTTATGCTGGTTACCTCTGCGTTCA



AATCTAACCTGTCTGGTCTGCAGTACCGTGCGAACAAACAGTCTTTCGTTGTTAC



CCGTACCCTGTCTCCGTACCTGGGTTCTAAACTGGTTTACGTTCCGAAAGACAAA



GACTGGCTGGTTCCGTCTCAGATGTTCGAAGGTCGTTTCGCGGACATCCTGCAGT



CTGACTACATGGTTTGGAAAGACGCGGGTCGTCTGTGCGTTATCGACACCGCGAA



ACACCTGTCTAACATCAAAAAATCTGTTTTCTCTTCTGAAGAAGTTCTGGCGTTC



CTGCGTGAACTGCCGCACCGTACCTTCATCCAGACCGAAGTTCGTGGTCTGGGTG



TTAACGTTGACGGTATCGCGTTCAACAACGGTGACATCCCGTCTCTGAAAACCTT



CTCTAACTGCGTTCAGGTTAAAGTTTCTCGTACCAACACCTCTCTGGTTCAGACC



CTGAACCGTTGGTTCGAAGGTGGTAAAGTTTCTCCGCCGTCTATCCAGTTCGAAC



GTGCGTACTACAAAAAAGACGACCAGATCCACGAAGACGCGGCGAAACGTAAAAT



CCGTTTCCAGATGCCGGCGACCGAACTGGTTCACGCGTCTGACGACGCGGGTTGG



ACCCCGTCTTACCTGCTGGGTATCGACCCGGGTGAATACGGTATGGGTCTGTCTC



TGGTTTCTATCAACAACGGTGAAGTTCTGGACTCTGGTTTCATCCACATCAACTC



TCTGATCAACTTCGCGTCTAAAAAATCTAACCACCAGACCAAAGTTGTTCCGCGT



CAGCAGTACAAATCTCCGTACGCGAACTACCTGGAACAGTCTAAAGACTCTGCGG



CGGGTGACATCGCGCACATCCTGGACCGTCTGATCTACAAACTGAACGCGCTGCC



GGTTTTCGAAGCGCTGTCTGGTAACTCTCAGTCTGCGGCGGACCAGGTTTGGACC



AAAGTTCTGTCTTTCTACACCTGGGGTGACAACGACGCGCAGAACTCTATCCGTA



AACAGCACTGGTTCGGTGCGTCTCACTGGGACATCAAAGGTATGCTGCGTCAGCC



GCCGACCGAAAAAAAACCGAAACCGTACATCGCGTTCCCGGGTTCTCAGGTTTCT



TCTTACGGTAACTCTCAGCGTTGCTCTTGCTGCGGTCGTAACCCGATCGAACAGC



TGCGTGAAATGGCGAAAGACACCTCTATCAAAGAACTGAAAATCCGTAACTCTGA



AATCCAGCTGTTCGACGGTACCATCAAACTGTTCAACCCGGACCCGTCTACCGTT



ATCGAACGTCGTCGTCACAACCTGGGTCCGTCTCGTATCCCGGTTGCGGACCGTA



CCTTCAAAAACATCTCTCCGTCTTCTCTGGAATTCAAAGAACTGATCACCATCGT



TTCTCGTTCTATCCGTCACTCTCCGGAATTCATCGCGAAAAAACGTGGTATCGGT



TCTGAATACTTCTGCGCGTACTCTGACTGCAACTCTTCTCTGAACTCTGAAGCGA



ACGCGGCGGCGAACGTTGCGCAGAAATTCCAGAAACAGCTGTTCTTCGAACTGTA



AGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGAC



CCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTAC



A





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGT


ID
TGCCGTCACTGCGTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGC


NO:
TTATTAAAAGCATTCTGTAACAAAGCGGGACCAAAGCCATGACAAAAACGCGTAA


77
CAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATTTGCACGGCGTC



ACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGAC



GCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAG



TAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAAC



TTTAAGAGGAGGATATACCATGCACCATCATCATCACCATAAACGTATCCTGAAC



TCTCTGAAAGTTGCGGCGCTGCGTCTGCTGTTCCGTGGTAAAGGTTCTGAACTGG



TTAAAACCGTTAAATACCCGCTGGTTTCTCCGGTTCAGGGTGCGGTTGAAGAACT



GGCGGAAGCGATCCGTCACGACAACCTGCACCTGTTCGGTCAGAAAGAAATCGTT



GACCTGATGGAAAAAGACGAAGGTACCCAGGTTTACTCTGTTGTTGACTTCTGGC



TGGACACCCTGCGTCTGGGTATGTTCTTCTCTCCGTCTGCGAACGCGCTGAAAAT



CACCCTGGGTAAATTCAACTCTGACCAGGTTTCTCCGTTCCGTAAAGTTCTGGAA



CAGTCTCCGTTCTTCCTGGCGGGTCGTCTGAAAGTTGAACCGGCGGAACGTATCC



TGTCTGTTGAAATCCGTAAAATCGGTAAACGTGAAAACCGTGTTGAAAACTACGC



GGCGGACGTTGAAACCTGCTTCATCGGTCAGCTGTCTTCTGACGAAAAACAGTCT



ATCCAGAAACTGGCGAACGACATCTGGGACTCTAAAGACCACGAAGAACAGCGTA



TGCTGAAAGCGGACTTCTTCGCGATCCCGCTGATCAAAGACCCGAAAGCGGTTAC



CGAAGAAGACCCGGAAAACGAAACCGCGGGTAAACAGAAACCGCTGGAACTGTGC



GTTTGCCTGGTTCCGGAACTGTACACCCGTGGTTTCGGTTCTATCGCGGACTTCC



TGGTTCAGCGTCTGACCCTGCTGCGTGACAAAATGTCTACCGACACCGCGGAAGA



CTGCCTGGAATACGTTGGTATCGAAGAAGAAAAAGGTAACGGTATGAACTCTCTG



CTGGGTACCTTCCTGAAAAACCTGCAGGGTGACGGTTTCGAACAGATCTTCCAGT



TCATGCTGGGTTCTTACGTTGGTTGGCAGGGTAAAGAAGACGTTCTGCGTGAACG



TCTGGACCTGCTGGCGGAAAAAGTTAAACGTCTGCCGAAACCGAAATTCGCGGGT



GAATGGTCTGGTCACCGTATGTTCCTGCACGGTCAGCTGAAATCTTGGTCTTCTA



ACTTCTTCCGTCTGTTCAACGAAACCCGTGAACTGCTGGAATCTATCAAATCTGA



CATCCAGCACGCGACCATGCTGATCTCTTACGTTGAAGAAAAAGGTGGTTACCAC



CCGCAGCTGCTGTCTCAGTACCGTAAACTGATGGAACAGCTGCCGGCGCTGCGTA



CCAAAGTTCTGGACCCGGAAATCGAAATGACCCACATGTCTGAAGCGGTTCGTTC



TTACATCATGATCCACAAATCTGTTGCGGGTTTCCTGCCGGACCTGCTGGAATCT



CTGGACCGTGACAAAGACCGTGAATTCCTGCTGTCTATCTTCCCGCGTATCCCGA



AAATCGACAAAAAAACCAAAGAAATCGTTGCGTGGGAACTGCCGGGTGAACCGGA



AGAAGGTTACCTGTTCACCGCGAACAACCTGTTCCGTAACTTCCTGGAAAACCCG



AAACACGTTCCGCGTTTCATGGCGGAACGTATCCCGGAAGACTGGACCCGTCTGC



GTTCTGCGCCGGTTTGGTTCGACGGTATGGTTAAACAGTGGCAGAAAGTTGTTAA



CCAGCTGGTTGAATCTCCGGGTGCGCTGTACCAGTTCAACGAATCTTTCCTGCGT



CAGCGTCTGCAGGCGATGCTGACCGTTTACAAACGTGACCTGCAGACCGAAAAAT



TCCTGAAACTGCTGGCGGACGTTTGCCGTCCGCTGGTTGACTTCTTCGGTCTGGG



TGGTAACGACATCATCTTCAAATCTTGCCAGGACCCGCGTAAACAGTGGCAGACC



GTTATCCCGCTGTCTGTTCCGGCGGACGTTTACACCGCGTGCGAAGGTCTGGCGA



TCCGTCTGCGTGAAACCCTGGGTTTCGAATGGAAAAACCTGAAAGGTCACGAACG



TGAAGACTTCCTGCGTCTGCACCAGCTGCTGGGTAACCTGCTGTTCTGGATCCGT



GACGCGAAACTGGTTGTTAAACTGGAAGACTGGATGAACAACCCGTGCGTTCAGG



AATACGTTGAAGCGCGTAAAGCGATCGACCTGCCGCTGGAAATCTTCGGTTTCGA



AGTTCCGATCTTCCTGAACGGTTACCTGTTCTCTGAACTGCGTCAGCTGGAACTG



CTGCTGCGTCGTAAATCTGTTATGACCTCTTACTCTGTTAAAACCACCGGTTCTC



CGAACCGTCTGTTCCAGCTGGTTTACCTGCCGCTGAACCCGTCTGACCCGGAAAA



AAAAAACTCTAACAACTTCCAGGAACGTCTGGACACCCCGACCGGTCTGTCTCGT



CGTTTCCTGGACCTGACCCTGGACGCGTTCGCGGGTAAACTGCTGACCGACCCGG



TTACCCAGGAACTGAAAACCATGGCGGGTTTCTACGACCACCTGTTCGGTTTCAA



ACTGCCGTGCAAACTGGCGGCGATGTCTAACCACCCGGGTTCTTCTTCTAAAATG



GTTGTTCTGGCGAAACCGAAAAAAGGTGTTGCGTCTAACATCGGTTTCGAACCGA



TCCCGGACCCGGCGCACCCGGTTTTCCGTGTTCGTTCTTCTTGGCCGGAACTGAA



ATACCTGGAAGGTCTGCTGTACCTGCCGGAAGACACCCCGCTGACCATCGAACTG



GCGGAAACCTCTGTTTCTTGCCAGTCTGTTTCTTCTGTTGCGTTCGACCTGAAAA



ACCTGACCACCATCCTGGGTCGTGTTGGTGAATTCCGTGTTACCGCGGACCAGCC



GTTCAAACTGACCCCGATCATCCCGGAAAAAGAAGAATCTTTCATCGGTAAAACC



TACCTGGGTCTGGACGCGGGTGAACGTTCTGGTGTTGGTTTCGCGATCGTTACCG



TTGACGGTGACGGTTACGAAGTTCAGCGTCTGGGTGTTCACGAAGACACCCAGCT



GATGGCGCTGCAGCAGGTTGCGTCTAAATCTCTGAAAGAACCGGTTTTCCAGCCG



CTGCGTAAAGGTACCTTCCGTCAGCAGGAACGTATCCGTAAATCTCTGCGTGGTT



GCTACTGGAACTTCTACCACGCGCTGATGATCAAATACCGTGCGAAAGTTGTTCA



CGAAGAATCTGTTGGTTCTTCTGGTCTGGTTGGTCAGTGGCTGCGTGCGTTCCAG



AAAGACCTGAAAAAAGCGGACGTTCTGCCGAAAAAAGGTGGTAAAAACGGTGTTG



ACAAAAAAAAACGTGAATCTTCTGCGCAGGACACCCTGTGGGGTGGTGCGTTCTC



TAAAAAAGAAGAACAGCAGATCGCGTTCGAAGTTCAGGCGGCGGGTTCTTCTCAG



TTCTGCCTGAAATGCGGTTGGTGGTTCCAGCTGGGTATGCGTGAAGTTAACCGTG



TTCAGGAATCTGGTGTTGTTCTGGACTGGAACCGTTCTATCGTTACCTTCCTGAT



CGAATCTTCTGGTGAAAAAGTTTACGGTTTCTCTCCGCAGCAGCTGGAAAAAGGT



TTCCGTCCGGACATCGAAACCTTCAAAAAAATGGTTCGTGACTTCATGCGTCCGC



CGATGTTCGACCGTAAAGGTCGTCCGGCGGCGGCGTACGAACGTTTCGTTCTGGG



TCGTCGTCACCGTCGTTACCGTTTCGACAAAGTTTTCGAAGAACGTTTCGGTCGT



TCTGCGCTGTTCATCTGCCCGCGTGTTGGTTGCGGTAACTTCGACCACTCTTCTG



AACAGTCTGCGGTTGTTCTGGCGCTGATCGGTTACATCGCGGACAAAGAAGGTAT



GTCTGGTAAAAAACTGGTTTACGTTCGTCTGGCGGAACTGATGGCGGAATGGAAA



CTGAAAAAACTGGAACGTTCTCGTGTTGAAGAACAGTCTTCTGCGCAGTAAGAAA



TCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCTCA



GGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGT


ID
TGCCGTCACTGCGTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGC


NO:
TTATTAAAAGCATTCTGTAACAAAGCGGGACCAAAGCCATGACAAAAACGCGTAA


78
CAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATTTGCACGGCGTC



ACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGAC



GCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAG



TAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAAC



TTTAAGAGGAGGATATACCATGCACCATCATCATCACCATGCGGAATCTAAACAG



ATGCAGTGCCGTAAATGCGGTGCGTCTATGAAATACGAAGTTATCGGTCTGGGTA



AAAAATCTTGCCGTTACATGTGCCCGGACTGCGGTAACCACACCTCTGCGCGTAA



AATCCAGAACAAAAAAAAACGTGACAAAAAATACGGTTCTGCGTCTAAAGCGCAG



TCTCAGCGTATCGCGGTTGCGGGTGCGCTGTACCCGGACAAAAAAGTTCAGACCA



TCAAAACCTACAAATACCCGGCGGACCTGAACGGTGAAGTTCACGACTCTGGTGT



TGCGGAAAAAATCGCGCAGGCGATCCAGGAAGACGAAATCGGTCTGCTGGGTCCG



TCTTCTGAATACGCGTGCTGGATCGCGTCTCAGAAACAGTCTGAACCGTACTCTG



TTGTTGACTTCTGGTTCGACGCGGTTTGCGCGGGTGGTGTTTTCGCGTACTCTGG



TGCGCGTCTGCTGTCTACCGTTCTGCAGCTGTCTGGTGAAGAATCTGTTCTGCGT



GCGGCGCTGGCGTCTTCTCCGTTCGTTGACGACATCAACCTGGCGCAGGCGGAAA



AATTCCTGGCGGTTTCTCGTCGTACCGGTCAGGACAAACTGGGTAAACGTATCGG



TGAATGCTTCGCGGAAGGTCGTCTGGAAGCGCTGGGTATCAAAGACCGTATGCGT



GAATTCGTTCAGGCGATCGACGTTGCGCAGACCGCGGGTCAGCGTTTCGCGGCGA



AACTGAAAATCTTCGGTATCTCTCAGATGCCGGAAGCGAAACAGTGGAACAACGA



CTCTGGTCTGACCGTTTGCATCCTGCCGGACTACTACGTTCCGGAAGAAAACCGT



GCGGACCAGCTGGTTGTTCTGCTGCGTCGTCTGCGTGAAATCGCGTACTGCATGG



GTATCGAAGACGAAGCGGGTTTCGAACACCTGGGTATCGACCCGGGTGCGCTGTC



TAACTTCTCTAACGGTAACCCGAAACGTGGTTTCCTGGGTCGTCTGCTGAACAAC



GACATCATCGCGCTGGCGAACAACATGTCTGCGATGACCCCGTACTGGGAAGGTC



GTAAAGGTGAACTGATCGAACGTCTGGCGTGGCTGAAACACCGTGCGGAAGGTCT



GTACCTGAAAGAACCGCACTTCGGTAACTCTTGGGCGGACCACCGTTCTCGTATC



TTCTCTCGTATCGCGGGTTGGCTGTCTGGTTGCGCGGGTAAACTGAAAATCGCGA



AAGACCAGATCTCTGGTGTTCGTACCGACCTGTTCCTGCTGAAACGTCTGCTGGA



CGCGGTTCCGCAGTCTGCGCCGTCTCCGGACTTCATCGCGTCTATCTCTGCGCTG



GACCGTTTCCTGGAAGCGGCGGAATCTTCTCAGGACCCGGCGGAACAGGTTCGTG



CGCTGTACGCGTTCCACCTGAACGCGCCGGCGGTTCGTTCTATCGCGAACAAAGC



GGTTCAGCGTTCTGACTCTCAGGAATGGCTGATCAAAGAACTGGACGCGGTTGAC



CACCTGGAATTCAACAAAGCGTTCCCGTTCTTCTCTGACACCGGTAAAAAAAAAA



AAAAAGGTGCGAACTCTAACGGTGCGCCGTCTGAAGAAGAATACACCGAAACCGA



ATCTATCCAGCAGCCGGAAGACGCGGAACAGGAAGTTAACGGTCAGGAAGGTAAC



GGTGCGTCTAAAAACCAGAAAAAATTCCAGCGTATCCCGCGTTTCTTCGGTGAAG



GTTCTCGTTCTGAATACCGTATCCTGACCGAAGCGCCGCAGTACTTCGACATGTT



CTGCAACAACATGCGTGCGATCTTCATGCAGCTGGAATCTCAGCCGCGTAAAGCG



CCGCGTGACTTCAAATGCTTCCTGCAGAACCGTCTGCAGAAACTGTACAAACAGA



CCTTCCTGAACGCGCGTTCTAACAAATGCCGTGCGCTGCTGGAATCTGTTCTGAT



CTCTTGGGGTGAATTCTACACCTACGGTGCGAACGAAAAAAAATTCCGTCTGCGT



CACGAAGCGTCTGAACGTTCTTCTGACCCGGACTACGTTGTTCAGCAGGCGCTGG



AAATCGCGCGTCGTCTGTTCCTGTTCGGTTTCGAATGGCGTGACTGCTCTGCGGG



TGAACGTGTTGACCTGGTTGAAATCCACAAAAAAGCGATCTCTTTCCTGCTGGCG



ATCACCCAGGCGGAAGTTTCTGTTGGTTCTTACAACTGGCTGGGTAACTCTACCG



TTTCTCGTTACCTGTCTGTTGCGGGTACCGACACCCTGTACGGTACCCAGCTGGA



AGAATTCCTGAACGCGACCGTTCTGTCTCAGATGCGTGGTCTGGCGATCCGTCTG



TCTTCTCAGGAACTGAAAGACGGTTTCGACGTTCAGCTGGAATCTTCTTGCCAGG



ACAACCTGCAGCACCTGCTGGTTTACCGTGCGTCTCGTGACCTGGCGGCGTGCAA



ACGTGCGACCTGCCCGGCGGAACTGGACCCGAAAATCCTGGTTCTGCCGGTTGGT



GCGTTCATCGCGTCTGTTATGAAAATGATCGAACGTGGTGACGAACCGCTGGCGG



GTGCGTACCTGCGTCACCGTCCGCACTCTTTCGGTTGGCAGATCCGTGTTCGTGG



TGTTGCGGAAGTTGGTATGGACCAGGGTACCGCGCTGGCGTTCCAGAAACCGACC



GAATCTGAACCGTTCAAAATCAAACCGTTCTCTGCGCAGTACGGTCCGGTTCTGT



GGCTGAACTCTTCTTCTTACTCTCAGTCTCAGTACCTGGACGGTTTCCTGTCTCA



GCCGAAAAACTGGTCTATGCGTGTTCTGCCGCAGGCGGGTTCTGTTCGTGTTGAA



CAGCGTGTTGCGCTGATCTGGAACCTGCAGGCGGGTAAAATGCGTCTGGAACGTT



CTGGTGCGCGTGCGTTCTTCATGCCGGTTCCGTTCTCTTTCCGTCCGTCTGGTTC



TGGTGACGAAGCGGTTCTGGCGCCGAACCGTTACCTGGGTCTGTTCCCGCACTCT



GGTGGTATCGAATACGCGGTTGTTGACGTTCTGGACTCTGCGGGTTTCAAAATCC



TGGAACGTGGTACCATCGCGGTTAACGGTTTCTCTCAGAAACGTGGTGAACGTCA



GGAAGAAGCGCACCGTGAAAAACAGCGTCGTGGTATCTCTGACATCGGTCGTAAA



AAACCGGTTCAGGCGGAAGTTGACGCGGCGAACGAACTGCACCGTAAATACACCG



ACGTTGCGACCCGTCTGGGTTGCCGTATCGTTGTTCAGTGGGCGCCGCAGCCGAA



ACCGGGTACCGCGCCGACCGCGCAGACCGTTTACGCGCGTGCGGTTCGTACCGAA



GCGCCGCGTTCTGGTAACCAGGAAGACCACGCGCGTATGAAATCTTCTTGGGGTT



ACACCTGGGGTACCTACTGGGAAAAACGTAAACCGGAAGACATCCTGGGTATCTC



TACCCAGGTTTACTGGACCGGTGGTATCGGTGAATCTTGCCCGGCGGTTGCGGTT



GCGCTGCTGGGTCACATCCGTGCGACCTCTACCCAGACCGAATGGGAAAAAGAAG



AAGTTGTTTTCGGTCGTCTGAAAAAATTCTTCCCGTCTTAAGAAATCATCCTTAG



CGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAAATAT



TCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGT


ID
TGCCGTCACTGCGTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGC


NO:
TTATTAAAAGCATTCTGTAACAAAGCGGGACCAAAGCCATGACAAAAACGCGTAA


79
CAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATTTGCACGGCGTC



ACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGAC



GCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAG



TAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAAC



TTTAAGAGGAGGATATACCATGCACCATCATCATCACCATGAAAAACGTATCAAC



AAAATCCGTAAAAAACTGTCTGCGGACAACGCGACCAAACCGGTTTCTCGTTCTG



GTCCGATGAAAACCCTGCTGGTTCGTGTTATGACCGACGACCTGAAAAAACGTCT



GGAAAAACGTCGTAAAAAACCGGAAGTTATGCCGCAGGTTATCTCTAACAACGCG



GCGAACAACCTGCGTATGCTGCTGGACGACTACACCAAAATGAAAGAAGCGATCC



TGCAGGTTTACTGGCAGGAATTCAAAGACGACCACGTTGGTCTGATGTGCAAATT



CGCGCAGCCGGCGTCTAAAAAAATCGACCAGAACAAACTGAAACCGGAAATGGAC



GAAAAAGGTAACCTGACCACCGCGGGTTTCGCGTGCTCTCAGTGCGGTCAGCCGC



TGTTCGTTTACAAACTGGAACAGGTTTCTGAAAAAGGTAAAGCGTACACCAACTA



CTTCGGTCGTTGCAACGTTGCGGAACACGAAAAACTGATCCTGCTGGCGCAGCTG



AAACCGGAAAAAGACTCTGACGAAGCGGTTACCTACTCTCTGGGTAAATTCGGTC



AGCGTGCGCTGGACTTCTACTCTATCCACGTTACCAAAGAATCTACCCACCCGGT



TAAACCGCTGGCGCAGATCGCGGGTAACCGTTACGCGTCTGGTCCGGTTGGTAAA



GCGCTGTCTGACGCGTGCATGGGTACCATCGCGTCTTTCCTGTCTAAATACCAGG



ACATCATCATCGAACACCAGAAAGTTGTTAAAGGTAACCAGAAACGTCTGGAATC



TCTGCGTGAACTGGCGGGTAAAGAAAACCTGGAATACCCGTCTGTTACCCTGCCG



CCGCAGCCGCACACCAAAGAAGGTGTTGACGCGTACAACGAAGTTATCGCGCGTG



TTCGTATGTGGGTTAACCTGAACCTGTGGCAGAAACTGAAACTGTCTCGTGACGA



CGCGAAACCGCTGCTGCGTCTGAAAGGTTTCCCGTCTTTCCCGGTTGTTGAACGT



CGTGAAAACGAAGTTGACTGGTGGAACACCATCAACGAAGTTAAAAAACTGATCG



ACGCGAAACGTGACATGGGTCGTGTTTTCTGGTCTGGTGTTACCGCGGAAAAACG



TAACACCATCCTGGAAGGTTACAACTACCTGCCGAACGAAAACGACCACAAAAAA



CGTGAAGGTTCTCTGGAAAACCCGAAAAAACCGGCGAAACGTCAGTTCGGTGACC



TGCTGCTGTACCTGGAAAAAAAATACGCGGGTGACTGGGGTAAAGTTTTCGACGA



AGCGTGGGAACGTATCGACAAAAAAATCGCGGGTCTGACCTCTCACATCGAACGT



GAAGAAGCGCGTAACGCGGAAGACGCGCAGTCTAAAGCGGTTCTGACCGACTGGC



TGCGTGCGAAAGCGTCTTTCGTTCTGGAACGTCTGAAAGAAATGGACGAAAAAGA



ATTCTACGCGTGCGAAATCCAGCTGCAGAAATGGTACGGTGACCTGCGTGGTAAC



CCGTTCGCGGTTGAAGCGGAAAACCGTGTTGTTGACATCTCTGGTTTCTCTATCG



GTTCTGACGGTCACTCTATCCAGTACCGTAACCTGCTGGCGTGGAAATACCTGGA



AAACGGTAAACGTGAATTCTACCTGCTGATGAACTACGGTAAAAAAGGTCGTATC



CGTTTCACCGACGGTACCGACATCAAAAAATCTGGTAAATGGCAGGGTCTGCTGT



ACGGTGGTGGTAAAGCGAAAGTTATCGACCTGACCTTCGACCCGGACGACGAACA



GCTGATCATCCTGCCGCTGGCGTTCGGTACCCGTCAGGGTCGTGAATTCATCTGG



AACGACCTGCTGTCTCTGGAAACCGGTCTGATCAAACTGGCGAACGGTCGTGTTA



TCGAAAAAACCATCTACAACAAAAAAATCGGTCGTGACGAACCGGCGCTGTTCGT



TGCGCTGACCTTCGAACGTCGTGAAGTTGTTGACCCGTCTAACATCAAACCGGTT



AACCTGATCGGTGTTGACCGTGGTGAAAACATCCCGGCGGTTATCGCGCTGACCG



ACCCGGAAGGTTGCCCGCTGCCGGAATTCAAAGACTCTTCTGGTGGTCCGACCGA



CATCCTGCGTATCGGTGAAGGTTACAAAGAAAAACAGCGTGCGATCCAGGCGGCG



AAAGAAGTTGAACAGCGTCGTGCGGGTGGTTACTCTCGTAAATTCGCGTCTAAAT



CTCGTAACCTGGCGGACGACATGGTTCGTAACTCTGCGCGTGACCTGTTCTACCA



CGCGGTTACCCACGACGCGGTTCTGGTTTTCGAAAACCTGTCTCGTGGTTTCGGT



CGTCAGGGTAAACGTACCTTCATGACCGAACGTCAGTACACCAAAATGGAAGACT



GGCTGACCGCGAAACTGGCGTACGAAGGTCTGACCTCTAAAACCTACCTGTCTAA



AACCCTGGCGCAGTACACCTCTAAAACCTGCTCTAACTGCGGTTTCACCATCACC



ACCGCGGACTACGACGGTATGCTGGTTCGTCTGAAAAAAACCTCTGACGGTTGGG



CGACCACCCTGAACAACAAAGAACTGAAAGCGGAAGGTCAGATCACCTACTACAA



CCGTTACAAACGTCAGACCGTTGAAAAAGAACTGTCTGCGGAACTGGACCGTCTG



TCTGAAGAATCTGGTAACAACGACATCTCTAAATGGACCAAAGGTCGTCGTGACG



AAGCGCTGTTCCTGCTGAAAAAACGTTTCTCTCACCGTCCGGTTCAGGAACAGTT



CGTTTGCCTGGACTGCGGTCACGAAGTTCACGCGGACGAACAGGCGGCGCTGAAC



ATCGCGCGTTCTTGGCTGTTCCTGAACTCTAACTCTACCGAATTCAAATCTTACA



AATCTGGTAAACAGCCGTTCGTTGGTGCGTGGCAGGCGTTCTACAAACGTCGTCT



GAAAGAAGTTTGGAAACCGAACGCGTAAGAAATCATCCTTAGCGAAAGCTAAGGA



TTTTTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGT



TATTACTCAGGAAGCAAAGAGGATTACA





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGT


ID
TGCCGTCACTGCGTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGC


NO:
TTATTAAAAGCATTCTGTAACAAAGCGGGACCAAAGCCATGACAAAAACGCGTAA


80
CAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATTTGCACGGCGTC



ACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGAC



GCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAG



TAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAAC



TTTAAGAGGAGGATATACCATGCACCATCATCATCACCATAAACGTATCAACAAA



ATCCGTCGTCGTCTGGTTAAAGACTCTAACACCAAAAAAGCGGGTAAAACCGGTC



CGATGAAAACCCTGCTGGTTCGTGTTATGACCCCGGACCTGCGTGAACGTCTGGA



AAACCTGCGTAAAAAACCGGAAAACATCCCGCAGCCGATCTCTAACACCTCTCGT



GCGAACCTGAACAAACTGCTGACCGACTACACCGAAATGAAAAAAGCGATCCTGC



ACGTTTACTGGGAAGAATTCCAGAAAGACCCGGTTGGTCTGATGTCTCGTGTTGC



GCAGCCGGCGCCGAAAAACATCGACCAGCGTAAACTGATCCCGGTTAAAGACGGT



AACGAACGTCTGACCTCTTCTGGTTTCGCGTGCTCTCAGTGCTGCCAGCCGCTGT



ACGTTTACAAACTGGAACAGGTTAACGACAAAGGTAAACCGCACACCAACTACTT



CGGTCGTTGCAACGTTTCTGAACACGAACGTCTGATCCTGCTGTCTCCGCACAAA



CCGGAAGCGAACGACGAACTGGTTACCTACTCTCTGGGTAAATTCGGTCAGCGTG



CGCTGGACTTCTACTCTATCCACGTTACCCGTGAATCTAACCACCCGGTTAAACC



GCTGGAACAGATCGGTGGTAACTCTTGCGCGTCTGGTCCGGTTGGTAAAGCGCTG



TCTGACGCGTGCATGGGTGCGGTTGCGTCTTTCCTGACCAAATACCAGGACATCA



TCCTGGAACACCAGAAAGTTATCAAAAAAAACGAAAAACGTCTGGCGAACCTGAA



AGACATCGCGTCTGCGAACGGTCTGGCGTTCCCGAAAATCACCCTGCCGCCGCAG



CCGCACACCAAAGAAGGTATCGAAGCGTACAACAACGTTGTTGCGCAGATCGTTA



TCTGGGTTAACCTGAACCTGTGGCAGAAACTGAAAATCGGTCGTGACGAAGCGAA



ACCGCTGCAGCGTCTGAAAGGTTTCCCGTCTTTCCCGCTGGTTGAACGTCAGGCG



AACGAAGTTGACTGGTGGGACATGGTTTGCAACGTTAAAAAACTGATCAACGAAA



AAAAAGAAGACGGTAAAGTTTTCTGGCAGAACCTGGCGGGTTACAAACGTCAGGA



AGCGCTGCTGCCGTACCTGTCTTCTGAAGAAGACCGTAAAAAAGGTAAAAAATTC



GCGCGTTACCAGTTCGGTGACCTGCTGCTGCACCTGGAAAAAAAACACGGTGAAG



ACTGGGGTAAAGTTTACGACGAAGCGTGGGAACGTATCGACAAAAAAGTTGAAGG



TCTGTCTAAACACATCAAACTGGAAGAAGAACGTCGTTCTGAAGACGCGCAGTCT



AAAGCGGCGCTGACCGACTGGCTGCGTGCGAAAGCGTCTTTCGTTATCGAAGGTC



TGAAAGAAGCGGACAAAGACGAATTCTGCCGTTGCGAACTGAAACTGCAGAAATG



GTACGGTGACCTGCGTGGTAAACCGTTCGCGATCGAAGCGGAAAACTCTATCCTG



GACATCTCTGGTTTCTCTAAACAGTACAACTGCGCGTTCATCTGGCAGAAAGACG



GTGTTAAAAAACTGAACCTGTACCTGATCATCAACTACTTCAAAGGTGGTAAACT



GCGTTTCAAAAAAATCAAACCGGAAGCGTTCGAAGCGAACCGTTTCTACACCGTT



ATCAACAAAAAATCTGGTGAAATCGTTCCGATGGAAGTTAACTTCAACTTCGACG



ACCCGAACCTGATCATCCTGCCGCTGGCGTTCGGTAAACGTCAGGGTCGTGAATT



CATCTGGAACGACCTGCTGTCTCTGGAAACCGGTTCTCTGAAACTGGCGAACGGT



CGTGTTATCGAAAAAACCCTGTACAACCGTCGTACCCGTCAGGACGAACCGGCGC



TGTTCGTTGCGCTGACCTTCGAACGTCGTGAAGTTCTGGACTCTTCTAACATCAA



ACCGATGAACCTGATCGGTATCGACCGTGGTGAAAACATCCCGGCGGTTATCGCG



CTGACCGACCCGGAAGGTTGCCCGCTGTCTCGTTTCAAAGACTCTCTGGGTAACC



CGACCCACATCCTGCGTATCGGTGAATCTTACAAAGAAAAACAGCGTACCATCCA



GGCGGCGAAAGAAGTTGAACAGCGTCGTGCGGGTGGTTACTCTCGTAAATACGCG



TCTAAAGCGAAAAACCTGGCGGACGACATGGTTCGTAACACCGCGCGTGACCTGC



TGTACTACGCGGTTACCCAGGACGCGATGCTGATCTTCGAAAACCTGTCTCGTGG



TTTCGGTCGTCAGGGTAAACGTACCTTCATGGCGGAACGTCAGTACACCCGTATG



GAAGACTGGCTGACCGCGAAACTGGCGTACGAAGGTCTGCCGTCTAAAACCTACC



TGTCTAAAACCCTGGCGCAGTACACCTCTAAAACCTGCTCTAACTGCGGTTTCAC



CATCACCTCTGCGGACTACGACCGTGTTCTGGAAAAACTGAAAAAAACCGCGACC



GGTTGGATGACCACCATCAACGGTAAAGAACTGAAAGTTGAAGGTCAGATCACCT



ACTACAACCGTTACAAACGTCAGAACGTTGTTAAAGACCTGTCTGTTGAACTGGA



CCGTCTGTCTGAAGAATCTGTTAACAACGACATCTCTTCTTGGACCAAAGGTCGT



TCTGGTGAAGCGCTGTCTCTGCTGAAAAAACGTTTCTCTCACCGTCCGGTTCAGG



AAAAATTCGTTTGCCTGAACTGCGGTTTCGAAACCCACGCGGACGAACAGGCGGC



GCTGAACATCGCGCGTTCTTGGCTGTTCCTGCGTTCTCAGGAATACAAAAAATAC



CAGACCAACAAAACCACCGGTAACACCGACAAACGTGCGTTCGTTGAAACCTGGC



AGTCTTTCTACCGTAAAAAACTGAAAGAAGTTTGGAAACCGGAAATCATCCTTAG



CGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAAATAT



TCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA





SEQ
tgccgtcactgcgtcttttactggctcttctcgctaaccaaaccggtaaccccgc


ID
ttattaaaagcattctgtaacaaagcgggaccaaagccatgacaaaaacgcgtaa


NO:
caaaagtgtctataatcacggcagaaaagtccacattgattatttgcacggcgtc


81
acactttgctatgccatagcatttttatccataagattagcggatcctacctgac



gctttttatcgcaactctctactgtttctccatacccgtttttttgggctagcac



cgcctatctcgtgtgagataggcggagatacgaactttaagAAGGAGatatacc





SEQ
TGCCGTCACTGCGTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGC


ID
TTATTAAAAGCATTCTGTAACAAAGCGGGACCAAAGCCATGACAAAAACGCGTAA


NO:
CAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATTTGCACGGCGTC


82
ACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGAC



GCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGTAGCGGA



TCCTACCTGAC





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGT


ID
TTCTAGAGCACAGCTAACACCACGTCGTCCCTATCTGCTGCCCTAGGTCTATGAG


NO:
TGGTTGCTGGATAACTTTACGGGCATGCATAAGGCTCGTAATATATATTCAGGGA


83
GACCACAACGGTTTCCCTCTACAAATAATTTTGTTTAACTTTTACTAGAGCTAGC



AGTAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGA



ACTTTAAGAGGAGGATATACCA





SEQ
GTTTGAGAGATATGTAAATTCAAAGGATAATCAAAC


ID



NO:



84






SEQ
actacattttttaagacctaattttgagt


ID



NO:



85






SEQ
ctcaaaactcattcgaatctctactctttgtagat


ID



NO:



86






SEQ
CTCTAGCAGGCCTGGCAAATTTCTACTGTTGTAGAT


ID



NO:



87






SEQ
CCGTCTAAAACTCATTCAGAATTTCTACTAGTGTAGAT


ID



NO:



88






SEQ
GTCTAGGTACTCTCTTTAATTTCTACTATTGT


ID



NO:



89






SEQ
gttaagttatatagaataatttctactgttgtaga


ID



NO:



90






SEQ
gtttaaaaccactttaaaatttctactattgta


ID



NO:



91






SEQ
GTTTGAGAATGATGTAAAAATGTATGGTACACAGAAATGTTTTAATACCATATTT


ID
TTACATCACTCTCAAACATACATCTCTTGTTACTGTTTATCGTATCCAGATTAAA


NO:
TTTCACGTTTTT


92






SEQ
CTCTACAACTGATAAAGAATTTCTACTTTTGTAGAT


ID



NO:



93






SEQ
GTCTGGCCCCAAATTTTAATTTCTACTGTTGTAGAT


ID



NO:



94






SEQ
GTCAAAAGACCTTTTTAATTTCTACTCTTGTAGAT


ID



NO:



95






SEQ
GTCTAGAGGACAGAATTTTTCAACGGGTGTGCCAATGGCCACTTTCCAGGTGGCA


ID
AAGCCCGTTGAGCTTCTACGGAAGTGGCAC


NO:



96






SEQ
CGAGGTTCTGTCTTTTGGTCAGGACAACCGTCTAGCTATAAGTGCTGCAGGGGTG


ID
TGAGAAACTCCTATTGCTGGACGATGTCTCTTTTAACGAGGCATTAGCAC


NO:



97






SEQ
GAACGAGGGACGTTTTGTCTCCAATGATTTTGCTATGACGACCTCGAACTGTGCC


ID
TTCAAGTCTGAGGCGAAAAAGAAATGGAAAAAAGTGTCTCATCGCTCTACCTCGT


NO:
AGTTAGAGG


98






SEQ
AATTACTGATGTTGTGATGAAGG


ID



NO:



99






SEQ
TATACCATAAGGATTTAAAGACT


ID



NO:



100






SEQ
GTCTTTACTCTCACCTTTCCACCTG


ID



NO:



101






SEQ
ATTTGAAGGTATCTCCGATAAGTAAAACGCATCAAAG


ID



NO:



102






SEQ
GTTTGAAGATATCTCCGATAAATAAGAAGCATCAAAG


ID



NO:



103






SEQ
TTGTTTTAATACCATATTTTTACATCACTCTCAAAC


ID



NO:



104






SEQ
AAAGAACGCTCGCTCAGTGTTCTGACCTTTCGAGCGCCTGTTCAGGGCGAAAACC


ID
CTGGGAGGCGCTCGAATCATAGGTGGGACAAGGGATTCGCGGCGAAAA


NO:



105






SEQ
GTTTGAGAATGATGTAAAAATGTATGGTACACAGAAATGTTTTAATACCATATTT


ID
TTACATCACTCTCAAACATACATCTCTTGTTACTGTTTATCGTATCCAGATTAAA


NO:
TTTCACGTTTTT


106






SEQ
GTCTAGAGGACAGAATTTTTCAACGGGTGTGCCAATGGCCACTTTCCAGGTGGCA


ID
AAGCCCGTTGAGCTTCTACGGAAGTGGCAC


NO:



107






SEQ
MSIYQEFVNKYSLSKTLRFELIPQGKTLENIKARGLILDDEKRAKDYKKAKQIID


ID
KYHQFFIEEILSSVCISEDLLQNYSDVYFKLKKSDDDNLQKDFKSAKDTIKKQIS


NO:
EYIKDSEKFKNLFNQNLIDAKKGQESDLILWLKQSKDNGIELFKANSDITDIDEA


108
LEIIKSFKGWTTYFKGFHENRKNVYSSNDIPTSIIYRIVDDNLPKFLENKAKYES



LKDKAPEAINYEQIKKDLAEELTFDIDYKTSEVNQRVFSLDEVFEIANFNNYLNQ



SGITKFNTIIGGKFVNGENTKRKGINEYINLYSQQINDKTLKKYKMSVLFKQILS



DTESKSFVIDKLEDDSDVVTTMQSFYEQIAAFKTVEEKSIKETLSLLFDDLKAQK



LDLSKIYFKNDKSLTDLSQQVEDDYSVIGTAVLEYITQQIAPKNLDNPSKKEQEL



IAKKTEKAKYLSLETIKLALEEFNKHRDIDKQCRFEEILANFAAIPMIFDEIAQN



KDNLAQISIKYQNQGKKDLLQASAEDDVKAIKDLLDQTNNLLHKLKIFHISQSED



KANILDKDEHFYLVFEECYFELANIVPLYNKIRNYITQKPYSDEKFKLNFENSTL



ANGWDKNKEPDNTAILFIKDDKYYLGVMNKKNNKIFDDKAIKENKGEGYKKIVYK



LLPGANKMLPKVFFSAKSIKFYNPSEDILRIRNHSTHTKNGSPQKGYEKFEFNIE



DCRKFIDFYKQSISKHPEWKDFGFRFSDTQRYNSIDEFYREVENQGYKLTFENIS



ESYIDSVVNQGKLYLFQIYNKDFSAYSKGRPNLHTLYWKALFDERNLQDVVYKLN



GEAELFYRKQSIPKKITHPAKEAIANKNKDNPKKESVFEYDLIKDKRFTEDKFFF



HCPITINFKSSGANKENDEINLLLKEKANDVHILSIDRGERHLAYYTLVDGKGNI



IKQDTFNIIGNDRMKTNYHDKLAAIEKDRDSARKDWKKINNIKEMKEGYLSQVVH



EIAKLVIEYNAIVVFEDLNFGFKRGRFKVEKQVYQKLEKMLIEKLNYLVFKDNEF



DKTGGVLRAYQLTAPFETFKKMGKQTGIIYYVPAGFTSKICPVTGFVNQLYPKYE



SVSKSQEFFSKFDKICYNLDKGYFEFSFDYKNFGDKAAKGKWTIASFGSRLINFR



NSDKNHNWDTREVYPTKELEKLLKDYSIEYGHGECIKAAICGESDKKFFAKLTSV



LNTILQMRNSKTGTELDYLISPVADVNGNFFDSRQAPKNMPQDADANGAYHIGLK



GLMLLGRIKNNQEGKKLNLVIKNEEYFEFVQNRNN





SEQ
MSIYQEFVNKYSLSKTLRFELIPQGKTLENIKARGLILDDEKRAKDYKKAKQIID


ID
KYHQFFIEEILSSVCISEDLLQNYSDVYFKLKKSDDDNLQKDFKSAKDTIKKQIS


NO:
EYIKDSEKFKNLFNQNLIDAKKGQESDLILWLKQSKDNGIELFKANSDITDIDEA


109
LEIIKSFKGWTTYFKGFHENRKNVYSSDDIPTSIIYRIVDDNLPKFLENKAKYES



LKDKAPEAINYEQIKKDLAEELTFDIDYKTSEVNQRVFSLDEVFEIANFNNYLNQ



SGITKFNTIIGGKFVNGENTKRKGINEYINLYSQQINDKTLKKYKMSVLFKQILS



DTESKSFVIDKLEDDSDVVTTMQSFYEQIAAFKTVEEKSIKETLSLLFDDLKAQK



LDLSKIYFKNDKSLTDLSQQVEDDYSVIGTAVLEYITQQVAPKNLDNPSKKEQDL



IAKKTEKAKYLSLETIKLALEEFNKHRDIDKQCRFEEILANFAAIPMIFDEIAQN



KDNLAQISLKYQNQGKKDLLQASAEEDVKAIKDLLDQTNNLLHRLKIFHISQSED



KANILDKDEHFYLVFEECYFELANIVPLYNKIRNYITQKPYSDEKFKLNFENSTL



ANGWDKNKEPDNTAILFIKDDKYYLGVMNKKNNKIFDDKAIKENKGEGYKKIVYK



LLPGANKMLPKVFFSAKSIKFYNPSEDILRIRNHSTHTKNGNPQKGYEKFEFNIE



DCRKFIDFYKESISKHPEWKDFGFRFSDTQRYNSIDEFYREVENQGYKLTFENIS



ESYIDSVVNQGKLYLFQIYNKDFSAYSKGRPNLHTLYWKALFDERNLQDVVYKLN



GEAELFYRKQSIPKKITHPAKEAIANKNKDNPKKESVFEYDLIKDKRFTEDKFFF



HCPITINFKSSGANKENDEINLLLKEKANDVHILSIDRGERHLAYYTLVDGKGNI



IKQDTFNIIGNDRMKTNYHDKLAAIEKDRDSARKDWKKINNIKEMKEGYLSQVVH



EIAKLVIEHNAIVVFEDLNFGFKRGRFKVEKQVYQKLEKMLIEKLNYLVFKDNEF



DKTGGVLRAYQLTAPFETFKKMGKQTGIIYYVPAGFTSKICPVTGFVNQLYPKYE



SVSKSQEFFSKFDKICYNLDKGYFEFSFDYKNFGDKAAKGKWTIASFGSRLINFR



NSDKNHNWDTREVYPTKELEKLLKDYSIEYGHGECIKAAICGESDKKFFAKLTSV



LNTILQMRNSKTGTELDYLISPVADVNGNFFDSRQAPKNMPQDADANGAYHIGLK



GLMLLDRIKNNQEGKKLNLVIKNEEYFEFVQNRNN





SEQ
MDKKYSIGLDIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGALLFDS


ID
GETAEATRLKRTARRRYTRRKNRICYLQEIFSNEMAKVDDSFFHRLEESFLVEED


NO:
KKHERHPIFGNIVDEVAYHEKYPTIYHLRKKLVDSTDKADLRLIYLALAHMIKFR


110
GHFLIEGDLNPDNSDVDKLFIQLVQTYNQLFEENPINASGVDAKAILSARLSKSR



RLENLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNFDLAEDAKLQLSKDTYDDDL



DNLLAQIGDQYADLFLAAKNLSDAILLSDILRVNTEITKAPLSASMIKRYDEHHQ



DLTLLKALVRQQLPEKYKEIFFDQSKNGYAGYIDGGASQEEFYKFIKPILEKMDG



TEELLVKLNREDLLRKQRTFDNGSIPHQIHLGELHAILRRQEDFYPFLKDNREKI



EKILTFRIPYYVGPLARGNSRFAWMTRKSEETITPWNFEEVVDKGASAQSFIERM



TNFDKNLPNEKVLPKHSLLYEYFTVYNELTKVKYVTEGMRKPAFLSGEQKKAIVD



LLFKTNRKVTVKQLKEDYFKKIECFDSVEISGVEDRFNASLGTYHDLLKIIKDKD



FLDNEENEDILEDIVLTLTLFEDREMIEERLKTYAHLFDDKVMKQLKRRRYTGWG



RLSRKLINGIRDKQSGKTILDFLKSDGFANRNFMQLIHDDSLTFKEDIQKAQVSG



QGDSLHEHIANLAGSPAIKKGILQTVKVVDELVKVMGRHKPENIVIEMARENQTT



QKGQKNSRERMKRIEEGIKELGSQILKEHPVENTQLQNEKLYLYYLQNGRDMYVD



QELDINRLSDYDVDHIVPQSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEVVKKMK



NYWRQLLNAKLITQRKFDNLTKAERGGLSELDKAGFIKRQLVETRQITKHVAQIL



DSRMNTKYDENDKLIREVKVITLKSKLVSDFRKDFQFYKVREINNYHHAHDAYLN



AVVGTALIKKYPKLESEFVYGDYKVYDVRKMIAKSEQEIGKATAKYFFYSNIMNF



FKTEITLANGEIRKRPLIETNGETGEIVWDKGRDFATVRKVLSMPQVNIVKKTEV



QTGGFSKESILPKRNSDKLIARKKDWDPKKYGGFDSPTVAYSVLVVAKVEKGKSK



KLKSVKELLGITIMERSSFEKNPIDFLEAKGYKEVKKDLIIKLPKYSLFELENGR



KRMLASAGELQKGNELALPSKYVNFLYLASHYEKLKGSPEDNEQKQLFVEQHKHY



LDEIIEQISEFSKRVILADANLDKVLSAYNKHRDKPIREQAENIIHLFTLTNLGA



PAAFKYFDTTIDRKRYTSTKEVLDATLIHQSITGLYETRIDLSQLGGD





SEQ
PKKKRKV


ID



NO:



111






SEQ
KRPAATKKAGQAKKKK


ID



NO:



112






SEQ
PAAKRVKLD


ID



NO:



113






SEQ
RQRRNELKRSP


ID



NO:



114






SEQ
NQSSNFGPMKGGNFGGRSSGPYGGGGQYFAKPRNQGGY


ID



NO:



115






SEQ
RMRIZFKNKGKDTAELRRRRVEVSVELRKAKKDEQILKRRNV


ID



NO:



116






SEQ
VSRKRPRP


ID



NO:



117






SEQ
PPKKARED


ID



NO:



118






SEQ
PQPKKKPL


ID



NO:



119






SEQ
SALIKKKKKMAP


ID



NO:



120






SEQ
DRLRR


ID



NO:



121






SEQ
PKQKKRK


ID



NO:



122






SEQ
RKLKKKIKKL


ID



NO:



123






SEQ
REKKKFLKRR


ID



NO:



124






SEQ
KRKGDEVDGVDEVAKKKSKK


ID



NO:



125






SEQ
RKCLQAGMNLEARKTKK


ID



NO:



126






SEQ
ATGGGTAAGATGTATTATCTGGGTTTGGATATAGGCACTAACTCTGTGGGATATG


ID
CAGTAACTGATCCCTCGTATCACTTGTTAAAGTTCAAAGGCGAACCCATGTGGGG


NO:
AGCACATGTATTTGCTGCGGGTAATCAGAGTGCCGAAAGGCGATCTTTCAGAACA


127
TCCAGGAGGCGATTAGATAGGAGACAGCAAAGAGTAAAGCTTGTGCAAGAGATCT



TTGCTCCTGTCATTTCACCTATAGACCCTCGTTTTTTTATAAGATTGCACGAATC



GGCTCTATGGAGAGACGATGTTGCCGAAACAGATAAACATATCTTTTTCAATGAT



CCCACTTATACAGACAAGGAATACTACTCCGACTACCCGACAATTCATCATTTGA



TCGTCGATCTTATGGAGAGCTCTGAAAAGCATGACCCCCGACTTGTCTATTTGGC



TGTAGCTTGGTTAGTTGCTCATAGAGGTCATTTCTTGAATGAAGTAGATAAAGAC



AATATAGGTGATGTACTTTCTTTTGATGCTTTCTACCCGGAATTTTTGGCCTTTT



TGTCAGACAATGGCGTCAGTCCCTGGGTCTGTGAGTCGAAGGCCCTTCAAGCTAC



TCTGCTGTCTAGGAATAGCGTCAACGACAAATATAAAGCATTAAAATCGCTGATA



TTCGGATCGCAAAAACCGGAAGATAACTTTGACGCTAACATCTCTGAAGATGGTT



TAATCCAATTGCTGGCGGGTAAGAAAGTTAAAGTAAACAAACTATTCCCACAAGA



GTCCAACGATGCTAGCTTTACGTTGAATGATAAAGAAGACGCTATTGAAGAAATT



CTAGGTACTTTAACGCCTGACGAGTGCGAATGGATCGCTCATATTCGCAGATTGT



TCGATTGGGCCATCATGAAACACGCGCTAAAGGATGGCAGGACGATATCTGAATC



AAAAGTGAAGCTATACGAGCAGCATCATCATGACTTGACTCAGTTAAAGTACTTT



GTGAAGACCTACCTAGCTAAAGAGTATGATGATATCTTCAGAAACGTAGACTCCG



AGACAACTAAAAATTATGTAGCTTATTCTTACCATGTGAAGGAAGTGAAAGGCAC



ATTACCAAAAAATAAAGCAACGCAAGAAGAATTTTGTAAATACGTCCTTGGCAAA



GTCAAAAACATTGAATGTTCCGAAGCAGACAAGGTTGATTTTGATGAAATGATAC



AACGACTTACGGACAATTCTTTTATGCCAAAGCAAGTCTCAGGTGAAAATAGAGT



AATACCATACCAGTTGTACTACTATGAATTAAAGACAATTTTAAACAAAGCCGCC



TCATATCTACCTTTTTTGACACAATGCGGTAAAGATGCTATTTCTAACCAAGACA



AATTACTGTCTATAATGACATTTCGCATACCATATTTCGTCGGCCCTTTAAGGAA



AGATAATTCAGAACATGCCTGGTTGGAACGTAAAGCGGGTAAAATTTACCCGTGG



AACTTTAATGATAAAGTAGATCTTGATAAATCGGAGGAAGCCTTTATCCGTAGGA



TGACCAATACTTGCACGTATTACCCAGGAGAAGACGTGTTACCATTAGATTCACT



TATCTATGAAAAGTTTATGATCTTGAATGAGATAAACAATATTAGGATTGACGGA



TACCCCATTTCTGTTGATGTGAAACAACAAGTATTTGGTTTATTTGAGAAGAAAA



GGCGAGTAACAGTTAAGGATATTCAAAATCTACTATTATCTCTTGGAGCGTTGGA



TAAACACGGTAAGCTGACTGGTATTGACACGACAATACACTCTAATTATAACACT



TATCATCATTTTAAATCTCTTATGGAGCGGGGAGTATTGACCAGAGATGATGTGG



AAAGAATAGTGGAAAGAATGACATATTCTGACGATACTAAGAGGGTCAGACTGTG



GTTAAATAATAATTATGGAACTCTAACAGCTGACGATGTTAAGCATATCTCAAGA



CTCAGAAAACACGATTTCGGCCGTTTGTCTAAAATGTTTTTGACAGGATTGAAAG



GTGTTCATAAGGAGACAGGCGAGAGAGCAAGTATACTGGATTTTATGTGGAATAC



TAACGACAATTTAATGCAACTACTGTCCGAATGTTACACATTCTCGGATGAGATC



ACCAAATTACAAGAGGCCTACTACGCAAAAGCTCAATTATCGCTAAATGACTTCT



TGGACTCTATGTATATATCAAACGCCGTTAAGAGACCTATTTATCGGACCTTAGC



GGTAGTAAATGATATTAGAAAGGCATGCGGGACGGCACCTAAAAGAATTTTCATC



GAGATGGCGCGAGATGGAGAGTCTAAGAAGAAAAGATCTGTGACTCGTAGAGAGC



AAATTAAAAATCTCTATAGATCAATTCGTAAAGACTTTCAACAAGAAGTTGATTT



TCTGGAAAAGATATTGGAAAATAAGAGTGACGGGCAGCTTCAGTCTGACGCTTTA



TATTTGTATTTTGCTCAATTAGGCAGAGACATGTACACAGGTGATCCAATCAAAT



TAGAACATATTAAAGACCAATCTTTTTACAACATTGATCATATTTATCCTCAATC



GATGGTGAAAGATGACAGTTTGGATAACAAGGTACTAGTCCAAAGCGAAATCAAT



GGCGAAAAGAGTTCGCGCTATCCATTAGACGCAGCCATTAGAAACAAAATGAAGC



CGTTGTGGGATGCCTACTATAATCATGGATTAATTTCTCTTAAGAAATACCAGCG



TTTGACGAGATCTACTCCATTTACGGACGACGAGAAGTGGGATTTTATCAATCGT



CAGCTAGTTGAAACTAGGCAATCTACTAAAGCTTTAGCAATATTGTTAAAGCGTA



AGTTTCCAGATACTGAAATAGTTTACTCAAAGGCTGGACTATCCAGCGATTTTAG



ACATGAATTCGGCCTGGTTAAGAGTAGGAATATTAATGATCTACACCATGCTAAA



GATGCCTTTCTCGCAATAGTTACTGGGAACGTTTATCATGAAAGATTTAATAGAA



GATGGTTTATGGTTAACCAGCCATACTCTGTGAAAACTAAGACATTGTTTACCCA



TTCAATTAAGAATGGCAACTTTGTCGCTTGGAATGGAGAAGAAGATCTTGGACGT



ATCGTAAAGATGTTGAAACAAAACAAGAACACAATCCACTTCACCAGGTTTTCCT



TTGATAGGAAGGAGGGATTGTTCGATATTCAACCTCTCAAAGCTTCTACCGGATT



GGTTCCACGAAAAGCAGGGTTGGATGTTGTTAAATATGGAGGATACGATAAAAGC



ACTGCCGCGTATTATTTATTAGTACGTTTTACACTCGAGGATAAGAAGACTCAAC



ACAAATTGATGATGATTCCTGTTGAAGGTCTCTACAAAGCACGTATTGACCATGA



TAAAGAGTTTTTAACAGATTATGCTCAGACCACGATCAGCGAAATTCTTCAAAAG



GACAAGCAGAAAGTGATCAACATCATGTTCCCTATGGGCACGAGACATATCAAAC



TGAATTCGATGATTTCTATTGATGGATTCTATCTTTCTATTGGTGGGAAGAGTAG



CAAAGGTAAGTCAGTACTATGTCATGCTATGGTGCCATTAATCGTCCCACACAAG



ATAGAATGTTATATCAAGGCTATGGAATCGTTTGCAAGAAAATTCAAAGAAAATA



ATAAATTGAGGATCGTTGAAAAGTTTGATAAAATAACTGTTGAAGATAACTTGAA



CTTATACGAGCTTTTTCTACAAAAGTTGCAACATAACCCATATAATAAATTTTTC



TCTACACAATTTGATGTGTTGACGAACGGTAGAAGTACATTCACCAAATTGTCTC



CAGAGGAGCAAGTCCAGACTTTACTTAATATACTGAGTATATTTAAAACTTGTCG



TTCTTCTGGGTGTGATTTAAAATCAATAAATGGTTCCGCTCAAGCGGCTAGAATT



ATGATATCCGCTGATTTAACTGGCTTATCAAAAAAGTATTCAGATATTAGATTAG



TTGAGCAAAGCGCATCAGGTCTATTTGTTTCAAAATCTCAAAATCTCTTGGAATA



CTTGCCAAAAAAGAAAAGGAAAGTTTAG





SEQ
ATGAGTAGTTTAACAAAGTTTACCAATAAATATAGTAAGCAACTAACTATAAAGA


ID
ACGAATTGATACCGGTCGGTAAGACTTTGGAAAACATAAAAGAAAATGGGTTGAT


NO:
TGATGGAGACGAGCAATTGAATGAGAATTATCAAAAAGCAAAGATAATAGTAGAT


128
GATTTTTTGAGAGACTTTATTAATAAAGCTCTAAATAACACTCAAATTGGTAACT



GGAGAGAGCTAGCCGACGCCTTGAACAAGGAAGATGAGGATAATATTGAGAAATT



ACAAGATAAGATTAGAGGGATTATCGTGTCTAAGTTTGAGACTTTTGATCTGTTC



AGTTCGTATTCGATTAAAAAGGACGAGAAAATCATCGATGATGATAACGATGTGG



AAGAAGAGGAGCTAGACCTTGGGAAGAAGACATCTAGCTTCAAATACATATTCAA



GAAAAATTTGTTCAAACTTGTCCTTCCTTCATATTTAAAAACAACAAATCAAGAT



AAGTTAAAAATCATTTCTTCCTTCGATAATTTTAGTACTTATTTTCGTGGTTTTT



TCGAAAACAGGAAAAATATATTCACTAAAAAGCCTATATCTACCTCTATAGCTTA



TAGAATTGTTCACGATAATTTCCCAAAATTTCTAGATAATATCAGGTGTTTTAAT



GTTTGGCAAACCGAGTGTCCTCAGTTAATAGTCAAGGCCGACAACTACCTTAAAA



GCAAGAATGTGATTGCAAAAGATAAGTCTTTGGCTAACTATTTTACAGTCGGTGC



CTATGATTATTTTCTGAGTCAAAATGGTATCGATTTCTATAACAACATTATTGGC



GGCTTACCAGCTTTTGCCGGGCATGAGAAGATTCAGGGTTTGAACGAATTTATCA



ATCAAGAATGTCAAAAGGATTCTGAATTAAAGTCTAAGCTCAAGAATAGGCACGC



TTTCAAAATGGCAGTCTTATTCAAACAAATCCTTTCAGACAGAGAAAAGTCATTT



GTGATTGACGAGTTCGAATCAGACGCTCAGGTAATTGATGCTGTTAAAAATTTTT



ACGCGGAACAATGCAAAGATAATAACGTCATATTTAATTTATTGAATCTGATCAA



GAATATTGCTTTTTTGTCGGATGATGAGTTAGACGGCATTTTCATAGAGGGTAAA



TACCTGTCCTCTGTGTCTCAAAAATTGTATAGTGATTGGTCAAAGTTGAGAAATG



ATATTGAAGATTCGGCTAATTCTAAACAGGGTAACAAAGAATTAGCGAAGAAAAT



CAAAACTAACAAGGGTGATGTTGAAAAGGCTATAAGTAAGTACGAGTTCAGTTTA



TCTGAACTAAATTCAATTGTTCATGATAACACAAAATTTTCCGATCTTTTATCAT



GCACATTACATAAAGTTGCAAGTGAAAAATTAGTCAAAGTAAACGAAGGTGATTG



GCCAAAACATCTAAAAAACAACGAGGAAAAACAGAAGATAAAAGAACCTCTTGAC



GCTTTATTGGAAATATACAATACTCTATTAATATTTAACTGTAAAAGTTTTAACA



AAAATGGTAATTTCTATGTCGACTACGATCGCTGCATTAATGAGTTGTCCAGTGT



TGTGTACTTGTATAATAAAACTCGTAATTATTGTACGAAAAAGCCGTACAACACT



GACAAATTTAAGTTGAATTTCAACTCCCCACAACTGGGTGAGGGCTTCTCTAAAA



GTAAAGAGAATGATTGCCTTACATTATTATTTAAAAAAGATGATAATTATTATGT



CGGAATCATAAGAAAGGGGGCAAAGATCAACTTCGATGACACTCAGGCCATAGCA



GACAACACAGATAACTGTATATTCAAAATGAATTATTTTTTGCTGAAGGATGCTA



AAAAATTTATCCCCAAATGTTCAATACAATTAAAAGAGGTTAAGGCCCATTTCAA



AAAGTCGGAAGATGACTATATTTTGTCCGATAAGGAAAAATTCGCTAGTCCGCTT



GTTATTAAAAAATCCACATTTCTTCTCGCTACGGCTCATGTGAAAGGAAAGAAGG



GCAATATTAAGAAATTTCAGAAAGAATACTCCAAAGAAAATCCTACGGAGTATAG



AAATAGTCTGAACGAATGGATAGCATTCTGCAAAGAGTTCTTGAAGACCTATAAA



GCTGCCACCATCTTTGATATTACAACTTTGAAAAAGGCCGAGGAATACGCTGACA



TTGTGGAATTCTATAAGGATGTAGATAATCTTTGTTACAAGTTAGAATTTTGCCC



TATCAAAACTTCTTTTATCGAAAATCTTATAGATAATGGCGATTTATACCTGTTT



AGAATTAATAACAAGGACTTTTCTTCAAAAAGTACAGGCACGAAAAACTTACACA



CATTATACTTGCAGGCTATATTTGACGAGCGAAACTTAAACAACCCCACGATAAT



GTTGAATGGAGGTGCAGAGTTATTCTACAGAAAAGAATCTATAGAACAGAAAAAT



CGGATCACGCACAAAGCCGGTAGTATCTTAGTGAATAAAGTGTGCAAAGATGGTA



CAAGTCTAGATGACAAAATCCGTAACGAAATTTACCAGTATGAAAACAAATTCAT



TGATACTCTTTCGGACGAAGCTAAAAAGGTTCTGCCAAACGTTATTAAGAAAGAG



GCTACGCATGATATAACAAAAGATAAACGTTTCACTAGCGACAAATTCTTCTTTC



ATTGTCCTTTAACAATCAACTACAAGGAAGGTGACACCAAACAATTTAATAATGA



AGTGCTCTCATTCCTTAGAGGTAACCCCGATATCAATATTATCGGCATTGATAGA



GGAGAAAGAAACCTAATCTATGTAACAGTCATTAACCAAAAAGGCGAAATATTGG



ATAGCGTCTCCTTCAATACTGTCACCAATAAGTCATCGAAGATAGAACAAACTGT



TGATTACGAAGAAAAATTGGCCGTTAGAGAAAAGGAACGTATCGAAGCGAAGAGA



TCTTGGGATAGCATATCCAAGATTGCCACCTTGAAGGAGGGTTATCTAAGCGCGA



TCGTACATGAAATCTGCTTATTAATGATTAAGCATAATGCTATTGTCGTGTTAGA



AAACCTGAATGCCGGTTTTAAAAGGATTAGAGGTGGTTTGTCAGAAAAGTCAGTA



TATCAAAAGTTTGAAAAGATGCTTATTAATAAACTCAACTACTTCGTTAGCAAGA



AAGAAAGTGATTGGAATAAACCGTCAGGTTTGCTCAATGGTCTTCAGTTAAGTGA



TCAATTTGAGTCTTTCGAAAAATTAGGAATTCAAAGTGGATTCATTTTTTATGTA



CCAGCCGCGTACACTTCAAAAATTGACCCTACGACCGGATTTGCCAACGTCTTGA



ATTTGTCCAAGGTCAGAAATGTTGACGCCATCAAAAGTTTTTTTAGCAACTTCAA



TGAAATCTCTTATTCCAAAAAGGAAGCCCTTTTCAAGTTTTCTTTTGACCTAGAC



TCGTTATCGAAGAAAGGATTTTCATCTTTCGTAAAGTTTAGCAAGTCCAAGTGGA



ATGTATACACATTCGGCGAGAGAATTATCAAGCCCAAGAACAAACAGGGCTATAG



AGAAGACAAGAGAATCAACTTGACTTTTGAGATGAAAAAATTACTCAACGAATAC



AAGGTTTCATTTGATTTGGAGAACAACTTGATTCCCAATTTGACATCAGCTAACT



TGAAGGATACGTTCTGGAAGGAGTTATTCTTTATATTCAAAACGACATTACAACT



GCGTAATAGTGTTACAAACGGTAAAGAAGATGTATTAATCTCACCTGTAAAGAAT



GCCAAAGGAGAATTTTTCGTATCCGGTACTCACAATAAGACACTACCACAGGATT



GCGACGCTAACGGTGCGTATCATATTGCGTTGAAAGGATTAATGATACTTGAAAG



AAATAACCTTGTTCGCGAAGAAAAAGACACCAAGAAGATCATGGCTATTAGCAAT



GTTGATTGGTTTGAATACGTGCAAAAGAGGAGAGGTGTTTTGTAA





SEQ
ATGAACAATTATGACGAGTTCACAAAGCTATACCCTATCCAAAAAACTATCAGGT


ID
TCGAATTGAAACCACAAGGGAGAACAATGGAACATCTGGAGACATTCAACTTTTT


NO:
TGAAGAGGACAGAGACAGAGCGGAGAAATACAAAATTTTAAAAGAGGCCATCGAT


129
GAATATCACAAAAAGTTTATCGACGAGCATTTAACAAACATGTCTTTGGACTGGA



ATTCACTTAAACAAATTTCTGAGAAATATTATAAGTCTCGGGAGGAAAAAGACAA



AAAGGTCTTTTTGTCCGAGCAAAAGAGAATGAGACAAGAAATTGTCTCGGAGTTT



AAAAAAGATGATCGGTTCAAAGATTTGTTTAGCAAGAAATTGTTTTCTGAATTGT



TGAAGGAGGAGATATACAAGAAAGGCAACCATCAAGAAATAGATGCTTTGAAATC



GTTTGACAAGTTCAGCGGTTACTTCATTGGTTTACATGAAAATAGGAAGAACATG



TATAGCGACGGCGATGAGATCACCGCTATATCGAATAGAATCGTTAACGAAAATT



TTCCGAAATTTTTGGATAATTTGCAAAAATACCAGGAAGCTAGGAAAAAGTACCC



TGAATGGATAATAAAGGCGGAATCAGCTTTGGTGGCTCACAACATAAAGATGGAT



GAAGTCTTCTCGCTGGAATATTTTAACAAAGTATTAAATCAGGAAGGAATCCAAA



GATACAACTTAGCCTTGGGTGGATACGTAACCAAATCAGGTGAGAAAATGATGGG



CTTAAATGATGCACTTAATCTAGCTCACCAATCCGAAAAGTCCTCTAAAGGGAGG



ATACACATGACACCATTGTTTAAGCAAATCCTTTCGGAGAAAGAATCTTTTTCAT



ATATCCCCGATGTTTTCACTGAGGATAGTCAATTGTTGCCCAGCATTGGTGGATT



TTTTGCACAAATAGAAAATGATAAAGATGGTAACATCTTCGATAGAGCCTTGGAA



TTGATAAGCTCCTATGCAGAATACGATACGGAACGAATATACATTAGACAAGCTG



ACATCAACAGAGTAAGCAATGTTATTTTTGGTGAGTGGGGAACTTTAGGTGGATT



AATGCGGGAGTACAAAGCTGACTCAATCAATGATATTAATTTGGAACGTACGTGC



AAAAAAGTCGATAAGTGGCTTGATAGTAAGGAGTTTGCTCTGTCGGATGTACTAG



AAGCAATTAAGAGAACAGGAAACAATGATGCATTTAATGAATATATTAGTAAAAT



GAGGACGGCTAGAGAAAAGATAGACGCCGCACGTAAGGAAATGAAGTTTATTTCC



GAGAAAATATCTGGCGATGAAGAGTCGATTCACATCATCAAGACCCTACTCGATT



CTGTTCAGCAATTTCTCCATTTTTTTAACCTCTTCAAAGCAAGACAAGACATTCC



CTTAGATGGGGCTTTTTATGCCGAATTTGATGAAGTTCATTCAAAGTTGTTTGCT



ATTGTTCCTCTTTACAATAAGGTCCGTAATTACCTTACTAAAAATAACTTGAACA



CCAAGAAAATAAAGTTAAACTTCAAGAATCCGACTCTTGCCAACGGGTGGGATCA



GAATAAAGTTTATGATTATGCTAGCTTAATATTTCTAAGAGATGGGAATTATTAC



TTAGGAATCATCAATCCAAAGCGTAAGAAAAACATTAAATTTGAACAAGGGTCAG



GCAATGGCCCATTCTATAGAAAAATGGTGTATAAGCAAATACCAGGACCTAACAA



GAACTTGCCTCGCGTATTTTTAACTTCAACAAAGGGTAAAAAAGAATATAAACCA



AGCAAAGAAATTATTGAAGGTTACGAAGCAGATAAACACATCAGAGGTGATAAGT



TCGATCTGGATTTCTGCCATAAATTGATTGACTTTTTTAAGGAATCTATAGAAAA



ACATAAGGACTGGTCCAAATTTAATTTCTACTTCTCACCTACAGAAAGTTATGGT



GACATTTCAGAATTTTATTTAGACGTTGAGAAACAAGGATATAGGATGCATTTTG



AAAATATTTCAGCGGAAACCATCGACGAATACGTTGAGAAGGGTGATTTATTCTT



GTTCCAAATTTACAATAAAGACTTCGTTAAAGCTGCAACCGGAAAGAAGGATATG



CATACCATATATTGGAACGCTGCATTCTCGCCAGAAAACTTACAAGATGTCGTTG



TAAAGCTTAATGGAGAAGCTGAGCTGTTCTATAGAGACAAGAGTGATATAAAAGA



GATTGTGCATCGGGAAGGTGAAATTCTGGTGAACAGAACTTACAATGGTCGTACA



CCCGTTCCAGACAAAATACATAAAAAACTGACCGATTATCATAATGGTAGGACAA



AGGACTTGGGCGAGGCCAAGGAGTACCTCGATAAAGTTAGATATTTCAAGGCACA



CTATGATATTACGAAAGACAGGAGATATTTAAACGATAAAATTTACTTTCATGTC



CCTTTGACCCTTAACTTTAAAGCTAATGGTAAAAAGAATTTGAACAAAATGGTAA



TTGAGAAGTTTTTATCGGACGAAAAAGCTCACATAATCGGAATCGACCGCGGAGA



GAGAAATTTACTGTATTATAGTATCATCGACAGAAGTGGAAAGATTATTGATCAG



CAATCTTTGAACGTCATTGATGGGTTTGACTATCGGGAAAAGTTAAATCAAAGGG



AAATTGAAATGAAGGATGCGAGACAATCATGGAATGCCATTGGTAAAATTAAAGA



TCTCAAGGAGGGGTACTTATCAAAAGCTGTACACGAGATAACTAAAATGGCTATC



CAATATAATGCAATTGTTGTAATGGAAGAATTGAATTATGGTTTTAAACGCGGCA



GGTTTAAAGTCGAAAAACAAATATACCAAAAGTTTGAAAACATGTTAATTGATAA



GATGAACTATCTTGTTTTCAAAGATGCACCTGATGAGAGTCCTGGCGGTGTGCTG



AACGCCTATCAATTAACAAACCCATTAGAGTCCTTTGCTAAACTGGGTAAACAAA



CTGGCATTCTATTTTATGTTCCAGCCGCTTACACCTCAAAGATCGATCCAACGAC



CGGTTTTGTAAACTTATTTAATACTTCTTCCAAAACAAACGCGCAAGAACGCAAA



GAATTCCTACAAAAATTTGAATCAATATCCTATAGCGCAAAAGATGGAGGTATAT



TCGCTTTCGCTTTTGACTACAGAAAGTTTGGCACTTCCAAGACAGATCATAAAAA



TGTGTGGACCGCTTATACCAACGGAGAAAGGATGCGTTATATTAAAGAAAAAAAG



AGGAACGAACTATTTGATCCATCGAAAGAAATTAAAGAAGCTTTGACAAGCAGCG



GAATCAAATATGATGGAGGTCAAAACATACTTCCAGATATTCTCAGATCTAATAA



TAACGGTCTTATTTACACGATGTATTCATCTTTTATCGCTGCCATCCAAATGCGT



GTGTATGATGGCAAGGAAGATTATATTATATCTCCTATTAAAAATTCAAAGGGTG



AATTTTTTCGCACGGATCCAAAAAGAAGAGAGCTTCCAATTGACGCCGATGCTAA



CGGTGCTTACAATATTGCATTGCGTGGTGAACTTACTATGAGAGCCATCGCCGAA



AAGTTTGATCCGGACAGTGAAAAAATGGCGAAATTGGAGCTAAAGCACAAGGATT



GGTTTGAATTCATGCAGACCCGTGGCGATTGA





SEQ
ATGACTAAAACGTTCGACTCCGAGTTTTTTAATCTCTATTCCTTGCAAAAGACCG


ID
TTAGGTTTGAATTGAAACCAGTTGGTGAAACTGCCTCATTTGTCGAAGACTTTAA


NO:
AAACGAGGGATTGAAAAGAGTGGTTAGTGAAGATGAAAGAAGGGCAGTAGACTAT


130
CAAAAGGTTAAAGAAATCATTGACGATTACCACAGAGATTTTATAGAAGAATCTC



TGAACTATTTTCCAGAGCAGGTTTCAAAAGATGCTCTAGAGCAAGCGTTTCATTT



GTATCAAAAGTTGAAAGCAGCGAAGGTGGAAGAAAGGGAAAAAGCTTTAAAAGAA



TGGGAAGCATTACAGAAAAAATTGCGAGAAAAAGTCGTCAAATGTTTCAGCGACT



CTAATAAAGCTCGCTTTTCTAGAATCGATAAAAAAGAATTGATTAAGGAAGATTT



AATAAATTGGCTGGTAGCACAAAACAGAGAGGATGATATTCCTACTGTTGAAACG



TTCAATAATTTTACTACTTACTTCACTGGTTTCCATGAGAACAGGAAGAATATTT



ACTCTAAAGATGATCACGCTACTGCTATAAGTTTTAGGTTGATTCACGAAAACTT



GCCTAAATTTTTTGACAATGTCATCAGTTTTAACAAGTTGAAAGAAGGTTTCCCG



GAATTAAAATTCGACAAAGTTAAAGAAGATTTAGAAGTAGATTACGACTTGAAGC



ATGCGTTTGAAATTGAATATTTCGTTAATTTCGTCACACAAGCTGGTATCGACCA



ATATAATTACCTGCTTGGAGGCAAAACTCTAGAAGACGGTACGAAGAAACAAGGA



ATGAATGAACAGATTAATTTATTTAAGCAACAACAAACTCGCGATAAAGCTAGAC



AGATTCCAAAACTGATTCCACTTTTCAAACAGATTCTATCTGAGAGAACTGAATC



TCAGAGTTTTATCCCTAAGCAGTTCGAGTCTGATCAGGAACTATTCGATTCCCTG



CAGAAATTGCATAACAACTGTCAAGATAAGTTTACCGTTTTGCAACAGGCGATCT



TGGGATTGGCTGAGGCAGATCTTAAAAAGGTCTTTATTAAAACTAGTGATCTAAA



CGCATTGTCTAACACTATTTTTGGAAATTATTCTGTGTTCTCAGACGCGCTCAAT



TTATATAAAGAGTCGCTAAAAACTAAAAAGGCTCAAGAAGCTTTTGAAAAGTTGC



CTGCACATAGTATTCATGATTTAATCCAATACTTAGAACAATTTAATTCGTCTCT



CGATGCTGAAAAGCAACAGTCTACCGATACTGTATTAAACTACTTTATTAAAACC



GACGAATTATATAGTCGTTTCATTAAATCCACCTCTGAGGCATTCACCCAAGTAC



AACCTCTCTTTGAACTGGAAGCTTTGAGCTCCAAAAGAAGACCCCCAGAAAGTGA



AGATGAGGGGGCTAAAGGCCAAGAAGGTTTCGAACAAATTAAGAGAATCAAAGCT



TATCTAGACACTCTAATGGAGGCTGTCCACTTTGCTAAGCCTTTGTATCTTGTCA



AGGGTAGAAAGATGATAGAGGGTCTAGACAAGGATCAAAGCTTCTACGAAGCGTT



TGAAATGGCCTACCAGGAGTTGGAGTCTTTAATCATCCCCATTTACAATAAGGCC



AGATCTTACCTGTCTAGGAAGCCATTTAAAGCGGATAAATTCAAAATTAATTTTG



ACAATAATACACTTCTATCTGGGTGGGATGCTAACAAGGAGACGGCTAACGCCAG



CATATTGTTTAAGAAGGATGGTTTATACTACCTGGGAATCATGCCAAAAGGCAAA



ACTTTCTTGTTCGATTATTTCGTTAGTTCAGAAGATTCTGAAAAGTTGAAACAAC



GGAGACAGAAAACCGCAGAGGAAGCGCTCGCACAGGATGGAGAATCCTATTTTGA



AAAAATACGGTATAAACTCCTACCAGGTGCTAGTAAGATGTTGCCAAAGGTATTT



TTTAGCAATAAAAATATTGGGTTTTACAATCCCTCAGATGATATTCTACGAATTC



GGAATACGGCCTCTCATACTAAGAATGGTACTCCCCAGAAGGGTCATTCCAAGGT



AGAATTTAACTTGAATGACTGTCACAAAATGATTGATTTTTTTAAATCTTCCATA



CAGAAACATCCCGAGTGGGGATCCTTTGGTTTCACTTTTTCTGATACGTCGGACT



TTGAAGATATGAGTGCTTTCTACCGAGAAGTTGAAAATCAAGGTTACGTTATAAG



TTTTGATAAAATAAAAGAAACTTACATTCAGTCTCAAGTTGAGCAAGGTAACTTA



TATTTATTTCAAATTTACAACAAAGATTTTAGTCCGTATTCAAAGGGAAAGCCAA



ACCTGCACACTTTATACTGGAAAGCTCTGTTTGAAGAGGCTAATTTGAATAACGT



AGTGGCTAAGCTAAACGGCGAAGCAGAAATCTTTTTCAGAAGACACAGTATCAAA



GCATCTGATAAAGTGGTACATCCTGCTAATCAAGCTATAGATAATAAGAATCCCC



ATACTGAGAAGACGCAGTCCACATTTGAATATGACTTGGTCAAAGACAAAAGATA



TACCCAAGACAAATTTTTTTTTCATGTACCGATATCTTTAAACTTTAAGGCTCAG



GGCGTTTCAAAGTTTAATGATAAGGTAAATGGATTCTTAAAGGGCAATCCCGACG



TTAATATAATCGGTATAGATCGAGGTGAGAGACATCTTTTATACTTTACCGTGGT



GAATCAAAAAGGAGAAATATTAGTGCAAGAGTCCTTGAATACATTAATGTCTGAC



AAGGGTCATGTCAACGATTATCAACAGAAATTGGACAAGAAGGAACAGGAAAGGG



ACGCTGCCAGGAAGTCCTGGACGACAGTAGAAAATATTAAAGAATTAAAAGAAGG



TTATTTATCACATGTGGTTCATAAACTTGCACATTTAATCATCAAATATAACGCA



ATAGTGTGCTTGGAAGATCTTAATTTTGGCTTCAAGAGGGGTAGGTTCAAGGTCG



AAAAACAGGTCTACCAGAAGTTCGAGAAAGCTCTGATCGATAAATTGAATTATCT



TGTTTTCAAAGAAAAAGAATTAGGAGAAGTTGGTCATTATCTTACAGCATACCAA



CTCACTGCACCATTTGAAAGCTTCAAAAAGCTAGGCAAGCAATCTGGGATTTTGT



TCTATGTTCCGGCTGATTATACATCAAAGATAGATCCTACCACAGGCTTTGTAAA



TTTTTTAGATCTTAGGTACCAATCCGTTGAAAAAGCTAAACAGTTGCTGTCCGAT



TTTAATGCGATAAGATTTAATAGTGTTCAGAATTATTTTGAGTTCGAAATTGATT



ATAAAAAATTGACACCAAAACGTAAAGTAGGAACACAATCTAAATGGGTTATTTG



TACCTATGGAGATGTTAGATACCAAAACAGAAGAAATCAGAAAGGTCACTGGGAA



ACTGAAGAAGTTAACGTTACTGAAAAACTTAAAGCTCTATTTGCGAGCGATTCAA



AAACGACGACGGTGATCGATTATGCAAATGATGATAACCTTATTGATGTAATTCT



GGAACAAGATAAGGCATCATTTTTTAAAGAACTACTATGGTTGTTAAAGCTAACC



ATGACCCTAAGGCACTCCAAGATAAAGTCAGAGGATGATTTTATCCTCTCTCCAG



TGAAAAACGAACAAGGTGAGTTTTACGACTCAAGAAAGGCGGGTGAAGTCTGGCC



TAAGGATGCTGATGCCAATGGAGCTTATCACATCGCTCTGAAGGGGCTATGGAAC



TTACAGCAAATTAACCAATGGGAAAAAGGTAAAACTTTAAACCTCGCCATAAAGA



ACCAGGATTGGTTCAGCTTTATCCAAGAAAAACCATATCAAGAATAA





SEQ
ATGCACACAGGAGGTCTACTCTCGATGGATGCTAAGGAATTTACCGGTCAATATC


ID
CGCTGTCCAAAACTTTGCGTTTTGAGCTTAGACCTATTGGCCGAACGTGGGATAA


NO:
CCTAGAGGCTTCTGGTTATTTGGCGGAAGATAGACATAGAGCTGAGTGTTATCCC


131
CGAGCTAAAGAATTGCTGGATGATAACCACAGGGCGTTCCTGAATAGAGTTCTAC



CGCAAATCGATATGGATTGGCATCCAATTGCTGAAGCTTTCTGCAAGGTGCACAA



AAATCCAGGTAATAAAGAATTGGCTCAGGATTATAATTTGCAGCTTAGTAAGAGA



AGAAAAGAAATTTCCGCTTATTTGCAGGATGCTGATGGATACAAGGGGTTGTTCG



CGAAACCTGCCCTGGACGAAGCTATGAAAATAGCTAAGGAAAACGGCAATGAATC



TGATATTGAAGTTTTGGAAGCCTTCAATGGATTTTCCGTTTATTTCACTGGTTAT



CATGAGAGTAGGGAGAATATATACTCAGACGAAGATATGGTATCCGTCGCCTATC



GCATAACTGAAGATAATTTTCCAAGGTTCGTGTCGAACGCGTTAATTTTTGATAA



ACTAAATGAATCGCACCCGGATATTATTTCGGAAGTGTCCGGTAATCTGGGGGTA



GACGATATTGGTAAATATTTTGATGTGTCCAACTACAATAATTTCCTTAGTCAAG



CAGGAATTGATGACTACAACCATATTATAGGAGGGCATACAACTGAAGACGGTCT



CATTCAAGCTTTTAACGTAGTGTTAAACCTAAGGCACCAAAAAGACCCAGGTTTT



GAGAAAATTCAATTTAAGCAACTCTACAAGCAGATACTGAGCGTTAGGACTAGTA



AGTCATATATCCCAAAGCAATTCGATAACTCAAAGGAAATGGTCGACTGTATATG



CGACTACGTCTCAAAAATAGAAAAATCTGAAACAGTAGAAAGAGCTCTGAAATTG



GTAAGAAATATATCTTCTTTTGATTTAAGAGGTATTTTCGTAAATAAAAAAAACC



TTCGAATTTTGTCTAATAAGTTAATTGGAGACTGGGACGCAATAGAGACAGCTTT



GATGCACAGTTCCAGCAGTGAAAACGATAAGAAATCAGTGTATGACTCTGCAGAG



GCATTCACCCTTGATGATATCTTCAGTTCTGTGAAAAAGTTCAGCGACGCCTCCG



CTGAGGATATAGGAAACCGCGCTGAAGACATATGTCGTGTTATCTCAGAAACAGC



TCCTTTCATTAACGACTTAAGGGCTGTAGATTTGGATTCTTTAAATGATGACGGC



TATGAAGCGGCCGTGTCTAAAATACGGGAATCTCTTGAACCCTACATGGATCTAT



TTCACGAATTGGAGATCTTTAGCGTGGGTGATGAGTTTCCTAAATGTGCTGCCTT



TTATAGCGAGTTGGAAGAGGTCTCAGAACAACTGATTGAAATCATTCCTTTATTT



AACAAAGCAAGAAGTTTTTGCACAAGGAAAAGGTATTCAACCGACAAAATCAAAG



TCAATTTAAAATTCCCTACTCTGGCAGATGGATGGGATCTAAATAAAGAAAGGGA



TAACAAAGCCGCAATTCTAAGAAAAGACGGTAAATACTACCTGGCAATTTTAGAC



ATGAAGAAAGATCTCAGTAGTATTCGTACGAGCGATGAGGACGAGTCTTCTTTTG



AAAAGATGGAATATAAATTGCTCCCTTCTCCTGTGAAAATGCTTCCAAAAATTTT



TGTTAAATCGAAAGCCGCCAAAGAAAAGTACGGGTTGACCGATAGAATGTTAGAA



TGCTACGATAAAGGTATGCATAAGTCGGGTAGTGCTTTTGATTTGGGTTTTTGTC



ATGAATTGATCGATTACTATAAGCGCTGCATTGCCGAGTACCCAGGCTGGGATGT



TTTCGACTTTAAATTTCGTGAGACAAGCGATTACGGATCCATGAAAGAATTTAAT



GAAGACGTCGCTGGCGCAGGTTACTATATGTCACTTAGAAAGATTCCATGTTCCG



AAGTTTATCGTTTACTGGACGAGAAGTCAATTTACTTGTTTCAAATATATAATAA



GGATTATAGCGAAAACGCACATGGGAATAAGAATATGCATACGATGTATTGGGAG



GGCTTGTTCTCACCACAAAATTTGGAATCACCAGTCTTCAAATTGTCCGGAGGCG



CAGAACTTTTTTTCAGAAAGTCATCTATTCCTAATGACGCTAAAACGGTACATCC



GAAAGGTTCAGTTCTTGTTCCCAGAAACGACGTCAATGGTAGAAGAATACCAGAC



TCGATCTACAGAGAGTTGACAAGGTATTTTAACCGTGGGGATTGCAGGATCAGTG



ATGAAGCTAAGTCTTACCTGGACAAGGTCAAGACAAAAAAAGCGGACCATGACAT



TGTTAAGGATAGAAGATTTACTGTAGATAAGATGATGTTCCATGTTCCGATTGCC



ATGAATTTTAAAGCTATAAGTAAACCAAATCTTAATAAGAAAGTTATTGATGGCA



TAATAGATGATCAAGATTTGAAAATCATCGGTATCGATCGTGGTGAGAGAAATCT



TATTTATGTGACCATGGTCGATAGGAAGGGGAATATATTGTATCAAGACAGTCTT



AATATTTTAAATGGATACGATTACCGCAAAGCTTTAGACGTGAGGGAATATGATA



ACAAAGAAGCTAGAAGGAATTGGACTAAAGTAGAAGGTATTAGAAAAATGAAAGA



AGGTTATTTATCTTTAGCTGTTAGTAAATTGGCCGATATGATCATCGAAAATAAT



GCTATAATCGTAATGGAAGATTTGAATCACGGGTTTAAGGCAGGTCGTTCCAAAA



TTGAAAAGCAGGTGTATCAAAAATTCGAATCAATGTTAATCAACAAGTTAGGATA



CATGGTGCTAAAAGACAAGTCCATTGACCAGTCTGGTGGAGCCCTTCATGGTTAC



CAATTAGCCAATCATGTTACGACCTTAGCTAGCGTGGGTAAACAATGTGGAGTAA



TTTTTTACATACCTGCAGCTTTTACTTCGAAGATTGATCCCACCACGGGCTTTGC



TGATTTATTCGCTCTCTCTAATGTGAAGAATGTCGCTTCTATGAGAGAGTTCTTC



TCCAAAATGAAGTCAGTAATATATGACAAGGCGGAAGGCAAATTCGCCTTTACAT



TTGATTATTTGGATTATAACGTTAAAAGCGAATGTGGACGTACCTTATGGACTGT



GTATACAGTTGGTGAACGCTTCACCTACTCTAGAGTAAACCGAGAGTATGTTCGG



AAAGTCCCAACAGATATCATCTATGATGCATTACAAAAAGCTGGTATTAGCGTCG



AAGGTGACCTTAGAGATAGAATCGCGGAAAGCGACGGTGACACATTAAAGTCTAT



ATTCTACGCTTTTAAATACGCGTTGGATATGAGAGTCGAAAACAGAGAGGAAGAC



TATATACAGTCACCTGTGAAGAATGCTTCTGGTGAGTTCTTTTGTTCAAAAAACG



CCGGAAAGTCTTTGCCGCAGGATTCAGATGCAAATGGTGCCTATAATATAGCTCT



GAAAGGGATCCTACAACTCAGAATGTTGAGCGAACAATACGATCCAAATGCAGAA



TCGATTAGATTGCCACTTATAACTAACAAGGCATGGTTAACTTTTATGCAATCCG



GTATGAAAACTTGGAAGAATTAA





SEQ
ATGGATTCTCTTAAGGATTTCACTAATTTATATCCAGTCTCGAAAACATTGCGGT


ID
TCGAATTGAAACCAGTTGGGAAAACTCTAGAAAACATTGAAAAAGCCGGTATATT


NO:
GAAAGAAGATGAACACAGAGCGGAATCCTACCGCCGGGTAAAAAAGATAATTGAC


132
ACATACCATAAAGTGTTTATTGACAGCTCCTTAGAGAACATGGCTAAAATGGGGA



TAGAAAATGAAATCAAGGCTATGCTGCAGTCTTTTTGTGAACTCTATAAGAAAGA



CCACAGGACAGAAGGAGAAGATAAAGCTCTTGATAAAATTAGAGCTGTTCTTAGA



GGTTTAATCGTTGGGGCTTTCACTGGTGTATGTGGAAGACGAGAAAACACAGTAC



AAAATGAAAAGTACGAGAGTTTGTTCAAAGAAAAATTGATAAAGGAAATTTTGCC



AGATTTCGTGTTGTCCACCGAGGCTGAGTCTCTTCCATTCAGCGTTGAAGAAGCA



ACAAGGAGCTTAAAAGAGTTTGACTCATTCACTTCTTATTTTGCTGGTTTTTACG



AAAATAGAAAGAATATTTATTCCACGAAACCGCAAAGTACTGCGATAGCCTACAG



ATTAATTCATGAAAACTTGCCTAAATTTATAGATAATATTTTGGTCTTCCAGAAG



ATTAAAGAACCAATCGCTAAAGAACTTGAGCACATAAGAGCAGATTTTAGCGCAG



GCGGATATATCAAAAAAGATGAACGGCTAGAAGACATATTCTCATTAAATTACTA



CATTCATGTCCTTTCTCAAGCTGGTATAGAAAAATATAATGCTTTAATCGGGAAG



ATAGTGACGGAAGGTGATGGTGAAATGAAAGGTCTTAATGAACATATTAACTTAT



ATAACCAACAGAGGGGTCGAGAGGATAGGTTGCCCTTGTTTAGGCCTCTATACAA



GCAAATCCTGTCCGATAGAGAGCAATTGTCTTATTTACCTGAATCATTTGAAAAA



GATGAAGAGCTGCTTAGAGCACTTAAGGAATTTTACGATCACATCGCCGAAGACA



TCTTGGGTAGAACACAGCAATTGATGACTTCAATTTCTGAATACGACTTGTCCCG



TATTTATGTCAGAAATGATTCTCAACTTACAGACATCTCGAAGAAAATGCTAGGA



GATTGGAACGCCATTTATATGGCTAGAGAACGAGCCTACGACCACGAACAGGCTC



CTAAACGTATTACTGCTAAATACGAACGTGATAGAATCAAGGCCTTAAAAGGTGA



AGAGTCAATTTCATTGGCGAATCTGAACAGCTGTATAGCTTTCTTGGACAATGTA



AGGGATTGTCGAGTTGACACATACCTATCAACTTTGGGGCAGAAAGAGGGTCCTC



ATGGCTTAAGTAACTTGGTGGAAAACGTCTTCGCCTCATATCATGAAGCAGAACA



GTTATTGTCGTTTCCTTACCCCGAAGAGAACAACCTTATTCAGGACAAAGACAAT



GTAGTTTTGATCAAAAACCTATTGGATAATATAAGTGATTTACAACGTTTCCTTA



AACCTTTGTGGGGAATGGGCGATGAACCTGACAAAGACGAAAGGTTTTACGGTGA



ATACAACTATATTAGAGGAGCGCTTGACCAGGTAATACCTTTGTACAATAAAGTA



AGGAACTACTTGACTCGTAAACCATATTCTACTAGAAAAGTTAAATTGAACTTTG



GTAATTCACAGCTGCTGAGTGGTTGGGATCGTAATAAAGAAAAAGATAACTCCTG



TGTTATCTTGCGAAAAGGACAAAACTTTTACTTGGCAATTATGAACAACCGTCAC



AAAAGGTCCTTCGAGAACAAAGTTCTGCCTGAATACAAAGAAGGTGAACCATATT



TTGAAAAAATGGACTATAAATTCCTGCCAGATCCTAATAAAATGTTGCCTAAGGT



CTTCTTGTCTAAAAAAGGTATAGAAATATATAAACCATCCCCGAAGTTGCTGGAG



CAATATGGTCATGGAACGCACAAAAAAGGTGACACTTTTAGTATGGATGACTTGC



ACGAGTTGATTGATTTTTTTAAACATTCCATTGAAGCGCACGAAGATTGGAAACA



ATTTGGTTTCAAGTTCTCTGACACAGCCACTTACGAAAATGTATCGTCCTTTTAT



AGAGAAGTGGAAGATCAGGGTTATAAACTGTCATTCCGTAAGGTTAGTGAAAGCT



ATGTGTACTCGTTGATCGATCAAGGGAAGCTTTATCTTTTTCAAATCTATAATAA



AGATTTCTCTCCTTGTTCAAAGGGCACACCTAATCTTCATACACTATACTGGAGA



ATGCTTTTCGATGAAAGAAATTTGGCTGATGTGATCTATAAATTAGACGGTAAAG



CTGAGATTTTTTTCAGAGAGAAATCCCTGAAAAACGACCATCCAACTCATCCGGC



AGGTAAACCGATTAAAAAGAAATCCCGGCAAAAAAAGGGCGAAGAGAGTTTATTC



GAGTATGATTTAGTTAAGGACAGACATTATACAATGGACAAATTTCAATTTCATG



TGCCCATTACTATGAACTTTAAGTGTAGTGCAGGGTCTAAGGTTAATGATATGGT



AAACGCACATATTAGAGAAGCTAAAGATATGCACGTCATCGGTATTGATCGCGGA



GAAAGAAATTTACTTTACATTTGCGTTATCGATTCTAGGGGCACCATCTTGGATC



AAATCTCTTTGAACACTATAAATGATATTGACTATCATGATCTACTAGAGAGTCG



GGATAAAGACAGGCAACAAGAAAGAAGAAATTGGCAAACAATTGAAGGTATTAAA



GAATTAAAGCAAGGCTATCTAAGCCAGGCTGTACACAGAATTGCCGAATTAATGG



TAGCATATAAAGCTGTCGTAGCTCTAGAAGACTTGAACATGGGTTTCAAAAGAGG



GCGCCAGAAGGTCGAAAGTAGTGTTTATCAACAATTTGAAAAACAGTTAATAGAT



AAGTTGAATTATCTAGTGGATAAAAAAAAGCGTCCTGAGGACATTGGCGGTTTAT



TAAGAGCCTACCAATTCACTGCGCCATTTAAATCGTTCAAAGAAATGGGTAAACA



AAACGGTTTTCTATTCTACATCCCCGCATGGAATACCTCAAATATAGATCCAACT



ACCGGTTTCGTCAACTTATTTCATGCTCAATATGAGAATGTGGACAAAGCAAAAT



CATTCTTTCAAAAATTTGATAGCATTAGCTACAATCCTAAAAAAGATTGGTTTGA



ATTTGCGTTCGATTATAAAAATTTCACCAAGAAGGCTGAAGGTTCCAGATCTATG



TGGATATTGTGCACCCACGGAAGTAGAATTAAGAACTTCCGTAATTCACAGAAAA



ACGGCCAGTGGGACAGCGAAGAATTCGCCCTAACCGAAGCTTTCAAAAGTCTTTT



CGTAAGATACGAGATAGACTATACAGCTGATCTAAAGACAGCTATTGTGGATGAG



AAGCAAAAAGACTTCTTTGTCGACCTTCTTAAGTTGTTCAAGTTAACTGTGCAGA



TGAGAAATAGTTGGAAGGAAAAAGACCTAGATTACTTGATTAGCCCAGTCGCTGG



TGCAGATGGCAGATTTTTTGATACACGTGAAGGCAATAAATCACTACCAAAAGAC



GCGGACGCTAATGGCGCATACAACATCGCATTGAAGGGTTTGTGGGCTCTCAGGC



AGATTAGGCAGACAAGTGAGGGTGGTAAGCTTAAGCTGGCGATTTCTAATAAGGA



ATGGTTACAGTTTGTTCAAGAAAGATCCTACGAAAAAGATTAA





SEQ
ATGAACAATGGTACTAATAATTTTCAAAACTTCATAGGGATTTCTAGCCTTCAAA


ID
AGACATTGAGAAATGCTTTAATTCCAACAGAAACGACTCAACAATTCATAGTGAA


NO:
AAATGGTATTATAAAAGAAGACGAGTTGCGTGGCGAGAATAGACAAATTTTGAAA


133
GATATCATGGATGACTACTACAGAGGGTTCATCTCCGAAACATTGTCTTCTATTG



ACGACATTGACTGGACCAGCTTATTCGAAAAAATGGAAATACAGCTGAAGAACGG



AGATAACAAGGACACTCTTATAAAGGAGCAAACGGAATATAGAAAGGCTATACAC



AAAAAGTTTGCTAATGACGATAGATTTAAAAACATGTTTAGTGCGAAGTTAATTT



CTGATATTCTACCCGAGTTTGTCATTCATAATAATAACTACTCTGCATCTGAAAA



AGAGGAGAAGACCCAGGTTATAAAGTTGTTTTCAAGATTTGCCACATCATTTAAA



GACTACTTCAAGAACAGGGCGAATTGCTTCTCTGCTGATGATATTAGCTCTTCCA



GCTGTCATAGAATTGTTAACGATAATGCCGAAATTTTTTTTAGTAATGCCTTGGT



ATATAGACGCATAGTCAAGTCACTAAGCAATGATGATATAAACAAGATTAGTGGT



GATATGAAAGATAGCCTTAAAGAAATGAGCCTTGAAGAGATATATTCATATGAGA



AGTACGGTGAATTTATAACTCAAGAAGGAATTTCTTTTTATAACGATATTTGTGG



TAAGGTTAATTCTTTTATGAATTTGTATTGCCAGAAGAACAAGGAAAATAAGAAT



CTATATAAACTACAAAAGTTGCATAAACAGATTTTGTGTATAGCTGATACATCCT



ACGAAGTTCCGTATAAATTTGAATCTGATGAGGAAGTTTATCAATCGGTAAACGG



TTTTCTTGACAACATTTCCAGCAAACATATCGTTGAGAGACTACGTAAAATTGGA



GACAACTATAATGGTTACAATCTAGATAAAATATACATAGTGTCCAAGTTTTATG



AGTCTGTCTCTCAAAAGACATATCGTGATTGGGAGACCATTAATACTGCACTTGA



AATTCATTATAACAACATATTGCCTGGTAACGGGAAGAGTAAAGCTGATAAGGTT



AAAAAGGCCGTCAAAAACGACTTGCAAAAGTCTATTACCGAGATAAATGAATTAG



TGTCAAACTACAAACTATGCTCAGATGATAATATTAAAGCGGAAACATACATCCA



CGAAATTTCCCACATACTGAATAACTTTGAAGCTCAGGAGCTTAAATATAACCCG



GAAATACACTTGGTTGAGAGCGAGTTAAAAGCATCTGAGTTGAAAAATGTATTAG



ACGTCATCATGAATGCGTTTCATTGGTGTTCAGTTTTCATGACTGAAGAATTAGT



CGACAAAGATAACAATTTTTATGCCGAATTAGAGGAAATATATGATGAAATTTAT



CCCGTAATTAGTTTATACAATCTAGTTAGAAATTATGTTACACAAAAGCCGTATA



GTACCAAGAAAATAAAGCTTAATTTCGGAATACCTACGCTTGCTGATGGTTGGTC



AAAAAGTAAAGAATATAGCAATAATGCAATAATTTTAATGAGAGATAACCTATAT



TATTTGGGTATTTTTAACGCTAAGAACAAACCAGACAAGAAAATAATTGAAGGTA



ATACATCTGAAAACAAGGGCGACTATAAAAAGATGATATACAATTTGCTCCCAGG



TCCTAATAAAATGATTCCTAAGGTTTTCCTGAGTAGCAAGACTGGCGTTGAAACT



TACAAGCCTAGTGCGTATATCCTGGAGGGTTATAAACAGAACAAGCATATCAAAT



CCTCTAAGGACTTCGATATCACCTTTTGCCATGACTTAATCGATTATTTTAAAAA



TTGTATCGCAATTCATCCAGAATGGAAAAATTTCGGATTTGATTTTAGTGATACC



AGCACTTACGAGGATATCTCTGGGTTCTACAGAGAAGTGGAGTTGCAGGGCTACA



AAATCGATTGGACTTACATATCTGAAAAGGACATAGATTTGCTGCAGGAGAAAGG



TCAGCTATATTTGTTTCAAATCTACAACAAAGACTTTTCTAAAAAGTCTACCGGT



AATGACAATCTGCACACAATGTACTTGAAGAACTTATTCTCCGAGGAGAACTTAA



AGGACATTGTACTCAAGTTGAATGGAGAAGCCGAGATTTTTTTTAGAAAGAGCAG



TATAAAGAATCCTATAATCCACAAGAAGGGCTCAATTCTCGTGAATAGGACGTAT



GAGGCAGAAGAAAAGGACCAATTTGGGAATATACAAATTGTAAGAAAAAACATCC



CAGAAAATATCTACCAGGAATTATATAAGTATTTTAATGACAAATCTGATAAGGA



ACTGTCTGACGAAGCCGCTAAGCTCAAGAATGTTGTGGGCCACCATGAAGCTGCT



ACTAATATAGTGAAGGACTACAGATATACCTACGATAAATATTTCCTGCATATGC



CAATTACTATAAACTTCAAAGCAAATAAAACAGGTTTTATAAATGATAGAATCCT



GCAGTATATTGCTAAAGAAAAGGATTTACATGTAATTGGGATTGATAGAGGTGAA



CGCAATCTGATCTATGTCAGCGTAATAGATACTTGTGGTAATATTGTGGAACAAA



AGTCCTTTAATATTGTGAACGGATATGATTACCAAATCAAGTTGAAACAACAAGA



GGGAGCACGCCAAATTGCCCGTAAGGAATGGAAAGAGATAGGTAAGATCAAGGAA



ATTAAGGAAGGTTATCTTTCATTAGTTATTCACGAAATTTCGAAGATGGTAATCA



AATACAACGCAATAATTGCTATGGAGGACCTGTCATATGGATTTAAGAAAGGTAG



ATTCAAGGTTGAGAGACAGGTATACCAGAAATTTGAAACTATGTTGATCAACAAA



TTAAATTACTTAGTCTTTAAGGACATATCAATAACGGAAAACGGCGGGCTTTTAA



AAGGGTATCAACTTACATACATACCTGATAAGTTGAAAAATGTGGGTCATCAGTG



TGGGTGCATCTTTTATGTTCCAGCCGCTTACACATCAAAAATCGATCCTACTACT



GGGTTCGTAAACATATTTAAATTTAAAGATCTAACCGTTGATGCAAAAAGAGAGT



TTATCAAGAAATTTGATAGCATTAGGTACGATTCAGAAAAAAATCTATTCTGTTT



TACTTTTGACTACAACAACTTTATAACGCAGAATACAGTGATGTCAAAATCGTCC



TGGTCAGTGTATACTTATGGTGTTAGAATTAAGAGACGTTTCGTAAACGGTCGTT



TTTCTAACGAGTCCGATACAATCGACATCACTAAAGATATGGAAAAAACTTTGGA



AATGACAGATATAAACTGGAGAGATGGTCACGACCTTAGACAAGATATAATCGAT



TATGAAATCGTACAGCATATTTTTGAAATTTTTCGCTTAACAGTTCAGATGCGTA



ACTCTCTTAGTGAGCTAGAAGATAGAGATTATGATAGACTTATCTCGCCTGTTCT



TAACGAAAATAATATCTTCTATGACTCGGCAAAAGCCGGTGATGCACTTCCAAAA



GATGCTGATGCAAATGGCGCGTACTGCATCGCATTGAAGGGGCTCTACGAGATTA



AACAAATCACCGAAAACTGGAAAGAAGATGGTAAATTTTCTAGGGATAAGTTGAA



AATCAGTAATAAAGATTGGTTCGATTTTATACAAAATAAGCGATACTTATAG





SEQ
ATGACCAATAAGTTTACTAATCAATACTCATTGTCTAAAACGTTAAGATTCGAGT


ID
TAATTCCCCAGGGAAAGACACTAGAATTTATTCAAGAAAAAGGTCTTCTCTCTCA


NO:
GGATAAACAAAGAGCAGAATCATACCAGGAGATGAAAAAAACCATAGATAAATTT


134
CATAAGTACTTCATCGACTTGGCACTATCGAACGCCAAGCTAACACATTTGGAAA



CCTACCTGGAGTTGTATAATAAATCGGCAGAGACGAAAAAGGAACAAAAATTCAA



GGATGACCTGAAGAAGGTTCAAGATAATCTGCGAAAGGAAATAGTGAAGTCGTTT



AGTGATGGTGATGCAAAGTCAATCTTTGCTATTTTAGACAAGAAGGAATTAATAA



CCGTGGAACTTGAAAAGTGGTTTGAAAATAACGAACAGAAAGATATTTACTTCGA



CGAAAAATTTAAAACGTTTACTACGTACTTTACAGGGTTCCATCAGAACCGCAAA



AACATGTACTCCGTTGAACCAAACTCTACTGCAATCGCCTACAGATTAATACACG



AAAATTTGCCTAAGTTTTTAGAAAATGCAAAGGCTTTTGAAAAGATAAAGCAAGT



CGAATCGTTACAGGTAAACTTTCGCGAATTAATGGGCGAATTTGGAGATGAAGGT



CTTATTTTTGTCAATGAATTAGAGGAAATGTTTCAAATTAATTATTATAACGATG



TCTTGAGTCAGAACGGCATTACTATCTACAACTCAATTATCAGTGGTTTCACTAA



GAATGATATAAAATATAAAGGTTTGAATGAATACATTAATAATTATAATCAAACT



AAAGATAAGAAGGACAGGCTTCCGAAATTGAAGCAATTGTACAAGCAGATTCTAA



GTGATAGGATTAGTTTGTCTTTCTTGCCAGACGCATTTACTGATGGCAAGCAAGT



CTTAAAGGCTATATTCGATTTCTACAAGATTAACCTACTTTCGTACACAATTGAA



GGTCAAGAAGAATCTCAAAATCTGCTGCTTTTGATTAGGCAAACTATAGAAAATT



TGTCGTCCTTTGACACTCAAAAAATTTACCTGAAGAATGATACACACCTGACTAC



AATATCACAGCAGGTCTTTGGGGATTTTTCTGTCTTCTCCACGGCCCTAAACTAT



TGGTATGAGACAAAAGTTAATCCAAAATTTGAAACAGAATATAGTAAGGCGAATG



AAAAAAAGAGAGAAATTTTGGATAAAGCGAAGGCAGTATTCACAAAACAAGACTA



TTTTTCTATCGCATTTCTCCAAGAAGTCTTATCCGAATATATTTTGACACTCGAT



CACACCTCTGATATAGTTAAGAAACATTCGTCCAACTGCATCGCAGATTACTTCA



AGAATCACTTCGTGGCTAAGAAAGAAAACGAAACGGATAAAACTTTTGACTTCAT



TGCTAACATAACCGCTAAATACCAATGTATTCAGGGCATATTAGAAAATGCAGAC



CAGTACGAAGACGAGTTAAAACAGGACCAAAAGTTAATAGATAATCTAAAGTTTT



TCTTAGATGCTATACTTGAGTTATTACATTTTATAAAGCCATTGCATCTAAAATC



GGAAAGTATTACTGAAAAAGACACTGCGTTCTATGATGTGTTCGAAAATTATTAT



GAGGCTTTATCTTTATTGACCCCCCTTTACAACATGGTCCGCAATTATGTTACTC



AGAAGCCTTACTCTACTGAAAAGATCAAATTAAACTTTGAAAATGCTCAGTTGCT



GAATGGTTGGGATGCCAATAAGGAAGGTGACTACCTGACGACTATTCTAAAAAAA



GACGGTAATTATTTCTTAGCAATCATGGATAAAAAACATAACAAGGCATTTCAAA



AATTTCCAGAAGGAAAAGAAAACTATGAAAAGATGGTTTATAAATTGTTGCCTGG



AGTTAATAAAATGTTGCCAAAAGTTTTTTTTAGCAATAAGAACATAGCTTACTTT



AATCCATCTAAGGAACTGCTCGAGAACTACAAGAAGGAAACACATAAAAAAGGTG



ATACATTTAATTTGGAACATTGCCATACTCTGATTGATTTTTTTAAGGACTCTCT



TAATAAACATGAAGACTGGAAATATTTTGATTTTCAATTTTCGGAAACTAAATCA



TACCAAGATCTAAGTGGATTTTACAGAGAAGTTGAACACCAAGGTTATAAGATTA



ACTTCAAGAATATAGATTCTGAATACATTGATGGTCTTGTAAACGAGGGTAAACT



ATTCCTGTTCCAAATCTACTCTAAGGACTTCTCACCTTTTTCCAAAGGAAAACCT



AATATGCATACGTTGTACTGGAAGGCTCTATTTGAAGAACAAAATTTGCAAAATG



TAATCTACAAACTGAACGGCCAAGCTGAAATATTCTTCAGAAAAGCCTCAATTAA



GCCAAAAAACATTATTCTTCATAAAAAGAAGATCAAGATTGCGAAGAAACATTTT



ATTGATAAGAAGACCAAGACTTCCGAAATTGTACCAGTACAAACAATCAAGAATC



TCAATATGTATTATCAAGGCAAGATAAGTGAGAAAGAGTTAACCCAGGATGATTT



ACGTTATATAGACAATTTCTCTATATTCAACGAGAAGAACAAAACAATAGACATT



ATCAAAGATAAAAGGTTTACTGTTGACAAATTTCAATTTCATGTGCCTATCACAA



TGAACTTTAAGGCCACAGGTGGTTCGTACATTAATCAAACTGTTTTAGAATATCT



GCAAAATAACCCAGAGGTCAAGATCATCGGTCTTGATAGGGGTGAGAGACATCTG



GTGTATCTAACACTCATTGATCAACAAGGCAACATCTTGAAGCAAGAATCATTGA



ACACTATCACAGACTCCAAGATCTCGACTCCATATCACAAACTCCTTGACAATAA



AGAAAACGAAAGGGATCTTGCCAGAAAAAATTGGGGTACAGTTGAAAATATTAAG



GAACTAAAAGAAGGTTACATTTCGCAAGTAGTTCACAAGATTGCAACACTCATGT



TGGAAGAAAACGCAATCGTTGTCATGGAAGATTTAAATTTCGGATTTAAGAGAGG



AAGATTTAAAGTAGAAAAGCAAATCTACCAGAAGTTGGAGAAGATGTTAATTGAC



AAATTGAACTACTTAGTGCTGAAAGACAAACAGCCTCAAGAATTGGGCGGTCTAT



ACAACGCTTTACAACTGACAAATAAATTTGAGTCATTCCAAAAGATGGGTAAGCA



GAGTGGTTTTTTGTTTTATGTTCCGGCATGGAACACATCCAAAATCGATCCAACT



ACAGGCTTCGTGAATTATTTCTACACTAAATATGAAAATGTGGATAAAGCAAAAG



CTTTCTTTGAGAAGTTCGAGGCGATCCGTTTTAACGCTGAAAAGAAGTACTTCGA



GTTCGAGGTCAAAAAGTATTCAGATTTTAACCCCAAGGCTGAAGGCACCCAGCAA



GCATGGACTATTTGCACGTACGGTGAGCGAATCGAAACTAAAAGGCAAAAGGATC



AAAATAATAAGTTTGTAAGCACACCCATTAACTTGACAGAAAAGATAGAAGATTT



TCTTGGAAAAAACCAAATTGTATATGGTGACGGTAACTGTATCAAGTCACAAATT



GCTTCTAAAGACGATAAGGCCTTCTTCGAAACTCTGCTATACTGGTTTAAAATGA



CGTTGCAAATGAGAAACAGTGAAACTAGAACTGATATCGACTATTTAATATCACC



CGTGATGAACGATAATGGTACCTTTTACAATTCAAGAGATTACGAGAAATTGGAG



AACCCCACACTACCAAAAGACGCAGACGCTAATGGTGCCTACCATATTGCTAAAA



AGGGACTGATGTTGTTGAACAAGATAGATCAAGCCGACTTAACTAAAAAAGTTGA



TTTGTCAATTTCGAATAGAGATTGGTTGCAATTCGTCCAGAAAAATAAGTAA





SEQ
ATGGAACAGGAATACTACTTGGGTTTGGATATGGGAACTGGTTCAGTCGGTTGGG


ID
CTGTTACGGACTCCGAGTACCACGTGTTGAGAAAACACGGAAAGGCTTTATGGGG


NO:
TGTCAGACTATTCGAATCAGCATCGACCGCGGAAGAGAGAAGAATGTTTAGAACT


135
TCAAGAAGAAGGCTGGATCGTAGGAATTGGCGGATAGAAATTTTACAAGAAATAT



TCGCCGAAGAAATCTCTAAAAAAGATCCAGGATTTTTTCTACGTATGAAGGAATC



CAAATACTATCCGGAAGATAAACGTGATATTAATGGCAATTGTCCAGAGTTACCC



TATGCTTTATTTGTGGACGACGATTTCACCGATAAAGATTACCATAAGAAGTTCC



CAACAATTTACCATCTGAGAAAGATGTTAATGAACACTGAAGAAACCCCGGATAT



AAGACTGGTCTATCTAGCCATTCATCATATGATGAAACACAGGGGACACTTCTTG



CTATCAGGGGATATAAATGAAATTAAAGAATTTGGTACAACATTTTCTAAATTAT



TGGAAAATATTAAAAACGAAGAATTAGATTGGAATTTAGAATTAGGCAAGGAGGA



ATACGCAGTTGTCGAATCGATTCTGAAAGATAACATGTTGAACAGATCAACGAAA



AAAACAAGGCTGATCAAGGCTTTAAAAGCGAAATCAATATGCGAAAAAGCAGTAT



TGAATTTGTTAGCTGGGGGGACTGTCAAGTTGTCTGATATTTTCGGATTGGAAGA



ATTGAATGAAACAGAGAGACCGAAGATATCCTTCGCCGATAATGGCTACGATGAT



TATATAGGCGAAGTCGAAAATGAGCTGGGCGAACAATTCTACATTATCGAGACTG



CCAAGGCTGTTTATGATTGGGCGGTGTTAGTCGAAATCCTTGGCAAATACACTTC



CATCTCCGAAGCTAAGGTGGCAACCTACGAAAAGCATAAAAGTGATTTGCAATTC



CTTAAGAAAATTGTCCGAAAGTACTTGACCAAAGAAGAGTACAAGGATATTTTCG



TATCAACATCGGACAAACTGAAGAATTATTCAGCTTATATTGGCATGACGAAAAT



TAATGGTAAGAAAGTTGATTTGCAATCCAAGAGATGTTCTAAAGAAGAATTTTAC



GATTTCATTAAAAAAAATGTCCTAAAAAAGTTGGAGGGACAACCTGAATATGAGT



ATTTAAAGGAAGAACTGGAAAGAGAAACTTTCCTACCAAAGCAAGTTAATCGTGA



TAATGGCGTTATTCCATACCAAATACACTTGTACGAATTAAAGAAGATCTTGGGT



AACTTGAGGGACAAAATTGATTTAATCAAGGAAAATGAAGACAAACTGGTACAAT



TATTTGAATTTAGAATACCTTACTACGTGGGCCCTTTAAACAAAATAGACGATGG



TAAGGAAGGGAAGTTCACATGGGCAGTCAGAAAGTCCAATGAAAAAATTTACCCA



TGGAATTTCGAAAACGTTGTAGATATTGAAGCTTCTGCTGAGAAATTTATTAGGA



GAATGACAAATAAATGCACTTATCTTATGGGGGAAGACGTGTTGCCTAAAGATAG



TTTATTATATTCAAAGTATATGGTCTTAAATGAATTAAACAATGTTAAATTAGAT



GGTGAAAAACTTTCCGTCGAATTGAAACAAAGATTGTATACAGATGTATTCTGCA



AATATAGAAAAGTAACTGTAAAGAAGATTAAAAACTACCTTAAATGTGAAGGCAT



TATCAGCGGAAATGTTGAGATCACTGGTATCGATGGTGATTTTAAGGCATCTTTA



ACCGCATATCACGACTTTAAGGAAATATTGACGGGTACTGAGCTTGCTAAAAAAG



ACAAAGAGAACATTATCACCAATATCGTGCTCTTCGGAGACGACAAGAAATTATT



GAAAAAGAGATTGAACCGCCTATACCCTCAGATTACCCCTAACCAATTGAAGAAA



ATCTGCGCTCTGTCTTATACTGGATGGGGTCGTTTTAGCAAGAAGTTTCTAGAAG



AAATTACTGCTCCGGATCCTGAAACTGGGGAAGTCTGGAATATAATTACCGCGCT



ATGGGAATCGAATAATAATTTAATGCAATTACTATCTAATGAATACAGATTTATG



GAAGAAGTCGAAACTTACAATATGGGAAAACAAACAAAAACTTTGAGCTACGAAA



CAGTAGAGAATATGTATGTCTCACCATCTGTAAAGCGGCAGATCTGGCAAACCTT



GAAGATAGTTAAAGAATTAGAAAAAGTGATGAAGGAAAGTCCAAAAAGGGTTTTT



ATTGAAATGGCCCGAGAAAAACAAGAATCTAAAAGGACGGAAAGTAGGAAAAAGC



AACTTATAGATCTATATAAAGCCTGCAAAAATGAAGAAAAAGATTGGGTAAAGGA



ATTAGGTGACCAGGAAGAGCAAAAATTGAGATCTGACAAGCTGTACTTGTATTAT



ACGCAAAAGGGCCGGTGTATGTATTCGGGTGAGGTAATAGAATTGAAAGATTTAT



GGGATAACACTAAGTATGACATTGACCATATTTACCCCCAGTCTAAGACAATGGA



CGATTCATTAAATAACCGAGTTCTTGTCAAAAAGAAGTACAATGCCACAAAGAGC



GATAAGTACCCATTGAACGAAAATATAAGACATGAACGAAAAGGTTTCTGGAAAT



CATTGTTGGACGGTGGATTTATTTCCAAAGAAAAATACGAGAGATTGATTAGAAA



CACTGAACTATCTCCAGAGGAGTTAGCTGGCTTTATCGAAAGACAAATTGTTGAA



ACTAGACAGTCTACAAAAGCAGTTGCAGAAATCTTAAAACAAGTATTTCCAGAAT



CCGAAATTGTGTACGTCAAAGCCGGAACAGTAAGTAGATTTAGAAAAGACTTTGA



ATTATTGAAAGTACGAGAGGTTAACGACCTACATCATGCTAAGGATGCTTATTTA



AATATAGTCGTTGGTAATTCGTATTACGTGAAATTCACAAAAAACGCATCTTGGT



TCATCAAGGAGAATCCTGGTAGGACATACAACTTGAAAAAGATGTTTACATCAGG



ATGGAATATCGAAAGAAATGGTGAGGTTGCGTGGGAGGTAGGCAAGAAGGGAACC



ATTGTTACTGTAAAGCAAATTATGAATAAAAACAATATACTTGTTACGAGACAGG



TGCACGAAGCCAAAGGAGGGTTGTTTGACCAGCAAATCATGAAGAAAGGTAAAGG



TCAGATAGCAATAAAAGAGACTGATGAGCGTTTAGCTAGTATAGAAAAATATGGG



GGCTACAATAAGGCAGCTGGTGCTTACTTCATGTTGGTCGAATCAAAGGATAAAA



AAGGGAAGACGATCCGGACCATAGAGTTTATCCCTCTGTACTTGAAGAATAAGAT



TGAGTCTGACGAAAGCATCGCATTGAATTTCTTGGAAAAGGGGCGCGGTCTAAAG



GAGCCAAAAATATTGTTAAAGAAAATTAAAATAGACACCCTATTCGACGTCGATG



GGTTTAAGATGTGGCTTAGTGGTCGTACTGGGGACAGATTATTATTCAAGTGTGC



CAATCAGTTAATCCTTGACGAGAAAATCATTGTTACAATGAAAAAAATTGTTAAG



TTTATTCAAAGGCGACAAGAAAATAGAGAACTAAAGTTGAGTGATAAGGATGGAA



TCGATAATGAAGTGTTAATGGAGATTTATAACACTTTTGTCGACAAATTGGAGAA



TACGGTGTACAGAATTAGGCTATCTGAACAGGCTAAAACCCTAATTGATAAACAG



AAGGAGTTTGAGCGACTTTCTCTTGAAGACAAATCTTCAACTCTTTTCGAGATCC



TACATATCTTTCAGTGTCAATCTTCTGCAGCTAATTTGAAAATGATTGGAGGTCC



TGGTAAGGCTGGTATATTAGTCATGAACAACAACATATCTAAGTGTAATAAGATT



AGTATAATTAACCAATCACCGACAGGTATCTTTGAAAATGAAATTGATTTACTTA



AA





SEQ
ATGAAATCATTCGACTCGTTCACCAACTTGTACTCCCTGTCTAAAACATTGAAAT


ID
TTGAAATGCGACCTGTTGGTAACACCCAAAAGATGTTAGATAATGCAGGAGTTTT


NO:
CGAAAAGGATAAACTGATCCAGAAAAAATACGGTAAAACGAAACCATATTTCGAT


136
AGGTTGCATCGGGAATTTATAGAAGAAGCTTTGACTGGTGTAGAATTAATTGGCT



TAGATGAGAATTTCCGTACTCTAGTCGATTGGCAAAAAGATAAAAAGAACAATGT



TGCCATGAAGGCATACGAAAATAGTCTACAAAGACTAAGAACAGAGATCGGGAAA



ATTTTCAATTTGAAGGCAGAAGACTGGGTGAAGAACAAATATCCAATATTGGGTC



TTAAGAATAAGAATACTGATATATTGTTCGAGGAGGCCGTTTTCGGTATTCTTAA



GGCAAGATATGGTGAAGAGAAAGACACGTTTATTGAAGTTGAGGAGATTGATAAA



ACCGGTAAGTCCAAAATCAACCAGATCTCTATCTTCGACAGTTGGAAGGGCTTCA



CTGGTTATTTTAAGAAGTTCTTCGAAACTAGGAAGAACTTCTATAAAAACGATGG



TACTTCCACGGCTATTGCTACAAGAATTATCGACCAAAACCTTAAGCGTTTTATT



GATAACCTATCAATTGTTGAAAGTGTTCGACAGAAAGTAGATTTGGCTGAAACTG



AAAAATCTTTTAGTATCTCCTTATCCCAGTTTTTCTCTATAGATTTTTATAATAA



ATGTTTGCTGCAAGATGGCATTGACTACTATAATAAAATAATTGGTGGAGAGACA



TTGAAAAACGGAGAGAAGCTGATTGGCCTTAATGAGTTGATAAATCAATATAGAC



AAAATAATAAGGACCAGAAAATCCCTTTCTTTAAATTGCTAGACAAACAGATTTT



GTCTGAAAAGATCCTATTCTTGGATGAAATAAAGAACGATACTGAATTGATTGAA



GCTTTGTCCCAGTTTGCTAAAACAGCTGAAGAAAAGACAAAGATTGTGAAAAAAT



TGTTTGCTGATTTCGTAGAAAACAATTCTAAATATGATCTAGCCCAGATTTATAT



AAGTCAAGAAGCTTTCAATACAATAAGTAATAAGTGGACAAGTGAAACAGAAACT



TTTGCTAAGTATTTATTCGAAGCCATGAAGTCTGGTAAACTTGCCAAATACGAAA



AAAAAGATAACAGTTATAAATTTCCAGACTTTATAGCCCTTTCACAGATGAAGTC



TGCCTTATTGTCGATATCCTTAGAAGGTCATTTTTGGAAGGAAAAATATTATAAG



ATAAGCAAGTTCCAAGAAAAGACTAATTGGGAACAATTTTTGGCTATATTTCTAT



ATGAGTTCAATTCATTATTTTCCGATAAAATCAACACTAAGGATGGAGAGACTAA



GCAAGTTGGCTACTATTTGTTCGCAAAAGATCTGCACAATTTGATTCTATCAGAA



CAAATAGATATACCAAAAGATTCAAAGGTAACTATAAAGGATTTCGCAGATTCCG



TCCTCACCATTTATCAAATGGCTAAATATTTTGCCGTTGAAAAAAAGAGAGCGTG



GTTAGCAGAATACGAGTTGGACTCGTTTTATACTCAGCCAGATACTGGATACTTG



CAATTCTACGATAATGCATACGAAGACATTGTACAGGTATACAATAAACTTAGAA



ATTACTTAACCAAGAAGCCCTACAGTGAAGAAAAATGGAAGCTGAACTTTGAAAA



TTCGACTTTGGCAAATGGTTGGGATAAAAATAAAGAAAGTGACAACTCCGCAGTG



ATTTTGCAAAAGGGTGGGAAATATTACTTGGGTTTAATCACAAAAGGCCACAATA



AGATTTTTGATGATAGATTTCAAGAAAAATTCATAGTTGGTATAGAAGGTGGCAA



ATACGAGAAAATTGTCTATAAATTCTTCCCTGATCAAGCCAAAATGTTCCCAAAA



GTTTGCTTTTCTGCTAAAGGATTGGAGTTTTTCCGGCCTAGCGAGGAGATCCTTC



GTATCTACAACAATGCTGAATTCAAAAAAGGAGAAACCTATAGCATAGATTCTAT



GCAAAAACTGATAGATTTTTATAAGGATTGTTTAACAAAGTACGAAGGCTGGGCC



TGCTATACATTTAGACATTTAAAGCCCACAGAAGAATACCAAAATAACATTGGTG



AATTCTTTCGGGACGTTGCCGAAGACGGCTATAGGATCGATTTTCAAGGTATCTC



AGATCAATATATCCACGAAAAGAACGAGAAGGGTGAGCTGCACCTTTTCGAAATT



CATAATAAGGACTGGAATTTGGATAAGGCGAGAGATGGTAAATCGAAGACCACTC



AAAAGAACTTGCATACTTTATATTTTGAGTCCTTGTTTTCTAATGATAACGTCGT



CCAAAATTTTCCAATAAAGTTGAATGGACAAGCGGAAATTTTCTATCGGCCTAAG



ACAGAGAAAGACAAATTAGAATCAAAGAAAGATAAAAAGGGAAATAAAGTCATTG



ATCACAAACGATACTCTGAGAATAAAATATTTTTCCACGTACCATTGACACTCAA



CAGGACTAAGAATGACTCTTATAGATTTAATGCTCAGATTAATAATTTTTTGGCA



AATAACAAGGATATTAACATAATTGGGGTGGATAGAGGTGAAAAGCACTTGGTAT



ATTACTCTGTCATCACTCAGGCTTCTGATATATTGGAAAGCGGGTCTCTAAATGA



ATTGAACGGTGTTAACTACGCCGAAAAGCTAGGTAAAAAAGCTGAAAACAGAGAG



CAGGCTCGGCGCGATTGGCAAGATGTTCAAGGAATTAAAGACCTTAAAAAAGGCT



ACATTAGTCAAGTAGTTAGAAAGTTAGCCGATCTTGCTATTAAACATAACGCAAT



CATTATTCTGGAGGACCTAAATATGCGTTTTAAGCAAGTTAGGGGTGGCATAGAA



AAAAGTATTTATCAGCAGCTTGAGAAGGCTTTGATAGATAAGTTATCGTTCCTAG



TTGACAAAGGTGAAAAAAATCCTGAACAAGCTGGTCATCTGTTGAAAGCTTATCA



GCTGAGCGCACCTTTTGAAACATTTCAAAAAATGGGAAAACAAACAGGTATTATT



TTCTATACTCAAGCGAGTTATACAAGTAAATCTGACCCAGTGACAGGATGGAGAC



CACACCTTTATCTAAAATATTTTTCTGCTAAAAAGGCCAAAGATGACATCGCTAA



GTTTACAAAAATAGAATTTGTCAACGATAGATTTGAATTGACTTACGATATTAAA



GATTTTCAGCAAGCAAAAGAATACCCAAATAAGACAGTGTGGAAAGTATGCTCCA



ATGTGGAGAGATTTAGATGGGATAAAAATCTCAATCAAAACAAGGGTGGTTACAC



ACATTATACTAATATAACTGAAAATATTCAAGAATTGTTTACTAAGTACGGAATT



GACATAACCAAAGACTTACTAACTCAGATTTCAACTATTGACGAAAAACAAAATA



CCTCATTTTTCCGCGACTTTATTTTTTATTTCAACTTGATCTGTCAAATTCGTAA



CACGGATGATTCCGAAATTGCCAAGAAGAACGGAAAAGATGATTTCATCCTATCT



CCAGTGGAACCATTTTTTGACTCAAGAAAAGATAATGGTAATAAGTTGCCTGAGA



ACGGAGATGATAACGGCGCTTATAATATCGCTCGGAAGGGTATTGTAATTCTTAA



TAAAATATCTCAGTACTCTGAAAAGAACGAAAACTGCGAGAAAATGAAGTGGGGC



GACTTGTATGTATCTAATATAGATTGGGATAATTTCGTTACTCAAGCCAACGCGA



GACATTGA





SEQ
ATGGAAAATTTTAAAAACCTATATCCAATTAATAAGACACTTAGATTCGAGCTTA


ID
GGCCATACGGCAAAACACTAGAAAATTTTAAGAAGTCAGGCCTATTAGAAAAAGA


NO:
CGCCTTTAAGGCAAATTCCAGAAGATCAATGCAGGCAATTATTGATGAGAAATTT


137
AAAGAGACTATCGAGGAAAGGTTGAAATACACTGAATTCTCTGAGTGCGATCTGG



GAAACATGACTTCCAAGGATAAAAAGATTACCGATAAGGCTGCTACCAACCTCAA



AAAGCAAGTCATCTTATCGTTTGATGATGAAATTTTTAATAACTACTTAAAGCCG



GACAAAAACATTGACGCCCTATTCAAAAATGATCCGTCCAACCCCGTAATTTCAA



CTTTTAAGGGTTTTACCACGTACTTTGTAAATTTTTTTGAGATTCGTAAACATAT



CTTCAAAGGAGAATCGTCGGGTTCCATGGCCTATAGGATAATTGATGAAAATCTT



ACGACTTACTTAAACAATATCGAAAAGATAAAAAAGTTACCAGAAGAATTAAAGT



CTCAATTGGAAGGTATTGACCAAATAGACAAATTAAATAACTATAATGAGTTCAT



AACTCAAAGCGGTATCACACATTACAATGAAATTATCGGTGGTATATCTAAAAGT



GAGAACGTAAAAATACAGGGAATAAACGAGGGGATCAATCTATACTGTCAGAAGA



ATAAAGTAAAATTACCAAGACTAACGCCATTATACAAAATGATTCTGTCTGATAG



AGTTTCCAACTCGTTCGTGCTTGATACTATAGAAAATGATACTGAATTAATTGAG



ATGATTAGCGACTTGATTAATAAAACAGAAATATCTCAAGACGTAATAATGTCAG



ACATTCAGAACATTTTCATAAAATATAAACAGCTTGGTAATTTACCGGGGATAAG



TTACTCTAGCATCGTGAATGCTATTTGCTCCGATTATGACAATAATTTTGGTGAC



GGAAAAAGAAAAAAATCATATGAGAACGATAGGAAGAAACACCTTGAAACAAACG



TATACTCAATTAACTATATATCGGAACTGTTAACAGACACCGATGTATCATCTAA



TATAAAAATGAGATATAAGGAACTTGAACAAAATTACCAGGTGTGTAAGGAGAAT



TTCAATGCTACCAACTGGATGAACATTAAGAATATTAAACAGAGTGAAAAGACAA



ACTTGATTAAAGATCTACTAGATATACTGAAATCAATACAGAGATTCTACGATCT



GTTTGATATAGTTGATGAAGACAAAAATCCTAGTGCTGAGTTTTACACGTGGCTA



AGTAAAAATGCGGAAAAGTTAGATTTCGAGTTCAACTCTGTTTATAATAAATCTA



GGAATTATTTAACTAGAAAGCAGTATTCTGATAAAAAGATAAAATTGAACTTCGA



CTCCCCTACGTTGGCAAAGGGTTGGGATGCAAACAAAGAAATCGATAACTCCACC



ATAATAATGCGTAAGTTTAACAATGATAGGGGGGATTACGATTATTTTTTGGGAA



TTTGGAACAAATCTACCCCAGCGAATGAAAAAATTATTCCCCTTGAAGACAATGG



TCTTTTTGAAAAAATGCAGTATAAATTATATCCAGACCCATCCAAGATGCTTCCA



AAGCAATTTCTGTCAAAAATTTGGAAGGCTAAACACCCTACTACTCCTGAATTTG



ATAAGAAGTATAAGGAGGGCCGACACAAAAAGGGTCCAGATTTTGAAAAAGAATT



CCTGCATGAATTGATAGATTGTTTTAAGCATGGTTTGGTAAATCATGATGAAAAA



TATCAGGATGTCTTTGGATTCAATTTGAGAAATACAGAGGATTACAACTCATATA



CAGAATTTCTCGAGGACGTCGAACGTTGCAATTATAATCTCAGTTTCAACAAGAT



CGCAGACACTTCAAACTTAATTAACGACGGAAAATTGTACGTTTTTCAAATCTGG



TCGAAAGACTTTAGTATTGATTCAAAGGGTACAAAAAACCTAAATACAATATATT



TCGAAAGTCTATTCTCGGAAGAAAACATGATCGAAAAAATGTTCAAACTGTCAGG



CGAAGCTGAAATATTCTACCGTCCCGCAAGCCTTAATTATTGTGAGGATATCATT



AAAAAAGGACATCACCATGCAGAGTTAAAAGATAAATTCGATTACCCAATAATTA



AAGATAAAAGATACTCCCAGGATAAGTTCTTTTTCCATGTACCTATGGTTATTAA



CTACAAGTCGGAAAAACTAAACTCGAAGTCATTAAATAATAGAACTAACGAGAAC



TTGGGACAATTCACACATATAATTGGTATTGATCGTGGCGAAAGACATTTAATAT



ATCTGACTGTTGTTGATGTTTCAACAGGAGAAATTGTTGAACAGAAACATCTTGA



TGAAATTATAAACACAGATACAAAAGGCGTTGAGCATAAAACTCATTATCTAAAT



AAATTGGAGGAAAAGTCGAAGACTCGCGATAACGAGAGAAAGAGTTGGGAAGCAA



TTGAAACCATAAAAGAGCTTAAAGAAGGTTACATTAGTCACGTCATCAATGAAAT



ACAAAAGTTACAAGAAAAGTATAACGCTTTGATTGTAATGGAAAATCTAAATTAT



GGTTTTAAGAATTCAAGAATCAAAGTCGAAAAGCAGGTCTATCAGAAATTTGAAA



CGGCACTTATTAAAAAGTTTAACTACATTATTGATAAAAAGGACCCAGAAACTTA



TATTCATGGTTACCAACTGACGAACCCAATCACAACATTGGACAAAATTGGAAAC



CAAAGTGGAATTGTTTTATACATTCCAGCTTGGAATACATCCAAAATAGACCCTG



TCACGGGGTTTGTCAACTTGTTATATGCCGACGATTTAAAGTATAAAAACCAAGA



ACAAGCAAAGTCTTTTATTCAAAAGATTGATAATATTTATTTCGAAAACGGTGAA



TTTAAATTCGACATAGATTTTTCTAAATGGAACAACCGTTATTCAATAAGTAAAA



CTAAATGGACACTCACCTCATACGGCACTCGTATCCAAACCTTTCGGAATCCCCA



AAAAAATAACAAATGGGATTCTGCAGAATACGACTTGACCGAGGAATTTAAATTA



ATTCTTAATATAGACGGTACACTCAAAAGTCAAGACGTGGAGACATACAAGAAGT



TTATGTCGTTATTCAAGCTTATGCTTCAGTTGAGGAACTCCGTTACAGGCACTGA



TATTGATTACATGATTTCACCAGTAACGGATAAGACTGGGACTCATTTCGATTCT



AGGGAAAATATTAAAAATTTACCTGCTGACGCAGACGCAAACGGCGCATACAATA



TAGCAAGAAAAGGGATTATGGCCATTGAGAATATTATGAATGGCATATCAGATCC



ATTAAAGATAAGCAATGAAGACTACTTAAAATACATTCAGAATCAGCAAGAATAA





SEQ
ATGACCCAGTTTGAAGGTTTCACCAATTTGTACCAAGTAAGTAAAACCTTGAGGT


ID
TCGAATTGATCCCACAGGGCAAGACATTGAAGCATATTCAAGAGCAAGGATTTAT


NO:
AGAAGAAGATAAAGCGAGAAACGATCACTATAAAGAGTTAAAACCCATTATTGAC


138
AGGATCTATAAAACATACGCCGATCAATGCCTTCAATTAGTGCAATTAGATTGGG



AAAACTTGAGCGCTGCCATCGATTCCTACAGGAAGGAAAAAACAGAAGAAACAAG



AAATGCCTTAATCGAGGAACAAGCAACCTATAGAAACGCTATACACGATTACTTC



ATCGGTAGAACTGATAATCTAACAGATGCAATAAATAAGAGACATGCTGAGATAT



ATAAAGGACTATTTAAAGCAGAATTATTCAACGGAAAGGTGTTGAAACAGTTAGG



TACCGTTACAACTACTGAGCATGAAAATGCCTTGCTGAGAAGCTTTGACAAGTTT



ACTACCTACTTTTCGGGTTTCTACGAAAATCGCAAAAATGTATTTTCTGCGGAAG



ATATTTCAACTGCAATCCCTCATAGGATTGTTCAAGATAATTTCCCTAAGTTTAA



AGAGAACTGTCACATTTTTACAAGGTTAATTACTGCGGTTCCAAGTCTAAGAGAA



CATTTTGAGAATGTAAAAAAAGCGATTGGTATATTTGTATCCACTAGCATTGAAG



AGGTTTTCAGCTTCCCTTTTTATAACCAATTACTTACCCAAACACAGATCGACCT



GTACAACCAATTGTTAGGTGGTATATCGAGGGAGGCTGGTACGGAAAAGATTAAA



GGATTAAATGAAGTTCTTAATTTGGCCATACAAAAAAATGATGAAACCGCGCACA



TTATCGCATCTTTACCACATAGGTTTATACCGTTATTCAAGCAAATATTATCTGA



TCGTAATACCTTATCGTTCATATTAGAGGAGTTTAAATCTGACGAAGAAGTTATA



CAATCTTTTTGCAAGTATAAGACGCTATTGAGAAACGAAAACGTTCTGGAAACAG



CCGAAGCACTGTTCAATGAATTAAACAGTATCGACTTGACTCATATTTTTATATC



GCATAAAAAGTTGGAGACAATTTCTTCAGCATTGTGCGATCACTGGGACACTTTA



AGGAACGCACTATATGAACGTAGGATCTCAGAATTGACAGGTAAGATAACGAAGT



CTGCTAAAGAGAAAGTGCAGAGATCCCTAAAACACGAGGATATAAATTTGCAGGA



GATAATTTCAGCTGCAGGTAAAGAGTTGTCTGAAGCGTTCAAGCAAAAGACTTCC



GAAATCTTGTCACACGCACACGCCGCATTAGATCAACCTTTACCCACTACTTTGA



AAAAACAAGAAGAGAAGGAGATATTAAAATCACAACTTGATTCTTTACTTGGCCT



TTATCATCTTTTAGATTGGTTCGCTGTTGACGAGAGCAATGAAGTGGATCCAGAG



TTTTCCGCAAGATTGACCGGTATAAAGTTGGAAATGGAACCTTCGTTATCATTTT



ACAACAAAGCTAGGAACTATGCTACAAAAAAACCTTATTCTGTCGAAAAATTTAA



ACTGAACTTCCAAATGCCTACTCTAGCAAGTGGCTGGGATGTTAATAAAGAAAAG



AACAATGGCGCTATTTTGTTTGTAAAAAATGGCCTATACTATCTTGGAATTATGC



CTAAACAAAAAGGTCGCTACAAGGCTTTGTCATTTGAACCTACTGAAAAGACTAG



CGAAGGTTTCGATAAGATGTATTACGATTATTTCCCGGATGCCGCTAAAATGATC



CCCAAGTGCTCTACTCAATTGAAGGCAGTAACTGCTCATTTCCAAACGCATACCA



CGCCAATACTGCTTTCTAACAACTTTATAGAACCACTAGAAATAACGAAAGAAAT



TTACGACCTAAATAACCCAGAGAAAGAACCAAAAAAGTTCCAGACGGCCTACGCC



AAAAAGACAGGGGACCAAAAAGGTTACCGCGAGGCGTTATGTAAATGGATTGATT



TTACTAGGGACTTTTTATCAAAATACACTAAAACGACGTCTATTGATCTTAGCTC



CTTACGCCCGTCCTCCCAATACAAGGATCTAGGTGAGTATTACGCAGAGTTGAAC



CCGCTATTATACCATATTTCCTTCCAAAGGATTGCTGAAAAGGAAATTATGGACG



CTGTTGAAACTGGGAAATTGTACCTGTTTCAGATTTATAATAAGGACTTCGCAAA



GGGTCACCATGGTAAGCCTAACCTTCACACTTTGTACTGGACCGGACTATTCTCG



CCTGAAAATTTGGCTAAAACAAGTATCAAGTTAAACGGTCAGGCCGAGTTATTTT



ATAGACCCAAATCTAGAATGAAAAGAATGGCCCATAGATTAGGCGAAAAGATGTT



AAACAAGAAATTAAAGGACCAAAAAACCCCGATACCAGACACTCTATACCAAGAA



CTGTACGACTATGTGAATCACAGGCTTAGTCACGATTTATCAGATGAAGCGAGGG



CTTTATTGCCAAATGTCATCACCAAGGAAGTATCACATGAAATAATTAAGGATAG



AAGGTTCACATCTGATAAATTCTTTTTTCATGTCCCAATTACATTGAATTATCAA



GCAGCGAACTCACCATCTAAATTTAATCAGCGCGTCAACGCCTATTTGAAAGAAC



ATCCCGAAACACCAATCATCGGCATAGATCGAGGTGAGAGAAACTTAATATATAT



AACTGTGATTGATTCTACAGGAAAAATCCTGGAGCAACGATCTTTAAATACCATA



CAACAGTTTGATTATCAAAAAAAGTTGGATAACAGAGAAAAAGAACGTGTTGCCG



CTAGGCAGGCTTGGTCTGTGGTAGGAACAATTAAGGACTTAAAGCAGGGCTATCT



GTCCCAAGTTATTCATGAAATAGTCGATCTGATGATACATTATCAGGCAGTTGTC



GTGTTGGAAAATTTGAATTTTGGCTTTAAATCAAAAAGAACTGGCATAGCAGAAA



AAGCTGTGTACCAGCAGTTTGAAAAGATGTTAATCGATAAGCTAAACTGCCTTGT



TCTTAAAGATTACCCCGCAGAAAAAGTAGGTGGTGTTCTTAATCCATATCAGTTG



ACAGACCAATTTACATCCTTTGCGAAAATGGGTACGCAAAGCGGGTTCTTATTCT



ACGTACCGGCCCCCTATACTTCTAAGATCGACCCACTAACAGGTTTTGTGGACCC



TTTTGTTTGGAAGACGATAAAGAACCACGAGTCACGCAAACATTTCTTAGAGGGC



TTTGATTTCTTGCACTACGACGTGAAAACTGGTGATTTTATCTTACACTTTAAAA



TGAACAGAAATCTCTCTTTCCAACGTGGACTGCCCGGATTCATGCCGGCTTGGGA



CATCGTTTTTGAAAAGAATGAAACGCAGTTTGACGCCAAAGGTACACCATTTATA



GCGGGTAAGAGAATTGTGCCGGTCATAGAAAACCATAGATTTACAGGTAGATATA



GGGATCTGTACCCTGCTAATGAATTGATTGCATTACTCGAAGAGAAAGGAATTGT



GTTTCGAGATGGATCGAATATTTTACCTAAGTTGTTGGAAAATGATGATTCACAC



GCAATTGATACTATGGTTGCCCTCATAAGATCGGTATTGCAAATGAGAAACTCAA



ATGCTGCTACGGGAGAGGATTATATAAACAGCCCCGTTCGCGATCTTAATGGTGT



TTGTTTTGATTCACGTTTTCAGAACCCCGAATGGCCAATGGATGCCGACGCAAAC



GGAGCATATCATATTGCTCTTAAAGGCCAACTACTATTAAATCACTTAAAGGAAT



CCAAAGACCTAAAATTGCAAAACGGGATATCTAATCAGGATTGGCTGGCTTACAT



ACAAGAACTACGTAACTAG





SEQ
ATGGCCGTTAAGTCAATCAAAGTGAAACTTAGACTGGATGACATGCCAGAGATTC


ID
GTGCGGGGTTATGGAAACTTCATAAGGAAGTTAACGCAGGGGTAAGATATTATAC


NO:
CGAATGGTTATCATTACTTCGACAAGAGAATTTGTACAGAAGGTCCCCGAACGGC


139
GACGGTGAGCAAGAATGCGATAAGACGGCTGAAGAATGTAAGGCAGAACTTTTGG



AGCGCCTGAGAGCCCGTCAGGTTGAAAATGGCCATAGAGGTCCTGCGGGATCTGA



TGATGAGCTTTTACAGCTAGCTAGACAATTGTATGAATTGTTGGTCCCTCAGGCT



ATTGGGGCTAAAGGAGACGCTCAACAAATCGCCAGAAAGTTCTTGTCACCTCTGG



CTGACAAAGATGCCGTGGGAGGATTAGGTATCGCTAAAGCAGGTAATAAACCAAG



ATGGGTTAGAATGAGAGAAGCAGGCGAACCTGGTTGGGAAGAAGAGAAAGAAAAG



GCCGAAACTAGAAAAAGCGCTGACAGAACCGCAGATGTTTTACGGGCCTTGGCTG



ATTTTGGACTGAAGCCTTTGATGAGAGTGTATACTGATTCAGAAATGTCTTCCGT



TGAATGGAAGCCCCTAAGGAAGGGACAAGCGGTCAGAACCTGGGATAGGGATATG



TTTCAACAGGCTATTGAAAGGATGATGTCATGGGAATCCTGGAATCAAAGAGTAG



GTCAAGAATACGCTAAACTGGTCGAACAAAAGAATAGATTTGAACAAAAAAATTT



TGTAGGTCAAGAACATTTAGTACATTTGGTTAATCAACTTCAACAAGATATGAAA



GAGGCATCTCCTGGTTTGGAATCAAAAGAACAAACAGCACACTATGTTACCGGCC



GAGCTTTGCGAGGTTCTGACAAAGTATTTGAAAAGTGGGGGAAATTAGCTCCCGA



TGCCCCCTTTGATCTATATGATGCTGAAATTAAAAACGTTCAAAGAAGGAACACT



AGACGTTTTGGATCCCATGATCTTTTTGCAAAGCTAGCTGAGCCAGAATACCAGG



CTCTATGGCGTGAAGACGCCTCGTTTTTGACTAGATACGCAGTATACAATTCAAT



ACTCAGAAAACTAAACCATGCCAAGATGTTTGCTACATTCACCCTGCCCGATGCT



ACCGCTCATCCTATTTGGACTAGATTTGACAAGTTGGGGGGGAATCTACATCAGT



ACACATTTTTATTTAATGAATTCGGTGAAAGAAGACACGCTATTAGATTCCACAA



GCTCCTAAAGGTTGAAAACGGCGTTGCGAGAGAAGTTGATGATGTAACAGTTCCC



ATTTCTATGTCGGAGCAATTGGATAATCTATTGCCTAGAGACCCTAATGAACCAA



TTGCTTTGTACTTTCGTGACTACGGTGCAGAACAACACTTTACAGGTGAATTCGG



CGGAGCCAAGATTCAATGTAGACGTGATCAACTCGCACACATGCATAGAAGAAGA



GGCGCTCGTGATGTTTATTTAAATGTGTCTGTTAGAGTTCAATCCCAATCGGAGG



CTAGAGGTGAAAGAAGGCCACCATACGCAGCAGTTTTTAGGTTAGTAGGTGATAA



TCATAGGGCATTTGTCCACTTCGACAAATTAAGTGATTATTTAGCAGAGCACCCT



GATGATGGAAAGTTGGGCAGTGAGGGATTATTAAGTGGGTTGAGGGTAATGTCTG



TAGATCTTGGTCTTCGTACTTCTGCGAGTATCTCTGTCTTTAGAGTAGCACGTAA



GGATGAGTTGAAACCTAATAGCAAAGGAAGAGTCCCGTTTTTTTTTCCTATTAAG



GGTAACGATAACCTGGTGGCCGTGCATGAAAGATCACAACTTTTGAAATTGCCAG



GAGAAACGGAGTCCAAGGACTTGAGGGCAATTAGAGAGGAACGTCAGCGTACATT



GCGACAGCTGAGAACTCAATTGGCTTATTTGAGGTTGTTGGTTAGGTGTGGTTCC



GAGGATGTTGGCAGAAGAGAAAGGTCTTGGGCCAAATTGATAGAACAACCAGTGG



ACGCCGCAAATCACATGACACCAGATTGGAGAGAAGCTTTCGAAAATGAACTCCA



GAAATTAAAGAGCCTACATGGCATATGCTCTGATAAAGAGTGGATGGATGCCGTA



TACGAATCCGTTCGTAGAGTCTGGCGCCACATGGGTAAGCAAGTACGGGACTGGA



GAAAGGATGTTCGTTCCGGCGAAAGACCGAAGATAAGGGGGTATGCAAAGGACGT



TGTAGGCGGTAATTCTATTGAACAGATTGAGTATTTGGAAAGGCAGTACAAATTT



CTTAAATCCTGGAGCTTCTTCGGCAAAGTGTCAGGACAAGTCATCAGGGCTGAAA



AAGGTTCCAGATTTGCTATTACGCTAAGGGAACATATTGATCATGCGAAAGAAGA



TAGACTGAAAAAACTAGCAGATAGAATAATTATGGAAGCACTTGGTTACGTCTAT



GCACTTGATGAAAGAGGCAAGGGGAAATGGGTAGCTAAATACCCGCCTTGTCAAC



TTATTTTATTAGAAGAATTAAGCGAGTACCAATTTAACAACGATAGACCTCCATC



CGAAAATAATCAGCTGATGCAATGGTCCCATAGGGGTGTTTTTCAAGAATTGATA



AATCAAGCTCAAGTACACGATTTGCTGGTAGGTACTATGTACGCAGCGTTTTCGA



GCCGTTTTGATGCAAGAACTGGTGCCCCAGGTATCAGATGTCGACGTGTTCCGGC



CAGATGTACACAGGAACATAACCCTGAGCCATTTCCGTGGTGGCTTAATAAGTTT



GTTGTCGAGCACACATTAGACGCATGCCCTCTGAGAGCAGATGACCTTATACCCA



CTGGAGAAGGCGAAATATTTGTTAGTCCATTCTCTGCAGAAGAAGGTGACTTTCA



CCAGATACATGCAGACTTAAATGCAGCACAGAATCTCCAACAAAGGTTGTGGTCG



GATTTTGATATTTCGCAAATAAGACTAAGATGCGATTGGGGAGAGGTTGATGGAG



AATTGGTGCTGATTCCAAGATTAACCGGAAAGCGAACTGCCGATTCCTATTCTAA



CAAGGTGTTTTACACAAATACTGGTGTTACCTATTACGAAAGAGAAAGGGGTAAG



AAGAGACGTAAAGTATTTGCTCAAGAAAAATTGTCAGAAGAGGAGGCAGAACTGT



TAGTAGAAGCAGACGAAGCCAGAGAAAAATCAGTTGTGCTTATGCGTGACCCTTC



CGGCATTATAAATCGTGGTAATTGGACACGACAAAAAGAATTTTGGTCTATGGTC



AATCAACGTATCGAAGGCTACCTAGTTAAGCAAATCAGGTCTAGGGTTCCACTAC



AAGATAGCGCATGTGAAAATACGGGTGATATATAA





SEQ
ATGGCTACTAGATCTTTCATTTTAAAAATTGAACCTAATGAAGAAGTGAAGAAGG


ID
GTCTCTGGAAAACTCACGAAGTACTTAATCATGGCATTGCCTATTATATGAATAT


NO:
CCTGAAGCTTATTCGTCAAGAAGCTATATACGAGCATCATGAGCAAGATCCTAAG


140
AACCCTAAGAAAGTAAGCAAAGCGGAAATTCAGGCTGAATTGTGGGACTTCGTCT



TGAAGATGCAGAAGTGTAACAGTTTTACGCACGAAGTTGATAAAGATGTGGTGTT



TAATATTTTGAGGGAGCTATATGAGGAGTTGGTGCCCTCGAGTGTCGAAAAAAAA



GGAGAAGCTAATCAGCTGTCAAATAAATTTTTATATCCTCTGGTGGATCCAAACT



CTCAATCAGGTAAAGGCACTGCCAGTAGTGGTCGAAAACCGAGATGGTATAATTT



GAAAATCGCAGGTGATCCATCGTGGGAAGAAGAAAAAAAAAAATGGGAAGAAGAT



AAAAAAAAAGATCCCCTTGCCAAAATACTAGGTAAGCTAGCCGAGTATGGACTTA



TACCATTATTCATTCCTTTCACGGACTCTAATGAACCAATTGTGAAGGAAATCAA



ATGGATGGAAAAATCACGTAATCAGTCTGTTAGGAGGTTGGACAAAGATATGTTT



ATACAGGCTCTTGAGAGGTTTTTGTCGTGGGAGTCCTGGAATTTGAAAGTGAAAG



AAGAATATGAAAAAGTGGAAAAGGAGCATAAGACGTTGGAAGAAAGGATTAAGGA



AGATATTCAGGCCTTTAAGAGTCTGGAACAGTACGAAAAAGAAAGACAGGAACAG



TTATTGAGAGATACTCTAAACACTAATGAATATAGGCTTTCCAAGAGGGGCTTGC



GAGGATGGAGAGAGATAATTCAGAAATGGTTGAAAATGGATGAGAACGAGCCATC



GGAGAAATATCTAGAGGTGTTTAAAGATTACCAAAGAAAGCACCCTCGCGAAGCT



GGTGATTACTCTGTTTATGAATTCCTTTCGAAGAAGGAAAATCACTTCATCTGGC



GAAATCATCCAGAGTACCCATATTTATATGCTACATTTTGCGAAATTGACAAGAA



AAAAAAAGATGCTAAACAGCAAGCGACATTCACCCTCGCTGATCCCATCAACCAC



CCATTATGGGTCAGGTTCGAAGAGAGATCAGGCTCGAACCTGAATAAGTACAGGA



TCTTGACTGAGCAATTGCATACTGAGAAGTTAAAAAAGAAATTGACGGTCCAACT



TGACAGATTGATTTATCCCACTGAATCTGGTGGATGGGAGGAGAAAGGTAAGGTT



GATATTGTCCTATTGCCTTCTCGTCAATTTTACAACCAAATATTTCTGGACATCG



AAGAGAAGGGTAAACATGCTTTTACCTATAAGGATGAGAGTATTAAATTTCCATT



GAAGGGAACGCTTGGCGGCGCTAGAGTTCAGTTCGATAGAGATCATTTGAGAAGA



TACCCGCATAAAGTGGAATCTGGTAATGTAGGTCGGATCTACTTTAACATGACGG



TAAATATTGAACCTACCGAGTCACCAGTCAGTAAGTCTTTAAAGATTCATAGGGA



TGATTTCCCTAAATTTGTCAACTTCAAGCCTAAGGAACTAACCGAGTGGATCAAA



GACAGTAAAGGCAAAAAGTTAAAGAGCGGTATTGAGTCCCTGGAGATAGGTCTTA



GAGTCATGTCTATCGATTTGGGTCAAAGACAAGCAGCCGCAGCATCTATTTTCGA



AGTTGTTGACCAAAAACCGGATATCGAGGGGAAATTATTTTTTCCAATAAAAGGA



ACTGAGCTATACGCTGTGCATCGCGCATCCTTCAATATAAAACTGCCAGGAGAAA



CACTAGTAAAATCTAGAGAGGTCTTGCGTAAAGCACGTGAGGACAATCTCAAATT



AATGAATCAGAAGTTAAATTTCCTTAGGAACGTGTTGCATTTCCAACAGTTCGAG



GACATAACTGAACGCGAGAAAAGAGTCACTAAGTGGATCTCAAGACAAGAAAATA



GTGATGTGCCATTAGTGTATCAAGACGAACTTATTCAAATAAGAGAGCTAATGTA



TAAACCATATAAAGACTGGGTGGCATTCTTAAAACAATTACACAAGCGGCTTGAA



GTAGAAATAGGAAAAGAAGTAAAGCATTGGAGGAAGAGTCTGTCCGATGGTCGCA



AAGGCCTGTACGGGATATCACTTAAAAATATTGATGAAATTGACAGAACACGAAA



ATTTTTGTTAAGATGGTCATTGAGACCAACCGAACCAGGTGAGGTTAGAAGGTTG



GAACCAGGCCAAAGGTTTGCCATCGATCAATTAAACCATCTTAACGCACTGAAAG



AAGATAGATTGAAGAAGATGGCGAACACTATTATTATGCACGCTCTAGGTTATTG



CTATGATGTGAGAAAGAAAAAATGGCAAGCCAAGAACCCTGCATGCCAAATTATT



TTGTTTGAAGATCTTTCTAATTACAATCCATACGAAGAGCGTTCACGTTTTGAAA



ACTCTAAATTGATGAAATGGTCTAGAAGAGAGATTCCGAGACAGGTCGCTCTACA



AGGGGAGATTTACGGTCTTCAAGTCGGTGAGGTTGGTGCTCAATTTTCTTCCAGA



TTTCATGCAAAAACTGGGTCTCCAGGCATTAGGTGTTCGGTCGTTACTAAGGAAA



AGTTACAGGACAACCGTTTCTTCAAAAATTTGCAACGTGAAGGCCGTTTAACACT



TGATAAGATAGCTGTCCTTAAGGAAGGCGATCTGTACCCAGATAAAGGTGGTGAG



AAATTCATATCTTTGAGTAAAGACAGGAAACTGGTTACAACACACGCCGACATTA



ACGCAGCTCAGAACTTGCAAAAGAGATTCTGGACAAGGACCCACGGCTTCTATAA



GGTGTACTGTAAAGCTTATCAAGTAGATGGACAAACGGTTTATATTCCTGAATCA



AAGGACCAGAAACAAAAAATTATAGAAGAATTTGGTGAAGGATACTTTATCTTGA



AGGATGGAGTTTATGAGTGGGGCAATGCAGGTAAGTTAAAGATAAAGAAAGGTTC



ATCAAAGCAATCAAGTAGCGAACTGGTCGATTCGGATATTTTAAAGGATAGCTTT



GATCTAGCTAGTGAATTGAAGGGAGAAAAGTTAATGTTATACAGAGATCCCAGTG



GGAATGTATTTCCATCTGATAAGTGGATGGCCGCCGGAGTGTTTTTTGGCAAATT



AGAGAGAATCTTGATTTCTAAACTGACCAATCAATACTCAATTTCGACCATCGAA



GACGACTCTTCAAAACAATCCATGTGA





SEQ
ATGCCTACTCGCACCATCAATCTGAAGTTAGTTTTGGGGAAGAACCCAGAAAATG


ID
CGACTCTAAGACGGGCACTATTCTCTACACATAGACTTGTCAACCAAGCGACTAA


NO:
GAGAATTGAAGAATTTTTACTGTTGTGTAGAGGAGAAGCTTATCGTACCGTAGAT


141
AATGAAGGTAAAGAAGCTGAGATCCCACGCCATGCTGTTCAAGAAGAGGCGCTTG



CTTTTGCAAAAGCTGCACAACGACATAACGGCTGTATCTCCACATATGAGGACCA



GGAAATCTTGGATGTGCTTAGACAATTGTATGAAAGATTAGTACCTAGCGTCAAT



GAAAACAACGAGGCTGGGGATGCCCAAGCCGCTAACGCTTGGGTGAGTCCATTAA



TGAGTGCAGAGTCCGAAGGTGGACTATCGGTCTATGATAAAGTGTTAGACCCGCC



GCCAGTATGGATGAAACTCAAAGAAGAGAAAGCGCCTGGTTGGGAAGCTGCTTCT



CAGATTTGGATACAGTCCGACGAAGGTCAATCGCTGCTAAATAAACCGGGTAGCC



CACCACGTTGGATTAGAAAACTTAGATCTGGTCAACCGTGGCAAGATGACTTCGT



TTCAGACCAAAAAAAAAAGCAAGATGAACTAACGAAAGGTAACGCACCACTCATA



AAACAATTGAAAGAGATGGGCCTCTTGCCTTTAGTTAATCCCTTTTTTAGACATT



TGTTGGATCCCGAGGGTAAGGGTGTATCCCCATGGGACAGATTGGCCGTAAGGGC



CGCGGTGGCGCACTTCATCTCTTGGGAAAGTTGGAACCACAGAACAAGAGCTGAG



TATAACAGTTTGAAACTGCGAAGAGATGAATTTGAGGCCGCATCTGATGAATTCA



AGGACGATTTTACATTGCTACGACAATATGAGGCTAAGCGACATAGTACGCTTAA



GTCAATTGCCTTAGCTGATGACTCTAACCCGTACCGAATTGGTGTAAGGTCCTTG



AGAGCCTGGAATAGGGTTAGAGAAGAATGGATTGACAAAGGCGCAACCGAGGAAC



AAAGGGTTACCATCCTTAGTAAGCTTCAAACACAATTACGGGGTAAATTCGGTGA



TCCAGACCTATTTAATTGGCTAGCCCAAGATAGACACGTACACCTGTGGTCCCCG



AGAGATTCCGTCACGCCCCTCGTAAGGATTAATGCCGTCGACAAAGTGCTTAGAA



GACGTAAGCCTTATGCACTGATGACTTTTGCACATCCGAGATTCCATCCAAGATG



GATTCTATACGAAGCGCCTGGTGGTTCTAACTTGCGACAATACGCTTTAGATTGT



ACTGAAAATGCTCTGCATATTACACTTCCATTACTCGTCGACGACGCCCATGGTA



CATGGATTGAGAAAAAAATCCGCGTACCACTCGCTCCTAGTGGACAAATACAAGA



TTTAACTTTAGAAAAACTTGAAAAGAAAAAAAACAGATTATACTATAGATCAGGA



TTCCAACAATTTGCTGGATTAGCCGGTGGTGCTGAGGTGTTGTTTCATAGGCCGT



ATATGGAACATGATGAGAGATCAGAAGAATCTCTGTTGGAAAGGCCAGGCGCTGT



GTGGTTCAAATTAACCTTAGATGTTGCTACCCAAGCACCACCTAACTGGTTAGAT



GGTAAAGGCAGAGTTAGGACACCTCCAGAAGTTCATCATTTCAAAACCGCTCTGT



CAAATAAATCTAAACATACGAGAACCTTGCAACCAGGATTGAGAGTCCTTTCTGT



TGATTTGGGTATGAGAACATTTGCTTCTTGTTCTGTTTTCGAATTGATCGAAGGT



AAACCTGAAACAGGTAGAGCATTCCCTGTTGCTGACGAAAGATCAATGGATAGTC



CAAATAAGTTATGGGCCAAGCACGAGAGAAGCTTTAAACTAACTCTGCCTGGAGA



AACACCGAGCAGAAAGGAGGAAGAAGAGAGAAGCATTGCTAGGGCAGAGATTTAC



GCGCTGAAAAGAGATATTCAAAGACTGAAATCACTCCTAAGATTAGGTGAGGAAG



ATAATGATAATAGAAGAGATGCTTTGTTAGAGCAATTCTTTAAAGGATGGGGTGA



AGAGGACGTAGTTCCTGGTCAAGCTTTCCCTAGAAGCCTCTTTCAGGGATTAGGC



GCTGCACCCTTTAGGTCAACACCCGAATTGTGGAGACAGCACTGTCAGACGTATT



ACGACAAAGCGGAAGCTTGCCTGGCAAAGCATATTTCCGACTGGAGGAAGAGAAC



TAGACCTCGTCCGACTTCGAGAGAGATGTGGTATAAGACAAGATCTTACCATGGT



GGCAAAAGTATTTGGATGCTAGAATACTTAGATGCTGTCCGCAAATTACTACTTT



CATGGTCGTTAAGAGGTCGTACTTACGGAGCTATTAATAGACAAGACACCGCTCG



TTTTGGTTCCTTAGCTTCTAGATTGTTGCATCATATCAACTCTTTAAAGGAAGAC



CGCATCAAAACCGGTGCAGATAGTATTGTGCAGGCCGCAAGGGGCTATATTCCTC



TCCCACATGGCAAGGGTTGGGAACAGCGTTATGAACCCTGTCAGTTGATATTATT



TGAAGATCTAGCTAGGTACAGATTTCGTGTAGACAGACCTCGGAGAGAGAATTCG



CAATTGATGCAGTGGAATCATCGAGCTATAGTAGCAGAAACGACGATGCAAGCTG



AACTATACGGTCAAATAGTCGAAAATACCGCTGCTGGTTTCTCCTCAAGATTTCA



TGCTGCAACTGGTGCTCCTGGTGTCAGATGTCGCTTTTTGTTAGAACGAGATTTC



GATAATGACCTACCAAAGCCGTACTTACTGAGAGAACTAAGTTGGATGTTAGGTA



ACACAAAGGTTGAATCAGAGGAAGAAAAATTGCGTCTTCTAAGCGAGAAAATTAG



ACCAGGTTCATTAGTCCCTTGGGATGGGGGTGAACAATTCGCGACATTACACCCG



AAAAGACAAACTCTTTGTGTCATTCACGCAGATATGAACGCTGCTCAAAACCTGC



AACGCAGATTTTTCGGAAGGTGTGGGGAAGCCTTTCGCCTTGTGTGTCAGCCACA



TGGTGATGATGTTTTGAGGCTAGCGTCTACACCAGGTGCAAGACTTTTGGGTGCA



TTACAACAACTGGAAAATGGTCAGGGAGCTTTCGAATTAGTTCGTGATATGGGTA



GCACATCACAAATGAATCGTTTCGTCATGAAGTCGTTGGGCAAAAAAAAGATCAA



GCCATTACAAGACAATAACGGGGATGATGAACTAGAAGACGTGCTATCTGTTTTA



CCTGAAGAAGATGATACCGGACGAATTACTGTATTTCGGGACTCTTCGGGTATAT



TCTTCCCTTGTAACGTTTGGATCCCGGCAAAACAGTTCTGGCCTGCGGTCCGTGC



TATGATTTGGAAGGTTATGGCATCACATTCATTGGGTTAG





SEQ
ATGACAAAGTTAAGGCATAGACAGAAGAAGTTAACTCACGATTGGGCGGGGTCTA


ID
AAAAGAGAGAAGTTCTAGGGAGCAATGGTAAATTACAGAATCCATTGCTAATGCC


NO:
CGTCAAAAAAGGTCAGGTGACAGAATTTCGAAAAGCATTTTCCGCATACGCCCGA


142
GCAACCAAAGGGGAAATGACGGATGGCAGAAAAAATATGTTTACTCACTCATTTG



AACCATTCAAGACCAAGCCTTCGTTACATCAGTGCGAACTGGCTGACAAAGCCTA



CCAGAGCTTGCATTCATATTTACCGGGTTCTTTGGCGCATTTTCTTTTATCTGCC



CATGCACTTGGTTTTAGGATTTTTAGCAAATCAGGGGAAGCCACTGCATTCCAAG



CGTCCTCAAAGATTGAAGCTTACGAAAGCAAGTTAGCTAGCGAGCTTGCTTGTGT



TGATTTGTCTATTCAGAACTTGACTATTTCAACTTTGTTCAACGCATTAACGACT



TCCGTAAGAGGTAAAGGTGAGGAGACATCGGCAGATCCACTGATAGCTAGATTTT



ACACCTTACTTACCGGTAAACCACTAAGCAGAGACACTCAGGGCCCAGAACGAGA



TTTAGCCGAGGTGATAAGCAGAAAAATTGCAAGTTCTTTTGGAACTTGGAAGGAG



ATGACTGCCAATCCACTTCAATCTCTTCAATTTTTTGAAGAGGAGTTGCATGCGC



TAGATGCAAATGTTAGTTTGTCACCTGCCTTCGATGTTCTGATTAAGATGAACGA



CCTGCAGGGTGACTTGAAGAACAGAACGATAGTTTTTGATCCAGATGCTCCTGTG



TTTGAATATAATGCTGAGGATCCTGCTGACATCATCATTAAACTGACAGCTAGAT



ATGCGAAAGAAGCAGTGATTAAAAATCAAAATGTCGGGAATTATGTTAAGAACGC



TATTACGACAACTAACGCAAACGGACTAGGTTGGTTGCTGAACAAAGGCCTTTCC



TTATTGCCTGTCTCCACTGATGACGAACTATTGGAGTTTATTGGGGTCGAGAGAT



CCCATCCTAGCTGTCATGCGTTGATAGAACTTATCGCTCAGTTAGAAGCACCTGA



ACTGTTCGAAAAAAATGTTTTTTCTGATACTCGTTCCGAGGTTCAAGGTATGATA



GATTCAGCTGTAAGCAATCATATCGCCAGGCTGTCAAGCTCTCGTAATTCATTGA



GCATGGACTCAGAGGAACTTGAGAGATTGATAAAATCTTTTCAAATTCATACACC



ACATTGTTCATTATTTATAGGGGCTCAATCCTTATCTCAACAATTGGAAAGCCTA



CCCGAAGCATTGCAGTCAGGAGTGAACAGTGCTGATATTCTGCTCGGCTCAACCC



AATACATGTTGACAAATTCTTTGGTCGAGGAGTCAATCGCTACGTATCAGAGAAC



CTTAAATAGAATTAACTACCTGTCCGGCGTTGCAGGACAGATTAACGGTGCTATT



AAGAGGAAAGCTATTGATGGTGAGAAGATACATTTACCCGCTGCTTGGTCAGAGT



TAATTTCTTTACCCTTTATTGGGCAACCAGTGATTGATGTTGAATCAGATTTAGC



CCACTTAAAGAACCAATACCAGACATTGTCTAACGAATTTGATACGCTGATTTCC



GCACTGCAAAAGAATTTCGACTTAAATTTTAATAAAGCCTTGCTTAATCGAACAC



AACATTTCGAGGCTATGTGTAGATCAACAAAAAAGAATGCCCTTTCTAAGCCTGA



GATCGTTAGTTATAGAGATTTGCTAGCCAGGTTGACTTCTTGTCTTTATAGGGGC



TCTCTAGTCTTGAGGAGGGCGGGTATAGAAGTACTGAAAAAGCACAAGATATTTG



AGTCCAACTCTGAATTAAGAGAGCACGTTCATGAAAGAAAACACTTCGTATTTGT



TTCTCCGCTCGATAGAAAAGCCAAGAAGCTCCTACGTTTGACTGACTCTAGGCCT



GATTTATTGCACGTAATTGATGAAATACTACAACATGATAATTTAGAGAACAAGG



ATAGAGAATCTTTGTGGTTAGTTCGATCTGGTTATTTACTGGCCGGCCTACCAGA



CCAACTCTCCTCTTCCTTTATAAATCTTCCAATCATTACTCAAAAAGGCGATCGT



CGCTTGATAGATCTCATTCAATACGACCAAATTAATAGAGATGCTTTTGTGATGT



TGGTAACTTCCGCTTTTAAGTCGAACTTAAGTGGGCTGCAGTACAGAGCAAACAA



ACAATCTTTTGTGGTTACGCGCACTTTGTCACCATATTTGGGATCTAAATTGGTT



TATGTGCCCAAAGATAAAGATTGGCTGGTCCCTTCCCAAATGTTCGAGGGGAGAT



TTGCGGACATTTTGCAATCCGATTATATGGTGTGGAAGGACGCTGGAAGATTGTG



TGTTATTGACACAGCTAAGCATTTGTCTAACATTAAAAAATCTGTATTCTCAAGT



GAAGAAGTCCTCGCGTTTTTAAGAGAATTGCCACACCGTACGTTTATCCAAACTG



AGGTCAGGGGTTTAGGGGTGAATGTGGACGGTATTGCATTTAATAACGGGGATAT



ACCCTCTCTGAAGACGTTTAGCAATTGCGTGCAAGTCAAAGTGAGTCGGACAAAC



ACTAGTCTGGTCCAAACATTAAATAGATGGTTTGAAGGCGGTAAGGTCTCGCCGC



CTAGCATCCAATTTGAGAGAGCATATTACAAAAAAGATGATCAAATCCACGAGGA



CGCTGCAAAAAGGAAGATAAGGTTTCAAATGCCAGCTACAGAGTTGGTACACGCG



TCAGACGACGCAGGATGGACCCCCTCCTATTTACTTGGTATCGATCCCGGTGAAT



ATGGTATGGGTTTGTCATTGGTCTCAATAAATAATGGCGAAGTTTTAGATAGCGG



ATTTATACACATAAATTCATTGATAAATTTCGCTTCTAAGAAATCAAATCATCAA



ACCAAAGTTGTTCCGAGGCAGCAATACAAGTCACCATACGCCAACTATCTAGAAC



AATCTAAAGATTCTGCAGCAGGAGACATAGCTCATATTTTGGATAGACTTATCTA



CAAGTTGAACGCCCTACCCGTTTTCGAAGCTCTATCTGGCAATAGTCAAAGCGCA



GCGGATCAGGTTTGGACAAAAGTCCTCAGCTTCTACACCTGGGGAGATAATGATG



CACAAAATTCAATTCGTAAGCAACATTGGTTCGGTGCTTCACACTGGGACATTAA



AGGCATGTTGAGGCAACCGCCAACAGAAAAAAAGCCCAAACCATACATTGCCTTT



CCCGGTTCACAAGTTTCTTCTTATGGTAATTCTCAAAGGTGTTCATGTTGTGGAC



GTAACCCAATTGAACAATTGCGCGAAATGGCGAAGGACACATCCATTAAGGAGTT



GAAGATTAGAAATTCAGAAATTCAATTGTTCGACGGTACTATAAAGTTATTTAAT



CCAGACCCGTCAACGGTCATAGAAAGAAGAAGACATAATTTAGGGCCATCAAGAA



TTCCTGTAGCTGATAGAACTTTCAAAAATATAAGTCCAAGCTCACTAGAATTCAA



AGAACTAATAACGATTGTGTCACGGTCTATACGTCATTCCCCAGAATTTATTGCT



AAAAAAAGAGGTATAGGTAGTGAGTACTTTTGTGCTTATAGTGATTGTAATTCCT



CCTTAAATTCAGAAGCAAATGCGGCTGCGAACGTTGCCCAAAAGTTCCAAAAGCA



ATTGTTTTTCGAATTATAG





SEQ
ATGAAAAGAATCTTGAACTCTTTAAAGGTTGCCGCCCTGCGTTTGTTATTTAGAG


ID
GTAAAGGATCTGAACTTGTCAAGACTGTTAAATACCCTTTGGTCTCGCCGGTTCA


NO:
GGGTGCAGTTGAGGAGTTAGCTGAGGCGATCCGCCATGATAACCTACATCTGTTT


143
GGTCAAAAAGAAATTGTTGACCTTATGGAAAAGGATGAAGGTACGCAAGTTTACT



CAGTGGTTGATTTCTGGTTAGATACCCTTCGTTTGGGGATGTTTTTCAGTCCATC



AGCAAACGCATTAAAAATCACGCTGGGTAAGTTTAATTCTGATCAGGTTAGCCCT



TTTAGGAAAGTGTTAGAGCAGTCTCCATTCTTCTTGGCTGGTAGGCTGAAGGTTG



AACCGGCAGAACGTATATTATCTGTCGAGATCCGTAAGATTGGGAAGAGGGAAAA



CAGAGTTGAGAACTATGCTGCTGACGTAGAAACGTGTTTTATAGGCCAATTAAGT



TCAGATGAGAAACAGTCAATACAAAAATTAGCTAATGATATCTGGGATAGTAAAG



ATCATGAAGAGCAAAGAATGTTAAAGGCAGATTTCTTCGCTATCCCTTTGATTAA



GGATCCAAAGGCTGTGACCGAAGAGGATCCTGAAAATGAAACTGCTGGTAAACAA



AAACCCTTGGAGTTGTGTGTCTGCCTTGTCCCAGAACTTTACACAAGAGGATTCG



GGTCAATAGCCGATTTTTTGGTTCAACGCTTAACTCTTTTAAGGGATAAAATGTC



TACAGATACTGCAGAAGATTGTTTAGAATATGTCGGGATTGAGGAGGAAAAAGGT



AACGGCATGAACTCATTGTTGGGAACGTTCTTAAAGAATTTGCAAGGCGATGGAT



TTGAGCAGATTTTCCAATTTATGTTAGGGAGCTATGTCGGTTGGCAAGGGAAGGA



AGATGTTTTAAGAGAGAGATTAGACTTATTGGCTGAAAAAGTGAAGAGGTTACCG



AAACCAAAATTTGCTGGCGAATGGTCTGGTCATAGGATGTTCTTGCATGGCCAAT



TGAAGTCTTGGTCTTCAAATTTTTTTAGACTATTTAACGAGACAAGGGAACTTCT



AGAGTCTATTAAGTCAGATATACAGCATGCCACAATGCTAATATCATATGTAGAA



GAAAAAGGTGGTTATCATCCTCAATTACTTAGTCAATATAGAAAACTTATGGAAC



AACTACCAGCTTTGCGTACCAAGGTATTGGACCCTGAGATTGAAATGACACATAT



GTCCGAAGCAGTTCGCTCTTATATAATGATACATAAATCTGTTGCGGGTTTTTTA



CCGGATTTATTAGAATCATTAGATAGAGACAAGGATCGTGAGTTTCTGCTTAGTA



TTTTTCCAAGAATCCCAAAAATTGATAAAAAAACCAAGGAAATTGTAGCTTGGGA



ACTGCCGGGAGAACCAGAAGAAGGTTATTTATTTACTGCTAATAACTTGTTCAGA



AACTTCTTAGAGAATCCGAAACATGTCCCGAGATTTATGGCCGAAAGGATCCCAG



AAGATTGGACTCGATTACGCTCTGCTCCTGTCTGGTTCGATGGAATGGTAAAACA



ATGGCAAAAAGTCGTTAACCAGTTAGTAGAATCACCAGGTGCTTTATATCAATTT



AACGAATCCTTCTTGAGACAAAGGTTACAGGCCATGTTAACTGTGTATAAGAGGG



ACTTACAAACTGAAAAATTTCTTAAACTTTTGGCGGATGTTTGTAGGCCTCTTGT



AGATTTTTTTGGTTTGGGTGGAAATGATATTATTTTTAAGAGCTGTCAAGACCCA



AGAAAACAATGGCAAACCGTTATTCCTCTCTCTGTTCCGGCAGATGTCTATACTG



CTTGCGAAGGTTTGGCGATTAGACTAAGGGAGACATTAGGATTCGAATGGAAGAA



TTTGAAAGGTCACGAGAGAGAAGATTTCTTAAGATTGCACCAGTTATTGGGCAAT



TTACTTTTCTGGATTCGTGATGCTAAATTGGTAGTAAAATTAGAGGATTGGATGA



ACAACCCATGTGTTCAGGAATATGTAGAAGCCCGGAAAGCTATCGATCTTCCACT



AGAAATATTCGGTTTTGAAGTGCCTATCTTCCTGAATGGCTATCTATTTTCGGAG



TTGAGACAATTAGAACTTTTGCTTAGGAGAAAAAGTGTGATGACTAGCTACAGTG



TAAAGACTACTGGATCTCCTAATAGGCTATTTCAGCTAGTTTATTTACCTCTAAA



CCCTAGTGACCCCGAAAAGAAGAACTCAAATAACTTTCAAGAACGTTTGGATACC



CCAACTGGTTTGTCCCGTCGTTTCCTAGACCTAACCCTTGATGCATTCGCAGGTA



AGTTACTTACCGATCCAGTTACACAAGAATTGAAGACAATGGCAGGTTTTTACGA



TCATCTTTTTGGATTCAAATTGCCATGTAAACTCGCCGCCATGTCGAATCATCCA



GGTTCTTCTTCAAAGATGGTTGTGTTAGCGAAACCCAAAAAAGGTGTTGCTTCTA



ATATAGGGTTTGAACCGATCCCAGATCCCGCTCATCCCGTATTTAGGGTTAGATC



CAGTTGGCCAGAGTTGAAGTACCTCGAGGGGCTATTGTATTTGCCAGAAGACACA



CCTTTGACCATCGAATTAGCAGAGACCTCCGTATCGTGCCAAAGTGTCTCGTCAG



TTGCATTCGATTTGAAAAACTTGACAACGATCTTAGGTCGTGTGGGAGAATTTAG



GGTCACAGCTGATCAACCCTTTAAACTAACGCCTATAATCCCGGAGAAAGAAGAA



TCTTTTATTGGTAAAACTTATTTGGGTCTCGACGCGGGTGAAAGGAGCGGCGTCG



GTTTCGCTATTGTTACAGTGGACGGAGATGGGTACGAAGTGCAAAGATTGGGGGT



CCACGAGGATACACAGCTTATGGCCTTGCAGCAAGTTGCTAGTAAATCCTTAAAA



GAGCCAGTATTTCAGCCTCTAAGAAAAGGCACCTTTAGACAACAAGAAAGAATAC



GGAAATCCTTACGTGGTTGCTACTGGAATTTTTATCATGCCTTGATGATAAAATA



TAGGGCCAAAGTAGTACATGAGGAATCTGTCGGAAGTAGTGGTCTTGTGGGTCAA



TGGTTGAGGGCTTTTCAGAAGGATTTGAAGAAAGCCGATGTTCTCCCCAAGAAGG



GCGGTAAAAACGGTGTAGATAAGAAGAAGAGAGAGTCCTCAGCTCAAGACACTCT



TTGGGGTGGTGCTTTCTCTAAAAAGGAGGAGCAACAGATTGCGTTTGAGGTGCAA



GCTGCAGGTTCTTCGCAATTTTGTTTGAAGTGCGGATGGTGGTTCCAACTAGGCA



TGCGTGAAGTAAACAGGGTACAAGAATCGGGCGTCGTGTTAGATTGGAATAGAAG



CATAGTTACCTTTTTAATAGAATCATCCGGCGAAAAAGTTTATGGTTTCTCCCCA



CAGCAATTAGAGAAGGGTTTCAGACCAGACATCGAAACTTTTAAAAAGATGGTAA



GAGACTTTATGAGACCTCCTATGTTTGATAGAAAAGGCAGACCGGCCGCAGCTTA



CGAGAGATTTGTTTTAGGAAGGAGACATCGAAGGTACAGGTTTGATAAAGTATTT



GAGGAAAGATTTGGGAGGTCTGCTCTTTTCATTTGTCCTAGAGTAGGTTGTGGAA



ATTTTGACCACAGCTCCGAACAGTCCGCGGTTGTTTTGGCCTTGATCGGATATAT



TGCCGATAAGGAGGGAATGTCAGGTAAGAAGTTGGTTTATGTACGGCTGGCCGAA



CTTATGGCCGAATGGAAACTAAAAAAATTAGAAAGATCCAGAGTTGAAGAACAAT



CATCCGCTCAATAA





SEQ
ATGGCAGAAAGCAAACAAATGCAGTGTAGGAAATGTGGAGCTAGTATGAAGTACG


ID
AAGTCATCGGTTTGGGTAAAAAGTCATGTAGATACATGTGTCCCGATTGTGGCAA


NO:
CCATACCTCGGCAAGAAAGATACAAAACAAAAAAAAAAGAGATAAAAAATATGGG


144
TCAGCCAGTAAAGCCCAATCTCAAAGAATTGCTGTAGCAGGTGCTCTTTACCCTG



ACAAAAAAGTACAAACTATCAAAACCTATAAATATCCAGCAGACTTGAATGGTGA



GGTGCATGATAGCGGTGTTGCCGAGAAAATCGCACAAGCAATACAAGAGGACGAG



ATTGGACTTTTGGGACCAAGCTCAGAATATGCATGCTGGATTGCATCTCAAAAAC



AGTCTGAGCCTTACAGTGTAGTCGATTTCTGGTTTGATGCAGTGTGCGCAGGGGG



AGTCTTCGCCTACTCTGGCGCTAGATTATTGAGTACAGTTTTACAGTTATCCGGT



GAGGAATCGGTGCTTAGAGCTGCCTTAGCCTCGTCTCCATTCGTTGACGATATAA



ACTTAGCGCAAGCCGAAAAGTTTTTGGCGGTTAGCAGGCGTACAGGTCAAGATAA



GTTAGGTAAGAGAATTGGGGAGTGCTTTGCAGAAGGAAGATTGGAAGCTTTAGGG



ATAAAAGATAGAATGAGGGAATTTGTTCAAGCTATCGATGTTGCACAGACCGCCG



GACAACGTTTCGCTGCCAAATTGAAGATATTCGGTATAAGTCAGATGCCAGAAGC



TAAGCAATGGAATAACGATTCCGGACTGACTGTCTGTATACTACCTGATTATTAT



GTTCCCGAAGAGAATCGCGCGGACCAACTTGTAGTGTTGTTAAGAAGACTTCGCG



AGATTGCATATTGCATGGGTATTGAAGATGAAGCGGGTTTCGAACATCTTGGAAT



AGATCCTGGTGCTCTTTCGAATTTTTCAAACGGTAACCCTAAGAGAGGATTTCTA



GGGAGGCTGTTAAATAACGATATTATTGCGTTGGCAAACAATATGAGTGCGATGA



CTCCATATTGGGAAGGGCGTAAGGGTGAACTCATAGAAAGGCTTGCGTGGTTAAA



GCACAGGGCAGAAGGGCTGTATCTTAAAGAACCTCATTTCGGTAACTCCTGGGCC



GATCATAGGTCACGAATTTTCTCAAGGATCGCAGGCTGGTTATCTGGTTGCGCTG



GCAAGTTGAAAATTGCGAAAGACCAAATTTCTGGAGTACGTACAGATCTATTTCT



GCTAAAAAGACTGCTGGACGCAGTTCCGCAATCGGCGCCATCCCCCGATTTTATT



GCGTCAATTTCGGCACTTGACAGGTTTTTAGAAGCTGCAGAATCGAGCCAGGACC



CTGCTGAACAAGTGAGGGCTCTCTACGCTTTTCACTTGAACGCACCTGCAGTCCG



AAGTATAGCCAATAAAGCAGTGCAAAGGTCCGACAGCCAAGAATGGCTGATAAAA



GAACTAGACGCTGTTGACCATTTAGAATTTAACAAAGCGTTCCCATTTTTCTCTG



ACACAGGAAAAAAAAAAAAAAAAGGTGCTAATAGCAACGGTGCTCCATCGGAAGA



AGAGTACACTGAAACGGAATCAATACAACAACCTGAGGACGCGGAACAGGAAGTA



AACGGACAAGAAGGGAACGGAGCGTCTAAAAATCAAAAGAAATTTCAAAGAATAC



CTAGATTCTTCGGTGAAGGCTCCAGATCTGAATACAGAATTTTAACGGAAGCTCC



ACAGTATTTCGATATGTTTTGTAATAACATGAGGGCTATATTTATGCAGTTAGAA



AGTCAACCCCGTAAAGCTCCCAGAGATTTTAAATGTTTCCTACAAAATCGATTAC



AAAAATTATACAAACAGACTTTCTTGAATGCACGAAGCAACAAGTGTCGCGCTCT



GCTTGAGTCAGTTTTAATCTCTTGGGGAGAATTTTATACATACGGTGCCAACGAA



AAGAAATTTAGATTAAGACATGAAGCTTCAGAACGCAGCAGTGACCCAGATTACG



TAGTTCAGCAAGCCTTGGAAATCGCGCGTCGTCTATTCCTTTTTGGCTTCGAATG



GAGAGATTGCTCCGCTGGTGAAAGAGTGGATTTGGTTGAAATTCACAAAAAGGCT



ATCAGTTTTTTGTTGGCTATTACTCAAGCTGAGGTCTCTGTTGGTTCATACAATT



GGCTTGGCAACTCAACAGTATCGAGATATTTATCCGTTGCGGGAACTGATACCTT



ATACGGTACCCAATTGGAAGAATTCCTGAACGCTACAGTGTTGAGTCAAATGCGT



GGTCTGGCCATTAGATTGAGTTCTCAAGAACTTAAGGACGGTTTTGATGTGCAGC



TCGAGTCTTCCTGCCAGGACAATCTGCAACACCTATTGGTGTATAGGGCTTCGAG



AGATTTGGCGGCTTGCAAGCGCGCTACTTGTCCAGCCGAACTCGATCCTAAGATT



TTAGTTTTACCGGTAGGTGCATTCATCGCTTCCGTAATGAAAATGATAGAAAGAG



GTGACGAACCTTTAGCTGGTGCTTATTTACGGCATAGGCCACACTCTTTCGGATG



GCAAATTAGGGTCCGCGGTGTTGCTGAGGTAGGGATGGATCAGGGTACAGCATTG



GCCTTTCAAAAGCCAACAGAGTCAGAACCTTTTAAAATTAAGCCCTTCTCTGCAC



AGTATGGACCAGTTCTGTGGTTGAACAGTAGTAGTTATTCTCAATCACAATATTT



GGACGGTTTTCTATCTCAACCAAAAAATTGGAGTATGAGGGTGTTGCCTCAGGCG



GGTTCAGTTCGCGTCGAACAACGAGTTGCTTTGATATGGAACTTACAAGCAGGCA



AGATGAGACTAGAACGCTCCGGTGCGAGGGCCTTTTTCATGCCTGTACCGTTTTC



ATTTAGGCCATCCGGCAGTGGGGACGAAGCAGTTTTGGCGCCCAACCGGTACTTG



GGTCTGTTCCCTCATTCCGGAGGTATAGAATACGCTGTAGTGGATGTCCTGGATT



CTGCTGGATTTAAAATTCTTGAAAGAGGCACTATTGCTGTCAATGGTTTCTCTCA



GAAAAGGGGAGAGCGCCAAGAAGAAGCCCATCGTGAAAAACAAAGAAGGGGGATA



AGTGATATAGGGCGAAAGAAGCCTGTGCAGGCAGAAGTCGATGCGGCGAACGAAT



TGCATAGAAAGTACACTGATGTTGCCACAAGATTAGGTTGTAGAATCGTCGTTCA



ATGGGCACCACAACCTAAACCAGGGACAGCACCGACAGCGCAAACTGTTTACGCG



AGGGCTGTTAGGACAGAAGCTCCGAGGAGCGGCAACCAAGAAGATCATGCAAGAA



TGAAAAGTTCTTGGGGTTACACCTGGGGTACGTATTGGGAGAAACGAAAACCAGA



AGATATTTTAGGGATTTCTACACAGGTGTATTGGACAGGAGGTATAGGCGAATCC



TGTCCTGCTGTAGCAGTCGCTTTATTAGGTCATATTAGAGCAACTTCAACACAAA



CGGAGTGGGAAAAGGAAGAAGTTGTCTTTGGAAGACTGAAGAAGTTCTTTCCGAG



TTAA





SEQ
ATGGAGAAGAGAATTAATAAGATACGGAAAAAATTATCTGCGGATAATGCAACAA


ID
AGCCAGTCTCTCGTTCAGGCCCCATGAAAACCCTGCTTGTAAGAGTAATGACGGA


NO:
TGATTTAAAAAAGAGGTTGGAAAAGCGTAGAAAAAAACCAGAAGTGATGCCGCAA


145
GTGATCTCAAATAACGCAGCTAATAATCTAAGGATGCTACTTGATGATTATACAA



AAATGAAAGAAGCAATCCTGCAAGTTTACTGGCAGGAATTCAAGGATGACCATGT



TGGACTAATGTGCAAATTCGCACAACCAGCGTCTAAGAAAATTGACCAAAATAAA



TTGAAACCCGAAATGGACGAAAAAGGGAATTTAACAACTGCCGGGTTTGCCTGCT



CGCAATGTGGGCAACCATTATTTGTTTATAAATTAGAGCAGGTTTCGGAAAAAGG



AAAGGCTTACACAAATTACTTCGGCAGATGTAATGTTGCCGAACACGAAAAACTC



ATATTGTTAGCTCAGTTGAAGCCTGAGAAAGACTCTGATGAGGCCGTTACTTACT



CGTTGGGGAAGTTTGGTCAAAGAGCTCTCGATTTTTATTCTATTCATGTGACAAA



GGAGTCCACACATCCCGTCAAGCCCTTGGCACAAATTGCGGGTAATAGATACGCT



TCGGGTCCAGTTGGGAAGGCCCTTTCTGATGCATGTATGGGCACAATTGCTAGCT



TTCTTAGTAAATACCAGGATATCATAATAGAGCATCAAAAAGTTGTAAAGGGTAA



CCAAAAGAGATTAGAATCGCTGCGTGAGTTGGCGGGTAAAGAAAACTTGGAATAT



CCATCTGTCACTCTGCCTCCTCAACCTCATACTAAGGAAGGTGTAGATGCGTACA



ATGAAGTTATCGCTAGAGTCCGTATGTGGGTGAATTTAAATTTGTGGCAAAAATT



GAAGTTATCGCGTGATGATGCAAAACCTCTTCTTAGACTAAAGGGCTTTCCTAGC



TTCCCTGTAGTGGAAAGACGCGAAAATGAAGTCGATTGGTGGAATACAATTAACG



AAGTCAAAAAACTGATCGATGCAAAGCGAGATATGGGTCGAGTTTTTTGGTCTGG



TGTTACAGCTGAAAAAAGGAATACGATCTTAGAAGGTTACAACTACTTGCCAAAT



GAGAACGATCATAAAAAAAGAGAAGGCAGTTTAGAAAATCCAAAAAAGCCAGCTA



AGAGACAATTTGGTGATTTGCTACTTTACCTAGAAAAAAAGTACGCCGGAGATTG



GGGGAAAGTCTTTGACGAAGCTTGGGAGAGAATAGATAAAAAAATAGCAGGATTG



ACGTCACACATTGAAAGAGAAGAGGCGAGAAATGCAGAAGATGCTCAGTCCAAAG



CTGTCCTCACCGACTGGTTGAGAGCCAAAGCGTCCTTTGTTCTCGAACGCCTAAA



AGAAATGGATGAGAAGGAATTTTATGCCTGCGAAATCCAGCTACAAAAATGGTAC



GGAGACTTGAGAGGTAACCCCTTTGCCGTGGAAGCAGAGAACCGTGTTGTAGATA



TCTCCGGTTTCTCAATCGGTAGCGATGGACACTCCATTCAGTATCGCAACTTGTT



GGCCTGGAAATATTTGGAAAACGGTAAGAGGGAATTCTATTTACTTATGAATTAT



GGCAAGAAAGGTAGAATCAGGTTTACTGACGGAACAGACATTAAAAAGAGTGGTA



AGTGGCAAGGCCTTTTGTACGGTGGTGGCAAGGCCAAAGTAATAGACTTAACATT



TGACCCCGACGACGAACAACTGATAATACTGCCTTTAGCTTTTGGTACTCGACAG



GGGCGAGAGTTCATTTGGAATGATCTTTTGTCACTCGAGACTGGTTTGATAAAAC



TTGCAAATGGAAGAGTCATCGAGAAGACAATTTACAACAAAAAGATAGGTCGCGA



TGAGCCTGCACTATTTGTGGCCTTGACCTTTGAGAGAAGGGAAGTTGTCGACCCA



TCCAATATTAAACCAGTCAACCTAATCGGTGTAGATAGAGGTGAAAACATCCCAG



CTGTTATCGCTCTGACAGACCCTGAAGGTTGCCCTTTGCCAGAATTTAAAGATTC



GTCTGGTGGACCAACAGATATATTACGTATTGGGGAAGGCTATAAAGAGAAACAA



CGTGCTATTCAGGCTGCAAAAGAAGTTGAACAGAGGAGAGCTGGAGGTTACAGTA



GAAAATTCGCCAGTAAAAGTAGAAACTTAGCAGATGACATGGTTAGAAACTCTGC



CCGGGATTTGTTCTATCATGCGGTTACTCACGATGCAGTCTTAGTCTTTGAAAAT



CTATCGCGCGGTTTTGGTAGGCAAGGCAAGAGGACTTTTATGACAGAGAGACAAT



ATACAAAAATGGAAGATTGGTTAACCGCGAAGCTCGCATATGAAGGTCTTACTTC



GAAAACGTACCTCAGCAAAACGCTGGCTCAATATACTTCTAAAACTTGTTCAAAT



TGTGGTTTTACTATTACCACGGCAGACTACGACGGGATGTTGGTGAGATTGAAGA



AGACGAGCGATGGTTGGGCAACAACATTGAATAATAAGGAATTAAAAGCAGAAGG



ACAGATTACGTATTACAATCGTTATAAACGCCAAACGGTTGAGAAAGAGTTGTCA



GCCGAGTTGGATAGACTAAGTGAAGAGAGCGGTAACAATGATATCTCAAAGTGGA



CTAAAGGGAGGCGGGATGAAGCCCTCTTTTTACTAAAGAAGAGATTCTCACATAG



ACCTGTGCAAGAACAATTCGTTTGTTTAGATTGTGGCCATGAGGTTCATGCAGAC



GAACAGGCTGCGTTAAATATTGCGAGAAGCTGGCTATTTCTAAATTCTAATTCAA



CAGAGTTCAAGAGCTATAAATCCGGAAAACAACCTTTCGTAGGCGCGTGGCAAGC



CTTCTATAAAAGGAGATTAAAAGAGGTTTGGAAACCAAATGCA





SEQ
ATGAAAAGAATTAACAAAATTAGAAGGAGGCTGGTCAAAGATTCTAATACCAAGA


ID
AAGCTGGTAAGACTGGTCCGATGAAAACCCTATTAGTCAGAGTTATGACCCCAGA


NO:
TTTGAGAGAAAGATTGGAGAACCTCAGGAAAAAGCCCGAAAACATCCCACAACCC


146
ATTAGTAACACATCAAGAGCTAATTTAAACAAGTTATTAACTGACTACACTGAAA



TGAAAAAAGCAATATTGCATGTTTACTGGGAAGAGTTCCAGAAAGATCCTGTTGG



GTTGATGTCTAGAGTTGCTCAACCGGCCCCAAAGAATATAGATCAAAGGAAACTT



ATTCCTGTGAAGGACGGCAATGAAAGATTAACCAGCTCCGGTTTCGCTTGCTCCC



AGTGCTGCCAACCCCTGTATGTATACAAACTGGAACAAGTAAATGATAAAGGTAA



GCCACATACTAACTACTTTGGTAGGTGTAATGTATCCGAGCATGAAAGATTGATC



TTGTTAAGTCCCCATAAACCAGAAGCTAATGATGAGTTAGTAACTTATAGTTTAG



GTAAGTTCGGACAACGAGCTTTAGATTTCTATAGCATCCATGTTACAAGAGAAAG



CAATCACCCCGTCAAACCACTGGAACAAATCGGTGGTAATAGTTGTGCGTCAGGT



CCAGTAGGCAAAGCTTTATCAGACGCTTGCATGGGTGCCGTGGCTAGTTTTTTGA



CGAAATACCAAGATATTATACTGGAACATCAAAAGGTAATTAAAAAGAATGAAAA



GAGACTCGCTAACTTAAAAGATATTGCAAGTGCCAATGGTTTAGCTTTTCCTAAA



ATTACCTTGCCACCTCAGCCACATACAAAGGAGGGAATTGAAGCTTACAATAATG



TAGTAGCCCAAATAGTTATTTGGGTGAACCTTAACCTATGGCAAAAGTTAAAAAT



TGGTAGAGACGAAGCCAAACCCCTGCAGAGGCTGAAGGGTTTTCCCTCCTTCCCC



TTAGTAGAGAGACAAGCTAATGAAGTGGACTGGTGGGATATGGTGTGCAATGTTA



AAAAATTGATTAATGAGAAGAAAGAGGATGGTAAAGTGTTTTGGCAGAATCTTGC



TGGCTACAAGAGACAGGAAGCTTTACTGCCTTATTTATCTTCTGAGGAAGATAGG



AAAAAAGGTAAAAAATTTGCTAGATATCAATTCGGAGACCTACTTCTGCATTTAG



AAAAAAAACATGGCGAAGATTGGGGTAAAGTTTATGATGAAGCCTGGGAAAGAAT



TGATAAGAAGGTAGAAGGTCTCTCCAAACATATTAAATTAGAGGAAGAACGTAGG



TCCGAAGACGCTCAATCAAAGGCAGCATTAACTGATTGGTTGAGAGCAAAAGCCT



CTTTCGTTATTGAAGGATTAAAAGAAGCCGACAAAGATGAATTTTGTAGATGTGA



GTTAAAGTTGCAAAAGTGGTATGGAGACCTCCGTGGTAAACCTTTTGCTATTGAG



GCTGAAAATTCTATACTCGATATCTCTGGATTTTCAAAACAATATAACTGCGCAT



TTATATGGCAGAAAGATGGTGTTAAAAAGCTAAATCTATACTTAATTATCAATTA



CTTTAAAGGTGGTAAATTGCGTTTTAAGAAGATAAAGCCTGAAGCCTTTGAGGCA



AACCGTTTTTACACTGTTATCAATAAAAAATCTGGGGAAATCGTACCAATGGAAG



TTAATTTCAATTTCGATGATCCTAATCTTATTATTTTACCTCTTGCTTTCGGCAA



AAGGCAAGGTAGGGAGTTTATTTGGAATGATTTATTGTCGCTGGAAACGGGGTCT



CTCAAACTCGCAAACGGTAGGGTGATAGAAAAAACATTATACAACAGGAGAACTC



GGCAGGATGAGCCAGCTCTTTTTGTGGCTCTGACATTCGAGAGAAGGGAAGTTTT



AGATTCATCTAACATCAAACCAATGAATTTAATAGGTATTGACCGGGGTGAAAAT



ATACCTGCAGTTATTGCTTTAACTGATCCTGAGGGATGTCCTCTTAGCAGATTCA



AGGACTCGTTGGGTAACCCTACTCACATCTTAAGGATTGGAGAAAGTTACAAGGA



GAAACAAAGGACAATACAAGCTGCTAAAGAAGTAGAACAAAGGAGGGGGGTGGAT



ATAGTCGGAAATATGCCAGCAAGGCCAAGAATTTAGCTGACGACATGGTTAGGAA



TACAGCTAGAGACCTTTTATACTATGCCGTCACCCAGGATGCCATGTTGATATTT



GAAAATTTAAGTAGAGGCTTCGGTAGACAAGGTAAGCGCACCTTCATGGCAGAGA



GACAATATACTAGAATGGAAGATTGGTTGACTGCCAAATTGGCATACGAAGGTCT



ACCTAGTAAGACGTACTTATCTAAAACACTAGCGCAGTATACTTCCAAGACATGC



AGTAATTGTGGTTTCACAATCACTTCTGCCGATTACGATCGCGTCTTGGAAAAAC



TAAAAAAAACAGCGACAGGTTGGATGACTACTATTAATGGGAAAGAATTGAAGGT



CGAAGGACAAATAACTTACTATAATAGATATAAACGGCAAAACGTTGTAAAAGAC



CTGTCAGTCGAACTCGATCGACTTAGTGAAGAATCTGTTAATAATGATATTAGTT



CGTGGACAAAAGGTAGATCCGGTGAAGCTTTGAGCCTCCTGAAAAAACGTTTTAG



CCATAGGCCTGTCCAAGAAAAGTTTGTATGTTTAAACTGTGGTTTTGAGACCCAT



GCAGACGAGCAGGCCGCTCTTAATATTGCTAGATCATGGTTATTTTTAAGATCTC



AGGAATACAAGAAGTACCAGACTAACAAGACAACAGGCAACACAGATAAGCGAGC



ATTCGTTGAGACTTGGCAATCTTTTTATAGAAAGAAATTGAAGGAAGTCTGGAAA



CCA





SEQ
ATGGGAAAAATGTATTATCTAGGCCTGGACATAGGGACCAATTCAGTAGGCTACG


ID
CTGTCACTGACCCCTCCTACCATTTGCTGAAGTTCAAGGGGGAACCCATGTGGGG


NO:
AGCACACGTGTTTGCGGCCGGCAACCAGAGCGCAGAGCGGAGAAGCTTCCGCACC


147
TCCAGGAGAAGGCTGGATCGCAGGCAGCAGCGTGTGAAGCTGGTCCAAGAGATAT



TTGCCCCAGTGATTTCCCCCATCGATCCGCGCTTCTTTATTAGGCTCCACGAGTC



CGCTCTCTGGCGCGACGACGTGGCCGAAACTGATAAACATATTTTCTTTAATGAC



CCAACATACACTGACAAGGAGTACTATTCAGATTACCCAACAATTCACCATTTGA



TCGTGGACCTTATGGAAAGTTCGGAGAAGCATGATCCTCGACTTGTCTATTTGGC



CGTGGCGTGGCTCGTGGCACATAGGGGCCACTTCTTGAACGAGGTGGACAAGGAT



AACATCGGGGATGTGTTATCTTTCGACGCTTTCTATCCTGAATTCCTTGCTTTTC



TGTCTGACAATGGCGTCAGCCCGTGGGTCTGCGAATCCAAGGCCCTCCAGGCTAC



GCTATTGTCAAGAAATAGCGTGAACGACAAGTACAAGGCTCTTAAGTCTTTGATT



TTTGGAAGCCAGAAGCCCGAGGACAACTTTGATGCAAATATCTCGGAGGACGGGC



TGATTCAGCTCCTCGCTGGGAAAAAGGTCAAGGTCAATAAGCTGTTTCCACAGGA



GTCAAATGACGCGAGCTTCACCCTTAACGACAAAGAGGATGCCATTGAAGAGATC



CTGGGGACACTCACCCCAGACGAGTGCGAGTGGATAGCCCATATTAGGCGCCTCT



TTGATTGGGCCATAATGAAACATGCGCTTAAGGACGGGCGCACGATATCCGAAAG



CAAGGTCAAATTGTACGAGCAGCACCACCATGATCTGACCCAGCTAAAATATTTT



GTAAAAACATATCTGGCCAAGGAGTACGATGATATCTTCCGCAACGTGGATAGTG



AGACCACCAAAAACTACGTCGCGTACTCATACCACGTGAAAGAAGTTAAGGGCAC



GCTGCCTAAGAACAAGGCAACACAAGAGGAGTTCTGCAAGTACGTTCTCGGGAAA



GTTAAAAATATAGAGTGCAGCGAGGCCGACAAAGTGGATTTTGACGAGATGATTC



AACGCCTGACCGACAATTCGTTTATGCCTAAACAGGTGAGTGGAGAGAATCGCGT



GATTCCATATCAGCTCTATTACTATGAACTCAAGACTATTCTGAATAAGGCCGCT



AGCTATTTACCCTTCCTTACGCAGTGCGGGAAGGATGCCATTTCTAACCAGGATA



AACTCTTGAGTATAATGACATTTCGAATTCCCTATTTCGTGGGTCCGCTTCGTAA



GGATAACAGTGAGCACGCTTGGCTGGAGCGGAAGGCTGGCAAAATTTATCCATGG



AATTTCAACGACAAGGTGGATCTGGACAAATCCGAAGAAGCCTTTATCCGCAGGA



TGACCAATACTTGCACATACTATCCTGGGGAGGATGTCCTTCCACTGGACTCTCT



GATCTACGAAAAGTTCATGATTTTGAATGAAATTAACAACATAAGGATCGATGGG



TATCCTATTTCCGTCGACGTGAAGCAGCAGGTGTTCGGGCTCTTTGAGAAGAAGC



GACGGGTGACCGTGAAGGATATTCAGAATCTTCTCTTATCGCTGGGAGCCCTGGA



TAAACACGGAAAACTGACCGGGATAGATACTACGATTCATTCTAATTACAACACG



TATCACCATTTTAAGTCACTGATGGAGAGGGGCGTCCTAACAAGAGATGACGTGG



AGAGAATAGTGGAACGAATGACATATTCTGATGACACCAAGAGAGTGCGGCTTTG



GCTGAATAACAACTACGGCACTCTGACGGCGGATGATGTAAAGCATATTTCCCGA



CTCCGTAAGCATGACTTCGGGCGGCTGTCTAAGATGTTTCTAACAGGCCTCAAGG



GTGTGCATAAGGAAACTGGGGAGCGCGCTAGCATCCTGGATTTTATGTGGAACAC



CAATGATAACCTGATGCAGCTCCTGTCAGAATGCTACACATTTTCGGACGAAATC



ACCAAGCTGCAGGAGGCTTACTATGCCAAGGCCCAACTAAGCTTGAATGATTTCC



TGGATTCTATGTACATCAGCAACGCCGTAAAACGACCAATTTATAGGACACTGGC



AGTGGTTAACGACATTAGGAAAGCATGCGGAACAGCTCCCAAGCGAATCTTTATC



GAGATGGCCCGCGACGGCGAGAGTAAGAAGAAAAGGTCAGTGACTAGGCGGGAGC



AGATCAAGAACCTTTACCGCTCTATCCGAAAAGACTTCCAGCAAGAGGTTGATTT



CCTTGAGAAGATCTTAGAGAACAAGTCAGATGGACAGCTCCAATCCGATGCTCTG



TATCTGTACTTCGCTCAGCTGGGACGAGATATGTACACTGGCGACCCCATTAAAC



TAGAACATATCAAGGACCAATCGTTTTATAATATCGACCACATCTACCCTCAGTC



CATGGTGAAAGACGATAGTCTGGACAATAAGGTGCTCGTCCAAAGTGAGATTAAC



GGAGAAAAGTCGAGCAGATATCCTTTGGACGCTGCGATCCGCAACAAGATGAAGC



CCCTGTGGGATGCTTACTACAATCATGGACTGATCAGCCTGAAGAAGTATCAGAG



ACTGACCCGGAGTACCCCTTTCACAGACGATGAGAAGTGGGATTTTATCAATAGA



CAACTGGTGGAAACCAGGCAGTCCACGAAAGCTCTGGCCATTCTTCTGAAGAGAA



AGTTTCCAGACACAGAGATCGTCTATTCAAAGGCCGGCCTCAGTTCCGACTTTAG



ACATGAGTTCGGACTCGTTAAATCACGAAATATAAACGATCTCCACCATGCAAAG



GACGCATTCCTCGCGATTGTGACTGGAAATGTCTATCACGAAAGATTTAATAGGC



GGTGGTTCATGGTTAACCAGCCATACTCAGTGAAGACCAAGACCCTTTTCACTCA



CTCTATTAAAAATGGCAACTTCGTGGCTTGGAATGGTGAGGAGGATCTTGGAAGA



ATTGTGAAGATGTTAAAACAGAATAAGAATACCATCCACTTTACTAGATTCAGCT



TTGACCGAAAAGAGGGGCTATTCGATATTCAACCGTTAAAGGCTTCAACAGGTCT



CGTTCCACGAAAGGCCGGACTGGACGTAGTGAAATACGGCGGCTATGATAAGAGC



ACCGCAGCTTACTACCTCCTTGTGCGATTTACGCTCGAGGATAAGAAGACCCAAC



ACAAGCTGATGATGATTCCCGTGGAGGGACTGTACAAAGCTCGAATTGACCATGA



TAAAGAGTTTCTCACAGATTACGCACAAACCACCATCTCTGAGATTCTCCAGAAA



GACAAACAAAAAGTTATAAACATAATGTTTCCAATGGGTACAAGGCATATTAAAC



TGAACAGCATGATCTCCATTGATGGCTTTTATTTGTCCATTGGAGGAAAGTCTAG



TAAAGGCAAGTCTGTCCTCTGCCATGCCATGGTACCCCTAATCGTCCCACACAAG



ATTGAATGCTACATCAAGGCTATGGAGAGTTTTGCTCGGAAATTTAAAGAGAATA



ATAAGCTGCGTATTGTGGAAAAATTCGACAAGATAACCGTTGAAGACAATCTGAA



TCTGTACGAGCTCTTTCTGCAGAAGCTGCAGCATAACCCCTATAATAAGTTCTTC



TCCACACAGTTCGATGTACTGACCAACGGGCGATCAACTTTCACAAAGCTAAGTC



CTGAGGAACAGGTGCAAACACTCCTAAACATTCTTTCCATTTTTAAGACCTGCAG



ATCTTCAGGATGCGACTTGAAGAGCATTAACGGGAGCGCACAGGCAGCTAGGATC



ATGATCTCAGCTGACCTGACAGGGCTGAGTAAAAAATACTCCGACATTCGGCTTG



TAGAGCAAAGCGCCAGTGGGTTGTTCGTTAGTAAGTCGCAGAACCTGCTGGAATA



CCTGTAA





SEQ
ATGTCTTCTTTGACGAAGTTTACAAACAAATACTCTAAGCAGCTTACAATTAAGA


ID
ACGAACTGATTCCCGTAGGAAAGACTCTGGAAAACATCAAAGAGAATGGGCTGAT


NO:
AGACGGCGACGAACAACTGAATGAGAACTATCAGAAGGCCAAAATTATCGTGGAT


148
GACTTCCTGAGGGATTTTATTAACAAGGCCCTGAATAATACCCAGATCGGCAATT



GGCGGGAACTGGCCGACGCTCTGAACAAAGAAGATGAGGACAATATCGAAAAATT



ACAAGACAAAATCAGGGGCATTATTGTCAGTAAGTTCGAGACATTCGATCTGTTC



TCTTCGTACTCCATTAAGAAGGACGAGAAAATCATCGATGATGACAATGACGTTG



AGGAAGAAGAACTGGACTTGGGTAAAAAGACCTCATCCTTCAAGTATATTTTTAA



AAAAAATCTGTTTAAATTAGTGCTCCCCAGTTATTTAAAGACAACTAACCAGGAC



AAGCTTAAGATTATCTCCTCTTTTGACAACTTTAGCACCTATTTTAGAGGCTTCT



TTGAAAATCGCAAGAATATTTTCACTAAGAAGCCCATAAGCACCTCTATTGCCTA



CAGAATCGTACATGATAACTTCCCAAAATTTTTGGATAACATTAGATGTTTTAAT



GTATGGCAGACCGAATGTCCTCAGTTAATTGTGAAGGCGGATAACTACCTCAAAT



CCAAGAATGTGATCGCCAAAGATAAGTCTCTTGCTAACTACTTTACGGTCGGAGC



CTACGATTACTTCTTATCTCAAAACGGTATTGACTTTTACAATAACATTATCGGG



GGATTGCCTGCCTTCGCCGGCCATGAGAAAATTCAGGGCTTAAACGAGTTCATAA



ATCAGGAATGTCAAAAGGACTCAGAGCTGAAATCAAAGCTTAAGAATCGACACGC



ATTTAAAATGGCGGTCTTGTTCAAACAGATCCTCAGCGATAGAGAGAAAAGCTTC



GTTATTGATGAATTCGAGAGCGACGCACAGGTGATTGATGCCGTGAAGAACTTCT



ATGCGGAACAGTGTAAAGACAATAATGTTATTTTCAACCTATTAAACTTGATTAA



GAATATCGCGTTTTTAAGTGACGATGAACTCGACGGTATCTTTATAGAAGGCAAG



TACCTGTCCTCTGTCAGCCAAAAACTCTACTCAGATTGGTCCAAGCTAAGAAATG



ACATCGAGGACAGTGCTAACAGCAAACAGGGCAATAAAGAGCTGGCAAAGAAAAT



CAAGACTAATAAAGGGGATGTGGAGAAGGCGATATCTAAATATGAGTTCTCCCTC



TCCGAACTGAACTCCATCGTCCACGATAATACCAAGTTTAGTGATCTGTTGTCGT



GTACACTGCACAAAGTGGCCAGTGAAAAACTCGTCAAGGTGAACGAAGGCGATTG



GCCCAAACACCTGAAAAATAATGAGGAGAAACAGAAGATCAAAGAACCTTTGGAT



GCGTTGCTCGAAATATATAACACACTGTTGATCTTCAACTGTAAAAGCTTCAACA



AGAACGGGAACTTTTATGTAGACTACGATCGATGTATAAATGAACTGAGCAGCGT



CGTTTACCTGTACAACAAGACTCGCAATTATTGTACGAAAAAACCATATAACACC



GATAAGTTCAAGCTTAATTTCAACAGTCCCCAGCTGGGAGAAGGGTTCAGCAAAT



CAAAAGAAAACGATTGCCTGACATTACTCTTTAAAAAGGATGATAATTATTATGT



TGGGATTATTAGGAAAGGCGCTAAGATCAACTTTGACGACACACAGGCCATAGCT



GACAACACTGATAACTGCATCTTTAAAATGAATTACTTTCTGTTGAAGGACGCCA



AAAAATTCATTCCAAAATGCTCTATTCAGCTCAAGGAGGTTAAGGCCCATTTCAA



GAAGTCTGAAGATGACTACATCCTCTCTGACAAGGAAAAATTCGCTAGTCCTCTG



GTTATCAAAAAAAGTACCTTCTTGCTGGCTACAGCTCACGTGAAAGGCAAGAAAG



GGAACATTAAGAAGTTCCAAAAGGAATACAGCAAAGAGAATCCAACCGAGTACAG



AAATTCTCTGAACGAATGGATCGCATTCTGTAAAGAATTTCTAAAGACGTACAAG



GCCGCTACCATTTTCGATATTACCACCTTGAAAAAAGCCGAGGAGTACGCCGACA



TCGTCGAATTCTATAAAGACGTGGATAACCTGTGTTACAAATTGGAATTCTGCCC



AATTAAGACCTCTTTCATTGAAAACCTCATCGACAATGGGGACCTCTACTTATTT



AGAATTAACAATAAGGATTTTTCTTCGAAATCTACCGGAACTAAAAATCTGCACA



CACTGTATCTGCAAGCAATCTTCGATGAACGTAATCTCAACAACCCTACAATAAT



GCTGAACGGCGGTGCTGAACTGTTCTACCGTAAAGAGAGTATTGAACAGAAGAAT



CGAATCACACACAAAGCGGGCAGTATTCTCGTCAATAAGGTGTGCAAAGACGGGA



CCAGCCTGGACGATAAGATCAGGAATGAAATATATCAGTATGAGAACAAGTTTAT



CGACACCTTGTCGGATGAGGCAAAGAAGGTGCTACCTAACGTTATCAAGAAGGAA



GCTACCCATGACATAACCAAGGATAAGCGGTTCACTTCTGACAAGTTCTTCTTCC



ACTGTCCTCTGACCATTAACTACAAGGAAGGAGACACTAAACAATTCAATAATGA



AGTACTTAGCTTTTTGCGGGGTAATCCCGATATTAACATAATTGGTATCGACCGG



GGAGAACGGAACCTGATATACGTGACAGTAATTAATCAGAAAGGAGAAATCCTGG



ATTCCGTATCCTTCAATACCGTGACTAATAAATCTAGTAAAATCGAGCAGACGGT



CGACTACGAGGAAAAGTTAGCAGTCAGAGAGAAGGAGAGAATCGAGGCCAAACGT



TCCTGGGATAGTATCAGCAAGATTGCTACTCTGAAAGAAGGATATCTGTCCGCTA



TCGTCCATGAGATCTGTTTGTTGATGATCAAGCACAATGCTATAGTGGTTCTGGA



GAACCTGAACGCAGGCTTCAAGCGAATTAGAGGGGGCCTGTCGGAAAAAAGCGTT



TACCAGAAGTTTGAAAAGATGCTAATCAATAAGTTAAATTACTTTGTAAGTAAAA



AAGAAAGCGATTGGAATAAGCCATCAGGACTTTTAAACGGGCTGCAACTGAGCGA



CCAGTTTGAGTCATTCGAAAAACTGGGTATTCAGAGTGGTTTCATATTCTACGTA



CCTGCCGCTTACACTTCAAAGATCGATCCTACAACTGGTTTTGCGAATGTCCTGA



ATCTGTCTAAGGTGAGGAATGTGGACGCAATCAAGTCTTTCTTCAGCAACTTCAA



CGAGATATCTTACAGCAAGAAAGAGGCTCTGTTTAAATTCAGTTTTGATCTGGAT



AGCCTGAGCAAGAAAGGATTCTCTTCTTTCGTAAAGTTTTCTAAGTCCAAATGGA



ACGTCTACACGTTCGGAGAGAGAATCATTAAACCAAAGAACAAGCAGGGGTATCG



GGAAGACAAAAGGATCAATCTGACTTTCGAAATGAAGAAACTATTGAATGAGTAC



AAAGTCTCATTCGATTTGGAGAACAATCTGATCCCCAATCTGACCAGCGCTAACC



TCAAAGACACATTCTGGAAGGAGCTGTTTTTCATCTTTAAGACCACCCTGCAGCT



ACGGAATAGTGTCACAAATGGGAAAGAGGATGTACTGATCTCACCTGTGAAAAAC



GCCAAGGGGGAGTTCTTTGTGTCCGGCACCCATAACAAAACCCTGCCTCAGGACT



GTGACGCGAACGGGGCCTACCACATCGCGCTAAAGGGGTTAATGATTCTCGAACG



TAATAATCTGGTGCGCGAAGAAAAAGACACAAAGAAAATTATGGCCATCAGCAAC



GTTGACTGGTTTGAGTACGTGCAGAAGCGTCGAGGAGTTTTGTAA





SEQ
ATGAACAACTATGACGAGTTCACTAAACTTTACCCCATTCAGAAAACCATCAGAT


ID
TTGAACTGAAGCCTCAGGGTCGTACCATGGAACACTTGGAAACTTTCAACTTTTT


NO:
CGAGGAGGACAGGGATAGAGCTGAGAAATACAAGATCTTGAAAGAGGCCATCGAC


149
GAGTATCACAAAAAATTCATCGATGAGCATCTCACCAACATGTCGCTGGATTGGA



ACAGTCTCAAGCAGATTTCCGAGAAGTACTATAAATCTCGGGAGGAGAAAGATAA



AAAGGTGTTTTTGAGCGAGCAAAAGCGAATGCGACAGGAGATAGTCTCTGAATTT



AAGAAAGATGATCGGTTTAAAGACCTATTTTCCAAAAAGCTTTTTTCAGAGCTGC



TGAAGGAAGAGATCTATAAAAAAGGCAATCACCAAGAAATTGATGCCCTGAAATC



ATTCGACAAATTCAGTGGGTATTTCATAGGACTGCATGAGAACCGGAAGAATATG



TATAGTGATGGAGACGAGATCACAGCCATAAGCAATCGAATCGTTAACGAGAATT



TCCCGAAGTTCCTGGATAACCTGCAGAAGTATCAAGAGGCTAGGAAAAAGTACCC



TGAGTGGATCATCAAGGCTGAATCAGCTCTGGTGGCTCACAATATCAAGATGGAT



GAAGTCTTTAGTCTTGAGTACTTTAATAAAGTCCTTAACCAGGAGGGCATCCAGC



GCTATAACCTGGCTCTCGGTGGCTACGTCACAAAAAGCGGAGAAAAGATGATGGG



TCTCAACGATGCACTGAATTTGGCTCATCAGTCGGAGAAGTCATCTAAGGGACGC



ATACACATGACACCACTGTTTAAACAAATCCTGAGCGAAAAGGAATCATTTTCCT



ACATTCCCGACGTATTCACCGAGGACTCACAACTGCTGCCTAGTATAGGGGGGTT



TTTCGCTCAGATAGAGAACGACAAAGATGGCAACATTTTTGACAGAGCCTTGGAG



TTGATTTCATCTTACGCCGAGTACGATACGGAGCGCATTTATATTCGCCAGGCGG



ATATCAACAGGGTTTCCAATGTGATCTTTGGCGAGTGGGGAACGCTGGGCGGGCT



GATGCGGGAATACAAAGCCGACTCGATCAATGACATCAACCTGGAGAGAACATGC



AAGAAGGTCGATAAATGGTTGGATAGCAAAGAGTTCGCCCTGAGTGACGTCTTGG



AAGCTATCAAAAGAACCGGAAATAATGACGCGTTCAACGAGTATATCTCTAAAAT



GAGGACCGCGAGAGAAAAAATTGATGCAGCAAGGAAGGAGATGAAGTTTATATCT



GAGAAGATCTCAGGCGATGAAGAGTCCATCCATATTATTAAAACTCTTCTGGACT



CAGTGCAGCAATTCCTGCACTTTTTTAACCTCTTCAAGGCCAGGCAGGATATACC



GTTAGACGGGGCTTTTTATGCCGAGTTTGATGAAGTTCATTCGAAACTTTTTGCT



ATAGTGCCTCTCTATAATAAAGTTCGCAATTACCTGACAAAGAATAACTTAAACA



CAAAGAAAATCAAGCTCAACTTCAAAAACCCAACACTGGCAAACGGATGGGATCA



GAACAAGGTATATGATTACGCCTCATTGATTTTCCTCCGGGACGGGAATTACTAT



CTGGGGATCATCAACCCTAAGCGCAAAAAGAACATTAAGTTCGAACAGGGATCTG



GCAATGGTCCCTTCTATAGGAAAATGGTATACAAACAGATTCCTGGCCCCAACAA



GAATCTCCCACGCGTCTTTCTGACGTCCACTAAGGGAAAGAAGGAGTACAAGCCG



TCTAAAGAAATTATCGAGGGCTATGAGGCAGACAAGCATATTAGGGGTGACAAGT



TTGACCTAGACTTTTGTCATAAGCTTATCGACTTTTTCAAGGAGTCCATAGAGAA



GCACAAAGATTGGTCAAAGTTTAATTTCTATTTTTCTCCAACAGAGTCCTACGGG



GATATCTCTGAGTTCTATCTGGATGTTGAAAAGCAGGGGTACAGAATGCACTTCG



AAAATATCTCAGCAGAAACTATCGATGAGTACGTAGAGAAAGGAGATCTGTTTCT



TTTCCAAATCTACAATAAGGATTTTGTGAAGGCCGCCACTGGGAAGAAGGACATG



CACACTATTTACTGGAACGCTGCATTTTCCCCTGAAAATCTGCAGGACGTAGTAG



TGAAATTAAATGGTGAGGCAGAACTGTTTTACCGCGATAAATCAGACATCAAGGA



AATAGTGCACCGGGAAGGCGAGATTCTTGTTAACCGAACATATAATGGCAGGACA



CCTGTCCCTGATAAAATTCATAAGAAACTGACCGATTACCACAACGGTCGAACCA



AGGATCTGGGCGAGGCCAAGGAATACCTCGATAAGGTGAGGTACTTCAAAGCCCA



TTATGACATCACCAAGGACCGAAGATACCTTAACGACAAAATCTACTTCCATGTC



CCACTCACCTTGAACTTCAAAGCTAACGGTAAGAAGAACCTCAATAAAATGGTGA



TTGAAAAATTTCTGTCCGATGAGAAGGCCCATATCATCGGCATTGATCGCGGCGA



GAGAAATCTCCTTTACTATTCTATCATTGATCGGTCGGGAAAGATTATCGACCAA



CAATCACTGAATGTCATCGACGGATTCGACTATAGAGAGAAGCTGAACCAACGGG



AAATCGAGATGAAGGACGCGCGCCAGTCCTGGAACGCTATCGGCAAAATTAAAGA



TTTGAAAGAAGGTTACCTCTCCAAAGCAGTGCACGAAATTACCAAAATGGCAATC



CAGTACAATGCTATTGTGGTAATGGAGGAGTTAAATTACGGATTTAAGCGCGGGA



GGTTCAAGGTTGAAAAGCAAATTTACCAAAAATTTGAGAACATGTTGATTGATAA



GATGAACTACCTGGTGTTCAAGGACGCACCTGACGAGTCGCCAGGCGGCGTGTTA



AATGCATATCAGCTGACAAATCCACTGGAGAGCTTTGCCAAGCTAGGAAAGCAGA



CTGGCATTCTCTTTTACGTCCCTGCAGCGTATACATCCAAAATTGACCCCACCAC



TGGCTTCGTCAATCTGTTTAACACCTCCTCCAAAACCAACGCACAAGAACGGAAA



GAATTTTTGCAAAAGTTTGAGTCCATTAGCTACTCTGCCAAAGACGGCGGGATCT



TTGCTTTCGCATTCGACTACAGGAAATTCGGGACGAGTAAGACAGACCACAAGAA



CGTCTGGACCGCGTACACTAATGGGGAACGCATGCGCTACATCAAAGAGAAAAAG



AGGAATGAACTTTTTGACCCTTCAAAGGAAATCAAGGAAGCTCTCACCTCAAGCG



GTATCAAATACGATGGCGGGCAGAATATTTTGCCAGATATCCTCAGATCGAACAA



TAATGGACTTATCTATACTATGTACTCCTCCTTCATTGCAGCAATTCAAATGAGA



GTGTACGATGGAAAGGAGGATTACATTATATCGCCAATTAAGAACTCCAAAGGCG



AATTCTTCCGCACGGATCCTAAGCGAAGAGAACTCCCAATCGACGCTGATGCGAA



CGGCGCCTATAATATAGCCCTGCGGGGTGAATTAACAATGCGCGCTATTGCCGAG



AAGTTCGACCCCGATTCAGAAAAAATGGCTAAGCTTGAGCTGAAACACAAAGATT



GGTTCGAATTCATGCAGACAAGAGGCGACTAA





SEQ
ATGACTAAGACCTTCGATTCCGAGTTCTTCAACCTTTATTCCCTGCAGAAAACT


ID
GTAAGGTTTGAGCTGAAGCCGGTGGGCGAGACAGCCAGCTTCGTAGAGGATTTCA


NO:
AGAATGAGGGTCTCAAACGGGTAGTTAGTGAGGATGAGAGGAGAGCAGTGGACTA


150
TCAGAAGGTGAAAGAGATCATCGATGACTATCACCGGGATTTCATAGAGGAGTCG



TTGAATTACTTCCCTGAGCAAGTATCCAAAGACGCGCTGGAACAGGCCTTTCATC



TTTACCAGAAACTGAAGGCAGCGAAGGTTGAGGAGCGGGAAAAGGCCTTGAAAGA



GTGGGAAGCCCTGCAGAAAAAGCTCAGAGAAAAGGTTGTCAAATGCTTCAGCGAC



AGCAACAAAGCCAGGTTCAGTAGGATCGATAAGAAAGAACTGATCAAAGAAGACT



TGATCAATTGGCTGGTTGCACAGAACCGGGAAGATGATATTCCCACCGTAGAGAC



CTTCAACAACTTCACAACTTACTTCACCGGCTTCCATGAGAATCGTAAAAACATC



TACAGTAAAGATGATCATGCAACCGCCATCTCCTTCCGGTTGATCCACGAGAATC



TCCCCAAGTTCTTTGACAACGTGATAAGTTTCAATAAGTTGAAAGAGGGATTTCC



CGAACTCAAGTTCGATAAAGTGAAGGAGGATCTGGAAGTGGATTATGACCTTAAG



CACGCTTTCGAGATAGAGTACTTCGTGAACTTTGTGACTCAGGCCGGCATCGATC



AGTATAACTACCTCCTCGGGGGTAAGACGCTCGAGGACGGTACTAAGAAGCAAGG



AATGAATGAGCAAATTAATCTATTTAAACAGCAGCAGACCAGGGATAAGGCTAGA



CAGATCCCCAAGCTTATTCCTCTTTTTAAACAGATCCTAAGTGAAAGGACAGAAA



GTCAAAGCTTCATACCTAAGCAATTTGAAAGTGATCAGGAGCTGTTTGACTCCCT



GCAAAAGCTGCACAACAATTGCCAGGACAAGTTTACCGTGCTGCAGCAGGCTATC



CTCGGACTGGCTGAGGCGGATCTTAAGAAGGTATTCATTAAGACTAGCGACCTCA



ATGCCCTTAGTAACACCATCTTTGGAAATTACTCCGTTTTCAGCGATGCCCTCAA



TCTATACAAAGAGAGCTTGAAGACTAAAAAAGCTCAGGAAGCTTTTGAAAAATTA



CCGGCACATTCTATACACGACCTTATACAATACTTAGAGCAGTTCAACAGCAGCC



TCGACGCTGAGAAACAGCAATCCACAGACACCGTCCTGAATTACTTCATCAAAAC



CGATGAACTGTACTCCCGATTTATCAAGAGCACTTCAGAAGCCTTCACGCAAGTT



CAGCCTCTGTTCGAGCTGGAGGCACTGTCCAGCAAGAGACGACCGCCAGAGTCTG



AAGACGAGGGAGCCAAGGGTCAAGAGGGGTTTGAACAGATAAAGCGAATTAAGGC



TTACTTGGATACTCTCATGGAGGCGGTGCATTTCGCTAAGCCTTTGTACCTGGTT



AAAGGCCGAAAAATGATTGAGGGGCTAGATAAGGATCAGTCTTTTTACGAGGCTT



TTGAAATGGCCTACCAGGAATTGGAATCCTTGATCATTCCAATCTATAATAAAGC



CCGGAGTTATCTGAGCAGGAAGCCCTTCAAAGCCGACAAGTTCAAAATAAATTTT



GACAATAATACGCTACTGTCTGGTTGGGACGCTAACAAGGAAACAGCCAATGCTT



CCATCCTGTTTAAGAAAGACGGCCTGTACTACCTGGGAATTATGCCAAAAGGCAA



AACTTTTTTGTTCGATTACTTTGTGTCATCAGAGGATAGCGAGAAGTTAAAGCAA



AGACGGCAGAAGACCGCCGAAGAAGCCCTCGCACAAGACGGAGAATCATATTTCG



AGAAAATTCGATATAAGCTCCTGCCTGGCGCATCAAAGATGTTGCCAAAAGTCTT



CTTTTCCAACAAAAACATCGGCTTTTATAACCCCAGCGATGATATCCTTCGCATC



CGGAACACCGCCTCACATACCAAAAATGGAACTCCACAGAAGGGCCACTCGAAGG



TTGAATTCAACCTTAACGATTGTCACAAAATGATTGATTTTTTTAAGAGCTCCAT



TCAGAAACACCCCGAATGGGGGTCCTTTGGCTTCACCTTTTCTGATACTTCAGAC



TTCGAGGACATGTCCGCCTTCTACAGGGAGGTGGAGAACCAGGGCTATGTCATCT



CCTTCGACAAAATAAAAGAGACATACATTCAGAGCCAGGTCGAGCAGGGAAATCT



GTACCTGTTTCAGATCTATAACAAGGATTTCAGTCCCTATAGCAAGGGCAAGCCC



AATTTACATACCCTGTACTGGAAGGCCCTGTTCGAAGAGGCAAACCTTAACAATG



TAGTTGCTAAGCTGAATGGGGAAGCAGAGATCTTCTTCCGAAGGCACAGCATCAA



GGCAAGCGACAAAGTTGTACATCCTGCTAACCAGGCCATCGATAACAAGAACCCG



CATACAGAAAAGACACAGTCAACCTTTGAATACGACCTCGTGAAGGACAAGAGGT



ACACACAAGATAAATTCTTCTTCCACGTGCCCATCAGCTTGAATTTTAAAGCGCA



GGGAGTGAGCAAATTTAACGACAAGGTCAACGGCTTCCTGAAGGGAAACCCCGAC



GTGAATATCATCGGAATTGATCGCGGTGAAAGACATCTCCTCTACTTTACTGTGG



TGAACCAGAAGGGTGAGATCCTAGTACAGGAGAGCCTGAACACCCTTATGAGTGA



TAAGGGCCATGTGAATGATTACCAGCAGAAGCTGGACAAGAAGGAACAGGAAAGG



GACGCAGCGCGGAAGTCCTGGACCACTGTTGAGAATATCAAAGAACTGAAGGAGG



GATATCTTAGCCATGTGGTACACAAACTTGCACATCTGATTATCAAGTATAATGC



CATAGTCTGCCTGGAAGACTTGAACTTCGGTTTCAAGCGAGGAAGGTTTAAAGTG



GAGAAGCAGGTGTACCAGAAGTTTGAGAAAGCCCTTATTGATAAGCTAAACTACC



TTGTCTTTAAGGAAAAAGAACTCGGCGAAGTTGGCCACTATTTAACCGCCTACCA



ACTAACCGCCCCTTTCGAGTCTTTTAAGAAACTGGGAAAGCAGAGCGGAATACTC



TTCTATGTGCCTGCAGACTACACCTCTAAGATCGACCCCACTACCGGCTTTGTAA



ACTTTCTAGATCTCCGCTATCAGTCAGTAGAAAAAGCCAAACAGCTCTTGTCAGA



TTTTAACGCCATCCGATTTAATTCCGTCCAAAATTACTTCGAGTTCGAAATCGAC



TATAAAAAACTTACCCCCAAGAGAAAGGTTGGGACGCAGTCTAAGTGGGTAATCT



GCACTTACGGTGACGTGAGATACCAGAACCGCCGAAACCAGAAAGGTCATTGGGA



AACCGAGGAAGTGAATGTGACTGAGAAGCTCAAGGCCCTCTTCGCTAGCGACAGT



AAAACAACAACAGTTATCGATTACGCCAATGACGATAATCTTATAGACGTGATCT



TGGAACAAGACAAAGCCTCTTTTTTTAAGGAATTGTTGTGGTTGCTGAAACTTAC



AATGACCCTTAGGCACAGCAAGATCAAATCAGAGGATGACTTCATCCTCAGCCCG



GTGAAGAATGAACAGGGAGAGTTCTACGATTCACGGAAGGCTGGAGAGGTGTGGC



CCAAGGATGCCGACGCGAACGGGGCCTACCACATAGCTCTAAAAGGTCTGTGGAA



CCTGCAACAAATCAATCAATGGGAGAAAGGTAAGACACTGAACCTGGCCATCAAA



AATCAAGATTGGTTCTCATTCATCCAGGAAAAGCCTTATCAAGAGTGA





SEQ
ATGCATACGGGAGGCCTTTTATCAATGGACGCAAAAGAGTTCACCGGGCAGTATC


ID
CATTATCTAAGACACTCCGCTTCGAGCTGAGGCCCATTGGCAGGACCTGGGACAA


NO:
CCTGGAGGCGTCGGGCTACCTGGCTGAGGACAGACATCGCGCAGAATGCTATCCG


151
AGAGCTAAGGAGCTTTTGGACGACAATCATCGCGCGTTCCTTAACCGGGTGCTCC



CACAGATCGATATGGACTGGCACCCGATCGCTGAGGCTTTTTGCAAGGTCCATAA



GAACCCTGGGAACAAAGAGCTCGCCCAGGACTACAACTTGCAGCTGAGCAAGCGA



CGGAAAGAGATTTCTGCCTACCTTCAAGACGCCGATGGCTACAAAGGGCTCTTCG



CAAAGCCCGCATTGGATGAGGCCATGAAAATCGCCAAGGAGAACGGGAATGAAAG



TGACATCGAAGTTCTCGAAGCGTTTAACGGATTTAGCGTGTACTTTACCGGCTAT



CATGAGTCAAGGGAGAATATTTATAGCGATGAGGACATGGTCTCTGTGGCCTACC



GGATTACCGAGGATAATTTCCCGAGGTTTGTTTCAAATGCACTAATATTCGACAA



GTTAAATGAGAGCCACCCAGACATCATCTCGGAGGTCAGCGGCAACCTCGGAGTT



GACGATATTGGCAAATACTTCGACGTGAGCAACTATAACAACTTCCTCTCACAGG



CTGGCATCGACGACTATAATCATATTATAGGCGGCCACACTACTGAGGATGGTCT



CATTCAGGCATTCAATGTAGTCTTGAATCTTAGGCACCAGAAGGACCCTGGGTTT



GAAAAGATACAGTTCAAGCAGCTGTATAAGCAGATATTATCCGTGCGAACATCTA



AAAGTTACATCCCCAAACAGTTTGATAACTCAAAGGAGATGGTGGATTGCATATG



CGATTATGTGTCAAAAATTGAAAAGAGCGAGACTGTGGAGCGGGCTCTGAAGCTC



GTCAGGAACATTAGCTCCTTTGACCTTAGAGGAATTTTCGTCAATAAAAAGAATC



TGAGGATCCTGAGCAATAAGCTAATAGGAGATTGGGACGCCATAGAGACAGCATT



GATGCATTCCAGCTCAAGCGAGAATGATAAGAAGTCTGTCTACGATAGCGCTGAA



GCCTTCACGCTGGACGATATCTTCTCTTCCGTGAAAAAATTTAGTGATGCGTCCG



CAGAAGATATCGGGAATCGAGCCGAAGATATCTGCAGGGTAATTTCAGAGACCGC



CCCTTTCATCAATGACCTGCGCGCCGTGGACCTGGATAGCCTGAATGACGATGGT



TACGAAGCTGCAGTTTCTAAGATCAGGGAGTCTCTGGAGCCATATATGGACTTGT



TTCACGAACTTGAGATCTTTAGCGTGGGCGACGAGTTCCCGAAATGCGCAGCTTT



CTATAGCGAGTTAGAGGAGGTCAGCGAGCAATTAATCGAGATCATACCCCTGTTT



AATAAGGCACGGAGCTTTTGTACTCGCAAGCGCTACAGCACCGACAAGATTAAAG



TTAATCTGAAATTTCCAACTCTCGCAGACGGGTGGGACCTAAACAAGGAACGCGA



TAATAAAGCCGCCATCCTTAGAAAGGACGGAAAGTACTATCTTGCCATCCTAGAT



ATGAAAAAAGATCTGAGTTCCATTCGTACTAGCGATGAAGACGAATCTTCTTTCG



AAAAAATGGAGTATAAGCTGCTCCCCTCGCCAGTCAAGATGCTACCCAAGATCTT



TGTGAAGAGCAAAGCAGCCAAGGAAAAGTACGGGCTGACGGACAGGATGCTGGAG



TGCTACGATAAGGGAATGCATAAATCAGGGTCAGCTTTTGACTTGGGCTTTTGCC



ATGAGCTAATCGATTACTACAAGCGCTGTATCGCCGAGTATCCAGGATGGGACGT



TTTCGACTTTAAATTTCGGGAGACTTCTGATTATGGTTCAATGAAGGAGTTCAAC



GAAGATGTCGCTGGTGCCGGTTACTACATGAGCCTTCGCAAGATTCCTTGTTCCG



AAGTCTACCGGCTACTGGACGAGAAATCTATATATTTGTTCCAGATATATAACAA



GGACTACAGTGAGAATGCACATGGGAATAAGAATATGCATACTATGTATTGGGAA



GGTCTCTTTTCACCCCAAAATTTGGAGTCACCCGTGTTCAAACTTAGCGGTGGCG



CAGAGCTGTTCTTTAGGAAATCCAGTATACCCAATGACGCCAAGACAGTCCACCC



AAAGGGTAGCGTCCTGGTGCCCAGAAACGATGTGAACGGCAGGAGAATCCCTGAC



AGCATTTACCGAGAACTTACCAGGTACTTCAACCGCGGCGACTGTAGAATCTCTG



ATGAGGCAAAGTCTTATCTGGATAAGGTGAAGACTAAGAAGGCAGATCATGACAT



TGTGAAAGACCGCCGCTTTACTGTCGACAAAATGATGTTTCACGTGCCTATCGCA



ATGAATTTTAAGGCAATCTCAAAACCGAATCTGAACAAGAAGGTGATAGATGGCA



TTATCGATGACCAGGACCTCAAGATCATCGGAATCGACAGAGGTGAGCGAAACCT



GATATACGTCACAATGGTAGATCGGAAGGGTAATATTCTGTACCAGGATTCACTA



AACATCCTCAATGGATATGACTATCGAAAAGCTCTCGATGTCAGGGAATACGACA



ACAAGGAGGCGCGACGGAATTGGACAAAGGTGGAAGGCATACGGAAGATGAAGGA



AGGCTATCTGTCACTAGCTGTCTCCAAATTGGCTGATATGATTATAGAGAACAAC



GCCATTATCGTGATGGAAGATCTCAACCATGGATTCAAGGCAGGAAGAAGTAAAA



TTGAGAAGCAGGTGTATCAGAAGTTCGAAAGCATGCTTATTAATAAGTTGGGTTA



TATGGTCTTAAAGGACAAGTCTATCGATCAGAGCGGCGGCGCACTCCATGGGTAT



CAGCTGGCTAACCATGTCACCACACTAGCATCCGTAGGCAAACAGTGTGGCGTGA



TTTTCTACATTCCTGCTGCGTTCACTTCTAAGATCGATCCTACCACGGGATTCGC



AGACCTGTTCGCACTGAGCAATGTTAAAAACGTGGCCTCCATGAGGGAGTTCTTT



AGCAAAATGAAAAGCGTGATTTATGACAAGGCCGAGGGCAAGTTCGCTTTCACAT



TTGACTACCTGGACTACAATGTGAAATCAGAGTGCGGGAGAACCCTGTGGACCGT



ATACACGGTAGGGGAAAGATTCACTTACAGTCGAGTTAATCGGGAGTATGTCCGT



AAAGTGCCAACTGACATCATCTACGATGCCCTTCAGAAGGCTGGCATAAGTGTTG



AGGGGGATCTAAGGGACAGGATCGCTGAATCGGATGGCGATACTCTCAAATCAAT



CTTCTACGCCTTCAAGTATGCCCTCGACATGAGGGTAGAGAACCGGGAGGAGGAC



TATATACAGTCTCCCGTGAAGAATGCGTCGGGAGAGTTCTTCTGCTCAAAAAACG



CCGGGAAATCTTTGCCGCAGGATTCTGATGCAAATGGGGCTTATAACATTGCTCT



CAAAGGCATCCTGCAGCTGCGCATGCTATCTGAACAATATGACCCAAACGCTGAA



AGCATTAGATTGCCATTGATCACCAATAAGGCTTGGCTGACTTTCATGCAGAGCG



GTATGAAGACATGGAAAAACTAA





SEQ
ATGGATTCCCTTAAGGACTTCACAAATCTTTACCCCGTGAGTAAAACCCTGAGAT


ID
TTGAACTCAAGCCCGTGGGAAAGACTCTCGAGAATATCGAGAAGGCCGGGATTTT


NO:
GAAGGAAGACGAGCATCGGGCGGAAAGTTACAGACGGGTGAAGAAGATTATAGAT


152
ACTTATCACAAGGTCTTTATAGACAGCTCTTTAGAGAACATGGCAAAGATGGGCA



TCGAGAACGAAATCAAGGCCATGCTGCAGTCCTTCTGCGAGCTGTATAAAAAGGA



TCATCGGACCGAAGGCGAAGACAAGGCGCTGGATAAGATCAGGGCAGTGCTGCGC



GGCCTCATTGTGGGTGCCTTCACTGGGGTGTGCGGGCGGAGAGAGAACACTGTGC



AGAATGAGAAATACGAGAGTTTGTTCAAAGAGAAACTCATCAAGGAAATCCTGCC



CGACTTCGTCTTAAGCACAGAAGCCGAATCTCTCCCATTTTCTGTCGAGGAGGCC



ACGCGTTCCCTTAAAGAGTTCGACAGTTTCACTTCATACTTTGCCGGATTTTATG



AAAACCGTAAAAATATATACTCCACTAAACCACAGTCAACTGCAATAGCTTACAG



GTTAATCCACGAAAACCTGCCAAAATTCATCGACAATATACTCGTCTTTCAAAAA



ATCAAGGAACCAATCGCGAAGGAACTTGAACACATCCGGGCTGACTTTAGTGCGG



GAGGATACATCAAAAAAGACGAGCGCCTGGAGGATATATTTTCACTAAATTATTA



TATTCATGTACTGAGCCAGGCTGGCATAGAAAAGTACAACGCTCTAATTGGGAAA



ATCGTGACAGAAGGTGACGGGGAAATGAAAGGGCTAAACGAACATATTAACTTAT



ATAACCAACAGCGGGGTCGAGAAGATCGTCTGCCCCTGTTCAGACCTCTGTATAA



GCAAATACTCTCCGACAGAGAGCAGCTATCATATCTGCCCGAGTCCTTTGAGAAA



GATGAAGAGCTGCTCCGGGCGCTCAAGGAGTTCTATGATCATATAGCCGAGGACA



TTTTGGGCAGAACTCAGCAACTCATGACGTCTATTTCTGAATATGATCTGTCTCG



TATCTATGTCAGGAATGATAGCCAGCTGACCGATATATCCAAGAAGATGCTGGGG



GACTGGAACGCCATTTATATGGCGAGGGAGCGAGCATACGATCACGAGCAGGCAC



CCAAGAGAATCACAGCCAAATATGAGAGAGACCGCATTAAGGCGCTGAAGGGCGA



AGAAAGTATCAGTCTGGCCAATCTGAACTCCTGCATAGCTTTCCTTGATAACGTG



AGGGATTGCAGAGTTGATACTTACCTGAGTACCCTGGGCCAGAAGGAAGGGCCTC



ACGGCCTCTCTAATCTAGTGGAGAATGTATTTGCCTCCTACCACGAAGCTGAGCA



GCTGCTGTCATTTCCGTACCCAGAGGAAAATAATTTAATACAGGATAAGGACAAC



GTAGTGCTTATCAAAAATCTACTGGATAACATTTCCGACCTCCAGCGCTTTCTCA



AACCACTTTGGGGGATGGGCGACGAGCCTGATAAGGATGAGCGCTTTTACGGCGA



GTACAACTACATCAGGGGCGCCTTGGACCAGGTGATTCCCCTCTATAATAAAGTC



AGGAATTACCTGACCCGAAAGCCATACAGTACAAGAAAGGTGAAATTAAATTTCG



GCAATAGTCAGCTGCTGTCTGGTTGGGACCGAAATAAGGAGAAAGACAACAGCTG



CGTAATTCTCAGAAAAGGACAGAACTTTTATTTGGCCATCATGAATAACAGACAC



AAGAGATCTTTCGAGAACAAAGTGCTCCCTGAGTATAAGGAGGGGGAACCCTACT



TCGAGAAGATGGACTATAAATTCCTTCCTGATCCAAATAAAATGCTGCCTAAAGT



ATTTCTGTCAAAAAAAGGTATAGAAATCTACAAACCTTCACCTAAGCTACTTGAA



CAGTATGGCCACGGCACCCATAAAAAAGGGGACACGTTCAGCATGGACGACCTAC



ACGAACTGATTGACTTCTTTAAGCACAGCATAGAAGCTCATGAGGACTGGAAACA



GTTCGGATTCAAATTCTCAGATACCGCGACCTACGAAAACGTGTCTAGTTTTTAC



CGGGAAGTCGAGGACCAGGGCTACAAGCTCAGCTTCAGAAAAGTTAGCGAATCTT



ACGTCTACTCCCTTATAGATCAAGGTAAGCTGTATCTCTTTCAAATCTACAACAA



GGACTTTTCCCCATGTAGCAAGGGCACCCCCAATCTGCACACTCTCTACTGGCGG



ATGCTGTTCGACGAGCGTAACCTGGCAGACGTGATCTACAAATTAGATGGTAAAG



CTGAGATCTTCTTTCGTGAAAAGAGCCTAAAGAACGATCACCCCACTCACCCCGC



CGGAAAGCCCATTAAGAAGAAAAGTAGGCAGAAGAAAGGAGAAGAATCGCTATTT



GAGTACGACCTCGTCAAGGATCGGCATTATACAATGGATAAGTTCCAGTTCCATG



TGCCAATAACTATGAATTTCAAGTGCAGTGCTGGCAGTAAGGTGAATGACATGGT



AAACGCTCATATCCGGGAGGCAAAGGACATGCATGTTATTGGAATTGATAGGGGT



GAGCGTAATCTCCTCTACATCTGTGTTATTGACTCCCGCGGCACAATCCTCGATC



AGATTTCCTTGAATACAATTAATGATATAGACTACCATGACTTGCTTGAGTCTCG



CGACAAAGATAGACAGCAGGAGAGAAGAAATTGGCAGACCATCGAAGGCATCAAG



GAACTCAAGCAAGGCTACCTTTCTCAGGCAGTGCATCGAATAGCCGAGCTGATGG



TGGCTTATAAAGCCGTCGTGGCACTAGAAGACCTAAATATGGGATTTAAACGAGG



CAGGCAGAAGGTGGAATCATCCGTATACCAGCAGTTCGAAAAACAGTTGATAGAC



AAACTCAATTACCTTGTAGACAAGAAGAAGCGGCCTGAGGACATAGGGGGCCTGC



TTAGAGCGTATCAATTTACAGCCCCATTCAAGTCTTTCAAAGAAATGGGTAAACA



GAACGGTTTTCTGTTTTACATCCCAGCGTGGAACACCAGCAATATAGATCCAACC



ACTGGCTTCGTCAATCTGTTTCATGCTCAGTATGAAAATGTGGACAAGGCCAAAT



CCTTCTTTCAGAAATTTGACAGCATCTCCTATAACCCAAAGAAAGACTGGTTTGA



ATTCGCCTTTGACTATAAGAATTTCACTAAGAAGGCCGAGGGATCAAGAAGCATG



TGGATATTGTGCACGCATGGCTCACGTATAAAGAACTTTAGAAACTCGCAAAAAA



ACGGGCAGTGGGACTCAGAAGAATTCGCACTCACCGAGGCTTTCAAATCCCTCTT



CGTCCGGTATGAGATCGATTACACCGCCGATCTGAAGACGGCAATCGTCGACGAG



AAACAGAAAGACTTCTTTGTAGATCTACTTAAGCTCTTTAAGCTAACCGTTCAGA



TGCGAAACAGTTGGAAAGAAAAGGATCTCGACTATCTCATTAGTCCAGTGGCTGG



CGCGGATGGTAGATTTTTCGATACCCGGGAAGGTAACAAGTCCCTTCCCAAAGAC



GCCGACGCGAATGGTGCCTACAATATTGCACTAAAGGGGCTCTGGGCGCTGCGGC



AAATTAGACAGACATCTGAAGGGGGCAAGCTTAAGCTGGCTATTTCTAATAAAGA



GTGGTTGCAGTTTGTGCAGGAAAGGAGTTATGAGAAGGACTAG





SEQ
ATGAACAACGGCACCAACAACTTCCAGAACTTCATCGGCATATCGTCTCTGCAGA


ID
AAACACTTAGGAATGCCCTGATTCCAACTGAGACAACACAGCAGTTTATTGTGAA


NO:
GAATGGGATCATCAAAGAGGACGAATTGCGCGGGGAGAATAGGCAGATCCTGAAG


153
GACATCATGGACGATTACTACAGGGGTTTTATCTCCGAAACGCTGAGCTCGATTG



ACGATATTGACTGGACGTCCCTCTTTGAGAAGATGGAAATCCAACTTAAAAATGG



CGATAATAAAGATACCCTGATAAAGGAACAAACCGAATATAGAAAGGCTATACAC



AAAAAATTCGCAAATGACGACCGCTTTAAGAACATGTTTTCTGCAAAACTGATTA



GCGATATTCTGCCCGAGTTTGTGATTCACAATAATAACTATTCCGCTTCGGAGAA



GGAGGAAAAGACTCAGGTGATTAAACTGTTTTCTCGGTTCGCCACTTCTTTCAAA



GATTATTTCAAAAATCGCGCCAACTGTTTTTCCGCTGACGACATCTCCTCCTCTT



CCTGCCACCGGATCGTAAACGACAATGCCGAGATCTTTTTTAGTAACGCCCTTGT



GTATCGGAGGATAGTGAAGAGCCTGTCCAATGATGACATAAACAAAATTTCTGGC



GATATGAAGGATAGCCTCAAAGAGATGAGCCTTGAAGAAATTTACTCCTACGAGA



AGTATGGGGAGTTCATCACCCAGGAGGGGATTTCCTTCTATAATGACATCTGTGG



CAAGGTGAACAGCTTCATGAACCTGTACTGCCAGAAGAATAAGGAAAACAAAAAT



CTGTACAAGCTTCAGAAGTTACATAAGCAGATCCTGTGTATCGCGGATACCTCAT



ATGAGGTTCCTTATAAGTTCGAGAGTGATGAAGAAGTGTACCAGTCTGTAAATGG



ATTCTTAGACAATATTTCGTCCAAACATATAGTGGAGAGACTGAGAAAGATCGGG



GACAATTACAATGGGTACAATCTCGACAAGATTTATATCGTGTCGAAGTTTTACG



AATCTGTGAGCCAGAAAACATACAGGGATTGGGAAACCATTAATACCGCGCTTGA



AATTCACTACAATAATATTCTGCCTGGCAACGGAAAAAGCAAGGCCGATAAGGTA



AAAAAGGCAGTCAAAAATGACCTTCAGAAAAGTATCACCGAAATCAATGAGTTGG



TGAGCAACTACAAATTGTGTTCAGACGATAATATTAAAGCGGAAACGTACATACA



TGAAATTAGCCATATTCTGAATAACTTTGAGGCGCAGGAACTTAAGTACAACCCT



GAAATTCATCTCGTCGAAAGCGAATTGAAGGCCTCTGAATTGAAAAACGTTCTTG



ACGTGATAATGAACGCTTTCCATTGGTGCTCTGTGTTTATGACTGAAGAGCTGGT



TGATAAGGACAACAACTTTTATGCTGAACTTGAGGAAATCTACGACGAGATCTAC



CCTGTGATTAGCTTGTATAACCTCGTCAGAAACTACGTTACCCAGAAGCCGTACA



GCACGAAAAAAATAAAGCTGAACTTTGGTATTCCGACTCTCGCCGATGGATGGAG



CAAGTCGAAGGAATATTCCAACAATGCCATCATTCTTATGCGAGACAATCTGTAT



TACCTCGGCATCTTTAACGCCAAAAACAAGCCGGATAAGAAAATCATTGAAGGGA



ATACGAGCGAGAATAAGGGCGACTATAAGAAAATGATCTACAACTTACTGCCAGG



TCCCAATAAAATGATTCCTAAGGTGTTTCTGTCATCGAAAACAGGTGTAGAAACA



TATAAGCCCAGCGCATACATCCTGGAAGGCTACAAGCAAAACAAACACATCAAAA



GCAGCAAGGACTTTGATATCACATTCTGCCACGATCTAATCGACTACTTCAAAAA



TTGCATCGCCATTCACCCTGAGTGGAAGAACTTCGGCTTTGACTTCTCCGACACC



AGTACCTACGAAGACATTTCTGGATTCTACCGTGAGGTTGAGCTGCAGGGTTATA



AAATTGACTGGACATACATCAGTGAAAAAGACATCGATCTACTGCAGGAGAAGGG



GCAGCTCTATCTCTTCCAGATTTATAATAAGGATTTCAGCAAGAAGTCCACTGGA



AACGACAATCTGCATACAATGTATCTTAAGAACTTGTTTAGCGAAGAGAATTTGA



AAGATATCGTTCTAAAGTTAAACGGGGAAGCCGAGATTTTCTTTCGAAAGTCTTC



CATTAAGAATCCAATTATTCACAAGAAGGGCAGTATCCTGGTCAACAGAACCTAT



GAGGCCGAGGAAAAGGACCAGTTCGGTAATATACAAATTGTGCGCAAGAACATCC



CCGAGAACATTTACCAGGAGCTCTATAAATACTTCAACGACAAAAGCGATAAGGA



GCTTTCCGACGAGGCTGCCAAGCTGAAAAACGTGGTGGGACACCATGAAGCAGCC



ACCAACATCGTCAAAGATTATCGTTATACATATGACAAATATTTTCTGCACATGC



CTATTACAATAAACTTTAAGGCAAACAAGACCGGGTTCATCAATGACCGGATACT



CCAGTACATCGCAAAAGAGAAGGACCTGCATGTGATCGGCATCGACCGCGGTGAA



AGAAATCTCATTTACGTCAGCGTTATCGACACTTGTGGAAACATTGTGGAGCAGA



AGTCCTTCAACATTGTTAACGGCTATGACTATCAGATCAAGCTCAAACAGCAGGA



AGGTGCTCGTCAGATTGCGAGGAAAGAATGGAAAGAGATCGGCAAGATCAAGGAG



ATCAAAGAAGGGTATCTGAGCTTGGTCATTCACGAGATCTCCAAAATGGTCATCA



AGTACAACGCTATTATCGCGATGGAAGACCTCTCTTACGGCTTTAAGAAGGGGCG



CTTTAAAGTGGAGCGCCAGGTCTATCAGAAGTTCGAGACTATGCTTATCAATAAG



CTGAATTACTTGGTCTTTAAGGATATCAGTATCACCGAGAACGGAGGACTGCTGA



AAGGTTACCAGCTCACATATATTCCCGATAAGCTCAAGAATGTGGGCCACCAATG



CGGTTGTATTTTTTACGTTCCAGCTGCCTACACATCTAAGATCGATCCTACCACC



GGATTCGTCAATATATTTAAATTTAAAGATCTAACCGTTGATGCCAAGCGTGAGT



TTATTAAGAAATTTGATTCAATCAGGTACGACAGCGAAAAGAACCTCTTCTGTTT



CACTTTCGACTACAACAACTTCATCACACAAAATACTGTGATGAGCAAGTCATCA



TGGAGCGTTTATACTTATGGTGTAAGGATAAAAAGGCGCTTTGTTAATGGAAGGT



TTTCCAATGAAAGCGATACAATAGACATCACAAAAGACATGGAGAAGACACTGGA



GATGACAGATATTAATTGGAGGGACGGGCATGACCTTAGACAGGACATCATCGAC



TACGAAATCGTCCAACACATTTTTGAGATATTCAGACTCACTGTCCAGATGCGAA



ACAGCCTGTCGGAACTCGAAGACCGGGACTACGATAGACTGATCTCCCCGGTGTT



AAACGAAAATAATATTTTCTACGATTCTGCTAAGGCAGGAGACGCTCTTCCTAAA



GATGCGGACGCCAATGGCGCTTACTGTATAGCGTTGAAGGGATTGTATGAGATTA



AACAGATCACTGAGAATTGGAAAGAAGACGGTAAATTCTCCAGAGACAAGCTGAA



AATCTCCAACAAAGACTGGTTTGATTTTATTCAAAATAAGCGCTACCTGTAA





SEQ
ATGACAAACAAATTTACTAATCAGTACAGCCTGTCAAAGACCCTCCGCTTCGAAC


ID
TGATTCCACAAGGGAAGACCCTTGAATTCATCCAGGAAAAGGGTTTATTATCCCA


NO:
GGATAAACAACGCGCAGAAAGCTATCAAGAGATGAAGAAGACGATCGATAAATTT


154
CATAAGTATTTCATAGATTTAGCCCTGAGCAACGCTAAATTGACCCACCTGGAAA



CCTATTTGGAGCTGTACAACAAGTCAGCCGAGACAAAGAAAGAGCAGAAGTTTAA



GGACGACCTGAAAAAAGTACAGGACAATTTGCGAAAAGAGATCGTCAAGTCTTTT



TCCGACGGAGACGCCAAGTCAATATTTGCCATCCTGGACAAAAAGGAACTCATCA



CTGTGGAGTTGGAGAAGTGGTTTGAGAATAATGAGCAGAAGGACATCTATTTTGA



CGAAAAGTTCAAGACATTTACTACTTACTTCACCGGATTTCACCAAAACCGGAAG



AACATGTACTCTGTTGAGCCGAACTCAACCGCCATCGCCTACCGCCTTATTCACG



AAAATCTGCCAAAGTTTCTCGAGAATGCTAAAGCCTTTGAGAAAATTAAGCAGGT



CGAGTCGCTCCAGGTGAACTTTCGAGAGCTGATGGGTGAATTCGGGGACGAGGGC



CTGATTTTCGTGAATGAACTCGAAGAGATGTTTCAGATCAACTACTATAATGATG



TACTCTCACAGAACGGGATCACTATCTACAACAGCATTATCTCTGGATTCACTAA



GAACGATATCAAGTATAAAGGGCTGAATGAATACATCAACAATTATAATCAGACT



AAGGACAAAAAGGACAGGCTGCCTAAATTGAAACAGCTGTATAAGCAGATCCTCA



GTGATAGAATTAGCTTGTCATTTCTCCCAGATGCCTTCACTGACGGAAAGCAGGT



GCTTAAGGCGATATTCGATTTCTATAAGATCAACCTCCTCTCTTATACAATCGAG



GGCCAGGAGGAGTCACAGAACCTCCTGCTCCTGATTCGACAAACTATTGAAAATC



TGTCCTCTTTCGATACGCAGAAGATATACCTGAAAAATGACACCCATCTCACTAC



AATATCCCAACAGGTATTCGGAGATTTCTCCGTCTTCAGTACAGCCCTGAATTAC



TGGTACGAGACAAAGGTGAACCCTAAGTTCGAAACAGAGTACAGCAAGGCGAACG



AAAAGAAGAGGGAGATCCTGGACAAAGCCAAAGCCGTTTTCACCAAGCAAGATTA



CTTTAGCATCGCATTTCTGCAGGAAGTCCTGTCTGAGTACATACTGACACTCGAT



CACACAAGCGACATAGTTAAGAAGCACTCTTCCAATTGTATCGCGGACTACTTCA



AAAATCATTTTGTCGCGAAAAAGGAGAACGAGACAGATAAGACCTTCGATTTTAT



CGCGAATATTACCGCAAAGTATCAATGCATTCAGGGTATCTTGGAGAACGCCGAC



CAGTACGAAGACGAGCTTAAACAGGATCAGAAGCTCATCGACAACCTAAAGTTCT



TTTTGGACGCTATACTGGAACTCCTTCATTTTATTAAGCCACTACATCTGAAGAG



TGAGTCTATCACTGAGAAGGACACTGCTTTTTACGACGTTTTCGAGAATTACTAC



GAAGCACTGTCTCTGCTAACCCCTCTGTATAACATGGTGAGAAACTATGTGACAC



AGAAACCTTATAGTACCGAGAAGATTAAGTTGAACTTCGAGAACGCACAATTGCT



GAATGGGTGGGATGCAAACAAAGAGGGTGATTACCTCACAACAATCCTCAAGAAA



GATGGCAATTACTTCCTGGCCATTATGGATAAAAAACATAACAAGGCATTTCAGA



AATTTCCCGAGGGGAAGGAAAATTATGAAAAGATGGTATACAAGTTGCTGCCCGG



GGTGAACAAAATGCTCCCGAAGGTGTTTTTCTCGAATAAGAATATCGCGTACTTT



AACCCGTCCAAGGAACTGTTGGAAAATTATAAAAAGGAAACACACAAGAAGGGGG



ACACTTTTAATTTGGAGCACTGCCACACACTCATTGACTTCTTTAAAGATAGTCT



CAACAAACATGAGGATTGGAAATATTTTGACTTTCAGTTTAGCGAGACCAAGTCT



TATCAGGATCTGTCGGGATTTTATAGGGAAGTTGAGCACCAGGGTTACAAGATAA



ATTTCAAGAACATCGATAGCGAGTACATTGACGGACTGGTGAACGAAGGGAAGCT



GTTCCTGTTTCAGATTTACAGCAAAGATTTCTCTCCTTTCTCAAAAGGCAAGCCG



AACATGCATACCCTGTATTGGAAGGCCCTGTTCGAGGAGCAAAACCTTCAGAATG



TGATTTACAAGCTGAACGGTCAGGCCGAGATTTTTTTTAGGAAGGCCTCTATCAA



GCCCAAAAACATCATTCTGCACAAGAAAAAGATAAAGATCGCCAAAAAACACTTC



ATTGATAAAAAGACAAAGACTTCTGAGATCGTACCTGTTCAGACAATCAAGAATC



TCAACATGTATTATCAGGGGAAGATTAGCGAGAAAGAGCTGACACAGGACGATTT



GAGGTACATCGACAACTTCTCTATCTTTAACGAGAAGAACAAGACAATCGATATC



ATCAAGGACAAGCGGTTTACCGTCGATAAATTCCAGTTCCATGTGCCTATCACGA



TGAATTTCAAGGCCACCGGTGGGAGTTATATCAACCAGACTGTGCTGGAGTATCT



GCAGAACAACCCCGAAGTAAAAATTATTGGCCTGGACAGAGGAGAGCGGCATCTG



GTGTACTTGACCCTCATCGATCAGCAGGGAAATATCCTGAAACAAGAATCTCTGA



ATACTATTACGGACTCCAAAATCAGCACACCTTACCACAAGCTGCTTGATAATAA



AGAGAATGAGAGGGACTTGGCCCGCAAAAATTGGGGCACCGTCGAGAATATTAAG



GAATTGAAAGAAGGATACATCTCACAGGTGGTTCACAAAATCGCAACCCTGATGT



TAGAAGAGAACGCTATTGTGGTGATGGAGGACTTAAACTTCGGATTTAAAAGAGG



AAGATTTAAAGTCGAGAAACAGATTTATCAGAAACTGGAAAAAATGCTCATTGAC



AAATTAAATTACCTGGTGCTGAAAGATAAACAGCCACAGGAGCTGGGTGGCCTGT



ATAATGCTCTGCAGCTGACCAACAAGTTCGAGTCGTTTCAGAAAATGGGCAAGCA



GTCAGGCTTCCTTTTTTACGTGCCCGCTTGGAACACCTCAAAAATCGACCCTACA



ACAGGCTTTGTGAATTATTTCTATACCAAGTATGAAAACGTGGACAAGGCAAAGG



CCTTTTTCGAGAAGTTTGAAGCAATCAGGTTCAATGCCGAGAAAAAATACTTTGA



GTTCGAGGTCAAAAAATATAGCGACTTCAACCCTAAGGCCGAAGGCACGCAACAA



GCCTGGACAATATGCACGTATGGGGAGAGAATTGAGACTAAGCGGCAGAAGGATC



AGAATAACAAATTCGTGAGCACACCGATTAACCTGACAGAGAAGATAGAGGACTT



CCTCGGCAAGAATCAGATCGTGTACGGCGACGGCAATTGCATCAAGTCACAAATT



GCATCTAAAGATGACAAAGCATTCTTCGAAACACTGCTGTATTGGTTCAAGATGA



CACTCCAGATGCGAAATAGCGAAACAAGAACAGATATTGACTACCTCATCAGCCC



TGTGATGAATGATAACGGCACGTTTTACAATTCCCGGGACTATGAAAAATTAGAG



AACCCGACACTGCCAAAAGACGCCGACGCAAATGGTGCATATCACATCGCAAAGA



AAGGTTTGATGCTGTTGAACAAAATTGATCAGGCTGATCTGACAAAAAAGGTCGA



TCTGAGTATCAGTAACCGCGACTGGTTGCAGTTTGTCCAGAAGAACAAATAA





SEQ
ATGGAACAAGAGTACTATCTGGGCCTGGACATGGGCACCGGGAGTGTCGGATGGG


ID
CAGTCACCGACTCAGAGTACCACGTCCTCAGAAAGCACGGTAAGGCACTTTGGGG


NO:
AGTGCGACTCTTCGAGTCCGCTAGTACTGCTGAAGAGAGGAGGATGTTTCGAACT


155
TCCAGGCGCAGGCTGGATCGGCGAAACTGGAGAATAGAGATTCTCCAGGAGATAT



TTGCTGAAGAGATTTCAAAGAAGGATCCTGGTTTTTTCCTGCGCATGAAAGAATC



TAAGTATTACCCCGAAGATAAACGCGACATCAACGGCAATTGTCCTGAACTGCCC



TATGCTCTGTTTGTCGACGACGATTTCACCGACAAAGATTACCACAAGAAATTCC



CCACCATATACCACCTGAGAAAGATGTTGATGAACACCGAGGAGACACCCGACAT



ACGTCTGGTTTACCTGGCTATCCATCATATGATGAAGCACCGCGGGCATTTCCTG



CTGTCTGGAGACATCAATGAGATAAAGGAATTTGGTACTACGTTCTCCAAGTTGT



TAGAAAACATTAAGAATGAAGAGTTGGACTGGAATCTTGAACTGGGAAAGGAAGA



GTATGCAGTTGTAGAGTCGATTTTGAAAGATAACATGTTAAACCGGTCAACTAAG



AAAACCAGGTTAATTAAGGCACTAAAGGCCAAATCGATATGCGAGAAGGCTGTGC



TAAATCTGCTGGCTGGAGGCACCGTGAAACTGTCTGATATTTTCGGCCTGGAAGA



GCTCAATGAAACCGAGCGGCCTAAAATTTCTTTCGCCGATAACGGATACGATGAC



TATATTGGGGAGGTGGAAAACGAGCTCGGAGAACAATTCTACATTATTGAAACCG



CTAAGGCAGTCTATGACTGGGCCGTGCTCGTCGAGATTTTAGGCAAGTACACCAG



CATTAGCGAAGCAAAGGTGGCTACCTATGAAAAGCACAAATCTGACCTCCAGTTT



CTGAAAAAGATTGTGCGCAAATACTTAACAAAAGAAGAGTACAAGGACATCTTTG



TGAGCACATCAGATAAGCTCAAGAATTACTCAGCATACATTGGAATGACAAAGAT



TAACGGGAAGAAGGTGGATCTCCAAAGCAAACGTTGTTCAAAGGAGGAGTTTTAC



GATTTCATAAAGAAGAACGTGCTGAAGAAACTGGAGGGACAACCGGAGTACGAGT



ATTTAAAGGAGGAGCTCGAGCGAGAAACTTTCCTGCCCAAGCAAGTGAACAGAGA



CAATGGTGTCATTCCTTACCAGATTCACTTATATGAGCTGAAGAAAATCCTGGGG



AACTTGAGAGACAAGATAGACCTCATCAAGGAAAATGAAGATAAGTTGGTCCAGT



TGTTCGAATTCAGAATCCCATATTACGTCGGCCCGCTCAATAAGATCGACGACGG



CAAGGAAGGCAAATTCACTTGGGCGGTGCGAAAAAGCAACGAAAAAATATACCCA



TGGAACTTTGAGAACGTCGTTGACATCGAGGCCAGCGCCGAGAAATTTATAAGAC



GCATGACTAATAAGTGTACTTACCTCATGGGCGAGGATGTTCTGCCCAAGGACAG



CCTGCTGTATTCCAAGTACATGGTGCTTAACGAGCTGAATAATGTAAAGTTAGAT



GGTGAGAAGCTCAGCGTGGAGCTTAAACAGAGGCTGTACACTGATGTGTTTTGCA



AGTATCGGAAAGTTACCGTTAAGAAGATAAAGAATTACCTGAAATGCGAAGGGAT



CATTTCCGGCAACGTGGAAATTACCGGAATCGACGGCGATTTTAAGGCGTCGTTG



ACCGCTTATCATGATTTCAAGGAGATTTTAACCGGCACGGAGCTCGCGAAGAAAG



ACAAGGAGAACATAATCACGAATATAGTTCTGTTTGGGGACGATAAAAAACTTCT



TAAAAAACGACTCAATCGACTGTATCCGCAGATTACCCCCAACCAGCTGAAGAAG



ATTTGCGCTCTGAGCTATACCGGGTGGGGCCGGTTCTCTAAGAAATTCCTCGAGG



AGATCACAGCACCAGACCCAGAGACTGGTGAGGTGTGGAATATTATTACAGCTCT



GTGGGAATCCAATAATAACCTTATGCAATTGTTGAGCAATGAATATAGGTTCATG



GAGGAAGTGGAAACCTACAATATGGGCAAGCAGACAAAGACCCTATCTTACGAGA



CCGTTGAGAATATGTATGTCTCCCCTTCAGTGAAACGGCAAATCTGGCAAACTTT



GAAGATCGTGAAGGAGCTCGAAAAGGTGATGAAAGAGAGCCCGAAGAGGGTTTTT



ATTGAAATGGCCAGAGAGAAACAGGAGAGCAAGAGAACAGAGTCTAGGAAGAAGC



AGCTAATCGATTTGTATAAAGCCTGCAAGAACGAGGAAAAAGACTGGGTCAAGGA



GCTAGGCGATCAGGAAGAACAGAAGTTGCGCTCTGATAAGCTGTACTTATATTAT



ACCCAGAAAGGACGGTGCATGTACTCAGGTGAGGTCATTGAGCTGAAAGATCTGT



GGGACAATACTAAGTATGATATTGATCACATCTACCCTCAGTCAAAAACTATGGA



CGACTCCCTCAACAACAGGGTGTTGGTTAAGAAGAAATACAATGCTACAAAGTCC



GATAAATACCCTCTTAACGAAAACATCCGGCACGAAAGAAAGGGCTTCTGGAAGT



CCCTGCTGGATGGGGGTTTTATCAGTAAAGAAAAGTATGAGAGGCTGATCCGAAA



TACCGAGCTCTCCCCCGAGGAACTGGCTGGCTTTATCGAAAGGCAGATCGTAGAG



ACTAGGCAATCTACAAAGGCAGTCGCTGAGATCCTGAAGCAAGTGTTTCCTGAGT



CAGAAATCGTGTACGTCAAAGCTGGCACAGTGTCACGGTTCCGAAAGGACTTTGA



GTTGTTAAAAGTTCGGGAGGTGAATGACCTGCACCACGCTAAAGACGCCTATCTG



AATATCGTTGTGGGGAACTCCTATTATGTTAAGTTTACTAAGAATGCGTCCTGGT



TTATTAAGGAGAACCCGGGGCGCACCTATAACCTGAAGAAGATGTTCACCTCCGG



CTGGAACATAGAACGGAACGGAGAAGTCGCGTGGGAGGTGGGTAAGAAAGGGACC



ATTGTGACCGTCAAACAGATTATGAACAAAAACAACATATTGGTAACTCGCCAGG



TGCATGAGGCCAAAGGGGGCCTCTTTGATCAGCAGATTATGAAAAAGGGCAAAGG



ACAGATCGCAATCAAGGAAACCGACGAGCGCCTGGCATCCATTGAGAAGTACGGA



GGCTACAACAAGGCGGCAGGTGCGTACTTCATGCTCGTCGAGTCCAAAGATAAGA



AAGGCAAAACTATTAGAACAATCGAGTTCATCCCTCTATATTTGAAAAATAAGAT



CGAAAGTGACGAAAGCATCGCCCTTAACTTCTTGGAGAAGGGCCGGGGCTTAAAG



GAACCAAAGATTCTGCTCAAGAAGATCAAGATCGACACACTCTTCGATGTGGATG



GTTTTAAGATGTGGCTGTCAGGCAGGACAGGGGATCGCTTGCTGTTCAAATGCGC



AAATCAGTTGATTCTGGACGAAAAGATCATTGTGACGATGAAGAAGATCGTTAAA



TTCATTCAGCGGAGACAGGAAAACAGAGAACTGAAACTCTCCGATAAGGATGGAA



TTGACAATGAAGTCCTCATGGAGATTTACAATACCTTTGTGGACAAGCTTGAGAA



CACAGTCTATCGGATCCGACTGTCCGAACAGGCAAAGACTCTGATCGACAAACAG



AAAGAATTCGAAAGACTAAGCTTAGAGGACAAAAGTTCAACTCTCTTTGAAATTC



TCCACATCTTCCAATGTCAAAGTAGTGCAGCCAACTTGAAGATGATCGGGGGTCC



CGGCAAGGCTGGAATCTTAGTCATGAACAACAACATCTCCAAATGTAACAAAATC



TCCATCATAAACCAGTCTCCCACCGGCATTTTCGAGAACGAAATTGATTTACTCA



AG





SEQ
ATGAAATCTTTCGATTCTTTCACCAACCTCTACTCCCTTAGCAAAACCCTTAAGT


ID
TTGAAATGAGGCCGGTGGGGAATACACAGAAGATGCTTGACAATGCTGGCGTCTT


NO:
TGAAAAGGACAAATTAATCCAGAAGAAGTATGGTAAAACAAAGCCATATTTTGAC


156
CGATTGCATCGGGAATTCATTGAAGAGGCTCTTACAGGAGTAGAATTGATCGGAC



TGGACGAGAACTTCCGTACCTTAGTAGACTGGCAGAAGGACAAGAAGAACAACGT



GGCAATGAAGGCCTATGAGAACTCACTCCAGCGCCTTAGAACCGAGATCGGAAAG



ATCTTTAATCTTAAGGCGGAAGATTGGGTAAAAAATAAGTACCCGATCCTGGGAC



TGAAAAACAAAAACACAGACATCCTGTTTGAAGAAGCCGTCTTTGGTATCTTGAA



GGCCAGGTATGGAGAGGAGAAAGACACGTTTATAGAGGTAGAGGAGATTGATAAA



ACAGGCAAGAGTAAGATTAATCAGATCAGTATCTTTGATTCTTGGAAGGGGTTCA



CAGGCTACTTTAAGAAGTTTTTCGAAACCAGGAAAAATTTCTATAAGAACGATGG



CACCTCCACAGCTATCGCGACACGCATCATAGATCAGAATCTGAAACGGTTCATT



GATAATCTGAGCATTGTTGAATCCGTGCGCCAGAAGGTCGACCTAGCTGAGACTG



AGAAGTCTTTCTCTATATCACTCTCCCAGTTCTTCTCAATAGATTTTTATAATAA



GTGCCTTCTGCAAGATGGCATAGACTACTATAACAAGATCATCGGCGGCGAAACT



CTCAAAAACGGTGAAAAGCTCATTGGCCTGAATGAGCTCATCAACCAATATAGAC



AAAATAACAAGGATCAGAAAATCCCATTCTTTAAGCTGCTAGATAAACAGATCCT



ATCAGAAAAAATCCTGTTCCTCGACGAAATCAAAAACGACACCGAACTCATCGAG



GCTCTCTCGCAGTTTGCCAAGACGGCTGAGGAGAAGACGAAGATTGTGAAAAAGC



TGTTTGCAGACTTTGTGGAGAACAACTCTAAATACGATTTGGCTCAGATTTATAT



CTCCCAGGAAGCATTTAACACAATCTCCAATAAGTGGACTAGCGAGACTGAAACC



TTCGCCAAATACCTGTTCGAGGCCATGAAAAGCGGCAAGCTCGCCAAATACGAGA



AGAAGGACAATTCCTATAAGTTTCCCGATTTCATCGCATTATCTCAGATGAAGTC



CGCGCTACTTAGCATTAGCCTGGAAGGCCATTTTTGGAAGGAGAAATACTATAAG



ATTTCCAAATTCCAAGAAAAGACCAATTGGGAGCAGTTCTTGGCTATTTTTCTAT



ACGAGTTCAACTCTTTGTTCAGTGACAAGATCAACACTAAGGACGGTGAGACCAA



ACAAGTGGGGTACTACCTCTTCGCCAAAGATCTTCATAACCTGATACTGTCCGAA



CAGATCGACATACCCAAGGATTCAAAGGTGACCATCAAGGATTTTGCGGATTCGG



TATTGACGATCTATCAGATGGCGAAGTATTTCGCTGTCGAGAAAAAGCGGGCATG



GCTGGCCGAATACGAGTTGGACTCCTTCTATACTCAACCCGATACAGGGTACCTG



CAGTTTTACGATAATGCATACGAGGATATAGTCCAGGTGTACAATAAACTCAGGA



ACTACCTCACTAAGAAACCATACTCCGAAGAAAAATGGAAACTTAATTTTGAGAA



TAGTACACTGGCCAATGGATGGGACAAGAACAAGGAATCAGACAACTCCGCTGTA



ATTCTCCAGAAGGGTGGCAAGTATTATCTGGGACTGATAACAAAGGGCCATAACA



AGATTTTCGATGACCGTTTTCAGGAGAAGTTTATAGTGGGCATAGAGGGTGGCAA



GTATGAAAAAATAGTCTACAAGTTCTTTCCCGATCAGGCGAAGATGTTCCCCAAA



GTATGCTTCAGTGCTAAAGGCCTCGAGTTTTTCCGGCCATCTGAAGAGATACTCC



GCATCTATAATAACGCAGAGTTTAAAAAGGGAGAGACGTACTCAATCGACTCGAT



GCAGAAACTCATTGACTTCTACAAAGATTGTCTCACAAAATACGAGGGCTGGGCT



TGCTACACGTTTCGGCACTTGAAGCCAACCGAGGAATATCAAAACAACATCGGGG



AGTTCTTCCGTGACGTCGCCGAAGACGGCTATAGAATTGACTTTCAGGGCATAAG



TGATCAGTATATTCACGAGAAGAATGAGAAAGGTGAGTTGCATCTTTTCGAAATC



CACAATAAAGACTGGAATCTTGACAAGGCTCGCGATGGAAAATCAAAGACTACCC



AGAAGAATCTTCATACACTTTACTTCGAGTCCCTCTTTTCCAACGACAACGTCGT



ACAGAATTTCCCAATAAAACTGAACGGCCAGGCCGAAATTTTTTACAGGCCCAAA



ACCGAAAAAGATAAACTGGAATCCAAGAAAGACAAGAAGGGAAATAAGGTGATAG



ATCACAAAAGGTATTCCGAGAACAAGATTTTTTTCCACGTACCTCTTACCCTGAA



CAGAACGAAGAACGACTCTTATAGATTCAATGCCCAGATAAACAACTTTCTCGCA



AACAACAAAGATATCAATATTATCGGCGTCGATAGAGGTGAGAAGCACTTGGTAT



ATTATTCTGTGATCACGCAAGCATCCGATATCTTGGAGTCCGGTTCTTTGAACGA



ACTGAATGGTGTCAACTACGCCGAGAAACTCGGTAAGAAAGCTGAGAATCGGGAG



CAGGCTAGAAGGGACTGGCAGGACGTTCAGGGTATCAAGGACCTGAAGAAGGGCT



ACATTTCTCAGGTGGTTCGAAAACTGGCTGATTTGGCCATTAAGCACAATGCAAT



CATCATTTTAGAAGATTTGAACATGCGGTTTAAACAAGTCAGGGGGGGGATAGAG



AAATCAATTTACCAACAGCTGGAAAAAGCTCTGATTGATAAACTCTCTTTTTTGG



TTGATAAGGGCGAAAAGAACCCCGAGCAAGCAGGACATCTCCTTAAAGCCTATCA



ACTGAGCGCACCTTTCGAGACATTCCAGAAGATGGGAAAGCAAACCGGCATCATT



TTCTATACCCAGGCTTCCTATACATCCAAGTCTGATCCAGTGACTGGGTGGAGAC



CCCATCTCTACCTCAAGTACTTTTCTGCCAAAAAAGCTAAGGACGACATTGCTAA



GTTCACAAAAATCGAGTTCGTGAACGACAGGTTCGAGCTGACTTATGACATAAAA



GATTTCCAGCAGGCCAAGGAGTACCCAAACAAGACAGTTTGGAAAGTGTGTTCCA



ATGTGGAGAGGTTTCGGTGGGACAAGAATCTGAATCAGAATAAAGGGGGATATAC



TCACTACACCAACATTACCGAGAACATCCAAGAGTTGTTCACCAAATACGGCATC



GACATTACTAAAGATCTGCTGACACAGATCTCCACCATCGATGAGAAGCAGAACA



CATCTTTCTTCCGGGATTTCATCTTTTATTTTAACTTGATCTGTCAGATTAGAAA



TACCGACGACAGTGAGATAGCTAAAAAAAACGGGAAAGACGATTTCATTCTCTCT



CCCGTGGAGCCGTTTTTTGACTCCCGCAAAGACAATGGCAATAAGCTTCCGGAAA



ACGGGGACGATAACGGCGCCTACAACATCGCTCGTAAGGGAATCGTTATCCTCAA



TAAAATAAGCCAGTATTCCGAGAAGAACGAGAATTGTGAAAAAATGAAGTGGGGG



GACCTTTACGTCAGCAACATCGATTGGGATAACTTTGTGACACAAGCCAATGCGA



GACACTAG





SEQ
ATGGAAAACTTCAAAAACCTCTACCCCATCAACAAGACCTTGAGGTTTGAGCTCC


ID
GGCCATATGGGAAGACACTGGAGAACTTCAAAAAGTCCGGTCTGCTGGAAAAGGA


NO:
TGCTTTTAAGGCTAACTCTAGGAGGTCTATGCAGGCCATTATCGATGAGAAATTC


157
AAGGAGACCATAGAGGAGCGTCTGAAATATACTGAGTTTTCCGAGTGTGACCTAG



GAAATATGACCAGTAAGGACAAAAAGATCACCGACAAGGCAGCGACAAACCTGAA



GAAACAGGTGATTTTAAGCTTTGATGATGAGATTTTCAATAACTACTTGAAGCCG



GACAAAAACATCGACGCTCTGTTCAAGAATGATCCAAGCAACCCGGTCATCTCTA



CTTTCAAGGGCTTCACCACATACTTTGTAAATTTCTTCGAAATACGGAAACACAT



CTTCAAGGGAGAGTCTTCCGGTAGCATGGCTTACAGAATAATCGATGAGAACCTA



ACTACATATCTAAACAATATCGAGAAGATCAAGAAATTGCCTGAAGAACTGAAAT



CTCAGCTTGAGGGAATCGATCAAATTGACAAACTGAACAACTATAACGAGTTCAT



CACCCAGTCCGGCATTACTCATTATAACGAAATTATTGGAGGGATTTCGAAGTCT



GAAAATGTCAAAATTCAAGGCATTAACGAAGGGATTAATCTTTACTGTCAAAAGA



ATAAAGTGAAGCTACCACGCTTAACTCCTCTGTATAAGATGATTCTCTCTGATCG



GGTCTCTAATTCCTTTGTGCTGGATACCATTGAAAATGATACCGAGTTAATTGAA



ATGATCTCTGATCTGATAAATAAGACAGAGATAAGTCAGGATGTTATTATGTCCG



ACATCCAAAATATTTTCATCAAATATAAACAACTCGGCAACTTGCCGGGGATTAG



CTACTCATCTATAGTGAATGCTATCTGTTCGGATTACGACAATAACTTTGGTGAC



GGCAAACGTAAAAAAAGCTATGAGAATGATCGCAAAAAACACCTCGAGACTAACG



TGTATAGCATTAACTATATCTCAGAGTTACTGACAGACACCGACGTCTCCAGCAA



CATAAAGATGCGGTACAAAGAGCTGGAGCAGAATTATCAGGTATGCAAGGAAAAT



TTCAACGCCACTAACTGGATGAACATCAAAAACATTAAGCAGTCTGAGAAAACCA



ATCTGATCAAGGACCTTCTTGACATCCTCAAGAGCATCCAGCGGTTTTATGATTT



GTTTGACATCGTGGATGAAGACAAAAATCCTAGTGCTGAGTTCTATACCTGGCTG



TCTAAAAACGCGGAGAAACTGGACTTCGAGTTTAATTCAGTGTACAACAAGAGCA



GGAACTACCTCACGAGAAAGCAGTACTCCGATAAAAAGATTAAGTTGAACTTCGA



TAGTCCTACTCTCGCCAAGGGGTGGGATGCGAACAAAGAAATTGATAATAGCACA



ATTATCATGAGGAAGTTCAACAACGACCGGGGCGATTACGATTACTTCTTGGGGA



TCTGGAATAAGAGCACACCTGCCAACGAAAAGATCATCCCATTAGAGGATAATGG



ACTGTTTGAAAAAATGCAATATAAGCTGTATCCCGATCCTAGTAAAATGCTGCCA



AAGCAATTCCTTTCTAAGATCTGGAAAGCTAAACATCCAACTACACCCGAGTTTG



ATAAGAAGTACAAAGAAGGTCGGCACAAGAAGGGGCCTGATTTTGAGAAAGAGTT



TCTGCACGAGTTGATCGATTGCTTTAAGCATGGATTGGTAAACCACGACGAAAAA



TATCAGGATGTGTTCGGGTTCAATCTGCGCAACACGGAAGACTACAACTCTTATA



CAGAGTTTCTGGAGGACGTCGAAAGGTGCAACTATAATCTTAGTTTCAATAAAAT



CGCTGACACGTCTAACTTGATAAATGATGGGAAACTCTATGTTTTTCAGATCTGG



AGCAAGGATTTCAGCATAGATAGCAAGGGAACAAAAAACTTGAACACAATATACT



TTGAATCCCTCTTCTCGGAGGAAAATATGATCGAGAAGATGTTCAAGCTCTCAGG



GGAAGCCGAAATATTCTATCGTCCAGCAAGTTTGAATTATTGTGAAGATATTATC



AAGAAGGGACACCACCACGCCGAACTGAAGGACAAATTCGACTATCCCATCATCA



AGGACAAGCGATATAGCCAGGACAAATTTTTTTTTCATGTCCCCATGGTTATCAA



CTACAAAAGCGAGAAGTTAAACTCCAAATCACTTAACAATAGGACGAACGAAAAT



TTAGGCCAATTCACGCACATCATCGGTATCGACCGCGGAGAGCGACATCTCATCT



ACCTGACCGTGGTGGATGTGTCCACCGGTGAGATCGTTGAGCAAAAGCACCTGGA



TGAAATTATAAATACAGATACAAAAGGCGTCGAGCATAAAACTCATTATCTCAAT



AAATTAGAAGAGAAGTCCAAGACGCGGGATAATGAAAGAAAGTCCTGGGAAGCAA



TCGAGACGATTAAGGAGCTGAAAGAAGGCTATATTAGCCACGTGATCAATGAAAT



CCAGAAATTGCAGGAAAAGTATAACGCACTGATAGTGATGGAGAACCTCAATTAT



GGGTTTAAGAACTCGCGTATCAAAGTGGAAAAGCAGGTCTACCAGAAATTCGAGA



CCGCCCTGATTAAAAAGTTTAATTACATCATTGACAAGAAAGATCCTGAAACCTA



CATTCATGGATACCAACTGACGAATCCAATCACTACACTCGATAAAATTGGTAAC



CAGAGCGGTATTGTGTTGTACATTCCGGCTTGGAATACAAGCAAGATTGATCCAG



TCACTGGTTTCGTTAACCTCCTGTATGCAGACGATTTGAAATACAAGAACCAGGA



GCAGGCTAAAAGCTTTATCCAGAAAATCGATAATATCTACTTCGAAAATGGTGAG



TTTAAATTTGATATAGATTTCAGCAAATGGAACAACCGCTACTCAATTAGCAAGA



CGAAATGGACACTGACAAGCTACGGAACCCGGATACAGACGTTCCGAAACCCCCA



GAAAAATAACAAGTGGGACAGCGCCGAGTATGACCTGACCGAAGAGTTTAAATTA



ATCCTGAACATCGATGGTACTCTGAAATCTCAGGATGTGGAAACCTATAAGAAAT



TCATGTCTTTATTCAAGCTGATGTTGCAGCTGCGAAACTCCGTTACTGGAACAGA



CATTGACTACATGATTAGCCCTGTGACAGATAAAACTGGAACCCACTTTGATTCA



CGGGAGAATATCAAGAACCTGCCCGCCGATGCTGATGCGAACGGAGCTTACAACA



TTGCTAGGAAGGGCATCATGGCAATCGAGAATATTATGAACGGCATTAGCGACCC



TCTGAAGATCAGTAATGAGGACTACCTGAAGTACATTCAGAACCAACAAGAGTAA





SEQ
ATGACCCAGTTTGAGGGTTTCACCAATCTTTATCAGGTGTCAAAAACACTCAGAT


ID
TTGAGCTCATCCCACAGGGTAAAACTTTAAAGCATATTCAAGAGCAGGGCTTTAT


NO:
AGAGGAAGACAAAGCCAGAAACGACCATTATAAGGAACTAAAACCGATCATTGAC


158
CGCATCTACAAAACCTATGCCGACCAATGCCTTCAGCTCGTCCAACTCGATTGGG



AGAATCTGAGCGCCGCTATTGACAGCTACAGGAAGGAGAAGACCGAGGAGACTAG



AAACGCCCTGATCGAGGAGCAGGCGACCTATAGAAACGCTATTCACGATTATTTT



ATCGGCCGCACCGACAATTTGACAGATGCCATCAACAAGCGGCACGCCGAAATTT



ATAAGGGGTTATTTAAGGCCGAGCTGTTCAATGGAAAAGTACTGAAACAGCTGGG



CACCGTAACAACCACCGAACACGAGAATGCTCTGTTGAGGTCCTTCGACAAGTTT



ACTACCTACTTTAGCGGCTTCTACGAAAACCGTAAAAACGTGTTTTCCGCGGAGG



ATATTTCAACAGCCATTCCTCATAGGATCGTGCAGGATAATTTCCCCAAGTTTAA



GGAGAACTGCCATATCTTTACCAGACTTATCACTGCTGTGCCAAGTTTACGAGAA



CACTTCGAGAATGTTAAGAAGGCTATAGGCATATTCGTTTCCACCTCCATCGAAG



AAGTATTCAGTTTTCCATTCTACAATCAGTTACTCACGCAGACCCAGATAGATCT



CTACAATCAGCTGCTCGGAGGCATTTCTAGAGAAGCAGGCACGGAAAAGATCAAG



GGCTTAAATGAAGTACTCAATCTTGCAATTCAGAAGAACGATGAGACAGCACACA



TTATTGCATCTCTCCCTCACAGATTCATTCCCCTGTTCAAACAGATCCTGTCCGA



TCGCAACACACTAAGCTTTATACTTGAGGAGTTTAAGTCAGATGAGGAAGTGATC



CAGAGCTTCTGTAAGTATAAGACTTTGCTCCGTAATGAAAACGTGCTTGAGACAG



CAGAGGCTCTCTTTAACGAGTTGAATTCCATCGACCTGACACACATTTTTATCAG



CCATAAAAAGCTGGAAACGATTAGCTCTGCCTTGTGCGACCACTGGGACACCCTG



CGTAACGCCCTCTATGAAAGGCGCATTTCCGAGCTCACCGGGAAGATCACAAAAA



GTGCCAAGGAAAAAGTCCAGAGGTCCCTTAAACATGAAGACATCAACCTACAAGA



GATCATCTCTGCGGCTGGGAAAGAGCTGTCAGAAGCATTTAAACAGAAGACTTCC



GAGATCCTGAGCCACGCACACGCCGCATTAGACCAGCCCCTGCCTACAACTCTTA



AAAAACAGGAGGAGAAGGAGATTTTAAAGAGCCAGCTGGACTCATTACTCGGCCT



GTATCATCTCCTGGACTGGTTCGCCGTGGACGAATCCAACGAGGTGGACCCAGAA



TTTAGCGCCAGGCTGACAGGAATTAAACTGGAAATGGAGCCAAGTTTGAGCTTTT



ACAACAAGGCTCGGAACTATGCCACTAAAAAGCCCTACAGCGTGGAAAAGTTCAA



GCTGAATTTTCAGATGCCGACCCTGGCTTCCGGGTGGGATGTTAATAAGGAAAAG



AATAATGGGGCTATACTGTTCGTCAAAAATGGTCTCTACTACCTGGGAATCATGC



CCAAACAGAAGGGCAGGTACAAAGCCCTTTCGTTTGAGCCGACCGAAAAAACCAG



CGAAGGCTTTGATAAGATGTATTACGACTATTTCCCAGATGCAGCCAAGATGATC



CCAAAATGTAGCACTCAGTTGAAGGCGGTAACCGCTCACTTTCAGACACACACCA



CTCCTATCTTGCTCTCCAACAACTTTATTGAGCCGCTGGAGATCACGAAGGAAAT



CTACGACCTTAACAACCCAGAGAAGGAACCCAAGAAATTCCAAACAGCTTATGCT



AAGAAGACTGGGGATCAAAAGGGCTATCGAGAGGCTTTGTGTAAGTGGATTGACT



TTACACGGGATTTCCTGAGTAAGTATACCAAGACCACATCTATTGACCTGTCCTC



ACTGAGACCTTCCTCACAATATAAGGATCTCGGAGAGTATTATGCCGAACTCAAC



CCTCTACTCTATCACATCTCTTTCCAGAGGATCGCCGAAAAGGAAATTATGGACG



CCGTCGAGACAGGCAAGCTGTACCTCTTCCAGATTTACAACAAGGATTTCGCAAA



GGGCCACCACGGAAAACCCAATTTGCACACTTTGTACTGGACAGGGCTCTTCTCT



CCCGAAAATTTGGCCAAAACTTCAATAAAACTGAACGGGCAAGCCGAGCTGTTCT



ATCGGCCCAAGTCACGTATGAAGCGGATGGCCCACCGGCTGGGCGAGAAGATGCT



CAACAAGAAACTGAAGGATCAGAAGACGCCCATACCAGACACTCTTTACCAAGAG



CTGTATGACTACGTGAATCACAGACTGAGTCACGACCTGTCTGATGAAGCCCGGG



CTCTTCTTCCAAATGTGATTACCAAAGAAGTTTCCCACGAAATTATCAAGGACCG



GCGCTTCACCTCTGACAAATTCTTTTTCCACGTCCCAATCACCCTCAACTACCAG



GCAGCCAATTCCCCTTCAAAGTTTAACCAGCGTGTGAATGCCTACCTGAAAGAGC



ATCCGGAGACCCCCATCATAGGGATAGACAGAGGAGAGCGGAATCTTATCTACAT



TACTGTGATTGACAGCACAGGTAAGATCTTGGAGCAGAGATCTTTAAATACAATC



CAGCAGTTTGACTACCAGAAGAAACTGGATAACCGAGAGAAGGAAAGGGTTGCTG



CAAGACAGGCCTGGTCAGTGGTCGGCACCATCAAAGACCTGAAGCAGGGCTACTT



ATCCCAAGTAATTCACGAAATTGTCGATCTTATGATTCATTATCAAGCCGTTGTT



GTGCTGGAGAACCTGAATTTTGGCTTCAAAAGCAAACGAACAGGTATCGCCGAGA



AAGCCGTGTATCAGCAGTTCGAAAAGATGCTCATAGACAAGCTGAACTGCTTAGT



GCTGAAGGATTATCCTGCTGAGAAGGTCGGCGGCGTACTTAACCCATACCAGCTG



ACCGATCAGTTCACTAGTTTCGCCAAGATGGGAACGCAAAGTGGCTTCCTTTTCT



ACGTGCCCGCTCCCTACACGAGTAAGATCGACCCTCTGACCGGCTTCGTCGACCC



ATTCGTCTGGAAGACCATCAAGAATCACGAATCACGGAAACACTTCTTAGAGGGG



TTTGACTTCCTGCACTACGACGTGAAGACAGGGGACTTCATCTTACACTTTAAGA



TGAATCGAAACCTCTCCTTCCAGCGGGGCCTGCCTGGTTTCATGCCCGCATGGGA



CATCGTGTTTGAGAAAAACGAGACACAGTTTGACGCTAAGGGAACCCCCTTTATT



GCGGGGAAGCGGATTGTCCCAGTCATCGAAAACCATCGGTTCACCGGGCGATACC



GGGATCTGTACCCGGCCAACGAGCTCATCGCGCTGCTGGAGGAGAAGGGTATTGT



GTTTAGGGATGGATCCAACATTCTGCCTAAGTTGCTGGAAAATGATGATTCGCAC



GCCATTGATACCATGGTTGCACTGATTAGATCCGTACTGCAGATGAGGAATAGCA



ATGCTGCAACCGGGGAGGATTATATTAATTCCCCAGTGCGAGATCTGAATGGTGT



CTGTTTTGACTCGCGCTTTCAGAATCCAGAATGGCCAATGGATGCAGACGCTAAC



GGGGCGTACCACATTGCTCTGAAAGGCCAGCTACTCCTGAACCACCTCAAGGAGA



GCAAAGATCTGAAGCTGCAGAACGGCATTTCCAACCAAGACTGGCTCGCCTACAT



ACAAGAACTGCGCAATTAA





SEQ
ATGGCTGTCAAATCCATCAAGGTTAAATTACGGCTTGATGACATGCCCGAGATCC


ID
GCGCCGGGCTCTGGAAACTCCATAAAGAAGTGAATGCTGGCGTTAGATACTACAC


NO:
AGAATGGCTCTCCCTGCTGCGCCAGGAAAATTTGTACCGCCGGTCACCTAATGGA


159
GATGGAGAGCAGGAATGCGATAAAACAGCAGAAGAGTGCAAAGCCGAATTGCTGG



AGCGACTGCGGGCACGGCAGGTTGAGAATGGACACCGAGGTCCGGCGGGATCGGA



CGACGAGCTGCTCCAGCTCGCCAGACAATTATATGAACTGCTGGTGCCTCAGGCT



ATTGGGGCAAAGGGTGACGCACAGCAGATTGCTAGAAAATTTCTGTCTCCCCTCG



CCGACAAAGACGCTGTCGGCGGCCTTGGGATAGCCAAAGCCGGCAACAAACCCCG



ATGGGTGCGCATGAGGGAGGCTGGTGAGCCTGGCTGGGAGGAAGAAAAGGAAAAG



GCCGAAACCAGAAAGTCCGCCGACAGGACCGCGGACGTACTCCGAGCATTGGCCG



ATTTTGGGCTGAAGCCCTTAATGCGAGTCTACACCGATAGTGAAATGTCTAGCGT



GGAGTGGAAGCCATTACGCAAAGGGCAGGCAGTGCGGACGTGGGACCGTGACATG



TTCCAGCAAGCCATCGAGCGAATGATGAGCTGGGAGAGCTGGAACCAGAGAGTGG



GGCAGGAGTATGCCAAGCTGGTCGAGCAGAAAAACCGGTTTGAGCAAAAAAATTT



TGTAGGTCAGGAACACCTGGTGCATCTCGTTAACCAGCTCCAGCAAGATATGAAG



GAAGCTTCGCCTGGATTAGAGAGCAAAGAGCAGACTGCACACTATGTAACCGGAA



GAGCACTGAGGGGCAGTGACAAAGTGTTCGAAAAATGGGGAAAACTGGCTCCCGA



TGCCCCCTTTGACCTGTACGACGCAGAAATAAAAAACGTGCAGCGGCGAAACACC



AGGCGATTTGGTAGCCATGATCTGTTCGCCAAATTGGCAGAGCCGGAATATCAGG



CTCTTTGGCGAGAAGACGCATCATTTCTCACTAGGTACGCGGTCTATAACTCCAT



TTTGAGGAAATTGAACCACGCAAAAATGTTTGCCACCTTCACGTTGCCTGACGCC



ACCGCTCATCCCATTTGGACACGGTTTGATAAGCTGGGCGGCAATCTGCATCAGT



ATACATTCCTGTTTAACGAGTTTGGAGAGCGAAGACATGCGATACGATTCCACAA



GCTACTGAAGGTCGAAAATGGCGTGGCACGTGAGGTGGACGATGTCACCGTGCCC



ATCAGCATGAGCGAACAGCTGGATAATTTGTTGCCGCGGGACCCAAATGAACCTA



TAGCCCTTTATTTTAGGGACTACGGGGCGGAGCAACATTTCACTGGGGAGTTTGG



CGGCGCAAAAATTCAGTGCCGACGCGACCAGCTCGCCCACATGCATAGAAGACGC



GGGGCCCGGGACGTATACCTTAACGTCTCTGTGAGGGTGCAGTCCCAGTCAGAGG



CAAGAGGGGAACGCAGACCACCTTACGCAGCAGTATTCAGGCTGGTAGGCGATAA



CCACCGGGCGTTTGTACACTTTGATAAACTTTCTGACTACCTGGCCGAACACCCG



GATGACGGCAAATTAGGATCGGAGGGGCTGCTTAGCGGCCTGCGTGTGATGAGCG



TCGATCTGGGGCTACGGACCTCTGCTTCCATCTCTGTGTTCCGTGTGGCCCGAAA



GGACGAGTTGAAACCTAATTCGAAGGGCCGTGTACCATTCTTTTTCCCTATTAAG



GGAAATGATAATCTCGTCGCGGTGCACGAGCGTTCCCAACTGCTGAAACTGCCTG



GCGAGACCGAGTCCAAAGATCTCAGAGCAATCCGGGAGGAGCGACAACGTACACT



TAGGCAACTCCGCACCCAGCTGGCCTATCTGCGCTTGCTGGTGCGGTGCGGCTCC



GAGGATGTAGGGAGAAGAGAGCGAAGCTGGGCAAAGCTGATAGAGCAACCAGTTG



ACGCCGCGAATCACATGACCCCCGACTGGCGCGAAGCGTTTGAAAATGAGCTGCA



GAAGTTGAAATCTCTGCATGGGATTTGCTCAGATAAGGAGTGGATGGACGCCGTA



TACGAGTCTGTTCGCCGGGTATGGCGGCACATGGGGAAGCAGGTGAGAGATTGGA



GAAAGGACGTTCGCTCTGGGGAACGGCCGAAAATTCGGGGATACGCAAAGGATGT



CGTGGGCGGCAATAGCATTGAGCAGATCGAGTACCTGGAAAGGCAATACAAATTT



CTGAAATCTTGGTCTTTCTTTGGGAAGGTAAGCGGACAAGTTATCAGAGCCGAAA



AGGGATCTCGCTTTGCTATCACATTGAGGGAACACATTGATCACGCCAAAGAAGA



CAGGTTGAAAAAGTTGGCTGATCGCATTATCATGGAAGCACTCGGTTACGTCTAC



GCCCTTGATGAGCGCGGTAAAGGGAAGTGGGTAGCCAAGTATCCCCCATGTCAGC



TGATCCTGCTCGAGGAACTTTCTGAGTATCAGTTCAATAACGACCGTCCTCCCTC



CGAAAATAATCAGCTCATGCAATGGTCCCACCGGGGTGTGTTCCAAGAACTGATC



AATCAGGCTCAGGTGCACGACCTCCTCGTAGGCACTATGTATGCAGCCTTTAGCT



CCCGTTTTGACGCGCGCACAGGCGCCCCTGGAATACGATGTAGGCGAGTTCCCGC



ACGGTGCACTCAAGAACATAACCCGGAGCCTTTCCCATGGTGGCTCAATAAGTTT



GTTGTGGAGCATACCCTCGACGCTTGCCCATTGAGGGCGGATGACTTGATTCCCA



CAGGCGAGGGGGAGATCTTCGTGAGCCCATTTTCTGCCGAAGAAGGGGATTTCCA



CCAAATACATGCCGACTTGAATGCTGCCCAAAATCTGCAGCAAAGGCTGTGGTCA



GACTTCGACATCTCGCAAATCAGACTGCGGTGTGACTGGGGCGAAGTAGACGGCG



AGCTGGTGCTGATACCTAGACTGACGGGTAAGCGTACCGCCGATAGCTATAGTAA



TAAGGTTTTTTATACGAATACGGGGGTGACATATTACGAGCGTGAGAGAGGCAAG



AAGCGTCGGAAGGTGTTCGCGCAGGAGAAGCTGAGCGAAGAGGAGGCGGAGCTAC



TGGTAGAGGCAGATGAGGCAAGAGAAAAGTCCGTCGTCCTGATGCGGGATCCTAG



CGGGATTATTAACAGAGGTAATTGGACACGGCAGAAAGAATTCTGGAGCATGGTG



AATCAAAGAATCGAGGGTTACCTGGTGAAGCAAATTCGAAGCCGGGTGCCCCTTC



AAGACAGCGCATGTGAAAACACTGGGGACATCTAG





SEQ
ATGGCTACTCGGTCCTTCATCCTGAAAATCGAGCCAAATGAAGAGGTGAAAAAGG


ID
GCCTGTGGAAGACCCATGAGGTACTTAACCACGGCATAGCATACTATATGAATAT


NO:
CCTAAAACTTATACGGCAGGAGGCTATCTACGAGCATCACGAGCAAGATCCTAAA


160
AATCCAAAGAAGGTTAGTAAGGCTGAAATCCAGGCTGAATTGTGGGACTTCGTGC



TGAAGATGCAGAAATGCAACAGTTTCACGCATGAAGTTGATAAGGACGTCGTGTT



TAATATACTCCGGGAGCTGTACGAAGAACTGGTACCAAGCTCTGTGGAAAAGAAA



GGAGAGGCCAACCAGCTAAGTAATAAGTTCCTCTATCCTCTCGTGGACCCCAATT



CACAGAGCGGCAAAGGTACCGCATCTTCTGGGAGGAAACCACGCTGGTACAACTT



GAAGATCGCTGGCGATCCCAGCTGGGAGGAGGAAAAGAAGAAATGGGAAGAGGAT



AAAAAGAAAGACCCCCTGGCCAAAATCTTAGGCAAGCTCGCCGAGTACGGTCTGA



TTCCACTTTTCATCCCGTTCACAGATAGCAATGAGCCGATCGTCAAGGAGATTAA



GTGGATGGAAAAGAGCCGCAATCAGAGTGTGCGGAGGCTGGACAAAGACATGTTT



ATTCAGGCCCTGGAACGCTTCCTTAGCTGGGAAAGCTGGAACCTGAAGGTTAAGG



AAGAGTACGAAAAAGTCGAGAAGGAGCATAAGACTTTGGAGGAGCGCATCAAAGA



AGACATCCAGGCCTTTAAGTCTCTAGAACAGTATGAGAAAGAACGGCAGGAACAG



CTGCTGCGTGATACACTGAACACAAACGAATATCGCCTGAGCAAGAGGGGACTCA



GAGGCTGGAGAGAAATCATTCAAAAGTGGCTCAAAATGGATGAAAATGAGCCGTC



TGAAAAATACCTTGAAGTTTTCAAGGACTACCAGCGGAAGCACCCTAGAGAAGCC



GGCGACTATAGTGTTTACGAATTCTTGAGCAAGAAGGAGAATCATTTTATATGGA



GGAATCACCCGGAGTACCCATATCTGTACGCAACCTTCTGCGAAATCGACAAGAA



AAAAAAAGACGCCAAGCAACAGGCTACATTTACTCTGGCCGACCCTATCAATCAC



CCTCTATGGGTCCGGTTTGAGGAGCGCTCCGGAAGCAATCTGAATAAATATCGTA



TTCTGACTGAACAGTTACACACAGAGAAGCTCAAGAAGAAACTTACGGTGCAGCT



GGACCGCCTGATATACCCAACAGAGTCCGGAGGATGGGAAGAGAAAGGAAAGGTT



GACATCGTACTGCTTCCATCTCGTCAGTTTTACAACCAGATATTCCTGGACATCG



AGGAGAAGGGGAAACACGCCTTCACATACAAGGACGAGTCCATAAAGTTCCCACT



GAAGGGTACTTTAGGCGGTGCTAGGGTGCAGTTCGACCGCGATCACCTGAGACGG



TACCCCCACAAGGTGGAGAGCGGGAACGTGGGACGAATCTACTTTAATATGACAG



TGAACATTGAACCCACAGAGAGTCCAGTTAGTAAATCCCTGAAAATTCACCGTGA



CGACTTTCCGAAATTTGTGAATTTCAAGCCAAAGGAGCTTACGGAGTGGATCAAG



GATTCAAAGGGAAAGAAGCTGAAATCTGGTATCGAATCTCTCGAGATCGGTCTCC



GTGTCATGAGCATCGATCTGGGACAGCGCCAGGCAGCTGCCGCCAGTATATTCGA



GGTGGTAGACCAAAAGCCTGACATCGAGGGAAAGCTCTTCTTCCCAATCAAAGGC



ACAGAGCTGTATGCGGTGCACCGGGCGTCCTTTAATATAAAGCTGCCCGGTGAAA



CCCTGGTGAAGTCACGGGAGGTGCTTAGAAAAGCGCGAGAGGATAACCTCAAACT



GATGAACCAAAAACTGAACTTTCTGAGGAACGTCCTGCACTTTCAGCAGTTCGAA



GATATTACCGAACGCGAAAAGAGAGTAACCAAGTGGATATCTCGTCAAGAGAACA



GCGACGTCCCGTTAGTCTATCAGGACGAACTCATCCAAATACGGGAGTTGATGTA



TAAGCCCTACAAGGATTGGGTCGCCTTTCTTAAGCAGCTTCACAAACGCCTAGAG



GTCGAAATAGGTAAAGAGGTGAAACATTGGCGGAAGTCGCTCAGCGACGGGAGGA



AGGGACTTTATGGCATCTCTTTGAAGAACATTGACGAAATCGATAGAACCAGAAA



ATTTTTGTTGAGATGGTCCCTCCGACCCACCGAGCCTGGAGAGGTGAGGCGGTTA



GAACCAGGACAGAGGTTCGCTATCGATCAGCTGAATCACCTCAATGCTCTGAAGG



AGGACCGCCTCAAGAAAATGGCCAATACAATCATAATGCACGCCCTTGGCTACTG



CTACGACGTCCGAAAGAAGAAGTGGCAGGCCAAGAATCCCGCCTGTCAAATTATC



CTTTTTGAGGATCTTAGCAATTACAACCCCTATGAAGAGCGGTCCAGATTCGAAA



ATAGTAAGCTCATGAAGTGGAGCCGCAGGGAGATCCCGCGCCAAGTGGCCCTTCA



GGGGGAAATTTATGGGCTGCAGGTAGGCGAGGTCGGGGCCCAATTCTCCTCGCGC



TTTCATGCGAAAACTGGAAGTCCTGGAATCCGGTGCTCAGTGGTGACAAAGGAGA



AGTTGCAAGACAATCGGTTTTTTAAAAACTTACAGCGGGAGGGAAGGCTGACCCT



GGATAAGATAGCCGTACTTAAGGAAGGAGATCTGTACCCTGACAAAGGCGGTGAA



AAGTTCATTAGCTTGAGCAAGGACCGAAAACTTGTGACCACCCACGCTGACATCA



ATGCGGCACAGAACCTGCAGAAGAGATTTTGGACTCGCACCCACGGATTCTACAA



AGTTTACTGCAAAGCATATCAAGTAGACGGACAGACCGTATACATCCCCGAGTCC



AAAGATCAGAAGCAGAAAATTATTGAAGAGTTTGGGGAAGGGTACTTTATCCTGA



AGGATGGTGTCTACGAATGGGGCAACGCTGGTAAACTTAAAATTAAGAAGGGCAG



CTCTAAACAGTCCTCCAGCGAGTTAGTTGATTCTGATATTCTGAAAGACAGTTTC



GACCTGGCCAGCGAACTTAAAGGGGAAAAATTAATGCTGTACCGGGACCCCAGCG



GAAACGTCTTTCCATCCGATAAGTGGATGGCCGCTGGAGTGTTCTTTGGCAAGTT



AGAGAGGATTCTCATAAGTAAGCTGACCAACCAATACTCAATCTCCACAATCGAG



GATGACTCATCCAAGCAGTCTATGTGA





SEQ
ATGCCTACACGCACTATCAACCTGAAACTGGTTCTTGGCAAGAATCCAGAGAATG


ID
CTACCCTTCGTCGGGCACTATTTTCAACGCATAGACTGGTGAATCAGGCTACCAA


NO:
ACGGATTGAAGAGTTCCTCTTGCTTTGTCGGGGGGAAGCATATAGGACGGTGGAT


161
AATGAGGGGAAAGAGGCTGAAATTCCGAGACACGCCGTGCAGGAGGAAGCTCTTG



CGTTTGCAAAGGCCGCTCAACGGCACAATGGTTGCATCTCTACTTATGAAGACCA



GGAAATCCTGGATGTGCTCCGGCAACTGTATGAAAGGCTGGTGCCTTCTGTGAAT



GAAAATAATGAAGCAGGGGACGCTCAAGCCGCAAACGCGTGGGTGTCGCCACTGA



TGTCCGCCGAGTCCGAGGGAGGGCTCAGCGTTTACGACAAGGTGCTGGACCCACC



CCCAGTGTGGATGAAACTCAAAGAGGAAAAAGCTCCGGGCTGGGAGGCTGCTTCC



CAGATCTGGATCCAGTCCGACGAAGGGCAGTCCCTTCTTAACAAGCCTGGTTCGC



CCCCGCGGTGGATTAGGAAACTGAGGTCAGGCCAGCCTTGGCAGGACGATTTTGT



TAGCGACCAGAAAAAGAAGCAGGACGAGCTGACAAAGGGGAATGCGCCACTGATC



AAACAATTAAAGGAAATGGGCTTATTGCCTCTTGTGAATCCCTTTTTTAGACATC



TGCTTGACCCGGAGGGGAAGGGGGTGTCACCTTGGGACAGACTCGCTGTTAGGGC



CGCTGTCGCTCATTTCATATCATGGGAATCATGGAACCACCGGACACGCGCCGAA



TACAATAGTTTGAAGCTGCGGAGGGATGAGTTCGAAGCAGCTTCCGACGAATTCA



AGGACGACTTCACGCTGCTTCGGCAGTACGAGGCTAAGAGGCACTCCACACTGAA



GAGTATAGCTTTAGCCGATGATTCAAACCCTTATAGGATCGGCGTACGCTCCCTC



CGCGCTTGGAACCGCGTCCGCGAGGAGTGGATCGACAAGGGAGCGACCGAGGAGC



AGCGGGTCACCATTCTCAGCAAGTTGCAGACCCAACTAAGGGGCAAATTTGGAGA



TCCTGACTTGTTCAACTGGCTGGCGCAGGACCGGCACGTGCACCTCTGGAGCCCT



AGAGATAGTGTTACCCCACTGGTTAGGATCAACGCTGTTGACAAAGTATTGCGAC



GGAGAAAACCGTACGCCTTGATGACTTTTGCCCACCCAAGATTCCACCCTCGGTG



GATACTTTACGAAGCCCCAGGGGGCAGCAATCTCCGCCAGTATGCACTGGATTGT



ACCGAAAATGCTCTGCACATTACACTGCCTCTGCTGGTTGACGATGCACATGGCA



CATGGATTGAGAAAAAAATTAGGGTTCCTCTTGCCCCCAGCGGCCAGATTCAGGA



CCTGACACTAGAAAAGCTCGAGAAGAAGAAAAATCGTCTCTACTACCGTTCTGGG



TTCCAGCAGTTTGCCGGCCTGGCCGGAGGTGCCGAGGTGCTTTTCCATCGACCAT



ACATGGAGCACGATGAGAGGAGCGAGGAGAGCTTATTAGAACGCCCTGGTGCTGT



TTGGTTCAAACTCACCTTGGACGTGGCAACCCAGGCCCCTCCAAACTGGTTGGAC



GGAAAGGGCCGCGTCCGAACGCCCCCCGAGGTTCACCACTTCAAGACAGCCCTCA



GTAACAAGTCTAAGCACACACGGACCCTCCAGCCCGGACTCAGAGTGTTATCCGT



GGATCTGGGAATGCGCACCTTCGCCTCTTGCTCCGTATTTGAGCTGATCGAGGGC



AAACCAGAGACTGGCAGAGCGTTCCCTGTGGCCGACGAACGTTCCATGGATTCAC



CAAACAAGCTGTGGGCCAAGCACGAAAGATCCTTTAAACTCACGCTCCCCGGCGA



AACCCCCAGTCGGAAAGAAGAGGAGGAACGGAGCATTGCAAGAGCCGAAATCTAT



GCGTTGAAAAGAGATATTCAGAGATTAAAAAGTCTTCTGCGCCTGGGGGAAGAGG



ATAACGATAATAGACGCGATGCACTTCTTGAGCAATTTTTCAAGGGCTGGGGCGA



GGAAGACGTGGTTCCAGGTCAGGCCTTTCCCCGGAGTCTGTTCCAGGGGCTGGGG



GCCGCCCCATTCAGATCCACCCCTGAGTTGTGGAGACAACACTGTCAAACCTATT



ATGATAAAGCAGAGGCGTGCCTGGCTAAACACATCAGCGATTGGCGCAAGAGAAC



CAGGCCTAGGCCTACCTCACGTGAGATGTGGTACAAGACACGCTCTTATCACGGC



GGAAAGTCAATCTGGATGCTGGAATACCTCGACGCTGTGAGGAAACTGCTCTTAT



CCTGGAGCCTCAGAGGCCGGACCTACGGGGCTATCAACAGACAGGACACAGCAAG



GTTCGGGAGCTTAGCCAGCCGGCTCCTTCACCACATTAACTCACTCAAAGAGGAT



CGAATAAAGACCGGAGCCGACTCGATCGTGCAGGCAGCCCGAGGGTACATCCCCC



TGCCTCATGGGAAGGGCTGGGAGCAGCGATATGAACCCTGCCAGCTGATCTTGTT



TGAGGACCTTGCCCGTTATAGATTTCGCGTTGATAGACCTCGCCGTGAGAATTCT



CAGCTGATGCAGTGGAACCACAGAGCGATCGTGGCTGAGACCACTATGCAGGCCG



AGCTGTATGGACAGATCGTGGAGAACACCGCCGCAGGGTTCAGTTCTCGGTTTCA



TGCTGCCACCGGAGCTCCCGGCGTCCGGTGCCGCTTCCTCTTAGAGCGTGATTTT



GACAATGACCTCCCAAAGCCCTATCTGCTGAGGGAACTGAGCTGGATGCTGGGGA



ACACAAAAGTAGAATCGGAGGAGGAGAAGCTACGGCTCCTCTCCGAAAAGATACG



TCCAGGCTCTCTGGTACCATGGGACGGAGGAGAGCAGTTCGCGACACTGCATCCT



AAGAGACAGACGTTATGTGTGATTCACGCCGATATGAACGCCGCTCAGAATCTGC



AGCGAAGATTCTTTGGCCGCTGCGGCGAAGCCTTCAGGCTGGTATGTCAGCCCCA



CGGGGATGATGTGCTGCGGCTGGCCTCAACCCCTGGGGCTAGACTCTTGGGGGCA



CTCCAGCAGCTGGAAAATGGCCAAGGGGCTTTCGAACTCGTTCGGGACATGGGCA



GCACAAGCCAGATGAACAGATTCGTCATGAAGAGCCTGGGAAAGAAAAAGATCAA



ACCCTTACAGGACAATAATGGCGACGACGAACTGGAGGACGTGTTGTCCGTGCTG



CCAGAGGAAGACGACACAGGCCGCATCACTGTCTTCCGCGACTCAAGTGGGATAT



TCTTTCCTTGCAACGTGTGGATTCCGGCCAAACAGTTCTGGCCTGCCGTCAGAGC



CATGATTTGGAAAGTGATGGCTAGTCATTCATTGGGATGA





SEQ
ATGACAAAGCTGAGGCACAGACAAAAGAAGCTTACACACGACTGGGCAGGGAGCA


ID
AGAAACGTGAGGTCCTTGGGTCAAATGGAAAACTGCAGAACCCCTTGCTCATGCC


NO:
TGTAAAGAAGGGGCAGGTAACAGAATTTAGAAAAGCATTCTCCGCGTACGCTCGG


162
GCAACTAAGGGGGAAATGACCGATGGACGGAAGAACATGTTCACCCATTCTTTCG



AGCCATTCAAAACAAAGCCGTCATTGCACCAATGCGAGCTGGCCGATAAGGCTTA



CCAGTCTTTGCATAGTTACCTCCCCGGTTCCCTGGCCCATTTCTTGCTTTCCGCA



CACGCACTGGGCTTTCGTATTTTCTCTAAATCTGGGGAGGCAACTGCCTTCCAGG



CCAGCTCAAAAATCGAGGCCTATGAGTCCAAGCTCGCTTCGGAGCTAGCCTGTGT



CGATTTGAGTATCCAGAATTTGACGATTAGTACTCTTTTCAACGCTCTCACAACT



TCAGTTCGGGGCAAGGGGGAGGAAACTTCAGCAGATCCCCTTATCGCACGGTTCT



ACACTCTCCTGACGGGCAAGCCCCTGAGCCGAGACACACAGGGCCCAGAACGGGA



CTTGGCAGAGGTCATCTCCAGAAAGATCGCCTCGTCCTTCGGCACATGGAAGGAA



ATGACTGCCAACCCTCTGCAGAGCCTCCAGTTCTTCGAAGAAGAGCTTCATGCAC



TAGATGCCAACGTGTCTTTATCTCCAGCTTTTGATGTGTTAATCAAGATGAATGA



TCTCCAAGGTGATCTGAAGAACCGTACTATAGTGTTCGACCCAGATGCACCCGTG



TTCGAGTACAACGCTGAGGATCCAGCCGATATCATCATAAAGCTGACAGCTCGGT



ATGCGAAGGAGGCCGTCATCAAGAATCAGAACGTGGGCAATTATGTGAAAAACGC



CATTACCACCACTAATGCCAATGGGCTGGGGTGGCTCCTCAATAAAGGGCTTTCA



CTACTGCCAGTTTCTACTGACGATGAGCTGCTCGAATTCATTGGGGTGGAGAGAA



GCCATCCCAGCTGTCACGCGCTGATAGAGCTGATTGCCCAGCTAGAGGCGCCGGA



ACTGTTTGAGAAGAATGTGTTTAGTGACACCCGTTCCGAGGTTCAGGGTATGATC



GACAGTGCAGTGTCGAACCACATTGCTCGGCTGTCCAGCAGCCGAAACTCCCTGA



GCATGGACAGCGAGGAATTGGAACGCTTGATTAAATCTTTCCAGATTCATACTCC



CCATTGTTCTCTGTTCATAGGCGCTCAGTCCTTATCTCAGCAGCTGGAGAGCTTA



CCTGAGGCGCTGCAGTCCGGAGTGAACAGCGCTGATATCTTATTAGGCAGCACAC



AGTATATGCTGACCAACTCTCTCGTTGAAGAGTCAATTGCAACATATCAAAGGAC



ATTAAATAGGATCAATTACCTGAGTGGGGTGGCTGGGCAGATTAACGGTGCTATC



AAAAGAAAGGCAATCGACGGCGAAAAAATACACCTGCCTGCCGCCTGGAGTGAGC



TCATCTCCTTACCTTTCATTGGACAGCCGGTGATTGATGTGGAGAGCGACCTGGC



ACACTTAAAAAACCAGTACCAGACCCTGTCCAATGAATTTGACACCCTCATTTCG



GCCCTGCAGAAGAACTTCGATTTGAATTTCAACAAAGCACTCCTTAACCGCACGC



AGCATTTCGAGGCAATGTGCCGGAGCACAAAAAAAAATGCTTTATCTAAGCCCGA



GATCGTGTCCTACAGAGATCTGCTGGCGCGGCTGACCAGTTGCCTTTATCGAGGC



TCGCTGGTTCTCAGAAGGGCGGGAATCGAAGTTCTGAAAAAGCACAAAATCTTTG



AGTCGAATAGTGAGCTGAGAGAACACGTCCACGAGCGAAAGCACTTCGTGTTCGT



TAGTCCATTGGACAGAAAGGCAAAAAAACTGTTGCGCCTGACCGATTCCCGCCCT



GACTTGCTCCATGTGATCGATGAGATCCTGCAACATGACAATCTGGAGAATAAGG



ACAGAGAGTCCCTTTGGCTGGTCCGGTCTGGGTACCTCCTTGCTGGTCTGCCGGA



CCAGCTGAGTTCTTCGTTTATCAATCTCCCCATAATCACGCAAAAGGGCGATCGC



CGGCTGATTGACCTGATTCAGTATGACCAGATCAATCGCGATGCTTTCGTAATGT



TGGTGACAAGTGCTTTCAAAAGCAATCTCTCTGGGTTGCAGTACCGCGCTAACAA



GCAGTCTTTCGTGGTCACCCGCACCCTGTCTCCTTACCTGGGTAGTAAGCTCGTA



TACGTCCCTAAAGACAAAGATTGGCTGGTCCCATCCCAGATGTTTGAGGGAAGAT



TCGCCGATATTCTGCAGAGTGACTACATGGTCTGGAAGGATGCCGGACGCCTGTG



CGTGATCGACACTGCCAAACATCTCTCTAACATTAAAAAAAGCGTGTTTAGTAGC



GAAGAAGTCCTTGCTTTTCTTCGAGAGCTGCCTCACCGGACCTTCATCCAGACCG



AGGTACGGGGGTTAGGAGTGAACGTCGATGGAATCGCATTTAATAACGGGGATAT



CCCGAGCTTGAAGACATTCTCGAATTGTGTGCAGGTGAAGGTGAGTAGGACTAAT



ACTAGTCTCGTGCAGACTCTAAACAGGTGGTTCGAGGGTGGCAAAGTGTCACCTC



CCTCTATTCAGTTCGAAAGAGCTTACTACAAAAAAGACGATCAGATTCACGAGGA



CGCAGCCAAGAGAAAGATACGCTTCCAGATGCCAGCAACGGAATTAGTGCACGCC



AGCGATGACGCTGGTTGGACCCCCAGCTACCTGCTGGGCATCGACCCCGGTGAGT



ACGGAATGGGTCTCAGTTTGGTGTCCATCAACAATGGAGAGGTCCTGGATTCTGG



ATTCATCCACATTAATTCCCTGATCAATTTCGCGTCCAAAAAAAGCAATCACCAG



ACCAAAGTAGTCCCCCGCCAGCAGTACAAGTCCCCCTACGCGAATTATCTCGAGC



AGTCAAAGGATTCAGCAGCAGGGGATATAGCTCACATTCTGGATCGGCTAATCTA



CAAATTGAACGCCTTGCCTGTGTTCGAGGCGCTGTCTGGCAACAGTCAGAGTGCT



GCTGATCAGGTATGGACCAAAGTTCTATCCTTCTATACATGGGGAGACAACGACG



CACAGAACAGTATACGGAAGCAGCACTGGTTCGGTGCCTCACACTGGGATATTAA



GGGGATGCTGCGCCAACCCCCAACCGAAAAAAAACCCAAACCATATATAGCCTTT



CCCGGGAGTCAAGTGTCATCCTATGGAAATAGTCAAAGGTGTAGTTGTTGCGGCC



GCAATCCCATTGAGCAGTTGCGTGAGATGGCAAAGGACACGAGTATCAAGGAGCT



GAAAATCCGAAATAGTGAGATCCAACTATTCGATGGTACAATCAAGCTGTTTAAC



CCCGACCCTTCCACCGTCATCGAGAGGCGGCGGCATAACCTAGGACCCTCACGCA



TTCCTGTGGCAGACCGAACTTTCAAGAATATTAGCCCTTCTTCGTTAGAGTTCAA



GGAGCTCATTACTATCGTTTCTCGAAGCATCCGCCATAGCCCCGAATTTATTGCT



AAGAAACGGGGTATCGGGTCTGAGTACTTTTGTGCTTATTCTGACTGCAACTCCT



CACTGAACTCAGAGGCCAATGCCGCGGCCAATGTGGCACAGAAGTTTCAGAAGCA



ACTCTTTTTCGAACTCTGA





SEQ
ATGAAACGTATTCTGAACTCTCTGAAAGTCGCCGCACTGAGGCTGCTGTTTCGAG


ID
GAAAGGGCTCAGAGCTGGTGAAGACCGTCAAGTACCCTCTGGTTTCGCCCGTCCA


NO:
GGGTGCTGTGGAAGAACTCGCCGAAGCAATACGCCACGACAACCTACATTTATTT


163
GGGCAGAAGGAAATCGTAGATCTGATGGAGAAGGACGAGGGCACCCAGGTCTACT



CGGTGGTGGACTTTTGGCTCGACACACTCCGTCTAGGGATGTTCTTCAGTCCAAG



TGCTAATGCCCTTAAGATCACTCTGGGGAAGTTTAACAGCGACCAAGTTTCCCCT



TTCAGGAAGGTTCTGGAGCAGTCCCCTTTCTTTCTCGCGGGTAGACTCAAAGTGG



AGCCCGCTGAACGTATCCTCAGCGTGGAGATCCGCAAGATCGGTAAGAGGGAGAA



TAGAGTGGAGAACTACGCCGCAGATGTAGAGACTTGTTTTATCGGTCAGCTGTCT



AGTGATGAAAAGCAGTCTATCCAGAAGCTCGCTAACGATATCTGGGACTCTAAGG



ATCACGAAGAGCAAAGGATGCTTAAGGCGGATTTCTTTGCCATTCCCCTCATCAA



AGACCCAAAGGCAGTGACCGAGGAAGATCCCGAGAATGAAACCGCAGGCAAACAG



AAGCCTCTCGAATTATGTGTGTGCTTAGTGCCCGAGTTGTACACCCGCGGGTTCG



GTTCAATAGCGGACTTCCTGGTCCAGCGTCTGACACTATTAAGAGACAAAATGAG



CACAGACACAGCAGAAGACTGCCTTGAGTATGTCGGCATAGAGGAGGAGAAGGGT



AATGGGATGAACTCGCTGCTGGGGACGTTCCTCAAGAACCTGCAGGGAGACGGGT



TCGAACAGATCTTCCAATTTATGCTCGGCAGTTACGTGGGATGGCAAGGTAAGGA



AGACGTCCTACGCGAACGGCTTGATTTGCTAGCGGAGAAGGTTAAAAGACTGCCG



AAACCTAAGTTTGCCGGCGAGTGGTCCGGCCATCGGATGTTCCTGCATGGTCAAT



TGAAGAGCTGGTCCTCTAACTTTTTCCGCCTGTTTAACGAGACTAGGGAGCTCCT



CGAAAGCATAAAATCCGACATCCAACACGCGACCATGTTAATCAGCTACGTCGAA



GAGAAAGGGGGATACCACCCACAACTCTTGTCACAGTACAGGAAACTAATGGAGC



AGCTGCCAGCTCTCAGAACAAAGGTGTTAGATCCAGAGATAGAAATGACTCACAT



GAGCGAGGCGGTAAGGTCGTACATTATGATCCACAAGTCGGTAGCAGGATTTCTG



CCTGACTTACTCGAGTCCCTCGATAGGGACAAGGACAGGGAATTCCTGCTGAGTA



TATTTCCAAGGATCCCCAAAATTGACAAAAAAACTAAGGAAATCGTGGCCTGGGA



GCTCCCAGGCGAGCCCGAAGAAGGATACCTGTTCACTGCCAATAATCTTTTTCGC



AACTTTCTGGAGAATCCTAAACATGTTCCACGTTTCATGGCAGAAAGGATCCCGG



AAGATTGGACGCGCCTGCGGTCCGCTCCCGTATGGTTTGACGGCATGGTGAAACA



ATGGCAGAAAGTGGTAAACCAGCTGGTGGAGTCACCTGGAGCATTGTATCAGTTC



AATGAAAGCTTTCTCCGACAACGTTTACAGGCAATGCTGACAGTGTATAAGAGAG



ACCTGCAGACAGAGAAATTCCTTAAGTTGTTGGCTGATGTCTGCAGGCCTCTGGT



GGACTTCTTTGGGCTGGGGGGAAACGATATCATCTTCAAAAGCTGCCAGGACCCG



AGGAAACAATGGCAAACTGTCATTCCCTTGAGTGTCCCCGCTGATGTGTACACCG



CGTGTGAGGGGCTGGCAATCCGGCTTCGTGAGACATTGGGATTTGAGTGGAAGAA



CCTTAAGGGCCATGAAAGGGAGGACTTTCTAAGACTGCACCAGCTTTTAGGGAAT



CTGCTTTTCTGGATTCGAGATGCCAAACTGGTGGTGAAATTGGAAGATTGGATGA



ATAATCCCTGTGTTCAGGAGTACGTTGAGGCTCGTAAGGCCATTGATCTCCCACT



GGAGATCTTCGGCTTTGAGGTCCCCATCTTCCTGAACGGATATCTGTTTAGTGAA



CTGAGGCAGTTAGAACTGCTGCTCCGCCGTAAGTCGGTTATGACCAGCTATTCGG



TTAAGACAACTGGCAGTCCAAACAGGCTTTTCCAGTTAGTCTACCTGCCATTAAA



TCCTTCCGACCCTGAGAAAAAAAATTCTAATAACTTTCAGGAACGCCTGGACACC



CCCACTGGCTTATCACGTCGCTTCCTGGACCTTACTCTGGACGCCTTCGCCGGCA



AGTTGCTGACAGACCCCGTGACTCAAGAGCTTAAAACTATGGCTGGGTTCTACGA



TCACCTGTTTGGTTTCAAGCTCCCATGTAAGCTGGCAGCCATGTCTAACCACCCT



GGCTCTAGCAGCAAGATGGTCGTGTTGGCCAAACCTAAAAAAGGGGTTGCATCTA



ATATAGGATTCGAACCAATCCCTGATCCCGCGCACCCCGTATTCCGGGTGAGATC



ATCATGGCCAGAGCTGAAGTATCTGGAGGGGTTACTGTATCTTCCAGAAGACACT



CCACTGACAATAGAGCTCGCAGAGACAAGTGTTAGTTGTCAGAGCGTCAGTAGCG



TGGCATTCGATCTGAAAAATCTGACTACTATCCTTGGACGCGTGGGTGAGTTCCG



TGTGACCGCAGACCAGCCTTTTAAGTTGACCCCCATCATCCCTGAGAAGGAGGAG



TCCTTCATAGGAAAAACATATCTAGGCCTTGATGCCGGGGAACGCTCAGGCGTAG



GGTTCGCTATCGTCACAGTCGACGGGGATGGGTACGAGGTACAGCGCCTGGGGGT



GCATGAAGATACACAGCTGATGGCCCTACAGCAGGTGGCCTCTAAAAGCTTGAAG



GAGCCGGTGTTCCAGCCGCTCAGAAAGGGTACTTTTCGGCAGCAGGAACGTATTA



GAAAATCTCTCAGAGGATGTTATTGGAACTTCTATCACGCTCTGATGATTAAGTA



CCGCGCCAAGGTAGTGCACGAAGAGAGCGTGGGCAGTTCCGGCCTGGTTGGGCAG



TGGTTACGAGCATTCCAGAAGGACCTCAAGAAAGCCGATGTGTTGCCAAAAAAGG



GAGGCAAAAACGGAGTCGATAAGAAAAAGAGAGAGTCTTCTGCACAAGACACATT



GTGGGGAGGGGCTTTTAGCAAGAAGGAAGAACAGCAGATAGCTTTCGAAGTCCAA



GCTGCTGGTTCTAGCCAGTTCTGCCTGAAGTGCGGATGGTGGTTCCAACTCGGAA



TGCGTGAGGTTAATCGCGTGCAGGAATCCGGCGTCGTGCTGGATTGGAATCGGAG



TATTGTCACATTCCTGATTGAGAGCTCTGGCGAGAAAGTGTATGGGTTCTCCCCT



CAGCAACTCGAAAAGGGGTTCAGACCAGACATTGAAACCTTCAAGAAGATGGTTC



GGGATTTCATGCGCCCGCCTATGTTTGACCGGAAGGGTCGCCCAGCAGCTGCCTA



CGAAAGGTTTGTCTTGGGACGCCGGCATCGGCGGTATAGATTCGACAAGGTTTTT



GAAGAACGATTCGGACGATCCGCGCTATTCATTTGCCCGAGGGTTGGCTGTGGCA



ACTTTGACCACAGCAGCGAGCAGTCAGCCGTAGTGCTGGCTCTAATCGGATATAT



TGCCGACAAAGAGGGGATGAGCGGAAAAAAGCTAGTCTACGTGCGTCTGGCAGAA



CTAATGGCGGAATGGAAATTGAAGAAACTGGAGAGGAGTAGAGTTGAGGAGCAAA



GCTCCGCTCAGTGA





SEQ
ATGGCGGAGTCGAAGCAAATGCAGTGCAGGAAGTGTGGAGCCTCTATGAAGTACG


ID
AAGTGATCGGCCTCGGGAAGAAAAGCTGCAGATATATGTGTCCCGACTGCGGGAA


NO:
TCACACATCTGCAAGAAAGATTCAGAATAAGAAGAAAAGGGACAAGAAGTATGGA


164
TCTGCCAGTAAAGCACAAAGCCAACGAATCGCAGTTGCAGGGGCCTTATACCCGG



ATAAAAAGGTTCAGACCATCAAGACTTATAAGTATCCAGCCGACCTGAATGGTGA



GGTCCATGACTCAGGGGTGGCCGAAAAAATAGCCCAAGCAATCCAGGAGGATGAA



ATAGGGCTCCTCGGCCCCTCTTCCGAGTACGCCTGTTGGATCGCTAGCCAGAAAC



AGAGCGAGCCCTACAGTGTTGTAGACTTTTGGTTTGACGCTGTGTGCGCCGGAGG



CGTGTTCGCCTATTCTGGGGCTAGATTGCTGTCTACCGTCCTGCAGCTATCTGGG



GAGGAGAGCGTCCTACGCGCAGCCCTGGCATCCTCCCCTTTTGTCGACGATATCA



ATCTGGCACAGGCCGAAAAATTTCTGGCGGTGTCCAGGCGAACCGGCCAAGATAA



GCTGGGGAAGCGCATTGGAGAGTGCTTCGCAGAGGGCCGACTTGAGGCCCTAGGC



ATCAAGGACCGGATGCGTGAATTTGTCCAGGCTATCGATGTCGCTCAGACCGCTG



GGCAGCGTTTTGCCGCGAAACTGAAAATCTTTGGGATTTCTCAGATGCCCGAGGC



AAAGCAGTGGAACAATGACAGCGGACTCACCGTGTGCATCCTGCCCGACTATTAC



GTCCCAGAAGAAAATCGCGCAGATCAGTTGGTCGTCCTGCTAAGACGACTGAGAG



AGATAGCATACTGTATGGGGATCGAAGATGAGGCCGGTTTTGAACATCTTGGAAT



TGATCCTGGCGCACTATCAAATTTTTCCAATGGCAATCCTAAACGCGGATTTTTG



GGCCGCCTGCTGAACAATGATATTATTGCCTTAGCGAACAACATGTCCGCCATGA



CGCCTTACTGGGAGGGCAGGAAGGGAGAACTGATTGAAAGATTGGCTTGGCTGAA



GCACCGTGCAGAGGGGCTTTATCTGAAGGAACCGCATTTTGGAAATAGTTGGGCC



GACCATAGGTCTAGAATTTTTTCCAGAATAGCCGGGTGGCTTTCTGGGTGCGCTG



GGAAGCTAAAGATCGCCAAAGACCAGATCAGCGGAGTGCGTACTGATCTGTTCCT



TCTGAAGAGACTGCTGGATGCGGTCCCGCAGTCCGCCCCTTCTCCCGACTTCATA



GCCTCTATCTCTGCCTTGGATCGCTTCCTGGAGGCCGCAGAATCTAGTCAGGATC



CTGCCGAACAGGTGAGGGCCCTATACGCCTTTCATCTGAACGCACCCGCGGTGCG



AAGCATCGCCAACAAGGCAGTCCAGCGATCCGACAGCCAAGAATGGCTTATAAAG



GAACTGGACGCTGTGGACCACCTGGAGTTTAACAAGGCCTTTCCCTTCTTCTCTG



ATACGGGAAAGAAGAAAAAGAAAGGGGCTAACTCGAATGGCGCTCCGTCCGAGGA



GGAGTACACCGAGACTGAGAGCATCCAGCAGCCCGAGGACGCTGAGCAAGAGGTT



AATGGTCAGGAAGGCAACGGGGCCTCGAAGAACCAGAAGAAGTTTCAGAGAATCC



CCCGATTCTTCGGCGAGGGGAGTCGCAGCGAGTATCGCATCCTCACTGAAGCCCC



GCAGTACTTCGACATGTTCTGTAACAACATGCGGGCCATCTTTATGCAATTAGAA



TCCCAACCGCGTAAAGCTCCCAGGGATTTTAAGTGTTTCCTGCAGAATCGGCTGC



AGAAATTGTATAAGCAGACATTCCTGAACGCTCGATCCAACAAGTGCCGGGCATT



ACTAGAGTCCGTATTGATTAGTTGGGGAGAGTTTTACACCTACGGGGCTAACGAG



AAAAAATTTCGACTGCGTCATGAAGCTTCTGAGCGCTCCTCGGACCCAGATTACG



TGGTGCAACAGGCGCTGGAGATCGCTCGGAGGCTGTTTCTCTTCGGCTTTGAGTG



GAGGGACTGTAGCGCAGGTGAAAGAGTGGATCTGGTCGAAATACATAAGAAAGCC



ATATCTTTCCTGTTGGCCATCACTCAGGCTGAGGTGTCTGTGGGCAGCTATAACT



GGCTGGGCAATTCTACCGTGAGTCGGTACCTGTCCGTGGCAGGGACTGATACCCT



TTACGGCACCCAGCTGGAAGAATTCTTAAATGCAACCGTGTTATCTCAGATGCGG



GGGCTGGCTATCAGGTTATCATCTCAGGAACTGAAGGATGGATTTGACGTACAGC



TGGAGTCTAGTTGCCAGGATAATCTGCAACACTTGCTCGTGTACAGGGCTTCACG



AGACCTTGCCGCCTGCAAGCGCGCTACTTGTCCAGCTGAGTTGGATCCTAAGATT



CTGGTACTGCCCGTGGGGGCCTTTATCGCTAGCGTGATGAAAATGATTGAAAGAG



GGGATGAGCCTTTAGCTGGAGCTTATCTGAGACACAGACCCCATAGTTTCGGGTG



GCAGATCCGCGTTCGAGGTGTGGCAGAGGTGGGAATGGACCAAGGGACCGCCCTG



GCGTTCCAGAAACCGACCGAGAGCGAACCCTTCAAGATAAAGCCGTTTTCCGCTC



AATACGGCCCCGTTCTATGGCTGAACAGCTCCAGTTATAGCCAGAGCCAGTACCT



GGACGGGTTCCTATCACAGCCCAAGAACTGGAGTATGCGGGTGCTGCCACAGGCC



GGCTCAGTGCGGGTAGAACAGCGCGTCGCCTTGATTTGGAATCTCCAGGCCGGAA



AGATGAGGCTGGAACGGAGCGGAGCGCGGGCTTTCTTCATGCCCGTCCCATTCAG



TTTCCGCCCCAGTGGCAGCGGCGACGAGGCAGTCCTGGCTCCAAATAGGTACCTG



GGACTCTTTCCACACAGCGGCGGCATAGAGTACGCTGTGGTCGATGTTCTTGACT



CTGCCGGCTTCAAAATACTCGAGAGAGGAACAATAGCCGTCAATGGCTTCTCCCA



GAAACGAGGAGAAAGACAAGAGGAAGCCCATCGCGAAAAACAAAGACGCGGTATC



TCCGATATTGGGCGCAAGAAGCCAGTCCAGGCCGAAGTCGATGCGGCCAACGAGC



TCCATCGAAAATACACCGATGTTGCTACTCGGCTGGGGTGTCGAATTGTCGTTCA



ATGGGCACCCCAACCCAAACCAGGCACTGCGCCGACCGCTCAGACTGTGTACGCT



AGGGCCGTGAGGACTGAAGCACCAAGATCCGGCAATCAGGAAGATCACGCCAGGA



TGAAATCTTCCTGGGGATACACATGGGGTACGTATTGGGAAAAAAGGAAGCCCGA



GGACATCCTCGGCATTAGTACCCAGGTGTATTGGACAGGCGGGATCGGCGAGTCC



TGCCCGGCTGTCGCCGTCGCGCTATTGGGACACATCAGGGCCACCTCAACCCAGA



CTGAATGGGAGAAAGAGGAAGTCGTGTTTGGGCGATTGAAAAAGTTCTTCCCATC



CTGA





SEQ
ATGGAGAAGCGCATCAATAAAATTCGCAAGAAGCTGTCTGCCGATAACGCCACAA


ID
AACCAGTTAGTCGAAGCGGCCCAATGAAGACCCTGCTAGTTCGAGTGATGACTGA


NO:
TGATCTGAAGAAAAGGCTCGAAAAGCGACGCAAGAAGCCTGAGGTAATGCCTCAG


165
GTTATAAGTAACAATGCAGCAAACAATCTGCGGATGCTGCTTGACGATTACACAA



AGATGAAGGAAGCCATTCTCCAGGTGTATTGGCAGGAGTTCAAGGATGATCACGT



AGGCCTGATGTGTAAATTCGCGCAACCTGCAAGCAAGAAGATCGACCAAAACAAG



CTGAAACCCGAGATGGATGAAAAAGGCAATTTAACAACCGCCGGATTCGCTTGTT



CCCAGTGTGGGCAGCCACTGTTCGTGTACAAGTTAGAACAGGTGTCGGAAAAAGG



AAAGGCATACACTAACTACTTTGGACGGTGCAATGTTGCAGAACACGAAAAGCTG



ATACTGCTTGCCCAGCTTAAGCCCGAAAAAGACAGCGACGAAGCGGTGACCTACA



GCCTGGGAAAATTCGGGCAGCGGGCACTGGACTTCTATTCTATCCACGTTACCAA



GGAGAGCACCCACCCAGTGAAGCCGTTGGCCCAAATCGCTGGAAACCGGTACGCC



AGCGGACCAGTCGGCAAGGCCCTGTCCGATGCCTGTATGGGCACAATTGCTTCTT



TCCTGTCCAAGTACCAGGACATCATAATCGAGCACCAAAAAGTTGTGAAAGGGAA



TCAGAAACGCCTGGAATCCCTTCGAGAACTGGCCGGCAAGGAGAACCTTGAGTAC



CCGTCCGTGACCCTGCCTCCACAGCCACATACCAAAGAGGGCGTAGACGCGTATA



ATGAGGTCATTGCCCGCGTTCGCATGTGGGTTAATTTAAACCTGTGGCAGAAATT



AAAACTAAGCCGAGATGATGCTAAACCGTTACTGAGATTGAAGGGATTCCCTAGC



TTTCCTGTGGTGGAGAGAAGGGAAAACGAGGTTGATTGGTGGAATACTATTAATG



AGGTGAAAAAGCTTATTGACGCCAAGAGGGATATGGGCAGGGTGTTCTGGAGCGG



GGTGACTGCCGAAAAGAGAAATACCATCCTCGAGGGATACAATTACCTCCCCAAC



GAGAATGATCATAAGAAAAGAGAGGGGAGCTTAGAGAATCCAAAGAAACCTGCAA



AGAGGCAATTCGGTGATCTCCTGCTCTACCTCGAGAAGAAATACGCGGGGGACTG



GGGAAAAGTTTTTGACGAAGCCTGGGAGCGCATTGACAAGAAGATCGCCGGGCTG



ACGTCTCACATTGAACGGGAAGAGGCACGGAATGCAGAGGACGCCCAGTCTAAGG



CCGTGCTGACTGACTGGCTGCGCGCAAAGGCCTCCTTCGTGCTCGAACGTCTGAA



GGAAATGGATGAGAAAGAGTTTTACGCGTGTGAAATACAGCTGCAGAAGTGGTAC



GGCGATCTAAGGGGAAATCCCTTCGCAGTGGAAGCCGAGAATAGGGTAGTTGACA



TCAGTGGGTTCTCCATCGGCAGTGATGGACATTCTATCCAGTATAGAAACCTGCT



CGCCTGGAAGTACTTAGAGAACGGCAAGAGAGAGTTCTATCTGCTGATGAACTAC



GGGAAAAAAGGTAGAATTCGCTTTACAGATGGCACCGACATAAAGAAGTCCGGAA



AGTGGCAAGGCCTCTTATACGGAGGCGGCAAAGCAAAGGTGATAGACTTGACTTT



TGACCCTGACGACGAACAGCTGATAATCTTGCCGCTGGCCTTTGGCACAAGACAA



GGTAGGGAATTTATCTGGAATGATCTTCTTTCTCTCGAGACCGGACTCATCAAGC



TCGCAAACGGAAGGGTCATCGAGAAGACAATCTACAATAAAAAGATAGGCCGAGA



CGAGCCAGCCCTGTTTGTGGCTTTGACATTTGAGCGGAGAGAGGTCGTAGATCCC



AGCAACATCAAACCCGTGAACCTGATCGGTGTTGACAGGGGCGAGAACATCCCGG



CGGTTATCGCACTGACGGATCCAGAAGGATGTCCTCTGCCCGAGTTCAAAGATTC



ATCGGGAGGGCCAACCGACATTTTGAGGATAGGGGAGGGGTACAAGGAGAAGCAG



CGAGCTATCCAGGCGGCCAAAGAAGTGGAGCAACGAAGAGCTGGTGGTTATTCTC



GCAAGTTCGCTTCCAAAAGTCGTAACCTGGCTGACGATATGGTGCGCAATTCTGC



CCGTGACCTTTTCTACCACGCCGTTACACACGACGCCGTGTTAGTGTTTGAAAAT



CTTAGTCGAGGCTTCGGGCGACAGGGGAAGCGGACCTTTATGACCGAGAGACAGT



ATACAAAAATGGAGGATTGGCTGACCGCCAAACTGGCGTATGAAGGACTCACATC



CAAGACCTATCTCTCAAAAACTTTGGCCCAGTATACATCTAAGACGTGCAGTAAC



TGTGGCTTCACCATTACCACAGCTGACTACGATGGCATGCTGGTCCGCTTAAAAA



AGACATCTGACGGCTGGGCTACTACCCTCAACAATAAAGAGCTCAAAGCCGAAGG



ACAAATTACCTATTATAACAGGTATAAAAGACAGACTGTCGAGAAGGAGTTGAGC



GCGGAGCTGGACCGCCTATCAGAGGAGTCAGGGAACAACGATATCTCTAAGTGGA



CTAAGGGACGCCGAGACGAGGCGTTGTTCTTGCTGAAAAAGCGGTTCTCTCATCG



ACCCGTGCAGGAGCAGTTCGTGTGTCTGGACTGCGGCCACGAGGTTCATGCTGAT



GAGCAAGCTGCTCTAAATATTGCCCGTAGTTGGTTGTTCCTGAACAGCAATTCAA



CAGAGTTCAAGTCATACAAGAGCGGAAAGCAGCCGTTTGTGGGCGCATGGCAGGC



ATTTTACAAAAGACGCCTGAAGGAAGTGTGGAAGCCAAACGCC





SEQ
ATGAAAAGGATTAACAAAATCCGAAGGCGGCTTGTAAAGGATTCTAACACCAAAA


ID
AGGCTGGCAAGACGGGGCCCATGAAAACATTACTCGTTAGAGTTATGACCCCCGA


NO:
CCTCAGAGAGCGACTGGAAAATTTACGCAAGAAGCCAGAGAACATACCTCAGCCA


166
ATTAGTAATACCTCTCGGGCAAACCTAAACAAGTTGCTTACTGATTACACGGAGA



TGAAAAAGGCCATACTGCATGTGTACTGGGAGGAGTTTCAAAAGGACCCTGTCGG



GCTAATGAGCAGGGTGGCTCAGCCTGCACCTAAAAACATCGACCAGCGGAAACTC



ATCCCAGTTAAGGACGGAAATGAGAGATTGACAAGTTCAGGTTTCGCCTGCTCAC



AGTGCTGTCAACCGCTGTACGTTTATAAGTTAGAACAAGTGAATGACAAAGGAAA



GCCTCACACAAATTATTTTGGCCGGTGTAATGTCTCTGAGCATGAGCGTCTGATT



CTGTTGTCCCCGCATAAACCGGAAGCTAATGACGAGCTCGTAACCTACAGCTTGG



GGAAGTTTGGCCAAAGAGCATTGGACTTCTATTCAATCCATGTGACCCGCGAATC



CAATCATCCCGTCAAGCCCTTGGAGCAGATAGGGGGCAATAGTTGCGCTTCTGGC



CCTGTGGGCAAAGCCCTGTCCGACGCCTGTATGGGAGCCGTGGCTTCATTCCTGA



CCAAATATCAGGATATCATCTTGGAGCACCAGAAAGTGATCAAGAAAAATGAAAA



AAGGTTAGCAAACCTCAAGGATATTGCAAGCGCTAACGGCTTGGCTTTTCCTAAA



ATCACACTTCCACCTCAGCCTCACACAAAGGAAGGCATCGAGGCATACAACAATG



TGGTGGCCCAGATCGTCATCTGGGTTAACTTAAACCTGTGGCAGAAACTTAAAAT



TGGCAGGGATGAGGCAAAACCCTTACAGCGCCTGAAAGGATTCCCCAGCTTTCCA



CTGGTGGAGCGCCAGGCTAACGAAGTGGACTGGTGGGATATGGTGTGTAACGTCA



AGAAGCTCATCAATGAAAAGAAAGAGGACGGTAAAGTCTTCTGGCAGAACCTCGC



CGGTTACAAACGGCAGGAGGCGCTGTTACCTTATCTGTCGAGTGAAGAGGACCGG



AAAAAAGGCAAGAAATTTGCTCGTTATCAGTTTGGTGATTTGCTCCTACATTTGG



AGAAGAAGCACGGCGAGGACTGGGGAAAAGTATACGATGAGGCCTGGGAGAGGAT



TGACAAAAAGGTGGAGGGACTGTCAAAGCACATCAAGCTCGAAGAAGAGCGCAGA



AGCGAGGACGCCCAATCCAAAGCAGCGCTGACTGACTGGCTGCGGGCGAAGGCCA



GTTTTGTAATCGAAGGCCTTAAAGAAGCCGACAAGGATGAATTCTGCAGATGCGA



ATTAAAACTCCAGAAGTGGTACGGCGATCTCCGAGGTAAGCCTTTCGCAATCGAG



GCCGAGAATTCCATACTGGACATTAGTGGATTCAGTAAACAGTATAATTGTGCCT



TTATATGGCAGAAGGATGGTGTCAAGAAACTCAACCTGTACCTTATTATTAATTA



TTTCAAAGGCGGGAAACTGAGATTTAAGAAGATAAAGCCTGAAGCCTTTGAGGCG



AACCGATTCTACACAGTTATTAACAAGAAATCTGGTGAAATTGTACCCATGGAGG



TAAACTTCAACTTCGATGATCCCAATCTGATTATATTGCCACTAGCTTTTGGCAA



GCGGCAGGGTAGGGAATTCATTTGGAACGATTTGCTTTCACTGGAAACAGGGTCC



CTTAAGCTGGCAAACGGGAGAGTGATTGAAAAGACATTGTACAATCGGAGGACAC



GTCAGGATGAACCTGCCCTTTTCGTGGCTCTGACATTCGAGCGCAGGGAGGTTCT



GGACTCTAGCAATATCAAGCCAATGAACCTGATCGGCATAGACCGAGGAGAGAAT



ATTCCGGCTGTGATCGCACTCACCGATCCCGAAGGATGTCCCCTTTCTCGGTTCA



AGGACTCCTTAGGCAATCCAACTCATATCCTGAGAATCGGCGAGTCATACAAGGA



GAAGCAGCGAACAATTCAGGCCGCCAAGGAAGTCGAGCAGAGGCGAGCTGGCGGC



TACAGCCGTAAATACGCTAGTAAAGCTAAGAACCTGGCCGACGATATGGTGCGCA



ATACTGCTAGAGACCTGCTGTACTATGCAGTGACGCAGGACGCAATGCTGATATT



CGAGAATCTGTCCAGAGGATTCGGAAGGCAGGGCAAGCGGACGTTCATGGCCGAG



CGCCAGTATACAAGGATGGAGGATTGGTTAACGGCCAAGCTTGCCTATGAGGGGC



TACCTAGTAAGACCTATCTGTCTAAGACGCTGGCTCAATACACCAGTAAGACCTG



CTCAAACTGTGGCTTTACAATCACTTCTGCTGATTATGATAGAGTGCTCGAGAAG



CTAAAAAAAACTGCCACCGGCTGGATGACTACTATTAATGGGAAGGAACTGAAAG



TGGAAGGACAGATTACCTATTATAATCGCTACAAGCGTCAAAACGTCGTCAAGGA



CCTGTCGGTGGAATTGGACAGACTCAGTGAAGAGTCCGTGAACAATGATATCAGC



TCCTGGACAAAAGGGCGCAGTGGGGAGGCACTCAGCTTGCTTAAAAAGAGGTTTT



CACATCGGCCGGTCCAGGAGAAATTTGTCTGCCTGAACTGCGGATTCGAGACACA



CGCCGACGAGCAGGCAGCACTGAACATTGCCAGATCCTGGCTGTTCCTTAGGTCC



CAGGAATATAAGAAGTACCAGACTAACAAAACCACGGGAAACACAGATAAAAGGG



CCTTTGTCGAAACTTGGCAATCCTTTTACCGGAAGAAGTTAAAGGAAGTGTGGAA



GCCC





SEQ
ATGGATAAGAAATACTCAATAGGCTTAGCAATCGGCACAAATAGCGTCGGATGGG


ID
CGGTGATCACTGATGAATATAAGGTTCCGTCTAAAAAGTTCAAGGTTCTGGGAAA


NO:
TACAGACCGCCACAGTATCAAAAAAAATCTTATAGGGGCTCTTTTATTTGACAGT


167
GGAGAGACAGCGGAAGCGACTCGTCTCAAACGGACAGCTCGTAGAAGGTATACAC



GTCGGAAGAATCGTATTTGTTATCTACAGGAGATTTTTTCAAATGAGATGGCGAA



AGTAGATGATAGTTTCTTTCATCGACTTGAAGAGTCTTTTTTGGTGGAAGAAGAC



AAGAAGCATGAACGTCATCCTATTTTTGGAAATATAGTAGATGAAGTTGCTTATC



ATGAGAAATATCCAACTATCTATCATCTGCGAAAAAAATTGGTAGATTCTACTGA



TAAAGCGGATTTGCGCTTAATCTATTTGGCCTTAGCGCATATGATTAAGTTTCGT



GGTCATTTTTTGATTGAGGGAGATTTAAATCCTGATAATAGTGATGTGGACAAAC



TATTTATCCAGTTGGTACAAACCTACAATCAATTATTTGAAGAAAACCCTATTAA



CGCAAGTGGAGTAGATGCTAAAGCGATTCTTTCTGCACGATTGAGTAAATCAAGA



CGATTAGAAAATCTCATTGCTCAGCTCCCCGGTGAGAAGAAAAATGGCTTATTTG



GGAATCTCATTGCTTTGTCATTGGGTTTGACCCCTAATTTTAAATCAAATTTTGA



TTTGGCAGAAGATGCTAAATTACAGCTTTCAAAAGATACTTACGATGATGATTTA



GATAATTTATTGGCGCAAATTGGAGATCAATATGCTGATTTGTTTTTGGCAGCTA



AGAATTTATCAGATGCTATTTTACTTTCAGATATCCTAAGAGTAAATACTGAAAT



AACTAAGGCTCCCCTATCAGCTTCAATGATTAAACGCTACGATGAACATCATCAA



GACTTGACTCTTTTAAAAGCTTTAGTTCGACAACAACTTCCAGAAAAGTATAAAG



AAATCTTTTTTGATCAATCAAAAAACGGATATGCAGGTTATATTGATGGGGGAGC



TAGCCAAGAAGAATTTTATAAATTTATCAAACCAATTTTAGAAAAAATGGATGGT



ACTGAGGAATTATTGGTGAAACTAAATCGTGAAGATTTGCTGCGCAAGCAACGGA



CCTTTGACAACGGCTCTATTCCCCATCAAATTCACTTGGGTGAGCTGCATGCTAT



TTTGAGAAGACAAGAAGACTTTTATCCATTTTTAAAAGACAATCGTGAGAAGATT



GAAAAAATCTTGACTTTTCGAATTCCTTATTATGTTGGTCCATTGGCGCGTGGCA



ATAGTCGTTTTGCATGGATGACTCGGAAGTCTGAAGAAACAATTACCCCATGGAA



TTTTGAAGAAGTTGTCGATAAAGGTGCTTCAGCTCAATCATTTATTGAACGCATG



ACAAACTTTGATAAAAATCTTCCAAATGAAAAAGTACTACCAAAACATAGTTTGC



TTTATGAGTATTTTACGGTTTATAACGAATTGACAAAGGTCAAATATGTTACTGA



AGGAATGCGAAAACCAGCATTTCTTTCAGGTGAACAGAAGAAAGCCATTGTTGAT



TTACTCTTCAAAACAAATCGAAAAGTAACCGTTAAGCAATTAAAAGAAGATTATT



TCAAAAAAATAGAATGTTTTGATAGTGTTGAAATTTCAGGAGTTGAAGATAGATT



TAATGCTTCATTAGGTACCTACCATGATTTGCTAAAAATTATTAAAGATAAAGAT



TTTTTGGATAATGAAGAAAATGAAGATATCTTAGAGGATATTGTTTTAACATTGA



CCTTATTTGAAGATAGGGAGATGATTGAGGAAAGACTTAAAACATATGCTCACCT



CTTTGATGATAAGGTGATGAAACAGCTTAAACGTCGCCGTTATACTGGTTGGGGA



CGTTTGTCTCGAAAATTGATTAATGGTATTAGGGATAAGCAATCTGGCAAAACAA



TATTAGATTTTTTGAAATCAGATGGTTTTGCCAATCGCAATTTTATGCAGCTGAT



CCATGATGATAGTTTGACATTTAAAGAAGACATTCAAAAAGCACAAGTGTCTGGA



CAAGGCGATAGTTTACATGAACATATTGCAAATTTAGCTGGTAGCCCTGCTATTA



AAAAAGGTATTTTACAGACTGTAAAAGTTGTTGATGAATTGGTCAAAGTAATGGG



GCGGCATAAGCCAGAAAATATCGTTATTGAAATGGCACGTGAAAATCAGACAACT



CAAAAGGGCCAGAAAAATTCGCGAGAGCGTATGAAACGAATCGAAGAAGGTATCA



AAGAATTAGGAAGTCAGATTCTTAAAGAGCATCCTGTTGAAAATACTCAATTGCA



AAATGAAAAGCTCTATCTCTATTATCTCCAAAATGGAAGAGACATGTATGTGGAC



CAAGAATTAGATATTAATCGTTTAAGTGATTATGATGTCGATCACATTGTTCCAC



AAAGTTTCCTTAAAGACGATTCAATAGACAATAAGGTCTTAACGCGTTCTGATAA



AAATCGTGGTAAATCGGATAACGTTCCAAGTGAAGAAGTAGTCAAAAAGATGAAA



AACTATTGGAGACAACTTCTAAACGCCAAGTTAATCACTCAACGTAAGTTTGATA



ATTTAACGAAAGCTGAACGTGGAGGTTTGAGTGAACTTGATAAAGCTGGTTTTAT



CAAACGCCAATTGGTTGAAACTCGCCAAATCACTAAGCATGTGGCACAAATTTTG



GATAGTCGCATGAATACTAAATACGATGAAAATGATAAACTTATTCGAGAGGTTA



AAGTGATTACCTTAAAATCTAAATTAGTTTCTGACTTCCGAAAAGATTTCCAATT



CTATAAAGTACGTGAGATTAACAATTACCATCATGCCCATGATGCGTATCTAAAT



GCCGTCGTTGGAACTGCTTTGATTAAGAAATATCCAAAACTTGAATCGGAGTTTG



TCTATGGTGATTATAAAGTTTATGATGTTCGTAAAATGATTGCTAAGTCTGAGCA



AGAAATAGGCAAAGCAACCGCAAAATATTTCTTTTACTCTAATATCATGAACTTC



TTCAAAACAGAAATTACACTTGCAAATGGAGAGATTCGCAAACGCCCTCTAATCG



AAACTAATGGGGAAACTGGAGAAATTGTCTGGGATAAAGGGCGAGATTTTGCCAC



AGTGCGCAAAGTATTGTCCATGCCCCAAGTCAATATTGTCAAGAAAACAGAAGTA



CAGACAGGCGGATTCTCCAAGGAGTCAATTTTACCAAAAAGAAATTCGGACAAGC



TTATTGCTCGTAAAAAAGACTGGGATCCAAAAAAATATGGTGGTTTTGATAGTCC



AACGGTAGCTTATTCAGTCCTAGTGGTTGCTAAGGTGGAAAAAGGGAAATCGAAG



AAGTTAAAATCCGTTAAAGAGTTACTAGGGATCACAATTATGGAAAGAAGTTCCT



TTGAAAAAAATCCGATTGACTTTTTAGAAGCTAAAGGATATAAGGAAGTTAAAAA



AGACTTAATCATTAAACTACCTAAATATAGTCTTTTTGAGTTAGAAAACGGTCGT



AAACGGATGCTGGCTAGTGCCGGAGAATTACAAAAAGGAAATGAGCTGGCTCTGC



CAAGCAAATATGTGAATTTTTTATATTTAGCTAGTCATTATGAAAAGTTGAAGGG



TAGTCCAGAAGATAACGAACAAAAACAATTGTTTGTGGAGCAGCATAAGCATTAT



TTAGATGAGATTATTGAGCAAATCAGTGAATTTTCTAAGCGTGTTATTTTAGCAG



ATGCCAATTTAGATAAAGTTCTTAGTGCATATAACAAACATAGAGACAAACCAAT



ACGTGAACAAGCAGAAAATATTATTCATTTATTTACGTTGACGAATCTTGGAGCT



CCCGCTGCTTTTAAATATTTTGATACAACAATTGATCGTAAACGATATACGTCTA



CAAAAGAAGTTTTAGATGCCACTCTTATCCATCAATCCATCACTGGTCTTTATGA



AACACGCATTGATTTGAGTCAGCTAGGAGGTGACTGA





SEQ
ATGGATAAGAAGTATTCAATTGGACTTGCGATTGGCACTAACAGTGTGGGCTGGG


ID
CGGTGATTACAGACGAGTATAAGGTGCCGTCAAAAAAGTTTAAAGTTCTGGGCAA


NO:
CACTGATCGCCATTCCATCAAGAAAAACCTAATCGGGGCCCTTCTTTTTGATAGT


168
GGCGAAACGGCCGAGGCGACGCGTCTAAAACGTACCGCGCGGCGTCGCTACACCC



GACGAAAAAACCGTATTTGTTACCTTCAGGAGATCTTCAGTAACGAAATGGCTAA



GGTGGACGATTCATTCTTCCACCGTCTGGAGGAGTCCTTTTTAGTTGAAGAAGAC



AAGAAGCATGAGCGACACCCAATTTTTGGTAACATTGTCGACGAAGTCGCCTATC



ACGAAAAATATCCGACCATTTATCACCTGCGCAAAAAACTGGTCGATAGCACGGA



TAAAGCGGATCTGCGGCTTATTTACCTGGCGCTTGCCCACATGATCAAGTTCCGC



GGCCACTTCCTGATAGAAGGAGACCTGAACCCGGATAATAGCGATGTAGACAAAC



TGTTTATTCAGCTGGTCCAGACCTACAACCAGCTGTTTGAAGAAAATCCGATTAA



TGCGTCAGGCGTGGATGCGAAAGCGATACTGAGTGCCCGCCTGTCGAAATCTCGC



CGTCTCGAAAATCTGATTGCACAGCTGCCCGGCGAAAAAAAAAACGGTCTTTTTG



GCAATCTGATCGCGCTGTCACTGGGCCTGACACCAAATTTTAAGAGCAACTTCGA



CCTGGCAGAGGATGCGAAGCTTCAACTGTCGAAGGACACCTATGACGATGATCTG



GATAATCTTCTGGCACAAATCGGTGATCAGTATGCGGATTTATTCCTTGCAGCGA



AAAACCTATCTGACGCAATTCTGTTGAGCGATATCCTCCGCGTCAACACCGAAAT



CACTAAAGCCCCCCTGTCAGCGTCGATGATTAAACGTTATGATGAGCACCATCAG



GATCTGACCTTGCTAAAGGCGCTGGTGCGACAGCAGCTTCCCGAAAAATATAAAG



AGATCTTTTTTGATCAATCGAAGAATGGTTATGCCGGATACATTGATGGCGGAGC



CAGTCAGGAAGAATTTTACAAATTCATCAAACCGATCCTGGAAAAAATGGATGGC



ACAGAAGAACTGCTTGTGAAATTGAACCGGGAAGATTTACTGCGCAAACAGCGTA



CGTTCGACAACGGCTCCATACCCCATCAGATTCACTTAGGTGAGCTGCATGCAAT



ACTCCGTCGCCAGGAAGATTTTTATCCATTTTTAAAAGACAACCGTGAGAAGATT



GAAAAAATTTTAACTTTTCGTATTCCATATTACGTCGGGCCTTTGGCCCGAGGTA



ACTCTCGATTCGCCTGGATGACGAGAAAAAGCGAGGAGACCATCACTCCGTGGAA



TTTTGAAGAGGTTGTTGATAAAGGCGCGAGCGCCCAGTCGTTTATCGAACGTATG



ACCAACTTTGATAAAAATCTGCCGAATGAAAAAGTGCTTCCGAAGCATTCTCTGT



TGTATGAATATTTCACTGTGTACAATGAGTTAACGAAAGTGAAATATGTGACCGA



AGGCATGCGGAAACCTGCTTTTCTGTCCGGAGAACAGAAAAAAGCAATTGTGGAC



CTGCTGTTCAAAACGAACCGGAAAGTAACTGTGAAGCAGCTGAAAGAGGACTACT



TCAAAAAAATCGAATGCTTCGACTCAGTAGAGATCTCTGGTGTTGAAGATCGCTT



CAACGCGAGTCTGGGAACGTACCATGATTTGTTGAAAATCATCAAAGATAAAGAC



TTTCTGGATAACGAAGAGAATGAGGACATTCTTGAAGATATTGTTTTGACACTGA



CTCTGTTTGAGGATCGCGAAATGATTGAAGAGCGCCTGAAAACGTATGCCCATTT



ATTCGATGACAAAGTCATGAAGCAGCTGAAACGTCGCCGCTATACTGGGTGGGGC



AGACTTTCACGTAAATTGATCAATGGTATAAGAGACAAACAGAGCGGCAAAACTA



TCTTAGATTTCCTGAAGAGTGATGGATTTGCCAACCGGAATTTTATGCAGCTTAT



ACATGATGACTCGCTAACGTTTAAAGAAGACATTCAGAAGGCGCAGGTCAGCGGC



CAGGGTGATTCGCTGCATGAACACATTGCAAATCTTGCCGGATCGCCAGCGATCA



AAAAAGGCATCCTTCAGACAGTAAAAGTTGTGGATGAACTGGTGAAAGTAATGGG



TCGTCACAAGCCAGAAAATATTGTGATCGAAATGGCCCGGGAAAATCAGACTACT



CAAAAAGGTCAGAAAAATTCTCGCGAGCGTATGAAACGTATTGAAGAAGGCATCA



AAGAGCTAGGCAGCCAGATATTAAAGGAACATCCGGTTGAGAACACTCAGCTGCA



GAATGAAAAACTGTATCTGTATTATCTTCAGAACGGCCGTGACATGTATGTTGAT



CAAGAACTGGATATCAATCGCTTGTCCGATTATGACGTGGATCATATTGTTCCGC



AAAGCTTTCTGAAAGACGATTCTATTGACAATAAAGTACTGACACGTTCGGACAA



AAACCGTGGTAAAAGCGATAACGTACCGTCGGAAGAAGTTGTTAAGAAAATGAAA



AATTATTGGCGCCAACTCCTGAATGCTAAATTGATTACCCAGCGGAAATTTGATA



ACTTAACCAAAGCCGAGCGGGGTGGCTTAAGTGAACTGGATAAAGCGGGTTTTAT



TAAACGCCAACTGGTAGAAACCCGCCAGATAACGAAACATGTAGCTCAAATCCTC



GATAGTCGCATGAATACGAAATATGACGAAAATGATAAATTGATCCGTGAAGTAA



AAGTGATTACTCTTAAAAGCAAATTGGTATCTGATTTTCGGAAAGATTTCCAATT



CTATAAGGTGAGAGAAATTAACAATTACCATCATGCACATGATGCGTATTTAAAT



GCAGTTGTTGGCACCGCCTTAATCAAAAAATATCCGAAATTAGAATCTGAGTTCG



TGTATGGTGATTATAAAGTTTATGATGTTCGAAAAATGATTGCTAAGTCTGAACA



GGAAATCGGCAAAGCGACCGCAAAGTATTTTTTTTATAGCAATATTATGAATTTT



TTTAAAACTGAGATTACCCTGGCGAATGGCGAAATTCGCAAACGTCCTCTGATTG



AAACCAATGGCGAAACCGGCGAGATAGTATGGGACAAGGGCCGTGATTTTGCGAC



CGTCCGGAAAGTCCTGTCAATGCCGCAGGTGAATATTGTCAAGAAAACAGAAGTT



CAGACAGGCGGTTTTAGTAAAGAGTCTATTCTGCCCAAACGTAATTCGGATAAAT



TGATTGCCCGCAAGAAAGATTGGGATCCGAAGAAATATGGTGGATTCGATTCTCC



GACGGTCGCCTATAGCGTTCTAGTCGTCGCCAAGGTCGAAAAAGGTAAATCCAAA



AAACTGAAATCTGTGAAAGAACTGTTAGGCATTACAATCATGGAACGTAGTAGTT



TTGAAAAGAACCCGATCGACTTCCTCGAGGCGAAAGGCTACAAAGAAGTCAAGAA



GGATTTGATTATTAAACTCCCAAAATATTCATTATTTGAGTTAGAAAACGGTAGG



AAGCGTATGCTGGCGAGTGCTGGGGAATTACAGAAAGGGAATGAGTTAGCACTGC



CGTCAAAATATGTGAACTTTCTGTATCTGGCCTCCCATTACGAGAAACTGAAAGG



TAGCCCGGAAGATAATGAACAGAAACAACTATTTGTCGAGCAACACAAACATTAT



CTGGATGAAATTATTGAACAGATTAGTGAATTCTCTAAACGTGTTATTTTAGCGG



ATGCCAACCTTGACAAGGTGCTGAGCGCATATAATAAACACCGTGATAAACCCAT



TCGTGAACAGGCTGAAAATATCATACATCTGTTCACGTTAACCAACTTGGGAGCT



CCTGCCGCTTTTAAATATTTCGATACCACAATTGACCGCAAACGTTATACGTCTA



CAAAAGAGGTGCTCGATGCGACCCTGATCCACCAGTCTATTACAGGCCTGTATGA



AACTCGTATCGACCTGTCACAACTGGGCGGCGACTGA





SEQ
ATGGACAAGAAATATTCAATCGGTTTAGCAATAGGAACTAACTCAGTAGGTTGGG


ID
CTGTAATTACAGACGAATACAAGGTACCGTCCAAAAAGTTTAAGGTGTTGGGGAA


NO:
CACAGATAGACACTCTATAAAAAAAAATTTAATAGGCGCTTTACTTTTCGATTCA


169
GGCGAAACTGCAGAAGCGACACGTCTGAAGAGAACCGCTAGACGTAGATACACGA



GGAGAAAGAACAGAATATGTTACCTACAAGAAATTTTTTCTAATGAGATGGCTAA



GGTGGATGATTCGTTTTTTCATAGACTCGAAGAATCTTTCTTAGTTGAAGAAGAT



AAAAAACACGAAAGGCATCCTATCTTTGGAAACATAGTTGATGAGGTGGCTTACC



ATGAAAAATATCCCACTATATATCACCTTAGAAAAAAGTTGGTTGATTCAACCGA



CAAAGCGGATCTAAGGTTAATTTACCTCGCGTTGGCTCACATGATAAAATTTAGA



GGACATTTCTTGATCGAAGGTGATTTAAATCCCGATAACTCTGATGTAGATAAAC



TGTTCATCCAGTTGGTTCAAACATATAATCAGTTGTTCGAAGAGAACCCCATTAA



CGCATCAGGTGTTGATGCTAAAGCAATCTTATCAGCAAGGTTGAGCAAGAGCAGA



CGTCTGGAAAACTTGATTGCCCAATTGCCAGGTGAAAAGAAGAACGGTCTTTTTG



GAAATTTAATTGCACTTTCACTTGGGTTGACACCGAATTTTAAAAGCAATTTCGA



CCTCGCTGAGGATGCTAAACTCCAGTTATCTAAGGATACATATGACGATGATTTG



GATAATCTATTGGCCCAGATAGGTGATCAGTATGCAGATTTGTTTTTGGCAGCTA



AGAATTTATCAGATGCAATTCTACTGAGCGATATTTTAAGGGTGAATACAGAAAT



AACTAAAGCACCTTTGTCTGCATCTATGATAAAAAGATACGATGAACACCATCAA



GATCTCACACTATTAAAAGCTTTAGTTAGACAACAATTACCAGAAAAATATAAAG



AAATCTTTTTCGATCAGTCCAAGAACGGATACGCCGGCTATATAGATGGCGGTGC



CTCCCAAGAAGAATTTTACAAATTTATCAAACCCATTTTGGAAAAGATGGATGGT



ACTGAAGAATTATTGGTCAAATTAAACAGGGAAGATTTATTAAGAAAACAAAGGA



CCTTTGATAATGGTTCTATTCCACACCAAATCCATCTAGGGGAATTACATGCGAT



TCTTAGAAGACAAGAAGATTTTTATCCATTCTTGAAAGATAACAGGGAAAAGATA



GAGAAAATCTTAACTTTTAGAATTCCCTACTACGTCGGGCCCTTAGCTAGGGGGA



ATTCTAGATTCGCCTGGATGACACGCAAATCAGAAGAAACAATTACGCCTTGGAA



TTTTGAAGAAGTTGTTGATAAAGGAGCCTCTGCTCAATCTTTTATTGAACGAATG



ACCAATTTTGATAAGAATTTACCCAATGAAAAGGTCTTACCCAAACATTCACTCC



TATACGAGTACTTTACTGTTTACAATGAGTTGACAAAAGTGAAGTATGTTACCGA



GGGTATGCGAAAACCTGCTTTCTTGAGTGGTGAACAAAAGAAGGCCATTGTTGAC



TTGTTATTCAAAACTAACAGAAAGGTCACTGTGAAGCAGCTTAAAGAAGATTATT



TCAAAAAGATCGAATGTTTCGACTCGGTAGAAATTAGTGGTGTGGAAGATAGATT



TAATGCTTCTCTTGGAACATATCATGATCTACTAAAGATCATCAAAGATAAAGAT



TTCTTGGACAATGAAGAAAATGAAGATATTCTTGAAGACATCGTGTTGACACTTA



CATTGTTTGAGGACAGAGAAATGATTGAAGAAAGGCTGAAGACCTACGCCCATTT



GTTTGATGATAAAGTCATGAAACAGTTAAAGAGGAGAAGGTATACCGGATGGGGT



AGGCTGTCTCGCAAATTGATTAATGGTATTCGTGATAAACAATCGGGTAAAACAA



TCCTAGATTTCCTGAAGTCCGATGGTTTCGCCAACAGGAATTTTATGCAATTGAT



TCATGACGATTCTTTGACTTTTAAAGAGGATATTCAGAAAGCACAGGTCTCAGGA



CAGGGCGATTCACTCCATGAACATATAGCTAACCTGGCTGGCTCCCCTGCTATTA



AGAAAGGTATCTTGCAAACCGTCAAAGTAGTAGACGAACTTGTTAAAGTTATGGG



AAGACACAAACCTGAAAATATCGTTATTGAAATGGCTCGCGAAAACCAGACAACA



CAAAAGGGTCAAAAGAATTCGAGAGAGAGAATGAAGCGTATCGAAGAAGGTATTA



AAGAACTTGGGTCCCAAATACTTAAAGAACATCCAGTAGAAAACACTCAGCTTCA



AAATGAAAAATTATACTTATATTATCTTCAGAATGGCCGCGATATGTATGTTGAC



CAAGAGTTAGATATAAATAGGTTGTCTGATTACGACGTGGATCATATTGTACCTC



AATCTTTTCTAAAAGATGATTCAATTGATAATAAGGTATTAACGAGAAGTGATAA



AAATAGAGGTAAATCTGACAACGTGCCAAGCGAAGAGGTGGTGAAGAAAATGAAA



AATTATTGGCGTCAACTGTTGAACGCCAAGTTAATTACGCAGAGAAAGTTTGATA



ATCTAACAAAAGCTGAAAGAGGAGGCCTATCTGAGTTAGATAAGGCCGGTTTTAT



CAAACGTCAGTTAGTTGAAACCAGGCAAATCACGAAGCACGTTGCCCAAATTCTA



GATTCAAGGATGAATACCAAATACGATGAAAACGATAAACTGATTCGGGAAGTCA



AGGTTATAACTCTAAAAAGCAAACTAGTTTCAGATTTTCGCAAAGATTTTCAATT



TTACAAAGTTCGAGAAATCAATAATTATCATCATGCTCACGACGCGTACTTGAAC



GCGGTCGTTGGTACAGCTTTAATAAAGAAATATCCTAAACTGGAATCGGAATTTG



TATATGGGGATTACAAAGTATACGACGTGAGAAAGATGATCGCTAAATCTGAACA



AGAAATTGGGAAAGCAACTGCCAAATATTTTTTTTACAGCAACATAATGAATTTT



TTTAAAACGGAAATTACATTGGCAAATGGCGAAATTAGAAAGCGCCCATTGATAG



AGACCAATGGAGAGACTGGGGAAATCGTGTGGGATAAAGGACGTGATTTTGCCAC



AGTGAGGAAAGTGTTAAGTATGCCACAAGTTAATATTGTAAAAAAGACCGAGGTC



CAAACGGGTGGATTTAGCAAAGAATCAATTTTACCTAAGAGAAATTCAGATAAAT



TAATTGCCCGCAAAAAGGATTGGGATCCTAAAAAATATGGTGGTTTTGATTCCCC



AACAGTTGCTTACTCCGTCCTAGTTGTTGCTAAGGTTGAAAAAGGAAAGTCTAAG



AAACTTAAATCCGTAAAAGAGTTACTGGGAATTACAATAATGGAAAGATCCTCTT



TCGAAAAGAACCCTATTGACTTCTTGGAGGCGAAAGGTTATAAAGAAGTCAAAAA



AGATTTGATCATAAAACTACCAAAGTATTCTCTATTTGAATTGGAAAACGGCAGA



AAAAGGATGTTGGCAAGCGCTGGTGAACTACAAAAGGGTAACGAATTGGCATTGC



CGAGTAAATACGTGAATTTTCTATATTTGGCATCACATTACGAAAAGTTAAAGGG



ATCACCCGAGGATAACGAGCAGAAACAACTGTTTGTTGAACAACACAAACATTAT



CTTGATGAAATTATAGAACAAATTAGTGAGTTCAGTAAGAGAGTTATTTTAGCCG



ATGCAAATTTAGACAAAGTTTTATCTGCTTATAACAAACATAGAGATAAGCCTAT



AAGGGAACAAGCCGAAAATATTATTCATTTGTTTACGTTAACAAATTTAGGGGCA



CCAGCAGCATTCAAGTACTTCGATACGACTATCGATCGTAAGCGTTACACATCTA



CCAAAGAAGTTCTTGATGCAACTTTGATTCATCAATCTATAACAGGCTTATATGA



AACTAGAATCGATCTGTCACAACTTGGTGGTGACTAA





SEQ
ATGGACAAGAAGTACTCAATTGGGCTTGCTATCGGCACTAACAGCGTTGGCTGGG


ID
CGGTCATCACAGACGAATATAAGGTCCCATCAAAGAAATTCAAAGTCCTTGGCAA


NO:
TACGGACCGACATTCAATCAAGAAGAACCTGATTGGAGCTCTGCTGTTTGATTCC


170
GGTGAAACCGCCGAGGCAACACGATTGAAACGTACCGCTCGTAGGAGGTATACGC



GGCGGAAAAATAGGATCTGCTATCTGCAGGAAATATTTAGCAACGAAATGGCCAA



GGTAGACGACAGCTTCTTCCACCGGCTCGAGGAATCTTTCCTCGTGGAAGAAGAC



AAAAAGCACGAGCGCCACCCCATTTTCGGCAATATCGTGGACGAGGTAGCTTACC



ATGAAAAGTATCCAACTATTTACCACTTACGTAAGAAGTTAGTGGACAGCACCGA



TAAAGCCGACCTTCGCCTGATTTACCTAGCACTTGCACACATGATTAAGTTCCGA



GGCCACTTCTTGATAGAGGGAGACCTGAATCCTGACAATTCCGATGTGGATAAAT



TGTTCATCCAGCTGGTACAGACATACAATCAGTTGTTTGAGGAAAATCCGATTAA



TGCCAGTGGCGTGGACGCCAAGGCTATCCTGTCTGCTCGGCTTAGTAAGAGTAGA



CGCCTGGAAAATCTAATCGCACAGCTGCCCGGCGAAAAGAAAAATGGACTGTTCG



GTAATTTGATCGCCCTGAGCCTGGGCCTCACCCCTAACTTTAAGTCTAACTTCGA



CCTGGCCGAAGATGCTAAGCTCCAGCTGTCCAAAGATACT



TACGATGACGATCTCGATAATCTACTGGCTCAGATCGGGGACCAGTACGCTGACC



TGTTTCTAGCTGCCAAGAACCTCAGTGACGCCATTCTCCTGTCCGATATTCTGAG



GGTTAACACTGAAATTACAAAGGCCCCGCTGAGCGCGAGCATGATCAAAAGGTAC



GACGAGCATCACCAGGACCTCACGCTGCTGAAGGCCTTAGTCAGACAGCAACTGC



CCGAAAAGTACAAAGAAATCTTTTTCGACCAATCCAAGAACGGGTACGCCGGCTA



CATTGATGGCGGGGCTTCACAAGAGGAGTTTTACAAGTTTATCAAGCCCATCCTG



GAGAAAATGGACGGCACTGAAGAACTGCTTGTGAAACTCAATAGGGAAGACTTAC



TGAGGAAACAGCGCACATTCGATAATGGCTCCATACCCCACCAAATCCATCTGGG



AGAGTTGCATGCCATCTTGCGAAGGCAGGAGGACTTCTACCCCTTTCTTAAGGAC



AACAGGGAGAAAATCGAGAAAATTCTGACTTTCCGTATCCCCTACTACGTGGGCC



CACTTGCTCGCGGAAACTCACGATTCGCATGGATGACCAGAAAGTCCGAGGAAAC



AATTACACCCTGGAATTTTGAGGAGGTAGTAGACAAGGGAGCCAGCGCTCAATCT



TTCATTGAGAGGATGACGAATTTCGACAAGAACCTTCCAAACGAGAAAGTGCTTC



CTAAGCACAGCCTGCTGTATGAGTATTTCACGGTGTACAACGAACTTACGAAGGT



CAAGTATGTGACAGAGGGTATGCGGAAACCTGCTTTTCTGTCTGGTGAACAGAAG



AAAGCTATCGTCGATCTCCTGTTTAAAACCAACCGAAAGGTGACGGTGAAACAGT



TGAAGGAGGATTACTTCAAGAAGATCGAGTGTTTTGATTCTGTTGAAATTTCTGG



GGTCGAGGATAGATTCAACGCCAGCCTGGGCACCTACCATGATTTGCTGAAGATT



ATCAAGGATAAGGATTTTCTGGATAATGAGGAGAATGAAGACATTTTGGAGGATA



TAGTGCTGACCCTCACCCTGTTCGAGGACCGGGAGATGATCGAGGAGAGACTGAA



AACATACGCTCACCTGTTTGACGACAAGGTCATGAAGCAGCTTAAGAGACGCCGT



TACACAGGCTGGGGAAGATTATCCCGCAAATTAATCAACGGGATACGCGATAAAC



AAAGTGGCAAGACCATACTCGACTTCCTAAAGAGCGATGGATTCGCAAATCGCAA



TTTCATGCAGTTGATCCACGACGATAGCCTGACCTTCAAAGAGGACATTCAGAAA



GCGCAGGTGAGTGGTCAAGGGGATTCCCTGCACGAACACATTGCTAACTTGGCTG



GATCACCAGCCATTAAGAAAGGCATACTGCAGACCGTTAAAGTGGTAGATGAGCT



TGTGAAAGTCATGGGAAGACATAAGCCAGAGAACATAGTGATCGAAATGGCCAGG



GAAAATCAGACCACGCAAAAGGGGCAGAAGAACTCAAGAGAGCGTATGAAGAGGA



TCGAGGAGGGCATCAAGGAGCTGGGTAGCCAGATCCTTAAAGAGCACCCAGTTGA



GAATACCCAGCTGCAGAATGAGAAACTTTATCTCTATTATCTCCAGAACGGAAGG



GATATGTATGTCGACCAGGAACTGGACATCAATCGGCTGAGTGATTATGACGTCG



ACCACATTGTGCCTCAAAGCTTTCTGAAGGATGATTCCATCGACAATAAAGTTCT



GACCCGGTCTGATAAAAATAGAGGCAAATCCGACAACGTACCTAGCGAAGAAGTC



GTCAAAAAAATGAAGAACTATTGGAGGCAGTTGCTGAATGCCAAGCTGATTACAC



AACGCAAGTTTGACAATCTCACCAAGGCAGAAAGGGGGGGCCTGTCAGAACTCGA



CAAAGCAGGTTTCATTAAAAGGCAGCTAGTTGAAACTAGGCAGATTACTAAGCAC



GTGGCCCAGATCCTCGACTCACGGATGAATACAAAGTATGATGAGAATGATAAGC



TAATCCGGGAGGTGAAGGTGATTACTCTGAAATCTAAGCTGGTGTCAGATTTCAG



AAAAGACTTCCAGTTCTACAAAGTCAGAGAGATCAACAATTATCACCATGCCCAC



GATGCATATCTTAATGCAGTAGTGGGGACAGCTCTGATCAAAAAATATCCTAAAC



TGGAGTCTGAATTCGTTTATGGTGACTATAAAGTCTATGACGTCAGAAAAATGAT



CGCAAAGAGCGAGCAGGAGATAGGGAAGGCCACAGCAAAGTACTTCTTTTACAGT



AATATCATGAACTTTTTCAAAACTGAGATTACATTGGCTAACGGCGAGATCCGCA



AGCGGCCACTGATAGAGACTAACGGAGAGACAGGGGAGATTGTTTGGGATAAGGG



CCGTGACTTCGCCACCGTTAGGAAAGTGCTGTCCATGCCCCAGGTGAACATTGTG



AAGAAGACAGAAGTGCAGACGGGTGGGTTCTCAAAAGAGTCTATTCTGCCTAAGC



GGAATAGTGACAAACTGATCGCACGTAAAAAGGACTGGGATCCAAAAAAGTACGG



CGGATTCGACAGTCCTACCGTTGCATATTCCGTGCTTGTGGTCGCTAAGGTGGAG



AAGGGAAAAAGCAAGAAACTGAAGTCAGTCAAAGAACTACTGGGCATAACGATCA



TGGAGCGCTCCAGTTTCGAAAAAAACCCAATCGATTTTCTTGAAGCCAAGGGATA



CAAGGAGGTAAAGAAAGACCTTATCATTAAGCTGCCTAAGTACAGTCTGTTCGAA



CTGGAGAATGGGAGGAAGCGCATGCTGGCATCAGCTGGAGAACTCCAAAAAGGGA



ACGAGTTGGCCCTCCCCTCAAAGTATGTCAATTTTCTCTACCTGGCTTCTCACTA



CGAGAAGTTAAAGGGGTCTCCAGAGGATAATGAGCAGAAACAGCTGTTTGTGGAA



CAGCACAAGCACTATTTGGACGAAATCATCGAACAAATTTCCGAGTTCAGTAAGA



GGGTGATTCTGGCCGACGCAAACCTTGACAAAGTTCTGTCCGCATACAATAAGCA



CAGAGACAAACCAATCCGCGAGCAAGCCGAGAATATAATTCACCTTTTCACTCTG



ACTAATCTGGGGGCCCCCGCAGCATTTAAATATTTCGATACAACAATCGACCGGA



AGCGGTATACATCTACTAAGGAAGTCCTCGATGCGACACTGATCCACCAGTCAAT



TACAGGTTTATATGAAACAAGAATCGACCTGTCCCAGCTGGGCGGCGACTAG





SEQ
AAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAA


ID
GCATTGATAATTGAGATCCCTCTCCCTGACAGGATGATTACATAAATAATAGTGA


NO:
CAAAAATAAATTATTTATTTATCCAGAAAATGAATTGGAAAATCAGGAGAGCGTT


171
TTCAATCCTACCTCTGGCGCAGTTGATATGTcaaaCAGGTtgccgtcactgcgtc



ttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagcattc



tgtaacaaaggggaccaaagccatgacaaaaacgcgtaacaaaagtgtctataat



cacggcagaaaagtccacattgattatttgcacggcgtcacactttgctatgcca



tagcatttttatccataagattagcggatcctacctgacgctttttatcgcaact



ctctactgtttctccatacccgtttttttgggctagcaccgcctatctcgtgtga



gataggcggagatacgaactttaagAAGGAGatataccATGGAACAGGAATATTA



TCTGGGCTTGGACATGGGCACCGGTTCCGTCGGCTGGGCTGTTACTGACAGTGAA



TATCACGTTCTAAGAAAGCATGGTAAGGCATTGTGGGGTGTAAGACTTTTCGAAT



CTGCTTCCACTGCTGAAGAGCGTAGAATGTTTAGAACGAGTCGACGTAGGCTAGA



CAGGCGCAATTGGAGAATCGAAATTTTACAAGAAATTTTTGCGGAAGAGATATCT



AAGAAAGACCCAGGCTTTTTCCTGAGAATGAAGGAATCTAAGTATTACCCTGAGG



ATAAAAGAGATATAAATGGTAACTGTCCCGAATTGCCTTACGCATTATTTGTGGA



CGATGATTTTACCGATAAGGATTACCATAAAAAGTTCCCAACTATCTACCATTTA



CGCAAAATGTTAATGAATACAGAGGAAACCCCAGACATAAGACTAGTTTATCTGG



CAATACACCATATGATGAAACATAGAGGCCATTTCTTACTTTCCGGGGATATCAA



CGAAATCAAAGAGTTTGGTACCACATTTAGTAAGTTACTGGAAAACATAAAGAAT



GAAGAATTGGATTGGAACTTAGAACTCGGAAAAGAAGAATACGCGGTTGTCGAAT



CTATCCTGAAGGATAATATGCTGAATAGGTCGACCAAAAAAACTAGGCTGATCAA



AGCACTGAAAGCCAAATCTATCTGCGAAAAAGCTGTTTTAAATTTACTTGCTGGT



GGCACTGTTAAGTTATCAGACATTTTTGGTTTGGAAGAATTGAACGAAACCGAGC



GTCCAAAAATTAGTTTCGCTGATAATGGCTACGATGATTACATTGGTGAGGTGGA



AAACGAGTTGGGCGAACAATTTTATATTATAGAGACAGCTAAGGCAGTCTATGAC



TGGGCTGTTTTAGTAGAAATCCTTGGTAAATACACATCTATCTCCGAAGCGAAAG



TTGCTACTTACGAAAAGCACAAGTCCGATCTCCAGTTTTTGAAGAAAATTGTCAG



GAAATATCTGACTAAGGAAGAATATAAAGATATTTTCGTTAGTACCTCTGACAAA



CTGAAAAATTACTCCGCTTACATCGGGATGACCAAGATTAATGGCAAAAAAGTTG



ATCTGCAAAGCAAAAGGTGTTCGAAGGAAGAATTTTATGATTTCATTAAAAAGAA



TGTCTTAAAAAAATTAGAAGGTCAGCCAGAATACGAATATTTGAAAGAAGAACTG



GAAAGAGAGACATTCTTACCAAAACAAGTCAACAGAGATAATGGGGTAATTCCAT



ATCAAATTCACCTCTACGAATTAAAAAAAATTTTAGGCAATTTACGCGATAAAAT



TGACCTTATCAAAGAAAATGAGGATAAGCTGGTTCAACTCTTTGAATTCAGAATA



CCCTATTATGTGGGCCCACTGAACAAGATTGATGACGGCAAAGAAGGTAAATTCA



CATGGGCCGTCCGCAAATCCAATGAAAAAATTTACCCATGGAACTTTGAAAATGT



AGTAGATATTGAAGCGTCTGCGGAGAAATTTATTCGAAGAATGACTAATAAATGC



ACTTACTTGATGGGAGAGGATGTTCTGCCTAAAGACAGCTTATTATACAGCAAGT



ACATGGTTCTAAACGAACTTAACAACGTTAAGTTGGACGGTGAGAAATTAAGTGT



AGAATTGAAACAAAGATTGTATACTGACGTCTTCTGCAAGTACAGAAAAGTGACA



GTTAAAAAAATTAAGAATTACTTGAAGTGCGAAGGTATAATTTCTGGAAACGTAG



AGATTACTGGTATTGATGGTGATTTCAAAGCATCCCTAACAGCTTACCACGATTT



CAAGGAAATCCTGACAGGAACTGAACTCGCAAAAAAAGATAAAGAAAACATTATT



ACTAATATTGTTCTTTTCGGTGATGACAAGAAATTGTTGAAGAAAAGACTGAATA



GACTTTACCCCCAGATTACTCCCAATCAACTTAAGAAAATTTGTGCTTTGTCTTA



CACAGGATGGGGTCGTTTTTCAAAAAAGTTCTTAGAAGAGATTACCGCACCTGAT



CCAGAAACAGGCGAAGTATGGAATATAATTACCGCCTTATGGGAATCGAACAATA



ATCTTATGCAACTTCTGAGCAATGAATATCGTTTCATGGAAGAAGTTGAGACTTA



CAACATGGGCAAACAGACGAAGACTTTATCCTATGAAACTGTGGAAAATATGTAT



GTATCACCTTCTGTCAAGAGACAAATTTGGCAAACCTTAAAAATTGTCAAAGAAT



TAGAAAAGGTAATGAAGGAGTCTCCTAAACGTGTGTTTATTGAAATGGCTAGAGA



AAAACAAGAGTCAAAAAGAACCGAGTCAAGAAAGAAGCAGTTAATCGATTTATAT



AAGGCTTGTAAAAACGAAGAGAAAGATTGGGTTAAAGAATTGGGGGACCAAGAGG



AACAAAAACTACGGTCGGATAAGTTGTATTTATACTATACGCAAAAGGGACGATG



TATGTATTCCGGCGAGGTAATAGAATTGAAGGATTTATGGGACAATACAAAATAT



GACATAGACCATATATATCCCCAATCAAAAACGATGGACGATAGCTTGAACAATA



GAGTACTCGTGAAAAAAAAATATAATGCGACCAAATCTGATAAGTATCCTCTGAA



TGAAAATATCAGACATGAAAGAAAGGGGTTCTGGAAGTCCTTGTTAGATGGTGGG



TTTATAAGCAAAGAAAAGTACGAGCGTCTAATAAGAAACACGGAGTTATCGCCAG



AAGAACTCGCTGGTTTTATTGAGAGGCAAATCGTGGAAACGAGACAATCTACCAA



AGCCGTTGCTGAGATCCTAAAGCAAGTTTTCCCAGAGTCGGAGATTGTCTATGTC



AAAGCTGGCACAGTGAGCAGGTTTAGGAAAGACTTCGAACTATTAAAGGTAAGAG



AAGTGAACGATTTACATCACGCAAAGGACGCTTACCTAAATATCGTTGTAGGTAA



CTCATATTATGTTAAATTTACCAAGAACGCCTCTTGGTTTATAAAGGAGAACCCA



GGTAGAACATATAACCTGAAAAAGATGTTCACCTCTGGTTGGAATATTGAGAGAA



ACGGAGAAGTCGCATGGGAAGTTGGTAAGAAAGGGACTATAGTGACAGTAAAGCA



AATTATGAACAAAAATAATATCCTCGTTACAAGGCAGGTTCATGAAGCAAAGGGC



GGCCTTTTTGACCAACAAATTATGAAGAAAGGGAAAGGTCAAATTGCAATAAAAG



AAACCGATGAGAGACTAGCGTCAATAGAAAAGTATGGTGGCTATAATAAAGCTGC



GGGTGCATACTTTATGCTTGTTGAATCAAAAGACAAGAAAGGTAAGACTATTAGA



ACTATAGAATTTATACCCCTGTACCTTAAAAACAAAATTGAATCGGATGAGTCAA



TCGCGTTAAATTTTCTAGAGAAAGGAAGGGGTTTAAAAGAACCAAAGATCCTGTT



AAAAAAGATTAAGATTGACACCTTGTTCGATGTAGATGGATTTAAAATGTGGTTA



TCTGGCAGAACAGGCGATAGACTTTTGTTTAAGTGCGCTAATCAATTAATTTTGG



ATGAGAAAATCATTGTCACAATGAAAAAAATAGTTAAGTTTATTCAGAGAAGACA



AGAAAACAGGGAGTTGAAATTATCTGATAAAGATGGTATCGACAATGAAGTTTTA



ATGGAAATCTACAATACATTCGTTGATAAACTTGAAAATACCGTATATCGAATCA



GGTTAAGTGAACAAGCCAAAACATTAATTGATAAACAAAAAGAATTTGAAAGGCT



ATCACTGGAAGACAAATCCTCCACCCTATTTGAAATTTTGCATATATTCCAGTGC



CAATCTTCAGCAGCTAATTTAAAAATGATTGGCGGACCTGGGAAAGCCGGCATCC



TAGTGATGAACAATAATATCTCCAAGTGTAACAAAATATCAATTATTAACCAATC



TCCGACAGGTATTTTTGAAAATGAAATAGACTTGCTTAAGATATAAGAAATCATC



CTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATTTATTATATCGCGTTGATTA



TTGATGCTGTTTTTAGTTTTAACGGCAATTAATATATGTGTTATTAATTGAATGA



ATTTTATCATTCATAATAAGTATGTGTAGGATCAAGCTCAGGTTAAATATTCACT



CAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACAGAATTATCTCATAACAAGT



GTTAAGGGATGTTATTTCC





SEQ
AATTCAAAGGATAATCAAAC


ID



NO:



172






SEQ
AATCTCTACTCTTTGTAGAT


ID



NO:



173






SEQ
AATTTCTACTGTTGTAGAT


ID



NO:



174






SEQ
AATTTCTACTAGTGTAGAT


ID



NO:



175






SEQ
AATTTCTACTATTGT


ID



NO:



176






SEQ
AATTTCTACTGTTGTAGA


ID



NO:



177






SEQ
AATTTCTACTATTGTA


ID



NO:



178






SEQ
AATTTCTACTTTTGTAGAT


ID



NO:



179






SEQ
AATTTCTACTGTTGTAGAT


ID



NO:



180






SEQ
AATTTCTACTCTTGTAGAT


ID



NO:



181








Claims
  • 1.-20. (canceled)
  • 21. A nucleic acid-guided nuclease system comprising: (a) (i) a nucleic acid-guided nuclease comprises an amino acid sequence having at least 90% sequence identity to the amino acid sequence of SEQ ID NO: 4, or (ii) a nucleic acid molecule encoding the nucleic acid-guided nuclease of (i); and(b) (i) an engineered guide nucleic acid capable of complexing with the nucleic acid-guided nuclease, or (ii) a nucleic acid molecule encoding the engineered guide nucleic acid of (i), wherein the engineered guide nucleic acid recognizes a target region in a genome of a cell and a protospacer adjacent motif (PAM) sequence of TTTN.
  • 22. The nucleic acid-guided nuclease system of claim 21, wherein the nucleic acid-guided nuclease is encoded by a nucleic acid molecule having at least 80% sequence identity to the nucleotide sequences of SEQ ID NO: 44 or SEQ ID NO: 24.
  • 23. The nucleic acid-guided nuclease system of claim 21, wherein the nucleic acid molecule encoding the nucleic acid-guided nuclease is codon optimized for Escherichia Coli.
  • 24. The nucleic acid-guided nuclease system of claim 21, wherein the nucleic acid molecule encoding the nucleic acid-guided nuclease is codon optimized for Saccharomyces cerevisiae.
  • 25. The nucleic acid-guided nuclease system of claim 21, wherein the nucleic acid molecule encoding the nucleic acid-guided nuclease is codon optimized for mammalian cells.
  • 26. The nucleic acid-guided nuclease system of claim 21, wherein the target region is within a coding region of a protein.
  • 27. The nucleic acid-guided nuclease system of claim 21, wherein the target region is within a non-coding region of a protein.
  • 28. The nucleic acid-guided nuclease system of claim 21, wherein the nucleic acid-guided nuclease comprises an amino acid sequence having at least 95% sequence identity to the amino acid sequence of SEQ ID NO: 4.
  • 29. The nucleic acid-guided nuclease system of claim 21, wherein the nucleic acid-guided nuclease comprises the amino acid sequence of SEQ ID NO: 4.
  • 30. A method of modifying a target region in the genome of a cell, the method comprising: (a) contacting a cell with the nucleic acid-guided nuclease system of claim 21; and(b) allowing the nucleic-acid guided nuclease system to create a genome edit in a target region of the genome of the cell.
  • 31. The method of claim 30, wherein the method results in cell death.
  • 32. The method of claim 30, wherein the nucleic acid-guided nuclease is encoded by a nucleic acid molecule having at least 80% sequence identity to the nucleotide sequences of SEQ ID NO: 44 or SEQ ID NO: 24.
  • 33. The method of claim 30, wherein the nucleic acid-guided nuclease is encoded by a nucleic acid molecule having at least 85% sequence identity to the nucleotide sequences of SEQ ID NO: 44 or SEQ ID NO: 24.
  • 34. The method of claim 30, wherein the nucleic acid molecule encoding the nucleic acid-guided nuclease is codon optimized for bacteria.
  • 35. The method of claim 30, wherein the nucleic acid molecule encoding the nucleic acid-guided nuclease is codon optimized for mammalian cells.
  • 36. The method of claim 30, wherein the nucleic acid-guided nuclease comprises an amino acid sequence having at least 95% sequence identity to the amino acid sequence of SEQ ID NO: 4.
  • 37. The method of claim 30, wherein the nucleic acid-guided nuclease comprises the amino acid sequence of SEQ ID NO: 4.
  • 38. The method of claim 30, wherein the target region is within a bacterial cell.
  • 39. The method of claim 30, wherein the target region is within a plant cell.
  • 40. The method of claim 30, wherein the target region is within a mammalian cell.
RELATED APPLICATIONS

This application is a Continuation of patent application U.S. Ser. No. 17/554,736, entitled “Nucleic Acid-Guided Nucleases” filed Dec. 17, 2021, now allowed; which is a Continuation of patent application U.S. Ser. No. 17/387,860, entitled “Nucleic Acid-guided Nucleases” filed Jul. 28, 2021, now U.S. Pat. No. 11,220,697; which is a Continuation of patent application U.S. Ser. No. 17/179,193, entitled “Nucleic Acid-Guided Nucleases” filed Feb. 18, 2021, now U.S. Pat. No. 11,130,970; which is a Continuation of patent application U.S. Ser. No. 16/819,896, entitled “Nucleic Acid-Guided Nucleases” filed Mar. 16, 2020; which is a Continuation of patent application U.S. Ser. No. 16/548,631, entitled “Nucleic Acid-Guided Nucleases” filed Aug. 22, 2019, now U.S. Pat. No. 10,626,416; which is a Continuation of patent application U.S. Ser. No. 15/896,433, entitled “Nucleic Acid-Guided Nucleases” filed Feb. 14, 2018, now U.S. Pat. No. 10,435,714; which is a Continuation of patent application U.S. Ser. No. 15/631,989, entitled “Nucleic Acid-Guided Nucleases” filed Jun. 23, 2017, now U.S. Pat. No. 10,011,849.

Continuations (8)
Number Date Country
Parent 17692069 Mar 2022 US
Child 18945973 US
Parent 17554736 Dec 2021 US
Child 17692069 US
Parent 17387860 Jul 2021 US
Child 17554736 US
Parent 17179193 Feb 2021 US
Child 17387860 US
Parent 16819896 Mar 2020 US
Child 17179193 US
Parent 16548631 Aug 2019 US
Child 16819896 US
Parent 15896433 Feb 2018 US
Child 16548631 US
Parent 15631989 Jun 2017 US
Child 15896433 US