Nucleic acid-guided nucleases

Information

  • Patent Grant
  • 11220697
  • Patent Number
    11,220,697
  • Date Filed
    Wednesday, July 28, 2021
    3 years ago
  • Date Issued
    Tuesday, January 11, 2022
    2 years ago
Abstract
Disclosed herein are nucleic acid-guided nucleases, guide nucleic acids, and targetable nuclease systems, and methods of use. Disclosed herein are engineered non-naturally occurring nucleic acid-guided nucleases, guide nucleic acids, and targetable nuclease systems, and methods of use. Targetable nuclease systems can be used to edit genetic targets, including recursive genetic engineering and trackable genetic engineering methods.
Description
INCORPORATION BY REFERENCE

A revised Sequence Listing is being submitted on Sep. 1, 2021 as an electronically filed sequence listing via EFS-Web as an ASCII formatted sequence listing, entitled “INSC104US6_seqlist_20210825”, created Aug. 25, 2021, and 791,000 bytes in size. The sequence listing is part of the specification filed herewith and is incorporated by reference in its entirety.


BACKGROUND OF THE DISCLOSURE

Nucleic acid-guided nucleases have become important tools for research and genome engineering. The applicability of these tools can be limited by the sequence specificity requirements, expression, or delivery issues.


SEQUENCE LISTING

This application contains a sequence list in Table 6.


SUMMARY OF THE DISCLOSURE

Disclosed herein are methods of modifying a target region in the genome of a cell, the method comprising: (a) contacting a cell with: a non-naturally occurring nucleic-acid-guided nuclease encoded by a nucleic acid having at least 80% identity to SEQ ID NO: 22; an engineered guide nucleic acid capable of complexing with the nucleic acid-guided nuclease; and an editing sequence encoding a nucleic acid complementary to said target region having a change in sequence relative to the target region; and (b) allowing the nuclease, guide nucleic acid, and editing sequence to create a genome edit in a target region of the genome of the cell. In some aspects, the engineered guide nucleic acid and the editing sequence are provided as a single nucleic acid. In some aspects, the single nucleic acid further comprises a mutation in a protospacer adjacent motif (PAM) site. In some aspects, the nucleic acid-guided nuclease is encoded by a nucleic acid with at least 85% identity to SEQ ID NO: 42. In some aspects, the nucleic acid-guided nuclease is encoded by a nucleic acid with at least 85% identity to SEQ ID NO: 128.


Disclosed herein are nucleic acid-guided nuclease systems comprising: (a) a non-naturally occurring nuclease encoded by a nucleic acid having at least 80% identity to SEQ ID NO: 22; (b) an engineered guide nucleic acid capable of complexing with the nucleic acid-guided nuclease, and (c) an editing sequence having a change in sequence relative to the sequence of a target region in a genome of a cell; wherein the system results in a genome edit in the target region in the genome of the cell facilitated by the nuclease, the engineered guide nucleic acid, and the editing sequence. In some aspects, nucleic acid-guided nuclease is encoded by a nucleic acid with at least 85% identity to SEQ ID NO: 42. In some aspects, the nucleic acid-guided nuclease is encoded by a nucleic acid with at least 85% identity to SEQ ID NO: 128. In some aspects, the nucleic acid-guided nuclease is codon optimized for the cell to be edited. In some aspects, the engineered guide nucleic acid and the editing sequence are provided as a single nucleic acid. In some aspects, the single nucleic acid further comprises a mutation in a protospacer adjacent motif (PAM) site.


Disclosed herein are compositions for use in genome editing comprising a non-naturally occurring nuclease encoded by a nucleic acid having at least 75% identity to SEQ ID NO: 22. In some aspects, the nucleic acid has at least 80% identity to SEQ ID NO: 22. In some aspects, the nucleic acid has at least 90% identity to SEQ ID NO: 22. In some aspects, the nuclease is further codon optimized for use in cells from a particular organism. In some aspects, the nuclease is codon optimized for E. coli In some aspects, the nuclease is codon optimized for S. Cerevisiae. In some aspects, the nuclease is codon optimized for mammalian cells. In some aspects, the nucleic acid-guided nuclease has less than 40% protein identity to SEQ ID NO: 12. In some aspects, the nucleic acid-guided nuclease has less than 40% protein identity to SEQ ID NO: 108.


INCORPORATION BY REFERENCE

All publications and patent applications mentioned in this specification are herein incorporated by reference to the same extent as if each individual publication or patent application was specifically and individually indicated to be incorporated by reference.





BRIEF DESCRIPTION OF THE DRAWINGS

This patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee.



FIG. 1A depicts a partial sequence alignment MAD1-8 (SEQ ID NO: 1-8) and MAD10-12 (SEQ ID NO: 10-12).



FIG. 1B depicts a phylogenetic tree of nucleases including MAD1-8.



FIG. 2 depicts an example protein expression construct.



FIG. 3 depicts an example editing cassette.



FIG. 4 depicts an example screening or selection experiment workflow.



FIG. 5A depicts an example protein expression construct.



FIG. 5B depicts an example editing cassette.



FIG. 5C depicts an example screening or selection experiment workflow.



FIG. 6A depicts an example protein expression construct.



FIG. 6B depicts an example editing cassette.



FIG. 6C depicts an example screening or selection experiment workflow.



FIG. 7A-7B depicts example data from a functional nuclease complex screening or selection experiment.



FIG. 8 depicts example data from a targetable nuclease complex-based editing experiment.



FIG. 9 depicts example data from a targetable nuclease complex-based editing experiment.



FIGS. 10A-10C depict example data from a targetable nuclease complex-based editing experiment.



FIG. 11 depicts a example sequence alignment of select sequences from an editing experiment.



FIG. 12 depicts example data from a targetable nuclease complex-based editing experiment.



FIG. 13A depicts an example alignment of scaffold sequences.



FIG. 13B depicts an example model of a nucleic acid-guided nuclease complexed with a guide nucleic acid and a target sequence.



FIG. 14A-14B depict example data from a primer validation experiment.



FIG. 15 depicts example data from a targetable nuclease complex-based editing experiment.



FIG. 16 depicts example validation data comparing results from two different assays.



FIG. 17A-17C depict an example trackable genetic engineering workflow, including a plasmid comprising an editing cassette and a recording cassette, and downstream sequencing of barcodes in order to identify the incorporated edit or mutation.



FIG. 18 depicts an example trackable genetic engineering workflow, including iterative rounds of engineering with a different editing cassette and recorder cassette with unique barcode (BC) at each round, which can be followed by selection and tracking to confirm the successful engineering step at each round.



FIG. 19 depicts an example recursive engineering workflow.





DETAILED DESCRIPTION OF THE DISCLOSURE

The present disclosure provides nucleic acid-guided nucleases and methods of use. Often, the subject nucleic-acid guided nucleases are part of a targetable nuclease system comprising a nucleic acid-guided nuclease and a guide nucleic acid. A subject targetable nuclease system can be used to cleave, modify, and/or edit a target polynucleotide sequence, often referred to as a target sequence. A subject targetable nuclease system refers collectively to transcripts and other elements involved in the expression of or directing the activity of genes, which may include sequences encoding a subject nucleic acid-guided nuclease protein and a guide nucleic acid as disclosed herein.


Methods, systems, vectors, polynucleotides, and compositions described herein may be used in various applications including altering or modifying synthesis of a gene product, such as a protein, polynucleotide cleavage, polynucleotide editing, polynucleotide splicing; trafficking of target polynucleotide, tracing of target polynucleotide, isolation of target polynucleotide, visualization of target polynucleotide, etc. Aspects of the invention also encompass methods and uses of the compositions and systems described herein in genome engineering, e.g. for altering or manipulating the expression of one or more genes or the one or more gene products, in prokaryotic, archaeal, or eukaryotic cells, in vitro, in vivo or ex vivo.


Nucleic Acid-Guided Nucleases


Bacterial and archaeal targetable nuclease systems have emerged as powerful tools for precision genome editing. However, naturally occurring nucleases have some limitations including expression and delivery challenges due to the nucleic acid sequence and protein size. Targetable nucleases that require PAM recognition are also limited in the sequences they can target throughout a genetic sequence. Other challenges include processivity, target recognition specificity and efficiency, and nuclease acidity efficiency, which often effect genetic editing efficiency.


Non-naturally occurring targetable nucleases and non-naturally occurring targetable nuclease systems can address many of these challenges and limitations.


Disclosed herein are non-naturally targetable nuclease systems. Such targetable nuclease systems are engineered to address one or more of the challenges described above and can be referred to as engineered nuclease systems. Engineered nuclease systems can comprise one or more of an engineered nuclease, such as an engineered nucleic acid-guided nuclease, an engineered guide nucleic acid, an engineered polynucleotides encoding said nuclease, or an engineered polynucleotides encoding said guide nucleic acid. Engineered nucleases, engineered guide nucleic acids, and engineered polynucleotides encoding the engineered nuclease or engineered guide nucleic acid are not naturally occurring and are not found in nature. It follows that engineered nuclease systems including one or more of these elements are non-naturally occurring.


Non-limiting examples of types of engineering that can be done to obtain a non-naturally occurring nuclease system are as follows. Engineering can include codon optimization to facilitate expression or improve expression in a host cell, such as a heterologous host cell. Engineering can reduce the size or molecular weight of the nuclease in order to facilitate expression or delivery. Engineering can alter PAM selection in order to change PAM specificity or to broaden the range of recognized PAMs. Engineering can alter, increase, or decrease stability, processivity, specificity, or efficiency of a targetable nuclease system. Engineering can alter, increase, or decrease protein stability. Engineering can alter, increase, or decrease processivity of nucleic acid scanning. Engineering can alter, increase, or decrease target sequence specificity. Engineering can alter, increase, or decrease nuclease activity. Engineering can alter, increase, or decrease editing efficiency. Engineering can alter, increase, or decrease transformation efficiency. Engineering can alter, increase, or decrease nuclease or guide nucleic acid expression.


Examples of non-naturally occurring nucleic acid sequences which are disclosed herein include sequences codon optimized for expression in bacteria, such as E. coli (e.g., SEQ ID NO: 41-60), sequences codon optimized for expression in single cell eukaryotes, such as yeast (e.g., SEQ ID NO: 127-146), sequences codon optimized for expression in multi cell eukaryotes, such as human cells (e.g., SEQ ID NO: 147-166), polynucleotides used for cloning or expression of any sequences disclosed herein (e.g., SEQ ID NO: 61-80), plasmids comprising nucleic acid sequences (e.g., SEQ ID NO: 21-40) operably linked to a heterologous promoter or nuclear localization signal or other heterologous element, proteins generated from engineered or codon optimized nucleic acid sequences (e.g., SEQ ID NO: 1-20), or engineered guide nucleic acids comprising any one of SEQ ID NO: 84-107. Such non-naturally occurring nucleic acid sequences can be amplified, cloned, assembled, synthesized, generated from synthesized oligonucleotides or dNTPs, or otherwise obtained using methods known by those skilled in the art.


Disclosed herein are nucleic acid-guided nucleases. Subject nucleases are functional in vitro, or in prokaryotic, archaeal, or eukaryotic cells for in vitro, in vivo, or ex vivo applications. Suitable nucleic acid-guided nucleases can be from an organism from a genus which includes but is not limited to Thiomicrospira, Succinivibrio, Candidatus, Porphyromonas, Acidaminococcus, Acidomonococcus, Prevotella, Smithella, Moraxella, Synergistes, Francisella, Leptospira, Catenibacterium, Kandleria, Clostridium, Dorea, Coprococcus, Enterococcus, Fructobacillus, Weissella, Pediococcus, Corynebacter, Sutterella, Legionella, Treponema, Roseburia, Filifactor, Eubacterium, Streptococcus, Lactobacillus, Mycoplasma, Bacteroides, Flaviivola, Flavobacterium, Sphaerochaeta, Azospirillum, Gluconacetobacter, Neisseria, Roseburia, Parvibaculum, Staphylococcus, Nitratifractor, Mycoplasma, Alicyclobacillus, Brevibacilus, Bacillus, B acteroidetes, Brevibacilus, Carnobacterium, Clostridiaridium, Clostridium, Desulfonatronum, Desulfovibrio, Helcococcus, Leptotrichia, Listeria, Methanomethyophilus, Methylobacterium, Opitutaceae, Paludibacter, Rhodobacter, Sphaerochaeta, Tuberibacillus, Oleiphilus, Omnitrophica, Parcubacteria, and Campylobacter. Species of organism of such a genus can be as otherwise herein discussed. Suitable nucleic acid-guided nucleases can be from an organism from a genus or unclassified genus within a kingdom which includes but is not limited to Firmicute, Actinobacteria, Bacteroidetes, Proteobacteria, Spirochates, and Tenericutes. Suitable nucleic acid-guided nucleases can be from an organism from a genus or unclassified genus within a phylum which includes but is not limited to Erysipelotrichia, Clostridia, Bacilli, Actinobacteria, Bacteroidetes, Flavobacteria, Alphaproteobacteria, Betaproteobacteria, Gammaproteobacteria, Deltaproteobacteria, Epsilonproteobacteria, Spirochaetes, and Mollicutes. Suitable nucleic acid-guided nucleases can be from an organism from a genus or unclassified genus within an order which includes but is not limited to Clostridiales, Lactobacillales, Actinomycetales, Bacteroidales, Flavobacteriales, Rhizobiales, Rhodospirillales, B urkholderiales, Neis seriales, Legionellales, Nautiliales, Campylobacterales, Spirochaetales, Mycoplasmatales, and Thiotrichales. Suitable nucleic acid-guided nucleases can be from an organism from a genus or unclassified genus within a family which includes but is not limited to Lachnospiraceae, Enterococcaceae, Leuconostocaceae, Lactobacillaceae, Streptococcaceae, Peptostreptococcaceae, Staphylococcaceae, Eubacteriaceae, Corynebacterineae, Bacteroidaceae, Flavobacterium, Cryomoorphaceae, Rhodobiaceae, Rhodospirillaceae, Acetobacteraceae, Sutterellaceae, Neisseriaceae, Legionellaceae, Nautiliaceae, Campylobacteraceae, Spirochaetaceae, Mycoplasmataceae, Pisciririckettsiaceae, and Francisellaceae. Other nucleic acid-guided nucleases have been describe in US Patent Application Publication No. US20160208243 filed Dec. 18, 2015, US Application Publication No. US20140068797 filed Mar. 15, 2013, U.S. Pat. No. 8,697,359 filed Oct. 15, 2013, and Zetsche et al., Cell 2015 Oct. 22; 163(3):759-71, each of which are incorporated herein by reference in their entirety.


Some nucleic acid-guided nucleases suitable for use in the methods, systems, and compositions of the present disclosure include those derived from an organism such as, but not limited to, Thiomicrospira sp. XS5, Eubacterium rectale, Succinivibrio dextrinosolvens, Candidatus Methanoplasma termitum, Candidatus Methanomethylophilus alvus, Porphyromonas crevioricanis, Flavobacterium branchiophilum, Acidaminococcus Sp., Acidomonococcus sp., Lachnospiraceae bacterium COE1, Prevotella brevis ATCC 19188, Smithella sp. SCADC, Moraxella bovoculi, Synergistes jonesii, Bacteroidetes oral taxon 274, Francisella tularensis, Leptospira inadai serovar Lyme str. 10, Acidomonococcus sp. crystal structure (5B43) S. mutans, S. agalactiae, S. equisimilis, S. sanguinis, S. pneumonia; C. jejuni, C. coli; N. salsuginis, N. tergarcus; S. auricularis, S. carnosus; N. meningitides, N. gonorrhoeae; L. monocytogenes, L. ivanovii; C. botulinum, C. difficile, C. tetani, C. sordellii; Francisella tularensis 1, Prevotella albensis, Lachnospiraceae bacterium MC2017 1, Butyrivibrio proteoclasticus, Butyrivibrio proteoclasticus B316, Peregrinibacteria bacterium GW2011_GWA2_33_10, Parcubacteria bacterium GW2011_GWC2_44_17, Smithella sp. SCADC, Acidaminococcus sp. BV3L6, Lachnospiraceae bacterium MA2020, Candidatus Methanoplasma termitum, Eubacterium eligens, Moraxella bovoculi 237, Leptospira inadai, Lachnospiraceae bacterium ND2006, Porphyromonas crevioricanis 3, Prevotella disiens, Porphyromonas macacae, Catenibacterium sp. CAG:290, Kandleria vitulina, Clostridiales bacterium KA00274, Lachnospiraceae bacterium 3-2, Dorea longicatena, Coprococcus catus GD/7, Enterococcus columbae DSM 7374, Fructobacillus sp. EFB-N1, Weissella halotolerans, Pediococcus acidilactici, Lactobacillus curvatus, Streptococcus pyogenes, Lactobacillus versmoldensis, Filifactor alocis ATCC 35896, Alicyclobacillus acidoterrestris, Alicyclobacillus acidoterrestris ATCC 49025, Desulfovibrio inopinatus, Desulfovibrio inopinatus DSM 10711, Oleiphilus sp. Oleiphilus sp. HI0009, Candidtus kefeldibacteria, Parcubacteria CasY.4, Omnitrophica WOR 2 bacterium GWF2, Bacillus sp. NSP2.1, and Bacillus thermoamylovorans.


In some instances, a nucleic acid-guided nuclease disclosed herein comprises an amino acid sequence comprising at least 50% amino acid identity to any one of SEQ ID NO: 1-20. In some instances, a nuclease comprises an amino acid sequence comprising at least about 10%, 20%, 30%, 40%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, or 100% amino acid identity to any one of SEQ ID NO: 1-20. In some cases, the nucleic acid-guided nuclease comprises at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, amino acid identity to any one of SEQ ID NO: 1-20. In some cases, the nucleic acid-guided nuclease comprises at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, amino acid identity to any one of SEQ ID NO: 1-8 or 10-12. In some cases, the nucleic acid-guided nuclease comprises at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, amino acid identity to any one of SEQ ID NO: 1-8 or 10-11. In some cases, the nucleic acid-guided nuclease comprises at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, amino acid identity to SEQ ID NO: 2. In some cases, the nucleic acid-guided nuclease comprises at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, amino acid identity to SEQ ID NO: 7.


In some cases, the nucleic acid-guided nuclease comprises any one of SEQ ID NO: 1-20. In some cases, the nucleic acid-guided nuclease comprises any one of SEQ ID NO: 1-8 or 10-12. In some cases, the nucleic acid-guided nuclease comprises any one of SEQ ID NO: 1-8 or 10-11. In some cases, the nucleic acid-guided nuclease comprises SEQ ID NO: 2. In some cases, the nucleic acid-guided nuclease comprises SEQ ID NO: 7.


In some instances, a nucleic acid-guided nuclease comprises an amino acid sequence comprising at most 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, or 95% amino acid identity to any one of SEQ ID NO: 12 or SEQ ID NO: 108-110. In some instances, a nucleic acid-guided nuclease comprises an amino acid sequence comprising at most 50% amino acid identity to any one of SEQ ID NO: 12 or SEQ ID NO: 108-110. In some instances, a nucleic acid-guided nuclease comprises an amino acid sequence comprising at most 45% amino acid identity to any one of SEQ ID NO: 12 or SEQ ID NO: 108-110. In some instances, a nucleic acid-guided nuclease comprises an amino acid sequence comprising at most 40% amino acid identity to any one of SEQ ID NO: 12 or SEQ ID NO: 108-110. In some instances, a nucleic acid-guided nuclease comprises an amino acid sequence comprising at most 35% amino acid identity to any one of SEQ ID NO: 12 or SEQ ID NO: 108-110. In some instances, a nucleic acid-guided nuclease comprises an amino acid sequence comprising at most 30% amino acid identity to any one of SEQ ID NO: 12 or SEQ ID NO: 108-110.


In some instances, a nucleic acid-guided nuclease disclosed herein is encoded by a nucleic acid sequence comprising at least 50% sequence identity to any one of SEQ ID NO: 21-40. In some instances, a nuclease is encoded by a nucleic acid sequence comprising at least about 10%, 20%, 30%, 40%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, or 100% sequence identity to any one of SEQ ID NO: 21-40. In some instances, a nuclease is encoded by a nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, sequence identity to any one of SEQ ID NO: 21-40. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, sequence identity to any one of SEQ ID NO: 21-40. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, sequence identity to any one of SEQ ID NO: 21-28 or 30-32. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, sequence identity to any one of SEQ ID NO: 21-28 or 30-31. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, sequence identity to SEQ ID NO: 22. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, sequence identity to SEQ ID NO: 27.


In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 21-40. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 21-28 or 30-32. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 21-28 or 30-31. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of SEQ ID NO: 22. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of SEQ ID NO: 27.


In some instances, a nucleic acid-guided nuclease disclosed herein is encoded on a nucleic acid sequence. Such a nucleic acid can be codon optimized for expression in a desired host cell. Suitable host cells can include, as non-limiting examples, prokaryotic cells such as E. coli, P. aeruginosa, B. subtilus, and V. natriegens, and eukaryotic cells such as S. cerevisiae, plant cells, insect cells, nematode cells, amphibian cells, fish cells, or mammalian cells, including human cells.


A nucleic acid sequence encoding a nucleic acid-guided nuclease can be codon optimized for expression in gram positive bacteria, e.g., Bacillus subtilis, or gram negative bacteria, e.g., E. coli. In some instances, a nucleic acid-guided nuclease disclosed herein is encoded by a nucleic acid sequence comprising at least 50% sequence identity to any one of SEQ ID NO: 41-60. In some instances, a nuclease is encoded by a nucleic acid sequence comprising at least about 10%, 20%, 30%, 40%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, or 100% sequence identity to any one of SEQ ID NO: 41-60. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ ID NO: 41-60. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ ID NO: 41-48 or 50-52. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ ID NO: 41-48 or 50-51. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to SEQ ID NO: 42. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to SEQ ID NO: 47.


In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 41-60. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 41-48 or 50-52. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 41-48 or 50-51. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of SEQ ID NO: 42. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of SEQ ID NO: 47.


A nucleic acid sequence encoding a nucleic acid-guided nuclease can be codon optimized for expression in a species of yeast, e.g., S. cerevisiae. In some instances, a nucleic acid-guided nuclease disclosed herein is encoded by a nucleic acid sequence comprising at least 50% sequence identity to any one of SEQ ID NO: 127-146. In some instances, a nuclease is encoded by a nucleic acid sequence comprising at least about 10%, 20%, 30%, 40%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, or 100% sequence identity to any one of SEQ ID NO: 127-146. In some instances, a nuclease is encoded by a nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ ID NO: 127-146. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ ID NO: 127-146. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ ID NO: 127-134 or 136-138. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ ID NO: 127-134 or 136-137. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to SEQ ID NO: 128. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to SEQ ID NO: 133.


In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 127-146. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 127-134 or 136-138. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 127-134 or 136-137. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of SEQ ID NO: 128. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of SEQ ID NO: 133.


A nucleic acid sequence encoding a nucleic acid-guided nuclease can be codon optimized for expression in mammalian cells. In some instances, a nucleic acid-guided nuclease disclosed herein is encoded by a nucleic acid sequence comprising at least 50% sequence identity to any one of SEQ ID NO: 147-166. In some instances, a nuclease is encoded by a nucleic acid sequence comprising at least about 10%, 20%, 30%, 40%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, or 100% sequence identity to any one of SEQ ID NO: 147-166. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ ID NO: 147-166. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ ID NO: 147-154 or 156-158. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ ID NO: 147-154 or 156-157. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to SEQ ID NO: 148. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to SEQ ID NO: 153.


In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 147-166. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 147-154 or 156-158. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 147-154 or 156-157. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of SEQ ID NO: 148. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of SEQ ID NO: 153.


A nucleic acid sequence encoding a nucleic acid-guided nuclease can be operably linked to a promoter. Such nucleic acid sequences can be linear or circular. The nucleic acid sequences can be comprised on a larger linear or circular nucleic acid sequences that comprises additional elements such as an origin of replication, selectable or screenable marker, terminator, other components of a targetable nuclease system, such as a guide nucleic acid, or an editing or recorder cassette as disclosed herein. These larger nucleic acid sequences can be recombinant expression vectors, as are described in more detail later.


Guide Nucleic Acid


In general, a guide nucleic acid can complex with a compatible nucleic acid-guided nuclease and can hybridize with a target sequence, thereby directing the nuclease to the target sequence. A subject nucleic acid-guided nuclease capable of complexing with a guide nucleic acid can be referred to as a nucleic acid-guided nuclease that is compatible with the guide nucleic acid. Likewise, a guide nucleic acid capable of complexing with a nucleic acid-guided nuclease can be referred to as a guide nucleic acid that is compatible with the nucleic acid-guided nucleases.


A guide nucleic acid can be DNA. A guide nucleic acid can be RNA. A guide nucleic acid can comprise both DNA and RNA. A guide nucleic acid can comprise modified of non-naturally occurring nucleotides. In cases where the guide nucleic acid comprises RNA, the RNA guide nucleic acid can be encoded by a DNA sequence on a polynucleotide molecule such as a plasmid, linear construct, or editing cassette as disclosed herein.


A guide nucleic acid can comprise a guide sequence. A guide sequence is a polynucleotide sequence having sufficient complementarity with a target polynucleotide sequence to hybridize with the target sequence and direct sequence-specific binding of a complexed nucleic acid-guided nuclease to the target sequence. The degree of complementarity between a guide sequence and its corresponding target sequence, when optimally aligned using a suitable alignment algorithm, is about or more than about 50%, 60%, 75%, 80%, 85%, 90%, 95%, 97.5%, 99%, or more. Optimal alignment may be determined with the use of any suitable algorithm for aligning sequences. In some embodiments, a guide sequence is about or more than about 5, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 35, 40, 45, 50, 75, or more nucleotides in length. In some embodiments, a guide sequence is less than about 75, 50, 45, 40, 35, 30, 25, 20 nucleotides in length. Preferably the guide sequence is 10-30 nucleotides long. The guide sequence can be 15-20 nucleotides in length. The guide sequence can be 15 nucleotides in length. The guide sequence can be 16 nucleotides in length. The guide sequence can be 17 nucleotides in length. The guide sequence can be 18 nucleotides in length. The guide sequence can be 19 nucleotides in length. The guide sequence can be 20 nucleotides in length.


A guide nucleic acid can comprise a scaffold sequence. In general, a “scaffold sequence” includes any sequence that has sufficient sequence to promote formation of a targetable nuclease complex, wherein the targetable nuclease complex comprises a nucleic acid-guided nuclease and a guide nucleic acid comprising a scaffold sequence and a guide sequence. Sufficient sequence within the scaffold sequence to promote formation of a targetable nuclease complex may include a degree of complementarity along the length of two sequence regions within the scaffold sequence, such as one or two sequence regions involved in forming a secondary structure. In some cases, the one or two sequence regions are comprised or encoded on the same polynucleotide. In some cases, the one or two sequence regions are comprised or encoded on separate polynucleotides. Optimal alignment may be determined by any suitable alignment algorithm, and may further account for secondary structures, such as self-complementarity within either the one or two sequence regions. In some embodiments, the degree of complementarity between the one or two sequence regions along the length of the shorter of the two when optimally aligned is about or more than about 25%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 97.5%, 99%, or higher. In some embodiments, at least one of the two sequence regions is about or more than about 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 40, 50, or more nucleotides in length.


A scaffold sequence of a subject guide nucleic acid can comprise a secondary structure. A secondary structure can comprise a pseudoknot region. In some cases, binding kinetics of a guide nucleic acid to a nucleic acid-guided nuclease is determined in part by secondary structures within the scaffold sequence. In some cases, binding kinetics of a guide nucleic acid to a nucleic acid-guided nuclease is determined in part by nucleic acid sequence with the scaffold sequence.


A scaffold sequence can comprise the sequence of any one of SEQ ID NO: 84-107. A scaffold sequence can comprise the sequence of any one of SEQ ID NO: 84-103. A scaffold sequence can comprise the sequence of any one of SEQ ID NO: 84-91 or 93-95. A scaffold sequence can comprise the sequence of any one of SEQ ID NO: 88, 93, 94, or 95. A scaffold sequence can comprise the sequence of SEQ ID NO: 88. A scaffold sequence can comprise the sequence of SEQ ID NO: 93. A scaffold sequence can comprise the sequence of SEQ ID NO: 94. A scaffold sequence can comprise the sequence of SEQ ID NO: 95.


In some aspects, the invention provides a nuclease that binds to a guide nucleic acid comprising a conserved scaffold sequence. For example, the nucleic acid-guided nucleases for use in the present disclosure can bind to a conserved pseudoknot region as shown in FIG. 13A. Specifically, the nucleic acid-guided nucleases for use in the present disclosure can bind to a guide nucleic acid comprising a conserved pseudoknot region as shown in FIG. 13A. Certain nucleic acid-guided nucleases for use in the present disclosure can bind to a pseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to the pseudoknot region of Scaffold-1 (SEQ ID NO: 172). Other nucleic acid-guided nucleases for use in the present disclosure can bind to a pseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to the pseudoknot region of Scaffold-3 (SEQ ID NO: 173). Still other nucleic acid-guided nucleases for use in the present disclosure can bind to a pseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to the pseudoknot region of Scaffold-4 (SEQ ID NO: 174). Other nucleic acid-guided nucleases for use in the present disclosure can bind to a pseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to the pseudoknot region of Scaffold-5 (SEQ ID NO: 175). Other nucleic acid-guided nucleases for use in the present disclosure can bind to a pseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to the pseudoknot region of Scaffold-6 (SEQ ID NO: 176). Still other nucleic acid-guided nucleases for use in the present disclosure can bind to a pseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to the pseudoknot region of Scaffold-7 (SEQ ID NO: 177). Other nucleic acid-guided nucleases for use in the present disclosure can bind to a pseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to the pseudoknot region of Scaffold-8 (SEQ ID NO: 178). Other nucleic acid-guided nucleases for use in the present disclosure can bind to a pseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to the pseudoknot region of Scaffold-10 (SEQ ID NO: 179). Still other nucleic acid-guided nucleases for use in the present disclosure can bind to a pseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to the pseudoknot region of Scaffold-11 (SEQ ID NO: 180). Certain nucleic acid-guided nucleases for use in the present disclosure can bind to a pseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to the pseudoknot region of Scaffold-12 (SEQ ID NO: 181). Additional sequences in FIG. 13A include those for the consensus sequence (SEQ ID No: 190); frame 1 (SEQ ID No: 191); scaffold-1 (SEQ ID No: 192); scaffold-2 (SEQ ID No: 193); scaffold-3 (SEQ ID No: 194); scaffold-4 (SEQ ID No: 195); scaffold-5 (SEQ ID No: 196); scaffold-6 (SEQ ID No: 197); scaffold-7 (SEQ ID No: 198); scaffold-8 (SEQ ID No: 199); scaffold-10 (SEQ ID No: 200); scaffold-11 (SEQ ID No: 201); and scaffold-12 (SEQ ID No: 202).


A guide nucleic acid can comprise the sequence of any one of SEQ ID NO: 84-107. A guide nucleic acid can comprise the sequence of any one of SEQ ID NO: 84-103. A guide nucleic acid can comprise the sequence of any one of SEQ ID NO: 84-91 or 93-95. A guide nucleic acid can comprise the sequence of any one of SEQ ID NO: 88, 93, 94, or 95. A guide nucleic acid can comprise the sequence of SEQ ID NO: 88. A guide nucleic acid can comprise the sequence of SEQ ID NO: 93. A guide nucleic acid can comprise the sequence of SEQ ID NO: 94. A guide nucleic acid can comprise the sequence of SEQ ID NO: 95.


In aspects of the invention the terms “guide nucleic acid” refers to one or more polynucleotides comprising 1) a guide sequence capable of hybridizing to a target sequence and 2) a scaffold sequence capable of interacting with or complexing with an nucleic acid-guided nuclease as described herein. A guide nucleic acid may be provided as one or more nucleic acids. In specific embodiments, the guide sequence and the scaffold sequence are provided as a single polynucleotide.


A guide nucleic acid can be compatible with a nucleic acid-guided nuclease when the two elements can form a functional targetable nuclease complex capable of cleaving a target sequence. Often, a compatible scaffold sequence for a compatible guide nucleic acid can be found by scanning sequences adjacent to a native nucleic acid-guided nuclease loci. In other words, native nucleic acid-guided nucleases can be encoded on a genome within proximity to a corresponding compatible guide nucleic acid or scaffold sequence.


Nucleic acid-guided nucleases can be compatible with guide nucleic acids that are not found within the nucleases endogenous host. Such orthogonal guide nucleic acids can be determined by empirical testing. Orthogonal guide nucleic acids can come from different bacterial species or be synthetic or otherwise engineered to be non-naturally occurring.


Orthogonal guide nucleic acids that are compatible with a common nucleic acid-guided nuclease can comprise one or more common features. Common features can include sequence outside a pseudoknot region. Common features can include a pseudoknot region. Common features can include a primary sequence or secondary structure.


A guide nucleic acid can be engineered to target a desired target sequence by altering the guide sequence such that the guide sequence is complementary to the target sequence, thereby allowing hybridization between the guide sequence and the target sequence. A guide nucleic acid with an engineered guide sequence can be referred to as an engineered guide nucleic acid. Engineered guide nucleic acids are often non-naturally occurring and are not found in nature.


Targetable Nuclease System


Disclosed herein are targetable nuclease systems. A targetable nuclease system can comprise a nucleic acid-guided nuclease and a compatible guide nucleic acid. A targetable nuclease system can comprise a nucleic acid-guided nuclease or a polynucleotide sequence encoding the nucleic acid-guided nuclease. A targetable nuclease system can comprise a guide nucleic acid or a polynucleotide sequence encoding the guide nucleic acid.


In general, a targetable nuclease system as disclosed herein is characterized by elements that promote the formation of a targetable nuclease complex at the site of a target sequence, wherein the targetable nuclease complex comprises a nucleic acid-guided nuclease and a guide nucleic acid.


A guide nucleic acid together with a nucleic acid-guided nuclease forms a targetable nuclease complex which is capable of binding to a target sequence within a target polynucleotide, as determined by the guide sequence of the guide nucleic acid.


In general, to generate a double stranded break, in most cases a targetable nuclease complex binds to a target sequence as determined by the guide nucleic acid, and the nuclease has to recognize a protospacer adjacent motif (PAM) sequence adjacent to the target sequence.


A targetable nuclease complex can comprise a nucleic acid-guided nuclease of any one of SEQ ID NO: 1-20 and a compatible guide nucleic acid. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of any one of SEQ ID NO: 1-8 or 10-12 and a compatible guide nucleic acid. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of any one of SEQ ID NO: 1-8 or 10-11 and a compatible guide nucleic acid. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 2 and a compatible guide nucleic acid. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 7 and a compatible guide nucleic acid. In any of these cases, the guide nucleic acid can comprise a scaffold sequence compatible with the nucleic acid-guided nuclease. In any of these cases, the guide nucleic acid can further comprise a guide sequence. The guide sequence can be engineered to target any desired target sequence. The guide sequence can be engineered to be complementary to any desired target sequence. The guide sequence can be engineered to hybridize to any desired target sequence.


A targetable nuclease complex can comprise a nucleic acid-guided nuclease of any one of SEQ ID NO: 1-20 and a compatible guide nucleic acid comprising any one of SEQ ID NO: 84-107. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of any one of SEQ ID NO: 1-8 or 10-12 and a compatible guide nucleic acid comprising any one of SEQ ID NO: 84-95. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of any one of SEQ ID NO: 1-8 or 10-11 and a compatible guide nucleic acid comprising any one of SEQ ID NO: 84-91 or 93-95. In any of these cases, the guide nucleic acid can further comprise a guide sequence. The guide sequence can be engineered to target any desired target sequence. The guide sequence can be engineered to be complementary to any desired target sequence. The guide sequence can be engineered to hybridize to any desired target sequence.


A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 2 and a compatible guide nucleic acid. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 2 and a compatible guide nucleic acid comprising any one of SEQ ID NO: 88, 93, 94, or 95. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 2 and a compatible guide nucleic acid comprising SEQ ID NO: 88. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 2 and a compatible guide nucleic acid comprising SEQ ID NO: 93. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 2 and a compatible guide nucleic acid comprising SEQ ID NO: 94. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 2 and a compatible guide nucleic acid comprising SEQ ID NO: 95. In any of these cases, the guide nucleic acid can further comprise a guide sequence. The guide sequence can be engineered to target any desired target sequence. The guide sequence can be engineered to be complementary to any desired target sequence. The guide sequence can be engineered to hybridize to any desired target sequence.


A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 7 and a compatible guide nucleic acid. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 7 and a compatible guide nucleic acid comprising any one of SEQ ID NO: 88, 93, 94, or 95. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 7 and a compatible guide nucleic acid comprising SEQ ID NO: 88. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 7 and a compatible guide nucleic acid comprising SEQ ID NO: 93. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 7 and a compatible guide nucleic acid comprising SEQ ID NO: 94. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 7 and a compatible guide nucleic acid comprising SEQ ID NO: 95. In any of these cases, the guide nucleic acid can further comprise a guide sequence. The guide sequence can be engineered to target any desired target sequence. The guide sequence can be engineered to be complementary to any desired target sequence. The guide sequence can be engineered to hybridize to any desired target sequence.


A target sequence of a targetable nuclease complex can be any polynucleotide endogenous or exogenous to a prokaryotic or eukaryotic cell, or in vitro. For example, the target sequence can be a polynucleotide residing in the nucleus of the eukaryotic cell. A target sequence can be a sequence coding a gene product (e.g., a protein) or a non-coding sequence (e.g., a regulatory polynucleotide or a junk DNA). Without wishing to be bound by theory, it is believed that the target sequence should be associated with a PAM; that is, a short sequence recognized by a targetable nuclease complex. The precise sequence and length requirements for a PAM differ depending on the nucleic acid-guided nuclease used, but PAMs are typically 2-5 base pair sequences adjacent the target sequence. Examples of PAM sequences are given in the examples section below, and the skilled person will be able to identify further PAM sequences for use with a given nucleic acid-guided nuclease. Further, engineering of the PAM Interacting (PI) domain may allow programming of PAM specificity, improve target site recognition fidelity, and increase the versatility of a nucleic acid-guided nuclease genome engineering platform. Nucleic acid-guided nucleases may be engineered to alter their PAM specificity, for example as described in Kleinstiver B P et al. Engineered CRISPR-Cas9 nucleases with altered PAM specificities. Nature. 2015 Jul. 23; 523 (7561): 481-5. doi: 10.1038/nature14592.


A PAM site is a nucleotide sequence in proximity to a target sequence. In most cases, a nucleic acid-guided nuclease can only cleave a target sequence if an appropriate PAM is present. PAMs are nucleic acid-guided nuclease-specific and can be different between two different nucleic acid-guided nucleases. A PAM can be 5′ or 3′ of a target sequence. A PAM can be upstream or downstream of a target sequence. A PAM can be 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides in length. Often, a PAM is between 2-6 nucleotides in length.


In some examples, a PAM can be provided on a separate oligonucleotide. In such cases, providing PAM on a oligonucleotide allows cleavage of a target sequence that otherwise would not be able to be cleave because no adjacent PAM is present on the same polynucleotide as the target sequence.


Polynucleotide sequences encoding a component of a targetable nuclease system can comprise one or more vectors. In general, the term “vector” refers to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked. Vectors include, but are not limited to, nucleic acid molecules that are single-stranded, double-stranded, or partially double-stranded; nucleic acid molecules that comprise one or more free ends, no free ends (e.g. circular); nucleic acid molecules that comprise DNA, RNA, or both; and other varieties of polynucleotides known in the art. One type of vector is a “plasmid,” which refers to a circular double stranded DNA loop into which additional DNA segments can be inserted, such as by standard molecular cloning techniques. Another type of vector is a viral vector, wherein virally-derived DNA or RNA sequences are present in the vector for packaging into a virus (e.g. retroviruses, replication defective retroviruses, adenoviruses, replication defective adenoviruses, and adeno-associated viruses). Viral vectors also include polynucleotides carried by a virus for transfection into a host cell. Certain vectors are capable of autonomous replication in a host cell into which they are introduced (e.g. bacterial vectors having a bacterial origin of replication and episomal mammalian vectors). Other vectors (e.g., non-episomal mammalian vectors) are integrated into the genome of a host cell upon introduction into the host cell, and thereby are replicated along with the host genome. Moreover, certain vectors are capable of directing the expression of genes to which they are operatively-linked. Such vectors are referred to herein as “expression vectors.” Common expression vectors of utility in recombinant DNA techniques are often in the form of plasmids. Further discussion of vectors is provided herein.


Recombinant expression vectors can comprise a nucleic acid of the invention in a form suitable for expression of the nucleic acid in a host cell, which means that the recombinant expression vectors include one or more regulatory elements, which may be selected on the basis of the host cells to be used for expression, that is operatively-linked to the nucleic acid sequence to be expressed. Within a recombinant expression vector, “operably linked” is intended to mean that the nucleotide sequence of interest is linked to the regulatory element(s) in a manner that allows for expression of the nucleotide sequence (e.g. in an in vitro transcription/translation system or in a host cell when the vector is introduced into the host cell). With regards to recombination and cloning methods, mention is made of U.S. patent application Ser. No. 10/815,730, published Sep. 2, 2004 as US 2004-0171156 A1, the contents of which are herein incorporated by reference in their entirety.


In some embodiments, a regulatory element is operably linked to one or more elements of a targetable nuclease system so as to drive expression of the one or more components of the targetable nuclease system.


In some embodiments, a vector comprises a regulatory element operably linked to a polynucleotide sequence encoding a nucleic acid-guided nuclease. The polynucleotide sequence encoding the nucleic acid-guided nuclease can be codon optimized for expression in particular cells, such as prokaryotic or eukaryotic cells. Eukaryotic cells can be yeast, fungi, algae, plant, animal, or human cells. Eukaryotic cells may be those of or derived from a particular organism, such as a mammal, including but not limited to human, mouse, rat, rabbit, dog, or non-human mammal including non-human primate.


In general, codon optimization refers to a process of modifying a nucleic acid sequence for enhanced expression in the host cells of interest by replacing at least one codon (e.g. about or more than about 1, 2, 3, 4, 5, 10, 15, 20, 25, 50, or more codons) of the native sequence with codons that are more frequently or most frequently used in the genes of that host cell while maintaining the native amino acid sequence. Various species exhibit particular bias for certain codons of a particular amino acid. Codon bias (differences in codon usage between organisms) often correlates with the efficiency of translation of messenger RNA (mRNA), which is in turn believed to be dependent on, among other things, the properties of the codons being translated and the availability of particular transfer RNA (tRNA) molecules. The predominance of selected tRNAs in a cell is generally a reflection of the codons used most frequently in peptide synthesis. Accordingly, genes can be tailored for optimal gene expression in a given organism based on codon optimization. Codon usage tables are readily available, for example, at the “Codon Usage Database” available at www kazusa.orjp/codon/ (visited Jul. 9, 2002), and these tables can be adapted in a number of ways. See Nakamura, Y., et al. “Codon usage tabulated from the international DNA sequence databases: status for the year 2000” Nucl. Acids Res. 28:292 (2000). Computer algorithms for codon optimizing a particular sequence for expression in a particular host cell are also available, such as Gene Forge (Aptagen; Jacobus, Pa.), are also available. In some embodiments, one or more codons (e.g. 1, 2, 3, 4, 5, 10, 15, 20, 25, 50, or more, or all codons) in a sequence encoding an engineered nuclease correspond to the most frequently used codon for a particular amino acid.


In some embodiments, a vector encodes a nucleic acid-guided nuclease comprising one or more nuclear localization sequences (NLSs), such as about or more than about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more NLSs. In some embodiments, the engineered nuclease comprises about or more than about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more NLSs at or near the amino-terminus, about or more than about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more NLSs at or near the carboxy-terminus, or a combination of these (e.g. one or more NLS at the amino-terminus and one or more NLS at the carboxy terminus). When more than one NLS is present, each may be selected independently of the others, such that a single NLS may be present in more than one copy and/or in combination with one or more other NLSs present in one or more copies. In a preferred embodiment of the invention, the engineered nuclease comprises at most 6 NLSs. In some embodiments, an NLS is considered near the N- or C-terminus when the nearest amino acid of the NLS is within about 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 40, 50, or more amino acids along the polypeptide chain from the N- or C-terminus. Non-limiting examples of NLSs include an NLS sequence derived from: the NLS of the SV40 virus large T-antigen, having the amino acid sequence PKKKRKV (SEQ ID NO: 111); the NLS from nucleoplasmin (e.g. the nucleoplasmin bipartite NLS with the sequence KRPAATKKAGQAKKKK (SEQ ID NO:112)); the c-myc NLS having the amino acid sequence PAAKRVKLD (SEQ ID NO:113) or RQRRNELKRSP (SEQ ID NO:114); the hRNPA1 M9 NLS having the sequence NQSSNFGPMKGGNFGGRSSGPYGGGGQYFAKPRNQGGY (SEQ ID NO: 115); the sequence RMRIZFKNKGKDTAELRRRRVEVSVELRKAKKDEQILKRRNV (SEQ ID NO:1 116) of the IBB domain from importin-alpha; the sequences VSRKRPRP (SEQ ID NO:117) and PPKKARED (SEQ ID NO:115) of the myoma T protein; the sequence PQPKKKPL (SEQ ID NO:119) of human p53; the sequence SALIKKKKKMAP (SEQ ID NO:120) of mouse c-abl IV; the sequences DRLRR (SEQ ID NO:121) and PKQKKRK (SEQ ID NO:122) of the influenza virus NS1; the sequence RKLKKKIKKL (SEQ ID NO:123) of the Hepatitis virus delta antigen; the sequence REKKKFLKRR (SEQ ID NO: 124) of the mouse Mx1 protein; the sequence KRKGDEVDGVDEVAKKKSKK (SEQ ID NO: 125) of the human poly(ADP-ribose) polymerase; and the sequence RKCLQAGMNLEARKTKK (SEQ ID NO: 126) of the steroid hormone receptors (human) glucocorticoid.


In general, the one or more NLSs are of sufficient strength to drive accumulation of the nucleic acid-guided nuclease in a detectable amount in the nucleus of a eukaryotic cell. In general, strength of nuclear localization activity may derive from the number of NLSs, the particular NLS(s) used, or a combination of these factors. Detection of accumulation in the nucleus may be performed by any suitable technique. For example, a detectable marker may be fused to the nucleic acid-guided nuclease, such that location within a cell may be visualized, such as in combination with a means for detecting the location of the nucleus (e.g. a stain specific for the nucleus such as DALI). Cell nuclei may also be isolated from cells, the contents of which may then be analyzed by any suitable process for detecting protein, such as immunohistochemistry, Western blot, or enzyme activity assay. Accumulation in the nucleus may also be determined indirectly, such as by an assay for the effect of the nucleic acid-guided nuclease complex formation (e.g. assay for DNA cleavage or mutation at the target sequence, or assay for altered gene expression activity affected by targetable nuclease complex formation and/or nucleic acid-guided nuclease activity), as compared to a control not exposed to the nucleic acid-guided nuclease or targetable nuclease complex, or exposed to a nucleic acid-guided nuclease lacking the one or more NLSs.


A nucleic acid-guided nuclease and one or more guide nucleic acids can be delivered either as DNA or RNA. Delivery of an nucleic acid-guided nuclease and guide nucleic acid both as RNA (unmodified or containing base or backbone modifications) molecules can be used to reduce the amount of time that the nucleic acid-guided nuclease persist in the cell. This may reduce the level of off-target cleavage activity in the target cell. Since delivery of a nucleic acid-guided nuclease as mRNA takes time to be translated into protein, it might be advantageous to deliver the guide nucleic acid several hours following the delivery of the nucleic acid-guided nuclease mRNA, to maximize the level of guide nucleic acid available for interaction with the nucleic acid-guided nuclease protein. In other cases, the nucleic acid-guided nuclease mRNA and guide nucleic acid are delivered concomitantly. In other examples, the guide nucleic acid is delivered sequentially, such as 0.5, 1, 2, 3, 4, or more hours after the nucleic acid-guided nuclease mRNA.


In situations where guide nucleic acid amount is limiting, it may be desirable to introduce a nucleic acid-guided nuclease as mRNA and guide nucleic acid in the form of a DNA expression cassette with a promoter driving the expression of the guide nucleic acid. This way the amount of guide nucleic acid available will be amplified via transcription.


Guide nucleic acid in the form of RNA or encoded on a DNA expression cassette can be introduced into a host cell comprising an nucleic acid-guided nuclease encoded on a vector or chromosome. The guide nucleic acid may be provided in the cassette one or more polynucleotides, which may be contiguous or non-contiguous in the cassette. In specific embodiments, the guide nucleic acid is provided in the cassette as a single contiguous polynucleotide.


A variety of delivery systems can be used to introduce a nucleic acid-guided nuclease (DNA or RNA) and guide nucleic acid (DNA or RNA) into a host cell. These include the use of yeast systems, lipofection systems, microinjection systems, biolistic systems, virosomes, liposomes, immunoliposomes, polycations, lipid:nucleic acid conjugates, virions, artificial virions, viral vectors, electroporation, cell permeable peptides, nanoparticles, nanowires (Shalek et al., Nano Letters, 2012), exosomes. Molecular trojan horses liposomes (Pardridge et al., Cold Spring Harb Protoc; 2010; doi:10.1101/pdb.prot5407) may be used to deliver an engineered nuclease and guide nuclease across the blood brain barrier.


In some embodiments, a editing template is also provided. A editing template may be a component of a vector as described herein, contained in a separate vector, or provided as a separate polynucleotide, such as an oligonucleotide, linear polynucleotide, or synthetic polynucleotide. In some cases, a editing template is on the same polynucleotide as a guide nucleic acid. In some embodiments, a editing template is designed to serve as a template in homologous recombination, such as within or near a target sequence nicked or cleaved by a nucleic acid-guided nuclease as a part of a complex as disclosed herein. A editing template polynucleotide may be of any suitable length, such as about or more than about 10, 15, 20, 25, 50, 75, 100, 150, 200, 500, 1000, or more nucleotides in length. In some embodiments, the editing template polynucleotide is complementary to a portion of a polynucleotide comprising the target sequence. When optimally aligned, a editing template polynucleotide might overlap with one or more nucleotides of a target sequences (e.g. about or more than about 1, 5, 10, 15, 20, 25, 30, 35, 40, or more nucleotides). In some embodiments, when a editing template sequence and a polynucleotide comprising a target sequence are optimally aligned, the nearest nucleotide of the template polynucleotide is within about 1, 5, 10, 15, 20, 25, 50, 75, 100, 200, 300, 400, 500, 1000, 5000, 10000, or more nucleotides from the target sequence.


In many examples, an editing template comprises at least one mutation compared to the target sequence. An editing template can comprise an insertion, deletion, modification, or any combination thereof compared to the target sequence. Examples of some editing templates are described in more detail in a later section.


In some aspects, the invention provides methods comprising delivering one or more polynucleotides, such as or one or more vectors or linear polynucleotides as described herein, one or more transcripts thereof, and/or one or proteins transcribed therefrom, to a host cell. In some aspects, the invention further provides cells produced by such methods, and organisms comprising or produced from such cells. In some embodiments, an engineered nuclease in combination with (and optionally complexed with) a guide nucleic acid is delivered to a cell.


Conventional viral and non-viral based gene transfer methods can be used to introduce nucleic acids in cells, such as prokaryotic cells, eukaryotic cells, mammalian cells, or target tissues. Such methods can be used to administer nucleic acids encoding components of an engineered nucleic acid-guided nuclease system to cells in culture, or in a host organism. Non-viral vector delivery systems include DNA plasmids, RNA (e.g. a transcript of a vector described herein), naked nucleic acid, and nucleic acid complexed with a delivery vehicle, such as a liposome. Viral vector delivery systems include DNA and RNA viruses, which have either episomal or integrated genomes after delivery to the cell. For a review of gene therapy procedures, see Anderson, Science 256:808-813 (1992); Nabel & Feigner, TIBTECH 11:211-217 (1993); Mitani & Caskey, TIBTECH 11:162-166 (1993); Dillon. TIBTECH 11:167-175 (1993); Miller, Nature 357:455-460 (1992); Van Brunt, Biotechnology 6(10):1149-1154 (1988); Vigne, Restorative Neurology and Neuroscience 8:35-36 (1995); Kremer & Perricaudet, British Medical Bulletin 51(1):31-44 (1995); Haddada et al., in Current Topics in Microbiology and Immunology Doerfler and Bohm (eds) (1995); and Yu et al., Gene Therapy 1:13-26 (1994).


Methods of non-viral delivery of nucleic acids include lipofection, microinjection, biolistics, virosomes, liposomes, immunoliposomes, polycation or lipid:nucleic acid conjugates, naked DNA, artificial virions, and agent-enhanced uptake of DNA. Lipofection is described in e.g., U.S. Pat. Nos. 5,049,386, 4,946,787; and 4,897,355) and lipofection reagents are sold commercially (e.g., Transfectam™ and Lipofectin™). Cationic and neutral lipids that are suitable for efficient receptor-recognition lipofection of polynucleotides include those of Felgner, WO 91/17424; WO 91/16024. Delivery can be to cells (e.g. in vitro or ex vivo administration) or target tissues (e.g. in vivo administration).


The preparation of lipid:nucleic acid complexes, including targeted liposomes such as immunolipid complexes, is well known to one of skill in the art (see, e.g., Crystal, Science 270:404-410 (1995); Blaese et al., Cancer Gene Ther. 2:291-297 (1995); Behr et al., Bioconjugate Chem. 5:382-389 (1994); Remy et al., Bioconjugate Chem. 5:647-654 (1994); Gao et al., Gene Therapy 2:710-722 (1995); Ahmad et al., Cancer Res. 52:4817-4820 (1992); U.S. Pat. Nos. 4,186,183, 4,217,344, 4,235,871, 4,261,975, 4,485,054, 4,501,728, 4,774,085, 4,837,028, and 4,946,787).


The use of RNA or DNA viral based systems for the delivery of nucleic acids take advantage of highly evolved processes for targeting a virus to specific cells in culture or in the host and trafficking the viral payload to the nucleus or host cell genome. Viral vectors can be administered directly to cells in culture, patients (in vivo), or they can be used to treat cells in vitro, and the modified cells may optionally be administered to patients (ex vivo). Conventional viral based systems could include retroviral, lentivirus, adenoviral, adeno-associated and herpes simplex virus vectors for gene transfer. Integration in the host genome is possible with the retrovirus, lentivirus, and adeno-associated virus gene transfer methods, often resulting in long term expression of the inserted transgene. Additionally, high transduction efficiencies have been observed in many different cell types and target tissues.


The tropism of a retrovirus can be altered by incorporating foreign envelope proteins, expanding the potential target population of target cells. Lentiviral vectors are retroviral vectors that are able to transduce or infect non-dividing cells and typically produce high viral titers. Selection of a retroviral gene transfer system would therefore depend on the target tissue. Retroviral vectors are comprised of cis-acting long terminal repeats with packaging capacity for up to 6-10 kb of foreign sequence. The minimum cis-acting LTRs are sufficient for replication and packaging of the vectors, which are then used to integrate the therapeutic gene into the target cell to provide permanent transgene expression. Widely used retroviral vectors include those based upon murine leukemia virus (MuLV), gibbon ape leukemia virus (GaLV), Simian Immuno deficiency virus (SIV), human immuno deficiency virus (HIV), and combinations thereof (see, e.g., Buchscher et al., J. Virol. 66:2731-2739 (1992); Johann et al., J. Virol. 66:1635-1640 (1992); Sommnerfelt et al., Virol. 176:58-59 (1990); Wilson et al., J. Virol. 63:2374-2378 (1989); Miller et al., J. Virol. 65:2220-2224 (1991); PCT/US94/05700).


In applications where transient expression is preferred, adenoviral based systems may be used. Adenoviral based vectors are capable of very high transduction efficiency in many cell types and do not require cell division. With such vectors, high titer and levels of expression have been obtained. This vector can be produced in large quantities in a relatively simple system.


Adeno-associated virus (“AAV”) vectors may also be used to transduce cells with target nucleic acids, e.g., in the in vitro production of nucleic acids and peptides, and for in vivo and ex vivo gene therapy procedures (see, e.g., West et al., Virology 160:38-47 (1987); U.S. Pat. No. 4,797,368; WO 93/24641; Kotin, Human Gene Therapy 5:793-801 (1994); Muzyczka, J. Clin. Invest. 94:1351 (1994). Construction of recombinant AAV vectors are described in a number of publications, including U.S. Pat. No. 5,173,414; Tratschin et al., Mol. Cell. Biol. 5:3251-3260 (1985); Tratschin, et al., Mol. Cell. Biol. 4:2072-2081 (1984); Hermonat & Muzyczka, PNAS 81:6466-6470 (1984); and Samulski et al., J. Virol. 63:03822-3828 (1989).


In some embodiments, a host cell is transiently or non-transiently transfected with one or more vectors, linear polynucleotides, polypeptides, nucleic acid-protein complexes, or any combination thereof as described herein. In some embodiments, a cell in transfected in vitro, in culture, or ex vivo. In some embodiments, a cell is transfected as it naturally occurs in a subject. In some embodiments, a cell that is transfected is taken from a subject. In some embodiments, the cell is derived from cells taken from a subject, such as a cell line.


In some embodiments, a cell transfected with one or more vectors, linear polynucleotides, polypeptides, nucleic acid-protein complexes, or any combination thereof as described herein is used to establish a new cell line comprising one or more transfection-derived sequences. In some embodiments, a cell transiently transfected with the components of an engineered nucleic acid-guided nuclease system as described herein (such as by transient transfection of one or more vectors, or transfection with RNA), and modified through the activity of an engineered nuclease complex, is used to establish a new cell line comprising cells containing the modification but lacking any other exogenous sequence.


In some embodiments, one or more vectors described herein are used to produce a non-human transgenic cell, organism, animal, or plant. In some embodiments, the transgenic animal is a mammal, such as a mouse, rat, or rabbit. Methods for producing transgenic cells, organisms, plants, and animals are known in the art, and generally begin with a method of cell transformation or transfection, such as described herein.


Methods of Use


In the context of formation of an engineered nuclease complex, “target sequence” refers to a sequence to which a guide sequence is designed to have complementarity, where hybridization between a target sequence and a guide sequence promotes the formation of a engineered nuclease complex. A target sequence may comprise any polynucleotide, such as DNA, RNA, or a DNA-RNA hybrid. A target sequence can be located in the nucleus or cytoplasm of a cell. A target sequence can be located in vitro or in a cell-free environment.


Typically, formation of an engineered nuclease complex comprising a guide nucleic acid hybridized to a target sequence and complexed with one or more engineered nucleases as disclosed herein results in cleavage of one or both strands in or near (e.g. within 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 50, or more base pairs from) the target sequence. Cleavage can occur within a target sequence, 5′ of the target sequence, upstream of a target sequence, 3′ of the target sequence, or downstream of a target sequence.


In some embodiments, one or more vectors driving expression of one or more components of a targetable nuclease system are introduced into a host cell or in vitro such formation of a targetable nuclease complex at one or more target sites. For example, a nucleic acid-guided nuclease and a guide nucleic acid could each be operably linked to separate regulatory elements on separate vectors. Alternatively, two or more of the elements expressed from the same or different regulatory elements, may be combined in a single vector, with one or more additional vectors providing any components of the targetable nuclease system not included in the first vector. Targetable nuclease system elements that are combined in a single vector may be arranged in any suitable orientation, such as one element located 5′ with respect to (“upstream” of) or 3′ with respect to (“downstream” of) a second element. The coding sequence of one element may be located on the same or opposite strand of the coding sequence of a second element, and oriented in the same or opposite direction. In some embodiments, a single promoter drives expression of a transcript encoding a nucleic acid-guided nuclease and one or more guide nucleic acids. In some embodiments, a nucleic acid-guided nuclease and one or more guide nucleic acids are operably linked to and expressed from the same promoter. In other embodiments, one or more guide nucleic acids or polynucleotides encoding the one or more guide nucleic acids are introduced into a cell or in vitro environment already comprising a nucleic acid-guided nuclease or polynucleotide sequence encoding the nucleic acid-guided nuclease.


When multiple different guide sequences are used, a single expression construct may be used to target nuclease activity to multiple different, corresponding target sequences within a cell or in vitro. For example, a single vector may comprise about or more than about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, or more guide sequences. In some embodiments, about or more than about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more such guide-sequence-containing vectors may be provided, and optionally delivered to a cell or in vitro.


Methods and compositions disclosed herein may comprise more than one guide nucleic acid, wherein each guide nucleic acid has a different guide sequence, thereby targeting a different target sequence. In such cases, multiple guide nucleic acids can be using in multiplexing, wherein multiple targets are targeted simultaneously. Additionally or alternatively, the multiple guide nucleic acids are introduced into a population of cells, such that each cell in a population received a different or random guide nucleic acid, thereby targeting multiple different target sequences across a population of cells. In such cases, the collection of subsequently altered cells can be referred to as a library.


Methods and compositions disclosed herein may comprise multiple different nucleic acid-guided nucleases, each with one or more different corresponding guide nucleic acids, thereby allowing targeting of different target sequences by different nucleic acid-guided nucleases. In some such cases, each nucleic acid-guided nuclease can correspond to a distinct plurality of guide nucleic acids, allowing two or more non overlapping, partially overlapping, or completely overlapping multiplexing events.


In some embodiments, the nucleic acid-guided nuclease has DNA cleavage activity or RNA cleavage activity. In some embodiments, the nucleic acid-guided nuclease directs cleavage of one or both strands at the location of a target sequence, such as within the target sequence and/or within the complement of the target sequence. In some embodiments, the nucleic acid-guided nuclease directs cleavage of one or both strands within about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 50, 100, 200, 500, or more base pairs from the first or last nucleotide of a target sequence.


In some embodiments, a nucleic acid-guided nuclease may form a component of an inducible system. The inducible nature of the system would allow for spatiotemporal control of gene editing or gene expression using a form of energy. The form of energy may include but is not limited to electromagnetic radiation, sound energy, chemical energy, light energy, temperature, and thermal energy. Examples of inducible system include tetracycline inducible promoters (Tet-On or Tet-Off), small molecule two-hybrid transcription activations systems (FKBP, ABA, etc), or light inducible systems (Phytochrome, LOV domains, or cryptochorome). In one embodiment, the nucleic acid-guided nuclease may be a part of a Light Inducible Transcriptional Effector (LITE) to direct changes in transcriptional activity in a sequence-specific manner. The components of a light inducible system may include a nucleic acid-guided nuclease, a light-responsive cytochrome heterodimer (e.g. from Arabidopsis thaliana), and a transcriptional activation/repression domain. Further examples of inducible DNA binding proteins and methods for their use are provided in U.S. 61/736,465 and U.S. 61/721,283, which is hereby incorporated by reference in its entirety. An inducible system can be temperature inducible such that the system is turned on or off by increasing or decreasing the temperature. In some temperature inducible systems, increasing the temperature turns the system on. In some temperature inducible systems, increasing the temperature turns the system off.


In some aspects, the invention provides for methods of modifying a target sequence in vitro, or in a prokaryotic or eukaryotic cell, which may be in vivo, ex vivo, or in vitro. In some embodiments, the method comprises sampling a cell or population of cells such as prokaryotic cells, or those from a human or non-human animal or plant (including micro-algae), and modifying the cell or cells. Culturing may occur at any stage in vitro or ex vivo. The cell or cells may even be re-introduced into the host, such as a non-human animal or plant (including micro-algae). For re-introduced cells it is particularly preferred that the cells are stem cells.


In some embodiments, the method comprises allowing a targetable nuclease complex to bind to the target sequence to effect cleavage of said target sequence, thereby modifying the target sequence, wherein the targetable nuclease complex comprises a nucleic acid-guided nuclease complexed with a guide nucleic acid wherein the guide sequence of the guide nucleic acid is hybridized to a target sequence within a target polynucleotide.


In some aspects, the invention provides a method of modifying expression of a target polynucleotide in in vitro or in a prokaryotic or eukaryotic cell. In some embodiments, the method comprises allowing an targetable nuclease complex to bind to a target sequence with the target polynucleotide such that said binding results in increased or decreased expression of said target polynucleotide; wherein the targetable nuclease complex comprises an nucleic acid-guided nuclease complexed with a guide nucleic acid, and wherein the guide sequence of the guide nucleic acid is hybridized to a target sequence within said target polynucleotide. Similar considerations apply as above for methods of modifying a target polynucleotide. In fact, these sampling, culturing and re-introduction options apply across the aspects of the present invention.


In some aspects, the invention provides kits containing any one or more of the elements disclosed in the above methods and compositions. Elements may provide individually or in combinations, and may be provided in any suitable container, such as a vial, a bottle, or a tube. In some embodiments, the kit includes instructions in one or more languages, for example in more than one language.


In some embodiments, a kit comprises one or more reagents for use in a process utilizing one or more of the elements described herein. Reagents may be provided in any suitable container. For example, a kit may provide one or more reaction or storage buffers. Reagents may be provided in a form that is usable in a particular assay, or in a form that requires addition of one or more other components before use (e.g. in concentrate or lyophilized form). A buffer can be any buffer, including but not limited to a sodium carbonate buffer, a sodium bicarbonate buffer, a borate buffer, a Tris buffer, a MOPS buffer, a HEPES buffer, and combinations thereof. In some embodiments, the buffer is alkaline. In some embodiments, the buffer has a pH from about 7 to about 10. In some embodiments, the kit comprises one or more oligonucleotides corresponding to a guide sequence for insertion into a vector so as to operably link the guide sequence and a regulatory element. In some embodiments, the kit comprises a editing template.


In some aspects, the invention provides methods for using one or more elements of a engineered targetable nuclease system. A targetable nuclease complex of the disclosure provides an effective means for modifying a target sequence within a target polynucleotide. A targetable nuclease complex of the disclosure has a wide variety of utility including modifying (e.g., deleting, inserting, translocating, inactivating, activating) a target sequence in a multiplicity of cell types. As such a targetable nuclease complex of the invention has a broad spectrum of applications in, e.g., biochemical pathway optimization, genome-wide studies, genome engineering, gene therapy, drug screening, disease diagnosis, and prognosis. An exemplary targetable nuclease complex comprises a nucleic acid-guided nuclease as disclosed herein complexed with a guide nucleic acid, wherein the guide sequence of the guide nucleic acid can hybridize to a target sequence within the target polynucleotide. A guide nucleic acid can comprise a guide sequence linked to a scaffold sequence. A scaffold sequence can comprise one or more sequence regions with a degree of complementarity such that together they form a secondary structure. In some cases, the one or more sequence regions are comprised or encoded on the same polynucleotide. In some cases, the one or more sequence regions are comprised or encoded on separate polynucleotides.


Provided herein are methods of cleaving a target polynucleotide. The method comprises cleaving a target polynucleotide using a targetable nuclease complex that binds to a target sequence within a target polynucleotide and effect cleavage of said target polynucleotide. Typically, the targetable nuclease complex of the invention, when introduced into a cell, creates a break (e.g., a single or a double strand break) in the target sequence. For example, the method can be used to cleave a target gene in a cell, or to replace a wildtype sequence with a modified sequence.


The break created by the targetable nuclease complex can be repaired by a repair processes such as the error prone non-homologous end joining (NHEJ) pathway, the high fidelity homology-directed repair (HDR), or by recombination pathways. During these repair processes, a editing template can be introduced into the genome sequence. In some methods, the HDR or recombination process is used to modify a target sequence. For example, an editing template comprising a sequence to be integrated flanked by an upstream sequence and a downstream sequence is introduced into a cell. The upstream and downstream sequences share sequence similarity with either side of the site of integration in the chromosome, target vector, or target polynucleotide.


An editing template can be DNA or RNA, e.g., a DNA plasmid, a bacterial artificial chromosome (BAC), a yeast artificial chromosome (YAC), a viral vector, a linear piece of DNA, a PCR fragment, oligonucleotide, synthetic polynucleotide, a naked nucleic acid, or a nucleic acid complexed with a delivery vehicle such as a liposome or poloxamer.


An editing template polynucleotide can comprise a sequence to be integrated (e.g, a mutated gene). A sequence for integration may be a sequence endogenous or exogenous to the cell. Examples of a sequence to be integrated include polynucleotides encoding a protein or a non-coding RNA (e.g., a microRNA). Thus, the sequence for integration may be operably linked to an appropriate control sequence or sequences. Alternatively, the sequence to be integrated may provide a regulatory function. Sequence to be integrated may be a mutated or variant of an endogenous wildtype sequence. Alternatively, sequence to be integrated may be a wildtype version of an endogenous mutated sequence. Additionally or alternatively, sequenced to be integrated may be a variant or mutated form of an endogenous mutated or variant sequence.


Upstream and downstream sequences in an editing template polynucleotide can be selected to promote recombination between the target polynucleotide of interest and the editing template polynucleotide. The upstream sequence can be a nucleic acid sequence having sequence similarity with the sequence upstream of the targeted site for integration. Similarly, the downstream sequence can be a nucleic acid sequence having similarity with the sequence downstream of the targeted site of integration. The upstream and downstream sequences in an editing template can have 75%, 80%, 85%, 90%, 95%, or 100% sequence identity with the targeted polynucleotide. Preferably, the upstream and downstream sequences in the editing template polynucleotide have about 95%, 96%, 97%, 98%, 99%, or 100% sequence identity with the targeted polynucleotide. In some methods, the upstream and downstream sequences in the editing template polynucleotide have about 99% or 100% sequence identity with the targeted polynucleotide.


An upstream or downstream sequence may comprise from about 20 bp to about 2500 bp, for example, about 50, 100, 200, 300, 400, 500, 600, 700, 800, 900, 1000, 1100, 1200, 1300, 1400, 1500, 1600, 1700, 1800, 1900, 2000, 2100, 2200, 2300, 2400, or 2500 bp. In some methods, the exemplary upstream or downstream sequence has about 15 bp to about 50 bp, about 30 bp to about 100 bp, about 200 bp to about 2000 bp, about 600 bp to about 1000 bp, or more particularly about 700 bp to about 1000 bp.


In some methods, the editing template polynucleotide may further comprise a marker. Such a marker may make it easy to screen for targeted integrations. Examples of suitable markers include restriction sites, fluorescent proteins, or selectable markers. The exogenous polynucleotide template of the invention can be constructed using recombinant techniques (see, for example, Green and Sambrook et al., 2014 and Ausubel et al., 2017).


In an exemplary method for modifying a target polynucleotide by integrating an editing template polynucleotide, a double stranded break is introduced into the genome sequence by an engineered nuclease complex, the break can be repaired via homologous recombination using an editing template such that the template is integrated into the target polynucleotide. The presence of a double-stranded break can increase the efficiency of integration of the editing template.


Disclosed herein are methods for modifying expression of a polynucleotide in a cell. Some methods comprise increasing or decreasing expression of a target polynucleotide by using a targetable nuclease complex that binds to the target polynucleotide.


In some methods, a target polynucleotide can be inactivated to effect the modification of the expression in a cell. For example, upon the binding of a targetable nuclease complex to a target sequence in a cell, the target polynucleotide is inactivated such that the sequence is not transcribed, the coded protein is not produced, or the sequence does not function as the wild-type sequence does. For example, a protein or microRNA coding sequence may be inactivated such that the protein is not produced.


In some methods, a control sequence can be inactivated such that it no longer functions as a regulatory sequence. As used herein, “regulatory sequence” can refer to any nucleic acid sequence that effects the transcription, translation, or accessibility of a nucleic acid sequence. Examples of regulatory sequences include, a promoter, a transcription terminator, and an enhancer.


An inactivated target sequence may include a deletion mutation (i.e., deletion of one or more nucleotides), an insertion mutation (i.e., insertion of one or more nucleotides), or a nonsense mutation (i.e., substitution of a single nucleotide for another nucleotide such that a stop codon is introduced). In some methods, the inactivation of a target sequence results in “knockout” of the target sequence.


An altered expression of one or more target polynucleotides associated with a signaling biochemical pathway can be determined by assaying for a difference in the mRNA levels of the corresponding genes between the test model cell and a control cell, when they are contacted with a candidate agent. Alternatively, the differential expression of the sequences associated with a signaling biochemical pathway is determined by detecting a difference in the level of the encoded polypeptide or gene product.


To assay for an agent-induced alteration in the level of mRNA transcripts or corresponding polynucleotides, nucleic acid contained in a sample is first extracted according to standard methods in the art. For instance, mRNA can be isolated using various lytic enzymes or chemical solutions according to the procedures set forth in Green and Sambrook (2014), or extracted by nucleic-acid-binding resins following the accompanying instructions provided by the manufacturers. The mRNA contained in the extracted nucleic acid sample is then detected by amplification procedures or conventional hybridization assays (e.g. Northern blot analysis) according to methods widely known in the art or based on the methods exemplified herein.


For purpose of this invention, amplification means any method employing a primer and a polymerase capable of replicating a target sequence with reasonable fidelity. Amplification may be carried out by natural or recombinant DNA polymerases such as TaqGold™, T7 DNA polymerase, Klenow fragment of E. coli DNA polymerase, and reverse transcriptase. A preferred amplification method is PCR. In particular, the isolated RNA can be subjected to a reverse transcription assay that is coupled with a quantitative polymerase chain reaction (RT-PCR) in order to quantify the expression level of a sequence associated with a signaling biochemical pathway.


Detection of the gene expression level can be conducted in real time in an amplification assay. In one aspect, the amplified products can be directly visualized with fluorescent DNA-binding agents including but not limited to DNA intercalators and DNA groove binders. Because the amount of the intercalators incorporated into the double-stranded DNA molecules is typically proportional to the amount of the amplified DNA products, one can conveniently determine the amount of the amplified products by quantifying the fluorescence of the intercalated dye using conventional optical systems in the art. DNA-binding dye suitable for this application include SYBR green, SYBR blue, DAPI, propidium iodine, Hoeste, SYBR gold, ethidium bromide, acridines, proflavine, acridine orange, acriflavine, fluorcoumanin, ellipticine, daunomycin, chloroquine, distamycin D, chromomycin, homidium, mithramycin, ruthenium polypyridyls, anthramycin, and the like.


In another aspect, other fluorescent labels such as sequence specific probes can be employed in the amplification reaction to facilitate the detection and quantification of the amplified products. Probe-based quantitative amplification relies on the sequence-specific detection of a desired amplified product. It utilizes fluorescent, target-specific probes (e.g., TaqMan™ probes) resulting in increased specificity and sensitivity. Methods for performing probe-based quantitative amplification are well established in the art and are taught in U.S. Pat. No. 5,210,015.


In yet another aspect, conventional hybridization assays using hybridization probes that share sequence homology with sequences associated with a signaling biochemical pathway can be performed. Typically, probes are allowed to form stable complexes with the sequences associated with a signaling biochemical pathway contained within the biological sample derived from the test subject in a hybridization reaction. It will be appreciated by one of skill in the art that where antisense is used as the probe nucleic acid, the target polynucleotides provided in the sample are chosen to be complementary to sequences of the antisense nucleic acids. Conversely, where the nucleotide probe is a sense nucleic acid, the target polynucleotide is selected to be complementary to sequences of the sense nucleic acid.


Hybridization can be performed under conditions of various stringency, for instance as described herein. Suitable hybridization conditions for the practice of the present invention are such that the recognition interaction between the probe and sequences associated with a signaling biochemical pathway is both sufficiently specific and sufficiently stable. Conditions that increase the stringency of a hybridization reaction are widely known and published in the art. See, for example, (Green and Sambrook, et al., (2014); Nonradioactive in Situ Hybridization Application Manual, Boehringer Mannheim, second edition). The hybridization assay can be formed using probes immobilized on any solid support, including but are not limited to nitrocellulose, glass, silicon, and a variety of gene arrays. A preferred hybridization assay is conducted on high-density gene chips as described in U.S. Pat. No. 5,445,934.


For a convenient detection of the probe-target complexes formed during the hybridization assay, the nucleotide probes are conjugated to a detectable label. Detectable labels suitable for use in the present invention include any composition detectable by photochemical, biochemical, spectroscopic, immunochemical, electrical, optical or chemical means. A wide variety of appropriate detectable labels are known in the art, which include fluorescent or chemiluminescent labels, radioactive isotope labels, enzymatic or other ligands. In preferred embodiments, one will likely desire to employ a fluorescent label or an enzyme tag, such as digoxigenin, .beta.-galactosidase, urease, alkaline phosphatase or peroxidase, avidin/biotin complex.


Detection methods used to detect or quantify the hybridization intensity will typically depend upon the label selected above. For example, radiolabels may be detected using photographic film or a phosphoimager. Fluorescent markers may be detected and quantified using a photodetector to detect emitted light. Enzymatic labels are typically detected by providing the enzyme with a substrate and measuring the reaction product produced by the action of the enzyme on the substrate; and finally colorimetric labels are detected by simply visualizing the colored label.


An agent-induced change in expression of sequences associated with a signaling biochemical pathway can also be determined by examining the corresponding gene products. Determining the protein level typically involves a) contacting the protein contained in a biological sample with an agent that specifically bind to a protein associated with a signaling biochemical pathway; and (b) identifying any agent:protein complex so formed. In one aspect of this embodiment, the agent that specifically binds a protein associated with a signaling biochemical pathway is an antibody, preferably a monoclonal antibody.


The reaction can be performed by contacting the agent with a sample of the proteins associated with a signaling biochemical pathway derived from the test samples under conditions that will allow a complex to form between the agent and the proteins associated with a signaling biochemical pathway. The formation of the complex can be detected directly or indirectly according to standard procedures in the art. In the direct detection method, the agents are supplied with a detectable label and unreacted agents may be removed from the complex; the amount of remaining label thereby indicating the amount of complex formed. For such method, it is preferable to select labels that remain attached to the agents even during stringent washing conditions. It is preferable that the label does not interfere with the binding reaction. In the alternative, an indirect detection procedure may use an agent that contains a label introduced either chemically or enzymatically. A desirable label generally does not interfere with binding or the stability of the resulting agent:polypeptide complex. However, the label is typically designed to be accessible to an antibody for an effective binding and hence generating a detectable signal.


A wide variety of labels suitable for detecting protein levels are known in the art. Non-limiting examples include radioisotopes, enzymes, colloidal metals, fluorescent compounds, bioluminescent compounds, and chemiluminescent compounds.


The amount of agent:polypeptide complexes formed during the binding reaction can be quantified by standard quantitative assays. As illustrated above, the formation of agent:polypeptide complex can be measured directly by the amount of label remained at the site of binding. In an alternative, the protein associated with a signaling biochemical pathway is tested for its ability to compete with a labeled analog for binding sites on the specific agent. In this competitive assay, the amount of label captured is inversely proportional to the amount of protein sequences associated with a signaling biochemical pathway present in a test sample.


A number of techniques for protein analysis based on the general principles outlined above are available in the art. They include but are not limited to radioimmunoassays, ELISA (enzyme linked immunoradiometric assays), “sandwich” immunoassays, immunoradiometric assays, in situ immunoassays (using e.g., colloidal gold, enzyme or radioisotope labels), western blot analysis, immunoprecipitation assays, immunofluorescent assays, and SDS-PAGE.


Antibodies that specifically recognize or bind to proteins associated with a signaling biochemical pathway are preferable for conducting the aforementioned protein analyses. Where desired, antibodies that recognize a specific type of post-translational modifications (e.g., signaling biochemical pathway inducible modifications) can be used. Post-translational modifications include but are not limited to glycosylation, lipidation, acetylation, and phosphorylation. These antibodies may be purchased from commercial vendors. For example, anti-phosphotyrosine antibodies that specifically recognize tyrosine-phosphorylated proteins are available from a number of vendors including Invitrogen and Perkin Elmer. Anti-phosphotyrosine antibodies are particularly useful in detecting proteins that are differentially phosphorylated on their tyrosine residues in response to an ER stress. Such proteins include but are not limited to eukaryotic translation initiation factor 2 alpha (eIF-2.alpha.). Alternatively, these antibodies can be generated using conventional polyclonal or monoclonal antibody technologies by immunizing a host animal or an antibody-producing cell with a target protein that exhibits the desired post-translational modification.


In practicing a subject method, it may be desirable to discern the expression pattern of an protein associated with a signaling biochemical pathway in different bodily tissue, in different cell types, and/or in different subcellular structures. These studies can be performed with the use of tissue-specific, cell-specific or subcellular structure specific antibodies capable of binding to protein markers that are preferentially expressed in certain tissues, cell types, or subcellular structures.


An altered expression of a gene associated with a signaling biochemical pathway can also be determined by examining a change in activity of the gene product relative to a control cell. The assay for an agent-induced change in the activity of a protein associated with a signaling biochemical pathway will dependent on the biological activity and/or the signal transduction pathway that is under investigation. For example, where the protein is a kinase, a change in its ability to phosphorylate the downstream substrate(s) can be determined by a variety of assays known in the art. Representative assays include but are not limited to immunoblotting and immunoprecipitation with antibodies such as anti-phosphotyrosine antibodies that recognize phosphorylated proteins. In addition, kinase activity can be detected by high throughput chemiluminescent assays such as AlphaScreen™ (available from Perkin Elmer) and eTag™ assay (Chan-Hui, et al. (2003) Clinical Immunology 111: 162-174).


Where the protein associated with a signaling biochemical pathway is part of a signaling cascade leading to a fluctuation of intracellular pH condition, pH sensitive molecules such as fluorescent pH dyes can be used as the reporter molecules. In another example where the protein associated with a signaling biochemical pathway is an ion channel, fluctuations in membrane potential and/or intracellular ion concentration can be monitored. A number of commercial kits and high-throughput devices are particularly suited for a rapid and robust screening for modulators of ion channels. Representative instruments include FLIPR™ (Molecular Devices, Inc.) and VIPR (Aurora Biosciences). These instruments are capable of detecting reactions in over 1000 sample wells of a microplate simultaneously, and providing real-time measurement and functional data within a second or even a minisecond.


In practicing any of the methods disclosed herein, a suitable vector can be introduced to a cell, tissue, organism, or an embryo via one or more methods known in the art, including without limitation, microinjection, electroporation, sonoporation, biolistics, calcium phosphate-mediated transfection, cationic transfection, liposome transfection, dendrimer transfection, heat shock transfection, nucleofection transfection, magnetofection, lipofection, impalefection, optical transfection, proprietary agent-enhanced uptake of nucleic acids, and delivery via liposomes, immunoliposomes, virosomes, or artificial virions. In some methods, the vector is introduced into an embryo by microinjection. The vector or vectors may be microinjected into the nucleus or the cytoplasm of the embryo. In some methods, the vector or vectors may be introduced into a cell by nucleofection.


A target polynucleotide of a targetable nuclease complex can be any polynucleotide endogenous or exogenous to the host cell. For example, the target polynucleotide can be a polynucleotide residing in the nucleus of the eukaryotic cell, the genome of a prokaryotic cell, or an extrachromosomal vector of a host cell. The target polynucleotide can be a sequence coding a gene product (e.g., a protein) or a non-coding sequence (e.g., a regulatory polynucleotide or a junk DNA).


Examples of target polynucleotides include a sequence associated with a signaling biochemical pathway, e.g., a signaling biochemical pathway-associated gene or polynucleotide. Examples of target polynucleotides include a disease associated gene or polynucleotide. A “disease-associated” gene or polynucleotide refers to any gene or polynucleotide which is yielding transcription or translation products at an abnormal level or in an abnormal form in cells derived from a disease-affected tissues compared with tissues or cells of a non-disease control. It may be a gene that becomes expressed at an abnormally high level; it may be a gene that becomes expressed at an abnormally low level, where the altered expression correlates with the occurrence and/or progression of the disease. A disease-associated gene also refers to a gene possessing mutation(s) or genetic variation that is directly responsible or is in linkage disequilibrium with a gene(s) that is responsible for the etiology of a disease. The transcribed or translated products may be known or unknown, and may be at a normal or abnormal level.


Embodiments of the invention also relate to methods and compositions related to knocking out genes, editing genes, altering genes, amplifying genes, and repairing particular mutations. Altering genes may also mean the epigenetic manipulation of a target sequence. This may be the chromatin state of a target sequence, such as by modification of the methylation state of the target sequence (i.e. addition or removal of methylation or methylation patterns or CpG islands), histone modification, increasing or reducing accessibility to the target sequence, or by promoting 3D folding. It will be appreciated that where reference is made to a method of modifying a cell, organism, or mammal including human or a non-human mammal or organism by manipulation of a target sequence in a genomic locus of interest, this may apply to the organism (or mammal) as a whole or just a single cell or population of cells from that organism (if the organism is multicellular). In the case of humans, for instance, Applicants envisage, inter alia, a single cell or a population of cells and these may preferably be modified ex vivo and then re-introduced. In this case, a biopsy or other tissue or biological fluid sample may be necessary. Stem cells are also particularly preferred in this regard. But, of course, in vivo embodiments are also envisaged. And the invention is especially advantageous as to HSCs.


The functionality of a targetable nuclease complex can be assessed by any suitable assay. For example, the components of a targetable nuclease system sufficient to form a targetable nuclease complex, including a guide nucleic acid and nucleic acid-guided nuclease, can be provided to a host cell having the corresponding target sequence, such as by transfection with vectors encoding the components of the engineered nuclease system, followed by an assessment of preferential cleavage within the target sequence. Similarly, cleavage of a target sequence may be evaluated in a test tube by providing the target sequence and components of a targetable nuclease complex. Other assays are possible, and will occur to those skilled in the art. A guide sequence can be selected to target any target sequence. In some embodiments, the target sequence is a sequence within a genome of a cell. Exemplary target sequences include those that are unique in the target genome.


Editing Cassette


Disclosed herein are compositions and methods for editing a target polynucleotide sequence. Such compositions include polynucleotides containing one or more components of targetable nuclease system. Polynucleotide sequences for use in these methods can be referred to as editing cassettes.


An editing cassette can comprise one or more primer sites. Primer sites can be used to amplify an editing cassette by using oligonucleotide primers comprising reverse complementary sequences that can hybridize to the one or more primer sites. An editing cassette can comprise two or more primer times. Sometimes, an editing cassette comprises a primer site on each end of the editing cassette, said primer sites flanking one or more of the other components of the editing cassette. Primer sites can be approximately 10, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26 or more nucleotides in length.


An editing cassette can comprise an editing template as disclosed herein. An editing cassette can comprise an editing sequence. An editing sequence can be homologous to a target sequence. An editing sequence can comprise at least one mutation relative to a target sequence. An editing sequence often comprises homology region (or homology arms) flanking at least one mutation relative to a target sequence, such that the flanking homology regions facilitate homologous recombination of the editing sequence into a target sequence. An editing sequence can comprise an editing template as disclosed herein. For example, the editing sequence can comprise at least one mutation relative to a target sequence including one or more PAM mutations that mutate or delete a PAM site. An editing sequence can comprise one or more mutations in a codon or non-coding sequence relative to a non-editing target site.


A PAM mutation can be a silent mutation. A silent mutation can be a change to at least one nucleotide of a codon relative to the original codon that does not change the amino acid encoded by the original codon. A silent mutation can be a change to a nucleotide within a non-coding region, such as an intron, 5′ untranslated region, 3′ untranslated region, or other non-coding region.


A PAM mutation can be a non-silent mutation. Non-silent mutations can include a missense mutation. A missense mutation can be when a change to at least one nucleotide of a codon relative to the original codon that changes the amino acid encoded by the original codon. Missense mutations can occur within an exon, open reading frame, or other coding region.


An editing sequence can comprise at least one mutation relative to a target sequence. A mutation can be a silent mutation or non-silent mutation, such as a missense mutation. A mutation can include an insertion of one or more nucleotides or base pairs. A mutation can include a deletion of one or more nucleotides or base pairs. A mutation can include a substitution of one or more nucleotides or base pairs for a different one or more nucleotides or base pairs. Inserted or substituted sequences can include exogenous or heterologous sequences.


An editing cassette can comprise a polynucleotide encoding a guide nucleic acid sequence. In some cases, the guide nucleic acid sequence is optionally operably linked to a promoter. A guide nucleic acid sequence can comprise a scaffold sequence and a guide sequence as described herein.


An editing cassette can comprise a barcode. A barcode can be a unique DNA sequence that corresponds to the editing sequence such that the barcode can identify the one or more mutations of the corresponding editing sequence. In some examples, the barcode is 15 nucleotides. The barcode can comprise less than 10, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 88, 90, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, or more than 200 nucleotides. A barcode can be a non-naturally occurring sequence. An editing cassette comprising a barcode can be a non-naturally occurring sequence.


An editing cassette can comprise one or more of an editing sequence and a polynucleotide encoding a guide nucleic acid optionally operably linked to a promoter, wherein the editing cassette and guide nucleic acid sequence are flanked by primer sites. An editing cassette can further comprise a barcode.


An example of an editing cassette is depicted in FIG. 3. Each editing cassette can be designed to edit a site in a target sequence Sites to be targeted can be coding regions, non-coding regions, functionally neutral sites, or they can be a screenable or selectable marker gene. Homology regions within the editing sequence flank the one or more mutations of the editing cassette and can be inserted into the target sequence by recombination. Recombination can comprise DNA cleavage, such as by an nucleic acid-guided nuclease, and repair via homologous recombination.


Editing cassettes can be generated by chemical synthesis, Gibson assembly, SLIC, CPEC, PCA, ligation-free cloning, overlapping oligo extension, in vitro assembly, in vitro oligo assembly, PCR, traditional ligation-based cloning, other known methods in the art, or any combination thereof.


Trackable sequences, such as barcodes or recorder sequences, can be designed in silico via standard code with a degenerate mutation at the target codon. The degenerate mutation can comprise 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, or more than 30 nucleic acid residues. In some examples, the degenerate mutations can comprise 15 nucleic acid residues (N15).


Homology arms can be added to an editing sequence to allow incorporation of the editing sequence into the desired location via homologous recombination or homology-driven repair. Homology arms can be added by synthesis, in vitro assembly, PCR, or other known methods in the art. For example, chemical synthesis, Gibson assembly, SLIC, CPEC, PCA, ligation-free cloning, overlapping oligo extension, in vitro assembly, in vitro oligo assembly, PCR, traditional ligation-based cloning, other known methods in the art, or any combination thereof. A homology arm can be added to both ends of a barcode, recorder sequence, and/or editing sequence, thereby flanking the sequence with two distinct homology arms, for example, a 5′ homology arm and a 3′ homology arm.


A homology arm can comprise sequence homologous to a target sequence. A homology arm can comprise sequence homologous to sequence adjacent to a target sequence. A homology arm can comprise sequence homologous to sequence upstream or downstream of a target sequence. A homology arm can comprise sequence homologous to sequence within the same gene or open reading frame as a target sequence. A homology arm can comprise sequence homologous to sequence upstream or downstream of a gene or open reading frame the target sequence is within. A homology arm can comprise sequence homologous to a 5′ UTR or 3′ UTR of a gene or open reading frame within which is a target sequence. A homology arm can comprise sequence homologous to a different gene, open reading frame, promoter, terminator, or nucleic acid sequence than that which the target sequence is within.


The same 5′ and 3′ homology arms can be added to a plurality of distinct editing sequences, thereby generating a library of unique editing sequences that each have the same targeted insertion site. The same 5′ and 3′ homology arms can be added to a plurality of distinct editing templates, thereby generating a library of unique editing templates that each have the same targeted insertion site. In alternative examples, different or a variety of 5′ or 3′ homology arms can be added to a plurality of editing sequences or editing templates.


A barcode library or recorder sequence library comprising flanking homology arms can be cloned into a vector backbone. In some examples, the barcode comprising flanking homology arms are cloned into an editing cassette. Cloning can occur by chemical synthesis, Gibson assembly, SLIC, CPEC, PCA, ligation-free cloning, overlapping oligo extension, in vitro assembly, in vitro oligo assembly, PCR, traditional ligation-based cloning, other known methods in the art, or any combination thereof.


An editing sequence library comprising flanking homology arms can be cloned into a vector backbone. In some examples, the editing sequence and homology arms are cloned into an editing cassette. Editing cassettes can, in some cases, further comprise a nucleic acid sequence encoding a guide nucleic acid or gRNA engineered to target the desired site of editing sequence insertion, e.g. the target sequence. Editing cassettes can, in some cases, further comprise a barcode or recorder sequence. Cloning can occur by chemical synthesis, Gibson assembly, SLIC, CPEC, PCA, ligation-free cloning, overlapping oligo extension, in vitro assembly, in vitro oligo assembly, PCR, traditional ligation-based cloning, other known methods in the art, or any combination thereof.


Gene-wide or genome-wide editing libraries can be cloned into a vector backbone. A barcode or recorder sequence library can be inserted or assembled into a second site to generate competent trackable plasmids that can embed the recording barcode at a fixed locus while integrating the editing libraries at a wide variety of user defined sites. Cloning can occur by chemical synthesis, Gibson assembly, SLIC, CPEC, PCA, ligation-free cloning, overlapping oligo extension, in vitro assembly, in vitro oligo assembly, PCR, traditional ligation-based cloning, other known methods in the art, or any combination thereof.


A guide nucleic acid or sequence encoding the same can be assembled or inserted into a vector backbone first, followed by insertion of an editing sequence and/or cassette. In other cases, an editing sequence and/or cassette can be inserted or assembled into a vector backbone first, followed by insertion of a guide nucleic acid or sequence encoding the same. In other cases, guide nucleic acid or sequence encoding the same and an editing sequence and/or cassette are simultaneous inserted or assembled into a vector. A recorder sequence or barcode can be inserted before or after any of these steps. In other words, it should be understood that there are many possible permutations to the order in which elements of the disclosure are assembled. The vector can be linear or circular and can be generated by chemical synthesis, Gibson assembly, SLIC, CPEC, PCA, ligation-free cloning, overlapping oligo extension, in vitro assembly, in vitro oligo assembly, PCR, traditional ligation-based cloning, other known methods in the art, or any combination thereof.


A nucleic acid molecule can be synthesized which comprises one or more elements disclosed herein. A nucleic acid molecule can be synthesized that comprises an editing cassette. A nucleic acid molecule can be synthesized that comprises a guide nucleic acid. A nucleic acid molecule can be synthesized that comprises a recorder cassette. A nucleic acid molecule can be synthesized that comprises a barcode. A nucleic acid molecule can be synthesized that comprises a homology arm. A nucleic acid molecule can be synthesized that comprises an editing cassette and a guide nucleic acid. A nucleic acid molecule can be synthesized that comprises an editing cassette and a barcode. A nucleic acid molecule can be synthesized that comprises an editing cassette, a guide nucleic acid, and a recorder cassette. A nucleic acid molecule can be synthesized that comprises an editing cassette, a recorder cassette, and two guide nucleic acids. A nucleic acid molecule can be synthesized that comprises a recorder cassette and a guide nucleic acid. In any of these cases, the guide nucleic acid can optionally be operably linked to a promoter. In any of these cases, the nucleic acid molecule can further include one or more barcodes.


Synthesis can occur by any nucleic acid synthesis method known in the art. Synthesis can occur by enzymatic nucleic acid synthesis. Synthesis can occur by chemical synthesis. Synthesis can occur by array-based synthesis. Synthesis can occur by solid-phase synthesis or phosphoramidite methods. Synthesis can occur by column or multi-well methods. Synthesized nucleic acid molecules can be non-naturally occurring nucleic acid molecules.


Software and automation methods can be used for multiplex synthesis and generation. For example, software and automation can be used to create 10, 102, 103, 104, 105, 106, or more synthesized polynucleotides, cassettes, or plasmids. An automation method can generate desired sequences and libraries in rapid fashion that can be processed through a workflow with minimal steps to produce precisely defined libraries, such as gene-wide or genome-wide editing libraries.


Polynucleotides or libraries can be generated which comprise two or more nucleic acid molecules or plasmids comprising any combination disclosed herein of recorder sequence, editing sequence, guide nucleic acid, and optional barcode, including combinations of one or more of any of the previously mentioned elements. For example, such a library can comprise at least 2, 3, 4, 5, 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 200, 300, 400, 500, 600, 700, 800, 1000, 1500, 2000, 2500, 3000, 3500, 4000, 4500, 5000, 5500, 6000, 6500, 7000, 7500, 8000, 8500, 9000, 9500, 104, 105, 106, 107, 108, 109, 1010, or more nucleic acid molecules or plasmids of the present disclosure. It should be understood that such a library can include any number of nucleic acid molecules or plasmids, even if the specific number is not explicit listed above.


Trackable plasmid libraries or nucleic acid molecule libraries can be sequenced in order to determine the recorder sequence and editing sequence pair that is comprised on each trackable plasmid. In other cases, a known recorder sequence is paired with a known editing sequence during the library generation process. Other methods of determining the association between a recorder sequence and editing sequence comprised on a common nucleic acid molecule or plasmid are envisioned such that the editing sequence can be identified by identification or sequencing of the recorder sequence.


Methods and compositions for tracking edited episomal libraries that are shuttled between E. coli and other organisms/cell lines are provided herein. The libraries can be comprised on plasmids, Bacterial artificial chromosomes (BACs), Yeast artificial chromosomes (YACs), synthetic chromosomes, or viral or phage genomes. These methods and compositions can be used to generate portable barcoded libraries in host organisms, such as E. coli. Library generation in such organisms can offer the advantage of established techniques for performing homologous recombination. Barcoded plasmid libraries can be deep-sequenced at one site to track mutational diversity targeted across the remaining portions of the plasmid allowing dramatic improvements in the depth of library coverage.


Any nucleic acid molecule disclosed herein can be an isolated nucleic acid. Isolated nucleic acids may be made by any method known in the art, for example using standard recombinant methods, assembly methods, synthesis techniques, or combinations thereof. In some embodiments, the nucleic acids may be cloned, amplified, assembled, or otherwise constructed.


Isolated nucleic acids may be obtained from cellular, bacterial, or other sources using any number of cloning methodologies known in the art. In some embodiments, oligonucleotide probes which selectively hybridize, under stringent conditions, to other oligonucleotides or to the nucleic acids of an organism or cell can be used to isolate or identify an isolated nucleic acid.


Cellular genomic DNA, RNA, or cDNA may be screened for the presence of an identified genetic element of interest using a probe based upon one or more sequences. Various degrees of stringency of hybridization may be employed in the assay.


High stringency conditions for nucleic acid hybridization are well known in the art. For example, conditions may comprise low salt and/or high temperature conditions, such as provided by about 0.02 M to about 0.15 M NaCl at temperatures of about 50° C. to about 70° C. It is understood that the temperature and ionic strength of a desired stringency are determined in part by the length of the particular nucleic acid(s), the length and nucleotide content of the target sequence(s), the charge composition of the nucleic acid(s), and by the presence or concentration of formamide, tetramethylammonium chloride or other solvent(s) in a hybridization mixture. Nucleic acids may be completely complementary to a target sequence or may exhibit one or more mismatches.


Nucleic acids of interest may also be amplified using a variety of known amplification techniques. For instance, polymerase chain reaction (PCR) technology may be used to amplify target sequences directly from DNA, RNA, or cDNA. PCR and other in vitro amplification methods may also be useful, for example, to clone nucleic acid sequences, to make nucleic acids to use as probes for detecting the presence of a target nucleic acid in samples, for nucleic acid sequencing, or for other purposes.


Isolated nucleic acids may be prepared by direct chemical synthesis by methods such as the phosphotriester method, or using an automated synthesizer. Chemical synthesis generally produces a single stranded oligonucleotide. This may be converted into double stranded DNA by hybridization with a complementary sequence or by polymerization with a DNA polymerase using the single strand as a template.


Recorder


In some example, two editing cassettes can be used together to track a genetic engineering step. For example, one editing cassette can comprise an editing template and an encoded guide nucleic acid, and a second editing cassette, referred to as a recorder cassette, can comprise an editing template comprising a recorder sequence and an encoded nucleic acid which has a distinct guide sequence compared to that of the first editing cassette. In such cases, the editing sequence and the recorder sequence can be inserted into separate target sequences and determined by their corresponding guide nucleic acids. A recorder sequence can comprise a barcode, trackable or traceable sequence, and/or a regulatory element operable with a screenable or selectable marker.


Through a multiplexed cloning approach, the recorder cassette can be covalently coupled to at least one editing cassette in a plasmid (e.g., FIG. 17A, green cassette) to generate plasmid libraries that have a unique recorder and editing cassette combination. This library can be sequenced to generate the recorder/edit mapping and used to track editing libraries across large segments of the target DNA (e.g., FIG. 17C). Recorder and editing sequences can be comprised on the same cassette, in which case they are both incorporated into the target nucleic acid sequence, such as a genome or plasmid, by the same recombination event. In other examples, the recorder and editing sequences can be comprised on separate cassettes within the same plasmid, in which case the recorder and editing sequences are incorporated into the target nucleic acid sequence by separate recombination events, either simultaneously or sequentially.


Methods are provided herein for combining multiplex oligonucleotide synthesis with recombineering, to create libraries of specifically designed and trackable mutations. Screens and/or selections followed by high-throughput sequencing and/or barcode microarray methods can allow for rapid mapping of mutations leading to a phenotype of interest.


Methods and compositions disclosed herein can be used to simultaneously engineer and track engineering events in a target nucleic acid sequence.


Such plasmids can be generated using in vitro assembly or cloning techniques. For example, the plasmids can be generated using chemical synthesis, Gibson assembly, SLIC, CPEC, PCA, ligation-free cloning, other in vitro oligo assembly techniques, traditional ligation-based cloning, or any combination thereof.


Such plasmids can comprise at least one recording sequence, such as a barcode, and at least one editing sequence. In most cases, the recording sequence is used to record and track engineering events. Each editing sequence can be used to incorporate a desired edit into a target nucleic acid sequence. The desired edit can include insertion, deletion, substitution, or alteration of the target nucleic acid sequence. In some examples, the one or more recording sequence and editing sequences are comprised on a single cassette comprised within the plasmid such that they are incorporated into the target nucleic acid sequence by the same engineering event. In other examples, the recording and editing sequences are comprised on separate cassettes within the plasmid such that they are each incorporated into the target nucleic acid by distinct engineering events. In some examples, the plasmid comprises two or more editing sequences. For example, one editing sequence can be used to alter or silence a PAM sequence while a second editing sequence can be used to incorporate a mutation into a distinct sequence.


Recorder sequences can be inserted into a site separated from the editing sequence insertion site. The inserted recorder sequence can be separated from the editing sequence by 1 bp to 1 Mbp. For example, the separation distance can be about 1 bp, 10 bp, 50 bp, 100 bp, 500 bp, 1 kp, 2 kb, 5 kb, 10 kb, or greater. The separation distance can be any discrete integer between 1 bp and 10 Mbp. In some examples, the maximum distance of separation depends on the size of the target nucleic acid or genome.


Recorder sequences can be inserted adjacent to editing sequences, or within proximity to the editing sequence. For example, the recorder sequence can be inserted outside of the open reading frame within which the editing sequence is inserted. Recorder sequence can be inserted into an untranslated region adjacent to an open reading frame within which an editing sequence has been inserted. The recorder sequence can be inserted into a functionally neutral or non-functional site. The recorder sequence can be inserted into a screenable or selectable marker gene.


In some examples, the target nucleic acid sequence is comprised within a genome, artificial chromosome, synthetic chromosome, or episomal plasmid. In various examples, the target nucleic acid sequence can be in vitro or in vivo. When the target nucleic acid sequence is in vivo, the plasmid can be introduced into the host organisms by transformation, transfection, conjugation, biolistics, nanoparticles, cell-permeable technologies, or other known methods for DNA delivery, or any combination thereof. In such examples, the host organism can be a eukaryote, prokaryote, bacterium, archaea, yeast, or other fungi.


The engineering event can comprise recombineering, non-homologous end joining, homologous recombination, or homology-driven repair. In some examples, the engineering event is performed in vitro or in vivo.


The methods described herein can be carried out in any type of cell in which a targetable nuclease system can function (e.g., target and cleave DNA), including prokaryotic and eukaryotic cells. In some embodiments the cell is a bacterial cell, such as Escherichia spp. (e.g., E. coli). In other embodiments, the cell is a fungal cell, such as a yeast cell, e.g., Saccharomyces spp. In other embodiments, the cell is an algal cell, a plant cell, an insect cell, or a mammalian cell, including a human cell.


In some examples, the cell is a recombinant organism. For example, the cell can comprise a non-native targetable nuclease system. Additionally or alternatively, the cell can comprise recombination system machinery. Such recombination systems can include lambda red recombination system, Cre/Lox, attB/attP, or other integrase systems. Where appropriate, the plasmid can have the complementary components or machinery required for the selected recombination system to work correctly and efficiently.


Method for genome editing can comprise: (a) introducing a vector that encodes at least one editing cassette and at least one guide nucleic acid into a first population of cells, thereby producing a second population of cells comprising the vector; (b) maintaining the second population of cells under conditions in which a nucleic acid-guided nuclease is expressed or maintained, wherein the nucleic acid-guided nuclease is encoded on the vector, a second vector, on the genome of cells of the second population of cells, or otherwise introduced into the cell, resulting in DNA cleavage and incorporation of the editing cassette; (c) obtaining viable cells; and (d) sequencing the target DNA molecule in at least one cell of the second population of cells to identify the mutation of at least one codon.


A method for genome editing can comprise: (a) introducing a vector that encodes at least one editing cassette comprising a PAM mutation as disclosed herein and at least one guide nucleic acid into a first population of cells, thereby producing a second population of cells comprising the vector; (b) maintaining the second population of cells under conditions in which a nucleic acid-guided nuclease is expressed or maintained, wherein the nucleic acid-guided nuclease is encoded on the vector, a second vector, on the genome of cells of the second population of cells, or otherwise introduced into the cell, resulting in DNA cleavage, incorporation of the editing cassette, and death of cells of the second population of cells that do not comprise the PAM mutation, whereas cells of the second population of cells that comprise the PAM mutation are viable; (c) obtaining viable cells; and (d) sequencing the target DNA in at least one cell of the second population of cells to identify the mutation of at least one codon.


Method for trackable genome editing can comprise: (a) introducing a vector that encodes at least one editing cassette, at least one recorder cassette, and at least two guide nucleic acids into a first population of cells, thereby producing a second population of cells comprising the vector; (b) maintaining the second population of cells under conditions in which a nucleic acid-guided nuclease is expressed or maintained, wherein the nucleic acid-guided nuclease is encoded on the vector, a second vector, on the genome of cells of the second population of cells, or otherwise introduced into the cell, resulting in DNA cleavage and incorporation of the editing and recorder cassettes; (c) obtaining viable cells; and (d) sequencing the recorder sequence of the target DNA molecule in at least one cell of the second population of cells to identify the mutation of at least one codon.


In some examples where the plasmid comprises a second editing sequence designed to silence a PAM, a method for trackable genome editing can comprise: (a) introducing a vector that encodes at least one editing cassette, a recorder cassette, and at least two guide nucleic acids into a first population of cells, thereby producing a second population of cells comprising the vector; (b) maintaining the second population of cells under conditions in which a nucleic acid-guided nuclease is expressed or maintained, wherein the nucleic acid-guided nuclease is encoded on the vector, a second vector, on the genome of cells of the second population of cells, or otherwise introduced into the cell, resulting in DNA cleavage, incorporation of the editing and recorder cassettes, and death of cells of the second population of cells that do not comprise the PAM mutation, whereas cells of the second population of cells that comprise the PAM mutation are viable; (c) obtaining viable cells; and (d) sequencing the recorder sequence of the target DNA in at least one cell of the second population of cells to identify the mutation of at least one codon.


In some examples transformation efficiency is determined by using a non-targeting control guide nucleic acid, which allows for validation of the recombineering procedure and CFU/ng calculations. In some cases, absolute efficient is obtained by counting the total number of colonies on each transformation plate, for example, by counting both red and white colonies from a galK control. In some examples, relative efficiency is calculated by the total number of successful transformants (for example, white colonies) out of all colonies from a control (for example, galK control).


The methods of the disclosure can provide, for example, greater than 1000× improvements in the efficiency, scale, cost of generating a combinatorial library, and/or precision of such library generation.


The methods of the disclosure can provide, for example, greater than: 10×, 50×, 100×, 200×, 300×, 400×, 500×, 600×, 700×, 800×, 900×, 1000×, 1100×, 1200×, 1300×, 1400×, 1500×, 1600×, 1700×, 1800×, 1900×, 2000×, or greater improvements in the efficiency of generating genomic or combinatorial libraries.


The methods of the disclosure can provide, for example, greater than: 10×, 50×, 100×, 200×, 300×, 400×, 500×, 600×, 700×, 800×, 900×, 1000×, 1100×, 1200×, 1300×, 1400×, 1500×, 1600×, 1700×, 1800×, 1900×, 2000×, or greater improvements in the scale of generating genomic or combinatorial libraries.


The methods of the disclosure can provide, for example, greater than: 10×, 50×, 100×, 200×, 300×, 400×, 500×, 600×, 700×, 800×, 900×, 1000×, 1100×, 1200×, 1300×, 1400×, 1500×, 1600×, 1700×, 1800×, 1900×, 2000×, or greater decrease in the cost of generating genomic or combinatorial libraries.


The methods of the disclosure can provide, for example, greater than: 10×, 50×, 100×, 200×, 300×, 400×, 500×, 600×, 700×, 800×, 900×, 1000×, 1100×, 1200×, 1300×, 1400×, 1500×, 1600×, 1700×, 1800×, 1900×, 2000×, or greater improvements in the precision of genomic or combinatorial library generation.


Recursive Tracking for Combinatorial Engineering


Disclosed herein are methods and compositions for iterative rounds of engineering. Disclosed herein are recursive engineering strategies that allow implementation of CREATE recording at the single cell level through several serial engineering cycles (e.g., FIG. 18 and FIG. 19). These disclosed methods and compositions can enable search-based technologies that can effectively construct and explore complex genotypic space. The terms recursive and iterative can be used interchangeably.


Combinatorial engineering methods can comprise multiple rounds of engineering. Methods disclosed herein can comprise 2 or more rounds of engineering. For example, a method can comprise 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 25, 30, or more than 30 rounds of engineering.


In some examples, during each round of engineering a new recorder sequence, such as a barcode, is incorporated at the same locus in nearby sites (e.g., FIG. 18, green bars or FIG. 19, black bars) such that following multiple engineering cycles to construct combinatorial diversity throughout the genome (e.g., FIG. 18, green bars or FIG. 19, grey bars) a simple PCR of the recording locus can be used to reconstruct each combinatorial genotype or to confirm that the engineered edit from each round has been incorporated into the target site.


Disclosed herein are methods for selecting for successive rounds of engineering. Selection can occur by a PAM mutation incorporated by an editing cassette. Selection can occur by a PAM mutation incorporated by a recorder cassette. Selection can occur using a screenable, selectable, or counter-selectable marker. Selection can occur by targeting a site for editing or recording that was incorporated by a prior round of engineering, thereby selecting for variants that successfully incorporated edits and recorder sequences from both rounds or all prior rounds of engineering.


Quantitation of these genotypes can be used for understanding combinatorial mutational effects on large populations and investigation of important biological phenomena such as epistasis.


Serial editing and combinatorial tracking can be implemented using recursive vector systems as disclosed herein. These recursive vector systems can be used to move rapidly through the transformation procedure. In some examples, these systems consist of two or more plasmids containing orthogonal replication origins, antibiotic markers, and an encoded guide nucleic acids. The encoded guide nucleic acid in each vector can be designed to target one of the other resistance markers for destruction by nucleic acid-guided nuclease-mediated cleavage. These systems can be used, in some examples, to perform transformations in which the antibiotic selection pressure is switched to remove the previous plasmid and drive enrichment of the next round of engineered genomes. Two or more passages through the transformation loop can be performed, or in other words, multiple rounds of engineering can be performed. Introducing the requisite recording cassettes and editing cassettes into recursive vectors as disclosed herein can be used for simultaneous genome editing and plasmid curing in each transformation step with high efficiencies.


In some examples, the recursive vector system disclosed herein comprises 2, 3, 4, 5, 6, 7, 8, 9, 10, or more than 10 unique plasmids. In some examples, the recursive vector system can use a particular plasmid more than once as long as a distinct plasmid is used in the previous round and in the subsequent round.


Recursive methods and compositions disclosed herein can be used to restore function to a selectable or screenable element in a targeted genome or plasmid. The selectable or screenable element can include an antibiotic resistance gene, a fluorescent gene, a unique DNA sequence or watermark, or other known reporter, screenable, or selectable gene. In some examples, each successive round of engineering can incorporate a fragment of the selectable or screenable element, such that at the end of the engineering rounds, the entire selectable or screenable element has been incorporated into the target genome or plasmid. In such examples, only those genome or plasmids which have successfully incorporated all of the fragments, and therefore all of the desired corresponding mutations, can be selected or screened for. In this way, the selected or screened cells will be enriched for those that have incorporated the edits from each and every iterative round of engineering.


Recursive methods can be used to switch a selectable or screenable marker between an on and an off position, or between an off and an on position, with each successive round of engineering. Using such a method allows conservation of available selectable or screenable markers by requiring, for example, the use of only one screenable or selectable marker. Furthermore, short regulatory sequence or start codon or non-start codons can be used to turn the screenable or selectable marker on and off. Such short sequences can easily fit within a synthesized cassette or polynucleotide.


One or more rounds of engineering can be performed using the methods and compositions disclosed herein. In some examples, each round of engineering is used to incorporate an edit unique from that of previous rounds. Each round of engineering can incorporate a unique recording sequence. Each round of engineering can result in removal or curing of the plasmid used in the previous round of engineering. In some examples, successful incorporation of the recording sequence of each round of engineering results in a complete and functional screenable or selectable marker or unique sequence combination.


Unique recorder cassettes comprising recording sequences such as barcodes or screenable or selectable markers can be inserted with each round of engineering, thereby generating a recorder sequence that is indicative of the combination of edits or engineering steps performed. Successive recording sequences can be inserted adjacent to one another. Successive recording sequences can be inserted within proximity to one another. Successive sequences can be inserted at a distance from one another.


Successive sequences can be inserted at a distance from one another. For example, successive recorder sequences can be inserted and separated by 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, or greater than 100 bp. In some examples, successive recorder sequences are separated by about 10, 50, 100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1100, 1200, 1300, 1400, 1500, or greater than 1500 bp.


Successive recorder sequences can be separated by any desired number of base pairs and can be dependent and limited on the number of successive recorder sequences to be inserted, the size of the target nucleic acid or target genomes, and/or the design of the desired final recorder sequence. For example, if the compiled recorder sequence is a functional screenable or selectable marker, than the successive recording sequences can be inserted within proximity and within the same reading frame from one another. If the compiled recorder sequence is a unique set of barcodes to be identified by sequencing and have no coding sequence element, then the successive recorder sequences can be inserted with any desired number of base pairs separating them. In these cases, the separation distance can be dependent on the sequencing technology to be used and the read length limit.


While preferred embodiments of the present invention have been shown and described herein, it will be obvious to those skilled in the art that such embodiments are provided by way of example only. Numerous variations, changes, and substitutions will now occur to those skilled in the art without departing from the invention. It should be understood that various alternatives to the embodiments of the invention described herein may be employed in practicing the invention. It is intended that the following claims define the scope of the invention and that methods and structures within the scope of these claims and their equivalents be covered thereby.


Some Definitions


As used herein the term “wild type” is a term of the art understood by skilled persons and means the typical form of an organism, strain, gene or characteristic as it occurs in nature as distinguished from mutant or variant forms.


As used herein the term “variant” should be taken to mean the exhibition of qualities that have a pattern that deviates from what occurs in nature.


The terms “orthologue” (also referred to as “ortholog” herein) and “homologue” (also referred to as “homolog” herein) are well known in the art. By means of further guidance, a “homologue” of a protein as used herein is a protein of the same species which performs the same or a similar function as the protein it is a homologue of. Homologous proteins may but need not be structurally related, or are only partially structurally related. An “orthologue” of a protein as used herein is a protein of a different species which performs the same or a similar function as the protein it is an orthologue of Orthologous proteins may but need not be structurally related, or are only partially structurally related. Homologs and orthologs may be identified by homology modelling (see, e.g., Greer, Science vol. 228 (1985) 1055, and Blundell et al. Eur J Biochem vol 172 (1988), 513) or “structural BLAST” (Dey F, Cliff Zhang Q, Petrey D, Honig B. Toward a “structural BLAST”: using structural relationships to infer function. Protein Sci. 2013 April; 22(4):359-66. doi: 10.1002/pro.2225.).


The terms “polynucleotide”, “nucleotide”, “nucleotide sequence”, “nucleic acid” and “oligonucleotide” are used interchangeably. They refer to a polymeric form of nucleotides of any length, either deoxyribonucleotides or ribonucleotides, or analogs thereof. Polynucleotides may have any three dimensional structure, and may perform any function, known or unknown. The following are non-limiting examples of polynucleotides: coding or non-coding regions of a gene or gene fragment, loci (locus) defined from linkage analysis, exons, introns, messenger RNA (mRNA), transfer RNA, ribosomal RNA, short interfering RNA (siRNA), short-hairpin RNA (shRNA), micro-RNA (miRNA), ribozymes, cDNA, recombinant polynucleotides, branched polynucleotides, plasmids, vectors, isolated DNA of any sequence, isolated RNA of any sequence, nucleic acid probes, and primers. The term also encompasses nucleic-acid-like structures with synthetic backbones, see, e.g., Eckstein, 1991; Baserga et al., 1992; Milligan, 1993; WO 97/03211; WO 96/39154; Mata, 1997; Strauss-Soukup, 1997; and Samstag, 1996. A polynucleotide may comprise one or more modified nucleotides, such as methylated nucleotides and nucleotide analogs. If present, modifications to the nucleotide structure may be imparted before or after assembly of the polymer. The sequence of nucleotides may be interrupted by non-nucleotide components. A polynucleotide may be further modified after polymerization, such as by conjugation with a labeling component.


“Complementarity” refers to the ability of a nucleic acid to form hydrogen bond(s) with another nucleic acid sequence by either traditional Watson-Crick base pairing or other non-traditional types. A percent complementarity indicates the percentage of residues in a nucleic acid molecule which can form hydrogen bonds (e.g., Watson-Crick base pairing) with a second nucleic acid sequence (e.g., 5, 6, 7, 8, 9, 10 out of 10 being 50%, 60%, 70%, 80%, 90%, and 100% complementary). “Perfectly complementary” means that all the contiguous residues of a nucleic acid sequence will hydrogen bond with the same number of contiguous residues in a second nucleic acid sequence. “Substantially complementary” as used herein refers to a degree of complementarity that is at least 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98%, 99%, or 100% over a region of 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 30, 35, 40, 45, 50, or more nucleotides, or refers to two nucleic acids that hybridize under stringent conditions.


As used herein, “stringent conditions” for hybridization refer to conditions under which a nucleic acid having complementarity to a target sequence predominantly hybridizes with the target sequence, and substantially does not hybridize to non-target sequences. Stringent conditions are generally sequence-dependent, and vary depending on a number of factors. In general, the longer the sequence, the higher the temperature at which the sequence specifically hybridizes to its target sequence. Non-limiting examples of stringent conditions are described in detail in Tijssen (1993). Laboratory Techniques In Biochemistry And Molecular Biology-Hybridization With Nucleic Acid Probes Part I, Second Chapter “Overview of principles of hybridization and the strategy of nucleic acid probe assay”, Elsevier, N.Y. Where reference is made to a polynucleotide sequence, then complementary or partially complementary sequences are also envisaged. These are preferably capable of hybridising to the reference sequence under highly stringent conditions. Generally, in order to maximize the hybridization rate, relatively low-stringency hybridization conditions are selected: about 20 to 25 degrees Celsius. lower than the thermal melting point (Tm). The Tm is the temperature at which 50% of specific target sequence hybridizes to a perfectly complementary probe in solution at a defined ionic strength and pH. Generally, in order to require at least about 85% nucleotide complementarity of hybridized sequences, highly stringent washing conditions are selected to be about 5 to 15 degrees Celsius lower than the Tm. In order to require at least about 70% nucleotide complementarity of hybridized sequences, moderately-stringent washing conditions are selected to be about 15 to 30 degrees Celsius lower than the Tm. Highly permissive (very low stringency) washing conditions may be as low as 50 degrees Celsius below the Tm, allowing a high level of mis-matching between hybridized sequences. Those skilled in the art will recognize that other physical and chemical parameters in the hybridization and wash stages can also be altered to affect the outcome of a detectable hybridization signal from a specific level of homology between target and probe sequences. “Hybridization” refers to a reaction in which one or more polynucleotides react to form a complex that is stabilized via hydrogen bonding between the bases of the nucleotide residues. The hydrogen bonding may occur by Watson Crick base pairing, Hoogstein binding, or in any other sequence specific manner. The complex may comprise two strands forming a duplex structure, three or more strands forming a multi stranded complex, a single self-hybridizing strand, or any combination of these. A hybridization reaction may constitute a step in a more extensive process, such as the initiation of PCR, or the cleavage of a polynucleotide by an enzyme. A sequence capable of hybridizing with a given sequence is referred to as the “complement” of the given sequence.


As used herein, the term “genomic locus” or “locus” (plural loci) is the specific location of a gene or DNA sequence on a chromosome. A “gene” refers to stretches of DNA or RNA that encode a polypeptide or an RNA chain that has functional role to play in an organism and hence is the molecular unit of heredity in living organisms. For the purpose of this invention it may be considered that genes include regions which regulate the production of the gene product, whether or not such regulatory sequences are adjacent to coding and/or transcribed sequences. Accordingly, a gene includes, but is not necessarily limited to, promoter sequences, terminators, translational regulatory sequences such as ribosome binding sites and internal ribosome entry sites, enhancers, silencers, insulators, boundary elements, replication origins, matrix attachment sites and locus control regions.


As used herein, “expression of a genomic locus” or “gene expression” is the process by which information from a gene is used in the synthesis of a functional gene product. The products of gene expression are often proteins, but in non-protein coding genes such as rRNA genes or tRNA genes, the product is functional RNA. The process of gene expression is used by all known life—eukaryotes (including multicellular organisms), prokaryotes (bacteria and archaea) and viruses to generate functional products to survive. As used herein “expression” of a gene or nucleic acid encompasses not only cellular gene expression, but also the transcription and translation of nucleic acid(s) in cloning systems and in any other context. As used herein, “expression” also refers to the process by which a polynucleotide is transcribed from a DNA template (such as into and mRNA or other RNA transcript) and/or the process by which a transcribed mRNA is subsequently translated into peptides, polypeptides, or proteins. Transcripts and encoded polypeptides may be collectively referred to as “gene product.” If the polynucleotide is derived from genomic DNA, expression may include splicing of the mRNA in a eukaryotic cell.


The terms “polypeptide”, “peptide” and “protein” are used interchangeably herein to refer to polymers of amino acids of any length. The polymer may be linear or branched, it may comprise modified amino acids, and it may be interrupted by non-amino acids. The terms also encompass an amino acid polymer that has been modified; for example, disulfide bond formation, glycosylation, lipidation, acetylation, phosphorylation, or any other manipulation, such as conjugation with a labeling component.


As used herein the term “amino acid” includes natural and/or unnatural or synthetic amino acids, including glycine and both the D or L optical isomers, and amino acid analogs and peptidomimetics.


As used herein, the term “domain” or “protein domain” refers to a part of a protein sequence that may exist and function independently of the rest of the protein chain. As described in aspects of the invention, sequence identity is related to sequence homology. Homology comparisons may be conducted by eye, or more usually, with the aid of readily available sequence comparison programs. These commercially available computer programs may calculate percent (%) homology between two or more sequences and may also calculate the sequence identity shared by two or more amino acid or nucleic acid sequences. Sequence homologies may be generated by any of a number of computer programs known in the art, for example BLAST or FASTA, etc. A suitable computer program for carrying out such an alignment is the GCG Wisconsin Bestfit package (University of Wisconsin. U.S.A; Devereux et al., 1984, Nucleic Acids Research 12:387). Examples of other software than may perform sequence comparisons include, but are not limited to, the BLAST package (see Ausubel et al., 1999 ibid—Chapter 18), FASTA (Atschul et al., 1990, J. Mol. Biol., 403-410) and the GENEWORKS suite of comparison tools. Both BLAST and FASTA are available for offline and online searching (see Ausubel et al., 1999 ibid, pages 7-58 to 7-60). However it is preferred to use the GCG Bestfit program.


Percent homology may be calculated over contiguous sequences, i.e., one sequence is aligned with the other sequence and each amino acid or nucleotide in one sequence is directly compared with the corresponding amino acid or nucleotide in the other sequence, one residue at a time. This is called an “ungapped” alignment. Typically, such ungapped alignments are performed only over a relatively short number of residues.


Although this is a very simple and consistent method, it fails to take into consideration that, for example, in an otherwise identical pair of sequences, one insertion or deletion may cause the following amino acid residues to be put out of alignment, thus potentially resulting in a large reduction in % homology when a global alignment is performed. Consequently, most sequence comparison methods are designed to produce optimal alignments that take into consideration possible insertions and deletions without unduly penalizing the overall homology or identity score. This is achieved by inserting “gaps” in the sequence alignment to try to maximize local homology or identity.


However, these more complex methods assign “gap penalties” to each gap that occurs in the alignment so that, for the same number of identical amino acids, a sequence alignment with as few gaps as possible—reflecting higher relatedness between the two compared sequences—may achieve a higher score than one with many gaps. “Affinity gap costs” are typically used that charge a relatively high cost for the existence of a gap and a smaller penalty for each subsequent residue in the gap. This is the most commonly used gap scoring system. High gap penalties may, of course, produce optimized alignments with fewer gaps. Most alignment programs allow the gap penalties to be modified. However, it is preferred to use the default values when using such software for sequence comparisons. For example, when using the GCG Wisconsin Bestfit package the default gap penalty for amino acid sequences is −12 for a gap and −4 for each extension.


Calculation of maximum % homology therefore first requires the production of an optimal alignment, taking into consideration gap penalties. A suitable computer program for carrying out such an alignment is the GCG Wisconsin Bestfit package (Devereux et al., 1984 Nuc. Acids Research 12 p38′7). Examples of other software that may perform sequence comparisons include, but are not limited to, the BLAST package (see Ausubel et al., 1999 Short Protocols in Molecular Biology, 4th Ed.—Chapter 18), FASTA (Altschul et al., 1990 J. Mol. Biol. 403-410) and the GENEWORKS suite of comparison tools. Both BLAST and FASTA are available for offline and online searching (see Ausubel et al., 1999, Short Protocols in Molecular Biology, pages 7-58 to 7-60). However, for some applications, it is preferred to use the GCG Bestfit program. A new tool, called BLAST 2 Sequences is also available for comparing protein and nucleotide sequences (see FEMS Microbiol Lett. 1999 174(2): 247-50; FEMS Microbiol Lett. 1999 177(1): 187-8 and the website of the National Center for Biotechnology information at the website of the National Institutes for Health).


Although the final % homology may be measured in terms of identity, the alignment process itself is typically not based on an all-or-nothing pair comparison. Instead, a scaled similarity score matrix is generally used that assigns scores to each pair-wise comparison based on chemical similarity or evolutionary distance. An example of such a matrix commonly used is the BLOSUM62 matrix—the default matrix for the BLAST suite of programs. GCG Wisconsin programs generally use either the public default values or a custom symbol comparison table, if supplied (see user manual for further details). For some applications, it is preferred to use the public default values for the GCG package, or in the case of other software, the default matrix, such as BLOSUM62.


Alternatively, percentage homologies may be calculated using the multiple alignment feature in DNASIS™ (Hitachi Software), based on an algorithm, analogous to CLUSTAL (Higgins D G & Sharp P M (1988), Gene 73(1), 237-244). Once the software has produced an optimal alignment, it is possible to calculate % homology, preferably % sequence identity. The software typically does this as part of the sequence comparison and generates a numerical result.


Sequences may also have deletions, insertions or substitutions of amino acid residues which produce a silent change and result in a functionally equivalent substance. Deliberate amino acid substitutions may be made on the basis of similarity in amino acid properties (such as polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues) and it is therefore useful to group amino acids together in functional groups. Amino acids may be grouped together based on the properties of their side chains alone. However, it is more useful to include mutation data as well. The sets of amino acids thus derived are likely to be conserved for structural reasons. These sets may be described in the form of a Venn diagram (Livingstone C. D. and Barton G. J. (1993) “Protein sequence alignments: a strategy for the hierarchical analysis of residue conservation” Comput. Appl. Biosci. 9: 745-756) (Taylor W. R. (1986) “The classification of amino acid conservation” J. Theor. Biol. 119; 205-218). Conservative substitutions may be made, for example according to the table below which describes a generally accepted Venn diagram grouping of amino acids.


Embodiments of the invention include sequences (both polynucleotide or polypeptide) which may comprise homologous substitution (substitution and replacement are both used herein to mean the interchange of an existing amino acid residue or nucleotide, with an alternative residue or nucleotide) that may occur i.e., like-for-like substitution in the case of amino acids such as basic for basic, acidic for acidic, polar for polar, etc. Non-homologous substitution may also occur i.e., from one class of residue to another or alternatively involving the inclusion of unnatural amino acids such as ornithine (hereinafter referred to as Z), diaminobutyric acid ornithine (hereinafter referred to as B), norleucine ornithine (hereinafter referred to as O), pyridylalanine, thienylalanine, naphthylalanine and phenylglycine.


Variant amino acid sequences may include suitable spacer groups that may be inserted between any two amino acid residues of the sequence including alkyl groups such as methyl, ethyl or propyl groups in addition to amino acid spacers such as glycine or .beta.-alanine residues. A further form of variation, which involves the presence of one or more amino acid residues in peptoid form, may be well understood by those skilled in the art. For the avoidance of doubt, “the peptoid form” is used to refer to variant amino acid residues wherein the .alpha.-carbon substituent group is on the residue's nitrogen atom rather than the .alpha.-carbon. Processes for preparing peptides in the peptoid form are known in the art, for example Simon R J et al., PNAS (1992) 89(20), 9367-9371 and Horwell D C, Trends Biotechnol. (1995) 13(4), 132-134.


The practice of the present invention employs, unless otherwise indicated, conventional techniques of immunology, biochemistry, chemistry, molecular biology, microbiology, cell biology, genomics and recombinant DNA, which are within the skill of the art. See Green and Sambrook, (Molecular Cloning: A Laboratory Manual. 4th, ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 2014); CURRENT PROTOCOLS IN MOLECULAR BIOLOGY (F. M. Ausubel, et al. eds., (2017)); Short Protocols in Molecular Biology, (Ausubel et al., 1999)); the series METHODS IN ENZYMOLOGY (Academic Press, Inc.): PCR 2: A PRACTICAL APPROACH (M. J. MacPherson, B. D. Hames and G. R. Taylor eds. (1995)), ANTIBODIES, A LABORATORY MANUAL, SECOND EDITION (Harlow and Lane, eds. (2014) and CULTURE OF ANIMAL CELLS: A MANUAL OF BASIC TECHNIQUE, 7TH EDITION (R. I. Freshney, ed. (2016)).


EXAMPLES

The following examples are given for the purpose of illustrating various embodiments of the invention and are not meant to limit the present invention in any fashion. The present examples, along with the methods described herein are presently representative of preferred embodiments, are exemplary, and are not intended as limitations on the scope of the invention. Changes therein and other uses which are encompassed within the spirit of the invention as defined by the scope of the claims will occur to those skilled in the art.


Example 1
Nucleic Acid-Guided Nucleases

Sequences for twenty nucleic acid guided nucleases, termed MAD1-MAD20 (SEQ ID NOs 1-20), were aligned and compared to other nucleic acid guided nucleases. A partial alignment and phylogenetic tree are depicted in FIG. 1A and FIG. 1B respectively. Key residues in that may be involved in the recognition of a PAM site are shown in FIG. 1A. These include amino acids at positions 167, 539, 548, 599, 603, 604, 605, 606, and 607.


Sequence alignments were built using PSI-BLAST to search for MAD nuclease homologs in the NCBI non-redundant databases. Multiple sequence alignments were further refined using the MUSCLE alignment algorithm with default settings as implemented in Geneious 10. The percent identity of each homolog to SpCas9 and AsCpf1 reference sequences were computed based on the pairwise alignment matching from these global alignments.


Genomic source sequences were identified using Uniprot linkage information or TBLASTN searches of NCBI using the default parameters and searching all possible frames for translational matches.


Percent identities of MAD1-8 and 10-12 to other various nuclease are summarized in Table 1. These percent identities represent the shared amino acid sequence identity between the indicated proteins.




















TABLE 1





Protein













identifier













or













accession













number
MAD1
MAD2
MAD3
MAD4
MAD5
MAD6
MAD7
MAD8
MAD10
MAD11
MAD12


























gi|102573
6.4
32.8
33.2
29.7
29.4
31.1
30.3
31.7
26.7
27.9
98.8


4861|pdb|5













B43|A













gi|105224
6.4
32.7
33.1
29.7
29.3
31
30.3
31.7
26.7
27.8
98.7


5173|pdb|5













KK5|A













gi|108621
6.1
33
34.4
29.6
30.1
33.5
32.3
32.1
26.2
27.2
46.8


6683|emb|













SDC1621













5.1|













gi|112017
5.9
30.9
37.2
32.8
33.6
34.4
35.7
35.1
26.3
28.3
34.9


5333|ref|













WP_0730













43853.1|













Cpf1.Sj
6.6
33.6
41.7
37.2
33.4
37.6
40.1
37.7
29.1
30.3
34.1


IWP_0818













39471













Cpf1.Ss|K
6.9
32.3
35.7
43
33.7
45.9
34.8
48
33.2
33.4
33.8


FO67989













MAD3
5.8
31
100
32.9
35.9
35
35.6
34.3
28
27.6
33.1


gi|108247
7
31.4
35.9
43.2
31.4
45
33.6
48.6
30.8
33.5
33


4576|gb|O













FY19591.













1|













MAD2
6.1
100
31
30.7
30.2
31
31.2
31.2
25.8
27.7
32.6


Cpf1.Lb5I
7.8
32.8
36.5
38.2
34.2
45.5
35.8
43.6
30.7
35.7
32.5


WP_0163













01126













gi|108828
6.7
30.6
35.3
42.4
33.2
44.7
32.1
46.8
30.7
32.6
32.4


6736|gb|O













HB41002.













1|













gi|109442
6.8
30.8
36.1
40.4
31.8
50.4
35.2
46.6
30.4
36.8
32.3


3310|emb|













SER03894













.1|













gi|493326
6.8
30.8
36.1
40.3
31.8
50.3
35.1
46.6
30.4
36.8
32.3


531|ref|W













P_006283













774.1|













MAD8
7.6
31.2
34.3
40.4
32
41.6
32.8
100
30.1
32.1
31.7


Cpf1.Bot|
6.9
30.1
36.6
41.5
32.5
50.2
35.4
45.5
29.8
34.1
31.6


WP_0092













17842













Cpf1.Li|W
7.3
30.2
34.6
39.3
30.3
40.7
31.8
39.4
32.1
31.3
31.5


P_020988













726













Cpf1.Pb1
6.3
31.4
31.8
36.1
30.8
45.7
30.4
39.4
27.7
33.5
31.5


WP_0441













10123













gi|817911
7.3
29.8
35
40.7
32.1
40.3
32.6
41.7
29.1
31
31.4


372|gb|AK













G08867.1|













gi|105283
6.6
30.8
35.5
32
31.5
34.4
51.9
33.4
26.1
29
31.3


8533|emb|













SCH4529













7.1|













gi|105371
7.2
29.6
33.2
39.6
29.8
49.1
32.2
41.4
30.1
32.4
31.3


3332|ref|













WP_0660













40075.1|













gi|817909
7.3
29.8
35
40.7
32
40.3
32.5
41.6
29.1
30.9
31.3


002|gb|AK













G06878.1|













gi|104220
7.2
29.5
35.2
40.6
31.9
40.1
32.7
41.6
29
30.8
31.2


1477|ref|













WP_0652













56572.1|













MAD6
7.5
31
35
38.9
33.1
100
34.3
41.6
30.5
33.6
31


gi|490468
6.8
31.8
31.7
36.2
28.6
36.5
31.4
38.4
28.5
31.4
31


773|ref|W













P_004339













290.1|













gi|565853
7.5
30.8
34.9
38.9
33.1
99.7
34.1
41.6
30.4
33.6
31


704|ref|W













P_023936













172.1|













gi|739005
7.5
30.9
35
38.9
33
99.9
34.2
41.5
30.4
33.5
31


707|ref|W













P_036887













416.1|













gi|739008
7.5
31
35
38.8
33
99.8
34.2
41.5
30.4
33.5
31


549|ref|W













P_036890













108.1|













Cpf1.Ft|W
7.1
31.9
33.8
40.3
29.7
39.4
34.1
41
29.8
32.5
30.8


P_014550













095













gi|504362
7.2
32.4
33.8
40.3
29.6
39.4
33.8
40.9
30.1
32.5
30.8


993|ref|W













P_014550













095.1|













gi|640557
6.6
31.4
34.8
40.7
31.2
48
34.1
45.1
28.8
35.2
30.8


447|ref|W













P_024988













992.1|













gi|109894
7.1
32.3
33.5
40.3
29.6
39.2
33.8
40.9
30.1
32.5
30.6


4113|ref|













WP_0713













04624.1|













gi|489124
7.1
32.3
33.9
40.9
29.9
39.2
33.9
40.9
29.9
32.2
30.6


848|ref|W













P_003034













647.1|













gi|738967
6.8
29.4
33.1
35.5
28.9
40.3
30.7
35.9
28.7
31.3
30.5


776|ref|W













P_036851













563.1|













MAD7
5.9
31.2
35.6
30.8
33.9
34.3
100
32.8
24.2
28.9
30.5


Cpf1.Lb6I
6.7
29.8
33.7
36.6
30.9
43
34
39.8
29.1
32.1
30.4


WP_0449













10713













gi|105296
5.5
30.5
35.8
32.3
34
35
53.8
33.4
26.2
27.4
30.4


1977|emb|













SCH4791













5.1|













gi|817918
7
29.1
34.4
39.8
31.7
40
32.4
41.1
28.4
30.1
30.3


353|gb|AK













G14689.1|













gi|917059
6.9
29.9
31.5
35.7
31.6
41.8
32.9
39.1
30.1
34
30.2


416|ref|W













P_051666













128.1|













gi|101164
6.8
29
34.7
40.3
31.4
40.1
33.1
41.6
28.5
30.4
30.1


9201|ref|













WP_0624













99108.1|













Cpf1.Pm|
6.3
29.2
32.3
34.2
27.4
38.7
29.4
35
27.2
30.1
30


WP_0183













59861













gi|817922
6.8
29.1
34.5
39.6
31.5
39.9
32.7
40.7
28.3
29.8
30


537|gb|AK













G18099.1|













gi|769142
6.7
31
34.6
37.8
31.5
41.4
33.3
39.2
28
31.9
29.9


322|ref|W













P_044919













442.1|













gi|102317
6.7
29.7
31.3
35.5
31.3
41
32.6
38.5
29.7
33.3
29.8


6441|pdb|5













ID6|A













gi|491540
5.9
28.3
30.4
29.7
28.5
29
30.7
29.8
25.8
27.8
29.8


987|ref|W













P_005398













606.1|













gi|652820
6.4
31.1
34
35.3
31.7
40.3
33.4
37.5
28.5
33.3
29.8


612|ref|W













P_027109













509.1|













gi|502240
5.9
31.6
36.1
31.2
33
35.4
49.4
34
26.6
29.4
29.7


446|ref|W













P_012739













647.1|













gi|524278
5.8
31.6
36
31
33
35.4
50
34
26.6
29.5
29.7


046|emb|C













DA41776.













1|













gi|737831
6.2
31.3
34.8
38.1
31.5
42.1
33
39.6
28.4
32.4
29.7


580|ref|W













P_035798













880.1|













gi|909652
6.9
30.7
34.2
37.2
30.8
41.5
34.2
38.7
28
32
29.7


572|ref|W













P_049895













985.1|













MAD4
6.7
30.7
32.9
100
30.7
38.9
30.8
40.4
28.8
29.4
29.7


gi|942073
5.9
31.6
36.1
31.1
32.7
35
49.7
33.9
27.1
29.5
29.6


049|ref|W













P_055286













279.1|













gi|654794
7.4
30.5
35.9
37.4
31.3
42.8
34.2
40.2
27.9
33.5
29.5


505|ref|W













P_028248













456.1|













gi|933014
5.6
31.3
34.9
31.2
31.5
32.4
46.7
30.6
25.4
27.7
29.4


786|emb|C













U047728.













1|













gi|941887
5.6
31.4
35
31.3
31.6
32.5
46.6
30.7
25.3
27.8
29.4


450|ref|W













P_055224













182.1|













gi|920071
6.3
31
31.8
38.8
31.8
41.3
33.8
42.6
29.8
34.7
29


674|ref|W













P_052943













011.1|













MAD5
5.1
30.2
35.9
30.7
100
33.1
33.9
32
24.3
28.7
29


gi|108146
6.9
30.4
33.5
34.7
29.7
40.1
30.5
37.4
27.3
32.5
28.9


2674|emb|













SCZ76797













.1|













gi|918722
7.4
27.5
30.5
35.7
28.3
35.2
28.5
36
26
27.1
28.8


523|ref|W













P_052585













281.1|













gi|524816
6.2
30
34.1
29.3
31.2
32.7
47.6
32.2
25.5
25.9
28.4


323|emb|C













DF09621.













1|













gi|941782
6.2
30.2
33.1
28.9
30.9
32
46.9
32.1
26
27.1
28.4


328|ref|W













P_055176













369.1|













gi|942113
6.4
29.8
33.8
29.7
31.3
33.1
48
32.5
25.8
26.2
28.4


296|ref|W













P_055306













762.1|













MAD11
6.4
27.7
27.6
29.4
28.7
33.6
28.9
32.1
26.2
100
27.8


gi|653158
5.9
26.4
28.1
33.5
27.4
32.5
27.8
32
27
26.8
27.6


548|ref|W













P_027407













524.1|













gi|652963
6.6
30.3
32.5
33.2
30.4
38.2
29.6
34.6
25.9
30.5
27.2


004|ref|W













P_027216













152.1|













gi|108306
6.2
25
24.3
26.6
23.1
28.1
23.2
26.4
45
24.9
27.1


9650|gb|O













GD68774.













1|













gi|302483
5.6
24.7
26.8
30.3
24.9
34.8
26
30.4
24.4
27.5
27.1


275|gb|EF













L46285.1|













gi|915400
5.6
24.7
26.8
30.3
24.9
34.8
26
30.4
24.4
27.5
27.1


855|ref|W













P_050786













240.1|













MAD10
5.6
25.8
28
28.8
24.3
30.5
24.2
30.1
100
26.2
26.6


gi|110111
6.1
26.8
26
27.3
24.3
28.1
24.4
28.2
44.1
25.4
26.1


7967|gb|O













IO75780.1













|













gi|108820
6.5
25.2
23.5
25.8
22.9
27
22
26.1
36.5
24.2
24.7


4458|gb|O













HA63117.













1|













gi|809198
4.9
25.6
26.5
22.2
23.9
23.8
25.8
23.9
20.3
25.1
24


071|ref|W













P_046328













599.1|













gi|108807
5.6
21.9
23.8
26.9
23.4
27.8
23.3
26.7
28.8
24.7
23.5


9929|gb|O













GZ45678.













1|













gi|110105
5.9
23.1
26.2
25.2
23
26.4
25.1
26.5
29.2
23.2
23.4


3499|gb|O













IO15737.1













|













gi|110105
5.4
21.2
22.8
23.6
20.6
25
20.7
25
25.9
22.2
23


8058|gb|O













IO19978.1













|













gi|108800
5.7
23.5
25.2
25.5
23.9
27
25.1
25.6
31.6
23.6
22.9


0848|gb|O













GY73485.













1|













gi|407014
5.2
23.5
25.9
26.7
24.3
25.8
23
27.8
29.9
25.3
22.9


433|gb|EK













E28449.1|













gi|818249
6
21
20.7
23.5
20
24.2
21
24
24.6
21.8
22.6


855|gb|KK













P36646.1|













gi|818703
5.8
23.3
25
25.1
23.5
26.5
24.7
25.3
31.2
23.3
22.6


647|gb|KK













T48220.1|













gi|818705
5.8
23.1
24.6
24.7
22.9
26.2
24.2
24.8
30.8
22.9
22.2


786|gb|KK













T50231.1|













gi|108395
4.5
20
22.1
23.5
20.6
24.6
20
24
23.5
20.7
22.1


0632|gb|O













GJ66851.1













|













gi|108393
6
20.4
20.2
22.6
19.3
23.3
20.6
23.2
23.9
21
21.8


2199|gb|O













GJ49885.1













|













gi|108341
5
21.7
23.3
25.5
23
25
22.7
25.9
27.2
22.4
21.5


0735|gb|O













GF20863.













1|













gi|101148
4.7
20.1
20.1
21.4
19.3
23.3
21.4
22
20.2
19.7
20.9


0927|ref|













WP_0623













76669.1|













gi|818539
5.1
19.8
21.6
22.1
20.5
22.9
21.2
22.8
24
20.5
19.9


593|gb|KK













R91555.1|













gi|503048
5.1
18.8
20.7
15.3
19.7
18.9
19.3
17.7
15.9
19
19.2


015|ref|W













P_013282













991.1|













gi|109623
5
19.1
20.5
17.4
20.1
19.7
20.4
20.4
17.5
18.5
18.9


2746|ref|













WP_0711













77645.1|













gi|769130
4.6
19.4
18.2
16.1
18.1
17.1
18.7
17.9
14.5
16.8
17.5


404|ref|W













P_044910













712.1|













gi|108556
2.6
11.6
12.1
12.7
10.2
12.1
12.7
11.6
10.9
11.1
10.5


9500|gb|O













GX23684.













1|













gi|818357
3.3
10
11.1
10.6
11.1
11.8
12.1
11.5
12.2
10.8
9.8


062|gb|KK













Q38176.1|













gi|745626
3.7
9.4
11.7
11.1
11.1
12.5
11.9
11.9
10.2
10.6
8.8


763|gb|KI













E18642.1|













MAD1
100
6.1
5.8
6.7
5.1
7.5
5.9
7.6
5.6
6.4
6.4


SpCas9
4
6.3
6.5
8.3
5.6
8.1
6.9
7.7
6.9
6.3
6.3


MAD12
6.4
32.6
33.1
29.7
29
31
30.5
31.7
26.6
27.8
100









Example 2
Expression of MAD Nucleases

Wild-type nucleic acid sequences for MAD1-MAD20 include SEQ ID NOs 21-40, respectively. These MAD nucleases were codon optimized for expression in E. coli and the codon optimized sequences are listed as SEQ ID NO: 41-60, respectively (summarized in Table 2). Codon optimized MAD1-MAD20 were cloned into an expression construct comprising a constitutive or inducible promoter (eg., proB promoter SEQ ID NO: 83, or pBAD promoter SEQ ID NO: 81 or SEQ ID NO: 82) and an optional 6×-His tag (eg., FIG. 2). The generated MAD1-MAD20 expression constructs are provided as SEQ ID NOs: 61-80, respectively. The expression constructs as depicted in FIG. 2 were generated either by restriction/ligation-based cloning or homology-based cloning.


Example 3
Testing Guide Nucleic Acid Sequences Compatible with MAD Nucleases

In order to have a functioning targetable nuclease complex, a nucleic acid-guided nuclease and a compatible guide nucleic acid is needed. To determine the compatible guide nucleic acid sequence, specifically the scaffold sequence portion of the guide nucleic acid, multiple approaches were taken. First, scaffold sequences were looked for near the endogenous loci of each MAD nuclease. In some cases, such as with MAD2, no endogenous scaffold sequence was found. Therefore, we tested the compatibility of MAD2 with scaffold sequences found near the endogenous loci of the other MAD nucleases. A list of the MAD nucleases and corresponding endogenous scaffold sequences that were tested is listed in Table 2.













TABLE 2









Endogenous




Codon optimized

scaffold sequence



WT nucleic acid
nucleic acid
Amino acid
for guide nucleic


MAD nuclease
sequence
sequence
sequence
acid







MAD1
SEQ ID NO: 21
SEQ ID NO: 41
SEQ ID NO: 1
SEQ ID NO: 84


MAD2
SEQ ID NO: 22
SEQ ID NO: 42
SEQ ID NO: 2
None identified


MAD3
SEQ ID NO: 23
SEQ ID NO: 43
SEQ ID NO: 3
SEQ ID NO: 86


MAD4
SEQ ID NO: 24
SEQ ID NO: 44
SEQ ID NO: 4
SEQ ID NO: 87


MAD5
SEQ ID NO: 25
SEQ ID NO: 45
SEQ ID NO: 5
SEQ ID NO: 88


MAD6
SEQ ID NO: 26
SEQ ID NO: 46
SEQ ID NO: 6
SEQ ID NO: 89


MAD7
SEQ ID NO: 27
SEQ ID NO: 47
SEQ ID NO: 7
SEQ ID NO: 90


MAD8
SEQ ID NO: 28
SEQ ID NO: 48
SEQ ID NO: 8
SEQ ID NO: 91


MAD9
SEQ ID NO: 29
SEQ ID NO: 49
SEQ ID NO: 9
SEQ ID NO: 92;






SEQ ID NO: 103;






SEQ ID NO: 106


MAD10
SEQ ID NO: 30
SEQ ID NO: 50
SEQ ID NO: 10
SEQ ID NO: 93


MAD11
SEQ ID NO: 31
SEQ ID NO: 51
SEQ ID NO: 11
SEQ ID NO: 94


MAD12
SEQ ID NO: 32
SEQ ID NO: 52
SEQ ID NO: 12
SEQ ID NO: 95


MAD13
SEQ ID NO: 33
SEQ ID NO: 53
SEQ ID NO: 13
SEQ ID NO: 96;






SEQ ID NO: 105;






SEQ ID NO: 107


MAD14
SEQ ID NO: 34
SEQ ID NO: 54
SEQ ID NO: 14
SEQ ID NO: 97


MAD15
SEQ ID NO: 35
SEQ ID NO: 55
SEQ ID NO: 15
SEQ ID NO: 98


MAD16
SEQ ID NO: 36
SEQ ID NO: 56
SEQ ID NO: 16
SEQ ID NO: 99


MAD17
SEQ ID NO: 37
SEQ ID NO: 57
SEQ ID NO: 17
SEQ ID NO: 100


MAD18
SEQ ID NO: 38
SEQ ID NO: 58
SEQ ID NO: 18
SEQ ID NO: 101


MAD19
SEQ ID NO: 39
SEQ ID NO: 59
SEQ ID NO: 19
SEQ ID NO: 102


MAD20
SEQ ID NO: 40
SEQ ID NO: 60
SEQ ID NO: 20
SEQ ID NO: 103









Editing cassettes as depicted in FIG. 3 were generated to assess the functionality of the MAD nucleases and corresponding guide nucleic acids. Each editing cassette comprises an editing sequence and a promoter operably linked to an encoded guide nucleic acid. The editing cassettes further comprises primer sites (P1 and P2) on flanking ends. The guide nucleic acids comprised various scaffold sequences to be tested, as well as a guide sequence to guide the MAD nuclease to the target sequence for editing. The editing sequences comprised a PAM mutation and/or codon mutation relative to the target sequence. The mutations were flanked by regions of homology (homology arms or HA) which would allow recombination into the cleaved target sequence. (agcagctttatcatctgccg (SEQ ID No: 183); QQLYHLP (SEQ ID No: 184); agcagttataataactgccg (SEQ ID No: 186; and QQLLP (SEQ ID No: 206)



FIG. 4 depicts an experimental designed to test different MAD nuclease and guide nucleic acid combinations. An expression cassette encoding the MAD nuclease or the MAD nuclease protein were added to host cells along with various editing cassettes as described above. In this example, the guide nucleic acids were engineered to target the galK gene in the host cell, and the editing sequence was designed to mutate the targeted galK gene in order to turn the gene off, thereby allowing for screening of successfully edited cells. This design was used for identification of functional or compatible MAD nuclease and guide nucleic acid combinations. Editing efficiency was determined by qPCR to measure the editing plasmid in the recovered cells in a high-throughput manner. Validation of MAD11 and Cas9 primers is shown in FIGS. 14A and 14B. These results show that the selected primer pairs are orthogonal and allow quantitative measurement of input plasmid DNA



FIGS. 5A-5B is a depiction of a similar experimental design. In this case, the editing cassette (FIG. 5B) further comprises a selectable marker, in this case kanamycin resistance (kan) and the MAD nuclease expression vector (FIG. 5A) further comprises a selectable marker, in this case chloramphenicol resistance (Cm), and the lambda RED recombination system to aid homologous recombination (HR) of the editing sequence into the target sequence. A compatible MAD nuclease and guide nucleic acid combination will cause a double strand break in the target sequence if a PAM sequence is present. Since the editing sequence (eg. FIG. 3) contains a PAM mutation that is not recognized by the MAD nuclease, edited cells that contain the PAM mutation survive cleavage by the MAD nuclease, while wild-type non-edited cells die (FIG. 5C). The editing sequence further comprises a mutation in the galK gene that allows for screening of edited cells, while the MAD nuclease expression vector and editing cassette contain drug selection markers, allowing for selection of edited cells.


Using these methods, compatible guide nucleic acids for MAD1-MAD20 were tested. Twenty scaffold sequences were tested. The guide nucleic acids used in the experiments contained one of the twenty scaffold sequences, referred to as scaffold-1, scaffold-2, etc., and a guide sequence that targets the galK gene. Sequences for Scaffold-1 through Scaffold-20 are listed as SEQ ID NO: 84-103, respectively. It should be understood that the guide sequence of the guide nucleic acid is variable and can be engineered or designed to target any desired target sequence. Since MAD2 does not have an endogenous scaffold sequence to test, a scaffold sequence from a close homology (scaffold-2, SEQ ID NO: 85) was tested and found to be a non-functional pair, meaning MAD2 and scaffold-2 were not compatible. Therefore, MAD2 was tested with the other nineteen scaffold sequences, despite the low sequence homology between MAD2 and the other MAD nucleases.


This workflow could also be used to identify or test PAM sequences compatible with a given MAD nuclease. Another method for identifying a PAM site is described in the next example.


In general, for the assays described, transformations were carried out as follows. E. coli strains expressing the codon optimized MAD nucleases were grown overnight. Saturated cultures were diluted 1/100 and grown to an OD600 of 0.6 and induced by adding arabinose at a filing concentration of 0.4% and (if a temperature sensitive plasmid is used) shifting the culture to 42 degrees Celsius in a shaking water bath. Following induction, cells were chilled on ice for 15 min prior to washing thrice with ¼ the initial culture volume with 10% glycerol (for example, 50 mL washed for a 200 mL culture). Cells were resuspended in 1/100 the initial volume (for example, 2 mL for a 200 mL culture) and stores at −90 degrees Celsius until ready to use. To perform the compatibility and editing efficiency screens described here, 50 ng of editing cassette was transformed into cell aliquots by electroporation. Following electroporation, the cells were recovered in LB for 3 hours and 100 μL of cells were plated on Macconkey plates containing 1% galactose.


Editing efficiencies were determined by dividing the number of white colonies (edited cells) by the total number of white and red colonies (edited and non-edited cells).


Example 4
PAM Selection Assay

In order to generate a double strand break in a target sequence, a guide nucleic acid must hybridize to a target sequence, and the MAD nuclease must recognize a PAM sequence adjacent to the target sequence. If the guide nucleic acid hybridizes to the target sequence, but the MAD nuclease does not recognize a PAM site, then cleavage does not occur.


A PAM is MAD nuclease-specific and not all MAD nucleases necessarily recognize the same PAM. In order to assess the PAM site requirements for the MAD nucleases, an assay as depicted in FIGS. 6A-6C was performed.



FIG. 6A depicts a MAD nuclease expression vector as described elsewhere, which also contains a chloramphenicol resistance gene and the lambda RED recombination system.



FIG. 6B depicts a self-targeting editing cassette. The guided nucleic acid is designed to target the target sequence which is contained on the same nucleic acid molecule. The target sequence is flanked by random nucleotides, depicted by N4, meaning four random nucleotides on either end of the target sequence. It should be understood that any number of random nucleotides could also be used (for example, 3, 5, 6, 7, 8, etc). The random nucleotides serve as a library of potential PAMs.



FIG. 6C depicts the experimental design. Basically, the MAD nuclease expression vector and editing cassette comprising the random PAM sites were transformed into a host cell. If a functional targetable nuclease complex was formed and the MAD nuclease recognized a PAM site, then the editing cassette vector was cleaved and which leads to cell death. If a functional targetable complex was not formed or if the MAD nuclease did not recognize the PAM, then the target sequence was not cleaved and the cell survived. Next generation sequence (NGS) was then used to sequence the starting and final cell populations in order to determine what PAM sites were recognized by a given MAD nuclease. These recognized PAM sites were then used to determine a consensus or non-consensus PAM for a given MAD nuclease.


The consensus PAM for MAD1-MAD8, and MAD10-MAD12 was determined to be TTTN. The consensus PAM for MAD9 was determined to be NNG. The consensus PAM for MAD13-MAD15 was determined to be TTN. The consensus PAM for MAD16-MAD18 was determined to be TA. The consensus PAM for MAD19-MAD20 was determined to be TTCN.


Example 5
Testing Heterologous Guide Nucleic Acids

Editing efficiencies were tested for MAD1, MAD2, MAD4, and MAD7 and are depicted in FIG. 7A and FIG. 7B. Experiment details and editing efficiencies are summarized in Table 3. Editing efficiency was determined by dividing the number of edited cells by the total number of recovered cells. Various editing cassettes targeting the galK gene were used to allow screening of editing cells. The guide nucleic acids encoded on the editing cassette contained a guide sequence targeting the galK gene and one of various scaffold sequences in order to test the compatibility of the indicated MAD nuclease with the indicated scaffold sequence, as summarized in Table 3.


Editing efficiencies for compatible MAD nuclease and guide nucleic acids (comprising the indicated scaffold sequences) were observed to have between 75-100% editing efficiency. MAD2 had between a 75-100% editing efficiency and MAD7 had between a 97-100% editing efficiency.


MAD2 combined with scaffold-1, scaffold-2, scaffold-4, or scaffold-13 in these experiments results in 0% editing efficiency. These data imply that MAD2 did not form a functional complex with these tested guide nucleic acids and that MAD2 is not compatible with these scaffold sequences.


MAD7 combined with scaffold-1, scaffold-2, scaffold-4, or scaffold-13 in these experiments results in 0% editing efficiency. These data imply that MAD7 did not form a functional complex with these tested guide nucleic acids and that MAD7 is not compatible with these scaffold sequences.


For MAD1 and MAD4, all tested guide nucleic acid combinations resulted in 0% editing efficiency, implying that MAD1 and MAD4 did not form a functional complex with any of the tested guide nucleic acids. These data also imply that MAD1 and MAD4 are not compatible with the tested scaffold sequences.


Combined, these data highlight the unpredictability of finding a compatible MAD nuclease and scaffold sequence pair in order to form a functional targetable nuclease complex. Some tested MAD nucleases did not function with any tested scaffold sequence. Some tested MAD nucleases only functioned with some tested scaffold sequences and not with others.














TABLE 3








Editing





Nucleic acid-
Guide nucleic acid scaffold
sequence

Editing


#
guided nuclease
sequence
mutation
Target gene
efficiency




















1
MAD1
Scaffold-1; SEQ ID NO: 84
L80**
galK
 0%


2
MAD1
Scaffold-2; SEQ ID NO: 85
Y145**
galK
 0%


3
MAD1
Scaffold-4; SEQ ID NO: 87
Y145**
galK
 0%


4
MAD1
Scaffold-10; SEQ ID NO: 93
Y145**
galK
 0%


5
MAD1
Scaffold-11; SEQ ID NO: 94
L80**
galK
 0%


6
MAD1
Scaffold-12; SEQ ID NO: 95
L10KpnI
galK
 0%


7
MAD1
Scaffold-13; SEQ ID NO: 96
Y145**
galK
 0%


8
MAD1
Scaffold-12; SEQ ID NO: 95
L10KpnI
galK
 0%


9
MAD2
Scaffold-10; SEQ ID NO: 93
L80**
galK
 0%


10
MAD2
Scaffold-10; SEQ ID NO: 93
Y145**
galK
100%


11
MAD2
Scaffold-11; SEQ ID NO: 94
L80**
galK
 98%


12
MAD2
Scaffold-11; SEQ ID NO: 94
Y145**
galK
 99%


13
MAD2
Scaffold-12; SEQ ID NO: 95
Y145**
galK
 98%


14
MAD2
Scaffold-12; SEQ ID NO: 95
Y145**
galK
 0%


15
MAD2
Scaffold-13; SEQ ID NO: 96
Y145**
galK
 0%


16
MAD2
Scaffold-1; SEQ ID NO: 84
L80**
galK
 0%


17
MAD2
Scaffold-2; SEQ ID NO: 85
Y145**
galK
 0%


18
MAD2
Scaffold-2; SEQ ID NO: 85
Y145**
galK
 0%


19
MAD2
Scaffold-4; SEQ ID NO: 87
Y145**
galK
 0%


20
MAD2
Scaffold-5; SEQ ID NO: 88
L80**
galK
 99%


21
MAD2
Scaffold-12; SEQ ID NO: 95
89**
galK
 0%


22
MAD2
Scaffold-12; SEQ ID NO: 95
70**
galK
 75%


23
MAD2
Scaffold-12; SEQ ID NO: 95
L10KpnI
galK
 79%


24
MAD4
Scaffold-1; SEQ ID NO: 84
L80**
galK
 0%


25
MAD4
Scaffold-2; SEQ ID NO: 85
Y145**
galK
 0%


26
MAD4
Scaffold-4; SEQ ID NO: 87
Y145**
galK
 0%


27
MAD4
Scaffold-10; SEQ ID NO: 93
Y145**
galK
 0%


28
MAD4
Scaffold-11; SEQ ID NO: 94
L80**
galK
 0%


29
MAD4
Scaffold-12; SEQ ID NO: 95
L10KpnI
galK
 0%


30
MAD4
Scaffold-13; SEQ ID NO: 96
Y145**
galK
 0%


31
MAD4
Scaffold-12; SEQ ID NO: 95
L10KpnI
galK
 0%


32
MAD7
Scaffold-1; SEQ ID NO: 84
L80**
galK
 0%


33
MAD7
Scaffold-2; SEQ ID NO: 85
Y145**
galK
 0%


34
MAD7
Scaffold-4; SEQ ID NO: 87
Y145**
galK
 0%


35
MAD7
Scaffold-10; SEQ ID NO: 93
Y145**
galK
100%


36
MAD7
Scaffold-11; SEQ ID NO: 94
L80**
galK
 97%


37
MAD7
Scaffold-12; SEQ ID NO: 95
L10KpnI
galK
 0%


38
MAD7
Scaffold-13; SEQ ID NO: 96
Y145**
galK
 0%


39
MAD7
Scaffold-12; SEQ ID NO: 95
L10KpnI
galK
 0%









Example 6
Assessment of MAD2 and MAD7

The ability of MAD2 and MAD7 to function with heterologous guide nucleic acids were tested using a similar experimental design as described above.


The compatibility of MAD2 with other scaffold sequences was tested and the results of an experiment are depicted in FIG. 8. The MAD nucleases, guide nucleic acid scaffold sequences, and editing sequences used in this experiment are summarized in Table 4.


The compatibility of MAD7 with other scaffold sequences was tested and the results of an experiment are depicted in FIG. 9. The MAD nucleases, guide nucleic acid scaffold sequences, and editing sequences used in this experiment are summarized in Table 5.













TABLE 4








Editing




Nucleic acid-
Guide nucleic acid scaffold
sequence
Target


#
guided nuclease
sequence
mutation
gene



















1
MAD2
Scaffold-12; SEQ ID NO: 95
N89KpnI
galK


2
MAD2
Scaffold-10; SEQ ID NO: 93
L80**
galK


3
MAD2
Scaffold-5; SEQ ID NO: 88
L80**
galK


4
MAD2
Scaffold-12; SEQ ID NO: 95
D70KpnI
galK


5
MAD2
Scaffold-12; SEQ ID NO: 95
Y145**
galK


6
MAD2
Scaffold-11; SEQ ID NO: 94
Y145**
galK


7
MAD2
Scaffold-10; SEQ ID NO: 93
Y145**
galK


8
MAD2
Scaffold-12; SEQ ID NO: 95
L10KpnI
galK


9
MAD2
Scaffold-11; SEQ ID NO: 94
L80**
galK


10
SpCas9
S. pyogenese gRNA
Y145**
galK


11
MAD2
Scaffold-2; SEQ ID NO: 85
Y145**
galK


12
MAD2
Scaffold-4; SEQ ID NO: 87
Y145**
galK


13
MAD2
Scaffold-1; SEQ ID NO: 84
L80**
galK


14
MAD2
Scaffold-13; SEQ ID NO: 96
Y145**
galK




















TABLE 5








Editing




Nucleic acid-
Guide nucleic acid scaffold
sequence
Target


#
guided nuclease
sequence
mutation
gene



















1
MAD7
Scaffold-1; SEQ ID NO: 84
L80**
galK


2
MAD7
Scaffold-2; SEQ ID NO: 85
Y145**
galK


3
MAD7
Scaffold-4; SEQ ID NO: 87
Y145**
galK


4
MAD7
Scaffold-10; SEQ ID NO: 93
Y145**
galK


5
MAD7
Scaffold-11; SEQ ID NO: 95
L80**
galK









In another experiment, transformation efficiencies (FIG. 10B) were determined by calculating the total number of recovered cells compared to the start number of cells. An example plate image is depicted in FIG. 10C. Editing efficiencies (FIG. 10A) were determined by calculating the ratio of editing colonies (white colonies, edited galK gene) versus total colonies.


In this example (FIG. 10A-10C), cells expressing galK were transformed with expression constructs expressing either MAD2 or MAD7 and a corresponding editing cassette comprising a guide nucleic acid targeting the galK gene. The guide nucleic acid was comprised of a guide sequence targeting the galK gene and the scaffold-12 sequence (SEQ ID NO: 95).


In the depicted example, MAD2 and MAD7 has a lower transformation efficiency compared to S. pyogenes Cas9, though the editing efficiency of MAD2 and MAD7 was slightly higher than S. pyogenes Cas9.



FIG. 11 depicts the sequencing results from select colonies recovered from the assay described above. The target sequence was in the galK coding sequence (CDS). The TTTN PAM is shown as the reverse complement (wild-type NAAA, mutated NGAA). The mutations targeted by the editing sequence are labeled as target codons. Changes compared to the wild-type sequence are highlighted. In these experiments, the scaffold-12 sequence (SEQ ID NO: 95) was used. The guide sequence of the guide nucleic acid targeted the galK gene.


Six of the seven depicted sequences from the MAD2 experiment contained the designed PAM mutation and designed mutations in the target codons of galK, which one sequences colony maintained the wild-type PAM and wild-type target codons while also containing an unintended mutation upstream of the target site.


Two of the four depicted sequences from the MAD7 experiment contained the designed PAM mutation and mutated target codons. One colony comprises a wildtype sequence, while another contained a deletion of eight nucleotides upstream of the target sequence.



FIG. 12 depicts results from another experiment testing the ability to recover edited cells. In Experiment 0, the MAD2 nuclease was used with a guide nucleic acid comprising scaffold-11 sequence and a guide sequence targeting galK. The editing cassette comprised an editing sequence designed to incorporate an L80** mutation into galK, thereby allowing screening of the edited cells. In experiment 1, the MAD2 nuclease was used with a guide nucleic acid comprising scaffold-12 sequence and a guide sequence targeting galK. The editing cassette comprised an editing sequence designed to incorporate an L10KpnI mutation into galK. In both experiments, a negative control plasmid a guide nucleic acid that is not compatible with MAD2 was included in the transformations. Following transformation, the ratio of the compatible editing cassette (those containing scaffold-11 or scaffold-12 guide nucleic acids) to the non-compatible editing cassette (negative control) was measure. The experiments were done in the presence or absence of selection. The results show that more compatible editing cassette containing cells were recovered compared to the non-compatible editing cassette, and this result is magnified when selection is used.


Example 7
Guide Nucleic Acid Characterization

The sequences of scaffolds 1-8, and 10-12 (SEQ ID NO: 84-91, and 93-95) were aligned and are depicted in FIG. 13A. Nucleotides that match the consensus sequence are faded, while those diverging from the consensus sequence are visible. The predicted pseudoknot region is indicated. Without being bound by theory, the region 5′ of the pseudoknot may be influence binding and/or kinetics of the nucleic acid-guided nuclease. As is shown in FIG. 13A, in general, there appears to be less variability in the pseudoknot region (e.g., SEQ ID NO: 172-181) as compared to the sequence outside of the pseudoknot region.



FIG. 13B shows a preliminary model of MAD2 and MAD12 complexed with a guide nucleic acid (in this example, a guide RNA) and target sequence (DNA).


Example 8
Editing Efficiency of the MAD Nucleases

A plate-based editing efficiency assay and a molecular editing efficiency assay were used to test editing efficiency of various MAD nuclease and guide nucleic acid combinations.



FIG. 15 depicts quantification of the data obtained using the molecular editing efficiency assay using MAD2 nuclease with a guide nucleic acid comprising scaffold-12 and a guide sequencing targeting galK. The indicated mutations were incorporated into the galK using corresponding editing cassettes containing the mutation. FIG. 16 shows the comparison of the editing efficiencies determined by the plate-based assay using white and red colonies as described previously, and the molecular editing efficiency assay. As shown in FIG. 16, the editing efficiencies as determined by the two separate assays are consistent.


Example 9
Trackable Editing

Genetic edits can be tracked by the use of a barcode. A barcode can be incorporated into or near the edit site as described in the present specification. When multiple rounds of engineering are being performed, with a different edit being made in each round, it may be beneficial to insert a barcode in a common region during each round of engineering, this way one could sequence a single site and get the sequences of all of the barcodes from each round without the need to sequence each edited site individually. FIGS. 17A-17C, 18, and 19 depict examples of such trackable engineering workflows.


As depicted in FIG. 17A, a cell expressing a MAD nuclease is transformed with a plasmid containing an editing cassette and a recording cassette. The editing cassette contains a PAM mutation and a gene edit. The recorder cassette comprises a barcode, in this case 15N. Both the editing cassette and recording cassette each comprise a guide nucleic acid to a distinct target sequence. Within a library of such plasmids, the recorder cassette for each round can contain the same guide nucleic acid, such that the first round barcode is inserted into the same location across all variants, regardless of what editing cassette and corresponding gene edit is used. The correlation between the barcode and editing cassette is determined beforehand though such that the edit can be identified by sequencing the barcode. FIG. 17B shows an example of a recording cassette designed to delete a PAM site while incorporating a 15N barcode (actatcaatg ggctaactnnnnnnnnnnnnnnntgaaacatctgcaactgcg (SEQ ID No: 203); actatcaatgggctaactac gttcgtggcgtggtgaaacatctgcaactgcg (SEQ ID No: 204). The deleted PAM is used to enrich for edited cells since mutated PAM cells escape cell death while cells containing a wild-type PAM sequence are killed. Fire 21 C depicts how sequencing the barcode region can be used to identify which edit is comprised within each cell.


A similar approach is depicted in FIG. 18. In this case, the recorder cassette from each round is designed to target a sequence adjacent to the previous round, and each time, a new PAM site is deleted by the recorder cassette. The result is a barcode array with the barcodes from each round that can be sequenced to confirm each round of engineering took place and to determine which combination of mutations are contained in the cell, and in which order the mutations were made. Each successive recorder cassette can be designed to be homologous on one end to the region comprising the mutated PAM from the previous round, which could increase the efficiency of getting fully edited cells at the end of the experiment. In other examples, the recorder cassette is designed to target a unique landing site that was incorporated by the previous recorder cassette. This increases the efficiency of recovering cells containing all of the desired mutations since the subsequent recorder cassette and barcode can only target a cell that has successfully completed the previous round of engineering.



FIG. 19 depicts another approach that allows the recycling of selectable markers or to otherwise cure the cell of the plasmid form the previous round of engineering. In this case, the transformed plasmid containing a guide nucleic acid designed to target a selectable marker or other unique sequence in the plasmid form the previous round of engineering.


Sequence Listing









TABLE 6





SEQ



ID



NO:
Sequence







SEQ
MGKMYYLGLDIGTNSVGYAVTDPSYHLLKFKGEPMWGAHVFAAGNQSAERRSFRTSRRRLDRRQQRVK


ID
LVQEIFAPVISPIDPRFFIRLHESALWRDDVAETDKHIFFNDPTYTDKEYYSDYPTIHHLIVDLMESSEKHDP


NO:
RLVYLAVAWLVAHRGHFLNEVDKDNIGDVLSFDAFYPEFLAFLSDNGVSPWVCESKALQATLLSRNSVN


1
DKYKALKSLIFGSQKPEDNFDANISEDGLIQLLAGKKVKVNKLFPQESNDASFTLNDKEDAIEEILGTLTPD



ECEWIAHIRRLFDWAIMKHALKDGRTISESKVKLYEQHHHDLTQLKYFVKTYLAKEYDDIFRNVDSETTK



NYVAYSYHVKEVKGTLPKNKATQEEFCKYVLGKVKNIECSEADKVDFDEMIQRLTDNSFMPKQVSGENR



VIPYQLYYYELKTILNKAASYLPFLTQCGKDAISNQDKLLSIMTFRIPYFVGPLRKDNSEHAWLERKAGKIY



PWNFNDKVDLDKSEEAFIRRMTNTCTYYPGEDVLPLDSLIYEKFMILNEINNIRIDGYPISVDVKQQVFGLF



EKKRRVTVKDIQNLLLSLGALDKHGKLTGIDTTIHSNYNTYHHFKSLMERGVLTRDDVERIVERMTYSDD



TKRVRLWLNNNYGTLTADDVKHISRLRKHDFGRLSKMFLTGLKGVHKETGERASILDFMWNTNDNLMQ



LLSECYTFSDEITKLQEAYYAKAQLSLNDFLDSMYISNAVKRPIYRTLAVVNDIRKACGTAPKRIFIEMARD



GESKKKRSVTRREQIKNLYRSIRKDFQQEVDFLEKILENKSDGQLQSDALYLYFAQLGRDMYTGDPIKLEH



IKDQSFYNIDHIYPQSMVKDDSLDNKVLVQSEINGEKSSRYPLDAAIRNKMKPLWDAYYNHGLISLKKYQ



RLTRSTPFTDDEKWDFINRQLVETRQSTKALAILLKRKFPDTEIVYSKAGLSSDFRHEFGLVKSRNINDLHH



AKDAFLAIVTGNVYHERFNRRWFMVNQPYSVKTKTLFTHSIKNGNFVAWNGEEDLGRIVKMLKQNKNTI



HFTRFSFDRKEGLFDIQPLKASTGLVPRKAGLDVVKYGGYDKSTAAYYLLVRFTLEDKKTQHKLMMIPVE



GLYKARIDHDKEFLTDYAQTTISEILQKDKQKVINIMFPMGTRHIKLNSMISIDGFYLSIGGKSSKGKSVLCH



AMVPLIVPHKIECYIKAMESFARKFKENNKLRIVEKFDKITVEDNLNLYELFLQKLQHNPYNKFFSTQFDVL



TNGRSTFTKLSPEEQVQTLLNILSIFKTCRSSGCDLKSINGSAQAARIMISADLTGLSKKYSDIRLVEQSASGL



FVSKSQNLLEYL*





SEQ
MSSLTKFTNKYSKQLTIKNELIPVGKTLENIKENGLIDGDEQLNENYQKAKIIVDDFLRDFINKALNNTQIG


ID
NWRELADALNKEDEDNIEKLQDKIRGIIVSKFETFDLFSSYSIKKDEKIIDDDNDVEEEELDLGKKTSSFKYI


NO:
FKKNLFKLVLPSYLKTTNQDKLKIISSFDNFSTYFRGFFENRKNIFTKKPISTSIAYRIVHDNFPKFLDNIRCFN


2
VWQTECPQLIVKADNYLKSKNVIAKDKSLANYFTVGAYDYFLSQNGIDFYNNIIGGLPAFAGHEKIQGLNE



FINQECQKDSELKSKLKNRHAFKMAVLFKQILSDREKSFVIDEFESDAQVIDAVKNFYAEQCKDNNVIFNL



LNLIKNIAFLSDDELDGIFIEGKYLSSVSQKLYSDWSKLRNDIEDSANSKQGNKELAKKIKTNKGDVEKAIS



KYEFSLSELNSIVHDNTKFSDLLSCTLHKVASEKLVKVNEGDWPKHLKNNEEKQKIKEPLDALLEIYNTLLI



FNCKSFNKNGNFYVDYDRCINELSSVVYLYNKTRNYCTKKPYNTDKFKLNFNSPQLGEGFSKSKENDCLT



LLFKKDDNYYVGIIRKGAKINFDDTQAIADNTDNCIFKMNYFLLKDAKKFIPKCSIQLKEVKAHFKKSEDD



YILSDKEKFASPLVIKKSTFLLATAHVKGKKGNIKKFQKEYSKENPTEYRNSLNEWIAFCKEFLKTYKAATI



FDITTLKKAEEYADIVEFYKDVDNLCYKLEFCPIKTSFIENLIDNGDLYLFRINNKDFSSKSTGTKNLHTLYL



QAIFDERNLNNPTIMLNGGAELFYRKESIEQKNRITHKAGSILVNKVCKDGTSLDDKIRNEIYQYENKFIDT



LSDEAKKVLPNVIKKEATHDITKDKRFTSDKFFFHCPLTINYKEGDTKQFNNEVLSFLRGNPDINIIGIDRGE



RNLIYVTVINQKGEILDSVSFNTVTNKSSKIEQTVDYEEKLAVREKERIEAKRSWDSISKIATLKEGYLSAIV



HEICLLMIKHNAIVVLENLNAGFKRIRGGLSEKSVYQKFEKMLINKLNYFVSKKESDWNKPSGLLNGLQLS



DQFESFEKLGIQSGFIFYVPAAYTSKIDPTTGFANVLNLSKVRNVDAIKSFFSNFNEISYSKKEALFKFSFDLD



SLSKKGFSSFVKFSKSKWNVYTFGERIIKPKNKQGYREDKRINLTFEMKKLLNEYKVSFDLENNLIPNLTSA



NLKDTFWKELFFIFKTTLQLRNSVTNGKEDVLISPVKNAKGEFFVSGTHNKTLPQDCDANGAYHIALKGL



MILERNNLVREEKDTKKIMAISNVDWFEYVQKRRGVL*





SEQ
MNNYDEFTKLYPIQKTIRFELKPQGRTMEHLETFNFFEEDRDRAEKYKILKEAIDEYHKKFIDEHLTNMSLD


ID
WNSLKQISEKYYKSREEKDKKVFLSEQKRMRQEIVSEFKKDDRFKDLFSKKLFSELLKEEIYKKGNHQEID


NO:
ALKSFDKFSGYFIGLHENRKNMYSDGDEITAISNRIVNENFPKFLDNLQKYQEARKKYPEWIIKAESALVA


3
HNIKMDEVFSLEYFNKVLNQEGIQRYNLALGGYVTKSGEKMMGLNDALNLAHQSEKSSKGRIHMTPLFK



QILSEKESFSYIPDVFTEDSQLLPSIGGFFAQIENDKDGNIFDRALELISSYAEYDTERIYIRQADINRVSNVIF



GEWGTLGGLMREYKADSINDINLERTCKKVDKWLDSKEFALSDVLEAIKRTGNNDAFNEYISKMRTARE



KIDAARKEMKFISEKISGDEESIHIIKTLLDSVQQFLHFFNLFKARQDIPLDGAFYAEFDEVHSKLFAIVPLYN



KVRNYLTKNNLNTKKIKLNFKNPTLANGWDQNKVYDYASLIFLRDGNYYLGIINPKRKKNIKFEQGSGNG



PFYRKMVYKQIPGPNKNLPRVFLTSTKGKKEYKPSKEIIEGYEADKHIRGDKFDLDFCHKLIDFFKESIEKH



KDWSKFNFYFSPTESYGDISEFYLDVEKQGYRMHFENISAETIDEYVEKGDLFLFQIYNKDFVKAATGKKD



MHTIYWNAAFSPENLQDVVVKLNGEAELFYRDKSDIKEIVHREGEILVNRTYNGRTPVPDKIHKKLTDYH



NGRTKDLGEAKEYLDKVRYFKAHYDITKDRRYLNDKIYFHVPLTLNFKANGKKNLNKMVIEKFLSDEKA



HIIGIDRGERNLLYYSIIDRSGKIIDQQSLNVIDGFDYREKLNQREIEMKDARQSWNAIGKIKDLKEGYLSKA



VHEITKMAIQYNAIVVMEELNYGFKRGRFKVEKQIYQKFENMLIDKMNYLVFKDAPDESPGGVLNAYQL



TNPLESFAKLGKQTGILFYVPAAYTSKIDPTTGFVNLFNTSSKTNAQERKEFLQKFESISYSAKDGGIFAFAF



DYRKFGTSKTDHKNVWTAYTNGERMRYIKEKKRNELFDPSKEIKEALTSSGIKYDGGQNILPDILRSNNNG



LIYTMYSSFIAAIQMRVYDGKEDYIISPIKNSKGEFFRTDPKRRELPIDADANGAYNIALRGELTMRAIAEKF



DPDSEKMAKLELKHKDWFEFMQTRGD*





SEQ
MTKTFDSEFFNLYSLQKTVRFELKPVGETASFVEDFKNEGLKRVVSEDERRAVDYQKVKEIIDDYHRDFIE


ID
ESLNYFPEQVSKDALEQAFHLYQKLKAAKVEEREKALKEWEALQKKLREKVVKCFSDSNKARFSRIDKK


NO:
ELIKEDLINWLVAQNREDDIPTVETFNNFTTYFTGFHENRKNIYSKDDHATAISFRLIHENLPKFFDNVISFN


4
KLKEGFPELKFDKVKEDLEVDYDLKHAFEIEYFVNFVTQAGIDQYNYLLGGKTLEDGTKKQGMNEQINLF



KQQQTRDKARQIPKLIPLFKQILSERTESQSFIPKQFESDQELFDSLQKLHNNCQDKFTVLQQAILGLAEADL



KKVFIKTSDLNALSNTIFGNYSVFSDALNLYKESLKTKKAQEAFEKLPAHSIHDLIQYLEQFNSSLDAEKQQ



STDTVLNYFIKTDELYSRFIKSTSEAFTQVQPLFELEALSSKRRPPESEDEGAKGQEGFEQIKRIKAYLDTLM



EAVHFAKPLYLVKGRKMIEGLDKDQSFYEAFEMAYQELESLIIPIYNKARSYLSRKPFKADKFKINFDNNTL



LSGWDANKETANASILFKKDGLYYLGIMPKGKTFLFDYFVSSEDSEKLKQRRQKTAEEALAQDGESYFEKI



RYKLLPGASKMLPKVFFSNKNIGFYNPSDDILRIRNTASHTKNGTPQKGHSKVEFNLNDCHKMIDFFKSSIQ



KHPEWGSFGFTFSDTSDFEDMSAFYREVENQGYVISFDKIKETYIQSQVEQGNLYLFQIYNKDFSPYSKGKP



NLHTLYWKALFEEANLNNVVAKLNGEAEIFFRRHSIKASDKVVHPANQAIDNKNPHTEKTQSTFEYDLVK



DKRYTQDKFFFHVPISLNFKAQGVSKFNDKVNGFLKGNPDVNIIGIDRGERHLLYFTVVNQKGEILVQESL



NTLMSDKGHVNDYQQKLDKKEQERDAARKSWTTVENIKELKEGYLSHVVHKLAHLIIKYNAIVCLEDLN



FGFKRGRFKVEKQVYQKFEKALIDKLNYLVFKEKELGEVGHYLTAYQLTAPFESFKKLGKQSGILFYVPA



DYTSKIDPTTGFVNFLDLRYQSVEKAKQLLSDFNAIRFNSVQNYFEFEIDYKKLTPKRKVGTQSKWVICTY



GDVRYQNRRNQKGHWETEEVNVTEKLKALFASDSKTTTVIDYANDDNLIDVILEQDKASFFKELLWLLKL



TMTLRHSKIKSEDDFILSPVKNEQGEFYDSRKAGEVWPKDADANGAYHIALKGLWNLQQINQWEKGKTL



NLAIKNQDWFSFIQEKPYQE*





SEQ
MHTGGLLSMDAKEFTGQYPLSKTLRFELRPIGRTWDNLEASGYLAEDRHRAECYPRAKELLDDNHRAFL


ID
NRVLPQIDMDWHPIAEAFCKVHKNPGNKELAQDYNLQLSKRRKEISAYLQDADGYKGLFAKPALDEAMK


NO:
IAKENGNESDIEVLEAFNGFSVYFTGYHESRENIYSDEDMVSVAYRITEDNFPRFVSNALIFDKLNESHPDIIS


5
EVSGNLGVDDIGKYFDVSNYNNFLSQAGIDDYNHIIGGHTTEDGLIQAFNVVLNLRHQKDPGFEKIQFKQL



YKQILSVRTSKSYIPKQFDNSKEMVDCICDYVSKIEKSETVERALKLVRNISSFDLRGIFVNKKNLRILSNKLI



GDWDAIETALMHSSSSENDKKSVYDSAEAFTLDDIFSSVKKFSDASAEDIGNRAEDICRVISETAPFINDLR



AVDLDSLNDDGYEAAVSKIRESLEPYMDLFHELEIFSVGDEFPKCAAFYSELEEVSEQLIEIIPLFNKARSFC



TRKRYSTDKIKVNLKFPTLADGWDLNKERDNKAAILRKDGKYYLAILDMKKDLSSIRTSDEDESSFEKME



YKLLPSPVKMLPKIFVKSKAAKEKYGLTDRMLECYDKGMHKSGSAFDLGFCHELIDYYKRCIAEYPGWD



VFDFKFRETSDYGSMKEFNEDVAGAGYYMSLRKIPCSEVYRLLDEKSIYLFQIYNKDYSENAHGNKNMHT



MYWEGLFSPQNLESPVFKLSGGAELFFRKSSIPNDAKTVHPKGSVLVPRNDVNGRRIPDSIYRELTRYFNRG



DCRISDEAKSYLDKVKTKKADHDIVKDRRFTVDKMMFHVPIAMNFKAISKPNLNKKVIDGIIDDQDLKIIGI



DRGERNLIYVTMVDRKGNILYQDSLNILNGYDYRKALDVREYDNKEARRNWTKVEGIRKMKEGYLSLAV



SKLADMIIENNAIIVMEDLNHGFKAGRSKIEKQVYQKFESMLINKLGYMVLKDKSIDQSGGALHGYQLAN



HVTTLASVGKQCGVIFYIPAAFTSKIDPTTGFADLFALSNVKNVASMREFFSKMKSVIYDKAEGKFAFTFD



YLDYNVKSECGRTLWTVYTVGERFTYSRVNREYVRKVPTDIIYDALQKAGISVEGDLRDRIAESDGDTLK



SIFYAFKYALDMRVENREEDYIQSPVKNASGEFFCSKNAGKSLPQDSDANGAYNIALKGILQLRMLSEQYD



PNAESIRLPLITNKAWLTFMQSGMKTWKN*





SEQ
MDSLKDFTNLYPVSKTLRFELKPVGKTLENIEKAGILKEDEHRAESYRRVKKIIDTYHKVFIDSSLENMAK


ID
MGIENEIKAMLQSFCELYKKDHRTEGEDKALDKIRAVLRGLIVGAFTGVCGRRENTVQNEKYESLFKEKLI


NO:
KEILPDFVLSTEAESLPFSVEEATRSLKEFDSFTSYFAGFYENRKNIYSTKPQSTAIAYRLIHENLPKFIDNILV


6
FQKIKEPIAKELEHIRADFSAGGYIKKDERLEDIFSLNYYIHVLSQAGIEKYNALIGKIVTEGDGEMKGLNEH



INLYNQQRGREDRLPLFRPLYKQILSDREQLSYLPESFEKDEELLRALKEFYDHIAEDILGRTQQLMTSISEY



DLSRIYVRNDSQLTDISKKMLGDWNAIYMARERAYDHEQAPKRITAKYERDRIKALKGEESISLANLNSCI



AFLDNVRDCRVDTYLSTLGQKEGPHGLSNLVENVFASYHEAEQLLSFPYPEENNLIQDKDNVVLIKNLLD



NISDLQRFLKPLWGMGDEPDKDERFYGEYNYIRGALDQVIPLYNKVRNYLTRKPYSTRKVKLNFGNSQLL



SGWDRNKEKDNSCVILRKGQNFYLAIMNNRHKRSFENKVLPEYKEGEPYFEKMDYKFLPDPNKMLPKVF



LSKKGIEIYKPSPKLLEQYGHGTHKKGDTFSMDDLHELIDFFKHSIEAHEDWKQFGFKFSDTATYENVSSFY



REVEDQGYKLSFRKVSESYVYSLIDQGKLYLFQIYNKDFSPCSKGTPNLHTLYWRMLFDERNLADVIYKL



DGKAEIFFREKSLKNDHPTHPAGKPIKKKSRQKKGEESLFEYDLVKDRHYTMDKFQFHVPITMNFKCSAGS



KVNDMVNAHIREAKDMHVIGIDRGERNLLYICVIDSRGTILDQISLNTINDIDYHDLLESRDKDRQQERRN



WQTIEGIKELKQGYLSQAVHRIAELMVAYKAVVALEDLNMGFKRGRQKVESSVYQQFEKQLIDKLNYLV



DKKKRPEDIGGLLRAYQFTAPFKSFKEMGKQNGFLFYIPAWNTSNIDPTTGFVNLFHAQYENVDKAKSFFQ



KFDSISYNPKKDWFEFAFDYKNFTKKAEGSRSMWILCTHGSRIKNFRNSQKNGQWDSEEFALTEAFKSLFV



RYEIDYTADLKTAIVDEKQKDFFVDLLKLFKLTVQMRNSWKEKDLDYLISPVAGADGRFFDTREGNKSLP



KDADANGAYNIALKGLWALRQIRQTSEGGKLKLAISNKEWLQFVQERSYEKD*





SEQ
MNNGTNNFQNFIGISSLQKTLRNALIPTETTQQFIVKNGIIKEDELRGENRQILKDIMDDYYRGFISETLSSID


ID
DIDWTSLFEKMEIQLKNGDNKDTLIKEQTEYRKAIHKKFANDDRFKNMFSAKLISDILPEFVIHNNNYSASE


NO:
KEEKTQVIKLFSRFATSFKDYFKNRANCFSADDISSSSCHRIVNDNAEIFFSNALVYRRIVKSLSNDDINKISG


7
DMKDSLKEMSLEEIYSYEKYGEFITQEGISFYNDICGKVNSFMNLYCQKNKENKNLYKLQKLHKQILCIAD



TSYEVPYKFESDEEVYQSVNGFLDNISSKHIVERLRKIGDNYNGYNLDKIYIVSKFYESVSQKTYRDWETIN



TALEIHYNNILPGNGKSKADKVKKAVKNDLQKSITEINELVSNYKLCSDDNIKAETYIHEISHILNNFEAQEL



KYNPEIHLVESELKASELKNVLDVIMNAFHWCSVFMTEELVDKDNNFYAELEEIYDEIYPVISLYNLVRNY



VTQKPYSTKKIKLNFGIPTLADGWSKSKEYSNNAIILMRDNLYYLGIFNAKNKPDKKIIEGNTSENKGDYK



KMIYNLLPGPNKMIPKVFLSSKTGVETYKPSAYILEGYKQNKHIKSSKDFDITFCHDLIDYFKNCIAIHPEWK



NFGFDFSDTSTYEDISGFYREVELQGYKIDWTYISEKDIDLLQEKGQLYLFQIYNKDFSKKSTGNDNLHTM



YLKNLFSEENLKDIVLKLNGEAEIFFRKSSIKNPIIHKKGSILVNRTYEAEEKDQFGNIQIVRKNIPENIYQEL



YKYFNDKSDKELSDEAAKLKNVVGHHEAATNIVKDYRYTYDKYFLHMPITINFKANKTGFINDRILQYIA



KEKDLHVIGIDRGERNLIYVSVIDTCGNIVEQKSFNIVNGYDYQIKLKQQEGARQIARKEWKEIGKIKEIKE



GYLSLVIHEISKMVIKYNAIIAMEDLSYGFKKGRFKVERQVYQKFETMLINKLNYLVFKDISITENGGLLKG



YQLTYIPDKLKNVGHQCGCIFYVPAAYTSKIDPTTGFVNIFKFKDLTVDAKREFIKKFDSIRYDSEKNLFCFT



FDYNNFITQNTVMSKSSWSVYTYGVRIKRRFVNGRFSNESDTIDITKDMEKTLEMTDINWRDGHDLRQDII



DYEIVQHIFEIFRLTVQMRNSLSELEDRDYDRLISPVLNENNIFYDSAKAGDALPKDADANGAYCIALKGLY



EIKQITENWKEDGKFSRDKLKISNKDWFDFIQNKRYL*





SEQ
MTNKFTNQYSLSKTLRFELIPQGKTLEFIQEKGLLSQDKQRAESYQEMKKTIDKFHKYFIDLALSNAKLTHL


ID
ETYLELYNKSAETKKEQKFKDDLKKVQDNLRKEIVKSFSDGDAKSIFAILDKKELITVELEKWFENNEQKD


NO:
IYFDEKFKTFTTYFTGFHQNRKNMYSVEPNSTAIAYRLIHENLPKFLENAKAFEKIKQVESLQVNFRELMGE


8
FGDEGLIFVNELEEMFQINYYNDVLSQNGITIYNSIISGFTKNDIKYKGLNEYINNYNQTKDKKDRLPKLKQ



LYKQILSDRISLSFLPDAFTDGKQVLKAIFDFYKINLLSYTIEGQEESQNLLLLIRQTIENLSSFDTQKIYLKND



THLTTISQQVFGDFSVFSTALNYWYETKVNPKFETEYSKANEKKREILDKAKAVFTKQDYFSIAFLQEVLS



EYILTLDHTSDIVKKHSSNCIADYFKNHFVAKKENETDKTFDFIANITAKYQCIQGILENADQYEDELKQDQ



KLIDNLKFFLDAILELLHFIKPLHLKSESITEKDTAFYDVFENYYEALSLLTPLYNMVRNYVTQKPYSTEKIK



LNFENAQLLNGWDANKEGDYLTTILKKDGNYFLAIMDKKHNKAFQKFPEGKENYEKMVYKLLPGVNKM



LPKVFFSNKNIAYFNPSKELLENYKKETHKKGDTFNLEHCHTLIDFFKDSLNKHEDWKYFDFQFSETKSYQ



DLSGFYREVEHQGYKINFKNIDSEYIDGLVNEGKLFLFQIYSKDFSPFSKGKPNMHTLYWKALFEEQNLQN



VIYKLNGQAEIFFRKASIKPKNIILHKKKIKIAKKHFIDKKTKTSEIVPVQTIKNLNMYYQGKISEKELTQDDL



RYIDNFSIFNEKNKTIDIIKDKRFTVDKFQFHVPITMNFKATGGSYINQTVLEYLQNNPEVKIIGLDRGERHL



VYLTLIDQQGNILKQESLNTITDSKISTPYHKLLDNKENERDLARKNWGTVENIKELKEGYISQVVHKIATL



MLEENAIVVMEDLNFGFKRGRFKVEKQIYQKLEKMLIDKLNYLVLKDKQPQELGGLYNALQLTNKFESFQ



KMGKQSGFLFYVPAWNTSKIDPTTGFVNYFYTKYENVDKAKAFFEKFEAIRFNAEKKYFEFEVKKYSDFN



PKAEGTQQAWTICTYGERIETKRQKDQNNKFVSTPINLTEKIEDFLGKNQIVYGDGNCIKSQIASKDDKAFF



ETLLYWFKMTLQMRNSETRTDIDYLISPVMNDNGTFYNSRDYEKLENPTLPKDADANGAYHIAKKGLML



LNKIDQADLTKKVDLSISNRDWLQFVQKNK*





SEQ
MEQEYYLGLDMGTGSVGWAVTDSEYHVLRKHGKALWGVRLFESASTAEERRMFRTSRRRLDRRNWRIE


ID
ILQEIFAEEISKKDPGFFLRMKESKYYPEDKRDINGNCPELPYALFVDDDFTDKDYHKKFPTIYHLRKMLM


NO:
NTEETPDIRLVYLAIHHMMKHRGHFLLSGDINEIKEFGTTFSKLLENIKNEELDWNLELGKEEYAVVESILK


9
DNMLNRSTKKTRLIKALKAKSICEKAVLNLLAGGTVKLSDIFGLEELNETERPKISFADNGYDDYIGEVENE



LGEQFYIIETAKAVYDWAVLVEILGKYTSISEAKVATYEKHKSDLQFLKKIVRKYLTKEEYKDIFVSTSDKL



KNYSAYIGMTKINGKKVDLQSKRCSKEEFYDFIKKNVLKKLEGQPEYEYLKEELERETFLPKQVNRDNGVI



PYQIHLYELKKILGNLRDKIDLIKENEDKLVQLFEFRIPYYVGPLNKIDDGKEGKFTWAVRKSNEKIYPWNF



ENVVDIEASAEKFIRRMTNKCTYLMGEDVLPKDSLLYSKYMVLNELNNVKLDGEKLSVELKQRLYTDVF



CKYRKVTVKKIKNYLKCEGIISGNVEITGIDGDFKASLTAYHDFKEILTGTELAKKDKENIITNIVLFGDDKK



LLKKRLNRLYPQITPNQLKKICALSYTGWGRFSKKFLEEITAPDPETGEVWNIITALWESNNNLMQLLSNE



YRFMEEVETYNMGKQTKTLSYETVENMYVSPSVKRQIWQTLKIVKELEKVMKESPKRVFIEMAREKQES



KRTESRKKQLIDLYKACKNEEKDWVKELGDQEEQKLRSDKLYLYYTQKGRCMYSGEVIELKDLWDNTK



YDIDHIYPQSKTMDDSLNNRVLVKKKYNATKSDKYPLNENIRHERKGFWKSLLDGGFISKEKYERLIRNTE



LSPEELAGFIERQIVETRQSTKAVAEILKQVFPESEIVYVKAGTVSRFRKDFELLKVREVNDLHHAKDAYLN



IVVGNSYYVKFTKNASWFIKENPGRTYNLKKMFTSGWNIERNGEVAWEVGKKGTIVTVKQIMNKNNILV



TRQVHEAKGGLFDQQIMKKGKGQIAIKETDERLASIEKYGGYNKAAGAYFMLVESKDKKGKTIRTIEFIPL



YLKNKIESDESIALNFLEKGRGLKEPKILLKKIKIDTLFDVDGFKMWLSGRTGDRLLFKCANQLILDEKIIVT



MKKIVKFIQRRQENRELKLSDKDGIDNEVLMEIYNTFVDKLENTVYRIRLSEQAKTLIDKQKEFERLSLEDK



SSTLFEILHIFQCQSSAANLKMIGGPGKAGILVMNNNISKCNKISIINQSPTGIFENEIDLLK





SEQ
MNKFENFTGLYPISKTLRFELIPQGKTLEYIEKSEILENDNYRAEKYEEVKDIIDGYHKWFINETLHDLHINW


ID
SELKVALENNRIEKSDASKKELQRVQKIKREEIYNAFIEHEAFQYLFKENLLSDLLPIQIEQSEDLDAEKKKQ


NO:
AVETFNRFSTYFTGFHENRKNIYSKEGISTSVTYRIVHDNFPKFLENMKVFEILRNECPEVISDTANELAPFID


10
GVRIEDIFLIDFFNSTFSQNGIDYYNRILGGVTTETGEKYRGINEFTNLYRQQHPEFGKSKKATKMVVLFKQI



LSDRDTLSFIPEMFGNDKQVQNSIQLFYNREISQFENEGVKTDVCTALATLTSKIAEFDTEKIYIQQPELPNV



SQRLFGSWNELNACLFKYAELKFGTAEKVANRKKIDKWLKSDLFSFTELNKALEFSGKDERIENYFSETGI



FAQLVKTGFDEAQSILETEYTSEVHLKDQQTDIEKIKTFLDALQNLMHLLKSLCVSEEADRDAAFYNEFDM



LYNQLKLVVPLYNKVRNYITQKLFRSDKIKIYFENKGQFLGGWVDSQTENSDNGTQAGGYIFRKENVINE



YDYYLGICSDPKLFRRTTIVSENDRSSFERLDYYQLKTASVYGNSYCGKHPYTEDKNELVNSIDRFVHLSG



NNILIEKIAKDKVKSNPTTNTPSGYLNFIHREAPNTYECLLQDENFVSLNQRVVSALKATLATLVRVPKALV



YAKKDYHLFSEIINDIDELSYEKAFSYFPVSQTEFENSSNRTIKPLLLFKISNKDLSFAENFEKGNRQKIGKKN



LHTLYFEALMKGNQDTIDIGTGMVFHRVKSLNYNEKTLKYGHHSTQLNEKFSYPIIKDKRFASDKFLFHLS



TEINYKEKRKPLNNSIIEFLTNNPDINIIGLDRGERHLIYLTLINQKGEILRQKTFNIVGNTNYHEKLNQREKE



RDNARKSWATIGKIKELKEGFLSLVIHEIAKIMVENNAIVVLEDLNFGFKRGRFKVEKQIYQKFEKMLIDKL



NYLVFKDKKANEAGGVLKGYQLAEKFESFQKMGKQSGFLFYVPAAYTSKIDPTTGFVNMLNLNYTNMK



DAQTLLSGMDKISFNADANYFEFELDYEKFKTNQTDHTNKWTICTVGEKRFTYNSATKETTTVNVTEDLK



KLLDKFEVKYSNGDNIKDEICRQTDAKFFEIILWLLKLTMQMRNSNTKTEEDFILSPVKNSNGEFFRSNDDA



NGIWPADADANGAYHIALKGLYLVKECFNKNEKSLKIEHKNWFKFAQTRFNGSLTKNG*





SEQ
MENFKNLYPINKTLRFELRPYGKTLENFKKSGLLEKDAFKANSRRSMQAIIDEKFKETIEERLKYTEFSECD


ID
LGNMTSKDKKITDKAATNLKKQVILSFDDEIFNNYLKPDKNIDALFKNDPSNPVISTFKGFTTYFVNFFEIR


NO:
KHIFKGESSGSMAYRIIDENLTTYLNNIEKIKKLPEELKSQLEGIDQIDKLNNYNEFITQSGITHYNEIIGGISK


11
SENVKIQGINEGINLYCQKNKVKLPRLTPLYKMILSDRVSNSFVLDTIENDTELIEMISDLINKTEISQDVIMS



DIQNIFIKYKQLGNLPGISYSSIVNAICSDYDNNFGDGKRKKSYENDRKKHLETNVYSINYISELLTDTDVSS



NIKMRYKELEQNYQVCKENFNATNWMNIKNIKQSEKTNLIKDLLDILKSIQRFYDLFDIVDEDKNPSAEFY



TWLSKNAEKLDFEFNSVYNKSRNYLTRKQYSDKKIKLNFDSPTLAKGWDANKEIDNSTIIMRKFNNDRGD



YDYFLGIWNKSTPANEKIIPLEDNGLFEKMQYKLYPDPSKMLPKQFLSKIWKAKHPTTPEFDKKYKEGRH



KKGPDFEKEFLHELIDCFKHGLVNHDEKYQDVFGFNLRNTEDYNSYTEFLEDVERCNYNLSFNKIADTSNL



INDGKLYVFQIWSKDFSIDSKGTKNLNTIYFESLFSEENMIEKMFKLSGEAEIFYRPASLNYCEDIIKKGHHH



AELKDKFDYPIIKDKRYSQDKFFFHVPMVINYKSEKLNSKSLNNRTNENLGQFTHIIGIDRGERHLIYLTVV



DVSTGEIVEQKHLDEIINTDTKGVEHKTHYLNKLEEKSKTRDNERKSWEAIETIKELKEGYISHVINEIQKLQ



EKYNALIVMENLNYGFKNSRIKVEKQVYQKFETALIKKFNYIIDKKDPETYIHGYQLTNPITTLDKIGNQSGI



VLYIPAWNTSKIDPVTGFVNLLYADDLKYKNQEQAKSFIQKIDNIYFENGEFKFDIDFSKWNNRYSISKTK



WTLTSYGTRIQTFRNPQKNNKWDSAEYDLTEEFKLILNIDGTLKSQDVETYKKFMSLFKLMLQLRNSVTG



TDIDYMISPVTDKTGTHFDSRENIKNLPADADANGAYNIARKGIMAIENIMNGISDPLKISNEDYLKYIQNQ



QE





SEQ
MTQFEGFTNLYQVSKTLRFELIPQGKTLKHIQEQGFIEEDKARNDHYKELKPIIDRIYKTYADQCLQLVQLD


ID
WENLSAAIDSYRKEKTEETRNALIEEQATYRNAIHDYFIGRTDNLTDAINKRHAEIYKGLFKAELFNGKVL


NO:
KQLGTVTTTEHENALLRSFDKFTTYFSGFYENRKNVFSAEDISTAIPHRIVQDNFPKFKENCHIFTRLITAVP


12
SLREHFENVKKAIGIFVSTSIEEVFSFPFYNQLLTQTQIDLYNQLLGGISREAGTEKIKGLNEVLNLAIQKNDE



TAHIIASLPHRFIPLFKQILSDRNTLSFILEEFKSDEEVIQSFCKYKTLLRNENVLETAEALFNELNSIDLTHIFI



SHKKLETISSALCDHWDTLRNALYERRISELTGKITKSAKEKVQRSLKHEDINLQEIISAAGKELSEAFKQKT



SEILSHAHAALDQPLPTTLKKQEEKEILKSQLDSLLGLYHLLDWFAVDESNEVDPEFSARLTGIKLEMEPSL



SFYNKARNYATKKPYSVEKFKLNFQMPTLASGWDVNKEKNNGAILFVKNGLYYLGIMPKQKGRYKALSF



EPTEKTSEGFDKMYYDYFPDAAKMIPKCSTQLKAVTAHFQTHTTPILLSNNFIEPLEITKEIYDLNNPEKEPK



KFQTAYAKKTGDQKGYREALCKWIDFTRDFLSKYTKTTSIDLSSLRPSSQYKDLGEYYAELNPLLYHISFQ



RIAEKEIMDAVETGKLYLFQIYNKDFAKGHHGKPNLHTLYWTGLFSPENLAKTSIKLNGQAELFYRPKSR



MKRMAHRLGEKMLNKKLKDQKTPIPDTLYQELYDYVNHRLSHDLSDEARALLPNVITKEVSHEIIKDRRF



TSDKFFFHVPITLNYQAANSPSKFNQRVNAYLKEHPETPIIGIDRGERNLIYITVIDSTGKILEQRSLNTIQQFD



YQKKLDNREKERVAARQAWSVVGTIKDLKQGYLSQVIHEIVDLMIHYQAVVVLENLNFGFKSKRTGIAEK



AVYQQFEKMLIDKLNCLVLKDYPAEKVGGVLNPYQLTDQFTSFAKMGTQSGFLFYVPAPYTSKIDPLTGF



VDPFVWKTIKNHESRKHFLEGFDFLHYDVKTGDFILHFKMNRNLSFQRGLPGFMPAWDIVFEKNETQFDA



KGTPFIAGKRIVPVIENHRFTGRYRDLYPANELIALLEEKGIVFRDGSNILPKLLENDDSHAIDTMVALIRSV



LQMRNSNAATGEDYINSPVRDLNGVCFDSRFQNPEWPMDADANGAYHIALKGQLLLNHLKESKDLKLQN



GISNQDWLAYIQELRN*





SEQ
MAVKSIKVKLRLDDMPEIRAGLWKLHKEVNAGVRYYTEWLSLLRQENLYRRSPNGDGEQECDKTAEEC


ID
KAELLERLRARQVENGHRGPAGSDDELLQLARQLYELLVPQAIGAKGDAQQIARKFLSPLADKDAVGGL


NO:
GIAKAGNKPRWVRMREAGEPGWEEEKEKAETRKSADRTADVLRALADFGLKPLMRVYTDSEMSSVEWK


13
PLRKGQAVRTWDRDMFQQAIERMMSWESWNQRVGQEYAKLVEQKNRFEQKNFVGQEHLVHLVNQLQQ



DMKEASPGLESKEQTAHYVTGRALRGSDKVFEKWGKLAPDAPFDLYDAEIKNVQRRNTRRFGSHDLFAK



LAEPEYQALWREDASFLTRYAVYNSILRKLNHAKMFATFTLPDATAHPIWTRFDKLGGNLHQYTFLFNEF



GERRHAIRFHKLLKVENGVAREVDDVTVPISMSEQLDNLLPRDPNEPIALYFRDYGAEQHFTGEFGGAKIQ



CRRDQLAHMHRRRGARDVYLNVSVRVQSQSEARGERRPPYAAVFRLVGDNHRAFVHFDKLSDYLAEHP



DDGKLGSEGLLSGLRVMSVDLGLRTSASISVFRVARKDELKPNSKGRVPFFFPIKGNDNLVAVHERSQLLK



LPGETESKDLRAIREERQRTLRQLRTQLAYLRLLVRCGSEDVGRRERSWAKLIEQPVDAANHMTPDWREA



FENELQKLKSLHGICSDKEWMDAVYESVRRVWRHMGKQVRDWRKDVRSGERPKIRGYAKDVVGGNSIE



QIEYLERQYKFLKSWSFFGKVSGQVIRAEKGSRFAITLREHIDHAKEDRLKKLADRIIMEALGYVYALDER



GKGKWVAKYPPCQLILLEELSEYQFNNDRPPSENNQLMQWSHRGVFQELINQAQVHDLLVGTMYAAFSS



RFDARTGAPGIRCRRVPARCTQEHNPEPFPWWLNKFVVEHTLDACPLRADDLIPTGEGEIFVSPFSAEEGDF



HQIHADLNAAQNLQQRLWSDFDISQIRLRCDWGEVDGELVLIPRLTGKRTADSYSNKVFYTNTGVTYYER



ERGKKRRKVFAQEKLSEEEAELLVEADEAREKSVVLMRDPSGIINRGNWTRQKEFWSMVNQRIEGYLVK



QIRSRVPLQDSACENTGDI*





SEQ
MATRSFILKIEPNEEVKKGLWKTHEVLNHGIAYYMNILKLIRQEAIYEHHEQDPKNPKKVSKAEIQAELWD


ID
FVLKMQKCNSFTHEVDKDVVFNILRELYEELVPSSVEKKGEANQLSNKFLYPLVDPNSQSGKGTASSGRK


NO:
PRWYNLKIAGDPSWEEEKKKWEEDKKKDPLAKILGKLAEYGLIPLFIPFTDSNEPIVKEIKWMEKSRNQSV


14
RRLDKDMFIQALERFLSWESWNLKVKEEYEKVEKEHKTLEERIKEDIQAFKSLEQYEKERQEQLLRDTLNT



NEYRLSKRGLRGWREIIQKWLKMDENEPSEKYLEVFKDYQRKHPREAGDYSVYEFLSKKENHFIWRNHPE



YPYLYATFCEIDKKKKDAKQQATFTLADPINHPLWVRFEERSGSNLNKYRILTEQLHTEKLKKKLTVQLDR



LIYPTESGGWEEKGKVDIVLLPSRQFYNQIFLDIEEKGKHAFTYKDESIKFPLKGTLGGARVQFDRDHLRRY



PHKVESGNVGRIYFNMTVNIEPTESPVSKSLKIHRDDFPKFVNFKPKELTEWIKDSKGKKLKSGIESLEIGLR



VMSIDLGQRQAAAASIFEVVDQKPDIEGKLFFPIKGTELYAVHRASFNIKLPGETLVKSREVLRKAREDNLK



LMNQKLNFLRNVLHFQQFEDITEREKRVTKWISRQENSDVPLVYQDELIQIRELMYKPYKDWVAFLKQLH



KRLEVEIGKEVKHWRKSLSDGRKGLYGISLKNIDEIDRTRKFLLRWSLRPTEPGEVRRLEPGQRFAIDQLNH



LNALKEDRLKKMANTIIMHALGYCYDVRKKKWQAKNPACQIILFEDLSNYNPYEERSRFENSKLMKWSR



REIPRQVALQGEIYGLQVGEVGAQFSSRFHAKTGSPGIRCSVVTKEKLQDNRFFKNLQREGRLTLDKIAVL



KEGDLYPDKGGEKFISLSKDRKLVTTHADINAAQNLQKRFWTRTHGFYKVYCKAYQVDGQTVYIPESKD



QKQKIIEEFGEGYFILKDGVYEWGNAGKLKIKKGSSKQSSSELVDSDILKDSFDLASELKGEKLMLYRDPS



GNVFPSDKWMAAGVFFGKLERILISKLTNQYSISTIEDDSSKQSM*





SEQ
MPTRTINLKLVLGKNPENATLRRALFSTHRLVNQATKRIEEFLLLCRGEAYRTVDNEGKEAEIPRHAVQEE


ID
ALAFAKAAQRHNGCISTYEDQEILDVLRQLYERLVPSVNENNEAGDAQAANAWVSPLMSAESEGGLSVY


NO:
DKVLDPPPVWMKLKEEKAPGWEAASQIWIQSDEGQSLLNKPGSPPRWIRKLRSGQPWQDDFVSDQKKKQ


15
DELTKGNAPLIKQLKEMGLLPLVNPFFRHLLDPEGKGVSPWDRLAVRAAVAHFISWESWNHRTRAEYNSL



KLRRDEFEAASDEFKDDFTLLRQYEAKRHSTLKSIALADDSNPYRIGVRSLRAWNRVREEWIDKGATEEQ



RVTILSKLQTQLRGKFGDPDLFNWLAQDRHVHLWSPRDSVTPLVRINAVDKVLRRRKPYALMTFAHPRFH



PRWILYEAPGGSNLRQYALDCTENALHITLPLLVDDAHGTWIEKKIRVPLAPSGQIQDLTLEKLEKKKNRL



YYRSGFQQFAGLAGGAEVLFHRPYMEHDERSEESLLERPGAVWFKLTLDVATQAPPNWLDGKGRVRTPP



EVHHFKTALSNKSKHTRTLQPGLRVLSVDLGMRTFASCSVFELIEGKPETGRAFPVADERSMDSPNKLWA



KHERSFKLTLPGETPSRKEEEERSIARAEIYALKRDIQRLKSLLRLGEEDNDNRRDALLEQFFKGWGEEDVV



PGQAFPRSLFQGLGAAPFRSTPELWRQHCQTYYDKAEACLAKHISDWRKRTRPRPTSREMWYKTRSYHG



GKSIWMLEYLDAVRKLLLSWSLRGRTYGAINRQDTARFGSLASRLLHHINSLKEDRIKTGADSIVQAARGY



IPLPHGKGWEQRYEPCQLILFEDLARYRFRVDRPRRENSQLMQWNHRAIVAETTMQAELYGQIVENTAAG



FSSRFHAATGAPGVRCRFLLERDFDNDLPKPYLLRELSWMLGNTKVESEEEKLRLLSEKIRPGSLVPWDGG



EQFATLHPKRQTLCVIHADMNAAQNLQRRFFGRCGEAFRLVCQPHGDDVLRLASTPGARLLGALQQLEN



GQGAFELVRDMGSTSQMNRFVMKSLGKKKIKPLQDNNGDDELEDVLSVLPEEDDTGRITVFRDSSGIFFPC



NVWIPAKQFWPAVRAMIWKVMASHSLG*





SEQ
MTKLRHRQKKLTHDWAGSKKREVLGSNGKLQNPLLMPVKKGQVTEFRKAFSAYARATKGEMTDGRKN


ID
MFTHSFEPFKTKPSLHQCELADKAYQSLHSYLPGSLAHFLLSAHALGFRIFSKSGEATAFQASSKIEAYESK


NO:
LASELACVDLSIQNLTISTLFNALTTSVRGKGEETSADPLIARFYTLLTGKPLSRDTQGPERDLAEVISRKIAS


16
SFGTWKEMTANPLQSLQFFEEELHALDANVSLSPAFDVLIKMNDLQGDLKNRTIVFDPDAPVFEYNAEDP



ADIIIKLTARYAKEAVIKNQNVGNYVKNAITTTNANGLGWLLNKGLSLLPVSTDDELLEFIGVERSHPSCH



ALIELIAQLEAPELFEKNVFSDTRSEVQGMIDSAVSNHIARLSSSRNSLSMDSEELERLIKSFQIHTPHCSLFIG



AQSLSQQLESLPEALQSGVNSADILLGSTQYMLTNSLVEESIATYQRTLNRINYLSGVAGQINGAIKRKAID



GEKIHLPAAWSELISLPFIGQPVIDVESDLAHLKNQYQTLSNEFDTLISALQKNFDLNFNKALLNRTQHFEA



MCRSTKKNALSKPEIVSYRDLLARLTSCLYRGSLVLRRAGIEVLKKHKIFESNSELREHVHERKHFVFVSPL



DRKAKKLLRLTDSRPDLLHVIDEILQHDNLENKDRESLWLVRSGYLLAGLPDQLSSSFINLPIITQKGDRRLI



DLIQYDQINRDAFVMLVTSAFKSNLSGLQYRANKQSFVVTRTLSPYLGSKLVYVPKDKDWLVPSQMFEGR



FADILQSDYMVWKDAGRLCVIDTAKHLSNIKKSVFSSEEVLAFLRELPHRTFIQTEVRGLGVNVDGIAFNN



GDIPSLKTFSNCVQVKVSRTNTSLVQTLNRWFEGGKVSPPSIQFERAYYKKDDQIHEDAAKRKIRFQMPAT



ELVHASDDAGWTPSYLLGIDPGEYGMGLSLVSINNGEVLDSGFIHINSLINFASKKSNHQTKVVPRQQYKS



PYANYLEQSKDSAAGDIAHILDRLIYKLNALPVFEALSGNSQSAADQVWTKVLSFYTWGDNDAQNSIRKQ



HWFGASHWDIKGMLRQPPTEKKPKPYIAFPGSQVSSYGNSQRCSCCGRNPIEQLREMAKDTSIKELKIRNS



EIQLFDGTIKLFNPDPSTVIERRRHNLGPSRIPVADRTFKNISPSSLEFKELITIVSRSIRHSPEFIAKKRGIGSEY



FCAYSDCNSSLNSEANAAANVAQKFQKQLFFEL*





SEQ
MKRILNSLKVAALRLLFRGKGSELVKTVKYPLVSPVQGAVEELAEAIRHDNLHLFGQKEIVDLMEKDEGT


ID
QVYSVVDFWLDTLRLGMFFSPSANALKITLGKFNSDQVSPFRKVLEQSPFFLAGRLKVEPAERILSVEIRKI


NO:
GKRENRVENYAADVETCFIGQLSSDEKQSIQKLANDIWDSKDHEEQRMLKADFFAIPLIKDPKAVTEEDPE


17
NETAGKQKPLELCVCLVPELYTRGFGSIADFLVQRLTLLRDKMSTDTAEDCLEYVGIEEEKGNGMNSLLG



TFLKNLQGDGFEQIFQFMLGSYVGWQGKEDVLRERLDLLAEKVKRLPKPKFAGEWSGHRMFLHGQLKS



WSSNFFRLFNETRELLESIKSDIQHATMLISYVEEKGGYHPQLLSQYRKLMEQLPALRTKVLDPEIEMTHMS



EAVRSYIMIHKSVAGFLPDLLESLDRDKDREFLLSIFPRIPKIDKKTKEIVAWELPGEPEEGYLFTANNLFRN



FLENPKHVPRFMAERIPEDWTRLRSAPVWFDGMVKQWQKVVNQLVESPGALYQFNESFLRQRLQAMLT



VYKRDLQTEKFLKLLADVCRPLVDFFGLGGNDIIFKSCQDPRKQWQTVIPLSVPADVYTACEGLAIRLRET



LGFEWKNLKGHEREDFLRLHQLLGNLLFWIRDAKLVVKLEDWMNNPCVQEYVEARKAIDLPLEIFGFEVP



IFLNGYLFSELRQLELLLRRKSVMTSYSVKTTGSPNRLFQLVYLPLNPSDPEKKNSNNFQERLDTPTGLSRR



FLDLTLDAFAGKLLTDPVTQELKTMAGFYDHLFGFKLPCKLAAMSNHPGSSSKMVVLAKPKKGVASNIGF



EPIPDPAHPVFRVRSSWPELKYLEGLLYLPEDTPLTIELAETSVSCQSVSSVAFDLKNLTTILGRVGEFRVTA



DQPFKLTPIIPEKEESFIGKTYLGLDAGERSGVGFAIVTVDGDGYEVQRLGVHEDTQLMALQQVASKSLKE



PVFQPLRKGTFRQQERIRKSLRGCYWNFYHALMIKYRAKVVHEESVGSSGLVGQWLRAFQKDLKKADVL



PKKGGKNGVDKKKRESSAQDTLWGGAFSKKEEQQIAFEVQAAGSSQFCLKCGWWFQLGMREVNRVQES



GVVLDWNRSIVTFLIESSGEKVYGFSPQQLEKGFRPDIETFKKMVRDFMRPPMFDRKGRPAAAYERFVLGR



RHRRYRFDKVFEERFGRSALFICPRVGCGNFDHSSEQSAVVLALIGYIADKEGMSGKKLVYVRLAELMAE



WKLKKLERSRVEEQSSAQ*





SEQ
MAESKQMQCRKCGASMKYEVIGLGKKSCRYMCPDCGNHTSARKIQNKKKRDKKYGSASKAQSQRIAVA


ID
GALYPDKKVQTIKTYKYPADLNGEVHDSGVAEKIAQAIQEDEIGLLGPSSEYACWIASQKQSEPYSVVDF


NO:
WFDAVCAGGVFAYSGARLLSTVLQLSGEESVLRAALASSPFVDDINLAQAEKFLAVSRRTGQDKLGKRIG


18
ECFAEGRLEALGIKDRMREFVQAIDVAQTAGQRFAAKLKIFGISQMPEAKQWNNDSGLTVCILPDYYVPEE



NRADQLVVLLRRLREIAYCMGIEDEAGFEHLGIDPGALSNFSNGNPKRGFLGRLLNNDIIALANNMSAMTP



YWEGRKGELIERLAWLKHRAEGLYLKEPHFGNSWADHRSRIFSRIAGWLSGCAGKLKIAKDQISGVRTDL



FLLKRLLDAVPQSAPSPDFIASISALDRFLEAAESSQDPAEQVRALYAFHLNAPAVRSIANKAVQRSDSQEW



LIKELDAVDHLEFNKAFPFFSDTGKKKKKGANSNGAPSEEEYTETESIQQPEDAEQEVNGQEGNGASKNQ



KKFQRIPRFFGEGSRSEYRILTEAPQYFDMFCNNMRAIFMQLESQPRKAPRDFKCFLQNRLQKLYKQTFLN



ARSNKCRALLESVLISWGEFYTYGANEKKFRLRHEASERSSDPDYVVQQALEIARRLFLFGFEWRDCSAGE



RVDLVEIHKKAISFLLAITQAEVSVGSYNWLGNSTVSRYLSVAGTDTLYGTQLEEFLNATVLSQMRGLAIR



LSSQELKDGFDVQLESSCQDNLQHLLVYRASRDLAACKRATCPAELDPKILVLPVGAFIASVMKMIERGDE



PLAGAYLRHRPHSFGWQIRVRGVAEVGMDQGTALAFQKPTESEPFKIKPFSAQYGPVLWLNSSSYSQSQY



LDGFLSQPKNWSMRVLPQAGSVRVEQRVALIWNLQAGKMRLERSGARAFFMPVPFSFRPSGSGDEAVLA



PNRYLGLFPHSGGIEYAVVDVLDSAGFKILERGTIAVNGFSQKRGERQEEAHREKQRRGISDIGRKKPVQA



EVDAANELHRKYTDVATRLGCRIVVQWAPQPKPGTAPTAQTVYARAVRTEAPRSGNQEDHARMKSSWG



YTWGTYWEKRKPEDILGISTQVYWTGGIGESCPAVAVALLGHIRATSTQTEWEKEEVVFGRLKKFFPS*





SEQ
MEKRINKIRKKLSADNATKPVSRSGPMKTLLVRVMTDDLKKRLEKRRKKPEVMPQVISNNAANNLRMLL


ID
DDYTKMKEAILQVYWQEFKDDHVGLMCKFAQPASKKIDQNKLKPEMDEKGNLTTAGFACSQCGQPLFV


NO:
YKLEQVSEKGKAYTNYFGRCNVAEHEKLILLAQLKPEKDSDEAVTYSLGKFGQRALDFYSIHVTKESTHP


19
VKPLAQIAGNRYASGPVGKALSDACMGTIASFLSKYQDIIIEHQKVVKGNQKRLESLRELAGKENLEYPSV



TLPPQPHTKEGVDAYNEVIARVRMWVNLNLWQKLKLSRDDAKPLLRLKGFPSFPVVERRENEVDWWNTI



NEVKKLIDAKRDMGRVFWSGVTAEKRNTILEGYNYLPNENDHKKREGSLENPKKPAKRQFGDLLLYLEK



KYAGDWGKVFDEAWERIDKKIAGLTSHIEREEARNAEDAQSKAVLTDWLRAKASFVLERLKEMDEKEFY



ACEIQLQKWYGDLRGNPFAVEAENRVVDISGFSIGSDGHSIQYRNLLAWKYLENGKREFYLLMNYGKKG



RIRFTDGTDIKKSGKWQGLLYGGGKAKVIDLTFDPDDEQLIILPLAFGTRQGREFIWNDLLSLETGLIKLAN



GRVIEKTIYNKKIGRDEPALFVALTFERREVVDPSNIKPVNLIGVDRGENIPAVIALTDPEGCPLPEFKDSSG



GPTDILRIGEGYKEKQRAIQAAKEVEQRRAGGYSRKFASKSRNLADDMVRNSARDLFYHAVTHDAVLVF



ENLSRGFGRQGKRTFMTERQYTKMEDWLTAKLAYEGLTSKTYLSKTLAQYTSKTCSNCGFTITTADYDG



MLVRLKKTSDGWATTLNNKELKAEGQITYYNRYKRQTVEKELSAELDRLSEESGNNDISKWTKGRRDEA



LFLLKKRFSHRPVQEQFVCLDCGHEVHADEQAALNIARSWLFLNSNSTEFKSYKSGKQPFVGAWQAFYKR



RLKEVWKPNA





SEQ
MKRINKIRRRLVKDSNTKKAGKTGPMKTLLVRVMTPDLRERLENLRKKPENIPQPISNTSRANLNKLLTDY


ID
TEMKKAILHVYWEEFQKDPVGLMSRVAQPAPKNIDQRKLIPVKDGNERLTSSGFACSQCCQPLYVYKLEQ


NO:
VNDKGKPHTNYFGRCNVSEHERLILLSPHKPEANDELVTYSLGKFGQRALDFYSIHVTRESNHPVKPLEQI


20
GGNSCASGPVGKALSDACMGAVASFLTKYQDIILEHQKVIKKNEKRLANLKDIASANGLAFPKITLPPQPH



TKEGIEAYNNVVAQIVIWVNLNLWQKLKIGRDEAKPLQRLKGFPSFPLVERQANEVDWWDMVCNVKKLI



NEKKEDGKVFWQNLAGYKRQEALLPYLSSEEDRKKGKKFARYQFGDLLLHLEKKHGEDWGKVYDEAW



ERIDKKVEGLSKHIKLEEERRSEDAQSKAALTDWLRAKASFVIEGLKEADKDEFCRCELKLQKWYGDLRG



KPFAIEAENSILDISGFSKQYNCAFIWQKDGVKKLNLYLIINYFKGGKLRFKKIKPEAFEANRFYTVINKKSG



EIVPMEVNFNFDDPNLIILPLAFGKRQGREFIWNDLLSLETGSLKLANGRVIEKTLYNRRTRQDEPALFVAL



TFERREVLDSSNIKPMNLIGIDRGENIPAVIALTDPEGCPLSRFKDSLGNPTHILRIGESYKEKQRTIQAAKEV



EQRRAGGYSRKYASKAKNLADDMVRNTARDLLYYAVTQDAMLIFENLSRGFGRQGKRTFMAERQYTRM



EDWLTAKLAYEGLPSKTYLSKTLAQYTSKTCSNCGFTITSADYDRVLEKLKKTATGWMTTINGKELKVEG



QITYYNRYKRQNVVKDLSVELDRLSEESVNNDISSWTKGRSGEALSLLKKRFSHRPVQEKFVCLNCGFETH



ADEQAALNIARSWLFLRSQEYKKYQTNKTTGNTDKRAFVETWQSFYRKKLKEVWKP





SEQ
atgGGAAAAATGTATTATCTTGGTCTGGATATAGGAACAAATTCTGTTGGATATGCCGTAACCGACCCA


ID
TCGTACCATTTGCTCAAATTTAAAGGCGAACCGATGTGGGGTGCCCACGTGTTTGCTGCGGGGAATCA


NO:
ATCAGCTGAACGGAGAAGCTTTCGTACGAGCCGCAGACGCCTTGACCGCAGGCAACAGCGTGTCAAA


21
CTGGTTCAAGAAATCTTTGCTCCCGTGATTAGTCCCATTGATCCACGTTTTTTTATCAGACTTCATGAG



AGCGCTTTATGGCGGGATGATGTGGCTGAAACGGATAAACATATTTTCTTTAATGACCCGACCTATAC



GGATAAGGAATATTATTCTGACTATCCAACCATCCATCATCTCATTGTGGACCTTATGGAAAGCAGTG



AAAAGCATGACCCGCGGCTTGTTTATTTGGCTGTTGCCTGGCTGGTTGCTCATCGTGGTCATTTCCTCA



ATGAAGTGGATAAGGATAATATTGGGGATGTCCTGAGTTTTGACGCCTTTTATCCTGAGTTTCTGGCA



TTTCTTTCCGATAATGGGGTGTCACCTTGGGTATGTGAGTCAAAAGCACTCCAAGCGACCCTGCTTTC



ACGAAACTCCGTCAACGATAAGTATAAAGCCTTGAAGTCTCTGATCTTTGGCAGCCAAAAGCCGGAG



GATAATTTTGATGCCAATATCAGTGAAGATGGACTTATCCAACTTTTAGCAGGAAAAAAGGTCAAGGT



CAATAAACTTTTTCCTCAAGAAAGTAATGATGCTTCCTTTACACTCAATGATAAGGAAGATGCAATTG



AGGAAATCTTAGGAACGCTTACACCGGATGAGTGTGAATGGATTGCGCATATTAGGAGGCTGTTTGAT



TGGGCCATCATGAAACATGCTCTCAAAGATGGCAGAACAATCTCCGAATCGAAAGTAAAGCTCTATG



AACAGCATCACCATGACTTGACACAGCTCAAGTATTTTGTGAAGACCTATCTAGCAAAGGAATATGAT



GACATTTTTCGAAACGTAGATAGTGAAACAACCAAAAACTATGTCGCATATTCCTATCATGTAAAAGA



AGTCAAGGGTACATTGCCCAAAAATAAGGCAACCCAAGAAGAATTTTGCAAGTATGTCCTTGGAAAG



GTAAAGAACATCGAATGCAGTGAAGCTGATAAGGTTGATTTTGATGAAATGATTCAGCGTCTTACAG



ACAATTCCTTTATGCCGAAACAAGTATCAGGTGAAAACAGGGTTATCCCTTACCAGCTTTACTATTAT



GAACTAAAGACTATTTTGAATAAAGCCGCTTCTTATCTGCCTTTTTTGACCCAATGCGGAAAAGATGC



CATCTCCAATCAAGATAAGCTCCTTTCCATCATGACCTTTCGGATTCCGTATTTCGTTGGGCCCTTGCG



CAAGGACAATTCAGAGCATGCCTGGCTGGAACGAAAAGCAGGGAAAATCTATCCGTGGAATTTTAAC



GACAAAGTTGACCTTGATAAAAGTGAAGAAGCGTTCATTCGGAGAATGACGAATACCTGCACTTATT



ATCCCGGTGAAGATGTTTTGCCACTTGACTCCCTTATTTATGAAAAATTCATGATCCTCAATGAAATCA



ATAATATCCGAATTGATGGTTATCCTATTTCTGTAGATGTAAAACAGCAGGTTTTTGGCCTCTTTGAAA



AGAAGAGAAGAGTGACCGTAAAGGATATCCAGAATCTCCTGCTTTCCTTGGGTGCCTTGGATAAGCAT



GGTAAATTGACGGGAATCGATACTACCATCCATAGCAATTACAATACATACCATCATTTTAAATCGCT



CATGGAGCGTGGCGTTCTTACTCGTGATGATGTGGAACGCATTGTGGAGCGTATGACCTATAGTGATG



ATACAAAACGCGTCCGTCTTTGGCTGAACAATAATTATGGAACGCTCACTGCTGACGACGTAAAGCAT



ATTTCAAGGCTCCGAAAGCATGATTTTGGCCGGCTTTCCAAAATGTTCCTCACAGGCCTAAAGGGAGT



TCATAAGGAAACGGGGGAACGAGCTTCCATTTTGGATTTTATGTGGAATACCAATGATAACTTGATGC



AGCTTTTATCTGAATGTTATACTTTTTCGGATGAAATTACCAAGCTGCAGGAAGCATACTATGCCAAG



GCGCAGCTTTCCCTGAATGATTTTCTGGACTCCATGTATATTTCAAATGCTGTCAAACGTCCTATCTAT



CGAACTCTTGCCGTTGTAAATGACATACGCAAAGCCTGTGGGACGGCGCCAAAACGCATTTTTATCGA



AATGGCAAGAGATGGGGAAAGCAAAAAGAAAAGGAGCGTAACAAGAAGAGAACAAATCAAGAATC



TTTATAGGTCCATCCGCAAGGATTTTCAGCAGGAGGTAGATTTCCTTGAAAAAATCCTTGAAAACAAA



AGCGATGGACAGCTGCAAAGCGATGCGCTCTATCTATACTTTGCGCAGCTTGGAAGGGATATGTATAC



CGGGGACCCTATCAAGTTGGAGCATATCAAGGACCAGTCCTTCTATAATATTGATCATATCTATCCCC



AAAGCATGGTCAAGGACGATAGTCTTGATAACAAGGTGTTGGTTCAATCGGAAATTAATGGAGAGAA



GAGCAGTCGATATCCTCTTGATGCTGCTATCCGTAATAAAATGAAGCCTCTTTGGGATGCTTATTATA



ACCATGGCCTGATTTCCCTCAAGAAGTATCAGCGTTTGACGCGGAGCACTCCCTTTACAGATGATGAA



AAGTGGGATTTCATCAATCGGCAGCTTGTTGAGACAAGACAATCCACGAAGGCCTTGGCAATCTTACT



AAAAAGGAAGTTCCCTGATACGGAGATTGTCTACTCCAAGGCAGGGCTTTCTTCTGATTTTCGGCATG



AGTTTGGTCTCGTAAAATCGAGGAATATCAATGACCTGCACCATGCAAAGGACGCATTTCTTGCGATT



GTAACAGGAAATGTCTATCATGAACGCTTTAATCGCCGGTGGTTTATGGTGAACCAGCCCTATTCCGT



CAAGACCAAGACGTTGTTTACGCATTCTATTAAAAATGGTAATTTTGTAGCTTGGAATGGAGAAGAGG



ATCTTGGCCGCATTGTTAAAATGTTAAAGCAAAATAAGAACACTATTCATTTCACGCGGTTCTCTTTTG



ATCGAAAGGAAGGCCTGTTTGATATTCAGCCACTAAAAGCGTCAACCGGTCTTGTACCAAGAAAAGC



CGGACTAGACGTGGTAAAATATGGTGGCTATGACAAATCGACAGCAGCTTATTATCTCCTTGTTCGAT



TTACACTAGAAGATAAAAAGACTCAACATAAATTGATGATGATTCCTGTAGAAGGCTTGTATAAAGCT



CGAATTGACCATGATAAGGAATTCTTAACGGACTATGCACAAACTACAATCAGTGAAATCCTACAAA



AAGATAAACAAAAGGTGATAAATATAATGTTTCCAATGGGAACAAGGCACATTAAACTGAATTCCAT



GATTTCAATCGATGGTTTTTATCTTTCCATTGGAGGAAAGTCTAGTAAGGGAAAATCGGTGTTGTGTC



ATGCTATGGTACCTCTTATTGTACCTCATAAGATAGAATGTTATATTAAGGCGATGGAGTCTTTTGCAC



GTAAATTTAAAGAAAATAATAAATTAAGGATTGTGGAAAAGTTTGATAAGATTACGGTGGAAGATAA



CTTGAACCTATACGAACTATTTTTACAAAAACTTCAACATAACCCATATAATAAGTTCTTCTCCACACA



ATTTGATGTGCTGACTAATGGAAGAAGTACATTTACTAAATTATCTCCAGAGGAACAAGTTCAAACGT



TATTGAATATCTTATCAATTTTTAAAACTTGTCGGAGCTCTGGCTGCGATTTAAAATCCATTAACGGTT



CTGCTCAAGCTGCCAGAATTATGATCAGCGCAGATTTAACTGGACTCTCAAAAAAATATTCCGATATT



CGGCTTGTTGAGCAATCAGCATCTGGACTTTTTGTTAGTAAATCACAAAATCTTTTGGAGTATTTAtga





SEQ
atgtcttcattaacaaaatttacaaataaatacagtaagcagctaaccataaaaaatgaactcatccca


ID 
gtaggaaagactctcgagaacattaaggaaaacggtctcatagatggagatgaacagctaaacgagaat


NO:
tcaaaaagcaaagataatcgttgatgattttctacgagatttcataaataaagctttaaataataccca


22 
aataggaaattggagagaattagcagatgctttaaataaagaagatgaagataacatagaaaagctcca



agacaaaatcagaggaataattgtaagtaaattcgagacatttgatttgttttcttcttactcgataaa



gaaagacgaaaagataatagatgatgataatgatgttgaagaagaggagctagatctaggaaaaaaaac



ttcctcatttaaatatatttttaagaaaaacctttttaaattagtacttccttcttatttaaagacaac



aaatcaggataaactgaaaataatctcttcttttgataatttttctacctatttcagaggattctttga



gaacagaaaaaatattttcactaagaagcctatatctacgtcaattgcctacagaattgtccatgataa



ctttccaaagtttctagataacatcagatgttttaatgtgtggcaaacagaatgcccacagttaattgt



aaaggctgataattatttaaaatcaaagaacgtcatagctaaagataaatctttagcaaactattttac



tgtaggagcatatgattacttcttatcccagaatggcattgatttctacaacaacattatcggcggtct



accagcatttgctggtcatgagaaaatccaaggacttaatgaatttataaatcaagaatgccaaaagga



cagcgaactaaaatctaaactgaaaaacagacatgctttcaaaatggctgttctatttaagcaaattct



ttcagatagagaaaaaagttttgttatagacgagttcgaatctgatgctcaggtcatagatgcggttaa



gaacttctatgcagaacaatgtaaggataataatgttatttttaaccttctaaatcttatcaagaatat



agcgttcttatctgatgatgaattagatggaatttttatagaaggcaagtatttaagctctgtttccca



aaagctatattcagattggtcgaagcttcgaaatgatattgaagatagtgcaaacagtaaacaaggaaa



taaagagttagcaaagaaaattaaaacaaataaaggcgatgttgaaaaggccataagtaaatatgagtt



ttctttatcagaacttaactcaattgtacatgataatacaaaattcagtgaccttctttcttgtacgtt



acataaagtggctagcgaaaaactagtgaaagttaatgaaggggactggccaaaacacctgaaaaataa



tgaagaaaaacaaaagataaaagagcctttagatgcattgttagaaatttataatacattgctgatatt



caactgcaagtcatttaataagaacggtaatttctatgttgattatgacagatgcataaatgagctttc



tagtgttgtttatttatataacaaaacaagaaattactgtacaaagaaaccttataacacagacaaatt



caaattaaactttaacagtcctcaattaggagagggctttagtaagtcgaaagaaaatgactgtctgac



attattatttaaaaaagacgacaattactatgttggaattatcagaaaaggggcaaaaattaactttga



tgatacacaagccattgcagacaatacagataactgtatatttaagatgaattatttcctattaaaaga



tgctaaaaagtttattcctaaatgttcaattcagttaaaagaagtaaaagcacattttaaaaaatcaga



ggatgattatatcctgagtgacaaagaaaaatttgcctctccccttgttattaagaaatcaacattttt



attagcaacagcacatgtaaaaggaaagaaaggaaacataaaaaaattccaaaaggaatattctaagga



aaatccaacagaatatagaaattctctgaatgaatggattgcattttgtaaagaatttctaaaaacata



taaggcggcaacaatctttgacattacaacgttaaaaaaagctgaagaatatgctgatattgttgagtt



ttataaggatgtagataatctttgttataaactagagttttgccctattaaaacatctttcattgagaa



tcttattgataatggggacttatatttattcagaatcaataataaagatttcagttcaaaatctactgg



tacaaagaatcttcatacgctctatcttcaggcaatctttgatgaaagaaacctcaataatcctactat



tatgttaaatggcggagcagagttattttatcgaaaagaaagcattgaacagaaaaataggataactca



taaggcaggatcaattcttgtaaacaaggtttgtaaggatggaacaagtctagatgacaaaatcagaaa



cgaaatatatcaatatgaaaacaagtttattgatacattgtctgatgaagctaaaaaagttttacctaa



tgtaataaaaaaagaagcaactcacgacataacaaaagataagcgatttacatcagataagttcttttt



ccattgcccattaacaattaactataaggaaggagatacaaaacaatttaacaatgaggttttatcttt



ccttagaggtaatccagacattaatatcatcggaattgacagaggagaaagaaaccttatatacgtaac



tgttattaatcagaaaggcgaaatacttgacagcgtttcgtttaacacagtaacaaacaagtcgagcaa



aattgaacaaactgttgattatgaggaaaagcttgctgttagggaaaaagaaagaatagaagcaaaaag



atcctgggattcaatatcaaagatagcaaccttaaaagaaggttatctatcagctattgttcatgagat



atgcctactgatgatcaaacacaacgcaatcgttgtacttgagaatctaaatgcaggatttaagagaat



tagaggaggattatcagaaaagtctgtttatcagaaattcgagaagatgcttattaacaaactaaatta



ctttgtatctaaaaaagaatcagactggaataaacctagtggacttttaaatggtttacaactttcaga



ccagttcgagtcatttgagaaattaggaattcaatctgggttcatcttctatgttcctgcagcatatac



atctaagattgatcctacaacaggatttgcaaatgttcttaacttatccaaggtaagaaatgttgatgc



aataaagagttttttcagtaatttcaatgaaatttcatatagcaaaaaagaagctctctttaaattctc



ttttgatttagattccttatcaaagaagggcttcagctcatttgtaaaattcagtaaatctaaatggaa



tgtatatacatttggagagagaataataaaaccaaagaataagcaagggtatcgtgaagataagagaat



taatttaacatttgaaatgaaaaaacttctgaatgaatataaagtaagttttgatcttgaaaacaactt



aattccaaatctaacctctgcaaatctgaaagataccttctggaaagaactattctttatttttaaaac



aactctgcagcttagaaacagtgtaacaaatggcaaagaagatgtactgatttctccagtaaagaacgc



taaaggagagttctttgtatcaggaactcataacaagacattacctcaagactgtgatgcaaatggagc



atatcatatcgccctaaaaggtctgatgattcttgaacgtaacaatcttgttagagaagaaaaagacac



aaagaagataatggcaatttctaatgttgactggtttgagtatgttcaaaaaaggagaggtgtcctgta



a





SEQ
ATGAACAACTATGATGAGTTTACCAAACTGTACCCAATACAGAAAACGATAAGGTTCGAATTGAAGC


ID
CGCAGGGAAGAACGATGGAACACCTCGAAACATTCAACTTTTTCGAAGAGGACAGGGATAGAGCGG


NO:
AGAAATATAAGATTTTAAAGGAAGCAATCGACGAGTATCATAAGAAGTTTATAGACGAACATCTAAC


23
AAATATGTCTCTTGACTGGAATTCTTTAAAACAGATTTCAGAGAAATACTATAAGAGTAGAGAGGAA



AAAGACAAGAAAGTTTTTCTGTCAGAACAGAAACGCATGAGGCAAGAGATAGTTTCTGAGTTCAAAA



AAGACGATCGGTTTAAAGATCTTTTTTCAAAAAAATTGTTTTCTGAACTTCTCAAGGAAGAGATTTAC



AAAAAAGGAAACCATCAGGAAATTGACGCATTGAAAAGTTTTGATAAATTCTCAGGCTATTTTATTGG



GTTGCATGAGAACCGAAAAAATATGTATTCTGACGGAGACGAGATCACGGCTATCTCTAACCGTATTG



TAAATGAGAATTTCCCGAAGTTCCTCGACAACCTTCAGAAATATCAGGAAGCTCGTAAAAAATATCCA



GAGTGGATCATTAAGGCAGAATCTGCTTTAGTTGCACATAATATCAAGATGGATGAAGTCTTTTCCTT



AGAGTATTTCAACAAAGTCCTGAATCAAGAAGGAATACAGAGATACAATCTCGCCCTAGGTGGCTAT



GTGACCAAAAGTGGTGAGAAAATGATGGGGCTTAATGATGCACTTAATCTTGCCCATCAAAGTGAAA



AAAGCAGCAAGGGAAGGATACACATGACTCCACTCTTCAAACAGATTCTGAGTGAAAAAGAGTCCTT



TTCTTATATACCAGATGTTTTTACAGAAGACTCTCAACTTTTACCATCCATTGGTGGGTTCTTTGCACA



AATAGAAAATGATAAGGACGGGAATATTTTTGACAGAGCATTAGAATTGATATCTTCTTATGCAGAAT



ACGATACAGAAAGGATATATATCAGGCAAGCGGACATAAACAGAGTTTCTAATGTTATTTTCGGGGA



GTGGGGAACACTGGGGGGGTTAATGAGGGAATACAAAGCAGACTCTATCAACGACATCAATTTGGAG



AGAACATGCAAGAAGGTAGACAAGTGGCTCGACTCAAAGGAGTTTGCGTTATCAGATGTATTAGAGG



CAATAAAAAGAACCGGCAATAATGATGCTTTTAATGAATATATCTCAAAGATGCGCACTGCCAGGGA



AAAGATTGACGCTGCAAGAAAGGAAATGAAATTCATTTCGGAAAAAATATCTGGAGACGAAGAATCG



ATCCATATTATCAAAACCTTATTGGACTCGGTGCAACAGTTTTTACATTTTTTCAATTTATTCAAAGCG



CGTCAGGACATTCCTCTTGATGGAGCATTCTATGCGGAGTTCGATGAAGTCCATAGCAAACTGTTTGC



TATTGTTCCGTTGTATAATAAGGTTAGGAACTATCTTACGAAAAATAACCTTAACACGAAAAAGATAA



AGCTAAACTTCAAGAATCCAACTCTGGCAAACGGATGGGATCAAAACAAGGTATATGACTACGCCTC



CTTAATCTTTCTCCGCGATGGTAATTATTATCTCGGAATAATAAATCCAAAAAGGAAAAAGAATATTA



AATTCGAACAAGGGTCTGGAAATGGCCCATTCTACCGGAAGATGGTGTACAAACAAATTCCAGGGCC



GAACAAGAACTTACCAAGAGTCTTCCTCACATCTACGAAAGGCAAAAAAGAGTACAAGCCGTCAAAG



GAGATAATAGAAGGATATGAAGCGGACAAACACATAAGAGGAGATAAATTCGATCTGGATTTCTGTC



ATAAGCTGATAGACTTCTTCAAGGAATCCATCGAGAAGCACAAGGACTGGAGTAAGTTCAACTTCTAT



TTCTCTCCAACTGAATCATATGGAGACATCAGCGAATTCTATCTGGATGTAGAAAAACAGGGATACCG



GATGCATTTTGAGAATATTTCTGCCGAGACGATTGATGAGTATGTCGAAAAGGGGGACTTATTCCTCT



TCCAGATATACAACAAAGACTTTGTGAAAGCGGCAACCGGAAAAAAAGATATGCACACCATTTATTG



GAACGCGGCATTCTCGCCCGAGAACCTTCAGGATGTGGTAGTGAAACTGAACGGTGAAGCAGAACTT



TTCTACAGAGACAAGAGCGACATCAAGGAGATAGTTCACAGGGAGGGAGAGATACTGGTCAATCGTA



CCTACAACGGCAGGACACCTGTGCCTGACAAGATCCACAAAAAATTAACAGATTATCATAATGGCCG



TACCAAAGATCTCGGAGAAGCAAAAGAATACCTCGATAAGGTCAGATATTTCAAAGCGCACTACGAC



ATCACAAAGGATCGCAGATACCTGAATGATAAAATATACTTCCATGTGCCTCTGACATTGAATTTCAA



AGCAAACGGGAAGAAGAATCTCAATAAGATGGTAATTGAAAAGTTCCTCTCGGACGAAAAAGCGCAT



ATTATTGGGATTGATCGCGGGGAAAGGAATCTTCTTTACTATTCTATCATTGACAGGTCAGGTAAAAT



AATCGATCAACAGAGCCTCAACGTCATCGATGGATTCGATTACCGAGAGAAACTGAATCAGAGGGAG



ATCGAGATGAAGGATGCCAGACAAAGCTGGAATGCTATCGGGAAGATAAAGGACCTCAAGGAAGGG



TATCTTTCAAAAGCGGTCCACGAAATTACCAAGATGGCGATACAATACAATGCCATTGTTGTCATGGA



GGAACTCAATTATGGGTTCAAACGCGGACGTTTCAAAGTTGAGAAGCAGATATATCAGAAATTCGAG



AATATGCTGATTGACAAGATGAATTATCTGGTATTCAAGGATGCTCCGGATGAAAGTCCGGGAGGAG



TCCTCAATGCATATCAGCTTACTAATCCGCTTGAAAGTTTCGCTAAACTTGGGAAACAGACAGGAATT



CTTTTCTATGTTCCGGCAGCCTATACTTCGAAGATAGATCCGACGACCGGGTTTGTCAATCTTTTCAAT



ACTTCAAGTAAAACGAACGCACAGGAAAGAAAAGAATTCTTGCAAAAATTCGAGTCGATCTCCTATT



CCGCTAAAGACGGAGGAATATTCGCATTCGCGTTCGATTATCGGAAGTTCGGAACGTCAAAAACAGA



CCACAAAAATGTATGGACCGCATACACGAACGGGGAAAGGATGAGGTACATAAAAGAGAAAAAACG



CAACGAACTGTTCGACCCCTCGAAGGAGATCAAAGAGGCTCTCACTTCATCAGGAATCAAATATGAC



GGCGGACAGAACATATTGCCAGATATCCTGAGGAGCAACAATAACGGTCTGATCTACACAATGTATT



CCTCTTTCATAGCGGCCATTCAAATGAGGGTCTATGACGGGAAAGAAGACTATATCATCTCGCCGATA



AAGAACAGCAAGGGAGAGTTCTTCAGGACCGATCCGAAAAGAAGGGAACTTCCGATAGACGCGGAT



GCGAACGGCGCGTATAACATTGCTCTCAGGGGCGAATTGACGATGCGTGCGATAGCGGAGAAGTTCG



ATCCGGACTCGGAAAAGATGGCGAAGCTAGAACTGAAACATAAGGACTGGTTCGAATTCATGCAGAC



AAGGGGGGATTGA





SEQ
ATGACAAAAACATTTGATTCAGAATTTTTTAATTTATATTCTCTTCAAAAAACAGTTCGTTTTGAACTC


ID
AAGCCGGTTGGTGAAACAGCCTCGTTTGTTGAAGATTTTAAAAACGAAGGTTTGAAACGAGTTGTTTC


NO:
AGAGGATGAACGGCGTGCGGTTGATTACCAAAAAGTGAAAGAAATTATTGATGACTACCACCGAGAT


24
TTTATTGAAGAATCGCTGAACTATTTTCCTGAGCAGGTCTCAAAAGACGCTTTGGAACAAGCTTTTCA



CCTTTATCAAAAACTAAAAGCCGCTAAGGTTGAAGAGCGTGAAAAAGCATTGAAAGAATGGGAAGCC



CTTCAGAAAAAACTGCGCGAAAAAGTTGTTAAATGTTTTTCAGATTCAAACAAAGCACGCTTTTCCCG



CATTGATAAAAAAGAACTGATTAAAGAAGATTTAATTAACTGGTTGGTTGCACAAAATCGCGAAGAT



GACATTCCAACCGTTGAAACCTTTAACAACTTTACGACTTATTTTACGGGGTTTCATGAAAACCGAAA



AAACATTTATTCAAAAGACGATCATGCCACAGCCATTTCATTTCGACTCATTCATGAAAACCTGCCTA



AGTTTTTTGATAATGTGATCAGCTTTAATAAATTGAAGGAAGGATTTCCAGAGCTGAAATTTGATAAG



GTTAAGGAAGATTTAGAAGTTGATTATGACTTGAAACATGCCTTTGAAATCGAATACTTTGTCAATTT



TGTTACCCAAGCCGGAATTGACCAATATAACTATCTTTTGGGGGGTAAAACCTTAGAAGACGGCACCA



AAAAGCAAGGCATGAATGAACAAATCAATCTGTTCAAGCAACAGCAAACCCGAGACAAAGCCCGAC



AAATTCCCAAACTCATACCATTGTTTAAACAAATTCTAAGCGAACGAACGGAAAGCCAATCGTTTATT



CCAAAACAATTTGAATCAGACCAAGAGCTATTTGACTCACTGCAAAAACTGCATAACAACTGCCAAG



ATAAATTTACCGTACTGCAACAAGCCATTTTAGGCTTAGCCGAAGCAGATCTGAAAAAAGTATTCATT



AAAACATCTGATCTTAATGCGCTATCAAATACCATTTTTGGAAATTACAGTGTGTTTTCGGATGCGTTG



AATTTATACAAAGAATCGCTCAAAACAAAAAAGGCGCAAGAAGCGTTTGAAAAACTACCCGCTCACA



GCATTCATGACTTGATTCAATATTTGGAGCAATTTAATAGCTCTTTGGATGCAGAAAAACAGCAATCA



ACTGACACCGTACTGAATTACTTTATTAAAACAGACGAGCTGTATTCTCGGTTCATAAAATCAACGAG



CGAAGCCTTCACACAAGTACAACCACTCTTTGAATTGGAAGCATTAAGCTCAAAACGTCGTCCACCGG



AAAGTGAAGACGAAGGCGCAAAAGGTCAGGAAGGGTTTGAGCAAATTAAACGCATAAAAGCCTATT



TGGATACCTTGATGGAGGCGGTGCATTTTGCAAAACCACTTTATCTGGTGAAGGGGCGCAAAATGATT



GAAGGTCTGGACAAAGACCAAAGTTTCTATGAAGCCTTTGAAATGGCTTACCAAGAACTAGAAAGTC



TGATTATTCCAATCTACAACAAAGCTCGTAGTTATTTAAGTCGTAAACCGTTTAAAGCGGACAAATTC



AAAATTAATTTTGATAATAATACATTGCTTTCCGGTTGGGATGCTAATAAAGAAACGGCTAACGCTTC



AATTTTGTTTAAGAAGGATGGTTTGTATTATTTAGGAATCATGCCTAAAGGAAAAACGTTTTTGTTCG



ATTACTTCGTTTCATCGGAAGATTCTGAAAAGTTAAAACAAAGAAGACAAAAAACCGCCGAAGAAGC



GCTTGCGCAAGATGGCGAAAGCTACTTTGAAAAAATTCGTTACAAGCTGTTACCTGGCGCCAGCAAA



ATGTTGCCGAAAGTATTTTTTTCCAACAAAAACATAGGGTTTTACAACCCAAGTGATGACATACTTCG



TATCAGGAATACAGCCTCTCACACTAAAAACGGAACACCGCAAAAAGGGCACTCTAAAGTAGAGTTT



AATTTGAATGATTGTCATAAGATGATTGATTTCTTTAAATCAAGCATTCAAAAGCATCCAGAGTGGGG



AAGTTTTGGATTCACCTTTTCAGATACATCAGATTTTGAAGATATGAGCGCCTTTTATCGAGAAGTCG



AAAACCAAGGTTATGTCATTAGTTTCGATAAAATAAAAGAAACTTACATTCAGAGTCAAGTTGAACA



GGGGAACCTATATTTATTCCAAATCTACAATAAAGACTTCTCGCCCTACAGCAAAGGCAAACCAAATT



TACACACGCTTTACTGGAAAGCGTTGTTTGAGGAAGCCAACCTAAATAATGTGGTGGCAAAACTCAAT



GGTGAAGCTGAAATTTTCTTTAGGCGACACTCAATCAAAGCATCTGATAAAGTGGTGCACCCAGCGA



ATCAAGCCATTGACAATAAAAACCCGCATACCGAAAAAACGCAAAGCACCTTTGAATATGATCTTGT



AAAAGACAAGCGCTATACCCAAGACAAATTCTTCTTCCATGTACCGATTTCATTGAACTTTAAGGCAC



AAGGTGTTTCAAAATTTAACGATAAAGTGAATGGATTTTTAAAGGGTAACCCAGATGTCAATATTATT



GGCATTGACCGAGGCGAACGACACCTTCTGTATTTCACTGTGGTGAATCAGAAAGGTGAAATTTTGGT



TCAAGAGTCGCTTAATACCCTAATGAGTGATAAAGGGCATGTGAATGACTACCAGCAAAAACTCGAC



AAAAAAGAACAAGAACGCGATGCCGCTCGCAAAAGCTGGACGACGGTTGAAAATATCAAAGAATTA



AAAGAAGGCTATTTATCTCATGTTGTTCATAAGTTGGCACACCTGATTATTAAATACAATGCCATTGTT



TGCTTGGAAGACCTGAATTTTGGTTTCAAACGCGGGCGTTTTAAAGTGGAAAAACAAGTTTATCAGAA



ATTTGAAAAAGCGCTTATTGATAAGCTTAACTACTTGGTATTTAAAGAAAAAGAGTTAGGCGAGGTG



GGCCATTATCTAACCGCCTATCAGTTGACCGCACCGTTTGAAAGTTTCAAGAAGTTAGGCAAGCAAAG



TGGCATATTGTTTTATGTTCCGGCGGATTACACCTCCAAAATTGACCCAACCACCGGGTTTGTCAACTT



TCTTGATCTGCGTTATCAGAGTGTCGAAAAAGCCAAACAGCTCTTAAGCGACTTTAATGCCATTCGTT



TTAATTCAGTACAAAACTATTTTGAGTTCGAAATAGATTACAAAAAACTCACACCCAAACGTAAAGTT



GGTACTCAGAGTAAATGGGTGATTTGTACCTATGGAGATGTCCGCTATCAAAATCGGCGTAATCAAAA



AGGTCACTGGGAAACGGAAGAAGTCAATGTGACTGAAAAACTAAAAGCCCTTTTCGCCAGTGATTCC



AAAACTACAACCGTAATCGATTACGCCAATGACGACAACCTAATTGACGTCATTCTGGAACAGGACA



AAGCCAGCTTCTTCAAAGAACTGTTATGGTTATTAAAACTCACCATGACGCTCCGCCACAGCAAAATC



AAAAGTGAAGACGACTTTATTCTTTCACCCGTTAAAAACGAACAAGGCGAGTTTTACGATAGTCGAA



AAGCGGGCGAGGTGTGGCCTAAAGATGCAGACGCCAATGGCGCTTATCACATAGCGTTGAAAGGCTT



GTGGAATCTGCAACAGATCAATCAGTGGGAAAAGGGTAAAACACTTAATCTGGCGATTAAAAACCAG



GATTGGTTCAGTTTTATTCAAGAAAAGCCCTATCAAGAATAA





SEQ
ATGCACACAGGCGGATTACTTAGCATGGATGCCAAGGAGTTTACCGGACAGTACCCCCTTTCGAAGA


ID
CTCTGCGTTTTGAACTGAGACCGATAGGCAGAACGTGGGACAATCTCGAAGCATCGGGGTATCTTGCG


NO:
GAGGACAGACACCGTGCAGAATGCTATCCCAGGGCAAAAGAGCTCTTGGACGACAACCATCGTGCAT


25
TCCTCAACCGTGTCCTGCCTCAGATCGATATGGATTGGCACCCGATCGCAGAGGCATTCTGCAAAGTC



CACAAGAATCCGGGAAACAAGGAATTGGCTCAGGATTACAATCTTCAGCTGTCCAAACGCAGAAAGG



AGATTTCGGCCTATCTGCAGGATGCGGACGGCTATAAAGGTCTGTTTGCCAAACCTGCATTGGATGAA



GCAATGAAGATCGCGAAAGAAAACGGAAATGAATCGGACATAGAGGTTCTTGAGGCATTCAACGGTT



TCTCCGTATACTTCACCGGATATCATGAGAGCAGGGAGAACATCTATTCGGACGAGGATATGGTGTCG



GTAGCTTATCGCATCACCGAAGACAATTTCCCGAGATTCGTTTCCAATGCGCTTATATTCGATAAGCT



GAATGAGTCGCACCCCGATATAATCTCGGAAGTATCCGGAAATCTGGGCGTAGACGACATCGGAAAA



TATTTTGATGTGTCTAACTACAATAATTTCCTGTCGCAGGCCGGTATAGATGACTACAATCACATCATC



GGCGGCCATACGACGGAGGACGGTCTGATCCAGGCATTCAATGTTGTTCTGAATCTCAGGCATCAGA



AAGACCCCGGATTCGAAAAAATCCAATTCAAACAGCTGTACAAACAGATACTCAGCGTCCGTACATC



CAAATCCTATATCCCGAAACAGTTCGATAATTCGAAGGAGATGGTGGACTGCATCTGCGACTATGTGT



CCAAGATCGAAAAATCCGAAACGGTCGAGAGAGCATTGAAGCTGGTAAGGAACATATCTTCTTTTGA



TTTGCGCGGAATATTCGTAAACAAGAAGAATCTCCGCATTCTTTCCAACAAACTGATTGGTGATTGGG



ACGCGATCGAAACCGCGCTGATGCACTCCTCCTCTTCGGAAAATGATAAGAAATCCGTCTACGACAGC



GCCGAGGCATTTACGCTGGATGATATCTTTTCGTCCGTTAAAAAATTCTCAGATGCATCTGCAGAGGA



TATCGGAAACCGGGCGGAGGACATATGCAGAGTCATATCTGAGACCGCTCCGTTCATAAACGATCTG



AGGGCTGTCGATTTGGACAGTTTGAATGACGACGGTTACGAGGCGGCGGTTTCCAAGATAAGGGAAT



CTCTGGAACCATATATGGATCTGTTTCATGAACTGGAGATATTCTCCGTAGGCGATGAATTCCCGAAA



TGTGCAGCTTTCTACAGTGAACTTGAAGAAGTCTCCGAACAGCTAATCGAGATTATACCGTTATTCAA



CAAGGCCCGTTCGTTCTGTACGCGCAAGAGATACAGTACGGACAAGATAAAGGTCAATTTGAAATTC



CCGACACTCGCCGACGGATGGGATCTCAACAAAGAACGCGACAACAAAGCCGCAATACTCAGGAAA



GACGGAAAGTACTACCTGGCCATACTGGATATGAAGAAAGATCTTTCTTCGATCAGAACTTCGGATGA



AGACGAATCCAGTTTTGAGAAAATGGAGTACAAGCTTCTTCCGAGTCCGGTAAAGATGCTGCCAAAG



ATCTTCGTAAAATCGAAGGCGGCCAAGGAGAAGTACGGTCTGACCGACCGTATGCTGGAGTGCTACG



ATAAAGGGATGCACAAGAGCGGCAGTGCATTCGATCTCGGATTTTGTCACGAATTGATCGATTACTAC



AAGAGGTGCATCGCAGAATATCCCGGCTGGGACGTCTTCGATTTCAAGTTCAGGGAAACATCGGATT



ATGGCAGCATGAAGGAGTTCAATGAGGATGTTGCAGGGGCCGGATACTATATGTCCCTCAGAAAGAT



CCCTTGTTCGGAGGTCTACAGGCTTCTTGATGAGAAATCGATATATCTTTTCCAGATCTACAACAAAG



ATTATTCGGAAAACGCTCATGGGAATAAGAACATGCATACCATGTATTGGGAAGGGCTCTTTTCCCCC



CAGAATCTGGAATCCCCTGTGTTTAAACTCAGCGGCGGTGCGGAGCTTTTCTTCCGTAAATCCTCCAT



ACCCAATGACGCCAAAACGGTCCATCCGAAGGGAAGCGTCCTGGTTCCGCGCAATGATGTAAACGGC



CGCAGGATACCTGACAGCATATATCGGGAGCTCACCAGATATTTCAACCGCGGAGATTGCCGCATAA



GCGACGAGGCAAAGAGTTATCTGGACAAGGTGAAAACCAAGAAAGCTGACCACGATATCGTGAAAG



ACAGGAGGTTCACGGTGGACAAGATGATGTTCCACGTCCCTATCGCCATGAATTTCAAAGCGATTTCG



AAGCCGAATCTCAATAAAAAGGTGATTGACGGCATAATCGACGACCAAGATCTGAAGATCATCGGCA



TAGACCGCGGAGAGCGCAACCTCATCTACGTAACCATGGTGGATCGCAAAGGGAACATCCTCTATCA



GGATAGCCTCAATATTCTGAACGGATACGATTACCGTAAGGCCCTCGACGTCCGCGAATATGACAATA



AAGAGGCTCGGAGGAACTGGACGAAGGTCGAAGGCATCCGTAAGATGAAAGAGGGGTATCTGTCGC



TTGCAGTCAGCAAATTGGCAGATATGATCATAGAGAACAATGCGATTATCGTCATGGAGGATCTCAAT



CACGGATTCAAGGCAGGGCGTTCGAAGATAGAGAAACAGGTCTATCAGAAGTTCGAATCCATGCTCA



TAAACAAACTCGGTTACATGGTCCTCAAGGATAAGTCTATCGATCAGAGCGGCGGAGCTCTCCACGG



ATACCAGCTTGCCAACCATGTGACAACATTGGCATCTGTAGGTAAACAATGTGGAGTGATATTCTACA



TCCCTGCTGCATTTACATCCAAGATAGATCCGACAACAGGATTTGCAGATCTGTTCGCCCTCAGCAAT



GTTAAAAACGTGGCATCTATGAGAGAATTTTTCTCCAAGATGAAGTCTGTAATCTATGATAAGGCGGA



GGGAAAATTCGCATTTACCTTCGACTATCTTGATTATAATGTGAAATCCGAGTGCGGAAGGACCCTTT



GGACCGTGTATACGGTCGGAGAGAGATTCACATACAGCAGGGTCAATAGAGAATATGTCAGAAAAGT



TCCGACAGACATAATCTACGACGCATTGCAAAAGGCAGGAATATCTGTTGAAGGGGATCTCAGGGAC



AGGATTGCTGAATCGGATGGCGACACTCTGAAGAGCATATTCTATGCATTCAAGTATGCATTGGATAT



GAGAGTAGAGAACCGCGAAGAGGATTACATACAGTCTCCTGTCAAAAATGCCTCCGGAGAATTCTTC



TGTTCCAAGAACGCAGGCAAATCGCTCCCTCAGGATTCCGATGCGAACGGTGCATACAATATCGCACT



CAAGGGGATCCTGCAGCTACGTATGCTTTCCGAGCAGTATGATCCGAATGCAGAGAGCATACGGTTG



CCACTGATAACCAACAAGGCCTGGCTGACCTTTATGCAGTCCGGTATGAAGACATGGAAGAACTGA





SEQ
atgGATAGTTTGAAAGATTTCACCAATCTGTACCCTGTCAGTAAGACATTGAGATTTGAATTAAAGCCC


ID
GTTGGAAAGACTTTAGAAAATATCGAGAAAGCAGGTATTTTGAAAGAGGATGAGCATCGTGCAGAAA


NO:
GTTATCGGAGGGTGAAGAAAATAATTGATACTTATCATAAGGTATTTATCGATTCTTCTCTTGAAAAT


26
ATGGCTAAAATGGGTATTGAGAATGAAATAAAAGCAATGCTCCAAAGTTTCTGCGAATTGTATAAAA



AAGATCATCGCACTGAGGGTGAAGACAAGGCATTAGATAAAATTCGAGCAGTACTTCGTGGCCTGAT



TGTTGGGGCTTTCACTGGTGTTTGCGGAAGACGGGAAAATACAGTCCAAAACGAGAAGTACGAGAGT



TTGTTCAAAGAAAAGTTGATAAAAGAAATTTTACCTGATTTTGTGCTCTCTACTGAGGCTGAAAGCTT



GCCTTTCTCTGTTGAAGAAGCTACGAGGTCACTGAAGGAGTTTGATAGCTTTACATCCTACTTTGCTGG



TTTTTACGAGAATAGAAAGAATATATACTCGACGAAACCTCAATCCACTGCCATTGCTTATCGTCTTA



TTCATGAGAACTTGCCGAAGTTCATTGATAATATTCTTGTTTTTCAGAAGATCAAAGAGCCTATAGCC



AAAGAGCTGGAACATATTCGTGCGGACTTTTCTGCCGGGGGGTACATAAAAAAGGATGAGAGATTGG



AGGATATTTTTTCGTTGAACTATTATATCCACGTGTTATCTCAGGCTGGGATCGAAAAATATAACGCA



TTGATTGGGAAGATTGTGACAGAAGGAGATGGAGAGATGAAAGGGCTCAATGAACACATCAACCTTT



ACAACCAACAAAGAGGCAGAGAGGATCGGCTCCCTCTTTTTAGGCCTCTTTATAAACAGATATTGAGT



GACAGAGAGCAATTATCATACTTGCCTGAGAGTTTTGAAAAAGATGAGGAGCTCCTCAGGGCTCTAA



AAGAGTTCTATGATCATATCGCAGAAGACATTCTCGGACGTACTCAACAGTTGATGACTTCTATTTCA



GAATATGATTTATCTCGGATATACGTAAGGAACGATAGCCAATTGACTGATATATCAAAAAAAATGTT



GGGAGATTGGAATGCTATCTACATGGCTAGAGAACGAGCATATGACCACGAGCAGGCTCCCAAAAGA



ATCACGGCGAAATACGAGAGGGACAGGATTAAAGCTCTTAAAGGAGAAGAGAGTATAAGTCTGGCA



AATCTTAATAGTTGTATTGCCTTTCTGGACAATGTTAGAGATTGCCGTGTAGATACTTATCTTTCCACA



CTGGGCCAGAAGGAAGGACCACATGGTCTATCTAATCTCGTTGAGAACGTTTTTGCCTCATACCATGA



AGCAGAGCAATTGTTGAGCTTTCCATACCCCGAAGAGAATAATCTGATTCAGGACAAGGACAATGTG



GTGTTAATTAAGAATCTTCTCGACAATATCAGTGATCTGCAGAGGTTCTTGAAACCTCTTTGGGGTAT



GGGAGACGAACCCGATAAAGATGAAAGATTTTATGGAGAGTATAATTATATCCGAGGAGCTCTAGAT



CAGGTGATCCCTCTGTACAATAAGGTAAGGAACTACCTCACTCGGAAGCCTTATTCGACCAGAAAAGT



AAAACTCAATTTTGGGAATTCTCAATTGCTTAGTGGTTGGGATAGAAATAAGGAAAAGGATAATAGC



TGTGTGATTTTGCGTAAGGGGCAGAACTTCTATTTGGCTATTATGAACAATAGGCACAAAAGAAGTTT



CGAAAACAAGGTGTTGCCCGAGTATAAGGAGGGAGAACCTTACTTCGAAAAGATGGATTATAAATTT



TTGCCTGATCCTAATAAAATGCTTCCTAAGGTTTTTCTTTCGAAAAAAGGAATAGAGATATACAAACC



AAGTCCGAAGCTTTTAGAACAATATGGACATGGAACTCACAAAAAGGGAGATACCTTTAGTATGGAT



GATTTGCACGAACTGATCGATTTCTTCAAACACTCAATCGAGGCTCATGAAGATTGGAAGCAATTCGG



ATTCAAATTTTCTGATACGGCTACTTATGAGAATGTATCTAGTTTCTATAGAGAAGTTGAGGATCAGG



GGTATAAGCTCTCTTTCCGAAAAGTTTCGGAATCTTATGTCTATTCATTAATAGATCAAGGCAAGTTGT



ATTTATTTCAGATATACAACAAGGACTTTTCTCCCTGCAGCAAAGGGACACCTAATCTGCATACCTTG



TATTGGAGAATGCTTTTTGACGAGCGCAATTTGGCAGATGTCATATACAAACTGGATGGGAAGGCTGA



AATCTTTTTCCGAGAGAAGAGTTTGAAAAATGATCATCCCACGCATCCGGCTGGTAAGCCTATCAAAA



AGAAAAGTCGACAAAAAAAAGGAGAGGAGAGTCTGTTTGAGTATGATTTAGTCAAGGATAGGCACTA



TACGATGGATAAGTTCCAGTTTCATGTGCCTATTACTATGAATTTTAAATGTTCTGCAGGAAGCAAAG



TCAATGATATGGTTAATGCTCATATTCGAGAGGCAAAGGATATGCATGTCATTGGAATTGATCGTGGA



GAACGCAATCTGCTGTATATATGCGTGATAGATAGTCGAGGGACGATTTTGGATCAAATTTCTCTGAA



TACGATTAACGATATAGACTATCATGATTTATTGGAGAGTCGAGACAAAGACCGTCAGCAGGAGCGC



CGAAACTGGCAAACTATCGAAGGGATCAAGGAGCTAAAACAAGGCTACCTTAGTCAGGCGGTTCATC



GGATAGCCGAACTGATGGTGGCTTATAAGGCTGTAGTTGCTTTGGAGGATTTGAATATGGGGTTCAAA



CGTGGGCGGCAGAAAGTAGAAAGTTCTGTTTATCAGCAGTTTGAGAAACAGCTGATAGATAAGCTCA



ACTATCTTGTGGACAAGAAGAAAAGGCCTGAAGATATTGGAGGATTGTTGAGAGCCTATCAATTTAC



GGCCCCATTTAAGAGTTTTAAGGAAATGGGAAAGCAAAACGGCTTCTTGTTTTATATCCCGGCTTGGA



ACACGAGCAACATAGATCCGACTACTGGATTTGTTAATTTATTTCATGCCCAGTATGAAAATGTAGAT



AAAGCGAAGAGCTTCTTTCAAAAGTTTGATTCAATTAGTTACAACCCGAAGAAAGACTGGTTTGAGTT



TGCATTCGATTATAAAAACTTTACTAAAAAGGCTGAAGGAAGTCGTTCTATGTGGATATTATGCACAC



ATGGTTCCCGAATAAAGAATTTTAGAAATTCCCAGAAGAATGGTCAATGGGATTCCGAAGAATTCGC



CTTGACGGAGGCTTTTAAGTCTCTTTTTGTGCGATATGAGATAGATTATACCGCTGATTTGAAAACAG



CTATTGTGGACGAAAAGCAAAAAGACTTCTTCGTGGATCTTCTGAAGCTATTCAAATTGACAGTACAG



ATGCGCAACAGCTGGAAAGAGAAGGATTTGGATTATCTAATCTCTCCTGTAGCAGGGGCTGATGGCC



GTTTCTTCGATACAAGAGAGGGAAATAAAAGTCTGCCTAAGGATGCAGATGCCAATGGAGCTTATAA



TATTGCCCTAAAAGGACTTTGGGCTCTACGCCAGATTCGGCAAACTTCAGAAGGCGGTAAACTCAAAT



TGGCGATTTCCAATAAGGAATGGCTACAGTTTGTGCAAGAGAGATCTTACGAGAAAGACtga





SEQ
atgaataatggaacaaataactttcagaattttatcggaatttcttctttgcagaagactcttaggaat


ID
gctctcattccaaccgaaacaacacagcaatttattgttaaaaacggaataattaaagaagatgagcta


NO:
agaggagaaaatcgtcagatacttaaagatatcatggatgattattacagaggtttcatttcagaaact


27
ttatcgtcaattgatgatattgactggacttctttatttgagaaaatggaaattcagttaaaaaatgga



gataacaaagacactcttataaaagaacagactgaataccgtaaggcaattcataaaaaatttgcaaat



gatgatagatttaaaaatatgttcagtgcaaaattaatctcagatattcttcctgaatttgtcattcat



aacaataattattctgcatcagaaaaggaagaaaaaacacaggtaattaaattattttccagatttgca



acgtcattcaaggactattttaaaaacagggctaattgtttttcggctgatgatatatcttcatcttct



tgtcatagaatagttaatgataatgcagagatattttttagtaatgcattggtgtataggagaattgta



aaaagtctttcaaatgatgatataaataaaatatccggagatatgaaggattcattaaaggaaatgtct



ctggaagaaatttattcttatgaaaaatatggggaatttattacacaggaaggtatatctttttataat



gatatatgtggtaaagtaaattcatttatgaatttatattgccagaaaaataaagaaaacaaaaatctc



tataagctgcaaaagcttcataaacagatactgtgcatagcagatacttcttatgaggtgccgtataaa



tttgaatcagatgaagaggtttatcaatcagtgaatggatttttggacaatattagttcgaaacatatc



gttgaaagattgcgtaagattggagacaactataacggctacaatcttgataagatttatattgttagt



aaattctatgaatcagtttcacaaaagacatatagagattgggaaacaataaatactgcattagaaatt



cattacaacaatatattacccggaaatggtaaatctaaagctgacaaggtaaaaaaagcggtaaagaat



gatctgcaaaaaagcattactgaaatcaatgagcttgttagcaattataaattatgttcggatgataat



attaaagctgagacatatatacatgaaatatcacatattttgaataattttgaagcacaggagcttaag



tataatcctgaaattcatctggtggaaagtgaattgaaagcatctgaattaaaaaatgttctcgatgta



ataatgaatgcttttcattggtgttcggttttcatgacagaggagctggtagataaagataataatttt



tatgccgagttagaagagatatatgacgaaatatatccggtaatttcattgtataatcttgtgcgtaat



tatgtaacgcagaagccatatagtacaaaaaaaattaaattgaattttggtattcctacactagcggat



ggatggagtaaaagtaaagaatatagtaataatgcaattattctcatgcgtgataatttgtactattta



ggaatatttaatgcaaaaaataagcctgacaaaaagataattgaaggtaatacatcagaaaataaaggg



gattataagaagatgatttataatcttctgccaggaccaaataaaatgatccccaaggtattcctctct



tcaaaaaccggagtggaaacatataagccgtctgcctatatattggagggctataaacaaaacaagcat



attaaatcctctaaggattttgatataacattttgtcacgatttgattgattattttaagaactgtata



gcaatacatcctgaatggaagaattttggctttgatttttctgacacctccacatatgaagatatcagc



ggattttacagagaagtcgaattacaaggttataaaatcgactggacatatatcagcgaaaaggatatt



gatttgttgcaggaaaaaggacagttatatttattccaaatatataacaaagatttttccaagaaaagt



accggaaatgataatcttcatactatgtatttgaagaatttgtttagtgaagagaatttaaaggatatt



gtactgaaattaaacggtgaggcggaaatcttctttagaaaatcaagcataaagaatccaataattcat



aaaaaaggctctattcttgttaatagaacatatgaagcagaggaaaaagatcaatttggaaatatccag



atagtcagaaaaaacataccggaaaatatatatcaggagctttataaatatttcaatgataaaagtgat



aaagaactttcggatgaagcagctaagcttaagaatgtagtaggtcatcatgaggctgctacaaacata



gtaaaagattatagatatacatatgataaatattttcttcatatgcctattacaatcaattttaaagcc



aataagacaggctttattaatgacagaatattacaatatattgctaaagaaaaggatttgcatgtaata



ggcattgatcgtggtgaaagaaacctgatatatgtttcagtaattgatacttgtggaaatattgttgaa



caaaaatcgtttaacattgttaatggatatgattatcagattaagctcaagcagcaggagggggcgcga



caaatcgcacgaaaagaatggaaagaaatcggcaaaataaaagaaattaaagaaggctatttatctctt



gtaattcatgaaatttcaaagatggttattaaatataatgccataattgcaatggaggatttaagctac



ggatttaaaaaaggtcgtttcaaggttgagcgacaggtttaccagaagtttgagacaatgcttatcaac



aaactcaactatctggtatttaaagatatatccataacggaaaacggtggtcttctaaagggataccag



cttacatatattccagataaactgaaaaatgtgggtcatcaatgtggctgtatattttatgtacctgct



gcctatacatcaaaaatagatcctacaaccggatttgtaaatatattcaaatttaaagatttaacagtt



gatgcgaagagagaatttataaaaaaatttgacagtatcagatatgattcagaaaaaaatctgttttgt



tttacattcgattataataactttattacgcaaaatactgttatgtcaaagtcaagctggagtgtatat



acgtacggagttaggataaaaagaagatttgtcaatggcaggttctcaaatgaatcggatacaattgat



ataacaaaagatatggaaaaaacactcgaaatgacagatataaattggagagatggtcatgatctgagg



caggatattattgattatgaaatcgtacaacacatatttgagatttttagattgactgtacaaatgaga



aacagtttaagtgaattagaagacagggattatgaccgtttgatttctccggtgctcaatgaaaataat



atattttatgattcagctaaagcaggagatgcgttacctaaagacgcagatgctaatggtgcatattgt



atagctctaaaaggcttgtatgaaatcaaacaaattacagagaattggaaagaagacggtaagttttca



agagataaacttaaaatttccaataaggactggtttgactttattcaaaataaaaggtatttataa





SEQ
atgacaaacaaatttacaaaccagtactcgctttccaaaacacttcgatttgagttgattccacaagga


ID
aaaacattggaatttattcaagaaaaaggattgctctctcaagataaacaacgagcggagagttatcaa


NO:
gaaatgaaaaaaactattgataaatttcataaatactttatcgatttagctttaagcaatgctaaacta


28
actcatttagaaacttacttggaattatacaataaaagtgctgaaacaaaaaaagaacaaaaatttaaa



gacgatttaaagaaagtacaagacaatttacgaaaagaaatcgttaaatctttttcagatggtgatgca



aaatcaatttttgcaattttggataaaaaagaactgattaccgtagaacttgaaaaatggtttgaaaac



aacgaacaaaaagacatttattttgacgaaaaattcaaaacgtttactacttattttactggttttcat



caaaacagaaaaaacatgtattcggttgaacccaattctacagcaattgcttatcgattgattcatgaa



aatttacctaaatttttagaaaatgctaaagcatttgaaaaaataaaacaagtagaaagtttgcaagtt



aattttagagaattaatgggggaatttggagatgaagggctaattttcgtaaatgaattagaagaaatg



tttcaaatcaattattataatgatgtgctttcacaaaatggaattacaatttataatagtataatttca



ggatttaccaaaaatgatataaaatataaaggtctaaatgaatacataaataattacaatcaaaccaaa



gacaaaaaagaccgtttgccaaaattaaaacaattgtataaacagattttgagtgataggatttcactt



tcgtttttgcccgatgcttttacggatgggaaacaagttttgaaagccatatttgacttttataaaatc



aacttactttcttataccattgaaggacaggaagaaagccaaaatcttttactattaattcgtcagaca



attgaaaacctttctagttttgatacccaaaaaatttatctaaaaaatgatacccatttaaccactatt



tcacaacaagtatttggcgatttttcggtgttttcaactgctttaaattattggtatgaaactaaagta



aatccaaaatttgaaacggaatatagcaaagccaacgaaaaaaaacgagaaattttagataaagccaaa



gcggtatttacaaaacaagattatttttcaattgcttttttacaagaagtactttcggaatacattctt



accttagatcacacttctgatattgtaaaaaagcattcctccaactgtattgcggattattttaaaaat



cattttgtagccaaaaaagaaaatgaaaccgacaaaacctttgattttattgctaatattactgcaaaa



taccaatgtattcaaggtattttagaaaatgcagaccaatacgaagacgaactcaaacaagaccaaaaa



ttaattgataatttgaaattctttttagatgctattttagaattgttgcattttattaaacctttgcat



ttaaaatcagaaagcattaccgaaaaagacactgctttttatgatgtgtttgaaaattattacgaagca



ttgagtttgttgaccccattatataatatggtgcgaaactatgtaacgcaaaagccgtacagcaccgaa



aaaataaaattaaattttgaaaatgcacaattattgaatggttgggatgccaataaagaaggtgattac



ctaactaccattttgaaaaaagacggtaattattttttagccataatggataaaaagcataacaaagcg



tttcaaaagtttccagaaggaaaagaaaattatgaaaaaatggtgtataaactattgcctggagtaaat



aagatgttgccaaaagtatttttttccaataaaaatattgcttacttcaacccatcaaaagagttatta



gaaaactataaaaaagagacgcacaaaaaaggagacacattcaatttagaacattgtcatacgttgatc



gattttttcaaggactctttaaacaaacatgaagactggaaatactttgattttcaattttctgaaaca



aaatcgtatcaagatttgagtggtttttatagagaagtagaacatcaaggctacaaaatcaattttaaa



aatatcgattcagaatatattgatggtttggtgaacgaaggtaaattgtttctatttcaaatttacagc



aaagatttttcgcctttttccaaagggaaaccgaacatgcacactttgtattggaaagccttatttgaa



gaacaaaatttgcaaaatgtaatctataaattgaatggacaagccgaaatattttttagaaaagcctct



ataaaacctaaaaatataatattgcacaaaaagaaaattaaaattgccaaaaagcattttattgataaa



aaaacaaaaacatctgaaattgttcctgttcaaacaataaaaaacctcaatatgtactaccaaggaaaa



ataagtgaaaaagaattaacacaagatgatttaaggtatattgataattttagcattttcaatgaaaaa



aataaaacaattgatattataaaagacaaacgatttacggttgataaatttcagtttcatgtgccgatt



accatgaactttaaagcaacgggcggaagttatatcaatcaaaccgtattagaatatttgcaaaacaat



cccgaagttaagattattggattggatagaggcgaacgccatttggtatatctgacactgatagaccag



caaggaaacatcttgaaacaagaaagtttgaatacaatcaccgattctaaaatctcgacaccttatcat



aagttgttggataacaaggaaaacgagcgtgacttggctcgaaaaaattggggaacggtggaaaacatc



aaagaactcaaagaaggctacatcagtcaagtggtgcataaaattgctacgttgatgctggaagaaaat



gccattgtggtaatggaagatttgaattttggatttaaacgtggacgttttaaagtggaaaaacaaatt



tatcaaaagctggaaaaaatgttgattgacaaattgaattatttggttttaaaagacaaacaacctcag



gaattaggcggattgtacaacgcattacaactcaccaataaatttgaaagtttccaaaaaatgggtaaa



caatcgggctttttgttttatgtacccgcttggaacacctccaaaatagacccaaccacagggtttgtc



aattatttttataccaaatatgaaaatgttgacaaagccaaagccttttttgaaaaatttgaggcgatt



cgtttcaatgcagaaaagaagtattttgaatttgaagtaaaaaaatatagcgattttaacccaaaagcc



gaaggcactcaacaagcctggaccatttgcacgtatggcgaacgaatagaaaccaaacgacaaaaagac



caaaacaacaaatttgtaagcactccaattaatctaaccgaaaagatagaagactttttgggtaaaaac



caaattgtttatggtgatggtaattgcatcaaatctcaaattgctagcaaagacgacaaggcttttttt



gaaaccttattgtattggttcaaaatgactttacaaatgcgaaacagcgaaacaagaacagatatagat



tatctaatttcgcccgtgatgaatgacaacggaacattttacaacagccgagattatgaaaaattagaa



aatccaactttgcccaaagatgccgatgccaacggagcgtatcatattgccaaaaaaggattgatgctt



ttgaataaaatagaccaagccgacttgacaaaaaaagtggatttatctattagtaacagagattggttg



caatttgtacaaaaaaataaataa





SEQ
atggaacaggagtactatttaggactggatatgggaaccggatctgtaggatgggctgttacagattcg


ID
gaatatcatgtcttgcgtaaacatggaaaagcactatggggagtccgattatttgaaagtgcatcgaca


NO:
gcagaagaacgaagaatgttccgaacatcaagaagaagactagatcgaagaaactggagaattgaaatt


29
ttacaggaaatttttgcagaggaaataagtaagaaagatccaggatttttcttgcgaatgaaagaaagc



aaatattatccagaagataagcgagatatcaatggaaattgtccggaactgccatatgcattatttgtt



gatgacgattttacagataaagattatcataaaaaatttccgacaatttatcatctcaggaaaatgttg



atgaatacagaggagacaccggatatccggttggtgtatctggcaattcatcatatgatgaagcatagg



ggccatttcttgttatctggtgacattaatgagattaaggagttcggaacgacattttcaaaattgttg



gagaatatcaaaaatgaggaattggattggaatcttgaactgggaaaagaagaatatgctgttgtagaa



agtattttaaaagataacatgttaaaccgatccacaaagaaaaccagattaataaaagcattaaaagca



aaatcaatatgtgaaaaggctgtactgaatttattggctggtggaacggtgaaattgagtgatatattt



ggtcttgaagaattaaatgagacagaaagaccgaagatttcctttgctgataatggatacgatgattat



atcggagaagttgaaaatgagctgggagaacaattctatattatagagacggcaaaagcagtgtatgac



tgggcggtattagttgaaatattgggaaaatatacgtcaatttcagaagcgaaagtagcaacgtatgaa



aaacataaatcggatttacaatttttgaaaaagatagttcggaaatatctgacaaaggaggaatataaa



gatatttttgtaagtacgagtgacaaattgaaaaattactctgcttatataggaatgacgaaaataaat



ggaaaaaaggttgatttgcagagcaaacggtgcagtaaagaagaattctatgattttattaagaaaaac



gtacttaaaaagctagaaggacaacctgaatatgaatatttgaaagaagagctagaaagagaaacattt



ctaccaaaacaggtgaacagggataatggtgtaataccgtatcagattcatttgtacgagttgaaaaag



atattaggaaatttacgggataaaatagacctcattaaagagaacgaagataaactggttcaattattt



gaattcagaattccgtattatgttggtccgctgaataagatagatgacggaaaagagggaaaatttaca



tgggctgtacggaaaagtaatgaaaagatatatccatggaattttgaaaatgtagttgatatagaagca



agtgcagaaaaatttatccggagaatgacaaataagtgtacatatctgatgggcgaagatgtattgccg



aaggattcattgctttacagtaaatatatggttttaaatgaattaaataatgtaaagttggatggcgaa



aaattatctgtagaattgaaacaacggttgtatacagatgtattttgtaagtatcggaaagtaactgta



aagaagataaaaaattacttgaaatgtgaaggtatcatatccggcaatgtcgaaataactggaattgat



ggtgattttaaggcatcgttaacggcatatcatgattttaaagaaatcttgacaggaacagaattggct



aaaaaggacaaagaaaatattattaccaatatagtattgtttggagatgataaaaagctgctgaaaaag



agactgaatcgattatatcctcagattacgccgaatcagttgaagaaaatatgtgcgctatcctataca



ggctggggaagattttctaaaaagttcttagaagaaataacagctccagatccggaaacgggagaggta



tggaatatcattacggcattgtgggaatcgaataataatctgatgcaattattaagtaatgaatatcgg



tttatggaagaagtcgaaacatacaatatgggaaaacagactaaaacattgtcgtacgaaacagtagag



aatatgtatgtttctccatctgtgaaaagacagatatggcagacgctgaaaatcgtgaaagaattagaa



aaagtaatgaaagaatctccgaaacgtgtatttattgagatggcgagagaaaagcaagaaagtaagaga



accgaatcgcgtaaaaaacaactaatagatttgtataaggcttgtaaaaatgaagaaaaagattgggta



aaagaactgggagatcaggaagaacagaaattacgaagcgataagttgtacctatattatacgcaaaag



ggtcgttgtatgtattctggcgaggtaatagaactgaaagacttatgggataatacaaaatatgatatt



gatcatatatatccacaatctaaaacgatggatgacagtcttaataatcgcgtattggtaaaaaagaaa



tataatgcaacaaaatcagataagtatccattaaatgaaaatatacgacatgagagaaaaggcttttgg



aagtcactgttagatggagggtttataagtaaagaaaaatatgaacgcttaataagaaatacagaattg



agtccggaagaattagcaggatttattgaaaggcagattgttgaaacgaggcagagtacaaaagctgta



gcggaaatattaaagcaagtgtttccggaaagtgaaattgtatatgtcaaagcaggtacggtttcaaga



ttcagaaaagattttgaattactgaaagttcgagaagtgaatgatttgcatcacgcaaaggatgcgtat



ttaaatattgtagttggtaatagttattatgtgaaatttactaagaatgcatcatggtttataaaagaa



aatccgggacgtacttacaacttaaaaaagatgtttacatcaggttggaatattgaacgaaatggagaa



gttgcatgggaagtcgggaaaaaaggaacaattgtaacggtaaaacaaataatgaataaaaataatata



ttggtgacaagacaggttcatgaagcgaaaggtgggctgtttgatcagcagattatgaaaaaaggaaaa



ggtcagattgctataaaggaaactgatgaacgtcttgcatcaatagaaaagtatggaggctataataaa



gctgccggggcatattttatgctggtagaatctaaagataaaaaaggaaaaacaattcgaacgatagaa



tttataccattatatttaaagaataaaatcgagtcggatgaatcaatagcattgaactttttagaaaaa



ggcagaggtttgaaagaaccaaagatactattgaaaaaaattaagattgatacattatttgatgtggac



ggattcaaaatgtggttgtctggaagaacaggggacagactactatttaaatgtgcaaatcaattgatt



ttggatgagaaaataattgtaacaatgaaaaaaattgtaaagtttattcaaaggagacaagaaaataga



gaattaaaattatctgataaagatggaattgataatgaagtacttatggaaatatataacacttttgtg



gataagttagaaaacacagtgtatagaatacgattatccgaacaggcaaaaacgcttatagataaacaa



aaagaatttgaaaggttatcactagaggataaaagtagtactttgtttgaaattttacatatttttcag



tgtcaaagtagtgcggccaatttaaaaatgataggcggacctggaaaagcaggaatattagttatgaat



aataatataagtaagtgtaacaaaatttctattataaatcagtctccaacaggaattttcgaaaatgag



attgatttgttaaagat





SEQ
ATGAAATCTTTCGATTCATTCACAAATCTTTATTCTCTTTCAAAAACCTTGAAATTTGAGATGAGACCT


ID
GTCGGAAATACCCAAAAAATGCTCGACAATGCAGGAGTATTTGAAAAAGACAAACTAATTCAAAAAA


NO:
AGTACGGAAAAACAAAGCCGTATTTCGACAGACTCCACAGAGAATTTATAGAAGAAGCGCTCACGGG


30
GGTAGAGCTAATAGGACTAGATGAGAACTTTAGGACACTTGTTGACTGGCAAAAAGATAAGAAAAAT



AATGTCGCAATGAAAGCGTATGAAAATAGTTTGCAGCGGCTGAGAACGGAAATAGGTAAAATATTTA



ACCTAAAGGCTGAGGATTGGGTAAAGAACAAATATCCAATATTAGGGCTGAAAAATAAAAATACCGA



TATTTTATTCGAAGAGGCTGTATTCGGGATATTGAAAGCCCGATATGGAGAAGAAAAAGATACTTTTA



TAGAAGTAGAGGAAATAGATAAAACCGGCAAATCAAAGATCAATCAAATATCAATTTTCGATAGTTG



GAAAGGATTTACAGGATATTTCAAAAAATTTTTTGAAACCAGAAAGAATTTTTACAAAAACGACGGA



ACTTCTACAGCAATTGCTACAAGGATCATTGATCAAAATCTGAAAAGATTCATAGATAATCTGTCAAT



AGTTGAAAGTGTGAGACAAAAGGTTGATCTCGCCGAGACAGAAAAATCTTTCAGCATATCTCTATCGC



AATTCTTCTCAATAGACTTTTATAACAAGTGTCTCCTTCAAGATGGTATTGATTACTACAACAAGATAA



TCGGTGGAGAAACTCTCAAAAATGGCGAAAAACTAATAGGTCTCAATGAACTAATAAATCAATATAG



GCAGAATAATAAGGATCAGAAAATCCCATTTTTCAAACTTCTTGATAAACAAATTTTGAGTGAAAAGA



TATTATTTTTGGATGAAATAAAAAATGACACAGAACTGATCGAGGCGCTGAGTCAGTTCGCAAAAAC



AGCCGAAGAAAAAACAAAAATTGTCAAAAAGCTTTTTGCCGATTTTGTAGAAAATAATTCCAAATAC



GATCTTGCACAGATTTATATTTCCCAAGAAGCATTCAATACTATATCAAACAAGTGGACAAGCGAAAC



TGAGACGTTCGCTAAATATCTATTCGAAGCAATGAAGAGTGGAAAACTTGCAAAGTATGAGAAAAAA



GATAATAGCTATAAATTTCCTGATTTTATTGCCCTTTCACAGATGAAGAGTGCTTTATTAAGTATCAGC



CTTGAGGGACATTTTTGGAAAGAGAAATACTACAAAATTTCAAAATTCCAAGAGAAGACCAATTGGG



AGCAGTTTCTTGCAATTTTTCTATACGAGTTTAACTCTCTTTTCAGCGACAAAATAAATACAAAAGATG



GAGAAACAAAGCAAGTTGGATACTATCTATTTGCCAAAGACCTGCATAATCTTATCTTAAGTGAGCAG



ATTGATATTCCAAAAGATTCAAAAGTCACAATAAAAGATTTTGCCGATTCTGTACTCACAATCTACCA



AATGGCAAAATATTTTGCGGTAGAAAAAAAACGAGCGTGGCTTGCCGAGTATGAACTAGATTCATTTT



ATACCCAGCCAGACACAGGCTATTTACAGTTTTATGATAACGCCTACGAGGATATTGTGCAGGTATAC



AACAAGCTTCGAAACTATCTGACCAAAAAGCCATATAGCGAGGAGAAATGGAAGTTGAATTTTGAAA



ATTCTACGCTGGCAAATGGATGGGATAAGAACAAAGAATCTGATAATTCAGCAGTTATTCTACAAAA



AGGTGGAAAATATTATTTGGGACTGATTACTAAAGGACACAACAAAATTTTTGATGACCGTTTTCAAG



AAAAATTTATTGTGGGAATTGAAGGTGGAAAATATGAAAAAATAGTCTATAAATTTTTCCCCGACCAG



GCAAAAATGTTTCCCAAAGTGTGCTTTTCTGCAAAAGGACTCGAATTTTTTAGACCGTCTGAAGAAAT



TTTAAGAATTTATAACAATGCAGAGTTTAAAAAAGGAGAAACTTATTCAATAGATAGTATGCAGAAG



TTGATTGATTTTTATAAAGATTGCTTGACTAAATATGAAGGCTGGGCATGTTATACCTTTCGGCATCTA



AAACCCACAGAAGAATACCAAAACAATATTGGAGAGTTTTTTCGAGATGTTGCAGAGGACGGATACA



GGATTGATTTTCAAGGCATTTCAGATCAATATATTCATGAAAAAAACGAGAAAGGCGAACTTCACCTT



TTTGAAATCCACAATAAAGATTGGAATTTGGATAAGGCACGAGACGGAAAGTCAAAAACAACACAAA



AAAACCTTCATACACTCTATTTCGAATCGCTCTTTTCAAACGATAATGTTGTTCAAAACTTTCCAATAA



AACTCAATGGTCAAGCTGAAATTTTTTATAGACCGAAAACGGAAAAAGACAAATTAGAATCAAAAAA



AGATAAGAAAGGGAATAAAGTGATTGACCATAAACGCTATAGTGAGAATAAGATTTTTTTTCATGTTC



CTCTCACACTAAACCGCACTAAAAATGACTCATATCGCTTTAATGCTCAAATCAACAACTTTCTCGCA



AATAATAAAGATATCAACATCATCGGTGTAGATAGGGGAGAAAAGCATTTAGTCTATTATTCGGTGAT



TACACAAGCTAGTGACATCTTAGAAAGTGGCTCACTAAATGAGCTAAATGGCGTGAATTATGCTGAA



AAACTGGGAAAAAAGGCAGAAAATCGAGAACAAGCACGCAGAGACTGGCAAGACGTACAAGGGATC



AAAGACCTCAAGAAAGGATATATTTCACAGGTGGTGCGAAAGCTTGCTGATTTAGCAATTAAACACA



ATGCCATTATCATTCTTGAAGATTTGAATATGAGATTTAAACAAGTTCGGGGCGGTATCGAAAAATCC



ATTTATCAACAGTTAGAAAAAGCACTGATAGATAAATTAAGCTTTCTTGTAGACAAAGGTGAAAAAA



ATCCCGAGCAAGCAGGACATCTTCTGAAAGCATATCAGCTTTCGGCCCCATTTGAGACATTTCAAAAA



ATGGGCAAACAGACGGGTATAATCTTTTATACACAAGCTTCGTATACCTCAAAAAGTGACCCTGTAAC



AGGTTGGCGACCACACCTGTATCTCAAATATTTCAGTGCCAAAAAAGCAAAAGACGATATTGCAAAG



TTTACAAAAATAGAATTTGTAAACGATAGGTTTGAGCTTACCTATGATATAAAGGACTTTCAGCAAGC



AAAAGAATATCCAAATAAAACTGTTTGGAAAGTTTGCTCAAATGTAGAGAGATTCAGGTGGGACAAA



AACCTCAATCAAAACAAAGGCGGATATACTCACTACACAAATATAACTGAGAATATCCAAGAGCTTT



TTACAAAATATGGAATTGATATCACAAAAGATTTGCTCACACAGATTTCTACAATTGATGAAAAACAA



AATACCTCATTTTTTAGAGATTTTATTTTTTATTTCAACCTTATTTGCCAAATCAGAAATACCGATGATT



CTGAGATTGCTAAAAAGAATGGGAAAGATGATTTTATACTGTCACCTGTTGAGCCGTTTTTCGATAGC



CGAAAAGACAATGGAAATAAACTTCCTGAGAATGGAGATGATAACGGCGCGTATAACATAGCAAGA



AAAGGGATTGTCATACTCAACAAAATCTCACAATATTCAGAGAAAAACGAAAATTGCGAGAAAATGA



AATGGGGGGATTTGTATGTATCAAACATTGACTGGGACAATTTTGTAACCCAAGCTAATGCACGGCAT



TAA





SEQ
ATGATTATCTTATATATTAGTACCTCGAATATGAACATGGAAGGAGTATTTATGGAAAATTTTAAAAA


ID
CTTGTATCCAATAAACAAAACACTTCGATTTGAATTAAGACCCTATGGAAAAACATTGGAAAATTTTA


NO:
AAAAATCCGGACTTTTAGAAAAAGATGCCTTTAAGGCAAATAGTAGACGAAGTATGCAAGCTATAAT


31
CGATGAAAAATTCAAAGAGACTATCGAAGAACGCTTAAAGTACACTGAATTCAGTGAATGTGATCTT



GGAAACATGACATCAAAAGATAAAAAAATAACTGATAAAGCAGCTACAAATTTAAAAAAGCAAGTT



ATCTTATCTTTTGACGATGAAATATTTAATAATTACCTAAAACCTGATAAAAATATTGACGCATTATTT



AAAAATGATCCTTCAAATCCTGTAATCTCTACATTTAAAGGTTTTACGACATATTTTGTGAATTTTTTT



GAAATTCGAAAACATATTTTCAAGGGAGAATCATCAGGCTCAATGGCATACCGAATTATAGATGAAA



ACCTGACAACATACTTGAATAATATTGAAAAAATAAAAAAACTGCCAGAAGAATTAAAATCACAGCT



AGAAGGCATTGATCAGATTGATAAACTTAATAATTATAATGAGTTCATTACACAGTCAGGTATAACAC



ACTATAATGAAATCATCGGCGGTATATCAAAATCAGAGAATGTCAAAATACAGGGAATTAATGAAGG



AATTAATCTATACTGTCAGAAGAACAAAGTTAAACTTCCTCGACTGACTCCGCTATACAAAATGATAT



TATCAGACAGAGTTTCCAACTCTTTTGTATTAGACACTATTGAAAATGACACAGAATTAATTGAAATG



ATAAGTGATTTGATTAATAAGACTGAGATTTCGCAAGATGTTATAATGTCAGATATTCAAAATATTTT



CATAAAATACAAACAACTTGGTAATTTGCCGGGTATCTCATATTCTTCAATAGTTAATGCTATTTGCTC



GGATTATGACAACAATTTCGGAGATGGGAAGCGAAAAAAATCTTACGAAAATGATCGCAAAAAGCAT



TTGGAGACTAATGTATACTCCATAAATTATATTTCTGAATTGCTTACAGATACCGATGTTTCATCAAAT



ATCAAGATGAGATATAAAGAGCTTGAGCAAAATTATCAGGTTTGCAAAGAAAATTTTAATGCCACAA



ACTGGATGAATATTAAAAATATAAAACAATCTGAAAAAACAAACCTTATTAAAGATTTGTTAGATAT



ACTTAAATCGATTCAACGTTTCTATGATTTGTTTGATATTGTTGACGAAGATAAAAATCCAAGTGCTG



AATTTTATACCTGGTTATCAAAAAATGCTGAAAAGCTTGACTTTGAATTCAATTCTGTATATAACAAG



TCACGAAACTATCTCACCAGGAAACAATACTCTGATAAAAAAATCAAGCTGAATTTTGATTCTCCAAC



ATTGGCCAAAGGGTGGGATGCTAACAAAGAAATAGATAACTCCACGATTATAATGCGTAAATTTAAT



AATGACAGAGGCGATTATGATTACTTCCTTGGCATATGGAATAAATCCACACCTGCAAATGAAAAAA



TAATCCCACTGGAGGATAATGGATTATTCGAAAAAATGCAATATAAGCTGTATCCAGATCCTAGTAAG



ATGTTACCGAAACAATTTCTATCAAAAATATGGAAGGCAAAGCATCCTACGACACCTGAATTTGATAA



AAAATATAAAGAGGGAAGACATAAAAAAGGTCCTGATTTCGAAAAAGAATTCCTGCATGAATTGATT



GATTGCTTCAAACATGGTCTTGTTAATCACGATGAAAAATATCAGGATGTTTTTGGCTTCAATCTCCGT



AACACTGAAGATTATAATTCATATACAGAGTTTCTCGAAGATGTGGAAAGATGCAATTACAATCTTTC



ATTTAACAAAATTGCTGATACTTCAAACCTTATTAATGATGGGAAATTGTATGTATTTCAGATATGGTC



AAAAGACTTTTCTATTGATTCAAAAGGTACTAAAAACTTGAATACAATCTATTTTGAATCACTATTTTC



AGAAGAAAACATGATAGAAAAAATGTTCAAGCTTTCTGGAGAGGCTGAGATATTCTATCGACCAGCA



TCGTTGAATTATTGTGAAGATATCATAAAAAAAGGTCATCACCATGCAGAATTAAAAGATAAGTTTGA



CTATCCTATAATAAAAGATAAGCGATATTCACAAGATAAGTTTTTCTTTCATGTGCCAATGGTTATAA



ATTATAAATCTGAGAAACTGAATTCCAAAAGCCTTAACAACCGAACAAATGAAAACCTGGGACAGTT



TACACATATTATAGGTATAGACAGGGGCGAGCGGCACTTGATTTATTTAACTGTTGTTGATGTTTCCA



CTGGTGAAATCGTTGAACAGAAACATCTGGACGAAATTATCAATACTGATACCAAGGGAGTTGAACA



CAAAACCCATTATTTGAATAAATTGGAAGAAAAATCTAAAACAAGAGATAACGAGCGTAAATCATGG



GAAGCTATTGAAACTATCAAAGAATTAAAAGAAGGCTATATTTCTCATGTAATTAATGAAATACAAA



AGCTGCAAGAAAAATATAATGCCTTAATCGTAATGGAAAATCTTAACTATGGGTTCAAAAACTCACG



AATCAAAGTTGAAAAACAGGTTTATCAAAAATTCGAGACAGCATTGATTAAAAAGTTCAATTATATTA



TTGATAAAAAAGATCCAGAAACCTATATACATGGTTACCAGCTTACAAATCCTATTACCACTCTGGAT



AAGATTGGAAATCAATCTGGAATAGTGCTGTATATTCCTGCGTGGAATACTTCTAAGATAGATCCCGT



CACAGGATTTGTAAACCTTCTGTACGCAGATGATTTGAAGTATAAAAATCAGGAGCAGGCCAAATCA



TTCATTCAGAAAATAGACAACATATATTTTGAAAATGGAGAGTTTAAATTTGATATTGATTTTTCCAA



ATGGAATAATCGCTACTCAATAAGTAAAACTAAATGGACGTTAACAAGTTATGGGACTCGCATCCAG



ACATTTAGAAATCCCCAGAAAAACAATAAGTGGGATTCTGCTGAATATGATTTGACAGAAGAGTTTA



AATTAATTTTAAATATAGACGGAACGTTAAAGTCACAGGACGTAGAAACATACAAAAAATTCATGTC



TTTATTTAAACTAATGCTACAGCTTCGAAACTCTGTTACAGGAACCGACATTGATTATATGATCTCTCC



TGTCACTGATAAAACAGGAACACATTTCGATTCAAGAGAAAATATTAAAAATCTTCCTGCCGATGCAG



ATGCCAATGGTGCCTACAACATTGCGCGCAAAGGAATAATGGCTATTGAAAATATAATGAACGGTAT



AAGCGATCCACTAAAAATAAGCAACGAAGACTATTTAAAGTATATTCAGAATCAACAGGAATAA





SEQ
ATGACCCAATTTGAAGGTTTTACCAATTTATACCAAGTTTCGAAGACCCTTCGTTTTGAACTGATTCCC


ID
CAAGGAAAAACACTCAAACATATCCAGGAGCAAGGGTTCATTGAGGAGGATAAAGCTCGCAATGACC


NO:
ATTACAAAGAGTTAAAACCAATCATTGACCGCATCTATAAGACTTATGCTGATCAATGTCTCCAACTG


32
GTACAGCTTGACTGGGAGAATCTATCTGCAGCCATAGACTCCTATCGTAAGGAAAAAACCGAAGAAA



CACGAAATGCGCTGATTGAGGAGCAAGCAACATATAGAAATGCGATTCATGACTACTTTATAGGTCG



GACGGATAATCTGACAGATGCCATAAATAAGCGCCATGCTGAAATCTATAAAGGACTTTTTAAAGCT



GAACTTTTCAATGGAAAAGTTTTAAAGCAATTAGGGACCGTAACCACGACAGAACATGAAAATGCTC



TACTCCGTTCGTTTGACAAATTTACGACCTATTTTTCCGGCTTTTATGAAAACCGAAAAAATGTCTTTA



GCGCTGAAGATATCAGCACGGCAATTCCCCATCGAATCGTCCAGGACAATTTCCCTAAATTTAAGGAA



AACTGCCATATTTTTACAAGATTGATAACCGCAGTTCCTTCTTTGCGGGAGCATTTTGAAAATGTCAA



AAAGGCCATTGGAATCTTTGTTAGTACGTCTATTGAAGAAGTCTTTTCCTTTCCCTTTTATAATCAACT



TCTAACCCAAACGCAAATTGATCTTTATAATCAACTTCTCGGCGGCATATCTAGGGAAGCAGGCACAG



AAAAAATCAAGGGACTTAATGAAGTTCTCAATCTGGCTATCCAAAAAAATGATGAAACAGCCCATAT



AATCGCGTCCCTGCCGCATCGTTTTATTCCTCTTTTTAAACAAATTCTTTCCGATCGAAATACGTTATC



CTTTATTTTGGAAGAATTCAAAAGCGATGAGGAAGTCATCCAATCCTTCTGCAAATATAAAACCCTCT



TGAGAAACGAAAATGTACTGGAGACTGCAGAAGCCCTTTTCAATGAATTAAATTCCATTGATTTGACT



CATATCTTTATTTCCCATAAAAAGTTAGAAACCATCTCTTCAGCGCTTTGTGACCATTGGGATACCTTG



CGCAATGCACTTTACGAAAGACGGATTTCTGAACTCACTGGCAAAATAACAAAAAGTGCCAAAGAAA



AAGTTCAAAGGTCATTAAAACATGAGGATATAAATCTCCAAGAAATTATTTCTGCTGCAGGAAAAGA



ACTATCAGAAGCATTCAAACAAAAAACAAGTGAAATTCTTTCCCATGCCCATGCTGCACTTGACCAGC



CTCTTCCCACAACATTAAAAAAACAGGAAGAAAAAGAAATCCTCAAATCACAGCTCGATTCGCTTTTA



GGCCTTTATCATCTTCTTGATTGGTTTGCTGTCGATGAAAGCAATGAAGTCGACCCAGAATTCTCAGC



ACGGCTGACAGGCATTAAACTAGAAATGGAACCAAGCCTTTCGTTTTATAATAAAGCAAGAAATTAT



GCGACAAAAAAGCCCTATTCGGTGGAAAAATTTAAATTGAATTTTCAAATGCCAACCCTTGCCTCTGG



TTGGGATGTCAATAAAGAAAAAAATAATGGAGCTATTTTATTCGTAAAAAATGGTCTCTATTACCTTG



GTATCATGCCTAAACAGAAGGGGCGCTATAAAGCCCTGTCTTTTGAGCCGACAGAAAAAACATCAGA



AGGATTCGATAAGATGTACTATGACTACTTCCCAGATGCCGCAAAAATGATTCCTAAGTGTTCCACTC



AGCTAAAGGCTGTAACCGCTCATTTTCAAACTCATACCACCCCCATTCTTCTCTCAAATAATTTCATTG



AACCTCTTGAAATCACAAAAGAAATTTATGACCTGAACAATCCTGAAAAGGAGCCTAAAAAGTTTCA



AACGGCTTATGCAAAGAAGACAGGCGATCAAAAAGGCTATAGAGAAGCGCTTTGCAAATGGATTGAC



TTTACGCGGGATTTTCTCTCTAAATATACGAAAACAACTTCAATCGATTTATCTTCACTCCGCCCTTCT



TCGCAATATAAAGATTTAGGGGAATATTACGCCGAACTGAATCCGCTTCTCTATCATATCTCCTTCCA



ACGAATTGCTGAAAAGGAAATCATGGATGCTGTAGAAACGGGAAAATTGTATCTGTTCCAAATCTAC



AATAAGGATTTTGCGAAGGGCCATCACGGGAAACCAAATCTCCACACCCTGTATTGGACAGGTCTCTT



CAGTCCTGAAAACCTTGCGAAAACCAGCATCAAACTTAATGGTCAAGCAGAATTGTTCTATCGACCTA



AAAGCCGCATGAAGCGGATGGCCCATCGTCTTGGGGAAAAAATGCTGAACAAAAAACTAAAGGACC



AGAAGACACCGATTCCAGATACCCTCTACCAAGAACTGTACGATTATGTCAACCACCGGCTAAGCCAT



GATCTTTCCGATGAAGCAAGGGCCCTGCTTCCAAATGTTATCACCAAAGAAGTCTCCCATGAAATTAT



AAAGGATCGGCGGTTTACTTCCGATAAATTTTTCTTCCATGTTCCCATTACACTGAATTATCAAGCAGC



CAATAGTCCCAGTAAATTCAACCAGCGTGTCAATGCCTACCTTAAGGAGCATCCGGAAACGCCCATCA



TTGGTATCGATCGTGGAGAACGCAATCTAATCTATATTACCGTCATTGACAGTACTGGGAAAATTTTG



GAGCAGCGTTCCCTGAATACCATCCAGCAATTTGACTACCAAAAAAAATTGGACAACAGGGAAAAAG



AGCGTGTTGCCGCCCGTCAAGCCTGGTCCGTCGTCGGAACGATCAAAGACCTTAAACAAGGCTACTTG



TCACAGGTCATCCATGAAATTGTAGACCTGATGATTCATTACCAAGCTGTTGTCGTCCTTGAAAACCT



CAACTTCGGATTTAAATCAAAACGGACAGGCATTGCCGAAAAAGCAGTCTACCAACAATTTGAAAAG



ATGCTAATAGATAAACTCAACTGTTTGGTTCTCAAAGATTATCCTGCTGAGAAAGTGGGAGGCGTCTT



AAACCCGTATCAACTTACAGATCAGTTCACGAGCTTTGCAAAAATGGGCACGCAAAGCGGCTTCCTTT



TCTATGTACCGGCCCCTTATACCTCAAAGATTGATCCCCTGACTGGTTTTGTCGATCCCTTTGTATGGA



AGACCATTAAAAATCATGAAAGTCGGAAGCATTTCCTAGAAGGATTTGATTTCCTGCATTATGATGTC



AAAACAGGTGATTTTATCCTCCATTTTAAAATGAATCGGAATCTCTCTTTCCAGAGAGGGCTTCCTGG



CTTCATGCCAGCTTGGGATATTGTTTTCGAAAAGAATGAAACCCAATTTGATGCAAAAGGGACGCCCT



TCATTGCAGGAAAACGAATTGTTCCTGTAATCGAAAATCATCGTTTTACGGGTCGTTACAGAGACCTC



TATCCCGCTAATGAACTCATTGCCCTTCTGGAAGAAAAAGGCATTGTCTTTAGAGACGGAAGTAATAT



ATTACCCAAACTTTTAGAAAATGATGATTCTCATGCAATTGATACGATGGTCGCCTTGATTCGCAGTG



TACTCCAAATGAGAAACAGCAATGCCGCAACGGGGGAAGACTACATCAACTCTCCCGTTAGGGATCT



GAACGGGGTGTGTTTCGACAGTCGATTCCAAAATCCAGAATGGCCAATGGATGCGGATGCCAACGGA



GCTTATCATATTGCCTTAAAAGGGCAGCTTCTTCTGAACCACCTCAAAGAAAGCAAAGATCTGAAATT



ACAAAACGGCATCAGCAACCAAGATTGGCTGGCCTACATTCAGGAACTGAGAAACTGA





SEQ
ATGGCCGTCAAATCCATCAAAGTGAAACTTCGTCTCGACGATATGCCGGAGATTCGGGCCGGTCTATG


ID
GAAACTTCATAAGGAAGTCAATGCGGGGGTTCGATATTACACGGAATGGCTCAGTCTTCTCCGTCAAG


NO:
AGAACTTGTATCGAAGAAGTCCGAATGGGGACGGAGAGCAAGAATGTGATAAGACTGCAGAAGAAT


33
GCAAAGCCGAATTGTTGGAGCGGCTGCGCGCGCGTCAAGTGGAGAATGGACACCGTGGTCCGGCGGG



ATCGGACGATGAATTGCTGCAGTTGGCGCGTCAACTCTATGAGTTGTTGGTTCCGCAGGCGATAGGTG



CGAAAGGCGACGCGCAGCAAATTGCCCGCAAATTTTTGAGCCCCTTGGCCGACAAGGACGCAGTTGG



TGGGCTTGGAATCGCGAAGGCGGGGAACAAACCGCGGTGGGTTCGCATGCGCGAAGCGGGGGAACC



AGGCTGGGAAGAGGAGAAGGAGAAGGCTGAGACGAGGAAATCTGCGGATCGGACTGCGGATGTTTT



GCGCGCGCTCGCGGATTTTGGGTTAAAGCCACTGATGCGCGTATACACCGATTCTGAGATGTCATCGG



TGGAGTGGAAACCGCTTCGGAAGGGACAAGCCGTTCGGACGTGGGATAGGGACATGTTCCAACAAGC



TATCGAACGGATGATGTCGTGGGAGTCGTGGAATCAGCGCGTTGGGCAAGAGTACGCGAAACTCGTA



GAACAAAAAAATCGATTTGAGCAGAAGAATTTCGTCGGCCAGGAACATCTGGTCCATCTCGTCAATC



AGTTGCAACAAGATATGAAAGAAGCATCGCCCGGACTCGAATCGAAAGAGCAAACCGCGCACTATGT



GACGGGACGGGCATTGCGCGGATCGGACAAGGTATTTGAGAAGTGGGGGAAACTCGCCCCCGATGCA



CCTTTCGATTTGTACGACGCCGAAATCAAGAATGTGCAGAGACGTAACACGAGACGATTCGGATCAC



ATGACTTGTTCGCAAAATTGGCAGAGCCAGAGTATCAGGCCCTGTGGCGCGAAGATGCTTCGTTTCTC



ACGCGTTACGCGGTGTACAACAGCATCCTTCGCAAACTGAATCACGCCAAAATGTTCGCGACGTTTAC



TTTGCCGGATGCAACGGCGCACCCGATTTGGACTCGCTTCGATAAATTGGGTGGGAATTTGCACCAGT



ACACCTTTTTGTTCAACGAATTTGGAGAACGCAGGCACGCGATTCGTTTTCACAAGCTATTGAAAGTC



GAGAATGGTGTCGCAAGAGAAGTTGATGATGTCACCGTGCCCATTTCAATGTCAGAGCAATTGGATA



ATCTGCTTCCCAGAGATCCCAATGAACCGATTGCGCTATATTTTCGAGATTACGGAGCCGAACAGCAT



TTCACAGGTGAATTTGGTGGCGCGAAGATCCAGTGCCGCCGGGATCAGCTGGCTCATATGCACCGAC



GCAGAGGGGCGAGGGATGTTTATCTCAATGTCAGCGTACGTGTGCAGAGTCAGTCTGAGGCGCGGGG



AGAACGTCGCCCGCCGTATGCGGCAGTATTTCGTCTGGTCGGGGACAACCATCGCGCGTTTGTCCATT



TCGATAAACTATCGGATTATCTTGCGGAACATCCGGATGATGGGAAGCTCGGGTCGGAGGGGTTGCTT



TCCGGGCTGCGGGTGATGAGTGTCGATCTCGGCCTTCGCACATCTGCATCGATTTCCGTTTTTCGCGTT



GCCCGGAAGGACGAGTTGAAGCCGAACTCAAAAGGTCGTGTACCGTTTTTCTTTCCGATAAAAGGGA



ATGACAATCTCGTCGCGGTTCATGAGCGATCACAACTCTTGAAGCTGCCTGGCGAAACGGAGTCGAA



GGACCTGCGTGCTATCCGAGAAGAACGCCAACGGACATTGCGGCAGTTGCGGACGCAACTGGCGTAT



TTGCGGCTGCTCGTGCGGTGTGGGTCGGAAGATGTGGGGCGGCGTGAACGGAGTTGGGCAAAGCTTA



TCGAGCAGCCGGTGGATGCGGCCAATCACATGACACCGGATTGGCGCGAGGCTTTTGAAAACGAACT



TCAGAAGCTTAAGTCACTCCATGGTATCTGTAGCGACAAGGAATGGATGGATGCTGTCTACGAGAGC



GTTCGCCGCGTGTGGCGTCACATGGGCAAACAGGTTCGCGATTGGCGAAAGGACGTACGAAGCGGAG



AGCGGCCCAAGATTCGCGGCTATGCGAAAGACGTGGTCGGTGGAAACTCGATTGAGCAAATCGAGTA



TCTGGAACGTCAGTACAAGTTCCTCAAGAGTTGGAGCTTCTTTGGTAAGGTGTCGGGACAAGTGATTC



GTGCGGAGAAGGGATCTCGTTTTGCGATCACGCTGCGCGAACACATTGATCACGCGAAGGAAGATCG



GCTGAAGAAATTGGCGGATCGCATCATTATGGAGGCTCTCGGCTATGTGTACGCGTTGGATGAGCGCG



GCAAAGGAAAGTGGGTTGCGAAGTATCCGCCGTGCCAGCTCATCCTGCTGGAGGAATTGAGCGAGTA



CCAGTTCAATAACGACAGGCCTCCGAGCGAAAACAACCAGTTGATGCAATGGAGTCATCGCGGCGTG



TTCCAGGAGTTGATAAATCAGGCCCAAGTCCATGATTTACTCGTTGGGACGATGTATGCAGCGTTCTC



GTCGCGATTCGACGCGCGAACTGGGGCACCGGGTATCCGCTGTCGCCGGGTTCCGGCGCGTTGCACCC



AGGAGCACAATCCAGAACCATTTCCTTGGTGGCTGAACAAGTTTGTGGTGGAACATACGTTGGATGCT



TGTCCCCTACGCGCAGACGACCTCATCCCAACGGGTGAAGGAGAGATTTTTGTCTCGCCGTTCAGCGC



GGAGGAGGGGGACTTTCATCAGATTCACGCCGACCTGAATGCGGCGCAAAATCTGCAGCAGCGACTC



TGGTCTGATTTTGATATCAGTCAAATTCGGTTGCGGTGTGATTGGGGTGAAGTGGACGGTGAACTCGT



TCTGATCCCAAGGCTTACAGGAAAACGAACGGCGGATTCATATAGCAACAAGGTGTTTTATACCAAT



ACAGGTGTCACCTATTATGAGCGAGAGCGGGGGAAGAAGCGGAGAAAGGTTTTCGCGCAAGAGAAA



TTGTCGGAGGAAGAGGCGGAGTTGCTCGTGGAAGCAGACGAGGCGAGGGAGAAATCGGTCGTTTTGA



TGCGTGATCCGTCTGGCATCATCAATCGGGGAAATTGGACCAGGCAAAAGGAATTTTGGTCGATGGT



GAACCAGCGGATCGAAGGATACTTGGTCAAGCAGATTCGCTCGCGCGTTCCATTACAAGATAGTGCG



TGTGAAAACACGGGGGATATTTAA





SEQ
ATGGCGACACGCAGTTTTATTTTAAAAATTGAACCAAATGAAGAAGTTAAAAAGGGATTATGGAAGA


ID
CGCATGAGGTATTGAATCATGGAATTGCCTACTACATGAATATTCTGAAACTAATTAGACAGGAAGCT


NO:
ATTTATGAACATCATGAACAAGATCCTAAAAATCCGAAAAAAGTTTCAAAAGCAGAAATACAAGCCG


34
AGTTATGGGATTTTGTTTTAAAAATGCAAAAATGTAATAGTTTTACACATGAAGTTGACAAAGATGTT



GTTTTTAACATCCTGCGTGAACTATATGAAGAGTTGGTCCCTAGTTCAGTCGAGAAAAAGGGTGAAGC



CAATCAATTATCGAATAAGTTTCTGTACCCGCTAGTTGATCCGAACAGTCAAAGTGGGAAAGGGACG



GCATCATCCGGACGTAAACCTCGGTGGTATAATTTAAAAATAGCAGGCGACCCATCGTGGGAGGAAG



AAAAGAAAAAATGGGAAGAGGATAAAAAGAAAGATCCCCTTGCTAAAATCTTAGGTAAGTTAGCAG



AATATGGGCTTATTCCGCTATTTATTCCATTTACTGACAGCAACGAACCAATTGTAAAAGAAATTAAA



TGGATGGAAAAAAGTCGTAATCAAAGTGTCCGGCGACTTGATAAGGATATGTTTATCCAAGCATTAG



AGCGTTTTCTTTCATGGGAAAGCTGGAACCTTAAAGTAAAGGAAGAGTATGAAAAAGTTGAAAAGGA



ACACAAAACACTAGAGGAAAGGATAAAAGAGGACATTCAAGCATTTAAATCCCTTGAACAATATGAA



AAAGAACGGCAGGAGCAACTTCTTAGAGATACATTGAATACAAATGAATACCGATTAAGCAAAAGAG



GATTACGTGGTTGGCGTGAAATTATCCAAAAATGGCTAAAGATGGATGAAAATGAACCATCAGAAAA



ATATTTAGAAGTATTTAAAGATTATCAACGGAAACATCCACGAGAAGCCGGGGACTATTCTGTCTATG



AATTTTTAAGCAAGAAAGAAAATCATTTTATTTGGCGAAATCATCCTGAATATCCTTATTTGTATGCTA



CATTTTGTGAAATTGACAAAAAAAAGAAAGACGCTAAGCAACAGGCAACTTTTACTTTGGCTGACCC



GATTAACCATCCGTTATGGGTACGATTTGAAGAAAGAAGCGGTTCGAACTTAAACAAATATCGAATTT



TAACAGAGCAATTACACACTGAAAAGTTAAAAAAGAAATTAACAGTTCAACTTGATCGTTTAATTTAT



CCAACTGAATCCGGCGGTTGGGAGGAAAAAGGTAAAGTAGATATCGTTTTGTTGCCGTCAAGACAAT



TTTATAATCAAATCTTCCTTGATATAGAAGAAAAGGGGAAACATGCTTTTACTTATAAGGATGAAAGT



ATTAAATTCCCCCTTAAAGGTACACTTGGTGGTGCAAGAGTGCAGTTTGACCGTGACCATTTGCGGAG



ATATCCGCATAAAGTAGAATCAGGAAATGTTGGACGGATTTATTTTAACATGACAGTAAATATTGAAC



CAACTGAGAGCCCTGTTAGTAAGTCTTTGAAAATACATAGGGACGATTTCCCCAAGTTCGTTAATTTT



AAACCGAAAGAGCTCACCGAATGGATAAAAGATAGTAAAGGGAAAAAATTAAAAAGTGGTATAGAA



TCCCTTGAAATTGGTCTACGGGTGATGAGTATCGACTTAGGTCAACGTCAAGCGGCTGCTGCATCGAT



TTTTGAAGTAGTTGATCAGAAACCGGATATTGAAGGGAAGTTATTTTTTCCAATCAAAGGAACTGAGC



TTTATGCTGTTCACCGGGCAAGTTTTAACATTAAATTACCGGGTGAAACATTAGTAAAATCACGGGAA



GTATTGCGGAAAGCTCGGGAGGACAACTTAAAATTAATGAATCAAAAGTTAAACTTTCTAAGAAATG



TTCTACATTTCCAACAGTTTGAAGATATCACAGAAAGAGAGAAGCGTGTAACTAAATGGATTTCTAGA



CAAGAAAATAGTGATGTTCCTCTTGTATATCAAGATGAGCTAATTCAAATTCGTGAATTAATGTATAA



ACCCTATAAAGATTGGGTTGCCTTTTTAAAACAACTCCATAAACGGCTAGAAGTCGAGATTGGCAAAG



AGGTTAAGCATTGGCGAAAATCATTAAGTGACGGGAGAAAAGGTCTTTACGGAATCTCCCTAAAAAA



TATTGATGAAATTGATCGAACAAGGAAATTCCTTTTAAGATGGAGCTTACGTCCAACAGAACCTGGGG



AAGTAAGACGCTTGGAACCAGGACAGCGTTTTGCGATTGATCAATTAAACCACCTAAATGCATTAAA



AGAAGATCGATTAAAAAAGATGGCAAATACGATTATCATGCATGCCTTAGGTTACTGTTATGATGTAA



GAAAGAAAAAGTGGCAGGCAAAAAATCCAGCATGTCAAATTATTTTATTTGAAGATTTATCTAACTAC



AATCCTTACGAGGAAAGGTCCCGTTTTGAAAACTCAAAACTGATGAAGTGGTCACGGAGAGAAATTC



CACGACAAGTCGCCTTACAAGGTGAAATTTACGGATTACAAGTTGGGGAAGTAGGTGCCCAATTCAG



TTCAAGATTCCATGCGAAAACCGGGTCGCCGGGAATTCGTTGCAGTGTTGTAACGAAAGAAAAATTG



CAGGATAATCGCTTTTTTAAAAATTTACAAAGAGAAGGACGACTTACTCTTGATAAAATCGCAGTTTT



AAAAGAAGGAGACTTATATCCAGATAAAGGTGGAGAAAAGTTTATTTCTTTATCAAAGGATCGAAAG



TTGGTAACTACGCATGCTGATATTAACGCGGCCCAAAATTTACAGAAGCGTTTTTGGACAAGAACACA



TGGATTTTATAAAGTTTACTGCAAAGCCTATCAGGTTGATGGACAAACTGTTTATATTCCGGAGAGCA



AGGACCAAAAACAAAAAATAATTGAAGAATTTGGGGAAGGCTATTTTATTTTAAAAGATGGTGTATA



TGAATGGGGTAATGCGGGGAAACTAAAAATTAAAAAAGGTTCCTCTAAACAATCATCGAGTGAATTA



GTAGATTCGGACATACTGAAAGATTCATTTGATTTAGCAAGTGAACTTAAGGGAGAGAAACTCATGTT



ATATCGAGATCCGAGTGGAAACGTATTTCCTTCCGACAAGTGGATGGCAGCAGGAGTATTTTTTGGCA



AATTAGAAAGAATATTGATTTCTAAGTTAACAAATCAATACTCAATATCAACAATAGAAGATGATTCT



TCAAAACAATCAATGTAA





SEQ
ATGCCCACCCGCACCATCAATCTGAAACTTGTTCTTGGGAAAAATCCTGAAAACGCAACATTGCGACG


ID
CGCCCTATTTTCGACACACCGTTTGGTTAACCAAGCGACGAAACGTATTGAGGAATTCTTGTTGCTGT


NO:
GTCGTGGAGAAGCCTACAGAACAGTGGATAATGAGGGGAAGGAAGCCGAGATTCCACGTCATGCAGT


35
CCAAGAAGAAGCTCTTGCCTTTGCCAAAGCTGCTCAACGCCACAACGGCTGTATATCCACCTATGAAG



ACCAAGAGATTCTTGATGTACTGCGGCAACTGTACGAACGTCTTGTTCCTTCGGTCAACGAAAACAAC



GAGGCAGGCGATGCTCAAGCTGCTAACGCCTGGGTCAGTCCGCTCATGTCGGCAGAAAGCGAAGGAG



GCTTGTCGGTCTACGACAAGGTGCTTGATCCACCGCCGGTTTGGATGAAGCTTAAAGAAGAAAAGGC



TCCAGGATGGGAAGCCGCTTCTCAAATTTGGATTCAGAGTGATGAGGGACAGTCGTTACTTAATAAGC



CAGGTAGCCCTCCCCGCTGGATTCGAAAACTGCGATCTGGGCAACCGTGGCAAGATGATTTCGTCAGT



GACCAAAAGAAAAAGCAAGATGAGCTGACCAAAGGGAACGCACCACTTATAAAACAACTCAAAGAA



ATGGGGTTGTTGCCTCTTGTTAACCCATTTTTTAGACATCTTCTTGACCCTGAAGGTAAAGGCGTGAGT



CCATGGGACCGTCTTGCTGTACGCGCTGCAGTGGCTCACTTTATCTCCTGGGAAAGTTGGAATCATAG



AACACGTGCAGAATACAATTCCTTGAAACTACGGCGAGACGAGTTTGAGGCAGCATCCGACGAATTC



AAAGACGATTTTACTTTGCTCCGACAATATGAAGCCAAACGCCATAGTACATTGAAAAGCATCGCGCT



GGCCGACGATTCGAACCCTTACCGGATTGGAGTACGTTCTCTGCGTGCCTGGAACCGCGTTCGTGAAG



AATGGATAGACAAGGGTGCAACAGAAGAACAACGCGTGACCATATTGTCAAAGCTTCAAACACAACT



TCGGGGAAAATTCGGCGATCCCGATCTGTTCAACTGGCTAGCTCAGGATAGGCATGTCCATTTGTGGT



CTCCTCGGGACAGCGTGACACCATTGGTTCGCATCAATGCGGTAGATAAAGTTCTGCGTCGACGAAAA



CCGTATGCATTGATGACCTTTGCCCATCCCCGCTTCCACCCTCGATGGATACTGTACGAGGCTCCAGG



AGGAAGCAATCTCCGTCAATATGCATTGGATTGTACAGAAAACGCTCTACACATCACGTTGCCTTTGC



TTGTCGACGATGCGCACGGAACCTGGATTGAAAAAAAGATCAGGGTGCCGCTGGCACCATCCGGACA



AATTCAAGATTTAACTCTGGAAAAACTTGAGAAGAAAAAAAATCGTTTATACTACCGTTCCGGTTTTC



AGCAGTTTGCCGGCTTGGCTGGCGGAGCTGAGGTTCTTTTCCACAGACCCTATATGGAACACGACGAA



CGCAGCGAGGAGTCTCTTTTGGAACGTCCGGGAGCCGTTTGGTTCAAATTGACCCTGGATGTGGCAAC



ACAGGCTCCCCCGAACTGGCTTGATGGTAAGGGCCGTGTCCGTACACCGCCGGAGGTACATCATTTTA



AAACCGCATTGTCGAATAAAAGCAAACATACACGTACGCTGCAGCCGGGTCTCCGTGTCTTGTCAGTA



GACTTGGGCATGCGAACATTCGCCTCCTGCTCAGTATTTGAACTCATCGAGGGAAAGCCTGAGACAGG



CCGTGCCTTCCCTGTTGCCGATGAGAGATCAATGGACAGCCCGAATAAACTGTGGGCCAAGCATGAA



CGTAGTTTTAAACTGACGCTCCCCGGCGAAACCCCTTCTCGAAAGGAAGAGGAAGAGCGTAGCATAG



CAAGAGCGGAAATTTATGCACTGAAACGCGACATACAACGCCTCAAAAGCCTACTCCGCTTAGGTGA



AGAAGATAACGATAACCGTCGTGATGCATTGCTTGAACAGTTCTTTAAAGGATGGGGAGAAGAAGAC



GTTGTGCCTGGACAAGCGTTTCCACGCTCTCTTTTCCAAGGGTTGGGAGCTGCCCCGTTTCGCTCAACT



CCAGAGTTATGGCGTCAGCATTGCCAAACATATTATGACAAAGCGGAAGCCTGTCTGGCTAAACATAT



CAGTGATTGGCGCAAGCGAACTCGTCCCCGTCCGACATCGCGGGAGATGTGGTACAAAACACGTTCC



TATCATGGCGGCAAGTCCATTTGGATGTTGGAATATCTTGATGCCGTTCGAAAACTGCTTCTCAGTTG



GAGCTTACGTGGTCGTACTTACGGTGCCATTAATCGCCAGGATACAGCCCGGTTTGGTTCTTTGGCAT



CACGGCTGCTCCACCATATCAATTCCCTAAAGGAAGACCGCATCAAAACAGGAGCCGACTCTATCGTT



CAGGCTGCTCGCGGGTATATTCCTCTCCCTCATGGCAAGGGTTGGGAACAAAGATATGAGCCTTGTCA



GCTCATATTATTTGAAGACCTCGCACGATATCGCTTTCGCGTGGATCGACCTCGTCGAGAGAACAGCC



AACTCATGCAGTGGAACCATCGAGCCATCGTGGCAGAAACAACGATGCAAGCCGAACTCTACGGACA



AATTGTCGAAAATACTGCAGCGGGGTTCAGCAGTCGTTTTCACGCGGCGACAGGTGCCCCCGGTGTAC



GTTGTCGTTTTCTTCTAGAAAGAGACTTTGATAACGATTTGCCCAAACCGTACCTTCTCAGGGAACTTT



CTTGGATGCTCGGCAATACAAAAGTCGAGTCTGAAGAAGAAAAGCTTCGATTGCTGTCTGAAAAAAT



CAGGCCAGGCAGTCTTGTTCCTTGGGATGGAGGCGAACAGTTCGCTACCCTGCATCCCAAAAGACAA



ACACTTTGCGTCATTCATGCCGATATGAATGCTGCCCAAAATTTACAACGCCGGTTTTTCGGTCGATGC



GGCGAGGCCTTTCGGCTTGTTTGTCAACCCCACGGTGACGACGTGTTACGACTCGCATCCACCCCAGG



AGCTCGTCTTCTTGGAGCCCTGCAGCAGCTTGAAAATGGACAAGGAGCTTTCGAGTTGGTTCGAGACA



TGGGGTCAACAAGTCAAATGAACCGGTTCGTCATGAAGTCTTTGGGAAAAAAGAAAATAAAACCCCT



TCAGGACAACAATGGAGACGACGAGCTTGAAGACGTGTTGTCCGTACTCCCGGAGGAAGACGACACA



GGACGTATCACAGTCTTCCGCGATTCATCAGGAATCTTTTTTCCTTGCAACGTCTGGATACCGGCCAA



ACAGTTTTGGCCAGCAGTACGCGCCATGATTTGGAAGGTCATGGCTTCCCATTCTTTGGGGTGA





SEQ
ATGACAAAGTTAAGACACCGACAGAAAAAATTAACACACGACTGGGCTGGCTCCAAAAAGAGGGAA


ID
GTATTAGGCTCAAATGGCAAGCTTCAGAATCCGTTGTTAATGCCGGTTAAAAAAGGTCAGGTTACTGA


NO:
GTTCCGGAAAGCGTTTTCTGCGTATGCTCGCGCAACGAAAGGAGAAATGACTGACGGCCGAAAGAAT


36
ATGTTTACGCATAGTTTCGAGCCATTTAAGACAAAGCCCTCGCTTCATCAGTGTGAATTGGCAGATAA



AGCATATCAATCTTTACATTCGTATCTGCCTGGTTCTCTTGCTCATTTTCTATTATCTGCTCACGCATTA



GGTTTTCGTATTTTTTCAAAATCTGGTGAAGCAACTGCATTCCAGGCATCCTCTAAAATTGAAGCTTAC



GAATCAAAATTGGCAAGCGAATTAGCTTGTGTAGATTTATCTATTCAAAACTTGACTATTTCAACGCT



TTTTAATGCGCTTACAACGTCTGTAAGAGGGAAGGGCGAAGAAACTAGCGCTGACCCCTTAATTGCAC



GATTTTACACCTTACTTACTGGCAAGCCTCTGTCTCGAGACACTCAAGGGCCTGAACGTGATTTAGCA



GAAGTTATCTCGCGTAAGATAGCTAGTTCTTTTGGCACATGGAAAGAAATGACGGCAAACCCTCTTCA



GTCATTACAATTTTTTGAAGAGGAACTCCATGCGCTGGATGCCAATGTCTCGCTCTCACCCGCCTTCGA



CGTTTTAATTAAAATGAATGATTTGCAGGGCGATTTAAAAAATCGAACCATTGTTTTTGATCCTGACG



CCCCTGTTTTTGAATATAACGCAGAAGACCCTGCCGACATAATTATTAAACTTACAGCTCGTTACGCT



AAAGAAGCTGTCATCAAAAATCAAAACGTAGGAAATTACGTTAAAAACGCTATTACTACCACAAATG



CCAATGGTCTTGGTTGGCTTTTGAACAAAGGTTTGTCGTTACTCCCTGTCTCGACCGATGACGAATTGC



TAGAGTTTATTGGCGTTGAACGATCTCATCCCTCATGCCATGCCTTAATTGAATTGATTGCACAATTAG



AAGCCCCCGAGCTCTTTGAGAAGAACGTATTTTCAGATACTCGTTCTGAAGTTCAAGGTATGATTGAT



TCAGCTGTTTCTAATCATATTGCTCGTCTTTCCAGCTCTAGAAATAGCTTGTCAATGGATAGTGAAGAA



TTAGAACGTTTAATCAAAAGCTTTCAGATACACACACCTCATTGCTCACTTTTTATTGGCGCCCAATCA



CTTTCACAGCAGTTAGAATCTTTGCCTGAAGCCCTTCAATCGGGCGTTAATTCAGCCGATATTTTACTA



GGCTCTACTCAATATATGCTCACCAATTCTTTGGTTGAAGAGTCAATTGCAACTTATCAAAGAACACT



TAATCGCATCAATTACTTGTCAGGTGTTGCAGGTCAGATTAACGGCGCAATAAAGCGAAAAGCGATA



GATGGAGAAAAAATTCACTTGCCTGCAGCTTGGTCAGAGTTGATATCTTTACCATTTATAGGCCAGCC



TGTTATAGATGTTGAAAGCGATTTAGCTCATCTAAAAAATCAATACCAAACACTTTCAAATGAGTTTG



ATACTCTTATATCTGCTTTGCAAAAGAATTTTGATTTGAACTTTAATAAAGCGCTCCTTAATCGTACTC



AGCATTTTGAAGCCATGTGTAGAAGCACTAAGAAAAACGCTTTATCCAAACCAGAGATCGTTTCCTAT



CGCGACCTGCTTGCTCGATTAACTTCTTGTTTGTATCGAGGCTCTTTAGTTTTGCGTCGTGCCGGCATT



GAAGTGTTAAAAAAACATAAAATATTTGAGTCAAACAGCGAACTTCGTGAACATGTTCATGAAAGAA



AGCATTTCGTGTTTGTTAGTCCTCTAGATCGCAAAGCCAAGAAACTCCTTCGATTAACTGATTCGCGTC



CAGACTTGTTACATGTTATTGATGAAATATTGCAGCACGATAATCTTGAAAACAAAGACCGCGAGTCA



CTTTGGCTAGTTCGCTCTGGTTATTTGCTTGCAGGACTTCCAGATCAACTTTCTTCATCTTTTATTAACT



TGCCTATCATTACTCAAAAAGGAGATAGACGCCTTATAGACCTGATTCAGTATGATCAAATTAATCGT



GATGCTTTTGTTATGTTAGTGACCTCTGCATTCAAGTCTAATTTGTCTGGTCTGCAGTATCGTGCCAAT



AAGCAATCGTTCGTTGTTACTCGCACGCTAAGCCCTTATCTCGGCTCAAAACTTGTCTACGTACCCAA



GGATAAAGATTGGTTAGTTCCTTCTCAAATGTTTGAAGGACGATTTGCTGACATTCTTCAATCAGATTA



TATGGTCTGGAAAGATGCCGGTCGTCTTTGTGTTATTGATACTGCAAAACACCTTTCTAATATAAAGA



AGTCTGTATTTTCATCCGAAGAAGTTCTCGCTTTTTTAAGAGAACTCCCTCACCGCACATTTATCCAGA



CCGAAGTTCGCGGCCTTGGCGTTAATGTCGATGGAATTGCATTTAATAATGGTGATATTCCGTCATTA



AAAACCTTTTCAAATTGCGTTCAGGTAAAAGTTTCTCGGACTAATACATCCCTAGTTCAAACACTTAA



TCGTTGGTTTGAAGGAGGAAAAGTTTCTCCTCCGAGCATTCAATTTGAACGGGCGTATTATAAAAAAG



ACGATCAAATTCATGAAGACGCAGCGAAAAGAAAGATACGATTCCAGATGCCCGCAACTGAGTTGGT



TCATGCTTCTGACGATGCGGGGTGGACACCAAGTTATTTGCTCGGCATTGATCCTGGCGAGTATGGAA



TGGGTCTTTCATTGGTTTCGATTAATAACGGAGAAGTCTTAGATTCAGGCTTTATTCATATTAATTCTC



TGATCAATTTTGCCTCTAAAAAGAGCAACCATCAAACTAAGGTTGTTCCGCGTCAGCAGTACAAATCT



CCTTATGCAAATTATTTAGAACAATCTAAAGATTCTGCTGCTGGTGATATTGCGCATATACTCGATCG



ACTTATATACAAATTAAATGCGTTGCCTGTTTTTGAGGCTCTTTCAGGTAATTCTCAGAGTGCTGCTGA



TCAAGTTTGGACGAAAGTCTTATCGTTTTACACTTGGGGTGATAATGACGCTCAGAATTCTATTAGAA



AGCAGCATTGGTTTGGAGCCAGTCATTGGGATATCAAAGGTATGTTAAGGCAACCCCCTACGGAGAA



GAAGCCTAAACCGTATATTGCTTTTCCTGGCTCTCAGGTTTCTTCGTATGGTAATTCCCAACGTTGCTC



TTGCTGCGGTCGCAATCCTATTGAACAACTTCGAGAAATGGCAAAGGATACCTCTATTAAAGAGCTAA



AAATTCGCAATTCTGAGATACAGCTTTTTGACGGAACCATTAAATTATTTAATCCAGACCCATCCACT



GTGATAGAGAGAAGGCGACATAATCTTGGTCCATCAAGAATTCCTGTTGCTGACCGTACTTTCAAAAA



CATCAGTCCATCAAGTCTAGAATTTAAAGAATTGATTACTATCGTGTCTCGATCTATCCGTCATTCACC



TGAGTTTATCGCTAAAAAACGCGGCATAGGGTCTGAGTATTTTTGCGCTTATTCCGATTGCAACTCATC



CTTAAATTCTGAAGCTAACGCAGCTGCTAACGTAGCGCAAAAATTTCAAAAACAGTTATTTTTTGAGT



TATAA





SEQ
ATGAAGAGAATTCTGAACAGTCTGAAAGTTGCTGCCTTGAGACTTCTGTTTCGAGGCAAAGGTTCTGA


ID
ATTAGTGAAGACAGTCAAATATCCATTGGTTTCCCCGGTTCAAGGCGCGGTTGAAGAACTTGCTGAAG


NO:
CAATTCGGCACGACAACCTGCACCTTTTTGGGCAGAAGGAAATAGTGGATCTTATGGAGAAAGACGA


37
AGGAACCCAGGTGTATTCGGTTGTGGATTTTTGGTTGGATACCCTGCGTTTAGGGATGTTTTTCTCACC



ATCAGCGAATGCGTTGAAAATCACGCTGGGAAAATTCAATTCTGATCAGGTTTCACCTTTTCGTAAGG



TTTTGGAGCAGTCACCTTTTTTTCTTGCGGGTCGCTTGAAGGTTGAACCTGCGGAAAGGATACTTTCTG



TTGAAATCAGAAAGATTGGTAAAAGAGAAAACAGAGTTGAGAACTATGCCGCCGATGTGGAGACATG



CTTCATTGGTCAGCTTTCTTCAGATGAGAAACAGAGTATCCAGAAGCTGGCAAATGATATCTGGGATA



GCAAGGATCATGAGGAACAGAGAATGTTGAAGGCGGATTTTTTTGCTATACCTCTTATAAAAGACCCC



AAAGCTGTCACAGAAGAAGATCCTGAAAATGAAACGGCGGGAAAACAGAAACCGCTTGAATTATGT



GTTTGTCTTGTTCCTGAGTTGTATACCCGAGGTTTCGGCTCCATTGCTGATTTTCTGGTTCAGCGACTTA



CCTTGCTGCGTGACAAAATGAGTACCGACACGGCGGAAGATTGCCTCGAGTATGTTGGCATTGAGGA



AGAAAAAGGCAATGGAATGAATTCCTTGCTCGGCACTTTTTTGAAGAACCTGCAGGGTGATGGTTTTG



AACAGATTTTTCAGTTTATGCTTGGGTCTTATGTTGGCTGGCAGGGGAAGGAAGATGTACTGCGCGAA



CGATTGGATTTGCTGGCCGAAAAAGTCAAAAGATTACCAAAGCCAAAATTTGCCGGAGAATGGAGTG



GTCATCGTATGTTTCTCCATGGTCAGCTGAAAAGCTGGTCGTCGAATTTCTTCCGTCTTTTTAATGAGA



CGCGGGAACTTCTGGAAAGTATCAAGAGTGATATTCAACATGCCACCATGCTCATTAGCTATGTGGAA



GAGAAAGGAGGCTATCATCCACAGCTGTTGAGTCAGTATCGGAAGTTAATGGAACAATTACCGGCGT



TGCGGACTAAGGTTTTGGATCCTGAGATTGAGATGACGCATATGTCCGAGGCTGTTCGAAGTTACATT



ATGATACACAAGTCTGTAGCGGGATTTCTGCCGGATTTACTCGAGTCTTTGGATCGAGATAAGGATAG



GGAATTTTTGCTTTCCATCTTTCCTCGTATTCCAAAGATAGATAAGAAGACGAAAGAGATCGTTGCAT



GGGAGCTACCGGGCGAGCCAGAGGAAGGCTATTTGTTCACAGCAAACAACCTTTTCCGGAATTTTCTT



GAGAATCCGAAACATGTGCCACGATTTATGGCAGAGAGGATTCCCGAGGATTGGACGCGTTTGCGCT



CGGCCCCTGTGTGGTTTGATGGGATGGTGAAGCAATGGCAGAAGGTGGTGAATCAGTTGGTTGAATCT



CCAGGCGCCCTTTATCAGTTCAATGAAAGTTTTTTGCGTCAAAGACTGCAAGCAATGCTTACGGTCTA



TAAGCGGGATCTCCAGACTGAGAAGTTTCTGAAGCTGCTGGCTGATGTCTGTCGTCCACTCGTTGATT



TTTTCGGACTTGGAGGAAATGATATTATCTTCAAGTCATGTCAGGATCCAAGAAAGCAATGGCAGACT



GTTATTCCACTCAGTGTCCCAGCGGATGTTTATACAGCATGTGAAGGCTTGGCTATTCGTCTCCGCGA



AACTCTTGGATTCGAATGGAAAAATCTGAAAGGACACGAGCGGGAAGATTTTTTACGGCTGCATCAG



TTGCTGGGAAATCTGCTGTTCTGGATCAGGGATGCGAAACTTGTCGTGAAGCTGGAAGACTGGATGA



ACAATCCTTGTGTTCAGGAGTATGTGGAAGCACGAAAAGCCATTGATCTTCCCTTGGAGATTTTCGGA



TTTGAGGTGCCGATTTTTCTCAATGGCTATCTCTTTTCGGAACTGCGCCAGCTGGAATTGTTGCTGAGG



CGTAAGTCGGTGATGACGTCTTACAGCGTCAAAACGACAGGCTCGCCAAATAGGCTCTTCCAGTTGGT



TTACCTACCTCTAAACCCTTCAGATCCGGAAAAGAAAAATTCCAACAACTTTCAGGAGCGCCTCGATA



CACCTACCGGTTTGTCGCGTCGTTTTCTGGATCTTACGCTGGATGCATTTGCTGGCAAACTCTTGACGG



ATCCGGTAACTCAGGAACTGAAGACGATGGCCGGTTTTTACGATCATCTCTTTGGCTTCAAGTTGCCG



TGTAAACTGGCGGCGATGAGTAACCATCCAGGATCCTCTTCCAAAATGGTGGTTCTGGCAAAACCAA



AGAAGGGTGTTGCTAGTAACATCGGCTTTGAACCTATTCCCGATCCTGCTCATCCTGTGTTCCGGGTG



AGAAGTTCCTGGCCGGAGTTGAAGTACCTGGAGGGGTTGTTGTATCTTCCCGAAGATACACCACTGAC



CATTGAACTGGCGGAAACGTCGGTCAGTTGTCAGTCTGTGAGTTCAGTCGCTTTCGATTTGAAGAATC



TGACGACTATCTTGGGTCGTGTTGGTGAATTCAGGGTGACGGCAGATCAACCTTTCAAGCTGACGCCC



ATTATTCCTGAGAAAGAGGAATCCTTCATCGGGAAGACCTACCTCGGTCTTGATGCTGGAGAGCGATC



TGGCGTTGGTTTCGCGATTGTGACGGTTGACGGCGATGGGTATGAGGTGCAGAGGTTGGGTGTGCATG



AAGATACTCAGCTTATGGCGCTTCAGCAAGTCGCCAGCAAGTCTCTTAAGGAGCCGGTTTTCCAGCCA



CTCCGTAAGGGCACATTTCGTCAGCAGGAGCGCATTCGCAAAAGCCTCCGCGGTTGCTACTGGAATTT



CTATCATGCATTGATGATCAAGTACCGAGCTAAAGTTGTGCATGAGGAATCGGTGGGTTCATCCGGTC



TGGTGGGGCAGTGGCTGCGTGCATTTCAGAAGGATCTCAAAAAGGCTGATGTTCTGCCCAAGAAGGG



TGGAAAAAATGGTGTAGACAAAAAAAAGAGAGAAAGCAGCGCTCAGGATACCTTATGGGGAGGAGC



TTTCTCGAAGAAGGAAGAGCAGCAGATAGCCTTTGAGGTTCAGGCAGCTGGATCAAGCCAGTTTTGTC



TGAAGTGTGGTTGGTGGTTTCAGTTGGGGATGCGGGAAGTAAATCGTGTGCAGGAGAGTGGCGTGGT



GCTGGACTGGAACCGGTCCATTGTAACCTTCCTCATCGAATCCTCAGGAGAAAAGGTATATGGTTTCA



GTCCTCAGCAACTGGAAAAAGGCTTTCGTCCTGACATCGAAACGTTCAAAAAAATGGTAAGGGATTTT



ATGAGACCCCCCATGTTTGATCGCAAAGGTCGGCCGGCCGCGGCGTATGAAAGATTCGTACTGGGAC



GTCGTCACCGTCGTTATCGCTTTGATAAAGTTTTTGAAGAGAGATTTGGTCGCAGTGCTCTTTTCATCT



GCCCGCGGGTCGGGTGTGGGAATTTCGATCACTCCAGTGAGCAGTCAGCCGTTGTCCTTGCCCTTATT



GGTTACATTGCTGATAAGGAAGGGATGAGTGGTAAGAAGCTTGTTTATGTGAGGCTGGCTGAACTTAT



GGCTGAGTGGAAGCTGAAGAAACTGGAGAGATCAAGGGTGGAAGAACAGAGCTCGGCACAATAA





SEQ
ATGGCAGAAAGCAAGCAGATGCAATGCCGCAAGTGCGGCGCAAGCATGAAGTATGAAGTAATTGGA


ID
TTGGGCAAGAAGTCATGCAGATATATGTGCCCAGATTGCGGCAATCACACCAGCGCGCGCAAGATTC


NO:
AGAACAAGAAAAAGCGCGACAAAAAGTATGGATCCGCAAGCAAAGCGCAGAGCCAGAGGATAGCTG


38
TGGCTGGCGCGCTTTATCCAGACAAAAAAGTGCAGACCATAAAGACCTACAAATACCCAGCGGATCT



TAATGGCGAAGTTCATGACAGCGGCGTCGCAGAGAAGATTGCGCAGGCGATTCAGGAAGATGAGATC



GGCCTGCTTGGCCCGTCCAGCGAATACGCTTGCTGGATTGCTTCACAAAAACAGAGCGAGCCGTATTC



AGTTGTAGATTTTTGGTTTGACGCGGTGTGCGCAGGCGGAGTATTCGCGTATTCTGGCGCGCGCCTGC



TTTCCACAGTCCTCCAGTTGAGTGGCGAGGAAAGCGTTTTGCGCGCTGCTTTAGCATCTAGCCCGTTTG



TAGATGACATTAATTTGGCGCAAGCGGAAAAGTTCCTAGCCGTTAGCCGGCGCACAGGCCAAGATAA



GCTAGGCAAGCGCATTGGAGAATGTTTTGCGGAAGGCCGGCTTGAAGCGCTTGGCATCAAAGATCGC



ATGCGCGAATTCGTGCAAGCGATTGATGTGGCCCAAACCGCGGGCCAGCGGTTCGCGGCCAAGCTAA



AGATATTCGGCATCAGTCAGATGCCTGAAGCCAAGCAATGGAACAATGATTCCGGGCTCACTGTATGT



ATTTTGCCGGATTATTATGTCCCGGAAGAAAACCGCGCGGACCAGCTGGTTGTTTTGCTTCGGCGCTT



ACGCGAGATCGCGTATTGCATGGGAATTGAGGATGAAGCAGGATTTGAGCATCTAGGCATTGACCCT



GGTGCTCTTTCCAATTTTTCCAATGGCAATCCAAAGCGAGGATTTCTCGGCCGCCTGCTCAATAATGA



CATTATAGCGCTGGCAAACAACATGTCAGCCATGACGCCGTATTGGGAAGGCAGAAAAGGCGAGTTG



ATTGAGCGCCTTGCATGGCTTAAACATCGCGCTGAAGGATTGTATTTGAAAGAGCCACATTTCGGCAA



CTCCTGGGCAGACCACCGCAGCAGGATTTTCAGTCGCATTGCGGGCTGGCTTTCCGGATGCGCGGGCA



AGCTCAAGATTGCCAAGGATCAGATTTCAGGCGTGCGTACGGATTTGTTTCTGCTCAAGCGCCTTCTG



GATGCGGTACCGCAAAGCGCGCCGTCGCCGGACTTTATTGCTTCCATCAGCGCGCTGGATCGGTTTTT



GGAAGCGGCAGAAAGCAGCCAGGATCCGGCAGAACAGGTACGCGCTTTGTACGCGTTTCATCTGAAC



GCGCCTGCGGTCCGATCCATCGCCAACAAGGCGGTACAGAGGTCTGATTCCCAGGAGTGGCTTATCA



AGGAACTGGATGCTGTAGATCACCTTGAATTCAACAAAGCATTTCCGTTTTTTTCGGATACAGGAAAG



AAAAAGAAGAAAGGAGCGAATAGCAACGGAGCGCCTTCTGAAGAAGAATACACGGAAACAGAATCC



ATTCAACAACCAGAAGATGCAGAGCAGGAAGTGAATGGTCAAGAAGGAAATGGCGCTTCAAAGAAC



CAGAAAAAGTTTCAGCGCATTCCTCGATTTTTCGGGGAAGGGTCAAGGAGTGAGTATCGAATTTTAAC



AGAAGCGCCGCAATATTTTGACATGTTCTGCAATAATATGCGCGCGATCTTTATGCAGCTAGAGAGTC



AGCCGCGCAAGGCGCCTCGTGATTTCAAATGCTTTCTGCAGAATCGTTTGCAGAAGCTTTACAAGCAA



ACCTTTCTCAATGCTCGCAGTAATAAATGCCGCGCGCTTCTGGAATCCGTCCTTATTTCATGGGGAGA



ATTTTATACTTATGGCGCGAATGAAAAGAAGTTTCGTCTGCGCCATGAAGCGAGCGAGCGCAGCTCG



GATCCGGACTATGTGGTTCAGCAGGCATTGGAAATCGCGCGCCGGCTTTTCTTGTTCGGATTTGAGTG



GCGCGATTGCTCTGCTGGAGAGCGCGTGGATTTGGTTGAAATCCACAAAAAAGCAATCTCATTTTTGC



TTGCAATCACTCAGGCCGAGGTTTCAGTTGGTTCCTATAACTGGCTTGGGAATAGCACCGTGAGCCGG



TATCTTTCGGTTGCTGGCACAGACACATTGTACGGCACTCAACTGGAGGAGTTTTTGAACGCCACAGT



GCTTTCACAGATGCGTGGGCTGGCGATTCGGCTTTCATCTCAGGAGTTAAAAGACGGATTTGATGTTC



AGTTGGAGAGTTCGTGCCAGGACAATCTCCAGCATCTGCTGGTGTATCGCGCTTCGCGCGACTTGGCT



GCGTGCAAACGCGCTACATGCCCGGCTGAATTGGATCCGAAAATTCTTGTTCTGCCGGTTGGTGCGTT



TATCGCGAGCGTAATGAAAATGATTGAGCGTGGCGATGAACCATTAGCAGGCGCGTATTTGCGTCATC



GGCCGCATTCATTCGGCTGGCAGATACGGGTTCGTGGAGTGGCGGAAGTAGGCATGGATCAGGGCAC



AGCGCTAGCATTCCAGAAGCCGACTGAATCAGAGCCGTTTAAAATAAAGCCGTTTTCCGCTCAATACG



GCCCAGTACTTTGGCTTAATTCTTCATCCTATAGCCAGAGCCAGTATCTGGATGGATTTTTAAGCCAGC



CAAAGAATTGGTCTATGCGGGTGCTACCTCAAGCCGGATCAGTGCGCGTGGAACAGCGCGTTGCTCTG



ATATGGAATTTGCAGGCAGGCAAGATGCGGCTGGAGCGCTCTGGAGCGCGCGCGTTTTTCATGCCAGT



GCCATTCAGCTTCAGGCCGTCTGGTTCAGGAGATGAAGCAGTATTGGCGCCGAATCGGTACTTGGGAC



TTTTTCCGCATTCCGGAGGAATAGAATACGCGGTGGTGGATGTATTAGATTCCGCGGGTTTCAAAATT



CTTGAGCGCGGTACGATTGCGGTAAATGGCTTTTCCCAGAAGCGCGGCGAACGCCAAGAGGAGGCAC



ACAGAGAAAAACAGAGACGCGGAATTTCTGATATAGGCCGCAAGAAGCCGGTGCAAGCTGAAGTTG



ACGCAGCCAATGAATTGCACCGCAAATACACCGATGTTGCCACTCGTTTAGGGTGCAGAATTGTGGTT



CAGTGGGCGCCCCAGCCAAAGCCGGGCACAGCGCCGACCGCGCAAACAGTATACGCGCGCGCAGTGC



GGACCGAAGCGCCGCGATCTGGAAATCAAGAGGATCATGCTCGTATGAAATCCTCTTGGGGATATAC



CTGGGGCACCTATTGGGAGAAGCGCAAACCAGAGGATATTTTGGGCATCTCAACCCAAGTATACTGG



ACCGGCGGTATAGGCGAGTCATGTCCCGCAGTCGCGGTTGCGCTTTTGGGGCACATTAGGGCAACATC



CACTCAAACTGAATGGGAAAAAGAGGAGGTTGTATTCGGTCGACTGAAGAAGTTCTTTCCAAGCTAG





SEQ
ATGGAAAAGAGAATAAACAAGATACGAAAGAAACTATCGGCCGATAATGCCACAAAGCCTGTGAGC


ID
AGGAGCGGCCCCATGAAAACACTCCTTGTCCGGGTCATGACGGACGACTTGAAAAAAAGACTGGAGA


NO:
AGCGTCGGAAAAAGCCGGAAGTTATGCCGCAGGTTATTTCAAATAACGCAGCAAACAATCTTAGAAT


39
GCTCCTTGATGACTATACAAAGATGAAGGAGGCGATACTACAAGTTTACTGGCAGGAATTTAAGGAC



GACCATGTGGGCTTGATGTGCAAATTTGCCCAGCCTGCTTCCAAAAAAATTGACCAGAACAAACTAA



AACCGGAAATGGATGAAAAAGGAAATCTAACAACTGCCGGTTTTGCATGTTCTCAATGCGGTCAGCC



GCTATTTGTTTATAAGCTTGAACAGGTGAGTGAAAAAGGCAAGGCTTATACAAATTACTTCGGCCGGT



GTAATGTGGCCGAGCATGAGAAATTGATTCTTCTTGCTCAATTAAAACCTGAAAAAGACAGTGACGA



AGCAGTGACATACTCCCTTGGCAAATTCGGCCAGAGGGCATTGGACTTTTATTCAATCCACGTAACAA



AAGAATCCACCCATCCAGTAAAGCCCCTGGCACAGATTGCGGGCAACCGCTATGCAAGCGGACCTGT



TGGCAAGGCCCTTTCCGATGCCTGTATGGGCACTATAGCCAGTTTTCTTTCGAAATATCAAGACATCA



TCATAGAACATCAAAAGGTTGTGAAGGGTAATCAAAAGAGGTTAGAGAGTCTCAGGGAATTGGCAGG



GAAAGAAAATCTTGAGTACCCATCGGTTACACTGCCGCCGCAGCCGCATACGAAAGAAGGGGTTGAC



GCTTATAACGAAGTTATTGCAAGGGTACGTATGTGGGTTAATCTTAATCTGTGGCAAAAGCTGAAGCT



CAGCCGTGATGACGCAAAACCGCTACTGCGGCTAAAAGGATTCCCATCTTTCCCTGTTGTGGAGCGGC



GTGAAAACGAAGTTGACTGGTGGAATACGATTAATGAAGTAAAAAAACTGATTGACGCTAAACGAGA



TATGGGACGGGTATTCTGGAGCGGCGTTACCGCAGAAAAGAGAAATACCATCCTTGAAGGATACAAC



TATCTGCCAAATGAGAATGACCATAAAAAGAGAGAGGGCAGTTTGGAAAACCCTAAGAAGCCTGCCA



AACGCCAGTTTGGAGACCTCTTGCTGTATCTTGAAAAGAAATATGCCGGAGACTGGGGAAAGGTCTTC



GATGAGGCATGGGAGAGGATAGATAAGAAAATAGCCGGACTCACAAGCCATATAGAGCGCGAAGAA



GCAAGAAACGCGGAAGACGCTCAATCCAAAGCCGTACTTACAGACTGGCTAAGGGCAAAGGCATCAT



TTGTTCTTGAAAGACTGAAGGAAATGGATGAAAAGGAATTCTATGCGTGTGAAATCCAACTTCAAAA



ATGGTATGGCGATCTTCGAGGCAACCCGTTTGCCGTTGAAGCTGAGAATAGAGTTGTTGATATAAGCG



GGTTTTCTATCGGAAGCGATGGCCATTCAATCCAATACAGAAATCTCCTTGCCTGGAAATATCTGGAG



AACGGCAAGCGTGAATTCTATCTGTTAATGAATTATGGCAAGAAAGGGCGCATCAGATTTACAGATG



GAACAGATATTAAAAAGAGCGGCAAATGGCAGGGACTATTATATGGCGGTGGCAAGGCAAAGGTTAT



TGATCTGACTTTCGACCCCGATGATGAACAGTTGATAATCCTGCCGCTGGCCTTTGGCACAAGGCAAG



GCCGCGAGTTTATCTGGAACGATTTGCTGAGTCTTGAAACAGGCCTGATAAAGCTCGCAAACGGAAG



AGTTATCGAAAAAACAATCTATAACAAAAAAATAGGGCGGGATGAACCGGCTCTATTCGTTGCCTTA



ACATTTGAGCGCCGGGAAGTTGTTGATCCATCAAATATAAAGCCTGTAAACCTTATAGGCGTTGACCG



CGGCGAAAACATCCCGGCGGTTATTGCATTGACAGACCCTGAAGGTTGTCCTTTACCGGAATTCAAGG



ATTCATCAGGGGGCCCAACAGACATCCTGCGAATAGGAGAAGGATATAAGGAAAAGCAGAGGGCTA



TTCAGGCAGCAAAGGAGGTAGAGCAAAGGCGGGCTGGCGGTTATTCACGGAAGTTTGCATCCAAGTC



GAGGAACCTGGCGGACGACATGGTGAGAAATTCAGCGCGAGACCTTTTTTACCATGCCGTTACCCAC



GATGCCGTCCTTGTCTTTGAAAACCTGAGCAGGGGTTTTGGAAGGCAGGGCAAAAGGACCTTCATGA



CGGAAAGACAATATACAAAGATGGAAGACTGGCTGACAGCGAAGCTCGCATACGAAGGTCTTACGTC



AAAAACCTACCTTTCAAAGACGCTGGCGCAATATACGTCAAAAACATGCTCCAACTGCGGGTTTACTA



TAACGACTGCCGATTATGACGGGATGTTGGTAAGGCTTAAAAAGACTTCTGATGGATGGGCAACTAC



CCTCAACAACAAAGAATTAAAAGCCGAAGGCCAGATAACGTATTATAACCGGTATAAAAGGCAAACC



GTGGAAAAAGAACTCTCCGCAGAGCTTGACAGGCTTTCAGAAGAGTCGGGCAATAATGATATTTCTA



AGTGGACCAAGGGTCGCCGGGACGAGGCATTATTTTTGTTAAAGAAAAGATTCAGCCATCGGCCTGTT



CAGGAACAGTTTGTTTGCCTCGATTGCGGCCATGAAGTCCACGCCGATGAACAGGCAGCCTTGAATAT



TGCAAGGTCATGGCTTTTTCTAAACTCAAATTCAACAGAATTCAAAAGTTATAAATCGGGTAAACAGC



CCTTCGTTGGTGCTTGGCAGGCCTTTTACAAAAGGAGGCTTAAAGAGGTATGGAAGCCCAACGCC





SEQ
ATGAAAAGGATAAATAAAATACGAAGGAGATTGGTAAAGGATAGCAACACGAAAAAAGCCGGCAAA


ID
ACCGGCCCTATGAAAACCTTGCTCGTTCGGGTTATGACACCTGACCTGAGAGAAAGGTTAGAGAATCT


NO:
TCGCAAAAAGCCGGAAAACATTCCTCAGCCCATTTCAAATACTTCACGTGCAAATTTAAATAAACTCC


40
TCACTGACTATACGGAAATGAAGAAAGCAATCCTGCATGTTTATTGGGAAGAGTTCCAAAAAGACCC



TGTCGGATTGATGAGCAGGGTTGCACAACCAGCGCCCAAGAATATTGATCAGAGAAAATTGATTCCG



GTGAAGGACGGAAATGAGAGACTAACAAGTTCTGGATTTGCCTGTTCTCAGTGCTGTCAACCCCTCTA



TGTTTATAAGCTTGAACAAGTGAATGACAAGGGTAAGCCCCATACAAATTACTTTGGCCGTTGTAATG



TCTCCGAGCATGAACGTTTGATATTGCTCTCGCCGCATAAACCGGAGGCAAATGACGAGCTAGTAACG



TATTCGTTGGGGAAGTTCGGTCAAAGGGCATTGGACTTTTATTCAATCCACGTAACAAGAGAATCGAA



CCATCCTGTAAAGCCGCTAGAACAGATCGGTGGCAATAGCTGCGCAAGTGGTCCCGTTGGTAAGGCTT



TATCTGATGCCTGTATGGGAGCAGTAGCCAGTTTCCTTACAAAGTACCAGGACATCATCCTCGAACAC



CAAAAGGTTATAAAAAAAAACGAAAAGAGATTGGCAAATCTAAAGGATATAGCAAGTGCAAACGGG



CTTGCATTTCCTAAAATCACTCTTCCACCGCAACCGCATACAAAAGAAGGGATTGAAGCTTATAACAA



TGTTGTTGCTCAGATAGTGATCTGGGTAAACCTGAATCTTTGGCAGAAACTCAAAATTGGCAGGGATG



AGGCAAAGCCCTTACAGCGGCTTAAGGGTTTTCCGTCCTTCCCTCTTGTTGAACGCCAGGCGAATGAG



GTTGATTGGTGGGATATGGTCTGTAATGTCAAAAAGTTGATTAACGAAAAGAAAGAGGACGGGAAGG



TCTTCTGGCAAAATCTTGCTGGATATAAAAGGCAGGAAGCCTTGCTTCCATATCTTTCGTCTGAAGAA



GACCGTAAAAAAGGAAAAAAGTTTGCGCGTTATCAGTTTGGTGACCTTTTGCTTCACCTTGAAAAGAA



ACACGGTGAAGATTGGGGCAAAGTTTATGATGAGGCATGGGAAAGAATAGATAAAAAAGTTGAAGG



TCTGAGTAAGCACATAAAGTTGGAGGAAGAAAGAAGGTCTGAAGATGCTCAATCAAAGGCTGCCCTC



ACTGATTGGCTCAGGGCAAAGGCCTCTTTTGTTATTGAAGGGCTCAAAGAAGCTGATAAGGATGAGTT



TTGCAGGTGTGAGTTAAAGCTTCAAAAGTGGTATGGAGATTTGAGAGGAAAACCATTTGCTATAGAA



GCAGAGAACAGCATTTTAGATATAAGCGGATTTTCTAAACAGTATAATTGTGCATTTATATGGCAGAA



AGACGGCGTAAAGAAGTTAAATCTTTATTTAATAATAAATTACTTCAAAGGTGGTAAGCTACGCTTCA



AAAAAATCAAGCCAGAAGCTTTTGAAGCAAATAGGTTTTATACAGTAATTAATAAAAAAAGCGGTGA



GATTGTGCCTATGGAGGTCAACTTCAATTTTGATGACCCGAATTTGATAATTCTGCCTTTGGCCTTTGG



AAAAAGGCAGGGGAGGGAGTTTATCTGGAACGACCTATTGAGCCTTGAGACGGGTTCATTGAAACTC



GCCAATGGCAGGGTTATTGAAAAAACGCTCTATAACAGAAGGACGAGACAGGATGAACCAGCACTTT



TTGTTGCCCTGACATTTGAAAGAAGAGAGGTGCTTGACTCATCGAATATAAAACCGATGAATCTGATA



GGAATAGACCGGGGAGAAAATATCCCGGCAGTCATAGCATTAACAGACCCGGAAGGATGCCCCTTGT



CAAGATTCAAAGATTCATTGGGCAATCCAACGCATATTTTGCGAATAGGAGAAAGTTATAAGGAAAA



ACAACGGACTATTCAGGCTGCTAAAGAAGTTGAACAAAGGCGGGCAGGCGGATATTCGAGAAAATAT



GCATCAAAGGCGAAGAATCTGGCGGACGATATGGTAAGAAATACAGCTCGTGACCTCTTATATTATG



CTGTTACTCAAGATGCAATGCTCATTTTTGAAAATCTTTCCCGCGGTTTTGGTAGACAAGGCAAGAGG



ACTTTTATGGCGGAAAGGCAGTACACGAGGATGGAAGACTGGCTGACTGCAAAGCTTGCCTATGAAG



GTCTGCCATCAAAAACCTATCTTTCAAAGACTCTGGCACAGTATACCTCAAAGACATGTTCTAATTGT



GGTTTTACAATCACAAGTGCAGATTATGACAGGGTGCTCGAAAAGCTCAAGAAGACGGCTACTGGAT



GGATGACTACAATCAATGGAAAAGAGTTAAAAGTTGAAGGACAGATAACATACTATAACCGGTATAA



AAGGCAGAATGTGGTAAAAGACCTCTCTGTAGAGCTGGATAGACTTTCGGAAGAGTCGGTAAATAAT



GATATTTCTAGTTGGACAAAAGGCCGCAGTGGTGAAGCTTTATCTCTGCTAAAAAAGAGATTTAGTCA



CAGGCCGGTGCAGGAAAAGTTTGTTTGCCTGAACTGTGGTTTTGAAACCCATGCAGACGAACAAGCA



GCACTGAATATTGCAAGGTCGTGGCTCTTTCTCCGTTCTCAAGAATATAAGAAGTATCAAACCAATAA



AACGACCGGAAATACTGACAAAAGGGCATTTGTTGAAACATGGCAATCCTTTTACAGAAAGAAGCTC



AAAGAAGTATGGAAACCA





SEQ
ATGGGTAAAATGTATTACCTTGGTTTAGACATTGGCACGAATTCCGTGGGCTACGCGGTGACCGACCC


ID
CTCATACCACCTGCTGAAGTTTAAGGGGGAACCAATGTGGGGTGCGCACGTATTTGCCGCCGGTAATC


NO:
AGAGCGCGGAACGACGCTCGTTCCGCACATCGCGTCGTCGTTTGGACCGACGCCAACAGCGCGTTAA


41
ACTGGTACAGGAGATTTTTGCCCCGGTGATTAGTCCGATCGACCCACGCTTCTTCATTCGTCTGCATGA



ATCCGCCCTGTGGCGCGATGACGTCGCGGAGACGGATAAACATATCTTTTTCAATGATCCTACCTATA



CCGATAAGGAATATTATAGCGATTACCCGACTATCCATCACCTGATCGTTGATCTGATGGAAAGCTCT



GAGAAACACGATCCGCGGCTGGTGTACCTTGCAGTGGCGTGGTTAGTGGCACACCGTGGTCATTTTCT



GAACGAGGTGGACAAGGATAATATTGGAGATGTGTTGTCGTTCGACGCATTTTATCCGGAGTTTCTCG



CGTTCCTGTCGGACAACGGTGTATCACCGTGGGTGTGCGAAAGCAAAGCGCTGCAGGCGACCTTGCT



GAGCCGTAACTCAGTGAACGACAAATATAAAGCCCTTAAGTCTCTGATCTTCGGATCCCAGAAACCTG



AAGATAACTTCGATGCCAATATTTCGGAAGATGGACTCATTCAACTGCTGGCCGGCAAAAAGGTAAA



AGTTAACAAACTGTTCCCTCAGGAATCGAACGATGCATCCTTCACATTGAATGATAAAGAAGACGCG



ATAGAAGAAATCCTGGGTACGCTTACACCAGATGAATGTGAATGGATTGCGCATATACGCCGCCTTTT



TGACTGGGCTATCATGAAACATGCTCTGAAAGATGGCAGGACTATTAGCGAGTCAAAAGTCAAACTG



TATGAGCAGCACCATCACGATCTGACCCAACTTAAATACTTCGTGAAAACCTACCTTGCAAAAGAATA



CGACGATATTTTCCGCAACGTGGATAGCGAAACAACGAAAAACTATGTAGCGTATTCCTATCATGTGA



AAGAGGTGAAAGGCACTCTGCCTAAAAATAAGGCAACGCAAGAAGAGTTTTGTAAGTATGTCCTGGG



CAAGGTTAAAAACATTGAATGCTCTGAAGCAGACAAGGTTGACTTTGATGAGATGATTCAGCGTCTTA



CCGACAACTCTTTTATGCCTAAGCAGGTTTCGGGCGAAAACCGCGTTATTCCTTATCAGTTATATTATT



ATGAACTGAAGACAATTCTGAATAAAGCAGCCTCGTACCTGCCTTTCCTGACGCAGTGTGGAAAAGAT



GCAATTTCGAACCAGGACAAACTACTGTCGATCATGACGTTCCGTATTCCTTACTTCGTCGGACCCTTG



CGAAAAGATAATTCGGAACATGCATGGCTCGAACGAAAGGCCGGTAAGATTTATCCGTGGAACTTTA



ACGACAAAGTGGACTTGGATAAATCAGAAGAAGCGTTCATTCGCCGAATGACCAATACCTGTACCTA



TTATCCCGGCGAAGATGTTTTACCGTTGGATTCGCTGATCTATGAGAAATTTATGATTTTAAATGAAAT



CAATAATATTCGTATTGACGGCTACCCGATTAGTGTTGACGTTAAACAGCAGGTTTTTGGCTTGTTCGA



AAAAAAACGACGCGTAACCGTGAAAGATATTCAGAACCTGCTGCTGTCTCTCGGAGCTCTGGACAAA



CACGGGAAGCTGACAGGCATCGATACCACTATCCACTCAAACTATAATACGTATCACCATTTTAAATC



TCTCATGGAACGCGGCGTCCTGACCCGGGATGACGTGGAACGCATCGTTGAAAGGATGACCTACAGC



GACGATACTAAGCGTGTGCGTCTGTGGCTGAATAACAACTATGGTACTTTAACCGCCGACGATGTGAA



ACACATTTCGCGTCTGCGCAAACACGATTTTGGCCGTTTATCCAAAATGTTCTTAACAGGTCTGAAGG



GTGTCCATAAGGAGACCGGTGAACGTGCCTCCATACTGGATTTCATGTGGAACACGAACGATAACCT



GATGCAGCTCCTTTCCGAATGCTACACGTTCAGTGATGAAATCACAAAGCTGCAAGAGGCGTATTATG



CAAAAGCCCAGTTGTCTTTAAACGATTTTTTAGACTCGATGTACATCTCTAACGCGGTGAAACGTCCG



ATTTACAGAACTCTGGCAGTGGTGAACGATATTCGAAAAGCATGTGGGACGGCCCCTAAACGCATTTT



CATCGAAATGGCTCGTGATGGTGAATCAAAAAAAAAGAGAAGTGTTACACGTCGCGAGCAGATCAAA



AACCTGTACCGCTCGATTCGTAAAGATTTCCAGCAGGAAGTTGATTTTCTGGAAAAGATCCTGGAAAA



TAAATCTGATGGTCAACTTCAGTCAGATGCTTTGTATCTTTACTTTGCACAATTAGGGCGCGATATGTA



CACGGGCGATCCAATAAAGCTGGAGCACATCAAAGATCAGAGTTTCTATAACATAGACCATATTTAC



CCGCAGTCTATGGTGAAAGACGATTCCCTAGATAACAAAGTGCTGGTGCAAAGCGAAATTAACGGCG



AGAAAAGCTCGCGATACCCTTTGGACGCCGCGATCCGCAATAAAATGAAGCCCCTTTGGGACGCTTA



CTATAATCATGGCCTGATCTCCTTAAAGAAATACCAGCGTCTAACGCGCTCGACCCCGTTTACCGATG



ATGAAAAATGGGACTTTATTAATCGCCAGTTAGTGGAAACCCGTCAATCTACCAAAGCGCTGGCCATT



TTGTTGAAGCGTAAGTTTCCAGACACCGAAATTGTGTATTCGAAGGCGGGGTTATCGTCCGACTTCAG



ACATGAATTCGGCCTTGTAAAAAGTCGCAATATTAATGATTTGCACCACGCTAAAGACGCATTCTTGG



CTATCGTTACCGGCAATGTGTACCATGAAAGATTCAATCGCAGATGGTTTATGGTGAACCAGCCGTAC



TCAGTTAAAACTAAAACTCTTTTTACCCACAGCATAAAGAATGGCAACTTCGTTGCCTGGAACGGCGA



AGAAGATCTCGGTCGTATTGTAAAAATGCTGAAGCAAAACAAAAATACCATTCACTTCACGCGCTTCT



CCTTCGATCGCAAAGAAGGATTATTTGATATCCAACCTCTGAAAGCCAGCACCGGCTTAGTCCCACGA



AAAGCCGGTCTGGATGTCGTTAAATACGGCGGATATGACAAATCTACCGCGGCCTATTACCTGCTGGT



GAGGTTCACGCTCGAGGACAAGAAAACCCAGCACAAGCTGATGATGATTCCTGTAGAAGGCCTGTAC



AAGGCTCGCATTGATCATGACAAGGAATTTCTTACCGATTATGCGCAAACGACTATAAGCGAAATCCT



ACAGAAAGATAAACAGAAAGTGATCAATATTATGTTTCCAATGGGTACGAGGCATATAAAACTCAAT



TCAATGATTAGTATCGATGGCTTCTATCTTAGTATCGGCGGAAAGTCCTCTAAAGGTAAGTCAGTTCT



ATGTCACGCAATGGTTCCACTGATCGTCCCTCACAAAATCGAATGTTACATTAAAGCAATGGAAAGCT



TCGCCCGGAAGTTTAAAGAAAACAACAAGCTGCGCATCGTAGAAAAATTCGATAAAATCACCGTTGA



AGACAACCTGAATCTCTACGAGCTCTTTCTCCAAAAACTGCAGCATAATCCCTATAATAAGTTTTTTTC



GACACAGTTTGACGTACTGACGAACGGCCGTTCTACTTTCACAAAACTGTCGCCGGAGGAACAGGTA



CAGACGCTCTTGAACATTTTAAGTATCTTTAAAACATGCCGCAGTTCGGGTTGCGACCTGAAATCCAT



CAACGGCAGTGCCCAGGCAGCGCGCATCATGATTAGCGCTGACTTAACTGGACTGTCGAAAAAATAT



TCAGATATTAGGTTGGTTGAACAGTCAGCTTCTGGTTTGTTCGTATCCAAAAGTCAGAACTTACTGGA



GTATCTCTAA





SEQ
ATGTCATCGCTCACGAAATTCACTAACAAATACTCTAAACAGCTCACCATTAAGAATGAACTCATCCC


ID
AGTTGGCAAAACACTGGAGAACATCAAAGAGAATGGTCTGATAGATGGCGACGAACAGCTGAATGA


NO:
GAATTATCAGAAGGCGAAAATTATTGTGGATGATTTTCTGCGGGACTTCATTAATAAAGCACTGAATA


42
ATACGCAGATCGGGAACTGGCGCGAACTGGCGGATGCCCTTAATAAAGAGGATGAAGATAACATCGA



GAAATTGCAGGATAAAATTCGGGGAATCATTGTATCCAAATTTGAAACGTTTGATCTGTTTAGCAGCT



ATTCTATTAAGAAAGATGAAAAGATTATTGACGACGACAATGATGTTGAAGAAGAGGAACTGGATCT



GGGCAAGAAGACCAGCTCATTTAAATACATATTTAAAAAAAACCTGTTTAAGTTAGTGTTGCCATCCT



ACCTGAAAACCACAAACCAGGACAAGCTGAAGATTATTAGCTCGTTTGATAATTTTTCAACGTACTTC



CGCGGGTTCTTTGAAAACCGGAAAAACATTTTTACCAAGAAACCGATCTCCACAAGTATTGCGTATCG



CATTGTTCATGATAACTTCCCGAAATTCCTTGATAACATTCGTTGTTTTAATGTGTGGCAGACGGAATG



CCCGCAACTAATCGTGAAAGCAGATAACTATCTGAAAAGCAAAAATGTTATAGCGAAAGATAAAAGT



TTGGCAAACTATTTTACCGTGGGCGCGTATGACTATTTCCTGTCTCAGAATGGTATAGATTTTTACAAC



AATATTATAGGTGGACTGCCAGCGTTCGCCGGCCATGAGAAAATCCAAGGTCTCAATGAATTCATCAA



TCAAGAGTGCCAAAAAGACAGCGAGCTGAAAAGTAAGCTGAAAAACCGTCACGCGTTCAAAATGGC



GGTACTGTTCAAACAGATACTCAGCGATCGTGAAAAAAGTTTTGTAATTGATGAGTTCGAGTCGGATG



CTCAAGTTATTGACGCCGTTAAAAACTTTTACGCCGAACAGTGCAAAGATAACAATGTTATTTTTAAC



TTATTAAATCTTATCAAGAATATCGCTTTCTTAAGTGATGACGAACTGGACGGCATATTCATTGAAGG



GAAATACCTGTCGAGCGTTAGTCAAAAACTCTATAGCGATTGGTCAAAATTACGTAACGACATTGAG



GATTCGGCTAACTCTAAACAAGGCAATAAAGAGCTGGCCAAGAAGATCAAAACCAACAAAGGGGAT



GTAGAAAAAGCGATCTCGAAATATGAGTTCTCGCTGTCGGAACTGAACTCGATTGTACATGATAACAC



CAAGTTTTCTGACCTCCTTAGTTGTACACTGCATAAGGTGGCTTCTGAGAAACTGGTGAAGGTCAATG



AAGGCGACTGGCCGAAACATCTCAAGAATAATGAAGAGAAACAAAAAATCAAAGAGCCGCTTGATG



CTCTGCTGGAGATCTATAATACACTTCTGATTTTTAACTGCAAAAGCTTCAATAAAAACGGCAACTTC



TATGTCGACTATGATCGTTGCATCAATGAACTGAGTTCGGTCGTGTATCTGTATAATAAAACACGTAA



CTATTGCACTAAAAAACCCTATAACACGGACAAGTTCAAACTCAATTTTAACAGTCCGCAGCTCGGTG



AAGGCTTTTCCAAGTCGAAAGAAAATGACTGTCTGACTCTTTTGTTTAAAAAAGACGACAACTATTAT



GTAGGCATTATCCGCAAAGGTGCAAAAATCAATTTTGATGATACACAAGCAATCGCCGATAACACCG



ACAATTGCATCTTTAAAATGAATTATTTCCTACTTAAAGACGCAAAAAAATTTATCCCGAAATGTAGC



ATTCAGCTGAAAGAAGTCAAGGCCCATTTTAAGAAATCTGAAGATGATTACATTTTGTCTGATAAAGA



GAAATTTGCTAGCCCGCTGGTCATTAAAAAGAGCACATTTTTGCTGGCAACTGCACATGTGAAAGGGA



AAAAAGGCAATATCAAGAAATTTCAGAAAGAATATTCGAAAGAAAACCCCACTGAGTATCGCAATTC



TTTAAACGAATGGATTGCTTTTTGTAAAGAGTTCTTAAAAACTTATAAAGCGGCTACCATTTTTGATAT



AACCACATTGAAAAAGGCAGAGGAATATGCTGATATTGTAGAATTCTACAAGGATGTCGATAATCTG



TGCTACAAACTGGAGTTCTGCCCGATTAAAACCTCGTTTATAGAAAACCTGATAGATAACGGCGACCT



GTATCTGTTTCGCATCAATAACAAAGACTTCAGCAGTAAATCGACCGGCACCAAGAACCTTCATACGT



TATATTTACAAGCTATATTCGATGAACGTAATCTGAACAATCCGACAATTATGCTGAATGGGGGAGCA



GAACTGTTCTATCGTAAAGAAAGTATTGAGCAGAAAAACCGTATCACACACAAAGCCGGTTCAATTC



TCGTGAATAAGGTGTGTAAAGACGGTACAAGCCTGGATGATAAGATACGTAATGAAATTTATCAATA



TGAGAATAAATTTATTGATACCCTGTCTGATGAAGCTAAAAAGGTGTTACCGAATGTCATTAAAAAGG



AAGCTACCCATGACATTACAAAAGATAAACGTTTCACTAGTGACAAATTCTTCTTTCACTGCCCCCTG



ACAATTAATTATAAGGAAGGCGATACCAAGCAGTTCAATAACGAAGTGCTGAGTTTTCTGCGTGGAA



ATCCTGACATCAACATTATCGGCATTGACCGCGGAGAGCGTAATTTAATCTATGTAACGGTTATAAAC



CAGAAAGGCGAGATTCTGGATTCGGTTTCATTCAATACCGTGACCAACAAGAGTTCAAAAATCGAGC



AGACAGTCGATTATGAAGAGAAATTGGCAGTCCGCGAGAAAGAGAGGATTGAAGCAAAACGTTCCTG



GGACTCTATCTCAAAAATTGCGACACTAAAGGAAGGTTATCTGAGCGCAATAGTTCACGAGATCTGTC



TGTTAATGATTAAACACAACGCGATCGTTGTCTTAGAGAATCTTAATGCAGGCTTTAAGCGTATTCGT



GGCGGTTTATCAGAAAAAAGTGTTTATCAAAAATTCGAAAAAATGTTGATTAACAAACTGAACTATTT



TGTCAGCAAGAAGGAATCCGACTGGAATAAACCGTCTGGTCTGCTGAATGGACTGCAGCTTTCGGATC



AGTTTGAAAGCTTCGAAAAACTGGGTATTCAGTCTGGTTTTATTTTTTACGTGCCGGCTGCATATACCT



CAAAGATTGATCCGACCACGGGCTTCGCCAATGTTCTGAATCTGTCGAAGGTACGCAATGTTGATGCG



ATCAAAAGCTTTTTTTCTAACTTCAACGAAATTAGTTATAGCAAGAAAGAAGCCCTTTTCAAATTCTC



ATTCGATCTGGATTCACTGAGTAAGAAAGGCTTTAGTAGCTTTGTGAAATTTAGTAAGAGTAAATGGA



ACGTCTACACCTTTGGAGAACGTATCATAAAGCCAAAGAATAAGCAAGGTTATCGGGAGGACAAAAG



AATCAACTTGACCTTCGAGATGAAGAAGTTACTTAACGAGTATAAGGTTTCTTTTGATCTTGAAAATA



ACTTGATTCCGAATCTCACGAGTGCCAACCTGAAGGATACTTTTTGGAAAGAGCTATTCTTTATCTTCA



AGACTACGCTGCAGCTCCGTAACAGCGTTACTAACGGTAAAGAAGATGTGCTCATCTCTCCGGTCAAA



AATGCGAAGGGTGAATTCTTCGTTTCGGGAACGCATAACAAGACTCTTCCGCAAGATTGCGATGCGA



ACGGTGCATACCATATTGCGTTGAAAGGTCTGATGATACTCGAACGTAACAACCTTGTACGTGAGGAG



AAAGATACGAAAAAGATTATGGCGATTTCAAACGTGGATTGGTTCGAGTACGTGCAGAAACGTAGAG



GCGTTCTGTAA





SEQ
ATGAACAACTACGACGAATTCACCAAACTGTACCCGATCCAGAAAACCATCCGTTTCGAACTGAAAC


ID
CGCAGGGTCGTACCATGGAACACCTGGAAACCTTCAACTTCTTCGAAGAAGACCGTGACCGTGCGGA


NO:
AAAATACAAAATCCTGAAAGAAGCGATCGACGAATACCACAAAAAATTCATCGACGAACACCTGACC


43
AACATGTCTCTGGACTGGAACTCTCTGAAACAGATCTCTGAAAAATACTACAAATCTCGTGAAGAAA



AAGACAAAAAAGTTTTCCTGTCTGAACAGAAACGTATGCGTCAGGAAATCGTTTCTGAATTCAAAAA



AGACGACCGTTTCAAAGACCTGTTCTCTAAAAAACTGTTCTCTGAACTGCTGAAAGAAGAAATCTACA



AAAAAGGTAACCACCAGGAAATCGACGCGCTGAAATCTTTCGACAAATTCTCTGGTTACTTCATCGGT



CTGCACGAAAACCGTAAAAACATGTACTCTGACGGTGACGAAATCACCGCGATCTCTAACCGTATCGT



TAACGAAAACTTCCCGAAATTCCTGGACAACCTGCAGAAATACCAGGAAGCGCGTAAAAAATACCCG



GAATGGATCATCAAAGCGGAATCTGCGCTGGTTGCGCACAACATCAAAATGGACGAAGTTTTCTCTCT



GGAATACTTCAACAAAGTTCTGAACCAGGAAGGTATCCAGCGTTACAACCTGGCGCTGGGTGGTTAC



GTTACCAAATCTGGTGAAAAAATGATGGGTCTGAACGACGCGCTGAACCTGGCGCACCAGTCTGAAA



AATCTTCTAAAGGTCGTATCCACATGACCCCGCTGTTCAAACAGATCCTGTCTGAAAAAGAATCTTTC



TCTTACATCCCGGACGTTTTCACCGAAGACTCTCAGCTGCTGCCGTCTATCGGTGGTTTCTTCGCGCAG



ATCGAAAACGACAAAGACGGTAACATCTTCGACCGTGCGCTGGAACTGATCTCTTCTTACGCGGAATA



CGACACCGAACGTATCTACATCCGTCAGGCGGACATCAACCGTGTTTCTAACGTTATCTTCGGTGAAT



GGGGTACCCTGGGTGGTCTGATGCGTGAATACAAAGCGGACTCTATCAACGACATCAACCTGGAACG



TACCTGCAAAAAAGTTGACAAATGGCTGGACTCTAAAGAATTCGCGCTGTCTGACGTTCTGGAAGCG



ATCAAACGTACCGGTAACAACGACGCGTTCAACGAATACATCTCTAAAATGCGTACCGCGCGTGAAA



AAATCGACGCGGCGCGTAAAGAAATGAAATTCATCTCTGAAAAAATCTCTGGTGACGAAGAATCTAT



CCACATCATCAAAACCCTGCTGGACTCTGTTCAGCAGTTCCTGCACTTCTTCAACCTGTTCAAAGCGCG



TCAGGACATCCCGCTGGACGGTGCGTTCTACGCGGAATTCGACGAAGTTCACTCTAAACTGTTCGCGA



TCGTTCCGCTGTACAACAAAGTTCGTAACTACCTGACCAAAAACAACCTGAACACCAAAAAAATCAA



ACTGAACTTCAAAAACCCGACCCTGGCGAACGGTTGGGACCAGAACAAAGTTTACGACTACGCGTCT



CTGATCTTCCTGCGTGACGGTAACTACTACCTGGGTATCATCAACCCGAAACGTAAAAAAAACATCAA



ATTCGAACAGGGTTCTGGTAACGGTCCGTTCTACCGTAAAATGGTTTACAAACAGATCCCGGGTCCGA



ACAAAAACCTGCCGCGTGTTTTCCTGACCTCTACCAAAGGTAAAAAAGAATACAAACCGTCTAAAGA



AATCATCGAAGGTTACGAAGCGGACAAACACATCCGTGGTGACAAATTCGACCTGGACTTCTGCCAC



AAACTGATCGACTTCTTCAAAGAATCTATCGAAAAACACAAAGACTGGTCTAAATTCAACTTCTACTT



CTCTCCGACCGAATCTTACGGTGACATCTCTGAATTCTACCTGGACGTTGAAAAACAGGGTTACCGTA



TGCACTTCGAAAACATCTCTGCGGAAACCATCGACGAATACGTTGAAAAAGGTGACCTGTTCCTGTTC



CAGATCTACAACAAAGACTTCGTTAAAGCGGCGACCGGTAAAAAAGACATGCACACCATCTACTGGA



ACGCGGCGTTCTCTCCGGAAAACCTGCAGGACGTTGTTGTTAAACTGAACGGTGAAGCGGAACTGTTC



TACCGTGACAAATCTGACATCAAAGAAATCGTTCACCGTGAAGGTGAAATCCTGGTTAACCGTACCTA



CAACGGTCGTACCCCGGTTCCGGACAAAATCCACAAAAAACTGACCGACTACCACAACGGTCGTACC



AAAGACCTGGGTGAAGCGAAAGAATACCTGGACAAAGTTCGTTACTTCAAAGCGCACTACGACATCA



CCAAAGACCGTCGTTACCTGAACGACAAAATCTACTTCCACGTTCCGCTGACCCTGAACTTCAAAGCG



AACGGTAAAAAAAACCTGAACAAAATGGTTATCGAAAAATTCCTGTCTGACGAAAAAGCGCACATCA



TCGGTATCGACCGTGGTGAACGTAACCTGCTGTACTACTCTATCATCGACCGTTCTGGTAAAATCATC



GACCAGCAGTCTCTGAACGTTATCGACGGTTTCGACTACCGTGAAAAACTGAACCAGCGTGAAATCG



AAATGAAAGACGCGCGTCAGTCTTGGAACGCGATCGGTAAAATCAAAGACCTGAAAGAAGGTTACCT



GTCTAAAGCGGTTCACGAAATCACCAAAATGGCGATCCAGTACAACGCGATCGTTGTTATGGAAGAA



CTGAACTACGGTTTCAAACGTGGTCGTTTCAAAGTTGAAAAACAGATCTACCAGAAATTCGAAAACAT



GCTGATCGACAAAATGAACTACCTGGTTTTCAAAGACGCGCCGGACGAATCTCCGGGTGGTGTTCTGA



ACGCGTACCAGCTGACCAACCCGCTGGAATCTTTCGCGAAACTGGGTAAACAGACCGGTATCCTGTTC



TACGTTCCGGCGGCGTACACCTCTAAAATCGACCCGACCACCGGTTTCGTTAACCTGTTCAACACCTC



TTCTAAAACCAACGCGCAGGAACGTAAAGAATTCCTGCAGAAATTCGAATCTATCTCTTACTCTGCGA



AAGACGGTGGTATCTTCGCGTTCGCGTTCGACTACCGTAAATTCGGTACCTCTAAAACCGACCACAAA



AACGTTTGGACCGCGTACACCAACGGTGAACGTATGCGTTACATCAAAGAAAAAAAACGTAACGAAC



TGTTCGACCCGTCTAAAGAAATCAAAGAAGCGCTGACCTCTTCTGGTATCAAATACGACGGTGGTCAG



AACATCCTGCCGGACATCCTGCGTTCTAACAACAACGGTCTGATCTACACCATGTACTCTTCTTTCATC



GCGGCGATCCAGATGCGTGTTTACGACGGTAAAGAAGACTACATCATCTCTCCGATCAAAAACTCTAA



AGGTGAATTCTTCCGTACCGACCCGAAACGTCGTGAACTGCCGATCGACGCGGACGCGAACGGTGCG



TACAACATCGCGCTGCGTGGTGAACTGACCATGCGTGCGATCGCGGAAAAATTCGACCCGGACTCTG



AAAAAATGGCGAAACTGGAACTGAAACACAAAGACTGGTTCGAATTCATGCAGACCCGTGGTGACTA



A





SEQ
ATGACTAAAACATTTGATTCAGAGTTTTTTAATTTGTACTCGCTGCAAAAAACGGTACGCTTTGAGTTA


ID
AAACCCGTGGGAGAAACCGCGTCATTTGTGGAAGACTTTAAAAACGAGGGCTTGAAACGTGTTGTGA


NO:
GCGAAGATGAAAGGCGAGCCGTCGATTACCAGAAAGTTAAGGAAATAATTGACGATTACCATCGGGA


44
TTTCATTGAAGAAAGTTTAAATTATTTTCCGGAACAGGTGAGTAAAGATGCTCTTGAGCAGGCGTTTC



ATCTTTATCAGAAACTGAAGGCAGCAAAAGTTGAGGAAAGGGAAAAAGCGCTGAAAGAATGGGAAG



CGCTGCAGAAAAAGCTACGTGAAAAAGTGGTGAAATGCTTCTCGGACTCGAATAAAGCCCGCTTCTC



AAGGATTGATAAAAAGGAACTGATTAAGGAAGACCTGATAAATTGGTTGGTCGCCCAGAATCGCGAG



GATGATATCCCTACGGTCGAAACGTTTAACAACTTCACCACATATTTTACCGGCTTCCATGAGAATCG



TAAAAATATTTACTCCAAAGATGATCACGCCACCGCTATTAGCTTTCGCCTTATTCATGAAAATCTTCC



AAAGTTTTTTGACAACGTGATTAGCTTCAATAAGTTGAAAGAGGGTTTCCCTGAATTAAAATTTGATA



AAGTGAAAGAGGATTTAGAAGTAGATTATGATCTGAAGCATGCGTTTGAAATAGAATATTTCGTTAAC



TTCGTGACCCAAGCGGGCATAGATCAGTATAATTATCTGTTAGGAGGGAAAACCCTGGAGGACGGGA



CGAAAAAACAAGGGATGAATGAGCAAATTAATCTGTTCAAACAACAGCAAACGCGAGATAAAGCGC



GTCAGATTCCCAAACTGATCCCCCTGTTCAAACAGATTCTTAGCGAAAGGACTGAAAGCCAGTCCTTT



ATTCCTAAACAATTTGAAAGTGATCAGGAGTTGTTCGATTCACTGCAGAAGTTACATAATAACTGCCA



GGATAAATTCACCGTGCTGCAACAAGCCATTCTCGGTCTGGCAGAGGCGGATCTTAAGAAGGTCTTCA



TCAAAACCTCTGATTTAAATGCCTTATCTAACACCATTTTCGGGAATTACAGCGTCTTTTCCGATGCAC



TGAACCTGTATAAAGAAAGCCTGAAAACGAAAAAAGCGCAGGAGGCTTTTGAGAAACTACCGGCCCA



TTCTATTCACGACCTCATTCAATACTTGGAACAGTTCAATTCCAGCCTGGACGCGGAAAAACAACAGA



GCACCGACACCGTCCTGAACTACTTCATCAAGACCGATGAATTATATTCTCGCTTCATTAAATCCACT



AGCGAGGCTTTCACTCAGGTGCAGCCTTTGTTCGAACTGGAAGCCCTGTCATCTAAGCGCCGCCCACC



GGAATCGGAAGATGAAGGGGCAAAAGGGCAGGAAGGCTTCGAGCAGATCAAGCGTATTAAAGCTTA



CCTGGATACGCTTATGGAAGCGGTACACTTTGCAAAGCCGTTGTATCTTGTTAAGGGTCGTAAAATGA



TCGAAGGGCTCGATAAAGACCAGTCCTTTTATGAAGCGTTTGAAATGGCGTACCAAGAACTTGAATCG



TTAATCATTCCTATCTATAACAAAGCGCGGAGCTATCTGTCGCGGAAACCTTTCAAGGCCGATAAATT



CAAGATTAATTTTGACAACAACACGCTACTGAGCGGATGGGATGCGAACAAGGAAACTGCTAACGCG



TCCATTCTGTTTAAGAAAGACGGGTTATATTACCTTGGAATTATGCCGAAAGGTAAGACCTTTCTCTTT



GACTACTTTGTATCGAGCGAGGATTCAGAGAAACTGAAACAGCGTCGCCAGAAGACCGCCGAAGAAG



CTCTGGCGCAGGATGGTGAAAGTTACTTCGAAAAAATTCGTTATAAACTGTTACCAGGGGCTTCAAAG



ATGTTACCGAAAGTCTTTTTTAGCAACAAAAATATTGGCTTTTACAACCCGTCGGATGACATTTTACGC



ATTCGCAACACAGCCTCTCACACCAAAAACGGGACCCCTCAGAAAGGCCACTCAAAAGTTGAGTTTA



ACCTGAATGATTGTCATAAGATGATTGATTTCTTCAAATCATCAATTCAGAAACACCCGGAATGGGGG



TCTTTTGGCTTTACGTTTTCTGATACCAGTGATTTTGAAGACATGAGTGCCTTCTACCGGGAAGTAGAA



AACCAGGGTTACGTAATTAGCTTTGACAAAATCAAAGAGACCTATATACAGAGCCAGGTGGAACAGG



GTAATCTCTACTTATTCCAGATTTATAACAAGGATTTCTCGCCCTACAGCAAAGGCAAACCAAACCTG



CATACTCTGTACTGGAAAGCCCTGTTTGAAGAAGCGAACCTGAATAACGTAGTGGCGAAGTTGAACG



GTGAAGCGGAAATCTTCTTCCGTCGTCACTCCATTAAGGCCTCTGATAAAGTTGTCCATCCGGCAAAT



CAGGCCATTGATAATAAGAATCCACACACGGAAAAAACGCAGTCAACCTTTGAATATGACCTCGTTA



AAGACAAACGCTACACGCAAGATAAGTTCTTTTTCCACGTCCCAATCAGCCTCAACTTTAAAGCACAA



GGGGTTTCAAAGTTTAATGATAAAGTCAATGGGTTCCTCAAGGGCAACCCGGATGTCAACATTATAGG



TATAGACAGGGGCGAACGCCATCTGCTTTACTTTACCGTAGTGAATCAGAAAGGTGAAATACTGGTTC



AGGAATCATTAAATACCTTGATGTCGGACAAAGGGCACGTTAATGATTACCAGCAGAAACTGGATAA



AAAAGAACAGGAACGTGATGCTGCGCGTAAATCGTGGACCACGGTTGAGAACATTAAAGAGCTGAA



AGAGGGGTATCTAAGCCATGTGGTACACAAACTGGCGCACCTCATCATTAAATATAACGCAATAGTCT



GCCTAGAAGACTTGAATTTTGGCTTTAAACGCGGCCGCTTCAAAGTGGAAAAACAAGTTTATCAAAA



ATTTGAAAAGGCGCTTATAGATAAACTGAATTATCTGGTTTTTAAAGAAAAGGAACTTGGTGAGGTAG



GGCACTACTTGACAGCTTATCAACTGACGGCCCCGTTCGAATCATTCAAAAAACTGGGCAAACAGTCT



GGCATTCTGTTTTACGTGCCGGCAGATTATACTTCAAAAATCGATCCAACAACTGGCTTTGTGAACTTC



CTGGACCTGAGATATCAGTCTGTAGAAAAAGCTAAACAACTTCTTAGCGATTTTAATGCCATTCGTTT



TAACAGCGTTCAGAATTACTTTGAATTCGAAATTGACTATAAAAAACTTACTCCGAAACGTAAAGTCG



GAACCCAAAGTAAATGGGTAATTTGTACGTATGGCGATGTCAGGTATCAGAACCGTCGGAATCAAAA



AGGTCATTGGGAGACCGAAGAAGTGAACGTGACCGAAAAGCTGAAGGCTCTGTTCGCCAGCGATTCA



AAAACTACAACTGTGATCGATTACGCAAATGATGATAACCTGATAGATGTGATTTTAGAGCAGGATA



AAGCCAGCTTTTTTAAAGAACTGTTGTGGCTCCTGAAACTTACGATGACCTTACGACATTCCAAGATC



AAATCGGAAGATGATTTTATTCTGTCACCGGTCAAGAATGAGCAGGGTGAATTCTATGATAGTAGGA



AAGCCGGCGAAGTGTGGCCGAAAGACGCCGACGCCAATGGCGCCTATCATATCGCGCTCAAAGGGCT



TTGGAATTTGCAGCAGATTAACCAGTGGGAAAAAGGTAAAACCCTGAATCTGGCTATCAAAAACCAG



GATTGGTTTAGCTTTATCCAAGAGAAACCGTATCAGGAATGA





SEQ
ATGCATACAGGCGGTCTTCTTAGTATGGACGCGAAAGAGTTCACAGGTCAGTATCCGTTGTCGAAAAC


ID
ATTACGATTCGAACTTCGGCCCATCGGCCGCACGTGGGATAACCTGGAGGCCTCAGGCTACTTAGCGG


NO:
AAGACCGCCATCGTGCCGAATGTTATCCTCGTGCGAAAGAGTTATTGGATGACAACCATCGTGCCTTC


45
CTGAATCGTGTGTTGCCACAAATCGATATGGATTGGCACCCGATTGCGGAGGCCTTTTGTAAGGTACA



TAAAAACCCTGGTAATAAAGAACTTGCCCAGGATTACAACCTTCAGTTGTCAAAGCGCCGTAAGGAG



ATCAGCGCATATCTTCAGGATGCAGATGGCTATAAAGGCCTGTTCGCGAAGCCCGCCTTAGACGAAG



CTATGAAAATTGCGAAAGAAAACGGGAACGAAAGTGATATTGAGGTTCTCGAAGCGTTTAACGGTTT



TAGCGTATACTTCACCGGTTATCATGAGTCACGCGAGAACATTTATAGCGATGAGGATATGGTGAGCG



TAGCCTACCGAATTACTGAGGATAATTTCCCGCGCTTTGTCTCAAACGCTTTGATCTTTGATAAATTAA



ACGAAAGCCATCCGGATATTATCTCTGAAGTATCGGGCAATCTTGGAGTTGATGACATTGGTAAGTAC



TTTGACGTGTCGAACTATAACAATTTTCTTTCCCAGGCCGGTATAGATGACTACAATCACATTATTGGC



GGCCATACAACCGAAGACGGACTGATACAAGCGTTTAATGTCGTATTGAACTTACGTCACCAAAAAG



ACCCTGGCTTTGAAAAAATTCAGTTCAAACAGCTCTACAAACAAATCCTGAGCGTGCGTACCAGCAA



AAGCTACATCCCGAAACAGTTTGACAACTCTAAGGAGATGGTTGACTGCATTTGCGATTATGTCAGCA



AAATAGAGAAATCCGAAACAGTAGAACGGGCCCTGAAACTAGTCCGTAATATCAGTTCTTTCGACTT



GCGCGGGATCTTTGTCAATAAAAAGAACTTGCGCATACTGAGCAACAAACTGATAGGAGATTGGGAC



GCGATCGAAACCGCATTGATGCATAGTTCTTCATCAGAAAACGATAAGAAAAGCGTATATGATAGCG



CGGAGGCTTTTACGTTGGATGACATCTTTTCAAGCGTGAAAAAATTTTCTGATGCCTCTGCCGAAGAT



ATTGGCAACAGGGCGGAAGACATCTGTAGAGTGATAAGTGAGACGGCCCCTTTTATCAACGATCTGC



GAGCGGTGGACCTGGATAGCCTGAACGACGATGGTTATGAAGCGGCCGTCTCAAAAATTCGGGAGTC



GCTGGAGCCTTATATGGATCTTTTCCATGAACTGGAAATTTTCTCGGTTGGCGATGAGTTCCCAAAAT



GCGCAGCATTTTACAGCGAACTGGAGGAAGTCAGCGAACAGCTGATCGAAATTATTCCGTTATTCAAC



AAGGCGCGTTCGTTCTGCACCCGGAAACGCTATAGCACCGATAAGATTAAAGTGAACTTAAAATTCCC



GACCTTGGCGGACGGGTGGGACCTGAACAAAGAGAGAGACAACAAAGCCGCGATTCTGCGGAAAGA



CGGTAAGTATTATCTGGCAATTCTGGATATGAAGAAAGATCTGTCAAGCATTAGGACCAGCGACGAA



GATGAATCCAGCTTCGAAAAGATGGAGTATAAACTGTTACCGAGTCCAGTAAAAATGCTGCCAAAGA



TATTCGTAAAATCGAAAGCCGCTAAGGAAAAATATGGCCTGACAGATCGTATGCTTGAATGCTACGA



TAAAGGTATGCATAAGTCGGGTAGTGCGTTTGATCTTGGCTTTTGCCATGAACTCATTGATTATTACAA



GCGTTGTATCGCGGAGTACCCAGGCTGGGATGTGTTCGATTTCAAGTTTCGCGAAACTTCCGATTATG



GGTCCATGAAAGAGTTCAATGAAGATGTGGCCGGAGCCGGTTACTATATGAGTCTGAGAAAAATTCC



GTGCAGCGAAGTGTACCGTCTGTTAGACGAGAAATCGATTTATCTATTTCAAATTTATAACAAAGATT



ACTCTGAAAATGCACATGGTAATAAGAACATGCATACCATGTACTGGGAGGGTCTCTTTTCCCCGCAA



AACCTGGAGTCGCCCGTTTTCAAGTTGTCGGGTGGGGCAGAACTTTTCTTTCGAAAATCCTCAATCCCT



AACGATGCCAAAACAGTACACCCGAAAGGCTCAGTGCTGGTTCCACGTAATGATGTTAACGGTCGGC



GTATTCCAGATTCAATCTACCGCGAACTGACACGCTATTTTAACCGTGGCGATTGCCGAATCAGTGAC



GAAGCCAAAAGTTATCTTGACAAGGTTAAGACTAAAAAAGCGGACCATGACATTGTGAAAGATCGCC



GCTTTACCGTGGATAAAATGATGTTCCACGTCCCGATTGCGATGAACTTTAAGGCGATCAGTAAACCG



AACTTAAACAAAAAAGTCATTGATGGCATCATTGATGATCAGGATCTGAAAATCATTGGTATTGATCG



TGGCGAGCGGAACTTAATTTACGTCACGATGGTTGACAGAAAAGGGAATATCTTATATCAGGATTCTC



TTAACATCCTCAATGGCTACGACTATCGTAAAGCTCTGGATGTGCGCGAATATGACAACAAGGAAGC



GCGTCGTAACTGGACTAAAGTGGAGGGCATTCGCAAAATGAAGGAAGGCTATCTGTCATTAGCGGTC



TCGAAATTAGCGGATATGATTATCGAAAATAACGCCATCATCGTTATGGAGGACCTGAACCACGGATT



CAAAGCGGGCCGCTCAAAGATTGAAAAACAAGTTTATCAGAAATTTGAGAGTATGCTGATTAACAAA



CTGGGCTATATGGTGTTAAAAGACAAGTCAATTGACCAATCAGGTGGCGCGCTGCATGGATACCAGC



TGGCGAACCATGTTACCACCTTAGCATCAGTTGGAAAGCAGTGTGGGGTTATCTTTTATATACCGGCA



GCGTTCACTAGTAAAATAGATCCGACCACTGGTTTCGCCGATCTCTTTGCCCTGAGTAACGTTAAAAA



CGTAGCGAGCATGCGTGAATTCTTTTCCAAAATGAAATCTGTCATTTATGATAAAGCTGAAGGCAAAT



TCGCATTCACCTTTGATTACTTGGATTACAACGTGAAGAGCGAATGTGGTCGTACGCTGTGGACCGTT



TACACCGTTGGTGAGCGCTTCACCTATTCCCGTGTGAACCGCGAATATGTACGTAAAGTCCCCACCGA



TATTATCTATGATGCCCTCCAGAAAGCAGGCATTAGCGTCGAAGGAGACTTAAGGGACAGAATTGCC



GAAAGCGATGGCGATACGCTGAAGTCTATTTTTTACGCATTCAAATACGCGCTAGATATGCGCGTTGA



GAATCGCGAGGAAGACTACATTCAATCACCTGTGAAAAATGCCTCTGGGGAATTTTTTTGTTCAAAAA



ATGCTGGTAAAAGCCTCCCACAAGATAGCGATGCAAACGGTGCATATAACATTGCCCTGAAAGGTAT



TCTTCAATTACGCATGCTGTCTGAGCAGTACGACCCCAACGCGGAATCTATTAGACTTCCGCTGATAA



CCAATAAAGCCTGGCTGACATTCATGCAGTCTGGCATGAAGACCTGGAAAAATTAG





SEQ
ATGGATAGTTTAAAAGATTTTACGAATCTATATCCCGTAAGCAAAACTCTTCGTTTTGAACTGAAACC


ID
TGTTGGAAAAACGTTGGAGAATATCGAGAAAGCGGGCATCCTGAAAGAAGACGAGCACCGTGCCGA


NO:
AAGCTACAGGCGTGTCAAAAAGATTATCGATACTTATCACAAAGTGTTCATTGATAGCAGTCTGGAGA


46
ACATGGCAAAAATGGGCATAGAAAATGAAATCAAAGCAATGCTGCAGAGCTTTTGCGAGCTCTACAA



GAAAGATCACCGAACGGAAGGTGAAGATAAAGCACTGGACAAAATTCGCGCCGTTCTTCGCGGTCTG



ATTGTTGGCGCGTTCACCGGCGTGTGCGGCCGCCGTGAAAACACCGTGCAGAACGAAAAGTACGAGT



CGCTGTTCAAAGAAAAACTGATAAAAGAAATTTTGCCTGACTTTGTGCTTTCGACCGAAGCGGAATCC



CTGCCATTTTCTGTCGAAGAAGCGACCCGCAGCCTGAAAGAATTTGACTCATTCACAAGTTACTTTGC



AGGCTTCTACGAAAACCGTAAAAACATCTACAGCACGAAGCCACAGAGCACGGCTATTGCTTATCGC



CTGATTCATGAGAACCTGCCGAAGTTCATCGATAACATCCTTGTTTTTCAAAAAATTAAAGAGCCGAT



TGCGAAAGAGTTAGAACATATTCGAGCTGACTTTTCTGCGGGTGGGTACATTAAAAAAGATGAGCGG



CTGGAAGACATCTTCAGTCTAAACTATTATATCCACGTTCTGTCGCAGGCAGGCATTGAGAAATATAA



TGCGCTGATTGGTAAGATTGTCACAGAAGGCGATGGTGAGATGAAAGGTCTTAATGAACATATCAAT



CTGTATAACCAGCAGCGTGGTCGCGAAGACCGTCTTCCACTGTTCCGCCCACTGTATAAACAGATCCT



GTCTGACCGGGAACAGCTGTCCTACCTGCCGGAAAGCTTTGAAAAGGATGAAGAGCTACTTCGCGCA



TTAAAGGAGTTTTACGACCATATTGCGGAAGACATTTTGGGTAGAACGCAGCAACTGATGACGTCAAT



TTCTGAATACGATCTGAGTAGAATCTACGTTAGGAATGATAGCCAGCTGACCGATATTAGCAAAAAA



ATGCTGGGCGACTGGAACGCTATCTATATGGCACGTGAACGTGCATATGATCATGAACAAGCACCGA



AACGTATAACCGCGAAATATGAGCGTGATCGCATTAAGGCGCTAAAGGGAGAAGAAAGCATCTCACT



CGCAAACCTGAACTCCTGTATCGCTTTCTTAGATAACGTGCGCGATTGTCGCGTCGACACGTATCTGTC



AACCCTTGGGCAGAAAGAGGGTCCACATGGTCTGTCTAACCTGGTGGAAAATGTCTTTGCGAGTTACC



ATGAAGCGGAACAACTGCTGTCTTTTCCATACCCCGAAGAAAACAATCTAATACAGGATAAAGATAA



CGTGGTGTTAATCAAAAACCTGCTGGACAACATCAGCGATCTGCAACGTTTCCTGAAACCTTTGTGGG



GTATGGGTGACGAGCCAGACAAAGACGAACGTTTTTATGGTGAGTATAATTATATACGTGGCGCCCTT



GACCAAGTTATTCCGCTGTATAACAAAGTACGGAACTATCTGACCCGTAAGCCATATTCTACCCGTAA



AGTGAAACTGAACTTTGGCAACTCGCAACTGCTGTCGGGTTGGGATCGTAACAAAGAAAAAGATAAT



AGTTGTGTTATCCTGCGTAAGGGACAAAATTTTTACCTCGCGATTATGAACAACAGACACAAGCGTTC



ATTTGAAAATAAGGTTCTGCCGGAGTATAAAGAGGGCGAACCGTACTTCGAGAAAATGGATTATAAG



TTCTTACCAGACCCTAATAAGATGTTACCGAAAGTCTTTCTTTCGAAAAAAGGCATAGAAATCTATAA



GCCGTCCCCGAAATTACTCGAACAGTATGGGCACGGGACCCACAAGAAAGGGGATACTTTTAGCATG



GACGATCTGCACGAACTGATCGATTTTTTTAAACACTCCATCGAAGCCCATGAAGACTGGAAACAGTT



TGGGTTCAAGTTCTCTGATACAGCCACATACGAGAATGTGTCTAGTTTTTATCGGGAAGTGGAGGATC



AGGGCTACAAACTTAGTTTTCGTAAAGTTTCAGAGAGTTATGTTTATAGTTTAATTGATCAGGGAAAA



CTTTACCTGTTCCAGATCTACAACAAAGATTTCTCGCCATGTAGTAAGGGTACCCCGAATCTGCATAC



ACTCTATTGGAGAATGTTATTCGATGAGCGTAACTTAGCGGATGTCATTTATAAATTGGACGGGAAAG



CAGAGATCTTTTTTCGTGAAAAATCACTGAAGAATGACCACCCGACTCATCCGGCCGGGAAACCGATC



AAAAAAAAATCCCGCCAGAAAAAAGGAGAAGAGTCTCTGTTTGAATATGATCTGGTGAAAGACCGTC



ATTACACTATGGATAAATTTCAATTTCATGTTCCAATTACAATGAACTTCAAATGTTCGGCGGGTTCCA



AAGTAAATGATATGGTAAACGCCCATATTCGCGAAGCGAAAGATATGCATGTTATTGGCATCGATAG



AGGCGAAAGAAACCTGCTTTATATTTGCGTAATTGACAGCCGTGGTACCATTCTGGACCAGATCTCTT



TAAACACCATCAATGACATCGATTATCACGACCTGTTGGAGTCTCGGGACAAGGACCGCCAGCAGGA



GCGCCGTAATTGGCAGACAATTGAAGGCATAAAAGAATTAAAACAGGGTTACCTTTCCCAGGCCGTA



CACCGCATAGCGGAACTGATGGTGGCCTACAAAGCCGTAGTTGCCCTGGAAGACTTGAATATGGGGT



TTAAACGTGGCCGTCAAAAAGTCGAGAGCAGCGTGTATCAGCAATTTGAAAAACAGTTGATTGACAA



GTTGAATTATTTGGTTGATAAAAAGAAACGTCCAGAAGATATTGGTGGCTTACTGCGTGCATACCAGT



TTACGGCACCTTTTAAGTCCTTCAAAGAAATGGGTAAACAGAACGGGTTTCTGTTTTACATCCCGGCC



TGGAATACATCCAACATCGATCCTACCACCGGGTTTGTCAACCTGTTTCATGCACAATATGAAAACGT



GGATAAAGCGAAGAGTTTTTTCCAAAAATTCGATAGTATTTCGTATAACCCAAAAAAAGATTGGTTTG



AGTTTGCGTTCGATTATAAAAATTTTACTAAAAAGGCTGAGGGATCCCGCAGTATGTGGATCCTCTGC



ACCCATGGCAGTCGTATTAAAAATTTTCGTAATTCGCAAAAGAATGGCCAGTGGGACTCGGAAGAGT



TTGCCCTGACCGAAGCGTTCAAATCGCTGTTTGTACGCTACGAAATTGACTACACAGCAGATCTGAAA



ACAGCCATCGTCGATGAAAAACAGAAAGATTTTTTTGTAGATCTCCTAAAACTGTTCAAACTGACTGT



TCAGATGCGCAATTCCTGGAAAGAGAAAGACCTGGATTATCTGATTAGCCCGGTAGCCGGTGCTGAT



GGACGATTTTTCGATACTCGTGAAGGTAACAAAAGTCTCCCGAAAGATGCTGATGCCAATGGTGCATA



CAATATTGCATTAAAGGGGCTATGGGCCTTGCGACAGATCCGCCAGACCAGCGAAGGCGGCAAGCTG



AAATTGGCCATATCGAATAAGGAATGGTTACAATTTGTTCAGGAACGTAGCTATGAAAAAGATTGA





SEQ
ATGAACAACGGCACAAATAATTTTCAGAACTTCATCGGGATCTCAAGTTTGCAGAAAACGCTGCGCA


ID
ATGCTCTGATCCCCACGGAAACCACGCAACAGTTCATCGTCAAGAACGGAATAATTAAAGAAGATGA


NO:
GTTACGTGGCGAGAACCGCCAGATTCTGAAAGATATCATGGATGACTACTACCGCGGATTCATCTCTG


47
AGACTCTGAGTTCTATTGATGACATAGATTGGACTAGCCTGTTCGAAAAAATGGAAATTCAGCTGAAA



AATGGTGATAATAAAGATACCTTAATTAAGGAACAGACAGAGTATCGGAAAGCAATCCATAAAAAAT



TTGCGAACGACGATCGGTTTAAGAACATGTTTAGCGCCAAACTGATTAGTGACATATTACCTGAATTT



GTCATCCACAACAATAATTATTCGGCATCAGAGAAAGAGGAAAAAACCCAGGTGATAAAATTGTTTT



CGCGCTTTGCGACTAGCTTTAAAGATTACTTCAAGAACCGTGCAAATTGCTTTTCAGCGGACGATATT



TCATCAAGCAGCTGCCATCGCATCGTCAACGACAATGCAGAGATATTCTTTTCAAATGCGCTGGTCTA



CCGCCGGATCGTAAAATCGCTGAGCAATGACGATATCAACAAAATTTCGGGCGATATGAAAGATTCA



TTAAAAGAAATGAGTCTGGAAGAAATATATTCTTACGAGAAGTATGGGGAATTTATTACCCAGGAAG



GCATTAGCTTCTATAATGATATCTGTGGGAAAGTGAATTCTTTTATGAACCTGTATTGTCAGAAAAAT



AAAGAAAACAAAAATTTATACAAACTTCAGAAACTTCACAAACAGATTCTATGCATTGCGGACACTA



GCTATGAGGTCCCGTATAAATTTGAAAGTGACGAGGAAGTGTACCAATCAGTTAACGGCTTCCTTGAT



AACATTAGCAGCAAACATATAGTCGAAAGATTACGCAAAATCGGCGATAACTATAACGGCTACAACC



TGGATAAAATTTATATCGTGTCCAAATTTTACGAGAGCGTTAGCCAAAAAACCTACCGCGACTGGGAA



ACAATTAATACCGCCCTCGAAATTCATTACAATAATATCTTGCCGGGTAACGGTAAAAGTAAAGCCGA



CAAAGTAAAAAAAGCGGTTAAGAATGATTTACAGAAATCCATCACCGAAATAAATGAACTAGTGTCA



AACTATAAGCTGTGCAGTGACGACAACATCAAAGCGGAGACTTATATACATGAGATTAGCCATATCTT



GAATAACTTTGAAGCACAGGAATTGAAATACAATCCGGAAATTCACCTAGTTGAATCCGAGCTCAAA



GCGAGTGAGCTTAAAAACGTGCTGGACGTGATCATGAATGCGTTTCATTGGTGTTCGGTTTTTATGAC



TGAGGAACTTGTTGATAAAGACAACAATTTTTATGCGGAACTGGAGGAGATTTACGATGAAATTTATC



CAGTAATTAGTCTGTACAACCTGGTTCGTAACTACGTTACCCAGAAACCGTACAGCACGAAAAAGATT



AAATTGAACTTTGGAATACCGACGTTAGCAGACGGTTGGTCAAAGTCCAAAGAGTATTCTAATAACG



CTATCATACTGATGCGCGACAATCTGTATTATCTGGGCATCTTTAATGCGAAGAATAAACCGGACAAG



AAGATTATCGAGGGTAATACGTCAGAAAATAAGGGTGACTACAAAAAGATGATTTATAATTTGCTCC



CGGGTCCCAACAAAATGATCCCGAAAGTTTTCTTGAGCAGCAAGACGGGGGTGGAAACGTATAAACC



GAGCGCCTATATCCTAGAGGGGTATAAACAGAATAAACATATCAAGTCTTCAAAAGACTTTGATATC



ACTTTCTGTCATGATCTGATCGACTACTTCAAAAACTGTATTGCAATTCATCCCGAGTGGAAAAACTTC



GGTTTTGATTTTAGCGACACCAGTACTTATGAAGACATTTCCGGGTTTTATCGTGAGGTAGAGTTACA



AGGTTACAAGATTGATTGGACATACATTAGCGAAAAAGACATTGATCTGCTGCAGGAAAAAGGTCAA



CTGTATCTGTTCCAGATATATAACAAAGATTTTTCGAAAAAATCAACCGGGAATGACAACCTTCACAC



CATGTACCTGAAAAATCTTTTCTCAGAAGAAAATCTTAAGGATATCGTCCTGAAACTTAACGGCGAAG



CGGAAATCTTCTTCAGGAAGAGCAGCATAAAGAACCCAATCATTCATAAAAAAGGCTCGATTTTAGT



CAACCGTACCTACGAAGCAGAAGAAAAAGACCAGTTTGGCAACATTCAAATTGTGCGTAAAAATATT



CCGGAAAACATTTATCAGGAGCTGTACAAATACTTCAACGATAAAAGCGACAAAGAGCTGTCTGATG



AAGCAGCCAAACTGAAGAATGTAGTGGGACACCACGAGGCAGCGACGAATATAGTCAAGGACTATC



GCTACACGTATGATAAATACTTCCTTCATATGCCTATTACGATCAATTTCAAAGCCAATAAAACGGGT



TTTATTAATGATAGGATCTTACAGTATATCGCTAAAGAAAAAGACTTACATGTGATCGGCATTGATCG



GGGCGAGCGTAACCTGATCTACGTGTCCGTGATTGATACTTGTGGTAATATAGTTGAACAGAAAAGCT



TTAACATTGTAAACGGCTACGACTATCAGATAAAACTGAAACAACAGGAGGGCGCTAGACAGATTGC



GCGGAAAGAATGGAAAGAAATTGGTAAAATTAAAGAGATCAAAGAGGGCTACCTGAGCTTAGTAAT



CCACGAGATCTCTAAAATGGTAATCAAATACAATGCAATTATAGCGATGGAGGATTTGTCTTATGGTT



TTAAAAAAGGGCGCTTTAAGGTCGAACGGCAAGTTTACCAGAAATTTGAAACCATGCTCATCAATAA



ACTCAACTATCTGGTATTTAAAGATATTTCGATTACCGAGAATGGCGGTCTCCTGAAAGGTTATCAGC



TGACATACATTCCTGATAAACTTAAAAACGTGGGTCATCAGTGCGGCTGCATTTTTTATGTGCCTGCTG



CATACACGAGCAAAATTGATCCGACCACCGGCTTTGTGAATATCTTTAAATTTAAAGACCTGACAGTG



GACGCAAAACGTGAATTCATTAAAAAATTTGACTCAATTCGTTATGACAGTGAAAAAAATCTGTTCTG



CTTTACATTTGACTACAATAACTTTATTACGCAAAACACGGTCATGAGCAAATCATCGTGGAGTGTGT



ATACATACGGCGTGCGCATCAAACGTCGCTTTGTGAACGGCCGCTTCTCAAACGAAAGTGATACCATT



GACATAACCAAAGATATGGAGAAAACGTTGGAAATGACGGACATTAACTGGCGCGATGGCCACGATC



TTCGTCAAGACATTATAGATTATGAAATTGTTCAGCACATATTCGAAATTTTCCGTTTAACAGTGCAA



ATGCGTAACTCCTTGTCTGAACTGGAGGACCGTGATTACGATCGTCTCATTTCACCTGTACTGAACGA



AAATAACATTTTTTATGACAGCGCGAAAGCGGGGGATGCACTTCCTAAGGATGCCGATGCAAATGGT



GCGTATTGTATTGCATTAAAAGGGTTATATGAAATTAAACAAATTACCGAAAATTGGAAAGAAGATG



GTAAATTTTCGCGCGATAAACTCAAAATCAGCAATAAAGATTGGTTCGACTTTATCCAGAATAAGCGC



TATCTCTAA





SEQ
ATGACCAATAAATTCACTAACCAGTATTCTCTCTCTAAGACCCTGCGCTTTGAACTGATTCCGCAGGG


ID
GAAAACCTTGGAGTTCATTCAAGAAAAAGGCCTCTTGTCTCAGGATAAACAGAGGGCTGAATCTTAC


NO:
CAAGAAATGAAGAAAACTATTGATAAGTTTCATAAATATTTCATTGATTTAGCCTTGTCTAACGCCAA


48
ATTAACTCACTTGGAAACGTATCTGGAGTTATACAACAAATCTGCCGAAACTAAGAAAGAACAGAAA



TTTAAAGACGATTTGAAAAAAGTACAGGACAATCTGCGTAAAGAAATTGTCAAATCCTTCAGTGACG



GCGATGCTAAAAGCATTTTTGCCATTCTGGACAAAAAAGAGTTGATTACTGTGGAATTAGAAAAGTG



GTTTGAAAACAATGAGCAGAAAGACATCTACTTCGATGAGAAATTCAAAACTTTCACCACCTATTTTA



CAGGATTTCATCAAAACCGGAAGAACATGTACTCAGTAGAACCGAACTCCACGGCCATTGCGTATCG



TTTGATCCATGAGAATCTGCCTAAATTTCTGGAGAATGCGAAAGCCTTTGAAAAGATTAAGCAGGTCG



AATCGCTGCAAGTGAATTTTCGTGAACTCATGGGCGAATTTGGTGACGAAGGTCTAATCTTCGTTAAC



GAACTGGAAGAAATGTTTCAGATTAATTACTACAATGACGTGCTATCGCAGAACGGTATCACAATCTA



CAATAGTATTATCTCAGGGTTCACAAAAAACGATATAAAATACAAAGGCCTGAACGAGTATATCAAT



AACTACAACCAAACAAAGGACAAAAAGGATAGGCTTCCGAAACTGAAGCAGTTATACAAACAGATTT



TATCTGACAGAATCTCCCTGAGCTTTCTGCCGGATGCTTTCACTGATGGGAAGCAGGTTCTGAAAGCG



ATTTTCGATTTTTATAAGATTAACTTACTGAGCTACACGATTGAAGGTCAAGAAGAATCTCAAAACTT



ACTGCTCTTGATCCGTCAAACCATTGAAAATCTATCATCGTTCGATACGCAGAAAATCTACCTCAAAA



ACGATACTCACCTGACTACGATCTCTCAGCAGGTTTTCGGGGATTTTAGTGTATTTTCAACAGCTCTGA



ACTACTGGTATGAAACCAAAGTCAATCCGAAATTCGAGACGGAATATTCTAAGGCCAACGAAAAAAA



ACGTGAGATTCTTGATAAAGCTAAAGCCGTATTTACTAAACAGGATTACTTTTCTATTGCTTTCCTGCA



GGAAGTTTTATCGGAGTATATCCTGACCCTGGATCATACATCTGATATCGTTAAAAAACACAGCAGCA



ATTGCATCGCTGACTATTTCAAAAACCACTTTGTCGCCAAAAAAGAAAACGAAACAGACAAGACTTT



CGATTTCATTGCTAACATCACCGCAAAATACCAGTGTATTCAGGGTATCTTGGAAAACGCCGACCAAT



ACGAAGACGAACTGAAACAAGATCAGAAGCTGATCGATAATTTAAAATTCTTCTTAGATGCAATCCT



GGAGCTGCTGCACTTCATCAAACCGCTTCATTTAAAGAGCGAGTCCATTACCGAAAAGGACACCGCCT



TCTATGACGTTTTTGAAAATTATTATGAAGCCCTCTCCTTGCTGACTCCGCTGTATAATATGGTACGCA



ATTACGTAACCCAGAAACCATATTCTACCGAAAAAATTAAACTGAACTTTGAAAACGCACAGCTGCTC



AACGGTTGGGACGCGAATAAAGAAGGTGACTACCTCACCACCATCCTGAAAAAAGATGGTAACTATT



TTCTGGCAATTATGGATAAGAAACATAATAAAGCATTCCAGAAATTTCCTGAAGGGAAAGAAAATTA



CGAAAAGATGGTGTACAAACTCTTACCTGGAGTTAACAAAATGTTGCCGAAAGTATTTTTTAGTAATA



AGAACATCGCGTACTTTAACCCGTCCAAAGAACTGCTGGAAAATTATAAAAAGGAGACGCATAAGAA



AGGGGATACCTTTAACCTGGAACATTGCCATACCTTAATAGACTTCTTCAAGGATTCCCTGAATAAAC



ACGAGGATTGGAAATATTTCGATTTTCAGTTTAGTGAGACCAAGTCATACCAGGATCTTAGCGGCTTT



TATCGCGAAGTAGAACACCAAGGCTATAAAATTAACTTCAAAAACATCGACAGCGAATACATCGACG



GTTTAGTTAACGAGGGCAAACTGTTTCTGTTCCAGATCTATTCAAAGGATTTTAGCCCGTTCTCTAAAG



GCAAACCAAATATGCATACGTTGTACTGGAAAGCACTGTTTGAAGAGCAAAACCTGCAGAATGTGAT



TTATAAACTGAACGGCCAAGCTGAGATTTTTTTCCGTAAAGCCTCGATTAAACCGAAAAATATCATCC



TTCATAAGAAGAAAATAAAGATCGCTAAAAAACACTTCATAGATAAAAAAACCAAAACCTCCGAAAT



AGTGCCTGTTCAAACAATTAAGAACTTGAATATGTACTACCAGGGCAAGATATCGGAAAAGGAGTTG



ACTCAAGACGATCTTCGCTATATCGATAACTTTTCGATTTTTAACGAAAAAAACAAGACGATCGACAT



CATCAAAGATAAACGCTTCACTGTAGATAAGTTCCAGTTTCATGTGCCGATTACTATGAACTTCAAAG



CTACCGGGGGTAGCTATATCAACCAAACGGTGTTGGAATACCTGCAGAATAACCCGGAAGTCAAAAT



CATTGGGCTGGACCGCGGAGAACGTCACCTTGTGTACTTGACCTTAATCGATCAGCAAGGCAACATCT



TAAAACAAGAATCGCTGAATACCATTACGGATTCAAAGATTAGCACCCCGTATCATAAGCTGCTCGAT



AACAAGGAGAATGAGCGCGACCTGGCCCGTAAAAACTGGGGCACGGTGGAAAACATTAAGGAGTTA



AAGGAGGGTTATATTTCCCAGGTAGTGCATAAGATCGCCACTCTCATGCTCGAGGAAAATGCGATCGT



TGTCATGGAAGACTTAAACTTCGGATTTAAACGTGGGCGATTTAAAGTAGAGAAACAAATCTACCAG



AAGTTAGAAAAAATGCTGATTGACAAATTAAATTACTTGGTCCTAAAAGACAAACAGCCGCAAGAAT



TGGGTGGATTATACAACGCCCTCCAACTTACCAATAAATTCGAAAGTTTTCAGAAAATGGGTAAACAG



TCAGGCTTTCTTTTTTATGTTCCTGCGTGGAACACATCCAAAATCGACCCTACAACCGGCTTCGTCAAT



TACTTCTATACTAAATATGAAAACGTCGACAAAGCAAAAGCATTCTTTGAAAAGTTCGAAGCAATAC



GTTTTAACGCTGAGAAAAAATATTTCGAGTTCGAAGTCAAGAAATACTCAGACTTTAACCCCAAAGCT



GAGGGCACACAGCAAGCGTGGACAATCTGCACCTACGGCGAGCGCATCGAAACGAAGCGTCAAAAA



GATCAGAATAACAAATTTGTTTCAACACCTATCAACCTGACCGAGAAGATTGAAGACTTCTTAGGTAA



AAATCAGATTGTTTATGGCGACGGTAACTGTATAAAATCTCAAATAGCCTCAAAGGATGATAAAGCA



TTTTTCGAAACATTATTATATTGGTTCAAAATGACACTGCAGATGCGCAATAGTGAGACGCGTACAGA



TATTGATTATCTTATCAGCCCGGTCATGAACGACAACGGTACTTTTTACAACTCCAGAGACTATGAAA



AACTTGAGAATCCAACTCTCCCCAAAGATGCTGATGCGAACGGTGCTTATCACATCGCGAAAAAAGG



TCTGATGCTGCTGAACAAAATCGACCAAGCCGATCTGACTAAGAAAGTTGACCTAAGCATTTCAAATC



GGGACTGGTTACAGTTTGTTCAAAAGAACAAATGA





SEQ
ATGGAACAGGAATATTATCTGGGCTTGGACATGGGCACCGGTTCCGTCGGCTGGGCTGTTACTGACAG


ID
TGAATATCACGTTCTAAGAAAGCATGGTAAGGCATTGTGGGGTGTAAGACTTTTCGAATCTGCTTCCA


NO:
CTGCTGAAGAGCGTAGAATGTTTAGAACGAGTCGACGTAGGCTAGACAGGCGCAATTGGAGAATCGA


49
AATTTTACAAGAAATTTTTGCGGAAGAGATATCTAAGAAAGACCCAGGCTTTTTCCTGAGAATGAAGG



AATCTAAGTATTACCCTGAGGATAAAAGAGATATAAATGGTAACTGTCCCGAATTGCCTTACGCATTA



TTTGTGGACGATGATTTTACCGATAAGGATTACCATAAAAAGTTCCCAACTATCTACCATTTACGCAA



AATGTTAATGAATACAGAGGAAACCCCAGACATAAGACTAGTTTATCTGGCAATACACCATATGATG



AAACATAGAGGCCATTTCTTACTTTCCGGGGATATCAACGAAATCAAAGAGTTTGGTACCACATTTAG



TAAGTTACTGGAAAACATAAAGAATGAAGAATTGGATTGGAACTTAGAACTCGGAAAAGAAGAATAC



GCGGTTGTCGAATCTATCCTGAAGGATAATATGCTGAATAGGTCGACCAAAAAAACTAGGCTGATCA



AAGCACTGAAAGCCAAATCTATCTGCGAAAAAGCTGTTTTAAATTTACTTGCTGGTGGCACTGTTAAG



TTATCAGACATTTTTGGTTTGGAAGAATTGAACGAAACCGAGCGTCCAAAAATTAGTTTCGCTGATAA



TGGCTACGATGATTACATTGGTGAGGTGGAAAACGAGTTGGGCGAACAATTTTATATTATAGAGACA



GCTAAGGCAGTCTATGACTGGGCTGTTTTAGTAGAAATCCTTGGTAAATACACATCTATCTCCGAAGC



GAAAGTTGCTACTTACGAAAAGCACAAGTCCGATCTCCAGTTTTTGAAGAAAATTGTCAGGAAATATC



TGACTAAGGAAGAATATAAAGATATTTTCGTTAGTACCTCTGACAAACTGAAAAATTACTCCGCTTAC



ATCGGGATGACCAAGATTAATGGCAAAAAAGTTGATCTGCAAAGCAAAAGGTGTTCGAAGGAAGAAT



TTTATGATTTCATTAAAAAGAATGTCTTAAAAAAATTAGAAGGTCAGCCAGAATACGAATATTTGAAA



GAAGAACTGGAAAGAGAGACATTCTTACCAAAACAAGTCAACAGAGATAATGGGGTAATTCCATATC



AAATTCACCTCTACGAATTAAAAAAAATTTTAGGCAATTTACGCGATAAAATTGACCTTATCAAAGAA



AATGAGGATAAGCTGGTTCAACTCTTTGAATTCAGAATACCCTATTATGTGGGCCCACTGAACAAGAT



TGATGACGGCAAAGAAGGTAAATTCACATGGGCCGTCCGCAAATCCAATGAAAAAATTTACCCATGG



AACTTTGAAAATGTAGTAGATATTGAAGCGTCTGCGGAGAAATTTATTCGAAGAATGACTAATAAAT



GCACTTACTTGATGGGAGAGGATGTTCTGCCTAAAGACAGCTTATTATACAGCAAGTACATGGTTCTA



AACGAACTTAACAACGTTAAGTTGGACGGTGAGAAATTAAGTGTAGAATTGAAACAAAGATTGTATA



CTGACGTCTTCTGCAAGTACAGAAAAGTGACAGTTAAAAAAATTAAGAATTACTTGAAGTGCGAAGG



TATAATTTCTGGAAACGTAGAGATTACTGGTATTGATGGTGATTTCAAAGCATCCCTAACAGCTTACC



ACGATTTCAAGGAAATCCTGACAGGAACTGAACTCGCAAAAAAAGATAAAGAAAACATTATTACTAA



TATTGTTCTTTTCGGTGATGACAAGAAATTGTTGAAGAAAAGACTGAATAGACTTTACCCCCAGATTA



CTCCCAATCAACTTAAGAAAATTTGTGCTTTGTCTTACACAGGATGGGGTCGTTTTTCAAAAAAGTTCT



TAGAAGAGATTACCGCACCTGATCCAGAAACAGGCGAAGTATGGAATATAATTACCGCCTTATGGGA



ATCGAACAATAATCTTATGCAACTTCTGAGCAATGAATATCGTTTCATGGAAGAAGTTGAGACTTACA



ACATGGGCAAACAGACGAAGACTTTATCCTATGAAACTGTGGAAAATATGTATGTATCACCTTCTGTC



AAGAGACAAATTTGGCAAACCTTAAAAATTGTCAAAGAATTAGAAAAGGTAATGAAGGAGTCTCCTA



AACGTGTGTTTATTGAAATGGCTAGAGAAAAACAAGAGTCAAAAAGAACCGAGTCAAGAAAGAAGC



AGTTAATCGATTTATATAAGGCTTGTAAAAACGAAGAGAAAGATTGGGTTAAAGAATTGGGGGACCA



AGAGGAACAAAAACTACGGTCGGATAAGTTGTATTTATACTATACGCAAAAGGGACGATGTATGTAT



TCCGGCGAGGTAATAGAATTGAAGGATTTATGGGACAATACAAAATATGACATAGACCATATATATC



CCCAATCAAAAACGATGGACGATAGCTTGAACAATAGAGTACTCGTGAAAAAAAAATATAATGCGAC



CAAATCTGATAAGTATCCTCTGAATGAAAATATCAGACATGAAAGAAAGGGGTTCTGGAAGTCCTTG



TTAGATGGTGGGTTTATAAGCAAAGAAAAGTACGAGCGTCTAATAAGAAACACGGAGTTATCGCCAG



AAGAACTCGCTGGTTTTATTGAGAGGCAAATCGTGGAAACGAGACAATCTACCAAAGCCGTTGCTGA



GATCCTAAAGCAAGTTTTCCCAGAGTCGGAGATTGTCTATGTCAAAGCTGGCACAGTGAGCAGGTTTA



GGAAAGACTTCGAACTATTAAAGGTAAGAGAAGTGAACGATTTACATCACGCAAAGGACGCTTACCT



AAATATCGTTGTAGGTAACTCATATTATGTTAAATTTACCAAGAACGCCTCTTGGTTTATAAAGGAGA



ACCCAGGTAGAACATATAACCTGAAAAAGATGTTCACCTCTGGTTGGAATATTGAGAGAAACGGAGA



AGTCGCATGGGAAGTTGGTAAGAAAGGGACTATAGTGACAGTAAAGCAAATTATGAACAAAAATAAT



ATCCTCGTTACAAGGCAGGTTCATGAAGCAAAGGGCGGCCTTTTTGACCAACAAATTATGAAGAAAG



GGAAAGGTCAAATTGCAATAAAAGAAACCGATGAGAGACTAGCGTCAATAGAAAAGTATGGTGGCT



ATAATAAAGCTGCGGGTGCATACTTTATGCTTGTTGAATCAAAAGACAAGAAAGGTAAGACTATTAG



AACTATAGAATTTATACCCCTGTACCTTAAAAACAAAATTGAATCGGATGAGTCAATCGCGTTAAATT



TTCTAGAGAAAGGAAGGGGTTTAAAAGAACCAAAGATCCTGTTAAAAAAGATTAAGATTGACACCTT



GTTCGATGTAGATGGATTTAAAATGTGGTTATCTGGCAGAACAGGCGATAGACTTTTGTTTAAGTGCG



CTAATCAATTAATTTTGGATGAGAAAATCATTGTCACAATGAAAAAAATAGTTAAGTTTATTCAGAGA



AGACAAGAAAACAGGGAGTTGAAATTATCTGATAAAGATGGTATCGACAATGAAGTTTTAATGGAAA



TCTACAATACATTCGTTGATAAACTTGAAAATACCGTATATCGAATCAGGTTAAGTGAACAAGCCAAA



ACATTAATTGATAAACAAAAAGAATTTGAAAGGCTATCACTGGAAGACAAATCCTCCACCCTATTTGA



AATTTTGCATATATTCCAGTGCCAATCTTCAGCAGCTAATTTAAAAATGATTGGCGGACCTGGGAAAG



CCGGCATCCTAGTGATGAACAATAATATCTCCAAGTGTAACAAAATATCAATTATTAACCAATCTCCG



ACAGGTATTTTTGAAAATGAAATAGACTTGCTTAAGATATAA





SEQ
ATGTCTTTCGACTCTTTCACCAACCTGTACTCTCTGTCTAAAACCCTGAAATTCGAAATGCGTCCGGTT


ID
GGTAACACCCAGAAAATGCTGGACAACGCGGGTGTTTTCGAAAAAGACAAACTGATCCAGAAAAAAT


NO:
ACGGTAAAACCAAACCGTACTTCGACCGTCTGCACCGTGAATTCATCGAAGAAGCGCTGACCGGTGTT


50
GAACTGATCGGTCTGGACGAAAACTTCCGTACCCTGGTTGACTGGCAGAAAGACAAAAAAAACAACG



TTGCGATGAAAGCGTACGAAAACTCTCTGCAGCGTCTGCGTACCGAAATCGGTAAAATCTTCAACCTG



AAAGCGGAAGACTGGGTTAAAAACAAATACCCGATCCTGGGTCTGAAAAACAAAAACACCGACATCC



TGTTCGAAGAAGCGGTTTTCGGTATCCTGAAAGCGCGTTACGGTGAAGAAAAAGACACCTTCATCGA



AGTTGAAGAAATCGACAAAACCGGTAAATCTAAAATCAACCAGATCTCTATCTTCGACTCTTGGAAA



GGTTTCACCGGTTACTTCAAAAAATTCTTCGAAACCCGTAAAAACTTCTACAAAAACGACGGTACCTC



TACCGCGATCGCGACCCGTATCATCGACCAGAACCTGAAACGTTTCATCGACAACCTGTCTATCGTTG



AATCTGTTCGTCAGAAAGTTGACCTGGCGGAAACCGAAAAATCTTTCTCTATCTCTCTGTCTCAGTTCT



TCTCTATCGACTTCTACAACAAATGCCTGCTGCAGGACGGTATCGACTACTACAACAAAATCATCGGT



GGTGAAACCCTGAAAAACGGTGAAAAACTGATCGGTCTGAACGAACTGATCAACCAGTACCGTCAGA



ACAACAAAGACCAGAAAATCCCGTTCTTCAAACTGCTGGACAAACAGATCCTGTCTGAAAAAATCCT



GTTCCTGGACGAAATCAAAAACGACACCGAACTGATCGAAGCGCTGTCTCAGTTCGCGAAAACCGCG



GAAGAAAAAACCAAAATCGTTAAAAAACTGTTCGCGGACTTCGTTGAAAACAACTCTAAATACGACC



TGGCGCAGATCTACATCTCTCAGGAAGCGTTCAACACCATCTCTAACAAATGGACCTCTGAAACCGAA



ACCTTCGCGAAATACCTGTTCGAAGCGATGAAATCTGGTAAACTGGCGAAATACGAAAAAAAAGACA



ACTCTTACAAATTCCCGGACTTCATCGCGCTGTCTCAGATGAAATCTGCGCTGCTGTCTATCTCTCTGG



AAGGTCACTTCTGGAAAGAAAAATACTACAAAATCTCTAAATTCCAGGAAAAAACCAACTGGGAACA



GTTCCTGGCGATCTTCCTGTACGAATTCAACTCTCTGTTCTCTGACAAAATCAACACCAAAGACGGTG



AAACCAAACAGGTTGGTTACTACCTGTTCGCGAAAGACCTGCACAACCTGATCCTGTCTGAACAGATC



GACATCCCGAAAGACTCTAAAGTTACCATCAAAGACTTCGCGGACTCTGTTCTGACCATCTACCAGAT



GGCGAAATACTTCGCGGTTGAAAAAAAACGTGCGTGGCTGGCGGAATACGAACTGGACTCTTTCTAC



ACCCAGCCGGACACCGGTTACCTGCAGTTCTACGACAACGCGTACGAAGACATCGTTCAGGTTTACAA



CAAACTGCGTAACTACCTGACCAAAAAACCGTACTCTGAAGAAAAATGGAAACTGAACTTCGAAAAC



TCTACCCTGGCGAACGGTTGGGACAAAAACAAAGAATCTGACAACTCTGCGGTTATCCTGCAGAAAG



GTGGTAAATACTACCTGGGTCTGATCACCAAAGGTCACAACAAAATCTTCGACGACCGTTTCCAGGAA



AAATTCATCGTTGGTATCGAAGGTGGTAAATACGAAAAAATCGTTTACAAATTCTTCCCGGACCAGGC



GAAAATGTTCCCGAAAGTTTGCTTCTCTGCGAAAGGTCTGGAATTCTTCCGTCCGTCTGAAGAAATCC



TGCGTATCTACAACAACGCGGAATTCAAAAAAGGTGAAACCTACTCTATCGACTCTATGCAGAAACT



GATCGACTTCTACAAAGACTGCCTGACCAAATACGAAGGTTGGGCGTGCTACACCTTCCGTCACCTGA



AACCGACCGAAGAATACCAGAACAACATCGGTGAATTCTTCCGTGACGTTGCGGAAGACGGTTACCG



TATCGACTTCCAGGGTATCTCTGACCAGTACATCCACGAAAAAAACGAAAAAGGTGAACTGCACCTG



TTCGAAATCCACAACAAAGACTGGAACCTGGACAAAGCGCGTGACGGTAAATCTAAAACCACCCAGA



AAAACCTGCACACCCTGTACTTCGAATCTCTGTTCTCTAACGACAACGTTGTTCAGAACTTCCCGATCA



AACTGAACGGTCAGGCGGAAATCTTCTACCGTCCGAAAACCGAAAAAGACAAACTGGAATCTAAAAA



AGACAAAAAAGGTAACAAAGTTATCGACCACAAACGTTACTCTGAAAACAAAATCTTCTTCCACGTT



CCGCTGACCCTGAACCGTACCAAAAACGACTCTTACCGTTTCAACGCGCAGATCAACAACTTCCTGGC



GAACAACAAAGACATCAACATCATCGGTGTTGACCGTGGTGAAAAACACCTGGTTTACTACTCTGTTA



TCACCCAGGCGTCTGACATCCTGGAATCTGGTTCTCTGAACGAACTGAACGGTGTTAACTACGCGGAA



AAACTGGGTAAAAAAGCGGAAAACCGTGAACAGGCGCGTCGTGACTGGCAGGACGTTCAGGGTATC



AAAGACCTGAAAAAAGGTTACATCTCTCAGGTTGTTCGTAAACTGGCGGACCTGGCGATCAAACACA



ACGCGATCATCATCCTGGAAGACCTGAACATGCGTTTCAAACAGGTTCGTGGTGGTATCGAAAAATCT



ATCTACCAGCAGCTGGAAAAAGCGCTGATCGACAAACTGTCTTTCCTGGTTGACAAAGGTGAAAAAA



ACCCGGAACAGGCGGGTCACCTGCTGAAAGCGTACCAGCTGTCTGCGCCGTTCGAAACCTTCCAGAA



AATGGGTAAACAGACCGGTATCATCTTCTACACCCAGGCGTCTTACACCTCTAAATCTGACCCGGTTA



CCGGTTGGCGTCCGCACCTGTACCTGAAATACTTCTCTGCGAAAAAAGCGAAAGACGACATCGCGAA



ATTCACCAAAATCGAATTCGTTAACGACCGTTTCGAACTGACCTACGACATCAAAGACTTCCAGCAGG



CGAAAGAATACCCGAACAAAACCGTTTGGAAAGTTTGCTCTAACGTTGAACGTTTCCGTTGGGACAA



AAACCTGAACCAGAACAAAGGTGGTTACACCCACTACACCAACATCACCGAAAACATCCAGGAACTG



TTCACCAAATACGGTATCGACATCACCAAAGACCTGCTGACCCAGATCTCTACCATCGACGAAAAAC



AGAACACCTCTTTCTTCCGTGACTTCATCTTCTACTTCAACCTGATCTGCCAGATCCGTAACACCGACG



ACTCTGAAATCGCGAAAAAAAACGGTAAAGACGACTTCATCCTGTCTCCGGTTGAACCGTTCTTCGAC



TCTCGTAAAGACAACGGTAACAAACTGCCGGAAAACGGTGACGACAACGGTGCGTACAACATCGCGC



GTAAAGGTATCGTTATCCTGAACAAAATCTCTCAGTACTCTGAAAAAAACGAAAACTGCGAAAAAAT



GAAATGGGGTGACCTGTACGTTTCTAACATCGACTGGGACAACTTCGTT





SEQ
ATGGAAAACTTTAAAAACTTATACCCAATAAACAAAACGTTACGTTTTGAACTGCGTCCATATGGTAA


ID
AACACTGGAAAACTTTAAAAAAAGCGGTTTGTTGGAGAAGGATGCATTTAAAGCGAACTCTCGCAGA


NO:
TCCATGCAGGCCATCATTGATGAAAAATTTAAAGAGACGATCGAAGAACGTCTGAAATACACGGAAT


51
TTAGTGAGTGTGACTTAGGTAATATGACTTCTAAAGATAAGAAAATCACCGATAAGGCGGCGACCAA



CCTGAAGAAGCAAGTCATTTTATCTTTTGATGATGAAATCTTTAACAACTATTTGAAACCGGACAAAA



ACATCGATGCCTTATTTAAAAATGACCCTTCGAACCCGGTGATTAGCACATTTAAGGGCTTCACAACG



TATTTTGTCAATTTTTTTGAAATTCGTAAACATATCTTCAAAGGAGAATCAAGCGGCTCTATGGCTTAT



CGCATTATTGATGAAAACCTGACGACCTATTTGAATAACATTGAAAAAATCAAAAAACTGCCAGAGG



AATTAAAGTCTCAGTTAGAAGGCATCGACCAGATCGACAAACTCAACAACTATAACGAATTTATTAC



GCAGTCTGGTATCACCCACTATAATGAAATTATTGGAGGTATCAGTAAATCAGAAAATGTGAAAATCC



AAGGGATTAATGAAGGCATTAACCTCTATTGCCAGAAAAATAAAGTGAAACTGCCGAGGCTGACTCC



ACTCTACAAAATGATCCTGTCTGACCGCGTCTCGAATAGCTTTGTCCTGGACACAATTGAAAACGATA



CGGAATTGATTGAGATGATAAGCGATCTGATTAACAAAACCGAAATTTCACAGGATGTAATCATGAG



TGATATACAAAACATCTTTATTAAATATAAACAGCTTGGTAATCTGCCTGGAATTAGCTATTCGTCAA



TAGTGAACGCAATCTGTTCTGATTATGATAACAATTTTGGCGACGGTAAGCGTAAAAAGAGTTATGAA



AACGATAGGAAAAAACACCTGGAAACTAACGTGTATTCTATCAACTATATCAGCGAACTGCTTACGG



ACACCGATGTGAGTTCAAACATTAAGATGCGGTATAAGGAGCTTGAACAGAACTACCAGGTCTGTAA



GGAAAACTTCAACGCAACCAACTGGATGAACATTAAAAATATCAAACAATCCGAGAAGACCAACTTA



ATCAAAGATCTGCTGGATATTTTGAAGAGCATTCAACGTTTTTATGATCTGTTCGATATCGTTGATGAA



GACAAGAATCCTAGTGCGGAATTTTATACATGGCTGTCTAAAAATGCGGAGAAATTGGATTTCGAATT



CAATTCTGTTTATAATAAATCACGCAACTATTTGACCCGCAAACAATACAGCGACAAAAAGATAAAA



CTAAACTTCGACAGTCCGACATTGGCAAAGGGCTGGGACGCAAATAAGGAAATCGATAACTCTACGA



TAATTATGCGTAAGTTCAATAATGATCGAGGTGATTATGATTATTTCTTAGGCATTTGGAACAAAAGC



ACCCCGGCCAACGAAAAGATAATTCCACTGGAGGATAACGGTCTGTTCGAAAAAATGCAGTACAAAT



TATATCCGGATCCAAGCAAGATGCTTCCAAAGCAGTTTCTGTCTAAAATTTGGAAAGCTAAGCATCCG



ACCACCCCAGAATTTGACAAGAAATATAAGGAAGGCCGCCATAAGAAAGGTCCCGATTTTGAAAAAG



AATTCTTGCACGAACTGATTGATTGCTTTAAACATGGCTTAGTCAATCACGATGAAAAGTATCAAGAT



GTTTTTGGATTCAATTTGAGAAACACAGAAGACTACAATTCCTACACTGAGTTTCTCGAAGATGTGGA



ACGATGTAATTATAATCTGAGCTTTAACAAAATCGCGGACACCTCGAATCTGATTAACGATGGTAAAC



TTTATGTTTTCCAGATCTGGAGCAAGGATTTCTCTATTGACAGCAAAGGCACCAAAAACCTGAACACC



ATTTACTTTGAAAGTCTCTTCAGCGAAGAAAATATGATTGAGAAAATGTTTAAACTTAGCGGTGAAGC



TGAAATATTCTATCGCCCGGCAAGCCTGAACTATTGCGAAGACATTATCAAAAAGGGTCATCACCACG



CTGAACTGAAAGATAAATTTGATTATCCTATCATAAAAGATAAACGCTATAGCCAGGATAAATTTTTT



TTTCATGTTCCTATGGTCATTAACTACAAATCAGAAAAACTGAACTCTAAAAGCCTCAATAATCGAAC



CAATGAAAACCTTGGGCAGTTTACCCATATAATTGGAATTGATCGCGGAGAGCGTCATTTAATCTACC



TGACCGTAGTCGATGTATCGACCGGCGAGATCGTCGAGCAGAAGCACTTAGACGAGATTATCAACAC



TGATACCAAAGGTGTTGAGCATAAGACGCACTATCTAAACAAGCTGGAGGAAAAATCGAAAACCCGT



GATAATGAACGTAAGAGTTGGGAGGCAATTGAAACGATTAAAGAACTGAAGGAGGGTTATATCAGCC



ACGTAATCAATGAAATTCAAAAACTGCAGGAAAAATACAACGCCCTGATCGTTATGGAAAATCTGAA



TTACGGTTTCAAAAATTCTCGCATCAAAGTGGAAAAACAGGTATATCAGAAGTTCGAGACGGCATTA



ATTAAAAAGTTTAATTACATCATTGACAAAAAAGATCCGGAAACTTATATTCATGGCTATCAGCTGAC



GAACCCGATCACCACACTGGATAAAATTGGTAACCAGTCTGGTATCGTGCTTTACATCCCTGCCTGGA



ATACCAGTAAAATCGATCCGGTAACGGGATTCGTCAACCTTCTATATGCAGATGACCTCAAATATAAG



AATCAGGAACAGGCCAAGTCTTTTATTCAGAAAATCGATAACATTTACTTTGAGAATGGGGAATTCAA



ATTTGATATTGATTTTTCTAAATGGAACAATCGTTATAGTATATCTAAGACGAAATGGACGCTCACCT



CGTACGGAACCCGAATCCAGACATTCCGCAATCCGCAGAAGAACAATAAATGGGACAGCGCCGAGTA



TGATCTCACTGAAGAATTCAAATTGATTCTGAACATTGACGGTACCCTGAAAAGCCAGGATGTCGAAA



CCTATAAAAAATTTATGTCTCTGTTCAAGCTGATGCTGCAACTTAGGAACTCTGTTACCGGCACTGAT



ATCGATTATATGATCTCCCCTGTCACTGATAAAACAGGTACGCATTTCGATTCGCGCGAAAATATCAA



AAATCTGCCCGCAGATGCCGACGCCAATGGGGCGTACAATATTGCACGCAAGGGTATCATGGCGATC



GAAAACATTATGAATGGTATCAGCGACCCGCTGAAAATCTCAAACGAAGATTATTTGAAATATATCC



AAAACCAGCAGGAATAA





SEQ
ATGACCCAGTTCGAAGGTTTCACCAACCTGTACCAGGTTTCTAAAACCCTGCGTTTCGAACTGATCCC


ID
GCAGGGTAAAACCCTGAAACACATCCAGGAACAGGGTTTCATCGAAGAAGACAAAGCGCGTAACGA


NO:
CCACTACAAAGAACTGAAACCGATCATCGACCGTATCTACAAAACCTACGCGGACCAGTGCCTGCAG


52
CTGGTTCAGCTGGACTGGGAAAACCTGTCTGCGGCGATCGACTCTTACCGTAAAGAAAAAACCGAAG



AAACCCGTAACGCGCTGATCGAAGAACAGGCGACCTACCGTAACGCGATCCACGACTACTTCATCGG



TCGTACCGACAACCTGACCGACGCGATCAACAAACGTCACGCGGAAATCTACAAAGGTCTGTTCAAA



GCGGAACTGTTCAACGGTAAAGTTCTGAAACAGCTGGGTACCGTTACCACCACCGAACACGAAAACG



CGCTGCTGCGTTCTTTCGACAAATTCACCACCTACTTCTCTGGTTTCTACGAAAACCGTAAAAACGTTT



TCTCTGCGGAAGACATCTCTACCGCGATCCCGCACCGTATCGTTCAGGACAACTTCCCGAAATTCAAA



GAAAACTGCCACATCTTCACCCGTCTGATCACCGCGGTTCCGTCTCTGCGTGAACACTTCGAAAACGT



TAAAAAAGCGATCGGTATCTTCGTTTCTACCTCTATCGAAGAAGTTTTCTCTTTCCCGTTCTACAACCA



GCTGCTGACCCAGACCCAGATCGACCTGTACAACCAGCTGCTGGGTGGTATCTCTCGTGAAGCGGGTA



CCGAAAAAATCAAAGGTCTGAACGAAGTTCTGAACCTGGCGATCCAGAAAAACGACGAAACCGCGC



ACATCATCGCGTCTCTGCCGCACCGTTTCATCCCGCTGTTCAAACAGATCCTGTCTGACCGTAACACCC



TGTCTTTCATCCTGGAAGAATTCAAATCTGACGAAGAAGTTATCCAGTCTTTCTGCAAATACAAAACC



CTGCTGCGTAACGAAAACGTTCTGGAAACCGCGGAAGCGCTGTTCAACGAACTGAACTCTATCGACCT



GACCCACATCTTCATCTCTCACAAAAAACTGGAAACCATCTCTTCTGCGCTGTGCGACCACTGGGACA



CCCTGCGTAACGCGCTGTACGAACGTCGTATCTCTGAACTGACCGGTAAAATCACCAAATCTGCGAAA



GAAAAAGTTCAGCGTTCTCTGAAACACGAAGACATCAACCTGCAGGAAATCATCTCTGCGGCGGGTA



AAGAACTGTCTGAAGCGTTCAAACAGAAAACCTCTGAAATCCTGTCTCACGCGCACGCGGCGCTGGA



CCAGCCGCTGCCGACCACCCTGAAAAAACAGGAAGAAAAAGAAATCCTGAAATCTCAGCTGGACTCT



CTGCTGGGTCTGTACCACCTGCTGGACTGGTTCGCGGTTGACGAATCTAACGAAGTTGACCCGGAATT



CTCTGCGCGTCTGACCGGTATCAAACTGGAAATGGAACCGTCTCTGTCTTTCTACAACAAAGCGCGTA



ACTACGCGACCAAAAAACCGTACTCTGTTGAAAAATTCAAACTGAACTTCCAGATGCCGACCCTGGC



GTCTGGTTGGGACGTTAACAAAGAAAAAAACAACGGTGCGATCCTGTTCGTTAAAAACGGTCTGTAC



TACCTGGGTATCATGCCGAAACAGAAAGGTCGTTACAAAGCGCTGTCTTTCGAACCGACCGAAAAAA



CCTCTGAAGGTTTCGACAAAATGTACTACGACTACTTCCCGGACGCGGCGAAAATGATCCCGAAATGC



TCTACCCAGCTGAAAGCGGTTACCGCGCACTTCCAGACCCACACCACCCCGATCCTGCTGTCTAACAA



CTTCATCGAACCGCTGGAAATCACCAAAGAAATCTACGACCTGAACAACCCGGAAAAAGAACCGAAA



AAATTCCAGACCGCGTACGCGAAAAAAACCGGTGACCAGAAAGGTTACCGTGAAGCGCTGTGCAAAT



GGATCGACTTCACCCGTGACTTCCTGTCTAAATACACCAAAACCACCTCTATCGACCTGTCTTCTCTGC



GTCCGTCTTCTCAGTACAAAGACCTGGGTGAATACTACGCGGAACTGAACCCGCTGCTGTACCACATC



TCTTTCCAGCGTATCGCGGAAAAAGAAATCATGGACGCGGTTGAAACCGGTAAACTGTACCTGTTCCA



GATCTACAACAAAGACTTCGCGAAAGGTCACCACGGTAAACCGAACCTGCACACCCTGTACTGGACC



GGTCTGTTCTCTCCGGAAAACCTGGCGAAAACCTCTATCAAACTGAACGGTCAGGCGGAACTGTTCTA



CCGTCCGAAATCTCGTATGAAACGTATGGCGCACCGTCTGGGTGAAAAAATGCTGAACAAAAAACTG



AAAGACCAGAAAACCCCGATCCCGGACACCCTGTACCAGGAACTGTACGACTACGTTAACCACCGTC



TGTCTCACGACCTGTCTGACGAAGCGCGTGCGCTGCTGCCGAACGTTATCACCAAAGAAGTTTCTCAC



GAAATCATCAAAGACCGTCGTTTCACCTCTGACAAATTCTTCTTCCACGTTCCGATCACCCTGAACTAC



CAGGCGGCGAACTCTCCGTCTAAATTCAACCAGCGTGTTAACGCGTACCTGAAAGAACACCCGGAAA



CCCCGATCATCGGTATCGACCGTGGTGAACGTAACCTGATCTACATCACCGTTATCGACTCTACCGGT



AAAATCCTGGAACAGCGTTCTCTGAACACCATCCAGCAGTTCGACTACCAGAAAAAACTGGACAACC



GTGAAAAAGAACGTGTTGCGGCGCGTCAGGCGTGGTCTGTTGTTGGTACCATCAAAGACCTGAAACA



GGGTTACCTGTCTCAGGTTATCCACGAAATCGTTGACCTGATGATCCACTACCAGGCGGTTGTTGTTCT



GGAAAACCTGAACTTCGGTTTCAAATCTAAACGTACCGGTATCGCGGAAAAAGCGGTTTACCAGCAG



TTCGAAAAAATGCTGATCGACAAACTGAACTGCCTGGTTCTGAAAGACTACCCGGCGGAAAAAGTTG



GTGGTGTTCTGAACCCGTACCAGCTGACCGACCAGTTCACCTCTTTCGCGAAAATGGGTACCCAGTCT



GGTTTCCTGTTCTACGTTCCGGCGCCGTACACCTCTAAAATCGACCCGCTGACCGGTTTCGTTGACCCG



TTCGTTTGGAAAACCATCAAAAACCACGAATCTCGTAAACACTTCCTGGAAGGTTTCGACTTCCTGCA



CTACGACGTTAAAACCGGTGACTTCATCCTGCACTTCAAAATGAACCGTAACCTGTCTTTCCAGCGTG



GTCTGCCGGGTTTCATGCCGGCGTGGGACATCGTTTTCGAAAAAAACGAAACCCAGTTCGACGCGAA



AGGTACCCCGTTCATCGCGGGTAAACGTATCGTTCCGGTTATCGAAAACCACCGTTTCACCGGTCGTT



ACCGTGACCTGTACCCGGCGAACGAACTGATCGCGCTGCTGGAAGAAAAAGGTATCGTTTTCCGTGA



CGGTTCTAACATCCTGCCGAAACTGCTGGAAAACGACGACTCTCACGCGATCGACACCATGGTTGCGC



TGATCCGTTCTGTTCTGCAGATGCGTAACTCTAACGCGGCGACCGGTGAAGACTACATCAACTCTCCG



GTTCGTGACCTGAACGGTGTTTGCTTCGACTCTCGTTTCCAGAACCCGGAATGGCCGATGGACGCGGA



CGCGAACGGTGCGTACCACATCGCGCTGAAAGGTCAGCTGCTGCTGAACCACCTGAAAGAATCTAAA



GACCTGAAACTGCAGAACGGTATCTCTAACCAGGACTGGCTGGCGTACATCCAGGAACTGCGTAACT



A





SEQ
ATGGCGGTTAAATCTATCAAAGTTAAACTGCGTCTGGACGACATGCCGGAAATCCGTGCGGGTCTGTG


ID
GAAACTGCACAAAGAAGTTAACGCGGGTGTTCGTTACTACACCGAATGGCTGTCTCTGCTGCGTCAGG


NO:
AAAACCTGTACCGTCGTTCTCCGAACGGTGACGGTGAACAGGAATGCGACAAAACCGCGGAAGAATG


53
CAAAGCGGAACTGCTGGAACGTCTGCGTGCGCGTCAGGTTGAAAACGGTCACCGTGGTCCGGCGGGT



TCTGACGACGAACTGCTGCAGCTGGCGCGTCAGCTGTACGAACTGCTGGTTCCGCAGGCGATCGGTGC



GAAAGGTGACGCGCAGCAGATCGCGCGTAAATTCCTGTCTCCGCTGGCGGACAAAGACGCGGTTGGT



GGTCTGGGTATCGCGAAAGCGGGTAACAAACCGCGTTGGGTTCGTATGCGTGAAGCGGGTGAACCGG



GTTGGGAAGAAGAAAAAGAAAAAGCGGAAACCCGTAAATCTGCGGACCGTACCGCGGACGTTCTGC



GTGCGCTGGCGGACTTCGGTCTGAAACCGCTGATGCGTGTTTACACCGACTCTGAAATGTCTTCTGTT



GAATGGAAACCGCTGCGTAAAGGTCAGGCGGTTCGTACCTGGGACCGTGACATGTTCCAGCAGGCGA



TCGAACGTATGATGTCTTGGGAATCTTGGAACCAGCGTGTTGGTCAGGAATACGCGAAACTGGTTGAA



CAGAAAAACCGTTTCGAACAGAAAAACTTCGTTGGTCAGGAACACCTGGTTCACCTGGTTAACCAGCT



GCAGCAGGACATGAAAGAAGCGTCTCCGGGTCTGGAATCTAAAGAACAGACCGCGCACTACGTTACC



GGTCGTGCGCTGCGTGGTTCTGACAAAGTTTTCGAAAAATGGGGTAAACTGGCGCCGGACGCGCCGTT



CGACCTGTACGACGCGGAAATCAAAAACGTTCAGCGTCGTAACACCCGTCGTTTCGGTTCTCACGACC



TGTTCGCGAAACTGGCGGAACCGGAATACCAGGCGCTGTGGCGTGAAGACGCGTCTTTCCTGACCCGT



TACGCGGTTTACAACTCTATCCTGCGTAAACTGAACCACGCGAAAATGTTCGCGACCTTCACCCTGCC



GGACGCGACCGCGCACCCGATCTGGACCCGTTTCGACAAACTGGGTGGTAACCTGCACCAGTACACC



TTCCTGTTCAACGAATTCGGTGAACGTCGTCACGCGATCCGTTTCCACAAACTGCTGAAAGTTGAAAA



CGGTGTTGCGCGTGAAGTTGACGACGTTACCGTTCCGATCTCTATGTCTGAACAGCTGGACAACCTGC



TGCCGCGTGACCCGAACGAACCGATCGCGCTGTACTTCCGTGACTACGGTGCGGAACAGCACTTCACC



GGTGAATTCGGTGGTGCGAAAATCCAGTGCCGTCGTGACCAGCTGGCGCACATGCACCGTCGTCGTG



GTGCGCGTGACGTTTACCTGAACGTTTCTGTTCGTGTTCAGTCTCAGTCTGAAGCGCGTGGTGAACGTC



GTCCGCCGTACGCGGCGGTTTTCCGTCTGGTTGGTGACAACCACCGTGCGTTCGTTCACTTCGACAAA



CTGTCTGACTACCTGGCGGAACACCCGGACGACGGTAAACTGGGTTCTGAAGGTCTGCTGTCTGGTCT



GCGTGTTATGTCTGTTGACCTGGGTCTGCGTACCTCTGCGTCTATCTCTGTTTTCCGTGTTGCGCGTAA



AGACGAACTGAAACCGAACTCTAAAGGTCGTGTTCCGTTCTTCTTCCCGATCAAAGGTAACGACAACC



TGGTTGCGGTTCACGAACGTTCTCAGCTGCTGAAACTGCCGGGTGAAACCGAATCTAAAGACCTGCGT



GCGATCCGTGAAGAACGTCAGCGTACCCTGCGTCAGCTGCGTACCCAGCTGGCGTACCTGCGTCTGCT



GGTTCGTTGCGGTTCTGAAGACGTTGGTCGTCGTGAACGTTCTTGGGCGAAACTGATCGAACAGCCGG



TTGACGCGGCGAACCACATGACCCCGGACTGGCGTGAAGCGTTCGAAAACGAACTGCAGAAACTGAA



ATCTCTGCACGGTATCTGCTCTGACAAAGAATGGATGGACGCGGTTTACGAATCTGTTCGTCGTGTTT



GGCGTCACATGGGTAAACAGGTTCGTGACTGGCGTAAAGACGTTCGTTCTGGTGAACGTCCGAAAAT



CCGTGGTTACGCGAAAGACGTTGTTGGTGGTAACTCTATCGAACAGATCGAATACCTGGAACGTCAGT



ACAAATTCCTGAAATCTTGGTCTTTCTTCGGTAAAGTTTCTGGTCAGGTTATCCGTGCGGAAAAAGGTT



CTCGTTTCGCGATCACCCTGCGTGAACACATCGACCACGCGAAAGAAGACCGTCTGAAAAAACTGGC



GGACCGTATCATCATGGAAGCGCTGGGTTACGTTTACGCGCTGGACGAACGTGGTAAAGGTAAATGG



GTTGCGAAATACCCGCCGTGCCAGCTGATCCTGCTGGAAGAACTGTCTGAATACCAGTTCAACAACGA



CCGTCCGCCGTCTGAAAACAACCAGCTGATGCAGTGGTCTCACCGTGGTGTTTTCCAGGAACTGATCA



ACCAGGCGCAGGTTCACGACCTGCTGGTTGGTACCATGTACGCGGCGTTCTCTTCTCGTTTCGACGCG



CGTACCGGTGCGCCGGGTATCCGTTGCCGTCGTGTTCCGGCGCGTTGCACCCAGGAACACAACCCGGA



ACCGTTCCCGTGGTGGCTGAACAAATTCGTTGTTGAACACACCCTGGACGCGTGCCCGCTGCGTGCGG



ACGACCTGATCCCGACCGGTGAAGGTGAAATCTTCGTTTCTCCGTTCTCTGCGGAAGAAGGTGACTTC



CACCAGATCCACGCGGACCTGAACGCGGCGCAGAACCTGCAGCAGCGTCTGTGGTCTGACTTCGACA



TCTCTCAGATCCGTCTGCGTTGCGACTGGGGTGAAGTTGACGGTGAACTGGTTCTGATCCCGCGTCTG



ACCGGTAAACGTACCGCGGACTCTTACTCTAACAAAGTTTTCTACACCAACACCGGTGTTACCTACTA



CGAACGTGAACGTGGTAAAAAACGTCGTAAAGTTTTCGCGCAGGAAAAACTGTCTGAAGAAGAAGCG



GAACTGCTGGTTGAAGCGGACGAAGCGCGTGAAAAATCTGTTGTTCTGATGCGTGACCCGTCTGGTAT



CATCAACCGTGGTAACTGGACCCGTCAGAAAGAATTCTGGTCTATGGTTAACCAGCGTATCGAAGGTT



ACCTGGTTAAACAGATCCGTTCTCGTGTTCCGCTGCAGGACTCTGCGTGCGAAAACACCGGTGACATC



TAA





SEQ
ATGGCGACCCGTTCTTTCATCCTGAAAATCGAACCGAACGAAGAAGTTAAAAAAGGTCTGTGGAAAA


ID
CCCACGAAGTTCTGAACCACGGTATCGCGTACTACATGAACATCCTGAAACTGATCCGTCAGGAAGC


NO:
GATCTACGAACACCACGAACAGGACCCGAAAAACCCGAAAAAAGTTTCTAAAGCGGAAATCCAGGC


54
GGAACTGTGGGACTTCGTTCTGAAAATGCAGAAATGCAACTCTTTCACCCACGAAGTTGACAAAGAC



GTTGTTTTCAACATCCTGCGTGAACTGTACGAAGAACTGGTTCCGTCTTCTGTTGAAAAAAAAGGTGA



AGCGAACCAGCTGTCTAACAAATTCCTGTACCCGCTGGTTGACCCGAACTCTCAGTCTGGTAAAGGTA



CCGCGTCTTCTGGTCGTAAACCGCGTTGGTACAACCTGAAAATCGCGGGTGACCCGTCTTGGGAAGAA



GAAAAAAAAAAATGGGAAGAAGACAAAAAAAAAGACCCGCTGGCGAAAATCCTGGGTAAACTGGCG



GAATACGGTCTGATCCCGCTGTTCATCCCGTTCACCGACTCTAACGAACCGATCGTTAAAGAAATCAA



ATGGATGGAAAAATCTCGTAACCAGTCTGTTCGTCGTCTGGACAAAGACATGTTCATCCAGGCGCTGG



AACGTTTCCTGTCTTGGGAATCTTGGAACCTGAAAGTTAAAGAAGAATACGAAAAAGTTGAAAAAGA



ACACAAAACCCTGGAAGAACGTATCAAAGAAGACATCCAGGCGTTCAAATCTCTGGAACAGTACGAA



AAAGAACGTCAGGAACAGCTGCTGCGTGACACCCTGAACACCAACGAATACCGTCTGTCTAAACGTG



GTCTGCGTGGTTGGCGTGAAATCATCCAGAAATGGCTGAAAATGGACGAAAACGAACCGTCTGAAAA



ATACCTGGAAGTTTTCAAAGACTACCAGCGTAAACACCCGCGTGAAGCGGGTGACTACTCTGTTTACG



AATTCCTGTCTAAAAAAGAAAACCACTTCATCTGGCGTAACCACCCGGAATACCCGTACCTGTACGCG



ACCTTCTGCGAAATCGACAAAAAAAAAAAAGACGCGAAACAGCAGGCGACCTTCACCCTGGCGGACC



CGATCAACCACCCGCTGTGGGTTCGTTTCGAAGAACGTTCTGGTTCTAACCTGAACAAATACCGTATC



CTGACCGAACAGCTGCACACCGAAAAACTGAAAAAAAAACTGACCGTTCAGCTGGACCGTCTGATCT



ACCCGACCGAATCTGGTGGTTGGGAAGAAAAAGGTAAAGTTGACATCGTTCTGCTGCCGTCTCGTCAG



TTCTACAACCAGATCTTCCTGGACATCGAAGAAAAAGGTAAACACGCGTTCACCTACAAAGACGAAT



CTATCAAATTCCCGCTGAAAGGTACCCTGGGTGGTGCGCGTGTTCAGTTCGACCGTGACCACCTGCGT



CGTTACCCGCACAAAGTTGAATCTGGTAACGTTGGTCGTATCTACTTCAACATGACCGTTAACATCGA



ACCGACCGAATCTCCGGTTTCTAAATCTCTGAAAATCCACCGTGACGACTTCCCGAAATTCGTTAACT



TCAAACCGAAAGAACTGACCGAATGGATCAAAGACTCTAAAGGTAAAAAACTGAAATCTGGTATCGA



ATCTCTGGAAATCGGTCTGCGTGTTATGTCTATCGACCTGGGTCAGCGTCAGGCGGCGGCGGCGTCTA



TCTTCGAAGTTGTTGACCAGAAACCGGACATCGAAGGTAAACTGTTCTTCCCGATCAAAGGTACCGAA



CTGTACGCGGTTCACCGTGCGTCTTTCAACATCAAACTGCCGGGTGAAACCCTGGTTAAATCTCGTGA



AGTTCTGCGTAAAGCGCGTGAAGACAACCTGAAACTGATGAACCAGAAACTGAACTTCCTGCGTAAC



GTTCTGCACTTCCAGCAGTTCGAAGACATCACCGAACGTGAAAAACGTGTTACCAAATGGATCTCTCG



TCAGGAAAACTCTGACGTTCCGCTGGTTTACCAGGACGAACTGATCCAGATCCGTGAACTGATGTACA



AACCGTACAAAGACTGGGTTGCGTTCCTGAAACAGCTGCACAAACGTCTGGAAGTTGAAATCGGTAA



AGAAGTTAAACACTGGCGTAAATCTCTGTCTGACGGTCGTAAAGGTCTGTACGGTATCTCTCTGAAAA



ACATCGACGAAATCGACCGTACCCGTAAATTCCTGCTGCGTTGGTCTCTGCGTCCGACCGAACCGGGT



GAAGTTCGTCGTCTGGAACCGGGTCAGCGTTTCGCGATCGACCAGCTGAACCACCTGAACGCGCTGA



AAGAAGACCGTCTGAAAAAAATGGCGAACACCATCATCATGCACGCGCTGGGTTACTGCTACGACGT



TCGTAAAAAAAAATGGCAGGCGAAAAACCCGGCGTGCCAGATCATCCTGTTCGAAGACCTGTCTAAC



TACAACCCGTACGAAGAACGTTCTCGTTTCGAAAACTCTAAACTGATGAAATGGTCTCGTCGTGAAAT



CCCGCGTCAGGTTGCGCTGCAGGGTGAAATCTACGGTCTGCAGGTTGGTGAAGTTGGTGCGCAGTTCT



CTTCTCGTTTCCACGCGAAAACCGGTTCTCCGGGTATCCGTTGCTCTGTTGTTACCAAAGAAAAACTGC



AGGACAACCGTTTCTTCAAAAACCTGCAGCGTGAAGGTCGTCTGACCCTGGACAAAATCGCGGTTCTG



AAAGAAGGTGACCTGTACCCGGACAAAGGTGGTGAAAAATTCATCTCTCTGTCTAAAGACCGTAAAC



TGGTTACCACCCACGCGGACATCAACGCGGCGCAGAACCTGCAGAAACGTTTCTGGACCCGTACCCA



CGGTTTCTACAAAGTTTACTGCAAAGCGTACCAGGTTGACGGTCAGACCGTTTACATCCCGGAATCTA



AAGACCAGAAACAGAAAATCATCGAAGAATTCGGTGAAGGTTACTTCATCCTGAAAGACGGTGTTTA



CGAATGGGGTAACGCGGGTAAACTGAAAATCAAAAAAGGTTCTTCTAAACAGTCTTCTTCTGAACTG



GTTGACTCTGACATCCTGAAAGACTCTTTCGACCTGGCGTCTGAACTGAAAGGTGAAAAACTGATGCT



GTACCGTGACCCGTCTGGTAACGTTTTCCCGTCTGACAAATGGATGGCGGCGGGTGTTTTCTTCGGTA



AACTGGAACGTATCCTGATCTCTAAACTGACCAACCAGTACTCTATCTCTACCATCGAAGACGACTCT



TCTAAACAGTCTATGTAA





SEQ
ATGCCGACCCGTACCATCAACCTGAAACTGGTTCTGGGTAAAAACCCGGAAAACGCGACCCTGCGTC


ID
GTGCGCTGTTCTCTACCCACCGTCTGGTTAACCAGGCGACCAAACGTATCGAAGAATTCCTGCTGCTG


NO:
TGCCGTGGTGAAGCGTACCGTACCGTTGACAACGAAGGTAAAGAAGCGGAAATCCCGCGTCACGCGG


55
TTCAGGAAGAAGCGCTGGCGTTCGCGAAAGCGGCGCAGCGTCACAACGGTTGCATCTCTACCTACGA



AGACCAGGAAATCCTGGACGTTCTGCGTCAGCTGTACGAACGTCTGGTTCCGTCTGTTAACGAAAACA



ACGAAGCGGGTGACGCGCAGGCGGCGAACGCGTGGGTTTCTCCGCTGATGTCTGCGGAATCTGAAGG



TGGTCTGTCTGTTTACGACAAAGTTCTGGACCCGCCGCCGGTTTGGATGAAACTGAAAGAAGAAAAA



GCGCCGGGTTGGGAAGCGGCGTCTCAGATCTGGATCCAGTCTGACGAAGGTCAGTCTCTGCTGAACA



AACCGGGTTCTCCGCCGCGTTGGATCCGTAAACTGCGTTCTGGTCAGCCGTGGCAGGACGACTTCGTT



TCTGACCAGAAAAAAAAACAGGACGAACTGACCAAAGGTAACGCGCCGCTGATCAAACAGCTGAAA



GAAATGGGTCTGCTGCCGCTGGTTAACCCGTTCTTCCGTCACCTGCTGGACCCGGAAGGTAAAGGTGT



TTCTCCGTGGGACCGTCTGGCGGTTCGTGCGGCGGTTGCGCACTTCATCTCTTGGGAATCTTGGAACC



ACCGTACCCGTGCGGAATACAACTCTCTGAAACTGCGTCGTGACGAATTCGAAGCGGCGTCTGACGA



ATTCAAAGACGACTTCACCCTGCTGCGTCAGTACGAAGCGAAACGTCACTCTACCCTGAAATCTATCG



CGCTGGCGGACGACTCTAACCCGTACCGTATCGGTGTTCGTTCTCTGCGTGCGTGGAACCGTGTTCGT



GAAGAATGGATCGACAAAGGTGCGACCGAAGAACAGCGTGTTACCATCCTGTCTAAACTGCAGACCC



AGCTGCGTGGTAAATTCGGTGACCCGGACCTGTTCAACTGGCTGGCGCAGGACCGTCACGTTCACCTG



TGGTCTCCGCGTGACTCTGTTACCCCGCTGGTTCGTATCAACGCGGTTGACAAAGTTCTGCGTCGTCGT



AAACCGTACGCGCTGATGACCTTCGCGCACCCGCGTTTCCACCCGCGTTGGATCCTGTACGAAGCGCC



GGGTGGTTCTAACCTGCGTCAGTACGCGCTGGACTGCACCGAAAACGCGCTGCACATCACCCTGCCGC



TGCTGGTTGACGACGCGCACGGTACCTGGATCGAAAAAAAAATCCGTGTTCCGCTGGCGCCGTCTGGT



CAGATCCAGGACCTGACCCTGGAAAAACTGGAAAAAAAAAAAAACCGTCTGTACTACCGTTCTGGTT



TCCAGCAGTTCGCGGGTCTGGCGGGTGGTGCGGAAGTTCTGTTCCACCGTCCGTACATGGAACACGAC



GAACGTTCTGAAGAATCTCTGCTGGAACGTCCGGGTGCGGTTTGGTTCAAACTGACCCTGGACGTTGC



GACCCAGGCGCCGCCGAACTGGCTGGACGGTAAAGGTCGTGTTCGTACCCCGCCGGAAGTTCACCAC



TTCAAAACCGCGCTGTCTAACAAATCTAAACACACCCGTACCCTGCAGCCGGGTCTGCGTGTTCTGTC



TGTTGACCTGGGTATGCGTACCTTCGCGTCTTGCTCTGTTTTCGAACTGATCGAAGGTAAACCGGAAA



CCGGTCGTGCGTTCCCGGTTGCGGACGAACGTTCTATGGACTCTCCGAACAAACTGTGGGCGAAACAC



GAACGTTCTTTCAAACTGACCCTGCCGGGTGAAACCCCGTCTCGTAAAGAAGAAGAAGAACGTTCTAT



CGCGCGTGCGGAAATCTACGCGCTGAAACGTGACATCCAGCGTCTGAAATCTCTGCTGCGTCTGGGTG



AAGAAGACAACGACAACCGTCGTGACGCGCTGCTGGAACAGTTCTTCAAAGGTTGGGGTGAAGAAGA



CGTTGTTCCGGGTCAGGCGTTCCCGCGTTCTCTGTTCCAGGGTCTGGGTGCGGCGCCGTTCCGTTCTAC



CCCGGAACTGTGGCGTCAGCACTGCCAGACCTACTACGACAAAGCGGAAGCGTGCCTGGCGAAACAC



ATCTCTGACTGGCGTAAACGTACCCGTCCGCGTCCGACCTCTCGTGAAATGTGGTACAAAACCCGTTC



TTACCACGGTGGTAAATCTATCTGGATGCTGGAATACCTGGACGCGGTTCGTAAACTGCTGCTGTCTT



GGTCTCTGCGTGGTCGTACCTACGGTGCGATCAACCGTCAGGACACCGCGCGTTTCGGTTCTCTGGCG



TCTCGTCTGCTGCACCACATCAACTCTCTGAAAGAAGACCGTATCAAAACCGGTGCGGACTCTATCGT



TCAGGCGGCGCGTGGTTACATCCCGCTGCCGCACGGTAAAGGTTGGGAACAGCGTTACGAACCGTGC



CAGCTGATCCTGTTCGAAGACCTGGCGCGTTACCGTTTCCGTGTTGACCGTCCGCGTCGTGAAAACTC



TCAGCTGATGCAGTGGAACCACCGTGCGATCGTTGCGGAAACCACCATGCAGGCGGAACTGTACGGT



CAGATCGTTGAAAACACCGCGGCGGGTTTCTCTTCTCGTTTCCACGCGGCGACCGGTGCGCCGGGTGT



TCGTTGCCGTTTCCTGCTGGAACGTGACTTCGACAACGACCTGCCGAAACCGTACCTGCTGCGTGAAC



TGTCTTGGATGCTGGGTAACACCAAAGTTGAATCTGAAGAAGAAAAACTGCGTCTGCTGTCTGAAAA



AATCCGTCCGGGTTCTCTGGTTCCGTGGGACGGTGGTGAACAGTTCGCGACCCTGCACCCGAAACGTC



AGACCCTGTGCGTTATCCACGCGGACATGAACGCGGCGCAGAACCTGCAGCGTCGTTTCTTCGGTCGT



TGCGGTGAAGCGTTCCGTCTGGTTTGCCAGCCGCACGGTGACGACGTTCTGCGTCTGGCGTCTACCCC



GGGTGCGCGTCTGCTGGGTGCGCTGCAGCAGCTGGAAAACGGTCAGGGTGCGTTCGAACTGGTTCGT



GACATGGGTTCTACCTCTCAGATGAACCGTTTCGTTATGAAATCTCTGGGTAAAAAAAAAATCAAACC



GCTGCAGGACAACAACGGTGACGACGAACTGGAAGACGTTCTGTCTGTTCTGCCGGAAGAAGACGAC



ACCGGTCGTATCACCGTTTTCCGTGACTCTTCTGGTATCTTCTTCCCGTGCAACGTTTGGATCCCGGCG



AAACAGTTCTGGCCGGCGGTTCGTGCGATGATCTGGAAAGTTATGGCGTCTCACTCTCTGGGTTAA





SEQ
ATGACCAAACTGCGTCACCGTCAGAAAAAACTGACCCACGACTGGGCGGGTTCTAAAAAACGTGAAG


ID
TTCTGGGTTCTAACGGTAAACTGCAGAACCCGCTGCTGATGCCGGTTAAAAAAGGTCAGGTTACCGAA


NO:
TTCCGTAAAGCGTTCTCTGCGTACGCGCGTGCGACCAAAGGTGAAATGACCGACGGTCGTAAAAACA


56
TGTTCACCCACTCTTTCGAACCGTTCAAAACCAAACCGTCTCTGCACCAGTGCGAACTGGCGGACAAA



GCGTACCAGTCTCTGCACTCTTACCTGCCGGGTTCTCTGGCGCACTTCCTGCTGTCTGCGCACGCGCTG



GGTTTCCGTATCTTCTCTAAATCTGGTGAAGCGACCGCGTTCCAGGCGTCTTCTAAAATCGAAGCGTA



CGAATCTAAACTGGCGTCTGAACTGGCGTGCGTTGACCTGTCTATCCAGAACCTGACCATCTCTACCC



TGTTCAACGCGCTGACCACCTCTGTTCGTGGTAAAGGTGAAGAAACCTCTGCGGACCCGCTGATCGCG



CGTTTCTACACCCTGCTGACCGGTAAACCGCTGTCTCGTGACACCCAGGGTCCGGAACGTGACCTGGC



GGAAGTTATCTCTCGTAAAATCGCGTCTTCTTTCGGTACCTGGAAAGAAATGACCGCGAACCCGCTGC



AGTCTCTGCAGTTCTTCGAAGAAGAACTGCACGCGCTGGACGCGAACGTTTCTCTGTCTCCGGCGTTC



GACGTTCTGATCAAAATGAACGACCTGCAGGGTGACCTGAAAAACCGTACCATCGTTTTCGACCCGG



ACGCGCCGGTTTTCGAATACAACGCGGAAGACCCGGCGGACATCATCATCAAACTGACCGCGCGTTA



CGCGAAAGAAGCGGTTATCAAAAACCAGAACGTTGGTAACTACGTTAAAAACGCGATCACCACCACC



AACGCGAACGGTCTGGGTTGGCTGCTGAACAAAGGTCTGTCTCTGCTGCCGGTTTCTACCGACGACGA



ACTGCTGGAATTCATCGGTGTTGAACGTTCTCACCCGTCTTGCCACGCGCTGATCGAACTGATCGCGC



AGCTGGAAGCGCCGGAACTGTTCGAAAAAAACGTTTTCTCTGACACCCGTTCTGAAGTTCAGGGTATG



ATCGACTCTGCGGTTTCTAACCACATCGCGCGTCTGTCTTCTTCTCGTAACTCTCTGTCTATGGACTCT



GAAGAACTGGAACGTCTGATCAAATCTTTCCAGATCCACACCCCGCACTGCTCTCTGTTCATCGGTGC



GCAGTCTCTGTCTCAGCAGCTGGAATCTCTGCCGGAAGCGCTGCAGTCTGGTGTTAACTCTGCGGACA



TCCTGCTGGGTTCTACCCAGTACATGCTGACCAACTCTCTGGTTGAAGAATCTATCGCGACCTACCAG



CGTACCCTGAACCGTATCAACTACCTGTCTGGTGTTGCGGGTCAGATCAACGGTGCGATCAAACGTAA



AGCGATCGACGGTGAAAAAATCCACCTGCCGGCGGCGTGGTCTGAACTGATCTCTCTGCCGTTCATCG



GTCAGCCGGTTATCGACGTTGAATCTGACCTGGCGCACCTGAAAAACCAGTACCAGACCCTGTCTAAC



GAATTCGACACCCTGATCTCTGCGCTGCAGAAAAACTTCGACCTGAACTTCAACAAAGCGCTGCTGAA



CCGTACCCAGCACTTCGAAGCGATGTGCCGTTCTACCAAAAAAAACGCGCTGTCTAAACCGGAAATC



GTTTCTTACCGTGACCTGCTGGCGCGTCTGACCTCTTGCCTGTACCGTGGTTCTCTGGTTCTGCGTCGT



GCGGGTATCGAAGTTCTGAAAAAACACAAAATCTTCGAATCTAACTCTGAACTGCGTGAACACGTTCA



CGAACGTAAACACTTCGTTTTCGTTTCTCCGCTGGACCGTAAAGCGAAAAAACTGCTGCGTCTGACCG



ACTCTCGTCCGGACCTGCTGCACGTTATCGACGAAATCCTGCAGCACGACAACCTGGAAAACAAAGA



CCGTGAATCTCTGTGGCTGGTTCGTTCTGGTTACCTGCTGGCGGGTCTGCCGGACCAGCTGTCTTCTTC



TTTCATCAACCTGCCGATCATCACCCAGAAAGGTGACCGTCGTCTGATCGACCTGATCCAGTACGACC



AGATCAACCGTGACGCGTTCGTTATGCTGGTTACCTCTGCGTTCAAATCTAACCTGTCTGGTCTGCAGT



ACCGTGCGAACAAACAGTCTTTCGTTGTTACCCGTACCCTGTCTCCGTACCTGGGTTCTAAACTGGTTT



ACGTTCCGAAAGACAAAGACTGGCTGGTTCCGTCTCAGATGTTCGAAGGTCGTTTCGCGGACATCCTG



CAGTCTGACTACATGGTTTGGAAAGACGCGGGTCGTCTGTGCGTTATCGACACCGCGAAACACCTGTC



TAACATCAAAAAATCTGTTTTCTCTTCTGAAGAAGTTCTGGCGTTCCTGCGTGAACTGCCGCACCGTAC



CTTCATCCAGACCGAAGTTCGTGGTCTGGGTGTTAACGTTGACGGTATCGCGTTCAACAACGGTGACA



TCCCGTCTCTGAAAACCTTCTCTAACTGCGTTCAGGTTAAAGTTTCTCGTACCAACACCTCTCTGGTTC



AGACCCTGAACCGTTGGTTCGAAGGTGGTAAAGTTTCTCCGCCGTCTATCCAGTTCGAACGTGCGTAC



TACAAAAAAGACGACCAGATCCACGAAGACGCGGCGAAACGTAAAATCCGTTTCCAGATGCCGGCGA



CCGAACTGGTTCACGCGTCTGACGACGCGGGTTGGACCCCGTCTTACCTGCTGGGTATCGACCCGGGT



GAATACGGTATGGGTCTGTCTCTGGTTTCTATCAACAACGGTGAAGTTCTGGACTCTGGTTTCATCCAC



ATCAACTCTCTGATCAACTTCGCGTCTAAAAAATCTAACCACCAGACCAAAGTTGTTCCGCGTCAGCA



GTACAAATCTCCGTACGCGAACTACCTGGAACAGTCTAAAGACTCTGCGGCGGGTGACATCGCGCAC



ATCCTGGACCGTCTGATCTACAAACTGAACGCGCTGCCGGTTTTCGAAGCGCTGTCTGGTAACTCTCA



GTCTGCGGCGGACCAGGTTTGGACCAAAGTTCTGTCTTTCTACACCTGGGGTGACAACGACGCGCAGA



ACTCTATCCGTAAACAGCACTGGTTCGGTGCGTCTCACTGGGACATCAAAGGTATGCTGCGTCAGCCG



CCGACCGAAAAAAAACCGAAACCGTACATCGCGTTCCCGGGTTCTCAGGTTTCTTCTTACGGTAACTC



TCAGCGTTGCTCTTGCTGCGGTCGTAACCCGATCGAACAGCTGCGTGAAATGGCGAAAGACACCTCTA



TCAAAGAACTGAAAATCCGTAACTCTGAAATCCAGCTGTTCGACGGTACCATCAAACTGTTCAACCCG



GACCCGTCTACCGTTATCGAACGTCGTCGTCACAACCTGGGTCCGTCTCGTATCCCGGTTGCGGACCG



TACCTTCAAAAACATCTCTCCGTCTTCTCTGGAATTCAAAGAACTGATCACCATCGTTTCTCGTTCTAT



CCGTCACTCTCCGGAATTCATCGCGAAAAAACGTGGTATCGGTTCTGAATACTTCTGCGCGTACTCTG



ACTGCAACTCTTCTCTGAACTCTGAAGCGAACGCGGCGGCGAACGTTGCGCAGAAATTCCAGAAACA



GCTGTTCTTCGAACTGTAA





SEQ
ATGAAACGTATCCTGAACTCTCTGAAAGTTGCGGCGCTGCGTCTGCTGTTCCGTGGTAAAGGTTCTGA


ID
ACTGGTTAAAACCGTTAAATACCCGCTGGTTTCTCCGGTTCAGGGTGCGGTTGAAGAACTGGCGGAAG


NO:
CGATCCGTCACGACAACCTGCACCTGTTCGGTCAGAAAGAAATCGTTGACCTGATGGAAAAAGACGA


57
AGGTACCCAGGTTTACTCTGTTGTTGACTTCTGGCTGGACACCCTGCGTCTGGGTATGTTCTTCTCTCC



GTCTGCGAACGCGCTGAAAATCACCCTGGGTAAATTCAACTCTGACCAGGTTTCTCCGTTCCGTAAAG



TTCTGGAACAGTCTCCGTTCTTCCTGGCGGGTCGTCTGAAAGTTGAACCGGCGGAACGTATCCTGTCT



GTTGAAATCCGTAAAATCGGTAAACGTGAAAACCGTGTTGAAAACTACGCGGCGGACGTTGAAACCT



GCTTCATCGGTCAGCTGTCTTCTGACGAAAAACAGTCTATCCAGAAACTGGCGAACGACATCTGGGAC



TCTAAAGACCACGAAGAACAGCGTATGCTGAAAGCGGACTTCTTCGCGATCCCGCTGATCAAAGACC



CGAAAGCGGTTACCGAAGAAGACCCGGAAAACGAAACCGCGGGTAAACAGAAACCGCTGGAACTGT



GCGTTTGCCTGGTTCCGGAACTGTACACCCGTGGTTTCGGTTCTATCGCGGACTTCCTGGTTCAGCGTC



TGACCCTGCTGCGTGACAAAATGTCTACCGACACCGCGGAAGACTGCCTGGAATACGTTGGTATCGA



AGAAGAAAAAGGTAACGGTATGAACTCTCTGCTGGGTACCTTCCTGAAAAACCTGCAGGGTGACGGT



TTCGAACAGATCTTCCAGTTCATGCTGGGTTCTTACGTTGGTTGGCAGGGTAAAGAAGACGTTCTGCG



TGAACGTCTGGACCTGCTGGCGGAAAAAGTTAAACGTCTGCCGAAACCGAAATTCGCGGGTGAATGG



TCTGGTCACCGTATGTTCCTGCACGGTCAGCTGAAATCTTGGTCTTCTAACTTCTTCCGTCTGTTCAAC



GAAACCCGTGAACTGCTGGAATCTATCAAATCTGACATCCAGCACGCGACCATGCTGATCTCTTACGT



TGAAGAAAAAGGTGGTTACCACCCGCAGCTGCTGTCTCAGTACCGTAAACTGATGGAACAGCTGCCG



GCGCTGCGTACCAAAGTTCTGGACCCGGAAATCGAAATGACCCACATGTCTGAAGCGGTTCGTTCTTA



CATCATGATCCACAAATCTGTTGCGGGTTTCCTGCCGGACCTGCTGGAATCTCTGGACCGTGACAAAG



ACCGTGAATTCCTGCTGTCTATCTTCCCGCGTATCCCGAAAATCGACAAAAAAACCAAAGAAATCGTT



GCGTGGGAACTGCCGGGTGAACCGGAAGAAGGTTACCTGTTCACCGCGAACAACCTGTTCCGTAACT



TCCTGGAAAACCCGAAACACGTTCCGCGTTTCATGGCGGAACGTATCCCGGAAGACTGGACCCGTCTG



CGTTCTGCGCCGGTTTGGTTCGACGGTATGGTTAAACAGTGGCAGAAAGTTGTTAACCAGCTGGTTGA



ATCTCCGGGTGCGCTGTACCAGTTCAACGAATCTTTCCTGCGTCAGCGTCTGCAGGCGATGCTGACCG



TTTACAAACGTGACCTGCAGACCGAAAAATTCCTGAAACTGCTGGCGGACGTTTGCCGTCCGCTGGTT



GACTTCTTCGGTCTGGGTGGTAACGACATCATCTTCAAATCTTGCCAGGACCCGCGTAAACAGTGGCA



GACCGTTATCCCGCTGTCTGTTCCGGCGGACGTTTACACCGCGTGCGAAGGTCTGGCGATCCGTCTGC



GTGAAACCCTGGGTTTCGAATGGAAAAACCTGAAAGGTCACGAACGTGAAGACTTCCTGCGTCTGCA



CCAGCTGCTGGGTAACCTGCTGTTCTGGATCCGTGACGCGAAACTGGTTGTTAAACTGGAAGACTGGA



TGAACAACCCGTGCGTTCAGGAATACGTTGAAGCGCGTAAAGCGATCGACCTGCCGCTGGAAATCTT



CGGTTTCGAAGTTCCGATCTTCCTGAACGGTTACCTGTTCTCTGAACTGCGTCAGCTGGAACTGCTGCT



GCGTCGTAAATCTGTTATGACCTCTTACTCTGTTAAAACCACCGGTTCTCCGAACCGTCTGTTCCAGCT



GGTTTACCTGCCGCTGAACCCGTCTGACCCGGAAAAAAAAAACTCTAACAACTTCCAGGAACGTCTG



GACACCCCGACCGGTCTGTCTCGTCGTTTCCTGGACCTGACCCTGGACGCGTTCGCGGGTAAACTGCT



GACCGACCCGGTTACCCAGGAACTGAAAACCATGGCGGGTTTCTACGACCACCTGTTCGGTTTCAAAC



TGCCGTGCAAACTGGCGGCGATGTCTAACCACCCGGGTTCTTCTTCTAAAATGGTTGTTCTGGCGAAA



CCGAAAAAAGGTGTTGCGTCTAACATCGGTTTCGAACCGATCCCGGACCCGGCGCACCCGGTTTTCCG



TGTTCGTTCTTCTTGGCCGGAACTGAAATACCTGGAAGGTCTGCTGTACCTGCCGGAAGACACCCCGC



TGACCATCGAACTGGCGGAAACCTCTGTTTCTTGCCAGTCTGTTTCTTCTGTTGCGTTCGACCTGAAAA



ACCTGACCACCATCCTGGGTCGTGTTGGTGAATTCCGTGTTACCGCGGACCAGCCGTTCAAACTGACC



CCGATCATCCCGGAAAAAGAAGAATCTTTCATCGGTAAAACCTACCTGGGTCTGGACGCGGGTGAAC



GTTCTGGTGTTGGTTTCGCGATCGTTACCGTTGACGGTGACGGTTACGAAGTTCAGCGTCTGGGTGTTC



ACGAAGACACCCAGCTGATGGCGCTGCAGCAGGTTGCGTCTAAATCTCTGAAAGAACCGGTTTTCCA



GCCGCTGCGTAAAGGTACCTTCCGTCAGCAGGAACGTATCCGTAAATCTCTGCGTGGTTGCTACTGGA



ACTTCTACCACGCGCTGATGATCAAATACCGTGCGAAAGTTGTTCACGAAGAATCTGTTGGTTCTTCT



GGTCTGGTTGGTCAGTGGCTGCGTGCGTTCCAGAAAGACCTGAAAAAAGCGGACGTTCTGCCGAAAA



AAGGTGGTAAAAACGGTGTTGACAAAAAAAAACGTGAATCTTCTGCGCAGGACACCCTGTGGGGTGG



TGCGTTCTCTAAAAAAGAAGAACAGCAGATCGCGTTCGAAGTTCAGGCGGCGGGTTCTTCTCAGTTCT



GCCTGAAATGCGGTTGGTGGTTCCAGCTGGGTATGCGTGAAGTTAACCGTGTTCAGGAATCTGGTGTT



GTTCTGGACTGGAACCGTTCTATCGTTACCTTCCTGATCGAATCTTCTGGTGAAAAAGTTTACGGTTTC



TCTCCGCAGCAGCTGGAAAAAGGTTTCCGTCCGGACATCGAAACCTTCAAAAAAATGGTTCGTGACTT



CATGCGTCCGCCGATGTTCGACCGTAAAGGTCGTCCGGCGGCGGCGTACGAACGTTTCGTTCTGGGTC



GTCGTCACCGTCGTTACCGTTTCGACAAAGTTTTCGAAGAACGTTTCGGTCGTTCTGCGCTGTTCATCT



GCCCGCGTGTTGGTTGCGGTAACTTCGACCACTCTTCTGAACAGTCTGCGGTTGTTCTGGCGCTGATCG



GTTACATCGCGGACAAAGAAGGTATGTCTGGTAAAAAACTGGTTTACGTTCGTCTGGCGGAACTGATG



GCGGAATGGAAACTGAAAAAACTGGAACGTTCTCGTGTTGAAGAACAGTCTTCTGCGCAGTAA





SEQ
ATGGCGGAATCTAAACAGATGCAGTGCCGTAAATGCGGTGCGTCTATGAAATACGAAGTTATCGGTC


ID
TGGGTAAAAAATCTTGCCGTTACATGTGCCCGGACTGCGGTAACCACACCTCTGCGCGTAAAATCCAG


NO:
AACAAAAAAAAACGTGACAAAAAATACGGTTCTGCGTCTAAAGCGCAGTCTCAGCGTATCGCGGTTG


58
CGGGTGCGCTGTACCCGGACAAAAAAGTTCAGACCATCAAAACCTACAAATACCCGGCGGACCTGAA



CGGTGAAGTTCACGACTCTGGTGTTGCGGAAAAAATCGCGCAGGCGATCCAGGAAGACGAAATCGGT



CTGCTGGGTCCGTCTTCTGAATACGCGTGCTGGATCGCGTCTCAGAAACAGTCTGAACCGTACTCTGT



TGTTGACTTCTGGTTCGACGCGGTTTGCGCGGGTGGTGTTTTCGCGTACTCTGGTGCGCGTCTGCTGTC



TACCGTTCTGCAGCTGTCTGGTGAAGAATCTGTTCTGCGTGCGGCGCTGGCGTCTTCTCCGTTCGTTGA



CGACATCAACCTGGCGCAGGCGGAAAAATTCCTGGCGGTTTCTCGTCGTACCGGTCAGGACAAACTG



GGTAAACGTATCGGTGAATGCTTCGCGGAAGGTCGTCTGGAAGCGCTGGGTATCAAAGACCGTATGC



GTGAATTCGTTCAGGCGATCGACGTTGCGCAGACCGCGGGTCAGCGTTTCGCGGCGAAACTGAAAAT



CTTCGGTATCTCTCAGATGCCGGAAGCGAAACAGTGGAACAACGACTCTGGTCTGACCGTTTGCATCC



TGCCGGACTACTACGTTCCGGAAGAAAACCGTGCGGACCAGCTGGTTGTTCTGCTGCGTCGTCTGCGT



GAAATCGCGTACTGCATGGGTATCGAAGACGAAGCGGGTTTCGAACACCTGGGTATCGACCCGGGTG



CGCTGTCTAACTTCTCTAACGGTAACCCGAAACGTGGTTTCCTGGGTCGTCTGCTGAACAACGACATC



ATCGCGCTGGCGAACAACATGTCTGCGATGACCCCGTACTGGGAAGGTCGTAAAGGTGAACTGATCG



AACGTCTGGCGTGGCTGAAACACCGTGCGGAAGGTCTGTACCTGAAAGAACCGCACTTCGGTAACTC



TTGGGCGGACCACCGTTCTCGTATCTTCTCTCGTATCGCGGGTTGGCTGTCTGGTTGCGCGGGTAAACT



GAAAATCGCGAAAGACCAGATCTCTGGTGTTCGTACCGACCTGTTCCTGCTGAAACGTCTGCTGGACG



CGGTTCCGCAGTCTGCGCCGTCTCCGGACTTCATCGCGTCTATCTCTGCGCTGGACCGTTTCCTGGAAG



CGGCGGAATCTTCTCAGGACCCGGCGGAACAGGTTCGTGCGCTGTACGCGTTCCACCTGAACGCGCCG



GCGGTTCGTTCTATCGCGAACAAAGCGGTTCAGCGTTCTGACTCTCAGGAATGGCTGATCAAAGAACT



GGACGCGGTTGACCACCTGGAATTCAACAAAGCGTTCCCGTTCTTCTCTGACACCGGTAAAAAAAAA



AAAAAAGGTGCGAACTCTAACGGTGCGCCGTCTGAAGAAGAATACACCGAAACCGAATCTATCCAGC



AGCCGGAAGACGCGGAACAGGAAGTTAACGGTCAGGAAGGTAACGGTGCGTCTAAAAACCAGAAAA



AATTCCAGCGTATCCCGCGTTTCTTCGGTGAAGGTTCTCGTTCTGAATACCGTATCCTGACCGAAGCGC



CGCAGTACTTCGACATGTTCTGCAACAACATGCGTGCGATCTTCATGCAGCTGGAATCTCAGCCGCGT



AAAGCGCCGCGTGACTTCAAATGCTTCCTGCAGAACCGTCTGCAGAAACTGTACAAACAGACCTTCCT



GAACGCGCGTTCTAACAAATGCCGTGCGCTGCTGGAATCTGTTCTGATCTCTTGGGGTGAATTCTACA



CCTACGGTGCGAACGAAAAAAAATTCCGTCTGCGTCACGAAGCGTCTGAACGTTCTTCTGACCCGGAC



TACGTTGTTCAGCAGGCGCTGGAAATCGCGCGTCGTCTGTTCCTGTTCGGTTTCGAATGGCGTGACTG



CTCTGCGGGTGAACGTGTTGACCTGGTTGAAATCCACAAAAAAGCGATCTCTTTCCTGCTGGCGATCA



CCCAGGCGGAAGTTTCTGTTGGTTCTTACAACTGGCTGGGTAACTCTACCGTTTCTCGTTACCTGTCTG



TTGCGGGTACCGACACCCTGTACGGTACCCAGCTGGAAGAATTCCTGAACGCGACCGTTCTGTCTCAG



ATGCGTGGTCTGGCGATCCGTCTGTCTTCTCAGGAACTGAAAGACGGTTTCGACGTTCAGCTGGAATC



TTCTTGCCAGGACAACCTGCAGCACCTGCTGGTTTACCGTGCGTCTCGTGACCTGGCGGCGTGCAAAC



GTGCGACCTGCCCGGCGGAACTGGACCCGAAAATCCTGGTTCTGCCGGTTGGTGCGTTCATCGCGTCT



GTTATGAAAATGATCGAACGTGGTGACGAACCGCTGGCGGGTGCGTACCTGCGTCACCGTCCGCACTC



TTTCGGTTGGCAGATCCGTGTTCGTGGTGTTGCGGAAGTTGGTATGGACCAGGGTACCGCGCTGGCGT



TCCAGAAACCGACCGAATCTGAACCGTTCAAAATCAAACCGTTCTCTGCGCAGTACGGTCCGGTTCTG



TGGCTGAACTCTTCTTCTTACTCTCAGTCTCAGTACCTGGACGGTTTCCTGTCTCAGCCGAAAAACTGG



TCTATGCGTGTTCTGCCGCAGGCGGGTTCTGTTCGTGTTGAACAGCGTGTTGCGCTGATCTGGAACCTG



CAGGCGGGTAAAATGCGTCTGGAACGTTCTGGTGCGCGTGCGTTCTTCATGCCGGTTCCGTTCTCTTTC



CGTCCGTCTGGTTCTGGTGACGAAGCGGTTCTGGCGCCGAACCGTTACCTGGGTCTGTTCCCGCACTCT



GGTGGTATCGAATACGCGGTTGTTGACGTTCTGGACTCTGCGGGTTTCAAAATCCTGGAACGTGGTAC



CATCGCGGTTAACGGTTTCTCTCAGAAACGTGGTGAACGTCAGGAAGAAGCGCACCGTGAAAAACAG



CGTCGTGGTATCTCTGACATCGGTCGTAAAAAACCGGTTCAGGCGGAAGTTGACGCGGCGAACGAAC



TGCACCGTAAATACACCGACGTTGCGACCCGTCTGGGTTGCCGTATCGTTGTTCAGTGGGCGCCGCAG



CCGAAACCGGGTACCGCGCCGACCGCGCAGACCGTTTACGCGCGTGCGGTTCGTACCGAAGCGCCGC



GTTCTGGTAACCAGGAAGACCACGCGCGTATGAAATCTTCTTGGGGTTACACCTGGGGTACCTACTGG



GAAAAACGTAAACCGGAAGACATCCTGGGTATCTCTACCCAGGTTTACTGGACCGGTGGTATCGGTG



AATCTTGCCCGGCGGTTGCGGTTGCGCTGCTGGGTCACATCCGTGCGACCTCTACCCAGACCGAATGG



GAAAAAGAAGAAGTTGTTTTCGGTCGTCTGAAAAAATTCTTCCCGTCTTAA





SEQ
ATGGAAAAACGTATCAACAAAATCCGTAAAAAACTGTCTGCGGACAACGCGACCAAACCGGTTTCTC


ID
GTTCTGGTCCGATGAAAACCCTGCTGGTTCGTGTTATGACCGACGACCTGAAAAAACGTCTGGAAAAA


NO:
CGTCGTAAAAAACCGGAAGTTATGCCGCAGGTTATCTCTAACAACGCGGCGAACAACCTGCGTATGC


59
TGCTGGACGACTACACCAAAATGAAAGAAGCGATCCTGCAGGTTTACTGGCAGGAATTCAAAGACGA



CCACGTTGGTCTGATGTGCAAATTCGCGCAGCCGGCGTCTAAAAAAATCGACCAGAACAAACTGAAA



CCGGAAATGGACGAAAAAGGTAACCTGACCACCGCGGGTTTCGCGTGCTCTCAGTGCGGTCAGCCGC



TGTTCGTTTACAAACTGGAACAGGTTTCTGAAAAAGGTAAAGCGTACACCAACTACTTCGGTCGTTGC



AACGTTGCGGAACACGAAAAACTGATCCTGCTGGCGCAGCTGAAACCGGAAAAAGACTCTGACGAAG



CGGTTACCTACTCTCTGGGTAAATTCGGTCAGCGTGCGCTGGACTTCTACTCTATCCACGTTACCAAAG



AATCTACCCACCCGGTTAAACCGCTGGCGCAGATCGCGGGTAACCGTTACGCGTCTGGTCCGGTTGGT



AAAGCGCTGTCTGACGCGTGCATGGGTACCATCGCGTCTTTCCTGTCTAAATACCAGGACATCATCAT



CGAACACCAGAAAGTTGTTAAAGGTAACCAGAAACGTCTGGAATCTCTGCGTGAACTGGCGGGTAAA



GAAAACCTGGAATACCCGTCTGTTACCCTGCCGCCGCAGCCGCACACCAAAGAAGGTGTTGACGCGT



ACAACGAAGTTATCGCGCGTGTTCGTATGTGGGTTAACCTGAACCTGTGGCAGAAACTGAAACTGTCT



CGTGACGACGCGAAACCGCTGCTGCGTCTGAAAGGTTTCCCGTCTTTCCCGGTTGTTGAACGTCGTGA



AAACGAAGTTGACTGGTGGAACACCATCAACGAAGTTAAAAAACTGATCGACGCGAAACGTGACATG



GGTCGTGTTTTCTGGTCTGGTGTTACCGCGGAAAAACGTAACACCATCCTGGAAGGTTACAACTACCT



GCCGAACGAAAACGACCACAAAAAACGTGAAGGTTCTCTGGAAAACCCGAAAAAACCGGCGAAACG



TCAGTTCGGTGACCTGCTGCTGTACCTGGAAAAAAAATACGCGGGTGACTGGGGTAAAGTTTTCGACG



AAGCGTGGGAACGTATCGACAAAAAAATCGCGGGTCTGACCTCTCACATCGAACGTGAAGAAGCGCG



TAACGCGGAAGACGCGCAGTCTAAAGCGGTTCTGACCGACTGGCTGCGTGCGAAAGCGTCTTTCGTTC



TGGAACGTCTGAAAGAAATGGACGAAAAAGAATTCTACGCGTGCGAAATCCAGCTGCAGAAATGGTA



CGGTGACCTGCGTGGTAACCCGTTCGCGGTTGAAGCGGAAAACCGTGTTGTTGACATCTCTGGTTTCT



CTATCGGTTCTGACGGTCACTCTATCCAGTACCGTAACCTGCTGGCGTGGAAATACCTGGAAAACGGT



AAACGTGAATTCTACCTGCTGATGAACTACGGTAAAAAAGGTCGTATCCGTTTCACCGACGGTACCGA



CATCAAAAAATCTGGTAAATGGCAGGGTCTGCTGTACGGTGGTGGTAAAGCGAAAGTTATCGACCTG



ACCTTCGACCCGGACGACGAACAGCTGATCATCCTGCCGCTGGCGTTCGGTACCCGTCAGGGTCGTGA



ATTCATCTGGAACGACCTGCTGTCTCTGGAAACCGGTCTGATCAAACTGGCGAACGGTCGTGTTATCG



AAAAAACCATCTACAACAAAAAAATCGGTCGTGACGAACCGGCGCTGTTCGTTGCGCTGACCTTCGA



ACGTCGTGAAGTTGTTGACCCGTCTAACATCAAACCGGTTAACCTGATCGGTGTTGACCGTGGTGAAA



ACATCCCGGCGGTTATCGCGCTGACCGACCCGGAAGGTTGCCCGCTGCCGGAATTCAAAGACTCTTCT



GGTGGTCCGACCGACATCCTGCGTATCGGTGAAGGTTACAAAGAAAAACAGCGTGCGATCCAGGCGG



CGAAAGAAGTTGAACAGCGTCGTGCGGGTGGTTACTCTCGTAAATTCGCGTCTAAATCTCGTAACCTG



GCGGACGACATGGTTCGTAACTCTGCGCGTGACCTGTTCTACCACGCGGTTACCCACGACGCGGTTCT



GGTTTTCGAAAACCTGTCTCGTGGTTTCGGTCGTCAGGGTAAACGTACCTTCATGACCGAACGTCAGT



ACACCAAAATGGAAGACTGGCTGACCGCGAAACTGGCGTACGAAGGTCTGACCTCTAAAACCTACCT



GTCTAAAACCCTGGCGCAGTACACCTCTAAAACCTGCTCTAACTGCGGTTTCACCATCACCACCGCGG



ACTACGACGGTATGCTGGTTCGTCTGAAAAAAACCTCTGACGGTTGGGCGACCACCCTGAACAACAA



AGAACTGAAAGCGGAAGGTCAGATCACCTACTACAACCGTTACAAACGTCAGACCGTTGAAAAAGAA



CTGTCTGCGGAACTGGACCGTCTGTCTGAAGAATCTGGTAACAACGACATCTCTAAATGGACCAAAG



GTCGTCGTGACGAAGCGCTGTTCCTGCTGAAAAAACGTTTCTCTCACCGTCCGGTTCAGGAACAGTTC



GTTTGCCTGGACTGCGGTCACGAAGTTCACGCGGACGAACAGGCGGCGCTGAACATCGCGCGTTCTTG



GCTGTTCCTGAACTCTAACTCTACCGAATTCAAATCTTACAAATCTGGTAAACAGCCGTTCGTTGGTGC



GTGGCAGGCGTTCTACAAACGTCGTCTGAAAGAAGTTTGGAAACCGAACGCG





SEQ
ATGAAACGTATCAACAAAATCCGTCGTCGTCTGGTTAAAGACTCTAACACCAAAAAAGCGGGTAAAA


ID
CCGGTCCGATGAAAACCCTGCTGGTTCGTGTTATGACCCCGGACCTGCGTGAACGTCTGGAAAACCTG


NO:
CGTAAAAAACCGGAAAACATCCCGCAGCCGATCTCTAACACCTCTCGTGCGAACCTGAACAAACTGC


60
TGACCGACTACACCGAAATGAAAAAAGCGATCCTGCACGTTTACTGGGAAGAATTCCAGAAAGACCC



GGTTGGTCTGATGTCTCGTGTTGCGCAGCCGGCGCCGAAAAACATCGACCAGCGTAAACTGATCCCGG



TTAAAGACGGTAACGAACGTCTGACCTCTTCTGGTTTCGCGTGCTCTCAGTGCTGCCAGCCGCTGTAC



GTTTACAAACTGGAACAGGTTAACGACAAAGGTAAACCGCACACCAACTACTTCGGTCGTTGCAACG



TTTCTGAACACGAACGTCTGATCCTGCTGTCTCCGCACAAACCGGAAGCGAACGACGAACTGGTTACC



TACTCTCTGGGTAAATTCGGTCAGCGTGCGCTGGACTTCTACTCTATCCACGTTACCCGTGAATCTAAC



CACCCGGTTAAACCGCTGGAACAGATCGGTGGTAACTCTTGCGCGTCTGGTCCGGTTGGTAAAGCGCT



GTCTGACGCGTGCATGGGTGCGGTTGCGTCTTTCCTGACCAAATACCAGGACATCATCCTGGAACACC



AGAAAGTTATCAAAAAAAACGAAAAACGTCTGGCGAACCTGAAAGACATCGCGTCTGCGAACGGTCT



GGCGTTCCCGAAAATCACCCTGCCGCCGCAGCCGCACACCAAAGAAGGTATCGAAGCGTACAACAAC



GTTGTTGCGCAGATCGTTATCTGGGTTAACCTGAACCTGTGGCAGAAACTGAAAATCGGTCGTGACGA



AGCGAAACCGCTGCAGCGTCTGAAAGGTTTCCCGTCTTTCCCGCTGGTTGAACGTCAGGCGAACGAAG



TTGACTGGTGGGACATGGTTTGCAACGTTAAAAAACTGATCAACGAAAAAAAAGAAGACGGTAAAGT



TTTCTGGCAGAACCTGGCGGGTTACAAACGTCAGGAAGCGCTGCTGCCGTACCTGTCTTCTGAAGAAG



ACCGTAAAAAAGGTAAAAAATTCGCGCGTTACCAGTTCGGTGACCTGCTGCTGCACCTGGAAAAAAA



ACACGGTGAAGACTGGGGTAAAGTTTACGACGAAGCGTGGGAACGTATCGACAAAAAAGTTGAAGG



TCTGTCTAAACACATCAAACTGGAAGAAGAACGTCGTTCTGAAGACGCGCAGTCTAAAGCGGCGCTG



ACCGACTGGCTGCGTGCGAAAGCGTCTTTCGTTATCGAAGGTCTGAAAGAAGCGGACAAAGACGAAT



TCTGCCGTTGCGAACTGAAACTGCAGAAATGGTACGGTGACCTGCGTGGTAAACCGTTCGCGATCGA



AGCGGAAAACTCTATCCTGGACATCTCTGGTTTCTCTAAACAGTACAACTGCGCGTTCATCTGGCAGA



AAGACGGTGTTAAAAAACTGAACCTGTACCTGATCATCAACTACTTCAAAGGTGGTAAACTGCGTTTC



AAAAAAATCAAACCGGAAGCGTTCGAAGCGAACCGTTTCTACACCGTTATCAACAAAAAATCTGGTG



AAATCGTTCCGATGGAAGTTAACTTCAACTTCGACGACCCGAACCTGATCATCCTGCCGCTGGCGTTC



GGTAAACGTCAGGGTCGTGAATTCATCTGGAACGACCTGCTGTCTCTGGAAACCGGTTCTCTGAAACT



GGCGAACGGTCGTGTTATCGAAAAAACCCTGTACAACCGTCGTACCCGTCAGGACGAACCGGCGCTG



TTCGTTGCGCTGACCTTCGAACGTCGTGAAGTTCTGGACTCTTCTAACATCAAACCGATGAACCTGAT



CGGTATCGACCGTGGTGAAAACATCCCGGCGGTTATCGCGCTGACCGACCCGGAAGGTTGCCCGCTGT



CTCGTTTCAAAGACTCTCTGGGTAACCCGACCCACATCCTGCGTATCGGTGAATCTTACAAAGAAAAA



CAGCGTACCATCCAGGCGGCGAAAGAAGTTGAACAGCGTCGTGCGGGTGGTTACTCTCGTAAATACG



CGTCTAAAGCGAAAAACCTGGCGGACGACATGGTTCGTAACACCGCGCGTGACCTGCTGTACTACGC



GGTTACCCAGGACGCGATGCTGATCTTCGAAAACCTGTCTCGTGGTTTCGGTCGTCAGGGTAAACGTA



CCTTCATGGCGGAACGTCAGTACACCCGTATGGAAGACTGGCTGACCGCGAAACTGGCGTACGAAGG



TCTGCCGTCTAAAACCTACCTGTCTAAAACCCTGGCGCAGTACACCTCTAAAACCTGCTCTAACTGCG



GTTTCACCATCACCTCTGCGGACTACGACCGTGTTCTGGAAAAACTGAAAAAAACCGCGACCGGTTGG



ATGACCACCATCAACGGTAAAGAACTGAAAGTTGAAGGTCAGATCACCTACTACAACCGTTACAAAC



GTCAGAACGTTGTTAAAGACCTGTCTGTTGAACTGGACCGTCTGTCTGAAGAATCTGTTAACAACGAC



ATCTCTTCTTGGACCAAAGGTCGTTCTGGTGAAGCGCTGTCTCTGCTGAAAAAACGTTTCTCTCACCGT



CCGGTTCAGGAAAAATTCGTTTGCCTGAACTGCGGTTTCGAAACCCACGCGGACGAACAGGCGGCGC



TGAACATCGCGCGTTCTTGGCTGTTCCTGCGTTCTCAGGAATACAAAAAATACCAGACCAACAAAACC



ACCGGTAACACCGACAAACGTGCGTTCGTTGAAACCTGGCAGTCTTTCTACCGTAAAAAACTGAAAG



AAGTTTGGAAACCG





SEQ
AAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAAGCATTGATAATTGA


ID
GATCCCTCTCCCTGACAGGATGATTACATAAATAATAGTGACAAAAATAAATTATTTATTTATCCAGA


NO:
AAATGAATTGGAAAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTcaaaCAGGTtg


61
ccgtcactgcgtcttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagcattctg



taacaaagcgggaccaaagccatgacaaaaacgcgtaacaaaagtgtctataatcacggcagaaaagtcc



acattgattatttgcacggcgtcacactttgctatgccatagcatttttatccataagattagcggatcc



tacctgacgctttttatcgcaactctctactgtttctccatacccgtttttttgggctagcaccgcctat



ctcgtgtgagataggcggagatacgaactttaagAAGGAGatataccATGGGT



AAAATGTATTACCTTGGTTTAGACATTGGCACGAATTCCGTGGGCTACGCGGTGACCGACCCCTCATA



CCACCTGCTGAAGTTTAAGGGGGAACCAATGTGGGGTGCGCACGTATTTGCCGCCGGTAATCAGAGC



GCGGAACGACGCTCGTTCCGCACATCGCGTCGTCGTTTGGACCGACGCCAACAGCGCGTTAAACTGGT



ACAGGAGATTTTTGCCCCGGTGATTAGTCCGATCGACCCACGCTTCTTCATTCGTCTGCATGAATCCGC



CCTGTGGCGCGATGACGTCGCGGAGACGGATAAACATATCTTTTTCAATGATCCTACCTATACCGATA



AGGAATATTATAGCGATTACCCGACTATCCATCACCTGATCGTTGATCTGATGGAAAGCTCTGAGAAA



CACGATCCGCGGCTGGTGTACCTTGCAGTGGCGTGGTTAGTGGCACACCGTGGTCATTTTCTGAACGA



GGTGGACAAGGATAATATTGGAGATGTGTTGTCGTTCGACGCATTTTATCCGGAGTTTCTCGCGTTCCT



GTCGGACAACGGTGTATCACCGTGGGTGTGCGAAAGCAAAGCGCTGCAGGCGACCTTGCTGAGCCGT



AACTCAGTGAACGACAAATATAAAGCCCTTAAGTCTCTGATCTTCGGATCCCAGAAACCTGAAGATA



ACTTCGATGCCAATATTTCGGAAGATGGACTCATTCAACTGCTGGCCGGCAAAAAGGTAAAAGTTAA



CAAACTGTTCCCTCAGGAATCGAACGATGCATCCTTCACATTGAATGATAAAGAAGACGCGATAGAA



GAAATCCTGGGTACGCTTACACCAGATGAATGTGAATGGATTGCGCATATACGCCGCCTTTTTGACTG



GGCTATCATGAAACATGCTCTGAAAGATGGCAGGACTATTAGCGAGTCAAAAGTCAAACTGTATGAG



CAGCACCATCACGATCTGACCCAACTTAAATACTTCGTGAAAACCTACCTTGCAAAAGAATACGACG



ATATTTTCCGCAACGTGGATAGCGAAACAACGAAAAACTATGTAGCGTATTCCTATCATGTGAAAGA



GGTGAAAGGCACTCTGCCTAAAAATAAGGCAACGCAAGAAGAGTTTTGTAAGTATGTCCTGGGCAAG



GTTAAAAACATTGAATGCTCTGAAGCAGACAAGGTTGACTTTGATGAGATGATTCAGCGTCTTACCGA



CAACTCTTTTATGCCTAAGCAGGTTTCGGGCGAAAACCGCGTTATTCCTTATCAGTTATATTATTATGA



ACTGAAGACAATTCTGAATAAAGCAGCCTCGTACCTGCCTTTCCTGACGCAGTGTGGAAAAGATGCA



ATTTCGAACCAGGACAAACTACTGTCGATCATGACGTTCCGTATTCCTTACTTCGTCGGACCCTTGCGA



AAAGATAATTCGGAACATGCATGGCTCGAACGAAAGGCCGGTAAGATTTATCCGTGGAACTTTAACG



ACAAAGTGGACTTGGATAAATCAGAAGAAGCGTTCATTCGCCGAATGACCAATACCTGTACCTATTAT



CCCGGCGAAGATGTTTTACCGTTGGATTCGCTGATCTATGAGAAATTTATGATTTTAAATGAAATCAA



TAATATTCGTATTGACGGCTACCCGATTAGTGTTGACGTTAAACAGCAGGTTTTTGGCTTGTTCGAAA



AAAAACGACGCGTAACCGTGAAAGATATTCAGAACCTGCTGCTGTCTCTCGGAGCTCTGGACAAACA



CGGGAAGCTGACAGGCATCGATACCACTATCCACTCAAACTATAATACGTATCACCATTTTAAATCTC



TCATGGAACGCGGCGTCCTGACCCGGGATGACGTGGAACGCATCGTTGAAAGGATGACCTACAGCGA



CGATACTAAGCGTGTGCGTCTGTGGCTGAATAACAACTATGGTACTTTAACCGCCGACGATGTGAAAC



ACATTTCGCGTCTGCGCAAACACGATTTTGGCCGTTTATCCAAAATGTTCTTAACAGGTCTGAAGGGT



GTCCATAAGGAGACCGGTGAACGTGCCTCCATACTGGATTTCATGTGGAACACGAACGATAACCTGA



TGCAGCTCCTTTCCGAATGCTACACGTTCAGTGATGAAATCACAAAGCTGCAAGAGGCGTATTATGCA



AAAGCCCAGTTGTCTTTAAACGATTTTTTAGACTCGATGTACATCTCTAACGCGGTGAAACGTCCGAT



TTACAGAACTCTGGCAGTGGTGAACGATATTCGAAAAGCATGTGGGACGGCCCCTAAACGCATTTTCA



TCGAAATGGCTCGTGATGGTGAATCAAAAAAAAAGAGAAGTGTTACACGTCGCGAGCAGATCAAAAA



CCTGTACCGCTCGATTCGTAAAGATTTCCAGCAGGAAGTTGATTTTCTGGAAAAGATCCTGGAAAATA



AATCTGATGGTCAACTTCAGTCAGATGCTTTGTATCTTTACTTTGCACAATTAGGGCGCGATATGTACA



CGGGCGATCCAATAAAGCTGGAGCACATCAAAGATCAGAGTTTCTATAACATAGACCATATTTACCC



GCAGTCTATGGTGAAAGACGATTCCCTAGATAACAAAGTGCTGGTGCAAAGCGAAATTAACGGCGAG



AAAAGCTCGCGATACCCTTTGGACGCCGCGATCCGCAATAAAATGAAGCCCCTTTGGGACGCTTACTA



TAATCATGGCCTGATCTCCTTAAAGAAATACCAGCGTCTAACGCGCTCGACCCCGTTTACCGATGATG



AAAAATGGGACTTTATTAATCGCCAGTTAGTGGAAACCCGTCAATCTACCAAAGCGCTGGCCATTTTG



TTGAAGCGTAAGTTTCCAGACACCGAAATTGTGTATTCGAAGGCGGGGTTATCGTCCGACTTCAGACA



TGAATTCGGCCTTGTAAAAAGTCGCAATATTAATGATTTGCACCACGCTAAAGACGCATTCTTGGCTA



TCGTTACCGGCAATGTGTACCATGAAAGATTCAATCGCAGATGGTTTATGGTGAACCAGCCGTACTCA



GTTAAAACTAAAACTCTTTTTACCCACAGCATAAAGAATGGCAACTTCGTTGCCTGGAACGGCGAAGA



AGATCTCGGTCGTATTGTAAAAATGCTGAAGCAAAACAAAAATACCATTCACTTCACGCGCTTCTCCT



TCGATCGCAAAGAAGGATTATTTGATATCCAACCTCTGAAAGCCAGCACCGGCTTAGTCCCACGAAA



AGCCGGTCTGGATGTCGTTAAATACGGCGGATATGACAAATCTACCGCGGCCTATTACCTGCTGGTGA



GGTTCACGCTCGAGGACAAGAAAACCCAGCACAAGCTGATGATGATTCCTGTAGAAGGCCTGTACAA



GGCTCGCATTGATCATGACAAGGAATTTCTTACCGATTATGCGCAAACGACTATAAGCGAAATCCTAC



AGAAAGATAAACAGAAAGTGATCAATATTATGTTTCCAATGGGTACGAGGCATATAAAACTCAATTC



AATGATTAGTATCGATGGCTTCTATCTTAGTATCGGCGGAAAGTCCTCTAAAGGTAAGTCAGTTCTAT



GTCACGCAATGGTTCCACTGATCGTCCCTCACAAAATCGAATGTTACATTAAAGCAATGGAAAGCTTC



GCCCGGAAGTTTAAAGAAAACAACAAGCTGCGCATCGTAGAAAAATTCGATAAAATCACCGTTGAAG



ACAACCTGAATCTCTACGAGCTCTTTCTCCAAAAACTGCAGCATAATCCCTATAATAAGTTTTTTTCGA



CACAGTTTGACGTACTGACGAACGGCCGTTCTACTTTCACAAAACTGTCGCCGGAGGAACAGGTACA



GACGCTCTTGAACATTTTAAGTATCTTTAAAACATGCCGCAGTTCGGGTTGCGACCTGAAATCCATCA



ACGGCAGTGCCCAGGCAGCGCGCATCATGATTAGCGCTGACTTAACTGGACTGTCGAAAAAATATTC



AGATATTAGGTTGGTTGAACAGTCAGCTTCTGGTTTGTTCGTATCCAAAAGTCAGAACTTACTGGAGT



ATCTCTAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATTTATTATATCGCGTTGA



TTATTGATGCTGTTTTTAGTTTTAACGGCAATTAATATATGTGTTATTAATTGAATGAATTTTATCATTC



ATAATAAGTATGTGTAGGATCAAGCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAA



AGAGGATTACAGAATTATCTCATAACAAGTGTTAAGGGATGTTATTTCC





SEQ
AAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAAGCATTGATAATTGA


ID
GATCCCTCTCCCTGACAGGATGATTACATAAATAATAGTGACAAAAATAAATTATTTATTTATCCAGA


NO:
AAATGAATTGGAAAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTcaaaCAGGTtg


62
ccgtcactgcgtcttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagcattctg



taacaaagcgggaccaaagccatgacaaaaacgcgtaacaaaagtgtctataatcacggcagaaaagtcc



acattgattatttgcacggcgtcacactttgctatgccatagcatttttatccataagattagcggatcc



tacctgacgctttttatcgcaactctctactgtttctccatacccgtttttttgggctagcaccgcctat



ctcgtgtgagataggcggagatacgaactttaagAAGGAGatataccATGTCA



TCGCTCACGAAATTCACTAACAAATACTCTAAACAGCTCACCATTAAGAATGAACTCATCCCAGTTGG



CAAAACACTGGAGAACATCAAAGAGAATGGTCTGATAGATGGCGACGAACAGCTGAATGAGAATTAT



CAGAAGGCGAAAATTATTGTGGATGATTTTCTGCGGGACTTCATTAATAAAGCACTGAATAATACGCA



GATCGGGAACTGGCGCGAACTGGCGGATGCCCTTAATAAAGAGGATGAAGATAACATCGAGAAATTG



CAGGATAAAATTCGGGGAATCATTGTATCCAAATTTGAAACGTTTGATCTGTTTAGCAGCTATTCTATT



AAGAAAGATGAAAAGATTATTGACGACGACAATGATGTTGAAGAAGAGGAACTGGATCTGGGCAAG



AAGACCAGCTCATTTAAATACATATTTAAAAAAAACCTGTTTAAGTTAGTGTTGCCATCCTACCTGAA



AACCACAAACCAGGACAAGCTGAAGATTATTAGCTCGTTTGATAATTTTTCAACGTACTTCCGCGGGT



TCTTTGAAAACCGGAAAAACATTTTTACCAAGAAACCGATCTCCACAAGTATTGCGTATCGCATTGTT



CATGATAACTTCCCGAAATTCCTTGATAACATTCGTTGTTTTAATGTGTGGCAGACGGAATGCCCGCA



ACTAATCGTGAAAGCAGATAACTATCTGAAAAGCAAAAATGTTATAGCGAAAGATAAAAGTTTGGCA



AACTATTTTACCGTGGGCGCGTATGACTATTTCCTGTCTCAGAATGGTATAGATTTTTACAACAATATT



ATAGGTGGACTGCCAGCGTTCGCCGGCCATGAGAAAATCCAAGGTCTCAATGAATTCATCAATCAAG



AGTGCCAAAAAGACAGCGAGCTGAAAAGTAAGCTGAAAAACCGTCACGCGTTCAAAATGGCGGTACT



GTTCAAACAGATACTCAGCGATCGTGAAAAAAGTTTTGTAATTGATGAGTTCGAGTCGGATGCTCAAG



TTATTGACGCCGTTAAAAACTTTTACGCCGAACAGTGCAAAGATAACAATGTTATTTTTAACTTATTA



AATCTTATCAAGAATATCGCTTTCTTAAGTGATGACGAACTGGACGGCATATTCATTGAAGGGAAATA



CCTGTCGAGCGTTAGTCAAAAACTCTATAGCGATTGGTCAAAATTACGTAACGACATTGAGGATTCGG



CTAACTCTAAACAAGGCAATAAAGAGCTGGCCAAGAAGATCAAAACCAACAAAGGGGATGTAGAAA



AAGCGATCTCGAAATATGAGTTCTCGCTGTCGGAACTGAACTCGATTGTACATGATAACACCAAGTTT



TCTGACCTCCTTAGTTGTACACTGCATAAGGTGGCTTCTGAGAAACTGGTGAAGGTCAATGAAGGCGA



CTGGCCGAAACATCTCAAGAATAATGAAGAGAAACAAAAAATCAAAGAGCCGCTTGATGCTCTGCTG



GAGATCTATAATACACTTCTGATTTTTAACTGCAAAAGCTTCAATAAAAACGGCAACTTCTATGTCGA



CTATGATCGTTGCATCAATGAACTGAGTTCGGTCGTGTATCTGTATAATAAAACACGTAACTATTGCA



CTAAAAAACCCTATAACACGGACAAGTTCAAACTCAATTTTAACAGTCCGCAGCTCGGTGAAGGCTTT



TCCAAGTCGAAAGAAAATGACTGTCTGACTCTTTTGTTTAAAAAAGACGACAACTATTATGTAGGCAT



TATCCGCAAAGGTGCAAAAATCAATTTTGATGATACACAAGCAATCGCCGATAACACCGACAATTGC



ATCTTTAAAATGAATTATTTCCTACTTAAAGACGCAAAAAAATTTATCCCGAAATGTAGCATTCAGCT



GAAAGAAGTCAAGGCCCATTTTAAGAAATCTGAAGATGATTACATTTTGTCTGATAAAGAGAAATTTG



CTAGCCCGCTGGTCATTAAAAAGAGCACATTTTTGCTGGCAACTGCACATGTGAAAGGGAAAAAAGG



CAATATCAAGAAATTTCAGAAAGAATATTCGAAAGAAAACCCCACTGAGTATCGCAATTCTTTAAAC



GAATGGATTGCTTTTTGTAAAGAGTTCTTAAAAACTTATAAAGCGGCTACCATTTTTGATATAACCAC



ATTGAAAAAGGCAGAGGAATATGCTGATATTGTAGAATTCTACAAGGATGTCGATAATCTGTGCTAC



AAACTGGAGTTCTGCCCGATTAAAACCTCGTTTATAGAAAACCTGATAGATAACGGCGACCTGTATCT



GTTTCGCATCAATAACAAAGACTTCAGCAGTAAATCGACCGGCACCAAGAACCTTCATACGTTATATT



TACAAGCTATATTCGATGAACGTAATCTGAACAATCCGACAATTATGCTGAATGGGGGAGCAGAACT



GTTCTATCGTAAAGAAAGTATTGAGCAGAAAAACCGTATCACACACAAAGCCGGTTCAATTCTCGTG



AATAAGGTGTGTAAAGACGGTACAAGCCTGGATGATAAGATACGTAATGAAATTTATCAATATGAGA



ATAAATTTATTGATACCCTGTCTGATGAAGCTAAAAAGGTGTTACCGAATGTCATTAAAAAGGAAGCT



ACCCATGACATTACAAAAGATAAACGTTTCACTAGTGACAAATTCTTCTTTCACTGCCCCCTGACAAT



TAATTATAAGGAAGGCGATACCAAGCAGTTCAATAACGAAGTGCTGAGTTTTCTGCGTGGAAATCCTG



ACATCAACATTATCGGCATTGACCGCGGAGAGCGTAATTTAATCTATGTAACGGTTATAAACCAGAAA



GGCGAGATTCTGGATTCGGTTTCATTCAATACCGTGACCAACAAGAGTTCAAAAATCGAGCAGACAG



TCGATTATGAAGAGAAATTGGCAGTCCGCGAGAAAGAGAGGATTGAAGCAAAACGTTCCTGGGACTC



TATCTCAAAAATTGCGACACTAAAGGAAGGTTATCTGAGCGCAATAGTTCACGAGATCTGTCTGTTAA



TGATTAAACACAACGCGATCGTTGTCTTAGAGAATCTTAATGCAGGCTTTAAGCGTATTCGTGGCGGT



TTATCAGAAAAAAGTGTTTATCAAAAATTCGAAAAAATGTTGATTAACAAACTGAACTATTTTGTCAG



CAAGAAGGAATCCGACTGGAATAAACCGTCTGGTCTGCTGAATGGACTGCAGCTTTCGGATCAGTTTG



AAAGCTTCGAAAAACTGGGTATTCAGTCTGGTTTTATTTTTTACGTGCCGGCTGCATATACCTCAAAG



ATTGATCCGACCACGGGCTTCGCCAATGTTCTGAATCTGTCGAAGGTACGCAATGTTGATGCGATCAA



AAGCTTTTTTTCTAACTTCAACGAAATTAGTTATAGCAAGAAAGAAGCCCTTTTCAAATTCTCATTCGA



TCTGGATTCACTGAGTAAGAAAGGCTTTAGTAGCTTTGTGAAATTTAGTAAGAGTAAATGGAACGTCT



ACACCTTTGGAGAACGTATCATAAAGCCAAAGAATAAGCAAGGTTATCGGGAGGACAAAAGAATCA



ACTTGACCTTCGAGATGAAGAAGTTACTTAACGAGTATAAGGTTTCTTTTGATCTTGAAAATAACTTG



ATTCCGAATCTCACGAGTGCCAACCTGAAGGATACTTTTTGGAAAGAGCTATTCTTTATCTTCAAGAC



TACGCTGCAGCTCCGTAACAGCGTTACTAACGGTAAAGAAGATGTGCTCATCTCTCCGGTCAAAAATG



CGAAGGGTGAATTCTTCGTTTCGGGAACGCATAACAAGACTCTTCCGCAAGATTGCGATGCGAACGGT



GCATACCATATTGCGTTGAAAGGTCTGATGATACTCGAACGTAACAACCTTGTACGTGAGGAGAAAG



ATACGAAAAAGATTATGGCGATTTCAAACGTGGATTGGTTCGAGTACGTGCAGAAACGTAGAGGCGT



TCTGTAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATTTATTATATCGCGTTGAT



TATTGATGCTGTTTTTAGTTTTAACGGCAATTAATATATGTGTTATTAATTGAATGAATTTTATCATTCA



TAATAAGTATGTGTAGGATCAAGCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAA



GAGGATTACAGAATTATCTCATAACAAGTGTTAAGGGATGTTATTTCC





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCG


ID
TCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCG


NO:
GGACCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGA


63
TTATTTGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTG



ACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCA



CTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCA



CCATCATCATCACCATAACAACTACGACGAATTCACCAAACTGTACCCGATCCAGAAAACCATCCGTT



TCGAACTGAAACCGCAGGGTCGTACCATGGAACACCTGGAAACCTTCAACTTCTTCGAAGAAGACCG



TGACCGTGCGGAAAAATACAAAATCCTGAAAGAAGCGATCGACGAATACCACAAAAAATTCATCGAC



GAACACCTGACCAACATGTCTCTGGACTGGAACTCTCTGAAACAGATCTCTGAAAAATACTACAAATC



TCGTGAAGAAAAAGACAAAAAAGTTTTCCTGTCTGAACAGAAACGTATGCGTCAGGAAATCGTTTCT



GAATTCAAAAAAGACGACCGTTTCAAAGACCTGTTCTCTAAAAAACTGTTCTCTGAACTGCTGAAAGA



AGAAATCTACAAAAAAGGTAACCACCAGGAAATCGACGCGCTGAAATCTTTCGACAAATTCTCTGGT



TACTTCATCGGTCTGCACGAAAACCGTAAAAACATGTACTCTGACGGTGACGAAATCACCGCGATCTC



TAACCGTATCGTTAACGAAAACTTCCCGAAATTCCTGGACAACCTGCAGAAATACCAGGAAGCGCGT



AAAAAATACCCGGAATGGATCATCAAAGCGGAATCTGCGCTGGTTGCGCACAACATCAAAATGGACG



AAGTTTTCTCTCTGGAATACTTCAACAAAGTTCTGAACCAGGAAGGTATCCAGCGTTACAACCTGGCG



CTGGGTGGTTACGTTACCAAATCTGGTGAAAAAATGATGGGTCTGAACGACGCGCTGAACCTGGCGC



ACCAGTCTGAAAAATCTTCTAAAGGTCGTATCCACATGACCCCGCTGTTCAAACAGATCCTGTCTGAA



AAAGAATCTTTCTCTTACATCCCGGACGTTTTCACCGAAGACTCTCAGCTGCTGCCGTCTATCGGTGGT



TTCTTCGCGCAGATCGAAAACGACAAAGACGGTAACATCTTCGACCGTGCGCTGGAACTGATCTCTTC



TTACGCGGAATACGACACCGAACGTATCTACATCCGTCAGGCGGACATCAACCGTGTTTCTAACGTTA



TCTTCGGTGAATGGGGTACCCTGGGTGGTCTGATGCGTGAATACAAAGCGGACTCTATCAACGACATC



AACCTGGAACGTACCTGCAAAAAAGTTGACAAATGGCTGGACTCTAAAGAATTCGCGCTGTCTGACG



TTCTGGAAGCGATCAAACGTACCGGTAACAACGACGCGTTCAACGAATACATCTCTAAAATGCGTAC



CGCGCGTGAAAAAATCGACGCGGCGCGTAAAGAAATGAAATTCATCTCTGAAAAAATCTCTGGTGAC



GAAGAATCTATCCACATCATCAAAACCCTGCTGGACTCTGTTCAGCAGTTCCTGCACTTCTTCAACCTG



TTCAAAGCGCGTCAGGACATCCCGCTGGACGGTGCGTTCTACGCGGAATTCGACGAAGTTCACTCTAA



ACTGTTCGCGATCGTTCCGCTGTACAACAAAGTTCGTAACTACCTGACCAAAAACAACCTGAACACCA



AAAAAATCAAACTGAACTTCAAAAACCCGACCCTGGCGAACGGTTGGGACCAGAACAAAGTTTACGA



CTACGCGTCTCTGATCTTCCTGCGTGACGGTAACTACTACCTGGGTATCATCAACCCGAAACGTAAAA



AAAACATCAAATTCGAACAGGGTTCTGGTAACGGTCCGTTCTACCGTAAAATGGTTTACAAACAGATC



CCGGGTCCGAACAAAAACCTGCCGCGTGTTTTCCTGACCTCTACCAAAGGTAAAAAAGAATACAAAC



CGTCTAAAGAAATCATCGAAGGTTACGAAGCGGACAAACACATCCGTGGTGACAAATTCGACCTGGA



CTTCTGCCACAAACTGATCGACTTCTTCAAAGAATCTATCGAAAAACACAAAGACTGGTCTAAATTCA



ACTTCTACTTCTCTCCGACCGAATCTTACGGTGACATCTCTGAATTCTACCTGGACGTTGAAAAACAG



GGTTACCGTATGCACTTCGAAAACATCTCTGCGGAAACCATCGACGAATACGTTGAAAAAGGTGACC



TGTTCCTGTTCCAGATCTACAACAAAGACTTCGTTAAAGCGGCGACCGGTAAAAAAGACATGCACAC



CATCTACTGGAACGCGGCGTTCTCTCCGGAAAACCTGCAGGACGTTGTTGTTAAACTGAACGGTGAAG



CGGAACTGTTCTACCGTGACAAATCTGACATCAAAGAAATCGTTCACCGTGAAGGTGAAATCCTGGTT



AACCGTACCTACAACGGTCGTACCCCGGTTCCGGACAAAATCCACAAAAAACTGACCGACTACCACA



ACGGTCGTACCAAAGACCTGGGTGAAGCGAAAGAATACCTGGACAAAGTTCGTTACTTCAAAGCGCA



CTACGACATCACCAAAGACCGTCGTTACCTGAACGACAAAATCTACTTCCACGTTCCGCTGACCCTGA



ACTTCAAAGCGAACGGTAAAAAAAACCTGAACAAAATGGTTATCGAAAAATTCCTGTCTGACGAAAA



AGCGCACATCATCGGTATCGACCGTGGTGAACGTAACCTGCTGTACTACTCTATCATCGACCGTTCTG



GTAAAATCATCGACCAGCAGTCTCTGAACGTTATCGACGGTTTCGACTACCGTGAAAAACTGAACCAG



CGTGAAATCGAAATGAAAGACGCGCGTCAGTCTTGGAACGCGATCGGTAAAATCAAAGACCTGAAAG



AAGGTTACCTGTCTAAAGCGGTTCACGAAATCACCAAAATGGCGATCCAGTACAACGCGATCGTTGTT



ATGGAAGAACTGAACTACGGTTTCAAACGTGGTCGTTTCAAAGTTGAAAAACAGATCTACCAGAAAT



TCGAAAACATGCTGATCGACAAAATGAACTACCTGGTTTTCAAAGACGCGCCGGACGAATCTCCGGG



TGGTGTTCTGAACGCGTACCAGCTGACCAACCCGCTGGAATCTTTCGCGAAACTGGGTAAACAGACCG



GTATCCTGTTCTACGTTCCGGCGGCGTACACCTCTAAAATCGACCCGACCACCGGTTTCGTTAACCTGT



TCAACACCTCTTCTAAAACCAACGCGCAGGAACGTAAAGAATTCCTGCAGAAATTCGAATCTATCTCT



TACTCTGCGAAAGACGGTGGTATCTTCGCGTTCGCGTTCGACTACCGTAAATTCGGTACCTCTAAAAC



CGACCACAAAAACGTTTGGACCGCGTACACCAACGGTGAACGTATGCGTTACATCAAAGAAAAAAAA



CGTAACGAACTGTTCGACCCGTCTAAAGAAATCAAAGAAGCGCTGACCTCTTCTGGTATCAAATACGA



CGGTGGTCAGAACATCCTGCCGGACATCCTGCGTTCTAACAACAACGGTCTGATCTACACCATGTACT



CTTCTTTCATCGCGGCGATCCAGATGCGTGTTTACGACGGTAAAGAAGACTACATCATCTCTCCGATC



AAAAACTCTAAAGGTGAATTCTTCCGTACCGACCCGAAACGTCGTGAACTGCCGATCGACGCGGACG



CGAACGGTGCGTACAACATCGCGCTGCGTGGTGAACTGACCATGCGTGCGATCGCGGAAAAATTCGA



CCCGGACTCTGAAAAAATGGCGAAACTGGAACTGAAACACAAAGACTGGTTCGAATTCATGCAGACC



CGTGGTGACTAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGACCC



TCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA





SEQ
AAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAAGCATTGATAATTGA


ID
GATCCCTCTCCCTGACAGGATGATTACATAAATAATAGTGACAAAAATAAATTATTTATTTATCCAGA


NO:
AAATGAATTGGAAAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTcaaaCAGGTtg


64
ccgtcactgcgtcttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagcattctg



taacaaagcgggaccaaagccatgacaaaaacgcgtaacaaaagtgtctataatcacggcagaaaagtcc



acattgattatttgcacggcgtcacactttgctatgccatagcatttttatccataagattagcggatcc



tacctgacgctttttatcgcaactctctactgtttctccatacccgtttttttgggctagcaccgcctat



ctcgtgtgagataggcggagatacgaactttaagAAGGAGatataccATGACT



AAAACATTTGATTCAGAGTTTTTTAATTTGTACTCGCTGCAAAAAACGGTACGCTTTGAGTTAAAACC



CGTGGGAGAAACCGCGTCATTTGTGGAAGACTTTAAAAACGAGGGCTTGAAACGTGTTGTGAGCGAA



GATGAAAGGCGAGCCGTCGATTACCAGAAAGTTAAGGAAATAATTGACGATTACCATCGGGATTTCA



TTGAAGAAAGTTTAAATTATTTTCCGGAACAGGTGAGTAAAGATGCTCTTGAGCAGGCGTTTCATCTT



TATCAGAAACTGAAGGCAGCAAAAGTTGAGGAAAGGGAAAAAGCGCTGAAAGAATGGGAAGCGCTG



CAGAAAAAGCTACGTGAAAAAGTGGTGAAATGCTTCTCGGACTCGAATAAAGCCCGCTTCTCAAGGA



TTGATAAAAAGGAACTGATTAAGGAAGACCTGATAAATTGGTTGGTCGCCCAGAATCGCGAGGATGA



TATCCCTACGGTCGAAACGTTTAACAACTTCACCACATATTTTACCGGCTTCCATGAGAATCGTAAAA



ATATTTACTCCAAAGATGATCACGCCACCGCTATTAGCTTTCGCCTTATTCATGAAAATCTTCCAAAGT



TTTTTGACAACGTGATTAGCTTCAATAAGTTGAAAGAGGGTTTCCCTGAATTAAAATTTGATAAAGTG



AAAGAGGATTTAGAAGTAGATTATGATCTGAAGCATGCGTTTGAAATAGAATATTTCGTTAACTTCGT



GACCCAAGCGGGCATAGATCAGTATAATTATCTGTTAGGAGGGAAAACCCTGGAGGACGGGACGAAA



AAACAAGGGATGAATGAGCAAATTAATCTGTTCAAACAACAGCAAACGCGAGATAAAGCGCGTCAG



ATTCCCAAACTGATCCCCCTGTTCAAACAGATTCTTAGCGAAAGGACTGAAAGCCAGTCCTTTATTCC



TAAACAATTTGAAAGTGATCAGGAGTTGTTCGATTCACTGCAGAAGTTACATAATAACTGCCAGGATA



AATTCACCGTGCTGCAACAAGCCATTCTCGGTCTGGCAGAGGCGGATCTTAAGAAGGTCTTCATCAAA



ACCTCTGATTTAAATGCCTTATCTAACACCATTTTCGGGAATTACAGCGTCTTTTCCGATGCACTGAAC



CTGTATAAAGAAAGCCTGAAAACGAAAAAAGCGCAGGAGGCTTTTGAGAAACTACCGGCCCATTCTA



TTCACGACCTCATTCAATACTTGGAACAGTTCAATTCCAGCCTGGACGCGGAAAAACAACAGAGCAC



CGACACCGTCCTGAACTACTTCATCAAGACCGATGAATTATATTCTCGCTTCATTAAATCCACTAGCG



AGGCTTTCACTCAGGTGCAGCCTTTGTTCGAACTGGAAGCCCTGTCATCTAAGCGCCGCCCACCGGAA



TCGGAAGATGAAGGGGCAAAAGGGCAGGAAGGCTTCGAGCAGATCAAGCGTATTAAAGCTTACCTG



GATACGCTTATGGAAGCGGTACACTTTGCAAAGCCGTTGTATCTTGTTAAGGGTCGTAAAATGATCGA



AGGGCTCGATAAAGACCAGTCCTTTTATGAAGCGTTTGAAATGGCGTACCAAGAACTTGAATCGTTAA



TCATTCCTATCTATAACAAAGCGCGGAGCTATCTGTCGCGGAAACCTTTCAAGGCCGATAAATTCAAG



ATTAATTTTGACAACAACACGCTACTGAGCGGATGGGATGCGAACAAGGAAACTGCTAACGCGTCCA



TTCTGTTTAAGAAAGACGGGTTATATTACCTTGGAATTATGCCGAAAGGTAAGACCTTTCTCTTTGACT



ACTTTGTATCGAGCGAGGATTCAGAGAAACTGAAACAGCGTCGCCAGAAGACCGCCGAAGAAGCTCT



GGCGCAGGATGGTGAAAGTTACTTCGAAAAAATTCGTTATAAACTGTTACCAGGGGCTTCAAAGATG



TTACCGAAAGTCTTTTTTAGCAACAAAAATATTGGCTTTTACAACCCGTCGGATGACATTTTACGCATT



CGCAACACAGCCTCTCACACCAAAAACGGGACCCCTCAGAAAGGCCACTCAAAAGTTGAGTTTAACC



TGAATGATTGTCATAAGATGATTGATTTCTTCAAATCATCAATTCAGAAACACCCGGAATGGGGGTCT



TTTGGCTTTACGTTTTCTGATACCAGTGATTTTGAAGACATGAGTGCCTTCTACCGGGAAGTAGAAAA



CCAGGGTTACGTAATTAGCTTTGACAAAATCAAAGAGACCTATATACAGAGCCAGGTGGAACAGGGT



AATCTCTACTTATTCCAGATTTATAACAAGGATTTCTCGCCCTACAGCAAAGGCAAACCAAACCTGCA



TACTCTGTACTGGAAAGCCCTGTTTGAAGAAGCGAACCTGAATAACGTAGTGGCGAAGTTGAACGGT



GAAGCGGAAATCTTCTTCCGTCGTCACTCCATTAAGGCCTCTGATAAAGTTGTCCATCCGGCAAATCA



GGCCATTGATAATAAGAATCCACACACGGAAAAAACGCAGTCAACCTTTGAATATGACCTCGTTAAA



GACAAACGCTACACGCAAGATAAGTTCTTTTTCCACGTCCCAATCAGCCTCAACTTTAAAGCACAAGG



GGTTTCAAAGTTTAATGATAAAGTCAATGGGTTCCTCAAGGGCAACCCGGATGTCAACATTATAGGTA



TAGACAGGGGCGAACGCCATCTGCTTTACTTTACCGTAGTGAATCAGAAAGGTGAAATACTGGTTCAG



GAATCATTAAATACCTTGATGTCGGACAAAGGGCACGTTAATGATTACCAGCAGAAACTGGATAAAA



AAGAACAGGAACGTGATGCTGCGCGTAAATCGTGGACCACGGTTGAGAACATTAAAGAGCTGAAAG



AGGGGTATCTAAGCCATGTGGTACACAAACTGGCGCACCTCATCATTAAATATAACGCAATAGTCTGC



CTAGAAGACTTGAATTTTGGCTTTAAACGCGGCCGCTTCAAAGTGGAAAAACAAGTTTATCAAAAATT



TGAAAAGGCGCTTATAGATAAACTGAATTATCTGGTTTTTAAAGAAAAGGAACTTGGTGAGGTAGGG



CACTACTTGACAGCTTATCAACTGACGGCCCCGTTCGAATCATTCAAAAAACTGGGCAAACAGTCTGG



CATTCTGTTTTACGTGCCGGCAGATTATACTTCAAAAATCGATCCAACAACTGGCTTTGTGAACTTCCT



GGACCTGAGATATCAGTCTGTAGAAAAAGCTAAACAACTTCTTAGCGATTTTAATGCCATTCGTTTTA



ACAGCGTTCAGAATTACTTTGAATTCGAAATTGACTATAAAAAACTTACTCCGAAACGTAAAGTCGGA



ACCCAAAGTAAATGGGTAATTTGTACGTATGGCGATGTCAGGTATCAGAACCGTCGGAATCAAAAAG



GTCATTGGGAGACCGAAGAAGTGAACGTGACCGAAAAGCTGAAGGCTCTGTTCGCCAGCGATTCAAA



AACTACAACTGTGATCGATTACGCAAATGATGATAACCTGATAGATGTGATTTTAGAGCAGGATAAA



GCCAGCTTTTTTAAAGAACTGTTGTGGCTCCTGAAACTTACGATGACCTTACGACATTCCAAGATCAA



ATCGGAAGATGATTTTATTCTGTCACCGGTCAAGAATGAGCAGGGTGAATTCTATGATAGTAGGAAA



GCCGGCGAAGTGTGGCCGAAAGACGCCGACGCCAATGGCGCCTATCATATCGCGCTCAAAGGGCTTT



GGAATTTGCAGCAGATTAACCAGTGGGAAAAAGGTAAAACCCTGAATCTGGCTATCAAAAACCAGGA



TTGGTTTAGCTTTATCCAAGAGAAACCGTATCAGGAATGAGAAATCATCCTTAGCGAAAGCTAAGGAT



TTTTTTTATCTGAAATTTATTATATCGCGTTGATTATTGATGCTGTTTTTAGTTTTAACGGCAATTAATA



TATGTGTTATTAATTGAATGAATTTTATCATTCATAATAAGTATGTGTAGGATCAAGCTCAGGTTAAAT



ATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACAGAATTATCTCATAACAAGTGTTAA



GGGATGTTATTTCC





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCG


ID
TCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCG


NO:
GGACCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGA


65
TTATTTGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTG



ACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCA



CTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCA



CCATCATCATCACCATCATACAGGCGGTCTTCTTAGTATGGACGCGAAAGAGTTCACAGGTCAGTATC



CGTTGTCGAAAACATTACGATTCGAACTTCGGCCCATCGGCCGCACGTGGGATAACCTGGAGGCCTCA



GGCTACTTAGCGGAAGACCGCCATCGTGCCGAATGTTATCCTCGTGCGAAAGAGTTATTGGATGACAA



CCATCGTGCCTTCCTGAATCGTGTGTTGCCACAAATCGATATGGATTGGCACCCGATTGCGGAGGCCT



TTTGTAAGGTACATAAAAACCCTGGTAATAAAGAACTTGCCCAGGATTACAACCTTCAGTTGTCAAAG



CGCCGTAAGGAGATCAGCGCATATCTTCAGGATGCAGATGGCTATAAAGGCCTGTTCGCGAAGCCCG



CCTTAGACGAAGCTATGAAAATTGCGAAAGAAAACGGGAACGAAAGTGATATTGAGGTTCTCGAAGC



GTTTAACGGTTTTAGCGTATACTTCACCGGTTATCATGAGTCACGCGAGAACATTTATAGCGATGAGG



ATATGGTGAGCGTAGCCTACCGAATTACTGAGGATAATTTCCCGCGCTTTGTCTCAAACGCTTTGATCT



TTGATAAATTAAACGAAAGCCATCCGGATATTATCTCTGAAGTATCGGGCAATCTTGGAGTTGATGAC



ATTGGTAAGTACTTTGACGTGTCGAACTATAACAATTTTCTTTCCCAGGCCGGTATAGATGACTACAA



TCACATTATTGGCGGCCATACAACCGAAGACGGACTGATACAAGCGTTTAATGTCGTATTGAACTTAC



GTCACCAAAAAGACCCTGGCTTTGAAAAAATTCAGTTCAAACAGCTCTACAAACAAATCCTGAGCGT



GCGTACCAGCAAAAGCTACATCCCGAAACAGTTTGACAACTCTAAGGAGATGGTTGACTGCATTTGC



GATTATGTCAGCAAAATAGAGAAATCCGAAACAGTAGAACGGGCCCTGAAACTAGTCCGTAATATCA



GTTCTTTCGACTTGCGCGGGATCTTTGTCAATAAAAAGAACTTGCGCATACTGAGCAACAAACTGATA



GGAGATTGGGACGCGATCGAAACCGCATTGATGCATAGTTCTTCATCAGAAAACGATAAGAAAAGCG



TATATGATAGCGCGGAGGCTTTTACGTTGGATGACATCTTTTCAAGCGTGAAAAAATTTTCTGATGCC



TCTGCCGAAGATATTGGCAACAGGGCGGAAGACATCTGTAGAGTGATAAGTGAGACGGCCCCTTTTA



TCAACGATCTGCGAGCGGTGGACCTGGATAGCCTGAACGACGATGGTTATGAAGCGGCCGTCTCAAA



AATTCGGGAGTCGCTGGAGCCTTATATGGATCTTTTCCATGAACTGGAAATTTTCTCGGTTGGCGATG



AGTTCCCAAAATGCGCAGCATTTTACAGCGAACTGGAGGAAGTCAGCGAACAGCTGATCGAAATTAT



TCCGTTATTCAACAAGGCGCGTTCGTTCTGCACCCGGAAACGCTATAGCACCGATAAGATTAAAGTGA



ACTTAAAATTCCCGACCTTGGCGGACGGGTGGGACCTGAACAAAGAGAGAGACAACAAAGCCGCGAT



TCTGCGGAAAGACGGTAAGTATTATCTGGCAATTCTGGATATGAAGAAAGATCTGTCAAGCATTAGG



ACCAGCGACGAAGATGAATCCAGCTTCGAAAAGATGGAGTATAAACTGTTACCGAGTCCAGTAAAAA



TGCTGCCAAAGATATTCGTAAAATCGAAAGCCGCTAAGGAAAAATATGGCCTGACAGATCGTATGCT



TGAATGCTACGATAAAGGTATGCATAAGTCGGGTAGTGCGTTTGATCTTGGCTTTTGCCATGAACTCA



TTGATTATTACAAGCGTTGTATCGCGGAGTACCCAGGCTGGGATGTGTTCGATTTCAAGTTTCGCGAA



ACTTCCGATTATGGGTCCATGAAAGAGTTCAATGAAGATGTGGCCGGAGCCGGTTACTATATGAGTCT



GAGAAAAATTCCGTGCAGCGAAGTGTACCGTCTGTTAGACGAGAAATCGATTTATCTATTTCAAATTT



ATAACAAAGATTACTCTGAAAATGCACATGGTAATAAGAACATGCATACCATGTACTGGGAGGGTCT



CTTTTCCCCGCAAAACCTGGAGTCGCCCGTTTTCAAGTTGTCGGGTGGGGCAGAACTTTTCTTTCGAAA



ATCCTCAATCCCTAACGATGCCAAAACAGTACACCCGAAAGGCTCAGTGCTGGTTCCACGTAATGATG



TTAACGGTCGGCGTATTCCAGATTCAATCTACCGCGAACTGACACGCTATTTTAACCGTGGCGATTGC



CGAATCAGTGACGAAGCCAAAAGTTATCTTGACAAGGTTAAGACTAAAAAAGCGGACCATGACATTG



TGAAAGATCGCCGCTTTACCGTGGATAAAATGATGTTCCACGTCCCGATTGCGATGAACTTTAAGGCG



ATCAGTAAACCGAACTTAAACAAAAAAGTCATTGATGGCATCATTGATGATCAGGATCTGAAAATCA



TTGGTATTGATCGTGGCGAGCGGAACTTAATTTACGTCACGATGGTTGACAGAAAAGGGAATATCTTA



TATCAGGATTCTCTTAACATCCTCAATGGCTACGACTATCGTAAAGCTCTGGATGTGCGCGAATATGA



CAACAAGGAAGCGCGTCGTAACTGGACTAAAGTGGAGGGCATTCGCAAAATGAAGGAAGGCTATCTG



TCATTAGCGGTCTCGAAATTAGCGGATATGATTATCGAAAATAACGCCATCATCGTTATGGAGGACCT



GAACCACGGATTCAAAGCGGGCCGCTCAAAGATTGAAAAACAAGTTTATCAGAAATTTGAGAGTATG



CTGATTAACAAACTGGGCTATATGGTGTTAAAAGACAAGTCAATTGACCAATCAGGTGGCGCGCTGC



ATGGATACCAGCTGGCGAACCATGTTACCACCTTAGCATCAGTTGGAAAGCAGTGTGGGGTTATCTTT



TATATACCGGCAGCGTTCACTAGTAAAATAGATCCGACCACTGGTTTCGCCGATCTCTTTGCCCTGAG



TAACGTTAAAAACGTAGCGAGCATGCGTGAATTCTTTTCCAAAATGAAATCTGTCATTTATGATAAAG



CTGAAGGCAAATTCGCATTCACCTTTGATTACTTGGATTACAACGTGAAGAGCGAATGTGGTCGTACG



CTGTGGACCGTTTACACCGTTGGTGAGCGCTTCACCTATTCCCGTGTGAACCGCGAATATGTACGTAA



AGTCCCCACCGATATTATCTATGATGCCCTCCAGAAAGCAGGCATTAGCGTCGAAGGAGACTTAAGG



GACAGAATTGCCGAAAGCGATGGCGATACGCTGAAGTCTATTTTTTACGCATTCAAATACGCGCTAGA



TATGCGCGTTGAGAATCGCGAGGAAGACTACATTCAATCACCTGTGAAAAATGCCTCTGGGGAATTTT



TTTGTTCAAAAAATGCTGGTAAAAGCCTCCCACAAGATAGCGATGCAAACGGTGCATATAACATTGCC



CTGAAAGGTATTCTTCAATTACGCATGCTGTCTGAGCAGTACGACCCCAACGCGGAATCTATTAGACT



TCCGCTGATAACCAATAAAGCCTGGCTGACATTCATGCAGTCTGGCATGAAGACCTGGAAAAATTAG



GAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAAATA



TTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA





SEQ
AAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAAGCATTGATAATTGA


ID
GATCCCTCTCCCTGACAGGATGATTACATAAATAATAGTGACAAAAATAAATTATTTATTTATCCAGA


NO:
AAATGAATTGGAAAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTcaaaCAGGTtg


66
ccgtcactgcgtcttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagcattctg



taacaaagcgggaccaaagccatgacaaaaacgcgtaacaaaagtgtctataatcacggcagaaaagtcc



acattgattatttgcacggcgtcacactttgctatgccatagcatttttatccataagattagcggatcc



tacctgacgctttttatcgcaactctctactgtttctccatacccgtttttttgggctagcaccgcctat



ctcgtgtgagataggcggagatacgaactttaagAAGGAGatataccatgGATA



GTTTGAAAGATTTCACCAATCTGTACCCTGTCAGTAAGACATTGAGATTTGAATTAAAGCCCGTTGGA



AAGACTTTAGAAAATATCGAGAAAGCAGGTATTTTGAAAGAGGATGAGCATCGTGCAGAAAGTTATC



GGAGGGTGAAGAAAATAATTGATACTTATCATAAGGTATTTATCGATTCTTCTCTTGAAAATATGGCT



AAAATGGGTATTGAGAATGAAATAAAAGCAATGCTCCAAAGTTTCTGCGAATTGTATAAAAAAGATC



ATCGCACTGAGGGTGAAGACAAGGCATTAGATAAAATTCGAGCAGTACTTCGTGGCCTGATTGTTGG



GGCTTTCACTGGTGTTTGCGGAAGACGGGAAAATACAGTCCAAAACGAGAAGTACGAGAGTTTGTTC



AAAGAAAAGTTGATAAAAGAAATTTTACCTGATTTTGTGCTCTCTACTGAGGCTGAAAGCTTGCCTTT



CTCTGTTGAAGAAGCTACGAGGTCACTGAAGGAGTTTGATAGCTTTACATCCTACTTTGCTGGTTTTTA



CGAGAATAGAAAGAATATATACTCGACGAAACCTCAATCCACTGCCATTGCTTATCGTCTTATTCATG



AGAACTTGCCGAAGTTCATTGATAATATTCTTGTTTTTCAGAAGATCAAAGAGCCTATAGCCAAAGAG



CTGGAACATATTCGTGCGGACTTTTCTGCCGGGGGGTACATAAAAAAGGATGAGAGATTGGAGGATA



TTTTTTCGTTGAACTATTATATCCACGTGTTATCTCAGGCTGGGATCGAAAAATATAACGCATTGATTG



GGAAGATTGTGACAGAAGGAGATGGAGAGATGAAAGGGCTCAATGAACACATCAACCTTTACAACC



AACAAAGAGGCAGAGAGGATCGGCTCCCTCTTTTTAGGCCTCTTTATAAACAGATATTGAGTGACAGA



GAGCAATTATCATACTTGCCTGAGAGTTTTGAAAAAGATGAGGAGCTCCTCAGGGCTCTAAAAGAGTT



CTATGATCATATCGCAGAAGACATTCTCGGACGTACTCAACAGTTGATGACTTCTATTTCAGAATATG



ATTTATCTCGGATATACGTAAGGAACGATAGCCAATTGACTGATATATCAAAAAAAATGTTGGGAGA



TTGGAATGCTATCTACATGGCTAGAGAACGAGCATATGACCACGAGCAGGCTCCCAAAAGAATCACG



GCGAAATACGAGAGGGACAGGATTAAAGCTCTTAAAGGAGAAGAGAGTATAAGTCTGGCAAATCTTA



ATAGTTGTATTGCCTTTCTGGACAATGTTAGAGATTGCCGTGTAGATACTTATCTTTCCACACTGGGCC



AGAAGGAAGGACCACATGGTCTATCTAATCTCGTTGAGAACGTTTTTGCCTCATACCATGAAGCAGAG



CAATTGTTGAGCTTTCCATACCCCGAAGAGAATAATCTGATTCAGGACAAGGACAATGTGGTGTTAAT



TAAGAATCTTCTCGACAATATCAGTGATCTGCAGAGGTTCTTGAAACCTCTTTGGGGTATGGGAGACG



AACCCGATAAAGATGAAAGATTTTATGGAGAGTATAATTATATCCGAGGAGCTCTAGATCAGGTGAT



CCCTCTGTACAATAAGGTAAGGAACTACCTCACTCGGAAGCCTTATTCGACCAGAAAAGTAAAACTC



AATTTTGGGAATTCTCAATTGCTTAGTGGTTGGGATAGAAATAAGGAAAAGGATAATAGCTGTGTGAT



TTTGCGTAAGGGGCAGAACTTCTATTTGGCTATTATGAACAATAGGCACAAAAGAAGTTTCGAAAAC



AAGGTGTTGCCCGAGTATAAGGAGGGAGAACCTTACTTCGAAAAGATGGATTATAAATTTTTGCCTGA



TCCTAATAAAATGCTTCCTAAGGTTTTTCTTTCGAAAAAAGGAATAGAGATATACAAACCAAGTCCGA



AGCTTTTAGAACAATATGGACATGGAACTCACAAAAAGGGAGATACCTTTAGTATGGATGATTTGCA



CGAACTGATCGATTTCTTCAAACACTCAATCGAGGCTCATGAAGATTGGAAGCAATTCGGATTCAAAT



TTTCTGATACGGCTACTTATGAGAATGTATCTAGTTTCTATAGAGAAGTTGAGGATCAGGGGTATAAG



CTCTCTTTCCGAAAAGTTTCGGAATCTTATGTCTATTCATTAATAGATCAAGGCAAGTTGTATTTATTT



CAGATATACAACAAGGACTTTTCTCCCTGCAGCAAAGGGACACCTAATCTGCATACCTTGTATTGGAG



AATGCTTTTTGACGAGCGCAATTTGGCAGATGTCATATACAAACTGGATGGGAAGGCTGAAATCTTTT



TCCGAGAGAAGAGTTTGAAAAATGATCATCCCACGCATCCGGCTGGTAAGCCTATCAAAAAGAAAAG



TCGACAAAAAAAAGGAGAGGAGAGTCTGTTTGAGTATGATTTAGTCAAGGATAGGCACTATACGATG



GATAAGTTCCAGTTTCATGTGCCTATTACTATGAATTTTAAATGTTCTGCAGGAAGCAAAGTCAATGA



TATGGTTAATGCTCATATTCGAGAGGCAAAGGATATGCATGTCATTGGAATTGATCGTGGAGAACGCA



ATCTGCTGTATATATGCGTGATAGATAGTCGAGGGACGATTTTGGATCAAATTTCTCTGAATACGATT



AACGATATAGACTATCATGATTTATTGGAGAGTCGAGACAAAGACCGTCAGCAGGAGCGCCGAAACT



GGCAAACTATCGAAGGGATCAAGGAGCTAAAACAAGGCTACCTTAGTCAGGCGGTTCATCGGATAGC



CGAACTGATGGTGGCTTATAAGGCTGTAGTTGCTTTGGAGGATTTGAATATGGGGTTCAAACGTGGGC



GGCAGAAAGTAGAAAGTTCTGTTTATCAGCAGTTTGAGAAACAGCTGATAGATAAGCTCAACTATCTT



GTGGACAAGAAGAAAAGGCCTGAAGATATTGGAGGATTGTTGAGAGCCTATCAATTTACGGCCCCAT



TTAAGAGTTTTAAGGAAATGGGAAAGCAAAACGGCTTCTTGTTTTATATCCCGGCTTGGAACACGAGC



AACATAGATCCGACTACTGGATTTGTTAATTTATTTCATGCCCAGTATGAAAATGTAGATAAAGCGAA



GAGCTTCTTTCAAAAGTTTGATTCAATTAGTTACAACCCGAAGAAAGACTGGTTTGAGTTTGCATTCG



ATTATAAAAACTTTACTAAAAAGGCTGAAGGAAGTCGTTCTATGTGGATATTATGCACACATGGTTCC



CGAATAAAGAATTTTAGAAATTCCCAGAAGAATGGTCAATGGGATTCCGAAGAATTCGCCTTGACGG



AGGCTTTTAAGTCTCTTTTTGTGCGATATGAGATAGATTATACCGCTGATTTGAAAACAGCTATTGTGG



ACGAAAAGCAAAAAGACTTCTTCGTGGATCTTCTGAAGCTATTCAAATTGACAGTACAGATGCGCAA



CAGCTGGAAAGAGAAGGATTTGGATTATCTAATCTCTCCTGTAGCAGGGGCTGATGGCCGTTTCTTCG



ATACAAGAGAGGGAAATAAAAGTCTGCCTAAGGATGCAGATGCCAATGGAGCTTATAATATTGCCCT



AAAAGGACTTTGGGCTCTACGCCAGATTCGGCAAACTTCAGAAGGCGGTAAACTCAAATTGGCGATT



TCCAATAAGGAATGGCTACAGTTTGTGCAAGAGAGATCTTACGAGAAAGACtgaGAAATCATCCTTAG



CGAAAGCTAAGGATTTTTTTTATCTGAAATTTATTATATCGCGTTGATTATTGATGCTGTTTTTAGTTTT



AACGGCAATTAATATATGTGTTATTAATTGAATGAATTTTATCATTCATAATAAGTATGTGTAGGATC



AAGCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACAGAATTATCTC



ATAACAAGTGTTAAGGGATGTTATTTCC





SEQ
AAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAAGCATTGATAATTGA


ID
GATCCCTCTCCCTGACAGGATGATTACATAAATAATAGTGACAAAAATAAATTATTTATTTATCCAGA


NO:
AAATGAATTGGAAAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTcaaaCAGGTtg


67
ccgtcactgcgtcttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagcattctg



taacaaagcgggaccaaagccatgacaaaaacgcgtaacaaaagtgtctataatcacggcagaaaagtcc



acattgattatttgcacggcgtcacactttgctatgccatagcatttttatccataagattagcggatcc



tacctgacgctttttatcgcaactctctactgtttctccatacccgtttttttgggctagcaccgcctat



ctcgtgtgagataggcggagatacgaactttaagAAGGAGatataccATGAAC



AACGGCACAAATAATTTTCAGAACTTCATCGGGATCTCAAGTTTGCAGAAAACGCTGCGCAATGCTCT



GATCCCCACGGAAACCACGCAACAGTTCATCGTCAAGAACGGAATAATTAAAGAAGATGAGTTACGT



GGCGAGAACCGCCAGATTCTGAAAGATATCATGGATGACTACTACCGCGGATTCATCTCTGAGACTCT



GAGTTCTATTGATGACATAGATTGGACTAGCCTGTTCGAAAAAATGGAAATTCAGCTGAAAAATGGT



GATAATAAAGATACCTTAATTAAGGAACAGACAGAGTATCGGAAAGCAATCCATAAAAAATTTGCGA



ACGACGATCGGTTTAAGAACATGTTTAGCGCCAAACTGATTAGTGACATATTACCTGAATTTGTCATC



CACAACAATAATTATTCGGCATCAGAGAAAGAGGAAAAAACCCAGGTGATAAAATTGTTTTCGCGCT



TTGCGACTAGCTTTAAAGATTACTTCAAGAACCGTGCAAATTGCTTTTCAGCGGACGATATTTCATCA



AGCAGCTGCCATCGCATCGTCAACGACAATGCAGAGATATTCTTTTCAAATGCGCTGGTCTACCGCCG



GATCGTAAAATCGCTGAGCAATGACGATATCAACAAAATTTCGGGCGATATGAAAGATTCATTAAAA



GAAATGAGTCTGGAAGAAATATATTCTTACGAGAAGTATGGGGAATTTATTACCCAGGAAGGCATTA



GCTTCTATAATGATATCTGTGGGAAAGTGAATTCTTTTATGAACCTGTATTGTCAGAAAAATAAAGAA



AACAAAAATTTATACAAACTTCAGAAACTTCACAAACAGATTCTATGCATTGCGGACACTAGCTATGA



GGTCCCGTATAAATTTGAAAGTGACGAGGAAGTGTACCAATCAGTTAACGGCTTCCTTGATAACATTA



GCAGCAAACATATAGTCGAAAGATTACGCAAAATCGGCGATAACTATAACGGCTACAACCTGGATAA



AATTTATATCGTGTCCAAATTTTACGAGAGCGTTAGCCAAAAAACCTACCGCGACTGGGAAACAATTA



ATACCGCCCTCGAAATTCATTACAATAATATCTTGCCGGGTAACGGTAAAAGTAAAGCCGACAAAGT



AAAAAAAGCGGTTAAGAATGATTTACAGAAATCCATCACCGAAATAAATGAACTAGTGTCAAACTAT



AAGCTGTGCAGTGACGACAACATCAAAGCGGAGACTTATATACATGAGATTAGCCATATCTTGAATA



ACTTTGAAGCACAGGAATTGAAATACAATCCGGAAATTCACCTAGTTGAATCCGAGCTCAAAGCGAG



TGAGCTTAAAAACGTGCTGGACGTGATCATGAATGCGTTTCATTGGTGTTCGGTTTTTATGACTGAGG



AACTTGTTGATAAAGACAACAATTTTTATGCGGAACTGGAGGAGATTTACGATGAAATTTATCCAGTA



ATTAGTCTGTACAACCTGGTTCGTAACTACGTTACCCAGAAACCGTACAGCACGAAAAAGATTAAATT



GAACTTTGGAATACCGACGTTAGCAGACGGTTGGTCAAAGTCCAAAGAGTATTCTAATAACGCTATCA



TACTGATGCGCGACAATCTGTATTATCTGGGCATCTTTAATGCGAAGAATAAACCGGACAAGAAGATT



ATCGAGGGTAATACGTCAGAAAATAAGGGTGACTACAAAAAGATGATTTATAATTTGCTCCCGGGTC



CCAACAAAATGATCCCGAAAGTTTTCTTGAGCAGCAAGACGGGGGTGGAAACGTATAAACCGAGCGC



CTATATCCTAGAGGGGTATAAACAGAATAAACATATCAAGTCTTCAAAAGACTTTGATATCACTTTCT



GTCATGATCTGATCGACTACTTCAAAAACTGTATTGCAATTCATCCCGAGTGGAAAAACTTCGGTTTT



GATTTTAGCGACACCAGTACTTATGAAGACATTTCCGGGTTTTATCGTGAGGTAGAGTTACAAGGTTA



CAAGATTGATTGGACATACATTAGCGAAAAAGACATTGATCTGCTGCAGGAAAAAGGTCAACTGTAT



CTGTTCCAGATATATAACAAAGATTTTTCGAAAAAATCAACCGGGAATGACAACCTTCACACCATGTA



CCTGAAAAATCTTTTCTCAGAAGAAAATCTTAAGGATATCGTCCTGAAACTTAACGGCGAAGCGGAA



ATCTTCTTCAGGAAGAGCAGCATAAAGAACCCAATCATTCATAAAAAAGGCTCGATTTTAGTCAACCG



TACCTACGAAGCAGAAGAAAAAGACCAGTTTGGCAACATTCAAATTGTGCGTAAAAATATTCCGGAA



AACATTTATCAGGAGCTGTACAAATACTTCAACGATAAAAGCGACAAAGAGCTGTCTGATGAAGCAG



CCAAACTGAAGAATGTAGTGGGACACCACGAGGCAGCGACGAATATAGTCAAGGACTATCGCTACAC



GTATGATAAATACTTCCTTCATATGCCTATTACGATCAATTTCAAAGCCAATAAAACGGGTTTTATTAA



TGATAGGATCTTACAGTATATCGCTAAAGAAAAAGACTTACATGTGATCGGCATTGATCGGGGCGAG



CGTAACCTGATCTACGTGTCCGTGATTGATACTTGTGGTAATATAGTTGAACAGAAAAGCTTTAACAT



TGTAAACGGCTACGACTATCAGATAAAACTGAAACAACAGGAGGGCGCTAGACAGATTGCGCGGAA



AGAATGGAAAGAAATTGGTAAAATTAAAGAGATCAAAGAGGGCTACCTGAGCTTAGTAATCCACGAG



ATCTCTAAAATGGTAATCAAATACAATGCAATTATAGCGATGGAGGATTTGTCTTATGGTTTTAAAAA



AGGGCGCTTTAAGGTCGAACGGCAAGTTTACCAGAAATTTGAAACCATGCTCATCAATAAACTCAACT



ATCTGGTATTTAAAGATATTTCGATTACCGAGAATGGCGGTCTCCTGAAAGGTTATCAGCTGACATAC



ATTCCTGATAAACTTAAAAACGTGGGTCATCAGTGCGGCTGCATTTTTTATGTGCCTGCTGCATACAC



GAGCAAAATTGATCCGACCACCGGCTTTGTGAATATCTTTAAATTTAAAGACCTGACAGTGGACGCAA



AACGTGAATTCATTAAAAAATTTGACTCAATTCGTTATGACAGTGAAAAAAATCTGTTCTGCTTTACA



TTTGACTACAATAACTTTATTACGCAAAACACGGTCATGAGCAAATCATCGTGGAGTGTGTATACATA



CGGCGTGCGCATCAAACGTCGCTTTGTGAACGGCCGCTTCTCAAACGAAAGTGATACCATTGACATAA



CCAAAGATATGGAGAAAACGTTGGAAATGACGGACATTAACTGGCGCGATGGCCACGATCTTCGTCA



AGACATTATAGATTATGAAATTGTTCAGCACATATTCGAAATTTTCCGTTTAACAGTGCAAATGCGTA



ACTCCTTGTCTGAACTGGAGGACCGTGATTACGATCGTCTCATTTCACCTGTACTGAACGAAAATAAC



ATTTTTTATGACAGCGCGAAAGCGGGGGATGCACTTCCTAAGGATGCCGATGCAAATGGTGCGTATTG



TATTGCATTAAAAGGGTTATATGAAATTAAACAAATTACCGAAAATTGGAAAGAAGATGGTAAATTT



TCGCGCGATAAACTCAAAATCAGCAATAAAGATTGGTTCGACTTTATCCAGAATAAGCGCTATCTCTA



AGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATTTATTATATCGCGTTGATTATTGA



TGCTGTTTTTAGTTTTAACGGCAATTAATATATGTGTTATTAATTGAATGAATTTTATCATTCATAATA



AGTATGTGTAGGATCAAGCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGG



ATTACAGAATTATCTCATAACAAGTGTTAAGGGATGTTATTTCC





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCG


ID
TCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCG


NO:
GGACCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGA


68
TTATTTGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTG



ACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCA



CTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCA



CCATCATCATCACCATACCAATAAATTCACTAACCAGTATTCTCTCTCTAAGACCCTGCGCTTTGAACT



GATTCCGCAGGGGAAAACCTTGGAGTTCATTCAAGAAAAAGGCCTCTTGTCTCAGGATAAACAGAGG



GCTGAATCTTACCAAGAAATGAAGAAAACTATTGATAAGTTTCATAAATATTTCATTGATTTAGCCTT



GTCTAACGCCAAATTAACTCACTTGGAAACGTATCTGGAGTTATACAACAAATCTGCCGAAACTAAGA



AAGAACAGAAATTTAAAGACGATTTGAAAAAAGTACAGGACAATCTGCGTAAAGAAATTGTCAAATC



CTTCAGTGACGGCGATGCTAAAAGCATTTTTGCCATTCTGGACAAAAAAGAGTTGATTACTGTGGAAT



TAGAAAAGTGGTTTGAAAACAATGAGCAGAAAGACATCTACTTCGATGAGAAATTCAAAACTTTCAC



CACCTATTTTACAGGATTTCATCAAAACCGGAAGAACATGTACTCAGTAGAACCGAACTCCACGGCCA



TTGCGTATCGTTTGATCCATGAGAATCTGCCTAAATTTCTGGAGAATGCGAAAGCCTTTGAAAAGATT



AAGCAGGTCGAATCGCTGCAAGTGAATTTTCGTGAACTCATGGGCGAATTTGGTGACGAAGGTCTAAT



CTTCGTTAACGAACTGGAAGAAATGTTTCAGATTAATTACTACAATGACGTGCTATCGCAGAACGGTA



TCACAATCTACAATAGTATTATCTCAGGGTTCACAAAAAACGATATAAAATACAAAGGCCTGAACGA



GTATATCAATAACTACAACCAAACAAAGGACAAAAAGGATAGGCTTCCGAAACTGAAGCAGTTATAC



AAACAGATTTTATCTGACAGAATCTCCCTGAGCTTTCTGCCGGATGCTTTCACTGATGGGAAGCAGGT



TCTGAAAGCGATTTTCGATTTTTATAAGATTAACTTACTGAGCTACACGATTGAAGGTCAAGAAGAAT



CTCAAAACTTACTGCTCTTGATCCGTCAAACCATTGAAAATCTATCATCGTTCGATACGCAGAAAATC



TACCTCAAAAACGATACTCACCTGACTACGATCTCTCAGCAGGTTTTCGGGGATTTTAGTGTATTTTCA



ACAGCTCTGAACTACTGGTATGAAACCAAAGTCAATCCGAAATTCGAGACGGAATATTCTAAGGCCA



ACGAAAAAAAACGTGAGATTCTTGATAAAGCTAAAGCCGTATTTACTAAACAGGATTACTTTTCTATT



GCTTTCCTGCAGGAAGTTTTATCGGAGTATATCCTGACCCTGGATCATACATCTGATATCGTTAAAAA



ACACAGCAGCAATTGCATCGCTGACTATTTCAAAAACCACTTTGTCGCCAAAAAAGAAAACGAAACA



GACAAGACTTTCGATTTCATTGCTAACATCACCGCAAAATACCAGTGTATTCAGGGTATCTTGGAAAA



CGCCGACCAATACGAAGACGAACTGAAACAAGATCAGAAGCTGATCGATAATTTAAAATTCTTCTTA



GATGCAATCCTGGAGCTGCTGCACTTCATCAAACCGCTTCATTTAAAGAGCGAGTCCATTACCGAAAA



GGACACCGCCTTCTATGACGTTTTTGAAAATTATTATGAAGCCCTCTCCTTGCTGACTCCGCTGTATAA



TATGGTACGCAATTACGTAACCCAGAAACCATATTCTACCGAAAAAATTAAACTGAACTTTGAAAAC



GCACAGCTGCTCAACGGTTGGGACGCGAATAAAGAAGGTGACTACCTCACCACCATCCTGAAAAAAG



ATGGTAACTATTTTCTGGCAATTATGGATAAGAAACATAATAAAGCATTCCAGAAATTTCCTGAAGGG



AAAGAAAATTACGAAAAGATGGTGTACAAACTCTTACCTGGAGTTAACAAAATGTTGCCGAAAGTAT



TTTTTAGTAATAAGAACATCGCGTACTTTAACCCGTCCAAAGAACTGCTGGAAAATTATAAAAAGGAG



ACGCATAAGAAAGGGGATACCTTTAACCTGGAACATTGCCATACCTTAATAGACTTCTTCAAGGATTC



CCTGAATAAACACGAGGATTGGAAATATTTCGATTTTCAGTTTAGTGAGACCAAGTCATACCAGGATC



TTAGCGGCTTTTATCGCGAAGTAGAACACCAAGGCTATAAAATTAACTTCAAAAACATCGACAGCGA



ATACATCGACGGTTTAGTTAACGAGGGCAAACTGTTTCTGTTCCAGATCTATTCAAAGGATTTTAGCC



CGTTCTCTAAAGGCAAACCAAATATGCATACGTTGTACTGGAAAGCACTGTTTGAAGAGCAAAACCT



GCAGAATGTGATTTATAAACTGAACGGCCAAGCTGAGATTTTTTTCCGTAAAGCCTCGATTAAACCGA



AAAATATCATCCTTCATAAGAAGAAAATAAAGATCGCTAAAAAACACTTCATAGATAAAAAAACCAA



AACCTCCGAAATAGTGCCTGTTCAAACAATTAAGAACTTGAATATGTACTACCAGGGCAAGATATCG



GAAAAGGAGTTGACTCAAGACGATCTTCGCTATATCGATAACTTTTCGATTTTTAACGAAAAAAACAA



GACGATCGACATCATCAAAGATAAACGCTTCACTGTAGATAAGTTCCAGTTTCATGTGCCGATTACTA



TGAACTTCAAAGCTACCGGGGGTAGCTATATCAACCAAACGGTGTTGGAATACCTGCAGAATAACCC



GGAAGTCAAAATCATTGGGCTGGACCGCGGAGAACGTCACCTTGTGTACTTGACCTTAATCGATCAGC



AAGGCAACATCTTAAAACAAGAATCGCTGAATACCATTACGGATTCAAAGATTAGCACCCCGTATCA



TAAGCTGCTCGATAACAAGGAGAATGAGCGCGACCTGGCCCGTAAAAACTGGGGCACGGTGGAAAA



CATTAAGGAGTTAAAGGAGGGTTATATTTCCCAGGTAGTGCATAAGATCGCCACTCTCATGCTCGAGG



AAAATGCGATCGTTGTCATGGAAGACTTAAACTTCGGATTTAAACGTGGGCGATTTAAAGTAGAGAA



ACAAATCTACCAGAAGTTAGAAAAAATGCTGATTGACAAATTAAATTACTTGGTCCTAAAAGACAAA



CAGCCGCAAGAATTGGGTGGATTATACAACGCCCTCCAACTTACCAATAAATTCGAAAGTTTTCAGAA



AATGGGTAAACAGTCAGGCTTTCTTTTTTATGTTCCTGCGTGGAACACATCCAAAATCGACCCTACAA



CCGGCTTCGTCAATTACTTCTATACTAAATATGAAAACGTCGACAAAGCAAAAGCATTCTTTGAAAAG



TTCGAAGCAATACGTTTTAACGCTGAGAAAAAATATTTCGAGTTCGAAGTCAAGAAATACTCAGACTT



TAACCCCAAAGCTGAGGGCACACAGCAAGCGTGGACAATCTGCACCTACGGCGAGCGCATCGAAACG



AAGCGTCAAAAAGATCAGAATAACAAATTTGTTTCAACACCTATCAACCTGACCGAGAAGATTGAAG



ACTTCTTAGGTAAAAATCAGATTGTTTATGGCGACGGTAACTGTATAAAATCTCAAATAGCCTCAAAG



GATGATAAAGCATTTTTCGAAACATTATTATATTGGTTCAAAATGACACTGCAGATGCGCAATAGTGA



GACGCGTACAGATATTGATTATCTTATCAGCCCGGTCATGAACGACAACGGTACTTTTTACAACTCCA



GAGACTATGAAAAACTTGAGAATCCAACTCTCCCCAAAGATGCTGATGCGAACGGTGCTTATCACATC



GCGAAAAAAGGTCTGATGCTGCTGAACAAAATCGACCAAGCCGATCTGACTAAGAAAGTTGACCTAA



GCATTTCAAATCGGGACTGGTTACAGTTTGTTCAAAAGAACAAATGAGAAATCATCCTTAGCGAAAG



CTAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACT



CAGGAAGCAAAGAGGATTACA





SEQ
AAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAAGCATTGATAATTGA


ID
GATCCCTCTCCCTGACAGGATGATTACATAAATAATAGTGACAAAAATAAATTATTTATTTATCCAGA


NO:
AAATGAATTGGAAAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTcaaaCAGGTtg


69
ccgtcactgcgtcttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagcattctg



taacaaagcgggaccaaagccatgacaaaaacgcgtaacaaaagtgtctataatcacggcagaaaagtcc



acattgattatttgcacggcgtcacactttgctatgccatagcatttttatccataagattagcggatcc



tacctgacgctttttatcgcaactctctactgtttctccatacccgtttttttgggctagcaccgcctat



ctcgtgtgagataggcggagatacgaactttaagAAGGAGatataccATGGA



ACAGGAATATTATCTGGGCTTGGACATGGGCACCGGTTCCGTCGGCTGGGCTGTTACTGACAGTGAAT



ATCACGTTCTAAGAAAGCATGGTAAGGCATTGTGGGGTGTAAGACTTTTCGAATCTGCTTCCACTGCT



GAAGAGCGTAGAATGTTTAGAACGAGTCGACGTAGGCTAGACAGGCGCAATTGGAGAATCGAAATTT



TACAAGAAATTTTTGCGGAAGAGATATCTAAGAAAGACCCAGGCTTTTTCCTGAGAATGAAGGAATC



TAAGTATTACCCTGAGGATAAAAGAGATATAAATGGTAACTGTCCCGAATTGCCTTACGCATTATTTG



TGGACGATGATTTTACCGATAAGGATTACCATAAAAAGTTCCCAACTATCTACCATTTACGCAAAATG



TTAATGAATACAGAGGAAACCCCAGACATAAGACTAGTTTATCTGGCAATACACCATATGATGAAAC



ATAGAGGCCATTTCTTACTTTCCGGGGATATCAACGAAATCAAAGAGTTTGGTACCACATTTAGTAAG



TTACTGGAAAACATAAAGAATGAAGAATTGGATTGGAACTTAGAACTCGGAAAAGAAGAATACGCG



GTTGTCGAATCTATCCTGAAGGATAATATGCTGAATAGGTCGACCAAAAAAACTAGGCTGATCAAAG



CACTGAAAGCCAAATCTATCTGCGAAAAAGCTGTTTTAAATTTACTTGCTGGTGGCACTGTTAAGTTA



TCAGACATTTTTGGTTTGGAAGAATTGAACGAAACCGAGCGTCCAAAAATTAGTTTCGCTGATAATGG



CTACGATGATTACATTGGTGAGGTGGAAAACGAGTTGGGCGAACAATTTTATATTATAGAGACAGCT



AAGGCAGTCTATGACTGGGCTGTTTTAGTAGAAATCCTTGGTAAATACACATCTATCTCCGAAGCGAA



AGTTGCTACTTACGAAAAGCACAAGTCCGATCTCCAGTTTTTGAAGAAAATTGTCAGGAAATATCTGA



CTAAGGAAGAATATAAAGATATTTTCGTTAGTACCTCTGACAAACTGAAAAATTACTCCGCTTACATC



GGGATGACCAAGATTAATGGCAAAAAAGTTGATCTGCAAAGCAAAAGGTGTTCGAAGGAAGAATTTT



ATGATTTCATTAAAAAGAATGTCTTAAAAAAATTAGAAGGTCAGCCAGAATACGAATATTTGAAAGA



AGAACTGGAAAGAGAGACATTCTTACCAAAACAAGTCAACAGAGATAATGGGGTAATTCCATATCAA



ATTCACCTCTACGAATTAAAAAAAATTTTAGGCAATTTACGCGATAAAATTGACCTTATCAAAGAAAA



TGAGGATAAGCTGGTTCAACTCTTTGAATTCAGAATACCCTATTATGTGGGCCCACTGAACAAGATTG



ATGACGGCAAAGAAGGTAAATTCACATGGGCCGTCCGCAAATCCAATGAAAAAATTTACCCATGGAA



CTTTGAAAATGTAGTAGATATTGAAGCGTCTGCGGAGAAATTTATTCGAAGAATGACTAATAAATGCA



CTTACTTGATGGGAGAGGATGTTCTGCCTAAAGACAGCTTATTATACAGCAAGTACATGGTTCTAAAC



GAACTTAACAACGTTAAGTTGGACGGTGAGAAATTAAGTGTAGAATTGAAACAAAGATTGTATACTG



ACGTCTTCTGCAAGTACAGAAAAGTGACAGTTAAAAAAATTAAGAATTACTTGAAGTGCGAAGGTAT



AATTTCTGGAAACGTAGAGATTACTGGTATTGATGGTGATTTCAAAGCATCCCTAACAGCTTACCACG



ATTTCAAGGAAATCCTGACAGGAACTGAACTCGCAAAAAAAGATAAAGAAAACATTATTACTAATAT



TGTTCTTTTCGGTGATGACAAGAAATTGTTGAAGAAAAGACTGAATAGACTTTACCCCCAGATTACTC



CCAATCAACTTAAGAAAATTTGTGCTTTGTCTTACACAGGATGGGGTCGTTTTTCAAAAAAGTTCTTA



GAAGAGATTACCGCACCTGATCCAGAAACAGGCGAAGTATGGAATATAATTACCGCCTTATGGGAAT



CGAACAATAATCTTATGCAACTTCTGAGCAATGAATATCGTTTCATGGAAGAAGTTGAGACTTACAAC



ATGGGCAAACAGACGAAGACTTTATCCTATGAAACTGTGGAAAATATGTATGTATCACCTTCTGTCAA



GAGACAAATTTGGCAAACCTTAAAAATTGTCAAAGAATTAGAAAAGGTAATGAAGGAGTCTCCTAAA



CGTGTGTTTATTGAAATGGCTAGAGAAAAACAAGAGTCAAAAAGAACCGAGTCAAGAAAGAAGCAG



TTAATCGATTTATATAAGGCTTGTAAAAACGAAGAGAAAGATTGGGTTAAAGAATTGGGGGACCAAG



AGGAACAAAAACTACGGTCGGATAAGTTGTATTTATACTATACGCAAAAGGGACGATGTATGTATTC



CGGCGAGGTAATAGAATTGAAGGATTTATGGGACAATACAAAATATGACATAGACCATATATATCCC



CAATCAAAAACGATGGACGATAGCTTGAACAATAGAGTACTCGTGAAAAAAAAATATAATGCGACCA



AATCTGATAAGTATCCTCTGAATGAAAATATCAGACATGAAAGAAAGGGGTTCTGGAAGTCCTTGTTA



GATGGTGGGTTTATAAGCAAAGAAAAGTACGAGCGTCTAATAAGAAACACGGAGTTATCGCCAGAAG



AACTCGCTGGTTTTATTGAGAGGCAAATCGTGGAAACGAGACAATCTACCAAAGCCGTTGCTGAGAT



CCTAAAGCAAGTTTTCCCAGAGTCGGAGATTGTCTATGTCAAAGCTGGCACAGTGAGCAGGTTTAGGA



AAGACTTCGAACTATTAAAGGTAAGAGAAGTGAACGATTTACATCACGCAAAGGACGCTTACCTAAA



TATCGTTGTAGGTAACTCATATTATGTTAAATTTACCAAGAACGCCTCTTGGTTTATAAAGGAGAACC



CAGGTAGAACATATAACCTGAAAAAGATGTTCACCTCTGGTTGGAATATTGAGAGAAACGGAGAAGT



CGCATGGGAAGTTGGTAAGAAAGGGACTATAGTGACAGTAAAGCAAATTATGAACAAAAATAATATC



CTCGTTACAAGGCAGGTTCATGAAGCAAAGGGCGGCCTTTTTGACCAACAAATTATGAAGAAAGGGA



AAGGTCAAATTGCAATAAAAGAAACCGATGAGAGACTAGCGTCAATAGAAAAGTATGGTGGCTATAA



TAAAGCTGCGGGTGCATACTTTATGCTTGTTGAATCAAAAGACAAGAAAGGTAAGACTATTAGAACT



ATAGAATTTATACCCCTGTACCTTAAAAACAAAATTGAATCGGATGAGTCAATCGCGTTAAATTTTCT



AGAGAAAGGAAGGGGTTTAAAAGAACCAAAGATCCTGTTAAAAAAGATTAAGATTGACACCTTGTTC



GATGTAGATGGATTTAAAATGTGGTTATCTGGCAGAACAGGCGATAGACTTTTGTTTAAGTGCGCTAA



TCAATTAATTTTGGATGAGAAAATCATTGTCACAATGAAAAAAATAGTTAAGTTTATTCAGAGAAGAC



AAGAAAACAGGGAGTTGAAATTATCTGATAAAGATGGTATCGACAATGAAGTTTTAATGGAAATCTA



CAATACATTCGTTGATAAACTTGAAAATACCGTATATCGAATCAGGTTAAGTGAACAAGCCAAAACA



TTAATTGATAAACAAAAAGAATTTGAAAGGCTATCACTGGAAGACAAATCCTCCACCCTATTTGAAAT



TTTGCATATATTCCAGTGCCAATCTTCAGCAGCTAATTTAAAAATGATTGGCGGACCTGGGAAAGCCG



GCATCCTAGTGATGAACAATAATATCTCCAAGTGTAACAAAATATCAATTATTAACCAATCTCCGACA



GGTATTTTTGAAAATGAAATAGACTTGCTTAAGATATAAGAAATCATCCTTAGCGAAAGCTAAGGATT



TTTTTTATCTGAAATTTATTATATCGCGTTGATTATTGATGCTGTTTTTAGTTTTAACGGCAATTAATAT



ATGTGTTATTAATTGAATGAATTTTATCATTCATAATAAGTATGTGTAGGATCAAGCTCAGGTTAAAT



ATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACAGAATTATCTCATAACAAGTGTTAA



GGGATGTTATTTCC





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCG


ID
TCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCG


NO:
GGACCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGA


70
TTATTTGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTG



ACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCA



CTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCA



CCATCATCATCACCATTCTTTCGACTCTTTCACCAACCTGTACTCTCTGTCTAAAACCCTGAAATTCGA



AATGCGTCCGGTTGGTAACACCCAGAAAATGCTGGACAACGCGGGTGTTTTCGAAAAAGACAAACTG



ATCCAGAAAAAATACGGTAAAACCAAACCGTACTTCGACCGTCTGCACCGTGAATTCATCGAAGAAG



CGCTGACCGGTGTTGAACTGATCGGTCTGGACGAAAACTTCCGTACCCTGGTTGACTGGCAGAAAGAC



AAAAAAAACAACGTTGCGATGAAAGCGTACGAAAACTCTCTGCAGCGTCTGCGTACCGAAATCGGTA



AAATCTTCAACCTGAAAGCGGAAGACTGGGTTAAAAACAAATACCCGATCCTGGGTCTGAAAAACAA



AAACACCGACATCCTGTTCGAAGAAGCGGTTTTCGGTATCCTGAAAGCGCGTTACGGTGAAGAAAAA



GACACCTTCATCGAAGTTGAAGAAATCGACAAAACCGGTAAATCTAAAATCAACCAGATCTCTATCTT



CGACTCTTGGAAAGGTTTCACCGGTTACTTCAAAAAATTCTTCGAAACCCGTAAAAACTTCTACAAAA



ACGACGGTACCTCTACCGCGATCGCGACCCGTATCATCGACCAGAACCTGAAACGTTTCATCGACAAC



CTGTCTATCGTTGAATCTGTTCGTCAGAAAGTTGACCTGGCGGAAACCGAAAAATCTTTCTCTATCTCT



CTGTCTCAGTTCTTCTCTATCGACTTCTACAACAAATGCCTGCTGCAGGACGGTATCGACTACTACAAC



AAAATCATCGGTGGTGAAACCCTGAAAAACGGTGAAAAACTGATCGGTCTGAACGAACTGATCAACC



AGTACCGTCAGAACAACAAAGACCAGAAAATCCCGTTCTTCAAACTGCTGGACAAACAGATCCTGTC



TGAAAAAATCCTGTTCCTGGACGAAATCAAAAACGACACCGAACTGATCGAAGCGCTGTCTCAGTTC



GCGAAAACCGCGGAAGAAAAAACCAAAATCGTTAAAAAACTGTTCGCGGACTTCGTTGAAAACAACT



CTAAATACGACCTGGCGCAGATCTACATCTCTCAGGAAGCGTTCAACACCATCTCTAACAAATGGACC



TCTGAAACCGAAACCTTCGCGAAATACCTGTTCGAAGCGATGAAATCTGGTAAACTGGCGAAATACG



AAAAAAAAGACAACTCTTACAAATTCCCGGACTTCATCGCGCTGTCTCAGATGAAATCTGCGCTGCTG



TCTATCTCTCTGGAAGGTCACTTCTGGAAAGAAAAATACTACAAAATCTCTAAATTCCAGGAAAAAAC



CAACTGGGAACAGTTCCTGGCGATCTTCCTGTACGAATTCAACTCTCTGTTCTCTGACAAAATCAACA



CCAAAGACGGTGAAACCAAACAGGTTGGTTACTACCTGTTCGCGAAAGACCTGCACAACCTGATCCT



GTCTGAACAGATCGACATCCCGAAAGACTCTAAAGTTACCATCAAAGACTTCGCGGACTCTGTTCTGA



CCATCTACCAGATGGCGAAATACTTCGCGGTTGAAAAAAAACGTGCGTGGCTGGCGGAATACGAACT



GGACTCTTTCTACACCCAGCCGGACACCGGTTACCTGCAGTTCTACGACAACGCGTACGAAGACATCG



TTCAGGTTTACAACAAACTGCGTAACTACCTGACCAAAAAACCGTACTCTGAAGAAAAATGGAAACT



GAACTTCGAAAACTCTACCCTGGCGAACGGTTGGGACAAAAACAAAGAATCTGACAACTCTGCGGTT



ATCCTGCAGAAAGGTGGTAAATACTACCTGGGTCTGATCACCAAAGGTCACAACAAAATCTTCGACG



ACCGTTTCCAGGAAAAATTCATCGTTGGTATCGAAGGTGGTAAATACGAAAAAATCGTTTACAAATTC



TTCCCGGACCAGGCGAAAATGTTCCCGAAAGTTTGCTTCTCTGCGAAAGGTCTGGAATTCTTCCGTCC



GTCTGAAGAAATCCTGCGTATCTACAACAACGCGGAATTCAAAAAAGGTGAAACCTACTCTATCGAC



TCTATGCAGAAACTGATCGACTTCTACAAAGACTGCCTGACCAAATACGAAGGTTGGGCGTGCTACAC



CTTCCGTCACCTGAAACCGACCGAAGAATACCAGAACAACATCGGTGAATTCTTCCGTGACGTTGCGG



AAGACGGTTACCGTATCGACTTCCAGGGTATCTCTGACCAGTACATCCACGAAAAAAACGAAAAAGG



TGAACTGCACCTGTTCGAAATCCACAACAAAGACTGGAACCTGGACAAAGCGCGTGACGGTAAATCT



AAAACCACCCAGAAAAACCTGCACACCCTGTACTTCGAATCTCTGTTCTCTAACGACAACGTTGTTCA



GAACTTCCCGATCAAACTGAACGGTCAGGCGGAAATCTTCTACCGTCCGAAAACCGAAAAAGACAAA



CTGGAATCTAAAAAAGACAAAAAAGGTAACAAAGTTATCGACCACAAACGTTACTCTGAAAACAAAA



TCTTCTTCCACGTTCCGCTGACCCTGAACCGTACCAAAAACGACTCTTACCGTTTCAACGCGCAGATC



AACAACTTCCTGGCGAACAACAAAGACATCAACATCATCGGTGTTGACCGTGGTGAAAAACACCTGG



TTTACTACTCTGTTATCACCCAGGCGTCTGACATCCTGGAATCTGGTTCTCTGAACGAACTGAACGGTG



TTAACTACGCGGAAAAACTGGGTAAAAAAGCGGAAAACCGTGAACAGGCGCGTCGTGACTGGCAGG



ACGTTCAGGGTATCAAAGACCTGAAAAAAGGTTACATCTCTCAGGTTGTTCGTAAACTGGCGGACCTG



GCGATCAAACACAACGCGATCATCATCCTGGAAGACCTGAACATGCGTTTCAAACAGGTTCGTGGTG



GTATCGAAAAATCTATCTACCAGCAGCTGGAAAAAGCGCTGATCGACAAACTGTCTTTCCTGGTTGAC



AAAGGTGAAAAAAACCCGGAACAGGCGGGTCACCTGCTGAAAGCGTACCAGCTGTCTGCGCCGTTCG



AAACCTTCCAGAAAATGGGTAAACAGACCGGTATCATCTTCTACACCCAGGCGTCTTACACCTCTAAA



TCTGACCCGGTTACCGGTTGGCGTCCGCACCTGTACCTGAAATACTTCTCTGCGAAAAAAGCGAAAGA



CGACATCGCGAAATTCACCAAAATCGAATTCGTTAACGACCGTTTCGAACTGACCTACGACATCAAAG



ACTTCCAGCAGGCGAAAGAATACCCGAACAAAACCGTTTGGAAAGTTTGCTCTAACGTTGAACGTTTC



CGTTGGGACAAAAACCTGAACCAGAACAAAGGTGGTTACACCCACTACACCAACATCACCGAAAACA



TCCAGGAACTGTTCACCAAATACGGTATCGACATCACCAAAGACCTGCTGACCCAGATCTCTACCATC



GACGAAAAACAGAACACCTCTTTCTTCCGTGACTTCATCTTCTACTTCAACCTGATCTGCCAGATCCGT



AACACCGACGACTCTGAAATCGCGAAAAAAAACGGTAAAGACGACTTCATCCTGTCTCCGGTTGAAC



CGTTCTTCGACTCTCGTAAAGACAACGGTAACAAACTGCCGGAAAACGGTGACGACAACGGTGCGTA



CAACATCGCGCGTAAAGGTATCGTTATCCTGAACAAAATCTCTCAGTACTCTGAAAAAAACGAAAAC



TGCGAAAAAATGAAATGGGGTGACCTGTACGTTTCTAACATCGACTGGGACAACTTCGTTGAAATCAT



CCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCA



GGAAGTTATTACTCAGGAAGCAAAGAGGATTACA





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCG


ID
TCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCG


NO:
GGACCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGA


71
TTATTTGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTG



ACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCA



CTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCA



CCATCATCATCACCATAACAAATTCGAAAACTTCACCGGTCTGTACCCGATCTCTAAAACCCTGCGTT



TCGAACTGATCCCGCAGGGTAAAACCCTGGAATACATCGAAAAATCTGAAATCCTGGAAAACGACAA



CTACCGTGCGGAAAAATACGAAGAAGTTAAAGACATCATCGACGGTTACCACAAATGGTTCATCAAC



GAAACCCTGCACGACCTGCACATCAACTGGTCTGAACTGAAAGTTGCGCTGGAAAACAACCGTATCG



AAAAATCTGACGCGTCTAAAAAAGAACTGCAGCGTGTTCAGAAAATCAAACGTGAAGAAATCTACAA



CGCGTTCATCGAACACGAAGCGTTCCAGTACCTGTTCAAAGAAAACCTGCTGTCTGACCTGCTGCCGA



TCCAGATCGAACAGTCTGAAGACCTGGACGCGGAAAAAAAAAAACAGGCGGTTGAAACCTTCAACCG



TTTCTCTACCTACTTCACCGGTTTCCACGAAAACCGTAAAAACATCTACTCTAAAGAAGGTATCTCTAC



CTCTGTTACCTACCGTATCGTTCACGACAACTTCCCGAAATTCCTGGAAAACATGAAAGTTTTCGAAA



TCCTGCGTAACGAATGCCCGGAAGTTATCTCTGACACCGCGAACGAACTGGCGCCGTTCATCGACGGT



GTTCGTATCGAAGACATCTTCCTGATCGACTTCTTCAACTCTACCTTCTCTCAGAACGGTATCGACTAC



TACAACCGTATCCTGGGTGGTGTTACCACCGAAACCGGTGAAAAATACCGTGGTATCAACGAATTCAC



CAACCTGTACCGTCAGCAGCACCCGGAATTCGGTAAATCTAAAAAAGCGACCAAAATGGTTGTTCTGT



TCAAACAGATCCTGTCTGACCGTGACACCCTGTCTTTCATCCCGGAAATGTTCGGTAACGACAAACAG



GTTCAGAACTCTATCCAGCTGTTCTACAACCGTGAAATCTCTCAGTTCGAAAACGAAGGTGTTAAAAC



CGACGTTTGCACCGCGCTGGCGACCCTGACCTCTAAAATCGCGGAATTCGACACCGAAAAAATCTAC



ATCCAGCAGCCGGAACTGCCGAACGTTTCTCAGCGTCTGTTCGGTTCTTGGAACGAACTGAACGCGTG



CCTGTTCAAATACGCGGAACTGAAATTCGGTACCGCGGAAAAAGTTGCGAACCGTAAAAAAATCGAC



AAATGGCTGAAATCTGACCTGTTCTCTTTCACCGAACTGAACAAAGCGCTGGAATTCTCTGGTAAAGA



CGAACGTATCGAAAACTACTTCTCTGAAACCGGTATCTTCGCGCAGCTGGTTAAAACCGGTTTCGACG



AAGCGCAGTCTATCCTGGAAACCGAATACACCTCTGAAGTTCACCTGAAAGACCAGCAGACCGACAT



CGAAAAAATCAAAACCTTCCTGGACGCGCTGCAGAACCTGATGCACCTGCTGAAATCTCTGTGCGTTT



CTGAAGAAGCGGACCGTGACGCGGCGTTCTACAACGAATTCGACATGCTGTACAACCAGCTGAAACT



GGTTGTTCCGCTGTACAACAAAGTTCGTAACTACATCACCCAGAAACTGTTCCGTTCTGACAAAATCA



AAATCTACTTCGAAAACAAAGGTCAGTTCCTGGGTGGTTGGGTTGACTCTCAGACCGAAAACTCTGAC



AACGGTACCCAGGCGGGTGGTTACATCTTCCGTAAAGAAAACGTTATCAACGAATACGACTACTACCT



GGGTATCTGCTCTGACCCGAAACTGTTCCGTCGTACCACCATCGTTTCTGAAAACGACCGTTCTTCTTT



CGAACGTCTGGACTACTACCAGCTGAAAACCGCGTCTGTTTACGGTAACTCTTACTGCGGTAAACACC



CGTACACCGAAGACAAAAACGAACTGGTTAACTCTATCGACCGTTTCGTTCACCTGTCTGGTAACAAC



ATCCTGATCGAAAAAATCGCGAAAGACAAAGTTAAATCTAACCCGACCACCAACACCCCGTCTGGTT



ACCTGAACTTCATCCACCGTGAAGCGCCGAACACCTACGAATGCCTGCTGCAGGACGAAAACTTCGTT



TCTCTGAACCAGCGTGTTGTTTCTGCGCTGAAAGCGACCCTGGCGACCCTGGTTCGTGTTCCGAAAGC



GCTGGTTTACGCGAAAAAAGACTACCACCTGTTCTCTGAAATCATCAACGACATCGACGAACTGTCTT



ACGAAAAAGCGTTCTCTTACTTCCCGGTTTCTCAGACCGAATTCGAAAACTCTTCTAACCGTACCATC



AAACCGCTGCTGCTGTTCAAAATCTCTAACAAAGACCTGTCTTTCGCGGAAAACTTCGAAAAAGGTAA



CCGTCAGAAAATCGGTAAAAAAAACCTGCACACCCTGTACTTCGAAGCGCTGATGAAAGGTAACCAG



GACACCATCGACATCGGTACCGGTATGGTTTTCCACCGTGTTAAATCTCTGAACTACAACGAAAAAAC



CCTGAAATACGGTCACCACTCTACCCAGCTGAACGAAAAATTCTCTTACCCGATCATCAAAGACAAAC



GTTTCGCGTCTGACAAATTCCTGTTCCACCTGTCTACCGAAATCAACTACAAAGAAAAACGTAAACCG



CTGAACAACTCTATCATCGAATTCCTGACCAACAACCCGGACATCAACATCATCGGTCTGGACCGTGG



TGAACGTCACCTGATCTACCTGACCCTGATCAACCAGAAAGGTGAAATCCTGCGTCAGAAAACCTTCA



ACATCGTTGGTAACACCAACTACCACGAAAAACTGAACCAGCGTGAAAAAGAACGTGACAACGCGCG



TAAATCTTGGGCGACCATCGGTAAAATCAAAGAACTGAAAGAAGGTTTCCTGTCTCTGGTTATCCACG



AAATCGCGAAAATCATGGTTGAAAACAACGCGATCGTTGTTCTGGAAGACCTGAACTTCGGTTTCAAA



CGTGGTCGTTTCAAAGTTGAAAAACAGATCTACCAGAAATTCGAAAAAATGCTGATCGACAAACTGA



ACTACCTGGTTTTCAAAGACAAAAAAGCGAACGAAGCGGGTGGTGTTCTGAAAGGTTACCAGCTGGC



GGAAAAATTCGAATCTTTCCAGAAAATGGGTAAACAGTCTGGTTTCCTGTTCTACGTTCCGGCGGCGT



ACACCTCTAAAATCGACCCGACCACCGGTTTCGTTAACATGCTGAACCTGAACTACACCAACATGAAA



GACGCGCAGACCCTGCTGTCTGGTATGGACAAAATCTCTTTCAACGCGGACGCGAACTACTTCGAATT



CGAACTGGACTACGAAAAATTCAAAACCAACCAGACCGACCACACCAACAAATGGACCATCTGCACC



GTTGGTGAAAAACGTTTCACCTACAACTCTGCGACCAAAGAAACCACCACCGTTAACGTTACCGAAG



ACCTGAAAAAACTGCTGGACAAATTCGAAGTTAAATACTCTAACGGTGACAACATCAAAGACGAAAT



CTGCCGTCAGACCGACGCGAAATTCTTCGAAATCATCCTGTGGCTGCTGAAACTGACCATGCAGATGC



GTAACTCTAACACCAAAACCGAAGAAGACTTCATCCTGTCTCCGGTTAAAAACTCTAACGGTGAATTC



TTCCGTTCTAACGACGACGCGAACGGTATCTGGCCGGCGGACGCGGACGCGAACGGTGCGTACCACA



TCGCGCTGAAAGGTCTGTACCTGGTTAAAGAATGCTTCAACAAAAACGAAAAATCTCTGAAAATCGA



ACACAAAAACTGGTTCAAATTCGCGCAGACCCGTTTCAACGGTTCTCTGACCAAAAACGGTTAAGAA



ATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAAATATTC



ACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCG


ID
TCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCG


NO:
GGACCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGA


72
TTATTTGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTG



ACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCA



CTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCA



CCATCATCATCACCATACCCAGTTCGAAGGTTTCACCAACCTGTACCAGGTTTCTAAAACCCTGCGTTT



CGAACTGATCCCGCAGGGTAAAACCCTGAAACACATCCAGGAACAGGGTTTCATCGAAGAAGACAAA



GCGCGTAACGACCACTACAAAGAACTGAAACCGATCATCGACCGTATCTACAAAACCTACGCGGACC



AGTGCCTGCAGCTGGTTCAGCTGGACTGGGAAAACCTGTCTGCGGCGATCGACTCTTACCGTAAAGAA



AAAACCGAAGAAACCCGTAACGCGCTGATCGAAGAACAGGCGACCTACCGTAACGCGATCCACGACT



ACTTCATCGGTCGTACCGACAACCTGACCGACGCGATCAACAAACGTCACGCGGAAATCTACAAAGG



TCTGTTCAAAGCGGAACTGTTCAACGGTAAAGTTCTGAAACAGCTGGGTACCGTTACCACCACCGAAC



ACGAAAACGCGCTGCTGCGTTCTTTCGACAAATTCACCACCTACTTCTCTGGTTTCTACGAAAACCGT



AAAAACGTTTTCTCTGCGGAAGACATCTCTACCGCGATCCCGCACCGTATCGTTCAGGACAACTTCCC



GAAATTCAAAGAAAACTGCCACATCTTCACCCGTCTGATCACCGCGGTTCCGTCTCTGCGTGAACACT



TCGAAAACGTTAAAAAAGCGATCGGTATCTTCGTTTCTACCTCTATCGAAGAAGTTTTCTCTTTCCCGT



TCTACAACCAGCTGCTGACCCAGACCCAGATCGACCTGTACAACCAGCTGCTGGGTGGTATCTCTCGT



GAAGCGGGTACCGAAAAAATCAAAGGTCTGAACGAAGTTCTGAACCTGGCGATCCAGAAAAACGAC



GAAACCGCGCACATCATCGCGTCTCTGCCGCACCGTTTCATCCCGCTGTTCAAACAGATCCTGTCTGA



CCGTAACACCCTGTCTTTCATCCTGGAAGAATTCAAATCTGACGAAGAAGTTATCCAGTCTTTCTGCA



AATACAAAACCCTGCTGCGTAACGAAAACGTTCTGGAAACCGCGGAAGCGCTGTTCAACGAACTGAA



CTCTATCGACCTGACCCACATCTTCATCTCTCACAAAAAACTGGAAACCATCTCTTCTGCGCTGTGCGA



CCACTGGGACACCCTGCGTAACGCGCTGTACGAACGTCGTATCTCTGAACTGACCGGTAAAATCACCA



AATCTGCGAAAGAAAAAGTTCAGCGTTCTCTGAAACACGAAGACATCAACCTGCAGGAAATCATCTC



TGCGGCGGGTAAAGAACTGTCTGAAGCGTTCAAACAGAAAACCTCTGAAATCCTGTCTCACGCGCAC



GCGGCGCTGGACCAGCCGCTGCCGACCACCCTGAAAAAACAGGAAGAAAAAGAAATCCTGAAATCTC



AGCTGGACTCTCTGCTGGGTCTGTACCACCTGCTGGACTGGTTCGCGGTTGACGAATCTAACGAAGTT



GACCCGGAATTCTCTGCGCGTCTGACCGGTATCAAACTGGAAATGGAACCGTCTCTGTCTTTCTACAA



CAAAGCGCGTAACTACGCGACCAAAAAACCGTACTCTGTTGAAAAATTCAAACTGAACTTCCAGATG



CCGACCCTGGCGTCTGGTTGGGACGTTAACAAAGAAAAAAACAACGGTGCGATCCTGTTCGTTAAAA



ACGGTCTGTACTACCTGGGTATCATGCCGAAACAGAAAGGTCGTTACAAAGCGCTGTCTTTCGAACCG



ACCGAAAAAACCTCTGAAGGTTTCGACAAAATGTACTACGACTACTTCCCGGACGCGGCGAAAATGA



TCCCGAAATGCTCTACCCAGCTGAAAGCGGTTACCGCGCACTTCCAGACCCACACCACCCCGATCCTG



CTGTCTAACAACTTCATCGAACCGCTGGAAATCACCAAAGAAATCTACGACCTGAACAACCCGGAAA



AAGAACCGAAAAAATTCCAGACCGCGTACGCGAAAAAAACCGGTGACCAGAAAGGTTACCGTGAAG



CGCTGTGCAAATGGATCGACTTCACCCGTGACTTCCTGTCTAAATACACCAAAACCACCTCTATCGAC



CTGTCTTCTCTGCGTCCGTCTTCTCAGTACAAAGACCTGGGTGAATACTACGCGGAACTGAACCCGCT



GCTGTACCACATCTCTTTCCAGCGTATCGCGGAAAAAGAAATCATGGACGCGGTTGAAACCGGTAAA



CTGTACCTGTTCCAGATCTACAACAAAGACTTCGCGAAAGGTCACCACGGTAAACCGAACCTGCACA



CCCTGTACTGGACCGGTCTGTTCTCTCCGGAAAACCTGGCGAAAACCTCTATCAAACTGAACGGTCAG



GCGGAACTGTTCTACCGTCCGAAATCTCGTATGAAACGTATGGCGCACCGTCTGGGTGAAAAAATGCT



GAACAAAAAACTGAAAGACCAGAAAACCCCGATCCCGGACACCCTGTACCAGGAACTGTACGACTAC



GTTAACCACCGTCTGTCTCACGACCTGTCTGACGAAGCGCGTGCGCTGCTGCCGAACGTTATCACCAA



AGAAGTTTCTCACGAAATCATCAAAGACCGTCGTTTCACCTCTGACAAATTCTTCTTCCACGTTCCGAT



CACCCTGAACTACCAGGCGGCGAACTCTCCGTCTAAATTCAACCAGCGTGTTAACGCGTACCTGAAAG



AACACCCGGAAACCCCGATCATCGGTATCGACCGTGGTGAACGTAACCTGATCTACATCACCGTTATC



GACTCTACCGGTAAAATCCTGGAACAGCGTTCTCTGAACACCATCCAGCAGTTCGACTACCAGAAAA



AACTGGACAACCGTGAAAAAGAACGTGTTGCGGCGCGTCAGGCGTGGTCTGTTGTTGGTACCATCAA



AGACCTGAAACAGGGTTACCTGTCTCAGGTTATCCACGAAATCGTTGACCTGATGATCCACTACCAGG



CGGTTGTTGTTCTGGAAAACCTGAACTTCGGTTTCAAATCTAAACGTACCGGTATCGCGGAAAAAGCG



GTTTACCAGCAGTTCGAAAAAATGCTGATCGACAAACTGAACTGCCTGGTTCTGAAAGACTACCCGGC



GGAAAAAGTTGGTGGTGTTCTGAACCCGTACCAGCTGACCGACCAGTTCACCTCTTTCGCGAAAATGG



GTACCCAGTCTGGTTTCCTGTTCTACGTTCCGGCGCCGTACACCTCTAAAATCGACCCGCTGACCGGTT



TCGTTGACCCGTTCGTTTGGAAAACCATCAAAAACCACGAATCTCGTAAACACTTCCTGGAAGGTTTC



GACTTCCTGCACTACGACGTTAAAACCGGTGACTTCATCCTGCACTTCAAAATGAACCGTAACCTGTC



TTTCCAGCGTGGTCTGCCGGGTTTCATGCCGGCGTGGGACATCGTTTTCGAAAAAAACGAAACCCAGT



TCGACGCGAAAGGTACCCCGTTCATCGCGGGTAAACGTATCGTTCCGGTTATCGAAAACCACCGTTTC



ACCGGTCGTTACCGTGACCTGTACCCGGCGAACGAACTGATCGCGCTGCTGGAAGAAAAAGGTATCG



TTTTCCGTGACGGTTCTAACATCCTGCCGAAACTGCTGGAAAACGACGACTCTCACGCGATCGACACC



ATGGTTGCGCTGATCCGTTCTGTTCTGCAGATGCGTAACTCTAACGCGGCGACCGGTGAAGACTACAT



CAACTCTCCGGTTCGTGACCTGAACGGTGTTTGCTTCGACTCTCGTTTCCAGAACCCGGAATGGCCGA



TGGACGCGGACGCGAACGGTGCGTACCACATCGCGCTGAAAGGTCAGCTGCTGCTGAACCACCTGAA



AGAATCTAAAGACCTGAAACTGCAGAACGGTATCTCTAACCAGGACTGGCTGGCGTACATCCAGGAA



CTGCGTAACTAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCT



CAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCG


ID
TCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCG


NO:
GGACCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGA


73
TTATTTGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTG



ACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCA



CTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCA



CCATCATCATCACCATGCGGTTAAATCTATCAAAGTTAAACTGCGTCTGGACGACATGCCGGAAATCC



GTGCGGGTCTGTGGAAACTGCACAAAGAAGTTAACGCGGGTGTTCGTTACTACACCGAATGGCTGTCT



CTGCTGCGTCAGGAAAACCTGTACCGTCGTTCTCCGAACGGTGACGGTGAACAGGAATGCGACAAAA



CCGCGGAAGAATGCAAAGCGGAACTGCTGGAACGTCTGCGTGCGCGTCAGGTTGAAAACGGTCACCG



TGGTCCGGCGGGTTCTGACGACGAACTGCTGCAGCTGGCGCGTCAGCTGTACGAACTGCTGGTTCCGC



AGGCGATCGGTGCGAAAGGTGACGCGCAGCAGATCGCGCGTAAATTCCTGTCTCCGCTGGCGGACAA



AGACGCGGTTGGTGGTCTGGGTATCGCGAAAGCGGGTAACAAACCGCGTTGGGTTCGTATGCGTGAA



GCGGGTGAACCGGGTTGGGAAGAAGAAAAAGAAAAAGCGGAAACCCGTAAATCTGCGGACCGTACC



GCGGACGTTCTGCGTGCGCTGGCGGACTTCGGTCTGAAACCGCTGATGCGTGTTTACACCGACTCTGA



AATGTCTTCTGTTGAATGGAAACCGCTGCGTAAAGGTCAGGCGGTTCGTACCTGGGACCGTGACATGT



TCCAGCAGGCGATCGAACGTATGATGTCTTGGGAATCTTGGAACCAGCGTGTTGGTCAGGAATACGC



GAAACTGGTTGAACAGAAAAACCGTTTCGAACAGAAAAACTTCGTTGGTCAGGAACACCTGGTTCAC



CTGGTTAACCAGCTGCAGCAGGACATGAAAGAAGCGTCTCCGGGTCTGGAATCTAAAGAACAGACCG



CGCACTACGTTACCGGTCGTGCGCTGCGTGGTTCTGACAAAGTTTTCGAAAAATGGGGTAAACTGGCG



CCGGACGCGCCGTTCGACCTGTACGACGCGGAAATCAAAAACGTTCAGCGTCGTAACACCCGTCGTTT



CGGTTCTCACGACCTGTTCGCGAAACTGGCGGAACCGGAATACCAGGCGCTGTGGCGTGAAGACGCG



TCTTTCCTGACCCGTTACGCGGTTTACAACTCTATCCTGCGTAAACTGAACCACGCGAAAATGTTCGC



GACCTTCACCCTGCCGGACGCGACCGCGCACCCGATCTGGACCCGTTTCGACAAACTGGGTGGTAACC



TGCACCAGTACACCTTCCTGTTCAACGAATTCGGTGAACGTCGTCACGCGATCCGTTTCCACAAACTG



CTGAAAGTTGAAAACGGTGTTGCGCGTGAAGTTGACGACGTTACCGTTCCGATCTCTATGTCTGAACA



GCTGGACAACCTGCTGCCGCGTGACCCGAACGAACCGATCGCGCTGTACTTCCGTGACTACGGTGCGG



AACAGCACTTCACCGGTGAATTCGGTGGTGCGAAAATCCAGTGCCGTCGTGACCAGCTGGCGCACAT



GCACCGTCGTCGTGGTGCGCGTGACGTTTACCTGAACGTTTCTGTTCGTGTTCAGTCTCAGTCTGAAGC



GCGTGGTGAACGTCGTCCGCCGTACGCGGCGGTTTTCCGTCTGGTTGGTGACAACCACCGTGCGTTCG



TTCACTTCGACAAACTGTCTGACTACCTGGCGGAACACCCGGACGACGGTAAACTGGGTTCTGAAGGT



CTGCTGTCTGGTCTGCGTGTTATGTCTGTTGACCTGGGTCTGCGTACCTCTGCGTCTATCTCTGTTTTCC



GTGTTGCGCGTAAAGACGAACTGAAACCGAACTCTAAAGGTCGTGTTCCGTTCTTCTTCCCGATCAAA



GGTAACGACAACCTGGTTGCGGTTCACGAACGTTCTCAGCTGCTGAAACTGCCGGGTGAAACCGAAT



CTAAAGACCTGCGTGCGATCCGTGAAGAACGTCAGCGTACCCTGCGTCAGCTGCGTACCCAGCTGGC



GTACCTGCGTCTGCTGGTTCGTTGCGGTTCTGAAGACGTTGGTCGTCGTGAACGTTCTTGGGCGAAAC



TGATCGAACAGCCGGTTGACGCGGCGAACCACATGACCCCGGACTGGCGTGAAGCGTTCGAAAACGA



ACTGCAGAAACTGAAATCTCTGCACGGTATCTGCTCTGACAAAGAATGGATGGACGCGGTTTACGAA



TCTGTTCGTCGTGTTTGGCGTCACATGGGTAAACAGGTTCGTGACTGGCGTAAAGACGTTCGTTCTGG



TGAACGTCCGAAAATCCGTGGTTACGCGAAAGACGTTGTTGGTGGTAACTCTATCGAACAGATCGAAT



ACCTGGAACGTCAGTACAAATTCCTGAAATCTTGGTCTTTCTTCGGTAAAGTTTCTGGTCAGGTTATCC



GTGCGGAAAAAGGTTCTCGTTTCGCGATCACCCTGCGTGAACACATCGACCACGCGAAAGAAGACCG



TCTGAAAAAACTGGCGGACCGTATCATCATGGAAGCGCTGGGTTACGTTTACGCGCTGGACGAACGT



GGTAAAGGTAAATGGGTTGCGAAATACCCGCCGTGCCAGCTGATCCTGCTGGAAGAACTGTCTGAAT



ACCAGTTCAACAACGACCGTCCGCCGTCTGAAAACAACCAGCTGATGCAGTGGTCTCACCGTGGTGTT



TTCCAGGAACTGATCAACCAGGCGCAGGTTCACGACCTGCTGGTTGGTACCATGTACGCGGCGTTCTC



TTCTCGTTTCGACGCGCGTACCGGTGCGCCGGGTATCCGTTGCCGTCGTGTTCCGGCGCGTTGCACCCA



GGAACACAACCCGGAACCGTTCCCGTGGTGGCTGAACAAATTCGTTGTTGAACACACCCTGGACGCG



TGCCCGCTGCGTGCGGACGACCTGATCCCGACCGGTGAAGGTGAAATCTTCGTTTCTCCGTTCTCTGC



GGAAGAAGGTGACTTCCACCAGATCCACGCGGACCTGAACGCGGCGCAGAACCTGCAGCAGCGTCTG



TGGTCTGACTTCGACATCTCTCAGATCCGTCTGCGTTGCGACTGGGGTGAAGTTGACGGTGAACTGGT



TCTGATCCCGCGTCTGACCGGTAAACGTACCGCGGACTCTTACTCTAACAAAGTTTTCTACACCAACA



CCGGTGTTACCTACTACGAACGTGAACGTGGTAAAAAACGTCGTAAAGTTTTCGCGCAGGAAAAACT



GTCTGAAGAAGAAGCGGAACTGCTGGTTGAAGCGGACGAAGCGCGTGAAAAATCTGTTGTTCTGATG



CGTGACCCGTCTGGTATCATCAACCGTGGTAACTGGACCCGTCAGAAAGAATTCTGGTCTATGGTTAA



CCAGCGTATCGAAGGTTACCTGGTTAAACAGATCCGTTCTCGTGTTCCGCTGCAGGACTCTGCGTGCG



AAAACACCGGTGACATCTAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAG



GGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCG


ID
TCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCG


NO:
GGACCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGA


74
TTATTTGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTG



ACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCA



CTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCA



CCATCATCATCACCATGCGACCCGTTCTTTCATCCTGAAAATCGAACCGAACGAAGAAGTTAAAAAAG



GTCTGTGGAAAACCCACGAAGTTCTGAACCACGGTATCGCGTACTACATGAACATCCTGAAACTGATC



CGTCAGGAAGCGATCTACGAACACCACGAACAGGACCCGAAAAACCCGAAAAAAGTTTCTAAAGCG



GAAATCCAGGCGGAACTGTGGGACTTCGTTCTGAAAATGCAGAAATGCAACTCTTTCACCCACGAAG



TTGACAAAGACGTTGTTTTCAACATCCTGCGTGAACTGTACGAAGAACTGGTTCCGTCTTCTGTTGAA



AAAAAAGGTGAAGCGAACCAGCTGTCTAACAAATTCCTGTACCCGCTGGTTGACCCGAACTCTCAGTC



TGGTAAAGGTACCGCGTCTTCTGGTCGTAAACCGCGTTGGTACAACCTGAAAATCGCGGGTGACCCGT



CTTGGGAAGAAGAAAAAAAAAAATGGGAAGAAGACAAAAAAAAAGACCCGCTGGCGAAAATCCTGG



GTAAACTGGCGGAATACGGTCTGATCCCGCTGTTCATCCCGTTCACCGACTCTAACGAACCGATCGTT



AAAGAAATCAAATGGATGGAAAAATCTCGTAACCAGTCTGTTCGTCGTCTGGACAAAGACATGTTCA



TCCAGGCGCTGGAACGTTTCCTGTCTTGGGAATCTTGGAACCTGAAAGTTAAAGAAGAATACGAAAA



AGTTGAAAAAGAACACAAAACCCTGGAAGAACGTATCAAAGAAGACATCCAGGCGTTCAAATCTCTG



GAACAGTACGAAAAAGAACGTCAGGAACAGCTGCTGCGTGACACCCTGAACACCAACGAATACCGTC



TGTCTAAACGTGGTCTGCGTGGTTGGCGTGAAATCATCCAGAAATGGCTGAAAATGGACGAAAACGA



ACCGTCTGAAAAATACCTGGAAGTTTTCAAAGACTACCAGCGTAAACACCCGCGTGAAGCGGGTGAC



TACTCTGTTTACGAATTCCTGTCTAAAAAAGAAAACCACTTCATCTGGCGTAACCACCCGGAATACCC



GTACCTGTACGCGACCTTCTGCGAAATCGACAAAAAAAAAAAAGACGCGAAACAGCAGGCGACCTTC



ACCCTGGCGGACCCGATCAACCACCCGCTGTGGGTTCGTTTCGAAGAACGTTCTGGTTCTAACCTGAA



CAAATACCGTATCCTGACCGAACAGCTGCACACCGAAAAACTGAAAAAAAAACTGACCGTTCAGCTG



GACCGTCTGATCTACCCGACCGAATCTGGTGGTTGGGAAGAAAAAGGTAAAGTTGACATCGTTCTGCT



GCCGTCTCGTCAGTTCTACAACCAGATCTTCCTGGACATCGAAGAAAAAGGTAAACACGCGTTCACCT



ACAAAGACGAATCTATCAAATTCCCGCTGAAAGGTACCCTGGGTGGTGCGCGTGTTCAGTTCGACCGT



GACCACCTGCGTCGTTACCCGCACAAAGTTGAATCTGGTAACGTTGGTCGTATCTACTTCAACATGAC



CGTTAACATCGAACCGACCGAATCTCCGGTTTCTAAATCTCTGAAAATCCACCGTGACGACTTCCCGA



AATTCGTTAACTTCAAACCGAAAGAACTGACCGAATGGATCAAAGACTCTAAAGGTAAAAAACTGAA



ATCTGGTATCGAATCTCTGGAAATCGGTCTGCGTGTTATGTCTATCGACCTGGGTCAGCGTCAGGCGG



CGGCGGCGTCTATCTTCGAAGTTGTTGACCAGAAACCGGACATCGAAGGTAAACTGTTCTTCCCGATC



AAAGGTACCGAACTGTACGCGGTTCACCGTGCGTCTTTCAACATCAAACTGCCGGGTGAAACCCTGGT



TAAATCTCGTGAAGTTCTGCGTAAAGCGCGTGAAGACAACCTGAAACTGATGAACCAGAAACTGAAC



TTCCTGCGTAACGTTCTGCACTTCCAGCAGTTCGAAGACATCACCGAACGTGAAAAACGTGTTACCAA



ATGGATCTCTCGTCAGGAAAACTCTGACGTTCCGCTGGTTTACCAGGACGAACTGATCCAGATCCGTG



AACTGATGTACAAACCGTACAAAGACTGGGTTGCGTTCCTGAAACAGCTGCACAAACGTCTGGAAGT



TGAAATCGGTAAAGAAGTTAAACACTGGCGTAAATCTCTGTCTGACGGTCGTAAAGGTCTGTACGGTA



TCTCTCTGAAAAACATCGACGAAATCGACCGTACCCGTAAATTCCTGCTGCGTTGGTCTCTGCGTCCG



ACCGAACCGGGTGAAGTTCGTCGTCTGGAACCGGGTCAGCGTTTCGCGATCGACCAGCTGAACCACCT



GAACGCGCTGAAAGAAGACCGTCTGAAAAAAATGGCGAACACCATCATCATGCACGCGCTGGGTTAC



TGCTACGACGTTCGTAAAAAAAAATGGCAGGCGAAAAACCCGGCGTGCCAGATCATCCTGTTCGAAG



ACCTGTCTAACTACAACCCGTACGAAGAACGTTCTCGTTTCGAAAACTCTAAACTGATGAAATGGTCT



CGTCGTGAAATCCCGCGTCAGGTTGCGCTGCAGGGTGAAATCTACGGTCTGCAGGTTGGTGAAGTTGG



TGCGCAGTTCTCTTCTCGTTTCCACGCGAAAACCGGTTCTCCGGGTATCCGTTGCTCTGTTGTTACCAA



AGAAAAACTGCAGGACAACCGTTTCTTCAAAAACCTGCAGCGTGAAGGTCGTCTGACCCTGGACAAA



ATCGCGGTTCTGAAAGAAGGTGACCTGTACCCGGACAAAGGTGGTGAAAAATTCATCTCTCTGTCTAA



AGACCGTAAACTGGTTACCACCCACGCGGACATCAACGCGGCGCAGAACCTGCAGAAACGTTTCTGG



ACCCGTACCCACGGTTTCTACAAAGTTTACTGCAAAGCGTACCAGGTTGACGGTCAGACCGTTTACAT



CCCGGAATCTAAAGACCAGAAACAGAAAATCATCGAAGAATTCGGTGAAGGTTACTTCATCCTGAAA



GACGGTGTTTACGAATGGGGTAACGCGGGTAAACTGAAAATCAAAAAAGGTTCTTCTAAACAGTCTT



CTTCTGAACTGGTTGACTCTGACATCCTGAAAGACTCTTTCGACCTGGCGTCTGAACTGAAAGGTGAA



AAACTGATGCTGTACCGTGACCCGTCTGGTAACGTTTTCCCGTCTGACAAATGGATGGCGGCGGGTGT



TTTCTTCGGTAAACTGGAACGTATCCTGATCTCTAAACTGACCAACCAGTACTCTATCTCTACCATCGA



AGACGACTCTTCTAAACAGTCTATGTAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTG



AAATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATT



ACA





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCG


ID
TCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCG


NO:
GGACCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGA


75
TTATTTGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTG



ACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCA



CTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCA



CCATCATCATCACCATCCGACCCGTACCATCAACCTGAAACTGGTTCTGGGTAAAAACCCGGAAAACG



CGACCCTGCGTCGTGCGCTGTTCTCTACCCACCGTCTGGTTAACCAGGCGACCAAACGTATCGAAGAA



TTCCTGCTGCTGTGCCGTGGTGAAGCGTACCGTACCGTTGACAACGAAGGTAAAGAAGCGGAAATCC



CGCGTCACGCGGTTCAGGAAGAAGCGCTGGCGTTCGCGAAAGCGGCGCAGCGTCACAACGGTTGCAT



CTCTACCTACGAAGACCAGGAAATCCTGGACGTTCTGCGTCAGCTGTACGAACGTCTGGTTCCGTCTG



TTAACGAAAACAACGAAGCGGGTGACGCGCAGGCGGCGAACGCGTGGGTTTCTCCGCTGATGTCTGC



GGAATCTGAAGGTGGTCTGTCTGTTTACGACAAAGTTCTGGACCCGCCGCCGGTTTGGATGAAACTGA



AAGAAGAAAAAGCGCCGGGTTGGGAAGCGGCGTCTCAGATCTGGATCCAGTCTGACGAAGGTCAGTC



TCTGCTGAACAAACCGGGTTCTCCGCCGCGTTGGATCCGTAAACTGCGTTCTGGTCAGCCGTGGCAGG



ACGACTTCGTTTCTGACCAGAAAAAAAAACAGGACGAACTGACCAAAGGTAACGCGCCGCTGATCAA



ACAGCTGAAAGAAATGGGTCTGCTGCCGCTGGTTAACCCGTTCTTCCGTCACCTGCTGGACCCGGAAG



GTAAAGGTGTTTCTCCGTGGGACCGTCTGGCGGTTCGTGCGGCGGTTGCGCACTTCATCTCTTGGGAA



TCTTGGAACCACCGTACCCGTGCGGAATACAACTCTCTGAAACTGCGTCGTGACGAATTCGAAGCGGC



GTCTGACGAATTCAAAGACGACTTCACCCTGCTGCGTCAGTACGAAGCGAAACGTCACTCTACCCTGA



AATCTATCGCGCTGGCGGACGACTCTAACCCGTACCGTATCGGTGTTCGTTCTCTGCGTGCGTGGAAC



CGTGTTCGTGAAGAATGGATCGACAAAGGTGCGACCGAAGAACAGCGTGTTACCATCCTGTCTAAAC



TGCAGACCCAGCTGCGTGGTAAATTCGGTGACCCGGACCTGTTCAACTGGCTGGCGCAGGACCGTCAC



GTTCACCTGTGGTCTCCGCGTGACTCTGTTACCCCGCTGGTTCGTATCAACGCGGTTGACAAAGTTCTG



CGTCGTCGTAAACCGTACGCGCTGATGACCTTCGCGCACCCGCGTTTCCACCCGCGTTGGATCCTGTA



CGAAGCGCCGGGTGGTTCTAACCTGCGTCAGTACGCGCTGGACTGCACCGAAAACGCGCTGCACATC



ACCCTGCCGCTGCTGGTTGACGACGCGCACGGTACCTGGATCGAAAAAAAAATCCGTGTTCCGCTGGC



GCCGTCTGGTCAGATCCAGGACCTGACCCTGGAAAAACTGGAAAAAAAAAAAAACCGTCTGTACTAC



CGTTCTGGTTTCCAGCAGTTCGCGGGTCTGGCGGGTGGTGCGGAAGTTCTGTTCCACCGTCCGTACAT



GGAACACGACGAACGTTCTGAAGAATCTCTGCTGGAACGTCCGGGTGCGGTTTGGTTCAAACTGACCC



TGGACGTTGCGACCCAGGCGCCGCCGAACTGGCTGGACGGTAAAGGTCGTGTTCGTACCCCGCCGGA



AGTTCACCACTTCAAAACCGCGCTGTCTAACAAATCTAAACACACCCGTACCCTGCAGCCGGGTCTGC



GTGTTCTGTCTGTTGACCTGGGTATGCGTACCTTCGCGTCTTGCTCTGTTTTCGAACTGATCGAAGGTA



AACCGGAAACCGGTCGTGCGTTCCCGGTTGCGGACGAACGTTCTATGGACTCTCCGAACAAACTGTGG



GCGAAACACGAACGTTCTTTCAAACTGACCCTGCCGGGTGAAACCCCGTCTCGTAAAGAAGAAGAAG



AACGTTCTATCGCGCGTGCGGAAATCTACGCGCTGAAACGTGACATCCAGCGTCTGAAATCTCTGCTG



CGTCTGGGTGAAGAAGACAACGACAACCGTCGTGACGCGCTGCTGGAACAGTTCTTCAAAGGTTGGG



GTGAAGAAGACGTTGTTCCGGGTCAGGCGTTCCCGCGTTCTCTGTTCCAGGGTCTGGGTGCGGCGCCG



TTCCGTTCTACCCCGGAACTGTGGCGTCAGCACTGCCAGACCTACTACGACAAAGCGGAAGCGTGCCT



GGCGAAACACATCTCTGACTGGCGTAAACGTACCCGTCCGCGTCCGACCTCTCGTGAAATGTGGTACA



AAACCCGTTCTTACCACGGTGGTAAATCTATCTGGATGCTGGAATACCTGGACGCGGTTCGTAAACTG



CTGCTGTCTTGGTCTCTGCGTGGTCGTACCTACGGTGCGATCAACCGTCAGGACACCGCGCGTTTCGG



TTCTCTGGCGTCTCGTCTGCTGCACCACATCAACTCTCTGAAAGAAGACCGTATCAAAACCGGTGCGG



ACTCTATCGTTCAGGCGGCGCGTGGTTACATCCCGCTGCCGCACGGTAAAGGTTGGGAACAGCGTTAC



GAACCGTGCCAGCTGATCCTGTTCGAAGACCTGGCGCGTTACCGTTTCCGTGTTGACCGTCCGCGTCG



TGAAAACTCTCAGCTGATGCAGTGGAACCACCGTGCGATCGTTGCGGAAACCACCATGCAGGCGGAA



CTGTACGGTCAGATCGTTGAAAACACCGCGGCGGGTTTCTCTTCTCGTTTCCACGCGGCGACCGGTGC



GCCGGGTGTTCGTTGCCGTTTCCTGCTGGAACGTGACTTCGACAACGACCTGCCGAAACCGTACCTGC



TGCGTGAACTGTCTTGGATGCTGGGTAACACCAAAGTTGAATCTGAAGAAGAAAAACTGCGTCTGCT



GTCTGAAAAAATCCGTCCGGGTTCTCTGGTTCCGTGGGACGGTGGTGAACAGTTCGCGACCCTGCACC



CGAAACGTCAGACCCTGTGCGTTATCCACGCGGACATGAACGCGGCGCAGAACCTGCAGCGTCGTTT



CTTCGGTCGTTGCGGTGAAGCGTTCCGTCTGGTTTGCCAGCCGCACGGTGACGACGTTCTGCGTCTGG



CGTCTACCCCGGGTGCGCGTCTGCTGGGTGCGCTGCAGCAGCTGGAAAACGGTCAGGGTGCGTTCGA



ACTGGTTCGTGACATGGGTTCTACCTCTCAGATGAACCGTTTCGTTATGAAATCTCTGGGTAAAAAAA



AAATCAAACCGCTGCAGGACAACAACGGTGACGACGAACTGGAAGACGTTCTGTCTGTTCTGCCGGA



AGAAGACGACACCGGTCGTATCACCGTTTTCCGTGACTCTTCTGGTATCTTCTTCCCGTGCAACGTTTG



GATCCCGGCGAAACAGTTCTGGCCGGCGGTTCGTGCGATGATCTGGAAAGTTATGGCGTCTCACTCTC



TGGGTTAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCTCAG



GTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCG


ID
TCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCG


NO:
GGACCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGA


76
TTATTTGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTG



ACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCA



CTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCA



CCATCATCATCACCATACCAAACTGCGTCACCGTCAGAAAAAACTGACCCACGACTGGGCGGGTTCTA



AAAAACGTGAAGTTCTGGGTTCTAACGGTAAACTGCAGAACCCGCTGCTGATGCCGGTTAAAAAAGG



TCAGGTTACCGAATTCCGTAAAGCGTTCTCTGCGTACGCGCGTGCGACCAAAGGTGAAATGACCGAC



GGTCGTAAAAACATGTTCACCCACTCTTTCGAACCGTTCAAAACCAAACCGTCTCTGCACCAGTGCGA



ACTGGCGGACAAAGCGTACCAGTCTCTGCACTCTTACCTGCCGGGTTCTCTGGCGCACTTCCTGCTGTC



TGCGCACGCGCTGGGTTTCCGTATCTTCTCTAAATCTGGTGAAGCGACCGCGTTCCAGGCGTCTTCTAA



AATCGAAGCGTACGAATCTAAACTGGCGTCTGAACTGGCGTGCGTTGACCTGTCTATCCAGAACCTGA



CCATCTCTACCCTGTTCAACGCGCTGACCACCTCTGTTCGTGGTAAAGGTGAAGAAACCTCTGCGGAC



CCGCTGATCGCGCGTTTCTACACCCTGCTGACCGGTAAACCGCTGTCTCGTGACACCCAGGGTCCGGA



ACGTGACCTGGCGGAAGTTATCTCTCGTAAAATCGCGTCTTCTTTCGGTACCTGGAAAGAAATGACCG



CGAACCCGCTGCAGTCTCTGCAGTTCTTCGAAGAAGAACTGCACGCGCTGGACGCGAACGTTTCTCTG



TCTCCGGCGTTCGACGTTCTGATCAAAATGAACGACCTGCAGGGTGACCTGAAAAACCGTACCATCGT



TTTCGACCCGGACGCGCCGGTTTTCGAATACAACGCGGAAGACCCGGCGGACATCATCATCAAACTG



ACCGCGCGTTACGCGAAAGAAGCGGTTATCAAAAACCAGAACGTTGGTAACTACGTTAAAAACGCGA



TCACCACCACCAACGCGAACGGTCTGGGTTGGCTGCTGAACAAAGGTCTGTCTCTGCTGCCGGTTTCT



ACCGACGACGAACTGCTGGAATTCATCGGTGTTGAACGTTCTCACCCGTCTTGCCACGCGCTGATCGA



ACTGATCGCGCAGCTGGAAGCGCCGGAACTGTTCGAAAAAAACGTTTTCTCTGACACCCGTTCTGAAG



TTCAGGGTATGATCGACTCTGCGGTTTCTAACCACATCGCGCGTCTGTCTTCTTCTCGTAACTCTCTGT



CTATGGACTCTGAAGAACTGGAACGTCTGATCAAATCTTTCCAGATCCACACCCCGCACTGCTCTCTG



TTCATCGGTGCGCAGTCTCTGTCTCAGCAGCTGGAATCTCTGCCGGAAGCGCTGCAGTCTGGTGTTAA



CTCTGCGGACATCCTGCTGGGTTCTACCCAGTACATGCTGACCAACTCTCTGGTTGAAGAATCTATCG



CGACCTACCAGCGTACCCTGAACCGTATCAACTACCTGTCTGGTGTTGCGGGTCAGATCAACGGTGCG



ATCAAACGTAAAGCGATCGACGGTGAAAAAATCCACCTGCCGGCGGCGTGGTCTGAACTGATCTCTC



TGCCGTTCATCGGTCAGCCGGTTATCGACGTTGAATCTGACCTGGCGCACCTGAAAAACCAGTACCAG



ACCCTGTCTAACGAATTCGACACCCTGATCTCTGCGCTGCAGAAAAACTTCGACCTGAACTTCAACAA



AGCGCTGCTGAACCGTACCCAGCACTTCGAAGCGATGTGCCGTTCTACCAAAAAAAACGCGCTGTCTA



AACCGGAAATCGTTTCTTACCGTGACCTGCTGGCGCGTCTGACCTCTTGCCTGTACCGTGGTTCTCTGG



TTCTGCGTCGTGCGGGTATCGAAGTTCTGAAAAAACACAAAATCTTCGAATCTAACTCTGAACTGCGT



GAACACGTTCACGAACGTAAACACTTCGTTTTCGTTTCTCCGCTGGACCGTAAAGCGAAAAAACTGCT



GCGTCTGACCGACTCTCGTCCGGACCTGCTGCACGTTATCGACGAAATCCTGCAGCACGACAACCTGG



AAAACAAAGACCGTGAATCTCTGTGGCTGGTTCGTTCTGGTTACCTGCTGGCGGGTCTGCCGGACCAG



CTGTCTTCTTCTTTCATCAACCTGCCGATCATCACCCAGAAAGGTGACCGTCGTCTGATCGACCTGATC



CAGTACGACCAGATCAACCGTGACGCGTTCGTTATGCTGGTTACCTCTGCGTTCAAATCTAACCTGTCT



GGTCTGCAGTACCGTGCGAACAAACAGTCTTTCGTTGTTACCCGTACCCTGTCTCCGTACCTGGGTTCT



AAACTGGTTTACGTTCCGAAAGACAAAGACTGGCTGGTTCCGTCTCAGATGTTCGAAGGTCGTTTCGC



GGACATCCTGCAGTCTGACTACATGGTTTGGAAAGACGCGGGTCGTCTGTGCGTTATCGACACCGCGA



AACACCTGTCTAACATCAAAAAATCTGTTTTCTCTTCTGAAGAAGTTCTGGCGTTCCTGCGTGAACTGC



CGCACCGTACCTTCATCCAGACCGAAGTTCGTGGTCTGGGTGTTAACGTTGACGGTATCGCGTTCAAC



AACGGTGACATCCCGTCTCTGAAAACCTTCTCTAACTGCGTTCAGGTTAAAGTTTCTCGTACCAACAC



CTCTCTGGTTCAGACCCTGAACCGTTGGTTCGAAGGTGGTAAAGTTTCTCCGCCGTCTATCCAGTTCGA



ACGTGCGTACTACAAAAAAGACGACCAGATCCACGAAGACGCGGCGAAACGTAAAATCCGTTTCCAG



ATGCCGGCGACCGAACTGGTTCACGCGTCTGACGACGCGGGTTGGACCCCGTCTTACCTGCTGGGTAT



CGACCCGGGTGAATACGGTATGGGTCTGTCTCTGGTTTCTATCAACAACGGTGAAGTTCTGGACTCTG



GTTTCATCCACATCAACTCTCTGATCAACTTCGCGTCTAAAAAATCTAACCACCAGACCAAAGTTGTT



CCGCGTCAGCAGTACAAATCTCCGTACGCGAACTACCTGGAACAGTCTAAAGACTCTGCGGCGGGTG



ACATCGCGCACATCCTGGACCGTCTGATCTACAAACTGAACGCGCTGCCGGTTTTCGAAGCGCTGTCT



GGTAACTCTCAGTCTGCGGCGGACCAGGTTTGGACCAAAGTTCTGTCTTTCTACACCTGGGGTGACAA



CGACGCGCAGAACTCTATCCGTAAACAGCACTGGTTCGGTGCGTCTCACTGGGACATCAAAGGTATGC



TGCGTCAGCCGCCGACCGAAAAAAAACCGAAACCGTACATCGCGTTCCCGGGTTCTCAGGTTTCTTCT



TACGGTAACTCTCAGCGTTGCTCTTGCTGCGGTCGTAACCCGATCGAACAGCTGCGTGAAATGGCGAA



AGACACCTCTATCAAAGAACTGAAAATCCGTAACTCTGAAATCCAGCTGTTCGACGGTACCATCAAAC



TGTTCAACCCGGACCCGTCTACCGTTATCGAACGTCGTCGTCACAACCTGGGTCCGTCTCGTATCCCG



GTTGCGGACCGTACCTTCAAAAACATCTCTCCGTCTTCTCTGGAATTCAAAGAACTGATCACCATCGTT



TCTCGTTCTATCCGTCACTCTCCGGAATTCATCGCGAAAAAACGTGGTATCGGTTCTGAATACTTCTGC



GCGTACTCTGACTGCAACTCTTCTCTGAACTCTGAAGCGAACGCGGCGGCGAACGTTGCGCAGAAATT



CCAGAAACAGCTGTTCTTCGAACTGTAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTG



AAATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATT



ACA





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCG


ID
TCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCG


NO:
GGACCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGA


77
TTATTTGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTG



ACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCA



CTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCA



CCATCATCATCACCATAAACGTATCCTGAACTCTCTGAAAGTTGCGGCGCTGCGTCTGCTGTTCCGTG



GTAAAGGTTCTGAACTGGTTAAAACCGTTAAATACCCGCTGGTTTCTCCGGTTCAGGGTGCGGTTGAA



GAACTGGCGGAAGCGATCCGTCACGACAACCTGCACCTGTTCGGTCAGAAAGAAATCGTTGACCTGA



TGGAAAAAGACGAAGGTACCCAGGTTTACTCTGTTGTTGACTTCTGGCTGGACACCCTGCGTCTGGGT



ATGTTCTTCTCTCCGTCTGCGAACGCGCTGAAAATCACCCTGGGTAAATTCAACTCTGACCAGGTTTCT



CCGTTCCGTAAAGTTCTGGAACAGTCTCCGTTCTTCCTGGCGGGTCGTCTGAAAGTTGAACCGGCGGA



ACGTATCCTGTCTGTTGAAATCCGTAAAATCGGTAAACGTGAAAACCGTGTTGAAAACTACGCGGCG



GACGTTGAAACCTGCTTCATCGGTCAGCTGTCTTCTGACGAAAAACAGTCTATCCAGAAACTGGCGAA



CGACATCTGGGACTCTAAAGACCACGAAGAACAGCGTATGCTGAAAGCGGACTTCTTCGCGATCCCG



CTGATCAAAGACCCGAAAGCGGTTACCGAAGAAGACCCGGAAAACGAAACCGCGGGTAAACAGAAA



CCGCTGGAACTGTGCGTTTGCCTGGTTCCGGAACTGTACACCCGTGGTTTCGGTTCTATCGCGGACTTC



CTGGTTCAGCGTCTGACCCTGCTGCGTGACAAAATGTCTACCGACACCGCGGAAGACTGCCTGGAATA



CGTTGGTATCGAAGAAGAAAAAGGTAACGGTATGAACTCTCTGCTGGGTACCTTCCTGAAAAACCTG



CAGGGTGACGGTTTCGAACAGATCTTCCAGTTCATGCTGGGTTCTTACGTTGGTTGGCAGGGTAAAGA



AGACGTTCTGCGTGAACGTCTGGACCTGCTGGCGGAAAAAGTTAAACGTCTGCCGAAACCGAAATTC



GCGGGTGAATGGTCTGGTCACCGTATGTTCCTGCACGGTCAGCTGAAATCTTGGTCTTCTAACTTCTTC



CGTCTGTTCAACGAAACCCGTGAACTGCTGGAATCTATCAAATCTGACATCCAGCACGCGACCATGCT



GATCTCTTACGTTGAAGAAAAAGGTGGTTACCACCCGCAGCTGCTGTCTCAGTACCGTAAACTGATGG



AACAGCTGCCGGCGCTGCGTACCAAAGTTCTGGACCCGGAAATCGAAATGACCCACATGTCTGAAGC



GGTTCGTTCTTACATCATGATCCACAAATCTGTTGCGGGTTTCCTGCCGGACCTGCTGGAATCTCTGGA



CCGTGACAAAGACCGTGAATTCCTGCTGTCTATCTTCCCGCGTATCCCGAAAATCGACAAAAAAACCA



AAGAAATCGTTGCGTGGGAACTGCCGGGTGAACCGGAAGAAGGTTACCTGTTCACCGCGAACAACCT



GTTCCGTAACTTCCTGGAAAACCCGAAACACGTTCCGCGTTTCATGGCGGAACGTATCCCGGAAGACT



GGACCCGTCTGCGTTCTGCGCCGGTTTGGTTCGACGGTATGGTTAAACAGTGGCAGAAAGTTGTTAAC



CAGCTGGTTGAATCTCCGGGTGCGCTGTACCAGTTCAACGAATCTTTCCTGCGTCAGCGTCTGCAGGC



GATGCTGACCGTTTACAAACGTGACCTGCAGACCGAAAAATTCCTGAAACTGCTGGCGGACGTTTGCC



GTCCGCTGGTTGACTTCTTCGGTCTGGGTGGTAACGACATCATCTTCAAATCTTGCCAGGACCCGCGT



AAACAGTGGCAGACCGTTATCCCGCTGTCTGTTCCGGCGGACGTTTACACCGCGTGCGAAGGTCTGGC



GATCCGTCTGCGTGAAACCCTGGGTTTCGAATGGAAAAACCTGAAAGGTCACGAACGTGAAGACTTC



CTGCGTCTGCACCAGCTGCTGGGTAACCTGCTGTTCTGGATCCGTGACGCGAAACTGGTTGTTAAACT



GGAAGACTGGATGAACAACCCGTGCGTTCAGGAATACGTTGAAGCGCGTAAAGCGATCGACCTGCCG



CTGGAAATCTTCGGTTTCGAAGTTCCGATCTTCCTGAACGGTTACCTGTTCTCTGAACTGCGTCAGCTG



GAACTGCTGCTGCGTCGTAAATCTGTTATGACCTCTTACTCTGTTAAAACCACCGGTTCTCCGAACCGT



CTGTTCCAGCTGGTTTACCTGCCGCTGAACCCGTCTGACCCGGAAAAAAAAAACTCTAACAACTTCCA



GGAACGTCTGGACACCCCGACCGGTCTGTCTCGTCGTTTCCTGGACCTGACCCTGGACGCGTTCGCGG



GTAAACTGCTGACCGACCCGGTTACCCAGGAACTGAAAACCATGGCGGGTTTCTACGACCACCTGTTC



GGTTTCAAACTGCCGTGCAAACTGGCGGCGATGTCTAACCACCCGGGTTCTTCTTCTAAAATGGTTGT



TCTGGCGAAACCGAAAAAAGGTGTTGCGTCTAACATCGGTTTCGAACCGATCCCGGACCCGGCGCAC



CCGGTTTTCCGTGTTCGTTCTTCTTGGCCGGAACTGAAATACCTGGAAGGTCTGCTGTACCTGCCGGAA



GACACCCCGCTGACCATCGAACTGGCGGAAACCTCTGTTTCTTGCCAGTCTGTTTCTTCTGTTGCGTTC



GACCTGAAAAACCTGACCACCATCCTGGGTCGTGTTGGTGAATTCCGTGTTACCGCGGACCAGCCGTT



CAAACTGACCCCGATCATCCCGGAAAAAGAAGAATCTTTCATCGGTAAAACCTACCTGGGTCTGGAC



GCGGGTGAACGTTCTGGTGTTGGTTTCGCGATCGTTACCGTTGACGGTGACGGTTACGAAGTTCAGCG



TCTGGGTGTTCACGAAGACACCCAGCTGATGGCGCTGCAGCAGGTTGCGTCTAAATCTCTGAAAGAAC



CGGTTTTCCAGCCGCTGCGTAAAGGTACCTTCCGTCAGCAGGAACGTATCCGTAAATCTCTGCGTGGT



TGCTACTGGAACTTCTACCACGCGCTGATGATCAAATACCGTGCGAAAGTTGTTCACGAAGAATCTGT



TGGTTCTTCTGGTCTGGTTGGTCAGTGGCTGCGTGCGTTCCAGAAAGACCTGAAAAAAGCGGACGTTC



TGCCGAAAAAAGGTGGTAAAAACGGTGTTGACAAAAAAAAACGTGAATCTTCTGCGCAGGACACCCT



GTGGGGTGGTGCGTTCTCTAAAAAAGAAGAACAGCAGATCGCGTTCGAAGTTCAGGCGGCGGGTTCT



TCTCAGTTCTGCCTGAAATGCGGTTGGTGGTTCCAGCTGGGTATGCGTGAAGTTAACCGTGTTCAGGA



ATCTGGTGTTGTTCTGGACTGGAACCGTTCTATCGTTACCTTCCTGATCGAATCTTCTGGTGAAAAAGT



TTACGGTTTCTCTCCGCAGCAGCTGGAAAAAGGTTTCCGTCCGGACATCGAAACCTTCAAAAAAATGG



TTCGTGACTTCATGCGTCCGCCGATGTTCGACCGTAAAGGTCGTCCGGCGGCGGCGTACGAACGTTTC



GTTCTGGGTCGTCGTCACCGTCGTTACCGTTTCGACAAAGTTTTCGAAGAACGTTTCGGTCGTTCTGCG



CTGTTCATCTGCCCGCGTGTTGGTTGCGGTAACTTCGACCACTCTTCTGAACAGTCTGCGGTTGTTCTG



GCGCTGATCGGTTACATCGCGGACAAAGAAGGTATGTCTGGTAAAAAACTGGTTTACGTTCGTCTGGC



GGAACTGATGGCGGAATGGAAACTGAAAAAACTGGAACGTTCTCGTGTTGAAGAACAGTCTTCTGCG



CAGTAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCTCAGGT



TAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCG


ID
TCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCG


NO:
GGACCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGA


78
TTATTTGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTG



ACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCA



CTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCA



CCATCATCATCACCATGCGGAATCTAAACAGATGCAGTGCCGTAAATGCGGTGCGTCTATGAAATACG



AAGTTATCGGTCTGGGTAAAAAATCTTGCCGTTACATGTGCCCGGACTGCGGTAACCACACCTCTGCG



CGTAAAATCCAGAACAAAAAAAAACGTGACAAAAAATACGGTTCTGCGTCTAAAGCGCAGTCTCAGC



GTATCGCGGTTGCGGGTGCGCTGTACCCGGACAAAAAAGTTCAGACCATCAAAACCTACAAATACCC



GGCGGACCTGAACGGTGAAGTTCACGACTCTGGTGTTGCGGAAAAAATCGCGCAGGCGATCCAGGAA



GACGAAATCGGTCTGCTGGGTCCGTCTTCTGAATACGCGTGCTGGATCGCGTCTCAGAAACAGTCTGA



ACCGTACTCTGTTGTTGACTTCTGGTTCGACGCGGTTTGCGCGGGTGGTGTTTTCGCGTACTCTGGTGC



GCGTCTGCTGTCTACCGTTCTGCAGCTGTCTGGTGAAGAATCTGTTCTGCGTGCGGCGCTGGCGTCTTC



TCCGTTCGTTGACGACATCAACCTGGCGCAGGCGGAAAAATTCCTGGCGGTTTCTCGTCGTACCGGTC



AGGACAAACTGGGTAAACGTATCGGTGAATGCTTCGCGGAAGGTCGTCTGGAAGCGCTGGGTATCAA



AGACCGTATGCGTGAATTCGTTCAGGCGATCGACGTTGCGCAGACCGCGGGTCAGCGTTTCGCGGCG



AAACTGAAAATCTTCGGTATCTCTCAGATGCCGGAAGCGAAACAGTGGAACAACGACTCTGGTCTGA



CCGTTTGCATCCTGCCGGACTACTACGTTCCGGAAGAAAACCGTGCGGACCAGCTGGTTGTTCTGCTG



CGTCGTCTGCGTGAAATCGCGTACTGCATGGGTATCGAAGACGAAGCGGGTTTCGAACACCTGGGTAT



CGACCCGGGTGCGCTGTCTAACTTCTCTAACGGTAACCCGAAACGTGGTTTCCTGGGTCGTCTGCTGA



ACAACGACATCATCGCGCTGGCGAACAACATGTCTGCGATGACCCCGTACTGGGAAGGTCGTAAAGG



TGAACTGATCGAACGTCTGGCGTGGCTGAAACACCGTGCGGAAGGTCTGTACCTGAAAGAACCGCAC



TTCGGTAACTCTTGGGCGGACCACCGTTCTCGTATCTTCTCTCGTATCGCGGGTTGGCTGTCTGGTTGC



GCGGGTAAACTGAAAATCGCGAAAGACCAGATCTCTGGTGTTCGTACCGACCTGTTCCTGCTGAAACG



TCTGCTGGACGCGGTTCCGCAGTCTGCGCCGTCTCCGGACTTCATCGCGTCTATCTCTGCGCTGGACCG



TTTCCTGGAAGCGGCGGAATCTTCTCAGGACCCGGCGGAACAGGTTCGTGCGCTGTACGCGTTCCACC



TGAACGCGCCGGCGGTTCGTTCTATCGCGAACAAAGCGGTTCAGCGTTCTGACTCTCAGGAATGGCTG



ATCAAAGAACTGGACGCGGTTGACCACCTGGAATTCAACAAAGCGTTCCCGTTCTTCTCTGACACCGG



TAAAAAAAAAAAAAAAGGTGCGAACTCTAACGGTGCGCCGTCTGAAGAAGAATACACCGAAACCGA



ATCTATCCAGCAGCCGGAAGACGCGGAACAGGAAGTTAACGGTCAGGAAGGTAACGGTGCGTCTAAA



AACCAGAAAAAATTCCAGCGTATCCCGCGTTTCTTCGGTGAAGGTTCTCGTTCTGAATACCGTATCCT



GACCGAAGCGCCGCAGTACTTCGACATGTTCTGCAACAACATGCGTGCGATCTTCATGCAGCTGGAAT



CTCAGCCGCGTAAAGCGCCGCGTGACTTCAAATGCTTCCTGCAGAACCGTCTGCAGAAACTGTACAAA



CAGACCTTCCTGAACGCGCGTTCTAACAAATGCCGTGCGCTGCTGGAATCTGTTCTGATCTCTTGGGG



TGAATTCTACACCTACGGTGCGAACGAAAAAAAATTCCGTCTGCGTCACGAAGCGTCTGAACGTTCTT



CTGACCCGGACTACGTTGTTCAGCAGGCGCTGGAAATCGCGCGTCGTCTGTTCCTGTTCGGTTTCGAA



TGGCGTGACTGCTCTGCGGGTGAACGTGTTGACCTGGTTGAAATCCACAAAAAAGCGATCTCTTTCCT



GCTGGCGATCACCCAGGCGGAAGTTTCTGTTGGTTCTTACAACTGGCTGGGTAACTCTACCGTTTCTCG



TTACCTGTCTGTTGCGGGTACCGACACCCTGTACGGTACCCAGCTGGAAGAATTCCTGAACGCGACCG



TTCTGTCTCAGATGCGTGGTCTGGCGATCCGTCTGTCTTCTCAGGAACTGAAAGACGGTTTCGACGTTC



AGCTGGAATCTTCTTGCCAGGACAACCTGCAGCACCTGCTGGTTTACCGTGCGTCTCGTGACCTGGCG



GCGTGCAAACGTGCGACCTGCCCGGCGGAACTGGACCCGAAAATCCTGGTTCTGCCGGTTGGTGCGTT



CATCGCGTCTGTTATGAAAATGATCGAACGTGGTGACGAACCGCTGGCGGGTGCGTACCTGCGTCACC



GTCCGCACTCTTTCGGTTGGCAGATCCGTGTTCGTGGTGTTGCGGAAGTTGGTATGGACCAGGGTACC



GCGCTGGCGTTCCAGAAACCGACCGAATCTGAACCGTTCAAAATCAAACCGTTCTCTGCGCAGTACGG



TCCGGTTCTGTGGCTGAACTCTTCTTCTTACTCTCAGTCTCAGTACCTGGACGGTTTCCTGTCTCAGCC



GAAAAACTGGTCTATGCGTGTTCTGCCGCAGGCGGGTTCTGTTCGTGTTGAACAGCGTGTTGCGCTGA



TCTGGAACCTGCAGGCGGGTAAAATGCGTCTGGAACGTTCTGGTGCGCGTGCGTTCTTCATGCCGGTT



CCGTTCTCTTTCCGTCCGTCTGGTTCTGGTGACGAAGCGGTTCTGGCGCCGAACCGTTACCTGGGTCTG



TTCCCGCACTCTGGTGGTATCGAATACGCGGTTGTTGACGTTCTGGACTCTGCGGGTTTCAAAATCCTG



GAACGTGGTACCATCGCGGTTAACGGTTTCTCTCAGAAACGTGGTGAACGTCAGGAAGAAGCGCACC



GTGAAAAACAGCGTCGTGGTATCTCTGACATCGGTCGTAAAAAACCGGTTCAGGCGGAAGTTGACGC



GGCGAACGAACTGCACCGTAAATACACCGACGTTGCGACCCGTCTGGGTTGCCGTATCGTTGTTCAGT



GGGCGCCGCAGCCGAAACCGGGTACCGCGCCGACCGCGCAGACCGTTTACGCGCGTGCGGTTCGTAC



CGAAGCGCCGCGTTCTGGTAACCAGGAAGACCACGCGCGTATGAAATCTTCTTGGGGTTACACCTGG



GGTACCTACTGGGAAAAACGTAAACCGGAAGACATCCTGGGTATCTCTACCCAGGTTTACTGGACCG



GTGGTATCGGTGAATCTTGCCCGGCGGTTGCGGTTGCGCTGCTGGGTCACATCCGTGCGACCTCTACC



CAGACCGAATGGGAAAAAGAAGAAGTTGTTTTCGGTCGTCTGAAAAAATTCTTCCCGTCTTAAGAAAT



CATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAAATATTCAC



TCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA


SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCG


ID
TCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCG


NO:
GGACCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGA


79
TTATTTGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTG



ACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCA



CTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCA



CCATCATCATCACCATGAAAAACGTATCAACAAAATCCGTAAAAAACTGTCTGCGGACAACGCGACC



AAACCGGTTTCTCGTTCTGGTCCGATGAAAACCCTGCTGGTTCGTGTTATGACCGACGACCTGAAAAA



ACGTCTGGAAAAACGTCGTAAAAAACCGGAAGTTATGCCGCAGGTTATCTCTAACAACGCGGCGAAC



AACCTGCGTATGCTGCTGGACGACTACACCAAAATGAAAGAAGCGATCCTGCAGGTTTACTGGCAGG



AATTCAAAGACGACCACGTTGGTCTGATGTGCAAATTCGCGCAGCCGGCGTCTAAAAAAATCGACCA



GAACAAACTGAAACCGGAAATGGACGAAAAAGGTAACCTGACCACCGCGGGTTTCGCGTGCTCTCAG



TGCGGTCAGCCGCTGTTCGTTTACAAACTGGAACAGGTTTCTGAAAAAGGTAAAGCGTACACCAACTA



CTTCGGTCGTTGCAACGTTGCGGAACACGAAAAACTGATCCTGCTGGCGCAGCTGAAACCGGAAAAA



GACTCTGACGAAGCGGTTACCTACTCTCTGGGTAAATTCGGTCAGCGTGCGCTGGACTTCTACTCTAT



CCACGTTACCAAAGAATCTACCCACCCGGTTAAACCGCTGGCGCAGATCGCGGGTAACCGTTACGCGT



CTGGTCCGGTTGGTAAAGCGCTGTCTGACGCGTGCATGGGTACCATCGCGTCTTTCCTGTCTAAATAC



CAGGACATCATCATCGAACACCAGAAAGTTGTTAAAGGTAACCAGAAACGTCTGGAATCTCTGCGTG



AACTGGCGGGTAAAGAAAACCTGGAATACCCGTCTGTTACCCTGCCGCCGCAGCCGCACACCAAAGA



AGGTGTTGACGCGTACAACGAAGTTATCGCGCGTGTTCGTATGTGGGTTAACCTGAACCTGTGGCAGA



AACTGAAACTGTCTCGTGACGACGCGAAACCGCTGCTGCGTCTGAAAGGTTTCCCGTCTTTCCCGGTT



GTTGAACGTCGTGAAAACGAAGTTGACTGGTGGAACACCATCAACGAAGTTAAAAAACTGATCGACG



CGAAACGTGACATGGGTCGTGTTTTCTGGTCTGGTGTTACCGCGGAAAAACGTAACACCATCCTGGAA



GGTTACAACTACCTGCCGAACGAAAACGACCACAAAAAACGTGAAGGTTCTCTGGAAAACCCGAAAA



AACCGGCGAAACGTCAGTTCGGTGACCTGCTGCTGTACCTGGAAAAAAAATACGCGGGTGACTGGGG



TAAAGTTTTCGACGAAGCGTGGGAACGTATCGACAAAAAAATCGCGGGTCTGACCTCTCACATCGAA



CGTGAAGAAGCGCGTAACGCGGAAGACGCGCAGTCTAAAGCGGTTCTGACCGACTGGCTGCGTGCGA



AAGCGTCTTTCGTTCTGGAACGTCTGAAAGAAATGGACGAAAAAGAATTCTACGCGTGCGAAATCCA



GCTGCAGAAATGGTACGGTGACCTGCGTGGTAACCCGTTCGCGGTTGAAGCGGAAAACCGTGTTGTT



GACATCTCTGGTTTCTCTATCGGTTCTGACGGTCACTCTATCCAGTACCGTAACCTGCTGGCGTGGAAA



TACCTGGAAAACGGTAAACGTGAATTCTACCTGCTGATGAACTACGGTAAAAAAGGTCGTATCCGTTT



CACCGACGGTACCGACATCAAAAAATCTGGTAAATGGCAGGGTCTGCTGTACGGTGGTGGTAAAGCG



AAAGTTATCGACCTGACCTTCGACCCGGACGACGAACAGCTGATCATCCTGCCGCTGGCGTTCGGTAC



CCGTCAGGGTCGTGAATTCATCTGGAACGACCTGCTGTCTCTGGAAACCGGTCTGATCAAACTGGCGA



ACGGTCGTGTTATCGAAAAAACCATCTACAACAAAAAAATCGGTCGTGACGAACCGGCGCTGTTCGT



TGCGCTGACCTTCGAACGTCGTGAAGTTGTTGACCCGTCTAACATCAAACCGGTTAACCTGATCGGTG



TTGACCGTGGTGAAAACATCCCGGCGGTTATCGCGCTGACCGACCCGGAAGGTTGCCCGCTGCCGGA



ATTCAAAGACTCTTCTGGTGGTCCGACCGACATCCTGCGTATCGGTGAAGGTTACAAAGAAAAACAG



CGTGCGATCCAGGCGGCGAAAGAAGTTGAACAGCGTCGTGCGGGTGGTTACTCTCGTAAATTCGCGT



CTAAATCTCGTAACCTGGCGGACGACATGGTTCGTAACTCTGCGCGTGACCTGTTCTACCACGCGGTT



ACCCACGACGCGGTTCTGGTTTTCGAAAACCTGTCTCGTGGTTTCGGTCGTCAGGGTAAACGTACCTT



CATGACCGAACGTCAGTACACCAAAATGGAAGACTGGCTGACCGCGAAACTGGCGTACGAAGGTCTG



ACCTCTAAAACCTACCTGTCTAAAACCCTGGCGCAGTACACCTCTAAAACCTGCTCTAACTGCGGTTT



CACCATCACCACCGCGGACTACGACGGTATGCTGGTTCGTCTGAAAAAAACCTCTGACGGTTGGGCG



ACCACCCTGAACAACAAAGAACTGAAAGCGGAAGGTCAGATCACCTACTACAACCGTTACAAACGTC



AGACCGTTGAAAAAGAACTGTCTGCGGAACTGGACCGTCTGTCTGAAGAATCTGGTAACAACGACAT



CTCTAAATGGACCAAAGGTCGTCGTGACGAAGCGCTGTTCCTGCTGAAAAAACGTTTCTCTCACCGTC



CGGTTCAGGAACAGTTCGTTTGCCTGGACTGCGGTCACGAAGTTCACGCGGACGAACAGGCGGCGCT



GAACATCGCGCGTTCTTGGCTGTTCCTGAACTCTAACTCTACCGAATTCAAATCTTACAAATCTGGTAA



ACAGCCGTTCGTTGGTGCGTGGCAGGCGTTCTACAAACGTCGTCTGAAAGAAGTTTGGAAACCGAAC



GCGTAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCTCAGGT



TAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCG


ID
TCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCG


NO:
GGACCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGA


80
TTATTTGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTG



ACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCA



CTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCA



CCATCATCATCACCATAAACGTATCAACAAAATCCGTCGTCGTCTGGTTAAAGACTCTAACACCAAAA



AAGCGGGTAAAACCGGTCCGATGAAAACCCTGCTGGTTCGTGTTATGACCCCGGACCTGCGTGAACG



TCTGGAAAACCTGCGTAAAAAACCGGAAAACATCCCGCAGCCGATCTCTAACACCTCTCGTGCGAAC



CTGAACAAACTGCTGACCGACTACACCGAAATGAAAAAAGCGATCCTGCACGTTTACTGGGAAGAAT



TCCAGAAAGACCCGGTTGGTCTGATGTCTCGTGTTGCGCAGCCGGCGCCGAAAAACATCGACCAGCG



TAAACTGATCCCGGTTAAAGACGGTAACGAACGTCTGACCTCTTCTGGTTTCGCGTGCTCTCAGTGCT



GCCAGCCGCTGTACGTTTACAAACTGGAACAGGTTAACGACAAAGGTAAACCGCACACCAACTACTT



CGGTCGTTGCAACGTTTCTGAACACGAACGTCTGATCCTGCTGTCTCCGCACAAACCGGAAGCGAACG



ACGAACTGGTTACCTACTCTCTGGGTAAATTCGGTCAGCGTGCGCTGGACTTCTACTCTATCCACGTTA



CCCGTGAATCTAACCACCCGGTTAAACCGCTGGAACAGATCGGTGGTAACTCTTGCGCGTCTGGTCCG



GTTGGTAAAGCGCTGTCTGACGCGTGCATGGGTGCGGTTGCGTCTTTCCTGACCAAATACCAGGACAT



CATCCTGGAACACCAGAAAGTTATCAAAAAAAACGAAAAACGTCTGGCGAACCTGAAAGACATCGCG



TCTGCGAACGGTCTGGCGTTCCCGAAAATCACCCTGCCGCCGCAGCCGCACACCAAAGAAGGTATCG



AAGCGTACAACAACGTTGTTGCGCAGATCGTTATCTGGGTTAACCTGAACCTGTGGCAGAAACTGAA



AATCGGTCGTGACGAAGCGAAACCGCTGCAGCGTCTGAAAGGTTTCCCGTCTTTCCCGCTGGTTGAAC



GTCAGGCGAACGAAGTTGACTGGTGGGACATGGTTTGCAACGTTAAAAAACTGATCAACGAAAAAAA



AGAAGACGGTAAAGTTTTCTGGCAGAACCTGGCGGGTTACAAACGTCAGGAAGCGCTGCTGCCGTAC



CTGTCTTCTGAAGAAGACCGTAAAAAAGGTAAAAAATTCGCGCGTTACCAGTTCGGTGACCTGCTGCT



GCACCTGGAAAAAAAACACGGTGAAGACTGGGGTAAAGTTTACGACGAAGCGTGGGAACGTATCGA



CAAAAAAGTTGAAGGTCTGTCTAAACACATCAAACTGGAAGAAGAACGTCGTTCTGAAGACGCGCAG



TCTAAAGCGGCGCTGACCGACTGGCTGCGTGCGAAAGCGTCTTTCGTTATCGAAGGTCTGAAAGAAG



CGGACAAAGACGAATTCTGCCGTTGCGAACTGAAACTGCAGAAATGGTACGGTGACCTGCGTGGTAA



ACCGTTCGCGATCGAAGCGGAAAACTCTATCCTGGACATCTCTGGTTTCTCTAAACAGTACAACTGCG



CGTTCATCTGGCAGAAAGACGGTGTTAAAAAACTGAACCTGTACCTGATCATCAACTACTTCAAAGGT



GGTAAACTGCGTTTCAAAAAAATCAAACCGGAAGCGTTCGAAGCGAACCGTTTCTACACCGTTATCA



ACAAAAAATCTGGTGAAATCGTTCCGATGGAAGTTAACTTCAACTTCGACGACCCGAACCTGATCATC



CTGCCGCTGGCGTTCGGTAAACGTCAGGGTCGTGAATTCATCTGGAACGACCTGCTGTCTCTGGAAAC



CGGTTCTCTGAAACTGGCGAACGGTCGTGTTATCGAAAAAACCCTGTACAACCGTCGTACCCGTCAGG



ACGAACCGGCGCTGTTCGTTGCGCTGACCTTCGAACGTCGTGAAGTTCTGGACTCTTCTAACATCAAA



CCGATGAACCTGATCGGTATCGACCGTGGTGAAAACATCCCGGCGGTTATCGCGCTGACCGACCCGG



AAGGTTGCCCGCTGTCTCGTTTCAAAGACTCTCTGGGTAACCCGACCCACATCCTGCGTATCGGTGAA



TCTTACAAAGAAAAACAGCGTACCATCCAGGCGGCGAAAGAAGTTGAACAGCGTCGTGCGGGTGGTT



ACTCTCGTAAATACGCGTCTAAAGCGAAAAACCTGGCGGACGACATGGTTCGTAACACCGCGCGTGA



CCTGCTGTACTACGCGGTTACCCAGGACGCGATGCTGATCTTCGAAAACCTGTCTCGTGGTTTCGGTC



GTCAGGGTAAACGTACCTTCATGGCGGAACGTCAGTACACCCGTATGGAAGACTGGCTGACCGCGAA



ACTGGCGTACGAAGGTCTGCCGTCTAAAACCTACCTGTCTAAAACCCTGGCGCAGTACACCTCTAAAA



CCTGCTCTAACTGCGGTTTCACCATCACCTCTGCGGACTACGACCGTGTTCTGGAAAAACTGAAAAAA



ACCGCGACCGGTTGGATGACCACCATCAACGGTAAAGAACTGAAAGTTGAAGGTCAGATCACCTACT



ACAACCGTTACAAACGTCAGAACGTTGTTAAAGACCTGTCTGTTGAACTGGACCGTCTGTCTGAAGAA



TCTGTTAACAACGACATCTCTTCTTGGACCAAAGGTCGTTCTGGTGAAGCGCTGTCTCTGCTGAAAAA



ACGTTTCTCTCACCGTCCGGTTCAGGAAAAATTCGTTTGCCTGAACTGCGGTTTCGAAACCCACGCGG



ACGAACAGGCGGCGCTGAACATCGCGCGTTCTTGGCTGTTCCTGCGTTCTCAGGAATACAAAAAATAC



CAGACCAACAAAACCACCGGTAACACCGACAAACGTGCGTTCGTTGAAACCTGGCAGTCTTTCTACC



GTAAAAAACTGAAAGAAGTTTGGAAACCGGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCT



GAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGAT



TACA





SEQ
tgccgtcactgcgtcttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagcat


ID
tctgtaacaaagcgggaccaaagccatgacaaaaacgcgtaacaaaagtgtctataatcacggcagaa


NO:
aagtccacattgattatttgcacggcgtcacactttgctatgccatagcatttttatccataagatta


81
gcggatcctacctgacgctttttatcgcaactctctactgtttctccatacccgtttttttgggctag



caccgcctatctcgtgtgagataggcggagatacgaactttaagAAGGAGatatacc





SEQ
TGCCGTCACTGCGTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATT


ID
CTGTAACAAAGCGGGACCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGA


NO:
AAAGTCCACATTGATTATTTGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTA


82
GCGGATCCTACCTGACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGTAGCG



GATCCTACCTGAC





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTTCTAGAGCACAG


ID
CTAACACCACGTCGTCCCTATCTGCTGCCCTAGGTCTATGAGTGGTTGCTGGATAACTTTACGGGCAT


NO:
GCATAAGGCTCGTAATATATATTCAGGGAGACCACAACGGTTTCCCTCTACAAATAATTTTGTTTAAC


83
TTTTACTAGAGCTAGCAGTAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATA



CGAACTTTAAGAGGAGGATATACCA





SEQ
GTTTGAGAGATATGTAAATTCAAAGGATAATCAAAC


ID



NO:



84






SEQ
actacattttttaagacctaattttgagt


ID



NO:



85






SEQ
ctcaaaactcattcgaatctctactctttgtagat


ID



NO:



86






SEQ
CTCTAGCAGGCCTGGCAAATTTCTACTGTTGTAGAT


ID



NO:



87






SEQ
CCGTCTAAAACTCATTCAGAATTTCTACTAGTGTAGAT


ID



NO



88






SEQ
GTCTAGGTACTCTCTTTAATTTCTACTATTGT


ID



NO:



89






SEQ
gttaagttatatagaataatttctactgttgtaga


ID



NO:



90






SEQ
gtttaaaaccactttaaaatttctactattgta


ID



NO:



91






SEQ
GTTTGAGAATGATGTAAAAATGTATGGTACACAGAAATGTTTTAATACCATATTTTTACATCACTCTC


ID
AAACATACATCTCTTGTTACTGTTTATCGTATCCAGATTAAATTTCACGTTTTT


NO:



92






SEQ
CTCTACAACTGATAAAGAATTTCTACTTTTGTAGAT


ID



NO:



93






SEQ
GTCTGGCCCCAAATTTTAATTTCTACTGTTGTAGAT


ID



NO:



94






SEQ
GTCAAAAGACCTTTTTAATTTCTACTCTTGTAGAT


ID



NO:



95






SEQ
GTCTAGAGGACAGAATTTTTCAACGGGTGTGCCAATGGCCACTTTCCAGGTGGCAAAGCCCGTTGAGC


ID
TTCTACGGAAGTGGCAC


NO:



96






SEQ
CGAGGTTCTGTCTTTTGGTCAGGACAACCGTCTAGCTATAAGTGCTGCAGGGGTGTGAGAAACTCCTA


ID
TTGCTGGACGATGTCTCTTTTAACGAGGCATTAGCAC


NO:



97






SEQ
GAACGAGGGACGTTTTGTCTCCAATGATTTTGCTATGACGACCTCGAACTGTGCCTTCAAGTCTGAGG


ID
CGAAAAAGAAATGGAAAAAAGTGTCTCATCGCTCTACCTCGTAGTTAGAGG


NO:



98






SEQ
AATTACTGATGTTGTGATGAAGG


ID



NO:



99






SEQ
TATACCATAAGGATTTAAAGACT


ID



NO:



100






SEQ
GTCTTTACTCTCACCTTTCCACCTG


ID



NO:



101






SEQ
ATTTGAAGGTATCTCCGATAAGTAAAACGCATCAAAG


ID



NO:



102






SEQ
GTTTGAAGATATCTCCGATAAATAAGAAGCATCAAAG


ID



NO:



103






SEQ
TTGTTTTAATACCATATTTTTACATCACTCTCAAAC


ID



NO:



104






SEQ
AAAGAACGCTCGCTCAGTGTTCTGACCTTTCGAGCGCCTGTTCAGGGCGAAAACCCTGGGAGGCGCTC


ID
GAATCATAGGTGGGACAAGGGATTCGCGGCGAAAA


NO:



105






SEQ
GTTTGAGAATGATGTAAAAATGTATGGTACACAGAAATGTTTTAATACCATATTTTTACATCACTCTC


ID
AAACATACATCTCTTGTTACTGTTTATCGTATCCAGATTAAATTTCACGTTTTT


NO:



106






SEQ
GTCTAGAGGACAGAATTTTTCAACGGGTGTGCCAATGGCCACTTTCCAGGTGGCAAAGCCCGTTGAGC


ID
TTCTACGGAAGTGGCAC


NO:



107






SEQ
MSIYQEFVNKYSLSKTLRFELIPQGKTLENIKARGLILDDEKRAKDYKKAKQIIDKYHQFFIEEILSSVC


ID
ISEDLLQNYSDVYFKLKKSDDDNLQKDFKSAKDTIKKQISEYIKDSEKFKNLFNQNLIDAKKGQESDLIL


NO:
WLKQSKDNGIELFKANSDITDIDEALEIIKSFKGWTTYFKGFHENRKNVYSSNDIPTSIIYRIVDDNLPK


108
FLENKAKYESLKDKAPEAINYEQIKKDLAEELTFDIDYKTSEVNQRVFSLDEVFEIANFNNYLNQSGITK



FNTIIGGKFVNGENTKRKGINEYINLYSQQINDKTLKKYKMSVLFKQILSDTESKSFVIDKLEDDSDVVT



TMQSFYEQIAAFKTVEEKSIKETLSLLFDDLKAQKLDLSKIYFKNDKSLTDLSQQVFDDYSVIGTAVLEY



ITQQIAPKNLDNPSKKEQELIAKKTEKAKYLSLETIKLALEEFNKHRDIDKQCRFEEILANFAAIPMIFD



EIAQNKDNLAQISIKYQNQGKKDLLQASAEDDVKAIKDLLDQTNNLLHKLKIFHISQSEDKANILDKDEH



FYLVFEECYFELANIVPLYNKIRNYITQKPYSDEKFKLNFENSTLANGWDKNKEPDNTAILFIKDDKYYL



GVMNKKNNKIFDDKAIKENKGEGYKKIVYKLLPGANKMLPKVFFSAKSIKFYNPSEDILRIRNHSTHTKN



GSPQKGYEKFEFNIEDCRKFIDFYKQSISKHPEWKDFGFRFSDTQRYNSIDEFYREVENQGYKLTFENIS



ESYIDSVVNQGKLYLFQIYNKDFSAYSKGRPNLHTLYWKALFDERNLQDVVYKLNGEAELFYRKQSIPKK



ITHPAKEAIANKNKDNPKKESVFEYDLIKDKRFTEDKFFFHCPITINFKSSGANKFNDEINLLLKEKAND



VHILSIDRGERHLAYYTLVDGKGNIIKQDTFNIIGNDRMKTNYHDKLAAIEKDRDSARKDWKKINNIKEM



KEGYLSQVVHEIAKLVIEYNAIVVFEDLNFGFKRGRFKVEKQVYQKLEKMLIEKLNYLVFKDNEFDKTGG



VLRAYQLTAPFETFKKMGKQTGIIYYVPAGFTSKICPVTGFVNQLYPKYESVSKSQEFFSKFDKICYNLD



KGYFEFSFDYKNFGDKAAKGKWTIASFGSRLINFRNSDKNHNWDTREVYPTKELEKLLKDYSIEYGHGEC



IKAAICGESDKKFFAKLTSVLNTILQMRNSKTGTELDYLISPVADVNGNFFDSRQAPKNMPQDADANGAY



HIGLKGLMLLGRIKNNQEGKKLNLVIKNEEYFEFVQNRNN





SEQ
MSIYQEFVNKYSLSKTLRFELIPQGKTLENIKARGLILDDEKRAKDYKKAKQIIDKYHQFFIEEILSSVC


ID
ISEDLLQNYSDVYFKLKKSDDDNLQKDFKSAKDTIKKQISEYIKDSEKFKNLFNQNLIDAKKGQESDLIL


NO:
WLKQSKDNGIELFKANSDITDIDEALEIIKSFKGWTTYFKGFHENRKNVYSSDDIPTSIIYRIVDDNLPK


109
FLENKAKYESLKDKAPEAINYEQIKKDLAEELTFDIDYKTSEVNQRVFSLDEVFEIANFNNYLNQSGITK



FNTIIGGKFVNGENTKRKGINEYINLYSQQINDKTLKKYKMSVLFKQILSDTESKSFVIDKLEDDSDVVT



TMQSFYEQIAAFKTVEEKSIKETLSLLFDDLKAQKLDLSKIYFKNDKSLTDLSQQVFDDYSVIGTAVLEY



ITQQVAPKNLDNPSKKEQDLIAKKTEKAKYLSLETIKLALEEFNKHRDIDKQCRFEEILANFAAIPMIFD



EIAQNKDNLAQISLKYQNQGKKDLLQASAEEDVKAIKDLLDQTNNLLHRLKIFHISQSEDKANILDKDEH



FYLVFEECYFELANIVPLYNKIRNYITQKPYSDEKFKLNFENSTLANGWDKNKEPDNTAILFIKDDKYYL



GVMNKKNNKIFDDKAIKENKGEGYKKIVYKLLPGANKMLPKVFFSAKSIKFYNPSEDILRIRNHSTHTKN



GNPQKGYEKFEFNIEDCRKFIDFYKESISKHPEWKDFGFRFSDTQRYNSIDEFYREVENQGYKLTFENIS



ESYIDSVVNQGKLYLFQIYNKDFSAYSKGRPNLHTLYWKALFDERNLQDVVYKLNGEAELFYRKQSIPKK



ITHPAKEAIANKNKDNPKKESVFEYDLIKDKRFTEDKFFFHCPITINFKSSGANKFNDEINLLLKEKAND



VHILSIDRGERHLAYYTLVDGKGNIIKQDTFNIIGNDRMKTNYHDKLAAIEKDRDSARKDWKKINNIKEM



KEGYLSQVVHEIAKLVIEHNAIVVFEDLNFGFKRGRFKVEKQVYQKLEKMLIEKLNYLVFKDNEFDKTGG



VLRAYQLTAPFETFKKMGKQTGIIYYVPAGFTSKICPVTGFVNQLYPKYESVSKSQEFFSKFDKICYNLD



KGYFEFSFDYKNFGDKAAKGKWTIASFGSRLINFRNSDKNHNWDTREVYPTKELEKLLKDYSIEYGHGEC



IKAAICGESDKKFFAKLTSVLNTILQMRNSKTGTELDYLISPVADVNGNFFDSRQAPKNMPQDADANGAY



HIGLKGLMLLDRIKNNQEGKKLNLVIKNEEYFEFVQNRNN





SEQ
MDKKYSIGLDIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGALLFDSGETAEATRLKRTARRR


ID
YTRRKNRICYLQEIFSNEMAKVDDSFFHRLEESFLVEEDKKHERHPIFGNIVDEVAYHEKYPTIYHLRKKLV


NO:
DSTDKADLRLIYLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIQLVQTYNQLFEENPINASGVDAKAILSAR


110
LSKSRRLENLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNFDLAEDAKLQLSKDTYDDDLDNLLAQIGDQY



ADLFLAAKNLSDAILLSDILRVNTEITKAPLSASMIKRYDEHHQDLTLLKALVRQQLPEKYKEIFFDQSKNG



YAGYIDGGASQEEFYKFIKPILEKMDGTEELLVKLNREDLLRKQRTFDNGSIPHQIHLGELHAILRRQEDFY



PFLKDNREKIEKILTFRIPYYVGPLARGNSRFAWMTRKSEETITPWNFEEVVDKGASAQSFIERMTNFDKNL



PNEKVLPKHSLLYEYFTVYNELTKVKYVTEGMRKPAFLSGEQKKAIVDLLFKTNRKVTVKQLKEDYFKKI



ECFDSVEISGVEDRFNASLGTYHDLLKIIKDKDFLDNEENEDILEDIVLTLTLFEDREMIEERLKTYAHLFDD



KVMKQLKRRRYTGWGRLSRKLINGIRDKQSGKTILDFLKSDGFANRNFMQLIHDDSLTFKEDIQKAQVSG



QGDSLHEHIANLAGSPAIKKGILQTVKVVDELVKVMGRHKPENIVIEMARENQTTQKGQKNSRERMKRIE



EGIKELGSQILKEHPVENTQLQNEKLYLYYLQNGRDMYVDQELDINRLSDYDVDHIVPQSFLKDDSIDNKV



LTRSDKNRGKSDNVPSEEVVKKMKNYWRQLLNAKLITQRKFDNLTKAERGGLSELDKAGFIKRQLVETR



QITKHVAQILDSRMNTKYDENDKLIREVKVITLKSKLVSDFRKDFQFYKVREINNYHHAHDAYLNAVVGT



ALIKKYPKLESEFVYGDYKVYDVRKMIAKSEQEIGKATAKYFFYSNIMNFFKTEITLANGEIRKRPLIETNG



ETGEIVWDKGRDFATVRKVLSMPQVNIVKKTEVQTGGFSKESILPKRNSDKLIARKKDWDPKKYGGFDSP



TVAYSVLVVAKVEKGKSKKLKSVKELLGITIMERSSFEKNPIDFLEAKGYKEVKKDLIIKLPKYSLFELENG



RKRMLASAGELQKGNELALPSKYVNFLYLASHYEKLKGSPEDNEQKQLFVEQHKHYLDEIIEQISEFSKRV



ILADANLDKVLSAYNKHRDKPIREQAENIIHLFTLTNLGAPAAFKYFDTTIDRKRYTSTKEVLDATLIHQSIT



GLYETRIDLSQLGGD





SEQ
PKKKRKV


ID



NO:



111






SEQ
KRPAATKKAGQAKKKK


ID



NO:



112






SEQ
PAAKRVKLD


ID



NO:



113






SEQ
RQRRNELKRSP


ID



NO:



114






SEQ
NQSSNFGPMKGGNFGGRSSGPYGGGGQYFAKPRNQGGY


ID



NO:



115






SEQ
RMRIZFKNKGKDTAELRRRRVEVSVELRKAKKDEQILKRRNV


ID



NO:



116






SEQ
VSRKRPRP


ID



NO:



117






SEQ
PPKKARED


ID



NO:



118






SEQ
PQPKKKPL


ID



NO:



119






SEQ
SALIKKKKKMAP


ID



NO:



120






SEQ
DRLRR


ID



NO:



121






SEQ
PKQKKRK


ID



NO:



122






SEQ
RKLKKKIKKL


ID



NO:



123






SEQ
REKKKFLKRR


ID



NO:



124






SEQ
KRKGDEVDGVDEVAKKKSKK


ID



NO:



125






SEQ
RKCLQAGMNLEARKTKK


ID



NO:



126






SEQ
ATGGGTAAGATGTATTATCTGGGTTTGGATATAGGCACTAACTCTGTGGGATATGCAGTAACTGATCC


ID
CTCGTATCACTTGTTAAAGTTCAAAGGCGAACCCATGTGGGGAGCACATGTATTTGCTGCGGGTAATC


NO:
AGAGTGCCGAAAGGCGATCTTTCAGAACATCCAGGAGGCGATTAGATAGGAGACAGCAAAGAGTAA


127
AGCTTGTGCAAGAGATCTTTGCTCCTGTCATTTCACCTATAGACCCTCGTTTTTTTATAAGATTGCACG



AATCGGCTCTATGGAGAGACGATGTTGCCGAAACAGATAAACATATCTTTTTCAATGATCCCACTTAT



ACAGACAAGGAATACTACTCCGACTACCCGACAATTCATCATTTGATCGTCGATCTTATGGAGAGCTC



TGAAAAGCATGACCCCCGACTTGTCTATTTGGCTGTAGCTTGGTTAGTTGCTCATAGAGGTCATTTCTT



GAATGAAGTAGATAAAGACAATATAGGTGATGTACTTTCTTTTGATGCTTTCTACCCGGAATTTTTGG



CCTTTTTGTCAGACAATGGCGTCAGTCCCTGGGTCTGTGAGTCGAAGGCCCTTCAAGCTACTCTGCTGT



CTAGGAATAGCGTCAACGACAAATATAAAGCATTAAAATCGCTGATATTCGGATCGCAAAAACCGGA



AGATAACTTTGACGCTAACATCTCTGAAGATGGTTTAATCCAATTGCTGGCGGGTAAGAAAGTTAAAG



TAAACAAACTATTCCCACAAGAGTCCAACGATGCTAGCTTTACGTTGAATGATAAAGAAGACGCTATT



GAAGAAATTCTAGGTACTTTAACGCCTGACGAGTGCGAATGGATCGCTCATATTCGCAGATTGTTCGA



TTGGGCCATCATGAAACACGCGCTAAAGGATGGCAGGACGATATCTGAATCAAAAGTGAAGCTATAC



GAGCAGCATCATCATGACTTGACTCAGTTAAAGTACTTTGTGAAGACCTACCTAGCTAAAGAGTATGA



TGATATCTTCAGAAACGTAGACTCCGAGACAACTAAAAATTATGTAGCTTATTCTTACCATGTGAAGG



AAGTGAAAGGCACATTACCAAAAAATAAAGCAACGCAAGAAGAATTTTGTAAATACGTCCTTGGCAA



AGTCAAAAACATTGAATGTTCCGAAGCAGACAAGGTTGATTTTGATGAAATGATACAACGACTTACG



GACAATTCTTTTATGCCAAAGCAAGTCTCAGGTGAAAATAGAGTAATACCATACCAGTTGTACTACTA



TGAATTAAAGACAATTTTAAACAAAGCCGCCTCATATCTACCTTTTTTGACACAATGCGGTAAAGATG



CTATTTCTAACCAAGACAAATTACTGTCTATAATGACATTTCGCATACCATATTTCGTCGGCCCTTTAA



GGAAAGATAATTCAGAACATGCCTGGTTGGAACGTAAAGCGGGTAAAATTTACCCGTGGAACTTTAA



TGATAAAGTAGATCTTGATAAATCGGAGGAAGCCTTTATCCGTAGGATGACCAATACTTGCACGTATT



ACCCAGGAGAAGACGTGTTACCATTAGATTCACTTATCTATGAAAAGTTTATGATCTTGAATGAGATA



AACAATATTAGGATTGACGGATACCCCATTTCTGTTGATGTGAAACAACAAGTATTTGGTTTATTTGA



GAAGAAAAGGCGAGTAACAGTTAAGGATATTCAAAATCTACTATTATCTCTTGGAGCGTTGGATAAA



CACGGTAAGCTGACTGGTATTGACACGACAATACACTCTAATTATAACACTTATCATCATTTTAAATC



TCTTATGGAGCGGGGAGTATTGACCAGAGATGATGTGGAAAGAATAGTGGAAAGAATGACATATTCT



GACGATACTAAGAGGGTCAGACTGTGGTTAAATAATAATTATGGAACTCTAACAGCTGACGATGTTA



AGCATATCTCAAGACTCAGAAAACACGATTTCGGCCGTTTGTCTAAAATGTTTTTGACAGGATTGAAA



GGTGTTCATAAGGAGACAGGCGAGAGAGCAAGTATACTGGATTTTATGTGGAATACTAACGACAATT



TAATGCAACTACTGTCCGAATGTTACACATTCTCGGATGAGATCACCAAATTACAAGAGGCCTACTAC



GCAAAAGCTCAATTATCGCTAAATGACTTCTTGGACTCTATGTATATATCAAACGCCGTTAAGAGACC



TATTTATCGGACCTTAGCGGTAGTAAATGATATTAGAAAGGCATGCGGGACGGCACCTAAAAGAATT



TTCATCGAGATGGCGCGAGATGGAGAGTCTAAGAAGAAAAGATCTGTGACTCGTAGAGAGCAAATTA



AAAATCTCTATAGATCAATTCGTAAAGACTTTCAACAAGAAGTTGATTTTCTGGAAAAGATATTGGAA



AATAAGAGTGACGGGCAGCTTCAGTCTGACGCTTTATATTTGTATTTTGCTCAATTAGGCAGAGACAT



GTACACAGGTGATCCAATCAAATTAGAACATATTAAAGACCAATCTTTTTACAACATTGATCATATTT



ATCCTCAATCGATGGTGAAAGATGACAGTTTGGATAACAAGGTACTAGTCCAAAGCGAAATCAATGG



CGAAAAGAGTTCGCGCTATCCATTAGACGCAGCCATTAGAAACAAAATGAAGCCGTTGTGGGATGCC



TACTATAATCATGGATTAATTTCTCTTAAGAAATACCAGCGTTTGACGAGATCTACTCCATTTACGGAC



GACGAGAAGTGGGATTTTATCAATCGTCAGCTAGTTGAAACTAGGCAATCTACTAAAGCTTTAGCAAT



ATTGTTAAAGCGTAAGTTTCCAGATACTGAAATAGTTTACTCAAAGGCTGGACTATCCAGCGATTTTA



GACATGAATTCGGCCTGGTTAAGAGTAGGAATATTAATGATCTACACCATGCTAAAGATGCCTTTCTC



GCAATAGTTACTGGGAACGTTTATCATGAAAGATTTAATAGAAGATGGTTTATGGTTAACCAGCCATA



CTCTGTGAAAACTAAGACATTGTTTACCCATTCAATTAAGAATGGCAACTTTGTCGCTTGGAATGGAG



AAGAAGATCTTGGACGTATCGTAAAGATGTTGAAACAAAACAAGAACACAATCCACTTCACCAGGTT



TTCCTTTGATAGGAAGGAGGGATTGTTCGATATTCAACCTCTCAAAGCTTCTACCGGATTGGTTCCAC



GAAAAGCAGGGTTGGATGTTGTTAAATATGGAGGATACGATAAAAGCACTGCCGCGTATTATTTATTA



GTACGTTTTACACTCGAGGATAAGAAGACTCAACACAAATTGATGATGATTCCTGTTGAAGGTCTCTA



CAAAGCACGTATTGACCATGATAAAGAGTTTTTAACAGATTATGCTCAGACCACGATCAGCGAAATTC



TTCAAAAGGACAAGCAGAAAGTGATCAACATCATGTTCCCTATGGGCACGAGACATATCAAACTGAA



TTCGATGATTTCTATTGATGGATTCTATCTTTCTATTGGTGGGAAGAGTAGCAAAGGTAAGTCAGTACT



ATGTCATGCTATGGTGCCATTAATCGTCCCACACAAGATAGAATGTTATATCAAGGCTATGGAATCGT



TTGCAAGAAAATTCAAAGAAAATAATAAATTGAGGATCGTTGAAAAGTTTGATAAAATAACTGTTGA



AGATAACTTGAACTTATACGAGCTTTTTCTACAAAAGTTGCAACATAACCCATATAATAAATTTTTCTC



TACACAATTTGATGTGTTGACGAACGGTAGAAGTACATTCACCAAATTGTCTCCAGAGGAGCAAGTCC



AGACTTTACTTAATATACTGAGTATATTTAAAACTTGTCGTTCTTCTGGGTGTGATTTAAAATCAATAA



ATGGTTCCGCTCAAGCGGCTAGAATTATGATATCCGCTGATTTAACTGGCTTATCAAAAAAGTATTCA



GATATTAGATTAGTTGAGCAAAGCGCATCAGGTCTATTTGTTTCAAAATCTCAAAATCTCTTGGAATA



CTTGCCAAAAAAGAAAAGGAAAGTTTAG





SEQ
ATGAGTAGTTTAACAAAGTTTACCAATAAATATAGTAAGCAACTAACTATAAAGAACGAATTGATAC


ID
CGGTCGGTAAGACTTTGGAAAACATAAAAGAAAATGGGTTGATTGATGGAGACGAGCAATTGAATGA


NO:
GAATTATCAAAAAGCAAAGATAATAGTAGATGATTTTTTGAGAGACTTTATTAATAAAGCTCTAAATA


128
ACACTCAAATTGGTAACTGGAGAGAGCTAGCCGACGCCTTGAACAAGGAAGATGAGGATAATATTGA



GAAATTACAAGATAAGATTAGAGGGATTATCGTGTCTAAGTTTGAGACTTTTGATCTGTTCAGTTCGT



ATTCGATTAAAAAGGACGAGAAAATCATCGATGATGATAACGATGTGGAAGAAGAGGAGCTAGACCT



TGGGAAGAAGACATCTAGCTTCAAATACATATTCAAGAAAAATTTGTTCAAACTTGTCCTTCCTTCAT



ATTTAAAAACAACAAATCAAGATAAGTTAAAAATCATTTCTTCCTTCGATAATTTTAGTACTTATTTTC



GTGGTTTTTTCGAAAACAGGAAAAATATATTCACTAAAAAGCCTATATCTACCTCTATAGCTTATAGA



ATTGTTCACGATAATTTCCCAAAATTTCTAGATAATATCAGGTGTTTTAATGTTTGGCAAACCGAGTGT



CCTCAGTTAATAGTCAAGGCCGACAACTACCTTAAAAGCAAGAATGTGATTGCAAAAGATAAGTCTTT



GGCTAACTATTTTACAGTCGGTGCCTATGATTATTTTCTGAGTCAAAATGGTATCGATTTCTATAACAA



CATTATTGGCGGCTTACCAGCTTTTGCCGGGCATGAGAAGATTCAGGGTTTGAACGAATTTATCAATC



AAGAATGTCAAAAGGATTCTGAATTAAAGTCTAAGCTCAAGAATAGGCACGCTTTCAAAATGGCAGT



CTTATTCAAACAAATCCTTTCAGACAGAGAAAAGTCATTTGTGATTGACGAGTTCGAATCAGACGCTC



AGGTAATTGATGCTGTTAAAAATTTTTACGCGGAACAATGCAAAGATAATAACGTCATATTTAATTTA



TTGAATCTGATCAAGAATATTGCTTTTTTGTCGGATGATGAGTTAGACGGCATTTTCATAGAGGGTAA



ATACCTGTCCTCTGTGTCTCAAAAATTGTATAGTGATTGGTCAAAGTTGAGAAATGATATTGAAGATT



CGGCTAATTCTAAACAGGGTAACAAAGAATTAGCGAAGAAAATCAAAACTAACAAGGGTGATGTTGA



AAAGGCTATAAGTAAGTACGAGTTCAGTTTATCTGAACTAAATTCAATTGTTCATGATAACACAAAAT



TTTCCGATCTTTTATCATGCACATTACATAAAGTTGCAAGTGAAAAATTAGTCAAAGTAAACGAAGGT



GATTGGCCAAAACATCTAAAAAACAACGAGGAAAAACAGAAGATAAAAGAACCTCTTGACGCTTTAT



TGGAAATATACAATACTCTATTAATATTTAACTGTAAAAGTTTTAACAAAAATGGTAATTTCTATGTC



GACTACGATCGCTGCATTAATGAGTTGTCCAGTGTTGTGTACTTGTATAATAAAACTCGTAATTATTGT



ACGAAAAAGCCGTACAACACTGACAAATTTAAGTTGAATTTCAACTCCCCACAACTGGGTGAGGGCT



TCTCTAAAAGTAAAGAGAATGATTGCCTTACATTATTATTTAAAAAAGATGATAATTATTATGTCGGA



ATCATAAGAAAGGGGGCAAAGATCAACTTCGATGACACTCAGGCCATAGCAGACAACACAGATAACT



GTATATTCAAAATGAATTATTTTTTGCTGAAGGATGCTAAAAAATTTATCCCCAAATGTTCAATACAA



TTAAAAGAGGTTAAGGCCCATTTCAAAAAGTCGGAAGATGACTATATTTTGTCCGATAAGGAAAAAT



TCGCTAGTCCGCTTGTTATTAAAAAATCCACATTTCTTCTCGCTACGGCTCATGTGAAAGGAAAGAAG



GGCAATATTAAGAAATTTCAGAAAGAATACTCCAAAGAAAATCCTACGGAGTATAGAAATAGTCTGA



ACGAATGGATAGCATTCTGCAAAGAGTTCTTGAAGACCTATAAAGCTGCCACCATCTTTGATATTACA



ACTTTGAAAAAGGCCGAGGAATACGCTGACATTGTGGAATTCTATAAGGATGTAGATAATCTTTGTTA



CAAGTTAGAATTTTGCCCTATCAAAACTTCTTTTATCGAAAATCTTATAGATAATGGCGATTTATACCT



GTTTAGAATTAATAACAAGGACTTTTCTTCAAAAAGTACAGGCACGAAAAACTTACACACATTATACT



TGCAGGCTATATTTGACGAGCGAAACTTAAACAACCCCACGATAATGTTGAATGGAGGTGCAGAGTT



ATTCTACAGAAAAGAATCTATAGAACAGAAAAATCGGATCACGCACAAAGCCGGTAGTATCTTAGTG



AATAAAGTGTGCAAAGATGGTACAAGTCTAGATGACAAAATCCGTAACGAAATTTACCAGTATGAAA



ACAAATTCATTGATACTCTTTCGGACGAAGCTAAAAAGGTTCTGCCAAACGTTATTAAGAAAGAGGCT



ACGCATGATATAACAAAAGATAAACGTTTCACTAGCGACAAATTCTTCTTTCATTGTCCTTTAACAAT



CAACTACAAGGAAGGTGACACCAAACAATTTAATAATGAAGTGCTCTCATTCCTTAGAGGTAACCCC



GATATCAATATTATCGGCATTGATAGAGGAGAAAGAAACCTAATCTATGTAACAGTCATTAACCAAA



AAGGCGAAATATTGGATAGCGTCTCCTTCAATACTGTCACCAATAAGTCATCGAAGATAGAACAAAC



TGTTGATTACGAAGAAAAATTGGCCGTTAGAGAAAAGGAACGTATCGAAGCGAAGAGATCTTGGGAT



AGCATATCCAAGATTGCCACCTTGAAGGAGGGTTATCTAAGCGCGATCGTACATGAAATCTGCTTATT



AATGATTAAGCATAATGCTATTGTCGTGTTAGAAAACCTGAATGCCGGTTTTAAAAGGATTAGAGGTG



GTTTGTCAGAAAAGTCAGTATATCAAAAGTTTGAAAAGATGCTTATTAATAAACTCAACTACTTCGTT



AGCAAGAAAGAAAGTGATTGGAATAAACCGTCAGGTTTGCTCAATGGTCTTCAGTTAAGTGATCAATT



TGAGTCTTTCGAAAAATTAGGAATTCAAAGTGGATTCATTTTTTATGTACCAGCCGCGTACACTTCAA



AAATTGACCCTACGACCGGATTTGCCAACGTCTTGAATTTGTCCAAGGTCAGAAATGTTGACGCCATC



AAAAGTTTTTTTAGCAACTTCAATGAAATCTCTTATTCCAAAAAGGAAGCCCTTTTCAAGTTTTCTTTT



GACCTAGACTCGTTATCGAAGAAAGGATTTTCATCTTTCGTAAAGTTTAGCAAGTCCAAGTGGAATGT



ATACACATTCGGCGAGAGAATTATCAAGCCCAAGAACAAACAGGGCTATAGAGAAGACAAGAGAAT



CAACTTGACTTTTGAGATGAAAAAATTACTCAACGAATACAAGGTTTCATTTGATTTGGAGAACAACT



TGATTCCCAATTTGACATCAGCTAACTTGAAGGATACGTTCTGGAAGGAGTTATTCTTTATATTCAAA



ACGACATTACAACTGCGTAATAGTGTTACAAACGGTAAAGAAGATGTATTAATCTCACCTGTAAAGA



ATGCCAAAGGAGAATTTTTCGTATCCGGTACTCACAATAAGACACTACCACAGGATTGCGACGCTAAC



GGTGCGTATCATATTGCGTTGAAAGGATTAATGATACTTGAAAGAAATAACCTTGTTCGCGAAGAAA



AAGACACCAAGAAGATCATGGCTATTAGCAATGTTGATTGGTTTGAATACGTGCAAAAGAGGAGAGG



TGTTTTGTAA





SEQ
ATGAACAATTATGACGAGTTCACAAAGCTATACCCTATCCAAAAAACTATCAGGTTCGAATTGAAACC


ID
ACAAGGGAGAACAATGGAACATCTGGAGACATTCAACTTTTTTGAAGAGGACAGAGACAGAGCGGA


NO:
GAAATACAAAATTTTAAAAGAGGCCATCGATGAATATCACAAAAAGTTTATCGACGAGCATTTAACA


129
AACATGTCTTTGGACTGGAATTCACTTAAACAAATTTCTGAGAAATATTATAAGTCTCGGGAGGAAAA



AGACAAAAAGGTCTTTTTGTCCGAGCAAAAGAGAATGAGACAAGAAATTGTCTCGGAGTTTAAAAAA



GATGATCGGTTCAAAGATTTGTTTAGCAAGAAATTGTTTTCTGAATTGTTGAAGGAGGAGATATACAA



GAAAGGCAACCATCAAGAAATAGATGCTTTGAAATCGTTTGACAAGTTCAGCGGTTACTTCATTGGTT



TACATGAAAATAGGAAGAACATGTATAGCGACGGCGATGAGATCACCGCTATATCGAATAGAATCGT



TAACGAAAATTTTCCGAAATTTTTGGATAATTTGCAAAAATACCAGGAAGCTAGGAAAAAGTACCCT



GAATGGATAATAAAGGCGGAATCAGCTTTGGTGGCTCACAACATAAAGATGGATGAAGTCTTCTCGC



TGGAATATTTTAACAAAGTATTAAATCAGGAAGGAATCCAAAGATACAACTTAGCCTTGGGTGGATA



CGTAACCAAATCAGGTGAGAAAATGATGGGCTTAAATGATGCACTTAATCTAGCTCACCAATCCGAA



AAGTCCTCTAAAGGGAGGATACACATGACACCATTGTTTAAGCAAATCCTTTCGGAGAAAGAATCTTT



TTCATATATCCCCGATGTTTTCACTGAGGATAGTCAATTGTTGCCCAGCATTGGTGGATTTTTTGCACA



AATAGAAAATGATAAAGATGGTAACATCTTCGATAGAGCCTTGGAATTGATAAGCTCCTATGCAGAA



TACGATACGGAACGAATATACATTAGACAAGCTGACATCAACAGAGTAAGCAATGTTATTTTTGGTG



AGTGGGGAACTTTAGGTGGATTAATGCGGGAGTACAAAGCTGACTCAATCAATGATATTAATTTGGA



ACGTACGTGCAAAAAAGTCGATAAGTGGCTTGATAGTAAGGAGTTTGCTCTGTCGGATGTACTAGAA



GCAATTAAGAGAACAGGAAACAATGATGCATTTAATGAATATATTAGTAAAATGAGGACGGCTAGAG



AAAAGATAGACGCCGCACGTAAGGAAATGAAGTTTATTTCCGAGAAAATATCTGGCGATGAAGAGTC



GATTCACATCATCAAGACCCTACTCGATTCTGTTCAGCAATTTCTCCATTTTTTTAACCTCTTCAAAGC



AAGACAAGACATTCCCTTAGATGGGGCTTTTTATGCCGAATTTGATGAAGTTCATTCAAAGTTGTTTG



CTATTGTTCCTCTTTACAATAAGGTCCGTAATTACCTTACTAAAAATAACTTGAACACCAAGAAAATA



AAGTTAAACTTCAAGAATCCGACTCTTGCCAACGGGTGGGATCAGAATAAAGTTTATGATTATGCTAG



CTTAATATTTCTAAGAGATGGGAATTATTACTTAGGAATCATCAATCCAAAGCGTAAGAAAAACATTA



AATTTGAACAAGGGTCAGGCAATGGCCCATTCTATAGAAAAATGGTGTATAAGCAAATACCAGGACC



TAACAAGAACTTGCCTCGCGTATTTTTAACTTCAACAAAGGGTAAAAAAGAATATAAACCAAGCAAA



GAAATTATTGAAGGTTACGAAGCAGATAAACACATCAGAGGTGATAAGTTCGATCTGGATTTCTGCC



ATAAATTGATTGACTTTTTTAAGGAATCTATAGAAAAACATAAGGACTGGTCCAAATTTAATTTCTAC



TTCTCACCTACAGAAAGTTATGGTGACATTTCAGAATTTTATTTAGACGTTGAGAAACAAGGATATAG



GATGCATTTTGAAAATATTTCAGCGGAAACCATCGACGAATACGTTGAGAAGGGTGATTTATTCTTGT



TCCAAATTTACAATAAAGACTTCGTTAAAGCTGCAACCGGAAAGAAGGATATGCATACCATATATTG



GAACGCTGCATTCTCGCCAGAAAACTTACAAGATGTCGTTGTAAAGCTTAATGGAGAAGCTGAGCTGT



TCTATAGAGACAAGAGTGATATAAAAGAGATTGTGCATCGGGAAGGTGAAATTCTGGTGAACAGAAC



TTACAATGGTCGTACACCCGTTCCAGACAAAATACATAAAAAACTGACCGATTATCATAATGGTAGG



ACAAAGGACTTGGGCGAGGCCAAGGAGTACCTCGATAAAGTTAGATATTTCAAGGCACACTATGATA



TTACGAAAGACAGGAGATATTTAAACGATAAAATTTACTTTCATGTCCCTTTGACCCTTAACTTTAAA



GCTAATGGTAAAAAGAATTTGAACAAAATGGTAATTGAGAAGTTTTTATCGGACGAAAAAGCTCACA



TAATCGGAATCGACCGCGGAGAGAGAAATTTACTGTATTATAGTATCATCGACAGAAGTGGAAAGAT



TATTGATCAGCAATCTTTGAACGTCATTGATGGGTTTGACTATCGGGAAAAGTTAAATCAAAGGGAAA



TTGAAATGAAGGATGCGAGACAATCATGGAATGCCATTGGTAAAATTAAAGATCTCAAGGAGGGGTA



CTTATCAAAAGCTGTACACGAGATAACTAAAATGGCTATCCAATATAATGCAATTGTTGTAATGGAAG



AATTGAATTATGGTTTTAAACGCGGCAGGTTTAAAGTCGAAAAACAAATATACCAAAAGTTTGAAAA



CATGTTAATTGATAAGATGAACTATCTTGTTTTCAAAGATGCACCTGATGAGAGTCCTGGCGGTGTGC



TGAACGCCTATCAATTAACAAACCCATTAGAGTCCTTTGCTAAACTGGGTAAACAAACTGGCATTCTA



TTTTATGTTCCAGCCGCTTACACCTCAAAGATCGATCCAACGACCGGTTTTGTAAACTTATTTAATACT



TCTTCCAAAACAAACGCGCAAGAACGCAAAGAATTCCTACAAAAATTTGAATCAATATCCTATAGCG



CAAAAGATGGAGGTATATTCGCTTTCGCTTTTGACTACAGAAAGTTTGGCACTTCCAAGACAGATCAT



AAAAATGTGTGGACCGCTTATACCAACGGAGAAAGGATGCGTTATATTAAAGAAAAAAAGAGGAAC



GAACTATTTGATCCATCGAAAGAAATTAAAGAAGCTTTGACAAGCAGCGGAATCAAATATGATGGAG



GTCAAAACATACTTCCAGATATTCTCAGATCTAATAATAACGGTCTTATTTACACGATGTATTCATCTT



TTATCGCTGCCATCCAAATGCGTGTGTATGATGGCAAGGAAGATTATATTATATCTCCTATTAAAAAT



TCAAAGGGTGAATTTTTTCGCACGGATCCAAAAAGAAGAGAGCTTCCAATTGACGCCGATGCTAACG



GTGCTTACAATATTGCATTGCGTGGTGAACTTACTATGAGAGCCATCGCCGAAAAGTTTGATCCGGAC



AGTGAAAAAATGGCGAAATTGGAGCTAAAGCACAAGGATTGGTTTGAATTCATGCAGACCCGTGGCG



ATTGA





SEQ
ATGACTAAAACGTTCGACTCCGAGTTTTTTAATCTCTATTCCTTGCAAAAGACCGTTAGGTTTGAATTG


ID
AAACCAGTTGGTGAAACTGCCTCATTTGTCGAAGACTTTAAAAACGAGGGATTGAAAAGAGTGGTTA


NO:
GTGAAGATGAAAGAAGGGCAGTAGACTATCAAAAGGTTAAAGAAATCATTGACGATTACCACAGAG


130
ATTTTATAGAAGAATCTCTGAACTATTTTCCAGAGCAGGTTTCAAAAGATGCTCTAGAGCAAGCGTTT



CATTTGTATCAAAAGTTGAAAGCAGCGAAGGTGGAAGAAAGGGAAAAAGCTTTAAAAGAATGGGAA



GCATTACAGAAAAAATTGCGAGAAAAAGTCGTCAAATGTTTCAGCGACTCTAATAAAGCTCGCTTTTC



TAGAATCGATAAAAAAGAATTGATTAAGGAAGATTTAATAAATTGGCTGGTAGCACAAAACAGAGAG



GATGATATTCCTACTGTTGAAACGTTCAATAATTTTACTACTTACTTCACTGGTTTCCATGAGAACAGG



AAGAATATTTACTCTAAAGATGATCACGCTACTGCTATAAGTTTTAGGTTGATTCACGAAAACTTGCC



TAAATTTTTTGACAATGTCATCAGTTTTAACAAGTTGAAAGAAGGTTTCCCGGAATTAAAATTCGACA



AAGTTAAAGAAGATTTAGAAGTAGATTACGACTTGAAGCATGCGTTTGAAATTGAATATTTCGTTAAT



TTCGTCACACAAGCTGGTATCGACCAATATAATTACCTGCTTGGAGGCAAAACTCTAGAAGACGGTAC



GAAGAAACAAGGAATGAATGAACAGATTAATTTATTTAAGCAACAACAAACTCGCGATAAAGCTAGA



CAGATTCCAAAACTGATTCCACTTTTCAAACAGATTCTATCTGAGAGAACTGAATCTCAGAGTTTTAT



CCCTAAGCAGTTCGAGTCTGATCAGGAACTATTCGATTCCCTGCAGAAATTGCATAACAACTGTCAAG



ATAAGTTTACCGTTTTGCAACAGGCGATCTTGGGATTGGCTGAGGCAGATCTTAAAAAGGTCTTTATT



AAAACTAGTGATCTAAACGCATTGTCTAACACTATTTTTGGAAATTATTCTGTGTTCTCAGACGCGCTC



AATTTATATAAAGAGTCGCTAAAAACTAAAAAGGCTCAAGAAGCTTTTGAAAAGTTGCCTGCACATA



GTATTCATGATTTAATCCAATACTTAGAACAATTTAATTCGTCTCTCGATGCTGAAAAGCAACAGTCT



ACCGATACTGTATTAAACTACTTTATTAAAACCGACGAATTATATAGTCGTTTCATTAAATCCACCTCT



GAGGCATTCACCCAAGTACAACCTCTCTTTGAACTGGAAGCTTTGAGCTCCAAAAGAAGACCCCCAG



AAAGTGAAGATGAGGGGGCTAAAGGCCAAGAAGGTTTCGAACAAATTAAGAGAATCAAAGCTTATCT



AGACACTCTAATGGAGGCTGTCCACTTTGCTAAGCCTTTGTATCTTGTCAAGGGTAGAAAGATGATAG



AGGGTCTAGACAAGGATCAAAGCTTCTACGAAGCGTTTGAAATGGCCTACCAGGAGTTGGAGTCTTT



AATCATCCCCATTTACAATAAGGCCAGATCTTACCTGTCTAGGAAGCCATTTAAAGCGGATAAATTCA



AAATTAATTTTGACAATAATACACTTCTATCTGGGTGGGATGCTAACAAGGAGACGGCTAACGCCAGC



ATATTGTTTAAGAAGGATGGTTTATACTACCTGGGAATCATGCCAAAAGGCAAAACTTTCTTGTTCGA



TTATTTCGTTAGTTCAGAAGATTCTGAAAAGTTGAAACAACGGAGACAGAAAACCGCAGAGGAAGCG



CTCGCACAGGATGGAGAATCCTATTTTGAAAAAATACGGTATAAACTCCTACCAGGTGCTAGTAAGAT



GTTGCCAAAGGTATTTTTTAGCAATAAAAATATTGGGTTTTACAATCCCTCAGATGATATTCTACGAAT



TCGGAATACGGCCTCTCATACTAAGAATGGTACTCCCCAGAAGGGTCATTCCAAGGTAGAATTTAACT



TGAATGACTGTCACAAAATGATTGATTTTTTTAAATCTTCCATACAGAAACATCCCGAGTGGGGATCC



TTTGGTTTCACTTTTTCTGATACGTCGGACTTTGAAGATATGAGTGCTTTCTACCGAGAAGTTGAAAAT



CAAGGTTACGTTATAAGTTTTGATAAAATAAAAGAAACTTACATTCAGTCTCAAGTTGAGCAAGGTAA



CTTATATTTATTTCAAATTTACAACAAAGATTTTAGTCCGTATTCAAAGGGAAAGCCAAACCTGCACA



CTTTATACTGGAAAGCTCTGTTTGAAGAGGCTAATTTGAATAACGTAGTGGCTAAGCTAAACGGCGAA



GCAGAAATCTTTTTCAGAAGACACAGTATCAAAGCATCTGATAAAGTGGTACATCCTGCTAATCAAGC



TATAGATAATAAGAATCCCCATACTGAGAAGACGCAGTCCACATTTGAATATGACTTGGTCAAAGAC



AAAAGATATACCCAAGACAAATTTTTTTTTCATGTACCGATATCTTTAAACTTTAAGGCTCAGGGCGTT



TCAAAGTTTAATGATAAGGTAAATGGATTCTTAAAGGGCAATCCCGACGTTAATATAATCGGTATAGA



TCGAGGTGAGAGACATCTTTTATACTTTACCGTGGTGAATCAAAAAGGAGAAATATTAGTGCAAGAG



TCCTTGAATACATTAATGTCTGACAAGGGTCATGTCAACGATTATCAACAGAAATTGGACAAGAAGG



AACAGGAAAGGGACGCTGCCAGGAAGTCCTGGACGACAGTAGAAAATATTAAAGAATTAAAAGAAG



GTTATTTATCACATGTGGTTCATAAACTTGCACATTTAATCATCAAATATAACGCAATAGTGTGCTTGG



AAGATCTTAATTTTGGCTTCAAGAGGGGTAGGTTCAAGGTCGAAAAACAGGTCTACCAGAAGTTCGA



GAAAGCTCTGATCGATAAATTGAATTATCTTGTTTTCAAAGAAAAAGAATTAGGAGAAGTTGGTCATT



ATCTTACAGCATACCAACTCACTGCACCATTTGAAAGCTTCAAAAAGCTAGGCAAGCAATCTGGGATT



TTGTTCTATGTTCCGGCTGATTATACATCAAAGATAGATCCTACCACAGGCTTTGTAAATTTTTTAGAT



CTTAGGTACCAATCCGTTGAAAAAGCTAAACAGTTGCTGTCCGATTTTAATGCGATAAGATTTAATAG



TGTTCAGAATTATTTTGAGTTCGAAATTGATTATAAAAAATTGACACCAAAACGTAAAGTAGGAACAC



AATCTAAATGGGTTATTTGTACCTATGGAGATGTTAGATACCAAAACAGAAGAAATCAGAAAGGTCA



CTGGGAAACTGAAGAAGTTAACGTTACTGAAAAACTTAAAGCTCTATTTGCGAGCGATTCAAAAACG



ACGACGGTGATCGATTATGCAAATGATGATAACCTTATTGATGTAATTCTGGAACAAGATAAGGCATC



ATTTTTTAAAGAACTACTATGGTTGTTAAAGCTAACCATGACCCTAAGGCACTCCAAGATAAAGTCAG



AGGATGATTTTATCCTCTCTCCAGTGAAAAACGAACAAGGTGAGTTTTACGACTCAAGAAAGGCGGG



TGAAGTCTGGCCTAAGGATGCTGATGCCAATGGAGCTTATCACATCGCTCTGAAGGGGCTATGGAACT



TACAGCAAATTAACCAATGGGAAAAAGGTAAAACTTTAAACCTCGCCATAAAGAACCAGGATTGGTT



CAGCTTTATCCAAGAAAAACCATATCAAGAATAA





SEQ
ATGCACACAGGAGGTCTACTCTCGATGGATGCTAAGGAATTTACCGGTCAATATCCGCTGTCCAAAAC


ID
TTTGCGTTTTGAGCTTAGACCTATTGGCCGAACGTGGGATAACCTAGAGGCTTCTGGTTATTTGGCGG


NO:
AAGATAGACATAGAGCTGAGTGTTATCCCCGAGCTAAAGAATTGCTGGATGATAACCACAGGGCGTT


131
CCTGAATAGAGTTCTACCGCAAATCGATATGGATTGGCATCCAATTGCTGAAGCTTTCTGCAAGGTGC



ACAAAAATCCAGGTAATAAAGAATTGGCTCAGGATTATAATTTGCAGCTTAGTAAGAGAAGAAAAGA



AATTTCCGCTTATTTGCAGGATGCTGATGGATACAAGGGGTTGTTCGCGAAACCTGCCCTGGACGAAG



CTATGAAAATAGCTAAGGAAAACGGCAATGAATCTGATATTGAAGTTTTGGAAGCCTTCAATGGATTT



TCCGTTTATTTCACTGGTTATCATGAGAGTAGGGAGAATATATACTCAGACGAAGATATGGTATCCGT



CGCCTATCGCATAACTGAAGATAATTTTCCAAGGTTCGTGTCGAACGCGTTAATTTTTGATAAACTAA



ATGAATCGCACCCGGATATTATTTCGGAAGTGTCCGGTAATCTGGGGGTAGACGATATTGGTAAATAT



TTTGATGTGTCCAACTACAATAATTTCCTTAGTCAAGCAGGAATTGATGACTACAACCATATTATAGG



AGGGCATACAACTGAAGACGGTCTCATTCAAGCTTTTAACGTAGTGTTAAACCTAAGGCACCAAAAA



GACCCAGGTTTTGAGAAAATTCAATTTAAGCAACTCTACAAGCAGATACTGAGCGTTAGGACTAGTA



AGTCATATATCCCAAAGCAATTCGATAACTCAAAGGAAATGGTCGACTGTATATGCGACTACGTCTCA



AAAATAGAAAAATCTGAAACAGTAGAAAGAGCTCTGAAATTGGTAAGAAATATATCTTCTTTTGATTT



AAGAGGTATTTTCGTAAATAAAAAAAACCTTCGAATTTTGTCTAATAAGTTAATTGGAGACTGGGACG



CAATAGAGACAGCTTTGATGCACAGTTCCAGCAGTGAAAACGATAAGAAATCAGTGTATGACTCTGC



AGAGGCATTCACCCTTGATGATATCTTCAGTTCTGTGAAAAAGTTCAGCGACGCCTCCGCTGAGGATA



TAGGAAACCGCGCTGAAGACATATGTCGTGTTATCTCAGAAACAGCTCCTTTCATTAACGACTTAAGG



GCTGTAGATTTGGATTCTTTAAATGATGACGGCTATGAAGCGGCCGTGTCTAAAATACGGGAATCTCT



TGAACCCTACATGGATCTATTTCACGAATTGGAGATCTTTAGCGTGGGTGATGAGTTTCCTAAATGTG



CTGCCTTTTATAGCGAGTTGGAAGAGGTCTCAGAACAACTGATTGAAATCATTCCTTTATTTAACAAA



GCAAGAAGTTTTTGCACAAGGAAAAGGTATTCAACCGACAAAATCAAAGTCAATTTAAAATTCCCTA



CTCTGGCAGATGGATGGGATCTAAATAAAGAAAGGGATAACAAAGCCGCAATTCTAAGAAAAGACG



GTAAATACTACCTGGCAATTTTAGACATGAAGAAAGATCTCAGTAGTATTCGTACGAGCGATGAGGA



CGAGTCTTCTTTTGAAAAGATGGAATATAAATTGCTCCCTTCTCCTGTGAAAATGCTTCCAAAAATTTT



TGTTAAATCGAAAGCCGCCAAAGAAAAGTACGGGTTGACCGATAGAATGTTAGAATGCTACGATAAA



GGTATGCATAAGTCGGGTAGTGCTTTTGATTTGGGTTTTTGTCATGAATTGATCGATTACTATAAGCGC



TGCATTGCCGAGTACCCAGGCTGGGATGTTTTCGACTTTAAATTTCGTGAGACAAGCGATTACGGATC



CATGAAAGAATTTAATGAAGACGTCGCTGGCGCAGGTTACTATATGTCACTTAGAAAGATTCCATGTT



CCGAAGTTTATCGTTTACTGGACGAGAAGTCAATTTACTTGTTTCAAATATATAATAAGGATTATAGC



GAAAACGCACATGGGAATAAGAATATGCATACGATGTATTGGGAGGGCTTGTTCTCACCACAAAATT



TGGAATCACCAGTCTTCAAATTGTCCGGAGGCGCAGAACTTTTTTTCAGAAAGTCATCTATTCCTAAT



GACGCTAAAACGGTACATCCGAAAGGTTCAGTTCTTGTTCCCAGAAACGACGTCAATGGTAGAAGAA



TACCAGACTCGATCTACAGAGAGTTGACAAGGTATTTTAACCGTGGGGATTGCAGGATCAGTGATGA



AGCTAAGTCTTACCTGGACAAGGTCAAGACAAAAAAAGCGGACCATGACATTGTTAAGGATAGAAGA



TTTACTGTAGATAAGATGATGTTCCATGTTCCGATTGCCATGAATTTTAAAGCTATAAGTAAACCAAA



TCTTAATAAGAAAGTTATTGATGGCATAATAGATGATCAAGATTTGAAAATCATCGGTATCGATCGTG



GTGAGAGAAATCTTATTTATGTGACCATGGTCGATAGGAAGGGGAATATATTGTATCAAGACAGTCTT



AATATTTTAAATGGATACGATTACCGCAAAGCTTTAGACGTGAGGGAATATGATAACAAAGAAGCTA



GAAGGAATTGGACTAAAGTAGAAGGTATTAGAAAAATGAAAGAAGGTTATTTATCTTTAGCTGTTAG



TAAATTGGCCGATATGATCATCGAAAATAATGCTATAATCGTAATGGAAGATTTGAATCACGGGTTTA



AGGCAGGTCGTTCCAAAATTGAAAAGCAGGTGTATCAAAAATTCGAATCAATGTTAATCAACAAGTT



AGGATACATGGTGCTAAAAGACAAGTCCATTGACCAGTCTGGTGGAGCCCTTCATGGTTACCAATTAG



CCAATCATGTTACGACCTTAGCTAGCGTGGGTAAACAATGTGGAGTAATTTTTTACATACCTGCAGCT



TTTACTTCGAAGATTGATCCCACCACGGGCTTTGCTGATTTATTCGCTCTCTCTAATGTGAAGAATGTC



GCTTCTATGAGAGAGTTCTTCTCCAAAATGAAGTCAGTAATATATGACAAGGCGGAAGGCAAATTCG



CCTTTACATTTGATTATTTGGATTATAACGTTAAAAGCGAATGTGGACGTACCTTATGGACTGTGTATA



CAGTTGGTGAACGCTTCACCTACTCTAGAGTAAACCGAGAGTATGTTCGGAAAGTCCCAACAGATATC



ATCTATGATGCATTACAAAAAGCTGGTATTAGCGTCGAAGGTGACCTTAGAGATAGAATCGCGGAAA



GCGACGGTGACACATTAAAGTCTATATTCTACGCTTTTAAATACGCGTTGGATATGAGAGTCGAAAAC



AGAGAGGAAGACTATATACAGTCACCTGTGAAGAATGCTTCTGGTGAGTTCTTTTGTTCAAAAAACGC



CGGAAAGTCTTTGCCGCAGGATTCAGATGCAAATGGTGCCTATAATATAGCTCTGAAAGGGATCCTAC



AACTCAGAATGTTGAGCGAACAATACGATCCAAATGCAGAATCGATTAGATTGCCACTTATAACTAA



CAAGGCATGGTTAACTTTTATGCAATCCGGTATGAAAACTTGGAAGAATTAA





SEQ
ATGGATTCTCTTAAGGATTTCACTAATTTATATCCAGTCTCGAAAACATTGCGGTTCGAATTGAAACC


ID
AGTTGGGAAAACTCTAGAAAACATTGAAAAAGCCGGTATATTGAAAGAAGATGAACACAGAGCGGA


NO:
ATCCTACCGCCGGGTAAAAAAGATAATTGACACATACCATAAAGTGTTTATTGACAGCTCCTTAGAGA


132
ACATGGCTAAAATGGGGATAGAAAATGAAATCAAGGCTATGCTGCAGTCTTTTTGTGAACTCTATAAG



AAAGACCACAGGACAGAAGGAGAAGATAAAGCTCTTGATAAAATTAGAGCTGTTCTTAGAGGTTTAA



TCGTTGGGGCTTTCACTGGTGTATGTGGAAGACGAGAAAACACAGTACAAAATGAAAAGTACGAGAG



TTTGTTCAAAGAAAAATTGATAAAGGAAATTTTGCCAGATTTCGTGTTGTCCACCGAGGCTGAGTCTC



TTCCATTCAGCGTTGAAGAAGCAACAAGGAGCTTAAAAGAGTTTGACTCATTCACTTCTTATTTTGCT



GGTTTTTACGAAAATAGAAAGAATATTTATTCCACGAAACCGCAAAGTACTGCGATAGCCTACAGATT



AATTCATGAAAACTTGCCTAAATTTATAGATAATATTTTGGTCTTCCAGAAGATTAAAGAACCAATCG



CTAAAGAACTTGAGCACATAAGAGCAGATTTTAGCGCAGGCGGATATATCAAAAAAGATGAACGGCT



AGAAGACATATTCTCATTAAATTACTACATTCATGTCCTTTCTCAAGCTGGTATAGAAAAATATAATG



CTTTAATCGGGAAGATAGTGACGGAAGGTGATGGTGAAATGAAAGGTCTTAATGAACATATTAACTT



ATATAACCAACAGAGGGGTCGAGAGGATAGGTTGCCCTTGTTTAGGCCTCTATACAAGCAAATCCTGT



CCGATAGAGAGCAATTGTCTTATTTACCTGAATCATTTGAAAAAGATGAAGAGCTGCTTAGAGCACTT



AAGGAATTTTACGATCACATCGCCGAAGACATCTTGGGTAGAACACAGCAATTGATGACTTCAATTTC



TGAATACGACTTGTCCCGTATTTATGTCAGAAATGATTCTCAACTTACAGACATCTCGAAGAAAATGC



TAGGAGATTGGAACGCCATTTATATGGCTAGAGAACGAGCCTACGACCACGAACAGGCTCCTAAACG



TATTACTGCTAAATACGAACGTGATAGAATCAAGGCCTTAAAAGGTGAAGAGTCAATTTCATTGGCG



AATCTGAACAGCTGTATAGCTTTCTTGGACAATGTAAGGGATTGTCGAGTTGACACATACCTATCAAC



TTTGGGGCAGAAAGAGGGTCCTCATGGCTTAAGTAACTTGGTGGAAAACGTCTTCGCCTCATATCATG



AAGCAGAACAGTTATTGTCGTTTCCTTACCCCGAAGAGAACAACCTTATTCAGGACAAAGACAATGTA



GTTTTGATCAAAAACCTATTGGATAATATAAGTGATTTACAACGTTTCCTTAAACCTTTGTGGGGAAT



GGGCGATGAACCTGACAAAGACGAAAGGTTTTACGGTGAATACAACTATATTAGAGGAGCGCTTGAC



CAGGTAATACCTTTGTACAATAAAGTAAGGAACTACTTGACTCGTAAACCATATTCTACTAGAAAAGT



TAAATTGAACTTTGGTAATTCACAGCTGCTGAGTGGTTGGGATCGTAATAAAGAAAAAGATAACTCCT



GTGTTATCTTGCGAAAAGGACAAAACTTTTACTTGGCAATTATGAACAACCGTCACAAAAGGTCCTTC



GAGAACAAAGTTCTGCCTGAATACAAAGAAGGTGAACCATATTTTGAAAAAATGGACTATAAATTCC



TGCCAGATCCTAATAAAATGTTGCCTAAGGTCTTCTTGTCTAAAAAAGGTATAGAAATATATAAACCA



TCCCCGAAGTTGCTGGAGCAATATGGTCATGGAACGCACAAAAAAGGTGACACTTTTAGTATGGATG



ACTTGCACGAGTTGATTGATTTTTTTAAACATTCCATTGAAGCGCACGAAGATTGGAAACAATTTGGT



TTCAAGTTCTCTGACACAGCCACTTACGAAAATGTATCGTCCTTTTATAGAGAAGTGGAAGATCAGGG



TTATAAACTGTCATTCCGTAAGGTTAGTGAAAGCTATGTGTACTCGTTGATCGATCAAGGGAAGCTTT



ATCTTTTTCAAATCTATAATAAAGATTTCTCTCCTTGTTCAAAGGGCACACCTAATCTTCATACACTAT



ACTGGAGAATGCTTTTCGATGAAAGAAATTTGGCTGATGTGATCTATAAATTAGACGGTAAAGCTGAG



ATTTTTTTCAGAGAGAAATCCCTGAAAAACGACCATCCAACTCATCCGGCAGGTAAACCGATTAAAA



AGAAATCCCGGCAAAAAAAGGGCGAAGAGAGTTTATTCGAGTATGATTTAGTTAAGGACAGACATTA



TACAATGGACAAATTTCAATTTCATGTGCCCATTACTATGAACTTTAAGTGTAGTGCAGGGTCTAAGG



TTAATGATATGGTAAACGCACATATTAGAGAAGCTAAAGATATGCACGTCATCGGTATTGATCGCGG



AGAAAGAAATTTACTTTACATTTGCGTTATCGATTCTAGGGGCACCATCTTGGATCAAATCTCTTTGAA



CACTATAAATGATATTGACTATCATGATCTACTAGAGAGTCGGGATAAAGACAGGCAACAAGAAAGA



AGAAATTGGCAAACAATTGAAGGTATTAAAGAATTAAAGCAAGGCTATCTAAGCCAGGCTGTACACA



GAATTGCCGAATTAATGGTAGCATATAAAGCTGTCGTAGCTCTAGAAGACTTGAACATGGGTTTCAAA



AGAGGGCGCCAGAAGGTCGAAAGTAGTGTTTATCAACAATTTGAAAAACAGTTAATAGATAAGTTGA



ATTATCTAGTGGATAAAAAAAAGCGTCCTGAGGACATTGGCGGTTTATTAAGAGCCTACCAATTCACT



GCGCCATTTAAATCGTTCAAAGAAATGGGTAAACAAAACGGTTTTCTATTCTACATCCCCGCATGGAA



TACCTCAAATATAGATCCAACTACCGGTTTCGTCAACTTATTTCATGCTCAATATGAGAATGTGGACA



AAGCAAAATCATTCTTTCAAAAATTTGATAGCATTAGCTACAATCCTAAAAAAGATTGGTTTGAATTT



GCGTTCGATTATAAAAATTTCACCAAGAAGGCTGAAGGTTCCAGATCTATGTGGATATTGTGCACCCA



CGGAAGTAGAATTAAGAACTTCCGTAATTCACAGAAAAACGGCCAGTGGGACAGCGAAGAATTCGCC



CTAACCGAAGCTTTCAAAAGTCTTTTCGTAAGATACGAGATAGACTATACAGCTGATCTAAAGACAGC



TATTGTGGATGAGAAGCAAAAAGACTTCTTTGTCGACCTTCTTAAGTTGTTCAAGTTAACTGTGCAGA



TGAGAAATAGTTGGAAGGAAAAAGACCTAGATTACTTGATTAGCCCAGTCGCTGGTGCAGATGGCAG



ATTTTTTGATACACGTGAAGGCAATAAATCACTACCAAAAGACGCGGACGCTAATGGCGCATACAAC



ATCGCATTGAAGGGTTTGTGGGCTCTCAGGCAGATTAGGCAGACAAGTGAGGGTGGTAAGCTTAAGC



TGGCGATTTCTAATAAGGAATGGTTACAGTTTGTTCAAGAAAGATCCTACGAAAAAGATTAA





SEQ
ATGAACAATGGTACTAATAATTTTCAAAACTTCATAGGGATTTCTAGCCTTCAAAAGACATTGAGAAA


ID
TGCTTTAATTCCAACAGAAACGACTCAACAATTCATAGTGAAAAATGGTATTATAAAAGAAGACGAG


NO:
TTGCGTGGCGAGAATAGACAAATTTTGAAAGATATCATGGATGACTACTACAGAGGGTTCATCTCCGA


133
AACATTGTCTTCTATTGACGACATTGACTGGACCAGCTTATTCGAAAAAATGGAAATACAGCTGAAGA



ACGGAGATAACAAGGACACTCTTATAAAGGAGCAAACGGAATATAGAAAGGCTATACACAAAAAGT



TTGCTAATGACGATAGATTTAAAAACATGTTTAGTGCGAAGTTAATTTCTGATATTCTACCCGAGTTTG



TCATTCATAATAATAACTACTCTGCATCTGAAAAAGAGGAGAAGACCCAGGTTATAAAGTTGTTTTCA



AGATTTGCCACATCATTTAAAGACTACTTCAAGAACAGGGCGAATTGCTTCTCTGCTGATGATATTAG



CTCTTCCAGCTGTCATAGAATTGTTAACGATAATGCCGAAATTTTTTTTAGTAATGCCTTGGTATATAG



ACGCATAGTCAAGTCACTAAGCAATGATGATATAAACAAGATTAGTGGTGATATGAAAGATAGCCTT



AAAGAAATGAGCCTTGAAGAGATATATTCATATGAGAAGTACGGTGAATTTATAACTCAAGAAGGAA



TTTCTTTTTATAACGATATTTGTGGTAAGGTTAATTCTTTTATGAATTTGTATTGCCAGAAGAACAAGG



AAAATAAGAATCTATATAAACTACAAAAGTTGCATAAACAGATTTTGTGTATAGCTGATACATCCTAC



GAAGTTCCGTATAAATTTGAATCTGATGAGGAAGTTTATCAATCGGTAAACGGTTTTCTTGACAACAT



TTCCAGCAAACATATCGTTGAGAGACTACGTAAAATTGGAGACAACTATAATGGTTACAATCTAGATA



AAATATACATAGTGTCCAAGTTTTATGAGTCTGTCTCTCAAAAGACATATCGTGATTGGGAGACCATT



AATACTGCACTTGAAATTCATTATAACAACATATTGCCTGGTAACGGGAAGAGTAAAGCTGATAAGG



TTAAAAAGGCCGTCAAAAACGACTTGCAAAAGTCTATTACCGAGATAAATGAATTAGTGTCAAACTA



CAAACTATGCTCAGATGATAATATTAAAGCGGAAACATACATCCACGAAATTTCCCACATACTGAATA



ACTTTGAAGCTCAGGAGCTTAAATATAACCCGGAAATACACTTGGTTGAGAGCGAGTTAAAAGCATC



TGAGTTGAAAAATGTATTAGACGTCATCATGAATGCGTTTCATTGGTGTTCAGTTTTCATGACTGAAG



AATTAGTCGACAAAGATAACAATTTTTATGCCGAATTAGAGGAAATATATGATGAAATTTATCCCGTA



ATTAGTTTATACAATCTAGTTAGAAATTATGTTACACAAAAGCCGTATAGTACCAAGAAAATAAAGCT



TAATTTCGGAATACCTACGCTTGCTGATGGTTGGTCAAAAAGTAAAGAATATAGCAATAATGCAATAA



TTTTAATGAGAGATAACCTATATTATTTGGGTATTTTTAACGCTAAGAACAAACCAGACAAGAAAATA



ATTGAAGGTAATACATCTGAAAACAAGGGCGACTATAAAAAGATGATATACAATTTGCTCCCAGGTC



CTAATAAAATGATTCCTAAGGTTTTCCTGAGTAGCAAGACTGGCGTTGAAACTTACAAGCCTAGTGCG



TATATCCTGGAGGGTTATAAACAGAACAAGCATATCAAATCCTCTAAGGACTTCGATATCACCTTTTG



CCATGACTTAATCGATTATTTTAAAAATTGTATCGCAATTCATCCAGAATGGAAAAATTTCGGATTTG



ATTTTAGTGATACCAGCACTTACGAGGATATCTCTGGGTTCTACAGAGAAGTGGAGTTGCAGGGCTAC



AAAATCGATTGGACTTACATATCTGAAAAGGACATAGATTTGCTGCAGGAGAAAGGTCAGCTATATTT



GTTTCAAATCTACAACAAAGACTTTTCTAAAAAGTCTACCGGTAATGACAATCTGCACACAATGTACT



TGAAGAACTTATTCTCCGAGGAGAACTTAAAGGACATTGTACTCAAGTTGAATGGAGAAGCCGAGAT



TTTTTTTAGAAAGAGCAGTATAAAGAATCCTATAATCCACAAGAAGGGCTCAATTCTCGTGAATAGGA



CGTATGAGGCAGAAGAAAAGGACCAATTTGGGAATATACAAATTGTAAGAAAAAACATCCCAGAAA



ATATCTACCAGGAATTATATAAGTATTTTAATGACAAATCTGATAAGGAACTGTCTGACGAAGCCGCT



AAGCTCAAGAATGTTGTGGGCCACCATGAAGCTGCTACTAATATAGTGAAGGACTACAGATATACCT



ACGATAAATATTTCCTGCATATGCCAATTACTATAAACTTCAAAGCAAATAAAACAGGTTTTATAAAT



GATAGAATCCTGCAGTATATTGCTAAAGAAAAGGATTTACATGTAATTGGGATTGATAGAGGTGAAC



GCAATCTGATCTATGTCAGCGTAATAGATACTTGTGGTAATATTGTGGAACAAAAGTCCTTTAATATT



GTGAACGGATATGATTACCAAATCAAGTTGAAACAACAAGAGGGAGCACGCCAAATTGCCCGTAAGG



AATGGAAAGAGATAGGTAAGATCAAGGAAATTAAGGAAGGTTATCTTTCATTAGTTATTCACGAAAT



TTCGAAGATGGTAATCAAATACAACGCAATAATTGCTATGGAGGACCTGTCATATGGATTTAAGAAA



GGTAGATTCAAGGTTGAGAGACAGGTATACCAGAAATTTGAAACTATGTTGATCAACAAATTAAATT



ACTTAGTCTTTAAGGACATATCAATAACGGAAAACGGCGGGCTTTTAAAAGGGTATCAACTTACATAC



ATACCTGATAAGTTGAAAAATGTGGGTCATCAGTGTGGGTGCATCTTTTATGTTCCAGCCGCTTACAC



ATCAAAAATCGATCCTACTACTGGGTTCGTAAACATATTTAAATTTAAAGATCTAACCGTTGATGCAA



AAAGAGAGTTTATCAAGAAATTTGATAGCATTAGGTACGATTCAGAAAAAAATCTATTCTGTTTTACT



TTTGACTACAACAACTTTATAACGCAGAATACAGTGATGTCAAAATCGTCCTGGTCAGTGTATACTTA



TGGTGTTAGAATTAAGAGACGTTTCGTAAACGGTCGTTTTTCTAACGAGTCCGATACAATCGACATCA



CTAAAGATATGGAAAAAACTTTGGAAATGACAGATATAAACTGGAGAGATGGTCACGACCTTAGACA



AGATATAATCGATTATGAAATCGTACAGCATATTTTTGAAATTTTTCGCTTAACAGTTCAGATGCGTA



ACTCTCTTAGTGAGCTAGAAGATAGAGATTATGATAGACTTATCTCGCCTGTTCTTAACGAAAATAAT



ATCTTCTATGACTCGGCAAAAGCCGGTGATGCACTTCCAAAAGATGCTGATGCAAATGGCGCGTACTG



CATCGCATTGAAGGGGCTCTACGAGATTAAACAAATCACCGAAAACTGGAAAGAAGATGGTAAATTT



TCTAGGGATAAGTTGAAAATCAGTAATAAAGATTGGTTCGATTTTATACAAAATAAGCGATACTTATA



G





SEQ
ATGACCAATAAGTTTACTAATCAATACTCATTGTCTAAAACGTTAAGATTCGAGTTAATTCCCCAGGG


ID
AAAGACACTAGAATTTATTCAAGAAAAAGGTCTTCTCTCTCAGGATAAACAAAGAGCAGAATCATAC


NO:
CAGGAGATGAAAAAAACCATAGATAAATTTCATAAGTACTTCATCGACTTGGCACTATCGAACGCCA


134
AGCTAACACATTTGGAAACCTACCTGGAGTTGTATAATAAATCGGCAGAGACGAAAAAGGAACAAAA



ATTCAAGGATGACCTGAAGAAGGTTCAAGATAATCTGCGAAAGGAAATAGTGAAGTCGTTTAGTGAT



GGTGATGCAAAGTCAATCTTTGCTATTTTAGACAAGAAGGAATTAATAACCGTGGAACTTGAAAAGT



GGTTTGAAAATAACGAACAGAAAGATATTTACTTCGACGAAAAATTTAAAACGTTTACTACGTACTTT



ACAGGGTTCCATCAGAACCGCAAAAACATGTACTCCGTTGAACCAAACTCTACTGCAATCGCCTACAG



ATTAATACACGAAAATTTGCCTAAGTTTTTAGAAAATGCAAAGGCTTTTGAAAAGATAAAGCAAGTC



GAATCGTTACAGGTAAACTTTCGCGAATTAATGGGCGAATTTGGAGATGAAGGTCTTATTTTTGTCAA



TGAATTAGAGGAAATGTTTCAAATTAATTATTATAACGATGTCTTGAGTCAGAACGGCATTACTATCT



ACAACTCAATTATCAGTGGTTTCACTAAGAATGATATAAAATATAAAGGTTTGAATGAATACATTAAT



AATTATAATCAAACTAAAGATAAGAAGGACAGGCTTCCGAAATTGAAGCAATTGTACAAGCAGATTC



TAAGTGATAGGATTAGTTTGTCTTTCTTGCCAGACGCATTTACTGATGGCAAGCAAGTCTTAAAGGCT



ATATTCGATTTCTACAAGATTAACCTACTTTCGTACACAATTGAAGGTCAAGAAGAATCTCAAAATCT



GCTGCTTTTGATTAGGCAAACTATAGAAAATTTGTCGTCCTTTGACACTCAAAAAATTTACCTGAAGA



ATGATACACACCTGACTACAATATCACAGCAGGTCTTTGGGGATTTTTCTGTCTTCTCCACGGCCCTAA



ACTATTGGTATGAGACAAAAGTTAATCCAAAATTTGAAACAGAATATAGTAAGGCGAATGAAAAAAA



GAGAGAAATTTTGGATAAAGCGAAGGCAGTATTCACAAAACAAGACTATTTTTCTATCGCATTTCTCC



AAGAAGTCTTATCCGAATATATTTTGACACTCGATCACACCTCTGATATAGTTAAGAAACATTCGTCC



AACTGCATCGCAGATTACTTCAAGAATCACTTCGTGGCTAAGAAAGAAAACGAAACGGATAAAACTT



TTGACTTCATTGCTAACATAACCGCTAAATACCAATGTATTCAGGGCATATTAGAAAATGCAGACCAG



TACGAAGACGAGTTAAAACAGGACCAAAAGTTAATAGATAATCTAAAGTTTTTCTTAGATGCTATACT



TGAGTTATTACATTTTATAAAGCCATTGCATCTAAAATCGGAAAGTATTACTGAAAAAGACACTGCGT



TCTATGATGTGTTCGAAAATTATTATGAGGCTTTATCTTTATTGACCCCCCTTTACAACATGGTCCGCA



ATTATGTTACTCAGAAGCCTTACTCTACTGAAAAGATCAAATTAAACTTTGAAAATGCTCAGTTGCTG



AATGGTTGGGATGCCAATAAGGAAGGTGACTACCTGACGACTATTCTAAAAAAAGACGGTAATTATT



TCTTAGCAATCATGGATAAAAAACATAACAAGGCATTTCAAAAATTTCCAGAAGGAAAAGAAAACTA



TGAAAAGATGGTTTATAAATTGTTGCCTGGAGTTAATAAAATGTTGCCAAAAGTTTTTTTTAGCAATA



AGAACATAGCTTACTTTAATCCATCTAAGGAACTGCTCGAGAACTACAAGAAGGAAACACATAAAAA



AGGTGATACATTTAATTTGGAACATTGCCATACTCTGATTGATTTTTTTAAGGACTCTCTTAATAAACA



TGAAGACTGGAAATATTTTGATTTTCAATTTTCGGAAACTAAATCATACCAAGATCTAAGTGGATTTT



ACAGAGAAGTTGAACACCAAGGTTATAAGATTAACTTCAAGAATATAGATTCTGAATACATTGATGG



TCTTGTAAACGAGGGTAAACTATTCCTGTTCCAAATCTACTCTAAGGACTTCTCACCTTTTTCCAAAGG



AAAACCTAATATGCATACGTTGTACTGGAAGGCTCTATTTGAAGAACAAAATTTGCAAAATGTAATCT



ACAAACTGAACGGCCAAGCTGAAATATTCTTCAGAAAAGCCTCAATTAAGCCAAAAAACATTATTCTT



CATAAAAAGAAGATCAAGATTGCGAAGAAACATTTTATTGATAAGAAGACCAAGACTTCCGAAATTG



TACCAGTACAAACAATCAAGAATCTCAATATGTATTATCAAGGCAAGATAAGTGAGAAAGAGTTAAC



CCAGGATGATTTACGTTATATAGACAATTTCTCTATATTCAACGAGAAGAACAAAACAATAGACATTA



TCAAAGATAAAAGGTTTACTGTTGACAAATTTCAATTTCATGTGCCTATCACAATGAACTTTAAGGCC



ACAGGTGGTTCGTACATTAATCAAACTGTTTTAGAATATCTGCAAAATAACCCAGAGGTCAAGATCAT



CGGTCTTGATAGGGGTGAGAGACATCTGGTGTATCTAACACTCATTGATCAACAAGGCAACATCTTGA



AGCAAGAATCATTGAACACTATCACAGACTCCAAGATCTCGACTCCATATCACAAACTCCTTGACAAT



AAAGAAAACGAAAGGGATCTTGCCAGAAAAAATTGGGGTACAGTTGAAAATATTAAGGAACTAAAA



GAAGGTTACATTTCGCAAGTAGTTCACAAGATTGCAACACTCATGTTGGAAGAAAACGCAATCGTTGT



CATGGAAGATTTAAATTTCGGATTTAAGAGAGGAAGATTTAAAGTAGAAAAGCAAATCTACCAGAAG



TTGGAGAAGATGTTAATTGACAAATTGAACTACTTAGTGCTGAAAGACAAACAGCCTCAAGAATTGG



GCGGTCTATACAACGCTTTACAACTGACAAATAAATTTGAGTCATTCCAAAAGATGGGTAAGCAGAG



TGGTTTTTTGTTTTATGTTCCGGCATGGAACACATCCAAAATCGATCCAACTACAGGCTTCGTGAATTA



TTTCTACACTAAATATGAAAATGTGGATAAAGCAAAAGCTTTCTTTGAGAAGTTCGAGGCGATCCGTT



TTAACGCTGAAAAGAAGTACTTCGAGTTCGAGGTCAAAAAGTATTCAGATTTTAACCCCAAGGCTGA



AGGCACCCAGCAAGCATGGACTATTTGCACGTACGGTGAGCGAATCGAAACTAAAAGGCAAAAGGAT



CAAAATAATAAGTTTGTAAGCACACCCATTAACTTGACAGAAAAGATAGAAGATTTTCTTGGAAAAA



ACCAAATTGTATATGGTGACGGTAACTGTATCAAGTCACAAATTGCTTCTAAAGACGATAAGGCCTTC



TTCGAAACTCTGCTATACTGGTTTAAAATGACGTTGCAAATGAGAAACAGTGAAACTAGAACTGATAT



CGACTATTTAATATCACCCGTGATGAACGATAATGGTACCTTTTACAATTCAAGAGATTACGAGAAAT



TGGAGAACCCCACACTACCAAAAGACGCAGACGCTAATGGTGCCTACCATATTGCTAAAAAGGGACT



GATGTTGTTGAACAAGATAGATCAAGCCGACTTAACTAAAAAAGTTGATTTGTCAATTTCGAATAGAG



ATTGGTTGCAATTCGTCCAGAAAAATAAGTAA





SEQ
ATGGAACAGGAATACTACTTGGGTTTGGATATGGGAACTGGTTCAGTCGGTTGGGCTGTTACGGACTC


ID
CGAGTACCACGTGTTGAGAAAACACGGAAAGGCTTTATGGGGTGTCAGACTATTCGAATCAGCATCG


NO:
ACCGCGGAAGAGAGAAGAATGTTTAGAACTTCAAGAAGAAGGCTGGATCGTAGGAATTGGCGGATA


135
GAAATTTTACAAGAAATATTCGCCGAAGAAATCTCTAAAAAAGATCCAGGATTTTTTCTACGTATGAA



GGAATCCAAATACTATCCGGAAGATAAACGTGATATTAATGGCAATTGTCCAGAGTTACCCTATGCTT



TATTTGTGGACGACGATTTCACCGATAAAGATTACCATAAGAAGTTCCCAACAATTTACCATCTGAGA



AAGATGTTAATGAACACTGAAGAAACCCCGGATATAAGACTGGTCTATCTAGCCATTCATCATATGAT



GAAACACAGGGGACACTTCTTGCTATCAGGGGATATAAATGAAATTAAAGAATTTGGTACAACATTTT



CTAAATTATTGGAAAATATTAAAAACGAAGAATTAGATTGGAATTTAGAATTAGGCAAGGAGGAATA



CGCAGTTGTCGAATCGATTCTGAAAGATAACATGTTGAACAGATCAACGAAAAAAACAAGGCTGATC



AAGGCTTTAAAAGCGAAATCAATATGCGAAAAAGCAGTATTGAATTTGTTAGCTGGGGGGACTGTCA



AGTTGTCTGATATTTTCGGATTGGAAGAATTGAATGAAACAGAGAGACCGAAGATATCCTTCGCCGAT



AATGGCTACGATGATTATATAGGCGAAGTCGAAAATGAGCTGGGCGAACAATTCTACATTATCGAGA



CTGCCAAGGCTGTTTATGATTGGGCGGTGTTAGTCGAAATCCTTGGCAAATACACTTCCATCTCCGAA



GCTAAGGTGGCAACCTACGAAAAGCATAAAAGTGATTTGCAATTCCTTAAGAAAATTGTCCGAAAGT



ACTTGACCAAAGAAGAGTACAAGGATATTTTCGTATCAACATCGGACAAACTGAAGAATTATTCAGC



TTATATTGGCATGACGAAAATTAATGGTAAGAAAGTTGATTTGCAATCCAAGAGATGTTCTAAAGAA



GAATTTTACGATTTCATTAAAAAAAATGTCCTAAAAAAGTTGGAGGGACAACCTGAATATGAGTATTT



AAAGGAAGAACTGGAAAGAGAAACTTTCCTACCAAAGCAAGTTAATCGTGATAATGGCGTTATTCCA



TACCAAATACACTTGTACGAATTAAAGAAGATCTTGGGTAACTTGAGGGACAAAATTGATTTAATCAA



GGAAAATGAAGACAAACTGGTACAATTATTTGAATTTAGAATACCTTACTACGTGGGCCCTTTAAACA



AAATAGACGATGGTAAGGAAGGGAAGTTCACATGGGCAGTCAGAAAGTCCAATGAAAAAATTTACCC



ATGGAATTTCGAAAACGTTGTAGATATTGAAGCTTCTGCTGAGAAATTTATTAGGAGAATGACAAATA



AATGCACTTATCTTATGGGGGAAGACGTGTTGCCTAAAGATAGTTTATTATATTCAAAGTATATGGTC



TTAAATGAATTAAACAATGTTAAATTAGATGGTGAAAAACTTTCCGTCGAATTGAAACAAAGATTGTA



TACAGATGTATTCTGCAAATATAGAAAAGTAACTGTAAAGAAGATTAAAAACTACCTTAAATGTGAA



GGCATTATCAGCGGAAATGTTGAGATCACTGGTATCGATGGTGATTTTAAGGCATCTTTAACCGCATA



TCACGACTTTAAGGAAATATTGACGGGTACTGAGCTTGCTAAAAAAGACAAAGAGAACATTATCACC



AATATCGTGCTCTTCGGAGACGACAAGAAATTATTGAAAAAGAGATTGAACCGCCTATACCCTCAGA



TTACCCCTAACCAATTGAAGAAAATCTGCGCTCTGTCTTATACTGGATGGGGTCGTTTTAGCAAGAAG



TTTCTAGAAGAAATTACTGCTCCGGATCCTGAAACTGGGGAAGTCTGGAATATAATTACCGCGCTATG



GGAATCGAATAATAATTTAATGCAATTACTATCTAATGAATACAGATTTATGGAAGAAGTCGAAACTT



ACAATATGGGAAAACAAACAAAAACTTTGAGCTACGAAACAGTAGAGAATATGTATGTCTCACCATC



TGTAAAGCGGCAGATCTGGCAAACCTTGAAGATAGTTAAAGAATTAGAAAAAGTGATGAAGGAAAGT



CCAAAAAGGGTTTTTATTGAAATGGCCCGAGAAAAACAAGAATCTAAAAGGACGGAAAGTAGGAAA



AAGCAACTTATAGATCTATATAAAGCCTGCAAAAATGAAGAAAAAGATTGGGTAAAGGAATTAGGTG



ACCAGGAAGAGCAAAAATTGAGATCTGACAAGCTGTACTTGTATTATACGCAAAAGGGCCGGTGTAT



GTATTCGGGTGAGGTAATAGAATTGAAAGATTTATGGGATAACACTAAGTATGACATTGACCATATTT



ACCCCCAGTCTAAGACAATGGACGATTCATTAAATAACCGAGTTCTTGTCAAAAAGAAGTACAATGC



CACAAAGAGCGATAAGTACCCATTGAACGAAAATATAAGACATGAACGAAAAGGTTTCTGGAAATCA



TTGTTGGACGGTGGATTTATTTCCAAAGAAAAATACGAGAGATTGATTAGAAACACTGAACTATCTCC



AGAGGAGTTAGCTGGCTTTATCGAAAGACAAATTGTTGAAACTAGACAGTCTACAAAAGCAGTTGCA



GAAATCTTAAAACAAGTATTTCCAGAATCCGAAATTGTGTACGTCAAAGCCGGAACAGTAAGTAGAT



TTAGAAAAGACTTTGAATTATTGAAAGTACGAGAGGTTAACGACCTACATCATGCTAAGGATGCTTAT



TTAAATATAGTCGTTGGTAATTCGTATTACGTGAAATTCACAAAAAACGCATCTTGGTTCATCAAGGA



GAATCCTGGTAGGACATACAACTTGAAAAAGATGTTTACATCAGGATGGAATATCGAAAGAAATGGT



GAGGTTGCGTGGGAGGTAGGCAAGAAGGGAACCATTGTTACTGTAAAGCAAATTATGAATAAAAACA



ATATACTTGTTACGAGACAGGTGCACGAAGCCAAAGGAGGGTTGTTTGACCAGCAAATCATGAAGAA



AGGTAAAGGTCAGATAGCAATAAAAGAGACTGATGAGCGTTTAGCTAGTATAGAAAAATATGGGGGC



TACAATAAGGCAGCTGGTGCTTACTTCATGTTGGTCGAATCAAAGGATAAAAAAGGGAAGACGATCC



GGACCATAGAGTTTATCCCTCTGTACTTGAAGAATAAGATTGAGTCTGACGAAAGCATCGCATTGAAT



TTCTTGGAAAAGGGGCGCGGTCTAAAGGAGCCAAAAATATTGTTAAAGAAAATTAAAATAGACACCC



TATTCGACGTCGATGGGTTTAAGATGTGGCTTAGTGGTCGTACTGGGGACAGATTATTATTCAAGTGT



GCCAATCAGTTAATCCTTGACGAGAAAATCATTGTTACAATGAAAAAAATTGTTAAGTTTATTCAAAG



GCGACAAGAAAATAGAGAACTAAAGTTGAGTGATAAGGATGGAATCGATAATGAAGTGTTAATGGA



GATTTATAACACTTTTGTCGACAAATTGGAGAATACGGTGTACAGAATTAGGCTATCTGAACAGGCTA



AAACCCTAATTGATAAACAGAAGGAGTTTGAGCGACTTTCTCTTGAAGACAAATCTTCAACTCTTTTC



GAGATCCTACATATCTTTCAGTGTCAATCTTCTGCAGCTAATTTGAAAATGATTGGAGGTCCTGGTAA



GGCTGGTATATTAGTCATGAACAACAACATATCTAAGTGTAATAAGATTAGTATAATTAACCAATCAC



CGACAGGTATCTTTGAAAATGAAATTGATTTACTTAAA





SEQ
ATGAAATCATTCGACTCGTTCACCAACTTGTACTCCCTGTCTAAAACATTGAAATTTGAAATGCGACC


ID
TGTTGGTAACACCCAAAAGATGTTAGATAATGCAGGAGTTTTCGAAAAGGATAAACTGATCCAGAAA


NO:
AAATACGGTAAAACGAAACCATATTTCGATAGGTTGCATCGGGAATTTATAGAAGAAGCTTTGACTG


136
GTGTAGAATTAATTGGCTTAGATGAGAATTTCCGTACTCTAGTCGATTGGCAAAAAGATAAAAAGAA



CAATGTTGCCATGAAGGCATACGAAAATAGTCTACAAAGACTAAGAACAGAGATCGGGAAAATTTTC



AATTTGAAGGCAGAAGACTGGGTGAAGAACAAATATCCAATATTGGGTCTTAAGAATAAGAATACTG



ATATATTGTTCGAGGAGGCCGTTTTCGGTATTCTTAAGGCAAGATATGGTGAAGAGAAAGACACGTTT



ATTGAAGTTGAGGAGATTGATAAAACCGGTAAGTCCAAAATCAACCAGATCTCTATCTTCGACAGTTG



GAAGGGCTTCACTGGTTATTTTAAGAAGTTCTTCGAAACTAGGAAGAACTTCTATAAAAACGATGGTA



CTTCCACGGCTATTGCTACAAGAATTATCGACCAAAACCTTAAGCGTTTTATTGATAACCTATCAATTG



TTGAAAGTGTTCGACAGAAAGTAGATTTGGCTGAAACTGAAAAATCTTTTAGTATCTCCTTATCCCAG



TTTTTCTCTATAGATTTTTATAATAAATGTTTGCTGCAAGATGGCATTGACTACTATAATAAAATAATT



GGTGGAGAGACATTGAAAAACGGAGAGAAGCTGATTGGCCTTAATGAGTTGATAAATCAATATAGAC



AAAATAATAAGGACCAGAAAATCCCTTTCTTTAAATTGCTAGACAAACAGATTTTGTCTGAAAAGATC



CTATTCTTGGATGAAATAAAGAACGATACTGAATTGATTGAAGCTTTGTCCCAGTTTGCTAAAACAGC



TGAAGAAAAGACAAAGATTGTGAAAAAATTGTTTGCTGATTTCGTAGAAAACAATTCTAAATATGAT



CTAGCCCAGATTTATATAAGTCAAGAAGCTTTCAATACAATAAGTAATAAGTGGACAAGTGAAACAG



AAACTTTTGCTAAGTATTTATTCGAAGCCATGAAGTCTGGTAAACTTGCCAAATACGAAAAAAAAGAT



AACAGTTATAAATTTCCAGACTTTATAGCCCTTTCACAGATGAAGTCTGCCTTATTGTCGATATCCTTA



GAAGGTCATTTTTGGAAGGAAAAATATTATAAGATAAGCAAGTTCCAAGAAAAGACTAATTGGGAAC



AATTTTTGGCTATATTTCTATATGAGTTCAATTCATTATTTTCCGATAAAATCAACACTAAGGATGGAG



AGACTAAGCAAGTTGGCTACTATTTGTTCGCAAAAGATCTGCACAATTTGATTCTATCAGAACAAATA



GATATACCAAAAGATTCAAAGGTAACTATAAAGGATTTCGCAGATTCCGTCCTCACCATTTATCAAAT



GGCTAAATATTTTGCCGTTGAAAAAAAGAGAGCGTGGTTAGCAGAATACGAGTTGGACTCGTTTTATA



CTCAGCCAGATACTGGATACTTGCAATTCTACGATAATGCATACGAAGACATTGTACAGGTATACAAT



AAACTTAGAAATTACTTAACCAAGAAGCCCTACAGTGAAGAAAAATGGAAGCTGAACTTTGAAAATT



CGACTTTGGCAAATGGTTGGGATAAAAATAAAGAAAGTGACAACTCCGCAGTGATTTTGCAAAAGGG



TGGGAAATATTACTTGGGTTTAATCACAAAAGGCCACAATAAGATTTTTGATGATAGATTTCAAGAAA



AATTCATAGTTGGTATAGAAGGTGGCAAATACGAGAAAATTGTCTATAAATTCTTCCCTGATCAAGCC



AAAATGTTCCCAAAAGTTTGCTTTTCTGCTAAAGGATTGGAGTTTTTCCGGCCTAGCGAGGAGATCCT



TCGTATCTACAACAATGCTGAATTCAAAAAAGGAGAAACCTATAGCATAGATTCTATGCAAAAACTG



ATAGATTTTTATAAGGATTGTTTAACAAAGTACGAAGGCTGGGCCTGCTATACATTTAGACATTTAAA



GCCCACAGAAGAATACCAAAATAACATTGGTGAATTCTTTCGGGACGTTGCCGAAGACGGCTATAGG



ATCGATTTTCAAGGTATCTCAGATCAATATATCCACGAAAAGAACGAGAAGGGTGAGCTGCACCTTTT



CGAAATTCATAATAAGGACTGGAATTTGGATAAGGCGAGAGATGGTAAATCGAAGACCACTCAAAAG



AACTTGCATACTTTATATTTTGAGTCCTTGTTTTCTAATGATAACGTCGTCCAAAATTTTCCAATAAAG



TTGAATGGACAAGCGGAAATTTTCTATCGGCCTAAGACAGAGAAAGACAAATTAGAATCAAAGAAAG



ATAAAAAGGGAAATAAAGTCATTGATCACAAACGATACTCTGAGAATAAAATATTTTTCCACGTACC



ATTGACACTCAACAGGACTAAGAATGACTCTTATAGATTTAATGCTCAGATTAATAATTTTTTGGCAA



ATAACAAGGATATTAACATAATTGGGGTGGATAGAGGTGAAAAGCACTTGGTATATTACTCTGTCATC



ACTCAGGCTTCTGATATATTGGAAAGCGGGTCTCTAAATGAATTGAACGGTGTTAACTACGCCGAAAA



GCTAGGTAAAAAAGCTGAAAACAGAGAGCAGGCTCGGCGCGATTGGCAAGATGTTCAAGGAATTAA



AGACCTTAAAAAAGGCTACATTAGTCAAGTAGTTAGAAAGTTAGCCGATCTTGCTATTAAACATAACG



CAATCATTATTCTGGAGGACCTAAATATGCGTTTTAAGCAAGTTAGGGGTGGCATAGAAAAAAGTATT



TATCAGCAGCTTGAGAAGGCTTTGATAGATAAGTTATCGTTCCTAGTTGACAAAGGTGAAAAAAATCC



TGAACAAGCTGGTCATCTGTTGAAAGCTTATCAGCTGAGCGCACCTTTTGAAACATTTCAAAAAATGG



GAAAACAAACAGGTATTATTTTCTATACTCAAGCGAGTTATACAAGTAAATCTGACCCAGTGACAGG



ATGGAGACCACACCTTTATCTAAAATATTTTTCTGCTAAAAAGGCCAAAGATGACATCGCTAAGTTTA



CAAAAATAGAATTTGTCAACGATAGATTTGAATTGACTTACGATATTAAAGATTTTCAGCAAGCAAAA



GAATACCCAAATAAGACAGTGTGGAAAGTATGCTCCAATGTGGAGAGATTTAGATGGGATAAAAATC



TCAATCAAAACAAGGGTGGTTACACACATTATACTAATATAACTGAAAATATTCAAGAATTGTTTACT



AAGTACGGAATTGACATAACCAAAGACTTACTAACTCAGATTTCAACTATTGACGAAAAACAAAATA



CCTCATTTTTCCGCGACTTTATTTTTTATTTCAACTTGATCTGTCAAATTCGTAACACGGATGATTCCGA



AATTGCCAAGAAGAACGGAAAAGATGATTTCATCCTATCTCCAGTGGAACCATTTTTTGACTCAAGAA



AAGATAATGGTAATAAGTTGCCTGAGAACGGAGATGATAACGGCGCTTATAATATCGCTCGGAAGGG



TATTGTAATTCTTAATAAAATATCTCAGTACTCTGAAAAGAACGAAAACTGCGAGAAAATGAAGTGG



GGCGACTTGTATGTATCTAATATAGATTGGGATAATTTCGTTACTCAAGCCAACGCGAGACATTGA





SEQ
ATGGAAAATTTTAAAAACCTATATCCAATTAATAAGACACTTAGATTCGAGCTTAGGCCATACGGCAA


ID
AACACTAGAAAATTTTAAGAAGTCAGGCCTATTAGAAAAAGACGCCTTTAAGGCAAATTCCAGAAGA


NO:
TCAATGCAGGCAATTATTGATGAGAAATTTAAAGAGACTATCGAGGAAAGGTTGAAATACACTGAAT


137
TCTCTGAGTGCGATCTGGGAAACATGACTTCCAAGGATAAAAAGATTACCGATAAGGCTGCTACCAA



CCTCAAAAAGCAAGTCATCTTATCGTTTGATGATGAAATTTTTAATAACTACTTAAAGCCGGACAAAA



ACATTGACGCCCTATTCAAAAATGATCCGTCCAACCCCGTAATTTCAACTTTTAAGGGTTTTACCACGT



ACTTTGTAAATTTTTTTGAGATTCGTAAACATATCTTCAAAGGAGAATCGTCGGGTTCCATGGCCTATA



GGATAATTGATGAAAATCTTACGACTTACTTAAACAATATCGAAAAGATAAAAAAGTTACCAGAAGA



ATTAAAGTCTCAATTGGAAGGTATTGACCAAATAGACAAATTAAATAACTATAATGAGTTCATAACTC



AAAGCGGTATCACACATTACAATGAAATTATCGGTGGTATATCTAAAAGTGAGAACGTAAAAATACA



GGGAATAAACGAGGGGATCAATCTATACTGTCAGAAGAATAAAGTAAAATTACCAAGACTAACGCCA



TTATACAAAATGATTCTGTCTGATAGAGTTTCCAACTCGTTCGTGCTTGATACTATAGAAAATGATACT



GAATTAATTGAGATGATTAGCGACTTGATTAATAAAACAGAAATATCTCAAGACGTAATAATGTCAG



ACATTCAGAACATTTTCATAAAATATAAACAGCTTGGTAATTTACCGGGGATAAGTTACTCTAGCATC



GTGAATGCTATTTGCTCCGATTATGACAATAATTTTGGTGACGGAAAAAGAAAAAAATCATATGAGA



ACGATAGGAAGAAACACCTTGAAACAAACGTATACTCAATTAACTATATATCGGAACTGTTAACAGA



CACCGATGTATCATCTAATATAAAAATGAGATATAAGGAACTTGAACAAAATTACCAGGTGTGTAAG



GAGAATTTCAATGCTACCAACTGGATGAACATTAAGAATATTAAACAGAGTGAAAAGACAAACTTGA



TTAAAGATCTACTAGATATACTGAAATCAATACAGAGATTCTACGATCTGTTTGATATAGTTGATGAA



GACAAAAATCCTAGTGCTGAGTTTTACACGTGGCTAAGTAAAAATGCGGAAAAGTTAGATTTCGAGTT



CAACTCTGTTTATAATAAATCTAGGAATTATTTAACTAGAAAGCAGTATTCTGATAAAAAGATAAAAT



TGAACTTCGACTCCCCTACGTTGGCAAAGGGTTGGGATGCAAACAAAGAAATCGATAACTCCACCAT



AATAATGCGTAAGTTTAACAATGATAGGGGGGATTACGATTATTTTTTGGGAATTTGGAACAAATCTA



CCCCAGCGAATGAAAAAATTATTCCCCTTGAAGACAATGGTCTTTTTGAAAAAATGCAGTATAAATTA



TATCCAGACCCATCCAAGATGCTTCCAAAGCAATTTCTGTCAAAAATTTGGAAGGCTAAACACCCTAC



TACTCCTGAATTTGATAAGAAGTATAAGGAGGGCCGACACAAAAAGGGTCCAGATTTTGAAAAAGAA



TTCCTGCATGAATTGATAGATTGTTTTAAGCATGGTTTGGTAAATCATGATGAAAAATATCAGGATGT



CTTTGGATTCAATTTGAGAAATACAGAGGATTACAACTCATATACAGAATTTCTCGAGGACGTCGAAC



GTTGCAATTATAATCTCAGTTTCAACAAGATCGCAGACACTTCAAACTTAATTAACGACGGAAAATTG



TACGTTTTTCAAATCTGGTCGAAAGACTTTAGTATTGATTCAAAGGGTACAAAAAACCTAAATACAAT



ATATTTCGAAAGTCTATTCTCGGAAGAAAACATGATCGAAAAAATGTTCAAACTGTCAGGCGAAGCT



GAAATATTCTACCGTCCCGCAAGCCTTAATTATTGTGAGGATATCATTAAAAAAGGACATCACCATGC



AGAGTTAAAAGATAAATTCGATTACCCAATAATTAAAGATAAAAGATACTCCCAGGATAAGTTCTTTT



TCCATGTACCTATGGTTATTAACTACAAGTCGGAAAAACTAAACTCGAAGTCATTAAATAATAGAACT



AACGAGAACTTGGGACAATTCACACATATAATTGGTATTGATCGTGGCGAAAGACATTTAATATATCT



GACTGTTGTTGATGTTTCAACAGGAGAAATTGTTGAACAGAAACATCTTGATGAAATTATAAACACAG



ATACAAAAGGCGTTGAGCATAAAACTCATTATCTAAATAAATTGGAGGAAAAGTCGAAGACTCGCGA



TAACGAGAGAAAGAGTTGGGAAGCAATTGAAACCATAAAAGAGCTTAAAGAAGGTTACATTAGTCAC



GTCATCAATGAAATACAAAAGTTACAAGAAAAGTATAACGCTTTGATTGTAATGGAAAATCTAAATT



ATGGTTTTAAGAATTCAAGAATCAAAGTCGAAAAGCAGGTCTATCAGAAATTTGAAACGGCACTTATT



AAAAAGTTTAACTACATTATTGATAAAAAGGACCCAGAAACTTATATTCATGGTTACCAACTGACGAA



CCCAATCACAACATTGGACAAAATTGGAAACCAAAGTGGAATTGTTTTATACATTCCAGCTTGGAATA



CATCCAAAATAGACCCTGTCACGGGGTTTGTCAACTTGTTATATGCCGACGATTTAAAGTATAAAAAC



CAAGAACAAGCAAAGTCTTTTATTCAAAAGATTGATAATATTTATTTCGAAAACGGTGAATTTAAATT



CGACATAGATTTTTCTAAATGGAACAACCGTTATTCAATAAGTAAAACTAAATGGACACTCACCTCAT



ACGGCACTCGTATCCAAACCTTTCGGAATCCCCAAAAAAATAACAAATGGGATTCTGCAGAATACGA



CTTGACCGAGGAATTTAAATTAATTCTTAATATAGACGGTACACTCAAAAGTCAAGACGTGGAGACAT



ACAAGAAGTTTATGTCGTTATTCAAGCTTATGCTTCAGTTGAGGAACTCCGTTACAGGCACTGATATT



GATTACATGATTTCACCAGTAACGGATAAGACTGGGACTCATTTCGATTCTAGGGAAAATATTAAAAA



TTTACCTGCTGACGCAGACGCAAACGGCGCATACAATATAGCAAGAAAAGGGATTATGGCCATTGAG



AATATTATGAATGGCATATCAGATCCATTAAAGATAAGCAATGAAGACTACTTAAAATACATTCAGA



ATCAGCAAGAATAA





SEQ
ATGACCCAGTTTGAAGGTTTCACCAATTTGTACCAAGTAAGTAAAACCTTGAGGTTCGAATTGATCCC


ID
ACAGGGCAAGACATTGAAGCATATTCAAGAGCAAGGATTTATAGAAGAAGATAAAGCGAGAAACGA


NO:
TCACTATAAAGAGTTAAAACCCATTATTGACAGGATCTATAAAACATACGCCGATCAATGCCTTCAAT


138
TAGTGCAATTAGATTGGGAAAACTTGAGCGCTGCCATCGATTCCTACAGGAAGGAAAAAACAGAAGA



AACAAGAAATGCCTTAATCGAGGAACAAGCAACCTATAGAAACGCTATACACGATTACTTCATCGGT



AGAACTGATAATCTAACAGATGCAATAAATAAGAGACATGCTGAGATATATAAAGGACTATTTAAAG



CAGAATTATTCAACGGAAAGGTGTTGAAACAGTTAGGTACCGTTACAACTACTGAGCATGAAAATGC



CTTGCTGAGAAGCTTTGACAAGTTTACTACCTACTTTTCGGGTTTCTACGAAAATCGCAAAAATGTATT



TTCTGCGGAAGATATTTCAACTGCAATCCCTCATAGGATTGTTCAAGATAATTTCCCTAAGTTTAAAG



AGAACTGTCACATTTTTACAAGGTTAATTACTGCGGTTCCAAGTCTAAGAGAACATTTTGAGAATGTA



AAAAAAGCGATTGGTATATTTGTATCCACTAGCATTGAAGAGGTTTTCAGCTTCCCTTTTTATAACCAA



TTACTTACCCAAACACAGATCGACCTGTACAACCAATTGTTAGGTGGTATATCGAGGGAGGCTGGTAC



GGAAAAGATTAAAGGATTAAATGAAGTTCTTAATTTGGCCATACAAAAAAATGATGAAACCGCGCAC



ATTATCGCATCTTTACCACATAGGTTTATACCGTTATTCAAGCAAATATTATCTGATCGTAATACCTTA



TCGTTCATATTAGAGGAGTTTAAATCTGACGAAGAAGTTATACAATCTTTTTGCAAGTATAAGACGCT



ATTGAGAAACGAAAACGTTCTGGAAACAGCCGAAGCACTGTTCAATGAATTAAACAGTATCGACTTG



ACTCATATTTTTATATCGCATAAAAAGTTGGAGACAATTTCTTCAGCATTGTGCGATCACTGGGACAC



TTTAAGGAACGCACTATATGAACGTAGGATCTCAGAATTGACAGGTAAGATAACGAAGTCTGCTAAA



GAGAAAGTGCAGAGATCCCTAAAACACGAGGATATAAATTTGCAGGAGATAATTTCAGCTGCAGGTA



AAGAGTTGTCTGAAGCGTTCAAGCAAAAGACTTCCGAAATCTTGTCACACGCACACGCCGCATTAGAT



CAACCTTTACCCACTACTTTGAAAAAACAAGAAGAGAAGGAGATATTAAAATCACAACTTGATTCTTT



ACTTGGCCTTTATCATCTTTTAGATTGGTTCGCTGTTGACGAGAGCAATGAAGTGGATCCAGAGTTTTC



CGCAAGATTGACCGGTATAAAGTTGGAAATGGAACCTTCGTTATCATTTTACAACAAAGCTAGGAACT



ATGCTACAAAAAAACCTTATTCTGTCGAAAAATTTAAACTGAACTTCCAAATGCCTACTCTAGCAAGT



GGCTGGGATGTTAATAAAGAAAAGAACAATGGCGCTATTTTGTTTGTAAAAAATGGCCTATACTATCT



TGGAATTATGCCTAAACAAAAAGGTCGCTACAAGGCTTTGTCATTTGAACCTACTGAAAAGACTAGCG



AAGGTTTCGATAAGATGTATTACGATTATTTCCCGGATGCCGCTAAAATGATCCCCAAGTGCTCTACT



CAATTGAAGGCAGTAACTGCTCATTTCCAAACGCATACCACGCCAATACTGCTTTCTAACAACTTTAT



AGAACCACTAGAAATAACGAAAGAAATTTACGACCTAAATAACCCAGAGAAAGAACCAAAAAAGTT



CCAGACGGCCTACGCCAAAAAGACAGGGGACCAAAAAGGTTACCGCGAGGCGTTATGTAAATGGATT



GATTTTACTAGGGACTTTTTATCAAAATACACTAAAACGACGTCTATTGATCTTAGCTCCTTACGCCCG



TCCTCCCAATACAAGGATCTAGGTGAGTATTACGCAGAGTTGAACCCGCTATTATACCATATTTCCTTC



CAAAGGATTGCTGAAAAGGAAATTATGGACGCTGTTGAAACTGGGAAATTGTACCTGTTTCAGATTTA



TAATAAGGACTTCGCAAAGGGTCACCATGGTAAGCCTAACCTTCACACTTTGTACTGGACCGGACTAT



TCTCGCCTGAAAATTTGGCTAAAACAAGTATCAAGTTAAACGGTCAGGCCGAGTTATTTTATAGACCC



AAATCTAGAATGAAAAGAATGGCCCATAGATTAGGCGAAAAGATGTTAAACAAGAAATTAAAGGAC



CAAAAAACCCCGATACCAGACACTCTATACCAAGAACTGTACGACTATGTGAATCACAGGCTTAGTC



ACGATTTATCAGATGAAGCGAGGGCTTTATTGCCAAATGTCATCACCAAGGAAGTATCACATGAAAT



AATTAAGGATAGAAGGTTCACATCTGATAAATTCTTTTTTCATGTCCCAATTACATTGAATTATCAAGC



AGCGAACTCACCATCTAAATTTAATCAGCGCGTCAACGCCTATTTGAAAGAACATCCCGAAACACCA



ATCATCGGCATAGATCGAGGTGAGAGAAACTTAATATATATAACTGTGATTGATTCTACAGGAAAAA



TCCTGGAGCAACGATCTTTAAATACCATACAACAGTTTGATTATCAAAAAAAGTTGGATAACAGAGA



AAAAGAACGTGTTGCCGCTAGGCAGGCTTGGTCTGTGGTAGGAACAATTAAGGACTTAAAGCAGGGC



TATCTGTCCCAAGTTATTCATGAAATAGTCGATCTGATGATACATTATCAGGCAGTTGTCGTGTTGGA



AAATTTGAATTTTGGCTTTAAATCAAAAAGAACTGGCATAGCAGAAAAAGCTGTGTACCAGCAGTTTG



AAAAGATGTTAATCGATAAGCTAAACTGCCTTGTTCTTAAAGATTACCCCGCAGAAAAAGTAGGTGGT



GTTCTTAATCCATATCAGTTGACAGACCAATTTACATCCTTTGCGAAAATGGGTACGCAAAGCGGGTT



CTTATTCTACGTACCGGCCCCCTATACTTCTAAGATCGACCCACTAACAGGTTTTGTGGACCCTTTTGT



TTGGAAGACGATAAAGAACCACGAGTCACGCAAACATTTCTTAGAGGGCTTTGATTTCTTGCACTACG



ACGTGAAAACTGGTGATTTTATCTTACACTTTAAAATGAACAGAAATCTCTCTTTCCAACGTGGACTG



CCCGGATTCATGCCGGCTTGGGACATCGTTTTTGAAAAGAATGAAACGCAGTTTGACGCCAAAGGTAC



ACCATTTATAGCGGGTAAGAGAATTGTGCCGGTCATAGAAAACCATAGATTTACAGGTAGATATAGG



GATCTGTACCCTGCTAATGAATTGATTGCATTACTCGAAGAGAAAGGAATTGTGTTTCGAGATGGATC



GAATATTTTACCTAAGTTGTTGGAAAATGATGATTCACACGCAATTGATACTATGGTTGCCCTCATAA



GATCGGTATTGCAAATGAGAAACTCAAATGCTGCTACGGGAGAGGATTATATAAACAGCCCCGTTCG



CGATCTTAATGGTGTTTGTTTTGATTCACGTTTTCAGAACCCCGAATGGCCAATGGATGCCGACGCAA



ACGGAGCATATCATATTGCTCTTAAAGGCCAACTACTATTAAATCACTTAAAGGAATCCAAAGACCTA



AAATTGCAAAACGGGATATCTAATCAGGATTGGCTGGCTTACATACAAGAACTACGTAACTAG





SEQ
ATGGCCGTTAAGTCAATCAAAGTGAAACTTAGACTGGATGACATGCCAGAGATTCGTGCGGGGTTAT


ID
GGAAACTTCATAAGGAAGTTAACGCAGGGGTAAGATATTATACCGAATGGTTATCATTACTTCGACA


NO:
AGAGAATTTGTACAGAAGGTCCCCGAACGGCGACGGTGAGCAAGAATGCGATAAGACGGCTGAAGA


139
ATGTAAGGCAGAACTTTTGGAGCGCCTGAGAGCCCGTCAGGTTGAAAATGGCCATAGAGGTCCTGCG



GGATCTGATGATGAGCTTTTACAGCTAGCTAGACAATTGTATGAATTGTTGGTCCCTCAGGCTATTGG



GGCTAAAGGAGACGCTCAACAAATCGCCAGAAAGTTCTTGTCACCTCTGGCTGACAAAGATGCCGTG



GGAGGATTAGGTATCGCTAAAGCAGGTAATAAACCAAGATGGGTTAGAATGAGAGAAGCAGGCGAA



CCTGGTTGGGAAGAAGAGAAAGAAAAGGCCGAAACTAGAAAAAGCGCTGACAGAACCGCAGATGTT



TTACGGGCCTTGGCTGATTTTGGACTGAAGCCTTTGATGAGAGTGTATACTGATTCAGAAATGTCTTCC



GTTGAATGGAAGCCCCTAAGGAAGGGACAAGCGGTCAGAACCTGGGATAGGGATATGTTTCAACAGG



CTATTGAAAGGATGATGTCATGGGAATCCTGGAATCAAAGAGTAGGTCAAGAATACGCTAAACTGGT



CGAACAAAAGAATAGATTTGAACAAAAAAATTTTGTAGGTCAAGAACATTTAGTACATTTGGTTAATC



AACTTCAACAAGATATGAAAGAGGCATCTCCTGGTTTGGAATCAAAAGAACAAACAGCACACTATGT



TACCGGCCGAGCTTTGCGAGGTTCTGACAAAGTATTTGAAAAGTGGGGGAAATTAGCTCCCGATGCCC



CCTTTGATCTATATGATGCTGAAATTAAAAACGTTCAAAGAAGGAACACTAGACGTTTTGGATCCCAT



GATCTTTTTGCAAAGCTAGCTGAGCCAGAATACCAGGCTCTATGGCGTGAAGACGCCTCGTTTTTGAC



TAGATACGCAGTATACAATTCAATACTCAGAAAACTAAACCATGCCAAGATGTTTGCTACATTCACCC



TGCCCGATGCTACCGCTCATCCTATTTGGACTAGATTTGACAAGTTGGGGGGGAATCTACATCAGTAC



ACATTTTTATTTAATGAATTCGGTGAAAGAAGACACGCTATTAGATTCCACAAGCTCCTAAAGGTTGA



AAACGGCGTTGCGAGAGAAGTTGATGATGTAACAGTTCCCATTTCTATGTCGGAGCAATTGGATAATC



TATTGCCTAGAGACCCTAATGAACCAATTGCTTTGTACTTTCGTGACTACGGTGCAGAACAACACTTT



ACAGGTGAATTCGGCGGAGCCAAGATTCAATGTAGACGTGATCAACTCGCACACATGCATAGAAGAA



GAGGCGCTCGTGATGTTTATTTAAATGTGTCTGTTAGAGTTCAATCCCAATCGGAGGCTAGAGGTGAA



AGAAGGCCACCATACGCAGCAGTTTTTAGGTTAGTAGGTGATAATCATAGGGCATTTGTCCACTTCGA



CAAATTAAGTGATTATTTAGCAGAGCACCCTGATGATGGAAAGTTGGGCAGTGAGGGATTATTAAGT



GGGTTGAGGGTAATGTCTGTAGATCTTGGTCTTCGTACTTCTGCGAGTATCTCTGTCTTTAGAGTAGCA



CGTAAGGATGAGTTGAAACCTAATAGCAAAGGAAGAGTCCCGTTTTTTTTTCCTATTAAGGGTAACGA



TAACCTGGTGGCCGTGCATGAAAGATCACAACTTTTGAAATTGCCAGGAGAAACGGAGTCCAAGGAC



TTGAGGGCAATTAGAGAGGAACGTCAGCGTACATTGCGACAGCTGAGAACTCAATTGGCTTATTTGA



GGTTGTTGGTTAGGTGTGGTTCCGAGGATGTTGGCAGAAGAGAAAGGTCTTGGGCCAAATTGATAGA



ACAACCAGTGGACGCCGCAAATCACATGACACCAGATTGGAGAGAAGCTTTCGAAAATGAACTCCAG



AAATTAAAGAGCCTACATGGCATATGCTCTGATAAAGAGTGGATGGATGCCGTATACGAATCCGTTC



GTAGAGTCTGGCGCCACATGGGTAAGCAAGTACGGGACTGGAGAAAGGATGTTCGTTCCGGCGAAAG



ACCGAAGATAAGGGGGTATGCAAAGGACGTTGTAGGCGGTAATTCTATTGAACAGATTGAGTATTTG



GAAAGGCAGTACAAATTTCTTAAATCCTGGAGCTTCTTCGGCAAAGTGTCAGGACAAGTCATCAGGG



CTGAAAAAGGTTCCAGATTTGCTATTACGCTAAGGGAACATATTGATCATGCGAAAGAAGATAGACT



GAAAAAACTAGCAGATAGAATAATTATGGAAGCACTTGGTTACGTCTATGCACTTGATGAAAGAGGC



AAGGGGAAATGGGTAGCTAAATACCCGCCTTGTCAACTTATTTTATTAGAAGAATTAAGCGAGTACCA



ATTTAACAACGATAGACCTCCATCCGAAAATAATCAGCTGATGCAATGGTCCCATAGGGGTGTTTTTC



AAGAATTGATAAATCAAGCTCAAGTACACGATTTGCTGGTAGGTACTATGTACGCAGCGTTTTCGAGC



CGTTTTGATGCAAGAACTGGTGCCCCAGGTATCAGATGTCGACGTGTTCCGGCCAGATGTACACAGGA



ACATAACCCTGAGCCATTTCCGTGGTGGCTTAATAAGTTTGTTGTCGAGCACACATTAGACGCATGCC



CTCTGAGAGCAGATGACCTTATACCCACTGGAGAAGGCGAAATATTTGTTAGTCCATTCTCTGCAGAA



GAAGGTGACTTTCACCAGATACATGCAGACTTAAATGCAGCACAGAATCTCCAACAAAGGTTGTGGT



CGGATTTTGATATTTCGCAAATAAGACTAAGATGCGATTGGGGAGAGGTTGATGGAGAATTGGTGCT



GATTCCAAGATTAACCGGAAAGCGAACTGCCGATTCCTATTCTAACAAGGTGTTTTACACAAATACTG



GTGTTACCTATTACGAAAGAGAAAGGGGTAAGAAGAGACGTAAAGTATTTGCTCAAGAAAAATTGTC



AGAAGAGGAGGCAGAACTGTTAGTAGAAGCAGACGAAGCCAGAGAAAAATCAGTTGTGCTTATGCG



TGACCCTTCCGGCATTATAAATCGTGGTAATTGGACACGACAAAAAGAATTTTGGTCTATGGTCAATC



AACGTATCGAAGGCTACCTAGTTAAGCAAATCAGGTCTAGGGTTCCACTACAAGATAGCGCATGTGA



AAATACGGGTGATATATAA





SEQ
ATGGCTACTAGATCTTTCATTTTAAAAATTGAACCTAATGAAGAAGTGAAGAAGGGTCTCTGGAAAAC


ID
TCACGAAGTACTTAATCATGGCATTGCCTATTATATGAATATCCTGAAGCTTATTCGTCAAGAAGCTA


NO:
TATACGAGCATCATGAGCAAGATCCTAAGAACCCTAAGAAAGTAAGCAAAGCGGAAATTCAGGCTGA


140
ATTGTGGGACTTCGTCTTGAAGATGCAGAAGTGTAACAGTTTTACGCACGAAGTTGATAAAGATGTGG



TGTTTAATATTTTGAGGGAGCTATATGAGGAGTTGGTGCCCTCGAGTGTCGAAAAAAAAGGAGAAGC



TAATCAGCTGTCAAATAAATTTTTATATCCTCTGGTGGATCCAAACTCTCAATCAGGTAAAGGCACTG



CCAGTAGTGGTCGAAAACCGAGATGGTATAATTTGAAAATCGCAGGTGATCCATCGTGGGAAGAAGA



AAAAAAAAAATGGGAAGAAGATAAAAAAAAAGATCCCCTTGCCAAAATACTAGGTAAGCTAGCCGA



GTATGGACTTATACCATTATTCATTCCTTTCACGGACTCTAATGAACCAATTGTGAAGGAAATCAAAT



GGATGGAAAAATCACGTAATCAGTCTGTTAGGAGGTTGGACAAAGATATGTTTATACAGGCTCTTGA



GAGGTTTTTGTCGTGGGAGTCCTGGAATTTGAAAGTGAAAGAAGAATATGAAAAAGTGGAAAAGGAG



CATAAGACGTTGGAAGAAAGGATTAAGGAAGATATTCAGGCCTTTAAGAGTCTGGAACAGTACGAAA



AAGAAAGACAGGAACAGTTATTGAGAGATACTCTAAACACTAATGAATATAGGCTTTCCAAGAGGGG



CTTGCGAGGATGGAGAGAGATAATTCAGAAATGGTTGAAAATGGATGAGAACGAGCCATCGGAGAA



ATATCTAGAGGTGTTTAAAGATTACCAAAGAAAGCACCCTCGCGAAGCTGGTGATTACTCTGTTTATG



AATTCCTTTCGAAGAAGGAAAATCACTTCATCTGGCGAAATCATCCAGAGTACCCATATTTATATGCT



ACATTTTGCGAAATTGACAAGAAAAAAAAAGATGCTAAACAGCAAGCGACATTCACCCTCGCTGATC



CCATCAACCACCCATTATGGGTCAGGTTCGAAGAGAGATCAGGCTCGAACCTGAATAAGTACAGGAT



CTTGACTGAGCAATTGCATACTGAGAAGTTAAAAAAGAAATTGACGGTCCAACTTGACAGATTGATTT



ATCCCACTGAATCTGGTGGATGGGAGGAGAAAGGTAAGGTTGATATTGTCCTATTGCCTTCTCGTCAA



TTTTACAACCAAATATTTCTGGACATCGAAGAGAAGGGTAAACATGCTTTTACCTATAAGGATGAGAG



TATTAAATTTCCATTGAAGGGAACGCTTGGCGGCGCTAGAGTTCAGTTCGATAGAGATCATTTGAGAA



GATACCCGCATAAAGTGGAATCTGGTAATGTAGGTCGGATCTACTTTAACATGACGGTAAATATTGAA



CCTACCGAGTCACCAGTCAGTAAGTCTTTAAAGATTCATAGGGATGATTTCCCTAAATTTGTCAACTTC



AAGCCTAAGGAACTAACCGAGTGGATCAAAGACAGTAAAGGCAAAAAGTTAAAGAGCGGTATTGAG



TCCCTGGAGATAGGTCTTAGAGTCATGTCTATCGATTTGGGTCAAAGACAAGCAGCCGCAGCATCTAT



TTTCGAAGTTGTTGACCAAAAACCGGATATCGAGGGGAAATTATTTTTTCCAATAAAAGGAACTGAGC



TATACGCTGTGCATCGCGCATCCTTCAATATAAAACTGCCAGGAGAAACACTAGTAAAATCTAGAGA



GGTCTTGCGTAAAGCACGTGAGGACAATCTCAAATTAATGAATCAGAAGTTAAATTTCCTTAGGAACG



TGTTGCATTTCCAACAGTTCGAGGACATAACTGAACGCGAGAAAAGAGTCACTAAGTGGATCTCAAG



ACAAGAAAATAGTGATGTGCCATTAGTGTATCAAGACGAACTTATTCAAATAAGAGAGCTAATGTAT



AAACCATATAAAGACTGGGTGGCATTCTTAAAACAATTACACAAGCGGCTTGAAGTAGAAATAGGAA



AAGAAGTAAAGCATTGGAGGAAGAGTCTGTCCGATGGTCGCAAAGGCCTGTACGGGATATCACTTAA



AAATATTGATGAAATTGACAGAACACGAAAATTTTTGTTAAGATGGTCATTGAGACCAACCGAACCA



GGTGAGGTTAGAAGGTTGGAACCAGGCCAAAGGTTTGCCATCGATCAATTAAACCATCTTAACGCAC



TGAAAGAAGATAGATTGAAGAAGATGGCGAACACTATTATTATGCACGCTCTAGGTTATTGCTATGAT



GTGAGAAAGAAAAAATGGCAAGCCAAGAACCCTGCATGCCAAATTATTTTGTTTGAAGATCTTTCTAA



TTACAATCCATACGAAGAGCGTTCACGTTTTGAAAACTCTAAATTGATGAAATGGTCTAGAAGAGAG



ATTCCGAGACAGGTCGCTCTACAAGGGGAGATTTACGGTCTTCAAGTCGGTGAGGTTGGTGCTCAATT



TTCTTCCAGATTTCATGCAAAAACTGGGTCTCCAGGCATTAGGTGTTCGGTCGTTACTAAGGAAAAGT



TACAGGACAACCGTTTCTTCAAAAATTTGCAACGTGAAGGCCGTTTAACACTTGATAAGATAGCTGTC



CTTAAGGAAGGCGATCTGTACCCAGATAAAGGTGGTGAGAAATTCATATCTTTGAGTAAAGACAGGA



AACTGGTTACAACACACGCCGACATTAACGCAGCTCAGAACTTGCAAAAGAGATTCTGGACAAGGAC



CCACGGCTTCTATAAGGTGTACTGTAAAGCTTATCAAGTAGATGGACAAACGGTTTATATTCCTGAAT



CAAAGGACCAGAAACAAAAAATTATAGAAGAATTTGGTGAAGGATACTTTATCTTGAAGGATGGAGT



TTATGAGTGGGGCAATGCAGGTAAGTTAAAGATAAAGAAAGGTTCATCAAAGCAATCAAGTAGCGAA



CTGGTCGATTCGGATATTTTAAAGGATAGCTTTGATCTAGCTAGTGAATTGAAGGGAGAAAAGTTAAT



GTTATACAGAGATCCCAGTGGGAATGTATTTCCATCTGATAAGTGGATGGCCGCCGGAGTGTTTTTTG



GCAAATTAGAGAGAATCTTGATTTCTAAACTGACCAATCAATACTCAATTTCGACCATCGAAGACGAC



TCTTCAAAACAATCCATGTGA





SEQ
ATGCCTACTCGCACCATCAATCTGAAGTTAGTTTTGGGGAAGAACCCAGAAAATGCGACTCTAAGAC


ID
GGGCACTATTCTCTACACATAGACTTGTCAACCAAGCGACTAAGAGAATTGAAGAATTTTTACTGTTG


NO:
TGTAGAGGAGAAGCTTATCGTACCGTAGATAATGAAGGTAAAGAAGCTGAGATCCCACGCCATGCTG


141
TTCAAGAAGAGGCGCTTGCTTTTGCAAAAGCTGCACAACGACATAACGGCTGTATCTCCACATATGAG



GACCAGGAAATCTTGGATGTGCTTAGACAATTGTATGAAAGATTAGTACCTAGCGTCAATGAAAACA



ACGAGGCTGGGGATGCCCAAGCCGCTAACGCTTGGGTGAGTCCATTAATGAGTGCAGAGTCCGAAGG



TGGACTATCGGTCTATGATAAAGTGTTAGACCCGCCGCCAGTATGGATGAAACTCAAAGAAGAGAAA



GCGCCTGGTTGGGAAGCTGCTTCTCAGATTTGGATACAGTCCGACGAAGGTCAATCGCTGCTAAATAA



ACCGGGTAGCCCACCACGTTGGATTAGAAAACTTAGATCTGGTCAACCGTGGCAAGATGACTTCGTTT



CAGACCAAAAAAAAAAGCAAGATGAACTAACGAAAGGTAACGCACCACTCATAAAACAATTGAAAG



AGATGGGCCTCTTGCCTTTAGTTAATCCCTTTTTTAGACATTTGTTGGATCCCGAGGGTAAGGGTGTAT



CCCCATGGGACAGATTGGCCGTAAGGGCCGCGGTGGCGCACTTCATCTCTTGGGAAAGTTGGAACCA



CAGAACAAGAGCTGAGTATAACAGTTTGAAACTGCGAAGAGATGAATTTGAGGCCGCATCTGATGAA



TTCAAGGACGATTTTACATTGCTACGACAATATGAGGCTAAGCGACATAGTACGCTTAAGTCAATTGC



CTTAGCTGATGACTCTAACCCGTACCGAATTGGTGTAAGGTCCTTGAGAGCCTGGAATAGGGTTAGAG



AAGAATGGATTGACAAAGGCGCAACCGAGGAACAAAGGGTTACCATCCTTAGTAAGCTTCAAACACA



ATTACGGGGTAAATTCGGTGATCCAGACCTATTTAATTGGCTAGCCCAAGATAGACACGTACACCTGT



GGTCCCCGAGAGATTCCGTCACGCCCCTCGTAAGGATTAATGCCGTCGACAAAGTGCTTAGAAGACGT



AAGCCTTATGCACTGATGACTTTTGCACATCCGAGATTCCATCCAAGATGGATTCTATACGAAGCGCC



TGGTGGTTCTAACTTGCGACAATACGCTTTAGATTGTACTGAAAATGCTCTGCATATTACACTTCCATT



ACTCGTCGACGACGCCCATGGTACATGGATTGAGAAAAAAATCCGCGTACCACTCGCTCCTAGTGGA



CAAATACAAGATTTAACTTTAGAAAAACTTGAAAAGAAAAAAAACAGATTATACTATAGATCAGGAT



TCCAACAATTTGCTGGATTAGCCGGTGGTGCTGAGGTGTTGTTTCATAGGCCGTATATGGAACATGAT



GAGAGATCAGAAGAATCTCTGTTGGAAAGGCCAGGCGCTGTGTGGTTCAAATTAACCTTAGATGTTGC



TACCCAAGCACCACCTAACTGGTTAGATGGTAAAGGCAGAGTTAGGACACCTCCAGAAGTTCATCATT



TCAAAACCGCTCTGTCAAATAAATCTAAACATACGAGAACCTTGCAACCAGGATTGAGAGTCCTTTCT



GTTGATTTGGGTATGAGAACATTTGCTTCTTGTTCTGTTTTCGAATTGATCGAAGGTAAACCTGAAACA



GGTAGAGCATTCCCTGTTGCTGACGAAAGATCAATGGATAGTCCAAATAAGTTATGGGCCAAGCACG



AGAGAAGCTTTAAACTAACTCTGCCTGGAGAAACACCGAGCAGAAAGGAGGAAGAAGAGAGAAGCA



TTGCTAGGGCAGAGATTTACGCGCTGAAAAGAGATATTCAAAGACTGAAATCACTCCTAAGATTAGG



TGAGGAAGATAATGATAATAGAAGAGATGCTTTGTTAGAGCAATTCTTTAAAGGATGGGGTGAAGAG



GACGTAGTTCCTGGTCAAGCTTTCCCTAGAAGCCTCTTTCAGGGATTAGGCGCTGCACCCTTTAGGTC



AACACCCGAATTGTGGAGACAGCACTGTCAGACGTATTACGACAAAGCGGAAGCTTGCCTGGCAAAG



CATATTTCCGACTGGAGGAAGAGAACTAGACCTCGTCCGACTTCGAGAGAGATGTGGTATAAGACAA



GATCTTACCATGGTGGCAAAAGTATTTGGATGCTAGAATACTTAGATGCTGTCCGCAAATTACTACTT



TCATGGTCGTTAAGAGGTCGTACTTACGGAGCTATTAATAGACAAGACACCGCTCGTTTTGGTTCCTT



AGCTTCTAGATTGTTGCATCATATCAACTCTTTAAAGGAAGACCGCATCAAAACCGGTGCAGATAGTA



TTGTGCAGGCCGCAAGGGGCTATATTCCTCTCCCACATGGCAAGGGTTGGGAACAGCGTTATGAACCC



TGTCAGTTGATATTATTTGAAGATCTAGCTAGGTACAGATTTCGTGTAGACAGACCTCGGAGAGAGAA



TTCGCAATTGATGCAGTGGAATCATCGAGCTATAGTAGCAGAAACGACGATGCAAGCTGAACTATAC



GGTCAAATAGTCGAAAATACCGCTGCTGGTTTCTCCTCAAGATTTCATGCTGCAACTGGTGCTCCTGG



TGTCAGATGTCGCTTTTTGTTAGAACGAGATTTCGATAATGACCTACCAAAGCCGTACTTACTGAGAG



AACTAAGTTGGATGTTAGGTAACACAAAGGTTGAATCAGAGGAAGAAAAATTGCGTCTTCTAAGCGA



GAAAATTAGACCAGGTTCATTAGTCCCTTGGGATGGGGGTGAACAATTCGCGACATTACACCCGAAA



AGACAAACTCTTTGTGTCATTCACGCAGATATGAACGCTGCTCAAAACCTGCAACGCAGATTTTTCGG



AAGGTGTGGGGAAGCCTTTCGCCTTGTGTGTCAGCCACATGGTGATGATGTTTTGAGGCTAGCGTCTA



CACCAGGTGCAAGACTTTTGGGTGCATTACAACAACTGGAAAATGGTCAGGGAGCTTTCGAATTAGTT



CGTGATATGGGTAGCACATCACAAATGAATCGTTTCGTCATGAAGTCGTTGGGCAAAAAAAAGATCA



AGCCATTACAAGACAATAACGGGGATGATGAACTAGAAGACGTGCTATCTGTTTTACCTGAAGAAGA



TGATACCGGACGAATTACTGTATTTCGGGACTCTTCGGGTATATTCTTCCCTTGTAACGTTTGGATCCC



GGCAAAACAGTTCTGGCCTGCGGTCCGTGCTATGATTTGGAAGGTTATGGCATCACATTCATTGGGTT



AG





SEQ
ATGACAAAGTTAAGGCATAGACAGAAGAAGTTAACTCACGATTGGGCGGGGTCTAAAAAGAGAGAA


ID
GTTCTAGGGAGCAATGGTAAATTACAGAATCCATTGCTAATGCCCGTCAAAAAAGGTCAGGTGACAG


NO:
AATTTCGAAAAGCATTTTCCGCATACGCCCGAGCAACCAAAGGGGAAATGACGGATGGCAGAAAAAA


142
TATGTTTACTCACTCATTTGAACCATTCAAGACCAAGCCTTCGTTACATCAGTGCGAACTGGCTGACA



AAGCCTACCAGAGCTTGCATTCATATTTACCGGGTTCTTTGGCGCATTTTCTTTTATCTGCCCATGCAC



TTGGTTTTAGGATTTTTAGCAAATCAGGGGAAGCCACTGCATTCCAAGCGTCCTCAAAGATTGAAGCT



TACGAAAGCAAGTTAGCTAGCGAGCTTGCTTGTGTTGATTTGTCTATTCAGAACTTGACTATTTCAACT



TTGTTCAACGCATTAACGACTTCCGTAAGAGGTAAAGGTGAGGAGACATCGGCAGATCCACTGATAG



CTAGATTTTACACCTTACTTACCGGTAAACCACTAAGCAGAGACACTCAGGGCCCAGAACGAGATTTA



GCCGAGGTGATAAGCAGAAAAATTGCAAGTTCTTTTGGAACTTGGAAGGAGATGACTGCCAATCCAC



TTCAATCTCTTCAATTTTTTGAAGAGGAGTTGCATGCGCTAGATGCAAATGTTAGTTTGTCACCTGCCT



TCGATGTTCTGATTAAGATGAACGACCTGCAGGGTGACTTGAAGAACAGAACGATAGTTTTTGATCCA



GATGCTCCTGTGTTTGAATATAATGCTGAGGATCCTGCTGACATCATCATTAAACTGACAGCTAGATA



TGCGAAAGAAGCAGTGATTAAAAATCAAAATGTCGGGAATTATGTTAAGAACGCTATTACGACAACT



AACGCAAACGGACTAGGTTGGTTGCTGAACAAAGGCCTTTCCTTATTGCCTGTCTCCACTGATGACGA



ACTATTGGAGTTTATTGGGGTCGAGAGATCCCATCCTAGCTGTCATGCGTTGATAGAACTTATCGCTC



AGTTAGAAGCACCTGAACTGTTCGAAAAAAATGTTTTTTCTGATACTCGTTCCGAGGTTCAAGGTATG



ATAGATTCAGCTGTAAGCAATCATATCGCCAGGCTGTCAAGCTCTCGTAATTCATTGAGCATGGACTC



AGAGGAACTTGAGAGATTGATAAAATCTTTTCAAATTCATACACCACATTGTTCATTATTTATAGGGG



CTCAATCCTTATCTCAACAATTGGAAAGCCTACCCGAAGCATTGCAGTCAGGAGTGAACAGTGCTGAT



ATTCTGCTCGGCTCAACCCAATACATGTTGACAAATTCTTTGGTCGAGGAGTCAATCGCTACGTATCA



GAGAACCTTAAATAGAATTAACTACCTGTCCGGCGTTGCAGGACAGATTAACGGTGCTATTAAGAGG



AAAGCTATTGATGGTGAGAAGATACATTTACCCGCTGCTTGGTCAGAGTTAATTTCTTTACCCTTTATT



GGGCAACCAGTGATTGATGTTGAATCAGATTTAGCCCACTTAAAGAACCAATACCAGACATTGTCTAA



CGAATTTGATACGCTGATTTCCGCACTGCAAAAGAATTTCGACTTAAATTTTAATAAAGCCTTGCTTA



ATCGAACACAACATTTCGAGGCTATGTGTAGATCAACAAAAAAGAATGCCCTTTCTAAGCCTGAGATC



GTTAGTTATAGAGATTTGCTAGCCAGGTTGACTTCTTGTCTTTATAGGGGCTCTCTAGTCTTGAGGAGG



GCGGGTATAGAAGTACTGAAAAAGCACAAGATATTTGAGTCCAACTCTGAATTAAGAGAGCACGTTC



ATGAAAGAAAACACTTCGTATTTGTTTCTCCGCTCGATAGAAAAGCCAAGAAGCTCCTACGTTTGACT



GACTCTAGGCCTGATTTATTGCACGTAATTGATGAAATACTACAACATGATAATTTAGAGAACAAGGA



TAGAGAATCTTTGTGGTTAGTTCGATCTGGTTATTTACTGGCCGGCCTACCAGACCAACTCTCCTCTTC



CTTTATAAATCTTCCAATCATTACTCAAAAAGGCGATCGTCGCTTGATAGATCTCATTCAATACGACC



AAATTAATAGAGATGCTTTTGTGATGTTGGTAACTTCCGCTTTTAAGTCGAACTTAAGTGGGCTGCAG



TACAGAGCAAACAAACAATCTTTTGTGGTTACGCGCACTTTGTCACCATATTTGGGATCTAAATTGGT



TTATGTGCCCAAAGATAAAGATTGGCTGGTCCCTTCCCAAATGTTCGAGGGGAGATTTGCGGACATTT



TGCAATCCGATTATATGGTGTGGAAGGACGCTGGAAGATTGTGTGTTATTGACACAGCTAAGCATTTG



TCTAACATTAAAAAATCTGTATTCTCAAGTGAAGAAGTCCTCGCGTTTTTAAGAGAATTGCCACACCG



TACGTTTATCCAAACTGAGGTCAGGGGTTTAGGGGTGAATGTGGACGGTATTGCATTTAATAACGGGG



ATATACCCTCTCTGAAGACGTTTAGCAATTGCGTGCAAGTCAAAGTGAGTCGGACAAACACTAGTCTG



GTCCAAACATTAAATAGATGGTTTGAAGGCGGTAAGGTCTCGCCGCCTAGCATCCAATTTGAGAGAG



CATATTACAAAAAAGATGATCAAATCCACGAGGACGCTGCAAAAAGGAAGATAAGGTTTCAAATGCC



AGCTACAGAGTTGGTACACGCGTCAGACGACGCAGGATGGACCCCCTCCTATTTACTTGGTATCGATC



CCGGTGAATATGGTATGGGTTTGTCATTGGTCTCAATAAATAATGGCGAAGTTTTAGATAGCGGATTT



ATACACATAAATTCATTGATAAATTTCGCTTCTAAGAAATCAAATCATCAAACCAAAGTTGTTCCGAG



GCAGCAATACAAGTCACCATACGCCAACTATCTAGAACAATCTAAAGATTCTGCAGCAGGAGACATA



GCTCATATTTTGGATAGACTTATCTACAAGTTGAACGCCCTACCCGTTTTCGAAGCTCTATCTGGCAAT



AGTCAAAGCGCAGCGGATCAGGTTTGGACAAAAGTCCTCAGCTTCTACACCTGGGGAGATAATGATG



CACAAAATTCAATTCGTAAGCAACATTGGTTCGGTGCTTCACACTGGGACATTAAAGGCATGTTGAGG



CAACCGCCAACAGAAAAAAAGCCCAAACCATACATTGCCTTTCCCGGTTCACAAGTTTCTTCTTATGG



TAATTCTCAAAGGTGTTCATGTTGTGGACGTAACCCAATTGAACAATTGCGCGAAATGGCGAAGGAC



ACATCCATTAAGGAGTTGAAGATTAGAAATTCAGAAATTCAATTGTTCGACGGTACTATAAAGTTATT



TAATCCAGACCCGTCAACGGTCATAGAAAGAAGAAGACATAATTTAGGGCCATCAAGAATTCCTGTA



GCTGATAGAACTTTCAAAAATATAAGTCCAAGCTCACTAGAATTCAAAGAACTAATAACGATTGTGTC



ACGGTCTATACGTCATTCCCCAGAATTTATTGCTAAAAAAAGAGGTATAGGTAGTGAGTACTTTTGTG



CTTATAGTGATTGTAATTCCTCCTTAAATTCAGAAGCAAATGCGGCTGCGAACGTTGCCCAAAAGTTC



CAAAAGCAATTGTTTTTCGAATTATAG





SEQ
ATGAAAAGAATCTTGAACTCTTTAAAGGTTGCCGCCCTGCGTTTGTTATTTAGAGGTAAAGGATCTGA


ID
ACTTGTCAAGACTGTTAAATACCCTTTGGTCTCGCCGGTTCAGGGTGCAGTTGAGGAGTTAGCTGAGG


NO:
CGATCCGCCATGATAACCTACATCTGTTTGGTCAAAAAGAAATTGTTGACCTTATGGAAAAGGATGAA


143
GGTACGCAAGTTTACTCAGTGGTTGATTTCTGGTTAGATACCCTTCGTTTGGGGATGTTTTTCAGTCCA



TCAGCAAACGCATTAAAAATCACGCTGGGTAAGTTTAATTCTGATCAGGTTAGCCCTTTTAGGAAAGT



GTTAGAGCAGTCTCCATTCTTCTTGGCTGGTAGGCTGAAGGTTGAACCGGCAGAACGTATATTATCTG



TCGAGATCCGTAAGATTGGGAAGAGGGAAAACAGAGTTGAGAACTATGCTGCTGACGTAGAAACGTG



TTTTATAGGCCAATTAAGTTCAGATGAGAAACAGTCAATACAAAAATTAGCTAATGATATCTGGGATA



GTAAAGATCATGAAGAGCAAAGAATGTTAAAGGCAGATTTCTTCGCTATCCCTTTGATTAAGGATCCA



AAGGCTGTGACCGAAGAGGATCCTGAAAATGAAACTGCTGGTAAACAAAAACCCTTGGAGTTGTGTG



TCTGCCTTGTCCCAGAACTTTACACAAGAGGATTCGGGTCAATAGCCGATTTTTTGGTTCAACGCTTAA



CTCTTTTAAGGGATAAAATGTCTACAGATACTGCAGAAGATTGTTTAGAATATGTCGGGATTGAGGAG



GAAAAAGGTAACGGCATGAACTCATTGTTGGGAACGTTCTTAAAGAATTTGCAAGGCGATGGATTTG



AGCAGATTTTCCAATTTATGTTAGGGAGCTATGTCGGTTGGCAAGGGAAGGAAGATGTTTTAAGAGA



GAGATTAGACTTATTGGCTGAAAAAGTGAAGAGGTTACCGAAACCAAAATTTGCTGGCGAATGGTCT



GGTCATAGGATGTTCTTGCATGGCCAATTGAAGTCTTGGTCTTCAAATTTTTTTAGACTATTTAACGAG



ACAAGGGAACTTCTAGAGTCTATTAAGTCAGATATACAGCATGCCACAATGCTAATATCATATGTAGA



AGAAAAAGGTGGTTATCATCCTCAATTACTTAGTCAATATAGAAAACTTATGGAACAACTACCAGCTT



TGCGTACCAAGGTATTGGACCCTGAGATTGAAATGACACATATGTCCGAAGCAGTTCGCTCTTATATA



ATGATACATAAATCTGTTGCGGGTTTTTTACCGGATTTATTAGAATCATTAGATAGAGACAAGGATCG



TGAGTTTCTGCTTAGTATTTTTCCAAGAATCCCAAAAATTGATAAAAAAACCAAGGAAATTGTAGCTT



GGGAACTGCCGGGAGAACCAGAAGAAGGTTATTTATTTACTGCTAATAACTTGTTCAGAAACTTCTTA



GAGAATCCGAAACATGTCCCGAGATTTATGGCCGAAAGGATCCCAGAAGATTGGACTCGATTACGCT



CTGCTCCTGTCTGGTTCGATGGAATGGTAAAACAATGGCAAAAAGTCGTTAACCAGTTAGTAGAATCA



CCAGGTGCTTTATATCAATTTAACGAATCCTTCTTGAGACAAAGGTTACAGGCCATGTTAACTGTGTA



TAAGAGGGACTTACAAACTGAAAAATTTCTTAAACTTTTGGCGGATGTTTGTAGGCCTCTTGTAGATT



TTTTTGGTTTGGGTGGAAATGATATTATTTTTAAGAGCTGTCAAGACCCAAGAAAACAATGGCAAACC



GTTATTCCTCTCTCTGTTCCGGCAGATGTCTATACTGCTTGCGAAGGTTTGGCGATTAGACTAAGGGAG



ACATTAGGATTCGAATGGAAGAATTTGAAAGGTCACGAGAGAGAAGATTTCTTAAGATTGCACCAGT



TATTGGGCAATTTACTTTTCTGGATTCGTGATGCTAAATTGGTAGTAAAATTAGAGGATTGGATGAAC



AACCCATGTGTTCAGGAATATGTAGAAGCCCGGAAAGCTATCGATCTTCCACTAGAAATATTCGGTTT



TGAAGTGCCTATCTTCCTGAATGGCTATCTATTTTCGGAGTTGAGACAATTAGAACTTTTGCTTAGGAG



AAAAAGTGTGATGACTAGCTACAGTGTAAAGACTACTGGATCTCCTAATAGGCTATTTCAGCTAGTTT



ATTTACCTCTAAACCCTAGTGACCCCGAAAAGAAGAACTCAAATAACTTTCAAGAACGTTTGGATACC



CCAACTGGTTTGTCCCGTCGTTTCCTAGACCTAACCCTTGATGCATTCGCAGGTAAGTTACTTACCGAT



CCAGTTACACAAGAATTGAAGACAATGGCAGGTTTTTACGATCATCTTTTTGGATTCAAATTGCCATG



TAAACTCGCCGCCATGTCGAATCATCCAGGTTCTTCTTCAAAGATGGTTGTGTTAGCGAAACCCAAAA



AAGGTGTTGCTTCTAATATAGGGTTTGAACCGATCCCAGATCCCGCTCATCCCGTATTTAGGGTTAGA



TCCAGTTGGCCAGAGTTGAAGTACCTCGAGGGGCTATTGTATTTGCCAGAAGACACACCTTTGACCAT



CGAATTAGCAGAGACCTCCGTATCGTGCCAAAGTGTCTCGTCAGTTGCATTCGATTTGAAAAACTTGA



CAACGATCTTAGGTCGTGTGGGAGAATTTAGGGTCACAGCTGATCAACCCTTTAAACTAACGCCTATA



ATCCCGGAGAAAGAAGAATCTTTTATTGGTAAAACTTATTTGGGTCTCGACGCGGGTGAAAGGAGCG



GCGTCGGTTTCGCTATTGTTACAGTGGACGGAGATGGGTACGAAGTGCAAAGATTGGGGGTCCACGA



GGATACACAGCTTATGGCCTTGCAGCAAGTTGCTAGTAAATCCTTAAAAGAGCCAGTATTTCAGCCTC



TAAGAAAAGGCACCTTTAGACAACAAGAAAGAATACGGAAATCCTTACGTGGTTGCTACTGGAATTT



TTATCATGCCTTGATGATAAAATATAGGGCCAAAGTAGTACATGAGGAATCTGTCGGAAGTAGTGGTC



TTGTGGGTCAATGGTTGAGGGCTTTTCAGAAGGATTTGAAGAAAGCCGATGTTCTCCCCAAGAAGGGC



GGTAAAAACGGTGTAGATAAGAAGAAGAGAGAGTCCTCAGCTCAAGACACTCTTTGGGGTGGTGCTT



TCTCTAAAAAGGAGGAGCAACAGATTGCGTTTGAGGTGCAAGCTGCAGGTTCTTCGCAATTTTGTTTG



AAGTGCGGATGGTGGTTCCAACTAGGCATGCGTGAAGTAAACAGGGTACAAGAATCGGGCGTCGTGT



TAGATTGGAATAGAAGCATAGTTACCTTTTTAATAGAATCATCCGGCGAAAAAGTTTATGGTTTCTCC



CCACAGCAATTAGAGAAGGGTTTCAGACCAGACATCGAAACTTTTAAAAAGATGGTAAGAGACTTTA



TGAGACCTCCTATGTTTGATAGAAAAGGCAGACCGGCCGCAGCTTACGAGAGATTTGTTTTAGGAAG



GAGACATCGAAGGTACAGGTTTGATAAAGTATTTGAGGAAAGATTTGGGAGGTCTGCTCTTTTCATTT



GTCCTAGAGTAGGTTGTGGAAATTTTGACCACAGCTCCGAACAGTCCGCGGTTGTTTTGGCCTTGATC



GGATATATTGCCGATAAGGAGGGAATGTCAGGTAAGAAGTTGGTTTATGTACGGCTGGCCGAACTTA



TGGCCGAATGGAAACTAAAAAAATTAGAAAGATCCAGAGTTGAAGAACAATCATCCGCTCAATAA





SEQ
ATGGCAGAAAGCAAACAAATGCAGTGTAGGAAATGTGGAGCTAGTATGAAGTACGAAGTCATCGGTT


ID
TGGGTAAAAAGTCATGTAGATACATGTGTCCCGATTGTGGCAACCATACCTCGGCAAGAAAGATACA


NO:
AAACAAAAAAAAAAGAGATAAAAAATATGGGTCAGCCAGTAAAGCCCAATCTCAAAGAATTGCTGT


144
AGCAGGTGCTCTTTACCCTGACAAAAAAGTACAAACTATCAAAACCTATAAATATCCAGCAGACTTG



AATGGTGAGGTGCATGATAGCGGTGTTGCCGAGAAAATCGCACAAGCAATACAAGAGGACGAGATTG



GACTTTTGGGACCAAGCTCAGAATATGCATGCTGGATTGCATCTCAAAAACAGTCTGAGCCTTACAGT



GTAGTCGATTTCTGGTTTGATGCAGTGTGCGCAGGGGGAGTCTTCGCCTACTCTGGCGCTAGATTATT



GAGTACAGTTTTACAGTTATCCGGTGAGGAATCGGTGCTTAGAGCTGCCTTAGCCTCGTCTCCATTCGT



TGACGATATAAACTTAGCGCAAGCCGAAAAGTTTTTGGCGGTTAGCAGGCGTACAGGTCAAGATAAG



TTAGGTAAGAGAATTGGGGAGTGCTTTGCAGAAGGAAGATTGGAAGCTTTAGGGATAAAAGATAGAA



TGAGGGAATTTGTTCAAGCTATCGATGTTGCACAGACCGCCGGACAACGTTTCGCTGCCAAATTGAAG



ATATTCGGTATAAGTCAGATGCCAGAAGCTAAGCAATGGAATAACGATTCCGGACTGACTGTCTGTAT



ACTACCTGATTATTATGTTCCCGAAGAGAATCGCGCGGACCAACTTGTAGTGTTGTTAAGAAGACTTC



GCGAGATTGCATATTGCATGGGTATTGAAGATGAAGCGGGTTTCGAACATCTTGGAATAGATCCTGGT



GCTCTTTCGAATTTTTCAAACGGTAACCCTAAGAGAGGATTTCTAGGGAGGCTGTTAAATAACGATAT



TATTGCGTTGGCAAACAATATGAGTGCGATGACTCCATATTGGGAAGGGCGTAAGGGTGAACTCATA



GAAAGGCTTGCGTGGTTAAAGCACAGGGCAGAAGGGCTGTATCTTAAAGAACCTCATTTCGGTAACT



CCTGGGCCGATCATAGGTCACGAATTTTCTCAAGGATCGCAGGCTGGTTATCTGGTTGCGCTGGCAAG



TTGAAAATTGCGAAAGACCAAATTTCTGGAGTACGTACAGATCTATTTCTGCTAAAAAGACTGCTGGA



CGCAGTTCCGCAATCGGCGCCATCCCCCGATTTTATTGCGTCAATTTCGGCACTTGACAGGTTTTTAGA



AGCTGCAGAATCGAGCCAGGACCCTGCTGAACAAGTGAGGGCTCTCTACGCTTTTCACTTGAACGCAC



CTGCAGTCCGAAGTATAGCCAATAAAGCAGTGCAAAGGTCCGACAGCCAAGAATGGCTGATAAAAGA



ACTAGACGCTGTTGACCATTTAGAATTTAACAAAGCGTTCCCATTTTTCTCTGACACAGGAAAAAAAA



AAAAAAAAGGTGCTAATAGCAACGGTGCTCCATCGGAAGAAGAGTACACTGAAACGGAATCAATAC



AACAACCTGAGGACGCGGAACAGGAAGTAAACGGACAAGAAGGGAACGGAGCGTCTAAAAATCAAA



AGAAATTTCAAAGAATACCTAGATTCTTCGGTGAAGGCTCCAGATCTGAATACAGAATTTTAACGGAA



GCTCCACAGTATTTCGATATGTTTTGTAATAACATGAGGGCTATATTTATGCAGTTAGAAAGTCAACC



CCGTAAAGCTCCCAGAGATTTTAAATGTTTCCTACAAAATCGATTACAAAAATTATACAAACAGACTT



TCTTGAATGCACGAAGCAACAAGTGTCGCGCTCTGCTTGAGTCAGTTTTAATCTCTTGGGGAGAATTT



TATACATACGGTGCCAACGAAAAGAAATTTAGATTAAGACATGAAGCTTCAGAACGCAGCAGTGACC



CAGATTACGTAGTTCAGCAAGCCTTGGAAATCGCGCGTCGTCTATTCCTTTTTGGCTTCGAATGGAGA



GATTGCTCCGCTGGTGAAAGAGTGGATTTGGTTGAAATTCACAAAAAGGCTATCAGTTTTTTGTTGGC



TATTACTCAAGCTGAGGTCTCTGTTGGTTCATACAATTGGCTTGGCAACTCAACAGTATCGAGATATTT



ATCCGTTGCGGGAACTGATACCTTATACGGTACCCAATTGGAAGAATTCCTGAACGCTACAGTGTTGA



GTCAAATGCGTGGTCTGGCCATTAGATTGAGTTCTCAAGAACTTAAGGACGGTTTTGATGTGCAGCTC



GAGTCTTCCTGCCAGGACAATCTGCAACACCTATTGGTGTATAGGGCTTCGAGAGATTTGGCGGCTTG



CAAGCGCGCTACTTGTCCAGCCGAACTCGATCCTAAGATTTTAGTTTTACCGGTAGGTGCATTCATCG



CTTCCGTAATGAAAATGATAGAAAGAGGTGACGAACCTTTAGCTGGTGCTTATTTACGGCATAGGCCA



CACTCTTTCGGATGGCAAATTAGGGTCCGCGGTGTTGCTGAGGTAGGGATGGATCAGGGTACAGCATT



GGCCTTTCAAAAGCCAACAGAGTCAGAACCTTTTAAAATTAAGCCCTTCTCTGCACAGTATGGACCAG



TTCTGTGGTTGAACAGTAGTAGTTATTCTCAATCACAATATTTGGACGGTTTTCTATCTCAACCAAAAA



ATTGGAGTATGAGGGTGTTGCCTCAGGCGGGTTCAGTTCGCGTCGAACAACGAGTTGCTTTGATATGG



AACTTACAAGCAGGCAAGATGAGACTAGAACGCTCCGGTGCGAGGGCCTTTTTCATGCCTGTACCGTT



TTCATTTAGGCCATCCGGCAGTGGGGACGAAGCAGTTTTGGCGCCCAACCGGTACTTGGGTCTGTTCC



CTCATTCCGGAGGTATAGAATACGCTGTAGTGGATGTCCTGGATTCTGCTGGATTTAAAATTCTTGAA



AGAGGCACTATTGCTGTCAATGGTTTCTCTCAGAAAAGGGGAGAGCGCCAAGAAGAAGCCCATCGTG



AAAAACAAAGAAGGGGGATAAGTGATATAGGGCGAAAGAAGCCTGTGCAGGCAGAAGTCGATGCGG



CGAACGAATTGCATAGAAAGTACACTGATGTTGCCACAAGATTAGGTTGTAGAATCGTCGTTCAATGG



GCACCACAACCTAAACCAGGGACAGCACCGACAGCGCAAACTGTTTACGCGAGGGCTGTTAGGACAG



AAGCTCCGAGGAGCGGCAACCAAGAAGATCATGCAAGAATGAAAAGTTCTTGGGGTTACACCTGGGG



TACGTATTGGGAGAAACGAAAACCAGAAGATATTTTAGGGATTTCTACACAGGTGTATTGGACAGGA



GGTATAGGCGAATCCTGTCCTGCTGTAGCAGTCGCTTTATTAGGTCATATTAGAGCAACTTCAACACA



AACGGAGTGGGAAAAGGAAGAAGTTGTCTTTGGAAGACTGAAGAAGTTCTTTCCGAGTTAA





SEQ
ATGGAGAAGAGAATTAATAAGATACGGAAAAAATTATCTGCGGATAATGCAACAAAGCCAGTCTCTC


ID
GTTCAGGCCCCATGAAAACCCTGCTTGTAAGAGTAATGACGGATGATTTAAAAAAGAGGTTGGAAAA


NO:
GCGTAGAAAAAAACCAGAAGTGATGCCGCAAGTGATCTCAAATAACGCAGCTAATAATCTAAGGATG


145
CTACTTGATGATTATACAAAAATGAAAGAAGCAATCCTGCAAGTTTACTGGCAGGAATTCAAGGATG



ACCATGTTGGACTAATGTGCAAATTCGCACAACCAGCGTCTAAGAAAATTGACCAAAATAAATTGAA



ACCCGAAATGGACGAAAAAGGGAATTTAACAACTGCCGGGTTTGCCTGCTCGCAATGTGGGCAACCA



TTATTTGTTTATAAATTAGAGCAGGTTTCGGAAAAAGGAAAGGCTTACACAAATTACTTCGGCAGATG



TAATGTTGCCGAACACGAAAAACTCATATTGTTAGCTCAGTTGAAGCCTGAGAAAGACTCTGATGAG



GCCGTTACTTACTCGTTGGGGAAGTTTGGTCAAAGAGCTCTCGATTTTTATTCTATTCATGTGACAAAG



GAGTCCACACATCCCGTCAAGCCCTTGGCACAAATTGCGGGTAATAGATACGCTTCGGGTCCAGTTGG



GAAGGCCCTTTCTGATGCATGTATGGGCACAATTGCTAGCTTTCTTAGTAAATACCAGGATATCATAA



TAGAGCATCAAAAAGTTGTAAAGGGTAACCAAAAGAGATTAGAATCGCTGCGTGAGTTGGCGGGTAA



AGAAAACTTGGAATATCCATCTGTCACTCTGCCTCCTCAACCTCATACTAAGGAAGGTGTAGATGCGT



ACAATGAAGTTATCGCTAGAGTCCGTATGTGGGTGAATTTAAATTTGTGGCAAAAATTGAAGTTATCG



CGTGATGATGCAAAACCTCTTCTTAGACTAAAGGGCTTTCCTAGCTTCCCTGTAGTGGAAAGACGCGA



AAATGAAGTCGATTGGTGGAATACAATTAACGAAGTCAAAAAACTGATCGATGCAAAGCGAGATATG



GGTCGAGTTTTTTGGTCTGGTGTTACAGCTGAAAAAAGGAATACGATCTTAGAAGGTTACAACTACTT



GCCAAATGAGAACGATCATAAAAAAAGAGAAGGCAGTTTAGAAAATCCAAAAAAGCCAGCTAAGAG



ACAATTTGGTGATTTGCTACTTTACCTAGAAAAAAAGTACGCCGGAGATTGGGGGAAAGTCTTTGACG



AAGCTTGGGAGAGAATAGATAAAAAAATAGCAGGATTGACGTCACACATTGAAAGAGAAGAGGCGA



GAAATGCAGAAGATGCTCAGTCCAAAGCTGTCCTCACCGACTGGTTGAGAGCCAAAGCGTCCTTTGTT



CTCGAACGCCTAAAAGAAATGGATGAGAAGGAATTTTATGCCTGCGAAATCCAGCTACAAAAATGGT



ACGGAGACTTGAGAGGTAACCCCTTTGCCGTGGAAGCAGAGAACCGTGTTGTAGATATCTCCGGTTTC



TCAATCGGTAGCGATGGACACTCCATTCAGTATCGCAACTTGTTGGCCTGGAAATATTTGGAAAACGG



TAAGAGGGAATTCTATTTACTTATGAATTATGGCAAGAAAGGTAGAATCAGGTTTACTGACGGAACA



GACATTAAAAAGAGTGGTAAGTGGCAAGGCCTTTTGTACGGTGGTGGCAAGGCCAAAGTAATAGACT



TAACATTTGACCCCGACGACGAACAACTGATAATACTGCCTTTAGCTTTTGGTACTCGACAGGGGCGA



GAGTTCATTTGGAATGATCTTTTGTCACTCGAGACTGGTTTGATAAAACTTGCAAATGGAAGAGTCAT



CGAGAAGACAATTTACAACAAAAAGATAGGTCGCGATGAGCCTGCACTATTTGTGGCCTTGACCTTTG



AGAGAAGGGAAGTTGTCGACCCATCCAATATTAAACCAGTCAACCTAATCGGTGTAGATAGAGGTGA



AAACATCCCAGCTGTTATCGCTCTGACAGACCCTGAAGGTTGCCCTTTGCCAGAATTTAAAGATTCGT



CTGGTGGACCAACAGATATATTACGTATTGGGGAAGGCTATAAAGAGAAACAACGTGCTATTCAGGC



TGCAAAAGAAGTTGAACAGAGGAGAGCTGGAGGTTACAGTAGAAAATTCGCCAGTAAAAGTAGAAA



CTTAGCAGATGACATGGTTAGAAACTCTGCCCGGGATTTGTTCTATCATGCGGTTACTCACGATGCAG



TCTTAGTCTTTGAAAATCTATCGCGCGGTTTTGGTAGGCAAGGCAAGAGGACTTTTATGACAGAGAGA



CAATATACAAAAATGGAAGATTGGTTAACCGCGAAGCTCGCATATGAAGGTCTTACTTCGAAAACGT



ACCTCAGCAAAACGCTGGCTCAATATACTTCTAAAACTTGTTCAAATTGTGGTTTTACTATTACCACGG



CAGACTACGACGGGATGTTGGTGAGATTGAAGAAGACGAGCGATGGTTGGGCAACAACATTGAATAA



TAAGGAATTAAAAGCAGAAGGACAGATTACGTATTACAATCGTTATAAACGCCAAACGGTTGAGAAA



GAGTTGTCAGCCGAGTTGGATAGACTAAGTGAAGAGAGCGGTAACAATGATATCTCAAAGTGGACTA



AAGGGAGGCGGGATGAAGCCCTCTTTTTACTAAAGAAGAGATTCTCACATAGACCTGTGCAAGAACA



ATTCGTTTGTTTAGATTGTGGCCATGAGGTTCATGCAGACGAACAGGCTGCGTTAAATATTGCGAGAA



GCTGGCTATTTCTAAATTCTAATTCAACAGAGTTCAAGAGCTATAAATCCGGAAAACAACCTTTCGTA



GGCGCGTGGCAAGCCTTCTATAAAAGGAGATTAAAAGAGGTTTGGAAACCAAATGCA





SEQ
ATGAAAAGAATTAACAAAATTAGAAGGAGGCTGGTCAAAGATTCTAATACCAAGAAAGCTGGTAAG


ID
ACTGGTCCGATGAAAACCCTATTAGTCAGAGTTATGACCCCAGATTTGAGAGAAAGATTGGAGAACC


NO:
TCAGGAAAAAGCCCGAAAACATCCCACAACCCATTAGTAACACATCAAGAGCTAATTTAAACAAGTT


146
ATTAACTGACTACACTGAAATGAAAAAAGCAATATTGCATGTTTACTGGGAAGAGTTCCAGAAAGAT



CCTGTTGGGTTGATGTCTAGAGTTGCTCAACCGGCCCCAAAGAATATAGATCAAAGGAAACTTATTCC



TGTGAAGGACGGCAATGAAAGATTAACCAGCTCCGGTTTCGCTTGCTCCCAGTGCTGCCAACCCCTGT



ATGTATACAAACTGGAACAAGTAAATGATAAAGGTAAGCCACATACTAACTACTTTGGTAGGTGTAA



TGTATCCGAGCATGAAAGATTGATCTTGTTAAGTCCCCATAAACCAGAAGCTAATGATGAGTTAGTAA



CTTATAGTTTAGGTAAGTTCGGACAACGAGCTTTAGATTTCTATAGCATCCATGTTACAAGAGAAAGC



AATCACCCCGTCAAACCACTGGAACAAATCGGTGGTAATAGTTGTGCGTCAGGTCCAGTAGGCAAAG



CTTTATCAGACGCTTGCATGGGTGCCGTGGCTAGTTTTTTGACGAAATACCAAGATATTATACTGGAA



CATCAAAAGGTAATTAAAAAGAATGAAAAGAGACTCGCTAACTTAAAAGATATTGCAAGTGCCAATG



GTTTAGCTTTTCCTAAAATTACCTTGCCACCTCAGCCACATACAAAGGAGGGAATTGAAGCTTACAAT



AATGTAGTAGCCCAAATAGTTATTTGGGTGAACCTTAACCTATGGCAAAAGTTAAAAATTGGTAGAG



ACGAAGCCAAACCCCTGCAGAGGCTGAAGGGTTTTCCCTCCTTCCCCTTAGTAGAGAGACAAGCTAAT



GAAGTGGACTGGTGGGATATGGTGTGCAATGTTAAAAAATTGATTAATGAGAAGAAAGAGGATGGTA



AAGTGTTTTGGCAGAATCTTGCTGGCTACAAGAGACAGGAAGCTTTACTGCCTTATTTATCTTCTGAG



GAAGATAGGAAAAAAGGTAAAAAATTTGCTAGATATCAATTCGGAGACCTACTTCTGCATTTAGAAA



AAAAACATGGCGAAGATTGGGGTAAAGTTTATGATGAAGCCTGGGAAAGAATTGATAAGAAGGTAG



AAGGTCTCTCCAAACATATTAAATTAGAGGAAGAACGTAGGTCCGAAGACGCTCAATCAAAGGCAGC



ATTAACTGATTGGTTGAGAGCAAAAGCCTCTTTCGTTATTGAAGGATTAAAAGAAGCCGACAAAGAT



GAATTTTGTAGATGTGAGTTAAAGTTGCAAAAGTGGTATGGAGACCTCCGTGGTAAACCTTTTGCTAT



TGAGGCTGAAAATTCTATACTCGATATCTCTGGATTTTCAAAACAATATAACTGCGCATTTATATGGC



AGAAAGATGGTGTTAAAAAGCTAAATCTATACTTAATTATCAATTACTTTAAAGGTGGTAAATTGCGT



TTTAAGAAGATAAAGCCTGAAGCCTTTGAGGCAAACCGTTTTTACACTGTTATCAATAAAAAATCTGG



GGAAATCGTACCAATGGAAGTTAATTTCAATTTCGATGATCCTAATCTTATTATTTTACCTCTTGCTTT



CGGCAAAAGGCAAGGTAGGGAGTTTATTTGGAATGATTTATTGTCGCTGGAAACGGGGTCTCTCAAA



CTCGCAAACGGTAGGGTGATAGAAAAAACATTATACAACAGGAGAACTCGGCAGGATGAGCCAGCTC



TTTTTGTGGCTCTGACATTCGAGAGAAGGGAAGTTTTAGATTCATCTAACATCAAACCAATGAATTTA



ATAGGTATTGACCGGGGTGAAAATATACCTGCAGTTATTGCTTTAACTGATCCTGAGGGATGTCCTCT



TAGCAGATTCAAGGACTCGTTGGGTAACCCTACTCACATCTTAAGGATTGGAGAAAGTTACAAGGAG



AAACAAAGGACAATACAAGCTGCTAAAGAAGTAGAACAAAGGAGGGCGGGTGGATATAGTCGGAAA



TATGCCAGCAAGGCCAAGAATTTAGCTGACGACATGGTTAGGAATACAGCTAGAGACCTTTTATACTA



TGCCGTCACCCAGGATGCCATGTTGATATTTGAAAATTTAAGTAGAGGCTTCGGTAGACAAGGTAAGC



GCACCTTCATGGCAGAGAGACAATATACTAGAATGGAAGATTGGTTGACTGCCAAATTGGCATACGA



AGGTCTACCTAGTAAGACGTACTTATCTAAAACACTAGCGCAGTATACTTCCAAGACATGCAGTAATT



GTGGTTTCACAATCACTTCTGCCGATTACGATCGCGTCTTGGAAAAACTAAAAAAAACAGCGACAGGT



TGGATGACTACTATTAATGGGAAAGAATTGAAGGTCGAAGGACAAATAACTTACTATAATAGATATA



AACGGCAAAACGTTGTAAAAGACCTGTCAGTCGAACTCGATCGACTTAGTGAAGAATCTGTTAATAA



TGATATTAGTTCGTGGACAAAAGGTAGATCCGGTGAAGCTTTGAGCCTCCTGAAAAAACGTTTTAGCC



ATAGGCCTGTCCAAGAAAAGTTTGTATGTTTAAACTGTGGTTTTGAGACCCATGCAGACGAGCAGGCC



GCTCTTAATATTGCTAGATCATGGTTATTTTTAAGATCTCAGGAATACAAGAAGTACCAGACTAACAA



GACAACAGGCAACACAGATAAGCGAGCATTCGTTGAGACTTGGCAATCTTTTTATAGAAAGAAATTG



AAGGAAGTCTGGAAACCA





SEQ
ATGGGAAAAATGTATTATCTAGGCCTGGACATAGGGACCAATTCAGTAGGCTACGCTGTCACTGACCC


ID
CTCCTACCATTTGCTGAAGTTCAAGGGGGAACCCATGTGGGGAGCACACGTGTTTGCGGCCGGCAACC


NO:
AGAGCGCAGAGCGGAGAAGCTTCCGCACCTCCAGGAGAAGGCTGGATCGCAGGCAGCAGCGTGTGA


147
AGCTGGTCCAAGAGATATTTGCCCCAGTGATTTCCCCCATCGATCCGCGCTTCTTTATTAGGCTCCACG



AGTCCGCTCTCTGGCGCGACGACGTGGCCGAAACTGATAAACATATTTTCTTTAATGACCCAACATAC



ACTGACAAGGAGTACTATTCAGATTACCCAACAATTCACCATTTGATCGTGGACCTTATGGAAAGTTC



GGAGAAGCATGATCCTCGACTTGTCTATTTGGCCGTGGCGTGGCTCGTGGCACATAGGGGCCACTTCT



TGAACGAGGTGGACAAGGATAACATCGGGGATGTGTTATCTTTCGACGCTTTCTATCCTGAATTCCTT



GCTTTTCTGTCTGACAATGGCGTCAGCCCGTGGGTCTGCGAATCCAAGGCCCTCCAGGCTACGCTATT



GTCAAGAAATAGCGTGAACGACAAGTACAAGGCTCTTAAGTCTTTGATTTTTGGAAGCCAGAAGCCC



GAGGACAACTTTGATGCAAATATCTCGGAGGACGGGCTGATTCAGCTCCTCGCTGGGAAAAAGGTCA



AGGTCAATAAGCTGTTTCCACAGGAGTCAAATGACGCGAGCTTCACCCTTAACGACAAAGAGGATGC



CATTGAAGAGATCCTGGGGACACTCACCCCAGACGAGTGCGAGTGGATAGCCCATATTAGGCGCCTC



TTTGATTGGGCCATAATGAAACATGCGCTTAAGGACGGGCGCACGATATCCGAAAGCAAGGTCAAAT



TGTACGAGCAGCACCACCATGATCTGACCCAGCTAAAATATTTTGTAAAAACATATCTGGCCAAGGA



GTACGATGATATCTTCCGCAACGTGGATAGTGAGACCACCAAAAACTACGTCGCGTACTCATACCACG



TGAAAGAAGTTAAGGGCACGCTGCCTAAGAACAAGGCAACACAAGAGGAGTTCTGCAAGTACGTTCT



CGGGAAAGTTAAAAATATAGAGTGCAGCGAGGCCGACAAAGTGGATTTTGACGAGATGATTCAACGC



CTGACCGACAATTCGTTTATGCCTAAACAGGTGAGTGGAGAGAATCGCGTGATTCCATATCAGCTCTA



TTACTATGAACTCAAGACTATTCTGAATAAGGCCGCTAGCTATTTACCCTTCCTTACGCAGTGCGGGA



AGGATGCCATTTCTAACCAGGATAAACTCTTGAGTATAATGACATTTCGAATTCCCTATTTCGTGGGTC



CGCTTCGTAAGGATAACAGTGAGCACGCTTGGCTGGAGCGGAAGGCTGGCAAAATTTATCCATGGAA



TTTCAACGACAAGGTGGATCTGGACAAATCCGAAGAAGCCTTTATCCGCAGGATGACCAATACTTGC



ACATACTATCCTGGGGAGGATGTCCTTCCACTGGACTCTCTGATCTACGAAAAGTTCATGATTTTGAA



TGAAATTAACAACATAAGGATCGATGGGTATCCTATTTCCGTCGACGTGAAGCAGCAGGTGTTCGGGC



TCTTTGAGAAGAAGCGACGGGTGACCGTGAAGGATATTCAGAATCTTCTCTTATCGCTGGGAGCCCTG



GATAAACACGGAAAACTGACCGGGATAGATACTACGATTCATTCTAATTACAACACGTATCACCATTT



TAAGTCACTGATGGAGAGGGGCGTCCTAACAAGAGATGACGTGGAGAGAATAGTGGAACGAATGAC



ATATTCTGATGACACCAAGAGAGTGCGGCTTTGGCTGAATAACAACTACGGCACTCTGACGGCGGAT



GATGTAAAGCATATTTCCCGACTCCGTAAGCATGACTTCGGGCGGCTGTCTAAGATGTTTCTAACAGG



CCTCAAGGGTGTGCATAAGGAAACTGGGGAGCGCGCTAGCATCCTGGATTTTATGTGGAACACCAAT



GATAACCTGATGCAGCTCCTGTCAGAATGCTACACATTTTCGGACGAAATCACCAAGCTGCAGGAGG



CTTACTATGCCAAGGCCCAACTAAGCTTGAATGATTTCCTGGATTCTATGTACATCAGCAACGCCGTA



AAACGACCAATTTATAGGACACTGGCAGTGGTTAACGACATTAGGAAAGCATGCGGAACAGCTCCCA



AGCGAATCTTTATCGAGATGGCCCGCGACGGCGAGAGTAAGAAGAAAAGGTCAGTGACTAGGCGGG



AGCAGATCAAGAACCTTTACCGCTCTATCCGAAAAGACTTCCAGCAAGAGGTTGATTTCCTTGAGAAG



ATCTTAGAGAACAAGTCAGATGGACAGCTCCAATCCGATGCTCTGTATCTGTACTTCGCTCAGCTGGG



ACGAGATATGTACACTGGCGACCCCATTAAACTAGAACATATCAAGGACCAATCGTTTTATAATATCG



ACCACATCTACCCTCAGTCCATGGTGAAAGACGATAGTCTGGACAATAAGGTGCTCGTCCAAAGTGA



GATTAACGGAGAAAAGTCGAGCAGATATCCTTTGGACGCTGCGATCCGCAACAAGATGAAGCCCCTG



TGGGATGCTTACTACAATCATGGACTGATCAGCCTGAAGAAGTATCAGAGACTGACCCGGAGTACCC



CTTTCACAGACGATGAGAAGTGGGATTTTATCAATAGACAACTGGTGGAAACCAGGCAGTCCACGAA



AGCTCTGGCCATTCTTCTGAAGAGAAAGTTTCCAGACACAGAGATCGTCTATTCAAAGGCCGGCCTCA



GTTCCGACTTTAGACATGAGTTCGGACTCGTTAAATCACGAAATATAAACGATCTCCACCATGCAAAG



GACGCATTCCTCGCGATTGTGACTGGAAATGTCTATCACGAAAGATTTAATAGGCGGTGGTTCATGGT



TAACCAGCCATACTCAGTGAAGACCAAGACCCTTTTCACTCACTCTATTAAAAATGGCAACTTCGTGG



CTTGGAATGGTGAGGAGGATCTTGGAAGAATTGTGAAGATGTTAAAACAGAATAAGAATACCATCCA



CTTTACTAGATTCAGCTTTGACCGAAAAGAGGGGCTATTCGATATTCAACCGTTAAAGGCTTCAACAG



GTCTCGTTCCACGAAAGGCCGGACTGGACGTAGTGAAATACGGCGGCTATGATAAGAGCACCGCAGC



TTACTACCTCCTTGTGCGATTTACGCTCGAGGATAAGAAGACCCAACACAAGCTGATGATGATTCCCG



TGGAGGGACTGTACAAAGCTCGAATTGACCATGATAAAGAGTTTCTCACAGATTACGCACAAACCAC



CATCTCTGAGATTCTCCAGAAAGACAAACAAAAAGTTATAAACATAATGTTTCCAATGGGTACAAGG



CATATTAAACTGAACAGCATGATCTCCATTGATGGCTTTTATTTGTCCATTGGAGGAAAGTCTAGTAA



AGGCAAGTCTGTCCTCTGCCATGCCATGGTACCCCTAATCGTCCCACACAAGATTGAATGCTACATCA



AGGCTATGGAGAGTTTTGCTCGGAAATTTAAAGAGAATAATAAGCTGCGTATTGTGGAAAAATTCGA



CAAGATAACCGTTGAAGACAATCTGAATCTGTACGAGCTCTTTCTGCAGAAGCTGCAGCATAACCCCT



ATAATAAGTTCTTCTCCACACAGTTCGATGTACTGACCAACGGGCGATCAACTTTCACAAAGCTAAGT



CCTGAGGAACAGGTGCAAACACTCCTAAACATTCTTTCCATTTTTAAGACCTGCAGATCTTCAGGATG



CGACTTGAAGAGCATTAACGGGAGCGCACAGGCAGCTAGGATCATGATCTCAGCTGACCTGACAGGG



CTGAGTAAAAAATACTCCGACATTCGGCTTGTAGAGCAAAGCGCCAGTGGGTTGTTCGTTAGTAAGTC



GCAGAACCTGCTGGAATACCTGTAA





SEQ
ATGTCTTCTTTGACGAAGTTTACAAACAAATACTCTAAGCAGCTTACAATTAAGAACGAACTGATTCC


ID
CGTAGGAAAGACTCTGGAAAACATCAAAGAGAATGGGCTGATAGACGGCGACGAACAACTGAATGA


NO:
GAACTATCAGAAGGCCAAAATTATCGTGGATGACTTCCTGAGGGATTTTATTAACAAGGCCCTGAATA


148
ATACCCAGATCGGCAATTGGCGGGAACTGGCCGACGCTCTGAACAAAGAAGATGAGGACAATATCGA



AAAATTACAAGACAAAATCAGGGGCATTATTGTCAGTAAGTTCGAGACATTCGATCTGTTCTCTTCGT



ACTCCATTAAGAAGGACGAGAAAATCATCGATGATGACAATGACGTTGAGGAAGAAGAACTGGACTT



GGGTAAAAAGACCTCATCCTTCAAGTATATTTTTAAAAAAAATCTGTTTAAATTAGTGCTCCCCAGTT



ATTTAAAGACAACTAACCAGGACAAGCTTAAGATTATCTCCTCTTTTGACAACTTTAGCACCTATTTTA



GAGGCTTCTTTGAAAATCGCAAGAATATTTTCACTAAGAAGCCCATAAGCACCTCTATTGCCTACAGA



ATCGTACATGATAACTTCCCAAAATTTTTGGATAACATTAGATGTTTTAATGTATGGCAGACCGAATG



TCCTCAGTTAATTGTGAAGGCGGATAACTACCTCAAATCCAAGAATGTGATCGCCAAAGATAAGTCTC



TTGCTAACTACTTTACGGTCGGAGCCTACGATTACTTCTTATCTCAAAACGGTATTGACTTTTACAATA



ACATTATCGGGGGATTGCCTGCCTTCGCCGGCCATGAGAAAATTCAGGGCTTAAACGAGTTCATAAAT



CAGGAATGTCAAAAGGACTCAGAGCTGAAATCAAAGCTTAAGAATCGACACGCATTTAAAATGGCGG



TCTTGTTCAAACAGATCCTCAGCGATAGAGAGAAAAGCTTCGTTATTGATGAATTCGAGAGCGACGCA



CAGGTGATTGATGCCGTGAAGAACTTCTATGCGGAACAGTGTAAAGACAATAATGTTATTTTCAACCT



ATTAAACTTGATTAAGAATATCGCGTTTTTAAGTGACGATGAACTCGACGGTATCTTTATAGAAGGCA



AGTACCTGTCCTCTGTCAGCCAAAAACTCTACTCAGATTGGTCCAAGCTAAGAAATGACATCGAGGAC



AGTGCTAACAGCAAACAGGGCAATAAAGAGCTGGCAAAGAAAATCAAGACTAATAAAGGGGATGTG



GAGAAGGCGATATCTAAATATGAGTTCTCCCTCTCCGAACTGAACTCCATCGTCCACGATAATACCAA



GTTTAGTGATCTGTTGTCGTGTACACTGCACAAAGTGGCCAGTGAAAAACTCGTCAAGGTGAACGAA



GGCGATTGGCCCAAACACCTGAAAAATAATGAGGAGAAACAGAAGATCAAAGAACCTTTGGATGCGT



TGCTCGAAATATATAACACACTGTTGATCTTCAACTGTAAAAGCTTCAACAAGAACGGGAACTTTTAT



GTAGACTACGATCGATGTATAAATGAACTGAGCAGCGTCGTTTACCTGTACAACAAGACTCGCAATTA



TTGTACGAAAAAACCATATAACACCGATAAGTTCAAGCTTAATTTCAACAGTCCCCAGCTGGGAGAA



GGGTTCAGCAAATCAAAAGAAAACGATTGCCTGACATTACTCTTTAAAAAGGATGATAATTATTATGT



TGGGATTATTAGGAAAGGCGCTAAGATCAACTTTGACGACACACAGGCCATAGCTGACAACACTGAT



AACTGCATCTTTAAAATGAATTACTTTCTGTTGAAGGACGCCAAAAAATTCATTCCAAAATGCTCTAT



TCAGCTCAAGGAGGTTAAGGCCCATTTCAAGAAGTCTGAAGATGACTACATCCTCTCTGACAAGGAA



AAATTCGCTAGTCCTCTGGTTATCAAAAAAAGTACCTTCTTGCTGGCTACAGCTCACGTGAAAGGCAA



GAAAGGGAACATTAAGAAGTTCCAAAAGGAATACAGCAAAGAGAATCCAACCGAGTACAGAAATTC



TCTGAACGAATGGATCGCATTCTGTAAAGAATTTCTAAAGACGTACAAGGCCGCTACCATTTTCGATA



TTACCACCTTGAAAAAAGCCGAGGAGTACGCCGACATCGTCGAATTCTATAAAGACGTGGATAACCT



GTGTTACAAATTGGAATTCTGCCCAATTAAGACCTCTTTCATTGAAAACCTCATCGACAATGGGGACC



TCTACTTATTTAGAATTAACAATAAGGATTTTTCTTCGAAATCTACCGGAACTAAAAATCTGCACACA



CTGTATCTGCAAGCAATCTTCGATGAACGTAATCTCAACAACCCTACAATAATGCTGAACGGCGGTGC



TGAACTGTTCTACCGTAAAGAGAGTATTGAACAGAAGAATCGAATCACACACAAAGCGGGCAGTATT



CTCGTCAATAAGGTGTGCAAAGACGGGACCAGCCTGGACGATAAGATCAGGAATGAAATATATCAGT



ATGAGAACAAGTTTATCGACACCTTGTCGGATGAGGCAAAGAAGGTGCTACCTAACGTTATCAAGAA



GGAAGCTACCCATGACATAACCAAGGATAAGCGGTTCACTTCTGACAAGTTCTTCTTCCACTGTCCTC



TGACCATTAACTACAAGGAAGGAGACACTAAACAATTCAATAATGAAGTACTTAGCTTTTTGCGGGGT



AATCCCGATATTAACATAATTGGTATCGACCGGGGAGAACGGAACCTGATATACGTGACAGTAATTA



ATCAGAAAGGAGAAATCCTGGATTCCGTATCCTTCAATACCGTGACTAATAAATCTAGTAAAATCGAG



CAGACGGTCGACTACGAGGAAAAGTTAGCAGTCAGAGAGAAGGAGAGAATCGAGGCCAAACGTTCC



TGGGATAGTATCAGCAAGATTGCTACTCTGAAAGAAGGATATCTGTCCGCTATCGTCCATGAGATCTG



TTTGTTGATGATCAAGCACAATGCTATAGTGGTTCTGGAGAACCTGAACGCAGGCTTCAAGCGAATTA



GAGGGGGCCTGTCGGAAAAAAGCGTTTACCAGAAGTTTGAAAAGATGCTAATCAATAAGTTAAATTA



CTTTGTAAGTAAAAAAGAAAGCGATTGGAATAAGCCATCAGGACTTTTAAACGGGCTGCAACTGAGC



GACCAGTTTGAGTCATTCGAAAAACTGGGTATTCAGAGTGGTTTCATATTCTACGTACCTGCCGCTTA



CACTTCAAAGATCGATCCTACAACTGGTTTTGCGAATGTCCTGAATCTGTCTAAGGTGAGGAATGTGG



ACGCAATCAAGTCTTTCTTCAGCAACTTCAACGAGATATCTTACAGCAAGAAAGAGGCTCTGTTTAAA



TTCAGTTTTGATCTGGATAGCCTGAGCAAGAAAGGATTCTCTTCTTTCGTAAAGTTTTCTAAGTCCAAA



TGGAACGTCTACACGTTCGGAGAGAGAATCATTAAACCAAAGAACAAGCAGGGGTATCGGGAAGAC



AAAAGGATCAATCTGACTTTCGAAATGAAGAAACTATTGAATGAGTACAAAGTCTCATTCGATTTGGA



GAACAATCTGATCCCCAATCTGACCAGCGCTAACCTCAAAGACACATTCTGGAAGGAGCTGTTTTTCA



TCTTTAAGACCACCCTGCAGCTACGGAATAGTGTCACAAATGGGAAAGAGGATGTACTGATCTCACCT



GTGAAAAACGCCAAGGGGGAGTTCTTTGTGTCCGGCACCCATAACAAAACCCTGCCTCAGGACTGTG



ACGCGAACGGGGCCTACCACATCGCGCTAAAGGGGTTAATGATTCTCGAACGTAATAATCTGGTGCG



CGAAGAAAAAGACACAAAGAAAATTATGGCCATCAGCAACGTTGACTGGTTTGAGTACGTGCAGAAG



CGTCGAGGAGTTTTGTAA





SEQ
ATGAACAACTATGACGAGTTCACTAAACTTTACCCCATTCAGAAAACCATCAGATTTGAACTGAAGCC


ID
TCAGGGTCGTACCATGGAACACTTGGAAACTTTCAACTTTTTCGAGGAGGACAGGGATAGAGCTGAG


NO:
AAATACAAGATCTTGAAAGAGGCCATCGACGAGTATCACAAAAAATTCATCGATGAGCATCTCACCA


149
ACATGTCGCTGGATTGGAACAGTCTCAAGCAGATTTCCGAGAAGTACTATAAATCTCGGGAGGAGAA



AGATAAAAAGGTGTTTTTGAGCGAGCAAAAGCGAATGCGACAGGAGATAGTCTCTGAATTTAAGAAA



GATGATCGGTTTAAAGACCTATTTTCCAAAAAGCTTTTTTCAGAGCTGCTGAAGGAAGAGATCTATAA



AAAAGGCAATCACCAAGAAATTGATGCCCTGAAATCATTCGACAAATTCAGTGGGTATTTCATAGGA



CTGCATGAGAACCGGAAGAATATGTATAGTGATGGAGACGAGATCACAGCCATAAGCAATCGAATCG



TTAACGAGAATTTCCCGAAGTTCCTGGATAACCTGCAGAAGTATCAAGAGGCTAGGAAAAAGTACCC



TGAGTGGATCATCAAGGCTGAATCAGCTCTGGTGGCTCACAATATCAAGATGGATGAAGTCTTTAGTC



TTGAGTACTTTAATAAAGTCCTTAACCAGGAGGGCATCCAGCGCTATAACCTGGCTCTCGGTGGCTAC



GTCACAAAAAGCGGAGAAAAGATGATGGGTCTCAACGATGCACTGAATTTGGCTCATCAGTCGGAGA



AGTCATCTAAGGGACGCATACACATGACACCACTGTTTAAACAAATCCTGAGCGAAAAGGAATCATT



TTCCTACATTCCCGACGTATTCACCGAGGACTCACAACTGCTGCCTAGTATAGGGGGGTTTTTCGCTCA



GATAGAGAACGACAAAGATGGCAACATTTTTGACAGAGCCTTGGAGTTGATTTCATCTTACGCCGAGT



ACGATACGGAGCGCATTTATATTCGCCAGGCGGATATCAACAGGGTTTCCAATGTGATCTTTGGCGAG



TGGGGAACGCTGGGCGGGCTGATGCGGGAATACAAAGCCGACTCGATCAATGACATCAACCTGGAGA



GAACATGCAAGAAGGTCGATAAATGGTTGGATAGCAAAGAGTTCGCCCTGAGTGACGTCTTGGAAGC



TATCAAAAGAACCGGAAATAATGACGCGTTCAACGAGTATATCTCTAAAATGAGGACCGCGAGAGAA



AAAATTGATGCAGCAAGGAAGGAGATGAAGTTTATATCTGAGAAGATCTCAGGCGATGAAGAGTCCA



TCCATATTATTAAAACTCTTCTGGACTCAGTGCAGCAATTCCTGCACTTTTTTAACCTCTTCAAGGCCA



GGCAGGATATACCGTTAGACGGGGCTTTTTATGCCGAGTTTGATGAAGTTCATTCGAAACTTTTTGCT



ATAGTGCCTCTCTATAATAAAGTTCGCAATTACCTGACAAAGAATAACTTAAACACAAAGAAAATCA



AGCTCAACTTCAAAAACCCAACACTGGCAAACGGATGGGATCAGAACAAGGTATATGATTACGCCTC



ATTGATTTTCCTCCGGGACGGGAATTACTATCTGGGGATCATCAACCCTAAGCGCAAAAAGAACATTA



AGTTCGAACAGGGATCTGGCAATGGTCCCTTCTATAGGAAAATGGTATACAAACAGATTCCTGGCCCC



AACAAGAATCTCCCACGCGTCTTTCTGACGTCCACTAAGGGAAAGAAGGAGTACAAGCCGTCTAAAG



AAATTATCGAGGGCTATGAGGCAGACAAGCATATTAGGGGTGACAAGTTTGACCTAGACTTTTGTCAT



AAGCTTATCGACTTTTTCAAGGAGTCCATAGAGAAGCACAAAGATTGGTCAAAGTTTAATTTCTATTT



TTCTCCAACAGAGTCCTACGGGGATATCTCTGAGTTCTATCTGGATGTTGAAAAGCAGGGGTACAGAA



TGCACTTCGAAAATATCTCAGCAGAAACTATCGATGAGTACGTAGAGAAAGGAGATCTGTTTCTTTTC



CAAATCTACAATAAGGATTTTGTGAAGGCCGCCACTGGGAAGAAGGACATGCACACTATTTACTGGA



ACGCTGCATTTTCCCCTGAAAATCTGCAGGACGTAGTAGTGAAATTAAATGGTGAGGCAGAACTGTTT



TACCGCGATAAATCAGACATCAAGGAAATAGTGCACCGGGAAGGCGAGATTCTTGTTAACCGAACAT



ATAATGGCAGGACACCTGTCCCTGATAAAATTCATAAGAAACTGACCGATTACCACAACGGTCGAAC



CAAGGATCTGGGCGAGGCCAAGGAATACCTCGATAAGGTGAGGTACTTCAAAGCCCATTATGACATC



ACCAAGGACCGAAGATACCTTAACGACAAAATCTACTTCCATGTCCCACTCACCTTGAACTTCAAAGC



TAACGGTAAGAAGAACCTCAATAAAATGGTGATTGAAAAATTTCTGTCCGATGAGAAGGCCCATATC



ATCGGCATTGATCGCGGCGAGAGAAATCTCCTTTACTATTCTATCATTGATCGGTCGGGAAAGATTAT



CGACCAACAATCACTGAATGTCATCGACGGATTCGACTATAGAGAGAAGCTGAACCAACGGGAAATC



GAGATGAAGGACGCGCGCCAGTCCTGGAACGCTATCGGCAAAATTAAAGATTTGAAAGAAGGTTACC



TCTCCAAAGCAGTGCACGAAATTACCAAAATGGCAATCCAGTACAATGCTATTGTGGTAATGGAGGA



GTTAAATTACGGATTTAAGCGCGGGAGGTTCAAGGTTGAAAAGCAAATTTACCAAAAATTTGAGAAC



ATGTTGATTGATAAGATGAACTACCTGGTGTTCAAGGACGCACCTGACGAGTCGCCAGGCGGCGTGTT



AAATGCATATCAGCTGACAAATCCACTGGAGAGCTTTGCCAAGCTAGGAAAGCAGACTGGCATTCTC



TTTTACGTCCCTGCAGCGTATACATCCAAAATTGACCCCACCACTGGCTTCGTCAATCTGTTTAACACC



TCCTCCAAAACCAACGCACAAGAACGGAAAGAATTTTTGCAAAAGTTTGAGTCCATTAGCTACTCTGC



CAAAGACGGCGGGATCTTTGCTTTCGCATTCGACTACAGGAAATTCGGGACGAGTAAGACAGACCAC



AAGAACGTCTGGACCGCGTACACTAATGGGGAACGCATGCGCTACATCAAAGAGAAAAAGAGGAAT



GAACTTTTTGACCCTTCAAAGGAAATCAAGGAAGCTCTCACCTCAAGCGGTATCAAATACGATGGCG



GGCAGAATATTTTGCCAGATATCCTCAGATCGAACAATAATGGACTTATCTATACTATGTACTCCTCCT



TCATTGCAGCAATTCAAATGAGAGTGTACGATGGAAAGGAGGATTACATTATATCGCCAATTAAGAA



CTCCAAAGGCGAATTCTTCCGCACGGATCCTAAGCGAAGAGAACTCCCAATCGACGCTGATGCGAAC



GGCGCCTATAATATAGCCCTGCGGGGTGAATTAACAATGCGCGCTATTGCCGAGAAGTTCGACCCCG



ATTCAGAAAAAATGGCTAAGCTTGAGCTGAAACACAAAGATTGGTTCGAATTCATGCAGACAAGAGG



CGACTAA





SEQ
ATGACTAAGACCTTCGATTCCGAGTTCTTCAACCTTTATTCCCTGCAGAAAACTGTAAGGTTTGAGCTG


ID
AAGCCGGTGGGCGAGACAGCCAGCTTCGTAGAGGATTTCAAGAATGAGGGTCTCAAACGGGTAGTTA


NO:
GTGAGGATGAGAGGAGAGCAGTGGACTATCAGAAGGTGAAAGAGATCATCGATGACTATCACCGGG


150
ATTTCATAGAGGAGTCGTTGAATTACTTCCCTGAGCAAGTATCCAAAGACGCGCTGGAACAGGCCTTT



CATCTTTACCAGAAACTGAAGGCAGCGAAGGTTGAGGAGCGGGAAAAGGCCTTGAAAGAGTGGGAA



GCCCTGCAGAAAAAGCTCAGAGAAAAGGTTGTCAAATGCTTCAGCGACAGCAACAAAGCCAGGTTCA



GTAGGATCGATAAGAAAGAACTGATCAAAGAAGACTTGATCAATTGGCTGGTTGCACAGAACCGGGA



AGATGATATTCCCACCGTAGAGACCTTCAACAACTTCACAACTTACTTCACCGGCTTCCATGAGAATC



GTAAAAACATCTACAGTAAAGATGATCATGCAACCGCCATCTCCTTCCGGTTGATCCACGAGAATCTC



CCCAAGTTCTTTGACAACGTGATAAGTTTCAATAAGTTGAAAGAGGGATTTCCCGAACTCAAGTTCGA



TAAAGTGAAGGAGGATCTGGAAGTGGATTATGACCTTAAGCACGCTTTCGAGATAGAGTACTTCGTG



AACTTTGTGACTCAGGCCGGCATCGATCAGTATAACTACCTCCTCGGGGGTAAGACGCTCGAGGACG



GTACTAAGAAGCAAGGAATGAATGAGCAAATTAATCTATTTAAACAGCAGCAGACCAGGGATAAGGC



TAGACAGATCCCCAAGCTTATTCCTCTTTTTAAACAGATCCTAAGTGAAAGGACAGAAAGTCAAAGCT



TCATACCTAAGCAATTTGAAAGTGATCAGGAGCTGTTTGACTCCCTGCAAAAGCTGCACAACAATTGC



CAGGACAAGTTTACCGTGCTGCAGCAGGCTATCCTCGGACTGGCTGAGGCGGATCTTAAGAAGGTATT



CATTAAGACTAGCGACCTCAATGCCCTTAGTAACACCATCTTTGGAAATTACTCCGTTTTCAGCGATG



CCCTCAATCTATACAAAGAGAGCTTGAAGACTAAAAAAGCTCAGGAAGCTTTTGAAAAATTACCGGC



ACATTCTATACACGACCTTATACAATACTTAGAGCAGTTCAACAGCAGCCTCGACGCTGAGAAACAGC



AATCCACAGACACCGTCCTGAATTACTTCATCAAAACCGATGAACTGTACTCCCGATTTATCAAGAGC



ACTTCAGAAGCCTTCACGCAAGTTCAGCCTCTGTTCGAGCTGGAGGCACTGTCCAGCAAGAGACGACC



GCCAGAGTCTGAAGACGAGGGAGCCAAGGGTCAAGAGGGGTTTGAACAGATAAAGCGAATTAAGGC



TTACTTGGATACTCTCATGGAGGCGGTGCATTTCGCTAAGCCTTTGTACCTGGTTAAAGGCCGAAAAA



TGATTGAGGGGCTAGATAAGGATCAGTCTTTTTACGAGGCTTTTGAAATGGCCTACCAGGAATTGGAA



TCCTTGATCATTCCAATCTATAATAAAGCCCGGAGTTATCTGAGCAGGAAGCCCTTCAAAGCCGACAA



GTTCAAAATAAATTTTGACAATAATACGCTACTGTCTGGTTGGGACGCTAACAAGGAAACAGCCAAT



GCTTCCATCCTGTTTAAGAAAGACGGCCTGTACTACCTGGGAATTATGCCAAAAGGCAAAACTTTTTT



GTTCGATTACTTTGTGTCATCAGAGGATAGCGAGAAGTTAAAGCAAAGACGGCAGAAGACCGCCGAA



GAAGCCCTCGCACAAGACGGAGAATCATATTTCGAGAAAATTCGATATAAGCTCCTGCCTGGCGCAT



CAAAGATGTTGCCAAAAGTCTTCTTTTCCAACAAAAACATCGGCTTTTATAACCCCAGCGATGATATC



CTTCGCATCCGGAACACCGCCTCACATACCAAAAATGGAACTCCACAGAAGGGCCACTCGAAGGTTG



AATTCAACCTTAACGATTGTCACAAAATGATTGATTTTTTTAAGAGCTCCATTCAGAAACACCCCGAA



TGGGGGTCCTTTGGCTTCACCTTTTCTGATACTTCAGACTTCGAGGACATGTCCGCCTTCTACAGGGAG



GTGGAGAACCAGGGCTATGTCATCTCCTTCGACAAAATAAAAGAGACATACATTCAGAGCCAGGTCG



AGCAGGGAAATCTGTACCTGTTTCAGATCTATAACAAGGATTTCAGTCCCTATAGCAAGGGCAAGCCC



AATTTACATACCCTGTACTGGAAGGCCCTGTTCGAAGAGGCAAACCTTAACAATGTAGTTGCTAAGCT



GAATGGGGAAGCAGAGATCTTCTTCCGAAGGCACAGCATCAAGGCAAGCGACAAAGTTGTACATCCT



GCTAACCAGGCCATCGATAACAAGAACCCGCATACAGAAAAGACACAGTCAACCTTTGAATACGACC



TCGTGAAGGACAAGAGGTACACACAAGATAAATTCTTCTTCCACGTGCCCATCAGCTTGAATTTTAAA



GCGCAGGGAGTGAGCAAATTTAACGACAAGGTCAACGGCTTCCTGAAGGGAAACCCCGACGTGAATA



TCATCGGAATTGATCGCGGTGAAAGACATCTCCTCTACTTTACTGTGGTGAACCAGAAGGGTGAGATC



CTAGTACAGGAGAGCCTGAACACCCTTATGAGTGATAAGGGCCATGTGAATGATTACCAGCAGAAGC



TGGACAAGAAGGAACAGGAAAGGGACGCAGCGCGGAAGTCCTGGACCACTGTTGAGAATATCAAAG



AACTGAAGGAGGGATATCTTAGCCATGTGGTACACAAACTTGCACATCTGATTATCAAGTATAATGCC



ATAGTCTGCCTGGAAGACTTGAACTTCGGTTTCAAGCGAGGAAGGTTTAAAGTGGAGAAGCAGGTGT



ACCAGAAGTTTGAGAAAGCCCTTATTGATAAGCTAAACTACCTTGTCTTTAAGGAAAAAGAACTCGGC



GAAGTTGGCCACTATTTAACCGCCTACCAACTAACCGCCCCTTTCGAGTCTTTTAAGAAACTGGGAAA



GCAGAGCGGAATACTCTTCTATGTGCCTGCAGACTACACCTCTAAGATCGACCCCACTACCGGCTTTG



TAAACTTTCTAGATCTCCGCTATCAGTCAGTAGAAAAAGCCAAACAGCTCTTGTCAGATTTTAACGCC



ATCCGATTTAATTCCGTCCAAAATTACTTCGAGTTCGAAATCGACTATAAAAAACTTACCCCCAAGAG



AAAGGTTGGGACGCAGTCTAAGTGGGTAATCTGCACTTACGGTGACGTGAGATACCAGAACCGCCGA



AACCAGAAAGGTCATTGGGAAACCGAGGAAGTGAATGTGACTGAGAAGCTCAAGGCCCTCTTCGCTA



GCGACAGTAAAACAACAACAGTTATCGATTACGCCAATGACGATAATCTTATAGACGTGATCTTGGA



ACAAGACAAAGCCTCTTTTTTTAAGGAATTGTTGTGGTTGCTGAAACTTACAATGACCCTTAGGCACA



GCAAGATCAAATCAGAGGATGACTTCATCCTCAGCCCGGTGAAGAATGAACAGGGAGAGTTCTACGA



TTCACGGAAGGCTGGAGAGGTGTGGCCCAAGGATGCCGACGCGAACGGGGCCTACCACATAGCTCTA



AAAGGTCTGTGGAACCTGCAACAAATCAATCAATGGGAGAAAGGTAAGACACTGAACCTGGCCATCA



AAAATCAAGATTGGTTCTCATTCATCCAGGAAAAGCCTTATCAAGAGTGA





SEQ
ATGCATACGGGAGGCCTTTTATCAATGGACGCAAAAGAGTTCACCGGGCAGTATCCATTATCTAAGAC


ID
ACTCCGCTTCGAGCTGAGGCCCATTGGCAGGACCTGGGACAACCTGGAGGCGTCGGGCTACCTGGCT


NO:
GAGGACAGACATCGCGCAGAATGCTATCCGAGAGCTAAGGAGCTTTTGGACGACAATCATCGCGCGT


151
TCCTTAACCGGGTGCTCCCACAGATCGATATGGACTGGCACCCGATCGCTGAGGCTTTTTGCAAGGTC



CATAAGAACCCTGGGAACAAAGAGCTCGCCCAGGACTACAACTTGCAGCTGAGCAAGCGACGGAAA



GAGATTTCTGCCTACCTTCAAGACGCCGATGGCTACAAAGGGCTCTTCGCAAAGCCCGCATTGGATGA



GGCCATGAAAATCGCCAAGGAGAACGGGAATGAAAGTGACATCGAAGTTCTCGAAGCGTTTAACGGA



TTTAGCGTGTACTTTACCGGCTATCATGAGTCAAGGGAGAATATTTATAGCGATGAGGACATGGTCTC



TGTGGCCTACCGGATTACCGAGGATAATTTCCCGAGGTTTGTTTCAAATGCACTAATATTCGACAAGT



TAAATGAGAGCCACCCAGACATCATCTCGGAGGTCAGCGGCAACCTCGGAGTTGACGATATTGGCAA



ATACTTCGACGTGAGCAACTATAACAACTTCCTCTCACAGGCTGGCATCGACGACTATAATCATATTA



TAGGCGGCCACACTACTGAGGATGGTCTCATTCAGGCATTCAATGTAGTCTTGAATCTTAGGCACCAG



AAGGACCCTGGGTTTGAAAAGATACAGTTCAAGCAGCTGTATAAGCAGATATTATCCGTGCGAACAT



CTAAAAGTTACATCCCCAAACAGTTTGATAACTCAAAGGAGATGGTGGATTGCATATGCGATTATGTG



TCAAAAATTGAAAAGAGCGAGACTGTGGAGCGGGCTCTGAAGCTCGTCAGGAACATTAGCTCCTTTG



ACCTTAGAGGAATTTTCGTCAATAAAAAGAATCTGAGGATCCTGAGCAATAAGCTAATAGGAGATTG



GGACGCCATAGAGACAGCATTGATGCATTCCAGCTCAAGCGAGAATGATAAGAAGTCTGTCTACGAT



AGCGCTGAAGCCTTCACGCTGGACGATATCTTCTCTTCCGTGAAAAAATTTAGTGATGCGTCCGCAGA



AGATATCGGGAATCGAGCCGAAGATATCTGCAGGGTAATTTCAGAGACCGCCCCTTTCATCAATGACC



TGCGCGCCGTGGACCTGGATAGCCTGAATGACGATGGTTACGAAGCTGCAGTTTCTAAGATCAGGGA



GTCTCTGGAGCCATATATGGACTTGTTTCACGAACTTGAGATCTTTAGCGTGGGCGACGAGTTCCCGA



AATGCGCAGCTTTCTATAGCGAGTTAGAGGAGGTCAGCGAGCAATTAATCGAGATCATACCCCTGTTT



AATAAGGCACGGAGCTTTTGTACTCGCAAGCGCTACAGCACCGACAAGATTAAAGTTAATCTGAAAT



TTCCAACTCTCGCAGACGGGTGGGACCTAAACAAGGAACGCGATAATAAAGCCGCCATCCTTAGAAA



GGACGGAAAGTACTATCTTGCCATCCTAGATATGAAAAAAGATCTGAGTTCCATTCGTACTAGCGATG



AAGACGAATCTTCTTTCGAAAAAATGGAGTATAAGCTGCTCCCCTCGCCAGTCAAGATGCTACCCAAG



ATCTTTGTGAAGAGCAAAGCAGCCAAGGAAAAGTACGGGCTGACGGACAGGATGCTGGAGTGCTACG



ATAAGGGAATGCATAAATCAGGGTCAGCTTTTGACTTGGGCTTTTGCCATGAGCTAATCGATTACTAC



AAGCGCTGTATCGCCGAGTATCCAGGATGGGACGTTTTCGACTTTAAATTTCGGGAGACTTCTGATTA



TGGTTCAATGAAGGAGTTCAACGAAGATGTCGCTGGTGCCGGTTACTACATGAGCCTTCGCAAGATTC



CTTGTTCCGAAGTCTACCGGCTACTGGACGAGAAATCTATATATTTGTTCCAGATATATAACAAGGAC



TACAGTGAGAATGCACATGGGAATAAGAATATGCATACTATGTATTGGGAAGGTCTCTTTTCACCCCA



AAATTTGGAGTCACCCGTGTTCAAACTTAGCGGTGGCGCAGAGCTGTTCTTTAGGAAATCCAGTATAC



CCAATGACGCCAAGACAGTCCACCCAAAGGGTAGCGTCCTGGTGCCCAGAAACGATGTGAACGGCAG



GAGAATCCCTGACAGCATTTACCGAGAACTTACCAGGTACTTCAACCGCGGCGACTGTAGAATCTCTG



ATGAGGCAAAGTCTTATCTGGATAAGGTGAAGACTAAGAAGGCAGATCATGACATTGTGAAAGACCG



CCGCTTTACTGTCGACAAAATGATGTTTCACGTGCCTATCGCAATGAATTTTAAGGCAATCTCAAAAC



CGAATCTGAACAAGAAGGTGATAGATGGCATTATCGATGACCAGGACCTCAAGATCATCGGAATCGA



CAGAGGTGAGCGAAACCTGATATACGTCACAATGGTAGATCGGAAGGGTAATATTCTGTACCAGGAT



TCACTAAACATCCTCAATGGATATGACTATCGAAAAGCTCTCGATGTCAGGGAATACGACAACAAGG



AGGCGCGACGGAATTGGACAAAGGTGGAAGGCATACGGAAGATGAAGGAAGGCTATCTGTCACTAG



CTGTCTCCAAATTGGCTGATATGATTATAGAGAACAACGCCATTATCGTGATGGAAGATCTCAACCAT



GGATTCAAGGCAGGAAGAAGTAAAATTGAGAAGCAGGTGTATCAGAAGTTCGAAAGCATGCTTATTA



ATAAGTTGGGTTATATGGTCTTAAAGGACAAGTCTATCGATCAGAGCGGCGGCGCACTCCATGGGTAT



CAGCTGGCTAACCATGTCACCACACTAGCATCCGTAGGCAAACAGTGTGGCGTGATTTTCTACATTCC



TGCTGCGTTCACTTCTAAGATCGATCCTACCACGGGATTCGCAGACCTGTTCGCACTGAGCAATGTTA



AAAACGTGGCCTCCATGAGGGAGTTCTTTAGCAAAATGAAAAGCGTGATTTATGACAAGGCCGAGGG



CAAGTTCGCTTTCACATTTGACTACCTGGACTACAATGTGAAATCAGAGTGCGGGAGAACCCTGTGGA



CCGTATACACGGTAGGGGAAAGATTCACTTACAGTCGAGTTAATCGGGAGTATGTCCGTAAAGTGCC



AACTGACATCATCTACGATGCCCTTCAGAAGGCTGGCATAAGTGTTGAGGGGGATCTAAGGGACAGG



ATCGCTGAATCGGATGGCGATACTCTCAAATCAATCTTCTACGCCTTCAAGTATGCCCTCGACATGAG



GGTAGAGAACCGGGAGGAGGACTATATACAGTCTCCCGTGAAGAATGCGTCGGGAGAGTTCTTCTGC



TCAAAAAACGCCGGGAAATCTTTGCCGCAGGATTCTGATGCAAATGGGGCTTATAACATTGCTCTCAA



AGGCATCCTGCAGCTGCGCATGCTATCTGAACAATATGACCCAAACGCTGAAAGCATTAGATTGCCAT



TGATCACCAATAAGGCTTGGCTGACTTTCATGCAGAGCGGTATGAAGACATGGAAAAACTAA





SEQ
ATGGATTCCCTTAAGGACTTCACAAATCTTTACCCCGTGAGTAAAACCCTGAGATTTGAACTCAAGCC


ID
CGTGGGAAAGACTCTCGAGAATATCGAGAAGGCCGGGATTTTGAAGGAAGACGAGCATCGGGCGGA


NO:
AAGTTACAGACGGGTGAAGAAGATTATAGATACTTATCACAAGGTCTTTATAGACAGCTCTTTAGAGA


152
ACATGGCAAAGATGGGCATCGAGAACGAAATCAAGGCCATGCTGCAGTCCTTCTGCGAGCTGTATAA



AAAGGATCATCGGACCGAAGGCGAAGACAAGGCGCTGGATAAGATCAGGGCAGTGCTGCGCGGCCT



CATTGTGGGTGCCTTCACTGGGGTGTGCGGGCGGAGAGAGAACACTGTGCAGAATGAGAAATACGAG



AGTTTGTTCAAAGAGAAACTCATCAAGGAAATCCTGCCCGACTTCGTCTTAAGCACAGAAGCCGAATC



TCTCCCATTTTCTGTCGAGGAGGCCACGCGTTCCCTTAAAGAGTTCGACAGTTTCACTTCATACTTTGC



CGGATTTTATGAAAACCGTAAAAATATATACTCCACTAAACCACAGTCAACTGCAATAGCTTACAGGT



TAATCCACGAAAACCTGCCAAAATTCATCGACAATATACTCGTCTTTCAAAAAATCAAGGAACCAATC



GCGAAGGAACTTGAACACATCCGGGCTGACTTTAGTGCGGGAGGATACATCAAAAAAGACGAGCGCC



TGGAGGATATATTTTCACTAAATTATTATATTCATGTACTGAGCCAGGCTGGCATAGAAAAGTACAAC



GCTCTAATTGGGAAAATCGTGACAGAAGGTGACGGGGAAATGAAAGGGCTAAACGAACATATTAACT



TATATAACCAACAGCGGGGTCGAGAAGATCGTCTGCCCCTGTTCAGACCTCTGTATAAGCAAATACTC



TCCGACAGAGAGCAGCTATCATATCTGCCCGAGTCCTTTGAGAAAGATGAAGAGCTGCTCCGGGCGC



TCAAGGAGTTCTATGATCATATAGCCGAGGACATTTTGGGCAGAACTCAGCAACTCATGACGTCTATT



TCTGAATATGATCTGTCTCGTATCTATGTCAGGAATGATAGCCAGCTGACCGATATATCCAAGAAGAT



GCTGGGGGACTGGAACGCCATTTATATGGCGAGGGAGCGAGCATACGATCACGAGCAGGCACCCAAG



AGAATCACAGCCAAATATGAGAGAGACCGCATTAAGGCGCTGAAGGGCGAAGAAAGTATCAGTCTG



GCCAATCTGAACTCCTGCATAGCTTTCCTTGATAACGTGAGGGATTGCAGAGTTGATACTTACCTGAG



TACCCTGGGCCAGAAGGAAGGGCCTCACGGCCTCTCTAATCTAGTGGAGAATGTATTTGCCTCCTACC



ACGAAGCTGAGCAGCTGCTGTCATTTCCGTACCCAGAGGAAAATAATTTAATACAGGATAAGGACAA



CGTAGTGCTTATCAAAAATCTACTGGATAACATTTCCGACCTCCAGCGCTTTCTCAAACCACTTTGGG



GGATGGGCGACGAGCCTGATAAGGATGAGCGCTTTTACGGCGAGTACAACTACATCAGGGGCGCCTT



GGACCAGGTGATTCCCCTCTATAATAAAGTCAGGAATTACCTGACCCGAAAGCCATACAGTACAAGA



AAGGTGAAATTAAATTTCGGCAATAGTCAGCTGCTGTCTGGTTGGGACCGAAATAAGGAGAAAGACA



ACAGCTGCGTAATTCTCAGAAAAGGACAGAACTTTTATTTGGCCATCATGAATAACAGACACAAGAG



ATCTTTCGAGAACAAAGTGCTCCCTGAGTATAAGGAGGGGGAACCCTACTTCGAGAAGATGGACTAT



AAATTCCTTCCTGATCCAAATAAAATGCTGCCTAAAGTATTTCTGTCAAAAAAAGGTATAGAAATCTA



CAAACCTTCACCTAAGCTACTTGAACAGTATGGCCACGGCACCCATAAAAAAGGGGACACGTTCAGC



ATGGACGACCTACACGAACTGATTGACTTCTTTAAGCACAGCATAGAAGCTCATGAGGACTGGAAAC



AGTTCGGATTCAAATTCTCAGATACCGCGACCTACGAAAACGTGTCTAGTTTTTACCGGGAAGTCGAG



GACCAGGGCTACAAGCTCAGCTTCAGAAAAGTTAGCGAATCTTACGTCTACTCCCTTATAGATCAAGG



TAAGCTGTATCTCTTTCAAATCTACAACAAGGACTTTTCCCCATGTAGCAAGGGCACCCCCAATCTGC



ACACTCTCTACTGGCGGATGCTGTTCGACGAGCGTAACCTGGCAGACGTGATCTACAAATTAGATGGT



AAAGCTGAGATCTTCTTTCGTGAAAAGAGCCTAAAGAACGATCACCCCACTCACCCCGCCGGAAAGC



CCATTAAGAAGAAAAGTAGGCAGAAGAAAGGAGAAGAATCGCTATTTGAGTACGACCTCGTCAAGG



ATCGGCATTATACAATGGATAAGTTCCAGTTCCATGTGCCAATAACTATGAATTTCAAGTGCAGTGCT



GGCAGTAAGGTGAATGACATGGTAAACGCTCATATCCGGGAGGCAAAGGACATGCATGTTATTGGAA



TTGATAGGGGTGAGCGTAATCTCCTCTACATCTGTGTTATTGACTCCCGCGGCACAATCCTCGATCAG



ATTTCCTTGAATACAATTAATGATATAGACTACCATGACTTGCTTGAGTCTCGCGACAAAGATAGACA



GCAGGAGAGAAGAAATTGGCAGACCATCGAAGGCATCAAGGAACTCAAGCAAGGCTACCTTTCTCAG



GCAGTGCATCGAATAGCCGAGCTGATGGTGGCTTATAAAGCCGTCGTGGCACTAGAAGACCTAAATA



TGGGATTTAAACGAGGCAGGCAGAAGGTGGAATCATCCGTATACCAGCAGTTCGAAAAACAGTTGAT



AGACAAACTCAATTACCTTGTAGACAAGAAGAAGCGGCCTGAGGACATAGGGGGCCTGCTTAGAGCG



TATCAATTTACAGCCCCATTCAAGTCTTTCAAAGAAATGGGTAAACAGAACGGTTTTCTGTTTTACATC



CCAGCGTGGAACACCAGCAATATAGATCCAACCACTGGCTTCGTCAATCTGTTTCATGCTCAGTATGA



AAATGTGGACAAGGCCAAATCCTTCTTTCAGAAATTTGACAGCATCTCCTATAACCCAAAGAAAGACT



GGTTTGAATTCGCCTTTGACTATAAGAATTTCACTAAGAAGGCCGAGGGATCAAGAAGCATGTGGAT



ATTGTGCACGCATGGCTCACGTATAAAGAACTTTAGAAACTCGCAAAAAAACGGGCAGTGGGACTCA



GAAGAATTCGCACTCACCGAGGCTTTCAAATCCCTCTTCGTCCGGTATGAGATCGATTACACCGCCGA



TCTGAAGACGGCAATCGTCGACGAGAAACAGAAAGACTTCTTTGTAGATCTACTTAAGCTCTTTAAGC



TAACCGTTCAGATGCGAAACAGTTGGAAAGAAAAGGATCTCGACTATCTCATTAGTCCAGTGGCTGG



CGCGGATGGTAGATTTTTCGATACCCGGGAAGGTAACAAGTCCCTTCCCAAAGACGCCGACGCGAAT



GGTGCCTACAATATTGCACTAAAGGGGCTCTGGGCGCTGCGGCAAATTAGACAGACATCTGAAGGGG



GCAAGCTTAAGCTGGCTATTTCTAATAAAGAGTGGTTGCAGTTTGTGCAGGAAAGGAGTTATGAGAA



GGACTAG





SEQ
ATGAACAACGGCACCAACAACTTCCAGAACTTCATCGGCATATCGTCTCTGCAGAAAACACTTAGGA


ID
ATGCCCTGATTCCAACTGAGACAACACAGCAGTTTATTGTGAAGAATGGGATCATCAAAGAGGACGA


NO:
ATTGCGCGGGGAGAATAGGCAGATCCTGAAGGACATCATGGACGATTACTACAGGGGTTTTATCTCC


153
GAAACGCTGAGCTCGATTGACGATATTGACTGGACGTCCCTCTTTGAGAAGATGGAAATCCAACTTAA



AAATGGCGATAATAAAGATACCCTGATAAAGGAACAAACCGAATATAGAAAGGCTATACACAAAAA



ATTCGCAAATGACGACCGCTTTAAGAACATGTTTTCTGCAAAACTGATTAGCGATATTCTGCCCGAGT



TTGTGATTCACAATAATAACTATTCCGCTTCGGAGAAGGAGGAAAAGACTCAGGTGATTAAACTGTTT



TCTCGGTTCGCCACTTCTTTCAAAGATTATTTCAAAAATCGCGCCAACTGTTTTTCCGCTGACGACATC



TCCTCCTCTTCCTGCCACCGGATCGTAAACGACAATGCCGAGATCTTTTTTAGTAACGCCCTTGTGTAT



CGGAGGATAGTGAAGAGCCTGTCCAATGATGACATAAACAAAATTTCTGGCGATATGAAGGATAGCC



TCAAAGAGATGAGCCTTGAAGAAATTTACTCCTACGAGAAGTATGGGGAGTTCATCACCCAGGAGGG



GATTTCCTTCTATAATGACATCTGTGGCAAGGTGAACAGCTTCATGAACCTGTACTGCCAGAAGAATA



AGGAAAACAAAAATCTGTACAAGCTTCAGAAGTTACATAAGCAGATCCTGTGTATCGCGGATACCTC



ATATGAGGTTCCTTATAAGTTCGAGAGTGATGAAGAAGTGTACCAGTCTGTAAATGGATTCTTAGACA



ATATTTCGTCCAAACATATAGTGGAGAGACTGAGAAAGATCGGGGACAATTACAATGGGTACAATCT



CGACAAGATTTATATCGTGTCGAAGTTTTACGAATCTGTGAGCCAGAAAACATACAGGGATTGGGAA



ACCATTAATACCGCGCTTGAAATTCACTACAATAATATTCTGCCTGGCAACGGAAAAAGCAAGGCCG



ATAAGGTAAAAAAGGCAGTCAAAAATGACCTTCAGAAAAGTATCACCGAAATCAATGAGTTGGTGAG



CAACTACAAATTGTGTTCAGACGATAATATTAAAGCGGAAACGTACATACATGAAATTAGCCATATTC



TGAATAACTTTGAGGCGCAGGAACTTAAGTACAACCCTGAAATTCATCTCGTCGAAAGCGAATTGAA



GGCCTCTGAATTGAAAAACGTTCTTGACGTGATAATGAACGCTTTCCATTGGTGCTCTGTGTTTATGAC



TGAAGAGCTGGTTGATAAGGACAACAACTTTTATGCTGAACTTGAGGAAATCTACGACGAGATCTAC



CCTGTGATTAGCTTGTATAACCTCGTCAGAAACTACGTTACCCAGAAGCCGTACAGCACGAAAAAAAT



AAAGCTGAACTTTGGTATTCCGACTCTCGCCGATGGATGGAGCAAGTCGAAGGAATATTCCAACAAT



GCCATCATTCTTATGCGAGACAATCTGTATTACCTCGGCATCTTTAACGCCAAAAACAAGCCGGATAA



GAAAATCATTGAAGGGAATACGAGCGAGAATAAGGGCGACTATAAGAAAATGATCTACAACTTACTG



CCAGGTCCCAATAAAATGATTCCTAAGGTGTTTCTGTCATCGAAAACAGGTGTAGAAACATATAAGCC



CAGCGCATACATCCTGGAAGGCTACAAGCAAAACAAACACATCAAAAGCAGCAAGGACTTTGATATC



ACATTCTGCCACGATCTAATCGACTACTTCAAAAATTGCATCGCCATTCACCCTGAGTGGAAGAACTT



CGGCTTTGACTTCTCCGACACCAGTACCTACGAAGACATTTCTGGATTCTACCGTGAGGTTGAGCTGC



AGGGTTATAAAATTGACTGGACATACATCAGTGAAAAAGACATCGATCTACTGCAGGAGAAGGGGCA



GCTCTATCTCTTCCAGATTTATAATAAGGATTTCAGCAAGAAGTCCACTGGAAACGACAATCTGCATA



CAATGTATCTTAAGAACTTGTTTAGCGAAGAGAATTTGAAAGATATCGTTCTAAAGTTAAACGGGGAA



GCCGAGATTTTCTTTCGAAAGTCTTCCATTAAGAATCCAATTATTCACAAGAAGGGCAGTATCCTGGT



CAACAGAACCTATGAGGCCGAGGAAAAGGACCAGTTCGGTAATATACAAATTGTGCGCAAGAACATC



CCCGAGAACATTTACCAGGAGCTCTATAAATACTTCAACGACAAAAGCGATAAGGAGCTTTCCGACG



AGGCTGCCAAGCTGAAAAACGTGGTGGGACACCATGAAGCAGCCACCAACATCGTCAAAGATTATCG



TTATACATATGACAAATATTTTCTGCACATGCCTATTACAATAAACTTTAAGGCAAACAAGACCGGGT



TCATCAATGACCGGATACTCCAGTACATCGCAAAAGAGAAGGACCTGCATGTGATCGGCATCGACCG



CGGTGAAAGAAATCTCATTTACGTCAGCGTTATCGACACTTGTGGAAACATTGTGGAGCAGAAGTCCT



TCAACATTGTTAACGGCTATGACTATCAGATCAAGCTCAAACAGCAGGAAGGTGCTCGTCAGATTGCG



AGGAAAGAATGGAAAGAGATCGGCAAGATCAAGGAGATCAAAGAAGGGTATCTGAGCTTGGTCATT



CACGAGATCTCCAAAATGGTCATCAAGTACAACGCTATTATCGCGATGGAAGACCTCTCTTACGGCTT



TAAGAAGGGGCGCTTTAAAGTGGAGCGCCAGGTCTATCAGAAGTTCGAGACTATGCTTATCAATAAG



CTGAATTACTTGGTCTTTAAGGATATCAGTATCACCGAGAACGGAGGACTGCTGAAAGGTTACCAGCT



CACATATATTCCCGATAAGCTCAAGAATGTGGGCCACCAATGCGGTTGTATTTTTTACGTTCCAGCTG



CCTACACATCTAAGATCGATCCTACCACCGGATTCGTCAATATATTTAAATTTAAAGATCTAACCGTT



GATGCCAAGCGTGAGTTTATTAAGAAATTTGATTCAATCAGGTACGACAGCGAAAAGAACCTCTTCTG



TTTCACTTTCGACTACAACAACTTCATCACACAAAATACTGTGATGAGCAAGTCATCATGGAGCGTTT



ATACTTATGGTGTAAGGATAAAAAGGCGCTTTGTTAATGGAAGGTTTTCCAATGAAAGCGATACAATA



GACATCACAAAAGACATGGAGAAGACACTGGAGATGACAGATATTAATTGGAGGGACGGGCATGAC



CTTAGACAGGACATCATCGACTACGAAATCGTCCAACACATTTTTGAGATATTCAGACTCACTGTCCA



GATGCGAAACAGCCTGTCGGAACTCGAAGACCGGGACTACGATAGACTGATCTCCCCGGTGTTAAAC



GAAAATAATATTTTCTACGATTCTGCTAAGGCAGGAGACGCTCTTCCTAAAGATGCGGACGCCAATGG



CGCTTACTGTATAGCGTTGAAGGGATTGTATGAGATTAAACAGATCACTGAGAATTGGAAAGAAGAC



GGTAAATTCTCCAGAGACAAGCTGAAAATCTCCAACAAAGACTGGTTTGATTTTATTCAAAATAAGCG



CTACCTGTAA





SEQ
ATGACAAACAAATTTACTAATCAGTACAGCCTGTCAAAGACCCTCCGCTTCGAACTGATTCCACAAGG


ID
GAAGACCCTTGAATTCATCCAGGAAAAGGGTTTATTATCCCAGGATAAACAACGCGCAGAAAGCTAT


NO:
CAAGAGATGAAGAAGACGATCGATAAATTTCATAAGTATTTCATAGATTTAGCCCTGAGCAACGCTA


154
AATTGACCCACCTGGAAACCTATTTGGAGCTGTACAACAAGTCAGCCGAGACAAAGAAAGAGCAGAA



GTTTAAGGACGACCTGAAAAAAGTACAGGACAATTTGCGAAAAGAGATCGTCAAGTCTTTTTCCGAC



GGAGACGCCAAGTCAATATTTGCCATCCTGGACAAAAAGGAACTCATCACTGTGGAGTTGGAGAAGT



GGTTTGAGAATAATGAGCAGAAGGACATCTATTTTGACGAAAAGTTCAAGACATTTACTACTTACTTC



ACCGGATTTCACCAAAACCGGAAGAACATGTACTCTGTTGAGCCGAACTCAACCGCCATCGCCTACCG



CCTTATTCACGAAAATCTGCCAAAGTTTCTCGAGAATGCTAAAGCCTTTGAGAAAATTAAGCAGGTCG



AGTCGCTCCAGGTGAACTTTCGAGAGCTGATGGGTGAATTCGGGGACGAGGGCCTGATTTTCGTGAAT



GAACTCGAAGAGATGTTTCAGATCAACTACTATAATGATGTACTCTCACAGAACGGGATCACTATCTA



CAACAGCATTATCTCTGGATTCACTAAGAACGATATCAAGTATAAAGGGCTGAATGAATACATCAAC



AATTATAATCAGACTAAGGACAAAAAGGACAGGCTGCCTAAATTGAAACAGCTGTATAAGCAGATCC



TCAGTGATAGAATTAGCTTGTCATTTCTCCCAGATGCCTTCACTGACGGAAAGCAGGTGCTTAAGGCG



ATATTCGATTTCTATAAGATCAACCTCCTCTCTTATACAATCGAGGGCCAGGAGGAGTCACAGAACCT



CCTGCTCCTGATTCGACAAACTATTGAAAATCTGTCCTCTTTCGATACGCAGAAGATATACCTGAAAA



ATGACACCCATCTCACTACAATATCCCAACAGGTATTCGGAGATTTCTCCGTCTTCAGTACAGCCCTG



AATTACTGGTACGAGACAAAGGTGAACCCTAAGTTCGAAACAGAGTACAGCAAGGCGAACGAAAAG



AAGAGGGAGATCCTGGACAAAGCCAAAGCCGTTTTCACCAAGCAAGATTACTTTAGCATCGCATTTCT



GCAGGAAGTCCTGTCTGAGTACATACTGACACTCGATCACACAAGCGACATAGTTAAGAAGCACTCTT



CCAATTGTATCGCGGACTACTTCAAAAATCATTTTGTCGCGAAAAAGGAGAACGAGACAGATAAGAC



CTTCGATTTTATCGCGAATATTACCGCAAAGTATCAATGCATTCAGGGTATCTTGGAGAACGCCGACC



AGTACGAAGACGAGCTTAAACAGGATCAGAAGCTCATCGACAACCTAAAGTTCTTTTTGGACGCTAT



ACTGGAACTCCTTCATTTTATTAAGCCACTACATCTGAAGAGTGAGTCTATCACTGAGAAGGACACTG



CTTTTTACGACGTTTTCGAGAATTACTACGAAGCACTGTCTCTGCTAACCCCTCTGTATAACATGGTGA



GAAACTATGTGACACAGAAACCTTATAGTACCGAGAAGATTAAGTTGAACTTCGAGAACGCACAATT



GCTGAATGGGTGGGATGCAAACAAAGAGGGTGATTACCTCACAACAATCCTCAAGAAAGATGGCAAT



TACTTCCTGGCCATTATGGATAAAAAACATAACAAGGCATTTCAGAAATTTCCCGAGGGGAAGGAAA



ATTATGAAAAGATGGTATACAAGTTGCTGCCCGGGGTGAACAAAATGCTCCCGAAGGTGTTTTTCTCG



AATAAGAATATCGCGTACTTTAACCCGTCCAAGGAACTGTTGGAAAATTATAAAAAGGAAACACACA



AGAAGGGGGACACTTTTAATTTGGAGCACTGCCACACACTCATTGACTTCTTTAAAGATAGTCTCAAC



AAACATGAGGATTGGAAATATTTTGACTTTCAGTTTAGCGAGACCAAGTCTTATCAGGATCTGTCGGG



ATTTTATAGGGAAGTTGAGCACCAGGGTTACAAGATAAATTTCAAGAACATCGATAGCGAGTACATT



GACGGACTGGTGAACGAAGGGAAGCTGTTCCTGTTTCAGATTTACAGCAAAGATTTCTCTCCTTTCTC



AAAAGGCAAGCCGAACATGCATACCCTGTATTGGAAGGCCCTGTTCGAGGAGCAAAACCTTCAGAAT



GTGATTTACAAGCTGAACGGTCAGGCCGAGATTTTTTTTAGGAAGGCCTCTATCAAGCCCAAAAACAT



CATTCTGCACAAGAAAAAGATAAAGATCGCCAAAAAACACTTCATTGATAAAAAGACAAAGACTTCT



GAGATCGTACCTGTTCAGACAATCAAGAATCTCAACATGTATTATCAGGGGAAGATTAGCGAGAAAG



AGCTGACACAGGACGATTTGAGGTACATCGACAACTTCTCTATCTTTAACGAGAAGAACAAGACAAT



CGATATCATCAAGGACAAGCGGTTTACCGTCGATAAATTCCAGTTCCATGTGCCTATCACGATGAATT



TCAAGGCCACCGGTGGGAGTTATATCAACCAGACTGTGCTGGAGTATCTGCAGAACAACCCCGAAGT



AAAAATTATTGGCCTGGACAGAGGAGAGCGGCATCTGGTGTACTTGACCCTCATCGATCAGCAGGGA



AATATCCTGAAACAAGAATCTCTGAATACTATTACGGACTCCAAAATCAGCACACCTTACCACAAGCT



GCTTGATAATAAAGAGAATGAGAGGGACTTGGCCCGCAAAAATTGGGGCACCGTCGAGAATATTAAG



GAATTGAAAGAAGGATACATCTCACAGGTGGTTCACAAAATCGCAACCCTGATGTTAGAAGAGAACG



CTATTGTGGTGATGGAGGACTTAAACTTCGGATTTAAAAGAGGAAGATTTAAAGTCGAGAAACAGAT



TTATCAGAAACTGGAAAAAATGCTCATTGACAAATTAAATTACCTGGTGCTGAAAGATAAACAGCCA



CAGGAGCTGGGTGGCCTGTATAATGCTCTGCAGCTGACCAACAAGTTCGAGTCGTTTCAGAAAATGG



GCAAGCAGTCAGGCTTCCTTTTTTACGTGCCCGCTTGGAACACCTCAAAAATCGACCCTACAACAGGC



TTTGTGAATTATTTCTATACCAAGTATGAAAACGTGGACAAGGCAAAGGCCTTTTTCGAGAAGTTTGA



AGCAATCAGGTTCAATGCCGAGAAAAAATACTTTGAGTTCGAGGTCAAAAAATATAGCGACTTCAAC



CCTAAGGCCGAAGGCACGCAACAAGCCTGGACAATATGCACGTATGGGGAGAGAATTGAGACTAAG



CGGCAGAAGGATCAGAATAACAAATTCGTGAGCACACCGATTAACCTGACAGAGAAGATAGAGGAC



TTCCTCGGCAAGAATCAGATCGTGTACGGCGACGGCAATTGCATCAAGTCACAAATTGCATCTAAAG



ATGACAAAGCATTCTTCGAAACACTGCTGTATTGGTTCAAGATGACACTCCAGATGCGAAATAGCGA



AACAAGAACAGATATTGACTACCTCATCAGCCCTGTGATGAATGATAACGGCACGTTTTACAATTCCC



GGGACTATGAAAAATTAGAGAACCCGACACTGCCAAAAGACGCCGACGCAAATGGTGCATATCACAT



CGCAAAGAAAGGTTTGATGCTGTTGAACAAAATTGATCAGGCTGATCTGACAAAAAAGGTCGATCTG



AGTATCAGTAACCGCGACTGGTTGCAGTTTGTCCAGAAGAACAAATAA





SEQ
ATGGAACAAGAGTACTATCTGGGCCTGGACATGGGCACCGGGAGTGTCGGATGGGCAGTCACCGACT


ID
CAGAGTACCACGTCCTCAGAAAGCACGGTAAGGCACTTTGGGGAGTGCGACTCTTCGAGTCCGCTAG


NO:
TACTGCTGAAGAGAGGAGGATGTTTCGAACTTCCAGGCGCAGGCTGGATCGGCGAAACTGGAGAATA


155
GAGATTCTCCAGGAGATATTTGCTGAAGAGATTTCAAAGAAGGATCCTGGTTTTTTCCTGCGCATGAA



AGAATCTAAGTATTACCCCGAAGATAAACGCGACATCAACGGCAATTGTCCTGAACTGCCCTATGCTC



TGTTTGTCGACGACGATTTCACCGACAAAGATTACCACAAGAAATTCCCCACCATATACCACCTGAGA



AAGATGTTGATGAACACCGAGGAGACACCCGACATACGTCTGGTTTACCTGGCTATCCATCATATGAT



GAAGCACCGCGGGCATTTCCTGCTGTCTGGAGACATCAATGAGATAAAGGAATTTGGTACTACGTTCT



CCAAGTTGTTAGAAAACATTAAGAATGAAGAGTTGGACTGGAATCTTGAACTGGGAAAGGAAGAGTA



TGCAGTTGTAGAGTCGATTTTGAAAGATAACATGTTAAACCGGTCAACTAAGAAAACCAGGTTAATTA



AGGCACTAAAGGCCAAATCGATATGCGAGAAGGCTGTGCTAAATCTGCTGGCTGGAGGCACCGTGAA



ACTGTCTGATATTTTCGGCCTGGAAGAGCTCAATGAAACCGAGCGGCCTAAAATTTCTTTCGCCGATA



ACGGATACGATGACTATATTGGGGAGGTGGAAAACGAGCTCGGAGAACAATTCTACATTATTGAAAC



CGCTAAGGCAGTCTATGACTGGGCCGTGCTCGTCGAGATTTTAGGCAAGTACACCAGCATTAGCGAA



GCAAAGGTGGCTACCTATGAAAAGCACAAATCTGACCTCCAGTTTCTGAAAAAGATTGTGCGCAAAT



ACTTAACAAAAGAAGAGTACAAGGACATCTTTGTGAGCACATCAGATAAGCTCAAGAATTACTCAGC



ATACATTGGAATGACAAAGATTAACGGGAAGAAGGTGGATCTCCAAAGCAAACGTTGTTCAAAGGAG



GAGTTTTACGATTTCATAAAGAAGAACGTGCTGAAGAAACTGGAGGGACAACCGGAGTACGAGTATT



TAAAGGAGGAGCTCGAGCGAGAAACTTTCCTGCCCAAGCAAGTGAACAGAGACAATGGTGTCATTCC



TTACCAGATTCACTTATATGAGCTGAAGAAAATCCTGGGGAACTTGAGAGACAAGATAGACCTCATC



AAGGAAAATGAAGATAAGTTGGTCCAGTTGTTCGAATTCAGAATCCCATATTACGTCGGCCCGCTCAA



TAAGATCGACGACGGCAAGGAAGGCAAATTCACTTGGGCGGTGCGAAAAAGCAACGAAAAAATATA



CCCATGGAACTTTGAGAACGTCGTTGACATCGAGGCCAGCGCCGAGAAATTTATAAGACGCATGACT



AATAAGTGTACTTACCTCATGGGCGAGGATGTTCTGCCCAAGGACAGCCTGCTGTATTCCAAGTACAT



GGTGCTTAACGAGCTGAATAATGTAAAGTTAGATGGTGAGAAGCTCAGCGTGGAGCTTAAACAGAGG



CTGTACACTGATGTGTTTTGCAAGTATCGGAAAGTTACCGTTAAGAAGATAAAGAATTACCTGAAATG



CGAAGGGATCATTTCCGGCAACGTGGAAATTACCGGAATCGACGGCGATTTTAAGGCGTCGTTGACC



GCTTATCATGATTTCAAGGAGATTTTAACCGGCACGGAGCTCGCGAAGAAAGACAAGGAGAACATAA



TCACGAATATAGTTCTGTTTGGGGACGATAAAAAACTTCTTAAAAAACGACTCAATCGACTGTATCCG



CAGATTACCCCCAACCAGCTGAAGAAGATTTGCGCTCTGAGCTATACCGGGTGGGGCCGGTTCTCTAA



GAAATTCCTCGAGGAGATCACAGCACCAGACCCAGAGACTGGTGAGGTGTGGAATATTATTACAGCT



CTGTGGGAATCCAATAATAACCTTATGCAATTGTTGAGCAATGAATATAGGTTCATGGAGGAAGTGG



AAACCTACAATATGGGCAAGCAGACAAAGACCCTATCTTACGAGACCGTTGAGAATATGTATGTCTC



CCCTTCAGTGAAACGGCAAATCTGGCAAACTTTGAAGATCGTGAAGGAGCTCGAAAAGGTGATGAAA



GAGAGCCCGAAGAGGGTTTTTATTGAAATGGCCAGAGAGAAACAGGAGAGCAAGAGAACAGAGTCT



AGGAAGAAGCAGCTAATCGATTTGTATAAAGCCTGCAAGAACGAGGAAAAAGACTGGGTCAAGGAG



CTAGGCGATCAGGAAGAACAGAAGTTGCGCTCTGATAAGCTGTACTTATATTATACCCAGAAAGGAC



GGTGCATGTACTCAGGTGAGGTCATTGAGCTGAAAGATCTGTGGGACAATACTAAGTATGATATTGAT



CACATCTACCCTCAGTCAAAAACTATGGACGACTCCCTCAACAACAGGGTGTTGGTTAAGAAGAAAT



ACAATGCTACAAAGTCCGATAAATACCCTCTTAACGAAAACATCCGGCACGAAAGAAAGGGCTTCTG



GAAGTCCCTGCTGGATGGGGGTTTTATCAGTAAAGAAAAGTATGAGAGGCTGATCCGAAATACCGAG



CTCTCCCCCGAGGAACTGGCTGGCTTTATCGAAAGGCAGATCGTAGAGACTAGGCAATCTACAAAGG



CAGTCGCTGAGATCCTGAAGCAAGTGTTTCCTGAGTCAGAAATCGTGTACGTCAAAGCTGGCACAGTG



TCACGGTTCCGAAAGGACTTTGAGTTGTTAAAAGTTCGGGAGGTGAATGACCTGCACCACGCTAAAG



ACGCCTATCTGAATATCGTTGTGGGGAACTCCTATTATGTTAAGTTTACTAAGAATGCGTCCTGGTTTA



TTAAGGAGAACCCGGGGCGCACCTATAACCTGAAGAAGATGTTCACCTCCGGCTGGAACATAGAACG



GAACGGAGAAGTCGCGTGGGAGGTGGGTAAGAAAGGGACCATTGTGACCGTCAAACAGATTATGAA



CAAAAACAACATATTGGTAACTCGCCAGGTGCATGAGGCCAAAGGGGGCCTCTTTGATCAGCAGATT



ATGAAAAAGGGCAAAGGACAGATCGCAATCAAGGAAACCGACGAGCGCCTGGCATCCATTGAGAAG



TACGGAGGCTACAACAAGGCGGCAGGTGCGTACTTCATGCTCGTCGAGTCCAAAGATAAGAAAGGCA



AAACTATTAGAACAATCGAGTTCATCCCTCTATATTTGAAAAATAAGATCGAAAGTGACGAAAGCAT



CGCCCTTAACTTCTTGGAGAAGGGCCGGGGCTTAAAGGAACCAAAGATTCTGCTCAAGAAGATCAAG



ATCGACACACTCTTCGATGTGGATGGTTTTAAGATGTGGCTGTCAGGCAGGACAGGGGATCGCTTGCT



GTTCAAATGCGCAAATCAGTTGATTCTGGACGAAAAGATCATTGTGACGATGAAGAAGATCGTTAAA



TTCATTCAGCGGAGACAGGAAAACAGAGAACTGAAACTCTCCGATAAGGATGGAATTGACAATGAAG



TCCTCATGGAGATTTACAATACCTTTGTGGACAAGCTTGAGAACACAGTCTATCGGATCCGACTGTCC



GAACAGGCAAAGACTCTGATCGACAAACAGAAAGAATTCGAAAGACTAAGCTTAGAGGACAAAAGT



TCAACTCTCTTTGAAATTCTCCACATCTTCCAATGTCAAAGTAGTGCAGCCAACTTGAAGATGATCGG



GGGTCCCGGCAAGGCTGGAATCTTAGTCATGAACAACAACATCTCCAAATGTAACAAAATCTCCATC



ATAAACCAGTCTCCCACCGGCATTTTCGAGAACGAAATTGATTTACTCAAG





SEQ
ATGAAATCTTTCGATTCTTTCACCAACCTCTACTCCCTTAGCAAAACCCTTAAGTTTGAAATGAGGCCG


ID
GTGGGGAATACACAGAAGATGCTTGACAATGCTGGCGTCTTTGAAAAGGACAAATTAATCCAGAAGA


NO:
AGTATGGTAAAACAAAGCCATATTTTGACCGATTGCATCGGGAATTCATTGAAGAGGCTCTTACAGGA


156
GTAGAATTGATCGGACTGGACGAGAACTTCCGTACCTTAGTAGACTGGCAGAAGGACAAGAAGAACA



ACGTGGCAATGAAGGCCTATGAGAACTCACTCCAGCGCCTTAGAACCGAGATCGGAAAGATCTTTAA



TCTTAAGGCGGAAGATTGGGTAAAAAATAAGTACCCGATCCTGGGACTGAAAAACAAAAACACAGAC



ATCCTGTTTGAAGAAGCCGTCTTTGGTATCTTGAAGGCCAGGTATGGAGAGGAGAAAGACACGTTTAT



AGAGGTAGAGGAGATTGATAAAACAGGCAAGAGTAAGATTAATCAGATCAGTATCTTTGATTCTTGG



AAGGGGTTCACAGGCTACTTTAAGAAGTTTTTCGAAACCAGGAAAAATTTCTATAAGAACGATGGCA



CCTCCACAGCTATCGCGACACGCATCATAGATCAGAATCTGAAACGGTTCATTGATAATCTGAGCATT



GTTGAATCCGTGCGCCAGAAGGTCGACCTAGCTGAGACTGAGAAGTCTTTCTCTATATCACTCTCCCA



GTTCTTCTCAATAGATTTTTATAATAAGTGCCTTCTGCAAGATGGCATAGACTACTATAACAAGATCAT



CGGCGGCGAAACTCTCAAAAACGGTGAAAAGCTCATTGGCCTGAATGAGCTCATCAACCAATATAGA



CAAAATAACAAGGATCAGAAAATCCCATTCTTTAAGCTGCTAGATAAACAGATCCTATCAGAAAAAA



TCCTGTTCCTCGACGAAATCAAAAACGACACCGAACTCATCGAGGCTCTCTCGCAGTTTGCCAAGACG



GCTGAGGAGAAGACGAAGATTGTGAAAAAGCTGTTTGCAGACTTTGTGGAGAACAACTCTAAATACG



ATTTGGCTCAGATTTATATCTCCCAGGAAGCATTTAACACAATCTCCAATAAGTGGACTAGCGAGACT



GAAACCTTCGCCAAATACCTGTTCGAGGCCATGAAAAGCGGCAAGCTCGCCAAATACGAGAAGAAGG



ACAATTCCTATAAGTTTCCCGATTTCATCGCATTATCTCAGATGAAGTCCGCGCTACTTAGCATTAGCC



TGGAAGGCCATTTTTGGAAGGAGAAATACTATAAGATTTCCAAATTCCAAGAAAAGACCAATTGGGA



GCAGTTCTTGGCTATTTTTCTATACGAGTTCAACTCTTTGTTCAGTGACAAGATCAACACTAAGGACGG



TGAGACCAAACAAGTGGGGTACTACCTCTTCGCCAAAGATCTTCATAACCTGATACTGTCCGAACAGA



TCGACATACCCAAGGATTCAAAGGTGACCATCAAGGATTTTGCGGATTCGGTATTGACGATCTATCAG



ATGGCGAAGTATTTCGCTGTCGAGAAAAAGCGGGCATGGCTGGCCGAATACGAGTTGGACTCCTTCT



ATACTCAACCCGATACAGGGTACCTGCAGTTTTACGATAATGCATACGAGGATATAGTCCAGGTGTAC



AATAAACTCAGGAACTACCTCACTAAGAAACCATACTCCGAAGAAAAATGGAAACTTAATTTTGAGA



ATAGTACACTGGCCAATGGATGGGACAAGAACAAGGAATCAGACAACTCCGCTGTAATTCTCCAGAA



GGGTGGCAAGTATTATCTGGGACTGATAACAAAGGGCCATAACAAGATTTTCGATGACCGTTTTCAGG



AGAAGTTTATAGTGGGCATAGAGGGTGGCAAGTATGAAAAAATAGTCTACAAGTTCTTTCCCGATCA



GGCGAAGATGTTCCCCAAAGTATGCTTCAGTGCTAAAGGCCTCGAGTTTTTCCGGCCATCTGAAGAGA



TACTCCGCATCTATAATAACGCAGAGTTTAAAAAGGGAGAGACGTACTCAATCGACTCGATGCAGAA



ACTCATTGACTTCTACAAAGATTGTCTCACAAAATACGAGGGCTGGGCTTGCTACACGTTTCGGCACT



TGAAGCCAACCGAGGAATATCAAAACAACATCGGGGAGTTCTTCCGTGACGTCGCCGAAGACGGCTA



TAGAATTGACTTTCAGGGCATAAGTGATCAGTATATTCACGAGAAGAATGAGAAAGGTGAGTTGCAT



CTTTTCGAAATCCACAATAAAGACTGGAATCTTGACAAGGCTCGCGATGGAAAATCAAAGACTACCC



AGAAGAATCTTCATACACTTTACTTCGAGTCCCTCTTTTCCAACGACAACGTCGTACAGAATTTCCCAA



TAAAACTGAACGGCCAGGCCGAAATTTTTTACAGGCCCAAAACCGAAAAAGATAAACTGGAATCCAA



GAAAGACAAGAAGGGAAATAAGGTGATAGATCACAAAAGGTATTCCGAGAACAAGATTTTTTTCCAC



GTACCTCTTACCCTGAACAGAACGAAGAACGACTCTTATAGATTCAATGCCCAGATAAACAACTTTCT



CGCAAACAACAAAGATATCAATATTATCGGCGTCGATAGAGGTGAGAAGCACTTGGTATATTATTCTG



TGATCACGCAAGCATCCGATATCTTGGAGTCCGGTTCTTTGAACGAACTGAATGGTGTCAACTACGCC



GAGAAACTCGGTAAGAAAGCTGAGAATCGGGAGCAGGCTAGAAGGGACTGGCAGGACGTTCAGGGT



ATCAAGGACCTGAAGAAGGGCTACATTTCTCAGGTGGTTCGAAAACTGGCTGATTTGGCCATTAAGCA



CAATGCAATCATCATTTTAGAAGATTTGAACATGCGGTTTAAACAAGTCAGGGGGGGGATAGAGAAA



TCAATTTACCAACAGCTGGAAAAAGCTCTGATTGATAAACTCTCTTTTTTGGTTGATAAGGGCGAAAA



GAACCCCGAGCAAGCAGGACATCTCCTTAAAGCCTATCAACTGAGCGCACCTTTCGAGACATTCCAG



AAGATGGGAAAGCAAACCGGCATCATTTTCTATACCCAGGCTTCCTATACATCCAAGTCTGATCCAGT



GACTGGGTGGAGACCCCATCTCTACCTCAAGTACTTTTCTGCCAAAAAAGCTAAGGACGACATTGCTA



AGTTCACAAAAATCGAGTTCGTGAACGACAGGTTCGAGCTGACTTATGACATAAAAGATTTCCAGCA



GGCCAAGGAGTACCCAAACAAGACAGTTTGGAAAGTGTGTTCCAATGTGGAGAGGTTTCGGTGGGAC



AAGAATCTGAATCAGAATAAAGGGGGATATACTCACTACACCAACATTACCGAGAACATCCAAGAGT



TGTTCACCAAATACGGCATCGACATTACTAAAGATCTGCTGACACAGATCTCCACCATCGATGAGAAG



CAGAACACATCTTTCTTCCGGGATTTCATCTTTTATTTTAACTTGATCTGTCAGATTAGAAATACCGAC



GACAGTGAGATAGCTAAAAAAAACGGGAAAGACGATTTCATTCTCTCTCCCGTGGAGCCGTTTTTTGA



CTCCCGCAAAGACAATGGCAATAAGCTTCCGGAAAACGGGGACGATAACGGCGCCTACAACATCGCT



CGTAAGGGAATCGTTATCCTCAATAAAATAAGCCAGTATTCCGAGAAGAACGAGAATTGTGAAAAAA



TGAAGTGGGGGGACCTTTACGTCAGCAACATCGATTGGGATAACTTTGTGACACAAGCCAATGCGAG



ACACTAG





SEQ
ATGGAAAACTTCAAAAACCTCTACCCCATCAACAAGACCTTGAGGTTTGAGCTCCGGCCATATGGGA


ID
AGACACTGGAGAACTTCAAAAAGTCCGGTCTGCTGGAAAAGGATGCTTTTAAGGCTAACTCTAGGAG


NO:
GTCTATGCAGGCCATTATCGATGAGAAATTCAAGGAGACCATAGAGGAGCGTCTGAAATATACTGAG


157
TTTTCCGAGTGTGACCTAGGAAATATGACCAGTAAGGACAAAAAGATCACCGACAAGGCAGCGACAA



ACCTGAAGAAACAGGTGATTTTAAGCTTTGATGATGAGATTTTCAATAACTACTTGAAGCCGGACAAA



AACATCGACGCTCTGTTCAAGAATGATCCAAGCAACCCGGTCATCTCTACTTTCAAGGGCTTCACCAC



ATACTTTGTAAATTTCTTCGAAATACGGAAACACATCTTCAAGGGAGAGTCTTCCGGTAGCATGGCTT



ACAGAATAATCGATGAGAACCTAACTACATATCTAAACAATATCGAGAAGATCAAGAAATTGCCTGA



AGAACTGAAATCTCAGCTTGAGGGAATCGATCAAATTGACAAACTGAACAACTATAACGAGTTCATC



ACCCAGTCCGGCATTACTCATTATAACGAAATTATTGGAGGGATTTCGAAGTCTGAAAATGTCAAAAT



TCAAGGCATTAACGAAGGGATTAATCTTTACTGTCAAAAGAATAAAGTGAAGCTACCACGCTTAACTC



CTCTGTATAAGATGATTCTCTCTGATCGGGTCTCTAATTCCTTTGTGCTGGATACCATTGAAAATGATA



CCGAGTTAATTGAAATGATCTCTGATCTGATAAATAAGACAGAGATAAGTCAGGATGTTATTATGTCC



GACATCCAAAATATTTTCATCAAATATAAACAACTCGGCAACTTGCCGGGGATTAGCTACTCATCTAT



AGTGAATGCTATCTGTTCGGATTACGACAATAACTTTGGTGACGGCAAACGTAAAAAAAGCTATGAG



AATGATCGCAAAAAACACCTCGAGACTAACGTGTATAGCATTAACTATATCTCAGAGTTACTGACAG



ACACCGACGTCTCCAGCAACATAAAGATGCGGTACAAAGAGCTGGAGCAGAATTATCAGGTATGCAA



GGAAAATTTCAACGCCACTAACTGGATGAACATCAAAAACATTAAGCAGTCTGAGAAAACCAATCTG



ATCAAGGACCTTCTTGACATCCTCAAGAGCATCCAGCGGTTTTATGATTTGTTTGACATCGTGGATGA



AGACAAAAATCCTAGTGCTGAGTTCTATACCTGGCTGTCTAAAAACGCGGAGAAACTGGACTTCGAG



TTTAATTCAGTGTACAACAAGAGCAGGAACTACCTCACGAGAAAGCAGTACTCCGATAAAAAGATTA



AGTTGAACTTCGATAGTCCTACTCTCGCCAAGGGGTGGGATGCGAACAAAGAAATTGATAATAGCAC



AATTATCATGAGGAAGTTCAACAACGACCGGGGCGATTACGATTACTTCTTGGGGATCTGGAATAAG



AGCACACCTGCCAACGAAAAGATCATCCCATTAGAGGATAATGGACTGTTTGAAAAAATGCAATATA



AGCTGTATCCCGATCCTAGTAAAATGCTGCCAAAGCAATTCCTTTCTAAGATCTGGAAAGCTAAACAT



CCAACTACACCCGAGTTTGATAAGAAGTACAAAGAAGGTCGGCACAAGAAGGGGCCTGATTTTGAGA



AAGAGTTTCTGCACGAGTTGATCGATTGCTTTAAGCATGGATTGGTAAACCACGACGAAAAATATCAG



GATGTGTTCGGGTTCAATCTGCGCAACACGGAAGACTACAACTCTTATACAGAGTTTCTGGAGGACGT



CGAAAGGTGCAACTATAATCTTAGTTTCAATAAAATCGCTGACACGTCTAACTTGATAAATGATGGGA



AACTCTATGTTTTTCAGATCTGGAGCAAGGATTTCAGCATAGATAGCAAGGGAACAAAAAACTTGAA



CACAATATACTTTGAATCCCTCTTCTCGGAGGAAAATATGATCGAGAAGATGTTCAAGCTCTCAGGGG



AAGCCGAAATATTCTATCGTCCAGCAAGTTTGAATTATTGTGAAGATATTATCAAGAAGGGACACCAC



CACGCCGAACTGAAGGACAAATTCGACTATCCCATCATCAAGGACAAGCGATATAGCCAGGACAAAT



TTTTTTTTCATGTCCCCATGGTTATCAACTACAAAAGCGAGAAGTTAAACTCCAAATCACTTAACAAT



AGGACGAACGAAAATTTAGGCCAATTCACGCACATCATCGGTATCGACCGCGGAGAGCGACATCTCA



TCTACCTGACCGTGGTGGATGTGTCCACCGGTGAGATCGTTGAGCAAAAGCACCTGGATGAAATTATA



AATACAGATACAAAAGGCGTCGAGCATAAAACTCATTATCTCAATAAATTAGAAGAGAAGTCCAAGA



CGCGGGATAATGAAAGAAAGTCCTGGGAAGCAATCGAGACGATTAAGGAGCTGAAAGAAGGCTATA



TTAGCCACGTGATCAATGAAATCCAGAAATTGCAGGAAAAGTATAACGCACTGATAGTGATGGAGAA



CCTCAATTATGGGTTTAAGAACTCGCGTATCAAAGTGGAAAAGCAGGTCTACCAGAAATTCGAGACC



GCCCTGATTAAAAAGTTTAATTACATCATTGACAAGAAAGATCCTGAAACCTACATTCATGGATACCA



ACTGACGAATCCAATCACTACACTCGATAAAATTGGTAACCAGAGCGGTATTGTGTTGTACATTCCGG



CTTGGAATACAAGCAAGATTGATCCAGTCACTGGTTTCGTTAACCTCCTGTATGCAGACGATTTGAAA



TACAAGAACCAGGAGCAGGCTAAAAGCTTTATCCAGAAAATCGATAATATCTACTTCGAAAATGGTG



AGTTTAAATTTGATATAGATTTCAGCAAATGGAACAACCGCTACTCAATTAGCAAGACGAAATGGAC



ACTGACAAGCTACGGAACCCGGATACAGACGTTCCGAAACCCCCAGAAAAATAACAAGTGGGACAG



CGCCGAGTATGACCTGACCGAAGAGTTTAAATTAATCCTGAACATCGATGGTACTCTGAAATCTCAGG



ATGTGGAAACCTATAAGAAATTCATGTCTTTATTCAAGCTGATGTTGCAGCTGCGAAACTCCGTTACT



GGAACAGACATTGACTACATGATTAGCCCTGTGACAGATAAAACTGGAACCCACTTTGATTCACGGG



AGAATATCAAGAACCTGCCCGCCGATGCTGATGCGAACGGAGCTTACAACATTGCTAGGAAGGGCAT



CATGGCAATCGAGAATATTATGAACGGCATTAGCGACCCTCTGAAGATCAGTAATGAGGACTACCTG



AAGTACATTCAGAACCAACAAGAGTAA





SEQ
ATGACCCAGTTTGAGGGTTTCACCAATCTTTATCAGGTGTCAAAAACACTCAGATTTGAGCTCATCCC


ID
ACAGGGTAAAACTTTAAAGCATATTCAAGAGCAGGGCTTTATAGAGGAAGACAAAGCCAGAAACGA


NO:
CCATTATAAGGAACTAAAACCGATCATTGACCGCATCTACAAAACCTATGCCGACCAATGCCTTCAGC


158
TCGTCCAACTCGATTGGGAGAATCTGAGCGCCGCTATTGACAGCTACAGGAAGGAGAAGACCGAGGA



GACTAGAAACGCCCTGATCGAGGAGCAGGCGACCTATAGAAACGCTATTCACGATTATTTTATCGGCC



GCACCGACAATTTGACAGATGCCATCAACAAGCGGCACGCCGAAATTTATAAGGGGTTATTTAAGGC



CGAGCTGTTCAATGGAAAAGTACTGAAACAGCTGGGCACCGTAACAACCACCGAACACGAGAATGCT



CTGTTGAGGTCCTTCGACAAGTTTACTACCTACTTTAGCGGCTTCTACGAAAACCGTAAAAACGTGTTT



TCCGCGGAGGATATTTCAACAGCCATTCCTCATAGGATCGTGCAGGATAATTTCCCCAAGTTTAAGGA



GAACTGCCATATCTTTACCAGACTTATCACTGCTGTGCCAAGTTTACGAGAACACTTCGAGAATGTTA



AGAAGGCTATAGGCATATTCGTTTCCACCTCCATCGAAGAAGTATTCAGTTTTCCATTCTACAATCAGT



TACTCACGCAGACCCAGATAGATCTCTACAATCAGCTGCTCGGAGGCATTTCTAGAGAAGCAGGCAC



GGAAAAGATCAAGGGCTTAAATGAAGTACTCAATCTTGCAATTCAGAAGAACGATGAGACAGCACAC



ATTATTGCATCTCTCCCTCACAGATTCATTCCCCTGTTCAAACAGATCCTGTCCGATCGCAACACACTA



AGCTTTATACTTGAGGAGTTTAAGTCAGATGAGGAAGTGATCCAGAGCTTCTGTAAGTATAAGACTTT



GCTCCGTAATGAAAACGTGCTTGAGACAGCAGAGGCTCTCTTTAACGAGTTGAATTCCATCGACCTGA



CACACATTTTTATCAGCCATAAAAAGCTGGAAACGATTAGCTCTGCCTTGTGCGACCACTGGGACACC



CTGCGTAACGCCCTCTATGAAAGGCGCATTTCCGAGCTCACCGGGAAGATCACAAAAAGTGCCAAGG



AAAAAGTCCAGAGGTCCCTTAAACATGAAGACATCAACCTACAAGAGATCATCTCTGCGGCTGGGAA



AGAGCTGTCAGAAGCATTTAAACAGAAGACTTCCGAGATCCTGAGCCACGCACACGCCGCATTAGAC



CAGCCCCTGCCTACAACTCTTAAAAAACAGGAGGAGAAGGAGATTTTAAAGAGCCAGCTGGACTCAT



TACTCGGCCTGTATCATCTCCTGGACTGGTTCGCCGTGGACGAATCCAACGAGGTGGACCCAGAATTT



AGCGCCAGGCTGACAGGAATTAAACTGGAAATGGAGCCAAGTTTGAGCTTTTACAACAAGGCTCGGA



ACTATGCCACTAAAAAGCCCTACAGCGTGGAAAAGTTCAAGCTGAATTTTCAGATGCCGACCCTGGCT



TCCGGGTGGGATGTTAATAAGGAAAAGAATAATGGGGCTATACTGTTCGTCAAAAATGGTCTCTACTA



CCTGGGAATCATGCCCAAACAGAAGGGCAGGTACAAAGCCCTTTCGTTTGAGCCGACCGAAAAAACC



AGCGAAGGCTTTGATAAGATGTATTACGACTATTTCCCAGATGCAGCCAAGATGATCCCAAAATGTAG



CACTCAGTTGAAGGCGGTAACCGCTCACTTTCAGACACACACCACTCCTATCTTGCTCTCCAACAACT



TTATTGAGCCGCTGGAGATCACGAAGGAAATCTACGACCTTAACAACCCAGAGAAGGAACCCAAGAA



ATTCCAAACAGCTTATGCTAAGAAGACTGGGGATCAAAAGGGCTATCGAGAGGCTTTGTGTAAGTGG



ATTGACTTTACACGGGATTTCCTGAGTAAGTATACCAAGACCACATCTATTGACCTGTCCTCACTGAG



ACCTTCCTCACAATATAAGGATCTCGGAGAGTATTATGCCGAACTCAACCCTCTACTCTATCACATCTC



TTTCCAGAGGATCGCCGAAAAGGAAATTATGGACGCCGTCGAGACAGGCAAGCTGTACCTCTTCCAG



ATTTACAACAAGGATTTCGCAAAGGGCCACCACGGAAAACCCAATTTGCACACTTTGTACTGGACAG



GGCTCTTCTCTCCCGAAAATTTGGCCAAAACTTCAATAAAACTGAACGGGCAAGCCGAGCTGTTCTAT



CGGCCCAAGTCACGTATGAAGCGGATGGCCCACCGGCTGGGCGAGAAGATGCTCAACAAGAAACTGA



AGGATCAGAAGACGCCCATACCAGACACTCTTTACCAAGAGCTGTATGACTACGTGAATCACAGACT



GAGTCACGACCTGTCTGATGAAGCCCGGGCTCTTCTTCCAAATGTGATTACCAAAGAAGTTTCCCACG



AAATTATCAAGGACCGGCGCTTCACCTCTGACAAATTCTTTTTCCACGTCCCAATCACCCTCAACTACC



AGGCAGCCAATTCCCCTTCAAAGTTTAACCAGCGTGTGAATGCCTACCTGAAAGAGCATCCGGAGAC



CCCCATCATAGGGATAGACAGAGGAGAGCGGAATCTTATCTACATTACTGTGATTGACAGCACAGGT



AAGATCTTGGAGCAGAGATCTTTAAATACAATCCAGCAGTTTGACTACCAGAAGAAACTGGATAACC



GAGAGAAGGAAAGGGTTGCTGCAAGACAGGCCTGGTCAGTGGTCGGCACCATCAAAGACCTGAAGC



AGGGCTACTTATCCCAAGTAATTCACGAAATTGTCGATCTTATGATTCATTATCAAGCCGTTGTTGTGC



TGGAGAACCTGAATTTTGGCTTCAAAAGCAAACGAACAGGTATCGCCGAGAAAGCCGTGTATCAGCA



GTTCGAAAAGATGCTCATAGACAAGCTGAACTGCTTAGTGCTGAAGGATTATCCTGCTGAGAAGGTC



GGCGGCGTACTTAACCCATACCAGCTGACCGATCAGTTCACTAGTTTCGCCAAGATGGGAACGCAAA



GTGGCTTCCTTTTCTACGTGCCCGCTCCCTACACGAGTAAGATCGACCCTCTGACCGGCTTCGTCGACC



CATTCGTCTGGAAGACCATCAAGAATCACGAATCACGGAAACACTTCTTAGAGGGGTTTGACTTCCTG



CACTACGACGTGAAGACAGGGGACTTCATCTTACACTTTAAGATGAATCGAAACCTCTCCTTCCAGCG



GGGCCTGCCTGGTTTCATGCCCGCATGGGACATCGTGTTTGAGAAAAACGAGACACAGTTTGACGCTA



AGGGAACCCCCTTTATTGCGGGGAAGCGGATTGTCCCAGTCATCGAAAACCATCGGTTCACCGGGCG



ATACCGGGATCTGTACCCGGCCAACGAGCTCATCGCGCTGCTGGAGGAGAAGGGTATTGTGTTTAGG



GATGGATCCAACATTCTGCCTAAGTTGCTGGAAAATGATGATTCGCACGCCATTGATACCATGGTTGC



ACTGATTAGATCCGTACTGCAGATGAGGAATAGCAATGCTGCAACCGGGGAGGATTATATTAATTCCC



CAGTGCGAGATCTGAATGGTGTCTGTTTTGACTCGCGCTTTCAGAATCCAGAATGGCCAATGGATGCA



GACGCTAACGGGGCGTACCACATTGCTCTGAAAGGCCAGCTACTCCTGAACCACCTCAAGGAGAGCA



AAGATCTGAAGCTGCAGAACGGCATTTCCAACCAAGACTGGCTCGCCTACATACAAGAACTGCGCAA



TTAA





SEQ
ATGGCTGTCAAATCCATCAAGGTTAAATTACGGCTTGATGACATGCCCGAGATCCGCGCCGGGCTCTG


ID
GAAACTCCATAAAGAAGTGAATGCTGGCGTTAGATACTACACAGAATGGCTCTCCCTGCTGCGCCAG


NO:
GAAAATTTGTACCGCCGGTCACCTAATGGAGATGGAGAGCAGGAATGCGATAAAACAGCAGAAGAG


159
TGCAAAGCCGAATTGCTGGAGCGACTGCGGGCACGGCAGGTTGAGAATGGACACCGAGGTCCGGCGG



GATCGGACGACGAGCTGCTCCAGCTCGCCAGACAATTATATGAACTGCTGGTGCCTCAGGCTATTGGG



GCAAAGGGTGACGCACAGCAGATTGCTAGAAAATTTCTGTCTCCCCTCGCCGACAAAGACGCTGTCG



GCGGCCTTGGGATAGCCAAAGCCGGCAACAAACCCCGATGGGTGCGCATGAGGGAGGCTGGTGAGCC



TGGCTGGGAGGAAGAAAAGGAAAAGGCCGAAACCAGAAAGTCCGCCGACAGGACCGCGGACGTACT



CCGAGCATTGGCCGATTTTGGGCTGAAGCCCTTAATGCGAGTCTACACCGATAGTGAAATGTCTAGCG



TGGAGTGGAAGCCATTACGCAAAGGGCAGGCAGTGCGGACGTGGGACCGTGACATGTTCCAGCAAGC



CATCGAGCGAATGATGAGCTGGGAGAGCTGGAACCAGAGAGTGGGGCAGGAGTATGCCAAGCTGGT



CGAGCAGAAAAACCGGTTTGAGCAAAAAAATTTTGTAGGTCAGGAACACCTGGTGCATCTCGTTAAC



CAGCTCCAGCAAGATATGAAGGAAGCTTCGCCTGGATTAGAGAGCAAAGAGCAGACTGCACACTATG



TAACCGGAAGAGCACTGAGGGGCAGTGACAAAGTGTTCGAAAAATGGGGAAAACTGGCTCCCGATG



CCCCCTTTGACCTGTACGACGCAGAAATAAAAAACGTGCAGCGGCGAAACACCAGGCGATTTGGTAG



CCATGATCTGTTCGCCAAATTGGCAGAGCCGGAATATCAGGCTCTTTGGCGAGAAGACGCATCATTTC



TCACTAGGTACGCGGTCTATAACTCCATTTTGAGGAAATTGAACCACGCAAAAATGTTTGCCACCTTC



ACGTTGCCTGACGCCACCGCTCATCCCATTTGGACACGGTTTGATAAGCTGGGCGGCAATCTGCATCA



GTATACATTCCTGTTTAACGAGTTTGGAGAGCGAAGACATGCGATACGATTCCACAAGCTACTGAAGG



TCGAAAATGGCGTGGCACGTGAGGTGGACGATGTCACCGTGCCCATCAGCATGAGCGAACAGCTGGA



TAATTTGTTGCCGCGGGACCCAAATGAACCTATAGCCCTTTATTTTAGGGACTACGGGGCGGAGCAAC



ATTTCACTGGGGAGTTTGGCGGCGCAAAAATTCAGTGCCGACGCGACCAGCTCGCCCACATGCATAG



AAGACGCGGGGCCCGGGACGTATACCTTAACGTCTCTGTGAGGGTGCAGTCCCAGTCAGAGGCAAGA



GGGGAACGCAGACCACCTTACGCAGCAGTATTCAGGCTGGTAGGCGATAACCACCGGGCGTTTGTAC



ACTTTGATAAACTTTCTGACTACCTGGCCGAACACCCGGATGACGGCAAATTAGGATCGGAGGGGCT



GCTTAGCGGCCTGCGTGTGATGAGCGTCGATCTGGGGCTACGGACCTCTGCTTCCATCTCTGTGTTCCG



TGTGGCCCGAAAGGACGAGTTGAAACCTAATTCGAAGGGCCGTGTACCATTCTTTTTCCCTATTAAGG



GAAATGATAATCTCGTCGCGGTGCACGAGCGTTCCCAACTGCTGAAACTGCCTGGCGAGACCGAGTC



CAAAGATCTCAGAGCAATCCGGGAGGAGCGACAACGTACACTTAGGCAACTCCGCACCCAGCTGGCC



TATCTGCGCTTGCTGGTGCGGTGCGGCTCCGAGGATGTAGGGAGAAGAGAGCGAAGCTGGGCAAAGC



TGATAGAGCAACCAGTTGACGCCGCGAATCACATGACCCCCGACTGGCGCGAAGCGTTTGAAAATGA



GCTGCAGAAGTTGAAATCTCTGCATGGGATTTGCTCAGATAAGGAGTGGATGGACGCCGTATACGAG



TCTGTTCGCCGGGTATGGCGGCACATGGGGAAGCAGGTGAGAGATTGGAGAAAGGACGTTCGCTCTG



GGGAACGGCCGAAAATTCGGGGATACGCAAAGGATGTCGTGGGCGGCAATAGCATTGAGCAGATCG



AGTACCTGGAAAGGCAATACAAATTTCTGAAATCTTGGTCTTTCTTTGGGAAGGTAAGCGGACAAGTT



ATCAGAGCCGAAAAGGGATCTCGCTTTGCTATCACATTGAGGGAACACATTGATCACGCCAAAGAAG



ACAGGTTGAAAAAGTTGGCTGATCGCATTATCATGGAAGCACTCGGTTACGTCTACGCCCTTGATGAG



CGCGGTAAAGGGAAGTGGGTAGCCAAGTATCCCCCATGTCAGCTGATCCTGCTCGAGGAACTTTCTGA



GTATCAGTTCAATAACGACCGTCCTCCCTCCGAAAATAATCAGCTCATGCAATGGTCCCACCGGGGTG



TGTTCCAAGAACTGATCAATCAGGCTCAGGTGCACGACCTCCTCGTAGGCACTATGTATGCAGCCTTT



AGCTCCCGTTTTGACGCGCGCACAGGCGCCCCTGGAATACGATGTAGGCGAGTTCCCGCACGGTGCAC



TCAAGAACATAACCCGGAGCCTTTCCCATGGTGGCTCAATAAGTTTGTTGTGGAGCATACCCTCGACG



CTTGCCCATTGAGGGCGGATGACTTGATTCCCACAGGCGAGGGGGAGATCTTCGTGAGCCCATTTTCT



GCCGAAGAAGGGGATTTCCACCAAATACATGCCGACTTGAATGCTGCCCAAAATCTGCAGCAAAGGC



TGTGGTCAGACTTCGACATCTCGCAAATCAGACTGCGGTGTGACTGGGGCGAAGTAGACGGCGAGCT



GGTGCTGATACCTAGACTGACGGGTAAGCGTACCGCCGATAGCTATAGTAATAAGGTTTTTTATACGA



ATACGGGGGTGACATATTACGAGCGTGAGAGAGGCAAGAAGCGTCGGAAGGTGTTCGCGCAGGAGA



AGCTGAGCGAAGAGGAGGCGGAGCTACTGGTAGAGGCAGATGAGGCAAGAGAAAAGTCCGTCGTCC



TGATGCGGGATCCTAGCGGGATTATTAACAGAGGTAATTGGACACGGCAGAAAGAATTCTGGAGCAT



GGTGAATCAAAGAATCGAGGGTTACCTGGTGAAGCAAATTCGAAGCCGGGTGCCCCTTCAAGACAGC



GCATGTGAAAACACTGGGGACATCTAG





SEQ
ATGGCTACTCGGTCCTTCATCCTGAAAATCGAGCCAAATGAAGAGGTGAAAAAGGGCCTGTGGAAGA


ID
CCCATGAGGTACTTAACCACGGCATAGCATACTATATGAATATCCTAAAACTTATACGGCAGGAGGCT


NO:
ATCTACGAGCATCACGAGCAAGATCCTAAAAATCCAAAGAAGGTTAGTAAGGCTGAAATCCAGGCTG


160
AATTGTGGGACTTCGTGCTGAAGATGCAGAAATGCAACAGTTTCACGCATGAAGTTGATAAGGACGT



CGTGTTTAATATACTCCGGGAGCTGTACGAAGAACTGGTACCAAGCTCTGTGGAAAAGAAAGGAGAG



GCCAACCAGCTAAGTAATAAGTTCCTCTATCCTCTCGTGGACCCCAATTCACAGAGCGGCAAAGGTAC



CGCATCTTCTGGGAGGAAACCACGCTGGTACAACTTGAAGATCGCTGGCGATCCCAGCTGGGAGGAG



GAAAAGAAGAAATGGGAAGAGGATAAAAAGAAAGACCCCCTGGCCAAAATCTTAGGCAAGCTCGCC



GAGTACGGTCTGATTCCACTTTTCATCCCGTTCACAGATAGCAATGAGCCGATCGTCAAGGAGATTAA



GTGGATGGAAAAGAGCCGCAATCAGAGTGTGCGGAGGCTGGACAAAGACATGTTTATTCAGGCCCTG



GAACGCTTCCTTAGCTGGGAAAGCTGGAACCTGAAGGTTAAGGAAGAGTACGAAAAAGTCGAGAAG



GAGCATAAGACTTTGGAGGAGCGCATCAAAGAAGACATCCAGGCCTTTAAGTCTCTAGAACAGTATG



AGAAAGAACGGCAGGAACAGCTGCTGCGTGATACACTGAACACAAACGAATATCGCCTGAGCAAGA



GGGGACTCAGAGGCTGGAGAGAAATCATTCAAAAGTGGCTCAAAATGGATGAAAATGAGCCGTCTGA



AAAATACCTTGAAGTTTTCAAGGACTACCAGCGGAAGCACCCTAGAGAAGCCGGCGACTATAGTGTT



TACGAATTCTTGAGCAAGAAGGAGAATCATTTTATATGGAGGAATCACCCGGAGTACCCATATCTGTA



CGCAACCTTCTGCGAAATCGACAAGAAAAAAAAAGACGCCAAGCAACAGGCTACATTTACTCTGGCC



GACCCTATCAATCACCCTCTATGGGTCCGGTTTGAGGAGCGCTCCGGAAGCAATCTGAATAAATATCG



TATTCTGACTGAACAGTTACACACAGAGAAGCTCAAGAAGAAACTTACGGTGCAGCTGGACCGCCTG



ATATACCCAACAGAGTCCGGAGGATGGGAAGAGAAAGGAAAGGTTGACATCGTACTGCTTCCATCTC



GTCAGTTTTACAACCAGATATTCCTGGACATCGAGGAGAAGGGGAAACACGCCTTCACATACAAGGA



CGAGTCCATAAAGTTCCCACTGAAGGGTACTTTAGGCGGTGCTAGGGTGCAGTTCGACCGCGATCACC



TGAGACGGTACCCCCACAAGGTGGAGAGCGGGAACGTGGGACGAATCTACTTTAATATGACAGTGAA



CATTGAACCCACAGAGAGTCCAGTTAGTAAATCCCTGAAAATTCACCGTGACGACTTTCCGAAATTTG



TGAATTTCAAGCCAAAGGAGCTTACGGAGTGGATCAAGGATTCAAAGGGAAAGAAGCTGAAATCTGG



TATCGAATCTCTCGAGATCGGTCTCCGTGTCATGAGCATCGATCTGGGACAGCGCCAGGCAGCTGCCG



CCAGTATATTCGAGGTGGTAGACCAAAAGCCTGACATCGAGGGAAAGCTCTTCTTCCCAATCAAAGG



CACAGAGCTGTATGCGGTGCACCGGGCGTCCTTTAATATAAAGCTGCCCGGTGAAACCCTGGTGAAGT



CACGGGAGGTGCTTAGAAAAGCGCGAGAGGATAACCTCAAACTGATGAACCAAAAACTGAACTTTCT



GAGGAACGTCCTGCACTTTCAGCAGTTCGAAGATATTACCGAACGCGAAAAGAGAGTAACCAAGTGG



ATATCTCGTCAAGAGAACAGCGACGTCCCGTTAGTCTATCAGGACGAACTCATCCAAATACGGGAGTT



GATGTATAAGCCCTACAAGGATTGGGTCGCCTTTCTTAAGCAGCTTCACAAACGCCTAGAGGTCGAAA



TAGGTAAAGAGGTGAAACATTGGCGGAAGTCGCTCAGCGACGGGAGGAAGGGACTTTATGGCATCTC



TTTGAAGAACATTGACGAAATCGATAGAACCAGAAAATTTTTGTTGAGATGGTCCCTCCGACCCACCG



AGCCTGGAGAGGTGAGGCGGTTAGAACCAGGACAGAGGTTCGCTATCGATCAGCTGAATCACCTCAA



TGCTCTGAAGGAGGACCGCCTCAAGAAAATGGCCAATACAATCATAATGCACGCCCTTGGCTACTGCT



ACGACGTCCGAAAGAAGAAGTGGCAGGCCAAGAATCCCGCCTGTCAAATTATCCTTTTTGAGGATCTT



AGCAATTACAACCCCTATGAAGAGCGGTCCAGATTCGAAAATAGTAAGCTCATGAAGTGGAGCCGCA



GGGAGATCCCGCGCCAAGTGGCCCTTCAGGGGGAAATTTATGGGCTGCAGGTAGGCGAGGTCGGGGC



CCAATTCTCCTCGCGCTTTCATGCGAAAACTGGAAGTCCTGGAATCCGGTGCTCAGTGGTGACAAAGG



AGAAGTTGCAAGACAATCGGTTTTTTAAAAACTTACAGCGGGAGGGAAGGCTGACCCTGGATAAGAT



AGCCGTACTTAAGGAAGGAGATCTGTACCCTGACAAAGGCGGTGAAAAGTTCATTAGCTTGAGCAAG



GACCGAAAACTTGTGACCACCCACGCTGACATCAATGCGGCACAGAACCTGCAGAAGAGATTTTGGA



CTCGCACCCACGGATTCTACAAAGTTTACTGCAAAGCATATCAAGTAGACGGACAGACCGTATACATC



CCCGAGTCCAAAGATCAGAAGCAGAAAATTATTGAAGAGTTTGGGGAAGGGTACTTTATCCTGAAGG



ATGGTGTCTACGAATGGGGCAACGCTGGTAAACTTAAAATTAAGAAGGGCAGCTCTAAACAGTCCTC



CAGCGAGTTAGTTGATTCTGATATTCTGAAAGACAGTTTCGACCTGGCCAGCGAACTTAAAGGGGAA



AAATTAATGCTGTACCGGGACCCCAGCGGAAACGTCTTTCCATCCGATAAGTGGATGGCCGCTGGAGT



GTTCTTTGGCAAGTTAGAGAGGATTCTCATAAGTAAGCTGACCAACCAATACTCAATCTCCACAATCG



AGGATGACTCATCCAAGCAGTCTATGTGA





SEQ
ATGCCTACACGCACTATCAACCTGAAACTGGTTCTTGGCAAGAATCCAGAGAATGCTACCCTTCGTCG


ID
GGCACTATTTTCAACGCATAGACTGGTGAATCAGGCTACCAAACGGATTGAAGAGTTCCTCTTGCTTT


NO:
GTCGGGGGGAAGCATATAGGACGGTGGATAATGAGGGGAAAGAGGCTGAAATTCCGAGACACGCCG


161
TGCAGGAGGAAGCTCTTGCGTTTGCAAAGGCCGCTCAACGGCACAATGGTTGCATCTCTACTTATGAA



GACCAGGAAATCCTGGATGTGCTCCGGCAACTGTATGAAAGGCTGGTGCCTTCTGTGAATGAAAATA



ATGAAGCAGGGGACGCTCAAGCCGCAAACGCGTGGGTGTCGCCACTGATGTCCGCCGAGTCCGAGGG



AGGGCTCAGCGTTTACGACAAGGTGCTGGACCCACCCCCAGTGTGGATGAAACTCAAAGAGGAAAAA



GCTCCGGGCTGGGAGGCTGCTTCCCAGATCTGGATCCAGTCCGACGAAGGGCAGTCCCTTCTTAACAA



GCCTGGTTCGCCCCCGCGGTGGATTAGGAAACTGAGGTCAGGCCAGCCTTGGCAGGACGATTTTGTTA



GCGACCAGAAAAAGAAGCAGGACGAGCTGACAAAGGGGAATGCGCCACTGATCAAACAATTAAAGG



AAATGGGCTTATTGCCTCTTGTGAATCCCTTTTTTAGACATCTGCTTGACCCGGAGGGGAAGGGGGTG



TCACCTTGGGACAGACTCGCTGTTAGGGCCGCTGTCGCTCATTTCATATCATGGGAATCATGGAACCA



CCGGACACGCGCCGAATACAATAGTTTGAAGCTGCGGAGGGATGAGTTCGAAGCAGCTTCCGACGAA



TTCAAGGACGACTTCACGCTGCTTCGGCAGTACGAGGCTAAGAGGCACTCCACACTGAAGAGTATAG



CTTTAGCCGATGATTCAAACCCTTATAGGATCGGCGTACGCTCCCTCCGCGCTTGGAACCGCGTCCGC



GAGGAGTGGATCGACAAGGGAGCGACCGAGGAGCAGCGGGTCACCATTCTCAGCAAGTTGCAGACC



CAACTAAGGGGCAAATTTGGAGATCCTGACTTGTTCAACTGGCTGGCGCAGGACCGGCACGTGCACC



TCTGGAGCCCTAGAGATAGTGTTACCCCACTGGTTAGGATCAACGCTGTTGACAAAGTATTGCGACGG



AGAAAACCGTACGCCTTGATGACTTTTGCCCACCCAAGATTCCACCCTCGGTGGATACTTTACGAAGC



CCCAGGGGGCAGCAATCTCCGCCAGTATGCACTGGATTGTACCGAAAATGCTCTGCACATTACACTGC



CTCTGCTGGTTGACGATGCACATGGCACATGGATTGAGAAAAAAATTAGGGTTCCTCTTGCCCCCAGC



GGCCAGATTCAGGACCTGACACTAGAAAAGCTCGAGAAGAAGAAAAATCGTCTCTACTACCGTTCTG



GGTTCCAGCAGTTTGCCGGCCTGGCCGGAGGTGCCGAGGTGCTTTTCCATCGACCATACATGGAGCAC



GATGAGAGGAGCGAGGAGAGCTTATTAGAACGCCCTGGTGCTGTTTGGTTCAAACTCACCTTGGACGT



GGCAACCCAGGCCCCTCCAAACTGGTTGGACGGAAAGGGCCGCGTCCGAACGCCCCCCGAGGTTCAC



CACTTCAAGACAGCCCTCAGTAACAAGTCTAAGCACACACGGACCCTCCAGCCCGGACTCAGAGTGT



TATCCGTGGATCTGGGAATGCGCACCTTCGCCTCTTGCTCCGTATTTGAGCTGATCGAGGGCAAACCA



GAGACTGGCAGAGCGTTCCCTGTGGCCGACGAACGTTCCATGGATTCACCAAACAAGCTGTGGGCCA



AGCACGAAAGATCCTTTAAACTCACGCTCCCCGGCGAAACCCCCAGTCGGAAAGAAGAGGAGGAACG



GAGCATTGCAAGAGCCGAAATCTATGCGTTGAAAAGAGATATTCAGAGATTAAAAAGTCTTCTGCGC



CTGGGGGAAGAGGATAACGATAATAGACGCGATGCACTTCTTGAGCAATTTTTCAAGGGCTGGGGCG



AGGAAGACGTGGTTCCAGGTCAGGCCTTTCCCCGGAGTCTGTTCCAGGGGCTGGGGGCCGCCCCATTC



AGATCCACCCCTGAGTTGTGGAGACAACACTGTCAAACCTATTATGATAAAGCAGAGGCGTGCCTGG



CTAAACACATCAGCGATTGGCGCAAGAGAACCAGGCCTAGGCCTACCTCACGTGAGATGTGGTACAA



GACACGCTCTTATCACGGCGGAAAGTCAATCTGGATGCTGGAATACCTCGACGCTGTGAGGAAACTG



CTCTTATCCTGGAGCCTCAGAGGCCGGACCTACGGGGCTATCAACAGACAGGACACAGCAAGGTTCG



GGAGCTTAGCCAGCCGGCTCCTTCACCACATTAACTCACTCAAAGAGGATCGAATAAAGACCGGAGC



CGACTCGATCGTGCAGGCAGCCCGAGGGTACATCCCCCTGCCTCATGGGAAGGGCTGGGAGCAGCGA



TATGAACCCTGCCAGCTGATCTTGTTTGAGGACCTTGCCCGTTATAGATTTCGCGTTGATAGACCTCGC



CGTGAGAATTCTCAGCTGATGCAGTGGAACCACAGAGCGATCGTGGCTGAGACCACTATGCAGGCCG



AGCTGTATGGACAGATCGTGGAGAACACCGCCGCAGGGTTCAGTTCTCGGTTTCATGCTGCCACCGGA



GCTCCCGGCGTCCGGTGCCGCTTCCTCTTAGAGCGTGATTTTGACAATGACCTCCCAAAGCCCTATCTG



CTGAGGGAACTGAGCTGGATGCTGGGGAACACAAAAGTAGAATCGGAGGAGGAGAAGCTACGGCTC



CTCTCCGAAAAGATACGTCCAGGCTCTCTGGTACCATGGGACGGAGGAGAGCAGTTCGCGACACTGC



ATCCTAAGAGACAGACGTTATGTGTGATTCACGCCGATATGAACGCCGCTCAGAATCTGCAGCGAAG



ATTCTTTGGCCGCTGCGGCGAAGCCTTCAGGCTGGTATGTCAGCCCCACGGGGATGATGTGCTGCGGC



TGGCCTCAACCCCTGGGGCTAGACTCTTGGGGGCACTCCAGCAGCTGGAAAATGGCCAAGGGGCTTT



CGAACTCGTTCGGGACATGGGCAGCACAAGCCAGATGAACAGATTCGTCATGAAGAGCCTGGGAAAG



AAAAAGATCAAACCCTTACAGGACAATAATGGCGACGACGAACTGGAGGACGTGTTGTCCGTGCTGC



CAGAGGAAGACGACACAGGCCGCATCACTGTCTTCCGCGACTCAAGTGGGATATTCTTTCCTTGCAAC



GTGTGGATTCCGGCCAAACAGTTCTGGCCTGCCGTCAGAGCCATGATTTGGAAAGTGATGGCTAGTCA



TTCATTGGGATGA





SEQ
ATGACAAAGCTGAGGCACAGACAAAAGAAGCTTACACACGACTGGGCAGGGAGCAAGAAACGTGAG


ID
GTCCTTGGGTCAAATGGAAAACTGCAGAACCCCTTGCTCATGCCTGTAAAGAAGGGGCAGGTAACAG


NO:
AATTTAGAAAAGCATTCTCCGCGTACGCTCGGGCAACTAAGGGGGAAATGACCGATGGACGGAAGAA


162
CATGTTCACCCATTCTTTCGAGCCATTCAAAACAAAGCCGTCATTGCACCAATGCGAGCTGGCCGATA



AGGCTTACCAGTCTTTGCATAGTTACCTCCCCGGTTCCCTGGCCCATTTCTTGCTTTCCGCACACGCAC



TGGGCTTTCGTATTTTCTCTAAATCTGGGGAGGCAACTGCCTTCCAGGCCAGCTCAAAAATCGAGGCC



TATGAGTCCAAGCTCGCTTCGGAGCTAGCCTGTGTCGATTTGAGTATCCAGAATTTGACGATTAGTAC



TCTTTTCAACGCTCTCACAACTTCAGTTCGGGGCAAGGGGGAGGAAACTTCAGCAGATCCCCTTATCG



CACGGTTCTACACTCTCCTGACGGGCAAGCCCCTGAGCCGAGACACACAGGGCCCAGAACGGGACTT



GGCAGAGGTCATCTCCAGAAAGATCGCCTCGTCCTTCGGCACATGGAAGGAAATGACTGCCAACCCT



CTGCAGAGCCTCCAGTTCTTCGAAGAAGAGCTTCATGCACTAGATGCCAACGTGTCTTTATCTCCAGC



TTTTGATGTGTTAATCAAGATGAATGATCTCCAAGGTGATCTGAAGAACCGTACTATAGTGTTCGACC



CAGATGCACCCGTGTTCGAGTACAACGCTGAGGATCCAGCCGATATCATCATAAAGCTGACAGCTCG



GTATGCGAAGGAGGCCGTCATCAAGAATCAGAACGTGGGCAATTATGTGAAAAACGCCATTACCACC



ACTAATGCCAATGGGCTGGGGTGGCTCCTCAATAAAGGGCTTTCACTACTGCCAGTTTCTACTGACGA



TGAGCTGCTCGAATTCATTGGGGTGGAGAGAAGCCATCCCAGCTGTCACGCGCTGATAGAGCTGATTG



CCCAGCTAGAGGCGCCGGAACTGTTTGAGAAGAATGTGTTTAGTGACACCCGTTCCGAGGTTCAGGGT



ATGATCGACAGTGCAGTGTCGAACCACATTGCTCGGCTGTCCAGCAGCCGAAACTCCCTGAGCATGG



ACAGCGAGGAATTGGAACGCTTGATTAAATCTTTCCAGATTCATACTCCCCATTGTTCTCTGTTCATAG



GCGCTCAGTCCTTATCTCAGCAGCTGGAGAGCTTACCTGAGGCGCTGCAGTCCGGAGTGAACAGCGCT



GATATCTTATTAGGCAGCACACAGTATATGCTGACCAACTCTCTCGTTGAAGAGTCAATTGCAACATA



TCAAAGGACATTAAATAGGATCAATTACCTGAGTGGGGTGGCTGGGCAGATTAACGGTGCTATCAAA



AGAAAGGCAATCGACGGCGAAAAAATACACCTGCCTGCCGCCTGGAGTGAGCTCATCTCCTTACCTTT



CATTGGACAGCCGGTGATTGATGTGGAGAGCGACCTGGCACACTTAAAAAACCAGTACCAGACCCTG



TCCAATGAATTTGACACCCTCATTTCGGCCCTGCAGAAGAACTTCGATTTGAATTTCAACAAAGCACT



CCTTAACCGCACGCAGCATTTCGAGGCAATGTGCCGGAGCACAAAAAAAAATGCTTTATCTAAGCCC



GAGATCGTGTCCTACAGAGATCTGCTGGCGCGGCTGACCAGTTGCCTTTATCGAGGCTCGCTGGTTCT



CAGAAGGGCGGGAATCGAAGTTCTGAAAAAGCACAAAATCTTTGAGTCGAATAGTGAGCTGAGAGA



ACACGTCCACGAGCGAAAGCACTTCGTGTTCGTTAGTCCATTGGACAGAAAGGCAAAAAAACTGTTG



CGCCTGACCGATTCCCGCCCTGACTTGCTCCATGTGATCGATGAGATCCTGCAACATGACAATCTGGA



GAATAAGGACAGAGAGTCCCTTTGGCTGGTCCGGTCTGGGTACCTCCTTGCTGGTCTGCCGGACCAGC



TGAGTTCTTCGTTTATCAATCTCCCCATAATCACGCAAAAGGGCGATCGCCGGCTGATTGACCTGATT



CAGTATGACCAGATCAATCGCGATGCTTTCGTAATGTTGGTGACAAGTGCTTTCAAAAGCAATCTCTC



TGGGTTGCAGTACCGCGCTAACAAGCAGTCTTTCGTGGTCACCCGCACCCTGTCTCCTTACCTGGGTA



GTAAGCTCGTATACGTCCCTAAAGACAAAGATTGGCTGGTCCCATCCCAGATGTTTGAGGGAAGATTC



GCCGATATTCTGCAGAGTGACTACATGGTCTGGAAGGATGCCGGACGCCTGTGCGTGATCGACACTGC



CAAACATCTCTCTAACATTAAAAAAAGCGTGTTTAGTAGCGAAGAAGTCCTTGCTTTTCTTCGAGAGC



TGCCTCACCGGACCTTCATCCAGACCGAGGTACGGGGGTTAGGAGTGAACGTCGATGGAATCGCATTT



AATAACGGGGATATCCCGAGCTTGAAGACATTCTCGAATTGTGTGCAGGTGAAGGTGAGTAGGACTA



ATACTAGTCTCGTGCAGACTCTAAACAGGTGGTTCGAGGGTGGCAAAGTGTCACCTCCCTCTATTCAG



TTCGAAAGAGCTTACTACAAAAAAGACGATCAGATTCACGAGGACGCAGCCAAGAGAAAGATACGCT



TCCAGATGCCAGCAACGGAATTAGTGCACGCCAGCGATGACGCTGGTTGGACCCCCAGCTACCTGCT



GGGCATCGACCCCGGTGAGTACGGAATGGGTCTCAGTTTGGTGTCCATCAACAATGGAGAGGTCCTG



GATTCTGGATTCATCCACATTAATTCCCTGATCAATTTCGCGTCCAAAAAAAGCAATCACCAGACCAA



AGTAGTCCCCCGCCAGCAGTACAAGTCCCCCTACGCGAATTATCTCGAGCAGTCAAAGGATTCAGCA



GCAGGGGATATAGCTCACATTCTGGATCGGCTAATCTACAAATTGAACGCCTTGCCTGTGTTCGAGGC



GCTGTCTGGCAACAGTCAGAGTGCTGCTGATCAGGTATGGACCAAAGTTCTATCCTTCTATACATGGG



GAGACAACGACGCACAGAACAGTATACGGAAGCAGCACTGGTTCGGTGCCTCACACTGGGATATTAA



GGGGATGCTGCGCCAACCCCCAACCGAAAAAAAACCCAAACCATATATAGCCTTTCCCGGGAGTCAA



GTGTCATCCTATGGAAATAGTCAAAGGTGTAGTTGTTGCGGCCGCAATCCCATTGAGCAGTTGCGTGA



GATGGCAAAGGACACGAGTATCAAGGAGCTGAAAATCCGAAATAGTGAGATCCAACTATTCGATGGT



ACAATCAAGCTGTTTAACCCCGACCCTTCCACCGTCATCGAGAGGCGGCGGCATAACCTAGGACCCTC



ACGCATTCCTGTGGCAGACCGAACTTTCAAGAATATTAGCCCTTCTTCGTTAGAGTTCAAGGAGCTCA



TTACTATCGTTTCTCGAAGCATCCGCCATAGCCCCGAATTTATTGCTAAGAAACGGGGTATCGGGTCT



GAGTACTTTTGTGCTTATTCTGACTGCAACTCCTCACTGAACTCAGAGGCCAATGCCGCGGCCAATGT



GGCACAGAAGTTTCAGAAGCAACTCTTTTTCGAACTCTGA





SEQ
ATGAAACGTATTCTGAACTCTCTGAAAGTCGCCGCACTGAGGCTGCTGTTTCGAGGAAAGGGCTCAGA


ID
GCTGGTGAAGACCGTCAAGTACCCTCTGGTTTCGCCCGTCCAGGGTGCTGTGGAAGAACTCGCCGAAG


NO:
CAATACGCCACGACAACCTACATTTATTTGGGCAGAAGGAAATCGTAGATCTGATGGAGAAGGACGA


163
GGGCACCCAGGTCTACTCGGTGGTGGACTTTTGGCTCGACACACTCCGTCTAGGGATGTTCTTCAGTC



CAAGTGCTAATGCCCTTAAGATCACTCTGGGGAAGTTTAACAGCGACCAAGTTTCCCCTTTCAGGAAG



GTTCTGGAGCAGTCCCCTTTCTTTCTCGCGGGTAGACTCAAAGTGGAGCCCGCTGAACGTATCCTCAG



CGTGGAGATCCGCAAGATCGGTAAGAGGGAGAATAGAGTGGAGAACTACGCCGCAGATGTAGAGAC



TTGTTTTATCGGTCAGCTGTCTAGTGATGAAAAGCAGTCTATCCAGAAGCTCGCTAACGATATCTGGG



ACTCTAAGGATCACGAAGAGCAAAGGATGCTTAAGGCGGATTTCTTTGCCATTCCCCTCATCAAAGAC



CCAAAGGCAGTGACCGAGGAAGATCCCGAGAATGAAACCGCAGGCAAACAGAAGCCTCTCGAATTA



TGTGTGTGCTTAGTGCCCGAGTTGTACACCCGCGGGTTCGGTTCAATAGCGGACTTCCTGGTCCAGCG



TCTGACACTATTAAGAGACAAAATGAGCACAGACACAGCAGAAGACTGCCTTGAGTATGTCGGCATA



GAGGAGGAGAAGGGTAATGGGATGAACTCGCTGCTGGGGACGTTCCTCAAGAACCTGCAGGGAGAC



GGGTTCGAACAGATCTTCCAATTTATGCTCGGCAGTTACGTGGGATGGCAAGGTAAGGAAGACGTCCT



ACGCGAACGGCTTGATTTGCTAGCGGAGAAGGTTAAAAGACTGCCGAAACCTAAGTTTGCCGGCGAG



TGGTCCGGCCATCGGATGTTCCTGCATGGTCAATTGAAGAGCTGGTCCTCTAACTTTTTCCGCCTGTTT



AACGAGACTAGGGAGCTCCTCGAAAGCATAAAATCCGACATCCAACACGCGACCATGTTAATCAGCT



ACGTCGAAGAGAAAGGGGGATACCACCCACAACTCTTGTCACAGTACAGGAAACTAATGGAGCAGCT



GCCAGCTCTCAGAACAAAGGTGTTAGATCCAGAGATAGAAATGACTCACATGAGCGAGGCGGTAAGG



TCGTACATTATGATCCACAAGTCGGTAGCAGGATTTCTGCCTGACTTACTCGAGTCCCTCGATAGGGA



CAAGGACAGGGAATTCCTGCTGAGTATATTTCCAAGGATCCCCAAAATTGACAAAAAAACTAAGGAA



ATCGTGGCCTGGGAGCTCCCAGGCGAGCCCGAAGAAGGATACCTGTTCACTGCCAATAATCTTTTTCG



CAACTTTCTGGAGAATCCTAAACATGTTCCACGTTTCATGGCAGAAAGGATCCCGGAAGATTGGACGC



GCCTGCGGTCCGCTCCCGTATGGTTTGACGGCATGGTGAAACAATGGCAGAAAGTGGTAAACCAGCT



GGTGGAGTCACCTGGAGCATTGTATCAGTTCAATGAAAGCTTTCTCCGACAACGTTTACAGGCAATGC



TGACAGTGTATAAGAGAGACCTGCAGACAGAGAAATTCCTTAAGTTGTTGGCTGATGTCTGCAGGCCT



CTGGTGGACTTCTTTGGGCTGGGGGGAAACGATATCATCTTCAAAAGCTGCCAGGACCCGAGGAAAC



AATGGCAAACTGTCATTCCCTTGAGTGTCCCCGCTGATGTGTACACCGCGTGTGAGGGGCTGGCAATC



CGGCTTCGTGAGACATTGGGATTTGAGTGGAAGAACCTTAAGGGCCATGAAAGGGAGGACTTTCTAA



GACTGCACCAGCTTTTAGGGAATCTGCTTTTCTGGATTCGAGATGCCAAACTGGTGGTGAAATTGGAA



GATTGGATGAATAATCCCTGTGTTCAGGAGTACGTTGAGGCTCGTAAGGCCATTGATCTCCCACTGGA



GATCTTCGGCTTTGAGGTCCCCATCTTCCTGAACGGATATCTGTTTAGTGAACTGAGGCAGTTAGAAC



TGCTGCTCCGCCGTAAGTCGGTTATGACCAGCTATTCGGTTAAGACAACTGGCAGTCCAAACAGGCTT



TTCCAGTTAGTCTACCTGCCATTAAATCCTTCCGACCCTGAGAAAAAAAATTCTAATAACTTTCAGGA



ACGCCTGGACACCCCCACTGGCTTATCACGTCGCTTCCTGGACCTTACTCTGGACGCCTTCGCCGGCA



AGTTGCTGACAGACCCCGTGACTCAAGAGCTTAAAACTATGGCTGGGTTCTACGATCACCTGTTTGGT



TTCAAGCTCCCATGTAAGCTGGCAGCCATGTCTAACCACCCTGGCTCTAGCAGCAAGATGGTCGTGTT



GGCCAAACCTAAAAAAGGGGTTGCATCTAATATAGGATTCGAACCAATCCCTGATCCCGCGCACCCC



GTATTCCGGGTGAGATCATCATGGCCAGAGCTGAAGTATCTGGAGGGGTTACTGTATCTTCCAGAAGA



CACTCCACTGACAATAGAGCTCGCAGAGACAAGTGTTAGTTGTCAGAGCGTCAGTAGCGTGGCATTC



GATCTGAAAAATCTGACTACTATCCTTGGACGCGTGGGTGAGTTCCGTGTGACCGCAGACCAGCCTTT



TAAGTTGACCCCCATCATCCCTGAGAAGGAGGAGTCCTTCATAGGAAAAACATATCTAGGCCTTGATG



CCGGGGAACGCTCAGGCGTAGGGTTCGCTATCGTCACAGTCGACGGGGATGGGTACGAGGTACAGCG



CCTGGGGGTGCATGAAGATACACAGCTGATGGCCCTACAGCAGGTGGCCTCTAAAAGCTTGAAGGAG



CCGGTGTTCCAGCCGCTCAGAAAGGGTACTTTTCGGCAGCAGGAACGTATTAGAAAATCTCTCAGAG



GATGTTATTGGAACTTCTATCACGCTCTGATGATTAAGTACCGCGCCAAGGTAGTGCACGAAGAGAGC



GTGGGCAGTTCCGGCCTGGTTGGGCAGTGGTTACGAGCATTCCAGAAGGACCTCAAGAAAGCCGATG



TGTTGCCAAAAAAGGGAGGCAAAAACGGAGTCGATAAGAAAAAGAGAGAGTCTTCTGCACAAGACA



CATTGTGGGGAGGGGCTTTTAGCAAGAAGGAAGAACAGCAGATAGCTTTCGAAGTCCAAGCTGCTGG



TTCTAGCCAGTTCTGCCTGAAGTGCGGATGGTGGTTCCAACTCGGAATGCGTGAGGTTAATCGCGTGC



AGGAATCCGGCGTCGTGCTGGATTGGAATCGGAGTATTGTCACATTCCTGATTGAGAGCTCTGGCGAG



AAAGTGTATGGGTTCTCCCCTCAGCAACTCGAAAAGGGGTTCAGACCAGACATTGAAACCTTCAAGA



AGATGGTTCGGGATTTCATGCGCCCGCCTATGTTTGACCGGAAGGGTCGCCCAGCAGCTGCCTACGAA



AGGTTTGTCTTGGGACGCCGGCATCGGCGGTATAGATTCGACAAGGTTTTTGAAGAACGATTCGGACG



ATCCGCGCTATTCATTTGCCCGAGGGTTGGCTGTGGCAACTTTGACCACAGCAGCGAGCAGTCAGCCG



TAGTGCTGGCTCTAATCGGATATATTGCCGACAAAGAGGGGATGAGCGGAAAAAAGCTAGTCTACGT



GCGTCTGGCAGAACTAATGGCGGAATGGAAATTGAAGAAACTGGAGAGGAGTAGAGTTGAGGAGCA



AAGCTCCGCTCAGTGA





SEQ
ATGGCGGAGTCGAAGCAAATGCAGTGCAGGAAGTGTGGAGCCTCTATGAAGTACGAAGTGATCGGCC


ID
TCGGGAAGAAAAGCTGCAGATATATGTGTCCCGACTGCGGGAATCACACATCTGCAAGAAAGATTCA


NO:
GAATAAGAAGAAAAGGGACAAGAAGTATGGATCTGCCAGTAAAGCACAAAGCCAACGAATCGCAGT


164
TGCAGGGGCCTTATACCCGGATAAAAAGGTTCAGACCATCAAGACTTATAAGTATCCAGCCGACCTG



AATGGTGAGGTCCATGACTCAGGGGTGGCCGAAAAAATAGCCCAAGCAATCCAGGAGGATGAAATA



GGGCTCCTCGGCCCCTCTTCCGAGTACGCCTGTTGGATCGCTAGCCAGAAACAGAGCGAGCCCTACAG



TGTTGTAGACTTTTGGTTTGACGCTGTGTGCGCCGGAGGCGTGTTCGCCTATTCTGGGGCTAGATTGCT



GTCTACCGTCCTGCAGCTATCTGGGGAGGAGAGCGTCCTACGCGCAGCCCTGGCATCCTCCCCTTTTG



TCGACGATATCAATCTGGCACAGGCCGAAAAATTTCTGGCGGTGTCCAGGCGAACCGGCCAAGATAA



GCTGGGGAAGCGCATTGGAGAGTGCTTCGCAGAGGGCCGACTTGAGGCCCTAGGCATCAAGGACCGG



ATGCGTGAATTTGTCCAGGCTATCGATGTCGCTCAGACCGCTGGGCAGCGTTTTGCCGCGAAACTGAA



AATCTTTGGGATTTCTCAGATGCCCGAGGCAAAGCAGTGGAACAATGACAGCGGACTCACCGTGTGC



ATCCTGCCCGACTATTACGTCCCAGAAGAAAATCGCGCAGATCAGTTGGTCGTCCTGCTAAGACGACT



GAGAGAGATAGCATACTGTATGGGGATCGAAGATGAGGCCGGTTTTGAACATCTTGGAATTGATCCT



GGCGCACTATCAAATTTTTCCAATGGCAATCCTAAACGCGGATTTTTGGGCCGCCTGCTGAACAATGA



TATTATTGCCTTAGCGAACAACATGTCCGCCATGACGCCTTACTGGGAGGGCAGGAAGGGAGAACTG



ATTGAAAGATTGGCTTGGCTGAAGCACCGTGCAGAGGGGCTTTATCTGAAGGAACCGCATTTTGGAA



ATAGTTGGGCCGACCATAGGTCTAGAATTTTTTCCAGAATAGCCGGGTGGCTTTCTGGGTGCGCTGGG



AAGCTAAAGATCGCCAAAGACCAGATCAGCGGAGTGCGTACTGATCTGTTCCTTCTGAAGAGACTGC



TGGATGCGGTCCCGCAGTCCGCCCCTTCTCCCGACTTCATAGCCTCTATCTCTGCCTTGGATCGCTTCC



TGGAGGCCGCAGAATCTAGTCAGGATCCTGCCGAACAGGTGAGGGCCCTATACGCCTTTCATCTGAAC



GCACCCGCGGTGCGAAGCATCGCCAACAAGGCAGTCCAGCGATCCGACAGCCAAGAATGGCTTATAA



AGGAACTGGACGCTGTGGACCACCTGGAGTTTAACAAGGCCTTTCCCTTCTTCTCTGATACGGGAAAG



AAGAAAAAGAAAGGGGCTAACTCGAATGGCGCTCCGTCCGAGGAGGAGTACACCGAGACTGAGAGC



ATCCAGCAGCCCGAGGACGCTGAGCAAGAGGTTAATGGTCAGGAAGGCAACGGGGCCTCGAAGAAC



CAGAAGAAGTTTCAGAGAATCCCCCGATTCTTCGGCGAGGGGAGTCGCAGCGAGTATCGCATCCTCA



CTGAAGCCCCGCAGTACTTCGACATGTTCTGTAACAACATGCGGGCCATCTTTATGCAATTAGAATCC



CAACCGCGTAAAGCTCCCAGGGATTTTAAGTGTTTCCTGCAGAATCGGCTGCAGAAATTGTATAAGCA



GACATTCCTGAACGCTCGATCCAACAAGTGCCGGGCATTACTAGAGTCCGTATTGATTAGTTGGGGAG



AGTTTTACACCTACGGGGCTAACGAGAAAAAATTTCGACTGCGTCATGAAGCTTCTGAGCGCTCCTCG



GACCCAGATTACGTGGTGCAACAGGCGCTGGAGATCGCTCGGAGGCTGTTTCTCTTCGGCTTTGAGTG



GAGGGACTGTAGCGCAGGTGAAAGAGTGGATCTGGTCGAAATACATAAGAAAGCCATATCTTTCCTG



TTGGCCATCACTCAGGCTGAGGTGTCTGTGGGCAGCTATAACTGGCTGGGCAATTCTACCGTGAGTCG



GTACCTGTCCGTGGCAGGGACTGATACCCTTTACGGCACCCAGCTGGAAGAATTCTTAAATGCAACCG



TGTTATCTCAGATGCGGGGGCTGGCTATCAGGTTATCATCTCAGGAACTGAAGGATGGATTTGACGTA



CAGCTGGAGTCTAGTTGCCAGGATAATCTGCAACACTTGCTCGTGTACAGGGCTTCACGAGACCTTGC



CGCCTGCAAGCGCGCTACTTGTCCAGCTGAGTTGGATCCTAAGATTCTGGTACTGCCCGTGGGGGCCT



TTATCGCTAGCGTGATGAAAATGATTGAAAGAGGGGATGAGCCTTTAGCTGGAGCTTATCTGAGACA



CAGACCCCATAGTTTCGGGTGGCAGATCCGCGTTCGAGGTGTGGCAGAGGTGGGAATGGACCAAGGG



ACCGCCCTGGCGTTCCAGAAACCGACCGAGAGCGAACCCTTCAAGATAAAGCCGTTTTCCGCTCAATA



CGGCCCCGTTCTATGGCTGAACAGCTCCAGTTATAGCCAGAGCCAGTACCTGGACGGGTTCCTATCAC



AGCCCAAGAACTGGAGTATGCGGGTGCTGCCACAGGCCGGCTCAGTGCGGGTAGAACAGCGCGTCGC



CTTGATTTGGAATCTCCAGGCCGGAAAGATGAGGCTGGAACGGAGCGGAGCGCGGGCTTTCTTCATG



CCCGTCCCATTCAGTTTCCGCCCCAGTGGCAGCGGCGACGAGGCAGTCCTGGCTCCAAATAGGTACCT



GGGACTCTTTCCACACAGCGGCGGCATAGAGTACGCTGTGGTCGATGTTCTTGACTCTGCCGGCTTCA



AAATACTCGAGAGAGGAACAATAGCCGTCAATGGCTTCTCCCAGAAACGAGGAGAAAGACAAGAGG



AAGCCCATCGCGAAAAACAAAGACGCGGTATCTCCGATATTGGGCGCAAGAAGCCAGTCCAGGCCGA



AGTCGATGCGGCCAACGAGCTCCATCGAAAATACACCGATGTTGCTACTCGGCTGGGGTGTCGAATTG



TCGTTCAATGGGCACCCCAACCCAAACCAGGCACTGCGCCGACCGCTCAGACTGTGTACGCTAGGGC



CGTGAGGACTGAAGCACCAAGATCCGGCAATCAGGAAGATCACGCCAGGATGAAATCTTCCTGGGGA



TACACATGGGGTACGTATTGGGAAAAAAGGAAGCCCGAGGACATCCTCGGCATTAGTACCCAGGTGT



ATTGGACAGGCGGGATCGGCGAGTCCTGCCCGGCTGTCGCCGTCGCGCTATTGGGACACATCAGGGC



CACCTCAACCCAGACTGAATGGGAGAAAGAGGAAGTCGTGTTTGGGCGATTGAAAAAGTTCTTCCCA



TCCTGA





SEQ
ATGGAGAAGCGCATCAATAAAATTCGCAAGAAGCTGTCTGCCGATAACGCCACAAAACCAGTTAGTC


ID
GAAGCGGCCCAATGAAGACCCTGCTAGTTCGAGTGATGACTGATGATCTGAAGAAAAGGCTCGAAAA


NO:
GCGACGCAAGAAGCCTGAGGTAATGCCTCAGGTTATAAGTAACAATGCAGCAAACAATCTGCGGATG


165
CTGCTTGACGATTACACAAAGATGAAGGAAGCCATTCTCCAGGTGTATTGGCAGGAGTTCAAGGATG



ATCACGTAGGCCTGATGTGTAAATTCGCGCAACCTGCAAGCAAGAAGATCGACCAAAACAAGCTGAA



ACCCGAGATGGATGAAAAAGGCAATTTAACAACCGCCGGATTCGCTTGTTCCCAGTGTGGGCAGCCA



CTGTTCGTGTACAAGTTAGAACAGGTGTCGGAAAAAGGAAAGGCATACACTAACTACTTTGGACGGT



GCAATGTTGCAGAACACGAAAAGCTGATACTGCTTGCCCAGCTTAAGCCCGAAAAAGACAGCGACGA



AGCGGTGACCTACAGCCTGGGAAAATTCGGGCAGCGGGCACTGGACTTCTATTCTATCCACGTTACCA



AGGAGAGCACCCACCCAGTGAAGCCGTTGGCCCAAATCGCTGGAAACCGGTACGCCAGCGGACCAGT



CGGCAAGGCCCTGTCCGATGCCTGTATGGGCACAATTGCTTCTTTCCTGTCCAAGTACCAGGACATCA



TAATCGAGCACCAAAAAGTTGTGAAAGGGAATCAGAAACGCCTGGAATCCCTTCGAGAACTGGCCGG



CAAGGAGAACCTTGAGTACCCGTCCGTGACCCTGCCTCCACAGCCACATACCAAAGAGGGCGTAGAC



GCGTATAATGAGGTCATTGCCCGCGTTCGCATGTGGGTTAATTTAAACCTGTGGCAGAAATTAAAACT



AAGCCGAGATGATGCTAAACCGTTACTGAGATTGAAGGGATTCCCTAGCTTTCCTGTGGTGGAGAGA



AGGGAAAACGAGGTTGATTGGTGGAATACTATTAATGAGGTGAAAAAGCTTATTGACGCCAAGAGGG



ATATGGGCAGGGTGTTCTGGAGCGGGGTGACTGCCGAAAAGAGAAATACCATCCTCGAGGGATACAA



TTACCTCCCCAACGAGAATGATCATAAGAAAAGAGAGGGGAGCTTAGAGAATCCAAAGAAACCTGCA



AAGAGGCAATTCGGTGATCTCCTGCTCTACCTCGAGAAGAAATACGCGGGGGACTGGGGAAAAGTTT



TTGACGAAGCCTGGGAGCGCATTGACAAGAAGATCGCCGGGCTGACGTCTCACATTGAACGGGAAGA



GGCACGGAATGCAGAGGACGCCCAGTCTAAGGCCGTGCTGACTGACTGGCTGCGCGCAAAGGCCTCC



TTCGTGCTCGAACGTCTGAAGGAAATGGATGAGAAAGAGTTTTACGCGTGTGAAATACAGCTGCAGA



AGTGGTACGGCGATCTAAGGGGAAATCCCTTCGCAGTGGAAGCCGAGAATAGGGTAGTTGACATCAG



TGGGTTCTCCATCGGCAGTGATGGACATTCTATCCAGTATAGAAACCTGCTCGCCTGGAAGTACTTAG



AGAACGGCAAGAGAGAGTTCTATCTGCTGATGAACTACGGGAAAAAAGGTAGAATTCGCTTTACAGA



TGGCACCGACATAAAGAAGTCCGGAAAGTGGCAAGGCCTCTTATACGGAGGCGGCAAAGCAAAGGT



GATAGACTTGACTTTTGACCCTGACGACGAACAGCTGATAATCTTGCCGCTGGCCTTTGGCACAAGAC



AAGGTAGGGAATTTATCTGGAATGATCTTCTTTCTCTCGAGACCGGACTCATCAAGCTCGCAAACGGA



AGGGTCATCGAGAAGACAATCTACAATAAAAAGATAGGCCGAGACGAGCCAGCCCTGTTTGTGGCTT



TGACATTTGAGCGGAGAGAGGTCGTAGATCCCAGCAACATCAAACCCGTGAACCTGATCGGTGTTGA



CAGGGGCGAGAACATCCCGGCGGTTATCGCACTGACGGATCCAGAAGGATGTCCTCTGCCCGAGTTC



AAAGATTCATCGGGAGGGCCAACCGACATTTTGAGGATAGGGGAGGGGTACAAGGAGAAGCAGCGA



GCTATCCAGGCGGCCAAAGAAGTGGAGCAACGAAGAGCTGGTGGTTATTCTCGCAAGTTCGCTTCCA



AAAGTCGTAACCTGGCTGACGATATGGTGCGCAATTCTGCCCGTGACCTTTTCTACCACGCCGTTACA



CACGACGCCGTGTTAGTGTTTGAAAATCTTAGTCGAGGCTTCGGGCGACAGGGGAAGCGGACCTTTAT



GACCGAGAGACAGTATACAAAAATGGAGGATTGGCTGACCGCCAAACTGGCGTATGAAGGACTCACA



TCCAAGACCTATCTCTCAAAAACTTTGGCCCAGTATACATCTAAGACGTGCAGTAACTGTGGCTTCAC



CATTACCACAGCTGACTACGATGGCATGCTGGTCCGCTTAAAAAAGACATCTGACGGCTGGGCTACTA



CCCTCAACAATAAAGAGCTCAAAGCCGAAGGACAAATTACCTATTATAACAGGTATAAAAGACAGAC



TGTCGAGAAGGAGTTGAGCGCGGAGCTGGACCGCCTATCAGAGGAGTCAGGGAACAACGATATCTCT



AAGTGGACTAAGGGACGCCGAGACGAGGCGTTGTTCTTGCTGAAAAAGCGGTTCTCTCATCGACCCG



TGCAGGAGCAGTTCGTGTGTCTGGACTGCGGCCACGAGGTTCATGCTGATGAGCAAGCTGCTCTAAAT



ATTGCCCGTAGTTGGTTGTTCCTGAACAGCAATTCAACAGAGTTCAAGTCATACAAGAGCGGAAAGC



AGCCGTTTGTGGGCGCATGGCAGGCATTTTACAAAAGACGCCTGAAGGAAGTGTGGAAGCCAAACGC



C





SEQ
ATGAAAAGGATTAACAAAATCCGAAGGCGGCTTGTAAAGGATTCTAACACCAAAAAGGCTGGCAAG


ID
ACGGGGCCCATGAAAACATTACTCGTTAGAGTTATGACCCCCGACCTCAGAGAGCGACTGGAAAATT


NO:
TACGCAAGAAGCCAGAGAACATACCTCAGCCAATTAGTAATACCTCTCGGGCAAACCTAAACAAGTT


166
GCTTACTGATTACACGGAGATGAAAAAGGCCATACTGCATGTGTACTGGGAGGAGTTTCAAAAGGAC



CCTGTCGGGCTAATGAGCAGGGTGGCTCAGCCTGCACCTAAAAACATCGACCAGCGGAAACTCATCC



CAGTTAAGGACGGAAATGAGAGATTGACAAGTTCAGGTTTCGCCTGCTCACAGTGCTGTCAACCGCTG



TACGTTTATAAGTTAGAACAAGTGAATGACAAAGGAAAGCCTCACACAAATTATTTTGGCCGGTGTA



ATGTCTCTGAGCATGAGCGTCTGATTCTGTTGTCCCCGCATAAACCGGAAGCTAATGACGAGCTCGTA



ACCTACAGCTTGGGGAAGTTTGGCCAAAGAGCATTGGACTTCTATTCAATCCATGTGACCCGCGAATC



CAATCATCCCGTCAAGCCCTTGGAGCAGATAGGGGGCAATAGTTGCGCTTCTGGCCCTGTGGGCAAA



GCCCTGTCCGACGCCTGTATGGGAGCCGTGGCTTCATTCCTGACCAAATATCAGGATATCATCTTGGA



GCACCAGAAAGTGATCAAGAAAAATGAAAAAAGGTTAGCAAACCTCAAGGATATTGCAAGCGCTAA



CGGCTTGGCTTTTCCTAAAATCACACTTCCACCTCAGCCTCACACAAAGGAAGGCATCGAGGCATACA



ACAATGTGGTGGCCCAGATCGTCATCTGGGTTAACTTAAACCTGTGGCAGAAACTTAAAATTGGCAGG



GATGAGGCAAAACCCTTACAGCGCCTGAAAGGATTCCCCAGCTTTCCACTGGTGGAGCGCCAGGCTA



ACGAAGTGGACTGGTGGGATATGGTGTGTAACGTCAAGAAGCTCATCAATGAAAAGAAAGAGGACG



GTAAAGTCTTCTGGCAGAACCTCGCCGGTTACAAACGGCAGGAGGCGCTGTTACCTTATCTGTCGAGT



GAAGAGGACCGGAAAAAAGGCAAGAAATTTGCTCGTTATCAGTTTGGTGATTTGCTCCTACATTTGGA



GAAGAAGCACGGCGAGGACTGGGGAAAAGTATACGATGAGGCCTGGGAGAGGATTGACAAAAAGGT



GGAGGGACTGTCAAAGCACATCAAGCTCGAAGAAGAGCGCAGAAGCGAGGACGCCCAATCCAAAGC



AGCGCTGACTGACTGGCTGCGGGCGAAGGCCAGTTTTGTAATCGAAGGCCTTAAAGAAGCCGACAAG



GATGAATTCTGCAGATGCGAATTAAAACTCCAGAAGTGGTACGGCGATCTCCGAGGTAAGCCTTTCGC



AATCGAGGCCGAGAATTCCATACTGGACATTAGTGGATTCAGTAAACAGTATAATTGTGCCTTTATAT



GGCAGAAGGATGGTGTCAAGAAACTCAACCTGTACCTTATTATTAATTATTTCAAAGGCGGGAAACTG



AGATTTAAGAAGATAAAGCCTGAAGCCTTTGAGGCGAACCGATTCTACACAGTTATTAACAAGAAAT



CTGGTGAAATTGTACCCATGGAGGTAAACTTCAACTTCGATGATCCCAATCTGATTATATTGCCACTA



GCTTTTGGCAAGCGGCAGGGTAGGGAATTCATTTGGAACGATTTGCTTTCACTGGAAACAGGGTCCCT



TAAGCTGGCAAACGGGAGAGTGATTGAAAAGACATTGTACAATCGGAGGACACGTCAGGATGAACCT



GCCCTTTTCGTGGCTCTGACATTCGAGCGCAGGGAGGTTCTGGACTCTAGCAATATCAAGCCAATGAA



CCTGATCGGCATAGACCGAGGAGAGAATATTCCGGCTGTGATCGCACTCACCGATCCCGAAGGATGT



CCCCTTTCTCGGTTCAAGGACTCCTTAGGCAATCCAACTCATATCCTGAGAATCGGCGAGTCATACAA



GGAGAAGCAGCGAACAATTCAGGCCGCCAAGGAAGTCGAGCAGAGGCGAGCTGGCGGCTACAGCCG



TAAATACGCTAGTAAAGCTAAGAACCTGGCCGACGATATGGTGCGCAATACTGCTAGAGACCTGCTG



TACTATGCAGTGACGCAGGACGCAATGCTGATATTCGAGAATCTGTCCAGAGGATTCGGAAGGCAGG



GCAAGCGGACGTTCATGGCCGAGCGCCAGTATACAAGGATGGAGGATTGGTTAACGGCCAAGCTTGC



CTATGAGGGGCTACCTAGTAAGACCTATCTGTCTAAGACGCTGGCTCAATACACCAGTAAGACCTGCT



CAAACTGTGGCTTTACAATCACTTCTGCTGATTATGATAGAGTGCTCGAGAAGCTAAAAAAAACTGCC



ACCGGCTGGATGACTACTATTAATGGGAAGGAACTGAAAGTGGAAGGACAGATTACCTATTATAATC



GCTACAAGCGTCAAAACGTCGTCAAGGACCTGTCGGTGGAATTGGACAGACTCAGTGAAGAGTCCGT



GAACAATGATATCAGCTCCTGGACAAAAGGGCGCAGTGGGGAGGCACTCAGCTTGCTTAAAAAGAGG



TTTTCACATCGGCCGGTCCAGGAGAAATTTGTCTGCCTGAACTGCGGATTCGAGACACACGCCGACGA



GCAGGCAGCACTGAACATTGCCAGATCCTGGCTGTTCCTTAGGTCCCAGGAATATAAGAAGTACCAG



ACTAACAAAACCACGGGAAACACAGATAAAAGGGCCTTTGTCGAAACTTGGCAATCCTTTTACCGGA



AGAAGTTAAAGGAAGTGTGGAAGCCC





SEQ
ATGGATAAGAAATACTCAATAGGCTTAGCAATCGGCACAAATAGCGTCGGATGGGCGGTGATCACTG


ID
ATGAATATAAGGTTCCGTCTAAAAAGTTCAAGGTTCTGGGAAATACAGACCGCCACAGTATCAAAAA


NO:
AAATCTTATAGGGGCTCTTTTATTTGACAGTGGAGAGACAGCGGAAGCGACTCGTCTCAAACGGACA


167
GCTCGTAGAAGGTATACACGTCGGAAGAATCGTATTTGTTATCTACAGGAGATTTTTTCAAATGAGAT



GGCGAAAGTAGATGATAGTTTCTTTCATCGACTTGAAGAGTCTTTTTTGGTGGAAGAAGACAAGAAGC



ATGAACGTCATCCTATTTTTGGAAATATAGTAGATGAAGTTGCTTATCATGAGAAATATCCAACTATC



TATCATCTGCGAAAAAAATTGGTAGATTCTACTGATAAAGCGGATTTGCGCTTAATCTATTTGGCCTT



AGCGCATATGATTAAGTTTCGTGGTCATTTTTTGATTGAGGGAGATTTAAATCCTGATAATAGTGATGT



GGACAAACTATTTATCCAGTTGGTACAAACCTACAATCAATTATTTGAAGAAAACCCTATTAACGCAA



GTGGAGTAGATGCTAAAGCGATTCTTTCTGCACGATTGAGTAAATCAAGACGATTAGAAAATCTCATT



GCTCAGCTCCCCGGTGAGAAGAAAAATGGCTTATTTGGGAATCTCATTGCTTTGTCATTGGGTTTGAC



CCCTAATTTTAAATCAAATTTTGATTTGGCAGAAGATGCTAAATTACAGCTTTCAAAAGATACTTACG



ATGATGATTTAGATAATTTATTGGCGCAAATTGGAGATCAATATGCTGATTTGTTTTTGGCAGCTAAG



AATTTATCAGATGCTATTTTACTTTCAGATATCCTAAGAGTAAATACTGAAATAACTAAGGCTCCCCT



ATCAGCTTCAATGATTAAACGCTACGATGAACATCATCAAGACTTGACTCTTTTAAAAGCTTTAGTTC



GACAACAACTTCCAGAAAAGTATAAAGAAATCTTTTTTGATCAATCAAAAAACGGATATGCAGGTTA



TATTGATGGGGGAGCTAGCCAAGAAGAATTTTATAAATTTATCAAACCAATTTTAGAAAAAATGGAT



GGTACTGAGGAATTATTGGTGAAACTAAATCGTGAAGATTTGCTGCGCAAGCAACGGACCTTTGACA



ACGGCTCTATTCCCCATCAAATTCACTTGGGTGAGCTGCATGCTATTTTGAGAAGACAAGAAGACTTT



TATCCATTTTTAAAAGACAATCGTGAGAAGATTGAAAAAATCTTGACTTTTCGAATTCCTTATTATGTT



GGTCCATTGGCGCGTGGCAATAGTCGTTTTGCATGGATGACTCGGAAGTCTGAAGAAACAATTACCCC



ATGGAATTTTGAAGAAGTTGTCGATAAAGGTGCTTCAGCTCAATCATTTATTGAACGCATGACAAACT



TTGATAAAAATCTTCCAAATGAAAAAGTACTACCAAAACATAGTTTGCTTTATGAGTATTTTACGGTT



TATAACGAATTGACAAAGGTCAAATATGTTACTGAAGGAATGCGAAAACCAGCATTTCTTTCAGGTG



AACAGAAGAAAGCCATTGTTGATTTACTCTTCAAAACAAATCGAAAAGTAACCGTTAAGCAATTAAA



AGAAGATTATTTCAAAAAAATAGAATGTTTTGATAGTGTTGAAATTTCAGGAGTTGAAGATAGATTTA



ATGCTTCATTAGGTACCTACCATGATTTGCTAAAAATTATTAAAGATAAAGATTTTTTGGATAATGAA



GAAAATGAAGATATCTTAGAGGATATTGTTTTAACATTGACCTTATTTGAAGATAGGGAGATGATTGA



GGAAAGACTTAAAACATATGCTCACCTCTTTGATGATAAGGTGATGAAACAGCTTAAACGTCGCCGTT



ATACTGGTTGGGGACGTTTGTCTCGAAAATTGATTAATGGTATTAGGGATAAGCAATCTGGCAAAACA



ATATTAGATTTTTTGAAATCAGATGGTTTTGCCAATCGCAATTTTATGCAGCTGATCCATGATGATAGT



TTGACATTTAAAGAAGACATTCAAAAAGCACAAGTGTCTGGACAAGGCGATAGTTTACATGAACATA



TTGCAAATTTAGCTGGTAGCCCTGCTATTAAAAAAGGTATTTTACAGACTGTAAAAGTTGTTGATGAA



TTGGTCAAAGTAATGGGGCGGCATAAGCCAGAAAATATCGTTATTGAAATGGCACGTGAAAATCAGA



CAACTCAAAAGGGCCAGAAAAATTCGCGAGAGCGTATGAAACGAATCGAAGAAGGTATCAAAGAAT



TAGGAAGTCAGATTCTTAAAGAGCATCCTGTTGAAAATACTCAATTGCAAAATGAAAAGCTCTATCTC



TATTATCTCCAAAATGGAAGAGACATGTATGTGGACCAAGAATTAGATATTAATCGTTTAAGTGATTA



TGATGTCGATCACATTGTTCCACAAAGTTTCCTTAAAGACGATTCAATAGACAATAAGGTCTTAACGC



GTTCTGATAAAAATCGTGGTAAATCGGATAACGTTCCAAGTGAAGAAGTAGTCAAAAAGATGAAAAA



CTATTGGAGACAACTTCTAAACGCCAAGTTAATCACTCAACGTAAGTTTGATAATTTAACGAAAGCTG



AACGTGGAGGTTTGAGTGAACTTGATAAAGCTGGTTTTATCAAACGCCAATTGGTTGAAACTCGCCAA



ATCACTAAGCATGTGGCACAAATTTTGGATAGTCGCATGAATACTAAATACGATGAAAATGATAAAC



TTATTCGAGAGGTTAAAGTGATTACCTTAAAATCTAAATTAGTTTCTGACTTCCGAAAAGATTTCCAAT



TCTATAAAGTACGTGAGATTAACAATTACCATCATGCCCATGATGCGTATCTAAATGCCGTCGTTGGA



ACTGCTTTGATTAAGAAATATCCAAAACTTGAATCGGAGTTTGTCTATGGTGATTATAAAGTTTATGA



TGTTCGTAAAATGATTGCTAAGTCTGAGCAAGAAATAGGCAAAGCAACCGCAAAATATTTCTTTTACT



CTAATATCATGAACTTCTTCAAAACAGAAATTACACTTGCAAATGGAGAGATTCGCAAACGCCCTCTA



ATCGAAACTAATGGGGAAACTGGAGAAATTGTCTGGGATAAAGGGCGAGATTTTGCCACAGTGCGCA



AAGTATTGTCCATGCCCCAAGTCAATATTGTCAAGAAAACAGAAGTACAGACAGGCGGATTCTCCAA



GGAGTCAATTTTACCAAAAAGAAATTCGGACAAGCTTATTGCTCGTAAAAAAGACTGGGATCCAAAA



AAATATGGTGGTTTTGATAGTCCAACGGTAGCTTATTCAGTCCTAGTGGTTGCTAAGGTGGAAAAAGG



GAAATCGAAGAAGTTAAAATCCGTTAAAGAGTTACTAGGGATCACAATTATGGAAAGAAGTTCCTTT



GAAAAAAATCCGATTGACTTTTTAGAAGCTAAAGGATATAAGGAAGTTAAAAAAGACTTAATCATTA



AACTACCTAAATATAGTCTTTTTGAGTTAGAAAACGGTCGTAAACGGATGCTGGCTAGTGCCGGAGAA



TTACAAAAAGGAAATGAGCTGGCTCTGCCAAGCAAATATGTGAATTTTTTATATTTAGCTAGTCATTA



TGAAAAGTTGAAGGGTAGTCCAGAAGATAACGAACAAAAACAATTGTTTGTGGAGCAGCATAAGCAT



TATTTAGATGAGATTATTGAGCAAATCAGTGAATTTTCTAAGCGTGTTATTTTAGCAGATGCCAATTTA



GATAAAGTTCTTAGTGCATATAACAAACATAGAGACAAACCAATACGTGAACAAGCAGAAAATATTA



TTCATTTATTTACGTTGACGAATCTTGGAGCTCCCGCTGCTTTTAAATATTTTGATACAACAATTGATC



GTAAACGATATACGTCTACAAAAGAAGTTTTAGATGCCACTCTTATCCATCAATCCATCACTGGTCTTT



ATGAAACACGCATTGATTTGAGTCAGCTAGGAGGTGACTGA





SEQ
ATGGATAAGAAGTATTCAATTGGACTTGCGATTGGCACTAACAGTGTGGGCTGGGCGGTGATTACAG


ID
ACGAGTATAAGGTGCCGTCAAAAAAGTTTAAAGTTCTGGGCAACACTGATCGCCATTCCATCAAGAA


NO:
AAACCTAATCGGGGCCCTTCTTTTTGATAGTGGCGAAACGGCCGAGGCGACGCGTCTAAAACGTACC


168
GCGCGGCGTCGCTACACCCGACGAAAAAACCGTATTTGTTACCTTCAGGAGATCTTCAGTAACGAAAT



GGCTAAGGTGGACGATTCATTCTTCCACCGTCTGGAGGAGTCCTTTTTAGTTGAAGAAGACAAGAAGC



ATGAGCGACACCCAATTTTTGGTAACATTGTCGACGAAGTCGCCTATCACGAAAAATATCCGACCATT



TATCACCTGCGCAAAAAACTGGTCGATAGCACGGATAAAGCGGATCTGCGGCTTATTTACCTGGCGCT



TGCCCACATGATCAAGTTCCGCGGCCACTTCCTGATAGAAGGAGACCTGAACCCGGATAATAGCGAT



GTAGACAAACTGTTTATTCAGCTGGTCCAGACCTACAACCAGCTGTTTGAAGAAAATCCGATTAATGC



GTCAGGCGTGGATGCGAAAGCGATACTGAGTGCCCGCCTGTCGAAATCTCGCCGTCTCGAAAATCTG



ATTGCACAGCTGCCCGGCGAAAAAAAAAACGGTCTTTTTGGCAATCTGATCGCGCTGTCACTGGGCCT



GACACCAAATTTTAAGAGCAACTTCGACCTGGCAGAGGATGCGAAGCTTCAACTGTCGAAGGACACC



TATGACGATGATCTGGATAATCTTCTGGCACAAATCGGTGATCAGTATGCGGATTTATTCCTTGCAGC



GAAAAACCTATCTGACGCAATTCTGTTGAGCGATATCCTCCGCGTCAACACCGAAATCACTAAAGCCC



CCCTGTCAGCGTCGATGATTAAACGTTATGATGAGCACCATCAGGATCTGACCTTGCTAAAGGCGCTG



GTGCGACAGCAGCTTCCCGAAAAATATAAAGAGATCTTTTTTGATCAATCGAAGAATGGTTATGCCGG



ATACATTGATGGCGGAGCCAGTCAGGAAGAATTTTACAAATTCATCAAACCGATCCTGGAAAAAATG



GATGGCACAGAAGAACTGCTTGTGAAATTGAACCGGGAAGATTTACTGCGCAAACAGCGTACGTTCG



ACAACGGCTCCATACCCCATCAGATTCACTTAGGTGAGCTGCATGCAATACTCCGTCGCCAGGAAGAT



TTTTATCCATTTTTAAAAGACAACCGTGAGAAGATTGAAAAAATTTTAACTTTTCGTATTCCATATTAC



GTCGGGCCTTTGGCCCGAGGTAACTCTCGATTCGCCTGGATGACGAGAAAAAGCGAGGAGACCATCA



CTCCGTGGAATTTTGAAGAGGTTGTTGATAAAGGCGCGAGCGCCCAGTCGTTTATCGAACGTATGACC



AACTTTGATAAAAATCTGCCGAATGAAAAAGTGCTTCCGAAGCATTCTCTGTTGTATGAATATTTCAC



TGTGTACAATGAGTTAACGAAAGTGAAATATGTGACCGAAGGCATGCGGAAACCTGCTTTTCTGTCCG



GAGAACAGAAAAAAGCAATTGTGGACCTGCTGTTCAAAACGAACCGGAAAGTAACTGTGAAGCAGCT



GAAAGAGGACTACTTCAAAAAAATCGAATGCTTCGACTCAGTAGAGATCTCTGGTGTTGAAGATCGC



TTCAACGCGAGTCTGGGAACGTACCATGATTTGTTGAAAATCATCAAAGATAAAGACTTTCTGGATAA



CGAAGAGAATGAGGACATTCTTGAAGATATTGTTTTGACACTGACTCTGTTTGAGGATCGCGAAATGA



TTGAAGAGCGCCTGAAAACGTATGCCCATTTATTCGATGACAAAGTCATGAAGCAGCTGAAACGTCG



CCGCTATACTGGGTGGGGCAGACTTTCACGTAAATTGATCAATGGTATAAGAGACAAACAGAGCGGC



AAAACTATCTTAGATTTCCTGAAGAGTGATGGATTTGCCAACCGGAATTTTATGCAGCTTATACATGA



TGACTCGCTAACGTTTAAAGAAGACATTCAGAAGGCGCAGGTCAGCGGCCAGGGTGATTCGCTGCAT



GAACACATTGCAAATCTTGCCGGATCGCCAGCGATCAAAAAAGGCATCCTTCAGACAGTAAAAGTTG



TGGATGAACTGGTGAAAGTAATGGGTCGTCACAAGCCAGAAAATATTGTGATCGAAATGGCCCGGGA



AAATCAGACTACTCAAAAAGGTCAGAAAAATTCTCGCGAGCGTATGAAACGTATTGAAGAAGGCATC



AAAGAGCTAGGCAGCCAGATATTAAAGGAACATCCGGTTGAGAACACTCAGCTGCAGAATGAAAAA



CTGTATCTGTATTATCTTCAGAACGGCCGTGACATGTATGTTGATCAAGAACTGGATATCAATCGCTT



GTCCGATTATGACGTGGATCATATTGTTCCGCAAAGCTTTCTGAAAGACGATTCTATTGACAATAAAG



TACTGACACGTTCGGACAAAAACCGTGGTAAAAGCGATAACGTACCGTCGGAAGAAGTTGTTAAGAA



AATGAAAAATTATTGGCGCCAACTCCTGAATGCTAAATTGATTACCCAGCGGAAATTTGATAACTTAA



CCAAAGCCGAGCGGGGTGGCTTAAGTGAACTGGATAAAGCGGGTTTTATTAAACGCCAACTGGTAGA



AACCCGCCAGATAACGAAACATGTAGCTCAAATCCTCGATAGTCGCATGAATACGAAATATGACGAA



AATGATAAATTGATCCGTGAAGTAAAAGTGATTACTCTTAAAAGCAAATTGGTATCTGATTTTCGGAA



AGATTTCCAATTCTATAAGGTGAGAGAAATTAACAATTACCATCATGCACATGATGCGTATTTAAATG



CAGTTGTTGGCACCGCCTTAATCAAAAAATATCCGAAATTAGAATCTGAGTTCGTGTATGGTGATTAT



AAAGTTTATGATGTTCGAAAAATGATTGCTAAGTCTGAACAGGAAATCGGCAAAGCGACCGCAAAGT



ATTTTTTTTATAGCAATATTATGAATTTTTTTAAAACTGAGATTACCCTGGCGAATGGCGAAATTCGCA



AACGTCCTCTGATTGAAACCAATGGCGAAACCGGCGAGATAGTATGGGACAAGGGCCGTGATTTTGC



GACCGTCCGGAAAGTCCTGTCAATGCCGCAGGTGAATATTGTCAAGAAAACAGAAGTTCAGACAGGC



GGTTTTAGTAAAGAGTCTATTCTGCCCAAACGTAATTCGGATAAATTGATTGCCCGCAAGAAAGATTG



GGATCCGAAGAAATATGGTGGATTCGATTCTCCGACGGTCGCCTATAGCGTTCTAGTCGTCGCCAAGG



TCGAAAAAGGTAAATCCAAAAAACTGAAATCTGTGAAAGAACTGTTAGGCATTACAATCATGGAACG



TAGTAGTTTTGAAAAGAACCCGATCGACTTCCTCGAGGCGAAAGGCTACAAAGAAGTCAAGAAGGAT



TTGATTATTAAACTCCCAAAATATTCATTATTTGAGTTAGAAAACGGTAGGAAGCGTATGCTGGCGAG



TGCTGGGGAATTACAGAAAGGGAATGAGTTAGCACTGCCGTCAAAATATGTGAACTTTCTGTATCTGG



CCTCCCATTACGAGAAACTGAAAGGTAGCCCGGAAGATAATGAACAGAAACAACTATTTGTCGAGCA



ACACAAACATTATCTGGATGAAATTATTGAACAGATTAGTGAATTCTCTAAACGTGTTATTTTAGCGG



ATGCCAACCTTGACAAGGTGCTGAGCGCATATAATAAACACCGTGATAAACCCATTCGTGAACAGGC



TGAAAATATCATACATCTGTTCACGTTAACCAACTTGGGAGCTCCTGCCGCTTTTAAATATTTCGATAC



CACAATTGACCGCAAACGTTATACGTCTACAAAAGAGGTGCTCGATGCGACCCTGATCCACCAGTCTA



TTACAGGCCTGTATGAAACTCGTATCGACCTGTCACAACTGGGCGGCGACTGA





SEQ
ATGGACAAGAAATATTCAATCGGTTTAGCAATAGGAACTAACTCAGTAGGTTGGGCTGTAATTACAG


ID
ACGAATACAAGGTACCGTCCAAAAAGTTTAAGGTGTTGGGGAACACAGATAGACACTCTATAAAAAA


NO:
AAATTTAATAGGCGCTTTACTTTTCGATTCAGGCGAAACTGCAGAAGCGACACGTCTGAAGAGAACC


169
GCTAGACGTAGATACACGAGGAGAAAGAACAGAATATGTTACCTACAAGAAATTTTTTCTAATGAGA



TGGCTAAGGTGGATGATTCGTTTTTTCATAGACTCGAAGAATCTTTCTTAGTTGAAGAAGATAAAAAA



CACGAAAGGCATCCTATCTTTGGAAACATAGTTGATGAGGTGGCTTACCATGAAAAATATCCCACTAT



ATATCACCTTAGAAAAAAGTTGGTTGATTCAACCGACAAAGCGGATCTAAGGTTAATTTACCTCGCGT



TGGCTCACATGATAAAATTTAGAGGACATTTCTTGATCGAAGGTGATTTAAATCCCGATAACTCTGAT



GTAGATAAACTGTTCATCCAGTTGGTTCAAACATATAATCAGTTGTTCGAAGAGAACCCCATTAACGC



ATCAGGTGTTGATGCTAAAGCAATCTTATCAGCAAGGTTGAGCAAGAGCAGACGTCTGGAAAACTTG



ATTGCCCAATTGCCAGGTGAAAAGAAGAACGGTCTTTTTGGAAATTTAATTGCACTTTCACTTGGGTT



GACACCGAATTTTAAAAGCAATTTCGACCTCGCTGAGGATGCTAAACTCCAGTTATCTAAGGATACAT



ATGACGATGATTTGGATAATCTATTGGCCCAGATAGGTGATCAGTATGCAGATTTGTTTTTGGCAGCT



AAGAATTTATCAGATGCAATTCTACTGAGCGATATTTTAAGGGTGAATACAGAAATAACTAAAGCAC



CTTTGTCTGCATCTATGATAAAAAGATACGATGAACACCATCAAGATCTCACACTATTAAAAGCTTTA



GTTAGACAACAATTACCAGAAAAATATAAAGAAATCTTTTTCGATCAGTCCAAGAACGGATACGCCG



GCTATATAGATGGCGGTGCCTCCCAAGAAGAATTTTACAAATTTATCAAACCCATTTTGGAAAAGATG



GATGGTACTGAAGAATTATTGGTCAAATTAAACAGGGAAGATTTATTAAGAAAACAAAGGACCTTTG



ATAATGGTTCTATTCCACACCAAATCCATCTAGGGGAATTACATGCGATTCTTAGAAGACAAGAAGAT



TTTTATCCATTCTTGAAAGATAACAGGGAAAAGATAGAGAAAATCTTAACTTTTAGAATTCCCTACTA



CGTCGGGCCCTTAGCTAGGGGGAATTCTAGATTCGCCTGGATGACACGCAAATCAGAAGAAACAATT



ACGCCTTGGAATTTTGAAGAAGTTGTTGATAAAGGAGCCTCTGCTCAATCTTTTATTGAACGAATGAC



CAATTTTGATAAGAATTTACCCAATGAAAAGGTCTTACCCAAACATTCACTCCTATACGAGTACTTTA



CTGTTTACAATGAGTTGACAAAAGTGAAGTATGTTACCGAGGGTATGCGAAAACCTGCTTTCTTGAGT



GGTGAACAAAAGAAGGCCATTGTTGACTTGTTATTCAAAACTAACAGAAAGGTCACTGTGAAGCAGC



TTAAAGAAGATTATTTCAAAAAGATCGAATGTTTCGACTCGGTAGAAATTAGTGGTGTGGAAGATAG



ATTTAATGCTTCTCTTGGAACATATCATGATCTACTAAAGATCATCAAAGATAAAGATTTCTTGGACA



ATGAAGAAAATGAAGATATTCTTGAAGACATCGTGTTGACACTTACATTGTTTGAGGACAGAGAAAT



GATTGAAGAAAGGCTGAAGACCTACGCCCATTTGTTTGATGATAAAGTCATGAAACAGTTAAAGAGG



AGAAGGTATACCGGATGGGGTAGGCTGTCTCGCAAATTGATTAATGGTATTCGTGATAAACAATCGG



GTAAAACAATCCTAGATTTCCTGAAGTCCGATGGTTTCGCCAACAGGAATTTTATGCAATTGATTCAT



GACGATTCTTTGACTTTTAAAGAGGATATTCAGAAAGCACAGGTCTCAGGACAGGGCGATTCACTCCA



TGAACATATAGCTAACCTGGCTGGCTCCCCTGCTATTAAGAAAGGTATCTTGCAAACCGTCAAAGTAG



TAGACGAACTTGTTAAAGTTATGGGAAGACACAAACCTGAAAATATCGTTATTGAAATGGCTCGCGA



AAACCAGACAACACAAAAGGGTCAAAAGAATTCGAGAGAGAGAATGAAGCGTATCGAAGAAGGTAT



TAAAGAACTTGGGTCCCAAATACTTAAAGAACATCCAGTAGAAAACACTCAGCTTCAAAATGAAAAA



TTATACTTATATTATCTTCAGAATGGCCGCGATATGTATGTTGACCAAGAGTTAGATATAAATAGGTT



GTCTGATTACGACGTGGATCATATTGTACCTCAATCTTTTCTAAAAGATGATTCAATTGATAATAAGGT



ATTAACGAGAAGTGATAAAAATAGAGGTAAATCTGACAACGTGCCAAGCGAAGAGGTGGTGAAGAA



AATGAAAAATTATTGGCGTCAACTGTTGAACGCCAAGTTAATTACGCAGAGAAAGTTTGATAATCTAA



CAAAAGCTGAAAGAGGAGGCCTATCTGAGTTAGATAAGGCCGGTTTTATCAAACGTCAGTTAGTTGA



AACCAGGCAAATCACGAAGCACGTTGCCCAAATTCTAGATTCAAGGATGAATACCAAATACGATGAA



AACGATAAACTGATTCGGGAAGTCAAGGTTATAACTCTAAAAAGCAAACTAGTTTCAGATTTTCGCAA



AGATTTTCAATTTTACAAAGTTCGAGAAATCAATAATTATCATCATGCTCACGACGCGTACTTGAACG



CGGTCGTTGGTACAGCTTTAATAAAGAAATATCCTAAACTGGAATCGGAATTTGTATATGGGGATTAC



AAAGTATACGACGTGAGAAAGATGATCGCTAAATCTGAACAAGAAATTGGGAAAGCAACTGCCAAAT



ATTTTTTTTACAGCAACATAATGAATTTTTTTAAAACGGAAATTACATTGGCAAATGGCGAAATTAGA



AAGCGCCCATTGATAGAGACCAATGGAGAGACTGGGGAAATCGTGTGGGATAAAGGACGTGATTTTG



CCACAGTGAGGAAAGTGTTAAGTATGCCACAAGTTAATATTGTAAAAAAGACCGAGGTCCAAACGGG



TGGATTTAGCAAAGAATCAATTTTACCTAAGAGAAATTCAGATAAATTAATTGCCCGCAAAAAGGATT



GGGATCCTAAAAAATATGGTGGTTTTGATTCCCCAACAGTTGCTTACTCCGTCCTAGTTGTTGCTAAGG



TTGAAAAAGGAAAGTCTAAGAAACTTAAATCCGTAAAAGAGTTACTGGGAATTACAATAATGGAAAG



ATCCTCTTTCGAAAAGAACCCTATTGACTTCTTGGAGGCGAAAGGTTATAAAGAAGTCAAAAAAGATT



TGATCATAAAACTACCAAAGTATTCTCTATTTGAATTGGAAAACGGCAGAAAAAGGATGTTGGCAAG



CGCTGGTGAACTACAAAAGGGTAACGAATTGGCATTGCCGAGTAAATACGTGAATTTTCTATATTTGG



CATCACATTACGAAAAGTTAAAGGGATCACCCGAGGATAACGAGCAGAAACAACTGTTTGTTGAACA



ACACAAACATTATCTTGATGAAATTATAGAACAAATTAGTGAGTTCAGTAAGAGAGTTATTTTAGCCG



ATGCAAATTTAGACAAAGTTTTATCTGCTTATAACAAACATAGAGATAAGCCTATAAGGGAACAAGC



CGAAAATATTATTCATTTGTTTACGTTAACAAATTTAGGGGCACCAGCAGCATTCAAGTACTTCGATA



CGACTATCGATCGTAAGCGTTACACATCTACCAAAGAAGTTCTTGATGCAACTTTGATTCATCAATCT



ATAACAGGCTTATATGAAACTAGAATCGATCTGTCACAACTTGGTGGTGACTAA





SEQ
ATGGACAAGAAGTACTCAATTGGGCTTGCTATCGGCACTAACAGCGTTGGCTGGGCGGTCATCACAG


ID
ACGAATATAAGGTCCCATCAAAGAAATTCAAAGTCCTTGGCAATACGGACCGACATTCAATCAAGAA


NO:
GAACCTGATTGGAGCTCTGCTGTTTGATTCCGGTGAAACCGCCGAGGCAACACGATTGAAACGTACCG


170
CTCGTAGGAGGTATACGCGGCGGAAAAATAGGATCTGCTATCTGCAGGAAATATTTAGCAACGAAAT



GGCCAAGGTAGACGACAGCTTCTTCCACCGGCTCGAGGAATCTTTCCTCGTGGAAGAAGACAAAAAG



CACGAGCGCCACCCCATTTTCGGCAATATCGTGGACGAGGTAGCTTACCATGAAAAGTATCCAACTAT



TTACCACTTACGTAAGAAGTTAGTGGACAGCACCGATAAAGCCGACCTTCGCCTGATTTACCTAGCAC



TTGCACACATGATTAAGTTCCGAGGCCACTTCTTGATAGAGGGAGACCTGAATCCTGACAATTCCGAT



GTGGATAAATTGTTCATCCAGCTGGTACAGACATACAATCAGTTGTTTGAGGAAAATCCGATTAATGC



CAGTGGCGTGGACGCCAAGGCTATCCTGTCTGCTCGGCTTAGTAAGAGTAGACGCCTGGAAAATCTA



ATCGCACAGCTGCCCGGCGAAAAGAAAAATGGACTGTTCGGTAATTTGATCGCCCTGAGCCTGGGCC



TCACCCCTAACTTTAAGTCTAACTTCGACCTGGCCGAAGATGCTAAGCTCCAGCTGTCCAAAGATACT



TACGATGACGATCTCGATAATCTACTGGCTCAGATCGGGGACCAGTACGCTGACCTGTTTCTAGCTGC



CAAGAACCTCAGTGACGCCATTCTCCTGTCCGATATTCTGAGGGTTAACACTGAAATTACAAAGGCCC



CGCTGAGCGCGAGCATGATCAAAAGGTACGACGAGCATCACCAGGACCTCACGCTGCTGAAGGCCTT



AGTCAGACAGCAACTGCCCGAAAAGTACAAAGAAATCTTTTTCGACCAATCCAAGAACGGGTACGCC



GGCTACATTGATGGCGGGGCTTCACAAGAGGAGTTTTACAAGTTTATCAAGCCCATCCTGGAGAAAAT



GGACGGCACTGAAGAACTGCTTGTGAAACTCAATAGGGAAGACTTACTGAGGAAACAGCGCACATTC



GATAATGGCTCCATACCCCACCAAATCCATCTGGGAGAGTTGCATGCCATCTTGCGAAGGCAGGAGG



ACTTCTACCCCTTTCTTAAGGACAACAGGGAGAAAATCGAGAAAATTCTGACTTTCCGTATCCCCTAC



TACGTGGGCCCACTTGCTCGCGGAAACTCACGATTCGCATGGATGACCAGAAAGTCCGAGGAAACAA



TTACACCCTGGAATTTTGAGGAGGTAGTAGACAAGGGAGCCAGCGCTCAATCTTTCATTGAGAGGAT



GACGAATTTCGACAAGAACCTTCCAAACGAGAAAGTGCTTCCTAAGCACAGCCTGCTGTATGAGTATT



TCACGGTGTACAACGAACTTACGAAGGTCAAGTATGTGACAGAGGGTATGCGGAAACCTGCTTTTCTG



TCTGGTGAACAGAAGAAAGCTATCGTCGATCTCCTGTTTAAAACCAACCGAAAGGTGACGGTGAAAC



AGTTGAAGGAGGATTACTTCAAGAAGATCGAGTGTTTTGATTCTGTTGAAATTTCTGGGGTCGAGGAT



AGATTCAACGCCAGCCTGGGCACCTACCATGATTTGCTGAAGATTATCAAGGATAAGGATTTTCTGGA



TAATGAGGAGAATGAAGACATTTTGGAGGATATAGTGCTGACCCTCACCCTGTTCGAGGACCGGGAG



ATGATCGAGGAGAGACTGAAAACATACGCTCACCTGTTTGACGACAAGGTCATGAAGCAGCTTAAGA



GACGCCGTTACACAGGCTGGGGAAGATTATCCCGCAAATTAATCAACGGGATACGCGATAAACAAAG



TGGCAAGACCATACTCGACTTCCTAAAGAGCGATGGATTCGCAAATCGCAATTTCATGCAGTTGATCC



ACGACGATAGCCTGACCTTCAAAGAGGACATTCAGAAAGCGCAGGTGAGTGGTCAAGGGGATTCCCT



GCACGAACACATTGCTAACTTGGCTGGATCACCAGCCATTAAGAAAGGCATACTGCAGACCGTTAAA



GTGGTAGATGAGCTTGTGAAAGTCATGGGAAGACATAAGCCAGAGAACATAGTGATCGAAATGGCCA



GGGAAAATCAGACCACGCAAAAGGGGCAGAAGAACTCAAGAGAGCGTATGAAGAGGATCGAGGAG



GGCATCAAGGAGCTGGGTAGCCAGATCCTTAAAGAGCACCCAGTTGAGAATACCCAGCTGCAGAATG



AGAAACTTTATCTCTATTATCTCCAGAACGGAAGGGATATGTATGTCGACCAGGAACTGGACATCAAT



CGGCTGAGTGATTATGACGTCGACCACATTGTGCCTCAAAGCTTTCTGAAGGATGATTCCATCGACAA



TAAAGTTCTGACCCGGTCTGATAAAAATAGAGGCAAATCCGACAACGTACCTAGCGAAGAAGTCGTC



AAAAAAATGAAGAACTATTGGAGGCAGTTGCTGAATGCCAAGCTGATTACACAACGCAAGTTTGACA



ATCTCACCAAGGCAGAAAGGGGGGGCCTGTCAGAACTCGACAAAGCAGGTTTCATTAAAAGGCAGCT



AGTTGAAACTAGGCAGATTACTAAGCACGTGGCCCAGATCCTCGACTCACGGATGAATACAAAGTAT



GATGAGAATGATAAGCTAATCCGGGAGGTGAAGGTGATTACTCTGAAATCTAAGCTGGTGTCAGATT



TCAGAAAAGACTTCCAGTTCTACAAAGTCAGAGAGATCAACAATTATCACCATGCCCACGATGCATAT



CTTAATGCAGTAGTGGGGACAGCTCTGATCAAAAAATATCCTAAACTGGAGTCTGAATTCGTTTATGG



TGACTATAAAGTCTATGACGTCAGAAAAATGATCGCAAAGAGCGAGCAGGAGATAGGGAAGGCCAC



AGCAAAGTACTTCTTTTACAGTAATATCATGAACTTTTTCAAAACTGAGATTACATTGGCTAACGGCG



AGATCCGCAAGCGGCCACTGATAGAGACTAACGGAGAGACAGGGGAGATTGTTTGGGATAAGGGCC



GTGACTTCGCCACCGTTAGGAAAGTGCTGTCCATGCCCCAGGTGAACATTGTGAAGAAGACAGAAGT



GCAGACGGGTGGGTTCTCAAAAGAGTCTATTCTGCCTAAGCGGAATAGTGACAAACTGATCGCACGT



AAAAAGGACTGGGATCCAAAAAAGTACGGCGGATTCGACAGTCCTACCGTTGCATATTCCGTGCTTGT



GGTCGCTAAGGTGGAGAAGGGAAAAAGCAAGAAACTGAAGTCAGTCAAAGAACTACTGGGCATAAC



GATCATGGAGCGCTCCAGTTTCGAAAAAAACCCAATCGATTTTCTTGAAGCCAAGGGATACAAGGAG



GTAAAGAAAGACCTTATCATTAAGCTGCCTAAGTACAGTCTGTTCGAACTGGAGAATGGGAGGAAGC



GCATGCTGGCATCAGCTGGAGAACTCCAAAAAGGGAACGAGTTGGCCCTCCCCTCAAAGTATGTCAA



TTTTCTCTACCTGGCTTCTCACTACGAGAAGTTAAAGGGGTCTCCAGAGGATAATGAGCAGAAACAGC



TGTTTGTGGAACAGCACAAGCACTATTTGGACGAAATCATCGAACAAATTTCCGAGTTCAGTAAGAG



GGTGATTCTGGCCGACGCAAACCTTGACAAAGTTCTGTCCGCATACAATAAGCACAGAGACAAACCA



ATCCGCGAGCAAGCCGAGAATATAATTCACCTTTTCACTCTGACTAATCTGGGGGCCCCCGCAGCATT



TAAATATTTCGATACAACAATCGACCGGAAGCGGTATACATCTACTAAGGAAGTCCTCGATGCGACA



CTGATCCACCAGTCAATTACAGGTTTATATGAAACAAGAATCGACCTGTCCCAGCTGGGCGGCGACTA



G





SEQ
AAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAAGCATTGATAATTGA


ID
GATCCCTCTCCCTGACAGGATGATTACATAAATAATAGTGACAAAAATAAATTATTTATTTATCCAGA


NO:
AAATGAATTGGAAAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTcaaaCAGGTtg


171
ccgtcactgcgtcttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagcattctg



taacaaagcgggaccaaagccatgacaaaaacgcgtaacaaaagtgtctataatcacggcagaaaagtcc



acattgattatttgcacggcgtcacactttgctatgccatagcatttttatccataagattagcggatcc



tacctgacgctttttatcgcaactctctactgtttctccatacccgtttttttgggctagcaccgcctat



ctcgtgtgagataggcggagatacgaactttaagAAGGAGatataccATGGA



ACAGGAATATTATCTGGGCTTGGACATGGGCACCGGTTCCGTCGGCTGGGCTGTTACTGACAGTGAAT



ATCACGTTCTAAGAAAGCATGGTAAGGCATTGTGGGGTGTAAGACTTTTCGAATCTGCTTCCACTGCT



GAAGAGCGTAGAATGTTTAGAACGAGTCGACGTAGGCTAGACAGGCGCAATTGGAGAATCGAAATTT



TACAAGAAATTTTTGCGGAAGAGATATCTAAGAAAGACCCAGGCTTTTTCCTGAGAATGAAGGAATC



TAAGTATTACCCTGAGGATAAAAGAGATATAAATGGTAACTGTCCCGAATTGCCTTACGCATTATTTG



TGGACGATGATTTTACCGATAAGGATTACCATAAAAAGTTCCCAACTATCTACCATTTACGCAAAATG



TTAATGAATACAGAGGAAACCCCAGACATAAGACTAGTTTATCTGGCAATACACCATATGATGAAAC



ATAGAGGCCATTTCTTACTTTCCGGGGATATCAACGAAATCAAAGAGTTTGGTACCACATTTAGTAAG



TTACTGGAAAACATAAAGAATGAAGAATTGGATTGGAACTTAGAACTCGGAAAAGAAGAATACGCG



GTTGTCGAATCTATCCTGAAGGATAATATGCTGAATAGGTCGACCAAAAAAACTAGGCTGATCAAAG



CACTGAAAGCCAAATCTATCTGCGAAAAAGCTGTTTTAAATTTACTTGCTGGTGGCACTGTTAAGTTA



TCAGACATTTTTGGTTTGGAAGAATTGAACGAAACCGAGCGTCCAAAAATTAGTTTCGCTGATAATGG



CTACGATGATTACATTGGTGAGGTGGAAAACGAGTTGGGCGAACAATTTTATATTATAGAGACAGCT



AAGGCAGTCTATGACTGGGCTGTTTTAGTAGAAATCCTTGGTAAATACACATCTATCTCCGAAGCGAA



AGTTGCTACTTACGAAAAGCACAAGTCCGATCTCCAGTTTTTGAAGAAAATTGTCAGGAAATATCTGA



CTAAGGAAGAATATAAAGATATTTTCGTTAGTACCTCTGACAAACTGAAAAATTACTCCGCTTACATC



GGGATGACCAAGATTAATGGCAAAAAAGTTGATCTGCAAAGCAAAAGGTGTTCGAAGGAAGAATTTT



ATGATTTCATTAAAAAGAATGTCTTAAAAAAATTAGAAGGTCAGCCAGAATACGAATATTTGAAAGA



AGAACTGGAAAGAGAGACATTCTTACCAAAACAAGTCAACAGAGATAATGGGGTAATTCCATATCAA



ATTCACCTCTACGAATTAAAAAAAATTTTAGGCAATTTACGCGATAAAATTGACCTTATCAAAGAAAA



TGAGGATAAGCTGGTTCAACTCTTTGAATTCAGAATACCCTATTATGTGGGCCCACTGAACAAGATTG



ATGACGGCAAAGAAGGTAAATTCACATGGGCCGTCCGCAAATCCAATGAAAAAATTTACCCATGGAA



CTTTGAAAATGTAGTAGATATTGAAGCGTCTGCGGAGAAATTTATTCGAAGAATGACTAATAAATGCA



CTTACTTGATGGGAGAGGATGTTCTGCCTAAAGACAGCTTATTATACAGCAAGTACATGGTTCTAAAC



GAACTTAACAACGTTAAGTTGGACGGTGAGAAATTAAGTGTAGAATTGAAACAAAGATTGTATACTG



ACGTCTTCTGCAAGTACAGAAAAGTGACAGTTAAAAAAATTAAGAATTACTTGAAGTGCGAAGGTAT



AATTTCTGGAAACGTAGAGATTACTGGTATTGATGGTGATTTCAAAGCATCCCTAACAGCTTACCACG



ATTTCAAGGAAATCCTGACAGGAACTGAACTCGCAAAAAAAGATAAAGAAAACATTATTACTAATAT



TGTTCTTTTCGGTGATGACAAGAAATTGTTGAAGAAAAGACTGAATAGACTTTACCCCCAGATTACTC



CCAATCAACTTAAGAAAATTTGTGCTTTGTCTTACACAGGATGGGGTCGTTTTTCAAAAAAGTTCTTA



GAAGAGATTACCGCACCTGATCCAGAAACAGGCGAAGTATGGAATATAATTACCGCCTTATGGGAAT



CGAACAATAATCTTATGCAACTTCTGAGCAATGAATATCGTTTCATGGAAGAAGTTGAGACTTACAAC



ATGGGCAAACAGACGAAGACTTTATCCTATGAAACTGTGGAAAATATGTATGTATCACCTTCTGTCAA



GAGACAAATTTGGCAAACCTTAAAAATTGTCAAAGAATTAGAAAAGGTAATGAAGGAGTCTCCTAAA



CGTGTGTTTATTGAAATGGCTAGAGAAAAACAAGAGTCAAAAAGAACCGAGTCAAGAAAGAAGCAG



TTAATCGATTTATATAAGGCTTGTAAAAACGAAGAGAAAGATTGGGTTAAAGAATTGGGGGACCAAG



AGGAACAAAAACTACGGTCGGATAAGTTGTATTTATACTATACGCAAAAGGGACGATGTATGTATTC



CGGCGAGGTAATAGAATTGAAGGATTTATGGGACAATACAAAATATGACATAGACCATATATATCCC



CAATCAAAAACGATGGACGATAGCTTGAACAATAGAGTACTCGTGAAAAAAAAATATAATGCGACCA



AATCTGATAAGTATCCTCTGAATGAAAATATCAGACATGAAAGAAAGGGGTTCTGGAAGTCCTTGTTA



GATGGTGGGTTTATAAGCAAAGAAAAGTACGAGCGTCTAATAAGAAACACGGAGTTATCGCCAGAAG



AACTCGCTGGTTTTATTGAGAGGCAAATCGTGGAAACGAGACAATCTACCAAAGCCGTTGCTGAGAT



CCTAAAGCAAGTTTTCCCAGAGTCGGAGATTGTCTATGTCAAAGCTGGCACAGTGAGCAGGTTTAGGA



AAGACTTCGAACTATTAAAGGTAAGAGAAGTGAACGATTTACATCACGCAAAGGACGCTTACCTAAA



TATCGTTGTAGGTAACTCATATTATGTTAAATTTACCAAGAACGCCTCTTGGTTTATAAAGGAGAACC



CAGGTAGAACATATAACCTGAAAAAGATGTTCACCTCTGGTTGGAATATTGAGAGAAACGGAGAAGT



CGCATGGGAAGTTGGTAAGAAAGGGACTATAGTGACAGTAAAGCAAATTATGAACAAAAATAATATC



CTCGTTACAAGGCAGGTTCATGAAGCAAAGGGCGGCCTTTTTGACCAACAAATTATGAAGAAAGGGA



AAGGTCAAATTGCAATAAAAGAAACCGATGAGAGACTAGCGTCAATAGAAAAGTATGGTGGCTATAA



TAAAGCTGCGGGTGCATACTTTATGCTTGTTGAATCAAAAGACAAGAAAGGTAAGACTATTAGAACT



ATAGAATTTATACCCCTGTACCTTAAAAACAAAATTGAATCGGATGAGTCAATCGCGTTAAATTTTCT



AGAGAAAGGAAGGGGTTTAAAAGAACCAAAGATCCTGTTAAAAAAGATTAAGATTGACACCTTGTTC



GATGTAGATGGATTTAAAATGTGGTTATCTGGCAGAACAGGCGATAGACTTTTGTTTAAGTGCGCTAA



TCAATTAATTTTGGATGAGAAAATCATTGTCACAATGAAAAAAATAGTTAAGTTTATTCAGAGAAGAC



AAGAAAACAGGGAGTTGAAATTATCTGATAAAGATGGTATCGACAATGAAGTTTTAATGGAAATCTA



CAATACATTCGTTGATAAACTTGAAAATACCGTATATCGAATCAGGTTAAGTGAACAAGCCAAAACA



TTAATTGATAAACAAAAAGAATTTGAAAGGCTATCACTGGAAGACAAATCCTCCACCCTATTTGAAAT



TTTGCATATATTCCAGTGCCAATCTTCAGCAGCTAATTTAAAAATGATTGGCGGACCTGGGAAAGCCG



GCATCCTAGTGATGAACAATAATATCTCCAAGTGTAACAAAATATCAATTATTAACCAATCTCCGACA



GGTATTTTTGAAAATGAAATAGACTTGCTTAAGATATAAGAAATCATCCTTAGCGAAAGCTAAGGATT



TTTTTTATCTGAAATTTATTATATCGCGTTGATTATTGATGCTGTTTTTAGTTTTAACGGCAATTAATAT



ATGTGTTATTAATTGAATGAATTTTATCATTCATAATAAGTATGTGTAGGATCAAGCTCAGGTTAAAT



ATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACAGAATTATCTCATAACAAGTGTTAA



GGGATGTTATTTCC





SEQ
AATTCAAAGGATAATCAAAC


ID



NO:



172






SEQ
AATCTCTACTCTTTGTAGAT


ID



NO:



173






SEQ
AATTTCTACTGTTGTAGAT


ID



NO:



174






SEQ
AATTTCTACTAGTGTAGAT


ID



NO:



175






SEQ
AATTTCTACTATTGT


ID



NO:



176






SEQ
AATTTCTACTGTTGTAGA


ID



NO:



177






SEQ
AATTTCTACTATTGTA


ID



NO:



178






SEQ
AATTTCTACTTTTGTAGAT


ID



NO:



179






SEQ
AATTTCTACTGTTGTAGAT


ID



NO:



180






SEQ
AATTTCTACTCTTGTAGAT


ID



NO:



181








Claims
  • 1. A nucleic acid-guided nuclease system comprising: (a) a nucleic acid having at least 95% identity to SEQ ID NO: 24 or SEQ ID NO: 44, wherein the nucleic acid encodes a nucleic acid-guided nuclease comprising the amino acid sequence of SEQ ID No. 4;(b) an engineered guide nucleic acid sequence capable of complexing with the nucleic acid-guided nuclease, wherein the engineered guide nucleic acid sequence is configured to hybridize to a TTTN sequence of a target region in a genome of a cell;(c) an editing sequence having a change in sequence relative to the sequence of a target region in a genome of a cell;wherein the system results in a genome edit in the target region in the genome of the cell facilitated by the nuclease, the engineered guide nucleic acid, and the editing sequence.
  • 2. The system of claim 1, wherein the engineered guide nucleic acid and the editing sequence are provided as a single nucleic acid.
  • 3. The system of claim 1, wherein the nucleic acid encoding the nucleic acid-guided nuclease is codon optimized for E. coli.
  • 4. The system of claim 1, wherein the nucleic acid encoding the nucleic acid-guided nuclease is codon optimized for S. cerevisiae.
  • 5. The system of claim 1, wherein the nucleic acid encoding the nucleic acid-guided nuclease is codon optimized for mammalian cells.
  • 6. The system of claim 1, wherein the target region is within a coding region of a protein.
  • 7. The system of claim 1, wherein the target region is within a non-coding region of a protein.
  • 8. The system of claim 7, wherein the target region is within a regulatory region of a protein.
  • 9. The system of claim 1, wherein the editing sequence having a change in sequence relative to the sequence of a target region in a genome of a cell further comprises a mutation in a protospacer adjacent motif (PAM) sequence.
  • 10. A method of modifying a target region in the genome of a cell, the method comprising: (a) contacting a cell with: the nucleic acid-guided nuclease system of claim 1; and(b) allowing the nucleic-acid guided nuclease system to create a genome edit in a target region of the genome of the cell.
  • 11. The method of claim 10, wherein the engineered guide nucleic acid sequence and the editing sequence are provided as a single nucleic acid.
  • 12. The method of claim 1, wherein the nucleic acid having at least 95% identity to SEQ ID No. 44 or SEQ ID No. 24 is codon optimized for the cell to be edited.
  • 13. The method of claim 12, wherein the nucleic acid-guided nuclease is codon optimized for E. coli.
  • 14. The method of claim 12, wherein the nucleic acid-guided nuclease is codon optimized for S. cerevisiae.
  • 15. The method of claim 12, wherein the nucleic acid-guided nuclease is codon optimized for mammalian cells.
  • 16. The method of claim 10, wherein the editing sequence encoding a nucleic acid complementary to the target region having a change in sequence relative to the target region further comprises a mutation in a protospacer adjacent motif (PAM) site.
  • 17. The method of claim 10, wherein the target region is within a eukaryotic cell.
  • 18. The method of claim 10, wherein the target region is within a bacterial cell.
  • 19. The method of claim 10, wherein the target region is within a plant cell.
  • 20. The method of claim 10, wherein the target region is within a mammalian cell.
RELATED APPLICATIONS

This application is a Continuation of Patent Application of U.S Ser. No. 17/179,193, entitled “Nucleic Acid-Guided Nucleases” filed Feb. 18, 2021, now allowed; which is a Continuation of Patent Application of U.S Ser. No. 16/819,896, entitled “Nucleic Acid-Guided Nucleases” filed Mar. 16, 2020; which is a Continuation Patent Application of U.S. Ser. No. 16/548,631, entitled “Nucleic Acid-Guided Nucleases” filed Aug. 22, 2019, now U.S. Pat. No. 10,626,416; which is a Continuation Patent Application of U.S. Ser. No. 15/896,433, entitled “Nucleic Acid-Guided Nucleases” filed Feb. 14, 2018, now U.S. Pat. No. 10,435,714; which is a Continuation Patent Application of U.S. Ser. No. 15/631,989, entitled “Nucleic Acid-Guided Nucleases” filed Jun. 23, 2017, now U.S. Pat. No. 10,011,849.

US Referenced Citations (75)
Number Name Date Kind
6391582 Ying et al. May 2002 B2
6837995 Vassarotti et al. Jan 2005 B1
7166443 Walker et al. Jan 2007 B2
8332160 Platt et al. Dec 2012 B1
8697359 Zhang et al. Apr 2014 B1
8926977 Miller et al. Jan 2015 B2
9260505 Weir et al. Feb 2016 B2
9361427 Hillson Jun 2016 B2
9499855 Hyde et al. Nov 2016 B2
9776138 Innings et al. Oct 2017 B2
9790490 Zhang et al. Oct 2017 B2
9896696 Begemann et al. Feb 2018 B2
9982279 Gill et al. May 2018 B1
9988624 Serber et al. Jun 2018 B2
10011849 Gill et al. Jul 2018 B1
10017760 Gill et al. Jul 2018 B2
10227576 Cameron et al. Mar 2019 B1
10266851 Chen Apr 2019 B2
10704033 Kim et al. Jul 2020 B1
10724021 Kim et al. Jul 2020 B1
10745678 Kim et al. Aug 2020 B1
10767169 Kim et al. Sep 2020 B1
10837021 Tian et al. Nov 2020 B1
10927385 Kannan et al. Feb 2021 B2
20020139741 Kopf Oct 2002 A1
20040110253 Kappler et al. Jun 2004 A1
20070020761 Yu et al. Jan 2007 A1
20100076057 Sontheimer et al. Mar 2010 A1
20110294217 McConnell-Smith et al. Dec 2011 A1
20130236970 Anneren et al. Sep 2013 A1
20140068797 Doudna et al. Mar 2014 A1
20140199767 Barrangou et al. Jul 2014 A1
20140242033 Gruber et al. Aug 2014 A1
20140273226 Wu et al. Sep 2014 A1
20150071898 Liu et al. Mar 2015 A1
20150098954 Hyde et al. Apr 2015 A1
20150159174 Frendewey et al. Jun 2015 A1
20150176013 Musunuru et al. Jun 2015 A1
20150191719 Hudson et al. Jul 2015 A1
20150225732 Williams et al. Aug 2015 A1
20150344549 Muir et al. Dec 2015 A1
20160024529 Carstens et al. Jan 2016 A1
20160053272 Wurzel et al. Feb 2016 A1
20160053304 Wurzel et al. Feb 2016 A1
20160076093 Shendure et al. Mar 2016 A1
20160102322 Ravinder et al. Apr 2016 A1
20160168592 Church et al. Jun 2016 A1
20160264981 Yang et al. Sep 2016 A1
20160281053 Sorek et al. Sep 2016 A1
20160289673 Huang et al. Oct 2016 A1
20160298134 Chen et al. Oct 2016 A1
20160354487 Zhang et al. Dec 2016 A1
20170002339 Barrngou et al. Jan 2017 A1
20170022499 Lu et al. Jan 2017 A1
20170051310 Doudna et al. Feb 2017 A1
20170073705 Chen et al. Mar 2017 A1
20170191123 Kim et al. Jul 2017 A1
20170211078 Kamineni et al. Jul 2017 A1
20170240922 Gill et al. Aug 2017 A1
20170369870 Gill et al. Dec 2017 A1
20180028567 Li et al. Feb 2018 A1
20180052176 Holt et al. Feb 2018 A1
20180073013 Lorenz et al. Mar 2018 A1
20180112235 Li et al. Apr 2018 A1
20180187149 Ma et al. Jul 2018 A1
20180200342 Bikard et al. Jul 2018 A1
20180230460 Gill et al. Aug 2018 A1
20180230461 Gill et al. Aug 2018 A1
20190017072 Ditommaso et al. Jan 2019 A1
20190136230 Sather et al. May 2019 A1
20190169605 Masquelier et al. Jun 2019 A1
20190194650 Gill et al. Jun 2019 A1
20190225928 Masquelier et al. Jul 2019 A1
20190270987 Masquelier et al. Sep 2019 A1
20200263197 Cheng et al. Aug 2020 A1
Foreign Referenced Citations (35)
Number Date Country
2395087 Dec 2011 EP
3199632 Aug 2017 EP
WO2002010183 Feb 2002 WO
WO 2003087341 Oct 2003 WO
WO 2010079430 Jul 2010 WO
WO 2011072246 Jun 2011 WO
WO 2011143124 Nov 2011 WO
WO 2013142578 Sep 2013 WO
WO 2013176772 Nov 2013 WO
WO 2014018423 Jan 2014 WO
WO 2014144495 Sep 2014 WO
WO 2016110453 Jul 2016 WO
WO2016110453 Jul 2016 WO
WO 2017053902 Mar 2017 WO
WO 2017078631 May 2017 WO
WO 2017083722 May 2017 WO
WO 2017106414 Jun 2017 WO
WO2017106414 Jun 2017 WO
WO 2017161371 Sep 2017 WO
WO 2017174329 Oct 2017 WO
WO 2017186718 Nov 2017 WO
WO2017212400 Dec 2017 WO
WO2017216392 Dec 2017 WO
WO 2017216392 Dec 2017 WO
WO2017223330 Dec 2017 WO
WO 2017223330 Dec 2017 WO
WO 2018031950 Feb 2018 WO
WO 2018071672 Apr 2018 WO
WO 2018083339 May 2018 WO
WO 2018191715 Oct 2018 WO
WO2019006436 Jan 2019 WO
WO 2019200004 Oct 2019 WO
WO2019209926 Oct 2019 WO
WO 2020005383 Jan 2020 WO
WO2020021045 Jan 2020 WO
Non-Patent Literature Citations (75)
Entry
International Search Report and Written Opinion for International Application No. PCT/US20/19379, dated Jul. 22, 2020, p. 1-10.
International Search Report and Written Opinion for International Application No. PCT/US20/36064, dated Sep. 18, 2020, p. 1-16.
International Search Report and Written Opinion for International Application No. PCT/US20/40389, dated Oct. 13, 2020, p. 1-12.
Arnak, et al., “Yeast Artificial Chromosomes”, John Wiley & Sons, Ltd., doi:10.1002/9780470015902.a0000379.pub3, pp. 1-10 (2012).
Woo, et al., “Dual roles of yeast Rad51 N-terminal domain in repairing DNA double-strand breaks”, Nucleic Acids Research, doi:10.1093/nar/gkaa.587, vol. 48, No. 15, pp. 8474-8489 (2020).
International Search Report and Written Opinion for International Application No. PCT/US2021/012868, dated Mar. 26, 2021, p. 1-15.
Anzalone et al., “Search-and-replace genome editing without doubles-strand breaks or donor DNA,” Nature, Oct. 21, 2019, vol. 576, No. 7785, pp. 149-157.
Alvarez, et al., “In vivo diversification of target genomic sites using processive T7 RNA polymerase-base deaminase fusions blocked by RNA-guided dCas9”, Dept.of Microbial Biotechnology and Systems Biology Program, Madrid, Spain, Jan. 1, 2019, p. 1-33.
International Search Report and Written Opinion for International Application No. PCT/US20/65168, dated Mar. 17, 2021, p. 1-15.
International Search Report and Written Opinion for International Application No. PCT/US2020/038345, dated Nov. 23, 2020, p. 1-13.
International Search Report and Written Opinion for International Application No. PCT/US21/12867, dated May 12, 2021, p. 1-17.
International Search Report and Written Opinion for International Application No. PCT/US2020/064727, dated Apr. 28, 2021, p. 1-13.
International Search Report and Written Opinion for International Application No. PCT/US21/29008, dated Aug. 24, 2021, p. 1-19.
International Search Report and Written Opinion for International Application No. PCT/US21/29011, dated Aug. 24, 2021, p. 1-20.
Bauer, et al., “Cell-microcarrier Adhesion to Gas-Liquid Interfaces and Foam”, Biotechnol. Prog. 2000, 16, 125-132, Oct. 19, 1999.
Datlinger, et al., “Pooled CRISPR screening with single-cell transcriptome readout”, Nature Methods, Jan. 10, 2017; p. 1-10, doi:10.1038/nmeth.4177.
Dixit, et al., “Perturb-Seq: Dissecting Molecular Circuits with Scalable Single-Cell RNA Profiling of Pooled Genetic Screens”, Cell 167, p. 1853-1866, Dec. 15, 2016.
GE Healthcare Life Sciences, “Microcarrier Cell Culture Principles and Methods”, 18-1140-62 AC, p. 1-23, Nov. 2013.
Jacobi, et al., “Simplified CRISPR tools for efficient genome editing and streamlined protocols for their delivery into mammalian cells and mouse zygotes”, Methods 121-122, p. 16-28, Mar. 23, 2017.
Jaitin, et al., “Dissecting Immune Circuits by Linking CRISPR-Pooled Screens with Single-Cell RNA-Seq”, Cell 167, p. 1883-1896, Dec. 15, 2016.
Kim, et al., “Formation of Thermoresponsive Poly(N-isopropylacrylamide)/Dextran Particles by Atom Transfer Radical Polymerization”, Macromol. Rapid Commun., 24, p. 517-521, 2003.
Kimple, et al., “Overview of Affinity Tags for Protein Purification”, Curr Protoc Protein Sci.; 73: Unit-9-9. Doi:10.1002/0471140864.ps0909s73, p. 1-26, Aug. 6, 2015.
Nienow, et al., “A potentially scalable method for the harvesting of hMSCs from microcarriers”, Biochemical Engineering Journal 85, p. 79-88, Feb. 4, 2014.
Replogle, et al., “Direct capture of CRISPR quides enables scalable, multiplexed, and multi-omic Perturb-Seq”, bioRxiv; doi:http://dx.doi.org/10.1101/503367, p. 1-26, Dec. 21, 2018.
Sivalingam, et al., “Superior Red Blood Cell Generation from Human Pluripotent Stem Cells Through a Novel Microcarrier-Based Embryoid Body Platform”, Tissue Engineering: Part C, vol. 22, No. 8, p. 765-780, Jun. 9, 2016.
Bao, et al., “Genome-scale engineering of Saccharomyces cerevisiae with single-nucleotide precision”, Nature Biotechnology, doi:10.1038/nbt.4132, pp. 1-6 (May 7, 2018).
Dicarlo, et al., “Genome engineering in Saccharomyces cervisiae using CRISPR-Case systems”, Nucleic Acids Research, 41(7):4336-43 (2013).
Garst, et al., “Genome-wide mapping of mutations at single-nucleotide resolution for protein, metabolic and genome engineering”, Nature Biotechnology, 35(1):48-59 (2017).
Hsu, et al., “DNA targeting specificity of RNA-guided Cas9 nucleases”, Nature Biotechnology, 31(9):827-32 (2013).
Jiang, et al., “RNA-guided editing of bacterial genomes using CRISPR-Cas systems”, Nature Biotechnology, 31(3):233-41 (2013).
Jinek, et al., “A Programmable Dual-RNA-Guided DNA Endonuclease in Adaptive Bacterial Immunity”, Science, 337:816-20 (2012).
Verwaal, et al., “CRISPR/Cpfl enables fast and simple genome editing of Saccharamyces cerevisiae”, Yeast, 35:201-11 (2018).
Lian, et al., “Combinatorial metabolic engineering using an orthogonal tri-functional CRISPR system”, Nature Communications, DOI:1038/s41467-017-01695-x/www.nature.com/naturecommunications, pp. 1-9 (2017).
Roy, et cl., “Multiplexed precision genome editing with trackable genomic barcodes in yeast”, Nature Biotechnolgy, doi:10.1038/nbt.4137, pp. 1-16 (2018).
Dong, “Establishment of a highly efficient virus-inducible CRISPR/Cas9 system in insect cells,” Antiviral Res., 130:50-7(2016).
Epinat et al., “A novel engineered meganuclease induces homologous recombination in eukaryotic cells, e.g., yeast and mammalian cells”, Nucleic Acids Research, 31(11): 2952-2962.
Farasat et al., “A Biophysical Model of CRISPR/Cas9 Activity for Rational Design of Genome Editing and Gene Regulation,” PLoS Comput Biol., 29:12(1):e1004724 (2016).
Liu et al., “A chemical-inducible CRISPR-Cas9 system for rapid control of genome editing”, Nature Chemical Biology, 12:980-987(2016).
Eklund, et al., “Altered target site specificity variants of the I-Ppol His-Cys bis homing endonuclease” Nucleic Acids Research, 35(17):5839-50 (2007).
Boles, et al., “Digital-to-biological converter for on-demand production of biologics”, Nature Biotechnology, doi:10.1038/nbt.3859 (May 29, 2017).
Pines, et al., “Codon Compression Algorithms for Saturation Mutagenesis”, ACS Synthetic Biology, 4:604-14 (2015).
Bessa et al., “Improved gap repair cloning in yeast: treatment of the gapped vector with Taq DNA polymerase avoids vector self-ligation,” Yeast, 29(10):419-23 (2012).
Boch, “TALEs of genome targeting,” Nature Biotechnology vol. 29, pp. 135-136 (2011).
Campbell et al., “Targeting protein function: the expanding toolkit for conditional disruption,” Biochem J., 473(17):2573-2589 (2016).
Casini et al., “Bricks and blueprints: methods and standards for DNA assembly,” Nat Rev Mol Cell Biol., (9):568-76 (2015).
Chica et al., “Semi-rational approaches to engineering enzyme activity: combining the benefits of directed evolution and rational design,” Current Opinion in Biotechnology, 16(4): 378-384 (2005).
Durai et al., “Zinc finger nucleases: custom-designed molecular scissors for genome engineering of plant and mammalian cells”, Nucleic Acids Res., 33(18):5978-90 (2005).
Kadonaga et al., “Regulation of RNA polymerase II transcription by sequence-specific DNA binding factors”, Cell, 116(2):247-57 (2004).
Lee et al., “Targeted chromosomal deletions in human cells using zinc finger nucleases”, Genome Res., 20(1): 81-9 (2009).
Miller et al., “A TALE nuclease architecture for efficient genome editing”, Nature Biotechnology, 29 (2): 143-8 (2011).
Mittelman et al., “Zinc-finger directed double-strand breaks within CAG repeat tracts promote repeat instability in human cells”, PNAS USA, 106 (24): 9607-12 (2009).
Shivange, “Advances in generating functional diversity for directed protein evolution”, Current Opinion in Chemical Biology, 13 (1): 19-25 (2009).
Udo, “An Alternative Method to Facilitate cDNA Cloning for Expression Studies in Mammalian Cells by Introducing Positive Blue White Selection in Vaccinia Topoisomerase I-Mediated Recombination,” PLoS One, 10(9):e0139349 (2015).
Urnov et al., “Genome editing with engineered zinc finger nucleases”, Nature Reviews Genetics, 11:636-646 (2010).
International Search Report and Written Opinion for International Application No. PCT/US2018/053608, dated Dec. 13, 2018, p. 1-9.
International Search Report and Written Opinion for International Application No. PCT/US2018/053670, dated Jan. 3, 2019, p. 1-13.
International Search Report and Written Opinion for International Application No. PCT/US2018/053671, dated Sep. 26, 2018, p. 1-12.
International Search Report and Written Opinion for International Application No. PCT/US2018/040519, dated Sep. 26, 2018, p. 1-8.
International Search Report and Written Opinion for International Application No. PCT/US2019/026836, dated Jul. 2, 2019, p. 1-10.
International Search Report and Written Opinion for International Application No. PCT/US2019/023342, dated Jun. 6, 2019, p. 1-34.
International Search Report and Written Opinion for International Application No. PCT/US2019/030085, dated Jul. 23, 2019, p. 1-14.
International Search Report and Written Opinion for International Application No. PCT/US20/24341, dated Jun. 19, 2020, p. 1-9.
NonFinal Office Action for U.S. Appl. No. 16/399,988, dated Jul. 31, 2019, p. 1-20.
First Office Action Interview Pilot Program Pre-Interview Communication for U.S. Appl. No. 16/024,831, dated Feb. 12, 2019, p. 1-37.
NonFinal Office Action for U.S. Appl. No. 16/024,816 dated Sep. 4, 2018, p. 1-10.
Final Office Action for U.S. Appl. No. 16/024,816 dated Nov. 26, 2018, p. 1-12.
First Office Action Interview Pilot Program Pre-Interview Communication Preinterview for U.S. Appl. No. 16/454,865 dated Aug. 16, 2019, p. 1-36.
Yoshioka, et al., “Development of a mono-promoter-driven CRISPR/Cas9 system in mammalian cells”, Scientific Reports, Jul. 3, 2015, p. 1-8.
Remaut, et al., “Plasmid vectors for high-efficiency expression controlled by the PL promoter of coliphage lambda,” Laboratory of Molecular Biology, Apr. 15, 1981, p. 81-93.
International Search Report and Written Opinion for International Application No. PCT/US2019/028821, dated Aug. 2, 2019, p. 1-14.
International Search Report and Written Opinion for International Application No. PCT/US2019/028883, dated Aug. 16, 2019, p. 1-12.
International Search Report and Written Opinion for International Application No. PCT/US2019/46526, dated Dec. 18, 2019, p. 1-17.
International Search Report and Written Opinion for International Application No. PCT/US2018/34779, dated Nov. 26, 2018, p. 1-39.
International Search Report and Written Opinion for International Application No. PCT/US19/57250, dated Feb. 25, 2020, p. 1-16.
International Search Report and Written Opinion for International Application No. PCT/US19/47135, dated Jun. 11, 2020, p. 1-15.
Related Publications (1)
Number Date Country
20210388391 A1 Dec 2021 US
Continuations (5)
Number Date Country
Parent 17179193 Feb 2021 US
Child 17387860 US
Parent 16819896 Mar 2020 US
Child 17179193 US
Parent 16548631 Aug 2019 US
Child 16819896 US
Parent 15896433 Feb 2018 US
Child 16548631 US
Parent 15631989 Jun 2017 US
Child 15896433 US