Nucleic acid-guided nucleases

Information

  • Patent Grant
  • 11408012
  • Patent Number
    11,408,012
  • Date Filed
    Saturday, January 29, 2022
    2 years ago
  • Date Issued
    Tuesday, August 9, 2022
    a year ago
Abstract
Disclosed herein are nucleic acid-guided nucleases, guide nucleic acids, and targetable nuclease systems, and methods of use. Disclosed herein are engineered non-naturally occurring nucleic acid-guided nucleases, guide nucleic acids, and targetable nuclease systems, and methods of use. Targetable nuclease systems can be used to edit genetic targets, including recursive genetic engineering and trackable genetic engineering methods.
Description
BACKGROUND OF THE DISCLOSURE

Nucleic acid-guided nucleases have become important tools for research and genome engineering. The applicability of these tools can be limited by the sequence specificity requirements, expression, or delivery issues.


INCORPORATION BY REFERENCE

Submitted with the present application is an electronically filed sequence listing via EFS-Web as an ASCII formatted sequence listing, entitled “INSC105US4_seqlist_20220128”, created 28 Jan. 2022, and 791,000 bytes in size. The sequence listing is part of the specification filed herewith and is incorporated by reference in its entirety.


SUMMARY OF THE DISCLOSURE

Disclosed herein are methods of modifying a target region in the genome of a cell, the method comprising: (a) contacting a cell with: a non-naturally occurring nucleic-acid-guided nuclease encoded by a nucleic acid having at least 80% identity to SEQ ID NO: 27; an engineered guide nucleic acid capable of complexing with the nucleic acid-guided nuclease; and an editing sequence encoding a nucleic acid complementary to said target region having a change in sequence relative to the target region; and (b) allowing the nuclease, guide nucleic acid, and editing sequence to create a genome edit in a target region of the genome of the cell. In some aspects, the engineered guide nucleic acid and the editing sequence are provided as a single nucleic acid. In some aspects, the single nucleic acid further comprises a mutation in a protospacer adjacent motif (PAM) site. In some aspects, the nucleic acid-guided nuclease is encoded by a nucleic acid with at least 85% identity to SEQ ID NO: 47. In some aspects, the nucleic acid-guided nuclease is encoded by a nucleic acid with at least 85% identity to SEQ ID NO: 133.


Disclosed herein are nucleic acid-guided nuclease systems comprising: (a) a non-naturally occurring nuclease encoded by a nucleic acid having at least 80% identity to SEQ ID NO: 27; (b) an engineered guide nucleic acid capable of complexing with the nucleic acid-guided nuclease, and (c) an editing sequence having a change in sequence relative to the sequence of a target region in a genome of a cell; wherein the system results in a genome edit in the target region in the genome of the cell facilitated by the nuclease, the engineered guide nucleic acid, and the editing sequence. In some aspects, nucleic acid-guided nuclease is encoded by a nucleic acid with at least 85% identity to SEQ ID NO: 47. In some aspects, the nucleic acid-guided nuclease is encoded by a nucleic acid with at least 85% identity to SEQ ID NO: 133. In some aspects, the nucleic acid-guided nuclease is codon optimized for the cell to be edited. In some aspects, the engineered guide nucleic acid and the editing sequence are provided as a single nucleic acid. In some aspects, the single nucleic acid further comprises a mutation in a protospacer adjacent motif (PAM) site.


Disclosed herein are compositions for use in genome editing comprising a non-naturally occurring nuclease encoded by a nucleic acid having at least 75% identity to SEQ ID NO: 27. In some aspects, the nucleic acid has at least 80% identity to SEQ ID NO: 27. In some aspects, the nucleic acid has at least 90% identity to SEQ ID NO: 27. In some aspects, the nuclease is further codon optimized for use in cells from a particular organism. In some aspects, the nuclease is codon optimized for E. coli In some aspects, the nuclease is codon optimized for S. cerevisiae. In some aspects, the nuclease is codon optimized for mammalian cells. In some aspects, the nucleic acid-guided nuclease has less than 40% protein identity to SEQ ID NO: 12. In some aspects, the nucleic acid-guided nuclease has less than 40% protein identity to SEQ ID NO: 108.


INCORPORATION BY REFERENCE

All publications and patent applications mentioned in this specification are herein incorporated by reference to the same extent as if each individual publication or patent application was specifically and individually indicated to be incorporated by reference.





BRIEF DESCRIPTION OF THE DRAWINGS

This patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee.



FIG. 1A depicts a partial sequence alignment MAD1-8 (SEQ ID NO: 1-8) and MAD10-12 (SEQ ID NO: 10-12).



FIG. 1B depicts a phylogenetic tree of nucleases including MAD1-8.



FIG. 2 depicts an example protein expression construct.



FIG. 3 depicts an example editing cassette.



FIG. 4 depicts an example screening or selection experiment workflow.



FIG. 5A depicts an example protein expression construct.



FIG. 5B depicts an example editing cassette.



FIG. 5C depicts an example screening or selection experiment workflow.



FIG. 6A depicts an example protein expression construct.



FIG. 6B depicts an example editing cassette.



FIG. 6C depicts an example screening or selection experiment workflow.



FIG. 7A-7B depicts example data from a functional nuclease complex screening or selection experiment.



FIG. 8 depicts example data from a targetable nuclease complex-based editing experiment.



FIG. 9 depicts example data from a targetable nuclease complex-based editing experiment.



FIGS. 10A-10C depict example data from a targetable nuclease complex-based editing experiment.



FIG. 11 depicts a example sequence alignment of select sequences from an editing experiment.



FIG. 12 depicts example data from a targetable nuclease complex-based editing experiment.



FIG. 13A depicts an example alignment of scaffold sequences.



FIG. 13B depicts an example model of a nucleic acid-guided nuclease complexed with a guide nucleic acid and a target sequence.



FIG. 14A-14B depict example data from a primer validation experiment.



FIG. 15 depicts example data from a targetable nuclease complex-based editing experiment.



FIG. 16 depicts example validation data comparing results from two different assays.



FIG. 17A-17C depict an example trackable genetic engineering workflow, including a plasmid comprising an editing cassette and a recording cassette, and downstream sequencing of barcodes in order to identify the incorporated edit or mutation.



FIG. 18 depicts an example trackable genetic engineering workflow, including iterative rounds of engineering with a different editing cassette and recorder cassette with unique barcode (BC) at each round, which can be followed by selection and tracking to confirm the successful engineering step at each round.



FIG. 19 depicts an example recursive engineering workflow.





DETAILED DESCRIPTION OF THE DISCLOSURE

The present disclosure provides nucleic acid-guided nucleases and methods of use. Often, the subject nucleic-acid guided nucleases are part of a targetable nuclease system comprising a nucleic acid-guided nuclease and a guide nucleic acid. A subject targetable nuclease system can be used to cleave, modify, and/or edit a target polynucleotide sequence, often referred to as a target sequence. A subject targetable nuclease system refers collectively to transcripts and other elements involved in the expression of or directing the activity of genes, which may include sequences encoding a subject nucleic acid-guided nuclease protein and a guide nucleic acid as disclosed herein.


Methods, systems, vectors, polynucleotides, and compositions described herein may be used in various applications including altering or modifying synthesis of a gene product, such as a protein, polynucleotide cleavage, polynucleotide editing, polynucleotide splicing; trafficking of target polynucleotide, tracing of target polynucleotide, isolation of target polynucleotide, visualization of target polynucleotide, etc. Aspects of the invention also encompass methods and uses of the compositions and systems described herein in genome engineering, e.g. for altering or manipulating the expression of one or more genes or the one or more gene products, in prokaryotic, archaeal, or eukaryotic cells, in vitro, in vivo or ex vivo.


Nucleic Acid-Guided Nucleases


Bacterial and archaeal targetable nuclease systems have emerged as powerful tools for precision genome editing. However, naturally occurring nucleases have some limitations including expression and delivery challenges due to the nucleic acid sequence and protein size. Targetable nucleases that require PAM recognition are also limited in the sequences they can target throughout a genetic sequence. Other challenges include processivity, target recognition specificity and efficiency, and nuclease acidity efficiency, which often effect genetic editing efficiency.


Non-naturally occurring targetable nucleases and non-naturally occurring targetable nuclease systems can address many of these challenges and limitations.


Disclosed herein are non-naturally targetable nuclease systems. Such targetable nuclease systems are engineered to address one or more of the challenges described above and can be referred to as engineered nuclease systems. Engineered nuclease systems can comprise one or more of an engineered nuclease, such as an engineered nucleic acid-guided nuclease, an engineered guide nucleic acid, an engineered polynucleotides encoding said nuclease, or an engineered polynucleotides encoding said guide nucleic acid. Engineered nucleases, engineered guide nucleic acids, and engineered polynucleotides encoding the engineered nuclease or engineered guide nucleic acid are not naturally occurring and are not found in nature. It follows that engineered nuclease systems including one or more of these elements are non-naturally occurring.


Non-limiting examples of types of engineering that can be done to obtain a non-naturally occurring nuclease system are as follows. Engineering can include codon optimization to facilitate expression or improve expression in a host cell, such as a heterologous host cell. Engineering can reduce the size or molecular weight of the nuclease in order to facilitate expression or delivery. Engineering can alter PAM selection in order to change PAM specificity or to broaden the range of recognized PAMs. Engineering can alter, increase, or decrease stability, processivity, specificity, or efficiency of a targetable nuclease system. Engineering can alter, increase, or decrease protein stability. Engineering can alter, increase, or decrease processivity of nucleic acid scanning. Engineering can alter, increase, or decrease target sequence specificity. Engineering can alter, increase, or decrease nuclease activity. Engineering can alter, increase, or decrease editing efficiency. Engineering can alter, increase, or decrease transformation efficiency. Engineering can alter, increase, or decrease nuclease or guide nucleic acid expression.


Examples of non-naturally occurring nucleic acid sequences which are disclosed herein include sequences codon optimized for expression in bacteria, such as E. coli (e.g., SEQ ID NO: 41-60), sequences codon optimized for expression in single cell eukaryotes, such as yeast (e.g., SEQ ID NO: 127-146), sequences codon optimized for expression in multi cell eukaryotes, such as human cells (e.g., SEQ ID NO: 147-166), polynucleotides used for cloning or expression of any sequences disclosed herein (e.g., SEQ ID NO: 61-80), plasmids comprising nucleic acid sequences (e.g., SEQ ID NO: 21-40) operably linked to a heterologous promoter or nuclear localization signal or other heterologous element, proteins generated from engineered or codon optimized nucleic acid sequences (e.g., SEQ ID NO: 1-20), or engineered guide nucleic acids comprising any one of SEQ ID NO: 84-107. Such non-naturally occurring nucleic acid sequences can be amplified, cloned, assembled, synthesized, generated from synthesized oligonucleotides or dNTPs, or otherwise obtained using methods known by those skilled in the art.


Disclosed herein are nucleic acid-guided nucleases. Subject nucleases are functional in vitro, or in prokaryotic, archaeal, or eukaryotic cells for in vitro, in vivo, or ex vivo applications. Suitable nucleic acid-guided nucleases can be from an organism from a genus which includes but is not limited to Thiomicrospira, Succinivibrio, Candidatus, Porphyromonas, Acidaminococcus, Acidomonococcus, Prevotella, Smithella, Moraxella, Synergistes, Francisella, Leptospira, Catenibacterium, Kandleria, Clostridium, Dorea, Coprococcus, Enterococcus, Fructobacillus, Weissella, Pediococcus, Corynebacter, Sutterella, Legionella, Treponema, Roseburia, Filifactor, Eubacterium, Streptococcus, Lactobacillus, Mycoplasma, Bacteroides, Flaviivola, Flavobacterium, Sphaerochaeta, Azospirillum, Gluconacetobacter, Neisseria, Roseburia, Parvibaculum, Staphylococcus, Nitratifractor, Mycoplasma, Alicyclobacillus, Brevibacilus, Bacillus, Bacteroidetes, Brevibacilus, Carnobacterium, Clostridiaridium, Clostridium, Desulfonatronum, Desulfovibrio, Helcococcus, Leptotrichia, Listeria, Methanomethyophilus, Methylobacterium, Opitutaceae, Paludibacter, Rhodobacter, Sphaerochaeta, Tuberibacillus, Oleiphilus, Omnitrophica, Parcubacteria, and Campylobacter. Species of organism of such a genus can be as otherwise herein discussed. Suitable nucleic acid-guided nucleases can be from an organism from a genus or unclassified genus within a kingdom which includes but is not limited to Firmicute, Actinobacteria, Bacteroidetes, Proteobacteria, Spirochates, and Tenericutes. Suitable nucleic acid-guided nucleases can be from an organism from a genus or unclassified genus within a phylum which includes but is not limited to Erysipelotrichia, Clostridia, Bacilli, Actinobacteria, Bacteroidetes, Flavobacteria, Alphaproteobacteria, Betaproteobacteria, Gammaproteobacteria, Deltaproteobacteria, Epsilonproteobacteria, Spirochaetes, and Mollicutes. Suitable nucleic acid-guided nucleases can be from an organism from a genus or unclassified genus within an order which includes but is not limited to Clostridiales, Lactobacillales, Actinomycetales, Bacteroidales, Flavobacteriales, Rhizobiales, Rhodospirillales, Burkholderiales, Neis seriales, Legionellales, Nautiliales, Campylobacterales, Spirochaetales, Mycoplasmatales, and Thiotrichales. Suitable nucleic acid-guided nucleases can be from an organism from a genus or unclassified genus within a family which includes but is not limited to Lachnospiraceae, Enterococcaceae, Leuconostocaceae, Lactobacillaceae, Streptococcaceae, Peptostreptococcaceae, Staphylococcaceae, Eubacteriaceae, Corynebacterineae, Bacteroidaceae, Flavobacterium, Cryomoorphaceae, Rhodobiaceae, Rhodospirillaceae, Acetobacteraceae, Sutterellaceae, Neisseriaceae, Legionellaceae, Nautiliaceae, Campylobacteraceae, Spirochaetaceae, Mycoplasmataceae, Pisciririckettsiaceae, and Francisellaceae. Other nucleic acid-guided nucleases have been describe in US Patent Application Publication No. US20160208243 filed Dec. 18, 2015, US Application Publication No. US20140068797 filed Mar. 15, 2013, U.S. Pat. No. 8,697,359 filed Oct. 15, 2013, and Zetsche et al., Cell 2015 Oct. 22; 163(3):759-71, each of which are incorporated herein by reference in their entirety.


Some nucleic acid-guided nucleases suitable for use in the methods, systems, and compositions of the present disclosure include those derived from an organism such as, but not limited to, Thiomicrospira sp. XS5, Eubacterium rectale, Succinivibrio dextrinosolvens, Candidatus Methanoplasma termitum, Candidatus Methanomethylophilus alvus, Porphyromonas crevioricanis, Flavobacterium branchiophilum, Acidaminococcus Sp., Acidomonococcus sp., Lachnospiraceae bacterium COE1, Prevotella brevis ATCC 19188, Smithella sp. SCADC, Moraxella bovoculi, Synergistes jonesii, Bacteroidetes oral taxon 274, Francisella tularensis, Leptospira inadai serovar Lyme str. 10, Acidomonococcus sp. crystal structure (5B43) S. mutans, S. agalactiae, S. equisimilis, S. sanguinis, S. pneumonia; C. jejuni, C. coli; N. salsuginis, N. tergarcus; S. auricularis, S. carnosus; N. meningitides, N. gonorrhoeae; L. monocytogenes, L. ivanovii; C. botulinum, C. difficile, C. tetani, C. sordellii; Francisella tularensis 1, Prevotella albensis, Lachnospiraceae bacterium MC2017 1, Butyrivibrio proteoclasticus, Butyrivibrio proteoclasticus B316, Peregrinibacteria bacterium GW2011_GWA2_33_10, Parcubacteria bacterium GW2011_GWC2_44_17, Smithella sp. SCADC, Acidaminococcus sp. BV3L6, Lachnospiraceae bacterium MA2020, Candidatus methanoplasma termitum, Eubacterium eligens, Moraxella bovoculi 237, Leptospira inadai, Lachnospiraceae bacterium ND2006, Porphyromonas crevioricanis 3, Prevotella disiens, Porphyromonas macacae, Catenibacterium sp. CAG:290, Kandleria vitulina, Clostridiales bacterium KA00274, Lachnospiraceae bacterium 3-2, Dorea longicatena, Coprococcus catus GD/7, Enterococcus columbae DSM 7374, Fructobacillus sp. EFB-N1, Weissella halotolerans, Pediococcus acidilactici, Lactobacillus curvatus, Streptococcus pyogenes, Lactobacillus versmoldensis, Filifactor alocis ATCC 35896, Alicyclobacillus acidoterrestris, Alicyclobacillus acidoterrestris ATCC 49025, Desulfovibrio inopinatus, Desulfovibrio inopinatus DSM 10711, Oleiphilus sp. Oleiphilus sp. HI0009, Candidtus kefeldibacteria, Parcubacteria CasY.4, Omnitrophica WOR 2 bacterium GWF2, Bacillus sp. NSP2.1, and Bacillus thermoamylovorans.


In some instances, a nucleic acid-guided nuclease disclosed herein comprises an amino acid sequence comprising at least 50% amino acid identity to any one of SEQ ID NO: 1-20. In some instances, a nuclease comprises an amino acid sequence comprising at least about 10%, 20%, 30%, 40%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, or 100% amino acid identity to any one of SEQ ID NO: 1-20. In some cases, the nucleic acid-guided nuclease comprises at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, amino acid identity to any one of SEQ ID NO: 1-20. In some cases, the nucleic acid-guided nuclease comprises at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, amino acid identity to any one of SEQ ID NO: 1-8 or 10-12. In some cases, the nucleic acid-guided nuclease comprises at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, amino acid identity to any one of SEQ ID NO: 1-8 or 10-11. In some cases, the nucleic acid-guided nuclease comprises at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, amino acid identity to SEQ ID NO: 2. In some cases, the nucleic acid-guided nuclease comprises at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, amino acid identity to SEQ ID NO: 7.


In some cases, the nucleic acid-guided nuclease comprises any one of SEQ ID NO: 1-20. In some cases, the nucleic acid-guided nuclease comprises any one of SEQ ID NO: 1-8 or 10-12. In some cases, the nucleic acid-guided nuclease comprises any one of SEQ ID NO: 1-8 or 10-11. In some cases, the nucleic acid-guided nuclease comprises SEQ ID NO: 2. In some cases, the nucleic acid-guided nuclease comprises SEQ ID NO: 7.


In some instances, a nucleic acid-guided nuclease comprises an amino acid sequence comprising at most 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, or 95% amino acid identity to any one of SEQ ID NO: 12 or SEQ ID NO: 108-110. In some instances, a nucleic acid-guided nuclease comprises an amino acid sequence comprising at most 50% amino acid identity to any one of SEQ ID NO: 12 or SEQ ID NO: 108-110. In some instances, a nucleic acid-guided nuclease comprises an amino acid sequence comprising at most 45% amino acid identity to any one of SEQ ID NO: 12 or SEQ ID NO: 108-110. In some instances, a nucleic acid-guided nuclease comprises an amino acid sequence comprising at most 40% amino acid identity to any one of SEQ ID NO: 12 or SEQ ID NO: 108-110. In some instances, a nucleic acid-guided nuclease comprises an amino acid sequence comprising at most 35% amino acid identity to any one of SEQ ID NO: 12 or SEQ ID NO: 108-110. In some instances, a nucleic acid-guided nuclease comprises an amino acid sequence comprising at most 30% amino acid identity to any one of SEQ ID NO: 12 or SEQ ID NO: 108-110.


In some instances, a nucleic acid-guided nuclease disclosed herein is encoded by a nucleic acid sequence comprising at least 50% sequence identity to any one of SEQ ID NO: 21-40. In some instances, a nuclease is encoded by a nucleic acid sequence comprising at least about 10%, 20%, 30%, 40%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, or 100% sequence identity to any one of SEQ ID NO: 21-40. In some instances, a nuclease is encoded by a nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, sequence identity to any one of SEQ ID NO: 21-40. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, sequence identity to any one of SEQ ID NO: 21-40. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, sequence identity to any one of SEQ ID NO: 21-28 or 30-32. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, sequence identity to any one of SEQ ID NO: 21-28 or 30-31. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, sequence identity to SEQ ID NO: 22. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, sequence identity to SEQ ID NO: 27.


In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 21-40. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 21-28 or 30-32. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 21-28 or 30-31. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of SEQ ID NO: 22. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of SEQ ID NO: 27.


In some instances, a nucleic acid-guided nuclease disclosed herein is encoded on a nucleic acid sequence. Such a nucleic acid can be codon optimized for expression in a desired host cell. Suitable host cells can include, as non-limiting examples, prokaryotic cells such as E. coli, P. aeruginosa, B. subtilus, and V. natriegens, and eukaryotic cells such as S. cerevisiae, plant cells, insect cells, nematode cells, amphibian cells, fish cells, or mammalian cells, including human cells.


A nucleic acid sequence encoding a nucleic acid-guided nuclease can be codon optimized for expression in gram positive bacteria, e.g., Bacillus subtilis, or gram negative bacteria, e.g., E. coli. In some instances, a nucleic acid-guided nuclease disclosed herein is encoded by a nucleic acid sequence comprising at least 50% sequence identity to any one of SEQ ID NO: 41-60. In some instances, a nuclease is encoded by a nucleic acid sequence comprising at least about 10%, 20%, 30%, 40%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, or 100% sequence identity to any one of SEQ ID NO: 41-60. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ ID NO: 41-60. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ ID NO: 41-48 or 50-52. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ ID NO: 41-48 or 50-51. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to SEQ ID NO: 42. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to SEQ ID NO: 47.


In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 41-60. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 41-48 or 50-52. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 41-48 or 50-51. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of SEQ ID NO: 42. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of SEQ ID NO: 47.


A nucleic acid sequence encoding a nucleic acid-guided nuclease can be codon optimized for expression in a species of yeast, e.g., S. cerevisiae. In some instances, a nucleic acid-guided nuclease disclosed herein is encoded by a nucleic acid sequence comprising at least 50% sequence identity to any one of SEQ ID NO: 127-146. In some instances, a nuclease is encoded by a nucleic acid sequence comprising at least about 10%, 20%, 30%, 40%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, or 100% sequence identity to any one of SEQ ID NO: 127-146. In some instances, a nuclease is encoded by a nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ ID NO: 127-146. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ ID NO: 127-146. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ ID NO: 127-134 or 136-138. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ ID NO: 127-134 or 136-137. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to SEQ ID NO: 128. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to SEQ ID NO: 133.


In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 127-146. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 127-134 or 136-138. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 127-134 or 136-137. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of SEQ ID NO: 128. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of SEQ ID NO: 133.


A nucleic acid sequence encoding a nucleic acid-guided nuclease can be codon optimized for expression in mammalian cells. In some instances, a nucleic acid-guided nuclease disclosed herein is encoded by a nucleic acid sequence comprising at least 50% sequence identity to any one of SEQ ID NO: 147-166. In some instances, a nuclease is encoded by a nucleic acid sequence comprising at least about 10%, 20%, 30%, 40%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95%, or 100% sequence identity to any one of SEQ ID NO: 147-166. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ ID NO: 147-166. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ ID NO: 147-154 or 156-158. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to any one of SEQ ID NO: 147-154 or 156-157. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to SEQ ID NO: 148. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence comprising at least about 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, greater than 95% sequence identity to SEQ ID NO: 153.


In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 147-166. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 147-154 or 156-158. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of any one of SEQ ID NO: 147-154 or 156-157. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of SEQ ID NO: 148. In some cases, the nucleic acid-guided nuclease is encoded by the nucleic acid sequence of SEQ ID NO: 153.


A nucleic acid sequence encoding a nucleic acid-guided nuclease can be operably linked to a promoter. Such nucleic acid sequences can be linear or circular. The nucleic acid sequences can be comprised on a larger linear or circular nucleic acid sequences that comprises additional elements such as an origin of replication, selectable or screenable marker, terminator, other components of a targetable nuclease system, such as a guide nucleic acid, or an editing or recorder cassette as disclosed herein. These larger nucleic acid sequences can be recombinant expression vectors, as are described in more detail later.


Guide Nucleic Acid


In general, a guide nucleic acid can complex with a compatible nucleic acid-guided nuclease and can hybridize with a target sequence, thereby directing the nuclease to the target sequence. A subject nucleic acid-guided nuclease capable of complexing with a guide nucleic acid can be referred to as a nucleic acid-guided nuclease that is compatible with the guide nucleic acid. Likewise, a guide nucleic acid capable of complexing with a nucleic acid-guided nuclease can be referred to as a guide nucleic acid that is compatible with the nucleic acid-guided nucleases.


A guide nucleic acid can be DNA. A guide nucleic acid can be RNA. A guide nucleic acid can comprise both DNA and RNA. A guide nucleic acid can comprise modified of non-naturally occurring nucleotides. In cases where the guide nucleic acid comprises RNA, the RNA guide nucleic acid can be encoded by a DNA sequence on a polynucleotide molecule such as a plasmid, linear construct, or editing cassette as disclosed herein.


A guide nucleic acid can comprise a guide sequence. A guide sequence is a polynucleotide sequence having sufficient complementarity with a target polynucleotide sequence to hybridize with the target sequence and direct sequence-specific binding of a complexed nucleic acid-guided nuclease to the target sequence. The degree of complementarity between a guide sequence and its corresponding target sequence, when optimally aligned using a suitable alignment algorithm, is about or more than about 50%, 60%, 75%, 80%, 85%, 90%, 95%, 97.5%, 99%, or more. Optimal alignment may be determined with the use of any suitable algorithm for aligning sequences. In some embodiments, a guide sequence is about or more than about 5, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 35, 40, 45, 50, 75, or more nucleotides in length. In some embodiments, a guide sequence is less than about 75, 50, 45, 40, 35, 30, 25, 20 nucleotides in length. Preferably the guide sequence is 10-30 nucleotides long. The guide sequence can be 15-20 nucleotides in length. The guide sequence can be 15 nucleotides in length. The guide sequence can be 16 nucleotides in length. The guide sequence can be 17 nucleotides in length. The guide sequence can be 18 nucleotides in length. The guide sequence can be 19 nucleotides in length. The guide sequence can be 20 nucleotides in length.


A guide nucleic acid can comprise a scaffold sequence. In general, a “scaffold sequence” includes any sequence that has sufficient sequence to promote formation of a targetable nuclease complex, wherein the targetable nuclease complex comprises a nucleic acid-guided nuclease and a guide nucleic acid comprising a scaffold sequence and a guide sequence. Sufficient sequence within the scaffold sequence to promote formation of a targetable nuclease complex may include a degree of complementarity along the length of two sequence regions within the scaffold sequence, such as one or two sequence regions involved in forming a secondary structure. In some cases, the one or two sequence regions are comprised or encoded on the same polynucleotide. In some cases, the one or two sequence regions are comprised or encoded on separate polynucleotides. Optimal alignment may be determined by any suitable alignment algorithm, and may further account for secondary structures, such as self-complementarity within either the one or two sequence regions. In some embodiments, the degree of complementarity between the one or two sequence regions along the length of the shorter of the two when optimally aligned is about or more than about 25%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 97.5%, 99%, or higher. In some embodiments, at least one of the two sequence regions is about or more than about 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 40, 50, or more nucleotides in length.


A scaffold sequence of a subject guide nucleic acid can comprise a secondary structure. A secondary structure can comprise a pseudoknot region. In some cases, binding kinetics of a guide nucleic acid to a nucleic acid-guided nuclease is determined in part by secondary structures within the scaffold sequence. In some cases, binding kinetics of a guide nucleic acid to a nucleic acid-guided nuclease is determined in part by nucleic acid sequence with the scaffold sequence.


A scaffold sequence can comprise the sequence of any one of SEQ ID NO: 84-107. A scaffold sequence can comprise the sequence of any one of SEQ ID NO: 84-103. A scaffold sequence can comprise the sequence of any one of SEQ ID NO: 84-91 or 93-95. A scaffold sequence can comprise the sequence of any one of SEQ ID NO: 88, 93, 94, or 95. A scaffold sequence can comprise the sequence of SEQ ID NO: 88. A scaffold sequence can comprise the sequence of SEQ ID NO: 93. A scaffold sequence can comprise the sequence of SEQ ID NO: 94. A scaffold sequence can comprise the sequence of SEQ ID NO: 95.


In some aspects, the invention provides a nuclease that binds to a guide nucleic acid comprising a conserved scaffold sequence. For example, the nucleic acid-guided nucleases for use in the present disclosure can bind to a conserved pseudoknot region as shown in FIG. 13A. Specifically, the nucleic acid-guided nucleases for use in the present disclosure can bind to a guide nucleic acid comprising a conserved pseudoknot region as shown in FIG. 13A. Certain nucleic acid-guided nucleases for use in the present disclosure can bind to a pseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to the pseudoknot region of Scaffold-1 (SEQ ID NO: 172). Other nucleic acid-guided nucleases for use in the present disclosure can bind to a pseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to the pseudoknot region of Scaffold-3 (SEQ ID NO: 173). Still other nucleic acid-guided nucleases for use in the present disclosure can bind to a pseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to the pseudoknot region of Scaffold-4 (SEQ ID NO: 174). Other nucleic acid-guided nucleases for use in the present disclosure can bind to a pseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to the pseudoknot region of Scaffold-5 (SEQ ID NO: 175). Other nucleic acid-guided nucleases for use in the present disclosure can bind to a pseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to the pseudoknot region of Scaffold-6 (SEQ ID NO: 176). Still other nucleic acid-guided nucleases for use in the present disclosure can bind to a pseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to the pseudoknot region of Scaffold-7 (SEQ ID NO: 177). Other nucleic acid-guided nucleases for use in the present disclosure can bind to a pseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to the pseudoknot region of Scaffold-8 (SEQ ID NO: 178). Other nucleic acid-guided nucleases for use in the present disclosure can bind to a pseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to the pseudoknot region of Scaffold-10 (SEQ ID NO: 179). Still other nucleic acid-guided nucleases for use in the present disclosure can bind to a pseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to the pseudoknot region of Scaffold-11 (SEQ ID NO: 180). Certain nucleic acid-guided nucleases for use in the present disclosure can bind to a pseudoknot region having at least 75%, 80%, 85%, 90%, 95%, or 100% sequence identity to the pseudoknot region of Scaffold-12 (SEQ ID NO: 181).


A guide nucleic acid can comprise the sequence of any one of SEQ ID NO: 84-107. A guide nucleic acid can comprise the sequence of any one of SEQ ID NO: 84-103. A guide nucleic acid can comprise the sequence of any one of SEQ ID NO: 84-91 or 93-95. A guide nucleic acid can comprise the sequence of any one of SEQ ID NO: 88, 93, 94, or 95. A guide nucleic acid can comprise the sequence of SEQ ID NO: 88. A guide nucleic acid can comprise the sequence of SEQ ID NO: 93. A guide nucleic acid can comprise the sequence of SEQ ID NO: 94. A guide nucleic acid can comprise the sequence of SEQ ID NO: 95.


In aspects of the invention the terms “guide nucleic acid” refers to one or more polynucleotides comprising 1) a guide sequence capable of hybridizing to a target sequence and 2) a scaffold sequence capable of interacting with or complexing with an nucleic acid-guided nuclease as described herein. A guide nucleic acid may be provided as one or more nucleic acids. In specific embodiments, the guide sequence and the scaffold sequence are provided as a single polynucleotide.


A guide nucleic acid can be compatible with a nucleic acid-guided nuclease when the two elements can form a functional targetable nuclease complex capable of cleaving a target sequence. Often, a compatible scaffold sequence for a compatible guide nucleic acid can be found by scanning sequences adjacent to a native nucleic acid-guided nuclease loci. In other words, native nucleic acid-guided nucleases can be encoded on a genome within proximity to a corresponding compatible guide nucleic acid or scaffold sequence.


Nucleic acid-guided nucleases can be compatible with guide nucleic acids that are not found within the nucleases endogenous host. Such orthogonal guide nucleic acids can be determined by empirical testing. Orthogonal guide nucleic acids can come from different bacterial species or be synthetic or otherwise engineered to be non-naturally occurring.


Orthogonal guide nucleic acids that are compatible with a common nucleic acid-guided nuclease can comprise one or more common features. Common features can include sequence outside a pseudoknot region. Common features can include a pseudoknot region. Common features can include a primary sequence or secondary structure.


A guide nucleic acid can be engineered to target a desired target sequence by altering the guide sequence such that the guide sequence is complementary to the target sequence, thereby allowing hybridization between the guide sequence and the target sequence. A guide nucleic acid with an engineered guide sequence can be referred to as an engineered guide nucleic acid. Engineered guide nucleic acids are often non-naturally occurring and are not found in nature.


Targetable Nuclease System


Disclosed herein are targetable nuclease systems. A targetable nuclease system can comprise a nucleic acid-guided nuclease and a compatible guide nucleic acid. A targetable nuclease system can comprise a nucleic acid-guided nuclease or a polynucleotide sequence encoding the nucleic acid-guided nuclease. A targetable nuclease system can comprise a guide nucleic acid or a polynucleotide sequence encoding the guide nucleic acid.


In general, a targetable nuclease system as disclosed herein is characterized by elements that promote the formation of a targetable nuclease complex at the site of a target sequence, wherein the targetable nuclease complex comprises a nucleic acid-guided nuclease and a guide nucleic acid.


A guide nucleic acid together with a nucleic acid-guided nuclease forms a targetable nuclease complex which is capable of binding to a target sequence within a target polynucleotide, as determined by the guide sequence of the guide nucleic acid.


In general, to generate a double stranded break, in most cases a targetable nuclease complex binds to a target sequence as determined by the guide nucleic acid, and the nuclease has to recognize a protospacer adjacent motif (PAM) sequence adjacent to the target sequence.


A targetable nuclease complex can comprise a nucleic acid-guided nuclease of any one of SEQ ID NO: 1-20 and a compatible guide nucleic acid. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of any one of SEQ ID NO: 1-8 or 10-12 and a compatible guide nucleic acid. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of any one of SEQ ID NO: 1-8 or 10-11 and a compatible guide nucleic acid. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 2 and a compatible guide nucleic acid. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 7 and a compatible guide nucleic acid. In any of these cases, the guide nucleic acid can comprise a scaffold sequence compatible with the nucleic acid-guided nuclease. In any of these cases, the guide nucleic acid can further comprise a guide sequence. The guide sequence can be engineered to target any desired target sequence. The guide sequence can be engineered to be complementary to any desired target sequence. The guide sequence can be engineered to hybridize to any desired target sequence.


A targetable nuclease complex can comprise a nucleic acid-guided nuclease of any one of SEQ ID NO: 1-20 and a compatible guide nucleic acid comprising any one of SEQ ID NO: 84-107. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of any one of SEQ ID NO: 1-8 or 10-12 and a compatible guide nucleic acid comprising any one of SEQ ID NO: 84-95. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of any one of SEQ ID NO: 1-8 or 10-11 and a compatible guide nucleic acid comprising any one of SEQ ID NO: 84-91 or 93-95. In any of these cases, the guide nucleic acid can further comprise a guide sequence. The guide sequence can be engineered to target any desired target sequence. The guide sequence can be engineered to be complementary to any desired target sequence. The guide sequence can be engineered to hybridize to any desired target sequence.


A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 2 and a compatible guide nucleic acid. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 2 and a compatible guide nucleic acid comprising any one of SEQ ID NO: 88, 93, 94, or 95. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 2 and a compatible guide nucleic acid comprising SEQ ID NO: 88. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 2 and a compatible guide nucleic acid comprising SEQ ID NO: 93. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 2 and a compatible guide nucleic acid comprising SEQ ID NO: 94. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 2 and a compatible guide nucleic acid comprising SEQ ID NO: 95. In any of these cases, the guide nucleic acid can further comprise a guide sequence. The guide sequence can be engineered to target any desired target sequence. The guide sequence can be engineered to be complementary to any desired target sequence. The guide sequence can be engineered to hybridize to any desired target sequence.


A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 7 and a compatible guide nucleic acid. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 7 and a compatible guide nucleic acid comprising any one of SEQ ID NO: 88, 93, 94, or 95. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 7 and a compatible guide nucleic acid comprising SEQ ID NO: 88. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 7 and a compatible guide nucleic acid comprising SEQ ID NO: 93. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 7 and a compatible guide nucleic acid comprising SEQ ID NO: 94. A targetable nuclease complex can comprise a nucleic acid-guided nuclease of SEQ ID NO: 7 and a compatible guide nucleic acid comprising SEQ ID NO: 95. In any of these cases, the guide nucleic acid can further comprise a guide sequence. The guide sequence can be engineered to target any desired target sequence. The guide sequence can be engineered to be complementary to any desired target sequence. The guide sequence can be engineered to hybridize to any desired target sequence.


A target sequence of a targetable nuclease complex can be any polynucleotide endogenous or exogenous to a prokaryotic or eukaryotic cell, or in vitro. For example, the target sequence can be a polynucleotide residing in the nucleus of the eukaryotic cell. A target sequence can be a sequence coding a gene product (e.g., a protein) or a non-coding sequence (e.g., a regulatory polynucleotide or a junk DNA). Without wishing to be bound by theory, it is believed that the target sequence should be associated with a PAM; that is, a short sequence recognized by a targetable nuclease complex. The precise sequence and length requirements for a PAM differ depending on the nucleic acid-guided nuclease used, but PAMs are typically 2-5 base pair sequences adjacent the target sequence. Examples of PAM sequences are given in the examples section below, and the skilled person will be able to identify further PAM sequences for use with a given nucleic acid-guided nuclease. Further, engineering of the PAM Interacting (PI) domain may allow programming of PAM specificity, improve target site recognition fidelity, and increase the versatility of a nucleic acid-guided nuclease genome engineering platform. Nucleic acid-guided nucleases may be engineered to alter their PAM specificity, for example as described in Kleinstiver B P et al. Engineered CRISPR-Cas9 nucleases with altered PAM specificities. Nature. 2015 Jul. 23; 523 (7561): 481-5. doi: 10.1038/nature14592.


A PAM site is a nucleotide sequence in proximity to a target sequence. In most cases, a nucleic acid-guided nuclease can only cleave a target sequence if an appropriate PAM is present. PAMs are nucleic acid-guided nuclease-specific and can be different between two different nucleic acid-guided nucleases. A PAM can be 5′ or 3′ of a target sequence. A PAM can be upstream or downstream of a target sequence. A PAM can be 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nucleotides in length. Often, a PAM is between 2-6 nucleotides in length.


In some examples, a PAM can be provided on a separate oligonucleotide. In such cases, providing PAM on a oligonucleotide allows cleavage of a target sequence that otherwise would not be able to be cleave because no adjacent PAM is present on the same polynucleotide as the target sequence.


Polynucleotide sequences encoding a component of a targetable nuclease system can comprise one or more vectors. In general, the term “vector” refers to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked. Vectors include, but are not limited to, nucleic acid molecules that are single-stranded, double-stranded, or partially double-stranded; nucleic acid molecules that comprise one or more free ends, no free ends (e.g. circular); nucleic acid molecules that comprise DNA, RNA, or both; and other varieties of polynucleotides known in the art. One type of vector is a “plasmid,” which refers to a circular double stranded DNA loop into which additional DNA segments can be inserted, such as by standard molecular cloning techniques. Another type of vector is a viral vector, wherein virally-derived DNA or RNA sequences are present in the vector for packaging into a virus (e.g. retroviruses, replication defective retroviruses, adenoviruses, replication defective adenoviruses, and adeno-associated viruses). Viral vectors also include polynucleotides carried by a virus for transfection into a host cell. Certain vectors are capable of autonomous replication in a host cell into which they are introduced (e.g. bacterial vectors having a bacterial origin of replication and episomal mammalian vectors). Other vectors (e.g., non-episomal mammalian vectors) are integrated into the genome of a host cell upon introduction into the host cell, and thereby are replicated along with the host genome. Moreover, certain vectors are capable of directing the expression of genes to which they are operatively-linked. Such vectors are referred to herein as “expression vectors.” Common expression vectors of utility in recombinant DNA techniques are often in the form of plasmids. Further discussion of vectors is provided herein.


Recombinant expression vectors can comprise a nucleic acid of the invention in a form suitable for expression of the nucleic acid in a host cell, which means that the recombinant expression vectors include one or more regulatory elements, which may be selected on the basis of the host cells to be used for expression, that is operatively-linked to the nucleic acid sequence to be expressed. Within a recombinant expression vector, “operably linked” is intended to mean that the nucleotide sequence of interest is linked to the regulatory element(s) in a manner that allows for expression of the nucleotide sequence (e.g. in an in vitro transcription/translation system or in a host cell when the vector is introduced into the host cell). With regards to recombination and cloning methods, mention is made of U.S. patent application Ser. No. 10/815,730, published Sep. 2, 2004 as US 2004-0171156 A1, the contents of which are herein incorporated by reference in their entirety.


In some embodiments, a regulatory element is operably linked to one or more elements of a targetable nuclease system so as to drive expression of the one or more components of the targetable nuclease system.


In some embodiments, a vector comprises a regulatory element operably linked to a polynucleotide sequence encoding a nucleic acid-guided nuclease. The polynucleotide sequence encoding the nucleic acid-guided nuclease can be codon optimized for expression in particular cells, such as prokaryotic or eukaryotic cells. Eukaryotic cells can be yeast, fungi, algae, plant, animal, or human cells. Eukaryotic cells may be those of or derived from a particular organism, such as a mammal, including but not limited to human, mouse, rat, rabbit, dog, or non-human mammal including non-human primate.


In general, codon optimization refers to a process of modifying a nucleic acid sequence for enhanced expression in the host cells of interest by replacing at least one codon (e.g. about or more than about 1, 2, 3, 4, 5, 10, 15, 20, 25, 50, or more codons) of the native sequence with codons that are more frequently or most frequently used in the genes of that host cell while maintaining the native amino acid sequence. Various species exhibit particular bias for certain codons of a particular amino acid. Codon bias (differences in codon usage between organisms) often correlates with the efficiency of translation of messenger RNA (mRNA), which is in turn believed to be dependent on, among other things, the properties of the codons being translated and the availability of particular transfer RNA (tRNA) molecules. The predominance of selected tRNAs in a cell is generally a reflection of the codons used most frequently in peptide synthesis. Accordingly, genes can be tailored for optimal gene expression in a given organism based on codon optimization. Codon usage tables are readily available, for example, at the “Codon Usage Database” available at www kazusa.orjp/codon/ (visited Jul. 9, 2002), and these tables can be adapted in a number of ways. See Nakamura, Y., et al. “Codon usage tabulated from the international DNA sequence databases: status for the year 2000” Nucl. Acids Res. 28:292 (2000). Computer algorithms for codon optimizing a particular sequence for expression in a particular host cell are also available, such as Gene Forge (Aptagen; Jacobus, Pa.), are also available. In some embodiments, one or more codons (e.g. 1, 2, 3, 4, 5, 10, 15, 20, 25, 50, or more, or all codons) in a sequence encoding an engineered nuclease correspond to the most frequently used codon for a particular amino acid.


In some embodiments, a vector encodes a nucleic acid-guided nuclease comprising one or more nuclear localization sequences (NLSs), such as about or more than about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more NLSs. In some embodiments, the engineered nuclease comprises about or more than about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more NLSs at or near the amino-terminus, about or more than about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more NLSs at or near the carboxy-terminus, or a combination of these (e.g. one or more NLS at the amino-terminus and one or more NLS at the carboxy terminus). When more than one NLS is present, each may be selected independently of the others, such that a single NLS may be present in more than one copy and/or in combination with one or more other NLSs present in one or more copies. In a preferred embodiment of the invention, the engineered nuclease comprises at most 6 NLSs. In some embodiments, an NLS is considered near the N- or C-terminus when the nearest amino acid of the NLS is within about 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 40, 50, or more amino acids along the polypeptide chain from the N- or C-terminus. Non-limiting examples of NLSs include an NLS sequence derived from: the NLS of the SV40 virus large T-antigen, having the amino acid sequence PKKKRKV (SEQ ID NO: 111); the NLS from nucleoplasmin (e.g. the nucleoplasmin bipartite NLS with the sequence KRPAATKKAGQAKKKK (SEQ ID NO:112)); the c-myc NLS having the amino acid sequence PAAKRVKLD (SEQ ID NO:113) or RQRRNELKRSP (SEQ ID NO:114); the hRNPA1 M9 NLS having the sequence NQSSNFGPMKGGNFGGRSSGPYGGGGQYFAKPRNQGGY (SEQ ID NO: 115); the sequence RMRIZFKNKGKDTAELRRRRVEVSVELRKAKKDEQILKRRNV (SEQ ID NO:1 116) of the IBB domain from importin-alpha; the sequences VSRKRPRP (SEQ ID NO:117) and PPKKARED (SEQ ID NO:115) of the myoma T protein; the sequence PQPKKKPL (SEQ ID NO:119) of human p53; the sequence SALIKKKKKMAP (SEQ ID NO:120) of mouse c-abl IV; the sequences DRLRR (SEQ ID NO:121) and PKQKKRK (SEQ ID NO:122) of the influenza virus NS1; the sequence RKLKKKIKKL (SEQ ID NO:123) of the Hepatitis virus delta antigen; the sequence REKKKFLKRR (SEQ ID NO: 124) of the mouse Mx1 protein; the sequence KRKGDEVDGVDEVAKKKSKK (SEQ ID NO: 125) of the human poly(ADP-ribose) polymerase; and the sequence RKCLQAGMNLEARKTKK (SEQ ID NO: 126) of the steroid hormone receptors (human) glucocorticoid.


In general, the one or more NLSs are of sufficient strength to drive accumulation of the nucleic acid-guided nuclease in a detectable amount in the nucleus of a eukaryotic cell. In general, strength of nuclear localization activity may derive from the number of NLSs, the particular NLS(s) used, or a combination of these factors. Detection of accumulation in the nucleus may be performed by any suitable technique. For example, a detectable marker may be fused to the nucleic acid-guided nuclease, such that location within a cell may be visualized, such as in combination with a means for detecting the location of the nucleus (e.g. a stain specific for the nucleus such as DAPI). Cell nuclei may also be isolated from cells, the contents of which may then be analyzed by any suitable process for detecting protein, such as immunohistochemistry, Western blot, or enzyme activity assay. Accumulation in the nucleus may also be determined indirectly, such as by an assay for the effect of the nucleic acid-guided nuclease complex formation (e.g. assay for DNA cleavage or mutation at the target sequence, or assay for altered gene expression activity affected by targetable nuclease complex formation and/or nucleic acid-guided nuclease activity), as compared to a control not exposed to the nucleic acid-guided nuclease or targetable nuclease complex, or exposed to a nucleic acid-guided nuclease lacking the one or more NLSs.


A nucleic acid-guided nuclease and one or more guide nucleic acids can be delivered either as DNA or RNA. Delivery of an nucleic acid-guided nuclease and guide nucleic acid both as RNA (unmodified or containing base or backbone modifications) molecules can be used to reduce the amount of time that the nucleic acid-guided nuclease persist in the cell. This may reduce the level of off-target cleavage activity in the target cell. Since delivery of a nucleic acid-guided nuclease as mRNA takes time to be translated into protein, it might be advantageous to deliver the guide nucleic acid several hours following the delivery of the nucleic acid-guided nuclease mRNA, to maximize the level of guide nucleic acid available for interaction with the nucleic acid-guided nuclease protein. In other cases, the nucleic acid-guided nuclease mRNA and guide nucleic acid are delivered concomitantly. In other examples, the guide nucleic acid is delivered sequentially, such as 0.5, 1, 2, 3, 4, or more hours after the nucleic acid-guided nuclease mRNA.


In situations where guide nucleic acid amount is limiting, it may be desirable to introduce a nucleic acid-guided nuclease as mRNA and guide nucleic acid in the form of a DNA expression cassette with a promoter driving the expression of the guide nucleic acid. This way the amount of guide nucleic acid available will be amplified via transcription.


Guide nucleic acid in the form of RNA or encoded on a DNA expression cassette can be introduced into a host cell comprising an nucleic acid-guided nuclease encoded on a vector or chromosome. The guide nucleic acid may be provided in the cassette one or more polynucleotides, which may be contiguous or non-contiguous in the cassette. In specific embodiments, the guide nucleic acid is provided in the cassette as a single contiguous polynucleotide.


A variety of delivery systems can be used to introduce a nucleic acid-guided nuclease (DNA or RNA) and guide nucleic acid (DNA or RNA) into a host cell. These include the use of yeast systems, lipofection systems, microinjection systems, biolistic systems, virosomes, liposomes, immunoliposomes, polycations, lipid:nucleic acid conjugates, virions, artificial virions, viral vectors, electroporation, cell permeable peptides, nanoparticles, nanowires (Shalek et al., Nano Letters, 2012), exosomes. Molecular trojan horses liposomes (Pardridge et al., Cold Spring Harb Protoc; 2010; doi:10.1101/pdb.prot5407) may be used to deliver an engineered nuclease and guide nuclease across the blood brain barrier.


In some embodiments, a editing template is also provided. A editing template may be a component of a vector as described herein, contained in a separate vector, or provided as a separate polynucleotide, such as an oligonucleotide, linear polynucleotide, or synthetic polynucleotide. In some cases, a editing template is on the same polynucleotide as a guide nucleic acid. In some embodiments, a editing template is designed to serve as a template in homologous recombination, such as within or near a target sequence nicked or cleaved by a nucleic acid-guided nuclease as a part of a complex as disclosed herein. A editing template polynucleotide may be of any suitable length, such as about or more than about 10, 15, 20, 25, 50, 75, 100, 150, 200, 500, 1000, or more nucleotides in length. In some embodiments, the editing template polynucleotide is complementary to a portion of a polynucleotide comprising the target sequence. When optimally aligned, a editing template polynucleotide might overlap with one or more nucleotides of a target sequences (e.g. about or more than about 1, 5, 10, 15, 20, 25, 30, 35, 40, or more nucleotides). In some embodiments, when a editing template sequence and a polynucleotide comprising a target sequence are optimally aligned, the nearest nucleotide of the template polynucleotide is within about 1, 5, 10, 15, 20, 25, 50, 75, 100, 200, 300, 400, 500, 1000, 5000, 10000, or more nucleotides from the target sequence.


In many examples, an editing template comprises at least one mutation compared to the target sequence. An editing template can comprise an insertion, deletion, modification, or any combination thereof compared to the target sequence. Examples of some editing templates are described in more detail in a later section.


In some aspects, the invention provides methods comprising delivering one or more polynucleotides, such as or one or more vectors or linear polynucleotides as described herein, one or more transcripts thereof, and/or one or proteins transcribed therefrom, to a host cell. In some aspects, the invention further provides cells produced by such methods, and organisms comprising or produced from such cells. In some embodiments, an engineered nuclease in combination with (and optionally complexed with) a guide nucleic acid is delivered to a cell.


Conventional viral and non-viral based gene transfer methods can be used to introduce nucleic acids in cells, such as prokaryotic cells, eukaryotic cells, mammalian cells, or target tissues. Such methods can be used to administer nucleic acids encoding components of an engineered nucleic acid-guided nuclease system to cells in culture, or in a host organism. Non-viral vector delivery systems include DNA plasmids, RNA (e.g. a transcript of a vector described herein), naked nucleic acid, and nucleic acid complexed with a delivery vehicle, such as a liposome. Viral vector delivery systems include DNA and RNA viruses, which have either episomal or integrated genomes after delivery to the cell. For a review of gene therapy procedures, see Anderson, Science 256:808-813 (1992); Nabel & Feigner, TIBTECH 11:211-217 (1993); Mitani & Caskey, TIBTECH 11:162-166 (1993); Dillon. TIBTECH 11:167-175 (1993); Miller, Nature 357:455-460 (1992); Van Brunt, Biotechnology 6(10):1149-1154 (1988); Vigne, Restorative Neurology and Neuroscience 8:35-36 (1995); Kremer & Perricaudet, British Medical Bulletin 51(1):31-44 (1995); Haddada et al., in Current Topics in Microbiology and Immunology Doerfler and Bohm (eds) (1995); and Yu et al., Gene Therapy 1:13-26 (1994).


Methods of non-viral delivery of nucleic acids include lipofection, microinjection, biolistics, virosomes, liposomes, immunoliposomes, polycation or lipid:nucleic acid conjugates, naked DNA, artificial virions, and agent-enhanced uptake of DNA. Lipofection is described in e.g., U.S. Pat. Nos. 5,049,386, 4,946,787; and 4,897,355) and lipofection reagents are sold commercially (e.g., Transfectam™ and Lipofectin™). Cationic and neutral lipids that are suitable for efficient receptor-recognition lipofection of polynucleotides include those of Felgner, WO 91/17424; WO 91/16024. Delivery can be to cells (e.g. in vitro or ex vivo administration) or target tissues (e.g. in vivo administration).


The preparation of lipid:nucleic acid complexes, including targeted liposomes such as immunolipid complexes, is well known to one of skill in the art (see, e.g., Crystal, Science 270:404-410 (1995); Blaese et al., Cancer Gene Ther. 2:291-297 (1995); Behr et al., Bioconjugate Chem. 5:382-389 (1994); Remy et al., Bioconjugate Chem. 5:647-654 (1994); Gao et al., Gene Therapy 2:710-722 (1995); Ahmad et al., Cancer Res. 52:4817-4820 (1992); U.S. Pat. Nos. 4,186,183, 4,217,344, 4,235,871, 4,261,975, 4,485,054, 4,501,728, 4,774,085, 4,837,028, and 4,946,787).


The use of RNA or DNA viral based systems for the delivery of nucleic acids take advantage of highly evolved processes for targeting a virus to specific cells in culture or in the host and trafficking the viral payload to the nucleus or host cell genome. Viral vectors can be administered directly to cells in culture, patients (in vivo), or they can be used to treat cells in vitro, and the modified cells may optionally be administered to patients (ex vivo). Conventional viral based systems could include retroviral, lentivirus, adenoviral, adeno-associated and herpes simplex virus vectors for gene transfer. Integration in the host genome is possible with the retrovirus, lentivirus, and adeno-associated virus gene transfer methods, often resulting in long term expression of the inserted transgene. Additionally, high transduction efficiencies have been observed in many different cell types and target tissues.


The tropism of a retrovirus can be altered by incorporating foreign envelope proteins, expanding the potential target population of target cells. Lentiviral vectors are retroviral vectors that are able to transduce or infect non-dividing cells and typically produce high viral titers. Selection of a retroviral gene transfer system would therefore depend on the target tissue. Retroviral vectors are comprised of cis-acting long terminal repeats with packaging capacity for up to 6-10 kb of foreign sequence. The minimum cis-acting LTRs are sufficient for replication and packaging of the vectors, which are then used to integrate the therapeutic gene into the target cell to provide permanent transgene expression. Widely used retroviral vectors include those based upon murine leukemia virus (MuLV), gibbon ape leukemia virus (GaLV), Simian Immuno deficiency virus (SIV), human immuno deficiency virus (HIV), and combinations thereof (see, e.g., Buchscher et al., J. Virol. 66:2731-2739 (1992); Johann et al., J. Virol. 66:1635-1640 (1992); Sommnerfelt et al., Virol. 176:58-59 (1990); Wilson et al., J. Virol. 63:2374-2378 (1989); Miller et al., J. Virol. 65:2220-2224 (1991); PCT/US94/05700).


In applications where transient expression is preferred, adenoviral based systems may be used. Adenoviral based vectors are capable of very high transduction efficiency in many cell types and do not require cell division. With such vectors, high titer and levels of expression have been obtained. This vector can be produced in large quantities in a relatively simple system.


Adeno-associated virus (“AAV”) vectors may also be used to transduce cells with target nucleic acids, e.g., in the in vitro production of nucleic acids and peptides, and for in vivo and ex vivo gene therapy procedures (see, e.g., West et al., Virology 160:38-47 (1987); U.S. Pat. No. 4,797,368; WO 93/24641; Kotin, Human Gene Therapy 5:793-801 (1994); Muzyczka, J. Clin. Invest. 94:1351 (1994). Construction of recombinant AAV vectors are described in a number of publications, including U.S. Pat. No. 5,173,414; Tratschin et al., Mol. Cell. Biol. 5:3251-3260 (1985); Tratschin, et al., Mol. Cell. Biol. 4:2072-2081 (1984); Hermonat & Muzyczka, PNAS 81:6466-6470 (1984); and Samulski et al., J. Virol. 63:03822-3828 (1989).


In some embodiments, a host cell is transiently or non-transiently transfected with one or more vectors, linear polynucleotides, polypeptides, nucleic acid-protein complexes, or any combination thereof as described herein. In some embodiments, a cell in transfected in vitro, in culture, or ex vivo. In some embodiments, a cell is transfected as it naturally occurs in a subject. In some embodiments, a cell that is transfected is taken from a subject. In some embodiments, the cell is derived from cells taken from a subject, such as a cell line.


In some embodiments, a cell transfected with one or more vectors, linear polynucleotides, polypeptides, nucleic acid-protein complexes, or any combination thereof as described herein is used to establish a new cell line comprising one or more transfection-derived sequences. In some embodiments, a cell transiently transfected with the components of an engineered nucleic acid-guided nuclease system as described herein (such as by transient transfection of one or more vectors, or transfection with RNA), and modified through the activity of an engineered nuclease complex, is used to establish a new cell line comprising cells containing the modification but lacking any other exogenous sequence.


In some embodiments, one or more vectors described herein are used to produce a non-human transgenic cell, organism, animal, or plant. In some embodiments, the transgenic animal is a mammal, such as a mouse, rat, or rabbit. Methods for producing transgenic cells, organisms, plants, and animals are known in the art, and generally begin with a method of cell transformation or transfection, such as described herein.


Methods of Use


In the context of formation of an engineered nuclease complex, “target sequence” refers to a sequence to which a guide sequence is designed to have complementarity, where hybridization between a target sequence and a guide sequence promotes the formation of a engineered nuclease complex. A target sequence may comprise any polynucleotide, such as DNA, RNA, or a DNA-RNA hybrid. A target sequence can be located in the nucleus or cytoplasm of a cell. A target sequence can be located in vitro or in a cell-free environment.


Typically, formation of an engineered nuclease complex comprising a guide nucleic acid hybridized to a target sequence and complexed with one or more engineered nucleases as disclosed herein results in cleavage of one or both strands in or near (e.g. within 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 50, or more base pairs from) the target sequence. Cleavage can occur within a target sequence, 5′ of the target sequence, upstream of a target sequence, 3′ of the target sequence, or downstream of a target sequence.


In some embodiments, one or more vectors driving expression of one or more components of a targetable nuclease system are introduced into a host cell or in vitro such formation of a targetable nuclease complex at one or more target sites. For example, a nucleic acid-guided nuclease and a guide nucleic acid could each be operably linked to separate regulatory elements on separate vectors. Alternatively, two or more of the elements expressed from the same or different regulatory elements, may be combined in a single vector, with one or more additional vectors providing any components of the targetable nuclease system not included in the first vector. Targetable nuclease system elements that are combined in a single vector may be arranged in any suitable orientation, such as one element located 5′ with respect to (“upstream” of) or 3′ with respect to (“downstream” of) a second element. The coding sequence of one element may be located on the same or opposite strand of the coding sequence of a second element, and oriented in the same or opposite direction. In some embodiments, a single promoter drives expression of a transcript encoding a nucleic acid-guided nuclease and one or more guide nucleic acids. In some embodiments, a nucleic acid-guided nuclease and one or more guide nucleic acids are operably linked to and expressed from the same promoter. In other embodiments, one or more guide nucleic acids or polynucleotides encoding the one or more guide nucleic acids are introduced into a cell or in vitro environment already comprising a nucleic acid-guided nuclease or polynucleotide sequence encoding the nucleic acid-guided nuclease.


When multiple different guide sequences are used, a single expression construct may be used to target nuclease activity to multiple different, corresponding target sequences within a cell or in vitro. For example, a single vector may comprise about or more than about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, or more guide sequences. In some embodiments, about or more than about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more such guide-sequence-containing vectors may be provided, and optionally delivered to a cell or in vitro.


Methods and compositions disclosed herein may comprise more than one guide nucleic acid, wherein each guide nucleic acid has a different guide sequence, thereby targeting a different target sequence. In such cases, multiple guide nucleic acids can be using in multiplexing, wherein multiple targets are targeted simultaneously. Additionally or alternatively, the multiple guide nucleic acids are introduced into a population of cells, such that each cell in a population received a different or random guide nucleic acid, thereby targeting multiple different target sequences across a population of cells. In such cases, the collection of subsequently altered cells can be referred to as a library.


Methods and compositions disclosed herein may comprise multiple different nucleic acid-guided nucleases, each with one or more different corresponding guide nucleic acids, thereby allowing targeting of different target sequences by different nucleic acid-guided nucleases. In some such cases, each nucleic acid-guided nuclease can correspond to a distinct plurality of guide nucleic acids, allowing two or more non overlapping, partially overlapping, or completely overlapping multiplexing events.


In some embodiments, the nucleic acid-guided nuclease has DNA cleavage activity or RNA cleavage activity. In some embodiments, the nucleic acid-guided nuclease directs cleavage of one or both strands at the location of a target sequence, such as within the target sequence and/or within the complement of the target sequence. In some embodiments, the nucleic acid-guided nuclease directs cleavage of one or both strands within about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 50, 100, 200, 500, or more base pairs from the first or last nucleotide of a target sequence.


In some embodiments, a nucleic acid-guided nuclease may form a component of an inducible system. The inducible nature of the system would allow for spatiotemporal control of gene editing or gene expression using a form of energy. The form of energy may include but is not limited to electromagnetic radiation, sound energy, chemical energy, light energy, temperature, and thermal energy. Examples of inducible system include tetracycline inducible promoters (Tet-On or Tet-Off), small molecule two-hybrid transcription activations systems (FKBP, ABA, etc), or light inducible systems (Phytochrome, LOV domains, or cryptochorome). In one embodiment, the nucleic acid-guided nuclease may be a part of a Light Inducible Transcriptional Effector (LITE) to direct changes in transcriptional activity in a sequence-specific manner. The components of a light inducible system may include a nucleic acid-guided nuclease, a light-responsive cytochrome heterodimer (e.g. from Arabidopsis thaliana), and a transcriptional activation/repression domain. Further examples of inducible DNA binding proteins and methods for their use are provided in U.S. 61/736,465 and U.S. 61/721,283, which is hereby incorporated by reference in its entirety. An inducible system can be temperature inducible such that the system is turned on or off by increasing or decreasing the temperature. In some temperature inducible systems, increasing the temperature turns the system on. In some temperature inducible systems, increasing the temperature turns the system off.


In some aspects, the invention provides for methods of modifying a target sequence in vitro, or in a prokaryotic or eukaryotic cell, which may be in vivo, ex vivo, or in vitro. In some embodiments, the method comprises sampling a cell or population of cells such as prokaryotic cells, or those from a human or non-human animal or plant (including micro-algae), and modifying the cell or cells. Culturing may occur at any stage in vitro or ex vivo. The cell or cells may even be re-introduced into the host, such as a non-human animal or plant (including micro-algae). For re-introduced cells it is particularly preferred that the cells are stem cells.


In some embodiments, the method comprises allowing a targetable nuclease complex to bind to the target sequence to effect cleavage of said target sequence, thereby modifying the target sequence, wherein the targetable nuclease complex comprises a nucleic acid-guided nuclease complexed with a guide nucleic acid wherein the guide sequence of the guide nucleic acid is hybridized to a target sequence within a target polynucleotide.


In some aspects, the invention provides a method of modifying expression of a target polynucleotide in in vitro or in a prokaryotic or eukaryotic cell. In some embodiments, the method comprises allowing an targetable nuclease complex to bind to a target sequence with the target polynucleotide such that said binding results in increased or decreased expression of said target polynucleotide; wherein the targetable nuclease complex comprises an nucleic acid-guided nuclease complexed with a guide nucleic acid, and wherein the guide sequence of the guide nucleic acid is hybridized to a target sequence within said target polynucleotide. Similar considerations apply as above for methods of modifying a target polynucleotide. In fact, these sampling, culturing and re-introduction options apply across the aspects of the present invention.


In some aspects, the invention provides kits containing any one or more of the elements disclosed in the above methods and compositions. Elements may provide individually or in combinations, and may be provided in any suitable container, such as a vial, a bottle, or a tube. In some embodiments, the kit includes instructions in one or more languages, for example in more than one language.


In some embodiments, a kit comprises one or more reagents for use in a process utilizing one or more of the elements described herein. Reagents may be provided in any suitable container. For example, a kit may provide one or more reaction or storage buffers. Reagents may be provided in a form that is usable in a particular assay, or in a form that requires addition of one or more other components before use (e.g. in concentrate or lyophilized form). A buffer can be any buffer, including but not limited to a sodium carbonate buffer, a sodium bicarbonate buffer, a borate buffer, a Tris buffer, a MOPS buffer, a HEPES buffer, and combinations thereof. In some embodiments, the buffer is alkaline. In some embodiments, the buffer has a pH from about 7 to about 10. In some embodiments, the kit comprises one or more oligonucleotides corresponding to a guide sequence for insertion into a vector so as to operably link the guide sequence and a regulatory element. In some embodiments, the kit comprises a editing template.


In some aspects, the invention provides methods for using one or more elements of a engineered targetable nuclease system. A targetable nuclease complex of the disclosure provides an effective means for modifying a target sequence within a target polynucleotide. A targetable nuclease complex of the disclosure has a wide variety of utility including modifying (e.g., deleting, inserting, translocating, inactivating, activating) a target sequence in a multiplicity of cell types. As such a targetable nuclease complex of the invention has a broad spectrum of applications in, e.g., biochemical pathway optimization, genome-wide studies, genome engineering, gene therapy, drug screening, disease diagnosis, and prognosis. An exemplary targetable nuclease complex comprises a nucleic acid-guided nuclease as disclosed herein complexed with a guide nucleic acid, wherein the guide sequence of the guide nucleic acid can hybridize to a target sequence within the target polynucleotide. A guide nucleic acid can comprise a guide sequence linked to a scaffold sequence. A scaffold sequence can comprise one or more sequence regions with a degree of complementarity such that together they form a secondary structure. In some cases, the one or more sequence regions are comprised or encoded on the same polynucleotide. In some cases, the one or more sequence regions are comprised or encoded on separate polynucleotides.


Provided herein are methods of cleaving a target polynucleotide. The method comprises cleaving a target polynucleotide using a targetable nuclease complex that binds to a target sequence within a target polynucleotide and effect cleavage of said target polynucleotide. Typically, the targetable nuclease complex of the invention, when introduced into a cell, creates a break (e.g., a single or a double strand break) in the target sequence. For example, the method can be used to cleave a target gene in a cell, or to replace a wildtype sequence with a modified sequence.


The break created by the targetable nuclease complex can be repaired by a repair processes such as the error prone non-homologous end joining (NHEJ) pathway, the high fidelity homology-directed repair (HDR), or by recombination pathways. During these repair processes, a editing template can be introduced into the genome sequence. In some methods, the HDR or recombination process is used to modify a target sequence. For example, an editing template comprising a sequence to be integrated flanked by an upstream sequence and a downstream sequence is introduced into a cell. The upstream and downstream sequences share sequence similarity with either side of the site of integration in the chromosome, target vector, or target polynucleotide.


An editing template can be DNA or RNA, e.g., a DNA plasmid, a bacterial artificial chromosome (BAC), a yeast artificial chromosome (YAC), a viral vector, a linear piece of DNA, a PCR fragment, oligonucleotide, synthetic polynucleotide, a naked nucleic acid, or a nucleic acid complexed with a delivery vehicle such as a liposome or poloxamer.


An editing template polynucleotide can comprise a sequence to be integrated (e.g, a mutated gene). A sequence for integration may be a sequence endogenous or exogenous to the cell. Examples of a sequence to be integrated include polynucleotides encoding a protein or a non-coding RNA (e.g., a microRNA). Thus, the sequence for integration may be operably linked to an appropriate control sequence or sequences. Alternatively, the sequence to be integrated may provide a regulatory function. Sequence to be integrated may be a mutated or variant of an endogenous wildtype sequence. Alternatively, sequence to be integrated may be a wildtype version of an endogenous mutated sequence. Additionally or alternatively, sequenced to be integrated may be a variant or mutated form of an endogenous mutated or variant sequence.


Upstream and downstream sequences in an editing template polynucleotide can be selected to promote recombination between the target polynucleotide of interest and the editing template polynucleotide. The upstream sequence can be a nucleic acid sequence having sequence similarity with the sequence upstream of the targeted site for integration. Similarly, the downstream sequence can be a nucleic acid sequence having similarity with the sequence downstream of the targeted site of integration. The upstream and downstream sequences in an editing template can have 75%, 80%, 85%, 90%, 95%, or 100% sequence identity with the targeted polynucleotide. Preferably, the upstream and downstream sequences in the editing template polynucleotide have about 95%, 96%, 97%, 98%, 99%, or 100% sequence identity with the targeted polynucleotide. In some methods, the upstream and downstream sequences in the editing template polynucleotide have about 99% or 100% sequence identity with the targeted polynucleotide.


An upstream or downstream sequence may comprise from about 20 bp to about 2500 bp, for example, about 50, 100, 200, 300, 400, 500, 600, 700, 800, 900, 1000, 1100, 1200, 1300, 1400, 1500, 1600, 1700, 1800, 1900, 2000, 2100, 2200, 2300, 2400, or 2500 bp. In some methods, the exemplary upstream or downstream sequence has about 15 bp to about 50 bp, about 30 bp to about 100 bp, about 200 bp to about 2000 bp, about 600 bp to about 1000 bp, or more particularly about 700 bp to about 1000 bp.


In some methods, the editing template polynucleotide may further comprise a marker. Such a marker may make it easy to screen for targeted integrations. Examples of suitable markers include restriction sites, fluorescent proteins, or selectable markers. The exogenous polynucleotide template of the invention can be constructed using recombinant techniques (see, for example, Green and Sambrook et al., 2014 and Ausubel et al., 2017).


In an exemplary method for modifying a target polynucleotide by integrating an editing template polynucleotide, a double stranded break is introduced into the genome sequence by an engineered nuclease complex, the break can be repaired via homologous recombination using an editing template such that the template is integrated into the target polynucleotide. The presence of a double-stranded break can increase the efficiency of integration of the editing template.


Disclosed herein are methods for modifying expression of a polynucleotide in a cell. Some methods comprise increasing or decreasing expression of a target polynucleotide by using a targetable nuclease complex that binds to the target polynucleotide.


In some methods, a target polynucleotide can be inactivated to effect the modification of the expression in a cell. For example, upon the binding of a targetable nuclease complex to a target sequence in a cell, the target polynucleotide is inactivated such that the sequence is not transcribed, the coded protein is not produced, or the sequence does not function as the wild-type sequence does. For example, a protein or microRNA coding sequence may be inactivated such that the protein is not produced.


In some methods, a control sequence can be inactivated such that it no longer functions as a regulatory sequence. As used herein, “regulatory sequence” can refer to any nucleic acid sequence that effects the transcription, translation, or accessibility of a nucleic acid sequence. Examples of regulatory sequences include, a promoter, a transcription terminator, and an enhancer.


An inactivated target sequence may include a deletion mutation (i.e., deletion of one or more nucleotides), an insertion mutation (i.e., insertion of one or more nucleotides), or a nonsense mutation (i.e., substitution of a single nucleotide for another nucleotide such that a stop codon is introduced). In some methods, the inactivation of a target sequence results in “knockout” of the target sequence.


An altered expression of one or more target polynucleotides associated with a signaling biochemical pathway can be determined by assaying for a difference in the mRNA levels of the corresponding genes between the test model cell and a control cell, when they are contacted with a candidate agent. Alternatively, the differential expression of the sequences associated with a signaling biochemical pathway is determined by detecting a difference in the level of the encoded polypeptide or gene product.


To assay for an agent-induced alteration in the level of mRNA transcripts or corresponding polynucleotides, nucleic acid contained in a sample is first extracted according to standard methods in the art. For instance, mRNA can be isolated using various lytic enzymes or chemical solutions according to the procedures set forth in Green and Sambrook (2014), or extracted by nucleic-acid-binding resins following the accompanying instructions provided by the manufacturers. The mRNA contained in the extracted nucleic acid sample is then detected by amplification procedures or conventional hybridization assays (e.g. Northern blot analysis) according to methods widely known in the art or based on the methods exemplified herein.


For purpose of this invention, amplification means any method employing a primer and a polymerase capable of replicating a target sequence with reasonable fidelity. Amplification may be carried out by natural or recombinant DNA polymerases such as TaqGold™, T7 DNA polymerase, Klenow fragment of E. coli DNA polymerase, and reverse transcriptase. A preferred amplification method is PCR. In particular, the isolated RNA can be subjected to a reverse transcription assay that is coupled with a quantitative polymerase chain reaction (RT-PCR) in order to quantify the expression level of a sequence associated with a signaling biochemical pathway.


Detection of the gene expression level can be conducted in real time in an amplification assay. In one aspect, the amplified products can be directly visualized with fluorescent DNA-binding agents including but not limited to DNA intercalators and DNA groove binders. Because the amount of the intercalators incorporated into the double-stranded DNA molecules is typically proportional to the amount of the amplified DNA products, one can conveniently determine the amount of the amplified products by quantifying the fluorescence of the intercalated dye using conventional optical systems in the art. DNA-binding dye suitable for this application include SYBR green, SYBR blue, DAPI, propidium iodine, Hoeste, SYBR gold, ethidium bromide, acridines, proflavine, acridine orange, acriflavine, fluorcoumanin, ellipticine, daunomycin, chloroquine, distamycin D, chromomycin, homidium, mithramycin, ruthenium polypyridyls, anthramycin, and the like.


In another aspect, other fluorescent labels such as sequence specific probes can be employed in the amplification reaction to facilitate the detection and quantification of the amplified products. Probe-based quantitative amplification relies on the sequence-specific detection of a desired amplified product. It utilizes fluorescent, target-specific probes (e.g., TaqMan™ probes) resulting in increased specificity and sensitivity. Methods for performing probe-based quantitative amplification are well established in the art and are taught in U.S. Pat. No. 5,210,015.


In yet another aspect, conventional hybridization assays using hybridization probes that share sequence homology with sequences associated with a signaling biochemical pathway can be performed. Typically, probes are allowed to form stable complexes with the sequences associated with a signaling biochemical pathway contained within the biological sample derived from the test subject in a hybridization reaction. It will be appreciated by one of skill in the art that where antisense is used as the probe nucleic acid, the target polynucleotides provided in the sample are chosen to be complementary to sequences of the antisense nucleic acids. Conversely, where the nucleotide probe is a sense nucleic acid, the target polynucleotide is selected to be complementary to sequences of the sense nucleic acid.


Hybridization can be performed under conditions of various stringency, for instance as described herein. Suitable hybridization conditions for the practice of the present invention are such that the recognition interaction between the probe and sequences associated with a signaling biochemical pathway is both sufficiently specific and sufficiently stable. Conditions that increase the stringency of a hybridization reaction are widely known and published in the art. See, for example, (Green and Sambrook, et al., (2014); Nonradioactive in Situ Hybridization Application Manual, Boehringer Mannheim, second edition). The hybridization assay can be formed using probes immobilized on any solid support, including but are not limited to nitrocellulose, glass, silicon, and a variety of gene arrays. A preferred hybridization assay is conducted on high-density gene chips as described in U.S. Pat. No. 5,445,934.


For a convenient detection of the probe-target complexes formed during the hybridization assay, the nucleotide probes are conjugated to a detectable label. Detectable labels suitable for use in the present invention include any composition detectable by photochemical, biochemical, spectroscopic, immunochemical, electrical, optical or chemical means. A wide variety of appropriate detectable labels are known in the art, which include fluorescent or chemiluminescent labels, radioactive isotope labels, enzymatic or other ligands. In preferred embodiments, one will likely desire to employ a fluorescent label or an enzyme tag, such as digoxigenin, .beta.-galactosidase, urease, alkaline phosphatase or peroxidase, avidin/biotin complex.


Detection methods used to detect or quantify the hybridization intensity will typically depend upon the label selected above. For example, radiolabels may be detected using photographic film or a phosphoimager. Fluorescent markers may be detected and quantified using a photodetector to detect emitted light. Enzymatic labels are typically detected by providing the enzyme with a substrate and measuring the reaction product produced by the action of the enzyme on the substrate; and finally colorimetric labels are detected by simply visualizing the colored label.


An agent-induced change in expression of sequences associated with a signaling biochemical pathway can also be determined by examining the corresponding gene products. Determining the protein level typically involves a) contacting the protein contained in a biological sample with an agent that specifically bind to a protein associated with a signaling biochemical pathway; and (b) identifying any agent:protein complex so formed. In one aspect of this embodiment, the agent that specifically binds a protein associated with a signaling biochemical pathway is an antibody, preferably a monoclonal antibody.


The reaction can be performed by contacting the agent with a sample of the proteins associated with a signaling biochemical pathway derived from the test samples under conditions that will allow a complex to form between the agent and the proteins associated with a signaling biochemical pathway. The formation of the complex can be detected directly or indirectly according to standard procedures in the art. In the direct detection method, the agents are supplied with a detectable label and unreacted agents may be removed from the complex; the amount of remaining label thereby indicating the amount of complex formed. For such method, it is preferable to select labels that remain attached to the agents even during stringent washing conditions. It is preferable that the label does not interfere with the binding reaction. In the alternative, an indirect detection procedure may use an agent that contains a label introduced either chemically or enzymatically. A desirable label generally does not interfere with binding or the stability of the resulting agent:polypeptide complex. However, the label is typically designed to be accessible to an antibody for an effective binding and hence generating a detectable signal.


A wide variety of labels suitable for detecting protein levels are known in the art. Non-limiting examples include radioisotopes, enzymes, colloidal metals, fluorescent compounds, bioluminescent compounds, and chemiluminescent compounds.


The amount of agent:polypeptide complexes formed during the binding reaction can be quantified by standard quantitative assays. As illustrated above, the formation of agent:polypeptide complex can be measured directly by the amount of label remained at the site of binding. In an alternative, the protein associated with a signaling biochemical pathway is tested for its ability to compete with a labeled analog for binding sites on the specific agent. In this competitive assay, the amount of label captured is inversely proportional to the amount of protein sequences associated with a signaling biochemical pathway present in a test sample.


A number of techniques for protein analysis based on the general principles outlined above are available in the art. They include but are not limited to radioimmunoassays, ELISA (enzyme linked immunoradiometric assays), “sandwich” immunoassays, immunoradiometric assays, in situ immunoassays (using e.g., colloidal gold, enzyme or radioisotope labels), western blot analysis, immunoprecipitation assays, immunofluorescent assays, and SDS-PAGE.


Antibodies that specifically recognize or bind to proteins associated with a signaling biochemical pathway are preferable for conducting the aforementioned protein analyses. Where desired, antibodies that recognize a specific type of post-translational modifications (e.g., signaling biochemical pathway inducible modifications) can be used. Post-translational modifications include but are not limited to glycosylation, lipidation, acetylation, and phosphorylation. These antibodies may be purchased from commercial vendors. For example, anti-phosphotyrosine antibodies that specifically recognize tyrosine-phosphorylated proteins are available from a number of vendors including Invitrogen and Perkin Elmer. Anti-phosphotyrosine antibodies are particularly useful in detecting proteins that are differentially phosphorylated on their tyrosine residues in response to an ER stress. Such proteins include but are not limited to eukaryotic translation initiation factor 2 alpha (eIF-2.alpha.). Alternatively, these antibodies can be generated using conventional polyclonal or monoclonal antibody technologies by immunizing a host animal or an antibody-producing cell with a target protein that exhibits the desired post-translational modification.


In practicing a subject method, it may be desirable to discern the expression pattern of an protein associated with a signaling biochemical pathway in different bodily tissue, in different cell types, and/or in different subcellular structures. These studies can be performed with the use of tissue-specific, cell-specific or subcellular structure specific antibodies capable of binding to protein markers that are preferentially expressed in certain tissues, cell types, or subcellular structures.


An altered expression of a gene associated with a signaling biochemical pathway can also be determined by examining a change in activity of the gene product relative to a control cell. The assay for an agent-induced change in the activity of a protein associated with a signaling biochemical pathway will dependent on the biological activity and/or the signal transduction pathway that is under investigation. For example, where the protein is a kinase, a change in its ability to phosphorylate the downstream substrate(s) can be determined by a variety of assays known in the art. Representative assays include but are not limited to immunoblotting and immunoprecipitation with antibodies such as anti-phosphotyrosine antibodies that recognize phosphorylated proteins. In addition, kinase activity can be detected by high throughput chemiluminescent assays such as AlphaScreen™ (available from Perkin Elmer) and eTag™ assay (Chan-Hui, et al. (2003) Clinical Immunology 111: 162-174).


Where the protein associated with a signaling biochemical pathway is part of a signaling cascade leading to a fluctuation of intracellular pH condition, pH sensitive molecules such as fluorescent pH dyes can be used as the reporter molecules. In another example where the protein associated with a signaling biochemical pathway is an ion channel, fluctuations in membrane potential and/or intracellular ion concentration can be monitored. A number of commercial kits and high-throughput devices are particularly suited for a rapid and robust screening for modulators of ion channels. Representative instruments include FLIPR™ (Molecular Devices, Inc.) and VIPR (Aurora Biosciences). These instruments are capable of detecting reactions in over 1000 sample wells of a microplate simultaneously, and providing real-time measurement and functional data within a second or even a minisecond.


In practicing any of the methods disclosed herein, a suitable vector can be introduced to a cell, tissue, organism, or an embryo via one or more methods known in the art, including without limitation, microinjection, electroporation, sonoporation, biolistics, calcium phosphate-mediated transfection, cationic transfection, liposome transfection, dendrimer transfection, heat shock transfection, nucleofection transfection, magnetofection, lipofection, impalefection, optical transfection, proprietary agent-enhanced uptake of nucleic acids, and delivery via liposomes, immunoliposomes, virosomes, or artificial virions. In some methods, the vector is introduced into an embryo by microinjection. The vector or vectors may be microinjected into the nucleus or the cytoplasm of the embryo. In some methods, the vector or vectors may be introduced into a cell by nucleofection.


A target polynucleotide of a targetable nuclease complex can be any polynucleotide endogenous or exogenous to the host cell. For example, the target polynucleotide can be a polynucleotide residing in the nucleus of the eukaryotic cell, the genome of a prokaryotic cell, or an extrachromosomal vector of a host cell. The target polynucleotide can be a sequence coding a gene product (e.g., a protein) or a non-coding sequence (e.g., a regulatory polynucleotide or a junk DNA).


Examples of target polynucleotides include a sequence associated with a signaling biochemical pathway, e.g., a signaling biochemical pathway-associated gene or polynucleotide. Examples of target polynucleotides include a disease associated gene or polynucleotide. A “disease-associated” gene or polynucleotide refers to any gene or polynucleotide which is yielding transcription or translation products at an abnormal level or in an abnormal form in cells derived from a disease-affected tissues compared with tissues or cells of a non-disease control. It may be a gene that becomes expressed at an abnormally high level; it may be a gene that becomes expressed at an abnormally low level, where the altered expression correlates with the occurrence and/or progression of the disease. A disease-associated gene also refers to a gene possessing mutation(s) or genetic variation that is directly responsible or is in linkage disequilibrium with a gene(s) that is responsible for the etiology of a disease. The transcribed or translated products may be known or unknown, and may be at a normal or abnormal level.


Embodiments of the invention also relate to methods and compositions related to knocking out genes, editing genes, altering genes, amplifying genes, and repairing particular mutations. Altering genes may also mean the epigenetic manipulation of a target sequence. This may be the chromatin state of a target sequence, such as by modification of the methylation state of the target sequence (i.e. addition or removal of methylation or methylation patterns or CpG islands), histone modification, increasing or reducing accessibility to the target sequence, or by promoting 3D folding. It will be appreciated that where reference is made to a method of modifying a cell, organism, or mammal including human or a non-human mammal or organism by manipulation of a target sequence in a genomic locus of interest, this may apply to the organism (or mammal) as a whole or just a single cell or population of cells from that organism (if the organism is multicellular). In the case of humans, for instance, Applicants envisage, inter alia, a single cell or a population of cells and these may preferably be modified ex vivo and then re-introduced. In this case, a biopsy or other tissue or biological fluid sample may be necessary. Stem cells are also particularly preferred in this regard. But, of course, in vivo embodiments are also envisaged. And the invention is especially advantageous as to HSCs.


The functionality of a targetable nuclease complex can be assessed by any suitable assay. For example, the components of a targetable nuclease system sufficient to form a targetable nuclease complex, including a guide nucleic acid and nucleic acid-guided nuclease, can be provided to a host cell having the corresponding target sequence, such as by transfection with vectors encoding the components of the engineered nuclease system, followed by an assessment of preferential cleavage within the target sequence. Similarly, cleavage of a target sequence may be evaluated in a test tube by providing the target sequence and components of a targetable nuclease complex. Other assays are possible, and will occur to those skilled in the art. A guide sequence can be selected to target any target sequence. In some embodiments, the target sequence is a sequence within a genome of a cell. Exemplary target sequences include those that are unique in the target genome.


Editing Cassette


Disclosed herein are compositions and methods for editing a target polynucleotide sequence. Such compositions include polynucleotides containing one or more components of targetable nuclease system. Polynucleotide sequences for use in these methods can be referred to as editing cassettes.


An editing cassette can comprise one or more primer sites. Primer sites can be used to amplify an editing cassette by using oligonucleotide primers comprising reverse complementary sequences that can hybridize to the one or more primer sites. An editing cassette can comprise two or more primer times. Sometimes, an editing cassette comprises a primer site on each end of the editing cassette, said primer sites flanking one or more of the other components of the editing cassette. Primer sites can be approximately 10, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26 or more nucleotides in length.


An editing cassette can comprise an editing template as disclosed herein. An editing cassette can comprise an editing sequence. An editing sequence can be homologous to a target sequence. An editing sequence can comprise at least one mutation relative to a target sequence. An editing sequence often comprises homology region (or homology arms) flanking at least one mutation relative to a target sequence, such that the flanking homology regions facilitate homologous recombination of the editing sequence into a target sequence. An editing sequence can comprise an editing template as disclosed herein. For example, the editing sequence can comprise at least one mutation relative to a target sequence including one or more PAM mutations that mutate or delete a PAM site. An editing sequence can comprise one or more mutations in a codon or non-coding sequence relative to a non-editing target site.


A PAM mutation can be a silent mutation. A silent mutation can be a change to at least one nucleotide of a codon relative to the original codon that does not change the amino acid encoded by the original codon. A silent mutation can be a change to a nucleotide within a non-coding region, such as an intron, 5′ untranslated region, 3′ untranslated region, or other non-coding region.


A PAM mutation can be a non-silent mutation. Non-silent mutations can include a missense mutation. A missense mutation can be when a change to at least one nucleotide of a codon relative to the original codon that changes the amino acid encoded by the original codon. Missense mutations can occur within an exon, open reading frame, or other coding region.


An editing sequence can comprise at least one mutation relative to a target sequence. A mutation can be a silent mutation or non-silent mutation, such as a missense mutation. A mutation can include an insertion of one or more nucleotides or base pairs. A mutation can include a deletion of one or more nucleotides or base pairs. A mutation can include a substitution of one or more nucleotides or base pairs for a different one or more nucleotides or base pairs. Inserted or substituted sequences can include exogenous or heterologous sequences.


An editing cassette can comprise a polynucleotide encoding a guide nucleic acid sequence. In some cases, the guide nucleic acid sequence is optionally operably linked to a promoter. A guide nucleic acid sequence can comprise a scaffold sequence and a guide sequence as described herein.


An editing cassette can comprise a barcode. A barcode can be a unique DNA sequence that corresponds to the editing sequence such that the barcode can identify the one or more mutations of the corresponding editing sequence. In some examples, the barcode is 15 nucleotides. The barcode can comprise less than 10, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 88, 90, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, or more than 200 nucleotides. A barcode can be a non-naturally occurring sequence. An editing cassette comprising a barcode can be a non-naturally occurring sequence.


An editing cassette can comprise one or more of an editing sequence and a polynucleotide encoding a guide nucleic acid optionally operably linked to a promoter, wherein the editing cassette and guide nucleic acid sequence are flanked by primer sites. An editing cassette can further comprise a barcode.


An example of an editing cassette is depicted in FIG. 3. Each editing cassette can be designed to edit a site in a target sequence Sites to be targeted can be coding regions, non-coding regions, functionally neutral sites, or they can be a screenable or selectable marker gene. Homology regions within the editing sequence flank the one or more mutations of the editing cassette and can be inserted into the target sequence by recombination. Recombination can comprise DNA cleavage, such as by an nucleic acid-guided nuclease, and repair via homologous recombination.


Editing cassettes can be generated by chemical synthesis, Gibson assembly, SLIC, CPEC, PCA, ligation-free cloning, overlapping oligo extension, in vitro assembly, in vitro oligo assembly, PCR, traditional ligation-based cloning, other known methods in the art, or any combination thereof.


Trackable sequences, such as barcodes or recorder sequences, can be designed in silico via standard code with a degenerate mutation at the target codon. The degenerate mutation can comprise 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, or more than 30 nucleic acid residues. In some examples, the degenerate mutations can comprise 15 nucleic acid residues (N15).


Homology arms can be added to an editing sequence to allow incorporation of the editing sequence into the desired location via homologous recombination or homology-driven repair. Homology arms can be added by synthesis, in vitro assembly, PCR, or other known methods in the art. For example, chemical synthesis, Gibson assembly, SLIC, CPEC, PCA, ligation-free cloning, overlapping oligo extension, in vitro assembly, in vitro oligo assembly, PCR, traditional ligation-based cloning, other known methods in the art, or any combination thereof. A homology arm can be added to both ends of a barcode, recorder sequence, and/or editing sequence, thereby flanking the sequence with two distinct homology arms, for example, a 5′ homology arm and a 3′ homology arm.


A homology arm can comprise sequence homologous to a target sequence. A homology arm can comprise sequence homologous to sequence adjacent to a target sequence. A homology arm can comprise sequence homologous to sequence upstream or downstream of a target sequence. A homology arm can comprise sequence homologous to sequence within the same gene or open reading frame as a target sequence. A homology arm can comprise sequence homologous to sequence upstream or downstream of a gene or open reading frame the target sequence is within. A homology arm can comprise sequence homologous to a 5′ UTR or 3′ UTR of a gene or open reading frame within which is a target sequence. A homology arm can comprise sequence homologous to a different gene, open reading frame, promoter, terminator, or nucleic acid sequence than that which the target sequence is within.


The same 5′ and 3′ homology arms can be added to a plurality of distinct editing sequences, thereby generating a library of unique editing sequences that each have the same targeted insertion site. The same 5′ and 3′ homology arms can be added to a plurality of distinct editing templates, thereby generating a library of unique editing templates that each have the same targeted insertion site. In alternative examples, different or a variety of 5′ or 3′ homology arms can be added to a plurality of editing sequences or editing templates.


A barcode library or recorder sequence library comprising flanking homology arms can be cloned into a vector backbone. In some examples, the barcode comprising flanking homology arms are cloned into an editing cassette. Cloning can occur by chemical synthesis, Gibson assembly, SLIC, CPEC, PCA, ligation-free cloning, overlapping oligo extension, in vitro assembly, in vitro oligo assembly, PCR, traditional ligation-based cloning, other known methods in the art, or any combination thereof.


An editing sequence library comprising flanking homology arms can be cloned into a vector backbone. In some examples, the editing sequence and homology arms are cloned into an editing cassette. Editing cassettes can, in some cases, further comprise a nucleic acid sequence encoding a guide nucleic acid or gRNA engineered to target the desired site of editing sequence insertion, e.g. the target sequence. Editing cassettes can, in some cases, further comprise a barcode or recorder sequence. Cloning can occur by chemical synthesis, Gibson assembly, SLIC, CPEC, PCA, ligation-free cloning, overlapping oligo extension, in vitro assembly, in vitro oligo assembly, PCR, traditional ligation-based cloning, other known methods in the art, or any combination thereof.


Gene-wide or genome-wide editing libraries can be cloned into a vector backbone. A barcode or recorder sequence library can be inserted or assembled into a second site to generate competent trackable plasmids that can embed the recording barcode at a fixed locus while integrating the editing libraries at a wide variety of user defined sites. Cloning can occur by chemical synthesis, Gibson assembly, SLIC, CPEC, PCA, ligation-free cloning, overlapping oligo extension, in vitro assembly, in vitro oligo assembly, PCR, traditional ligation-based cloning, other known methods in the art, or any combination thereof.


A guide nucleic acid or sequence encoding the same can be assembled or inserted into a vector backbone first, followed by insertion of an editing sequence and/or cassette. In other cases, an editing sequence and/or cassette can be inserted or assembled into a vector backbone first, followed by insertion of a guide nucleic acid or sequence encoding the same. In other cases, guide nucleic acid or sequence encoding the same and an editing sequence and/or cassette are simultaneous inserted or assembled into a vector. A recorder sequence or barcode can be inserted before or after any of these steps. In other words, it should be understood that there are many possible permutations to the order in which elements of the disclosure are assembled. The vector can be linear or circular and can be generated by chemical synthesis, Gibson assembly, SLIC, CPEC, PCA, ligation-free cloning, overlapping oligo extension, in vitro assembly, in vitro oligo assembly, PCR, traditional ligation-based cloning, other known methods in the art, or any combination thereof.


A nucleic acid molecule can be synthesized which comprises one or more elements disclosed herein. A nucleic acid molecule can be synthesized that comprises an editing cassette. A nucleic acid molecule can be synthesized that comprises a guide nucleic acid. A nucleic acid molecule can be synthesized that comprises a recorder cassette. A nucleic acid molecule can be synthesized that comprises a barcode. A nucleic acid molecule can be synthesized that comprises a homology arm. A nucleic acid molecule can be synthesized that comprises an editing cassette and a guide nucleic acid. A nucleic acid molecule can be synthesized that comprises an editing cassette and a barcode. A nucleic acid molecule can be synthesized that comprises an editing cassette, a guide nucleic acid, and a recorder cassette. A nucleic acid molecule can be synthesized that comprises an editing cassette, a recorder cassette, and two guide nucleic acids. A nucleic acid molecule can be synthesized that comprises a recorder cassette and a guide nucleic acid. In any of these cases, the guide nucleic acid can optionally be operably linked to a promoter. In any of these cases, the nucleic acid molecule can further include one or more barcodes.


Synthesis can occur by any nucleic acid synthesis method known in the art. Synthesis can occur by enzymatic nucleic acid synthesis. Synthesis can occur by chemical synthesis. Synthesis can occur by array-based synthesis. Synthesis can occur by solid-phase synthesis or phosphoramidite methods. Synthesis can occur by column or multi-well methods. Synthesized nucleic acid molecules can be non-naturally occurring nucleic acid molecules.


Software and automation methods can be used for multiplex synthesis and generation. For example, software and automation can be used to create 10, 102, 103, 104, 105, 106, or more synthesized polynucleotides, cassettes, or plasmids. An automation method can generate desired sequences and libraries in rapid fashion that can be processed through a workflow with minimal steps to produce precisely defined libraries, such as gene-wide or genome-wide editing libraries.


Polynucleotides or libraries can be generated which comprise two or more nucleic acid molecules or plasmids comprising any combination disclosed herein of recorder sequence, editing sequence, guide nucleic acid, and optional barcode, including combinations of one or more of any of the previously mentioned elements. For example, such a library can comprise at least 2, 3, 4, 5, 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 200, 300, 400, 500, 600, 700, 800, 1000, 1500, 2000, 2500, 3000, 3500, 4000, 4500, 5000, 5500, 6000, 6500, 7000, 7500, 8000, 8500, 9000, 9500, 104, 105, 106, 107, 108, 109, 1010, or more nucleic acid molecules or plasmids of the present disclosure. It should be understood that such a library can include any number of nucleic acid molecules or plasmids, even if the specific number is not explicit listed above.


Trackable plasmid libraries or nucleic acid molecule libraries can be sequenced in order to determine the recorder sequence and editing sequence pair that is comprised on each trackable plasmid. In other cases, a known recorder sequence is paired with a known editing sequence during the library generation process. Other methods of determining the association between a recorder sequence and editing sequence comprised on a common nucleic acid molecule or plasmid are envisioned such that the editing sequence can be identified by identification or sequencing of the recorder sequence.


Methods and compositions for tracking edited episomal libraries that are shuttled between E. coli and other organisms/cell lines are provided herein. The libraries can be comprised on plasmids, Bacterial artificial chromosomes (BACs), Yeast artificial chromosomes (YACs), synthetic chromosomes, or viral or phage genomes. These methods and compositions can be used to generate portable barcoded libraries in host organisms, such as E. coli. Library generation in such organisms can offer the advantage of established techniques for performing homologous recombination. Barcoded plasmid libraries can be deep-sequenced at one site to track mutational diversity targeted across the remaining portions of the plasmid allowing dramatic improvements in the depth of library coverage.


Any nucleic acid molecule disclosed herein can be an isolated nucleic acid. Isolated nucleic acids may be made by any method known in the art, for example using standard recombinant methods, assembly methods, synthesis techniques, or combinations thereof. In some embodiments, the nucleic acids may be cloned, amplified, assembled, or otherwise constructed.


Isolated nucleic acids may be obtained from cellular, bacterial, or other sources using any number of cloning methodologies known in the art. In some embodiments, oligonucleotide probes which selectively hybridize, under stringent conditions, to other oligonucleotides or to the nucleic acids of an organism or cell can be used to isolate or identify an isolated nucleic acid.


Cellular genomic DNA, RNA, or cDNA may be screened for the presence of an identified genetic element of interest using a probe based upon one or more sequences. Various degrees of stringency of hybridization may be employed in the assay.


High stringency conditions for nucleic acid hybridization are well known in the art. For example, conditions may comprise low salt and/or high temperature conditions, such as provided by about 0.02 M to about 0.15 M NaCl at temperatures of about 50° C. to about 70° C. It is understood that the temperature and ionic strength of a desired stringency are determined in part by the length of the particular nucleic acid(s), the length and nucleotide content of the target sequence(s), the charge composition of the nucleic acid(s), and by the presence or concentration of formamide, tetramethylammonium chloride or other solvent(s) in a hybridization mixture. Nucleic acids may be completely complementary to a target sequence or may exhibit one or more mismatches.


Nucleic acids of interest may also be amplified using a variety of known amplification techniques. For instance, polymerase chain reaction (PCR) technology may be used to amplify target sequences directly from DNA, RNA, or cDNA. PCR and other in vitro amplification methods may also be useful, for example, to clone nucleic acid sequences, to make nucleic acids to use as probes for detecting the presence of a target nucleic acid in samples, for nucleic acid sequencing, or for other purposes.


Isolated nucleic acids may be prepared by direct chemical synthesis by methods such as the phosphotriester method, or using an automated synthesizer. Chemical synthesis generally produces a single stranded oligonucleotide. This may be converted into double stranded DNA by hybridization with a complementary sequence or by polymerization with a DNA polymerase using the single strand as a template.


Recorder


In some example, two editing cassettes can be used together to track a genetic engineering step. For example, one editing cassette can comprise an editing template and an encoded guide nucleic acid, and a second editing cassette, referred to as a recorder cassette, can comprise an editing template comprising a recorder sequence and an encoded nucleic acid which has a distinct guide sequence compared to that of the first editing cassette. In such cases, the editing sequence and the recorder sequence can be inserted into separate target sequences and determined by their corresponding guide nucleic acids. A recorder sequence can comprise a barcode, trackable or traceable sequence, and/or a regulatory element operable with a screenable or selectable marker.


Through a multiplexed cloning approach, the recorder cassette can be covalently coupled to at least one editing cassette in a plasmid (e.g., FIG. 17A, green cassette) to generate plasmid libraries that have a unique recorder and editing cassette combination. This library can be sequenced to generate the recorder/edit mapping and used to track editing libraries across large segments of the target DNA (e.g., FIG. 17C). Recorder and editing sequences can be comprised on the same cassette, in which case they are both incorporated into the target nucleic acid sequence, such as a genome or plasmid, by the same recombination event. In other examples, the recorder and editing sequences can be comprised on separate cassettes within the same plasmid, in which case the recorder and editing sequences are incorporated into the target nucleic acid sequence by separate recombination events, either simultaneously or sequentially.


Methods are provided herein for combining multiplex oligonucleotide synthesis with recombineering, to create libraries of specifically designed and trackable mutations. Screens and/or selections followed by high-throughput sequencing and/or barcode microarray methods can allow for rapid mapping of mutations leading to a phenotype of interest.


Methods and compositions disclosed herein can be used to simultaneously engineer and track engineering events in a target nucleic acid sequence.


Such plasmids can be generated using in vitro assembly or cloning techniques. For example, the plasmids can be generated using chemical synthesis, Gibson assembly, SLIC, CPEC, PCA, ligation-free cloning, other in vitro oligo assembly techniques, traditional ligation-based cloning, or any combination thereof.


Such plasmids can comprise at least one recording sequence, such as a barcode, and at least one editing sequence. In most cases, the recording sequence is used to record and track engineering events. Each editing sequence can be used to incorporate a desired edit into a target nucleic acid sequence. The desired edit can include insertion, deletion, substitution, or alteration of the target nucleic acid sequence. In some examples, the one or more recording sequence and editing sequences are comprised on a single cassette comprised within the plasmid such that they are incorporated into the target nucleic acid sequence by the same engineering event. In other examples, the recording and editing sequences are comprised on separate cassettes within the plasmid such that they are each incorporated into the target nucleic acid by distinct engineering events. In some examples, the plasmid comprises two or more editing sequences. For example, one editing sequence can be used to alter or silence a PAM sequence while a second editing sequence can be used to incorporate a mutation into a distinct sequence.


Recorder sequences can be inserted into a site separated from the editing sequence insertion site. The inserted recorder sequence can be separated from the editing sequence by 1 bp to 1 Mbp. For example, the separation distance can be about 1 bp, 10 bp, 50 bp, 100 bp, 500 bp, 1 kp, 2 kb, 5 kb, 10 kb, or greater. The separation distance can be any discrete integer between 1 bp and 10 Mbp. In some examples, the maximum distance of separation depends on the size of the target nucleic acid or genome.


Recorder sequences can be inserted adjacent to editing sequences, or within proximity to the editing sequence. For example, the recorder sequence can be inserted outside of the open reading frame within which the editing sequence is inserted. Recorder sequence can be inserted into an untranslated region adjacent to an open reading frame within which an editing sequence has been inserted. The recorder sequence can be inserted into a functionally neutral or non-functional site. The recorder sequence can be inserted into a screenable or selectable marker gene.


In some examples, the target nucleic acid sequence is comprised within a genome, artificial chromosome, synthetic chromosome, or episomal plasmid. In various examples, the target nucleic acid sequence can be in vitro or in vivo. When the target nucleic acid sequence is in vivo, the plasmid can be introduced into the host organisms by transformation, transfection, conjugation, biolistics, nanoparticles, cell-permeable technologies, or other known methods for DNA delivery, or any combination thereof. In such examples, the host organism can be a eukaryote, prokaryote, bacterium, archaea, yeast, or other fungi.


The engineering event can comprise recombineering, non-homologous end joining, homologous recombination, or homology-driven repair. In some examples, the engineering event is performed in vitro or in vivo.


The methods described herein can be carried out in any type of cell in which a targetable nuclease system can function (e.g., target and cleave DNA), including prokaryotic and eukaryotic cells. In some embodiments the cell is a bacterial cell, such as Escherichia spp. (e.g., E. coli). In other embodiments, the cell is a fungal cell, such as a yeast cell, e.g., Saccharomyces spp. In other embodiments, the cell is an algal cell, a plant cell, an insect cell, or a mammalian cell, including a human cell.


In some examples, the cell is a recombinant organism. For example, the cell can comprise a non-native targetable nuclease system. Additionally or alternatively, the cell can comprise recombination system machinery. Such recombination systems can include lambda red recombination system, Cre/Lox, attB/attP, or other integrase systems. Where appropriate, the plasmid can have the complementary components or machinery required for the selected recombination system to work correctly and efficiently.


Method for genome editing can comprise: (a) introducing a vector that encodes at least one editing cassette and at least one guide nucleic acid into a first population of cells, thereby producing a second population of cells comprising the vector; (b) maintaining the second population of cells under conditions in which a nucleic acid-guided nuclease is expressed or maintained, wherein the nucleic acid-guided nuclease is encoded on the vector, a second vector, on the genome of cells of the second population of cells, or otherwise introduced into the cell, resulting in DNA cleavage and incorporation of the editing cassette; (c) obtaining viable cells; and (d) sequencing the target DNA molecule in at least one cell of the second population of cells to identify the mutation of at least one codon.


A method for genome editing can comprise: (a) introducing a vector that encodes at least one editing cassette comprising a PAM mutation as disclosed herein and at least one guide nucleic acid into a first population of cells, thereby producing a second population of cells comprising the vector; (b) maintaining the second population of cells under conditions in which a nucleic acid-guided nuclease is expressed or maintained, wherein the nucleic acid-guided nuclease is encoded on the vector, a second vector, on the genome of cells of the second population of cells, or otherwise introduced into the cell, resulting in DNA cleavage, incorporation of the editing cassette, and death of cells of the second population of cells that do not comprise the PAM mutation, whereas cells of the second population of cells that comprise the PAM mutation are viable; (c) obtaining viable cells; and (d) sequencing the target DNA in at least one cell of the second population of cells to identify the mutation of at least one codon.


Method for trackable genome editing can comprise: (a) introducing a vector that encodes at least one editing cassette, at least one recorder cassette, and at least two guide nucleic acids into a first population of cells, thereby producing a second population of cells comprising the vector; (b) maintaining the second population of cells under conditions in which a nucleic acid-guided nuclease is expressed or maintained, wherein the nucleic acid-guided nuclease is encoded on the vector, a second vector, on the genome of cells of the second population of cells, or otherwise introduced into the cell, resulting in DNA cleavage and incorporation of the editing and recorder cassettes; (c) obtaining viable cells; and (d) sequencing the recorder sequence of the target DNA molecule in at least one cell of the second population of cells to identify the mutation of at least one codon.


In some examples where the plasmid comprises a second editing sequence designed to silence a PAM, a method for trackable genome editing can comprise: (a) introducing a vector that encodes at least one editing cassette, a recorder cassette, and at least two guide nucleic acids into a first population of cells, thereby producing a second population of cells comprising the vector; (b) maintaining the second population of cells under conditions in which a nucleic acid-guided nuclease is expressed or maintained, wherein the nucleic acid-guided nuclease is encoded on the vector, a second vector, on the genome of cells of the second population of cells, or otherwise introduced into the cell, resulting in DNA cleavage, incorporation of the editing and recorder cassettes, and death of cells of the second population of cells that do not comprise the PAM mutation, whereas cells of the second population of cells that comprise the PAM mutation are viable; (c) obtaining viable cells; and (d) sequencing the recorder sequence of the target DNA in at least one cell of the second population of cells to identify the mutation of at least one codon.


In some examples transformation efficiency is determined by using a non-targeting control guide nucleic acid, which allows for validation of the recombineering procedure and CFU/ng calculations. In some cases, absolute efficient is obtained by counting the total number of colonies on each transformation plate, for example, by counting both red and white colonies from a galK control. In some examples, relative efficiency is calculated by the total number of successful transformants (for example, white colonies) out of all colonies from a control (for example, galK control).


The methods of the disclosure can provide, for example, greater than 1000× improvements in the efficiency, scale, cost of generating a combinatorial library, and/or precision of such library generation.


The methods of the disclosure can provide, for example, greater than: 10×, 50×, 100×, 200×, 300×, 400×, 500×, 600×, 700×, 800×, 900×, 1000×, 1100×, 1200×, 1300×, 1400×, 1500×, 1600×, 1700×, 1800×, 1900×, 2000×, or greater improvements in the efficiency of generating genomic or combinatorial libraries.


The methods of the disclosure can provide, for example, greater than: 10×, 50×, 100×, 200×, 300×, 400×, 500×, 600×, 700×, 800×, 900×, 1000×, 1100×, 1200×, 1300×, 1400×, 1500×, 1600×, 1700×, 1800×, 1900×, 2000×, or greater improvements in the scale of generating genomic or combinatorial libraries.


The methods of the disclosure can provide, for example, greater than: 10×, 50×, 100×, 200×, 300×, 400×, 500×, 600×, 700×, 800×, 900×, 1000×, 1100×, 1200×, 1300×, 1400×, 1500×, 1600×, 1700×, 1800×, 1900×, 2000×, or greater decrease in the cost of generating genomic or combinatorial libraries.


The methods of the disclosure can provide, for example, greater than: 10×, 50×, 100×, 200×, 300×, 400×, 500×, 600×, 700×, 800×, 900×, 1000×, 1100×, 1200×, 1300×, 1400×, 1500×, 1600×, 1700×, 1800×, 1900×, 2000×, or greater improvements in the precision of genomic or combinatorial library generation.


Recursive Tracking for Combinatorial Engineering


Disclosed herein are methods and compositions for iterative rounds of engineering. Disclosed herein are recursive engineering strategies that allow implementation of CREATE recording at the single cell level through several serial engineering cycles (e.g., FIG. 18 and FIG. 19). These disclosed methods and compositions can enable search-based technologies that can effectively construct and explore complex genotypic space. The terms recursive and iterative can be used interchangeably.


Combinatorial engineering methods can comprise multiple rounds of engineering. Methods disclosed herein can comprise 2 or more rounds of engineering. For example, a method can comprise 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 25, 30, or more than 30 rounds of engineering.


In some examples, during each round of engineering a new recorder sequence, such as a barcode, is incorporated at the same locus in nearby sites (e.g., FIG. 18, green bars or FIG. 19, black bars) such that following multiple engineering cycles to construct combinatorial diversity throughout the genome (e.g., FIG. 18, green bars or FIG. 19, grey bars) a simple PCR of the recording locus can be used to reconstruct each combinatorial genotype or to confirm that the engineered edit from each round has been incorporated into the target site.


Disclosed herein are methods for selecting for successive rounds of engineering. Selection can occur by a PAM mutation incorporated by an editing cassette. Selection can occur by a PAM mutation incorporated by a recorder cassette. Selection can occur using a screenable, selectable, or counter-selectable marker. Selection can occur by targeting a site for editing or recording that was incorporated by a prior round of engineering, thereby selecting for variants that successfully incorporated edits and recorder sequences from both rounds or all prior rounds of engineering.


Quantitation of these genotypes can be used for understanding combinatorial mutational effects on large populations and investigation of important biological phenomena such as epistasis.


Serial editing and combinatorial tracking can be implemented using recursive vector systems as disclosed herein. These recursive vector systems can be used to move rapidly through the transformation procedure. In some examples, these systems consist of two or more plasmids containing orthogonal replication origins, antibiotic markers, and an encoded guide nucleic acids. The encoded guide nucleic acid in each vector can be designed to target one of the other resistance markers for destruction by nucleic acid-guided nuclease-mediated cleavage. These systems can be used, in some examples, to perform transformations in which the antibiotic selection pressure is switched to remove the previous plasmid and drive enrichment of the next round of engineered genomes. Two or more passages through the transformation loop can be performed, or in other words, multiple rounds of engineering can be performed. Introducing the requisite recording cassettes and editing cassettes into recursive vectors as disclosed herein can be used for simultaneous genome editing and plasmid curing in each transformation step with high efficiencies.


In some examples, the recursive vector system disclosed herein comprises 2, 3, 4, 5, 6, 7, 8, 9, 10, or more than 10 unique plasmids. In some examples, the recursive vector system can use a particular plasmid more than once as long as a distinct plasmid is used in the previous round and in the subsequent round.


Recursive methods and compositions disclosed herein can be used to restore function to a selectable or screenable element in a targeted genome or plasmid. The selectable or screenable element can include an antibiotic resistance gene, a fluorescent gene, a unique DNA sequence or watermark, or other known reporter, screenable, or selectable gene. In some examples, each successive round of engineering can incorporate a fragment of the selectable or screenable element, such that at the end of the engineering rounds, the entire selectable or screenable element has been incorporated into the target genome or plasmid. In such examples, only those genome or plasmids which have successfully incorporated all of the fragments, and therefore all of the desired corresponding mutations, can be selected or screened for. In this way, the selected or screened cells will be enriched for those that have incorporated the edits from each and every iterative round of engineering.


Recursive methods can be used to switch a selectable or screenable marker between an on and an off position, or between an off and an on position, with each successive round of engineering. Using such a method allows conservation of available selectable or screenable markers by requiring, for example, the use of only one screenable or selectable marker. Furthermore, short regulatory sequence or start codon or non-start codons can be used to turn the screenable or selectable marker on and off. Such short sequences can easily fit within a synthesized cassette or polynucleotide.


One or more rounds of engineering can be performed using the methods and compositions disclosed herein. In some examples, each round of engineering is used to incorporate an edit unique from that of previous rounds. Each round of engineering can incorporate a unique recording sequence. Each round of engineering can result in removal or curing of the plasmid used in the previous round of engineering. In some examples, successful incorporation of the recording sequence of each round of engineering results in a complete and functional screenable or selectable marker or unique sequence combination.


Unique recorder cassettes comprising recording sequences such as barcodes or screenable or selectable markers can be inserted with each round of engineering, thereby generating a recorder sequence that is indicative of the combination of edits or engineering steps performed. Successive recording sequences can be inserted adjacent to one another. Successive recording sequences can be inserted within proximity to one another. Successive sequences can be inserted at a distance from one another.


Successive sequences can be inserted at a distance from one another. For example, successive recorder sequences can be inserted and separated by 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, or greater than 100 bp. In some examples, successive recorder sequences are separated by about 10, 50, 100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1100, 1200, 1300, 1400, 1500, or greater than 1500 bp.


Successive recorder sequences can be separated by any desired number of base pairs and can be dependent and limited on the number of successive recorder sequences to be inserted, the size of the target nucleic acid or target genomes, and/or the design of the desired final recorder sequence. For example, if the compiled recorder sequence is a functional screenable or selectable marker, than the successive recording sequences can be inserted within proximity and within the same reading frame from one another. If the compiled recorder sequence is a unique set of barcodes to be identified by sequencing and have no coding sequence element, then the successive recorder sequences can be inserted with any desired number of base pairs separating them. In these cases, the separation distance can be dependent on the sequencing technology to be used and the read length limit.


While preferred embodiments of the present invention have been shown and described herein, it will be obvious to those skilled in the art that such embodiments are provided by way of example only. Numerous variations, changes, and substitutions will now occur to those skilled in the art without departing from the invention. It should be understood that various alternatives to the embodiments of the invention described herein may be employed in practicing the invention. It is intended that the following claims define the scope of the invention and that methods and structures within the scope of these claims and their equivalents be covered thereby.


Some Definitions

As used herein the term “wild type” is a term of the art understood by skilled persons and means the typical form of an organism, strain, gene or characteristic as it occurs in nature as distinguished from mutant or variant forms.


As used herein the term “variant” should be taken to mean the exhibition of qualities that have a pattern that deviates from what occurs in nature.


The terms “orthologue” (also referred to as “ortholog” herein) and “homologue” (also referred to as “homolog” herein) are well known in the art. By means of further guidance, a “homologue” of a protein as used herein is a protein of the same species which performs the same or a similar function as the protein it is a homologue of. Homologous proteins may but need not be structurally related, or are only partially structurally related. An “orthologue” of a protein as used herein is a protein of a different species which performs the same or a similar function as the protein it is an orthologue of Orthologous proteins may but need not be structurally related, or are only partially structurally related. Homologs and orthologs may be identified by homology modelling (see, e.g., Greer, Science vol. 228 (1985) 1055, and Blundell et al. Eur J Biochem vol 172 (1988), 513) or “structural BLAST” (Dey F, Cliff Zhang Q, Petrey D, Honig B. Toward a “structural BLAST”: using structural relationships to infer function. Protein Sci. 2013 April; 22(4):359-66. doi: 10.1002/pro.2225.).


The terms “polynucleotide”, “nucleotide”, “nucleotide sequence”, “nucleic acid” and “oligonucleotide” are used interchangeably. They refer to a polymeric form of nucleotides of any length, either deoxyribonucleotides or ribonucleotides, or analogs thereof. Polynucleotides may have any three dimensional structure, and may perform any function, known or unknown. The following are non-limiting examples of polynucleotides: coding or non-coding regions of a gene or gene fragment, loci (locus) defined from linkage analysis, exons, introns, messenger RNA (mRNA), transfer RNA, ribosomal RNA, short interfering RNA (siRNA), short-hairpin RNA (shRNA), micro-RNA (miRNA), ribozymes, cDNA, recombinant polynucleotides, branched polynucleotides, plasmids, vectors, isolated DNA of any sequence, isolated RNA of any sequence, nucleic acid probes, and primers. The term also encompasses nucleic-acid-like structures with synthetic backbones, see, e.g., Eckstein, 1991; Baserga et al., 1992; Milligan, 1993; WO 97/03211; WO 96/39154; Mata, 1997; Strauss-Soukup, 1997; and Samstag, 1996. A polynucleotide may comprise one or more modified nucleotides, such as methylated nucleotides and nucleotide analogs. If present, modifications to the nucleotide structure may be imparted before or after assembly of the polymer. The sequence of nucleotides may be interrupted by non-nucleotide components. A polynucleotide may be further modified after polymerization, such as by conjugation with a labeling component.


“Complementarity” refers to the ability of a nucleic acid to form hydrogen bond(s) with another nucleic acid sequence by either traditional Watson-Crick base pairing or other non-traditional types. A percent complementarity indicates the percentage of residues in a nucleic acid molecule which can form hydrogen bonds (e.g., Watson-Crick base pairing) with a second nucleic acid sequence (e.g., 5, 6, 7, 8, 9, 10 out of 10 being 50%, 60%, 70%, 80%, 90%, and 100% complementary). “Perfectly complementary” means that all the contiguous residues of a nucleic acid sequence will hydrogen bond with the same number of contiguous residues in a second nucleic acid sequence. “Substantially complementary” as used herein refers to a degree of complementarity that is at least 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98%, 99%, or 100% over a region of 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 30, 35, 40, 45, 50, or more nucleotides, or refers to two nucleic acids that hybridize under stringent conditions.


As used herein, “stringent conditions” for hybridization refer to conditions under which a nucleic acid having complementarity to a target sequence predominantly hybridizes with the target sequence, and substantially does not hybridize to non-target sequences. Stringent conditions are generally sequence-dependent, and vary depending on a number of factors. In general, the longer the sequence, the higher the temperature at which the sequence specifically hybridizes to its target sequence. Non-limiting examples of stringent conditions are described in detail in Tijssen (1993). Laboratory Techniques In Biochemistry And Molecular Biology-Hybridization With Nucleic Acid Probes Part I, Second Chapter “Overview of principles of hybridization and the strategy of nucleic acid probe assay”, Elsevier, N.Y. Where reference is made to a polynucleotide sequence, then complementary or partially complementary sequences are also envisaged. These are preferably capable of hybridising to the reference sequence under highly stringent conditions. Generally, in order to maximize the hybridization rate, relatively low-stringency hybridization conditions are selected: about 20 to 25 degrees Celsius. lower than the thermal melting point (Tm). The Tm is the temperature at which 50% of specific target sequence hybridizes to a perfectly complementary probe in solution at a defined ionic strength and pH. Generally, in order to require at least about 85% nucleotide complementarity of hybridized sequences, highly stringent washing conditions are selected to be about 5 to 15 degrees Celsius lower than the Tm. In order to require at least about 70% nucleotide complementarity of hybridized sequences, moderately-stringent washing conditions are selected to be about 15 to 30 degrees Celsius lower than the Tm. Highly permissive (very low stringency) washing conditions may be as low as 50 degrees Celsius below the Tm, allowing a high level of mis-matching between hybridized sequences. Those skilled in the art will recognize that other physical and chemical parameters in the hybridization and wash stages can also be altered to affect the outcome of a detectable hybridization signal from a specific level of homology between target and probe sequences.


“Hybridization” refers to a reaction in which one or more polynucleotides react to form a complex that is stabilized via hydrogen bonding between the bases of the nucleotide residues. The hydrogen bonding may occur by Watson Crick base pairing, Hoogstein binding, or in any other sequence specific manner. The complex may comprise two strands forming a duplex structure, three or more strands forming a multi stranded complex, a single self-hybridizing strand, or any combination of these. A hybridization reaction may constitute a step in a more extensive process, such as the initiation of PCR, or the cleavage of a polynucleotide by an enzyme. A sequence capable of hybridizing with a given sequence is referred to as the “complement” of the given sequence.


As used herein, the term “genomic locus” or “locus” (plural loci) is the specific location of a gene or DNA sequence on a chromosome. A “gene” refers to stretches of DNA or RNA that encode a polypeptide or an RNA chain that has functional role to play in an organism and hence is the molecular unit of heredity in living organisms. For the purpose of this invention it may be considered that genes include regions which regulate the production of the gene product, whether or not such regulatory sequences are adjacent to coding and/or transcribed sequences. Accordingly, a gene includes, but is not necessarily limited to, promoter sequences, terminators, translational regulatory sequences such as ribosome binding sites and internal ribosome entry sites, enhancers, silencers, insulators, boundary elements, replication origins, matrix attachment sites and locus control regions.


As used herein, “expression of a genomic locus” or “gene expression” is the process by which information from a gene is used in the synthesis of a functional gene product. The products of gene expression are often proteins, but in non-protein coding genes such as rRNA genes or tRNA genes, the product is functional RNA. The process of gene expression is used by all known life—eukaryotes (including multicellular organisms), prokaryotes (bacteria and archaea) and viruses to generate functional products to survive. As used herein “expression” of a gene or nucleic acid encompasses not only cellular gene expression, but also the transcription and translation of nucleic acid(s) in cloning systems and in any other context. As used herein, “expression” also refers to the process by which a polynucleotide is transcribed from a DNA template (such as into and mRNA or other RNA transcript) and/or the process by which a transcribed mRNA is subsequently translated into peptides, polypeptides, or proteins. Transcripts and encoded polypeptides may be collectively referred to as “gene product.” If the polynucleotide is derived from genomic DNA, expression may include splicing of the mRNA in a eukaryotic cell.


The terms “polypeptide”, “peptide” and “protein” are used interchangeably herein to refer to polymers of amino acids of any length. The polymer may be linear or branched, it may comprise modified amino acids, and it may be interrupted by non amino acids. The terms also encompass an amino acid polymer that has been modified; for example, disulfide bond formation, glycosylation, lipidation, acetylation, phosphorylation, or any other manipulation, such as conjugation with a labeling component. As used herein the term “amino acid” includes natural and/or unnatural or synthetic amino acids, including glycine and both the D or L optical isomers, and amino acid analogs and peptidomimetics.


As used herein, the term “domain” or “protein domain” refers to a part of a protein sequence that may exist and function independently of the rest of the protein chain.


As described in aspects of the invention, sequence identity is related to sequence homology. Homology comparisons may be conducted by eye, or more usually, with the aid of readily available sequence comparison programs. These commercially available computer programs may calculate percent (%) homology between two or more sequences and may also calculate the sequence identity shared by two or more amino acid or nucleic acid sequences. Sequence homologies may be generated by any of a number of computer programs known in the art, for example BLAST or FASTA, etc. A suitable computer program for carrying out such an alignment is the GCG Wisconsin Bestfit package (University of Wisconsin. U.S.A; Devereux et al., 1984, Nucleic Acids Research 12:387). Examples of other software than may perform sequence comparisons include, but are not limited to, the BLAST package (see Ausubel et al., 1999 ibid—Chapter 18), FASTA (Atschul et al., 1990, J. Mol. Biol., 403-410) and the GENEWORKS suite of comparison tools. Both BLAST and FASTA are available for offline and online searching (see Ausubel et al., 1999 ibid, pages 7-58 to 7-60). However it is preferred to use the GCG Bestfit program.


Percent homology may be calculated over contiguous sequences, i.e., one sequence is aligned with the other sequence and each amino acid or nucleotide in one sequence is directly compared with the corresponding amino acid or nucleotide in the other sequence, one residue at a time. This is called an “ungapped” alignment. Typically, such ungapped alignments are performed only over a relatively short number of residues.


Although this is a very simple and consistent method, it fails to take into consideration that, for example, in an otherwise identical pair of sequences, one insertion or deletion may cause the following amino acid residues to be put out of alignment, thus potentially resulting in a large reduction in % homology when a global alignment is performed. Consequently, most sequence comparison methods are designed to produce optimal alignments that take into consideration possible insertions and deletions without unduly penalizing the overall homology or identity score. This is achieved by inserting “gaps” in the sequence alignment to try to maximize local homology or identity.


However, these more complex methods assign “gap penalties” to each gap that occurs in the alignment so that, for the same number of identical amino acids, a sequence alignment with as few gaps as possible—reflecting higher relatedness between the two compared sequences—may achieve a higher score than one with many gaps. “Affinity gap costs” are typically used that charge a relatively high cost for the existence of a gap and a smaller penalty for each subsequent residue in the gap. This is the most commonly used gap scoring system. High gap penalties may, of course, produce optimized alignments with fewer gaps. Most alignment programs allow the gap penalties to be modified. However, it is preferred to use the default values when using such software for sequence comparisons. For example, when using the GCG Wisconsin Bestfit package the default gap penalty for amino acid sequences is −12 for a gap and −4 for each extension.


Calculation of maximum % homology therefore first requires the production of an optimal alignment, taking into consideration gap penalties. A suitable computer program for carrying out such an alignment is the GCG Wisconsin Bestfit package (Devereux et al., 1984 Nuc. Acids Research 12 p387). Examples of other software that may perform sequence comparisons include, but are not limited to, the BLAST package (see Ausubel et al., 1999 Short Protocols in Molecular Biology, 4th Ed.—Chapter 18), FASTA (Altschul et al., 1990 J. Mol. Biol. 403-410) and the GENEWORKS suite of comparison tools. Both BLAST and FASTA are available for offline and online searching (see Ausubel et al., 1999, Short Protocols in Molecular Biology, pages 7-58 to 7-60). However, for some applications, it is preferred to use the GCG Bestfit program. A new tool, called BLAST 2 Sequences is also available for comparing protein and nucleotide sequences (see FEMS Microbiol Lett. 1999 174(2): 247-50; FEMS Microbiol Lett. 1999 177(1): 187-8 and the website of the National Center for Biotechnology information at the website of the National Institutes for Health).


Although the final % homology may be measured in terms of identity, the alignment process itself is typically not based on an all-or-nothing pair comparison. Instead, a scaled similarity score matrix is generally used that assigns scores to each pair-wise comparison based on chemical similarity or evolutionary distance. An example of such a matrix commonly used is the BLOSUM62 matrix—the default matrix for the BLAST suite of programs. GCG Wisconsin programs generally use either the public default values or a custom symbol comparison table, if supplied (see user manual for further details). For some applications, it is preferred to use the public default values for the GCG package, or in the case of other software, the default matrix, such as BLOSUM62.


Alternatively, percentage homologies may be calculated using the multiple alignment feature in DNASIS™ (Hitachi Software), based on an algorithm, analogous to CLUSTAL (Higgins D G & Sharp P M (1988), Gene 73(1), 237-244). Once the software has produced an optimal alignment, it is possible to calculate % homology, preferably % sequence identity. The software typically does this as part of the sequence comparison and generates a numerical result.


Sequences may also have deletions, insertions or substitutions of amino acid residues which produce a silent change and result in a functionally equivalent substance. Deliberate amino acid substitutions may be made on the basis of similarity in amino acid properties (such as polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues) and it is therefore useful to group amino acids together in functional groups. Amino acids may be grouped together based on the properties of their side chains alone. However, it is more useful to include mutation data as well. The sets of amino acids thus derived are likely to be conserved for structural reasons. These sets may be described in the form of a Venn diagram (Livingstone C. D. and Barton G. J. (1993) “Protein sequence alignments: a strategy for the hierarchical analysis of residue conservation” Comput. Appl. Biosci. 9: 745-756) (Taylor W. R. (1986) “The classification of amino acid conservation” J. Theor. Biol. 119; 205-218). Conservative substitutions may be made, for example according to the table below which describes a generally accepted Venn diagram grouping of amino acids.


Embodiments of the invention include sequences (both polynucleotide or polypeptide) which may comprise homologous substitution (substitution and replacement are both used herein to mean the interchange of an existing amino acid residue or nucleotide, with an alternative residue or nucleotide) that may occur i.e., like-for-like substitution in the case of amino acids such as basic for basic, acidic for acidic, polar for polar, etc. Non-homologous substitution may also occur i.e., from one class of residue to another or alternatively involving the inclusion of unnatural amino acids such as ornithine (hereinafter referred to as Z), diaminobutyric acid ornithine (hereinafter referred to as B), norleucine ornithine (hereinafter referred to as O), pyridylalanine, thienylalanine, naphthylalanine and phenylglycine.


Variant amino acid sequences may include suitable spacer groups that may be inserted between any two amino acid residues of the sequence including alkyl groups such as methyl, ethyl or propyl groups in addition to amino acid spacers such as glycine or .beta.-alanine residues. A further form of variation, which involves the presence of one or more amino acid residues in peptoid form, may be well understood by those skilled in the art. For the avoidance of doubt, “the peptoid form” is used to refer to variant amino acid residues wherein the .alpha.-carbon substituent group is on the residue's nitrogen atom rather than the .alpha.-carbon. Processes for preparing peptides in the peptoid form are known in the art, for example Simon R J et al., PNAS (1992) 89(20), 9367-9371 and Horwell D C, Trends Biotechnol. (1995) 13(4), 132-134.


The practice of the present invention employs, unless otherwise indicated, conventional techniques of immunology, biochemistry, chemistry, molecular biology, microbiology, cell biology, genomics and recombinant DNA, which are within the skill of the art. See Green and Sambrook, (Molecular Cloning: A Laboratory Manual. 4th, ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 2014); CURRENT PROTOCOLS IN MOLECULAR BIOLOGY (F. M. Ausubel, et al. eds., (2017)); Short Protocols in Molecular Biology, (Ausubel et al., 1999)); the series METHODS IN ENZYMOLOGY (Academic Press, Inc.): PCR 2: A PRACTICAL APPROACH (M. J. MacPherson, B. D. Hames and G. R. Taylor eds. (1995)), ANTIBODIES, A LABORATORY MANUAL, SECOND EDITION (Harlow and Lane, eds. (2014) and CULTURE OF ANIMAL CELLS: A MANUAL OF BASIC TECHNIQUE, 7TH EDITION (R. I. Freshney, ed. (2016)).


EXAMPLES

The following examples are given for the purpose of illustrating various embodiments of the invention and are not meant to limit the present invention in any fashion. The present examples, along with the methods described herein are presently representative of preferred embodiments, are exemplary, and are not intended as limitations on the scope of the invention. Changes therein and other uses which are encompassed within the spirit of the invention as defined by the scope of the claims will occur to those skilled in the art.


Example 1. Nucleic Acid-Guided Nucleases

Sequences for twenty nucleic acid guided nucleases, termed MAD1-MAD20 (SEQ ID NOs 1-20), were aligned and compared to other nucleic acid guided nucleases. A partial alignment and phylogenetic tree are depicted in FIG. 1A and FIG. 1B respectively. Key residues in that may be involved in the recognition of a PAM site are shown in FIG. 1A. These include amino acids at positions 167, 539, 548, 599, 603, 604, 605, 606, and 607.


Sequence alignments were built using PSI-BLAST to search for MAD nuclease homologs in the NCBI non-redundant databases. Multiple sequence alignments were further refined using the MUSCLE alignment algorithm with default settings as implemented in Geneious 10. The percent identity of each homolog to SpCas9 and AsCpf1 reference sequences were computed based on the pairwise alignment matching from these global alignments.


Genomic source sequences were identified using Uniprot linkage information or TBLASTN searches of NCBI using the default parameters and searching all possible frames for translational matches.


Percent identities of MAD1-8 and 10-12 to other various nuclease are summarized in Table 1. These percent identities represent the shared amino acid sequence identity between the indicated proteins.




















TABLE 1





Protein













identifier













or













accession













number
MAD1
MAD2
MAD3
MAD4
MAD5
MAD6
MAD7
MAD8
MAD10
MAD11
MAD12


























gi|1025734861|
6.4
32.8
33.2
29.7
29.4
31.1
30.3
31.7
26.7
27.9
98.8


pdb|













5B43|A













gi|1052245173|
6.4
32.7
33.1
29.7
29.3
31
30.3
31.7
26.7
27.8
98.7


pdb|













5KK5|A













gi|1086216683
6.1
33
34.4
29.6
30.1
33.5
32.3
32.1
26.2
27.2
46.8


|emb|













SDC16215.1|













gi|1120175333
5.9
30.9
37.2
32.8
33.6
34.4
35.7
35.1
26.3
28.3
34.9


|ref|WP_













073043853.1|













Cpf1.Sj
6.6
33.6
41.7
37.2
33.4
37.6
40.1
37.7
29.1
30.3
34.1


|WP_













081839471













Cpf1.Ss|
6.9
32.3
35.7
43
33.7
45.9
34.8
48
33.2
33.4
33.8


KFO67989













MAD3
5.8
31
100
32.9
35.9
35
35.6
34.3
28
27.6
33.1


gi|1082474576
7
31.4
35.9
43.2
31.4
45
33.6
48.6
30.8
33.5
33


|gb|













OFY19591.1|













MAD2
6.1
100
31
30.7
30.2
31
31.2
31.2
25.8
27.7
32.6


Cpf1.Lb5|
7.8
32.8
36.5
38.2
34.2
45.5
35.8
43.6
30.7
35.7
32.5


WP_













016301126













gi|1088286736
6.7
30.6
35.3
42.4
33.2
44.7
32.1
46.8
30.7
32.6
32.4


|gb|













OHB41002.1|













gi|1094423310
6.8
30.8
36.1
40.4
31.8
50.4
35.2
46.6
30.4
36.8
32.3


|emb|













SER03894.1|













gi|493326531
6.8
30.8
36.1
40.3
31.8
50.3
35.1
46.6
30.4
36.8
32.3


|ref|WP_













006283774.1|













MAD8
7.6
31.2
34.3
40.4
32
41.6
32.8
100
30.1
32.1
31.7


Cpf1.Bot|
6.9
30.1
36.6
41.5
32.5
50.2
35.4
45.5
29.8
34.1
31.6


WP_009217842













Cpf1.Li|
7.3
30.2
34.6
39.3
30.3
40.7
31.8
39.4
32.1
31.3
31.5


WP_













020988726













Cpf1.Pb|
6.3
31.4
31.8
36.1
30.8
45.7
30.4
39.4
27.7
33.5
31.5


WP_













044110123













gi|817911372
7.3
29.8
35
40.7
32.1
40.3
32.6
41.7
29.1
31
31.4


|gb|













AKG08867.1|













gi|1052838533
6.6
30.8
35.5
32
31.5
34.4
51.9
33.4
26.1
29
31.3


|emb|













SCH45297.1|













gi|1053713332|
7.2
29.6
33.2
39.6
29.8
49.1
32.2
41.4
30.1
32.4
31.3


ref|WP_













066040075.1|













gi|817909002|gb|
7.3
29.8
35
40.7
32
40.3
32.5
41.6
29.1
30.9
31.3


AKG06878.1|













gi|1042201477
7.2
29.5
35.2
40.6
31.9
40.1
32.7
41.6
29
30.8
31.2


|ref|WP_













065256572.1|













MAD6
7.5
31
35
38.9
33.1
100
34.3
41.6
30.5
33.6
31


gi|490468773
6.8
31.8
31.7
36.2
28.6
36.5
31.4
38.4
28.5
31.4
31


|ref|WP_













004339290.1|













gi|565853704
7.5
30.8
34.9
38.9
33.1
99.7
34.1
41.6
30.4
33.6
31


|ref|WP_













023936172.1|













gi|739005707
7.5
30.9
35
38.9
33
99.9
34.2
41.5
30.4
33.5
31


|ref|













WP_













036887416.1|













gi|739008549|
7.5
31
35
38.8
33
99.8
34.2
41.5
30.4
33.5
31


ref|WP_













036890108.1|













Cpf1.Ft|
7.1
31.9
33.8
40.3
29.7
39.4
34.1
41
29.8
32.5
30.8


WP_014550095













gi|504362993
7.2
32.4
33.8
40.3
29.6
39.4
33.8
40.9
30.1
32.5
30.8


|ref|













WP_













014550095.1|













gi|
6.6
31.4
34.8
40.7
31.2
48
34.1
45.1
28.8
35.2
30.8


640557447|ref|













WP_













024988992.1|













gi|1098944113|ref|
7.1
32.3
33.5
40.3
29.6
39.2
33.8
40.9
30.1
32.5
30.6


WP_













071304624.1|













gi|489124848
7.1
32.3
33.9
40.9
29.9
39.2
33.9
40.9
29.9
32.2
30.6


|ref|













WP_003034647.1|













gi|
6.8
29.4
33.1
35.5
28.9
40.3
30.7
35.9
28.7
31.3
30.5


738967776|ref|













WP_













036851563.1|













MAD7
5.9
31.2
35.6
30.8
33.9
34.3
100
32.8
24.2
28.9
30.5


Cpf1.Lb6|
6.7
29.8
33.7
36.6
30.9
43
34
39.8
29.1
32.1
30.4


WP_













044910713













gi|1052961977|
5.5
30.5
35.8
32.3
34
35
53.8
33.4
26.2
27.4
30.4


emb|













SCH47915.1|













gi|817918353
7
29.1
34.4
39.8
31.7
40
32.4
41.1
28.4
30.1
30.3


|gb|













AKG14689.1|













gi|917059416
6.9
29.9
31.5
35.7
31.6
41.8
32.9
39.1
30.1
34
30.2


|ref|WP_













051666128.1|













gi|
6.8
29
34.7
40.3
31.4
40.1
33.1
41.6
28.5
30.4
30.1


1011649201|ref|













WP_













062499108.1|













Cpf1.Pm|
6.3
29.2
32.3
34.2
27.4
38.7
29.4
35
27.2
30.1
30


WP_













018359861













gi|817922537
6.8
29.1
34.5
39.6
31.5
39.9
32.7
40.7
28.3
29.8
30


|gb|













AKG18099.1|













gi|769142322|ref|
6.7
31
34.6
37.8
31.5
41.4
33.3
39.2
28
31.9
29.9


WP_













044919442.1|













gi|1023176441
6.7
29.7
31.3
35.5
31.3
41
32.6
38.5
29.7
33.3
29.8


|pdb|













5ID6|A













gi|491540987
5.9
28.3
30.4
29.7
28.5
29
30.7
29.8
25.8
27.8
29.8


|ref|













WP_













005398606.1|













gi|652820612
6.4
31.1
34
35.3
31.7
40.3
33.4
37.5
28.5
33.3
29.8


|ref|













WP_













027109509.1|













gi|502240446
5.9
31.6
36.1
31.2
33
35.4
49.4
34
26.6
29.4
29.7


|ref|













WP_012739647.1|













gi|524278046|
5.8
31.6
36
31
33
35.4
50
34
26.6
29.5
29.7


emb|













CDA41776.1|













gi|737831580
6.2
31.3
34.8
38.1
31.5
42.1
33
39.6
28.4
32.4
29.7


|ref|













WP_035798880.1|













gi|909652572
6.9
30.7
34.2
37.2
30.8
41.5
34.2
38.7
28
32
29.7


|ref|













WP_049895985.1|













MAD4
6.7
30.7
32.9
100
30.7
38.9
30.8
40.4
28.8
29.4
29.7


gi|942073049
5.9
31.6
36.1
31.1
32.7
35
49.7
33.9
27.1
29.5
29.6


|ref|













WP_055286279.1|













gi|654794505
7.4
30.5
35.9
37.4
31.3
42.8
34.2
40.2
27.9
33.5
29.5


|ref|WP_













028248456.1|













gi|933014786
5.6
31.3
34.9
31.2
31.5
32.4
46.7
30.6
25.4
27.7
29.4


|emb|













CUO47728.1|













gi|941887450
5.6
31.4
35
31.3
31.6
32.5
46.6
30.7
25.3
27.8
29.4


|ref|WP_













055224182.1|













gi|920071674
6.3
31
31.8
38.8
31.8
41.3
33.8
42.6
29.8
34.7
29


|ref|WP_













052943011.1|













MAD5
5.1
30.2
35.9
30.7
100
33.1
33.9
32
24.3
28.7
29


gi|1081462674
6.9
30.4
33.5
34.7
29.7
40.1
30.5
37.4
27.3
32.5
28.9


|emb|













SCZ76797.1|













gi|918722523
7.4
27.5
30.5
35.7
28.3
35.2
28.5
36
26
27.1
28.8


|ref|WP_













052585281.1|













gi|524816323
6.2
30
34.1
29.3
31.2
32.7
47.6
32.2
25.5
25.9
28.4


|emb|













CDF09621.1|













gi|941782328
6.2
30.2
33.1
28.9
30.9
32
46.9
32.1
26
27.1
28.4


|ref|













WP_055176369.1|













gi|942113296
6.4
29.8
33.8
29.7
31.3
33.1
48
32.5
25.8
26.2
28.4


|ref|













WP_055306762.1|













MAD11
6.4
27.7
27.6
29.4
28.7
33.6
28.9
32.1
26.2
100
27.8


gi|653158548|ref|
5.9
26.4
28.1
33.5
27.4
32.5
27.8
32
27
26.8
27.6


WP_027407524.1|













gi|652963004|ref|
6.6
30.3
32.5
33.2
30.4
38.2
29.6
34.6
25.9
30.5
27.2


WP_027216152.1|













gi|1083069650
6.2
25
24.3
26.6
23.1
28.1
23.2
26.4
45
24.9
27.1


|gb|













OGD68774.1|













gi|302483275
5.6
24.7
26.8
30.3
24.9
34.8
26
30.4
24.4
27.5
27.1


|gb|













EFL46285.1|













gi|915400855
5.6
24.7
26.8
30.3
24.9
34.8
26
30.4
24.4
27.5
27.1


|ref|













WP_050786240.1|













MAD10
5.6
25.8
28
28.8
24.3
30.5
24.2
30.1
100
26.2
26.6


gi|1101117967
6.1
26.8
26
27.3
24.3
28.1
24.4
28.2
44.1
25.4
26.1


|gb|OIO75780.1|













gi|1088204458|
6.5
25.2
23.5
25.8
22.9
27
22
26.1
36.5
24.2
24.7


gb|













OHA63117.1|













gi|809198071
4.9
25.6
26.5
22.2
23.9
23.8
25.8
23.9
20.3
25.1
24


|ref|













WP_046328599.1|













gi|1088079929
5.6
21.9
23.8
26.9
23.4
27.8
23.3
26.7
28.8
24.7
23.5


|gb|













OGZ45678.1|













gi|1101053499
5.9
23.1
26.2
25.2
23
26.4
25.1
26.5
29.2
23.2
23.4


|gb|













OIO15737.1|













gi|1101058058
5.4
21.2
22.8
23.6
20.6
25
20.7
25
25.9
22.2
23


|gb|













OIO19978.1|













gi|1088000848
5.7
23.5
25.2
25.5
23.9
27
25.1
25.6
31.6
23.6
22.9


|gb|













OGY73485.1|













gi|407014433
5.2
23.5
25.9
26.7
24.3
25.8
23
27.8
29.9
25.3
22.9


|gb|













EKE28449.1|













gi|818249855
6
21
20.7
23.5
20
24.2
21
24
24.6
21.8
22.6


|gb|













KKP36646.1|













gi|818703647
5.8
23.3
25
25.1
23.5
26.5
24.7
25.3
31.2
23.3
22.6


|gb|













KKT48220.1|













gi|818705786
5.8
23.1
24.6
24.7
22.9
26.2
24.2
24.8
30.8
22.9
22.2


|gb|













KKT50231.1|













gi|1083950632
4.5
20
22.1
23.5
20.6
24.6
20
24
23.5
20.7
22.1


|gb|













OGJ66851.1|













gi|1083932199
6
20.4
20.2
22.6
19.3
23.3
20.6
23.2
23.9
21
21.8


|gb|













OGJ49885.1|













gi|1083410735
5
21.7
23.3
25.5
23
25
22.7
25.9
27.2
22.4
21.5


|gb|













OGF20863.1|













gi|1011480927
4.7
20.1
20.1
21.4
19.3
23.3
21.4
22
20.2
19.7
20.9


|ref|WP_













062376669.1|













gi|818539593
5.1
19.8
21.6
22.1
20.5
22.9
21.2
22.8
24
20.5
19.9


|gb|













KKR91555.1|













gi|503048015
5.1
18.8
20.7
15.3
19.7
18.9
19.3
17.7
15.9
19
19.2


|ref|WP_













013282991.1|













gi|1096232746
5
19.1
20.5
17.4
20.1
19.7
20.4
20.4
17.5
18.5
18.9


|ref|WP_













071177645.1|













gi|769130404
4.6
19.4
18.2
16.1
18.1
17.1
18.7
17.9
14.5
16.8
17.5


|ref|WP_













044910712.1|













gi|1085569500
2.6
11.6
12.1
12.7
10.2
12.1
12.7
11.6
10.9
11.1
10.5


|gb|













OGX23684.1|













gi|818357062
3.3
10
11.1
10.6
11.1
11.8
12.1
11.5
12.2
10.8
9.8


|gb|













KKQ38176.1|













gi|745626763
3.7
9.4
11.7
11.1
11.1
12.5
11.9
11.9
10.2
10.6
8.8


|gb|













KIE18642.1|













MAD1
100
6.1
5.8
6.7
5.1
7.5
5.9
7.6
5.6
6.4
6.4


SpCas9
4
6.3
6.5
8.3
5.6
8.1
6.9
7.7
6.9
6.3
6.3


MAD12
6.4
32.6
33.1
29.7
29
31
30.5
31.7
26.6
27.8
100









Example 2: Expression of MAD Nucleases

Wild-type nucleic acid sequences for MAD1-MAD20 include SEQ ID NOs 21-40, respectively. These MAD nucleases were codon optimized for expression in E. coli and the codon optimized sequences are listed as SEQ ID NO: 41-60, respectively (summarized in Table 2).


Codon optimized MAD1-MAD20 were cloned into an expression construct comprising a constitutive or inducible promoter (eg., proB promoter SEQ ID NO: 83, or pBAD promoter SEQ ID NO: 81 or SEQ ID NO: 82) and an optional 6×-His tag (eg., FIG. 2). The generated MAD1-MAD20 expression constructs are provided as SEQ ID NOs: 61-80, respectively. The expression constructs as depicted in FIG. 2 were generated either by restriction/ligation-based cloning or homology-based cloning.


Example 3. Testing Guide Nucleic Acid Sequences Compatible with MAD Nucleases

In order to have a functioning targetable nuclease complex, a nucleic acid-guided nuclease and a compatible guide nucleic acid is needed. To determine the compatible guide nucleic acid sequence, specifically the scaffold sequence portion of the guide nucleic acid, multiple approaches were taken. First, scaffold sequences were looked for near the endogenous loci of each MAD nuclease. In some cases, such as with MAD2, no endogenous scaffold sequence was found. Therefore, we tested the compatibility of MAD2 with scaffold sequences found near the endogenous loci of the other MAD nucleases. A list of the MAD nucleases and corresponding endogenous scaffold sequences that were tested is listed in Table 2.













TABLE 2







Codon optimized

Endogenous scaffold


MAD
WT nucleic acid
nucleic acid
Amino acid
sequence for


nuclease
sequence
sequence
sequence
guide nucleic acid







MAD1
SEQ ID NO: 21
SEQ ID NO: 41
SEQ ID NO: 1 
SEQ ID NO: 84 


MAD2
SEQ ID NO: 22
SEQ ID NO: 42
SEQ ID NO: 2 
None identified


MAD3
SEQ ID NO: 23
SEQ ID NO: 43
SEQ ID NO: 3 
SEQ ID NO: 86 


MAD4
SEQ ID NO: 24
SEQ ID NO: 44
SEQ ID NO: 4 
SEQ ID NO: 87 


MAD5
SEQ ID NO: 25
SEQ ID NO: 45
SEQ ID NO: 5 
SEQ ID NO: 88 


MAD6
SEQ ID NO: 26
SEQ ID NO: 46
SEQ ID NO: 6 
SEQ ID NO: 89 


MAD7
SEQ ID NO: 27
SEQ ID NO: 47
SEQ ID NO: 7 
SEQ ID NO: 90 


MAD8
SEQ ID NO: 28
SEQ ID NO: 48
SEQ ID NO: 8 
SEQ ID NO: 91 


MAD9
SEQ ID NO: 29
SEQ ID NO: 49
SEQ ID NO: 9 
SEQ ID NO: 92; 






 SEQ ID NO: 103;






SEQ ID NO: 106


MAD10
SEQ ID NO: 30
SEQ ID NO: 50
SEQ ID NO: 10
SEQ ID NO: 93 


MAD11
SEQ ID NO: 31
SEQ ID NO: 51
SEQ ID NO: 11
SEQ ID NO: 94 


MAD12
SEQ ID NO: 32
SEQ ID NO: 52
SEQ ID NO: 12
SEQ ID NO: 95 


MAD13
SEQ ID NO: 33
SEQ ID NO: 53
SEQ ID NO: 13
SEQ ID NO: 96;






 SEQ ID NO: 105;






SEQ ID NO: 107


MAD14
SEQ ID NO: 34
SEQ ID NO: 54
SEQ ID NO: 14
SEQ ID NO: 97 


MAD15
SEQ ID NO: 35
SEQ ID NO: 55
SEQ ID NO: 15
SEQ ID NO: 98 


MAD16
SEQ ID NO: 36
SEQ ID NO: 56
SEQ ID NO: 16
SEQ ID NO: 99 


MAD17
SEQ ID NO: 37
SEQ ID NO: 57
SEQ ID NO: 17
SEQ ID NO: 100


MAD18
SEQ ID NO: 38
SEQ ID NO: 58
SEQ ID NO: 18
SEQ ID NO: 101


MAD19
SEQ ID NO: 39
SEQ ID NO: 59
SEQ ID NO: 19
SEQ ID NO: 102


MAD20
SEQ ID NO: 40
SEQ ID NO: 60
SEQ ID NO: 20
SEQ ID NO: 103









Editing cassettes as depicted in FIG. 3 were generated to assess the functionality of the MAD nucleases and corresponding guide nucleic acids. Each editing cassette comprises an editing sequence and a promoter operably linked to an encoded guide nucleic acid. The editing cassettes further comprises primer sites (P1 and P2) on flanking ends. The guide nucleic acids comprised various scaffold sequences to be tested, as well as a guide sequence to guide the MAD nuclease to the target sequence for editing. The editing sequences comprised a PAM mutation and/or codon mutation relative to the target sequence. The mutations were flanked by regions of homology (homology arms or HA) which would allow recombination into the cleaved target sequence.



FIG. 4 depicts an experimental designed to test different MAD nuclease and guide nucleic acid combinations. An expression cassette encoding the MAD nuclease or the MAD nuclease protein were added to host cells along with various editing cassettes as described above. In this example, the guide nucleic acids were engineered to target the galK gene in the host cell, and the editing sequence was designed to mutate the targeted galK gene in order to turn the gene off, thereby allowing for screening of successfully edited cells. This design was used for identification of functional or compatible MAD nuclease and guide nucleic acid combinations. Editing efficiency was determined by qPCR to measure the editing plasmid in the recovered cells in a high-throughput manner. Validation of MAD11 and Cas9 primers is shown in FIGS. 14A and 14B. These results show that the selected primer pairs are orthogonal and allow quantitative measurement of input plasmid DNA



FIGS. 5A-5B is a depiction of a similar experimental design. In this case, the editing cassette (FIG. 5B) further comprises a selectable marker, in this case kanamycin resistance (kan) and the MAD nuclease expression vector (FIG. 5A) further comprises a selectable marker, in this case chloramphenicol resistance (Cm), and the lambda RED recombination system to aid homologous recombination (HR) of the editing sequence into the target sequence. A compatible MAD nuclease and guide nucleic acid combination will cause a double strand break in the target sequence if a PAM sequence is present. Since the editing sequence (eg. FIG. 3) contains a PAM mutation that is not recognized by the MAD nuclease, edited cells that contain the PAM mutation survive cleavage by the MAD nuclease, while wild-type non-edited cells die (FIG. 5C). The editing sequence further comprises a mutation in the galK gene that allows for screening of edited cells, while the MAD nuclease expression vector and editing cassette contain drug selection markers, allowing for selection of edited cells.


Using these methods, compatible guide nucleic acids for MAD1-MAD20 were tested. Twenty scaffold sequences were tested. The guide nucleic acids used in the experiments contained one of the twenty scaffold sequences, referred to as scaffold-1, scaffold-2, etc., and a guide sequence that targets the galK gene. Sequences for Scaffold-1 through Scaffold-20 are listed as SEQ ID NO: 84-103, respectively. It should be understood that the guide sequence of the guide nucleic acid is variable and can be engineered or designed to target any desired target sequence. Since MAD2 does not have an endogenous scaffold sequence to test, a scaffold sequence from a close homology (scaffold-2, SEQ ID NO: 85) was tested and found to be a non-functional pair, meaning MAD2 and scaffold-2 were not compatible. Therefore, MAD2 was tested with the other nineteen scaffold sequences, despite the low sequence homology between MAD2 and the other MAD nucleases.


This workflow could also be used to identify or test PAM sequences compatible with a given MAD nuclease. Another method for identifying a PAM site is described in the next example.


In general, for the assays described, transformations were carried out as follows. E. coli strains expressing the codon optimized MAD nucleases were grown overnight. Saturated cultures were diluted 1/100 and grown to an OD600 of 0.6 and induced by adding arabinose at a filing concentration of 0.4% and (if a temperature sensitive plasmid is used) shifting the culture to 42 degrees Celsius in a shaking water bath. Following induction, cells were chilled on ice for 15 min prior to washing thrice with ¼ the initial culture volume with 10% glycerol (for example, 50 mL washed for a 200 mL culture). Cells were resuspended in 1/100 the initial volume (for example, 2 mL for a 200 mL culture) and stores at −90 degrees Celsius until ready to use. To perform the compatibility and editing efficiency screens described here, 50 ng of editing cassette was transformed into cell aliquots by electroporation. Following electroporation, the cells were recovered in LB for 3 hours and 100 μL of cells were plated on Macconkey plates containing 1% galactose.


Editing efficiencies were determined by dividing the number of white colonies (edited cells) by the total number of white and red colonies (edited and non-edited cells).


Example 4. PAM Selection Assay

In order to generate a double strand break in a target sequence, a guide nucleic acid must hybridize to a target sequence, and the MAD nuclease must recognize a PAM sequence adjacent to the target sequence. If the guide nucleic acid hybridizes to the target sequence, but the MAD nuclease does not recognize a PAM site, then cleavage does not occur.


A PAM is MAD nuclease-specific and not all MAD nucleases necessarily recognize the same PAM. In order to assess the PAM site requirements for the MAD nucleases, an assay as depicted in FIGS. 6A-6C was performed.



FIG. 6A depicts a MAD nuclease expression vector as described elsewhere, which also contains a chloramphenicol resistance gene and the lambda RED recombination system.



FIG. 6B depicts a self-targeting editing cassette. The guided nucleic acid is designed to target the target sequence which is contained on the same nucleic acid molecule. The target sequence is flanked by random nucleotides, depicted by N4, meaning four random nucleotides on either end of the target sequence. It should be understood that any number of random nucleotides could also be used (for example, 3, 5, 6, 7, 8, etc). The random nucleotides serve as a library of potential PAMs.



FIG. 6C depicts the experimental design. Basically, the MAD nuclease expression vector and editing cassette comprising the random PAM sites were transformed into a host cell. If a functional targetable nuclease complex was formed and the MAD nuclease recognized a PAM site, then the editing cassette vector was cleaved and which leads to cell death. If a functional targetable complex was not formed or if the MAD nuclease did not recognize the PAM, then the target sequence was not cleaved and the cell survived. Next generation sequence (NGS) was then used to sequence the starting and final cell populations in order to determine what PAM sites were recognized by a given MAD nuclease. These recognized PAM sites were then used to determine a consensus or non-consensus PAM for a given MAD nuclease.


The consensus PAM for MAD1-MAD8, and MAD10-MAD12 was determined to be TTTN. The consensus PAM for MAD9 was determined to be NNG. The consensus PAM for MAD13-MAD15 was determined to be TTN. The consensus PAM for MAD16-MAD18 was determined to be TA. The consensus PAM for MAD19-MAD20 was determined to be TTCN.


Example 5: Testing Heterologous Guide Nucleic Acids

Editing efficiencies were tested for MAD1, MAD2, MAD4, and MAD7 and are depicted in FIG. 7A and FIG. 7B. Experiment details and editing efficiencies are summarized in Table 3. Editing efficiency was determined by dividing the number of edited cells by the total number of recovered cells. Various editing cassettes targeting the galK gene were used to allow screening of editing cells. The guide nucleic acids encoded on the editing cassette contained a guide sequence targeting the galK gene and one of various scaffold sequences in order to test the compatibility of the indicated MAD nuclease with the indicated scaffold sequence, as summarized in Table 3.


Editing efficiencies for compatible MAD nuclease and guide nucleic acids (comprising the indicated scaffold sequences) were observed to have between 75-100% editing efficiency. MAD2 had between a 75-100% editing efficiency and MAD7 had between a 97-100% editing efficiency.


MAD2 combined with scaffold-1, scaffold-2, scaffold-4, or scaffold-13 in these experiments results in 0% editing efficiency. These data imply that MAD2 did not form a functional complex with these tested guide nucleic acids and that MAD2 is not compatible with these scaffold sequences.


MAD7 combined with scaffold-1, scaffold-2, scaffold-4, or scaffold-13 in these experiments results in 0% editing efficiency. These data imply that MAD7 did not form a functional complex with these tested guide nucleic acids and that MAD7 is not compatible with these scaffold sequences.


For MAD1 and MAD4, all tested guide nucleic acid combinations resulted in 0% editing efficiency, implying that MAD1 and MAD4 did not form a functional complex with any of the tested guide nucleic acids. These data also imply that MAD1 and MAD4 are not compatible with the tested scaffold sequences.


Combined, these data highlight the unpredictability of finding a compatible MAD nuclease and scaffold sequence pair in order to form a functional targetable nuclease complex. Some tested MAD nucleases did not function with any tested scaffold sequence. Some tested MAD nucleases only functioned with some tested scaffold sequences and not with others.














TABLE 3






Nucleic







acid-

Editing





guided
Guide nucleic acid scaffold
sequence
Target
Editing


#
nuclease
sequence
mutation
gene
efficiency




















1
MAD1
Scaffold-1; SEQ ID NO: 84
L80**
galK
0%


2
MAD1
Scaffold-2; SEQ ID NO: 85
Y145**
galK
0%


3
MAD1
Scaffold-4; SEQ ID NO: 87
Y145**
galK
0%


4
MAD1
Scaffold-10;
Y145**
galK
0%




SEQ ID NO: 93





5
MAD1
Scaffold-11;
L80**
galK
0%




SEQ ID NO: 94





6
MAD1
Scaffold-12;
L10KpnI
galK
0%




SEQ ID NO: 95





7
MAD1
Scaffold-13;
Y145**
galK
0%




SEQ ID NO: 96





8
MAD1
Scaffold-12;
L10KpnI
galK
0%




SEQ ID NO: 95





9
MAD2
Scaffold-10;
L80**
galK
0%




SEQ ID NO: 93





10
MAD2
Scaffold-10;
Y145**
galK
100% 




SEQ ID NO: 93





11
MAD2
Scaffold-11;
L80**
galK
98% 




SEQ ID NO: 94





12
MAD2
Scaffold-11;
Y145**
galK
99% 




SEQ ID NO: 94





13
MAD2
Scaffold-12;
Y145**
galK
98% 




SEQ ID NO: 95





14
MAD2
Scaffold-12;
Y145**
galK
0%




SEQ ID NO: 95





15
MAD2
Scaffold-13;
Y145**
galK
0%




SEQ ID NO: 96





16
MAD2
Scaffold-1; SEQ ID NO: 84
L80**
galK
0%


17
MAD2
Scaffold-2; SEQ ID NO: 85
Y145**
galK
0%


18
MAD2
Scaffold-2; SEQ ID NO: 85
Y145**
galK
0%


19
MAD2
Scaffold-4; SEQ ID NO: 87
Y145**
galK
0%


20
MAD2
Scaffold-5; SEQ ID NO: 88
L80**
galK
99% 


21
MAD2
Scaffold-12;
89**
galK
0%




SEQ ID NO: 95





22
MAD2
Scaffold-12;
70**
galK
75% 




SEQ ID NO: 95





23
MAD2
Scaffold-12;
L10KpnI
galK
79% 




SEQ ID NO: 95





24
MAD4
Scaffold-1; SEQ ID NO: 84
L80**
galK
0%


25
MAD4
Scaffold-2; SEQ ID NO: 85
Y145**
galK
0%


26
MAD4
Scaffold-4; SEQ ID NO: 87
Y145**
galK
0%


27
MAD4
Scaffold-10;
Y145**
galK
0%




SEQ ID NO: 93





28
MAD4
Scaffold-11;
L80**
galK
0%




SEQ ID NO: 94





29
MAD4
Scaffold-12;
L10KpnI
galK
0%




SEQ ID NO: 95





30
MAD4
Scaffold-13;
Y145**
galK
0%




SEQ ID NO: 96





31
MAD4
Scaffold-12;
L10KpnI
galK
0%




SEQ ID NO: 95





32
MAD7
Scaffold-1; SEQ ID NO: 84
L80**
galK
0%


33
MAD7
Scaffold-2; SEQ ID NO: 85
Y145**
galK
0%


34
MAD7
Scaffold-4; SEQ ID NO: 87
Y145**
galK
0%


35
MAD7
Scaffold-10;
Y145**
galK
100% 




SEQ ID NO: 93





36
MAD7
Scaffold-11;
L80**
galK
97% 




SEQ ID NO: 94





37
MAD7
Scaffold-12;
L10KpnI
galK
0%




SEQ ID NO: 95





38
MAD7
Scaffold-13;
Y145**
galK
0%




SEQ ID NO: 96





39
MAD7
Scaffold-12;
L10KpnI
galK
0%




SEQ ID NO: 95









Example 6. Assessment of MAD2 and MAD7

The ability of MAD2 and MAD7 to function with heterologous guide nucleic acids were tested using a similar experimental design as described above.


The compatibility of MAD2 with other scaffold sequences was tested and the results of an experiment are depicted in FIG. 8. The MAD nucleases, guide nucleic acid scaffold sequences, and editing sequences used in this experiment are summarized in Table 4.


The compatibility of MAD7 with other scaffold sequences was tested and the results of an experiment are depicted in FIG. 9. The MAD nucleases, guide nucleic acid scaffold sequences, and editing sequences used in this experiment are summarized in Table 5.













TABLE 4






Nucleic acid-

Editing




guided
Guide nucleic acid scaffold
sequence
Target


#
nuclease
sequence
mutation
gene



















1
MAD2
Scaffold-12; SEQ ID NO: 95
N89KpnI
galK


2
MAD2
Scaffold-10; SEQ ID NO: 93
L80**
galK


3
MAD2
Scaffold-5; SEQ ID NO: 88
L80**
galK


4
MAD2
Scaffold-12; SEQ ID NO: 95
D70KpnI
galK


5
MAD2
Scaffold-12; SEQ ID NO: 95
Y145**
galK


6
MAD2
Scaffold-11; SEQ ID NO: 94
Y145**
galK


7
MAD2
Scaffold-10; SEQ ID NO: 93
Y145**
galK


8
MAD2
Scaffold-12; SEQ ID NO: 95
L10KpnI
galK


9
MAD2
Scaffold-11; SEQ ID NO: 94
L80**
galK


10
SpCas9

S. pyogenese gRNA

Y145**
galK


11
MAD2
Scaffold-2; SEQ ID NO: 85
Y145**
galK


12
MAD2
Scaffold-4; SEQ ID NO: 87
Y145**
galK


13
MAD2
Scaffold-1; SEQ ID NO: 84
L80**
galK


14
MAD2
Scaffold-13; SEQ ID NO: 96
Y145**
galK




















TABLE 5






Nucleic acid-

Editing




guided
Guide nucleic acid scaffold
sequence
Target


#
nuclease
sequence
mutation
gene







1
MAD7
Scaffold-1; SEQ ID NO: 84
L80**
galK


2
MAD7
Scaffold-2; SEQ ID NO: 85
Y145**
galK


3
MAD7
Scaffold-4; SEQ ID NO: 87
Y145**
galK


4
MAD7
Scaffold-10; SEQ ID NO: 93
Y145**
galK


5
MAD7
Scaffold-11; SEQ ID NO: 95
L80**
galK









In another experiment, transformation efficiencies (FIG. 10B) were determined by calculating the total number of recovered cells compared to the start number of cells. An example plate image is depicted in FIG. 10C. Editing efficiencies (FIG. 10A) were determined by calculating the ratio of editing colonies (white colonies, edited galK gene) versus total colonies.


In this example (FIG. 10A-10C), cells expressing galK were transformed with expression constructs expressing either MAD2 or MAD7 and a corresponding editing cassette comprising a guide nucleic acid targeting the galK gene. The guide nucleic acid was comprised of a guide sequence targeting the galK gene and the scaffold-12 sequence (SEQ ID NO: 95).


In the depicted example, MAD2 and MAD7 has a lower transformation efficiency compared to S. pyogenes Cas9, though the editing efficiency of MAD2 and MAD7 was slightly higher than S. pyogenes Cas9.



FIG. 11 depicts the sequencing results from select colonies recovered from the assay described above. The target sequence was in the galK coding sequence (CDS). The TTTN PAM is shown as the reverse complement (wild-type NAAA, mutated NGAA). The mutations targeted by the editing sequence are labeled as target codons. Changes compared to the wild-type sequence are highlighted. In these experiments, the scaffold-12 sequence (SEQ ID NO: 95) was used. The guide sequence of the guide nucleic acid targeted the galK gene.


Six of the seven depicted sequences from the MAD2 experiment contained the designed PAM mutation and designed mutations in the target codons of galK, which one sequences colony maintained the wild-type PAM and wild-type target codons while also containing an unintended mutation upstream of the target site.


Two of the four depicted sequences from the MAD7 experiment contained the designed PAM mutation and mutated target codons. One colony comprises a wildtype sequence, while another contained a deletion of eight nucleotides upstream of the target sequence.



FIG. 12 depicts results from another experiment testing the ability to recover edited cells. In Experiment 0, the MAD2 nuclease was used with a guide nucleic acid comprising scaffold-11 sequence and a guide sequence targeting galK. The editing cassette comprised an editing sequence designed to incorporate an L80** mutation into galK, thereby allowing screening of the edited cells. In experiment 1, the MAD2 nuclease was used with a guide nucleic acid comprising scaffold-12 sequence and a guide sequence targeting galK. The editing cassette comprised an editing sequence designed to incorporate an L10KpnI mutation into galK. In both experiments, a negative control plasmid a guide nucleic acid that is not compatible with MAD2 was included in the transformations. Following transformation, the ratio of the compatible editing cassette (those containing scaffold-11 or scaffold-12 guide nucleic acids) to the non-compatible editing cassette (negative control) was measure. The experiments were done in the presence or absence of selection. The results show that more compatible editing cassette containing cells were recovered compared to the non-compatible editing cassette, and this result is magnified when selection is used.


Example 7. Guide Nucleic Acid Characterization

The sequences of scaffolds 1-8, and 10-12 (SEQ ID NO: 84-91, and 93-95) were aligned and are depicted in FIG. 13A. Nucleotides that match the consensus sequence are faded, while those diverging from the consensus sequence are visible. The predicted pseudoknot region is indicated. Without being bound by theory, the region 5′ of the pseudoknot may be influence binding and/or kinetics of the nucleic acid-guided nuclease. As is shown in FIG. 13A, in general, there appears to be less variability in the pseudoknot region (e.g., SEQ ID NO: 172-181) as compared to the sequence outside of the pseudoknot region.



FIG. 13B shows a preliminary model of MAD2 and MAD12 complexed with a guide nucleic acid (in this example, a guide RNA) and target sequence (DNA).


Example 8. Editing Efficiency of the MAD Nucleases

A plate-based editing efficiency assay and a molecular editing efficiency assay were used to test editing efficiency of various MAD nuclease and guide nucleic acid combinations.



FIG. 15 depicts quantification of the data obtained using the molecular editing efficiency assay using MAD2 nuclease with a guide nucleic acid comprising scaffold-12 and a guide sequencing targeting galK. The indicated mutations were incorporated into the galK using corresponding editing cassettes containing the mutation. FIG. 16 shows the comparison of the editing efficiencies determined by the plate-based assay using white and red colonies as described previously, and the molecular editing efficiency assay. As shown in FIG. 16, the editing efficiencies as determined by the two separate assays are consistent.


Example 9. Trackable Editing

Genetic edits can be tracked by the use of a barcode. A barcode can be incorporated into or near the edit site as described in the present specification. When multiple rounds of engineering are being performed, with a different edit being made in each round, it may be beneficial to insert a barcode in a common region during each round of engineering, this way one could sequence a single site and get the sequences of all of the barcodes from each round without the need to sequence each edited site individually. FIGS. 17A-17C, 18, and 19 depict examples of such trackable engineering workflows.


As depicted in FIG. 17A, a cell expressing a MAD nuclease is transformed with a plasmid containing an editing cassette and a recording cassette. The editing cassette contains a PAM mutation and a gene edit. The recorder cassette comprises a barcode, in this case 15N. Both the editing cassette and recording cassette each comprise a guide nucleic acid to a distinct target sequence. Within a library of such plasmids, the recorder cassette for each round can contain the same guide nucleic acid, such that the first round barcode is inserted into the same location across all variants, regardless of what editing cassette and corresponding gene edit is used. The correlation between the barcode and editing cassette is determined beforehand though such that the edit can be identified by sequencing the barcode. FIG. 17B shows an example of a recording cassette designed to delete a PAM site while incorporating a 15N barcode. The deleted PAM is used to enrich for edited cells since mutated PAM cells escape cell death while cells containing a wild-type PAM sequence are killed. Fire 21 C depicts how sequencing the barcode region can be used to identify which edit is comprised within each cell.


A similar approach is depicted in FIG. 18. In this case, the recorder cassette from each round is designed to target a sequence adjacent to the previous round, and each time, a new PAM site is deleted by the recorder cassette. The result is a barcode array with the barcodes from each round that can be sequenced to confirm each round of engineering took place and to determine which combination of mutations are contained in the cell, and in which order the mutations were made. Each successive recorder cassette can be designed to be homologous on one end to the region comprising the mutated PAM from the previous round, which could increase the efficiency of getting fully edited cells at the end of the experiment. In other examples, the recorder cassette is designed to target a unique landing site that was incorporated by the previous recorder cassette. This increases the efficiency of recovering cells containing all of the desired mutations since the subsequent recorder cassette and barcode can only target a cell that has successfully completed the previous round of engineering.



FIG. 19 depicts another approach that allows the recycling of selectable markers or to otherwise cure the cell of the plasmid form the previous round of engineering. In this case, the transformed plasmid containing a guide nucleic acid designed to target a selectable marker or other unique sequence in the plasmid form the previous round of engineering.









TABLE 6







SEQUENCE LISTING








SEQ



ID



NO:
Sequence





SEQ
MGKMYYLGLDIGTNSVGYAVTDPSYHLLKFKGEPMWGAHVFAAGNQSAERRSFRTSRRRLDRRQQRV


ID
KLVQEIFAPVISPIDPRFFIRLHESALWRDDVAETDKHIFFNDPTYTDKEYYSDYPTIHHLIVDLMESSEKH


NO:
DPRLVYLAVAWLVAHRGHFLNEVDKDNIGDVLSFDAFYPEFLAFLSDNGVSPWVCESKALQATLLSRN


1
SVNDKYKALKSLIFGSQKPEDNFDANISEDGLIQLLAGKKVKVNKLFPQESNDASFTLNDKEDAIEEILGT



LTPDECEWIAHIRRLFDWAIMKHALKDGRTISESKVKLYEQHHHDLTQLKYFVKTYLAKEYDDIFRNVD



SETTKNYVAYSYHVKEVKGTLPKNKATQEEFCKYVLGKVKNIECSEADKVDFDEMIQRLTDNSFMPKQ



VSGENRVIPYQLYYYELKTILNKAASYLPFLTQCGKDAISNQDKLLSIMTFRIPYFVGPLRKDNSEHAWL



ERKAGKIYPWNFNDKVDLDKSEEAFIRRMTNTCTYYPGEDVLPLDSLIYEKFMILNEINNIRIDGYPISVD



VKQQVFGLFEKKRRVTVKDIQNLLLSLGALDKHGKLTGIDTTIHSNYNTYHHFKSLMERGVLTRDDVER



IVERMTYSDDTKRVRLWLNNNYGTLTADDVKHISRLRKHDFGRLSKMFLTGLKGVHKETGERASILDF



MWNTNDNLMQLLSECYTFSDEITKLQEAYYAKAQLSLNDFLDSMYISNAVKRPIYRTLAVVNDIRKAC



GTAPKRIFIEMARDGESKKKRSVTRREQIKNLYRSIRKDFQQEVDFLEKILENKSDGQLQSDALYLYFAQ



LGRDMYTGDPIKLEHIKDQSFYNIDHIYPQSMVKDDSLDNKVLVQSEINGEKSSRYPLDAAIRNKMKPL



WDAYYNHGLISLKKYQRLTRSTPFTDDEKWDFINRQLVETRQSTKALAILLKRKFPDTEIVYSKAGLSSD



FRHEFGLVKSRNINDLHHAKDAFLAIVTGNVYHERFNRRWFMVNQPYSVKTKTLFTHSIKNGNFVAWN



GEEDLGRIVKMLKQNKNTIHFTRFSFDRKEGLFDIQPLKASTGLVPRKAGLDVVKYGGYDKSTAAYYLL



VRFTLEDKKTQHKLMMIPVEGLYKARIDHDKEFLTDYAQTTISEILQKDKQKVINIMFPMGTRHIKLNSM



ISIDGFYLSIGGKSSKGKSVLCHAMVPLIVPHKIECYIKAMESFARKFKENNKLRIVEKFDKITVEDNLNL



YELFLQKLQHNPYNKFFSTQFDVLTNGRSTFTKLSPEEQVQTLLNILSIFKTCRSSGCDLKSINGSAQAARI



MISADLTGLSKKYSDIRLVEQSASGLFVSKSQNLLEYL*





SEQ
MSSLTKFTNKYSKQLTIKNELIPVGKTLENIKENGLIDGDEQLNENYQKAKIIVDDFLRDFINKALNNTQI


ID
GNWRELADALNKEDEDNIEKLQDKIRGIIVSKFETFDLFSSYSIKKDEKIIDDDNDVEEEELDLGKKTSSF


NO:
KYIFKKNLFKLVLPSYLKTTNQDKLKIISSFDNFSTYFRGFFENRKNIFTKKPISTSIAYRIVHDNFPKFLDN


2
IRCFNVWQTECPQLIVKADNYLKSKNVIAKDKSLANYFTVGAYDYFLSQNGIDFYNNIIGGLPAFAGHEK



IQGLNEFINQECQKDSELKSKLKNRHAFKMAVLFKQILSDREKSFVIDEFESDAQVIDAVKNFYAEQCKD



NNVIFNLLNLIKNIAFLSDDELDGIFIEGKYLSSVSQKLYSDWSKLRNDIEDSANSKQGNKELAKKIKTNK



GDVEKAISKYEFSLSELNSIVHDNTKFSDLLSCTLHKVASEKLVKVNEGDWPKHLKNNEEKQKIKEPLD



ALLEIYNTLLIFNCKSFNKNGNFYVDYDRCINELSSVVYLYNKTRNYCTKKPYNTDKFKLNFNSPQLGEG



FSKSKENDCLTLLFKKDDNYYVGIIRKGAKINFDDTQAIADNTDNCIFKMNYFLLKDAKKFIPKCSIQLKE



VKAHFKKSEDDYILSDKEKFASPLVIKKSTFLLATAHVKGKKGNIKKFQKEYSKENPTEYRNSLNEWIAF



CKEFLKTYKAATIFDITTLKKAEEYADIVEFYKDVDNLCYKLEFCPIKTSFIENLIDNGDLYLFRINNKDFS



SKSTGTKNLHTLYLQAIFDERNLNNPTIMLNGGAELFYRKESIEQKNRITHKAGSILVNKVCKDGTSLDD



KIRNEIYQYENKFIDTLSDEAKKVLPNVIKKEATHDITKDKRFTSDKFFFHCPLTINYKEGDTKQFNNEVL



SFLRGNPDINIIGIDRGERNLIYVTVINQKGEILDSVSFNTVTNKSSKIEQTVDYEEKLAVREKERIEAKRS



WDSISKIATLKEGYLSAIVHEICLLMIKHNAIVVLENLNAGFKRIRGGLSEKSVYQKFEKMLINKLNYFVS



KKESDWNKPSGLLNGLQLSDQFESFEKLGIQSGFIFYVPAAYTSKIDPTTGFANVLNLSKVRNVDAIKSFF



SNFNEISYSKKEALFKFSFDLDSLSKKGFSSFVKFSKSKWNVYTFGERIIKPKNKQGYREDKRINLTFEMK



KLLNEYKVSFDLENNLIPNLTSANLKDTFWKELFFIFKTTLQLRNSVTNGKEDVLISPVKNAKGEFFVSGT



HNKTLPQDCDANGAYHIALKGLMILERNNLVREEKDTKKIMAISNVDWFEYVQKRRGVL*





SEQ
MNNYDEFTKLYPIQKTIRFELKPQGRTMEHLETFNFFEEDRDRAEKYKILKEAIDEYHKKFIDEHLTNMS


ID
LDWNSLKQISEKYYKSREEKDKKVFLSEQKRMRQEIVSEFKKDDRFKDLFSKKLFSELLKEEIYKKGNH


NO:
QEIDALKSFDKFSGYFIGLHENRKNMYSDGDEITAISNRIVNENFPKFLDNLQKYQEARKKYPEWIIKAES


3
ALVAHNIKMDEVFSLEYFNKVLNQEGIQRYNLALGGYVTKSGEKMMGLNDALNLAHQSEKSSKGRIH



MTPLFKQILSEKESFSYIPDVFTEDSQLLPSIGGFFAQIENDKDGNIFDRALELISSYAEYDTERIYIRQADIN



RVSNVIFGEWGTLGGLMREYKADSINDINLERTCKKVDKWLDSKEFALSDVLEAIKRTGNNDAFNEYIS



KMRTAREKIDAARKEMKFISEKISGDEESIHIIKTLLDSVQQFLHFFNLFKARQDIPLDGAFYAEFDEVHS



KLFAIVPLYNKVRNYLTKNNLNTKKIKLNFKNPTLANGWDQNKVYDYASLIFLRDGNYYLGIINPKRKK



NIKFEQGSGNGPFYRKMVYKQIPGPNKNLPRVFLTSTKGKKEYKPSKEIIEGYEADKHIRGDKFDLDFCH



KLIDFFKESIEKHKDWSKFNFYFSPTESYGDISEFYLDVEKQGYRMHFENISAETIDEYVEKGDLFLFQIY



NKDFVKAATGKKDMHTIYWNAAFSPENLQDVVVKLNGEAELFYRDKSDIKEIVHREGEILVNRTYNGR



TPVPDKIHKKLTDYHNGRTKDLGEAKEYLDKVRYFKAHYDITKDRRYLNDKIYFHVPLTLNFKANGKK



NLNKMVIEKFLSDEKAHIIGIDRGERNLLYYSIIDRSGKIIDQQSLNVIDGFDYREKLNQREIEMKDARQS



WNAIGKIKDLKEGYLSKAVHEITKMAIQYNAIVVMEELNYGFKRGRFKVEKQIYQKFENMLIDKMNYL



VFKDAPDESPGGVLNAYQLTNPLESFAKLGKQTGILFYVPAAYTSKIDPTTGFVNLFNTSSKTNAQERKE



FLQKFESISYSAKDGGIFAFAFDYRKFGTSKTDHKNVWTAYTNGERMRYIKEKKRNELFDPSKEIKEALT



SSGIKYDGGQNILPDILRSNNNGLIYTMYSSFIAAIQMRVYDGKEDYIISPIKNSKGEFFRTDPKRRELPIDA



DANGAYNIALRGELTMRAIAEKFDPDSEKMAKLELKHKDWFEFMQTRGD*





SEQ
MTKTFDSEFFNLYSLQKTVRFELKPVGETASFVEDFKNEGLKRVVSEDERRAVDYQKVKEIIDDYHRDFI


ID
EESLNYFPEQVSKDALEQAFHLYQKLKAAKVEEREKALKEWEALQKKLREKVVKCFSDSNKARFSRID


NO:
KKELIKEDLINWLVAQNREDDIPTVETFNNFTTYFTGFHENRKNIYSKDDHATAISFRLIHENLPKFFDNVI


4
SFNKLKEGFPELKFDKVKEDLEVDYDLKHAFEIEYFVNFVTQAGIDQYNYLLGGKTLEDGTKKQGMNE



QINLFKQQQTRDKARQIPKLIPLFKQILSERTESQSFIPKQFESDQELFDSLQKLHNNCQDKFTVLQQAILG



LAEADLKKVFIKTSDLNALSNTIFGNYSVFSDALNLYKESLKTKKAQEAFEKLPAHSIHDLIQYLEQFNSS



LDAEKQQSTDTVLNYFIKTDELYSRFIKSTSEAFTQVQPLFELEALSSKRRPPESEDEGAKGQEGFEQIKRI



KAYLDTLMEAVHFAKPLYLVKGRKMIEGLDKDQSFYEAFEMAYQELESLIIPIYNKARSYLSRKPFKAD



KFKINFDNNTLLSGWDANKETANASILFKKDGLYYLGIMPKGKTFLFDYFVSSEDSEKLKQRRQKTAEE



ALAQDGESYFEKIRYKLLPGASKMLPKVFFSNKNIGFYNPSDDILRIRNTASHTKNGTPQKGHSKVEFNL



NDCHKMIDFFKSSIQKHPEWGSFGFTFSDTSDFEDMSAFYREVENQGYVISFDKIKETYIQSQVEQGNLY



LFQIYNKDFSPYSKGKPNLHTLYWKALFEEANLNNVVAKLNGEAEIFFRRHSIKASDKVVHPANQAIDN



KNPHTEKTQSTFEYDLVKDKRYTQDKFFFHVPISLNFKAQGVSKFNDKVNGFLKGNPDVNIIGIDRGERH



LLYFTVVNQKGEILVQESLNTLMSDKGHVNDYQQKLDKKEQERDAARKSWTTVENIKELKEGYLSHV



VHKLAHLIIKYNAIVCLEDLNFGFKRGRFKVEKQVYQKFEKALIDKLNYLVFKEKELGEVGHYLTAYQL



TAPFESFKKLGKQSGILFYVPADYTSKIDPTTGFVNFLDLRYQSVEKAKQLLSDFNAIRFNSVQNYFEFEI



DYKKLTPKRKVGTQSKWVICTYGDVRYQNRRNQKGHWETEEVNVTEKLKALFASDSKTTTVIDYAND



DNLIDVILEQDKASFFKELLWLLKLTMTLRHSKIKSEDDFILSPVKNEQGEFYDSRKAGEVWPKDADAN



GAYHIALKGLWNLQQINQWEKGKTLNLAIKNQDWFSFIQEKPYQE*





SEQ
MHTGGLLSMDAKEFTGQYPLSKTLRFELRPIGRTWDNLEASGYLAEDRHRAECYPRAKELLDDNHRAF


ID
LNRVLPQIDMDWHPIAEAFCKVHKNPGNKELAQDYNLQLSKRRKEISAYLQDADGYKGLFAKPALDEA


NO:
MKIAKENGNESDIEVLEAFNGFSVYFTGYHESRENIYSDEDMVSVAYRITEDNFPRFVSNALIFDKLNESH


5
PDIISEVSGNLGVDDIGKYFDVSNYNNFLSQAGIDDYNHIIGGHTTEDGLIQAFNVVLNLRHQKDPGFEKI



QFKQLYKQILSVRTSKSYIPKQFDNSKEMVDCICDYVSKIEKSETVERALKLVRNISSFDLRGIFVNKKNL



RILSNKLIGDWDAIETALMHSSSSENDKKSVYDSAEAFTLDDIFSSVKKFSDASAEDIGNRAEDICRVISET



APFINDLRAVDLDSLNDDGYEAAVSKIRESLEPYMDLFHELEIFSVGDEFPKCAAFYSELEEVSEQLIEIIP



LFNKARSFCTRKRYSTDKIKVNLKFPTLADGWDLNKERDNKAAILRKDGKYYLAILDMKKDLSSIRTSD



EDESSFEKMEYKLLPSPVKMLPKIFVKSKAAKEKYGLTDRMLECYDKGMHKSGSAFDLGFCHELIDYY



KRCIAEYPGWDVFDFKFRETSDYGSMKEFNEDVAGAGYYMSLRKIPCSEVYRLLDEKSIYLFQIYNKDY



SENAHGNKNMHTMYWEGLFSPQNLESPVFKLSGGAELFFRKSSIPNDAKTVHPKGSVLVPRNDVNGRRI



PDSIYRELTRYFNRGDCRISDEAKSYLDKVKTKKADHDIVKDRRFTVDKMMFHVPIAMNFKAISKPNLN



KKVIDGIIDDQDLKIIGIDRGERNLIYVTMVDRKGNILYQDSLNILNGYDYRKALDVREYDNKEARRNW



TKVEGIRKMKEGYLSLAVSKLADMIIENNAIIVMEDLNHGFKAGRSKIEKQVYQKFESMLINKLGYMVL



KDKSIDQSGGALHGYQLANHVTTLASVGKQCGVIFYIPAAFTSKIDPTTGFADLFALSNVKNVASMREFF



SKMKSVIYDKAEGKFAFTFDYLDYNVKSECGRTLWTVYTVGERFTYSRVNREYVRKVPTDIIYDALQK



AGISVEGDLRDRIAESDGDTLKSIFYAFKYALDMRVENREEDYIQSPVKNASGEFFCSKNAGKSLPQDSD



ANGAYNIALKGILQLRMLSEQYDPNAESIRLPLITNKAWLTFMQSGMKTWKN*





SEQ
MDSLKDFTNLYPVSKTLRFELKPVGKTLENIEKAGILKEDEHRAESYRRVKKIIDTYHKVFIDSSLENMA


ID
KMGIENEIKAMLQSFCELYKKDHRTEGEDKALDKIRAVLRGLIVGAFTGVCGRRENTVQNEKYESLFKE


NO:
KLIKEILPDFVLSTEAESLPFSVEEATRSLKEFDSFTSYFAGFYENRKNIYSTKPQSTAIAYRLIHENLPKFID


6
NILVFQKIKEPIAKELEHIRADFSAGGYIKKDERLEDIFSLNYYIHVLSQAGIEKYNALIGKIVTEGDGEMK



GLNEHINLYNQQRGREDRLPLFRPLYKQILSDREQLSYLPESFEKDEELLRALKEFYDHIAEDILGRTQQL



MTSISEYDLSRIYVRNDSQLTDISKKMLGDWNAIYMARERAYDHEQAPKRITAKYERDRIKALKGEESIS



LANLNSCIAFLDNVRDCRVDTYLSTLGQKEGPHGLSNLVENVFASYHEAEQLLSFPYPEENNLIQDKDN



VVLIKNLLDNISDLQRFLKPLWGMGDEPDKDERFYGEYNYIRGALDQVIPLYNKVRNYLTRKPYSTRKV



KLNFGNSQLLSGWDRNKEKDNSCVILRKGQNFYLAIMNNRHKRSFENKVLPEYKEGEPYFEKMDYKFL



PDPNKMLPKVFLSKKGIEIYKPSPKLLEQYGHGTHKKGDTFSMDDLHELIDFFKHSIEAHEDWKQFGFKF



SDTATYENVSSFYREVEDQGYKLSFRKVSESYVYSLIDQGKLYLFQIYNKDFSPCSKGTPNLHTLYWRM



LFDERNLADVIYKLDGKAEIFFREKSLKNDHPTHPAGKPIKKKSRQKKGEESLFEYDLVKDRHYTMDKF



QFHVPITMNFKCSAGSKVNDMVNAHIREAKDMHVIGIDRGERNLLYICVIDSRGTILDQISLNTINDIDYH



DLLESRDKDRQQERRNWQTIEGIKELKQGYLSQAVHRIAELMVAYKAVVALEDLNMGFKRGRQKVESS



VYQQFEKQLIDKLNYLVDKKKRPEDIGGLLRAYQFTAPFKSFKEMGKQNGFLFYIPAWNTSNIDPTTGF



VNLFHAQYENVDKAKSFFQKFDSISYNPKKDWFEFAFDYKNFTKKAEGSRSMWILCTHGSRIKNFRNSQ



KNGQWDSEEFALTEAFKSLFVRYEIDYTADLKTAIVDEKQKDFFVDLLKLFKLTVQMRNSWKEKDLDY



LISPVAGADGRFFDTREGNKSLPKDADANGAYNIALKGLWALRQIRQTSEGGKLKLAISNKEWLQFVQE



RSYEKD*





SEQ
MNNGTNNFQNFIGISSLQKTLRNALIPTETTQQFIVKNGIIKEDELRGENRQILKDIMDDYYRGFISETLSSI


ID
DDIDWTSLFEKMEIQLKNGDNKDTLIKEQTEYRKAIHKKFANDDRFKNMFSAKLISDILPEFVIHNNNYS


NO:
ASEKEEKTQVIKLFSRFATSFKDYFKNRANCFSADDISSSSCHRIVNDNAEIFFSNALVYRRIVKSLSNDDI


7
NKISGDMKDSLKEMSLEEIYSYEKYGEFITQEGISFYNDICGKVNSFMNLYCQKNKENKNLYKLQKLHK



QILCIADTSYEVPYKFESDEEVYQSVNGFLDNISSKHIVERLRKIGDNYNGYNLDKIYIVSKFYESVSQKT



YRDWETINTALEIHYNNILPGNGKSKADKVKKAVKNDLQKSITEINELVSNYKLCSDDNIKAETYIHEISH



ILNNFEAQELKYNPEIHLVESELKASELKNVLDVIMNAFHWCSVFMTEELVDKDNNFYAELEEIYDEIYP



VISLYNLVRNYVTQKPYSTKKIKLNFGIPTLADGWSKSKEYSNNAIILMRDNLYYLGIFNAKNKPDKKIIE



GNTSENKGDYKKMIYNLLPGPNKMIPKVFLSSKTGVETYKPSAYILEGYKQNKHIKSSKDFDITFCHDLI



DYFKNCIAIHPEWKNFGFDFSDTSTYEDISGFYREVELQGYKIDWTYISEKDIDLLQEKGQLYLFQIYNKD



FSKKSTGNDNLHTMYLKNLFSEENLKDIVLKLNGEAEIFFRKSSIKNPIIHKKGSILVNRTYEAEEKDQFG



NIQIVRKNIPENIYQELYKYFNDKSDKELSDEAAKLKNVVGHHEAATNIVKDYRYTYDKYFLHMPITINF



KANKTGFINDRILQYIAKEKDLHVIGIDRGERNLIYVSVIDTCGNIVEQKSFNIVNGYDYQIKLKQQEGAR



QIARKEWKEIGKIKEIKEGYLSLVIHEISKMVIKYNAIIAMEDLSYGFKKGRFKVERQVYQKFETMLINKL



NYLVFKDISITENGGLLKGYQLTYIPDKLKNVGHQCGCIFYVPAAYTSKIDPTTGFVNIFKFKDLTVDAK



REFIKKFDSIRYDSEKNLFCFTFDYNNFITQNTVMSKSSWSVYTYGVRIKRRFVNGRFSNESDTIDITKDM



EKTLEMTDINWRDGHDLRQDIIDYEIVQHIFEIFRLTVQMRNSLSELEDRDYDRLISPVLNENNIFYDSAK



AGDALPKDADANGAYCIALKGLYEIKQITENWKEDGKFSRDKLKISNKDWFDFIQNKRYL*





SEQ
MTNKFTNQYSLSKTLRFELIPQGKTLEFIQEKGLLSQDKQRAESYQEMKKTIDKFHKYFIDLALSNAKLT


ID
HLETYLELYNKSAETKKEQKFKDDLKKVQDNLRKEIVKSFSDGDAKSIFAILDKKELITVELEKWFENNE


NO:
QKDIYFDEKFKTFTTYFTGFHQNRKNMYSVEPNSTAIAYRLIHENLPKFLENAKAFEKIKQVESLQVNFR


8
ELMGEFGDEGLIFVNELEEMFQINYYNDVLSQNGITIYNSIISGFTKNDIKYKGLNEYINNYNQTKDKKDR



LPKLKQLYKQILSDRISLSFLPDAFTDGKQVLKAIFDFYKINLLSYTIEGQEESQNLLLLIRQTIENLSSFDT



QKIYLKNDTHLTTISQQVFGDFSVFSTALNYWYETKVNPKFETEYSKANEKKREILDKAKAVFTKQDYF



SIAFLQEVLSEYILTLDHTSDIVKKHSSNCIADYFKNHFVAKKENETDKTFDFIANITAKYQCIQGILENAD



QYEDELKQDQKLIDNLKFFLDAILELLHFIKPLHLKSESITEKDTAFYDVFENYYEALSLLTPLYNMVRNY



VTQKPYSTEKIKLNFENAQLLNGWDANKEGDYLTTILKKDGNYFLAIMDKKHNKAFQKFPEGKENYEK



MVYKLLPGVNKMLPKVFFSNKNIAYFNPSKELLENYKKETHKKGDTFNLEHCHTLIDFFKDSLNKHED



WKYFDFQFSETKSYQDLSGFYREVEHQGYKINFKNIDSEYIDGLVNEGKLFLFQIYSKDFSPFSKGKPNM



HTLYWKALFEEQNLQNVIYKLNGQAEIFFRKASIKPKNIILHKKKIKIAKKHFIDKKTKTSEIVPVQTIKNL



NMYYQGKISEKELTQDDLRYIDNFSIFNEKNKTIDIIKDKRFTVDKFQFHVPITMNFKATGGSYINQTVLE



YLQNNPEVKIIGLDRGERHLVYLTLIDQQGNILKQESLNTITDSKISTPYHKLLDNKENERDLARKNWGT



VENIKELKEGYISQVVHKIATLMLEENAIVVMEDLNFGFKRGRFKVEKQIYQKLEKMLIDKLNYLVLKD



KQPQELGGLYNALQLTNKFESFQKMGKQSGFLFYVPAWNTSKIDPTTGFVNYFYTKYENVDKAKAFFE



KFEAIRFNAEKKYFEFEVKKYSDFNPKAEGTQQAWTICTYGERIETKRQKDQNNKFVSTPINLTEKIEDFL



GKNQIVYGDGNCIKSQIASKDDKAFFETLLYWFKMTLQMRNSETRTDIDYLISPVMNDNGTFYNSRDYE



KLENPTLPKDADANGAYHIAKKGLMLLNKIDQADLTKKVDLSISNRDWLQFVQKNK*





SEQ
MEQEYYLGLDMGTGSVGWAVTDSEYHVLRKHGKALWGVRLFESASTAEERRMFRTSRRRLDRRNWRI


ID
EILQEIFAEEISKKDPGFFLRMKESKYYPEDKRDINGNCPELPYALFVDDDFTDKDYHKKFPTIYHLRKML


NO:
MNTEETPDIRLVYLAIHHMMKHRGHFLLSGDINEIKEFGTTFSKLLENIKNEELDWNLELGKEEYAVVES


9
ILKDNMLNRSTKKTRLIKALKAKSICEKAVLNLLAGGTVKLSDIFGLEELNETERPKISFADNGYDDYIGE



VENELGEQFYIIETAKAVYDWAVLVEILGKYTSISEAKVATYEKHKSDLQFLKKIVRKYLTKEEYKDIFV



STSDKLKNYSAYIGMTKINGKKVDLQSKRCSKEEFYDFIKKNVLKKLEGQPEYEYLKEELERETFLPKQV



NRDNGVIPYQIHLYELKKILGNLRDKIDLIKENEDKLVQLFEFRIPYYVGPLNKIDDGKEGKFTWAVRKS



NEKIYPWNFENVVDIEASAEKFIRRMTNKCTYLMGEDVLPKDSLLYSKYMVLNELNNVKLDGEKLSVE



LKQRLYTDVFCKYRKVTVKKIKNYLKCEGIISGNVEITGIDGDFKASLTAYHDFKEILTGTELAKKDKENI



ITNIVLFGDDKKLLKKRLNRLYPQITPNQLKKICALSYTGWGRFSKKFLEEITAPDPETGEVWNIITALWE



SNNNLMQLLSNEYRFMEEVETYNMGKQTKTLSYETVENMYVSPSVKRQIWQTLKIVKELEKVMKESPK



RVFIEMAREKQESKRTESRKKQLIDLYKACKNEEKDWVKELGDQEEQKLRSDKLYLYYTQKGRCMYS



GEVIELKDLWDNTKYDIDHIYPQSKTMDDSLNNRVLVKKKYNATKSDKYPLNENIRHERKGFWKSLLD



GGFISKEKYERLIRNTELSPEELAGFIERQIVETRQSTKAVAEILKQVFPESEIVYVKAGTVSRFRKDFELL



KVREVNDLHHAKDAYLNIVVGNSYYVKFTKNASWFIKENPGRTYNLKKMFTSGWNIERNGEVAWEVG



KKGTIVTVKQIMNKNNILVTRQVHEAKGGLFDQQIMKKGKGQIAIKETDERLASIEKYGGYNKAAGAYF



MLVESKDKKGKTIRTIEFIPLYLKNKIESDESIALNFLEKGRGLKEPKILLKKIKIDTLFDVDGFKMWLSGR



TGDRLLFKCANQLILDEKIIVTMKKIVKFIQRRQENRELKLSDKDGIDNEVLMEIYNTFVDKLENTVYRIR



LSEQAKTLIDKQKEFERLSLEDKSSTLFEILHIFQCQSSAANLKMIGGPGKAGILVMNNNISKCNKISIINQS



PTGIFENEIDLLK





SEQ
MNKFENFTGLYPISKTLRFELIPQGKTLEYIEKSEILENDNYRAEKYEEVKDIIDGYHKWFINETLHDLHIN


ID
WSELKVALENNRIEKSDASKKELQRVQKIKREEIYNAFIEHEAFQYLFKENLLSDLLPIQIEQSEDLDAEK


NO:
KKQAVETFNRFSTYFTGFHENRKNIYSKEGISTSVTYRIVHDNFPKFLENMKVFEILRNECPEVISDTANE


10
LAPFIDGVRIEDIFLIDFFNSTFSQNGIDYYNRILGGVTTETGEKYRGINEFTNLYRQQHPEFGKSKKATKM



VVLFKQILSDRDTLSFIPEMFGNDKQVQNSIQLFYNREISQFENEGVKTDVCTALATLTSKIAEFDTEKIYI



QQPELPNVSQRLFGSWNELNACLFKYAELKFGTAEKVANRKKIDKWLKSDLFSFTELNKALEFSGKDER



IENYFSETGIFAQLVKTGFDEAQSILETEYTSEVHLKDQQTDIEKIKTFLDALQNLMHLLKSLCVSEEADR



DAAFYNEFDMLYNQLKLVVPLYNKVRNYITQKLFRSDKIKIYFENKGQFLGGWVDSQTENSDNGTQAG



GYIFRKENVINEYDYYLGICSDPKLFRRTTIVSENDRSSFERLDYYQLKTASVYGNSYCGKHPYTEDKNE



LVNSIDRFVHLSGNNILIEKIAKDKVKSNPTTNTPSGYLNFIHREAPNTYECLLQDENFVSLNQRVVSALK



ATLATLVRVPKALVYAKKDYHLFSEIINDIDELSYEKAFSYFPVSQTEFENSSNRTIKPLLLFKISNKDLSF



AENFEKGNRQKIGKKNLHTLYFEALMKGNQDTIDIGTGMVFHRVKSLNYNEKTLKYGHHSTQLNEKFS



YPIIKDKRFASDKFLFHLSTEINYKEKRKPLNNSIIEFLTNNPDINIIGLDRGERHLIYLTLINQKGEILRQKT



FNIVGNTNYHEKLNQREKERDNARKSWATIGKIKELKEGFLSLVIHEIAKIMVENNAIVVLEDLNFGFKR



GRFKVEKQIYQKFEKMLIDKLNYLVFKDKKANEAGGVLKGYQLAEKFESFQKMGKQSGFLFYVPAAYT



SKIDPTTGFVNMLNLNYTNMKDAQTLLSGMDKISFNADANYFEFELDYEKFKTNQTDHTNKWTICTVG



EKRFTYNSATKETTTVNVTEDLKKLLDKFEVKYSNGDNIKDEICRQTDAKFFEIILWLLKLTMQMRNSN



TKTEEDFILSPVKNSNGEFFRSNDDANGIWPADADANGAYHIALKGLYLVKECFNKNEKSLKIEHKNWF



KFAQTRFNGSLTKNG*





SEQ
MENFKNLYPINKTLRFELRPYGKTLENFKKSGLLEKDAFKANSRRSMQAIIDEKFKETIEERLKYTEFSEC


ID
DLGNMTSKDKKITDKAATNLKKQVILSFDDEIFNNYLKPDKNIDALFKNDPSNPVISTFKGFTTYFVNFFE


NO:
IRKHIFKGESSGSMAYRIIDENLTTYLNNIEKIKKLPEELKSQLEGIDQIDKLNNYNEFITQSGITHYNEIIGG


11
ISKSENVKIQGINEGINLYCQKNKVKLPRLTPLYKMILSDRVSNSFVLDTIENDTELIEMISDLINKTEISQD



VIMSDIQNIFIKYKQLGNLPGISYSSIVNAICSDYDNNFGDGKRKKSYENDRKKHLETNVYSINYISELLTD



TDVSSNIKMRYKELEQNYQVCKENFNATNWMNIKNIKQSEKTNLIKDLLDILKSIQRFYDLFDIVDEDKN



PSAEFYTWLSKNAEKLDFEFNSVYNKSRNYLTRKQYSDKKIKLNFDSPTLAKGWDANKEIDNSTIIMRK



FNNDRGDYDYFLGIWNKSTPANEKIIPLEDNGLFEKMQYKLYPDPSKMLPKQFLSKIWKAKHPTTPEFD



KKYKEGRHKKGPDFEKEFLHELIDCFKHGLVNHDEKYQDVFGFNLRNTEDYNSYTEFLEDVERCNYNL



SFNKIADTSNLINDGKLYVFQIWSKDFSIDSKGTKNLNTIYFESLFSEENMIEKMFKLSGEAEIFYRPASLN



YCEDIIKKGHHHAELKDKFDYPIIKDKRYSQDKFFFHVPMVINYKSEKLNSKSLNNRTNENLGQFTHIIGI



DRGERHLIYLTVVDVSTGEIVEQKHLDEIINTDTKGVEHKTHYLNKLEEKSKTRDNERKSWEAIETIKEL



KEGYISHVINEIQKLQEKYNALIVMENLNYGFKNSRIKVEKQVYQKFETALIKKFNYIIDKKDPETYIHGY



QLTNPITTLDKIGNQSGIVLYIPAWNTSKIDPVTGFVNLLYADDLKYKNQEQAKSFIQKIDNIYFENGEFK



FDIDFSKWNNRYSISKTKWTLTSYGTRIQTFRNPQKNNKWDSAEYDLTEEFKLILNIDGTLKSQDVETYK



KFMSLFKLMLQLRNSVTGTDIDYMISPVTDKTGTHFDSRENIKNLPADADANGAYNIARKGIMAIENIM



NGISDPLKISNEDYLKYIQNQQE





SEQ
MTQFEGFTNLYQVSKTLRFELIPQGKTLKHIQEQGFIEEDKARNDHYKELKPIIDRIYKTYADQCLQLVQL


ID
DWENLSAAIDSYRKEKTEETRNALIEEQATYRNAIHDYFIGRTDNLTDAINKRHAEIYKGLFKAELFNGK


NO:
VLKQLGTVTTTEHENALLRSFDKFTTYFSGFYENRKNVFSAEDISTAIPHRIVQDNFPKFKENCHIFTRLIT


12
AVPSLREHFENVKKAIGIFVSTSIEEVFSFPFYNQLLTQTQIDLYNQLLGGISREAGTEKIKGLNEVLNLAI



QKNDETAHIIASLPHRFIPLFKQILSDRNTLSFILEEFKSDEEVIQSFCKYKTLLRNENVLETAEALFNELNS



IDLTHIFISHKKLETISSALCDHWDTLRNALYERRISELTGKITKSAKEKVQRSLKHEDINLQEIISAAGKEL



SEAFKQKTSEILSHAHAALDQPLPTTLKKQEEKEILKSQLDSLLGLYHLLDWFAVDESNEVDPEFSARLT



GIKLEMEPSLSFYNKARNYATKKPYSVEKFKLNFQMPTLASGWDVNKEKNNGAILFVKNGLYYLGIMP



KQKGRYKALSFEPTEKTSEGFDKMYYDYFPDAAKMIPKCSTQLKAVTAHFQTHTTPILLSNNFIEPLEITK



EIYDLNNPEKEPKKFQTAYAKKTGDQKGYREALCKWIDFTRDFLSKYTKTTSIDLSSLRPSSQYKDLGEY



YAELNPLLYHISFQRIAEKEIMDAVETGKLYLFQIYNKDFAKGHHGKPNLHTLYWTGLFSPENLAKTSIK



LNGQAELFYRPKSRMKRMAHRLGEKMLNKKLKDQKTPIPDTLYQELYDYVNHRLSHDLSDEARALLP



NVITKEVSHEIIKDRRFTSDKFFFHVPITLNYQAANSPSKFNQRVNAYLKEHPETPIIGIDRGERNLIYITVI



DSTGKILEQRSLNTIQQFDYQKKLDNREKERVAARQAWSVVGTIKDLKQGYLSQVIHEIVDLMIHYQAV



VVLENLNFGFKSKRTGIAEKAVYQQFEKMLIDKLNCLVLKDYPAEKVGGVLNPYQLTDQFTSFAKMGT



QSGFLFYVPAPYTSKIDPLTGFVDPFVWKTIKNHESRKHFLEGFDFLHYDVKTGDFILHFKMNRNLSFQR



GLPGFMPAWDIVFEKNETQFDAKGTPFIAGKRIVPVIENHRFTGRYRDLYPANELIALLEEKGIVFRDGSN



ILPKLLENDDSHAIDTMVALIRSVLQMRNSNAATGEDYINSPVRDLNGVCFDSRFQNPEWPMDADANG



AYHIALKGQLLLNHLKESKDLKLQNGISNQDWLAYIQELRN*





SEQ
MAVKSIKVKLRLDDMPEIRAGLWKLHKEVNAGVRYYTEWLSLLRQENLYRRSPNGDGEQECDKTAEE


ID
CKAELLERLRARQVENGHRGPAGSDDELLQLARQLYELLVPQAIGAKGDAQQIARKFLSPLADKDAVG


NO:
GLGIAKAGNKPRWVRMREAGEPGWEEEKEKAETRKSADRTADVLRALADFGLKPLMRVYTDSEMSSV


13
EWKPLRKGQAVRTWDRDMFQQAIERMMSWESWNQRVGQEYAKLVEQKNRFEQKNFVGQEHLVHLV



NQLQQDMKEASPGLESKEQTAHYVTGRALRGSDKVFEKWGKLAPDAPFDLYDAEIKNVQRRNTRRFG



SHDLFAKLAEPEYQALWREDASFLTRYAVYNSILRKLNHAKMFATFTLPDATAHPIWTRFDKLGGNLH



QYTFLFNEFGERRHAIRFHKLLKVENGVAREVDDVTVPISMSEQLDNLLPRDPNEPIALYFRDYGAEQHF



TGEFGGAKIQCRRDQLAHMHRRRGARDVYLNVSVRVQSQSEARGERRPPYAAVFRLVGDNHRAFVHF



DKLSDYLAEHPDDGKLGSEGLLSGLRVMSVDLGLRTSASISVFRVARKDELKPNSKGRVPFFFPIKGNDN



LVAVHERSQLLKLPGETESKDLRAIREERQRTLRQLRTQLAYLRLLVRCGSEDVGRRERSWAKLIEQPV



DAANHMTPDWREAFENELQKLKSLHGICSDKEWMDAVYESVRRVWRHMGKQVRDWRKDVRSGERP



KIRGYAKDVVGGNSIEQIEYLERQYKFLKSWSFFGKVSGQVIRAEKGSRFAITLREHIDHAKEDRLKKLA



DRIIMEALGYVYALDERGKGKWVAKYPPCQLILLEELSEYQFNNDRPPSENNQLMQWSHRGVFQELINQ



AQVHDLLVGTMYAAFSSRFDARTGAPGIRCRRVPARCTQEHNPEPFPWWLNKFVVEHTLDACPLRADD



LIPTGEGEIFVSPFSAEEGDFHQIHADLNAAQNLQQRLWSDFDISQIRLRCDWGEVDGELVLIPRLTGKRT



ADSYSNKVFYTNTGVTYYERERGKKRRKVFAQEKLSEEEAELLVEADEAREKSVVLMRDPSGIINRGN



WTRQKEFWSMVNQRIEGYLVKQIRSRVPLQDSACENTGDI*





SEQ
MATRSFILKIEPNEEVKKGLWKTHEVLNHGIAYYMNILKLIRQEAIYEHHEQDPKNPKKVSKAEIQAELW


ID
DFVLKMQKCNSFTHEVDKDVVFNILRELYEELVPSSVEKKGEANQLSNKFLYPLVDPNSQSGKGTASSG


NO:
RKPRWYNLKIAGDPSWEEEKKKWEEDKKKDPLAKILGKLAEYGLIPLFIPFTDSNEPIVKEIKWMEKSRN


14
QSVRRLDKDMFIQALERFLSWESWNLKVKEEYEKVEKEHKTLEERIKEDIQAFKSLEQYEKERQEQLLR



DTLNTNEYRLSKRGLRGWREIIQKWLKMDENEPSEKYLEVFKDYQRKHPREAGDYSVYEFLSKKENHFI



WRNHPEYPYLYATFCEIDKKKKDAKQQATFTLADPINHPLWVRFEERSGSNLNKYRILTEQLHTEKLKK



KLTVQLDRLIYPTESGGWEEKGKVDIVLLPSRQFYNQIFLDIEEKGKHAFTYKDESIKFPLKGTLGGARV



QFDRDHLRRYPHKVESGNVGRIYFNMTVNIEPTESPVSKSLKIHRDDFPKFVNFKPKELTEWIKDSKGKK



LKSGIESLEIGLRVMSIDLGQRQAAAASIFEVVDQKPDIEGKLFFPIKGTELYAVHRASFNIKLPGETLVKS



REVLRKAREDNLKLMNQKLNFLRNVLHFQQFEDITEREKRVTKWISRQENSDVPLVYQDELIQIRELMY



KPYKDWVAFLKQLHKRLEVEIGKEVKHWRKSLSDGRKGLYGISLKNIDEIDRTRKFLLRWSLRPTEPGE



VRRLEPGQRFAIDQLNHLNALKEDRLKKMANTIIMHALGYCYDVRKKKWQAKNPACQIILFEDLSNYN



PYEERSRFENSKLMKWSRREIPRQVALQGEIYGLQVGEVGAQFSSRFHAKTGSPGIRCSVVTKEKLQDN



RFFKNLQREGRLTLDKIAVLKEGDLYPDKGGEKFISLSKDRKLVTTHADINAAQNLQKRFWTRTHGFYK



VYCKAYQVDGQTVYIPESKDQKQKIIEEFGEGYFILKDGVYEWGNAGKLKIKKGSSKQSSSELVDSDILK



DSFDLASELKGEKLMLYRDPSGNVFPSDKWMAAGVFFGKLERILISKLTNQYSISTIEDDSSKQSM*





SEQ
MPTRTINLKLVLGKNPENATLRRALFSTHRLVNQATKRIEEFLLLCRGEAYRTVDNEGKEAEIPRHAVQE


ID
EALAFAKAAQRHNGCISTYEDQEILDVLRQLYERLVPSVNENNEAGDAQAANAWVSPLMSAESEGGLS


NO:
VYDKVLDPPPVWMKLKEEKAPGWEAASQIWIQSDEGQSLLNKPGSPPRWIRKLRSGQPWQDDFVSDQK


15
KKQDELTKGNAPLIKQLKEMGLLPLVNPFFRHLLDPEGKGVSPWDRLAVRAAVAHFISWESWNHRTRA



EYNSLKLRRDEFEAASDEFKDDFTLLRQYEAKRHSTLKSIALADDSNPYRIGVRSLRAWNRVREEWIDK



GATEEQRVTILSKLQTQLRGKFGDPDLFNWLAQDRHVHLWSPRDSVTPLVRINAVDKVLRRRKPYALM



TFAHPRFHPRWILYEAPGGSNLRQYALDCTENALHITLPLLVDDAHGTWIEKKIRVPLAPSGQIQDLTLE



KLEKKKNRLYYRSGFQQFAGLAGGAEVLFHRPYMEHDERSEESLLERPGAVWFKLTLDVATQAPPNWL



DGKGRVRTPPEVHHFKTALSNKSKHTRTLQPGLRVLSVDLGMRTFASCSVFELIEGKPETGRAFPVADE



RSMDSPNKLWAKHERSFKLTLPGETPSRKEEEERSIARAEIYALKRDIQRLKSLLRLGEEDNDNRRDALL



EQFFKGWGEEDVVPGQAFPRSLFQGLGAAPFRSTPELWRQHCQTYYDKAEACLAKHISDWRKRTRPRP



TSREMWYKTRSYHGGKSIWMLEYLDAVRKLLLSWSLRGRTYGAINRQDTARFGSLASRLLHHINSLKE



DRIKTGADSIVQAARGYIPLPHGKGWEQRYEPCQLILFEDLARYRFRVDRPRRENSQLMQWNHRAIVAE



TTMQAELYGQIVENTAAGFSSRFHAATGAPGVRCRFLLERDFDNDLPKPYLLRELSWMLGNTKVESEEE



KLRLLSEKIRPGSLVPWDGGEQFATLHPKRQTLCVIHADMNAAQNLQRRFFGRCGEAFRLVCQPHGDD



VLRLASTPGARLLGALQQLENGQGAFELVRDMGSTSQMNRFVMKSLGKKKIKPLQDNNGDDELEDVLS



VLPEEDDTGRITVFRDSSGIFFPCNVWIPAKQFWPAVRAMIWKVMASHSLG*





SEQ
MTKLRHRQKKLTHDWAGSKKREVLGSNGKLQNPLLMPVKKGQVTEFRKAFSAYARATKGEMTDGRK


ID
NMFTHSFEPFKTKPSLHQCELADKAYQSLHSYLPGSLAHFLLSAHALGFRIFSKSGEATAFQASSKIEAYE


NO:
SKLASELACVDLSIQNLTISTLFNALTTSVRGKGEETSADPLIARFYTLLTGKPLSRDTQGPERDLAEVISR


16
KIASSFGTWKEMTANPLQSLQFFEEELHALDANVSLSPAFDVLIKMNDLQGDLKNRTIVFDPDAPVFEYN



AEDPADIIIKLTARYAKEAVIKNQNVGNYVKNAITTTNANGLGWLLNKGLSLLPVSTDDELLEFIGVERS



HPSCHALIELIAQLEAPELFEKNVFSDTRSEVQGMIDSAVSNHIARLSSSRNSLSMDSEELERLIKSFQIHTP



HCSLFIGAQSLSQQLESLPEALQSGVNSADILLGSTQYMLTNSLVEESIATYQRTLNRINYLSGVAGQING



AIKRKAIDGEKIHLPAAWSELISLPFIGQPVIDVESDLAHLKNQYQTLSNEFDTLISALQKNFDLNFNKALL



NRTQHFEAMCRSTKKNALSKPEIVSYRDLLARLTSCLYRGSLVLRRAGIEVLKKHKIFESNSELREHVHE



RKHFVFVSPLDRKAKKLLRLTDSRPDLLHVIDEILQHDNLENKDRESLWLVRSGYLLAGLPDQLSSSFIN



LPIITQKGDRRLIDLIQYDQINRDAFVMLVTSAFKSNLSGLQYRANKQSFVVTRTLSPYLGSKLVYVPKD



KDWLVPSQMFEGRFADILQSDYMVWKDAGRLCVIDTAKHLSNIKKSVFSSEEVLAFLRELPHRTFIQTE



VRGLGVNVDGIAFNNGDIPSLKTFSNCVQVKVSRTNTSLVQTLNRWFEGGKVSPPSIQFERAYYKKDDQ



IHEDAAKRKIRFQMPATELVHASDDAGWTPSYLLGIDPGEYGMGLSLVSINNGEVLDSGFIHINSLINFAS



KKSNHQTKVVPRQQYKSPYANYLEQSKDSAAGDIAHILDRLIYKLNALPVFEALSGNSQSAADQVWTK



VLSFYTWGDNDAQNSIRKQHWFGASHWDIKGMLRQPPTEKKPKPYIAFPGSQVSSYGNSQRCSCCGRN



PIEQLREMAKDTSIKELKIRNSEIQLFDGTIKLFNPDPSTVIERRRHNLGPSRIPVADRTFKNISPSSLE



FKELITIVSRSIRHSPEFIAKKRGIGSEYFCAYSDCNSSLNSEANAAANVAQKFQKQLFFEL*





SEQ
MKRILNSLKVAALRLLFRGKGSELVKTVKYPLVSPVQGAVEELAEAIRHDNLHLFGQKEIVDLMEKDEG


ID
TQVYSVVDFWLDTLRLGMFFSPSANALKITLGKFNSDQVSPFRKVLEQSPFFLAGRLKVEPAERILSVEIR


NO:
KIGKRENRVENYAADVETCFIGQLSSDEKQSIQKLANDIWDSKDHEEQRMLKADFFAIPLIKDPKAVTEE


17
DPENETAGKQKPLELCVCLVPELYTRGFGSIADFLVQRLTLLRDKMSTDTAEDCLEYVGIEEEKGNGMN



SLLGTFLKNLQGDGFEQIFQFMLGSYVGWQGKEDVLRERLDLLAEKVKRLPKPKFAGEWSGHRMFLHG



QLKSWSSNFFRLFNETRELLESIKSDIQHATMLISYVEEKGGYHPQLLSQYRKLMEQLPALRTKVLDPEIE



MTHMSEAVRSYIMIHKSVAGFLPDLLESLDRDKDREFLLSIFPRIPKIDKKTKEIVAWELPGEPEEGYLFT



ANNLFRNFLENPKHVPRFMAERIPEDWTRLRSAPVWFDGMVKQWQKVVNQLVESPGALYQFNESFLR



QRLQAMLTVYKRDLQTEKFLKLLADVCRPLVDFFGLGGNDIIFKSCQDPRKQWQTVIPLSVPADVYTAC



EGLAIRLRETLGFEWKNLKGHEREDFLRLHQLLGNLLFWIRDAKLVVKLEDWMNNPCVQEYVEARKAI



DLPLEIFGFEVPIFLNGYLFSELRQLELLLRRKSVMTSYSVKTTGSPNRLFQLVYLPLNPSDPEKKNSNNF



QERLDTPTGLSRRFLDLTLDAFAGKLLTDPVTQELKTMAGFYDHLFGFKLPCKLAAMSNHPGSSSKMV



VLAKPKKGVASNIGFEPIPDPAHPVFRVRSSWPELKYLEGLLYLPEDTPLTIELAETSVSCQSVSSVAFDL



KNLTTILGRVGEFRVTADQPFKLTPIIPEKEESFIGKTYLGLDAGERSGVGFAIVTVDGDGYEVQRLGVHE



DTQLMALQQVASKSLKEPVFQPLRKGTFRQQERIRKSLRGCYWNFYHALMIKYRAKVVHEESVGSSGL



VGQWLRAFQKDLKKADVLPKKGGKNGVDKKKRESSAQDTLWGGAFSKKEEQQIAFEVQAAGSSQFCL



KCGWWFQLGMREVNRVQESGVVLDWNRSIVTFLIESSGEKVYGFSPQQLEKGFRPDIETFKKMVRDFM



RPPMFDRKGRPAAAYERFVLGRRHRRYRFDKVFEERFGRSALFICPRVGCGNFDHSSEQSAVVLALIGYI



ADKEGMSGKKLVYVRLAELMAEWKLKKLERSRVEEQSSAQ*





SEQ
MAESKQMQCRKCGASMKYEVIGLGKKSCRYMCPDCGNHTSARKIQNKKKRDKKYGSASKAQSQRIAV


ID
AGALYPDKKVQTIKTYKYPADLNGEVHDSGVAEKIAQAIQEDEIGLLGPSSEYACWIASQKQSEPYSVV


NO:
DFWFDAVCAGGVFAYSGARLLSTVLQLSGEESVLRAALASSPFVDDINLAQAEKFLAVSRRTGQDKLG


18
KRIGECFAEGRLEALGIKDRMREFVQAIDVAQTAGQRFAAKLKIFGISQMPEAKQWNNDSGLTVCILPD



YYVPEENRADQLVVLLRRLREIAYCMGIEDEAGFEHLGIDPGALSNFSNGNPKRGFLGRLLNNDIIALAN



NMSAMTPYWEGRKGELIERLAWLKHRAEGLYLKEPHFGNSWADHRSRIFSRIAGWLSGCAGKLKIAKD



QISGVRTDLFLLKRLLDAVPQSAPSPDFIASISALDRFLEAAESSQDPAEQVRALYAFHLNAPAVRSIANK



AVQRSDSQEWLIKELDAVDHLEFNKAFPFFSDTGKKKKKGANSNGAPSEEEYTETESIQQPEDAEQEVN



GQEGNGASKNQKKFQRIPRFFGEGSRSEYRILTEAPQYFDMFCNNMRAIFMQLESQPRKAPRDFKCFLQ



NRLQKLYKQTFLNARSNKCRALLESVLISWGEFYTYGANEKKFRLRHEASERSSDPDYVVQQALEIARR



LFLFGFEWRDCSAGERVDLVEIHKKAISFLLAITQAEVSVGSYNWLGNSTVSRYLSVAGTDTLYGTQLEE



FLNATVLSQMRGLAIRLSSQELKDGFDVQLESSCQDNLQHLLVYRASRDLAACKRATCPAELDPKILVL



PVGAFIASVMKMIERGDEPLAGAYLRHRPHSFGWQIRVRGVAEVGMDQGTALAFQKPTESEPFKIKPFS



AQYGPVLWLNSSSYSQSQYLDGFLSQPKNWSMRVLPQAGSVRVEQRVALIWNLQAGKMRLERSGARA



FFMPVPFSFRPSGSGDEAVLAPNRYLGLFPHSGGIEYAVVDVLDSAGFKILERGTIAVNGFSQKRGERQE



EAHREKQRRGISDIGRKKPVQAEVDAANELHRKYTDVATRLGCRIVVQWAPQPKPGTAPTAQTVYARA



VRTEAPRSGNQEDHARMKSSWGYTWGTYWEKRKPEDILGISTQVYWTGGIGESCPAVAVALLGHIRAT



STQTEWEKEEVVFGRLKKFFPS*





SEQ
MEKRINKIRKKLSADNATKPVSRSGPMKTLLVRVMTDDLKKRLEKRRKKPEVMPQVISNNAANNLRML


ID
LDDYTKMKEAILQVYWQEFKDDHVGLMCKFAQPASKKIDQNKLKPEMDEKGNLTTAGFACSQCGQPL


NO:
FVYKLEQVSEKGKAYTNYFGRCNVAEHEKLILLAQLKPEKDSDEAVTYSLGKFGQRALDFYSIHVTKES


19
THPVKPLAQIAGNRYASGPVGKALSDACMGTIASFLSKYQDIIIEHQKVVKGNQKRLESLRELAGKENLE



YPSVTLPPQPHTKEGVDAYNEVIARVRMWVNLNLWQKLKLSRDDAKPLLRLKGFPSFPVVERRENEVD



WWNTINEVKKLIDAKRDMGRVFWSGVTAEKRNTILEGYNYLPNENDHKKREGSLENPKKPAKRQFGD



LLLYLEKKYAGDWGKVFDEAWERIDKKIAGLTSHIEREEARNAEDAQSKAVLTDWLRAKASFVLERLK



EMDEKEFYACEIQLQKWYGDLRGNPFAVEAENRVVDISGFSIGSDGHSIQYRNLLAWKYLENGKREFYL



LMNYGKKGRIRFTDGTDIKKSGKWQGLLYGGGKAKVIDLTFDPDDEQLIILPLAFGTRQGREFIWNDLLS



LETGLIKLANGRVIEKTIYNKKIGRDEPALFVALTFERREVVDPSNIKPVNLIGVDRGENIPAVIALTDP



EGCPLPEFKDSSGGPTDILRIGEGYKEKQRAIQAAKEVEQRRAGGYSRKFASKSRNLADDMVRNSARDL



FYHAVTHDAVLVFENLSRGFGRQGKRTFMTERQYTKMEDWLTAKLAYEGLTSKTYLSKTLAQYTSKTCS



NCGFTITTADYDGMLVRLKKTSDGWATTLNNKELKAEGQITYYNRYKRQTVEKELSAELDRLSEESGN



NDISKWTKGRRDEALFLLKKRFSHRPVQEQFVCLDCGHEVHADEQAALNIARSWLFLNSNSTEFKSYKS



GKQPFVGAWQAFYKRRLKEVWKPNA





SEQ
MKRINKIRRRLVKDSNTKKAGKTGPMKTLLVRVMTPDLRERLENLRKKPENIPQPISNTSRANLNKLLT


ID
DYTEMKKAILHVYWEEFQKDPVGLMSRVAQPAPKNIDQRKLIPVKDGNERLTSSGFACSQCCQPLYVY


NO:
KLEQVNDKGKPHTNYFGRCNVSEHERLILLSPHKPEANDELVTYSLGKFGQRALDFYSIHVTRESNHPV


20
KPLEQIGGNSCASGPVGKALSDACMGAVASFLTKYQDIILEHQKVIKKNEKRLANLKDIASANGLAFPKI



TLPPQPHTKEGIEAYNNVVAQIVIWVNLNLWQKLKIGRDEAKPLQRLKGFPSFPLVERQANEVDWWDM



VCNVKKLINEKKEDGKVFWQNLAGYKRQEALLPYLSSEEDRKKGKKFARYQFGDLLLHLEKKHGEDW



GKVYDEAWERIDKKVEGLSKHIKLEEERRSEDAQSKAALTDWLRAKASFVIEGLKEADKDEFCRCELKL



QKWYGDLRGKPFAIEAENSILDISGFSKQYNCAFIWQKDGVKKLNLYLIINYFKGGKLRFKKIKPEAFEA



NRFYTVINKKSGEIVPMEVNFNFDDPNLIILPLAFGKRQGREFIWNDLLSLETGSLKLANGRVIEKTLYNR



RTRQDEPALFVALTFERREVLDSSNIKPMNLIGIDRGENIPAVIALTDPEGCPLSRFKDSLGNPTHILRI



GESYKEKQRTIQAAKEVEQRRAGGYSRKYASKAKNLADDMVRNTARDLLYYAVTQDAMLIFENLSRGFGR



QGKRTFMAERQYTRMEDWLTAKLAYEGLPSKTYLSKTLAQYTSKTCSNCGFTITSADYDRVLEKLKKT



ATGWMTTINGKELKVEGQITYYNRYKRQNVVKDLSVELDRLSEESVNNDISSWTKGRSGEALSLLKKR



FSHRPVQEKFVCLNCGFETHADEQAALNIARSWLFLRSQEYKKYQTNKTTGNTDKRAFVETWQSFYRK



KLKEVWKP





SEQ
atgGGAAAAATGTATTATCTTGGTCTGGATATAGGAACAAATTCTGTTGGATATGCCGTAACCGACCC


ID
ATCGTACCATTTGCTCAAATTTAAAGGCGAACCGATGTGGGGTGCCCACGTGTTTGCTGCGGGGAAT


NO:
CAATCAGCTGAACGGAGAAGCTTTCGTACGAGCCGCAGACGCCTTGACCGCAGGCAACAGCGTGTC


21
AAACTGGTTCAAGAAATCTTTGCTCCCGTGATTAGTCCCATTGATCCACGTTTTTTTATCAGACTTCA



TGAGAGCGCTTTATGGCGGGATGATGTGGCTGAAACGGATAAACATATTTTCTTTAATGACCCGACC



TATACGGATAAGGAATATTATTCTGACTATCCAACCATCCATCATCTCATTGTGGACCTTATGGAAA



GCAGTGAAAAGCATGACCCGCGGCTTGTTTATTTGGCTGTTGCCTGGCTGGTTGCTCATCGTGGTCA



TTTCCTCAATGAAGTGGATAAGGATAATATTGGGGATGTCCTGAGTTTTGACGCCTTTTATCCTGAG



TTTCTGGCATTTCTTTCCGATAATGGGGTGTCACCTTGGGTATGTGAGTCAAAAGCACTCCAAGCGA



CCCTGCTTTCACGAAACTCCGTCAACGATAAGTATAAAGCCTTGAAGTCTCTGATCTTTGGCAGCCA



AAAGCCGGAGGATAATTTTGATGCCAATATCAGTGAAGATGGACTTATCCAACTTTTAGCAGGAAA



AAAGGTCAAGGTCAATAAACTTTTTCCTCAAGAAAGTAATGATGCTTCCTTTACACTCAATGATAAG



GAAGATGCAATTGAGGAAATCTTAGGAACGCTTACACCGGATGAGTGTGAATGGATTGCGCATATT



AGGAGGCTGTTTGATTGGGCCATCATGAAACATGCTCTCAAAGATGGCAGAACAATCTCCGAATCG



AAAGTAAAGCTCTATGAACAGCATCACCATGACTTGACACAGCTCAAGTATTTTGTGAAGACCTAT



CTAGCAAAGGAATATGATGACATTTTTCGAAACGTAGATAGTGAAACAACCAAAAACTATGTCGCA



TATTCCTATCATGTAAAAGAAGTCAAGGGTACATTGCCCAAAAATAAGGCAACCCAAGAAGAATTT



TGCAAGTATGTCCTTGGAAAGGTAAAGAACATCGAATGCAGTGAAGCTGATAAGGTTGATTTTGAT



GAAATGATTCAGCGTCTTACAGACAATTCCTTTATGCCGAAACAAGTATCAGGTGAAAACAGGGTT



ATCCCTTACCAGCTTTACTATTATGAACTAAAGACTATTTTGAATAAAGCCGCTTCTTATCTGCCTTT



TTTGACCCAATGCGGAAAAGATGCCATCTCCAATCAAGATAAGCTCCTTTCCATCATGACCTTTCGG



ATTCCGTATTTCGTTGGGCCCTTGCGCAAGGACAATTCAGAGCATGCCTGGCTGGAACGAAAAGCA



GGGAAAATCTATCCGTGGAATTTTAACGACAAAGTTGACCTTGATAAAAGTGAAGAAGCGTTCATT



CGGAGAATGACGAATACCTGCACTTATTATCCCGGTGAAGATGTTTTGCCACTTGACTCCCTTATTT



ATGAAAAATTCATGATCCTCAATGAAATCAATAATATCCGAATTGATGGTTATCCTATTTCTGTAGA



TGTAAAACAGCAGGTTTTTGGCCTCTTTGAAAAGAAGAGAAGAGTGACCGTAAAGGATATCCAGAA



TCTCCTGCTTTCCTTGGGTGCCTTGGATAAGCATGGTAAATTGACGGGAATCGATACTACCATCCAT



AGCAATTACAATACATACCATCATTTTAAATCGCTCATGGAGCGTGGCGTTCTTACTCGTGATGATG



TGGAACGCATTGTGGAGCGTATGACCTATAGTGATGATACAAAACGCGTCCGTCTTTGGCTGAACA



ATAATTATGGAACGCTCACTGCTGACGACGTAAAGCATATTTCAAGGCTCCGAAAGCATGATTTTG



GCCGGCTTTCCAAAATGTTCCTCACAGGCCTAAAGGGAGTTCATAAGGAAACGGGGGAACGAGCTT



CCATTTTGGATTTTATGTGGAATACCAATGATAACTTGATGCAGCTTTTATCTGAATGTTATACTTTT



TCGGATGAAATTACCAAGCTGCAGGAAGCATACTATGCCAAGGCGCAGCTTTCCCTGAATGATTTTC



TGGACTCCATGTATATTTCAAATGCTGTCAAACGTCCTATCTATCGAACTCTTGCCGTTGTAAATGA



CATACGCAAAGCCTGTGGGACGGCGCCAAAACGCATTTTTATCGAAATGGCAAGAGATGGGGAAA



GCAAAAAGAAAAGGAGCGTAACAAGAAGAGAACAAATCAAGAATCTTTATAGGTCCATCCGCAAG



GATTTTCAGCAGGAGGTAGATTTCCTTGAAAAAATCCTTGAAAACAAAAGCGATGGACAGCTGCAA



AGCGATGCGCTCTATCTATACTTTGCGCAGCTTGGAAGGGATATGTATACCGGGGACCCTATCAAGT



TGGAGCATATCAAGGACCAGTCCTTCTATAATATTGATCATATCTATCCCCAAAGCATGGTCAAGGA



CGATAGTCTTGATAACAAGGTGTTGGTTCAATCGGAAATTAATGGAGAGAAGAGCAGTCGATATCC



TCTTGATGCTGCTATCCGTAATAAAATGAAGCCTCTTTGGGATGCTTATTATAACCATGGCCTGATTT



CCCTCAAGAAGTATCAGCGTTTGACGCGGAGCACTCCCTTTACAGATGATGAAAAGTGGGATTTCA



TCAATCGGCAGCTTGTTGAGACAAGACAATCCACGAAGGCCTTGGCAATCTTACTAAAAAGGAAGT



TCCCTGATACGGAGATTGTCTACTCCAAGGCAGGGCTTTCTTCTGATTTTCGGCATGAGTTTGGTCTC



GTAAAATCGAGGAATATCAATGACCTGCACCATGCAAAGGACGCATTTCTTGCGATTGTAACAGGA



AATGTCTATCATGAACGCTTTAATCGCCGGTGGTTTATGGTGAACCAGCCCTATTCCGTCAAGACCA



AGACGTTGTTTACGCATTCTATTAAAAATGGTAATTTTGTAGCTTGGAATGGAGAAGAGGATCTTGG



CCGCATTGTTAAAATGTTAAAGCAAAATAAGAACACTATTCATTTCACGCGGTTCTCTTTTGATCGA



AAGGAAGGCCTGTTTGATATTCAGCCACTAAAAGCGTCAACCGGTCTTGTACCAAGAAAAGCCGGA



CTAGACGTGGTAAAATATGGTGGCTATGACAAATCGACAGCAGCTTATTATCTCCTTGTTCGATTTA



CACTAGAAGATAAAAAGACTCAACATAAATTGATGATGATTCCTGTAGAAGGCTTGTATAAAGCTC



GAATTGACCATGATAAGGAATTCTTAACGGACTATGCACAAACTACAATCAGTGAAATCCTACAAA



AAGATAAACAAAAGGTGATAAATATAATGTTTCCAATGGGAACAAGGCACATTAAACTGAATTCCA



TGATTTCAATCGATGGTTTTTATCTTTCCATTGGAGGAAAGTCTAGTAAGGGAAAATCGGTGTTGTG



TCATGCTATGGTACCTCTTATTGTACCTCATAAGATAGAATGTTATATTAAGGCGATGGAGTCTTTT



GCACGTAAATTTAAAGAAAATAATAAATTAAGGATTGTGGAAAAGTTTGATAAGATTACGGTGGAA



GATAACTTGAACCTATACGAACTATTTTTACAAAAACTTCAACATAACCCATATAATAAGTTCTTCT



CCACACAATTTGATGTGCTGACTAATGGAAGAAGTACATTTACTAAATTATCTCCAGAGGAACAAG



TTCAAACGTTATTGAATATCTTATCAATTTTTAAAACTTGTCGGAGCTCTGGCTGCGATTTAAAATCC



ATTAACGGTTCTGCTCAAGCTGCCAGAATTATGATCAGCGCAGATTTAACTGGACTCTCAAAAAAAT



ATTCCGATATTCGGCTTGTTGAGCAATCAGCATCTGGACTTTTTGTTAGTAAATCACAAAATCTTTTG



GAGTATTTAtga





SEQ
atgtcttcattaacaaaatttacaaataaatacagtaagcagctaaccataaaaaatgaactcatccc


ID
agtaggaaagactctcgagaacattaaggaaaacggtctcatagatggagatgaacagctaaacgaga


NO:
attatcaaaaagcaaagataatcgttgatgattttctacgagatttcataaataaagctttaaataat


22
acccaaataggaaattggagagaattagcagatgctttaaataaagaagatgaagataacatagaaaa



gctccaagacaaaatcagaggaataattgtaagtaaattcgagacatttgatttgttttcttcttact



cgataaagaaagacgaaaagataatagatgatgataatgatgttgaagaagaggagctagatctagga



aaaaaaacttcctcatttaaatatatttttaagaaaaacctttttaaattagtacttccttcttattt



aaagacaacaaatcaggataaactgaaaataatctcttcttttgataatttttctacctatttcagag



gattctttgagaacagaaaaaatattttcactaagaagcctatatctacgtcaattgcctacagaatt



gtccatgataactttccaaagtttctagataacatcagatgttttaatgtgtggcaaacagaatgccc



acagttaattgtaaaggctgataattatttaaaatcaaagaacgtcatagctaaagataaatctttag



caaactattttactgtaggagcatatgattacttcttatcccagaatggcattgatttctacaacaac



attatcggcggtctaccagcatttgctggtcatgagaaaatccaaggacttaatgaatttataaatca



agaatgccaaaaggacagcgaactaaaatctaaactgaaaaacagacatgctttcaaaatggctgttc



tatttaagcaaattctttcagatagagaaaaaagttttgttatagacgagttcgaatctgatgctcag



gtcatagatgcggttaagaacttctatgcagaacaatgtaaggataataatgttatttttaaccttct



aaatcttatcaagaatatagcgttcttatctgatgatgaattagatggaatttttatagaaggcaagt



atttaagctctgtttcccaaaagctatattcagattggtcgaagcttcgaaatgatattgaagatagt



gcaaacagtaaacaaggaaataaagagttagcaaagaaaattaaaacaaataaaggcgatgttgaaaa



ggccataagtaaatatgagttttctttatcagaacttaactcaattgtacatgataatacaaaattca



gtgaccttctttcttgtacgttacataaagtggctagcgaaaaactagtgaaagttaatgaaggggac



tggccaaaacacctgaaaaataatgaagaaaaacaaaagataaaagagcctttagatgcattgttaga



aatttataatacattgctgatattcaactgcaagtcatttaataagaacggtaatttctatgttgatt



atgacagatgcataaatgagctttctagtgttgtttatttatataacaaaacaagaaattactgtaca



aagaaaccttataacacagacaaattcaaattaaactttaacagtcctcaattaggagagggctttag



taagtcgaaagaaaatgactgtctgacattattatttaaaaaagacgacaattactatgttggaatta



tcagaaaaggggcaaaaattaactttgatgatacacaagccattgcagacaatacagataactgtata



tttaagatgaattatttcctattaaaagatgctaaaaagtttattcctaaatgttcaattcagttaaa



agaagtaaaagcacattttaaaaaatcagaggatgattatatcctgagtgacaaagaaaaatttgcct



ctccccttgttattaagaaatcaacatttttattagcaacagcacatgtaaaaggaaagaaaggaaac



ataaaaaaattccaaaaggaatattctaaggaaaatccaacagaatatagaaattctctgaatgaatg



gattgcattttgtaaagaatttctaaaaacatataaggcggcaacaatcatgacattacaacgttaaa



aaaagctgaagaatatgctgatattgagagattataaggatgtagataatctagttataaactagagt



tttgccctattaaaacatctttcattgagaatcttattgataatggggacttatatttattcagaatc



aataataaagatttcagttcaaaatctactggtacaaagaatcttcatacgctctatcttcaggcaat



ctttgatgaaagaaacctcaataatcctactattatgttaaatggcggagcagagttattttatcgaa



aagaaagcattgaacagaaaaataggataactcataaggcaggatcaattcttgtaaacaaggtttgt



aaggatggaacaagtctagatgacaaaatcagaaacgaaatatatcaatatgaaaacaagtttattga



tacattgtctgatgaagctaaaaaagttttacctaatgtaataaaaaaagaagcaactcacgacataa



caaaagataagcgatttacatcagataagttctttttccattgcccattaacaattaactataaggaa



ggagatacaaaacaatttaacaatgaggttttatctttccttagaggtaatccagacattaatatcat



cggaattgacagaggagaaagaaaccttatatacgtaactgttattaatcagaaaggcgaaatacttg



acagcgtttcgtttaacacagtaacaaacaagtcgagcaaaattgaacaaactgttgattatgaggaa



aagcttgctgttagggaaaaagaaagaatagaagcaaaaagatcctgggattcaatatcaaagatagc



aaccttaaaagaaggttatctatcagctattgttcatgagatatgcctactgatgatcaaacacaacg



caatcgttgtacttgagaatctaaatgcaggatttaagagaattagaggaggattatcagaaaagtct



gtttatcagaaattcgagaagatgcttattaacaaactaaattactttgtatctaaaaaagaatcaga



ctggaataaacctagtggacttttaaatggtttacaactttcagaccagttcgagtcatttgagaaat



taggaattcaatctgggttcatcttctatgttcctgcagcatatacatctaagattgatcctacaaca



ggatttgcaaatgttcttaacttatccaaggtaagaaatgttgatgcaataaagagttttttcagtaa



tttcaatgaaatttcatatagcaaaaaagaagctctctttaaattctcttttgatttagattccttat



caaagaagggcttcagctcatttgtaaaattcagtaaatctaaatggaatgtatatacatttggagag



agaataataaaaccaaagaataagcaagggtatcgtgaagataagagaattaatttaacatttgaaat



gaaaaaacttctgaatgaatataaagtaagttttgatcttgaaaacaacttaattccaaatctaacct



ctgcaaatctgaaagataccttctggaaagaactattctttatttttaaaacaactctgcagcttaga



aacagtgtaacaaatggcaaagaagatgtactgatttctccagtaaagaacgctaaaggagagttctt



tgtatcaggaactcataacaagacattacctcaagactgtgatgcaaatggagcatatcatatcgccc



taaaaggtctgatgattcttgaacgtaacaatcttgttagagaagaaaaagacacaaagaagataatg



gcaatttctaatgttgactggtttgagtatgttcaaaaaaggagaggtgtcctgtaa





SEQ
ATGAACAACTATGATGAGTTTACCAAACTGTACCCAATACAGAAAACGATAAGGTTCGAATTGAAG


ID
CCGCAGGGAAGAACGATGGAACACCTCGAAACATTCAACTTTTTCGAAGAGGACAGGGATAGAGC


NO:
GGAGAAATATAAGATTTTAAAGGAAGCAATCGACGAGTATCATAAGAAGTTTATAGACGAACATCT


23
AACAAATATGTCTCTTGACTGGAATTCTTTAAAACAGATTTCAGAGAAATACTATAAGAGTAGAGA



GGAAAAAGACAAGAAAGTTTTTCTGTCAGAACAGAAACGCATGAGGCAAGAGATAGTTTCTGAGTT



CAAAAAAGACGATCGGTTTAAAGATCTTTTTTCAAAAAAATTGTTTTCTGAACTTCTCAAGGAAGAG



ATTTACAAAAAAGGAAACCATCAGGAAATTGACGCATTGAAAAGTTTTGATAAATTCTCAGGCTAT



TTTATTGGGTTGCATGAGAACCGAAAAAATATGTATTCTGACGGAGACGAGATCACGGCTATCTCT



AACCGTATTGTAAATGAGAATTTCCCGAAGTTCCTCGACAACCTTCAGAAATATCAGGAAGCTCGT



AAAAAATATCCAGAGTGGATCATTAAGGCAGAATCTGCTTTAGTTGCACATAATATCAAGATGGAT



GAAGTCTTTTCCTTAGAGTATTTCAACAAAGTCCTGAATCAAGAAGGAATACAGAGATACAATCTC



GCCCTAGGTGGCTATGTGACCAAAAGTGGTGAGAAAATGATGGGGCTTAATGATGCACTTAATCTT



GCCCATCAAAGTGAAAAAAGCAGCAAGGGAAGGATACACATGACTCCACTCTTCAAACAGATTCTG



AGTGAAAAAGAGTCCTTTTCTTATATACCAGATGTTTTTACAGAAGACTCTCAACTTTTACCATCCA



TTGGTGGGTTCTTTGCACAAATAGAAAATGATAAGGACGGGAATATTTTTGACAGAGCATTAGAAT



TGATATCTTCTTATGCAGAATACGATACAGAAAGGATATATATCAGGCAAGCGGACATAAACAGAG



TTTCTAATGTTATTTTCGGGGAGTGGGGAACACTGGGGGGGTTAATGAGGGAATACAAAGCAGACT



CTATCAACGACATCAATTTGGAGAGAACATGCAAGAAGGTAGACAAGTGGCTCGACTCAAAGGAG



TTTGCGTTATCAGATGTATTAGAGGCAATAAAAAGAACCGGCAATAATGATGCTTTTAATGAATAT



ATCTCAAAGATGCGCACTGCCAGGGAAAAGATTGACGCTGCAAGAAAGGAAATGAAATTCATTTCG



GAAAAAATATCTGGAGACGAAGAATCGATCCATATTATCAAAACCTTATTGGACTCGGTGCAACAG



TTTTTACATTTTTTCAATTTATTCAAAGCGCGTCAGGACATTCCTCTTGATGGAGCATTCTATGCGGA



GTTCGATGAAGTCCATAGCAAACTGTTTGCTATTGTTCCGTTGTATAATAAGGTTAGGAACTATCTT



ACGAAAAATAACCTTAACACGAAAAAGATAAAGCTAAACTTCAAGAATCCAACTCTGGCAAACGG



ATGGGATCAAAACAAGGTATATGACTACGCCTCCTTAATCTTTCTCCGCGATGGTAATTATTATCTC



GGAATAATAAATCCAAAAAGGAAAAAGAATATTAAATTCGAACAAGGGTCTGGAAATGGCCCATT



CTACCGGAAGATGGTGTACAAACAAATTCCAGGGCCGAACAAGAACTTACCAAGAGTCTTCCTCAC



ATCTACGAAAGGCAAAAAAGAGTACAAGCCGTCAAAGGAGATAATAGAAGGATATGAAGCGGACA



AACACATAAGAGGAGATAAATTCGATCTGGATTTCTGTCATAAGCTGATAGACTTCTTCAAGGAAT



CCATCGAGAAGCACAAGGACTGGAGTAAGTTCAACTTCTATTTCTCTCCAACTGAATCATATGGAG



ACATCAGCGAATTCTATCTGGATGTAGAAAAACAGGGATACCGGATGCATTTTGAGAATATTTCTG



CCGAGACGATTGATGAGTATGTCGAAAAGGGGGACTTATTCCTCTTCCAGATATACAACAAAGACT



TTGTGAAAGCGGCAACCGGAAAAAAAGATATGCACACCATTTATTGGAACGCGGCATTCTCGCCCG



AGAACCTTCAGGATGTGGTAGTGAAACTGAACGGTGAAGCAGAACTTTTCTACAGAGACAAGAGCG



ACATCAAGGAGATAGTTCACAGGGAGGGAGAGATACTGGTCAATCGTACCTACAACGGCAGGACA



CCTGTGCCTGACAAGATCCACAAAAAATTAACAGATTATCATAATGGCCGTACCAAAGATCTCGGA



GAAGCAAAAGAATACCTCGATAAGGTCAGATATTTCAAAGCGCACTACGACATCACAAAGGATCGC



AGATACCTGAATGATAAAATATACTTCCATGTGCCTCTGACATTGAATTTCAAAGCAAACGGGAAG



AAGAATCTCAATAAGATGGTAATTGAAAAGTTCCTCTCGGACGAAAAAGCGCATATTATTGGGATT



GATCGCGGGGAAAGGAATCTTCTTTACTATTCTATCATTGACAGGTCAGGTAAAATAATCGATCAAC



AGAGCCTCAACGTCATCGATGGATTCGATTACCGAGAGAAACTGAATCAGAGGGAGATCGAGATG



AAGGATGCCAGACAAAGCTGGAATGCTATCGGGAAGATAAAGGACCTCAAGGAAGGGTATCTTTC



AAAAGCGGTCCACGAAATTACCAAGATGGCGATACAATACAATGCCATTGTTGTCATGGAGGAACT



CAATTATGGGTTCAAACGCGGACGTTTCAAAGTTGAGAAGCAGATATATCAGAAATTCGAGAATAT



GCTGATTGACAAGATGAATTATCTGGTATTCAAGGATGCTCCGGATGAAAGTCCGGGAGGAGTCCT



CAATGCATATCAGCTTACTAATCCGCTTGAAAGTTTCGCTAAACTTGGGAAACAGACAGGAATTCTT



TTCTATGTTCCGGCAGCCTATACTTCGAAGATAGATCCGACGACCGGGTTTGTCAATCTTTTCAATA



CTTCAAGTAAAACGAACGCACAGGAAAGAAAAGAATTCTTGCAAAAATTCGAGTCGATCTCCTATT



CCGCTAAAGACGGAGGAATATTCGCATTCGCGTTCGATTATCGGAAGTTCGGAACGTCAAAAACAG



ACCACAAAAATGTATGGACCGCATACACGAACGGGGAAAGGATGAGGTACATAAAAGAGAAAAAA



CGCAACGAACTGTTCGACCCCTCGAAGGAGATCAAAGAGGCTCTCACTTCATCAGGAATCAAATAT



GACGGCGGACAGAACATATTGCCAGATATCCTGAGGAGCAACAATAACGGTCTGATCTACACAATG



TATTCCTCTTTCATAGCGGCCATTCAAATGAGGGTCTATGACGGGAAAGAAGACTATATCATCTCGC



CGATAAAGAACAGCAAGGGAGAGTTCTTCAGGACCGATCCGAAAAGAAGGGAACTTCCGATAGAC



GCGGATGCGAACGGCGCGTATAACATTGCTCTCAGGGGCGAATTGACGATGCGTGCGATAGCGGAG



AAGTTCGATCCGGACTCGGAAAAGATGGCGAAGCTAGAACTGAAACATAAGGACTGGTTCGAATTC



ATGCAGACAAGGGGGGATTGA





SEQ
ATGACAAAAACATTTGATTCAGAATTTTTTAATTTATATTCTCTTCAAAAAACAGTTCGTTTTGAACT


ID
CAAGCCGGTTGGTGAAACAGCCTCGTTTGTTGAAGATTTTAAAAACGAAGGTTTGAAACGAGTTGT


NO:
TTCAGAGGATGAACGGCGTGCGGTTGATTACCAAAAAGTGAAAGAAATTATTGATGACTACCACCG


24
AGATTTTATTGAAGAATCGCTGAACTATTTTCCTGAGCAGGTCTCAAAAGACGCTTTGGAACAAGCT



TTTCACCTTTATCAAAAACTAAAAGCCGCTAAGGTTGAAGAGCGTGAAAAAGCATTGAAAGAATGG



GAAGCCCTTCAGAAAAAACTGCGCGAAAAAGTTGTTAAATGTTTTTCAGATTCAAACAAAGCACGC



TTTTCCCGCATTGATAAAAAAGAACTGATTAAAGAAGATTTAATTAACTGGTTGGTTGCACAAAATC



GCGAAGATGACATTCCAACCGTTGAAACCTTTAACAACTTTACGACTTATTTTACGGGGTTTCATGA



AAACCGAAAAAACATTTATTCAAAAGACGATCATGCCACAGCCATTTCATTTCGACTCATTCATGAA



AACCTGCCTAAGTTTTTTGATAATGTGATCAGCTTTAATAAATTGAAGGAAGGATTTCCAGAGCTGA



AATTTGATAAGGTTAAGGAAGATTTAGAAGTTGATTATGACTTGAAACATGCCTTTGAAATCGAAT



ACTTTGTCAATTTTGTTACCCAAGCCGGAATTGACCAATATAACTATCTTTTGGGGGGTAAAACCTT



AGAAGACGGCACCAAAAAGCAAGGCATGAATGAACAAATCAATCTGTTCAAGCAACAGCAAACCC



GAGACAAAGCCCGACAAATTCCCAAACTCATACCATTGTTTAAACAAATTCTAAGCGAACGAACGG



AAAGCCAATCGTTTATTCCAAAACAATTTGAATCAGACCAAGAGCTATTTGACTCACTGCAAAAAC



TGCATAACAACTGCCAAGATAAATTTACCGTACTGCAACAAGCCATTTTAGGCTTAGCCGAAGCAG



ATCTGAAAAAAGTATTCATTAAAACATCTGATCTTAATGCGCTATCAAATACCATTTTTGGAAATTA



CAGTGTGTTTTCGGATGCGTTGAATTTATACAAAGAATCGCTCAAAACAAAAAAGGCGCAAGAAGC



GTTTGAAAAACTACCCGCTCACAGCATTCATGACTTGATTCAATATTTGGAGCAATTTAATAGCTCT



TTGGATGCAGAAAAACAGCAATCAACTGACACCGTACTGAATTACTTTATTAAAACAGACGAGCTG



TATTCTCGGTTCATAAAATCAACGAGCGAAGCCTTCACACAAGTACAACCACTCTTTGAATTGGAAG



CATTAAGCTCAAAACGTCGTCCACCGGAAAGTGAAGACGAAGGCGCAAAAGGTCAGGAAGGGTTT



GAGCAAATTAAACGCATAAAAGCCTATTTGGATACCTTGATGGAGGCGGTGCATTTTGCAAAACCA



CTTTATCTGGTGAAGGGGCGCAAAATGATTGAAGGTCTGGACAAAGACCAAAGTTTCTATGAAGCC



TTTGAAATGGCTTACCAAGAACTAGAAAGTCTGATTATTCCAATCTACAACAAAGCTCGTAGTTATT



TAAGTCGTAAACCGTTTAAAGCGGACAAATTCAAAATTAATTTTGATAATAATACATTGCTTTCCGG



TTGGGATGCTAATAAAGAAACGGCTAACGCTTCAATTTTGTTTAAGAAGGATGGTTTGTATTATTTA



GGAATCATGCCTAAAGGAAAAACGTTTTTGTTCGATTACTTCGTTTCATCGGAAGATTCTGAAAAGT



TAAAACAAAGAAGACAAAAAACCGCCGAAGAAGCGCTTGCGCAAGATGGCGAAAGCTACTTTGAA



AAAATTCGTTACAAGCTGTTACCTGGCGCCAGCAAAATGTTGCCGAAAGTATTTTTTTCCAACAAAA



ACATAGGGTTTTACAACCCAAGTGATGACATACTTCGTATCAGGAATACAGCCTCTCACACTAAAA



ACGGAACACCGCAAAAAGGGCACTCTAAAGTAGAGTTTAATTTGAATGATTGTCATAAGATGATTG



ATTTCTTTAAATCAAGCATTCAAAAGCATCCAGAGTGGGGAAGTTTTGGATTCACCTTTTCAGATAC



ATCAGATTTTGAAGATATGAGCGCCTTTTATCGAGAAGTCGAAAACCAAGGTTATGTCATTAGTTTC



GATAAAATAAAAGAAACTTACATTCAGAGTCAAGTTGAACAGGGGAACCTATATTTATTCCAAATC



TACAATAAAGACTTCTCGCCCTACAGCAAAGGCAAACCAAATTTACACACGCTTTACTGGAAAGCG



TTGTTTGAGGAAGCCAACCTAAATAATGTGGTGGCAAAACTCAATGGTGAAGCTGAAATTTTCTTTA



GGCGACACTCAATCAAAGCATCTGATAAAGTGGTGCACCCAGCGAATCAAGCCATTGACAATAAAA



ACCCGCATACCGAAAAAACGCAAAGCACCTTTGAATATGATCTTGTAAAAGACAAGCGCTATACCC



AAGACAAATTCTTCTTCCATGTACCGATTTCATTGAACTTTAAGGCACAAGGTGTTTCAAAATTTAA



CGATAAAGTGAATGGATTTTTAAAGGGTAACCCAGATGTCAATATTATTGGCATTGACCGAGGCGA



ACGACACCTTCTGTATTTCACTGTGGTGAATCAGAAAGGTGAAATTTTGGTTCAAGAGTCGCTTAAT



ACCCTAATGAGTGATAAAGGGCATGTGAATGACTACCAGCAAAAACTCGACAAAAAAGAACAAGA



ACGCGATGCCGCTCGCAAAAGCTGGACGACGGTTGAAAATATCAAAGAATTAAAAGAAGGCTATTT



ATCTCATGTTGTTCATAAGTTGGCACACCTGATTATTAAATACAATGCCATTGTTTGCTTGGAAGAC



CTGAATTTTGGTTTCAAACGCGGGCGTTTTAAAGTGGAAAAACAAGTTTATCAGAAATTTGAAAAA



GCGCTTATTGATAAGCTTAACTACTTGGTATTTAAAGAAAAAGAGTTAGGCGAGGTGGGCCATTAT



CTAACCGCCTATCAGTTGACCGCACCGTTTGAAAGTTTCAAGAAGTTAGGCAAGCAAAGTGGCATA



TTGTTTTATGTTCCGGCGGATTACACCTCCAAAATTGACCCAACCACCGGGTTTGTCAACTTTCTTGA



TCTGCGTTATCAGAGTGTCGAAAAAGCCAAACAGCTCTTAAGCGACTTTAATGCCATTCGTTTTAAT



TCAGTACAAAACTATTTTGAGTTCGAAATAGATTACAAAAAACTCACACCCAAACGTAAAGTTGGT



ACTCAGAGTAAATGGGTGATTTGTACCTATGGAGATGTCCGCTATCAAAATCGGCGTAATCAAAAA



GGTCACTGGGAAACGGAAGAAGTCAATGTGACTGAAAAACTAAAAGCCCTTTTCGCCAGTGATTCC



AAAACTACAACCGTAATCGATTACGCCAATGACGACAACCTAATTGACGTCATTCTGGAACAGGAC



AAAGCCAGCTTCTTCAAAGAACTGTTATGGTTATTAAAACTCACCATGACGCTCCGCCACAGCAAA



ATCAAAAGTGAAGACGACTTTATTCTTTCACCCGTTAAAAACGAACAAGGCGAGTTTTACGATAGT



CGAAAAGCGGGCGAGGTGTGGCCTAAAGATGCAGACGCCAATGGCGCTTATCACATAGCGTTGAA



AGGCTTGTGGAATCTGCAACAGATCAATCAGTGGGAAAAGGGTAAAACACTTAATCTGGCGATTAA



AAACCAGGATTGGTTCAGTTTTATTCAAGAAAAGCCCTATCAAGAATAA





SEQ
ATGCACACAGGCGGATTACTTAGCATGGATGCCAAGGAGTTTACCGGACAGTACCCCCTTTCGAAG


ID
ACTCTGCGTTTTGAACTGAGACCGATAGGCAGAACGTGGGACAATCTCGAAGCATCGGGGTATCTT


NO:
GCGGAGGACAGACACCGTGCAGAATGCTATCCCAGGGCAAAAGAGCTCTTGGACGACAACCATCG


25
TGCATTCCTCAACCGTGTCCTGCCTCAGATCGATATGGATTGGCACCCGATCGCAGAGGCATTCTGC



AAAGTCCACAAGAATCCGGGAAACAAGGAATTGGCTCAGGATTACAATCTTCAGCTGTCCAAACGC



AGAAAGGAGATTTCGGCCTATCTGCAGGATGCGGACGGCTATAAAGGTCTGTTTGCCAAACCTGCA



TTGGATGAAGCAATGAAGATCGCGAAAGAAAACGGAAATGAATCGGACATAGAGGTTCTTGAGGC



ATTCAACGGTTTCTCCGTATACTTCACCGGATATCATGAGAGCAGGGAGAACATCTATTCGGACGA



GGATATGGTGTCGGTAGCTTATCGCATCACCGAAGACAATTTCCCGAGATTCGTTTCCAATGCGCTT



ATATTCGATAAGCTGAATGAGTCGCACCCCGATATAATCTCGGAAGTATCCGGAAATCTGGGCGTA



GACGACATCGGAAAATATTTTGATGTGTCTAACTACAATAATTTCCTGTCGCAGGCCGGTATAGATG



ACTACAATCACATCATCGGCGGCCATACGACGGAGGACGGTCTGATCCAGGCATTCAATGTTGTTCT



GAATCTCAGGCATCAGAAAGACCCCGGATTCGAAAAAATCCAATTCAAACAGCTGTACAAACAGAT



ACTCAGCGTCCGTACATCCAAATCCTATATCCCGAAACAGTTCGATAATTCGAAGGAGATGGTGGA



CTGCATCTGCGACTATGTGTCCAAGATCGAAAAATCCGAAACGGTCGAGAGAGCATTGAAGCTGGT



AAGGAACATATCTTCTTTTGATTTGCGCGGAATATTCGTAAACAAGAAGAATCTCCGCATTCTTTCC



AACAAACTGATTGGTGATTGGGACGCGATCGAAACCGCGCTGATGCACTCCTCCTCTTCGGAAAAT



GATAAGAAATCCGTCTACGACAGCGCCGAGGCATTTACGCTGGATGATATCTTTTCGTCCGTTAAAA



AATTCTCAGATGCATCTGCAGAGGATATCGGAAACCGGGCGGAGGACATATGCAGAGTCATATCTG



AGACCGCTCCGTTCATAAACGATCTGAGGGCTGTCGATTTGGACAGTTTGAATGACGACGGTTACG



AGGCGGCGGTTTCCAAGATAAGGGAATCTCTGGAACCATATATGGATCTGTTTCATGAACTGGAGA



TATTCTCCGTAGGCGATGAATTCCCGAAATGTGCAGCTTTCTACAGTGAACTTGAAGAAGTCTCCGA



ACAGCTAATCGAGATTATACCGTTATTCAACAAGGCCCGTTCGTTCTGTACGCGCAAGAGATACAGT



ACGGACAAGATAAAGGTCAATTTGAAATTCCCGACACTCGCCGACGGATGGGATCTCAACAAAGAA



CGCGACAACAAAGCCGCAATACTCAGGAAAGACGGAAAGTACTACCTGGCCATACTGGATATGAA



GAAAGATCTTTCTTCGATCAGAACTTCGGATGAAGACGAATCCAGTTTTGAGAAAATGGAGTACAA



GCTTCTTCCGAGTCCGGTAAAGATGCTGCCAAAGATCTTCGTAAAATCGAAGGCGGCCAAGGAGAA



GTACGGTCTGACCGACCGTATGCTGGAGTGCTACGATAAAGGGATGCACAAGAGCGGCAGTGCATT



CGATCTCGGATTTTGTCACGAATTGATCGATTACTACAAGAGGTGCATCGCAGAATATCCCGGCTGG



GACGTCTTCGATTTCAAGTTCAGGGAAACATCGGATTATGGCAGCATGAAGGAGTTCAATGAGGAT



GTTGCAGGGGCCGGATACTATATGTCCCTCAGAAAGATCCCTTGTTCGGAGGTCTACAGGCTTCTTG



ATGAGAAATCGATATATCTTTTCCAGATCTACAACAAAGATTATTCGGAAAACGCTCATGGGAATA



AGAACATGCATACCATGTATTGGGAAGGGCTCTTTTCCCCCCAGAATCTGGAATCCCCTGTGTTTAA



ACTCAGCGGCGGTGCGGAGCTTTTCTTCCGTAAATCCTCCATACCCAATGACGCCAAAACGGTCCAT



CCGAAGGGAAGCGTCCTGGTTCCGCGCAATGATGTAAACGGCCGCAGGATACCTGACAGCATATAT



CGGGAGCTCACCAGATATTTCAACCGCGGAGATTGCCGCATAAGCGACGAGGCAAAGAGTTATCTG



GACAAGGTGAAAACCAAGAAAGCTGACCACGATATCGTGAAAGACAGGAGGTTCACGGTGGACAA



GATGATGTTCCACGTCCCTATCGCCATGAATTTCAAAGCGATTTCGAAGCCGAATCTCAATAAAAAG



GTGATTGACGGCATAATCGACGACCAAGATCTGAAGATCATCGGCATAGACCGCGGAGAGCGCAA



CCTCATCTACGTAACCATGGTGGATCGCAAAGGGAACATCCTCTATCAGGATAGCCTCAATATTCTG



AACGGATACGATTACCGTAAGGCCCTCGACGTCCGCGAATATGACAATAAAGAGGCTCGGAGGAA



CTGGACGAAGGTCGAAGGCATCCGTAAGATGAAAGAGGGGTATCTGTCGCTTGCAGTCAGCAAATT



GGCAGATATGATCATAGAGAACAATGCGATTATCGTCATGGAGGATCTCAATCACGGATTCAAGGC



AGGGCGTTCGAAGATAGAGAAACAGGTCTATCAGAAGTTCGAATCCATGCTCATAAACAAACTCGG



TTACATGGTCCTCAAGGATAAGTCTATCGATCAGAGCGGCGGAGCTCTCCACGGATACCAGCTTGC



CAACCATGTGACAACATTGGCATCTGTAGGTAAACAATGTGGAGTGATATTCTACATCCCTGCTGCA



TTTACATCCAAGATAGATCCGACAACAGGATTTGCAGATCTGTTCGCCCTCAGCAATGTTAAAAACG



TGGCATCTATGAGAGAATTTTTCTCCAAGATGAAGTCTGTAATCTATGATAAGGCGGAGGGAAAAT



TCGCATTTACCTTCGACTATCTTGATTATAATGTGAAATCCGAGTGCGGAAGGACCCTTTGGACCGT



GTATACGGTCGGAGAGAGATTCACATACAGCAGGGTCAATAGAGAATATGTCAGAAAAGTTCCGAC



AGACATAATCTACGACGCATTGCAAAAGGCAGGAATATCTGTTGAAGGGGATCTCAGGGACAGGAT



TGCTGAATCGGATGGCGACACTCTGAAGAGCATATTCTATGCATTCAAGTATGCATTGGATATGAG



AGTAGAGAACCGCGAAGAGGATTACATACAGTCTCCTGTCAAAAATGCCTCCGGAGAATTCTTCTG



TTCCAAGAACGCAGGCAAATCGCTCCCTCAGGATTCCGATGCGAACGGTGCATACAATATCGCACT



CAAGGGGATCCTGCAGCTACGTATGCTTTCCGAGCAGTATGATCCGAATGCAGAGAGCATACGGTT



GCCACTGATAACCAACAAGGCCTGGCTGACCTTTATGCAGTCCGGTATGAAGACATGGAAGAACTG



A





SEQ
atgGATAGTTTGAAAGATTTCACCAATCTGTACCCTGTCAGTAAGACATTGAGATTTGAATTAAAGCC


ID
CGTTGGAAAGACTTTAGAAAATATCGAGAAAGCAGGTATTTTGAAAGAGGATGAGCATCGTGCAGA


NO:
AAGTTATCGGAGGGTGAAGAAAATAATTGATACTTATCATAAGGTATTTATCGATTCTTCTCTTGAA


26
AATATGGCTAAAATGGGTATTGAGAATGAAATAAAAGCAATGCTCCAAAGTTTCTGCGAATTGTAT



AAAAAAGATCATCGCACTGAGGGTGAAGACAAGGCATTAGATAAAATTCGAGCAGTACTTCGTGGC



CTGATTGTTGGGGCTTTCACTGGTGTTTGCGGAAGACGGGAAAATACAGTCCAAAACGAGAAGTAC



AGAGTTTGTTCAAAGAAAAGTTGATAAAAGAAATTTTACCTGATTTTGTGCTCTCTACTGAGGCTG



AAAGCTTGCCTTTCTCTGTTGAAGAAGCTACGAGGTCACTGAAGGAGTTTGATAGCTTTACATCCTA



CTTTGCTGGTTTTTACGAGAATAGAAAGAATATATACTCGACGAAACCTCAATCCACTGCCATTGCT



TATCGTCTTATTCATGAGAACTTGCCGAAGTTCATTGATAATATTCTTGTTTTTCAGAAGATCAAAG



AGCCTATAGCCAAAGAGCTGGAACATATTCGTGCGGACTTTTCTGCCGGGGGGTACATAAAAAAGG



ATGAGAGATTGGAGGATATTTTTTCGTTGAACTATTATATCCACGTGTTATCTCAGGCTGGGATCGA



AAAATATAACGCATTGATTGGGAAGATTGTGACAGAAGGAGATGGAGAGATGAAAGGGCTCAATG



AACACATCAACCTTTACAACCAACAAAGAGGCAGAGAGGATCGGCTCCCTCTTTTTAGGCCTCTTTA



TAAACAGATATTGAGTGACAGAGAGCAATTATCATACTTGCCTGAGAGTTTTGAAAAAGATGAGGA



GCTCCTCAGGGCTCTAAAAGAGTTCTATGATCATATCGCAGAAGACATTCTCGGACGTACTCAACA



GTTGATGACTTCTATTTCAGAATATGATTTATCTCGGATATACGTAAGGAACGATAGCCAATTGACT



GATATATCAAAAAAAATGTTGGGAGATTGGAATGCTATCTACATGGCTAGAGAACGAGCATATGAC



CACGAGCAGGCTCCCAAAAGAATCACGGCGAAATACGAGAGGGACAGGATTAAAGCTCTTAAAGG



AGAAGAGAGTATAAGTCTGGCAAATCTTAATAGTTGTATTGCCTTTCTGGACAATGTTAGAGATTGC



CGTGTAGATACTTATCTTTCCACACTGGGCCAGAAGGAAGGACCACATGGTCTATCTAATCTCGTTG



AGAACGTTTTTGCCTCATACCATGAAGCAGAGCAATTGTTGAGCTTTCCATACCCCGAAGAGAATA



ATCTGATTCAGGACAAGGACAATGTGGTGTTAATTAAGAATCTTCTCGACAATATCAGTGATCTGCA



GAGGTTCTTGAAACCTCTTTGGGGTATGGGAGACGAACCCGATAAAGATGAAAGATTTTATGGAGA



GTATAATTATATCCGAGGAGCTCTAGATCAGGTGATCCCTCTGTACAATAAGGTAAGGAACTACCTC



ACTCGGAAGCCTTATTCGACCAGAAAAGTAAAACTCAATTTTGGGAATTCTCAATTGCTTAGTGGTT



GGGATAGAAATAAGGAAAAGGATAATAGCTGTGTGATTTTGCGTAAGGGGCAGAACTTCTATTTGG



CTATTATGAACAATAGGCACAAAAGAAGTTTCGAAAACAAGGTGTTGCCCGAGTATAAGGAGGGA



GAACCTTACTTCGAAAAGATGGATTATAAATTTTTGCCTGATCCTAATAAAATGCTTCCTAAGGTTT



TTCTTTCGAAAAAAGGAATAGAGATATACAAACCAAGTCCGAAGCTTTTAGAACAATATGGACATG



GAACTCACAAAAAGGGAGATACCTTTAGTATGGATGATTTGCACGAACTGATCGATTTCTTCAAAC



ACTCAATCGAGGCTCATGAAGATTGGAAGCAATTCGGATTCAAATTTTCTGATACGGCTACTTATGA



GAATGTATCTAGTTTCTATAGAGAAGTTGAGGATCAGGGGTATAAGCTCTCTTTCCGAAAAGTTTCG



GAATCTTATGTCTATTCATTAATAGATCAAGGCAAGTTGTATTTATTTCAGATATACAACAAGGACT



TTTCTCCCTGCAGCAAAGGGACACCTAATCTGCATACCTTGTATTGGAGAATGCTTTTTGACGAGCG



CAATTTGGCAGATGTCATATACAAACTGGATGGGAAGGCTGAAATCTTTTTCCGAGAGAAGAGTTT



GAAAAATGATCATCCCACGCATCCGGCTGGTAAGCCTATCAAAAAGAAAAGTCGACAAAAAAAAG



GAGAGGAGAGTCTGTTTGAGTATGATTTAGTCAAGGATAGGCACTATACGATGGATAAGTTCCAGT



TTCATGTGCCTATTACTATGAATTTTAAATGTTCTGCAGGAAGCAAAGTCAATGATATGGTTAATGC



TCATATTCGAGAGGCAAAGGATATGCATGTCATTGGAATTGATCGTGGAGAACGCAATCTGCTGTA



TATATGCGTGATAGATAGTCGAGGGACGATTTTGGATCAAATTTCTCTGAATACGATTAACGATATA



GACTATCATGATTTATTGGAGAGTCGAGACAAAGACCGTCAGCAGGAGCGCCGAAACTGGCAAACT



ATCGAAGGGATCAAGGAGCTAAAACAAGGCTACCTTAGTCAGGCGGTTCATCGGATAGCCGAACTG



ATGGTGGCTTATAAGGCTGTAGTTGCTTTGGAGGATTTGAATATGGGGTTCAAACGTGGGCGGCAG



AAAGTAGAAAGTTCTGTTTATCAGCAGTTTGAGAAACAGCTGATAGATAAGCTCAACTATCTTGTG



GACAAGAAGAAAAGGCCTGAAGATATTGGAGGATTGTTGAGAGCCTATCAATTTACGGCCCCATTT



AAGAGTTTTAAGGAAATGGGAAAGCAAAACGGCTTCTTGTTTTATATCCCGGCTTGGAACACGAGC



AACATAGATCCGACTACTGGATTTGTTAATTTATTTCATGCCCAGTATGAAAATGTAGATAAAGCGA



AGAGCTTCTTTCAAAAGTTTGATTCAATTAGTTACAACCCGAAGAAAGACTGGTTTGAGTTTGCATT



CGATTATAAAAACTTTACTAAAAAGGCTGAAGGAAGTCGTTCTATGTGGATATTATGCACACATGG



TTCCCGAATAAAGAATTTTAGAAATTCCCAGAAGAATGGTCAATGGGATTCCGAAGAATTCGCCTT



GACGGAGGCTTTTAAGTCTCTTTTTGTGCGATATGAGATAGATTATACCGCTGATTTGAAAACAGCT



ATTGTGGACGAAAAGCAAAAAGACTTCTTCGTGGATCTTCTGAAGCTATTCAAATTGACAGTACAG



ATGCGCAACAGCTGGAAAGAGAAGGATTTGGATTATCTAATCTCTCCTGTAGCAGGGGCTGATGGC



CGTTTCTTCGATACAAGAGAGGGAAATAAAAGTCTGCCTAAGGATGCAGATGCCAATGGAGCTTAT



AATATTGCCCTAAAAGGACTTTGGGCTCTACGCCAGATTCGGCAAACTTCAGAAGGCGGTAAACTC



AAATTGGCGATTTCCAATAAGGAATGGCTACAGTTTGTGCAAGAGAGATCTTACGAGAAAGACtga





SEQ
atgaataatggaacaaataactttcagaattttatcggaatttcttctttgcagaagactcttaggaat


ID
gctctcattccaaccgaaacaacacagcaatttattgttaaaaacggaataattaaagaagatgagcta


NO:
agaggagaaaatcgtcagatacttaaagatatcatggatgattattacagaggtttcatttcagaaact


27
ttatcgtcaattgatgatattgactggacttctttatttgagaaaatggaaattcagttaaaaaatgga



gataacaaagacactcttataaaagaacagactgaataccgtaaggcaattcataaaaaatttgcaaat



gatgatagatttaaaaatatgttcagtgcaaaattaatctcagatattcttcctgaatttgtcattcat



aacaataattattctgcatcagaaaaggaagaaaaaacacaggtaattaaattattttccagatttgca



acgtcattcaaggactattttaaaaacagggctaattgtttttcggctgatgatatatcttcatcttct



tgtcatagaatagttaatgataatgcagagatattttttagtaatgcattggtgtataggagaattgta



aaaagtctttcaaatgatgatataaataaaatatccggagatatgaaggattcattaaaggaaatgtct



ctggaagaaatttattcttatgaaaaatatggggaatttattacacaggaaggtatatctttttataat



gatatatgtggtaaagtaaattcatttatgaatttatattgccagaaaaataaagaaaacaaaaatctc



tataagctgcaaaagcttcataaacagatactgtgcatagcagatacttcttatgaggtgccgtataaa



tttgaatcagatgaagaggtttatcaatcagtgaatggatttttggacaatattagttcgaaacatatc



gttgaaagattgcgtaagattggagacaactataacggctacaatcttgataagatttatattgttagt



aaattctatgaatcagtttcacaaaagacatatagagattgggaaacaataaatactgcattagaaatt



cattacaacaatatattacccggaaatggtaaatctaaagctgacaaggtaaaaaaagcggtaaagaat



gatctgcaaaaaagcattactgaaatcaatgagcttgttagcaattataaattatgttcggatgataat



attaaagctgagacatatatacatgaaatatcacatattttgaataattttgaagcacaggagcttaag



tataatcctgaaattcatctggtggaaagtgaattgaaagcatctgaattaaaaaatgttctcgatgta



ataatgaatgcttttcattggtgttcggttttcatgacagaggagctggtagataaagataataatttt



tatgccgagttagaagagatatatgacgaaatatatccggtaatttcattgtataatcttgtgcgtaat



tatgtaacgcagaagccatatagtacaaaaaaaattaaattgaattttggtattcctacactagcggat



ggatggagtaaaagtaaagaatatagtaataatgcaattattctcatgcgtgataatttgtactattta



ggaatatttaatgcaaaaaataagcctgacaaaaagataattgaaggtaatacatcagaaaataaaggg



gattataagaagatgatttataatcttctgccaggaccaaataaaatgatccccaaggtattcctctct



tcaaaaaccggagtggaaacatataagccgtctgcctatatattggagggctataaacaaaacaagcat



attaaatcctctaaggattttgatataacattttgtcacgatttgattgattattttaagaactgtata



gcaatacatcctgaatggaagaattttggctttgatttttctgacacctccacatatgaagatatcagc



ggattttacagagaagtcgaattacaaggttataaaatcgactggacatatatcagcgaaaaggatatt



gatttgttgcaggaaaaaggacagttatatttattccaaatatataacaaagatttttccaagaaaagt



accggaaatgataatcttcatactatgtatttgaagaatttgtttagtgaagagaatttaaaggatatt



gtactgaaattaaacggtgaggcggaaatcttctttagaaaatcaagcataaagaatccaataattcat



aaaaaaggctctattcttgttaatagaacatatgaagcagaggaaaaagatcaatttggaaatatccag



atagtcagaaaaaacataccggaaaatatatatcaggagctttataaatatttcaatgataaaagtgat



aaagaactttcggatgaagcagctaagcttaagaatgtagtaggtcatcatgaggctgctacaaacata



gtaaaagattatagatatacatatgataaatattttcttcatatgcctattacaatcaattttaaagcc



aataagacaggctttattaatgacagaatattacaatatattgctaaagaaaaggatttgcatgtaata



ggcattgatcgtggtgaaagaaacctgatatatgtttcagtaattgatacttgtggaaatattgttgaa



caaaaatcgtttaacattgttaatggatatgattatcagattaagctcaagcagcaggagggggcgcga



caaatcgcacgaaaagaatggaaagaaatcggcaaaataaaagaaattaaagaaggctatttatctctt



gtaattcatgaaatttcaaagatggttattaaatataatgccataattgcaatggaggatttaagctac



ggatttaaaaaaggtcgtttcaaggttgagcgacaggtttaccagaagtttgagacaatgcttatcaac



aaactcaactatctggtatttaaagatatatccataacggaaaacggtggtcttctaaagggataccag



cttacatatattccagataaactgaaaaatgtgggtcatcaatgtggctgtatattttatgtacctgct



gcctatacatcaaaaatagatcctacaaccggatttgtaaatatattcaaatttaaagatttaacagtt



gatgcgaagagagaatttataaaaaaatttgacagtatcagatatgattcagaaaaaaatctgttttgt



tttacattcgattataataactttattacgcaaaatactgttatgtcaaagtcaagctggagtgtatat



acgtacggagttaggataaaaagaagatttgtcaatggcaggttctcaaatgaatcggatacaattgat



ataacaaaagatatggaaaaaacactcgaaatgacagatataaattggagagatggtcatgatctgagg



caggatattattgattatgaaatcgtacaacacatatttgagatttttagattgactgtacaaatgaga



aacagtttaagtgaattagaagacagggattatgaccgtttgatttctccggtgctcaatgaaaataat



atattttatgattcagctaaagcaggagatgcgttacctaaagacgcagatgctaatggtgcatattgt



atagctctaaaaggcttgtatgaaatcaaacaaattacagagaattggaaagaagacggtaagttttca



agagataaacttaaaatttccaataaggactggtttgactttattcaaaataaaaggtatttataa





SEQ
atgacaaacaaatttacaaaccagtactcgctttccaaaacacttcgatttgagttgattccacaagga


ID
aaaacattggaatttattcaagaaaaaggattgctctctcaagataaacaacgagcggagagttatcaa


NO:
gaaatgaaaaaaactattgataaatttcataaatactttatcgatttagctttaagcaatgctaaacta


28
actcatttagaaacttacttggaattatacaataaaagtgctgaaacaaaaaaagaacaaaaatttaaa



gacgatttaaagaaagtacaagacaatttacgaaaagaaatcgttaaatctttttcagatggtgatgca



aaatcaatttttgcaattttggataaaaaagaactgattaccgtagaacttgaaaaatggtttgaaaac



aacgaacaaaaagacatttattttgacgaaaaattcaaaacgtttactacttattttactggttttcat



caaaacagaaaaaacatgtattcggttgaacccaattctacagcaattgcttatcgattgattcatgaa



aatttacctaaatttttagaaaatgctaaagcatttgaaaaaataaaacaagtagaaagtttgcaagtt



aattttagagaattaatgggggaatttggagatgaagggctaattttcgtaaatgaattagaagaaatg



tttcaaatcaattattataatgatgtgctttcacaaaatggaattacaatttataatagtataatttca



ggatttaccaaaaatgatataaaatataaaggtctaaatgaatacataaataattacaatcaaaccaaa



gacaaaaaagaccgtttgccaaaattaaaacaattgtataaacagattttgagtgataggatttcactt



tcgtttttgcccgatgcttttacggatgggaaacaagttttgaaagccatatttgacttttataaaatc



aacttactttcttataccattgaaggacaggaagaaagccaaaatcttttactattaattcgtcagaca



attgaaaacctttctagttttgatacccaaaaaatttatctaaaaaatgatacccatttaaccactatt



tcacaacaagtatttggcgatttttcggtgttttcaactgctttaaattattggtatgaaactaaagta



aatccaaaatttgaaacggaatatagcaaagccaacgaaaaaaaacgagaaattttagataaagccaaa



gcggtatttacaaaacaagattatttttcaattgcttttttacaagaagtactttcggaatacattctt



accttagatcacacttctgatattgtaaaaaagcattcctccaactgtattgcggattattttaaaaat



cattttgtagccaaaaaagaaaatgaaaccgacaaaacctttgattttattgctaatattactgcaaaa



taccaatgtattcaaggtattttagaaaatgcagaccaatacgaagacgaactcaaacaagaccaaaaa



ttaattgataatttgaaattctttttagatgctattttagaattgttgcattttattaaacctttgcat



ttaaaatcagaaagcattaccgaaaaagacactgctttttatgatgtgtttgaaaattattacgaagca



ttgagtttgttgaccccattatataatatggtgcgaaactatgtaacgcaaaagccgtacagcaccgaa



aaaataaaattaaattttgaaaatgcacaattattgaatggttgggatgccaataaagaaggtgattac



ctaactaccattttgaaaaaagacggtaattattttttagccataatggataaaaagcataacaaagcg



tttcaaaagtttccagaaggaaaagaaaattatgaaaaaatggtgtataaactattgcctggagtaaat



aagatgttgccaaaagtatttttttccaataaaaatattgcttacttcaacccatcaaaagagttatta



gaaaactataaaaaagagacgcacaaaaaaggagacacattcaatttagaacattgtcatacgttgatc



gattttttcaaggactctttaaacaaacatgaagactggaaatactttgattttcaattttctgaaaca



aaatcgtatcaagatttgagtggtttttatagagaagtagaacatcaaggctacaaaatcaattttaaa



aatatcgattcagaatatattgatggtttggtgaacgaaggtaaattgtttctatttcaaatttacagc



aaagatttttcgcctttttccaaagggaaaccgaacatgcacactttgtattggaaagccttatttgaa



gaacaaaatttgcaaaatgtaatctataaattgaatggacaagccgaaatattttttagaaaagcctct



ataaaacctaaaaatataatattgcacaaaaagaaaattaaaattgccaaaaagcattttattgataaa



aaaacaaaaacatctgaaattgttcctgttcaaacaataaaaaacctcaatatgtactaccaaggaaaa



ataagtgaaaaagaattaacacaagatgatttaaggtatattgataattttagcattttcaatgaaaaa



aataaaacaattgatattataaaagacaaacgatttacggttgataaatttcagtttcatgtgccgatt



accatgaactttaaagcaacgggcggaagttatatcaatcaaaccgtattagaatatttgcaaaacaat



cccgaagttaagattattggattggatagaggcgaacgccatttggtatatctgacactgatagaccag



caaggaaacatcttgaaacaagaaagtttgaatacaatcaccgattctaaaatctcgacaccttatcat



aagttgttggataacaaggaaaacgagcgtgacttggctcgaaaaaattggggaacggtggaaaacatc



aaagaactcaaagaaggctacatcagtcaagtggtgcataaaattgctacgttgatgctggaagaaaat



gccattgtggtaatggaagatttgaattttggatttaaacgtggacgttttaaagtggaaaaacaaatt



tatcaaaagctggaaaaaatgttgattgacaaattgaattatttggttttaaaagacaaacaacctcag



gaattaggcggattgtacaacgcattacaactcaccaataaatttgaaagtttccaaaaaatgggtaaa



caatcgggctttttgttttatgtacccgcttggaacacctccaaaatagacccaaccacagggtttgtc



aattatttttataccaaatatgaaaatgttgacaaagccaaagccttttttgaaaaatttgaggcgatt



cgtttcaatgcagaaaagaagtattttgaatttgaagtaaaaaaatatagcgattttaacccaaaagcc



gaaggcactcaacaagcctggaccatttgcacgtatggcgaacgaatagaaaccaaacgacaaaaagac



caaaacaacaaatttgtaagcactccaattaatctaaccgaaaagatagaagactttttgggtaaaaac



caaattgtttatggtgatggtaattgcatcaaatctcaaattgctagcaaagacgacaaggcttttttt



gaaaccttattgtattggttcaaaatgactttacaaatgcgaaacagcgaaacaagaacagatatagat



tatctaatttcgcccgtgatgaatgacaacggaacattttacaacagccgagattatgaaaaattagaa



aatccaactttgcccaaagatgccgatgccaacggagcgtatcatattgccaaaaaaggattgatgctt



ttgaataaaatagaccaagccgacttgacaaaaaaagtggatttatctattagtaacagagattggttg



caatttgtacaaaaaaataaataa





SEQ
atggaacaggagtactatttaggactggatatgggaaccggatctgtaggatgggctgttacagattcg


ID
gaatatcatgtcttgcgtaaacatggaaaagcactatggggagtccgattatttgaaagtgcatcgaca


NO:
gcagaagaacgaagaatgttccgaacatcaagaagaagactagatcgaagaaactggagaattgaaatt


29
ttacaggaaatttttgcagaggaaataagtaagaaagatccaggatttttcttgcgaatgaaagaaagc



aaatattatccagaagataagcgagatatcaatggaaattgtccggaactgccatatgcattatttgtt



gatgacgattttacagataaagattatcataaaaaatttccgacaatttatcatctcaggaaaatgttg



atgaatacagaggagacaccggatatccggttggtgtatctggcaattcatcatatgatgaagcatagg



ggccatttcttgttatctggtgacattaatgagattaaggagttcggaacgacattttcaaaattgttg



gagaatatcaaaaatgaggaattggattggaatcttgaactgggaaaagaagaatatgctgttgtagaa



agtattttaaaagataacatgttaaaccgatccacaaagaaaaccagattaataaaagcattaaaagca



aaatcaatatgtgaaaaggctgtactgaatttattggctggtggaacggtgaaattgagtgatatattt



ggtcttgaagaattaaatgagacagaaagaccgaagatttcctttgctgataatggatacgatgattat



atcggagaagttgaaaatgagctgggagaacaattctatattatagagacggcaaaagcagtgtatgac



tgggcggtattagttgaaatattgggaaaatatacgtcaatttcagaagcgaaagtagcaacgtatgaa



aaacataaatcggatttacaatttttgaaaaagatagttcggaaatatctgacaaaggaggaatataaa



gatatttttgtaagtacgagtgacaaattgaaaaattactctgcttatataggaatgacgaaaataaat



ggaaaaaaggttgatttgcagagcaaacggtgcagtaaagaagaattctatgattttattaagaaaaac



gtacttaaaaagctagaaggacaacctgaatatgaatatttgaaagaagagctagaaagagaaacattt



ctaccaaaacaggtgaacagggataatggtgtaataccgtatcagattcatttgtacgagttgaaaaag



atattaggaaatttacgggataaaatagacctcattaaagagaacgaagataaactggttcaattattt



gaattcagaattccgtattatgttggtccgctgaataagatagatgacggaaaagagggaaaatttaca



tgggctgtacggaaaagtaatgaaaagatatatccatggaattttgaaaatgtagttgatatagaagca



agtgcagaaaaatttatccggagaatgacaaataagtgtacatatctgatgggcgaagatgtattgccg



aaggattcattgctttacagtaaatatatggttttaaatgaattaaataatgtaaagttggatggcgaa



aaattatctgtagaattgaaacaacggttgtatacagatgtattttgtaagtatcggaaagtaactgta



aagaagataaaaaattacttgaaatgtgaaggtatcatatccggcaatgtcgaaataactggaattgat



ggtgattttaaggcatcgttaacggcatatcatgattttaaagaaatcttgacaggaacagaattggct



aaaaaggacaaagaaaatattattaccaatatagtattgtttggagatgataaaaagctgctgaaaaag



agactgaatcgattatatcctcagattacgccgaatcagttgaagaaaatatgtgcgctatcctataca



ggctggggaagattttctaaaaagttcttagaagaaataacagctccagatccggaaacgggagaggta



tggaatatcattacggcattgtgggaatcgaataataatctgatgcaattattaagtaatgaatatcgg



tttatggaagaagtcgaaacatacaatatgggaaaacagactaaaacattgtcgtacgaaacagtagag



aatatgtatgtttctccatctgtgaaaagacagatatggcagacgctgaaaatcgtgaaagaattagaa



aaagtaatgaaagaatctccgaaacgtgtatttattgagatggcgagagaaaagcaagaaagtaagaga



accgaatcgcgtaaaaaacaactaatagatttgtataaggcttgtaaaaatgaagaaaaagattgggta



aaagaactgggagatcaggaagaacagaaattacgaagcgataagttgtacctatattatacgcaaaag



ggtcgttgtatgtattctggcgaggtaatagaactgaaagacttatgggataatacaaaatatgatatt



gatcatatatatccacaatctaaaacgatggatgacagtcttaataatcgcgtattggtaaaaaagaaa



tataatgcaacaaaatcagataagtatccattaaatgaaaatatacgacatgagagaaaaggcttttgg



aagtcactgttagatggagggtttataagtaaagaaaaatatgaacgcttaataagaaatacagaattg



agtccggaagaattagcaggatttattgaaaggcagattgttgaaacgaggcagagtacaaaagctgta



gcggaaatattaaagcaagtgtttccggaaagtgaaattgtatatgtcaaagcaggtacggtttcaaga



ttcagaaaagattttgaattactgaaagttcgagaagtgaatgatttgcatcacgcaaaggatgcgtat



ttaaatattgtagttggtaatagttattatgtgaaatttactaagaatgcatcatggtttataaaagaa



aatccgggacgtacttacaacttaaaaaagatgtttacatcaggttggaatattgaacgaaatggagaa



gttgcatgggaagtcgggaaaaaaggaacaattgtaacggtaaaacaaataatgaataaaaataatata



ttggtgacaagacaggttcatgaagcgaaaggtgggctgtttgatcagcagattatgaaaaaaggaaaa



ggtcagattgctataaaggaaactgatgaacgtcttgcatcaatagaaaagtatggaggctataataaa



gctgccggggcatattttatgctggtagaatctaaagataaaaaaggaaaaacaattcgaacgatagaa



tttataccattatatttaaagaataaaatcgagtcggatgaatcaatagcattgaactttttagaaaaa



ggcagaggtttgaaagaaccaaagatactattgaaaaaaattaagattgatacattatttgatgtggac



ggattcaaaatgtggttgtctggaagaacaggggacagactactatttaaatgtgcaaatcaattgatt



ttggatgagaaaataattgtaacaatgaaaaaaattgtaaagtttattcaaaggagacaagaaaataga



gaattaaaattatctgataaagatggaattgataatgaagtacttatggaaatatataacacttttgtg



gataagttagaaaacacagtgtatagaatacgattatccgaacaggcaaaaacgcttatagataaacaa



aaagaatttgaaaggttatcactagaggataaaagtagtactttgtttgaaattttacatatttttcag



tgtcaaagtagtgcggccaatttaaaaatgataggcggacctggaaaagcaggaatattagttatgaat



aataatataagtaagtgtaacaaaatttctattataaatcagtctccaacaggaattttcgaaaatgag



attgatttgttaaagat





SEQ
ATGAAATCTTTCGATTCATTCACAAATCTTTATTCTCTTTCAAAAACCTTGAAATTTGAGATGAGACC


ID
TGTCGGAAATACCCAAAAAATGCTCGACAATGCAGGAGTATTTGAAAAAGACAAACTAATTCAAAA


NO:
AAAGTACGGAAAAACAAAGCCGTATTTCGACAGACTCCACAGAGAATTTATAGAAGAAGCGCTCA


30
CGGGGGTAGAGCTAATAGGACTAGATGAGAACTTTAGGACACTTGTTGACTGGCAAAAAGATAAG



AAAAATAATGTCGCAATGAAAGCGTATGAAAATAGTTTGCAGCGGCTGAGAACGGAAATAGGTAA



AATATTTAACCTAAAGGCTGAGGATTGGGTAAAGAACAAATATCCAATATTAGGGCTGAAAAATAA



AAATACCGATATTTTATTCGAAGAGGCTGTATTCGGGATATTGAAAGCCCGATATGGAGAAGAAAA



AGATACTTTTATAGAAGTAGAGGAAATAGATAAAACCGGCAAATCAAAGATCAATCAAATATCAAT



TTTCGATAGTTGGAAAGGATTTACAGGATATTTCAAAAAATTTTTTGAAACCAGAAAGAATTTTTAC



AAAAACGACGGAACTTCTACAGCAATTGCTACAAGGATCATTGATCAAAATCTGAAAAGATTCATA



GATAATCTGTCAATAGTTGAAAGTGTGAGACAAAAGGTTGATCTCGCCGAGACAGAAAAATCTTTC



AGCATATCTCTATCGCAATTCTTCTCAATAGACTTTTATAACAAGTGTCTCCTTCAAGATGGTATTGA



TTACTACAACAAGATAATCGGTGGAGAAACTCTCAAAAATGGCGAAAAACTAATAGGTCTCAATGA



ACTAATAAATCAATATAGGCAGAATAATAAGGATCAGAAAATCCCATTTTTCAAACTTCTTGATAA



ACAAATTTTGAGTGAAAAGATATTATTTTTGGATGAAATAAAAAATGACACAGAACTGATCGAGGC



GCTGAGTCAGTTCGCAAAAACAGCCGAAGAAAAAACAAAAATTGTCAAAAAGCTTTTTGCCGATTT



TGTAGAAAATAATTCCAAATACGATCTTGCACAGATTTATATTTCCCAAGAAGCATTCAATACTATA



TCAAACAAGTGGACAAGCGAAACTGAGACGTTCGCTAAATATCTATTCGAAGCAATGAAGAGTGGA



AAACTTGCAAAGTATGAGAAAAAAGATAATAGCTATAAATTTCCTGATTTTATTGCCCTTTCACAGA



TGAAGAGTGCTTTATTAAGTATCAGCCTTGAGGGACATTTTTGGAAAGAGAAATACTACAAAATTTC



AAAATTCCAAGAGAAGACCAATTGGGAGCAGTTTCTTGCAATTTTTCTATACGAGTTTAACTCTCTT



TTCAGCGACAAAATAAATACAAAAGATGGAGAAACAAAGCAAGTTGGATACTATCTATTTGCCAAA



GACCTGCATAATCTTATCTTAAGTGAGCAGATTGATATTCCAAAAGATTCAAAAGTCACAATAAAA



GATTTTGCCGATTCTGTACTCACAATCTACCAAATGGCAAAATATTTTGCGGTAGAAAAAAAACGA



GCGTGGCTTGCCGAGTATGAACTAGATTCATTTTATACCCAGCCAGACACAGGCTATTTACAGTTTT



ATGATAACGCCTACGAGGATATTGTGCAGGTATACAACAAGCTTCGAAACTATCTGACCAAAAAGC



CATATAGCGAGGAGAAATGGAAGTTGAATTTTGAAAATTCTACGCTGGCAAATGGATGGGATAAGA



ACAAAGAATCTGATAATTCAGCAGTTATTCTACAAAAAGGTGGAAAATATTATTTGGGACTGATTA



CTAAAGGACACAACAAAATTTTTGATGACCGTTTTCAAGAAAAATTTATTGTGGGAATTGAAGGTG



GAAAATATGAAAAAATAGTCTATAAATTTTTCCCCGACCAGGCAAAAATGTTTCCCAAAGTGTGCTT



TTCTGCAAAAGGACTCGAATTTTTTAGACCGTCTGAAGAAATTTTAAGAATTTATAACAATGCAGAG



TTTAAAAAAGGAGAAACTTATTCAATAGATAGTATGCAGAAGTTGATTGATTTTTATAAAGATTGCT



TGACTAAATATGAAGGCTGGGCATGTTATACCTTTCGGCATCTAAAACCCACAGAAGAATACCAAA



ACAATATTGGAGAGTTTTTTCGAGATGTTGCAGAGGACGGATACAGGATTGATTTTCAAGGCATTTC



AGATCAATATATTCATGAAAAAAACGAGAAAGGCGAACTTCACCTTTTTGAAATCCACAATAAAGA



TTGGAATTTGGATAAGGCACGAGACGGAAAGTCAAAAACAACACAAAAAAACCTTCATACACTCTA



TTTCGAATCGCTCTTTTCAAACGATAATGTTGTTCAAAACTTTCCAATAAAACTCAATGGTCAAGCT



GAAATTTTTTATAGACCGAAAACGGAAAAAGACAAATTAGAATCAAAAAAAGATAAGAAAGGGAA



TAAAGTGATTGACCATAAACGCTATAGTGAGAATAAGATTTTTTTTCATGTTCCTCTCACACTAAAC



CGCACTAAAAATGACTCATATCGCTTTAATGCTCAAATCAACAACTTTCTCGCAAATAATAAAGATA



TCAACATCATCGGTGTAGATAGGGGAGAAAAGCATTTAGTCTATTATTCGGTGATTACACAAGCTA



GTGACATCTTAGAAAGTGGCTCACTAAATGAGCTAAATGGCGTGAATTATGCTGAAAAACTGGGAA



AAAAGGCAGAAAATCGAGAACAAGCACGCAGAGACTGGCAAGACGTACAAGGGATCAAAGACCTC



AAGAAAGGATATATTTCACAGGTGGTGCGAAAGCTTGCTGATTTAGCAATTAAACACAATGCCATT



ATCATTCTTGAAGATTTGAATATGAGATTTAAACAAGTTCGGGGCGGTATCGAAAAATCCATTTATC



AACAGTTAGAAAAAGCACTGATAGATAAATTAAGCTTTCTTGTAGACAAAGGTGAAAAAAATCCCG



AGCAAGCAGGACATCTTCTGAAAGCATATCAGCTTTCGGCCCCATTTGAGACATTTCAAAAAATGG



GCAAACAGACGGGTATAATCTTTTATACACAAGCTTCGTATACCTCAAAAAGTGACCCTGTAACAG



GTTGGCGACCACACCTGTATCTCAAATATTTCAGTGCCAAAAAAGCAAAAGACGATATTGCAAAGT



TTACAAAAATAGAATTTGTAAACGATAGGTTTGAGCTTACCTATGATATAAAGGACTTTCAGCAAG



CAAAAGAATATCCAAATAAAACTGTTTGGAAAGTTTGCTCAAATGTAGAGAGATTCAGGTGGGACA



AAAACCTCAATCAAAACAAAGGCGGATATACTCACTACACAAATATAACTGAGAATATCCAAGAGC



TTTTTACAAAATATGGAATTGATATCACAAAAGATTTGCTCACACAGATTTCTACAATTGATGAAAA



ACAAAATACCTCATTTTTTAGAGATTTTATTTTTTATTTCAACCTTATTTGCCAAATCAGAAATACCG



ATGATTCTGAGATTGCTAAAAAGAATGGGAAAGATGATTTTATACTGTCACCTGTTGAGCCGTTTTT



CGATAGCCGAAAAGACAATGGAAATAAACTTCCTGAGAATGGAGATGATAACGGCGCGTATAACA



TAGCAAGAAAAGGGATTGTCATACTCAACAAAATCTCACAATATTCAGAGAAAAACGAAAATTGCG



AGAAAATGAAATGGGGGGATTTGTATGTATCAAACATTGACTGGGACAATTTTGTAACCCAAGCTA



ATGCACGGCATTAA





SEQ
ATGATTATCTTATATATTAGTACCTCGAATATGAACATGGAAGGAGTATTTATGGAAAATTTTAAAA


ID
ACTTGTATCCAATAAACAAAACACTTCGATTTGAATTAAGACCCTATGGAAAAACATTGGAAAATT


NO:
TTAAAAAATCCGGACTTTTAGAAAAAGATGCCTTTAAGGCAAATAGTAGACGAAGTATGCAAGCTA


31
TAATCGATGAAAAATTCAAAGAGACTATCGAAGAACGCTTAAAGTACACTGAATTCAGTGAATGTG



ATCTTGGAAACATGACATCAAAAGATAAAAAAATAACTGATAAAGCAGCTACAAATTTAAAAAAG



CAAGTTATCTTATCTTTTGACGATGAAATATTTAATAATTACCTAAAACCTGATAAAAATATTGACG



CATTATTTAAAAATGATCCTTCAAATCCTGTAATCTCTACATTTAAAGGTTTTACGACATATTTTGTG



AATTTTTTTGAAATTCGAAAACATATTTTCAAGGGAGAATCATCAGGCTCAATGGCATACCGAATTA



TAGATGAAAACCTGACAACATACTTGAATAATATTGAAAAAATAAAAAAACTGCCAGAAGAATTA



AAATCACAGCTAGAAGGCATTGATCAGATTGATAAACTTAATAATTATAATGAGTTCATTACACAG



TCAGGTATAACACACTATAATGAAATCATCGGCGGTATATCAAAATCAGAGAATGTCAAAATACAG



GGAATTAATGAAGGAATTAATCTATACTGTCAGAAGAACAAAGTTAAACTTCCTCGACTGACTCCG



CTATACAAAATGATATTATCAGACAGAGTTTCCAACTCTTTTGTATTAGACACTATTGAAAATGACA



CAGAATTAATTGAAATGATAAGTGATTTGATTAATAAGACTGAGATTTCGCAAGATGTTATAATGTC



AGATATTCAAAATATTTTCATAAAATACAAACAACTTGGTAATTTGCCGGGTATCTCATATTCTTCA



ATAGTTAATGCTATTTGCTCGGATTATGACAACAATTTCGGAGATGGGAAGCGAAAAAAATCTTAC



GAAAATGATCGCAAAAAGCATTTGGAGACTAATGTATACTCCATAAATTATATTTCTGAATTGCTTA



CAGATACCGATGTTTCATCAAATATCAAGATGAGATATAAAGAGCTTGAGCAAAATTATCAGGTTT



GCAAAGAAAATTTTAATGCCACAAACTGGATGAATATTAAAAATATAAAACAATCTGAAAAAACA



AACCTTATTAAAGATTTGTTAGATATACTTAAATCGATTCAACGTTTCTATGATTTGTTTGATATTGT



TGACGAAGATAAAAATCCAAGTGCTGAATTTTATACCTGGTTATCAAAAAATGCTGAAAAGCTTGA



CTTTGAATTCAATTCTGTATATAACAAGTCACGAAACTATCTCACCAGGAAACAATACTCTGATAAA



AAAATCAAGCTGAATTTTGATTCTCCAACATTGGCCAAAGGGTGGGATGCTAACAAAGAAATAGAT



AACTCCACGATTATAATGCGTAAATTTAATAATGACAGAGGCGATTATGATTACTTCCTTGGCATAT



GGAATAAATCCACACCTGCAAATGAAAAAATAATCCCACTGGAGGATAATGGATTATTCGAAAAAA



TGCAATATAAGCTGTATCCAGATCCTAGTAAGATGTTACCGAAACAATTTCTATCAAAAATATGGA



AGGCAAAGCATCCTACGACACCTGAATTTGATAAAAAATATAAAGAGGGAAGACATAAAAAAGGT



CCTGATTTCGAAAAAGAATTCCTGCATGAATTGATTGATTGCTTCAAACATGGTCTTGTTAATCACG



ATGAAAAATATCAGGATGTTTTTGGCTTCAATCTCCGTAACACTGAAGATTATAATTCATATACAGA



GTTTCTCGAAGATGTGGAAAGATGCAATTACAATCTTTCATTTAACAAAATTGCTGATACTTCAAAC



CTTATTAATGATGGGAAATTGTATGTATTTCAGATATGGTCAAAAGACTTTTCTATTGATTCAAAAG



GTACTAAAAACTTGAATACAATCTATTTTGAATCACTATTTTCAGAAGAAAACATGATAGAAAAAA



TGTTCAAGCTTTCTGGAGAGGCTGAGATATTCTATCGACCAGCATCGTTGAATTATTGTGAAGATAT



CATAAAAAAAGGTCATCACCATGCAGAATTAAAAGATAAGTTTGACTATCCTATAATAAAAGATAA



GCGATATTCACAAGATAAGTTTTTCTTTCATGTGCCAATGGTTATAAATTATAAATCTGAGAAACTG



AATTCCAAAAGCCTTAACAACCGAACAAATGAAAACCTGGGACAGTTTACACATATTATAGGTATA



GACAGGGGCGAGCGGCACTTGATTTATTTAACTGTTGTTGATGTTTCCACTGGTGAAATCGTTGAAC



AGAAACATCTGGACGAAATTATCAATACTGATACCAAGGGAGTTGAACACAAAACCCATTATTTGA



ATAAATTGGAAGAAAAATCTAAAACAAGAGATAACGAGCGTAAATCATGGGAAGCTATTGAAACT



ATCAAAGAATTAAAAGAAGGCTATATTTCTCATGTAATTAATGAAATACAAAAGCTGCAAGAAAAA



TATAATGCCTTAATCGTAATGGAAAATCTTAACTATGGGTTCAAAAACTCACGAATCAAAGTTGAA



AAACAGGTTTATCAAAAATTCGAGACAGCATTGATTAAAAAGTTCAATTATATTATTGATAAAAAA



GATCCAGAAACCTATATACATGGTTACCAGCTTACAAATCCTATTACCACTCTGGATAAGATTGGAA



ATCAATCTGGAATAGTGCTGTATATTCCTGCGTGGAATACTTCTAAGATAGATCCCGTCACAGGATT



TGTAAACCTTCTGTACGCAGATGATTTGAAGTATAAAAATCAGGAGCAGGCCAAATCATTCATTCA



GAAAATAGACAACATATATTTTGAAAATGGAGAGTTTAAATTTGATATTGATTTTTCCAAATGGAAT



AATCGCTACTCAATAAGTAAAACTAAATGGACGTTAACAAGTTATGGGACTCGCATCCAGACATTT



AGAAATCCCCAGAAAAACAATAAGTGGGATTCTGCTGAATATGATTTGACAGAAGAGTTTAAATTA



ATTTTAAATATAGACGGAACGTTAAAGTCACAGGACGTAGAAACATACAAAAAATTCATGTCTTTA



TTTAAACTAATGCTACAGCTTCGAAACTCTGTTACAGGAACCGACATTGATTATATGATCTCTCCTG



TCACTGATAAAACAGGAACACATTTCGATTCAAGAGAAAATATTAAAAATCTTCCTGCCGATGCAG



ATGCCAATGGTGCCTACAACATTGCGCGCAAAGGAATAATGGCTATTGAAAATATAATGAACGGTA



TAAGCGATCCACTAAAAATAAGCAACGAAGACTATTTAAAGTATATTCAGAATCAACAGGAATAA





SEQ
ATGACCCAATTTGAAGGTTTTACCAATTTATACCAAGTTTCGAAGACCCTTCGTTTTGAACTGATTCC


ID
CCAAGGAAAAACACTCAAACATATCCAGGAGCAAGGGTTCATTGAGGAGGATAAAGCTCGCAATG


NO:
ACCATTACAAAGAGTTAAAACCAATCATTGACCGCATCTATAAGACTTATGCTGATCAATGTCTCCA


32
ACTGGTACAGCTTGACTGGGAGAATCTATCTGCAGCCATAGACTCCTATCGTAAGGAAAAAACCGA



AGAAACACGAAATGCGCTGATTGAGGAGCAAGCAACATATAGAAATGCGATTCATGACTACTTTAT



AGGTCGGACGGATAATCTGACAGATGCCATAAATAAGCGCCATGCTGAAATCTATAAAGGACTTTT



TAAAGCTGAACTTTTCAATGGAAAAGTTTTAAAGCAATTAGGGACCGTAACCACGACAGAACATGA



AAATGCTCTACTCCGTTCGTTTGACAAATTTACGACCTATTTTTCCGGCTTTTATGAAAACCGAAAA



AATGTCTTTAGCGCTGAAGATATCAGCACGGCAATTCCCCATCGAATCGTCCAGGACAATTTCCCTA



AATTTAAGGAAAACTGCCATATTTTTACAAGATTGATAACCGCAGTTCCTTCTTTGCGGGAGCATTT



TGAAAATGTCAAAAAGGCCATTGGAATCTTTGTTAGTACGTCTATTGAAGAAGTCTTTTCCTTTCCC



TTTTATAATCAACTTCTAACCCAAACGCAAATTGATCTTTATAATCAACTTCTCGGCGGCATATCTA



GGGAAGCAGGCACAGAAAAAATCAAGGGACTTAATGAAGTTCTCAATCTGGCTATCCAAAAAAAT



GATGAAACAGCCCATATAATCGCGTCCCTGCCGCATCGTTTTATTCCTCTTTTTAAACAAATTCTTTC



CGATCGAAATACGTTATCCTTTATTTTGGAAGAATTCAAAAGCGATGAGGAAGTCATCCAATCCTTC



TGCAAATATAAAACCCTCTTGAGAAACGAAAATGTACTGGAGACTGCAGAAGCCCTTTTCAATGAA



TTAAATTCCATTGATTTGACTCATATCTTTATTTCCCATAAAAAGTTAGAAACCATCTCTTCAGCGCT



TTGTGACCATTGGGATACCTTGCGCAATGCACTTTACGAAAGACGGATTTCTGAACTCACTGGCAAA



ATAACAAAAAGTGCCAAAGAAAAAGTTCAAAGGTCATTAAAACATGAGGATATAAATCTCCAAGA



AATTATTTCTGCTGCAGGAAAAGAACTATCAGAAGCATTCAAACAAAAAACAAGTGAAATTCTTTC



CCATGCCCATGCTGCACTTGACCAGCCTCTTCCCACAACATTAAAAAAACAGGAAGAAAAAGAAAT



CCTCAAATCACAGCTCGATTCGCTTTTAGGCCTTTATCATCTTCTTGATTGGTTTGCTGTCGATGAAA



GCAATGAAGTCGACCCAGAATTCTCAGCACGGCTGACAGGCATTAAACTAGAAATGGAACCAAGCC



TTTCGTTTTATAATAAAGCAAGAAATTATGCGACAAAAAAGCCCTATTCGGTGGAAAAATTTAAATT



GAATTTTCAAATGCCAACCCTTGCCTCTGGTTGGGATGTCAATAAAGAAAAAAATAATGGAGCTAT



TTTATTCGTAAAAAATGGTCTCTATTACCTTGGTATCATGCCTAAACAGAAGGGGCGCTATAAAGCC



CTGTCTTTTGAGCCGACAGAAAAAACATCAGAAGGATTCGATAAGATGTACTATGACTACTTCCCA



GATGCCGCAAAAATGATTCCTAAGTGTTCCACTCAGCTAAAGGCTGTAACCGCTCATTTTCAAACTC



ATACCACCCCCATTCTTCTCTCAAATAATTTCATTGAACCTCTTGAAATCACAAAAGAAATTTATGA



CCTGAACAATCCTGAAAAGGAGCCTAAAAAGTTTCAAACGGCTTATGCAAAGAAGACAGGCGATC



AAAAAGGCTATAGAGAAGCGCTTTGCAAATGGATTGACTTTACGCGGGATTTTCTCTCTAAATATAC



GAAAACAACTTCAATCGATTTATCTTCACTCCGCCCTTCTTCGCAATATAAAGATTTAGGGGAATAT



TACGCCGAACTGAATCCGCTTCTCTATCATATCTCCTTCCAACGAATTGCTGAAAAGGAAATCATGG



ATGCTGTAGAAACGGGAAAATTGTATCTGTTCCAAATCTACAATAAGGATTTTGCGAAGGGCCATC



ACGGGAAACCAAATCTCCACACCCTGTATTGGACAGGTCTCTTCAGTCCTGAAAACCTTGCGAAAA



CCAGCATCAAACTTAATGGTCAAGCAGAATTGTTCTATCGACCTAAAAGCCGCATGAAGCGGATGG



CCCATCGTCTTGGGGAAAAAATGCTGAACAAAAAACTAAAGGACCAGAAGACACCGATTCCAGAT



ACCCTCTACCAAGAACTGTACGATTATGTCAACCACCGGCTAAGCCATGATCTTTCCGATGAAGCAA



GGGCCCTGCTTCCAAATGTTATCACCAAAGAAGTCTCCCATGAAATTATAAAGGATCGGCGGTTTAC



TTCCGATAAATTTTTCTTCCATGTTCCCATTACACTGAATTATCAAGCAGCCAATAGTCCCAGTAAAT



TCAACCAGCGTGTCAATGCCTACCTTAAGGAGCATCCGGAAACGCCCATCATTGGTATCGATCGTG



GAGAACGCAATCTAATCTATATTACCGTCATTGACAGTACTGGGAAAATTTTGGAGCAGCGTTCCCT



GAATACCATCCAGCAATTTGACTACCAAAAAAAATTGGACAACAGGGAAAAAGAGCGTGTTGCCG



CCCGTCAAGCCTGGTCCGTCGTCGGAACGATCAAAGACCTTAAACAAGGCTACTTGTCACAGGTCA



TCCATGAAATTGTAGACCTGATGATTCATTACCAAGCTGTTGTCGTCCTTGAAAACCTCAACTTCGG



ATTTAAATCAAAACGGACAGGCATTGCCGAAAAAGCAGTCTACCAACAATTTGAAAAGATGCTAAT



AGATAAACTCAACTGTTTGGTTCTCAAAGATTATCCTGCTGAGAAAGTGGGAGGCGTCTTAAACCC



GTATCAACTTACAGATCAGTTCACGAGCTTTGCAAAAATGGGCACGCAAAGCGGCTTCCTTTTCTAT



GTACCGGCCCCTTATACCTCAAAGATTGATCCCCTGACTGGTTTTGTCGATCCCTTTGTATGGAAGA



CCATTAAAAATCATGAAAGTCGGAAGCATTTCCTAGAAGGATTTGATTTCCTGCATTATGATGTCAA



AACAGGTGATTTTATCCTCCATTTTAAAATGAATCGGAATCTCTCTTTCCAGAGAGGGCTTCCTGGC



TTCATGCCAGCTTGGGATATTGTTTTCGAAAAGAATGAAACCCAATTTGATGCAAAAGGGACGCCC



TTCATTGCAGGAAAACGAATTGTTCCTGTAATCGAAAATCATCGTTTTACGGGTCGTTACAGAGACC



TCTATCCCGCTAATGAACTCATTGCCCTTCTGGAAGAAAAAGGCATTGTCTTTAGAGACGGAAGTAA



TATATTACCCAAACTTTTAGAAAATGATGATTCTCATGCAATTGATACGATGGTCGCCTTGATTCGC



AGTGTACTCCAAATGAGAAACAGCAATGCCGCAACGGGGGAAGACTACATCAACTCTCCCGTTAGG



GATCTGAACGGGGTGTGTTTCGACAGTCGATTCCAAAATCCAGAATGGCCAATGGATGCGGATGCC



AACGGAGCTTATCATATTGCCTTAAAAGGGCAGCTTCTTCTGAACCACCTCAAAGAAAGCAAAGAT



CTGAAATTACAAAACGGCATCAGCAACCAAGATTGGCTGGCCTACATTCAGGAACTGAGAAACTGA





SEQ
ATGGCCGTCAAATCCATCAAAGTGAAACTTCGTCTCGACGATATGCCGGAGATTCGGGCCGGTCTA


ID
TGGAAACTTCATAAGGAAGTCAATGCGGGGGTTCGATATTACACGGAATGGCTCAGTCTTCTCCGTC


NO:
AAGAGAACTTGTATCGAAGAAGTCCGAATGGGGACGGAGAGCAAGAATGTGATAAGACTGCAGAA


33
GAATGCAAAGCCGAATTGTTGGAGCGGCTGCGCGCGCGTCAAGTGGAGAATGGACACCGTGGTCCG



GCGGGATCGGACGATGAATTGCTGCAGTTGGCGCGTCAACTCTATGAGTTGTTGGTTCCGCAGGCG



ATAGGTGCGAAAGGCGACGCGCAGCAAATTGCCCGCAAATTTTTGAGCCCCTTGGCCGACAAGGAC



GCAGTTGGTGGGCTTGGAATCGCGAAGGCGGGGAACAAACCGCGGTGGGTTCGCATGCGCGAAGC



GGGGGAACCAGGCTGGGAAGAGGAGAAGGAGAAGGCTGAGACGAGGAAATCTGCGGATCGGACT



GCGGATGTTTTGCGCGCGCTCGCGGATTTTGGGTTAAAGCCACTGATGCGCGTATACACCGATTCTG



AGATGTCATCGGTGGAGTGGAAACCGCTTCGGAAGGGACAAGCCGTTCGGACGTGGGATAGGGAC



ATGTTCCAACAAGCTATCGAACGGATGATGTCGTGGGAGTCGTGGAATCAGCGCGTTGGGCAAGAG



TACGCGAAACTCGTAGAACAAAAAAATCGATTTGAGCAGAAGAATTTCGTCGGCCAGGAACATCTG



GTCCATCTCGTCAATCAGTTGCAACAAGATATGAAAGAAGCATCGCCCGGACTCGAATCGAAAGAG



CAAACCGCGCACTATGTGACGGGACGGGCATTGCGCGGATCGGACAAGGTATTTGAGAAGTGGGG



GAAACTCGCCCCCGATGCACCTTTCGATTTGTACGACGCCGAAATCAAGAATGTGCAGAGACGTAA



CACGAGACGATTCGGATCACATGACTTGTTCGCAAAATTGGCAGAGCCAGAGTATCAGGCCCTGTG



GCGCGAAGATGCTTCGTTTCTCACGCGTTACGCGGTGTACAACAGCATCCTTCGCAAACTGAATCAC



GCCAAAATGTTCGCGACGTTTACTTTGCCGGATGCAACGGCGCACCCGATTTGGACTCGCTTCGATA



AATTGGGTGGGAATTTGCACCAGTACACCTTTTTGTTCAACGAATTTGGAGAACGCAGGCACGCGA



TTCGTTTTCACAAGCTATTGAAAGTCGAGAATGGTGTCGCAAGAGAAGTTGATGATGTCACCGTGCC



CATTTCAATGTCAGAGCAATTGGATAATCTGCTTCCCAGAGATCCCAATGAACCGATTGCGCTATAT



TTTCGAGATTACGGAGCCGAACAGCATTTCACAGGTGAATTTGGTGGCGCGAAGATCCAGTGCCGC



CGGGATCAGCTGGCTCATATGCACCGACGCAGAGGGGCGAGGGATGTTTATCTCAATGTCAGCGTA



CGTGTGCAGAGTCAGTCTGAGGCGCGGGGAGAACGTCGCCCGCCGTATGCGGCAGTATTTCGTCTG



GTCGGGGACAACCATCGCGCGTTTGTCCATTTCGATAAACTATCGGATTATCTTGCGGAACATCCGG



ATGATGGGAAGCTCGGGTCGGAGGGGTTGCTTTCCGGGCTGCGGGTGATGAGTGTCGATCTCGGCC



TTCGCACATCTGCATCGATTTCCGTTTTTCGCGTTGCCCGGAAGGACGAGTTGAAGCCGAACTCAAA



AGGTCGTGTACCGTTTTTCTTTCCGATAAAAGGGAATGACAATCTCGTCGCGGTTCATGAGCGATCA



CAACTCTTGAAGCTGCCTGGCGAAACGGAGTCGAAGGACCTGCGTGCTATCCGAGAAGAACGCCAA



CGGACATTGCGGCAGTTGCGGACGCAACTGGCGTATTTGCGGCTGCTCGTGCGGTGTGGGTCGGAA



GATGTGGGGCGGCGTGAACGGAGTTGGGCAAAGCTTATCGAGCAGCCGGTGGATGCGGCCAATCA



CATGACACCGGATTGGCGCGAGGCTTTTGAAAACGAACTTCAGAAGCTTAAGTCACTCCATGGTAT



CTGTAGCGACAAGGAATGGATGGATGCTGTCTACGAGAGCGTTCGCCGCGTGTGGCGTCACATGGG



CAAACAGGTTCGCGATTGGCGAAAGGACGTACGAAGCGGAGAGCGGCCCAAGATTCGCGGCTATG



CGAAAGACGTGGTCGGTGGAAACTCGATTGAGCAAATCGAGTATCTGGAACGTCAGTACAAGTTCC



TCAAGAGTTGGAGCTTCTTTGGTAAGGTGTCGGGACAAGTGATTCGTGCGGAGAAGGGATCTCGTT



TTGCGATCACGCTGCGCGAACACATTGATCACGCGAAGGAAGATCGGCTGAAGAAATTGGCGGATC



GCATCATTATGGAGGCTCTCGGCTATGTGTACGCGTTGGATGAGCGCGGCAAAGGAAAGTGGGTTG



CGAAGTATCCGCCGTGCCAGCTCATCCTGCTGGAGGAATTGAGCGAGTACCAGTTCAATAACGACA



GGCCTCCGAGCGAAAACAACCAGTTGATGCAATGGAGTCATCGCGGCGTGTTCCAGGAGTTGATAA



ATCAGGCCCAAGTCCATGATTTACTCGTTGGGACGATGTATGCAGCGTTCTCGTCGCGATTCGACGC



GCGAACTGGGGCACCGGGTATCCGCTGTCGCCGGGTTCCGGCGCGTTGCACCCAGGAGCACAATCC



AGAACCATTTCCTTGGTGGCTGAACAAGTTTGTGGTGGAACATACGTTGGATGCTTGTCCCCTACGC



GCAGACGACCTCATCCCAACGGGTGAAGGAGAGATTTTTGTCTCGCCGTTCAGCGCGGAGGAGGGG



GACTTTCATCAGATTCACGCCGACCTGAATGCGGCGCAAAATCTGCAGCAGCGACTCTGGTCTGATT



TTGATATCAGTCAAATTCGGTTGCGGTGTGATTGGGGTGAAGTGGACGGTGAACTCGTTCTGATCCC



AAGGCTTACAGGAAAACGAACGGCGGATTCATATAGCAACAAGGTGTTTTATACCAATACAGGTGT



CACCTATTATGAGCGAGAGCGGGGGAAGAAGCGGAGAAAGGTTTTCGCGCAAGAGAAATTGTCGG



AGGAAGAGGCGGAGTTGCTCGTGGAAGCAGACGAGGCGAGGGAGAAATCGGTCGTTTTGATGCGT



GATCCGTCTGGCATCATCAATCGGGGAAATTGGACCAGGCAAAAGGAATTTTGGTCGATGGTGAAC



CAGCGGATCGAAGGATACTTGGTCAAGCAGATTCGCTCGCGCGTTCCATTACAAGATAGTGCGTGT



GAAAACACGGGGGATATTTAA





SEQ
ATGGCGACACGCAGTTTTATTTTAAAAATTGAACCAAATGAAGAAGTTAAAAAGGGATTATGGAAG


ID
ACGCATGAGGTATTGAATCATGGAATTGCCTACTACATGAATATTCTGAAACTAATTAGACAGGAA


NO:
GCTATTTATGAACATCATGAACAAGATCCTAAAAATCCGAAAAAAGTTTCAAAAGCAGAAATACAA


34
GCCGAGTTATGGGATTTTGTTTTAAAAATGCAAAAATGTAATAGTTTTACACATGAAGTTGACAAAG



ATGTTGTTTTTAACATCCTGCGTGAACTATATGAAGAGTTGGTCCCTAGTTCAGTCGAGAAAAAGGG



TGAAGCCAATCAATTATCGAATAAGTTTCTGTACCCGCTAGTTGATCCGAACAGTCAAAGTGGGAA



AGGGACGGCATCATCCGGACGTAAACCTCGGTGGTATAATTTAAAAATAGCAGGCGACCCATCGTG



GGAGGAAGAAAAGAAAAAATGGGAAGAGGATAAAAAGAAAGATCCCCTTGCTAAAATCTTAGGTA



AGTTAGCAGAATATGGGCTTATTCCGCTATTTATTCCATTTACTGACAGCAACGAACCAATTGTAAA



AGAAATTAAATGGATGGAAAAAAGTCGTAATCAAAGTGTCCGGCGACTTGATAAGGATATGTTTAT



CCAAGCATTAGAGCGTTTTCTTTCATGGGAAAGCTGGAACCTTAAAGTAAAGGAAGAGTATGAAAA



AGTTGAAAAGGAACACAAAACACTAGAGGAAAGGATAAAAGAGGACATTCAAGCATTTAAATCCC



TTGAACAATATGAAAAAGAACGGCAGGAGCAACTTCTTAGAGATACATTGAATACAAATGAATACC



GATTAAGCAAAAGAGGATTACGTGGTTGGCGTGAAATTATCCAAAAATGGCTAAAGATGGATGAA



AATGAACCATCAGAAAAATATTTAGAAGTATTTAAAGATTATCAACGGAAACATCCACGAGAAGCC



GGGGACTATTCTGTCTATGAATTTTTAAGCAAGAAAGAAAATCATTTTATTTGGCGAAATCATCCTG



AATATCCTTATTTGTATGCTACATTTTGTGAAATTGACAAAAAAAAGAAAGACGCTAAGCAACAGG



CAACTTTTACTTTGGCTGACCCGATTAACCATCCGTTATGGGTACGATTTGAAGAAAGAAGCGGTTC



GAACTTAAACAAATATCGAATTTTAACAGAGCAATTACACACTGAAAAGTTAAAAAAGAAATTAAC



AGTTCAACTTGATCGTTTAATTTATCCAACTGAATCCGGCGGTTGGGAGGAAAAAGGTAAAGTAGA



TATCGTTTTGTTGCCGTCAAGACAATTTTATAATCAAATCTTCCTTGATATAGAAGAAAAGGGGAAA



CATGCTTTTACTTATAAGGATGAAAGTATTAAATTCCCCCTTAAAGGTACACTTGGTGGTGCAAGAG



TGCAGTTTGACCGTGACCATTTGCGGAGATATCCGCATAAAGTAGAATCAGGAAATGTTGGACGGA



TTTATTTTAACATGACAGTAAATATTGAACCAACTGAGAGCCCTGTTAGTAAGTCTTTGAAAATACA



TAGGGACGATTTCCCCAAGTTCGTTAATTTTAAACCGAAAGAGCTCACCGAATGGATAAAAGATAG



TAAAGGGAAAAAATTAAAAAGTGGTATAGAATCCCTTGAAATTGGTCTACGGGTGATGAGTATCGA



CTTAGGTCAACGTCAAGCGGCTGCTGCATCGATTTTTGAAGTAGTTGATCAGAAACCGGATATTGAA



GGGAAGTTATTTTTTCCAATCAAAGGAACTGAGCTTTATGCTGTTCACCGGGCAAGTTTTAACATTA



AATTACCGGGTGAAACATTAGTAAAATCACGGGAAGTATTGCGGAAAGCTCGGGAGGACAACTTA



AAATTAATGAATCAAAAGTTAAACTTTCTAAGAAATGTTCTACATTTCCAACAGTTTGAAGATATCA



CAGAAAGAGAGAAGCGTGTAACTAAATGGATTTCTAGACAAGAAAATAGTGATGTTCCTCTTGTAT



ATCAAGATGAGCTAATTCAAATTCGTGAATTAATGTATAAACCCTATAAAGATTGGGTTGCCTTTTT



AAAACAACTCCATAAACGGCTAGAAGTCGAGATTGGCAAAGAGGTTAAGCATTGGCGAAAATCATT



AAGTGACGGGAGAAAAGGTCTTTACGGAATCTCCCTAAAAAATATTGATGAAATTGATCGAACAAG



GAAATTCCTTTTAAGATGGAGCTTACGTCCAACAGAACCTGGGGAAGTAAGACGCTTGGAACCAGG



ACAGCGTTTTGCGATTGATCAATTAAACCACCTAAATGCATTAAAAGAAGATCGATTAAAAAAGAT



GGCAAATACGATTATCATGCATGCCTTAGGTTACTGTTATGATGTAAGAAAGAAAAAGTGGCAGGC



AAAAAATCCAGCATGTCAAATTATTTTATTTGAAGATTTATCTAACTACAATCCTTACGAGGAAAGG



TCCCGTTTTGAAAACTCAAAACTGATGAAGTGGTCACGGAGAGAAATTCCACGACAAGTCGCCTTA



CAAGGTGAAATTTACGGATTACAAGTTGGGGAAGTAGGTGCCCAATTCAGTTCAAGATTCCATGCG



AAAACCGGGTCGCCGGGAATTCGTTGCAGTGTTGTAACGAAAGAAAAATTGCAGGATAATCGCTTT



TTTAAAAATTTACAAAGAGAAGGACGACTTACTCTTGATAAAATCGCAGTTTTAAAAGAAGGAGAC



TTATATCCAGATAAAGGTGGAGAAAAGTTTATTTCTTTATCAAAGGATCGAAAGTTGGTAACTACGC



ATGCTGATATTAACGCGGCCCAAAATTTACAGAAGCGTTTTTGGACAAGAACACATGGATTTTATA



AAGTTTACTGCAAAGCCTATCAGGTTGATGGACAAACTGTTTATATTCCGGAGAGCAAGGACCAAA



AACAAAAAATAATTGAAGAATTTGGGGAAGGCTATTTTATTTTAAAAGATGGTGTATATGAATGGG



GTAATGCGGGGAAACTAAAAATTAAAAAAGGTTCCTCTAAACAATCATCGAGTGAATTAGTAGATT



CGGACATACTGAAAGATTCATTTGATTTAGCAAGTGAACTTAAGGGAGAGAAACTCATGTTATATC



GAGATCCGAGTGGAAACGTATTTCCTTCCGACAAGTGGATGGCAGCAGGAGTATTTTTTGGCAAAT



TAGAAAGAATATTGATTTCTAAGTTAACAAATCAATACTCAATATCAACAATAGAAGATGATTCTTC



AAAACAATCAATGTAA





SEQ
ATGCCCACCCGCACCATCAATCTGAAACTTGTTCTTGGGAAAAATCCTGAAAACGCAACATTGCGA


ID
CGCGCCCTATTTTCGACACACCGTTTGGTTAACCAAGCGACGAAACGTATTGAGGAATTCTTGTTGC


NO:
TGTGTCGTGGAGAAGCCTACAGAACAGTGGATAATGAGGGGAAGGAAGCCGAGATTCCACGTCAT


35
GCAGTCCAAGAAGAAGCTCTTGCCTTTGCCAAAGCTGCTCAACGCCACAACGGCTGTATATCCACCT



ATGAAGACCAAGAGATTCTTGATGTACTGCGGCAACTGTACGAACGTCTTGTTCCTTCGGTCAACGA



AAACAACGAGGCAGGCGATGCTCAAGCTGCTAACGCCTGGGTCAGTCCGCTCATGTCGGCAGAAAG



CGAAGGAGGCTTGTCGGTCTACGACAAGGTGCTTGATCCACCGCCGGTTTGGATGAAGCTTAAAGA



AGAAAAGGCTCCAGGATGGGAAGCCGCTTCTCAAATTTGGATTCAGAGTGATGAGGGACAGTCGTT



ACTTAATAAGCCAGGTAGCCCTCCCCGCTGGATTCGAAAACTGCGATCTGGGCAACCGTGGCAAGA



TGATTTCGTCAGTGACCAAAAGAAAAAGCAAGATGAGCTGACCAAAGGGAACGCACCACTTATAA



AACAACTCAAAGAAATGGGGTTGTTGCCTCTTGTTAACCCATTTTTTAGACATCTTCTTGACCCTGA



AGGTAAAGGCGTGAGTCCATGGGACCGTCTTGCTGTACGCGCTGCAGTGGCTCACTTTATCTCCTGG



GAAAGTTGGAATCATAGAACACGTGCAGAATACAATTCCTTGAAACTACGGCGAGACGAGTTTGAG



GCAGCATCCGACGAATTCAAAGACGATTTTACTTTGCTCCGACAATATGAAGCCAAACGCCATAGT



ACATTGAAAAGCATCGCGCTGGCCGACGATTCGAACCCTTACCGGATTGGAGTACGTTCTCTGCGTG



CCTGGAACCGCGTTCGTGAAGAATGGATAGACAAGGGTGCAACAGAAGAACAACGCGTGACCATA



TTGTCAAAGCTTCAAACACAACTTCGGGGAAAATTCGGCGATCCCGATCTGTTCAACTGGCTAGCTC



AGGATAGGCATGTCCATTTGTGGTCTCCTCGGGACAGCGTGACACCATTGGTTCGCATCAATGCGGT



AGATAAAGTTCTGCGTCGACGAAAACCGTATGCATTGATGACCTTTGCCCATCCCCGCTTCCACCCT



CGATGGATACTGTACGAGGCTCCAGGAGGAAGCAATCTCCGTCAATATGCATTGGATTGTACAGAA



AACGCTCTACACATCACGTTGCCTTTGCTTGTCGACGATGCGCACGGAACCTGGATTGAAAAAAAG



ATCAGGGTGCCGCTGGCACCATCCGGACAAATTCAAGATTTAACTCTGGAAAAACTTGAGAAGAAA



AAAAATCGTTTATACTACCGTTCCGGTTTTCAGCAGTTTGCCGGCTTGGCTGGCGGAGCTGAGGTTC



TTTTCCACAGACCCTATATGGAACACGACGAACGCAGCGAGGAGTCTCTTTTGGAACGTCCGGGAG



CCGTTTGGTTCAAATTGACCCTGGATGTGGCAACACAGGCTCCCCCGAACTGGCTTGATGGTAAGG



GCCGTGTCCGTACACCGCCGGAGGTACATCATTTTAAAACCGCATTGTCGAATAAAAGCAAACATA



CACGTACGCTGCAGCCGGGTCTCCGTGTCTTGTCAGTAGACTTGGGCATGCGAACATTCGCCTCCTG



CTCAGTATTTGAACTCATCGAGGGAAAGCCTGAGACAGGCCGTGCCTTCCCTGTTGCCGATGAGAG



ATCAATGGACAGCCCGAATAAACTGTGGGCCAAGCATGAACGTAGTTTTAAACTGACGCTCCCCGG



CGAAACCCCTTCTCGAAAGGAAGAGGAAGAGCGTAGCATAGCAAGAGCGGAAATTTATGCACTGA



AACGCGACATACAACGCCTCAAAAGCCTACTCCGCTTAGGTGAAGAAGATAACGATAACCGTCGTG



ATGCATTGCTTGAACAGTTCTTTAAAGGATGGGGAGAAGAAGACGTTGTGCCTGGACAAGCGTTTC



CACGCTCTCTTTTCCAAGGGTTGGGAGCTGCCCCGTTTCGCTCAACTCCAGAGTTATGGCGTCAGCA



TTGCCAAACATATTATGACAAAGCGGAAGCCTGTCTGGCTAAACATATCAGTGATTGGCGCAAGCG



AACTCGTCCCCGTCCGACATCGCGGGAGATGTGGTACAAAACACGTTCCTATCATGGCGGCAAGTC



CATTTGGATGTTGGAATATCTTGATGCCGTTCGAAAACTGCTTCTCAGTTGGAGCTTACGTGGTCGT



ACTTACGGTGCCATTAATCGCCAGGATACAGCCCGGTTTGGTTCTTTGGCATCACGGCTGCTCCACC



ATATCAATTCCCTAAAGGAAGACCGCATCAAAACAGGAGCCGACTCTATCGTTCAGGCTGCTCGCG



GGTATATTCCTCTCCCTCATGGCAAGGGTTGGGAACAAAGATATGAGCCTTGTCAGCTCATATTATT



TGAAGACCTCGCACGATATCGCTTTCGCGTGGATCGACCTCGTCGAGAGAACAGCCAACTCATGCA



GTGGAACCATCGAGCCATCGTGGCAGAAACAACGATGCAAGCCGAACTCTACGGACAAATTGTCGA



AAATACTGCAGCGGGGTTCAGCAGTCGTTTTCACGCGGCGACAGGTGCCCCCGGTGTACGTTGTCGT



TTTCTTCTAGAAAGAGACTTTGATAACGATTTGCCCAAACCGTACCTTCTCAGGGAACTTTCTTGGA



TGCTCGGCAATACAAAAGTCGAGTCTGAAGAAGAAAAGCTTCGATTGCTGTCTGAAAAAATCAGGC



CAGGCAGTCTTGTTCCTTGGGATGGAGGCGAACAGTTCGCTACCCTGCATCCCAAAAGACAAACAC



TTTGCGTCATTCATGCCGATATGAATGCTGCCCAAAATTTACAACGCCGGTTTTTCGGTCGATGCGG



CGAGGCCTTTCGGCTTGTTTGTCAACCCCACGGTGACGACGTGTTACGACTCGCATCCACCCCAGGA



GCTCGTCTTCTTGGAGCCCTGCAGCAGCTTGAAAATGGACAAGGAGCTTTCGAGTTGGTTCGAGAC



ATGGGGTCAACAAGTCAAATGAACCGGTTCGTCATGAAGTCTTTGGGAAAAAAGAAAATAAAACCC



CTTCAGGACAACAATGGAGACGACGAGCTTGAAGACGTGTTGTCCGTACTCCCGGAGGAAGACGAC



ACAGGACGTATCACAGTCTTCCGCGATTCATCAGGAATCTTTTTTCCTTGCAACGTCTGGATACCGG



CCAAACAGTTTTGGCCAGCAGTACGCGCCATGATTTGGAAGGTCATGGCTTCCCATTCTTTGGGGTG



A





SEQ
ATGACAAAGTTAAGACACCGACAGAAAAAATTAACACACGACTGGGCTGGCTCCAAAAAGAGGGA


ID
AGTATTAGGCTCAAATGGCAAGCTTCAGAATCCGTTGTTAATGCCGGTTAAAAAAGGTCAGGTTAC


NO:
TGAGTTCCGGAAAGCGTTTTCTGCGTATGCTCGCGCAACGAAAGGAGAAATGACTGACGGCCGAAA


36
GAATATGTTTACGCATAGTTTCGAGCCATTTAAGACAAAGCCCTCGCTTCATCAGTGTGAATTGGCA



GATAAAGCATATCAATCTTTACATTCGTATCTGCCTGGTTCTCTTGCTCATTTTCTATTATCTGCTCA



CGCATTAGGTTTTCGTATTTTTTCAAAATCTGGTGAAGCAACTGCATTCCAGGCATCCTCTAAAATT



GAAGCTTACGAATCAAAATTGGCAAGCGAATTAGCTTGTGTAGATTTATCTATTCAAAACTTGACTA



TTTCAACGCTTTTTAATGCGCTTACAACGTCTGTAAGAGGGAAGGGCGAAGAAACTAGCGCTGACC



CCTTAATTGCACGATTTTACACCTTACTTACTGGCAAGCCTCTGTCTCGAGACACTCAAGGGCCTGA



ACGTGATTTAGCAGAAGTTATCTCGCGTAAGATAGCTAGTTCTTTTGGCACATGGAAAGAAATGAC



GGCAAACCCTCTTCAGTCATTACAATTTTTTGAAGAGGAACTCCATGCGCTGGATGCCAATGTCTCG



CTCTCACCCGCCTTCGACGTTTTAATTAAAATGAATGATTTGCAGGGCGATTTAAAAAATCGAACCA



TTGTTTTTGATCCTGACGCCCCTGTTTTTGAATATAACGCAGAAGACCCTGCCGACATAATTATTAA



ACTTACAGCTCGTTACGCTAAAGAAGCTGTCATCAAAAATCAAAACGTAGGAAATTACGTTAAAAA



CGCTATTACTACCACAAATGCCAATGGTCTTGGTTGGCTTTTGAACAAAGGTTTGTCGTTACTCCCT



GTCTCGACCGATGACGAATTGCTAGAGTTTATTGGCGTTGAACGATCTCATCCCTCATGCCATGCCT



TAATTGAATTGATTGCACAATTAGAAGCCCCCGAGCTCTTTGAGAAGAACGTATTTTCAGATACTCG



TTCTGAAGTTCAAGGTATGATTGATTCAGCTGTTTCTAATCATATTGCTCGTCTTTCCAGCTCTAGAA



ATAGCTTGTCAATGGATAGTGAAGAATTAGAACGTTTAATCAAAAGCTTTCAGATACACACACCTC



ATTGCTCACTTTTTATTGGCGCCCAATCACTTTCACAGCAGTTAGAATCTTTGCCTGAAGCCCTTCAA



TCGGGCGTTAATTCAGCCGATATTTTACTAGGCTCTACTCAATATATGCTCACCAATTCTTTGGTTGA



AGAGTCAATTGCAACTTATCAAAGAACACTTAATCGCATCAATTACTTGTCAGGTGTTGCAGGTCAG



ATTAACGGCGCAATAAAGCGAAAAGCGATAGATGGAGAAAAAATTCACTTGCCTGCAGCTTGGTCA



GAGTTGATATCTTTACCATTTATAGGCCAGCCTGTTATAGATGTTGAAAGCGATTTAGCTCATCTAA



AAAATCAATACCAAACACTTTCAAATGAGTTTGATACTCTTATATCTGCTTTGCAAAAGAATTTTGA



TTTGAACTTTAATAAAGCGCTCCTTAATCGTACTCAGCATTTTGAAGCCATGTGTAGAAGCACTAAG



AAAAACGCTTTATCCAAACCAGAGATCGTTTCCTATCGCGACCTGCTTGCTCGATTAACTTCTTGTTT



GTATCGAGGCTCTTTAGTTTTGCGTCGTGCCGGCATTGAAGTGTTAAAAAAACATAAAATATTTGAG



TCAAACAGCGAACTTCGTGAACATGTTCATGAAAGAAAGCATTTCGTGTTTGTTAGTCCTCTAGATC



GCAAAGCCAAGAAACTCCTTCGATTAACTGATTCGCGTCCAGACTTGTTACATGTTATTGATGAAAT



ATTGCAGCACGATAATCTTGAAAACAAAGACCGCGAGTCACTTTGGCTAGTTCGCTCTGGTTATTTG



CTTGCAGGACTTCCAGATCAACTTTCTTCATCTTTTATTAACTTGCCTATCATTACTCAAAAAGGAGA



TAGACGCCTTATAGACCTGATTCAGTATGATCAAATTAATCGTGATGCTTTTGTTATGTTAGTGACCT



CTGCATTCAAGTCTAATTTGTCTGGTCTGCAGTATCGTGCCAATAAGCAATCGTTCGTTGTTACTCGC



ACGCTAAGCCCTTATCTCGGCTCAAAACTTGTCTACGTACCCAAGGATAAAGATTGGTTAGTTCCTT



CTCAAATGTTTGAAGGACGATTTGCTGACATTCTTCAATCAGATTATATGGTCTGGAAAGATGCCGG



TCGTCTTTGTGTTATTGATACTGCAAAACACCTTTCTAATATAAAGAAGTCTGTATTTTCATCCGAAG



AAGTTCTCGCTTTTTTAAGAGAACTCCCTCACCGCACATTTATCCAGACCGAAGTTCGCGGCCTTGG



CGTTAATGTCGATGGAATTGCATTTAATAATGGTGATATTCCGTCATTAAAAACCTTTTCAAATTGC



GTTCAGGTAAAAGTTTCTCGGACTAATACATCCCTAGTTCAAACACTTAATCGTTGGTTTGAAGGAG



GAAAAGTTTCTCCTCCGAGCATTCAATTTGAACGGGCGTATTATAAAAAAGACGATCAAATTCATG



AAGACGCAGCGAAAAGAAAGATACGATTCCAGATGCCCGCAACTGAGTTGGTTCATGCTTCTGACG



ATGCGGGGTGGACACCAAGTTATTTGCTCGGCATTGATCCTGGCGAGTATGGAATGGGTCTTTCATT



GGTTTCGATTAATAACGGAGAAGTCTTAGATTCAGGCTTTATTCATATTAATTCTCTGATCAATTTTG



CCTCTAAAAAGAGCAACCATCAAACTAAGGTTGTTCCGCGTCAGCAGTACAAATCTCCTTATGCAA



ATTATTTAGAACAATCTAAAGATTCTGCTGCTGGTGATATTGCGCATATACTCGATCGACTTATATA



CAAATTAAATGCGTTGCCTGTTTTTGAGGCTCTTTCAGGTAATTCTCAGAGTGCTGCTGATCAAGTTT



GGACGAAAGTCTTATCGTTTTACACTTGGGGTGATAATGACGCTCAGAATTCTATTAGAAAGCAGC



ATTGGTTTGGAGCCAGTCATTGGGATATCAAAGGTATGTTAAGGCAACCCCCTACGGAGAAGAAGC



CTAAACCGTATATTGCTTTTCCTGGCTCTCAGGTTTCTTCGTATGGTAATTCCCAACGTTGCTCTTGC



TGCGGTCGCAATCCTATTGAACAACTTCGAGAAATGGCAAAGGATACCTCTATTAAAGAGCTAAAA



ATTCGCAATTCTGAGATACAGCTTTTTGACGGAACCATTAAATTATTTAATCCAGACCCATCCACTG



TGATAGAGAGAAGGCGACATAATCTTGGTCCATCAAGAATTCCTGTTGCTGACCGTACTTTCAAAA



ACATCAGTCCATCAAGTCTAGAATTTAAAGAATTGATTACTATCGTGTCTCGATCTATCCGTCATTC



ACCTGAGTTTATCGCTAAAAAACGCGGCATAGGGTCTGAGTATTTTTGCGCTTATTCCGATTGCAAC



TCATCCTTAAATTCTGAAGCTAACGCAGCTGCTAACGTAGCGCAAAAATTTCAAAAACAGTTATTTT



TTGAGTTATAA





SEQ
ATGAAGAGAATTCTGAACAGTCTGAAAGTTGCTGCCTTGAGACTTCTGTTTCGAGGCAAAGGTTCTG


ID
AATTAGTGAAGACAGTCAAATATCCATTGGTTTCCCCGGTTCAAGGCGCGGTTGAAGAACTTGCTG


NO:
AAGCAATTCGGCACGACAACCTGCACCTTTTTGGGCAGAAGGAAATAGTGGATCTTATGGAGAAAG


37
ACGAAGGAACCCAGGTGTATTCGGTTGTGGATTTTTGGTTGGATACCCTGCGTTTAGGGATGTTTTT



CTCACCATCAGCGAATGCGTTGAAAATCACGCTGGGAAAATTCAATTCTGATCAGGTTTCACCTTTT



CGTAAGGTTTTGGAGCAGTCACCTTTTTTTCTTGCGGGTCGCTTGAAGGTTGAACCTGCGGAAAGGA



TACTTTCTGTTGAAATCAGAAAGATTGGTAAAAGAGAAAACAGAGTTGAGAACTATGCCGCCGATG



TGGAGACATGCTTCATTGGTCAGCTTTCTTCAGATGAGAAACAGAGTATCCAGAAGCTGGCAAATG



ATATCTGGGATAGCAAGGATCATGAGGAACAGAGAATGTTGAAGGCGGATTTTTTTGCTATACCTC



TTATAAAAGACCCCAAAGCTGTCACAGAAGAAGATCCTGAAAATGAAACGGCGGGAAAACAGAAA



CCGCTTGAATTATGTGTTTGTCTTGTTCCTGAGTTGTATACCCGAGGTTTCGGCTCCATTGCTGATTT



TCTGGTTCAGCGACTTACCTTGCTGCGTGACAAAATGAGTACCGACACGGCGGAAGATTGCCTCGA



GTATGTTGGCATTGAGGAAGAAAAAGGCAATGGAATGAATTCCTTGCTCGGCACTTTTTTGAAGAA



CCTGCAGGGTGATGGTTTTGAACAGATTTTTCAGTTTATGCTTGGGTCTTATGTTGGCTGGCAGGGG



AAGGAAGATGTACTGCGCGAACGATTGGATTTGCTGGCCGAAAAAGTCAAAAGATTACCAAAGCC



AAAATTTGCCGGAGAATGGAGTGGTCATCGTATGTTTCTCCATGGTCAGCTGAAAAGCTGGTCGTCG



AATTTCTTCCGTCTTTTTAATGAGACGCGGGAACTTCTGGAAAGTATCAAGAGTGATATTCAACATG



CCACCATGCTCATTAGCTATGTGGAAGAGAAAGGAGGCTATCATCCACAGCTGTTGAGTCAGTATC



GGAAGTTAATGGAACAATTACCGGCGTTGCGGACTAAGGTTTTGGATCCTGAGATTGAGATGACGC



ATATGTCCGAGGCTGTTCGAAGTTACATTATGATACACAAGTCTGTAGCGGGATTTCTGCCGGATTT



ACTCGAGTCTTTGGATCGAGATAAGGATAGGGAATTTTTGCTTTCCATCTTTCCTCGTATTCCAAAG



ATAGATAAGAAGACGAAAGAGATCGTTGCATGGGAGCTACCGGGCGAGCCAGAGGAAGGCTATTT



GTTCACAGCAAACAACCTTTTCCGGAATTTTCTTGAGAATCCGAAACATGTGCCACGATTTATGGCA



GAGAGGATTCCCGAGGATTGGACGCGTTTGCGCTCGGCCCCTGTGTGGTTTGATGGGATGGTGAAG



CAATGGCAGAAGGTGGTGAATCAGTTGGTTGAATCTCCAGGCGCCCTTTATCAGTTCAATGAAAGTT



TTTTGCGTCAAAGACTGCAAGCAATGCTTACGGTCTATAAGCGGGATCTCCAGACTGAGAAGTTTCT



GAAGCTGCTGGCTGATGTCTGTCGTCCACTCGTTGATTTTTTCGGACTTGGAGGAAATGATATTATC



TTCAAGTCATGTCAGGATCCAAGAAAGCAATGGCAGACTGTTATTCCACTCAGTGTCCCAGCGGAT



GTTTATACAGCATGTGAAGGCTTGGCTATTCGTCTCCGCGAAACTCTTGGATTCGAATGGAAAAATC



TGAAAGGACACGAGCGGGAAGATTTTTTACGGCTGCATCAGTTGCTGGGAAATCTGCTGTTCTGGA



TCAGGGATGCGAAACTTGTCGTGAAGCTGGAAGACTGGATGAACAATCCTTGTGTTCAGGAGTATG



TGGAAGCACGAAAAGCCATTGATCTTCCCTTGGAGATTTTCGGATTTGAGGTGCCGATTTTTCTCAA



TGGCTATCTCTTTTCGGAACTGCGCCAGCTGGAATTGTTGCTGAGGCGTAAGTCGGTGATGACGTCT



TACAGCGTCAAAACGACAGGCTCGCCAAATAGGCTCTTCCAGTTGGTTTACCTACCTCTAAACCCTT



CAGATCCGGAAAAGAAAAATTCCAACAACTTTCAGGAGCGCCTCGATACACCTACCGGTTTGTCGC



GTCGTTTTCTGGATCTTACGCTGGATGCATTTGCTGGCAAACTCTTGACGGATCCGGTAACTCAGGA



ACTGAAGACGATGGCCGGTTTTTACGATCATCTCTTTGGCTTCAAGTTGCCGTGTAAACTGGCGGCG



ATGAGTAACCATCCAGGATCCTCTTCCAAAATGGTGGTTCTGGCAAAACCAAAGAAGGGTGTTGCT



AGTAACATCGGCTTTGAACCTATTCCCGATCCTGCTCATCCTGTGTTCCGGGTGAGAAGTTCCTGGC



CGGAGTTGAAGTACCTGGAGGGGTTGTTGTATCTTCCCGAAGATACACCACTGACCATTGAACTGG



CGGAAACGTCGGTCAGTTGTCAGTCTGTGAGTTCAGTCGCTTTCGATTTGAAGAATCTGACGACTAT



CTTGGGTCGTGTTGGTGAATTCAGGGTGACGGCAGATCAACCTTTCAAGCTGACGCCCATTATTCCT



GAGAAAGAGGAATCCTTCATCGGGAAGACCTACCTCGGTCTTGATGCTGGAGAGCGATCTGGCGTT



GGTTTCGCGATTGTGACGGTTGACGGCGATGGGTATGAGGTGCAGAGGTTGGGTGTGCATGAAGAT



ACTCAGCTTATGGCGCTTCAGCAAGTCGCCAGCAAGTCTCTTAAGGAGCCGGTTTTCCAGCCACTCC



GTAAGGGCACATTTCGTCAGCAGGAGCGCATTCGCAAAAGCCTCCGCGGTTGCTACTGGAATTTCT



ATCATGCATTGATGATCAAGTACCGAGCTAAAGTTGTGCATGAGGAATCGGTGGGTTCATCCGGTCT



GGTGGGGCAGTGGCTGCGTGCATTTCAGAAGGATCTCAAAAAGGCTGATGTTCTGCCCAAGAAGGG



TGGAAAAAATGGTGTAGACAAAAAAAAGAGAGAAAGCAGCGCTCAGGATACCTTATGGGGAGGAG



CTTTCTCGAAGAAGGAAGAGCAGCAGATAGCCTTTGAGGTTCAGGCAGCTGGATCAAGCCAGTTTT



GTCTGAAGTGTGGTTGGTGGTTTCAGTTGGGGATGCGGGAAGTAAATCGTGTGCAGGAGAGTGGCG



TGGTGCTGGACTGGAACCGGTCCATTGTAACCTTCCTCATCGAATCCTCAGGAGAAAAGGTATATG



GTTTCAGTCCTCAGCAACTGGAAAAAGGCTTTCGTCCTGACATCGAAACGTTCAAAAAAATGGTAA



GGGATTTTATGAGACCCCCCATGTTTGATCGCAAAGGTCGGCCGGCCGCGGCGTATGAAAGATTCG



TACTGGGACGTCGTCACCGTCGTTATCGCTTTGATAAAGTTTTTGAAGAGAGATTTGGTCGCAGTGC



TCTTTTCATCTGCCCGCGGGTCGGGTGTGGGAATTTCGATCACTCCAGTGAGCAGTCAGCCGTTGTC



CTTGCCCTTATTGGTTACATTGCTGATAAGGAAGGGATGAGTGGTAAGAAGCTTGTTTATGTGAGGC



TGGCTGAACTTATGGCTGAGTGGAAGCTGAAGAAACTGGAGAGATCAAGGGTGGAAGAACAGAGC



TCGGCACAATAA





SEQ
ATGGCAGAAAGCAAGCAGATGCAATGCCGCAAGTGCGGCGCAAGCATGAAGTATGAAGTAATTGG


ID
ATTGGGCAAGAAGTCATGCAGATATATGTGCCCAGATTGCGGCAATCACACCAGCGCGCGCAAGAT


NO:
TCAGAACAAGAAAAAGCGCGACAAAAAGTATGGATCCGCAAGCAAAGCGCAGAGCCAGAGGATA


38
GCTGTGGCTGGCGCGCTTTATCCAGACAAAAAAGTGCAGACCATAAAGACCTACAAATACCCAGCG



GATCTTAATGGCGAAGTTCATGACAGCGGCGTCGCAGAGAAGATTGCGCAGGCGATTCAGGAAGAT



GAGATCGGCCTGCTTGGCCCGTCCAGCGAATACGCTTGCTGGATTGCTTCACAAAAACAGAGCGAG



CCGTATTCAGTTGTAGATTTTTGGTTTGACGCGGTGTGCGCAGGCGGAGTATTCGCGTATTCTGGCG



CGCGCCTGCTTTCCACAGTCCTCCAGTTGAGTGGCGAGGAAAGCGTTTTGCGCGCTGCTTTAGCATC



TAGCCCGTTTGTAGATGACATTAATTTGGCGCAAGCGGAAAAGTTCCTAGCCGTTAGCCGGCGCAC



AGGCCAAGATAAGCTAGGCAAGCGCATTGGAGAATGTTTTGCGGAAGGCCGGCTTGAAGCGCTTGG



CATCAAAGATCGCATGCGCGAATTCGTGCAAGCGATTGATGTGGCCCAAACCGCGGGCCAGCGGTT



CGCGGCCAAGCTAAAGATATTCGGCATCAGTCAGATGCCTGAAGCCAAGCAATGGAACAATGATTC



CGGGCTCACTGTATGTATTTTGCCGGATTATTATGTCCCGGAAGAAAACCGCGCGGACCAGCTGGTT



GTTTTGCTTCGGCGCTTACGCGAGATCGCGTATTGCATGGGAATTGAGGATGAAGCAGGATTTGAG



CATCTAGGCATTGACCCTGGTGCTCTTTCCAATTTTTCCAATGGCAATCCAAAGCGAGGATTTCTCG



GCCGCCTGCTCAATAATGACATTATAGCGCTGGCAAACAACATGTCAGCCATGACGCCGTATTGGG



AAGGCAGAAAAGGCGAGTTGATTGAGCGCCTTGCATGGCTTAAACATCGCGCTGAAGGATTGTATT



TGAAAGAGCCACATTTCGGCAACTCCTGGGCAGACCACCGCAGCAGGATTTTCAGTCGCATTGCGG



GCTGGCTTTCCGGATGCGCGGGCAAGCTCAAGATTGCCAAGGATCAGATTTCAGGCGTGCGTACGG



ATTTGTTTCTGCTCAAGCGCCTTCTGGATGCGGTACCGCAAAGCGCGCCGTCGCCGGACTTTATTGC



TTCCATCAGCGCGCTGGATCGGTTTTTGGAAGCGGCAGAAAGCAGCCAGGATCCGGCAGAACAGGT



ACGCGCTTTGTACGCGTTTCATCTGAACGCGCCTGCGGTCCGATCCATCGCCAACAAGGCGGTACAG



AGGTCTGATTCCCAGGAGTGGCTTATCAAGGAACTGGATGCTGTAGATCACCTTGAATTCAACAAA



GCATTTCCGTTTTTTTCGGATACAGGAAAGAAAAAGAAGAAAGGAGCGAATAGCAACGGAGCGCCT



TCTGAAGAAGAATACACGGAAACAGAATCCATTCAACAACCAGAAGATGCAGAGCAGGAAGTGAA



TGGTCAAGAAGGAAATGGCGCTTCAAAGAACCAGAAAAAGTTTCAGCGCATTCCTCGATTTTTCGG



GGAAGGGTCAAGGAGTGAGTATCGAATTTTAACAGAAGCGCCGCAATATTTTGACATGTTCTGCAA



TAATATGCGCGCGATCTTTATGCAGCTAGAGAGTCAGCCGCGCAAGGCGCCTCGTGATTTCAAATG



CTTTCTGCAGAATCGTTTGCAGAAGCTTTACAAGCAAACCTTTCTCAATGCTCGCAGTAATAAATGC



CGCGCGCTTCTGGAATCCGTCCTTATTTCATGGGGAGAATTTTATACTTATGGCGCGAATGAAAAGA



AGTTTCGTCTGCGCCATGAAGCGAGCGAGCGCAGCTCGGATCCGGACTATGTGGTTCAGCAGGCAT



TGGAAATCGCGCGCCGGCTTTTCTTGTTCGGATTTGAGTGGCGCGATTGCTCTGCTGGAGAGCGCGT



GGATTTGGTTGAAATCCACAAAAAAGCAATCTCATTTTTGCTTGCAATCACTCAGGCCGAGGTTTCA



GTTGGTTCCTATAACTGGCTTGGGAATAGCACCGTGAGCCGGTATCTTTCGGTTGCTGGCACAGACA



CATTGTACGGCACTCAACTGGAGGAGTTTTTGAACGCCACAGTGCTTTCACAGATGCGTGGGCTGGC



GATTCGGCTTTCATCTCAGGAGTTAAAAGACGGATTTGATGTTCAGTTGGAGAGTTCGTGCCAGGAC



AATCTCCAGCATCTGCTGGTGTATCGCGCTTCGCGCGACTTGGCTGCGTGCAAACGCGCTACATGCC



CGGCTGAATTGGATCCGAAAATTCTTGTTCTGCCGGTTGGTGCGTTTATCGCGAGCGTAATGAAAAT



GATTGAGCGTGGCGATGAACCATTAGCAGGCGCGTATTTGCGTCATCGGCCGCATTCATTCGGCTGG



CAGATACGGGTTCGTGGAGTGGCGGAAGTAGGCATGGATCAGGGCACAGCGCTAGCATTCCAGAA



GCCGACTGAATCAGAGCCGTTTAAAATAAAGCCGTTTTCCGCTCAATACGGCCCAGTACTTTGGCTT



AATTCTTCATCCTATAGCCAGAGCCAGTATCTGGATGGATTTTTAAGCCAGCCAAAGAATTGGTCTA



TGCGGGTGCTACCTCAAGCCGGATCAGTGCGCGTGGAACAGCGCGTTGCTCTGATATGGAATTTGC



AGGCAGGCAAGATGCGGCTGGAGCGCTCTGGAGCGCGCGCGTTTTTCATGCCAGTGCCATTCAGCT



TCAGGCCGTCTGGTTCAGGAGATGAAGCAGTATTGGCGCCGAATCGGTACTTGGGACTTTTTCCGCA



TTCCGGAGGAATAGAATACGCGGTGGTGGATGTATTAGATTCCGCGGGTTTCAAAATTCTTGAGCG



CGGTACGATTGCGGTAAATGGCTTTTCCCAGAAGCGCGGCGAACGCCAAGAGGAGGCACACAGAG



AAAAACAGAGACGCGGAATTTCTGATATAGGCCGCAAGAAGCCGGTGCAAGCTGAAGTTGACGCA



GCCAATGAATTGCACCGCAAATACACCGATGTTGCCACTCGTTTAGGGTGCAGAATTGTGGTTCAGT



GGGCGCCCCAGCCAAAGCCGGGCACAGCGCCGACCGCGCAAACAGTATACGCGCGCGCAGTGCGG



ACCGAAGCGCCGCGATCTGGAAATCAAGAGGATCATGCTCGTATGAAATCCTCTTGGGGATATACC



TGGGGCACCTATTGGGAGAAGCGCAAACCAGAGGATATTTTGGGCATCTCAACCCAAGTATACTGG



ACCGGCGGTATAGGCGAGTCATGTCCCGCAGTCGCGGTTGCGCTTTTGGGGCACATTAGGGCAACA



TCCACTCAAACTGAATGGGAAAAAGAGGAGGTTGTATTCGGTCGACTGAAGAAGTTCTTTCCAAGC



TAG





SEQ
ATGGAAAAGAGAATAAACAAGATACGAAAGAAACTATCGGCCGATAATGCCACAAAGCCTGTGAG


ID
CAGGAGCGGCCCCATGAAAACACTCCTTGTCCGGGTCATGACGGACGACTTGAAAAAAAGACTGGA


NO:
GAAGCGTCGGAAAAAGCCGGAAGTTATGCCGCAGGTTATTTCAAATAACGCAGCAAACAATCTTAG


39
AATGCTCCTTGATGACTATACAAAGATGAAGGAGGCGATACTACAAGTTTACTGGCAGGAATTTAA



GGACGACCATGTGGGCTTGATGTGCAAATTTGCCCAGCCTGCTTCCAAAAAAATTGACCAGAACAA



ACTAAAACCGGAAATGGATGAAAAAGGAAATCTAACAACTGCCGGTTTTGCATGTTCTCAATGCGG



TCAGCCGCTATTTGTTTATAAGCTTGAACAGGTGAGTGAAAAAGGCAAGGCTTATACAAATTACTTC



GGCCGGTGTAATGTGGCCGAGCATGAGAAATTGATTCTTCTTGCTCAATTAAAACCTGAAAAAGAC



AGTGACGAAGCAGTGACATACTCCCTTGGCAAATTCGGCCAGAGGGCATTGGACTTTTATTCAATCC



ACGTAACAAAAGAATCCACCCATCCAGTAAAGCCCCTGGCACAGATTGCGGGCAACCGCTATGCAA



GCGGACCTGTTGGCAAGGCCCTTTCCGATGCCTGTATGGGCACTATAGCCAGTTTTCTTTCGAAATA



TCAAGACATCATCATAGAACATCAAAAGGTTGTGAAGGGTAATCAAAAGAGGTTAGAGAGTCTCAG



GGAATTGGCAGGGAAAGAAAATCTTGAGTACCCATCGGTTACACTGCCGCCGCAGCCGCATACGAA



AGAAGGGGTTGACGCTTATAACGAAGTTATTGCAAGGGTACGTATGTGGGTTAATCTTAATCTGTG



GCAAAAGCTGAAGCTCAGCCGTGATGACGCAAAACCGCTACTGCGGCTAAAAGGATTCCCATCTTT



CCCTGTTGTGGAGCGGCGTGAAAACGAAGTTGACTGGTGGAATACGATTAATGAAGTAAAAAAACT



GATTGACGCTAAACGAGATATGGGACGGGTATTCTGGAGCGGCGTTACCGCAGAAAAGAGAAATA



CCATCCTTGAAGGATACAACTATCTGCCAAATGAGAATGACCATAAAAAGAGAGAGGGCAGTTTGG



AAAACCCTAAGAAGCCTGCCAAACGCCAGTTTGGAGACCTCTTGCTGTATCTTGAAAAGAAATATG



CCGGAGACTGGGGAAAGGTCTTCGATGAGGCATGGGAGAGGATAGATAAGAAAATAGCCGGACTC



ACAAGCCATATAGAGCGCGAAGAAGCAAGAAACGCGGAAGACGCTCAATCCAAAGCCGTACTTAC



AGACTGGCTAAGGGCAAAGGCATCATTTGTTCTTGAAAGACTGAAGGAAATGGATGAAAAGGAATT



CTATGCGTGTGAAATCCAACTTCAAAAATGGTATGGCGATCTTCGAGGCAACCCGTTTGCCGTTGAA



GCTGAGAATAGAGTTGTTGATATAAGCGGGTTTTCTATCGGAAGCGATGGCCATTCAATCCAATAC



AGAAATCTCCTTGCCTGGAAATATCTGGAGAACGGCAAGCGTGAATTCTATCTGTTAATGAATTATG



GCAAGAAAGGGCGCATCAGATTTACAGATGGAACAGATATTAAAAAGAGCGGCAAATGGCAGGGA



CTATTATATGGCGGTGGCAAGGCAAAGGTTATTGATCTGACTTTCGACCCCGATGATGAACAGTTGA



TAATCCTGCCGCTGGCCTTTGGCACAAGGCAAGGCCGCGAGTTTATCTGGAACGATTTGCTGAGTCT



TGAAACAGGCCTGATAAAGCTCGCAAACGGAAGAGTTATCGAAAAAACAATCTATAACAAAAAAA



TAGGGCGGGATGAACCGGCTCTATTCGTTGCCTTAACATTTGAGCGCCGGGAAGTTGTTGATCCATC



AAATATAAAGCCTGTAAACCTTATAGGCGTTGACCGCGGCGAAAACATCCCGGCGGTTATTGCATT



GACAGACCCTGAAGGTTGTCCTTTACCGGAATTCAAGGATTCATCAGGGGGCCCAACAGACATCCT



GCGAATAGGAGAAGGATATAAGGAAAAGCAGAGGGCTATTCAGGCAGCAAAGGAGGTAGAGCAA



AGGCGGGCTGGCGGTTATTCACGGAAGTTTGCATCCAAGTCGAGGAACCTGGCGGACGACATGGTG



AGAAATTCAGCGCGAGACCTTTTTTACCATGCCGTTACCCACGATGCCGTCCTTGTCTTTGAAAACC



TGAGCAGGGGTTTTGGAAGGCAGGGCAAAAGGACCTTCATGACGGAAAGACAATATACAAAGATG



GAAGACTGGCTGACAGCGAAGCTCGCATACGAAGGTCTTACGTCAAAAACCTACCTTTCAAAGACG



CTGGCGCAATATACGTCAAAAACATGCTCCAACTGCGGGTTTACTATAACGACTGCCGATTATGAC



GGGATGTTGGTAAGGCTTAAAAAGACTTCTGATGGATGGGCAACTACCCTCAACAACAAAGAATTA



AAAGCCGAAGGCCAGATAACGTATTATAACCGGTATAAAAGGCAAACCGTGGAAAAAGAACTCTC



CGCAGAGCTTGACAGGCTTTCAGAAGAGTCGGGCAATAATGATATTTCTAAGTGGACCAAGGGTCG



CCGGGACGAGGCATTATTTTTGTTAAAGAAAAGATTCAGCCATCGGCCTGTTCAGGAACAGTTTGTT



TGCCTCGATTGCGGCCATGAAGTCCACGCCGATGAACAGGCAGCCTTGAATATTGCAAGGTCATGG



CTTTTTCTAAACTCAAATTCAACAGAATTCAAAAGTTATAAATCGGGTAAACAGCCCTTCGTTGGTG



CTTGGCAGGCCTTTTACAAAAGGAGGCTTAAAGAGGTATGGAAGCCCAACGCC





SEQ
ATGAAAAGGATAAATAAAATACGAAGGAGATTGGTAAAGGATAGCAACACGAAAAAAGCCGGCA


ID
AAACCGGCCCTATGAAAACCTTGCTCGTTCGGGTTATGACACCTGACCTGAGAGAAAGGTTAGAGA


NO:
ATCTTCGCAAAAAGCCGGAAAACATTCCTCAGCCCATTTCAAATACTTCACGTGCAAATTTAAATAA


40
ACTCCTCACTGACTATACGGAAATGAAGAAAGCAATCCTGCATGTTTATTGGGAAGAGTTCCAAAA



AGACCCTGTCGGATTGATGAGCAGGGTTGCACAACCAGCGCCCAAGAATATTGATCAGAGAAAATT



GATTCCGGTGAAGGACGGAAATGAGAGACTAACAAGTTCTGGATTTGCCTGTTCTCAGTGCTGTCA



ACCCCTCTATGTTTATAAGCTTGAACAAGTGAATGACAAGGGTAAGCCCCATACAAATTACTTTGGC



CGTTGTAATGTCTCCGAGCATGAACGTTTGATATTGCTCTCGCCGCATAAACCGGAGGCAAATGACG



AGCTAGTAACGTATTCGTTGGGGAAGTTCGGTCAAAGGGCATTGGACTTTTATTCAATCCACGTAAC



AAGAGAATCGAACCATCCTGTAAAGCCGCTAGAACAGATCGGTGGCAATAGCTGCGCAAGTGGTCC



CGTTGGTAAGGCTTTATCTGATGCCTGTATGGGAGCAGTAGCCAGTTTCCTTACAAAGTACCAGGAC



ATCATCCTCGAACACCAAAAGGTTATAAAAAAAAACGAAAAGAGATTGGCAAATCTAAAGGATAT



AGCAAGTGCAAACGGGCTTGCATTTCCTAAAATCACTCTTCCACCGCAACCGCATACAAAAGAAGG



GATTGAAGCTTATAACAATGTTGTTGCTCAGATAGTGATCTGGGTAAACCTGAATCTTTGGCAGAAA



CTCAAAATTGGCAGGGATGAGGCAAAGCCCTTACAGCGGCTTAAGGGTTTTCCGTCCTTCCCTCTTG



TTGAACGCCAGGCGAATGAGGTTGATTGGTGGGATATGGTCTGTAATGTCAAAAAGTTGATTAACG



AAAAGAAAGAGGACGGGAAGGTCTTCTGGCAAAATCTTGCTGGATATAAAAGGCAGGAAGCCTTG



CTTCCATATCTTTCGTCTGAAGAAGACCGTAAAAAAGGAAAAAAGTTTGCGCGTTATCAGTTTGGTG



ACCTTTTGCTTCACCTTGAAAAGAAACACGGTGAAGATTGGGGCAAAGTTTATGATGAGGCATGGG



AAAGAATAGATAAAAAAGTTGAAGGTCTGAGTAAGCACATAAAGTTGGAGGAAGAAAGAAGGTCT



GAAGATGCTCAATCAAAGGCTGCCCTCACTGATTGGCTCAGGGCAAAGGCCTCTTTTGTTATTGAAG



GGCTCAAAGAAGCTGATAAGGATGAGTTTTGCAGGTGTGAGTTAAAGCTTCAAAAGTGGTATGGAG



ATTTGAGAGGAAAACCATTTGCTATAGAAGCAGAGAACAGCATTTTAGATATAAGCGGATTTTCTA



AACAGTATAATTGTGCATTTATATGGCAGAAAGACGGCGTAAAGAAGTTAAATCTTTATTTAATAAT



AAATTACTTCAAAGGTGGTAAGCTACGCTTCAAAAAAATCAAGCCAGAAGCTTTTGAAGCAAATAG



GTTTTATACAGTAATTAATAAAAAAAGCGGTGAGATTGTGCCTATGGAGGTCAACTTCAATTTTGAT



GACCCGAATTTGATAATTCTGCCTTTGGCCTTTGGAAAAAGGCAGGGGAGGGAGTTTATCTGGAAC



GACCTATTGAGCCTTGAGACGGGTTCATTGAAACTCGCCAATGGCAGGGTTATTGAAAAAACGCTC



TATAACAGAAGGACGAGACAGGATGAACCAGCACTTTTTGTTGCCCTGACATTTGAAAGAAGAGAG



GTGCTTGACTCATCGAATATAAAACCGATGAATCTGATAGGAATAGACCGGGGAGAAAATATCCCG



GCAGTCATAGCATTAACAGACCCGGAAGGATGCCCCTTGTCAAGATTCAAAGATTCATTGGGCAAT



CCAACGCATATTTTGCGAATAGGAGAAAGTTATAAGGAAAAACAACGGACTATTCAGGCTGCTAAA



GAAGTTGAACAAAGGCGGGCAGGCGGATATTCGAGAAAATATGCATCAAAGGCGAAGAATCTGGC



GGACGATATGGTAAGAAATACAGCTCGTGACCTCTTATATTATGCTGTTACTCAAGATGCAATGCTC



ATTTTTGAAAATCTTTCCCGCGGTTTTGGTAGACAAGGCAAGAGGACTTTTATGGCGGAAAGGCAGT



ACACGAGGATGGAAGACTGGCTGACTGCAAAGCTTGCCTATGAAGGTCTGCCATCAAAAACCTATC



TTTCAAAGACTCTGGCACAGTATACCTCAAAGACATGTTCTAATTGTGGTTTTACAATCACAAGTGC



AGATTATGACAGGGTGCTCGAAAAGCTCAAGAAGACGGCTACTGGATGGATGACTACAATCAATGG



AAAAGAGTTAAAAGTTGAAGGACAGATAACATACTATAACCGGTATAAAAGGCAGAATGTGGTAA



AAGACCTCTCTGTAGAGCTGGATAGACTTTCGGAAGAGTCGGTAAATAATGATATTTCTAGTTGGAC



AAAAGGCCGCAGTGGTGAAGCTTTATCTCTGCTAAAAAAGAGATTTAGTCACAGGCCGGTGCAGGA



AAAGTTTGTTTGCCTGAACTGTGGTTTTGAAACCCATGCAGACGAACAAGCAGCACTGAATATTGC



AAGGTCGTGGCTCTTTCTCCGTTCTCAAGAATATAAGAAGTATCAAACCAATAAAACGACCGGAAA



TACTGACAAAAGGGCATTTGTTGAAACATGGCAATCCTTTTACAGAAAGAAGCTCAAAGAAGTATG



GAAACCA





SEQ
ATGGGTAAAATGTATTACCTTGGTTTAGACATTGGCACGAATTCCGTGGGCTACGCGGTGACCGACC


ID
CCTCATACCACCTGCTGAAGTTTAAGGGGGAACCAATGTGGGGTGCGCACGTATTTGCCGCCGGTA


NO:
ATCAGAGCGCGGAACGACGCTCGTTCCGCACATCGCGTCGTCGTTTGGACCGACGCCAACAGCGCG


41
TTAAACTGGTACAGGAGATTTTTGCCCCGGTGATTAGTCCGATCGACCCACGCTTCTTCATTCGTCT



GCATGAATCCGCCCTGTGGCGCGATGACGTCGCGGAGACGGATAAACATATCTTTTTCAATGATCCT



ACCTATACCGATAAGGAATATTATAGCGATTACCCGACTATCCATCACCTGATCGTTGATCTGATGG



AAAGCTCTGAGAAACACGATCCGCGGCTGGTGTACCTTGCAGTGGCGTGGTTAGTGGCACACCGTG



GTCATTTTCTGAACGAGGTGGACAAGGATAATATTGGAGATGTGTTGTCGTTCGACGCATTTTATCC



GGAGTTTCTCGCGTTCCTGTCGGACAACGGTGTATCACCGTGGGTGTGCGAAAGCAAAGCGCTGCA



GGCGACCTTGCTGAGCCGTAACTCAGTGAACGACAAATATAAAGCCCTTAAGTCTCTGATCTTCGG



ATCCCAGAAACCTGAAGATAACTTCGATGCCAATATTTCGGAAGATGGACTCATTCAACTGCTGGC



CGGCAAAAAGGTAAAAGTTAACAAACTGTTCCCTCAGGAATCGAACGATGCATCCTTCACATTGAA



TGATAAAGAAGACGCGATAGAAGAAATCCTGGGTACGCTTACACCAGATGAATGTGAATGGATTGC



GCATATACGCCGCCTTTTTGACTGGGCTATCATGAAACATGCTCTGAAAGATGGCAGGACTATTAGC



GAGTCAAAAGTCAAACTGTATGAGCAGCACCATCACGATCTGACCCAACTTAAATACTTCGTGAAA



ACCTACCTTGCAAAAGAATACGACGATATTTTCCGCAACGTGGATAGCGAAACAACGAAAAACTAT



GTAGCGTATTCCTATCATGTGAAAGAGGTGAAAGGCACTCTGCCTAAAAATAAGGCAACGCAAGAA



GAGTTTTGTAAGTATGTCCTGGGCAAGGTTAAAAACATTGAATGCTCTGAAGCAGACAAGGTTGAC



TTTGATGAGATGATTCAGCGTCTTACCGACAACTCTTTTATGCCTAAGCAGGTTTCGGGCGAAAACC



GCGTTATTCCTTATCAGTTATATTATTATGAACTGAAGACAATTCTGAATAAAGCAGCCTCGTACCT



GCCTTTCCTGACGCAGTGTGGAAAAGATGCAATTTCGAACCAGGACAAACTACTGTCGATCATGAC



GTTCCGTATTCCTTACTTCGTCGGACCCTTGCGAAAAGATAATTCGGAACATGCATGGCTCGAACGA



AAGGCCGGTAAGATTTATCCGTGGAACTTTAACGACAAAGTGGACTTGGATAAATCAGAAGAAGCG



TTCATTCGCCGAATGACCAATACCTGTACCTATTATCCCGGCGAAGATGTTTTACCGTTGGATTCGC



TGATCTATGAGAAATTTATGATTTTAAATGAAATCAATAATATTCGTATTGACGGCTACCCGATTAG



TGTTGACGTTAAACAGCAGGTTTTTGGCTTGTTCGAAAAAAAACGACGCGTAACCGTGAAAGATAT



TCAGAACCTGCTGCTGTCTCTCGGAGCTCTGGACAAACACGGGAAGCTGACAGGCATCGATACCAC



TATCCACTCAAACTATAATACGTATCACCATTTTAAATCTCTCATGGAACGCGGCGTCCTGACCCGG



GATGACGTGGAACGCATCGTTGAAAGGATGACCTACAGCGACGATACTAAGCGTGTGCGTCTGTGG



CTGAATAACAACTATGGTACTTTAACCGCCGACGATGTGAAACACATTTCGCGTCTGCGCAAACAC



GATTTTGGCCGTTTATCCAAAATGTTCTTAACAGGTCTGAAGGGTGTCCATAAGGAGACCGGTGAAC



GTGCCTCCATACTGGATTTCATGTGGAACACGAACGATAACCTGATGCAGCTCCTTTCCGAATGCTA



CACGTTCAGTGATGAAATCACAAAGCTGCAAGAGGCGTATTATGCAAAAGCCCAGTTGTCTTTAAA



CGATTTTTTAGACTCGATGTACATCTCTAACGCGGTGAAACGTCCGATTTACAGAACTCTGGCAGTG



GTGAACGATATTCGAAAAGCATGTGGGACGGCCCCTAAACGCATTTTCATCGAAATGGCTCGTGAT



GGTGAATCAAAAAAAAAGAGAAGTGTTACACGTCGCGAGCAGATCAAAAACCTGTACCGCTCGATT



CGTAAAGATTTCCAGCAGGAAGTTGATTTTCTGGAAAAGATCCTGGAAAATAAATCTGATGGTCAA



CTTCAGTCAGATGCTTTGTATCTTTACTTTGCACAATTAGGGCGCGATATGTACACGGGCGATCCAA



TAAAGCTGGAGCACATCAAAGATCAGAGTTTCTATAACATAGACCATATTTACCCGCAGTCTATGGT



GAAAGACGATTCCCTAGATAACAAAGTGCTGGTGCAAAGCGAAATTAACGGCGAGAAAAGCTCGC



GATACCCTTTGGACGCCGCGATCCGCAATAAAATGAAGCCCCTTTGGGACGCTTACTATAATCATGG



CCTGATCTCCTTAAAGAAATACCAGCGTCTAACGCGCTCGACCCCGTTTACCGATGATGAAAAATG



GGACTTTATTAATCGCCAGTTAGTGGAAACCCGTCAATCTACCAAAGCGCTGGCCATTTTGTTGAAG



CGTAAGTTTCCAGACACCGAAATTGTGTATTCGAAGGCGGGGTTATCGTCCGACTTCAGACATGAAT



TCGGCCTTGTAAAAAGTCGCAATATTAATGATTTGCACCACGCTAAAGACGCATTCTTGGCTATCGT



TACCGGCAATGTGTACCATGAAAGATTCAATCGCAGATGGTTTATGGTGAACCAGCCGTACTCAGTT



AAAACTAAAACTCTTTTTACCCACAGCATAAAGAATGGCAACTTCGTTGCCTGGAACGGCGAAGAA



GATCTCGGTCGTATTGTAAAAATGCTGAAGCAAAACAAAAATACCATTCACTTCACGCGCTTCTCCT



TCGATCGCAAAGAAGGATTATTTGATATCCAACCTCTGAAAGCCAGCACCGGCTTAGTCCCACGAA



AAGCCGGTCTGGATGTCGTTAAATACGGCGGATATGACAAATCTACCGCGGCCTATTACCTGCTGGT



GAGGTTCACGCTCGAGGACAAGAAAACCCAGCACAAGCTGATGATGATTCCTGTAGAAGGCCTGTA



CAAGGCTCGCATTGATCATGACAAGGAATTTCTTACCGATTATGCGCAAACGACTATAAGCGAAAT



CCTACAGAAAGATAAACAGAAAGTGATCAATATTATGTTTCCAATGGGTACGAGGCATATAAAACT



CAATTCAATGATTAGTATCGATGGCTTCTATCTTAGTATCGGCGGAAAGTCCTCTAAAGGTAAGTCA



GTTCTATGTCACGCAATGGTTCCACTGATCGTCCCTCACAAAATCGAATGTTACATTAAAGCAATGG



AAAGCTTCGCCCGGAAGTTTAAAGAAAACAACAAGCTGCGCATCGTAGAAAAATTCGATAAAATCA



CCGTTGAAGACAACCTGAATCTCTACGAGCTCTTTCTCCAAAAACTGCAGCATAATCCCTATAATAA



GTTTTTTTCGACACAGTTTGACGTACTGACGAACGGCCGTTCTACTTTCACAAAACTGTCGCCGGAG



GAACAGGTACAGACGCTCTTGAACATTTTAAGTATCTTTAAAACATGCCGCAGTTCGGGTTGCGACC



TGAAATCCATCAACGGCAGTGCCCAGGCAGCGCGCATCATGATTAGCGCTGACTTAACTGGACTGT



CGAAAAAATATTCAGATATTAGGTTGGTTGAACAGTCAGCTTCTGGTTTGTTCGTATCCAAAAGTCA



GAACTTACTGGAGTATCTCTAA





SEQ
ATGTCATCGCTCACGAAATTCACTAACAAATACTCTAAACAGCTCACCATTAAGAATGAACTCATCC


ID
CAGTTGGCAAAACACTGGAGAACATCAAAGAGAATGGTCTGATAGATGGCGACGAACAGCTGAAT


NO:
GAGAATTATCAGAAGGCGAAAATTATTGTGGATGATTTTCTGCGGGACTTCATTAATAAAGCACTG


42
AATAATACGCAGATCGGGAACTGGCGCGAACTGGCGGATGCCCTTAATAAAGAGGATGAAGATAA



CATCGAGAAATTGCAGGATAAAATTCGGGGAATCATTGTATCCAAATTTGAAACGTTTGATCTGTTT



AGCAGCTATTCTATTAAGAAAGATGAAAAGATTATTGACGACGACAATGATGTTGAAGAAGAGGA



ACTGGATCTGGGCAAGAAGACCAGCTCATTTAAATACATATTTAAAAAAAACCTGTTTAAGTTAGT



GTTGCCATCCTACCTGAAAACCACAAACCAGGACAAGCTGAAGATTATTAGCTCGTTTGATAATTTT



TCAACGTACTTCCGCGGGTTCTTTGAAAACCGGAAAAACATTTTTACCAAGAAACCGATCTCCACAA



GTATTGCGTATCGCATTGTTCATGATAACTTCCCGAAATTCCTTGATAACATTCGTTGTTTTAATGTG



TGGCAGACGGAATGCCCGCAACTAATCGTGAAAGCAGATAACTATCTGAAAAGCAAAAATGTTATA



GCGAAAGATAAAAGTTTGGCAAACTATTTTACCGTGGGCGCGTATGACTATTTCCTGTCTCAGAATG



GTATAGATTTTTACAACAATATTATAGGTGGACTGCCAGCGTTCGCCGGCCATGAGAAAATCCAAG



GTCTCAATGAATTCATCAATCAAGAGTGCCAAAAAGACAGCGAGCTGAAAAGTAAGCTGAAAAAC



CGTCACGCGTTCAAAATGGCGGTACTGTTCAAACAGATACTCAGCGATCGTGAAAAAAGTTTTGTA



ATTGATGAGTTCGAGTCGGATGCTCAAGTTATTGACGCCGTTAAAAACTTTTACGCCGAACAGTGCA



AAGATAACAATGTTATTTTTAACTTATTAAATCTTATCAAGAATATCGCTTTCTTAAGTGATGACGA



ACTGGACGGCATATTCATTGAAGGGAAATACCTGTCGAGCGTTAGTCAAAAACTCTATAGCGATTG



GTCAAAATTACGTAACGACATTGAGGATTCGGCTAACTCTAAACAAGGCAATAAAGAGCTGGCCAA



GAAGATCAAAACCAACAAAGGGGATGTAGAAAAAGCGATCTCGAAATATGAGTTCTCGCTGTCGG



AACTGAACTCGATTGTACATGATAACACCAAGTTTTCTGACCTCCTTAGTTGTACACTGCATAAGGT



GGCTTCTGAGAAACTGGTGAAGGTCAATGAAGGCGACTGGCCGAAACATCTCAAGAATAATGAAG



AGAAACAAAAAATCAAAGAGCCGCTTGATGCTCTGCTGGAGATCTATAATACACTTCTGATTTTTAA



CTGCAAAAGCTTCAATAAAAACGGCAACTTCTATGTCGACTATGATCGTTGCATCAATGAACTGAGT



TCGGTCGTGTATCTGTATAATAAAACACGTAACTATTGCACTAAAAAACCCTATAACACGGACAAG



TTCAAACTCAATTTTAACAGTCCGCAGCTCGGTGAAGGCTTTTCCAAGTCGAAAGAAAATGACTGTC



TGACTCTTTTGTTTAAAAAAGACGACAACTATTATGTAGGCATTATCCGCAAAGGTGCAAAAATCA



ATTTTGATGATACACAAGCAATCGCCGATAACACCGACAATTGCATCTTTAAAATGAATTATTTCCT



ACTTAAAGACGCAAAAAAATTTATCCCGAAATGTAGCATTCAGCTGAAAGAAGTCAAGGCCCATTT



TAAGAAATCTGAAGATGATTACATTTTGTCTGATAAAGAGAAATTTGCTAGCCCGCTGGTCATTAAA



AAGAGCACATTTTTGCTGGCAACTGCACATGTGAAAGGGAAAAAAGGCAATATCAAGAAATTTCAG



AAAGAATATTCGAAAGAAAACCCCACTGAGTATCGCAATTCTTTAAACGAATGGATTGCTTTTTGTA



AAGAGTTCTTAAAAACTTATAAAGCGGCTACCATTTTTGATATAACCACATTGAAAAAGGCAGAGG



AATATGCTGATATTGTAGAATTCTACAAGGATGTCGATAATCTGTGCTACAAACTGGAGTTCTGCCC



GATTAAAACCTCGTTTATAGAAAACCTGATAGATAACGGCGACCTGTATCTGTTTCGCATCAATAAC



AAAGACTTCAGCAGTAAATCGACCGGCACCAAGAACCTTCATACGTTATATTTACAAGCTATATTCG



ATGAACGTAATCTGAACAATCCGACAATTATGCTGAATGGGGGAGCAGAACTGTTCTATCGTAAAG



AAAGTATTGAGCAGAAAAACCGTATCACACACAAAGCCGGTTCAATTCTCGTGAATAAGGTGTGTA



AAGACGGTACAAGCCTGGATGATAAGATACGTAATGAAATTTATCAATATGAGAATAAATTTATTG



ATACCCTGTCTGATGAAGCTAAAAAGGTGTTACCGAATGTCATTAAAAAGGAAGCTACCCATGACA



TTACAAAAGATAAACGTTTCACTAGTGACAAATTCTTCTTTCACTGCCCCCTGACAATTAATTATAA



GGAAGGCGATACCAAGCAGTTCAATAACGAAGTGCTGAGTTTTCTGCGTGGAAATCCTGACATCAA



CATTATCGGCATTGACCGCGGAGAGCGTAATTTAATCTATGTAACGGTTATAAACCAGAAAGGCGA



GATTCTGGATTCGGTTTCATTCAATACCGTGACCAACAAGAGTTCAAAAATCGAGCAGACAGTCGA



TTATGAAGAGAAATTGGCAGTCCGCGAGAAAGAGAGGATTGAAGCAAAACGTTCCTGGGACTCTAT



CTCAAAAATTGCGACACTAAAGGAAGGTTATCTGAGCGCAATAGTTCACGAGATCTGTCTGTTAAT



GATTAAACACAACGCGATCGTTGTCTTAGAGAATCTTAATGCAGGCTTTAAGCGTATTCGTGGCGGT



TTATCAGAAAAAAGTGTTTATCAAAAATTCGAAAAAATGTTGATTAACAAACTGAACTATTTTGTCA



GCAAGAAGGAATCCGACTGGAATAAACCGTCTGGTCTGCTGAATGGACTGCAGCTTTCGGATCAGT



TTGAAAGCTTCGAAAAACTGGGTATTCAGTCTGGTTTTATTTTTTACGTGCCGGCTGCATATACCTCA



AAGATTGATCCGACCACGGGCTTCGCCAATGTTCTGAATCTGTCGAAGGTACGCAATGTTGATGCG



ATCAAAAGCTTTTTTTCTAACTTCAACGAAATTAGTTATAGCAAGAAAGAAGCCCTTTTCAAATTCT



CATTCGATCTGGATTCACTGAGTAAGAAAGGCTTTAGTAGCTTTGTGAAATTTAGTAAGAGTAAATG



GAACGTCTACACCTTTGGAGAACGTATCATAAAGCCAAAGAATAAGCAAGGTTATCGGGAGGACA



AAAGAATCAACTTGACCTTCGAGATGAAGAAGTTACTTAACGAGTATAAGGTTTCTTTTGATCTTGA



AAATAACTTGATTCCGAATCTCACGAGTGCCAACCTGAAGGATACTTTTTGGAAAGAGCTATTCTTT



ATCTTCAAGACTACGCTGCAGCTCCGTAACAGCGTTACTAACGGTAAAGAAGATGTGCTCATCTCTC



CGGTCAAAAATGCGAAGGGTGAATTCTTCGTTTCGGGAACGCATAACAAGACTCTTCCGCAAGATT



GCGATGCGAACGGTGCATACCATATTGCGTTGAAAGGTCTGATGATACTCGAACGTAACAACCTTG



TACGTGAGGAGAAAGATACGAAAAAGATTATGGCGATTTCAAACGTGGATTGGTTCGAGTACGTGC



AGAAACGTAGAGGCGTTCTGTAA





SEQ
ATGAACAACTACGACGAATTCACCAAACTGTACCCGATCCAGAAAACCATCCGTTTCGAACTGAAA


ID
CCGCAGGGTCGTACCATGGAACACCTGGAAACCTTCAACTTCTTCGAAGAAGACCGTGACCGTGCG


NO:
GAAAAATACAAAATCCTGAAAGAAGCGATCGACGAATACCACAAAAAATTCATCGACGAACACCT


43
GACCAACATGTCTCTGGACTGGAACTCTCTGAAACAGATCTCTGAAAAATACTACAAATCTCGTGA



AGAAAAAGACAAAAAAGTTTTCCTGTCTGAACAGAAACGTATGCGTCAGGAAATCGTTTCTGAATT



CAAAAAAGACGACCGTTTCAAAGACCTGTTCTCTAAAAAACTGTTCTCTGAACTGCTGAAAGAAGA



AATCTACAAAAAAGGTAACCACCAGGAAATCGACGCGCTGAAATCTTTCGACAAATTCTCTGGTTA



CTTCATCGGTCTGCACGAAAACCGTAAAAACATGTACTCTGACGGTGACGAAATCACCGCGATCTC



TAACCGTATCGTTAACGAAAACTTCCCGAAATTCCTGGACAACCTGCAGAAATACCAGGAAGCGCG



TAAAAAATACCCGGAATGGATCATCAAAGCGGAATCTGCGCTGGTTGCGCACAACATCAAAATGGA



CGAAGTTTTCTCTCTGGAATACTTCAACAAAGTTCTGAACCAGGAAGGTATCCAGCGTTACAACCTG



GCGCTGGGTGGTTACGTTACCAAATCTGGTGAAAAAATGATGGGTCTGAACGACGCGCTGAACCTG



GCGCACCAGTCTGAAAAATCTTCTAAAGGTCGTATCCACATGACCCCGCTGTTCAAACAGATCCTGT



CTGAAAAAGAATCTTTCTCTTACATCCCGGACGTTTTCACCGAAGACTCTCAGCTGCTGCCGTCTAT



CGGTGGTTTCTTCGCGCAGATCGAAAACGACAAAGACGGTAACATCTTCGACCGTGCGCTGGAACT



GATCTCTTCTTACGCGGAATACGACACCGAACGTATCTACATCCGTCAGGCGGACATCAACCGTGTT



TCTAACGTTATCTTCGGTGAATGGGGTACCCTGGGTGGTCTGATGCGTGAATACAAAGCGGACTCTA



TCAACGACATCAACCTGGAACGTACCTGCAAAAAAGTTGACAAATGGCTGGACTCTAAAGAATTCG



CGCTGTCTGACGTTCTGGAAGCGATCAAACGTACCGGTAACAACGACGCGTTCAACGAATACATCT



CTAAAATGCGTACCGCGCGTGAAAAAATCGACGCGGCGCGTAAAGAAATGAAATTCATCTCTGAAA



AAATCTCTGGTGACGAAGAATCTATCCACATCATCAAAACCCTGCTGGACTCTGTTCAGCAGTTCCT



GCACTTCTTCAACCTGTTCAAAGCGCGTCAGGACATCCCGCTGGACGGTGCGTTCTACGCGGAATTC



GACGAAGTTCACTCTAAACTGTTCGCGATCGTTCCGCTGTACAACAAAGTTCGTAACTACCTGACCA



AAAACAACCTGAACACCAAAAAAATCAAACTGAACTTCAAAAACCCGACCCTGGCGAACGGTTGG



GACCAGAACAAAGTTTACGACTACGCGTCTCTGATCTTCCTGCGTGACGGTAACTACTACCTGGGTA



TCATCAACCCGAAACGTAAAAAAAACATCAAATTCGAACAGGGTTCTGGTAACGGTCCGTTCTACC



GTAAAATGGTTTACAAACAGATCCCGGGTCCGAACAAAAACCTGCCGCGTGTTTTCCTGACCTCTAC



CAAAGGTAAAAAAGAATACAAACCGTCTAAAGAAATCATCGAAGGTTACGAAGCGGACAAACACA



TCCGTGGTGACAAATTCGACCTGGACTTCTGCCACAAACTGATCGACTTCTTCAAAGAATCTATCGA



AAAACACAAAGACTGGTCTAAATTCAACTTCTACTTCTCTCCGACCGAATCTTACGGTGACATCTCT



GAATTCTACCTGGACGTTGAAAAACAGGGTTACCGTATGCACTTCGAAAACATCTCTGCGGAAACC



ATCGACGAATACGTTGAAAAAGGTGACCTGTTCCTGTTCCAGATCTACAACAAAGACTTCGTTAAA



GCGGCGACCGGTAAAAAAGACATGCACACCATCTACTGGAACGCGGCGTTCTCTCCGGAAAACCTG



CAGGACGTTGTTGTTAAACTGAACGGTGAAGCGGAACTGTTCTACCGTGACAAATCTGACATCAAA



GAAATCGTTCACCGTGAAGGTGAAATCCTGGTTAACCGTACCTACAACGGTCGTACCCCGGTTCCG



GACAAAATCCACAAAAAACTGACCGACTACCACAACGGTCGTACCAAAGACCTGGGTGAAGCGAA



AGAATACCTGGACAAAGTTCGTTACTTCAAAGCGCACTACGACATCACCAAAGACCGTCGTTACCT



GAACGACAAAATCTACTTCCACGTTCCGCTGACCCTGAACTTCAAAGCGAACGGTAAAAAAAACCT



GAACAAAATGGTTATCGAAAAATTCCTGTCTGACGAAAAAGCGCACATCATCGGTATCGACCGTGG



TGAACGTAACCTGCTGTACTACTCTATCATCGACCGTTCTGGTAAAATCATCGACCAGCAGTCTCTG



AACGTTATCGACGGTTTCGACTACCGTGAAAAACTGAACCAGCGTGAAATCGAAATGAAAGACGCG



CGTCAGTCTTGGAACGCGATCGGTAAAATCAAAGACCTGAAAGAAGGTTACCTGTCTAAAGCGGTT



CACGAAATCACCAAAATGGCGATCCAGTACAACGCGATCGTTGTTATGGAAGAACTGAACTACGGT



TTCAAACGTGGTCGTTTCAAAGTTGAAAAACAGATCTACCAGAAATTCGAAAACATGCTGATCGAC



AAAATGAACTACCTGGTTTTCAAAGACGCGCCGGACGAATCTCCGGGTGGTGTTCTGAACGCGTAC



CAGCTGACCAACCCGCTGGAATCTTTCGCGAAACTGGGTAAACAGACCGGTATCCTGTTCTACGTTC



CGGCGGCGTACACCTCTAAAATCGACCCGACCACCGGTTTCGTTAACCTGTTCAACACCTCTTCTAA



AACCAACGCGCAGGAACGTAAAGAATTCCTGCAGAAATTCGAATCTATCTCTTACTCTGCGAAAGA



CGGTGGTATCTTCGCGTTCGCGTTCGACTACCGTAAATTCGGTACCTCTAAAACCGACCACAAAAAC



GTTTGGACCGCGTACACCAACGGTGAACGTATGCGTTACATCAAAGAAAAAAAACGTAACGAACTG



TTCGACCCGTCTAAAGAAATCAAAGAAGCGCTGACCTCTTCTGGTATCAAATACGACGGTGGTCAG



AACATCCTGCCGGACATCCTGCGTTCTAACAACAACGGTCTGATCTACACCATGTACTCTTCTTTCA



TCGCGGCGATCCAGATGCGTGTTTACGACGGTAAAGAAGACTACATCATCTCTCCGATCAAAAACT



CTAAAGGTGAATTCTTCCGTACCGACCCGAAACGTCGTGAACTGCCGATCGACGCGGACGCGAACG



GTGCGTACAACATCGCGCTGCGTGGTGAACTGACCATGCGTGCGATCGCGGAAAAATTCGACCCGG



ACTCTGAAAAAATGGCGAAACTGGAACTGAAACACAAAGACTGGTTCGAATTCATGCAGACCCGTG



GTGACTAA





SEQ
ATGACTAAAACATTTGATTCAGAGTTTTTTAATTTGTACTCGCTGCAAAAAACGGTACGCTTTGAGT


ID
TAAAACCCGTGGGAGAAACCGCGTCATTTGTGGAAGACTTTAAAAACGAGGGCTTGAAACGTGTTG


NO:
TGAGCGAAGATGAAAGGCGAGCCGTCGATTACCAGAAAGTTAAGGAAATAATTGACGATTACCATC


44
GGGATTTCATTGAAGAAAGTTTAAATTATTTTCCGGAACAGGTGAGTAAAGATGCTCTTGAGCAGG



CGTTTCATCTTTATCAGAAACTGAAGGCAGCAAAAGTTGAGGAAAGGGAAAAAGCGCTGAAAGAA



TGGGAAGCGCTGCAGAAAAAGCTACGTGAAAAAGTGGTGAAATGCTTCTCGGACTCGAATAAAGC



CCGCTTCTCAAGGATTGATAAAAAGGAACTGATTAAGGAAGACCTGATAAATTGGTTGGTCGCCCA



GAATCGCGAGGATGATATCCCTACGGTCGAAACGTTTAACAACTTCACCACATATTTTACCGGCTTC



CATGAGAATCGTAAAAATATTTACTCCAAAGATGATCACGCCACCGCTATTAGCTTTCGCCTTATTC



ATGAAAATCTTCCAAAGTTTTTTGACAACGTGATTAGCTTCAATAAGTTGAAAGAGGGTTTCCCTGA



ATTAAAATTTGATAAAGTGAAAGAGGATTTAGAAGTAGATTATGATCTGAAGCATGCGTTTGAAAT



AGAATATTTCGTTAACTTCGTGACCCAAGCGGGCATAGATCAGTATAATTATCTGTTAGGAGGGAA



AACCCTGGAGGACGGGACGAAAAAACAAGGGATGAATGAGCAAATTAATCTGTTCAAACAACAGC



AAACGCGAGATAAAGCGCGTCAGATTCCCAAACTGATCCCCCTGTTCAAACAGATTCTTAGCGAAA



GGACTGAAAGCCAGTCCTTTATTCCTAAACAATTTGAAAGTGATCAGGAGTTGTTCGATTCACTGCA



GAAGTTACATAATAACTGCCAGGATAAATTCACCGTGCTGCAACAAGCCATTCTCGGTCTGGCAGA



GGCGGATCTTAAGAAGGTCTTCATCAAAACCTCTGATTTAAATGCCTTATCTAACACCATTTTCGGG



AATTACAGCGTCTTTTCCGATGCACTGAACCTGTATAAAGAAAGCCTGAAAACGAAAAAAGCGCAG



GAGGCTTTTGAGAAACTACCGGCCCATTCTATTCACGACCTCATTCAATACTTGGAACAGTTCAATT



CCAGCCTGGACGCGGAAAAACAACAGAGCACCGACACCGTCCTGAACTACTTCATCAAGACCGATG



AATTATATTCTCGCTTCATTAAATCCACTAGCGAGGCTTTCACTCAGGTGCAGCCTTTGTTCGAACTG



GAAGCCCTGTCATCTAAGCGCCGCCCACCGGAATCGGAAGATGAAGGGGCAAAAGGGCAGGAAGG



CTTCGAGCAGATCAAGCGTATTAAAGCTTACCTGGATACGCTTATGGAAGCGGTACACTTTGCAAA



GCCGTTGTATCTTGTTAAGGGTCGTAAAATGATCGAAGGGCTCGATAAAGACCAGTCCTTTTATGAA



GCGTTTGAAATGGCGTACCAAGAACTTGAATCGTTAATCATTCCTATCTATAACAAAGCGCGGAGCT



ATCTGTCGCGGAAACCTTTCAAGGCCGATAAATTCAAGATTAATTTTGACAACAACACGCTACTGA



GCGGATGGGATGCGAACAAGGAAACTGCTAACGCGTCCATTCTGTTTAAGAAAGACGGGTTATATT



ACCTTGGAATTATGCCGAAAGGTAAGACCTTTCTCTTTGACTACTTTGTATCGAGCGAGGATTCAGA



GAAACTGAAACAGCGTCGCCAGAAGACCGCCGAAGAAGCTCTGGCGCAGGATGGTGAAAGTTACT



TCGAAAAAATTCGTTATAAACTGTTACCAGGGGCTTCAAAGATGTTACCGAAAGTCTTTTTTAGCAA



CAAAAATATTGGCTTTTACAACCCGTCGGATGACATTTTACGCATTCGCAACACAGCCTCTCACACC



AAAAACGGGACCCCTCAGAAAGGCCACTCAAAAGTTGAGTTTAACCTGAATGATTGTCATAAGATG



ATTGATTTCTTCAAATCATCAATTCAGAAACACCCGGAATGGGGGTCTTTTGGCTTTACGTTTTCTGA



TACCAGTGATTTTGAAGACATGAGTGCCTTCTACCGGGAAGTAGAAAACCAGGGTTACGTAATTAG



CTTTGACAAAATCAAAGAGACCTATATACAGAGCCAGGTGGAACAGGGTAATCTCTACTTATTCCA



GATTTATAACAAGGATTTCTCGCCCTACAGCAAAGGCAAACCAAACCTGCATACTCTGTACTGGAA



AGCCCTGTTTGAAGAAGCGAACCTGAATAACGTAGTGGCGAAGTTGAACGGTGAAGCGGAAATCTT



CTTCCGTCGTCACTCCATTAAGGCCTCTGATAAAGTTGTCCATCCGGCAAATCAGGCCATTGATAAT



AAGAATCCACACACGGAAAAAACGCAGTCAACCTTTGAATATGACCTCGTTAAAGACAAACGCTAC



ACGCAAGATAAGTTCTTTTTCCACGTCCCAATCAGCCTCAACTTTAAAGCACAAGGGGTTTCAAAGT



TTAATGATAAAGTCAATGGGTTCCTCAAGGGCAACCCGGATGTCAACATTATAGGTATAGACAGGG



GCGAACGCCATCTGCTTTACTTTACCGTAGTGAATCAGAAAGGTGAAATACTGGTTCAGGAATCATT



AAATACCTTGATGTCGGACAAAGGGCACGTTAATGATTACCAGCAGAAACTGGATAAAAAAGAAC



AGGAACGTGATGCTGCGCGTAAATCGTGGACCACGGTTGAGAACATTAAAGAGCTGAAAGAGGGG



TATCTAAGCCATGTGGTACACAAACTGGCGCACCTCATCATTAAATATAACGCAATAGTCTGCCTAG



AAGACTTGAATTTTGGCTTTAAACGCGGCCGCTTCAAAGTGGAAAAACAAGTTTATCAAAAATTTG



AAAAGGCGCTTATAGATAAACTGAATTATCTGGTTTTTAAAGAAAAGGAACTTGGTGAGGTAGGGC



ACTACTTGACAGCTTATCAACTGACGGCCCCGTTCGAATCATTCAAAAAACTGGGCAAACAGTCTG



GCATTCTGTTTTACGTGCCGGCAGATTATACTTCAAAAATCGATCCAACAACTGGCTTTGTGAACTT



CCTGGACCTGAGATATCAGTCTGTAGAAAAAGCTAAACAACTTCTTAGCGATTTTAATGCCATTCGT



TTTAACAGCGTTCAGAATTACTTTGAATTCGAAATTGACTATAAAAAACTTACTCCGAAACGTAAAG



TCGGAACCCAAAGTAAATGGGTAATTTGTACGTATGGCGATGTCAGGTATCAGAACCGTCGGAATC



AAAAAGGTCATTGGGAGACCGAAGAAGTGAACGTGACCGAAAAGCTGAAGGCTCTGTTCGCCAGC



GATTCAAAAACTACAACTGTGATCGATTACGCAAATGATGATAACCTGATAGATGTGATTTTAGAG



CAGGATAAAGCCAGCTTTTTTAAAGAACTGTTGTGGCTCCTGAAACTTACGATGACCTTACGACATT



CCAAGATCAAATCGGAAGATGATTTTATTCTGTCACCGGTCAAGAATGAGCAGGGTGAATTCTATG



ATAGTAGGAAAGCCGGCGAAGTGTGGCCGAAAGACGCCGACGCCAATGGCGCCTATCATATCGCG



CTCAAAGGGCTTTGGAATTTGCAGCAGATTAACCAGTGGGAAAAAGGTAAAACCCTGAATCTGGCT



ATCAAAAACCAGGATTGGTTTAGCTTTATCCAAGAGAAACCGTATCAGGAATGA





SEQ
ATGCATACAGGCGGTCTTCTTAGTATGGACGCGAAAGAGTTCACAGGTCAGTATCCGTTGTCGAAA


ID
ACATTACGATTCGAACTTCGGCCCATCGGCCGCACGTGGGATAACCTGGAGGCCTCAGGCTACTTA


NO:
GCGGAAGACCGCCATCGTGCCGAATGTTATCCTCGTGCGAAAGAGTTATTGGATGACAACCATCGT


45
GCCTTCCTGAATCGTGTGTTGCCACAAATCGATATGGATTGGCACCCGATTGCGGAGGCCTTTTGTA



AGGTACATAAAAACCCTGGTAATAAAGAACTTGCCCAGGATTACAACCTTCAGTTGTCAAAGCGCC



GTAAGGAGATCAGCGCATATCTTCAGGATGCAGATGGCTATAAAGGCCTGTTCGCGAAGCCCGCCT



TAGACGAAGCTATGAAAATTGCGAAAGAAAACGGGAACGAAAGTGATATTGAGGTTCTCGAAGCG



TTTAACGGTTTTAGCGTATACTTCACCGGTTATCATGAGTCACGCGAGAACATTTATAGCGATGAGG



ATATGGTGAGCGTAGCCTACCGAATTACTGAGGATAATTTCCCGCGCTTTGTCTCAAACGCTTTGAT



CTTTGATAAATTAAACGAAAGCCATCCGGATATTATCTCTGAAGTATCGGGCAATCTTGGAGTTGAT



GACATTGGTAAGTACTTTGACGTGTCGAACTATAACAATTTTCTTTCCCAGGCCGGTATAGATGACT



ACAATCACATTATTGGCGGCCATACAACCGAAGACGGACTGATACAAGCGTTTAATGTCGTATTGA



ACTTACGTCACCAAAAAGACCCTGGCTTTGAAAAAATTCAGTTCAAACAGCTCTACAAACAAATCC



TGAGCGTGCGTACCAGCAAAAGCTACATCCCGAAACAGTTTGACAACTCTAAGGAGATGGTTGACT



GCATTTGCGATTATGTCAGCAAAATAGAGAAATCCGAAACAGTAGAACGGGCCCTGAAACTAGTCC



GTAATATCAGTTCTTTCGACTTGCGCGGGATCTTTGTCAATAAAAAGAACTTGCGCATACTGAGCAA



CAAACTGATAGGAGATTGGGACGCGATCGAAACCGCATTGATGCATAGTTCTTCATCAGAAAACGA



TAAGAAAAGCGTATATGATAGCGCGGAGGCTTTTACGTTGGATGACATCTTTTCAAGCGTGAAAAA



ATTTTCTGATGCCTCTGCCGAAGATATTGGCAACAGGGCGGAAGACATCTGTAGAGTGATAAGTGA



GACGGCCCCTTTTATCAACGATCTGCGAGCGGTGGACCTGGATAGCCTGAACGACGATGGTTATGA



AGCGGCCGTCTCAAAAATTCGGGAGTCGCTGGAGCCTTATATGGATCTTTTCCATGAACTGGAAATT



TTCTCGGTTGGCGATGAGTTCCCAAAATGCGCAGCATTTTACAGCGAACTGGAGGAAGTCAGCGAA



CAGCTGATCGAAATTATTCCGTTATTCAACAAGGCGCGTTCGTTCTGCACCCGGAAACGCTATAGCA



CCGATAAGATTAAAGTGAACTTAAAATTCCCGACCTTGGCGGACGGGTGGGACCTGAACAAAGAGA



GAGACAACAAAGCCGCGATTCTGCGGAAAGACGGTAAGTATTATCTGGCAATTCTGGATATGAAGA



AAGATCTGTCAAGCATTAGGACCAGCGACGAAGATGAATCCAGCTTCGAAAAGATGGAGTATAAA



CTGTTACCGAGTCCAGTAAAAATGCTGCCAAAGATATTCGTAAAATCGAAAGCCGCTAAGGAAAAA



TATGGCCTGACAGATCGTATGCTTGAATGCTACGATAAAGGTATGCATAAGTCGGGTAGTGCGTTTG



ATCTTGGCTTTTGCCATGAACTCATTGATTATTACAAGCGTTGTATCGCGGAGTACCCAGGCTGGGA



TGTGTTCGATTTCAAGTTTCGCGAAACTTCCGATTATGGGTCCATGAAAGAGTTCAATGAAGATGTG



GCCGGAGCCGGTTACTATATGAGTCTGAGAAAAATTCCGTGCAGCGAAGTGTACCGTCTGTTAGAC



GAGAAATCGATTTATCTATTTCAAATTTATAACAAAGATTACTCTGAAAATGCACATGGTAATAAGA



ACATGCATACCATGTACTGGGAGGGTCTCTTTTCCCCGCAAAACCTGGAGTCGCCCGTTTTCAAGTT



GTCGGGTGGGGCAGAACTTTTCTTTCGAAAATCCTCAATCCCTAACGATGCCAAAACAGTACACCC



GAAAGGCTCAGTGCTGGTTCCACGTAATGATGTTAACGGTCGGCGTATTCCAGATTCAATCTACCGC



GAACTGACACGCTATTTTAACCGTGGCGATTGCCGAATCAGTGACGAAGCCAAAAGTTATCTTGAC



AAGGTTAAGACTAAAAAAGCGGACCATGACATTGTGAAAGATCGCCGCTTTACCGTGGATAAAATG



ATGTTCCACGTCCCGATTGCGATGAACTTTAAGGCGATCAGTAAACCGAACTTAAACAAAAAAGTC



ATTGATGGCATCATTGATGATCAGGATCTGAAAATCATTGGTATTGATCGTGGCGAGCGGAACTTA



ATTTACGTCACGATGGTTGACAGAAAAGGGAATATCTTATATCAGGATTCTCTTAACATCCTCAATG



GCTACGACTATCGTAAAGCTCTGGATGTGCGCGAATATGACAACAAGGAAGCGCGTCGTAACTGGA



CTAAAGTGGAGGGCATTCGCAAAATGAAGGAAGGCTATCTGTCATTAGCGGTCTCGAAATTAGCGG



ATATGATTATCGAAAATAACGCCATCATCGTTATGGAGGACCTGAACCACGGATTCAAAGCGGGCC



GCTCAAAGATTGAAAAACAAGTTTATCAGAAATTTGAGAGTATGCTGATTAACAAACTGGGCTATA



TGGTGTTAAAAGACAAGTCAATTGACCAATCAGGTGGCGCGCTGCATGGATACCAGCTGGCGAACC



ATGTTACCACCTTAGCATCAGTTGGAAAGCAGTGTGGGGTTATCTTTTATATACCGGCAGCGTTCAC



TAGTAAAATAGATCCGACCACTGGTTTCGCCGATCTCTTTGCCCTGAGTAACGTTAAAAACGTAGCG



AGCATGCGTGAATTCTTTTCCAAAATGAAATCTGTCATTTATGATAAAGCTGAAGGCAAATTCGCAT



TCACCTTTGATTACTTGGATTACAACGTGAAGAGCGAATGTGGTCGTACGCTGTGGACCGTTTACAC



CGTTGGTGAGCGCTTCACCTATTCCCGTGTGAACCGCGAATATGTACGTAAAGTCCCCACCGATATT



ATCTATGATGCCCTCCAGAAAGCAGGCATTAGCGTCGAAGGAGACTTAAGGGACAGAATTGCCGAA



AGCGATGGCGATACGCTGAAGTCTATTTTTTACGCATTCAAATACGCGCTAGATATGCGCGTTGAGA



ATCGCGAGGAAGACTACATTCAATCACCTGTGAAAAATGCCTCTGGGGAATTTTTTTGTTCAAAAAA



TGCTGGTAAAAGCCTCCCACAAGATAGCGATGCAAACGGTGCATATAACATTGCCCTGAAAGGTAT



TCTTCAATTACGCATGCTGTCTGAGCAGTACGACCCCAACGCGGAATCTATTAGACTTCCGCTGATA



ACCAATAAAGCCTGGCTGACATTCATGCAGTCTGGCATGAAGACCTGGAAAAATTAG





SEQ
ATGGATAGTTTAAAAGATTTTACGAATCTATATCCCGTAAGCAAAACTCTTCGTTTTGAACTGAAAC


ID
CTGTTGGAAAAACGTTGGAGAATATCGAGAAAGCGGGCATCCTGAAAGAAGACGAGCACCGTGCC


NO:
GAAAGCTACAGGCGTGTCAAAAAGATTATCGATACTTATCACAAAGTGTTCATTGATAGCAGTCTG


46
GAGAACATGGCAAAAATGGGCATAGAAAATGAAATCAAAGCAATGCTGCAGAGCTTTTGCGAGCT



CTACAAGAAAGATCACCGAACGGAAGGTGAAGATAAAGCACTGGACAAAATTCGCGCCGTTCTTCG



CGGTCTGATTGTTGGCGCGTTCACCGGCGTGTGCGGCCGCCGTGAAAACACCGTGCAGAACGAAAA



GTACGAGTCGCTGTTCAAAGAAAAACTGATAAAAGAAATTTTGCCTGACTTTGTGCTTTCGACCGAA



GCGGAATCCCTGCCATTTTCTGTCGAAGAAGCGACCCGCAGCCTGAAAGAATTTGACTCATTCACA



AGTTACTTTGCAGGCTTCTACGAAAACCGTAAAAACATCTACAGCACGAAGCCACAGAGCACGGCT



ATTGCTTATCGCCTGATTCATGAGAACCTGCCGAAGTTCATCGATAACATCCTTGTTTTTCAAAAAA



TTAAAGAGCCGATTGCGAAAGAGTTAGAACATATTCGAGCTGACTTTTCTGCGGGTGGGTACATTA



AAAAAGATGAGCGGCTGGAAGACATCTTCAGTCTAAACTATTATATCCACGTTCTGTCGCAGGCAG



GCATTGAGAAATATAATGCGCTGATTGGTAAGATTGTCACAGAAGGCGATGGTGAGATGAAAGGTC



TTAATGAACATATCAATCTGTATAACCAGCAGCGTGGTCGCGAAGACCGTCTTCCACTGTTCCGCCC



ACTGTATAAACAGATCCTGTCTGACCGGGAACAGCTGTCCTACCTGCCGGAAAGCTTTGAAAAGGA



TGAAGAGCTACTTCGCGCATTAAAGGAGTTTTACGACCATATTGCGGAAGACATTTTGGGTAGAAC



GCAGCAACTGATGACGTCAATTTCTGAATACGATCTGAGTAGAATCTACGTTAGGAATGATAGCCA



GCTGACCGATATTAGCAAAAAAATGCTGGGCGACTGGAACGCTATCTATATGGCACGTGAACGTGC



ATATGATCATGAACAAGCACCGAAACGTATAACCGCGAAATATGAGCGTGATCGCATTAAGGCGCT



AAAGGGAGAAGAAAGCATCTCACTCGCAAACCTGAACTCCTGTATCGCTTTCTTAGATAACGTGCG



CGATTGTCGCGTCGACACGTATCTGTCAACCCTTGGGCAGAAAGAGGGTCCACATGGTCTGTCTAAC



CTGGTGGAAAATGTCTTTGCGAGTTACCATGAAGCGGAACAACTGCTGTCTTTTCCATACCCCGAAG



AAAACAATCTAATACAGGATAAAGATAACGTGGTGTTAATCAAAAACCTGCTGGACAACATCAGCG



ATCTGCAACGTTTCCTGAAACCTTTGTGGGGTATGGGTGACGAGCCAGACAAAGACGAACGTTTTT



ATGGTGAGTATAATTATATACGTGGCGCCCTTGACCAAGTTATTCCGCTGTATAACAAAGTACGGAA



CTATCTGACCCGTAAGCCATATTCTACCCGTAAAGTGAAACTGAACTTTGGCAACTCGCAACTGCTG



TCGGGTTGGGATCGTAACAAAGAAAAAGATAATAGTTGTGTTATCCTGCGTAAGGGACAAAATTTT



TACCTCGCGATTATGAACAACAGACACAAGCGTTCATTTGAAAATAAGGTTCTGCCGGAGTATAAA



GAGGGCGAACCGTACTTCGAGAAAATGGATTATAAGTTCTTACCAGACCCTAATAAGATGTTACCG



AAAGTCTTTCTTTCGAAAAAAGGCATAGAAATCTATAAGCCGTCCCCGAAATTACTCGAACAGTAT



GGGCACGGGACCCACAAGAAAGGGGATACTTTTAGCATGGACGATCTGCACGAACTGATCGATTTT



TTTAAACACTCCATCGAAGCCCATGAAGACTGGAAACAGTTTGGGTTCAAGTTCTCTGATACAGCCA



CATACGAGAATGTGTCTAGTTTTTATCGGGAAGTGGAGGATCAGGGCTACAAACTTAGTTTTCGTAA



AGTTTCAGAGAGTTATGTTTATAGTTTAATTGATCAGGGAAAACTTTACCTGTTCCAGATCTACAAC



AAAGATTTCTCGCCATGTAGTAAGGGTACCCCGAATCTGCATACACTCTATTGGAGAATGTTATTCG



ATGAGCGTAACTTAGCGGATGTCATTTATAAATTGGACGGGAAAGCAGAGATCTTTTTTCGTGAAA



AATCACTGAAGAATGACCACCCGACTCATCCGGCCGGGAAACCGATCAAAAAAAAATCCCGCCAG



AAAAAAGGAGAAGAGTCTCTGTTTGAATATGATCTGGTGAAAGACCGTCATTACACTATGGATAAA



TTTCAATTTCATGTTCCAATTACAATGAACTTCAAATGTTCGGCGGGTTCCAAAGTAAATGATATGG



TAAACGCCCATATTCGCGAAGCGAAAGATATGCATGTTATTGGCATCGATAGAGGCGAAAGAAACC



TGCTTTATATTTGCGTAATTGACAGCCGTGGTACCATTCTGGACCAGATCTCTTTAAACACCATCAA



TGACATCGATTATCACGACCTGTTGGAGTCTCGGGACAAGGACCGCCAGCAGGAGCGCCGTAATTG



GCAGACAATTGAAGGCATAAAAGAATTAAAACAGGGTTACCTTTCCCAGGCCGTACACCGCATAGC



GGAACTGATGGTGGCCTACAAAGCCGTAGTTGCCCTGGAAGACTTGAATATGGGGTTTAAACGTGG



CCGTCAAAAAGTCGAGAGCAGCGTGTATCAGCAATTTGAAAAACAGTTGATTGACAAGTTGAATTA



TTTGGTTGATAAAAAGAAACGTCCAGAAGATATTGGTGGCTTACTGCGTGCATACCAGTTTACGGC



ACCTTTTAAGTCCTTCAAAGAAATGGGTAAACAGAACGGGTTTCTGTTTTACATCCCGGCCTGGAAT



ACATCCAACATCGATCCTACCACCGGGTTTGTCAACCTGTTTCATGCACAATATGAAAACGTGGATA



AAGCGAAGAGTTTTTTCCAAAAATTCGATAGTATTTCGTATAACCCAAAAAAAGATTGGTTTGAGTT



TGCGTTCGATTATAAAAATTTTACTAAAAAGGCTGAGGGATCCCGCAGTATGTGGATCCTCTGCACC



CATGGCAGTCGTATTAAAAATTTTCGTAATTCGCAAAAGAATGGCCAGTGGGACTCGGAAGAGTTT



GCCCTGACCGAAGCGTTCAAATCGCTGTTTGTACGCTACGAAATTGACTACACAGCAGATCTGAAA



ACAGCCATCGTCGATGAAAAACAGAAAGATTTTTTTGTAGATCTCCTAAAACTGTTCAAACTGACTG



TTCAGATGCGCAATTCCTGGAAAGAGAAAGACCTGGATTATCTGATTAGCCCGGTAGCCGGTGCTG



ATGGACGATTTTTCGATACTCGTGAAGGTAACAAAAGTCTCCCGAAAGATGCTGATGCCAATGGTG



CATACAATATTGCATTAAAGGGGCTATGGGCCTTGCGACAGATCCGCCAGACCAGCGAAGGCGGCA



AGCTGAAATTGGCCATATCGAATAAGGAATGGTTACAATTTGTTCAGGAACGTAGCTATGAAAAAG



ATTGA





SEQ
ATGAACAACGGCACAAATAATTTTCAGAACTTCATCGGGATCTCAAGTTTGCAGAAAACGCTGCGC


ID
AATGCTCTGATCCCCACGGAAACCACGCAACAGTTCATCGTCAAGAACGGAATAATTAAAGAAGAT


NO:
GAGTTACGTGGCGAGAACCGCCAGATTCTGAAAGATATCATGGATGACTACTACCGCGGATTCATC


47
TCTGAGACTCTGAGTTCTATTGATGACATAGATTGGACTAGCCTGTTCGAAAAAATGGAAATTCAGC



TGAAAAATGGTGATAATAAAGATACCTTAATTAAGGAACAGACAGAGTATCGGAAAGCAATCCAT



AAAAAATTTGCGAACGACGATCGGTTTAAGAACATGTTTAGCGCCAAACTGATTAGTGACATATTA



CCTGAATTTGTCATCCACAACAATAATTATTCGGCATCAGAGAAAGAGGAAAAAACCCAGGTGATA



AAATTGTTTTCGCGCTTTGCGACTAGCTTTAAAGATTACTTCAAGAACCGTGCAAATTGCTTTTCAG



CGGACGATATTTCATCAAGCAGCTGCCATCGCATCGTCAACGACAATGCAGAGATATTCTTTTCAAA



TGCGCTGGTCTACCGCCGGATCGTAAAATCGCTGAGCAATGACGATATCAACAAAATTTCGGGCGA



TATGAAAGATTCATTAAAAGAAATGAGTCTGGAAGAAATATATTCTTACGAGAAGTATGGGGAATT



TATTACCCAGGAAGGCATTAGCTTCTATAATGATATCTGTGGGAAAGTGAATTCTTTTATGAACCTG



TATTGTCAGAAAAATAAAGAAAACAAAAATTTATACAAACTTCAGAAACTTCACAAACAGATTCTA



TGCATTGCGGACACTAGCTATGAGGTCCCGTATAAATTTGAAAGTGACGAGGAAGTGTACCAATCA



GTTAACGGCTTCCTTGATAACATTAGCAGCAAACATATAGTCGAAAGATTACGCAAAATCGGCGAT



AACTATAACGGCTACAACCTGGATAAAATTTATATCGTGTCCAAATTTTACGAGAGCGTTAGCCAA



AAAACCTACCGCGACTGGGAAACAATTAATACCGCCCTCGAAATTCATTACAATAATATCTTGCCG



GGTAACGGTAAAAGTAAAGCCGACAAAGTAAAAAAAGCGGTTAAGAATGATTTACAGAAATCCAT



CACCGAAATAAATGAACTAGTGTCAAACTATAAGCTGTGCAGTGACGACAACATCAAAGCGGAGA



CTTATATACATGAGATTAGCCATATCTTGAATAACTTTGAAGCACAGGAATTGAAATACAATCCGG



AAATTCACCTAGTTGAATCCGAGCTCAAAGCGAGTGAGCTTAAAAACGTGCTGGACGTGATCATGA



ATGCGTTTCATTGGTGTTCGGTTTTTATGACTGAGGAACTTGTTGATAAAGACAACAATTTTTATGC



GGAACTGGAGGAGATTTACGATGAAATTTATCCAGTAATTAGTCTGTACAACCTGGTTCGTAACTAC



GTTACCCAGAAACCGTACAGCACGAAAAAGATTAAATTGAACTTTGGAATACCGACGTTAGCAGAC



GGTTGGTCAAAGTCCAAAGAGTATTCTAATAACGCTATCATACTGATGCGCGACAATCTGTATTATC



TGGGCATCTTTAATGCGAAGAATAAACCGGACAAGAAGATTATCGAGGGTAATACGTCAGAAAATA



AGGGTGACTACAAAAAGATGATTTATAATTTGCTCCCGGGTCCCAACAAAATGATCCCGAAAGTTT



TCTTGAGCAGCAAGACGGGGGTGGAAACGTATAAACCGAGCGCCTATATCCTAGAGGGGTATAAAC



AGAATAAACATATCAAGTCTTCAAAAGACTTTGATATCACTTTCTGTCATGATCTGATCGACTACTT



CAAAAACTGTATTGCAATTCATCCCGAGTGGAAAAACTTCGGTTTTGATTTTAGCGACACCAGTACT



TATGAAGACATTTCCGGGTTTTATCGTGAGGTAGAGTTACAAGGTTACAAGATTGATTGGACATACA



TTAGCGAAAAAGACATTGATCTGCTGCAGGAAAAAGGTCAACTGTATCTGTTCCAGATATATAACA



AAGATTTTTCGAAAAAATCAACCGGGAATGACAACCTTCACACCATGTACCTGAAAAATCTTTTCTC



AGAAGAAAATCTTAAGGATATCGTCCTGAAACTTAACGGCGAAGCGGAAATCTTCTTCAGGAAGAG



CAGCATAAAGAACCCAATCATTCATAAAAAAGGCTCGATTTTAGTCAACCGTACCTACGAAGCAGA



AGAAAAAGACCAGTTTGGCAACATTCAAATTGTGCGTAAAAATATTCCGGAAAACATTTATCAGGA



GCTGTACAAATACTTCAACGATAAAAGCGACAAAGAGCTGTCTGATGAAGCAGCCAAACTGAAGA



ATGTAGTGGGACACCACGAGGCAGCGACGAATATAGTCAAGGACTATCGCTACACGTATGATAAAT



ACTTCCTTCATATGCCTATTACGATCAATTTCAAAGCCAATAAAACGGGTTTTATTAATGATAGGAT



CTTACAGTATATCGCTAAAGAAAAAGACTTACATGTGATCGGCATTGATCGGGGCGAGCGTAACCT



GATCTACGTGTCCGTGATTGATACTTGTGGTAATATAGTTGAACAGAAAAGCTTTAACATTGTAAAC



GGCTACGACTATCAGATAAAACTGAAACAACAGGAGGGCGCTAGACAGATTGCGCGGAAAGAATG



GAAAGAAATTGGTAAAATTAAAGAGATCAAAGAGGGCTACCTGAGCTTAGTAATCCACGAGATCTC



TAAAATGGTAATCAAATACAATGCAATTATAGCGATGGAGGATTTGTCTTATGGTTTTAAAAAAGG



GCGCTTTAAGGTCGAACGGCAAGTTTACCAGAAATTTGAAACCATGCTCATCAATAAACTCAACTA



TCTGGTATTTAAAGATATTTCGATTACCGAGAATGGCGGTCTCCTGAAAGGTTATCAGCTGACATAC



ATTCCTGATAAACTTAAAAACGTGGGTCATCAGTGCGGCTGCATTTTTTATGTGCCTGCTGCATACA



CGAGCAAAATTGATCCGACCACCGGCTTTGTGAATATCTTTAAATTTAAAGACCTGACAGTGGACG



CAAAACGTGAATTCATTAAAAAATTTGACTCAATTCGTTATGACAGTGAAAAAAATCTGTTCTGCTT



TACATTTGACTACAATAACTTTATTACGCAAAACACGGTCATGAGCAAATCATCGTGGAGTGTGTAT



ACATACGGCGTGCGCATCAAACGTCGCTTTGTGAACGGCCGCTTCTCAAACGAAAGTGATACCATT



GACATAACCAAAGATATGGAGAAAACGTTGGAAATGACGGACATTAACTGGCGCGATGGCCACGA



TCTTCGTCAAGACATTATAGATTATGAAATTGTTCAGCACATATTCGAAATTTTCCGTTTAACAGTG



CAAATGCGTAACTCCTTGTCTGAACTGGAGGACCGTGATTACGATCGTCTCATTTCACCTGTACTGA



ACGAAAATAACATTTTTTATGACAGCGCGAAAGCGGGGGATGCACTTCCTAAGGATGCCGATGCAA



ATGGTGCGTATTGTATTGCATTAAAAGGGTTATATGAAATTAAACAAATTACCGAAAATTGGAAAG



AAGATGGTAAATTTTCGCGCGATAAACTCAAAATCAGCAATAAAGATTGGTTCGACTTTATCCAGA



ATAAGCGCTATCTCTAA





SEQ
ATGACCAATAAATTCACTAACCAGTATTCTCTCTCTAAGACCCTGCGCTTTGAACTGATTCCGCAGG


ID
GGAAAACCTTGGAGTTCATTCAAGAAAAAGGCCTCTTGTCTCAGGATAAACAGAGGGCTGAATCTT


NO:
ACCAAGAAATGAAGAAAACTATTGATAAGTTTCATAAATATTTCATTGATTTAGCCTTGTCTAACGC


48
CAAATTAACTCACTTGGAAACGTATCTGGAGTTATACAACAAATCTGCCGAAACTAAGAAAGAACA



GAAATTTAAAGACGATTTGAAAAAAGTACAGGACAATCTGCGTAAAGAAATTGTCAAATCCTTCAG



TGACGGCGATGCTAAAAGCATTTTTGCCATTCTGGACAAAAAAGAGTTGATTACTGTGGAATTAGA



AAAGTGGTTTGAAAACAATGAGCAGAAAGACATCTACTTCGATGAGAAATTCAAAACTTTCACCAC



CTATTTTACAGGATTTCATCAAAACCGGAAGAACATGTACTCAGTAGAACCGAACTCCACGGCCAT



TGCGTATCGTTTGATCCATGAGAATCTGCCTAAATTTCTGGAGAATGCGAAAGCCTTTGAAAAGATT



AAGCAGGTCGAATCGCTGCAAGTGAATTTTCGTGAACTCATGGGCGAATTTGGTGACGAAGGTCTA



ATCTTCGTTAACGAACTGGAAGAAATGTTTCAGATTAATTACTACAATGACGTGCTATCGCAGAACG



GTATCACAATCTACAATAGTATTATCTCAGGGTTCACAAAAAACGATATAAAATACAAAGGCCTGA



ACGAGTATATCAATAACTACAACCAAACAAAGGACAAAAAGGATAGGCTTCCGAAACTGAAGCAG



TTATACAAACAGATTTTATCTGACAGAATCTCCCTGAGCTTTCTGCCGGATGCTTTCACTGATGGGA



AGCAGGTTCTGAAAGCGATTTTCGATTTTTATAAGATTAACTTACTGAGCTACACGATTGAAGGTCA



AGAAGAATCTCAAAACTTACTGCTCTTGATCCGTCAAACCATTGAAAATCTATCATCGTTCGATACG



CAGAAAATCTACCTCAAAAACGATACTCACCTGACTACGATCTCTCAGCAGGTTTTCGGGGATTTTA



GTGTATTTTCAACAGCTCTGAACTACTGGTATGAAACCAAAGTCAATCCGAAATTCGAGACGGAAT



ATTCTAAGGCCAACGAAAAAAAACGTGAGATTCTTGATAAAGCTAAAGCCGTATTTACTAAACAGG



ATTACTTTTCTATTGCTTTCCTGCAGGAAGTTTTATCGGAGTATATCCTGACCCTGGATCATACATCT



GATATCGTTAAAAAACACAGCAGCAATTGCATCGCTGACTATTTCAAAAACCACTTTGTCGCCAAA



AAAGAAAACGAAACAGACAAGACTTTCGATTTCATTGCTAACATCACCGCAAAATACCAGTGTATT



CAGGGTATCTTGGAAAACGCCGACCAATACGAAGACGAACTGAAACAAGATCAGAAGCTGATCGA



TAATTTAAAATTCTTCTTAGATGCAATCCTGGAGCTGCTGCACTTCATCAAACCGCTTCATTTAAAG



AGCGAGTCCATTACCGAAAAGGACACCGCCTTCTATGACGTTTTTGAAAATTATTATGAAGCCCTCT



CCTTGCTGACTCCGCTGTATAATATGGTACGCAATTACGTAACCCAGAAACCATATTCTACCGAAAA



AATTAAACTGAACTTTGAAAACGCACAGCTGCTCAACGGTTGGGACGCGAATAAAGAAGGTGACTA



CCTCACCACCATCCTGAAAAAAGATGGTAACTATTTTCTGGCAATTATGGATAAGAAACATAATAA



AGCATTCCAGAAATTTCCTGAAGGGAAAGAAAATTACGAAAAGATGGTGTACAAACTCTTACCTGG



AGTTAACAAAATGTTGCCGAAAGTATTTTTTAGTAATAAGAACATCGCGTACTTTAACCCGTCCAAA



GAACTGCTGGAAAATTATAAAAAGGAGACGCATAAGAAAGGGGATACCTTTAACCTGGAACATTG



CCATACCTTAATAGACTTCTTCAAGGATTCCCTGAATAAACACGAGGATTGGAAATATTTCGATTTT



CAGTTTAGTGAGACCAAGTCATACCAGGATCTTAGCGGCTTTTATCGCGAAGTAGAACACCAAGGC



TATAAAATTAACTTCAAAAACATCGACAGCGAATACATCGACGGTTTAGTTAACGAGGGCAAACTG



TTTCTGTTCCAGATCTATTCAAAGGATTTTAGCCCGTTCTCTAAAGGCAAACCAAATATGCATACGT



TGTACTGGAAAGCACTGTTTGAAGAGCAAAACCTGCAGAATGTGATTTATAAACTGAACGGCCAAG



CTGAGATTTTTTTCCGTAAAGCCTCGATTAAACCGAAAAATATCATCCTTCATAAGAAGAAAATAAA



GATCGCTAAAAAACACTTCATAGATAAAAAAACCAAAACCTCCGAAATAGTGCCTGTTCAAACAAT



TAAGAACTTGAATATGTACTACCAGGGCAAGATATCGGAAAAGGAGTTGACTCAAGACGATCTTCG



CTATATCGATAACTTTTCGATTTTTAACGAAAAAAACAAGACGATCGACATCATCAAAGATAAACG



CTTCACTGTAGATAAGTTCCAGTTTCATGTGCCGATTACTATGAACTTCAAAGCTACCGGGGGTAGC



TATATCAACCAAACGGTGTTGGAATACCTGCAGAATAACCCGGAAGTCAAAATCATTGGGCTGGAC



CGCGGAGAACGTCACCTTGTGTACTTGACCTTAATCGATCAGCAAGGCAACATCTTAAAACAAGAA



TCGCTGAATACCATTACGGATTCAAAGATTAGCACCCCGTATCATAAGCTGCTCGATAACAAGGAG



AATGAGCGCGACCTGGCCCGTAAAAACTGGGGCACGGTGGAAAACATTAAGGAGTTAAAGGAGGG



TTATATTTCCCAGGTAGTGCATAAGATCGCCACTCTCATGCTCGAGGAAAATGCGATCGTTGTCATG



GAAGACTTAAACTTCGGATTTAAACGTGGGCGATTTAAAGTAGAGAAACAAATCTACCAGAAGTTA



GAAAAAATGCTGATTGACAAATTAAATTACTTGGTCCTAAAAGACAAACAGCCGCAAGAATTGGGT



GGATTATACAACGCCCTCCAACTTACCAATAAATTCGAAAGTTTTCAGAAAATGGGTAAACAGTCA



GGCTTTCTTTTTTATGTTCCTGCGTGGAACACATCCAAAATCGACCCTACAACCGGCTTCGTCAATTA



CTTCTATACTAAATATGAAAACGTCGACAAAGCAAAAGCATTCTTTGAAAAGTTCGAAGCAATACG



TTTTAACGCTGAGAAAAAATATTTCGAGTTCGAAGTCAAGAAATACTCAGACTTTAACCCCAAAGC



TGAGGGCACACAGCAAGCGTGGACAATCTGCACCTACGGCGAGCGCATCGAAACGAAGCGTCAAA



AAGATCAGAATAACAAATTTGTTTCAACACCTATCAACCTGACCGAGAAGATTGAAGACTTCTTAG



GTAAAAATCAGATTGTTTATGGCGACGGTAACTGTATAAAATCTCAAATAGCCTCAAAGGATGATA



AAGCATTTTTCGAAACATTATTATATTGGTTCAAAATGACACTGCAGATGCGCAATAGTGAGACGC



GTACAGATATTGATTATCTTATCAGCCCGGTCATGAACGACAACGGTACTTTTTACAACTCCAGAGA



CTATGAAAAACTTGAGAATCCAACTCTCCCCAAAGATGCTGATGCGAACGGTGCTTATCACATCGC



GAAAAAAGGTCTGATGCTGCTGAACAAAATCGACCAAGCCGATCTGACTAAGAAAGTTGACCTAAG



CATTTCAAATCGGGACTGGTTACAGTTTGTTCAAAAGAACAAATGA





SEQ
ATGGAACAGGAATATTATCTGGGCTTGGACATGGGCACCGGTTCCGTCGGCTGGGCTGTTACTGAC


ID
AGTGAATATCACGTTCTAAGAAAGCATGGTAAGGCATTGTGGGGTGTAAGACTTTTCGAATCTGCTT


NO:
CCACTGCTGAAGAGCGTAGAATGTTTAGAACGAGTCGACGTAGGCTAGACAGGCGCAATTGGAGA


49
ATCGAAATTTTACAAGAAATTTTTGCGGAAGAGATATCTAAGAAAGACCCAGGCTTTTTCCTGAGA



ATGAAGGAATCTAAGTATTACCCTGAGGATAAAAGAGATATAAATGGTAACTGTCCCGAATTGCCT



TACGCATTATTTGTGGACGATGATTTTACCGATAAGGATTACCATAAAAAGTTCCCAACTATCTACC



ATTTACGCAAAATGTTAATGAATACAGAGGAAACCCCAGACATAAGACTAGTTTATCTGGCAATAC



ACCATATGATGAAACATAGAGGCCATTTCTTACTTTCCGGGGATATCAACGAAATCAAAGAGTTTG



GTACCACATTTAGTAAGTTACTGGAAAACATAAAGAATGAAGAATTGGATTGGAACTTAGAACTCG



GAAAAGAAGAATACGCGGTTGTCGAATCTATCCTGAAGGATAATATGCTGAATAGGTCGACCAAAA



AAACTAGGCTGATCAAAGCACTGAAAGCCAAATCTATCTGCGAAAAAGCTGTTTTAAATTTACTTG



CTGGTGGCACTGTTAAGTTATCAGACATTTTTGGTTTGGAAGAATTGAACGAAACCGAGCGTCCAA



AAATTAGTTTCGCTGATAATGGCTACGATGATTACATTGGTGAGGTGGAAAACGAGTTGGGCGAAC



AATTTTATATTATAGAGACAGCTAAGGCAGTCTATGACTGGGCTGTTTTAGTAGAAATCCTTGGTAA



ATACACATCTATCTCCGAAGCGAAAGTTGCTACTTACGAAAAGCACAAGTCCGATCTCCAGTTTTTG



AAGAAAATTGTCAGGAAATATCTGACTAAGGAAGAATATAAAGATATTTTCGTTAGTACCTCTGAC



AAACTGAAAAATTACTCCGCTTACATCGGGATGACCAAGATTAATGGCAAAAAAGTTGATCTGCAA



AGCAAAAGGTGTTCGAAGGAAGAATTTTATGATTTCATTAAAAAGAATGTCTTAAAAAAATTAGAA



GGTCAGCCAGAATACGAATATTTGAAAGAAGAACTGGAAAGAGAGACATTCTTACCAAAACAAGT



CAACAGAGATAATGGGGTAATTCCATATCAAATTCACCTCTACGAATTAAAAAAAATTTTAGGCAA



TTTACGCGATAAAATTGACCTTATCAAAGAAAATGAGGATAAGCTGGTTCAACTCTTTGAATTCAGA



ATACCCTATTATGTGGGCCCACTGAACAAGATTGATGACGGCAAAGAAGGTAAATTCACATGGGCC



GTCCGCAAATCCAATGAAAAAATTTACCCATGGAACTTTGAAAATGTAGTAGATATTGAAGCGTCT



GCGGAGAAATTTATTCGAAGAATGACTAATAAATGCACTTACTTGATGGGAGAGGATGTTCTGCCT



AAAGACAGCTTATTATACAGCAAGTACATGGTTCTAAACGAACTTAACAACGTTAAGTTGGACGGT



GAGAAATTAAGTGTAGAATTGAAACAAAGATTGTATACTGACGTCTTCTGCAAGTACAGAAAAGTG



ACAGTTAAAAAAATTAAGAATTACTTGAAGTGCGAAGGTATAATTTCTGGAAACGTAGAGATTACT



GGTATTGATGGTGATTTCAAAGCATCCCTAACAGCTTACCACGATTTCAAGGAAATCCTGACAGGA



ACTGAACTCGCAAAAAAAGATAAAGAAAACATTATTACTAATATTGTTCTTTTCGGTGATGACAAG



AAATTGTTGAAGAAAAGACTGAATAGACTTTACCCCCAGATTACTCCCAATCAACTTAAGAAAATT



TGTGCTTTGTCTTACACAGGATGGGGTCGTTTTTCAAAAAAGTTCTTAGAAGAGATTACCGCACCTG



ATCCAGAAACAGGCGAAGTATGGAATATAATTACCGCCTTATGGGAATCGAACAATAATCTTATGC



AACTTCTGAGCAATGAATATCGTTTCATGGAAGAAGTTGAGACTTACAACATGGGCAAACAGACGA



AGACTTTATCCTATGAAACTGTGGAAAATATGTATGTATCACCTTCTGTCAAGAGACAAATTTGGCA



AACCTTAAAAATTGTCAAAGAATTAGAAAAGGTAATGAAGGAGTCTCCTAAACGTGTGTTTATTGA



AATGGCTAGAGAAAAACAAGAGTCAAAAAGAACCGAGTCAAGAAAGAAGCAGTTAATCGATTTAT



ATAAGGCTTGTAAAAACGAAGAGAAAGATTGGGTTAAAGAATTGGGGGACCAAGAGGAACAAAAA



CTACGGTCGGATAAGTTGTATTTATACTATACGCAAAAGGGACGATGTATGTATTCCGGCGAGGTA



ATAGAATTGAAGGATTTATGGGACAATACAAAATATGACATAGACCATATATATCCCCAATCAAAA



ACGATGGACGATAGCTTGAACAATAGAGTACTCGTGAAAAAAAAATATAATGCGACCAAATCTGAT



AAGTATCCTCTGAATGAAAATATCAGACATGAAAGAAAGGGGTTCTGGAAGTCCTTGTTAGATGGT



GGGTTTATAAGCAAAGAAAAGTACGAGCGTCTAATAAGAAACACGGAGTTATCGCCAGAAGAACT



CGCTGGTTTTATTGAGAGGCAAATCGTGGAAACGAGACAATCTACCAAAGCCGTTGCTGAGATCCT



AAAGCAAGTTTTCCCAGAGTCGGAGATTGTCTATGTCAAAGCTGGCACAGTGAGCAGGTTTAGGAA



AGACTTCGAACTATTAAAGGTAAGAGAAGTGAACGATTTACATCACGCAAAGGACGCTTACCTAAA



TATCGTTGTAGGTAACTCATATTATGTTAAATTTACCAAGAACGCCTCTTGGTTTATAAAGGAGAAC



CCAGGTAGAACATATAACCTGAAAAAGATGTTCACCTCTGGTTGGAATATTGAGAGAAACGGAGAA



GTCGCATGGGAAGTTGGTAAGAAAGGGACTATAGTGACAGTAAAGCAAATTATGAACAAAAATAA



TATCCTCGTTACAAGGCAGGTTCATGAAGCAAAGGGCGGCCTTTTTGACCAACAAATTATGAAGAA



AGGGAAAGGTCAAATTGCAATAAAAGAAACCGATGAGAGACTAGCGTCAATAGAAAAGTATGGTG



GCTATAATAAAGCTGCGGGTGCATACTTTATGCTTGTTGAATCAAAAGACAAGAAAGGTAAGACTA



TTAGAACTATAGAATTTATACCCCTGTACCTTAAAAACAAAATTGAATCGGATGAGTCAATCGCGTT



AAATTTTCTAGAGAAAGGAAGGGGTTTAAAAGAACCAAAGATCCTGTTAAAAAAGATTAAGATTGA



CACCTTGTTCGATGTAGATGGATTTAAAATGTGGTTATCTGGCAGAACAGGCGATAGACTTTTGTTT



AAGTGCGCTAATCAATTAATTTTGGATGAGAAAATCATTGTCACAATGAAAAAAATAGTTAAGTTT



ATTCAGAGAAGACAAGAAAACAGGGAGTTGAAATTATCTGATAAAGATGGTATCGACAATGAAGT



TTTAATGGAAATCTACAATACATTCGTTGATAAACTTGAAAATACCGTATATCGAATCAGGTTAAGT



GAACAAGCCAAAACATTAATTGATAAACAAAAAGAATTTGAAAGGCTATCACTGGAAGACAAATC



CTCCACCCTATTTGAAATTTTGCATATATTCCAGTGCCAATCTTCAGCAGCTAATTTAAAAATGATTG



GCGGACCTGGGAAAGCCGGCATCCTAGTGATGAACAATAATATCTCCAAGTGTAACAAAATATCAA



TTATTAACCAATCTCCGACAGGTATTTTTGAAAATGAAATAGACTTGCTTAAGATATAA





SEQ
ATGTCTTTCGACTCTTTCACCAACCTGTACTCTCTGTCTAAAACCCTGAAATTCGAAATGCGTCCGGT


ID
TGGTAACACCCAGAAAATGCTGGACAACGCGGGTGTTTTCGAAAAAGACAAACTGATCCAGAAAA


NO:
AATACGGTAAAACCAAACCGTACTTCGACCGTCTGCACCGTGAATTCATCGAAGAAGCGCTGACCG


50
GTGTTGAACTGATCGGTCTGGACGAAAACTTCCGTACCCTGGTTGACTGGCAGAAAGACAAAAAAA



ACAACGTTGCGATGAAAGCGTACGAAAACTCTCTGCAGCGTCTGCGTACCGAAATCGGTAAAATCT



TCAACCTGAAAGCGGAAGACTGGGTTAAAAACAAATACCCGATCCTGGGTCTGAAAAACAAAAAC



ACCGACATCCTGTTCGAAGAAGCGGTTTTCGGTATCCTGAAAGCGCGTTACGGTGAAGAAAAAGAC



ACCTTCATCGAAGTTGAAGAAATCGACAAAACCGGTAAATCTAAAATCAACCAGATCTCTATCTTC



GACTCTTGGAAAGGTTTCACCGGTTACTTCAAAAAATTCTTCGAAACCCGTAAAAACTTCTACAAAA



ACGACGGTACCTCTACCGCGATCGCGACCCGTATCATCGACCAGAACCTGAAACGTTTCATCGACA



ACCTGTCTATCGTTGAATCTGTTCGTCAGAAAGTTGACCTGGCGGAAACCGAAAAATCTTTCTCTAT



CTCTCTGTCTCAGTTCTTCTCTATCGACTTCTACAACAAATGCCTGCTGCAGGACGGTATCGACTACT



ACAACAAAATCATCGGTGGTGAAACCCTGAAAAACGGTGAAAAACTGATCGGTCTGAACGAACTG



ATCAACCAGTACCGTCAGAACAACAAAGACCAGAAAATCCCGTTCTTCAAACTGCTGGACAAACAG



ATCCTGTCTGAAAAAATCCTGTTCCTGGACGAAATCAAAAACGACACCGAACTGATCGAAGCGCTG



TCTCAGTTCGCGAAAACCGCGGAAGAAAAAACCAAAATCGTTAAAAAACTGTTCGCGGACTTCGTT



GAAAACAACTCTAAATACGACCTGGCGCAGATCTACATCTCTCAGGAAGCGTTCAACACCATCTCT



AACAAATGGACCTCTGAAACCGAAACCTTCGCGAAATACCTGTTCGAAGCGATGAAATCTGGTAAA



CTGGCGAAATACGAAAAAAAAGACAACTCTTACAAATTCCCGGACTTCATCGCGCTGTCTCAGATG



AAATCTGCGCTGCTGTCTATCTCTCTGGAAGGTCACTTCTGGAAAGAAAAATACTACAAAATCTCTA



AATTCCAGGAAAAAACCAACTGGGAACAGTTCCTGGCGATCTTCCTGTACGAATTCAACTCTCTGTT



CTCTGACAAAATCAACACCAAAGACGGTGAAACCAAACAGGTTGGTTACTACCTGTTCGCGAAAGA



CCTGCACAACCTGATCCTGTCTGAACAGATCGACATCCCGAAAGACTCTAAAGTTACCATCAAAGA



CTTCGCGGACTCTGTTCTGACCATCTACCAGATGGCGAAATACTTCGCGGTTGAAAAAAAACGTGC



GTGGCTGGCGGAATACGAACTGGACTCTTTCTACACCCAGCCGGACACCGGTTACCTGCAGTTCTAC



GACAACGCGTACGAAGACATCGTTCAGGTTTACAACAAACTGCGTAACTACCTGACCAAAAAACCG



TACTCTGAAGAAAAATGGAAACTGAACTTCGAAAACTCTACCCTGGCGAACGGTTGGGACAAAAAC



AAAGAATCTGACAACTCTGCGGTTATCCTGCAGAAAGGTGGTAAATACTACCTGGGTCTGATCACC



AAAGGTCACAACAAAATCTTCGACGACCGTTTCCAGGAAAAATTCATCGTTGGTATCGAAGGTGGT



AAATACGAAAAAATCGTTTACAAATTCTTCCCGGACCAGGCGAAAATGTTCCCGAAAGTTTGCTTCT



CTGCGAAAGGTCTGGAATTCTTCCGTCCGTCTGAAGAAATCCTGCGTATCTACAACAACGCGGAATT



CAAAAAAGGTGAAACCTACTCTATCGACTCTATGCAGAAACTGATCGACTTCTACAAAGACTGCCT



GACCAAATACGAAGGTTGGGCGTGCTACACCTTCCGTCACCTGAAACCGACCGAAGAATACCAGAA



CAACATCGGTGAATTCTTCCGTGACGTTGCGGAAGACGGTTACCGTATCGACTTCCAGGGTATCTCT



GACCAGTACATCCACGAAAAAAACGAAAAAGGTGAACTGCACCTGTTCGAAATCCACAACAAAGA



CTGGAACCTGGACAAAGCGCGTGACGGTAAATCTAAAACCACCCAGAAAAACCTGCACACCCTGTA



CTTCGAATCTCTGTTCTCTAACGACAACGTTGTTCAGAACTTCCCGATCAAACTGAACGGTCAGGCG



GAAATCTTCTACCGTCCGAAAACCGAAAAAGACAAACTGGAATCTAAAAAAGACAAAAAAGGTAA



CAAAGTTATCGACCACAAACGTTACTCTGAAAACAAAATCTTCTTCCACGTTCCGCTGACCCTGAAC



CGTACCAAAAACGACTCTTACCGTTTCAACGCGCAGATCAACAACTTCCTGGCGAACAACAAAGAC



ATCAACATCATCGGTGTTGACCGTGGTGAAAAACACCTGGTTTACTACTCTGTTATCACCCAGGCGT



CTGACATCCTGGAATCTGGTTCTCTGAACGAACTGAACGGTGTTAACTACGCGGAAAAACTGGGTA



AAAAAGCGGAAAACCGTGAACAGGCGCGTCGTGACTGGCAGGACGTTCAGGGTATCAAAGACCTG



AAAAAAGGTTACATCTCTCAGGTTGTTCGTAAACTGGCGGACCTGGCGATCAAACACAACGCGATC



ATCATCCTGGAAGACCTGAACATGCGTTTCAAACAGGTTCGTGGTGGTATCGAAAAATCTATCTACC



AGCAGCTGGAAAAAGCGCTGATCGACAAACTGTCTTTCCTGGTTGACAAAGGTGAAAAAAACCCGG



AACAGGCGGGTCACCTGCTGAAAGCGTACCAGCTGTCTGCGCCGTTCGAAACCTTCCAGAAAATGG



GTAAACAGACCGGTATCATCTTCTACACCCAGGCGTCTTACACCTCTAAATCTGACCCGGTTACCGG



TTGGCGTCCGCACCTGTACCTGAAATACTTCTCTGCGAAAAAAGCGAAAGACGACATCGCGAAATT



CACCAAAATCGAATTCGTTAACGACCGTTTCGAACTGACCTACGACATCAAAGACTTCCAGCAGGC



GAAAGAATACCCGAACAAAACCGTTTGGAAAGTTTGCTCTAACGTTGAACGTTTCCGTTGGGACAA



AAACCTGAACCAGAACAAAGGTGGTTACACCCACTACACCAACATCACCGAAAACATCCAGGAACT



GTTCACCAAATACGGTATCGACATCACCAAAGACCTGCTGACCCAGATCTCTACCATCGACGAAAA



ACAGAACACCTCTTTCTTCCGTGACTTCATCTTCTACTTCAACCTGATCTGCCAGATCCGTAACACCG



ACGACTCTGAAATCGCGAAAAAAAACGGTAAAGACGACTTCATCCTGTCTCCGGTTGAACCGTTCT



TCGACTCTCGTAAAGACAACGGTAACAAACTGCCGGAAAACGGTGACGACAACGGTGCGTACAAC



ATCGCGCGTAAAGGTATCGTTATCCTGAACAAAATCTCTCAGTACTCTGAAAAAAACGAAAACTGC



GAAAAAATGAAATGGGGTGACCTGTACGTTTCTAACATCGACTGGGACAACTTCGTT





SEQ
ATGGAAAACTTTAAAAACTTATACCCAATAAACAAAACGTTACGTTTTGAACTGCGTCCATATGGTA


ID
AAACACTGGAAAACTTTAAAAAAAGCGGTTTGTTGGAGAAGGATGCATTTAAAGCGAACTCTCGCA


NO:
GATCCATGCAGGCCATCATTGATGAAAAATTTAAAGAGACGATCGAAGAACGTCTGAAATACACGG


51
AATTTAGTGAGTGTGACTTAGGTAATATGACTTCTAAAGATAAGAAAATCACCGATAAGGCGGCGA



CCAACCTGAAGAAGCAAGTCATTTTATCTTTTGATGATGAAATCTTTAACAACTATTTGAAACCGGA



CAAAAACATCGATGCCTTATTTAAAAATGACCCTTCGAACCCGGTGATTAGCACATTTAAGGGCTTC



ACAACGTATTTTGTCAATTTTTTTGAAATTCGTAAACATATCTTCAAAGGAGAATCAAGCGGCTCTA



TGGCTTATCGCATTATTGATGAAAACCTGACGACCTATTTGAATAACATTGAAAAAATCAAAAAAC



TGCCAGAGGAATTAAAGTCTCAGTTAGAAGGCATCGACCAGATCGACAAACTCAACAACTATAACG



AATTTATTACGCAGTCTGGTATCACCCACTATAATGAAATTATTGGAGGTATCAGTAAATCAGAAAA



TGTGAAAATCCAAGGGATTAATGAAGGCATTAACCTCTATTGCCAGAAAAATAAAGTGAAACTGCC



GAGGCTGACTCCACTCTACAAAATGATCCTGTCTGACCGCGTCTCGAATAGCTTTGTCCTGGACACA



ATTGAAAACGATACGGAATTGATTGAGATGATAAGCGATCTGATTAACAAAACCGAAATTTCACAG



GATGTAATCATGAGTGATATACAAAACATCTTTATTAAATATAAACAGCTTGGTAATCTGCCTGGAA



TTAGCTATTCGTCAATAGTGAACGCAATCTGTTCTGATTATGATAACAATTTTGGCGACGGTAAGCG



TAAAAAGAGTTATGAAAACGATAGGAAAAAACACCTGGAAACTAACGTGTATTCTATCAACTATAT



CAGCGAACTGCTTACGGACACCGATGTGAGTTCAAACATTAAGATGCGGTATAAGGAGCTTGAACA



GAACTACCAGGTCTGTAAGGAAAACTTCAACGCAACCAACTGGATGAACATTAAAAATATCAAACA



ATCCGAGAAGACCAACTTAATCAAAGATCTGCTGGATATTTTGAAGAGCATTCAACGTTTTTATGAT



CTGTTCGATATCGTTGATGAAGACAAGAATCCTAGTGCGGAATTTTATACATGGCTGTCTAAAAATG



CGGAGAAATTGGATTTCGAATTCAATTCTGTTTATAATAAATCACGCAACTATTTGACCCGCAAACA



ATACAGCGACAAAAAGATAAAACTAAACTTCGACAGTCCGACATTGGCAAAGGGCTGGGACGCAA



ATAAGGAAATCGATAACTCTACGATAATTATGCGTAAGTTCAATAATGATCGAGGTGATTATGATT



ATTTCTTAGGCATTTGGAACAAAAGCACCCCGGCCAACGAAAAGATAATTCCACTGGAGGATAACG



GTCTGTTCGAAAAAATGCAGTACAAATTATATCCGGATCCAAGCAAGATGCTTCCAAAGCAGTTTCT



GTCTAAAATTTGGAAAGCTAAGCATCCGACCACCCCAGAATTTGACAAGAAATATAAGGAAGGCCG



CCATAAGAAAGGTCCCGATTTTGAAAAAGAATTCTTGCACGAACTGATTGATTGCTTTAAACATGGC



TTAGTCAATCACGATGAAAAGTATCAAGATGTTTTTGGATTCAATTTGAGAAACACAGAAGACTAC



AATTCCTACACTGAGTTTCTCGAAGATGTGGAACGATGTAATTATAATCTGAGCTTTAACAAAATCG



CGGACACCTCGAATCTGATTAACGATGGTAAACTTTATGTTTTCCAGATCTGGAGCAAGGATTTCTC



TATTGACAGCAAAGGCACCAAAAACCTGAACACCATTTACTTTGAAAGTCTCTTCAGCGAAGAAAA



TATGATTGAGAAAATGTTTAAACTTAGCGGTGAAGCTGAAATATTCTATCGCCCGGCAAGCCTGAA



CTATTGCGAAGACATTATCAAAAAGGGTCATCACCACGCTGAACTGAAAGATAAATTTGATTATCC



TATCATAAAAGATAAACGCTATAGCCAGGATAAATTTTTTTTTCATGTTCCTATGGTCATTAACTAC



AAATCAGAAAAACTGAACTCTAAAAGCCTCAATAATCGAACCAATGAAAACCTTGGGCAGTTTACC



CATATAATTGGAATTGATCGCGGAGAGCGTCATTTAATCTACCTGACCGTAGTCGATGTATCGACCG



GCGAGATCGTCGAGCAGAAGCACTTAGACGAGATTATCAACACTGATACCAAAGGTGTTGAGCATA



AGACGCACTATCTAAACAAGCTGGAGGAAAAATCGAAAACCCGTGATAATGAACGTAAGAGTTGG



GAGGCAATTGAAACGATTAAAGAACTGAAGGAGGGTTATATCAGCCACGTAATCAATGAAATTCAA



AAACTGCAGGAAAAATACAACGCCCTGATCGTTATGGAAAATCTGAATTACGGTTTCAAAAATTCT



CGCATCAAAGTGGAAAAACAGGTATATCAGAAGTTCGAGACGGCATTAATTAAAAAGTTTAATTAC



ATCATTGACAAAAAAGATCCGGAAACTTATATTCATGGCTATCAGCTGACGAACCCGATCACCACA



CTGGATAAAATTGGTAACCAGTCTGGTATCGTGCTTTACATCCCTGCCTGGAATACCAGTAAAATCG



ATCCGGTAACGGGATTCGTCAACCTTCTATATGCAGATGACCTCAAATATAAGAATCAGGAACAGG



CCAAGTCTTTTATTCAGAAAATCGATAACATTTACTTTGAGAATGGGGAATTCAAATTTGATATTGA



TTTTTCTAAATGGAACAATCGTTATAGTATATCTAAGACGAAATGGACGCTCACCTCGTACGGAACC



CGAATCCAGACATTCCGCAATCCGCAGAAGAACAATAAATGGGACAGCGCCGAGTATGATCTCACT



GAAGAATTCAAATTGATTCTGAACATTGACGGTACCCTGAAAAGCCAGGATGTCGAAACCTATAAA



AAATTTATGTCTCTGTTCAAGCTGATGCTGCAACTTAGGAACTCTGTTACCGGCACTGATATCGATT



ATATGATCTCCCCTGTCACTGATAAAACAGGTACGCATTTCGATTCGCGCGAAAATATCAAAAATCT



GCCCGCAGATGCCGACGCCAATGGGGCGTACAATATTGCACGCAAGGGTATCATGGCGATCGAAAA



CATTATGAATGGTATCAGCGACCCGCTGAAAATCTCAAACGAAGATTATTTGAAATATATCCAAAA



CCAGCAGGAATAA





SEQ
ATGACCCAGTTCGAAGGTTTCACCAACCTGTACCAGGTTTCTAAAACCCTGCGTTTCGAACTGATCC


ID
CGCAGGGTAAAACCCTGAAACACATCCAGGAACAGGGTTTCATCGAAGAAGACAAAGCGCGTAAC


NO:
GACCACTACAAAGAACTGAAACCGATCATCGACCGTATCTACAAAACCTACGCGGACCAGTGCCTG


52
CAGCTGGTTCAGCTGGACTGGGAAAACCTGTCTGCGGCGATCGACTCTTACCGTAAAGAAAAAACC



GAAGAAACCCGTAACGCGCTGATCGAAGAACAGGCGACCTACCGTAACGCGATCCACGACTACTTC



ATCGGTCGTACCGACAACCTGACCGACGCGATCAACAAACGTCACGCGGAAATCTACAAAGGTCTG



TTCAAAGCGGAACTGTTCAACGGTAAAGTTCTGAAACAGCTGGGTACCGTTACCACCACCGAACAC



GAAAACGCGCTGCTGCGTTCTTTCGACAAATTCACCACCTACTTCTCTGGTTTCTACGAAAACCGTA



AAAACGTTTTCTCTGCGGAAGACATCTCTACCGCGATCCCGCACCGTATCGTTCAGGACAACTTCCC



GAAATTCAAAGAAAACTGCCACATCTTCACCCGTCTGATCACCGCGGTTCCGTCTCTGCGTGAACAC



TTCGAAAACGTTAAAAAAGCGATCGGTATCTTCGTTTCTACCTCTATCGAAGAAGTTTTCTCTTTCCC



GTTCTACAACCAGCTGCTGACCCAGACCCAGATCGACCTGTACAACCAGCTGCTGGGTGGTATCTCT



CGTGAAGCGGGTACCGAAAAAATCAAAGGTCTGAACGAAGTTCTGAACCTGGCGATCCAGAAAAA



CGACGAAACCGCGCACATCATCGCGTCTCTGCCGCACCGTTTCATCCCGCTGTTCAAACAGATCCTG



TCTGACCGTAACACCCTGTCTTTCATCCTGGAAGAATTCAAATCTGACGAAGAAGTTATCCAGTCTT



TCTGCAAATACAAAACCCTGCTGCGTAACGAAAACGTTCTGGAAACCGCGGAAGCGCTGTTCAACG



AACTGAACTCTATCGACCTGACCCACATCTTCATCTCTCACAAAAAACTGGAAACCATCTCTTCTGC



GCTGTGCGACCACTGGGACACCCTGCGTAACGCGCTGTACGAACGTCGTATCTCTGAACTGACCGG



TAAAATCACCAAATCTGCGAAAGAAAAAGTTCAGCGTTCTCTGAAACACGAAGACATCAACCTGCA



GGAAATCATCTCTGCGGCGGGTAAAGAACTGTCTGAAGCGTTCAAACAGAAAACCTCTGAAATCCT



GTCTCACGCGCACGCGGCGCTGGACCAGCCGCTGCCGACCACCCTGAAAAAACAGGAAGAAAAAG



AAATCCTGAAATCTCAGCTGGACTCTCTGCTGGGTCTGTACCACCTGCTGGACTGGTTCGCGGTTGA



CGAATCTAACGAAGTTGACCCGGAATTCTCTGCGCGTCTGACCGGTATCAAACTGGAAATGGAACC



GTCTCTGTCTTTCTACAACAAAGCGCGTAACTACGCGACCAAAAAACCGTACTCTGTTGAAAAATTC



AAACTGAACTTCCAGATGCCGACCCTGGCGTCTGGTTGGGACGTTAACAAAGAAAAAAACAACGGT



GCGATCCTGTTCGTTAAAAACGGTCTGTACTACCTGGGTATCATGCCGAAACAGAAAGGTCGTTAC



AAAGCGCTGTCTTTCGAACCGACCGAAAAAACCTCTGAAGGTTTCGACAAAATGTACTACGACTAC



TTCCCGGACGCGGCGAAAATGATCCCGAAATGCTCTACCCAGCTGAAAGCGGTTACCGCGCACTTC



CAGACCCACACCACCCCGATCCTGCTGTCTAACAACTTCATCGAACCGCTGGAAATCACCAAAGAA



ATCTACGACCTGAACAACCCGGAAAAAGAACCGAAAAAATTCCAGACCGCGTACGCGAAAAAAAC



CGGTGACCAGAAAGGTTACCGTGAAGCGCTGTGCAAATGGATCGACTTCACCCGTGACTTCCTGTCT



AAATACACCAAAACCACCTCTATCGACCTGTCTTCTCTGCGTCCGTCTTCTCAGTACAAAGACCTGG



GTGAATACTACGCGGAACTGAACCCGCTGCTGTACCACATCTCTTTCCAGCGTATCGCGGAAAAAG



AAATCATGGACGCGGTTGAAACCGGTAAACTGTACCTGTTCCAGATCTACAACAAAGACTTCGCGA



AAGGTCACCACGGTAAACCGAACCTGCACACCCTGTACTGGACCGGTCTGTTCTCTCCGGAAAACC



TGGCGAAAACCTCTATCAAACTGAACGGTCAGGCGGAACTGTTCTACCGTCCGAAATCTCGTATGA



AACGTATGGCGCACCGTCTGGGTGAAAAAATGCTGAACAAAAAACTGAAAGACCAGAAAACCCCG



ATCCCGGACACCCTGTACCAGGAACTGTACGACTACGTTAACCACCGTCTGTCTCACGACCTGTCTG



ACGAAGCGCGTGCGCTGCTGCCGAACGTTATCACCAAAGAAGTTTCTCACGAAATCATCAAAGACC



GTCGTTTCACCTCTGACAAATTCTTCTTCCACGTTCCGATCACCCTGAACTACCAGGCGGCGAACTC



TCCGTCTAAATTCAACCAGCGTGTTAACGCGTACCTGAAAGAACACCCGGAAACCCCGATCATCGG



TATCGACCGTGGTGAACGTAACCTGATCTACATCACCGTTATCGACTCTACCGGTAAAATCCTGGAA



CAGCGTTCTCTGAACACCATCCAGCAGTTCGACTACCAGAAAAAACTGGACAACCGTGAAAAAGAA



CGTGTTGCGGCGCGTCAGGCGTGGTCTGTTGTTGGTACCATCAAAGACCTGAAACAGGGTTACCTGT



CTCAGGTTATCCACGAAATCGTTGACCTGATGATCCACTACCAGGCGGTTGTTGTTCTGGAAAACCT



GAACTTCGGTTTCAAATCTAAACGTACCGGTATCGCGGAAAAAGCGGTTTACCAGCAGTTCGAAAA



AATGCTGATCGACAAACTGAACTGCCTGGTTCTGAAAGACTACCCGGCGGAAAAAGTTGGTGGTGT



TCTGAACCCGTACCAGCTGACCGACCAGTTCACCTCTTTCGCGAAAATGGGTACCCAGTCTGGTTTC



CTGTTCTACGTTCCGGCGCCGTACACCTCTAAAATCGACCCGCTGACCGGTTTCGTTGACCCGTTCG



TTTGGAAAACCATCAAAAACCACGAATCTCGTAAACACTTCCTGGAAGGTTTCGACTTCCTGCACTA



CGACGTTAAAACCGGTGACTTCATCCTGCACTTCAAAATGAACCGTAACCTGTCTTTCCAGCGTGGT



CTGCCGGGTTTCATGCCGGCGTGGGACATCGTTTTCGAAAAAAACGAAACCCAGTTCGACGCGAAA



GGTACCCCGTTCATCGCGGGTAAACGTATCGTTCCGGTTATCGAAAACCACCGTTTCACCGGTCGTT



ACCGTGACCTGTACCCGGCGAACGAACTGATCGCGCTGCTGGAAGAAAAAGGTATCGTTTTCCGTG



ACGGTTCTAACATCCTGCCGAAACTGCTGGAAAACGACGACTCTCACGCGATCGACACCATGGTTG



CGCTGATCCGTTCTGTTCTGCAGATGCGTAACTCTAACGCGGCGACCGGTGAAGACTACATCAACTC



TCCGGTTCGTGACCTGAACGGTGTTTGCTTCGACTCTCGTTTCCAGAACCCGGAATGGCCGATGGAC



GCGGACGCGAACGGTGCGTACCACATCGCGCTGAAAGGTCAGCTGCTGCTGAACCACCTGAAAGAA



TCTAAAGACCTGAAACTGCAGAACGGTATCTCTAACCAGGACTGGCTGGCGTACATCCAGGAACTG



CGTAACTA





SEQ
ATGGCGGTTAAATCTATCAAAGTTAAACTGCGTCTGGACGACATGCCGGAAATCCGTGCGGGTCTG


ID
TGGAAACTGCACAAAGAAGTTAACGCGGGTGTTCGTTACTACACCGAATGGCTGTCTCTGCTGCGTC


NO:
AGGAAAACCTGTACCGTCGTTCTCCGAACGGTGACGGTGAACAGGAATGCGACAAAACCGCGGAA


53
GAATGCAAAGCGGAACTGCTGGAACGTCTGCGTGCGCGTCAGGTTGAAAACGGTCACCGTGGTCCG



GCGGGTTCTGACGACGAACTGCTGCAGCTGGCGCGTCAGCTGTACGAACTGCTGGTTCCGCAGGCG



ATCGGTGCGAAAGGTGACGCGCAGCAGATCGCGCGTAAATTCCTGTCTCCGCTGGCGGACAAAGAC



GCGGTTGGTGGTCTGGGTATCGCGAAAGCGGGTAACAAACCGCGTTGGGTTCGTATGCGTGAAGCG



GGTGAACCGGGTTGGGAAGAAGAAAAAGAAAAAGCGGAAACCCGTAAATCTGCGGACCGTACCGC



GGACGTTCTGCGTGCGCTGGCGGACTTCGGTCTGAAACCGCTGATGCGTGTTTACACCGACTCTGAA



ATGTCTTCTGTTGAATGGAAACCGCTGCGTAAAGGTCAGGCGGTTCGTACCTGGGACCGTGACATGT



TCCAGCAGGCGATCGAACGTATGATGTCTTGGGAATCTTGGAACCAGCGTGTTGGTCAGGAATACG



CGAAACTGGTTGAACAGAAAAACCGTTTCGAACAGAAAAACTTCGTTGGTCAGGAACACCTGGTTC



ACCTGGTTAACCAGCTGCAGCAGGACATGAAAGAAGCGTCTCCGGGTCTGGAATCTAAAGAACAGA



CCGCGCACTACGTTACCGGTCGTGCGCTGCGTGGTTCTGACAAAGTTTTCGAAAAATGGGGTAAACT



GGCGCCGGACGCGCCGTTCGACCTGTACGACGCGGAAATCAAAAACGTTCAGCGTCGTAACACCCG



TCGTTTCGGTTCTCACGACCTGTTCGCGAAACTGGCGGAACCGGAATACCAGGCGCTGTGGCGTGA



AGACGCGTCTTTCCTGACCCGTTACGCGGTTTACAACTCTATCCTGCGTAAACTGAACCACGCGAAA



ATGTTCGCGACCTTCACCCTGCCGGACGCGACCGCGCACCCGATCTGGACCCGTTTCGACAAACTGG



GTGGTAACCTGCACCAGTACACCTTCCTGTTCAACGAATTCGGTGAACGTCGTCACGCGATCCGTTT



CCACAAACTGCTGAAAGTTGAAAACGGTGTTGCGCGTGAAGTTGACGACGTTACCGTTCCGATCTCT



ATGTCTGAACAGCTGGACAACCTGCTGCCGCGTGACCCGAACGAACCGATCGCGCTGTACTTCCGT



GACTACGGTGCGGAACAGCACTTCACCGGTGAATTCGGTGGTGCGAAAATCCAGTGCCGTCGTGAC



CAGCTGGCGCACATGCACCGTCGTCGTGGTGCGCGTGACGTTTACCTGAACGTTTCTGTTCGTGTTC



AGTCTCAGTCTGAAGCGCGTGGTGAACGTCGTCCGCCGTACGCGGCGGTTTTCCGTCTGGTTGGTGA



CAACCACCGTGCGTTCGTTCACTTCGACAAACTGTCTGACTACCTGGCGGAACACCCGGACGACGG



TAAACTGGGTTCTGAAGGTCTGCTGTCTGGTCTGCGTGTTATGTCTGTTGACCTGGGTCTGCGTACCT



CTGCGTCTATCTCTGTTTTCCGTGTTGCGCGTAAAGACGAACTGAAACCGAACTCTAAAGGTCGTGT



TCCGTTCTTCTTCCCGATCAAAGGTAACGACAACCTGGTTGCGGTTCACGAACGTTCTCAGCTGCTG



AAACTGCCGGGTGAAACCGAATCTAAAGACCTGCGTGCGATCCGTGAAGAACGTCAGCGTACCCTG



CGTCAGCTGCGTACCCAGCTGGCGTACCTGCGTCTGCTGGTTCGTTGCGGTTCTGAAGACGTTGGTC



GTCGTGAACGTTCTTGGGCGAAACTGATCGAACAGCCGGTTGACGCGGCGAACCACATGACCCCGG



ACTGGCGTGAAGCGTTCGAAAACGAACTGCAGAAACTGAAATCTCTGCACGGTATCTGCTCTGACA



AAGAATGGATGGACGCGGTTTACGAATCTGTTCGTCGTGTTTGGCGTCACATGGGTAAACAGGTTC



GTGACTGGCGTAAAGACGTTCGTTCTGGTGAACGTCCGAAAATCCGTGGTTACGCGAAAGACGTTG



TTGGTGGTAACTCTATCGAACAGATCGAATACCTGGAACGTCAGTACAAATTCCTGAAATCTTGGTC



TTTCTTCGGTAAAGTTTCTGGTCAGGTTATCCGTGCGGAAAAAGGTTCTCGTTTCGCGATCACCCTG



CGTGAACACATCGACCACGCGAAAGAAGACCGTCTGAAAAAACTGGCGGACCGTATCATCATGGA



AGCGCTGGGTTACGTTTACGCGCTGGACGAACGTGGTAAAGGTAAATGGGTTGCGAAATACCCGCC



GTGCCAGCTGATCCTGCTGGAAGAACTGTCTGAATACCAGTTCAACAACGACCGTCCGCCGTCTGA



AAACAACCAGCTGATGCAGTGGTCTCACCGTGGTGTTTTCCAGGAACTGATCAACCAGGCGCAGGT



TCACGACCTGCTGGTTGGTACCATGTACGCGGCGTTCTCTTCTCGTTTCGACGCGCGTACCGGTGCG



CCGGGTATCCGTTGCCGTCGTGTTCCGGCGCGTTGCACCCAGGAACACAACCCGGAACCGTTCCCGT



GGTGGCTGAACAAATTCGTTGTTGAACACACCCTGGACGCGTGCCCGCTGCGTGCGGACGACCTGA



TCCCGACCGGTGAAGGTGAAATCTTCGTTTCTCCGTTCTCTGCGGAAGAAGGTGACTTCCACCAGAT



CCACGCGGACCTGAACGCGGCGCAGAACCTGCAGCAGCGTCTGTGGTCTGACTTCGACATCTCTCA



GATCCGTCTGCGTTGCGACTGGGGTGAAGTTGACGGTGAACTGGTTCTGATCCCGCGTCTGACCGGT



AAACGTACCGCGGACTCTTACTCTAACAAAGTTTTCTACACCAACACCGGTGTTACCTACTACGAAC



GTGAACGTGGTAAAAAACGTCGTAAAGTTTTCGCGCAGGAAAAACTGTCTGAAGAAGAAGCGGAA



CTGCTGGTTGAAGCGGACGAAGCGCGTGAAAAATCTGTTGTTCTGATGCGTGACCCGTCTGGTATCA



TCAACCGTGGTAACTGGACCCGTCAGAAAGAATTCTGGTCTATGGTTAACCAGCGTATCGAAGGTT



ACCTGGTTAAACAGATCCGTTCTCGTGTTCCGCTGCAGGACTCTGCGTGCGAAAACACCGGTGACAT



CTAA





SEQ
ATGGCGACCCGTTCTTTCATCCTGAAAATCGAACCGAACGAAGAAGTTAAAAAAGGTCTGTGGAAA


ID
ACCCACGAAGTTCTGAACCACGGTATCGCGTACTACATGAACATCCTGAAACTGATCCGTCAGGAA


NO:
GCGATCTACGAACACCACGAACAGGACCCGAAAAACCCGAAAAAAGTTTCTAAAGCGGAAATCCA


54
GGCGGAACTGTGGGACTTCGTTCTGAAAATGCAGAAATGCAACTCTTTCACCCACGAAGTTGACAA



AGACGTTGTTTTCAACATCCTGCGTGAACTGTACGAAGAACTGGTTCCGTCTTCTGTTGAAAAAAAA



GGTGAAGCGAACCAGCTGTCTAACAAATTCCTGTACCCGCTGGTTGACCCGAACTCTCAGTCTGGTA



AAGGTACCGCGTCTTCTGGTCGTAAACCGCGTTGGTACAACCTGAAAATCGCGGGTGACCCGTCTTG



GGAAGAAGAAAAAAAAAAATGGGAAGAAGACAAAAAAAAAGACCCGCTGGCGAAAATCCTGGGT



AAACTGGCGGAATACGGTCTGATCCCGCTGTTCATCCCGTTCACCGACTCTAACGAACCGATCGTTA



AAGAAATCAAATGGATGGAAAAATCTCGTAACCAGTCTGTTCGTCGTCTGGACAAAGACATGTTCA



TCCAGGCGCTGGAACGTTTCCTGTCTTGGGAATCTTGGAACCTGAAAGTTAAAGAAGAATACGAAA



AAGTTGAAAAAGAACACAAAACCCTGGAAGAACGTATCAAAGAAGACATCCAGGCGTTCAAATCT



CTGGAACAGTACGAAAAAGAACGTCAGGAACAGCTGCTGCGTGACACCCTGAACACCAACGAATA



CCGTCTGTCTAAACGTGGTCTGCGTGGTTGGCGTGAAATCATCCAGAAATGGCTGAAAATGGACGA



AAACGAACCGTCTGAAAAATACCTGGAAGTTTTCAAAGACTACCAGCGTAAACACCCGCGTGAAGC



GGGTGACTACTCTGTTTACGAATTCCTGTCTAAAAAAGAAAACCACTTCATCTGGCGTAACCACCCG



GAATACCCGTACCTGTACGCGACCTTCTGCGAAATCGACAAAAAAAAAAAAGACGCGAAACAGCA



GGCGACCTTCACCCTGGCGGACCCGATCAACCACCCGCTGTGGGTTCGTTTCGAAGAACGTTCTGGT



TCTAACCTGAACAAATACCGTATCCTGACCGAACAGCTGCACACCGAAAAACTGAAAAAAAAACTG



ACCGTTCAGCTGGACCGTCTGATCTACCCGACCGAATCTGGTGGTTGGGAAGAAAAAGGTAAAGTT



GACATCGTTCTGCTGCCGTCTCGTCAGTTCTACAACCAGATCTTCCTGGACATCGAAGAAAAAGGTA



AACACGCGTTCACCTACAAAGACGAATCTATCAAATTCCCGCTGAAAGGTACCCTGGGTGGTGCGC



GTGTTCAGTTCGACCGTGACCACCTGCGTCGTTACCCGCACAAAGTTGAATCTGGTAACGTTGGTCG



TATCTACTTCAACATGACCGTTAACATCGAACCGACCGAATCTCCGGTTTCTAAATCTCTGAAAATC



CACCGTGACGACTTCCCGAAATTCGTTAACTTCAAACCGAAAGAACTGACCGAATGGATCAAAGAC



TCTAAAGGTAAAAAACTGAAATCTGGTATCGAATCTCTGGAAATCGGTCTGCGTGTTATGTCTATCG



ACCTGGGTCAGCGTCAGGCGGCGGCGGCGTCTATCTTCGAAGTTGTTGACCAGAAACCGGACATCG



AAGGTAAACTGTTCTTCCCGATCAAAGGTACCGAACTGTACGCGGTTCACCGTGCGTCTTTCAACAT



CAAACTGCCGGGTGAAACCCTGGTTAAATCTCGTGAAGTTCTGCGTAAAGCGCGTGAAGACAACCT



GAAACTGATGAACCAGAAACTGAACTTCCTGCGTAACGTTCTGCACTTCCAGCAGTTCGAAGACAT



CACCGAACGTGAAAAACGTGTTACCAAATGGATCTCTCGTCAGGAAAACTCTGACGTTCCGCTGGT



TTACCAGGACGAACTGATCCAGATCCGTGAACTGATGTACAAACCGTACAAAGACTGGGTTGCGTT



CCTGAAACAGCTGCACAAACGTCTGGAAGTTGAAATCGGTAAAGAAGTTAAACACTGGCGTAAATC



TCTGTCTGACGGTCGTAAAGGTCTGTACGGTATCTCTCTGAAAAACATCGACGAAATCGACCGTACC



CGTAAATTCCTGCTGCGTTGGTCTCTGCGTCCGACCGAACCGGGTGAAGTTCGTCGTCTGGAACCGG



GTCAGCGTTTCGCGATCGACCAGCTGAACCACCTGAACGCGCTGAAAGAAGACCGTCTGAAAAAAA



TGGCGAACACCATCATCATGCACGCGCTGGGTTACTGCTACGACGTTCGTAAAAAAAAATGGCAGG



CGAAAAACCCGGCGTGCCAGATCATCCTGTTCGAAGACCTGTCTAACTACAACCCGTACGAAGAAC



GTTCTCGTTTCGAAAACTCTAAACTGATGAAATGGTCTCGTCGTGAAATCCCGCGTCAGGTTGCGCT



GCAGGGTGAAATCTACGGTCTGCAGGTTGGTGAAGTTGGTGCGCAGTTCTCTTCTCGTTTCCACGCG



AAAACCGGTTCTCCGGGTATCCGTTGCTCTGTTGTTACCAAAGAAAAACTGCAGGACAACCGTTTCT



TCAAAAACCTGCAGCGTGAAGGTCGTCTGACCCTGGACAAAATCGCGGTTCTGAAAGAAGGTGACC



TGTACCCGGACAAAGGTGGTGAAAAATTCATCTCTCTGTCTAAAGACCGTAAACTGGTTACCACCC



ACGCGGACATCAACGCGGCGCAGAACCTGCAGAAACGTTTCTGGACCCGTACCCACGGTTTCTACA



AAGTTTACTGCAAAGCGTACCAGGTTGACGGTCAGACCGTTTACATCCCGGAATCTAAAGACCAGA



AACAGAAAATCATCGAAGAATTCGGTGAAGGTTACTTCATCCTGAAAGACGGTGTTTACGAATGGG



GTAACGCGGGTAAACTGAAAATCAAAAAAGGTTCTTCTAAACAGTCTTCTTCTGAACTGGTTGACTC



TGACATCCTGAAAGACTCTTTCGACCTGGCGTCTGAACTGAAAGGTGAAAAACTGATGCTGTACCG



TGACCCGTCTGGTAACGTTTTCCCGTCTGACAAATGGATGGCGGCGGGTGTTTTCTTCGGTAAACTG



GAACGTATCCTGATCTCTAAACTGACCAACCAGTACTCTATCTCTACCATCGAAGACGACTCTTCTA



AACAGTCTATGTAA





SEQ
ATGCCGACCCGTACCATCAACCTGAAACTGGTTCTGGGTAAAAACCCGGAAAACGCGACCCTGCGT


ID
CGTGCGCTGTTCTCTACCCACCGTCTGGTTAACCAGGCGACCAAACGTATCGAAGAATTCCTGCTGC


NO:
TGTGCCGTGGTGAAGCGTACCGTACCGTTGACAACGAAGGTAAAGAAGCGGAAATCCCGCGTCACG


55
CGGTTCAGGAAGAAGCGCTGGCGTTCGCGAAAGCGGCGCAGCGTCACAACGGTTGCATCTCTACCT



ACGAAGACCAGGAAATCCTGGACGTTCTGCGTCAGCTGTACGAACGTCTGGTTCCGTCTGTTAACG



AAAACAACGAAGCGGGTGACGCGCAGGCGGCGAACGCGTGGGTTTCTCCGCTGATGTCTGCGGAAT



CTGAAGGTGGTCTGTCTGTTTACGACAAAGTTCTGGACCCGCCGCCGGTTTGGATGAAACTGAAAG



AAGAAAAAGCGCCGGGTTGGGAAGCGGCGTCTCAGATCTGGATCCAGTCTGACGAAGGTCAGTCTC



TGCTGAACAAACCGGGTTCTCCGCCGCGTTGGATCCGTAAACTGCGTTCTGGTCAGCCGTGGCAGG



ACGACTTCGTTTCTGACCAGAAAAAAAAACAGGACGAACTGACCAAAGGTAACGCGCCGCTGATCA



AACAGCTGAAAGAAATGGGTCTGCTGCCGCTGGTTAACCCGTTCTTCCGTCACCTGCTGGACCCGGA



AGGTAAAGGTGTTTCTCCGTGGGACCGTCTGGCGGTTCGTGCGGCGGTTGCGCACTTCATCTCTTGG



GAATCTTGGAACCACCGTACCCGTGCGGAATACAACTCTCTGAAACTGCGTCGTGACGAATTCGAA



GCGGCGTCTGACGAATTCAAAGACGACTTCACCCTGCTGCGTCAGTACGAAGCGAAACGTCACTCT



ACCCTGAAATCTATCGCGCTGGCGGACGACTCTAACCCGTACCGTATCGGTGTTCGTTCTCTGCGTG



CGTGGAACCGTGTTCGTGAAGAATGGATCGACAAAGGTGCGACCGAAGAACAGCGTGTTACCATCC



TGTCTAAACTGCAGACCCAGCTGCGTGGTAAATTCGGTGACCCGGACCTGTTCAACTGGCTGGCGC



AGGACCGTCACGTTCACCTGTGGTCTCCGCGTGACTCTGTTACCCCGCTGGTTCGTATCAACGCGGT



TGACAAAGTTCTGCGTCGTCGTAAACCGTACGCGCTGATGACCTTCGCGCACCCGCGTTTCCACCCG



CGTTGGATCCTGTACGAAGCGCCGGGTGGTTCTAACCTGCGTCAGTACGCGCTGGACTGCACCGAA



AACGCGCTGCACATCACCCTGCCGCTGCTGGTTGACGACGCGCACGGTACCTGGATCGAAAAAAAA



ATCCGTGTTCCGCTGGCGCCGTCTGGTCAGATCCAGGACCTGACCCTGGAAAAACTGGAAAAAAAA



AAAAACCGTCTGTACTACCGTTCTGGTTTCCAGCAGTTCGCGGGTCTGGCGGGTGGTGCGGAAGTTC



TGTTCCACCGTCCGTACATGGAACACGACGAACGTTCTGAAGAATCTCTGCTGGAACGTCCGGGTG



CGGTTTGGTTCAAACTGACCCTGGACGTTGCGACCCAGGCGCCGCCGAACTGGCTGGACGGTAAAG



GTCGTGTTCGTACCCCGCCGGAAGTTCACCACTTCAAAACCGCGCTGTCTAACAAATCTAAACACAC



CCGTACCCTGCAGCCGGGTCTGCGTGTTCTGTCTGTTGACCTGGGTATGCGTACCTTCGCGTCTTGCT



CTGTTTTCGAACTGATCGAAGGTAAACCGGAAACCGGTCGTGCGTTCCCGGTTGCGGACGAACGTT



CTATGGACTCTCCGAACAAACTGTGGGCGAAACACGAACGTTCTTTCAAACTGACCCTGCCGGGTG



AAACCCCGTCTCGTAAAGAAGAAGAAGAACGTTCTATCGCGCGTGCGGAAATCTACGCGCTGAAAC



GTGACATCCAGCGTCTGAAATCTCTGCTGCGTCTGGGTGAAGAAGACAACGACAACCGTCGTGACG



CGCTGCTGGAACAGTTCTTCAAAGGTTGGGGTGAAGAAGACGTTGTTCCGGGTCAGGCGTTCCCGC



GTTCTCTGTTCCAGGGTCTGGGTGCGGCGCCGTTCCGTTCTACCCCGGAACTGTGGCGTCAGCACTG



CCAGACCTACTACGACAAAGCGGAAGCGTGCCTGGCGAAACACATCTCTGACTGGCGTAAACGTAC



CCGTCCGCGTCCGACCTCTCGTGAAATGTGGTACAAAACCCGTTCTTACCACGGTGGTAAATCTATC



TGGATGCTGGAATACCTGGACGCGGTTCGTAAACTGCTGCTGTCTTGGTCTCTGCGTGGTCGTACCT



ACGGTGCGATCAACCGTCAGGACACCGCGCGTTTCGGTTCTCTGGCGTCTCGTCTGCTGCACCACAT



CAACTCTCTGAAAGAAGACCGTATCAAAACCGGTGCGGACTCTATCGTTCAGGCGGCGCGTGGTTA



CATCCCGCTGCCGCACGGTAAAGGTTGGGAACAGCGTTACGAACCGTGCCAGCTGATCCTGTTCGA



AGACCTGGCGCGTTACCGTTTCCGTGTTGACCGTCCGCGTCGTGAAAACTCTCAGCTGATGCAGTGG



AACCACCGTGCGATCGTTGCGGAAACCACCATGCAGGCGGAACTGTACGGTCAGATCGTTGAAAAC



ACCGCGGCGGGTTTCTCTTCTCGTTTCCACGCGGCGACCGGTGCGCCGGGTGTTCGTTGCCGTTTCCT



GCTGGAACGTGACTTCGACAACGACCTGCCGAAACCGTACCTGCTGCGTGAACTGTCTTGGATGCT



GGGTAACACCAAAGTTGAATCTGAAGAAGAAAAACTGCGTCTGCTGTCTGAAAAAATCCGTCCGGG



TTCTCTGGTTCCGTGGGACGGTGGTGAACAGTTCGCGACCCTGCACCCGAAACGTCAGACCCTGTGC



GTTATCCACGCGGACATGAACGCGGCGCAGAACCTGCAGCGTCGTTTCTTCGGTCGTTGCGGTGAA



GCGTTCCGTCTGGTTTGCCAGCCGCACGGTGACGACGTTCTGCGTCTGGCGTCTACCCCGGGTGCGC



GTCTGCTGGGTGCGCTGCAGCAGCTGGAAAACGGTCAGGGTGCGTTCGAACTGGTTCGTGACATGG



GTTCTACCTCTCAGATGAACCGTTTCGTTATGAAATCTCTGGGTAAAAAAAAAATCAAACCGCTGCA



GGACAACAACGGTGACGACGAACTGGAAGACGTTCTGTCTGTTCTGCCGGAAGAAGACGACACCG



GTCGTATCACCGTTTTCCGTGACTCTTCTGGTATCTTCTTCCCGTGCAACGTTTGGATCCCGGCGAAA



CAGTTCTGGCCGGCGGTTCGTGCGATGATCTGGAAAGTTATGGCGTCTCACTCTCTGGGTTAA





SEQ
ATGACCAAACTGCGTCACCGTCAGAAAAAACTGACCCACGACTGGGCGGGTTCTAAAAAACGTGAA


ID
GTTCTGGGTTCTAACGGTAAACTGCAGAACCCGCTGCTGATGCCGGTTAAAAAAGGTCAGGTTACC


NO:
GAATTCCGTAAAGCGTTCTCTGCGTACGCGCGTGCGACCAAAGGTGAAATGACCGACGGTCGTAAA


56
AACATGTTCACCCACTCTTTCGAACCGTTCAAAACCAAACCGTCTCTGCACCAGTGCGAACTGGCGG



ACAAAGCGTACCAGTCTCTGCACTCTTACCTGCCGGGTTCTCTGGCGCACTTCCTGCTGTCTGCGCA



CGCGCTGGGTTTCCGTATCTTCTCTAAATCTGGTGAAGCGACCGCGTTCCAGGCGTCTTCTAAAATC



GAAGCGTACGAATCTAAACTGGCGTCTGAACTGGCGTGCGTTGACCTGTCTATCCAGAACCTGACC



ATCTCTACCCTGTTCAACGCGCTGACCACCTCTGTTCGTGGTAAAGGTGAAGAAACCTCTGCGGACC



CGCTGATCGCGCGTTTCTACACCCTGCTGACCGGTAAACCGCTGTCTCGTGACACCCAGGGTCCGGA



ACGTGACCTGGCGGAAGTTATCTCTCGTAAAATCGCGTCTTCTTTCGGTACCTGGAAAGAAATGACC



GCGAACCCGCTGCAGTCTCTGCAGTTCTTCGAAGAAGAACTGCACGCGCTGGACGCGAACGTTTCT



CTGTCTCCGGCGTTCGACGTTCTGATCAAAATGAACGACCTGCAGGGTGACCTGAAAAACCGTACC



ATCGTTTTCGACCCGGACGCGCCGGTTTTCGAATACAACGCGGAAGACCCGGCGGACATCATCATC



AAACTGACCGCGCGTTACGCGAAAGAAGCGGTTATCAAAAACCAGAACGTTGGTAACTACGTTAAA



AACGCGATCACCACCACCAACGCGAACGGTCTGGGTTGGCTGCTGAACAAAGGTCTGTCTCTGCTG



CCGGTTTCTACCGACGACGAACTGCTGGAATTCATCGGTGTTGAACGTTCTCACCCGTCTTGCCACG



CGCTGATCGAACTGATCGCGCAGCTGGAAGCGCCGGAACTGTTCGAAAAAAACGTTTTCTCTGACA



CCCGTTCTGAAGTTCAGGGTATGATCGACTCTGCGGTTTCTAACCACATCGCGCGTCTGTCTTCTTCT



CGTAACTCTCTGTCTATGGACTCTGAAGAACTGGAACGTCTGATCAAATCTTTCCAGATCCACACCC



CGCACTGCTCTCTGTTCATCGGTGCGCAGTCTCTGTCTCAGCAGCTGGAATCTCTGCCGGAAGCGCT



GCAGTCTGGTGTTAACTCTGCGGACATCCTGCTGGGTTCTACCCAGTACATGCTGACCAACTCTCTG



GTTGAAGAATCTATCGCGACCTACCAGCGTACCCTGAACCGTATCAACTACCTGTCTGGTGTTGCGG



GTCAGATCAACGGTGCGATCAAACGTAAAGCGATCGACGGTGAAAAAATCCACCTGCCGGCGGCGT



GGTCTGAACTGATCTCTCTGCCGTTCATCGGTCAGCCGGTTATCGACGTTGAATCTGACCTGGCGCA



CCTGAAAAACCAGTACCAGACCCTGTCTAACGAATTCGACACCCTGATCTCTGCGCTGCAGAAAAA



CTTCGACCTGAACTTCAACAAAGCGCTGCTGAACCGTACCCAGCACTTCGAAGCGATGTGCCGTTCT



ACCAAAAAAAACGCGCTGTCTAAACCGGAAATCGTTTCTTACCGTGACCTGCTGGCGCGTCTGACCT



CTTGCCTGTACCGTGGTTCTCTGGTTCTGCGTCGTGCGGGTATCGAAGTTCTGAAAAAACACAAAAT



CTTCGAATCTAACTCTGAACTGCGTGAACACGTTCACGAACGTAAACACTTCGTTTTCGTTTCTCCG



CTGGACCGTAAAGCGAAAAAACTGCTGCGTCTGACCGACTCTCGTCCGGACCTGCTGCACGTTATC



GACGAAATCCTGCAGCACGACAACCTGGAAAACAAAGACCGTGAATCTCTGTGGCTGGTTCGTTCT



GGTTACCTGCTGGCGGGTCTGCCGGACCAGCTGTCTTCTTCTTTCATCAACCTGCCGATCATCACCC



AGAAAGGTGACCGTCGTCTGATCGACCTGATCCAGTACGACCAGATCAACCGTGACGCGTTCGTTA



TGCTGGTTACCTCTGCGTTCAAATCTAACCTGTCTGGTCTGCAGTACCGTGCGAACAAACAGTCTTT



CGTTGTTACCCGTACCCTGTCTCCGTACCTGGGTTCTAAACTGGTTTACGTTCCGAAAGACAAAGAC



TGGCTGGTTCCGTCTCAGATGTTCGAAGGTCGTTTCGCGGACATCCTGCAGTCTGACTACATGGTTT



GGAAAGACGCGGGTCGTCTGTGCGTTATCGACACCGCGAAACACCTGTCTAACATCAAAAAATCTG



TTTTCTCTTCTGAAGAAGTTCTGGCGTTCCTGCGTGAACTGCCGCACCGTACCTTCATCCAGACCGA



AGTTCGTGGTCTGGGTGTTAACGTTGACGGTATCGCGTTCAACAACGGTGACATCCCGTCTCTGAAA



ACCTTCTCTAACTGCGTTCAGGTTAAAGTTTCTCGTACCAACACCTCTCTGGTTCAGACCCTGAACC



GTTGGTTCGAAGGTGGTAAAGTTTCTCCGCCGTCTATCCAGTTCGAACGTGCGTACTACAAAAAAGA



CGACCAGATCCACGAAGACGCGGCGAAACGTAAAATCCGTTTCCAGATGCCGGCGACCGAACTGGT



TCACGCGTCTGACGACGCGGGTTGGACCCCGTCTTACCTGCTGGGTATCGACCCGGGTGAATACGGT



ATGGGTCTGTCTCTGGTTTCTATCAACAACGGTGAAGTTCTGGACTCTGGTTTCATCCACATCAACTC



TCTGATCAACTTCGCGTCTAAAAAATCTAACCACCAGACCAAAGTTGTTCCGCGTCAGCAGTACAA



ATCTCCGTACGCGAACTACCTGGAACAGTCTAAAGACTCTGCGGCGGGTGACATCGCGCACATCCT



GGACCGTCTGATCTACAAACTGAACGCGCTGCCGGTTTTCGAAGCGCTGTCTGGTAACTCTCAGTCT



GCGGCGGACCAGGTTTGGACCAAAGTTCTGTCTTTCTACACCTGGGGTGACAACGACGCGCAGAAC



TCTATCCGTAAACAGCACTGGTTCGGTGCGTCTCACTGGGACATCAAAGGTATGCTGCGTCAGCCGC



CGACCGAAAAAAAACCGAAACCGTACATCGCGTTCCCGGGTTCTCAGGTTTCTTCTTACGGTAACTC



TCAGCGTTGCTCTTGCTGCGGTCGTAACCCGATCGAACAGCTGCGTGAAATGGCGAAAGACACCTC



TATCAAAGAACTGAAAATCCGTAACTCTGAAATCCAGCTGTTCGACGGTACCATCAAACTGTTCAA



CCCGGACCCGTCTACCGTTATCGAACGTCGTCGTCACAACCTGGGTCCGTCTCGTATCCCGGTTGCG



GACCGTACCTTCAAAAACATCTCTCCGTCTTCTCTGGAATTCAAAGAACTGATCACCATCGTTTCTC



GTTCTATCCGTCACTCTCCGGAATTCATCGCGAAAAAACGTGGTATCGGTTCTGAATACTTCTGCGC



GTACTCTGACTGCAACTCTTCTCTGAACTCTGAAGCGAACGCGGCGGCGAACGTTGCGCAGAAATT



CCAGAAACAGCTGTTCTTCGAACTGTAA





SEQ
ATGAAACGTATCCTGAACTCTCTGAAAGTTGCGGCGCTGCGTCTGCTGTTCCGTGGTAAAGGTTCTG


ID
AACTGGTTAAAACCGTTAAATACCCGCTGGTTTCTCCGGTTCAGGGTGCGGTTGAAGAACTGGCGG


NO:
AAGCGATCCGTCACGACAACCTGCACCTGTTCGGTCAGAAAGAAATCGTTGACCTGATGGAAAAAG


57
ACGAAGGTACCCAGGTTTACTCTGTTGTTGACTTCTGGCTGGACACCCTGCGTCTGGGTATGTTCTTC



TCTCCGTCTGCGAACGCGCTGAAAATCACCCTGGGTAAATTCAACTCTGACCAGGTTTCTCCGTTCC



GTAAAGTTCTGGAACAGTCTCCGTTCTTCCTGGCGGGTCGTCTGAAAGTTGAACCGGCGGAACGTAT



CCTGTCTGTTGAAATCCGTAAAATCGGTAAACGTGAAAACCGTGTTGAAAACTACGCGGCGGACGT



TGAAACCTGCTTCATCGGTCAGCTGTCTTCTGACGAAAAACAGTCTATCCAGAAACTGGCGAACGA



CATCTGGGACTCTAAAGACCACGAAGAACAGCGTATGCTGAAAGCGGACTTCTTCGCGATCCCGCT



GATCAAAGACCCGAAAGCGGTTACCGAAGAAGACCCGGAAAACGAAACCGCGGGTAAACAGAAAC



CGCTGGAACTGTGCGTTTGCCTGGTTCCGGAACTGTACACCCGTGGTTTCGGTTCTATCGCGGACTT



CCTGGTTCAGCGTCTGACCCTGCTGCGTGACAAAATGTCTACCGACACCGCGGAAGACTGCCTGGA



ATACGTTGGTATCGAAGAAGAAAAAGGTAACGGTATGAACTCTCTGCTGGGTACCTTCCTGAAAAA



CCTGCAGGGTGACGGTTTCGAACAGATCTTCCAGTTCATGCTGGGTTCTTACGTTGGTTGGCAGGGT



AAAGAAGACGTTCTGCGTGAACGTCTGGACCTGCTGGCGGAAAAAGTTAAACGTCTGCCGAAACCG



AAATTCGCGGGTGAATGGTCTGGTCACCGTATGTTCCTGCACGGTCAGCTGAAATCTTGGTCTTCTA



ACTTCTTCCGTCTGTTCAACGAAACCCGTGAACTGCTGGAATCTATCAAATCTGACATCCAGCACGC



GACCATGCTGATCTCTTACGTTGAAGAAAAAGGTGGTTACCACCCGCAGCTGCTGTCTCAGTACCGT



AAACTGATGGAACAGCTGCCGGCGCTGCGTACCAAAGTTCTGGACCCGGAAATCGAAATGACCCAC



ATGTCTGAAGCGGTTCGTTCTTACATCATGATCCACAAATCTGTTGCGGGTTTCCTGCCGGACCTGC



TGGAATCTCTGGACCGTGACAAAGACCGTGAATTCCTGCTGTCTATCTTCCCGCGTATCCCGAAAAT



CGACAAAAAAACCAAAGAAATCGTTGCGTGGGAACTGCCGGGTGAACCGGAAGAAGGTTACCTGT



TCACCGCGAACAACCTGTTCCGTAACTTCCTGGAAAACCCGAAACACGTTCCGCGTTTCATGGCGGA



ACGTATCCCGGAAGACTGGACCCGTCTGCGTTCTGCGCCGGTTTGGTTCGACGGTATGGTTAAACAG



TGGCAGAAAGTTGTTAACCAGCTGGTTGAATCTCCGGGTGCGCTGTACCAGTTCAACGAATCTTTCC



TGCGTCAGCGTCTGCAGGCGATGCTGACCGTTTACAAACGTGACCTGCAGACCGAAAAATTCCTGA



AACTGCTGGCGGACGTTTGCCGTCCGCTGGTTGACTTCTTCGGTCTGGGTGGTAACGACATCATCTT



CAAATCTTGCCAGGACCCGCGTAAACAGTGGCAGACCGTTATCCCGCTGTCTGTTCCGGCGGACGTT



TACACCGCGTGCGAAGGTCTGGCGATCCGTCTGCGTGAAACCCTGGGTTTCGAATGGAAAAACCTG



AAAGGTCACGAACGTGAAGACTTCCTGCGTCTGCACCAGCTGCTGGGTAACCTGCTGTTCTGGATCC



GTGACGCGAAACTGGTTGTTAAACTGGAAGACTGGATGAACAACCCGTGCGTTCAGGAATACGTTG



AAGCGCGTAAAGCGATCGACCTGCCGCTGGAAATCTTCGGTTTCGAAGTTCCGATCTTCCTGAACGG



TTACCTGTTCTCTGAACTGCGTCAGCTGGAACTGCTGCTGCGTCGTAAATCTGTTATGACCTCTTACT



CTGTTAAAACCACCGGTTCTCCGAACCGTCTGTTCCAGCTGGTTTACCTGCCGCTGAACCCGTCTGA



CCCGGAAAAAAAAAACTCTAACAACTTCCAGGAACGTCTGGACACCCCGACCGGTCTGTCTCGTCG



TTTCCTGGACCTGACCCTGGACGCGTTCGCGGGTAAACTGCTGACCGACCCGGTTACCCAGGAACTG



AAAACCATGGCGGGTTTCTACGACCACCTGTTCGGTTTCAAACTGCCGTGCAAACTGGCGGCGATGT



CTAACCACCCGGGTTCTTCTTCTAAAATGGTTGTTCTGGCGAAACCGAAAAAAGGTGTTGCGTCTAA



CATCGGTTTCGAACCGATCCCGGACCCGGCGCACCCGGTTTTCCGTGTTCGTTCTTCTTGGCCGGAA



CTGAAATACCTGGAAGGTCTGCTGTACCTGCCGGAAGACACCCCGCTGACCATCGAACTGGCGGAA



ACCTCTGTTTCTTGCCAGTCTGTTTCTTCTGTTGCGTTCGACCTGAAAAACCTGACCACCATCCTGGG



TCGTGTTGGTGAATTCCGTGTTACCGCGGACCAGCCGTTCAAACTGACCCCGATCATCCCGGAAAAA



GAAGAATCTTTCATCGGTAAAACCTACCTGGGTCTGGACGCGGGTGAACGTTCTGGTGTTGGTTTCG



CGATCGTTACCGTTGACGGTGACGGTTACGAAGTTCAGCGTCTGGGTGTTCACGAAGACACCCAGC



TGATGGCGCTGCAGCAGGTTGCGTCTAAATCTCTGAAAGAACCGGTTTTCCAGCCGCTGCGTAAAG



GTACCTTCCGTCAGCAGGAACGTATCCGTAAATCTCTGCGTGGTTGCTACTGGAACTTCTACCACGC



GCTGATGATCAAATACCGTGCGAAAGTTGTTCACGAAGAATCTGTTGGTTCTTCTGGTCTGGTTGGT



CAGTGGCTGCGTGCGTTCCAGAAAGACCTGAAAAAAGCGGACGTTCTGCCGAAAAAAGGTGGTAA



AAACGGTGTTGACAAAAAAAAACGTGAATCTTCTGCGCAGGACACCCTGTGGGGTGGTGCGTTCTC



TAAAAAAGAAGAACAGCAGATCGCGTTCGAAGTTCAGGCGGCGGGTTCTTCTCAGTTCTGCCTGAA



ATGCGGTTGGTGGTTCCAGCTGGGTATGCGTGAAGTTAACCGTGTTCAGGAATCTGGTGTTGTTCTG



GACTGGAACCGTTCTATCGTTACCTTCCTGATCGAATCTTCTGGTGAAAAAGTTTACGGTTTCTCTCC



GCAGCAGCTGGAAAAAGGTTTCCGTCCGGACATCGAAACCTTCAAAAAAATGGTTCGTGACTTCAT



GCGTCCGCCGATGTTCGACCGTAAAGGTCGTCCGGCGGCGGCGTACGAACGTTTCGTTCTGGGTCGT



CGTCACCGTCGTTACCGTTTCGACAAAGTTTTCGAAGAACGTTTCGGTCGTTCTGCGCTGTTCATCTG



CCCGCGTGTTGGTTGCGGTAACTTCGACCACTCTTCTGAACAGTCTGCGGTTGTTCTGGCGCTGATC



GGTTACATCGCGGACAAAGAAGGTATGTCTGGTAAAAAACTGGTTTACGTTCGTCTGGCGGAACTG



ATGGCGGAATGGAAACTGAAAAAACTGGAACGTTCTCGTGTTGAAGAACAGTCTTCTGCGCAGTAA





SEQ
ATGGCGGAATCTAAACAGATGCAGTGCCGTAAATGCGGTGCGTCTATGAAATACGAAGTTATCGGT


ID
CTGGGTAAAAAATCTTGCCGTTACATGTGCCCGGACTGCGGTAACCACACCTCTGCGCGTAAAATCC


NO:
AGAACAAAAAAAAACGTGACAAAAAATACGGTTCTGCGTCTAAAGCGCAGTCTCAGCGTATCGCG


58
GTTGCGGGTGCGCTGTACCCGGACAAAAAAGTTCAGACCATCAAAACCTACAAATACCCGGCGGAC



CTGAACGGTGAAGTTCACGACTCTGGTGTTGCGGAAAAAATCGCGCAGGCGATCCAGGAAGACGA



AATCGGTCTGCTGGGTCCGTCTTCTGAATACGCGTGCTGGATCGCGTCTCAGAAACAGTCTGAACCG



TACTCTGTTGTTGACTTCTGGTTCGACGCGGTTTGCGCGGGTGGTGTTTTCGCGTACTCTGGTGCGCG



TCTGCTGTCTACCGTTCTGCAGCTGTCTGGTGAAGAATCTGTTCTGCGTGCGGCGCTGGCGTCTTCTC



CGTTCGTTGACGACATCAACCTGGCGCAGGCGGAAAAATTCCTGGCGGTTTCTCGTCGTACCGGTCA



GGACAAACTGGGTAAACGTATCGGTGAATGCTTCGCGGAAGGTCGTCTGGAAGCGCTGGGTATCAA



AGACCGTATGCGTGAATTCGTTCAGGCGATCGACGTTGCGCAGACCGCGGGTCAGCGTTTCGCGGC



GAAACTGAAAATCTTCGGTATCTCTCAGATGCCGGAAGCGAAACAGTGGAACAACGACTCTGGTCT



GACCGTTTGCATCCTGCCGGACTACTACGTTCCGGAAGAAAACCGTGCGGACCAGCTGGTTGTTCTG



CTGCGTCGTCTGCGTGAAATCGCGTACTGCATGGGTATCGAAGACGAAGCGGGTTTCGAACACCTG



GGTATCGACCCGGGTGCGCTGTCTAACTTCTCTAACGGTAACCCGAAACGTGGTTTCCTGGGTCGTC



TGCTGAACAACGACATCATCGCGCTGGCGAACAACATGTCTGCGATGACCCCGTACTGGGAAGGTC



GTAAAGGTGAACTGATCGAACGTCTGGCGTGGCTGAAACACCGTGCGGAAGGTCTGTACCTGAAAG



AACCGCACTTCGGTAACTCTTGGGCGGACCACCGTTCTCGTATCTTCTCTCGTATCGCGGGTTGGCT



GTCTGGTTGCGCGGGTAAACTGAAAATCGCGAAAGACCAGATCTCTGGTGTTCGTACCGACCTGTTC



CTGCTGAAACGTCTGCTGGACGCGGTTCCGCAGTCTGCGCCGTCTCCGGACTTCATCGCGTCTATCT



CTGCGCTGGACCGTTTCCTGGAAGCGGCGGAATCTTCTCAGGACCCGGCGGAACAGGTTCGTGCGC



TGTACGCGTTCCACCTGAACGCGCCGGCGGTTCGTTCTATCGCGAACAAAGCGGTTCAGCGTTCTGA



CTCTCAGGAATGGCTGATCAAAGAACTGGACGCGGTTGACCACCTGGAATTCAACAAAGCGTTCCC



GTTCTTCTCTGACACCGGTAAAAAAAAAAAAAAAGGTGCGAACTCTAACGGTGCGCCGTCTGAAGA



AGAATACACCGAAACCGAATCTATCCAGCAGCCGGAAGACGCGGAACAGGAAGTTAACGGTCAGG



AAGGTAACGGTGCGTCTAAAAACCAGAAAAAATTCCAGCGTATCCCGCGTTTCTTCGGTGAAGGTT



CTCGTTCTGAATACCGTATCCTGACCGAAGCGCCGCAGTACTTCGACATGTTCTGCAACAACATGCG



TGCGATCTTCATGCAGCTGGAATCTCAGCCGCGTAAAGCGCCGCGTGACTTCAAATGCTTCCTGCAG



AACCGTCTGCAGAAACTGTACAAACAGACCTTCCTGAACGCGCGTTCTAACAAATGCCGTGCGCTG



CTGGAATCTGTTCTGATCTCTTGGGGTGAATTCTACACCTACGGTGCGAACGAAAAAAAATTCCGTC



TGCGTCACGAAGCGTCTGAACGTTCTTCTGACCCGGACTACGTTGTTCAGCAGGCGCTGGAAATCGC



GCGTCGTCTGTTCCTGTTCGGTTTCGAATGGCGTGACTGCTCTGCGGGTGAACGTGTTGACCTGGTT



GAAATCCACAAAAAAGCGATCTCTTTCCTGCTGGCGATCACCCAGGCGGAAGTTTCTGTTGGTTCTT



ACAACTGGCTGGGTAACTCTACCGTTTCTCGTTACCTGTCTGTTGCGGGTACCGACACCCTGTACGG



TACCCAGCTGGAAGAATTCCTGAACGCGACCGTTCTGTCTCAGATGCGTGGTCTGGCGATCCGTCTG



TCTTCTCAGGAACTGAAAGACGGTTTCGACGTTCAGCTGGAATCTTCTTGCCAGGACAACCTGCAGC



ACCTGCTGGTTTACCGTGCGTCTCGTGACCTGGCGGCGTGCAAACGTGCGACCTGCCCGGCGGAACT



GGACCCGAAAATCCTGGTTCTGCCGGTTGGTGCGTTCATCGCGTCTGTTATGAAAATGATCGAACGT



GGTGACGAACCGCTGGCGGGTGCGTACCTGCGTCACCGTCCGCACTCTTTCGGTTGGCAGATCCGTG



TTCGTGGTGTTGCGGAAGTTGGTATGGACCAGGGTACCGCGCTGGCGTTCCAGAAACCGACCGAAT



CTGAACCGTTCAAAATCAAACCGTTCTCTGCGCAGTACGGTCCGGTTCTGTGGCTGAACTCTTCTTC



TTACTCTCAGTCTCAGTACCTGGACGGTTTCCTGTCTCAGCCGAAAAACTGGTCTATGCGTGTTCTGC



CGCAGGCGGGTTCTGTTCGTGTTGAACAGCGTGTTGCGCTGATCTGGAACCTGCAGGCGGGTAAAA



TGCGTCTGGAACGTTCTGGTGCGCGTGCGTTCTTCATGCCGGTTCCGTTCTCTTTCCGTCCGTCTGGT



TCTGGTGACGAAGCGGTTCTGGCGCCGAACCGTTACCTGGGTCTGTTCCCGCACTCTGGTGGTATCG



AATACGCGGTTGTTGACGTTCTGGACTCTGCGGGTTTCAAAATCCTGGAACGTGGTACCATCGCGGT



TAACGGTTTCTCTCAGAAACGTGGTGAACGTCAGGAAGAAGCGCACCGTGAAAAACAGCGTCGTGG



TATCTCTGACATCGGTCGTAAAAAACCGGTTCAGGCGGAAGTTGACGCGGCGAACGAACTGCACCG



TAAATACACCGACGTTGCGACCCGTCTGGGTTGCCGTATCGTTGTTCAGTGGGCGCCGCAGCCGAA



ACCGGGTACCGCGCCGACCGCGCAGACCGTTTACGCGCGTGCGGTTCGTACCGAAGCGCCGCGTTC



TGGTAACCAGGAAGACCACGCGCGTATGAAATCTTCTTGGGGTTACACCTGGGGTACCTACTGGGA



AAAACGTAAACCGGAAGACATCCTGGGTATCTCTACCCAGGTTTACTGGACCGGTGGTATCGGTGA



ATCTTGCCCGGCGGTTGCGGTTGCGCTGCTGGGTCACATCCGTGCGACCTCTACCCAGACCGAATGG



GAAAAAGAAGAAGTTGTTTTCGGTCGTCTGAAAAAATTCTTCCCGTCTTAA





SEQ
ATGGAAAAACGTATCAACAAAATCCGTAAAAAACTGTCTGCGGACAACGCGACCAAACCGGTTTCT


ID
CGTTCTGGTCCGATGAAAACCCTGCTGGTTCGTGTTATGACCGACGACCTGAAAAAACGTCTGGAA


NO:
AAACGTCGTAAAAAACCGGAAGTTATGCCGCAGGTTATCTCTAACAACGCGGCGAACAACCTGCGT


59
ATGCTGCTGGACGACTACACCAAAATGAAAGAAGCGATCCTGCAGGTTTACTGGCAGGAATTCAAA



GACGACCACGTTGGTCTGATGTGCAAATTCGCGCAGCCGGCGTCTAAAAAAATCGACCAGAACAAA



CTGAAACCGGAAATGGACGAAAAAGGTAACCTGACCACCGCGGGTTTCGCGTGCTCTCAGTGCGGT



CAGCCGCTGTTCGTTTACAAACTGGAACAGGTTTCTGAAAAAGGTAAAGCGTACACCAACTACTTC



GGTCGTTGCAACGTTGCGGAACACGAAAAACTGATCCTGCTGGCGCAGCTGAAACCGGAAAAAGA



CTCTGACGAAGCGGTTACCTACTCTCTGGGTAAATTCGGTCAGCGTGCGCTGGACTTCTACTCTATC



CACGTTACCAAAGAATCTACCCACCCGGTTAAACCGCTGGCGCAGATCGCGGGTAACCGTTACGCG



TCTGGTCCGGTTGGTAAAGCGCTGTCTGACGCGTGCATGGGTACCATCGCGTCTTTCCTGTCTAAAT



ACCAGGACATCATCATCGAACACCAGAAAGTTGTTAAAGGTAACCAGAAACGTCTGGAATCTCTGC



GTGAACTGGCGGGTAAAGAAAACCTGGAATACCCGTCTGTTACCCTGCCGCCGCAGCCGCACACCA



AAGAAGGTGTTGACGCGTACAACGAAGTTATCGCGCGTGTTCGTATGTGGGTTAACCTGAACCTGT



GGCAGAAACTGAAACTGTCTCGTGACGACGCGAAACCGCTGCTGCGTCTGAAAGGTTTCCCGTCTTT



CCCGGTTGTTGAACGTCGTGAAAACGAAGTTGACTGGTGGAACACCATCAACGAAGTTAAAAAACT



GATCGACGCGAAACGTGACATGGGTCGTGTTTTCTGGTCTGGTGTTACCGCGGAAAAACGTAACAC



CATCCTGGAAGGTTACAACTACCTGCCGAACGAAAACGACCACAAAAAACGTGAAGGTTCTCTGGA



AAACCCGAAAAAACCGGCGAAACGTCAGTTCGGTGACCTGCTGCTGTACCTGGAAAAAAAATACGC



GGGTGACTGGGGTAAAGTTTTCGACGAAGCGTGGGAACGTATCGACAAAAAAATCGCGGGTCTGAC



CTCTCACATCGAACGTGAAGAAGCGCGTAACGCGGAAGACGCGCAGTCTAAAGCGGTTCTGACCGA



CTGGCTGCGTGCGAAAGCGTCTTTCGTTCTGGAACGTCTGAAAGAAATGGACGAAAAAGAATTCTA



CGCGTGCGAAATCCAGCTGCAGAAATGGTACGGTGACCTGCGTGGTAACCCGTTCGCGGTTGAAGC



GGAAAACCGTGTTGTTGACATCTCTGGTTTCTCTATCGGTTCTGACGGTCACTCTATCCAGTACCGTA



ACCTGCTGGCGTGGAAATACCTGGAAAACGGTAAACGTGAATTCTACCTGCTGATGAACTACGGTA



AAAAAGGTCGTATCCGTTTCACCGACGGTACCGACATCAAAAAATCTGGTAAATGGCAGGGTCTGC



TGTACGGTGGTGGTAAAGCGAAAGTTATCGACCTGACCTTCGACCCGGACGACGAACAGCTGATCA



TCCTGCCGCTGGCGTTCGGTACCCGTCAGGGTCGTGAATTCATCTGGAACGACCTGCTGTCTCTGGA



AACCGGTCTGATCAAACTGGCGAACGGTCGTGTTATCGAAAAAACCATCTACAACAAAAAAATCGG



TCGTGACGAACCGGCGCTGTTCGTTGCGCTGACCTTCGAACGTCGTGAAGTTGTTGACCCGTCTAAC



ATCAAACCGGTTAACCTGATCGGTGTTGACCGTGGTGAAAACATCCCGGCGGTTATCGCGCTGACC



GACCCGGAAGGTTGCCCGCTGCCGGAATTCAAAGACTCTTCTGGTGGTCCGACCGACATCCTGCGT



ATCGGTGAAGGTTACAAAGAAAAACAGCGTGCGATCCAGGCGGCGAAAGAAGTTGAACAGCGTCG



TGCGGGTGGTTACTCTCGTAAATTCGCGTCTAAATCTCGTAACCTGGCGGACGACATGGTTCGTAAC



TCTGCGCGTGACCTGTTCTACCACGCGGTTACCCACGACGCGGTTCTGGTTTTCGAAAACCTGTCTC



GTGGTTTCGGTCGTCAGGGTAAACGTACCTTCATGACCGAACGTCAGTACACCAAAATGGAAGACT



GGCTGACCGCGAAACTGGCGTACGAAGGTCTGACCTCTAAAACCTACCTGTCTAAAACCCTGGCGC



AGTACACCTCTAAAACCTGCTCTAACTGCGGTTTCACCATCACCACCGCGGACTACGACGGTATGCT



GGTTCGTCTGAAAAAAACCTCTGACGGTTGGGCGACCACCCTGAACAACAAAGAACTGAAAGCGG



AAGGTCAGATCACCTACTACAACCGTTACAAACGTCAGACCGTTGAAAAAGAACTGTCTGCGGAAC



TGGACCGTCTGTCTGAAGAATCTGGTAACAACGACATCTCTAAATGGACCAAAGGTCGTCGTGACG



AAGCGCTGTTCCTGCTGAAAAAACGTTTCTCTCACCGTCCGGTTCAGGAACAGTTCGTTTGCCTGGA



CTGCGGTCACGAAGTTCACGCGGACGAACAGGCGGCGCTGAACATCGCGCGTTCTTGGCTGTTCCT



GAACTCTAACTCTACCGAATTCAAATCTTACAAATCTGGTAAACAGCCGTTCGTTGGTGCGTGGCAG



GCGTTCTACAAACGTCGTCTGAAAGAAGTTTGGAAACCGAACGCG





SEQ
ATGAAACGTATCAACAAAATCCGTCGTCGTCTGGTTAAAGACTCTAACACCAAAAAAGCGGGTAAA


ID
ACCGGTCCGATGAAAACCCTGCTGGTTCGTGTTATGACCCCGGACCTGCGTGAACGTCTGGAAAAC


NO:
CTGCGTAAAAAACCGGAAAACATCCCGCAGCCGATCTCTAACACCTCTCGTGCGAACCTGAACAAA


60
CTGCTGACCGACTACACCGAAATGAAAAAAGCGATCCTGCACGTTTACTGGGAAGAATTCCAGAAA



GACCCGGTTGGTCTGATGTCTCGTGTTGCGCAGCCGGCGCCGAAAAACATCGACCAGCGTAAACTG



ATCCCGGTTAAAGACGGTAACGAACGTCTGACCTCTTCTGGTTTCGCGTGCTCTCAGTGCTGCCAGC



CGCTGTACGTTTACAAACTGGAACAGGTTAACGACAAAGGTAAACCGCACACCAACTACTTCGGTC



GTTGCAACGTTTCTGAACACGAACGTCTGATCCTGCTGTCTCCGCACAAACCGGAAGCGAACGACG



AACTGGTTACCTACTCTCTGGGTAAATTCGGTCAGCGTGCGCTGGACTTCTACTCTATCCACGTTAC



CCGTGAATCTAACCACCCGGTTAAACCGCTGGAACAGATCGGTGGTAACTCTTGCGCGTCTGGTCCG



GTTGGTAAAGCGCTGTCTGACGCGTGCATGGGTGCGGTTGCGTCTTTCCTGACCAAATACCAGGACA



TCATCCTGGAACACCAGAAAGTTATCAAAAAAAACGAAAAACGTCTGGCGAACCTGAAAGACATC



GCGTCTGCGAACGGTCTGGCGTTCCCGAAAATCACCCTGCCGCCGCAGCCGCACACCAAAGAAGGT



ATCGAAGCGTACAACAACGTTGTTGCGCAGATCGTTATCTGGGTTAACCTGAACCTGTGGCAGAAA



CTGAAAATCGGTCGTGACGAAGCGAAACCGCTGCAGCGTCTGAAAGGTTTCCCGTCTTTCCCGCTG



GTTGAACGTCAGGCGAACGAAGTTGACTGGTGGGACATGGTTTGCAACGTTAAAAAACTGATCAAC



GAAAAAAAAGAAGACGGTAAAGTTTTCTGGCAGAACCTGGCGGGTTACAAACGTCAGGAAGCGCT



GCTGCCGTACCTGTCTTCTGAAGAAGACCGTAAAAAAGGTAAAAAATTCGCGCGTTACCAGTTCGG



TGACCTGCTGCTGCACCTGGAAAAAAAACACGGTGAAGACTGGGGTAAAGTTTACGACGAAGCGTG



GGAACGTATCGACAAAAAAGTTGAAGGTCTGTCTAAACACATCAAACTGGAAGAAGAACGTCGTTC



TGAAGACGCGCAGTCTAAAGCGGCGCTGACCGACTGGCTGCGTGCGAAAGCGTCTTTCGTTATCGA



AGGTCTGAAAGAAGCGGACAAAGACGAATTCTGCCGTTGCGAACTGAAACTGCAGAAATGGTACG



GTGACCTGCGTGGTAAACCGTTCGCGATCGAAGCGGAAAACTCTATCCTGGACATCTCTGGTTTCTC



TAAACAGTACAACTGCGCGTTCATCTGGCAGAAAGACGGTGTTAAAAAACTGAACCTGTACCTGAT



CATCAACTACTTCAAAGGTGGTAAACTGCGTTTCAAAAAAATCAAACCGGAAGCGTTCGAAGCGAA



CCGTTTCTACACCGTTATCAACAAAAAATCTGGTGAAATCGTTCCGATGGAAGTTAACTTCAACTTC



GACGACCCGAACCTGATCATCCTGCCGCTGGCGTTCGGTAAACGTCAGGGTCGTGAATTCATCTGG



AACGACCTGCTGTCTCTGGAAACCGGTTCTCTGAAACTGGCGAACGGTCGTGTTATCGAAAAAACC



CTGTACAACCGTCGTACCCGTCAGGACGAACCGGCGCTGTTCGTTGCGCTGACCTTCGAACGTCGTG



AAGTTCTGGACTCTTCTAACATCAAACCGATGAACCTGATCGGTATCGACCGTGGTGAAAACATCCC



GGCGGTTATCGCGCTGACCGACCCGGAAGGTTGCCCGCTGTCTCGTTTCAAAGACTCTCTGGGTAAC



CCGACCCACATCCTGCGTATCGGTGAATCTTACAAAGAAAAACAGCGTACCATCCAGGCGGCGAAA



GAAGTTGAACAGCGTCGTGCGGGTGGTTACTCTCGTAAATACGCGTCTAAAGCGAAAAACCTGGCG



GACGACATGGTTCGTAACACCGCGCGTGACCTGCTGTACTACGCGGTTACCCAGGACGCGATGCTG



ATCTTCGAAAACCTGTCTCGTGGTTTCGGTCGTCAGGGTAAACGTACCTTCATGGCGGAACGTCAGT



ACACCCGTATGGAAGACTGGCTGACCGCGAAACTGGCGTACGAAGGTCTGCCGTCTAAAACCTACC



TGTCTAAAACCCTGGCGCAGTACACCTCTAAAACCTGCTCTAACTGCGGTTTCACCATCACCTCTGC



GGACTACGACCGTGTTCTGGAAAAACTGAAAAAAACCGCGACCGGTTGGATGACCACCATCAACGG



TAAAGAACTGAAAGTTGAAGGTCAGATCACCTACTACAACCGTTACAAACGTCAGAACGTTGTTAA



AGACCTGTCTGTTGAACTGGACCGTCTGTCTGAAGAATCTGTTAACAACGACATCTCTTCTTGGACC



AAAGGTCGTTCTGGTGAAGCGCTGTCTCTGCTGAAAAAACGTTTCTCTCACCGTCCGGTTCAGGAAA



AATTCGTTTGCCTGAACTGCGGTTTCGAAACCCACGCGGACGAACAGGCGGCGCTGAACATCGCGC



GTTCTTGGCTGTTCCTGCGTTCTCAGGAATACAAAAAATACCAGACCAACAAAACCACCGGTAACA



CCGACAAACGTGCGTTCGTTGAAACCTGGCAGTCTTTCTACCGTAAAAAACTGAAAGAAGTTTGGA



AACCG





SEQ
AAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAAGCATTGATAATTG


ID
AGATCCCTCTCCCTGACAGGATGATTACATAAATAATAGTGACAAAAATAAATTATTTATTTATCCA


NO:
GAAAATGAATTGGAAAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTcaaaCAG


61
GTtgccgtcactgcgtcttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagc



attctgtaacaaagcgggaccaaagccatgacaaaaacgcgtaacaaaagtgtctataatcacggcag



aaaagtccacattgattatttgcacggcgtcacactttgctatgccatagcatttttatccataagat



tagcggatcctacctgacgctttttatcgcaactctctactgtttctccatacccgtttttttgggct



agcaccgcctatctcgtgtgagataggcggagatacgaactttaagAAGGAGatatacc



ATGGGTAAAATGTATTACCTTGGTTTAGACATTGGCACGAATTCCGTGGGCTACGCGGTGACCGA



CCCCTCATACCACCTGCTGAAGTTTAAGGGGGAACCAATGTGGGGTGCGCACGTATTTGCCGCCGG



TAATCAGAGCGCGGAACGACGCTCGTTCCGCACATCGCGTCGTCGTTTGGACCGACGCCAACAGCG



CGTTAAACTGGTACAGGAGATTTTTGCCCCGGTGATTAGTCCGATCGACCCACGCTTCTTCATTCGT



CTGCATGAATCCGCCCTGTGGCGCGATGACGTCGCGGAGACGGATAAACATATCTTTTTCAATGATC



CTACCTATACCGATAAGGAATATTATAGCGATTACCCGACTATCCATCACCTGATCGTTGATCTGAT



GGAAAGCTCTGAGAAACACGATCCGCGGCTGGTGTACCTTGCAGTGGCGTGGTTAGTGGCACACCG



TGGTCATTTTCTGAACGAGGTGGACAAGGATAATATTGGAGATGTGTTGTCGTTCGACGCATTTTAT



CCGGAGTTTCTCGCGTTCCTGTCGGACAACGGTGTATCACCGTGGGTGTGCGAAAGCAAAGCGCTG



CAGGCGACCTTGCTGAGCCGTAACTCAGTGAACGACAAATATAAAGCCCTTAAGTCTCTGATCTTCG



GATCCCAGAAACCTGAAGATAACTTCGATGCCAATATTTCGGAAGATGGACTCATTCAACTGCTGG



CCGGCAAAAAGGTAAAAGTTAACAAACTGTTCCCTCAGGAATCGAACGATGCATCCTTCACATTGA



ATGATAAAGAAGACGCGATAGAAGAAATCCTGGGTACGCTTACACCAGATGAATGTGAATGGATTG



CGCATATACGCCGCCTTTTTGACTGGGCTATCATGAAACATGCTCTGAAAGATGGCAGGACTATTAG



CGAGTCAAAAGTCAAACTGTATGAGCAGCACCATCACGATCTGACCCAACTTAAATACTTCGTGAA



AACCTACCTTGCAAAAGAATACGACGATATTTTCCGCAACGTGGATAGCGAAACAACGAAAAACTA



TGTAGCGTATTCCTATCATGTGAAAGAGGTGAAAGGCACTCTGCCTAAAAATAAGGCAACGCAAGA



AGAGTTTTGTAAGTATGTCCTGGGCAAGGTTAAAAACATTGAATGCTCTGAAGCAGACAAGGTTGA



CTTTGATGAGATGATTCAGCGTCTTACCGACAACTCTTTTATGCCTAAGCAGGTTTCGGGCGAAAAC



CGCGTTATTCCTTATCAGTTATATTATTATGAACTGAAGACAATTCTGAATAAAGCAGCCTCGTACC



TGCCTTTCCTGACGCAGTGTGGAAAAGATGCAATTTCGAACCAGGACAAACTACTGTCGATCATGA



CGTTCCGTATTCCTTACTTCGTCGGACCCTTGCGAAAAGATAATTCGGAACATGCATGGCTCGAACG



AAAGGCCGGTAAGATTTATCCGTGGAACTTTAACGACAAAGTGGACTTGGATAAATCAGAAGAAGC



GTTCATTCGCCGAATGACCAATACCTGTACCTATTATCCCGGCGAAGATGTTTTACCGTTGGATTCG



CTGATCTATGAGAAATTTATGATTTTAAATGAAATCAATAATATTCGTATTGACGGCTACCCGATTA



GTGTTGACGTTAAACAGCAGGTTTTTGGCTTGTTCGAAAAAAAACGACGCGTAACCGTGAAAGATA



TTCAGAACCTGCTGCTGTCTCTCGGAGCTCTGGACAAACACGGGAAGCTGACAGGCATCGATACCA



CTATCCACTCAAACTATAATACGTATCACCATTTTAAATCTCTCATGGAACGCGGCGTCCTGACCCG



GGATGACGTGGAACGCATCGTTGAAAGGATGACCTACAGCGACGATACTAAGCGTGTGCGTCTGTG



GCTGAATAACAACTATGGTACTTTAACCGCCGACGATGTGAAACACATTTCGCGTCTGCGCAAACA



CGATTTTGGCCGTTTATCCAAAATGTTCTTAACAGGTCTGAAGGGTGTCCATAAGGAGACCGGTGAA



CGTGCCTCCATACTGGATTTCATGTGGAACACGAACGATAACCTGATGCAGCTCCTTTCCGAATGCT



ACACGTTCAGTGATGAAATCACAAAGCTGCAAGAGGCGTATTATGCAAAAGCCCAGTTGTCTTTAA



ACGATTTTTTAGACTCGATGTACATCTCTAACGCGGTGAAACGTCCGATTTACAGAACTCTGGCAGT



GGTGAACGATATTCGAAAAGCATGTGGGACGGCCCCTAAACGCATTTTCATCGAAATGGCTCGTGA



TGGTGAATCAAAAAAAAAGAGAAGTGTTACACGTCGCGAGCAGATCAAAAACCTGTACCGCTCGAT



TCGTAAAGATTTCCAGCAGGAAGTTGATTTTCTGGAAAAGATCCTGGAAAATAAATCTGATGGTCA



ACTTCAGTCAGATGCTTTGTATCTTTACTTTGCACAATTAGGGCGCGATATGTACACGGGCGATCCA



ATAAAGCTGGAGCACATCAAAGATCAGAGTTTCTATAACATAGACCATATTTACCCGCAGTCTATG



GTGAAAGACGATTCCCTAGATAACAAAGTGCTGGTGCAAAGCGAAATTAACGGCGAGAAAAGCTC



GCGATACCCTTTGGACGCCGCGATCCGCAATAAAATGAAGCCCCTTTGGGACGCTTACTATAATCAT



GGCCTGATCTCCTTAAAGAAATACCAGCGTCTAACGCGCTCGACCCCGTTTACCGATGATGAAAAA



TGGGACTTTATTAATCGCCAGTTAGTGGAAACCCGTCAATCTACCAAAGCGCTGGCCATTTTGTTGA



AGCGTAAGTTTCCAGACACCGAAATTGTGTATTCGAAGGCGGGGTTATCGTCCGACTTCAGACATG



AATTCGGCCTTGTAAAAAGTCGCAATATTAATGATTTGCACCACGCTAAAGACGCATTCTTGGCTAT



CGTTACCGGCAATGTGTACCATGAAAGATTCAATCGCAGATGGTTTATGGTGAACCAGCCGTACTC



AGTTAAAACTAAAACTCTTTTTACCCACAGCATAAAGAATGGCAACTTCGTTGCCTGGAACGGCGA



AGAAGATCTCGGTCGTATTGTAAAAATGCTGAAGCAAAACAAAAATACCATTCACTTCACGCGCTT



CTCCTTCGATCGCAAAGAAGGATTATTTGATATCCAACCTCTGAAAGCCAGCACCGGCTTAGTCCCA



CGAAAAGCCGGTCTGGATGTCGTTAAATACGGCGGATATGACAAATCTACCGCGGCCTATTACCTG



CTGGTGAGGTTCACGCTCGAGGACAAGAAAACCCAGCACAAGCTGATGATGATTCCTGTAGAAGGC



CTGTACAAGGCTCGCATTGATCATGACAAGGAATTTCTTACCGATTATGCGCAAACGACTATAAGC



GAAATCCTACAGAAAGATAAACAGAAAGTGATCAATATTATGTTTCCAATGGGTACGAGGCATATA



AAACTCAATTCAATGATTAGTATCGATGGCTTCTATCTTAGTATCGGCGGAAAGTCCTCTAAAGGTA



AGTCAGTTCTATGTCACGCAATGGTTCCACTGATCGTCCCTCACAAAATCGAATGTTACATTAAAGC



AATGGAAAGCTTCGCCCGGAAGTTTAAAGAAAACAACAAGCTGCGCATCGTAGAAAAATTCGATA



AAATCACCGTTGAAGACAACCTGAATCTCTACGAGCTCTTTCTCCAAAAACTGCAGCATAATCCCTA



TAATAAGTTTTTTTCGACACAGTTTGACGTACTGACGAACGGCCGTTCTACTTTCACAAAACTGTCG



CCGGAGGAACAGGTACAGACGCTCTTGAACATTTTAAGTATCTTTAAAACATGCCGCAGTTCGGGTT



GCGACCTGAAATCCATCAACGGCAGTGCCCAGGCAGCGCGCATCATGATTAGCGCTGACTTAACTG



GACTGTCGAAAAAATATTCAGATATTAGGTTGGTTGAACAGTCAGCTTCTGGTTTGTTCGTATCCAA



AAGTCAGAACTTACTGGAGTATCTCTAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCT



GAAATTTATTATATCGCGTTGATTATTGATGCTGTTTTTAGTTTTAACGGCAATTAATATATGTGTTA



TTAATTGAATGAATTTTATCATTCATAATAAGTATGTGTAGGATCAAGCTCAGGTTAAATATTCACT



CAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACAGAATTATCTCATAACAAGTGTTAAGGGATG



TTATTTCC





SEQ
AAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAAGCATTGATAATTG


ID
AGATCCCTCTCCCTGACAGGATGATTACATAAATAATAGTGACAAAAATAAATTATTTATTTATCCA


NO:
GAAAATGAATTGGAAAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTcaaaCAG


62
GTtgccgtcactgcgtcttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagc



attctgtaacaaagcgggaccaaagccatgacaaaaacgcgtaacaaaagtgtctataatcacggcag



aaaagtccacattgattatttgcacggcgtcacactttgctatgccatagcatttttatccataagat



tagcggatcctacctgacgctttttatcgcaactctctactgtttctccatacccgtttttttgggct



agcaccgcctatctcgtgtgagataggcggagatacgaactttaagAAGGAGatatacc



ATGTCATCGCTCACGAAATTCACTAACAAATACTCTAAACAGCTCACCATTAAGAATGAACTCAT



CCCAGTTGGCAAAACACTGGAGAACATCAAAGAGAATGGTCTGATAGATGGCGACGAACAGCTGA



ATGAGAATTATCAGAAGGCGAAAATTATTGTGGATGATTTTCTGCGGGACTTCATTAATAAAGCACT



GAATAATACGCAGATCGGGAACTGGCGCGAACTGGCGGATGCCCTTAATAAAGAGGATGAAGATA



ACATCGAGAAATTGCAGGATAAAATTCGGGGAATCATTGTATCCAAATTTGAAACGTTTGATCTGTT



TAGCAGCTATTCTATTAAGAAAGATGAAAAGATTATTGACGACGACAATGATGTTGAAGAAGAGGA



ACTGGATCTGGGCAAGAAGACCAGCTCATTTAAATACATATTTAAAAAAAACCTGTTTAAGTTAGT



GTTGCCATCCTACCTGAAAACCACAAACCAGGACAAGCTGAAGATTATTAGCTCGTTTGATAATTTT



TCAACGTACTTCCGCGGGTTCTTTGAAAACCGGAAAAACATTTTTACCAAGAAACCGATCTCCACAA



GTATTGCGTATCGCATTGTTCATGATAACTTCCCGAAATTCCTTGATAACATTCGTTGTTTTAATGTG



TGGCAGACGGAATGCCCGCAACTAATCGTGAAAGCAGATAACTATCTGAAAAGCAAAAATGTTATA



GCGAAAGATAAAAGTTTGGCAAACTATTTTACCGTGGGCGCGTATGACTATTTCCTGTCTCAGAATG



GTATAGATTTTTACAACAATATTATAGGTGGACTGCCAGCGTTCGCCGGCCATGAGAAAATCCAAG



GTCTCAATGAATTCATCAATCAAGAGTGCCAAAAAGACAGCGAGCTGAAAAGTAAGCTGAAAAAC



CGTCACGCGTTCAAAATGGCGGTACTGTTCAAACAGATACTCAGCGATCGTGAAAAAAGTTTTGTA



ATTGATGAGTTCGAGTCGGATGCTCAAGTTATTGACGCCGTTAAAAACTTTTACGCCGAACAGTGCA



AAGATAACAATGTTATTTTTAACTTATTAAATCTTATCAAGAATATCGCTTTCTTAAGTGATGACGA



ACTGGACGGCATATTCATTGAAGGGAAATACCTGTCGAGCGTTAGTCAAAAACTCTATAGCGATTG



GTCAAAATTACGTAACGACATTGAGGATTCGGCTAACTCTAAACAAGGCAATAAAGAGCTGGCCAA



GAAGATCAAAACCAACAAAGGGGATGTAGAAAAAGCGATCTCGAAATATGAGTTCTCGCTGTCGG



AACTGAACTCGATTGTACATGATAACACCAAGTTTTCTGACCTCCTTAGTTGTACACTGCATAAGGT



GGCTTCTGAGAAACTGGTGAAGGTCAATGAAGGCGACTGGCCGAAACATCTCAAGAATAATGAAG



AGAAACAAAAAATCAAAGAGCCGCTTGATGCTCTGCTGGAGATCTATAATACACTTCTGATTTTTAA



CTGCAAAAGCTTCAATAAAAACGGCAACTTCTATGTCGACTATGATCGTTGCATCAATGAACTGAGT



TCGGTCGTGTATCTGTATAATAAAACACGTAACTATTGCACTAAAAAACCCTATAACACGGACAAG



TTCAAACTCAATTTTAACAGTCCGCAGCTCGGTGAAGGCTTTTCCAAGTCGAAAGAAAATGACTGTC



TGACTCTTTTGTTTAAAAAAGACGACAACTATTATGTAGGCATTATCCGCAAAGGTGCAAAAATCA



ATTTTGATGATACACAAGCAATCGCCGATAACACCGACAATTGCATCTTTAAAATGAATTATTTCCT



ACTTAAAGACGCAAAAAAATTTATCCCGAAATGTAGCATTCAGCTGAAAGAAGTCAAGGCCCATTT



TAAGAAATCTGAAGATGATTACATTTTGTCTGATAAAGAGAAATTTGCTAGCCCGCTGGTCATTAAA



AAGAGCACATTTTTGCTGGCAACTGCACATGTGAAAGGGAAAAAAGGCAATATCAAGAAATTTCAG



AAAGAATATTCGAAAGAAAACCCCACTGAGTATCGCAATTCTTTAAACGAATGGATTGCTTTTTGTA



AAGAGTTCTTAAAAACTTATAAAGCGGCTACCATTTTTGATATAACCACATTGAAAAAGGCAGAGG



AATATGCTGATATTGTAGAATTCTACAAGGATGTCGATAATCTGTGCTACAAACTGGAGTTCTGCCC



GATTAAAACCTCGTTTATAGAAAACCTGATAGATAACGGCGACCTGTATCTGTTTCGCATCAATAAC



AAAGACTTCAGCAGTAAATCGACCGGCACCAAGAACCTTCATACGTTATATTTACAAGCTATATTCG



ATGAACGTAATCTGAACAATCCGACAATTATGCTGAATGGGGGAGCAGAACTGTTCTATCGTAAAG



AAAGTATTGAGCAGAAAAACCGTATCACACACAAAGCCGGTTCAATTCTCGTGAATAAGGTGTGTA



AAGACGGTACAAGCCTGGATGATAAGATACGTAATGAAATTTATCAATATGAGAATAAATTTATTG



ATACCCTGTCTGATGAAGCTAAAAAGGTGTTACCGAATGTCATTAAAAAGGAAGCTACCCATGACA



TTACAAAAGATAAACGTTTCACTAGTGACAAATTCTTCTTTCACTGCCCCCTGACAATTAATTATAA



GGAAGGCGATACCAAGCAGTTCAATAACGAAGTGCTGAGTTTTCTGCGTGGAAATCCTGACATCAA



CATTATCGGCATTGACCGCGGAGAGCGTAATTTAATCTATGTAACGGTTATAAACCAGAAAGGCGA



GATTCTGGATTCGGTTTCATTCAATACCGTGACCAACAAGAGTTCAAAAATCGAGCAGACAGTCGA



TTATGAAGAGAAATTGGCAGTCCGCGAGAAAGAGAGGATTGAAGCAAAACGTTCCTGGGACTCTAT



CTCAAAAATTGCGACACTAAAGGAAGGTTATCTGAGCGCAATAGTTCACGAGATCTGTCTGTTAAT



GATTAAACACAACGCGATCGTTGTCTTAGAGAATCTTAATGCAGGCTTTAAGCGTATTCGTGGCGGT



TTATCAGAAAAAAGTGTTTATCAAAAATTCGAAAAAATGTTGATTAACAAACTGAACTATTTTGTCA



GCAAGAAGGAATCCGACTGGAATAAACCGTCTGGTCTGCTGAATGGACTGCAGCTTTCGGATCAGT



TTGAAAGCTTCGAAAAACTGGGTATTCAGTCTGGTTTTATTTTTTACGTGCCGGCTGCATATACCTCA



AAGATTGATCCGACCACGGGCTTCGCCAATGTTCTGAATCTGTCGAAGGTACGCAATGTTGATGCG



ATCAAAAGCTTTTTTTCTAACTTCAACGAAATTAGTTATAGCAAGAAAGAAGCCCTTTTCAAATTCT



CATTCGATCTGGATTCACTGAGTAAGAAAGGCTTTAGTAGCTTTGTGAAATTTAGTAAGAGTAAATG



GAACGTCTACACCTTTGGAGAACGTATCATAAAGCCAAAGAATAAGCAAGGTTATCGGGAGGACA



AAAGAATCAACTTGACCTTCGAGATGAAGAAGTTACTTAACGAGTATAAGGTTTCTTTTGATCTTGA



AAATAACTTGATTCCGAATCTCACGAGTGCCAACCTGAAGGATACTTTTTGGAAAGAGCTATTCTTT



ATCTTCAAGACTACGCTGCAGCTCCGTAACAGCGTTACTAACGGTAAAGAAGATGTGCTCATCTCTC



CGGTCAAAAATGCGAAGGGTGAATTCTTCGTTTCGGGAACGCATAACAAGACTCTTCCGCAAGATT



GCGATGCGAACGGTGCATACCATATTGCGTTGAAAGGTCTGATGATACTCGAACGTAACAACCTTG



TACGTGAGGAGAAAGATACGAAAAAGATTATGGCGATTTCAAACGTGGATTGGTTCGAGTACGTGC



AGAAACGTAGAGGCGTTCTGTAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAAT



TTATTATATCGCGTTGATTATTGATGCTGTTTTTAGTTTTAACGGCAATTAATATATGTGTTATTAATT



GAATGAATTTTATCATTCATAATAAGTATGTGTAGGATCAAGCTCAGGTTAAATATTCACTCAGGAA



GTTATTACTCAGGAAGCAAAGAGGATTACAGAATTATCTCATAACAAGTGTTAAGGGATGTTATTTC



C





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGC


ID
GTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAA


NO:
GCGGGACCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCAC


63
ATTGATTATTTGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCC



TACCTGACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATA



CGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATAT



ACCATGCACCATCATCATCACCATAACAACTACGACGAATTCACCAAACTGTACCCGATCCAGAAA



ACCATCCGTTTCGAACTGAAACCGCAGGGTCGTACCATGGAACACCTGGAAACCTTCAACTTCTTCG



AAGAAGACCGTGACCGTGCGGAAAAATACAAAATCCTGAAAGAAGCGATCGACGAATACCACAAA



AAATTCATCGACGAACACCTGACCAACATGTCTCTGGACTGGAACTCTCTGAAACAGATCTCTGAA



AAATACTACAAATCTCGTGAAGAAAAAGACAAAAAAGTTTTCCTGTCTGAACAGAAACGTATGCGT



CAGGAAATCGTTTCTGAATTCAAAAAAGACGACCGTTTCAAAGACCTGTTCTCTAAAAAACTGTTCT



CTGAACTGCTGAAAGAAGAAATCTACAAAAAAGGTAACCACCAGGAAATCGACGCGCTGAAATCT



TTCGACAAATTCTCTGGTTACTTCATCGGTCTGCACGAAAACCGTAAAAACATGTACTCTGACGGTG



ACGAAATCACCGCGATCTCTAACCGTATCGTTAACGAAAACTTCCCGAAATTCCTGGACAACCTGC



AGAAATACCAGGAAGCGCGTAAAAAATACCCGGAATGGATCATCAAAGCGGAATCTGCGCTGGTT



GCGCACAACATCAAAATGGACGAAGTTTTCTCTCTGGAATACTTCAACAAAGTTCTGAACCAGGAA



GGTATCCAGCGTTACAACCTGGCGCTGGGTGGTTACGTTACCAAATCTGGTGAAAAAATGATGGGT



CTGAACGACGCGCTGAACCTGGCGCACCAGTCTGAAAAATCTTCTAAAGGTCGTATCCACATGACC



CCGCTGTTCAAACAGATCCTGTCTGAAAAAGAATCTTTCTCTTACATCCCGGACGTTTTCACCGAAG



ACTCTCAGCTGCTGCCGTCTATCGGTGGTTTCTTCGCGCAGATCGAAAACGACAAAGACGGTAACAT



CTTCGACCGTGCGCTGGAACTGATCTCTTCTTACGCGGAATACGACACCGAACGTATCTACATCCGT



CAGGCGGACATCAACCGTGTTTCTAACGTTATCTTCGGTGAATGGGGTACCCTGGGTGGTCTGATGC



GTGAATACAAAGCGGACTCTATCAACGACATCAACCTGGAACGTACCTGCAAAAAAGTTGACAAAT



GGCTGGACTCTAAAGAATTCGCGCTGTCTGACGTTCTGGAAGCGATCAAACGTACCGGTAACAACG



ACGCGTTCAACGAATACATCTCTAAAATGCGTACCGCGCGTGAAAAAATCGACGCGGCGCGTAAAG



AAATGAAATTCATCTCTGAAAAAATCTCTGGTGACGAAGAATCTATCCACATCATCAAAACCCTGCT



GGACTCTGTTCAGCAGTTCCTGCACTTCTTCAACCTGTTCAAAGCGCGTCAGGACATCCCGCTGGAC



GGTGCGTTCTACGCGGAATTCGACGAAGTTCACTCTAAACTGTTCGCGATCGTTCCGCTGTACAACA



AAGTTCGTAACTACCTGACCAAAAACAACCTGAACACCAAAAAAATCAAACTGAACTTCAAAAACC



CGACCCTGGCGAACGGTTGGGACCAGAACAAAGTTTACGACTACGCGTCTCTGATCTTCCTGCGTG



ACGGTAACTACTACCTGGGTATCATCAACCCGAAACGTAAAAAAAACATCAAATTCGAACAGGGTT



CTGGTAACGGTCCGTTCTACCGTAAAATGGTTTACAAACAGATCCCGGGTCCGAACAAAAACCTGC



CGCGTGTTTTCCTGACCTCTACCAAAGGTAAAAAAGAATACAAACCGTCTAAAGAAATCATCGAAG



GTTACGAAGCGGACAAACACATCCGTGGTGACAAATTCGACCTGGACTTCTGCCACAAACTGATCG



ACTTCTTCAAAGAATCTATCGAAAAACACAAAGACTGGTCTAAATTCAACTTCTACTTCTCTCCGAC



CGAATCTTACGGTGACATCTCTGAATTCTACCTGGACGTTGAAAAACAGGGTTACCGTATGCACTTC



GAAAACATCTCTGCGGAAACCATCGACGAATACGTTGAAAAAGGTGACCTGTTCCTGTTCCAGATC



TACAACAAAGACTTCGTTAAAGCGGCGACCGGTAAAAAAGACATGCACACCATCTACTGGAACGCG



GCGTTCTCTCCGGAAAACCTGCAGGACGTTGTTGTTAAACTGAACGGTGAAGCGGAACTGTTCTACC



GTGACAAATCTGACATCAAAGAAATCGTTCACCGTGAAGGTGAAATCCTGGTTAACCGTACCTACA



ACGGTCGTACCCCGGTTCCGGACAAAATCCACAAAAAACTGACCGACTACCACAACGGTCGTACCA



AAGACCTGGGTGAAGCGAAAGAATACCTGGACAAAGTTCGTTACTTCAAAGCGCACTACGACATCA



CCAAAGACCGTCGTTACCTGAACGACAAAATCTACTTCCACGTTCCGCTGACCCTGAACTTCAAAGC



GAACGGTAAAAAAAACCTGAACAAAATGGTTATCGAAAAATTCCTGTCTGACGAAAAAGCGCACA



TCATCGGTATCGACCGTGGTGAACGTAACCTGCTGTACTACTCTATCATCGACCGTTCTGGTAAAAT



CATCGACCAGCAGTCTCTGAACGTTATCGACGGTTTCGACTACCGTGAAAAACTGAACCAGCGTGA



AATCGAAATGAAAGACGCGCGTCAGTCTTGGAACGCGATCGGTAAAATCAAAGACCTGAAAGAAG



GTTACCTGTCTAAAGCGGTTCACGAAATCACCAAAATGGCGATCCAGTACAACGCGATCGTTGTTAT



GGAAGAACTGAACTACGGTTTCAAACGTGGTCGTTTCAAAGTTGAAAAACAGATCTACCAGAAATT



CGAAAACATGCTGATCGACAAAATGAACTACCTGGTTTTCAAAGACGCGCCGGACGAATCTCCGGG



TGGTGTTCTGAACGCGTACCAGCTGACCAACCCGCTGGAATCTTTCGCGAAACTGGGTAAACAGAC



CGGTATCCTGTTCTACGTTCCGGCGGCGTACACCTCTAAAATCGACCCGACCACCGGTTTCGTTAAC



CTGTTCAACACCTCTTCTAAAACCAACGCGCAGGAACGTAAAGAATTCCTGCAGAAATTCGAATCT



ATCTCTTACTCTGCGAAAGACGGTGGTATCTTCGCGTTCGCGTTCGACTACCGTAAATTCGGTACCT



CTAAAACCGACCACAAAAACGTTTGGACCGCGTACACCAACGGTGAACGTATGCGTTACATCAAAG



AAAAAAAACGTAACGAACTGTTCGACCCGTCTAAAGAAATCAAAGAAGCGCTGACCTCTTCTGGTA



TCAAATACGACGGTGGTCAGAACATCCTGCCGGACATCCTGCGTTCTAACAACAACGGTCTGATCT



ACACCATGTACTCTTCTTTCATCGCGGCGATCCAGATGCGTGTTTACGACGGTAAAGAAGACTACAT



CATCTCTCCGATCAAAAACTCTAAAGGTGAATTCTTCCGTACCGACCCGAAACGTCGTGAACTGCCG



ATCGACGCGGACGCGAACGGTGCGTACAACATCGCGCTGCGTGGTGAACTGACCATGCGTGCGATC



GCGGAAAAATTCGACCCGGACTCTGAAAAAATGGCGAAACTGGAACTGAAACACAAAGACTGGTT



CGAATTCATGCAGACCCGTGGTGACTAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCT



GAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGA



TTACA





SEQ
AAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAAGCATTGATAATTG


ID
AGATCCCTCTCCCTGACAGGATGATTACATAAATAATAGTGACAAAAATAAATTATTTATTTATCCA


NO:
GAAAATGAATTGGAAAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTcaaaCAG


64
GTtgccgtcactgcgtcttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagc



attctgtaacaaagcgggaccaaagccatgacaaaaacgcgtaacaaaagtgtctataatcacggcag



aaaagtccacattgattatttgcacggcgtcacactttgctatgccatagcatttttatccataagat



tagcggatcctacctgacgctttttatcgcaactctctactgtttctccatacccgtttttttgggct



agcaccgcctatctcgtgtgagataggcggagatacgaactttaagAAGGAGatatacc



ATGACTAAAACATTTGATTCAGAGTTTTTTAATTTGTACTCGCTGCAAAAAACGGTACGCTTTGAG



TTAAAACCCGTGGGAGAAACCGCGTCATTTGTGGAAGACTTTAAAAACGAGGGCTTGAAACGTGTT



GTGAGCGAAGATGAAAGGCGAGCCGTCGATTACCAGAAAGTTAAGGAAATAATTGACGATTACCA



TCGGGATTTCATTGAAGAAAGTTTAAATTATTTTCCGGAACAGGTGAGTAAAGATGCTCTTGAGCAG



GCGTTTCATCTTTATCAGAAACTGAAGGCAGCAAAAGTTGAGGAAAGGGAAAAAGCGCTGAAAGA



ATGGGAAGCGCTGCAGAAAAAGCTACGTGAAAAAGTGGTGAAATGCTTCTCGGACTCGAATAAAG



CCCGCTTCTCAAGGATTGATAAAAAGGAACTGATTAAGGAAGACCTGATAAATTGGTTGGTCGCCC



AGAATCGCGAGGATGATATCCCTACGGTCGAAACGTTTAACAACTTCACCACATATTTTACCGGCTT



CCATGAGAATCGTAAAAATATTTACTCCAAAGATGATCACGCCACCGCTATTAGCTTTCGCCTTATT



CATGAAAATCTTCCAAAGTTTTTTGACAACGTGATTAGCTTCAATAAGTTGAAAGAGGGTTTCCCTG



AATTAAAATTTGATAAAGTGAAAGAGGATTTAGAAGTAGATTATGATCTGAAGCATGCGTTTGAAA



TAGAATATTTCGTTAACTTCGTGACCCAAGCGGGCATAGATCAGTATAATTATCTGTTAGGAGGGAA



AACCCTGGAGGACGGGACGAAAAAACAAGGGATGAATGAGCAAATTAATCTGTTCAAACAACAGC



AAACGCGAGATAAAGCGCGTCAGATTCCCAAACTGATCCCCCTGTTCAAACAGATTCTTAGCGAAA



GGACTGAAAGCCAGTCCTTTATTCCTAAACAATTTGAAAGTGATCAGGAGTTGTTCGATTCACTGCA



GAAGTTACATAATAACTGCCAGGATAAATTCACCGTGCTGCAACAAGCCATTCTCGGTCTGGCAGA



GGCGGATCTTAAGAAGGTCTTCATCAAAACCTCTGATTTAAATGCCTTATCTAACACCATTTTCGGG



AATTACAGCGTCTTTTCCGATGCACTGAACCTGTATAAAGAAAGCCTGAAAACGAAAAAAGCGCAG



GAGGCTTTTGAGAAACTACCGGCCCATTCTATTCACGACCTCATTCAATACTTGGAACAGTTCAATT



CCAGCCTGGACGCGGAAAAACAACAGAGCACCGACACCGTCCTGAACTACTTCATCAAGACCGATG



AATTATATTCTCGCTTCATTAAATCCACTAGCGAGGCTTTCACTCAGGTGCAGCCTTTGTTCGAACTG



GAAGCCCTGTCATCTAAGCGCCGCCCACCGGAATCGGAAGATGAAGGGGCAAAAGGGCAGGAAGG



CTTCGAGCAGATCAAGCGTATTAAAGCTTACCTGGATACGCTTATGGAAGCGGTACACTTTGCAAA



GCCGTTGTATCTTGTTAAGGGTCGTAAAATGATCGAAGGGCTCGATAAAGACCAGTCCTTTTATGAA



GCGTTTGAAATGGCGTACCAAGAACTTGAATCGTTAATCATTCCTATCTATAACAAAGCGCGGAGCT



ATCTGTCGCGGAAACCTTTCAAGGCCGATAAATTCAAGATTAATTTTGACAACAACACGCTACTGA



GCGGATGGGATGCGAACAAGGAAACTGCTAACGCGTCCATTCTGTTTAAGAAAGACGGGTTATATT



ACCTTGGAATTATGCCGAAAGGTAAGACCTTTCTCTTTGACTACTTTGTATCGAGCGAGGATTCAGA



GAAACTGAAACAGCGTCGCCAGAAGACCGCCGAAGAAGCTCTGGCGCAGGATGGTGAAAGTTACT



TCGAAAAAATTCGTTATAAACTGTTACCAGGGGCTTCAAAGATGTTACCGAAAGTCTTTTTTAGCAA



CAAAAATATTGGCTTTTACAACCCGTCGGATGACATTTTACGCATTCGCAACACAGCCTCTCACACC



AAAAACGGGACCCCTCAGAAAGGCCACTCAAAAGTTGAGTTTAACCTGAATGATTGTCATAAGATG



ATTGATTTCTTCAAATCATCAATTCAGAAACACCCGGAATGGGGGTCTTTTGGCTTTACGTTTTCTGA



TACCAGTGATTTTGAAGACATGAGTGCCTTCTACCGGGAAGTAGAAAACCAGGGTTACGTAATTAG



CTTTGACAAAATCAAAGAGACCTATATACAGAGCCAGGTGGAACAGGGTAATCTCTACTTATTCCA



GATTTATAACAAGGATTTCTCGCCCTACAGCAAAGGCAAACCAAACCTGCATACTCTGTACTGGAA



AGCCCTGTTTGAAGAAGCGAACCTGAATAACGTAGTGGCGAAGTTGAACGGTGAAGCGGAAATCTT



CTTCCGTCGTCACTCCATTAAGGCCTCTGATAAAGTTGTCCATCCGGCAAATCAGGCCATTGATAAT



AAGAATCCACACACGGAAAAAACGCAGTCAACCTTTGAATATGACCTCGTTAAAGACAAACGCTAC



ACGCAAGATAAGTTCTTTTTCCACGTCCCAATCAGCCTCAACTTTAAAGCACAAGGGGTTTCAAAGT



TTAATGATAAAGTCAATGGGTTCCTCAAGGGCAACCCGGATGTCAACATTATAGGTATAGACAGGG



GCGAACGCCATCTGCTTTACTTTACCGTAGTGAATCAGAAAGGTGAAATACTGGTTCAGGAATCATT



AAATACCTTGATGTCGGACAAAGGGCACGTTAATGATTACCAGCAGAAACTGGATAAAAAAGAAC



AGGAACGTGATGCTGCGCGTAAATCGTGGACCACGGTTGAGAACATTAAAGAGCTGAAAGAGGGG



TATCTAAGCCATGTGGTACACAAACTGGCGCACCTCATCATTAAATATAACGCAATAGTCTGCCTAG



AAGACTTGAATTTTGGCTTTAAACGCGGCCGCTTCAAAGTGGAAAAACAAGTTTATCAAAAATTTG



AAAAGGCGCTTATAGATAAACTGAATTATCTGGTTTTTAAAGAAAAGGAACTTGGTGAGGTAGGGC



ACTACTTGACAGCTTATCAACTGACGGCCCCGTTCGAATCATTCAAAAAACTGGGCAAACAGTCTG



GCATTCTGTTTTACGTGCCGGCAGATTATACTTCAAAAATCGATCCAACAACTGGCTTTGTGAACTT



CCTGGACCTGAGATATCAGTCTGTAGAAAAAGCTAAACAACTTCTTAGCGATTTTAATGCCATTCGT



TTTAACAGCGTTCAGAATTACTTTGAATTCGAAATTGACTATAAAAAACTTACTCCGAAACGTAAAG



TCGGAACCCAAAGTAAATGGGTAATTTGTACGTATGGCGATGTCAGGTATCAGAACCGTCGGAATC



AAAAAGGTCATTGGGAGACCGAAGAAGTGAACGTGACCGAAAAGCTGAAGGCTCTGTTCGCCAGC



GATTCAAAAACTACAACTGTGATCGATTACGCAAATGATGATAACCTGATAGATGTGATTTTAGAG



CAGGATAAAGCCAGCTTTTTTAAAGAACTGTTGTGGCTCCTGAAACTTACGATGACCTTACGACATT



CCAAGATCAAATCGGAAGATGATTTTATTCTGTCACCGGTCAAGAATGAGCAGGGTGAATTCTATG



ATAGTAGGAAAGCCGGCGAAGTGTGGCCGAAAGACGCCGACGCCAATGGCGCCTATCATATCGCG



CTCAAAGGGCTTTGGAATTTGCAGCAGATTAACCAGTGGGAAAAAGGTAAAACCCTGAATCTGGCT



ATCAAAAACCAGGATTGGTTTAGCTTTATCCAAGAGAAACCGTATCAGGAATGAGAAATCATCCTT



AGCGAAAGCTAAGGATTTTTTTTATCTGAAATTTATTATATCGCGTTGATTATTGATGCTGTTTTTAG



TTTTAACGGCAATTAATATATGTGTTATTAATTGAATGAATTTTATCATTCATAATAAGTATGTGTAG



GATCAAGCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACAGAAT



TATCTCATAACAAGTGTTAAGGGATGTTATTTCC





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGC


ID
GTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAA


NO:
GCGGGACCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCAC


65
ATTGATTATTTGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCC



TACCTGACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATA



CGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATAT



ACCATGCACCATCATCATCACCATCATACAGGCGGTCTTCTTAGTATGGACGCGAAAGAGTTCACA



GGTCAGTATCCGTTGTCGAAAACATTACGATTCGAACTTCGGCCCATCGGCCGCACGTGGGATAAC



CTGGAGGCCTCAGGCTACTTAGCGGAAGACCGCCATCGTGCCGAATGTTATCCTCGTGCGAAAGAG



TTATTGGATGACAACCATCGTGCCTTCCTGAATCGTGTGTTGCCACAAATCGATATGGATTGGCACC



CGATTGCGGAGGCCTTTTGTAAGGTACATAAAAACCCTGGTAATAAAGAACTTGCCCAGGATTACA



ACCTTCAGTTGTCAAAGCGCCGTAAGGAGATCAGCGCATATCTTCAGGATGCAGATGGCTATAAAG



GCCTGTTCGCGAAGCCCGCCTTAGACGAAGCTATGAAAATTGCGAAAGAAAACGGGAACGAAAGT



GATATTGAGGTTCTCGAAGCGTTTAACGGTTTTAGCGTATACTTCACCGGTTATCATGAGTCACGCG



AGAACATTTATAGCGATGAGGATATGGTGAGCGTAGCCTACCGAATTACTGAGGATAATTTCCCGC



GCTTTGTCTCAAACGCTTTGATCTTTGATAAATTAAACGAAAGCCATCCGGATATTATCTCTGAAGT



ATCGGGCAATCTTGGAGTTGATGACATTGGTAAGTACTTTGACGTGTCGAACTATAACAATTTTCTT



TCCCAGGCCGGTATAGATGACTACAATCACATTATTGGCGGCCATACAACCGAAGACGGACTGATA



CAAGCGTTTAATGTCGTATTGAACTTACGTCACCAAAAAGACCCTGGCTTTGAAAAAATTCAGTTCA



AACAGCTCTACAAACAAATCCTGAGCGTGCGTACCAGCAAAAGCTACATCCCGAAACAGTTTGACA



ACTCTAAGGAGATGGTTGACTGCATTTGCGATTATGTCAGCAAAATAGAGAAATCCGAAACAGTAG



AACGGGCCCTGAAACTAGTCCGTAATATCAGTTCTTTCGACTTGCGCGGGATCTTTGTCAATAAAAA



GAACTTGCGCATACTGAGCAACAAACTGATAGGAGATTGGGACGCGATCGAAACCGCATTGATGCA



TAGTTCTTCATCAGAAAACGATAAGAAAAGCGTATATGATAGCGCGGAGGCTTTTACGTTGGATGA



CATCTTTTCAAGCGTGAAAAAATTTTCTGATGCCTCTGCCGAAGATATTGGCAACAGGGCGGAAGA



CATCTGTAGAGTGATAAGTGAGACGGCCCCTTTTATCAACGATCTGCGAGCGGTGGACCTGGATAG



CCTGAACGACGATGGTTATGAAGCGGCCGTCTCAAAAATTCGGGAGTCGCTGGAGCCTTATATGGA



TCTTTTCCATGAACTGGAAATTTTCTCGGTTGGCGATGAGTTCCCAAAATGCGCAGCATTTTACAGC



GAACTGGAGGAAGTCAGCGAACAGCTGATCGAAATTATTCCGTTATTCAACAAGGCGCGTTCGTTC



TGCACCCGGAAACGCTATAGCACCGATAAGATTAAAGTGAACTTAAAATTCCCGACCTTGGCGGAC



GGGTGGGACCTGAACAAAGAGAGAGACAACAAAGCCGCGATTCTGCGGAAAGACGGTAAGTATTA



TCTGGCAATTCTGGATATGAAGAAAGATCTGTCAAGCATTAGGACCAGCGACGAAGATGAATCCAG



CTTCGAAAAGATGGAGTATAAACTGTTACCGAGTCCAGTAAAAATGCTGCCAAAGATATTCGTAAA



ATCGAAAGCCGCTAAGGAAAAATATGGCCTGACAGATCGTATGCTTGAATGCTACGATAAAGGTAT



GCATAAGTCGGGTAGTGCGTTTGATCTTGGCTTTTGCCATGAACTCATTGATTATTACAAGCGTTGT



ATCGCGGAGTACCCAGGCTGGGATGTGTTCGATTTCAAGTTTCGCGAAACTTCCGATTATGGGTCCA



TGAAAGAGTTCAATGAAGATGTGGCCGGAGCCGGTTACTATATGAGTCTGAGAAAAATTCCGTGCA



GCGAAGTGTACCGTCTGTTAGACGAGAAATCGATTTATCTATTTCAAATTTATAACAAAGATTACTC



TGAAAATGCACATGGTAATAAGAACATGCATACCATGTACTGGGAGGGTCTCTTTTCCCCGCAAAA



CCTGGAGTCGCCCGTTTTCAAGTTGTCGGGTGGGGCAGAACTTTTCTTTCGAAAATCCTCAATCCCT



AACGATGCCAAAACAGTACACCCGAAAGGCTCAGTGCTGGTTCCACGTAATGATGTTAACGGTCGG



CGTATTCCAGATTCAATCTACCGCGAACTGACACGCTATTTTAACCGTGGCGATTGCCGAATCAGTG



ACGAAGCCAAAAGTTATCTTGACAAGGTTAAGACTAAAAAAGCGGACCATGACATTGTGAAAGATC



GCCGCTTTACCGTGGATAAAATGATGTTCCACGTCCCGATTGCGATGAACTTTAAGGCGATCAGTAA



ACCGAACTTAAACAAAAAAGTCATTGATGGCATCATTGATGATCAGGATCTGAAAATCATTGGTAT



TGATCGTGGCGAGCGGAACTTAATTTACGTCACGATGGTTGACAGAAAAGGGAATATCTTATATCA



GGATTCTCTTAACATCCTCAATGGCTACGACTATCGTAAAGCTCTGGATGTGCGCGAATATGACAAC



AAGGAAGCGCGTCGTAACTGGACTAAAGTGGAGGGCATTCGCAAAATGAAGGAAGGCTATCTGTC



ATTAGCGGTCTCGAAATTAGCGGATATGATTATCGAAAATAACGCCATCATCGTTATGGAGGACCT



GAACCACGGATTCAAAGCGGGCCGCTCAAAGATTGAAAAACAAGTTTATCAGAAATTTGAGAGTAT



GCTGATTAACAAACTGGGCTATATGGTGTTAAAAGACAAGTCAATTGACCAATCAGGTGGCGCGCT



GCATGGATACCAGCTGGCGAACCATGTTACCACCTTAGCATCAGTTGGAAAGCAGTGTGGGGTTAT



CTTTTATATACCGGCAGCGTTCACTAGTAAAATAGATCCGACCACTGGTTTCGCCGATCTCTTTGCC



CTGAGTAACGTTAAAAACGTAGCGAGCATGCGTGAATTCTTTTCCAAAATGAAATCTGTCATTTATG



ATAAAGCTGAAGGCAAATTCGCATTCACCTTTGATTACTTGGATTACAACGTGAAGAGCGAATGTG



GTCGTACGCTGTGGACCGTTTACACCGTTGGTGAGCGCTTCACCTATTCCCGTGTGAACCGCGAATA



TGTACGTAAAGTCCCCACCGATATTATCTATGATGCCCTCCAGAAAGCAGGCATTAGCGTCGAAGG



AGACTTAAGGGACAGAATTGCCGAAAGCGATGGCGATACGCTGAAGTCTATTTTTTACGCATTCAA



ATACGCGCTAGATATGCGCGTTGAGAATCGCGAGGAAGACTACATTCAATCACCTGTGAAAAATGC



CTCTGGGGAATTTTTTTGTTCAAAAAATGCTGGTAAAAGCCTCCCACAAGATAGCGATGCAAACGG



TGCATATAACATTGCCCTGAAAGGTATTCTTCAATTACGCATGCTGTCTGAGCAGTACGACCCCAAC



GCGGAATCTATTAGACTTCCGCTGATAACCAATAAAGCCTGGCTGACATTCATGCAGTCTGGCATGA



AGACCTGGAAAAATTAGGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGG



GAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA





SEQ
AAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAAGCATTGATAATTG


ID
AGATCCCTCTCCCTGACAGGATGATTACATAAATAATAGTGACAAAAATAAATTATTTATTTATCCA


NO:
GAAAATGAATTGGAAAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTcaaaCAG


66
GTtgccgtcactgcgtcttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagc



attctgtaacaaagcgggaccaaagccatgacaaaaacgcgtaacaaaagtgtctataatcacggcag



aaaagtccacattgattatttgcacggcgtcacactttgctatgccatagcatttttatccataagat



tagcggatcctacctgacgctttttatcgcaactctctactgtttctccatacccgtttttttgggct



agcaccgcctatctcgtgtgagataggcggagatacgaactttaagAAGGAGatataccatg



GATAGTTTGAAAGATTTCACCAATCTGTACCCTGTCAGTAAGACATTGAGATTTGAATTAAAGC



CCGTTGGAAAGACTTTAGAAAATATCGAGAAAGCAGGTATTTTGAAAGAGGATGAGCATCGTGCAG



AAAGTTATCGGAGGGTGAAGAAAATAATTGATACTTATCATAAGGTATTTATCGATTCTTCTCTTGA



AAATATGGCTAAAATGGGTATTGAGAATGAAATAAAAGCAATGCTCCAAAGTTTCTGCGAATTGTA



TAAAAAAGATCATCGCACTGAGGGTGAAGACAAGGCATTAGATAAAATTCGAGCAGTACTTCGTGG



CCTGATTGTTGGGGCTTTCACTGGTGTTTGCGGAAGACGGGAAAATACAGTCCAAAACGAGAAGTA



CGAGAGTTTGTTCAAAGAAAAGTTGATAAAAGAAATTTTACCTGATTTTGTGCTCTCTACTGAGGCT



GAAAGCTTGCCTTTCTCTGTTGAAGAAGCTACGAGGTCACTGAAGGAGTTTGATAGCTTTACATCCT



ACTTTGCTGGTTTTTACGAGAATAGAAAGAATATATACTCGACGAAACCTCAATCCACTGCCATTGC



TTATCGTCTTATTCATGAGAACTTGCCGAAGTTCATTGATAATATTCTTGTTTTTCAGAAGATCAAAG



AGCCTATAGCCAAAGAGCTGGAACATATTCGTGCGGACTTTTCTGCCGGGGGGTACATAAAAAAGG



ATGAGAGATTGGAGGATATTTTTTCGTTGAACTATTATATCCACGTGTTATCTCAGGCTGGGATCGA



AAAATATAACGCATTGATTGGGAAGATTGTGACAGAAGGAGATGGAGAGATGAAAGGGCTCAATG



AACACATCAACCTTTACAACCAACAAAGAGGCAGAGAGGATCGGCTCCCTCTTTTTAGGCCTCTTTA



TAAACAGATATTGAGTGACAGAGAGCAATTATCATACTTGCCTGAGAGTTTTGAAAAAGATGAGGA



GCTCCTCAGGGCTCTAAAAGAGTTCTATGATCATATCGCAGAAGACATTCTCGGACGTACTCAACA



GTTGATGACTTCTATTTCAGAATATGATTTATCTCGGATATACGTAAGGAACGATAGCCAATTGACT



GATATATCAAAAAAAATGTTGGGAGATTGGAATGCTATCTACATGGCTAGAGAACGAGCATATGAC



CACGAGCAGGCTCCCAAAAGAATCACGGCGAAATACGAGAGGGACAGGATTAAAGCTCTTAAAGG



AGAAGAGAGTATAAGTCTGGCAAATCTTAATAGTTGTATTGCCTTTCTGGACAATGTTAGAGATTGC



CGTGTAGATACTTATCTTTCCACACTGGGCCAGAAGGAAGGACCACATGGTCTATCTAATCTCGTTG



AGAACGTTTTTGCCTCATACCATGAAGCAGAGCAATTGTTGAGCTTTCCATACCCCGAAGAGAATA



ATCTGATTCAGGACAAGGACAATGTGGTGTTAATTAAGAATCTTCTCGACAATATCAGTGATCTGCA



GAGGTTCTTGAAACCTCTTTGGGGTATGGGAGACGAACCCGATAAAGATGAAAGATTTTATGGAGA



GTATAATTATATCCGAGGAGCTCTAGATCAGGTGATCCCTCTGTACAATAAGGTAAGGAACTACCTC



ACTCGGAAGCCTTATTCGACCAGAAAAGTAAAACTCAATTTTGGGAATTCTCAATTGCTTAGTGGTT



GGGATAGAAATAAGGAAAAGGATAATAGCTGTGTGATTTTGCGTAAGGGGCAGAACTTCTATTTGG



CTATTATGAACAATAGGCACAAAAGAAGTTTCGAAAACAAGGTGTTGCCCGAGTATAAGGAGGGA



GAACCTTACTTCGAAAAGATGGATTATAAATTTTTGCCTGATCCTAATAAAATGCTTCCTAAGGTTT



TTCTTTCGAAAAAAGGAATAGAGATATACAAACCAAGTCCGAAGCTTTTAGAACAATATGGACATG



GAACTCACAAAAAGGGAGATACCTTTAGTATGGATGATTTGCACGAACTGATCGATTTCTTCAAAC



ACTCAATCGAGGCTCATGAAGATTGGAAGCAATTCGGATTCAAATTTTCTGATACGGCTACTTATGA



GAATGTATCTAGTTTCTATAGAGAAGTTGAGGATCAGGGGTATAAGCTCTCTTTCCGAAAAGTTTCG



GAATCTTATGTCTATTCATTAATAGATCAAGGCAAGTTGTATTTATTTCAGATATACAACAAGGACT



TTTCTCCCTGCAGCAAAGGGACACCTAATCTGCATACCTTGTATTGGAGAATGCTTTTTGACGAGCG



CAATTTGGCAGATGTCATATACAAACTGGATGGGAAGGCTGAAATCTTTTTCCGAGAGAAGAGTTT



GAAAAATGATCATCCCACGCATCCGGCTGGTAAGCCTATCAAAAAGAAAAGTCGACAAAAAAAAG



GAGAGGAGAGTCTGTTTGAGTATGATTTAGTCAAGGATAGGCACTATACGATGGATAAGTTCCAGT



TTCATGTGCCTATTACTATGAATTTTAAATGTTCTGCAGGAAGCAAAGTCAATGATATGGTTAATGC



TCATATTCGAGAGGCAAAGGATATGCATGTCATTGGAATTGATCGTGGAGAACGCAATCTGCTGTA



TATATGCGTGATAGATAGTCGAGGGACGATTTTGGATCAAATTTCTCTGAATACGATTAACGATATA



GACTATCATGATTTATTGGAGAGTCGAGACAAAGACCGTCAGCAGGAGCGCCGAAACTGGCAAACT



ATCGAAGGGATCAAGGAGCTAAAACAAGGCTACCTTAGTCAGGCGGTTCATCGGATAGCCGAACTG



ATGGTGGCTTATAAGGCTGTAGTTGCTTTGGAGGATTTGAATATGGGGTTCAAACGTGGGCGGCAG



AAAGTAGAAAGTTCTGTTTATCAGCAGTTTGAGAAACAGCTGATAGATAAGCTCAACTATCTTGTG



GACAAGAAGAAAAGGCCTGAAGATATTGGAGGATTGTTGAGAGCCTATCAATTTACGGCCCCATTT



AAGAGTTTTAAGGAAATGGGAAAGCAAAACGGCTTCTTGTTTTATATCCCGGCTTGGAACACGAGC



AACATAGATCCGACTACTGGATTTGTTAATTTATTTCATGCCCAGTATGAAAATGTAGATAAAGCGA



AGAGCTTCTTTCAAAAGTTTGATTCAATTAGTTACAACCCGAAGAAAGACTGGTTTGAGTTTGCATT



CGATTATAAAAACTTTACTAAAAAGGCTGAAGGAAGTCGTTCTATGTGGATATTATGCACACATGG



TTCCCGAATAAAGAATTTTAGAAATTCCCAGAAGAATGGTCAATGGGATTCCGAAGAATTCGCCTT



GACGGAGGCTTTTAAGTCTCTTTTTGTGCGATATGAGATAGATTATACCGCTGATTTGAAAACAGCT



ATTGTGGACGAAAAGCAAAAAGACTTCTTCGTGGATCTTCTGAAGCTATTCAAATTGACAGTACAG



ATGCGCAACAGCTGGAAAGAGAAGGATTTGGATTATCTAATCTCTCCTGTAGCAGGGGCTGATGGC



CGTTTCTTCGATACAAGAGAGGGAAATAAAAGTCTGCCTAAGGATGCAGATGCCAATGGAGCTTAT



AATATTGCCCTAAAAGGACTTTGGGCTCTACGCCAGATTCGGCAAACTTCAGAAGGCGGTAAACTC



AAATTGGCGATTTCCAATAAGGAATGGCTACAGTTTGTGCAAGAGAGATCTTACGAGAAAGACtgaG



AAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATTTATTATATCGCGTTGATTATTGAT



GCTGTTTTTAGTTTTAACGGCAATTAATATATGTGTTATTAATTGAATGAATTTTATCATTCATAATA



AGTATGTGTAGGATCAAGCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAG



GATTACAGAATTATCTCATAACAAGTGTTAAGGGATGTTATTTCC





SEQ
AAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAAGCATTGATAATTG


ID
AGATCCCTCTCCCTGACAGGATGATTACATAAATAATAGTGACAAAAATAAATTATTTATTTATCCA


NO:
GAAAATGAATTGGAAAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTcaaaCAG


67
GTtgccgtcactgcgtcttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagc



attctgtaacaaagcgggaccaaagccatgacaaaaacgcgtaacaaaagtgtctataatcacggcag



aaaagtccacattgattatttgcacggcgtcacactttgctatgccatagcatttttatccataagat



tagcggatcctacctgacgctttttatcgcaactctctactgtttctccatacccgtttttttgggct



agcaccgcctatctcgtgtgagataggcggagatacgaactttaagAAGGAGatatacc



ATGAACAACGGCACAAATAATTTTCAGAACTTCATCGGGATCTCAAGTTTGCAGAAAACGCTGCG



CAATGCTCTGATCCCCACGGAAACCACGCAACAGTTCATCGTCAAGAACGGAATAATTAAAGAAGA



TGAGTTACGTGGCGAGAACCGCCAGATTCTGAAAGATATCATGGATGACTACTACCGCGGATTCAT



CTCTGAGACTCTGAGTTCTATTGATGACATAGATTGGACTAGCCTGTTCGAAAAAATGGAAATTCAG



CTGAAAAATGGTGATAATAAAGATACCTTAATTAAGGAACAGACAGAGTATCGGAAAGCAATCCAT



AAAAAATTTGCGAACGACGATCGGTTTAAGAACATGTTTAGCGCCAAACTGATTAGTGACATATTA



CCTGAATTTGTCATCCACAACAATAATTATTCGGCATCAGAGAAAGAGGAAAAAACCCAGGTGATA



AAATTGTTTTCGCGCTTTGCGACTAGCTTTAAAGATTACTTCAAGAACCGTGCAAATTGCTTTTCAG



CGGACGATATTTCATCAAGCAGCTGCCATCGCATCGTCAACGACAATGCAGAGATATTCTTTTCAAA



TGCGCTGGTCTACCGCCGGATCGTAAAATCGCTGAGCAATGACGATATCAACAAAATTTCGGGCGA



TATGAAAGATTCATTAAAAGAAATGAGTCTGGAAGAAATATATTCTTACGAGAAGTATGGGGAATT



TATTACCCAGGAAGGCATTAGCTTCTATAATGATATCTGTGGGAAAGTGAATTCTTTTATGAACCTG



TATTGTCAGAAAAATAAAGAAAACAAAAATTTATACAAACTTCAGAAACTTCACAAACAGATTCTA



TGCATTGCGGACACTAGCTATGAGGTCCCGTATAAATTTGAAAGTGACGAGGAAGTGTACCAATCA



GTTAACGGCTTCCTTGATAACATTAGCAGCAAACATATAGTCGAAAGATTACGCAAAATCGGCGAT



AACTATAACGGCTACAACCTGGATAAAATTTATATCGTGTCCAAATTTTACGAGAGCGTTAGCCAA



AAAACCTACCGCGACTGGGAAACAATTAATACCGCCCTCGAAATTCATTACAATAATATCTTGCCG



GGTAACGGTAAAAGTAAAGCCGACAAAGTAAAAAAAGCGGTTAAGAATGATTTACAGAAATCCAT



CACCGAAATAAATGAACTAGTGTCAAACTATAAGCTGTGCAGTGACGACAACATCAAAGCGGAGA



CTTATATACATGAGATTAGCCATATCTTGAATAACTTTGAAGCACAGGAATTGAAATACAATCCGG



AAATTCACCTAGTTGAATCCGAGCTCAAAGCGAGTGAGCTTAAAAACGTGCTGGACGTGATCATGA



ATGCGTTTCATTGGTGTTCGGTTTTTATGACTGAGGAACTTGTTGATAAAGACAACAATTTTTATGC



GGAACTGGAGGAGATTTACGATGAAATTTATCCAGTAATTAGTCTGTACAACCTGGTTCGTAACTAC



GTTACCCAGAAACCGTACAGCACGAAAAAGATTAAATTGAACTTTGGAATACCGACGTTAGCAGAC



GGTTGGTCAAAGTCCAAAGAGTATTCTAATAACGCTATCATACTGATGCGCGACAATCTGTATTATC



TGGGCATCTTTAATGCGAAGAATAAACCGGACAAGAAGATTATCGAGGGTAATACGTCAGAAAATA



AGGGTGACTACAAAAAGATGATTTATAATTTGCTCCCGGGTCCCAACAAAATGATCCCGAAAGTTT



TCTTGAGCAGCAAGACGGGGGTGGAAACGTATAAACCGAGCGCCTATATCCTAGAGGGGTATAAAC



AGAATAAACATATCAAGTCTTCAAAAGACTTTGATATCACTTTCTGTCATGATCTGATCGACTACTT



CAAAAACTGTATTGCAATTCATCCCGAGTGGAAAAACTTCGGTTTTGATTTTAGCGACACCAGTACT



TATGAAGACATTTCCGGGTTTTATCGTGAGGTAGAGTTACAAGGTTACAAGATTGATTGGACATACA



TTAGCGAAAAAGACATTGATCTGCTGCAGGAAAAAGGTCAACTGTATCTGTTCCAGATATATAACA



AAGATTTTTCGAAAAAATCAACCGGGAATGACAACCTTCACACCATGTACCTGAAAAATCTTTTCTC



AGAAGAAAATCTTAAGGATATCGTCCTGAAACTTAACGGCGAAGCGGAAATCTTCTTCAGGAAGAG



CAGCATAAAGAACCCAATCATTCATAAAAAAGGCTCGATTTTAGTCAACCGTACCTACGAAGCAGA



AGAAAAAGACCAGTTTGGCAACATTCAAATTGTGCGTAAAAATATTCCGGAAAACATTTATCAGGA



GCTGTACAAATACTTCAACGATAAAAGCGACAAAGAGCTGTCTGATGAAGCAGCCAAACTGAAGA



ATGTAGTGGGACACCACGAGGCAGCGACGAATATAGTCAAGGACTATCGCTACACGTATGATAAAT



ACTTCCTTCATATGCCTATTACGATCAATTTCAAAGCCAATAAAACGGGTTTTATTAATGATAGGAT



CTTACAGTATATCGCTAAAGAAAAAGACTTACATGTGATCGGCATTGATCGGGGCGAGCGTAACCT



GATCTACGTGTCCGTGATTGATACTTGTGGTAATATAGTTGAACAGAAAAGCTTTAACATTGTAAAC



GGCTACGACTATCAGATAAAACTGAAACAACAGGAGGGCGCTAGACAGATTGCGCGGAAAGAATG



GAAAGAAATTGGTAAAATTAAAGAGATCAAAGAGGGCTACCTGAGCTTAGTAATCCACGAGATCTC



TAAAATGGTAATCAAATACAATGCAATTATAGCGATGGAGGATTTGTCTTATGGTTTTAAAAAAGG



GCGCTTTAAGGTCGAACGGCAAGTTTACCAGAAATTTGAAACCATGCTCATCAATAAACTCAACTA



TCTGGTATTTAAAGATATTTCGATTACCGAGAATGGCGGTCTCCTGAAAGGTTATCAGCTGACATAC



ATTCCTGATAAACTTAAAAACGTGGGTCATCAGTGCGGCTGCATTTTTTATGTGCCTGCTGCATACA



CGAGCAAAATTGATCCGACCACCGGCTTTGTGAATATCTTTAAATTTAAAGACCTGACAGTGGACG



CAAAACGTGAATTCATTAAAAAATTTGACTCAATTCGTTATGACAGTGAAAAAAATCTGTTCTGCTT



TACATTTGACTACAATAACTTTATTACGCAAAACACGGTCATGAGCAAATCATCGTGGAGTGTGTAT



ACATACGGCGTGCGCATCAAACGTCGCTTTGTGAACGGCCGCTTCTCAAACGAAAGTGATACCATT



GACATAACCAAAGATATGGAGAAAACGTTGGAAATGACGGACATTAACTGGCGCGATGGCCACGA



TCTTCGTCAAGACATTATAGATTATGAAATTGTTCAGCACATATTCGAAATTTTCCGTTTAACAGTG



CAAATGCGTAACTCCTTGTCTGAACTGGAGGACCGTGATTACGATCGTCTCATTTCACCTGTACTGA



ACGAAAATAACATTTTTTATGACAGCGCGAAAGCGGGGGATGCACTTCCTAAGGATGCCGATGCAA



ATGGTGCGTATTGTATTGCATTAAAAGGGTTATATGAAATTAAACAAATTACCGAAAATTGGAAAG



AAGATGGTAAATTTTCGCGCGATAAACTCAAAATCAGCAATAAAGATTGGTTCGACTTTATCCAGA



ATAAGCGCTATCTCTAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATTTATTA



TATCGCGTTGATTATTGATGCTGTTTTTAGTTTTAACGGCAATTAATATATGTGTTATTAATTGAATG



AATTTTATCATTCATAATAAGTATGTGTAGGATCAAGCTCAGGTTAAATATTCACTCAGGAAGTTAT



TACTCAGGAAGCAAAGAGGATTACAGAATTATCTCATAACAAGTGTTAAGGGATGTTATTTCC





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGC


ID
GTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAA


NO:
GCGGGACCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCAC


68
ATTGATTATTTGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCC



TACCTGACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATA



CGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATAT



ACCATGCACCATCATCATCACCATACCAATAAATTCACTAACCAGTATTCTCTCTCTAAGACCCTGC



GCTTTGAACTGATTCCGCAGGGGAAAACCTTGGAGTTCATTCAAGAAAAAGGCCTCTTGTCTCAGG



ATAAACAGAGGGCTGAATCTTACCAAGAAATGAAGAAAACTATTGATAAGTTTCATAAATATTTCA



TTGATTTAGCCTTGTCTAACGCCAAATTAACTCACTTGGAAACGTATCTGGAGTTATACAACAAATC



TGCCGAAACTAAGAAAGAACAGAAATTTAAAGACGATTTGAAAAAAGTACAGGACAATCTGCGTA



AAGAAATTGTCAAATCCTTCAGTGACGGCGATGCTAAAAGCATTTTTGCCATTCTGGACAAAAAAG



AGTTGATTACTGTGGAATTAGAAAAGTGGTTTGAAAACAATGAGCAGAAAGACATCTACTTCGATG



AGAAATTCAAAACTTTCACCACCTATTTTACAGGATTTCATCAAAACCGGAAGAACATGTACTCAGT



AGAACCGAACTCCACGGCCATTGCGTATCGTTTGATCCATGAGAATCTGCCTAAATTTCTGGAGAAT



GCGAAAGCCTTTGAAAAGATTAAGCAGGTCGAATCGCTGCAAGTGAATTTTCGTGAACTCATGGGC



GAATTTGGTGACGAAGGTCTAATCTTCGTTAACGAACTGGAAGAAATGTTTCAGATTAATTACTACA



ATGACGTGCTATCGCAGAACGGTATCACAATCTACAATAGTATTATCTCAGGGTTCACAAAAAACG



ATATAAAATACAAAGGCCTGAACGAGTATATCAATAACTACAACCAAACAAAGGACAAAAAGGAT



AGGCTTCCGAAACTGAAGCAGTTATACAAACAGATTTTATCTGACAGAATCTCCCTGAGCTTTCTGC



CGGATGCTTTCACTGATGGGAAGCAGGTTCTGAAAGCGATTTTCGATTTTTATAAGATTAACTTACT



GAGCTACACGATTGAAGGTCAAGAAGAATCTCAAAACTTACTGCTCTTGATCCGTCAAACCATTGA



AAATCTATCATCGTTCGATACGCAGAAAATCTACCTCAAAAACGATACTCACCTGACTACGATCTCT



CAGCAGGTTTTCGGGGATTTTAGTGTATTTTCAACAGCTCTGAACTACTGGTATGAAACCAAAGTCA



ATCCGAAATTCGAGACGGAATATTCTAAGGCCAACGAAAAAAAACGTGAGATTCTTGATAAAGCTA



AAGCCGTATTTACTAAACAGGATTACTTTTCTATTGCTTTCCTGCAGGAAGTTTTATCGGAGTATATC



CTGACCCTGGATCATACATCTGATATCGTTAAAAAACACAGCAGCAATTGCATCGCTGACTATTTCA



AAAACCACTTTGTCGCCAAAAAAGAAAACGAAACAGACAAGACTTTCGATTTCATTGCTAACATCA



CCGCAAAATACCAGTGTATTCAGGGTATCTTGGAAAACGCCGACCAATACGAAGACGAACTGAAAC



AAGATCAGAAGCTGATCGATAATTTAAAATTCTTCTTAGATGCAATCCTGGAGCTGCTGCACTTCAT



CAAACCGCTTCATTTAAAGAGCGAGTCCATTACCGAAAAGGACACCGCCTTCTATGACGTTTTTGAA



AATTATTATGAAGCCCTCTCCTTGCTGACTCCGCTGTATAATATGGTACGCAATTACGTAACCCAGA



AACCATATTCTACCGAAAAAATTAAACTGAACTTTGAAAACGCACAGCTGCTCAACGGTTGGGACG



CGAATAAAGAAGGTGACTACCTCACCACCATCCTGAAAAAAGATGGTAACTATTTTCTGGCAATTA



TGGATAAGAAACATAATAAAGCATTCCAGAAATTTCCTGAAGGGAAAGAAAATTACGAAAAGATG



GTGTACAAACTCTTACCTGGAGTTAACAAAATGTTGCCGAAAGTATTTTTTAGTAATAAGAACATCG



CGTACTTTAACCCGTCCAAAGAACTGCTGGAAAATTATAAAAAGGAGACGCATAAGAAAGGGGAT



ACCTTTAACCTGGAACATTGCCATACCTTAATAGACTTCTTCAAGGATTCCCTGAATAAACACGAGG



ATTGGAAATATTTCGATTTTCAGTTTAGTGAGACCAAGTCATACCAGGATCTTAGCGGCTTTTATCG



CGAAGTAGAACACCAAGGCTATAAAATTAACTTCAAAAACATCGACAGCGAATACATCGACGGTTT



AGTTAACGAGGGCAAACTGTTTCTGTTCCAGATCTATTCAAAGGATTTTAGCCCGTTCTCTAAAGGC



AAACCAAATATGCATACGTTGTACTGGAAAGCACTGTTTGAAGAGCAAAACCTGCAGAATGTGATT



TATAAACTGAACGGCCAAGCTGAGATTTTTTTCCGTAAAGCCTCGATTAAACCGAAAAATATCATCC



TTCATAAGAAGAAAATAAAGATCGCTAAAAAACACTTCATAGATAAAAAAACCAAAACCTCCGAA



ATAGTGCCTGTTCAAACAATTAAGAACTTGAATATGTACTACCAGGGCAAGATATCGGAAAAGGAG



TTGACTCAAGACGATCTTCGCTATATCGATAACTTTTCGATTTTTAACGAAAAAAACAAGACGATCG



ACATCATCAAAGATAAACGCTTCACTGTAGATAAGTTCCAGTTTCATGTGCCGATTACTATGAACTT



CAAAGCTACCGGGGGTAGCTATATCAACCAAACGGTGTTGGAATACCTGCAGAATAACCCGGAAGT



CAAAATCATTGGGCTGGACCGCGGAGAACGTCACCTTGTGTACTTGACCTTAATCGATCAGCAAGG



CAACATCTTAAAACAAGAATCGCTGAATACCATTACGGATTCAAAGATTAGCACCCCGTATCATAA



GCTGCTCGATAACAAGGAGAATGAGCGCGACCTGGCCCGTAAAAACTGGGGCACGGTGGAAAACA



TTAAGGAGTTAAAGGAGGGTTATATTTCCCAGGTAGTGCATAAGATCGCCACTCTCATGCTCGAGG



AAAATGCGATCGTTGTCATGGAAGACTTAAACTTCGGATTTAAACGTGGGCGATTTAAAGTAGAGA



AACAAATCTACCAGAAGTTAGAAAAAATGCTGATTGACAAATTAAATTACTTGGTCCTAAAAGACA



AACAGCCGCAAGAATTGGGTGGATTATACAACGCCCTCCAACTTACCAATAAATTCGAAAGTTTTC



AGAAAATGGGTAAACAGTCAGGCTTTCTTTTTTATGTTCCTGCGTGGAACACATCCAAAATCGACCC



TACAACCGGCTTCGTCAATTACTTCTATACTAAATATGAAAACGTCGACAAAGCAAAAGCATTCTTT



GAAAAGTTCGAAGCAATACGTTTTAACGCTGAGAAAAAATATTTCGAGTTCGAAGTCAAGAAATAC



TCAGACTTTAACCCCAAAGCTGAGGGCACACAGCAAGCGTGGACAATCTGCACCTACGGCGAGCGC



ATCGAAACGAAGCGTCAAAAAGATCAGAATAACAAATTTGTTTCAACACCTATCAACCTGACCGAG



AAGATTGAAGACTTCTTAGGTAAAAATCAGATTGTTTATGGCGACGGTAACTGTATAAAATCTCAA



ATAGCCTCAAAGGATGATAAAGCATTTTTCGAAACATTATTATATTGGTTCAAAATGACACTGCAGA



TGCGCAATAGTGAGACGCGTACAGATATTGATTATCTTATCAGCCCGGTCATGAACGACAACGGTA



CTTTTTACAACTCCAGAGACTATGAAAAACTTGAGAATCCAACTCTCCCCAAAGATGCTGATGCGA



ACGGTGCTTATCACATCGCGAAAAAAGGTCTGATGCTGCTGAACAAAATCGACCAAGCCGATCTGA



CTAAGAAAGTTGACCTAAGCATTTCAAATCGGGACTGGTTACAGTTTGTTCAAAAGAACAAATGAG



AAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAAATA



TTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA





SEQ
AAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAAGCATTGATAATTG


ID
AGATCCCTCTCCCTGACAGGATGATTACATAAATAATAGTGACAAAAATAAATTATTTATTTATCCA


NO:
GAAAATGAATTGGAAAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTcaaaCAG


69
GTtgccgtcactgcgtcttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagc



attctgtaacaaagcgggaccaaagccatgacaaaaacgcgtaacaaaagtgtctataatcacggcag



aaaagtccacattgattatttgcacggcgtcacactttgctatgccatagcatttttatccataagat



tagcggatcctacctgacgctttttatcgcaactctctactgtttctccatacccgtttttttgggct



agcaccgcctatctcgtgtgagataggcggagatacgaactttaagAAGGAGatatacc



ATGGAACAGGAATATTATCTGGGCTTGGACATGGGCACCGGTTCCGTCGGCTGGGCTGTTACTGA



CAGTGAATATCACGTTCTAAGAAAGCATGGTAAGGCATTGTGGGGTGTAAGACTTTTCGAATCTGCT



TCCACTGCTGAAGAGCGTAGAATGTTTAGAACGAGTCGACGTAGGCTAGACAGGCGCAATTGGAGA



ATCGAAATTTTACAAGAAATTTTTGCGGAAGAGATATCTAAGAAAGACCCAGGCTTTTTCCTGAGA



ATGAAGGAATCTAAGTATTACCCTGAGGATAAAAGAGATATAAATGGTAACTGTCCCGAATTGCCT



TACGCATTATTTGTGGACGATGATTTTACCGATAAGGATTACCATAAAAAGTTCCCAACTATCTACC



ATTTACGCAAAATGTTAATGAATACAGAGGAAACCCCAGACATAAGACTAGTTTATCTGGCAATAC



ACCATATGATGAAACATAGAGGCCATTTCTTACTTTCCGGGGATATCAACGAAATCAAAGAGTTTG



GTACCACATTTAGTAAGTTACTGGAAAACATAAAGAATGAAGAATTGGATTGGAACTTAGAACTCG



GAAAAGAAGAATACGCGGTTGTCGAATCTATCCTGAAGGATAATATGCTGAATAGGTCGACCAAAA



AAACTAGGCTGATCAAAGCACTGAAAGCCAAATCTATCTGCGAAAAAGCTGTTTTAAATTTACTTG



CTGGTGGCACTGTTAAGTTATCAGACATTTTTGGTTTGGAAGAATTGAACGAAACCGAGCGTCCAA



AAATTAGTTTCGCTGATAATGGCTACGATGATTACATTGGTGAGGTGGAAAACGAGTTGGGCGAAC



AATTTTATATTATAGAGACAGCTAAGGCAGTCTATGACTGGGCTGTTTTAGTAGAAATCCTTGGTAA



ATACACATCTATCTCCGAAGCGAAAGTTGCTACTTACGAAAAGCACAAGTCCGATCTCCAGTTTTTG



AAGAAAATTGTCAGGAAATATCTGACTAAGGAAGAATATAAAGATATTTTCGTTAGTACCTCTGAC



AAACTGAAAAATTACTCCGCTTACATCGGGATGACCAAGATTAATGGCAAAAAAGTTGATCTGCAA



AGCAAAAGGTGTTCGAAGGAAGAATTTTATGATTTCATTAAAAAGAATGTCTTAAAAAAATTAGAA



GGTCAGCCAGAATACGAATATTTGAAAGAAGAACTGGAAAGAGAGACATTCTTACCAAAACAAGT



CAACAGAGATAATGGGGTAATTCCATATCAAATTCACCTCTACGAATTAAAAAAAATTTTAGGCAA



TTTACGCGATAAAATTGACCTTATCAAAGAAAATGAGGATAAGCTGGTTCAACTCTTTGAATTCAGA



ATACCCTATTATGTGGGCCCACTGAACAAGATTGATGACGGCAAAGAAGGTAAATTCACATGGGCC



GTCCGCAAATCCAATGAAAAAATTTACCCATGGAACTTTGAAAATGTAGTAGATATTGAAGCGTCT



GCGGAGAAATTTATTCGAAGAATGACTAATAAATGCACTTACTTGATGGGAGAGGATGTTCTGCCT



AAAGACAGCTTATTATACAGCAAGTACATGGTTCTAAACGAACTTAACAACGTTAAGTTGGACGGT



GAGAAATTAAGTGTAGAATTGAAACAAAGATTGTATACTGACGTCTTCTGCAAGTACAGAAAAGTG



ACAGTTAAAAAAATTAAGAATTACTTGAAGTGCGAAGGTATAATTTCTGGAAACGTAGAGATTACT



GGTATTGATGGTGATTTCAAAGCATCCCTAACAGCTTACCACGATTTCAAGGAAATCCTGACAGGA



ACTGAACTCGCAAAAAAAGATAAAGAAAACATTATTACTAATATTGTTCTTTTCGGTGATGACAAG



AAATTGTTGAAGAAAAGACTGAATAGACTTTACCCCCAGATTACTCCCAATCAACTTAAGAAAATT



TGTGCTTTGTCTTACACAGGATGGGGTCGTTTTTCAAAAAAGTTCTTAGAAGAGATTACCGCACCTG



ATCCAGAAACAGGCGAAGTATGGAATATAATTACCGCCTTATGGGAATCGAACAATAATCTTATGC



AACTTCTGAGCAATGAATATCGTTTCATGGAAGAAGTTGAGACTTACAACATGGGCAAACAGACGA



AGACTTTATCCTATGAAACTGTGGAAAATATGTATGTATCACCTTCTGTCAAGAGACAAATTTGGCA



AACCTTAAAAATTGTCAAAGAATTAGAAAAGGTAATGAAGGAGTCTCCTAAACGTGTGTTTATTGA



AATGGCTAGAGAAAAACAAGAGTCAAAAAGAACCGAGTCAAGAAAGAAGCAGTTAATCGATTTAT



ATAAGGCTTGTAAAAACGAAGAGAAAGATTGGGTTAAAGAATTGGGGGACCAAGAGGAACAAAAA



CTACGGTCGGATAAGTTGTATTTATACTATACGCAAAAGGGACGATGTATGTATTCCGGCGAGGTA



ATAGAATTGAAGGATTTATGGGACAATACAAAATATGACATAGACCATATATATCCCCAATCAAAA



ACGATGGACGATAGCTTGAACAATAGAGTACTCGTGAAAAAAAAATATAATGCGACCAAATCTGAT



AAGTATCCTCTGAATGAAAATATCAGACATGAAAGAAAGGGGTTCTGGAAGTCCTTGTTAGATGGT



GGGTTTATAAGCAAAGAAAAGTACGAGCGTCTAATAAGAAACACGGAGTTATCGCCAGAAGAACT



CGCTGGTTTTATTGAGAGGCAAATCGTGGAAACGAGACAATCTACCAAAGCCGTTGCTGAGATCCT



AAAGCAAGTTTTCCCAGAGTCGGAGATTGTCTATGTCAAAGCTGGCACAGTGAGCAGGTTTAGGAA



AGACTTCGAACTATTAAAGGTAAGAGAAGTGAACGATTTACATCACGCAAAGGACGCTTACCTAAA



TATCGTTGTAGGTAACTCATATTATGTTAAATTTACCAAGAACGCCTCTTGGTTTATAAAGGAGAAC



CCAGGTAGAACATATAACCTGAAAAAGATGTTCACCTCTGGTTGGAATATTGAGAGAAACGGAGAA



GTCGCATGGGAAGTTGGTAAGAAAGGGACTATAGTGACAGTAAAGCAAATTATGAACAAAAATAA



TATCCTCGTTACAAGGCAGGTTCATGAAGCAAAGGGCGGCCTTTTTGACCAACAAATTATGAAGAA



AGGGAAAGGTCAAATTGCAATAAAAGAAACCGATGAGAGACTAGCGTCAATAGAAAAGTATGGTG



GCTATAATAAAGCTGCGGGTGCATACTTTATGCTTGTTGAATCAAAAGACAAGAAAGGTAAGACTA



TTAGAACTATAGAATTTATACCCCTGTACCTTAAAAACAAAATTGAATCGGATGAGTCAATCGCGTT



AAATTTTCTAGAGAAAGGAAGGGGTTTAAAAGAACCAAAGATCCTGTTAAAAAAGATTAAGATTGA



CACCTTGTTCGATGTAGATGGATTTAAAATGTGGTTATCTGGCAGAACAGGCGATAGACTTTTGTTT



AAGTGCGCTAATCAATTAATTTTGGATGAGAAAATCATTGTCACAATGAAAAAAATAGTTAAGTTT



ATTCAGAGAAGACAAGAAAACAGGGAGTTGAAATTATCTGATAAAGATGGTATCGACAATGAAGT



TTTAATGGAAATCTACAATACATTCGTTGATAAACTTGAAAATACCGTATATCGAATCAGGTTAAGT



GAACAAGCCAAAACATTAATTGATAAACAAAAAGAATTTGAAAGGCTATCACTGGAAGACAAATC



CTCCACCCTATTTGAAATTTTGCATATATTCCAGTGCCAATCTTCAGCAGCTAATTTAAAAATGATTG



GCGGACCTGGGAAAGCCGGCATCCTAGTGATGAACAATAATATCTCCAAGTGTAACAAAATATCAA



TTATTAACCAATCTCCGACAGGTATTTTTGAAAATGAAATAGACTTGCTTAAGATATAAGAAATCAT



CCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATTTATTATATCGCGTTGATTATTGATGCTGTTT



TTAGTTTTAACGGCAATTAATATATGTGTTATTAATTGAATGAATTTTATCATTCATAATAAGTATGT



GTAGGATCAAGCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA



GAATTATCTCATAACAAGTGTTAAGGGATGTTATTTCC





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGC


ID
GTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAA


NO:
GCGGGACCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCAC


70
ATTGATTATTTGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCC



TACCTGACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATA



CGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATAT



ACCATGCACCATCATCATCACCATTCTTTCGACTCTTTCACCAACCTGTACTCTCTGTCTAAAACCCT



GAAATTCGAAATGCGTCCGGTTGGTAACACCCAGAAAATGCTGGACAACGCGGGTGTTTTCGAAAA



AGACAAACTGATCCAGAAAAAATACGGTAAAACCAAACCGTACTTCGACCGTCTGCACCGTGAATT



CATCGAAGAAGCGCTGACCGGTGTTGAACTGATCGGTCTGGACGAAAACTTCCGTACCCTGGTTGA



CTGGCAGAAAGACAAAAAAAACAACGTTGCGATGAAAGCGTACGAAAACTCTCTGCAGCGTCTGC



GTACCGAAATCGGTAAAATCTTCAACCTGAAAGCGGAAGACTGGGTTAAAAACAAATACCCGATCC



TGGGTCTGAAAAACAAAAACACCGACATCCTGTTCGAAGAAGCGGTTTTCGGTATCCTGAAAGCGC



GTTACGGTGAAGAAAAAGACACCTTCATCGAAGTTGAAGAAATCGACAAAACCGGTAAATCTAAA



ATCAACCAGATCTCTATCTTCGACTCTTGGAAAGGTTTCACCGGTTACTTCAAAAAATTCTTCGAAA



CCCGTAAAAACTTCTACAAAAACGACGGTACCTCTACCGCGATCGCGACCCGTATCATCGACCAGA



ACCTGAAACGTTTCATCGACAACCTGTCTATCGTTGAATCTGTTCGTCAGAAAGTTGACCTGGCGGA



AACCGAAAAATCTTTCTCTATCTCTCTGTCTCAGTTCTTCTCTATCGACTTCTACAACAAATGCCTGC



TGCAGGACGGTATCGACTACTACAACAAAATCATCGGTGGTGAAACCCTGAAAAACGGTGAAAAA



CTGATCGGTCTGAACGAACTGATCAACCAGTACCGTCAGAACAACAAAGACCAGAAAATCCCGTTC



TTCAAACTGCTGGACAAACAGATCCTGTCTGAAAAAATCCTGTTCCTGGACGAAATCAAAAACGAC



ACCGAACTGATCGAAGCGCTGTCTCAGTTCGCGAAAACCGCGGAAGAAAAAACCAAAATCGTTAA



AAAACTGTTCGCGGACTTCGTTGAAAACAACTCTAAATACGACCTGGCGCAGATCTACATCTCTCAG



GAAGCGTTCAACACCATCTCTAACAAATGGACCTCTGAAACCGAAACCTTCGCGAAATACCTGTTC



GAAGCGATGAAATCTGGTAAACTGGCGAAATACGAAAAAAAAGACAACTCTTACAAATTCCCGGA



CTTCATCGCGCTGTCTCAGATGAAATCTGCGCTGCTGTCTATCTCTCTGGAAGGTCACTTCTGGAAA



GAAAAATACTACAAAATCTCTAAATTCCAGGAAAAAACCAACTGGGAACAGTTCCTGGCGATCTTC



CTGTACGAATTCAACTCTCTGTTCTCTGACAAAATCAACACCAAAGACGGTGAAACCAAACAGGTT



GGTTACTACCTGTTCGCGAAAGACCTGCACAACCTGATCCTGTCTGAACAGATCGACATCCCGAAA



GACTCTAAAGTTACCATCAAAGACTTCGCGGACTCTGTTCTGACCATCTACCAGATGGCGAAATACT



TCGCGGTTGAAAAAAAACGTGCGTGGCTGGCGGAATACGAACTGGACTCTTTCTACACCCAGCCGG



ACACCGGTTACCTGCAGTTCTACGACAACGCGTACGAAGACATCGTTCAGGTTTACAACAAACTGC



GTAACTACCTGACCAAAAAACCGTACTCTGAAGAAAAATGGAAACTGAACTTCGAAAACTCTACCC



TGGCGAACGGTTGGGACAAAAACAAAGAATCTGACAACTCTGCGGTTATCCTGCAGAAAGGTGGTA



AATACTACCTGGGTCTGATCACCAAAGGTCACAACAAAATCTTCGACGACCGTTTCCAGGAAAAAT



TCATCGTTGGTATCGAAGGTGGTAAATACGAAAAAATCGTTTACAAATTCTTCCCGGACCAGGCGA



AAATGTTCCCGAAAGTTTGCTTCTCTGCGAAAGGTCTGGAATTCTTCCGTCCGTCTGAAGAAATCCT



GCGTATCTACAACAACGCGGAATTCAAAAAAGGTGAAACCTACTCTATCGACTCTATGCAGAAACT



GATCGACTTCTACAAAGACTGCCTGACCAAATACGAAGGTTGGGCGTGCTACACCTTCCGTCACCTG



AAACCGACCGAAGAATACCAGAACAACATCGGTGAATTCTTCCGTGACGTTGCGGAAGACGGTTAC



CGTATCGACTTCCAGGGTATCTCTGACCAGTACATCCACGAAAAAAACGAAAAAGGTGAACTGCAC



CTGTTCGAAATCCACAACAAAGACTGGAACCTGGACAAAGCGCGTGACGGTAAATCTAAAACCACC



CAGAAAAACCTGCACACCCTGTACTTCGAATCTCTGTTCTCTAACGACAACGTTGTTCAGAACTTCC



CGATCAAACTGAACGGTCAGGCGGAAATCTTCTACCGTCCGAAAACCGAAAAAGACAAACTGGAA



TCTAAAAAAGACAAAAAAGGTAACAAAGTTATCGACCACAAACGTTACTCTGAAAACAAAATCTTC



TTCCACGTTCCGCTGACCCTGAACCGTACCAAAAACGACTCTTACCGTTTCAACGCGCAGATCAACA



ACTTCCTGGCGAACAACAAAGACATCAACATCATCGGTGTTGACCGTGGTGAAAAACACCTGGTTT



ACTACTCTGTTATCACCCAGGCGTCTGACATCCTGGAATCTGGTTCTCTGAACGAACTGAACGGTGT



TAACTACGCGGAAAAACTGGGTAAAAAAGCGGAAAACCGTGAACAGGCGCGTCGTGACTGGCAGG



ACGTTCAGGGTATCAAAGACCTGAAAAAAGGTTACATCTCTCAGGTTGTTCGTAAACTGGCGGACC



TGGCGATCAAACACAACGCGATCATCATCCTGGAAGACCTGAACATGCGTTTCAAACAGGTTCGTG



GTGGTATCGAAAAATCTATCTACCAGCAGCTGGAAAAAGCGCTGATCGACAAACTGTCTTTCCTGG



TTGACAAAGGTGAAAAAAACCCGGAACAGGCGGGTCACCTGCTGAAAGCGTACCAGCTGTCTGCGC



CGTTCGAAACCTTCCAGAAAATGGGTAAACAGACCGGTATCATCTTCTACACCCAGGCGTCTTACAC



CTCTAAATCTGACCCGGTTACCGGTTGGCGTCCGCACCTGTACCTGAAATACTTCTCTGCGAAAAAA



GCGAAAGACGACATCGCGAAATTCACCAAAATCGAATTCGTTAACGACCGTTTCGAACTGACCTAC



GACATCAAAGACTTCCAGCAGGCGAAAGAATACCCGAACAAAACCGTTTGGAAAGTTTGCTCTAAC



GTTGAACGTTTCCGTTGGGACAAAAACCTGAACCAGAACAAAGGTGGTTACACCCACTACACCAAC



ATCACCGAAAACATCCAGGAACTGTTCACCAAATACGGTATCGACATCACCAAAGACCTGCTGACC



CAGATCTCTACCATCGACGAAAAACAGAACACCTCTTTCTTCCGTGACTTCATCTTCTACTTCAACCT



GATCTGCCAGATCCGTAACACCGACGACTCTGAAATCGCGAAAAAAAACGGTAAAGACGACTTCAT



CCTGTCTCCGGTTGAACCGTTCTTCGACTCTCGTAAAGACAACGGTAACAAACTGCCGGAAAACGG



TGACGACAACGGTGCGTACAACATCGCGCGTAAAGGTATCGTTATCCTGAACAAAATCTCTCAGTA



CTCTGAAAAAAACGAAAACTGCGAAAAAATGAAATGGGGTGACCTGTACGTTTCTAACATCGACTG



GGACAACTTCGTTGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGAC



CCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGC


ID
GTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAA


NO:
GCGGGACCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCAC


71
ATTGATTATTTGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCC



TACCTGACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATA



CGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATAT



ACCATGCACCATCATCATCACCATAACAAATTCGAAAACTTCACCGGTCTGTACCCGATCTCTAAAA



CCCTGCGTTTCGAACTGATCCCGCAGGGTAAAACCCTGGAATACATCGAAAAATCTGAAATCCTGG



AAAACGACAACTACCGTGCGGAAAAATACGAAGAAGTTAAAGACATCATCGACGGTTACCACAAA



TGGTTCATCAACGAAACCCTGCACGACCTGCACATCAACTGGTCTGAACTGAAAGTTGCGCTGGAA



AACAACCGTATCGAAAAATCTGACGCGTCTAAAAAAGAACTGCAGCGTGTTCAGAAAATCAAACGT



GAAGAAATCTACAACGCGTTCATCGAACACGAAGCGTTCCAGTACCTGTTCAAAGAAAACCTGCTG



TCTGACCTGCTGCCGATCCAGATCGAACAGTCTGAAGACCTGGACGCGGAAAAAAAAAAACAGGC



GGTTGAAACCTTCAACCGTTTCTCTACCTACTTCACCGGTTTCCACGAAAACCGTAAAAACATCTAC



TCTAAAGAAGGTATCTCTACCTCTGTTACCTACCGTATCGTTCACGACAACTTCCCGAAATTCCTGG



AAAACATGAAAGTTTTCGAAATCCTGCGTAACGAATGCCCGGAAGTTATCTCTGACACCGCGAACG



AACTGGCGCCGTTCATCGACGGTGTTCGTATCGAAGACATCTTCCTGATCGACTTCTTCAACTCTAC



CTTCTCTCAGAACGGTATCGACTACTACAACCGTATCCTGGGTGGTGTTACCACCGAAACCGGTGAA



AAATACCGTGGTATCAACGAATTCACCAACCTGTACCGTCAGCAGCACCCGGAATTCGGTAAATCT



AAAAAAGCGACCAAAATGGTTGTTCTGTTCAAACAGATCCTGTCTGACCGTGACACCCTGTCTTTCA



TCCCGGAAATGTTCGGTAACGACAAACAGGTTCAGAACTCTATCCAGCTGTTCTACAACCGTGAAA



TCTCTCAGTTCGAAAACGAAGGTGTTAAAACCGACGTTTGCACCGCGCTGGCGACCCTGACCTCTAA



AATCGCGGAATTCGACACCGAAAAAATCTACATCCAGCAGCCGGAACTGCCGAACGTTTCTCAGCG



TCTGTTCGGTTCTTGGAACGAACTGAACGCGTGCCTGTTCAAATACGCGGAACTGAAATTCGGTACC



GCGGAAAAAGTTGCGAACCGTAAAAAAATCGACAAATGGCTGAAATCTGACCTGTTCTCTTTCACC



GAACTGAACAAAGCGCTGGAATTCTCTGGTAAAGACGAACGTATCGAAAACTACTTCTCTGAAACC



GGTATCTTCGCGCAGCTGGTTAAAACCGGTTTCGACGAAGCGCAGTCTATCCTGGAAACCGAATAC



ACCTCTGAAGTTCACCTGAAAGACCAGCAGACCGACATCGAAAAAATCAAAACCTTCCTGGACGCG



CTGCAGAACCTGATGCACCTGCTGAAATCTCTGTGCGTTTCTGAAGAAGCGGACCGTGACGCGGCG



TTCTACAACGAATTCGACATGCTGTACAACCAGCTGAAACTGGTTGTTCCGCTGTACAACAAAGTTC



GTAACTACATCACCCAGAAACTGTTCCGTTCTGACAAAATCAAAATCTACTTCGAAAACAAAGGTC



AGTTCCTGGGTGGTTGGGTTGACTCTCAGACCGAAAACTCTGACAACGGTACCCAGGCGGGTGGTT



ACATCTTCCGTAAAGAAAACGTTATCAACGAATACGACTACTACCTGGGTATCTGCTCTGACCCGAA



ACTGTTCCGTCGTACCACCATCGTTTCTGAAAACGACCGTTCTTCTTTCGAACGTCTGGACTACTACC



AGCTGAAAACCGCGTCTGTTTACGGTAACTCTTACTGCGGTAAACACCCGTACACCGAAGACAAAA



ACGAACTGGTTAACTCTATCGACCGTTTCGTTCACCTGTCTGGTAACAACATCCTGATCGAAAAAAT



CGCGAAAGACAAAGTTAAATCTAACCCGACCACCAACACCCCGTCTGGTTACCTGAACTTCATCCA



CCGTGAAGCGCCGAACACCTACGAATGCCTGCTGCAGGACGAAAACTTCGTTTCTCTGAACCAGCG



TGTTGTTTCTGCGCTGAAAGCGACCCTGGCGACCCTGGTTCGTGTTCCGAAAGCGCTGGTTTACGCG



AAAAAAGACTACCACCTGTTCTCTGAAATCATCAACGACATCGACGAACTGTCTTACGAAAAAGCG



TTCTCTTACTTCCCGGTTTCTCAGACCGAATTCGAAAACTCTTCTAACCGTACCATCAAACCGCTGCT



GCTGTTCAAAATCTCTAACAAAGACCTGTCTTTCGCGGAAAACTTCGAAAAAGGTAACCGTCAGAA



AATCGGTAAAAAAAACCTGCACACCCTGTACTTCGAAGCGCTGATGAAAGGTAACCAGGACACCAT



CGACATCGGTACCGGTATGGTTTTCCACCGTGTTAAATCTCTGAACTACAACGAAAAAACCCTGAA



ATACGGTCACCACTCTACCCAGCTGAACGAAAAATTCTCTTACCCGATCATCAAAGACAAACGTTTC



GCGTCTGACAAATTCCTGTTCCACCTGTCTACCGAAATCAACTACAAAGAAAAACGTAAACCGCTG



AACAACTCTATCATCGAATTCCTGACCAACAACCCGGACATCAACATCATCGGTCTGGACCGTGGT



GAACGTCACCTGATCTACCTGACCCTGATCAACCAGAAAGGTGAAATCCTGCGTCAGAAAACCTTC



AACATCGTTGGTAACACCAACTACCACGAAAAACTGAACCAGCGTGAAAAAGAACGTGACAACGC



GCGTAAATCTTGGGCGACCATCGGTAAAATCAAAGAACTGAAAGAAGGTTTCCTGTCTCTGGTTAT



CCACGAAATCGCGAAAATCATGGTTGAAAACAACGCGATCGTTGTTCTGGAAGACCTGAACTTCGG



TTTCAAACGTGGTCGTTTCAAAGTTGAAAAACAGATCTACCAGAAATTCGAAAAAATGCTGATCGA



CAAACTGAACTACCTGGTTTTCAAAGACAAAAAAGCGAACGAAGCGGGTGGTGTTCTGAAAGGTTA



CCAGCTGGCGGAAAAATTCGAATCTTTCCAGAAAATGGGTAAACAGTCTGGTTTCCTGTTCTACGTT



CCGGCGGCGTACACCTCTAAAATCGACCCGACCACCGGTTTCGTTAACATGCTGAACCTGAACTAC



ACCAACATGAAAGACGCGCAGACCCTGCTGTCTGGTATGGACAAAATCTCTTTCAACGCGGACGCG



AACTACTTCGAATTCGAACTGGACTACGAAAAATTCAAAACCAACCAGACCGACCACACCAACAAA



TGGACCATCTGCACCGTTGGTGAAAAACGTTTCACCTACAACTCTGCGACCAAAGAAACCACCACC



GTTAACGTTACCGAAGACCTGAAAAAACTGCTGGACAAATTCGAAGTTAAATACTCTAACGGTGAC



AACATCAAAGACGAAATCTGCCGTCAGACCGACGCGAAATTCTTCGAAATCATCCTGTGGCTGCTG



AAACTGACCATGCAGATGCGTAACTCTAACACCAAAACCGAAGAAGACTTCATCCTGTCTCCGGTT



AAAAACTCTAACGGTGAATTCTTCCGTTCTAACGACGACGCGAACGGTATCTGGCCGGCGGACGCG



GACGCGAACGGTGCGTACCACATCGCGCTGAAAGGTCTGTACCTGGTTAAAGAATGCTTCAACAAA



AACGAAAAATCTCTGAAAATCGAACACAAAAACTGGTTCAAATTCGCGCAGACCCGTTTCAACGGT



TCTCTGACCAAAAACGGTTAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGT



AGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGC


ID
GTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAA


NO:
GCGGGACCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCAC


72
ATTGATTATTTGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCC



TACCTGACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATA



CGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATAT



ACCATGCACCATCATCATCACCATACCCAGTTCGAAGGTTTCACCAACCTGTACCAGGTTTCTAAAA



CCCTGCGTTTCGAACTGATCCCGCAGGGTAAAACCCTGAAACACATCCAGGAACAGGGTTTCATCG



AAGAAGACAAAGCGCGTAACGACCACTACAAAGAACTGAAACCGATCATCGACCGTATCTACAAA



ACCTACGCGGACCAGTGCCTGCAGCTGGTTCAGCTGGACTGGGAAAACCTGTCTGCGGCGATCGAC



TCTTACCGTAAAGAAAAAACCGAAGAAACCCGTAACGCGCTGATCGAAGAACAGGCGACCTACCG



TAACGCGATCCACGACTACTTCATCGGTCGTACCGACAACCTGACCGACGCGATCAACAAACGTCA



CGCGGAAATCTACAAAGGTCTGTTCAAAGCGGAACTGTTCAACGGTAAAGTTCTGAAACAGCTGGG



TACCGTTACCACCACCGAACACGAAAACGCGCTGCTGCGTTCTTTCGACAAATTCACCACCTACTTC



TCTGGTTTCTACGAAAACCGTAAAAACGTTTTCTCTGCGGAAGACATCTCTACCGCGATCCCGCACC



GTATCGTTCAGGACAACTTCCCGAAATTCAAAGAAAACTGCCACATCTTCACCCGTCTGATCACCGC



GGTTCCGTCTCTGCGTGAACACTTCGAAAACGTTAAAAAAGCGATCGGTATCTTCGTTTCTACCTCT



ATCGAAGAAGTTTTCTCTTTCCCGTTCTACAACCAGCTGCTGACCCAGACCCAGATCGACCTGTACA



ACCAGCTGCTGGGTGGTATCTCTCGTGAAGCGGGTACCGAAAAAATCAAAGGTCTGAACGAAGTTC



TGAACCTGGCGATCCAGAAAAACGACGAAACCGCGCACATCATCGCGTCTCTGCCGCACCGTTTCA



TCCCGCTGTTCAAACAGATCCTGTCTGACCGTAACACCCTGTCTTTCATCCTGGAAGAATTCAAATC



TGACGAAGAAGTTATCCAGTCTTTCTGCAAATACAAAACCCTGCTGCGTAACGAAAACGTTCTGGA



AACCGCGGAAGCGCTGTTCAACGAACTGAACTCTATCGACCTGACCCACATCTTCATCTCTCACAAA



AAACTGGAAACCATCTCTTCTGCGCTGTGCGACCACTGGGACACCCTGCGTAACGCGCTGTACGAA



CGTCGTATCTCTGAACTGACCGGTAAAATCACCAAATCTGCGAAAGAAAAAGTTCAGCGTTCTCTG



AAACACGAAGACATCAACCTGCAGGAAATCATCTCTGCGGCGGGTAAAGAACTGTCTGAAGCGTTC



AAACAGAAAACCTCTGAAATCCTGTCTCACGCGCACGCGGCGCTGGACCAGCCGCTGCCGACCACC



CTGAAAAAACAGGAAGAAAAAGAAATCCTGAAATCTCAGCTGGACTCTCTGCTGGGTCTGTACCAC



CTGCTGGACTGGTTCGCGGTTGACGAATCTAACGAAGTTGACCCGGAATTCTCTGCGCGTCTGACCG



GTATCAAACTGGAAATGGAACCGTCTCTGTCTTTCTACAACAAAGCGCGTAACTACGCGACCAAAA



AACCGTACTCTGTTGAAAAATTCAAACTGAACTTCCAGATGCCGACCCTGGCGTCTGGTTGGGACGT



TAACAAAGAAAAAAACAACGGTGCGATCCTGTTCGTTAAAAACGGTCTGTACTACCTGGGTATCAT



GCCGAAACAGAAAGGTCGTTACAAAGCGCTGTCTTTCGAACCGACCGAAAAAACCTCTGAAGGTTT



CGACAAAATGTACTACGACTACTTCCCGGACGCGGCGAAAATGATCCCGAAATGCTCTACCCAGCT



GAAAGCGGTTACCGCGCACTTCCAGACCCACACCACCCCGATCCTGCTGTCTAACAACTTCATCGAA



CCGCTGGAAATCACCAAAGAAATCTACGACCTGAACAACCCGGAAAAAGAACCGAAAAAATTCCA



GACCGCGTACGCGAAAAAAACCGGTGACCAGAAAGGTTACCGTGAAGCGCTGTGCAAATGGATCG



ACTTCACCCGTGACTTCCTGTCTAAATACACCAAAACCACCTCTATCGACCTGTCTTCTCTGCGTCCG



TCTTCTCAGTACAAAGACCTGGGTGAATACTACGCGGAACTGAACCCGCTGCTGTACCACATCTCTT



TCCAGCGTATCGCGGAAAAAGAAATCATGGACGCGGTTGAAACCGGTAAACTGTACCTGTTCCAGA



TCTACAACAAAGACTTCGCGAAAGGTCACCACGGTAAACCGAACCTGCACACCCTGTACTGGACCG



GTCTGTTCTCTCCGGAAAACCTGGCGAAAACCTCTATCAAACTGAACGGTCAGGCGGAACTGTTCTA



CCGTCCGAAATCTCGTATGAAACGTATGGCGCACCGTCTGGGTGAAAAAATGCTGAACAAAAAACT



GAAAGACCAGAAAACCCCGATCCCGGACACCCTGTACCAGGAACTGTACGACTACGTTAACCACCG



TCTGTCTCACGACCTGTCTGACGAAGCGCGTGCGCTGCTGCCGAACGTTATCACCAAAGAAGTTTCT



CACGAAATCATCAAAGACCGTCGTTTCACCTCTGACAAATTCTTCTTCCACGTTCCGATCACCCTGA



ACTACCAGGCGGCGAACTCTCCGTCTAAATTCAACCAGCGTGTTAACGCGTACCTGAAAGAACACC



CGGAAACCCCGATCATCGGTATCGACCGTGGTGAACGTAACCTGATCTACATCACCGTTATCGACTC



TACCGGTAAAATCCTGGAACAGCGTTCTCTGAACACCATCCAGCAGTTCGACTACCAGAAAAAACT



GGACAACCGTGAAAAAGAACGTGTTGCGGCGCGTCAGGCGTGGTCTGTTGTTGGTACCATCAAAGA



CCTGAAACAGGGTTACCTGTCTCAGGTTATCCACGAAATCGTTGACCTGATGATCCACTACCAGGCG



GTTGTTGTTCTGGAAAACCTGAACTTCGGTTTCAAATCTAAACGTACCGGTATCGCGGAAAAAGCG



GTTTACCAGCAGTTCGAAAAAATGCTGATCGACAAACTGAACTGCCTGGTTCTGAAAGACTACCCG



GCGGAAAAAGTTGGTGGTGTTCTGAACCCGTACCAGCTGACCGACCAGTTCACCTCTTTCGCGAAA



ATGGGTACCCAGTCTGGTTTCCTGTTCTACGTTCCGGCGCCGTACACCTCTAAAATCGACCCGCTGA



CCGGTTTCGTTGACCCGTTCGTTTGGAAAACCATCAAAAACCACGAATCTCGTAAACACTTCCTGGA



AGGTTTCGACTTCCTGCACTACGACGTTAAAACCGGTGACTTCATCCTGCACTTCAAAATGAACCGT



AACCTGTCTTTCCAGCGTGGTCTGCCGGGTTTCATGCCGGCGTGGGACATCGTTTTCGAAAAAAACG



AAACCCAGTTCGACGCGAAAGGTACCCCGTTCATCGCGGGTAAACGTATCGTTCCGGTTATCGAAA



ACCACCGTTTCACCGGTCGTTACCGTGACCTGTACCCGGCGAACGAACTGATCGCGCTGCTGGAAG



AAAAAGGTATCGTTTTCCGTGACGGTTCTAACATCCTGCCGAAACTGCTGGAAAACGACGACTCTC



ACGCGATCGACACCATGGTTGCGCTGATCCGTTCTGTTCTGCAGATGCGTAACTCTAACGCGGCGAC



CGGTGAAGACTACATCAACTCTCCGGTTCGTGACCTGAACGGTGTTTGCTTCGACTCTCGTTTCCAG



AACCCGGAATGGCCGATGGACGCGGACGCGAACGGTGCGTACCACATCGCGCTGAAAGGTCAGCT



GCTGCTGAACCACCTGAAAGAATCTAAAGACCTGAAACTGCAGAACGGTATCTCTAACCAGGACTG



GCTGGCGTACATCCAGGAACTGCGTAACTAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTAT



CTGAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAG



GATTACA





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGC


ID
GTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAA


NO:
GCGGGACCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCAC


73
ATTGATTATTTGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCC



TACCTGACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATA



CGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATAT



ACCATGCACCATCATCATCACCATGCGGTTAAATCTATCAAAGTTAAACTGCGTCTGGACGACATGC



CGGAAATCCGTGCGGGTCTGTGGAAACTGCACAAAGAAGTTAACGCGGGTGTTCGTTACTACACCG



AATGGCTGTCTCTGCTGCGTCAGGAAAACCTGTACCGTCGTTCTCCGAACGGTGACGGTGAACAGG



AATGCGACAAAACCGCGGAAGAATGCAAAGCGGAACTGCTGGAACGTCTGCGTGCGCGTCAGGTT



GAAAACGGTCACCGTGGTCCGGCGGGTTCTGACGACGAACTGCTGCAGCTGGCGCGTCAGCTGTAC



GAACTGCTGGTTCCGCAGGCGATCGGTGCGAAAGGTGACGCGCAGCAGATCGCGCGTAAATTCCTG



TCTCCGCTGGCGGACAAAGACGCGGTTGGTGGTCTGGGTATCGCGAAAGCGGGTAACAAACCGCGT



TGGGTTCGTATGCGTGAAGCGGGTGAACCGGGTTGGGAAGAAGAAAAAGAAAAAGCGGAAACCCG



TAAATCTGCGGACCGTACCGCGGACGTTCTGCGTGCGCTGGCGGACTTCGGTCTGAAACCGCTGAT



GCGTGTTTACACCGACTCTGAAATGTCTTCTGTTGAATGGAAACCGCTGCGTAAAGGTCAGGCGGTT



CGTACCTGGGACCGTGACATGTTCCAGCAGGCGATCGAACGTATGATGTCTTGGGAATCTTGGAAC



CAGCGTGTTGGTCAGGAATACGCGAAACTGGTTGAACAGAAAAACCGTTTCGAACAGAAAAACTTC



GTTGGTCAGGAACACCTGGTTCACCTGGTTAACCAGCTGCAGCAGGACATGAAAGAAGCGTCTCCG



GGTCTGGAATCTAAAGAACAGACCGCGCACTACGTTACCGGTCGTGCGCTGCGTGGTTCTGACAAA



GTTTTCGAAAAATGGGGTAAACTGGCGCCGGACGCGCCGTTCGACCTGTACGACGCGGAAATCAAA



AACGTTCAGCGTCGTAACACCCGTCGTTTCGGTTCTCACGACCTGTTCGCGAAACTGGCGGAACCGG



AATACCAGGCGCTGTGGCGTGAAGACGCGTCTTTCCTGACCCGTTACGCGGTTTACAACTCTATCCT



GCGTAAACTGAACCACGCGAAAATGTTCGCGACCTTCACCCTGCCGGACGCGACCGCGCACCCGAT



CTGGACCCGTTTCGACAAACTGGGTGGTAACCTGCACCAGTACACCTTCCTGTTCAACGAATTCGGT



GAACGTCGTCACGCGATCCGTTTCCACAAACTGCTGAAAGTTGAAAACGGTGTTGCGCGTGAAGTT



GACGACGTTACCGTTCCGATCTCTATGTCTGAACAGCTGGACAACCTGCTGCCGCGTGACCCGAACG



AACCGATCGCGCTGTACTTCCGTGACTACGGTGCGGAACAGCACTTCACCGGTGAATTCGGTGGTG



CGAAAATCCAGTGCCGTCGTGACCAGCTGGCGCACATGCACCGTCGTCGTGGTGCGCGTGACGTTT



ACCTGAACGTTTCTGTTCGTGTTCAGTCTCAGTCTGAAGCGCGTGGTGAACGTCGTCCGCCGTACGC



GGCGGTTTTCCGTCTGGTTGGTGACAACCACCGTGCGTTCGTTCACTTCGACAAACTGTCTGACTAC



CTGGCGGAACACCCGGACGACGGTAAACTGGGTTCTGAAGGTCTGCTGTCTGGTCTGCGTGTTATGT



CTGTTGACCTGGGTCTGCGTACCTCTGCGTCTATCTCTGTTTTCCGTGTTGCGCGTAAAGACGAACTG



AAACCGAACTCTAAAGGTCGTGTTCCGTTCTTCTTCCCGATCAAAGGTAACGACAACCTGGTTGCGG



TTCACGAACGTTCTCAGCTGCTGAAACTGCCGGGTGAAACCGAATCTAAAGACCTGCGTGCGATCC



GTGAAGAACGTCAGCGTACCCTGCGTCAGCTGCGTACCCAGCTGGCGTACCTGCGTCTGCTGGTTCG



TTGCGGTTCTGAAGACGTTGGTCGTCGTGAACGTTCTTGGGCGAAACTGATCGAACAGCCGGTTGAC



GCGGCGAACCACATGACCCCGGACTGGCGTGAAGCGTTCGAAAACGAACTGCAGAAACTGAAATC



TCTGCACGGTATCTGCTCTGACAAAGAATGGATGGACGCGGTTTACGAATCTGTTCGTCGTGTTTGG



CGTCACATGGGTAAACAGGTTCGTGACTGGCGTAAAGACGTTCGTTCTGGTGAACGTCCGAAAATC



CGTGGTTACGCGAAAGACGTTGTTGGTGGTAACTCTATCGAACAGATCGAATACCTGGAACGTCAG



TACAAATTCCTGAAATCTTGGTCTTTCTTCGGTAAAGTTTCTGGTCAGGTTATCCGTGCGGAAAAAG



GTTCTCGTTTCGCGATCACCCTGCGTGAACACATCGACCACGCGAAAGAAGACCGTCTGAAAAAAC



TGGCGGACCGTATCATCATGGAAGCGCTGGGTTACGTTTACGCGCTGGACGAACGTGGTAAAGGTA



AATGGGTTGCGAAATACCCGCCGTGCCAGCTGATCCTGCTGGAAGAACTGTCTGAATACCAGTTCA



ACAACGACCGTCCGCCGTCTGAAAACAACCAGCTGATGCAGTGGTCTCACCGTGGTGTTTTCCAGG



AACTGATCAACCAGGCGCAGGTTCACGACCTGCTGGTTGGTACCATGTACGCGGCGTTCTCTTCTCG



TTTCGACGCGCGTACCGGTGCGCCGGGTATCCGTTGCCGTCGTGTTCCGGCGCGTTGCACCCAGGAA



CACAACCCGGAACCGTTCCCGTGGTGGCTGAACAAATTCGTTGTTGAACACACCCTGGACGCGTGC



CCGCTGCGTGCGGACGACCTGATCCCGACCGGTGAAGGTGAAATCTTCGTTTCTCCGTTCTCTGCGG



AAGAAGGTGACTTCCACCAGATCCACGCGGACCTGAACGCGGCGCAGAACCTGCAGCAGCGTCTGT



GGTCTGACTTCGACATCTCTCAGATCCGTCTGCGTTGCGACTGGGGTGAAGTTGACGGTGAACTGGT



TCTGATCCCGCGTCTGACCGGTAAACGTACCGCGGACTCTTACTCTAACAAAGTTTTCTACACCAAC



ACCGGTGTTACCTACTACGAACGTGAACGTGGTAAAAAACGTCGTAAAGTTTTCGCGCAGGAAAAA



CTGTCTGAAGAAGAAGCGGAACTGCTGGTTGAAGCGGACGAAGCGCGTGAAAAATCTGTTGTTCTG



ATGCGTGACCCGTCTGGTATCATCAACCGTGGTAACTGGACCCGTCAGAAAGAATTCTGGTCTATGG



TTAACCAGCGTATCGAAGGTTACCTGGTTAAACAGATCCGTTCTCGTGTTCCGCTGCAGGACTCTGC



GTGCGAAAACACCGGTGACATCTAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAA



ATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTA



CA





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGC


ID
GTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAA


NO:
GCGGGACCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCAC


74
ATTGATTATTTGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCC



TACCTGACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATA



CGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATAT



ACCATGCACCATCATCATCACCATGCGACCCGTTCTTTCATCCTGAAAATCGAACCGAACGAAGAA



GTTAAAAAAGGTCTGTGGAAAACCCACGAAGTTCTGAACCACGGTATCGCGTACTACATGAACATC



CTGAAACTGATCCGTCAGGAAGCGATCTACGAACACCACGAACAGGACCCGAAAAACCCGAAAAA



AGTTTCTAAAGCGGAAATCCAGGCGGAACTGTGGGACTTCGTTCTGAAAATGCAGAAATGCAACTC



TTTCACCCACGAAGTTGACAAAGACGTTGTTTTCAACATCCTGCGTGAACTGTACGAAGAACTGGTT



CCGTCTTCTGTTGAAAAAAAAGGTGAAGCGAACCAGCTGTCTAACAAATTCCTGTACCCGCTGGTTG



ACCCGAACTCTCAGTCTGGTAAAGGTACCGCGTCTTCTGGTCGTAAACCGCGTTGGTACAACCTGAA



AATCGCGGGTGACCCGTCTTGGGAAGAAGAAAAAAAAAAATGGGAAGAAGACAAAAAAAAAGAC



CCGCTGGCGAAAATCCTGGGTAAACTGGCGGAATACGGTCTGATCCCGCTGTTCATCCCGTTCACCG



ACTCTAACGAACCGATCGTTAAAGAAATCAAATGGATGGAAAAATCTCGTAACCAGTCTGTTCGTC



GTCTGGACAAAGACATGTTCATCCAGGCGCTGGAACGTTTCCTGTCTTGGGAATCTTGGAACCTGAA



AGTTAAAGAAGAATACGAAAAAGTTGAAAAAGAACACAAAACCCTGGAAGAACGTATCAAAGAAG



ACATCCAGGCGTTCAAATCTCTGGAACAGTACGAAAAAGAACGTCAGGAACAGCTGCTGCGTGACA



CCCTGAACACCAACGAATACCGTCTGTCTAAACGTGGTCTGCGTGGTTGGCGTGAAATCATCCAGA



AATGGCTGAAAATGGACGAAAACGAACCGTCTGAAAAATACCTGGAAGTTTTCAAAGACTACCAGC



GTAAACACCCGCGTGAAGCGGGTGACTACTCTGTTTACGAATTCCTGTCTAAAAAAGAAAACCACT



TCATCTGGCGTAACCACCCGGAATACCCGTACCTGTACGCGACCTTCTGCGAAATCGACAAAAAAA



AAAAAGACGCGAAACAGCAGGCGACCTTCACCCTGGCGGACCCGATCAACCACCCGCTGTGGGTTC



GTTTCGAAGAACGTTCTGGTTCTAACCTGAACAAATACCGTATCCTGACCGAACAGCTGCACACCG



AAAAACTGAAAAAAAAACTGACCGTTCAGCTGGACCGTCTGATCTACCCGACCGAATCTGGTGGTT



GGGAAGAAAAAGGTAAAGTTGACATCGTTCTGCTGCCGTCTCGTCAGTTCTACAACCAGATCTTCCT



GGACATCGAAGAAAAAGGTAAACACGCGTTCACCTACAAAGACGAATCTATCAAATTCCCGCTGAA



AGGTACCCTGGGTGGTGCGCGTGTTCAGTTCGACCGTGACCACCTGCGTCGTTACCCGCACAAAGTT



GAATCTGGTAACGTTGGTCGTATCTACTTCAACATGACCGTTAACATCGAACCGACCGAATCTCCGG



TTTCTAAATCTCTGAAAATCCACCGTGACGACTTCCCGAAATTCGTTAACTTCAAACCGAAAGAACT



GACCGAATGGATCAAAGACTCTAAAGGTAAAAAACTGAAATCTGGTATCGAATCTCTGGAAATCGG



TCTGCGTGTTATGTCTATCGACCTGGGTCAGCGTCAGGCGGCGGCGGCGTCTATCTTCGAAGTTGTT



GACCAGAAACCGGACATCGAAGGTAAACTGTTCTTCCCGATCAAAGGTACCGAACTGTACGCGGTT



CACCGTGCGTCTTTCAACATCAAACTGCCGGGTGAAACCCTGGTTAAATCTCGTGAAGTTCTGCGTA



AAGCGCGTGAAGACAACCTGAAACTGATGAACCAGAAACTGAACTTCCTGCGTAACGTTCTGCACT



TCCAGCAGTTCGAAGACATCACCGAACGTGAAAAACGTGTTACCAAATGGATCTCTCGTCAGGAAA



ACTCTGACGTTCCGCTGGTTTACCAGGACGAACTGATCCAGATCCGTGAACTGATGTACAAACCGTA



CAAAGACTGGGTTGCGTTCCTGAAACAGCTGCACAAACGTCTGGAAGTTGAAATCGGTAAAGAAGT



TAAACACTGGCGTAAATCTCTGTCTGACGGTCGTAAAGGTCTGTACGGTATCTCTCTGAAAAACATC



GACGAAATCGACCGTACCCGTAAATTCCTGCTGCGTTGGTCTCTGCGTCCGACCGAACCGGGTGAA



GTTCGTCGTCTGGAACCGGGTCAGCGTTTCGCGATCGACCAGCTGAACCACCTGAACGCGCTGAAA



GAAGACCGTCTGAAAAAAATGGCGAACACCATCATCATGCACGCGCTGGGTTACTGCTACGACGTT



CGTAAAAAAAAATGGCAGGCGAAAAACCCGGCGTGCCAGATCATCCTGTTCGAAGACCTGTCTAAC



TACAACCCGTACGAAGAACGTTCTCGTTTCGAAAACTCTAAACTGATGAAATGGTCTCGTCGTGAA



ATCCCGCGTCAGGTTGCGCTGCAGGGTGAAATCTACGGTCTGCAGGTTGGTGAAGTTGGTGCGCAG



TTCTCTTCTCGTTTCCACGCGAAAACCGGTTCTCCGGGTATCCGTTGCTCTGTTGTTACCAAAGAAAA



ACTGCAGGACAACCGTTTCTTCAAAAACCTGCAGCGTGAAGGTCGTCTGACCCTGGACAAAATCGC



GGTTCTGAAAGAAGGTGACCTGTACCCGGACAAAGGTGGTGAAAAATTCATCTCTCTGTCTAAAGA



CCGTAAACTGGTTACCACCCACGCGGACATCAACGCGGCGCAGAACCTGCAGAAACGTTTCTGGAC



CCGTACCCACGGTTTCTACAAAGTTTACTGCAAAGCGTACCAGGTTGACGGTCAGACCGTTTACATC



CCGGAATCTAAAGACCAGAAACAGAAAATCATCGAAGAATTCGGTGAAGGTTACTTCATCCTGAAA



GACGGTGTTTACGAATGGGGTAACGCGGGTAAACTGAAAATCAAAAAAGGTTCTTCTAAACAGTCT



TCTTCTGAACTGGTTGACTCTGACATCCTGAAAGACTCTTTCGACCTGGCGTCTGAACTGAAAGGTG



AAAAACTGATGCTGTACCGTGACCCGTCTGGTAACGTTTTCCCGTCTGACAAATGGATGGCGGCGG



GTGTTTTCTTCGGTAAACTGGAACGTATCCTGATCTCTAAACTGACCAACCAGTACTCTATCTCTACC



ATCGAAGACGACTCTTCTAAACAGTCTATGTAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTT



TATCTGAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAA



GAGGATTACA





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGC


ID
GTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAA


NO:
GCGGGACCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCAC


75
ATTGATTATTTGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCC



TACCTGACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATA



CGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATAT



ACCATGCACCATCATCATCACCATCCGACCCGTACCATCAACCTGAAACTGGTTCTGGGTAAAAACC



CGGAAAACGCGACCCTGCGTCGTGCGCTGTTCTCTACCCACCGTCTGGTTAACCAGGCGACCAAAC



GTATCGAAGAATTCCTGCTGCTGTGCCGTGGTGAAGCGTACCGTACCGTTGACAACGAAGGTAAAG



AAGCGGAAATCCCGCGTCACGCGGTTCAGGAAGAAGCGCTGGCGTTCGCGAAAGCGGCGCAGCGT



CACAACGGTTGCATCTCTACCTACGAAGACCAGGAAATCCTGGACGTTCTGCGTCAGCTGTACGAA



CGTCTGGTTCCGTCTGTTAACGAAAACAACGAAGCGGGTGACGCGCAGGCGGCGAACGCGTGGGTT



TCTCCGCTGATGTCTGCGGAATCTGAAGGTGGTCTGTCTGTTTACGACAAAGTTCTGGACCCGCCGC



CGGTTTGGATGAAACTGAAAGAAGAAAAAGCGCCGGGTTGGGAAGCGGCGTCTCAGATCTGGATC



CAGTCTGACGAAGGTCAGTCTCTGCTGAACAAACCGGGTTCTCCGCCGCGTTGGATCCGTAAACTGC



GTTCTGGTCAGCCGTGGCAGGACGACTTCGTTTCTGACCAGAAAAAAAAACAGGACGAACTGACCA



AAGGTAACGCGCCGCTGATCAAACAGCTGAAAGAAATGGGTCTGCTGCCGCTGGTTAACCCGTTCT



TCCGTCACCTGCTGGACCCGGAAGGTAAAGGTGTTTCTCCGTGGGACCGTCTGGCGGTTCGTGCGGC



GGTTGCGCACTTCATCTCTTGGGAATCTTGGAACCACCGTACCCGTGCGGAATACAACTCTCTGAAA



CTGCGTCGTGACGAATTCGAAGCGGCGTCTGACGAATTCAAAGACGACTTCACCCTGCTGCGTCAG



TACGAAGCGAAACGTCACTCTACCCTGAAATCTATCGCGCTGGCGGACGACTCTAACCCGTACCGT



ATCGGTGTTCGTTCTCTGCGTGCGTGGAACCGTGTTCGTGAAGAATGGATCGACAAAGGTGCGACC



GAAGAACAGCGTGTTACCATCCTGTCTAAACTGCAGACCCAGCTGCGTGGTAAATTCGGTGACCCG



GACCTGTTCAACTGGCTGGCGCAGGACCGTCACGTTCACCTGTGGTCTCCGCGTGACTCTGTTACCC



CGCTGGTTCGTATCAACGCGGTTGACAAAGTTCTGCGTCGTCGTAAACCGTACGCGCTGATGACCTT



CGCGCACCCGCGTTTCCACCCGCGTTGGATCCTGTACGAAGCGCCGGGTGGTTCTAACCTGCGTCAG



TACGCGCTGGACTGCACCGAAAACGCGCTGCACATCACCCTGCCGCTGCTGGTTGACGACGCGCAC



GGTACCTGGATCGAAAAAAAAATCCGTGTTCCGCTGGCGCCGTCTGGTCAGATCCAGGACCTGACC



CTGGAAAAACTGGAAAAAAAAAAAAACCGTCTGTACTACCGTTCTGGTTTCCAGCAGTTCGCGGGT



CTGGCGGGTGGTGCGGAAGTTCTGTTCCACCGTCCGTACATGGAACACGACGAACGTTCTGAAGAA



TCTCTGCTGGAACGTCCGGGTGCGGTTTGGTTCAAACTGACCCTGGACGTTGCGACCCAGGCGCCGC



CGAACTGGCTGGACGGTAAAGGTCGTGTTCGTACCCCGCCGGAAGTTCACCACTTCAAAACCGCGC



TGTCTAACAAATCTAAACACACCCGTACCCTGCAGCCGGGTCTGCGTGTTCTGTCTGTTGACCTGGG



TATGCGTACCTTCGCGTCTTGCTCTGTTTTCGAACTGATCGAAGGTAAACCGGAAACCGGTCGTGCG



TTCCCGGTTGCGGACGAACGTTCTATGGACTCTCCGAACAAACTGTGGGCGAAACACGAACGTTCTT



TCAAACTGACCCTGCCGGGTGAAACCCCGTCTCGTAAAGAAGAAGAAGAACGTTCTATCGCGCGTG



CGGAAATCTACGCGCTGAAACGTGACATCCAGCGTCTGAAATCTCTGCTGCGTCTGGGTGAAGAAG



ACAACGACAACCGTCGTGACGCGCTGCTGGAACAGTTCTTCAAAGGTTGGGGTGAAGAAGACGTTG



TTCCGGGTCAGGCGTTCCCGCGTTCTCTGTTCCAGGGTCTGGGTGCGGCGCCGTTCCGTTCTACCCC



GGAACTGTGGCGTCAGCACTGCCAGACCTACTACGACAAAGCGGAAGCGTGCCTGGCGAAACACAT



CTCTGACTGGCGTAAACGTACCCGTCCGCGTCCGACCTCTCGTGAAATGTGGTACAAAACCCGTTCT



TACCACGGTGGTAAATCTATCTGGATGCTGGAATACCTGGACGCGGTTCGTAAACTGCTGCTGTCTT



GGTCTCTGCGTGGTCGTACCTACGGTGCGATCAACCGTCAGGACACCGCGCGTTTCGGTTCTCTGGC



GTCTCGTCTGCTGCACCACATCAACTCTCTGAAAGAAGACCGTATCAAAACCGGTGCGGACTCTATC



GTTCAGGCGGCGCGTGGTTACATCCCGCTGCCGCACGGTAAAGGTTGGGAACAGCGTTACGAACCG



TGCCAGCTGATCCTGTTCGAAGACCTGGCGCGTTACCGTTTCCGTGTTGACCGTCCGCGTCGTGAAA



ACTCTCAGCTGATGCAGTGGAACCACCGTGCGATCGTTGCGGAAACCACCATGCAGGCGGAACTGT



ACGGTCAGATCGTTGAAAACACCGCGGCGGGTTTCTCTTCTCGTTTCCACGCGGCGACCGGTGCGCC



GGGTGTTCGTTGCCGTTTCCTGCTGGAACGTGACTTCGACAACGACCTGCCGAAACCGTACCTGCTG



CGTGAACTGTCTTGGATGCTGGGTAACACCAAAGTTGAATCTGAAGAAGAAAAACTGCGTCTGCTG



TCTGAAAAAATCCGTCCGGGTTCTCTGGTTCCGTGGGACGGTGGTGAACAGTTCGCGACCCTGCACC



CGAAACGTCAGACCCTGTGCGTTATCCACGCGGACATGAACGCGGCGCAGAACCTGCAGCGTCGTT



TCTTCGGTCGTTGCGGTGAAGCGTTCCGTCTGGTTTGCCAGCCGCACGGTGACGACGTTCTGCGTCT



GGCGTCTACCCCGGGTGCGCGTCTGCTGGGTGCGCTGCAGCAGCTGGAAAACGGTCAGGGTGCGTT



CGAACTGGTTCGTGACATGGGTTCTACCTCTCAGATGAACCGTTTCGTTATGAAATCTCTGGGTAAA



AAAAAAATCAAACCGCTGCAGGACAACAACGGTGACGACGAACTGGAAGACGTTCTGTCTGTTCTG



CCGGAAGAAGACGACACCGGTCGTATCACCGTTTTCCGTGACTCTTCTGGTATCTTCTTCCCGTGCA



ACGTTTGGATCCCGGCGAAACAGTTCTGGCCGGCGGTTCGTGCGATGATCTGGAAAGTTATGGCGT



CTCACTCTCTGGGTTAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGG



AGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGC


ID
GTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAA


NO:
GCGGGACCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCAC


76
ATTGATTATTTGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCC



TACCTGACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATA



CGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATAT



ACCATGCACCATCATCATCACCATACCAAACTGCGTCACCGTCAGAAAAAACTGACCCACGACTGG



GCGGGTTCTAAAAAACGTGAAGTTCTGGGTTCTAACGGTAAACTGCAGAACCCGCTGCTGATGCCG



GTTAAAAAAGGTCAGGTTACCGAATTCCGTAAAGCGTTCTCTGCGTACGCGCGTGCGACCAAAGGT



GAAATGACCGACGGTCGTAAAAACATGTTCACCCACTCTTTCGAACCGTTCAAAACCAAACCGTCT



CTGCACCAGTGCGAACTGGCGGACAAAGCGTACCAGTCTCTGCACTCTTACCTGCCGGGTTCTCTGG



CGCACTTCCTGCTGTCTGCGCACGCGCTGGGTTTCCGTATCTTCTCTAAATCTGGTGAAGCGACCGC



GTTCCAGGCGTCTTCTAAAATCGAAGCGTACGAATCTAAACTGGCGTCTGAACTGGCGTGCGTTGAC



CTGTCTATCCAGAACCTGACCATCTCTACCCTGTTCAACGCGCTGACCACCTCTGTTCGTGGTAAAG



GTGAAGAAACCTCTGCGGACCCGCTGATCGCGCGTTTCTACACCCTGCTGACCGGTAAACCGCTGTC



TCGTGACACCCAGGGTCCGGAACGTGACCTGGCGGAAGTTATCTCTCGTAAAATCGCGTCTTCTTTC



GGTACCTGGAAAGAAATGACCGCGAACCCGCTGCAGTCTCTGCAGTTCTTCGAAGAAGAACTGCAC



GCGCTGGACGCGAACGTTTCTCTGTCTCCGGCGTTCGACGTTCTGATCAAAATGAACGACCTGCAGG



GTGACCTGAAAAACCGTACCATCGTTTTCGACCCGGACGCGCCGGTTTTCGAATACAACGCGGAAG



ACCCGGCGGACATCATCATCAAACTGACCGCGCGTTACGCGAAAGAAGCGGTTATCAAAAACCAGA



ACGTTGGTAACTACGTTAAAAACGCGATCACCACCACCAACGCGAACGGTCTGGGTTGGCTGCTGA



ACAAAGGTCTGTCTCTGCTGCCGGTTTCTACCGACGACGAACTGCTGGAATTCATCGGTGTTGAACG



TTCTCACCCGTCTTGCCACGCGCTGATCGAACTGATCGCGCAGCTGGAAGCGCCGGAACTGTTCGAA



AAAAACGTTTTCTCTGACACCCGTTCTGAAGTTCAGGGTATGATCGACTCTGCGGTTTCTAACCACA



TCGCGCGTCTGTCTTCTTCTCGTAACTCTCTGTCTATGGACTCTGAAGAACTGGAACGTCTGATCAA



ATCTTTCCAGATCCACACCCCGCACTGCTCTCTGTTCATCGGTGCGCAGTCTCTGTCTCAGCAGCTGG



AATCTCTGCCGGAAGCGCTGCAGTCTGGTGTTAACTCTGCGGACATCCTGCTGGGTTCTACCCAGTA



CATGCTGACCAACTCTCTGGTTGAAGAATCTATCGCGACCTACCAGCGTACCCTGAACCGTATCAAC



TACCTGTCTGGTGTTGCGGGTCAGATCAACGGTGCGATCAAACGTAAAGCGATCGACGGTGAAAAA



ATCCACCTGCCGGCGGCGTGGTCTGAACTGATCTCTCTGCCGTTCATCGGTCAGCCGGTTATCGACG



TTGAATCTGACCTGGCGCACCTGAAAAACCAGTACCAGACCCTGTCTAACGAATTCGACACCCTGA



TCTCTGCGCTGCAGAAAAACTTCGACCTGAACTTCAACAAAGCGCTGCTGAACCGTACCCAGCACTT



CGAAGCGATGTGCCGTTCTACCAAAAAAAACGCGCTGTCTAAACCGGAAATCGTTTCTTACCGTGA



CCTGCTGGCGCGTCTGACCTCTTGCCTGTACCGTGGTTCTCTGGTTCTGCGTCGTGCGGGTATCGAA



GTTCTGAAAAAACACAAAATCTTCGAATCTAACTCTGAACTGCGTGAACACGTTCACGAACGTAAA



CACTTCGTTTTCGTTTCTCCGCTGGACCGTAAAGCGAAAAAACTGCTGCGTCTGACCGACTCTCGTC



CGGACCTGCTGCACGTTATCGACGAAATCCTGCAGCACGACAACCTGGAAAACAAAGACCGTGAAT



CTCTGTGGCTGGTTCGTTCTGGTTACCTGCTGGCGGGTCTGCCGGACCAGCTGTCTTCTTCTTTCATC



AACCTGCCGATCATCACCCAGAAAGGTGACCGTCGTCTGATCGACCTGATCCAGTACGACCAGATC



AACCGTGACGCGTTCGTTATGCTGGTTACCTCTGCGTTCAAATCTAACCTGTCTGGTCTGCAGTACC



GTGCGAACAAACAGTCTTTCGTTGTTACCCGTACCCTGTCTCCGTACCTGGGTTCTAAACTGGTTTAC



GTTCCGAAAGACAAAGACTGGCTGGTTCCGTCTCAGATGTTCGAAGGTCGTTTCGCGGACATCCTGC



AGTCTGACTACATGGTTTGGAAAGACGCGGGTCGTCTGTGCGTTATCGACACCGCGAAACACCTGT



CTAACATCAAAAAATCTGTTTTCTCTTCTGAAGAAGTTCTGGCGTTCCTGCGTGAACTGCCGCACCG



TACCTTCATCCAGACCGAAGTTCGTGGTCTGGGTGTTAACGTTGACGGTATCGCGTTCAACAACGGT



GACATCCCGTCTCTGAAAACCTTCTCTAACTGCGTTCAGGTTAAAGTTTCTCGTACCAACACCTCTCT



GGTTCAGACCCTGAACCGTTGGTTCGAAGGTGGTAAAGTTTCTCCGCCGTCTATCCAGTTCGAACGT



GCGTACTACAAAAAAGACGACCAGATCCACGAAGACGCGGCGAAACGTAAAATCCGTTTCCAGAT



GCCGGCGACCGAACTGGTTCACGCGTCTGACGACGCGGGTTGGACCCCGTCTTACCTGCTGGGTATC



GACCCGGGTGAATACGGTATGGGTCTGTCTCTGGTTTCTATCAACAACGGTGAAGTTCTGGACTCTG



GTTTCATCCACATCAACTCTCTGATCAACTTCGCGTCTAAAAAATCTAACCACCAGACCAAAGTTGT



TCCGCGTCAGCAGTACAAATCTCCGTACGCGAACTACCTGGAACAGTCTAAAGACTCTGCGGCGGG



TGACATCGCGCACATCCTGGACCGTCTGATCTACAAACTGAACGCGCTGCCGGTTTTCGAAGCGCTG



TCTGGTAACTCTCAGTCTGCGGCGGACCAGGTTTGGACCAAAGTTCTGTCTTTCTACACCTGGGGTG



ACAACGACGCGCAGAACTCTATCCGTAAACAGCACTGGTTCGGTGCGTCTCACTGGGACATCAAAG



GTATGCTGCGTCAGCCGCCGACCGAAAAAAAACCGAAACCGTACATCGCGTTCCCGGGTTCTCAGG



TTTCTTCTTACGGTAACTCTCAGCGTTGCTCTTGCTGCGGTCGTAACCCGATCGAACAGCTGCGTGA



AATGGCGAAAGACACCTCTATCAAAGAACTGAAAATCCGTAACTCTGAAATCCAGCTGTTCGACGG



TACCATCAAACTGTTCAACCCGGACCCGTCTACCGTTATCGAACGTCGTCGTCACAACCTGGGTCCG



TCTCGTATCCCGGTTGCGGACCGTACCTTCAAAAACATCTCTCCGTCTTCTCTGGAATTCAAAGAAC



TGATCACCATCGTTTCTCGTTCTATCCGTCACTCTCCGGAATTCATCGCGAAAAAACGTGGTATCGG



TTCTGAATACTTCTGCGCGTACTCTGACTGCAACTCTTCTCTGAACTCTGAAGCGAACGCGGCGGCG



AACGTTGCGCAGAAATTCCAGAAACAGCTGTTCTTCGAACTGTAAGAAATCATCCTTAGCGAAAGC



TAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTAC



TCAGGAAGCAAAGAGGATTACA





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGC


ID
GTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAA


NO:
GCGGGACCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCAC


77
ATTGATTATTTGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCC



TACCTGACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATA



CGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATAT



ACCATGCACCATCATCATCACCATAAACGTATCCTGAACTCTCTGAAAGTTGCGGCGCTGCGTCTGC



TGTTCCGTGGTAAAGGTTCTGAACTGGTTAAAACCGTTAAATACCCGCTGGTTTCTCCGGTTCAGGG



TGCGGTTGAAGAACTGGCGGAAGCGATCCGTCACGACAACCTGCACCTGTTCGGTCAGAAAGAAAT



CGTTGACCTGATGGAAAAAGACGAAGGTACCCAGGTTTACTCTGTTGTTGACTTCTGGCTGGACACC



CTGCGTCTGGGTATGTTCTTCTCTCCGTCTGCGAACGCGCTGAAAATCACCCTGGGTAAATTCAACT



CTGACCAGGTTTCTCCGTTCCGTAAAGTTCTGGAACAGTCTCCGTTCTTCCTGGCGGGTCGTCTGAA



AGTTGAACCGGCGGAACGTATCCTGTCTGTTGAAATCCGTAAAATCGGTAAACGTGAAAACCGTGT



TGAAAACTACGCGGCGGACGTTGAAACCTGCTTCATCGGTCAGCTGTCTTCTGACGAAAAACAGTC



TATCCAGAAACTGGCGAACGACATCTGGGACTCTAAAGACCACGAAGAACAGCGTATGCTGAAAG



CGGACTTCTTCGCGATCCCGCTGATCAAAGACCCGAAAGCGGTTACCGAAGAAGACCCGGAAAACG



AAACCGCGGGTAAACAGAAACCGCTGGAACTGTGCGTTTGCCTGGTTCCGGAACTGTACACCCGTG



GTTTCGGTTCTATCGCGGACTTCCTGGTTCAGCGTCTGACCCTGCTGCGTGACAAAATGTCTACCGA



CACCGCGGAAGACTGCCTGGAATACGTTGGTATCGAAGAAGAAAAAGGTAACGGTATGAACTCTCT



GCTGGGTACCTTCCTGAAAAACCTGCAGGGTGACGGTTTCGAACAGATCTTCCAGTTCATGCTGGGT



TCTTACGTTGGTTGGCAGGGTAAAGAAGACGTTCTGCGTGAACGTCTGGACCTGCTGGCGGAAAAA



GTTAAACGTCTGCCGAAACCGAAATTCGCGGGTGAATGGTCTGGTCACCGTATGTTCCTGCACGGTC



AGCTGAAATCTTGGTCTTCTAACTTCTTCCGTCTGTTCAACGAAACCCGTGAACTGCTGGAATCTAT



CAAATCTGACATCCAGCACGCGACCATGCTGATCTCTTACGTTGAAGAAAAAGGTGGTTACCACCC



GCAGCTGCTGTCTCAGTACCGTAAACTGATGGAACAGCTGCCGGCGCTGCGTACCAAAGTTCTGGA



CCCGGAAATCGAAATGACCCACATGTCTGAAGCGGTTCGTTCTTACATCATGATCCACAAATCTGTT



GCGGGTTTCCTGCCGGACCTGCTGGAATCTCTGGACCGTGACAAAGACCGTGAATTCCTGCTGTCTA



TCTTCCCGCGTATCCCGAAAATCGACAAAAAAACCAAAGAAATCGTTGCGTGGGAACTGCCGGGTG



AACCGGAAGAAGGTTACCTGTTCACCGCGAACAACCTGTTCCGTAACTTCCTGGAAAACCCGAAAC



ACGTTCCGCGTTTCATGGCGGAACGTATCCCGGAAGACTGGACCCGTCTGCGTTCTGCGCCGGTTTG



GTTCGACGGTATGGTTAAACAGTGGCAGAAAGTTGTTAACCAGCTGGTTGAATCTCCGGGTGCGCT



GTACCAGTTCAACGAATCTTTCCTGCGTCAGCGTCTGCAGGCGATGCTGACCGTTTACAAACGTGAC



CTGCAGACCGAAAAATTCCTGAAACTGCTGGCGGACGTTTGCCGTCCGCTGGTTGACTTCTTCGGTC



TGGGTGGTAACGACATCATCTTCAAATCTTGCCAGGACCCGCGTAAACAGTGGCAGACCGTTATCC



CGCTGTCTGTTCCGGCGGACGTTTACACCGCGTGCGAAGGTCTGGCGATCCGTCTGCGTGAAACCCT



GGGTTTCGAATGGAAAAACCTGAAAGGTCACGAACGTGAAGACTTCCTGCGTCTGCACCAGCTGCT



GGGTAACCTGCTGTTCTGGATCCGTGACGCGAAACTGGTTGTTAAACTGGAAGACTGGATGAACAA



CCCGTGCGTTCAGGAATACGTTGAAGCGCGTAAAGCGATCGACCTGCCGCTGGAAATCTTCGGTTTC



GAAGTTCCGATCTTCCTGAACGGTTACCTGTTCTCTGAACTGCGTCAGCTGGAACTGCTGCTGCGTC



GTAAATCTGTTATGACCTCTTACTCTGTTAAAACCACCGGTTCTCCGAACCGTCTGTTCCAGCTGGTT



TACCTGCCGCTGAACCCGTCTGACCCGGAAAAAAAAAACTCTAACAACTTCCAGGAACGTCTGGAC



ACCCCGACCGGTCTGTCTCGTCGTTTCCTGGACCTGACCCTGGACGCGTTCGCGGGTAAACTGCTGA



CCGACCCGGTTACCCAGGAACTGAAAACCATGGCGGGTTTCTACGACCACCTGTTCGGTTTCAAACT



GCCGTGCAAACTGGCGGCGATGTCTAACCACCCGGGTTCTTCTTCTAAAATGGTTGTTCTGGCGAAA



CCGAAAAAAGGTGTTGCGTCTAACATCGGTTTCGAACCGATCCCGGACCCGGCGCACCCGGTTTTCC



GTGTTCGTTCTTCTTGGCCGGAACTGAAATACCTGGAAGGTCTGCTGTACCTGCCGGAAGACACCCC



GCTGACCATCGAACTGGCGGAAACCTCTGTTTCTTGCCAGTCTGTTTCTTCTGTTGCGTTCGACCTGA



AAAACCTGACCACCATCCTGGGTCGTGTTGGTGAATTCCGTGTTACCGCGGACCAGCCGTTCAAACT



GACCCCGATCATCCCGGAAAAAGAAGAATCTTTCATCGGTAAAACCTACCTGGGTCTGGACGCGGG



TGAACGTTCTGGTGTTGGTTTCGCGATCGTTACCGTTGACGGTGACGGTTACGAAGTTCAGCGTCTG



GGTGTTCACGAAGACACCCAGCTGATGGCGCTGCAGCAGGTTGCGTCTAAATCTCTGAAAGAACCG



GTTTTCCAGCCGCTGCGTAAAGGTACCTTCCGTCAGCAGGAACGTATCCGTAAATCTCTGCGTGGTT



GCTACTGGAACTTCTACCACGCGCTGATGATCAAATACCGTGCGAAAGTTGTTCACGAAGAATCTGT



TGGTTCTTCTGGTCTGGTTGGTCAGTGGCTGCGTGCGTTCCAGAAAGACCTGAAAAAAGCGGACGTT



CTGCCGAAAAAAGGTGGTAAAAACGGTGTTGACAAAAAAAAACGTGAATCTTCTGCGCAGGACAC



CCTGTGGGGTGGTGCGTTCTCTAAAAAAGAAGAACAGCAGATCGCGTTCGAAGTTCAGGCGGCGGG



TTCTTCTCAGTTCTGCCTGAAATGCGGTTGGTGGTTCCAGCTGGGTATGCGTGAAGTTAACCGTGTT



CAGGAATCTGGTGTTGTTCTGGACTGGAACCGTTCTATCGTTACCTTCCTGATCGAATCTTCTGGTGA



AAAAGTTTACGGTTTCTCTCCGCAGCAGCTGGAAAAAGGTTTCCGTCCGGACATCGAAACCTTCAA



AAAAATGGTTCGTGACTTCATGCGTCCGCCGATGTTCGACCGTAAAGGTCGTCCGGCGGCGGCGTA



CGAACGTTTCGTTCTGGGTCGTCGTCACCGTCGTTACCGTTTCGACAAAGTTTTCGAAGAACGTTTC



GGTCGTTCTGCGCTGTTCATCTGCCCGCGTGTTGGTTGCGGTAACTTCGACCACTCTTCTGAACAGTC



TGCGGTTGTTCTGGCGCTGATCGGTTACATCGCGGACAAAGAAGGTATGTCTGGTAAAAAACTGGT



TTACGTTCGTCTGGCGGAACTGATGGCGGAATGGAAACTGAAAAAACTGGAACGTTCTCGTGTTGA



AGAACAGTCTTCTGCGCAGTAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATG



TAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGC


ID
GTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAA


NO:
GCGGGACCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCAC


78
ATTGATTATTTGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCC



TACCTGACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATA



CGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATAT



ACCATGCACCATCATCATCACCATGCGGAATCTAAACAGATGCAGTGCCGTAAATGCGGTGCGTCT



ATGAAATACGAAGTTATCGGTCTGGGTAAAAAATCTTGCCGTTACATGTGCCCGGACTGCGGTAAC



CACACCTCTGCGCGTAAAATCCAGAACAAAAAAAAACGTGACAAAAAATACGGTTCTGCGTCTAAA



GCGCAGTCTCAGCGTATCGCGGTTGCGGGTGCGCTGTACCCGGACAAAAAAGTTCAGACCATCAAA



ACCTACAAATACCCGGCGGACCTGAACGGTGAAGTTCACGACTCTGGTGTTGCGGAAAAAATCGCG



CAGGCGATCCAGGAAGACGAAATCGGTCTGCTGGGTCCGTCTTCTGAATACGCGTGCTGGATCGCG



TCTCAGAAACAGTCTGAACCGTACTCTGTTGTTGACTTCTGGTTCGACGCGGTTTGCGCGGGTGGTG



TTTTCGCGTACTCTGGTGCGCGTCTGCTGTCTACCGTTCTGCAGCTGTCTGGTGAAGAATCTGTTCTG



CGTGCGGCGCTGGCGTCTTCTCCGTTCGTTGACGACATCAACCTGGCGCAGGCGGAAAAATTCCTGG



CGGTTTCTCGTCGTACCGGTCAGGACAAACTGGGTAAACGTATCGGTGAATGCTTCGCGGAAGGTC



GTCTGGAAGCGCTGGGTATCAAAGACCGTATGCGTGAATTCGTTCAGGCGATCGACGTTGCGCAGA



CCGCGGGTCAGCGTTTCGCGGCGAAACTGAAAATCTTCGGTATCTCTCAGATGCCGGAAGCGAAAC



AGTGGAACAACGACTCTGGTCTGACCGTTTGCATCCTGCCGGACTACTACGTTCCGGAAGAAAACC



GTGCGGACCAGCTGGTTGTTCTGCTGCGTCGTCTGCGTGAAATCGCGTACTGCATGGGTATCGAAGA



CGAAGCGGGTTTCGAACACCTGGGTATCGACCCGGGTGCGCTGTCTAACTTCTCTAACGGTAACCCG



AAACGTGGTTTCCTGGGTCGTCTGCTGAACAACGACATCATCGCGCTGGCGAACAACATGTCTGCG



ATGACCCCGTACTGGGAAGGTCGTAAAGGTGAACTGATCGAACGTCTGGCGTGGCTGAAACACCGT



GCGGAAGGTCTGTACCTGAAAGAACCGCACTTCGGTAACTCTTGGGCGGACCACCGTTCTCGTATCT



TCTCTCGTATCGCGGGTTGGCTGTCTGGTTGCGCGGGTAAACTGAAAATCGCGAAAGACCAGATCTC



TGGTGTTCGTACCGACCTGTTCCTGCTGAAACGTCTGCTGGACGCGGTTCCGCAGTCTGCGCCGTCT



CCGGACTTCATCGCGTCTATCTCTGCGCTGGACCGTTTCCTGGAAGCGGCGGAATCTTCTCAGGACC



CGGCGGAACAGGTTCGTGCGCTGTACGCGTTCCACCTGAACGCGCCGGCGGTTCGTTCTATCGCGA



ACAAAGCGGTTCAGCGTTCTGACTCTCAGGAATGGCTGATCAAAGAACTGGACGCGGTTGACCACC



TGGAATTCAACAAAGCGTTCCCGTTCTTCTCTGACACCGGTAAAAAAAAAAAAAAAGGTGCGAACT



CTAACGGTGCGCCGTCTGAAGAAGAATACACCGAAACCGAATCTATCCAGCAGCCGGAAGACGCG



GAACAGGAAGTTAACGGTCAGGAAGGTAACGGTGCGTCTAAAAACCAGAAAAAATTCCAGCGTAT



CCCGCGTTTCTTCGGTGAAGGTTCTCGTTCTGAATACCGTATCCTGACCGAAGCGCCGCAGTACTTC



GACATGTTCTGCAACAACATGCGTGCGATCTTCATGCAGCTGGAATCTCAGCCGCGTAAAGCGCCG



CGTGACTTCAAATGCTTCCTGCAGAACCGTCTGCAGAAACTGTACAAACAGACCTTCCTGAACGCG



CGTTCTAACAAATGCCGTGCGCTGCTGGAATCTGTTCTGATCTCTTGGGGTGAATTCTACACCTACG



GTGCGAACGAAAAAAAATTCCGTCTGCGTCACGAAGCGTCTGAACGTTCTTCTGACCCGGACTACG



TTGTTCAGCAGGCGCTGGAAATCGCGCGTCGTCTGTTCCTGTTCGGTTTCGAATGGCGTGACTGCTC



TGCGGGTGAACGTGTTGACCTGGTTGAAATCCACAAAAAAGCGATCTCTTTCCTGCTGGCGATCACC



CAGGCGGAAGTTTCTGTTGGTTCTTACAACTGGCTGGGTAACTCTACCGTTTCTCGTTACCTGTCTGT



TGCGGGTACCGACACCCTGTACGGTACCCAGCTGGAAGAATTCCTGAACGCGACCGTTCTGTCTCA



GATGCGTGGTCTGGCGATCCGTCTGTCTTCTCAGGAACTGAAAGACGGTTTCGACGTTCAGCTGGAA



TCTTCTTGCCAGGACAACCTGCAGCACCTGCTGGTTTACCGTGCGTCTCGTGACCTGGCGGCGTGCA



AACGTGCGACCTGCCCGGCGGAACTGGACCCGAAAATCCTGGTTCTGCCGGTTGGTGCGTTCATCG



CGTCTGTTATGAAAATGATCGAACGTGGTGACGAACCGCTGGCGGGTGCGTACCTGCGTCACCGTC



CGCACTCTTTCGGTTGGCAGATCCGTGTTCGTGGTGTTGCGGAAGTTGGTATGGACCAGGGTACCGC



GCTGGCGTTCCAGAAACCGACCGAATCTGAACCGTTCAAAATCAAACCGTTCTCTGCGCAGTACGG



TCCGGTTCTGTGGCTGAACTCTTCTTCTTACTCTCAGTCTCAGTACCTGGACGGTTTCCTGTCTCAGC



CGAAAAACTGGTCTATGCGTGTTCTGCCGCAGGCGGGTTCTGTTCGTGTTGAACAGCGTGTTGCGCT



GATCTGGAACCTGCAGGCGGGTAAAATGCGTCTGGAACGTTCTGGTGCGCGTGCGTTCTTCATGCCG



GTTCCGTTCTCTTTCCGTCCGTCTGGTTCTGGTGACGAAGCGGTTCTGGCGCCGAACCGTTACCTGG



GTCTGTTCCCGCACTCTGGTGGTATCGAATACGCGGTTGTTGACGTTCTGGACTCTGCGGGTTTCAA



AATCCTGGAACGTGGTACCATCGCGGTTAACGGTTTCTCTCAGAAACGTGGTGAACGTCAGGAAGA



AGCGCACCGTGAAAAACAGCGTCGTGGTATCTCTGACATCGGTCGTAAAAAACCGGTTCAGGCGGA



AGTTGACGCGGCGAACGAACTGCACCGTAAATACACCGACGTTGCGACCCGTCTGGGTTGCCGTAT



CGTTGTTCAGTGGGCGCCGCAGCCGAAACCGGGTACCGCGCCGACCGCGCAGACCGTTTACGCGCG



TGCGGTTCGTACCGAAGCGCCGCGTTCTGGTAACCAGGAAGACCACGCGCGTATGAAATCTTCTTG



GGGTTACACCTGGGGTACCTACTGGGAAAAACGTAAACCGGAAGACATCCTGGGTATCTCTACCCA



GGTTTACTGGACCGGTGGTATCGGTGAATCTTGCCCGGCGGTTGCGGTTGCGCTGCTGGGTCACATC



CGTGCGACCTCTACCCAGACCGAATGGGAAAAAGAAGAAGTTGTTTTCGGTCGTCTGAAAAAATTC



TTCCCGTCTTAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGACC



CTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGC


ID
GTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAA


NO:
GCGGGACCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCAC


79
ATTGATTATTTGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCC



TACCTGACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATA



CGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATAT



ACCATGCACCATCATCATCACCATGAAAAACGTATCAACAAAATCCGTAAAAAACTGTCTGCGGAC



AACGCGACCAAACCGGTTTCTCGTTCTGGTCCGATGAAAACCCTGCTGGTTCGTGTTATGACCGACG



ACCTGAAAAAACGTCTGGAAAAACGTCGTAAAAAACCGGAAGTTATGCCGCAGGTTATCTCTAACA



ACGCGGCGAACAACCTGCGTATGCTGCTGGACGACTACACCAAAATGAAAGAAGCGATCCTGCAG



GTTTACTGGCAGGAATTCAAAGACGACCACGTTGGTCTGATGTGCAAATTCGCGCAGCCGGCGTCT



AAAAAAATCGACCAGAACAAACTGAAACCGGAAATGGACGAAAAAGGTAACCTGACCACCGCGGG



TTTCGCGTGCTCTCAGTGCGGTCAGCCGCTGTTCGTTTACAAACTGGAACAGGTTTCTGAAAAAGGT



AAAGCGTACACCAACTACTTCGGTCGTTGCAACGTTGCGGAACACGAAAAACTGATCCTGCTGGCG



CAGCTGAAACCGGAAAAAGACTCTGACGAAGCGGTTACCTACTCTCTGGGTAAATTCGGTCAGCGT



GCGCTGGACTTCTACTCTATCCACGTTACCAAAGAATCTACCCACCCGGTTAAACCGCTGGCGCAGA



TCGCGGGTAACCGTTACGCGTCTGGTCCGGTTGGTAAAGCGCTGTCTGACGCGTGCATGGGTACCAT



CGCGTCTTTCCTGTCTAAATACCAGGACATCATCATCGAACACCAGAAAGTTGTTAAAGGTAACCA



GAAACGTCTGGAATCTCTGCGTGAACTGGCGGGTAAAGAAAACCTGGAATACCCGTCTGTTACCCT



GCCGCCGCAGCCGCACACCAAAGAAGGTGTTGACGCGTACAACGAAGTTATCGCGCGTGTTCGTAT



GTGGGTTAACCTGAACCTGTGGCAGAAACTGAAACTGTCTCGTGACGACGCGAAACCGCTGCTGCG



TCTGAAAGGTTTCCCGTCTTTCCCGGTTGTTGAACGTCGTGAAAACGAAGTTGACTGGTGGAACACC



ATCAACGAAGTTAAAAAACTGATCGACGCGAAACGTGACATGGGTCGTGTTTTCTGGTCTGGTGTT



ACCGCGGAAAAACGTAACACCATCCTGGAAGGTTACAACTACCTGCCGAACGAAAACGACCACAA



AAAACGTGAAGGTTCTCTGGAAAACCCGAAAAAACCGGCGAAACGTCAGTTCGGTGACCTGCTGCT



GTACCTGGAAAAAAAATACGCGGGTGACTGGGGTAAAGTTTTCGACGAAGCGTGGGAACGTATCG



ACAAAAAAATCGCGGGTCTGACCTCTCACATCGAACGTGAAGAAGCGCGTAACGCGGAAGACGCG



CAGTCTAAAGCGGTTCTGACCGACTGGCTGCGTGCGAAAGCGTCTTTCGTTCTGGAACGTCTGAAAG



AAATGGACGAAAAAGAATTCTACGCGTGCGAAATCCAGCTGCAGAAATGGTACGGTGACCTGCGTG



GTAACCCGTTCGCGGTTGAAGCGGAAAACCGTGTTGTTGACATCTCTGGTTTCTCTATCGGTTCTGA



CGGTCACTCTATCCAGTACCGTAACCTGCTGGCGTGGAAATACCTGGAAAACGGTAAACGTGAATT



CTACCTGCTGATGAACTACGGTAAAAAAGGTCGTATCCGTTTCACCGACGGTACCGACATCAAAAA



ATCTGGTAAATGGCAGGGTCTGCTGTACGGTGGTGGTAAAGCGAAAGTTATCGACCTGACCTTCGA



CCCGGACGACGAACAGCTGATCATCCTGCCGCTGGCGTTCGGTACCCGTCAGGGTCGTGAATTCATC



TGGAACGACCTGCTGTCTCTGGAAACCGGTCTGATCAAACTGGCGAACGGTCGTGTTATCGAAAAA



ACCATCTACAACAAAAAAATCGGTCGTGACGAACCGGCGCTGTTCGTTGCGCTGACCTTCGAACGT



CGTGAAGTTGTTGACCCGTCTAACATCAAACCGGTTAACCTGATCGGTGTTGACCGTGGTGAAAAC



ATCCCGGCGGTTATCGCGCTGACCGACCCGGAAGGTTGCCCGCTGCCGGAATTCAAAGACTCTTCTG



GTGGTCCGACCGACATCCTGCGTATCGGTGAAGGTTACAAAGAAAAACAGCGTGCGATCCAGGCGG



CGAAAGAAGTTGAACAGCGTCGTGCGGGTGGTTACTCTCGTAAATTCGCGTCTAAATCTCGTAACCT



GGCGGACGACATGGTTCGTAACTCTGCGCGTGACCTGTTCTACCACGCGGTTACCCACGACGCGGTT



CTGGTTTTCGAAAACCTGTCTCGTGGTTTCGGTCGTCAGGGTAAACGTACCTTCATGACCGAACGTC



AGTACACCAAAATGGAAGACTGGCTGACCGCGAAACTGGCGTACGAAGGTCTGACCTCTAAAACCT



ACCTGTCTAAAACCCTGGCGCAGTACACCTCTAAAACCTGCTCTAACTGCGGTTTCACCATCACCAC



CGCGGACTACGACGGTATGCTGGTTCGTCTGAAAAAAACCTCTGACGGTTGGGCGACCACCCTGAA



CAACAAAGAACTGAAAGCGGAAGGTCAGATCACCTACTACAACCGTTACAAACGTCAGACCGTTGA



AAAAGAACTGTCTGCGGAACTGGACCGTCTGTCTGAAGAATCTGGTAACAACGACATCTCTAAATG



GACCAAAGGTCGTCGTGACGAAGCGCTGTTCCTGCTGAAAAAACGTTTCTCTCACCGTCCGGTTCAG



GAACAGTTCGTTTGCCTGGACTGCGGTCACGAAGTTCACGCGGACGAACAGGCGGCGCTGAACATC



GCGCGTTCTTGGCTGTTCCTGAACTCTAACTCTACCGAATTCAAATCTTACAAATCTGGTAAACAGC



CGTTCGTTGGTGCGTGGCAGGCGTTCTACAAACGTCGTCTGAAAGAAGTTTGGAAACCGAACGCGT



AAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAA



ATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGC


ID
GTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAA


NO:
GCGGGACCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCAC


80
ATTGATTATTTGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCC



TACCTGACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATA



CGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATAT



ACCATGCACCATCATCATCACCATAAACGTATCAACAAAATCCGTCGTCGTCTGGTTAAAGACTCTA



ACACCAAAAAAGCGGGTAAAACCGGTCCGATGAAAACCCTGCTGGTTCGTGTTATGACCCCGGACC



TGCGTGAACGTCTGGAAAACCTGCGTAAAAAACCGGAAAACATCCCGCAGCCGATCTCTAACACCT



CTCGTGCGAACCTGAACAAACTGCTGACCGACTACACCGAAATGAAAAAAGCGATCCTGCACGTTT



ACTGGGAAGAATTCCAGAAAGACCCGGTTGGTCTGATGTCTCGTGTTGCGCAGCCGGCGCCGAAAA



ACATCGACCAGCGTAAACTGATCCCGGTTAAAGACGGTAACGAACGTCTGACCTCTTCTGGTTTCGC



GTGCTCTCAGTGCTGCCAGCCGCTGTACGTTTACAAACTGGAACAGGTTAACGACAAAGGTAAACC



GCACACCAACTACTTCGGTCGTTGCAACGTTTCTGAACACGAACGTCTGATCCTGCTGTCTCCGCAC



AAACCGGAAGCGAACGACGAACTGGTTACCTACTCTCTGGGTAAATTCGGTCAGCGTGCGCTGGAC



TTCTACTCTATCCACGTTACCCGTGAATCTAACCACCCGGTTAAACCGCTGGAACAGATCGGTGGTA



ACTCTTGCGCGTCTGGTCCGGTTGGTAAAGCGCTGTCTGACGCGTGCATGGGTGCGGTTGCGTCTTT



CCTGACCAAATACCAGGACATCATCCTGGAACACCAGAAAGTTATCAAAAAAAACGAAAAACGTCT



GGCGAACCTGAAAGACATCGCGTCTGCGAACGGTCTGGCGTTCCCGAAAATCACCCTGCCGCCGCA



GCCGCACACCAAAGAAGGTATCGAAGCGTACAACAACGTTGTTGCGCAGATCGTTATCTGGGTTAA



CCTGAACCTGTGGCAGAAACTGAAAATCGGTCGTGACGAAGCGAAACCGCTGCAGCGTCTGAAAG



GTTTCCCGTCTTTCCCGCTGGTTGAACGTCAGGCGAACGAAGTTGACTGGTGGGACATGGTTTGCAA



CGTTAAAAAACTGATCAACGAAAAAAAAGAAGACGGTAAAGTTTTCTGGCAGAACCTGGCGGGTT



ACAAACGTCAGGAAGCGCTGCTGCCGTACCTGTCTTCTGAAGAAGACCGTAAAAAAGGTAAAAAAT



TCGCGCGTTACCAGTTCGGTGACCTGCTGCTGCACCTGGAAAAAAAACACGGTGAAGACTGGGGTA



AAGTTTACGACGAAGCGTGGGAACGTATCGACAAAAAAGTTGAAGGTCTGTCTAAACACATCAAAC



TGGAAGAAGAACGTCGTTCTGAAGACGCGCAGTCTAAAGCGGCGCTGACCGACTGGCTGCGTGCGA



AAGCGTCTTTCGTTATCGAAGGTCTGAAAGAAGCGGACAAAGACGAATTCTGCCGTTGCGAACTGA



AACTGCAGAAATGGTACGGTGACCTGCGTGGTAAACCGTTCGCGATCGAAGCGGAAAACTCTATCC



TGGACATCTCTGGTTTCTCTAAACAGTACAACTGCGCGTTCATCTGGCAGAAAGACGGTGTTAAAAA



ACTGAACCTGTACCTGATCATCAACTACTTCAAAGGTGGTAAACTGCGTTTCAAAAAAATCAAACC



GGAAGCGTTCGAAGCGAACCGTTTCTACACCGTTATCAACAAAAAATCTGGTGAAATCGTTCCGAT



GGAAGTTAACTTCAACTTCGACGACCCGAACCTGATCATCCTGCCGCTGGCGTTCGGTAAACGTCAG



GGTCGTGAATTCATCTGGAACGACCTGCTGTCTCTGGAAACCGGTTCTCTGAAACTGGCGAACGGTC



GTGTTATCGAAAAAACCCTGTACAACCGTCGTACCCGTCAGGACGAACCGGCGCTGTTCGTTGCGCT



GACCTTCGAACGTCGTGAAGTTCTGGACTCTTCTAACATCAAACCGATGAACCTGATCGGTATCGAC



CGTGGTGAAAACATCCCGGCGGTTATCGCGCTGACCGACCCGGAAGGTTGCCCGCTGTCTCGTTTCA



AAGACTCTCTGGGTAACCCGACCCACATCCTGCGTATCGGTGAATCTTACAAAGAAAAACAGCGTA



CCATCCAGGCGGCGAAAGAAGTTGAACAGCGTCGTGCGGGTGGTTACTCTCGTAAATACGCGTCTA



AAGCGAAAAACCTGGCGGACGACATGGTTCGTAACACCGCGCGTGACCTGCTGTACTACGCGGTTA



CCCAGGACGCGATGCTGATCTTCGAAAACCTGTCTCGTGGTTTCGGTCGTCAGGGTAAACGTACCTT



CATGGCGGAACGTCAGTACACCCGTATGGAAGACTGGCTGACCGCGAAACTGGCGTACGAAGGTCT



GCCGTCTAAAACCTACCTGTCTAAAACCCTGGCGCAGTACACCTCTAAAACCTGCTCTAACTGCGGT



TTCACCATCACCTCTGCGGACTACGACCGTGTTCTGGAAAAACTGAAAAAAACCGCGACCGGTTGG



ATGACCACCATCAACGGTAAAGAACTGAAAGTTGAAGGTCAGATCACCTACTACAACCGTTACAAA



CGTCAGAACGTTGTTAAAGACCTGTCTGTTGAACTGGACCGTCTGTCTGAAGAATCTGTTAACAACG



ACATCTCTTCTTGGACCAAAGGTCGTTCTGGTGAAGCGCTGTCTCTGCTGAAAAAACGTTTCTCTCA



CCGTCCGGTTCAGGAAAAATTCGTTTGCCTGAACTGCGGTTTCGAAACCCACGCGGACGAACAGGC



GGCGCTGAACATCGCGCGTTCTTGGCTGTTCCTGCGTTCTCAGGAATACAAAAAATACCAGACCAA



CAAAACCACCGGTAACACCGACAAACGTGCGTTCGTTGAAACCTGGCAGTCTTTCTACCGTAAAAA



ACTGAAAGAAGTTTGGAAACCGGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAAT



GTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA





SEQ
tgccgtcactgcgtcttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagc


ID
attctgtaacaaagcgggaccaaagccatgacaaaaacgcgtaacaaaagtgtctataatcacggc


NO:
agaaaagtccacattgattatttgcacggcgtcacactttgctatgccatagcatttttatccata


81
agattagcggatcctacctgacgctttttatcgcaactctctactgtttctccatacccgtttttt



tgggctagcaccgcctatctcgtgtgagataggcggagatacgaactttaagAAGGAGatatacc





SEQ
TGCCGTCACTGCGTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCA


ID
TTCTGTAACAAAGCGGGACCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCA


NO:
GAAAAGTCCACATTGATTATTTGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAG


82
ATTAGCGGATCCTACCTGACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGG



TAGCGGATCCTACCTGAC





SEQ
AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTTCTAGAGCACA


ID
GCTAACACCACGTCGTCCCTATCTGCTGCCCTAGGTCTATGAGTGGTTGCTGGATAACTTTACGGGC


NO:
ATGCATAAGGCTCGTAATATATATTCAGGGAGACCACAACGGTTTCCCTCTACAAATAATTTTGTTT


83
AACTTTTACTAGAGCTAGCAGTAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGG



AGATACGAACTTTAAGAGGAGGATATACCA





SEQ
GTTTGAGAGATATGTAAATTCAAAGGATAATCAAAC


ID



NO:



84






SEQ
actacattttttaagacctaattttgagt


ID



NO:



85






SEQ
ctcaaaactcattcgaatctctactctttgtagat


ID



NO:



86






SEQ
CTCTAGCAGGCCTGGCAAATTTCTACTGTTGTAGAT


ID



NO:



87






SEQ
CCGTCTAAAACTCATTCAGAATTTCTACTAGTGTAGAT


ID



NO:



88






SEQ
GTCTAGGTACTCTCTTTAATTTCTACTATTGT


ID



NO:



89






SEQ
gttaagttatatagaataatttctactgttgtaga


ID



NO:



90






SEQ
gtttaaaaccactttaaaatttctactattgta


ID



NO:



91






SEQ
GTTTGAGAATGATGTAAAAATGTATGGTACACAGAAATGTTTTAATACCATATTTTTACATCACTCT


ID
CAAACATACATCTCTTGTTACTGTTTATCGTATCCAGATTAAATTTCACGTTTTT


NO:



92






SEQ
CTCTACAACTGATAAAGAATTTCTACTTTTGTAGAT


ID



NO:



93






SEQ
GTCTGGCCCCAAATTTTAATTTCTACTGTTGTAGAT


ID



NO:



94






SEQ
GTCAAAAGACCTTTTTAATTTCTACTCTTGTAGAT


ID



NO:



95






SEQ
GTCTAGAGGACAGAATTTTTCAACGGGTGTGCCAATGGCCACTTTCCAGGTGGCAAAGCCCGTTGA


ID
GCTTCTACGGAAGTGGCAC


NO:



96






SEQ
CGAGGTTCTGTCTTTTGGTCAGGACAACCGTCTAGCTATAAGTGCTGCAGGGGTGTGAGAAACTCCT


ID
ATTGCTGGACGATGTCTCTTTTAACGAGGCATTAGCAC


NO:



97






SEQ
GAACGAGGGACGTTTTGTCTCCAATGATTTTGCTATGACGACCTCGAACTGTGCCTTCAAGTCTGAG


ID
GCGAAAAAGAAATGGAAAAAAGTGTCTCATCGCTCTACCTCGTAGTTAGAGG


NO:



98






SEQ
AATTACTGATGTTGTGATGAAGG


ID



NO:



99






SEQ
TATACCATAAGGATTTAAAGACT


ID



NO:



100






SEQ
GTCTTTACTCTCACCTTTCCACCTG


ID



NO:



101






SEQ
ATTTGAAGGTATCTCCGATAAGTAAAACGCATCAAAG


ID



NO:



102






SEQ
GTTTGAAGATATCTCCGATAAATAAGAAGCATCAAAG


ID



NO:



103






SEQ
TTGTTTTAATACCATATTTTTACATCACTCTCAAAC


ID



NO:



104






SEQ
AAAGAACGCTCGCTCAGTGTTCTGACCTTTCGAGCGCCTGTTCAGGGCGAAAACCCTGGGAGGCGC


ID
TCGAATCATAGGTGGGACAAGGGATTCGCGGCGAAAA


NO:



105






SEQ
GTTTGAGAATGATGTAAAAATGTATGGTACACAGAAATGTTTTAATACCATATTTTTACATCACTCT


ID
CAAACATACATCTCTTGTTACTGTTTATCGTATCCAGATTAAATTTCACGTTTTT


NO:



106






SEQ
GTCTAGAGGACAGAATTTTTCAACGGGTGTGCCAATGGCCACTTTCCAGGTGGCAAAGCCCGTTGA


ID
GCTTCTACGGAAGTGGCAC


NO:



107






SEQ
MSIYQEFVNKYSLSKTLRFELIPQGKTLENIKARGLILDDEKRAKDYKKAKQIIDKYHQFFIEEILSSVC


ID
ISEDLLQNYSDVYFKLKKSDDDNLQKDFKSAKDTIKKQISEYIKDSEKFKNLFNQNLIDAKKGQESDLIL


NO:
WLKQSKDNGIELFKANSDITDIDEALEIIKSFKGWTTYFKGFHENRKNVYSSNDIPTSIIYRIVDDNLPK


108
FLENKAKYESLKDKAPEAINYEQIKKDLAEELTFDIDYKTSEVNQRVFSLDEVFEIANFNNYLNQSGITK



FNTIIGGKFVNGENTKRKGINEYINLYSQQINDKTLKKYKMSVLFKQILSDTESKSFVIDKLEDDSDVVT



TMQSFYEQIAAFKTVEEKSIKETLSLLFDDLKAQKLDLSKIYFKNDKSLTDLSQQVFDDYSVIGTAVLEY



ITQQIAPKNLDNPSKKEQELIAKKTEKAKYLSLETIKLALEEFNKHRDIDKQCRFEEILANFAAIPMIFD



EIAQNKDNLAQISIKYQNQGKKDLLQASAEDDVKAIKDLLDQTNNLLHKLKIFHISQSEDKANILDKDEH



FYLVFEECYFELANIVPLYNKIRNYITQKPYSDEKFKLNFENSTLANGWDKNKEPDNTAILFIKDDKYYL



GVMNKKNNKIFDDKAIKENKGEGYKKIVYKLLPGANKMLPKVFFSAKSIKFYNPSEDILRIRNHSTHTKN



GSPQKGYEKFEFNIEDCRKFIDFYKQSISKHPEWKDFGFRFSDTQRYNSIDEFYREVENQGYKLTFENIS



ESYIDSVVNQGKLYLFQIYNKDFSAYSKGRPNLHTLYWKALFDERNLQDVVYKLNGEAELFYRKQSIPKK



ITHPAKEAIANKNKDNPKKESVFEYDLIKDKRFTEDKFFFHCPITINFKSSGANKFNDEINLLLKEKAND



VHILSIDRGERHLAYYTLVDGKGNIIKQDTFNIIGNDRMKTNYHDKLAAIEKDRDSARKDWKKINNIKEM



KEGYLSQVVHEIAKLVIEYNAIVVFEDLNFGFKRGRFKVEKQVYQKLEKMLIEKLNYLVFKDNEFDKTGG



VLRAYQLTAPFETFKKMGKQTGIIYYVPAGFTSKICPVTGFVNQLYPKYESVSKSQEFFSKFDKICYNLD



KGYFEFSFDYKNFGDKAAKGKWTIASFGSRLINFRNSDKNHNWDTREVYPTKELEKLLKDYSIEYGHGEC



IKAAICGESDKKFFAKLTSVLNTILQMRNSKTGTELDYLISPVADVNGNFFDSRQAPKNMPQDADANGAY



HIGLKGLMLLGRIKNNQEGKKLNLVIKNEEYFEFVQNRNN





SEQ
MSIYQEFVNKYSLSKTLRFELIPQGKTLENIKARGLILDDEKRAKDYKKAKQIIDKYHQFFIEEILSSVC


ID
ISEDLLQNYSDVYFKLKKSDDDNLQKDFKSAKDTIKKQISEYIKDSEKFKNLFNQNLIDAKKGQESDLIL


NO:
WLKQSKDNGIELFKANSDITDIDEALEIIKSFKGWTTYFKGFHENRKNVYSSDDIPTSIIYRIVDDNLPK


109
FLENKAKYESLKDKAPEAINYEQIKKDLAEELTFDIDYKTSEVNQRVFSLDEVFEIANFNNYLNQSGITK



FNTIIGGKFVNGENTKRKGINEYINLYSQQINDKTLKKYKMSVLFKQILSDTESKSFVIDKLEDDSDVVT



TMQSFYEQIAAFKTVEEKSIKETLSLLFDDLKAQKLDLSKIYFKNDKSLTDLSQQVFDDYSVIGTAVLEY



ITQQVAPKNLDNPSKKEQDLIAKKTEKAKYLSLETIKLALEEFNKHRDIDKQCRFEEILANFAAIPMIFD



EIAQNKDNLAQISLKYQNQGKKDLLQASAEEDVKAIKDLLDQTNNLLHRLKIFHISQSEDKANILDKDEH



FYLVFEECYFELANIVPLYNKIRNYITQKPYSDEKFKLNFENSTLANGWDKNKEPDNTAILFIKDDKYYL



GVMNKKNNKIFDDKAIKENKGEGYKKIVYKLLPGANKMLPKVFFSAKSIKFYNPSEDILRIRNHSTHTKN



GNPQKGYEKFEFNIEDCRKFIDFYKESISKHPEWKDFGFRFSDTQRYNSIDEFYREVENQGYKLTFENIS



ESYIDSVVNQGKLYLFQIYNKDFSAYSKGRPNLHTLYWKALFDERNLQDVVYKLNGEAELFYRKQSIPKK



ITHPAKEAIANKNKDNPKKESVFEYDLIKDKRFTEDKFFFHCPITINFKSSGANKFNDEINLLLKEKAND



VHILSIDRGERHLAYYTLVDGKGNIIKQDTFNIIGNDRMKTNYHDKLAAIEKDRDSARKDWKKINNIKEM



KEGYLSQVVHEIAKLVIEHNAIVVFEDLNFGFKRGRFKVEKQVYQKLEKMLIEKLNYLVFKDNEFDKTGG



VLRAYQLTAPFETFKKMGKQTGIIYYVPAGFTSKICPVTGFVNQLYPKYESVSKSQEFFSKFDKICYNLD



KGYFEFSFDYKNFGDKAAKGKWTIASFGSRLINFRNSDKNHNWDTREVYPTKELEKLLKDYSIEYGHGEC



IKAAICGESDKKFFAKLTSVLNTILQMRNSKTGTELDYLISPVADVNGNFFDSRQAPKNMPQDADANGAY



HIGLKGLMLLDRIKNNQEGKKLNLVIKNEEYFEFVQNRNN





SEQ
MDKKYSIGLDIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGALLFDSGETAEATRLKRTAR


ID
RRYTRRKNRICYLQEIFSNEMAKVDDSFFHRLEESFLVEEDKKHERHPIFGNIVDEVAYHEKYPTIYHLR


NO:
KKLVDSTDKADLRLIYLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIQLVQTYNQLFEENPINASGVDAK


110
AILSARLSKSRRLENLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNFDLAEDAKLQLSKDTYDDDLDNLL



AQIGDQYADLFLAAKNLSDAILLSDILRVNTEITKAPLSASMIKRYDEHHQDLTLLKALVRQQLPEKYKE



IFFDQSKNGYAGYIDGGASQEEFYKFIKPILEKMDGTEELLVKLNREDLLRKQRTFDNGSIPHQIHLGELH



AILRRQEDFYPFLKDNREKIEKILTFRIPYYVGPLARGNSRFAWMTRKSEETITPWNFEEVVDKGASAQSF



IERMTNFDKNLPNEKVLPKHSLLYEYFTVYNELTKVKYVTEGMRKPAFLSGEQKKAIVDLLFKTNRKVT



VKQLKEDYFKKIECFDSVEISGVEDRFNASLGTYHDLLKIIKDKDFLDNEENEDILEDIVLTLTLFEDREMI



EERLKTYAHLFDDKVMKQLKRRRYTGWGRLSRKLINGIRDKQSGKTILDFLKSDGFANRNFMQLIHDDS



LTFKEDIQKAQVSGQGDSLHEHIANLAGSPAIKKGILQTVKVVDELVKVMGRHKPENIVIEMARENQTT



QKGQKNSRERMKRIEEGIKELGSQILKEHPVENTQLQNEKLYLYYLQNGRDMYVDQELDINRLSDYDV



DHIVPQSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEVVKKMKNYWRQLLNAKLITQRKFDNLTKAERG



GLSELDKAGFIKRQLVETRQITKHVAQILDSRMNTKYDENDKLIREVKVITLKSKLVSDFRKDFQFYKVR



EINNYHHAHDAYLNAVVGTALIKKYPKLESEFVYGDYKVYDVRKMIAKSEQEIGKATAKYFFYSNIMN



FFKTEITLANGEIRKRPLIETNGETGEIVWDKGRDFATVRKVLSMPQVNIVKKTEVQTGGFSKESILPKRN



SDKLIARKKDWDPKKYGGFDSPTVAYSVLVVAKVEKGKSKKLKSVKELLGITIMERSSFEKNPIDFLEA



KGYKEVKKDLIIKLPKYSLFELENGRKRMLASAGELQKGNELALPSKYVNFLYLASHYEKLKGSPEDNE



QKQLFVEQHKHYLDEIIEQISEFSKRVILADANLDKVLSAYNKHRDKPIREQAENIIHLFTLTNLGAPAAF



KYFDTTIDRKRYTSTKEVLDATLIHQSITGLYETRIDLSQLGGD





SEQ
PKKKRKV


ID



NO:



111






SEQ
KRPAATKKAGQAKKKK


ID



NO:



112






SEQ
PAAKRVKLD


ID



NO:



113






SEQ
RQRRNELKRSP


ID



NO:



114






SEQ
NQSSNFGPMKGGNFGGRSSGPYGGGGQYFAKPRNQGGY


ID



NO:



115






SEQ
RMRIZFKNKGKDTAELRRRRVEVSVELRKAKKDEQILKRRNV


ID



NO:



116






SEQ
VSRKRPRP


ID



NO:



117






SEQ
PPKKARED


ID



NO:



118






SEQ
PQPKKKPL


ID



NO:



119






SEQ
SALIKKKKKMAP


ID



NO:



120






SEQ
DRLRR


ID



NO:



121






SEQ
PKQKKRK


ID



NO:



122






SEQ
RKLKKKIKKL


ID



NO:



123






SEQ
REKKKFLKRR


ID



NO:



124






SEQ
KRKGDEVDGVDEVAKKKSKK


ID



NO:



125






SEQ
RKCLQAGMNLEARKTKK


ID



NO:



126






SEQ
ATGGGTAAGATGTATTATCTGGGTTTGGATATAGGCACTAACTCTGTGGGATATGCAGTAACTGATC


ID
CCTCGTATCACTTGTTAAAGTTCAAAGGCGAACCCATGTGGGGAGCACATGTATTTGCTGCGGGTAA


NO:
TCAGAGTGCCGAAAGGCGATCTTTCAGAACATCCAGGAGGCGATTAGATAGGAGACAGCAAAGAG


127
TAAAGCTTGTGCAAGAGATCTTTGCTCCTGTCATTTCACCTATAGACCCTCGTTTTTTTATAAGATTG



CACGAATCGGCTCTATGGAGAGACGATGTTGCCGAAACAGATAAACATATCTTTTTCAATGATCCC



ACTTATACAGACAAGGAATACTACTCCGACTACCCGACAATTCATCATTTGATCGTCGATCTTATGG



AGAGCTCTGAAAAGCATGACCCCCGACTTGTCTATTTGGCTGTAGCTTGGTTAGTTGCTCATAGAGG



TCATTTCTTGAATGAAGTAGATAAAGACAATATAGGTGATGTACTTTCTTTTGATGCTTTCTACCCG



GAATTTTTGGCCTTTTTGTCAGACAATGGCGTCAGTCCCTGGGTCTGTGAGTCGAAGGCCCTTCAAG



CTACTCTGCTGTCTAGGAATAGCGTCAACGACAAATATAAAGCATTAAAATCGCTGATATTCGGATC



GCAAAAACCGGAAGATAACTTTGACGCTAACATCTCTGAAGATGGTTTAATCCAATTGCTGGCGGG



TAAGAAAGTTAAAGTAAACAAACTATTCCCACAAGAGTCCAACGATGCTAGCTTTACGTTGAATGA



TAAAGAAGACGCTATTGAAGAAATTCTAGGTACTTTAACGCCTGACGAGTGCGAATGGATCGCTCA



TATTCGCAGATTGTTCGATTGGGCCATCATGAAACACGCGCTAAAGGATGGCAGGACGATATCTGA



ATCAAAAGTGAAGCTATACGAGCAGCATCATCATGACTTGACTCAGTTAAAGTACTTTGTGAAGAC



CTACCTAGCTAAAGAGTATGATGATATCTTCAGAAACGTAGACTCCGAGACAACTAAAAATTATGT



AGCTTATTCTTACCATGTGAAGGAAGTGAAAGGCACATTACCAAAAAATAAAGCAACGCAAGAAG



AATTTTGTAAATACGTCCTTGGCAAAGTCAAAAACATTGAATGTTCCGAAGCAGACAAGGTTGATTT



TGATGAAATGATACAACGACTTACGGACAATTCTTTTATGCCAAAGCAAGTCTCAGGTGAAAATAG



AGTAATACCATACCAGTTGTACTACTATGAATTAAAGACAATTTTAAACAAAGCCGCCTCATATCTA



CCTTTTTTGACACAATGCGGTAAAGATGCTATTTCTAACCAAGACAAATTACTGTCTATAATGACAT



TTCGCATACCATATTTCGTCGGCCCTTTAAGGAAAGATAATTCAGAACATGCCTGGTTGGAACGTAA



AGCGGGTAAAATTTACCCGTGGAACTTTAATGATAAAGTAGATCTTGATAAATCGGAGGAAGCCTT



TATCCGTAGGATGACCAATACTTGCACGTATTACCCAGGAGAAGACGTGTTACCATTAGATTCACTT



ATCTATGAAAAGTTTATGATCTTGAATGAGATAAACAATATTAGGATTGACGGATACCCCATTTCTG



TTGATGTGAAACAACAAGTATTTGGTTTATTTGAGAAGAAAAGGCGAGTAACAGTTAAGGATATTC



AAAATCTACTATTATCTCTTGGAGCGTTGGATAAACACGGTAAGCTGACTGGTATTGACACGACAAT



ACACTCTAATTATAACACTTATCATCATTTTAAATCTCTTATGGAGCGGGGAGTATTGACCAGAGAT



GATGTGGAAAGAATAGTGGAAAGAATGACATATTCTGACGATACTAAGAGGGTCAGACTGTGGTTA



AATAATAATTATGGAACTCTAACAGCTGACGATGTTAAGCATATCTCAAGACTCAGAAAACACGAT



TTCGGCCGTTTGTCTAAAATGTTTTTGACAGGATTGAAAGGTGTTCATAAGGAGACAGGCGAGAGA



GCAAGTATACTGGATTTTATGTGGAATACTAACGACAATTTAATGCAACTACTGTCCGAATGTTACA



CATTCTCGGATGAGATCACCAAATTACAAGAGGCCTACTACGCAAAAGCTCAATTATCGCTAAATG



ACTTCTTGGACTCTATGTATATATCAAACGCCGTTAAGAGACCTATTTATCGGACCTTAGCGGTAGT



AAATGATATTAGAAAGGCATGCGGGACGGCACCTAAAAGAATTTTCATCGAGATGGCGCGAGATG



GAGAGTCTAAGAAGAAAAGATCTGTGACTCGTAGAGAGCAAATTAAAAATCTCTATAGATCAATTC



GTAAAGACTTTCAACAAGAAGTTGATTTTCTGGAAAAGATATTGGAAAATAAGAGTGACGGGCAGC



TTCAGTCTGACGCTTTATATTTGTATTTTGCTCAATTAGGCAGAGACATGTACACAGGTGATCCAAT



CAAATTAGAACATATTAAAGACCAATCTTTTTACAACATTGATCATATTTATCCTCAATCGATGGTG



AAAGATGACAGTTTGGATAACAAGGTACTAGTCCAAAGCGAAATCAATGGCGAAAAGAGTTCGCG



CTATCCATTAGACGCAGCCATTAGAAACAAAATGAAGCCGTTGTGGGATGCCTACTATAATCATGG



ATTAATTTCTCTTAAGAAATACCAGCGTTTGACGAGATCTACTCCATTTACGGACGACGAGAAGTGG



GATTTTATCAATCGTCAGCTAGTTGAAACTAGGCAATCTACTAAAGCTTTAGCAATATTGTTAAAGC



GTAAGTTTCCAGATACTGAAATAGTTTACTCAAAGGCTGGACTATCCAGCGATTTTAGACATGAATT



CGGCCTGGTTAAGAGTAGGAATATTAATGATCTACACCATGCTAAAGATGCCTTTCTCGCAATAGTT



ACTGGGAACGTTTATCATGAAAGATTTAATAGAAGATGGTTTATGGTTAACCAGCCATACTCTGTGA



AAACTAAGACATTGTTTACCCATTCAATTAAGAATGGCAACTTTGTCGCTTGGAATGGAGAAGAAG



ATCTTGGACGTATCGTAAAGATGTTGAAACAAAACAAGAACACAATCCACTTCACCAGGTTTTCCTT



TGATAGGAAGGAGGGATTGTTCGATATTCAACCTCTCAAAGCTTCTACCGGATTGGTTCCACGAAA



AGCAGGGTTGGATGTTGTTAAATATGGAGGATACGATAAAAGCACTGCCGCGTATTATTTATTAGT



ACGTTTTACACTCGAGGATAAGAAGACTCAACACAAATTGATGATGATTCCTGTTGAAGGTCTCTAC



AAAGCACGTATTGACCATGATAAAGAGTTTTTAACAGATTATGCTCAGACCACGATCAGCGAAATT



CTTCAAAAGGACAAGCAGAAAGTGATCAACATCATGTTCCCTATGGGCACGAGACATATCAAACTG



AATTCGATGATTTCTATTGATGGATTCTATCTTTCTATTGGTGGGAAGAGTAGCAAAGGTAAGTCAG



TACTATGTCATGCTATGGTGCCATTAATCGTCCCACACAAGATAGAATGTTATATCAAGGCTATGGA



ATCGTTTGCAAGAAAATTCAAAGAAAATAATAAATTGAGGATCGTTGAAAAGTTTGATAAAATAAC



TGTTGAAGATAACTTGAACTTATACGAGCTTTTTCTACAAAAGTTGCAACATAACCCATATAATAAA



TTTTTCTCTACACAATTTGATGTGTTGACGAACGGTAGAAGTACATTCACCAAATTGTCTCCAGAGG



AGCAAGTCCAGACTTTACTTAATATACTGAGTATATTTAAAACTTGTCGTTCTTCTGGGTGTGATTTA



AAATCAATAAATGGTTCCGCTCAAGCGGCTAGAATTATGATATCCGCTGATTTAACTGGCTTATCAA



AAAAGTATTCAGATATTAGATTAGTTGAGCAAAGCGCATCAGGTCTATTTGTTTCAAAATCTCAAAA



TCTCTTGGAATACTTGCCAAAAAAGAAAAGGAAAGTTTAG





SEQ
ATGAGTAGTTTAACAAAGTTTACCAATAAATATAGTAAGCAACTAACTATAAAGAACGAATTGATA


ID
CCGGTCGGTAAGACTTTGGAAAACATAAAAGAAAATGGGTTGATTGATGGAGACGAGCAATTGAAT


NO:
GAGAATTATCAAAAAGCAAAGATAATAGTAGATGATTTTTTGAGAGACTTTATTAATAAAGCTCTA


128
AATAACACTCAAATTGGTAACTGGAGAGAGCTAGCCGACGCCTTGAACAAGGAAGATGAGGATAA



TATTGAGAAATTACAAGATAAGATTAGAGGGATTATCGTGTCTAAGTTTGAGACTTTTGATCTGTTC



AGTTCGTATTCGATTAAAAAGGACGAGAAAATCATCGATGATGATAACGATGTGGAAGAAGAGGA



GCTAGACCTTGGGAAGAAGACATCTAGCTTCAAATACATATTCAAGAAAAATTTGTTCAAACTTGTC



CTTCCTTCATATTTAAAAACAACAAATCAAGATAAGTTAAAAATCATTTCTTCCTTCGATAATTTTA



GTACTTATTTTCGTGGTTTTTTCGAAAACAGGAAAAATATATTCACTAAAAAGCCTATATCTACCTC



TATAGCTTATAGAATTGTTCACGATAATTTCCCAAAATTTCTAGATAATATCAGGTGTTTTAATGTTT



GGCAAACCGAGTGTCCTCAGTTAATAGTCAAGGCCGACAACTACCTTAAAAGCAAGAATGTGATTG



CAAAAGATAAGTCTTTGGCTAACTATTTTACAGTCGGTGCCTATGATTATTTTCTGAGTCAAAATGG



TATCGATTTCTATAACAACATTATTGGCGGCTTACCAGCTTTTGCCGGGCATGAGAAGATTCAGGGT



TTGAACGAATTTATCAATCAAGAATGTCAAAAGGATTCTGAATTAAAGTCTAAGCTCAAGAATAGG



CACGCTTTCAAAATGGCAGTCTTATTCAAACAAATCCTTTCAGACAGAGAAAAGTCATTTGTGATTG



ACGAGTTCGAATCAGACGCTCAGGTAATTGATGCTGTTAAAAATTTTTACGCGGAACAATGCAAAG



ATAATAACGTCATATTTAATTTATTGAATCTGATCAAGAATATTGCTTTTTTGTCGGATGATGAGTTA



GACGGCATTTTCATAGAGGGTAAATACCTGTCCTCTGTGTCTCAAAAATTGTATAGTGATTGGTCAA



AGTTGAGAAATGATATTGAAGATTCGGCTAATTCTAAACAGGGTAACAAAGAATTAGCGAAGAAA



ATCAAAACTAACAAGGGTGATGTTGAAAAGGCTATAAGTAAGTACGAGTTCAGTTTATCTGAACTA



AATTCAATTGTTCATGATAACACAAAATTTTCCGATCTTTTATCATGCACATTACATAAAGTTGCAA



GTGAAAAATTAGTCAAAGTAAACGAAGGTGATTGGCCAAAACATCTAAAAAACAACGAGGAAAAA



CAGAAGATAAAAGAACCTCTTGACGCTTTATTGGAAATATACAATACTCTATTAATATTTAACTGTA



AAAGTTTTAACAAAAATGGTAATTTCTATGTCGACTACGATCGCTGCATTAATGAGTTGTCCAGTGT



TGTGTACTTGTATAATAAAACTCGTAATTATTGTACGAAAAAGCCGTACAACACTGACAAATTTAAG



TTGAATTTCAACTCCCCACAACTGGGTGAGGGCTTCTCTAAAAGTAAAGAGAATGATTGCCTTACAT



TATTATTTAAAAAAGATGATAATTATTATGTCGGAATCATAAGAAAGGGGGCAAAGATCAACTTCG



ATGACACTCAGGCCATAGCAGACAACACAGATAACTGTATATTCAAAATGAATTATTTTTTGCTGAA



GGATGCTAAAAAATTTATCCCCAAATGTTCAATACAATTAAAAGAGGTTAAGGCCCATTTCAAAAA



GTCGGAAGATGACTATATTTTGTCCGATAAGGAAAAATTCGCTAGTCCGCTTGTTATTAAAAAATCC



ACATTTCTTCTCGCTACGGCTCATGTGAAAGGAAAGAAGGGCAATATTAAGAAATTTCAGAAAGAA



TACTCCAAAGAAAATCCTACGGAGTATAGAAATAGTCTGAACGAATGGATAGCATTCTGCAAAGAG



TTCTTGAAGACCTATAAAGCTGCCACCATCTTTGATATTACAACTTTGAAAAAGGCCGAGGAATACG



CTGACATTGTGGAATTCTATAAGGATGTAGATAATCTTTGTTACAAGTTAGAATTTTGCCCTATCAA



AACTTCTTTTATCGAAAATCTTATAGATAATGGCGATTTATACCTGTTTAGAATTAATAACAAGGAC



TTTTCTTCAAAAAGTACAGGCACGAAAAACTTACACACATTATACTTGCAGGCTATATTTGACGAGC



GAAACTTAAACAACCCCACGATAATGTTGAATGGAGGTGCAGAGTTATTCTACAGAAAAGAATCTA



TAGAACAGAAAAATCGGATCACGCACAAAGCCGGTAGTATCTTAGTGAATAAAGTGTGCAAAGAT



GGTACAAGTCTAGATGACAAAATCCGTAACGAAATTTACCAGTATGAAAACAAATTCATTGATACT



CTTTCGGACGAAGCTAAAAAGGTTCTGCCAAACGTTATTAAGAAAGAGGCTACGCATGATATAACA



AAAGATAAACGTTTCACTAGCGACAAATTCTTCTTTCATTGTCCTTTAACAATCAACTACAAGGAAG



GTGACACCAAACAATTTAATAATGAAGTGCTCTCATTCCTTAGAGGTAACCCCGATATCAATATTAT



CGGCATTGATAGAGGAGAAAGAAACCTAATCTATGTAACAGTCATTAACCAAAAAGGCGAAATATT



GGATAGCGTCTCCTTCAATACTGTCACCAATAAGTCATCGAAGATAGAACAAACTGTTGATTACGA



AGAAAAATTGGCCGTTAGAGAAAAGGAACGTATCGAAGCGAAGAGATCTTGGGATAGCATATCCA



AGATTGCCACCTTGAAGGAGGGTTATCTAAGCGCGATCGTACATGAAATCTGCTTATTAATGATTAA



GCATAATGCTATTGTCGTGTTAGAAAACCTGAATGCCGGTTTTAAAAGGATTAGAGGTGGTTTGTCA



GAAAAGTCAGTATATCAAAAGTTTGAAAAGATGCTTATTAATAAACTCAACTACTTCGTTAGCAAG



AAAGAAAGTGATTGGAATAAACCGTCAGGTTTGCTCAATGGTCTTCAGTTAAGTGATCAATTTGAGT



CTTTCGAAAAATTAGGAATTCAAAGTGGATTCATTTTTTATGTACCAGCCGCGTACACTTCAAAAAT



TGACCCTACGACCGGATTTGCCAACGTCTTGAATTTGTCCAAGGTCAGAAATGTTGACGCCATCAAA



AGTTTTTTTAGCAACTTCAATGAAATCTCTTATTCCAAAAAGGAAGCCCTTTTCAAGTTTTCTTTTGA



CCTAGACTCGTTATCGAAGAAAGGATTTTCATCTTTCGTAAAGTTTAGCAAGTCCAAGTGGAATGTA



TACACATTCGGCGAGAGAATTATCAAGCCCAAGAACAAACAGGGCTATAGAGAAGACAAGAGAAT



CAACTTGACTTTTGAGATGAAAAAATTACTCAACGAATACAAGGTTTCATTTGATTTGGAGAACAAC



TTGATTCCCAATTTGACATCAGCTAACTTGAAGGATACGTTCTGGAAGGAGTTATTCTTTATATTCA



AAACGACATTACAACTGCGTAATAGTGTTACAAACGGTAAAGAAGATGTATTAATCTCACCTGTAA



AGAATGCCAAAGGAGAATTTTTCGTATCCGGTACTCACAATAAGACACTACCACAGGATTGCGACG



CTAACGGTGCGTATCATATTGCGTTGAAAGGATTAATGATACTTGAAAGAAATAACCTTGTTCGCGA



AGAAAAAGACACCAAGAAGATCATGGCTATTAGCAATGTTGATTGGTTTGAATACGTGCAAAAGAG



GAGAGGTGTTTTGTAA





SEQ
ATGAACAATTATGACGAGTTCACAAAGCTATACCCTATCCAAAAAACTATCAGGTTCGAATTGAAA


ID
CCACAAGGGAGAACAATGGAACATCTGGAGACATTCAACTTTTTTGAAGAGGACAGAGACAGAGC


NO:
GGAGAAATACAAAATTTTAAAAGAGGCCATCGATGAATATCACAAAAAGTTTATCGACGAGCATTT


129
AACAAACATGTCTTTGGACTGGAATTCACTTAAACAAATTTCTGAGAAATATTATAAGTCTCGGGAG



GAAAAAGACAAAAAGGTCTTTTTGTCCGAGCAAAAGAGAATGAGACAAGAAATTGTCTCGGAGTTT



AAAAAAGATGATCGGTTCAAAGATTTGTTTAGCAAGAAATTGTTTTCTGAATTGTTGAAGGAGGAG



ATATACAAGAAAGGCAACCATCAAGAAATAGATGCTTTGAAATCGTTTGACAAGTTCAGCGGTTAC



TTCATTGGTTTACATGAAAATAGGAAGAACATGTATAGCGACGGCGATGAGATCACCGCTATATCG



AATAGAATCGTTAACGAAAATTTTCCGAAATTTTTGGATAATTTGCAAAAATACCAGGAAGCTAGG



AAAAAGTACCCTGAATGGATAATAAAGGCGGAATCAGCTTTGGTGGCTCACAACATAAAGATGGAT



GAAGTCTTCTCGCTGGAATATTTTAACAAAGTATTAAATCAGGAAGGAATCCAAAGATACAACTTA



GCCTTGGGTGGATACGTAACCAAATCAGGTGAGAAAATGATGGGCTTAAATGATGCACTTAATCTA



GCTCACCAATCCGAAAAGTCCTCTAAAGGGAGGATACACATGACACCATTGTTTAAGCAAATCCTT



TCGGAGAAAGAATCTTTTTCATATATCCCCGATGTTTTCACTGAGGATAGTCAATTGTTGCCCAGCA



TTGGTGGATTTTTTGCACAAATAGAAAATGATAAAGATGGTAACATCTTCGATAGAGCCTTGGAATT



GATAAGCTCCTATGCAGAATACGATACGGAACGAATATACATTAGACAAGCTGACATCAACAGAGT



AAGCAATGTTATTTTTGGTGAGTGGGGAACTTTAGGTGGATTAATGCGGGAGTACAAAGCTGACTC



AATCAATGATATTAATTTGGAACGTACGTGCAAAAAAGTCGATAAGTGGCTTGATAGTAAGGAGTT



TGCTCTGTCGGATGTACTAGAAGCAATTAAGAGAACAGGAAACAATGATGCATTTAATGAATATAT



TAGTAAAATGAGGACGGCTAGAGAAAAGATAGACGCCGCACGTAAGGAAATGAAGTTTATTTCCG



AGAAAATATCTGGCGATGAAGAGTCGATTCACATCATCAAGACCCTACTCGATTCTGTTCAGCAATT



TCTCCATTTTTTTAACCTCTTCAAAGCAAGACAAGACATTCCCTTAGATGGGGCTTTTTATGCCGAAT



TTGATGAAGTTCATTCAAAGTTGTTTGCTATTGTTCCTCTTTACAATAAGGTCCGTAATTACCTTACT



AAAAATAACTTGAACACCAAGAAAATAAAGTTAAACTTCAAGAATCCGACTCTTGCCAACGGGTGG



GATCAGAATAAAGTTTATGATTATGCTAGCTTAATATTTCTAAGAGATGGGAATTATTACTTAGGAA



TCATCAATCCAAAGCGTAAGAAAAACATTAAATTTGAACAAGGGTCAGGCAATGGCCCATTCTATA



GAAAAATGGTGTATAAGCAAATACCAGGACCTAACAAGAACTTGCCTCGCGTATTTTTAACTTCAA



CAAAGGGTAAAAAAGAATATAAACCAAGCAAAGAAATTATTGAAGGTTACGAAGCAGATAAACAC



ATCAGAGGTGATAAGTTCGATCTGGATTTCTGCCATAAATTGATTGACTTTTTTAAGGAATCTATAG



AAAAACATAAGGACTGGTCCAAATTTAATTTCTACTTCTCACCTACAGAAAGTTATGGTGACATTTC



AGAATTTTATTTAGACGTTGAGAAACAAGGATATAGGATGCATTTTGAAAATATTTCAGCGGAAAC



CATCGACGAATACGTTGAGAAGGGTGATTTATTCTTGTTCCAAATTTACAATAAAGACTTCGTTAAA



GCTGCAACCGGAAAGAAGGATATGCATACCATATATTGGAACGCTGCATTCTCGCCAGAAAACTTA



CAAGATGTCGTTGTAAAGCTTAATGGAGAAGCTGAGCTGTTCTATAGAGACAAGAGTGATATAAAA



GAGATTGTGCATCGGGAAGGTGAAATTCTGGTGAACAGAACTTACAATGGTCGTACACCCGTTCCA



GACAAAATACATAAAAAACTGACCGATTATCATAATGGTAGGACAAAGGACTTGGGCGAGGCCAA



GGAGTACCTCGATAAAGTTAGATATTTCAAGGCACACTATGATATTACGAAAGACAGGAGATATTT



AAACGATAAAATTTACTTTCATGTCCCTTTGACCCTTAACTTTAAAGCTAATGGTAAAAAGAATTTG



AACAAAATGGTAATTGAGAAGTTTTTATCGGACGAAAAAGCTCACATAATCGGAATCGACCGCGGA



GAGAGAAATTTACTGTATTATAGTATCATCGACAGAAGTGGAAAGATTATTGATCAGCAATCTTTG



AACGTCATTGATGGGTTTGACTATCGGGAAAAGTTAAATCAAAGGGAAATTGAAATGAAGGATGCG



AGACAATCATGGAATGCCATTGGTAAAATTAAAGATCTCAAGGAGGGGTACTTATCAAAAGCTGTA



CACGAGATAACTAAAATGGCTATCCAATATAATGCAATTGTTGTAATGGAAGAATTGAATTATGGT



TTTAAACGCGGCAGGTTTAAAGTCGAAAAACAAATATACCAAAAGTTTGAAAACATGTTAATTGAT



AAGATGAACTATCTTGTTTTCAAAGATGCACCTGATGAGAGTCCTGGCGGTGTGCTGAACGCCTATC



AATTAACAAACCCATTAGAGTCCTTTGCTAAACTGGGTAAACAAACTGGCATTCTATTTTATGTTCC



AGCCGCTTACACCTCAAAGATCGATCCAACGACCGGTTTTGTAAACTTATTTAATACTTCTTCCAAA



ACAAACGCGCAAGAACGCAAAGAATTCCTACAAAAATTTGAATCAATATCCTATAGCGCAAAAGAT



GGAGGTATATTCGCTTTCGCTTTTGACTACAGAAAGTTTGGCACTTCCAAGACAGATCATAAAAATG



TGTGGACCGCTTATACCAACGGAGAAAGGATGCGTTATATTAAAGAAAAAAAGAGGAACGAACTA



TTTGATCCATCGAAAGAAATTAAAGAAGCTTTGACAAGCAGCGGAATCAAATATGATGGAGGTCAA



AACATACTTCCAGATATTCTCAGATCTAATAATAACGGTCTTATTTACACGATGTATTCATCTTTTAT



CGCTGCCATCCAAATGCGTGTGTATGATGGCAAGGAAGATTATATTATATCTCCTATTAAAAATTCA



AAGGGTGAATTTTTTCGCACGGATCCAAAAAGAAGAGAGCTTCCAATTGACGCCGATGCTAACGGT



GCTTACAATATTGCATTGCGTGGTGAACTTACTATGAGAGCCATCGCCGAAAAGTTTGATCCGGACA



GTGAAAAAATGGCGAAATTGGAGCTAAAGCACAAGGATTGGTTTGAATTCATGCAGACCCGTGGCG



ATTGA





SEQ
ATGACTAAAACGTTCGACTCCGAGTTTTTTAATCTCTATTCCTTGCAAAAGACCGTTAGGTTTGAATT


ID
GAAACCAGTTGGTGAAACTGCCTCATTTGTCGAAGACTTTAAAAACGAGGGATTGAAAAGAGTGGT


NO:
TAGTGAAGATGAAAGAAGGGCAGTAGACTATCAAAAGGTTAAAGAAATCATTGACGATTACCACA


130
GAGATTTTATAGAAGAATCTCTGAACTATTTTCCAGAGCAGGTTTCAAAAGATGCTCTAGAGCAAG



CGTTTCATTTGTATCAAAAGTTGAAAGCAGCGAAGGTGGAAGAAAGGGAAAAAGCTTTAAAAGAA



TGGGAAGCATTACAGAAAAAATTGCGAGAAAAAGTCGTCAAATGTTTCAGCGACTCTAATAAAGCT



CGCTTTTCTAGAATCGATAAAAAAGAATTGATTAAGGAAGATTTAATAAATTGGCTGGTAGCACAA



AACAGAGAGGATGATATTCCTACTGTTGAAACGTTCAATAATTTTACTACTTACTTCACTGGTTTCC



ATGAGAACAGGAAGAATATTTACTCTAAAGATGATCACGCTACTGCTATAAGTTTTAGGTTGATTCA



CGAAAACTTGCCTAAATTTTTTGACAATGTCATCAGTTTTAACAAGTTGAAAGAAGGTTTCCCGGAA



TTAAAATTCGACAAAGTTAAAGAAGATTTAGAAGTAGATTACGACTTGAAGCATGCGTTTGAAATT



GAATATTTCGTTAATTTCGTCACACAAGCTGGTATCGACCAATATAATTACCTGCTTGGAGGCAAAA



CTCTAGAAGACGGTACGAAGAAACAAGGAATGAATGAACAGATTAATTTATTTAAGCAACAACAA



ACTCGCGATAAAGCTAGACAGATTCCAAAACTGATTCCACTTTTCAAACAGATTCTATCTGAGAGA



ACTGAATCTCAGAGTTTTATCCCTAAGCAGTTCGAGTCTGATCAGGAACTATTCGATTCCCTGCAGA



AATTGCATAACAACTGTCAAGATAAGTTTACCGTTTTGCAACAGGCGATCTTGGGATTGGCTGAGGC



AGATCTTAAAAAGGTCTTTATTAAAACTAGTGATCTAAACGCATTGTCTAACACTATTTTTGGAAAT



TATTCTGTGTTCTCAGACGCGCTCAATTTATATAAAGAGTCGCTAAAAACTAAAAAGGCTCAAGAA



GCTTTTGAAAAGTTGCCTGCACATAGTATTCATGATTTAATCCAATACTTAGAACAATTTAATTCGT



CTCTCGATGCTGAAAAGCAACAGTCTACCGATACTGTATTAAACTACTTTATTAAAACCGACGAATT



ATATAGTCGTTTCATTAAATCCACCTCTGAGGCATTCACCCAAGTACAACCTCTCTTTGAACTGGAA



GCTTTGAGCTCCAAAAGAAGACCCCCAGAAAGTGAAGATGAGGGGGCTAAAGGCCAAGAAGGTTT



CGAACAAATTAAGAGAATCAAAGCTTATCTAGACACTCTAATGGAGGCTGTCCACTTTGCTAAGCC



TTTGTATCTTGTCAAGGGTAGAAAGATGATAGAGGGTCTAGACAAGGATCAAAGCTTCTACGAAGC



GTTTGAAATGGCCTACCAGGAGTTGGAGTCTTTAATCATCCCCATTTACAATAAGGCCAGATCTTAC



CTGTCTAGGAAGCCATTTAAAGCGGATAAATTCAAAATTAATTTTGACAATAATACACTTCTATCTG



GGTGGGATGCTAACAAGGAGACGGCTAACGCCAGCATATTGTTTAAGAAGGATGGTTTATACTACC



TGGGAATCATGCCAAAAGGCAAAACTTTCTTGTTCGATTATTTCGTTAGTTCAGAAGATTCTGAAAA



GTTGAAACAACGGAGACAGAAAACCGCAGAGGAAGCGCTCGCACAGGATGGAGAATCCTATTTTG



AAAAAATACGGTATAAACTCCTACCAGGTGCTAGTAAGATGTTGCCAAAGGTATTTTTTAGCAATA



AAAATATTGGGTTTTACAATCCCTCAGATGATATTCTACGAATTCGGAATACGGCCTCTCATACTAA



GAATGGTACTCCCCAGAAGGGTCATTCCAAGGTAGAATTTAACTTGAATGACTGTCACAAAATGAT



TGATTTTTTTAAATCTTCCATACAGAAACATCCCGAGTGGGGATCCTTTGGTTTCACTTTTTCTGATA



CGTCGGACTTTGAAGATATGAGTGCTTTCTACCGAGAAGTTGAAAATCAAGGTTACGTTATAAGTTT



TGATAAAATAAAAGAAACTTACATTCAGTCTCAAGTTGAGCAAGGTAACTTATATTTATTTCAAATT



TACAACAAAGATTTTAGTCCGTATTCAAAGGGAAAGCCAAACCTGCACACTTTATACTGGAAAGCT



CTGTTTGAAGAGGCTAATTTGAATAACGTAGTGGCTAAGCTAAACGGCGAAGCAGAAATCTTTTTC



AGAAGACACAGTATCAAAGCATCTGATAAAGTGGTACATCCTGCTAATCAAGCTATAGATAATAAG



AATCCCCATACTGAGAAGACGCAGTCCACATTTGAATATGACTTGGTCAAAGACAAAAGATATACC



CAAGACAAATTTTTTTTTCATGTACCGATATCTTTAAACTTTAAGGCTCAGGGCGTTTCAAAGTTTAA



TGATAAGGTAAATGGATTCTTAAAGGGCAATCCCGACGTTAATATAATCGGTATAGATCGAGGTGA



GAGACATCTTTTATACTTTACCGTGGTGAATCAAAAAGGAGAAATATTAGTGCAAGAGTCCTTGAA



TACATTAATGTCTGACAAGGGTCATGTCAACGATTATCAACAGAAATTGGACAAGAAGGAACAGGA



AAGGGACGCTGCCAGGAAGTCCTGGACGACAGTAGAAAATATTAAAGAATTAAAAGAAGGTTATT



TATCACATGTGGTTCATAAACTTGCACATTTAATCATCAAATATAACGCAATAGTGTGCTTGGAAGA



TCTTAATTTTGGCTTCAAGAGGGGTAGGTTCAAGGTCGAAAAACAGGTCTACCAGAAGTTCGAGAA



AGCTCTGATCGATAAATTGAATTATCTTGTTTTCAAAGAAAAAGAATTAGGAGAAGTTGGTCATTAT



CTTACAGCATACCAACTCACTGCACCATTTGAAAGCTTCAAAAAGCTAGGCAAGCAATCTGGGATT



TTGTTCTATGTTCCGGCTGATTATACATCAAAGATAGATCCTACCACAGGCTTTGTAAATTTTTTAGA



TCTTAGGTACCAATCCGTTGAAAAAGCTAAACAGTTGCTGTCCGATTTTAATGCGATAAGATTTAAT



AGTGTTCAGAATTATTTTGAGTTCGAAATTGATTATAAAAAATTGACACCAAAACGTAAAGTAGGA



ACACAATCTAAATGGGTTATTTGTACCTATGGAGATGTTAGATACCAAAACAGAAGAAATCAGAAA



GGTCACTGGGAAACTGAAGAAGTTAACGTTACTGAAAAACTTAAAGCTCTATTTGCGAGCGATTCA



AAAACGACGACGGTGATCGATTATGCAAATGATGATAACCTTATTGATGTAATTCTGGAACAAGAT



AAGGCATCATTTTTTAAAGAACTACTATGGTTGTTAAAGCTAACCATGACCCTAAGGCACTCCAAGA



TAAAGTCAGAGGATGATTTTATCCTCTCTCCAGTGAAAAACGAACAAGGTGAGTTTTACGACTCAA



GAAAGGCGGGTGAAGTCTGGCCTAAGGATGCTGATGCCAATGGAGCTTATCACATCGCTCTGAAGG



GGCTATGGAACTTACAGCAAATTAACCAATGGGAAAAAGGTAAAACTTTAAACCTCGCCATAAAGA



ACCAGGATTGGTTCAGCTTTATCCAAGAAAAACCATATCAAGAATAA





SEQ
ATGCACACAGGAGGTCTACTCTCGATGGATGCTAAGGAATTTACCGGTCAATATCCGCTGTCCAAA


ID
ACTTTGCGTTTTGAGCTTAGACCTATTGGCCGAACGTGGGATAACCTAGAGGCTTCTGGTTATTTGG


NO:
CGGAAGATAGACATAGAGCTGAGTGTTATCCCCGAGCTAAAGAATTGCTGGATGATAACCACAGGG


131
CGTTCCTGAATAGAGTTCTACCGCAAATCGATATGGATTGGCATCCAATTGCTGAAGCTTTCTGCAA



GGTGCACAAAAATCCAGGTAATAAAGAATTGGCTCAGGATTATAATTTGCAGCTTAGTAAGAGAAG



AAAAGAAATTTCCGCTTATTTGCAGGATGCTGATGGATACAAGGGGTTGTTCGCGAAACCTGCCCT



GGACGAAGCTATGAAAATAGCTAAGGAAAACGGCAATGAATCTGATATTGAAGTTTTGGAAGCCTT



CAATGGATTTTCCGTTTATTTCACTGGTTATCATGAGAGTAGGGAGAATATATACTCAGACGAAGAT



ATGGTATCCGTCGCCTATCGCATAACTGAAGATAATTTTCCAAGGTTCGTGTCGAACGCGTTAATTT



TTGATAAACTAAATGAATCGCACCCGGATATTATTTCGGAAGTGTCCGGTAATCTGGGGGTAGACG



ATATTGGTAAATATTTTGATGTGTCCAACTACAATAATTTCCTTAGTCAAGCAGGAATTGATGACTA



CAACCATATTATAGGAGGGCATACAACTGAAGACGGTCTCATTCAAGCTTTTAACGTAGTGTTAAA



CCTAAGGCACCAAAAAGACCCAGGTTTTGAGAAAATTCAATTTAAGCAACTCTACAAGCAGATACT



GAGCGTTAGGACTAGTAAGTCATATATCCCAAAGCAATTCGATAACTCAAAGGAAATGGTCGACTG



TATATGCGACTACGTCTCAAAAATAGAAAAATCTGAAACAGTAGAAAGAGCTCTGAAATTGGTAAG



AAATATATCTTCTTTTGATTTAAGAGGTATTTTCGTAAATAAAAAAAACCTTCGAATTTTGTCTAATA



AGTTAATTGGAGACTGGGACGCAATAGAGACAGCTTTGATGCACAGTTCCAGCAGTGAAAACGATA



AGAAATCAGTGTATGACTCTGCAGAGGCATTCACCCTTGATGATATCTTCAGTTCTGTGAAAAAGTT



CAGCGACGCCTCCGCTGAGGATATAGGAAACCGCGCTGAAGACATATGTCGTGTTATCTCAGAAAC



AGCTCCTTTCATTAACGACTTAAGGGCTGTAGATTTGGATTCTTTAAATGATGACGGCTATGAAGCG



GCCGTGTCTAAAATACGGGAATCTCTTGAACCCTACATGGATCTATTTCACGAATTGGAGATCTTTA



GCGTGGGTGATGAGTTTCCTAAATGTGCTGCCTTTTATAGCGAGTTGGAAGAGGTCTCAGAACAACT



GATTGAAATCATTCCTTTATTTAACAAAGCAAGAAGTTTTTGCACAAGGAAAAGGTATTCAACCGA



CAAAATCAAAGTCAATTTAAAATTCCCTACTCTGGCAGATGGATGGGATCTAAATAAAGAAAGGGA



TAACAAAGCCGCAATTCTAAGAAAAGACGGTAAATACTACCTGGCAATTTTAGACATGAAGAAAGA



TCTCAGTAGTATTCGTACGAGCGATGAGGACGAGTCTTCTTTTGAAAAGATGGAATATAAATTGCTC



CCTTCTCCTGTGAAAATGCTTCCAAAAATTTTTGTTAAATCGAAAGCCGCCAAAGAAAAGTACGGGT



TGACCGATAGAATGTTAGAATGCTACGATAAAGGTATGCATAAGTCGGGTAGTGCTTTTGATTTGG



GTTTTTGTCATGAATTGATCGATTACTATAAGCGCTGCATTGCCGAGTACCCAGGCTGGGATGTTTT



CGACTTTAAATTTCGTGAGACAAGCGATTACGGATCCATGAAAGAATTTAATGAAGACGTCGCTGG



CGCAGGTTACTATATGTCACTTAGAAAGATTCCATGTTCCGAAGTTTATCGTTTACTGGACGAGAAG



TCAATTTACTTGTTTCAAATATATAATAAGGATTATAGCGAAAACGCACATGGGAATAAGAATATG



CATACGATGTATTGGGAGGGCTTGTTCTCACCACAAAATTTGGAATCACCAGTCTTCAAATTGTCCG



GAGGCGCAGAACTTTTTTTCAGAAAGTCATCTATTCCTAATGACGCTAAAACGGTACATCCGAAAG



GTTCAGTTCTTGTTCCCAGAAACGACGTCAATGGTAGAAGAATACCAGACTCGATCTACAGAGAGT



TGACAAGGTATTTTAACCGTGGGGATTGCAGGATCAGTGATGAAGCTAAGTCTTACCTGGACAAGG



TCAAGACAAAAAAAGCGGACCATGACATTGTTAAGGATAGAAGATTTACTGTAGATAAGATGATGT



TCCATGTTCCGATTGCCATGAATTTTAAAGCTATAAGTAAACCAAATCTTAATAAGAAAGTTATTGA



TGGCATAATAGATGATCAAGATTTGAAAATCATCGGTATCGATCGTGGTGAGAGAAATCTTATTTAT



GTGACCATGGTCGATAGGAAGGGGAATATATTGTATCAAGACAGTCTTAATATTTTAAATGGATAC



GATTACCGCAAAGCTTTAGACGTGAGGGAATATGATAACAAAGAAGCTAGAAGGAATTGGACTAA



AGTAGAAGGTATTAGAAAAATGAAAGAAGGTTATTTATCTTTAGCTGTTAGTAAATTGGCCGATAT



GATCATCGAAAATAATGCTATAATCGTAATGGAAGATTTGAATCACGGGTTTAAGGCAGGTCGTTC



CAAAATTGAAAAGCAGGTGTATCAAAAATTCGAATCAATGTTAATCAACAAGTTAGGATACATGGT



GCTAAAAGACAAGTCCATTGACCAGTCTGGTGGAGCCCTTCATGGTTACCAATTAGCCAATCATGTT



ACGACCTTAGCTAGCGTGGGTAAACAATGTGGAGTAATTTTTTACATACCTGCAGCTTTTACTTCGA



AGATTGATCCCACCACGGGCTTTGCTGATTTATTCGCTCTCTCTAATGTGAAGAATGTCGCTTCTATG



AGAGAGTTCTTCTCCAAAATGAAGTCAGTAATATATGACAAGGCGGAAGGCAAATTCGCCTTTACA



TTTGATTATTTGGATTATAACGTTAAAAGCGAATGTGGACGTACCTTATGGACTGTGTATACAGTTG



GTGAACGCTTCACCTACTCTAGAGTAAACCGAGAGTATGTTCGGAAAGTCCCAACAGATATCATCT



ATGATGCATTACAAAAAGCTGGTATTAGCGTCGAAGGTGACCTTAGAGATAGAATCGCGGAAAGCG



ACGGTGACACATTAAAGTCTATATTCTACGCTTTTAAATACGCGTTGGATATGAGAGTCGAAAACA



GAGAGGAAGACTATATACAGTCACCTGTGAAGAATGCTTCTGGTGAGTTCTTTTGTTCAAAAAACG



CCGGAAAGTCTTTGCCGCAGGATTCAGATGCAAATGGTGCCTATAATATAGCTCTGAAAGGGATCC



TACAACTCAGAATGTTGAGCGAACAATACGATCCAAATGCAGAATCGATTAGATTGCCACTTATAA



CTAACAAGGCATGGTTAACTTTTATGCAATCCGGTATGAAAACTTGGAAGAATTAA





SEQ
ATGGATTCTCTTAAGGATTTCACTAATTTATATCCAGTCTCGAAAACATTGCGGTTCGAATTGAAAC


ID
CAGTTGGGAAAACTCTAGAAAACATTGAAAAAGCCGGTATATTGAAAGAAGATGAACACAGAGCG


NO:
GAATCCTACCGCCGGGTAAAAAAGATAATTGACACATACCATAAAGTGTTTATTGACAGCTCCTTA


132
GAGAACATGGCTAAAATGGGGATAGAAAATGAAATCAAGGCTATGCTGCAGTCTTTTTGTGAACTC



TATAAGAAAGACCACAGGACAGAAGGAGAAGATAAAGCTCTTGATAAAATTAGAGCTGTTCTTAG



AGGTTTAATCGTTGGGGCTTTCACTGGTGTATGTGGAAGACGAGAAAACACAGTACAAAATGAAAA



GTACGAGAGTTTGTTCAAAGAAAAATTGATAAAGGAAATTTTGCCAGATTTCGTGTTGTCCACCGA



GGCTGAGTCTCTTCCATTCAGCGTTGAAGAAGCAACAAGGAGCTTAAAAGAGTTTGACTCATTCACT



TCTTATTTTGCTGGTTTTTACGAAAATAGAAAGAATATTTATTCCACGAAACCGCAAAGTACTGCGA



TAGCCTACAGATTAATTCATGAAAACTTGCCTAAATTTATAGATAATATTTTGGTCTTCCAGAAGAT



TAAAGAACCAATCGCTAAAGAACTTGAGCACATAAGAGCAGATTTTAGCGCAGGCGGATATATCAA



AAAAGATGAACGGCTAGAAGACATATTCTCATTAAATTACTACATTCATGTCCTTTCTCAAGCTGGT



ATAGAAAAATATAATGCTTTAATCGGGAAGATAGTGACGGAAGGTGATGGTGAAATGAAAGGTCTT



AATGAACATATTAACTTATATAACCAACAGAGGGGTCGAGAGGATAGGTTGCCCTTGTTTAGGCCT



CTATACAAGCAAATCCTGTCCGATAGAGAGCAATTGTCTTATTTACCTGAATCATTTGAAAAAGATG



AAGAGCTGCTTAGAGCACTTAAGGAATTTTACGATCACATCGCCGAAGACATCTTGGGTAGAACAC



AGCAATTGATGACTTCAATTTCTGAATACGACTTGTCCCGTATTTATGTCAGAAATGATTCTCAACTT



ACAGACATCTCGAAGAAAATGCTAGGAGATTGGAACGCCATTTATATGGCTAGAGAACGAGCCTAC



GACCACGAACAGGCTCCTAAACGTATTACTGCTAAATACGAACGTGATAGAATCAAGGCCTTAAAA



GGTGAAGAGTCAATTTCATTGGCGAATCTGAACAGCTGTATAGCTTTCTTGGACAATGTAAGGGATT



GTCGAGTTGACACATACCTATCAACTTTGGGGCAGAAAGAGGGTCCTCATGGCTTAAGTAACTTGG



TGGAAAACGTCTTCGCCTCATATCATGAAGCAGAACAGTTATTGTCGTTTCCTTACCCCGAAGAGAA



CAACCTTATTCAGGACAAAGACAATGTAGTTTTGATCAAAAACCTATTGGATAATATAAGTGATTTA



CAACGTTTCCTTAAACCTTTGTGGGGAATGGGCGATGAACCTGACAAAGACGAAAGGTTTTACGGT



GAATACAACTATATTAGAGGAGCGCTTGACCAGGTAATACCTTTGTACAATAAAGTAAGGAACTAC



TTGACTCGTAAACCATATTCTACTAGAAAAGTTAAATTGAACTTTGGTAATTCACAGCTGCTGAGTG



GTTGGGATCGTAATAAAGAAAAAGATAACTCCTGTGTTATCTTGCGAAAAGGACAAAACTTTTACT



TGGCAATTATGAACAACCGTCACAAAAGGTCCTTCGAGAACAAAGTTCTGCCTGAATACAAAGAAG



GTGAACCATATTTTGAAAAAATGGACTATAAATTCCTGCCAGATCCTAATAAAATGTTGCCTAAGGT



CTTCTTGTCTAAAAAAGGTATAGAAATATATAAACCATCCCCGAAGTTGCTGGAGCAATATGGTCAT



GGAACGCACAAAAAAGGTGACACTTTTAGTATGGATGACTTGCACGAGTTGATTGATTTTTTTAAAC



ATTCCATTGAAGCGCACGAAGATTGGAAACAATTTGGTTTCAAGTTCTCTGACACAGCCACTTACGA



AAATGTATCGTCCTTTTATAGAGAAGTGGAAGATCAGGGTTATAAACTGTCATTCCGTAAGGTTAGT



GAAAGCTATGTGTACTCGTTGATCGATCAAGGGAAGCTTTATCTTTTTCAAATCTATAATAAAGATT



TCTCTCCTTGTTCAAAGGGCACACCTAATCTTCATACACTATACTGGAGAATGCTTTTCGATGAAAG



AAATTTGGCTGATGTGATCTATAAATTAGACGGTAAAGCTGAGATTTTTTTCAGAGAGAAATCCCTG



AAAAACGACCATCCAACTCATCCGGCAGGTAAACCGATTAAAAAGAAATCCCGGCAAAAAAAGGG



CGAAGAGAGTTTATTCGAGTATGATTTAGTTAAGGACAGACATTATACAATGGACAAATTTCAATTT



CATGTGCCCATTACTATGAACTTTAAGTGTAGTGCAGGGTCTAAGGTTAATGATATGGTAAACGCAC



ATATTAGAGAAGCTAAAGATATGCACGTCATCGGTATTGATCGCGGAGAAAGAAATTTACTTTACA



TTTGCGTTATCGATTCTAGGGGCACCATCTTGGATCAAATCTCTTTGAACACTATAAATGATATTGA



CTATCATGATCTACTAGAGAGTCGGGATAAAGACAGGCAACAAGAAAGAAGAAATTGGCAAACAA



TTGAAGGTATTAAAGAATTAAAGCAAGGCTATCTAAGCCAGGCTGTACACAGAATTGCCGAATTAA



TGGTAGCATATAAAGCTGTCGTAGCTCTAGAAGACTTGAACATGGGTTTCAAAAGAGGGCGCCAGA



AGGTCGAAAGTAGTGTTTATCAACAATTTGAAAAACAGTTAATAGATAAGTTGAATTATCTAGTGG



ATAAAAAAAAGCGTCCTGAGGACATTGGCGGTTTATTAAGAGCCTACCAATTCACTGCGCCATTTA



AATCGTTCAAAGAAATGGGTAAACAAAACGGTTTTCTATTCTACATCCCCGCATGGAATACCTCAA



ATATAGATCCAACTACCGGTTTCGTCAACTTATTTCATGCTCAATATGAGAATGTGGACAAAGCAAA



ATCATTCTTTCAAAAATTTGATAGCATTAGCTACAATCCTAAAAAAGATTGGTTTGAATTTGCGTTC



GATTATAAAAATTTCACCAAGAAGGCTGAAGGTTCCAGATCTATGTGGATATTGTGCACCCACGGA



AGTAGAATTAAGAACTTCCGTAATTCACAGAAAAACGGCCAGTGGGACAGCGAAGAATTCGCCCTA



ACCGAAGCTTTCAAAAGTCTTTTCGTAAGATACGAGATAGACTATACAGCTGATCTAAAGACAGCT



ATTGTGGATGAGAAGCAAAAAGACTTCTTTGTCGACCTTCTTAAGTTGTTCAAGTTAACTGTGCAGA



TGAGAAATAGTTGGAAGGAAAAAGACCTAGATTACTTGATTAGCCCAGTCGCTGGTGCAGATGGCA



GATTTTTTGATACACGTGAAGGCAATAAATCACTACCAAAAGACGCGGACGCTAATGGCGCATACA



ACATCGCATTGAAGGGTTTGTGGGCTCTCAGGCAGATTAGGCAGACAAGTGAGGGTGGTAAGCTTA



AGCTGGCGATTTCTAATAAGGAATGGTTACAGTTTGTTCAAGAAAGATCCTACGAAAAAGATTAA





SEQ
ATGAACAATGGTACTAATAATTTTCAAAACTTCATAGGGATTTCTAGCCTTCAAAAGACATTGAGAA


ID
ATGCTTTAATTCCAACAGAAACGACTCAACAATTCATAGTGAAAAATGGTATTATAAAAGAAGACG


NO:
AGTTGCGTGGCGAGAATAGACAAATTTTGAAAGATATCATGGATGACTACTACAGAGGGTTCATCT


133
CCGAAACATTGTCTTCTATTGACGACATTGACTGGACCAGCTTATTCGAAAAAATGGAAATACAGCT



GAAGAACGGAGATAACAAGGACACTCTTATAAAGGAGCAAACGGAATATAGAAAGGCTATACACA



AAAAGTTTGCTAATGACGATAGATTTAAAAACATGTTTAGTGCGAAGTTAATTTCTGATATTCTACC



CGAGTTTGTCATTCATAATAATAACTACTCTGCATCTGAAAAAGAGGAGAAGACCCAGGTTATAAA



GTTGTTTTCAAGATTTGCCACATCATTTAAAGACTACTTCAAGAACAGGGCGAATTGCTTCTCTGCT



GATGATATTAGCTCTTCCAGCTGTCATAGAATTGTTAACGATAATGCCGAAATTTTTTTTAGTAATG



CCTTGGTATATAGACGCATAGTCAAGTCACTAAGCAATGATGATATAAACAAGATTAGTGGTGATA



TGAAAGATAGCCTTAAAGAAATGAGCCTTGAAGAGATATATTCATATGAGAAGTACGGTGAATTTA



TAACTCAAGAAGGAATTTCTTTTTATAACGATATTTGTGGTAAGGTTAATTCTTTTATGAATTTGTAT



TGCCAGAAGAACAAGGAAAATAAGAATCTATATAAACTACAAAAGTTGCATAAACAGATTTTGTGT



ATAGCTGATACATCCTACGAAGTTCCGTATAAATTTGAATCTGATGAGGAAGTTTATCAATCGGTAA



ACGGTTTTCTTGACAACATTTCCAGCAAACATATCGTTGAGAGACTACGTAAAATTGGAGACAACT



ATAATGGTTACAATCTAGATAAAATATACATAGTGTCCAAGTTTTATGAGTCTGTCTCTCAAAAGAC



ATATCGTGATTGGGAGACCATTAATACTGCACTTGAAATTCATTATAACAACATATTGCCTGGTAAC



GGGAAGAGTAAAGCTGATAAGGTTAAAAAGGCCGTCAAAAACGACTTGCAAAAGTCTATTACCGA



GATAAATGAATTAGTGTCAAACTACAAACTATGCTCAGATGATAATATTAAAGCGGAAACATACAT



CCACGAAATTTCCCACATACTGAATAACTTTGAAGCTCAGGAGCTTAAATATAACCCGGAAATACA



CTTGGTTGAGAGCGAGTTAAAAGCATCTGAGTTGAAAAATGTATTAGACGTCATCATGAATGCGTTT



CATTGGTGTTCAGTTTTCATGACTGAAGAATTAGTCGACAAAGATAACAATTTTTATGCCGAATTAG



AGGAAATATATGATGAAATTTATCCCGTAATTAGTTTATACAATCTAGTTAGAAATTATGTTACACA



AAAGCCGTATAGTACCAAGAAAATAAAGCTTAATTTCGGAATACCTACGCTTGCTGATGGTTGGTC



AAAAAGTAAAGAATATAGCAATAATGCAATAATTTTAATGAGAGATAACCTATATTATTTGGGTAT



TTTTAACGCTAAGAACAAACCAGACAAGAAAATAATTGAAGGTAATACATCTGAAAACAAGGGCG



ACTATAAAAAGATGATATACAATTTGCTCCCAGGTCCTAATAAAATGATTCCTAAGGTTTTCCTGAG



TAGCAAGACTGGCGTTGAAACTTACAAGCCTAGTGCGTATATCCTGGAGGGTTATAAACAGAACAA



GCATATCAAATCCTCTAAGGACTTCGATATCACCTTTTGCCATGACTTAATCGATTATTTTAAAAATT



GTATCGCAATTCATCCAGAATGGAAAAATTTCGGATTTGATTTTAGTGATACCAGCACTTACGAGGA



TATCTCTGGGTTCTACAGAGAAGTGGAGTTGCAGGGCTACAAAATCGATTGGACTTACATATCTGA



AAAGGACATAGATTTGCTGCAGGAGAAAGGTCAGCTATATTTGTTTCAAATCTACAACAAAGACTT



TTCTAAAAAGTCTACCGGTAATGACAATCTGCACACAATGTACTTGAAGAACTTATTCTCCGAGGAG



AACTTAAAGGACATTGTACTCAAGTTGAATGGAGAAGCCGAGATTTTTTTTAGAAAGAGCAGTATA



AAGAATCCTATAATCCACAAGAAGGGCTCAATTCTCGTGAATAGGACGTATGAGGCAGAAGAAAA



GGACCAATTTGGGAATATACAAATTGTAAGAAAAAACATCCCAGAAAATATCTACCAGGAATTATA



TAAGTATTTTAATGACAAATCTGATAAGGAACTGTCTGACGAAGCCGCTAAGCTCAAGAATGTTGT



GGGCCACCATGAAGCTGCTACTAATATAGTGAAGGACTACAGATATACCTACGATAAATATTTCCT



GCATATGCCAATTACTATAAACTTCAAAGCAAATAAAACAGGTTTTATAAATGATAGAATCCTGCA



GTATATTGCTAAAGAAAAGGATTTACATGTAATTGGGATTGATAGAGGTGAACGCAATCTGATCTA



TGTCAGCGTAATAGATACTTGTGGTAATATTGTGGAACAAAAGTCCTTTAATATTGTGAACGGATAT



GATTACCAAATCAAGTTGAAACAACAAGAGGGAGCACGCCAAATTGCCCGTAAGGAATGGAAAGA



GATAGGTAAGATCAAGGAAATTAAGGAAGGTTATCTTTCATTAGTTATTCACGAAATTTCGAAGAT



GGTAATCAAATACAACGCAATAATTGCTATGGAGGACCTGTCATATGGATTTAAGAAAGGTAGATT



CAAGGTTGAGAGACAGGTATACCAGAAATTTGAAACTATGTTGATCAACAAATTAAATTACTTAGT



CTTTAAGGACATATCAATAACGGAAAACGGCGGGCTTTTAAAAGGGTATCAACTTACATACATACC



TGATAAGTTGAAAAATGTGGGTCATCAGTGTGGGTGCATCTTTTATGTTCCAGCCGCTTACACATCA



AAAATCGATCCTACTACTGGGTTCGTAAACATATTTAAATTTAAAGATCTAACCGTTGATGCAAAAA



GAGAGTTTATCAAGAAATTTGATAGCATTAGGTACGATTCAGAAAAAAATCTATTCTGTTTTACTTT



TGACTACAACAACTTTATAACGCAGAATACAGTGATGTCAAAATCGTCCTGGTCAGTGTATACTTAT



GGTGTTAGAATTAAGAGACGTTTCGTAAACGGTCGTTTTTCTAACGAGTCCGATACAATCGACATCA



CTAAAGATATGGAAAAAACTTTGGAAATGACAGATATAAACTGGAGAGATGGTCACGACCTTAGAC



AAGATATAATCGATTATGAAATCGTACAGCATATTTTTGAAATTTTTCGCTTAACAGTTCAGATGCG



TAACTCTCTTAGTGAGCTAGAAGATAGAGATTATGATAGACTTATCTCGCCTGTTCTTAACGAAAAT



AATATCTTCTATGACTCGGCAAAAGCCGGTGATGCACTTCCAAAAGATGCTGATGCAAATGGCGCG



TACTGCATCGCATTGAAGGGGCTCTACGAGATTAAACAAATCACCGAAAACTGGAAAGAAGATGGT



AAATTTTCTAGGGATAAGTTGAAAATCAGTAATAAAGATTGGTTCGATTTTATACAAAATAAGCGA



TACTTATAG





SEQ
ATGACCAATAAGTTTACTAATCAATACTCATTGTCTAAAACGTTAAGATTCGAGTTAATTCCCCAGG


ID
GAAAGACACTAGAATTTATTCAAGAAAAAGGTCTTCTCTCTCAGGATAAACAAAGAGCAGAATCAT


NO:
ACCAGGAGATGAAAAAAACCATAGATAAATTTCATAAGTACTTCATCGACTTGGCACTATCGAACG


134
CCAAGCTAACACATTTGGAAACCTACCTGGAGTTGTATAATAAATCGGCAGAGACGAAAAAGGAAC



AAAAATTCAAGGATGACCTGAAGAAGGTTCAAGATAATCTGCGAAAGGAAATAGTGAAGTCGTTTA



GTGATGGTGATGCAAAGTCAATCTTTGCTATTTTAGACAAGAAGGAATTAATAACCGTGGAACTTG



AAAAGTGGTTTGAAAATAACGAACAGAAAGATATTTACTTCGACGAAAAATTTAAAACGTTTACTA



CGTACTTTACAGGGTTCCATCAGAACCGCAAAAACATGTACTCCGTTGAACCAAACTCTACTGCAAT



CGCCTACAGATTAATACACGAAAATTTGCCTAAGTTTTTAGAAAATGCAAAGGCTTTTGAAAAGAT



AAAGCAAGTCGAATCGTTACAGGTAAACTTTCGCGAATTAATGGGCGAATTTGGAGATGAAGGTCT



TATTTTTGTCAATGAATTAGAGGAAATGTTTCAAATTAATTATTATAACGATGTCTTGAGTCAGAAC



GGCATTACTATCTACAACTCAATTATCAGTGGTTTCACTAAGAATGATATAAAATATAAAGGTTTGA



ATGAATACATTAATAATTATAATCAAACTAAAGATAAGAAGGACAGGCTTCCGAAATTGAAGCAAT



TGTACAAGCAGATTCTAAGTGATAGGATTAGTTTGTCTTTCTTGCCAGACGCATTTACTGATGGCAA



GCAAGTCTTAAAGGCTATATTCGATTTCTACAAGATTAACCTACTTTCGTACACAATTGAAGGTCAA



GAAGAATCTCAAAATCTGCTGCTTTTGATTAGGCAAACTATAGAAAATTTGTCGTCCTTTGACACTC



AAAAAATTTACCTGAAGAATGATACACACCTGACTACAATATCACAGCAGGTCTTTGGGGATTTTTC



TGTCTTCTCCACGGCCCTAAACTATTGGTATGAGACAAAAGTTAATCCAAAATTTGAAACAGAATAT



AGTAAGGCGAATGAAAAAAAGAGAGAAATTTTGGATAAAGCGAAGGCAGTATTCACAAAACAAGA



CTATTTTTCTATCGCATTTCTCCAAGAAGTCTTATCCGAATATATTTTGACACTCGATCACACCTCTG



ATATAGTTAAGAAACATTCGTCCAACTGCATCGCAGATTACTTCAAGAATCACTTCGTGGCTAAGAA



AGAAAACGAAACGGATAAAACTTTTGACTTCATTGCTAACATAACCGCTAAATACCAATGTATTCA



GGGCATATTAGAAAATGCAGACCAGTACGAAGACGAGTTAAAACAGGACCAAAAGTTAATAGATA



ATCTAAAGTTTTTCTTAGATGCTATACTTGAGTTATTACATTTTATAAAGCCATTGCATCTAAAATCG



GAAAGTATTACTGAAAAAGACACTGCGTTCTATGATGTGTTCGAAAATTATTATGAGGCTTTATCTT



TATTGACCCCCCTTTACAACATGGTCCGCAATTATGTTACTCAGAAGCCTTACTCTACTGAAAAGAT



CAAATTAAACTTTGAAAATGCTCAGTTGCTGAATGGTTGGGATGCCAATAAGGAAGGTGACTACCT



GACGACTATTCTAAAAAAAGACGGTAATTATTTCTTAGCAATCATGGATAAAAAACATAACAAGGC



ATTTCAAAAATTTCCAGAAGGAAAAGAAAACTATGAAAAGATGGTTTATAAATTGTTGCCTGGAGT



TAATAAAATGTTGCCAAAAGTTTTTTTTAGCAATAAGAACATAGCTTACTTTAATCCATCTAAGGAA



CTGCTCGAGAACTACAAGAAGGAAACACATAAAAAAGGTGATACATTTAATTTGGAACATTGCCAT



ACTCTGATTGATTTTTTTAAGGACTCTCTTAATAAACATGAAGACTGGAAATATTTTGATTTTCAATT



TTCGGAAACTAAATCATACCAAGATCTAAGTGGATTTTACAGAGAAGTTGAACACCAAGGTTATAA



GATTAACTTCAAGAATATAGATTCTGAATACATTGATGGTCTTGTAAACGAGGGTAAACTATTCCTG



TTCCAAATCTACTCTAAGGACTTCTCACCTTTTTCCAAAGGAAAACCTAATATGCATACGTTGTACT



GGAAGGCTCTATTTGAAGAACAAAATTTGCAAAATGTAATCTACAAACTGAACGGCCAAGCTGAAA



TATTCTTCAGAAAAGCCTCAATTAAGCCAAAAAACATTATTCTTCATAAAAAGAAGATCAAGATTG



CGAAGAAACATTTTATTGATAAGAAGACCAAGACTTCCGAAATTGTACCAGTACAAACAATCAAGA



ATCTCAATATGTATTATCAAGGCAAGATAAGTGAGAAAGAGTTAACCCAGGATGATTTACGTTATA



TAGACAATTTCTCTATATTCAACGAGAAGAACAAAACAATAGACATTATCAAAGATAAAAGGTTTA



CTGTTGACAAATTTCAATTTCATGTGCCTATCACAATGAACTTTAAGGCCACAGGTGGTTCGTACAT



TAATCAAACTGTTTTAGAATATCTGCAAAATAACCCAGAGGTCAAGATCATCGGTCTTGATAGGGG



TGAGAGACATCTGGTGTATCTAACACTCATTGATCAACAAGGCAACATCTTGAAGCAAGAATCATT



GAACACTATCACAGACTCCAAGATCTCGACTCCATATCACAAACTCCTTGACAATAAAGAAAACGA



AAGGGATCTTGCCAGAAAAAATTGGGGTACAGTTGAAAATATTAAGGAACTAAAAGAAGGTTACA



TTTCGCAAGTAGTTCACAAGATTGCAACACTCATGTTGGAAGAAAACGCAATCGTTGTCATGGAAG



ATTTAAATTTCGGATTTAAGAGAGGAAGATTTAAAGTAGAAAAGCAAATCTACCAGAAGTTGGAGA



AGATGTTAATTGACAAATTGAACTACTTAGTGCTGAAAGACAAACAGCCTCAAGAATTGGGCGGTC



TATACAACGCTTTACAACTGACAAATAAATTTGAGTCATTCCAAAAGATGGGTAAGCAGAGTGGTT



TTTTGTTTTATGTTCCGGCATGGAACACATCCAAAATCGATCCAACTACAGGCTTCGTGAATTATTTC



TACACTAAATATGAAAATGTGGATAAAGCAAAAGCTTTCTTTGAGAAGTTCGAGGCGATCCGTTTT



AACGCTGAAAAGAAGTACTTCGAGTTCGAGGTCAAAAAGTATTCAGATTTTAACCCCAAGGCTGAA



GGCACCCAGCAAGCATGGACTATTTGCACGTACGGTGAGCGAATCGAAACTAAAAGGCAAAAGGA



TCAAAATAATAAGTTTGTAAGCACACCCATTAACTTGACAGAAAAGATAGAAGATTTTCTTGGAAA



AAACCAAATTGTATATGGTGACGGTAACTGTATCAAGTCACAAATTGCTTCTAAAGACGATAAGGC



CTTCTTCGAAACTCTGCTATACTGGTTTAAAATGACGTTGCAAATGAGAAACAGTGAAACTAGAACT



GATATCGACTATTTAATATCACCCGTGATGAACGATAATGGTACCTTTTACAATTCAAGAGATTACG



AGAAATTGGAGAACCCCACACTACCAAAAGACGCAGACGCTAATGGTGCCTACCATATTGCTAAAA



AGGGACTGATGTTGTTGAACAAGATAGATCAAGCCGACTTAACTAAAAAAGTTGATTTGTCAATTT



CGAATAGAGATTGGTTGCAATTCGTCCAGAAAAATAAGTAA





SEQ
ATGGAACAGGAATACTACTTGGGTTTGGATATGGGAACTGGTTCAGTCGGTTGGGCTGTTACGGAC


ID
TCCGAGTACCACGTGTTGAGAAAACACGGAAAGGCTTTATGGGGTGTCAGACTATTCGAATCAGCA


NO:
TCGACCGCGGAAGAGAGAAGAATGTTTAGAACTTCAAGAAGAAGGCTGGATCGTAGGAATTGGCG


135
GATAGAAATTTTACAAGAAATATTCGCCGAAGAAATCTCTAAAAAAGATCCAGGATTTTTTCTACGT



ATGAAGGAATCCAAATACTATCCGGAAGATAAACGTGATATTAATGGCAATTGTCCAGAGTTACCC



TATGCTTTATTTGTGGACGACGATTTCACCGATAAAGATTACCATAAGAAGTTCCCAACAATTTACC



ATCTGAGAAAGATGTTAATGAACACTGAAGAAACCCCGGATATAAGACTGGTCTATCTAGCCATTC



ATCATATGATGAAACACAGGGGACACTTCTTGCTATCAGGGGATATAAATGAAATTAAAGAATTTG



GTACAACATTTTCTAAATTATTGGAAAATATTAAAAACGAAGAATTAGATTGGAATTTAGAATTAG



GCAAGGAGGAATACGCAGTTGTCGAATCGATTCTGAAAGATAACATGTTGAACAGATCAACGAAA



AAAACAAGGCTGATCAAGGCTTTAAAAGCGAAATCAATATGCGAAAAAGCAGTATTGAATTTGTTA



GCTGGGGGGACTGTCAAGTTGTCTGATATTTTCGGATTGGAAGAATTGAATGAAACAGAGAGACCG



AAGATATCCTTCGCCGATAATGGCTACGATGATTATATAGGCGAAGTCGAAAATGAGCTGGGCGAA



CAATTCTACATTATCGAGACTGCCAAGGCTGTTTATGATTGGGCGGTGTTAGTCGAAATCCTTGGCA



AATACACTTCCATCTCCGAAGCTAAGGTGGCAACCTACGAAAAGCATAAAAGTGATTTGCAATTCC



TTAAGAAAATTGTCCGAAAGTACTTGACCAAAGAAGAGTACAAGGATATTTTCGTATCAACATCGG



ACAAACTGAAGAATTATTCAGCTTATATTGGCATGACGAAAATTAATGGTAAGAAAGTTGATTTGC



AATCCAAGAGATGTTCTAAAGAAGAATTTTACGATTTCATTAAAAAAAATGTCCTAAAAAAGTTGG



AGGGACAACCTGAATATGAGTATTTAAAGGAAGAACTGGAAAGAGAAACTTTCCTACCAAAGCAA



GTTAATCGTGATAATGGCGTTATTCCATACCAAATACACTTGTACGAATTAAAGAAGATCTTGGGTA



ACTTGAGGGACAAAATTGATTTAATCAAGGAAAATGAAGACAAACTGGTACAATTATTTGAATTTA



GAATACCTTACTACGTGGGCCCTTTAAACAAAATAGACGATGGTAAGGAAGGGAAGTTCACATGGG



CAGTCAGAAAGTCCAATGAAAAAATTTACCCATGGAATTTCGAAAACGTTGTAGATATTGAAGCTT



CTGCTGAGAAATTTATTAGGAGAATGACAAATAAATGCACTTATCTTATGGGGGAAGACGTGTTGC



CTAAAGATAGTTTATTATATTCAAAGTATATGGTCTTAAATGAATTAAACAATGTTAAATTAGATGG



TGAAAAACTTTCCGTCGAATTGAAACAAAGATTGTATACAGATGTATTCTGCAAATATAGAAAAGT



AACTGTAAAGAAGATTAAAAACTACCTTAAATGTGAAGGCATTATCAGCGGAAATGTTGAGATCAC



TGGTATCGATGGTGATTTTAAGGCATCTTTAACCGCATATCACGACTTTAAGGAAATATTGACGGGT



ACTGAGCTTGCTAAAAAAGACAAAGAGAACATTATCACCAATATCGTGCTCTTCGGAGACGACAAG



AAATTATTGAAAAAGAGATTGAACCGCCTATACCCTCAGATTACCCCTAACCAATTGAAGAAAATC



TGCGCTCTGTCTTATACTGGATGGGGTCGTTTTAGCAAGAAGTTTCTAGAAGAAATTACTGCTCCGG



ATCCTGAAACTGGGGAAGTCTGGAATATAATTACCGCGCTATGGGAATCGAATAATAATTTAATGC



AATTACTATCTAATGAATACAGATTTATGGAAGAAGTCGAAACTTACAATATGGGAAAACAAACAA



AAACTTTGAGCTACGAAACAGTAGAGAATATGTATGTCTCACCATCTGTAAAGCGGCAGATCTGGC



AAACCTTGAAGATAGTTAAAGAATTAGAAAAAGTGATGAAGGAAAGTCCAAAAAGGGTTTTTATTG



AAATGGCCCGAGAAAAACAAGAATCTAAAAGGACGGAAAGTAGGAAAAAGCAACTTATAGATCTA



TATAAAGCCTGCAAAAATGAAGAAAAAGATTGGGTAAAGGAATTAGGTGACCAGGAAGAGCAAAA



ATTGAGATCTGACAAGCTGTACTTGTATTATACGCAAAAGGGCCGGTGTATGTATTCGGGTGAGGT



AATAGAATTGAAAGATTTATGGGATAACACTAAGTATGACATTGACCATATTTACCCCCAGTCTAA



GACAATGGACGATTCATTAAATAACCGAGTTCTTGTCAAAAAGAAGTACAATGCCACAAAGAGCGA



TAAGTACCCATTGAACGAAAATATAAGACATGAACGAAAAGGTTTCTGGAAATCATTGTTGGACGG



TGGATTTATTTCCAAAGAAAAATACGAGAGATTGATTAGAAACACTGAACTATCTCCAGAGGAGTT



AGCTGGCTTTATCGAAAGACAAATTGTTGAAACTAGACAGTCTACAAAAGCAGTTGCAGAAATCTT



AAAACAAGTATTTCCAGAATCCGAAATTGTGTACGTCAAAGCCGGAACAGTAAGTAGATTTAGAAA



AGACTTTGAATTATTGAAAGTACGAGAGGTTAACGACCTACATCATGCTAAGGATGCTTATTTAAAT



ATAGTCGTTGGTAATTCGTATTACGTGAAATTCACAAAAAACGCATCTTGGTTCATCAAGGAGAATC



CTGGTAGGACATACAACTTGAAAAAGATGTTTACATCAGGATGGAATATCGAAAGAAATGGTGAGG



TTGCGTGGGAGGTAGGCAAGAAGGGAACCATTGTTACTGTAAAGCAAATTATGAATAAAAACAATA



TACTTGTTACGAGACAGGTGCACGAAGCCAAAGGAGGGTTGTTTGACCAGCAAATCATGAAGAAAG



GTAAAGGTCAGATAGCAATAAAAGAGACTGATGAGCGTTTAGCTAGTATAGAAAAATATGGGGGC



TACAATAAGGCAGCTGGTGCTTACTTCATGTTGGTCGAATCAAAGGATAAAAAAGGGAAGACGATC



CGGACCATAGAGTTTATCCCTCTGTACTTGAAGAATAAGATTGAGTCTGACGAAAGCATCGCATTG



AATTTCTTGGAAAAGGGGCGCGGTCTAAAGGAGCCAAAAATATTGTTAAAGAAAATTAAAATAGAC



ACCCTATTCGACGTCGATGGGTTTAAGATGTGGCTTAGTGGTCGTACTGGGGACAGATTATTATTCA



AGTGTGCCAATCAGTTAATCCTTGACGAGAAAATCATTGTTACAATGAAAAAAATTGTTAAGTTTAT



TCAAAGGCGACAAGAAAATAGAGAACTAAAGTTGAGTGATAAGGATGGAATCGATAATGAAGTGT



TAATGGAGATTTATAACACTTTTGTCGACAAATTGGAGAATACGGTGTACAGAATTAGGCTATCTGA



ACAGGCTAAAACCCTAATTGATAAACAGAAGGAGTTTGAGCGACTTTCTCTTGAAGACAAATCTTC



AACTCTTTTCGAGATCCTACATATCTTTCAGTGTCAATCTTCTGCAGCTAATTTGAAAATGATTGGAG



GTCCTGGTAAGGCTGGTATATTAGTCATGAACAACAACATATCTAAGTGTAATAAGATTAGTATAAT



TAACCAATCACCGACAGGTATCTTTGAAAATGAAATTGATTTACTTAAA





SEQ
ATGAAATCATTCGACTCGTTCACCAACTTGTACTCCCTGTCTAAAACATTGAAATTTGAAATGCGAC


ID
CTGTTGGTAACACCCAAAAGATGTTAGATAATGCAGGAGTTTTCGAAAAGGATAAACTGATCCAGA


NO:
AAAAATACGGTAAAACGAAACCATATTTCGATAGGTTGCATCGGGAATTTATAGAAGAAGCTTTGA


136
CTGGTGTAGAATTAATTGGCTTAGATGAGAATTTCCGTACTCTAGTCGATTGGCAAAAAGATAAAA



AGAACAATGTTGCCATGAAGGCATACGAAAATAGTCTACAAAGACTAAGAACAGAGATCGGGAAA



ATTTTCAATTTGAAGGCAGAAGACTGGGTGAAGAACAAATATCCAATATTGGGTCTTAAGAATAAG



AATACTGATATATTGTTCGAGGAGGCCGTTTTCGGTATTCTTAAGGCAAGATATGGTGAAGAGAAA



GACACGTTTATTGAAGTTGAGGAGATTGATAAAACCGGTAAGTCCAAAATCAACCAGATCTCTATC



TTCGACAGTTGGAAGGGCTTCACTGGTTATTTTAAGAAGTTCTTCGAAACTAGGAAGAACTTCTATA



AAAACGATGGTACTTCCACGGCTATTGCTACAAGAATTATCGACCAAAACCTTAAGCGTTTTATTGA



TAACCTATCAATTGTTGAAAGTGTTCGACAGAAAGTAGATTTGGCTGAAACTGAAAAATCTTTTAGT



ATCTCCTTATCCCAGTTTTTCTCTATAGATTTTTATAATAAATGTTTGCTGCAAGATGGCATTGACTA



CTATAATAAAATAATTGGTGGAGAGACATTGAAAAACGGAGAGAAGCTGATTGGCCTTAATGAGTT



GATAAATCAATATAGACAAAATAATAAGGACCAGAAAATCCCTTTCTTTAAATTGCTAGACAAACA



GATTTTGTCTGAAAAGATCCTATTCTTGGATGAAATAAAGAACGATACTGAATTGATTGAAGCTTTG



TCCCAGTTTGCTAAAACAGCTGAAGAAAAGACAAAGATTGTGAAAAAATTGTTTGCTGATTTCGTA



GAAAACAATTCTAAATATGATCTAGCCCAGATTTATATAAGTCAAGAAGCTTTCAATACAATAAGT



AATAAGTGGACAAGTGAAACAGAAACTTTTGCTAAGTATTTATTCGAAGCCATGAAGTCTGGTAAA



CTTGCCAAATACGAAAAAAAAGATAACAGTTATAAATTTCCAGACTTTATAGCCCTTTCACAGATG



AAGTCTGCCTTATTGTCGATATCCTTAGAAGGTCATTTTTGGAAGGAAAAATATTATAAGATAAGCA



AGTTCCAAGAAAAGACTAATTGGGAACAATTTTTGGCTATATTTCTATATGAGTTCAATTCATTATT



TTCCGATAAAATCAACACTAAGGATGGAGAGACTAAGCAAGTTGGCTACTATTTGTTCGCAAAAGA



TCTGCACAATTTGATTCTATCAGAACAAATAGATATACCAAAAGATTCAAAGGTAACTATAAAGGA



TTTCGCAGATTCCGTCCTCACCATTTATCAAATGGCTAAATATTTTGCCGTTGAAAAAAAGAGAGCG



TGGTTAGCAGAATACGAGTTGGACTCGTTTTATACTCAGCCAGATACTGGATACTTGCAATTCTACG



ATAATGCATACGAAGACATTGTACAGGTATACAATAAACTTAGAAATTACTTAACCAAGAAGCCCT



ACAGTGAAGAAAAATGGAAGCTGAACTTTGAAAATTCGACTTTGGCAAATGGTTGGGATAAAAATA



AAGAAAGTGACAACTCCGCAGTGATTTTGCAAAAGGGTGGGAAATATTACTTGGGTTTAATCACAA



AAGGCCACAATAAGATTTTTGATGATAGATTTCAAGAAAAATTCATAGTTGGTATAGAAGGTGGCA



AATACGAGAAAATTGTCTATAAATTCTTCCCTGATCAAGCCAAAATGTTCCCAAAAGTTTGCTTTTC



TGCTAAAGGATTGGAGTTTTTCCGGCCTAGCGAGGAGATCCTTCGTATCTACAACAATGCTGAATTC



AAAAAAGGAGAAACCTATAGCATAGATTCTATGCAAAAACTGATAGATTTTTATAAGGATTGTTTA



ACAAAGTACGAAGGCTGGGCCTGCTATACATTTAGACATTTAAAGCCCACAGAAGAATACCAAAAT



AACATTGGTGAATTCTTTCGGGACGTTGCCGAAGACGGCTATAGGATCGATTTTCAAGGTATCTCAG



ATCAATATATCCACGAAAAGAACGAGAAGGGTGAGCTGCACCTTTTCGAAATTCATAATAAGGACT



GGAATTTGGATAAGGCGAGAGATGGTAAATCGAAGACCACTCAAAAGAACTTGCATACTTTATATT



TTGAGTCCTTGTTTTCTAATGATAACGTCGTCCAAAATTTTCCAATAAAGTTGAATGGACAAGCGGA



AATTTTCTATCGGCCTAAGACAGAGAAAGACAAATTAGAATCAAAGAAAGATAAAAAGGGAAATA



AAGTCATTGATCACAAACGATACTCTGAGAATAAAATATTTTTCCACGTACCATTGACACTCAACAG



GACTAAGAATGACTCTTATAGATTTAATGCTCAGATTAATAATTTTTTGGCAAATAACAAGGATATT



AACATAATTGGGGTGGATAGAGGTGAAAAGCACTTGGTATATTACTCTGTCATCACTCAGGCTTCTG



ATATATTGGAAAGCGGGTCTCTAAATGAATTGAACGGTGTTAACTACGCCGAAAAGCTAGGTAAAA



AAGCTGAAAACAGAGAGCAGGCTCGGCGCGATTGGCAAGATGTTCAAGGAATTAAAGACCTTAAA



AAAGGCTACATTAGTCAAGTAGTTAGAAAGTTAGCCGATCTTGCTATTAAACATAACGCAATCATT



ATTCTGGAGGACCTAAATATGCGTTTTAAGCAAGTTAGGGGTGGCATAGAAAAAAGTATTTATCAG



CAGCTTGAGAAGGCTTTGATAGATAAGTTATCGTTCCTAGTTGACAAAGGTGAAAAAAATCCTGAA



CAAGCTGGTCATCTGTTGAAAGCTTATCAGCTGAGCGCACCTTTTGAAACATTTCAAAAAATGGGA



AAACAAACAGGTATTATTTTCTATACTCAAGCGAGTTATACAAGTAAATCTGACCCAGTGACAGGA



TGGAGACCACACCTTTATCTAAAATATTTTTCTGCTAAAAAGGCCAAAGATGACATCGCTAAGTTTA



CAAAAATAGAATTTGTCAACGATAGATTTGAATTGACTTACGATATTAAAGATTTTCAGCAAGCAA



AAGAATACCCAAATAAGACAGTGTGGAAAGTATGCTCCAATGTGGAGAGATTTAGATGGGATAAA



AATCTCAATCAAAACAAGGGTGGTTACACACATTATACTAATATAACTGAAAATATTCAAGAATTG



TTTACTAAGTACGGAATTGACATAACCAAAGACTTACTAACTCAGATTTCAACTATTGACGAAAAA



CAAAATACCTCATTTTTCCGCGACTTTATTTTTTATTTCAACTTGATCTGTCAAATTCGTAACACGGA



TGATTCCGAAATTGCCAAGAAGAACGGAAAAGATGATTTCATCCTATCTCCAGTGGAACCATTTTTT



GACTCAAGAAAAGATAATGGTAATAAGTTGCCTGAGAACGGAGATGATAACGGCGCTTATAATATC



GCTCGGAAGGGTATTGTAATTCTTAATAAAATATCTCAGTACTCTGAAAAGAACGAAAACTGCGAG



AAAATGAAGTGGGGCGACTTGTATGTATCTAATATAGATTGGGATAATTTCGTTACTCAAGCCAAC



GCGAGACATTGA





SEQ
ATGGAAAATTTTAAAAACCTATATCCAATTAATAAGACACTTAGATTCGAGCTTAGGCCATACGGC


ID
AAAACACTAGAAAATTTTAAGAAGTCAGGCCTATTAGAAAAAGACGCCTTTAAGGCAAATTCCAGA


NO:
AGATCAATGCAGGCAATTATTGATGAGAAATTTAAAGAGACTATCGAGGAAAGGTTGAAATACACT


137
GAATTCTCTGAGTGCGATCTGGGAAACATGACTTCCAAGGATAAAAAGATTACCGATAAGGCTGCT



ACCAACCTCAAAAAGCAAGTCATCTTATCGTTTGATGATGAAATTTTTAATAACTACTTAAAGCCGG



ACAAAAACATTGACGCCCTATTCAAAAATGATCCGTCCAACCCCGTAATTTCAACTTTTAAGGGTTT



TACCACGTACTTTGTAAATTTTTTTGAGATTCGTAAACATATCTTCAAAGGAGAATCGTCGGGTTCC



ATGGCCTATAGGATAATTGATGAAAATCTTACGACTTACTTAAACAATATCGAAAAGATAAAAAAG



TTACCAGAAGAATTAAAGTCTCAATTGGAAGGTATTGACCAAATAGACAAATTAAATAACTATAAT



GAGTTCATAACTCAAAGCGGTATCACACATTACAATGAAATTATCGGTGGTATATCTAAAAGTGAG



AACGTAAAAATACAGGGAATAAACGAGGGGATCAATCTATACTGTCAGAAGAATAAAGTAAAATT



ACCAAGACTAACGCCATTATACAAAATGATTCTGTCTGATAGAGTTTCCAACTCGTTCGTGCTTGAT



ACTATAGAAAATGATACTGAATTAATTGAGATGATTAGCGACTTGATTAATAAAACAGAAATATCT



CAAGACGTAATAATGTCAGACATTCAGAACATTTTCATAAAATATAAACAGCTTGGTAATTTACCG



GGGATAAGTTACTCTAGCATCGTGAATGCTATTTGCTCCGATTATGACAATAATTTTGGTGACGGAA



AAAGAAAAAAATCATATGAGAACGATAGGAAGAAACACCTTGAAACAAACGTATACTCAATTAAC



TATATATCGGAACTGTTAACAGACACCGATGTATCATCTAATATAAAAATGAGATATAAGGAACTT



GAACAAAATTACCAGGTGTGTAAGGAGAATTTCAATGCTACCAACTGGATGAACATTAAGAATATT



AAACAGAGTGAAAAGACAAACTTGATTAAAGATCTACTAGATATACTGAAATCAATACAGAGATTC



TACGATCTGTTTGATATAGTTGATGAAGACAAAAATCCTAGTGCTGAGTTTTACACGTGGCTAAGTA



AAAATGCGGAAAAGTTAGATTTCGAGTTCAACTCTGTTTATAATAAATCTAGGAATTATTTAACTAG



AAAGCAGTATTCTGATAAAAAGATAAAATTGAACTTCGACTCCCCTACGTTGGCAAAGGGTTGGGA



TGCAAACAAAGAAATCGATAACTCCACCATAATAATGCGTAAGTTTAACAATGATAGGGGGGATTA



CGATTATTTTTTGGGAATTTGGAACAAATCTACCCCAGCGAATGAAAAAATTATTCCCCTTGAAGAC



AATGGTCTTTTTGAAAAAATGCAGTATAAATTATATCCAGACCCATCCAAGATGCTTCCAAAGCAAT



TTCTGTCAAAAATTTGGAAGGCTAAACACCCTACTACTCCTGAATTTGATAAGAAGTATAAGGAGG



GCCGACACAAAAAGGGTCCAGATTTTGAAAAAGAATTCCTGCATGAATTGATAGATTGTTTTAAGC



ATGGTTTGGTAAATCATGATGAAAAATATCAGGATGTCTTTGGATTCAATTTGAGAAATACAGAGG



ATTACAACTCATATACAGAATTTCTCGAGGACGTCGAACGTTGCAATTATAATCTCAGTTTCAACAA



GATCGCAGACACTTCAAACTTAATTAACGACGGAAAATTGTACGTTTTTCAAATCTGGTCGAAAGA



CTTTAGTATTGATTCAAAGGGTACAAAAAACCTAAATACAATATATTTCGAAAGTCTATTCTCGGAA



GAAAACATGATCGAAAAAATGTTCAAACTGTCAGGCGAAGCTGAAATATTCTACCGTCCCGCAAGC



CTTAATTATTGTGAGGATATCATTAAAAAAGGACATCACCATGCAGAGTTAAAAGATAAATTCGAT



TACCCAATAATTAAAGATAAAAGATACTCCCAGGATAAGTTCTTTTTCCATGTACCTATGGTTATTA



ACTACAAGTCGGAAAAACTAAACTCGAAGTCATTAAATAATAGAACTAACGAGAACTTGGGACAAT



TCACACATATAATTGGTATTGATCGTGGCGAAAGACATTTAATATATCTGACTGTTGTTGATGTTTC



AACAGGAGAAATTGTTGAACAGAAACATCTTGATGAAATTATAAACACAGATACAAAAGGCGTTG



AGCATAAAACTCATTATCTAAATAAATTGGAGGAAAAGTCGAAGACTCGCGATAACGAGAGAAAG



AGTTGGGAAGCAATTGAAACCATAAAAGAGCTTAAAGAAGGTTACATTAGTCACGTCATCAATGAA



ATACAAAAGTTACAAGAAAAGTATAACGCTTTGATTGTAATGGAAAATCTAAATTATGGTTTTAAG



AATTCAAGAATCAAAGTCGAAAAGCAGGTCTATCAGAAATTTGAAACGGCACTTATTAAAAAGTTT



AACTACATTATTGATAAAAAGGACCCAGAAACTTATATTCATGGTTACCAACTGACGAACCCAATC



ACAACATTGGACAAAATTGGAAACCAAAGTGGAATTGTTTTATACATTCCAGCTTGGAATACATCC



AAAATAGACCCTGTCACGGGGTTTGTCAACTTGTTATATGCCGACGATTTAAAGTATAAAAACCAA



GAACAAGCAAAGTCTTTTATTCAAAAGATTGATAATATTTATTTCGAAAACGGTGAATTTAAATTCG



ACATAGATTTTTCTAAATGGAACAACCGTTATTCAATAAGTAAAACTAAATGGACACTCACCTCATA



CGGCACTCGTATCCAAACCTTTCGGAATCCCCAAAAAAATAACAAATGGGATTCTGCAGAATACGA



CTTGACCGAGGAATTTAAATTAATTCTTAATATAGACGGTACACTCAAAAGTCAAGACGTGGAGAC



ATACAAGAAGTTTATGTCGTTATTCAAGCTTATGCTTCAGTTGAGGAACTCCGTTACAGGCACTGAT



ATTGATTACATGATTTCACCAGTAACGGATAAGACTGGGACTCATTTCGATTCTAGGGAAAATATTA



AAAATTTACCTGCTGACGCAGACGCAAACGGCGCATACAATATAGCAAGAAAAGGGATTATGGCC



ATTGAGAATATTATGAATGGCATATCAGATCCATTAAAGATAAGCAATGAAGACTACTTAAAATAC



ATTCAGAATCAGCAAGAATAA





SEQ
ATGACCCAGTTTGAAGGTTTCACCAATTTGTACCAAGTAAGTAAAACCTTGAGGTTCGAATTGATCC


ID
CACAGGGCAAGACATTGAAGCATATTCAAGAGCAAGGATTTATAGAAGAAGATAAAGCGAGAAAC


NO:
GATCACTATAAAGAGTTAAAACCCATTATTGACAGGATCTATAAAACATACGCCGATCAATGCCTT


138
CAATTAGTGCAATTAGATTGGGAAAACTTGAGCGCTGCCATCGATTCCTACAGGAAGGAAAAAACA



GAAGAAACAAGAAATGCCTTAATCGAGGAACAAGCAACCTATAGAAACGCTATACACGATTACTTC



ATCGGTAGAACTGATAATCTAACAGATGCAATAAATAAGAGACATGCTGAGATATATAAAGGACTA



TTTAAAGCAGAATTATTCAACGGAAAGGTGTTGAAACAGTTAGGTACCGTTACAACTACTGAGCAT



GAAAATGCCTTGCTGAGAAGCTTTGACAAGTTTACTACCTACTTTTCGGGTTTCTACGAAAATCGCA



AAAATGTATTTTCTGCGGAAGATATTTCAACTGCAATCCCTCATAGGATTGTTCAAGATAATTTCCC



TAAGTTTAAAGAGAACTGTCACATTTTTACAAGGTTAATTACTGCGGTTCCAAGTCTAAGAGAACAT



TTTGAGAATGTAAAAAAAGCGATTGGTATATTTGTATCCACTAGCATTGAAGAGGTTTTCAGCTTCC



CTTTTTATAACCAATTACTTACCCAAACACAGATCGACCTGTACAACCAATTGTTAGGTGGTATATC



GAGGGAGGCTGGTACGGAAAAGATTAAAGGATTAAATGAAGTTCTTAATTTGGCCATACAAAAAA



ATGATGAAACCGCGCACATTATCGCATCTTTACCACATAGGTTTATACCGTTATTCAAGCAAATATT



ATCTGATCGTAATACCTTATCGTTCATATTAGAGGAGTTTAAATCTGACGAAGAAGTTATACAATCT



TTTTGCAAGTATAAGACGCTATTGAGAAACGAAAACGTTCTGGAAACAGCCGAAGCACTGTTCAAT



GAATTAAACAGTATCGACTTGACTCATATTTTTATATCGCATAAAAAGTTGGAGACAATTTCTTCAG



CATTGTGCGATCACTGGGACACTTTAAGGAACGCACTATATGAACGTAGGATCTCAGAATTGACAG



GTAAGATAACGAAGTCTGCTAAAGAGAAAGTGCAGAGATCCCTAAAACACGAGGATATAAATTTG



CAGGAGATAATTTCAGCTGCAGGTAAAGAGTTGTCTGAAGCGTTCAAGCAAAAGACTTCCGAAATC



TTGTCACACGCACACGCCGCATTAGATCAACCTTTACCCACTACTTTGAAAAAACAAGAAGAGAAG



GAGATATTAAAATCACAACTTGATTCTTTACTTGGCCTTTATCATCTTTTAGATTGGTTCGCTGTTGA



CGAGAGCAATGAAGTGGATCCAGAGTTTTCCGCAAGATTGACCGGTATAAAGTTGGAAATGGAACC



TTCGTTATCATTTTACAACAAAGCTAGGAACTATGCTACAAAAAAACCTTATTCTGTCGAAAAATTT



AAACTGAACTTCCAAATGCCTACTCTAGCAAGTGGCTGGGATGTTAATAAAGAAAAGAACAATGGC



GCTATTTTGTTTGTAAAAAATGGCCTATACTATCTTGGAATTATGCCTAAACAAAAAGGTCGCTACA



AGGCTTTGTCATTTGAACCTACTGAAAAGACTAGCGAAGGTTTCGATAAGATGTATTACGATTATTT



CCCGGATGCCGCTAAAATGATCCCCAAGTGCTCTACTCAATTGAAGGCAGTAACTGCTCATTTCCAA



ACGCATACCACGCCAATACTGCTTTCTAACAACTTTATAGAACCACTAGAAATAACGAAAGAAATT



TACGACCTAAATAACCCAGAGAAAGAACCAAAAAAGTTCCAGACGGCCTACGCCAAAAAGACAGG



GGACCAAAAAGGTTACCGCGAGGCGTTATGTAAATGGATTGATTTTACTAGGGACTTTTTATCAAA



ATACACTAAAACGACGTCTATTGATCTTAGCTCCTTACGCCCGTCCTCCCAATACAAGGATCTAGGT



GAGTATTACGCAGAGTTGAACCCGCTATTATACCATATTTCCTTCCAAAGGATTGCTGAAAAGGAA



ATTATGGACGCTGTTGAAACTGGGAAATTGTACCTGTTTCAGATTTATAATAAGGACTTCGCAAAGG



GTCACCATGGTAAGCCTAACCTTCACACTTTGTACTGGACCGGACTATTCTCGCCTGAAAATTTGGC



TAAAACAAGTATCAAGTTAAACGGTCAGGCCGAGTTATTTTATAGACCCAAATCTAGAATGAAAAG



AATGGCCCATAGATTAGGCGAAAAGATGTTAAACAAGAAATTAAAGGACCAAAAAACCCCGATAC



CAGACACTCTATACCAAGAACTGTACGACTATGTGAATCACAGGCTTAGTCACGATTTATCAGATG



AAGCGAGGGCTTTATTGCCAAATGTCATCACCAAGGAAGTATCACATGAAATAATTAAGGATAGAA



GGTTCACATCTGATAAATTCTTTTTTCATGTCCCAATTACATTGAATTATCAAGCAGCGAACTCACC



ATCTAAATTTAATCAGCGCGTCAACGCCTATTTGAAAGAACATCCCGAAACACCAATCATCGGCAT



AGATCGAGGTGAGAGAAACTTAATATATATAACTGTGATTGATTCTACAGGAAAAATCCTGGAGCA



ACGATCTTTAAATACCATACAACAGTTTGATTATCAAAAAAAGTTGGATAACAGAGAAAAAGAACG



TGTTGCCGCTAGGCAGGCTTGGTCTGTGGTAGGAACAATTAAGGACTTAAAGCAGGGCTATCTGTC



CCAAGTTATTCATGAAATAGTCGATCTGATGATACATTATCAGGCAGTTGTCGTGTTGGAAAATTTG



AATTTTGGCTTTAAATCAAAAAGAACTGGCATAGCAGAAAAAGCTGTGTACCAGCAGTTTGAAAAG



ATGTTAATCGATAAGCTAAACTGCCTTGTTCTTAAAGATTACCCCGCAGAAAAAGTAGGTGGTGTTC



TTAATCCATATCAGTTGACAGACCAATTTACATCCTTTGCGAAAATGGGTACGCAAAGCGGGTTCTT



ATTCTACGTACCGGCCCCCTATACTTCTAAGATCGACCCACTAACAGGTTTTGTGGACCCTTTTGTTT



GGAAGACGATAAAGAACCACGAGTCACGCAAACATTTCTTAGAGGGCTTTGATTTCTTGCACTACG



ACGTGAAAACTGGTGATTTTATCTTACACTTTAAAATGAACAGAAATCTCTCTTTCCAACGTGGACT



GCCCGGATTCATGCCGGCTTGGGACATCGTTTTTGAAAAGAATGAAACGCAGTTTGACGCCAAAGG



TACACCATTTATAGCGGGTAAGAGAATTGTGCCGGTCATAGAAAACCATAGATTTACAGGTAGATA



TAGGGATCTGTACCCTGCTAATGAATTGATTGCATTACTCGAAGAGAAAGGAATTGTGTTTCGAGAT



GGATCGAATATTTTACCTAAGTTGTTGGAAAATGATGATTCACACGCAATTGATACTATGGTTGCCC



TCATAAGATCGGTATTGCAAATGAGAAACTCAAATGCTGCTACGGGAGAGGATTATATAAACAGCC



CCGTTCGCGATCTTAATGGTGTTTGTTTTGATTCACGTTTTCAGAACCCCGAATGGCCAATGGATGC



CGACGCAAACGGAGCATATCATATTGCTCTTAAAGGCCAACTACTATTAAATCACTTAAAGGAATC



CAAAGACCTAAAATTGCAAAACGGGATATCTAATCAGGATTGGCTGGCTTACATACAAGAACTACG



TAACTAG





SEQ
ATGGCCGTTAAGTCAATCAAAGTGAAACTTAGACTGGATGACATGCCAGAGATTCGTGCGGGGTTA


ID
TGGAAACTTCATAAGGAAGTTAACGCAGGGGTAAGATATTATACCGAATGGTTATCATTACTTCGA


NO:
CAAGAGAATTTGTACAGAAGGTCCCCGAACGGCGACGGTGAGCAAGAATGCGATAAGACGGCTGA


139
AGAATGTAAGGCAGAACTTTTGGAGCGCCTGAGAGCCCGTCAGGTTGAAAATGGCCATAGAGGTCC



TGCGGGATCTGATGATGAGCTTTTACAGCTAGCTAGACAATTGTATGAATTGTTGGTCCCTCAGGCT



ATTGGGGCTAAAGGAGACGCTCAACAAATCGCCAGAAAGTTCTTGTCACCTCTGGCTGACAAAGAT



GCCGTGGGAGGATTAGGTATCGCTAAAGCAGGTAATAAACCAAGATGGGTTAGAATGAGAGAAGC



AGGCGAACCTGGTTGGGAAGAAGAGAAAGAAAAGGCCGAAACTAGAAAAAGCGCTGACAGAACC



GCAGATGTTTTACGGGCCTTGGCTGATTTTGGACTGAAGCCTTTGATGAGAGTGTATACTGATTCAG



AAATGTCTTCCGTTGAATGGAAGCCCCTAAGGAAGGGACAAGCGGTCAGAACCTGGGATAGGGAT



ATGTTTCAACAGGCTATTGAAAGGATGATGTCATGGGAATCCTGGAATCAAAGAGTAGGTCAAGAA



TACGCTAAACTGGTCGAACAAAAGAATAGATTTGAACAAAAAAATTTTGTAGGTCAAGAACATTTA



GTACATTTGGTTAATCAACTTCAACAAGATATGAAAGAGGCATCTCCTGGTTTGGAATCAAAAGAA



CAAACAGCACACTATGTTACCGGCCGAGCTTTGCGAGGTTCTGACAAAGTATTTGAAAAGTGGGGG



AAATTAGCTCCCGATGCCCCCTTTGATCTATATGATGCTGAAATTAAAAACGTTCAAAGAAGGAAC



ACTAGACGTTTTGGATCCCATGATCTTTTTGCAAAGCTAGCTGAGCCAGAATACCAGGCTCTATGGC



GTGAAGACGCCTCGTTTTTGACTAGATACGCAGTATACAATTCAATACTCAGAAAACTAAACCATG



CCAAGATGTTTGCTACATTCACCCTGCCCGATGCTACCGCTCATCCTATTTGGACTAGATTTGACAA



GTTGGGGGGGAATCTACATCAGTACACATTTTTATTTAATGAATTCGGTGAAAGAAGACACGCTATT



AGATTCCACAAGCTCCTAAAGGTTGAAAACGGCGTTGCGAGAGAAGTTGATGATGTAACAGTTCCC



ATTTCTATGTCGGAGCAATTGGATAATCTATTGCCTAGAGACCCTAATGAACCAATTGCTTTGTACT



TTCGTGACTACGGTGCAGAACAACACTTTACAGGTGAATTCGGCGGAGCCAAGATTCAATGTAGAC



GTGATCAACTCGCACACATGCATAGAAGAAGAGGCGCTCGTGATGTTTATTTAAATGTGTCTGTTAG



AGTTCAATCCCAATCGGAGGCTAGAGGTGAAAGAAGGCCACCATACGCAGCAGTTTTTAGGTTAGT



AGGTGATAATCATAGGGCATTTGTCCACTTCGACAAATTAAGTGATTATTTAGCAGAGCACCCTGAT



GATGGAAAGTTGGGCAGTGAGGGATTATTAAGTGGGTTGAGGGTAATGTCTGTAGATCTTGGTCTT



CGTACTTCTGCGAGTATCTCTGTCTTTAGAGTAGCACGTAAGGATGAGTTGAAACCTAATAGCAAAG



GAAGAGTCCCGTTTTTTTTTCCTATTAAGGGTAACGATAACCTGGTGGCCGTGCATGAAAGATCACA



ACTTTTGAAATTGCCAGGAGAAACGGAGTCCAAGGACTTGAGGGCAATTAGAGAGGAACGTCAGC



GTACATTGCGACAGCTGAGAACTCAATTGGCTTATTTGAGGTTGTTGGTTAGGTGTGGTTCCGAGGA



TGTTGGCAGAAGAGAAAGGTCTTGGGCCAAATTGATAGAACAACCAGTGGACGCCGCAAATCACAT



GACACCAGATTGGAGAGAAGCTTTCGAAAATGAACTCCAGAAATTAAAGAGCCTACATGGCATATG



CTCTGATAAAGAGTGGATGGATGCCGTATACGAATCCGTTCGTAGAGTCTGGCGCCACATGGGTAA



GCAAGTACGGGACTGGAGAAAGGATGTTCGTTCCGGCGAAAGACCGAAGATAAGGGGGTATGCAA



AGGACGTTGTAGGCGGTAATTCTATTGAACAGATTGAGTATTTGGAAAGGCAGTACAAATTTCTTA



AATCCTGGAGCTTCTTCGGCAAAGTGTCAGGACAAGTCATCAGGGCTGAAAAAGGTTCCAGATTTG



CTATTACGCTAAGGGAACATATTGATCATGCGAAAGAAGATAGACTGAAAAAACTAGCAGATAGA



ATAATTATGGAAGCACTTGGTTACGTCTATGCACTTGATGAAAGAGGCAAGGGGAAATGGGTAGCT



AAATACCCGCCTTGTCAACTTATTTTATTAGAAGAATTAAGCGAGTACCAATTTAACAACGATAGAC



CTCCATCCGAAAATAATCAGCTGATGCAATGGTCCCATAGGGGTGTTTTTCAAGAATTGATAAATCA



AGCTCAAGTACACGATTTGCTGGTAGGTACTATGTACGCAGCGTTTTCGAGCCGTTTTGATGCAAGA



ACTGGTGCCCCAGGTATCAGATGTCGACGTGTTCCGGCCAGATGTACACAGGAACATAACCCTGAG



CCATTTCCGTGGTGGCTTAATAAGTTTGTTGTCGAGCACACATTAGACGCATGCCCTCTGAGAGCAG



ATGACCTTATACCCACTGGAGAAGGCGAAATATTTGTTAGTCCATTCTCTGCAGAAGAAGGTGACTT



TCACCAGATACATGCAGACTTAAATGCAGCACAGAATCTCCAACAAAGGTTGTGGTCGGATTTTGA



TATTTCGCAAATAAGACTAAGATGCGATTGGGGAGAGGTTGATGGAGAATTGGTGCTGATTCCAAG



ATTAACCGGAAAGCGAACTGCCGATTCCTATTCTAACAAGGTGTTTTACACAAATACTGGTGTTACC



TATTACGAAAGAGAAAGGGGTAAGAAGAGACGTAAAGTATTTGCTCAAGAAAAATTGTCAGAAGA



GGAGGCAGAACTGTTAGTAGAAGCAGACGAAGCCAGAGAAAAATCAGTTGTGCTTATGCGTGACC



CTTCCGGCATTATAAATCGTGGTAATTGGACACGACAAAAAGAATTTTGGTCTATGGTCAATCAACG



TATCGAAGGCTACCTAGTTAAGCAAATCAGGTCTAGGGTTCCACTACAAGATAGCGCATGTGAAAA



TACGGGTGATATATAA





SEQ
ATGGCTACTAGATCTTTCATTTTAAAAATTGAACCTAATGAAGAAGTGAAGAAGGGTCTCTGGAAA


ID
ACTCACGAAGTACTTAATCATGGCATTGCCTATTATATGAATATCCTGAAGCTTATTCGTCAAGAAG


NO:
CTATATACGAGCATCATGAGCAAGATCCTAAGAACCCTAAGAAAGTAAGCAAAGCGGAAATTCAG


140
GCTGAATTGTGGGACTTCGTCTTGAAGATGCAGAAGTGTAACAGTTTTACGCACGAAGTTGATAAA



GATGTGGTGTTTAATATTTTGAGGGAGCTATATGAGGAGTTGGTGCCCTCGAGTGTCGAAAAAAAA



GGAGAAGCTAATCAGCTGTCAAATAAATTTTTATATCCTCTGGTGGATCCAAACTCTCAATCAGGTA



AAGGCACTGCCAGTAGTGGTCGAAAACCGAGATGGTATAATTTGAAAATCGCAGGTGATCCATCGT



GGGAAGAAGAAAAAAAAAAATGGGAAGAAGATAAAAAAAAAGATCCCCTTGCCAAAATACTAGG



TAAGCTAGCCGAGTATGGACTTATACCATTATTCATTCCTTTCACGGACTCTAATGAACCAATTGTG



AAGGAAATCAAATGGATGGAAAAATCACGTAATCAGTCTGTTAGGAGGTTGGACAAAGATATGTTT



ATACAGGCTCTTGAGAGGTTTTTGTCGTGGGAGTCCTGGAATTTGAAAGTGAAAGAAGAATATGAA



AAAGTGGAAAAGGAGCATAAGACGTTGGAAGAAAGGATTAAGGAAGATATTCAGGCCTTTAAGAG



TCTGGAACAGTACGAAAAAGAAAGACAGGAACAGTTATTGAGAGATACTCTAAACACTAATGAAT



ATAGGCTTTCCAAGAGGGGCTTGCGAGGATGGAGAGAGATAATTCAGAAATGGTTGAAAATGGAT



GAGAACGAGCCATCGGAGAAATATCTAGAGGTGTTTAAAGATTACCAAAGAAAGCACCCTCGCGA



AGCTGGTGATTACTCTGTTTATGAATTCCTTTCGAAGAAGGAAAATCACTTCATCTGGCGAAATCAT



CCAGAGTACCCATATTTATATGCTACATTTTGCGAAATTGACAAGAAAAAAAAAGATGCTAAACAG



CAAGCGACATTCACCCTCGCTGATCCCATCAACCACCCATTATGGGTCAGGTTCGAAGAGAGATCA



GGCTCGAACCTGAATAAGTACAGGATCTTGACTGAGCAATTGCATACTGAGAAGTTAAAAAAGAAA



TTGACGGTCCAACTTGACAGATTGATTTATCCCACTGAATCTGGTGGATGGGAGGAGAAAGGTAAG



GTTGATATTGTCCTATTGCCTTCTCGTCAATTTTACAACCAAATATTTCTGGACATCGAAGAGAAGG



GTAAACATGCTTTTACCTATAAGGATGAGAGTATTAAATTTCCATTGAAGGGAACGCTTGGCGGCG



CTAGAGTTCAGTTCGATAGAGATCATTTGAGAAGATACCCGCATAAAGTGGAATCTGGTAATGTAG



GTCGGATCTACTTTAACATGACGGTAAATATTGAACCTACCGAGTCACCAGTCAGTAAGTCTTTAAA



GATTCATAGGGATGATTTCCCTAAATTTGTCAACTTCAAGCCTAAGGAACTAACCGAGTGGATCAA



AGACAGTAAAGGCAAAAAGTTAAAGAGCGGTATTGAGTCCCTGGAGATAGGTCTTAGAGTCATGTC



TATCGATTTGGGTCAAAGACAAGCAGCCGCAGCATCTATTTTCGAAGTTGTTGACCAAAAACCGGA



TATCGAGGGGAAATTATTTTTTCCAATAAAAGGAACTGAGCTATACGCTGTGCATCGCGCATCCTTC



AATATAAAACTGCCAGGAGAAACACTAGTAAAATCTAGAGAGGTCTTGCGTAAAGCACGTGAGGA



CAATCTCAAATTAATGAATCAGAAGTTAAATTTCCTTAGGAACGTGTTGCATTTCCAACAGTTCGAG



GACATAACTGAACGCGAGAAAAGAGTCACTAAGTGGATCTCAAGACAAGAAAATAGTGATGTGCC



ATTAGTGTATCAAGACGAACTTATTCAAATAAGAGAGCTAATGTATAAACCATATAAAGACTGGGT



GGCATTCTTAAAACAATTACACAAGCGGCTTGAAGTAGAAATAGGAAAAGAAGTAAAGCATTGGA



GGAAGAGTCTGTCCGATGGTCGCAAAGGCCTGTACGGGATATCACTTAAAAATATTGATGAAATTG



ACAGAACACGAAAATTTTTGTTAAGATGGTCATTGAGACCAACCGAACCAGGTGAGGTTAGAAGGT



TGGAACCAGGCCAAAGGTTTGCCATCGATCAATTAAACCATCTTAACGCACTGAAAGAAGATAGAT



TGAAGAAGATGGCGAACACTATTATTATGCACGCTCTAGGTTATTGCTATGATGTGAGAAAGAAAA



AATGGCAAGCCAAGAACCCTGCATGCCAAATTATTTTGTTTGAAGATCTTTCTAATTACAATCCATA



CGAAGAGCGTTCACGTTTTGAAAACTCTAAATTGATGAAATGGTCTAGAAGAGAGATTCCGAGACA



GGTCGCTCTACAAGGGGAGATTTACGGTCTTCAAGTCGGTGAGGTTGGTGCTCAATTTTCTTCCAGA



TTTCATGCAAAAACTGGGTCTCCAGGCATTAGGTGTTCGGTCGTTACTAAGGAAAAGTTACAGGAC



AACCGTTTCTTCAAAAATTTGCAACGTGAAGGCCGTTTAACACTTGATAAGATAGCTGTCCTTAAGG



AAGGCGATCTGTACCCAGATAAAGGTGGTGAGAAATTCATATCTTTGAGTAAAGACAGGAAACTGG



TTACAACACACGCCGACATTAACGCAGCTCAGAACTTGCAAAAGAGATTCTGGACAAGGACCCACG



GCTTCTATAAGGTGTACTGTAAAGCTTATCAAGTAGATGGACAAACGGTTTATATTCCTGAATCAAA



GGACCAGAAACAAAAAATTATAGAAGAATTTGGTGAAGGATACTTTATCTTGAAGGATGGAGTTTA



TGAGTGGGGCAATGCAGGTAAGTTAAAGATAAAGAAAGGTTCATCAAAGCAATCAAGTAGCGAAC



TGGTCGATTCGGATATTTTAAAGGATAGCTTTGATCTAGCTAGTGAATTGAAGGGAGAAAAGTTAA



TGTTATACAGAGATCCCAGTGGGAATGTATTTCCATCTGATAAGTGGATGGCCGCCGGAGTGTTTTT



TGGCAAATTAGAGAGAATCTTGATTTCTAAACTGACCAATCAATACTCAATTTCGACCATCGAAGAC



GACTCTTCAAAACAATCCATGTGA





SEQ
ATGCCTACTCGCACCATCAATCTGAAGTTAGTTTTGGGGAAGAACCCAGAAAATGCGACTCTAAGA


ID
CGGGCACTATTCTCTACACATAGACTTGTCAACCAAGCGACTAAGAGAATTGAAGAATTTTTACTGT


NO:
TGTGTAGAGGAGAAGCTTATCGTACCGTAGATAATGAAGGTAAAGAAGCTGAGATCCCACGCCATG


141
CTGTTCAAGAAGAGGCGCTTGCTTTTGCAAAAGCTGCACAACGACATAACGGCTGTATCTCCACAT



ATGAGGACCAGGAAATCTTGGATGTGCTTAGACAATTGTATGAAAGATTAGTACCTAGCGTCAATG



AAAACAACGAGGCTGGGGATGCCCAAGCCGCTAACGCTTGGGTGAGTCCATTAATGAGTGCAGAGT



CCGAAGGTGGACTATCGGTCTATGATAAAGTGTTAGACCCGCCGCCAGTATGGATGAAACTCAAAG



AAGAGAAAGCGCCTGGTTGGGAAGCTGCTTCTCAGATTTGGATACAGTCCGACGAAGGTCAATCGC



TGCTAAATAAACCGGGTAGCCCACCACGTTGGATTAGAAAACTTAGATCTGGTCAACCGTGGCAAG



ATGACTTCGTTTCAGACCAAAAAAAAAAGCAAGATGAACTAACGAAAGGTAACGCACCACTCATA



AAACAATTGAAAGAGATGGGCCTCTTGCCTTTAGTTAATCCCTTTTTTAGACATTTGTTGGATCCCG



AGGGTAAGGGTGTATCCCCATGGGACAGATTGGCCGTAAGGGCCGCGGTGGCGCACTTCATCTCTT



GGGAAAGTTGGAACCACAGAACAAGAGCTGAGTATAACAGTTTGAAACTGCGAAGAGATGAATTT



GAGGCCGCATCTGATGAATTCAAGGACGATTTTACATTGCTACGACAATATGAGGCTAAGCGACAT



AGTACGCTTAAGTCAATTGCCTTAGCTGATGACTCTAACCCGTACCGAATTGGTGTAAGGTCCTTGA



GAGCCTGGAATAGGGTTAGAGAAGAATGGATTGACAAAGGCGCAACCGAGGAACAAAGGGTTACC



ATCCTTAGTAAGCTTCAAACACAATTACGGGGTAAATTCGGTGATCCAGACCTATTTAATTGGCTAG



CCCAAGATAGACACGTACACCTGTGGTCCCCGAGAGATTCCGTCACGCCCCTCGTAAGGATTAATG



CCGTCGACAAAGTGCTTAGAAGACGTAAGCCTTATGCACTGATGACTTTTGCACATCCGAGATTCCA



TCCAAGATGGATTCTATACGAAGCGCCTGGTGGTTCTAACTTGCGACAATACGCTTTAGATTGTACT



GAAAATGCTCTGCATATTACACTTCCATTACTCGTCGACGACGCCCATGGTACATGGATTGAGAAAA



AAATCCGCGTACCACTCGCTCCTAGTGGACAAATACAAGATTTAACTTTAGAAAAACTTGAAAAGA



AAAAAAACAGATTATACTATAGATCAGGATTCCAACAATTTGCTGGATTAGCCGGTGGTGCTGAGG



TGTTGTTTCATAGGCCGTATATGGAACATGATGAGAGATCAGAAGAATCTCTGTTGGAAAGGCCAG



GCGCTGTGTGGTTCAAATTAACCTTAGATGTTGCTACCCAAGCACCACCTAACTGGTTAGATGGTAA



AGGCAGAGTTAGGACACCTCCAGAAGTTCATCATTTCAAAACCGCTCTGTCAAATAAATCTAAACA



TACGAGAACCTTGCAACCAGGATTGAGAGTCCTTTCTGTTGATTTGGGTATGAGAACATTTGCTTCT



TGTTCTGTTTTCGAATTGATCGAAGGTAAACCTGAAACAGGTAGAGCATTCCCTGTTGCTGACGAAA



GATCAATGGATAGTCCAAATAAGTTATGGGCCAAGCACGAGAGAAGCTTTAAACTAACTCTGCCTG



GAGAAACACCGAGCAGAAAGGAGGAAGAAGAGAGAAGCATTGCTAGGGCAGAGATTTACGCGCTG



AAAAGAGATATTCAAAGACTGAAATCACTCCTAAGATTAGGTGAGGAAGATAATGATAATAGAAG



AGATGCTTTGTTAGAGCAATTCTTTAAAGGATGGGGTGAAGAGGACGTAGTTCCTGGTCAAGCTTTC



CCTAGAAGCCTCTTTCAGGGATTAGGCGCTGCACCCTTTAGGTCAACACCCGAATTGTGGAGACAG



CACTGTCAGACGTATTACGACAAAGCGGAAGCTTGCCTGGCAAAGCATATTTCCGACTGGAGGAAG



AGAACTAGACCTCGTCCGACTTCGAGAGAGATGTGGTATAAGACAAGATCTTACCATGGTGGCAAA



AGTATTTGGATGCTAGAATACTTAGATGCTGTCCGCAAATTACTACTTTCATGGTCGTTAAGAGGTC



GTACTTACGGAGCTATTAATAGACAAGACACCGCTCGTTTTGGTTCCTTAGCTTCTAGATTGTTGCA



TCATATCAACTCTTTAAAGGAAGACCGCATCAAAACCGGTGCAGATAGTATTGTGCAGGCCGCAAG



GGGCTATATTCCTCTCCCACATGGCAAGGGTTGGGAACAGCGTTATGAACCCTGTCAGTTGATATTA



TTTGAAGATCTAGCTAGGTACAGATTTCGTGTAGACAGACCTCGGAGAGAGAATTCGCAATTGATG



CAGTGGAATCATCGAGCTATAGTAGCAGAAACGACGATGCAAGCTGAACTATACGGTCAAATAGTC



GAAAATACCGCTGCTGGTTTCTCCTCAAGATTTCATGCTGCAACTGGTGCTCCTGGTGTCAGATGTC



GCTTTTTGTTAGAACGAGATTTCGATAATGACCTACCAAAGCCGTACTTACTGAGAGAACTAAGTTG



GATGTTAGGTAACACAAAGGTTGAATCAGAGGAAGAAAAATTGCGTCTTCTAAGCGAGAAAATTAG



ACCAGGTTCATTAGTCCCTTGGGATGGGGGTGAACAATTCGCGACATTACACCCGAAAAGACAAAC



TCTTTGTGTCATTCACGCAGATATGAACGCTGCTCAAAACCTGCAACGCAGATTTTTCGGAAGGTGT



GGGGAAGCCTTTCGCCTTGTGTGTCAGCCACATGGTGATGATGTTTTGAGGCTAGCGTCTACACCAG



GTGCAAGACTTTTGGGTGCATTACAACAACTGGAAAATGGTCAGGGAGCTTTCGAATTAGTTCGTG



ATATGGGTAGCACATCACAAATGAATCGTTTCGTCATGAAGTCGTTGGGCAAAAAAAAGATCAAGC



CATTACAAGACAATAACGGGGATGATGAACTAGAAGACGTGCTATCTGTTTTACCTGAAGAAGATG



ATACCGGACGAATTACTGTATTTCGGGACTCTTCGGGTATATTCTTCCCTTGTAACGTTTGGATCCCG



GCAAAACAGTTCTGGCCTGCGGTCCGTGCTATGATTTGGAAGGTTATGGCATCACATTCATTGGGTT



AG





SEQ
ATGACAAAGTTAAGGCATAGACAGAAGAAGTTAACTCACGATTGGGCGGGGTCTAAAAAGAGAGA


ID
AGTTCTAGGGAGCAATGGTAAATTACAGAATCCATTGCTAATGCCCGTCAAAAAAGGTCAGGTGAC


NO:
AGAATTTCGAAAAGCATTTTCCGCATACGCCCGAGCAACCAAAGGGGAAATGACGGATGGCAGAA


142
AAAATATGTTTACTCACTCATTTGAACCATTCAAGACCAAGCCTTCGTTACATCAGTGCGAACTGGC



TGACAAAGCCTACCAGAGCTTGCATTCATATTTACCGGGTTCTTTGGCGCATTTTCTTTTATCTGCCC



ATGCACTTGGTTTTAGGATTTTTAGCAAATCAGGGGAAGCCACTGCATTCCAAGCGTCCTCAAAGAT



TGAAGCTTACGAAAGCAAGTTAGCTAGCGAGCTTGCTTGTGTTGATTTGTCTATTCAGAACTTGACT



ATTTCAACTTTGTTCAACGCATTAACGACTTCCGTAAGAGGTAAAGGTGAGGAGACATCGGCAGAT



CCACTGATAGCTAGATTTTACACCTTACTTACCGGTAAACCACTAAGCAGAGACACTCAGGGCCCA



GAACGAGATTTAGCCGAGGTGATAAGCAGAAAAATTGCAAGTTCTTTTGGAACTTGGAAGGAGATG



ACTGCCAATCCACTTCAATCTCTTCAATTTTTTGAAGAGGAGTTGCATGCGCTAGATGCAAATGTTA



GTTTGTCACCTGCCTTCGATGTTCTGATTAAGATGAACGACCTGCAGGGTGACTTGAAGAACAGAAC



GATAGTTTTTGATCCAGATGCTCCTGTGTTTGAATATAATGCTGAGGATCCTGCTGACATCATCATT



AAACTGACAGCTAGATATGCGAAAGAAGCAGTGATTAAAAATCAAAATGTCGGGAATTATGTTAAG



AACGCTATTACGACAACTAACGCAAACGGACTAGGTTGGTTGCTGAACAAAGGCCTTTCCTTATTGC



CTGTCTCCACTGATGACGAACTATTGGAGTTTATTGGGGTCGAGAGATCCCATCCTAGCTGTCATGC



GTTGATAGAACTTATCGCTCAGTTAGAAGCACCTGAACTGTTCGAAAAAAATGTTTTTTCTGATACT



CGTTCCGAGGTTCAAGGTATGATAGATTCAGCTGTAAGCAATCATATCGCCAGGCTGTCAAGCTCTC



GTAATTCATTGAGCATGGACTCAGAGGAACTTGAGAGATTGATAAAATCTTTTCAAATTCATACACC



ACATTGTTCATTATTTATAGGGGCTCAATCCTTATCTCAACAATTGGAAAGCCTACCCGAAGCATTG



CAGTCAGGAGTGAACAGTGCTGATATTCTGCTCGGCTCAACCCAATACATGTTGACAAATTCTTTGG



TCGAGGAGTCAATCGCTACGTATCAGAGAACCTTAAATAGAATTAACTACCTGTCCGGCGTTGCAG



GACAGATTAACGGTGCTATTAAGAGGAAAGCTATTGATGGTGAGAAGATACATTTACCCGCTGCTT



GGTCAGAGTTAATTTCTTTACCCTTTATTGGGCAACCAGTGATTGATGTTGAATCAGATTTAGCCCA



CTTAAAGAACCAATACCAGACATTGTCTAACGAATTTGATACGCTGATTTCCGCACTGCAAAAGAA



TTTCGACTTAAATTTTAATAAAGCCTTGCTTAATCGAACACAACATTTCGAGGCTATGTGTAGATCA



ACAAAAAAGAATGCCCTTTCTAAGCCTGAGATCGTTAGTTATAGAGATTTGCTAGCCAGGTTGACTT



CTTGTCTTTATAGGGGCTCTCTAGTCTTGAGGAGGGCGGGTATAGAAGTACTGAAAAAGCACAAGA



TATTTGAGTCCAACTCTGAATTAAGAGAGCACGTTCATGAAAGAAAACACTTCGTATTTGTTTCTCC



GCTCGATAGAAAAGCCAAGAAGCTCCTACGTTTGACTGACTCTAGGCCTGATTTATTGCACGTAATT



GATGAAATACTACAACATGATAATTTAGAGAACAAGGATAGAGAATCTTTGTGGTTAGTTCGATCT



GGTTATTTACTGGCCGGCCTACCAGACCAACTCTCCTCTTCCTTTATAAATCTTCCAATCATTACTCA



AAAAGGCGATCGTCGCTTGATAGATCTCATTCAATACGACCAAATTAATAGAGATGCTTTTGTGATG



TTGGTAACTTCCGCTTTTAAGTCGAACTTAAGTGGGCTGCAGTACAGAGCAAACAAACAATCTTTTG



TGGTTACGCGCACTTTGTCACCATATTTGGGATCTAAATTGGTTTATGTGCCCAAAGATAAAGATTG



GCTGGTCCCTTCCCAAATGTTCGAGGGGAGATTTGCGGACATTTTGCAATCCGATTATATGGTGTGG



AAGGACGCTGGAAGATTGTGTGTTATTGACACAGCTAAGCATTTGTCTAACATTAAAAAATCTGTAT



TCTCAAGTGAAGAAGTCCTCGCGTTTTTAAGAGAATTGCCACACCGTACGTTTATCCAAACTGAGGT



CAGGGGTTTAGGGGTGAATGTGGACGGTATTGCATTTAATAACGGGGATATACCCTCTCTGAAGAC



GTTTAGCAATTGCGTGCAAGTCAAAGTGAGTCGGACAAACACTAGTCTGGTCCAAACATTAAATAG



ATGGTTTGAAGGCGGTAAGGTCTCGCCGCCTAGCATCCAATTTGAGAGAGCATATTACAAAAAAGA



TGATCAAATCCACGAGGACGCTGCAAAAAGGAAGATAAGGTTTCAAATGCCAGCTACAGAGTTGGT



ACACGCGTCAGACGACGCAGGATGGACCCCCTCCTATTTACTTGGTATCGATCCCGGTGAATATGGT



ATGGGTTTGTCATTGGTCTCAATAAATAATGGCGAAGTTTTAGATAGCGGATTTATACACATAAATT



CATTGATAAATTTCGCTTCTAAGAAATCAAATCATCAAACCAAAGTTGTTCCGAGGCAGCAATACA



AGTCACCATACGCCAACTATCTAGAACAATCTAAAGATTCTGCAGCAGGAGACATAGCTCATATTTT



GGATAGACTTATCTACAAGTTGAACGCCCTACCCGTTTTCGAAGCTCTATCTGGCAATAGTCAAAGC



GCAGCGGATCAGGTTTGGACAAAAGTCCTCAGCTTCTACACCTGGGGAGATAATGATGCACAAAAT



TCAATTCGTAAGCAACATTGGTTCGGTGCTTCACACTGGGACATTAAAGGCATGTTGAGGCAACCG



CCAACAGAAAAAAAGCCCAAACCATACATTGCCTTTCCCGGTTCACAAGTTTCTTCTTATGGTAATT



CTCAAAGGTGTTCATGTTGTGGACGTAACCCAATTGAACAATTGCGCGAAATGGCGAAGGACACAT



CCATTAAGGAGTTGAAGATTAGAAATTCAGAAATTCAATTGTTCGACGGTACTATAAAGTTATTTAA



TCCAGACCCGTCAACGGTCATAGAAAGAAGAAGACATAATTTAGGGCCATCAAGAATTCCTGTAGC



TGATAGAACTTTCAAAAATATAAGTCCAAGCTCACTAGAATTCAAAGAACTAATAACGATTGTGTC



ACGGTCTATACGTCATTCCCCAGAATTTATTGCTAAAAAAAGAGGTATAGGTAGTGAGTACTTTTGT



GCTTATAGTGATTGTAATTCCTCCTTAAATTCAGAAGCAAATGCGGCTGCGAACGTTGCCCAAAAGT



TCCAAAAGCAATTGTTTTTCGAATTATAG





SEQ
ATGAAAAGAATCTTGAACTCTTTAAAGGTTGCCGCCCTGCGTTTGTTATTTAGAGGTAAAGGATCTG


ID
AACTTGTCAAGACTGTTAAATACCCTTTGGTCTCGCCGGTTCAGGGTGCAGTTGAGGAGTTAGCTGA


NO:
GGCGATCCGCCATGATAACCTACATCTGTTTGGTCAAAAAGAAATTGTTGACCTTATGGAAAAGGA


143
TGAAGGTACGCAAGTTTACTCAGTGGTTGATTTCTGGTTAGATACCCTTCGTTTGGGGATGTTTTTCA



GTCCATCAGCAAACGCATTAAAAATCACGCTGGGTAAGTTTAATTCTGATCAGGTTAGCCCTTTTAG



GAAAGTGTTAGAGCAGTCTCCATTCTTCTTGGCTGGTAGGCTGAAGGTTGAACCGGCAGAACGTAT



ATTATCTGTCGAGATCCGTAAGATTGGGAAGAGGGAAAACAGAGTTGAGAACTATGCTGCTGACGT



AGAAACGTGTTTTATAGGCCAATTAAGTTCAGATGAGAAACAGTCAATACAAAAATTAGCTAATGA



TATCTGGGATAGTAAAGATCATGAAGAGCAAAGAATGTTAAAGGCAGATTTCTTCGCTATCCCTTTG



ATTAAGGATCCAAAGGCTGTGACCGAAGAGGATCCTGAAAATGAAACTGCTGGTAAACAAAAACC



CTTGGAGTTGTGTGTCTGCCTTGTCCCAGAACTTTACACAAGAGGATTCGGGTCAATAGCCGATTTT



TTGGTTCAACGCTTAACTCTTTTAAGGGATAAAATGTCTACAGATACTGCAGAAGATTGTTTAGAAT



ATGTCGGGATTGAGGAGGAAAAAGGTAACGGCATGAACTCATTGTTGGGAACGTTCTTAAAGAATT



TGCAAGGCGATGGATTTGAGCAGATTTTCCAATTTATGTTAGGGAGCTATGTCGGTTGGCAAGGGA



AGGAAGATGTTTTAAGAGAGAGATTAGACTTATTGGCTGAAAAAGTGAAGAGGTTACCGAAACCA



AAATTTGCTGGCGAATGGTCTGGTCATAGGATGTTCTTGCATGGCCAATTGAAGTCTTGGTCTTCAA



ATTTTTTTAGACTATTTAACGAGACAAGGGAACTTCTAGAGTCTATTAAGTCAGATATACAGCATGC



CACAATGCTAATATCATATGTAGAAGAAAAAGGTGGTTATCATCCTCAATTACTTAGTCAATATAGA



AAACTTATGGAACAACTACCAGCTTTGCGTACCAAGGTATTGGACCCTGAGATTGAAATGACACAT



ATGTCCGAAGCAGTTCGCTCTTATATAATGATACATAAATCTGTTGCGGGTTTTTTACCGGATTTATT



AGAATCATTAGATAGAGACAAGGATCGTGAGTTTCTGCTTAGTATTTTTCCAAGAATCCCAAAAATT



GATAAAAAAACCAAGGAAATTGTAGCTTGGGAACTGCCGGGAGAACCAGAAGAAGGTTATTTATTT



ACTGCTAATAACTTGTTCAGAAACTTCTTAGAGAATCCGAAACATGTCCCGAGATTTATGGCCGAAA



GGATCCCAGAAGATTGGACTCGATTACGCTCTGCTCCTGTCTGGTTCGATGGAATGGTAAAACAATG



GCAAAAAGTCGTTAACCAGTTAGTAGAATCACCAGGTGCTTTATATCAATTTAACGAATCCTTCTTG



AGACAAAGGTTACAGGCCATGTTAACTGTGTATAAGAGGGACTTACAAACTGAAAAATTTCTTAAA



CTTTTGGCGGATGTTTGTAGGCCTCTTGTAGATTTTTTTGGTTTGGGTGGAAATGATATTATTTTTAA



GAGCTGTCAAGACCCAAGAAAACAATGGCAAACCGTTATTCCTCTCTCTGTTCCGGCAGATGTCTAT



ACTGCTTGCGAAGGTTTGGCGATTAGACTAAGGGAGACATTAGGATTCGAATGGAAGAATTTGAAA



GGTCACGAGAGAGAAGATTTCTTAAGATTGCACCAGTTATTGGGCAATTTACTTTTCTGGATTCGTG



ATGCTAAATTGGTAGTAAAATTAGAGGATTGGATGAACAACCCATGTGTTCAGGAATATGTAGAAG



CCCGGAAAGCTATCGATCTTCCACTAGAAATATTCGGTTTTGAAGTGCCTATCTTCCTGAATGGCTA



TCTATTTTCGGAGTTGAGACAATTAGAACTTTTGCTTAGGAGAAAAAGTGTGATGACTAGCTACAGT



GTAAAGACTACTGGATCTCCTAATAGGCTATTTCAGCTAGTTTATTTACCTCTAAACCCTAGTGACC



CCGAAAAGAAGAACTCAAATAACTTTCAAGAACGTTTGGATACCCCAACTGGTTTGTCCCGTCGTTT



CCTAGACCTAACCCTTGATGCATTCGCAGGTAAGTTACTTACCGATCCAGTTACACAAGAATTGAAG



ACAATGGCAGGTTTTTACGATCATCTTTTTGGATTCAAATTGCCATGTAAACTCGCCGCCATGTCGA



ATCATCCAGGTTCTTCTTCAAAGATGGTTGTGTTAGCGAAACCCAAAAAAGGTGTTGCTTCTAATAT



AGGGTTTGAACCGATCCCAGATCCCGCTCATCCCGTATTTAGGGTTAGATCCAGTTGGCCAGAGTTG



AAGTACCTCGAGGGGCTATTGTATTTGCCAGAAGACACACCTTTGACCATCGAATTAGCAGAGACC



TCCGTATCGTGCCAAAGTGTCTCGTCAGTTGCATTCGATTTGAAAAACTTGACAACGATCTTAGGTC



GTGTGGGAGAATTTAGGGTCACAGCTGATCAACCCTTTAAACTAACGCCTATAATCCCGGAGAAAG



AAGAATCTTTTATTGGTAAAACTTATTTGGGTCTCGACGCGGGTGAAAGGAGCGGCGTCGGTTTCGC



TATTGTTACAGTGGACGGAGATGGGTACGAAGTGCAAAGATTGGGGGTCCACGAGGATACACAGCT



TATGGCCTTGCAGCAAGTTGCTAGTAAATCCTTAAAAGAGCCAGTATTTCAGCCTCTAAGAAAAGG



CACCTTTAGACAACAAGAAAGAATACGGAAATCCTTACGTGGTTGCTACTGGAATTTTTATCATGCC



TTGATGATAAAATATAGGGCCAAAGTAGTACATGAGGAATCTGTCGGAAGTAGTGGTCTTGTGGGT



CAATGGTTGAGGGCTTTTCAGAAGGATTTGAAGAAAGCCGATGTTCTCCCCAAGAAGGGCGGTAAA



AACGGTGTAGATAAGAAGAAGAGAGAGTCCTCAGCTCAAGACACTCTTTGGGGTGGTGCTTTCTCT



AAAAAGGAGGAGCAACAGATTGCGTTTGAGGTGCAAGCTGCAGGTTCTTCGCAATTTTGTTTGAAG



TGCGGATGGTGGTTCCAACTAGGCATGCGTGAAGTAAACAGGGTACAAGAATCGGGCGTCGTGTTA



GATTGGAATAGAAGCATAGTTACCTTTTTAATAGAATCATCCGGCGAAAAAGTTTATGGTTTCTCCC



CACAGCAATTAGAGAAGGGTTTCAGACCAGACATCGAAACTTTTAAAAAGATGGTAAGAGACTTTA



TGAGACCTCCTATGTTTGATAGAAAAGGCAGACCGGCCGCAGCTTACGAGAGATTTGTTTTAGGAA



GGAGACATCGAAGGTACAGGTTTGATAAAGTATTTGAGGAAAGATTTGGGAGGTCTGCTCTTTTCA



TTTGTCCTAGAGTAGGTTGTGGAAATTTTGACCACAGCTCCGAACAGTCCGCGGTTGTTTTGGCCTT



GATCGGATATATTGCCGATAAGGAGGGAATGTCAGGTAAGAAGTTGGTTTATGTACGGCTGGCCGA



ACTTATGGCCGAATGGAAACTAAAAAAATTAGAAAGATCCAGAGTTGAAGAACAATCATCCGCTCA



ATAA





SEQ
ATGGCAGAAAGCAAACAAATGCAGTGTAGGAAATGTGGAGCTAGTATGAAGTACGAAGTCATCGG


ID
TTTGGGTAAAAAGTCATGTAGATACATGTGTCCCGATTGTGGCAACCATACCTCGGCAAGAAAGAT


NO:
ACAAAACAAAAAAAAAAGAGATAAAAAATATGGGTCAGCCAGTAAAGCCCAATCTCAAAGAATTG


144
CTGTAGCAGGTGCTCTTTACCCTGACAAAAAAGTACAAACTATCAAAACCTATAAATATCCAGCAG



ACTTGAATGGTGAGGTGCATGATAGCGGTGTTGCCGAGAAAATCGCACAAGCAATACAAGAGGAC



GAGATTGGACTTTTGGGACCAAGCTCAGAATATGCATGCTGGATTGCATCTCAAAAACAGTCTGAG



CCTTACAGTGTAGTCGATTTCTGGTTTGATGCAGTGTGCGCAGGGGGAGTCTTCGCCTACTCTGGCG



CTAGATTATTGAGTACAGTTTTACAGTTATCCGGTGAGGAATCGGTGCTTAGAGCTGCCTTAGCCTC



GTCTCCATTCGTTGACGATATAAACTTAGCGCAAGCCGAAAAGTTTTTGGCGGTTAGCAGGCGTACA



GGTCAAGATAAGTTAGGTAAGAGAATTGGGGAGTGCTTTGCAGAAGGAAGATTGGAAGCTTTAGG



GATAAAAGATAGAATGAGGGAATTTGTTCAAGCTATCGATGTTGCACAGACCGCCGGACAACGTTT



CGCTGCCAAATTGAAGATATTCGGTATAAGTCAGATGCCAGAAGCTAAGCAATGGAATAACGATTC



CGGACTGACTGTCTGTATACTACCTGATTATTATGTTCCCGAAGAGAATCGCGCGGACCAACTTGTA



GTGTTGTTAAGAAGACTTCGCGAGATTGCATATTGCATGGGTATTGAAGATGAAGCGGGTTTCGAA



CATCTTGGAATAGATCCTGGTGCTCTTTCGAATTTTTCAAACGGTAACCCTAAGAGAGGATTTCTAG



GGAGGCTGTTAAATAACGATATTATTGCGTTGGCAAACAATATGAGTGCGATGACTCCATATTGGG



AAGGGCGTAAGGGTGAACTCATAGAAAGGCTTGCGTGGTTAAAGCACAGGGCAGAAGGGCTGTAT



CTTAAAGAACCTCATTTCGGTAACTCCTGGGCCGATCATAGGTCACGAATTTTCTCAAGGATCGCAG



GCTGGTTATCTGGTTGCGCTGGCAAGTTGAAAATTGCGAAAGACCAAATTTCTGGAGTACGTACAG



ATCTATTTCTGCTAAAAAGACTGCTGGACGCAGTTCCGCAATCGGCGCCATCCCCCGATTTTATTGC



GTCAATTTCGGCACTTGACAGGTTTTTAGAAGCTGCAGAATCGAGCCAGGACCCTGCTGAACAAGT



GAGGGCTCTCTACGCTTTTCACTTGAACGCACCTGCAGTCCGAAGTATAGCCAATAAAGCAGTGCA



AAGGTCCGACAGCCAAGAATGGCTGATAAAAGAACTAGACGCTGTTGACCATTTAGAATTTAACAA



AGCGTTCCCATTTTTCTCTGACACAGGAAAAAAAAAAAAAAAAGGTGCTAATAGCAACGGTGCTCC



ATCGGAAGAAGAGTACACTGAAACGGAATCAATACAACAACCTGAGGACGCGGAACAGGAAGTAA



ACGGACAAGAAGGGAACGGAGCGTCTAAAAATCAAAAGAAATTTCAAAGAATACCTAGATTCTTC



GGTGAAGGCTCCAGATCTGAATACAGAATTTTAACGGAAGCTCCACAGTATTTCGATATGTTTTGTA



ATAACATGAGGGCTATATTTATGCAGTTAGAAAGTCAACCCCGTAAAGCTCCCAGAGATTTTAAAT



GTTTCCTACAAAATCGATTACAAAAATTATACAAACAGACTTTCTTGAATGCACGAAGCAACAAGT



GTCGCGCTCTGCTTGAGTCAGTTTTAATCTCTTGGGGAGAATTTTATACATACGGTGCCAACGAAAA



GAAATTTAGATTAAGACATGAAGCTTCAGAACGCAGCAGTGACCCAGATTACGTAGTTCAGCAAGC



CTTGGAAATCGCGCGTCGTCTATTCCTTTTTGGCTTCGAATGGAGAGATTGCTCCGCTGGTGAAAGA



GTGGATTTGGTTGAAATTCACAAAAAGGCTATCAGTTTTTTGTTGGCTATTACTCAAGCTGAGGTCT



CTGTTGGTTCATACAATTGGCTTGGCAACTCAACAGTATCGAGATATTTATCCGTTGCGGGAACTGA



TACCTTATACGGTACCCAATTGGAAGAATTCCTGAACGCTACAGTGTTGAGTCAAATGCGTGGTCTG



GCCATTAGATTGAGTTCTCAAGAACTTAAGGACGGTTTTGATGTGCAGCTCGAGTCTTCCTGCCAGG



ACAATCTGCAACACCTATTGGTGTATAGGGCTTCGAGAGATTTGGCGGCTTGCAAGCGCGCTACTTG



TCCAGCCGAACTCGATCCTAAGATTTTAGTTTTACCGGTAGGTGCATTCATCGCTTCCGTAATGAAA



ATGATAGAAAGAGGTGACGAACCTTTAGCTGGTGCTTATTTACGGCATAGGCCACACTCTTTCGGAT



GGCAAATTAGGGTCCGCGGTGTTGCTGAGGTAGGGATGGATCAGGGTACAGCATTGGCCTTTCAAA



AGCCAACAGAGTCAGAACCTTTTAAAATTAAGCCCTTCTCTGCACAGTATGGACCAGTTCTGTGGTT



GAACAGTAGTAGTTATTCTCAATCACAATATTTGGACGGTTTTCTATCTCAACCAAAAAATTGGAGT



ATGAGGGTGTTGCCTCAGGCGGGTTCAGTTCGCGTCGAACAACGAGTTGCTTTGATATGGAACTTAC



AAGCAGGCAAGATGAGACTAGAACGCTCCGGTGCGAGGGCCTTTTTCATGCCTGTACCGTTTTCATT



TAGGCCATCCGGCAGTGGGGACGAAGCAGTTTTGGCGCCCAACCGGTACTTGGGTCTGTTCCCTCAT



TCCGGAGGTATAGAATACGCTGTAGTGGATGTCCTGGATTCTGCTGGATTTAAAATTCTTGAAAGAG



GCACTATTGCTGTCAATGGTTTCTCTCAGAAAAGGGGAGAGCGCCAAGAAGAAGCCCATCGTGAAA



AACAAAGAAGGGGGATAAGTGATATAGGGCGAAAGAAGCCTGTGCAGGCAGAAGTCGATGCGGCG



AACGAATTGCATAGAAAGTACACTGATGTTGCCACAAGATTAGGTTGTAGAATCGTCGTTCAATGG



GCACCACAACCTAAACCAGGGACAGCACCGACAGCGCAAACTGTTTACGCGAGGGCTGTTAGGAC



AGAAGCTCCGAGGAGCGGCAACCAAGAAGATCATGCAAGAATGAAAAGTTCTTGGGGTTACACCT



GGGGTACGTATTGGGAGAAACGAAAACCAGAAGATATTTTAGGGATTTCTACACAGGTGTATTGGA



CAGGAGGTATAGGCGAATCCTGTCCTGCTGTAGCAGTCGCTTTATTAGGTCATATTAGAGCAACTTC



AACACAAACGGAGTGGGAAAAGGAAGAAGTTGTCTTTGGAAGACTGAAGAAGTTCTTTCCGAGTTA



A





SEQ
ATGGAGAAGAGAATTAATAAGATACGGAAAAAATTATCTGCGGATAATGCAACAAAGCCAGTCTCT


ID
CGTTCAGGCCCCATGAAAACCCTGCTTGTAAGAGTAATGACGGATGATTTAAAAAAGAGGTTGGAA


NO:
AAGCGTAGAAAAAAACCAGAAGTGATGCCGCAAGTGATCTCAAATAACGCAGCTAATAATCTAAG


145
GATGCTACTTGATGATTATACAAAAATGAAAGAAGCAATCCTGCAAGTTTACTGGCAGGAATTCAA



GGATGACCATGTTGGACTAATGTGCAAATTCGCACAACCAGCGTCTAAGAAAATTGACCAAAATAA



ATTGAAACCCGAAATGGACGAAAAAGGGAATTTAACAACTGCCGGGTTTGCCTGCTCGCAATGTGG



GCAACCATTATTTGTTTATAAATTAGAGCAGGTTTCGGAAAAAGGAAAGGCTTACACAAATTACTTC



GGCAGATGTAATGTTGCCGAACACGAAAAACTCATATTGTTAGCTCAGTTGAAGCCTGAGAAAGAC



TCTGATGAGGCCGTTACTTACTCGTTGGGGAAGTTTGGTCAAAGAGCTCTCGATTTTTATTCTATTCA



TGTGACAAAGGAGTCCACACATCCCGTCAAGCCCTTGGCACAAATTGCGGGTAATAGATACGCTTC



GGGTCCAGTTGGGAAGGCCCTTTCTGATGCATGTATGGGCACAATTGCTAGCTTTCTTAGTAAATAC



CAGGATATCATAATAGAGCATCAAAAAGTTGTAAAGGGTAACCAAAAGAGATTAGAATCGCTGCGT



GAGTTGGCGGGTAAAGAAAACTTGGAATATCCATCTGTCACTCTGCCTCCTCAACCTCATACTAAGG



AAGGTGTAGATGCGTACAATGAAGTTATCGCTAGAGTCCGTATGTGGGTGAATTTAAATTTGTGGC



AAAAATTGAAGTTATCGCGTGATGATGCAAAACCTCTTCTTAGACTAAAGGGCTTTCCTAGCTTCCC



TGTAGTGGAAAGACGCGAAAATGAAGTCGATTGGTGGAATACAATTAACGAAGTCAAAAAACTGA



TCGATGCAAAGCGAGATATGGGTCGAGTTTTTTGGTCTGGTGTTACAGCTGAAAAAAGGAATACGA



TCTTAGAAGGTTACAACTACTTGCCAAATGAGAACGATCATAAAAAAAGAGAAGGCAGTTTAGAAA



ATCCAAAAAAGCCAGCTAAGAGACAATTTGGTGATTTGCTACTTTACCTAGAAAAAAAGTACGCCG



GAGATTGGGGGAAAGTCTTTGACGAAGCTTGGGAGAGAATAGATAAAAAAATAGCAGGATTGACG



TCACACATTGAAAGAGAAGAGGCGAGAAATGCAGAAGATGCTCAGTCCAAAGCTGTCCTCACCGA



CTGGTTGAGAGCCAAAGCGTCCTTTGTTCTCGAACGCCTAAAAGAAATGGATGAGAAGGAATTTTA



TGCCTGCGAAATCCAGCTACAAAAATGGTACGGAGACTTGAGAGGTAACCCCTTTGCCGTGGAAGC



AGAGAACCGTGTTGTAGATATCTCCGGTTTCTCAATCGGTAGCGATGGACACTCCATTCAGTATCGC



AACTTGTTGGCCTGGAAATATTTGGAAAACGGTAAGAGGGAATTCTATTTACTTATGAATTATGGCA



AGAAAGGTAGAATCAGGTTTACTGACGGAACAGACATTAAAAAGAGTGGTAAGTGGCAAGGCCTT



TTGTACGGTGGTGGCAAGGCCAAAGTAATAGACTTAACATTTGACCCCGACGACGAACAACTGATA



ATACTGCCTTTAGCTTTTGGTACTCGACAGGGGCGAGAGTTCATTTGGAATGATCTTTTGTCACTCG



AGACTGGTTTGATAAAACTTGCAAATGGAAGAGTCATCGAGAAGACAATTTACAACAAAAAGATA



GGTCGCGATGAGCCTGCACTATTTGTGGCCTTGACCTTTGAGAGAAGGGAAGTTGTCGACCCATCCA



ATATTAAACCAGTCAACCTAATCGGTGTAGATAGAGGTGAAAACATCCCAGCTGTTATCGCTCTGA



CAGACCCTGAAGGTTGCCCTTTGCCAGAATTTAAAGATTCGTCTGGTGGACCAACAGATATATTACG



TATTGGGGAAGGCTATAAAGAGAAACAACGTGCTATTCAGGCTGCAAAAGAAGTTGAACAGAGGA



GAGCTGGAGGTTACAGTAGAAAATTCGCCAGTAAAAGTAGAAACTTAGCAGATGACATGGTTAGA



AACTCTGCCCGGGATTTGTTCTATCATGCGGTTACTCACGATGCAGTCTTAGTCTTTGAAAATCTATC



GCGCGGTTTTGGTAGGCAAGGCAAGAGGACTTTTATGACAGAGAGACAATATACAAAAATGGAAG



ATTGGTTAACCGCGAAGCTCGCATATGAAGGTCTTACTTCGAAAACGTACCTCAGCAAAACGCTGG



CTCAATATACTTCTAAAACTTGTTCAAATTGTGGTTTTACTATTACCACGGCAGACTACGACGGGAT



GTTGGTGAGATTGAAGAAGACGAGCGATGGTTGGGCAACAACATTGAATAATAAGGAATTAAAAG



CAGAAGGACAGATTACGTATTACAATCGTTATAAACGCCAAACGGTTGAGAAAGAGTTGTCAGCCG



AGTTGGATAGACTAAGTGAAGAGAGCGGTAACAATGATATCTCAAAGTGGACTAAAGGGAGGCGG



GATGAAGCCCTCTTTTTACTAAAGAAGAGATTCTCACATAGACCTGTGCAAGAACAATTCGTTTGTT



TAGATTGTGGCCATGAGGTTCATGCAGACGAACAGGCTGCGTTAAATATTGCGAGAAGCTGGCTAT



TTCTAAATTCTAATTCAACAGAGTTCAAGAGCTATAAATCCGGAAAACAACCTTTCGTAGGCGCGTG



GCAAGCCTTCTATAAAAGGAGATTAAAAGAGGTTTGGAAACCAAATGCA





SEQ
ATGAAAAGAATTAACAAAATTAGAAGGAGGCTGGTCAAAGATTCTAATACCAAGAAAGCTGGTAA


ID
GACTGGTCCGATGAAAACCCTATTAGTCAGAGTTATGACCCCAGATTTGAGAGAAAGATTGGAGAA


NO:
CCTCAGGAAAAAGCCCGAAAACATCCCACAACCCATTAGTAACACATCAAGAGCTAATTTAAACAA


146
GTTATTAACTGACTACACTGAAATGAAAAAAGCAATATTGCATGTTTACTGGGAAGAGTTCCAGAA



AGATCCTGTTGGGTTGATGTCTAGAGTTGCTCAACCGGCCCCAAAGAATATAGATCAAAGGAAACT



TATTCCTGTGAAGGACGGCAATGAAAGATTAACCAGCTCCGGTTTCGCTTGCTCCCAGTGCTGCCAA



CCCCTGTATGTATACAAACTGGAACAAGTAAATGATAAAGGTAAGCCACATACTAACTACTTTGGT



AGGTGTAATGTATCCGAGCATGAAAGATTGATCTTGTTAAGTCCCCATAAACCAGAAGCTAATGAT



GAGTTAGTAACTTATAGTTTAGGTAAGTTCGGACAACGAGCTTTAGATTTCTATAGCATCCATGTTA



CAAGAGAAAGCAATCACCCCGTCAAACCACTGGAACAAATCGGTGGTAATAGTTGTGCGTCAGGTC



CAGTAGGCAAAGCTTTATCAGACGCTTGCATGGGTGCCGTGGCTAGTTTTTTGACGAAATACCAAG



ATATTATACTGGAACATCAAAAGGTAATTAAAAAGAATGAAAAGAGACTCGCTAACTTAAAAGATA



TTGCAAGTGCCAATGGTTTAGCTTTTCCTAAAATTACCTTGCCACCTCAGCCACATACAAAGGAGGG



AATTGAAGCTTACAATAATGTAGTAGCCCAAATAGTTATTTGGGTGAACCTTAACCTATGGCAAAA



GTTAAAAATTGGTAGAGACGAAGCCAAACCCCTGCAGAGGCTGAAGGGTTTTCCCTCCTTCCCCTTA



GTAGAGAGACAAGCTAATGAAGTGGACTGGTGGGATATGGTGTGCAATGTTAAAAAATTGATTAAT



GAGAAGAAAGAGGATGGTAAAGTGTTTTGGCAGAATCTTGCTGGCTACAAGAGACAGGAAGCTTTA



CTGCCTTATTTATCTTCTGAGGAAGATAGGAAAAAAGGTAAAAAATTTGCTAGATATCAATTCGGA



GACCTACTTCTGCATTTAGAAAAAAAACATGGCGAAGATTGGGGTAAAGTTTATGATGAAGCCTGG



GAAAGAATTGATAAGAAGGTAGAAGGTCTCTCCAAACATATTAAATTAGAGGAAGAACGTAGGTC



CGAAGACGCTCAATCAAAGGCAGCATTAACTGATTGGTTGAGAGCAAAAGCCTCTTTCGTTATTGA



AGGATTAAAAGAAGCCGACAAAGATGAATTTTGTAGATGTGAGTTAAAGTTGCAAAAGTGGTATGG



AGACCTCCGTGGTAAACCTTTTGCTATTGAGGCTGAAAATTCTATACTCGATATCTCTGGATTTTCA



AAACAATATAACTGCGCATTTATATGGCAGAAAGATGGTGTTAAAAAGCTAAATCTATACTTAATT



ATCAATTACTTTAAAGGTGGTAAATTGCGTTTTAAGAAGATAAAGCCTGAAGCCTTTGAGGCAAAC



CGTTTTTACACTGTTATCAATAAAAAATCTGGGGAAATCGTACCAATGGAAGTTAATTTCAATTTCG



ATGATCCTAATCTTATTATTTTACCTCTTGCTTTCGGCAAAAGGCAAGGTAGGGAGTTTATTTGGAA



TGATTTATTGTCGCTGGAAACGGGGTCTCTCAAACTCGCAAACGGTAGGGTGATAGAAAAAACATT



ATACAACAGGAGAACTCGGCAGGATGAGCCAGCTCTTTTTGTGGCTCTGACATTCGAGAGAAGGGA



AGTTTTAGATTCATCTAACATCAAACCAATGAATTTAATAGGTATTGACCGGGGTGAAAATATACCT



GCAGTTATTGCTTTAACTGATCCTGAGGGATGTCCTCTTAGCAGATTCAAGGACTCGTTGGGTAACC



CTACTCACATCTTAAGGATTGGAGAAAGTTACAAGGAGAAACAAAGGACAATACAAGCTGCTAAA



GAAGTAGAACAAAGGAGGGCGGGTGGATATAGTCGGAAATATGCCAGCAAGGCCAAGAATTTAGC



TGACGACATGGTTAGGAATACAGCTAGAGACCTTTTATACTATGCCGTCACCCAGGATGCCATGTTG



ATATTTGAAAATTTAAGTAGAGGCTTCGGTAGACAAGGTAAGCGCACCTTCATGGCAGAGAGACAA



TATACTAGAATGGAAGATTGGTTGACTGCCAAATTGGCATACGAAGGTCTACCTAGTAAGACGTAC



TTATCTAAAACACTAGCGCAGTATACTTCCAAGACATGCAGTAATTGTGGTTTCACAATCACTTCTG



CCGATTACGATCGCGTCTTGGAAAAACTAAAAAAAACAGCGACAGGTTGGATGACTACTATTAATG



GGAAAGAATTGAAGGTCGAAGGACAAATAACTTACTATAATAGATATAAACGGCAAAACGTTGTA



AAAGACCTGTCAGTCGAACTCGATCGACTTAGTGAAGAATCTGTTAATAATGATATTAGTTCGTGGA



CAAAAGGTAGATCCGGTGAAGCTTTGAGCCTCCTGAAAAAACGTTTTAGCCATAGGCCTGTCCAAG



AAAAGTTTGTATGTTTAAACTGTGGTTTTGAGACCCATGCAGACGAGCAGGCCGCTCTTAATATTGC



TAGATCATGGTTATTTTTAAGATCTCAGGAATACAAGAAGTACCAGACTAACAAGACAACAGGCAA



CACAGATAAGCGAGCATTCGTTGAGACTTGGCAATCTTTTTATAGAAAGAAATTGAAGGAAGTCTG



GAAACCA





SEQ
ATGGGAAAAATGTATTATCTAGGCCTGGACATAGGGACCAATTCAGTAGGCTACGCTGTCACTGAC


ID
CCCTCCTACCATTTGCTGAAGTTCAAGGGGGAACCCATGTGGGGAGCACACGTGTTTGCGGCCGGC


NO:
AACCAGAGCGCAGAGCGGAGAAGCTTCCGCACCTCCAGGAGAAGGCTGGATCGCAGGCAGCAGCG


147
TGTGAAGCTGGTCCAAGAGATATTTGCCCCAGTGATTTCCCCCATCGATCCGCGCTTCTTTATTAGG



CTCCACGAGTCCGCTCTCTGGCGCGACGACGTGGCCGAAACTGATAAACATATTTTCTTTAATGACC



CAACATACACTGACAAGGAGTACTATTCAGATTACCCAACAATTCACCATTTGATCGTGGACCTTAT



GGAAAGTTCGGAGAAGCATGATCCTCGACTTGTCTATTTGGCCGTGGCGTGGCTCGTGGCACATAG



GGGCCACTTCTTGAACGAGGTGGACAAGGATAACATCGGGGATGTGTTATCTTTCGACGCTTTCTAT



CCTGAATTCCTTGCTTTTCTGTCTGACAATGGCGTCAGCCCGTGGGTCTGCGAATCCAAGGCCCTCC



AGGCTACGCTATTGTCAAGAAATAGCGTGAACGACAAGTACAAGGCTCTTAAGTCTTTGATTTTTGG



AAGCCAGAAGCCCGAGGACAACTTTGATGCAAATATCTCGGAGGACGGGCTGATTCAGCTCCTCGC



TGGGAAAAAGGTCAAGGTCAATAAGCTGTTTCCACAGGAGTCAAATGACGCGAGCTTCACCCTTAA



CGACAAAGAGGATGCCATTGAAGAGATCCTGGGGACACTCACCCCAGACGAGTGCGAGTGGATAG



CCCATATTAGGCGCCTCTTTGATTGGGCCATAATGAAACATGCGCTTAAGGACGGGCGCACGATAT



CCGAAAGCAAGGTCAAATTGTACGAGCAGCACCACCATGATCTGACCCAGCTAAAATATTTTGTAA



AAACATATCTGGCCAAGGAGTACGATGATATCTTCCGCAACGTGGATAGTGAGACCACCAAAAACT



ACGTCGCGTACTCATACCACGTGAAAGAAGTTAAGGGCACGCTGCCTAAGAACAAGGCAACACAA



GAGGAGTTCTGCAAGTACGTTCTCGGGAAAGTTAAAAATATAGAGTGCAGCGAGGCCGACAAAGT



GGATTTTGACGAGATGATTCAACGCCTGACCGACAATTCGTTTATGCCTAAACAGGTGAGTGGAGA



GAATCGCGTGATTCCATATCAGCTCTATTACTATGAACTCAAGACTATTCTGAATAAGGCCGCTAGC



TATTTACCCTTCCTTACGCAGTGCGGGAAGGATGCCATTTCTAACCAGGATAAACTCTTGAGTATAA



TGACATTTCGAATTCCCTATTTCGTGGGTCCGCTTCGTAAGGATAACAGTGAGCACGCTTGGCTGGA



GCGGAAGGCTGGCAAAATTTATCCATGGAATTTCAACGACAAGGTGGATCTGGACAAATCCGAAGA



AGCCTTTATCCGCAGGATGACCAATACTTGCACATACTATCCTGGGGAGGATGTCCTTCCACTGGAC



TCTCTGATCTACGAAAAGTTCATGATTTTGAATGAAATTAACAACATAAGGATCGATGGGTATCCTA



TTTCCGTCGACGTGAAGCAGCAGGTGTTCGGGCTCTTTGAGAAGAAGCGACGGGTGACCGTGAAGG



ATATTCAGAATCTTCTCTTATCGCTGGGAGCCCTGGATAAACACGGAAAACTGACCGGGATAGATA



CTACGATTCATTCTAATTACAACACGTATCACCATTTTAAGTCACTGATGGAGAGGGGCGTCCTAAC



AAGAGATGACGTGGAGAGAATAGTGGAACGAATGACATATTCTGATGACACCAAGAGAGTGCGGC



TTTGGCTGAATAACAACTACGGCACTCTGACGGCGGATGATGTAAAGCATATTTCCCGACTCCGTAA



GCATGACTTCGGGCGGCTGTCTAAGATGTTTCTAACAGGCCTCAAGGGTGTGCATAAGGAAACTGG



GGAGCGCGCTAGCATCCTGGATTTTATGTGGAACACCAATGATAACCTGATGCAGCTCCTGTCAGA



ATGCTACACATTTTCGGACGAAATCACCAAGCTGCAGGAGGCTTACTATGCCAAGGCCCAACTAAG



CTTGAATGATTTCCTGGATTCTATGTACATCAGCAACGCCGTAAAACGACCAATTTATAGGACACTG



GCAGTGGTTAACGACATTAGGAAAGCATGCGGAACAGCTCCCAAGCGAATCTTTATCGAGATGGCC



CGCGACGGCGAGAGTAAGAAGAAAAGGTCAGTGACTAGGCGGGAGCAGATCAAGAACCTTTACCG



CTCTATCCGAAAAGACTTCCAGCAAGAGGTTGATTTCCTTGAGAAGATCTTAGAGAACAAGTCAGA



TGGACAGCTCCAATCCGATGCTCTGTATCTGTACTTCGCTCAGCTGGGACGAGATATGTACACTGGC



GACCCCATTAAACTAGAACATATCAAGGACCAATCGTTTTATAATATCGACCACATCTACCCTCAGT



CCATGGTGAAAGACGATAGTCTGGACAATAAGGTGCTCGTCCAAAGTGAGATTAACGGAGAAAAG



TCGAGCAGATATCCTTTGGACGCTGCGATCCGCAACAAGATGAAGCCCCTGTGGGATGCTTACTAC



AATCATGGACTGATCAGCCTGAAGAAGTATCAGAGACTGACCCGGAGTACCCCTTTCACAGACGAT



GAGAAGTGGGATTTTATCAATAGACAACTGGTGGAAACCAGGCAGTCCACGAAAGCTCTGGCCATT



CTTCTGAAGAGAAAGTTTCCAGACACAGAGATCGTCTATTCAAAGGCCGGCCTCAGTTCCGACTTTA



GACATGAGTTCGGACTCGTTAAATCACGAAATATAAACGATCTCCACCATGCAAAGGACGCATTCC



TCGCGATTGTGACTGGAAATGTCTATCACGAAAGATTTAATAGGCGGTGGTTCATGGTTAACCAGCC



ATACTCAGTGAAGACCAAGACCCTTTTCACTCACTCTATTAAAAATGGCAACTTCGTGGCTTGGAAT



GGTGAGGAGGATCTTGGAAGAATTGTGAAGATGTTAAAACAGAATAAGAATACCATCCACTTTACT



AGATTCAGCTTTGACCGAAAAGAGGGGCTATTCGATATTCAACCGTTAAAGGCTTCAACAGGTCTC



GTTCCACGAAAGGCCGGACTGGACGTAGTGAAATACGGCGGCTATGATAAGAGCACCGCAGCTTAC



TACCTCCTTGTGCGATTTACGCTCGAGGATAAGAAGACCCAACACAAGCTGATGATGATTCCCGTG



GAGGGACTGTACAAAGCTCGAATTGACCATGATAAAGAGTTTCTCACAGATTACGCACAAACCACC



ATCTCTGAGATTCTCCAGAAAGACAAACAAAAAGTTATAAACATAATGTTTCCAATGGGTACAAGG



CATATTAAACTGAACAGCATGATCTCCATTGATGGCTTTTATTTGTCCATTGGAGGAAAGTCTAGTA



AAGGCAAGTCTGTCCTCTGCCATGCCATGGTACCCCTAATCGTCCCACACAAGATTGAATGCTACAT



CAAGGCTATGGAGAGTTTTGCTCGGAAATTTAAAGAGAATAATAAGCTGCGTATTGTGGAAAAATT



CGACAAGATAACCGTTGAAGACAATCTGAATCTGTACGAGCTCTTTCTGCAGAAGCTGCAGCATAA



CCCCTATAATAAGTTCTTCTCCACACAGTTCGATGTACTGACCAACGGGCGATCAACTTTCACAAAG



CTAAGTCCTGAGGAACAGGTGCAAACACTCCTAAACATTCTTTCCATTTTTAAGACCTGCAGATCTT



CAGGATGCGACTTGAAGAGCATTAACGGGAGCGCACAGGCAGCTAGGATCATGATCTCAGCTGACC



TGACAGGGCTGAGTAAAAAATACTCCGACATTCGGCTTGTAGAGCAAAGCGCCAGTGGGTTGTTCG



TTAGTAAGTCGCAGAACCTGCTGGAATACCTGTAA





SEQ
ATGTCTTCTTTGACGAAGTTTACAAACAAATACTCTAAGCAGCTTACAATTAAGAACGAACTGATTC


ID
CCGTAGGAAAGACTCTGGAAAACATCAAAGAGAATGGGCTGATAGACGGCGACGAACAACTGAAT


NO:
GAGAACTATCAGAAGGCCAAAATTATCGTGGATGACTTCCTGAGGGATTTTATTAACAAGGCCCTG


148
AATAATACCCAGATCGGCAATTGGCGGGAACTGGCCGACGCTCTGAACAAAGAAGATGAGGACAA



TATCGAAAAATTACAAGACAAAATCAGGGGCATTATTGTCAGTAAGTTCGAGACATTCGATCTGTT



CTCTTCGTACTCCATTAAGAAGGACGAGAAAATCATCGATGATGACAATGACGTTGAGGAAGAAGA



ACTGGACTTGGGTAAAAAGACCTCATCCTTCAAGTATATTTTTAAAAAAAATCTGTTTAAATTAGTG



CTCCCCAGTTATTTAAAGACAACTAACCAGGACAAGCTTAAGATTATCTCCTCTTTTGACAACTTTA



GCACCTATTTTAGAGGCTTCTTTGAAAATCGCAAGAATATTTTCACTAAGAAGCCCATAAGCACCTC



TATTGCCTACAGAATCGTACATGATAACTTCCCAAAATTTTTGGATAACATTAGATGTTTTAATGTA



TGGCAGACCGAATGTCCTCAGTTAATTGTGAAGGCGGATAACTACCTCAAATCCAAGAATGTGATC



GCCAAAGATAAGTCTCTTGCTAACTACTTTACGGTCGGAGCCTACGATTACTTCTTATCTCAAAACG



GTATTGACTTTTACAATAACATTATCGGGGGATTGCCTGCCTTCGCCGGCCATGAGAAAATTCAGGG



CTTAAACGAGTTCATAAATCAGGAATGTCAAAAGGACTCAGAGCTGAAATCAAAGCTTAAGAATCG



ACACGCATTTAAAATGGCGGTCTTGTTCAAACAGATCCTCAGCGATAGAGAGAAAAGCTTCGTTAT



TGATGAATTCGAGAGCGACGCACAGGTGATTGATGCCGTGAAGAACTTCTATGCGGAACAGTGTAA



AGACAATAATGTTATTTTCAACCTATTAAACTTGATTAAGAATATCGCGTTTTTAAGTGACGATGAA



CTCGACGGTATCTTTATAGAAGGCAAGTACCTGTCCTCTGTCAGCCAAAAACTCTACTCAGATTGGT



CCAAGCTAAGAAATGACATCGAGGACAGTGCTAACAGCAAACAGGGCAATAAAGAGCTGGCAAAG



AAAATCAAGACTAATAAAGGGGATGTGGAGAAGGCGATATCTAAATATGAGTTCTCCCTCTCCGAA



CTGAACTCCATCGTCCACGATAATACCAAGTTTAGTGATCTGTTGTCGTGTACACTGCACAAAGTGG



CCAGTGAAAAACTCGTCAAGGTGAACGAAGGCGATTGGCCCAAACACCTGAAAAATAATGAGGAG



AAACAGAAGATCAAAGAACCTTTGGATGCGTTGCTCGAAATATATAACACACTGTTGATCTTCAAC



TGTAAAAGCTTCAACAAGAACGGGAACTTTTATGTAGACTACGATCGATGTATAAATGAACTGAGC



AGCGTCGTTTACCTGTACAACAAGACTCGCAATTATTGTACGAAAAAACCATATAACACCGATAAG



TTCAAGCTTAATTTCAACAGTCCCCAGCTGGGAGAAGGGTTCAGCAAATCAAAAGAAAACGATTGC



CTGACATTACTCTTTAAAAAGGATGATAATTATTATGTTGGGATTATTAGGAAAGGCGCTAAGATCA



ACTTTGACGACACACAGGCCATAGCTGACAACACTGATAACTGCATCTTTAAAATGAATTACTTTCT



GTTGAAGGACGCCAAAAAATTCATTCCAAAATGCTCTATTCAGCTCAAGGAGGTTAAGGCCCATTT



CAAGAAGTCTGAAGATGACTACATCCTCTCTGACAAGGAAAAATTCGCTAGTCCTCTGGTTATCAA



AAAAAGTACCTTCTTGCTGGCTACAGCTCACGTGAAAGGCAAGAAAGGGAACATTAAGAAGTTCCA



AAAGGAATACAGCAAAGAGAATCCAACCGAGTACAGAAATTCTCTGAACGAATGGATCGCATTCTG



TAAAGAATTTCTAAAGACGTACAAGGCCGCTACCATTTTCGATATTACCACCTTGAAAAAAGCCGA



GGAGTACGCCGACATCGTCGAATTCTATAAAGACGTGGATAACCTGTGTTACAAATTGGAATTCTG



CCCAATTAAGACCTCTTTCATTGAAAACCTCATCGACAATGGGGACCTCTACTTATTTAGAATTAAC



AATAAGGATTTTTCTTCGAAATCTACCGGAACTAAAAATCTGCACACACTGTATCTGCAAGCAATCT



TCGATGAACGTAATCTCAACAACCCTACAATAATGCTGAACGGCGGTGCTGAACTGTTCTACCGTA



AAGAGAGTATTGAACAGAAGAATCGAATCACACACAAAGCGGGCAGTATTCTCGTCAATAAGGTGT



GCAAAGACGGGACCAGCCTGGACGATAAGATCAGGAATGAAATATATCAGTATGAGAACAAGTTT



ATCGACACCTTGTCGGATGAGGCAAAGAAGGTGCTACCTAACGTTATCAAGAAGGAAGCTACCCAT



GACATAACCAAGGATAAGCGGTTCACTTCTGACAAGTTCTTCTTCCACTGTCCTCTGACCATTAACT



ACAAGGAAGGAGACACTAAACAATTCAATAATGAAGTACTTAGCTTTTTGCGGGGTAATCCCGATA



TTAACATAATTGGTATCGACCGGGGAGAACGGAACCTGATATACGTGACAGTAATTAATCAGAAAG



GAGAAATCCTGGATTCCGTATCCTTCAATACCGTGACTAATAAATCTAGTAAAATCGAGCAGACGG



TCGACTACGAGGAAAAGTTAGCAGTCAGAGAGAAGGAGAGAATCGAGGCCAAACGTTCCTGGGAT



AGTATCAGCAAGATTGCTACTCTGAAAGAAGGATATCTGTCCGCTATCGTCCATGAGATCTGTTTGT



TGATGATCAAGCACAATGCTATAGTGGTTCTGGAGAACCTGAACGCAGGCTTCAAGCGAATTAGAG



GGGGCCTGTCGGAAAAAAGCGTTTACCAGAAGTTTGAAAAGATGCTAATCAATAAGTTAAATTACT



TTGTAAGTAAAAAAGAAAGCGATTGGAATAAGCCATCAGGACTTTTAAACGGGCTGCAACTGAGCG



ACCAGTTTGAGTCATTCGAAAAACTGGGTATTCAGAGTGGTTTCATATTCTACGTACCTGCCGCTTA



CACTTCAAAGATCGATCCTACAACTGGTTTTGCGAATGTCCTGAATCTGTCTAAGGTGAGGAATGTG



GACGCAATCAAGTCTTTCTTCAGCAACTTCAACGAGATATCTTACAGCAAGAAAGAGGCTCTGTTTA



AATTCAGTTTTGATCTGGATAGCCTGAGCAAGAAAGGATTCTCTTCTTTCGTAAAGTTTTCTAAGTC



CAAATGGAACGTCTACACGTTCGGAGAGAGAATCATTAAACCAAAGAACAAGCAGGGGTATCGGG



AAGACAAAAGGATCAATCTGACTTTCGAAATGAAGAAACTATTGAATGAGTACAAAGTCTCATTCG



ATTTGGAGAACAATCTGATCCCCAATCTGACCAGCGCTAACCTCAAAGACACATTCTGGAAGGAGC



TGTTTTTCATCTTTAAGACCACCCTGCAGCTACGGAATAGTGTCACAAATGGGAAAGAGGATGTACT



GATCTCACCTGTGAAAAACGCCAAGGGGGAGTTCTTTGTGTCCGGCACCCATAACAAAACCCTGCC



TCAGGACTGTGACGCGAACGGGGCCTACCACATCGCGCTAAAGGGGTTAATGATTCTCGAACGTAA



TAATCTGGTGCGCGAAGAAAAAGACACAAAGAAAATTATGGCCATCAGCAACGTTGACTGGTTTGA



GTACGTGCAGAAGCGTCGAGGAGTTTTGTAA





SEQ
ATGAACAACTATGACGAGTTCACTAAACTTTACCCCATTCAGAAAACCATCAGATTTGAACTGAAG


ID
CCTCAGGGTCGTACCATGGAACACTTGGAAACTTTCAACTTTTTCGAGGAGGACAGGGATAGAGCT


NO:
GAGAAATACAAGATCTTGAAAGAGGCCATCGACGAGTATCACAAAAAATTCATCGATGAGCATCTC


149
ACCAACATGTCGCTGGATTGGAACAGTCTCAAGCAGATTTCCGAGAAGTACTATAAATCTCGGGAG



GAGAAAGATAAAAAGGTGTTTTTGAGCGAGCAAAAGCGAATGCGACAGGAGATAGTCTCTGAATTT



AAGAAAGATGATCGGTTTAAAGACCTATTTTCCAAAAAGCTTTTTTCAGAGCTGCTGAAGGAAGAG



ATCTATAAAAAAGGCAATCACCAAGAAATTGATGCCCTGAAATCATTCGACAAATTCAGTGGGTAT



TTCATAGGACTGCATGAGAACCGGAAGAATATGTATAGTGATGGAGACGAGATCACAGCCATAAGC



AATCGAATCGTTAACGAGAATTTCCCGAAGTTCCTGGATAACCTGCAGAAGTATCAAGAGGCTAGG



AAAAAGTACCCTGAGTGGATCATCAAGGCTGAATCAGCTCTGGTGGCTCACAATATCAAGATGGAT



GAAGTCTTTAGTCTTGAGTACTTTAATAAAGTCCTTAACCAGGAGGGCATCCAGCGCTATAACCTGG



CTCTCGGTGGCTACGTCACAAAAAGCGGAGAAAAGATGATGGGTCTCAACGATGCACTGAATTTGG



CTCATCAGTCGGAGAAGTCATCTAAGGGACGCATACACATGACACCACTGTTTAAACAAATCCTGA



GCGAAAAGGAATCATTTTCCTACATTCCCGACGTATTCACCGAGGACTCACAACTGCTGCCTAGTAT



AGGGGGGTTTTTCGCTCAGATAGAGAACGACAAAGATGGCAACATTTTTGACAGAGCCTTGGAGTT



GATTTCATCTTACGCCGAGTACGATACGGAGCGCATTTATATTCGCCAGGCGGATATCAACAGGGTT



TCCAATGTGATCTTTGGCGAGTGGGGAACGCTGGGCGGGCTGATGCGGGAATACAAAGCCGACTCG



ATCAATGACATCAACCTGGAGAGAACATGCAAGAAGGTCGATAAATGGTTGGATAGCAAAGAGTT



CGCCCTGAGTGACGTCTTGGAAGCTATCAAAAGAACCGGAAATAATGACGCGTTCAACGAGTATAT



CTCTAAAATGAGGACCGCGAGAGAAAAAATTGATGCAGCAAGGAAGGAGATGAAGTTTATATCTG



AGAAGATCTCAGGCGATGAAGAGTCCATCCATATTATTAAAACTCTTCTGGACTCAGTGCAGCAATT



CCTGCACTTTTTTAACCTCTTCAAGGCCAGGCAGGATATACCGTTAGACGGGGCTTTTTATGCCGAG



TTTGATGAAGTTCATTCGAAACTTTTTGCTATAGTGCCTCTCTATAATAAAGTTCGCAATTACCTGAC



AAAGAATAACTTAAACACAAAGAAAATCAAGCTCAACTTCAAAAACCCAACACTGGCAAACGGAT



GGGATCAGAACAAGGTATATGATTACGCCTCATTGATTTTCCTCCGGGACGGGAATTACTATCTGGG



GATCATCAACCCTAAGCGCAAAAAGAACATTAAGTTCGAACAGGGATCTGGCAATGGTCCCTTCTA



TAGGAAAATGGTATACAAACAGATTCCTGGCCCCAACAAGAATCTCCCACGCGTCTTTCTGACGTCC



ACTAAGGGAAAGAAGGAGTACAAGCCGTCTAAAGAAATTATCGAGGGCTATGAGGCAGACAAGCA



TATTAGGGGTGACAAGTTTGACCTAGACTTTTGTCATAAGCTTATCGACTTTTTCAAGGAGTCCATA



GAGAAGCACAAAGATTGGTCAAAGTTTAATTTCTATTTTTCTCCAACAGAGTCCTACGGGGATATCT



CTGAGTTCTATCTGGATGTTGAAAAGCAGGGGTACAGAATGCACTTCGAAAATATCTCAGCAGAAA



CTATCGATGAGTACGTAGAGAAAGGAGATCTGTTTCTTTTCCAAATCTACAATAAGGATTTTGTGAA



GGCCGCCACTGGGAAGAAGGACATGCACACTATTTACTGGAACGCTGCATTTTCCCCTGAAAATCT



GCAGGACGTAGTAGTGAAATTAAATGGTGAGGCAGAACTGTTTTACCGCGATAAATCAGACATCAA



GGAAATAGTGCACCGGGAAGGCGAGATTCTTGTTAACCGAACATATAATGGCAGGACACCTGTCCC



TGATAAAATTCATAAGAAACTGACCGATTACCACAACGGTCGAACCAAGGATCTGGGCGAGGCCAA



GGAATACCTCGATAAGGTGAGGTACTTCAAAGCCCATTATGACATCACCAAGGACCGAAGATACCT



TAACGACAAAATCTACTTCCATGTCCCACTCACCTTGAACTTCAAAGCTAACGGTAAGAAGAACCTC



AATAAAATGGTGATTGAAAAATTTCTGTCCGATGAGAAGGCCCATATCATCGGCATTGATCGCGGC



GAGAGAAATCTCCTTTACTATTCTATCATTGATCGGTCGGGAAAGATTATCGACCAACAATCACTGA



ATGTCATCGACGGATTCGACTATAGAGAGAAGCTGAACCAACGGGAAATCGAGATGAAGGACGCG



CGCCAGTCCTGGAACGCTATCGGCAAAATTAAAGATTTGAAAGAAGGTTACCTCTCCAAAGCAGTG



CACGAAATTACCAAAATGGCAATCCAGTACAATGCTATTGTGGTAATGGAGGAGTTAAATTACGGA



TTTAAGCGCGGGAGGTTCAAGGTTGAAAAGCAAATTTACCAAAAATTTGAGAACATGTTGATTGAT



AAGATGAACTACCTGGTGTTCAAGGACGCACCTGACGAGTCGCCAGGCGGCGTGTTAAATGCATAT



CAGCTGACAAATCCACTGGAGAGCTTTGCCAAGCTAGGAAAGCAGACTGGCATTCTCTTTTACGTCC



CTGCAGCGTATACATCCAAAATTGACCCCACCACTGGCTTCGTCAATCTGTTTAACACCTCCTCCAA



AACCAACGCACAAGAACGGAAAGAATTTTTGCAAAAGTTTGAGTCCATTAGCTACTCTGCCAAAGA



CGGCGGGATCTTTGCTTTCGCATTCGACTACAGGAAATTCGGGACGAGTAAGACAGACCACAAGAA



CGTCTGGACCGCGTACACTAATGGGGAACGCATGCGCTACATCAAAGAGAAAAAGAGGAATGAAC



TTTTTGACCCTTCAAAGGAAATCAAGGAAGCTCTCACCTCAAGCGGTATCAAATACGATGGCGGGC



AGAATATTTTGCCAGATATCCTCAGATCGAACAATAATGGACTTATCTATACTATGTACTCCTCCTT



CATTGCAGCAATTCAAATGAGAGTGTACGATGGAAAGGAGGATTACATTATATCGCCAATTAAGAA



CTCCAAAGGCGAATTCTTCCGCACGGATCCTAAGCGAAGAGAACTCCCAATCGACGCTGATGCGAA



CGGCGCCTATAATATAGCCCTGCGGGGTGAATTAACAATGCGCGCTATTGCCGAGAAGTTCGACCC



CGATTCAGAAAAAATGGCTAAGCTTGAGCTGAAACACAAAGATTGGTTCGAATTCATGCAGACAAG



AGGCGACTAA





SEQ
ATGACTAAGACCTTCGATTCCGAGTTCTTCAACCTTTATTCCCTGCAGAAAACTGTAAGGTTTGAGC


ID
TGAAGCCGGTGGGCGAGACAGCCAGCTTCGTAGAGGATTTCAAGAATGAGGGTCTCAAACGGGTA


NO:
GTTAGTGAGGATGAGAGGAGAGCAGTGGACTATCAGAAGGTGAAAGAGATCATCGATGACTATCA


150
CCGGGATTTCATAGAGGAGTCGTTGAATTACTTCCCTGAGCAAGTATCCAAAGACGCGCTGGAACA



GGCCTTTCATCTTTACCAGAAACTGAAGGCAGCGAAGGTTGAGGAGCGGGAAAAGGCCTTGAAAG



AGTGGGAAGCCCTGCAGAAAAAGCTCAGAGAAAAGGTTGTCAAATGCTTCAGCGACAGCAACAAA



GCCAGGTTCAGTAGGATCGATAAGAAAGAACTGATCAAAGAAGACTTGATCAATTGGCTGGTTGCA



CAGAACCGGGAAGATGATATTCCCACCGTAGAGACCTTCAACAACTTCACAACTTACTTCACCGGC



TTCCATGAGAATCGTAAAAACATCTACAGTAAAGATGATCATGCAACCGCCATCTCCTTCCGGTTGA



TCCACGAGAATCTCCCCAAGTTCTTTGACAACGTGATAAGTTTCAATAAGTTGAAAGAGGGATTTCC



CGAACTCAAGTTCGATAAAGTGAAGGAGGATCTGGAAGTGGATTATGACCTTAAGCACGCTTTCGA



GATAGAGTACTTCGTGAACTTTGTGACTCAGGCCGGCATCGATCAGTATAACTACCTCCTCGGGGGT



AAGACGCTCGAGGACGGTACTAAGAAGCAAGGAATGAATGAGCAAATTAATCTATTTAAACAGCA



GCAGACCAGGGATAAGGCTAGACAGATCCCCAAGCTTATTCCTCTTTTTAAACAGATCCTAAGTGA



AAGGACAGAAAGTCAAAGCTTCATACCTAAGCAATTTGAAAGTGATCAGGAGCTGTTTGACTCCCT



GCAAAAGCTGCACAACAATTGCCAGGACAAGTTTACCGTGCTGCAGCAGGCTATCCTCGGACTGGC



TGAGGCGGATCTTAAGAAGGTATTCATTAAGACTAGCGACCTCAATGCCCTTAGTAACACCATCTTT



GGAAATTACTCCGTTTTCAGCGATGCCCTCAATCTATACAAAGAGAGCTTGAAGACTAAAAAAGCT



CAGGAAGCTTTTGAAAAATTACCGGCACATTCTATACACGACCTTATACAATACTTAGAGCAGTTCA



ACAGCAGCCTCGACGCTGAGAAACAGCAATCCACAGACACCGTCCTGAATTACTTCATCAAAACCG



ATGAACTGTACTCCCGATTTATCAAGAGCACTTCAGAAGCCTTCACGCAAGTTCAGCCTCTGTTCGA



GCTGGAGGCACTGTCCAGCAAGAGACGACCGCCAGAGTCTGAAGACGAGGGAGCCAAGGGTCAAG



AGGGGTTTGAACAGATAAAGCGAATTAAGGCTTACTTGGATACTCTCATGGAGGCGGTGCATTTCG



CTAAGCCTTTGTACCTGGTTAAAGGCCGAAAAATGATTGAGGGGCTAGATAAGGATCAGTCTTTTTA



CGAGGCTTTTGAAATGGCCTACCAGGAATTGGAATCCTTGATCATTCCAATCTATAATAAAGCCCGG



AGTTATCTGAGCAGGAAGCCCTTCAAAGCCGACAAGTTCAAAATAAATTTTGACAATAATACGCTA



CTGTCTGGTTGGGACGCTAACAAGGAAACAGCCAATGCTTCCATCCTGTTTAAGAAAGACGGCCTG



TACTACCTGGGAATTATGCCAAAAGGCAAAACTTTTTTGTTCGATTACTTTGTGTCATCAGAGGATA



GCGAGAAGTTAAAGCAAAGACGGCAGAAGACCGCCGAAGAAGCCCTCGCACAAGACGGAGAATCA



TATTTCGAGAAAATTCGATATAAGCTCCTGCCTGGCGCATCAAAGATGTTGCCAAAAGTCTTCTTTT



CCAACAAAAACATCGGCTTTTATAACCCCAGCGATGATATCCTTCGCATCCGGAACACCGCCTCACA



TACCAAAAATGGAACTCCACAGAAGGGCCACTCGAAGGTTGAATTCAACCTTAACGATTGTCACAA



AATGATTGATTTTTTTAAGAGCTCCATTCAGAAACACCCCGAATGGGGGTCCTTTGGCTTCACCTTTT



CTGATACTTCAGACTTCGAGGACATGTCCGCCTTCTACAGGGAGGTGGAGAACCAGGGCTATGTCA



TCTCCTTCGACAAAATAAAAGAGACATACATTCAGAGCCAGGTCGAGCAGGGAAATCTGTACCTGT



TTCAGATCTATAACAAGGATTTCAGTCCCTATAGCAAGGGCAAGCCCAATTTACATACCCTGTACTG



GAAGGCCCTGTTCGAAGAGGCAAACCTTAACAATGTAGTTGCTAAGCTGAATGGGGAAGCAGAGAT



CTTCTTCCGAAGGCACAGCATCAAGGCAAGCGACAAAGTTGTACATCCTGCTAACCAGGCCATCGA



TAACAAGAACCCGCATACAGAAAAGACACAGTCAACCTTTGAATACGACCTCGTGAAGGACAAGA



GGTACACACAAGATAAATTCTTCTTCCACGTGCCCATCAGCTTGAATTTTAAAGCGCAGGGAGTGA



GCAAATTTAACGACAAGGTCAACGGCTTCCTGAAGGGAAACCCCGACGTGAATATCATCGGAATTG



ATCGCGGTGAAAGACATCTCCTCTACTTTACTGTGGTGAACCAGAAGGGTGAGATCCTAGTACAGG



AGAGCCTGAACACCCTTATGAGTGATAAGGGCCATGTGAATGATTACCAGCAGAAGCTGGACAAGA



AGGAACAGGAAAGGGACGCAGCGCGGAAGTCCTGGACCACTGTTGAGAATATCAAAGAACTGAAG



GAGGGATATCTTAGCCATGTGGTACACAAACTTGCACATCTGATTATCAAGTATAATGCCATAGTCT



GCCTGGAAGACTTGAACTTCGGTTTCAAGCGAGGAAGGTTTAAAGTGGAGAAGCAGGTGTACCAGA



AGTTTGAGAAAGCCCTTATTGATAAGCTAAACTACCTTGTCTTTAAGGAAAAAGAACTCGGCGAAG



TTGGCCACTATTTAACCGCCTACCAACTAACCGCCCCTTTCGAGTCTTTTAAGAAACTGGGAAAGCA



GAGCGGAATACTCTTCTATGTGCCTGCAGACTACACCTCTAAGATCGACCCCACTACCGGCTTTGTA



AACTTTCTAGATCTCCGCTATCAGTCAGTAGAAAAAGCCAAACAGCTCTTGTCAGATTTTAACGCCA



TCCGATTTAATTCCGTCCAAAATTACTTCGAGTTCGAAATCGACTATAAAAAACTTACCCCCAAGAG



AAAGGTTGGGACGCAGTCTAAGTGGGTAATCTGCACTTACGGTGACGTGAGATACCAGAACCGCCG



AAACCAGAAAGGTCATTGGGAAACCGAGGAAGTGAATGTGACTGAGAAGCTCAAGGCCCTCTTCG



CTAGCGACAGTAAAACAACAACAGTTATCGATTACGCCAATGACGATAATCTTATAGACGTGATCT



TGGAACAAGACAAAGCCTCTTTTTTTAAGGAATTGTTGTGGTTGCTGAAACTTACAATGACCCTTAG



GCACAGCAAGATCAAATCAGAGGATGACTTCATCCTCAGCCCGGTGAAGAATGAACAGGGAGAGT



TCTACGATTCACGGAAGGCTGGAGAGGTGTGGCCCAAGGATGCCGACGCGAACGGGGCCTACCAC



ATAGCTCTAAAAGGTCTGTGGAACCTGCAACAAATCAATCAATGGGAGAAAGGTAAGACACTGAA



CCTGGCCATCAAAAATCAAGATTGGTTCTCATTCATCCAGGAAAAGCCTTATCAAGAGTGA





SEQ
ATGCATACGGGAGGCCTTTTATCAATGGACGCAAAAGAGTTCACCGGGCAGTATCCATTATCTAAG


ID
ACACTCCGCTTCGAGCTGAGGCCCATTGGCAGGACCTGGGACAACCTGGAGGCGTCGGGCTACCTG


NO:
GCTGAGGACAGACATCGCGCAGAATGCTATCCGAGAGCTAAGGAGCTTTTGGACGACAATCATCGC


151
GCGTTCCTTAACCGGGTGCTCCCACAGATCGATATGGACTGGCACCCGATCGCTGAGGCTTTTTGCA



AGGTCCATAAGAACCCTGGGAACAAAGAGCTCGCCCAGGACTACAACTTGCAGCTGAGCAAGCGA



CGGAAAGAGATTTCTGCCTACCTTCAAGACGCCGATGGCTACAAAGGGCTCTTCGCAAAGCCCGCA



TTGGATGAGGCCATGAAAATCGCCAAGGAGAACGGGAATGAAAGTGACATCGAAGTTCTCGAAGC



GTTTAACGGATTTAGCGTGTACTTTACCGGCTATCATGAGTCAAGGGAGAATATTTATAGCGATGAG



GACATGGTCTCTGTGGCCTACCGGATTACCGAGGATAATTTCCCGAGGTTTGTTTCAAATGCACTAA



TATTCGACAAGTTAAATGAGAGCCACCCAGACATCATCTCGGAGGTCAGCGGCAACCTCGGAGTTG



ACGATATTGGCAAATACTTCGACGTGAGCAACTATAACAACTTCCTCTCACAGGCTGGCATCGACG



ACTATAATCATATTATAGGCGGCCACACTACTGAGGATGGTCTCATTCAGGCATTCAATGTAGTCTT



GAATCTTAGGCACCAGAAGGACCCTGGGTTTGAAAAGATACAGTTCAAGCAGCTGTATAAGCAGAT



ATTATCCGTGCGAACATCTAAAAGTTACATCCCCAAACAGTTTGATAACTCAAAGGAGATGGTGGA



TTGCATATGCGATTATGTGTCAAAAATTGAAAAGAGCGAGACTGTGGAGCGGGCTCTGAAGCTCGT



CAGGAACATTAGCTCCTTTGACCTTAGAGGAATTTTCGTCAATAAAAAGAATCTGAGGATCCTGAG



CAATAAGCTAATAGGAGATTGGGACGCCATAGAGACAGCATTGATGCATTCCAGCTCAAGCGAGAA



TGATAAGAAGTCTGTCTACGATAGCGCTGAAGCCTTCACGCTGGACGATATCTTCTCTTCCGTGAAA



AAATTTAGTGATGCGTCCGCAGAAGATATCGGGAATCGAGCCGAAGATATCTGCAGGGTAATTTCA



GAGACCGCCCCTTTCATCAATGACCTGCGCGCCGTGGACCTGGATAGCCTGAATGACGATGGTTAC



GAAGCTGCAGTTTCTAAGATCAGGGAGTCTCTGGAGCCATATATGGACTTGTTTCACGAACTTGAGA



TCTTTAGCGTGGGCGACGAGTTCCCGAAATGCGCAGCTTTCTATAGCGAGTTAGAGGAGGTCAGCG



AGCAATTAATCGAGATCATACCCCTGTTTAATAAGGCACGGAGCTTTTGTACTCGCAAGCGCTACAG



CACCGACAAGATTAAAGTTAATCTGAAATTTCCAACTCTCGCAGACGGGTGGGACCTAAACAAGGA



ACGCGATAATAAAGCCGCCATCCTTAGAAAGGACGGAAAGTACTATCTTGCCATCCTAGATATGAA



AAAAGATCTGAGTTCCATTCGTACTAGCGATGAAGACGAATCTTCTTTCGAAAAAATGGAGTATAA



GCTGCTCCCCTCGCCAGTCAAGATGCTACCCAAGATCTTTGTGAAGAGCAAAGCAGCCAAGGAAAA



GTACGGGCTGACGGACAGGATGCTGGAGTGCTACGATAAGGGAATGCATAAATCAGGGTCAGCTTT



TGACTTGGGCTTTTGCCATGAGCTAATCGATTACTACAAGCGCTGTATCGCCGAGTATCCAGGATGG



GACGTTTTCGACTTTAAATTTCGGGAGACTTCTGATTATGGTTCAATGAAGGAGTTCAACGAAGATG



TCGCTGGTGCCGGTTACTACATGAGCCTTCGCAAGATTCCTTGTTCCGAAGTCTACCGGCTACTGGA



CGAGAAATCTATATATTTGTTCCAGATATATAACAAGGACTACAGTGAGAATGCACATGGGAATAA



GAATATGCATACTATGTATTGGGAAGGTCTCTTTTCACCCCAAAATTTGGAGTCACCCGTGTTCAAA



CTTAGCGGTGGCGCAGAGCTGTTCTTTAGGAAATCCAGTATACCCAATGACGCCAAGACAGTCCAC



CCAAAGGGTAGCGTCCTGGTGCCCAGAAACGATGTGAACGGCAGGAGAATCCCTGACAGCATTTAC



CGAGAACTTACCAGGTACTTCAACCGCGGCGACTGTAGAATCTCTGATGAGGCAAAGTCTTATCTG



GATAAGGTGAAGACTAAGAAGGCAGATCATGACATTGTGAAAGACCGCCGCTTTACTGTCGACAAA



ATGATGTTTCACGTGCCTATCGCAATGAATTTTAAGGCAATCTCAAAACCGAATCTGAACAAGAAG



GTGATAGATGGCATTATCGATGACCAGGACCTCAAGATCATCGGAATCGACAGAGGTGAGCGAAAC



CTGATATACGTCACAATGGTAGATCGGAAGGGTAATATTCTGTACCAGGATTCACTAAACATCCTCA



ATGGATATGACTATCGAAAAGCTCTCGATGTCAGGGAATACGACAACAAGGAGGCGCGACGGAAT



TGGACAAAGGTGGAAGGCATACGGAAGATGAAGGAAGGCTATCTGTCACTAGCTGTCTCCAAATTG



GCTGATATGATTATAGAGAACAACGCCATTATCGTGATGGAAGATCTCAACCATGGATTCAAGGCA



GGAAGAAGTAAAATTGAGAAGCAGGTGTATCAGAAGTTCGAAAGCATGCTTATTAATAAGTTGGGT



TATATGGTCTTAAAGGACAAGTCTATCGATCAGAGCGGCGGCGCACTCCATGGGTATCAGCTGGCT



AACCATGTCACCACACTAGCATCCGTAGGCAAACAGTGTGGCGTGATTTTCTACATTCCTGCTGCGT



TCACTTCTAAGATCGATCCTACCACGGGATTCGCAGACCTGTTCGCACTGAGCAATGTTAAAAACGT



GGCCTCCATGAGGGAGTTCTTTAGCAAAATGAAAAGCGTGATTTATGACAAGGCCGAGGGCAAGTT



CGCTTTCACATTTGACTACCTGGACTACAATGTGAAATCAGAGTGCGGGAGAACCCTGTGGACCGT



ATACACGGTAGGGGAAAGATTCACTTACAGTCGAGTTAATCGGGAGTATGTCCGTAAAGTGCCAAC



TGACATCATCTACGATGCCCTTCAGAAGGCTGGCATAAGTGTTGAGGGGGATCTAAGGGACAGGAT



CGCTGAATCGGATGGCGATACTCTCAAATCAATCTTCTACGCCTTCAAGTATGCCCTCGACATGAGG



GTAGAGAACCGGGAGGAGGACTATATACAGTCTCCCGTGAAGAATGCGTCGGGAGAGTTCTTCTGC



TCAAAAAACGCCGGGAAATCTTTGCCGCAGGATTCTGATGCAAATGGGGCTTATAACATTGCTCTC



AAAGGCATCCTGCAGCTGCGCATGCTATCTGAACAATATGACCCAAACGCTGAAAGCATTAGATTG



CCATTGATCACCAATAAGGCTTGGCTGACTTTCATGCAGAGCGGTATGAAGACATGGAAAAACTAA





SEQ
ATGGATTCCCTTAAGGACTTCACAAATCTTTACCCCGTGAGTAAAACCCTGAGATTTGAACTCAAGC


ID
CCGTGGGAAAGACTCTCGAGAATATCGAGAAGGCCGGGATTTTGAAGGAAGACGAGCATCGGGCG


NO:
GAAAGTTACAGACGGGTGAAGAAGATTATAGATACTTATCACAAGGTCTTTATAGACAGCTCTTTA


152
GAGAACATGGCAAAGATGGGCATCGAGAACGAAATCAAGGCCATGCTGCAGTCCTTCTGCGAGCTG



TATAAAAAGGATCATCGGACCGAAGGCGAAGACAAGGCGCTGGATAAGATCAGGGCAGTGCTGCG



CGGCCTCATTGTGGGTGCCTTCACTGGGGTGTGCGGGCGGAGAGAGAACACTGTGCAGAATGAGAA



ATACGAGAGTTTGTTCAAAGAGAAACTCATCAAGGAAATCCTGCCCGACTTCGTCTTAAGCACAGA



AGCCGAATCTCTCCCATTTTCTGTCGAGGAGGCCACGCGTTCCCTTAAAGAGTTCGACAGTTTCACT



TCATACTTTGCCGGATTTTATGAAAACCGTAAAAATATATACTCCACTAAACCACAGTCAACTGCAA



TAGCTTACAGGTTAATCCACGAAAACCTGCCAAAATTCATCGACAATATACTCGTCTTTCAAAAAAT



CAAGGAACCAATCGCGAAGGAACTTGAACACATCCGGGCTGACTTTAGTGCGGGAGGATACATCAA



AAAAGACGAGCGCCTGGAGGATATATTTTCACTAAATTATTATATTCATGTACTGAGCCAGGCTGGC



ATAGAAAAGTACAACGCTCTAATTGGGAAAATCGTGACAGAAGGTGACGGGGAAATGAAAGGGCT



AAACGAACATATTAACTTATATAACCAACAGCGGGGTCGAGAAGATCGTCTGCCCCTGTTCAGACC



TCTGTATAAGCAAATACTCTCCGACAGAGAGCAGCTATCATATCTGCCCGAGTCCTTTGAGAAAGAT



GAAGAGCTGCTCCGGGCGCTCAAGGAGTTCTATGATCATATAGCCGAGGACATTTTGGGCAGAACT



CAGCAACTCATGACGTCTATTTCTGAATATGATCTGTCTCGTATCTATGTCAGGAATGATAGCCAGC



TGACCGATATATCCAAGAAGATGCTGGGGGACTGGAACGCCATTTATATGGCGAGGGAGCGAGCAT



ACGATCACGAGCAGGCACCCAAGAGAATCACAGCCAAATATGAGAGAGACCGCATTAAGGCGCTG



AAGGGCGAAGAAAGTATCAGTCTGGCCAATCTGAACTCCTGCATAGCTTTCCTTGATAACGTGAGG



GATTGCAGAGTTGATACTTACCTGAGTACCCTGGGCCAGAAGGAAGGGCCTCACGGCCTCTCTAAT



CTAGTGGAGAATGTATTTGCCTCCTACCACGAAGCTGAGCAGCTGCTGTCATTTCCGTACCCAGAGG



AAAATAATTTAATACAGGATAAGGACAACGTAGTGCTTATCAAAAATCTACTGGATAACATTTCCG



ACCTCCAGCGCTTTCTCAAACCACTTTGGGGGATGGGCGACGAGCCTGATAAGGATGAGCGCTTTT



ACGGCGAGTACAACTACATCAGGGGCGCCTTGGACCAGGTGATTCCCCTCTATAATAAAGTCAGGA



ATTACCTGACCCGAAAGCCATACAGTACAAGAAAGGTGAAATTAAATTTCGGCAATAGTCAGCTGC



TGTCTGGTTGGGACCGAAATAAGGAGAAAGACAACAGCTGCGTAATTCTCAGAAAAGGACAGAAC



TTTTATTTGGCCATCATGAATAACAGACACAAGAGATCTTTCGAGAACAAAGTGCTCCCTGAGTATA



AGGAGGGGGAACCCTACTTCGAGAAGATGGACTATAAATTCCTTCCTGATCCAAATAAAATGCTGC



CTAAAGTATTTCTGTCAAAAAAAGGTATAGAAATCTACAAACCTTCACCTAAGCTACTTGAACAGT



ATGGCCACGGCACCCATAAAAAAGGGGACACGTTCAGCATGGACGACCTACACGAACTGATTGACT



TCTTTAAGCACAGCATAGAAGCTCATGAGGACTGGAAACAGTTCGGATTCAAATTCTCAGATACCG



CGACCTACGAAAACGTGTCTAGTTTTTACCGGGAAGTCGAGGACCAGGGCTACAAGCTCAGCTTCA



GAAAAGTTAGCGAATCTTACGTCTACTCCCTTATAGATCAAGGTAAGCTGTATCTCTTTCAAATCTA



CAACAAGGACTTTTCCCCATGTAGCAAGGGCACCCCCAATCTGCACACTCTCTACTGGCGGATGCTG



TTCGACGAGCGTAACCTGGCAGACGTGATCTACAAATTAGATGGTAAAGCTGAGATCTTCTTTCGTG



AAAAGAGCCTAAAGAACGATCACCCCACTCACCCCGCCGGAAAGCCCATTAAGAAGAAAAGTAGG



CAGAAGAAAGGAGAAGAATCGCTATTTGAGTACGACCTCGTCAAGGATCGGCATTATACAATGGAT



AAGTTCCAGTTCCATGTGCCAATAACTATGAATTTCAAGTGCAGTGCTGGCAGTAAGGTGAATGAC



ATGGTAAACGCTCATATCCGGGAGGCAAAGGACATGCATGTTATTGGAATTGATAGGGGTGAGCGT



AATCTCCTCTACATCTGTGTTATTGACTCCCGCGGCACAATCCTCGATCAGATTTCCTTGAATACAAT



TAATGATATAGACTACCATGACTTGCTTGAGTCTCGCGACAAAGATAGACAGCAGGAGAGAAGAAA



TTGGCAGACCATCGAAGGCATCAAGGAACTCAAGCAAGGCTACCTTTCTCAGGCAGTGCATCGAAT



AGCCGAGCTGATGGTGGCTTATAAAGCCGTCGTGGCACTAGAAGACCTAAATATGGGATTTAAACG



AGGCAGGCAGAAGGTGGAATCATCCGTATACCAGCAGTTCGAAAAACAGTTGATAGACAAACTCA



ATTACCTTGTAGACAAGAAGAAGCGGCCTGAGGACATAGGGGGCCTGCTTAGAGCGTATCAATTTA



CAGCCCCATTCAAGTCTTTCAAAGAAATGGGTAAACAGAACGGTTTTCTGTTTTACATCCCAGCGTG



GAACACCAGCAATATAGATCCAACCACTGGCTTCGTCAATCTGTTTCATGCTCAGTATGAAAATGTG



GACAAGGCCAAATCCTTCTTTCAGAAATTTGACAGCATCTCCTATAACCCAAAGAAAGACTGGTTTG



AATTCGCCTTTGACTATAAGAATTTCACTAAGAAGGCCGAGGGATCAAGAAGCATGTGGATATTGT



GCACGCATGGCTCACGTATAAAGAACTTTAGAAACTCGCAAAAAAACGGGCAGTGGGACTCAGAA



GAATTCGCACTCACCGAGGCTTTCAAATCCCTCTTCGTCCGGTATGAGATCGATTACACCGCCGATC



TGAAGACGGCAATCGTCGACGAGAAACAGAAAGACTTCTTTGTAGATCTACTTAAGCTCTTTAAGC



TAACCGTTCAGATGCGAAACAGTTGGAAAGAAAAGGATCTCGACTATCTCATTAGTCCAGTGGCTG



GCGCGGATGGTAGATTTTTCGATACCCGGGAAGGTAACAAGTCCCTTCCCAAAGACGCCGACGCGA



ATGGTGCCTACAATATTGCACTAAAGGGGCTCTGGGCGCTGCGGCAAATTAGACAGACATCTGAAG



GGGGCAAGCTTAAGCTGGCTATTTCTAATAAAGAGTGGTTGCAGTTTGTGCAGGAAAGGAGTTATG



AGAAGGACTAG





SEQ
ATGAACAACGGCACCAACAACTTCCAGAACTTCATCGGCATATCGTCTCTGCAGAAAACACTTAGG


ID
AATGCCCTGATTCCAACTGAGACAACACAGCAGTTTATTGTGAAGAATGGGATCATCAAAGAGGAC


NO:
GAATTGCGCGGGGAGAATAGGCAGATCCTGAAGGACATCATGGACGATTACTACAGGGGTTTTATC


153
TCCGAAACGCTGAGCTCGATTGACGATATTGACTGGACGTCCCTCTTTGAGAAGATGGAAATCCAA



CTTAAAAATGGCGATAATAAAGATACCCTGATAAAGGAACAAACCGAATATAGAAAGGCTATACA



CAAAAAATTCGCAAATGACGACCGCTTTAAGAACATGTTTTCTGCAAAACTGATTAGCGATATTCTG



CCCGAGTTTGTGATTCACAATAATAACTATTCCGCTTCGGAGAAGGAGGAAAAGACTCAGGTGATT



AAACTGTTTTCTCGGTTCGCCACTTCTTTCAAAGATTATTTCAAAAATCGCGCCAACTGTTTTTCCGC



TGACGACATCTCCTCCTCTTCCTGCCACCGGATCGTAAACGACAATGCCGAGATCTTTTTTAGTAAC



GCCCTTGTGTATCGGAGGATAGTGAAGAGCCTGTCCAATGATGACATAAACAAAATTTCTGGCGAT



ATGAAGGATAGCCTCAAAGAGATGAGCCTTGAAGAAATTTACTCCTACGAGAAGTATGGGGAGTTC



ATCACCCAGGAGGGGATTTCCTTCTATAATGACATCTGTGGCAAGGTGAACAGCTTCATGAACCTGT



ACTGCCAGAAGAATAAGGAAAACAAAAATCTGTACAAGCTTCAGAAGTTACATAAGCAGATCCTGT



GTATCGCGGATACCTCATATGAGGTTCCTTATAAGTTCGAGAGTGATGAAGAAGTGTACCAGTCTGT



AAATGGATTCTTAGACAATATTTCGTCCAAACATATAGTGGAGAGACTGAGAAAGATCGGGGACAA



TTACAATGGGTACAATCTCGACAAGATTTATATCGTGTCGAAGTTTTACGAATCTGTGAGCCAGAAA



ACATACAGGGATTGGGAAACCATTAATACCGCGCTTGAAATTCACTACAATAATATTCTGCCTGGC



AACGGAAAAAGCAAGGCCGATAAGGTAAAAAAGGCAGTCAAAAATGACCTTCAGAAAAGTATCAC



CGAAATCAATGAGTTGGTGAGCAACTACAAATTGTGTTCAGACGATAATATTAAAGCGGAAACGTA



CATACATGAAATTAGCCATATTCTGAATAACTTTGAGGCGCAGGAACTTAAGTACAACCCTGAAAT



TCATCTCGTCGAAAGCGAATTGAAGGCCTCTGAATTGAAAAACGTTCTTGACGTGATAATGAACGC



TTTCCATTGGTGCTCTGTGTTTATGACTGAAGAGCTGGTTGATAAGGACAACAACTTTTATGCTGAA



CTTGAGGAAATCTACGACGAGATCTACCCTGTGATTAGCTTGTATAACCTCGTCAGAAACTACGTTA



CCCAGAAGCCGTACAGCACGAAAAAAATAAAGCTGAACTTTGGTATTCCGACTCTCGCCGATGGAT



GGAGCAAGTCGAAGGAATATTCCAACAATGCCATCATTCTTATGCGAGACAATCTGTATTACCTCG



GCATCTTTAACGCCAAAAACAAGCCGGATAAGAAAATCATTGAAGGGAATACGAGCGAGAATAAG



GGCGACTATAAGAAAATGATCTACAACTTACTGCCAGGTCCCAATAAAATGATTCCTAAGGTGTTTC



TGTCATCGAAAACAGGTGTAGAAACATATAAGCCCAGCGCATACATCCTGGAAGGCTACAAGCAAA



ACAAACACATCAAAAGCAGCAAGGACTTTGATATCACATTCTGCCACGATCTAATCGACTACTTCA



AAAATTGCATCGCCATTCACCCTGAGTGGAAGAACTTCGGCTTTGACTTCTCCGACACCAGTACCTA



CGAAGACATTTCTGGATTCTACCGTGAGGTTGAGCTGCAGGGTTATAAAATTGACTGGACATACATC



AGTGAAAAAGACATCGATCTACTGCAGGAGAAGGGGCAGCTCTATCTCTTCCAGATTTATAATAAG



GATTTCAGCAAGAAGTCCACTGGAAACGACAATCTGCATACAATGTATCTTAAGAACTTGTTTAGC



GAAGAGAATTTGAAAGATATCGTTCTAAAGTTAAACGGGGAAGCCGAGATTTTCTTTCGAAAGTCT



TCCATTAAGAATCCAATTATTCACAAGAAGGGCAGTATCCTGGTCAACAGAACCTATGAGGCCGAG



GAAAAGGACCAGTTCGGTAATATACAAATTGTGCGCAAGAACATCCCCGAGAACATTTACCAGGAG



CTCTATAAATACTTCAACGACAAAAGCGATAAGGAGCTTTCCGACGAGGCTGCCAAGCTGAAAAAC



GTGGTGGGACACCATGAAGCAGCCACCAACATCGTCAAAGATTATCGTTATACATATGACAAATAT



TTTCTGCACATGCCTATTACAATAAACTTTAAGGCAAACAAGACCGGGTTCATCAATGACCGGATAC



TCCAGTACATCGCAAAAGAGAAGGACCTGCATGTGATCGGCATCGACCGCGGTGAAAGAAATCTCA



TTTACGTCAGCGTTATCGACACTTGTGGAAACATTGTGGAGCAGAAGTCCTTCAACATTGTTAACGG



CTATGACTATCAGATCAAGCTCAAACAGCAGGAAGGTGCTCGTCAGATTGCGAGGAAAGAATGGA



AAGAGATCGGCAAGATCAAGGAGATCAAAGAAGGGTATCTGAGCTTGGTCATTCACGAGATCTCCA



AAATGGTCATCAAGTACAACGCTATTATCGCGATGGAAGACCTCTCTTACGGCTTTAAGAAGGGGC



GCTTTAAAGTGGAGCGCCAGGTCTATCAGAAGTTCGAGACTATGCTTATCAATAAGCTGAATTACTT



GGTCTTTAAGGATATCAGTATCACCGAGAACGGAGGACTGCTGAAAGGTTACCAGCTCACATATAT



TCCCGATAAGCTCAAGAATGTGGGCCACCAATGCGGTTGTATTTTTTACGTTCCAGCTGCCTACACA



TCTAAGATCGATCCTACCACCGGATTCGTCAATATATTTAAATTTAAAGATCTAACCGTTGATGCCA



AGCGTGAGTTTATTAAGAAATTTGATTCAATCAGGTACGACAGCGAAAAGAACCTCTTCTGTTTCAC



TTTCGACTACAACAACTTCATCACACAAAATACTGTGATGAGCAAGTCATCATGGAGCGTTTATACT



TATGGTGTAAGGATAAAAAGGCGCTTTGTTAATGGAAGGTTTTCCAATGAAAGCGATACAATAGAC



ATCACAAAAGACATGGAGAAGACACTGGAGATGACAGATATTAATTGGAGGGACGGGCATGACCT



TAGACAGGACATCATCGACTACGAAATCGTCCAACACATTTTTGAGATATTCAGACTCACTGTCCAG



ATGCGAAACAGCCTGTCGGAACTCGAAGACCGGGACTACGATAGACTGATCTCCCCGGTGTTAAAC



GAAAATAATATTTTCTACGATTCTGCTAAGGCAGGAGACGCTCTTCCTAAAGATGCGGACGCCAAT



GGCGCTTACTGTATAGCGTTGAAGGGATTGTATGAGATTAAACAGATCACTGAGAATTGGAAAGAA



GACGGTAAATTCTCCAGAGACAAGCTGAAAATCTCCAACAAAGACTGGTTTGATTTTATTCAAAAT



AAGCGCTACCTGTAA





SEQ
ATGACAAACAAATTTACTAATCAGTACAGCCTGTCAAAGACCCTCCGCTTCGAACTGATTCCACAA


ID
GGGAAGACCCTTGAATTCATCCAGGAAAAGGGTTTATTATCCCAGGATAAACAACGCGCAGAAAGC


NO:
TATCAAGAGATGAAGAAGACGATCGATAAATTTCATAAGTATTTCATAGATTTAGCCCTGAGCAAC


154
GCTAAATTGACCCACCTGGAAACCTATTTGGAGCTGTACAACAAGTCAGCCGAGACAAAGAAAGAG



CAGAAGTTTAAGGACGACCTGAAAAAAGTACAGGACAATTTGCGAAAAGAGATCGTCAAGTCTTTT



TCCGACGGAGACGCCAAGTCAATATTTGCCATCCTGGACAAAAAGGAACTCATCACTGTGGAGTTG



GAGAAGTGGTTTGAGAATAATGAGCAGAAGGACATCTATTTTGACGAAAAGTTCAAGACATTTACT



ACTTACTTCACCGGATTTCACCAAAACCGGAAGAACATGTACTCTGTTGAGCCGAACTCAACCGCC



ATCGCCTACCGCCTTATTCACGAAAATCTGCCAAAGTTTCTCGAGAATGCTAAAGCCTTTGAGAAAA



TTAAGCAGGTCGAGTCGCTCCAGGTGAACTTTCGAGAGCTGATGGGTGAATTCGGGGACGAGGGCC



TGATTTTCGTGAATGAACTCGAAGAGATGTTTCAGATCAACTACTATAATGATGTACTCTCACAGAA



CGGGATCACTATCTACAACAGCATTATCTCTGGATTCACTAAGAACGATATCAAGTATAAAGGGCT



GAATGAATACATCAACAATTATAATCAGACTAAGGACAAAAAGGACAGGCTGCCTAAATTGAAAC



AGCTGTATAAGCAGATCCTCAGTGATAGAATTAGCTTGTCATTTCTCCCAGATGCCTTCACTGACGG



AAAGCAGGTGCTTAAGGCGATATTCGATTTCTATAAGATCAACCTCCTCTCTTATACAATCGAGGGC



CAGGAGGAGTCACAGAACCTCCTGCTCCTGATTCGACAAACTATTGAAAATCTGTCCTCTTTCGATA



CGCAGAAGATATACCTGAAAAATGACACCCATCTCACTACAATATCCCAACAGGTATTCGGAGATT



TCTCCGTCTTCAGTACAGCCCTGAATTACTGGTACGAGACAAAGGTGAACCCTAAGTTCGAAACAG



AGTACAGCAAGGCGAACGAAAAGAAGAGGGAGATCCTGGACAAAGCCAAAGCCGTTTTCACCAAG



CAAGATTACTTTAGCATCGCATTTCTGCAGGAAGTCCTGTCTGAGTACATACTGACACTCGATCACA



CAAGCGACATAGTTAAGAAGCACTCTTCCAATTGTATCGCGGACTACTTCAAAAATCATTTTGTCGC



GAAAAAGGAGAACGAGACAGATAAGACCTTCGATTTTATCGCGAATATTACCGCAAAGTATCAATG



CATTCAGGGTATCTTGGAGAACGCCGACCAGTACGAAGACGAGCTTAAACAGGATCAGAAGCTCAT



CGACAACCTAAAGTTCTTTTTGGACGCTATACTGGAACTCCTTCATTTTATTAAGCCACTACATCTGA



AGAGTGAGTCTATCACTGAGAAGGACACTGCTTTTTACGACGTTTTCGAGAATTACTACGAAGCACT



GTCTCTGCTAACCCCTCTGTATAACATGGTGAGAAACTATGTGACACAGAAACCTTATAGTACCGAG



AAGATTAAGTTGAACTTCGAGAACGCACAATTGCTGAATGGGTGGGATGCAAACAAAGAGGGTGA



TTACCTCACAACAATCCTCAAGAAAGATGGCAATTACTTCCTGGCCATTATGGATAAAAAACATAA



CAAGGCATTTCAGAAATTTCCCGAGGGGAAGGAAAATTATGAAAAGATGGTATACAAGTTGCTGCC



CGGGGTGAACAAAATGCTCCCGAAGGTGTTTTTCTCGAATAAGAATATCGCGTACTTTAACCCGTCC



AAGGAACTGTTGGAAAATTATAAAAAGGAAACACACAAGAAGGGGGACACTTTTAATTTGGAGCA



CTGCCACACACTCATTGACTTCTTTAAAGATAGTCTCAACAAACATGAGGATTGGAAATATTTTGAC



TTTCAGTTTAGCGAGACCAAGTCTTATCAGGATCTGTCGGGATTTTATAGGGAAGTTGAGCACCAGG



GTTACAAGATAAATTTCAAGAACATCGATAGCGAGTACATTGACGGACTGGTGAACGAAGGGAAG



CTGTTCCTGTTTCAGATTTACAGCAAAGATTTCTCTCCTTTCTCAAAAGGCAAGCCGAACATGCATA



CCCTGTATTGGAAGGCCCTGTTCGAGGAGCAAAACCTTCAGAATGTGATTTACAAGCTGAACGGTC



AGGCCGAGATTTTTTTTAGGAAGGCCTCTATCAAGCCCAAAAACATCATTCTGCACAAGAAAAAGA



TAAAGATCGCCAAAAAACACTTCATTGATAAAAAGACAAAGACTTCTGAGATCGTACCTGTTCAGA



CAATCAAGAATCTCAACATGTATTATCAGGGGAAGATTAGCGAGAAAGAGCTGACACAGGACGATT



TGAGGTACATCGACAACTTCTCTATCTTTAACGAGAAGAACAAGACAATCGATATCATCAAGGACA



AGCGGTTTACCGTCGATAAATTCCAGTTCCATGTGCCTATCACGATGAATTTCAAGGCCACCGGTGG



GAGTTATATCAACCAGACTGTGCTGGAGTATCTGCAGAACAACCCCGAAGTAAAAATTATTGGCCT



GGACAGAGGAGAGCGGCATCTGGTGTACTTGACCCTCATCGATCAGCAGGGAAATATCCTGAAACA



AGAATCTCTGAATACTATTACGGACTCCAAAATCAGCACACCTTACCACAAGCTGCTTGATAATAA



AGAGAATGAGAGGGACTTGGCCCGCAAAAATTGGGGCACCGTCGAGAATATTAAGGAATTGAAAG



AAGGATACATCTCACAGGTGGTTCACAAAATCGCAACCCTGATGTTAGAAGAGAACGCTATTGTGG



TGATGGAGGACTTAAACTTCGGATTTAAAAGAGGAAGATTTAAAGTCGAGAAACAGATTTATCAGA



AACTGGAAAAAATGCTCATTGACAAATTAAATTACCTGGTGCTGAAAGATAAACAGCCACAGGAGC



TGGGTGGCCTGTATAATGCTCTGCAGCTGACCAACAAGTTCGAGTCGTTTCAGAAAATGGGCAAGC



AGTCAGGCTTCCTTTTTTACGTGCCCGCTTGGAACACCTCAAAAATCGACCCTACAACAGGCTTTGT



GAATTATTTCTATACCAAGTATGAAAACGTGGACAAGGCAAAGGCCTTTTTCGAGAAGTTTGAAGC



AATCAGGTTCAATGCCGAGAAAAAATACTTTGAGTTCGAGGTCAAAAAATATAGCGACTTCAACCC



TAAGGCCGAAGGCACGCAACAAGCCTGGACAATATGCACGTATGGGGAGAGAATTGAGACTAAGC



GGCAGAAGGATCAGAATAACAAATTCGTGAGCACACCGATTAACCTGACAGAGAAGATAGAGGAC



TTCCTCGGCAAGAATCAGATCGTGTACGGCGACGGCAATTGCATCAAGTCACAAATTGCATCTAAA



GATGACAAAGCATTCTTCGAAACACTGCTGTATTGGTTCAAGATGACACTCCAGATGCGAAATAGC



GAAACAAGAACAGATATTGACTACCTCATCAGCCCTGTGATGAATGATAACGGCACGTTTTACAAT



TCCCGGGACTATGAAAAATTAGAGAACCCGACACTGCCAAAAGACGCCGACGCAAATGGTGCATAT



CACATCGCAAAGAAAGGTTTGATGCTGTTGAACAAAATTGATCAGGCTGATCTGACAAAAAAGGTC



GATCTGAGTATCAGTAACCGCGACTGGTTGCAGTTTGTCCAGAAGAACAAATAA





SEQ
ATGGAACAAGAGTACTATCTGGGCCTGGACATGGGCACCGGGAGTGTCGGATGGGCAGTCACCGAC


ID
TCAGAGTACCACGTCCTCAGAAAGCACGGTAAGGCACTTTGGGGAGTGCGACTCTTCGAGTCCGCT


NO:
AGTACTGCTGAAGAGAGGAGGATGTTTCGAACTTCCAGGCGCAGGCTGGATCGGCGAAACTGGAG


155
AATAGAGATTCTCCAGGAGATATTTGCTGAAGAGATTTCAAAGAAGGATCCTGGTTTTTTCCTGCGC



ATGAAAGAATCTAAGTATTACCCCGAAGATAAACGCGACATCAACGGCAATTGTCCTGAACTGCCC



TATGCTCTGTTTGTCGACGACGATTTCACCGACAAAGATTACCACAAGAAATTCCCCACCATATACC



ACCTGAGAAAGATGTTGATGAACACCGAGGAGACACCCGACATACGTCTGGTTTACCTGGCTATCC



ATCATATGATGAAGCACCGCGGGCATTTCCTGCTGTCTGGAGACATCAATGAGATAAAGGAATTTG



GTACTACGTTCTCCAAGTTGTTAGAAAACATTAAGAATGAAGAGTTGGACTGGAATCTTGAACTGG



GAAAGGAAGAGTATGCAGTTGTAGAGTCGATTTTGAAAGATAACATGTTAAACCGGTCAACTAAGA



AAACCAGGTTAATTAAGGCACTAAAGGCCAAATCGATATGCGAGAAGGCTGTGCTAAATCTGCTGG



CTGGAGGCACCGTGAAACTGTCTGATATTTTCGGCCTGGAAGAGCTCAATGAAACCGAGCGGCCTA



AAATTTCTTTCGCCGATAACGGATACGATGACTATATTGGGGAGGTGGAAAACGAGCTCGGAGAAC



AATTCTACATTATTGAAACCGCTAAGGCAGTCTATGACTGGGCCGTGCTCGTCGAGATTTTAGGCAA



GTACACCAGCATTAGCGAAGCAAAGGTGGCTACCTATGAAAAGCACAAATCTGACCTCCAGTTTCT



GAAAAAGATTGTGCGCAAATACTTAACAAAAGAAGAGTACAAGGACATCTTTGTGAGCACATCAG



ATAAGCTCAAGAATTACTCAGCATACATTGGAATGACAAAGATTAACGGGAAGAAGGTGGATCTCC



AAAGCAAACGTTGTTCAAAGGAGGAGTTTTACGATTTCATAAAGAAGAACGTGCTGAAGAAACTGG



AGGGACAACCGGAGTACGAGTATTTAAAGGAGGAGCTCGAGCGAGAAACTTTCCTGCCCAAGCAA



GTGAACAGAGACAATGGTGTCATTCCTTACCAGATTCACTTATATGAGCTGAAGAAAATCCTGGGG



AACTTGAGAGACAAGATAGACCTCATCAAGGAAAATGAAGATAAGTTGGTCCAGTTGTTCGAATTC



AGAATCCCATATTACGTCGGCCCGCTCAATAAGATCGACGACGGCAAGGAAGGCAAATTCACTTGG



GCGGTGCGAAAAAGCAACGAAAAAATATACCCATGGAACTTTGAGAACGTCGTTGACATCGAGGC



CAGCGCCGAGAAATTTATAAGACGCATGACTAATAAGTGTACTTACCTCATGGGCGAGGATGTTCT



GCCCAAGGACAGCCTGCTGTATTCCAAGTACATGGTGCTTAACGAGCTGAATAATGTAAAGTTAGA



TGGTGAGAAGCTCAGCGTGGAGCTTAAACAGAGGCTGTACACTGATGTGTTTTGCAAGTATCGGAA



AGTTACCGTTAAGAAGATAAAGAATTACCTGAAATGCGAAGGGATCATTTCCGGCAACGTGGAAAT



TACCGGAATCGACGGCGATTTTAAGGCGTCGTTGACCGCTTATCATGATTTCAAGGAGATTTTAACC



GGCACGGAGCTCGCGAAGAAAGACAAGGAGAACATAATCACGAATATAGTTCTGTTTGGGGACGA



TAAAAAACTTCTTAAAAAACGACTCAATCGACTGTATCCGCAGATTACCCCCAACCAGCTGAAGAA



GATTTGCGCTCTGAGCTATACCGGGTGGGGCCGGTTCTCTAAGAAATTCCTCGAGGAGATCACAGC



ACCAGACCCAGAGACTGGTGAGGTGTGGAATATTATTACAGCTCTGTGGGAATCCAATAATAACCT



TATGCAATTGTTGAGCAATGAATATAGGTTCATGGAGGAAGTGGAAACCTACAATATGGGCAAGCA



GACAAAGACCCTATCTTACGAGACCGTTGAGAATATGTATGTCTCCCCTTCAGTGAAACGGCAAAT



CTGGCAAACTTTGAAGATCGTGAAGGAGCTCGAAAAGGTGATGAAAGAGAGCCCGAAGAGGGTTT



TTATTGAAATGGCCAGAGAGAAACAGGAGAGCAAGAGAACAGAGTCTAGGAAGAAGCAGCTAATC



GATTTGTATAAAGCCTGCAAGAACGAGGAAAAAGACTGGGTCAAGGAGCTAGGCGATCAGGAAGA



ACAGAAGTTGCGCTCTGATAAGCTGTACTTATATTATACCCAGAAAGGACGGTGCATGTACTCAGG



TGAGGTCATTGAGCTGAAAGATCTGTGGGACAATACTAAGTATGATATTGATCACATCTACCCTCAG



TCAAAAACTATGGACGACTCCCTCAACAACAGGGTGTTGGTTAAGAAGAAATACAATGCTACAAAG



TCCGATAAATACCCTCTTAACGAAAACATCCGGCACGAAAGAAAGGGCTTCTGGAAGTCCCTGCTG



GATGGGGGTTTTATCAGTAAAGAAAAGTATGAGAGGCTGATCCGAAATACCGAGCTCTCCCCCGAG



GAACTGGCTGGCTTTATCGAAAGGCAGATCGTAGAGACTAGGCAATCTACAAAGGCAGTCGCTGAG



ATCCTGAAGCAAGTGTTTCCTGAGTCAGAAATCGTGTACGTCAAAGCTGGCACAGTGTCACGGTTCC



GAAAGGACTTTGAGTTGTTAAAAGTTCGGGAGGTGAATGACCTGCACCACGCTAAAGACGCCTATC



TGAATATCGTTGTGGGGAACTCCTATTATGTTAAGTTTACTAAGAATGCGTCCTGGTTTATTAAGGA



GAACCCGGGGCGCACCTATAACCTGAAGAAGATGTTCACCTCCGGCTGGAACATAGAACGGAACG



GAGAAGTCGCGTGGGAGGTGGGTAAGAAAGGGACCATTGTGACCGTCAAACAGATTATGAACAAA



AACAACATATTGGTAACTCGCCAGGTGCATGAGGCCAAAGGGGGCCTCTTTGATCAGCAGATTATG



AAAAAGGGCAAAGGACAGATCGCAATCAAGGAAACCGACGAGCGCCTGGCATCCATTGAGAAGTA



CGGAGGCTACAACAAGGCGGCAGGTGCGTACTTCATGCTCGTCGAGTCCAAAGATAAGAAAGGCA



AAACTATTAGAACAATCGAGTTCATCCCTCTATATTTGAAAAATAAGATCGAAAGTGACGAAAGCA



TCGCCCTTAACTTCTTGGAGAAGGGCCGGGGCTTAAAGGAACCAAAGATTCTGCTCAAGAAGATCA



AGATCGACACACTCTTCGATGTGGATGGTTTTAAGATGTGGCTGTCAGGCAGGACAGGGGATCGCT



TGCTGTTCAAATGCGCAAATCAGTTGATTCTGGACGAAAAGATCATTGTGACGATGAAGAAGATCG



TTAAATTCATTCAGCGGAGACAGGAAAACAGAGAACTGAAACTCTCCGATAAGGATGGAATTGACA



ATGAAGTCCTCATGGAGATTTACAATACCTTTGTGGACAAGCTTGAGAACACAGTCTATCGGATCCG



ACTGTCCGAACAGGCAAAGACTCTGATCGACAAACAGAAAGAATTCGAAAGACTAAGCTTAGAGG



ACAAAAGTTCAACTCTCTTTGAAATTCTCCACATCTTCCAATGTCAAAGTAGTGCAGCCAACTTGAA



GATGATCGGGGGTCCCGGCAAGGCTGGAATCTTAGTCATGAACAACAACATCTCCAAATGTAACAA



AATCTCCATCATAAACCAGTCTCCCACCGGCATTTTCGAGAACGAAATTGATTTACTCAAG





SEQ
ATGAAATCTTTCGATTCTTTCACCAACCTCTACTCCCTTAGCAAAACCCTTAAGTTTGAAATGAGGC


ID
CGGTGGGGAATACACAGAAGATGCTTGACAATGCTGGCGTCTTTGAAAAGGACAAATTAATCCAGA


NO:
AGAAGTATGGTAAAACAAAGCCATATTTTGACCGATTGCATCGGGAATTCATTGAAGAGGCTCTTA


156
CAGGAGTAGAATTGATCGGACTGGACGAGAACTTCCGTACCTTAGTAGACTGGCAGAAGGACAAG



AAGAACAACGTGGCAATGAAGGCCTATGAGAACTCACTCCAGCGCCTTAGAACCGAGATCGGAAA



GATCTTTAATCTTAAGGCGGAAGATTGGGTAAAAAATAAGTACCCGATCCTGGGACTGAAAAACAA



AAACACAGACATCCTGTTTGAAGAAGCCGTCTTTGGTATCTTGAAGGCCAGGTATGGAGAGGAGAA



AGACACGTTTATAGAGGTAGAGGAGATTGATAAAACAGGCAAGAGTAAGATTAATCAGATCAGTA



TCTTTGATTCTTGGAAGGGGTTCACAGGCTACTTTAAGAAGTTTTTCGAAACCAGGAAAAATTTCTA



TAAGAACGATGGCACCTCCACAGCTATCGCGACACGCATCATAGATCAGAATCTGAAACGGTTCAT



TGATAATCTGAGCATTGTTGAATCCGTGCGCCAGAAGGTCGACCTAGCTGAGACTGAGAAGTCTTTC



TCTATATCACTCTCCCAGTTCTTCTCAATAGATTTTTATAATAAGTGCCTTCTGCAAGATGGCATAGA



CTACTATAACAAGATCATCGGCGGCGAAACTCTCAAAAACGGTGAAAAGCTCATTGGCCTGAATGA



GCTCATCAACCAATATAGACAAAATAACAAGGATCAGAAAATCCCATTCTTTAAGCTGCTAGATAA



ACAGATCCTATCAGAAAAAATCCTGTTCCTCGACGAAATCAAAAACGACACCGAACTCATCGAGGC



TCTCTCGCAGTTTGCCAAGACGGCTGAGGAGAAGACGAAGATTGTGAAAAAGCTGTTTGCAGACTT



TGTGGAGAACAACTCTAAATACGATTTGGCTCAGATTTATATCTCCCAGGAAGCATTTAACACAATC



TCCAATAAGTGGACTAGCGAGACTGAAACCTTCGCCAAATACCTGTTCGAGGCCATGAAAAGCGGC



AAGCTCGCCAAATACGAGAAGAAGGACAATTCCTATAAGTTTCCCGATTTCATCGCATTATCTCAGA



TGAAGTCCGCGCTACTTAGCATTAGCCTGGAAGGCCATTTTTGGAAGGAGAAATACTATAAGATTTC



CAAATTCCAAGAAAAGACCAATTGGGAGCAGTTCTTGGCTATTTTTCTATACGAGTTCAACTCTTTG



TTCAGTGACAAGATCAACACTAAGGACGGTGAGACCAAACAAGTGGGGTACTACCTCTTCGCCAAA



GATCTTCATAACCTGATACTGTCCGAACAGATCGACATACCCAAGGATTCAAAGGTGACCATCAAG



GATTTTGCGGATTCGGTATTGACGATCTATCAGATGGCGAAGTATTTCGCTGTCGAGAAAAAGCGG



GCATGGCTGGCCGAATACGAGTTGGACTCCTTCTATACTCAACCCGATACAGGGTACCTGCAGTTTT



ACGATAATGCATACGAGGATATAGTCCAGGTGTACAATAAACTCAGGAACTACCTCACTAAGAAAC



CATACTCCGAAGAAAAATGGAAACTTAATTTTGAGAATAGTACACTGGCCAATGGATGGGACAAGA



ACAAGGAATCAGACAACTCCGCTGTAATTCTCCAGAAGGGTGGCAAGTATTATCTGGGACTGATAA



CAAAGGGCCATAACAAGATTTTCGATGACCGTTTTCAGGAGAAGTTTATAGTGGGCATAGAGGGTG



GCAAGTATGAAAAAATAGTCTACAAGTTCTTTCCCGATCAGGCGAAGATGTTCCCCAAAGTATGCTT



CAGTGCTAAAGGCCTCGAGTTTTTCCGGCCATCTGAAGAGATACTCCGCATCTATAATAACGCAGA



GTTTAAAAAGGGAGAGACGTACTCAATCGACTCGATGCAGAAACTCATTGACTTCTACAAAGATTG



TCTCACAAAATACGAGGGCTGGGCTTGCTACACGTTTCGGCACTTGAAGCCAACCGAGGAATATCA



AAACAACATCGGGGAGTTCTTCCGTGACGTCGCCGAAGACGGCTATAGAATTGACTTTCAGGGCAT



AAGTGATCAGTATATTCACGAGAAGAATGAGAAAGGTGAGTTGCATCTTTTCGAAATCCACAATAA



AGACTGGAATCTTGACAAGGCTCGCGATGGAAAATCAAAGACTACCCAGAAGAATCTTCATACACT



TTACTTCGAGTCCCTCTTTTCCAACGACAACGTCGTACAGAATTTCCCAATAAAACTGAACGGCCAG



GCCGAAATTTTTTACAGGCCCAAAACCGAAAAAGATAAACTGGAATCCAAGAAAGACAAGAAGGG



AAATAAGGTGATAGATCACAAAAGGTATTCCGAGAACAAGATTTTTTTCCACGTACCTCTTACCCTG



AACAGAACGAAGAACGACTCTTATAGATTCAATGCCCAGATAAACAACTTTCTCGCAAACAACAAA



GATATCAATATTATCGGCGTCGATAGAGGTGAGAAGCACTTGGTATATTATTCTGTGATCACGCAAG



CATCCGATATCTTGGAGTCCGGTTCTTTGAACGAACTGAATGGTGTCAACTACGCCGAGAAACTCGG



TAAGAAAGCTGAGAATCGGGAGCAGGCTAGAAGGGACTGGCAGGACGTTCAGGGTATCAAGGACC



TGAAGAAGGGCTACATTTCTCAGGTGGTTCGAAAACTGGCTGATTTGGCCATTAAGCACAATGCAA



TCATCATTTTAGAAGATTTGAACATGCGGTTTAAACAAGTCAGGGGGGGGATAGAGAAATCAATTT



ACCAACAGCTGGAAAAAGCTCTGATTGATAAACTCTCTTTTTTGGTTGATAAGGGCGAAAAGAACC



CCGAGCAAGCAGGACATCTCCTTAAAGCCTATCAACTGAGCGCACCTTTCGAGACATTCCAGAAGA



TGGGAAAGCAAACCGGCATCATTTTCTATACCCAGGCTTCCTATACATCCAAGTCTGATCCAGTGAC



TGGGTGGAGACCCCATCTCTACCTCAAGTACTTTTCTGCCAAAAAAGCTAAGGACGACATTGCTAA



GTTCACAAAAATCGAGTTCGTGAACGACAGGTTCGAGCTGACTTATGACATAAAAGATTTCCAGCA



GGCCAAGGAGTACCCAAACAAGACAGTTTGGAAAGTGTGTTCCAATGTGGAGAGGTTTCGGTGGGA



CAAGAATCTGAATCAGAATAAAGGGGGATATACTCACTACACCAACATTACCGAGAACATCCAAGA



GTTGTTCACCAAATACGGCATCGACATTACTAAAGATCTGCTGACACAGATCTCCACCATCGATGAG



AAGCAGAACACATCTTTCTTCCGGGATTTCATCTTTTATTTTAACTTGATCTGTCAGATTAGAAATAC



CGACGACAGTGAGATAGCTAAAAAAAACGGGAAAGACGATTTCATTCTCTCTCCCGTGGAGCCGTT



TTTTGACTCCCGCAAAGACAATGGCAATAAGCTTCCGGAAAACGGGGACGATAACGGCGCCTACAA



CATCGCTCGTAAGGGAATCGTTATCCTCAATAAAATAAGCCAGTATTCCGAGAAGAACGAGAATTG



TGAAAAAATGAAGTGGGGGGACCTTTACGTCAGCAACATCGATTGGGATAACTTTGTGACACAAGC



CAATGCGAGACACTAG





SEQ
ATGGAAAACTTCAAAAACCTCTACCCCATCAACAAGACCTTGAGGTTTGAGCTCCGGCCATATGGG


ID
AAGACACTGGAGAACTTCAAAAAGTCCGGTCTGCTGGAAAAGGATGCTTTTAAGGCTAACTCTAGG


NO:
AGGTCTATGCAGGCCATTATCGATGAGAAATTCAAGGAGACCATAGAGGAGCGTCTGAAATATACT


157
GAGTTTTCCGAGTGTGACCTAGGAAATATGACCAGTAAGGACAAAAAGATCACCGACAAGGCAGC



GACAAACCTGAAGAAACAGGTGATTTTAAGCTTTGATGATGAGATTTTCAATAACTACTTGAAGCC



GGACAAAAACATCGACGCTCTGTTCAAGAATGATCCAAGCAACCCGGTCATCTCTACTTTCAAGGG



CTTCACCACATACTTTGTAAATTTCTTCGAAATACGGAAACACATCTTCAAGGGAGAGTCTTCCGGT



AGCATGGCTTACAGAATAATCGATGAGAACCTAACTACATATCTAAACAATATCGAGAAGATCAAG



AAATTGCCTGAAGAACTGAAATCTCAGCTTGAGGGAATCGATCAAATTGACAAACTGAACAACTAT



AACGAGTTCATCACCCAGTCCGGCATTACTCATTATAACGAAATTATTGGAGGGATTTCGAAGTCTG



AAAATGTCAAAATTCAAGGCATTAACGAAGGGATTAATCTTTACTGTCAAAAGAATAAAGTGAAGC



TACCACGCTTAACTCCTCTGTATAAGATGATTCTCTCTGATCGGGTCTCTAATTCCTTTGTGCTGGAT



ACCATTGAAAATGATACCGAGTTAATTGAAATGATCTCTGATCTGATAAATAAGACAGAGATAAGT



CAGGATGTTATTATGTCCGACATCCAAAATATTTTCATCAAATATAAACAACTCGGCAACTTGCCGG



GGATTAGCTACTCATCTATAGTGAATGCTATCTGTTCGGATTACGACAATAACTTTGGTGACGGCAA



ACGTAAAAAAAGCTATGAGAATGATCGCAAAAAACACCTCGAGACTAACGTGTATAGCATTAACTA



TATCTCAGAGTTACTGACAGACACCGACGTCTCCAGCAACATAAAGATGCGGTACAAAGAGCTGGA



GCAGAATTATCAGGTATGCAAGGAAAATTTCAACGCCACTAACTGGATGAACATCAAAAACATTAA



GCAGTCTGAGAAAACCAATCTGATCAAGGACCTTCTTGACATCCTCAAGAGCATCCAGCGGTTTTAT



GATTTGTTTGACATCGTGGATGAAGACAAAAATCCTAGTGCTGAGTTCTATACCTGGCTGTCTAAAA



ACGCGGAGAAACTGGACTTCGAGTTTAATTCAGTGTACAACAAGAGCAGGAACTACCTCACGAGAA



AGCAGTACTCCGATAAAAAGATTAAGTTGAACTTCGATAGTCCTACTCTCGCCAAGGGGTGGGATG



CGAACAAAGAAATTGATAATAGCACAATTATCATGAGGAAGTTCAACAACGACCGGGGCGATTAC



GATTACTTCTTGGGGATCTGGAATAAGAGCACACCTGCCAACGAAAAGATCATCCCATTAGAGGAT



AATGGACTGTTTGAAAAAATGCAATATAAGCTGTATCCCGATCCTAGTAAAATGCTGCCAAAGCAA



TTCCTTTCTAAGATCTGGAAAGCTAAACATCCAACTACACCCGAGTTTGATAAGAAGTACAAAGAA



GGTCGGCACAAGAAGGGGCCTGATTTTGAGAAAGAGTTTCTGCACGAGTTGATCGATTGCTTTAAG



CATGGATTGGTAAACCACGACGAAAAATATCAGGATGTGTTCGGGTTCAATCTGCGCAACACGGAA



GACTACAACTCTTATACAGAGTTTCTGGAGGACGTCGAAAGGTGCAACTATAATCTTAGTTTCAATA



AAATCGCTGACACGTCTAACTTGATAAATGATGGGAAACTCTATGTTTTTCAGATCTGGAGCAAGG



ATTTCAGCATAGATAGCAAGGGAACAAAAAACTTGAACACAATATACTTTGAATCCCTCTTCTCGG



AGGAAAATATGATCGAGAAGATGTTCAAGCTCTCAGGGGAAGCCGAAATATTCTATCGTCCAGCAA



GTTTGAATTATTGTGAAGATATTATCAAGAAGGGACACCACCACGCCGAACTGAAGGACAAATTCG



ACTATCCCATCATCAAGGACAAGCGATATAGCCAGGACAAATTTTTTTTTCATGTCCCCATGGTTAT



CAACTACAAAAGCGAGAAGTTAAACTCCAAATCACTTAACAATAGGACGAACGAAAATTTAGGCC



AATTCACGCACATCATCGGTATCGACCGCGGAGAGCGACATCTCATCTACCTGACCGTGGTGGATG



TGTCCACCGGTGAGATCGTTGAGCAAAAGCACCTGGATGAAATTATAAATACAGATACAAAAGGCG



TCGAGCATAAAACTCATTATCTCAATAAATTAGAAGAGAAGTCCAAGACGCGGGATAATGAAAGA



AAGTCCTGGGAAGCAATCGAGACGATTAAGGAGCTGAAAGAAGGCTATATTAGCCACGTGATCAAT



GAAATCCAGAAATTGCAGGAAAAGTATAACGCACTGATAGTGATGGAGAACCTCAATTATGGGTTT



AAGAACTCGCGTATCAAAGTGGAAAAGCAGGTCTACCAGAAATTCGAGACCGCCCTGATTAAAAA



GTTTAATTACATCATTGACAAGAAAGATCCTGAAACCTACATTCATGGATACCAACTGACGAATCC



AATCACTACACTCGATAAAATTGGTAACCAGAGCGGTATTGTGTTGTACATTCCGGCTTGGAATACA



AGCAAGATTGATCCAGTCACTGGTTTCGTTAACCTCCTGTATGCAGACGATTTGAAATACAAGAACC



AGGAGCAGGCTAAAAGCTTTATCCAGAAAATCGATAATATCTACTTCGAAAATGGTGAGTTTAAAT



TTGATATAGATTTCAGCAAATGGAACAACCGCTACTCAATTAGCAAGACGAAATGGACACTGACAA



GCTACGGAACCCGGATACAGACGTTCCGAAACCCCCAGAAAAATAACAAGTGGGACAGCGCCGAG



TATGACCTGACCGAAGAGTTTAAATTAATCCTGAACATCGATGGTACTCTGAAATCTCAGGATGTGG



AAACCTATAAGAAATTCATGTCTTTATTCAAGCTGATGTTGCAGCTGCGAAACTCCGTTACTGGAAC



AGACATTGACTACATGATTAGCCCTGTGACAGATAAAACTGGAACCCACTTTGATTCACGGGAGAA



TATCAAGAACCTGCCCGCCGATGCTGATGCGAACGGAGCTTACAACATTGCTAGGAAGGGCATCAT



GGCAATCGAGAATATTATGAACGGCATTAGCGACCCTCTGAAGATCAGTAATGAGGACTACCTGAA



GTACATTCAGAACCAACAAGAGTAA





SEQ
ATGACCCAGTTTGAGGGTTTCACCAATCTTTATCAGGTGTCAAAAACACTCAGATTTGAGCTCATCC


ID
CACAGGGTAAAACTTTAAAGCATATTCAAGAGCAGGGCTTTATAGAGGAAGACAAAGCCAGAAAC


NO:
GACCATTATAAGGAACTAAAACCGATCATTGACCGCATCTACAAAACCTATGCCGACCAATGCCTT


158
CAGCTCGTCCAACTCGATTGGGAGAATCTGAGCGCCGCTATTGACAGCTACAGGAAGGAGAAGACC



GAGGAGACTAGAAACGCCCTGATCGAGGAGCAGGCGACCTATAGAAACGCTATTCACGATTATTTT



ATCGGCCGCACCGACAATTTGACAGATGCCATCAACAAGCGGCACGCCGAAATTTATAAGGGGTTA



TTTAAGGCCGAGCTGTTCAATGGAAAAGTACTGAAACAGCTGGGCACCGTAACAACCACCGAACAC



GAGAATGCTCTGTTGAGGTCCTTCGACAAGTTTACTACCTACTTTAGCGGCTTCTACGAAAACCGTA



AAAACGTGTTTTCCGCGGAGGATATTTCAACAGCCATTCCTCATAGGATCGTGCAGGATAATTTCCC



CAAGTTTAAGGAGAACTGCCATATCTTTACCAGACTTATCACTGCTGTGCCAAGTTTACGAGAACAC



TTCGAGAATGTTAAGAAGGCTATAGGCATATTCGTTTCCACCTCCATCGAAGAAGTATTCAGTTTTC



CATTCTACAATCAGTTACTCACGCAGACCCAGATAGATCTCTACAATCAGCTGCTCGGAGGCATTTC



TAGAGAAGCAGGCACGGAAAAGATCAAGGGCTTAAATGAAGTACTCAATCTTGCAATTCAGAAGA



ACGATGAGACAGCACACATTATTGCATCTCTCCCTCACAGATTCATTCCCCTGTTCAAACAGATCCT



GTCCGATCGCAACACACTAAGCTTTATACTTGAGGAGTTTAAGTCAGATGAGGAAGTGATCCAGAG



CTTCTGTAAGTATAAGACTTTGCTCCGTAATGAAAACGTGCTTGAGACAGCAGAGGCTCTCTTTAAC



GAGTTGAATTCCATCGACCTGACACACATTTTTATCAGCCATAAAAAGCTGGAAACGATTAGCTCTG



CCTTGTGCGACCACTGGGACACCCTGCGTAACGCCCTCTATGAAAGGCGCATTTCCGAGCTCACCGG



GAAGATCACAAAAAGTGCCAAGGAAAAAGTCCAGAGGTCCCTTAAACATGAAGACATCAACCTAC



AAGAGATCATCTCTGCGGCTGGGAAAGAGCTGTCAGAAGCATTTAAACAGAAGACTTCCGAGATCC



TGAGCCACGCACACGCCGCATTAGACCAGCCCCTGCCTACAACTCTTAAAAAACAGGAGGAGAAGG



AGATTTTAAAGAGCCAGCTGGACTCATTACTCGGCCTGTATCATCTCCTGGACTGGTTCGCCGTGGA



CGAATCCAACGAGGTGGACCCAGAATTTAGCGCCAGGCTGACAGGAATTAAACTGGAAATGGAGC



CAAGTTTGAGCTTTTACAACAAGGCTCGGAACTATGCCACTAAAAAGCCCTACAGCGTGGAAAAGT



TCAAGCTGAATTTTCAGATGCCGACCCTGGCTTCCGGGTGGGATGTTAATAAGGAAAAGAATAATG



GGGCTATACTGTTCGTCAAAAATGGTCTCTACTACCTGGGAATCATGCCCAAACAGAAGGGCAGGT



ACAAAGCCCTTTCGTTTGAGCCGACCGAAAAAACCAGCGAAGGCTTTGATAAGATGTATTACGACT



ATTTCCCAGATGCAGCCAAGATGATCCCAAAATGTAGCACTCAGTTGAAGGCGGTAACCGCTCACT



TTCAGACACACACCACTCCTATCTTGCTCTCCAACAACTTTATTGAGCCGCTGGAGATCACGAAGGA



AATCTACGACCTTAACAACCCAGAGAAGGAACCCAAGAAATTCCAAACAGCTTATGCTAAGAAGAC



TGGGGATCAAAAGGGCTATCGAGAGGCTTTGTGTAAGTGGATTGACTTTACACGGGATTTCCTGAG



TAAGTATACCAAGACCACATCTATTGACCTGTCCTCACTGAGACCTTCCTCACAATATAAGGATCTC



GGAGAGTATTATGCCGAACTCAACCCTCTACTCTATCACATCTCTTTCCAGAGGATCGCCGAAAAGG



AAATTATGGACGCCGTCGAGACAGGCAAGCTGTACCTCTTCCAGATTTACAACAAGGATTTCGCAA



AGGGCCACCACGGAAAACCCAATTTGCACACTTTGTACTGGACAGGGCTCTTCTCTCCCGAAAATTT



GGCCAAAACTTCAATAAAACTGAACGGGCAAGCCGAGCTGTTCTATCGGCCCAAGTCACGTATGAA



GCGGATGGCCCACCGGCTGGGCGAGAAGATGCTCAACAAGAAACTGAAGGATCAGAAGACGCCCA



TACCAGACACTCTTTACCAAGAGCTGTATGACTACGTGAATCACAGACTGAGTCACGACCTGTCTGA



TGAAGCCCGGGCTCTTCTTCCAAATGTGATTACCAAAGAAGTTTCCCACGAAATTATCAAGGACCG



GCGCTTCACCTCTGACAAATTCTTTTTCCACGTCCCAATCACCCTCAACTACCAGGCAGCCAATTCC



CCTTCAAAGTTTAACCAGCGTGTGAATGCCTACCTGAAAGAGCATCCGGAGACCCCCATCATAGGG



ATAGACAGAGGAGAGCGGAATCTTATCTACATTACTGTGATTGACAGCACAGGTAAGATCTTGGAG



CAGAGATCTTTAAATACAATCCAGCAGTTTGACTACCAGAAGAAACTGGATAACCGAGAGAAGGA



AAGGGTTGCTGCAAGACAGGCCTGGTCAGTGGTCGGCACCATCAAAGACCTGAAGCAGGGCTACTT



ATCCCAAGTAATTCACGAAATTGTCGATCTTATGATTCATTATCAAGCCGTTGTTGTGCTGGAGAAC



CTGAATTTTGGCTTCAAAAGCAAACGAACAGGTATCGCCGAGAAAGCCGTGTATCAGCAGTTCGAA



AAGATGCTCATAGACAAGCTGAACTGCTTAGTGCTGAAGGATTATCCTGCTGAGAAGGTCGGCGGC



GTACTTAACCCATACCAGCTGACCGATCAGTTCACTAGTTTCGCCAAGATGGGAACGCAAAGTGGC



TTCCTTTTCTACGTGCCCGCTCCCTACACGAGTAAGATCGACCCTCTGACCGGCTTCGTCGACCCATT



CGTCTGGAAGACCATCAAGAATCACGAATCACGGAAACACTTCTTAGAGGGGTTTGACTTCCTGCA



CTACGACGTGAAGACAGGGGACTTCATCTTACACTTTAAGATGAATCGAAACCTCTCCTTCCAGCGG



GGCCTGCCTGGTTTCATGCCCGCATGGGACATCGTGTTTGAGAAAAACGAGACACAGTTTGACGCT



AAGGGAACCCCCTTTATTGCGGGGAAGCGGATTGTCCCAGTCATCGAAAACCATCGGTTCACCGGG



CGATACCGGGATCTGTACCCGGCCAACGAGCTCATCGCGCTGCTGGAGGAGAAGGGTATTGTGTTT



AGGGATGGATCCAACATTCTGCCTAAGTTGCTGGAAAATGATGATTCGCACGCCATTGATACCATG



GTTGCACTGATTAGATCCGTACTGCAGATGAGGAATAGCAATGCTGCAACCGGGGAGGATTATATT



AATTCCCCAGTGCGAGATCTGAATGGTGTCTGTTTTGACTCGCGCTTTCAGAATCCAGAATGGCCAA



TGGATGCAGACGCTAACGGGGCGTACCACATTGCTCTGAAAGGCCAGCTACTCCTGAACCACCTCA



AGGAGAGCAAAGATCTGAAGCTGCAGAACGGCATTTCCAACCAAGACTGGCTCGCCTACATACAAG



AACTGCGCAATTAA





SEQ
ATGGCTGTCAAATCCATCAAGGTTAAATTACGGCTTGATGACATGCCCGAGATCCGCGCCGGGCTCT


ID
GGAAACTCCATAAAGAAGTGAATGCTGGCGTTAGATACTACACAGAATGGCTCTCCCTGCTGCGCC


NO:
AGGAAAATTTGTACCGCCGGTCACCTAATGGAGATGGAGAGCAGGAATGCGATAAAACAGCAGAA


159
GAGTGCAAAGCCGAATTGCTGGAGCGACTGCGGGCACGGCAGGTTGAGAATGGACACCGAGGTCC



GGCGGGATCGGACGACGAGCTGCTCCAGCTCGCCAGACAATTATATGAACTGCTGGTGCCTCAGGC



TATTGGGGCAAAGGGTGACGCACAGCAGATTGCTAGAAAATTTCTGTCTCCCCTCGCCGACAAAGA



CGCTGTCGGCGGCCTTGGGATAGCCAAAGCCGGCAACAAACCCCGATGGGTGCGCATGAGGGAGG



CTGGTGAGCCTGGCTGGGAGGAAGAAAAGGAAAAGGCCGAAACCAGAAAGTCCGCCGACAGGACC



GCGGACGTACTCCGAGCATTGGCCGATTTTGGGCTGAAGCCCTTAATGCGAGTCTACACCGATAGT



GAAATGTCTAGCGTGGAGTGGAAGCCATTACGCAAAGGGCAGGCAGTGCGGACGTGGGACCGTGA



CATGTTCCAGCAAGCCATCGAGCGAATGATGAGCTGGGAGAGCTGGAACCAGAGAGTGGGGCAGG



AGTATGCCAAGCTGGTCGAGCAGAAAAACCGGTTTGAGCAAAAAAATTTTGTAGGTCAGGAACACC



TGGTGCATCTCGTTAACCAGCTCCAGCAAGATATGAAGGAAGCTTCGCCTGGATTAGAGAGCAAAG



AGCAGACTGCACACTATGTAACCGGAAGAGCACTGAGGGGCAGTGACAAAGTGTTCGAAAAATGG



GGAAAACTGGCTCCCGATGCCCCCTTTGACCTGTACGACGCAGAAATAAAAAACGTGCAGCGGCGA



AACACCAGGCGATTTGGTAGCCATGATCTGTTCGCCAAATTGGCAGAGCCGGAATATCAGGCTCTTT



GGCGAGAAGACGCATCATTTCTCACTAGGTACGCGGTCTATAACTCCATTTTGAGGAAATTGAACC



ACGCAAAAATGTTTGCCACCTTCACGTTGCCTGACGCCACCGCTCATCCCATTTGGACACGGTTTGA



TAAGCTGGGCGGCAATCTGCATCAGTATACATTCCTGTTTAACGAGTTTGGAGAGCGAAGACATGC



GATACGATTCCACAAGCTACTGAAGGTCGAAAATGGCGTGGCACGTGAGGTGGACGATGTCACCGT



GCCCATCAGCATGAGCGAACAGCTGGATAATTTGTTGCCGCGGGACCCAAATGAACCTATAGCCCT



TTATTTTAGGGACTACGGGGCGGAGCAACATTTCACTGGGGAGTTTGGCGGCGCAAAAATTCAGTG



CCGACGCGACCAGCTCGCCCACATGCATAGAAGACGCGGGGCCCGGGACGTATACCTTAACGTCTC



TGTGAGGGTGCAGTCCCAGTCAGAGGCAAGAGGGGAACGCAGACCACCTTACGCAGCAGTATTCA



GGCTGGTAGGCGATAACCACCGGGCGTTTGTACACTTTGATAAACTTTCTGACTACCTGGCCGAACA



CCCGGATGACGGCAAATTAGGATCGGAGGGGCTGCTTAGCGGCCTGCGTGTGATGAGCGTCGATCT



GGGGCTACGGACCTCTGCTTCCATCTCTGTGTTCCGTGTGGCCCGAAAGGACGAGTTGAAACCTAAT



TCGAAGGGCCGTGTACCATTCTTTTTCCCTATTAAGGGAAATGATAATCTCGTCGCGGTGCACGAGC



GTTCCCAACTGCTGAAACTGCCTGGCGAGACCGAGTCCAAAGATCTCAGAGCAATCCGGGAGGAGC



GACAACGTACACTTAGGCAACTCCGCACCCAGCTGGCCTATCTGCGCTTGCTGGTGCGGTGCGGCTC



CGAGGATGTAGGGAGAAGAGAGCGAAGCTGGGCAAAGCTGATAGAGCAACCAGTTGACGCCGCGA



ATCACATGACCCCCGACTGGCGCGAAGCGTTTGAAAATGAGCTGCAGAAGTTGAAATCTCTGCATG



GGATTTGCTCAGATAAGGAGTGGATGGACGCCGTATACGAGTCTGTTCGCCGGGTATGGCGGCACA



TGGGGAAGCAGGTGAGAGATTGGAGAAAGGACGTTCGCTCTGGGGAACGGCCGAAAATTCGGGGA



TACGCAAAGGATGTCGTGGGCGGCAATAGCATTGAGCAGATCGAGTACCTGGAAAGGCAATACAA



ATTTCTGAAATCTTGGTCTTTCTTTGGGAAGGTAAGCGGACAAGTTATCAGAGCCGAAAAGGGATCT



CGCTTTGCTATCACATTGAGGGAACACATTGATCACGCCAAAGAAGACAGGTTGAAAAAGTTGGCT



GATCGCATTATCATGGAAGCACTCGGTTACGTCTACGCCCTTGATGAGCGCGGTAAAGGGAAGTGG



GTAGCCAAGTATCCCCCATGTCAGCTGATCCTGCTCGAGGAACTTTCTGAGTATCAGTTCAATAACG



ACCGTCCTCCCTCCGAAAATAATCAGCTCATGCAATGGTCCCACCGGGGTGTGTTCCAAGAACTGAT



CAATCAGGCTCAGGTGCACGACCTCCTCGTAGGCACTATGTATGCAGCCTTTAGCTCCCGTTTTGAC



GCGCGCACAGGCGCCCCTGGAATACGATGTAGGCGAGTTCCCGCACGGTGCACTCAAGAACATAAC



CCGGAGCCTTTCCCATGGTGGCTCAATAAGTTTGTTGTGGAGCATACCCTCGACGCTTGCCCATTGA



GGGCGGATGACTTGATTCCCACAGGCGAGGGGGAGATCTTCGTGAGCCCATTTTCTGCCGAAGAAG



GGGATTTCCACCAAATACATGCCGACTTGAATGCTGCCCAAAATCTGCAGCAAAGGCTGTGGTCAG



ACTTCGACATCTCGCAAATCAGACTGCGGTGTGACTGGGGCGAAGTAGACGGCGAGCTGGTGCTGA



TACCTAGACTGACGGGTAAGCGTACCGCCGATAGCTATAGTAATAAGGTTTTTTATACGAATACGG



GGGTGACATATTACGAGCGTGAGAGAGGCAAGAAGCGTCGGAAGGTGTTCGCGCAGGAGAAGCTG



AGCGAAGAGGAGGCGGAGCTACTGGTAGAGGCAGATGAGGCAAGAGAAAAGTCCGTCGTCCTGAT



GCGGGATCCTAGCGGGATTATTAACAGAGGTAATTGGACACGGCAGAAAGAATTCTGGAGCATGGT



GAATCAAAGAATCGAGGGTTACCTGGTGAAGCAAATTCGAAGCCGGGTGCCCCTTCAAGACAGCGC



ATGTGAAAACACTGGGGACATCTAG





SEQ
ATGGCTACTCGGTCCTTCATCCTGAAAATCGAGCCAAATGAAGAGGTGAAAAAGGGCCTGTGGAAG


ID
ACCCATGAGGTACTTAACCACGGCATAGCATACTATATGAATATCCTAAAACTTATACGGCAGGAG


NO:
GCTATCTACGAGCATCACGAGCAAGATCCTAAAAATCCAAAGAAGGTTAGTAAGGCTGAAATCCAG


160
GCTGAATTGTGGGACTTCGTGCTGAAGATGCAGAAATGCAACAGTTTCACGCATGAAGTTGATAAG



GACGTCGTGTTTAATATACTCCGGGAGCTGTACGAAGAACTGGTACCAAGCTCTGTGGAAAAGAAA



GGAGAGGCCAACCAGCTAAGTAATAAGTTCCTCTATCCTCTCGTGGACCCCAATTCACAGAGCGGC



AAAGGTACCGCATCTTCTGGGAGGAAACCACGCTGGTACAACTTGAAGATCGCTGGCGATCCCAGC



TGGGAGGAGGAAAAGAAGAAATGGGAAGAGGATAAAAAGAAAGACCCCCTGGCCAAAATCTTAG



GCAAGCTCGCCGAGTACGGTCTGATTCCACTTTTCATCCCGTTCACAGATAGCAATGAGCCGATCGT



CAAGGAGATTAAGTGGATGGAAAAGAGCCGCAATCAGAGTGTGCGGAGGCTGGACAAAGACATGT



TTATTCAGGCCCTGGAACGCTTCCTTAGCTGGGAAAGCTGGAACCTGAAGGTTAAGGAAGAGTACG



AAAAAGTCGAGAAGGAGCATAAGACTTTGGAGGAGCGCATCAAAGAAGACATCCAGGCCTTTAAG



TCTCTAGAACAGTATGAGAAAGAACGGCAGGAACAGCTGCTGCGTGATACACTGAACACAAACGA



ATATCGCCTGAGCAAGAGGGGACTCAGAGGCTGGAGAGAAATCATTCAAAAGTGGCTCAAAATGG



ATGAAAATGAGCCGTCTGAAAAATACCTTGAAGTTTTCAAGGACTACCAGCGGAAGCACCCTAGAG



AAGCCGGCGACTATAGTGTTTACGAATTCTTGAGCAAGAAGGAGAATCATTTTATATGGAGGAATC



ACCCGGAGTACCCATATCTGTACGCAACCTTCTGCGAAATCGACAAGAAAAAAAAAGACGCCAAGC



AACAGGCTACATTTACTCTGGCCGACCCTATCAATCACCCTCTATGGGTCCGGTTTGAGGAGCGCTC



CGGAAGCAATCTGAATAAATATCGTATTCTGACTGAACAGTTACACACAGAGAAGCTCAAGAAGAA



ACTTACGGTGCAGCTGGACCGCCTGATATACCCAACAGAGTCCGGAGGATGGGAAGAGAAAGGAA



AGGTTGACATCGTACTGCTTCCATCTCGTCAGTTTTACAACCAGATATTCCTGGACATCGAGGAGAA



GGGGAAACACGCCTTCACATACAAGGACGAGTCCATAAAGTTCCCACTGAAGGGTACTTTAGGCGG



TGCTAGGGTGCAGTTCGACCGCGATCACCTGAGACGGTACCCCCACAAGGTGGAGAGCGGGAACGT



GGGACGAATCTACTTTAATATGACAGTGAACATTGAACCCACAGAGAGTCCAGTTAGTAAATCCCT



GAAAATTCACCGTGACGACTTTCCGAAATTTGTGAATTTCAAGCCAAAGGAGCTTACGGAGTGGAT



CAAGGATTCAAAGGGAAAGAAGCTGAAATCTGGTATCGAATCTCTCGAGATCGGTCTCCGTGTCAT



GAGCATCGATCTGGGACAGCGCCAGGCAGCTGCCGCCAGTATATTCGAGGTGGTAGACCAAAAGCC



TGACATCGAGGGAAAGCTCTTCTTCCCAATCAAAGGCACAGAGCTGTATGCGGTGCACCGGGCGTC



CTTTAATATAAAGCTGCCCGGTGAAACCCTGGTGAAGTCACGGGAGGTGCTTAGAAAAGCGCGAGA



GGATAACCTCAAACTGATGAACCAAAAACTGAACTTTCTGAGGAACGTCCTGCACTTTCAGCAGTT



CGAAGATATTACCGAACGCGAAAAGAGAGTAACCAAGTGGATATCTCGTCAAGAGAACAGCGACG



TCCCGTTAGTCTATCAGGACGAACTCATCCAAATACGGGAGTTGATGTATAAGCCCTACAAGGATT



GGGTCGCCTTTCTTAAGCAGCTTCACAAACGCCTAGAGGTCGAAATAGGTAAAGAGGTGAAACATT



GGCGGAAGTCGCTCAGCGACGGGAGGAAGGGACTTTATGGCATCTCTTTGAAGAACATTGACGAAA



TCGATAGAACCAGAAAATTTTTGTTGAGATGGTCCCTCCGACCCACCGAGCCTGGAGAGGTGAGGC



GGTTAGAACCAGGACAGAGGTTCGCTATCGATCAGCTGAATCACCTCAATGCTCTGAAGGAGGACC



GCCTCAAGAAAATGGCCAATACAATCATAATGCACGCCCTTGGCTACTGCTACGACGTCCGAAAGA



AGAAGTGGCAGGCCAAGAATCCCGCCTGTCAAATTATCCTTTTTGAGGATCTTAGCAATTACAACCC



CTATGAAGAGCGGTCCAGATTCGAAAATAGTAAGCTCATGAAGTGGAGCCGCAGGGAGATCCCGC



GCCAAGTGGCCCTTCAGGGGGAAATTTATGGGCTGCAGGTAGGCGAGGTCGGGGCCCAATTCTCCT



CGCGCTTTCATGCGAAAACTGGAAGTCCTGGAATCCGGTGCTCAGTGGTGACAAAGGAGAAGTTGC



AAGACAATCGGTTTTTTAAAAACTTACAGCGGGAGGGAAGGCTGACCCTGGATAAGATAGCCGTAC



TTAAGGAAGGAGATCTGTACCCTGACAAAGGCGGTGAAAAGTTCATTAGCTTGAGCAAGGACCGAA



AACTTGTGACCACCCACGCTGACATCAATGCGGCACAGAACCTGCAGAAGAGATTTTGGACTCGCA



CCCACGGATTCTACAAAGTTTACTGCAAAGCATATCAAGTAGACGGACAGACCGTATACATCCCCG



AGTCCAAAGATCAGAAGCAGAAAATTATTGAAGAGTTTGGGGAAGGGTACTTTATCCTGAAGGATG



GTGTCTACGAATGGGGCAACGCTGGTAAACTTAAAATTAAGAAGGGCAGCTCTAAACAGTCCTCCA



GCGAGTTAGTTGATTCTGATATTCTGAAAGACAGTTTCGACCTGGCCAGCGAACTTAAAGGGGAAA



AATTAATGCTGTACCGGGACCCCAGCGGAAACGTCTTTCCATCCGATAAGTGGATGGCCGCTGGAG



TGTTCTTTGGCAAGTTAGAGAGGATTCTCATAAGTAAGCTGACCAACCAATACTCAATCTCCACAAT



CGAGGATGACTCATCCAAGCAGTCTATGTGA





SEQ
ATGCCTACACGCACTATCAACCTGAAACTGGTTCTTGGCAAGAATCCAGAGAATGCTACCCTTCGTC


ID
GGGCACTATTTTCAACGCATAGACTGGTGAATCAGGCTACCAAACGGATTGAAGAGTTCCTCTTGCT


NO:
TTGTCGGGGGGAAGCATATAGGACGGTGGATAATGAGGGGAAAGAGGCTGAAATTCCGAGACACG


161
CCGTGCAGGAGGAAGCTCTTGCGTTTGCAAAGGCCGCTCAACGGCACAATGGTTGCATCTCTACTTA



TGAAGACCAGGAAATCCTGGATGTGCTCCGGCAACTGTATGAAAGGCTGGTGCCTTCTGTGAATGA



AAATAATGAAGCAGGGGACGCTCAAGCCGCAAACGCGTGGGTGTCGCCACTGATGTCCGCCGAGTC



CGAGGGAGGGCTCAGCGTTTACGACAAGGTGCTGGACCCACCCCCAGTGTGGATGAAACTCAAAGA



GGAAAAAGCTCCGGGCTGGGAGGCTGCTTCCCAGATCTGGATCCAGTCCGACGAAGGGCAGTCCCT



TCTTAACAAGCCTGGTTCGCCCCCGCGGTGGATTAGGAAACTGAGGTCAGGCCAGCCTTGGCAGGA



CGATTTTGTTAGCGACCAGAAAAAGAAGCAGGACGAGCTGACAAAGGGGAATGCGCCACTGATCA



AACAATTAAAGGAAATGGGCTTATTGCCTCTTGTGAATCCCTTTTTTAGACATCTGCTTGACCCGGA



GGGGAAGGGGGTGTCACCTTGGGACAGACTCGCTGTTAGGGCCGCTGTCGCTCATTTCATATCATG



GGAATCATGGAACCACCGGACACGCGCCGAATACAATAGTTTGAAGCTGCGGAGGGATGAGTTCG



AAGCAGCTTCCGACGAATTCAAGGACGACTTCACGCTGCTTCGGCAGTACGAGGCTAAGAGGCACT



CCACACTGAAGAGTATAGCTTTAGCCGATGATTCAAACCCTTATAGGATCGGCGTACGCTCCCTCCG



CGCTTGGAACCGCGTCCGCGAGGAGTGGATCGACAAGGGAGCGACCGAGGAGCAGCGGGTCACCA



TTCTCAGCAAGTTGCAGACCCAACTAAGGGGCAAATTTGGAGATCCTGACTTGTTCAACTGGCTGGC



GCAGGACCGGCACGTGCACCTCTGGAGCCCTAGAGATAGTGTTACCCCACTGGTTAGGATCAACGC



TGTTGACAAAGTATTGCGACGGAGAAAACCGTACGCCTTGATGACTTTTGCCCACCCAAGATTCCAC



CCTCGGTGGATACTTTACGAAGCCCCAGGGGGCAGCAATCTCCGCCAGTATGCACTGGATTGTACC



GAAAATGCTCTGCACATTACACTGCCTCTGCTGGTTGACGATGCACATGGCACATGGATTGAGAAA



AAAATTAGGGTTCCTCTTGCCCCCAGCGGCCAGATTCAGGACCTGACACTAGAAAAGCTCGAGAAG



AAGAAAAATCGTCTCTACTACCGTTCTGGGTTCCAGCAGTTTGCCGGCCTGGCCGGAGGTGCCGAG



GTGCTTTTCCATCGACCATACATGGAGCACGATGAGAGGAGCGAGGAGAGCTTATTAGAACGCCCT



GGTGCTGTTTGGTTCAAACTCACCTTGGACGTGGCAACCCAGGCCCCTCCAAACTGGTTGGACGGA



AAGGGCCGCGTCCGAACGCCCCCCGAGGTTCACCACTTCAAGACAGCCCTCAGTAACAAGTCTAAG



CACACACGGACCCTCCAGCCCGGACTCAGAGTGTTATCCGTGGATCTGGGAATGCGCACCTTCGCCT



CTTGCTCCGTATTTGAGCTGATCGAGGGCAAACCAGAGACTGGCAGAGCGTTCCCTGTGGCCGACG



AACGTTCCATGGATTCACCAAACAAGCTGTGGGCCAAGCACGAAAGATCCTTTAAACTCACGCTCC



CCGGCGAAACCCCCAGTCGGAAAGAAGAGGAGGAACGGAGCATTGCAAGAGCCGAAATCTATGCG



TTGAAAAGAGATATTCAGAGATTAAAAAGTCTTCTGCGCCTGGGGGAAGAGGATAACGATAATAGA



CGCGATGCACTTCTTGAGCAATTTTTCAAGGGCTGGGGCGAGGAAGACGTGGTTCCAGGTCAGGCC



TTTCCCCGGAGTCTGTTCCAGGGGCTGGGGGCCGCCCCATTCAGATCCACCCCTGAGTTGTGGAGAC



AACACTGTCAAACCTATTATGATAAAGCAGAGGCGTGCCTGGCTAAACACATCAGCGATTGGCGCA



AGAGAACCAGGCCTAGGCCTACCTCACGTGAGATGTGGTACAAGACACGCTCTTATCACGGCGGAA



AGTCAATCTGGATGCTGGAATACCTCGACGCTGTGAGGAAACTGCTCTTATCCTGGAGCCTCAGAG



GCCGGACCTACGGGGCTATCAACAGACAGGACACAGCAAGGTTCGGGAGCTTAGCCAGCCGGCTCC



TTCACCACATTAACTCACTCAAAGAGGATCGAATAAAGACCGGAGCCGACTCGATCGTGCAGGCAG



CCCGAGGGTACATCCCCCTGCCTCATGGGAAGGGCTGGGAGCAGCGATATGAACCCTGCCAGCTGA



TCTTGTTTGAGGACCTTGCCCGTTATAGATTTCGCGTTGATAGACCTCGCCGTGAGAATTCTCAGCT



GATGCAGTGGAACCACAGAGCGATCGTGGCTGAGACCACTATGCAGGCCGAGCTGTATGGACAGAT



CGTGGAGAACACCGCCGCAGGGTTCAGTTCTCGGTTTCATGCTGCCACCGGAGCTCCCGGCGTCCG



GTGCCGCTTCCTCTTAGAGCGTGATTTTGACAATGACCTCCCAAAGCCCTATCTGCTGAGGGAACTG



AGCTGGATGCTGGGGAACACAAAAGTAGAATCGGAGGAGGAGAAGCTACGGCTCCTCTCCGAAAA



GATACGTCCAGGCTCTCTGGTACCATGGGACGGAGGAGAGCAGTTCGCGACACTGCATCCTAAGAG



ACAGACGTTATGTGTGATTCACGCCGATATGAACGCCGCTCAGAATCTGCAGCGAAGATTCTTTGGC



CGCTGCGGCGAAGCCTTCAGGCTGGTATGTCAGCCCCACGGGGATGATGTGCTGCGGCTGGCCTCA



ACCCCTGGGGCTAGACTCTTGGGGGCACTCCAGCAGCTGGAAAATGGCCAAGGGGCTTTCGAACTC



GTTCGGGACATGGGCAGCACAAGCCAGATGAACAGATTCGTCATGAAGAGCCTGGGAAAGAAAAA



GATCAAACCCTTACAGGACAATAATGGCGACGACGAACTGGAGGACGTGTTGTCCGTGCTGCCAGA



GGAAGACGACACAGGCCGCATCACTGTCTTCCGCGACTCAAGTGGGATATTCTTTCCTTGCAACGTG



TGGATTCCGGCCAAACAGTTCTGGCCTGCCGTCAGAGCCATGATTTGGAAAGTGATGGCTAGTCATT



CATTGGGATGA





SEQ
ATGACAAAGCTGAGGCACAGACAAAAGAAGCTTACACACGACTGGGCAGGGAGCAAGAAACGTGA


ID
GGTCCTTGGGTCAAATGGAAAACTGCAGAACCCCTTGCTCATGCCTGTAAAGAAGGGGCAGGTAAC


NO:
AGAATTTAGAAAAGCATTCTCCGCGTACGCTCGGGCAACTAAGGGGGAAATGACCGATGGACGGA


162
AGAACATGTTCACCCATTCTTTCGAGCCATTCAAAACAAAGCCGTCATTGCACCAATGCGAGCTGGC



CGATAAGGCTTACCAGTCTTTGCATAGTTACCTCCCCGGTTCCCTGGCCCATTTCTTGCTTTCCGCAC



ACGCACTGGGCTTTCGTATTTTCTCTAAATCTGGGGAGGCAACTGCCTTCCAGGCCAGCTCAAAAAT



CGAGGCCTATGAGTCCAAGCTCGCTTCGGAGCTAGCCTGTGTCGATTTGAGTATCCAGAATTTGACG



ATTAGTACTCTTTTCAACGCTCTCACAACTTCAGTTCGGGGCAAGGGGGAGGAAACTTCAGCAGATC



CCCTTATCGCACGGTTCTACACTCTCCTGACGGGCAAGCCCCTGAGCCGAGACACACAGGGCCCAG



AACGGGACTTGGCAGAGGTCATCTCCAGAAAGATCGCCTCGTCCTTCGGCACATGGAAGGAAATGA



CTGCCAACCCTCTGCAGAGCCTCCAGTTCTTCGAAGAAGAGCTTCATGCACTAGATGCCAACGTGTC



TTTATCTCCAGCTTTTGATGTGTTAATCAAGATGAATGATCTCCAAGGTGATCTGAAGAACCGTACT



ATAGTGTTCGACCCAGATGCACCCGTGTTCGAGTACAACGCTGAGGATCCAGCCGATATCATCATA



AAGCTGACAGCTCGGTATGCGAAGGAGGCCGTCATCAAGAATCAGAACGTGGGCAATTATGTGAA



AAACGCCATTACCACCACTAATGCCAATGGGCTGGGGTGGCTCCTCAATAAAGGGCTTTCACTACT



GCCAGTTTCTACTGACGATGAGCTGCTCGAATTCATTGGGGTGGAGAGAAGCCATCCCAGCTGTCA



CGCGCTGATAGAGCTGATTGCCCAGCTAGAGGCGCCGGAACTGTTTGAGAAGAATGTGTTTAGTGA



CACCCGTTCCGAGGTTCAGGGTATGATCGACAGTGCAGTGTCGAACCACATTGCTCGGCTGTCCAGC



AGCCGAAACTCCCTGAGCATGGACAGCGAGGAATTGGAACGCTTGATTAAATCTTTCCAGATTCAT



ACTCCCCATTGTTCTCTGTTCATAGGCGCTCAGTCCTTATCTCAGCAGCTGGAGAGCTTACCTGAGG



CGCTGCAGTCCGGAGTGAACAGCGCTGATATCTTATTAGGCAGCACACAGTATATGCTGACCAACT



CTCTCGTTGAAGAGTCAATTGCAACATATCAAAGGACATTAAATAGGATCAATTACCTGAGTGGGG



TGGCTGGGCAGATTAACGGTGCTATCAAAAGAAAGGCAATCGACGGCGAAAAAATACACCTGCCT



GCCGCCTGGAGTGAGCTCATCTCCTTACCTTTCATTGGACAGCCGGTGATTGATGTGGAGAGCGACC



TGGCACACTTAAAAAACCAGTACCAGACCCTGTCCAATGAATTTGACACCCTCATTTCGGCCCTGCA



GAAGAACTTCGATTTGAATTTCAACAAAGCACTCCTTAACCGCACGCAGCATTTCGAGGCAATGTG



CCGGAGCACAAAAAAAAATGCTTTATCTAAGCCCGAGATCGTGTCCTACAGAGATCTGCTGGCGCG



GCTGACCAGTTGCCTTTATCGAGGCTCGCTGGTTCTCAGAAGGGCGGGAATCGAAGTTCTGAAAAA



GCACAAAATCTTTGAGTCGAATAGTGAGCTGAGAGAACACGTCCACGAGCGAAAGCACTTCGTGTT



CGTTAGTCCATTGGACAGAAAGGCAAAAAAACTGTTGCGCCTGACCGATTCCCGCCCTGACTTGCTC



CATGTGATCGATGAGATCCTGCAACATGACAATCTGGAGAATAAGGACAGAGAGTCCCTTTGGCTG



GTCCGGTCTGGGTACCTCCTTGCTGGTCTGCCGGACCAGCTGAGTTCTTCGTTTATCAATCTCCCCAT



AATCACGCAAAAGGGCGATCGCCGGCTGATTGACCTGATTCAGTATGACCAGATCAATCGCGATGC



TTTCGTAATGTTGGTGACAAGTGCTTTCAAAAGCAATCTCTCTGGGTTGCAGTACCGCGCTAACAAG



CAGTCTTTCGTGGTCACCCGCACCCTGTCTCCTTACCTGGGTAGTAAGCTCGTATACGTCCCTAAAG



ACAAAGATTGGCTGGTCCCATCCCAGATGTTTGAGGGAAGATTCGCCGATATTCTGCAGAGTGACT



ACATGGTCTGGAAGGATGCCGGACGCCTGTGCGTGATCGACACTGCCAAACATCTCTCTAACATTA



AAAAAAGCGTGTTTAGTAGCGAAGAAGTCCTTGCTTTTCTTCGAGAGCTGCCTCACCGGACCTTCAT



CCAGACCGAGGTACGGGGGTTAGGAGTGAACGTCGATGGAATCGCATTTAATAACGGGGATATCCC



GAGCTTGAAGACATTCTCGAATTGTGTGCAGGTGAAGGTGAGTAGGACTAATACTAGTCTCGTGCA



GACTCTAAACAGGTGGTTCGAGGGTGGCAAAGTGTCACCTCCCTCTATTCAGTTCGAAAGAGCTTAC



TACAAAAAAGACGATCAGATTCACGAGGACGCAGCCAAGAGAAAGATACGCTTCCAGATGCCAGC



AACGGAATTAGTGCACGCCAGCGATGACGCTGGTTGGACCCCCAGCTACCTGCTGGGCATCGACCC



CGGTGAGTACGGAATGGGTCTCAGTTTGGTGTCCATCAACAATGGAGAGGTCCTGGATTCTGGATTC



ATCCACATTAATTCCCTGATCAATTTCGCGTCCAAAAAAAGCAATCACCAGACCAAAGTAGTCCCCC



GCCAGCAGTACAAGTCCCCCTACGCGAATTATCTCGAGCAGTCAAAGGATTCAGCAGCAGGGGATA



TAGCTCACATTCTGGATCGGCTAATCTACAAATTGAACGCCTTGCCTGTGTTCGAGGCGCTGTCTGG



CAACAGTCAGAGTGCTGCTGATCAGGTATGGACCAAAGTTCTATCCTTCTATACATGGGGAGACAA



CGACGCACAGAACAGTATACGGAAGCAGCACTGGTTCGGTGCCTCACACTGGGATATTAAGGGGAT



GCTGCGCCAACCCCCAACCGAAAAAAAACCCAAACCATATATAGCCTTTCCCGGGAGTCAAGTGTC



ATCCTATGGAAATAGTCAAAGGTGTAGTTGTTGCGGCCGCAATCCCATTGAGCAGTTGCGTGAGAT



GGCAAAGGACACGAGTATCAAGGAGCTGAAAATCCGAAATAGTGAGATCCAACTATTCGATGGTA



CAATCAAGCTGTTTAACCCCGACCCTTCCACCGTCATCGAGAGGCGGCGGCATAACCTAGGACCCT



CACGCATTCCTGTGGCAGACCGAACTTTCAAGAATATTAGCCCTTCTTCGTTAGAGTTCAAGGAGCT



CATTACTATCGTTTCTCGAAGCATCCGCCATAGCCCCGAATTTATTGCTAAGAAACGGGGTATCGGG



TCTGAGTACTTTTGTGCTTATTCTGACTGCAACTCCTCACTGAACTCAGAGGCCAATGCCGCGGCCA



ATGTGGCACAGAAGTTTCAGAAGCAACTCTTTTTCGAACTCTGA





SEQ
ATGAAACGTATTCTGAACTCTCTGAAAGTCGCCGCACTGAGGCTGCTGTTTCGAGGAAAGGGCTCA


ID
GAGCTGGTGAAGACCGTCAAGTACCCTCTGGTTTCGCCCGTCCAGGGTGCTGTGGAAGAACTCGCC


NO:
GAAGCAATACGCCACGACAACCTACATTTATTTGGGCAGAAGGAAATCGTAGATCTGATGGAGAAG


163
GACGAGGGCACCCAGGTCTACTCGGTGGTGGACTTTTGGCTCGACACACTCCGTCTAGGGATGTTCT



TCAGTCCAAGTGCTAATGCCCTTAAGATCACTCTGGGGAAGTTTAACAGCGACCAAGTTTCCCCTTT



CAGGAAGGTTCTGGAGCAGTCCCCTTTCTTTCTCGCGGGTAGACTCAAAGTGGAGCCCGCTGAACGT



ATCCTCAGCGTGGAGATCCGCAAGATCGGTAAGAGGGAGAATAGAGTGGAGAACTACGCCGCAGA



TGTAGAGACTTGTTTTATCGGTCAGCTGTCTAGTGATGAAAAGCAGTCTATCCAGAAGCTCGCTAAC



GATATCTGGGACTCTAAGGATCACGAAGAGCAAAGGATGCTTAAGGCGGATTTCTTTGCCATTCCC



CTCATCAAAGACCCAAAGGCAGTGACCGAGGAAGATCCCGAGAATGAAACCGCAGGCAAACAGAA



GCCTCTCGAATTATGTGTGTGCTTAGTGCCCGAGTTGTACACCCGCGGGTTCGGTTCAATAGCGGAC



TTCCTGGTCCAGCGTCTGACACTATTAAGAGACAAAATGAGCACAGACACAGCAGAAGACTGCCTT



GAGTATGTCGGCATAGAGGAGGAGAAGGGTAATGGGATGAACTCGCTGCTGGGGACGTTCCTCAA



GAACCTGCAGGGAGACGGGTTCGAACAGATCTTCCAATTTATGCTCGGCAGTTACGTGGGATGGCA



AGGTAAGGAAGACGTCCTACGCGAACGGCTTGATTTGCTAGCGGAGAAGGTTAAAAGACTGCCGA



AACCTAAGTTTGCCGGCGAGTGGTCCGGCCATCGGATGTTCCTGCATGGTCAATTGAAGAGCTGGTC



CTCTAACTTTTTCCGCCTGTTTAACGAGACTAGGGAGCTCCTCGAAAGCATAAAATCCGACATCCAA



CACGCGACCATGTTAATCAGCTACGTCGAAGAGAAAGGGGGATACCACCCACAACTCTTGTCACAG



TACAGGAAACTAATGGAGCAGCTGCCAGCTCTCAGAACAAAGGTGTTAGATCCAGAGATAGAAAT



GACTCACATGAGCGAGGCGGTAAGGTCGTACATTATGATCCACAAGTCGGTAGCAGGATTTCTGCC



TGACTTACTCGAGTCCCTCGATAGGGACAAGGACAGGGAATTCCTGCTGAGTATATTTCCAAGGAT



CCCCAAAATTGACAAAAAAACTAAGGAAATCGTGGCCTGGGAGCTCCCAGGCGAGCCCGAAGAAG



GATACCTGTTCACTGCCAATAATCTTTTTCGCAACTTTCTGGAGAATCCTAAACATGTTCCACGTTTC



ATGGCAGAAAGGATCCCGGAAGATTGGACGCGCCTGCGGTCCGCTCCCGTATGGTTTGACGGCATG



GTGAAACAATGGCAGAAAGTGGTAAACCAGCTGGTGGAGTCACCTGGAGCATTGTATCAGTTCAAT



GAAAGCTTTCTCCGACAACGTTTACAGGCAATGCTGACAGTGTATAAGAGAGACCTGCAGACAGAG



AAATTCCTTAAGTTGTTGGCTGATGTCTGCAGGCCTCTGGTGGACTTCTTTGGGCTGGGGGGAAACG



ATATCATCTTCAAAAGCTGCCAGGACCCGAGGAAACAATGGCAAACTGTCATTCCCTTGAGTGTCC



CCGCTGATGTGTACACCGCGTGTGAGGGGCTGGCAATCCGGCTTCGTGAGACATTGGGATTTGAGT



GGAAGAACCTTAAGGGCCATGAAAGGGAGGACTTTCTAAGACTGCACCAGCTTTTAGGGAATCTGC



TTTTCTGGATTCGAGATGCCAAACTGGTGGTGAAATTGGAAGATTGGATGAATAATCCCTGTGTTCA



GGAGTACGTTGAGGCTCGTAAGGCCATTGATCTCCCACTGGAGATCTTCGGCTTTGAGGTCCCCATC



TTCCTGAACGGATATCTGTTTAGTGAACTGAGGCAGTTAGAACTGCTGCTCCGCCGTAAGTCGGTTA



TGACCAGCTATTCGGTTAAGACAACTGGCAGTCCAAACAGGCTTTTCCAGTTAGTCTACCTGCCATT



AAATCCTTCCGACCCTGAGAAAAAAAATTCTAATAACTTTCAGGAACGCCTGGACACCCCCACTGG



CTTATCACGTCGCTTCCTGGACCTTACTCTGGACGCCTTCGCCGGCAAGTTGCTGACAGACCCCGTG



ACTCAAGAGCTTAAAACTATGGCTGGGTTCTACGATCACCTGTTTGGTTTCAAGCTCCCATGTAAGC



TGGCAGCCATGTCTAACCACCCTGGCTCTAGCAGCAAGATGGTCGTGTTGGCCAAACCTAAAAAAG



GGGTTGCATCTAATATAGGATTCGAACCAATCCCTGATCCCGCGCACCCCGTATTCCGGGTGAGATC



ATCATGGCCAGAGCTGAAGTATCTGGAGGGGTTACTGTATCTTCCAGAAGACACTCCACTGACAAT



AGAGCTCGCAGAGACAAGTGTTAGTTGTCAGAGCGTCAGTAGCGTGGCATTCGATCTGAAAAATCT



GACTACTATCCTTGGACGCGTGGGTGAGTTCCGTGTGACCGCAGACCAGCCTTTTAAGTTGACCCCC



ATCATCCCTGAGAAGGAGGAGTCCTTCATAGGAAAAACATATCTAGGCCTTGATGCCGGGGAACGC



TCAGGCGTAGGGTTCGCTATCGTCACAGTCGACGGGGATGGGTACGAGGTACAGCGCCTGGGGGTG



CATGAAGATACACAGCTGATGGCCCTACAGCAGGTGGCCTCTAAAAGCTTGAAGGAGCCGGTGTTC



CAGCCGCTCAGAAAGGGTACTTTTCGGCAGCAGGAACGTATTAGAAAATCTCTCAGAGGATGTTAT



TGGAACTTCTATCACGCTCTGATGATTAAGTACCGCGCCAAGGTAGTGCACGAAGAGAGCGTGGGC



AGTTCCGGCCTGGTTGGGCAGTGGTTACGAGCATTCCAGAAGGACCTCAAGAAAGCCGATGTGTTG



CCAAAAAAGGGAGGCAAAAACGGAGTCGATAAGAAAAAGAGAGAGTCTTCTGCACAAGACACATT



GTGGGGAGGGGCTTTTAGCAAGAAGGAAGAACAGCAGATAGCTTTCGAAGTCCAAGCTGCTGGTTC



TAGCCAGTTCTGCCTGAAGTGCGGATGGTGGTTCCAACTCGGAATGCGTGAGGTTAATCGCGTGCA



GGAATCCGGCGTCGTGCTGGATTGGAATCGGAGTATTGTCACATTCCTGATTGAGAGCTCTGGCGA



GAAAGTGTATGGGTTCTCCCCTCAGCAACTCGAAAAGGGGTTCAGACCAGACATTGAAACCTTCAA



GAAGATGGTTCGGGATTTCATGCGCCCGCCTATGTTTGACCGGAAGGGTCGCCCAGCAGCTGCCTA



CGAAAGGTTTGTCTTGGGACGCCGGCATCGGCGGTATAGATTCGACAAGGTTTTTGAAGAACGATT



CGGACGATCCGCGCTATTCATTTGCCCGAGGGTTGGCTGTGGCAACTTTGACCACAGCAGCGAGCA



GTCAGCCGTAGTGCTGGCTCTAATCGGATATATTGCCGACAAAGAGGGGATGAGCGGAAAAAAGCT



AGTCTACGTGCGTCTGGCAGAACTAATGGCGGAATGGAAATTGAAGAAACTGGAGAGGAGTAGAG



TTGAGGAGCAAAGCTCCGCTCAGTGA





SEQ
ATGGCGGAGTCGAAGCAAATGCAGTGCAGGAAGTGTGGAGCCTCTATGAAGTACGAAGTGATCGG


ID
CCTCGGGAAGAAAAGCTGCAGATATATGTGTCCCGACTGCGGGAATCACACATCTGCAAGAAAGAT


NO:
TCAGAATAAGAAGAAAAGGGACAAGAAGTATGGATCTGCCAGTAAAGCACAAAGCCAACGAATCG


164
CAGTTGCAGGGGCCTTATACCCGGATAAAAAGGTTCAGACCATCAAGACTTATAAGTATCCAGCCG



ACCTGAATGGTGAGGTCCATGACTCAGGGGTGGCCGAAAAAATAGCCCAAGCAATCCAGGAGGAT



GAAATAGGGCTCCTCGGCCCCTCTTCCGAGTACGCCTGTTGGATCGCTAGCCAGAAACAGAGCGAG



CCCTACAGTGTTGTAGACTTTTGGTTTGACGCTGTGTGCGCCGGAGGCGTGTTCGCCTATTCTGGGG



CTAGATTGCTGTCTACCGTCCTGCAGCTATCTGGGGAGGAGAGCGTCCTACGCGCAGCCCTGGCATC



CTCCCCTTTTGTCGACGATATCAATCTGGCACAGGCCGAAAAATTTCTGGCGGTGTCCAGGCGAACC



GGCCAAGATAAGCTGGGGAAGCGCATTGGAGAGTGCTTCGCAGAGGGCCGACTTGAGGCCCTAGG



CATCAAGGACCGGATGCGTGAATTTGTCCAGGCTATCGATGTCGCTCAGACCGCTGGGCAGCGTTTT



GCCGCGAAACTGAAAATCTTTGGGATTTCTCAGATGCCCGAGGCAAAGCAGTGGAACAATGACAGC



GGACTCACCGTGTGCATCCTGCCCGACTATTACGTCCCAGAAGAAAATCGCGCAGATCAGTTGGTC



GTCCTGCTAAGACGACTGAGAGAGATAGCATACTGTATGGGGATCGAAGATGAGGCCGGTTTTGAA



CATCTTGGAATTGATCCTGGCGCACTATCAAATTTTTCCAATGGCAATCCTAAACGCGGATTTTTGG



GCCGCCTGCTGAACAATGATATTATTGCCTTAGCGAACAACATGTCCGCCATGACGCCTTACTGGGA



GGGCAGGAAGGGAGAACTGATTGAAAGATTGGCTTGGCTGAAGCACCGTGCAGAGGGGCTTTATCT



GAAGGAACCGCATTTTGGAAATAGTTGGGCCGACCATAGGTCTAGAATTTTTTCCAGAATAGCCGG



GTGGCTTTCTGGGTGCGCTGGGAAGCTAAAGATCGCCAAAGACCAGATCAGCGGAGTGCGTACTGA



TCTGTTCCTTCTGAAGAGACTGCTGGATGCGGTCCCGCAGTCCGCCCCTTCTCCCGACTTCATAGCCT



CTATCTCTGCCTTGGATCGCTTCCTGGAGGCCGCAGAATCTAGTCAGGATCCTGCCGAACAGGTGAG



GGCCCTATACGCCTTTCATCTGAACGCACCCGCGGTGCGAAGCATCGCCAACAAGGCAGTCCAGCG



ATCCGACAGCCAAGAATGGCTTATAAAGGAACTGGACGCTGTGGACCACCTGGAGTTTAACAAGGC



CTTTCCCTTCTTCTCTGATACGGGAAAGAAGAAAAAGAAAGGGGCTAACTCGAATGGCGCTCCGTC



CGAGGAGGAGTACACCGAGACTGAGAGCATCCAGCAGCCCGAGGACGCTGAGCAAGAGGTTAATG



GTCAGGAAGGCAACGGGGCCTCGAAGAACCAGAAGAAGTTTCAGAGAATCCCCCGATTCTTCGGCG



AGGGGAGTCGCAGCGAGTATCGCATCCTCACTGAAGCCCCGCAGTACTTCGACATGTTCTGTAACA



ACATGCGGGCCATCTTTATGCAATTAGAATCCCAACCGCGTAAAGCTCCCAGGGATTTTAAGTGTTT



CCTGCAGAATCGGCTGCAGAAATTGTATAAGCAGACATTCCTGAACGCTCGATCCAACAAGTGCCG



GGCATTACTAGAGTCCGTATTGATTAGTTGGGGAGAGTTTTACACCTACGGGGCTAACGAGAAAAA



ATTTCGACTGCGTCATGAAGCTTCTGAGCGCTCCTCGGACCCAGATTACGTGGTGCAACAGGCGCTG



GAGATCGCTCGGAGGCTGTTTCTCTTCGGCTTTGAGTGGAGGGACTGTAGCGCAGGTGAAAGAGTG



GATCTGGTCGAAATACATAAGAAAGCCATATCTTTCCTGTTGGCCATCACTCAGGCTGAGGTGTCTG



TGGGCAGCTATAACTGGCTGGGCAATTCTACCGTGAGTCGGTACCTGTCCGTGGCAGGGACTGATA



CCCTTTACGGCACCCAGCTGGAAGAATTCTTAAATGCAACCGTGTTATCTCAGATGCGGGGGCTGGC



TATCAGGTTATCATCTCAGGAACTGAAGGATGGATTTGACGTACAGCTGGAGTCTAGTTGCCAGGA



TAATCTGCAACACTTGCTCGTGTACAGGGCTTCACGAGACCTTGCCGCCTGCAAGCGCGCTACTTGT



CCAGCTGAGTTGGATCCTAAGATTCTGGTACTGCCCGTGGGGGCCTTTATCGCTAGCGTGATGAAAA



TGATTGAAAGAGGGGATGAGCCTTTAGCTGGAGCTTATCTGAGACACAGACCCCATAGTTTCGGGT



GGCAGATCCGCGTTCGAGGTGTGGCAGAGGTGGGAATGGACCAAGGGACCGCCCTGGCGTTCCAG



AAACCGACCGAGAGCGAACCCTTCAAGATAAAGCCGTTTTCCGCTCAATACGGCCCCGTTCTATGG



CTGAACAGCTCCAGTTATAGCCAGAGCCAGTACCTGGACGGGTTCCTATCACAGCCCAAGAACTGG



AGTATGCGGGTGCTGCCACAGGCCGGCTCAGTGCGGGTAGAACAGCGCGTCGCCTTGATTTGGAAT



CTCCAGGCCGGAAAGATGAGGCTGGAACGGAGCGGAGCGCGGGCTTTCTTCATGCCCGTCCCATTC



AGTTTCCGCCCCAGTGGCAGCGGCGACGAGGCAGTCCTGGCTCCAAATAGGTACCTGGGACTCTTT



CCACACAGCGGCGGCATAGAGTACGCTGTGGTCGATGTTCTTGACTCTGCCGGCTTCAAAATACTCG



AGAGAGGAACAATAGCCGTCAATGGCTTCTCCCAGAAACGAGGAGAAAGACAAGAGGAAGCCCAT



CGCGAAAAACAAAGACGCGGTATCTCCGATATTGGGCGCAAGAAGCCAGTCCAGGCCGAAGTCGA



TGCGGCCAACGAGCTCCATCGAAAATACACCGATGTTGCTACTCGGCTGGGGTGTCGAATTGTCGTT



CAATGGGCACCCCAACCCAAACCAGGCACTGCGCCGACCGCTCAGACTGTGTACGCTAGGGCCGTG



AGGACTGAAGCACCAAGATCCGGCAATCAGGAAGATCACGCCAGGATGAAATCTTCCTGGGGATA



CACATGGGGTACGTATTGGGAAAAAAGGAAGCCCGAGGACATCCTCGGCATTAGTACCCAGGTGTA



TTGGACAGGCGGGATCGGCGAGTCCTGCCCGGCTGTCGCCGTCGCGCTATTGGGACACATCAGGGC



CACCTCAACCCAGACTGAATGGGAGAAAGAGGAAGTCGTGTTTGGGCGATTGAAAAAGTTCTTCCC



ATCCTGA





SEQ
ATGGAGAAGCGCATCAATAAAATTCGCAAGAAGCTGTCTGCCGATAACGCCACAAAACCAGTTAGT


ID
CGAAGCGGCCCAATGAAGACCCTGCTAGTTCGAGTGATGACTGATGATCTGAAGAAAAGGCTCGAA


NO:
AAGCGACGCAAGAAGCCTGAGGTAATGCCTCAGGTTATAAGTAACAATGCAGCAAACAATCTGCG


165
GATGCTGCTTGACGATTACACAAAGATGAAGGAAGCCATTCTCCAGGTGTATTGGCAGGAGTTCAA



GGATGATCACGTAGGCCTGATGTGTAAATTCGCGCAACCTGCAAGCAAGAAGATCGACCAAAACAA



GCTGAAACCCGAGATGGATGAAAAAGGCAATTTAACAACCGCCGGATTCGCTTGTTCCCAGTGTGG



GCAGCCACTGTTCGTGTACAAGTTAGAACAGGTGTCGGAAAAAGGAAAGGCATACACTAACTACTT



TGGACGGTGCAATGTTGCAGAACACGAAAAGCTGATACTGCTTGCCCAGCTTAAGCCCGAAAAAGA



CAGCGACGAAGCGGTGACCTACAGCCTGGGAAAATTCGGGCAGCGGGCACTGGACTTCTATTCTAT



CCACGTTACCAAGGAGAGCACCCACCCAGTGAAGCCGTTGGCCCAAATCGCTGGAAACCGGTACGC



CAGCGGACCAGTCGGCAAGGCCCTGTCCGATGCCTGTATGGGCACAATTGCTTCTTTCCTGTCCAAG



TACCAGGACATCATAATCGAGCACCAAAAAGTTGTGAAAGGGAATCAGAAACGCCTGGAATCCCTT



CGAGAACTGGCCGGCAAGGAGAACCTTGAGTACCCGTCCGTGACCCTGCCTCCACAGCCACATACC



AAAGAGGGCGTAGACGCGTATAATGAGGTCATTGCCCGCGTTCGCATGTGGGTTAATTTAAACCTG



TGGCAGAAATTAAAACTAAGCCGAGATGATGCTAAACCGTTACTGAGATTGAAGGGATTCCCTAGC



TTTCCTGTGGTGGAGAGAAGGGAAAACGAGGTTGATTGGTGGAATACTATTAATGAGGTGAAAAAG



CTTATTGACGCCAAGAGGGATATGGGCAGGGTGTTCTGGAGCGGGGTGACTGCCGAAAAGAGAAA



TACCATCCTCGAGGGATACAATTACCTCCCCAACGAGAATGATCATAAGAAAAGAGAGGGGAGCTT



AGAGAATCCAAAGAAACCTGCAAAGAGGCAATTCGGTGATCTCCTGCTCTACCTCGAGAAGAAATA



CGCGGGGGACTGGGGAAAAGTTTTTGACGAAGCCTGGGAGCGCATTGACAAGAAGATCGCCGGGC



TGACGTCTCACATTGAACGGGAAGAGGCACGGAATGCAGAGGACGCCCAGTCTAAGGCCGTGCTG



ACTGACTGGCTGCGCGCAAAGGCCTCCTTCGTGCTCGAACGTCTGAAGGAAATGGATGAGAAAGAG



TTTTACGCGTGTGAAATACAGCTGCAGAAGTGGTACGGCGATCTAAGGGGAAATCCCTTCGCAGTG



GAAGCCGAGAATAGGGTAGTTGACATCAGTGGGTTCTCCATCGGCAGTGATGGACATTCTATCCAG



TATAGAAACCTGCTCGCCTGGAAGTACTTAGAGAACGGCAAGAGAGAGTTCTATCTGCTGATGAAC



TACGGGAAAAAAGGTAGAATTCGCTTTACAGATGGCACCGACATAAAGAAGTCCGGAAAGTGGCA



AGGCCTCTTATACGGAGGCGGCAAAGCAAAGGTGATAGACTTGACTTTTGACCCTGACGACGAACA



GCTGATAATCTTGCCGCTGGCCTTTGGCACAAGACAAGGTAGGGAATTTATCTGGAATGATCTTCTT



TCTCTCGAGACCGGACTCATCAAGCTCGCAAACGGAAGGGTCATCGAGAAGACAATCTACAATAAA



AAGATAGGCCGAGACGAGCCAGCCCTGTTTGTGGCTTTGACATTTGAGCGGAGAGAGGTCGTAGAT



CCCAGCAACATCAAACCCGTGAACCTGATCGGTGTTGACAGGGGCGAGAACATCCCGGCGGTTATC



GCACTGACGGATCCAGAAGGATGTCCTCTGCCCGAGTTCAAAGATTCATCGGGAGGGCCAACCGAC



ATTTTGAGGATAGGGGAGGGGTACAAGGAGAAGCAGCGAGCTATCCAGGCGGCCAAAGAAGTGGA



GCAACGAAGAGCTGGTGGTTATTCTCGCAAGTTCGCTTCCAAAAGTCGTAACCTGGCTGACGATAT



GGTGCGCAATTCTGCCCGTGACCTTTTCTACCACGCCGTTACACACGACGCCGTGTTAGTGTTTGAA



AATCTTAGTCGAGGCTTCGGGCGACAGGGGAAGCGGACCTTTATGACCGAGAGACAGTATACAAAA



ATGGAGGATTGGCTGACCGCCAAACTGGCGTATGAAGGACTCACATCCAAGACCTATCTCTCAAAA



ACTTTGGCCCAGTATACATCTAAGACGTGCAGTAACTGTGGCTTCACCATTACCACAGCTGACTACG



ATGGCATGCTGGTCCGCTTAAAAAAGACATCTGACGGCTGGGCTACTACCCTCAACAATAAAGAGC



TCAAAGCCGAAGGACAAATTACCTATTATAACAGGTATAAAAGACAGACTGTCGAGAAGGAGTTG



AGCGCGGAGCTGGACCGCCTATCAGAGGAGTCAGGGAACAACGATATCTCTAAGTGGACTAAGGG



ACGCCGAGACGAGGCGTTGTTCTTGCTGAAAAAGCGGTTCTCTCATCGACCCGTGCAGGAGCAGTT



CGTGTGTCTGGACTGCGGCCACGAGGTTCATGCTGATGAGCAAGCTGCTCTAAATATTGCCCGTAGT



TGGTTGTTCCTGAACAGCAATTCAACAGAGTTCAAGTCATACAAGAGCGGAAAGCAGCCGTTTGTG



GGCGCATGGCAGGCATTTTACAAAAGACGCCTGAAGGAAGTGTGGAAGCCAAACGCC





SEQ
ATGAAAAGGATTAACAAAATCCGAAGGCGGCTTGTAAAGGATTCTAACACCAAAAAGGCTGGCAA


ID
GACGGGGCCCATGAAAACATTACTCGTTAGAGTTATGACCCCCGACCTCAGAGAGCGACTGGAAAA


NO:
TTTACGCAAGAAGCCAGAGAACATACCTCAGCCAATTAGTAATACCTCTCGGGCAAACCTAAACAA


166
GTTGCTTACTGATTACACGGAGATGAAAAAGGCCATACTGCATGTGTACTGGGAGGAGTTTCAAAA



GGACCCTGTCGGGCTAATGAGCAGGGTGGCTCAGCCTGCACCTAAAAACATCGACCAGCGGAAACT



CATCCCAGTTAAGGACGGAAATGAGAGATTGACAAGTTCAGGTTTCGCCTGCTCACAGTGCTGTCA



ACCGCTGTACGTTTATAAGTTAGAACAAGTGAATGACAAAGGAAAGCCTCACACAAATTATTTTGG



CCGGTGTAATGTCTCTGAGCATGAGCGTCTGATTCTGTTGTCCCCGCATAAACCGGAAGCTAATGAC



GAGCTCGTAACCTACAGCTTGGGGAAGTTTGGCCAAAGAGCATTGGACTTCTATTCAATCCATGTGA



CCCGCGAATCCAATCATCCCGTCAAGCCCTTGGAGCAGATAGGGGGCAATAGTTGCGCTTCTGGCC



CTGTGGGCAAAGCCCTGTCCGACGCCTGTATGGGAGCCGTGGCTTCATTCCTGACCAAATATCAGG



ATATCATCTTGGAGCACCAGAAAGTGATCAAGAAAAATGAAAAAAGGTTAGCAAACCTCAAGGAT



ATTGCAAGCGCTAACGGCTTGGCTTTTCCTAAAATCACACTTCCACCTCAGCCTCACACAAAGGAAG



GCATCGAGGCATACAACAATGTGGTGGCCCAGATCGTCATCTGGGTTAACTTAAACCTGTGGCAGA



AACTTAAAATTGGCAGGGATGAGGCAAAACCCTTACAGCGCCTGAAAGGATTCCCCAGCTTTCCAC



TGGTGGAGCGCCAGGCTAACGAAGTGGACTGGTGGGATATGGTGTGTAACGTCAAGAAGCTCATCA



ATGAAAAGAAAGAGGACGGTAAAGTCTTCTGGCAGAACCTCGCCGGTTACAAACGGCAGGAGGCG



CTGTTACCTTATCTGTCGAGTGAAGAGGACCGGAAAAAAGGCAAGAAATTTGCTCGTTATCAGTTT



GGTGATTTGCTCCTACATTTGGAGAAGAAGCACGGCGAGGACTGGGGAAAAGTATACGATGAGGCC



TGGGAGAGGATTGACAAAAAGGTGGAGGGACTGTCAAAGCACATCAAGCTCGAAGAAGAGCGCAG



AAGCGAGGACGCCCAATCCAAAGCAGCGCTGACTGACTGGCTGCGGGCGAAGGCCAGTTTTGTAAT



CGAAGGCCTTAAAGAAGCCGACAAGGATGAATTCTGCAGATGCGAATTAAAACTCCAGAAGTGGT



ACGGCGATCTCCGAGGTAAGCCTTTCGCAATCGAGGCCGAGAATTCCATACTGGACATTAGTGGAT



TCAGTAAACAGTATAATTGTGCCTTTATATGGCAGAAGGATGGTGTCAAGAAACTCAACCTGTACCT



TATTATTAATTATTTCAAAGGCGGGAAACTGAGATTTAAGAAGATAAAGCCTGAAGCCTTTGAGGC



GAACCGATTCTACACAGTTATTAACAAGAAATCTGGTGAAATTGTACCCATGGAGGTAAACTTCAA



CTTCGATGATCCCAATCTGATTATATTGCCACTAGCTTTTGGCAAGCGGCAGGGTAGGGAATTCATT



TGGAACGATTTGCTTTCACTGGAAACAGGGTCCCTTAAGCTGGCAAACGGGAGAGTGATTGAAAAG



ACATTGTACAATCGGAGGACACGTCAGGATGAACCTGCCCTTTTCGTGGCTCTGACATTCGAGCGCA



GGGAGGTTCTGGACTCTAGCAATATCAAGCCAATGAACCTGATCGGCATAGACCGAGGAGAGAATA



TTCCGGCTGTGATCGCACTCACCGATCCCGAAGGATGTCCCCTTTCTCGGTTCAAGGACTCCTTAGG



CAATCCAACTCATATCCTGAGAATCGGCGAGTCATACAAGGAGAAGCAGCGAACAATTCAGGCCGC



CAAGGAAGTCGAGCAGAGGCGAGCTGGCGGCTACAGCCGTAAATACGCTAGTAAAGCTAAGAACC



TGGCCGACGATATGGTGCGCAATACTGCTAGAGACCTGCTGTACTATGCAGTGACGCAGGACGCAA



TGCTGATATTCGAGAATCTGTCCAGAGGATTCGGAAGGCAGGGCAAGCGGACGTTCATGGCCGAGC



GCCAGTATACAAGGATGGAGGATTGGTTAACGGCCAAGCTTGCCTATGAGGGGCTACCTAGTAAGA



CCTATCTGTCTAAGACGCTGGCTCAATACACCAGTAAGACCTGCTCAAACTGTGGCTTTACAATCAC



TTCTGCTGATTATGATAGAGTGCTCGAGAAGCTAAAAAAAACTGCCACCGGCTGGATGACTACTAT



TAATGGGAAGGAACTGAAAGTGGAAGGACAGATTACCTATTATAATCGCTACAAGCGTCAAAACGT



CGTCAAGGACCTGTCGGTGGAATTGGACAGACTCAGTGAAGAGTCCGTGAACAATGATATCAGCTC



CTGGACAAAAGGGCGCAGTGGGGAGGCACTCAGCTTGCTTAAAAAGAGGTTTTCACATCGGCCGGT



CCAGGAGAAATTTGTCTGCCTGAACTGCGGATTCGAGACACACGCCGACGAGCAGGCAGCACTGAA



CATTGCCAGATCCTGGCTGTTCCTTAGGTCCCAGGAATATAAGAAGTACCAGACTAACAAAACCAC



GGGAAACACAGATAAAAGGGCCTTTGTCGAAACTTGGCAATCCTTTTACCGGAAGAAGTTAAAGGA



AGTGTGGAAGCCC





SEQ
ATGGATAAGAAATACTCAATAGGCTTAGCAATCGGCACAAATAGCGTCGGATGGGCGGTGATCACT


ID
GATGAATATAAGGTTCCGTCTAAAAAGTTCAAGGTTCTGGGAAATACAGACCGCCACAGTATCAAA


NO:
AAAAATCTTATAGGGGCTCTTTTATTTGACAGTGGAGAGACAGCGGAAGCGACTCGTCTCAAACGG


167
ACAGCTCGTAGAAGGTATACACGTCGGAAGAATCGTATTTGTTATCTACAGGAGATTTTTTCAAATG



AGATGGCGAAAGTAGATGATAGTTTCTTTCATCGACTTGAAGAGTCTTTTTTGGTGGAAGAAGACA



AGAAGCATGAACGTCATCCTATTTTTGGAAATATAGTAGATGAAGTTGCTTATCATGAGAAATATCC



AACTATCTATCATCTGCGAAAAAAATTGGTAGATTCTACTGATAAAGCGGATTTGCGCTTAATCTAT



TTGGCCTTAGCGCATATGATTAAGTTTCGTGGTCATTTTTTGATTGAGGGAGATTTAAATCCTGATA



ATAGTGATGTGGACAAACTATTTATCCAGTTGGTACAAACCTACAATCAATTATTTGAAGAAAACCC



TATTAACGCAAGTGGAGTAGATGCTAAAGCGATTCTTTCTGCACGATTGAGTAAATCAAGACGATT



AGAAAATCTCATTGCTCAGCTCCCCGGTGAGAAGAAAAATGGCTTATTTGGGAATCTCATTGCTTTG



TCATTGGGTTTGACCCCTAATTTTAAATCAAATTTTGATTTGGCAGAAGATGCTAAATTACAGCTTTC



AAAAGATACTTACGATGATGATTTAGATAATTTATTGGCGCAAATTGGAGATCAATATGCTGATTTG



TTTTTGGCAGCTAAGAATTTATCAGATGCTATTTTACTTTCAGATATCCTAAGAGTAAATACTGAAA



TAACTAAGGCTCCCCTATCAGCTTCAATGATTAAACGCTACGATGAACATCATCAAGACTTGACTCT



TTTAAAAGCTTTAGTTCGACAACAACTTCCAGAAAAGTATAAAGAAATCTTTTTTGATCAATCAAAA



AACGGATATGCAGGTTATATTGATGGGGGAGCTAGCCAAGAAGAATTTTATAAATTTATCAAACCA



ATTTTAGAAAAAATGGATGGTACTGAGGAATTATTGGTGAAACTAAATCGTGAAGATTTGCTGCGC



AAGCAACGGACCTTTGACAACGGCTCTATTCCCCATCAAATTCACTTGGGTGAGCTGCATGCTATTT



TGAGAAGACAAGAAGACTTTTATCCATTTTTAAAAGACAATCGTGAGAAGATTGAAAAAATCTTGA



CTTTTCGAATTCCTTATTATGTTGGTCCATTGGCGCGTGGCAATAGTCGTTTTGCATGGATGACTCGG



AAGTCTGAAGAAACAATTACCCCATGGAATTTTGAAGAAGTTGTCGATAAAGGTGCTTCAGCTCAA



TCATTTATTGAACGCATGACAAACTTTGATAAAAATCTTCCAAATGAAAAAGTACTACCAAAACAT



AGTTTGCTTTATGAGTATTTTACGGTTTATAACGAATTGACAAAGGTCAAATATGTTACTGAAGGAA



TGCGAAAACCAGCATTTCTTTCAGGTGAACAGAAGAAAGCCATTGTTGATTTACTCTTCAAAACAA



ATCGAAAAGTAACCGTTAAGCAATTAAAAGAAGATTATTTCAAAAAAATAGAATGTTTTGATAGTG



TTGAAATTTCAGGAGTTGAAGATAGATTTAATGCTTCATTAGGTACCTACCATGATTTGCTAAAAAT



TATTAAAGATAAAGATTTTTTGGATAATGAAGAAAATGAAGATATCTTAGAGGATATTGTTTTAACA



TTGACCTTATTTGAAGATAGGGAGATGATTGAGGAAAGACTTAAAACATATGCTCACCTCTTTGATG



ATAAGGTGATGAAACAGCTTAAACGTCGCCGTTATACTGGTTGGGGACGTTTGTCTCGAAAATTGAT



TAATGGTATTAGGGATAAGCAATCTGGCAAAACAATATTAGATTTTTTGAAATCAGATGGTTTTGCC



AATCGCAATTTTATGCAGCTGATCCATGATGATAGTTTGACATTTAAAGAAGACATTCAAAAAGCA



CAAGTGTCTGGACAAGGCGATAGTTTACATGAACATATTGCAAATTTAGCTGGTAGCCCTGCTATTA



AAAAAGGTATTTTACAGACTGTAAAAGTTGTTGATGAATTGGTCAAAGTAATGGGGCGGCATAAGC



CAGAAAATATCGTTATTGAAATGGCACGTGAAAATCAGACAACTCAAAAGGGCCAGAAAAATTCG



CGAGAGCGTATGAAACGAATCGAAGAAGGTATCAAAGAATTAGGAAGTCAGATTCTTAAAGAGCA



TCCTGTTGAAAATACTCAATTGCAAAATGAAAAGCTCTATCTCTATTATCTCCAAAATGGAAGAGAC



ATGTATGTGGACCAAGAATTAGATATTAATCGTTTAAGTGATTATGATGTCGATCACATTGTTCCAC



AAAGTTTCCTTAAAGACGATTCAATAGACAATAAGGTCTTAACGCGTTCTGATAAAAATCGTGGTA



AATCGGATAACGTTCCAAGTGAAGAAGTAGTCAAAAAGATGAAAAACTATTGGAGACAACTTCTAA



ACGCCAAGTTAATCACTCAACGTAAGTTTGATAATTTAACGAAAGCTGAACGTGGAGGTTTGAGTG



AACTTGATAAAGCTGGTTTTATCAAACGCCAATTGGTTGAAACTCGCCAAATCACTAAGCATGTGGC



ACAAATTTTGGATAGTCGCATGAATACTAAATACGATGAAAATGATAAACTTATTCGAGAGGTTAA



AGTGATTACCTTAAAATCTAAATTAGTTTCTGACTTCCGAAAAGATTTCCAATTCTATAAAGTACGT



GAGATTAACAATTACCATCATGCCCATGATGCGTATCTAAATGCCGTCGTTGGAACTGCTTTGATTA



AGAAATATCCAAAACTTGAATCGGAGTTTGTCTATGGTGATTATAAAGTTTATGATGTTCGTAAAAT



GATTGCTAAGTCTGAGCAAGAAATAGGCAAAGCAACCGCAAAATATTTCTTTTACTCTAATATCATG



AACTTCTTCAAAACAGAAATTACACTTGCAAATGGAGAGATTCGCAAACGCCCTCTAATCGAAACT



AATGGGGAAACTGGAGAAATTGTCTGGGATAAAGGGCGAGATTTTGCCACAGTGCGCAAAGTATTG



TCCATGCCCCAAGTCAATATTGTCAAGAAAACAGAAGTACAGACAGGCGGATTCTCCAAGGAGTCA



ATTTTACCAAAAAGAAATTCGGACAAGCTTATTGCTCGTAAAAAAGACTGGGATCCAAAAAAATAT



GGTGGTTTTGATAGTCCAACGGTAGCTTATTCAGTCCTAGTGGTTGCTAAGGTGGAAAAAGGGAAA



TCGAAGAAGTTAAAATCCGTTAAAGAGTTACTAGGGATCACAATTATGGAAAGAAGTTCCTTTGAA



AAAAATCCGATTGACTTTTTAGAAGCTAAAGGATATAAGGAAGTTAAAAAAGACTTAATCATTAAA



CTACCTAAATATAGTCTTTTTGAGTTAGAAAACGGTCGTAAACGGATGCTGGCTAGTGCCGGAGAA



TTACAAAAAGGAAATGAGCTGGCTCTGCCAAGCAAATATGTGAATTTTTTATATTTAGCTAGTCATT



ATGAAAAGTTGAAGGGTAGTCCAGAAGATAACGAACAAAAACAATTGTTTGTGGAGCAGCATAAG



CATTATTTAGATGAGATTATTGAGCAAATCAGTGAATTTTCTAAGCGTGTTATTTTAGCAGATGCCA



ATTTAGATAAAGTTCTTAGTGCATATAACAAACATAGAGACAAACCAATACGTGAACAAGCAGAAA



ATATTATTCATTTATTTACGTTGACGAATCTTGGAGCTCCCGCTGCTTTTAAATATTTTGATACAACA



ATTGATCGTAAACGATATACGTCTACAAAAGAAGTTTTAGATGCCACTCTTATCCATCAATCCATCA



CTGGTCTTTATGAAACACGCATTGATTTGAGTCAGCTAGGAGGTGACTGA





SEQ
ATGGATAAGAAGTATTCAATTGGACTTGCGATTGGCACTAACAGTGTGGGCTGGGCGGTGATTACA


ID
GACGAGTATAAGGTGCCGTCAAAAAAGTTTAAAGTTCTGGGCAACACTGATCGCCATTCCATCAAG


NO:
AAAAACCTAATCGGGGCCCTTCTTTTTGATAGTGGCGAAACGGCCGAGGCGACGCGTCTAAAACGT


168
ACCGCGCGGCGTCGCTACACCCGACGAAAAAACCGTATTTGTTACCTTCAGGAGATCTTCAGTAAC



GAAATGGCTAAGGTGGACGATTCATTCTTCCACCGTCTGGAGGAGTCCTTTTTAGTTGAAGAAGACA



AGAAGCATGAGCGACACCCAATTTTTGGTAACATTGTCGACGAAGTCGCCTATCACGAAAAATATC



CGACCATTTATCACCTGCGCAAAAAACTGGTCGATAGCACGGATAAAGCGGATCTGCGGCTTATTT



ACCTGGCGCTTGCCCACATGATCAAGTTCCGCGGCCACTTCCTGATAGAAGGAGACCTGAACCCGG



ATAATAGCGATGTAGACAAACTGTTTATTCAGCTGGTCCAGACCTACAACCAGCTGTTTGAAGAAA



ATCCGATTAATGCGTCAGGCGTGGATGCGAAAGCGATACTGAGTGCCCGCCTGTCGAAATCTCGCC



GTCTCGAAAATCTGATTGCACAGCTGCCCGGCGAAAAAAAAAACGGTCTTTTTGGCAATCTGATCG



CGCTGTCACTGGGCCTGACACCAAATTTTAAGAGCAACTTCGACCTGGCAGAGGATGCGAAGCTTC



AACTGTCGAAGGACACCTATGACGATGATCTGGATAATCTTCTGGCACAAATCGGTGATCAGTATG



CGGATTTATTCCTTGCAGCGAAAAACCTATCTGACGCAATTCTGTTGAGCGATATCCTCCGCGTCAA



CACCGAAATCACTAAAGCCCCCCTGTCAGCGTCGATGATTAAACGTTATGATGAGCACCATCAGGA



TCTGACCTTGCTAAAGGCGCTGGTGCGACAGCAGCTTCCCGAAAAATATAAAGAGATCTTTTTTGAT



CAATCGAAGAATGGTTATGCCGGATACATTGATGGCGGAGCCAGTCAGGAAGAATTTTACAAATTC



ATCAAACCGATCCTGGAAAAAATGGATGGCACAGAAGAACTGCTTGTGAAATTGAACCGGGAAGA



TTTACTGCGCAAACAGCGTACGTTCGACAACGGCTCCATACCCCATCAGATTCACTTAGGTGAGCTG



CATGCAATACTCCGTCGCCAGGAAGATTTTTATCCATTTTTAAAAGACAACCGTGAGAAGATTGAA



AAAATTTTAACTTTTCGTATTCCATATTACGTCGGGCCTTTGGCCCGAGGTAACTCTCGATTCGCCTG



GATGACGAGAAAAAGCGAGGAGACCATCACTCCGTGGAATTTTGAAGAGGTTGTTGATAAAGGCG



CGAGCGCCCAGTCGTTTATCGAACGTATGACCAACTTTGATAAAAATCTGCCGAATGAAAAAGTGC



TTCCGAAGCATTCTCTGTTGTATGAATATTTCACTGTGTACAATGAGTTAACGAAAGTGAAATATGT



GACCGAAGGCATGCGGAAACCTGCTTTTCTGTCCGGAGAACAGAAAAAAGCAATTGTGGACCTGCT



GTTCAAAACGAACCGGAAAGTAACTGTGAAGCAGCTGAAAGAGGACTACTTCAAAAAAATCGAAT



GCTTCGACTCAGTAGAGATCTCTGGTGTTGAAGATCGCTTCAACGCGAGTCTGGGAACGTACCATG



ATTTGTTGAAAATCATCAAAGATAAAGACTTTCTGGATAACGAAGAGAATGAGGACATTCTTGAAG



ATATTGTTTTGACACTGACTCTGTTTGAGGATCGCGAAATGATTGAAGAGCGCCTGAAAACGTATGC



CCATTTATTCGATGACAAAGTCATGAAGCAGCTGAAACGTCGCCGCTATACTGGGTGGGGCAGACT



TTCACGTAAATTGATCAATGGTATAAGAGACAAACAGAGCGGCAAAACTATCTTAGATTTCCTGAA



GAGTGATGGATTTGCCAACCGGAATTTTATGCAGCTTATACATGATGACTCGCTAACGTTTAAAGAA



GACATTCAGAAGGCGCAGGTCAGCGGCCAGGGTGATTCGCTGCATGAACACATTGCAAATCTTGCC



GGATCGCCAGCGATCAAAAAAGGCATCCTTCAGACAGTAAAAGTTGTGGATGAACTGGTGAAAGTA



ATGGGTCGTCACAAGCCAGAAAATATTGTGATCGAAATGGCCCGGGAAAATCAGACTACTCAAAAA



GGTCAGAAAAATTCTCGCGAGCGTATGAAACGTATTGAAGAAGGCATCAAAGAGCTAGGCAGCCA



GATATTAAAGGAACATCCGGTTGAGAACACTCAGCTGCAGAATGAAAAACTGTATCTGTATTATCT



TCAGAACGGCCGTGACATGTATGTTGATCAAGAACTGGATATCAATCGCTTGTCCGATTATGACGTG



GATCATATTGTTCCGCAAAGCTTTCTGAAAGACGATTCTATTGACAATAAAGTACTGACACGTTCGG



ACAAAAACCGTGGTAAAAGCGATAACGTACCGTCGGAAGAAGTTGTTAAGAAAATGAAAAATTAT



TGGCGCCAACTCCTGAATGCTAAATTGATTACCCAGCGGAAATTTGATAACTTAACCAAAGCCGAG



CGGGGTGGCTTAAGTGAACTGGATAAAGCGGGTTTTATTAAACGCCAACTGGTAGAAACCCGCCAG



ATAACGAAACATGTAGCTCAAATCCTCGATAGTCGCATGAATACGAAATATGACGAAAATGATAAA



TTGATCCGTGAAGTAAAAGTGATTACTCTTAAAAGCAAATTGGTATCTGATTTTCGGAAAGATTTCC



AATTCTATAAGGTGAGAGAAATTAACAATTACCATCATGCACATGATGCGTATTTAAATGCAGTTGT



TGGCACCGCCTTAATCAAAAAATATCCGAAATTAGAATCTGAGTTCGTGTATGGTGATTATAAAGTT



TATGATGTTCGAAAAATGATTGCTAAGTCTGAACAGGAAATCGGCAAAGCGACCGCAAAGTATTTT



TTTTATAGCAATATTATGAATTTTTTTAAAACTGAGATTACCCTGGCGAATGGCGAAATTCGCAAAC



GTCCTCTGATTGAAACCAATGGCGAAACCGGCGAGATAGTATGGGACAAGGGCCGTGATTTTGCGA



CCGTCCGGAAAGTCCTGTCAATGCCGCAGGTGAATATTGTCAAGAAAACAGAAGTTCAGACAGGCG



GTTTTAGTAAAGAGTCTATTCTGCCCAAACGTAATTCGGATAAATTGATTGCCCGCAAGAAAGATTG



GGATCCGAAGAAATATGGTGGATTCGATTCTCCGACGGTCGCCTATAGCGTTCTAGTCGTCGCCAAG



GTCGAAAAAGGTAAATCCAAAAAACTGAAATCTGTGAAAGAACTGTTAGGCATTACAATCATGGAA



CGTAGTAGTTTTGAAAAGAACCCGATCGACTTCCTCGAGGCGAAAGGCTACAAAGAAGTCAAGAAG



GATTTGATTATTAAACTCCCAAAATATTCATTATTTGAGTTAGAAAACGGTAGGAAGCGTATGCTGG



CGAGTGCTGGGGAATTACAGAAAGGGAATGAGTTAGCACTGCCGTCAAAATATGTGAACTTTCTGT



ATCTGGCCTCCCATTACGAGAAACTGAAAGGTAGCCCGGAAGATAATGAACAGAAACAACTATTTG



TCGAGCAACACAAACATTATCTGGATGAAATTATTGAACAGATTAGTGAATTCTCTAAACGTGTTAT



TTTAGCGGATGCCAACCTTGACAAGGTGCTGAGCGCATATAATAAACACCGTGATAAACCCATTCG



TGAACAGGCTGAAAATATCATACATCTGTTCACGTTAACCAACTTGGGAGCTCCTGCCGCTTTTAAA



TATTTCGATACCACAATTGACCGCAAACGTTATACGTCTACAAAAGAGGTGCTCGATGCGACCCTG



ATCCACCAGTCTATTACAGGCCTGTATGAAACTCGTATCGACCTGTCACAACTGGGCGGCGACTGA





SEQ
ATGGACAAGAAATATTCAATCGGTTTAGCAATAGGAACTAACTCAGTAGGTTGGGCTGTAATTACA


ID
GACGAATACAAGGTACCGTCCAAAAAGTTTAAGGTGTTGGGGAACACAGATAGACACTCTATAAAA


NO:
AAAAATTTAATAGGCGCTTTACTTTTCGATTCAGGCGAAACTGCAGAAGCGACACGTCTGAAGAGA


169
ACCGCTAGACGTAGATACACGAGGAGAAAGAACAGAATATGTTACCTACAAGAAATTTTTTCTAAT



GAGATGGCTAAGGTGGATGATTCGTTTTTTCATAGACTCGAAGAATCTTTCTTAGTTGAAGAAGATA



AAAAACACGAAAGGCATCCTATCTTTGGAAACATAGTTGATGAGGTGGCTTACCATGAAAAATATC



CCACTATATATCACCTTAGAAAAAAGTTGGTTGATTCAACCGACAAAGCGGATCTAAGGTTAATTTA



CCTCGCGTTGGCTCACATGATAAAATTTAGAGGACATTTCTTGATCGAAGGTGATTTAAATCCCGAT



AACTCTGATGTAGATAAACTGTTCATCCAGTTGGTTCAAACATATAATCAGTTGTTCGAAGAGAACC



CCATTAACGCATCAGGTGTTGATGCTAAAGCAATCTTATCAGCAAGGTTGAGCAAGAGCAGACGTC



TGGAAAACTTGATTGCCCAATTGCCAGGTGAAAAGAAGAACGGTCTTTTTGGAAATTTAATTGCACT



TTCACTTGGGTTGACACCGAATTTTAAAAGCAATTTCGACCTCGCTGAGGATGCTAAACTCCAGTTA



TCTAAGGATACATATGACGATGATTTGGATAATCTATTGGCCCAGATAGGTGATCAGTATGCAGATT



TGTTTTTGGCAGCTAAGAATTTATCAGATGCAATTCTACTGAGCGATATTTTAAGGGTGAATACAGA



AATAACTAAAGCACCTTTGTCTGCATCTATGATAAAAAGATACGATGAACACCATCAAGATCTCAC



ACTATTAAAAGCTTTAGTTAGACAACAATTACCAGAAAAATATAAAGAAATCTTTTTCGATCAGTCC



AAGAACGGATACGCCGGCTATATAGATGGCGGTGCCTCCCAAGAAGAATTTTACAAATTTATCAAA



CCCATTTTGGAAAAGATGGATGGTACTGAAGAATTATTGGTCAAATTAAACAGGGAAGATTTATTA



AGAAAACAAAGGACCTTTGATAATGGTTCTATTCCACACCAAATCCATCTAGGGGAATTACATGCG



ATTCTTAGAAGACAAGAAGATTTTTATCCATTCTTGAAAGATAACAGGGAAAAGATAGAGAAAATC



TTAACTTTTAGAATTCCCTACTACGTCGGGCCCTTAGCTAGGGGGAATTCTAGATTCGCCTGGATGA



CACGCAAATCAGAAGAAACAATTACGCCTTGGAATTTTGAAGAAGTTGTTGATAAAGGAGCCTCTG



CTCAATCTTTTATTGAACGAATGACCAATTTTGATAAGAATTTACCCAATGAAAAGGTCTTACCCAA



ACATTCACTCCTATACGAGTACTTTACTGTTTACAATGAGTTGACAAAAGTGAAGTATGTTACCGAG



GGTATGCGAAAACCTGCTTTCTTGAGTGGTGAACAAAAGAAGGCCATTGTTGACTTGTTATTCAAAA



CTAACAGAAAGGTCACTGTGAAGCAGCTTAAAGAAGATTATTTCAAAAAGATCGAATGTTTCGACT



CGGTAGAAATTAGTGGTGTGGAAGATAGATTTAATGCTTCTCTTGGAACATATCATGATCTACTAAA



GATCATCAAAGATAAAGATTTCTTGGACAATGAAGAAAATGAAGATATTCTTGAAGACATCGTGTT



GACACTTACATTGTTTGAGGACAGAGAAATGATTGAAGAAAGGCTGAAGACCTACGCCCATTTGTT



TGATGATAAAGTCATGAAACAGTTAAAGAGGAGAAGGTATACCGGATGGGGTAGGCTGTCTCGCA



AATTGATTAATGGTATTCGTGATAAACAATCGGGTAAAACAATCCTAGATTTCCTGAAGTCCGATGG



TTTCGCCAACAGGAATTTTATGCAATTGATTCATGACGATTCTTTGACTTTTAAAGAGGATATTCAG



AAAGCACAGGTCTCAGGACAGGGCGATTCACTCCATGAACATATAGCTAACCTGGCTGGCTCCCCT



GCTATTAAGAAAGGTATCTTGCAAACCGTCAAAGTAGTAGACGAACTTGTTAAAGTTATGGGAAGA



CACAAACCTGAAAATATCGTTATTGAAATGGCTCGCGAAAACCAGACAACACAAAAGGGTCAAAA



GAATTCGAGAGAGAGAATGAAGCGTATCGAAGAAGGTATTAAAGAACTTGGGTCCCAAATACTTA



AAGAACATCCAGTAGAAAACACTCAGCTTCAAAATGAAAAATTATACTTATATTATCTTCAGAATG



GCCGCGATATGTATGTTGACCAAGAGTTAGATATAAATAGGTTGTCTGATTACGACGTGGATCATAT



TGTACCTCAATCTTTTCTAAAAGATGATTCAATTGATAATAAGGTATTAACGAGAAGTGATAAAAAT



AGAGGTAAATCTGACAACGTGCCAAGCGAAGAGGTGGTGAAGAAAATGAAAAATTATTGGCGTCA



ACTGTTGAACGCCAAGTTAATTACGCAGAGAAAGTTTGATAATCTAACAAAAGCTGAAAGAGGAGG



CCTATCTGAGTTAGATAAGGCCGGTTTTATCAAACGTCAGTTAGTTGAAACCAGGCAAATCACGAA



GCACGTTGCCCAAATTCTAGATTCAAGGATGAATACCAAATACGATGAAAACGATAAACTGATTCG



GGAAGTCAAGGTTATAACTCTAAAAAGCAAACTAGTTTCAGATTTTCGCAAAGATTTTCAATTTTAC



AAAGTTCGAGAAATCAATAATTATCATCATGCTCACGACGCGTACTTGAACGCGGTCGTTGGTACA



GCTTTAATAAAGAAATATCCTAAACTGGAATCGGAATTTGTATATGGGGATTACAAAGTATACGAC



GTGAGAAAGATGATCGCTAAATCTGAACAAGAAATTGGGAAAGCAACTGCCAAATATTTTTTTTAC



AGCAACATAATGAATTTTTTTAAAACGGAAATTACATTGGCAAATGGCGAAATTAGAAAGCGCCCA



TTGATAGAGACCAATGGAGAGACTGGGGAAATCGTGTGGGATAAAGGACGTGATTTTGCCACAGTG



AGGAAAGTGTTAAGTATGCCACAAGTTAATATTGTAAAAAAGACCGAGGTCCAAACGGGTGGATTT



AGCAAAGAATCAATTTTACCTAAGAGAAATTCAGATAAATTAATTGCCCGCAAAAAGGATTGGGAT



CCTAAAAAATATGGTGGTTTTGATTCCCCAACAGTTGCTTACTCCGTCCTAGTTGTTGCTAAGGTTG



AAAAAGGAAAGTCTAAGAAACTTAAATCCGTAAAAGAGTTACTGGGAATTACAATAATGGAAAGA



TCCTCTTTCGAAAAGAACCCTATTGACTTCTTGGAGGCGAAAGGTTATAAAGAAGTCAAAAAAGAT



TTGATCATAAAACTACCAAAGTATTCTCTATTTGAATTGGAAAACGGCAGAAAAAGGATGTTGGCA



AGCGCTGGTGAACTACAAAAGGGTAACGAATTGGCATTGCCGAGTAAATACGTGAATTTTCTATAT



TTGGCATCACATTACGAAAAGTTAAAGGGATCACCCGAGGATAACGAGCAGAAACAACTGTTTGTT



GAACAACACAAACATTATCTTGATGAAATTATAGAACAAATTAGTGAGTTCAGTAAGAGAGTTATT



TTAGCCGATGCAAATTTAGACAAAGTTTTATCTGCTTATAACAAACATAGAGATAAGCCTATAAGG



GAACAAGCCGAAAATATTATTCATTTGTTTACGTTAACAAATTTAGGGGCACCAGCAGCATTCAAGT



ACTTCGATACGACTATCGATCGTAAGCGTTACACATCTACCAAAGAAGTTCTTGATGCAACTTTGAT



TCATCAATCTATAACAGGCTTATATGAAACTAGAATCGATCTGTCACAACTTGGTGGTGACTAA





SEQ
ATGGACAAGAAGTACTCAATTGGGCTTGCTATCGGCACTAACAGCGTTGGCTGGGCGGTCATCACA


ID
GACGAATATAAGGTCCCATCAAAGAAATTCAAAGTCCTTGGCAATACGGACCGACATTCAATCAAG


NO:
AAGAACCTGATTGGAGCTCTGCTGTTTGATTCCGGTGAAACCGCCGAGGCAACACGATTGAAACGT


170
ACCGCTCGTAGGAGGTATACGCGGCGGAAAAATAGGATCTGCTATCTGCAGGAAATATTTAGCAAC



GAAATGGCCAAGGTAGACGACAGCTTCTTCCACCGGCTCGAGGAATCTTTCCTCGTGGAAGAAGAC



AAAAAGCACGAGCGCCACCCCATTTTCGGCAATATCGTGGACGAGGTAGCTTACCATGAAAAGTAT



CCAACTATTTACCACTTACGTAAGAAGTTAGTGGACAGCACCGATAAAGCCGACCTTCGCCTGATTT



ACCTAGCACTTGCACACATGATTAAGTTCCGAGGCCACTTCTTGATAGAGGGAGACCTGAATCCTG



ACAATTCCGATGTGGATAAATTGTTCATCCAGCTGGTACAGACATACAATCAGTTGTTTGAGGAAA



ATCCGATTAATGCCAGTGGCGTGGACGCCAAGGCTATCCTGTCTGCTCGGCTTAGTAAGAGTAGAC



GCCTGGAAAATCTAATCGCACAGCTGCCCGGCGAAAAGAAAAATGGACTGTTCGGTAATTTGATCG



CCCTGAGCCTGGGCCTCACCCCTAACTTTAAGTCTAACTTCGACCTGGCCGAAGATGCTAAGCTCCA



GCTGTCCAAAGATACTTACGATGACGATCTCGATAATCTACTGGCTCAGATCGGGGACCAGTACGC



TGACCTGTTTCTAGCTGCCAAGAACCTCAGTGACGCCATTCTCCTGTCCGATATTCTGAGGGTTAAC



ACTGAAATTACAAAGGCCCCGCTGAGCGCGAGCATGATCAAAAGGTACGACGAGCATCACCAGGA



CCTCACGCTGCTGAAGGCCTTAGTCAGACAGCAACTGCCCGAAAAGTACAAAGAAATCTTTTTCGA



CCAATCCAAGAACGGGTACGCCGGCTACATTGATGGCGGGGCTTCACAAGAGGAGTTTTACAAGTT



TATCAAGCCCATCCTGGAGAAAATGGACGGCACTGAAGAACTGCTTGTGAAACTCAATAGGGAAGA



CTTACTGAGGAAACAGCGCACATTCGATAATGGCTCCATACCCCACCAAATCCATCTGGGAGAGTT



GCATGCCATCTTGCGAAGGCAGGAGGACTTCTACCCCTTTCTTAAGGACAACAGGGAGAAAATCGA



GAAAATTCTGACTTTCCGTATCCCCTACTACGTGGGCCCACTTGCTCGCGGAAACTCACGATTCGCA



TGGATGACCAGAAAGTCCGAGGAAACAATTACACCCTGGAATTTTGAGGAGGTAGTAGACAAGGG



AGCCAGCGCTCAATCTTTCATTGAGAGGATGACGAATTTCGACAAGAACCTTCCAAACGAGAAAGT



GCTTCCTAAGCACAGCCTGCTGTATGAGTATTTCACGGTGTACAACGAACTTACGAAGGTCAAGTAT



GTGACAGAGGGTATGCGGAAACCTGCTTTTCTGTCTGGTGAACAGAAGAAAGCTATCGTCGATCTC



CTGTTTAAAACCAACCGAAAGGTGACGGTGAAACAGTTGAAGGAGGATTACTTCAAGAAGATCGA



GTGTTTTGATTCTGTTGAAATTTCTGGGGTCGAGGATAGATTCAACGCCAGCCTGGGCACCTACCAT



GATTTGCTGAAGATTATCAAGGATAAGGATTTTCTGGATAATGAGGAGAATGAAGACATTTTGGAG



GATATAGTGCTGACCCTCACCCTGTTCGAGGACCGGGAGATGATCGAGGAGAGACTGAAAACATAC



GCTCACCTGTTTGACGACAAGGTCATGAAGCAGCTTAAGAGACGCCGTTACACAGGCTGGGGAAGA



TTATCCCGCAAATTAATCAACGGGATACGCGATAAACAAAGTGGCAAGACCATACTCGACTTCCTA



AAGAGCGATGGATTCGCAAATCGCAATTTCATGCAGTTGATCCACGACGATAGCCTGACCTTCAAA



GAGGACATTCAGAAAGCGCAGGTGAGTGGTCAAGGGGATTCCCTGCACGAACACATTGCTAACTTG



GCTGGATCACCAGCCATTAAGAAAGGCATACTGCAGACCGTTAAAGTGGTAGATGAGCTTGTGAAA



GTCATGGGAAGACATAAGCCAGAGAACATAGTGATCGAAATGGCCAGGGAAAATCAGACCACGCA



AAAGGGGCAGAAGAACTCAAGAGAGCGTATGAAGAGGATCGAGGAGGGCATCAAGGAGCTGGGT



AGCCAGATCCTTAAAGAGCACCCAGTTGAGAATACCCAGCTGCAGAATGAGAAACTTTATCTCTAT



TATCTCCAGAACGGAAGGGATATGTATGTCGACCAGGAACTGGACATCAATCGGCTGAGTGATTAT



GACGTCGACCACATTGTGCCTCAAAGCTTTCTGAAGGATGATTCCATCGACAATAAAGTTCTGACCC



GGTCTGATAAAAATAGAGGCAAATCCGACAACGTACCTAGCGAAGAAGTCGTCAAAAAAATGAAG



AACTATTGGAGGCAGTTGCTGAATGCCAAGCTGATTACACAACGCAAGTTTGACAATCTCACCAAG



GCAGAAAGGGGGGGCCTGTCAGAACTCGACAAAGCAGGTTTCATTAAAAGGCAGCTAGTTGAAAC



TAGGCAGATTACTAAGCACGTGGCCCAGATCCTCGACTCACGGATGAATACAAAGTATGATGAGAA



TGATAAGCTAATCCGGGAGGTGAAGGTGATTACTCTGAAATCTAAGCTGGTGTCAGATTTCAGAAA



AGACTTCCAGTTCTACAAAGTCAGAGAGATCAACAATTATCACCATGCCCACGATGCATATCTTAAT



GCAGTAGTGGGGACAGCTCTGATCAAAAAATATCCTAAACTGGAGTCTGAATTCGTTTATGGTGAC



TATAAAGTCTATGACGTCAGAAAAATGATCGCAAAGAGCGAGCAGGAGATAGGGAAGGCCACAGC



AAAGTACTTCTTTTACAGTAATATCATGAACTTTTTCAAAACTGAGATTACATTGGCTAACGGCGAG



ATCCGCAAGCGGCCACTGATAGAGACTAACGGAGAGACAGGGGAGATTGTTTGGGATAAGGGCCG



TGACTTCGCCACCGTTAGGAAAGTGCTGTCCATGCCCCAGGTGAACATTGTGAAGAAGACAGAAGT



GCAGACGGGTGGGTTCTCAAAAGAGTCTATTCTGCCTAAGCGGAATAGTGACAAACTGATCGCACG



TAAAAAGGACTGGGATCCAAAAAAGTACGGCGGATTCGACAGTCCTACCGTTGCATATTCCGTGCT



TGTGGTCGCTAAGGTGGAGAAGGGAAAAAGCAAGAAACTGAAGTCAGTCAAAGAACTACTGGGCA



TAACGATCATGGAGCGCTCCAGTTTCGAAAAAAACCCAATCGATTTTCTTGAAGCCAAGGGATACA



AGGAGGTAAAGAAAGACCTTATCATTAAGCTGCCTAAGTACAGTCTGTTCGAACTGGAGAATGGGA



GGAAGCGCATGCTGGCATCAGCTGGAGAACTCCAAAAAGGGAACGAGTTGGCCCTCCCCTCAAAGT



ATGTCAATTTTCTCTACCTGGCTTCTCACTACGAGAAGTTAAAGGGGTCTCCAGAGGATAATGAGCA



GAAACAGCTGTTTGTGGAACAGCACAAGCACTATTTGGACGAAATCATCGAACAAATTTCCGAGTT



CAGTAAGAGGGTGATTCTGGCCGACGCAAACCTTGACAAAGTTCTGTCCGCATACAATAAGCACAG



AGACAAACCAATCCGCGAGCAAGCCGAGAATATAATTCACCTTTTCACTCTGACTAATCTGGGGGC



CCCCGCAGCATTTAAATATTTCGATACAACAATCGACCGGAAGCGGTATACATCTACTAAGGAAGT



CCTCGATGCGACACTGATCCACCAGTCAATTACAGGTTTATATGAAACAAGAATCGACCTGTCCCA



GCTGGGCGGCGACTAG





SEQ
AAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAAGCATTGATAATTG


ID
AGATCCCTCTCCCTGACAGGATGATTACATAAATAATAGTGACAAAAATAAATTATTTATTTATCCA


NO:
GAAAATGAATTGGAAAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTcaaaCAG


171
GTtgccgtcactgcgtcttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagc



attctgtaacaaagcgggaccaaagccatgacaaaaacgcgtaacaaaagtgtctataatcacggcag



aaaagtccacattgattatttgcacggcgtcacactttgctatgccatagcatttttatccataagat



tagcggatcctacctgacgctttttatcgcaactctctactgtttctccatacccgtttttttgggct



agcaccgcctatctcgtgtgagataggcggagatacgaactttaagAAGGAGatatacc



ATGGAACAGGAATATTATCTGGGCTTGGACATGGGCACCGGTTCCGTCGGCTGGGCTGTTACTGA



CAGTGAATATCACGTTCTAAGAAAGCATGGTAAGGCATTGTGGGGTGTAAGACTTTTCGAATCTGCT



TCCACTGCTGAAGAGCGTAGAATGTTTAGAACGAGTCGACGTAGGCTAGACAGGCGCAATTGGAGA



ATCGAAATTTTACAAGAAATTTTTGCGGAAGAGATATCTAAGAAAGACCCAGGCTTTTTCCTGAGA



ATGAAGGAATCTAAGTATTACCCTGAGGATAAAAGAGATATAAATGGTAACTGTCCCGAATTGCCT



TACGCATTATTTGTGGACGATGATTTTACCGATAAGGATTACCATAAAAAGTTCCCAACTATCTACC



ATTTACGCAAAATGTTAATGAATACAGAGGAAACCCCAGACATAAGACTAGTTTATCTGGCAATAC



ACCATATGATGAAACATAGAGGCCATTTCTTACTTTCCGGGGATATCAACGAAATCAAAGAGTTTG



GTACCACATTTAGTAAGTTACTGGAAAACATAAAGAATGAAGAATTGGATTGGAACTTAGAACTCG



GAAAAGAAGAATACGCGGTTGTCGAATCTATCCTGAAGGATAATATGCTGAATAGGTCGACCAAAA



AAACTAGGCTGATCAAAGCACTGAAAGCCAAATCTATCTGCGAAAAAGCTGTTTTAAATTTACTTG



CTGGTGGCACTGTTAAGTTATCAGACATTTTTGGTTTGGAAGAATTGAACGAAACCGAGCGTCCAA



AAATTAGTTTCGCTGATAATGGCTACGATGATTACATTGGTGAGGTGGAAAACGAGTTGGGCGAAC



AATTTTATATTATAGAGACAGCTAAGGCAGTCTATGACTGGGCTGTTTTAGTAGAAATCCTTGGTAA



ATACACATCTATCTCCGAAGCGAAAGTTGCTACTTACGAAAAGCACAAGTCCGATCTCCAGTTTTTG



AAGAAAATTGTCAGGAAATATCTGACTAAGGAAGAATATAAAGATATTTTCGTTAGTACCTCTGAC



AAACTGAAAAATTACTCCGCTTACATCGGGATGACCAAGATTAATGGCAAAAAAGTTGATCTGCAA



AGCAAAAGGTGTTCGAAGGAAGAATTTTATGATTTCATTAAAAAGAATGTCTTAAAAAAATTAGAA



GGTCAGCCAGAATACGAATATTTGAAAGAAGAACTGGAAAGAGAGACATTCTTACCAAAACAAGT



CAACAGAGATAATGGGGTAATTCCATATCAAATTCACCTCTACGAATTAAAAAAAATTTTAGGCAA



TTTACGCGATAAAATTGACCTTATCAAAGAAAATGAGGATAAGCTGGTTCAACTCTTTGAATTCAGA



ATACCCTATTATGTGGGCCCACTGAACAAGATTGATGACGGCAAAGAAGGTAAATTCACATGGGCC



GTCCGCAAATCCAATGAAAAAATTTACCCATGGAACTTTGAAAATGTAGTAGATATTGAAGCGTCT



GCGGAGAAATTTATTCGAAGAATGACTAATAAATGCACTTACTTGATGGGAGAGGATGTTCTGCCT



AAAGACAGCTTATTATACAGCAAGTACATGGTTCTAAACGAACTTAACAACGTTAAGTTGGACGGT



GAGAAATTAAGTGTAGAATTGAAACAAAGATTGTATACTGACGTCTTCTGCAAGTACAGAAAAGTG



ACAGTTAAAAAAATTAAGAATTACTTGAAGTGCGAAGGTATAATTTCTGGAAACGTAGAGATTACT



GGTATTGATGGTGATTTCAAAGCATCCCTAACAGCTTACCACGATTTCAAGGAAATCCTGACAGGA



ACTGAACTCGCAAAAAAAGATAAAGAAAACATTATTACTAATATTGTTCTTTTCGGTGATGACAAG



AAATTGTTGAAGAAAAGACTGAATAGACTTTACCCCCAGATTACTCCCAATCAACTTAAGAAAATT



TGTGCTTTGTCTTACACAGGATGGGGTCGTTTTTCAAAAAAGTTCTTAGAAGAGATTACCGCACCTG



ATCCAGAAACAGGCGAAGTATGGAATATAATTACCGCCTTATGGGAATCGAACAATAATCTTATGC



AACTTCTGAGCAATGAATATCGTTTCATGGAAGAAGTTGAGACTTACAACATGGGCAAACAGACGA



AGACTTTATCCTATGAAACTGTGGAAAATATGTATGTATCACCTTCTGTCAAGAGACAAATTTGGCA



AACCTTAAAAATTGTCAAAGAATTAGAAAAGGTAATGAAGGAGTCTCCTAAACGTGTGTTTATTGA



AATGGCTAGAGAAAAACAAGAGTCAAAAAGAACCGAGTCAAGAAAGAAGCAGTTAATCGATTTAT



ATAAGGCTTGTAAAAACGAAGAGAAAGATTGGGTTAAAGAATTGGGGGACCAAGAGGAACAAAAA



CTACGGTCGGATAAGTTGTATTTATACTATACGCAAAAGGGACGATGTATGTATTCCGGCGAGGTA



ATAGAATTGAAGGATTTATGGGACAATACAAAATATGACATAGACCATATATATCCCCAATCAAAA



ACGATGGACGATAGCTTGAACAATAGAGTACTCGTGAAAAAAAAATATAATGCGACCAAATCTGAT



AAGTATCCTCTGAATGAAAATATCAGACATGAAAGAAAGGGGTTCTGGAAGTCCTTGTTAGATGGT



GGGTTTATAAGCAAAGAAAAGTACGAGCGTCTAATAAGAAACACGGAGTTATCGCCAGAAGAACT



CGCTGGTTTTATTGAGAGGCAAATCGTGGAAACGAGACAATCTACCAAAGCCGTTGCTGAGATCCT



AAAGCAAGTTTTCCCAGAGTCGGAGATTGTCTATGTCAAAGCTGGCACAGTGAGCAGGTTTAGGAA



AGACTTCGAACTATTAAAGGTAAGAGAAGTGAACGATTTACATCACGCAAAGGACGCTTACCTAAA



TATCGTTGTAGGTAACTCATATTATGTTAAATTTACCAAGAACGCCTCTTGGTTTATAAAGGAGAAC



CCAGGTAGAACATATAACCTGAAAAAGATGTTCACCTCTGGTTGGAATATTGAGAGAAACGGAGAA



GTCGCATGGGAAGTTGGTAAGAAAGGGACTATAGTGACAGTAAAGCAAATTATGAACAAAAATAA



TATCCTCGTTACAAGGCAGGTTCATGAAGCAAAGGGCGGCCTTTTTGACCAACAAATTATGAAGAA



AGGGAAAGGTCAAATTGCAATAAAAGAAACCGATGAGAGACTAGCGTCAATAGAAAAGTATGGTG



GCTATAATAAAGCTGCGGGTGCATACTTTATGCTTGTTGAATCAAAAGACAAGAAAGGTAAGACTA



TTAGAACTATAGAATTTATACCCCTGTACCTTAAAAACAAAATTGAATCGGATGAGTCAATCGCGTT



AAATTTTCTAGAGAAAGGAAGGGGTTTAAAAGAACCAAAGATCCTGTTAAAAAAGATTAAGATTGA



CACCTTGTTCGATGTAGATGGATTTAAAATGTGGTTATCTGGCAGAACAGGCGATAGACTTTTGTTT



AAGTGCGCTAATCAATTAATTTTGGATGAGAAAATCATTGTCACAATGAAAAAAATAGTTAAGTTT



ATTCAGAGAAGACAAGAAAACAGGGAGTTGAAATTATCTGATAAAGATGGTATCGACAATGAAGT



TTTAATGGAAATCTACAATACATTCGTTGATAAACTTGAAAATACCGTATATCGAATCAGGTTAAGT



GAACAAGCCAAAACATTAATTGATAAACAAAAAGAATTTGAAAGGCTATCACTGGAAGACAAATC



CTCCACCCTATTTGAAATTTTGCATATATTCCAGTGCCAATCTTCAGCAGCTAATTTAAAAATGATTG



GCGGACCTGGGAAAGCCGGCATCCTAGTGATGAACAATAATATCTCCAAGTGTAACAAAATATCAA



TTATTAACCAATCTCCGACAGGTATTTTTGAAAATGAAATAGACTTGCTTAAGATATAAGAAATCAT



CCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATTTATTATATCGCGTTGATTATTGATGCTGTTT



TTAGTTTTAACGGCAATTAATATATGTGTTATTAATTGAATGAATTTTATCATTCATAATAAGTATGT



GTAGGATCAAGCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA



GAATTATCTCATAACAAGTGTTAAGGGATGTTATTTCC





SEQ
AATTCAAAGGATAATCAAAC


ID



NO:



172






SEQ
AATCTCTACTCTTTGTAGAT


ID



NO:



173






SEQ
AATTTCTACTGTTGTAGAT


ID



NO:



174






SEQ
AATTTCTACTAGTGTAGAT


ID



NO:



175






SEQ
AATTTCTACTATTGT


ID



NO:



176






SEQ
AATTTCTACTGTTGTAGA


ID



NO:



177






SEQ
AATTTCTACTATTGTA


ID



NO:



178






SEQ
AATTTCTACTTTTGTAGAT


ID



NO:



179






SEQ
AATTTCTACTGTTGTAGAT


ID



NO:



180






SEQ
AATTTCTACTCTTGTAGAT


ID



NO:



181








Claims
  • 1. A nucleic acid-guided nuclease system comprising: an engineered guide nucleic acid complexed with a nucleic acid-guided nuclease comprising the amino acid sequence of SEQ ID NO: 7 and an editing sequence comprising a nucleic acid sequence complementary to a target region having a change in sequence relative to the target region, wherein the engineered guide nucleic acid comprises the nucleotide sequence of SEQ ID NO: 88, SEQ ID NO: 93, or SEQ ID NO: 94.
  • 2. The nucleic acid-guided nuclease system of claim 1, wherein the nucleic acid-guided nuclease is encoded by a nucleotide sequence comprising the nucleotide sequence of SEQ ID NO: 27, SEQ ID NO: 47, SEQ ID NO: 133, or SEQ ID NO: 153.
  • 3. The nucleic acid-guided nuclease system of claim 1, wherein a nucleotide sequence encoding the nucleic acid-guided nuclease is codon optimized.
  • 4. The nucleic acid-guided nuclease system of claim 3, wherein the nucleotide sequence encoding the nucleic acid-guided nuclease is codon optimized for E. coli.
  • 5. The nucleic acid-guided nuclease system of claim 3, wherein the nucleotide sequence encoding the nucleic acid-guided nuclease is codon optimized for S. cervisiae.
  • 6. The nucleic acid-guided nuclease system of claim 3, wherein the nucleotide sequence encoding the nucleic acid-guided nuclease is codon optimized for mammalian cells.
  • 7. The nucleic acid-guided nuclease system of claim 3, wherein the nucleotide sequence encoding the nucleic acid-guided nuclease is codon optimized for plant cells.
  • 8. The nucleic acid-guided nuclease system of claim 3, wherein the codon optimized nucleotide sequence encoding the nucleic acid-guided nuclease a nucleotide sequence having at least 95% sequence identity to the nucleotide sequence of SEQ ID NO: 47, SEQ ID NO: 133, or SEQ ID NO: 153.
  • 9. The nucleic acid-guided nuclease system of claim 1, wherein an endogenous scaffold sequence comprises the nucleotide sequence of SEQ ID NO: 90.
  • 10. The nucleic acid-guided nuclease system of claim 1, wherein the engineered guide nucleic acid comprises a pseudoknot region comprising nucleotide residues 17-26 of SEQ ID NO: 200 or nucleotide residues 17-26 of SEQ ID NO: 201.
RELATED APPLICATIONS

This application is a Continuation of U.S. Ser. No. 16/421,783, entitled “Nucleic Acid-Guided Nucleases” filed 24 May 2019; which is a Continuation of U.S. Ser. No. 15/896,444, entitled “Nucleic Acid-Guided Nucleases” filed 14 Feb. 2018, now U.S. Pat. No. 10,337,028; which is a Continuation patent application of U.S. Ser. No. 15/632,001, entitled “Nucleic Acid-Guided Nucleases” filed 23 Jun. 2017, now U.S. Pat. No. 9,982,279.

US Referenced Citations (89)
Number Name Date Kind
6391582 Ying et al. May 2002 B2
6837995 Vassarotti et al. Jan 2005 B1
7166443 Walker et al. Jan 2007 B2
8332160 Platt et al. Dec 2012 B1
8697359 Zhang et al. Apr 2014 B1
8926977 Miller et al. Jan 2015 B2
9260505 Weir et al. Feb 2016 B2
9361427 Hillson Jun 2016 B2
9499855 Hyde et al. Nov 2016 B2
9776138 Innings et al. Oct 2017 B2
9790490 Zhang et al. Oct 2017 B2
9896696 Begemann et al. Feb 2018 B2
9982279 Gill et al. May 2018 B1
9988624 Serber et al. Jun 2018 B2
10011849 Gill et al. Jul 2018 B1
10017760 Gill et al. Jul 2018 B2
10227576 Cameron et al. Mar 2019 B1
10266851 Chen Apr 2019 B2
10704033 Kim et al. Jul 2020 B1
10724021 Kim et al. Jul 2020 B1
10745678 Kim et al. Aug 2020 B1
10767169 Kim et al. Sep 2020 B1
10837021 Tian et al. Nov 2020 B1
10927385 Kannan et al. Feb 2021 B2
20020139741 Kopf Oct 2002 A1
20040110253 Kappler et al. Jun 2004 A1
20060014137 Ghosh et al. Jan 2006 A1
20070020761 Yu et al. Jan 2007 A1
20100076057 Sontheimer et al. Mar 2010 A1
20110294217 McConnell-Smith et al. Dec 2011 A1
20130236970 Anneren et al. Sep 2013 A1
20140068797 Doudna et al. Mar 2014 A1
20140199767 Barrangou et al. Jul 2014 A1
20140242033 Gruber et al. Aug 2014 A1
20140273226 Wu et al. Sep 2014 A1
20150024464 Lippow et al. Jan 2015 A1
20150071898 Liu et al. Mar 2015 A1
20150098954 Hyde et al. Apr 2015 A1
20150159174 Frendewey et al. Jun 2015 A1
20150176013 Musunuru et al. Jun 2015 A1
20150191719 Hudson et al. Jul 2015 A1
20150225732 Williams et al. Aug 2015 A1
20150344549 Muir et al. Dec 2015 A1
20160024529 Carstens et al. Jan 2016 A1
20160053272 Wurzel et al. Feb 2016 A1
20160053304 Wurzel et al. Feb 2016 A1
20160076093 Shendure et al. Mar 2016 A1
20160102322 Ravinder et al. Apr 2016 A1
20160130608 Doudna et al. May 2016 A1
20160168592 Church et al. Jun 2016 A1
20160264981 Yang et al. Sep 2016 A1
20160281053 Sorek et al. Sep 2016 A1
20160289673 Huang et al. Oct 2016 A1
20160298134 Chen et al. Oct 2016 A1
20160354487 Zhang et al. Dec 2016 A1
20170002339 Barrngou et al. Jan 2017 A1
20170022499 Lu et al. Jan 2017 A1
20170044525 Kaper et al. Feb 2017 A1
20170051310 Doudna et al. Feb 2017 A1
20170073705 Chen et al. Mar 2017 A1
20170191123 Kim et al. Jul 2017 A1
20170211078 Kamineni et al. Jul 2017 A1
20170240922 Gill et al. Aug 2017 A1
20170369870 Gill et al. Dec 2017 A1
20180028567 Li et al. Feb 2018 A1
20180052176 Holt et al. Feb 2018 A1
20180073013 Lorenz et al. Mar 2018 A1
20180112235 Li et al. Apr 2018 A1
20180187149 Ma et al. Jul 2018 A1
20180200342 Bikard et al. Jul 2018 A1
20180203017 Ting et al. Jul 2018 A1
20180230460 Gill et al. Aug 2018 A1
20180230461 Gill et al. Aug 2018 A1
20180284125 Gordon et al. Oct 2018 A1
20180327761 Duchateau et al. Nov 2018 A1
20180371498 Gill et al. Dec 2018 A1
20190017072 Ditommaso et al. Jan 2019 A1
20190085324 Regev et al. Mar 2019 A1
20190136230 Sather et al. May 2019 A1
20190169605 Masquelier et al. Jun 2019 A1
20190194650 Gill et al. Jun 2019 A1
20190225928 Masquelier et al. Jul 2019 A1
20190270987 Masquelier et al. Sep 2019 A1
20200071660 Spindler et al. Mar 2020 A1
20200095533 Garst et al. Mar 2020 A1
20200109398 Rubens et al. Apr 2020 A1
20200216794 Belgrader et al. Jul 2020 A1
20200263197 Cheng et al. Aug 2020 A1
20200270632 Roy et al. Aug 2020 A1
Foreign Referenced Citations (44)
Number Date Country
2395087 Dec 2011 EP
3199632 Aug 2017 EP
WO2002010183 Feb 2002 WO
WO 2003087341 Oct 2003 WO
WO 2010079430 Jul 2010 WO
WO 2011072246 Jun 2011 WO
WO 2011143124 Nov 2011 WO
WO 2013142578 Sep 2013 WO
WO 2013176772 Nov 2013 WO
WO 2014018423 Jan 2014 WO
WO2014143381 Sep 2014 WO
WO 2014144495 Sep 2014 WO
WO 2016110453 Jul 2016 WO
WO 2017053902 Mar 2017 WO
WO2017075265 May 2017 WO
WO 2017078631 May 2017 WO
WO 2017083722 May 2017 WO
WO2017106414 Jun 2017 WO
WO 2017106414 Jun 2017 WO
WO 2017161371 Sep 2017 WO
WO 2017174329 Oct 2017 WO
WO 2017186718 Nov 2017 WO
WO-2017189308 Nov 2017 WO
WO2017212400 Dec 2017 WO
WO 2017216392 Dec 2017 WO
WO 2017223330 Dec 2017 WO
WO 2018031950 Feb 2018 WO
WO 2018071672 Apr 2018 WO
WO2018073391 Apr 2018 WO
WO 2018083339 May 2018 WO
WO2018152325 Aug 2018 WO
WO2018172556 Sep 2018 WO
WO 2018191715 Oct 2018 WO
WO2019006436 Jan 2019 WO
WO2019055878 Mar 2019 WO
WO2019200004 Oct 2019 WO
WO2019209926 Oct 2019 WO
WO2020005383 Jan 2020 WO
WO2020021045 Jan 2020 WO
WO2020074906 Apr 2020 WO
WO2020191102 Sep 2020 WO
WO2020191153 Sep 2020 WO
WO2020217057 Oct 2020 WO
WO2021207541 Oct 2021 WO
Non-Patent Literature Citations (119)
Entry
GenBank CP004049.1. 2014. GenBank. p. 1-322 (Year: 2014).
GenBank KU516160.1. 2016. GenBank. p. 1-8 (Year: 2016).
GenBank KU516198.1. 2016. GenBank. p. 1-85 (Year: 2016).
Singh RK et al. Protein Engineering Approaches in the Post-Genomic Era. 2017. Current Protein and Peptide Science. 2017. 18, 1-11 (Year: 2017).
Zhang M et al. Propagated Perturbations from a Peripheral Mutation Show Interactions Supporting WW Domain Thermostability. 2018. Structure. 26, 1474-1485 (Year: 2018).
Bao, et al., “Genome-scale engineering of Saccharomyces cerevisiae with single-nucleotide precision”, Nature Biotechnology, doi:10.1038/nbt.4132, pp. 1-6 (May 7, 2018).
Dicarlo, et al., “Genome engineering in Saccharomyces cervisiae using CRISPR-Case systems”, Nucleic Acids Research, 41(7):4336-43 (2013).
Garst, et al., “Genome-wide mapping of mutations at single-nucleotide resolution for protein, metabolic and genome engineering”, Nature Biotechnology, 35(1):48-59 (2017).
Hsu, et al., “DNA targeting specificity of RNA-guided Cas9 nucleases”, Nature Biotechnology, 31(9):827-32 (2013).
Jiang, et al., “RNA-guided editing of bacterial genomes using CRISPR-Cas systems”, Nature Biotechnology, 31(3):233-41 (2013).
Jinek, et al., “A Programmable Dual-RNA-Guided DNA Endonuclease in Adaptive Bacterial Immunity”, Science, 337:816-20 (2012).
Verwaal, et al., “CRISPR/Cpfl enables fast and simple genome editing of Saccharamyces cerevisiae”, Yeast, 35:201-11 (2018).
Lian, et al., “Combinatorial metabolic engineering using an orthogonal tri-functional CRISPR system”, Nature Communications, DOI:1038/s41467-017-01695-x/www.nature.com/naturecommunications, pp. 1-9 (2017).
Roy, et cl., “Multiplexed precision genome editing with trackable genomic barcodes in yeast”, Nature Biotechnolgy, doi:10.1038/nbt.4137, pp. 1-16 (2018).
Dong, “Establishment of a highly efficient virus-inducible CRISPR/Cas9 system in insect cells,” Antiviral Res., 130:50-7(2016).
Epinat et al., “A novel engineered meganuclease induces homologous recombination in eukaryotic cells, e.g., yeast and mammalian cells”, Nucleic Acids Research, 31(11): 2952-2962.
Farasat et al., “A Biophysical Model of CRISPR/Cas9 Activity for Rational Design of Genome Editing and Gene Regulation,” PLoS Comput Biol., 29:12(1):e1004724 (2016).
Liu et al., “A chemical-inducible CRISPR-Cas9 system for rapid control of genome editing”, Nature Chemical Biology, 12:980-987(2016).
Eklund, et al., “Altered target site specificity variants of the I-Ppol His-Cys bis homing endonuclease” Nucleic Acids Research, 35(17):5839-50 (2007).
Boles, et al., “Digital-to-biological converter for on-demand production of biologies”, Nature Biotechnology, doi:10.1038/nbt.3859 (May 29, 2017).
Pines, et al., “Codon Compression Algorithms for Saturation Mutagenesis”, ACS Synthetic Biology, 4:604-14 (2015).
Bessa et al., “Improved gap repair cloning in yeast: treatment of the gapped vector with Taq DNA polymerase avoids vector self-ligation,” Yeast, 29(10):419-23 (2012).
Boch, “TALEs of genome targeting,” Nature Biotechnology vol. 29, pp. 135-136 (2011).
Campbell et al., “Targeting protein function: the expanding toolkit for conditional disruption,” Biochem J., 473(17):2573-2589 (2016).
Casini et al., “Bricks and blueprints: methods and standards for DNA assembly,” Nat Rev Mol Cell Biol., (9):568-76 (2015).
Chica et al., “Semi-rational approaches to engineering enzyme activity: combining the benefits of directed evolution and rational design,” Current Opinion in Biotechnology, 16(4): 378-384 (2005).
Du Rai et al., “Zinc finger nucleases: custom-designed molecular scissors for genome engineering of plant and mammalian cells”, Nucleic Acids Res., 33(18):5978-90 (2005).
Kadonaga et al., “Regulation of RNA polymerase II transcription by sequence-specific DNA binding factors”, Cell, 116(2):247-57 (2004).
Lee et al., “Targeted chromosomal deletions in human cells using zinc finger nucleases”, Genome Res., 20(1): 81-9 (2009).
Miller et al., “A TALE nuclease architecture for efficient genome editing”, Nature Biotechnology, 29 (2): 143-8 (2011).
Mittelman et al., “Zinc-finger directed double-strand breaks within CAG repeat tracts promote repeat instability in human cells”, PNAS USA, 106 (24): 9607-12 (2009).
Shivange, “Advances in generating functional diversity for directed protein evolution”, Current Opinion in Chemical Biology, 13 (1): 19-25 (2009).
Udo, “An Alternative Method to Facilitate cDNA Cloning for Expression Studies in Mammalian Cells by Introducing Positive Blue White Selection in Vaccinia Topoisomerase I-Mediated Recombination,” PLoS One, 10(9):e0139349 (2015).
Urnov et al., “Genome editing with engineered zinc finger nucleases”, Nature Reviews Genetics, 11:636-646 (2010).
International Search Report and Written Opinion for International Application No. PCT/US2018/053608, dated Dec. 13, 2018, p. 1-9.
International Search Report and Written Opinion for International Application No. PCT/US2018/053670, dated Jan. 3, 2019, p. 1-13.
International Search Report and Written Opinion for International Application No. PCT/US2018/053671, dated Sep. 26, 2018, p. 1-12.
International Search Report and Written Opinion for International Application No. PCT/US2018/040519, dated Sep. 26, 2018, p. 1-8.
International Search Report and Written Opinion for International Application No. PCT/US2019/026836, dated Jul. 2, 2019, p. 1-10.
International Search Report and Written Opinion for International Application No. PCT/US2019/023342, dated Jun. 6, 2019, p. 1-34.
International Search Report and Written Opinion for International Application No. PCT/US2019/030085, dated Jul. 23, 2019, p. 1-14.
International Search Report and Written Opinion for International Application No. PCT/US20/24341, dated Jun. 19, 2020, p. 1-9.
NonFinal Office Action for U.S. Appl. No. 16/399,988, dated Jul. 31, 2019, p. 1-20.
First Office Action Interview Pilot Program Pre-Interview Communication for U.S. Appl. No. 16/024,831, dated Feb. 12, 2019, p. 1-37.
NonFinal Office Action for U.S. Appl. No. 16/024,816 dated Sep. 4, 2018, p. 1-10.
Final Office Action for U.S. Appl. No. 16/024,816 dated Nov. 26, 2018, p. 1-12.
First Office Action Interview Pilot Program Pre-Interview Communication Preinterview for U.S. Appl. No. 16/454,865 dated Aug. 16, 2019, p. 1-36.
Yoshioka, et al., “Development of a mono-promoter-driven CRISPR/Cas9 system in mammalian cells”, Scientific Reports, Jul. 3, 2015, p. 1-8.
Remaut, et al., “Plasmid vectors for high-efficiency expression controlled by the PL promoter of coliphage lambda,” Laboratory of Molecular Biology, Apr. 15, 1981, p. 81-93.
International Search Report and Written Opinion for International Application No. PCT/US2019/028821, dated Aug. 2, 2019, p. 1-14.
International Search Report and Written Opinion for International Application No. PCT/US2019/028883, dated Aug. 16, 2019, p. 1-12.
International Search Report and Written Opinion for International Application No. PCT/US2019/46526, dated Dec. 18, 2019, p. 1-17.
International Search Report and Written Opinion for International Application No. PCT/US2018/34779, dated Nov. 26, 2018, p. 1-39.
International Search Report and Written Opinion for International Application No. PCT/US19/57250, dated Feb. 25, 2020, p. 1-16.
International Search Report and Written Opinion for International Application No. PCT/US19/47135, dated Jun. 11, 2020, p. 1-15.
International Search Report and Written Opinion for International Application No. PCT/US20/19379, dated Jul. 22, 2020, p. 1-10.
International Search Report and Written Opinion for International Application No. PCT/US20/36064, dated Sep. 18, 2020, p. 1-16.
International Search Report and Written Opinion for International Application No. PCT/US20/40389, dated Oct. 13, 2020, p. 1-12.
Arnak, et al., “Yeast Artificial Chromosomes”, John Wiley & Sons, Ltd., doi:10.1002/9780470015902.a0000379.pub3, pp. 1-10 (2012).
Woo, et al., “Dual roles of yeast Rad51 N-terminal domain in repairing DNA double-strand breaks”, Nucleic Acids Research, doi:10.1093/nar/gkaa.587, vol. 48, No. 15, pp. 8474-8489 (2020).
International Search Report and Written Opinion for International Application No. PCT/US2021/012868, dated Mar. 26, 2021, p. 1-15.
Anzalone et al., “Search-and-replace genome editing without doubles-strand breaks or donor DNA,” Nature, Oct. 21, 2019, vol. 576, No. 7785, pp. 149-157.
Alvarez, et al., “In vivo diversification of target genomic sites using processive T7 RNA polymerase-base deaminase fusions blocked by RNA-guided dCas9”, Dept.of Microbial Biotechnology and Systems Biology Program, Madrid, Spain, Jan. 1, 2019, p. 1-33.
International Search Report and Written Opinion for International Application No. PCT/US20/65168, dated Mar. 17, 2021, p. 1-15.
International Search Report and Written Opinion for International Application No. PCT/US2020/038345, dated Nov. 23, 2020, p. 1-13.
International Search Report and Written Opinion for International Application No. PCT/US21/12867, dated May 12, 2021, p. 1-17.
International Search Report and Written Opinion for International Application No. PCT/US2020/064727, dated Apr. 28, 2021, p. 1-13.
International Search Report and Written Opinion for International Application No. PCT/US21/29008, dated Aug. 24, 2021, p. 1-19.
International Search Report and Written Opinion for International Application No. PCT/US21/29011, dated Aug. 24, 2021, p. 1-20.
Bauer, et al., “Cell-microcarrier Adhesion to Gas-Liquid Interfaces and Foam”, Biotechnol. Prog. 2000, 16, 125-132, Oct. 19, 1999.
Datlinger, et al., “Pooled CRISPR screening with single-cell transcriptome readout”, Nature Methods, Jan. 10, 2017; p. 1-10, doi:10.1038/nmeth.4177.
Dixit, et al., “Perturb-Seq: Dissecting Molecular Circuits with Scalable Single-Cell RNA Profiling of Pooled Genetic Screens”, Cell 167, p. 1853-1866, Dec. 15, 2016.
GE Healthcare Life Sciences, “Microcarrier Cell Culture Principles and Methods”, 18-1140-62 AC, p. 1-23, Nov. 2013.
Jacobi, et al., “Simplified CRISPR tools for efficient genome editing and streamlined protocols for their delivery into mammalian cells and mouse zygotes”, Methods 121-122, p. 16-28, Mar. 23, 2017.
Jaitin, et al., “Dissecting Immune Circuits by Linking CRISPR-Pooled Screens with Single-Cell RNA-Seq”, Cell 167, p. 1883-1896, Dec. 15, 2016.
Kim, et al., “Formation of Thermoresponsive Poly(N-isopropylacrylamide)/Dextran Particles by Atom Transfer Radical Polymerization”, Macromol. Rapid Commun., 24, p. 517-521, 2003.
Kimple, et al., “Overview of Affinity Tags for Protein Purification”, Curr Protoc Protein Sci.; 73: Unit—9-9. DOI:10.1002/0471140864.ps0909s73, p. 1-26, Aug. 6, 2015.
Nienow, et al., “A potentially scalable method for the harvesting of hMSCs from microcarriers”, Biochemical Engineering Journal 85, p. 79-88, Feb. 4, 2014.
Replogle, et al., “Direct capture of CRISPR guides enables scalable, multiplexed, and multi-omic Perturb-Seq”, bioRxiv; doi:http://dx.doi.org/10.1101/503367, p. 1-26, Dec. 21, 2018.
Sivalingam, et al., “Superior Red Blood Cell Generation from Human Pluripotent Stem Cells Through a Novel Microcarrier-Based Embryoid Body Platform”, Tissue Engineering: Part C, vol. 22, No. 8, p. 765-780, Jun. 9, 2016.
International Search Report and Written Opinion for International Application No. PCT/US21/35807, dated Nov. 24, 2021, p. 1-21.
International Search Report and Written Opinion for International Application No. PCT/US21/50338, dated Dec. 10, 2021, p. 1-17.
International Search Report and Written Opinion for International Application No. PCT/US21/43097, dated Nov. 19, 2021, p. 1-12.
International Search Report and Written Opinion for International Application No. PCT/US21/39872, dated Oct. 27, 2021, p. 1-14.
International Search Report and Written Opinion for International Application No. PCT/US21/48566, dated Dec. 10, 2021, p. 1-10.
Filsinger, et al., “Characterizing the portability of RecT-mediated oligonucleotide recombination”, bioRxiv, Apr. 15, 2020, doi:org/10.1101/2020.04.14.041095, p. 1-25.
Nelson, et al., “Engineered pegRNAs improve prime editing efficiency”, Nature Biotechnology, Jul. 25, 2021, doi.org/10.1038/s41587-021-01039-7, p. 1-14.
Yu, et al., “Improved delivery of Cas9 protein/gRNA complexes using lipofectamine CRISPRMAX”, Biotechnol Ltt, Feb. 18, 2016, doi 10.1007/sl0529-016-2064-9, p. 919-929.
Bengali, et al., “Gene Delivery Through Cell Culture Substrate Adsorbed DNA Complexes”, Biotechnol Bioeng., May 5, 2005, doi:10.1002/bit.20393, p. 1-23.
Segura, et al., “Substrate-mediated DNA delivery: role of the cationic polymer structure and extent of modification”, Journal of Controlled Release, Aug. 9, 2003, doi:10.1016/j.jconrel.2003.08.003, p. 69-84.
Takahashi, et al., “Integration of CpG-free DNA induces de novo methylation of CpG islands in pluripotent stem cells,” Science, May 5, 2017, vol. 356, No. 6337, pp. 1-7.
Chen, et al., “Human Pluripotent Stem Cell Culture: Considerations for Maintenance, Expansion, and Therapeutics”, Cell Stem Cell, Jan. 2, 2014, doi.org/10.1016/j.stem.2013.12.005, p. 13-26.
Fayazpour, F., “Exploring New Applications For Photophysically Encoded Mircrocarriers”, Ghent University Faculty of Pharmaceutical Sciences, Thesis Submission, Sep. 2008, 169 pages.
Chueng, et al., “Unlinking the methylome pattern from nucleotide sequence, revealed by large-scale in vivo genome engineering and methylome editing in medaka fish,” PLoS Genetics, Dec. 21, 2017, vol. 13, No. 12, pp. 1-25.
Elvin, et al., “Modified bacteriophage lambda promoter vectors for overproduction of proteins in Escherichia coli”, Gene, 87, Sep. 15, 1989, p. 123-126.
Segall-Shapiro, et al., “Engineered promoters enable constant gene expression at any copy number in bacteria”, Nature Biotechology, vol. 36, No. 4, Mar. 19, 2018, p. 352-363.
Xing, et al., “A CRISPR/Cas9 toolkit for multiplex genome editing in plants”, BMC Plant Biology, 2014, p. 1-12.
Sun, et al., “A Single Multiplex crRNA Array for FnCpf1-Mediated Human Genome Editing,” Molecular Therapy, Aug. 1, 2018, vol. 26, No. 8, pp. 2070-2076.
Kurata, et al., “Highly multiplexed genome engineering using CRISPR/Cas9 gRNA arrays,” PLoS ONE, Sep. 17, 2018, vol. 13, No. 9, pp. 1-17.
Hubmann, et al., “Natural and Modified Promoters for Tailored Metabolic Engineering of the Yeast Saccharomyces cerevisiae”, Methods in Molecular Biology, vol. 1152, doi10.1007/978-1-4939-0563-8_2, p. 17-42.
Unciti-Broceta, et al., “Combining Nebulization-Mediated Transfection and Polymer Microarrays for the Rapid Determination of Optimal Transfection Substrates”, Journal of Combinatorial Chemistry, vol. 10, No. 2, Feb. 5, 2008, p. 179-184.
Fayazpour, et al., “Evaluation of Digitally Encoded Layer-by-layer Coated Microparticles as Cell Carriers”, Advanced Functional Materials, Sep. 1, 2008, p. 2716-2723.
UniProtKB/TrEMBL, “A0A1G4WF58_9FIRM”, Nov. 22, 2017, rerieved from Internet: https://www.uniprot.org/uniprot/A0A_164WF58.txt, pp. 1-3.
Natsume, et al., “Conditional Degrons for Controlling Protein Expression at the Protein Level”, Annual Review of Genetics, vol. 51, 2017, doi.org/10.1146/annurev-genet-120116-024656, p. 83-104.
Chen, et al., “Enhancing the copy number of episomal plasmids in Saccharomyces cerevisiae for improved protein production”, FEMS Yeast Research, Apr. 25, 2012, doi:10.1111/j.1567-1364.2012.00809.x; p. 598-607.
Price, et al., “Expanding and understanding the CRISPR toolbox for Bacillus subtilis with MAD7 and dMAD7”, Biotechnology and Bioengineering, Feb. 19, 2020, doi:10.1002/bit.27312 p. 1805-1816.
International Search Report and Written Opinion for International Application No. PCT/US21/43534, dated Nov. 10, 2021, p. 1-16.
International Search Report and Written Opinion for International Application No. PCT/US20/26095, dated Jul. 17, 2020, p. 1-10.
Anzalone, et al., “Programmable large DNA deletion, replacement, integration, and inversion with twin prime editing and site-specific recombinases”, bioRxiv, Nov. 2, 2021, doi:10.1101/2021.11.01.466790, p. 1-51.
Horwitz, et al., “Efficient Multiplexed Integration of Synergistic Alleles and Metabolic Pathways in Yeasts via CRISPR-Cas”, Cell Systems 1, Jul. 29, 2015, doi:10.1016/j.cels.2015.02.001, p. 88-96.
Jillette, et al., “Split Selectable Markers”, Nature Communications, Oct. 31, 2019, doi:10.1038/s41467-019-12891-2, p. 1-8.
Pavankumar, “Inteins: Localized Distribution, Gene Regulation, and Protein Engineering for Biological Applications”, Microorganisms, Feb. 28, 2018, doi:10.3390/microorganisms6010019, p. 1-15.
Choi, et al., “Precise genomic deletions using paired prime editing”, bioRxiv, Jan. 2, 2021, doi:10.1101/2020.12.30.424891, p. 1-32.
Lin, et al., “High-efficiency prime editing with optimized, paired pegRNAs in plants”, Nature Biotechnology, Mar. 25, 2021, doi:10.1038/s41587-021-00868-w, p. 1-12.
Bolukbasi, et al., “Orthogonal Cas9-Cas9 chimeras provide a versatile platform for genome editing”, Nature Communications, Nov. 19, 2018, doi:10.1038/s41467-018-07310-x, p. 1-12.
Kweon, et al., “Fusion guide RNAs for orthogonal gene manipulation with Cas9 and Cpf1”, Nature Communications, Nov. 23, 2017, doi:10.1038/s41467-017-01650-w, p. 1-6.
International Search Report and Written Opinion for International Application No. PCT/US21/48578, dated Feb. 15, 2022, p. 1-12.
International Search Report and Written Opinion for International Application No. PCT/US21/61156, dated Mar. 3, 2022, p. 1-13.
International Search Report and Written Opinion for International Application No. PCT/US20/53873, dated Jan. 14, 2021, p. 1-16.
Related Publications (1)
Number Date Country
20220162646 A1 May 2022 US
Continuations (3)
Number Date Country
Parent 16421783 May 2019 US
Child 17588256 US
Parent 15896444 Feb 2018 US
Child 16421783 US
Parent 15632001 Jun 2017 US
Child 15896444 US