TETRAHYDROCANNABINOLIC ACID (THCA) SYNTHASE VARIANTS, AND MANUFACTURE AND USE THEREOF

Information

  • Patent Application
  • 20230407239
  • Publication Number
    20230407239
  • Date Filed
    November 11, 2021
    3 years ago
  • Date Published
    December 21, 2023
    11 months ago
Abstract
Provided herein is a recombinant vector that comprises a promoter, a secretion tag gene of interest (GOI), and optionally a purification and/or a detection tag. The GOI comprises a polynucleotide sequence as set forth in any one of SEQ ID NOs: 2-57 or a polynucleotide sequence with at least 90%, 95%, 98%, 99% or 99.9% identity with any one of SEQ ID NOs: 2-57. Also, disclosed are the sequences of polypeptides that result from expression of a GOI by a host expression system, and a method of producing cannabinoid compounds.
Description
BACKGROUND

Cannabinoids, the terpenophenolic compounds found in Cannabis sativa, have proven therapeutic potential. For example, cannabidiol (CBD) is a potent antioxidant and anti-inflammatory compound and may provide protection against acute and chronic neurodegeneration. It is found in high concentrations in hemp and acts as a high-affinity α2-adrenergic receptor agonist, a moderate-affinity 5-HT1A receptor antagonist, and a low-affinity CB1 receptor antagonist. CBD may also have anti-depressant activity. Cannabichromene (CBC) possesses anti-inflammatory, anti-fungal, and anti-viral properties. Thus, cannabinoids are regarded as promising therapeutic agents for the treatment of various diseases.


The varins are a class of cannabinoids that are structurally different from the classical cannabinoids (e.g., Tetrahydrocannabinol (THC), CBD, Cannabigerol (CBG), or CBC and their corresponding carboxylic acids). The varins have a 3-carbon propyl side chain instead of a pentyl (5-carbon) side chain attached to the aromatic ring. Many of the varins are found in very low amounts in the Cannabis plant. Tetrahydrocannabivarin (THCV) is one widely studied cannabinoid varin compounds. THCV can function as an antagonist of THC at CB1 receptors and thus attenuate the psychoactive effects of THC. THCV is being evaluated as a potential therapeutic for the treatment for type 2 diabetes, by increasing insulin sensitivity and improving glucose tolerance in patients with type-2 diabetes (Wargent et al., Nutr Diabetes., May; 3(5): e68 (2013)). THCV has also shown promise for the treatment of epilepsy and in reducing tremors associated with Parkinson's disease.


However, traditional chemical synthesis of many cannabinoids, particularly the low abundance varins is either suboptimal, non-existing, and/or does not permit the cost-effective manufacture of cannabinoids in high yields and at commercial scales. Thus, there is a need to develop a more effective system for industrial-scale manufacture of cannabinoid compounds. Bio-catalysis provides one cost-effective method for producing many cannabinoids found in the Cannabis plant. The process requires contacting a cannabinoid synthase enzyme with a substrate of the enzyme to produce cannabinoids and the low abundance varins at commercial scale and in pharmaceutical grade purity. While existing protocols, such as the one described in U.S. Pat. No. 10,336,978 is focused on using cannabinoid synthase enzymes, namely, THCA synthase and CBDA synthase found in the Cannabis plant, for the bio-catalytic manufacture of cannabinoids, the focus of this application is on mutants of THCA synthase that have improved yields and enhanced kinetic rates compared to the plant (botanical) THCA synthase enzyme.


SUMMARY OF THE INVENTION

In one aspect, the present invention provides a recombinant vector that comprises a promoter, secretion tag, gene of interest (GOI), and optionally a purification and/or a detection tag, wherein the gene of interest comprises a polynucleotide sequence encoding a polypeptide with at least 90%, 95%, 98%, or 99.9% sequence identity to the sequence as set forth in any one of SEQ ID NOs: 84-139.


In another aspect, the present invention provides a polynucleotide comprising the polynucleotide sequence as set forth in any one of SEQ ID NO: 2-57, or a polynucleotide comprising a polynucleotide sequence comprising at least 90%, 95%, 98%, 99%, or 99.9% sequence identity to the sequence as set forth in any one of SEQ ID NOs: 2-57.


In some aspects, the present invention provides a polypeptide comprising the amino acid sequence of any one of SEQ ID NOs: 84-139 or a polypeptide comprising an amino acid sequence having at least 90%, 95%, 98%, or 99.9% sequence identity to a sequence as set forth in any one of SEQ ID NOs: 84-139.


In other aspects, the present invention provides a gene of interest operably linked to a promoter which is a polynucleotide as set forth in any one of SEQ ID NOs: 71-80.


In one aspect, the present invention provides a promoter having a polynucleotide sequence comprising at least 90%, 95%, 98%, or 99% sequence identity to the sequence of SEQ ID NOs: 71-80.


In another aspect, the present invention provides a promoter that is a constitutive promoter or an inducible promoter. In one embodiment, the promoter is selected from a group consisting of an alcohol oxidase 1 promoter (pAOX1), an alcohol oxidase 2 promoter (pAOX2), a dihydroxyacetone kinase promoter (pDAK), an S-hydroxymethyl-glutathione hydrolase promoter (pFGH), an NAD+-dependent formylglutathione dehydrogenase promoter (pFDH), a fructose 1,6-bisphosphate aldolase promoter (pFBA2), a peroxisomal membrane signal receptor PTS1 promoter (pPEX5), a peroxisomal protein Pex8p promoter (pPEX8), an alcohol dehydrogenase 2 promoter (pADH2) and a peroxin Pex14p promoter (pPEX14).


In some aspects, present invention provides a recombinant vector comprising a secretion tag as set forth in any one of SEQ ID NOs: 58-67 and 154 or a secretion tag whose sequence is at least 90%, 95%, 98%, 99%, or 100% identical to SEQ ID NOs: 58-67 and 154.


In other aspects, the present invention provides a recombinant vector comprising a purification/detection tag as set forth in any one of SEQ ID NOs: 68-70 and 153. In one embodiment, the purification/detection tag is selected from the group consisting of a human influenza hemagglutinin (HA) tag, a His 6 tag and an HN tag.


In one aspect, the present invention provides a host expression system comprising a recombinant vector that comprises a promoter, secretion tag, gene of interest (GOI), and optionally a purification and/or a detection tag.


In another aspect, the present invention provides a host expression system comprising a microbial cell, a yeast cell, a plant cell, or an animal cell.


In some aspects, the present invention provides a host expression system comprising a yeast cell is selected from the group consisting of Pichia pastoris, Pichia angusta, Pichia guillermordii, Pichia methanolica, Pichia inositovera, Hansenula polymorpha, Candida boidinii, and Yarrowia lipolytica.


In one aspect, the present invention provides a method of producing a Tetrahydrocannabinolic acid (THCA) synthase or its variants. In another embodiment, the invention provides a method for producing a cannabinoid by contacting a cannabinoid precursor with an enzyme that is expressed by the recombinant vector of this disclosure.





BRIEF DESCRIPTION OF THE DRAWINGS


FIG. 1 shows the expression cassette for a recombinant vector according to one embodiment of the invention.



FIGS. 2A-2I show the high-pressure liquid chromatography (HPLC) plots of select cannabinoids from the enzymatic assay as well as those of cannabinoid references standards.





DETAILED DESCRIPTION

It is to be understood that the invention is not limited to methodologies, protocols, cell lines, animal species or genera, compounds, polymers, and reagents described, as such may vary. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only and is not intended to limit the scope of the present invention, which will be limited only by the appended claims.


Terms as set forth hereinafter are generally to be understood in their common sense unless indicated otherwise.


Unless otherwise defined, scientific and technical terms used herein have the meanings that are commonly understood by those of ordinary skill in the art. In the event of any latent ambiguity, definitions provided herein take precedent over any dictionary or extrinsic definition. Unless otherwise required by context, singular terms shall include pluralities and plural terms shall include the singular. The use of “or” means “and/or” unless stated otherwise. The use of the term “including,” as well as other forms, such as “includes” and “included,” is not limiting.


That the disclosure may be more readily understood, select terms are defined below.


The articles “a” and “an” are used herein to refer to one or to more than one (i.e., to at least one) of the grammatical object of the article. By way of example, “an element” means one element or more than one element.


“About” as used herein when referring to a measurable value such as an amount, a temporal duration, and the like, is meant to encompass variations of ±20% or ±10%, more preferably ±5%, even more preferably ±1%, and still more preferably ±0.1% from the specified value, as such variations are appropriate to perform the disclosed methods.


The term “comprising” or “comprises” is intended to mean that the compositions and methods include the recited elements, but do not exclude others. “Consisting essentially of” when used to define compositions and methods, shall mean excluding other elements of any essential significance to the combination. For example, a composition consisting essentially of the elements as defined herein would not exclude other elements that do not materially affect the basic and novel characteristic(s) of the claimed invention. “Consisting of” shall mean excluding more than a trace amount of other ingredients and substantial method steps recited. Embodiments defined by each of these transition terms are within the scope of this invention.


The term “isolated” means altered or removed from the natural state. For example, a nucleic acid or a peptide naturally present in a living animal is not “isolated,” but the same nucleic acid or peptide partially or completely separated from the coexisting materials of its natural state is “isolated.” An isolated nucleic acid or protein can exist in substantially purified form, or can exist in a non-native environment such as, for example, a host cell.


As used herein, the term “precursor” refers to a compound that participates in a chemical reaction that produces another compound. In one embodiment, the cannabinoid precursor refers to a compound that participates in a reaction to produce another compound. For example, CBGA is a precursor to THCA, CBDA, and CBCA. In another example, CBGVA is a precursor to THCVA, CBDVA, and CBCVA.


The terms “nucleic acid” and “polynucleotide” as used herein are interchangeable. Nucleic acids and polynucleotides are polymers of nucleotides.


The nucleic acids of the present disclosure may include naturally occurring bases including adenine, guanine, cytosine, thymidine, and uracil. The sequences of nucleic acids may also contain modified bases. Examples of such modified bases include aza- and deaza-adenine, guanine, cytosine, thymidine and uracil; and xanthine and hypoxanthine. In one embodiment, the polynucleotide or nucleic acid molecule is a complementary DNA (cDNA).


A polypeptide, in one embodiment, may include various structural forms of the primary protein. For example, a polypeptide of the disclosure may be in the form of acidic or basic salts or in neutral form. In addition, individual amino acid residues may be modified by oxidation or reduction. The proteins and polypeptides of the present disclosure may also include truncations, analogs and homologs of the proteins and polypeptides, as described herein, having substantially the same function as the proteins or polypeptides of the present disclosure, such as having THCA synthase activity.


The term “sequence identity” or “identity” refers to the maximum correspondence relation of the amino acid or nucleotide residues in the two sequences that are aligned for comparison. The length for identity comparison may extend to the full length of a nucleotide or polypeptide sequence, or a fragment of a nucleotide or polypeptide sequence that is at least about nine residues, at least about 20 to 24 residues, at least about 28 to 32 residues, and preferably at least about 36 or greater than 36 residues. Several different algorithms known in the art that can be used to measure sequence identity.


In one embodiment, the sequence identity is in a range from about 50% to about 99.9%, about 55% to about 99.9%, about 60% to about 99.9%, about 65% to about 99.9%, about 70% to about 99.9%, about 75% to about 99.9%, about 80% to about 99.9%, about 85% to about 99.9%, about 90% to about 99.9%, and about 95% to about 99.9%.


In one embodiment, the sequence identity between two polynucleotides or two polypeptides is about 99.5%, about 98.5%, about 97.5%, about 96.5%, about 95.5%, about 94.5%, about 93.5%, about 92.5%, about 91.5%, about 90.5%, about 89.5%, about 88.5%, about 87.5%, about 86.5%, about 85.5%, about 84.5%, about 84.5%, about 83.5%, about 82.5%, about 81.5%, or about 80.5%.


In one embodiment, the sequence identity between two polynucleotides or two polypeptides is about 99%, about 98%, about 97%, about 96%, about 95%, about 94%, about 93%, about 92%, about 91%, about 90%, about 89%, about 88%, about 87%, about 86%, about 85%, about 84%, about 83%, about 82%, about 81%, or about 80%.


In one embodiment, the sequence identity between two polynucleotides or two polypeptides is about 99.9%, about 99.8%, about 99.7%, about 99.6%, about 99.5%, about 99.4%, about 99.3%, about 99.2%, about 99.1%, or about 99%.


The term “cannabinoid product” or “cannabinoid compound” is intended to mean any simple or complex substance or compound of natural, semi-synthetic, or synthetic origin, which can act on or modulate the activity of a cannabinoid receptor in a subject. In some embodiments, the cannabinoid product is an agonist of a cannabinoid receptor. In some embodiments, the cannabinoid product is an antagonist of a cannabinoid receptor. In yet another embodiment, the cannabinoid product is an inverse-agonist or an inverse-antagonist of a cannabinoid receptor. In one embodiment, the cannabinoid product comprises phytocannabinoids, endogenous cannabinoids (endocannabinoids), bio-synthetic cannabinoids, or synthetic cannabinoids. In one embodiment, the cannabinoid product comprises a pentyl side chain on the aromatic ring. Certain cannabinoids have a propyl side chain. In this application, this class of cannabinoids may be referred to as “varin.”


Ranges: throughout this disclosure, various aspects of the embodiments can be presented in a range format. It should be understood that the description in range format is merely for convenience and brevity and should not be construed as an inflexible limitation. Accordingly, the description of a range should be considered to have specifically disclosed all the possible subranges as well as individual numerical values within that range. For example, description of a range such as from 1 to 6 should be considered to have specifically disclosed subranges such as from 1 to 3, from 1 to 4, from 1 to 5, from 2 to 4, from 2 to 6, from 3 to 6 etc., as well as individual numbers within that range, for example, 1, 2, 2.7, 3, 4, 5, 5.3, and 6. This applies regardless of the breadth of the range. Unless otherwise explicitly stated to the contrary, a range that is disclosed also includes the endpoints of the range.


The present invention relates to mutants of THCA synthase as well as recombinant vectors that comprises a polynucleotide sequences that code for THCA synthase mutants. The THCA synthase mutants of the invention have advantages over THCA synthase found in the Cannabis plant. For instance, the mutant THCA synthases described here have better catalytic output compared to THCA synthase in the Cannabis plant. In one embodiment, the catalytic activity of the mutant THCA synthases are at least 1-50 fold greater, at least 5-fold greater, at least 10-fold greater, at least 15-fold greater, at least 20-fold greater, at least 25-fold greater, at least 30-fold greater, at least 35-fold greater, at least 40-fold greater, or at least 45-fold greater, than the catalytic activity of the native THCA synthase from the Cannabis plant.


In some embodiment, for certain mutants, the catalytic activity is between 5-10 fold greater than the catalytic activity of THCA synthase enzyme from the Cannabis plant. In one embodiment the catalytic activity of the mutants is between 11-15 fold greater, between 16-20-fold greater, between 21-25 fold greater, between 26-30 fold greater, between 31-35 fold greater, between 36-40 fold greater, between 41-45 fold greater, or between 46-50 fold greater than the catalytic activity of native THCA synthase from the Cannabis plant.


In other embodiments, for certain mutants, the catalytic activity is about 5 to about 10 fold greater than the catalytic activity of a THCA synthase having the sequence of SEQ ID NO: 83 or THCA synthase of the Cannabis plant (e.g., native THCA synthase from a wild-type Cannabis plant). In one embodiment, the catalytic activity of the mutant is about 11 to about 15 fold greater, about 16 to about 20 fold greater, about 21 to about 25 fold greater, about 26 to about 30 fold greater, about 31 to about 35 fold greater, about 36 to about 40 fold greater, about 41 to about 45 fold greater, or about 46 to about 50 fold greater than the catalytic activity of a THCA synthase having the sequence of SEQ ID NO: 83 or THCA synthase of the Cannabis plant (e.g., native THCA synthase from a wild-type Cannabis plant).


Non-limiting cannabinoid products include tetrahydrocannabinol (THC), cannabidiol (CBD), olivetol, cannabinol (CBN), cannabigerol (CBG), cannabichromene (CBC), cannabicyclol (CBCL), nabilone, tetrahydrocannabinolic acid (THCA), cannabichromenic acid (CBCA), cannabicyclol carboxylic acid (CBCLA), cannabigerolic acid (CBGA), cannabidiolic acid (CBDA), cannabinolic acid (CBNA), tetrahydrocannabivarin (THCV), cannabivarin (CBV), cannabidivarin (CBDV), cannabigerovarin (CBGV), cannabichromevarin (CBCV), cannabicyclovarin (CBCLV), cannabicyclovarinic acid (CBCLVA), cannabigerovarinic acid (CBGVA), tetrahydrocannabivarinic acid (THCVA), cannabichrome varinic acid (CBCVA), cannabidivarinic acid (CBDVA), as well as the prodrugs and pharmaceutically acceptable salts of these cannabinoids. Exemplary prodrugs include alkyl ethers, haloalkyl ethers, alkyl esters, haloalkyl esters, and aromatic esters, for example CBD difluoromethyl ether or CBD methyl ether.


As used herein, the term “cannabinoid varin compound” refers to cannabinoid compounds comprising a propyl side chain attached to an aromatic ring. In one embodiment, the cannabinoid varin compound is psychoactive. In another embodiment, the cannabinoid varin compound is non-psychoactive. Non-limiting examples of cannabinoid varin compounds include tetrahydrocannabivarin (THCV), cannabivarin (CBV), cannabidivarin (CBDV), cannabigerovarin (CBGV), cannabichromevarin (CBCV), cannabicyclovarin (CBCLV), cannabicyclovarinic acid (CBCLVA), cannabigerovarinic acid (CBGVA), tetrahydrocannabivarinic acid (THCVA), cannabichromevarinic acid (CBCVA), and cannabidivarinic acid (CBDVA), as well as natural or synthetic molecules that have a basic cannabinoid varin structure and are modified synthetically to provide a cannabinoid analog.


The polynucleotide of interest can be cloned into a vector of interest to produce a construct. In an embodiment, the vector is an expression vector. Examples of expression vectors include but are not limited to cosmids, plasmids, or modified viruses (e.g. replication defective retroviruses, adenoviruses, and adeno-associated viruses), so long as the vector is compatible with the host cell used. The expression vectors are suitable for transformation of a host cell, which means that the expression vectors contain a polynucleotide of the application and regulatory sequences selected on the basis of the host cells to be used for expression. “Operably linked” is intended to mean that the nucleic acid is linked to regulatory sequences in a manner which allows expression of the nucleic acid. In some embodiments, the isolated and/or purified nucleic acid molecules, polynucleotides or vectors, constructs or in vitro expression systems comprising these isolated and/or purified nucleic acid molecules, may be used to create transgenic or recombinant organisms or recombinant cells (e.g. optionally cells of recombinant organisms) that produce polypeptides with cannabinoid synthase activity and/or modulated levels of polypeptides with cannabinoid synthase activity.


In some embodiments, a polynucleotide comprising a sequence encoding a polypeptide described herein, or a vector comprising the polynucleotide, can be codon-optimized e.g., for use in a particular host cell, such as, for example, a bacterial or plant cell.



FIG. 1 schematically shows an expression cassette used for the manufacture of a THCA synthase enzyme. In one embodiment, the expression cassette comprises a gene of interest (GOI) comprising the sequence as set forth in any one of nucleotide sequences of SEQ ID NOs: 2-57 are operably linked to a secretion tag according SEQ ID NOs: 58-67 and 154, which in turn is operably linked to a promoter sequence as set forth in any one of SEQ ID NOs: 71-80. The expression cassette is inserted into the yeast genome and is used to express a cannabinoid synthase enzyme. In some cases, a purification/detection tag as set forth in any one of SEQ ID NOs: 68-70 and 153 is operably linked downstream of the gene of interest to enable purification and/or detection of the protein expressed by the gene of interest.


In one embodiment, the disclosure also provides a recombinant vector comprising a polynucleotide sequence for the gene of interest (GOI) as set forth in any one of SEQ ID NOs: 2-57. In one embodiment, the disclosure also provides a recombinant vector comprising a polynucleotide sequence with at least 90%, 95%, 98%, or 99.9% identity with any one of SEQ ID NOs: 2-57.


In one embodiment, a polynucleotide comprising the polynucleotide sequence as set forth in any one of SEQ ID NOs: 2-57 encodes a polypeptide of SEQ ID NOs: 84-139. In one embodiment, the polynucleotide sequence encodes a polypeptide with at least 90%, 95%, 98%, or 99.9% identity with any one of SEQ ID NOs: 84-139.


In one embodiment, the polynucleotide sequence is operably linked to a regulatory sequence for the transcription and translation of the gene of interest (GOI). Suitable regulatory sequences may be derived from a variety of sources, including bacterial, fungal, viral, mammalian, or insect genes (for example, see the regulatory sequences described in Goeddel, Gene Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, CA (1990), incorporated herein by reference in its entirety). Selection of appropriate regulatory sequences is dependent on the host cell chosen, as discussed below, and may be readily accomplished by one of ordinary skill in the art. Examples of such regulatory sequences include: a transcriptional promoter and enhancer or RNA polymerase binding sequence, a ribosomal binding sequence, including a translation initiation signal. Additionally, depending on the host cell chosen and the vector employed, other sequences, such as an origin of replication, additional DNA restriction sites, enhancers, and sequences conferring inducibility of transcription may be incorporated into the expression vector.


In one embodiment, the regulatory sequence is a promoter. The promoter is functional in the host cell or host expression system. In another embodiment, the promoter is a constitutive promoter or an inducible promoter. In one embodiment, the promoter is selected from a group consisting of an alcohol oxidase 1 promoter (pAOX1), an alcohol oxidase 2 promoter (pAOX2), a dihydroxyacetone kinase promoter (pDAK), an S-hydroxymethyl-glutathione hydrolase promoter (pFGH), a NAD+- dependent formylglutathione dehydrogenase promoter (pFDH), a peroxin Pex14p promoter (pPEX14p), a fructose 1,6-bisphosphate aldolase promoter (pFBA), a peroxisomal membrane signal receptor PTS1 promoter (pPEX5), an alcohol dehydrogenase 2 promoter (pADH2), and a peroxisomal protein Pex8p promoter (pPEX8).


In one embodiment, the promoter is a polynucleotide of SEQ ID NOs: 71-80 or a polynucleotide that has at least 90%, 95%, 98%, or 99% identity with any one of SEQ ID NOs: 71-80.


For some embodiments, the promoter is a polynucleotide that has at least 99% identity to a polynucleotide as set forth in any one of SEQ ID NOs: 71-80.


According to one embodiment, the promoter is a polynucleotide that has at least 98% identity to a polynucleotide as set forth in any one of SEQ ID NOs: 71-80.


In one embodiment, the promoter is a polynucleotide that has at least 97% identity to a polynucleotide as set forth in any one of SEQ ID NOs: 71-80.


In one embodiment, the promoter is a polynucleotide that has at least 96% identity to a polynucleotide as set forth in any one of SEQ ID NOs: 71-80.


In one embodiment, the promoter is a polynucleotide that has at least 95% identity to a polynucleotide as set forth in any one of SEQ ID NOs: 71-80.


In one embodiment, the promoter is a polynucleotide that has at least 90%, 91%, 92%, 93%, 94%, or 95% identity to a polynucleotide as set forth in any one of SEQ ID NOs: 71-80.


In one embodiment, the promoter is a polynucleotide sequence selected from a group consisting of SEQ ID NOs: 71-80.


In one embodiment, the vector encodes a polynucleotide for a secretion peptide. In one embodiment, the secretion peptide is encoded by a polynucleotide sequence of SEQ ID NOs: 58-67 and 154 or having at least 90%, 95%, 98%, 99%, or 100% identity with a polynucleotide as set forth in any one of SEQ ID NOs: 58-67 and 154.


In one embodiment, the secretion peptide is encoded by a polynucleotide sequence having at least 99% identity with any one of SEQ ID NOs: 58-67 and 154.


In one embodiment, the secretion peptide is encoded by a polynucleotide sequence having at least 98% identity with any one of SEQ ID NOs: 58-67 and 154.


In one embodiment, the secretion peptide is encoded by a polynucleotide sequence having at least 97% identity with any one of SEQ ID NOs: 58-67 and 154.


In one embodiment, the secretion peptide is encoded by a polynucleotide sequence having at least 96% identity with any one of SEQ ID NOs: 58-67 and 154.


In one embodiment, the secretion peptide is encoded by a polynucleotide sequence having at least 95% identity with any one of SEQ ID NOs: 58-67 and 154.


In one embodiment, the secretion peptide is encoded by a polynucleotide sequence having at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, or at least 95% identity with any one of SEQ ID NOs: 58-67 and 154.


In one embodiment, the secretion tag is encoded by a polynucleotide sequence that represents a codon optimized version of SED ID NO: 58. In some embodiments, the sequence for the codon optimized polynucleotide sequence is as follows (SEQ ID NO: 154):









ATGAGATTCCCATCCATCTTCACTGCTGTTTTGTTCGCTGCTTCTTCC





GCTTTGGCTGCTCCAGTTAACACTACTACTGAGGACGAGACTGCTCAG





ATTCCAGCTGAAGCTGTTATTGGTTACTCCGACTTGGAAGGTGACTTC





GACGTTGCTGTTTTGCCATTCTCCAACTCCACTAACAACGGTTTGTTG





TTCATCAACACTACAATCGCTTCCATTGCTGCTAAAGAAGAGGGAGTT





TCCTTGGAGAAGAGAGAGGCTGAAGCT






In another embodiment, the secretion peptide comprises a polypeptide sequence as set forth in any one of SEQ ID NOs: 140-149 or a polypeptide sequence having at least 90%, 95%, 98%, 99%, or 100% identity with any one of SEQ ID NOs: 140-149.


In one embodiment, the secretion peptide comprises a polypeptide sequence having at least 99% identity with any one of SEQ ID NOs: 140-149.


In one embodiment, the secretion peptide comprises a polypeptide sequence having at least 98% identity with any one of SEQ ID NOs: 140-149.


In one embodiment, the secretion peptide comprises a polypeptide sequence having at least 97% identity with any one of SEQ ID NOs: 140-149.


In one embodiment, the secretion peptide comprises a polypeptide sequence having at least 96% identity with any one of SEQ ID NOs: 140-149.


In one embodiment, the secretion peptide comprises a polypeptide sequence having at least 95% identity with any one of SEQ ID NOs: 140-149.


In one embodiment, the secretion peptide comprises a polypeptide sequence having at least 90% identity, at least 91% identity, at least 92% identity, at least 93% identity, or at least 94% identity with any one of SEQ ID NOs: 140-149.


In one embodiment, the vector encodes a polynucleotide for a purification and/or a detection tag. In another embodiment, the purification and/or detection tag is selected from the group consisting of a human influenza hemagglutinin (HA) tag, a His 6 tag, and an HN tag. In one embodiment, the purification tag is encoded by a polynucleotide sequence as set forth in any one of SEQ ID NOs: 68-70 and 153. In another embodiment, the purification tag encodes a polypeptide as set forth in any one of SEQ ID NOs: 150-152.


In one embodiment, the recombinant vector is a plasmid. Exemplary plasmids include the pPIC 3.5 and pL, represented by SEQ ID NOs: 81 and 82 respectively.


In one embodiment, the disclosure relates to a polynucleotide for the GOI as set forth in any one of SEQ ID NO: No: 2-57. In another aspect, the disclosure relates to a polynucleotide of the GOI comprising a sequence having at least 90%, 95%, 98%, 99%, or 99.9% identity with any one of SEQ ID NOs: 2-57.


In one embodiment, the polynucleotide comprises a sequence having at least 99.9% identity with any one of SEQ ID NOs: 2-57.


In one embodiment, the polynucleotide comprises a sequence having at least 98% identity with any one of SEQ ID NOs: 2-57.


In one embodiment, the polynucleotide comprises a sequence having at least 97% identity with any one of SEQ ID NOs: 2-57.


In one embodiment, the polynucleotide comprises a sequence having at least 96% identity with any one of SEQ ID NOs: 2-57.


In one embodiment, the polynucleotide comprises a sequence having at least 95% identity with any one of SEQ ID NOs: 2-57.


In one embodiment, the polynucleotide comprises a sequence having at least 90%, at least 91%, at least 92%, at least 93%, or at least 94% identity with any one of SEQ ID NOs: 2-57.


In another aspect, the polynucleotide for the gene of interest (GOI) is selected from SEQ ID NOs: 2-57.


In one embodiment, a polynucleotide of SEQ ID NOs: 2-57 encodes a polypeptide of SEQ ID NOs: 84-139. In one embodiment, a polynucleotide sequence having 90%, 95%, 98%, or 99.9% identity to SEQ ID NOs: 2-57 encodes a polypeptide with a sequence having at least at least 90%, 95%, 98%, or 99.9% identity with any one of SEQ ID NOs: 84-139.


In one embodiment, the polypeptide has 99.9% identity to a polypeptide of SEQ ID NOs: 84-139.


In another embodiment, the polypeptide has 98% identity to a polypeptide of SEQ ID NOs: 84-139.


In another embodiment, the polypeptide has 97% identity to a polypeptide of SEQ ID NOs: 84-139.


In another embodiment, the polypeptide has 96% identity to a polypeptide of SEQ ID NOs: 84-139.


In another embodiment, the polypeptide has 95% identity to a polypeptide of SEQ ID NOs: 84-139.


In one embodiment, the polypeptide has 90% identity, 91% identity, 92% identity, 93% identity, or 94% identity to a polypeptide of SEQ ID NOs: 84-139.


In another aspect, the present invention provides a polypeptide comprising the amino acid sequence of any one of SEQ ID NOs: 84-139 or an amino acid sequence comprising at least 90% identity, 91% identity, 92% identity, 93% identity, or 94% identity to a polypeptide of SEQ ID NOs: 84-139.


In some embodiments, a polypeptide of the present invention comprises an alteration at one or more (e.g., several) amino acids in the polypeptide, wherein at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 162, 164, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 193, 194, 195, 196, 197, 198, 199, 200, or more amino acids are altered. In one embodiment, the alteration is relative to the reference polypeptide of SEQ ID NO: 83, wherein the alteration comprises one or more substitutions, insertions, deletions, and/or additions in the polypeptide relative to the reference polypeptide of SEQ ID NO: 83.


In other embodiments, the invention includes an isolated, recombinant, substantially pure, or non-naturally occurring polypeptide having THCA synthase activity, wherein the polypeptide comprises an amino acid sequence comprising at least about 60%, 65%, 70%, 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% sequence identity to any one of SEQ ID NOs: 84-139. In one embodiment, the polypeptide does not comprise the sequence of SEQ ID NO: 83. In another embodiment, the THCA synthase activity is at least about 2 to about 30-fold greater than the activity of a naturally occurring Cannabis synthase or a synthase having the sequence of SEQ ID NO: 83.


The recombinant vector of the disclosure may also contain a selectable “marker gene”, which facilitates the selection of host cells transformed or transfected with a recombinant cassette of the application. Examples of selectable marker genes are genes encoding for proteins such as G418 and hygromycin which confer resistance to certain drugs, β-galactosidase, chloramphenicol acetyltransferase, firefly luciferase, or an immunoglobulin or portion thereof such as the Fc portion of an immunoglobulin, optionally IgG. Transcription of the selectable marker gene is monitored by changes in the concentration of the selectable marker protein such as β-galactosidase, chloramphenicol acetyltransferase, or firefly luciferase. If the selectable marker gene encodes a protein conferring antibiotic resistance, such as neomycin, resistance transformant cells can be selected with G418. Cells that have incorporated the selectable marker gene will survive, while the other cells die. In another embodiment, the selectable marker is introduced to the host expression system on a separate vector from the recombinant vector of interest.


The recombinant vectors of the disclosure can be introduced into a host cell or a host expression system to produce a transformed host cell. Prokaryotic and/or eukaryotic cells can be transformed with nucleic acid by, for example, electroporation or calcium chloride-mediated transformation. For example, the vector can be introduced into mammalian cells via conventional techniques such as calcium phosphate or calcium chloride co-precipitation, DEAE-dextran mediated transfection, lipofectin, viral mediated methods, electroporation or microinjection. Suitable methods for transforming and transfecting cells can be found in Sambrook et al. (Molecular Cloning: A Laboratory Manual, 3rd Edition, Cold Spring Harbor Laboratory Press, 2001), and other laboratory textbooks.


Suitable host cells include a wide variety of eukaryotic cells and prokaryotic cells. For example, the nucleic acids and proteins of the disclosure may be expressed in plant cells, yeast cells or mammalian cells. In one embodiment, plant cells are of the genus Cannabis, for example Cannabis saliva L, Cannabis indica Lam, and Cannabis ruderalis Janisch, especially Cannabis sativa. Microorganisms are preferably bacteria (e.g. Escherichia coli) or yeast (e.g. Saccharomyces cerevisiae, Pichia pastoris). Insect cells are preferably Spodoptera frugiperda cells. The eukaryotic cell, in one embodiment, is a yeast cell.


Therefore, in another aspect, the disclosure provides a host expression system comprising the recombinant vector as disclosed herein. In one embodiment, the host expression system comprises a microbial cell, a yeast cell, a plant cell, or an animal cell. In another embodiment, the host expression system comprises a yeast cell. In one embodiment, the yeast comprises one or more of Pichia pastoris, Pichia angusta, Pichia guillermordii, Pichia methanolica, Pichia inositovera, Hansenula polymorpha, Candida boidinii, and Yarrowia lipolytica.


Accordingly, also provided herein is a recombinant cell comprising the recombinant vector or the polynucleotide of this disclosure. In one embodiment, the recombinant cell results in production of the polypeptide of this disclosure.


In another embodiment, the present disclosure provides a method of producing a Tetrahydrocannabinolic acid (THCA) synthase or its variants, which comprises expressing the recombinant vector as disclosed above in the host expression system.


In one embodiment, a method of producing a Tetrahydrocannabinolic acid (THCA) synthase or its variants comprises expressing the recombinant vector with a polynucleotide sequence as set forth in any one of SEQ ID NOs: 2-57.


In an embodiment, the method of producing a Tetrahydrocannabinolic acid (THCA) synthase or its variants comprises expressing the recombinant vector with a polynucleotide sequence having at least 90%, 95%, 98%, or 99.9% identity with any one of SEQ ID NOs: 2-57.


In an embodiment, the method of producing a Tetrahydrocannabinolic acid (THCA) synthase or its variants comprises expressing the recombinant vector with a polynucleotide sequence with at least 99.9% identity with any one of SEQ ID NOs: 2-57.


In an embodiment, the method of producing a Tetrahydrocannabinolic acid (THCA) synthase or its variants comprises expressing the recombinant vector with a polynucleotide sequence with at least 98% identity with any one of SEQ ID NOs: 2-57.


In an embodiment, the method of producing a Tetrahydrocannabinolic acid (THCA) synthase or its variants comprises expressing the recombinant vector with a polynucleotide sequence with at least 95% identity with any one of SEQ ID NOs: 2-57.


In an embodiment, the method of producing a Tetrahydrocannabinolic acid (THCA) synthase or its variants comprises expressing the recombinant vector with a polynucleotide sequence with at least 90%identity with any one of SEQ ID NOs: 2-57.


In an embodiment, the method of producing a Tetrahydrocannabinolic acid (THCA) synthase or its variants comprises expressing the recombinant vector with a polynucleotide sequence with at least 85%, 86%, 87%, 88% or 89% identity with any one of SEQ ID NOs: 2-57.


According to an embodiment, the recombinant vector with a polynucleotide sequence as set forth in any one of SEQ ID NOs: 2-57 encodes a polypeptide of SEQ ID NOs: 84-139 or a polypeptide with at least 90%, 95%, 98%, or 99% identity with any one of SEQ ID NOs: 84-139.


In one embodiment, the recombinant vector with a polynucleotide sequence as set forth in any one of SEQ ID NOs: 2-57 encodes a polypeptide with at least 99.9% identity with any one of SEQ ID NOs: 84-139.


In one embodiment, the recombinant vector with a polynucleotide sequence as set forth in any one of SEQ ID NOs: 2-57 encodes a polypeptide with at least 98% identity with any one of SEQ ID NOs: 84-139.


In one embodiment, the recombinant vector with a polynucleotide sequence as set forth in any one of SEQ ID NOs: 2-57 encodes a polypeptide with at least 95% identity with any one of SEQ ID NOs: 84-139.


In one embodiment, the recombinant vector with a polynucleotide sequence as set forth in any one of SEQ ID NOs: 2-57 encodes a polypeptide with at least 90% identity with any one of SEQ ID NOs: 84-139.


In one embodiment, the recombinant vector with a polynucleotide sequence as set forth in any one of SEQ ID NOs: 2-57 encodes a polypeptide with at least 85%, 86%, 87%, 88% or 89% identity with any one of SEQ ID NOs: 84-139.


In one embodiment the recombinant vector comprises a promoter of SEQ ID NO: 71 operably linked to a secretion tag of SEQ ID NO: 58 which is operably linked to the gene of interest of SEQ ID NO: 2.


One embodiment provides a recombinant vector comprising a promoter of SEQ ID NO: 75 operably linked to a secretion tag of SEQ ID NO: 58 which is operably linked to the gene of interest of SEQ ID NO: 2.


In one embodiment the recombinant vector comprises a promoter of SEQ ID NO: 71 operably linked to a secretion tag of SEQ ID NO: 58 which is operably linked to the gene of interest of SEQ ID NO: 3.


In one embodiment the recombinant vector comprises a promoter of SEQ ID NO: 75 operably linked to a secretion tag of SEQ ID NO: 58 which is operably linked to the gene of interest of SEQ ID NO: 3.


One embodiment provides a recombinant vector comprising a promoter of SEQ ID NO: 71 operably linked to a secretion tag of SEQ ID NO: 58 operably linked to the gene of interest of SEQ ID NO: 8.


In one embodiment the recombinant vector comprises a promoter of SEQ ID NO: 75 operably linked to a secretion tag of SEQ ID NO: 58 which is operably linked to the gene of interest of SEQ ID NO: 8.


One embodiment provides a recombinant vector comprising a promoter of SEQ ID NO: 71 operably linked to a secretion tag of SEQ ID NO: 58 which is operably linked to the gene of interest of SEQ ID NO: 9.


In one embodiment the recombinant vector comprises a promoter of SEQ ID NO: 71 operably linked to a secretion tag of SEQ ID NO: 58 which is operably linked to the gene of interest of SEQ ID NO: 9 which is operably linked to a purification/detection tag of SEQ ID NO: 68.


In one embodiment the recombinant vector comprises a promoter of SEQ ID NO: 73 operably linked to a secretion tag of SEQ ID NO: 58 which is operably linked to the gene of interest of SEQ ID NO: 9 which is operably linked to a purification/detection tag of SEQ ID NO: 68.


One embodiment provides a recombinant vector comprising a promoter of SEQ ID NO: 75 operably linked to a secretion tag of SEQ ID NO: 58 which is operably linked to the gene of interest of SEQ ID NO: 9 and which is operably linked to a purification/detection tag of SEQ ID NO: 68.


In one embodiment, the recombinant vector comprises a promoter of SEQ ID NO: 75 operably linked to a secretion tag of SEQ ID NO: 58 which is operably linked to the gene of interest of SEQ ID NO: 18.


In one embodiment, the recombinant vector comprises a promoter of SEQ ID NO: 75 operably linked to a secretion tag of SEQ ID NO: 58 which is operably linked to the gene of interest of SEQ ID NO: 20.


In one embodiment, the recombinant vector comprises a promoter of SEQ ID NO: 71 operably linked to a secretion tag of SEQ ID NO: 58 which is operably linked to the gene of interest of SEQ ID NO: 28.


In one embodiment, the recombinant vector comprises a promoter of SEQ ID NO: 73 operably linked to a secretion tag of SEQ ID NO: 58 which is operably linked to the gene of interest of SEQ ID NO: 28.


In one embodiment, the recombinant vector comprises a promoter of SEQ ID NO: 75 operably linked to a secretion tag of SEQ ID NO: 58 which is operably linked to the gene of interest of SEQ ID NO: 28.


In one embodiment, the recombinant vector comprises a promoter of SEQ ID NO: 71 operably linked to a secretion tag of SEQ ID NO: 58 which is operably linked to the gene of interest of SEQ ID NO: 29.


In one embodiment, the recombinant vector comprises a promoter of SEQ ID NO: 73 operably linked to a secretion tag of SEQ ID NO: 58 which is operably linked to the gene of interest of SEQ ID NO: 29.


In one embodiment, the recombinant vector comprises a promoter of SEQ ID NO: 75 operably linked to a secretion tag of SEQ ID NO: 58 which is operably linked to the gene of interest of SEQ ID NO: 29.


In one embodiment, the recombinant vector comprises a promoter of SEQ ID NO: 71 operably linked to a secretion tag of SEQ ID NO: 58 which is operably linked to the gene of interest of SEQ ID NO: 30.


In one embodiment, the recombinant vector comprises a promoter of SEQ ID NO: 71 operably linked to a secretion tag of SEQ ID NO: 58 which is operably linked to the gene of interest of SEQ ID NO: 31.


In one embodiment, the recombinant vector comprises a promoter of SEQ ID NO: 71 operably linked to a secretion tag of SEQ ID NO: 58 which is operably linked to the gene of interest of SEQ ID NO: 32.


One embodiment comprises a recombinant vector with a promoter of SEQ ID NO: 75 operably linked to a secretion tag of SEQ ID NO: 58 which is operably linked to the gene of interest of SEQ ID NO: 32.


In one embodiment, the recombinant vector comprises a promoter of SEQ ID NO: 71 operably linked to a secretion tag of SEQ ID NO: 58 which is operably linked to the gene of interest of SEQ ID NO: 33.


One embodiment provides a recombinant vector comprising a promoter of SEQ ID NO: 75 operably linked to a secretion tag of SEQ ID NO: 58 which is operably linked to the gene of interest of SEQ ID NO: 33.


One embodiment provides a recombinant vector comprising a promoter of SEQ ID NO: 71 operably linked to a secretion tag of SEQ ID NO: 58 which is operably linked to the gene of interest of SEQ ID NO: 34.


In one embodiment, a method for producing a cannabinoid compound comprises reacting a cannabinoid precursor with an enzyme that is expressed by a host expression system comprising a recombinant vector that has a promoter operably linked to a secretion tag that is operably linked to a GOI, wherein the cannabinoid precursor is a compound of Formula I:




embedded image


wherein R1 is H, —COOH, or an ester of the carboxylic acid and R2 is a linear or branched CH3, C2H5, C3H7, C4H9, C5H11, C6H13, C7H15 or C8H17 group.


In one embodiment, the recombinant vector comprises a GOI having a polynucleotide sequence as set forth in any one of SEQ ID NOs: 2-57.


In an embodiment, the recombinant vector comprises a polynucleotide sequence with at least 90%, 95%, 98%, or 99.9% identity with any one of SEQ ID NOs: 2-57.


In one embodiment, the recombinant vector with a polynucleotide sequence as set forth in any one of SEQ ID NOs: 2-57 encodes a polypeptide as set forth in any one of SEQ ID NOs: 84-139.


In one embodiment, the recombinant vector with a polynucleotide sequence as set forth in any one of SEQ ID NOs: 2-57 encodes a polypeptide with at least 90%, 95%, 98%, or 99.9% identity with any one of SEQ ID NOs: 84-139.


In one embodiment, the cannabinoid compound produced by a method described in this application is isolated and/or purified using chemical purification protocols well known in the art. In one embodiment, R2 is C3H7 or C5H10. In one embodiment, R2 is C3H7 and R1 is —COOH or and ester. In one embodiment, R2 is C5H11 and R1 is —COOH or and ester, for example a C1-C10 alkyl ester, an optionally substituted aryl or benzyl ester. In another embodiment, the cannabinoid precursor is CBGVA, CBGA, and ester of CBGA or CBGVA, or derivatives and analogs of CBGA and CBGVA.


EXPERIMENTAL EXAMPLES

The invention is further described in detail by reference to the following experimental examples. These examples are provided for purposes of illustration only, and are not intended to be limiting unless otherwise specified. Thus, the invention should in no way be construed as being limited to the following examples, but rather, should be construed to encompass any and all variations which become evident as a result of the teaching provided herein.


Without further description, it is believed that one of ordinary skill in the art can, using the preceding description and the following illustrative examples, make and utilize the compounds of the present invention and practice the claimed methods. The following working examples, therefore, specifically point out the preferred embodiments of the present invention, and are not to be construed as limiting in any way the remainder of the disclosure.


Example 1
Molecular Cloning

An array of promoters was employed to drive the expression of the native and mutant THCA synthase enzymes. The suitable promoters include, but are not limited to: alcohol oxidase 1 promoter (pAOX1), alcohol oxidase 2 promoter (pAOX2), NAD+-dependent formylglutathione dehydrogenase promoter (pFDH), peroxin Pex14p promoter (pPEX14), S-hydroxymethyl-glutathione hydrolase promoter (pFGH), dihydroxyacetone kinase promoter (pDAK), fructose 1,6-bisphosphate aldolase promoter (pFBA2), peroxisomal protein Pex8p promoter (pPEX8), peroxisomal membrane signal receptor PTS1 promoter (pPEX5) and alcohol dehydrogenase 2 promoter (pADH2). In one embodiment, the pAOX1 promoter was cloned to the standard plasmid pPIC3.5 (Invitrogen) with the BglII and EcoRI restriction sites, yielding a derivative called plasmid pL. DH5alpha E. coli strain was used for cloning and plasmid amplification. Transformation was performed according to a standard heat-shock protocol as known in the art.


In order to get the enzyme to secrete into the culture medium, a secretion peptide was fused to the 5′ of the GOI (gene of interest) in frame. Suitable secretion tags include but are not limited to α-MF (alpha mating factor from S. cerevisiae yeast), HSP150 (Heat shock protein 150 from P. pastoris yeast), combination of pre-α-MF (19 amino acids from α-MF) and the pro-region of HSP150, combination of pre-α-MF and a double repeat of the pro-region of HSP150, SUC2 (invertase from S. cerevisiae), PHO1 (acid phosphatase from S. cerevisiae), PIR1 (55 amino acids from Protein with internal repeats 1, from P. pastoris), OST1 (dolichyl-diphosphooligosaccharide protein glycotransferase subunit OST1), OST1 fused to the pro-region of the α-MF secretion peptide (20-85 amino acids of the α-MF secretion peptide) and DDDK (first 18 amino acids from protein A0A68487.1:GenBank accession No. from P. pastoris).


In one embodiment, the secretion tag, and the gene for THCA synthase or its mutants were synthesized as one construct. In another embodiment, a PCR based method was used to fuse the desired secretion tag to the codon optimized gene of THCA synthase and mutants of this enzyme. The genes for the THCA synthase and its mutants were chemically synthesized (Thermo Fisher Scientific, USA), and sequences of these constructs were verified with Sanger sequencing (Eurofins Genomics).


To process a secretion tagged construct (i.e., cleavage of the signal peptide from the nascent peptide chain), the cleavage recognition amino acid sequence (lysine-arginine) of Kex2 protease was introduced downstream of each secretion peptide. Thus, all secretion peptides were flanked with a lysine-arginine (KR) or lysine-lysine (KK) pair at the C-terminus, upstream of the respective GOI. To increase efficiency of cleavage by Kex2 protease, the amino acid pairs LE (leucine-glutamic acid) or ME (Methionine-glutamic acid) were cloned upstream of KR or KK flanks. Downstream of KR or KK cleavage site an EAEA repeat (glutamic acid-alanine-glutamic acid-alanine) was cloned, which is processed (sequentially cleaved) by the aminopeptidyl peptidase.


Transformation Into Yeast Strain

Plasmids were linearized (cut) within their promoter sequence using SacI/BamHI/another restriction enzyme, depending on the promoter used. The linearized plasmids were transformed individually into the GS115 strain of the yeast, Pichia pastoris. Following transformation, the yeast cells was plated onto YPD (Yeast Extract-Peptone-Dextrose) plates containing geneticin (G418≥0.25mg/ml) to enable ‘antibiotic resistance’ based selection of the transformed yeast colonies. The successful integration of the respective GOI was subsequently verified using colony PCR. FIG. 1 schematically illustrates the construct containing the GOI and its integration within the yeast genome.


Colony Screening and Growth Experiment With the Shaking Flasks

Selected clones (colonies) were streaked from the transformation plates into fresh YPD plates containing geneticin (G418≥0.25mg/ml) to obtain monoclonal colonies. Each monoclonal colony was then subjected to serial plating onto YPD plates with increasing geneticin concentration (up to 3 mg/ml) to obtain a set of clones with variable gene copy number with respect to the gene of interest. Subsequently, at least 10 such monoclonal colonies were chosen to be individually tested in shake flask studies. Note that glycerol stocks were also made for these selected monoclonal colonies using standard procedure to enable long term storage at −80° C.


Shake flask studies were initiated with at least 10 monoclonal colonies for each new construct in parallel with appropriate reference control(s). Each individual colony (grown as a biomass on YPD plates) was inoculated into a shake flask containing 10 mls of BMGY (Buffered Glycerol-complex) medium and incubated for 24 hours at 30° C., with shaking at 270 rpm. After 24-hours the cultures were induced for 3 consecutive days with a daily dose of either a final concentration of 0.2% potassium formate+0.5-1% sorbitol or 0.25-1% methanol. In some cases, a daily double dose of 0.2% potassium format+0.5-1% sorbitol was fed to the cultures.


Enzyme Assay

At the end of 70-72 hours, shake flask cultures of cells were subjected to centrifugation at 10,000 g for five minutes. The supernatant was collected and the activity of the enzyme in the supernatant was measured using an HPLC based enzyme assay. In one embodiment, the activity of the enzyme in the supernatant was measured by setting up the reaction mixture that included 125 μl of 200mM sodium citrate buffer (pH 4.8), 50 -100 μl of the supernatant and 25 μl of a 554 μM solution of Cannabigerolic acid (CBGA) in DMSO.


Following incubation at 30° C. for 30 mins, the reaction was stopped by the addition of 250 μl of 100% methanol (1:1 MeOH:vol. of reaction mixture). The stopped reaction was then centrifuged at 14000 rpm for 15 mins and the supernatant analyzed by HPLC. In one embodiment the supernatant comprising the enzyme is incubated with a substrate, for example, CBGA or CBGVA for 1-16 hours, preferably for 1-15 hours, preferably for 1-14 hours, preferably for 1-13 hours, preferably for 1-12 hours, preferably for 1-11 hours, preferably for 1-10 hours, preferably for 1-9 hours, preferably for 1-8 hours, preferably for 1-7 hours, preferably for 1-6 hours, preferably for 1-5 hours, preferably for 1-4 hours, preferably for 1-3 hours, or preferably for 1-2 hours.


HPLC

HPLC was performed on the Agilent 1260 Infinity II system. Mobile Phase A refers to water with 0.1% formic acid while the mobile Phase B refers to acetonitrile with 0.1% formic acid. Each new run on the HPLC starts with a one-hour equilibration at 25% A and 75% B. To ensure that the system was thoroughly cleaned and there was no carryover, prior to the analysis of a new set of samples the column was cleaned by injecting pure methanol followed by an injection of DMSO. Such a cleaning routine was carried out once every 20 samples. The injection volume was set at 2011.1 and the flow rate of the mobile phase was set at 40011.1/min. Samples were analyzed using the following gradient:

    • 0.0 min 25% A, 75% B,
    • 5.00 min: 4% A, 96% B,
    • 5.01 min: 0% A, 100% B, and
    • 6.00 min: 0% A, 100% B.


10 μl of 0.01 mg/mL of Δ8-THC (Cerilliant T-032 at 1 mg/mL diluted with 100% methanol) was added as an internal standard to each sample and mixed five times within the injector compartment prior to loading onto the column. Calibration curves for CBGA (Cerilliant C-142 at 1 mg/mL diluted with 100% methanol), CBDA (Cerilliant C-144 at 1 mg/mL diluted with 100% methanol), and Δ9-THCA-A (Cerilliant T-093 at 1 mg/mL diluted with 100% methanol) were generated separately and were used to calculate the amount of CBGA, CBDA and/or Δ9-THCA-A produced in a test sample by an enzyme of the present disclosure. Prior to, and following each injection, the injection needle was washed for 3 seconds with 100% MeOH. The column chamber was set to a temperature of 30.0±0.8° C. while the Diode Array Detector (DAD) was set to scan from 190 nm to 400 nm with data primarily collected at 269 nm with a peak width of 5 Hz (>0.05 min). Data was analyzed using Agilent OpenLab CDS running on Windows 10. The results of samples tested are summarized in Table 1.


The sequences of this disclosure are provided with the Reference sequence numbered as SEQ ID NO: 1 or 83.


Table 1 shows enzyme assay data.












TABLE 1







Enzyme activty relative to SEQ




ID 83 (reference)
SEQ ID Nos.









≥5 fold
84-86, 90, 91, 100-103, 106-116



<5 fold
87-89, 92-99, 104 and 105



Not determined
117-139










Table 2 correlates the SEQ ID numbers of the polynucleotide sequences for various accessory elements used to produce the THCA synthase mutants according to this disclosure to the SEQ ID numbers of their corresponding polypeptide (protein) sequences.











TABLE 2






Corresponding



DNA SEQ
Protein SEQ



ID No.
ID No.
Accessory Element







SEQ ID NO: 58
SEQ ID NO: 140
Secretion peptide alpha-MF




(alpha mating factor from S.





cerevisiae yeast)



SEQ ID NO: 59
SEQ ID NO: 141
Secretion peptide HSP150 (Heat




shock protein 150 from P.





pastoris yeast)



SEQ ID NO: 60
SEQ ID NO: 142
Secretion peptide: combination




of pre region of alpha-MF (19




amino acids from alpha-MF) and




pro-region of HSP150




from P. pastoris


SEQ ID NO: 61
SEQ ID NO: 143
Secretion peptide: combination




of pre region of alpha-MF and




double repeat of pro-region




of HSP150


SEQ ID NO: 62
SEQ ID NO: 144
Secretion peptide SUC2




(invertase from S. cerevisiae)


SEQ ID NO: 63
SEQ ID NO: 145
Secretion peptide PHO1 (acid




phosphatase from S. cerevisiae)


SEQ ID NO: 64
SEQ ID NO: 146
Secretion peptide PIR1 (55 amino




acids from Protein with internal




repeats 1, from P. pastoris


SEQ ID NO: 65
SEQ ID NO: 147
Secretion peptide OST1




(dolichyl-diphosphooligosaccharide




protein glycotransferase subunit




OST1)


SEQ ID NO: 66
SEQ ID NO: 148
Secretion peptide OST1 fused to




pro-region of alpha-MF secretion




peptide (20-85 amino acids of




alpha-MF secretion peptide)


SEQ ID NO: 67
SEQ ID NO: 149
Secretion peptide DDDK (first




18 amino acids from protein with




access number AOA68487.1




from P. pastoris)


SEQ ID NO: 68
SEQ ID NO: 150
Human influenza hemagglutinin


or 153

(HA) tag


SEQ ID NO: 69
SEQ ID NO: 151
6X Histidine tag


SEQ ID NO: 70
SEQ ID NO: 152
HN tag


SEQ ID NO: 71
Not applicable
Promoter pAOX1 (alcohol oxidase 1)


SEQ ID NO: 72
Not applicable
Promoter pAOX2 (alcohol oxidase 2)


SEQ ID NO: 73
Not applicable
Promoter pFDH




(NAD+-dependent formylglutathione




dehydrogenase)


SEQ ID NO: 74
Not applicable
Promoter pPEX14 (peroxin Pex14p)


SEQ ID NO: 75
Not applicable
Promoter pFGH (S-




hydroxymethyl-glutathione hydrolase)


SEQ ID NO: 76
Not applicable
Promoter pDAK (dihydroxyacetone




kinase)


SEQ ID NO: 77
Not applicable
Promoter pFBA (fructose




1,6-bisphosphate aldolase)


SEQ ID NO: 78
Not applicable
Promoter pPEX5 (peroxisomal




membrane signal receptor PTS1)


SEQ ID NO: 79
Not applicable
Promoter pADH2 (alcohol




dehydrogenase 2)


SEQ ID NO: 80
Not applicable
Promoter pPEX8 (peroxisomal




protein Pex8p)


SEQ ID NO: 81
Not applicable
Plasmid pPIC 3.5


SEQ ID NO: 82
Not applicable
Plasmid pL
















(REFERENCE THCA synthase)


SEQ ID NO: 1


AACCCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCAAAC





AACGTTGCTAACCCTAAGTTGGTTTACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGAACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCATTGGTTATCGTTACTCCATCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCATTCGTTGTTGTTGACTTGAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAAGCTGGTGCTACTTTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGAACTTGTCCTTCCCA





GGTGGTTACTGTCCAACTGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGATTGGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAG





TCCATGGGTGAGGACTTGTTCTGGGCTATTAGAGGTGGTGGTGGTGAG





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGTTGGTTGCTGTTCCA





TCCAAGTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACATCACTGAC





AACCACGGTAAGAACAAGACTACTGTTCACGGTTACTTCTCTTCCATC





TTCCACGGTGGTGTTGATTCCTTGGTTGATTTGATGAACAAGTCTTTC





CCAGAGTTGGGTATCAAGAAAACTGACTGTAAAGAGTTCTCCTGGATC





GACACAACAATCTTCTACTCCGGTGTTGTTAACTTCAACACTGCTAAC





TTTAAGAAAGAGATCTTGTTGGACAGATCCGCTGGTAAAAAGACTGCT





TTCTCCATTAAGTTGGACTACGTTAAGAAGCCAATCCCAGAGACTGCT





ATGGTTAAGATTTTGGAGAAGTTGTACGAAGAGGACGTTGGTGCTGGT





ATGTACGTTTTGTACCCATACGGTGGTATCATGGAAGAAATCTCCGAG





TCCGCTATTCCATTCCCACACAGAGCTGGTATTATGTACGAGTTGTGG





TACACTGCTTCTTGGGAGAAGCAAGAGGACAACGAAAAGCACATCAAC





TGGGTTAGATCCGTTTACAACTTCACTACTCCTTACGTTTCCCAGAAC





CCAAGATTGGCTTACTTGAACTACAGAGACTTGGACTTGGGTAAGACT





AACCACGCTTCCCCAAACAATTACACACAGGCTAGAATCTGGGGTGAA





AAGTACTTCGGAAAGAACTTTAACAGATTGGTTAAGGTTAAGACTAAG





GTTGACCCTAACAACTTCTTCAGAAACGAGCAGTCCATCCCACCATTG





CCACCACATCATCATCACCATCACTAA





SEQ ID NO: 2


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCGAAC





AACGTTGCTAACCCTAAGTTGGTATACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGAACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCCTTGGTTATCGTTACTCCCTCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCCTTCGTTGTTGTTGACCTTAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAAGCAGGTGCTACGTTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGAACCTCTCCTTCCCG





GGTGGTTACTGTCCAACCGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGACTTGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAG





TCCATGGGAGAGGACTTGTTCTGGGCTATTCGTGGTGGTGGTGGTGAG





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGCTTGTTGCTGTTCCC





TCCAAGTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACATCACTGAC





AACCACGGTAAGAACAAAACTACTGTTCACGGTTACTTCTCTTCCATC





TTCCACGGTGGTGTTGATTCCTTGGTTGATTTGATGAACAAGTCTTTC





CCAGAGTTGGGCATCAAGAAAACTGACTGTAAAGAGTTCAGTTGGATC





GACACAACAATCTTCTACAGTGGTGTTGTAAACTTCAACACTGCCAAC





TTTAAGAAAGAGATCTTGTTGGACAGGTCCGCTGGTAAAAAGACAGCT





TTCTCCATTAAGTTGGACTACGTTAAGAAGCCAATCCCAGAGACTGCT





ATGGTTAAGATTTTGGAGAAGTTGTACGAAGAGGACGTTGGTGCTGGT





ATGTACGTTTTGTACCCATACGGTGGCATCATGGAAGAAATCTCCGAG





TCCGCTATTCCGTTCCCACACCGAGCTGGTATCATGTACGAGTTGTGG





TACACTGCTTCTTGGGAGAAGCAAGAGGACAACGAAAAGCACATCAAC





TGGGTTAGATCCGTTTACAACTTCACTACTCCTTACGTTTCCCAGAAC





CCAAGATTGGCGTACTTGAACTACAGAGACTTGGACTTGGGCAAGACT





AACCACGCGTCCCCAAACAATTACACACAGGCTAGAATCTGGGGTGAA





AAGTACTTCGGAAAGAACTTTAACAGATTGGTTAAGGTTAAGACTAAG





GTTGACCCTAACAACTTCTTCAGGAACGAGCAGTCCATCCCACCATTG





GTCAATTAA





SEQ ID NO: NO: 3


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCGGGAAT





AGGGGAAACGTATACACTCAGCACGACCAGTTGTACATGTCCTTGTTG





AACTCCACAATCCAGAACTTGAGATTCATCTCCGACACTACTCCAAAG





CCCTTGGTTATCGTTACTCCCTCCAACAACGCTCACATCCAGGCTACT





ATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGAACAAGATCCGGT





GGTCATGACGCTGAAGGTATGTCCTACATTTCCCAGGTTCCCTTCGTT





GTTGTTGACCTTAGAAACATGCACTCCATCAAGATCGACGTTCACTCC





CAAACTGCTTGGGTTGAAAGCGGTGCTACGTTGGGTGAAGTTTACTAC





TGGATCAACGAGAAGAACGAGAACCTCTCCTTCCCGGGTGGTTACTGT





CCAACCGTTGGTGTTGGTGGTCACTTTTCTGGTGGTGGTTACGGTGCT





TTGATGAGAAACTACGGACTTGCTGCTGACAACATCATCGACGCTCAC





TTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAGTCCATGGGAGAG





GACTTGTTCTGGGCTATTCGTGGTGGTGGTGGTGAGAACTTCGGTATT





ATTGCTGCTTGGAAGATCAAGCTTGTTGCTGTTCCCTCCAAGTCCACT





ATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGTTTGGTTAAGTTG





TTTAACAAGTGGCAGAACATTGCTTACAAGTACGACAAGGACTTGGTT





TTGATGACTCACTTCATCACTAAGAACATCACTGACAACCACGGTAAG





AACAAAACTACTGTTCACGGTTACTTCTCTTCCATCTTCCACGGTGGT





GTTGATTCCTTGGTTGATTTGATGAACAAGTCTTTCCCAGAGTTGGGC





ATCAAGAAAACTGACTGTAAAGAGTTCAGTTGGATCGACACAACAATC





TTCTACAGTGGTGTTGTAAACTTCAACACTGCCAACTTTAAGAAAGAG





ATCTTGTTGGACAGGTCCGCTGGTAAAAAGACAGCTTTCTCCATTAAG





TTGGACTACGTTAAGAAGCCAATCCCAGAGACTGCTATGGTTAAGATT





TTGGAGAAGTTGTACGAAGAGGACGTTGGTGCTGGTATGTACGTTTTG





TACCCATACGGTGGCATCATGGAAGAAATCTCCGAGTCCGCTATTCCG





TTCCCACACCGAGCTGGTATCATGTACGAGTTGTGGTACACTGCTTCT





TGGGAGAAGCAAGAGGACAACGAAAAGCACATCAACTGGGTTAGATCC





GTTTACAACTTCACTACTCCTTACGTTTCCCAGAACCCAAGATTGGCG





TACTTGAACTACAGAGACTTGGACTTGGGCAAGACTAACCACGCGTCC





CCAAACAATTACACACAGGCTAGAATCTGGGGTGAAAAGTACTTCGGA





AAGAACTTTAACAGATTGGTTAAGGTTAAGACTAAGGTTGACCCTAAC





AACTTCTTCAGGAACGAGCAGTCCATCCCACCATTGGTCAATTAA





SEQ ID NO: NO: 4


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCGAAC





AACGTAGCTAACCCTAAGTTGGTGTACACTCAGCACGACCAGTTGTAC





ATGTCATTGCTGAACTCCACAATCCAAAACCTGAGATTCATCTCCGAT





ACTACGCCAAAGCCGCTAGTAATTGTGACTCCCTCCAATAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAAATAAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGCATGTCCTACATTTCCCAG





GTTCCCTTCGTTGTAGTTGACCTTAGAAACATGCACAGCATCAAGATC





GACGTTCACTCCCAAACAGCTTGGGTTGAAGCAGGTGCTACGCTTGGT





GAAGTGTACTACTGGATCAACGAGAAGAATGAGAACCTATCCTTCCCC





GGTGGTTACTGTCCAACAGTTGGTGTTGGCGGTCACTTTAGTGGTGGT





GGATACGGTGCTTTGATGAGAAATTATGGACTAGCAGCTGACAACATC





ATCGATGCACACCTTGTAAACGTTGACGGTAAGGTTCTAGACAGGAAG





AGTATGGGTGAGGACCTTTTCTGGGCTATTCGTGGTGGTGGTGGTGAG





AACTTCGGTATCATTGCTGCATGGAAGATAAAGCTGGTTGCTGTGCCC





TCCAAGTCCACTATCTTCTCCGTCAAAAAAAACATGGAAATACACGGT





TTGGTTAAGTTATTTAACAAGTGGCAAAACATTGCATACAAGTACGAC





AAGGATTTGGTACTTATGACTCACTTCATCACTAAGAACATCACTGAC





AACCACGGCAAGAACAAGACTACTGTACACGGTTACTTCTCTAGTATA





TTCCACGGTGGTGTTGATTCCTTAGTTGATTTGATGAACAAGAGCTTC





CCAGAGCTTGGCATCAAGAAAACTGACTGTAAAGAGTTCAGTTGGATC





GACACTACAATCTTCTACTCCGGTGTTGTAAACTTCAACACAGCCAAC





TTTAAAAAAGAGATCTTGTTGGACAGATCCGCTGGAAAAAAGACAGCT





TTCTCAATTAAGTTGGACTACGTGAAGAAGCCAATCCCAGAGACTGCA





ATGGTTAAGATTCTTGAGAAGTTATACGAAGAGGACGTAGGTGCTGGC





ATGTACGTTTTGTACCCTTACGGTGGCATCATGGAAGAAATCTCCGAG





TCCGCTATTCCCTTCCCACACAGGGCTGGTATCATGTACGAGTTATGG





TACACTGCTAGTTGGGAGAAGCAAGAGGACAACGAAAAGCACATCAAC





TGGGTTAGAAGTGTTTACAACTTCACAACTCCTTACGTTTCCCAGAAC





CCAAGACTTGCGTACTTGAACTACAGAGACTTGGACTTGGGTAAGACT





AACCACGCGTCCCCCAACAATTACACACAGGCTCGTATCTGGGGTGAA





AAGTACTTCGGAAAGAACTTTAACAGGTTAGTTAAGGTTAAGACCAAG





GTTGACCCTAACAACTTCTTCAGAAACGAGCAGAGTATCCCTCCATTG





GTCAATTAA





SEQ ID NO: 5


AACGCAAGAGAAAACTTCTTGCAATGTTTTTCCGCTCACATCCCAAAC





AACGTTGCTAACCCTAAGTTGGTTTACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGAACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCATTGGTTATCGTTACTCCATCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCATTCGTTGTTGTTGACTTGAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAATCTGGTGCTACTTTGGGT





GAAGTTTACTACTGGATCAACGAGCAAAACGAGAACTTGTCCTTCCCA





GGTGGTTACTGTCCAACTGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGATTGGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTTCTGTTTTGGACAGACAG





TCCATGGGTGAGGACTTGTTCTGGGCTATTAGAGGTGGTGGTGGTGAG





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGTTGGTTGCTGTTCCA





TCCAAGTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACATCACTGAC





AACCACGGTAAGAACAAGACTACTGTTCACGGTTACTTCTCTTCCATC





TTCCACGGTGGTGTTGATTCCTTGGTTGATTTGATGAACAAGTCTTTC





CCAGAGTTGGGTATCCAATCCACTGACTGTAAAGAGTTCTCCTGGATC





GACACAACAATCTTCTACTCCGGTGTTGTTAACTTCAACACTGCTAAC





TTTAAGAAAGAGATCTTGTTGGACAGATCCGCTGGTAAAAAGACTGCT





TTCTCCATTAAGTTGGACTACGTTAAGCAACCAATCCCAGAGACTGCT





ATGGTTAAGATTTTGGAGAAGTTGTACGAAGAGGACGTTGGTGCTGGT





ATGTACGTTTTGTACCCATACGGTGGTATCATGGAAGAAATCTCCGAG





TCCGCTATTCCATTCCCACACAGAGCTGGTATTATGTACGAGTTGTGG





TACACTGCTTCTTGGGAGAAGCAAGAGGACAACGAAAAGCACATCAAC





TGGGTTAGATCCGTTTACAACTTCACTACTCCTTACGTTTCCCAGAAC





CCAAGATTGGCTTACTTGAACTACAGAGACTTGGACTTGGGAGGCACT





AACCACGCTTCCCCAAACAATTACACACAGGCTAGAATCTGGGGTGAA





AAGTACTTCGGAAAGAACTTTAACAGATTGGTTAAGGTTAAGACTAAG





GTTGACCCTAACAACTTCTTCAGAAACGAGCAGTCCATCCCACCATTG





GTCAATTAA





SEQ ID NO: 6


AACGCAAGAGAAAACTTCCTTCAATGTTTTTCCGCTCACATCCCGAAC





AACGTTGCTAACCCTAAGTTGGTATACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGAACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCCTTGGTTATCGTTACTCCCTCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCCTTCGTTGTTGTTGACCTTAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAAAGCGGTGCTACGTTGGGT





GAAGTTTACTACTGGATCAACGAGCAAAACGAGAACCTCTCCTTCCCG





GGTGGTTACTGTCCAACCGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGACTTGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAGCGTTTTGGACAGACAG





TCCATGGGAGAGGACTTGTTCTGGGCTATTCGTGGTGGTGGTGGTGAG





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGCTTGTTGCTGTTCCC





TCCAAGTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACATCACTGAC





AACCACGGTAAGAACAAAACTACTGTTCACGGTTACTTCTCTTCCATC





TTCCACGGTGGTGTTGATTCCTTGGTTGATTTGATGAACAAGTCTTTC





CCAGAGTTGGGCATCCAATCCACTGACTCTAAAGAGTTCAGTTGGATC





GACACAACAATCTTCTACAGTGGTGTTGTAAACTTCAACACTGCCAAC





TTTAAGAAAGAGATCTTGTTGGACAGGTCCGCTGGTAAAAAGACAGCT





TTCTCCATTAAGTTGGACTACGTTAAGCAACCAATCCCAGAGACTGCT





ATGGTTAAGATTTTGGAGAAGTTGTACGAAGAGGACGTTGGTGCTGGT





ATGTACGTTTTGTACCCATACGGTGGCATCATGGAAGAAATCTCCGAG





TCCGCTATTCCGTTCCCACACCGAGCTGGTATCATGTACGAGTTGTGG





TACACTGCTTCTTGGGAGAAGCAAGAGGACAACGAAAAGCACATCAAC





TGGGTTAGATCCGTTTACAACTTCACTACTCCTTACGTTTCCCAGAAC





CCAAGATTGGCGTACTTGAACTACAGAGACTTGGACTTGGGAGGCACT





AACCACGCGTCCCCAAACAATTACACACAGGCTAGAATCTGGGGTGAA





AAGTACTTCGGAAAGAACTTTAACAGATTGGTTAAGGTTAAGACTAAG





GTTGACCCTAACAACTTCTTCAGGAACGAGCAGTCCATCCCACCATTG





GTCAATTAA





SEQ ID NO: 7


AACGCAAGAGAAAACTTCTTGCAATGTTTTTCCGCTCACATCCCAAAC





AACGTTGCTAACCCTAAGTTGGTTTACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGAACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCATTGGTTATCGTTACTCCATCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCATTCGTTGTTGTTGACTTGAGAAACTTGAATCGTATCTCTATC





GACGTTCACTCCCAAACTGCTTGGGTTGAATCTGGTTCTACTTTGGGT





GAAGTTTACTACTGGATCAACGAGCAAAACGAGAACTTGTCCTTCCCA





GGTGGTTACTGTCCAACTGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAAGTACGGATTGGCTGCTGACAACATC





ATCGACGCTATCTTGGTTAACGTTGACGGTTCTGTTTTGGACAGACAG





TCCATGGGTGAGGACTTGTTCTGGGCTATTAGAGGTGGTGGTGGTGAG





AACTTCGGTATTATTTACGCTTGGAAGATCAAGTTGGTTGCTGTTCCA





TCCAAGTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





GAAGACTTGGTTTTGATGACTCACTTCATCACTAAGAACATCACTGAC





AACCACGGTAAGAACAAGACTACTGTTCACGGTTACTTCTCTTCCATC





TTCCACGGTGGTGTTGATTCCTTGGTTGATTTGATGAACAAGTCTTTC





CCAGAGTTGGGTATCCAATCCACTGACTGTAAAGAGTTCTCCTGGATC





GACACAACAATCTTCTACTCCGGTGTTGTTAACTTCAACACTGCTAAC





TTTAAGAAAGAGATCTTGTTGGACAGATCCGCTGGTAAAAAGACTGCT





TTCTCCATTAAGTTGGACTACGTTAAGCAACCAATCCCAGAGACTGCT





ATGGTTAAGATTTTGGAGAAGTTGTACGAAGAGGACGTTGGTGCTGGT





ATGTACGTTTTGTACCCATACGGTGGTATCATGGAAGAAATCTCCGAG





TCCGCTACTCCATTCCCACACAGAGCTGGTATTATGTACGAGTTGTGG





TACACTGCTTCTTGGGAGAAGCAAGAGGACAACGAAAAGCACATCAAC





TGGGTTAGATCCGTTTACAACTTCACTACTCCTTACGTTTCCCAGAAC





CCAAGATTGGCTTACTTGAACTACAGAGACTTGGACTTGGGTGGTACT





AACCACGCTTCCCCAAACAATTACACACAGGCTAGAATCTGGGGTGAA





AAGTACTTCGGAAAGAACTTTAACAGATTGGTTAAGGTTAAGACTAAG





GTTGACCCTAACAACTTCTTCAGAAACGAGCAGTCCATCCCACCATTG





GTTAACTAA





SEQ ID NO: 8


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCAAAC





AACGTTGCTAACCCTAAGTTGGTTTACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGAACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCATTGGTTATCGTTACTCCATCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCATTCGTTGTTGTTGACTTGAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAAGCTGGTGCTACTTTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGAACTTGTCCTTCCCA





GGTGGTTACTGTCCAACTGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGATTGGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAG





TCCATGGGTGAGGACTTGTTCTGGGCTATTAGAGGTGGTGGTGGTGAG





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGTTGGTTGCTGTTCCA





TCCAAGTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACATCACTGAC





AACCACGGTAAGAACAAGACTACTGTTCACGGTTACTTCTCTTCCATC





TTCCACGGTGGTGTTGATTCCTTGGTTGATTTGATGAACAAGTCTTTC





CCAGAGTTGGGTATCAAGAAAACTGACTGTAAAGAGTTCTCCTGGATC





GACACAACAATCTTCTACTCCGGTGTTGTTAACTTCAACACTGCTAAC





TTTAAGAAAGAGATCTTGTTGGACAGATCCGCTGGTAAAAAGACTGCT





TTCTCCATTAAGTTGGACTACGTTAAGAAGCCAATCCCAGAGACTGCT





ATGGTTAAGATTTTGGAGAAGTTGTACGAAGAGGACGTTGGTGCTGGT





ATGTACGTTTTGTACCCATACGGTGGTATCATGGAAGAAATCTCCGAG





TCCGCTATTCCATTCCCACACAGAGCTGGTATTATGTACGAGTTGTGG





TACACTGCTTCTTGGGAGAAGCAAGAGGACAACGAAAAGCACATCAAC





TGGGTTAGATCCGTTTACAACTTCACTACTCCTTACGTTTCCCAGAAC





CCAAGATTGGCTTACTTGAACTACAGAGACTTGGACTTGGGTAAGACT





AACCACGCTTCCCCAAACAATTACACACAGGCTAGAATCTGGGGTGAA





AAGTACTTCGGAAAGAACTTTAACAGATTGGTTAAGGTTAAGACTAAG





GTTGACCCTAACAACTTCTTCAGAAACGAGCAGTCCATCCCACCATTG





GTCAATTAA





SEQ ID NO: 9


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCAAAC





AACGTTGCTAACCCTAAGTTGGTTTACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGGACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCATTGGTTATCGTTACTCCATCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCATTCGTTGTTGTTGACTTGAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAAGCTGGTGCTACTTTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGAACTTGTCCTTCCCA





GGTGGTTACTGTCCAACTGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGATTGGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAG





TCCATGGGTGAGGACTTGTTCTGGGCTATTAGAGGTGGTGGTGGTGAG





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGTTGGTTGCTGTTCCA





TCCAAGTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACATCACTGAC





AACCACGGTAAGAACAAGACTACTGTTCACGGTTACTTCTCTTCCATC





TTCCACGGTGGTGTTGATTCCTTGGTTGATTTGATGAACAAGTCTTTC





CCAGAGTTGGGTATCAAGAAAACTGACTGTAAAGAGTTCTCCTGGATC





GACACAACAATCTTCTACTCCGGTGTTGTTAACTTCAACACTGCTAAC





TTTAAGAAAGAGATCTTGTTGGACAGATCCGCTGGTAAAAAGACTGCT





TTCTCCATTAAGTTGGACTACGTTAAGAAGCCAATCCCAGAGACTGCT





ATGGTTAAGATTTTGGAGAAGTTGTACGAAGAGGACGTTGGTGCTGGT





ATGTACGTTTTGTACCCATACGGTGGTATCATGGAAGAAATCTCCGAG





TCCGCTATTCCATTCCCACACAGAGCTGGTATTATGTACGAGTTGTGG





TACACTGCTTCTTGGGAGAAGCAAGAGGACAACGAAAAGCACATCAAC





TGGGTTAGATCCGTTTACAACTTCACTACTCCTTACGTTTCCCAGAAC





CCAAGATTGGCTTACTTGAACTACAGAGACTTGGACTTGGGTAAGACT





AACCACGCTTCCCCAAACAATTACACACAGGCTAGAATCTGGGGTGAA





AAGTACTTCGGAAAGAACTTTAACAGATTGGTTAAGGTTAAGACTAAG





GTTGACCCTAACAACTTCTTCAGAAACGAGCAGTCCATCCCACCATTG





GTCAATTAA





SEQ ID NO: 10


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCAAAC





AACGTTGCTAACCCTAAGTTGGTTTACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGAACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCATTGGTTATCGTTACTCCATCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCATTCGTTGTTGTTGACTTGAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAAGCTGGTGCTACTTTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGAACTTGTCCTTCCCA





GGTGGTTACTGTCCAACTGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGATTGGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAG





TCCATGGGTGAGGACTTGTTCTGGGCTATTAGAGGTGGTGGTGGTGAG





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGTTGGTTGCTGTTCCA





TCCAAGTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACAGCGGAAAC





ACTGTTCACGGTTACTTCTCTTCCATCTTCCACGGTGGTGTTGATTCC





TTGGTTGATTTGATGAACAAGTCTTTCCCAGAGTTGGGTATCAAGAAA





ACTGACTGTAAAGAGTTCTCCTGGATCGACACAACAATCTTCTACTCC





GGTGTTGTTAACTTCAACACTGCTAACTTTAAGAAAGAGATCTTGTTG





GACAGATCCGCTGGTAAAAAGACTGCTTTCTCCATTAAGTTGGACTAC





GTTAAGAAGCCAATCCCAGAGACTGCTATGGTTAAGATTTTGGAGAAG





TTGTACGAAGAGGACGTTGGTGCTGGTATGTACGTTTTGTACCCATAC





GGTGGTATCATGGAAGAAATCTCCGAGTCCGCTATTCCATTCCCACAC





AGAGCTGGTATTATGTACGAGTTGTGGTACACTGCTTCTTGGGAGAAG





CAAGAGGACAACGAAAAGCACATCAACTGGGTTAGATCCGTTTACAAC





TTCACTACTCCTTACGTTTCCCAGAACCCAAGATTGGCTTACTTGAAC





TACAGAGACTTGGACTTGGGTAAGACTAACCACGCTTCCCCAAACAAT





TACACACAGGCTAGAATCTGGGGTGAAAAGTACTTCGGAAAGAACTTT





AACAGATTGGTTAAGGTTAAGACTAAGGTTGACCCTAACAACTTCTTC





AGAAACGAGCAGTCCATCCCACCATTGGTCAATTAA





SEQ ID NO: 11


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCAAAC





AACGTTGCTAACCCTAAGTTGGTTTACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGAACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCATTGGTTATCGTTACTCCATCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCATTCGTTGTTGTTGACTTGAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAAGCTGGTGCTACTTTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGAACTTGTCCTTCCCA





GGTGGTTACTGTCCAACTGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGATTGGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAG





TCCATGGGTGAGGACTTGTTCTGGGCTATTAGAGGTGGTGGTGGTGAG





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGTTGGTTGCTGTTCCA





TCCAAGTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTIGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACAGCGGAAAC





ACTGTTCACGGTTACTTCTCTTCCATCTTCCACGGTGGTGTTGATTCC





TTGGTTGATTTGATGCAAAAGTCTTTCCCAGAGTTGGGTATCAAGAAA





ACTGACTGTAAAGAGTTCTCCTGGATCGACACAACAATCTTCTACTCC





GGTGTTGTTAACTTCAACACTGCTAACTTTAAGAAAGAGATCTTGTTG





GACAGATCCGCTGGTAAAAAGACTGCTTTCTCCATTAAGTTGGACTAC





GTTAAGAAGCCAATCCCAGAGACTGCTATGGTTAAGATTTTGGAGAAG





TTGTACGAAGAGGACGTTGGTGCTGGTATGTACGTTTTGTACCCATAC





GGTGGTATCATGGAAGAAATCTCCGAGTCCGCTATTCCATTCCCACAC





AGAGCTGGTATTATGTACGAGTTGTGGTACACTGCTTCTTGGGAGAAG





CAAGAGGACAACGAAAAGCACATCAACTGGGTTAGATCCGTTTACAAC





TTCACTACTCCTTACGTTTCCCAGAACCCAAGATTGGCTTACTTGAAC





TACAGAGACTTGGACTTGGGTAAGACTAACCACGCTTCCCCAAACAAT





TACACACAGGCTAGAATCTGGGGTGAAAAGTACTTCGGAAAGAACTTT





AACAGATTGGTTAAGGTTAAGACTAAGGTTGACCCTAACAACTTCTTC





AGAAACGAGCAGTCCATCCCACCATTGGTCAATTAA





SEQ ID NO: 12


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCAAAC





AACGTTGCTAACCCTAAGTTGGTTTACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGAACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCATTGGTTATCGTTACTCCATCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCATTCGTTGTTGTTGACTTGAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAAGCTGGTGCTACTTTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGAACTTGTCCTTCCCA





GGTGGTTACTGTCCAACTGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGATTGGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAG





TCCATGGGTGAGGACTTGTTCTGGGCTATTAGAGGTGGTGGTGGTGAG





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGTTGGTTGCTGTTCCA





TCCAAGTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACATCACTGAC





AACCACGGTAAGAACAAGACTACTGTTCACGGTTACTTCTCTTCCATC





TTCCACGGTGGTGTTGATTCCTTGGTTGATTTGATGCAAAAGTCTTTC





CCAGAGTTGGGTATCAAGAAAACTGACTGTAAAGAGTTCTCCTGGATC





GACACAACAATCTTCTACTCCGGTGTTGTTAACTTCAACACTGCTAAC





TTTAAGAAAGAGATCTTGTTGGACAGATCCGCTGGTAAAAAGACTGCT





TTCTCCATTAAGTTGGACTACGTTAAGAAGCCAATCCCAGAGACTGCT





ATGGTTAAGATTTTGGAGAAGTTGTACGAAGAGGACGTTGGTGCTGGT





ATGTACGTTTTGTACCCATACGGTGGTATCATGGAAGAAATCTCCGAG





TCCGCTATTCCATTCCCACACAGAGCTGGTATTATGTACGAGTTGTGG





TACACTGCTTCTTGGGAGAAGCAAGAGGACAACGAAAAGCACATCAAC





TGGGTTAGATCCGTTTACAACTTCACTACTCCTTACGTTTCCCAGAAC





CCAAGATTGGCTTACTTGAACTACAGAGACTTGGACTTGGGTAAGACT





AACCACGCTTCCCCAAACAATTACACACAGGCTAGAATCTGGGGTGAA





AAGTACTTCGGAAAGAACTTTAACAGATTGGTTAAGGTTAAGACTAAG





GTTGACCCTAACAACTTCTTCAGAAACGAGCAGTCCATCCCACCATTG





GTCAATTAA





SEQ ID NO: 13


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCAAAC





AACGTTGCTAACCCTAAGTTGGTTTACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGAACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCATTGGTTATCGTTACTCCATCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCATTCGTTGTTGTTGACTTGAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAAGCTGGTGCTACTTTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGAACTTGTCCTTCCCA





GGTGGTTACTGTCCAACTGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGATTGGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAG





TCCATGGGTGAGGACTTGTTCTGGGCTATTAGAGGTGGTGGTGGTGAG





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGTTGGTTGCTGTTCCA





TCCAAGTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACATCACTGAC





AACCACGGTAAGAACAAGACTACTGTTCACGGTTACTTCTCTTCCATC





TTCCACGGTGGTTGTGATTCCGCAGTTGATTTGATGCAAAAGTCTTTC





CCAGAGTTGGGTGTCAAGAAAACTGACTGTAAAGAGTTCTCCTGGATC





GACACAACAATCTTCTACTCCGGTGTTGTTAACTTCAACACTGCTAAC





TTTAAGAAAGAGATCTTGTTGGACAGATCCGCTGGTAAAAAGACTGCT





TTCTCCATTAAGTTGGACTACGTTAAGAAGCCAATCCCAGAGACTGCT





ATGGTTAAGATTTTGGAGAAGTTGTACGAAGAGGACGTTGGTGCTGGT





ATGTACGTTTTGTACCCATACGGTGGTATCATGGAAGAAATCTCCGAG





TCCGCTATTCCATTCCCACACAGAGCTGGTATTATGTACGAGTTGTGG





TACACTGCTTCTTGGGAGAAGCAAGAGGACAACGAAAAGCACATCAAC





TGGGTTAGATCCGTTTACAACTTCACTACTCCTTACGTTTCCCAGAAC





CCAAGATTGGCTTACTTGAACTACAGAGACTTGGACTTGGGTAAGACT





AACCACGCTTCCCCAAACAATTACACACAGGCTAGAATCTGGGGTGAA





AAGTACTTCGGAAAGAACTTTAACAGATTGGTTAAGGTTAAGACTAAG





GTTGACCCTAACAACTTCTTCAGAAACGAGCAGTCCATCCCACCATTG





GTCAATTAA





SEQ ID NO: 14


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCAAAC





AACGTTGCTAACCCTAAGTTGGTTTACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGAACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCATTGGTTATCGTTACTCCATCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCATTCGTTGTTGTTGACTTGAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAAGCTGGTGCTACTTTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGAACTTGTCCTTCCCA





GGTGGTTACTGTCCAACTGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGATTGGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAG





TCCATGGGTGAGGACTTGTTCTGGGCTATTAGAGGTGGTGGTGGTGAG





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGTTGGTTGCTGTTCCA





TCCAAGTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACATCACTGAC





AACCACGGTAAGAACAAGACTACTGTTCACGGTTACTTCTCTTCCATC





TTCCACGGTGGTGTTGATTCCTTGGTTGATTTGATGCAAAAGTCTTTC





CCAGAGTTGGGTATCAAGAAAACTGACAGCAAAGAGTTCTCCTGGATC





GACACAACAATCTTCTACTCCGGTGTTGTTAACTTCAACACTGCTAAC





TTTAAGAAAGAGATCTTGTTGGACAGATCCGCTGGTAAAAAGACTGCT





TTCTCCATTAAGTTGGACTACGTTAAGAAGCCAATCCCAGAGACTGCT





ATGGTTAAGATTTTGGAGAAGTTGTACGAAGAGGACGTTGGTGCTGGT





ATGTACGTTTTGTACCCATACGGTGGTATCATGGAAGAAATCTCCGAG





TCCGCTATTCCATTCCCACACAGAGCTGGTATTATGTACGAGTTGTGG





TACACTGCTTCTTGGGAGAAGCAAGAGGACAACGAAAAGCACATCAAC





TGGGTTAGATCCGTTTACAACTTCACTACTCCTTACGTTTCCCAGAAC





CCAAGATTGGCTTACTTGAACTACAGAGACTTGGACTTGGGTAAGACT





AACCACGCTTCCCCAAACAATTACACACAGGCTAGAATCTGGGGTGAA





AAGTACTTCGGAAAGAACTTTAACAGATTGGTTAAGGTTAAGACTAAG





GTTGACCCTAACAACTTCTTCAGAAACGAGCAGTCCATCCCACCATTG





GTCAATTAA





SEQ ID NO: 15


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCAAAC





AACGTTGCTAACCCTAAGTTGGTTTACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGAACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCATTGGTTATCGTTACTCCATCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCATTCGTTGTTGTTGACTTGAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAAGCTGGTGCTACTTTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGAACTTGTCCTTCCCA





GGTGGTTACTGTCCAACTGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGATTGGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAG





TCCATGGGTGAGGACTTGTTCTGGGCTATTAGAGGTGGTGGTGGTGAG





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGTTGGTTGCTGTTCCA





TCCAAGTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACATCACTGAC





AACCACGGTAAGAACAAGACTACTGTTCACGGTTACTTCTCTTCCATC





TTCCACGGTGGTGTTGATTCCTTGGTTGATTTGATGAACAAGTCTTTC





CCAGAGTTGGGTATCAAGAAAACTGACTGTAAAGAGTTCTCCTGGATC





GACACAACAATCTTCTACTCCGGTGTTGTTAACTTCAACACTGCTAAC





TTTAAGAAAGAGATCTTGTTGGACAGATCCGCTGGTAAAAAGACTGCT





TTCTCCATTAAGTTGGACTACGTTAAGAAGCCAATCCCAGAGACTGCT





ATGGTTAAGATTTTGGAGAAGTTGTACGAAGAGGACGTTGGTGCTGGT





ATGTACGTTTTGTACCCATACGGTGGTATCATGGAAGAAATCTCCGAG





TCCGCTATTCCATTCCCACACAGAGCTGGTATTATGTACGAGTTGTGG





TACACTGCTTCTTGGGAGAAGCAAGAGGACAACGAAAAGCACATCAAC





TGGGTTAGATCCGTTTACCAATTCACTACTCCTTACGTTTCCCAGAAC





CCAAGATTGGCTTACTTGAACTACAGAGACTTGGACTTGGGTAAGACT





AACCACGCTTCCCCAAACAATTACACACAGGCTAGAATCTGGGGTGAA





AAGTACTTCGGAAAGAACTTTAACAGATTGGTTAAGGTTAAGACTAAG





GTTGACCCTAACAACTTCTTCAGAAACGAGCAGTCCATCCCACCATTG





GTCAATTAA





SEQ ID NO: 16


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCAAAC





AACGTTGCTAACCCTAAGTTGGTTTACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGAACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCATTGGTTATCGTTACTCCATCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCATTCGTTGTTGTTGACTTGAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAAGCTGGTGCTACTTTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGAACTTGTCCTTCCCA





GGTGGTTACTGTCCAACTGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGATTGGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAG





TCCATGGGTGAGGACTTGTTCTGGGCTATTAGAGGTGGTGGTGGTGAG





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGTTGGTTGCTGTTCCA





TCCAAGTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACATCACTGAC





AACCACGGTAAGAACAAGACTACTGTTCACGGTTACTTCTCTTCCATC





TTCCACGGTGGTGTTGATTCCTTGGTTGATTTGATGCAAAAGTCTTTC





CCAGAGTTGGGTATCAAGAAAACTGACAGCAAAGAGTTCTCCTGGATC





GACACAACAATCTTCTACTCCGGTGTTGTTAACTTCAACACTGCTAAC





TTTAAGAAAGAGATCTTGTTGGACAGATCCGCTGGTAAAAAGACTGCT





TTCTCCATTAAGTTGGACTACGTTAAGAAGCCAATCCCAGAGACTGCT





ATGGTTAAGATTTTGGAGAAGTTGTACGAAGAGGACGTTGGTGCTGGT





ATGTACGTTTTGTACCCATACGGTGGTATCATGGAAGAAATCTCCGAG





TCCGCTATTCCATTCCCACACAGAGCTGGTATTATGTACGAGTTGTGG





TACACTGCTTCTTGGGAGAAGCAAGAGGACAACGAAAAGCACATCAAC





TGGGTTAGATCCGTTTACCAATTCACTACTCCTTACGTTTCCCAGAAC





CCAAGATTGGCTTACTTGAACTACAGAGACTTGGACTTGGGTAAGACT





AACCACGCTTCCCCAAACAATTACACACAGGCTAGAATCTGGGGTGAA





AAGTACTTCGGAAAGAACTTTAACAGATTGGTTAAGGTTAAGACTAAG





GTTGACCCTAACAACTTCTTCAGAAACGAGCAGTCCATCCCACCATTG





GTCAATTAA





SEQ ID NO: 17


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCAAAC





AACGTTGCTAACCCTAAGTTGGTTTACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGAACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCATTGGTTATCGTTACTCCATCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCATTCGTTGTTGTTGACTTGAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAAGCTGGTGCTACTTTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGAACTTGTCCTTCCCA





GGTGGTTACTGTCCAACTGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGATTGGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAG





TCCATGGGTGAGGACTTGTTCTGGGCTATTAGAGGTGGTGGTGGTGAG





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGTTGGTTGCTGTTCCA





TCCAAGTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACAGCGGAAAC





ACTGTTCACGGTTACTTCTCTTCCATCTTCCACGGTGGTGTTGATTCC





TTGGTTGATTTGATGCAAAAGTCTTTCCCAGAGTTGGGTATCAAGAAA





ACTGACAGCAAAGAGTTCTCCTGGATCGACACAACAATCTTCTACTCC





GGTGTTGTTAACTTCAACACTGCTAACTTTAAGAAAGAGATCTTGTTG





GACAGATCCGCTGGTAAAAAGACTGCTTTCTCCATTAAGTTGGACTAC





GTTAAGAAGCCAATCCCAGAGACTGCTATGGTTAAGATTTTGGAGAAG





TTGTACGAAGAGGACGTTGGTGCTGGTATGTACGTTTTGTACCCATAC





GGTGGTATCATGGAAGAAATCTCCGAGTCCGCTATTCCATTCCCACAC





AGAGCTGGTATTATGTACGAGTTGTGGTACACTGCTTCTTGGGAGAAG





CAAGAGGACAACGAAAAGCACATCAACTGGGTTAGATCCGTTTACCAA





TTCACTACTCCTTACGTTTCCCAGAACCCAAGATTGGCTTACTTGAAC





TACAGAGACTTGGACTTGGGTAAGACTAACCACGCTTCCCCAAACAAT





TACACACAGGCTAGAATCTGGGGTGAAAAGTACTTCGGAAAGAACTTT





AACAGATTGGTTAAGGTTAAGACTAAGGTTGACCCTAACAACTTCTTC





AGAAACGAGCAGTCCATCCCACCATTGGTCAATTAA





SEQ ID NO: 18


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCCAAC





AACGTTGCTAACCCTAAGTTGGTATACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGGACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCCTTGGTTATCGTTACTCCCTCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCCTTCGTTGTTGTTGACCTTAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAAAGCGGTGCTACATTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGAACCTCTCCTTCCCC





GGTGGTTACTGTCCAACCGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGACTTGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAG





TCCATGGGAGAGGACTTGTTCTGGGCTATTCGTGGTGGTGGTGGTGAG





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGCTTGTTGCTGTTCCC





TCCAAGTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACATCACTGAC





AACCACGGTAAGAACAAAACTACTGTTCACGGTTACTTCTCTTCCATC





TTCCACGGTGGTGTTGATTCCTTGGTTGATTTGATGAACAAGTCTTTC





CCAGAGTTGGGCATCAAGAAAACTGACTGTAAAGAGTTCAGTTGGATC





GACACAACAATCTTCTACAGTGGTGTTGTAAACTTCAACACTGCCAAC





TTTAAGAAAGAGATCTTGTTGGACAGGTCCGCTGGTAAAAAGACAGCT





TTCTCCATTAAGTTGGACTACGTTAAGAAGCCAATCCCAGAGACTGCT





ATGGTTAAGATTTTGGAGAAGTTGTACGAAGAGGACGTTGGTGCTGGT





ATGTACGTTTTGTACCCATACGGTGGCATCATGGAAGAAATCTCCGAG





TCCGCTATTCCGTTCCCACACCGAGCTGGTATCATGTACGAGTTGTGG





TACACTGCTTCTTGGGAGAAGCAAGAGGACAACGAAAAGCACATCAAC





TGGGTTAGATCCGTTTACAACTTCACTACTCCTTACGTTTCCCAGAAC





CCAAGATTGGCGTACTTGAACTACAGAGACTTGGACTTGGGCAAGACC





AACCACGCATCCCCTAACAATTACACACAGGCTAGAATCTGGGGTGAA





AAGTACTTCGGAAAGAACTTTAACAGATTGGTTAAGGTTAAGACTAAG





GTTGACCCTAACAACTTCTTCAGGAACGAGCAGTCCATCCCACCATTG





GTCAATTAA





SEQ ID NO: 19


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCCAAC





AACGTTGCTAACCCTAAGTTGGTATACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGAACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCCTTGGTTATCGTTACTCCCTCCAACAACGCACAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCCTTCGTTGTTGTTGACCTTAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAAAGCGGTGCTACATTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGAACCTCTCCTTCCCC





GGTGGTTACTGTCCAACCGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGACTTGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAG





TCCATGGGAGAGGACTTGTTCTGGGCTATTCGTGGTGGTGGTGGTGAG





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGCTTGTTGCTGTTCCC





TCCAAGTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACATCACTGAC





AACCACGGTAAGAACAAAACTACTGTTCACGGTTACTTCTCTTCCATC





TTCCACGGTGGTGTTGATTCCTTGGTTGATTTGATGAACAAGTCTTTC





CCAGAGTTGGGCATCAAGAAAACTGACTGTAAAGAGTTCAGTTGGATC





GACACAACAATCTTCTACAGTGGTGTTGTAAACTTCAACACTGCCAAC





TTTAAGAAAGAGATCTTGTTGGACAGGTCCGCTGGTAAAAAGACAGCT





TTCTCCATTAAGTTGGACTACGTTAAGAAGCCAATCCCAGAGACTGCT





ATGGTTAAGATTTTGGAGAAGTTGTACGAAGAGGACGTTGGTGCTGGT





ATGTACGTTTTGTACCCATACGGTGGCATCATGGAAGAAATCTCCGAG





TCCGCTATTCCGTTCCCACACCGAGCTGGTATCATGTACGAGTTGTGG





TACACTGCTTCTTGGGAGAAGCAAGAGGACAACGAAAAGCACATCAAC





TGGGTTAGATCCGTTTACAACTTCACTACTCCTTACGTTTCCCAGAAC





CCAAGATTGGCGTACTTGAACTACAGAGACTTGGACTTGGGCAAGACC





AACCACGCATCCCCTAACAATTACACACAGGCTAGAATCTGGGGTGAA





AAGTACTTCGGAAAGAACTTTAACAGATTGGTTAAGGTTAAGACTAAG





GTTGACCCTAACAACTTCTTCAGGAACGAGCAGTCCATCCCACCATTG





GTCAATTAA





SEQ ID NO: 20


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCCAAC





AACGTTGCTAACCCTAAGTTGGTATACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGGACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCCTTGGTTATCGTTACTCCCTCCAACAACGCACAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCCTTCGTTGTTGTTGACCTTAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAAAGCGGTGCTACATTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGAACCTCTCCTTCCCC





GGTGGTTACTGTCCAACCGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGACTTGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAG





TCCATGGGAGAGGACTTGTTCTGGGCTATTCGTGGTGGTGGTGGTGAG





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGCTTGTTGCTGTTCCC





TCCAAGTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACATCACTGAC





AACCACGGTAAGAACAAAACTACTGTTCACGGTTACTTCTCTTCCATC





TTCCACGGTGGTGTTGATTCCTTGGTTGATTTGATGAACAAGTCTTTC





CCAGAGTTGGGCATCAAGAAAACTGACTGTAAAGAGTTCAGTTGGATC





GACACAACAATCTTCTACAGTGGTGTTGTAAACTTCAACACTGCCAAC





TTTAAGAAAGAGATCTTGTTGGACAGGTCCGCTGGTAAAAAGACAGCT





TTCTCCATTAAGTTGGACTACGTTAAGAAGCCAATCCCAGAGACTGCT





ATGGTTAAGATTTTGGAGAAGTTGTACGAAGAGGACGTTGGTGCTGGT





ATGTACGTTTTGTACCCATACGGTGGCATCATGGAAGAAATCTCCGAG





TCCGCTATTCCGTTCCCACACCGAGCTGGTATCATGTACGAGTTGTGG





TACACTGCTTCTTGGGAGAAGCAAGAGGACAACGAAAAGCACATCAAC





TGGGTTAGATCCGTTTACAACTTCACTACTCCTTACGTTTCCCAGAAC





CCAAGATTGGCGTACTTGAACTACAGAGACTTGGACTTGGGCAAGACC





AACCACGCATCCCCTAACAATTACACACAGGCTAGAATCTGGGGTGAA





AAGTACTTCGGAAAGAACTTTAACAGATTGGTTAAGGTTAAGACTAAG





GTTGACCCTAACAACTTCTTCAGGAACGAGCAGTCCATCCCACCATTG





GTCAATTAA





SEQ ID NO: 21


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCCAAC





AACGTTGCTAACCCTAAGTTGGTATACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGGACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCCTTGGTTATCGTTACTCCCTCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCCTTCGTTGTTGTTGACCTTAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAAAGCGGTGCTACATTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGAACCTCTCCTTCCCC





GGTGGTTACTGTCCAACCGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGACTTGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTTCAGTTTTGGACAGACAA





TCCATGGGAGAGGACTTGTTCTGGGCTATTCGTGGTGGTGGTGGTGAG





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGCTTGTTGCTGTTCCC





TCCAAGTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACATCACTGAC





AACCACGGTAAGAACAAAACTACTGTTCACGGTTACTTCTCTTCCATC





TTCCACGGTGGTGTTGATTCCTTGGTTGATTTGATGAACAAGTCTTTC





CCAGAGTTGGGCATCTCTAAAACTGACTGTAAAGAGTTCAGTTGGATC





GACACAACAATCTTCTACAGTGGTGTTGTAAACTTCAACACTGCCAAC





TTTAAGAAAGAGATCTTGTTGGACAGGTCCGCTGGTAAAAAGACAGCT





TTCTCCATTAAGTTGGACTACGTTAAGCAGCCAATCCCAGAGACTGCT





ATGGTTAAGATTTTGGAGAAGTTGTACGAAGAGGACGTTGGTGCTGGT





ATGTACGTTTTGTACCCATACGGTGGCATCATGGAAGAAATCTCCGAG





TCCGCTATTCCGTTCCCACACCGAGCTGGTATCATGTACGAGTTGTGG





TACACTGCTTCTTGGGAGAAGCAAGAGGACAACGAAAAGCACATCAAC





TGGGTTAGATCCGTTTACAACTTCACTACTCCTTACGTTTCCCAGAAC





CCAAGATTGGCGTACTTGAACTACAGAGACTTGGACTTGGGCGGTACC





AACCACGCATCCCCTAACAATTACACACAGGCTAGAATCTGGGGTGAA





AAGTACTTCGGAAAGAACTTTAACAGATTGGTTAAGGTTAAGACTAAG





GTTGACCCTAACAACTTCTTCAGGAACGAGCAGTCCATCCCACCATTG





GTCAATTAA





SEQ ID NO: 22


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCCAAC





AACGTTGCTAACCCTAAGTTGGTATACACTCAGCACGACCAGTTGTTC





GAGTCCTTGTTGAACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCCTTGGTTATCGTTACTCCCTCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTTTGTCCTACATTTCCCAG





GTTCCCTTCGTTGTTGTTGACCTTAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAAAGCGGTGCTACATTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGAACCTCTCCTTCCCC





GGTGGTTACTGTCCAACCGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGACTTGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAG





TCCATGGGAGAGGACTTGTTCTGGGCTATTCGTGGTGGTGGTGGTGAG





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGCTTGTTGCTGTTCCC





TCCAAGTCCACTATCTTCTCCGTTAAGAAAAACGTTGAAATCCACGCT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACATCACTGAC





AACCACGGTAAGAACAAAACTACTGTTCACGGTTACTTCTCTTCCATC





TTCCACGGTGGTGTTGATTCCTTGGTTGATTTGATGAACAAGTCTTTC





CCAGAGTTGGGCATCAAGAAAACTGACTGTAAAGAGTTCAGTTGGATC





GACACAACAATCTTCTACAGTGGTGTTGTAAACTTCAACACTGCCAAC





TTTAAGAAAGAGATCTTGTTGGACAGGTCCGCTGGTAAAAAGACAGCT





TTCTCCATTAAGTTGGACTACGTTAAGAAGCCAATCCCAGAGACTGCT





ATGGTTAAGATTTTGGAGAAGTTGTACGAAGAGGACGTTGGTGCTGGT





ATGTACGTTTTGTACCCATACGGTGGCATCATGGAAGAAATCTCCGAG





TCCGCTATTCCGTTCCCACACCGAGCTGGTACCGTTTACGAGTTGTGG





TACACTGCTTCTTGGGAGAAGCAAGAGGACAACGAAAAGCACATCAAC





TGGGTTAGATCCGTTTACAACTTCACTACTCCTTACGTTTCCCAGAAC





CCAAGATTGGCGTACTTGAACTACAGAGACTTGGACTTGGGCAAGACC





AACCACGCATCCCCTAACAATTACACACAGGCTAGAATCTGGGGTGAA





AAGTACTTCGGAAAGAACTTTAACAGATTGGTTAAGGTTAAGACTAAG





GTTGACCCTAACAACTTCTTCAGGAACGAGCAGTCCATCCCACCATTG





GTCAATTAA





SEQ ID NO: 23


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCCAAC





AACGTTGCTAACCCTAAGTTGGTATACACTCAGCACGACCAGTTGTTC





GAGTCCTTGTTGGACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCCTTGGTTATCGTTACTCCCTCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTTTGTCCTACATTTCCCAG





GTTCCCTTCGTTGTTGTTGACCTTAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAAAGCGGTGCTACATTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGAACCTCTCCTTCCCC





GGTGGTTACTGTCCAACCGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGACTTGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAG





TCCATGGGAGAGGACTTGTTCTGGGCTATTCGTGGTGGTGGTGGTGAG





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGCTTGTTGCTGTTCCC





TCCAAGTCCACTATCTTCTCCGTTAAGAAAAACGTTGAAATCCACGCT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACATCACTGAC





AACCACGGTAAGAACAAAACTACTGTTCACGGTTACTTCTCTTCCATC





TTCCACGGTGGTGTTGATTCCTTGGTTGATTTGATGAACAAGTCTTTC





CCAGAGTTGGGCATCAAGAAAACTGACTGTAAAGAGTTCAGTTGGATC





GACACAACAATCTTCTACAGTGGTGTTGTAAACTTCAACACTGCCAAC





TTTAAGAAAGAGATCTTGTTGGACAGGTCCGCTGGTAAAAAGACAGCT





TTCTCCATTAAGTTGGACTACGTTAAGAAGCCAATCCCAGAGACTGCT





ATGGTTAAGATTTTGGAGAAGTTGTACGAAGAGGACGTTGGTGCTGGT





ATGTACGTTTTGTACCCATACGGTGGCATCATGGAAGAAATCTCCGAG





TCCGCTATTCCGTTCCCACACCGAGCTGGTACCGTTTACGAGTTGTGG





TACACTGCTTCTTGGGAGAAGCAAGAGGACAACGAAAAGCACATCAAC





TGGGTTAGATCCGTTTACAACTTCACTACTCCTTACGTTTCCCAGAAC





CCAAGATTGGCGTACTTGAACTACAGAGACTTGGACTTGGGCAAGACC





AACCACGCATCCCCTAACAATTACACACAGGCTAGAATCTGGGGTGAA





AAGTACTTCGGAAAGAACTTTAACAGATTGGTTAAGGTTAAGACTAAG





GTTGACCCTAACAACTTCTTCAGGAACGAGCAGTCCATCCCACCATTG





GTCAATTAA





SEQ ID NO: 24


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCCAAC





AACGTTGCTAACCCTAAGTTGGTATACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGAACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCCTTGGTTATCGTTACTCCCTCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCCTTCGTTGTTGTTGACCTTAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAAAGCGGTGCTACATTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGAACCTCTCCTTCCCC





GGTGGTTACTGTCCAACCGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAAGTACGGACTTGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAG





TCCATGGGAGAGGACTTGTTCTGGGCTATTCGTGGTGGTGGTGGTGAG





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGCTTGTTGCTGTTCCC





TCCAAGTCCACTATCTTCTCCGTTAAGAAAAACGGTGCTATCCACGAG





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





GAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACATCACTGAC





AACCACGGTAAGAACAAAACTACTGTTCACGGTTACTTCTCTTCCATC





TTCCACGGTGGTGTTGATTCCTTGGTTGATTTGATGAACAAGTCTTTC





CCAGAGTTGGGCATCAAGAAAACTGACTGTAAAGAGTTCAGTTGGATC





GACACAACAATCTTCTACAGTGGTGTTGTAAACTTCAACACTGCCAAC





TTTAAGAAAGAGATCTTGTTGGACAGGTCCGCTGGTAAAAAGACAGCT





TTCTCCATTAAGTTGGACTACGTTAAGAAGCCAATCCCAGAGACTGCT





ATGGTTAAGATTTTGGAGAAGTTGTACGAAGAGGACGTTGGTGCTGGT





ATGTACGTTTTGTACCCATACGGTGGCATCATGGAGAAGATCTCCGAG





TCCGCTATTCCGTTCCCACACCGAGCTGGTATCATGTACGAGTTGTGG





TACACTGCTTCTTGGGAGAAGCAAGAGGACAACGAAAAGCACATCAAC





TGGGTTAGATCCGTTTACAACTTCACTACTCCTTACGTTTCCCAGAAC





CCAAGATTGGCGTACTTGAACTACAGAGACTTGGACTTGGGCAAGACC





AACCACGCATCCCCTAACAATTACACACAGGCTAGAATCTGGGGTGAA





AAGTACTTCGGAAAGAACTTTAACAGATTGGTTAAGGTTAAGACTAAG





GTTGACCCTAACAACTTCTTCAGGAACGAGCAGTCCATCCCACCATTG





GTCAATTAA





SEQ ID NO: 25


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCCAAC





AACGTTGCTAACCCTAAGTTGGTATACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGAACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCCTTGGTTATCGTTACTCCCTCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCCTTCGTTGTTGTTGACCTTAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAAAGCGGTGCTACATTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGAACCTCTCCTTCCCC





GGTGGTTACTGTCCAACCGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGACTTGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAG





TCCATGGGAGAGGACTTGTTCTGGGCTATTCGTGGTGGTGGTGGTGAG





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGCTTGTTGCTGTTCCC





TCCAAGTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACATCACTGAC





AACCACGGTAAGAACAAAACTACTGTTCACGGTTACTTCTCTTCCATC





TTCCACGGTGGTGTTGATTCCTTGGTTGATTTGATGAACAAGTCTTTC





CCAGAGTTGGGCATCAAGAAAACTGACTGTAAAGAGTTCAGTTGGATC





GACACAACAATCTTCTACAGTGGTGGAACAGGCAGTGGAACCTTTAAG





AAAGAGATCTTGTTGGACAGGTCCGCTGGTAAAAAGACAGCTTTCTCC





ATTAAGTTGGACTACGTTAAGAAGCCAATCCCAGAGACTGCTATGGTT





AAGATTTTGGAGAAGTTGTACGAAGAGGACGTTGGTGCTGGTATGTAC





GTTTTGTACCCATACGGTGGCATCATGGAAGAAATCTCCGAGTCCGCT





ATTCCGTTCCCACACCGAGCTGGTATCATGTACGAGTTGTGGTACACT





GCTTCTTGGGAGAAGCAAGAGGACAACGAAAAGCACATCAACTGGGTT





AGATCCGTTTACAACTTCACTACTCCTTACGTTTCCCAGAACCCAAGA





TTGGCGTACTTGAACTACAGAGACTTGGACTTGGGCAAGACCAACCAC





GCATCCCCTAACAATTACACACAGGCTAGAATCTGGGGTGAAAAGTAC





TTCGGAAAGAACTTTAACAGATTGGTTAAGGTTAAGACTAAGGTTGAC





CCTAACAACTTCTTCAGGAACGAGCAGTCCATCCCACCATTGGTCAAT





TAA





SEQ ID NO: 26


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCCAAC





AACGTTGCTAACCCTAAGTTGGTATACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGAACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCCTTGGTTATCGTTACTCCCTCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCCTTCGTTGTTGTTGACCTTAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAAAGCGGTGCTACATTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGAACCTCTCCTTCCCC





GGTGGTTACTGTCCAACCGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGACTTGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAG





TCCATGGGAGAGGACTTGTTCTGGGCTATTCGTGGTGGTGGTGGTGAG





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGCTTGTTGCTGTTCCC





TCCAAGTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACATCACTGAC





AACCACGGTAAGAACAAAACTACTGTTCACGGTTACTTCTCTTCCATC





TTCCACGGTGGTGTTGATTCCTTGGTTGATTTGATGAACAAGTCTTTC





CCAGAGTTGGGCATCAAGAAAACTGACTGTAAAGAGTTCAGTTGGATC





GACACAACAATCTTCTACAGTGGTGGAACAGGCAGTGGAACCAAGAAA





GAGATCTTGTTGGACAGGTCCGCTGGTAAAAAGACAGCTTTCTCCATT





AAGTTGGACTACGTTAAGAAGCCAATCCCAGAGACTGCTATGGTTAAG





ATTTTGGAGAAGTTGTACGAAGAGGACGTTGGTGCTGGTATGTACGTT





TTGTACCCATACGGTGGCATCATGGAAGAAATCTCCGAGTCCGCTATT





CCGTTCCCACACCGAGCTGGTATCATGTACGAGTTGTGGTACACTGCT





TCTTGGGAGAAGCAAGAGGACAACGAAAAGCACATCAACTGGGTTAGA





TCCGTTTACAACTTCACTACTCCTTACGTTTCCCAGAACCCAAGATTG





GCGTACTTGAACTACAGAGACTTGGACTTGGGCAAGACCAACCACGCA





TCCCCTAACAATTACACACAGGCTAGAATCTGGGGTGAAAAGTACTTC





GGAAAGAACTTTAACAGATTGGTTAAGGTTAAGACTAAGGTTGACCCT





AACAACTTCTTCAGGAACGAGCAGTCCATCCCACCATTGGTCAATTAA








SEQ ID NO: 27


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCCAAC





AACGTTGCTAACCCTAAGTTGGTATACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGAACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCCTTGGTTATCGTTACTCCCTCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCCTTCGTTGTTGTTGACCTTAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAAAGCGGTGCTACATTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGAACCTCTCCTTCCCC





GGTGGTTACTGTCCAACCGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGACTTGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAG





TCCATGGGAGAGGACTTGTTCTGGGCTATTCGTGGTGGTGGTGGTGAG





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGCTTGTTGCTGTTCCC





TCCAAGTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACATCACTGAC





AACCACGGTAAGAACAAAACTACTGTTCACGGTTACTTCTCTTCCATC





TTCCACGGTGGTGTTGATTCCTTGGTTGATTTGATGAACAAGTCTTTC





CCAGAGTTGGGCATCAAGAAAACTGACTGTAAAGAGTTCAGTTGGATC





GACACAACAATCTTCTACAGTGGTGGAGATGGCACAGGTAAGAAAGAG





ATCTTGTTGGACAGGTCCGCTAAAGGCAAGACAGCTTTCTCCATTAAG





TTGGACTACGTTAAGAAGCCAATCCCAGAGACTGCTATGGTTAAGATT





TTGGAGAAGTTGTACGAAGAGGACGTTGGTGCTGGTATGTACGTTTTG





TACCCATACGGTGGCATCATGGAAGAAATCTCCGAGTCCGCTATTCCG





TTCCCACACCGAGCTGGTATCATGTACGAGTTGTGGTACACTGCTTCT





TGGGAGAAGCAAGAGGACAACGAAAAGCACATCAACTGGGTTAGATCC





GTTTACAACTTCACTACTCCTTACGTTTCCCAGAACCCAAGATTGGCG





TACTTGAACTACAGAGACTTGGACTTGGGCAAGACCAACCACGCATCC





CCTAACAATTACACACAGGCTAGAATCTGGGGTGAAAAGTACTTCGGA





AAGAACTTTAACAGATTGGTTAAGGTTAAGACTAAGGTTGACCCTAAC





AACTTCTTCAGGAACGAGCAGTCCATCCCACCATTGGTCAATTAA





SEQ ID NO: 28


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCCAAC





AACGTTGCTAACCCTAAGTTGGTATACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGAACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCCTTGGTTATCGTTACTCCCTCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCCTTCGTTGTTGTTGACCTTAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAAAGCGGTGCTACATTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGAACCTCTCCTTCCCC





GGTGGTTACTGTCCAACCGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGACTTGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAG





TCCATGGGAGAGGACTTGTTCTGGGCTATTCGTGGTGGTGGTGGTGAG





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGCTTGTTGCTGTTCCC





TCCAAGTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACCCCACACCT





AACACTACTGTTCACGGTTACTTCTCTTCCATCTTCCACGGTGGTGTT





GATTCCTTGGTTGATTTGATGAACAAGTCTTTCCCAGAGTTGGGCATC





AAGAAAACTGACTGTAAAGAGTTCAGTTGGATCGACACAACAATCTTC





TACAGTGGTGTTGTAAACTTCAACACTGCCAACTTTAAGAAAGAGATC





TTGTTGGACAGGTCCGCTGGTAAAAAGACAGCTTTCTCCATTAAGTTG





GACTACGTTAAGAAGCCAATCCCAGAGACTGCTATGGTTAAGATTTTG





GAGAAGTTGTACGAAGAGGACGTTGGTGCTGGTATGTACGTTTTGTAC





CCATACGGTGGCATCATGGAAGAAATCTCCGAGTCCGCTATTCCGTTC





CCACACCGAGCTGGTATCATGTACGAGTTGTGGTACACTGCTTCTTGG





GAGAAGCAAGAGGACAACGAAAAGCACATCAACTGGGTTAGATCCGTT





TACAACTTCACTACTCCTTACGTTTCCCAGAACCCAAGATTGGCGTAC





TTGAACTACAGAGACTTGGACTTGGGCAAGACCAACCACGCATCCCCT





AACAATTACACACAGGCTAGAATCTGGGGTGAAAAGTACTTCGGAAAG





AACTTTAACAGATTGGTTAAGGTTAAGACTAAGGTTGACCCTAACAAC





TTCTTCAGGAACGAGCAGTCCATCCCACCATTGGTCAATTAA





SEQ ID NO: 29


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCCAAC





GCTAACCCTAAGTTGGTATACACTCAGCACGACCAGTTGTACATGTCC





TTGTTGAACTCCACAATCCAGAACTTGAGATTCATCTCCGACACTACT





CCAAAGCCCTTGGTTATCGTTACTCCCTCCAACAACGCACACATCCAG





GCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGAACAAGA





TCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAGGTTCCC





TTCGTTGTTGTTGACCTTAGAAACATGCACTCCATCAAGATCGACGTT





CACTCCCAAACTGCTTGGGTTGAAAGCGGTGCTACATTGGGTGAAGTT





TACTACTGGATCAACGAGAAGAACGAGAACCTCTCCTTCCCCGGTGGT





TACTGTCCAACCGTTGGTGTTGGTGGTCACTTTTCTGGTGGTGGTTAC





GGTGCTTTGATGAGAAACTACGGACTTGCTGCTGACAACATCATCGAC





GCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAGTCCATG





GGAGAGGACTTGTTCTGGGCTATTCGTGGTGGTGGTGGTGAGAACTTC





GGTATTATTGCTGCTTGGAAGATCAAGCTTGTTGCTGTTCCCTCCAAG





TCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGTTTGGTT





AAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGACAAGGAC





TTGGTTTTGATGACTCACTTCATCACTAAGAACATCACTGACAACCAC





GGTAAGAACAAAACTACTGTTCACGGTTACTTCTCTTCCATCTTCCAC





GGTGGTGTTGATTCCTTGGTTGATTTGATGAACAAGTCTTTCCCAGAG





TTGGGCATCAAGAAAACTGACTGTAAAGAGTTCAGTTGGATCGACACA





ACAATCTTCTACAGTGGTGTTGTAAACTTCAACACTGCCAACTTTAAG





AAAGAGATCTTGTTGGACAGGTCCGCTGGTAAAAAGACAGCTTTCTCC





ATTAAGTTGGACTACGTTAAGAAGCCAATCCCAGAGACTGCTATGGTT





AAGATTTTGGAGAAGTTGTACGAAGAGGACGTTGGTGCTGGTATGTAC





GTTTTGTACCCATACGGTGGCATCATGGAAGAAATCTCCGAGTCCGCT





ATTCCGTTCCCACACCGAGCTGGTATCATGTACGAGTTGTGGTACACT





GCTTCTTGGGAGAAGCAAGAGGACAACGAAAAGCACATCAACTGGGTT





AGATCCGTTTACAACTTCACTACTCCTTACGTTTCCCAGAACCCAAGA





TTGGCGTACTTGAACTACAGAGACTTGGACTTGGGCAAGACCAACCAC





GCATCCCCTAACAATTACACACAGGCTAGAATCTGGGGTGAAAAGTAC





TTCGGAAAGAACTTTAACAGATTGGTTAAGGTTAAGACTAAGGTTGAC





CCTAACAACTTCTTCAGGAACGAGCAGTCCATCCCACCATTGGTCAAT





TAA





SEQ ID NO: 30


AACGCAAGAGAAAACTTGTTGAAGTGTTTGTCCAAGCACGACGGCACA





GGAAAGACTGTATACACTCAGCACGACCAGTTGTACATGTCCTTGTTG





AACTCCACAATCCAGAACTTGAGATTCATCTCCGACACTACTCCAAAG





CCCTTGGTTATCGTTACTCCCTCCAACAACGCAAAGATCCAGGCTACT





ATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGAACAAGATCCGGT





GGTCATGACGCTGAAGGTATGTCCTACATTTCCCAGGTTCCCTTCGTT





GTTGTTGACCTTAGAAACATGCACTCCATCAAGATCGACGTTCACTCC





CAAACTGCTTGGGTTGAAAGCGGTGCTACATTGGGTGAAGTTTACTAC





TGGATCAACGAGAAGAACGAGAACCTCTCCTTCCCCGGTGGTTACTGT





CCAACCGTTGGTGTTGGTGGTCACTTTTCTGGTGGTGGTTACGGTGCT





TTGATGAGAAACTACGGACTTGCTGCTGACAACATCATCGACGCTCAC





TTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAGTCCATGGGAGAG





GACTTGTTCTGGGCTATTCGTGGTGGTGGTGGTGAGAACTTCGGTATT





ATTGCTGCTTGGAAGATCAAGCTTGTTGCTGTTCCCTCCAAGTCCACT





ATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGTTTGGTTAAGTTG





TTTAACAAGTGGCAGAACATTGCTTACAAGTACGACAAGGACTTGGTT





TTGATGACTCACTTCATCACTAAGAACATCACTGACAACCACGGTAAG





AACAAAACTACTGTTCACGGTTACTTCTCTTCCATCTTCCACGGTGGT





GTTGATTCCTTGGTTGATTTGATGAACAAGTCTTTCCCAGAGTTGGGC





ATCAAGAAAACTGACTGTAAAGAGTTCAGTTGGATCGACACAACAATC





TTCTACAGTGGTGTTGTAAACTTCAACACTGCCAACTTTAAGAAAGAG





ATCTTGTTGGACAGGTCCGCTGGTAAAAAGACAGCTTTCTCCATTAAG





TTGGACTACGTTAAGAAGCCAATCCCAGAGACTGCTATGGTTAAGATT





TTGGAGAAGTTGTACGAAGAGGACGTTGGTGCTGGTATGTACGTTTTG





TACCCATACGGTGGCATCATGGAAGAAATCTCCGAGTCCGCTATTCCG





TTCCCACACCGAGCTGGTATCATGTACGAGTTGTGGTACACTGCTTCT





TGGGAGAAGCAAGAGGACAACGAAAAGCACATCAACTGGGTTAGATCC





GTTTACAACTTCACTACTCCTTACGTTTCCCAGAACCCAAGATTGGCG





TACTTGAACTACAGAGACTTGGACTTGGGCAAGACCAACCACGCATCC





CCTAACAATTACACACAGGCTAGAATCTGGGGTGAAAAGTACTTCGGA





AAGAACTTTAACAGATTGGTTAAGGTTAAGACTAAGGTTGACCCTAAC





AACTTCTTCAGGAACGAGCAGTCCATCCCACCATTGGTCAATTAA





SEQ ID NO: 31


AACGCAAGAGAAAACTTGTTGAAGTGTTTGTCCAAGCACGGCGACGGA





ACAGGAAAGACTGTATACACTCAGCACGACCAGTTGTACATGTCCTTG





TTGAACTCCACAATCCAGAACTTGAGATTCATCTCCGACACTACTCCA





AAGCCCTTGGTTATCGTTACTCCCTCCAACAACGCAAAGATCCAGGCT





ACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGAACAAGATCC





GGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAGGTTCCCTTC





GTTGTTGTTGACCTTAGAAACATGCACTCCATCAAGATCGACGTTCAC





TCCCAAACTGCTTGGGTTGAAAGCGGTGCTACATTGGGTGAAGTTTAC





TACTGGATCAACGAGAAGAACGAGAACCTCTCCTTCCCCGGTGGTTAC





TGTCCAACCGTTGGTGTTGGTGGTCACTTTTCTGGTGGTGGTTACGGT





GCTTTGATGAGAAACTACGGACTTGCTGCTGACAACATCATCGACGCT





CACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAGTCCATGGGA





GAGGACTTGTTCTGGGCTATTCGTGGTGGTGGTGGTGAGAACTTCGGT





ATTATTGCTGCTTGGAAGATCAAGCTTGTTGCTGTTCCCTCCAAGTCC





ACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGTTTGGTTAAG





TTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGACAAGGACTTG





GTTTTGATGACTCACTTCATCACTAAGAACATCACTGACAACCACGGT





AAGAACAAAACTACTGTTCACGGTTACTTCTCTTCCATCTTCCACGGT





GGTGTTGATTCCTTGGTTGATTTGATGAACAAGTCTTTCCCAGAGTTG





GGCATCAAGAAAACTGACTGTAAAGAGTTCAGTTGGATCGACACAACA





ATCTTCTACAGTGGTGTTGTAAACTTCAACACTGCCAACTTTAAGAAA





GAGATCTTGTTGGACAGGTCCGCTGGTAAAAAGACAGCTTTCTCCATT





AAGTTGGACTACGTTAAGAAGCCAATCCCAGAGACTGCTATGGTTAAG





ATTTTGGAGAAGTTGTACGAAGAGGACGTTGGTGCTGGTATGTACGTT





TTGTACCCATACGGTGGCATCATGGAAGAAATCTCCGAGTCCGCTATT





CCGTTCCCACACCGAGCTGGTATCATGTACGAGTTGTGGTACACTGCT





TCTTGGGAGAAGCAAGAGGACAACGAAAAGCACATCAACTGGGTTAGA





TCCGTTTACAACTTCACTACTCCTTACGTTTCCCAGAACCCAAGATTG





GCGTACTTGAACTACAGAGACTTGGACTTGGGCAAGACCAACCACGCA





TCCCCTAACAATTACACACAGGCTAGAATCTGGGGTGAAAAGTACTTC





GGAAAGAACTTTAACAGATTGGTTAAGGTTAAGACTAAGGTTGACCCT





AACAACTTCTTCAGGAACGAGCAGTCCATCCCACCATTGGTCAATTAA








SEQ ID NO: 32


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCCAAC





AACGTTGCTAACCCTAAGTTGGTATACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGAACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCCTTGGTTATCGTTACTCCCTCCAACAACGCACAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCCTTCGTTGTTGTTGACCTTAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAAAGCGGTGCTACATTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGAACCTCTCCTTCCCC





GGTGGTTACTGTCCAACCGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGACTTGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAG





TCCATGGGAGAGGACTTGTTCTGGGCTATTCGTGGTGGTGGTGGTGAG





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGCTTGTTGCTGTTCCC





TCCAAGTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACATCACTGAC





AACCACGGTAAGAACAAAACTACTGTTCACGGTTACTTCTCTTCCATC





TTCCACGGTGGTGTTGATTCCTTGGTTGATTTGATGAACAAGTCTTTC





CCAGAGTTGGGCATCAAGAAAACTGACTGTAAAGAGTTCAGTTGGATC





GACACAACAATCTTCTACAGTGGTGTTGTAAACTTCAACACTGCCAAC





TTTAAGAAAGAGATCTTGTTGGACAGGTCCGCTGGTAAAAAGACAGCT





TTCTCCATTAAGTTGGACTACGTTAAGAAGCCAATCCCAGAGACTGCT





ATGGTTAAGATTTTGGAGAAGTTGTACGAAGAGGACGTTGGTGCTGGT





ATGTACGTTTTGTACCCATACGGTGGCATCATGGAAGAAATCTCCGAG





TCCGCTATTCCGTTCCCACACCGAGCTGGTATCATGTACGAGTTGTGG





TACACTGCTTCTTGGGAGAAGCAAGAGGACAACGAAAAGCACATCAAC





TGGGTTAGATCCGTTTACAACTTCACTACTCCTTACGTTTCCCAGAAC





CCAAGATTGGCGTACTTGAACTACAGAGACTTGGACTTGGGCAAGACC





AACCACGCATCCCCTAACGATTACACACAGGCTAGAATCTGGGGTGAA





AAGTACTTCGGAAAGAACTTTAACAGATTGGTTAAGGTTAAGACTAAG





GTTGACCCTAACAACTTCTTCAGGAACGAGCAGTCCATCCCACCATTG





GTCAATTAA





SEQ ID NO: 33


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCCAAC





AACGTTGCTAACCCTAAGTTGGTATACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGGACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCCTTGGTTATCGTTACTCCCTCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCCTTCGTTGTTGTTGACCTTAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAAAGCGGTGCTACATTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGAACCTCTCCTTCCCC





GGTGGTTACTGTCCAACCGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGACTTGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAG





TCCATGGGAGAGGACTTGTTCTGGGCTATTCGTGGTGGTGGTGGTGAG





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGCTTGTTGCTGTTCCC





TCCAAGTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACATCACTGAC





AACCACGGTAAGAACAAAACTACTGTTCACGGTTACTTCTCTTCCATC





TTCCACGGTGGTGTTGATTCCTTGGTTGATTTGATGAACAAGTCTTTC





CCAGAGTTGGGCATCAAGAAAACTGACTGTAAAGAGTTCAGTTGGATC





GACACAACAATCTTCTACAGTGGTGTTGTAAACTTCAACACTGCCAAC





TTTAAGAAAGAGATCTTGTTGGACAGGTCCGCTGGTAAAAAGACAGCT





TTCTCCATTAAGTTGGACTACGTTAAGAAGCCAATCCCAGAGACTGCT





ATGGTTAAGATTTTGGAGAAGTTGTACGAAGAGGACGTTGGTGCTGGT





ATGTACGTTTTGTACCCATACGGTGGCATCATGGAAGAAATCTCCGAG





TCCGCTATTCCGTTCCCACACCGAGCTGGTATCATGTACGAGTTGTGG





TACACTGCTTCTTGGGAGAAGCAAGAGGACAACGAAAAGCACATCAAC





TGGGTTAGATCCGTTTACAACTTCACTACTCCTTACGTTTCCCAGAAC





CCAAGATTGGCGTACTTGAACTACAGAGACTTGGACTTGGGCAAGACC





AACCACGCATCCCCTAACGATTACACACAGGCTAGAATCTGGGGTGAA





AAGTACTTCGGAAAGAACTTTAACAGATTGGTTAAGGTTAAGACTAAG





GTTGACCCTAACAACTTCTTCAGGAACGAGCAGTCCATCCCACCATTG





GTCAATTAA





SEQ ID NO: 34


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCCAAC





AACGTTGCTAACCCTAAGTTGGTATACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGAACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCCTTGGTTATCGTTACTCCCTCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCCTTCGTTGTTGTTGACCTTAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAAAGCGGTGCTACATTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGGACCTCTCCTTCCCC





GGTGGTTACTGTCCAACCGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGACTTGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAG





TCCATGGGAGAGGACTTGTTCTGGGCTATTCGTGGTGGTGGTGGTGAG





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGCTTGTTGCTGTTCCC





AAGTCCTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACATCACTGAC





AACCACGGTAAGAACAAAACTACTGTTCACGGTTACTTCTCTTCCATC





TTCCACGGTGGTGTTGATTCCTTGGTTGATTTGATGAACAAGTCTTTC





CCAGAGTTGGGCATCAAGAAAACTGACTGTAAAGAGTTCAGTTGGATC





GACACAACAATCTTCTACAGTGGTGTTGTAAACTTCAACACTGCCAAC





TTTAAGAAAGAGATCTTGTTGGACAGGTCCGCTGGTAAAAAGACAGCT





TTCTCCATTAAGTTGGACTACGTTAAGAAGCCAATCCCAGAGACTGCT





ATGGTTAAGATTTTGGAGAAGTTGTACGAAGAGGACGTTGGTGCTGGT





ATGTACGTTTTGTACCCATACGGTGGCATCATGGAAGAAATCTCCGAG





TCCGCTATTCCGTTCCCACACCGAGCTGGTATCATGTACGAGTTGTGG





TACACTGCTTCTTGGGAGAAGCAAGAGGACAACGAAAAGCACATCAAC





TGGGTTAGATCCGTTTACAACTTCACTACTCCTTACGTTTCCCAGAAC





CCAAGATTGGCGTACTTGAACTACAGAGACTTGGACTTGGGCAAGACC





AACCACGCATCCCCTAACAATTACACACAGGCTAGAATCTGGGGTGAA





AAGTACTTCGGAAAGAACTTTAACAGATTGGTTAAGGTTAAGACTAAG





GTTGACCCTAACAACTTCTTCAGGAACGAGCAGTCCATCCCACCATTG





GTCAATTAA





SEQ ID NO: 35


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCAAAC





AACGTTGCTAACCCTAAGTTGGTTTACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGGACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCATTGGTTATCGTTACTCCATCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCATTCGTTGTTGTTGACTTGAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAATCAGGTGCTACTTTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGAACTTGTCCTTCCCA





GGTGGTTACTGTCCAACTGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGATTGGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAG





TCCATGGGTGAGGACTTGTTCTGGGCTATTAGAGGTGGTGGTGGTGAG





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGTTGGTTGCTGTTCCA





TCCAAGTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACCCCACACCT





AACACTACTGTTCACGGTTACTTCTCTTCCATCTTCCACGGTGGTGTT





GATTCCTTGGTTGATTTGATGAACAAGTCTTTCCCAGAGTTGGGTATC





AAGAAAACTGACTGTAAAGAGTTCTCCTGGATCGACACAACAATCTTC





TACTCCGGTGTTGTTAACTTCAACACTGCTAACTTTAAGAAAGAGATC





TTGTTGGACAGATCCGCTGGTAAAAAGACTGCTTTCTCCATTAAGTTG





GACTACGTTAAGAAGCCAATCCCAGAGACTGCTATGGTTAAGATTTTG





GAGAAGTTGTACGAAGAGGACGTTGGTGCTGGTATGTACGTTTTGTAC





CCATACGGTGGTATCATGGAAGAAATCTCCGAGTCCGCTATTCCATTC





CCACACAGAGCTGGTATTATGTACGAGTTGTGGTACACTGCTTCTTGG





GAGAAGCAAGAGGACAACGAAAAGCACATCAACTGGGTTAGATCCGTT





TACAACTTCACTACTCCTTACGTTTCCCAGAACCCAAGATTGGCTTAC





TTGAACTACAGAGACTTGGACTTGGGTAAGACTAACCACGCTTCCCCA





AACAATTACACACAGGCTAGAATCTGGGGTGAAAAGTACTTCGGAAAG





AACTTTAACAGATTGGTTAAGGTTAAGACTAAGGTTGACCCTAACAAC





TTCTTCAGAAACGAGCAGTCCATCCCACCATTGGTCAATTAA





SEQ ID NO: 36


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCCAAC





GCTAACCCTAAGTTGGTTTACACTCAGCACGACCAGTTGTACATGTCC





TTGTTGGACTCCACAATCCAGAACTTGAGATTCATCTCCGACACTACT





CCAAAGCCATTGGTTATCGTTACTCCATCCAACAACTCCCACATCCAG





GCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGAACAAGA





TCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAGGTTCCA





TTCGTTGTTGTTGACTTGAGAAACATGCACTCCATCAAGATCGACGTT





CACTCCCAAACTGCTTGGGTTGAATCAGGTGCTACTTTGGGTGAAGTT





TACTACTGGATCAACGAGAAGAACGAGAACTTGTCCTTCCCAGGTGGT





TACTGTCCAACTGTTGGTGTTGGTGGTCACTTTTCTGGTGGTGGTTAC





GGTGCTTTGATGAGAAACTACGGATTGGCTGCTGACAACATCATCGAC





GCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAGTCCATG





GGTGAGGACTTGTTCTGGGCTATTAGAGGTGGTGGTGGTGAGAACTTC





GGTATTATTGCTGCTTGGAAGATCAAGTTGGTTGCTGTTCCATCCAAG





TCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGTTTGGTT





AAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGACAAGGAC





TTGGTTTTGATGACTCACTTCATCACTAAGAACCCCACACCTAACACT





ACTGTTCACGGTTACTTCTCTTCCATCTTCCACGGTGGTGTTGATTCC





TTGGTTGATTTGATGAACAAGTCTTTCCCAGAGTTGGGTATCAAGAAA





ACTGACTGTAAAGAGTTCTCCTGGATCGACACAACAATCTTCTACTCC





GGTGTTGTTAACTTCAACACTGCTAACTTTAAGAAAGAGATCTTGTTG





GACAGATCCGCTGGTAAAAAGACTGCTTTCTCCATTAAGTTGGACTAC





GTTAAGAAGCCAATCCCAGAGACTGCTATGGTTAAGATTTTGGAGAAG





TTGTACGAAGAGGACGTTGGTGCTGGTATGTACGTTTTGTACCCATAC





GGTGGTATCATGGAAGAAATCTCCGAGTCCGCTATTCCATTCCCACAC





AGAGCTGGTATTATGTACGAGTTGTGGTACACTGCTTCTTGGGAGAAG





CAAGAGGACAACGAAAAGCACATCAACTGGGTTAGATCCGTTTACAAC





TTCACTACTCCTTACGTTTCCCAGAACCCAAGATTGGCTTACTTGAAC





TACAGAGACTTGGACTTGGGTAAGACTAACCACGCTTCCCCAAACAAT





TACACACAGGCTAGAATCTGGGGTGAAAAGTACTTCGGAAAGAACTTT





AACAGATTGGTTAAGGTTAAGACTAAGGTTGACCCTAACAACTTCTTC





AGAAACGAGCAGTCCATCCCACCATTGGTCAATTAA





SEQ ID NO: 37


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCCAAC





GCTAACCCTAAGTTGGTTTACACTCAGCACGACCAGTTGTACATGTCC





TTGTTGGACTCCACAATCCAGAACTTGAGATTCATCTCCGACACTACT





CCAAAGCCATTGGTTATCGTTACTCCATCCAACAACGCACACATCCAG





GCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGAACAAGA





TCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAGGTTCCA





TTCGTTGTTGTTGACTTGAGAAACATGCACTCCATCAAGATCGACGTT





CACTCCCAAACTGCTTGGGTTGAATCAGGTGCTACTTTGGGTGAAGTT





TACTACTGGATCAACGAGAAGAACGAGAACTTGTCCTTCCCAGGTGGT





TACTGTCCAACTGTTGGTGTTGGTGGTCACTTTTCTGGTGGTGGTTAC





GGTGCTTTGATGAGAAACTACGGATTGGCTGCTGACAACATCATCGAC





GCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAGTCCATG





GGTGAGGACTTGTTCTGGGCTATTAGAGGTGGTGGTGGTGAGAACTTC





GGTATTATTGCTGCTTGGAAGATCAAGTTGGTTGCTGTTCCATCCAAG





TCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGTTTGGTT





AAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGACAAGGAC





TTGGTTTTGATGACTCACTTCATCACTAAGAACATCACTGACAACCAC





GGTAAGAACAAGACTACTGTTCACGGTTACTTCTCTTCCATCTTCCAC





GGTGGTGTTGATTCCTTGGTTGATTTGATGAACAAGTCTTTCCCAGAG





TTGGGTATCAAGAAAACTGACTGTAAAGAGTTCTCCTGGATCGACACA





ACAATCTTCTACTCCGGTGTTGTTAACTTCAACACTGCTAACTTTAAG





AAAGAGATCTTGTTGGACAGATCCGCTGGTAAAAAGACTGCTTTCTCC





ATTAAGTTGGACTACGTTAAGAAGCCAATCCCAGAGACTGCTATGGTT





AAGATTTTGGAGAAGTTGTACGAAGAGGACGTTGGTGCTGGTATGTAC





GTTTTGTACCCATACGGTGGTATCATGGAAGAAATCTCCGAGTCCGCT





ATTCCATTCCCACACAGAGCTGGTATTATGTACGAGTTGTGGTACACT





GCTTCTTGGGAGAAGCAAGAGGACAACGAAAAGCACATCAACTGGGTT





AGATCCGTTTACAACTTCACTACTCCTTACGTTTCCCAGAACCCAAGA





TTGGCTTACTTGAACTACAGAGACTTGGACTTGGGTAAGACTAACCAC





GCTTCCCCAAACAATTACACACAGGCTAGAATCTGGGGTGAAAAGTAC





TTCGGAAAGAACTTTAACAGATTGGTTAAGGTTAAGACTAAGGTTGAC





CCTAACAACTTCTTCAGAAACGAGCAGTCCATCCCACCATTGGTCAAT





TAA





SEQ ID NO: 38


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCAAAC





AACGTTGCTAACCCTAAGTTGGTTTACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGGACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCATTGGTTATCGTTACTCCATCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCATTCGTTGTTGTTGACTTGAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAATCAGGTGCTACTTTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGGACTTGTCCTTCCCA





GGTGGTTACTGTCCAACTGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGATTGGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAG





TCCATGGGTGAGGACTTGTTCTGGGCTATTAGAGGTGGTGGTGGTGAG





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGTTGGTTGCTGTTCCA





AAGTCCTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACATCACTGAC





AACCACGGTAAGAACAAGACTACTGTTCACGGTTACTTCTCTTCCATC





TTCCACGGTGGTGTTGATTCCTTGGTTGATTTGATGAACAAGTCTTTC





CCAGAGTTGGGTATCAAGAAAACTGACTGTAAAGAGTTCTCCTGGATC





GACACAACAATCTTCTACTCCGGTGTTGTTAACTTCAACACTGCTAAC





TTTAAGAAAGAGATCTTGTTGGACAGATCCGCTGGTAAAAAGACTGCT





TTCTCCATTAAGTTGGACTACGTTAAGAAGCCAATCCCAGAGACTGCT





ATGGTTAAGATTTTGGAGAAGTTGTACGAAGAGGACGTTGGTGCTGGT





ATGTACGTTTTGTACCCATACGGTGGTATCATGGAAGAAATCTCCGAG





TCCGCTATTCCATTCCCACACAGAGCTGGTATTATGTACGAGTTGTGG





TACACTGCTTCTTGGGAGAAGCAAGAGGACAACGAAAAGCACATCAAC





TGGGTTAGATCCGTTTACAACTTCACTACTCCTTACGTTTCCCAGAAC





CCAAGATTGGCTTACTTGAACTACAGAGACTTGGACTTGGGTAAGACT





AACCACGCTTCCCCAAACAATTACACACAGGCTAGAATCTGGGGTGAA





AAGTACTTCGGAAAGAACTTTAACAGATTGGTTAAGGTTAAGACTAAG





GTTGACCCTAACAACTTCTTCAGAAACGAGCAGTCCATCCCACCATTG





GTCAATTAA





SEQ ID NO: 39


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCAAAC





AACGTTGCTAACCCTAAGTTGGTTTACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGGACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCATTGGTTATCGTTACTCCATCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCATTCGTTGTTGTTGACTTGAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAATCAGGTGCTACTTTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGAACTTGTCCTTCCCA





GGTGGTTACTGTCCAACTGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGATTGGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAG





TCCATGGGTGAGGACTTGTTCTGGGCTATTAGAGGTGGTGGTGGTGAG





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGTTGGTTGCTGTTCCA





TCCAAGTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACATCACTGAC





AACCACGGTAAGAACAAGACTACTGTTCACGGTTACTTCTCTTCCATC





TTCCACGGTGGTGTTGATTCCTTGGTTGATTTGATGAACAAGTCTTTC





CCAGAGTTGGGTATCAAGAAAACTGACGCTAAAGAGTTCTCCTGGATC





GACACAACAATCTTCTACTCCGGTGTTGTTAACTTCAACACTGCTAAC





TTTAAGAAAGAGATCTTGTTGGACAGATCCGCTGGTAAAAAGACTGCT





TTCTCCATTAAGTTGGACTACGTTAAGAAGCCAATCCCAGAGACTGCT





ATGGTTAAGATTTTGGAGAAGTTGTACGAAGAGGACGTTGGTGCTGGT





ATGTACGTTTTGTACCCATACGGTGGTATCATGGAAGAAATCTCCGAG





TCCGCTATTCCATTCCCACACAGAGCTGGTATTATGTACGAGTTGTGG





TACACTGCTTCTTGGGAGAAGCAAGAGGACAACGAAAAGCACATCAAC





TGGGTTAGATCCGTTTACAACTTCACTACTCCTTACGTTTCCCAGAAC





CCAAGATTGGCTTACTTGAACTACAGAGACTTGGACTTGGGTAAGACT





AACCACGCTTCCCCAAACAATTACACACAGGCTAGAATCTGGGGTGAA





AAGTACTTCGGAAAGAACTTTAACAGATTGGTTAAGGTTAAGACTAAG





GTTGACCCTAACAACTTCTTCAGAAACGAGCAGTCCATCCCACCATTG





GTCAATTAA





SEQ ID NO: 40


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCAAAC





AACGTTGCTAACCCTAAGTTGGTTTACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGGACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCATTGGTTATCGTTACTCCATCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCATTCGTTGTTGTTGACTTGAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAATCAGGTGCTACTTTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGAACTTGTCCTTCCCA





GGTGGTTACTGTCCAACTGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGATTGGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAG





TCCATGGGTGAGGACTTGTTCTGGGCTATTAGAGGTGGTGGTGGTGAG





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGTTGGTTGCTGTTCCA





TCCAAGTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACATCACTGAC





AACCACGGTAAGAACAAGACTACTGTTCACGGTTACTTCTCTTCCATC





TTCCACGGTGGTGTTGATTCCTTGGTTGATTTGATGAACAAGTCTTTC





CCAGAGTTGGGTATCAAGAAAACTGACACTAAAGAGTTCTCCTGGATC





GACACAACAATCTTCTACTCCGGTGTTGTTAACTTCAACACTGCTAAC





TTTAAGAAAGAGATCTTGTTGGACAGATCCGCTGGTAAAAAGACTGCT





TTCTCCATTAAGTTGGACTACGTTAAGAAGCCAATCCCAGAGACTGCT





ATGGTTAAGATTTTGGAGAAGTTGTACGAAGAGGACGTTGGTGCTGGT





ATGTACGTTTTGTACCCATACGGTGGTATCATGGAAGAAATCTCCGAG





TCCGCTATTCCATTCCCACACAGAGCTGGTATTATGTACGAGTTGTGG





TACACTGCTTCTTGGGAGAAGCAAGAGGACAACGAAAAGCACATCAAC





TGGGTTAGATCCGTTTACAACTTCACTACTCCTTACGTTTCCCAGAAC





CCAAGATTGGCTTACTTGAACTACAGAGACTTGGACTTGGGTAAGACT





AACCACGCTTCCCCAAACAATTACACACAGGCTAGAATCTGGGGTGAA





AAGTACTTCGGAAAGAACTTTAACAGATTGGTTAAGGTTAAGACTAAG





GTTGACCCTAACAACTTCTTCAGAAACGAGCAGTCCATCCCACCATTG





GTCAATTAA





SEQ ID NO: 41


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCAAAC





AACGTTGCTAACCCTAAGTTGGTTTACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGGACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCATTGGTTATCGTTACTCCATCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCATTCGTTGTTGTTGACTTGAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAATCAGGTGCTACTTTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGAACTTGTCCTTCCCA





GGTGGTTACTGTCCAACTGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGATTGGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAG





TCCATGGGTGAGGACTTGTTCTGGGCTATTAGAGGTGGTGGTGGTGAG





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGTTGGTTGCTGTTCCA





TCCAAGTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACATCACTGAC





AACCACGGTAAGAACAAGACTACTGTTCACGGTTACTTCTCTTCCATC





TTCCACGGTGGTGTTGATTCCTTGGTTGATTTGATGAACAAGTCTTTC





CCAGAGTTGGGTATCAAGAAAACTGACCCTAAAGAGTTCTCCTGGATC





GACACAACAATCTTCTACTCCGGTGTTGTTAACTTCAACACTGCTAAC





TTTAAGAAAGAGATCTTGTTGGACAGATCCGCTGGTAAAAAGACTGCT





TTCTCCATTAAGTTGGACTACGTTAAGAAGCCAATCCCAGAGACTGCT





ATGGTTAAGATTTTGGAGAAGTTGTACGAAGAGGACGTTGGTGCTGGT





ATGTACGTTTTGTACCCATACGGTGGTATCATGGAAGAAATCTCCGAG





TCCGCTATTCCATTCCCACACAGAGCTGGTATTATGTACGAGTTGTGG





TACACTGCTTCTTGGGAGAAGCAAGAGGACAACGAAAAGCACATCAAC





TGGGTTAGATCCGTTTACAACTTCACTACTCCTTACGTTTCCCAGAAC





CCAAGATTGGCTTACTTGAACTACAGAGACTTGGACTTGGGTAAGACT





AACCACGCTTCCCCAAACAATTACACACAGGCTAGAATCTGGGGTGAA





AAGTACTTCGGAAAGAACTTTAACAGATTGGTTAAGGTTAAGACTAAG





GTTGACCCTAACAACTTCTTCAGAAACGAGCAGTCCATCCCACCATTG





GTCAATTAA





SEQ ID NO: 42


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCAAAC





AACGTTGCTAACCCTAAGTTGGTTTACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGGACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCATTGGTTATCGTTACTCCATCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCATTCGTTGTTGTTGACTTGAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAATCAGGTGCTACTTTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGAACTTGTCCTTCCCA





GGTGGTTACTGTCCAACTGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGATTGGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAG





TCCATGGGTGAGGACTTGTTCTGGGCTATTAGAGGTGGTGGTGGTGAG





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGTTGGTTGCTGTTCCA





TCCAAGTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACATCACTGAC





AACCACGGTAAGAACAAGACTACTGTTCACGGTTACTTCTCTTCCATC





TTCCACGGTGGTGTTGATTCCTTGGTTGATTTGATGAACAAGTCTTTC





CCAGAGTTGGGTATCAAGAAAACTGACGTCAAAGAGTTCTCCTGGATC





GACACAACAATCTTCTACTCCGGTGTTGTTAACTTCAACACTGCTAAC





TTTAAGAAAGAGATCTTGTTGGACAGATCCGCTGGTAAAAAGACTGCT





TTCTCCATTAAGTTGGACTACGTTAAGAAGCCAATCCCAGAGACTGCT





ATGGTTAAGATTTTGGAGAAGTTGTACGAAGAGGACGTTGGTGCTGGT





ATGTACGTTTTGTACCCATACGGTGGTATCATGGAAGAAATCTCCGAG





TCCGCTATTCCATTCCCACACAGAGCTGGTATTATGTACGAGTTGTGG





TACACTGCTTCTTGGGAGAAGCAAGAGGACAACGAAAAGCACATCAAC





TGGGTTAGATCCGTTTACAACTTCACTACTCCTTACGTTTCCCAGAAC





CCAAGATTGGCTTACTTGAACTACAGAGACTTGGACTTGGGTAAGACT





AACCACGCTTCCCCAAACAATTACACACAGGCTAGAATCTGGGGTGAA





AAGTACTTCGGAAAGAACTTTAACAGATTGGTTAAGGTTAAGACTAAG





GTTGACCCTAACAACTTCTTCAGAAACGAGCAGTCCATCCCACCATTG





GTCAATTAA





SEQ ID NO: 43


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCAAAC





AACGTTGCTAACCCTAAGTTGGTTTACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGGACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCATTGGTTATCGTTACTCCATCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCATTCGTTGTTGTTGACTTGAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAATCAGGTGCTACTTTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGAACTTGTCCTTCCCA





GGTGGTTACTGTCCAACTGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGATTGGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAG





TCCATGGGTGAGGACTTGTTCTGGGCTATTAGAGGTGGTGGTGGTGAC





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGTTGGTTGCTGTTCCA





TCCAAGTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACATCACTGAC





AACCACGGTAAGAACAAGACTACTGTTCACGGTTACTTCTCTTCCATC





TTCCACGGTGGTGTTGATTCCTTGGTTGATTTGATGAACAAGTCTTTC





CCAGAGTTGGGTATCAAGAAAACTGACTGTAAAGAGTTCTCCTGGATC





GACACAACAATCTTCTACTCCGGTGTTGTTAACTTCAACACTGCTAAC





TTTAAGAAAGAGATCTTGTTGGACAGATCCGCTGGTAAAAAGACTGCT





TTCTCCATTAAGTTGGACTACGTTAAGAAGCCAATCCCAGAGACTGCT





ATGGTTAAGATTTTGGAGAAGTTGTACGAAGAGGACGTTGGTGCTGGT





ATGTACGTTTTGTACCCATACGGTGGTATCATGGAAGAAATCTCCGAG





TCCGCTATTCCATTCCCACACAGAGCTGGTATTATGTACGAGTTGTGG





TACACTGCTTCTTGGGAGAAGCAAGAGGACAACGAAAAGCACATCAAC





TGGGTTAGATCCGTTTACAACTTCACTACTCCTTACGTTTCCCAGAAC





CCAAGATTGGCTTACTTGAACTACAGAGACTTGGACTTGGGTAAGACT





AACCACGCTTCCCCAAACAATTACACACAGGCTAGAATCTGGGGTGAA





AAGTACTTCGGAAAGAACTTTAACAGATTGGTTAAGGTTAAGACTAAG





GTTGACCCTAACAACTTCTTCAGAAACGAGCAGTCCATCCCACCATTG





GTCAATTAA





SEQ ID NO: 44


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCAAAC





AACGTTGCTAACCCTAAGTTGGTTTACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGAACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCATTGGTTATCGTTACTCCATCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCATTCGTTGTTGTTGACTTGAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAATCAGGTGCTACTTTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGAACTTGTCCTTCCCA





GGTGGTTACTGTCCAACTGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGATTGGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAG





TCCATGGGTGAGGACTTGTTCTGGGCTATTAGAGGTGGTGGTGGTGAG





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGTTGGTTGCTGTTCCA





TCCAAGTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACCCCACACCT





AACACTACTGTTCACGGTTACTTCTCTTCCATCTTCCACGGTGGTGTT





GATTCCTTGGTTGATTTGATGAACAAGTCTTTCCCAGAGTTGGGTATC





AAGAAAACTGACTGTAAAGAGTTCTCCTGGATCGACACAACAATCTTC





TACTCCGGTGTTGTTAACTTCAACACTGCTAACTTTAAGAAAGAGATC





TTGTTGGACAGATCCGCTGGTAAAAAGACTGCTTTCTCCATTAAGTTG





GACTACGTTAAGAAGCCAATCCCAGAGACTGCTATGGTTAAGATTTTG





GAGAAGTTGTACGAAGAGGACGTTGGTGCTGGTATGTACGTTTTGTAC





CCATACGGTGGTATCATGGAAGAAATCTCCGAGTCCGCTATTCCATTC





CCACACAGAGCTGGTATTATGTACGAGTTGTGGTACACTGCTTCTTGG





GAGAAGCAAGAGGACAACGAAAAGCACATCAACTGGGTTAGATCCGTT





TACAACTTCACTACTCCTTACGTTTCCCAGAACCCAAGATTGGCTTAC





TTGAACTACAGAGACTTGGACTTGGGTAAGACTAACCACGCTTCCCCA





AACAATTACACACAGGCTAGAATCTGGGGTGAAAAGTACTTCGGAAAG





AACTTTAACAGATTGGTTAAGGTTAAGACTAAGGTTGACCCTAACAAC





TTCTTCAGAAACGAGCAGTCCATCCCACCATTGGTCAATTAA





SEQ ID NO: 45


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCAAAC





AACGTTGCTAACCCTAAGTTGGTTTACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGAACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCATTGGTTATCGTTACTCCATCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCATTCGTTGTTGTTGACTTGAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAATCAGGTGCTACTTTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGAACTTGTCCTTCCCA





GGTGGTTACTGTCCAACTGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGATTGGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAG





TCCATGGGTGAGGACTTGTTCTGGGCTATTAGAGGTGGTGGTGGTGAG





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGTTGGTTGCTGTTCCA





TCCAAGTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACCCCACACCT





AACACTACTGTTCACGGTTACTTCTCTTCCATCTTCCACGGTGGTGTT





GATTCCTTGGTTGATTTGATGAACAAGTCTTTCCCAGAGTTGGGTATC





AAGAAAACTGACGCTAAAGAGTTCTCCTGGATCGACACAACAATCTTC





TACTCCGGTGTTGTTAACTTCAACACTGCTAACTTTAAGAAAGAGATC





TTGTTGGACAGATCCGCTGGTAAAAAGACTGCTTTCTCCATTAAGTTG





GACTACGTTAAGAAGCCAATCCCAGAGACTGCTATGGTTAAGATTTTG





GAGAAGTTGTACGAAGAGGACGTTGGTGCTGGTATGTACGTTTTGTAC





CCATACGGTGGTATCATGGAAGAAATCTCCGAGTCCGCTATTCCATTC





CCACACAGAGCTGGTATTATGTACGAGTTGTGGTACACTGCTTCTTGG





GAGAAGCAAGAGGACAACGAAAAGCACATCAACTGGGTTAGATCCGTT





TACAACTTCACTACTCCTTACGTTTCCCAGAACCCAAGATTGGCTTAC





TTGAACTACAGAGACTTGGACTTGGGTAAGACTAACCACGCTTCCCCA





AACAATTACACACAGGCTAGAATCTGGGGTGAAAAGTACTTCGGAAAG





AACTTTAACAGATTGGTTAAGGTTAAGACTAAGGTTGACCCTAACAAC





TTCTTCAGAAACGAGCAGTCCATCCCACCATTGGTCAATTAA





SEQ ID NO: 46


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCAAAC





AACGTTGCTAACCCTAAGTTGGTTTACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGAACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCATTGGTTATCGTTACTCCATCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCATTCGTTGTTGTTGACTTGAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAATCAGGTGCTACTTTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGAACTTGTCCTTCCCA





GGTGGTTACTGTCCAACTGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGATTGGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAG





TCCATGGGTGAGGACTTGTTCTGGGCTATTAGAGGTGGTGGTGGTGAG





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGTTGGTTGCTGTTCCA





TCCAAGTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACCCCACACCT





AACACTACTGTTCACGGTTACTTCTCTTCCATCTTCCACGGTGGTGTT





GATTCCTTGGTTGATTTGATGAACAAGTCTTTCCCAGAGTTGGGTATC





AAGAAAACTGACACTAAAGAGTTCTCCTGGATCGACACAACAATCTTC





TACTCCGGTGTTGTTAACTTCAACACTGCTAACTTTAAGAAAGAGATC





TTGTTGGACAGATCCGCTGGTAAAAAGACTGCTTTCTCCATTAAGTTG





GACTACGTTAAGAAGCCAATCCCAGAGACTGCTATGGTTAAGATTTTG





GAGAAGTTGTACGAAGAGGACGTTGGTGCTGGTATGTACGTTTTGTAC





CCATACGGTGGTATCATGGAAGAAATCTCCGAGTCCGCTATTCCATTC





CCACACAGAGCTGGTATTATGTACGAGTTGTGGTACACTGCTTCTTGG





GAGAAGCAAGAGGACAACGAAAAGCACATCAACTGGGTTAGATCCGTT





TACAACTTCACTACTCCTTACGTTTCCCAGAACCCAAGATTGGCTTAC





TTGAACTACAGAGACTTGGACTTGGGTAAGACTAACCACGCTTCCCCA





AACAATTACACACAGGCTAGAATCTGGGGTGAAAAGTACTTCGGAAAG





AACTTTAACAGATTGGTTAAGGTTAAGACTAAGGTTGACCCTAACAAC





TTCTTCAGAAACGAGCAGTCCATCCCACCATTGGTCAATTAA





SEQ ID NO: 47


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCAAAC





AACGTTGCTAACCCTAAGTTGGTTTACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGAACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCATTGGTTATCGTTACTCCATCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCATTCGTTGTTGTTGACTTGAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAATCAGGTGCTACTTTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGAACTTGTCCTTCCCA





GGTGGTTACTGTCCAACTGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGATTGGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAG





TCCATGGGTGAGGACTTGTTCTGGGCTATTAGAGGTGGTGGTGGTGAG





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGTTGGTTGCTGTTCCA





TCCAAGTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACCCCACACCT





AACACTACTGTTCACGGTTACTTCTCTTCCATCTTCCACGGTGGTGTT





GATTCCTTGGTTGATTTGATGAACAAGTCTTTCCCAGAGTTGGGTATC





AAGAAAACTGACCCTAAAGAGTTCTCCTGGATCGACACAACAATCTTC





TACTCCGGTGTTGTTAACTTCAACACTGCTAACTTTAAGAAAGAGATC





TTGTTGGACAGATCCGCTGGTAAAAAGACTGCTTTCTCCATTAAGTTG





GACTACGTTAAGAAGCCAATCCCAGAGACTGCTATGGTTAAGATTTTG





GAGAAGTTGTACGAAGAGGACGTTGGTGCTGGTATGTACGTTTTGTAC





CCATACGGTGGTATCATGGAAGAAATCTCCGAGTCCGCTATTCCATTC





CCACACAGAGCTGGTATTATGTACGAGTTGTGGTACACTGCTTCTTGG





GAGAAGCAAGAGGACAACGAAAAGCACATCAACTGGGTTAGATCCGTT





TACAACTTCACTACTCCTTACGTTTCCCAGAACCCAAGATTGGCTTAC





TTGAACTACAGAGACTTGGACTTGGGTAAGACTAACCACGCTTCCCCA





AACAATTACACACAGGCTAGAATCTGGGGTGAAAAGTACTTCGGAAAG





AACTTTAACAGATTGGTTAAGGTTAAGACTAAGGTTGACCCTAACAAC





TTCTTCAGAAACGAGCAGTCCATCCCACCATTGGTCAATTAA





SEQ ID NO: 48


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCAAAC





AACGTTGCTAACCCTAAGTTGGTTTACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGAACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCATTGGTTATCGTTACTCCATCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCATTCGTTGTTGTTGACTTGAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAATCAGGTGCTACTTTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGAACTTGTCCTTCCCA





GGTGGTTACTGTCCAACTGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGATTGGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAG





TCCATGGGTGAGGACTTGTTCTGGGCTATTAGAGGTGGTGGTGGTGAG





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGTTGGTTGCTGTTCCA





TCCAAGTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACCCCACACCT





AACACTACTGTTCACGGTTACTTCTCTTCCATCTTCCACGGTGGTGTT





GATTCCTTGGTTGATTTGATGAACAAGTCTTTCCCAGAGTTGGGTATC





AAGAAAACTGACGTCAAAGAGTTCTCCTGGATCGACACAACAATCTTC





TACTCCGGTGTTGTTAACTTCAACACTGCTAACTTTAAGAAAGAGATC





TTGTTGGACAGATCCGCTGGTAAAAAGACTGCTTTCTCCATTAAGTTG





GACTACGTTAAGAAGCCAATCCCAGAGACTGCTATGGTTAAGATTTTG





GAGAAGTTGTACGAAGAGGACGTTGGTGCTGGTATGTACGTTTTGTAC





CCATACGGTGGTATCATGGAAGAAATCTCCGAGTCCGCTATTCCATTC





CCACACAGAGCTGGTATTATGTACGAGTTGTGGTACACTGCTTCTTGG





GAGAAGCAAGAGGACAACGAAAAGCACATCAACTGGGTTAGATCCGTT





TACAACTTCACTACTCCTTACGTTTCCCAGAACCCAAGATTGGCTTAC





TTGAACTACAGAGACTTGGACTTGGGTAAGACTAACCACGCTTCCCCA





AACAATTACACACAGGCTAGAATCTGGGGTGAAAAGTACTTCGGAAAG





AACTTTAACAGATTGGTTAAGGTTAAGACTAAGGTTGACCCTAACAAC





TTCTTCAGAAACGAGCAGTCCATCCCACCATTGGTCAATTAA





SEQ ID NO: 49


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCCAAC





GCTAACCCTAAGTTGGTTTACACTCAGCACGACCAGTTGTACATGTCC





TTGTTGAACTCCACAATCCAGAACTTGAGATTCATCTCCGACACTACT





CCAAAGCCATTGGTTATCGTTACTCCATCCAACAACGCACACATCCAG





GCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGAACAAGA





TCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAGGTTCCA





TTCGTTGTTGTTGACTTGAGAAACATGCACTCCATCAAGATCGACGTT





CACTCCCAAACTGCTTGGGTTGAATCAGGTGCTACTTTGGGTGAAGTT





TACTACTGGATCAACGAGAAGAACGAGAACTTGTCCTTCCCAGGTGGT





TACTGTCCAACTGTTGGTGTTGGTGGTCACTTTTCTGGTGGTGGTTAC





GGTGCTTTGATGAGAAACTACGGATTGGCTGCTGACAACATCATCGAC





GCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAGTCCATG





GGTGAGGACTTGTTCTGGGCTATTAGAGGTGGTGGTGGTGAGAACTTC





GGTATTATTGCTGCTTGGAAGATCAAGTTGGTTGCTGTTCCATCCAAG





TCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGTTTGGTT





AAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGACAAGGAC





TTGGTTTTGATGACTCACTTCATCACTAAGAACATCACTGACAACCAC





GGTAAGAACAAGACTACTGTTCACGGTTACTTCTCTTCCATCTTCCAC





GGTGGTGTTGATTCCTTGGTTGATTTGATGAACAAGTCTTTCCCAGAG





TTGGGTATCAAGAAAACTGACTGTAAAGAGTTCTCCTGGATCGACACA





ACAATCTTCTACTCCGGTGTTGTTAACTTCAACACTGCTAACTTTAAG





AAAGAGATCTTGTTGGACAGATCCGCTGGTAAAAAGACTGCTTTCTCC





ATTAAGTTGGACTACGTTAAGAAGCCAATCCCAGAGACTGCTATGGTT





AAGATTTTGGAGAAGTTGTACGAAGAGGACGTTGGTGCTGGTATGTAC





GTTTTGTACCCATACGGTGGTATCATGGAAGAAATCTCCGAGTCCGCT





ATTCCATTCCCACACAGAGCTGGTATTATGTACGAGTTGTGGTACACT





GCTTCTTGGGAGAAGCAAGAGGACAACGAAAAGCACATCAACTGGGTT





AGATCCGTTTACAACTTCACTACTCCTTACGTTTCCCAGAACCCAAGA





TTGGCTTACTTGAACTACAGAGACTTGGACTTGGGTAAGACTAACCAC





GCTTCCCCAAACAATTACACACAGGCTAGAATCTGGGGTGAAAAGTAC





TTCGGAAAGAACTTTAACAGATTGGTTAAGGTTAAGACTAAGGTTGAC





CCTAACAACTTCTTCAGAAACGAGCAGTCCATCCCACCATTGGTCAAT





TAA





SEQ ID NO: 50


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCCAAC





GCTAACCCTAAGTTGGTTTACACTCAGCACGACCAGTTGTACATGTCC





TTGTTGAACTCCACAATCCAGAACTTGAGATTCATCTCCGACACTACT





CCAAAGCCATTGGTTATCGTTACTCCATCCAACAACGCACACATCCAG





GCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGAACAAGA





TCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAGGTTCCA





TTCGTTGTTGTTGACTTGAGAAACATGCACTCCATCAAGATCGACGTT





CACTCCCAAACTGCTTGGGTTGAATCAGGTGCTACTTTGGGTGAAGTT





TACTACTGGATCAACGAGAAGAACGAGAACTTGTCCTTCCCAGGTGGT





TACTGTCCAACTGTTGGTGTTGGTGGTCACTTTTCTGGTGGTGGTTAC





GGTGCTTTGATGAGAAACTACGGATTGGCTGCTGACAACATCATCGAC





GCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAGTCCATG





GGTGAGGACTTGTTCTGGGCTATTAGAGGTGGTGGTGGTGAGAACTTC





GGTATTATTGCTGCTTGGAAGATCAAGTTGGTTGCTGTTCCATCCAAG





TCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGTTTGGTT





AAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGACAAGGAC





TTGGTTTTGATGACTCACTTCATCACTAAGAACATCACTGACAACCAC





GGTAAGAACAAGACTACTGTTCACGGTTACTTCTCTTCCATCTTCCAC





GGTGGTGTTGATTCCTTGGTTGATTTGATGAACAAGTCTTTCCCAGAG





TTGGGTATCAAGAAAACTGACTGTAAAGAGTTCTCCTGGATCGACACA





ACAATCTTCTACTCCGGTGTTGTTAACTTCAACACTGCTAACTTTAAG





AAAGAGATCTTGTTGGACAGATCCGCTGGTAAAAAGACTGCTTTCTCC





ATTAAGTTGGACTACGTTAAGAAGCCAATCCCAGAGACTGCTATGGTT





AAGATTTTGGAGAAGTTGTACGAAGAGGACGTTGGTGCTGGTATGTAC





GTTTTGTACCCATACGGTGGTATCATGGAAGAAATCTCCGAGTCCGCT





ATTCCATTCCCACACAGAGCTGGTATTATGTACGAGTTGTGGTACACT





GCTTCTTGGGAGAAGCAAGAGGACAACGAAAAGCACATCAACTGGGTT





AGATCCGTTTACAACTTCACTACTCCTTACGTTTCCCAGAACCCAAGA





TTGGCTTACTTGAACTACAGAGACTTGGACTTGGGTAAGACTAACCAC





GCTTCCCCAAACGATTACACACAGGCTAGAATCTGGGGTGAAAAGTAC





TTCGGAAAGAACTTTAACAGATTGGTTAAGGTTAAGACTAAGGTTGAC





CCTAACAACTTCTTCAGAAACGAGCAGTCCATCCCACCATTGGTCAAT





TAA





SEQ ID NO: 51


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCAAAC





AACGTTGCTAACCCTAAGTTGGTTTACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGAACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCATTGGTTATCGTTACTCCATCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCATTCGTTGTTGTTGACTTGAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAATCAGGTGCTACTTTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGGACTTGTCCTTCCCA





GGTGGTTACTGTCCAACTGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGATTGGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAG





TCCATGGGTGAGGACTTGTTCTGGGCTATTAGAGGTGGTGGTGGTGAG





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGTTGGTTGCTGTTCCA





AAGTCCTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACATCACTGAC





AACCACGGTAAGAACAAGACTACTGTTCACGGTTACTTCTCTTCCATC





TTCCACGGTGGTGTTGATTCCTTGGTTGATTTGATGAACAAGTCTTTC





CCAGAGTTGGGTATCAAGAAAACTGACTGTAAAGAGTTCTCCTGGATC





GACACAACAATCTTCTACTCCGGTGTTGTTAACTTCAACACTGCTAAC





TTTAAGAAAGAGATCTTGTTGGACAGATCCGCTGGTAAAAAGACTGCT





TTCTCCATTAAGTTGGACTACGTTAAGAAGCCAATCCCAGAGACTGCT





ATGGTTAAGATTTTGGAGAAGTTGTACGAAGAGGACGTTGGTGCTGGT





ATGTACGTTTTGTACCCATACGGTGGTATCATGGAAGAAATCTCCGAG





TCCGCTATTCCATTCCCACACAGAGCTGGTATTATGTACGAGTTGTGG





TACACTGCTTCTTGGGAGAAGCAAGAGGACAACGAAAAGCACATCAAC





TGGGTTAGATCCGTTTACAACTTCACTACTCCTTACGTTTCCCAGAAC





CCAAGATTGGCTTACTTGAACTACAGAGACTTGGACTTGGGTAAGACT





AACCACGCTTCCCCAAACAATTACACACAGGCTAGAATCTGGGGTGAA





AAGTACTTCGGAAAGAACTTTAACAGATTGGTTAAGGTTAAGACTAAG





GTTGACCCTAACAACTTCTTCAGAAACGAGCAGTCCATCCCACCATTG





GTCAATTAA





SEQ ID NO: 52


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCCAAC





GCTAACCCTAAGTTGGTTTACACTCAGCACGACCAGTTGTACATGTCC





TTGTTGAACTCCACAATCCAGAACTTGAGATTCATCTCCGACACTACT





CCAAAGCCATTGGTTATCGTTACTCCATCCAACAACGCACACATCCAG





GCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGAACAAGA





TCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAGGTTCCA





TTCGTTGTTGTTGACTTGAGAAACATGCACTCCATCAAGATCGACGTT





CACTCCCAAACTGCTTGGGTTGAATCAGGTGCTACTTTGGGTGAAGTT





TACTACTGGATCAACGAGAAGAACGAGAACTTGTCCTTCCCAGGTGGT





TACTGTCCAACTGTTGGTGTTGGTGGTCACTTTTCTGGTGGTGGTTAC





GGTGCTTTGATGAGAAACTACGGATTGGCTGCTGACAACATCATCGAC





GCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAGTCCATG





GGTGAGGACTTGTTCTGGGCTATTAGAGGTGGTGGTGGTGAGAACTTC





GGTATTATTGCTGCTTGGAAGATCAAGTTGGTTGCTGTTCCATCCAAG





TCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGTTTGGTT





AAGTIGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGACAAGGAC





TTGGTTTTGATGACTCACTTCATCACTAAGAACCCCACACCTAACACT





ACTGTTCACGGTTACTTCTCTTCCATCTTCCACGGTGGTGTTGATTCC





TTGGTTGATTTGATGAACAAGTCTTTCCCAGAGTTGGGTATCAAGAAA





ACTGACTGTAAAGAGTTCTCCTGGATCGACACAACAATCTTCTACTCC





GGTGTTGTTAACTTCAACACTGCTAACTTTAAGAAAGAGATCTTGTTG





GACAGATCCGCTGGTAAAAAGACTGCTTTCTCCATTAAGTTGGACTAC





GTTAAGAAGCCAATCCCAGAGACTGCTATGGTTAAGATTTTGGAGAAG





TTGTACGAAGAGGACGTTGGTGCTGGTATGTACGTTTTGTACCCATAC





GGTGGTATCATGGAAGAAATCTCCGAGTCCGCTATTCCATTCCCACAC





AGAGCTGGTATTATGTACGAGTTGTGGTACACTGCTTCTTGGGAGAAG





CAAGAGGACAACGAAAAGCACATCAACTGGGTTAGATCCGTTTACAAC





TTCACTACTCCTTACGTTTCCCAGAACCCAAGATTGGCTTACTTGAAC





TACAGAGACTTGGACTTGGGTAAGACTAACCACGCTTCCCCAAACAAT





TACACACAGGCTAGAATCTGGGGTGAAAAGTACTTCGGAAAGAACTTT





AACAGATTGGTTAAGGTTAAGACTAAGGTTGACCCTAACAACTTCTTC





AGAAACGAGCAGTCCATCCCACCATTGGTCAATTAA





SEQ ID NO: 53


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCAAAC





AACGTTGCTAACCCTAAGTTGGTTTACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGAACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCATTGGTTATCGTTACTCCATCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCATTCGTTGTTGTTGACTTGAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAATCAGGTGCTACTTTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGGACTTGTCCTTCCCA





GGTGGTTACTGTCCAACTGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGATTGGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAG





TCCATGGGTGAGGACTTGTTCTGGGCTATTAGAGGTGGTGGTGGTGAG





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGTTGGTTGCTGTTCCA





AAGTCCTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACCCCACACCT





AACACTACTGTTCACGGTTACTTCTCTTCCATCTTCCACGGTGGTGTT





GATTCCTTGGTTGATTTGATGAACAAGTCTTTCCCAGAGTTGGGTATC





AAGAAAACTGACTGTAAAGAGTTCTCCTGGATCGACACAACAATCTTC





TACTCCGGTGTTGTTAACTTCAACACTGCTAACTTTAAGAAAGAGATC





TTGTTGGACAGATCCGCTGGTAAAAAGACTGCTTTCTCCATTAAGTTG





GACTACGTTAAGAAGCCAATCCCAGAGACTGCTATGGTTAAGATTTTG





GAGAAGTTGTACGAAGAGGACGTTGGTGCTGGTATGTACGTTTTGTAC





CCATACGGTGGTATCATGGAAGAAATCTCCGAGTCCGCTATTCCATTC





CCACACAGAGCTGGTATTATGTACGAGTTGTGGTACACTGCTTCTTGG





GAGAAGCAAGAGGACAACGAAAAGCACATCAACTGGGTTAGATCCGTT





TACAACTTCACTACTCCTTACGTTTCCCAGAACCCAAGATTGGCTTAC





TTGAACTACAGAGACTTGGACTTGGGTAAGACTAACCACGCTTCCCCA





AACAATTACACACAGGCTAGAATCTGGGGTGAAAAGTACTTCGGAAAG





AACTTTAACAGATTGGTTAAGGTTAAGACTAAGGTTGACCCTAACAAC





TTCTTCAGAAACGAGCAGTCCATCCCACCATTGGTCAATTAA





SEQ ID NO: 54


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCCAAC





GCTAACCCTAAGTTGGTTTACACTCAGCACGACCAGTTGTACATGTCC





TTGTTGAACTCCACAATCCAGAACTTGAGATTCATCTCCGACACTACT





CCAAAGCCATTGGTTATCGTTACTCCATCCAACAACGCACACATCCAG





GCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGAACAAGA





TCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAGGTTCCA





TTCGTTGTTGTTGACTTGAGAAACATGCACTCCATCAAGATCGACGTT





CACTCCCAAACTGCTTGGGTTGAATCAGGTGCTACTTTGGGTGAAGTT





TACTACTGGATCAACGAGAAGAACGAGGACTTGTCCTTCCCAGGTGGT





TACTGTCCAACTGTTGGTGTTGGTGGTCACTTTTCTGGTGGTGGTTAC





GGTGCTTTGATGAGAAACTACGGATTGGCTGCTGACAACATCATCGAC





GCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAGTCCATG





GGTGAGGACTTGTTCTGGGCTATTAGAGGTGGTGGTGGTGAGAACTTC





GGTATTATTGCTGCTTGGAAGATCAAGTTGGTTGCTGTTCCAAAGTCC





TCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGTTTGGTT





AAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGACAAGGAC





TTGGTTTTGATGACTCACTTCATCACTAAGAACATCACTGACAACCAC





GGTAAGAACAAGACTACTGTTCACGGTTACTTCTCTTCCATCTTCCAC





GGTGGTGTTGATTCCTTGGTTGATTTGATGAACAAGTCTTTCCCAGAG





TTGGGTATCAAGAAAACTGACTGTAAAGAGTTCTCCTGGATCGACACA





ACAATCTTCTACTCCGGTGTTGTTAACTTCAACACTGCTAACTTTAAG





AAAGAGATCTTGTTGGACAGATCCGCTGGTAAAAAGACTGCTTTCTCC





ATTAAGTTGGACTACGTTAAGAAGCCAATCCCAGAGACTGCTATGGTT





AAGATTTTGGAGAAGTTGTACGAAGAGGACGTTGGTGCTGGTATGTAC





GTTTTGTACCCATACGGTGGTATCATGGAAGAAATCTCCGAGTCCGCT





ATTCCATTCCCACACAGAGCTGGTATTATGTACGAGTTGTGGTACACT





GCTTCTTGGGAGAAGCAAGAGGACAACGAAAAGCACATCAACTGGGTT





AGATCCGTTTACAACTTCACTACTCCTTACGTTTCCCAGAACCCAAGA





TTGGCTTACTTGAACTACAGAGACTTGGACTTGGGTAAGACTAACCAC





GCTTCCCCAAACAATTACACACAGGCTAGAATCTGGGGTGAAAAGTAC





TTCGGAAAGAACTTTAACAGATTGGTTAAGGTTAAGACTAAGGTTGAC





CCTAACAACTTCTTCAGAAACGAGCAGTCCATCCCACCATTGGTCAAT





TAA





SEQ ID NO: 55


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCCAAC





GCTAACCCTAAGTTGGTTTACACTCAGCACGACCAGTTGTACATGTCC





TTGTTGAACTCCACAATCCAGAACTTGAGATTCATCTCCGACACTACT





CCAAAGCCATTGGTTATCGTTACTCCATCCAACAACTCCCACATCCAG





GCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGAACAAGA





TCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAGGTTCCA





TTCGTTGTTGTTGACTTGAGAAACATGCACTCCATCAAGATCGACGTT





CACTCCCAAACTGCTTGGGTTGAATCAGGTGCTACTTTGGGTGAAGTT





TACTACTGGATCAACGAGAAGAACGAGGACTTGTCCTTCCCAGGTGGT





TACTGTCCAACTGTTGGTGTTGGTGGTCACTTTTCTGGTGGTGGTTAC





GGTGCTTTGATGAGAAACTACGGATTGGCTGCTGACAACATCATCGAC





GCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAGTCCATG





GGTGAGGACTTGTTCTGGGCTATTAGAGGTGGTGGTGGTGAGAACTTC





GGTATTATTGCTGCTTGGAAGATCAAGTTGGTTGCTGTTCCAAAGTCC





TCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGTTTGGTT





AAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGACAAGGAC





TTGGTTTTGATGACTCACTTCATCACTAAGAACCCCACACCTAACACT





ACTGTTCACGGTTACTTCTCTTCCATCTTCCACGGTGGTGTTGATTCC





TTGGTTGATTTGATGAACAAGTCTTTCCCAGAGTTGGGTATCAAGAAA





ACTGACTGTAAAGAGTTCTCCTGGATCGACACAACAATCTTCTACTCC





GGTGTTGTTAACTTCAACACTGCTAACTTTAAGAAAGAGATCTTGTTG





GACAGATCCGCTGGTAAAAAGACTGCTTTCTCCATTAAGTTGGACTAC





GTTAAGAAGCCAATCCCAGAGACTGCTATGGTTAAGATTTTGGAGAAG





TTGTACGAAGAGGACGTTGGTGCTGGTATGTACGTTTTGTACCCATAC





GGTGGTATCATGGAAGAAATCTCCGAGTCCGCTATTCCATTCCCACAC





AGAGCTGGTATTATGTACGAGTTGTGGTACACTGCTTCTTGGGAGAAG





CAAGAGGACAACGAAAAGCACATCAACTGGGTTAGATCCGTTTACAAC





TTCACTACTCCTTACGTTTCCCAGAACCCAAGATTGGCTTACTTGAAC





TACAGAGACTTGGACTTGGGTAAGACTAACCACGCTTCCCCAAACAAT





TACACACAGGCTAGAATCTGGGGTGAAAAGTACTTCGGAAAGAACTTT





AACAGATTGGTTAAGGTTAAGACTAAGGTTGACCCTAACAACTTCTTC





AGAAACGAGCAGTCCATCCCACCATTGGTCAATTAA





SEQ ID NO: 56


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCAAAC





AACGTTGCTAACCCTAAGTTGGTTTACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGGACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCATTGGTTATCGTTACTCCATCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCATTCGTTGTTGTTGACTTGAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAATCAGGTGCTACTTTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGAACTTGTCCTTCCCA





GGTGGTTACTGTCCAACTGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGATTGGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAG





TCCATGGGTGAGGACTTGTTCTGGGCTATTAGAGGTGGTGGTGGTGAC





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGTTGGTTGCTGTTCCA





TCCAAGTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTTGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACCCCACACCT





AACACTACTGTTCACGGTTACTTCTCTTCCATCTTCCACGGTGGTGTT





GATTCCTTGGTTGATTTGATGAACAAGTCTTTCCCAGAGTTGGGTATC





AAGAAAACTGACTGTAAAGAGTTCTCCTGGATCGACACAACAATCTTC





TACTCCGGTGTTGTTAACTTCAACACTGCTAACTTTAAGAAAGAGATC





TTGTTGGACAGATCCGCTGGTAAAAAGACTGCTTTCTCCATTAAGTTG





GACTACGTTAAGAAGCCAATCCCAGAGACTGCTATGGTTAAGATTTTG





GAGAAGTTGTACGAAGAGGACGTTGGTGCTGGTATGTACGTTTTGTAC





CCATACGGTGGTATCATGGAAGAAATCTCCGAGTCCGCTATTCCATTC





CCACACAGAGCTGGTATTATGTACGAGTTGTGGTACACTGCTTCTTGG





GAGAAGCAAGAGGACAACGAAAAGCACATCAACTGGGTTAGATCCGTT





TACAACTTCACTACTCCTTACGTTTCCCAGAACCCAAGATTGGCTTAC





TTGAACTACAGAGACTTGGACTTGGGTAAGACTAACCACGCTTCCCCA





AACAATTACACACAGGCTAGAATCTGGGGTGAAAAGTACTTCGGAAAG





AACTTTAACAGATTGGTTAAGGTTAAGACTAAGGTTGACCCTAACAAC





TTCTTCAGAAACGAGCAGTCCATCCCACCATTGGTCAATTAA





SEQ ID NO: 57


AACGCAAGAGAAAACTTCTTGAAGTGTTTTTCCAAGCACATCCCAAAC





AACGTTGCTAACCCTAAGTTGGTTTACACTCAGCACGACCAGTTGTAC





ATGTCCTTGTTGAACTCCACAATCCAGAACTTGAGATTCATCTCCGAC





ACTACTCCAAAGCCATTGGTTATCGTTACTCCATCCAACAACTCCCAC





ATCCAGGCTACTATCTTGTGTTCCAAGAAGGTTGGATTGCAGATCAGA





ACAAGATCCGGTGGTCATGACGCTGAAGGTATGTCCTACATTTCCCAG





GTTCCATTCGTTGTTGTTGACTTGAGAAACATGCACTCCATCAAGATC





GACGTTCACTCCCAAACTGCTTGGGTTGAATCAGGTGCTACTTTGGGT





GAAGTTTACTACTGGATCAACGAGAAGAACGAGAACTTGTCCTTCCCA





GGTGGTTACTGTCCAACTGTTGGTGTTGGTGGTCACTTTTCTGGTGGT





GGTTACGGTGCTTTGATGAGAAACTACGGATTGGCTGCTGACAACATC





ATCGACGCTCACTTGGTTAACGTTGACGGTAAGGTTTTGGACAGAAAG





TCCATGGGTGAGGACTTGTTCTGGGCTATTAGAGGTGGTGGTGGTGAC





AACTTCGGTATTATTGCTGCTTGGAAGATCAAGTTGGTTGCTGTTCCA





TCCAAGTCCACTATCTTCTCCGTTAAGAAAAACATGGAAATCCACGGT





TTGGTTAAGTIGTTTAACAAGTGGCAGAACATTGCTTACAAGTACGAC





AAGGACTTGGTTTTGATGACTCACTTCATCACTAAGAACCCCACACCT





AACACTACTGTTCACGGTTACTTCTCTTCCATCTTCCACGGTGGTGTT





GATTCCTTGGTTGATTTGATGAACAAGTCTTTCCCAGAGTTGGGTATC





AAGAAAACTGACTGTAAAGAGTTCTCCTGGATCGACACAACAATCTTC





TACTCCGGTGTTGTTAACTTCAACACTGCTAACTTTAAGAAAGAGATC





TTGTTGGACAGATCCGCTGGTAAAAAGACTGCTTTCTCCATTAAGTTG





GACTACGTTAAGAAGCCAATCCCAGAGACTGCTATGGTTAAGATTTTG





GAGAAGTTGTACGAAGAGGACGTTGGTGCTGGTATGTACGTTTTGTAC





CCATACGGTGGTATCATGGAAGAAATCTCCGAGTCCGCTATTCCATTC





CCACACAGAGCTGGTATTATGTACGAGTTGTGGTACACTGCTTCTTGG





GAGAAGCAAGAGGACAACGAAAAGCACATCAACTGGGTTAGATCCGTT





TACAACTTCACTACTCCTTACGTTTCCCAGAACCCAAGATTGGCTTAC





TTGAACTACAGAGACTTGGACTTGGGTAAGACTAACCACGCTTCCCCA





AACAATTACACACAGGCTAGAATCTGGGGTGAAAAGTACTTCGGAAAG





AACTTTAACAGATTGGTTAAGGTTAAGACTAAGGTTGACCCTAACAAC





TTCTTCAGAAACGAGCAGTCCATCCCACCATTGGTCAATTAA





SEQ ID NO: 58


ATGAGATTTCCTTCAATTTTTACTGCAGTTTTATTCGCAGCATCCTCC





GCATTAGCTGCTCCAGTCAACACTACAACAGAAGATGAAACGGCACAA





ATTCCGGCTGAAGCTGTCATCGGTTACTCAGATTTAGAAGGGGATTTC





GATGTTGCTGTTTTGCCATTTTCCAACAGCACAAATAACGGGTTATTG





TTTATAAATACTACTATTGCCAGCATTGCTGCTAAAGAAGAAGGGGTA





TCTCTCGAGAAAAGAGAGGCTGAAGCT





SEQ ID NO: 59


ATGCAATACAAAAAGACTTTGGTTGCCTCTGCTTTGGCCGCTACTACA





TTGGCCGCCTATGCTCCATCTGAGCCTTGGTCCACTTTGACTCCAACA





GCCACTTACAGCGGTGGTGTTACCGACTACGCTTCCACCTTCGGTATT





GCCGTTCAACCAATCTCCACTACATCCAGCGCATCATCTGCAGCCACC





ACAGCTAGCTTGGAGAAGAGGGAGGCTGATGCT





SEQ ID NO: 60


ATGAGATTCCCATCCATCTTCACTGCTGTTTTGTTCGCTGCTTCTTCC





GCTTTGGCTGCTAGAGCCTATGCTCCATCTGAGCCTTGGTCCACTTTG





ACTCCAACAGCCACTTACAGCGGTGGTGTTACCGACTACGCTTCCACC





TTCGGTATTGCCGTTCAACCAATCTCCACTACATCCAGCGCATCATCT





GCAGCCACCACAGCCTCATCTgCTAGCTTGGAGAAGAGGGAGGCTGATGCT





SEQ ID NO: 61


ATGAGATTCCCATCCATCTTCACTGCTGTTTTGTTCGCTGCTTCTTCC





GCTTTGGCTGctagaGCCTATGCTCCATCTGAGCCTTGGTCCACTTTG





ACTCCAACAGCCACTTACAGCGGTGGTGTTACCGACTACGCTTCCACC





TTCGGTATTGCCGTTCAACCAATCTCCACTACATCCAGCGCATCATCT





GCAGCCACCACAGCCTCATCTgctagaGCCTATGCTCCATCTGAGCCT





TGGTCCACTTTGACTCCAACAGCCACTTACAGCGGTGGTGTTACCGAC





TACGCTTCCACCTTCGGTATTGCCGTTCAACCAATCTCCACTACATCC





AGCGCATCATCTGCAGCCACCACAGCCTCATCTgCTAGCTTGGAGAAG





AGGGAGGCTGATGCT


SEQ ID NO: 62


ATGCTTTTGCAAGCTTTCCTTTTCCTTTTGGCTGGTTTTGCAGCCAAA





ATATCTGCAGGATCCGCTAGCTTGGAGAAGAGGGAGGCTGAAGCT





SEQ ID NO: 63


ATGTTTTCTCCTATTCTAAGTCTGGAAATTATTCTCGCTTTGGCTACT





CTCCAATCAGTCTTTGCGGCTAGCTTGGAGAAGAGGGAGGCTGAAGCT








SEQ ID NO: 64


TGTACAGGAACTTAATAATTGCTACTGCCCTTACTTGCGGTGCATACA





GTGCCTACGTGCCTTCCGAACCATGGAGCACACTGACACCTGATGCTA





GCCTTGAAAGTGCCCTCAAAGATTACTCACAAACTTTTGGAATAGCTA





TTAAGTCCTTAGATGCCGACGGATCCGCTAGCTTGGAGAAGAGGGAGG





CTGAAGCT





SEQ ID NO: 65


ATGAGGCAGGTTTGGTTCTCTTGGATTGTGGGATTGTTCCTATGTTTT





TTCAACGTGTCTTCTGCTGCTAGCTTGGAGAAGAGGGAGGCTGATGCT








SEQ ID NO: 66


ATGAGGCAGGTTTGGTTCTCTTGGATTGTGGGATTGTTCCTATGTTTT





TTCAACGTGTCTTCTGCTGCTCCAGTTAACACTACTACTGAGGACGAG





ACTGCTCAGATTCCAGCTGAAGCTGTTATTGGTTACTCCGACTTGGAA





GGTGACTTCGACGTTGCTGTTTTGCCATTCTCCAACTCCACTAACAAC





GGTTTGTTGTTCATCAACACTACAATCGCTTCCATTGCTGCTAAAGAA





GAGGGAGCTAGCTTGGAGAAGAGGGAGGCTGATGCT





SEQ ID NO: 67


ATGTTCAACCTGAAAACTATTCTCATCTCAACACTTGCATCGATCGCT





GTTGCCGCTAGCTTGGAGAAGAGGGAGGCTGATGCT





SEQ ID NO: 68


TACCCATACGATGTTCCAGATTACGCT





SEQ ID NO: 69


CATCACCATCACCATCAC





SEQ ID NO: 70


CATAACCATAATCACAATCACAAC





SEQ ID NO: 71


GATCTAACATCCAAAGACGAAAGGTTGAATGAAACCTTTTTGCCATCC





GACATCCACAGGTCCATTCTCACACATAAGTGCCAAACGCAACAGGAG





GGGATACACTAGCAGCAGACCGTTGCAAACGCAGGACCTCCACTCCTC





TTCTCCTCAACACCCACTTTTGCCATCGAAAAACCAGCCCAGTTATTG





GGCTTGATTGGAGCTCGCTCATTCCAATTCCTTCTATTAGGCTACTAA





CACCATGACTTTATTAGCCTGTCTATCCTGGCCCCCCTGGCGAGGTTC





ATGTTTGTTTATTTCCGAATGCAACAAGCTCCGCATTACACCCGAACA





TCACTCCAGATGAGGGCTTTCTGAGTGTGGGGTCAAATAGTTTCATGT





TCCCCAAATGGCCCAAAACTGACAGTTTAAACGCTGTCTTGGAACCTA





ATATGACAAAAGCGTGATCTCATCCAAGATGAACTAAGTTTGGTTCGT





TGAAATGCTAACGGCCAGTTGGTCAAAAAGAAACTTCCAAAAGTCGGC





ATACCGTTTGTCTTGTTTGGTATTGATTGACGAATGCTCAAAAATAAT





CTCATTAATGCTTAGCGCAGTCTCTCTATCGCTTCTGAACCCCGGTGC





ACCTGTGCCGAAACGCAAATGGGGAAACACCCGCTTTTTGGATGATTA





TGCATTGTCTCCACATTGTATGCTTCCAAGATTCTGGTGGGAATACTG





CTGATAGCCTAACGTTCATGATCAAAATTTAACTGTTCTAACCCCTAC





TTGACAGCAATATATAAACAGAAGGAAGCTGCCCTGTCTTAAACCTTT





TTTTTTATCATCATTATTAGCTTACTTTCATAATTGCGACTGGTTCCA





ATTGACAAGCTTTTGATTTTAACGACTTTTAACGACAACTTGAGAAGA





TCAAAAAACAACTAATTATTCGAAACG





SEQ ID NO: 72


TTATTTTTTGACCGAATTCTTTTTTTCAGACCATATGACCGGTCCATC





TTCTACGGGGGGATTATCTATGCTTTGACCTCTATCTTGATTCTTTTA





TGATTCAAATCACTTTTACGTTATTTATTACTTACTGGTTATTTACTT





AGCGCCTTTTCTGAAAAACATTTACTAAAAATCATACATCGGCACTCT





CAAACACGACAGATTGTGATCAAGAAGCAGAGACAATCACCACTAAGG





TTGCACATTTGAGCCAGTAGGCTCCTAATAGAGGTTCGATACTTATTT





TGATAATACGACATATTGTCTTACCTCTGAATGTGTCAATACTCTCTC





GTTCTTCGTCTCGTCAGCTAAAAATATAACACTTCGAGTAAGATACGC





CCAATTGAAGGCTACGAGATACCAGACTATCACTAGTAGAACTTTGAC





ATCTGCTAAAGCAGATCAAATATCCATTTATCCAGAATCAATTACCTT





CCTTTAGCTTGTCGAAGGCATGAAAAAGCTACATGAAAATCCCCATCC





TTGAAGTTTTGTCAGCTTAAAGGACTCCATTTCCTAAAATTTCAAGCA





GTCCTCTCAACTAAATTTTTTTCCATTCCTCTGCACCCAGCCCTCTTC





ATCAACCGTCCAGCCTTCTCAAAAGTCCAATGTAAGTAGCCTGCAAAT





TCAGGTTACAACCCCTCAATTTTCCATCCAAGGGCGATCCTTACAAAG





TTAATATCGAACAGCAGAGACTAAGCGAGTCATCATCACCACCCAACG





ATGGTGAAAAACTTTAAGCATAGATTGATGGAGGGTGTATGGCACTTG





GCGGCTGCATTAGAGTTTGAAACTATGGGGTAATACATCACATCCGGA





ACTGATCCGACTCCGAGATCATATGCAAAGCACGTGATGTACCCCGTA





AACTGCTCGGATTATCGTTGCAATTCATCGTCTTAAACAGTACAAGAA





ACTTTATTCATGGGTCATTGGACTCTGATGAGGGGCACATTTCCCCAA





TGATTTTTTGGGAAAGAAAGCCGTAAGAGGACAGTTAAGCGAAAGAGA





CAAGACAACGAACAGCAAAAGTGACAGCTGTCAGCTACCTAGTGGACA





GTTGGGAGTTTCCAATTGGTTGGTTTTGAATTTTTACCCATGTTGAGT





TGTCCTTGCTTCTCCTTGCAAACAATGCAAGTTGATAAGACATCACCT





TCCAAGATAGGCTATTTTTGTCGCATAAATTTTTGTCTCGGAGTGAAA





ACCCCTTTTATGTGAACAGATTACAGAAGCGTCCTACCCTTCACCGGT





TGAGATGGGGAGAAAATTAAGCGATGAGGAGACGATTATTGGTATAAA





AGAAGCAACCAAAATCCCTTATTGTCCTTTTCTGATCAGCATCAAAGA





ATATTGTCTTAAAACGGGCTTTTAACTACATTGTTCTTACACATTGCA





AACCTCTTCCTTCTATTTCGGATCAACTGTATTGACTACATTGATCTT





TTTTAACGAAGTTTACGACTTACTAAATCCCCACAAACAAATCAACTG





AGAAAA





SEQ ID NO: 73


ATATAAATACATCGCTCAAAGCTCCAACTTTGCTTGTTCATACAATTC





TTGATATTCACAATGTCTACCGAAGGTCAAGTAAGTTCAATCAAAGTA





ATTGTTTGGGAGGGAAGAAGATTGTTTTATTGCGAACCTTTCAATATC





TTACCCGACTAAATAACCATTACAGTGAATTTTTTACTAACTATATAG





ATCATCAAATGTAAGGCAGCTGTTGCCTGGGAGGCAGGAAAGGATCTC





TCTATTGAGGAGATTGAGGTTCTTCCTCCAAGAGCCCATGAAGTTAGA





GTGAAAGTGGAATTCACTGGTGTATGCCACACTGATGCTTACACGCTT





TCTGGTGCAGATGCAGAGGGAAGTTTCCCTGTTGTGTTCGGCCATGAA





GGTGCTGGTGTTGTCGAGTCAGTTGGAGAAGGTGTTGAGTCCGTGAAG





GTTGGGGATTCTGTAGTGCTTCTGTACACTCCTGAGTGCAGAGAGTGC





AAGTTCTGTCTGTCTGGTAAGACGAACCTCTGTGGTAAAATCAGAGCC





ACCCAGGGTAAAGGTTTGTTACCAGACGGGACTTCTCGTTTCCGTTGT





AAGGGCAAGGATTTGTTTCACTATATGGGATGTTCTTCCTTTTCTCAA





TACACTGTGGTGGCTGACATCTCAGTGGTTAAAGTCCAAGACGAAGCT





CCTAAGGACAAGACATGTCTGTTGGGTTGTGGTGTTACCACAGGGTAC





GGTGCTGCTATCAACACTGCTAAGATCTCTAAGGGTGACAAGATCGGT





GTGTTTGGTGCTGGATGTATTGGATTATCTGTCATCCAAGGTGCAGTT





TCCAAAGGTGCAAGCGAGATTATTGTAATTGACATCAATGATTCAAAG





AAGGCATGGGCGGACCAATTTGGTGCAACTAAGTTTGTCAATCCTACA





ACCTTACCAGAAGGTACCAATATTGTTGACTACTTGATTGATATCACT





GACGGAGGCTTTGACTATACCTTCGACTGTACCGGTAATGTTCAAGTA





ATGAGAAATGCA





SEQ ID NO: 74


ATTCTGGCCCTTGCACCTGATCGCGAAGGTGGAAATGGCAGAAGGATC





AGCCTGGACGAAGCAACCAGTTCCAACTGCTAAGTAAAGAAGATGCTA





GACGAAGGAGACTTCAGAGGTGAAAAGTTTGCAAGAAGAGAGCTGCGG





GAAATAAATTTTCAATTTAAGGACTTGAGTGCGTCCATATTCGTGTAC





GTGTCCAACTGTTTTCCATTACCTAAGAAAAACATAAAGATTAAAAAG





ATAAACCCAATCGGGAAACTTTAGCGTGCCGTTTCGGATTCCGAAAAA





CTTTTGGAGCGCCAGATGACTATGGAAAGAGGAGTGTACCAAAATGGC





AAGTCGGGGGCTACTCACCGGATAGCCAATACATTCTCTAGGAACCAG





GGATGAATCCAGGTTTTTGTTGTCACGGTAGGTCAAGCATTCACTTCT





TAGGAATATCTCGTTGAAAGCTACTTGAAATCCCATTGGGTGCGGAAC





CAGCTTCTAATTAAATAGTTCGATGATGTTCTCTAAGTGGGACTCTAC





GGCTCAAACTTCTACACAGCATCATCTTAGTAGTCCCTTCCCAAAACA





CCATTCTAGGTTTCGGAACGTAACGAAACAATGTTCCTCTCTTCACAT





TGGGCCGTTACTCTAGCCTTCCGAAGAACCAATAAAAGGGACCGGCTG





AAACGGGTGTGGAAACTCCTGTCCAGTTTATGGCAAAGGCTACAGAAA





TCCCAATCTTGTCGGGATGTTGCTCCTCCCAAACGCCATATTGTACTG





CAGTTGGTGCGCATTTTAGGGAAAATTTACCCCAGATGTCCTGATTTT





CGAGGGCTACCCCCAACTCCCTGTGCTTATACTTAGTCTAATTCTATT





CAGTGTGCTGACCTACACGTAATGATGTCGTAACCCAGTTAAATGGCC





GAAAAACTATTTAAGTAAGTTTATTTCTCCTCCAGATGAGACTCTCCT





TCTTTTCTCCGCTAGTTATCAAACTATAAACCTATTTTACCTCAAATA





CCTCCAACATCACCCACTTAAACA





SEQ ID NO: 75


ATCTTCATTGATGAAACGTTGTGATCGGTGTGACTTTTATAGTAAAAG





CTACAACTGTTTGAAATACCAAGATATCATTGTGAATGGCTCAAAAGG





GTAATACATCTGAAAAACCTGAAGTGTGGAAAATTCCGATGGAGCCAA





CTCATGATAACGCAGAAGTCCCATTTTGCCATCTTCTCTTGGTATGAA





ACGGTAGAAAATGATCCGAGTATGCCAATTGATACTCTTGATTCATGC





CCTATAGTTTGCGTAGGGTTTAATTGATCTCCTGGTCTATCGATCTGG





GACGCAATGTAGACCCCATTAGTGGAAACACTGAAAGGGATCCAACAC





TCTAGGCGGACCCGCTCACAGTCATTTCAGGACAATCACCACAGGAAT





CAACTACTTCTCCCAGTCTTCCTTGCGTGAAGCTTCAAGCCTACAACA





TAACACTTCTTACTTAATCTTTGATTCTCGAATTGTTTACCCAATCTT





GACAACTTAGCCTAAGCAATACTCTGGGGTTATATATAGCAATTGCTC





TTCCTCGCTGTAGCGTTCATTCCATCTTTCTA





SEQ ID NO: 76


TGTCATCTGCTGATGCTGTGAGGGAGAAAGAAGTAGGGGTGATACATG





GTTTATAGGCAAAGCATGTTTGTTTCAGATCAAAGATTAGCGTTTCAA





AGTTGTGGAAAAGTGACCATGCAACAATATGCAACACATTCGGATTAT





CTGATAAGTTTCAAAGCTACTAAGTAAGCCCGTTTCAAGTCTCCAGAC





CGACATCTGCCATCCAGTGATTTTCTTAGTCCTGAAAAATACGATGTG





TAAACATAAACCACAAAGATCGGCCTCCGAGGTTGAACCCTTACGAAA





GAGACATCTGGTAGCGCCAATGCCAAAAAAAAATCACACCAGAAGGAC





AATTCCCTTCCCCCCCAGCCCATTAAAGCTTACCATTTCCTATTCCAA





TACGTTCCATAGAGGGCATCGCTCGGCTCATTTTCGCGTGGGTCATAC





TAGAGCGGCTAGCTAGTCGGCTGTTTGAGCTCTCTAATCGAGGGGTAA





GGATGTCTAATATGTCATAATGGCTCACTATATAAAGAACCCGCTTGC





TCAACCTTCGACTCCTTTCCCGATCCTTTGCTTGTTGCTTCTTCTTTT





ATAACAGGAAACAAAGGAATTTATACACTTTAA





SEQ ID NO: 77


AAATTAATCCATAAGATAAGGCAAATGTGCTTAAGTAATTGAAAACAG





TGTTGTGATTATATAAGCATGGTATTTGAATAGAACTACTGGGGTTAA





CTTATCTAGTAGGATGGAAGTTGAGGGAGATCAAGATGCTTAAAGAAA





AGGATTGGCCAATATGAAAGCCATAATTAGCAATACTTATTTAATCAG





ATAATTGTGGGGCATTGTGACTTGACTTTTACCAGGACTTCAAACCTC





AACCATTTAAACAGTTATAGAAGACGTACCGTCACTTTTGCTTTTAAT





GTGATCTAAATGTGATCACATGAACTCAAACTAAAATGATATCTTTTA





CTGGACAAAAATGTTATCCTGCAAACAGAAAGCTTTCTTCTATTCTAA





GAAGAACATTTACATTGGTGGGAAACCTGAAAACAGAAAATAAATACT





CCCCAGTGACCCTATGAGCAGGATTTTTGCATCCCTATTGTAGGCCTT





TCAAACTCACACCTAATATTTCCCGCCACTCACACTATCAATGATCAC





TTCCCAGTTCTCTTCTTCCCCTATTCGTACCATGCAACCCTTACACGC





CTTTTCCATTTCGGTTCGGATGCGACTTCCAGTCTGTGGGGTACGTAG





CCTATTCTCTTAGCCGGTATTTAAACATACAAATTCACCCAAATTCTA





CCTTGATAAGGTAATTGATTAATTTCATAAAT





SEQ ID NO: 78


TCCAAACCAAACGGTCTAGCAAAAACGATAACTTTAAAGAACTTTTCA





ATTGGTTTTGTACACTACCACCGGTTTACTACCTCTGCCTTCGGTTCT





TCTCCTCACATTTTTCGCAACTGGGATAGCGTAGCCTAAAGTGTCACA





TGCTCGCTGCTCACATTCCCTACACAACAGAGATTGTCAGCAGAGGAA





ATTGAGCTCCACCATTCAACACTTGTGGATTTATGATAGTCTGTGCTA





TCAGCTCTCTTTTTTTTGTTGCTGTAGAATTTACCGTGCTAGCAACCT





TTTAAACTTTGTTTAGCTCTCCTTCCCTCTTCCATTCATCTGTTTCGG





TCCGATCCGTCTCTGGTCATCTCCTCCGCATTTTTTTTTTACCGTTAG





CGATAGGGGTCAGATCAATTCAATCAGTTTTGGCAAGGGTATTTAAAG





GTGGCGAAATCCCCCTCCGTTTGTTGAACACATCCAACTATTCTCAAC





CCAACCATCTAACTAATCGTA





SEQ ID NO: 79


CGCAGCGTTTTCTGACGGTACTAGAGGACTCTTAGGGGAAGGTAGAAT





CAATAAAGATCATATTAGGTAAGCAAATTTTGGATGGAATAGGAGACT





AGGTGTGGATGCGCGATCTCGCCAAATTGCACGACCAGAGTGGATGCC





GGATGGTGGTAAACCGTTTCTTCCTTTTTACCACCCAAGTGCGAGTGA





AACACCCCATGGCTGCTCTCCGATTGCCCCTCTACAGGCATAAGGGTG





TGACTTTGTGGGCTTGAATTTTACACCCCCTCCAACTTTTCTCGCATC





AATTGATCCTGTTACCAATATTGCATGCCCGGAGGAGACTTGCCCCCT





AATTTCGCGGCGTCGTCCCGGATCGCAGGGTGAGACTGTAGAGACCCC





ACATAGTGACAATGATTATGTAAGAAGAGGGGGGTGATTCGGCCGGCT





ATCGAACTCTAACAACTAGGGGGGTGAACAATGCCCAGCAGTCCTCCC





CACTCTTTGACAAATCAGTATCACCGATTAACACCCCAAATCTTATTC





TCAACGGTCCCTCATCCTTGCACCCCTCTTTGGACAAATGGCAGTTAG





CATTGGTGCACTGACTGACTGCCCAACCTTAAACCCAAATTTCTTAGA





AGGGGCCCATCTAGTTAGCGAGGGGTGAAAAATTCCTCCATCGGAGAT





GTATTGACCGTAAGTTGCTGCTTAAAAAAAATCAGTTCAGATAGCGAG





ACTTTTTTGATTTCGCAACGGGAGTGCCTGTTCCATTCGATTGCAATT





CTCACCCCTTCTGCCCAGTCCTGCCAATTGCCCATGAATCTGCTAATT





TCGTTGATTCCCACCCCCCTTTCCAACTCCACAAATTGTCCAATCTCG





TTTTCCATTTGGGAGAATCTGCATGTCGACTACATAAAGCGACCGGTG





TCCGAAAAGATCTGTGTAGTTTTCAACATTTTGTGCTCCCCCCGCTGT





TTGAAAACGGGGGTGAGCGCTCTCCGGGGTGCGAATTCGTGCCCAATT





CCTTTCACCCTGCCTATTGTAGACGTCAACCCGCATCTGGTGCGAATA





TAGCGCACCCCCAATGATCACACCAACAATTGGTCCACCCCTCCCCAA





TCTCTAATATTCACAATTCACCTCACTATAAATACCCCTGTCCTGCTC





CCAAATTCTTTTTTCCTTCTTCCATCAGCTACTAGCTTTTATCTTATT





TACTTTACGAAA





SEQ ID NO: 80


TGTGTCTGTAAATTTAAATCCAAAGAAAAATAAAGGGCACTACATAGA





ACAACACAATCAACCTACATAGCTCTCTTTTTTTTTTTTTTTTTTTTT





TTCTTTTTTGTTTTTTCTCAATTTTCTTCATTTTTTGCATTGTTACTA





CTCTCCCATTAAGGGAAATTCTACCAGGAAAACCGAAGAAGACAAATG





CAAAAGGGAAATATGAATACGCATGTATGCGCGCAAAACCGCACTTAC





AGAGGGCATTAGGACATT





SEQ ID NO: 81


AGATCTAACATCCAAAGACGAAAGGTTGAATGAAACCTTTTTGCCATC





CGACATCCACAGGTCCATTCTCACACATAAGTGCCAAACGCAACAGGA





GGGGATACACTAGCAGCAGACCGTTGCAAACGCAGGACCTCCACTCCT





CTTCTCCTCAACACCCACTTTTGCCATCGAAAAACCAGCCCAGTTATT





GGGCTTGATTGGAGCTCGCTCATTCCAATTCCTTCTATTAGGCTACTA





ACACCATGACTTTATTAGCCTGTCTATCCTGGCCCCCCTGGCGAGGTT





CATGTTTGTTTATTTCCGAATGCAACAAGCTCCGCATTACACCCGAAC





ATCACTCCAGATGAGGGCTTTCTGAGTGTGGGGTCAAATAGTTTCATG





TTCCCCAAATGGCCCAAAACTGACAGTTTAAACGCTGTCTTGGAACCT





AATATGACAAAAGCGTGATCTCATCCAAGATGAACTAAGTTTGGTTCG





TTGAAATGCTAACGGCCAGTTGGTCAAAAAGAAACTTCCAAAAGTCGG





CATACCGTTTGTCTTGTTTGGTATTGATTGACGAATGCTCAAAAATAA





TCTCATTAATGCTTAGCGCAGTCTCTCTATCGCTTCTGAACCCCGGTG





CACCTGTGCCGAAACGCAAATGGGGAAACACCCGCTTTTTGGATGATT





ATGCATTGTCTCCACATTGTATGCTTCCAAGATTCTGGTGGGAATACT





GCTGATAGCCTAACGTTCATGATCAAAATTTAACTGTTCTAACCCCTA





CTTGACAGCAATATATAAACAGAAGGAAGCTGCCCTGTCTTAAACCTT





TTTTTTTATCATCATTATTAGCTTACTTTCATAATTGCGACTGGTTCC





AATTGACAAGCTTTTGATTTTAACGACTTTTAACGACAACTTGAGAAG





ATCAAAAAACAACTAATTATTCGAAACGGAATTCGCTGGATCCTACGT





AGAATTCCCTAGGGCGGCCGCGAATTAATTCGCCTTAGACATGACTGT





TCCTCAGTTCAAGTTGGGCACTTACGAGAAGACCGGTCTTGCTAGATT





CTAATCAAGAGGATGTCAGAATGCCATTTGCCTGAGAGATGCAGGCTT





CATTTTTGATACTTTTTTATTTGTAACCTATATAGTATAGGATTTTTT





TTGTCATTTTGTTTCTTCTCGTACGAGCTTGCTCCTGATCAGCCTATC





TCGCAGCTGATGAATATCTTGTGGTAGGGGTTTGGGAAAATCATTCGA





GTTTGATGTTTTTCTTGGTATTTCCCACTCCTCTTCAGAGTACAGAAG





ATTAAGTGAGAAGTTCGTTTGTGCAAGCTTATCGATAAGCTTTAATGC





GGTAGTTTATCACAGTTAAATTGCTAACGCAGTCAGGCACCGTGTATG





AAATCTAACAATGCGCTCATCGTCATCCTCGGCACCGTCACCCTGGAT





GCTGTAGGCATAGGCTTGGTTATGCCGGTACTGCCGGGCCTCTTGCGG





GATATCGTCCATTCCGACAGCATCGCCAGTCACTATGGCGTGCTGCTA





GCGCTATATGCGTTGATGCAATTTCTATGCGCACCCGTTCTCGGAGCA





CTGTCCGACCGCTTTGGCCGCCGCCCAGTCCTGCTCGCTTCGCTACTT





GGAGCCACTATCGACTACGCGATCATGGCGACCACACCCGTCCTGTGG





ATCTATCGAATCTAAATGTAAGTTAAAATCTCTAAATAATTAAATAAG





TCCCAGTTTCTCCATACGAACCTTAACAGCATTGCGGTGAGCATCTAG





ACCTTCAACAGCAGCCAGATCCATCACTGCTTGGCCAATATGTTTCAG





TCCCTCAGGAGTTACGTCTTGTGAAGTGATGAACTTCTGGAAGGTTGC





AGTGTTAACTCCGCTGTATTGACGGGCATATCCGTACGTTGGCAAAGT





GTGGTTGGTACCGGAGGAGTAATCTCCACAACTCTCTGGAGAGTAGGC





ACCAACAAACACAGATCCAGCGTGTTGTACTTGATCAACATAAGAAGA





AGCATTCTCGATTTGCAGGATCAAGTGTTCAGGAGCGTACTGATTGGA





CATTTCCAAAGCCTGCTCGTAGGTTGCAACCGATAGGGTTGTAGAGTG





TGCAATACACTTGCGTACAATTTCAACCCTTGGCAACTGCACAGCTTG





GTTGTGAACAGCATCTTCAATTCTGGCAAGCTCCTTGTCTGTCATATC





GACAGCCAACAGAATCACCTGGGAATCAATACCATGTTCAGCTTGAGA





CAGAAGGTCTGAGGCAACGAAATCTGGATCAGCGTATTTATCAGCAAT





AACTAGAACTTCAGAAGGCCCAGCAGGCATGTCAATACTACACAGGGC





TGATGTGTCATTTTGAACCATCATCTTGGCAGCAGTAACGAACTGGTT





TCCTGGACCAAATATTTTGTCACACTTAGGAACAGTTTCTGTTCCGTA





AGCCATAGCAGCTACTGCCTGGGCGCCTCCTGCTAGCACGATACACTT





AGCACCAACCTTGTGGGCAACGTAGATGACTTCTGGGGTAAGGGTACC





ATCCTTCTTAGGTGGAGATGCAAAAACAATTTCTTTGCAACCAGCAAC





TTTGGCAGGAACACCCAGCATCAGGGAAGTGGAAGGCAGAATTGCGGT





TCCACCAGGAATATAGAGGCCAACTTTCTCAATAGGTCTTGCAAAACG





AGAGCAGACTACACCAGGGCAAGTCTCAACTTGCAACGTCTCCGTTAG





TTGAGCTTCATGGAATTTCCTGACGTTATCTATAGAGAGATCAATGGC





TCTCTTAACGTTATCTGGCAATTGCATAAGTTCCTCTGGGAAAGGAGC





TTCTAACACAGGTGTCTTCAAAGCGACTCCATCAAACTTGGCAGTTAG





TTCTAAAAGGGCTTTGTCACCATTTTGACGAACATTGTCGACAATTGG





TTTGACTAATTCCATAATCTGTTCCGTTTTCTGGATAGGACGACGAAG





GGCATCTTCAATTTCTTGTGAGGAGGCCTTAGAAACGTCAATTTTGCA





CAATTCAATACGACCTTCAGAAGGGACTTCTTTAGGTTTGGATTCTTC





TTTAGGTTGTTCCTTGGTGTATCCTGGCTTGGCATCTCCTTTCCTTCT





AGTGACCTTTAGGGACTTCATATCCAGGTTTCTCTCCACCTCGTCCAA





CGTCACACCGTACTTGGCACATCTAACTAATGCAAAATAAAATAAGTC





AGCACATTCCCAGGCTATATCTTCCTTGGATTTAGCTTCTGCAAGTTC





ATCAGCTTCCTCCCTAATTTTAGCGTTCAACAAAACTTCGTCGTCAAA





TAACCGTTTGGTATAAGAACCTTCTGGAGCATTGCTCTTACGATCCCA





CAAGGTGGCTTCCATGGCTCTAAGACCCTTTGATTGGCCAAAACAGGA





AGTGCGTTCCAAGTGACAGAAACCAACACCTGTTTGTTCAACCACAAA





TTTCAAGCAGTCTCCATCACAATCCAATTCGATACCCAGCAACTTTTG





AGTTGCTCCAGATGTAGCACCTTTATACCACAAACCGTGACGACGAGA





TTGGTAGACTCCAGTTTGTGTCCTTATAGCCTCCGGAATAGACTTTTT





GGACGAGTACACCAGGCCCAACGAGTAATTAGAAGAGTCAGCCACCAA





AGTAGTGAATAGACCATCGGGGCGGTCAGTAGTCAAAGACGCCAACAA





AATTTCACTGACAGGGAACTTTTTGACATCTTCAGAAAGTTCGTATTC





AGTAGTCAATTGCCGAGCATCAATAATGGGGATTATACCAGAAGCAAC





AGTGGAAGTCACATCTACCAACTTTGCGGTCTCAGAAAAAGCATAAAC





AGTTCTACTACCGCCATTAGTGAAACTTTTCAAATCGCCCAGTGGAGA





AGAAAAAGGCACAGCGATACTAGCATTAGCGGGCAAGGATGCAACTTT





ATCAACCAGGGTCCTATAGATAACCCTAGCGCCTGGGATCATCCTTTG





GACAACTCTTTCTGCCAAATCTAGGTCCAAAATCACTTCATTGATACC





ATTATTGTACAACTTGAGCAAGTTGTCGATCAGCTCCTCAAATTGGTC





CTCTGTAACGGATGACTCAACTTGCACATTAACTTGAAGCTCAGTCGA





TTGAGTGAACTTGATCAGGTTGTGCAGCTGGTCAGCAGCATAGGGAAA





CACGGCTTTTCCTACCAAACTCAAGGAATTATCAAACTCTGCAACACT





TGCGTATGCAGGTAGCAAGGGAAATGTCATACTTGAAGTCGGACAGTG





AGTGTAGTCTTGAGAAATTCTGAAGCCGTATTTTTATTATCAGTGAGT





CAGTCATCAGGAGATCCTCTACGCCGGACGCATCGTGGCCGACCTGCA





GGGGGGGGGGGGGCGCTGAGGTCTGCCTCGTGAAGAAGGTGTTGCTGA





CTCATACCAGGCCTGAATCGCCCCATCATCCAGCCAGAAAGTGAGGGA





GCCACGGTTGATGAGAGCTTTGTTGTAGGTGGACCAGTTGGTGATTTT





GAACTTTTGCTTTGCCACGGAACGGTCTGCGTTGTCGGGAAGATGCGT





GATCTGATCCTTCAACTCAGCAAAAGTTCGATTTATTCAACAAAGCCG





CCGTCCCGTCAAGTCAGCGTAATGCTCTGCCAGTGTTACAACCAATTA





ACCAATTCTGATTAGAAAAACTCATCGAGCATCAAATGAAACTGCAAT





TTATTCATATCAGGATTATCAATACCATATTTTTGAAAAAGCCGTTTC





TGTAATGAAGGAGAAAACTCACCGAGGCAGTTCCATAGGATGGCAAGA





TCCTGGTATCGGTCTGCGATTCCGACTCGTCCAACATCAATACAACCT





ATTAATTTCCCCTCGTCAAAAATAAGGTTATCAAGTGAGAAATCACCA





TGAGTGACGACTGAATCCGGTGAGAATGGCAAAAGCTTATGCATTTCT





TTCCAGACTTGTTCAACAGGCCAGCCATTACGCTCGTCATCAAAATCA





CTCGCATCAACCAAACCGTTATTCATTCGTGATTGCGCCTGAGCGAGA





CGAAATACGCGATCGCTGTTAAAAGGACAATTACAAACAGGAATCGAA





TGCAACCGGCGCAGGAACACTGCCAGCGCATCAACAATATTTTCACCT





GAATCAGGATATTCTTCTAATACCTGGAATGCTGTTTTCCCGGGGATC





GCAGTGGTGAGTAACCATGCATCATCAGGAGTACGGATAAAATGCTTG





ATGGTCGGAAGAGGCATAAATTCCGTCAGCCAGTTTAGTCTGACCATC





TCATCTGTAACATCATTGGCAACGCTACCTTTGCCATGTTTCAGAAAC





AACTCTGGCGCATCGGGCTTCCCATACAATCGATAGATTGTCGCACCT





GATTGCCCGACATTATCGCGAGCCCATTTATACCCATATAAATCAGCA





TCCATGTTGGAATTTAATCGCGGCCTCGAGCAAGACGTTTCCCGTTGA





ATATGGCTCATAACACCCCTTGTATTACTGTTTATGTAAGCAGACAGT





TTTATTGTTCATGATGATATATTTTTATCTTGTGCAATGTAACATCAG





AGATTTTGAGACACAACGTGGCTTTCCCCCCCCCCCCTGCAGGTCGGC





ATCACCGGCGCCACAGGTGCGGTTGCTGGCGCCTATATCGCCGACATC





ACCGATGGGGAAGATCGGGCTCGCCACTTCGGGCTCATGAGCGCTTGT





TTCGGCGTGGGTATGGTGGCAGGCCCCGTGGCCGGGGGACTGTTGGGC





GCCATCTCCTTGCATGCACCATTCCTTGCGGCGGCGGTGCTCAACGGC





CTCAACCTACTACTGGGCTGCTTCCTAATGCAGGAGTCGCATAAGGGA





GAGCGTCGAGTATCTATGATTGGAAGTATGGGAATGGTGATACCCGCA





TTCTTCAGTGTCTTGAGGTCTCCTATCAGATTATGCCCAACTAAAGCA





ACCGGAGGAGGAGATTTCATGGTAAATTTCTCTGACTTTTGGTCATCA





GTAGACTCGAACTGTGAGACTATCTCGGTTATGACAGCAGAAATGTCC





TTCTTGGAGACAGTAAATGAAGTCCCACCAATAAAGAAATCCTTGTTA





TCAGGAACAAACTTCTTGTTTCGAACTTTTTCGGTGCCTTGAACTATA





AAATGTAGAGTGGATATGTCGGGTAGGAATGGAGCGGGCAAATGCTTA





CCTTCTGGACCTTCAAGAGGTATGTAGGGTTTGTAGATACTGATGCCA





ACTTCAGTGACAACGTTGCTATTTCGTTCAAACCATTCCGAATCCAGA





GAAATCAAAGTTGTTTGTCTACTATTGATCCAAGCCAGTGCGGTCTTG





AAACTGACAATAGTGTGCTCGTGTTTTGAGGTCATCTTTGTATGAATA





AATCTAGTCTTTGATCTAAATAATCTTGACGAGCCAAGGCGATAAATA





CCCAAATCTAAAACTCTTTTAAAACGTTAAAAGGACAAGTATGTCTGC





CTGTATTAAACCCCAAATCAGCTCGTAGTCTGATCCTCATCAACTTGA





GGGGCACTATCTTGTTTTAGAGAAATTTGCGGAGATGCGATATCGAGA





AAAAGGTACGCTGATTTTAAACGTGAAATTTATCTCAAGATCTCTGCC





TCGCGCGTTTCGGTGATGACGGTGAAAACCTCTGACACATGCAGCTCC





CGGAGACGGTCACAGCTTGTCTGTAAGCGGATGCCGGGAGCAGACAAG





CCCGTCAGGGCGCGTCAGCGGGTGTTGGCGGGTGTCGGGGCGCAGCCA





TGACCCAGTCACGTAGCGATAGCGGAGTGTATACTGGCTTAACTATGC





GGCATCAGAGCAGATTGTACTGAGAGTGCACCATATGCGGTGTGAAAT





ACCGCACAGATGCGTAAGGAGAAAATACCGCATCAGGCGCTCTTCCGC





TTCCTCGCTCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGC





GGTATCAGCTCACTCAAAGGCGGTAATACGGTTATCCACAGAATCAGG





GGATAACGCAGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAG





GAACCGTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCC





CCCTGACGAGCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCGAAA





CCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCCT





CGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGC





CTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCAATGCTCACGCTGTAG





GTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCA





CGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTATCG





TCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTGGCAGCAGC





CACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGA





GTTCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATT





TGGTATCTGCGCTCTGCTGAAGCCAGTTACCTTCGGAAAAAGAGTTGG





TAGCTCTTGATCCGGCAAACAAACCACCGCTGGTAGCGGTGGTTTTTT





TGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGA





TCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTC





ACGTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGATCTTCACCTA





GATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATA





TGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGCACC





TATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCC





CGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAG





TGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATC





AGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGC





AACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAG





AGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGC





TGCAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAG





CTCCGGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTG





CAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAA





GTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTC





TCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTA





CTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTC





TTGCCCGGCGTCAACACGGGATAATACCGCGCCACATAGCAGAACTTT





AAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAG





GATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACC





CAACTGATCTTCAGCATCTTTTACTTTCACCAGCGTTTCTGGGTGAGC





AAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACACG





GAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCAT





TTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTA





GAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCC





ACCTGACGTCTAAGAAACCATTATTATCATGACATTAACCTATAAAAA





TAGGCGTATCACGAGGCCCTTTCGTCTTCAAGAATTAATTCTCATGTT





TGACAGCTTATCATCGATAAGCTGACTCATGTTGGTATTGTGAAATAG





ACGCAGATCGGGAACACTGAAAAATAACAGTTATTATTCG





SEQ ID NO: 82


AGATCTAACATCCAAAGACGAAAGGTTGAATGAAACCTTTTTGCCATC





CGACATCCACAGGTCCATTCTCACACATAAGTGCCAAACGCAACAGGA





GGGGATACACTAGCAGCAGACCGTTGCAAACGCAGGACCTCCACTCCT





CTTCTCCTCAACACCCACTTTTGCCATCGAAAAACCAGCCCAGTTATT





GGGCTTGATTGGAGCTCGCTCATTCCAATTCCTTCTATTAGGCTACTA





ACACCATGACTTTATTAGCCTGTCTATCCTGGCCCCCCTGGCGAGGTT





CATGTTTGTTTATTTCCGAATGCAACAAGCTCCGCATTACACCCGAAC





ATCACTCCAGATGAGGGCTTTCTGAGTGTGGGGTCAAATAGTTTCATG





TTCCCCAAATGGCCCAAAACTGACAGTTTAAACGCTGTCTTGGAACCT





AATATGACAAAAGCGTGATCTCATCCAAGATGAACTAAGTTTGGTTCG





TTGAAATGCTAACGGCCAGTTGGTCAAAAAGAAACTTCCAAAAGTCGG





CATACCGTTTGTCTTGTTTGGTATTGATTGACGAATGCTCAAAAATAA





TCTCATTAATGCTTAGCGCAGTCTCTCTATCGCTTCTGAACCCCGGTG





CACCTGTGCCGAAACGCAAATGGGGAAACACCCGCTTTTTGGATGATT





ATGCATTGTCTCCACATTGTATGCTTCCAAGATTCTGGTGGGAATACT





GCTGATAGCCTAACGTTCATGATCAAAATTTAACTGTTCTAACCCCTA





CTTGACAGCAATATATAAACAGAAGGAAGCTGCCCTGTCTTAAACCTT





TTTTTTTATCATCATTATTAGCTTACTTTCATAATTGCGACTGGTTCC





AATTGACAAGCTTTTGATTTTAACGACTTTTAACGACAACTTGAGAAG





ATCAAAAAACAACTAATTATTCGAAGGATCTGAATTCATAGCTAGCTT





GGAGAAGAGGCCTTAATTAACTGGGCCGCGAATTAATTCGCCTTAGAC





ATGACTGTTCCTCAGTTCAAGTTGGGCACTTACGAGAAGACCGGTCTT





GCTAGATTCTAATCAAGAGGATGTCAGAATGCCATTTGCCTGAGAGAT





GCAGGCTTCATTTTTGATACTTTTTTATTTGTAACCTATATAGTATAG





GATTTTTTTTGTCATTTTGTTTCTTCTCGTACGAGCTTGCTCCTGATC





AGCCTATCTCGCAGCTGATGAATATCTTGTGGTAGGGGTTTGGGAAAA





TCATTCGAGTTTGATGTTTTTCTTGGTATTTCCCACTCCTCTTCAGAG





TACAGAAGATTAAGTGAGACGTTCGTTTGTGCAAGCTTATCGATAAGC





TTTAATGCGGTAGTTTATCACAGTTAAATTGCTAACGCAGTCAGGCAC





CGTGTATGAAATCTAACAATGCGCTCATCGTCATCCTCGGCACCGTCA





CCCTGGATGCTGTAGGCATAGGCTTGGTTATGCCGGTACTGCCGGGCC





TCTTGCGGGATATCGTCCATTCCGACAGCATCGCCAGTCACTATGGCG





TGCTGCTAGGTACAACTTGAGCAAGTTGTCGATCAGCTCCTCAAATTG





GTCCTCTGTAACGGATGACTCAACTTGCACATTAACTTGAAGCTCAGT





CGATTGAGTGAACTTGATCAGGTTGTGCAGCTGGTCAGCAGCATAGGG





AAACACGGCTTTTCCTACCAAACTCAAGGAATTATCAAACTCTGCAAC





ACTTGCGTATGCAGGTAGCAAGGGAAATGTCATACTTGAAGTCGGACA





GTGAGTGTAGTCTTGAGAAATTCTGAAGCCGTATTTTTATTATCAGTG





AGTCAGTCATCAGGAGATCCTCTACGCCGGACGCATCGTGGCCGACCT





GCAGGGGGGGGGGGGGCGCTGAGGTCTGCCTCGTGAAGAAGGTGTTGC





TGACTCATACCAGCTGAATCGCCCCATCATCCAGCCAGAAAGTGAGGG





AGCCACGGTTGATGAGAGCTTTGTTGTAGGTGGACCAGTTGGTGATTT





TGAACTTTTGCTTTGCCACGGAACGGTCTGCGTTGTCGGGAAGATGCG





TGATCTGATCCTTCAACTCAGCAAAAGTTCGATTTATTCAACAAAGCC





GCCGTCCCGTCAAGTCAGCGTAATGCTCTGCCAGTGTTACAACCAATT





AACCAATTCTGATTAGAAAAACTCATCGAGCATCAAATGAAACTGCAA





TTTATTCATATCAGGATTATCAATACCATATTTTTGAAAAAGCCGTTT





CTGTAATGAAGGAGAAAACTCACCGAGGCAGTTCCATAGGATGGCAAG





ATCCTGGTATCGGTCTGCGATTCCGACTCGTCCAACATCAATACAACC





TATTAATTTCCCCTCGTCAAAAATAAGGTTATCAAGTGAGAAATCACC





ATGAGTGACGACTGAATCCGGTGAGAATGGCAAAAGCTTATGCATTTC





TTTCCAGACTTGTTCAACAGGCCAGCCATTACGCTCGTCATCAAAATC





ACTCGCATCAACCAAACCGTTATTCATTCGTGATTGCGCCTGAGCGAG





ACGAAATACGCGATCGCTGTTAAAAGGACAATTACAAACAGGAATCGA





ATGCAACCGGCGCAGGAACACTGCCAGCGCATCAACAATATTTTCACC





TGAATCAGGATATTCTTCTAATACCTGGAATGCTGTTTTCCCGGGGAT





CGCAGTGGTGAGTAACCATGCATCATCAGGAGTACGGATAAAATGCTT





GATGGTCGGAAGAGGCATAAATTCCGTCAGCCAGTTTAGTCTGACCAT





CTCATCTGTAACATCATTGGCAACGCTACCTTTGCCATGTTTCAGAAA





CAACTCTGGCGCATCGGGCTTCCCATACAATCGATAGATTGTCGCACC





TGATTGCCCGACATTATCGCGAGCCCATTTATACCCATATAAATCAGC





ATCCATGTTGGAATTTAATCGCGGCCTCGAGCAAGACGTTTCCCGTTG





AATATGGCTCATAACACCCCTTGTATTACTGTTTATGTAAGCAGACAG





TTTTATTGTTCATGATGATATATTTTTATCTTGTGCAATGTAACATCA





GAGATTTTGAGACACAACGTGGCTTTCCCCCCCCCCCCTGCAGGTCGG





CATCACCGGCGCCACAGGTGCGGTTGCTGGCGCCTATATCGCCGACAT





CACCGATGGGGAAGATCGGGCTCGCCACTTCGGGCTCATGAGCGCTTG





TTTCGGCGTGGGTATGGTGGCAGGCCCCGTGGCCGGGGGACTGTTGGG





CGCCATCTCCTTGCATGCACCATTCCTTGCGGCGGCGGTGCTCAACGG





CCTCAACCTACTACTGGGCTGCTTCCTAATGCAGGAGTCGCATAAGGG





AGAGCGTCGAGTATCTATGATTGGAAGTATGGGAATGGTGATACCCGC





ATTCTTCAGTGTCTTGAGGTCTCCTATCAGATTATGCCCAACTAAAGC





AACCGGAGGAGGAGATTTCATGGTAAATTTCTCTGACTTTTGGTCATC





AGTAGACTCGAACTGTGAGACTATCTCGGTTATGACAGCAGAAATGTC





CTTCTTGGAGACAGTAAATGAAGTCCCACCAATAAAGAAATCCTTGTT





ATCAGGAACAAACTTCTTGTTTCGAACTTTTTCGGTGCCTTGAACTAT





AAAATGTAGAGTGGATATGTCGGGTAGGAATGGAGCGGGCAAATGCTT





ACCTTCTGGACCTTCAAGAGGTATGTAGGGTTTGTAGATACTGATGCC





AACTTCAGTGACAACGTTGCTATTTCGTTCAAACCATTCCGAATCCAG





AGAAATCAAAGTTGTTTGTCTACTATTGATCCAAGCCAGTGCGGTCTT





GAAACTGACAATAGTGTGCTCGTGTTTTGAGGTCATCTTTGTATGAAT





AAATCTAGTCTTTGATCTAAATAATCTTGACGAGCCAAGGCGATAAAT





ACCCAAATCTAAAACTCTTTTAAAACGTTAAAAGGACAAGTATGTCTG





CCTGTATTAAACCCCAAATCAGCTCGTAGTCTGATCCTCATCAACTTG





AGGGGCACTATCTTGTTTTAGAGAAATTTGCGGAGATGCGATATCGAG





AAAAAGGTACGCTGATTTTAAACGTGAAATTTATCTCAAGATCTCTGC





CTCGCGCGTTTCGGTGATGACGGTGAAAACCTCTGACACATGCAGCTC





CCGGAGACGGTCACAGCTTGTCTGTAAGCGGATGCCGGGAGCAGACAA





GCCCGTCAGGGCGCGTCAGCGGGTGTTGGCGGGTGTCGGGGCGCAGCC





ATGACCCAGTCACGTAGCGATAGCGGAGTGTATACTGGCTTAACTATG





CGGCATCAGAGCAGATTGTACTGAGAGTGCACCATATGCGGTGTGAAA





TACCGCACAGATGCGTAAGGAGAAAATACCGCATCAGGCGCTCTTCCG





CTTCCTCGCTCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAG





CGGTATCAGCTCACTCAAAGGCGGTAATACGGTTATCCACAGAATCAG





GGGATAACGCAGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCA





GGAACCGTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCC





CCCCTGACGAGCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCGAA





ACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCC





TCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCG





CCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCAATGCTCACGCTGTA





GGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGC





ACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTATC





GTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTGGCAGCAG





CCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAG





AGTTCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTAT





TTGGTATCTGCGCTCTGCTGAAGCCAGTTACCTTCGGAAAAAGAGTTG





GTAGCTCTTGATCCGGCAAACAAACCACCGCTGGTAGCGGTGGTTTTT





TTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAG





ATCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACT





CACGTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGATCTTCACCT





AGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATAT





ATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGCAC





CTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCC





CCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCA





GTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTAT





CAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTG





CAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTA





GAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTG





CTGCAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCA





GCTCCGGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGT





GCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTA





AGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATT





CTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGT





ACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCT





CTTGCCCGGCGTCAACACGGGATAATACCGCGCCACATAGCAGAACTT





TAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAA





GGATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCAC





CCAACTGATCTTCAGCATCTTTTACTTTCACCAGCGTTTCTGGGTGAG





CAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACAC





GGAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCA





TTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTT





AGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGC





CACCTGACGTCTAAGAAACCATTATTATCATGACATTAACCTATAAAA





ATAGGCGTATCACGAGGCCCTTTCGTCTTCAAGAATTAATTCTCATGT





TTGACAGCTTATCATCGATAAGCTGACTCATGTTGGTATTGTGAAATA





GACGCAGATCGGGAACACTGAAAAATAACAGTTATTATTCG





SEQ ID NO: 83


NPRENFLKCFSKHIPNNVANPKLVYTQHDQLYMSLLNSTIQNLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVEAGATLGEVYYWINEKNENLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHG





LVKLFNKWQNIAYKYDKDLVLMTHFITKNITDNHGKNKTTVHGYFSSI





FHGGVDSLVDLMNKSFPELGIKKTDCKEFSWIDTTIFYSGVVNENTAN





FKKEILLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAG





MYVLYPYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHIN





WVRSVYNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGE





KYFGKNFNRLVKVKTKVDPNNFFRNEQSIPPLPPHHHHHH





SEQ ID NO: 84


NARENFLKCFSKHIPNNVANPKLVYTQHDQLYMSLLNSTIQNLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVEAGATLGEVYYWINEKNENLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHG





LVKLENKWONIAYKYDKDLVLMTHFITKNITDNHGKNKTTVHGYFSSI





FHGGVDSLVDLMNKSFPELGIKKTDCKEFSWIDTTIFYSGVVNFNTAN





FKKEILLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAG





MYVLYPYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHIN





WVRSVYNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGE





KYFGKNFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 85


NARENFLKCFSKHIGNRGNVYTQHDQLYMSLLNSTIQNLRFISDTTPK





PLVIVTPSNNAHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQVPFV





VVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEKNENLSFPGGYC





PTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRKSMGE





DLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHGLVKL





FNKWQNIAYKYDKDLVLMTHFITKNITDNHGKNKTTVHGYFSSIFHGG





VDSLVDLMNKSFPELGIKKTDCKEFSWIDTTIFYSGVVNENTANFKKE





ILLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAGMYVL





YPYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHINWVRS





VYNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGEKYFG





KNFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 86


NARENFLKCFSKHIPNNVANPKLVYTQHDQLYMSLLNSTIQNLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVEAGATLGEVYYWINEKNENLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHG





LVKLFNKWQNIAYKYDKDLVLMTHFITKNITDNHGKNKTTVHGYFSSI





FHGGVDSLVDLMNKSFPELGIKKTDCKEFSWIDTTIFYSGVVNFNTAN





FKKEILLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAG





MYVLYPYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHIN





WVRSVYNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGE





KYFGKNFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 87


NARENFLQCFSAHIPNNVANPKLVYTQHDQLYMSLLNSTIQNLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEQNENLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGSVLDRQ





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHG





LVKLENKWQNIAYKYDKDLVLMTHFITKNITDNHGKNKTTVHGYFSSI





FHGGVDSLVDLMNKSFPELGIQSTDCKEFSWIDTTIFYSGVVNFNTAN





FKKEILLDRSAGKKTAFSIKLDYVKOPIPETAMVKILEKLYEEDVGAG





MYVLYPYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHIN





WVRSVYNFTTPYVSQNPRLAYLNYRDLDLGGTNHASPNNYTQARIWGE





KYFGKNFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 88


NARENFLQCFSAHIPNNVANPKLVYTQHDQLYMSLLNSTIONLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEQNENLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGSVLDRQ





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHG





LVKLENKWQNIAYKYDKDLVLMTHFITKNITDNHGKNKTTVHGYFSSI





FHGGVDSLVDLMNKSFPELGIQSTDSKEFSWIDTTIFYSGVVNFNTAN





FKKEILLDRSAGKKTAFSIKLDYVKQPIPETAMVKILEKLYEEDVGAG





MYVLYPYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHIN





WVRSVYNFTTPYVSQNPRLAYLNYRDLDLGGTNHASPNNYTQARIWGE





KYFGKNFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 89


NARENFLQCFSAHIPNNVANPKLVYTQHDQLYMSLLNSTIQNLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNLNRISIDVHSQTAWVESGSTLGEVYYWINEQNENLSFP





GGYCPTVGVGGHFSGGGYGALMRKYGLAADNIIDAILVNVDGSVLDRQ





SMGEDLFWAIRGGGGENFGIIYAWKIKLVAVPSKSTIFSVKKNMEIHG





LVKLFNKWQNIAYKYDEDLVLMTHFITKNITDNHGKNKTTVHGYFSSI





FHGGVDSLVDLMNKSFPELGIQSTDCKEFSWIDTTIFYSGVVNENTAN





FKKEILLDRSAGKKTAFSIKLDYVKQPIPETAMVKILEKLYEEDVGAG





MYVLYPYGGIMEEISESATPFPHRAGIMYELWYTASWEKQEDNEKHIN





WVRSVYNFTTPYVSQNPRLAYLNYRDLDLGGTNHASPNNYTQARIWGE





KYFGKNFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 90


NARENFLKCFSKHIPNNVANPKLVYTQHDQLYMSLLNSTIONLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVEAGATLGEVYYWINEKNENLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHG





LVKLENKWQNIAYKYDKDLVLMTHFITKNITDNHGKNKTTVHGYFSSI





FHGGVDSLVDLMNKSFPELGIKKTDCKEFSWIDTTIFYSGVVNFNTAN





FKKEILLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAG





MYVLYPYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHIN





WVRSVYNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGE





KYFGKNFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 91


NARENFLKCFSKHIPNNVANPKLVYTQHDQLYMSLLDSTIQNLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVEAGATLGEVYYWINEKNENLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHG





LVKLENKWQNIAYKYDKDLVLMTHFITKNITDNHGKNKTTVHGYFSSI





FHGGVDSLVDLMNKSFPELGIKKTDCKEFSWIDTTIFYSGVVNFNTAN





FKKEILLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAG





MYVLYPYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHIN





WVRSVYNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGE





KYFGKNFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 92


NARENFLKCFSKHIPNNVANPKLVYTQHDQLYMSLLNSTIONLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVEAGATLGEVYYWINEKNENLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHG





LVKLFNKWONIAYKYDKDLVLMTHFITKNSGNTVHGYFSSIFHGGVDS





LVDLMNKSFPELGIKKTDCKEFSWIDTTIFYSGVVNENTANFKKEILL





DRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAGMYVLYPY





GGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHINWVRSVYN





FTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGEKYFGKNF





NRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 93


NARENFLKCFSKHIPNNVANPKLVYTQHDQLYMSLLNSTIONLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVEAGATLGEVYYWINEKNENLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHG





LVKLFNKWQNIAYKYDKDLVLMTHFITKNSGNTVHGYFSSIFHGGVDS





LVDLMQKSFPELGIKKTDCKEFSWIDTTIFYSGVVNFNTANFKKEILL





DRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAGMYVLYPY





GGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHINWVRSVYN





FTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGEKYFGKNF





NRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 94


NARENFLKCFSKHIPNNVANPKLVYTQHDQLYMSLLNSTIQNLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVEAGATLGEVYYWINEKNENLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHG





LVKLENKWQNIAYKYDKDLVLMTHFITKNITDNHGKNKTTVHGYFSSI





FHGGVDSLVDLMQKSFPELGIKKTDCKEFSWIDTTIFYSGVVNFNTAN





FKKEILLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAG





MYVLYPYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHIN





WVRSVYNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGE





KYFGKNFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 95


NARENFLKCFSKHIPNNVANPKLVYTQHDQLYMSLLNSTIQNLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVEAGATLGEVYYWINEKNENLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHG





LVKLFNKWQNIAYKYDKDLVLMTHFITKNITDNHGKNKTTVHGYFSSI





FHGGCDSAVDLMQKSFPELGVKKTDCKEFSWIDTTIFYSGVVNFNTAN





FKKEILLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAG





MYVLYPYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHIN





WVRSVYNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGE





KYFGKNFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 96


NARENFLKCFSKHIPNNVANPKLVYTQHDQLYMSLLNSTIONLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVEAGATLGEVYYWINEKNENLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHG





LVKLFNKWQNIAYKYDKDLVLMTHFITKNITDNHGKNKTTVHGYFSSI





FHGGVDSLVDLMQKSFPELGIKKTDSKEFSWIDTTIFYSGVVNFNTAN





FKKEILLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAG





MYVLYPYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHIN





WVRSVYNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGE





KYFGKNFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 97


NARENFLKCFSKHIPNNVANPKLVYTQHDQLYMSLLNSTIQNLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVEAGATLGEVYYWINEKNENLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHG





LVKLENKWONIAYKYDKDLVLMTHFITKNITDNHGKNKTTVHGYFSSI





FHGGVDSLVDLMNKSFPELGIKKTDCKEFSWIDTTIFYSGVVNENTAN





FKKEILLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAG





MYVLYPYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHIN





WVRSVYQFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGE





KYFGKNFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 98


NARENFLKCESKHIPNNVANPKLVYTQHDQLYMSLLNSTIQNLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVEAGATLGEVYYWINEKNENLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHG





LVKLFNKWQNIAYKYDKDLVLMTHFITKNITDNHGKNKTTVHGYFSSI





FHGGVDSLVDLMQKSFPELGIKKTDSKEFSWIDTTIFYSGVVNFNTAN





FKKEILLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAG





MYVLYPYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHIN





WVRSVYQFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGE





KYFGKNFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 99


NARENFLKCFSKHIPNNVANPKLVYTQHDQLYMSLLNSTIQNLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVEAGATLGEVYYWINEKNENLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHG





LVKLFNKWQNIAYKYDKDLVLMTHFITKNSGNTVHGYFSSIFHGGVDS





LVDLMQKSFPELGIKKTDSKEFSWIDTTIFYSGVVNFNTANFKKEILL





DRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAGMYVLYPY





GGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHINWVRSVYQ





FTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGEKYFGKNF





NRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 100


NARENFLKCFSKHIPNNVANPKLVYTQHDQLYMSLLDSTIQNLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEKNENLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHG





LVKLFNKWQNIAYKYDKDLVLMTHFITKNITDNHGKNKTTVHGYFSSI





FHGGVDSLVDLMNKSFPELGIKKTDCKEFSWIDTTIFYSGVVNFNTAN





FKKEILLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAG





MYVLYPYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHIN





WVRSVYNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGE





KYFGKNFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 101


NARENFLKCFSKHIPNNVANPKLVYTQHDQLYMSLLNSTIQNLRFISD





TTPKPLVIVTPSNNAHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEKNENLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHG





LVKLENKWONIAYKYDKDLVLMTHFITKNITDNHGKNKTTVHGYFSSI





FHGGVDSLVDLMNKSFPELGIKKTDCKEFSWIDTTIFYSGVVNFNTAN





FKKEILLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAG





MYVLYPYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHIN





WVRSVYNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGE





KYFGKNFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 102


NARENFLKCFSKHIPNNVANPKLVYTQHDQLYMSLLDSTIQNLRFISD





TTPKPLVIVTPSNNAHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEKNENLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHG





LVKLFNKWQNIAYKYDKDLVLMTHFITKNITDNHGKNKTTVHGYFSSI





FHGGVDSLVDLMNKSFPELGIKKTDCKEFSWIDTTIFYSGVVNENTAN





FKKEILLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAG





MYVLYPYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHIN





WVRSVYNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGE





KYFGKNFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 103


NARENFLKCFSKHIPNNVANPKLVYTQHDQLYMSLLDSTIQNLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEKNENLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGSVLDRQ





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHG





LVKLFNKWQNIAYKYDKDLVLMTHFITKNITDNHGKNKTTVHGYESSI





FHGGVDSLVDLMNKSFPELGISKTDCKEFSWIDTTIFYSGVVNENTAN





FKKEILLDRSAGKKTAFSIKLDYVKQPIPETAMVKILEKLYEEDVGAG





MYVLYPYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHIN





WVRSVYNFTTPYVSQNPRLAYLNYRDLDLGGTNHASPNNYTQARIWGE





KYFGKNFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 104


NARENFLKCFSKHIPNNVANPKLVYTQHDQLFESLLNSTIQNLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGLSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEKNENLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNVEIHA





LVKLFNKWQNIAYKYDKDLVLMTHFITKNITDNHGKNKTTVHGYFSSI





FHGGVDSLVDLMNKSFPELGIKKTDCKEFSWIDTTIFYSGVVNENTAN





FKKEILLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAG





MYVLYPYGGIMEEISESAIPFPHRAGTVYELWYTASWEKQEDNEKHIN





WVRSVYNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGE





KYFGKNFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 105


NARENFLKCFSKHIPNNVANPKLVYTQHDQLFESLLDSTIQNLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGLSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEKNENLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNVEIHA





LVKLFNKWQNIAYKYDKDLVLMTHFITKNITDNHGKNKTTVHGYESSI





FHGGVDSLVDLMNKSFPELGIKKTDCKEFSWIDTTIFYSGVVNFNTAN





FKKEILLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAG





MYVLYPYGGIMEEISESAIPFPHRAGTVYELWYTASWEKQEDNEKHIN





WVRSVYNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGE





KYFGKNFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 106


NARENFLKCFSKHIPNNVANPKLVYTQHDQLYMSLLNSTIQNLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEKNENLSFP





GGYCPTVGVGGHFSGGGYGALMRKYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNGAIHE





LVKLFNKWQNIAYKYDEDLVLMTHFITKNITDNHGKNKTTVHGYFSSI





FHGGVDSLVDLMNKSFPELGIKKTDCKEFSWIDTTIFYSGVVNFNTAN





FKKEILLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAG





MYVLYPYGGIMEKISESAIPFPHRAGIMYELWYTASWEKQEDNEKHIN





WVRSVYNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGE





KYFGKNFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 107


NARENFLKCFSKHIPNNVANPKLVYTQHDQLYMSLLNSTIQNLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEKNENLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHG





LVKLENKWONIAYKYDKDLVLMTHFITKNITDNHGKNKTTVHGYFSSI





FHGGVDSLVDLMNKSFPELGIKKTDCKEFSWIDTTIFYSGGTGSGTFK





KEILLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAGMY





VLYPYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHINWV





RSVYNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGEKY





FGKNFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 108


NARENFLKCFSKHIPNNVANPKLVYTQHDQLYMSLLNSTIQNLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEKNENLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHG





LVKLFNKWQNIAYKYDKDLVLMTHFITKNITDNHGKNKTTVHGYFSSI





FHGGVDSLVDLMNKSFPELGIKKTDCKEFSWIDTTIFYSGGTGSGTKK





EILLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAGMYV





LYPYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHINWVR





SVYNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGEKYF





GKNFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 109


NARENFLKCFSKHIPNNVANPKLVYTQHDQLYMSLLNSTIQNLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEKNENLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHG





LVKLFNKWQNIAYKYDKDLVLMTHFITKNITDNHGKNKTTVHGYFSSI





FHGGVDSLVDLMNKSFPELGIKKTDCKEFSWIDTTIFYSGGDGTGKKE





ILLDRSAKGKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAGMYVL





YPYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHINWVRS





VYNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGEKYFG





KNFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 110


NARENFLKCFSKHIPNNVANPKLVYTQHDQLYMSLLNSTIONLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEKNENLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHG





LVKLENKWQNIAYKYDKDLVLMTHFITKNPTPNTTVHGYFSSIFHGGV





DSLVDLMNKSFPELGIKKTDCKEFSWIDTTIFYSGVVNFNTANFKKEI





LLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAGMYVLY





PYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHINWVRSV





YNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGEKYFGK





NFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 111


NARENFLKCFSKHIPNANPKLVYTQHDQLYMSLLNSTIQNLRFISDTT





PKPLVIVTPSNNAHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQVP





FVVVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEKNENLSFPGG





YCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRKSM





GEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHGLV





KLFNKWQNIAYKYDKDLVLMTHFITKNITDNHGKNKTTVHGYFSSIFH





GGVDSLVDLMNKSFPELGIKKTDCKEFSWIDTTIFYSGVVNFNTANFK





KEILLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAGMY





VLYPYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHINWV





RSVYNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGEKY





FGKNFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 112


NARENLLKCLSKHDGTGKTVYTQHDQLYMSLLNSTIQNLRFISDTTPK





PLVIVTPSNNAKIQATILCSKKVGLQIRTRSGGHDAEGMSYISQVPFV





VVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEKNENLSFPGGYC





PTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRKSMGE





DLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHGLVKL





FNKWQNIAYKYDKDLVLMTHFITKNITDNHGKNKTTVHGYFSSIFHGG





VDSLVDLMNKSFPELGIKKTDCKEFSWIDTTIFYSGVVNFNTANFKKE





ILLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAGMYVL





YPYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHINWVRS





VYNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGEKYFG





KNFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 113


NARENLLKCLSKHGDGTGKTVYTQHDQLYMSLLNSTIQNLRFISDTTP





KPLVIVTPSNNAKIQATILCSKKVGLQIRTRSGGHDAEGMSYISQVPF





VVVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEKNENLSFPGGY





CPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRKSMG





EDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHGLVK





LFNKWQNIAYKYDKDLVLMTHFITKNITDNHGKNKTTVHGYFSSIFHG





GVDSLVDLMNKSFPELGIKKTDCKEFSWIDTTIFYSGVVNENTANFKK





EILLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAGMYV





LYPYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHINWVR





SVYNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGEKYF





GKNFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 114


NARENFLKCFSKHIPNNVANPKLVYTQHDQLYMSLLNSTIQNLRFISD





TTPKPLVIVTPSNNAHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEKNENLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHG





LVKLENKWQNIAYKYDKDLVLMTHFITKNITDNHGKNKTTVHGYESSI





FHGGVDSLVDLMNKSFPELGIKKTDCKEFSWIDTTIFYSGVVNENTAN





FKKEILLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAG





MYVLYPYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHIN





WVRSVYNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNDYTQARIWGE





KYFGKNFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 115


NARENFLKCFSKHIPNNVANPKLVYTQHDQLYMSLLDSTIONLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEKNENLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHG





LVKLENKWQNIAYKYDKDLVLMTHFITKNITDNHGKNKTTVHGYFSSI





FHGGVDSLVDLMNKSFPELGIKKTDCKEFSWIDTTIFYSGVVNFNTAN





FKKEILLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAG





MYVLYPYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHIN





WVRSVYNFTTPYVSONPRLAYLNYRDLDLGKTNHASPNDYTQARIWGE





KYFGKNFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 116


NARENFLKCFSKHIPNNVANPKLVYTQHDQLYMSLLNSTIQNLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEKNEDLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPKSSTIFSVKKNMEIHG





LVKLFNKWQNIAYKYDKDLVLMTHFITKNITDNHGKNKTTVHGYFSSI





FHGGVDSLVDLMNKSFPELGIKKTDCKEFSWIDTTIFYSGVVNFNTAN





FKKEILLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAG





MYVLYPYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHIN





WVRSVYNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGE





KYFGKNFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 117


NARENFLKCFSKHIPNNVANPKLVYTQHDQLYMSLLDSTIONLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEKNENLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHG





LVKLFNKWQNIAYKYDKDLVLMTHFITKNPTPNTTVHGYFSSIFHGGV





DSLVDLMNKSFPELGIKKTDCKEFSWIDTTIFYSGVVNENTANFKKEI





LLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAGMYVLY





PYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHINWVRSV





YNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGEKYFGK





NFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 118


NARENFLKCFSKHIPNANPKLVYTQHDQLYMSLLDSTIQNLRFISDTT





PKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQVP





FVVVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEKNENLSFPGG





YCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRKSM





GEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHGLV





KLFNKWQNIAYKYDKDLVLMTHFITKNPTPNTTVHGYFSSIFHGGVDS





LVDLMNKSFPELGIKKTDCKEFSWIDTTIFYSGVVNFNTANFKKEILL





DRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAGMYVLYPY





GGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHINWVRSVYN





FTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGEKYFGKNF





NRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 119


NARENFLKCFSKHIPNANPKLVYTQHDQLYMSLLDSTIQNLRFISDTT





PKPLVIVTPSNNAHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQVP





FVVVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEKNENLSFPGG





YCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRKSM





GEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHGLV





KLFNKWQNIAYKYDKDLVLMTHFITKNITDNHGKNKTTVHGYFSSIFH





GGVDSLVDLMNKSFPELGIKKTDCKEFSWIDTTIFYSGVVNFNTANFK





KEILLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAGMY





VLYPYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHINWV





RSVYNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGEKY





FGKNFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 120


NARENFLKCFSKHIPNNVANPKLVYTQHDQLYMSLLDSTIQNLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEKNEDLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPKSSTIFSVKKNMEIHG





LVKLFNKWQNIAYKYDKDLVLMTHFITKNITDNHGKNKTTVHGYFSSI





FHGGVDSLVDLMNKSFPELGIKKTDCKEFSWIDTTIFYSGVVNFNTAN





FKKEILLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAG





MYVLYPYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHIN





WVRSVYNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGE





KYFGKNFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 121


NARENFLKCFSKHIPNNVANPKLVYTQHDQLYMSLLDSTIQNLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEKNENLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHG





LVKLFNKWQNIAYKYDKDLVLMTHFITKNITDNHGKNKTTVHGYFSSI





FHGGVDSLVDLMNKSFPELGIKKTDAKEFSWIDTTIFYSGVVNFNTAN





FKKEILLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAG





MYVLYPYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHIN





WVRSVYNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGE





KYFGKNFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 122


NARENFLKCFSKHIPNNVANPKLVYTQHDQLYMSLLDSTIQNLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEKNENLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHG





LVKLFNKWQNIAYKYDKDLVLMTHFITKNITDNHGKNKTTVHGYESSI





FHGGVDSLVDLMNKSFPELGIKKTDTKEFSWIDTTIFYSGVVNFNTAN





FKKEILLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAG





MYVLYPYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHIN





WVRSVYNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGE





KYFGKNFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 123


NARENFLKCFSKHIPNNVANPKLVYTQHDQLYMSLLDSTIQNLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEKNENLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHG





LVKLFNKWQNIAYKYDKDLVLMTHFITKNITDNHGKNKTTVHGYFSSI





FHGGVDSLVDLMNKSFPELGIKKTDPKEFSWIDTTIFYSGVVNENTAN





FKKEILLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAG





MYVLYPYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHIN





WVRSVYNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGE





KYFGKNFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 124


NARENFLKCFSKHIPNNVANPKLVYTQHDQLYMSLLDSTIQNLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEKNENLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHG





LVKLFNKWQNIAYKYDKDLVLMTHFITKNITDNHGKNKTTVHGYESSI





FHGGVDSLVDLMNKSFPELGIKKTDVKEFSWIDTTIFYSGVVNENTAN





FKKEILLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAG





MYVLYPYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHIN





WVRSVYNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGE





KYFGKNFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 125


NARENFLKCFSKHIPNNVANPKLVYTQHDQLYMSLLDSTIQNLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEKNENLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGDNFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHG





LVKLFNKWQNIAYKYDKDLVLMTHFITKNITDNHGKNKTTVHGYFSSI





FHGGVDSLVDLMNKSFPELGIKKTDCKEFSWIDTTIFYSGVVNFNTAN





FKKEILLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAG





MYVLYPYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHIN





WVRSVYNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGE





KYFGKNFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 126


NARENFLKCFSKHIPNNVANPKLVYTQHDQLYMSLLNSTIQNLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEKNENLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHG





LVKLFNKWQNIAYKYDKDLVLMTHFITKNPTPNTTVHGYFSSIFHGGV





DSLVDLMNKSFPELGIKKTDCKEFSWIDTTIFYSGVVNFNTANFKKEI





LLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAGMYVLY





PYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHINWVRSV





YNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGEKYFGK





NFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 127


NARENFLKCFSKHIPNNVANPKLVYTQHDQLYMSLLNSTIQNLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEKNENLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHG





LVKLENKWQNIAYKYDKDLVLMTHFITKNPTPNTTVHGYFSSIFHGGV





DSLVDLMNKSFPELGIKKTDAKEFSWIDTTIFYSGVVNFNTANFKKEI





LLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAGMYVLY





PYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHINWVRSV





YNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGEKYFGK





NFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 128


NARENFLKCFSKHIPNNVANPKLVYTQHDQLYMSLLNSTIQNLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEKNENLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHG





LVKLENKWQNIAYKYDKDLVLMTHFITKNPTPNTTVHGYFSSIFHGGV





DSLVDLMNKSFPELGIKKTDTKEFSWIDTTIFYSGVVNFNTANFKKEI





LLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAGMYVLY





PYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHINWVRSV





YNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGEKYFGK





NFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 129


NARENFLKCFSKHIPNNVANPKLVYTQHDQLYMSLLNSTIQNLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEKNENLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHG





LVKLFNKWQNIAYKYDKDLVLMTHFITKNPTPNTTVHGYFSSIFHGGV





DSLVDLMNKSFPELGIKKTDPKEFSWIDTTIFYSGVVNFNTANFKKEI





LLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAGMYVLY





PYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHINWVRSV





YNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGEKYFGK





NFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 130


NARENFLKCFSKHIPNNVANPKLVYTQHDQLYMSLLNSTIONLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEKNENLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHG





LVKLFNKWQNIAYKYDKDLVLMTHFITKNPTPNTTVHGYFSSIFHGGV





DSLVDLMNKSFPELGIKKTDVKEFSWIDTTIFYSGVVNFNTANFKKEI





LLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAGMYVLY





PYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHINWVRSV





YNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGEKYFGK





NFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 131


NARENFLKCFSKHIPNANPKLVYTQHDQLYMSLLNSTIQNLRFISDTT





PKPLVIVTPSNNAHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQVP





FVVVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEKNENLSFPGG





YCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRKSM





GEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHGLV





KLFNKWQNIAYKYDKDLVLMTHFITKNITDNHGKNKTTVHGYFSSIFH





GGVDSLVDLMNKSFPELGIKKTDCKEFSWIDTTIFYSGVVNFNTANFK





KEILLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAGMY





VLYPYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHINWV





RSVYNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGEKY





FGKNFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 132


NARENFLKCFSKHIPNANPKLVYTQHDQLYMSLLNSTIQNLRFISDTT





PKPLVIVTPSNNAHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQVP





FVVVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEKNENLSFPGG





YCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRKSM





GEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHGLV





KLENKWONIAYKYDKDLVLMTHFITKNITDNHGKNKTTVHGYFSSIFH





GGVDSLVDLMNKSFPELGIKKTDCKEFSWIDTTIFYSGVVNFNTANFK





KEILLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAGMY





VLYPYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHINWV





RSVYNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNDYTQARIWGEKY





FGKNFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 133


NARENFLKCFSKHIPNNVANPKLVYTQHDQLYMSLLNSTIQNLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEKNEDLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPKSSTIFSVKKNMEIHG





LVKLENKWQNIAYKYDKDLVLMTHFITKNITDNHGKNKTTVHGYFSSI





FHGGVDSLVDLMNKSFPELGIKKTDCKEFSWIDTTIFYSGVVNFNTAN





FKKEILLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAG





MYVLYPYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHIN





WVRSVYNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGE





KYFGKNFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 134


NARENFLKCFSKHIPNANPKLVYTQHDQLYMSLLNSTIQNLRFISDTT





PKPLVIVTPSNNAHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQVP





FVVVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEKNENLSFPGG





YCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRKSM





GEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHGLV





KLENKWQNIAYKYDKDLVLMTHFITKNPTPNTTVHGYFSSIFHGGVDS





LVDLMNKSFPELGIKKTDCKEFSWIDTTIFYSGVVNENTANFKKEILL





DRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAGMYVLYPY





GGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHINWVRSVYN





FTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGEKYFGKNF





NRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 135


NARENFLKCFSKHIPNNVANPKLVYTQHDQLYMSLLNSTIQNLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEKNEDLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPKSSTIFSVKKNMEIHG





LVKLFNKWQNIAYKYDKDLVLMTHFITKNPTPNTTVHGYFSSIFHGGV





DSLVDLMNKSFPELGIKKTDCKEFSWIDTTIFYSGVVNENTANFKKEI





LLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAGMYVLY





PYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHINWVRSV





YNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGEKYFGK





NFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 136


NARENFLKCFSKHIPNANPKLVYTQHDQLYMSLLNSTIQNLRFISDTT





PKPLVIVTPSNNAHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQVP





FVVVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEKNEDLSFPGG





YCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRKSM





GEDLFWAIRGGGGENFGIIAAWKIKLVAVPKSSTIFSVKKNMEIHGLV





KLFNKWQNIAYKYDKDLVLMTHFITKNITDNHGKNKTTVHGYFSSIFH





GGVDSLVDLMNKSFPELGIKKTDCKEFSWIDTTIFYSGVVNFNTANFK





KEILLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAGMY





VLYPYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHINWV





RSVYNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGEKY





FGKNFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 137


NARENFLKCFSKHIPNANPKLVYTQHDQLYMSLLNSTIONLRFISDTT





PKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQVP





FVVVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEKNEDLSFPGG





YCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRKSM





GEDLFWAIRGGGGENFGIIAAWKIKLVAVPKSSTIFSVKKNMEIHGLV





KLFNKWQNIAYKYDKDLVLMTHFITKNPTPNTTVHGYFSSIFHGGVDS





LVDLMNKSFPELGIKKTDCKEFSWIDTTIFYSGVVNFNTANFKKEILL





DRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAGMYVLYPY





GGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHINWVRSVYN





FTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGEKYFGKNF





NRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 138


NARENFLKCFSKHIPNNVANPKLVYTQHDQLYMSLLDSTIQNLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEKNENLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGDNFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHG





LVKLENKWQNIAYKYDKDLVLMTHFITKNPTPNTTVHGYFSSIFHGGV





DSLVDLMNKSFPELGIKKTDCKEFSWIDTTIFYSGVVNENTANFKKEI





LLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAGMYVLY





PYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHINWVRSV





YNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGEKYFGK





NFNRLVKVKTKVDPNNFFRNEQSIPPLVN





SEQ ID NO: 139


NARENFLKCFSKHIPNNVANPKLVYTQHDQLYMSLLNSTIQNLRFISD





TTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQ





VPFVVVDLRNMHSIKIDVHSQTAWVESGATLGEVYYWINEKNENLSFP





GGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK





SMGEDLFWAIRGGGGDNFGIIAAWKIKLVAVPSKSTIFSVKKNMEIHG





LVKLFNKWQNIAYKYDKDLVLMTHFITKNPTPNTTVHGYFSSIFHGGV





DSLVDLMNKSFPELGIKKTDCKEFSWIDTTIFYSGVVNFNTANFKKEI





LLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAGMYVLY





PYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHINWVRSV





YNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGEKYFGK





NFNRLVKVKTKVDPNNFFRNEQSIPPLVN





Accessory elements:


SEQ ID NO: 140


MRFPSIFTAVLFAASSALAAPVNTTTEDETAQIPAEAVIGYSDLEGDF





DVAVLPFSNSTNNGLLFINTTIASIAAKEEGVSLEKREAEA





SEQ ID NO: 141


MQYKKTLVASALAATTLAAYAPSEPWSTLTPTATYSGGVTDYASTFGI





AVOPISTTSSASSAATTASLEKREADA





SEQ ID NO: 142


MRFPSIFTAVLFAASSALAARAYAPSEPWSTLTPTATYSGGVTDYAST





FGIAVQPISTTSSASSAATTASSASLEKREADA





SEQ ID NO: 143


MRFPSIFTAVLFAASSALAARAYAPSEPWSTLTPTATYSGGVTDYAST





FGIAVQPISTTSSASSAATTASSARAYAPSEPWSTLTPTATYSGGVTD





YASTFGIAVQPISTTSSASSAATTASSASLEKREADA





SEQ ID NO: 144


MLLQAFLFLLAGFAAKISAGSASLEKREADA





SEQ ID NO: 145


MFSPILSLEIILALATLOSVFAASLEKREADA





SEQ ID NO: 146


MYRNLIIATALTCGAYSAYVPSEPWSTLTPDASLESALKDYSQTFGIA





IKSLDADGSASLEKREADA





SEQ ID NO: 147


MRQVWFSWIVGLFLCFFNVSSAASLEKREADA





SEQ ID NO: 148


MRQVWFSWIVGLFLCFFNVSSAAPVNTTTEDETAQIPAEAVIGYSDLE





GDEDVAVLPFSNSTNNGLLFINTTIASIAAKEEGASLEKREADA





SEQ ID NO: 149


MFNLKTILISTLASIAVAASLEKREADA





SEQ ID NO: 150


YPYDVPDYA





SEQ ID NO: 151


HHHHHH





SEQ ID NO: 152


HNHNHNHN





SEQ ID NO: 153


TATCCATATGATGTTCCAGATTATGCT






Enumerated Embodiments

The following enumerated embodiments are provided, the numbering of which is not to be construed as designating levels of importance.


Embodiment 1 provides a recombinant vector comprising a promoter, secretion tag, gene of interest (GOI), and optionally a purification and/or a detection tag, wherein the gene of interest comprises a polynucleotide sequence encoding a polypeptide with at least 90%, 95%, 98%, or 99.9% sequence identity to the sequence as set forth in any one of SEQ ID NOs: 84-139.


Embodiment 2 provides the recombinant vector of embodiment 1, wherein the polynucleotide sequence encodes a polypeptide as set forth in any one of SEQ ID NOs: 84-139.


Embodiment 3 provides the recombinant vector of embodiment 2, wherein the polynucleotide sequence comprises the sequence as set forth in any one of SEQ ID NOs: 2-57, or a sequence comprising at least 90%, 95%, 98%, 99%, or 99.9% sequence identity to the sequence as set forth in any one of SEQ ID NOs: 2-57.


Embodiment 4 provides the recombinant vector of embodiment 1, wherein the gene of interest is operably linked to a promoter.


Embodiment 5 provides the recombinant vector of embodiment 4, wherein the promoter is a polynucleotide as set forth in any one of SEQ ID NOs: 71-80.


Embodiment 6 provides the recombinant vector of embodiment 5, wherein the promoter has a polynucleotide sequence comprising at least 90%, 95%, 98%, or 99% sequence identity to the sequence of SEQ ID NOs: 71-80.


Embodiment 7 provides the recombinant vector of embodiment 4, wherein the promoter is a constitutive promoter or an inducible promoter.


Embodiment 8 provides the recombinant vector of embodiment 4, wherein the promoter is selected from a group consisting of an alcohol oxidase 1 promoter (pAOX1), an alcohol oxidase 2 promoter (pAOX2), a dihydroxyacetone kinase promoter (pDAK), an S-hydroxymethyl-glutathione hydrolase promoter (pFGH), an NAD+-dependent formylglutathione dehydrogenase promoter (pFDH), a fructose 1,6-bisphosphate aldolase promoter (pFBA2), a peroxisomal membrane signal receptor PTS1 promoter (pPEX5), a peroxisomal protein Pex8p promoter (pPEX8), an alcohol dehydrogenase 2 promoter (pADH2) and a peroxin Pex14p promoter (pPEX14).


Embodiment 9 provides the recombinant vector of embodiment 1, comprising a secretion tag as set forth in any one of SEQ ID NOs: 58-67 and 154.


Embodiment 10 provides the recombinant vector of embodiment 9, wherein the sequence of the secretion tag is at least 90%, 95%, 98%, 99%, or 100% identical to SEQ ID NOs: 58-67 and 154.


Embodiment 11 provides the recombinant vector of embodiment 1, comprising a purification tag as set forth in any one of SEQ ID NOs: 68-70 and 153.


Embodiment 12 provides the recombinant vector of embodiment 11, wherein the purification/detection tag is selected from the group consisting of a human influenza hemagglutinin (HA) tag, a His 6 tag and an HN tag.


Embodiment 13 provides a host expression system comprising the recombinant vector of embodiment 1.


Embodiment 14 provides the host expression system of embodiment 13, comprising a microbial cell, a yeast cell, a plant cell, or an animal cell.


Embodiment 15 provides the host expression system of embodiment 14, wherein the yeast cell is selected from the group consisting of Pichia pastoris, Pichia angusta, Pichia guillermordii, Pichia methanolica, Pichia inositovera, Hansenula polymorpha, Candida boidinii, and Yarrowia lipolytica.


Embodiment 16 provides a method of producing a cannabinoid compound, comprising reacting a cannabinoid precursor with an enzyme that is expressed by the host expression system of embodiment 13, wherein the cannabinoid precursor is compound of Formula I (see below): wherein R1 is H or —COOH and R2 is a linear or branched CH3, C2H5, C3H7, C4H9, C5H10, C6H13, C7H15 or C8H17 group.




embedded image


Embodiment 17 provides a polypeptide comprising the amino acid sequence of any one of SEQ ID NOs: 84-139 or an amino acid sequence comprising at least 90%, 95%, 98%, or 99.9% sequence identity to any one of SEQ ID NOs: 84-139.


Embodiment 18 provides the polypeptide of embodiment 17, wherein the polypeptide comprises the amino acid sequence of any one of SEQ ID NOs: 91 and 110-116 or an amino acid sequence comprising at least 90%, 95%, 98%, or 99.9% sequence identity to any one of SEQ ID NOs: 91 and 110-116.


Embodiment 19 provides a nucleic acid comprising a polynucleotide sequence encoding a polypeptide comprising the amino acid sequence of any one of SEQ ID NOs: 84-139 or an amino acid sequence comprising at least 90%, 95%, 98%, or 99.9% sequence identity to any one of SEQ ID NOs: 84-139.


Embodiment 20 provides a nucleic acid of embodiment 19, wherein the nucleic acid comprises the polynucleotide sequence of any one of SEQ ID NOs: 2-57 or a polynucleotide sequence comprising at least 90%, 95%, 98%, or 99.9% sequence identity to any one of SEQ ID NOs: 2-57.


Embodiment 21 provides a nucleic acid of embodiment 20, wherein the nucleic acid comprises the polynucleotide sequence of any one of SEQ ID NOs: 9 and 28-34 or a polynucleotide sequence comprising at least 90%, 95%, 98%, or 99.9% sequence identity to any one of SEQ ID NOs: 9 and 28-34.


Other Embodiments

The disclosures of each and every patent, patent application, and publication cited herein are hereby incorporated herein by reference in their entirety. While this invention has been disclosed with reference to specific embodiments, it is apparent that other embodiments and variations of this invention may be devised by others skilled in the art without departing from the true spirit and scope of the invention. The appended claims are intended to be construed to include all such embodiments and equivalent variations.

Claims
  • 1. A recombinant vector comprising a promoter, secretion tag, gene of interest (GOI), and optionally a purification and/or a detection tag, wherein the gene of interest comprises a polynucleotide sequence encoding a polypeptide with at least 90%, 95%, 98%, or 99.9% sequence identity to the sequence as set forth in any one of SEQ ID NOs: 84-139.
  • 2. The recombinant vector of claim 1, wherein the polynucleotide sequence encodes a polypeptide as set forth in any one of SEQ ID NOs: 84-139.
  • 3. The recombinant vector of claim 1, wherein the polynucleotide sequence comprises the sequence as set forth in any one of SEQ ID NOs: 2-57, or a sequence comprising at least 90%, 95%, 98%, 99%, or 99.9% sequence identity to the sequence as set forth in any one of SEQ ID NOs: 2-57.
  • 4. The recombinant vector of claim 1, wherein the gene of interest is operably linked to a promoter.
  • 5. The recombinant vector of claim 4, wherein the promoter is a polynucleotide as set forth in any one of SEQ ID NOs: 71-80.
  • 6. The recombinant vector of claim 5, wherein the promoter has a polynucleotide sequence comprising at least 90%, 95%, 98%, or 99% sequence identity to the sequence of SEQ ID NOs: 71-80.
  • 7. The recombinant vector of claim 4, wherein the promoter is a constitutive promoter or an inducible promoter.
  • 8. The recombinant vector of claim 4, wherein the promoter is selected from a group consisting of an alcohol oxidase 1 promoter (pAOX1), an alcohol oxidase 2 promoter (pAOX2), a dihydroxyacetone kinase promoter (pDAK), an S-hydroxymethyl-glutathione hydrolase promoter (pFGH), an NAD+-dependent formylglutathione dehydrogenase promoter (pFDH), a fructose 1,6-bisphosphate aldolase promoter (pFBA2), a peroxisomal membrane signal receptor PTS1 promoter (pPEX5), a peroxisomal protein Pex8p promoter (pPEX8), an alcohol dehydrogenase 2 promoter (pADH2) and a peroxin Pex14p promoter (pPEX14).
  • 9. The recombinant vector of claim 1, comprising a secretion tag as set forth in any one of SEQ ID NOs: 58-67 and 154.
  • 10. The recombinant vector of claim 9, wherein the sequence of the secretion tag is at least 90%, 95%, 98%, 99%, or 100% identical to SEQ ID NOs: 58-67 and 154.
  • 11. The recombinant vector of claim 1, comprising a purification tag as set forth in any one of SEQ ID NOs: 68-70 and 153.
  • 12. The recombinant vector of claim 11, wherein the purification/detection tag is selected from the group consisting of a human influenza hemagglutinin (HA) tag, a His 6 tag and an HN tag.
  • 13. A host expression system comprising the recombinant vector of claim 1.
  • 14. The host expression system of claim 13, comprising a microbial cell, a yeast cell, a plant cell, or an animal cell.
  • 15. The host expression system of claim 14, wherein the yeast cell is selected from the group consisting of Pichia pastoris, Pichia angusta, Pichia guillermordii, Pichia methanolica, Pichia inositovera, Hansenula polymorpha, Candida boidinii, and Yarrowia lipolytica.
  • 16. A method of producing a cannabinoid compound, comprising reacting a cannabinoid precursor with an enzyme that is expressed by the host expression system of claim 13, wherein the cannabinoid precursor is compound of Formula I (see below): wherein R1 is H or —COOH and R2 is a linear or branched CH3, C2H5, C3H7, C4H9, C510, C6H13, C7H15 or C8H17 group.
  • 17. A polypeptide comprising the amino acid sequence of any one of SEQ ID NOs: 84-139 or an amino acid sequence comprising at least 90%, 95%, 98%, or 99.9% sequence identity to any one of SEQ ID NOs: 84-139.
  • 18. The polypeptide of claim 17, wherein the polypeptide comprises the amino acid sequence of any one of SEQ ID NOs: 91 and 110-116 or an amino acid sequence comprising at least 90%, 95%, 98%, or 99.9% sequence identity to any one of SEQ ID NOs: 91 and 110-116.
  • 19. A nucleic acid comprising a polynucleotide sequence encoding a polypeptide comprising the amino acid sequence of any one of SEQ ID NOs: 84-139 or an amino acid sequence comprising at least 90%, 95%, 98%, or 99.9% sequence identity to any one of SEQ ID NOs: 84-139. The nucleic acid of claim 19, wherein the nucleic acid comprises the polynucleotide sequence of any one of SEQ ID NOs: 2-57 or a polynucleotide sequence comprising at least 90%, 95%, 98%, or 99.9% sequence identity to any one of SEQ ID NOs: 2-57.
  • 21. The nucleic acid of claim 20, wherein the nucleic acid comprises the polynucleotide sequence of any one of SEQ ID NOs: 9 and 28-34 or a polynucleotide sequence comprising at least 90%, 95%, 98%, or 99.9% sequence identity to any one of SEQ ID NOs: 9 and 28-34.
CROSS-REFERENCE TO RELATED APPLICATIONS

The present application claims priority under 35 U.S.C. § 119(e) to U.S. Provisional Patent Application 63/113,240, filed Nov. 13, 2020, which is herein incorporated by reference in its entirety.

PCT Information
Filing Document Filing Date Country Kind
PCT/US2021/058944 11/11/2021 WO
Provisional Applications (1)
Number Date Country
63113230 Nov 2020 US