Recombinant chalcomycin polyketide synthase and modifying genes

Information

  • Patent Application
  • 20040132055
  • Publication Number
    20040132055
  • Date Filed
    August 21, 2003
    20 years ago
  • Date Published
    July 08, 2004
    20 years ago
Abstract
Domains of chalcomycin polyketide synthases and modification enzymes and polynucleotides encoding them are provided. Methods to prepare chalcomycin in pharmaceutically useful quantities are described, as are methods to prepare chalcomycin analogs and other polyketides using the polynucleotides encoding chalcomycin polyketide synthase domains or modifying enzymes.
Description


FIELD OF THE INVENTION

[0003] The invention relates to recombinant polynucleotides that encode polypeptides or domains of the chalcomycin polyketide synthase gene cluster. Accordingly, the present invention is directed to the production of chalcomycin PKS enzymes, to polynucleotides that encode such enzymes, and to host cells that contain such polynucleotides. Further enhancements in the biological activities of chalcomycin and other polyketides, through production of derivatives thereof, is also made possible according to the practice of the invention by providing P450 hydroxylases that provide attachment points on the polyketide molecule for further modifications. Thus the present invention relates to the fields of molecular biology, chemistry, recombinant DNA technology, medicine, animal health, and agriculture.



BACKGROUND OF THE INVENTION

[0004] Polyketides represent a large family of diverse compounds synthesized from 2 carbon units through a series of condensations and subsequent modifications. Polyketides occur in many types of organisms including fungi and mycelial bacteria, in particular the actinomycetes. An appreciation for the wide variety of polyketide structures and for their biological activities, may be gained upon review of the extensive art, for example, published PCT Patent Publication WO 95/08548 and U.S. Pat. Nos. 5,672,491 and 6,303,342


[0005] Polyketides are synthesized in nature by polyketide synthases (“PKS”). The Type I or modular PKS comprise a set of separate catalytic active sites; each active site is termed a “domain”, and a set thereof is termed a “module”. One module exists for each cycle of carbon chain elongation and modification. FIG. 9 of aforementioned WO95/08548 depicts a typical Type I PKS, in this case 6-deoxyerythronolide B synthase (“DEBS”), which is involved in the production of erythromycin. Six separate modules, each catalyzing a round of condensation and modification of a 2-carbon unit, are present in DEBS. The number and type of catalytic domains that are present in each module varies based on the needed chemistry, and the total of 6 modules is provided on 3 separate polypeptides (designated DEBS-1, DEBS-2, and DEBS-3, with 2 modules per each polypeptide). Each of the DEBS polypeptides is encoded by a separate open reading frame (gene), see Caffrey et al., FEBS Letters, 304, pp. 205, 1992. DEBS provides a representative example of a Type I PKS. In DEBS, modules 1 and 2 reside on DEBS-1, modules 3 and 4 on DEBS-2, and modules 5 and 6 on DEBS-3, wherein module 1 is defined as the first module to act on the growing polyketide backbone, and module 6 the last.


[0006] The minimal PKS module is typified by module 3 of DEBS which contains a ketosynthase (“KS”) domain, an acyltransferase (“AT”) domain, and an acyl carrier protein (“ACP”) domain. These three enzyme activities are sufficient to activate a 2-carbon extender unit and attach it to the growing polyketide molecule. Additional domains that may be included in a module relate to reactions other than the actual condensation, and include domains for a ketoreductase activity (“KR”), a dehydratase activity (“DH”), and an enoylreductase activity (“ER”). With respect to DEBS-1, the first module thereof also contains an additional set of the AT and ACP activities because that module catalyzes initial condensation, and so begins with a “loading domain” (sometimes referred to as a loading module) that contains an AT and ACP, that bind the starter unit. The “finishing” of the 6-deoxyerythronolide molecule is regulated by a thioesterase activity (“TE”) in module 6 that catalyzes cyclization of the macrolide ring during release of the product of the PKS.


[0007] PKS genes can be engineered in a variety of ways to achieve biosynthesis of polyketides. For instance, PKS genes can be inserted into a heterologous host to make a polyketide in a host that does not make it naturally. Polyketides can also be made by hybrid or otherwise altered PKSs or polyketide biosynthetic gene clusters. Also, polyketides can be overproduced by increasing the pools of available starting polyketide biosynthetic precursors and by other means. See U.S. Pat. Nos. 5,672,491; 5,962,290; 6,080,555; 6,391,594; and 6,221,641 and PCT Patent Publications 00/47724, 01/27306, and 01/31035.


[0008] Chalcomycin is a 16-membered macrolide antibiotic produced by some strains of Streptomyces bikiniensis and possesses a broad spectrum of antimicrobial activity. Certain naturally occurring derivatives of chalcomycin produced by other Streptomyces organisms also have antimicrobial activity. For instance, the 8-deoxy chalcomycin derivative produced by Streptomycin hirsutus has antimicrobial activity against gram-positive bacteria. Chalcomycin has two modifying sugar molecules, D-mycinose and D-chalcose, the former being subject to post-glycosylation modification by O-methylation at two positions. For additional information regarding chalcomycin, see Woo, P. W. K. et al., J.A.C.S., 1962, 84, 1512; 1964, 86, 2724; 2726; Celmer, W. D., J. A. C. S., 1965, 87, 1801; Omura, S. et al., J. A. C. S., 1975, 97, 4001; Neszmelyi, A. et al., Chem. Comm., 1976, 97; Jardim, M. E. et al., Int. J. Mass Spectrom. Ion Phys., 1983, 48, 189; Hauske, J. R. et al., J. O. C., 1986, 51, 2808; Kim, S. D. et al., J. Antibiot., 1996, 49, 955; Woo, P. W. K. et al., Tetrahedron, 1996, 52, 3857 and Goo, Y. M. et al., J. Antibiot., 1997, 50, 85.


[0009] The chemical structure of chalcomycin, shown without stereochemistry, is provided by formula I below.
1


[0010] Chalcomycin is synthesized by a Type I or modular PKS and modification enzymes. Post-PKS modification reactions include P450 oxidation at three sites to add hydroxyl groups and glycosylation at the C5 hydroxyl to add D-chalcose, and at the C20 hydroxyl to add allose, which is then methylated at two positions to yield D-mycinose.


[0011] There is a need for recombinant nucleic acids, host cells, and methods of using those host cells to produce polyketides including but not limited to chalcomycin and chalcomycin analogs. These and other needs are met by the materials and methods provided by the present invention.



SUMMARY OF THE INVENTION

[0012] The present invention provides recombinant nucleic acids encoding polyketide synthases and polyketide modification enzymes. The recombinant nucleic acids of the invention are useful in the production of polyketides, including but not limited to chalcomycin and chalcomycin analogs and derivatives in recombinant host cells. The biosynthesis of chalcomycin is performed by a modular PKS and polyketide modification enzymes. The chalcomycin synthase is made up of several proteins, each having one or more modules. The modules have domains with specific synthetic functions.


[0013] The present invention also provides domains and modules of the chalcomycin PKS and corresponding nucleic acid sequences encoding them and/or parts thereof. Such compounds are useful in the production of hybrid PKS enzymes and the recombinant genes that encode them.


[0014] The present invention also provides modifying genes of chalcomycin biosynthetic gene cluster in recombinant form, including but not limited to isolated form and incorporated into a vector or the chromosomal DNA of a host cell. The present invention also provides recombinant P450 hydroxylases that provide hydroxyl attachment points useful for further chemical modification. The P450 hydroxylases of the present invention include ChmHI, ChmPI and ChmPII hydroxylases.


[0015] The present invention also provides recombinant host cells that contain the nucleic acids of the invention. In one embodiment, the host cell provided by the invention is a Streptomyces host cell that produces a chalcomycin modification enzyme and/or a domain, module, or protein of the chalcomycin PKS. Methods for the genetic manipulation of Streptomyces are described in Kieser et al, “Practical Streptomyces Genetics,” The John Innes Foundation, Norwich (2000), which is incorporated herein by reference in its entirety.


[0016] Accordingly, there is provided a recombinant PKS wherein at least 10, 15, 20, or more consecutive amino acids in one or more domains of one or more modules thereof are derived from one or more domains of one or more modules of chalcomycin polyketide synthase. Preferabiy at least an entire domain of a module of chalcomycin synthase is included. Representative chalcomycin PKS domains useful in this aspect of the invention include, for example, KR, DH, ER, AT, ACP and KS domains. In one embodiment of the invention, the PKS is assembled from polypeptides encoded by DNA molecules that comprise coding sequences for PKS domains, wherein at least one encoded domain corresponds to a domain of chalcomycin PKS. In such DNA molecules, the coding sequences are operably linked to control sequences so that expression therefrom in host cells is effective. In this manner, chalcomycin PKS coding sequences or modules and/or domains can be made to encode PKS to biosynthesize compounds having antibiotic or other useful bioactivity other than chalcomycin.


[0017] These and other aspects of the present invention are described in more detail in the Detailed Description of the Invention, below.







BRIEF DESCRIPTION OF THE DRAWINGS

[0018]
FIG. 1 illustrates the structure of the chalcomycin PKS biosynthetic gene cluster, and cosmids pKOS146.185.1 and pKOS146.185.10, which contain insert DNA encompassing the chalcomycin PKS gene cluster and associated modification enzyme genes. Abbreviations: ACP, acyl carrier protein; chm, chalcomycin gene; Orf, open reading frame.


[0019]
FIG. 2 shows proposed pathways for post-PKS modification of the chalcomycin-spiramycin hybrid PKS macrolide product.







DETAILED DESCRIPTION OF THE INVENTION

[0020] The invention provides recombinant materials for the production of polyketides. In an aspect, the present invention provides recombinant nucleic acids encoding polyketide synthases that contain all or a portion of the chalcomycin PKS. The biosynthesis of chalcomycin is performed by a modular PKS and modification enzymes. The chalcomycin synthase is made up of five proteins, each having one or more modules, each module comprising domains with specific synthetic functions. Thus, the present invention also provides the domains and modules of the chalcomycin PKS and corresponding nucleic acid sequences encoding them in recombinant form.


[0021] Modifying genes of the chalcomycin biosynthetic gene cluster are also provided, including but not limited to the genes for the ChmHI, ChmPI and ChmPII P450 hydroxylases that provide hydroxyl attachment points useful for further chemical modification.


[0022] Methods and host cells for using these genes to produce or modify a polyketide in recombinant host cells are also provided.


[0023] The nucleotide sequences encoding chalcomycin PKS and modifying proteins of the present invention were isolated from Streptomyces bikiniensis NRRL 2737 (obtained from the Agricultural Research Service Culture Collection, National Center for Agricultural Utilization Research, Peoria, Ill. USA). The chalcomycin PKS gene cluster and modifying genes are contained in cosmids pKOS 146.185.1 and pKOS146.185.10. The cloning and characterization of the chalcomycin PKS gene cluster is described in Example 1, infra. pKOS146-185.1 was deposited under the terms of the Budapest Treaty with the American Type Culture Collection, 10801 University Blvd., Manassas, Va., 20110-2209, on Feb. 19, 2003, with accession number PTA-4961. pKOS146-185.10 was deposited under the terms of the Budapest Treaty with the American Type Culture Collection, 10801 University Blvd., Manassas, Va., 20110-2209, on Feb. 19, 2003, with accession number PTA-4962.


[0024] Given the valuable properties of chalcomycin and its modifying enzymes, means to produce useful quantities thereof and derivatives or analogs of chalcomycin are valuable. Further, the chalcomycin modifying enzymes can also be used to modify other polyketides and produce derivatives thereof with enhanced solubility and/or bioactivity, for instance as antibiotics, and/or sites for further enzymatic or chemical modification. The nucleotide sequences of the chalcomycin biosynthetic gene cluster encoding chalcomycin PKS and modifying enzymes, and domains and/or modules of the PKS can be used, for example, to make antibiotics or other useful compounds in addition to chalcomycin, and in host cells in addition to Streptomyces bikiniensis.


[0025] There is a need for recombinant nucleic acids, host cells, and methods of expressing those nucleic acids in host cells resulting in production of chalcomycin and or its analogs or derivatives, and modifying enzymes, such as the cytochrome P450 hydroxylases that specifically attach hydroxyl groups on the resulting aglycone (which can then be used as attachment points for further modifications). The modifying P450's from the chalcomycin PKS cluster of the present invention can be used to make compounds in a host that does not naturally produce such compounds. These and other needs are met by the materials and methods of the present invention


[0026] In one aspect of the invention, purified and isolated DNA molecules are provided that comprise one or more coding sequences for one or more domains or modules of chalcomycin synthase. Examples of such encoded domains include chalcomycin synthase KR, DH, ER, AT, ACP, and KS domains. In one aspect, the invention provides DNA molecules in which the complete set of chalcomycin PKS-encoding sequences are operably linked to expression control sequences that are effective in suitable host cells to produce chalcomycin and/or its analogs or derivatives. In one aspect, the invention provides polypeptides comprising a portion of the coding sequences for the proteins of the chalcomycin synthase.


[0027] Table 2 in Example 1 provides a description of genes in the chalcomycin PKS gene (i.e., SEQ ID NO:1 and subsequences encoding modules, domains and ORFs, e.g., as indicated), as well as encoded proteins (including SEQ ID. NOS: 2-43) or domains. It will be apparent from Table 2, and FIGS. 1 and 2, which DNA strand comprises the coding sequence for a protein (i.e., the strand having the sequence of SEQ ID NO:1, or its complement.


[0028] In one aspect, the invention provides an isolated or recombinant DNA molecule comprising a nucleotide sequence that encodes at least one polypeptide, alternatively at least one module, alternatively at least one domain, involved in the biosynthesis of a chalcomycin. In one aspect, the invention provides the present invention provides an isolated or recombinant DNA molecule comprising a nucleotide sequence that encodes at least one polypeptide, alternatively at least one module, alternatively at least one domain, involved in the biosynthesis of a chalcomycin. The invention also provides polypeptides comprising PKS interpolypeptide linker sequences, and polynucleotides encoding such linker sequences. Also provided by the invention are polypeptides comprising intrapolypeptide linker sequences, and polynucleotides encoding such linkers.


[0029] In one aspect, the invention provides an isolated or recombinant DNA molecule comprising a sequence identical or substantially similar to at least one subsequence of SEQ ID NO:1 or its complement. In an embodiment the subsequence comprises a sequence encoding a chalcomycin PKS domain or module. In an aspect, the invention provides a recombinant DNA molecule that encodes a polypeptide, module or domain derived from a chalcomycin polyketide synthase (PKS) gene cluster. In this context, a polypeptide, module or domain is derived from a chalcomycin polyketide synthase (PKS) gene cluster when it is encoded by a DNA with substantial sequence identity to the corresponding coding region of the S. bikiniensis chalcomycin gene cluster. For example, in an embodiment, the DNA encoding sequence of the polypeptide, module or domain hybridizes under stringent conditions to a nucleic acid having the sequence of SEQ ID NO:1 (or its complement). Generally, such a polypeptide, module or domain is biologically active, i.e., has at least one enzymatic activity chracteristic of the polypeptide, module or domain encoded exactly by corresponding sequence of SEQ ID NO:1 or its complement. The biological activity of a polypeptide of the invention can be measured by methods well known to the art.


[0030] In one aspect, the invention provides the present invention provides an isolated or recombinant DNA molecule comprising a nucleotide sequence that encodes an open reading frame, module or domain having an amino acid sequence identical or substantially similar to an ORF, module or domain encoded by SEQ ID NO:1 or its complement. Generally, a polypeptide, module or domain having a sequence substantially similar to a reference sequence has substantially the same activity as the reference protein, module or domain (e.g., when integrated into an appropriate PKS framework using methods known in the art). In certain embodiments, one or more activities of a substantially similar polypeptide, module or domain are modified or inactivated as described below.


[0031] In one aspect, the invention provides an isolated or recombinant DNA molecule comprising a nucleotide sequence that encodes at least one polypeptide, module or domain encoded by SEQ ID NO:1, e.g., a polypeptide, module or domain involved in the biosynthesis of a chalcomycin, wherein said nucleotide sequence comprises at least 20, 25, 30, 35, 40, 45, or 50 contiguous base pairs identical to a sequence of SEQ ID NO:1 or its complement. In one aspect, the invention provides an isolated or recombinant DNA molecule comprising a nucleotide sequence that encodes at least one polypeptide, module or domain encoded by SEQ ID NO:1, e.g., a polypeptide, module or domain involved in the biosynthesis of a chalcomycin, wherein said polypeptide, module or domain comprises at least 10, 15, 20, 30, or 40 contiguous residues of a corresponding polypeptide, module or domain encoded by SEQ ID NO:1 or its complement.


[0032] In a related aspect, the invention provides a recombinant DNA molecule, comprising a sequence of at least about 200, optionally at least about 500, basepairs with a sequence identical or substantially identical to a protein encoding region of SEQ ID NO:1. In an embodiment, the DNA molecule encodes a polypeptide, module or domain derived from a chalcomycin polyketide synthase (PKS) gene cluster.


[0033] It will be understood that SEQ ID NO:1 was determined using the inserts of pKOS 146.185.1 and pKOS146-185.10. Accordingly, the invention provides an isolated or recombinant DNA molecule comprising a sequence identical or substantially similar to a ORF encoding sequence of the insert of pKOS 146.185.1 or pKOS146-185.10.


[0034] Those of skill will recognize that, due to the degeneracy of the genetic code, a large number of DNA sequences encode the amino acid sequences of the domains, modules, and proteins of the chalcomycin PKS, the enzymes involved in chalcomycin modification and other polypeptides encoded by the genes of the chalcomycin biosynthetic gene cluster. The present invention contemplates all such DNAs. For example, it may be advantageous to optimize sequence to account for the codon preference of a host organism. The invention also contemplates naturally occurring genes encoding the chalcomycin PKS and tailoring enzymes that are polymorphic or other variants. In addition, it will be appreciated that polypeptide, modules and domains of the invention may comprise one or more conservative amino acid substitutions relative to the polypeptides encoded by SEQ ID NO: 1, such as, for example, conservative substitutions include aspartic-glutamic as acidic amino acids; lysine/arginine/histidine as basic amino acids; leucine/isoleucine, methionine/valine, alanine/valine as hydrophobic amino acids; serine/glycine/alanine/threonine as hydrophilic amino acids.


[0035] As used herein, the terms “substantial identity,” “substantial sequence identity,” or “substantial similarity” in the context of nucleic acids, refers to a measure of sequence similarity between two polynucleotides. Substantial sequence identity can be determined by hybridization under stringent conditions, by direct comparison, or other means. For example, two polynucleotides can be identified as having substantial sequence identity if they are capable of specifically hybridizing to each other under stringent hybridization conditions. Other degrees of sequence identity (e.g., less than “substantial”) can be characterized by hybridization under different conditions of stringency. “Stringent hybridization conditions” refers to conditions in a range from about 5° C. to about 20° C. or 25° C. below the melting temperature (Tm) of the target sequence and a probe with exact or nearly exact complementarity to the target. As used herein, the melting temperature is the temperature at which a population of double-stranded nucleic acid molecules becomes half-dissociated into single strands. Methods for calculating the Tm of nucleic acids are well known in the art (see, e.g., Berger and Kimmel, 1987, Methods In Enzymology, Vol. 152: Guide To Molecular Cloning Techniques, San Diego: Academic Press, Inc. and Sambrook et al., 1989, Molecular Cloning: A Laboratory Manual, 2nd Ed., Vols. 1-3, Cold Spring Harbor Laboratory). Typically, stringent hybridization conditions for probes greater than 50 nucleotides are salt concentrations less than about 1.0 M sodium ion, typically about 0.01 to 1.0 M sodium ion at pH 7.0 to 8.3, and temperatures at least about 50° C., preferably at least about 60° C. As noted, stringent conditions may also be achieved with the addition of destabilizing agents such as formamide, in which case lower temperatures may be employed. Exemplary conditions include hybridization at 7% sodium dodecyl sulfate (SDS), 0.5 M NaPO4 pH 7.0, 1 mM EDTA at 50° C. (or alternatively 65° C.); wash with 2×SSC, 1% SDS, at 50° C. (or alternatively 0.1-0.2 ×SSC, 1% SDS, at 50° C. or 65° C.). Other exemplary conditions for hybridization include (1) high stringency: 0.1×SSPE, 0.1% SDS, 65° C.; (2) medium stringency: 0.2×SSPE, 0.1% SDS, 50° C.; and (3) low stringency: 1.0×SSPE, 0.1% SDS, 50° C. Equivalent stringencies may be achieved using alternative buffers, salts and temperatures.


[0036] Alternatively, substantial sequence identity can be described as a percentage identity between two nucleotide or amino acid sequences. Two nucleic acid sequences are considered substantially identical when they are at least about 70% identical, at least about 75% identical, or at least about 80% identical, or at least about 85% identical, or at least about 90% identical, or at least about 95% or 98% identical. Two amino acid sequences are considered substantially identical when they are at least about 60%, sequence identical, more often at least about 70%, at least about 80%, or at least about 90% sequence identity to the reference sequence. Percentage sequence (nucleotide or amino acid) identity is typically calculated using art known means to determine the optimal alignment between two sequences and comparing the two sequences. Optimal alignment of sequences may be conducted using the local homology algorithm of Smith and Waterman (1981) Adv. Appl. Math. 2: 482, by the homology alignment algorithm of Needleman and Wunsch (1970) J. Mol. Biol. 48: 443, by the search for similarity method of Pearson and Lipman (1988) Proc. Natl. Acad. Sci. U.S.A. 85: 2444, by the BLAST algorithm of Altschul (1990) J. Mol. Biol. 215: 403-410; and Shpaer (1996) Genomics 38:179-191, or by the Needleham et al. (1970) J. Mol. Biol. 48: 443-453; and Sankoff et al., 1983, Time Warps, String Edits, and Macromolecules, The Theory and Practice of Sequence Comparison, Chapter One, Addison-Wesley, Reading, Mass.; generally by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., Madison, Wis.; BLAST from the National Center for Biotechnology Information (http:// www.ncbi.nlm.nih.gov/). In each case default parameters are used (for example the BLAST program uses as defaults a wordlength (W) of 11, the BLOSUM62 scoring matrix (see Henikoff (1992) Proc. Natl. Acad. Sci. USA 89: 10915-10919) alignments (B) of 50, expectation (E) of 10, M=5, N=−4, and a comparison of both strands).


[0037] As used herein the term “recombinant” has its usual meaning in the art and refers to a polynucleotide synthesized or otherwise manipulated in vitro, or to methods of using recombinant polynucleotides to produce gene products in cells or other biological systems. Thus, a “recombinant” polynucleotide is defined either by its method of production or its structure. In reference to its method of production, the process is use of recombinant nucleic acid techniques, e.g., involving human intervention in the nucleotide sequence, typically selection or production. Alternatively, a recombinant polynucleotide can be a polynucleotide made by generating a sequence comprising fusion of two fragments which are not naturally contiguous to each other, but is meant to exclude products of nature. Thus, for example, products made by transforming cells with any non-naturally occurring vector is encompassed, as are polynucleotides comprising sequence derived using any synthetic oligonucleotide process, as are polynucleotides from which a region has been deleted. A recombinant polynucleotide can also be a coding sequence that has been modified in vivo using a recombinant oligo or polynucleotide (such as a PKS in which a domain is inactivated by homologous recombination using a recombinant polynucleotide). A “recombinant” polypeptide is one expressed from a recombinant polynucleotide.


[0038] It will be immediately recognized by those of skill that recombinant polypeptides of the invention have a variety of uses, some of which are described in detail below, including but not limited to use as enzymes, or componants of enzymes, useful for the synthesis or modification of polyketides. Recombinant polypeptides encoded by the chalcomycin PKS gene cluster are also useful as antigens for production of antibodies. Such antibodies find use for purification of bacterial (e.g., Streptomyces bikiniensis) proteins, detection and typing of bacteria, and particularly, as tools for strain improvement (e.g., to assay PKS protein levels to identify “up-regulated” strains in which levels of polyketide producing or modifying proteins are elevated) or assessment of efficiency of expression of recombinant proteins. Polyclonal and monoclonal antibodies can be made by well known and routine methods (see, e.g., Harlow and Lane, 1988, ANTIBODIES: A LABORATORY MANUAL, COLD SPRING HARBOR LABORATORY, New York; Koehler and Milstein 1075, Nature 256:495). In selecting polypeptide sequences for antibody induction, it is not to retain biological activity; however, the protein fragment must be immunogenic, and preferably antigenic (as can be determined by routine methods). Generally the protein fragment is produced by recombinant expression of a DNA comprising at least about 60, more often at least about 200, or even at least about 500 or more base pairs of protein coding sequence, such as a polypeptide, module or domain derived from a chalcomycin polyketide synthase (PKS) gene cluster. Methods for expression of recombinant proteins are well known. (See, e.g., Ausubel et al., 2002, Current Protocols In Molecular Biology, Greene Publishing and Wiley-Interscience, New York.)


[0039] Further aspects of the invention include chimeric PKSs comprising a portion (in one embodiment at least a domain, optionally at least a module, or alternatively at least one polypeptide) from the chalcomycin PKS, and a portion (in one embodiment at least a domain, optionally at least a module, or alternatively at least a polypeptide) from one or more non-chalcomycin PKSs. For example, the invention provides (1) encoding DNA for a chimeric PKS that is substantially patterned on a non-chalcomycin producing enzyme, but which includes one or more functional domains or modules of chalcomycin PKS; (2) encoding DNA for a chimeric PKS that is substantially patterned on the chalcomycin PKS, but which includes one or more functional domains or modules of another PKS or NRPS; and (3) methods for making chalcomycin analogs and derivatives. With respect to item (1) above, examples include chimeric PKS enzymes wherein the genes for the erythromycin PKS, rapamycin PKS, tylosin PKS, and spiramycin PKS, or another PKS function as accepting genes, and one or more of the above-identified coding sequences for chalcomycin domains or modules are inserted as replacements for domains or modules of comparable function. With respect to item (2) above, examples include chimeric PKS enzymes wherein the chalcomycin PKS serves as an accepting gene, and genes for the erythromycin PKS, rapamycin PKS, tylosin PKS, and spiramycin PKS, or another PKS function as accepting genes, and one or more of the above-identified coding sequences for chalcomycin domains or modules are inserted as replacements for domains or modules of comparable function. A partial list of sources of PKS sequences for use in making chimeric molecules, for illustration and not limitation, includes Avermectin (U.S. Pat. No. 5,252,474; MacNeil et al., 1993, Industrial Microorganisms: Basic and Applied Molecular Genetics, Raltz. Hegeman, & Skatrud, eds. (ASM), pp. 245-256; MacNeil et al., 1992, Gene 115: 119-25); Candicidin (FRO008) (Hu et al., 1994, Mol. Microbiol. 14: 163-72); Epothilone (U.S. Pat. No. 6,303,342); Erythromycin (WO 93/13663; U.S. Pat. No. 5,824,513; Donadio et al., 1991, Science 252:675-79; Cortes et al., 1990, Nature 348:176-8); FK-506 (Motamedi et al., 1998, Eur. J. Biochem. 256:528-34; Motamedi et al., 1997, Eur. J. Biochem. 244:74-80); FK-520 (U.S. Pat. No. 6,503,737; see also Nielsen et al., 1991, Biochem. 30:5789-96); Lovastatin (U.S. Pat. No. 5,744,350); Nemadectin (MacNeil et al., 1993, supra); Niddamycin (Kakavas et al., 1997, J. Bacteriol. 179:7515-22); Oleandomycin (Swan et al., 1994, Mol. Gen. Genet. 242:358-62; U.S. Pat. No. 6,388,099; Olano et al., 1998, Mol. Gen. Genet. 259:299-308); Platenolide (EP Pat. App. 791,656 ); Rapamycin (Schwecke et al., 1995, Proc. Natl. Acad. Sci. USA 92:7839-43); Aparicio et al., 1996, Gene 169:9-16); Rifamycin (August et al., 1998, Chemistry & Biology, 5: 69-79); Soraphen (U.S. Pat. No. 5,716,849; Schupp et al., 1995, J. Bacteriology 177: 3673-79); Spiramycin (U.S. Pat. No. 5,098,837); Tylosin (EP 0 791,655; Kuhstoss et al., 1996, Gene 183:231-36; U.S. Pat. No. 5,876,991). Additional suitable PKS coding sequences remain to be discovered and characterized, but will be available to those of skill (e.g., by reference to GenBank).


[0040] Construction of such chimeric enzymes is most effectively achieved by construction of appropriate encoding polynucleotides. In preparing modified and chimeric proteins, it is not necessary, although it may be most efficient, to replace or substitute one or more entire domains or modules of one PKS (e.g., the chalcomycin PKS or another PKS) with an entire domain or module of a different PKS (e.g., the chalcomycin PKS or another PKS). Rather, peptide subsequences of a PKS domain or module that correspond to a peptide subsequence in an accepting domain or module, or which otherwise provide useful function, may be used as replacements. Accordingly, appropriate encoding DNAs for construction of such chimeric PKS include those that encode at least 10, 15, 20 or more amino acids of a selected chalcomycin domain or module. Recombinant methods for manipulating modular PKS genes to make chimeric PKS enzymes are described in U.S. Pat. Nos. 5,672,491; 5,843,718; 5,830,750; and 5,712,146; and in PCT publication Nos. 98/49315 and 97/02358. A number of genetic engineering strategies have been used with DEBS to demonstrate that the structures of polyketides can be manipulated to produce novel natural products, primarily analogs of the erythromycins (see the patent publications referenced supra and Hutchinson, 1998, Curr Opin Microbiol. 1:319-329, and Baltz, 1998, Trends Microbiol. 6:76-83).


[0041] The invention methods may be directed to the preparation of an individual polyketide. The polyketide may or may not be novel, but the method of preparation permits a more convenient or alternative method of preparing it. The resulting polyketides may be further modified to convert them to other useful compounds. Examples of chemical structures of sixteen-membered macrolides that can be made using the materials and methods of the present invention are described in PCT Patent Publication WO 02/32916; U.S. Patent Application US20020128213A (app. Ser. No. 09/969,177); and copending U.S. provisional patent application No. 60/493,966.


[0042] The recombinant DNAs and DNA vectors of the inventions can also be used to make “libraries” of polyketides. Generally, members of these polyketide libraries may themselves be novel compounds, and the invention further includes novel polyketide members of these libraries. Regardless of the naturally occurring PKS gene used as an acceptor, the invention provides libraries of polyketides by generating modifications in, or using a portion of, the chalcomycin PKS so that the protein complexes produced have altered activities in one or more respects, and thus produce polyketides other than the natural product of the PKS. Novel polyketides may thus be prepared, or polyketides in general prepared more readily, using this method. By providing a large number of different genes or gene clusters derived from a naturally occurring PKS gene cluster, each of which has been modified in a different way from the native cluster, an effectively combinatorial library of polyketides can be produced as a result of the multiple variations in these activities.


[0043] As noted, in one aspect the invention provides recombinant PKS wherein at least 10, 15, 20, or more consecutive amino acids in one or more domains of one or more modules thereof are derived from one or more domains of one or more modules of chalcomycin polyketide synthase. A polyketide synthase “derived from” a naturally occurring PKS contains the scaffolding encoded by all the portion employed of the naturally occurring synthase gene, contains at least two modules that are functional, and contains mutations, deletions, or replacements of one or more of the activities of these functional modules so that the nature of the resulting polyketide is altered. This definition applies both at the protein and genetic levels. Particular embodiments include those wherein a KS, AT, KR, DH, or ER has been deleted or replaced by a version of the activity from a different PKS or from another location within the same PKS, and derivatives where at least one noncondensation cycle enzymatic activity (KR, DH, or ER) has been deleted or wherein any of these activities has been added or mutated so as to change the ultimate polyketide synthesized.


[0044] There are at least five degrees of freedom for constructing a polyketide synthase in terms of the polyketide that will be produced. First, the polyketide chain length will be determined by the number of modules in the PKS. Second, the nature of the carbon skeleton of the PKS will be determined by the specificities of the acyl transferases which determine the nature of the extender units at each position—e.g., malonyl, methyl malonyl, methoxy malonyl, or ethyl malonyl, etc. Third, the loading domain specificity will also have an effect on the resulting carbon skeleton of the polyketide. Fourth, the oxidation state at various positions of the polyketide will be determined by the dehydratase and reductase portions of the modules. This will determine the presence and location of ketone, alcohol, alkene or alkane substituents at particular locations in the polyketide. Fifth, the stereochemistry of the resulting polyketide is a function of three aspects of the synthase. The first aspect is related to the AT/KS specificity associated with substituted malonyls as extender units, which affects stereochemistry only when the reductive cycle is missing or when it contains only a ketoreductase since the dehydratase would abolish chirality. Also, the specificity of the ketoreductase will determine the chirality of the corresponding hydroxyl group. Also, the enoyl reductase specificity for substituted malonyls as extender units will influence the result when there is a complete KR/DH/ER available.


[0045] As can be appreciated by those skilled in the art, polyketide biosynthesis can be manipulated to make a product other than the product of a naturally occurring PKS biosynthetic cluster. For example, AT domains can be altered or replaced to change specificity. For example, and not limitation, the AT domain of chalcomycin module 0 (loading domain) can be replaced by an AT with specificity for methylmalonyl-CoA to produce chalcomycin derivatives with a C-15 ethyl group in place of the C-15 methyl group. The variable domains within a module can be deleted and or inactivated or replaced with other variable domains found in other modules of the same PKS or from another PKS. See e.g., Katz & McDaniel, Med Res Rev 19: 543-558 (1999) and WO 98/49315. Similarly, entire modules can be deleted and/or replaced with other modules from the same PKS or another PKS. See e.g., Gokhale et al., Science 284:482 (1999) and WO 00/47724. For example, and not limitation, 3-hydroxy derivatives of chalcomycin can be produced using a modified chalcomycin PKS in which module 7 of the chalcomycin PKS is replaced by module 7 of the tylosin PKS (optionally with appropriate linker modifications). Similarly, protein subunits of different PKSs also can be mixed and matched to make compounds having the desired backbone and modifications. For example, subunits of 1 and 2 (encoding modules 1-4) of the pikromycin PKS were combined with the DEBS3 subunit to make a hybrid PKS product (see Tang et al., Science, 287: 640 (2001), WO 00/26349 and WO 99/6159). Also see Examples, below.


[0046] Mutations can be introduced into PKS genes such that polypeptides with altered activity are encoded. Polypeptides with “altered activity” include those in which one or more domains are inactivated or deleted, or in which a mutation changes the substrate specificity of a domain, as well as other alterations in activity. Mutations can be made to the native sequences using conventional techniques. The substrates for mutation can be an entire cluster of genes or only one or two of them; the substrate for mutation may also be portions of one or more of these genes. Techniques for mutation include preparing synthetic oligonucleotides including the mutations and inserting the mutated sequence into the gene encoding a PKS subunit using restriction endonuclease digestion. (See, e.g., Kunkel, T. A. Proc Natl Acad Sci USA (1985) 82:448; Geisselsoder et al. BioTechniques (1987) 5:786.) Alternatively, the mutations can be effected using a mismatched primer (generally 10-20 nucleotides in length) that hybridizes to the native nucleotide sequence (generally cDNA corresponding to the RNA sequence), at a temperature below the melting temperature of the mismatched duplex. The primer can be made specific by keeping primer length and base composition within relatively narrow limits and by keeping the mutant base centrally located. (See Zoller and Smith, Methods in Enzymology (1983) 100:468). Primer extension is effected using DNA polymerase. The product of the extension reaction is cloned, and those clones containing the mutated DNA are selected. Selection can be accomplished using the mutant primer as a hybridization probe. The technique is also applicable for generating multiple point mutations. (See, e.g., Dalbie-McFarland et al. Proc Natl Acad Sci USA (1982) 79:6409). PCR mutagenesis can also be used for effecting the desired mutations.


[0047] Random mutagenesis of selected portions of the nucleotide sequences encoding enzymatic activities can be accomplished by several different techniques known in the art, e.g., by inserting an oligonucleotide linker randomly into a plasmid, by irradiation with X-rays or ultraviolet light, by incorporating incorrect nucleotides during in vitro DNA synthesis, by error-prone PCR mutagenesis, and by preparing synthetic mutants or by damaging plasmid DNA in vitro with chemicals. Chemical mutagens include, for example, sodium bisulfite, nitrous acid, hydroxylamine, agents which damage or remove bases thereby preventing normal base-pairing such as hydrazine or formic acid, analogues of nucleotide precursors such as nitrosoguanidine, 5-bromouracil, 2-aminopurine, or acridine intercalating agents such as proflavine, acriflavine, quinacrine, and the like. Generally, plasmid DNA or DNA fragments are treated with chemicals, transformed into E. coli and propagated as a pool or library of mutant plasmids.


[0048] In addition to providing mutated forms of regions encoding enzymatic activity, regions encoding corresponding activities from different PKS synthases or from different locations in the same PKS synthase can be recovered, for example, using PCR techniques with appropriate primers. By “corresponding” activity encoding regions is meant those regions encoding the same general type of activity—e.g., a ketoreductase activity in one location of a gene cluster would “correspond” to a ketoreductase-encoding activity in another location in the gene cluster or in a different gene cluster; similarly, a complete reductase cycle could be considered corresponding—e.g., KR/DH/ER could correspond to KR alone.


[0049] If replacement of a particular target region in a host polyketide synthase is to be made, this replacement can be conducted in vitro using suitable restriction enzymes or can be effected in vivo using recombinant techniques involving homologous sequences framing the replacement gene. One such system involving plasmids of differing temperature sensitivities is described in PCT application WO 96/40968. Another useful method for modifying a PKS gene (e.g., making domain substitutions or “swaps”) is a RED/ET cloning procedure developed for constructing domain swaps or modifications in an expression plasmid without first introducing restriction sites. The method is related to ET cloning methods (see, Datansko & Wanner, 2000, Proc. Natl. Acad. Sci. U.S.A. 97, 6640-45; Muyrers et al, 2000, Genetic Engineering 22:77-98). The RED/ET cloning procedure is used to introduce a unique restriction site in the recipient plasmid at the location of the targeted domain. This restriction site is used to subsequently linearize the recipient plasmid in a subsequent ET cloning step to introduce the modification. This linearization step is necessary in the absence of a selectable marker, which cannot be used for domain substitutions. An advantage of using this method for PKS engineering is that restriction sites do not have to be introduced in the recipient plasmid in order to construct the swap, which makes it faster and more powerful because boundary junctions can be altered more easily.


[0050] In one embodiment, the invention provides a chimeric PKS in which one of more polypeptides are derived from a chalcomycin PKS polypeptide, and one or more peptides are derived from one or more non-chalcomycin PKS(s) that, like the chalcomysin PKS, produces a 16-membered macrolide. Examples of PKS(s) that produces a 16-membered macrolide include, for example, the tylosin PKS, the spiramycin PKS, the niddamycin PKS, and the mycinamicin PKS. All the currently known PKSs for 16-membered macrolides consists of five large polypeptides encoded by colinear genes in a single operon. The arrangement of modules on these polypeptides is conserved. Thus, for known 16-membered macrolide PKSs, the first polypeptide has a loading module and two extender modules, the second a single extender module, the third two extender modules, the fourth a single extender module, and the fifth a single extender module followed by a thioesterase domain. The different aglycone core structures produced by different 16-membered macrolide PKSs is due to differences in the catalytic domains within each of these modules.


[0051] As is illustrated in the examples, below, new hybrid 16-membered macrolides can be made by expressing combinations of PKS polypeptides from different sources in a suitable host. The hybrid PKS produces hybrid polyketides that, optionally can be further modified by the post-PKS tailoring enzymes present within the host. See Examples, infra.


[0052] By expressing particular combinations of these PKS polypeptides one can produce molecules with desired combinations of structural features based on, for example, the macrolactone structural features specified by each of the five polypeptides of different 16-membered macrolide PKSs as shown in Table 1, below. As noted, by expressing particular combinations of these PKS polypeptides one can produce molecules with desired combinations of structural features. Although, as described in the Examples and Table 1, selection of particular combinations of polypeptides provides a level of predictability as to the products formed by the hybrid PKS, the invention is not limited to any particular combinations or structures “predicted” by the table.
1TABLE 1PKS12345TylG15-ethyl10,11-ene9-keto5-hydroxy3-hydroxy14-methyl10-H8-methyl4-methyl2-H12,13-ene7-methylene12-methyl6-ethylSrmG15-methyl10,11-ene9-keto5-hydroxy3-hydroxy14-H10-H8-methyl4-methoxy2-H12,13-ene7-methylene12-H6-ethylChmG15-methyl10,11-ene9-keto5-hydroxy3-keto14-methyl10-H8-methyl4-methyl2-H12,13-ene7-methylene12-H6-methyl


[0053] In one embodiment, the components of the chimeric PKS are arranged onto polypeptides having interpolypeptide linkers that direct the assembly of the polypeptides into the functional PKS protein, such that it is not required that the PKS have the same arrangement of modules in the polypeptides as observed in natural PKSs. Suitable interpolypeptide linkers to join polypeptides and intrapolypeptide linkers to join modules within a polypeptide are described in PCT publication WO 00/47724.


[0054] In one embodiment of the invention, the components of the PKS are arranged into five polypeptides similarly to natural PKS proteins involved in the biosynthesis of tylactone, platenolide, and the like. Thus, for example, the first polypeptide comprises the loading domain, first and second extender modules, and a C-terminal interpolypeptide linker region suitable for interaction with the second polypeptide. The second polypeptide comprises an N-terminal interpolypeptide linker region suitable for interaction with the first polypeptide, the third extender module, and a C-terminal interpolypeptide linker region suitable for interaction with the third polypeptide. The third polypeptide comprises an N-terminal interpolypeptide linker region suitable for interaction with the second polypeptide, the fourth and fifth extender modules, and a C-terminal interpolypeptide linker region suitable for interaction with the fourth polypeptide. The fourth polypeptide comprises an N-terminal interpolypeptide linker region suitable for interaction with the third polypeptide, the sixth extender module, and a C-terminal interpolypeptide linker region suitable for interaction with the fifth polypeptide. The fifth polypeptide comprises an N-terminal interpolypeptide linker region suitable for interaction with the fourth polypeptide, the seventh extender module, and the terminal thioesterase domain.


[0055] In other embodiments of the invention, the components of the PKS residing on any given polypeptide are derived from the same source, and are naturally contiguous in that source, but the intrapolypeptide linkers are changed to allow proper assembly across heterologous polypeptide junctions to form a functional PKS. For example, in one embodiment of the invention, the first polypeptide is the intact first polypeptide of the chalcomycin PKS, encoded by chmGI, and comprises the loading domain and first and second extender modules from the chalcomycin PKS together with the native C-terminal interpolypeptide linker region that directs interaction of the first polypeptide with the second polypeptide of the chalcomycin PKS. The second polypeptide comprises the N-terminal interpolypeptide linker and module 3 of the chalcomycin PKS, encoded by chmGII, but with the C-terminal interpolypeptide linker replaced by the C-terminal interpolypeptide linker from the second polypeptide of the spiramycin PKS, encoded by srmG2. This replaced C-terminal interpolypeptide linker directs the second polypeptide to interact with the third polypeptide, taken from the spiramycin PKS and encoded by the srmG3 gene. The remaining polypeptides are the third, fourth, and fifth polypeptides of the spiramycin PKS, encoded by srmG3, srmG4, and srmG5, respectively. In another embodiment of the invention, the first polypeptide comprises the loading domain and first, second and third extender modules from the chalcomycin PKS, together with a C-terminal interpolypeptide linker region derived from the C-terminus of the first polypeptide of the tylosin PKS. The remaining polypeptides are the third, fourth, and fifth polypeptides of the tylosin PKS. The use of the appropriate interpolypeptide linkers directs the proper assembly of the PKS, thereby improving the catalytic activity of the resulting hybrid PKS.


[0056] As noted above, the DNA compounds of the invention can be expressed in host cells for production of known and novel compounds. Preferred hosts include fungal systems such as yeast and procaryotic hosts, but single cell cultures of, for example, mammalian cells could also be used. A variety of methods for heterologous expression of PKS genes and host cells suitable for expression of these genes and production of polyketides are described, for example, in U.S. Pat. Nos. 5,843,718 and 5,830,750; WO 01/31035, WO 01/27306, and WO 02/068613; and U.S. patent application Ser. Nos. 10/087,451 (published as US2002000087451); 60/355,211; and 60/396,513 (corresponding to published application 20020045220).


[0057] Appropriate host cells for the expression of the hybrid PKS genes include those organisms capable of producing the needed precursors, such as malonyl-CoA, methylmalonyl-CoA, ethylmalonyl-CoA, and methoxymalonyl-ACP, and having phosphopantotheinylation systems capable of activating the ACP domains of modular PKSs. See, for example, U.S. Pat. No. 6,579,695. However, as disclosed in U.S. Pat. No. 6,033,883, a wide variety of hosts can be used, even tnough some hosts natively do not contain the appropriate post-translational mechanisms to activate the acyl carrier proteins of the synthases. Also see WO 97/13845 and WO 98/27203. The host cell may natively produce none, some, or all of the required polyketide precursors, and may be genetically engineered so as to produce the required polyketide precursors. Such hosts can be modified with the appropriate recombinant enzymes to effect these modifications. Suitable host cells include Streptomyces, E. coli, yeast, and other procaryotic hosts which use control sequences compatible with Streptomyces spp. Examples of suitable hosts that either natively produce modular polyketides or have been engineered so as to produce modular polyketides include but are not limited to actinomycetes such as Streptomyces coelicolor, Streptomyces venezuelae, Streptomyces fradiae, Streptomyces ambofaciens, and Saccharopolyspora erythraea, eubacteria such as Escherichia coli, myxobacteria such as Myxococcus xanthus, and yeasts such as Saccharomyces cerevisiae.


[0058] In one embodiment, any native modular PKS genes in the host cell have been deleted to produce a “clean host,” as described in U.S. Pat. No. 5,672,491, incorporated herein by reference. The construction of the clean host S. fradiae K159-1, and the clean host S. fradiae K159-1/244-17a that produces methoxymalonyl-ACP are described below in Examples 2 and 3. Other organisms can be engineered using similar methods.


[0059] In some embodiments, the host cell expresses, or is engineered to express, a polyketide “tailoring” or “modifying” enzyme. Once a PKS product is released, it is subject to post-PKS tailoring reactions. These reactions are important for biological activity and for the diversity seen among 16-membered macrolides. Tailoring enzymes normally associated with polyketide biosynthesis include oxygenases, glycosyl- and methyltransferases, acyltransferases, halogenases, cyclases, aminotransferases, and hydroxylases. Tailoring enzymes for modification of a product of the chalcomycin PKS, a non-chalcomycin PKS, or a chimeric PKS, can be those normally associated with chalcomycin biosynthesis (including, but not limited to, proteins described in Table 2) or “heterologous” tailoring enzymes. As noted above, the P450 hydrolases encoded by the chmHI, chmPI and chmPII genes are of particular interest for production of polyketides having hydroxy groups well suited for subsequent chemical modification.


[0060] For purposes of the present invention, tailoring enzymes can be expressed in the organism in which they are naturally produced, or as recombinant proteins in heterologous hosts. For example, as shown in Examples 6 and 7, a hybrid PKSs having elements from the chalcomycin and spiramycin PKSs, or from the tylosin and chalcomysin PKSs were expressed in an engineered host derived from a tylosin producing strain of S. fradiae in which all or most of the post-PKS tailoring reactions of the tylosin biosynthetic pathway (see Baltz and Seno, 1988, “Genetics of Streptomyces fradiae and tylosin biosynthesis” Annu Rev Microbiol. 42:547-74) were expressed and which modified the polyketide product.


[0061] In some cases, the structure produced by the heterologous or hybrid PKS may be modified with different efficiencies by post-PKS tailoring enzymes from different sources. In such cases, post-PKS tailoring enzymes can be recruited from other pathways to obtain the desired compound. For example, as discussed in Example 6, a chmH gene has been used to modify the product of a chalcomycin-spiramycin hybrid PKS. Similarly, host cells can be selected, or engineered, for expression of a glycosylatation apparatus (discussed below), amide synthases, (see, for example, U.S. patent publication 20020045220 “Biosynthesis of Polyketide Synthase Substrates”). For example and not limitation, the host cell can contain the desosamine, megosamine, and/or mycarose biosynthetic genes, corresponding glycosyl transferase genes, and hydroxylase genes (e.g., picK, megK, eryK, megF, and/or eryF). Methods for glycosylating polyketides are generally known in the art and can be applied in accordance with the methods of the present invention; the glycosylation may be effected intracellularly by providing the appropriate glycosylation enzymes or may be effected in vitro using chemical synthetic means as described herein and in WO 98/49315, incorporated herein by reference. Glycosylation with desosamine, mycarose, and/or megosamine is effected in accordance with the methods of the invention in recombinant host cells provided by the invention. Alternatively and as noted, glycosylation may be effected intracellularly using endogenous or recombinantly produced intracellular glycosylases. In addition, synthetic chemical methods may be employed.


[0062] Alternatively, the aglycone compounds can be produced in the recombinant host cell, and the desired modification (e.g., glycosylation and hydroxylation) steps carried out in vitro (e.g., using purified enzymes, isolated from native sources or recombinantly produced) or in vivo in a converting cell different from the host cell (e.g., by supplying the converting cell with the aglycone).


[0063] It will be apparent to the reader that a variety of recombinant vectors can be utilized in the practice of aspects of the invention. As used herein, “vector” refers to polynucleotide elements that are used to introduce recombinant nucleic acid into cells for either expression or replication. Selection and use of such vehicles is routine in the art. An “expression vector” includes vectors capable of expressing DNAs that are operatively linked with regulatory sequences, such as promoter regions. Thus, an expression vector refers to a recombinant DNA or RNA construct, such as a plasmid, a phage, recombinant virus or other vector that, upon introduction into an appropriate host cell, results in expression of the cloned DNA. Appropriate expression vectors are well known to those of skill in the art and include those that are replicable in eukaryotic cells and/or prokaryotic cells and those that remain episomal or those which integrate into the host cell genome.


[0064] The vectors used to perform the various operations to replace the enzymatic activity in the host PKS genes or to support mutations in these regions of the host PKS genes may be chosen to contain control sequences operably linked to the resulting coding sequences in a manner that expression of the coding sequences may be effected in an appropriate host. Suitable control sequences include those which function in eucaryotic and procaryotic host cells. If the cloning vectors employed to obtain PKS genes encoding derived PKS lack control sequences for expression operably linked to the encoding nucleotide sequences, the nucleotide sequences are inserted into appropriate expression vectors. This can be done individually, or using a pool of isolated encoding nucleotide sequences, which can be inserted into host vectors, the resulting vectors transformed or transfected into host cells, and the resulting cells plated out into individual colonies.


[0065] Suitable control sequences for single cell cultures of various types of organisms are well known in the art. Control systems for expression in yeast are widely available and are routinely used. Control elements include promoters, optionally containing operator sequences, and other elements depending on the nature of the host, such as ribosome binding sites. Particularly useful promoters for procaryotic hosts include those from PKS gene clusters which result in the production of polyketides as secondary metabolites, including those from Type I or aromatic (Type II) PKS gene clusters. Examples are act promoters, tcm promoters, spiramycin promoters, tylosin promoter (e.g., tylGIp, see Rodriguez et al., “Rapid engineering of polyketide overproduction by gene transfer to industrially optimized strains” J Ind Microbiol Biotechnol. Apr. 16, 2003; and DeHoff et al., “Streptomyces fradiae tylactone synthase, starter module and modules 1-7, (tylG) gene, complete cds” Genbank Accession No. U78289), and other promoters. However, other bacterial promoters, such as those derived from sugar metabolizing enzymes, such as galactose, lactose (lac) and maltose, are also useful. Additional examples include promoters derived from biosynthetic enzymes such as for tryptophan (trp), the β-lactamase (bla), bacteriophage lambda PL, and T5. In addition, synthetic promoters, such as the tac promoter (U.S. Pat. No. 4,551,433), can be used.


[0066] As noted, particularly useful control sequences are those which themselves, or with suitable regulatory systems, activate expression during transition from growth to stationary phase in the vegetative mycelium. The system contained in the plasmid identified as pCK7, i.e., the actI/actIII promoter pair and the actII-ORF4 (an activator gene), is particularly preferred. Particularly preferred hosts are those which lack their own means for producing polyketides so that a cleaner result is obtained. Illustrative control sequences, vectors, and host cells of these types include the modified S. coelicolor CH999 and vectors described in PCT publication WO 96/40968 and similar strains of S. lividans. See U.S. Pat. Nos. 5,672,491; 5,830,750, 5,843,718; and 6,177,262, each of which is incorporated herein by reference.


[0067] Other regulatory sequences may also be desirable which allow for regulation of expression of the PKS sequences relative to the growth of the host cell. Regulatory sequences are known to those of skill in the art, and examples include those which cause the expression of a gene to be turned on or off in response to a chemical or physical stimulus, including the presence of a regulatory compound. Other types of regulatory elements may also be present in the vector, for example, enhancer sequences.


[0068] Selectable markers can also be included in the recombinant expression vectors. A variety of markers are known which are useful in selecting for transformed cell lines and generally comprise a gene whose expression confers a selectable phenotype on transformed cells when the cells are grown in an appropriate selective medium. Such markers include, for example, genes which confer antibiotic resistance or sensitivity to the plasmid. Alternatively, several polyketides are naturally colored, and this characteristic provides a built-in marker for screening cells successfully transformed by the present constructs.


[0069] The various PKS nucleotide sequences, or a mixture of such sequences, can be cloned into one or more recombinant vectors as individual cassettes, with separate control elements or under the control of a single promoter. The PKS subunits or components can include flanking restriction sites to allow for the easy deletion and insertion of other PKS subunits so that hybrid or chimeric PKSs can be generated. The design of such restriction sites is known to those of skill in the art and can be accomplished using the techniques described above, such as site-directed mutagenesis and PCR. Methods for introducing the recombinant vectors of the present invention into suitable hosts are known to those of skill in the art and typically include the use of CaCl2 or other agents, such as divalent cations, lipofection, DMSO, protoplast transformation, conjugation, and electroporation.


[0070] Expression vectors containing nucleotide sequences encoding a variety of PKS systems for the production of different polyketides can be transformed into the appropriate host cells to construct a polyketide library. In one approach, a mixture of such vectors is transformed into the selected host cells and the resulting cells plated into individual colonies and selected for successful transformants. Each individual colony has the ability to produce a particular PKS and ultimately a particular polyketide. Typically, there will be duplications in some of the colonies; the subset of the transformed colonies that contains a different PKS in each member colony can be considered the library. Alternatively, the expression vectors can be used individually to transform hosts, which transformed hosts are then assembled into a library. A variety of strategies might be devised to obtain a multiplicity of colonies each containing a PKS gene cluster derived from the naturally occurring host gene cluster so that each colony in the library produces a different PKS and ultimately a different polyketide. The number of different polyketides that are produced by the library is typically at least four, more typically at least ten, and preferably at least 20, more preferably at least 50, reflecting similar numbers of different altered PKS gene clusters and PKS gene products. The number of members in the library is arbitrarily chosen; however, the degrees of freedom outlined above with respect to the variation of starter, extender units, stereochemistry, oxidation state, and chain length is quite large. The polyketide producing colonies can be identified and isolated using known techniques and the produced polyketides further characterized. The polyketides produced by these colonies can be used collectively in a panel to represent a library or may be assessed individually for activity.


[0071] The libraries can thus be considered at four levels: (1) a multiplicity of colonies each with a different PKS encoding sequence encoding a different PKS cluster but all derived from a naturally occurring PKS cluster; (2) colonies which contain the proteins that are members of the PKS produced by the coding sequences; (3) the polyketides produced; and (4) compounds derived from the polyketides. Of course, combination libraries can also be constructed wherein members of a library derived, for example, from the erythromycin PKS can be considered as a part of the same library as those derived from, for example, the rapamycin PKS cluster.


[0072] Colonies in the library are induced to produce the relevant synthases and thus to produce the relevant polyketides to obtain a library of candidate polyketides. The polyketides secreted into the media can be screened for binding to desired targets, such as receptors, signaling proteins, and the like. The supernatants per se can be used for screening, or partial or complete purification of the polyketides can first be effected. Typically, such screening methods involve detecting the binding of each member of the library to receptor or other target ligand. Binding can be detected either directly or through a competition assay. Means to screen such libraries for binding are well known in the art. Alternatively, individual polyketide members of the library can be tested against a desired target. In this event, screens wherein the biological response of the target is measured can be included.


[0073] Thus, the present invention provides recombinant DNA molecules and vectors comprising those recombinant DNA molecules that encode all or a portion of the chalcomycin PKS and/or chalcomycin modification enzymes and that, when transformed into a host cell and the host cell is cultured under conditions that lead to the expression of said chalcomycin PKS and/or modification enzymes, results in the production of polyketides including but not limited to chalcomycin and/or analogs or derivatives thereof in useful quantities. The present invention also provides recombinant host cells comprising those recombinant vectors.


[0074] Suitable culture conditions for production of polyketides using the cells of the invention will vary according to the host cell and the nature of the polyketide being produced, but will be know to those of skill in the art. See, for example, the examples below and WO 98/27203 “Production of Polyketides in Bacteria and Yeast” and WO 01/83803 “Overproduction Hosts for Biosynthesis of Polyketides.”


[0075] The polyketide product produced by host cells of the invention can be recovered (i.e., separated from the producing cells and at least partially purified) using routine techniques (e.g., extraction from broth followed by chromatography).


[0076] The compositions, cells and methods of the invention may be directed to the preparation of an individual polyketide or a number of polyketides. The polyketide may or may not be novel, but the method of preparation permits a more convenient or alternative method of preparing it. It will be understood that the resulting polyketides may be further modified to convert them to other useful compounds. For example, an ester linkage may be added to produce a “pharmaceutically acceptable ester” (i.e., an ester that hydrolyzes under physiologically relevant conditions to produce a compound or a salt thereof). Illustrative examples of suitable ester groups include but are not limited to formates, acetates, propionates, butyrates, succinates, and ethylsuccinates.


[0077] The polyketide product produced by recombinant cells can be chemically modified in a variety of ways. For example , for example by addition of a protecting group, for example to produce prodrug forms. A variety of protecting groups are disclosed, for example, in T. H. Greene and P. G. M. Wuts, Protective Groups in Organic Synthesis, Third Edition, John Wiley & Sons, New York (1999). Prodrugs are in general functional derivatives of the compounds that are readily convertible in vivo into the required compound. Conventional procedures for the selection and preparation of suitable prodrug derivatives are described, for example, in “Design of Prodrugs,” H. Bundgaard ed., Elsevier, 1985.


[0078] Similarly, improvements in water solubility of a polyketide compound can be achieved by addition of groups containing solubilizing functionalities to the compound or by removal of hydrophobic groups from the compound, so as to decrease the lipophilicity of the compound. Typical groups containing solubilizing functionalities include, but are not limited to: 2-(dimethylaminoethyl)amino, piperidinyl, N-alkylpiperidinyl, hexahydropyranyl, furfuryl, tetrahydro furfuryl, pyrrolidinyl, N-alkylpyrrolidinyl, piperazinylamino, N-alkylpiperazinyl, morpholinyl, N-alkylaziridinylmethyl, (1-azabicyclo[1.3.0]hex-1-yl)ethyl, 2-(N-methylpyrrolidin-2-yl) ethyl, 2-(4-imidazolyl)ethyl, 2-(1-methyl-4-imidazolyl)ethyl, 2-(1-methyl-5-imidazolyl)ethyl, 2-(4-pyridyl)ethyl, and 3-(4-morpholino)-1-propyl. In the case of geldanamycin analogs, solubilizing groups can be added by reaction with amines, which results in the displacement of the 17-methoxy group by the amine (see, Schnur et al., 1995, “Inhibition of the Oncogene Product p185erbB-2 in Vitro and in Vivo by Geldanamycin and Dihydrogeldanamycin Derivatives,”, J. Med. Chem. 38, 3806-3812; Schnur et al., 1995 “erbB-2 Oncogene Inhibition by Geldanamycin Derivatives: Synthesis, Mechanism of Action, and Structure-Activity relationships,” J. Med. Chem. 38, 3813-3820; Schnur et al., “Ansamycin Derivatives as Antioncogene and Anticancer Agents,” U.S. Pat. No. 5,932,655; all of which are incorporated herein by reference). Typical amines containing solubilizing functionalities include 2-(dimethylamino)-ethylamine, 4-aminopiperidine, 4-amino-1-methylpiperidine, 4-aminohexahydropyran, furfurylamine, tetrahydrofurfurylamine, 3-(aminomethyl)-tetrahydrofuran, 2-(amino-methyl)pyrrolidone, 2-(aminomethyl)-1-methylpyrrolidine, 1-methylpiperazine, morpholine, 1-methyl-2(aminomethyl)aziridine, 1-(2-aminoethyl)-1-azabicyclo-[1.3.0]hexane, 1-(2-aminoethyl)piperazine, 4-(2-aminoethyl)morpholine, 1-(2-amino-ethyl) pyrrolidine, 2-(2-aminoethyl)pyridine, 2-fluoroethylamine, 2,2-difluoroethylamine, and the like.


[0079] In addition to post synthesis chemical or biosynthetic modifications, various polyketide forms or compositions can be produced, including but not limited to mixtures of polyketides, enantiomers, diastereomers, geometrical isomers, polymorphic crystalline forms and solvates, and combinations and mixtures thereof can be produced


[0080] Many other modifications of polyketides produced according to the invention will be apparent to those of skill, and can be accomplished using techniques of pharmaceutical chemistry.


[0081] Prior to use the PKS product (whether modified or not) can be formulated for storage, stability or administration. For example, the polyketide products can be formulated as a “pharmaceutically acceptable salt.” Suitable pharmaceutically acceptable salts of compounds include acid addition salts which may, for example, be formed by mixing a solution of the compound with a solution of a pharmaceutically acceptable acid such as hydrochloric acid, hydrobromic acid, sulfuric acid, fumaric acid, maleic acid, succinic acid, benzoic acid, acetic acid, citric acid, tartaric acid, phosphoric acid, carbonic acid, or the like. Where the compounds carry one or more acidic moieties, pharmaceutically acceptable salts may be formed by treatment of a solution of the compound with a solution of a pharmaceutically acceptable base, such as lithium hydroxide, sodium hydroxide, potassium hydroxide, tetraalkylammonium hydroxide, lithium carbonate, sodium carbonate, potassium carbonate, ammonia, alkylamines, or the like.


[0082] Prior to administration to a mammal the PKS product will be formulated as a pharmaceutical composition according to methods well known in the art, e.g., combination with a pharmaceutically acceptable carrier. The term “pharmaceutically acceptable carrier” refers to a medium that is used to prepare a desired dosage form of a compound. A pharmaceutically acceptable carrier can include one or more solvents, diluents, or other liquid vehicles; dispersion or suspension aids; surface active agents; isotonic agents; thickening or emulsifying agents; preservatives; solid binders; lubricants; and the like. Remington's Pharmaceutical Sciences, Fifteenth Edition, E. W. Martin (Mack Publishing Co., Easton, Pa., 1975) and Handbook of Pharmaceutical Excipients, Third Edition, A. H. Kibbe ed. (American Pharmaceutical Assoc. 2000), disclose various carriers used in formulating pharmaceutical compositions and known techniques for the preparation thereof.


[0083] The composition may be administered in any suitable form such as solid, semisolid, or liquid form. See Pharmaceutical Dosage Forms and Drug Delivery Systems, 5th edition, Lippicott Williams & Wilkins (1991). In an embodiment, for illustration and not limitation, the polyketide is combined in admixture with an organic or inorganic carrier or excipient suitable for external, enteral, or parenteral application. The active ingredient may be compounded, for example, with the usual non-toxic, pharmaceutically acceptable carriers for tablets, pellets, capsules, suppositories, pessaries, solutions, emulsions, suspensions, and any other form suitable for use. The carriers that can be used include water, glucose, lactose, gum acacia, gelatin, mannitol, starch paste, magnesium trisilicate, talc, corn starch, keratin, colloidal silica, potato starch, urea, and other carriers suitable for use in manufacturing preparations, in solid, semi-solid, or liquified form. In addition, auxiliary stabilizing, thickening, and coloring agents and perfumes may be used.


[0084] It will be apparent from the forgoing that the invention provides may useful compositions and methods of using them. Without intending to limit its scope, in one aspect, the invention provides a recombinant DNA molecule that encodes a polypeptide, module or domain derived from a chalcomycin polyketide synthase (PKS) gene cluster. In an embodiment, the DNA molecule (or its complement) has substantial sequence identity to SEQ ID NO:1. In an embodiment, the DNA molecule hybridizes under stringent conditions to a nucleic acid having the sequence of SEQ ID NO:1 or its complement. In a related aspect, the invention provides a recombinant DNA molecule, comprising a sequence of at least about 200, optionally at least about 500, basepairs with a sequence identical or substantially identical to a protein encoding region of SEQ ID NO:1. In an embodiment, the DNA molecule encodes a polypeptide, module or domain derived from a chalcomycin polyketide synthase (PKS) gene cluster.


[0085] In one embodiment, the recombinant DNA molecule comprises a sequence encoding at least one module of a chalcomycin polyketide synthase. In an embodiment, the recombinant DNA molecule encodes a chalcomycin polyketide synthase polypeptide selected from the group consisting of ChmGI, ChmGII, ChmGIII, ChmGIV, and ChmV.


[0086] In one aspect, the recombinant DNA molecule includes a coding sequence for a chalcomycin modifying enzyme, such as a chalcomycin P450 hydrolase enzyme selected from the group consisting of ChmHI, ChmPI, and ChmPII.


[0087] The invention also provides vector that comprise the recombinant DNA molecules of the invention. In an aspect, the invention provides a recombinant host cell comprising the vector. In a related aspect, the invention provides a recombinant host cell comprising a DNA molecule of the invention integrated into the cell chromosomal DNA.


[0088] Also provided is a chimeric PKS that comprises at least one domain of a chalcomycin PKS, and a cell containing such a chimeric PKS. In a related aspect, the invention provides a modified functional chalcomycin PKS that differs from the S. bikiniensis chalcomycin PKS by the inactivation of at least one domain of the chalcomycin PKS and/or addition of at least one domain of a non-chalcomycin PKS (for example, a loading domain, a thioesterase domain, an AT domain, a KS domain, an ACP domain, a KR domain, a DH domain, and an ER domain). The invention provides a cell comprising a modified functional PKS. The invention also provides a method to prepare an chalcomycin derivative which method comprises providing substrates including extender units to the cell.


[0089] In an aspect, the invention provides a recombinant expression system capable of producing a chalcomycin synthase domain in a host cell, said system comprising an encoding sequence for a chalcomycin polyketide synthase domain, and said encoding sequence being operably linked to control sequences effective in said cell to produce RNA that is translated into said domain, and a host cell modified to contain the recombinant expression system.


[0090] The invention provides an isolated polypeptide encoded by a recombinant chalcomycin polyketide synthase (PKS) gene, and a recombinant host cell containing or expressing such a polypeptide.


[0091] The invention also provides a recombinant S. bikiniensis cell in which a chmGI, chmGII, chmGIII, chmGIV, or chmV is disrupted so as to reduce or eliminate production of chalcomycin.


[0092] The invention also provides a recombinant DNA molecule encoding a first protein comprising one or more modules of a chalcomycin PKS and a second protein comprising one or more modules of a tylosin PKS or spiramycin PKS, optionally one or more polypeptides of a chalcomycin PKS and one or more polypeptides of a tylosin PKS or spiramycin PKS. In a related aspect, the invention provides a recombinant host cell comprising a hybrid polyketide synthase comprising one or more modules of a chalcomycin PKS and one or more modules of a tylosin PKS or spiramycin PKS.



EXAMPLES

[0093] The following examples are provided to illustrate, but are not intended to limit, the present invention.



Example 1


Isolation and Characterization of Chalcomycin PKS Cluster From Streptomyces bikiniensis NRRL 2737

[0094] Growth of organism and extraction of genomic DNA. For genomic DNA extraction, a spore stock of Streptomyces bikiniensis NRRL 2737 was used to prepare a seed culture. Spores were stored as a suspension in 25% (v/v) glycerol at −80° C. and used to inoculate 5 ml of unbuffered Trypticase Soy Broth (TSB) liquid media. The entire seed culture was transferred into 50 ml of the same growth medium in a 250 ml baffled Erlenmeyer flask and incubated with shaking for 24 h at 28° C. A 10 ml portion of the cell suspension was centrifuged (10,000×g) and the resulting pellet was washed once with 10 ml buffer 1 (Tris, 50 mM, pH7.5; 20 mM EDTA). The pellet was suspended in 3.5 ml of buffer 1 containing 150 μg/ml RNase (Sigma-Aldrich) and 1 mg/ml lysozyme. After incubation of the mixture at 37° C. for 30 min, the salt concentration was adjusted by adding 850 μl 5 M NaCl solution, then the mixture was extracted two times with phenol:chloroform:isoamylaclohol (25:24:1, vol/vol) with gentle agitation followed by centrifugation for 10 min at 3500×g. After precipitation with 1 vol of isopropanol, the genomic DNA knot was spooled on a glass rod and redissolved in 200 μl of water. This method yielded about 0.5 mg total DNA. Standard agarose gel electrophoresis using 0.7% Seakem® LE-Agarose (Bio Whiaker Molecular Applications, Rockland, Me.) at a voltage of 50 mV overnight revealed that the sample contained mainly high molecular weight DNA of 50 kb or greater.


[0095] PKS Probe design. Five degenerate PCR primers were designed (degKS1F 5′-TTCGAY SCSGVSTTCTTCGSAT-3′[SEQ ID NO:44]; degKS2F 5′-GCSATGGAYCCSCARCARCGSVT-3′[SEQ ID NO:45]; degKS3F 5′-SSCTSGTSGCSMTSCAYCWSGC-3′[SEQ ID NO:46]; degKS5R 5′-GTSCCSGTSCCRTGSSCYTCSAC-3′[SEQ ID NO:47]; degKS7R 5′-ASRTGSGCRTTSGTSCCSSWSA-3′[SEQ ID NO:48]) based on conserved regions of ketosynthase (KS) domains of type I PKS genes and codon bias of high G+C organisms. The primers were used in the following combinations: degKS1F/degKS5R, degKS2F/degKS5R and degKS3F/degKS7R. The PCR conditions for the amplification of KS domains were as follows: A total reaction volume of 50 μl contained 100 ng of S. bikiniensis total DNA, 200 pmol of each primer, 0.2 mM dNTP, 10% DMSO and 2.5 U Taq DNA polymerase (Roche Applied Science, Indianapolis, Ind.). Thirty-five cycles of PCR were performed using the following steps: denaturation (94° C.; 40 sec); annealing (55° C.; 30 sec); extension (72° C.; 60 sec), 35 cycles. The resulting PCR reactions were subjected to electrophoresis on 1% agarose gels and the PCR products of approximately 700 bp were extracted from the gels using the gel extraction kit from Quiagen (Valencia, Calif.) according to manufacturer's protocol. The fragments were treated with Pfu DNA Polymerase (Stratagene, La Jolla, Calif.) to remove the A overhangs and cloned into the plasmid vector pLitmus28 (New England Biolabs, Beverley, Mass.) cut with EcRV. Thirty-two “amplimers” (the ca. 700 bp PCR-amplified segment) for each primer pair were sequenced using standard protocols. Of the 96 inserts sequenced, 81 were found to be KS amplimers. Employing the sequence comparison program ClustalW, 15 of the 81 KS amplimers were found to be unique and were compared with the 8 KS sequences of the related tylosin PKS cluster of Streptomyces fradiae using the program ClustalW. Each KS amplimer was thus assigned to a particular KS within the putative chalcomycin PKS cluster.


[0096] Genomic library preparation. Approximately 10 μg of genomic DNA was partially digested with Sau3A1 (1 hr incubation using dilutions of the enzyme) and the digested DNA was run on an agarose gel with DNA standards. One of the conditions used was found to have generated fragments of size 35-47 kb. The DNA from this digestion was ligated with pSuperKos plasmid, a derivative of pSuperCos (Stratagene) digested with AfeI and self-ligated to eliminate the neo marker, pre-linearized with BamHI and XbaI and the ligation mixture was packaged using a Gigapack XIII (Stragene) in vitro packaging Kit and the mixture was subsequently used for infection of Escherichia coli DH5α employing protocols supplied by the manufacturer. Approximately 2000 of E. coli transductants were probed by in-situ colony hybridization with DIG labeled Sb3/7-31 (KSq), Sb1/5-75 (KS3) and and Sb1/5-78 (KS7). Plasmids from 15 colonies, which showed strong hybridization signals were isolated, digested with BamHI and subjected to Southern blotting employing the KSq or KS7 amplimers as probes. Ten plasmids showed strong hybridization with one or both amplimers. The ends of the insert in each of the 10 plasmids were sequenced using convergent primers for each (T7 promoter and T3 promoter). Two cosmids, pKOS146.185.1 and pKOS146.185.10 were found to possess high homology at one end with a segment of the PKS from the tylosin biosynthesis cluster. These two plasmids also each gave rise to DNA fragments of ca. 1 kb and 1.2 kb after BamHI-digestion.


[0097] Identification of chalcomycin biosynthetic gene cluster. Further verification that cosmids pKOS146.185.1 and pKOS146.185.10 contained the chalcomycin biosynthesis cluster was performed by PCR. Specific primer pairs were designed for the chalcomycin KSq (Sb3/7-31 forward 5′-CGTCAGCCTGATCCTCGCCGA-3′[SEQ ID NO:49]; reverse 5′-TCCAGGTGGCCGACGTTC GTC)[SEQ ID NO:50], KS3 (Sb1/5-75 forward 5′-AACGAGATCCCGCCGGG CCTC-3′[SEQ ID NO:51]; reverse 5′-ATCA CGCGTTGCTGGGCGAGG-3′[SEQ ID NO:52]) and KS7 (Sb1/5-78 forward 5′-GGACGTCTGCCGGAGG GTTCC-3′[SEQ ID NO:53]; reverse 5′-GGCCCGTTGGGCACGGACAGA-3′[SEQ ID NO:54]) amplimers and used in PCR reactions with each of the 8 KS amplimers using the following conditions: total reaction volume of 50 μl contained 20-100 ng of plasmid DNA containing an amplimer, 100 pmol of each primer, 0.2 mM dNTP,10% DMSO and 2.5 U Taq DNA polymerase. Cycle steps were as follows: denaturation (94° C.; 40 sec), annealing (55° C. for KSq and KS3 specific primers, 65° C. for KS5 specific primers; 30 sec), extension (72° C.; 60 sec), 25 cycles. Each primer set was found to amplify its cognate amplimer exclusively, with the exception of the primer set for KS7, which was also seen to give a small amount of amplification of non-cognate amplimers. Each primer set was then used for PCR with cosmids pKOS146.185.1, pKOS146.185.10 and pKOS146.185.11 employing the same conditions as described above but using cosmid DNA in place of the plasmid-containing amplimer DNA. pKOS146.185.1 gave correctly sized amplimers with KSq and KS3 primers but not with KS7 specific primers, whereas pKOS 146-185.10 gave a correctly sized amplimer with KS7 but not with KSq and KS3 specific primers, indicating that pKOS146.185.1 contained the 5′ region of the chalcomycin PKS genes.


[0098] The sequence of the insert of pKOS146.185.1 corresponds to bases 1 to 48,595 of SEQ ID No.1 and the sequence of the insert of PKOS146.185.10 corresponds to bases 44,218 to 85,915 of SEQ ID No.1. Table 2 below provides open reading frame (ORF) boundaries corresponding to the nucleotide position in SEQ ID NO:1 (Table 3) of the chalcomycin PKS as well as the nucleotide sequences encoding enzymes involved in precursor synthesis and chalcomycin modification. In addition to the ORFs listed in Table 2, SEQ ID NO:1 includes additional open reading frames of genes encoding proteins or domains thereof that may be useful in the biosynthesis of chalcomycin and/or analogs thereof in certain host cells. The various open reading frames, module-coding sequences, and domain encoding sequences shown in Table 2 and the figures are sometimes referred to as “subsequences.” Those of skill in the art will recognize, upon consideration of the sequence shown in Example 1, that the actual start locations of several of the genes could differ from the start locations shown in the table, for example due to the presence in-frame of codons utilizable by the initiator methionine tRNA in close proximity to the codon indicated as the start codon. The actual start codon can be confirmed by amino acid sequencing of the proteins expressed from the genes.
2TABLE 2Chalcomycin PKS and modifying gene cluster ORFs of SEQ ID NO: 1* “C” encoded by complement of SEQ ID NO: 1ORF boundariesNameProposed Function (homology)Strand*No. residues  1-1009Orf1Orf1 complementC 343 [SEQ ID NO: 2]integral membrane protein (homolog of S. coelicolorSCF51A.28c) 1121-1963Orf 2Orf2 complementC 280 [SEQ ID NO: 3](homolog of S. coelicolor SCF51A.29c) 3008-3889Orf3Orf 3 complementC 293 [SEQ ID NO: 4]Oxidoreductase (homolog of M. tuberculosisMLCL581.18c) 3991-5208chmCIV3,4-dehydratase, D-chalcose pathway (EryCIVC 405 [SEQ ID NO: 5]homolog) 5339-6118chmMIIID-chalcose O-methytransferase (SpnH homolog)C 259 [SEQ ID NO: 6] 6239-7696chmCVD-chalcose pathway (EryCV homolog)C 485 [SEQ ID NO: 7] 7761-10271chmRbeta-glucosidase, extracellular reactivator ofC 836 [SEQ ID NO: 8]chalcomycin (OleR, DesR, EryBI homolog)10306-11511chmPIIP450 C12, C13-epoxidase (MycG homolog)C 401 [SEQ ID NO: 9]11549-12772chmPIP450 C8-hydroxylase (OleP homolog)C 407 [SEQ ID NO: 10]12762-13610chmITEII (homolog of TEII of tylosin cluster [the predictedC 282 [SEQ ID NO: 11]product of ORF5 of GenBank accession # AF145042])13631-14602chmAIITDP-glucose 4,6-dehydrataseC 323 [SEQ ID NO: 12]14648-15562chmAITDP-glucose synthaseC 305 [SEQ ID NO: 13]15869-16459chmJ3-epimerase; D-allose pathway (TylJ homolog)C 196 [SEQ ID NO: 14]16523-17290chmMIID-mycinose 3′ OH-MT (TylF homolog)C 255 [SEQ ID NO: 15]17551-18810chmHIP450 C20-hydroxylase (TylHI homolog) 420 [SEQ ID NO: 16]18831-19052chmHIIFerredoxin (TylHII homolog) 73 [SEQ ID NO: 17]18959-20029chmDD-allose pathway 4-KR (TylD homolog) 326 [SEQ ID NO: 18]19049-20029chmDAlternate N-terminus   [SEQ ID NO: 19]20062-21273chmMID-mycinose pathway 2′ OH-MT(TylE homolog) 403 [SEQ ID NO: 20]21329-22576chmND-allose glycosyltransferase (TylN homolog) 418 [SEQ ID NO: 21]22653-23495chrBResistance determinant; 23S-rRNA N1-C 280 [SEQ ID NO: 22]methyltransferase (TlrB homolog)23622-36947chmGIPKS, Modules 0-24441 [SEQ ID NO: 23]23823-25046KSOqPKS Ketosynthase 0q loading domain25353-26396AT0PKS Acyltransferase loading domain26499-26756ACP0PKS Acyl carrier protein loading domain26808-28079KS1PKS Ketosynthase 1 domain28386-29432AT1PKS Acyl transferase 1 domain30099-30794KR1PKS Ketoreductase 1 domain30966-31220ACP1PKS Acyl carrier protein domain31296-32567KS2PKS Ketosynthase 2 domain32889-33932AT2PKS Acyl transferase 2 domain33975-34574DH2PKS Dehydrogenase 2 domain35472-36257KR2PKS Ketoreductase 2 domain36402-36659ACP2PKS Acyl carrier protein 2 domain37041-42965chmGIIPKS, Module 31974 [SEQ ID NO: 24]37143-38414KS3PKS Ketosynthase 3 domain38724-39869AT3PKS Acyl transferase 3 domain39903-40544DH3PKS Dehydrogenase 3 domain41442-42281KR3PKS Ketoreductase 3 domain42411-42668ACP3PKS Acyl carrier protein 3 domain43022-54388chmGIIIPKS, Modules 4 and 53788 [SEQ ID NO: 25]43139-44422KS4PKS Ketosynthase 4 domain44750-45796AT4PKS Acyl transferase 4 domain46436-47248KR4PKS Ketoreductase 4 domain47318-47575ACP4PKS Acyl carrier protein 4 domain47651-48925KS5PKS Ketosynthase 5 domain49226-50272AT5PKS Acyl transferase 5 domain50309-51001DH5PKS Dehydrogenase 5 domain52085-52957ER5PKS Enoylreductase 5 domain52925-53728KR5PKS Ketoreductase 5 domain54439-59277chmGIVPKS, Module 61612 [SEQ ID NO: 26]54544-55818KS6PKS Ketosynthase 6 domain56122-57168AT6PKS Acyl transferase 6 domain57844-58707KR6PKS Ketoreductase 6 domain58753-59019ACP6PKS Acyl carrier protein 6 domain59387-63439chmGVPKS, Module 7 and TE1350 [SEQ ID NO: 27]59489-60778KS7PKS Ketosynthase 7 domain61112-62209AT7PKS Acyl transferase 7 domain62276-62533ACP7PKS Acyl carrier protein 7 domain62549-63436TEThioesterase63522-64760ChmCIINDP - hexose 3,4-isomerase; D-chalcose pathway 412 [SEQ ID NO: 28]component (EryCII homolog)64804-66081ChmCIIIChalcose glycosyltransferase (EryCIII homolog) 425 [SEQ ID NO: 29]66194-66940ChmUPost PKS Ketoreductase (SimJ2, NovJ homolog) 248 [SEQ ID NO: 30]67323-68471Orf4Permease homolog (SCF6.09 homolog)C 382 [SEQ ID NO: 31]68733-70196Orf5Membrane protein homolog (SC66T3.03 homolog)C 487 [SEQ ID NO: 32]70193-70888Orf6D-alanyl-D-alanine carboxypeptidase homologC 231 [SEQ ID NO: 33](SCD6.17c homolog)71382-72542Orf7Sensory histidine kinase homolog (SCE94.10C 386 [SEQ ID NO: 34]homolog)72638-73324Orf8Two-component syst. response regulator homologC 228 [SEQ ID NO: 35](SCE94.09 homolog)73651-75081Orf9permease (xanthine/uracil permease type) (SC9G1.02,C 476 [SEQ ID NO: 36]SC9G1.04 homolog)75401-76117Orf10SC6A11.03c HomologC   [SEQ ID NO: 37]76537-78375Orf11Permease homolog (SC9G1.02, SC9G1.04 homolog) 612 [SEQ ID NO: 38]78521-79192Orf12MerR-family transcriptional regulator (SC1A4.06c 223 [SEQ ID NO: 39]homolog)79228-79983Orf13Type-II thioesterase (SanP homolog)C 251 [SEQ ID NO: 40]80489-81439Orf14Open reading frameC   [SEQ ID NO: 41]81806-82528Orf15Response regulator homolog (SCD49.02c homolog) 240 [SEQ ID NO: 42]82712-85912Orf16Open reading frame   [SEQ ID NO: 43]


[0099] Genes listed in Table 2 that encode proteins with post-PKS polyketide-modifying activities include: chmPI, chmPII, chmHI (P450 homologs), chmN, chnCIII (glycosyltransferases) and chmU (polyketide ketoreductase).


[0100] Genes listed in Table 2 that encode proteins predicted to participate in the biosynthesis of sugar residue subunits of chalcomycin or modification of sugar residues after their addition to the polyketide include: chmCIV, chmMIII, chmCV, chmAII, chmAI, chmJ, chmMII, chmD, chmMI, and chmCII. Of these, three are predicted to participate in D-chalcose residue biosynthesis (ChmCII, ChmCIV and ChmCV), two are predicted to participate in D-biosynthesis allose residue biosynthesis (ChmD and ChmJ) two are predicted to participate in conversion of the D-allose residue to D-mycinose residue after covalent linkage of the D-allose to the polyketide (ChmMI and ChmMII), two are predicted to provide precursors for both the allose and chalcose pathways (ChmAI and ChmAII), and one is predicted to O-methylate the chalcose residue (ChmMIII).


[0101] As noted above, the invention also provides inter-polypeptide linker sequences, which can be identified by the skilled reader (e.g., comprinsing the sequences between the N-terminus of the polyptide and the beginning of the first KS domain; or between the C-terminue of the polypeptide and the beginning of the last ACP domain) and polynucleotides encoding such linkers.
3TABLE 3Chalcomyin PKS cluster from Streptomyces bikiniensis NRRL 2737(SEQ ID NO:1)1GGGGCCCGCCGGACGGGGCTGCCCGGCTCTCGGCGGTGCCCGGTGGGCCGGGTGCGGGCT61CGCCCGCGGCGAGATGCTCCAGGACCTCCGCCAGTTCCCGGCAGGCGCGGCGTACCGAGC121GGCGGGTCGCACGCTGCTCCGTGATGACGGAGGCGAGAAGCAGGGCGGTCAGTGCGGCGG181AACCGTTGAACGCCTGGAGCTTGGCCATGATCTCGACGTCCGAGAGGTGGAGGAACCCTC241CCCGTCCGGCGTTCGCCTCGAAGGTGGCGAGCACGGAGGCGAAGAGCGCGCAGAGCATGC301TTCCGGTGAGCTGGAAGCGGAGCGCCGCCCAGATCAGCAGGGGGAAGACGAGGAAGAGCA361TGCCCACCGGGCTGAGCACGGCCATGGGCATGAGGATCAGGGTCGCGAGACCCAGCAGGG421CCGCCTCCTTCCAGCGCCGTACGCGGAACCGTCCGGCCGGCCCCGCGAGGACGAGGAGGA481GCGGGGCGACGAGCAGCACCCCCATCGTGTCGCCCACCCACCAGGCCAGCCAGACGGGCC541AGAACTCGGTCGTGTCCAGGGAGCTCTTCGCCACCTGCAGTCCGACCCCGGCGGTCGCGC601TGATCAGCATGGCGCCGAACCCGCCGAGGAAGACCAGGGAGAGTCCGTCCCGCAGCCGTG661CCATGTCGAGCCGGAAGCCGGCCCGTGTCAGCAGCAGGAAGGCGCAGAGCGGCGCGACGG721TGTTGCTGACCACGGTGACCACGGTGGTGGGCCCCGGCGTGGTGAGGGAGGCGATGACGA781GGAAGGAGCCGAGGGCGATCCCGGGCCAGACGCGCGCGCCGAGCAGCAGCAGGGCGGCGA841CGGCGACGCCGGTGGGAGGCCAGATGGGGGTGACCACCACGCCTTCGACGACGAGGCGGC901CCATCAGGCCGAGTCGTCCGGCCGCGTAGTAGAGCACCGCCACGGCCAGCGACATCAGCG961CCGTCGCCGCGGGGGACCGGTACTGCCGAATATCCAACACGTCTGCCATCAGACACCGAC1021CGGGTTGCCGCGCCCCTGCATGGCCGCCTGCGTGTCGGGGACACGCCGCCCGGCCCGGAC1081CGGGCCGCGGGGAGGACCGGCCCGAGGCGGGTCCCCTCGCTCACTCCCCGGTCGCCGGCC1141GGGGCGGACCGCCGTCGTGGCCGACGACGAGGACGGCCGCGTCGTCCTCGTGCCCCACCG1201TGGCCGCCAGTCGCATGACGGCGGTGGCGAGCGCGTCGACGTCCAGCCCGGCGACGGCCG1261TGATCCCGGCGAGTCTCGTCACCCGGTCGAGACCCTCGTCGATGTGGAGCGAGGGCCCCT1321CCACGACCCCGTCGGTCAGGAGGACGAAGACGCCGTCGGCGGTGAGCCGGTGGCGCGTGG1381CCGGGTAGTCGGCGCCGCGCAGCACGCCGAGAGGAGGCCCGCCCTCGCTGTCCTCGATGC1441CGGAGCGGCCGTCGGCGGTGGCCCAGATGTGGGGGATGTGGCCGGCCCGGGCGCATTCCA1501GGGTGCCGGCGGCGGGGTCGAGCCGCAGGAAGGTGCAGGTGGCGAAGAGGTCGGCGCCGA1561GGGAGACGAGCAGGTCGTTGGTGCGGCCGAGCAGCTCTCCCGGGTCCGCGGTGACGGAGG1621CCAGCGCGCGCAGTGCCACGCGGACCTGTCCCATGAAGGCGGCGGCCTCGATGTTGTGTC1681CCTGGACGTCGCCGATGGACATGCCGATCCGCCCGCCAGGCAGGGGGAAGGCGTCGTACC1741AGTCGCCGCCGACGTTGAGCCCGTGGTTGGCGGGCTCGTACCGGACGGCCAGCCGGGCAC1801CCGGAAGACTGGGCAGGTCCGAGGGCAGCATGCCGCGCTGCAGGGCCACGGCGAGCTCGA1861CACGGGACCGCTGCGTCTCGGCCAGCTCCCGCGCCCTGGCGGTCAGCGACCCGAGCCGGA1921CCAGAAGGTCCTCGCTGCTGTCGGCCGGGCGTCTGCGGAACATCGATCACTCCGACGTCA1981CGACAATCCTCGCATCACTCCGTCCCGTCTCCAGCACGCGGGACCACAGGGGACCACCCC2041GGTACGAACAGGTCCTTCCCACTGTGCCCGGAGGGGGCGGGGTCCGCATCTCATCGGCGG2101GAGAGCGCGGTGGATCCCAGGGGGCCCGCTCAGGTCACCGAAAACGAGCAAACGTTCGAT2161AATGTGGTCGCGCCGGTCTGTGCGGCCGTTCAGCGTTCGACGGTCACTGCGGCGGCCGCG2221ATGCCGTCGCGCACCAGCCACCGGCCGGTGAACCCGCCCAGCTGCCGTCCGTCGACCGTC2281GGCCCGGGCACGAGCAGCCGGGCCCTGAAACGGCCGCCCGGCTGGAAGTCGACGGCCGCC2341TCCCCGAAGTCGAGCCGGCTCCGGGTGAGCGGGAACCACGCCTTGTAGACGGCCTCCTTG2401GCGCAGAACAGCAGCCTGTCCCACGGGATGCCGGGCCGGGCCAGTGCCAGAGCGGCCAGC2461GGCCCGCGCTCATCCGCGGTCGTCACCGCCTCGAAGACGCCGTGCGGCAGCGGAAGGGCG2521GGCTCGGCGTCGATGCCCAGAGACGCCACGTCCGCCTCGGACGCCACCGTCGCGGCGCGG2581TAGCCGGCGCAGTGGGTCATGCTGCCGACGACACCGGGCGGCCATCCCGGCGCACCCCGC2641TCACCGGGCACGAGGGCCACCGGCCCCCACCCCAGACCGGCCAGCGCGCGTCGCGCGCAG2701AAGCGCACGGAGGTGAACTCCCTGCGTCGCTTGTCGACCGCGCGCCCGATCGCCTTCTCC2761TCGTCGGCGAACAGCCGCGCCTCGGGGTACCGGTCGGCATCCAGGACGTCGCCGAAGACG2821TCCACCGACACCGCGGCGGGCGGCAGCAGTGCGCAGATCAGCCCGGCCCACCGCGGCACC2881GACACGGCGGCGTCGGCCTCGGCACGGTCGTGCACGCACGCCGACGCCGCGGCGTCGGCG2941AGGGGGCCCGTGCGGCTGCCGTCCCCGGACCGCCGCGCCGAGGACGCCGCCGGCGGGAAG3001CCGTCCCTCACACGGGCACCGTGGCGTCGTCGTGCGTGCGTCCGAGCGCGTTCAGCCGGG3061CCAGCTGGCCATCGGAGAGAGCGATGCCCGACGCCGCGACGTTCTCCCTCAGATGCGCGG3121GCGAGCTGGTGCCGGGGATGGGAATGACCGCCGGCGAGCGGTGCAGCAGCCACGCGAGCG3181CCACCTGACCGGCCGACACCTCCAGCTCGGTGGCGATGTCGGCCACCGGGCTCCCCCCGG3241CGGCGAGCGCACCGCGGGCGATCGGCAGCCAGGCGATGAACGCGATCTCGTGCTTCTCGC3301AGTACTCGACCACCTCGTCGTTGCGCCGGTCGGTCAGGTTGTACACGTTCTGCACGCTGG3361CCACGGTGATGTGCTCACGGGCGGCCTCCACTTCCCGGACGGTGACCTTGGACAGCCCGA3421TGTGCCGGATCTTCCCTTCGTCCTGCAGTTCCCCGAGCGCACCGAACTGCTCGGCCGCCG3481GCACCTTCGGATCGATCCGGTGCAGCTGGAAGAGGTCGATGCGGTCGAGTCGCAGCCTGC3541GCAGGCTCAGCTCGGCCTGCTGGCGGAGGTACTCCGGTCGGCCGCACGGCACCCACTGGT3601CGGGACGGGGCCGGCACTGTCCGGCCTTCGTCGCGATCACGAGGCCGTCGGGGTACGGGC3661GCAGTGCTTCGGCCAGCAGCTCCTCGTTGCTTCCCAGCCCGTAGGAATCGGCCGTGTCGA3721TGAAGGACACACCGAGGTCCACGGCCAGCCGGGCCGTGCTGATCGCAGCCTCCCGGTCCT3781CCGGCGGGCCCCAGTACCCCGGACCGGTCAGCCGGAGGGCGCCGAAACCCAACCGGTGTA3841CCGAGAGGTCGCCTCCCAGAGGGAAGGTGGTCCTGCCGGGCTGTGCCATGCGTTCCTCCT3901GGACGACGTCCGTGCACTCGGGTGGGGCGCGTGGTGACCTCGGTGGGGGCGGGCCGATGC3961CGACCGTCCGAACACGGTGGATCCCCCGGTTCAGGGAGCCGTCGTGGAGGGCACCCGCTC4021GTCGGCCCGTCGGGTCACCTCGTGTCCCCGCTCCAGAGTGAGCCGCACGATGTCGCAGAC4081CCGACGGATGTCCTCGCGAGAGACGGTGGGGCCGGTGGGCAGGGCGAGGACCTTCCGCGC4141GAGCCGTTCGGTGTGGGGCAGGGAGACCGGCCGCTCCGAGCGGTAGGGCTCCATCTCGTG4201GCATCCGGGGGAGAAGTACCGCTGGCACATGATGTTCTCCGCCCTCAGCACCTCGTCCAG4261CAGGTCCCGGTGGACCCCGGTCACCGCGGCGTCGACCTCCACGACCAGATAGTGGTAGTT4321GTTGCGCTCGGCACGGTCGAACTCCATGACCTTCAGCCCGGCGAGCCCGGCGAGTTCCGA4381CCGGTAGTCGTCGTGGTTGGCCTCGTTCCGGCGAACCGTCTCCTCGAAGGCGTCGAGCGA4441GGTGAGTCCCATCGCGGCGGCGGCTTCGGTCATCTTCCCGTTGGTGCCGGTCTCCGTGGA4501CACCCGCCCCTGGGTGAATCCGAAGTTGTGCATCGCGCGGACCCGCTGGGCCAGCCTCTC4561GTCATCGGTCACCACCGCGCCGCCCTCGAAGGCGTTGACCACCTTGGTGGCGTGGAAGCT4621GAACACCTCGGCTTGTCCGAAGCCGCCGATCGGCTGCCCTTCCGACGTGCAGCCGAGTCC4681GTGCGCGGCATCGTAGAAGAGCCTGAGACCGTGATCGGCCGCGACCTTCGCCAGCCGGTC4741GACCGCGCAGGGCCGACCCCACAGATGGACCGGCACGACCGCACTCGTACGGGGGGTGAT4801CGCCGCCTCGATCAGATCCGGGTCCAGGCAGTTGGTGGCCGGGTCGATGTCCACGAAGAC4861CGGGGTCAGACCCAGCCAGCGGAACGCCTGGGCGGTCGCCGGGAAGGTCAGCGACGGCAT4921GATGACCTCGCCCGACAGATCGGCGGCGCGGGCCAGCAACTGGAGGGCGACGGTCGCGTT4981GCAGGTCGACACGCAGTAGCGCACGCCGGCGAGTTCCGCCACGCGCTGCTCGAACTCCCG5041GGCCAGCGGCCCGCCGTTGGTCAGCCACTGGTGGTCCAGCGCCCAGTTCACACGGTCCAG5101GAACCGCGCTCGGTCGCCCACGTTGGGCCGCCCCACGTGAAGTGGTTGCAGGAACGCCGA5161CGGGCCGCCGAACACGGCGAGATCACCGAGATTGCGTTTCATGGTCATACCTCCCCGGTG5221CACGGGACGGTGCACCCGGCAGCAGCCGGCACAGGACGTGGACGTACCGGAGGGACAGGG5281GCCGCAGCGCACGACGGCGTCTCCGTCGTGCGTACGCGAAAGGGGCGCACGGAACGGATC5341AGTGCTCGCGGCGCCAGTAGACGCCCGTCACGTCCACCGTCTCGATCGGCTCGTCGATGC5401CGTGTTCGCTCCGGTAGTCGTGGACGGCCTGCTTGCACGCCGGGATCAGGTAGTCGTCCA5461CGATCACGAACCCGCCCACGGACAGCTTGGGGTAGAGGTTGACCAGCGCGTCCCTCGTCG5521ACTCGTACAGGTCGCCGTCCACCCGCAGCACCGCGAGCCGGTCGATCGGCGCGGTCGGCA5581GCGTGTCGGCGAACATGCCGGGCAGGAACCTGACCTGCTCGTCCAGCAGGCCGTAGCGGG5641CGAAGTTCTCCCTCACCTGCTCCTCGGAACAGCTCAGTACCCAGTTCAGGTGGTGGAACT5701CCATCGCCCGGTCGAGCGGGTGGCTGTCCTCGGACGTCACCGGGACACCGGCGAACGAGT5761CGGCGAGCCACACGGTGCGGTCCTCCACACCGTGCGCCTTGAGGAGCGCGCGCATCAGGA5821TGCAGGCGCCGCCGCGCCACACGCCGGTCTCGATGAGATCACCGGGAACATCGTCCTGGA5881GGACCCTCTCCACGCACCTCTGGATGTTGTCCAGGCGTTGCAGGCCGATCATGGTGTGCG5941CGACCGTCGGCACGTCCAGTCCTTCGGCCCGGTGGTCGGCGTCGAAGGTCCCGCGCTCGA6001CGTCCGGGCCCTCCGCCCCCGGACGCATGCGGTCCGCCAGTTCCTTGTCCAGGACGTTCA6061TCCCGACCGGGAAGGTCGGGTGATCCCCGTAGATGGTGTTCGACACGACGTTCTTCATGA6121GGTCCACGTAGAGCTCGCGGGTCTGCACGGTCGACTTACCTCCGGCTAGATATCGAACAG6181AGAGAGAATGTGCGGGGCGACGGCGTCCGCGGACGCGGTGGGAGCCGTCGCGGTCGGGTC6241ACCTCAGGAAGCCGCGCGCGTCCTCCCAGCCGGCCACGACATCGGCCTCCAGCTGGTTGA6301GCCGCGCCGCCACCACCTGGTCGAACCCGTCCATGAAGTACTCGTCACCGGCGTGCGGCG6361CTATCAGACGGCCGTCGTCCACGAACCGCTCGACGACCTCCGTCAGGGAGGTGCCCGGCC6421CCACCCGGCCCGCGACGTACCGGTCCGCTCCGGTGAGATCCGGGAACCCCGCCTCCCGGT6481ACAGGTACACGTCACCCAGCAGGTCCACCTGTACGGACACCTGGGGATGCGCGGACCGGC6541GCATGGTTCCGGGCCTGATCCTGAGCAGCTCGGCGTCCGCGCCGGTCTTCAGGCTGTGCA6601ACGCGTAGCCGTAGTCGATGTTCAGCGTGGGGGTCCGCCGGCTCACCGCCTCCTCGAACG6661CGAGCAGCCCCTCCTGGAGTTCGGCGCGTTCGGCCTCGAACAGCCTGCCGTCCTCCCGGC6721CGCTGTAGTCCTCGCGCACGTTCAGGAAGTCCACCGGCCGGTCCGGCGCGGCCTCGTTCA6781GGTCGGCGATGAAGTCGACGAGGTCGAGCAGCCGGTGGACGCGCCCGGGCAGCACGATGT6841AGCTGAGCCCGAGCCGGATCGGGCTCTCCCGCTCGGACCGCAGCTGCTGGAAGCGCCGCA6901GGTTGGCGCGGACCCTTCCGAAGGCGGCGCGCTTGCCCGTGGTCTCCTCGTACTCCTCGT6961CGTTCAAGCCGTACAGCGAGGTGCGCACCGCGTGCAGGTCCCACACCCCGGGCTGCCTGT7021CCAGCGTCCGCTCGGTGAGCGCGAACGAGTTCGTGTACACGGTCGGCCGCAGCCCGCGAG7081CCGCCGCACGGCTGCTGAGCGCGCCCAGGCCCGGGTTGGTGAGGGGCTCCAGACCGCCCG7141AGAAGTACATGGCGTACGGGTTGCCGGCCGGGATCTCGTCGATGACGGACGCGAACATCG7201CGTTGCCGGACTCCAGCGCCGACGGGTCGTACCGCGCACCGGTCACCCGGACGCAGAAGT7261GGCAGCGGAACATGCAGCTCGGCCCCGGGTACAGGCCCACGACATAGGGGAAGGCCGGCT7321TGTGCGCGAGCGCCGCGTCGAAGACCCCCCGTCGCTCCAGCGGCAGCAGGGTGTTCTGCC7381AGTACTTGCCCGCCGGCCCGTTCTCCACGGCGTGGCGCAGCTCCGGGACACGGCCGAAGA7441GGCCGAGGAGCCGGCCGAAGGAGGCCCGGTCCACGTCCAGCATCCGGCGTGCCTCCTCCA7501GGGGCGTGAAGGGGTGGTTCCCGTAGCGCTCGGCCAGCCGCACGAGCCGGCGCGCGACCG7561TCGGGCCCTCGTCGACACCGACGCGGCCGTCGGAGACCAGGGCGCGGTGCACCGTGTGGA7621CCGCTGCCGGCAGGTCGGCGCCGGGGCGGGCGCACAGGGCGACCGGATCGGCCGCGGTGG7681AGGCGTGCGAGGTCATCCTTCGTGTCCAATCGTCTGCGTCCGGAGGGTCGGTCCGTTCCC7741CGGGAAACGCGGCGGGGCGGTCACCGGGACGGCACCGTGACGGATGTGGTCAGGGACGTC7801TCGGCCGCGGACGGGCCCACGTGTACGGCCCTCCTGCCGGTGCCCGGCTTCCACGTGCCG7861GCGGCGGTGTCCCAGTAGCGCAGTTGCTGCTCGTCGACGGGGACGGTGACCCGCCGCTGT7921TCTCCCGGCCGGAGGGTGACCTTGGTGAATCCGGCGAGTTTGCGCAGCGCCTGGGGGGCC7981TGGGTGTCCGGACTTGCCCCGAGGTAGACCTGGACGGTCTCCCTGCCCGCCCGGTCACCG8041GTGTTGCGGACGGTGACCGTGGCCTCCAGCCCCCGGGCGGTGCGCGCCACGGACAGGTCG8101CTGTAGGCGAAGGTGGTGTACGACAGGCCGTACCCGAACGGGAAGAGCGGGGCCACGCCG8161GTCCGGTCGTACCACCGGTAGCCGACATCCAGTCCCTCGGAATAGGTCTCCTTGCCGTCG8221ACACCCGGGTAGCGGTGCGGGTCACCGGCCATGGGATGGCCGGTCTCGGTCCCCGGGAAG8281GTCTGGGTCAGCCGGCCGCTGGGATTCGCGTCGCCGAACAGCAGGGCAGCGGTGGCCTCC8341GCGCCCTCCTGCCCCGGGTACCACATCTCCAGTACGGCGGCCGTCTCACGCAGCCAGGGC8401ATGAGCACCGAGGAACCGGTGTTGAGGACGACGATCGTGCGGGGGTTGACCTTGGTGATC8461GCCGCGATCAGCTCGTCCTGTCGGCCGGGCAGGGACAGCGAGGACCGGTCCACGCCCTCG8521GCACTGTCGTCGTGGGCGAAGACGACGGCGGTGCGGGCCTCGGCCGCCGCCGCGACGGCG8581CGGTCGAAGTCGTCCCGGGCAGCCTCGGGAGTGACCCAGCTCAGCTCCACCGACAGCGGG8641GTCTGTTCGAAGGCCCAGCCGTTCATGGTCACCGCGTGGGTGCCGCGGGTCAGCCGCATC8701GTGGTGGTGACGGTGCCGAACGCCTCGGTGCCGAGGACAGCGGACTGCCCCGCGATCTGC8761AGGTTGGCGACCCCGCCGACGGCGGTGAAGGCGATCTTGTAGTCGCCGTCGGCGGGGACG8821GTGAGCCGACCCTCGTAGAAGACGCCCTGGCCGCTGGGATCCAGTTGCTTCCCGCTCGTG8881AAGGCGGGGGTGAGCGAGCTTTCGGGCACGGGCACGCCGACCAGGTCCTCACCGGGCTCG8941TGGACGACCCGGGCCTTCTCGCCGGCGCGCCGTGCGAGGGCGTCGACGGGTGCGGTCGCC9001CGGTCGGGTATGACGTGCGCGCTGCCGTTGCCGGTGACCTTGGGGTGCTTGGCGCTGTTG9061CCGATGACGGCGATGTCCGTGGCGTCCTCACCGGTCAGGGGCAGCGTCCGGCGCTCGTTG9121CGCAGCAGCACCCCGCCGTTCTCGGCGATGGTGCGGGCCGCCGCGCGGCCGGCCTCGGGA9181TCACGCTGCGGGCGCTCGGTCGCCTTGCCGTCGAGCAGGCCGAAGCGCTCCATCTGGCCG9241AGGATGCGGACGACGGACCGATCGAGCGTCGCCTCGGGGACGCTGCCGTCCCGCACGGCG9301GCCCTGAGCGCGGAGGAGAAGTACTTCGATTCCGGCACCGGCTGCCCCAGGGTCAGCTCG9361ACGCCCAGTTCCTGGTCGAGGCCCCTGGTGATGTCGCCGGTGGCGTGGGTCGCCAGCCAG9421TCGGACACCACCCAGCCGCGGAAGTCCCACTGTTCGCGCAGCACCTCCTGCAACAGGTGC9481TCGTTCCCGCATGCGTGCGCGCCGTTGACCTTGTTGTACGAGCACATCACCGAGGCGGCG9541CCCGCCCGCACGGCGCTGCGGAAGGCCGGCAGCTCCACCTCCTGGAGCGTCTGCTCGTCG9601ACCACGGCGTCGATGGTCTCCCGCTGGTACTCCTGGTTGTTCGCGGCGAAGTGCTTGACC9661GTGGCCATCAGGCCCTGGTTCTGGATGCCGCGGACCTCGTGGGCGGCCATCCGGGACGAC9721AGCAGAGGATCCTCGCTGTAGGTCTCGAAGTTCCGCCCCGCGTGCGGCACCCGGATGACG9781TTGGTCATGGGACCGAGGACGATGTCCTGTCCGAGGGCCCGTCCCTCCCGGCCCAGGACG9841GTGCCGTACTCCTCGGCGAGGCGTTCGTCGAAGGTGGCGGCGAGCGCCACGGGGGTGGGC9901ATGGCGGTGGCGGTACCGCCGAGCAGACGGATGCCGACCGGGCCGTCGGCGGTGCGCAGC9961TCGGGGATGCCGAGCCGGGGCACGCCCGGCAGATAGCCGATGCCGGTCATGGTCGGCCCG10021CCCACGGGCCCGGTGGTCCAGTGGACGAACGAGATCTTCTCGTCGAGGGTCATCTTCGCC10081ACCAGCCCGCGCACCCGCGCCGTCCCGGGCTCGGCGGCCGGAGCACCGGCGGCGGAGGGG10141GCGGTGAGCAGACCGCCGACGCACAGGGCCGCGAGCGCGGATCCGACGGTGCGCCGGATG10201CGGCGACCCGTCCTCGTCGTGCCGAACAGCATGCTGACGGACGTCCTTTCTGCCGAGGTT10261GCCGTCCTCATCGGGGCGGGAACGTTTCTGTCCGTGCCGCCATGATCACCAGCCCACGAG10321CATCGTGCGCGGGCCCCGCACCACCATCTCCGTCTTCCACTCGACGGGCTGGGCGAGGTG10381CAGCCCGGGGAGTTCGTCGAGCAGGGTCCGCAGCGCCTCCTGGAGCTCCAGCCGCGCGAG10441GGACGCGCCGAGGCAGTGATGCGGACCGTGTCCGTACCCCAGGTGCCGGCCGGCGTCGTC10501GCGGGTGATGTCGAGCACGCCGGGCGAGCGGAAGCGCAGGGCGTCCCGGTTGGCGGCGTT10561CATCTGGACCAGCACCGGATCCCCCGCCCGCACCAGCGTCCCGCCCACCTCGACGTCCTC10621GGTGGCGTAGCGCGGGAATCCCGCCTGGCTGCCGAGCGGAACGAACCGCGTCAGCTCTTC10681CACCGCGTTGTCCAGCAGGTCAGGACGGTCCCGGAGGAGGGCCAGCTGGTCCGGCTGTTC10741GAGCAGGACCAGGACGAAGTTGCTGATCTGGCTGGCCGTGGTCTCGTGCCCGGCGAACAG10801GAGGAAGACGATCAGGTCGACCAACTCCTCCTGCGACAGCCGGCCCTGGGCGTCACGGGC10861CTCGACGAGCGCCGAGACGAGATCGTCTCGCGGCGCGGCGCGGCGGGCCGTGATCAGGTC10921CGCCAGATAGCCGGTCAGCTCGCCGGCCAGCCGCACATGGTCCTCGGCGGTGAGCGAGCT10981GGTGGACATCGCTATCTCGCACCAGCCGCGGAACAGGTCACGGTCCTCCTCGGGCACGCC11041CAGCAGGCCGCAGATGACCGCCACGGGAAGGGGGACGGCGTAGTGGTCCACGAGGTCGAC11101GGGCGATCCGAGCGCCGTCATGTCCCGGAGCAGCGACGCCGTCATCCTTCGGACGTGCGG11161CCGCATGGCCTCCACGCGGCGCGGGGTGAACGCCTTGGTGACCAGCCCGCGCAGCCTGGT11221GTGGTCCGGCGCGTCCATGCCGATGATGCCGTTGGGCGTCCGCAGCTCCCGCATGCGCGG11281CGCGTCGTTCTCCGCCTCCGCGGCGTGGCTGAACCGCCGGTCGCCGAGGACGAAACGCGC11341GTCGTCGTAACGGGAGACGAGCCAGCCCGGCTCCCCGTACGGCATGTGGACCCAGAGGAG11401TCCTTCGCTCTTCCGCGCCTGCTCGTACGCCTCGTCCAGGTCCAGTCCCTCCGGGCGGCC11461GAAGGGATAGGAGACGGGTGCGGTCTTCGTGCTGGTCAAGGGGAGCCCCATATCGCGTCG11521TTCGGGTGCTTAATCGTCGTGGGCCGGATCATGCGGCCGAGCCGCCCCAGGTGACGGGGA11581CCGCCTCGGGGGTGCGCAGGAGGGAGCCGGCCCGCCAGCGAAGCTCACTCGCGGGGACGG11641ACAGCCGGAGCTCCGGCAGGCGCTCGACCAGGGTGCTCAGCACGACCTGCAGTTCGAGCG11701GGGCCAGGGGGGCGCCCATGCAGTAGTGGGCGCCGTGACCGAAGCCGAGGTGCGGGTTCT11761GGGCACGGGCCAGGTCGAGGGAGGCGGCACCGGGAAACGCCGTGTCGTCGAGGTTCGCGG11821ACGCGATCGAGGTGATGACCGCCTCGCCCGCGCGGATGAGCGTGCCGCCGATCTCGACGT11881CCTCCAAGGCGATGCGGGCGAACCCGTCCAGGGTGCTCAGCGGGGTGAAGCGCAGCAGCT11941CCTCGACGGCGGAGGGAACCAGGTCAGGTTCGGTCACCAGGCGCTCGTAGAGGCGGCGGT12001CGTCGAGCAGGAGGAGGACGAAATTGCCGATCTGGGTGGCGGTCGTCTCGTATCCGCTGA12061TCAGCAGGCCGGAGCCCATCATGAGGAGCTCGGCCTCGGAGAGCCGGTCGCCTTCGTCAC12121GGGCCTGGACCATGGTGCTGAGGAAGTCGTCGGTGGGCCGGGCGCGGCGATCGGCCACGA12181GGTCGGCCATGTAGGCGGAGAAGTCCTCGGTCGCCCGCTGCACTTCGGCGGGCCCCAGGG12241AGGAGGACACCAGCGCCTCGGAGAAGTCCCGGAAGAGGTGGCGGTCGTCGTAGGAGACGC12301CCAGCAGTCCGCAGATCACGGAGATCGACACGGGCAGCGCCAGGTCTTCCATGACCTCGC12361CGGACGGCTCGTCGCGCGCGACCATGCCGTCGAGCAGTTCATCGGTGAACCGTGAGACCC12421GCGGGCGCAGTTCCTCCACCCGGCGGGCGGTGAACGCCTTCCCGGCGAGCCGCCGGAGCC12481TGGTGTGTCCGGGCGGGTCCATGGAGATGAGACCGCCGGCGGGCAGCGGGTACTCGGTGG12541GCCGGGGCACGTCACGTCCGGCCCCGGCCTCCATGCTGAACCGCGGGTCTCCCAGGACCG12601CCCTCACGTCGGCGTGTCGGGTCAGCAGCCAGGCCTCACCGCCGTACGGCATGCGGATCC12661GGGACACGGGCTCGTGCTCCCGCAGACGGCTGTAGAGCGGATCCAGCGCGATGCCCTCGG12721GCGGGGAGAAGGGGTACGGGCGAGTCCGGTGGACCGGACAGCTAGACGTCATGGACACTC12781TCTCGGTCGTACGGGGCGGCGGCAGAGCTCGAAGGGTGCGGGCGGCCGACGGCGAGCCCG12841GTCGCCGTCCGGATCGTCCGGCACACCGCTTCGGCCTGCTCGGTGAGGTAGAAGTGGCCT12901CCGGGGAAGGAACGGAACGCCGTCTCCCCGGTGGTCAGCCCGTGCCAGGCCAGGGCTCCG12961TCGGTGGGGACGTGCGGATCCGCGCTTCCCGTCAGCACGGTGATCGGGCAGGCGAGCGGA13021GCTCCCGGACGCCACGTGTAGGTGGCGGCGGCCCGGTAGTCGTTGCGGATCGCCGGCAGG13081GCCATCCGCAGGACCTCCTCGTCGGCCAGCACCCGGGGGTCGGTGCCTTGCAGTTCGGCC13141AGCTTGGCGACCAGCCGGTCGTCGCTCAGCAGGTGCGCGGTGGTCCGGTGGGCGACGGTC13201GGGGCGACGCGGCCGGAGACGAACAGATGCGCCAGCCTGCTCTCCGGCGCCTGTTCCGCG13261AGCCGGCGGGCGGTCTCGAAGGCCACCAGCGACCCCAGGCTGTGACCGAAGAGCGCCACC13321GGCCGGTCCAGCCAGGGGCCGAGCGCTCCGGCGACGTGCGTCACCAGAGCGTCGACGGAG13381TCCACGAACGGCTCCAGCCGCCGGTCCTGCCGCCCGGGATACTGGACGGCGAGCACCTCG13441ACCCGGTCCGGCAGCAGACAGGTGAAGGGGAGGAAGGCGCTGGCGCTGCCGCCGGCGTGC13501GGAAGGCACACGAGCCGGAGCGCGGGGTCGGCGACGGGCCGGTAGCGGCGCAGCCAGAGG13561TCGCCGGCGTTGGTGGAGCCGGGCGCCGTCGTGGCGGGCCCCTCGTTCATGCGTGATCCG13621CTTCCGGACGTCACGGTGTCAGCGCCTTCCACCACGTGCCGTTGTCCCGGTACCAGTCGA13681CGGTCTCCGCGAGCCCGCGATCGAAGGCGACGCGGGGAGCCCATCCCAGCTCCTCGGTGA13741TCTTCCGGGTGTCCACGCAGTAGCGACGGTCGTGCGCGGCCCGGTCCGGCACCCGGTCCA13801CCACGGACCAGTCGGCACCGAAGACGCCGAGAAGACGACGGGTGAGGTCGACGTTGGACA13861GTTCCGTCCCGCCGCCGATGTTGTACACCTCTCCGGCCCGCCCCCGGGCGGCGACCATGG13921CGATGCCCCGGCAGTGGTCGTCCACGTGCAGCCAGTCGCGCCGGTGGAGGCCGTCCCCGT13981ACAGGGGAAGGCGCAGGCCCTCCAGCAGATGGGTCACGAAGAGCGGGACGACCTTCTCGG14041GGTGCTGGTGGGGGCCGTAGTTGTTGGAGCAGCGCGTCACCCGGACGTCGAGGCCGTGGG14101TACGGTGGAAGGCGAGCGCCAGGAGGTCGGAGGAGGCCTTGGACGCCGCGTAGGGGGAGT14161TCGGGTCCAGCGGATCGGCCTCGGTGGAGGAGCCCTCGGGGATCGATCCGTACACCTCGT14221CGGTGGAGACGTGGACGAAGCGGCTGACGCCCGCTTCGAGCGCCGACCGCAGCAGGGTCT14281GCGTCCCCAGCACGTTGGTGGTGACGAAGGCGGCGGAGTCCGTGATCGAGCGGTCCACGT14341GCGACTCGGCGGCGAAGTGCACCACCAGGTCGGTGCCGCGCAGGGCCTGGGCCACCGTGG14401GCGGGTCGCAGATGTCGCCGTGCAGGAAGGTGTGACGGGGGTGGCCGGCGACCTCGGCGA14461GGTTCTCGGTGTTCCCGGCGTACGTGAGCTTGTCCAGCGAGAGGACGTGGGCGCCGGCCA14521GCTCCGGATAGGAGCCCGACAGGAGCTGTCTGACGAAGTGCGAGCCGATGAAGCCCGCTG14581CCCCGGTCACCAGGACGCGCATCCTTGATCCGCCTTTCTGCCGTGCTGTGTGGGGGAGGT14641GTGGTCAACCGGTGCGCGTCGCCCCGAGTCCCGAGGCGACCTCCATGAGGTAGTCCCCGT14701AGCTGGAGGCGCTCAGCTCCTCACCCAGCCGGTAACAGGTGTCCGCGTCGATGAAGCCCA14761TGCGCAGCGCGATCTCCTCGAGACAGGCGATGCGTACGCCCTGGCGCTGCTCGAGGAGCT14821GGACGTACTGACCCGCCTGGAGCAGGGAGTCGTGGGTTCCCATGTCGAGCCAGGCGAATC14881CACGTCCGAGCTGGGTCAGCCGGGCGCGGCCCTGCTCCAGATAGGACAGGTTGATGTCGG14941TGATCTCCAACTCGCCTCGGGCTGAGGGCTTGAGGTCCTTCGCCAGCTCCACCACGTCGT15001TGTCGTAGAAGTACAGCCCGGTGACGGCGAGGTTGGAGCGAGGACGGCTCGGCTTCTCCT15061CCAGGGACAACAGGTGCCCCTGCTCGTCGATCTCGGCGACGCCGTAGCGCTCGGGCGACT15121TGGACGGATAGCCGAACAGTTCACATCCGTCGAGACGTCGCAGGGAGTGCTTCAGGACGG15181TGGAGAATCCGGGGCCGTGGAAGACGTTGTCGCCCAGAATGAGCGCCACCCGGTCGTTCC15241CGATGTGCTTGTCGCCGACCAGGAACGCGTCGGCGACACCGCGCGGCTCGTCCTGGACCG15301CGTAGTCGAGCTGGATACCGAGGCGCGAGCCGTCGCCGAGCATGACCTGGAACGTCTCCA15361CGTGCTGGTGCGAGGAGATGATGAGGATGTCCTGGATTCCCGCCAGCATCAGCACCGACA15421GCGGATAATAGATCATGGGTTTGTCATAAACGGGCAACTGCTGTTTGGAAAGCGCACCGG15481TCAGCGGCTGCAGTCGAGTGGCGCTTCCCCCGGCGAGGATTATCCCCCTCACTCCGGGGC15541GTTTCAGCGGTGTCTCGAACACGGTTGGTCCTCCGTGGTCACATGGCCGATATGGGGGGT15601GAAGACACTGTCCTGAGAGGCCGGCGGACCGGCTGTCGCCTCGCGGACACAGCGGCTTAA15661TGCATTCACCCCGCCCCGGGACCGTCATCCGAGAAGAAGGAATGCGGTGTCGTGGGAACC15721GACGTCCAGGAGTTCCTTTCGGGCCGCGGAACGGCGGCGCGGAGATTCTGAACCGCGGGG15781GATTCCAGGGCGGTGGCAGGGAAGGGAACCACCGCCGCGCCATCTCTCCCGGAACGTTCC15841GCAAGCGGCGGGCCGTGCTTCGGACGGCCTATCTCTGCGCCTGTTGCTGTTCCTGCCAGG15901CCTGATAGGTCGGCAGCAGACCGAGCCGTTCGGCCGTCGCGAGGGTCGGGGCGTTCTCGT15961CCCTGTCGGAGAGCAGCGGCTCGATGTCGTCCGGCCACGCGATTCCGAGGTCGGGGTCGA16021GCGGATTGACGGAGTGCTCGCGTGCGGGAGCGTATCCGGAGGAGCAGAGGTAGACGAGCG16081TGGCGTCGTCGGTGAGGGAGAGGAAGGCACGGCCCAGTCCCGCGGTCAGATAGACGGCGG16141TGTTGCGTTCCGCGTCCATGGGCACGATCTCCCAGCGCCCGAAGGTCGGCGAACCGATGC16201GGACGTCCACGACGACGTCGAGGCCGGCTCCCCGCACGCACACGCTGTACTTGGCCTGAC16261CTGGCGGGATCTCGGTGTAGTGGATCCCGCGCAGCGCGCCGCGGTGCGACACCGCGACAT16321TGACCTGGGCCACCGGGAAGTCATGGCCGAACGCCTGACGGAAGCTCTCGCCCCGGAACC16381ACTCGTGGGACCTCCCGCGGTGGTCGGAGTGGATGACGGGTTCCTGCGACCAGGCCCCCT16441CGATGCTGAGTGGATGCATCGCGCCTTCTCCTTCGGACCGATGGGTGGGGTGCGGGGCGG16501GCCGGGGGCACGGCCGAGCCGGTCAGCTGGAGCGTCTCCAGTAGGAACCCTGGCGATCGA16561TCTGGTGGATCTCGTCCGTGATGCCGTGCTCGTCCCGGAACTCGTGGACGGCCTGCCGGC16621ACGCCGGAATGCAGTAGTCGTCGACGATGACGTACCCGCCGTCCGACACCTTGTGGTAGA16681GGTTGGTGAGAACCTCCCTGGTGGCTGCGTACGAGTCCCCGTCGAGCCTCAGCACCGCGA16741GCCTCTCGATGGGCGCGGTGGGCATCGTGTCCTTGAACCAGCCCGGGAGGAAGCGGACCT16801GGTCGTCCAGGAGTCCGTAGCGCGCGAAGTTCCCCTTCACGGTCTCCACGTCGACCGGGA16861TGCTGAGGACGTCGTTGTACTGGCCGAGGTCGATGTCGACGTCCAGCTGGTGGTCGTCCT16921CGGTGGTCTTGGGGAAACCCTGGAAGGAGTCCGCGACCCACACCTTCCGGTCCCGCACGC16981CGTGCGCCCGGAAGACGCCGCGGGCGAAGATGCAGGCCCCGCCCCGCCAGACCCCGGTCT17041CCGCGAAGTCCCCCGGCACGCCGTCGCGCAGCACGTCCTCCAGGCACTTCTGCAGGTTGT17101CGAGCCGCTTGAGACCGACCATCGAGTGCGCCACGCGCGGAAAGTCCTCACCCACCGAGC17161GCAGTTCCGCGGAATACGAGCTGCTGGTGATGAGGCCGGCGACATTGGTCTGGTCCTCGT17221AAATCATGTTCGTCACGACCTTCTTCAGAAGGTCGAGATACAGGTCCGCTTCGGCAGCTA17281TGACAGTCATTTTCCTCACTTACGGGTAGCAGTGCCCAGCGGGCGGCTCGTTCAGGACGG17341GGGCCGCCGGGGCTGAATTCCCTGTGTCCACACAGATGAGGTGGATGAGGTGGATGAGGT17401AGCCATCTAACCCCAGTGATCAGATTCGGGCAAGGGTCGAAAACGAGCCACGTCTTATGT17461CGATCCTGTCCGGAAGCGAGGGGCATATGGTGCAGTGGCGACTGCGGCCGATCTGGCTGA17521TCCTTGCTGCGGCGCTGACCGTGTGCTTGCATGTTGGACGTAGATCACCTTCTCCCGATT17581GCATTCAGGGTGAGGAAATCCATGAAATCTTCAAAAGTCGTTCACAGCCGTCCTGCGGAA17641GCGGGCGTCGCATGGCCCGTCGCGCGAACCTGCCCCTTTACGCTCCCTGATCAGTACGCA17701GAGAAGCGCAAGAACGAGCCCATATGCCGGGCTCAGGTCTGGGACGACTCCAGAACCTGG17761CTCATCACCAAGCACGAGCACGTACGAGCCCTTCTCGCCGACCCCCGGGTCACCGTCGAC17821CCGGCCAAGCTGCCCAGGCTCTCCCCCTCCGACGGCGACGGCGGCGGCTTCCGGTCCCTG17881CTGACCATGGACCCCCCGGACCACAACGCCCTCCGCCGCATGCTCATATCCGAGTTCAGC17941GTGCACCGGGTCCGGGAGATGCGCCCGGGCATCGAGCGCACCGTGCACGGGCTGCTGGAC18001GGGATCCTCGAACGGCGGGGGCCGGTGGACCTGGTGGCCGAACTCGCGCTGCCGATGTCC18061ACCCTGGTGATCTGTCAGCTCCTCGGAGTGCCCTACGAAGACCGCGAGTTCTTCCAGGAA18121CGCAGCGAACAGGCCACCCGGGTGGGCGGGAGCCAGGAATCGCTGACCGCGCTCCTGGAA18181CTACGGGACTACCTGGACCGGTTGGTCACCGCGAAGATCGAGACGCCGGGTGACGACCTG18241CTGTGCCGGCTCATCGCCAGTCGACTGCACACCGGTGAGATGCGACACGCCGAGATCGTG18301GACAACGCCGTGCTGCTGCTCGCCGCCGGCCACGAGACCAGTGCCGCCATGGTGGCACTG18361GGCATCCTGACACTGTTGCGGCACCCCGGCGCCCTGGCGGAGTTGCGGGGCGACGGTACG18421CTGATGCCGCAGACGGTCGACGAACTCCTGCGTTTCCACTCCATCGCGGACGGCCTTCGA18481CGGGCGGTCACGGAGGACATCGAACTCGGCGGCATCACGCTGCGCGCCCCAGACGGCCTC18541ATCGTCTCGCTGGCCTCCGCCAACCGCGACGAGAGCGCCTTCGCCTCCCCGGACGGCTTC18601GACCCGCACCATCCGGCGAGCCGGCACGTCGCCTTCGGCTACGGCCCCCACCAGTGCCTG18661GGCCAGAACCTGGCCCGGCTGGAGCTGGAGGTCACCCTGGGCGCGGTGGTGGAGCGCATT18721CCCACGCTCAGGCTGGCCGGCGACGCCGACGCACTGCGCGTCAAACAGGATTCGACCATC18781TTCGGGCTGCACGAGCTGCCGGTCGAGTGGTGACGGAAGGAGGACACAGCGTGCGGGTGA18841CAGTCGACCAGAGCCGGTGCCTGGGAGCCGGCCAGTGCGAGCAGCTGGCGCCGGAGGTCT18901TCCGCCAGGACGAGGAAGGACTGAGCCGGGTCCTCGTCCCCGAGCCCGATCCGGCGTCAT18961GGCCGCGGGTGCTCCAGACGGTGGACCTCTGCCCCGTACAGGCCGTCCTCATCGACGAGG19021GCCCCGGTCCCGCGCCGCAGGACACCAAGTGACCGCTGACCGCTGGGCCGGCCGCACGGT19081GCTCGTCACGGGAGCACTCGGGTTCATCGGCTCCCACTTCGTCCGACAGCTGGAGGCGCG19141CGGAGCCGAGGTGCTCGCCCTCTACCGCACCGAACGGCCGCAATTACAGGCCGAGTTGGC19201CGCGCTCGACCGAGTACGCCTGATCAGGACGGAGCTGCGGGACGAGTCGGACGTGCGAGG19261GGCCTTCAAGTACCTGGCACCCTCCATCGACACCGTCGTCCACTGCGCGGCCATGGACGG19321CAACGCGCAGTTCAAGCTGGAGCGCTCGGCCGAGATCCTCGACAGCAACCAGCGGACCAT19381CTCCCACCTGCTGAACTGCGTACGGGACTTCGGCGTCGGCGAGGCCGTGGTCATGAGCTC19441CTCCGAGCTGTACTGCGCGCCGCCCACCGCGGCGGCCCACGAGGACGACGACTTCCGCCG19501ATCCATGCGGTACACGGACAACGGCTACGTCCTGTCCAAGACCTACGGCGAGATCCTGGC19561CAGGCTCCACCGCGAGCAGTTCGGCACCAACGTCTTCCTGGTGCGACCGGGCAACGTCTA19621CGGGCCGGGAGACGGCTACGACCCCTCCCGGGGCCGGGTGATCCCCAGCATGCTGGCCAA19681GGCCGACGCCGGCGAGGAGATAGAGATCTGGGGGGACGGCAGTCAGACCCGGTCCTTCAT19741CCACGTCACCGACCTGGTACGGGCCTCACTGCGCCTGCTGGAGACCGGCAAGTACCCCGA19801GATGAACGTGGCCGGCGCGGAACAGGTCTCCATCCTGGAGCTCGCCCGGATGGTGATGGC19861CGTCCTGGGACGGCCCGAGCGCATCCGCCTCGACCCCGGCCGCCCCGTCGGCGCCCCGAG19921CAGACTTCTGGATCTGACCAGGATGTCGGAAGTGATCGACTTCGAGCCCCAGCCCCTGCG19981GACCGGGCTGGAAGAGACCGCTCGCTGGTTCCGCCACCACACGCGCTGAACCTCCTCTCA20041TACCCCCCTGGAAGGTAACTCGTGGTCACACACGCCCCGAACTCGCTGATCAGTGACATA20101ATCCGCGCCTCCGGCGGGCATGACGCCGACCTCAAGGACCTGGCCGCCCGACACGATCCG20161GCCGACATCGTCCGCGTACTCCTGGACGAGATCACCTCACGCTGCCCCGCTCCCGTGAAC20221GACGTCCCCGTCCTCGTCGAGCTGGCCGTCCGGGCGGGAGACCGCCTCTTCCCCACCTAT20281CTGTACGTCCTCAAGGGCGGCCCGGTGCGCCTCGCGGCCAAGGACGAGGCGTTCGTCGCC20341ATGCGCGTCGAGTACGAGCTGGGCGAACTGGCCCGCGAGCTGTTCGGACCGGTGCGGGAG20401AACGTCACCGGCGTCCGCGGAACGACTCTCTTCCCCTACGTCGGGGACACGGCGTCGGAA20461GGCGAGGAGGACTCGGGTGCCGAGCACATCGGCACGCACTTCCTGGCCGCGCAGCAGGGC20521TCCCAGACCGTGCTCGCCGGCTGCCATTCCCACAAGCCGGACCTCAGCGAGCTCTCCTCG20581CGCTACCTCACCCCGAAGTGGGGCTCGCTGCACTGGTTCACCCCCCACTACGACCGCCAC20641TTCAGGAGCTACCGGGACCAGCCCGTACGCGTTCTGGAGATCGGCATCGGCGGCTACAAG20701CACCCCGAGTGGGGCGGCGGCTCCCTGCGCATGTGGAAGCACTTCTTCCATCGCGGCGAG20761ATCTACGGCCTGGACATCGTCGACAAGTCGCACTTCGACGCGCCGCGCATCACGACCCTG20821CGCGGCGACCAGAGCGACCCCGACCACCTGCGGTCGATCGCCGAGAAGTACGGACCGTTC20881GACATCGTCATCGACGACGGAAGCCACATCAACGACCACATCCGGACCTCGTTCCAGGCA20941CTGTTCCCGCATGTGCGGCCCGGCGGCCTCTACGTGATCGAGGACCTGTGGACCGCCTAC21001TGGTCCGGCTTCGGCGGCGACGAGGACCCGAAGCGGTACAGCGGGACGAGCCTGGGCCTG21061CTCAAGTCCCTCGTCGACTCGATCCAGCACGAGGAACTGCCGGAGGCCGGCGACCACCGT21121CCCAGTTACGCGGACCAGCACGTGGTCGGCATGCACCTCTACCACAACCTGGCGTTCATC21181GAGAAGGGCACCAACGCCGAGGGCGGCATCCCCCCGTGGATCCCACGCGACTTCGAGACC21241CTCGTCGCCGTCTCCTCCGGGGGCCACGCATGAGGAGCCGTCGGCACCAGCCACCCGAAC21301ACACCCGACCGGACCGCAGGAGGCCCGCATGCGCGTGACCCTGCTGAGCGTCGGATCCCG21361AGGCGACGTCCAGCCGTTCGTCGCCCTCGGCATCGGCCTCAAGGCCCGCGGCCACGACGT21421CACCCTGGCCGCCCCCGCCACGCTGCGGCCACTGGTCGAGCGCGCGGGACTGACGTACAG21481GCTGTCCCCCGGGGATCCCGACGGATTCTTCACCATGCCCGAGGTCGTCGAAGCGCTGCG21541GCGCGGCCCCTCGTTCAAGAACATGCTCGCGGGGATGCCCGAGGCGCCCGAGAGCTACAC21601ACAGCAGGTCGTCGACGCGATCCACGACGCCGCCGAGGGCGCCGACCTCATAGTGAACGC21661GCCCCTCACCCTGGCCGCCGCGTACGGGCACCCGCCCGCCCCGTGGGCCTCGGTGTCCTG21721GTGGCCCAACAGCATGACCTCGGCCTTCCCGGCCGTCGAATCCGGGCAGCGCCACCTCGG21781ACCGCTGACGTCCCTGTACAACCGCTACACCCATCGCAGGGCGGCACGCGACGAGTGGGA21841GTGGCGACGCCCCGAGATCGACGGCTACCGCCGACGCCTCGGCCTCCGGCCCTTCGGCGA21901CGAGTCCCCGTTCCTCCGACTGGGGCACGACCGCCCGTACCTGTTCCCCTTCAGCCCCAG21961CGTGCTGCCCAAGCCGCGGGACTGGCCGCGCCAGAGCCACGTCACCGGCTACTGGTTCTG22021GGACCAGCACGGGGAGCCGCCCGCCGAGCTGGAGTCGTTCCTGGAGGACGGGGAGCCCCC22081GGTGGCGCTCACCTTCGGCAGCACCTGGTCACTCCACCGGCAGGAGGAGGCCCTCCAGGC22141CGCCCTCGACGCCGTCCGTGGCGTCGGACGCCGACTGGTCATGGTCGACGGACCGGACAG22201CGACCTGCCCGACGACGTGCTCCGCCTGCACCAGGTGGACTACGCCACCCTCTTCCCCAG22261GATGGCCGCGGTGATCCACCACGGCGGCGCCGGCACGACCGCCGAGGTCCTCCGGGCCGG22321AGTGCCCCAGGTCATCGTGCCGGTCTTCGCCGATCACCCCTTCTGGGCGGCTCGACTGTC22381CAGAACAGGCGTCGCCGCCCGGCCGGTCCCCTTCGCCCGCTTCAGCCGAGAGGCACTGGC22441GCAGAGCGTCCGCCAGGCGGTCACCGATCCCGCGATGGCGGGCCGGGCCAGGCGACTGGG22501CGAACGGGTCTCCAAGGAACGGGGAGTGGACACCGCCTGCGACATCCTCGAGAAGTGGGC22561GGAGACGGCACGCGCCACGGCCTGACACGGCCACCGGCGGGCGGGCCCGGAAGCCGCACG22621CCCGCCGGCCGACGGGTCCCGGGACCGCGCCGCTACGCCGACAACCGGTAGGCGGAGAGC22681CGCACGGAGAGCGTGACCCGAGTCGGCGCCGGCAGCCGCTGGATCGTCTCCAGATCCCGC22741TGCGCGTCACGGTGCCAGGCATTGGGCCCCATGCCGACCAGGTGCGCCAGAGCCTGGTGG22801TCGAGAGCCATGGTCCGGGTCAACTCCTCCGTGGCGACGGCCGAGAAGTGCGGAGCGAGC22861TGCTCCGCGAGACGCGACTCCTTGCCTTCGTCCACCTGCAGCAGGCCGAGGGCGCCGATC22921ACCTCCCGCAGGTGATCGGGCAGAGGAGTGACAACCAGGAGAACGCCACGGGGATGGAGA22981ACGCGATGCAGTTCAGGACCGTTGCGGGGAGCGAACGTGTTGATCACCATGCCGGCTGCG23041GCATCCCGCAGCGGAAGCGTCTGCCAGGCGTCGGTCACCGCGGCCGCGATCCGCGGATGC23101GCCTTCGCGGCACGCCGCACGGCGTACTTGGAGATGTCCAGCAGCAGGCCCTGGGCATCG23161GGGAACGCTTCCATGACCCCGGCGTGATAGTGGCCCGTCCCCCCACCGATGTCGACCACA23221CAGCCGGGCACGGCCGGATCGGCCGTCCGCCGCGCCAGATCGACCAGCGCATCCATCACG23281GGGTCGTAGTGACCCGCCGACAGGAATGCGTCCCGGGCCTCCACCATTTCCTTGGTGTCG23341GCCCGCAGCTTCGTCGCCCTGGGAAGCAGATTCACATAACCCTGCTTCGCGATGTCGAAG23401GAGTGTCCGGCGGGGCAGAAAAGTGCGCGGTCGCCCTGAGCCAGCGAAGCACCGCAGTGC23461GGGCAGGCGAGGTAGCGCACGATCCTGTTGAGCATGGGGCACCGTTCCTTCGGGCGAGTC23521GGCAGACCGGCCAACCCTAACGCAGGCGCCCGGCCGCCCACCGCCCGGCGCAGGGCCGAC23581GAACCACCGTGACGTGCACCAGCCGCGCGTGAGAACTCCTCATGCGCGCACCCTACGGAA23641ATCGGCAGGTCAACCGGCGATTCCTGCGGGAATTCCGAGCGAAACGGCCTCACTGTGTTT23701CCCCGCTGCATTTCCTCGCTGAATTCAGCGAATCCCGGCAGACGACCGGCTCTGCCGGCG23761TGACAGCCCCTATCGATCGACCAGGAGTTTCGATGGCCCCGAAGAGTGGTGCGCAGCGTT23821CGAGCGACATAGCCGTCGTCGGCATGTCCTGCCGCCTTCCGGGGGCACCGGGCATCGATG23881AGTTCTGGCATCTGCTGACCACCGGAGGCAGCGCGATCGAGCGTCGCGCCGACGGCACCT23941GGCGCGGCTCCCTGGACGGAGCCGCCGACTTCGACGCCGCCTTCTTCGACATGACCCCCC24001GCCAGGCCGCCGCCGCCGACCCGCAGCAACGACTCATGCTGGAACTCGGCTGGACGGCCC24061TGGAGAACGCCGGGATCGTCCCCGGCAGCCTCGCCGGCACGGACACCGGCGTCTTCGTCG24121GCATCGCGGCCGACGACTACGCGGCACTCCTGCACCGGTCCGCCACCCCCGTCAGCGGGC24181ACACCGCGACGGGCCTGAGCCGGGGCATGGCCGCCAACCGCCTCTCCTACCTCCTGGGCC24241TGCGCGGTCCCAGCCTCGCGGTGGACAGCGCGCAGTCCTCCTCGCTCGTCGCGGTCCACC24301TGGCCTGCGAGAGCCTGCGCCGCGGCGAGTCCGACCTCGCGATCGTCGGCGGCGTCAGCC24361TGATCCTCGCCGAGGACAGCACGGCGGGCATGGAGCTCATGGGCGCGCTCTCGCCGGACG24421GCCGCTGCCACACCTTCGACGCACGCGCCAACGGCTACGTACGCGGTGAGGGCGGAGCCT24481GCGTCGTCCTCAAGCCCCTGGAGCGGGCACTGGCCGACGGGGACCGCGTCCACTGCGTCG24541TCCGAGGAAGCGCGGTCAACAACGACGGCGGCGGCTCCACCCTGACCACCCCCCACCGCG24601AGGCCCAGGCCGCCGTCCTGCGGGCGGCGTACGAACGGGCCGGGGTCGGCCCGGACCAGG24661TGTCCTACGTCGAACTGCACGGTACGGGGACGCCGGTCGGCGACCCCGTCGAGGCGGCGG24721CTCTCGGCGCGGTCCTCGGCACGGCCCACGGCCGTAACGCCCCGCTGTCCGTGGGATCGG24781TCAAGACGAACGTCGGCCACCTGGAGGCGGCCGCGGGCCTCGTGGGATTCGTGAAGGCAG24841CCCTGTGCGTCCGCGAGGGCGTGGTGCCGCCGAGCCTGAACCACGCGACGCCCAACCCTG24901CCATCCCCATGGACCGGCTAAACCTGCGCGTACCCACGCGACTGGAGCCCTGGCCGCACC24961CGGACGACCGAGCGACCGGCCGGCTGCGACTGGCCGGCGTCTCGTCCTTCGGCATGGGTG25021GGACGAACGCGCATGTGGTGGTGGAGGAGGCGCCGCTTCCGGAGGCCGGGGAGCCGGTCG25081GGGCCGGTGTGCCTTTGGCTGTGGTGCCGGTGGTGGTGTCGGGTCGTTCTGCGGGTGCGG25141TGGCTGAACTGGCCTCCCGCTTGAACGAGTCGGTTCGTTCGGATCGGTTGGTGGATGTGG25201GGTTGTCGTCGGTGGTGTCGCGGTCGGTGTTCGAGCATCGGTCCGTGGTTCTGGCGGGGG25261ACTCTGCCGAGCTGAGTGCCGGTTTGGATGCTCTGGCCGCTGACGGAGTGTCTCCTGTCC25321TGGTGTCGGGTGTGGCGTCGGTCGGGGGTGGCCGGTCGGTCTTCGTGTTCCCGGGTGCGG25381GGGTGAAGTGGGCGGGGATGGCGCTCGGGTTGTGGGCGGAGTCTGCTGTGTTTGCGGAGT25441CGATGGCGCGGTGTGAGGCGGCGTTCGCCGGGTTGGTGGAGTGGCGTCTGGCGGATGTGC25501TGGGTGATGGGGCTGCGTTGGAGCGTGAGGACGTGGTGCAGCCGGCGTCGTTCGCGGTGA25561TGGTGTCGTTGGCGGCGTTGTGGCGGTCGTTGGGTGTGGTGCCGGATGCGGTGGTGGGGC25621ATTCGCAGGGGGAGATCGCTGCTGCGGTGGTGGCGGGTGGTCTGTCGTTGGAGGACGGCG25681CGCGTGTGGTGGTGTTGCGTGCGCGGGTGGCTGAGGAGGTTTTGTCGGGTGGGGGGATTG25741CTTCGGTGCGTCTTTCGCGGGCCGAGGTGGAGGAGCGGTTGGCGGGTGGGGGTGGTGGGT25801TGAGTGTGGCGGTGGTGAATGCGCCGTCGTCGACGGTGGTGGCGGGTGAGTTGGGGGATT25861TGGATCGGTTTGTTGCGGCGTGTGAGGCGGAGGGGGTGCGGGCGCGTCGGCTGGAGTTCG25921GGTATGCGTCGCATTCGAGGTTCGTGGAGCCGGTGCGTGAGCGGTTGTTGGAGGGGTTGG25981CCGATGTCCGTCCGGTGAGGGGGCGGATTCCGTTCTATTCGACGGTGGAGGCTGCGGAGT26041TCGATACGGCTGGTCTGGATGCGGAGTACTGGTTCGGGAATCTGCGTCGGCCGGTTCGCT26101TCCAGGAGACGGTCGAGCGGCTGTTGGCGGATGGTTTCCGGGTGTTCGTGGAGTGCGGCG26161CGCATCCGGTGCTGACCGGGGCGGTGCAGGAGACCGCGGAGACTGCGGGCCGGGAGATCT26221GCTCCGTCGGATCCCTGCGTCGTGACGAGGGTGGACTGCGTCGCTTCCTGACCTCTGCGG26281CGGAGGCGTTCGTCCAGGGGGTGGAGGTGTCCTGGCCGGTGCTGTTCGACGGCACCGGCG26341CCCGGACGGTCGACCTGCCCACCTACCCCTTCCAACGCCGGCACCACTGGGCACCCGACG26401GCTCCGCGTCGGCGGCGCCCACACGGGACATCCGACCGGACGAGACCGCCGCGGTTCCAG26461CGGACACGATGGACCTCGCGGGACAACTTCGCGCGGACGTGGCGTCGTTGCCCACCACCG26521AGCAGATCGCGCGGTTGCTCGACCAGGTACGCGACGGTGTCGCCACGGTCCTCGGACTGG26581ACGCCCGGGACGAGGTGCGCGCGGAGGCGACGTTCAAGGAACTGGGCGTCGAATCGCTGA26641CCGGCGTCGAGCTGAAGAACCACCTCCGTGCCCGGACCGGACTGCACGTCCCCACCTCGC26701TGATCTACGACTGCCCGACTCCCCTCGCCGCCGCTCACTACCTCCGCGACGAGCTCTTGG26761GCCGACCCGCGGAGCAGGCCGTCGTCCCCGCCGGCATCCCGGTCGACGAACCGATCGCGA26821TCGTGGGGATGGGGTGCCGTCTGCCGGGTGGGGTGTCGTCGCCGGAGGGGTTGTGGGATC26881TGGTGGCGTCGGGGGTGGATGCGGTTTCTCCGTTCCCCACGGATCGGGGTTGGGATGTGG26941GGGGTCTGTTCGATCCGGAGCCGGGGGTGCCGGGGCGTTCGTATGTGCGTGAGGGCGGGT27001TCCTTCATGAGGCGGGGGAGTTCGACGCGGGGTTCTTCGGTATCTCTCCGCGTGAGGCGT27061TGGCGATGGATCCGCAGCAGCGGTTGTTGCTGGAGACGTCGTGGGAGGCGTTGGAGCGGG27121CGGGCATCGATCCGCACACGCTTCGCGGTTCACGGACCGGCGTCTACGCCGGAGTGATGG27181CCCAGGAATACGGCCCCCGACTCCACGAAGGCGCAGACGGATACGAGGGCTATCTGCTGA27241CCGGATCCTCCAGCAGTGTCGCCTCGGGTCGTATCTCGTACGTGCTGGGTCTGGAAGGGC27301CGGCGGTGACGGTGGACACCGCGTGCTCGTCGTCGCTGGTCGCGCTGCACCTGGCCGTGC27361GGGCGCTGCGCAGCGGTGAGTGCGACCTCGCCCTCGCCGGCGGCGCGACCGTCATGGCCG27421AACCCGGCATGTTCGTGGAGTTCTCACGGCAGCGCGGGCTGTCTGCCCACGGACGGTGCA27481AGGCGTACTCGGACTCGGCCGATGGCACGGGCTGGGCCGAGGGGGCGGGTGTGCTGCTCG27541TCGAGCGGTTGTCGGATGCGGTACGTCATGGGCGTCGGGTGCTGGCGGTCGTGCGTGGTT27601CCGCGGTCAACCAGGACGGTGCGAGCAACGGACTGACGGCGCCGAACGGGCGGTCCCAGT27661CGCGTTTGATCCGTCAGGCGTTGGCGGATGCGCGGTTGGGTGTGGCTGATGTGGATGTGG27721TGGAGGGCCACGGCACGGGGACGCGTCTGGGTGATCCGATCGAGGCGCAGGCGTTGTTGG27781CGACGTATGGGCAGCGGGATGCGGGTCGGCCTCTGCGGCTTGGTTCGCTGAAGTCGAACG27841TGGGGCATACGCAGGCGGCTGCCGGTGTGGCGGGCGTGATCAAGATGGTCATGGCGATGC27901GGCACGGTGTCCTGCCGAAGACGCTGCACGTGGATGAGCCGACGGCGGAGGTGGACTGGT27961CGGCCGGCGCGGTGTCCCTGCTGAGGGAGCAGGAGGCGTGGCCGCGTGGCGAGCGTGTGC28021GGCGGGCCGGAGTCTCCTCGTTCGGCGTGAGTGGGACGAACGCGCATGTGGTGGTCGAGG28081AGGCGCCGGTTCCGGAGGACGGGGAGGCGATCGAGGGCGGTGCGCCTTTGGCTGTGGTGC28141CGGTGGTGGTGTCGGGTCGTTCTGCGGGTGCGGTGGCGGAGCTGGCGGGCCGGGTCAGCG28201AGGTCGCTGGGTCTGGTCGGTTGGTGGATGTGGGGTTGTCGTCGGTGGTGTCGCGGTCGG28261TGTTCGAGCACCGGTCCGTGGTGCTGGCGGGGGACTCTGCCGAGCTGAATGCCGGTCTGG28321ATGCTCTGGCCGCTGACGGAGTGTCTCCTGTCCTGGTGTCGGGTGTGGCGTCGGGTGAGG28381GTGGCCGGTCGGTGTTCGTGTTCCCTGGTCAGGGGACGCAGTGGGCGGGGATGGCGCTCG28441GGTTGTGGGCGGAGTCGGCGGTGTTCGCGGAGTCGATGGCGCGGTGTGAGGCGGCGTTCG28501CCGGGTTGGTGGAGTGGCGTCTGGCGGATGTGCTGGGTGACGGGTCTGCGTTGGAGCGGG28561TCGATGTGGTGCAGCCGGCGTCGTTCGCGGTGATGGTGTCGCTGGCGGAGTTGTGGCGGT28621CGTTGGGTGTGGTGCCGGATGCGGTGGTGGGGCATTCGCAGGGGGAGATCGCTGCTGCGG28681TGGTGGCGGGTGGTCTCTCGCTGGAGGATGGCGCGCGTGTGGTGGTGTTGCGTGCGCGGT28741TGATAGGCCGTGAGCTGGCCGGGCGCGGTGGGATGGCGTCGGTGGCGCTGCCCCTCGCGG28801TGGTGGAGGAGCGTCTGGCGGGGTGGGCGGGGCGTCTGGGTGTGGCGGTGGTCAACGGAC28861CCTCCGCCACGGTCGTCGCGGGTGATGTGGATGCGGTGGGGGAGTTTGTGACCGCGTGCG28921AGGTGGAGGGGGTTCGGGCGCGTGTTCTGCCGGTGGACTACGCCTCGCACTCGGCGCACG28981TGGAGGACCTGAAAGCCGAGCTTGAACAGATTCTGGCCGGCATCGGCCCGGTGACCGGTG29041GCATCCCGTTCTATTCGACGTCCGAAGCCGCGCAGATCGACACGGCTGGTCTGGACGCGG29101GGTACTGGTTCGGGAATCTGCGTCGGCCGGTGCGGTTCCAGGAGACGGTCGAGCGGTTGC29161TGGCGGATGGTTTCCGGGTGTTCGTGGAGTGTGGCGCGCATCCGGTGCTGACGGGGGCGG29221TGCAGGAGACCGCGGAATCCACCGGTCGCCAGGTGTGTGCGGTCGGATCCCTGCGTCGTG29281ACGAGGGAGGTCTGCGCCGCTTCCTGACCTCTGCGGCGGAGGCGTTCGTCCAGGGGGTCG29341GGGTGTCCTGGCCGGCACTGTTCGACGGCACCGGCGCCCGCACGGTCGACCTGCCCACCT29401ATCCCTTCCAGCGTCGGCGTTACTGGCTGGAGTCACGTCCTCCTGCGGCGGTTGTTCCGT29461CGGGGGTCCAGGACGGATTGTCGTATGAGGTGGTGTGGAAGAGCCTGCCGGTACGGGAGT29521CGTCGCGTCTTGACGGCCGGTGGCTGCTCGTCGTGCCCGAAACCCTGGACGCCGACGGCA29581CGCGGATCGCCCACGACCTCCAGCACGCCCTCACCACCCACGGCGCCACGGTCTCCCGTG29641TGTCGGTCGACGTGACGACGATCGACCGCGCCGACCTGTCGGCGCGGCTCACCACGAGCG29701CGGCCGAAGACCAGGAACCGCTTGGGCGAGTGGTGTCCCTCCTGGGGTGGGCCGAGGGAG29761TACGGGCCCATGGCCCGAACGTACCGACTTCCGTCGCCGCCTCCCTGGCACTCGTGCAGG29821CTGTCGGCGATGCCGGGTTCGGTGTTCCGGTGTGGGCGGTGACGCGGGGTGCGGTGTCCG29881TGGTGCCTGGTGAGGTGCCGGAGACGGCGGGTGCGCAACTGTGGGCGCTCGGCCGGGTCG29941CCGGTCTCGAACTTCCCGACCGTTGGGGTGGTCTGATCGATCTTCCGGCGGATGCCGATG30001CGCGTACGGCGGGGCTTGCGGTGCGGGCCCTGGCCGCCGGGATCGCCGATGGTGAGGACC30061AGGTGGCGGTGCGCCCCTCGGGTGCCTACGGCCGGCGTGTAGTTCAGGCAGCCCACCGGG30121AGCCGTCGGGAGCGAAGACGGAGTGGCGACCGCGTGGCACCGTGCTCGTCACCGGGGGAA30181TGGGCGCCATCGGCACTCGGGTGGCCCGCTGGCTGGCCCGGAACGGAGCCGAACACCTGG30241TGCTGACCGGCCGCCGCGGTGCCGGGACCCCCGGCGCGGACGAGCTGGCGGGAGAGCTGA30301GGGCGTCCGGAGTCCAGGTCACGCTCGCCGCCTGCGACGTGTCCGATCGTGCCGCGCTGG30361CCGCGCTGCTCGACGCGCATCCGCCGACCGCCGTCTTCCACACGGCCGGTGTACTGAACG30421ACGGAACGGTCGACACGCTCACTCCCGCTCACCTGGACGGGGTCCTGAGCCCCAAGGCGA30481CGGCCGCCGTTCACCTGCACGAGCTCACCGCCCACCTGGACCTGGACGCCTTCGTCCTCT30541TCGCCTCCGTCACCGGCGTATGGGGTAACGGCGGCCAGGCCGGGTACGCCATGGCCAACG30601CGGCTCTGGACGCGCTCGCCGAGCAGCGCCGTGCCGGCGGACTTGCGGCGACCTCCATCA30661GTTGGGGCCTCTGGGGTGGCGGCGGCATGGCCGAGGGTGACGGCGAGGTGAGCCTCAACC30721GTCGTGGAATCCGCGCTCTTGAGCCCGCCACCGGCATCGAGGCGCTGCAGCGGACGCTCG30781ACCAGGGCGCCACCTGCCGCACCGTCGTCGACGTGGACTGGGGTCAGTTCGCTCCTCGTA30841CGGCGGCGCTGCGGCGCGGGCGGCTCTTCGCGGATCTGCCCGAGGTGCGGCGTGTCCTGG30901AGTCCGAGGGGGTTGCACGGGAGGACGCCGGAACCGTCGAGCCCGGCGCCGTGCTCGCCG30961AGCGCCTCGCATCGCGCTCCGAGGCCGAACAGCGACGCATGCTCGTCGAGTTGGTACGAG31021CCGAAGCGGCTGCCGTCCTGCGTCACGACACGACGGACCTCCTGGCGCCGCGCAGGTCGT31081TCAAGGACGCCGGGTTCGACTCCTTGACCGCGCTGGAGCTCCGTAACCGGCTGAACACCG31141CCACCGGTGTCGTCCTTCCCGTCACCGTCGTCTTCGACCACCCGAACCCCGGTGCACTGG31201CGGACTTCCTGTACGGCGAAGCACTGGGCCTGTCCGCGGCCAGGTCTTCCGCGAGCGATA31261CGGCCGACACGACCCGCCCGGCCGCCGCCCCCGAAGAGCCGATCGCGATCGTCGGAATGG31321CCTGCCGCTACCCGGGCGAGGCCCGTTCCCCCGAGGPACTGTGGAAGTTGCTCATCGACG31381AACGGGACGTCATCGGCCCCATGCCCACGGATCGGGGTTGGGATGTGGGGGGTCTGTTCG31441ATCCGGAGCCGGGGGTGCCGGGGCGTTCGTATGTGCGTGAGGGCGGGTTCCTTCATGAGG31501CGGGGGAGTTCGACGCGGGGTTCTTCGGTATTTCTCCGCGTGAGGCGTTGGCGATGGATC31561CGCAGCAGCGGTTGTTGCTGGAGACGTCGTGGGAGGCGTTGGAGCGGGCGGGCATCGATC31621CGCACACGCTCCGCGGCTCACAGACCGGCGTCTACGCGGGGATGTTCCACCAGGAGTACG31681CGACCCGGCTGCACGAGGCACCCGTGGAGTTCGAAGGCCACTTGCTGACGGGGACGTCCG31741GGAGTGTGGCTTCGGGTCGTATCTCGTATGTGCTGGGTCTGGAGGGGCCGGCGGTGACGG31801TGGACACGGCGTGTTCGTCGTCGTTGGTGGCGCTGCATCTGGCGGTGCGGGCGTTGCGGA31861GTGGTGAGTGTGACCTTGCTCTTGCGGGTGGGGTGACGGTGATGGCGGAGCCGGGTGTGT31921TCGTGGAGTTCTCGCGGCAGCGGGGGTTGGCTGCGGACGGGCGGTGCAAGGCGTTCGCGG31981CTGCTGCCGATGGCACGGGCTGGGCCGAGGGTGTGGGCGTGCTGGCGGTGGAGCGGTTGT32041CGGATGCGGTGCGTCATGGGCGTCGGGTGCTGGCGGTGGTGCGTGGCTCGGCGGTGAATC32101AGGACGGTGCGAGCAATGGGTTGACGGCTCCGAACGGTCCGTCGCAGCAGCGGGTGATTC32161GTCAGGCGTTGGCGGATGCGCGGTTGGGTGTGGCTGATGTGGATGTGGTGGAGGGGCATG32221GGACGGGGACGCGTCTGGGTGATCCGATCGAGGCGCAGGCGTTGTTGGCGACGTATGGGC32281AGCGGGATGCGGGTCGGCCTCTGCGGCTTGGTTCGCTGAAGTCGAATGTGGGGCATACGC32341AGGCGGCTGCCGGTGTGGCGGGCGTGATCAAGATGGTCATGGCGATGCGGCACGGGGTCC32401TGCCGAAGACGCTGCACGTCGATGAGGTCTCTCCGCACGTCGACTGGTCGGCGGGCGCGG32461TGTCCCTGCTGACGGAGCAGGAGCCGTGGCCCCGTGGTGAGCGCGTACGGCGGGCTGGTG32521TCTCCGCGTTCGGTGTGAGTGGGACGAACGCGCATGTGGTGGTGGAGGAGGCACCTGCTT32581CGGAGGCGCCGGTCGCGGTGGAGCCGGTGGAGCCGGGGGCTGTGGGGCTTCTTCCGGTGG32641TGCCCGTGGTGGTGTCGGGTCGTTCTGCGGGTGCGGTGGCTGAACTGGCCTCCCGCTTGA32701ACGAGTCGGTTCGTTCGGATCGGTTGGTGGATGTGGGGTTGTCGTCGGTGGTGTCGCGGT32761CGGTGTTCGAGCATCGGTCCGTGGTTCTGGCGGGGGACTCTGCCGAGCTGAGTGCCGGTT32821TGGATGCTCTGGCCGCTGACGGAGTGTCTCCTGTCCTGGTGTCGGGTGTGGCGTCGGTCG32881GGGGTGGCCGGTCGGTGTTCGTGTTCCCGGGTGCGGGGGTGAAGTGGGCGGGGATGGCGC32941TCGGGTTGTGGGCGGAGTCTGCTGTGTTTGCGGAGTCGATGGCGCGGTGTGAGGCGGCGT33001TCGCCGGGTTGGTGGAGTGGCGTCTGGCGGATGTGCTGGGTGATGGGGCTGCGTTGGAGC33061GTGAGGACGTGGTGCAGCCGGCGTCGTTCGCGGTGATGGTGTCGTTGGCGGCGTTGTGGC33121GGTCGTTGGGTGTGGTGCCGGATGCGGTGGTGGGGCATTCGCAGGGGGAGATCGCTGCTG33181CGGTGGTGGCGGGTGGTCTGTCGTTGGAGGACGGGGCGCGTGTGGTGGTGTTGCGGGCGC33241GGGTGGCTGAGGAGGTTTTGTCGGGTGGGGGGATTGCTTCGGTGCGTCTTTCGCGGGCCG33301AGGTGGAGGAGCGGTTGGCGGGTGGGGGTGGTGGGTTGAGTGTGGCGGTGGTGAATGCGC33361CGTCGTCGACGGTGGTGGCGGGTGAGTTGGGGGAGTTGGATCGGTTTGTTGCGGCGTGTG33421AGGCGGAGGGGGTGCGGGCGCGTCGGCTGGAGTTCGGGTATGCGTCGCATTCGAGGTTCG33481TGGAGCCGGTGCGTGAGCGGTTGTTGGAGGGGTTGGCCGATGTCCGTCCGGTGAGGGGGC33541GGATTCCGTTCTATTCGACGGTGGAGGCTGGGGAGTTCGATACGGCTGGTCTGGATGCGG33601AGTACTGGTTCGGGAATCTGCGTCGGCCGGTTCGGTTCCAGGAGACGGTCGAGCGGTTGT33661TGGCGGATGGTTTCCGGGTGTTCGTGGAGTGTGGTGCGCATCCGGTGCTGACCGGGGCGG33721TGCAGGAGACCGCGGAGACTGCGGGCCGGGAGGTGTGTGCGGTTGGTTCGCTGCGTCGTG33781ACGAGGGAGGTCTCCGTCGCTTCCTGACCTCTGCGGCGGAGGCGTTCGTCCAGGGGGTGG33841AGGTGTCCTGGCCGGTGCTGTTCGACGGCACCGGCGCCCGCACGGTCGACCTGCCCACCT33901ACCCCTTCCAACGCCGCCACTACTGGGCACAGTCCTCGCCCGCCGGCGCCGGCAGCTCTG33961CTGCGGCCCGCTTCGGTATGACCTGGGAGGAGCATCCGCTCCTCGGCGGCGCGCTGCCGC34021TCGCGGACTCCGGCGAGCTGCTGCTCGTCGGGAGGATCTCCCCGGCCTCCCACTCCTGGA34081TCGCCGACCACACCGTGGCCGGGACCGCCCTGCTGCCCGGGACGGCCTTCGTCGACATGG34141CACTGCACGCCGCCGCGGTCGCGGGCTGTGCGGGTGTGGAGGAGCTGAGCATCGAGGCCC34201CGCTGCCGGTGCACGGCGGGATCCGGCTCCAGGTGGTGATCGACGAGCCCGACTCCTCCG34261CGCGGCGCCGCGTGAGCGTGTTCGCGAGGCCGGAAGAGGAAGACGGGGACGCCGGCCGCT34321GGACCCGACACGCCACCGGCGTGCTGACCCCCGACGTCGCCCCCGAGCCGGGCCGGCCGC34381AGTGGTGCCGGCAGGCCTGGCCGCCGAGCGGCTCCGTCCGGGTGGAGGCGTCGGAGCTCT34441ACGACCGGTTCTCCGCGCTGGGATACGACTACGGCGAGGTCTTCGCCGGGGTCGAGGCCG34501TCTGGCTGCGCGAGGGCGAGGCCTTCGCCGAGGTCCGCCTGCCCACGGGCGCGGCGCCCG34561ACGCCGAGCGGTTCGGGGTGCACCCCGGCCTCCTCGACGCGGCTCTGCACCCCTGGCTGC34621TGGGCGACTTCCTGTCGCGGCCCGACGGCGGATCCGTACTGCTGCCGTTCGCGTGGCGCG34681GCATCACGCTCCACACGGCCGGTGCCGACGCGCTGCGGGTCCGTCTCGGACCGGCCGGAG34741AAGGCGCTCTGTCGCTCGAAGCCGTCGACCTCTCCGGTGCCCCGGTGCTGTCGATGGACG34801CACTGGTGCTGCGTCCGCTCGCCCAGGACCGCCTGGCGGAACTGGTCGGCGGCACGACCT34861CCACCCCGCTGTACCGCGTGGACTGGCAGCGGAGCCCGATCGCGAGGACGGCGCCGTCGG34921CCACGGGGCTCTTCGGCTCCCTCCCGTCCGGTGCCGTCCGCCGCTGGGCGGTCGTCGGGC34981AGGGCGGTGTCGCCGCACGGTACGCGACGGCGGAACCCGGCACGGGGTGCGTCGGGGTCT35041TCCCCGACCTGGACGCACTGCGTACGACGCTGGACTCCGGAGCGGACGGCCCCGACATCG35101TCCTCGCCGACTTCGGGGCCCGGCCAGGCGACGCCGCGCCGCACGGGACGGATCCGGCCG35161ACGGCGCACGCGACACGGTCCGGCGGGGGCTCGCCCTCATACAGGGCTGGCTGTCCGACG35221AGCGCTTCGCCGCCGCGCGTCTCGCCGTGCTCACCGAGCACGCCGTCGCCACCGAGGCGG35281ACACCCGCCGGACGGACCTCGCGGGCTCGGCACTGTGGGGGCTGATGCGTTCGGCGCAGA35341CCGAGCACCCCGACCGCTTCGTCCTCGTCGACCACGACGGGCAGGACGCCTCGTACCGGA35401CGCTGCCCACCGCGCTCGACAGCGAAATCCCGCAACTGGCGCTCCGAGCCGGGGAGACGC35461TGGCCCCCGAACTGGCGGTCCTGCCGAGTCCGGCCGACGGGGGGCCCGCGACAAGCGCGG35521CGTTCGATCCCGAAGGCACGGTACTCGTCACCGGAGCCACCGGCACCCTCGGCAGCCTGC35581TGGCCCGGCACCTGGTCACGGCACACGGCGTACGGCATCTGCTGCTGCTCAGCCGCAGCG35641GACGCGAAGCCGCGGGGGCCGCCGAACTGGAGCGTGAACTCCGTCAACGGGGAGCCGAGT35701TCCAGCTCCTCTCCTGTGACGCGACGGACCGGGCAGCGCTGAAGGAGGCCCTCGCCACCG35761TCCCCGCCGCCCACCCGCTGACCGCGGTGATCCACACGGCCGGCGTCCTCGACGACGGCG35821TCGTCGAGGCGCTGACCCCCGAACGGCTGGACCGCGTGCTGCGCCCCAAGGCGGACGCCG35881CGCTGAACCTGCACGACCTGACCGAGGGAATGCCGCTGAAGGCGTTCGTCCTGTACTCCG35941GGGCGGTCGGACTGCTGGGTGGAGCGGGCCAGGCCAACTACGCGGCAGCCAACGCGTTCC36001TGGACGGCCTGGCCCAACACCGGCACGCGCAAGGGCTGCCCGCGGTGTCCCTGGCATGGG36061GACTCTGGAGCGCCACCAGCACGTTCACCGACCATCTCGGCGAGGTGGACCTGCGGCGCA36121TGGAGCGGTCCGGCATCACGCCGCTCACGGACGAGCAGGGCCTTGACCTGTTCGACCGGG36181CCCTCGGCGCCGCGGTGGACGCGCCGCAGCTCTGCGTGATGGGGCTGGACACGGCGGCGC36241TGCGCCGGCAGGCGGCCGAGCACGGGCCGACTTCGATGCCTCCTCTGCTGCGTACGCTGG36301CGGCGCCTCCCGTACGGCGCGGCGCGGGGCGCTCCGGCCGGGGCGGACGGGCGGCGTCCG36361CCACGGACGCGCCGAGCCGGGCGCAGGCCCTGCGCGAGCGACTGACGGGCCTGGACGCGG36421CGGCACGGCGGGACGAACTCCTGGTCCTGTCGCAGGCGCAGTTGGCCGATGTGCTGGGCT36481TCGCCGACAAGACCGCGGTGGACCCCGTTCGTTCCTTCCGCGAGATCGGTCTGGACTCGC36541TGACCGCCGTCGAGCTGCGCAACCGGCTCGGTGTCGTCACCGGACTGCGGCTGCCGCCGG36601CGCTGGTCTTCGACCACCCCAACCTCGACGCGCTCGCGGCCCACCTGGCGGAGCTCCTCG36661CGGCTGAGGGCCGGGACGACGCGGGCGCCGCGGCGCTCTCGGGAATCGACGCCCTGGACC36721GGGCGGTCCGGGAGATGGCGGCCGACGACACGCGCCGTGACGCCGTCCGCCGACGCCTCG36781CGGAACTGCTGGCGGTGGTCGGCGACGCCCCGCGGGACGGCGGCCGCGCCCCACGGGCGG36841CCGCCGACGCGGGAGGCCGCGACGCTCAAGCGGACCCCGACCTGCTGGGCCGGCTGGACT36901CCGCCTCCGACGACGATCTGTTCGCGTTCATCGAAGACCAGCTGTGAGCGGGACGCCGCG36961CGCGTTCCCCACCCGTCCCTAAGCGCCGCATCAGGCGCACTCGCACCGACACGAGCACGC37021AGGCCAGGAGGGTCCGGTCGATGACGGCCAACGATGACAAGATCCGCGACTACCTGAAGC37081GGGTCGTCGCCGAGCTGCACAGCACGCGGCAACGGCTCAACGCCTTGGAGCACGACGCCC37141GCGAGCCCATCGCCATCGTGGGGATGAGCTGCCGGCTGCCCGGCGGGGTGACCACCCCCG37201AGAGCCTGTGGAGGCTGGTCGACTCCGGCACCGACGCCGCCTCGCCGTTCCCCGACGACC37261GGGGCTGGGACCTGGACGCGCTCCACCATCCGGAGTCGGGAGCCGTCCACTCCCGCGAGG37321GCGGATTCCTCCACGACAGCGCGGACTTCGACGCGGAGTTCTTCGGCATCAGCCCGCGAG37381AGGCCCTGGCCATGGACCCGCAGCAGCGGCTGCTGCTGGAGACCGCCTGGGAGGTGTTCG37441AGCGCGCCGGCATCGACCCGGTCTCCGCCCGCGGCTCCCGCACGGGGGTGTACGCGGGCG37501TCATGTACCACGACTACGGCGCCCGGCTGAACGAGATCCCGCCGGGCCTCGAGGGCTACC37561TGGTCAACGGCAGCGCGGGCAGCATCGCCTCGGGCCGGGTGGCCTACACCCTCGGTCTGG37621AGGGCCCCGCCGTCACCGTCGACACGGCCTGCTCCTCGTCACTGGTCGCCGTGCACCTGG37681CGGCACAGGCACTGCGGCGGCGGGAGTGCGACATGGCGCTCGCGGGCGGCGCGACCGTCC37741TGTCCACCCCCGACCTGTTCATCGACTTCGCGCGACTCGGCGGCCTGGCCTCCGACGGGC37801GCTGCAAGGCGTTCTCCGACGCCGCCGACGGCACCAGCTTCGCCGAGGGCGCCGGCCTGC37861TGCTCCTCATGCGGCTGTCGGACGCGGTGGCCGAGGGCCACACCGTCCTGGCGGTCGTCC37921GAGGCTCCGCCGTCAACCAGGACGGGGCGAGCAACGGCCTGACGGCCCCCAACGGCCTCG37981CCCAGCAACGCGTGATCCGCGAGGCGCTCGCCGACGCGGACCTGGACCCCGACCAGATCG38041ACGCGGTGGAGGCGCACGGCACCGGAACCAGGCTCGGCGACCCCATCGAGGCGCAGGCCC38101TGCTGCACACGTACGGCACGAGCCGCAGCCCCGAACGACCCCTGTGGCTCGGTTCGCTGA38161AGTCCAACATCGGCCACACCCAGGCCGCCGCCGGAGTGGCCGGAGTCATCAAGACGGTGC38221TGGCGATGCGCCACGGACGGCTGCCCCGCACACTGCACGTCACCCGCCCCTCCAGCCGGG38281TGGAATGGTCGGCGGGCGCGGTGGAACTGCTCACGCGGGCACAGGACTGGCCCGGCCAGG38341GGAACGCCCCGCGCCGCGCCGGAGTGTCGTCCTTCGGTGCCAGCGGCACCAACGCACACC38401TGATCCTGGAAGGCGTCCCGGACGGCGACATCACGGTCGCGGAGACCCGACCGGCCACCG38461GCGGCGGCGCCTGGCCGCTCGCGGGCCGGACCGAAGCGGCCCTGCGCGCACAGGCCCGGC38521GGCTCCACGACCACCTCGCCGCCCGCCCCCACGTCTCACCCGCCGCGGTCGGGCGGACGC38581TGGTCCGCTCCCGCACGGCGTTCGAGCACCGGGCCGTCGTCCTCGGACAGGACACCGCCG38641ACCTCCTGAGCGGCCTCGCGGAGCTGGCGTCCGGCGGCGCTCACGGACCCGGCGTGATCA38701CCGGCCGGGCCGCGCGCGGCCGCCGTACCGCACTGCTCTTCACCGGACAGGGCAGCCAGC38761GGCCGGGTGCCGGACGGCATCTCTACGAGCGGTACGAGGTGTTCGCCCGCGCCCTGGACG38821AGACGGCCGCGGCACTCGACCGGCACCTCGACCGCCCGCTGCGCGACGTGATGTTCGCGG38881AGCCGGGCGGCGCGACGGCCGGACTCCTCGACCGCACCGAGTACACCCAGCCCGCACTGT38941TCGCCCTGGAAGTCGCCCTCTTCCGACTGGTGACCGCCGGGGGCCTGCGCCCCGACGCAC39001TCCTCGGGCACTCCGTCGGCGAACTGGCCGCCGCCCACGTCGCCGGAGTGTTCACCCTGC39061CCGACGCCGCCCGCCTGGTGACGGCGCGAGGCCGACTGATGGGCGAGCTGCCGGCCGGTG39121GCGCCATGATGGCGATCCAGGCGAGCGGCCCGGAGATCGAGGAGACGATCACGGCGCTCG39181CGGCCCACCGGTCGGCGCGCGTCGCCGTCGCCGCACTCAACGGTCCCGACGCCACCGTGA39241TCTCGGGCGACGAGGACGTGGTCGCCGAACTCGCCACGCTGTGGCGGGAGCGGGGCCGCC39301GCACCAGGGCGCTGCCCGTCAGCCACGCCTTCCACTCGCCGCACATGGACGCCGCACTGG39361AACCGTTCGCCCGGATCGCGCGCGACGTCTCCTACGCCGAACCGCGCATCCCGGTGGTCT39421CCAACCTGACCGGCGGCATCGCGTCGGCCACGACGCTGTGCGCCCCCGAGTACTGGGTGC39481GCCACGCGCGCGAGGCCGTGCGCTTCAGCGACGGCTTCCGCGCCCTGCGCGACCAGGGGA39541TCGACACCTTCATCGAGCTCGGACCGGACGGCGTGCTGTCCGCCCTGGGCCGCGACTGCC39601TCCGCGAGGAGGAAGGAGACGCCCCACGCCAGGACGGCTCGGCGGACCCCGACACGACCG39661GCTCCCGCGCCGACGGGGGGCGGCGACCCGTGCTGACGGTGCCGCTGCTGCGCCGGGACC39721GCGACGAGACGACCACCTGCCTCGGGGCCCTGGCCACCGTCCACACCCACGGCGTCCCCG39781TCGACCTCGCGGCCGTGCACGGCGCCCCCGAGGGGCCCGCCGTCGAGCTCCCCACCTACG39841CCTTCCAACGCACGCGCTACTGGCTGGACGCCCCGGCCCCCGCCGCCGGCCCCACCGCCA39901CCGGTCTGGAGGCGACGGACCAGCCCCTGCTCCCGGCCGTCATCGACCTGCCCGACGGGG39961AAGGCACCGTACGGACCGGACTGCTCTCCCTGCGCACCCATCCGTGGATCGCCGACCACC40021GCGTCCGCGACCACGCCGTGGTGCCCGGAGCGGCCCTGCTGGACGTCGCCGCCTGGGCGG40081GCACCGAAGCGGGCTGCCCCCGGGTCGCCGAACTGACCTTCGCCACGCCCCTCGTCCTGC40141CCGAGAACGGAGAAGGAGTCCGACTGCGCGTGACGGTCAGCGGCCCCGACGCGGAAGGCA40201TCCGTTCGCTACGCATCGACTCCCGGCCCGCCGACACGGTCCGTACCGCCGATGCCCCCT40261CCGACTGGACCCGCCACGCGTCCGGCACCCTCGTCCCCGCACCCGAGGAGGCCGGCGACG40321GCACCGGCGTGCCGACCGAACTGCTCGGCGCCTGGCCCCCGGCCGACGCGACCCCGGTCG40381CCCTCGACGCGGACGCCGTCGCGGCCGAGTATCAGCGCCTCGCGGCCGGCGGCGTGACGT40441ACGGCCCCGCGTTCCGGGCACTGCGCGCCGTCTGGCGCCGCGGCGCAGAGGTCTTCGCCG40501AGGTCCGGCTTCCCGGCCAGGCGGCCGCCGACGCCTCGCGGTACGGCATGCACCCGGCCC40561TGCTGGACGCCCTGACGCACGCCACCGGGTTCGGCGAGCGGTCCACCGAGGCCCGCGGCC40621TGGTCCCGTTCGCCTGGAGTGACGTCCGGATCCACGTCCGCGGCGCCGACTCCCTGCGCG40681TACGCATCGCGCCGGCCGGCCCCGACGCCGTGACCGTCGCCGCGGTCGACCCGACGGGCC40741GCCCGGTCCTCGCCGCCCGCTCGCTCACGCTGCGCCCCCTGGCGGAGAGCCGGTTCCAGG40801ACCCGGAGGCGGACAGCACGCCGCTGTACCGACTGGAGTGGACACCGGCCCCCGGTTCCG40861TGACCGGGCACGCCGGTCCAAGGCAGGCGGCAGCGGAGTGGGGCGTCCTCGGCGACCCGG40921TCCAGGCCCTCCTCGACGCCGTGCGCGACGGGGCGGAGGCACCCGTGCGAACCCATGACG40981ACCTGCTCGCGCTCGCGGCCTCCGACACGGCCCCGCCCGACCATGTGCTGGCGCTGCTGG41041GCCACGACGGGGACGCTCTCGCCACGGGCGCCCACGACCTGGCCGCACGCGCCCTGGCCC41101TGGTACAGGGCTGGCTGACCCACGCCCGCTTCGCCGACGCGCGACTGGTCGTGCTGACGC41161AGGGGGCGGTGACGGCCGGCACGAGCCCCGTCCACCCGGCCGCGGCCGCCGCCTGGGGGC41221TGCTGCGCAGCGCACAGTCCGAGCACCCGGGCCGCTTCGTCCTCGTCGACGCGGACCCCG41281CCGACCCGGCCGCCTCGTACCGTTCCCTGCCACGGGCCGTCGCCTCCGGGGCGTCCCAAC41341TCGCCCTGCGCGGCGCCGAGATCCTCGTCCCCCGGCTCGCCCGGGGAACGGACCGACAGG41401CCACCGTGCCCGGACACCCCGGCGACGTCACCGCACCGGAGACGACCGCTGCCCCCGAGC41461CGGCCCCGTCCGGCACCCCCTCCGGCCCCTGGCCCGdGGACGGCACCGTGCTCGTCACGG41521GCGGAACCGGGACCCTGGGCAAGGCCGTGGCCCGGCACCTCGTGACCAAGCACGGTGTCC41581GGCACCTGATCCTGGCCGGTCGGCGAGGCGCGGACACCCCCGGGGCGGCCGACCTGGCCA41641CCGAACTGACCGGCCTGGGCGCCACCGTGAACATCGTCCGCTGCGATGCCGCCGACCGCT41701CGGCGCTCGAAGGCGTCCTGGCCGCCGTCCCCGCCGCGCACCCGCTCACCGCCGTCGTGC41761ACACCGCCGGAGTGCTCGACGACGGCATCGTCACGGCGCAGACGCCCCGACGCCTCTCGG41821CGGTCCTGCGCGCCAAGGCGGACGCGGTCAGCCACCTGCACGAACTGACCCGCGACCTGG41881ACCTGTCCGCCTTCGTCCTCTTCTCATCGGCCGCCGGAACCCTCGGCAGCCCCGGCCAGT41941CCGGCTACGCGGCCGCCAACAGCTTCCTCGACGCGTTCGCCGCCTGGCGGCGAGCGCAGG42001GCCTCCCCGCCGTGTCCCTGGCATGGGGACTGTGGGGTGACGGCGGTGACGGTCGTGACG42061GCGGTGGCTCGGCGGCCGACGGCATGGGAGCGAGCCTGGCCGCCGCCGACCTGGCACGGC42121TGCGCCGTTCCGGCATCCTCCCGCTCGACCCGGCCGAAGCGCTGCGCCTGTTCGACGAGG42181CCTGCGACCCCGCCAGGACCGAGGCCGTACTGCTGCCGATCCGCCTCGACCTGACCGGCC42241TGCGCGCCCGTTCCGCCCGCGGCGCCGTACACGCGAGCGTGGTGCCCGAAGTGCTGCACA42301CCTTGGTGCCCCCGCCCGCCGGTGCCGGATCCCCGGCCGGTGCCGACGCGTCGGATCCCG42361CGGCGGGCCAGGCGCCCCCGGCCCCGGCGTCCGACACCCTGGCCGAACGGCTCGCCGGGA42421AGCCCCGAGGCGAACGGCTCACCGCCCTCACCCAACTGGTACGCACCGAGATCGCCTCGG42481TACTCGGGCACCCCGACTCCGGCCGCGTCCAGCTCCAGTCCTCCTTCAAGGAGTCCGGCT42541TCGACTCGCTCACCGCCGTCGAACTCCGCAACCGGCTCACCGCGGCCACCGGAACGAAGC42601TTCCCGCCACCCTCGTCTTCGACCATCCGACACCCGCGGCACTCGTCGACCACCTGGAAC42661AGGAACTGCCGAAGGCAGCACAGGAGATCCCGGCGGACCTCCCGGCCGTTCTCGACGCAC42721TCGACCGGATCCGGGACGGACTCGCCACCGCCGCCACCGACGACAGCAGCCGCGACCACA42781TCGCGGAGCGTCTCCAGGCGTTGCTCGGCACGCTCACCTCGGCTGCGGGCGTCAGCCGCC42841CGACCGGCAGCCCGGGCGAGCACGACCGGCAGGGCCCCGATGAGCTGTCGCTCGGCCAAC42901GACTCGCCGCCAGCAGCGACGACGAACTCTTCGACCTCTTCGACAGCGACTTCCGATCGA42961CGTAGCCCAGGGCCACCCTTCCGCCTCCGCCGCCCGCCCCACACCCCTGGAGAACAGCAC43021GATGTCTTCCACATCGCCCGCCACCAACGAAGAGAAGCTCCGCGACTACCTCCGCCGCGC43081CATGACCGACCTGCACGAGGCCCGCGAGCAGATCCGCCGGACCGAGTCGGCCAGGCACGA43141GCCGATCGCGATCGTGGGGATGGGGTGCCGTCTGCCGGGTGGGGTGTCGTCGCCGGAGGG43201GTTGTGGGATCTGGTGGCGTCGGGGGTGGATGCGGTTTCTCCGTTCCCCACGGATCGGGG43261TTGGGATGTGGGGGGTCTGTTCGATCCGGAGCCGGGGGTGCCGGGGCGTTCGTATGTGCG43321TGAGGGCGGGTTCCTTCATGAGGCGGGGGAGTTCGACGCGGGGTTCTTCGGTATCTCTCC43381GCGTGAGGCGTTGGCGATGGATCCGCAGCAGCGGTTGCTGTTGGAGACGTCGTGGGAGGC43441GTTGGAGCGGGCGGGCATCGATCCGCACACGCTCCGCGGCTCACGGACCGGCGTCTACGC43501CGGAGTGATGTACCACGACTACGGCAGCACCGCGACCGTCTCCGTCGCCTCCGACGACGA43561GACCGCCGGATTCCTCGGCACGGGGACGTCCGGGAGTGTGGCTTCGGGTCGTATCTCGTA43621TGTGCTGGGGCTGGAGGGGCCGGCGGTGACGGTGGACACGGCGTGTTCGTCGTCGTTGGT43681GGCGCTGCATCTGGCGGTGCGGGCGCTGCGGAGTGGTGAGTGTGACCTTGCTCTTGCGGG43741TGGGGTGACGGTGATGGCGGAGCCGGGTGTGTTCGTGGAGTTCTCGCGGCAGCGGGGGTT43801GGCTGCGGACGGGCGGTGCAAGGCGTTCGCGGCTGCTGCCGATGGCACGGGCTGGGCCGA43861GGGTGTGGGCGTGCTGGCGGTGGAGCGGTTGTCGGATGCGGTGCGTCACGGGCGCCGGGT43921CCTGGCGGTCGTGCGTGGTTCCGCGGTCAACCAGGACGGTGCGAGCAATGGGTTGACGGC43981TCCGAACGGTCCGTCGCAGCAGCGGGTGATTCGTCAGGCGTTGGCGGATGCGCGGTTGGG44041TGTGGCTGATGTGGATGTGGTGGAGGGGCATGGGACGGGGACGCGTCTGGGTGATCCGAT44101CGAGGCGCAGGCGTTGTTGGCGACGTATGGGCAGCGGGATGCGGGTCGGGCTTTGCGGCT44161TGGTTCGCTGAAGTCGAACGTGGGGCATACGCAGGCGGCTGCCGGTGTGGCGGGCGTGAT44221CAAGATGGTCATGGCGATGCGGCACGGTGTCCTGCCGAAGACGCTGCACGTGGATGAGCC44281GACGGCGGAGGTGGACTGGTCGGCGGGCGCGGTGTCCCTGCTGAGGGAGCAGGAGGCGTG44341GCCTGAGGTGGGGCGTCTGCGTAGGGCTGCGGTGTCTTCGTTCGGTGTGAGTGGGACGAA44401CGCGCATGTGGTGGTGGAGGAGGCACCTGCTTCGGAGGCGCCGGTCGCGGGGGAGCCGGT44461GGAGCCGGTGGAGCCGGGGGCTGTGGGGCTTCTTCCGGTGGTGCCGGTGGTGGTGTCGGG44521TCGTTCTGCGGGTGCGGTGGCGGAGCTGGCCTCCCGCTTGAACGAGTCGGTTCGTTCGGA44581TCGGTTGGTGGATGTGGGGTTGTCGTCGGTGGTGTCGCGGTCGGTGTTCGAGCACCGGTC44641CGTGGTTCTGGCGGAGGACTCTGCCGAGCTGCATACCGGTCTGGTTGCTGTCGGGACTGG44701GGTGCCGTCGCCTGGCGTGGTGTCGGGTGTGGCGTCGGTCGAGGGTGGCCGGTCGGTGTT44761CGTGTTCCCTGGTCAGGGGACGCAGTGGGCGGGGATGGCGCTCGGGTTGTGGGCGGAGTC44821GGCGGTGTTCGCGGAGTCGATGGCGCGGTGTGAGGCGGCGTTCGCCGGGTTGGTGGACTG44881GCGTCTGGCGGATGTGCTGGGTGACAGGTCTGCGTTGGAGCGGGTCGATGTGGTGCAGCC44941GGCGTCGTTCGCGGTGATGGTGTCGCTGGCCGAGCTGTGGCGGTCGCTGGGTGTGGTGCC45001CGATGCGGTGGTGGGGCATTCGCAGGGGGAGATCGCTGCTGCGGTGGTGGCGGGTGGTCT45061CTCGCTGGAGGACGGCGCGCGTGTGGTGGTGTTGCGTGCGCGGTTGATAGGCCGTGAGCT45121GGCCGGGCACGGTGGGATGGCGTCGGTGGCGCTGCCGGTCGCGGTGGTGGAGGAGCGTCT45181GGCGGCGTGGGCGGGGCGTCTGGGTGTGGCGGTGGTCAACGCACCCTCCGCCACGGTCGT45241CGCGGGTGATGTGGACGCGGTGGCGGAGTTTGTGACCGCGTGCGAGGTGGAGGGGGTTCG45301GGCGCGTGTTCTGCCGGTGGACTACGCCTCGCACTCGGCGCACGTGGAGGAGCTGAGGGC45361CGAGCTTGAACAGATTCTGGCCGGCATCGACCCGGTGGCCGGTGAGACCCCCCTGTACTC45421CACGGTGGAGGCGGGTGTCGTGGATACGGCGTCGATGGATGCGGGGTACTGGTTCAGGAA45481TCTGCGTCGGCCGGTTCGTTTCCAGGAGACGGTCGAGCGGTTGCTGGCGGATGGTTTCCG45541GGTGTTCGTGGAGTGCGGCGCGCATCCGGTGCTGACGGGGGCGGTGCAGGAGACCGCGGA45601ATCCACCGGTCGCCAGGTGTGTGCGGTCGGATCCCTGCGTCGTGACGAGGGTGGTCTGCG45661ACGCTTCCTGACCTCTGCGGCGGAGGCGTTCGTCCAGGGGGTGGAGGTGTCCTGGCCGGT45721GCTGTTCGATGGCACCGGCGCCCGCACGGTCGACCTGCCCACCTACCCCTTCCAGCGTCG45781GCGTTACTGGCTGGAGTCACGTCCTCCTGCGGCGGTTGTTCCGTCGGGGGTCCAGGACGG45841ATTGTCGTATGAGGTGGTGTGGAAGAGCCTGCCGGTACGGGAGTCGTCGCGTCTTGACGG45901CCGGTGGCTGCTCGTCGTGCCCGAAACCCTGGACGCCGACGGCACGCGGATCGCCCACGA45961CCTCCAGCACGCCCTCACCACCCACGGCGCCACCGTGCACACTCTTGCTCTTGACCCCAG46021CGCGGCGCACTTCGACGGTCTCTTTGACGGGATACTCCAGGAAGAAACAGATGTCACGGG46081CATCTTCTCTCTCCTCGGACTGGCATCGGGCCCGCACCCGGATCACGGCGAGGTGGAGCT46141CGCGGGAGCCGCGTCGCTGACGTTGATGCGCCAAGCCCAGCGAGACGGCTTCCGTGCTCC46201GGTGTGGGCGGTGACGCGGGGTGCGGTGTCCGTGGTGCCTGGTGAGGTGCCGGAGACCGC46261GGGTGCGCAACTGTGGGCGCTCGGCCGGGTCGCCGGTCTCGAACTCCCCGACCGTTGGGG46321TGGTCTGATCGATCTCCCGGCGGATGCCGATGCGCGTACGGCGGGGCTTGCGGTGCGGGC46381CCTGGCCGCCGGGATCGCCGATGGTGAGGACCAGCTGGCGGTGCGCCCCTCAGGTGCCTA46441CGGCCGGCGCCTCGTACGAGCCACCGCGCGCCGGGGACGGAAGGACTGGCGCCCGCAGGG46501TACGGTGCTGCTCGCCGGGCACCTCGACGCCGTCGGTGAACCACTGGCCCGATGGCTGCT46561CACCGGCGGCGCGGACCACGTCGTCCTTGCGGATCCCGCCCTGACCGAACTCCCGGCCAC46621CCTCGCGGATCTGGCCCAGACCGTGACGACCGCTGCGGCACCCGACCTTGCCGACCGTGC46681AGTCCTCGCCGCCCTGGTCACCGAGTACGTACCCGCCACCGTGGTCGTCGTTCCGCCCGC46741GGCGGAGCTCGCTCCGCTGGCGAGTATCAGCCCGGCCGACCTCGCGGCGGCCGTCACCGC46801CAAGTCCGCGACCGCGGCGCACTTCGACGCGCTGCTCGACGGACCCCACGCACCGGAGCT46861GGTGCTGATCTCCTCGGTCGCGGGGATCTGGGGTGGTGTCCGGCAGGGTGCGTACGCCGT46921CGGTGCCGCTCACCTCGATGCCCTGGCCGCCCGCCGCAGGGCCCGCGGTCTGTCGGCGGC46981CTCCGTCGCGTGGACGCCCTGGGCGGGTTCCGTCACCGCGGACGGCTCCGCCGCCGAGTC47041GCTGCGGCAGTACGGCATCGCTCCGCTGGAGCCGCAGGCGGCGCTCGCGGAGCTGGACCG47101GGCGCTGAACCAGCAGCTGCACGGCGGCGGGGGCGACGCGGCGGTGGCCGACATCGACTG47161GGAGCGGTTCCTCGCGTCGTTCACCTCCGTACGTCCCAGCGTTCTCTTCGACGAGCTGCC47221CGAGGTACGCCGTCTCCGCGAGGCGGAGGCGGCGGCCATGGCGGACCAGGCCGCCGCCCG47281GACGGGAGCGCCCGGCGGAACGGAGCTGGCGCGCTCTCTGCGGGCCAAGTCCCTGAACGC47341CCAGCGAACTGCGCTCCTGGAATTGGTCACTGCCCACGTGGCGGCCGTGCTGGGAGAGAG47401CGTTCCCGAGGCGATCGACCGGAGCCGGGCGTTCAAGGACATCGGCTTCACCTCCATGAC47461CGCGATGGAACTGCGCAACCGGCTCAAGGAGGCCACCGGGCTCGCCCTTCCTGCCTCCCT47521CGTCTTCGACCACCCCCACCCCGGCGCACTCGCCGACCACCTGCGCGAGGAACTCCTGGG47581CGAGGACGGTGCGGCGGGCGCCGACTCCGCGGCGGAGGAACCGAGCGCTACCTCTCCGAC47641GGTCCAGGACGAGCCGATCGCCATCATCGGCATGGCCTGCCGCCTCCCTGGTGACGTCGG47701AACACCCGACGAACTCTGGGAGCTGCTGGAAACCGGCCGCGACGCGATGTCGGACCTGCC47761CGTCAACCGCGGGTGGGACGTGGCGGGGCTCTACGACCCGGATCCAGACGCGGCGGGGCG47821TTCCTACGTCCGGGAGGGCGGGTTCCTCCACGACGCGGGGGAGTTCGACGCGGAGTTCTT47881CGGCATCTCGCCGCGTGAGGCGCTGGCGATGGACCCGCAGCAGCGCATCGTCCTCGAACT47941CGCCTGGGAATCGTTCGAACGTGCGGGCCTGGACCCGGCCGGCCGCCGCGGCAGCCGTAC48001CGGCGTGTTCATGGGAACCAACGGCCAGCACTACATGCCGCTGCTGCAGAACGGCAACGA48061CAGCTTCGACGGCTACCTCGGCACCGGGAACTCGGCCAGTGTCATGTCGGGCCGCATCTC48121GTACACCCTCGGCTTGGAGGGGCCGGCGCTGACGGTGGACACGGCGTGTTCGTCGTCGCT48181GGTCGCGCTGCATCTGGCGGTGCGGGCGCTGCGCAACGGTGAGTGCGACCTCGCCCTCGC48241CGGCGGCGCGACCGTGATGTCGACGCCGGAAGTCCTGGTGGAGTTCTCCCGGCAGCGTGC48301AGTCTCCGCAGACGGTCGCTGCAAGGCGTTCTCCGCCTCGGCCGACGGCTTCGGACCGGC48361CGAGGGTGCGGGCGTGTTGCTCGTCGAGCGGTTGTCGGATGCGGTGCGTCATGGGCGTCG48421GGTGTTGGCGGTCGTGCGTGGTTCGGCGGTGAATCAGGACGGTGCGAGTAATGGGTTGAC48481GGCTCCGAACGGTCCGTCGCAGCAGCGGGTGATTCGTCAGGCGTTGGCGGATGCGCGGTT48541GGGTGTGGCTGATGTGGATGTGGTGGAGGGGCATGGGACGGGGACGCGTCTGGGTGATCC48601GATCGAGGCGCAGGCGTTGTTGGCGACGTATGGGCAGCGGGATGCGGGTCGGCCGTTGCG48661GCTTGGTTCGTTGAAGTCGAACGTGGGGCATACGCAGGCGGCTGCCGGTGTGGCGGGCGT48721GATCAAGATGGTCATGGCGATGCGGCACGGTGTCCTGCCGAAGACGCTGCACGTCGATGA48781GGTCTCTCCGCACGTCGACTGGTCGGCGGGTGCGGTGTCCCTGCTGACGGAGCAGGAGCC48841GTGGCCGGAGGTGGGGCGCCCTCGCAGGGCTGCGGTCTCTTCGTTCGGGCTCAGCGGGAC48901GAATGCGCATGTGGTGGTCGAGGAGGCGCCGGTCGGGGAGGCGGGGCAGGCCGCCGGGGA48961TGCTCGGTTGGCTGTGGTGCCGGTGGTGGTGTCGGGCCGGTCTGCGGGTGCGGTTGCTGA49021ACTGGCCTCCCGCTTGAACGAGTCGATTCGTTCGGATCGGTTGGTGGATGTGGGGTTGTC49081GTCGGTGGTGTCGCGGTCGGTGTTCGAGCACCGGTCCGTGCTACTGGCGGGGGACTCTGG49141CGAGCTGCATACCGGTCTGGTTGCTGTCGGGACTGGTGTGCCGTCGCCTGGTGTGGTGTC49201GGGTGTGGCGTCGGTCGGGGGTGGCCGGTCGGTGTTCGTGTTCCCTGGTCAGGGGACGCA49261GTGGGCGGGGATGGCGCTCGGGTTGTGGGCGGAGTCGTCGGTGTTCGCGGACTCGATGGC49321GCGGTGTGAGGCGGCGTTCGAGGGGTTGGTGGACTGGAGTCTGGCGGATGTGCTGGGTGA49381CGGGTCCGCGTTGGAGCGGGTCGATGTGGTGCAGCCGGCGTCGTTCGCGGTGATGGTGTC49441GCTTGCTGAGCTGTGGCGGTCGTTGGGTGTGGTGCCGGATGCGGTGGTGGGGCATTCGCA49501GGGGGAGATCGCTGCTGCGGTGGTGGCGGGTGGTCTGTCGTTGGAAGACGGGGCGCGTGT49561GGTGGTGTTGCGTGCGCGGTTGATCGGCCGTGAGCTGGCCGGGCGCGGTGGGATGGCGTC49621GGTGGCGCTGCCGGTCGCGGTGGTGGAGGAGCGTCTGGCGGGGTGGGCGGGGCGTCTGGG49681TGTGGCGGTGGTCAACGGACCGTCCGCCACGGTCGTCGCGGGTGATGTGGATGCGGTGGC49741GGAGTTTGTGACCGCGTGCGAGGTGGAGGGGGTTCGGGCGCGTGTTCTGCCGGTGGACTA49801CGCCTCGCACTCGGCGCACGTGGAGGACCTGAAGGCCGAGCTTGAAGAGGTGCTGGCCGG49861CATCGGCCCGGTGACCGGTGGGATCCCGTTCTATTCGACGTCCGAAGCCGCGCAGATCGA49921CACGGCTGGTCTGGACGCGGGGTACTGGTTCGGGAATCTGCGTCGGCCGGTGCGGTTCCA49981GGAGACGGTCGAGCGGTTGTTGGCGGATGGTTTCCGGGTGTTCGTGGAGTGTGGTGCGCA50041TCCGGTGCTGACGGGGGCGGTGCAGGAGACCGCGGAATCCACCGGTCGCCAGGTGTGTGC50101GGTCGGATCCCTGCGTCGTGACGAGGGAGGTCTGCGCCGCTTCCTCACCTCTGCGGCGGA50161GGCGTTCGTCCAGGGGGTCGGGGTGTTCTGGCCGGCACTGTTCGACGGCACCGGCGCCCG50221TATCGTCGACCTGCCCACCTACCCCTTCCAACGACGGCACTACTGGTACAACGACCCTGC50281CCGCCGCACGGGCGATGCCACCTCCTTCGGTATGGCGCAGGCCGGTCATCCCTTGCTCGA50341TGCCGGTACGGAGCTCCCTGAATCAGGCGAGCACCTCTACACCGCTCGGCTCGCCGCCGA50401CTCGCATCCTTGGCTGCTGGAACACACCCTGCTGGGTGCGCCGTTGCTGCCCGGTGCGGC50461GTTCGTCGACCTCGTCCTGTGGGCCGGCGGGGAGGTCGGATGCGACCTGATCGAAGAGCT50521GACGCTGACCTCGCCGCTGCTGTTGTCCGACAGCGCTGCCCTTCAACTGCGGCTGGTCGT50581GGGCACGGCGGACGCCGAGGGACGTCGTACGATCACCGTCCACTCGCGGCCGGACGGAGA50641CCCGCGTACGACGCGCACACCTGCGGCATCGTCCGAGACCAGCCCGGACGCGGAGTCGGA50701CACGGAGATCCGTAGGGACACGTCCGCCTGGACGAAGCACGCTCAGGCGACGGTCGCCCC50761CGCCCCTGACGTCCCCCCTTCCGGGGTGGACGCGGAAGGGGACGCCGTTCGCCCCGCAGT50821GGAATGGAGCGTGGCGGCGACGGAGTCGGATGCCTTCCAGGCCGAGGACTTCTACGCGTC50881CTTCGCCGCACACGGCTATGGCTACGGCCCGCTGTTCCAGGGCGTACGGTCAGGCCGTCA50941GGACGGGACCGACGTCTACGCCGAAGTCGCCCTGGATCACGACCGCTTGCCGTCTGCCGA51001GCAGTTCGGCCTGCACCCCGCGCTGCTCGACGCGGCGTTCCAGACGATGCGTCTGGGATC51061GTTCTTCCCCGACGACGGACAGGCACGTGTGCCGTACACCTTCCGGGGGATTCGTCTCTA51121CGCCCCGGGAGCCGCGCGCCTGCGGGTCCGTGTCTCGGCGGTCGGGGCCGATGCCGTACG51181CGTGGAGTGCGCCGATGAGCGGGGGCGGCTCGTCTGTGAGATCGACGCCCTCGTCGTCAG51241CACGGTCTCCCCGGACCAGTTGCGGCCGGCCGGACAGGACGCGACCCAGGACATGCTGCA51301CCGGATCGAGTGGCCCGTCCTCTCCCCGCCGACCGGCAGCGCCACCTCCCCTGCTCCGCC51361CCGCTGGATCGTGGTCGGGGGCGAGGACGAGGGCCTCGGGCTCGGGGGCCTTCGACTCGA51421CGGTCCGAGGCTTGACGGTCCCGGGCTTGCGGAAGCGCTGTCCGAAGCCGGTATGGGGAC51481CGAGCGTCACCGGAACCTGGCCGACGCGCTGTCGGCCGTACGGACGCCGGTGGACACGGC51541AGGCTCCGCTGCCGCCGCCGGCACGACCTCCCTCATAGCCGTCCCCGTACCGCAGTCGCC51601CACCATGGACGCCGGTGCCGTGCGCCACGCCGTCCACCGAGCCCTGGAGCTGGTGCAGGG51661CTGGGTGGCGGCGGACGAGGCGGCGGAAGAGGGCGGGAGCGACGGTGCCGCGGCCGACCG51721GCGGCTGGTGCTGGTCACGAGCGGAGCGGTGTCCACGGGTGACGCCGACCCGCTGCGCGA51781CCCGGTGGCCGCGGCCGTCTGGGGTCTGATCAAGTCCGCCCAGTCGGAGCAGCCCGGCCG51841CATCGTCCTCGTTGACCTCGACGAGGGAGCCGTGGACGGGGCGGCCTTGGCAGCCGCGAT51901CTCGACCGGCGAACCACAACTCGCCCTGCGCGACGGCGATGTGCACGTGCCCAGGCTGGC51961ACCCCTGTCCGTGCGGGACTCGCAGACGCTGCTGCCGCCCGCCGGTACGCGCGCCTGGCA52021TCTGGTCGGCGCCGGCACCGGAACCCTTTCGGACCTCGCGCTCGTACCGGCGCAGACCGA52081CACCGTCGCGCTCGCACCTGGGCAGGTGCGGATCGCGGTGCGAGCCGCCGGACTCAACTT52141CCGGGACACGCTCATCGCGCTCGGTATGTATCCGGGCGAGGGCGTGATGGGCGCCGAGGG52201CGCCGGAGTGATCACCGAGGTCGGCCCGGACGTGGTGAGCCTCGCCGTCGGGGACCGCGT52261CCTGGGCATGTGGACCGACGGGTTCGGGCCGTACGTCGTGGCCGACCACCGCATGGTGGC52321CCCGATGCCGCGCGACTGGTCCTACGCGGAGGCCGCTTCGGTACCCGCCGTCTTCCTCAG52381TGCCTACTACGGACTCAGGCACCTGGCCGGTCTGCGCGCCGGCCAGTCGGTGCTGGTGCA52441CGCGGCGGCGGGCGGTGTGGGCATGGCTGCCGTCCAACTCGCCCGGCACTTCGGGGCCGA52501GGTCTTCGGCACGGCCGGCACGGCCAAATGGGACGCACTGCGGGCACAGGGCCTGGACGA52561CCGGCACATCGCCGGTTCACGGACGCTGGACTTCGCGGACCGGTTTCTCGACGCGACCGA52621GGGGCGCGGCGTGGACGTCGTCCTGAACTCGCTGGCGGGCGACTTCGTCGACGCCTCCCT52681GCGACTGCTGCCGCGCGGAGGGCGGTTCGTGGAACTGGGCAACCCGGACGTACGCGACGC52741CGCGCAGGTCGCCGCCGACCGGCCGGGAACCGTCTACCGGGCCTTCGAGCTGATGGAGGC52801CGGGCCGGAGCTGATCGGCCGCATGCTGAACGAACTGCTGGAACTGTTCGAGTCCGGGGC52861GCTGCGCCTGCTGCCCGTCACCCCGTACGACATCCGGCGGGCACCCGACGCCTTCCGCAC52921GCTCAGCCAGGCCGGTCACGTCGGCAAACTGGTCCTGACGATGCCACCGGCCTTCGAACC52981CCACGGCACGGTCCTGATCACCGGCGGCACCGGGAACCTGGGCGGAACACTCGCCCGCCA53041TCTCGTGACCGAACACGGAGTGCGCCACCTGCTCCTGGCCGGACGTAGGGGGCCCGAGGC53101CGAAGGCGCCGCCGAACTCGTACGGGAACTGCACGACCTGGGCGCCTCCGTCACGGTCGC53161CGCCTGTGACGTGGCCGACCGAGCGGCGCTCCGGAAACTCCTCGGCGGCATACCGCCGGA53221GCGCCCGCTCACCGGAGTCGTCCACGCGGCGGGCGTTCTCGACGACGGCGTGGTCACGTC53281ATTGACCCCGGACAGGGTCGACGGCGTCCTGCGGCCCAAGGTGGATGCCGCTCTCAACCT53341CCACGAAGCGGCCCTCGATCCCGAACTCGGTCTCGACATCACCGCGTTCGTCCTGTTCTC53401GTCCGTCGCTGCCCTGCTCGGCGGCTCGGGTCAGGGAAGCTACGCCGCCGCCAACGGCTT53461CCTCGACGGACTCGCCCAGTACCGGCGTGGCCGCTCGCTGCCCGCGCTCTCCCTCGGCTG53521GGGCCTGGCCGGGAGCGGCCGGATGACATCCCACCTGGACAGCCGGGCCCTGCTCCGGCG53581CATGGCCAGGGGCGGTGTCCTGCCGCTGTCCCCGGCGGAGAGCATGGCACTGTTCGACGC53641GGCCCAGGGCTTCGACGAGGCGCTCCAGGTGCCGGCGCGCTTCCACACCGCCGCACTGGG53701CGCCGACGGCAACGTCCCGCCGCTCTTCAACGGACTGATCCGGGGCGGGACGGCGCATGC53761CGAGGCCCGGCGCAGGACGGTCGGCGCGTCGCCCGCGGGTGGTCCTGCCGGAGGCGAGCC53821GGTGAACCTCGCCGACCGGCTGTCCGGACTGACGGAGGACGAACAGCGGGCGCTGCTCCT53881CGACACGGTGCGCACGCACGCGGCTCTCGTCCTGGGCCACACGGGCACGGACGGCATCCA53941GGCCGACCGGGCGTTCAAGGACCTGGGATTCGACTCGCTGACGGCCGTCGAGATGCGCAA54001CCGGCTCACCGCCGCCACGGGGCTGCACCTCGCGGCGACGCTGGTCTTCGACCACCCCGC54061TCCGGCGGACCTCGCCGAGCACCTCCGCTCTCGACTCGTCCCCGAGGGGACGGACGTACC54121GCCGCTCCTCGCGGAACTCGGTCGGCTCGAAACGGCGTTCAAGAAGCTGACCACCGCGGA54181CCTCGCCTCGGTCGTGCCCGACGACATCGCCCGCGACGAGATCGCCGTACGTCTCGCCGC54241CCTCGGTTCCCTGTGGAACGGGCTCCATGGCAACGGCCTCAGCGGAGACGCGGCGCAGAA54301GCACGGCGACTCGATCGTCGAGGACATCGACTCCGCCGACGACGACGAGATCTTCGCCTT54361CCTCGACGAGAGCTTCGGCGACTCCTGACCGCAGGCACCTCCGTACGGACCGACGACTCT54421GCAGACGGGTGATGAGAGATGGCGACCGAACACGAGCAGAAGCTCCGCGACTACCTCAAG54481CGGGCCACCACCGAACTCCACAAGGCCACGGAACGGCTGAAGGAGGTCGAACAACGCGCT54541CACGAGCCGGTTGCGATCGTGGGGATGGGATGCCGGTTCCCGGGCGGGGCGTCCTCGCCT54601GAGGAGTTGTGGGACCTGGTGGCGGCGGAGACGGACGCGGTCTCCCCCTTCCCGGTGGAC54661CGGGGGTGGGACGTGACGGGGCTGTACGACCCGGATCCGGACGCGGCAGGGCGTGCCTAC54721GTCCGCGAGGGCGGGTTCCTCCACGACGCGGGGGAGTTCGACGCGGGGTTCTTCGGAATC54781TCTCCGCGTGAGGCGTTGGCGATGGATCCGCAGCAGCGGTTGCTGCTGGAGACGTCGTGG54841GAGGCGTTGGAGCGGGCGGGCATCGATCCGCACACGCTGCGCGGCACGCGGACCGGGGTC54901TACATGGGTGCCTGGAACGGCGGATACGCCGAGGGGATTCCCCAACCCACGGCGGAACTG54961GAGGCCCAGCTCCTCACCGGCGGCGTGGTGAGCTTCACCTCGGGCCGTGTGTCCTACCTC55021CTGGGTCTGGAGGGGCCGGCGGTGACGGTGGACACGGCGTGTTCGTCGTCGCTGGTCGCG55081CTGCACCTGGCGGTGCGGGCGCTGCGCAGTGGTGAGTGCGACCTCGCCCTCGCCGGCGGC55141GCGACGGTGATGTCGACGCCCGACGTGTTCGTGCGCTTCTCCCGGCAGCGAGGAGTGGCC55201GCGGACGGTCGCTGCAAGGCGTTCTCCGCGTCGGCCGACGGATTCGGACCGGCTGAGGGT55261GTGGGCGTGCTGGCGGTGGAGCGGTTGTCGGATGCGGTGCGTCATGGGCGTCGGGTGCTG55321GCGGTCGTGCGTGGTTCCGCGGTCAACCAGGACGGTGCGAGCAACGGACTGACGGCGCCG55381AGCGGACGAGCTCAGGCCCTTCTGATTCGTCGAGCGTTGGCGGATGCGCGGTTGGGTGTG55441GCTGATGTGGATGTGGTGGAGGGGCATGGGACGGGGACGCGTCTGGGTGATCCGATCGAG55501GCGCAGGCGTTGTTGGCGACGTATGGGCAGCGGGATGCGGGTCGGCCGTTGCGGCTTGGT55561TCGTTGAAGTCGAATGTGGGGCATACGCAGGCGGCTGCCGGTGTGGCGGGCGTGATCAAG55621ATGGTCATGGCGATGCGGCACGGTGTCCTGCCGAAGACGCTGCACGTGGATGAGCCGACG55681GCGGAGGTGGACTGGTCGGcCGGCGCGGTGTCTTTGCTGAGGGAGCAGGAGGCGTGGCCT55741GAGGTGGGGCGTCTGCGTAGGGCTGCGGTGTCTTCGTTCGGTGTGAGTGGGACGAACGCG55801CATGTGGTGGTGGAGGAGGCGCCGGTTCCGGAGGACGGGGAGGCGGTCGGGGGCGGTGTG55861CCTTTGGCTGTGGTGCCGGTGGTGGTGTCGGGTCGTTCTGCGGGTGCGGTGGCGGAGCTG55921GCGGGCCGGGTCAGCGAGGTGGCTGCGTCTGGTCGGTTGGTGGATGTGGGGTTGTCGTCG55981GTGGTGTCGCGGTCGGTGTTCGAGCACCGGTCCGTGGTACTGGCGGGGGACTCTGCCGAG56041CTGAATGCCGGTTTGGATGCTGTGGCCGGTGGTGTGCCGTCGCCTGGTGTGGTGTCGGGT56101GTGGCGTCGGGTGAGGGTGGCCGGTCGGTGTTCGTGTTCCCTGGTCAGGGGACGCAGTGG56161GCGGGGATGGCGCTCGGGTTGTGGGCGGAGTCGTCGGTGTTCGCGGAGTCGATGGCGCGG56221TGTGAGGCGGCGTTCGTCGGCTTGGTGGACTGGCGCTTGTCGCAGGTTTTGAGCGATGGG56281TCGGCGCTGGAGCGGGTGGAGGTGGTGCAGCCGGCGTCGTTCGCGGTGATGGTGTCGCTT56341GCTGAGCTGTGGCGGTCGTTGGGTGTGGTGCCGGATGCGGTGGTGGGGCATTCGCAGGGG56401GAGATCGCTGCTGCGGTGGTGGCGGGTGGTTTGTCGCTGGAGGACGGGGCGCGTGTGGTG56461GTGTTGCGTGCGCGGTTGATCGGTCGTGAGCTGGCCGGGCGCGGTGGGATGGCGTCGGTG56521GCGCTGCCGGTCGCGGTGGTGGAGGAGCGTCTGGCGGGGTGGGCGGGGCGTCTGGGTGTG56581GCGGTGGTCAACGGACCGTCCGCCACGGTCGTCGCGGGTGATGTGGATGCGGTGGCGGAG56641TTCGTGGCCGCGTGCGAGGTGGAGGGGGTTCGGGCGCGTGTTCTGCCGGTGGACTACGCC56701TCGCACTCGGCGCACGTGGAGGACCTGAAAGCCGAGCTTGAACAGATTCTGGCCGGCATC56761GGCCCGGTGACCGGTGGGATCCCGTTCTATTCGACGTCCGAAGCCGCGCAGATCGACACG56821GCTGGTCTGGACGCGGGGTACTGGTTCGGGAATCTGCGTCGGCCGGTGCGGTTCCAGGAG56881ACGGTCGAGCGGTTGTTGGCGGATGGTTTCCGGGTGTTCGTGGAGTGTGGCGCGCATCCG56941GTGCTGACGGGGGCGGTGCAGGAGACCGCGGAATCCACCGGTCGCCAGGTGTGTGCGGTC57001GGATCCCTGCGTCGTGACGAGGGAGGTCTGCGCCGCTTCCTCACCTCGGCCGCGGAGGCA57061TTCGTCCAAGGCGTGGAGGTGTCCTGGCCGGCACTGTTCGAAGGCACCGGCGCCCGCACG57121GTCGACCTGCCCACCTACCCCTTCCAACGTCGGCGCTACTGGCTGGAGTCGCGCCCTCCC57181GCGGCGCCGATCGAGACTGCCGCAGCCTCTGGCATCGAGAGCTGGCGCTACCGCGTGGCG57241TGGAAGAGCCTGTCGCTGTCGGAGTCGTCGCGTCTTGACGGCCGGTGGCTGCTCGTCGTG57301CCCGAAACCCTGGACGCCGACGGCACGCGGATCGCCCACGACATCCAGCACGCCCTCACC57361ACCCACGGCGCCACGGTCTCCCGTCTGACGGTCGACGTGACGACGACCGACCGCGCCGAC57421CTGTCGGCGCGGCTCACCACCACCGCGGCCGAAGACCAGGGGCCTCTCCGGGGCGTCCTC57481TCCCTCCTGTCCACCGATGAACGGCAGCACCCGGATCATCCCGGTGTCGACCGTGCCACG57541GCGGGCACGATGCTGCTCGCCCAGGCGTGCGGGGATCTGGTCGTGGCCCGGGGCGTGGAG57601CCGAGGCTGTGGGTCGTGACCCGCGGGGCGGTCGCGGTGTCCCCCGCCGAGCGTCCGTCG57661TCAGCCGGCGCCCAGGTCTGGGGCCTGGGGCGCTGCGCGGCGCTCGAACTTCCCACTCGG57721TGGGGTGGGATGGTCGACCTTCCCCCGGCGGCCCGGGATGCTGGAAGGCACGTACGGCGG57781CTCGTGCGTCTGCTGTCGGAGACCTGTGCGGAGGACCAGGTGGCGCTGCGTGCGTCGGGT57841GCGTACGGCCGCAGGCTGCTGCCCGCGTCCAGCCCCTCCGTATCCGTCCCCCGGACCGCG57901AAGAGCGGCTACCAGCCGCGCGGCACGGTGCTGGTGACCGGCGGAACCGGTGCCCTCGGT57961GGCCACTTGGCACGGTGGCTGGCCCGCAACGGCGCCGAGCACATCGTTCTGGCCGGGCGT58021CGGGGCGAGGGTGCTCCAGGAGCCGCGGAACTGTCGGCGGAGCTCAAGGAGCTGGGTGCG58081GAGGTCACCGTCGCGGCCTGCGACGTGGCGGACCGGAACGCGTTGCGTGACATGCTGGAA58141TCCCTGCCGGCCGACCGGCCGCTGTCGGGGGTGTTCCACGCTGCCGGTGTCCCGCACTCG58201GCGCCGCTGGCCGAGACGGATGTGGCGGGGCTCGCCGCCGTGCTCCCGGGGAAGGTCGTC58261GGGGCACGGCACCTGCACGAACTCACCAGGGAGAAGGAACTGGACGCGTTCGTGCTGTAC58321GCGTCGGGCGCCGGGGTGTGGGGGAGCGGCGGGCAGAGCGCGTACGGAGCCGCCAACGCC58381GCACTGGACGCGCTGGCCGAACAGCGCCGGGCTGAGGGACTGCCCGCCACTTCGGTCTCC58441TGGGGCCTGTGGGACGGCGGAGGCATGGCCGGCGAGCGAGGCGAGGAGTTCCTCACCGCC58501CTCGGCCTGCGGGCCATGGAGCCCGAGTCGGCTGTCGCCGCCCTGGAGGAGGCCCTGGAT58561CGTGGGGACACCTGCGTGAGCGTGGTCGACGTCGACTGGTCCCGGTTCGCCGAGTCGTTC58621ACCGCCTTCCGGCCCAGCCCGCTGATCGGGGAGCTCCCCGGGGTACGTGCCGTGCCCGAC58681GGATCGGCGGGCGGACCGTCGGACGACCTCGCGGACGCTGCGCGGCACGGCGGGGCAGCC58741GACCGGGGTGTGCCTGCAGGGCTCGCCCGGGCGACGGGCGACGACCGGCAGGACATCCTG58801CTCGATCTCGTACGCCGCCATGCCGCCGCCGTCCTCGGTCACCCGGGACCGCAGCACATC58861GAGCCCGACGCCGGTTTCCGGACCCTGGGGTTCAGTTCGGTCACCGCGGTGGAACTGGCC58921AACAAGCTCGGTGCGGCCGTGGGAACGAAGATCCCCGCCACCTTCGCGTTCGACCACCCC58981AACGCCCGTGCCGCGGCGTCCCGCCTCGACGTCCTGTTGGCGGCGTCGAGCGATGAGACC59041GCGCAGGAGGCGGAGATCCGGCAGGCACTGCGGACTGTGCCGCTGGCCCGGCTGCGGGCT59101GCGGGGCTCCTCGACGGCCTGCTCGAACTCGCCGGGCTGGAAGCGGAGCCCGGCCTGCCG59161GGCGACGTACCGGATCGCGGTGCGGCCACGCCGGACGAGGAGTCCGCCCTGGCGGAAGTC59221GACGGCCTGGACGCCGAAGCACTGGTCGACCTCGTCCTCAACCAGTCCGACTCCTGACCG59281CCGGCGGCGGCGCCGCGGCCCGCCGTGCCGTCGCCGCCCTCGGCCGTACGAAGAACCCCA59341CAGACCTGACCGGGTCACGGCCCGGTGCTCAGCAAGGAGACCACTCATGGCTCTGTCCCA59401AGAGAAGGTACTGGAGGCACTGCGCACCTCCGTCAAGGACGCCGAACGGCTGCGCAAGCG59461CAACCGCGAACTCCTCGCGGCCCGCCACGAGCCCATCGCCGTCGTCGGCATGGCCTGCCG59521CTATCCCGGCGGGGTCCGTTCGCCCGAGGACCTCTGGGAACTCGTCGTGTCGGGCACGGA59581CGCGGTGGGTCCCTTTCCCGAGGACCGTGGCTGGGACGTGGAGCGGATCTACGACCAGGA59641CCCGTCCGTCCCGGGCACCACGTACTGCCGCGAGGGCGGATTCCTTTACGATGCGGGGGA59701CTTCGACGCGGCTTTCTTCGGGATAGGGCCGCGCGAGGCCACCGTGATGGACCCCCAGCA59761GCGCCAGCTGCTGGAGGCGTCCTGGGAAGCCCTGGAGCAGGCCGGGCTGGACCCCCGGGC59821GCTCCGCGGCAGCCAGGGGGGTGTGTTCGTCGGCGCGGCGAACCAGGGCTACGTACCGGG59881TGACGCGGCGGCGTCGGGACGTCTGCCGGAGGGTTCCGACGGCTATCTGCTCACCGGCAA59941CGCCGACGCCGTCCTGTCGGGCCGGATCAGCTACTTCCTGGGCCTGGAAGGCCCGTCCAT60001GACCGTCGAGACGGCCTGCTCCTCCTCCCTGGTGGCACTGCACCTGGCGGTGCAGGCGCT60061GCGCCGTGAGGAGTGCGAGTTCGCCCTGGCCGGAGGGGTCGCCGTGCTCGCCAACCCGGC60121CGCCTACGTGGAGTTCGCCCGGCAGCGGGGACTCGCCCCGGACGGGCGCTGCAAGGCGTT60181CGACGACGCGGCGGACGGTACGGGCTGGGCCGAGGGCGTCGGCGTCCTGGTGGTGGAGCG60241GCTGTCGGACGCGGTACGCAAGGGGCACCGGGTCCTCGCCGTCGTGCGGGGCACGGCGGT60301GAACCAGGACGGTGCCAGCAGCGGTCTGTCCGTGCCCAACGGGCCCTCCCAGCAGCGGGT60361CATCCGCCGAGCGCTGGCCGACGCCCGGCTGGAGGCCGGCCAGATCGACGCGGTGGAGGC60421CCACGGCACCGGCACTCGGCTGGGGGACCCCATCGAGGCGCAAGCCCTGCTGGACACGTA60481CGGAGAGGAGCGGAGCCCCGAACGCCCTCTGTGGGTCGGGTCGTTGAAGTCGAACTTCGG60541TCACGCACAGGCGGCAGCCGGAGTCGGCGGCGTCATCAAGACGGTGATGGCGCTCCGGCA60601CGGCCTGCTTCCCCGCACGCTCCATGTGACCAGCCCGACGCGGCACGTCGACTGGGGCGA60661CGGACAGGTGCGGCTGCTGACCGAGCCGGTCGACTGGCCGCGGACCGGCGCCCCCCGGCG60721GGCCGCGGTCTCGGCGTTCGGCGTGAGCGGCACCAACGGGCACATCATCCTCGAGGAGGC60781GCCGCCGCCCACCCGGCCCGAAGCGGTCCGGCAGGCCGGGGAGCGGCGGCCGGTCCTGGT60841CCCGTGGACGCTGTCCGGCCGTACGAGGCCGGCGCTGTGCCGGCAGGCCGCGCGCCTGGC60901GGCGCACCTCGAACAGCACCCGGACCTCGACCCGCTGGACGTCGGGTTCTCGCTCGCCAC60961GACGCGCACCCACTTCGAGCACCGGGCCGTGCTGCTCGCGGACGCCGCCACCGAGGGCGG61021CTCCCGTGCCGACGCGCTCGGGGCGTTGCGGGCGATCGCGGAGGACCGCGACCCGGGCGG61081GGCGGTACGGGACACCGCGCGGGGCGAAGGGCGTATCGCCTTCCTGTTCTGCGGGCAGGG61141CAGCCAGCGGCCCGGCATGGCGGAGCAGCTGTACGCGCAGTACCCGGCGTTCGCGCGGGA61201ACTGGACACGATCGCGACGCATCTGGACGCCCATCTGGACCGTCCGTTGGCGACGGTGAT61261GTTCGCGCCGGCCGGTACGGCGGAGGCCGCGCTGCTCGACGGCACGCAGTACGCCCAGGC61321GGCCCTGTTCGCCGTAGAGGTCGCGTTGTTCCGGCTCTTCGAGGGCTGGGGGCTGCGCCC61381CGACGTACTGCTGGGCCATTCCGTGGGCGAGCTGGCCGCCGCCCACGTGGCCGGGGTGTT61441CGGGCCGGCGGACGCCTGCTCGCTGGTCGCCGCACGCGGCCGGCTCATGCAGGAGCTGCC61501GGCCGGCGGCGCGATGCTCTCGGTCCGTGCCGCCGAGCACGAGGTGCGGGAGCTGATCGC61561CGGGCAGGAGGACCGCATCGCGGTGGCGGCCGTCAACGGGCCCCGCTCCGTGGTCGTATC61621CGGGGACGAGGACGCGGTCTCGGCGCTCGCCGAGGAGCTGACCGAATACGGCGTGCGCAC61681CAAGCGCCTCAACGTCAGCCACGCCTTCCACTCCCCACGTCTGGACTCCATGCTGGAGAC61741GTTCCGCCGGGTCGCGGAGACGGTGGAGTACCGCGAGCCGACGCTCGACGTGATCAGCGG61801CCTGACCGGCCGCCCGGCCGACGCCGGGGAACTCGCCACCGCCGACTACTGGGTCCGGCA61861GGCGAGGGAGACCGTCCGGTTCCACGACGGGGTGCGCGCCGCGCACGCGCGCGGCGTCAG61921CACCTTCGTGGAGCTGGGGCCGGACGGCGTGCTGTGCGGCCTGGCCCTGGAGACCCTGGC61981GGAGGAGACCGACGGGGAAGCGGCCGCCGAGACGCCCGGCCGGGCGCGGGCGGCGCTGGT62041GCCCGTGATGCGTCGGGAGCGGCCGGAGGGCAGTACCCTCCTGACGGCGCTCGCCACGGC62101CCACGCGCGCGGGGCGGAGGTGGACTGGTCCCGGTTCTACGCCGACACCGGCGCCCGCCA62161CACCACACTGCCCACCTATGCCTTCCAGCGCCAGCGGTTCTGGCTGGAGACGGCGGCCCC62221CGCCGCGCCCGCGGCGGGCCAGGGGGCCGGACCGGCCGACCCGCAGGACAGCACCGGTCC62281GGCCGCCCGGCCCACGCTGACGGAACAGGACCTCCTCCTGCTCGTGCGGACGGAAGCGGC62341GGCCGCACTCGGCCACGCCGAACTGGAGGACGTACCGGCCGACAGCCTCTTCGGCGACAT62401CGGCTTCGACTCGCTCGCGGCCATCGAACTGGGCGCCGCCCTGACCGGCGCCACCGGGCT62461GGAAGTGCCGTCGTCCCTCGTCCTCGACCACCCCACGCCCAGGGAGCTGGCCGCGCACCT62521GGCAGCCGCCCGGACGGCCGCCGACAGCGACGACACGTCCCCCGAAGGCCCGGACACGGC62581CGGTGAGAGCAGCCTGTCGGCGATGTACCGGCGGGCCGTGCGGCTCGGCCGGGCCGAGCC62641GTTCATCGGCACACTCGCCGAACTCGCCGCCTTCCGGCCCGTCTTCCCCGCCGATCACAC62701CCTCGCGGACGGCGAGACCGTCGGACAGGCGGCCGCCGCCTGGCAGCCGGCTCCGGTGCG62761CCTGGCCACCACGGACGGTGAGGGACCGGAGCTGATCTGCTGCGCGGGTACGGCGGTGGC62821GTCGGGACCGGAGGAGTTCACCGCGCTGGCCGCGGCCCTGGGCGACCGGCTGACCGTGTC62881GGCACTGCGCCAGCCCGGCTTCCGCGCGAACGAGTTGCTGCCCGGCTCCCTGGACGGGCT62941GCTCGACGCGCAGGCGGACGCGGTGCTGCGGCACACGGGTGACAGGCCCTACGCCCTCCT63001CGGCCACTCGGCGGGCGGGGCGCTGGCGCACGCGCTGGCCTGCCGACTGGAGGAGCTCGG63061CGCGGGTCCCGCGGCGCTGGTCCTGGCCGACGTCTATCTGCCCAGCTCGCCGGGGGCGAT63121GGGGGTGTGGCGCAACGAGATGCTCGACTGGGTCATGCGGCGTTCCGTGGTGTCCATCGA63181CGATGCCCGGTTGACGGCCATGGGCGCCTACAACCAGATGCTCCTGGAGTGGACACCGCG63241GCCCACGAAGGCGCCGGTCCTGTTCCTGCGCGCGACGGAGCCGGTGAGGCCGTGGTCCGG63301AGAACCGGAGAGCTGGCGGGCGCACTGGGACGGCGGCGACCACACCGCCGTCGACGTGCC63361CGGCACCCACCTGACGCTGATGACCGAGCACGCCCGCCACCTCGCGGCGACCCTCCACAC63421CTGGCTCGGCACCCTGTGAACCACGCCCGGGGCGGCTTCGCCGCGCGTAGGACTGCCGCC63481TCCCCCGACTTCCGTACACCGCGACACCTTGGAGGACTCCCGTGACAACGCAGTGGACCA63541CCCCGTCCGTGCTCGGCCGCAGACTGCAACGCACCTACGTGGGGCACTGGTTCGCAGGAA63601CGCAGGGAGACCCCTACGCGCTGATCCTGCGCGCCCAGCGGGACGACACCACCCCCTACG63661AGGAGGACGTCCGCGCACGCGGACCGGTGTTCCACAGCGAGGTGCTCGACACCTGGGTGA63721TCACGGACGGCGCTCTCGCCCGGTCCGTCCTGACCGACGCCCGCTTCGGCGGGCTGACGC63781GCGCGGGAGGACGGTATCGCGCGGAGCTTCTCCCTCCGGCGGGCCCCGAGGTCGGTCCGG63841CCCGCGCAGGGGTACGCGGCGGCGTGCGGGCCGACGCCGATCCGGCGGTGTCGGCGCAGG63901ACGAGGTGGTGGTGGAGGCCCTCGCCGAGCAGCTCTCACGCACCCTCCTGGGCGGACTCG63961GCGACGACTTCGACCTCGTCGCCGCCTTTGCGCGACGCCTGCCGGCACAGGTCCTGGCGG64021AATTCCTCGGGCTGCCCGCAGCCGCGCGCAGCCGGTTCGAGGAACTGCTGGCCGGCTGCG64081CCCACAGCCTCGACAGCCGGCTCTGTCCGCAGACGCTCGACATCACACGGACCGGCCTCG64141GAGCGGCGGCCGAGCTCCGGGAACTGCTCGCGCGCCACCTCGGCGGGAGCGGACCACGCT64201CCGCTCAAGCGGCAGTCTCCCTGGCAGTCGAGGTGGCCGCACCCGCCGGCGCGCTCATCT64261GCAACGCGGTCGAGGCGCTGAGCAGCTCTCCCGGGCAGTGGAACGCCCTCCGCCAGAACC64321CGGAGAAGGCCGACGCCGTCGTGGCGGAGACCTGGTGGCGACGACCGCCGGTGCGGGTGG64381AGAGCCGGATCGCCCAGGAGGACGTCGACGTGGCCGGAGTGCCCGTCCCCGCGGACGGGC64441ACGTGGCGATCCTCGTCGCCGCCGCCCAGCGCGACCCGGCGATCACCCCGGCCCCGACGA64501AGGACGACACCGGCACCCCCGGACAGGGCGACTGCGGCGTGCCCCTGGGGCTCGTCGGCG64561ACGCGCACGCCACCTCCGCCGCCCGGACGGTCCGCGCCCTCTGCCGCGGTGCGCTGCGAG64621CGCTCGCGCAGGAGGCACCGGGCCTGCGGCCGAACGGGACCCCGGTGCGCCTCAGGCGGG64681CACCCGTCACGCTCGGCCACGCCCGCTTCCCCGTCGCCCGGACGGGCCGGGGGACACCGA64741CCGACGCGGGCGCGGCATGAGCACCCGCGACGACCACCGACTGCCGAACGGGGAGACGAG64801CCGATGCGCGTCCTGATGACGTCGATCGCCCACAACACGCACTACTACCACCTGGTGCCG64861CTCGCCTGGGCCCTGAAGGCCGCGGGCCACGAGGTGCGCGTCGCCGGCCAGCCCCGCGTC64921ACGGACATCATCACCGGGTCCGGACTGACCGCCGTGCCGGTCGGTGACGACGAGGACATG64981ATGGAGCTGTTCGCCGAGATCGGCGGAGACATCACCCCCTATCAGGAGGGACTGGACTTC65041GCCGAGGAGCGGCCCGAGGCACGGTCCTGGGAACATCTGCTCGGACAGCAGACCGTTCTG65101ACCTCGCTGTGCTTCGCACCGCTCAACGGCGACTCGACGATGGACGACATCGTCGCGCTG65161GCCCGCTCCTGGCAGCCGGACCTGGTGATCTGGGAACCCTTCACCTTCGCCGGAGCGGTC65221GCCGCCCACGCCGTGGGCGCGGCGCACGCCCGCGTCCTGTGGGGTCCCGATGTCATCGGC65281CGGGCCCGGGAACGGTTCGTGGAGGCCAAGGCACAGCAGGCTCCCGAACACCGGGAGGAC65341CCGATGGCCGAGTGGCTCGGCTGGACCCTGGAGAGGCTGGGCCTCCCGGCCGCCGGAGAC65401GGGATGGAGGAGTTGCTGAACGGCCAGTGGGTCATCGACCCGGGCCCGGAGAGCGTCCGG65461CTCGACCTTCGCGAGCCGATCCTGCCCATGCGTTTCGTTCCCTACAACGGACCTGCCGTC65521GTCCCCGGATGGCTGTCCGAGAAGCCGAAGCGACCGCGCGTCTGCCTCACCCAGGGAGTG65581TCGGGACGCGAGACCCACGGCAAGGACGCCGTCCGCTTCCAGGACCTGCTCGCGGCGCTC65641GGCGACCTCGACATCGAGATCGTCGCCACCCTGGACAGCACCCAGCGGGAGAACCTGACG65701GAGGTCCCCGACAACGTCCGGATCGTCGACTTCGTCTCGATGGACGTGCTGCTGCCGAGT65761TGCGCCATGATCATCTACCACGGTGGCGCCGGCACCTCGGCGACGGCCCTCCTGCACGGC65821GTTCCGCAGGTCGTCATCGGAGCGCACTGGGACGTGCCGGTCAGGGCACGGCAGCTCGAC65881GACCTGGGCGCCGGCATCTTCATCCGGCCCGAGGACCTCGACGCCGCCACACTGCGCGCG65941GCGGTTCAGCGCGTGCTCACCGAGCCCTCCCTCCAGCGGGCCGCGGACCGGCTGCGGGCC66001GAGATGCGCTCCAACCCCACGCCGGCCGAGACCGTCACGGTGCTGGAGCGGCTCTCCCGG66061AGCCACCGACAGCCCCGCTGACCACACGCGGTACACGGTGCGGGCCCACGTGCCGGGGGC66121TCCACCGTCGCCGGCGGTCGTCGGATCGCCGTCCGGCCATGTCCCGGCACCCAAGGACGG66181AGCAGAGCAGAACATGGAATTCGAAGGTCAGGTCGCGCTCGTCACCGGGGCCGGCAGGGG66241GATCGGCCGTGCGACGGTCGTCCGCCTCGCGGAGGCCGGATGTGACATCGCCCTCCACTA66301CAACCAAGCGAAAGCGCAGGCCGAGGAAGTCGCCGAGCGCATCGCCGCACTGGGCCGCAC66361GGTCGAACTGTTCCCGGGCGACCTCTCCCGCCCCGAGACCGGGCGACAGCTCGTGGCCGC66421GGTGCAGCAGAAGTTCGACCGGATCGACATCCTGGTGAACAGCGCGGGCATCACACGGGA66481CAAACTCCTGCTGTCCATGGAGGCGGACGACATCCACCAGGTCATCGCCACCAACCTCGT66541CGGCCCGATGTTCCTCACCCAGGCGGTCGCGCTCACCATGCTGCGTCAACGCTCCGGGCG66601CATCGTCAACATCTCCTCCGCCGCCGCGAGCAGGCCCGGAAAGGGCCAGTCCAACTACGC66661CGCGTCCAAGGCCGGTCTGGAGGCCTTCACCAGGGCCATGGCGGTGGAACTCGGATCCCG66721CGGAATTCTCGTCAACGCGGTCGCTCCCGGCATCGTCAAGACCGGCCTGACCGAGGCTCT66781CCGCGAGGGGGCGGAGCCCGAACTCCTGGCCCGGCAGGTGATCGGTTCCTTCGCCGAACC66841CGAGGCGGTGGCGGAGGCGGTGGCCTACTTGGCGAGCCCGCGCAACACGCACACGACGGG66901CACGGTCCTCACCGTCGACGGCGGGCTCAAGATGGTGTGAGGCCCACCGGGCATCGGACA66961GCCGGTGGATCCGCCGGAGGCGGAGACCCGCGCACCGCGGGCGTCGACCCGCGCACCGCG67021GGCGCGCGTGGGGGCGCCCGCGGTGCGCGAAGGCCGCCTCTGGCTCGCGTCACCGGAAGA67081AGCCCGATTCCCCAGCGGGCGACCGTAGCCGGAGCGTTGGACCGCCCTGCTCCGCACGTC67141GAGCCCCACCGTGGTAGCGGCGGACATGCCCAGGGGGCGCACGCCCGGATCCTGCCGCAC67201GACCAGGCGCACCAGGGTTTCGGCGAACACGGCGGCCGACGCCGCGGGGTTGCCGCCGCG67261GCACGGCCGACGCGCGCTGGTCGCACGGGGTGAGCCCGCGGGAGCGGAGGGCGCCGGTTC67321GCTCAGGACGCCGGGATCTCCGCCGGGACGACGGACTCGGGGGCGCCGTCGGGGCCGGGC67381CTCGTGCCACCGTCGGCGTCGAGCTGCGGCGGATCGGAGAGCTGCTCGCGCACGTACCCC67441CACACCACGGCGACGAGGGCGGCGACGGGCACGGCGAGGAGGCTGCCGACGATGCCCGCC67501AGACTGCCGCCCAGGGTGACGGCGAGCAGGACGACGGCGGCGTGCAGTCCGAGTCCGCGG67561CTCTGGATCATGGGCTGGAAGACGTTCCCCTCCAGCTGCTGCACCACCACGATGATCGCG67621AGCACGATCAGGGCGTCGGTCAGGCCGTTGGAGACCAGGGCGATGAGCACGGCGACGAAT67681CCTGCTAACAGCGCGCCGATGATCGGCACGAAGGCGCTGACGAAGGTGAGTACGGCGAGC67741GGCAGGACGAGCGGGACGCCGAGGATCCACAGGCCGATGCCGATGAGGACGGCGTCGAGG67801AGGCCCACCGCGGCCTGGGAGCGGACGAAGGCGCCGAGGGTGTCCCAGCCGCGTTCGGCG67861ATCGTGGTGGCGTCGGTCGCGAGGCGGCCGGGGAGCTGACGGGCGAGCCAGGGCAGGAAG67921CGCGGCCCGTCCTTGAGGAAGAAGAACATCAGGAAGAGCGCGAGGACGGCCGTGACGACC67981CCGTTCACCACCGTGCCGACGCCGGTGGCGAGGGTGGTCAGCAGGGATCCGACGCTGTTC68041TGGAGACGGTCGGTCGCGGTGTCCAGGGCGCCGGTGATCTGGTCGTCGCCGATGTTCAGG68101GGCGGACCCGCGGTCCACTCGCGGAGACGCTGGATGCCCTCGACGACACCGTCGGCCAGT68161TCGCCGGACTGCGAGGCGACGGGCACGGCGATGAGCGCGACGGTGCCGGCGGTGACGGCG68221AGGAAGAGGACGGTGACCACCGAGGCGGCGAGCGCCGGCGGCCAGCCGAGACGGCGCAAC68281AAGCGGGCGAAGGGCCAGGTCAGCGTGGTGATCAGCAGACCGATGACGAGTGGCCACACG68341ATCGACCACATCCGGCCGAGCAGCCAGATCACCGCCGCCGTGCCCAAGAGCACCAGCAGA68401AGCTCCGTCGATATGCGGGCCGAGGCCCGTAGCGCGGCACGTGTTCTCGCAGGACTGAGA68461GAGGCAGACATGGCGATCACCCTAGAGCGGCCCGGGCGGCCCGCTGCCCCCGTGCCCCGA68521TCCTTCGCGCCGGGGTGACGCGCATCGGGGGTTCCGGCACTGCCTGAGCGCCTGCACGGA68581CGGGGCGGGTTACGCCGAGGGGAAGCAGCCGGCTCCGGATCGACAGGAGTGCGGGTGACG68641GTCGTACACCGGCGCTACGTCGCCCACTCCGCGGCGCACAAGGGCGGTCGCCGTCTGGCC68701CCTCCCCGCGTCCGACGACCGCCCACCCCTCGTCAGGGAGCGGGGTCCGGGGCGAGGTGG68761ATACGGGCCGCTACCGGGAGGTGGTCGCTGGCCGTCGCGGGAAGGGTCCATGCCGAGGCG68821GCCCGGACGCCCCCGAGAAGGATCTGGTCGATCCGGACGACCGGCAGGCGCGCCGGCCAG68881GTGAAGCCGAAGCCCGCGCCCGCCGCGGCCTGTGCGGAGACGAGACGGTCGGTGAGGGGG68941CGCAGTGCGCGGTCGTCCGTGGAGCCGTTGAGGTCGCCCAGGAGGACGACGCGCCGGACG69001GGCTCGGCCCGCACCTCCGCCGCCAGCAGCCCCAGCGCCTCGTCGCGCGCCCCGGCGGTG69061AATCCGCCCGGGCCCACGCGGACGGACGGCAGATGGGCGACATAGACGGCCAGCGGCCCG69121CCGGGCGCGTCCACCGTGGCCCGCATCGCCCGCGTCCACGGCATGATCGGCACAGCCCGC69181GCGTCGCTCAGCGGATGCACGCTCCACAAGCCCACCGTGCCCTCGTAGAAGTGGTACGGG69241TACGACTCCGCAAGGGCCCGCTCGTAGGCGGGCGCCGTGGCCGGGCTCAGCTCCTCCAAC69301GCCAGTACGTCCGCCCCGGCGGCGAGCAGGCTCCGCACCGTTCCGGCGGGGTCGGGGTTG69361GCCTGCTCGACGTTGTGACTGACCAGCGTGAGGTCCCCACCGGGCGTCGTCTTGTCGGTC69421AGCGCTCCGCCGAAGGCCGTCAGCCAGGCCACGGCCGGCACCACCAGGGCCACCACCGCC69481GCCACCCGCGCGCGCCACAGCAGGGCCGCGGTCACCAGCACGGGCACGGCCAGCGCGCTC69541CAGGGCAGCAAGGTCTCTGCCAGACTGCTCAGCCGTCCGGGCAGTCCGGGGAGCCGCCCA69601TGGCCCGCGACGCTCACCGCGAGGAAGACGGAACAGCCCGTGACCGCCGCCCCGCGGCGT69661CGCCACCACCGCCGGCGCGGACCGGCGTCCCCCACGCCGACCGCGGCTCCGGGAGCGCGG69722CCCCTGCCGTCCGCCGTGCCGTCCGCAGCCAGGGAGCAGACGGTCTGTTCCGCCCTGCCG69781TCCGCCGCAGTGGAGGGGGCGCTCTCCGCTGCCCCGCTGTCCGGCGCCGCCCGGGAGGGG69841ACAGTCTGTGCCGCCCTGCCGTCCGGCGCCGCGAGGAAGACGGAACAGCCCGTGACCGCC69901GCCCCGCGGCGTCGCCACCACCGCCGGCGCGGACCGGCGTCCCCCACGCCGACCGCGGCT69961CCGGGAGCGCGGCCCCTGCCGTCCGCCGTGCCGTCCGCAGCCAGGGAGCAGACGGTCTGT70021TCCGCCCTGCCGTCCGCCGCAGTGGAGGGGGCGCTCTCCGCTGCCCCGCTGTCCGGCGCC70081GCGGGGGAGGGGACAGTCTGTGCCGCCCTGCCGTCCGGCGCCGCGGGGGAGGGGACGGGC70141TCCGCCGTCCCGCCGTCCGGCACCGCGGCCGGATCCCGGCGTGTCGTCGCCGTCATCGTT70201CCCCGCCCTGGGTTCCGGCGGCGGCCAGCCGCTCGCGGACGGCGGTGAGCAGGCCACGGG70261CCGCCTCGACCGCGGCCCGGAGCCCCTCGGCGGGCGTCGTGTTCGCCCGCTGGTGCAGGA70321CGGCGCCCACCAGCAGCCGGGGCGCGCCGCCCACGCGCACCTCGTAGGCCCACAGGAGGT70381TCCCGCCCGCCGGGGTGCTGGAGCCCGTCTTCACACCGAGGACGCCCGGGGTGTCCAGCA70441GGGGATTGGTGTTGGTGATCGTCCCGAGGCCCGGGACGGTGGTCTCGCGGGTGGCGACGA70501CCGCCCGGAAGACAGGGTCCTCCATCGCGGCCCGCGTCAGCCGCACCTGGTCGGCCGCGG70561TGCTTGTGGTCGTCGGCTCGATGCCGCTCGCCCCCGTGTAGACCGTGTCCTTCATCCCGA70621GCCGTACGGCGGCGCGCCGCATCTTCGTCACGAAGGCCGCCTGGCTGCCGGAGTCCCAGC70681GGGCCAGCAGACGGGCGACGTTGTTGCCGGAGGGGATGAGGAGCAGTTCCAGGAGCCGGC70741GCTGGGAGTGGCGTTCGCCGGACCGTACCGGGACGGTCGACTCGCCGCCGACCCCCGCCT70801CGTGCGCGGCCGTCCGGTCCACTTCGATCAGGGGGCCGTCCTCGTCGGGCCTCAGCGGAT70861GCTCTTCGAGGATGACGTAGGCGGTCATCACCTTCGTCAGGCTCGCGATCGGTACGGGCC70921GCCGCTCGCCCCGCTCGCCCAGCGAGCCGGTTCCTTCGAGCTCGACGGCGCTCTGGCCGT70981CCTGCGGCCAGGGGAGCGGCCCGATGTCGGAGACCGGTAGTCGCTCGCCTCCCGCCGCCG71041GAGGCGCGCCGGAGGGCGAGGCGACGGTGATGCCCAGGGCCGTCAGCAGGGCCACGGACA71101GGGCACCGCCGACGAGGCGGTGACGGGGGGTGCGGGGCAGGGACACGGGCCGCCTCCAGG71161GGCTGCGGTACGGGATCGGTACGGCAGCAAGACTCCGGGGGCTACGTCTCCGCCTCACGG71221TCGGGAGAGCGCGGCCCGCGCTCGGGAACCCATCGGTCGTGTATCGGCGGGGCGGCGGCG71281ACCGGCCGCCGGGCGACGCGGAGGGAGCGCCTGAGGGGCGGGGCGTACCGACAGGCGACC71341GTCTGGGGTGGGGAGGCCCGCGGGCTGTCCCGGGGACCGGTTCACGCCTCGGACGTCTGC71401CCGTCCTCGGGCAGGCTCAGGGTCGCGACCGCTCCTCCGTCCCGGGCGTTGGCGAAGGCC71461AGGGTCGCGCCGATCACCCTCGCCTGGCCGGACGCGATCGTCAGGCCAAGGCCGTGCCCG71521TGCCCCCGTTCGGCCGAGCCCGTGCGGAACCGCTGGGGGCCGTGGGACAGCAGGTCGGCG71581GGGAAGCCGGGGCCGTGGTCCCGGACGGTCACGGTCCGGCCGGCGACCGTGACCTCCACC71641CGACCCGCGCCGTGCCGGTGGGCGTTGACGACGAGGTTGGAGACGATGCGGTCCAGGCGC71701CGGGGGTCCGACTCGACCACTGCCGCCCCCTGTCGCGTCACCTGCGCCGCGAGGCCCGTC71761CGCGCCACCGAGTCCCGGACGAGGGCGCCCAGGTCGACCGGTCCCTGCTGGGCCGTCTCG71821GCGCCCGCGTCGAGCCGGGAGACCTCCAGGAGGTCCTCCACCAGGTCGCGCAGCACCCGT71881ACCCGGCTCTGGACCATGTCCGTCACCTCGCCCTCGGGCAGCAGTTCCGCCGAGGTGACC71941AGGCCCATCAGCGGGGTGCGCAGCTCGTGTGCGACGTCGGCGGTGAAGCGCTGCTCGGTG72001TCGATCCGCTGCTGGAGGCTGTCGGCCATCGAGTCGACCACGGCCGAGATCTCGGCGACC72061TCGTCGCCTCCGCGGACCGTTCCCGTCCGGGCGTCGAGGTCGCCCGCCGTGATGCGCCGG72121GCCGTGCGGGCGACCCGGCGCAGCCGCCGGGCGGGGAGTTCCGTCGCCAGGGCCGTGGCG72181GGGACGACGACGCCCAGGGTGAGCAGCGAGTACTTCCACATGTGACGGTCCAGGGCCTGC72241CGGGTCAGCAGGTCGGCGGTCATGTCGACCTCGACCGCGTACAGCTTCCCGCCCTCCCGG72301CGGGCCGCCCGGAAGACGGGGGCCGGGGGCCCGTCCTCGTAGAGGGTGGCCTCGCCGCCG72361TGCTCGATCTGCCTCAGCAGGGCCTCGGGCAGTTCCTCGGGGGACACCCGGGGCCCTTCC72421TCACCTGCTGCGTCGGCGTCCTCCAAGGCCGTTGCCAGCGCCACGTGGGCCCTGCCCGCG72481CCCTCGTGCAGGGAGCGCCGCAGCACCGAGTCGTGCACCAGCACTCCGACGGTCAGCGCC72541ATCGAGGCACAGGCGAGCGCCACCAGGAGGACGATCTTCCAGCGCAGCGAGCGGGAGCGC72601GGTACCAGGCCCGTGACGGCTCCCCGGGGGGCCCGGCTCACCGCTTCCACTTGTAGCCGA72661AGCCCCGGACCGTCTCGACGCGCTCGGCGCCGATCTTCTTGCGCAGCCGCTGCACGCACA72721GGTCGACGACCCGGGTGTCCCCGTCCCAGCCGTAGTCCCACACCTCGCGCAGGAGGGTCT72781GACGGTCCAGCACGATCCCGGGGTGCGCGGCGAACTGGAGCAGCAGCCGCAGCTCGGTCG72841GGGCGAGCGCGATCCGCTCGCCGCCCCGGCGGACCTCCAGGGCCGCGGGGTCGAGGGAGA72901GGTCGCCGAAGAGCAGCGGGCCGGCCGGCGTCGCGGGGTCGGCGGGTCCGGGGGCGGGGG72961ACACGAAGGCGGCCCGGCGCAGCAGCGAACGGATGCGCGCCACCAGGACGGCGGTGTCGA73021CGGGCTTCACCACGTAGTCGTCGGCCCCGGCCTCCAGGCCGGACACCACGTCGAGGGCGT73081CACCGCGCGCCGACATCATCAGGATCGGGTCCGTGGCCGTCTCCCGGATGCGCCGGCACA73141GTCCGATGCCGTCCAGGCCCGGCAGCATCACGTCGAGCAGCACCAGGTCGTGCCGTCCCT73201CCCGGAAGAGTTCGAGCCCGGTCAGTCCGTCGGCCGCGACGCGCACGGGGTAGCCGTAGC73261GCTCCAGGGACATCGCGACGGACCGCCGGATCACCTCGTCGTCCTCGACCAGCAGGACGG73321TCACGGTCGCAGGCGCGGGCGCCGACACGGAATCAGACATGTCCATCTCTCGGGCACGGG73381GCGGGGCGGGCCGACGCGCGGCCCGGCCCTCCATCATGCCTCACTCGCGCGCCTCCCCCG73441GCGCCGCCGGAGAGCCGTCGGCACGCCTCTGAGCTGGTCTGATACCTGACTGATACATCA73501TGGCGCGGTGACCGACACACGCCCGAGCGGGGACGGACACGGAGGCGGCGCGCCGGACAC73561AAGATGCCGCGGGAGCCGATCCGGCGCCCACCGAGGCCGTGTGAACGACGCCGTCCGGAA73621CCGCACGCCCGCGGGGGCTCGTGGCGGGTGTCAGTGGTGTGCGGCAGGCGCGGGCAGCGT73681CTCCGGAGCGGGGTCGGTCCTGCGGCCGAGGTGGTTGAAGGCGAGGTTGAGGAGGACGGC73741GACCACACATCCGGTGCTGATGCCTGAGTCGAGGACGATGCGGGCGCCTTCGGGGAAGGC73801GTGGTAGAAGTCCGGGGCGGCGATGGGGATGATGCCGGCGCCCAGGGAGATGGCGACGAT73861GAGGACGTTGTCGCCGCGTTCGAGGGCGGCTCCGGCGAGGGTCTGGATGCCGCTGGCGGC73921GACGGTGCCGAAGAGGGCGATGCCCACTCCGCCGAGGACGGGCTGGGGGATGAGCGCGAC73981GACGGAGGCGAGCAGCGGGCAGAGGCCGAGCAGGAGGAGGATGCCGCCGGCCGCGGCTAC74041GACGAAGCGGCTGCGGACCTTGGTGATCGCGACGAGGCCGACGTTCTGGGCGAAGGCGCT74101GGCGGCGAAGCCGTTGAAGAGCGGGCTGAGGGCGGTGCCGAGGCCGTCGGCGCGGAGCGC74161GGCGGCCAGGGTCTTCTCGTCGGCCGGCCGCTCGACGATCTCACCGAGGGCGAGGACGTC74221GGCGGTGGACTCGGTCATCGACACGAGCATCACGATGCACATGGAGATGATCGCCGCGGC74281CGCGAACTGCGGAGCGCCGAAGTGGAACGGGGTGGGGAGTCCGATGACGTCGGCGTCGCC74341TACGGCGCTGAAGTCGGCGACGCCCAGCGGCAGCGAGAGGAGCGTGCCGGCGACGAGGCC74401CAGCAGGATGGAGATCTGCTTGAGGAAGCCCGTCAGGACGCGGCGCAGGACCACGGTGAT74461CAGCAGGGTGGCGGTGGCGAGGCCGATGTACGTGAGGCTGCCGTAGTCCGGTGCCTGCGC74521GTTGCCGCCCTGGGCCCAGTTGAACGCGACGGGCAGCAGGGAGACACCGATGAGGGTGAT74581GACCGTGCCCGTGACCACGGGCGGGAAGAAGCGGATCAGTTTGCAGAAGAAGGGGGCGAG74641GAGGAATCCGAAGACGCCGGCGACGATCACGGCGCCGTAGATGACGGGCAGCGCGTCGTC74701GGGGCCTTCGGCCTTCGCTATCGCGAGCATCGGTGCGACGCCGGCGAAGGAGACGCCGTT74761GACGAACGGGAGCCGTGCTCCGACCTTCCAGAAGCCGAGCGTCTGCAGGAGCGTGGCGAT74821GCCCGAGGTGAACAGGCTGGCGCTCATGAGGAACGCGATGTCCGCGGTGGACAGGCCGAC74881GCCGATGCCGACGACCAGCGGCGGCGCGACGACGCCGGCGTACATGGCGGCGACGTGCTG74941CAGACCGGCGCTGAAGAGCTTCAGCGGGGGCAGCATCTGGTCGACCGGATGGGTGTCGCC75001CGGTCTTGCGGGGGTGTCGGGACGAGCGTCGGTTCCGTCCTCGGTCTTCGTCGGTTCCGG75061GGCGGGGCGGGATTCCGGCATGGCGTGTCTCCTGGCCGCGGCGGTCTCGTGGGAGCGCGG75121GCAGCCCTGTGGGAGAGCCGGCGTCGCTGACGCACGGCTCTGCGGATGGGGGTGGTGACG75181GGGTGGGGAGGATGCGGCACCGGGCGTGGGACGAGAGGCCGTCGTGCCTGGGTGCCGCGG75241CCGGAGAGCGGGGAGTGGGGCGCCGGAGAACGGGCGGGAGGGATTCGTGTCCCGGGGTTC75301CCGCGGTCCTCCGGCCGGCCGGCGCGCAGCTGTCCGTGATGCGCGTCGTGCCACTCCGTC75361CGGCGAAGTCGGAACAGTTCCTCACGGGGCGGCGTGGGTGTCAAGAGGCGGGACGGGCCG75421GGTTGAGGACCTCCCGGACGTGGTGCCAGAGGTGCAGGTCGATGTGTTCGAGGAAGTCCG75481CCGCCCGGGCCTCCGCGCGGGCCGCCTGACGGGGCGGGCCGCCGGCCGCGCCCTCCCATT75541GCTGGTAGGTGAGGTGGAGGCGTTCGAGGGCGTGGCCGAGTACGGCGGGGTCGAGCGGCA75601CGATGCGGGAAAGGGCCCCCACGTAGGGGGGCCACCAGGTGGTGGCGGCCTCTCTGAGCA75661GGACGCGGAGCCGGCCGGTCCTGCCGCACGCGGCGACGAAGCAGGCCGGTGGCGGGCAGA75721GGACTCTCAGGAGGGCGTCCGTCTGGGCGGGGGTCCCCTGCCAGCGGTTCCGGCGTTCGG75781CGTCCTCCCACTGGCCGCGGGTGAGGCCGAGCTCGGTCGCGGTCTCGTTGACGGTGAGTC75841CGGCGACGGTACGGCACTGGGCGAGGGTGCCGGGCTCTTCGACGAGATCGACCGGAGAGC75901ACCAGAGGGCCGCCGCGAGGGCCTTCACCTGGGTGTTGTCGGGGGTGGCGGTGCCGGCTT75961CCCACGCCTCGACGAGGCCGGGGTGCGCGGGGTGTCCGCAGTAGGCGGTGACGGCCCAGG76021CGACCTGGCCGAGCGTCAGGCCGAGCTGGGCGCGGACGGCCGCGGCACGGGTGGGATCGA76081AGCGGGGCAGGGCATACGGATCGTCTGCGGGTGGCATAGCGGGACACCGTAGGGACCCCG76141CGGTCCACGGCCAAGGGCCGGAACGGTCTCGATGTCGGCTCCGCCGCCCGGCGTTCGGCT76201TCTTCGTCGGATCTCCTGAGCGGGCAGACCGTCGATGACCTGCTCTTTCATGGGAGGAGG76261CGGGCAAGCGGAGAGAAGAGGAGGCTGAGGCGACTGTCGGTTCCTCCAGAATCCGCGAGG76321AACTGGCGCCTTCTCTTGGGGGAGTTGACGAGGTTCAGGCCGGGCCATAAAGTCCCGTCC76381CGTCCACAACGGAAAACTTCTTCCGTCATGCGAAACCTGTGGTGACAGCCGCCCACCCCC76441GCGGCTCGCCCCCATCGGCCATGCTCCCGGTTCGGGCCGTCCCCCGGCACCGGTCACTCT76501GCCCACACGCCCCCGGCAGTCCGCTGCCCGCGGGCGGTGTCTCAAGGACCTGCCTTGCTG76561CCCCGCGCCACTGACCACGCATCGCCGACCCCACCCCATCCGGTCGACGAGATCCTTCCC76621GCCCGCCGCATGCTGCCCGCCGCCCTCCAACACGTGGCGAGCATGTACCCCGGCCTGACC76681GCACCACCGCTGATCATCAGCAGCGCCCTGGGGCTGACCCCGGCCCAGCTCTCCGCCCTC76741CTGGCCGCCGCGCTGCTGATCGCCGGGCTCGGCACGATCGCCCAGACCCTCGGCGTCTAC76801GGCGTCGGCGCCGGGTTGCCCCTCGTCAACGGCGTCTCGTTCGCCGTCGTGTCACCGGCG76861CTCGCCACCGCCGCCACCCAGGGGCGCGACGGCGCCCTCCCGGCGATCTTCGGGGCCACC76921CTCGTGGCGGGACTCCTCTGCCTGCTCCTCGCTCCCGTCTTCTGCCGACTGGTCAGGTTC76981TTCCCACCGGTCGTCAGCGGCTGCGTCATCACCCTGGTCGGCATCTCCCTCCTGCCGGTC77041GCCGGCACCTGGGCCCGGGGCGGAGACGCCGAAGCCGCCGGCTTCGGCTCCCCCGCCGAC77101CTGGCCCTGGCGGCGACGACCCTCGTCATCACCCTGACCGTGCACCGCATGCTCTCGGGC77161CGCTTCCTCGGGCGGGTCGCCATCCTCATCGGCATGCTCGCGGGCACCCTGATCGCGATC77221CCGCTGGGCAAGGTCGACCTCGACCCCCTCGCCCAGGCGCCCCTCTTCGCCCTGCCCACG77281CCCTTCGGCTTCGGCACCCCGCAGTTCGTCCCCACCGTGATCGCCACCGCCGCGGTCGTG77341ATGATCGTGTCCATGATGGAGTCCACCGCCGCGCTGCTGGCGCTGGGCGCGGTCGCCGAA77401CGGCCGGTCCGGGACCGGACCATCGCCGGAAGCCTCCGCGCCCTCGGCCTCGCCACGGTC77461CTCGGCGGCGTCCTCGGCTCGTTCACCAGCACGTCGTACGCGCAGAACGTCGGCCTGGTC77521TCCCTCAGCCGGATCCGCAGCCGCTATGTGGTCACGCTCTGCGGCGCCGTCCTCGTCCTG77581ATGGGCTTCGTGCCCGTCCTGGGCTCGTTCGTCGCCCTCGTCCCTCTGCCCGTCCTCGGC77641GGTGCGGGGGTGGTCTTCTTCGGCTCCGTCGCCGTCACGGGCATCCGTACGCTGGCCAAG77701GCCGCCCTCGGCACCGGACACAACGCTGTGATCGTCTCCGTCACCCTCGCCTTCGGTCTC77761TTCCCCGTCCTGGACCCGGACTTCTACGCCCGTCTTCCCGCCCCGGTGGCGACCGTGCTC77821GGCTCGGGGATCACCGCCGGCTGCCTGGTCGCGGTCCTCCTCAACTACCTCCTGAACCAC77881CTGGGCCGCGGCACCGAGGCCGACCCCGACGCGATCTCCGCGGAACAGGTCACCGCCCTC77941GACACCGCGGACACCGTCCTCGGGCCGAAGCGTTCCTCCGACTGGACGCCCTTCCAGCCC78001TCCGGCAGCCCCTCCGGCACCCCTGACCACGGCCGTCACACCAGGGGCACGGCACGGCCC78061GCTCCCGCCTGGCCCTACGTGACCGGCCCCGTGGACCCCACCGACACCGGGCGGCACCAC78121CGGCCGCACGAGGTTCCGGCGCCGCCCCACCGACCAGACGAGGTGCCCCCGCCGCTCCAC78181CCCTCTGCCGCTCACGAAGGCGAACCCCCGCCCGCCGTCACCGAGAACGCGGTCTTTCCC78241GGACCGCTCCACCCGCTCCACCCGCTCCACCCCCGGCCCACCGGTCGTCCCGACCGTCCC78301CGGCAACGGCACAGTGCGGAGGCCGACCCCTGGCAGCATCCGCAGACCCCCTCCGCATCC78361GGCGACAGCCAGTAGAGACGACCTCCCCCGACCTCTTCGCAGAGCCCGGCATCGCACAGG78421CCCGGCGGAGAGGTGGCGCCGACCCACCCGACCACCGCCGGAAGCGCCCCCGGGGACCCG78481TGTCCCACGGATTCCCCGGCGACAAGACGAGGTAGCCCCGATGACCACCGTTTCCGCCGC78541CCGCCACCGTGCGGGCGGCTCCCCGCGCGGCGGCACGTCCCGCCCGGGCCCCGACGAGAG78601AATCGCCCAGGTCGTGGCCGAGGCCCTCGGATCCGCCCGGACGGTCCTCGACCCGGATGC78661GCTGCCCGGCCTCGGCACCACGCGACTCCCGTTCGGCGACGGGAGGTTCGACGCGGCGAT78721GATGCTCTGCAACGCCCCCGGCGTCCCCGACGCGCTCTCGCGGCTCGGGGAACTGCGCCG78781CGTGACACGGGGCCCCGTCGTGGTCCTCGCGACCGACCCCTCGCGCGTCCGCTCGTTCTG78841GCTGGACCGGTACGCCCCCGAGGTCCTGGCCGTCGAAGCGCGGCGTCATCCGCCGATCGC78901CGATCTGACCGCCGTCCTCGGGGGTTCCGCCGAGGTGCGGAGCGTCCCCGTTCCCCTCGA78961CTGCACCGACACCTTCGACGAGGCGTACTACGGAAGACCCGAGAAGCTCCTGGACCCGTC79021GGCCCGCCAGGCGGGGTCGGCCTGGAGCTTCGTGGACGACCGGGTCCGCGAGGAGTTCGA79081CACGACCCTGCGCCGCGAACTCCGGTCGGGGGAGTGGGACGAGCGCTTCGGCCACCTCCG79141CCGCCGGCCCGTCTACGAGGGATCACTGGTGATCGTCCGTGCCGTCCCCTGACGTCCTCC79201CGGGGACGCGCCACCCCCGGGGTCGCGTCAGCGGTCTCCGGCCAGGTGCCCCGAGAGCTC79261GCGCATCAGCGTCTCGTGCTGCTGGAGCAGGTAGAAGTGGCCGCCGGGCAGGACCCGTAC79321CCGGAAGCCCTCGGGTGCGACATCCGCCCAGGCGTCCATGTCCCCTACCGCCACGTTCGG79381ATCCGTGTCGCCGATCCAGGCGTGCACCGGACAGCCGACGGCGGTGGGGACGCGGGGGCC79441GTAGGTGCTCACCACGGTGAAGTCCGCCCGGACCGCGGGCAGCACGAGCTGTCGGATGTC79501CGGGTCGTCAAGCAGGGCGGTATCGGTCCCGCCGAGCCCGCGCAGTACGGCGACCAGCTC79561GTCGTCCCCCTTCCGGTGCAGGTCGAGCGGGGTCAGCCGGTGCGGGGCCTTGCGGCTGGA79621GACGTGGAGGGCGGCGGGCGTGACGCGGTGCCGCTCCTCCAGGCGCAGCGCCACCTCGTA79681GGCCAGGGACGCGCCCATGCTGTGGCCGAAGAGCGTCAGGGGCACGTCCGCGAGCGGCAG79741CAGTGCCGCCGTGACCCGGTCGGCGAGCACGTCCATCCGGTCGACGAACGGCTCGTTGAA79801CCGCTCCTGGCGTCCGGGGTACTGCGCCACCAGGACCTCGGTGTCGCCGCCGAAGGCGCT79861GCCCCAGGCGTGGAAGAAGCTCGCGGAACCGCCCGCGTGCGGCAGGACCGCCAGCCGCCG79921CCGGGGCGCGGGAGTGCTGGAGTACCTGCGGAACCACGTCGTGCTGTCCGTGCCGGTCGT79981CATGTGTGCGTACACCCCGTCCTCGGGTTCTTGGGGTGCCAGTGTCCCCGCAGGGCCCGG80041TGTCCGGACGCGGTGGGGGTCCGGTGGCGAGCCGCTTACGTGTCCCCGCGCTTCCGGGAC80101CGGCGGCCGCACACGTGTCGGCCCCCACGAACACCAGGGTGCGTGGGGGCCGATGCGTGT80161TTCGAGTCCTGGTCTGACGATTTCAGGCCGAAAGATATGTCGGACTTTACAGCTGCGATC80221GAAGCCGATCGATAATGCCGTGGACGGGTAACGTCGGAATCACTCGGTGCTCTTGAGCGC80281ACCACTCACGTTGACGACCTCGTGGCACTCCCGCGCCTGCTGTCCCGTCGCGGGCGTACC80341GGGCTTCGTGCAGGTCACGTCGATGGTCACGGTGTCCTTCGCGCCGATCCTGATGGGAGT80401CACCCAGTGGTAGTCCTGGTTGCGGAACGTCTCCAGCGCGATCGTGGTGATCTTGCGGTC80461CCCGAAGGTGATCGTCATCACCCCTTCGTCACCCTGGAAGTTCGCGACAACGATGTCCGT80521GATCCCGAACACGGTCTTCTCGGGGACCGTGTACGTCCCGGTCCTGGACTGTCCCGCTCC80581CGACCTCAGGTCGATGGTGGCCGAGCTCTGCCGTCCCCCGCCGGTCGCGGTGCCGTCGCC80641CGTGCCGGCCGAACCGTCGTCCCCGCCGCCCGCTCCACCGCCCGATCCCGGCTTCTGGCT80701GCTGCCGCCGGACGGCCGGCCCGGGCCGGGGGAGGAGCCGTCCGAGCCGCTCTCCGTCGG80761GACCGGGCGGGGCTGCACCGCCTGGGTGGCCGCCTCCTCGGCGGCGCTGCGCACGGCCGG80821CCGGACCAGCGTGAACCAGGCGATCAGCAGCGCGATCAGCGCCGCGAGCAGCAGGAGCAG80881CCACTTGGGGAAGACCGGTATCTGGACGAACTCCGCGTCCAGCGTCGGCGCCGTGTGGGG80941CTCCTCGGCGCGGTCCTCGGTCTGCTCCGGCTCGCCGGTCTCGCGGGCGTCCACGGTGAA81001CGGCCAGACCACGGGGTCGCCGAACCACACCGGGCTCGCGGTGCGGACCCGCAGCCGGAG81061TTCCTTCGACTCGCCCGGCTCCAGCGCCGGCTCCGCCGGCGTGAAGGCGAACCGGAGCTC81121CTCGCCCGCCTGCCCGGGCGTGAACCCCACCCGTACCGGGGTGTTGCCCTGGTTGCGGAC81181GGCCAGCAGATAGCGGCCCCGGAGCCAGCCGCGCCGGCGGCGCGGCGAGAGGTCGGTCCG81241CAGCTCGTGGAACGCGCCGACGCGCACCACGGTCTCCAGGACCTTGACCGACTCGGGCTG81301CTCGTTCGGGAGGATCCGTACACCGAGGGGCAGCTCGCCGGCCCGTGTCTCCGGCGAGCG81361CGGCGGTGCCAGACGGAGCGTCACCGTCTCGGACGTGCCGGGATAGAGGGAGAGCCGCTC81421GGGCTCGACGGTGGTCCATTCGGCACCGTCACCGACGACCTTCAGGTCGTACGCCTCGAC81481GATGTCACTGTCGTTGCGGACGGTCAGGGTGGTGGTGGCGATGTCGCCCGGCGTCACGGA81541CACGGCCGGGATGTCGAGGCCGGGCGCACCGGGACCGGAGGAGGCTGCGGAAGGCGTCAC81601CCGCCCCACCGTAGGAGACCTGACAGATCCGTACGAGGCACGCGAGGGCAATGTCCGGGC81661AGCTCGGCTGCCCGGCAAGCACAAGTCAACTCTCCGGTAACAATGGATTTCTAGTCTGGA81721GAGCCGCCTTCGGCACACCACCGGCCCGTGGTCGGCTCGTGTCGTGTCCGCCTTCCCCCC81781ACCGACCCAGGAAAACAGGTATCCGATGTTCCGCACCGAGGAGAAGAGGCCGGTCGCGAC81841CGGCACTACGGCGCATGACGCCGTCCGGGGCCACCCGGACGCCCATGCCGCCGGCTTCGG81901CCGCCCGCGCCGCGTCACCGTGGCGGTCTACGCCGCCGACCCCGTGCTGCGGGTCGGCGT81961CGTCCAACAGCTCCGCCAGCGCCCCGAGACCGAGCTCGTCGACGACGCGGACGCGGAGAA82021CGCGCAGGTCTCCCTGGTCGTCGTCGACGCCCTCGACGACGACGTGACCGCCCTGCTGAC82081CCGGCTGAGCTACAACGGCGCCACCCGCGCGGGACTCGTGATCGGCACCCTCGGCGTCGG82141GGCGCTCCAACGCGTCGTCGAGTGCGGGGTGTCGGCGGTGCTGCGCCGCGCCGAGGCCGA82201CCAGGACCAGCTCGTCCAGCTGGTCCTGGCGGTGGCCAACGGCGAGGGCGTGCTCCCGGG82261CGACCTGCTCGGCGAGTTACTGGGACACGTCGGCAGCCTGCGCCGCGCGGCCCTCGACCC82321CGGCGCCCTGCCCCTCTCCACCCTCACCAGCAGGGAGGCGGAGATGCTGCGCCTGGTCTC82381GGAGGGCCTGGACACCGCGGCGATCGCCCGCAAGACCTCGTACTCCGAGCGGACCGTGAA82441GAACGTCCTGCACGAGATCACCACCCGCCTCCAACTGCGCAACCGCGCCCACGCCGTGGG82501CTACGCGCTCCGCAACGGGCTGATATGACCGTCCCGTCCGGACCGCGGCCCGGCGGCCGG82561CGCGACAGCCGGAGGGAAGGCGGCGCTGCCCCAAAGTGCATCCCGCCCTTCCCCCGGGTG82621CGGCCCCCGGGCCCTCCCGCCGCGTGCGCCGCCGCCGCACGATGACGGGGGGCACCTCCC82681GGTGCCGCACCGGACGGAGAAGGGCACCGTGATGAAGACCGCTGGCCCCGGTGGACGGCA82741CCGCCGGGGGAGACTCGCCTCGGCGCTCCTGCTGCTCGTCCCCCTGCTGGGCGCGACGGG82801CGTGGCCGGGCCGGACGACCCCCGGACCGCGGCGGCCGCGGCGGACGCCGCCGAGACCAC82861CCGCATCGCCTACGCGGGCACCGGCCACCGCAGCCTCGGCGAACCGGCCTCCACCGACTC82921CAGCACCCCGCTGTTCGGAGCGGGACCCACCCACTACGACACCGACCCGTCCGCCCTCGG82981CGACCGGCTGGTCTTCGCGAGCCGCCGCGACGAGAAGCACCCCCAGATCTATCTGCGGGG83041CGCCGACGGCGGAGTCCTGCGGCTCACCAGCGGCCTGGACGCGGCCCGTCCCCGGCTCAC83101CCCGGACGGCGGGTCGGTGCTCTTCGACGCCGCCGACCCGGCCGGCGGCTCCCAGCGCGA83161CCTGTGGCTGGTGCGCACCGACGGCACCGGGCTGACCCGGCTGACGGACACGCCCGCCAG83221CGAGGAGGACCCGGCGGTCTCCCCCGACGGCGCCCGGATCGCCTACTCCAGCGACGCCGA83281CCCCCTGGCCGGGCGGCAGATCTACGTCCGCGCCCTCACGGGCGGCATCCCCACCCGGCT83341CACCGACCCGGCCCGCGGCACGGCCTCCGAGCCCGCCTGGAACCCCGTCGACGACGACGT83401CAACCGCGCGTGGATCGCGTACACGTCGACCACGACCGAGGACGGGCGGACCAGGCAGCG83461GCTGCGGATCACCGACGGCACCACCGACGAGACCCTGTTCACCGGCGCGTACGCGAACTG83521GCAGGGCCACGGGGCGGCATGGCTGCCCGACGGGGACGGGATCGTGTTCCTCAGCCCCGA83581GACCACCTGCACCTGCAGGACCCCCTACGACCACGTCTTCCGGTCGGTCGTGCACGCCGA83641CCGGGAACCCTCCCTGGTGCTCGACGAGGACCGCGACGTCCTCTCGCCCACCTGGATCGG83701CACCGCCGAGGGCGGCCACGCGATCGTCGAGCGCAGCTCGGCGGCGACCGCGCACACGGC83761GACCCTCCAGGACATCCGCGCGGACGGTTCCGACCCGCGCGACCTGCAGCGGAAGATCCT83821GCGCGAGGACCCCCAGGCCGACACCAACACCGACCCCGCCAAGGATCCGCTCTTCCAGCC83881CGCGCCCCCGTTCGACCCGTGGACCGAACGGCAGAACTACACCCCCGACGGGCGCCGCCT83941CGTCCTCACCCGCTTCGAGGGCCCCGACGACGCGCGGATCGAGCGGATCTGGACGGCCGA84001CGCCGACGGTACGAACGAGGCGCCGATGCCCCTCGACGGGCGCGGCGCGCGGGACTGGGA84061CACCGACCCGACGTTCTCCCCGGACGGCACCCGCCTGGCCTTCACCCGCACCTCGCCCGG84121CGGGGTCGGCGAGGCCGCGGGAGACAGCCGCATCCTCCTCGCCGAGGTCGCCACCGGCCG84181GATCACCGGAGAGATCGTGCCCCCGGCCGGTGAACTCCGCGGCGGGGACGCCCAGCCGAC84241CTGGTCCTCCGACGGCACCACCCTGGCCTTCACCCGCGCCCGGCAGATCGCCGGGGGCGG84301CGGCAGCAAGCACGTGTGGACCGCGTCCACGGCTGACCTGACCCGGCAGCGCGACCTGAG84361CGCGACGCACTGCCCGCGCGACTGCGACGTCATCGACGACAGCCCCGCCTTCTCGCCCGA84421CGGACGCTCCCTCGCCTTCAACCGCAAGAACGGCGGCGGGCGGATCGACGAGCGCAACGG84481ACTGCTCCTGACCACCCTGTCCGGCGACGCCTGCCAGGTCCTGCTGCCCACCGCCGCCCG84541CGGCCAGGACGGCGCGTGCGAGCGGGAACTGCCGGACACCACGCTCACCGGTCCGCACCA84601GCCGCGCGACGCCGCCTGGACCGCCGACGGCAAGAGGCTGGTCTTCAGCTCCCGGGCCGC84661GGCCGCGGTCAACAGCCCGGAGAAGCTGAACGTCCTGGACGTCGGCTCCGGTGACATCAC84721CCCGCTCACCGCCGAGCTCGCCGGACGCCAGAAGGAACCCACCGTCCAGCAGTCCGTGGA84781CCTCGCCGTCGAGGCACCCGCCACGACGCCCGACGTCACCGTCGGCGCGTCCGGCACGGT84841CACCGTCCACGTGGTCAACCACGGTCCCGCCGCCTCGCCCGGCACCCGGCTCACCGTCGT84901CCCGCCGTCCGGTGTGCGGATCACCGGGATCGAGTGGCCCGGCGGCACCTGCGACGCCGC84961CTCCCTCCAGTGCGACCTGGGCGTCGTCGAGGCCGGAGCCCAGGTCCCCGTGGACGTCAC85021GCTCACCGGCGTCACCGCCGGCGACGCACCCGTCGACTGGTCGGTCACCGGCGCCGTCCT85081CGACCCCCGGCCCGGCGACAACGACGGCCGGAGCGTGATCCCCGTACGCGAGGCACCCCC85141GACGCCGACCCCCACGCCGACGCCGACCCCCACGCCCACCCCGACCCCGACTCCGACGCC85201GACCCCCACCCGGACCCCGACGCCCACCCCGACTCCGACCCGGCCCCCGCAGCCCCCCGC85261GCCGAAGGCCGGACCCGGGGTGCGGATCACCGTCCAGCCCGAGCCCGGCTACGTCGGCGG85321ACGCGTCGTCGTCACGTACAGCGTCCGCAACGGCCGCAACGCGCTCGCCACCGGACTCCG85381GCTCAGGATCGGACTGCCCGCCGGGGTGCCCCACGGCGGACTTCCGGCGGGCTGCGACCG85441GAACGGCGCGTGCGCGCTGCCCGACCTCACCCCGGGCACGACCGCCGTCCTGCGGGTCGT85501CCTCAGCCCGAAGAAGGCGATGACCGCCCGCGTCACGGCCGTGCTCGACACCACCGGCAC85561GGACGCCGACCGCAGCGACAACACCGCCCGGGAGCGGCTGCGCGTCCTCCAGCCGCGCAT85621CGTCGCCGTGCCCGACATCGGCAAGCCCGGATTCGTCACCTCCGTCCGAGGCGTGGACTT85681CCCGCCCGGCGTCCCGGTGCGCTTCAGCTGGAACCCCGGGATCACCGCCGCCGCCTCGCC85741GACCTTCCCGGAGGCCGACGGCACGTTCATCGGACAGCTCCTCATCCTCGCCAAGGACCA85801GACCGGGCCGCGCACCATCACGGCCTCGGGCCCCGGATTCTCCCCGGTGAAGACCGACTT85861CCTGGTCGTCAGCGGCACCGTCCAGCCGCCGGACGGGGTGACTCGCCGGTGATCC



Example 2


Construction of a “Clean” Host Strain, S. fradiae K159-1

[0102] This example describes the preparation of the clean host, Streptomyces fradiae K159-1, a strain in which the tylGI, tylGII, tylGIII, tylGIV, and tylGV genes have been deleted. Plasmid pKOS159-5 was first constructed as follows. Two fragments flanking the tylG genes were PCR amplified from S. fradiae genomic DNA using the following primers:


[0103] tylGI left flank:


[0104] forward 5′-TTTGCATGCGATGTTGACGATCTCCTCGTC [SEQ ID NO:_];


[0105] reverse 5′-GGAAGCTTCATATGTTCTCTCCGGAATGTG [SEQ ID NO:_];


[0106] tylGVI right flank:


[0107] forward 5′-TTAAGCTTTCTAGAGAGGAGAGGCCGTGAAC [SEQ ID NO:_];


[0108] reverse 5′- AAAGAATTCGAACTCGAGCACGGACTCGTTG [SEQ ID NO:_].


[0109] The Sph I, Hind III and EcoR I restriction sites are underlined. The two fragments were then cloned into pSET152 using the underlined restriction sites and corresponding sites in pSET152 [see Bierman, M., Logan, R., O'Brien, K., Seno, E. T., Nagaraja, R. & Schoner, B. E. (1992). Plasmid cloning vectors for the conjugal transfer of DNA from Escherichia coli to Streptomyces spp. Gene 116:43-49]. The resulting plasmid was named pKOS159-5. This plasmid no longer contains the int-φC31 gene and attP locus from pSET152 and therefore serves as a suicide vector for delivery by homologous recombination.


[0110] Spores of S. fradiae 99 (Russia) were prepared by harvesting from strain grown on 1-2 AS-1 plates [see Wilson, V. T. W. and Cundliffe, E. (1998). Characterization and targeted disruption of a glycosyltransferase gene in the tylosin producer, Streptomyces fradiea. Gene 214: 95-100], filtering the spores through sterile cotton, and resuspending in 1 ml of 20% glycerol [see Hopwood, D. A., et al. (1985). Genetic Manipulation of Streptomyces: A Laboratory Manual. The John Innes Foundation, Norwich, UK]. Spore suspensions were stored at −20° C. A 20 μl aliquot of the spore suspension was added to 5 mL of 2×YT and incubated at 30° C. with shaking. After two days, the cultures were diluted 1:50 and incubated at 30° C. with shaking for 3 h. After that 1 mL of the cultures were collected by centrifugation (recipient cells). Donor cells were prepared by transforming E. coli S17-1 with pKOS159-5 and selecting for apramycin resistance only. Several colonies were picked off the primary transformation plate and used to inoculate 5 ml of LB with chloramphenicol (10 μg/ml) kanamycin (100 μg/ml) and apramycin (60 μg/ml). The cells were grown at 37° C. for 4 h (OD600 of 0.4-0.6), collected by centrifugation, washed in 5 mL LB, centrifuged, and resuspended in 100 μl of LB. Conjugal transfer between the donor and recipient cells was performed by resuspending the recipient cells in the 100 μl donor suspension and spreading the cells on AS-1 plates. After incubated at 37° C. for 16-20 h the plates were then overlayed with 3 mL of soft nutrient agar containing 1 mg nalidixic acid and 1.5 mg apramycin. Exconjugants were observed after 48 h of further incubation at 30° C.


[0111] Apramycin resistant colonies were analyzed by PCR to confirm single crossovers at both flanking regions. One colony was selected for carrying out a double crossover as follows. The strain was grown on AS-1 plates non-selectively until well-sporulated. Spores were harvested, dilutions were plated on AS-1 plates, and single colonies were screened for loss of apramycin resistance. A single apramycin sensitive colony was isolated which did not produce tylosin. The double crossover was confirmed by PCR. This strain was named S. fradiae K159-1.


[0112]

Streptomyces fradiae
K159-1 was deposited under the terms of the Budapest Treaty with the American Type Culture Collection, 10801 University Blvd., Manassas, Va., 20110-2209, on Mar. 12, 2003, with accession number PTA-5060.



Example 3


Construction of S. fradiae K159-1/244-17a, A “Clean” Host Expressing Methoxymalonyl Biosynthetic Enzymes

[0113]

Streptomyces fradiae
K159-1/244-17a is derived from strain K159-1 (Example 2) by addition of the fkbGHIJK genes from Streptomyces hygroscopicus var. ascomyceticus ATCC 14891, which encode proteins catalyzing the biosynthesis of methoxymalonyl-ACP.


[0114] The putative methoxymalonyl-ACP biosynthetic genes from S. hygroscopicus ATCC 14891 (fkbGHIJK) are arranged with the 3′ end of fkbG (encoding an O-methyl transferase) overlapping by 6 codons the 3′ end of fkbH (encoding an unknown function), which is the last gene of a convergent operon that begins with fkbB (one of the PKS genes) and ends with the genes fkbK, J, I and H. To facilitate expression of these genes in S. fradiae, an operon was constructed beginning with fkbK and ending with fkbG, all in the same direction. This was done using PCR to clone fkbG with flanking restriction sites to allow its 5′ end, with its existing Shine-Dalgarno sequence, to be fused to the 3′ end of fkbH. This operon was then placed behind the tylG promoter in a pSAM2-based vector, which was introduced in S. fradiae clean host K159-1 by conjugation. Exconjugants were selected and named K159-1/244-17a.


[0115]

Streptomyces fradiae
K159-1/244-17a was deposited under the terms of the Budapest Treaty with the American Type Culture Collection, 10801 University Blvd., Manassas, Va., 20110-2209, on Mar. 12, 2003, with accession number PTA-5053.



Example 4


Construction of an Operon Containing All Five Chalcomycin PKS Genes and Expression in S. fradiae

[0116] A construct comprising the genes encoding chmGI-Vwas constructed as follows: The 3′ end of ChmGV was obtained by PCR with pKOS146-185.10 as the template and the following primers (Chalco-1A: GACACGGCCGGTGAGAGCAGC [SEQ ID NO:_] and Chalco-1B: CTTCTAGATGTCGCGGTGTACGG [SEQ ID NO:_]. The 942 bp PCR product was digested with NcoI and XbaI, the 309 bp fragment was gel isolated, and the fragment ligated into the same sites of Litmus29 to give pKOS342-33. That plasmid was cut with NcoI and XhoI and ligated to a 2.4 kb NcoI-XhoI fragment from pKOS146-185.10 to give pKOS342-35. That plasmid was digested with BglII and XhoI and ligated with a 6.4 kb BglII-XhoI fragment (including chmGIV and the 5′ region of chmGV) from pKOS146-185.10 to create pKOS342-36 (containing chmGIV and GV).


[0117] A 5.4 kb HindIII/ PstI fragment containing the 5′ half of chmGIII was isolated from pKOS146-185.1 and a 6.3 kb PstI/BglII fragment containing the 3′ half of chmGIII was isolated from pKOS146-185.10. These two pieces were ligated into Litmus28 cut with HindIIl and BglII to obtain pKOS342-38. Plasmid pKOS342-36 was cut with BglII and SpeI, the 9 kb fragment was gel isolated and the fragment ligated to the BglII and SpeI sites of pKOS342-38 to obtain pKOS342-39.


[0118] Plasmid pKOS232-172 (described in Example 5), containing chmGI and GII was cut with NdeI and HindIII and the 19 kb fragment was isolated. Plasmid pKOS342-39 was digested with HindIII and SpeI and the 20 kb fragment was isolated. These two fragments were then ligated into the vector portion of an expression cosmid, pKOS244-20 (gel isolated 8 kb NdeI-SpeI fragment). The resulting plasmid (pKOS342-45) was recovered using in vitro λ phage packaging (Stratagene) and infection of E. coli DH5α. The correct clone was identified by restriction enzyme analysis and the plasmid was moved into E. coli DH5α/pUB307 and conjugated into S. fradiae.


[0119] Expression of PKS genes in S. fradiae (in this and the following examples) were under the control of the tylosin PKS promoter (tylGIp, see Rodriguez et al., “Rapid engineering of polyketide overproduction by gene transfer to industrially optimized strains” J Ind Microbiol Biotechnol).


[0120] Apramycin resistant colonies were obtained, shown to secrete bioactive compounds, and grown vegetatively in 5 mL TSB medium (+30 μg/ml apramycin) at 30° C. for 48 h. Two mL of seed culture was inoculated into 40 ml Russia (R) medium (15 g/L whole wheat flour, 10 g/L corn gluten hydrolyzate (Sigma), 25 g/L beet molasses, 2.5 g/L brewer's yeast, 1 g/L (NH4)2HPO4, 1 g/L NaCl, 2 g/L CaCO3, and 34 g/L soybean oil) in 250 ml baffled shake flasks. After a 7 days growth at 30° C., the culture broth was analyzed for 16-membered macrolide production by HPLC (Metachem Metasil Basic column, 4.6×150 mm, 5 μm particle) using linear gradient from 15 to 100% organic phase (56% methanol, 5 mM ammonium acetate) at 1 ml/mm over 7 min. The HPLC used simultaneous detection by electrospray mass spectrometry (Turbo Ionspray source) and UV absorption at 282 nm. LC-MS analysis of the broth showed that several chalconolide derivatives were produced. The most abundant compounds were purified and shown to have the structures below. The 3-keto also forms the enol tautomer.
2



Example 5


Construction of Streptomyces fradiae K232-192 Expressing a Hybrid Chalcomycin-Spiramycin PKS

[0121]

Streptomyces fradiae
K232-192 is derived from strain K159-1/244-17a (Example 3) by addition of hybrid chalcomycin-spiramycin PKS genes, which encode proteins catalyzing the biosynthesis of 14-methylplatenolide. The chalcomycin genes were obtained from cosmid pKOS146-185.1, which was deposited under the terms of the Budapest Treaty with the American Type Culture Collection, 10801 University Blvd., Manassas, Va., 20110-2209, on Feb. 19, 2003, with accession number PTA-4961.


[0122] The first two genes of the chalcomycin PKS were isolated from the cosmid pKOS146-185.1 as EcoRI/XhoI and XhoI/BspHI fragments, and a coding sequence for a spiramycin PKS C-terminal linker attached to 3′ end. The EcoRI site is near the 5′ end of chmG1. The EcoRI/XhoI fragment was cloned into a modified Litmus28 with a synthetic linker inserted in order to create an appropriate translation start sequence. The altered region of the Litmus28 polylinker between the AflII and EcoRI sites in this plasmid (pKOS232-165) is given below. The plasmid with the chmG fragment was pKOS232-168A.
4AflII      PacI   SD      NdeI       EcoRICTTAAGGGTTAATTAAGGAGGACACATATGTCCGGAGAATTC[SEQ ID NO. _]                         M  S  G  E  F


[0123] The XhoI/BspHI fragment was ligated between the XhoI and NcoI sites of Litmus28 to give pKOS232-156. The two fragments were then joined to give pKOS232-172.


[0124] Starting with overlapping cosmid pKC1306 (described in U.S. Pat. No. 5,945,320 to Eli Lilly Company), a cassette containing the last three ORFs of the spiramycin PKS was constructed as follows. An AvrII site was introduced at the 3′ end of srmG5 by PCR from a natural MluI site to the 3′ end. The PCR product was cut with MluI and AvrII, gel isolated and ligated into a Litmus-based vector (pKOS232-75B) between the same sites to give pKOS231-118A. The 7 kb BamHI/MluI fragment from cosmid pKC1306 was subcloned in Litmus38 (New England Biolabs) to give pKOS231-113A. The 3.8 kb BamHI/MluI fragment of pKOS231-118A was gel isolated and ligated with the 7 kb BamHI/MluI fragment of pKOS231-113A to give pKOS231-120. The 7 kb BsrGI/BamHI fragment from pKC1306 was subcloned in Litmus38 to give pKOS231-113B. The 6.2 kb PstI/BamHI fragment from pKOS231-113B was cloned into Litmus28 to give pKOS231-122. The 7.5 kb BamHI/AvrII fragment was isolated from pKOS231-120 and ligated with pKOS231-122, which was cut with BamHI and AvrII and dephosphorylated, to give pKOS231-124. The 3.1 kb BamHI/SpeI fragment from pKOS231-118B (which contained a PCR fragment that created a 5′ end for srmG3) and the 7.5 kb BamHI/AvrII fragment from pKOS231-120 were isolated and ligated to give pKOS231-130. The 14 kb BamHI fragment was isolated from pKC1306 and subcloned in Litmus28 to give pKOS231-111B. The 14 kb BamHI fragment was isolated from pKOS231-111B and ligated to pKOS231-130 cut with BamHI and dephosphorylated, to give pKOS231-132.


[0125] To attach a coding sequence for a spiramycin PKS C-terminal linker to the 3′ end of chmGII, a HindIII site was introduced at the 3′ end of srmG2 using PCR with PKOS231-112B as template. The engineered HindIII site was positioned with respect to the reading frame to match that of the natural HindIII site in the chalcomycin chmGII gene. The resulting PCR product was cut with HindIII and BamHI (a natural site) and ligated into the same sites of pKOS231-114A to give pKOS232-178. This was then joined to pKOS231-132 at the BsrGI site to give pKOS232-182. The chmG1,2 cassette was isolated from pKOS232-172 as an 18 kb NdeI/HindIII fragment and the srmG3,4,5 cassette was isolated from pKOS232-182 as a 20 kb HindIII/AvrII fragment. These fragments plus a pSET152-based vector having the tylG promoter (the vector portion gel isolated from pKOS244-20) were joined in a three-piece ligation and recombinants were recovered by in vitro lambda phage packaging and infection of E. coli. Correct constructs were identified by restriction analysis (pKOS232-184A) and transferred into E. coli DH5α/pUB307.


[0126] The resulting E. coli DH5α/pUB307 cells were conjugated with S. fradiae K159-1/244-17a (Example 3) to produce Streptomyces fradiae K232-192. Streptomyces fradiae K232-192 was deposited under the terms of the Budapest Treaty with the American Type Culture Collection, 10801 University Blvd., Manassas, Va., 20110-2209, on Mar. 12, 2003, with accession number PTA-5052. Conjugation was performed as described in Practical Streptomyces Genetics (Kieser et al., 2000) except that plates were left overnight at 37° C. before overlaying with the selective agent (apramycin and naladixic acid). Apramycin resistant exconjugants were streaked for single colonies and a set of clones were patched onto R5 plates and inoculated into tryptic soy broth (40 ml in 250 ml shake flasks). Both the solid and liquid media contained apramycin (to select for pKOS232-184A) and kanamycin (to select for pKOS244-17A). Liquid and solid cultures were grown at 30° C. Agar plugs taken from most patches on R5 showed bioactivity when placed on an M. luteus test lawn. The agar was extracted with ethyl acetate and found to contain a compound of 730 amu. TSB seed cultures at 2-3 days were used to inoculate fermentation media and these cultures were grown for 7-10 days at 28° C. Upon extraction with ethyl acetate the 730 amu compound (730-I) was isolated and its structure verified by NMR as shown below. In addition, LC-MS analysis of the filtered culture broth showed abundant production of a 586 amu and a 730 amu (730-II) compound, and a 714 amu compound and a smaller amount of a 904 amu compound (most likely representing 14-methyl-platenolide with all three sugars attached). Thus, the chalcomycin-spiramycin hybrid PKS synthesized the predicted 14-methyl platenolide.
3



Example 6


Construction of Streptomyces fradiae K344-51 Expressing a Hybrid Chalcomycin-Spiramycin PKS and the chmH Gene

[0127] The chmH gene was cloned from cosmid pKOS146-185.1, which was deposited under the terms of the Budapest Treaty with the American Type Culture Collection, 10801 University Blvd., Manassas, Va., 20110-2209, on Feb. 19, 2003, with accession number PTA-4961.


[0128] A 6.3 Kb EcoRI fragment containing the chmH gene and a small downstream ferredoxin gene was cloned from cosmid pKOS146-185.1 into Litmus28 to give pKOS344-10B. This was then cut with SacI and religated to give pKOS344-016 having a 2.1 Kb insert. An NdeI site was introduced at the start of translation and an internal NdeI site was simultaneously replaced with a PstI site (without changing the amino acid sequence) in a three-piece ligation using two PCR products ligated between the EcoRI and BamHI sites of pKOS344-016 to give pKOS344-022B. The unique FseI site in pKOS344-022B was changed to an XbaI site with a synthetic linker and the chmH gene plus ferredoxin gene were excised with NdeI and XbaI and ligated into the expression vector, pKOS342-108D, between the NdeI and AvrII sites to give pKOS344-037B. This vector was transferred into DH5α/pUB307, and conjugated into K232-192 (Example 5). Exconjugants were selected with thiostrepton and streaked for single colonies to yield S. fradiae K344-51.


[0129] The vector for integration of chmH, pKOS342-108D, uses the int and att functions of Streptomyces phage φBT1. All S. fradiae strains were plated on AS1 agar for sporulation, R5 agar for solid media production, or grown in liquid TSB for vegetative growth and Russia medium for production. All appropriate antibiotics for selection of integrated markers were added to the media, except for the production stage.


[0130] Expression of the chmH gene (with its downstream ferredoxin) dramatically increased hydroxylation of the 14-methyl. A 906 amu compound was detected. This is the structure expected when all post-PKS tailoring of the tylosin pathway occurred, along with an additional reduction adding two hydrogen atoms. The methanol adduct characteristic of the aldehyde is not seen for the 906 amu compound, and reduction of the aldehyde most likely accounts for the addition of the two hydrogen atoms.
4


[0131] In addition, there appear to be a significant amount of 730 amu (730-I) compound that has the aldehyde and is hydroxylated on the 14-methyl. This is deduced by the presence of the methanol adduct (762 amu) and the fact that the 730 amu compound now elutes later from the C18 column compared with the 730 amu carboxylic acid seen prior to expression of chmH.


[0132] Without intending to be bound by a specific mechanism, FIG. 2 shows proposed pathways for post-PKS modification of the chalcomycin-spiramycin hybrid PKS macrolide product in the absence or presence of ChmH. When ChmH is present, the post-PKS reaction sequence from the Chm/Srm hybrid essentially follows that for tylosin and gives the 904 amu structure, which is converted by reduction of the aldehyde to a 906 amu compound. This reduction of the aldehyde has been described for tylosin (to give relomycin). Knockout of genes for allose biosynthesis or its transfer (tylJ), would give the demycinosyl compound of 730 amu (with the 14-hydroxymethyl and the aldehyde).



Example 7


Expression of a chmGI-GII Operon with the tvlG2 C-terminal Linker in S. fradiae K105-2

[0133] a) Construction of the S. fradiae tvlD Knockout Strain K168-173


[0134] The tylD knockout plasmid was constructed from two PCR products encompassing 1.8 kb regions upstream and downstream of the tyID gene using PCR primers that introduced new restriction sites. The upstream PCR product was cut with EcoRI and PstI and the downstream product was cut with PstI and SphI. These were then ligated together between the EcoRI and SphI sites of pUC19 and the sequence was verified. The resulting plasmid, pKOS168-106, has about 80% of the tylD gene deleted between the artificial PstI sites. This plasmid was introduced into S. fradiae by conjugation from E. coli DH5α/pUB307 and apramycin resistant exconjugants were obtained. Three were found by PCR to be the result of homologous recombination at the expected tylD locus and these were grown in the absence of selection and screened for the second crossover. Apramycin sensitive clones were isolated and some were found that produced demycinosyltylosin (DMT) by LC-MS analysis of the fermentation broths. The strain was designated S. fradiae K168-173.


[0135] b) Construction of S. fradiae tylD K105-2


[0136] The S. fradiae DMT (demycinosyltylosin) producer (K168-173) described above was used to introduce a KS-1 null mutation in the tylosin PKS. The plasmids pKOS168-190 and pKOS268-145 were digested with EcoRI and EcoRV and the 6.2 kb and 2.6 kb fragments, respectively, were gel isolated and ligated together to give pKOS264-65. A mutation was introduced into pKOS264-65 using PCR to change the active site cysteine of the tylosin KS1 to alanine, with the simultaneous introduction of an NheI site, to give pKOS325-8. Finally, pKOS325-8 and pKOS241-52 were digested with PvuII and XbaI and ligated together to give pKOS264-76. Plasmid pKOS264-76 was conjugated into the DMT producer strain S. fradiae K168-173 (Example 5) from E. coli DH5α/pUB307 and exconjugants were selected for apramycin resistance. Clones that underwent the correct first crossover event were identified by Southern blot analysis and one of these was propagated without selection to allow a second crossover. DNA from clones that had become apramycin sensitive was digested with XmaI/NheI and analyzed by Southern blot. Three clones had the pattern consistent with that expected for the desired second crossover to leave the KS1 -null mutation in the chromosome. This strain was designated S. fradiae K105-2, and was shown to produce no tylosin-related structure, but could convert O-mycaminosyl-tylonolide (OMT) into demycinosyl-tylosin (DMT).


[0137] c) Construction of a chmGI-GII Operon with the tylG2 C-terminal Linker


[0138] The tylG2 C-terminal linker region was isolated by PCR from a pSET-based vector encoding the including the entire tylosin PKS (pKOS168-190). The primers used were TylLink-A: 5′-TGAAGCTTCCCGCCACGCTGGTG-3′[SEQ ID NO:_] and TylLink-B: 5′-CGTCTAGACAGGGTGTGAAACCG-3′) [SEQ ID NO:_]. This created a HindIII site at the sane position of the encoded sequence corresponding to the natural HindIII site in the linker region of chmGII. The amplimer was cut with HindIII and XbaI and ligated into Litmus29 to give pKOS342-78. The tylosin linker region of pKOS342-78 was excised using HindIII and XbaI, and then ligated into HindIII and XbaI-digested pKOS232-172 (Example 5) to create pKOS342-82. This hybrid piece was then cut out with NdeI and XbaI and ligated to a pSET-based vector (pKOS232-189) cut with NdeI and SpeI to create the pSET152-based expression vector, pKOS342-84.


[0139] d) Expression of a chmGI-GII Operon with the tvlG2 C-terminal Linker in S. fradiae K105-2


[0140] The expression vector pKOS342-84 was transferred to E. coli DH5α/pUB307 and conjugated into S. fradiae K105-2 (this example, above). Apramycin resistant colonies were isolated and fermented in production medium. The broth was analyzed by LC-MS and found to contain the compound shown below. The chm/tyl hybrids differ from the chm/srm hybrids only by having a 4-methyl in place of a 4-methoxy, apparently making the chm/tyl good substrates for the TylH hydroxylase.
5


[0141] All publications and patent documents cited herein are incorporated herein by reference as if each such publication or document was specifically and individually indicated to be incorporated herein by reference.


[0142] Although the present invention has been described in detail with reference to specific embodiments, those of skill in the art will recognize that modifications and improvements are within the scope and spirit of the invention, as set forth in the claims which follow. Citation of publications and patent documents is not intended as an admission that any such document is pertinent prior art, nor does it constitute any admission as to the contents or date of the same. The invention having now been described by way of written description, those of skill in the art will recognize that the invention can be practiced in a variety of embodiments and that the foregoing description are for purposes of illustration and not limitation.


Claims
  • 1. A recombinant DNA molecule that encodes a polypeptide, module or domain derived from a chalcomycin polyketide synthase (PKS) gene cluster.
  • 2. The recombinant DNA molecule of claim 1 that comprises a sequence encoding a chalcomycin polyketide synthase module selected from the group consisting of modules 0 to 7.
  • 3. The recombinant DNA molecule of claim 2 that comprises a sequence encoding a chalcomycin polyketide synthase polypeptide selected from the group consisting of ChmGI, ChmGII, ChmGIII, ChmGIV, and ChmV.
  • 4. The recombinant DNA molecule of claim 1 that comprises a coding sequence for a chalcomycin modifying enzyme.
  • 5. The recombinant DNA molecule of claim 4 that comprises a coding sequence for a chalcomycin P450 hydrolase enzyme selected from the group consisting of ChmHI, ChmPI, and ChmPII.
  • 6. A vector that comprises a DNA molecule of claim 1.
  • 7. The vector of claim 6 that is an expression vector.
  • 8. A recombinant host cell comprising the vector of claim 6
  • 9. A recombinant host cell comprising a DNA molecule of claim 1 integrated into the cell chromosomal DNA.
  • 10. A chimeric PKS that comprises at least one domain of a chalcomycin PKS.
  • 11. A cell comprising the chimeric PKS of claim 10
  • 12. A modified functional chalcomycin PKS that differs from the S. bikiniensis chalcomycin PKS by the inactivation of at least one domain of the chalcomycin PKS and/or addition of at least one domain of a non-chalcomycin PKS.
  • 13. The modified functional chalcomycin PKS of claim 12, wherein the domain of the chalcomycin PKS or the non-chalcomycin PKS is selected from the group consisting of a loading domain, a thioesterase domain, an AT domain, a KS domain, an ACP domain, a KR domain, a DH domain, and an ER domain.
  • 14. A cell comprising the PKS of claim 12
  • 15. A method to prepare an chalcomycin derivative which method comprises providing extender units to the cell of claim 14.
  • 16. A recombinant expression system capable of producing a chalcomycin synthase domain in a host cell, said system comprising an encoding sequence for a chalcomycin polyketide synthase domain, and said encoding sequence being operably linked to control sequences effective in said cell to produce RNA that is translated into said domain.
  • 17. A host cell modified to contain a recombinant expression system of claim 16.
  • 18. An isolated polypeptide encoded by a recombinant polynucleotide of claim 1.
  • 19. A recombinant host cell comprising a S. bikiniensis chalcomycin PKS polypeptide selected from the group consisting of ChmGI, ChmGII, ChmGIII, ChmGIV, and ChmV.
  • 20. The host cell of claim 19 that is S. fradiae.
  • 21. A recombinant S. bikiniensis cell in which a chmGI, chmGII, chmGIII, chmGIV, or chmV is disrupted so as to reduce or eliminate production of chalcomycin.
  • 22. A recombinant DNA molecule encoding a first protein comprising one or more modules of a chalcomycin PKS and a second protein comprising one or more modules of a tylosin PKS or spiramycin PKS.
  • 23. The DNA molecule of claim 22 wherein the hybrid polyketide synthase comprises one or more polypeptides of a chalcomycin PKS and one or more polypeptides of a tylosin PKS or spiramycin PKS.
  • 24. A recombinant host cell comprising a hybrid polyketide synthase comprising one or more modules of a chalcomycin PKS and one or more modules of a tylosin PKS or spiramycin PKS.
  • 25. A recombinant DNA molecule, comprising a sequence of at least about 200, optionally at least about 500, basepairs with a sequence identical or substantially identical to a protein encoding region of SEQ ID NO:1.
  • 26. The DNA molecule of claim 25 that encodes a polypeptide, module or domain derived from a chalcomycin polyketide synthase (PKS) gene cluster.
  • 27. A method of producing a polyketide, which method comprises growing the recombinant host cell of claim 17 or 24 under conditions whereby a polyketide synthesized by a PKS comprising a protein encoded by said recombinant DNA molecule is produced in the cell.
  • 28. The method of claim 27 further comprising recovering the synthesized polyketide.
  • 29. The method of claim 28 further comprising chemically modifying said polyketide.
  • 30. The method of claim 28 further comprising formulating said polyketide for administration to a mammal.
CROSS-REFERENCE TO RELATED APPLICATIONS

[0001] The present application claims benefit of U.S. provisional patent application Nos. 60/405,194 (filed Aug. 21, 2002); 60/420,994 (filed Oct. 24, 2002); and 60/493,966 (filed Aug. 8, 2003) each of which is incorporated herein by reference in its entirety.

STATEMENT OF GOVERNMENT INTEREST

[0002] Subject matter disclosed in this application was made, in part, with government support under NIH Grant No. R43 CA AI50305. As such, the United States government may have certain rights in this invention.

Provisional Applications (2)
Number Date Country
60420994 Oct 2002 US
60493966 Aug 2003 US