COMPOSITIONS AND METHODS TO INHIBIT VIRAL REPLICATION

Information

  • Patent Application
  • 20180193430
  • Publication Number
    20180193430
  • Date Filed
    October 30, 2017
    7 years ago
  • Date Published
    July 12, 2018
    6 years ago
Abstract
This disclosure provides vaccine and therapeutic active against viral infections such as herpes simplex virus 1 (HSV-1) infections.
Description
BACKGROUND

Throughout this disclosure, various patent and technical publications are identified by an identifying citation or an Arabic numeral, the full citations for which are found immediately preceding the claims. These citations and the publications referenced within the present specification are incorporated by reference into the present disclosure to more fully describe the state of the art to which this invention pertains.


Innate immunity is the first line of host defense. In response to invading pathogens, pattern recognition receptors (PRRs) sense pathogen-associated molecular patterns (PAMPs) that are structural components or replication intermediates (Medzhitov, 2007; Takeuchi and Akira, 2010; Ting et al., 2010). PRRs include the cytosolic receptors (e.g., cGAS, IFI16, RIG-I-like and NOD-like receptors) and the membrane-anchored Toll-like receptors (TLRs) and C-type lectins. Upon binding to PAMPs, PRRs recruit cognate adaptor molecules that signal to activate two closely-related kinase complexes, IKKα/β and TBK-1/IKKε. IKKα/β phosphorylates and induces the degradation of the inhibitor of NF-κB (IκBs), leading to the nuclear translocation of NF-κB (Chen et al., 1996; Zandi et al., 1997). TBK-1/IKKε can directly phosphorylate interferon regulatory factors (IRFs) to induce its dimerization and translocation into the nucleus (Fitzgerald et al., 2003; Sharma et al., 2003). Along with other transcription factors, nuclear NF-κB and IRFs coordinate to up-regulate the expression of many immune genes to engender an antiviral state (Bhatt and Ghosh, 2014). The cytosolic RIG-I receptor is a genuine RNA sensor that, in response to viral infection, activates NF-κB and IRFs through the mitochondrion antiviral signaling (MAVS) protein (Kawai et al., 2005; Meylan et al., 2005; Seth et al., 2005; Xu et al., 2005). Studies entailing gene knockout mice demonstrate that loss of RIG-I or MAVS severely impairs host innate immune response and greatly increases viral replication (Kato et al., 2006; Sun et al., 2006). Not surprisingly, viruses have evolved diverse strategies to halt or hijack antiviral signaling downstream of RIG-I and MAVS (Chan and Gack, 2015; Feng et al., 2013).


Post-translational modification (PTM) is a major means to regulate protein function and underpins diverse fundamental biological processes. First reported more than five decades ago (Mycek and Waelsch, 1960), deamidation of asparagine/glutamine in protein has long been regarded as a non-specific process associated with protein “aging”. Early protein deamidation research surveyed the overall deamidation of the cellular proteome, and led to the postulate that non-enzymatic protein deamidation serves as a biological clock for protein “aging” (Robinson and Robinson, 2001; Weintraub and Deverman, 2007). As such, research in protein deamidation is scarce and accordingly Applicant's understanding is rudimentary at best. A few proteins (e.g., Bcl-xL and 4EBP2) were shown to be regulated by deamidation in mammalian cells, which was postulated to be the consequence of an increase in cellular pH (Bidinosti et al., 2010; Deverman et al., 2002; Dho et al., 2013). Recent studies demonstrate that pathogenic bacteria secrete effectors to deamidate key signaling molecules to evade host immune defenses (Cui et al., 2010; Sanada et al., 2012) and manipulate cellular signaling (Flatau et al., 1997; Schmidt et al., 1997), indicating that protein deamidation can be catalyzed by bacterial enzymes and is highly regulated. The roles of protein deamidation in metazoan remain largely unclear.


SUMMARY

It has been reported that gamma herpesviruses, including human Kaposi's sarcoma-associated herpesvirus (KSHV) and murine gamma herpesvirus 68 (γHV68), deploy vGAT pseudo-enzymes to induce RIG-I deamidation (He et al., 2015; Kolakofsky and Garcin, 2015). Though lacking intrinsic enzyme activity, vGAT proteins recruited cellular phosphoribosylformyglycinamidine synthetase (PFAS, also known as FGARAT) to deamidate and concomitantly activate RIG-I. Activated RIG-I was harnessed by γHV68 to evade antiviral cytokine production (Dong and Feng, 2011; Dong et al., 2012). Applicant reports here that herpes simplex virus 1 (HSV-1) induces RIG-I deamidation to prevent RIG-I activation by viral dsRNA. The UL37 tegument protein was sufficient to deamidate RIG-I in cells and in vitro, making it the first viral protein deamidase to be identified. Site-specific deamidation within the helicase 2i domain impaired the RNA detection and ATP hydrolysis of RIG-I. Uncoupling RIG-I deamidation from HSV-1 infection restored RIG-I activation and anti-viral cytokine production, thereby reducing HSV-1 replication. This work delineates a pivotal role of protein deamidation in sensing nucleic acid by a PRR and demonstrates that HSV-1 exploits protein deamidation to evade innate immune defense.


The therapeutic and prophylactic interventions and screens disclosed herein are derived from Applicant's disclosed discoveries. As background, RIG-I detects double-stranded RNA (dsRNA) to trigger antiviral cytokine production. Protein deamidation is emerging as a post-translational modification that chiefly regulates protein function. Applicant reports here that UL37 of herpes simplex virus 1 (HSV-1) is a protein deamidase that targets RIG-I to block RNA-induced activation. Mass spectrometry analysis identified two asparagine residues in the helicase 2i domain that were deamidated upon UL37 expression or HSV-1 infection. Deamidations in the helicase 2i domain rendered RIG-I unable to sense viral dsRNA, trigger antiviral immune responses and restrict viral replication. Purified full-length UL37 and its carboxyl terminal fragment were sufficient to deamidate RIG-I in vitro. Uncoupling RIG-I deamidation from HSV-1 infection, via engineering deamidation-resistant RIG-I or introducing deamidase-deficient UL37 into the HSV-1 genome, restored RIG-I activation and antiviral immune signaling. This work defines the first viral deamidase and a pivotal role of protein deamidation in sensing microbial pathogens by a pattern recognition receptor.


This disclosure provides an isolated polynucleotide encoding a RIG-I mutant and equivalents thereof as well as compliment thereto. In one aspect, the RIG-I mutant is RIG-I-QQ. A non-limiting example of this polypeptide is SEQ ID NO. 4, and equivalents thereof and complements thereto. An equivalent is one or more polypeptide that retains amino acids at positions 495 and/or 549 that make the protein deaminase resistant, e.g., a substitution of Q at positions 495 and/or 549. Vectors and host cells are further provided herein. Complementary polypeptides to these are further provided herein.


Further provided are polypeptides are encoded by polynucleotides.


Compositions containing one or more of the polynucleotides, proteins, vectors and host cells and one or more carriers, are further provided herein. In one aspect, the compositions contain buffers, stabilizers and/or preservatives. In a further aspect, the compositions are lyophilized for ease of transport, storage and use.


The compositions can be formulated for administration as a vaccine or therapeutic composition and contain an effective immunity-inducing amount of the active components and optionally an adjuvant. The compositions can be further formulated into dosage units that can be packaged into kits with instructions for use.


The compositions are useful in methods to inhibit viral replication by contacting the virus in a cell, tissue or subject in need thereof by with an effective amount of an agent that inhibits the deamidation activity of UL37. Also provided is a method to abolish 5′-ppp-RNA-binding and ATP hydrolysis is a cell, tissue or subject infected with the virus, as well as to inhibit or “switch off RIG-1 by contacting the cell, tissue or administering to the subject an effective amount of a composition as provided herein. In another aspect, provided herein is a method to block RNA-induced activation by a cell, tissue or subject in need thereof by contacting the cell, tissue or administering to the subject. In a further aspect, this disclosure also provides a method to induce an anti-viral immune response in a subject in need thereof by administering an effective amount of a composition as described herein.


In one aspect, the virus is an virus that exhibits these activities such as a DNA virus, e.g., a virus of the class Herpesviridae, e.g., HIV-1 or HSV-2 virus.


Methods to determine if the methods are effective are known in the art and disclosed herein, e.g., a reduction in viral load, deamination assay, ATPase activation assay, enhanced immunity, e.g., B-cell or T-cell adaptive immunity, etc.


The contacting can be in vitro or in vivo, and the administration can be effected by methods known in the art, e.g., injection or oral administration. Multiple administrations can be provided as necessary.


For in vivo methods, the subject to be treated is any subject at risk of or having a viral infection, e.g., a pet, sports animal or human patient.





BRIEF DESCRIPTION OF THE FIGURES


FIGS. 1A-1D: HSV-1 evades RNA-induced activation of RIG-I. (FIG. 1A) HEK293/Flag-RIG-I cells were mock-infected or infected with HSV-1 (MOI=2) or Sendai virus (SeV, 100 HAU/ml) for 4 hours. Whole cell lysates (WCLs) were analyzed by two-dimensional gel electrophoresis. (FIGS. 1A-1B) HEK293 cells were mock-infected or infected with HSV-1 (MOI=2) for 1 h and super-infected with SeV (100 HAU/ml) for 8 (FIG. 1B) or 16 h (FIG. 1C). The expression of the indicated antiviral genes was analyzed by real-time PCR using total RNA (FIG. 1B). Supernatant was collected to determine IFN-β by ELISA (FIG. 1C). (FIG. 1D) HEK293/Flag-RIG-I were mock-infected or infected with HSV-1 for 1 h, followed by SeV infection (100 HAU/ml) for 4 h. RIG-I was purified and analyzed by gel filtration and immunoblotting. Numbers indicate the size of RIG-I in kDa and V0 denotes void volume. For FIGS. 1B-1D, WCLs were analyzed by immunoblotting with antibodies against SeV, HSV-1 UL37 and β-actin. ***, p<0.001 was calculated in reference to cells infected with SeV. For FIGS. 1B-IC, data are presented as mean±SD.



FIGS. 2A-2C: HSV-1 UL37 interacts with RIG-I. (FIG. 2A) 293T cells were transfected with plasmids containing GST-RIG-I and the indicated open reading frames of HSV-1. WCLs were precipitated with the indicated antibody. WCLs and precipitated proteins were analyzed by immunoblotting. (FIGS. 2B-2C) 293T cells were infected with recombinant HSV-1 UL37-Flag at MOI of 30 for 1 h (FIG. 2B) or MOI of 1 for 8 and 16 h (FIG. 2C). WCLs were precipitated with anti-Flag (M2) antibody. RIG-I and WCLs were analyzed by immunoblotting with indicated antibodies.



FIGS. 3A-3J: UL37 inhibits RIG-I activation. (FIGS. 3A-3F) Whole cell lysates (WCLs) of 293T cells stably expressing UL37 were analyzed by immunoblotting with anti-V5 (UL37) and anti-β-actin antibodies (FIG. 3A). Cells were infected with Sendai virus (SeV) (100 HAU, 8 h) and total RNA was analyzed by real-time PCR with primers specific for the indicated genes (FIG. 3B). WCLs were analyzed by immunoblotting with antibodies against V5 (UL37), SeV and β-actin (FIG. 3C). Supernatant was harvested for cytokines determined by ELISA at 16 hpi, and WCLs were analyzed by immunoblotting with antibodies against SeV and β-actin (FIG. 3D). Cells were transfected with poly [I:C] and the PRDIII-luc reporter. Activation of the PRDIII promoter was determined by luciferase reporter assay (FIG. 3E). 293T stable cells were infected with SeV (100 HAU/ml) for 1 and 3 h, and WCLs were analyzed for the phosphorylation of TBK-1 and IRF3 by immunoblotting (FIG. 3F). (FIGS. 3G-3H) 293T cells stably expressing Flag-RIG-I and RIG-I-V5 were transfected with an empty or UL37-containing plasmid. At 30 h post-transfection, cells were infected with SeV (100 HAU/ml) for 4 h. WCLs were precipitated with anti-Flag. Precipitated proteins and WCLs were analyzed by immunoblotting with the indicated antibodies (FIG. 3G). 293T/Flag-RIG-1 cells, without or with UL37-V5 expression (by lentivirus), were mock-infected or infected with SeV (100 HAU/ml) for 4 h. Purified RIG-I was analyzed by gel filtration and immunoblotting. Numbers at the top indicate the size of RIG-I in kDa and V0 denotes void volume (FIG. 3H). (FIG. 3I) Diagram of key components of the RIG-I-mediated IFN induction pathway. (FIG. 3J) 293T cells were transfected with plasmids containing MAVS, TBK1 and the constitutively active IRF3-5D, along with the IFN-β reporter plasmid and a UL37-containing plasmid. Activation of the IFN-β promoter was determined by luciferase assay. WCLs were analyzed by immunoblotting with anti-Flag (M2) (MAVS, TBK-1 and IRF3-5D) and anti-V5 (UL37) antibodies (right panels). ***, p<0.001. For FIG. 3B, FIG. 3D, FIG. 3E and FIG. 3J, data are presented as mean±SD.



FIGS. 4A-4E: UL37 deamidates RIG-I in cells and in vitro. (FIG. 4A) HEK293/Flag-RIG-I cells were transfected with an empty or UL37-containing plasmid. Whole cell lysates (WCLs) were analyzed by two-dimensional gel electrophoresis and immunoblotting with the indicated antibodies. (FIGS. 4B-4C) HEK293/Flag-RIG-I cells were transfected with a UL37-expressing plasmid or infected with HSV-1 (MOI=1) for 12 h. RIG-I was purified and analyzed by tandem mass spectrometry. Two peptides containing deamidated asparagines were identified. D495 and D549 (in red) were shown (FIG. 4B). Deamidated peptides were quantitatively determined by tandem mass spectrometry analysis and data represents one of two independent experiments (FIG. 4C). (FIG. 4D) HEK293/Flag-RIG-I or HEK293/Flag-RIG-I-DD cells were transfected with an empty or UL37-containing plasmid. WCLs were analyzed by two-dimensional gel electrophoresis and immunoblotting. (FIG. 4E) GST-RIG-I and UL37 were purified from transfected 293T cells and E. coli, respectively, and analyzed by silver staining (right panels). Deamidation reaction was analyzed by two-dimensional gel electrophoresis and immunoblotting with anti-RIG-I antibody.



FIGS. 5A-5H: The deamidated RIG-I-DD mutant fails to sense viral dsRNA. (FIG. 5A) N955 and N549 are located in the helicase 2i domain of the RIG-I structure (PDB ID: 3TMI). dsRNA is shown as helices in dark yellow. (FIG. 5B) Purified RIG-I and RIG-I-DD were incubated with [32P]-labeled 5′-triphosphate 19mer dsRNA and analyzed by electrophoresis mobility shift assay. (FIGS. 5C-5D) Purified RIG-I and RIG-I-DD were used for in vitro ATP hydrolysis with increasing amount of ATP (FIG. 5C) or 5′-triphosphate 19 mer dsRNA (FIG. 5D). (FIG. 5E) 293T cells stably expressing RIG-I-WT and RIG-I-DD were mock-infected or infected with Sendai virus (SeV, 100 HAU/ml) for 4 h. Purified RIG-I was analyzed by gel filtration and immunoblotting. Numbers at the top indicate the size of RIG-I in kDa and V0 denotes void volume. (FIGS. 5F-5H) Rig-fi MEFs “reconstituted” with RIG-I-WT or RIG-I-DD were analyzed by immunoblotting with the indicated antibodies (FIG. 5F). Cells were infected with recombinant eGFP VSV (Indiana Strain, MOI=20) for 8 h, and total RNA was analyzed by real-time PCR (FIG. 5G). Cells were infected with VSV (MOI=0.05) and viral titer in the supernatant was determined by plaque assay (FIG. 5H) hpi, hours post-infection. For FIG. 5C, FIG. 5D, FIG. 5G and FIG. 5H, data are presented as mean±SD.



FIGS. 6A-6G: The deamidation-resistant RIG-I-QQ restores antiviral cytokine production in response to HSV-1 infection. (FIG. 6A) N549 forms two hydrogen bonds with the backbone of T504 within the helicase 2i domain of the RIG-I structure (PDB ID: 4A36). (FIG. 6B) HEK293/Flag-RIG-I-WT or HEK293/Flag-RIG-I-QQ cells were transfected with an empty or UL37-containing plasmid. Whole cell lysates (WCLs) were analyzed by two-dimensional gel electrophoresis. (FIG. 6C) HEK293/Flag-RIG-I-QQ cells were mock-infected or infected with HSV-1 (MOI=5) or Sendai virus (SeV, 100 HAU/ml) for 4 h. Purified RIG-I was resolved by gel filtration and analyzed by immunoblotting. (FIGS. 6D-6F) Control HEK293 (Vec) or HEK293 cells stably expressing RIG-I-WT, RIG-I-DD or RIG-I-QQ were infected with HSV-1 (MOI=5) for 4 h and WCLs were analyzed by immunoblotting with the indicated antibodies (FIG. 6D). Total RNA was analyzed by real-time PCR (FIG. 6E). Supernatant was collected at 16 hpi and cytokines (IFN-β and RANTES) were quantified by ELISA (FIG. 6F). M, mock-infected; numbers on the top indicate hours post-infection in (FIG. 6D). (FIG. 6G) Stable HEK293 cell lines as above were infected with HSV-1 at MOI=0.5 and viral titer was determined by plaque assay. For FIG. 6E, FIG. 6F and FIG. 6G, data are presented as mean±SD.



FIGS. 7A-7H: A cysteine is required for UL37 deamidase activity. (FIG. 7A) 293T cells were transfected with plasmids containing UL37 wild-type or mutants, along with the PRDIII-luciferase reporter. Cells were infected with Sendai virus (SeV, 100 HAU/mL) at 30 h later for 15 hours. Activation of the PRDIII promoter was determined by luciferase reporter assay. (FIG. 7B) UL37 and UL37 (571-1123) were purified from E. coli and analyzed by silver staining (right panel). In vitro RIG-I deamidation reaction was analyzed by two-dimensional gel electrophoresis and immunoblotting. (FIG. 7C) UL37C (571-1123) was reacted with CNM (1 and 10 μM) for 45 minutes and analyzed by tandem mass spectrometry. The ratio of peptide containing the indicated cysteines is shown as the percentage of the CNM-modified peptide to total peptide. Data represents one of two independent experiments. (FIGS. 7D-7E) HEK293/Flag-RIG-I cells were infected with recombinant HSV-1 UL37-WT or HSV-1 UL37-C819S (MOI=5) for 4 h. WCLs were analyzed by two-dimensional gel electrophoresis and immunoblotting (FIG. 7D). RIG-I was purified and analyzed by gel filtration and immunoblotting (FIG. 7E). Numbers indicate the size of RIG-I in kDa and V0 denotes void volume. (FIGS. 7F-7G) THP-1 cells were harvested at 8 h after HSV-1 infection (MOI=2) and total RNA was analyzed by real-time PCR (FIG. 7F). Supernatant was collected at 16 hpi to quantify cytokines by ELISA (FIG. 7G). (FIG. 7H) HFF cells were infected with HSV-1 UL37-WT and HSV-1 UL37-C819S at MOI of 0.1. Viral replication was determined by plaque assays. **, p<0.01 and ***, p<0.001. For FIGS. 7A, 7F, 7G and 7H, data are presented as mean±SD.



FIGS. 8A-8I: HSV-1 evades RIG-I-mediated antiviral immune responses. (FIG. 8A) HFF cells stably expressing control, IFI16 or RIG-I shRNA were prepared by lentiviral transduction. RNA was extracted and cDNA was prepared to determine IFI16 and RIG-I mRNA expression by real-time PCR analysis. Stable cells were then infected with HSV-1 (MOI=5) for the indicated hours. RNA was extracted and cDNA was prepared to determine IFN-β mRNA expression by real-time PCR analysis. (FIG. 8B) 293T cells stably expressing control or RIG-I shRNAs were prepared by lentiviral transduction. WCLs were analyzed by SDS-PAGE and immunoblotting with the indicated antibodies. (FIG. 8C) Control and RIG-I knockdown 293T cells were mock-infected or infected with HSV-1 of indicated MOI for 24 h. RNA was extracted and cDNA was prepared to determine IFN-β and ISG56 mRNA expression by real-time PCR analysis. (FIG. 8D) Control and RIG-I knockdown HeLa cells were mock-infected or infected with HSV-1 of indicated MOI for 24 h. RNA was extracted and cDNA was prepared to determine IFN-β and ISG56 mRNA expression by real-time PCR analysis. (FIG. 8E) THP1 cells stably expressing control or RIG-I shRNA were prepared by lentiviral transduction. Cells were then infected with HSV-1 (MOI=5) for the indicated hours. RNA was extracted and cDNA was prepared to determine IFN-β and ISG56 mRNA expression by real-time PCR analysis. (FIG. 8F) 293T cells were infected with HSV-1 (MOI=5) for 2 h and then transfected with LMV Poly [I:C] for 8 h. RNA was extracted and cDNA was prepared to determine IFN-β and ISG56 mRNA expression by real-time PCR analysis. (FIG. 8G) 293T cells were transfected with a plasmids containing RIG-I and those containing indicated ORFs of HSV-1. Whole cell lysates (WCLs) were harvested and RIG-I was immunoprecipitated, followed by SDS-PAGE analysis and immunoblotting with the indicated antibodies. (FIG. 8H) 293T cells were transfected with plasmids containing UL37 and RIG-I/MDA5. RIG-I/MDA5 was immunoprecipitated. Precipitated proteins and WCL were analyzed by immunoblotting with indicated antibodies. (FIG. 8I) 293T cells were infected by HSV-1 (MOI=5) for 15 h. WCLs were analyzed by gel filtration and immunoblotting with indicated antibodies. **p<0.01, ***p<0.001, error bars denote SD (n=3).



FIGS. 9A-9H: UL37 inhibits RIG-I-mediated antiviral immune responses. (FIG. 9A) 293T cells were transfected with a PRDIII (ISRE) reporter cocktail and increasing amounts of a plasmid containing UL37. Transfected cells were subsequently infected with Sendai virus (SeV, 100 [HAU]/ml) for 15 h. Fold induction of the PRDIII reporter was determined by luciferase assay. (FIG. 9B) 293T cells were transfected with an IFN-β reporter cocktail and increasing amounts of a plasmid containing UL37. Transfected cells were subsequently infected with SeV (100 [HAU]/ml) for 15 h. Fold induction of the PRDIII reporter was determined by luciferase assay. (FIG. 9C) Control 293T and 293T cells stably expressing UL37 were infected with SeV (100 [HAU]/ml) for 10 h. RNA was extracted and cDNA was prepared to determine IL8 and CXCL2 mRNA expression by real-time PCR analysis. (FIG. 9D) 293T cells were transfected with an NF-κB reporter cocktail and increasing amounts of a plasmid containing UL37. Fold induction of the NF-κB reporter was determined by luciferase assay. (FIG. 9E) 293T cells were transfected with PRDIII reporter cocktail and increasing amounts of a plasmid containing UL37. Fold induction of the PRDIII reporter was determined by luciferase assay. (FIG. 9F) (Left) Rig-i+/+ and Rig-i−/− MEF cells were transfected with an NF-κB reporter cocktail and a plasmid containing UL37 via NEON transfection system. Fold induction of the NF-κB reporter was determined by luciferase assay. (Right) 293T cells were transfected with a PRDIII reporter cocktail and a plasmid containing MDA5, with increasing amounts of a plasmid containing UL37. Fold induction of the PRDIII reporter was determined by luciferase assay. (FIG. 9G) Control 293T and 293T cells stably expressing UL37 were infected with SeV (100 [HAU]/ml) for 8 h. WCLs were analyzed by native PAGE and immunoblotting with anti-IRF3 antibody. (FIG. 9H) HeLa cells were transfected with a control plasmid or a plasmid containing UL37, and then infected with SeV (100 [HAU]/ml) for 8 h. IRF3 nuclear translocation was analyzed by immunofluorescence microscopy with anti-IRF3 antibody. **p<0.01, ***p<0.001, error bars denote SD (n=3).



FIGS. 10A-10C: UL37 deamidates RIG-I. (FIG. 10A) 293T cells were transfected with a plasmid containing MDA5 and either a vector or a plasmid containing UL37. WCLs were analyzed by 2-dimensional gel electrophoresis (2DGE) and immunoblotting with indicated antibodies. (FIG. 10B) 293T/RIG-I cells were mock-infected or infected with HSV-1 (MOI=5) or transfected with a plasmid containing UL37. RIG-I was purified and analyzed by SDS-PAGE and coommassie staining. (FIG. 10C) 293T/RIG-I cells were transfected with a plasmid containing UL37 and subsequently treated with or without DON (10 μM). WCLs were analyzed by 2DGE and immunoblotting with indicated antibodies. β-actin served as an internal control.



FIGS. 11A-11I: Deamidation inactivates RIG-I to sense dsRNA. (FIG. 11A) 293T cells were transfected with a PRDIII reporter cocktail and plasmids containing RIG-I, RIG-I-N495D, RIG-I-N549D, RIG-I-DD and RIG-I-N(1-200), respectively. Fold induction of the PRDIII reporter was determined by luciferase assay. (FIG. 11B) 293T cells were transfected with an NF-κB reporter cocktail and plasmids containing RIG-I, RIG-I-N495D, RIG-I-N549D, RIG-I-DD and RIG-I-N(1-200), respectively. Fold induction of the NF-κB reporter was determined by luciferase assay. (FIG. 11C) RIG-I WT and the indicated mutants were purified from transfected 293T cells and analyzed by SDS-PAGE and silver staining. (FIG. 11D) Purified RIG-I, RIG-I-N495D, RIG-I-N549D and RIG-I-DD were incubated with 32P-labeled 5′-triphosphate dsRNA (20 nM), with and without a 500-fold excess of cold 5′-triphosphate dsRNA. RNA-RIG-I complex was analyzed by PAGE and autoradiography. (FIG. 11E) Purified RIG-I and RIG-I-DD were incubated with 32P-labeled 5′-triphosphate dsRNA (20 nM) or 32P-labeled control dsRNA, with and without a 500-fold excess of cold dsRNA. RNA-RIG-I complex was analyzed by PAGE and autoradiography. (FIG. 11F) Purified RIG-I, RIG-I-DD and RIG-I-K270A were incubated with 32P-labeled 5′-triphosphate dsRNA (20 nM), with and without a 500-fold excess of cold 5′-triphosphate dsRNA. RNA-RIG-I complex was analyzed by PAGE and autoradiography. (FIG. 11G) Purified RIG-I, RIG-I-N495D, RIG-I-N549D and RIG-I-DD (20 nM) were incubated with increasing concentrations of ATP in the presence of 5′-triphosphate dsRNA (80 nM) and analyzed by ATP hydrolysis assay. (FIG. 11H) Purified RIG-I, RIG-I-N495D, RIG-I-N549D and RIG-I-DD (20 nM) were incubated with increasing concentrations of dsRNA in the presence of ATP (1000 μM) and analyzed by ATP hydrolysis assay. (FIG. 11I) “Reconstituted” MEFs as shown in FIG. 5G were infected with Sendai virus (100 [HAU]/ml) for 10 h. RNA was extracted and cDNA was prepared to determine IFN-β and ISG56 mRNA expression by real-time PCR analysis. **p<0.01, ***p<0.001, error bars denote SD (n=3).



FIGS. 12A-12E: The carboxyl terminal half of UL37 contains a deamidase domain. (FIG. 12A) 293T cells were transfected with a PRDIII reporter cocktail and increasing amount of a plasmid containing UL37-WT, UL37-C819S or UL37-C850S. Transfected cells were subsequently infected with Sendai virus (SeV, 100 [HAU]/ml) for 15 h. Fold induction of the PRDIII reporter was determined by luciferase assay. (FIG. 12B) 293TRex/RIG-I cells were transfected with a plasmid containing UL37-WT, UL37-C819S or UL37-C850S. WCLs were analyzed by 2-dimensional gel electrophoresis and immunoblotting with indicated antibodies. (FIG. 12C) 293T cells were transfected with the ISRE reporter plasmid cocktail and increasing amounts of a plasmid containing UL37 (1-1123), UL37 (571-1123) or UL37 (730-1123). At 16 hours later, cells were infected with SeV (100 HAU) for 14 hours. Fold induction of the ISRE promoter was determined by luciferase assay. (FIG. 12D) Recombinant HSV-1 carrying flag-tagged wild-type UL37 or C819S UL37 was generated by homologous recombination. Viral DNA was extracted from infected Vero cells and digested by BamHI. DNA fragments were analyzed by agarose gel electrophoresis. (FIG. 12E) 293T cells were infected with HSV-1 (KOS), HSV-1 UL37-WT or HSV-1 UL37-C819S (MOI=5) for 20 h, respectively. WCLs were analyzed by immunoblotting with indicated antibodies.



FIGS. 13A-13F: The carboxyl terminal half of UL37 contains a deamidase domain. (FIG. 13A) HeLa cells were infected with recombinant HSV-1 UL37-WT or HSV-1 UL37-C819S (MOI=5) for the indicated hours. RNA was extracted and cDNA was prepared to determine IFN-β and ISG56 mRNA expression by real-time PCR analysis. (FIG. 13B) HeLa and 293T cells were infected with recombinant HSV-1 as in (A). Supernatant was harvested and IFN-β was quantified by ELISA. (FIG. 13C) (Top) HFF cells stably expressing control or STING shRNA were prepared by lentiviral transduction. WCLs were analyzed by immunoblotting with the indicated antibodies. (Bottom) HFF stable cells were infected with recombinant HSV-1 (MOI=5) as in (A). RNA was extracted and cDNA was prepared to determine IFN-β mRNA expression by real-time PCR analysis. (FIG. 13D) HeLa cells were infected with recombinant HSV-1 (MOI=0.1/1) as in (A) for the indicated hours. Supernatant was harvested and HSV-1 viral titer was measured by plaque assay. (FIG. 13E) Vero cells were infected with recombinant HSV-1 (MOI=0.1) as in (A) for the indicated hours. Supernatant was harvested and HSV-1 viral titer was measured by plaque assay. (FIG. 13F) HFF cells stably expressing control or RIG-I shRNA were prepared by lentiviral transduction as in FIG. 8A. Cells were then infected with recombinant HSV-1 (MOI=0.1) as in (FIG. 13A) for the indicated hours. Supernatant was harvested and HSV-1 viral titer was measured by plaque assay. **p<0.01, ***p<0.001, error bars denote SD (n=3).



FIGS. 14A-14E show that HSV-1 carrying deamidase-deficient UL37C718S more robustly induces cytokine production in THP-1 cells than wild-type HSV-1. (FIG. 14A and FIG. 14B) Human THP-1 monocytes were infected with HSV-1 UL37 wild-type (HSV-1 WT) or HSV-1 containing the deamidase-deficient UL37C819S mutant (HSV-1 C819S) at MOI=5. The expression of inflammatory cytokines was determined by real-time PCR at the indicated time points (FIG. 14A). Medium was collected at 16 hours post-infection and IFN-β was determined by ELISA (FIG. 14B). In FIG. 14C, THP-1 monocytes were infected as described in (FIG. 14A). Whole cell lysates prepared at various time points post-infection were analyzed by immunoblotting with the indicated antibodies. p- indicates phosphorylated TBK1 or IRF3. In FIG. 14D, wild-type and cGAS knockout L929 mouse fibroblasts were infected with HSV-1 WT and HSV-1 UL37C819S at MOI=5. L929 cells were harvested at 8 hours post-infection and the expression of the indicated cytokine genes were analyzed by real-time PCR. In FIG. 14E, THP-1 monocytes were infected as described in (FIG. 14A). cGAMP was extracted and quantified using permeabilized THP-1 reporter cells. For FIGS. 14A, 14B, 14D and 14E, p*<0.5, p***<0.005.



FIGS. 15A-15H evidence that UL37 targets cGAS to inhibit innate immune activation. In FIG. 15A, THP-1 monocytes infected with control (Vector) or UL37-expressing lentivirus were selected with puromycin and whole cell lysates (WCLs) were analyzed by immunoblotting with the indicated antibodies. FIG. 15B shows stable THP-1 cell lines as described in (FIG. 15A) were transfected with HT-DNA. Cells were harvested at 6 hours post-infection (hpi) and the expression of IFNB1 and ISG56 were analyzed by real-time PCR (FIG. 15B). Medium was harvested at 16 hpi and IFN-β was determined by ELISA (FIG. 15C). In FIG. 15C, stable THP-1 cell lines were transfected with cGAMP (2 μg/ml). Cells were harvested at 8 h post-transfection, and the expression of IFNB1 and ISG56 were analyzed by real-time PCR. In FIG. 15D, control (V) or UL37-expressing (U) THP-1 cells were left non-transfected (NT), or transfected with HT-DNA or cGAMP. Cells were harvested at 3 h post-transfection and WCLs were analyzed by immunoblotting with the indicated antibodies. In FIG. 15E, THP-1 cells were infected with control (Vector) or UL37-expressing lentivirus to establish stable cells lines as in (FIG. 15A). WCLs were analyzed by immunoblotting with the indicated antibodies. In FIG. 15F, stable THP-1 cell lines were transfected with HT-DNA. Cells were harvested at 6 h post-transfection. The relative mRNA quantity of IFNB1 was determined by real-time PCR (FIG. 15G), while intracellular cGAMP was extracted and the concentration was determined (FIG. 15H).



FIGS. 16A-16G show that UL37 interacts with and deamidates cGAS. FIG. 16A shows that human THP-1 monocytes were infected with HSV-1 carrying Flag-tagged UL37 at MOI=0.5. At 16 hours post-infection (hpi), cells were harvested and whole cell lysates (WCLs) were precipitated with anti-FLAG (UL37). Precipitated proteins and WCLs were analyzed by immunoblotting with antibody against cGAS or FLAG (UL37). In FIG. 16B, 293T cells stably expressing FLAG-tagged cGAS were transfected with vector or vector containing UL37. At 30 h post-transfection, WCLs were prepared and analyzed by two-dimensional gel electrophoresis (2-DGE) and immunoblotting with the indicated antibodies. In FIG. 16C, cGAS-expressing 293T cells were mock-infected or infected with HSV-(MOI=10). At 4 hpi, WCLs were analyzed by 2-DGE and immunoblotting with the indicated antibodies. In FIGS. 16D and 16E, purified cGAS was analyzed by tandem mass spectrometry. The m/z spectrum of a deamidated peptide containing Q451 and Q454 is shown. Es in darker color indicates deamidated residues (FIG. 16D). The relative deamidation efficiency was calculated by number of deamidated peptides and total number of peptides. Data represents results of two independent experiments (FIG. 16E). In FIG. 16F, 293T cells stably expressing wild-type cGAS or the deamidated cGAS-DDEE mutant were transfected with vector or vector containing UL37. WCLs were prepared, and analyzed by 2-DGE and immunoblotting as described in (FIG. 16B). In FIG. 16G, purified cGAS or deamidated cGAS-DDEE mutant (as fusion constructs with maltose-binding protein from E. coli), UL37 and its deamidase-deficient UL37 C819S mutant were analyzed by silver staining (left panels). In vitro deamidation reactions were analyzed by 2-DGE and immunoblotting (right panels).



FIGS. 17A-17H show that deamidated cGAS fails to synthesize cGAMP, trigger innate immune response and restrict viral replication. In FIG. 17A, 293T cells were transfected with an IFN-β reporter plasmid cocktail with plasmids containing cGAS wild-type or the deamidated cGAS-DDEE mutant. At 30 h post-transfection, IFN-γ activation was determined by luciferase assay (top panel), while whole cell lysates (WCLs) were analyzed by immunoblotting with the indicated antibodies (bottom panels). In FIG. 17B, the N196 residue is located in proximity to the catalytic active site, consisting of E211, D213 (not shown) and D307 (PDB: 4K9B). cGAMP, in relation to E211 and D307, is also shown. In FIG. 17C, in vitro cGAMP synthesis was measured with purified cGAS or its deamidated mutants, including N201D and DDEE, with or without HT-DNA. Reactions were analyzed by thin layer chromatography (left panel). The relative intensity of cGAMP was determined by densitometry analysis (right panel). Data represents more than three independent experiments. In FIG. 17D, cGAS-deficient L929 cells were infected with control (Vector) lentivirus or vector containing cGAS wild-type (WT) or the deamidated cGAS-DDEE mutant (DDEE). WCLs prepared from stable L929 cells were analyzed by immunoblotting with the indicated antibodies. In FIG. 17E, “reconstituted” cGAS stable cell lines as described in FIG. 17D were transfected with HT-DNA. Cells were harvested at 6 h post-transfection and the expression of the indicated inflammatory genes was analyzed by real-time PCR. In FIGS. 17G-17H, “reconstituted” cGAS stable cell lines were infected with HSV-1 (MOI=0.01) (FIG. 17G) or murine herpesvirus 68 (MHV68) (H) (MOI=0.05). Viral titers in the supernatant at the indicated time points were determined by plaque assays. For C, E, G and H, p**<0.01; p***<0.005.



FIGS. 18A-18G show that HSV-1 carrying deamidase-deficient UL37C819S more robustly induces innate and adaptive immune responses. In FIGS. 18A-18B, age (10-12-week old) and gender-matched BL6 mice were intravenously infected with HSV-1 UL37 wild-type (WT) or HSV-1 UL37C819S (C819S) (5×107 PFU). Blood was collected at 8 hours post-infection (hpi) and cytokines in sera were determined by ELISA (FIG. 18A). In FIGS. 18C-18E, mice were sacrificed at 3 days post-infection and the viral genome copy numbers in the brain were determined by real-time PCR analysis (FIG. 18B). In FIGS. 18C-18E, mice infection with HSV-1 was carried out as in FIG. 18A. Mouse survival was recorded over time (FIG. 18C). Infected mice were sacrificed at the indicated days post-infection (dpi). The spleens were harvested and isolated T cells were analyzed by flow cytometry after staining with gB-specific tetramer (FIG. 18D). Sera were collected at the indicated time points and antibody titer was determined by ELISA. In FIG. 18F and FIG. 18G, (FIG. 18F) or STING (FIG. G), were infected with HSV-1 UL37 wild-type (WT) or the deamidase-deficient HSV-1 UL37C819S (UL37C819S) (5×107 PFU).



FIGS. 19A-19H show that vaccination with HSV-1 carrying deamidase-deficient UL37C819S protects mice from lethal HSV-1 challenge. FIG. 19A is diagram of the experimental design for immunization and challenge with HSV-1 wild-type. Wk, week. In FIG. 19B and FIG. 19C, age-(10-12-week old) and gender-matched BALB/c mice were intraperitoneally infected with HSV-1 UL37C819S (C819S) (1×106 PFU) twice at an interval of two weeks or received PBS injection (control). Mice were intravenously challenged with lethal doses of HSV-1 wild-type (5×106 PFU) and survival was recorded (FIG. 19A). No mouse died up to 20 days post-infection (dpi) in the vaccinated group. Mouse body weight was determined and recorded (FIG. 19B). In FIG. 19D and FIG. 19E, mock-infected mice and mice vaccinated with PBS or HSV-1 UL37C819S were challenged with lethal doses of HSV-1 as described in (FIG. 19A), and sacrificed at 3 dpi. Mouse brains were collected and fixed. Brain sections analyzed by Haematoxylin & Eosin staining. Representative images are shown (FIG. 19D). Boxed regions have been amplified and are shown below the original images. Scale bars denote 100 μm (top) and 50 μm (bottom). In FIG. 19F and FIG. 19G, brain sections as described in FIG. 19D, were stained with anti-UL37 rabbit serum and representative images are shown (FIG. 19F), with boxed regions amplified and displayed below. UL37-positive cells were counted from five randomly selected fields and the percentage of HSV-1-positive cells was semi-quantitatively determined (FIG. 19G). In FIG. 19H, brain sections were also stained with antibody against NeuN (a neuron marker), and representative images are shown in FIG. 19F.





SEQUENCE LISTING

Attached are nucleotide sequences that are relevant to disclosure;


SEQ ID NO.: 1 is the wild-type polynucleotide sequence of UL37.


SEQ ID NO.: 2 is the mutated polynucleotide sequence designated UL37 C819S.


SEQ ID NO.: 3 depicts wild-type RIGI polypeptide.


SEQ ID NO.: 4 depicts mutated RIG-I-QQ polypeptide.


SEQ ID NO.: 5 depicts the polynucleotide sequence of Strain KOS of HSV-1, a mutated HSV-1 having mutated UL37.


DETAILED DESCRIPTION

Before the compositions and methods are described, it is to be understood that the invention is not limited to the particular methodologies, protocols, cell lines, assays, and reagents described, as these may vary. It is also to be understood that the terminology used herein is intended to describe particular embodiments of the present invention, and is in no way intended to limit the scope of the present invention as set forth in the appended claims.


Unless defined otherwise, all technical and scientific terms used herein have the same meanings as commonly understood by one of ordinary skill in the art to which this invention belongs. Although any methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention, the preferred methods, devices, and materials are now described. All technical and patent publications cited herein are incorporated herein by reference in their entirety. Nothing herein is to be construed as an admission that the invention is not entitled to antedate such disclosure by virtue of prior invention.


The practice of the present invention will employ, unless otherwise indicated, conventional techniques of tissue culture, immunology, molecular biology, microbiology, cell biology and recombinant DNA, which are within the skill of the art. See, e.g., Sambrook and Russell eds. (2001) Molecular Cloning: A Laboratory Manual, 3rd edition; the series Ausubel et al. eds. (2007) Current Protocols in Molecular Biology; the series Methods in Enzymology (Academic Press, Inc., N.Y.); MacPherson et al. (1991) PCR 1: A Practical Approach (IRL Press at Oxford University Press); MacPherson et al. (1995) PCR 2: A Practical Approach; Harlow and Lane eds. (1999) Antibodies, A Laboratory Manual; Freshney (2005) Culture of Animal Cells: A Manual of Basic Technique, 5th edition; Gait ed. (1984) Oligonucleotide Synthesis; U.S. Pat. No. 4,683,195; Hames and Higgins eds. (1984) Nucleic Acid Hybridization; Anderson (1999) Nucleic Acid Hybridization; Hames and Higgins eds. (1984) Transcription and Translation; Immobilized Cells and Enzymes (IRL Press (1986); Perbal (1984) A Practical Guide to Molecular Cloning; Miller and Calos eds. (1987) Gene Transfer Vectors for Mammalian Cells (Cold Spring Harbor Laboratory); Makrides ed. (2003) Gene Transfer and Expression in Mammalian Cells; and Mayer and Walker eds. (1987) Immunochemical Methods in Cell and Molecular Biology (Academic Press, London).


All numerical designations, e.g., pH, temperature, time, concentration, and molecular weight, including ranges, are approximations which are varied (+) or (−) by increments of 0.1. It is to be understood, although not always explicitly stated that all numerical designations are preceded by the term “about”. It also is to be understood, although not always explicitly stated, that the reagents described herein are merely exemplary and that equivalents of such are known in the art.


As will be understood by one skilled in the art, for any and all purposes, particularly in terms of providing a written description, all ranges disclosed herein also encompass any and all possible subranges and combinations of subranges thereof. Any listed range can be easily recognized as sufficiently describing and enabling the same range being broken down into at least equal halves, thirds, quarters, fifths, tenths, etc. As a non-limiting example, each range discussed herein can be readily broken down into a lower third, middle third and upper third, etc. As will also be understood by one skilled in the art all language such as “up to,” “at least,” “greater than,” “less than,” and the like include the number recited and refer to ranges which can be subsequently broken down into subranges as discussed above.


Definitions

As used in the specification and claims, the singular form “a”, “an” and “the” include plural references unless the context clearly dictates otherwise. For example, the term “a cell” includes a plurality of cells, including mixtures thereof.


As used herein, the term “comprising” or “comprises” is intended to mean that the compositions and methods include the recited elements, but not excluding others. “Consisting essentially of” when used to define compositions and methods, shall mean excluding other elements of any essential significance to the combination for the stated purpose. Thus, a composition consisting essentially of the elements as defined herein would not exclude trace contaminants from the isolation and purification method and pharmaceutically acceptable carriers, such as phosphate buffered saline, preservatives and the like. “Consisting of” shall mean excluding more than trace elements of other ingredients and substantial method steps for administering the compositions of this invention or process steps to produce a composition or achieve an intended result. Embodiments defined by each of these transition terms are within the scope of this invention.


The term “isolated” as used herein with respect to nucleic acids, such as DNA or RNA, refers to molecules separated from other DNAs or RNAs, respectively that are present in the natural source of the macromolecule. The term “isolated peptide fragment” is meant to include peptide fragments which are not naturally occurring as fragments and would not be found in the natural state. The term “isolated” is also used herein to refer to polypeptides, antibodies, proteins, host cells and polynucleotides that are isolated from other cellular proteins or tissues and is meant to encompass both purified and recombinant polypeptides, antibodies, proteins and polynucleotides. In other embodiments, the term “isolated” means separated from constituents, cellular and otherwise, in which the cell, tissue, polynucleotide, peptide, polypeptide, protein, antibody or fragment(s) thereof, which are normally associated in nature and can include at least 80%, or alternatively at least 85%, or alternatively at least 90%, or alternatively at least 95%, or alternatively at least 98%, purified from a cell or cellular extract. For example, an isolated polynucleotide is separated from the 3′ and 5′ contiguous nucleotides with which it is normally associated in its native or natural environment, e.g., on the chromosome. An isolated cell, for example, is a cell that is separated form tissue or cells of dissimilar phenotype or genotype. As is apparent to those of skill in the art, a non-naturally occurring polynucleotide, peptide, polypeptide, protein, antibody or fragment(s) thereof, does not require “isolation” to distinguish it from its naturally occurring counterpart.


The term “binding” or “binds” as used herein are meant to include interactions between molecules that may be detected using, for example, a hybridization assay. The terms are also meant to include “binding” interactions between molecules. Interactions may be, for example, protein-protein, antibody-protein, protein-nucleic acid, protein-small molecule or small molecule-nucleic acid in nature. This binding can result in the formation of a “complex” comprising the interacting molecules. A “complex” refers to the binding of two or more molecules held together by covalent or non-covalent bonds, interactions or forces.


Hybridization reactions can be performed under conditions of different “stringency”. In general, a low stringency hybridization reaction is carried out at about 40° C. in about 10×SSC or a solution of equivalent ionic strength/temperature. A moderate stringency hybridization is typically performed at about 50° C. in about 6×SSC, and a high stringency hybridization reaction is generally performed at about 60° C. in about 1×SSC. Hybridization reactions can also be performed under “physiological conditions” which is well known to one of skill in the art. A non-limiting example of a physiological condition is the temperature, ionic strength, pH and concentration of Mg2+ normally found in a cell.


The term “polypeptide” is used interchangeably with the term “protein” and in its broadest sense refers to a compound of two or more subunit amino acids, amino acid analogs or peptidomimetics. The subunits may be linked by peptide bonds. In another embodiment, the subunit may be linked by other bonds, e.g., ester, ether, etc. As used herein the term “amino acid” refers to natural and/or unnatural or synthetic amino acids, including glycine and both the D and L optical isomers, amino acid analogs and peptidomimetics. A peptide of three or more amino acids is commonly called an oligopeptide if the peptide chain is short. If the peptide chain is long, the peptide is commonly called a polypeptide or a protein. The term “peptide fragment” as used herein, also refers to a peptide chain.


The phrase “equivalent polypeptide” or “biologically equivalent peptide or peptide fragment” or “biologically equivalent polynucleotide” refers to a protein or a peptide fragment which is homologous to the exemplified reference polynucleotide, protein or peptide fragment and which exhibit similar biological activity in vitro or in vivo, e.g., approximately 100%, or alternatively, over 90% or alternatively over 85% or alternatively over 70%, as compared to the standard or control biological activity. Additional embodiments within the scope of this invention are identified by having more than 60%, or alternatively, more than 65%, or alternatively, more than 70%, or alternatively, more than 75%, or alternatively, more than 80%, or alternatively, more than 85%, or alternatively, more than 90%, or alternatively, more than 95%, or alternatively more than 97%, or alternatively, more than 98% or 99% sequence identity or homology. Percentage homology can be determined by sequence comparison using programs such as BLAST run under appropriate conditions. In one aspect, the program is run under default parameters.


The term “polynucleotide” refers to a polymeric form of nucleotides of any length, either deoxyribonucleotides or ribonucleotides or analogs thereof. Polynucleotides can have any three-dimensional structure and may perform any function, known or unknown. The following are non-limiting examples of polynucleotides: a gene or gene fragment (for example, a probe, primer, or EST), exons, introns, messenger RNA (mRNA), transfer RNA, ribosomal RNA, ribozymes, cDNA, RNAi, siRNA, recombinant polynucleotides, branched polynucleotides, plasmids, vectors, isolated DNA of any sequence, isolated RNA of any sequence, nucleic acid probes and primers. A polynucleotide can comprise modified nucleotides, such as methylated nucleotides and nucleotide analogs. If present, modifications to the nucleotide structure can be imparted before or after assembly of the polynucleotide. The sequence of nucleotides can be interrupted by non-nucleotide components. A polynucleotide can be further modified after polymerization, such as by conjugation with a labeling component. The term also refers to both double- and single-stranded molecules. Unless otherwise specified or required, any embodiment of this invention that is a polynucleotide encompasses both the double-stranded form and each of two complementary single-stranded forms known or predicted to make up the double-stranded form.


A polynucleotide is composed of a specific sequence of four nucleotide bases: adenine (A); cytosine (C); guanine (G); thymine (T); and uracil (U) for thymine when the polynucleotide is RNA. Thus, the term “polynucleotide sequence” is the alphabetical representation of a polynucleotide molecule. This alphabetical representation can be input into databases in a computer having a central processing unit and used for bioinformatics applications such as functional genomics and homology searching.


“Homology” or “identity” or “similarity” are synonymously and refers to sequence similarity between two peptides or between two nucleic acid molecules. Homology can be determined by comparing a position in each sequence which may be aligned for purposes of comparison. When a position in the compared sequence is occupied by the same base or amino acid, then the molecules are homologous at that position. A degree of homology between sequences is a function of the number of matching or homologous positions shared by the sequences. An “unrelated” or “non-homologous” sequence shares less than 40% identity, or alternatively less than 25% identity, with one of the sequences of the present invention.


A polynucleotide or polynucleotide region (or a polypeptide or polypeptide region) has a certain percentage (for example, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98% or 99%) of “sequence identity” to another sequence means that, when aligned, that percentage of bases (or amino acids) are the same in comparing the two sequences. This alignment and the percent homology or sequence identity can be determined using software programs known in the art, for example those described in Ausubel et al. eds. (2007) Current Protocols in Molecular Biology. Preferably, default parameters are used for alignment. One alignment program is BLAST, using default parameters. In particular, programs are BLASTN and BLASTP, using the following default parameters: Genetic code=standard; filter=none; strand=both; cutoff=60; expect=10; Matrix=BLOSUM62; Descriptions=50 sequences; sort by=HIGH SCORE; Databases=non-redundant, GenBank+EMBL+DDBJ+PDB+GenBank CDS translations+SwissProtein+SPupdate+PIR. Details of these programs can be found at the following Internet address: http://www.ncbi.nlm.nih.gov/blast/Blast.cgi, last accessed on Nov. 26, 2007. Biologically equivalent polynucleotides are those having the specified percent homology and encoding a polypeptide having the same or similar biological activity.


The term “non-contiguous” refers to the presence of an intervening peptide, nucleotide, polypeptide or polynucleotide between a specified region and/or sequence. For example, two polypeptide sequences are non-contiguous because the two sequences are separated by a polypeptide sequences that is not homologous to either of the two sequences. Non-limiting intervening sequences are comprised of at least a single amino acid or nucleotide.


A “gene” refers to a polynucleotide containing at least one open reading frame (ORF) that is capable of encoding a particular polypeptide or protein after being transcribed and translated. Any of the polynucleotide or polypeptide sequences described herein may be used to identify larger fragments or full-length coding sequences of the gene with which they are associated. Methods of isolating larger fragment sequences are known to those of skill in the art.


The term “express” refers to the production of a gene product such as RNA or a polypeptide or protein.


As used herein, “expression” refers to the process by which polynucleotides are transcribed into mRNA and/or the process by which the transcribed mRNA is subsequently being translated into peptides, polypeptides, or proteins. If the polynucleotide is derived from genomic DNA, expression may include splicing of the mRNA in an eukaryotic cell.


The term “encode” as it is applied to polynucleotides refers to a polynucleotide which is said to “encode” a polypeptide if, in its native state or when manipulated by methods well known to those skilled in the art, it can be transcribed and/or translated to produce the mRNA for the polypeptide and/or a fragment thereof. The antisense strand is the complement of such a nucleic acid, and the encoding sequence can be deduced there from.


Applicant have provided herein the polypeptide and/or polynucleotide sequences for use in gene and protein transfer and expression techniques described below. It should be understood, although not always explicitly stated that the sequences provided herein can be used to provide the expression product as well as substantially identical sequences that produce a protein that has the same biological properties. These “biologically equivalent” or “biologically active” polypeptides are encoded by equivalent polynucleotides as described herein. They may possess at least 60%, or alternatively, at least 65%, or alternatively, at least 70%, or alternatively, at least 75%, or alternatively, at least 80%, or alternatively at least 85%, or alternatively at least 90%, or alternatively at least 95% or alternatively at least 98%, identical primary amino acid sequence to the reference polypeptide when compared using sequence identity methods run under default conditions. Specific polypeptide sequences are provided as examples of particular embodiments. Modifications to the sequences to amino acids with alternate amino acids that have similar charge.


A polynucleotide of this invention can be delivered to a cell or tissue using a gene delivery vehicle. “Gene delivery,” “gene transfer,” “transducing,” and the like as used herein, are terms referring to the introduction of an exogenous polynucleotide (sometimes referred to as a “transgene”) into a host cell, irrespective of the method used for the introduction. Such methods include a variety of well-known techniques such as vector-mediated gene transfer (by, e.g., viral infection/transfection, or various other protein-based or lipid-based gene delivery complexes) as well as techniques facilitating the delivery of “naked” polynucleotides (such as electroporation, “gene gun” delivery and various other techniques used for the introduction of polynucleotides). The introduced polynucleotide may be stably or transiently maintained in the host cell. Stable maintenance typically requires that the introduced polynucleotide either contains an origin of replication compatible with the host cell or integrates into a replicon of the host cell such as an extrachromosomal replicon (e.g., a plasmid) or a nuclear or mitochondrial chromosome. A number of vectors are known to be capable of mediating transfer of genes to mammalian cells, as is known in the art and described herein.


A “composition” is intended to mean a combination of active polypeptide, polynucleotide or antibody and another compound or composition, inert (e.g. a detectable label) or active (e.g. a gene delivery vehicle) alone or in combination with a carrier which can in one embodiment be a simple carrier like saline or pharmaceutically acceptable or a solid support as defined below.


A “pharmaceutical composition” is intended to include the combination of an active polypeptide, polynucleotide or antibody with a carrier, inert or active such as a solid support, making the composition suitable for diagnostic or therapeutic use in vitro, in vivo or ex vivo.


As used herein, the term “pharmaceutically acceptable carrier” encompasses any of the standard pharmaceutical carriers, such as a phosphate buffered saline solution, water, and emulsions, such as an oil/water or water/oil emulsion, and various types of wetting agents. The compositions also can include stabilizers and preservatives. For examples of carriers, stabilizers and adjuvants, see Martin (1975) Remington's Pharm. Sci., 15th Ed. (Mack Publ. Co., Easton).


A “subject,” “individual” or “patient” is used interchangeably herein, and refers to a vertebrate, preferably a mammal, more preferably a human. Mammals include, but are not limited to, murines, rats, rabbits, simians, bovines, ovines, porcines, canines, felines, farm animals, sport animals, pets, equines, and primates, particularly humans.


“Cell,” “host cell” or “recombinant host cell” are terms used interchangeably herein. It is understood that such terms refer not only to the particular subject cell but to the progeny or potential progeny of such a cell. The cells can be of any one or more of the type murine, rat, rabbit, simian, bovine, ovine, porcine, canine, feline, equine, and primate, particularly human.


Because certain modifications may occur in succeeding generations due to either mutation or environmental influences, such progeny may not, in fact, be identical to the parent cell, but are still included within the scope of the term as used herein.


Herpesviridae is a large family of DNA viruses that cause diseases in animals, including humans. Non-limiting examples of the members include HSV-1, HSV-2, varicella zoster virus, Epstein-Barr virus, cytomegalovirus, varicella-zoster virus, human herpesvirus 6A and 6B, and Karposi's sarcoma-associated herpesvirus.


“Treating,” “treatment,” or “ameliorating” of a disease includes: (1) preventing the disease, i.e., causing the clinical symptoms of the disease not to develop in a patient that may be predisposed to the disease but does not yet experience or display symptoms of the disease; and/or (2) inhibiting the disease, i.e., arresting or reducing the development of the disease or its clinical symptoms; and/or (3) relieving the disease, i.e., causing regression of the disease or its clinical symptoms.


The term “suffering” as it related to the term “treatment” refers to a patient or individual who has been diagnosed with or is predisposed to a disease or infection by a virus. A patient may also be referred to being “at risk of suffering” from a disease or infection by a virus. This patient has not yet developed characteristic disease pathology, however are known to be predisposed to the disease due to family history, being genetically predispose to developing the disease, or diagnosed with a disease or disorder that predisposes them to developing the disease to be treated.


An “effective amount” is an amount sufficient to effect beneficial or desired results. An effective amount can be administered in one or more administrations, applications or dosages. Such delivery is dependent on a number of variables including the time period for which the individual dosage unit is to be used, the bioavailability of the therapeutic agent, the route of administration, etc. It is understood, however, that specific dose levels of the therapeutic agents disclosed herein for any particular subject depends upon a variety of factors including the activity of the specific compound employed, bioavailability of the compound, the route of administration, the age of the animal and its body weight, general health, sex, the diet of the animal, the time of administration, the rate of excretion, the drug combination, and the severity of the particular disorder being treated and form of administration. In general, one will desire to administer an amount of the compound that is effective to achieve a serum level commensurate with the concentrations found to be effective in vivo. These considerations, as well as effective formulations and administration procedures are well known in the art and are described in standard textbooks.


“Under transcriptional control” is a term well understood in the art and indicates that transcription of a polynucleotide sequence, usually a DNA sequence, depends on its being operatively linked to an element which contributes to the initiation of, or promotes, transcription. “Operatively linked” intends the polynucleotides are arranged in a manner that allows them to function in a cell.


A “probe” when used in the context of polynucleotide manipulation refers to an oligonucleotide that is provided as a reagent to detect a target potentially present in a sample of interest by hybridizing with the target. Usually, a probe will comprise a detectable label or a means by which a label can be attached, either before or subsequent to the hybridization reaction. Alternatively, a “probe” can be a biological compound such as a polypeptide, antibody, or fragments thereof that is capable of binding to the target potentially present in a sample of interest.


“Detectable labels” or “markers” include, but are not limited to radioisotopes, fluorochromes, chemiluminescent compounds, dyes, and proteins, including enzymes. Detectable labels can also be attached to a polynucleotide, polypeptide, antibody or composition described herein.


A “primer” is a short polynucleotide, generally with a free 3′—OH group that binds to a target or “template” potentially present in a sample of interest by hybridizing with the target, and thereafter promoting polymerization of a polynucleotide complementary to the target. A “polymerase chain reaction” (“PCR”) is a reaction in which replicate copies are made of a target polynucleotide using a “pair of primers” or a “set of primers” consisting of an “upstream” and a “downstream” primer, and a catalyst of polymerization, such as a DNA polymerase, and typically a thermally-stable polymerase enzyme. Methods for PCR are well known in the art, and taught, for example in MacPherson et al. (1991) PCR 1: A Practical Approach (IRL Press at Oxford University Press). All processes of producing replicate copies of a polynucleotide, such as PCR or gene cloning, are collectively referred to herein as “replication.” A primer can also be used as a probe in hybridization reactions, such as Southern or Northern blot analyses. Sambrook and Russell (2001), infra.


“Hybridization” refers to a reaction in which one or more polynucleotides react to form a complex that is stabilized via hydrogen bonding between the bases of the nucleotide residues. The hydrogen bonding may occur by Watson-Crick base pairing, Hoogstein binding, or in any other sequence-specific manner. The complex may comprise two strands forming a duplex structure, three or more strands forming a multi-stranded complex, a single self-hybridizing strand, or any combination of these. A hybridization reaction may constitute a step in a more extensive process, such as the initiation of a PCR reaction, or the enzymatic cleavage of a polynucleotide by a ribozyme.


Hybridization reactions can be performed under conditions of different “stringency”. In general, a low stringency hybridization reaction is carried out at about 40° C. in 10×SSC or a solution of equivalent ionic strength/temperature. A moderate stringency hybridization is typically performed at about 50° C. in 6×SSC, and a high stringency hybridization reaction is generally performed at about 60° C. in 1×SSC. Hybridization reactions can also be performed under “physiological conditions” which is well known to one of skill in the art. A non-limiting example of a physiological condition is the temperature, ionic strength, pH and concentration of Mg2 normally found in a cell.


When hybridization occurs in an antiparallel configuration between two single-stranded polynucleotides, the reaction is called “annealing” and those polynucleotides are described as “complementary”. A double-stranded polynucleotide can be “complementary” or “homologous” to another polynucleotide, if hybridization can occur between one of the strands of the first polynucleotide and the second. “Complementarity” or “homology” (the degree that one polynucleotide is complementary with another) is quantifiable in terms of the proportion of bases in opposing strands that are expected to form hydrogen bonding with each other, according to generally accepted base-pairing rules.


The term “propagate” means to grow a cell or population of cells. The term “growing” also refers to the proliferation of cells in the presence of supporting media, nutrients, growth factors, support cells, or any chemical or biological compound necessary for obtaining the desired number of cells or cell type.


The term “culturing” refers to the in vitro propagation of cells or organisms on or in media of various kinds. It is understood that the descendants of a cell grown in culture may not be completely identical (i.e., morphologically, genetically, or phenotypically) to the parent cell.


A “viral vector” is defined as a recombinantly produced virus or viral particle that comprises a polynucleotide to be delivered into a host cell, either in vivo, ex vivo or in vitro. Examples of viral vectors include retroviral vectors, lentiviral vectors, adenovirus vectors, adeno-associated virus vectors, alphavirus vectors and the like. Alphavirus vectors, such as Semliki Forest virus-based vectors and Sindbis virus-based vectors, have also been developed for use in gene therapy and immunotherapy. See, Schlesinger and Dubensky (1999) Curr. Opin. Biotechnol. 5:434-439 and Ying, et al. (1999) Nat. Med. 5(7):823-827.


In aspects where gene transfer is mediated by a lentiviral vector, a vector construct refers to the polynucleotide comprising the lentiviral genome or part thereof, and a therapeutic gene. As used herein, “lentiviral mediated gene transfer” or “lentiviral transduction” carries the same meaning and refers to the process by which a gene or nucleic acid sequences are stably transferred into the host cell by virtue of the virus entering the cell and integrating its genome into the host cell genome. The virus can enter the host cell via its normal mechanism of infection or be modified such that it binds to a different host cell surface receptor or ligand to enter the cell. Retroviruses carry their genetic information in the form of RNA; however, once the virus infects a cell, the RNA is reverse-transcribed into the DNA form which integrates into the genomic DNA of the infected cell. The integrated DNA form is called a provirus. As used herein, lentiviral vector refers to a viral particle capable of introducing exogenous nucleic acid into a cell through a viral or viral-like entry mechanism. A “lentiviral vector” is a type of retroviral vector well-known in the art that has certain advantages in transducing nondividing cells as compared to other retroviral vectors. See, Trono D. (2002) Lentiviral vectors, New York: Spring-Verlag Berlin Heidelberg.


Lentiviral vectors of this invention are based on or derived from oncoretroviruses (the sub-group of retroviruses containing MLV), and lentiviruses (the sub-group of retroviruses containing HIV). Examples include ASLV, SNV and RSV all of which have been split into packaging and vector components for lentiviral vector particle production systems. The lentiviral vector particle according to the invention may be based on a genetically or otherwise (e.g. by specific choice of packaging cell system) altered version of a particular retrovirus.


That the vector particle according to the invention is “based on” a particular retrovirus means that the vector is derived from that particular retrovirus. The genome of the vector particle comprises components from that retrovirus as a backbone. The vector particle contains essential vector components compatible with the RNA genome, including reverse transcription and integration systems. Usually these will include gag and pol proteins derived from the particular retrovirus. Thus, the majority of the structural components of the vector particle will normally be derived from that retrovirus, although they may have been altered genetically or otherwise so as to provide desired useful properties. However, certain structural components and in particular the env proteins, may originate from a different virus. The vector host range and cell types infected or transduced can be altered by using different env genes in the vector particle production system to give the vector particle a different specificity.


“RNA interference” (RNAi) refers to sequence-specific or gene specific suppression of gene expression (protein synthesis) that is mediated by short interfering RNA (siRNA).


“Short interfering RNA” (siRNA) refers to double-stranded RNA molecules (dsRNA), generally, from about 10 to about 30 nucleotides in length that are capable of mediating RNA interference (RNAi), or 11 nucleotides in length, 12 nucleotides in length, 13 nucleotides in length, 14 nucleotides in length, 15 nucleotides in length, 16 nucleotides in length, 17 nucleotides in length, 18 nucleotides in length, 19 nucleotides in length, 20 nucleotides in length, 21 nucleotides in length, 22 nucleotides in length, 23 nucleotides in length, 24 nucleotides in length, 25 nucleotides in length, 26 nucleotides in length, 27 nucleotides in length, 28 nucleotides in length, or 29 nucleotides in length. As used herein, the term siRNA includes short hairpin RNAs (shRNAs).


“Double stranded RNA” (dsRNA) refer to double stranded RNA molecules that may be of any length and may be cleaved intracellularly into smaller RNA molecules, such as siRNA. In cells that have a competent interferon response, longer dsRNA, such as those longer than about 30 base pair in length, may trigger the interferon response. In other cells that do not have a competent interferon response, dsRNA may be used to trigger specific RNAi.


The term siRNA includes short hairpin RNAs (shRNAs). shRNAs comprise a single strand of RNA that forms a stem-loop structure, where the stem consists of the complementary sense and antisense strands that comprise a double-stranded siRNA, and the loop is a linker of varying size. The stem structure of shRNAs generally is from about 10 to about 30 nucleotides in length. For example, the stem can be 10-30 nucleotides in length, or alternatively, 12-28 nucleotides in length, or alternatively, 15-25 nucleotides in length, or alternatively, 19-23 nucleotides in length, or alternatively, 21-23 nucleotides in length.


Tools to assist siRNA design are readily available to the public. For example, a computer-based siRNA design tool is available on the internet at www.dharmacon.com, Ambion-www.ambion.com/jp/techlib/misc/siRNA_finder.html; Thermo Scientific-Dharmacon-www.dharmacon.com/DesignCenter/DesignCenterPage.aspx; Bioinformatics Research Center-sysbio.kribb.re.kr:8080/AsiDesigner/menuDesigner.jsf; and Invitrogen-maidesigner.invitrogen.com/maiexpress/.


As used herein, the term “purification label” refers to at least one marker useful for purification or identification. A non-exhaustive list of this marker includes His, lacZ, GST, maltose-binding protein, NusA, BCCP, c-myc, CaM, FLAG, GFP, YFP, cherry, thioredoxin, poly(NANP), V5, Snap, HA, chitin-binding protein, Softag 1, Softag 3, Strep, or S-protein. Suitable direct or indirect fluorescence marker comprise FLAG, GFP, YFP, RFP, dTomato, cherry, Cy3, Cy 5, Cy 5.5, Cy 7, DNP, AMCA, Biotin, Digoxigenin, Tamra, Texas Red, rhodamine, Alexa fluors, FITC, TRITC or any other fluorescent dye or hapten.


Modes for Carrying Out the Aspects of the Disclosure

Applicant has identified a mechanism by which certain virus evade a host's innate immune response. Provided herein are compositions and methods that build upon this discovery. To that end, in one aspect provided herein is an isolated polynucleotide encoding a RIG-I-QQ mutant and equivalents thereof. Non-limiting examples of equivalents include polynucleotides that hybridize under stringen conditions to the polynucleotide (e.g., a polynucleotide encoding SEQ ID NO. 4) and sequences having at least 70% sequence identity to a polynucleotide encoding SEQ ID NO. 4). In one aspect, the isolated polynucleotide encodes the polypeptide shown in SEQ ID NO. 4, and equivalents that retain amino acids at positions 495 and/or 549 that make the protein deaminase resistant, e.g., substitution of Q at position positions 495 and/or 549. The isolated polynucleotides can be included within a vector or other gene delivery vehicle or isolated host cell. The RIG-I-QQ polypeptide and equivalents thereof can be combined or contained with a host cell and/or with a carrier, such as a pharmaceutically acceptable carrier. The polypeptide and proteins can be chemically and/or recombinantly produced using methods known in the art, using host cells containing the polynucleotides and culturing the cell under conditions for expression and/or replication of the polynucleotides or polypeptides. The polypeptides and polynucleotides can be further combined with a detectable label or a purification label and used for purification and/or in drug development screens. Compositions containing the RIG-I-QQ mutant or an equivalent thereof can be combined with other antiviral agents and immune enhancing compositions such as an vaccine adjuvant.


Also provided herein is an isolated mutated UL37 polynucleotide that fails to deaminate RIG-I polypeptide. Non-limiting examples of such is the polynucleotide is a modified wild-type UL 37 mutated at positions 819 and 850, e.g., identified herein as UL 37 C819S (SEQ ID NO.: 2) and C850S, and equivalents thereof. The isolated mutated UL37 polynucleotides fail to deaminate RIG-I polypeptide. The mutated polynucleotides can be combined with a label, e.g., a detectable or purification label for screening, probes, primers or other assays. The polynucleotides can be chemically or recombinantly produced using methods known in the art. In one aspect, they are combined within an HSV vector or virus and are useful in vaccine compositions. In one aspect, they are combined with a carrier, such as a pharmaceutically acceptable carrier and/or adjuvant. The compositions can be incorporated into a kit and can further contain instructions for use, e.g., in the methods disclosed herein.


The agents and compositions of the present disclosure can be used in the manufacture of medicaments and for the treatment of humans and other animals by administration in accordance with conventional procedures, such as an active ingredient in pharmaceutical compositions.


Also provided are methods for one or more of:

    • a. inhibiting viral replication in a cell or tissue;
    • b. abolishing 5′-ppp-RNA-binding and ATP hydrolysis is a cell or tissue;
    • c. switching off RIG-1 in a cell or tissue;
    • d. blocking RNA-induced activation in a cell or tissue;
    • e. inhibiting the deamidation activity of UL37 in a cell or tissue; or
    • f. inducing an anti-viral immune in a tissue;
    • g. inducing expression of anti-viral cytokine genes; or
    • h. enhancing adaptive immunity,


      by contacting the cell, virus or tissue or surface containing the virus with an effective amount of a composition as described herein, e.g., an agent that inhibits the deamidation activity of UL 37. Non-limiting examples of agents that inhibit the deamidation activity of UL 37 polynucleotide include, for example, antisense UL37 polynucleotides and UL37 interfering RNA molecules, e.g., siRNA, dsRNA, and shRNA, mutated RIG-I-QQ polynucleotide or an equivalent thereof, a mutated RIG-I-QQ polypeptide or an equivalent thereof, a mutated UL37 polynucleotide or an equivalent thereof, or a virus containing mutated UL37 polynucleotide or an equivalent thereof. Virus that are inhibited by the method includes virus that deaminate RIG-I, e.g., a virus of the class Herpesviradae, e.g., HSV-1, HSV-2, Varicella Zoster Virus and HCMV. The agents can be combined with a carrier for ease of administration.


The contacting can be performed in vitro or in vivo. When performed in vivo, the agent is administered to a subject infected with the virus or for whom prophylaxis is desired. Any suitable method of administration can be used in the method, e.g., topical, intravenous, by inhalation therapy. The subject is any animal that is susceptible to the viral infection e.g., a mammal or a human. The method can further comprise administration of an effective amount of an antiviral agent.


Administration can be effected in one dose, continuously or intermittently throughout the course of treatment. Methods of determining the most effective means and dosage of administration are known to those of skill in the art and will vary with the composition used for therapy, the purpose of the therapy, the infection being treated, and the subject being treated. Single or multiple administrations can be carried out with the dose level and pattern being selected by the treating physician. Suitable dosage formulations and methods of administering the agents are known in the art. Route of administration can also be determined and method of determining the most effective route of administration are known to those of skill in the art and will vary with the composition used for treatment, the purpose of the treatment, the health condition or disease stage of the subject being treated, and target cell or tissue. Non-limiting examples of route of administration include oral administration, nasal administration, injection, topical application, intraperitoneal, intravenous and by inhalation.


Also provided herein is a method for one or more of:

    • a. inhibiting viral replication in subject infected with a virus;
    • b. abolishing 5′-ppp-RNA-binding and ATP hydrolysis in a subject;
    • c. switching off RIG-1 in a subject;
    • d. blocking RNA-induced activation in a subject;
    • e. inhibiting the deamidation activity of UL37 in a subject; or
    • f. inducing an anti-viral immune in a subject;
    • g. inducing expression of anti-viral cytokine genes; or
    • h. enhancing adaptive immunity,


      comprising administering to the subject an effective amount of the composition as described herein. In one aspect, the composition comprises an effective amount of mutated RIG-I polypeptide as described above (e.g., RIG-I-QQ or an equivalent thereof) or a virus containing a mutated UL37 polynucleotide that lacks the ability to deaminate RIG-I polypeptide. Any suitable method of administration can be used in the method, e.g., topical, intravenous, by inhalation therapy. The subject is any animal that is susceptible to the viral infection e.g., a mammal or a human. The method can further comprise administration of an effective amount of an antiviral agent. The virus and viral infections include virus that deaminate RIG-I, e.g., a virus of the class Herpesviradae, e.g., HSV-1, HSV-2, Varicella Zoster Virus and HCMV. The agents can be combined with a carrier for ease of administration.


“Administration” can be effected in one dose, continuously or intermittently throughout the course of treatment. Methods of determining the most effective means and dosage of administration are known to those of skill in the art and will vary with the composition used for therapy, the purpose of the therapy, the virus being treated, and the subject being treated. Single or multiple administrations can be carried out with the dose level and pattern being selected by the treating physician. Suitable dosage formulations and methods of administering the agents are known in the art. Route of administration can also be determined and method of determining the most effective route of administration are known to those of skill in the art and will vary with the composition used for treatment, the purpose of the treatment, the health condition or disease stage of the subject being treated, and target cell or tissue. Non-limiting examples of route of administration include oral administration, nasal administration, injection, topical application, intraperitoneal, intravenous and by inhalation.


Materials and Methods
Experiment No. 1
Cell Lines and Viruses

HEK293T, HeLa, Vero, HCT116, BHK21, mouse embryonic fibroblasts (MEFs) and human foreskin fibroblasts (HFF) were cultured in Dulbecco's modified Eagle's medium (DMEM, Corning) supplemented with 10% heat-inactivated fetal bovine serum (FBS; HyClone), penicillin (100 U/mL) and streptomycin (100 μg/mL). Wild-type and Rig-i−/− MEFs were described previously (Kato et al., 2005). Wild-type HSV-1 (KOS strain), GFP HSV-1 and HSV-1 recombinant viruses were amplified in Vero cells, with viral titers ranging from 107 to 108 pfu/ml. eGFP VSV (Dr. Sean Whelan) was amplified in BHK21 cells, with viral titer of 109 pfu/ml. Sendai virus was purchased from Charles River Laboratories.


Constructs

Luciferase reporter plasmids for the NF-κB, IFN-β promoter, PRDIII (ISRE) promoter, mammalian expression plasmids for RIG-I and their truncated mutants, MDA5, MAVS, IKKβ, TBK1, IRF3-5D, RelA were described previously (Dong et al., 2010; Dong and Feng, 2011; Dong et al., 2012; He et al., 2015; Seth et al., 2005). The non-silencing (control) shRNA and shRNA against human RIG-I, human IFI16 and human STING were purchased from Thermo Scientific. HSV-1 expression library was described previously (Sen et al., 2013). Mammalian expression plasmids for truncated RIG-I and UL37, lentiviral expression plasmids for RIG-I and UL37 were generated by standard molecular biology techniques. All point mutants, including those of RIG-I and UL37, were generated by site-directed mutagenesis and confirm by sequencing. HSV-1ΔUL37 (KOS) and HSV-1(KOS) Bacmid was a gift from Dr. Thomas C. Mettenleiter.


Antibodies and Reagents

Antibody against UL37 was a gift from Dr. Weiming Yuan. Antibodies against GST (Z-5), IRF3 (FL-425), TRAF6 (D10) and RIG-I (H-300) were purchased from Santa Cruz Biotechnology. Antibodies against FLAG (M2, Sigma), V5 (A190-220A, Bethyl Group), RIG-I (SS1A, Enzo Life Sciences), STING (ab92605, Abcam), dsRNA-J2 (SCICONS), Sendai Virus (PD029, MBL), P-S172 TBK-1 (D52C2, Cell Signaling) and β-actin (Ab8226, Abcam) were purchased from the indicated suppliers. The glutamine analog 6-Diazo-5-oxo-L-norleucine (DON) was purchased from Sigma. Low molecular weight Poly [I:C] (31852-29-6), ppp-dsRNA (tlrl-3prna) and control-dsRNA (tlrl-3prnac) were purchased from InvivoGen Lipofectamine 2000 was purchased from Life Technologies.


DNA and RNA Transfection

For plasmid transfection in HEK293T cells, calcium phosphate transfection method was applied. 293T cells were plated at around 50%-60% confluence. For dsRNA and Poly [I:C] transfection in 293T cells and plasmid transfection in HeLa cells, Lipofectamine 2000 transfection reagent was used according to the manufacturer's instructions. Both cells were prepared at around 80%-90% confluence prior to transfection.


Lentivirus-Mediated Stable Cell Line Construction

Lentiviruses were produced as previously described (Dong and Feng, 2011; Feng et al., 2008). Briefly, HEK293T cells were transfected with the packaging plasmids VSV-G and DR8.9 and the pCDH lentiviral expression vector or lentiviral shRNA plasmids. At 48 h post transfection, supernatant was harvested and filtered (and concentrated by centrifugation if necessary). HEK293T cells, MEFs, HeLa, HCT116 or HFF cells were infected with the supernatant in the presence of polybrene (8 μg/ml) with centrifugation at 1800 rpm for 45 minutes. Cells were selected at 48 h post infection and maintained in 10% FBS DMEM supplemented with puromycin (1˜2 μg/ml).


Dual-Luciferase Reporter Assay

HEK293T cells, seeded in 24-well plates (˜50% cell density), were transfected with IFN-β, PRDIII (ISRE) or NF-κB reporter plasmid cocktail (50 ng of luciferase reporter plasmid and 5 ng of pRL Renilla luciferase control vector) and expression plasmid (empty plasmid, one or multiple plasmids depending on the experiment) by calcium phosphate precipitation. Cells were infected with SeV (100 HA/ml), HSV-1 for 16 h, transfected with Poly [I:C] for 16 h or directly harvested 30-36 h post transfection. Whole cell lysates were used to determine the activity of firefly luciferase and renilla luciferase by a microplate reader (FLUOstar Omega).


Plaque Assay

HSV-1 and VSV titer were determined by plaque assay on Vero monolayer essentially as previously described (Lieber and Bailer, 2013). Briefly, 10-fold serially-diluted virus-containing supernatant was added onto Vero cells and incubated for 2 h at 37° C. Then, DMEM containing 2% FBS and 1% methylcellulose (Sigma) was added after removing the supernatant. Plaques were counted at day 3 post-infection.


Confocal Microscopy

HFF cells were infected with HSV-1 for 8 h (MOI=50). HeLa cells were transfected with expression plasmid containing UL37 and subsequently infected with Sendai Virus for 6 h (200 HA/ml). Cells were fixed, permeabilized, stained with indicated primary antibody (1:100 dilution) and Alexa Fluor 488/594-congugated goat secondary antibody (1:200 dilution), and analyzed with confocal microscope (Leica). Representative images were shown for all analyses.


Protein Expression and Purification

HEK293T cells were transfected with expression vector containing Flag-tagged gene of interest. Cells were harvested and lysed with Triton X-100 buffer (20 mM Tris, pH 7.5, 150 mM NaCl, 1.5 mM MgCl2, 20 mM β-glycerophosphate, 1 mM sodium orthovanadate, 10% glycerol, 0.5 mM EGTA, 0.5% Triton X-100) supplemented with a protease inhibitor cocktail (Roche). Whole cell lysates were sonicated and centrifuged at 12,000 rpm for 15 min. Supernatant was harvested, filtered, pre-cleared with protein A/G agarose beads at 4° C. for 1 h and then incubated with anti-Flag agarose beads at 4° C. for 4 h. The agarose beads were washed extensively and eluted with 0.2 mg/ml 3× Flag peptide. The eluted proteins were analyzed by SDS gel electrophoresis and silver staining.


For recombinant protein expression and purification, E. coli B121(DE3) was transformed with pGEX-4T-1 or pET28 plasmid containing UL37. Recombinant GST-UL37 expression was induced by 0.1 mM IPTG at 20° C. Bacteria were harvested, lysed and incubated with glutathione sepharose 4B (GE) for 4 h at 4° C. Sepharose beads were washed extensively and GST-UL37 was eluted with 10 mM reduced glutathione. UL37 was then cleaved and purified from the fusion protein by TEV protease treatment at 4° C. overnight.


Co-Immunoprecipitation (Co-IP) and Immunoblotting

For Co-IP using exogenous protein, HEK293T cells were transfected with indicated expression plasmids for 48 h. For Co-IP using endogenous proteins, cells were directly harvested. Whole cell lysates were prepared with NP40 buffer (50 mM Tris-HCl, pH 7.4, 150 mM NaCl, 1% NP-40, 5 mM EDTA) supplemented with 20 mM β-glycerophosphate and 1 mM sodium orthovanadate. Whole cell lysates were sonicated, centrifuged and pre-cleared with protein A/G agarose for 1 h. Pre-cleared samples were then incubated with indicated antibodies overnight and protein A/G agarose for 1 h at 4° C., or with antibody/glutathione-conjugated agarose for 4 h at 4° C. The agarose beads were washed extensively and samples were eluted by boiling at 95° C. for 10 min. Precipitated proteins were analyzed by SDS gel electrophoresis and immunoblotting.


All immunoblottings were performed using the indicated primary antibodies (1:1000 dilution) and IRDye800-conjugated secondary antibodies (1:10,000 dilution, Licor). Proteins were visualized by Odyssey infrared imaging system (Licor).


Gel Filtration

Virus-infected HEK293T/Flag-RIG-I or HeLa/Flag-RIG-I stable cells were harvested and lysed in cold Triton X-100 buffer (20 mM Tris, pH 7.5, 150 mM NaCl, 1.5 mM MgCl2, 20 mM β-glycerophosphate, 1 mM sodium orthovanadate, 10% glycerol, 0.5 mM EGTA, 0.5% Triton X-100, 1 mM PMSF and 10 μg/ml leupeptin). Centrifuged supernatant was filtered and subjected to incubation with anti-Flag-conjugated agarose beads for 2 h at 4° C. Beads were then extensively washed and proteins were eluted with 3× Flag peptide at 0.2 mg/ml.


Gel filtration with superose 6 was performed as described previously. Briefly, eluted proteins (200-300 μl) were loaded to superose 6 column and subjected to gel filtration analysis with Buffer B (20 mM Tris-HCl, pH 7.6, 150 mM NaCl, 1 mM EDTA, 0.5 mM EGTA, 0.5% Triton X-100, 20 mM NaF, 20 mM β-glycerophosphate, 1 mM Na3VO4, 2.5 mM metabisulphite [sodium salt], 5 mM benzamidine). Elution was collected in 0.5 ml fractions and aliquots of fractions were analyzed by immunoblotting.


Mock- or HSV-1-infected cells (2×107) were harvested and lysed in 300 μl cold Triton X-100 buffer. Samples were sonicated briefly and centrifuged. Supernatant was filtered and loaded to superose 6 column and subjected to gel filtration analysis with Buffer B. Elution was collected in 0.5 ml fractions and aliquots of fractions were analyzed by immunoblotting.


Quantitative Real-Time PCR (qRT-PCR)


Quantitative Real-time PCR was performed as previously described. Cells were infected or treated with viruses or agents for indicated time period. Total RNA was extracted using TRIzol reagent (Invitrogen). Complementary cDNA was synthesized from DNase I-treated total RNA using reverse transcriptase (Invitrogen). cDNA was diluted and qRT-PCR was performed using SYBR Green Master Mix (Applied Biosystems) by real-time PCR instrument (Applied Biosystems). Relative mRNA expression for each target gene was calculated by the 2−ΔΔCt method using β-Actin as an internal control. The sequences of qRT-PCR primers are as follows:











Human β-actin










forward
5′-CTGGCACCCAGCACAATG-3′



reverse
5′-GCCGATCCACACGGAGTACT-3′













Human IFN-β










forward
5′-AGGACAGGATGAACTTTGAC-3′



reverse
5′-TGATAGACATTAGCCAGGAG-3′













Human ISG56










forward
5′-TCTCAGAGGAGCCTGGCTAA-3′



reverse
5′-TGACATCTCAATTGCTCCAG-3′













Human IL8










forward
5′-GGCACAAACTTTCAGAGACAG-3′



reverse
5′-ACACAGAGCTGCAGAAATCAGG-3′













HumanCXCL2










forward
5′-GGGCAGAAAGCTTGTCTCAA-3′



reverse
5′-GCTTCCTCCTTCCTTCTGGT-3′













Human IFI16










forward
5′-ACAAACCCGAGAAACAATGACC-3′



reverse
5′-GCATCTGAGGAGTCCGAAGA-3′













Mouse β-actin










forward
5′-ACGGCCAGGTCATCACTATTG-3′



reverse
5′-CAAGAAGGAAGGCTGGAAAAGA-3′













Mouse IFN-β










forward
5′-TCCGAGCAGAGATCTTCAGGAA-3′



reverse
5′-TGCAACCACCACTCATTCTGAG-3′













Mouse ISG56










forward
5′-ACC ATG GGA GAG AAT GCT GAT-3′



reverse
5′-GCC AGG AGG TTG TGC-3′






Cytokine ELISA

Commercial cytokine ELISA kits used in this study include: human IFN-β (PBL Assay Science) and human RANTES (R&D Systems). Cytokine levels in the supernatant from cultured cells were assessed according to manufacturer's instruction. Absorbance was determined with FLUOstar Omega (BMG Labtech.).


In Vitro ATPase Activity Assay

Purified RIG-I or RIG-I mutants were incubated with 5′-ppp-dsRNA (Invivogen) at 37° C. for 20 min in ATPase reaction buffer (50 mM Tris-HCl, pH 7.5, 2.5 mM MgCl2, and ATP. Released phosphates were measured using a PiColorLock™ phosphate detection reagent (Innova Biosciences). For reactions with varying concentrations of ATP, the concentrations of RIG-I proteins and RNA were 20 nM and 80 nM, respectively. For reactions with varying concentrations of the RNA, the concentrations of RIG-I proteins and ATP were 20 nM and 500 μM, respectively.


Mass Spectrometry Analysis

For identification of deamidation sites, HEK293T/Flag-RIG-I stable cell line was transfected with an expression plasmid containing UL37 or infected by HSV-1 for 10 h (MOI=10). Flag-RIG-I was purified by anti-Flag-conjugated agarose beads for 4 h at 4° C. Beads were then extensively washed and RIG-I was eluted with 3× Flag peptide at 0.2 mg/ml. Purified RIG-I was subjected to SDS page electrophoresis and gel slices were prepared for in-gel digestion and Mass Spectrometry analysis (Harvard Taplin Mass Spectrometry Facility).


For Cysteine labeling experiment, bacterial purified UL37 (571-1123) was treated with N-methylacetamide (Alfa Aesar) (1 μM) at room temperature for 45 min. Samples were then blocked with Iodoacetamide (Sigma) (50 mM) at room temperature for 1 h and subjected to Mass Spectrometry analysis (Poochon Scientific).


Statistical Analysis

Statistical analysis was performed by unpaired two-tailed Student's t-test. *, p<0.05; **, p<0.01; ***, p<0.001. A p-value less than 0.05 is considered statistically significant.


Experimental Methods
HSV-1 Evades RNA-Induced RIG-I Activation

Applicant previously reported that the vGAT proteins of human KSHV and murine γHV68 recruit PFAS to deamidate RIG-I. In addition to these gamma herpesviruses, HSV-1 infection also increased negative charge of RIG-I as analyzed by two-dimensional gel electrophoresis (2-DGE), indicative of deamidation (FIG. 1A). However, genomes of HSV-1 and other alpha-herpesviruses contain no homologue of gamma herpesvirus vGAT proteins, suggesting a distinct mechanism of RIG-I deamidation. Additionally, the antiviral roles of RIG-I against DNA viruses, such as herpesviruses, are not well defined. Thus, Applicant investigated whether HSV-1 infection induces RIG-I deamidation and determined the functional consequence of RIG-I deamidation on host immune responses.


To probe the roles of RIG-I in host defense against HSV-1, Applicant depleted RIG-I expression and assessed the IFN-β mRNA in primary human foreskin fibroblasts (HFF). Applicant found that knockdown of RIG-I reduced IFN-β mRNA at 24 h induced by HSV-1 infection (FIG. 8A). A similar effect was observed with knockdown of IF116, a DNA sensor implicated in detecting herpesviruses (Kerur et al., 2011, Unterholzner et al., 2010). In 293T and HeLa cells that HSV-1 replication is very robust, Applicant observed that RIG-I depletion impaired IFN and ISG56 induction at multiplicity of infection (MOI) of 0.1 and 0.5 (FIG. 8B, FIG. 5C). No difference in IFN and ISG56 induction was observed in RIG-I knockdown cells upon high MOI (=5) HSV-1 infection, suggesting that HSV-1 can blunt IFN induction. Moreover, RIG-I depletion in human THP-1 macrophages also reduced IFN-β and ISG56 mRNA induced by HSV-1 infection (FIG. 5D). These results indicate that RIG-I senses dsRNA produced by HSV-1-infected cells and contributes to the IFN induction by HSV-1 infection.


To determine whether HSV-1 infection inhibits RIG-I activation, Applicant sequentially infected 293T cells with HSV-1 and SeV, and determined IFN-β and ISG56 expression. Applicant found that HSV-1 infection significantly reduced IFN-β and ISG56 mRNA induced by SeV (FIG. 1B), which correlated with minimal IFN-β secretion (FIG. 1C). Importantly, HSV-1 infection did not significantly reduce SeV replication as evidenced by expression of the major protein of SeV (FIG. 1B and FIG. 1C). Furthermore, SeV infection induced the oligomerization of RIG-I that eluted in fractions corresponding to protein sizes between ˜440 kDa and ˜670 kDa, while RIG-I eluted in fraction corresponding to ˜230 kDa in mock-infected cells or cells that were infected with HSV-1 (FIG. 1D). Strikingly, RIG-I purified from HSV-1- and SeV-infected cells had an elution pattern identical to that of mock-infected cells. Finally, HSV-1 infection completely blunted the induction of IFN and ISG56 mRNA in 293T cells transfected with LMW poly [I:C] (FIG. 5E). These results show that HSV-1 inhibits RIG-I activation triggered by SeV, a prototype RIG-I activator.


HSV-1 UL37 Interacts with RIG-I


To delineate the mechanism by which HSV-1 abrogates RIG-I activation, Applicant screened for RIG-I-binding proteins by co-immunoprecipitation (Co-IP) using a HSV-1 expression library, with a particular focus on gene products that operate in the early phase of infection. Co-IP assays identified open reading frames UL21 and UL37 as RIG-I-interacting proteins (FIG. 2A and FIG. 9A). Although UL37 has no sequence homology with gamma herpesvirus vGAT proteins, it shares multiple functions with the vGAT proteins, e.g., activating NF-κB and promoting viral replication (Desai et al., 2001; Full et al., 2014; Gaspar et al., 2008; Liu et al., 2008). Thus, Applicant examined whether UL37 evades RIG-I-dependent immune defense. Indeed, UL37 was readily detected in protein complexes precipitated by antibody against RIG-I in HSV-1-infected 293T cells as early as 1 hour post-infection (hpi) at high MOI (=30) (FIG. 2B) and during late lytic replication at lower MOI (=1) (FIG. 2C). When expressed in 293T cells, UL37 co-precipitated with RIG-I (FIG. 9B), indicating that UL37 interacts with RIG-I in the absence of any other viral proteins. Interestingly, UL37 also interacted with MDA5 in transfected 293T cells (FIG. 9B). Gel filtration analyses further showed that UL37 co-eluted with RIG-I in fractions corresponding to ˜220-440 kDa, supporting that these proteins form a complex in HSV-1-infected cells (FIG. 9C). UL37 also partly co-eluted with its interacting partner TRAF6 by gel filtration analysis. Thus, UL37 interacts with RIG-I in HSV-1-infected or transfected cells.


HSV-1 UL37 Inhibits RIG-I Activation

Applicant established 293T cells stably expressing UL37 (FIG. 3A). Upon SeV infection, UL37 expression significantly reduced IFN-β and ISG56 expression by real-time PCR analysis (FIG. 3B) and reporter assays (FIG. 9A, FIG. 9B). UL37 did not reduce SeV protein expression (FIG. 3C). UL37 significantly up-regulated IL-8 and CxCL2 expression (FIG. 9C), likely due to the NF-κB activation by UL37 (Liu et al., 2008), while had a marginal effect on IL-8 expression upon SeV infection (FIG. 3B). ELISA further confirmed that UL37 expression reduced IFN-β secretion by ˜75% in response to SeV infection (FIG. 3D). Moreover, UL37 expression inhibited IFN induction upon LMW poly [I:C] transfection (FIG. 3E). Over-expression of UL37 was sufficient to activate NF-κB (FIG. 9D), but had no detectable effect on PRDIII, an IRF-responsive element of the IFN-β promoter (FIG. 9E). Loss of RIG-I in mouse embryonic fibroblasts (MEFs) had no effect on UL37-induced NF-κB activation (FIG. 9F). UL37 expression did not alter the transcription of the PRDIII promoter induced by MDA5 over-expression (FIG. 9F). These results collectively show that UL37 specifically inhibits RIG-I-dependent IFN-β induction.


To probe the effect of UL37 on signaling events downstream of RIG-I, Applicant analyzed the phosphorylation of TBK-1 (Ser172) and IRF3 (Ser396), markers of activated TBK-1 and IRF3, respectively. As shown in FIG. 3F, UL37 expression inhibited the phosphorylation of TBK-1 and IRF3 upon SeV infection. Moreover, UL37 expression reduced the dimerization and nuclear translocation of IRF3 (FIG. 9G and FIG. 9H). Using 293T cells stably expressing Flag-RIG-I and RIG-I-VS, Applicant found that UL37 expression abolished RIG-I dimerization upon SeV infection by Co-IP assay (FIG. 3G). Furthermore, UL37 diminished the SeV-induced oligomerization of RIG-I as analyzed by gel filtration (FIG. 3H). To test whether UL37 inhibits key components of the IRF-IFN pathway downstream of RIG-I, Applicant over-expressed MAVS, TBK-1 and the constitutively active IRF3-5D mutant (FIG. 3I) and examined the activation of the IFN-β reporter. Consistent with NF-κB activation by UL37 (FIG. 9D), Applicant found that UL37 enhanced, rather than inhibited, the transcription of the IFN-β reporter in a dose-dependent manner with all three components (FIG. 3J). UL37 did not alter the protein level of MAVS, TBK-1 and IRF3-5D. These results conclude that UL37 specifically targets RIG-I to block IFN induction by viral dsRNA.


UL37 Deamidates RIG-I

HSV-1 infection reduced the charge of RIG-I, suggesting that HSV-1 induces RIG-I deamidation. Applicant found that UL37 expression was sufficient to reduce the charge of RIG-I, but not that of R-actin (FIG. 4A). Furthermore, UL37 expression did not alter the charge of MDA5, an RNA sensor akin to RIG-I (FIG. 10A). Applicant thus purified RIG-I 293T stable cells upon HSV-1 infection or UL37 expression (FIG. 10B). Tandem mass spectrometry analyses of both samples identified two peptides that contained aspartates at residue 495 and 549, indicative of deamidation of N495 and N549 (FIG. 4B). HSV-1 infection and UL37 expression had similar effect on the deamidation of N495 and N549 (FIG. 4C), suggesting that UL37 is responsible for RIG-I deamidation during HSV-1 infection. When N495D and N549D were introduced into RIG-I, designated RIG-I-DD, Applicant found that RIG-I-DD migrated toward the positive end of the strip, to a position identical to that of RIG-I-WT when UL37 was expressed (FIG. 4D). Moreover, UL37 expression did not further shift RIG-I-DD by 2-DGE analysis, indicating that N495 and N549 are the two sites of deamidation by UL37.


To probe the mechanism of UL37-induced deamidation, Applicant first determined whether a specific inhibitor of glutamine amidotransferase, 6-diazo-5-oxo-L-norleucine (DON), can block UL37-induced RIG-I deamidation. Indeed, DON inhibited RIG-I deamidation in cells expressing UL37 (FIG. 10C). This result suggests that UL37-mediated deamidation of RIG-I depends on an enzymatic activity akin to glutamine amidotransferase. Thus, Applicant sought to determine whether UL37 is intrinsically a protein deamidase. Applicant purified UL37 full-length from E. co/i to homogeneity and examined RIG-I deamidation in vitro. Analysis by 2-DGE indicated that UL37 was sufficient to reduce RIG-I charge, suggestive of deamidation (FIG. 4E). These results indicate that UL37 deamidates RIG-I in cells and in vitro.


Deamidated RIG-I Fails to Sense RNA and Hydrolyze ATP

Applicant previously showed that m-vGAT induced deamidation and concomitant activation of RIG-I. However, the RIG-I-DD mutant failed to activate NF-κB and IFN-β reporters (FIG. 11A, FIG. 11B). N495 and N549 reside in the helicase 2i (Hel2i) domain that specializes in duplex RNA recognition (Luo et al., 2013). Previous structural analyses of RNA-bound RIG-I showed that these two residues flank the dsRNA-binding α-helix, α23 (FIG. 5A) (Kowalinski et al., 2011; Luo et al., 2011). Residues of the α-helix (α23), specifically K508 and Q511, make direct contact with dsRNA. While N495 precedes the α23 helix, N549 is located in the middle of a spatially adjacent α-helix (α24). Thus, Applicant opted to determine whether deamidation of N495 and N549 affects the RNA-binding ability of RIG-I, an important function for RNA detection by RIG-I. Applicant purified RIG-I-WT and its mutants to homogeneity from transfected 293T cells (FIG. 11C) and performed electrophoresis mobility shift assay (EMSA). Applicant found two distinct RIG-I:RNA complexes that correlated with increasing doses of RIG-I (FIG. 5B and FIG. 11D). The RIG-I-DD mutant was significantly impaired in forming the fast migrating RIG-I:RNA complex, while formed higher levels of the more slowly migrating RIG-I:RNA complex compared to RIG-I-WT (FIG. 5B). EMSA also showed that the deamidation of N549 had a major effect on the RNA-binding ability of RIG-I, while RIG-I-N495D demonstrated comparable RNA-binding affinity to RIG-I-WT (FIG. 11D). Using a control dsRNA 19 mer lacking the 5′-triphosphate, Applicant found that the more slowly migrating RIG-I:RNA complex consisted of RIG-I and dsRNA without 5′-triphophate (FIG. 11E). Interestingly, the K270A mutant previously shown to have impaired ATPase activity, bound to the 5′-triphosphate 19mer dsRNA (5′ppp-RNA) with affinity similar to RIG-I-WT (FIG. 11F). Although ATP hydrolysis is not required for RIG-I signaling, it has been proposed that ATPase activity is necessary for recycling of RIG-I from RNA-bound complexes and critical for RIG-I-mediated innate immune signaling against nonself RNA (Anchisi et al., 2015; Lassig et al., 2015; Luo et al., 2013). Applicant thus examined the ATP hydrolysis activity using purified RIG-I proteins. An in vitro ATPase assay showed that RIG-I-DD completely lost its ability to hydrolyze ATP (FIG. 5C). RIG-I-WT and RIG-I-DD demonstrated ATPase activity with kcat of 944 and 32.6 sec−1 at physiological ATP concentrations, respectively. Furthermore, RIG-I-DD failed to hydrolyze ATP upon 5′ppp-RNA stimulation (FIG. 5D). RIG-I-N549D and RIG-I-N495D demonstrated basal or no ATPase activity similar to RIG-I-DD, with or without 5′ppp-RNA (FIG. 11G, FIG. 11H). These results indicate that deamidation of N495 and N549 abolishes RIG-I activity to bind RNA and hydrolyze ATP.


To assess the functional consequence of RIG-I deamidation, Applicant examined RIG-I activation by gel filtration. SeV infection induced oligomerization of RIG-I-WT as evidenced by fractions corresponding to protein complexes of ˜440-670 kDa sizes, while RIG-I-WT in mock-infected cells eluted in fractions corresponding to ˜130-230 kDa (FIG. 5E). However, SeV infection failed to induce the oligomerization of RIG-I-DD. Notably, a low level of RIG-I-DD was detected in fractions corresponding to protein sizes of ˜440 kDa regardless of SeV infection. Applicant then “reconstituted” RIG-I expression in Rig-i−/− MEF with RIG-I-WT or RIG-I-DD (FIG. 5F), and examined host immune responses and viral infection. Compared to RIG-I-WT, RIG-I-DD induced basal or lower expression of IFN-β and ISG56 upon vesicular stomatitis virus (VSV) infection (FIG. 5G). Similar results were observed in SeV-infected cells (FIG. 11I). Consequently, RIG-I-WT, but not RIG-I-DD, reduced VSV replication in Rig-i−/− MEFs (FIG. 5H). These results demonstrate that deamidation of N495 and N549 eliminates RIG-I detection of viral RNA and restriction of viral replication.


A Deamidation-Resistant RIG-I-QQ Mutant Restores Antiviral Immune Responses Against HSV-1 Infection

Applicant's mutational analysis indicates that N549 is critical for the RNA-binding and ATPase activities of RIG-I. Previously solved crystal structure of RIG-I showed that the amide group of N549 (within α24) forms two hydrogen bonds with the backbone of threonine 504 of the RNA-binding α-helix (α23) (FIG. 6A) (Kowalinski et al., 2011; Luo et al., 2011). The RIG-I-N549A mutant failed to trigger IFN induction by SeV infection (data not shown), suggesting that the hydrogen bonds between N549 and T504 of the two neighboring helices are critical for RIG-I immune signaling. The side chain of glutamine contains a primary amide functional group as asparagine does. Applicant hypothesized that a glutamine residue at position 549 might conserve hydrogen bonds with T504, which translates to a predicted ˜1 angstrom short difference in hydrogen bonds formed by Q549 than N549, thereby potentially resisting deamidation. Applicant then generated a RIG-I mutant containing Q495 and Q549, designated RIG-I-QQ. In 293T cells stably expressing RIG-I-WT or the RIG-I-QQ mutant, UL37 expression shifted RIG-I-WT, but not RIG-I-QQ, toward the positive end of the strip, indicating that RIG-I-QQ is deamidation-resistant (FIG. 6B). Furthermore, RIG-I-QQ was eluted in fractions corresponding to sizes of ˜440-670 kDa in cells infected with HSV-1, demonstrating similar levels of oligomerization as RIG-I-WT (FIG. 1D) and RIG-I-QQ (FIG. 6C) induced by SeV infection. These results indicate that RIG-I-QQ is refractory to deamidation, and therefore, restores RIG-I activation induced by HSV-1 infection.


Applicant reasoned that only the deamidation-resistant RIG-I-QQ mutant will confer gain-of-function in RIG-I-mediated innate immune response, thus Applicant used wild-type HEK293 to establish stable cell lines expressing RIG-I wild-type and mutants. In resting cells, the level of phosphorylated TBK-1 (Ser172) was below detection in all four cell lines. HSV-1 infection increased the phosphorylation of TBK-1 to similar levels in control cells and cells expressing RIG-I-WT or RIG-I-DD (FIG. 6D). Remarkably, HSV-1 infection induced TBK-1 phosphorylation to much more pronounced levels in cells expressing RIG-I-QQ than the other three cell lines. Similar results were observed for phosphorylated IRF3. Consistent with this, HSV-1 infection also more significantly up-regulated IFN-β and ISG56 expression in RIG-I-QQ cells than control cells and cells expressing RIG-I-WT or RIG-I-DD (FIG. 6E). The low levels of IFN-β and ISG56 induction in the other three cell lines are likely due to activation of innate sensors other than RIG-I. Increased IFN-β and RANTES expression were detected only in the supernatant of HSV-1-infected 293T cells expressing RIG-I-QQ, but not the other three cell lines (FIG. 6F). To determine the antiviral activities of RIG-I wild-type and these mutants, Applicant examined viral replication in HEK293 stable cells. As shown in FIG. 6G, RIG-I-WT reduced HSV-1 replication by ˜50%, while RIG-I-DD had a marginal effect on HSV-1 replication. Consistent with the robust antiviral response induced by RIG-I-QQ, RIG-I-QQ reduced HSV-1 titer by ˜75-90% in HEK293 cells. These results show that the deamidation-resistant RIG-I-QQ restores RIG-I antiviral activity against HSV-1 and efficiently restricts HSV-1 replication.


The Carboxyl Terminal of UL37 Contains a Deamidase Domain

UL37 purified from E. coli is sufficient to deamidate RIG-I, implying that UL37 is a bonafide protein deamidase. Because all known protein deamidases (e.g., PFAS) are cysteine hydrolases (Zhao et al., 2016), Applicant suspect that UL37 also contains a catalytic cysteine residue. Thus, Applicant mutated all 14 cysteines of UL37 individually to serines and screened for the loss of inhibition of RIG-I-mediated activation of the PRDIII promoter upon SeV infection. The C819S and C850S mutants were identified to have greatly impaired blockade of PRDIII induction by SeV (FIG. 7A and FIG. 12A). Analysis by 2-DGE also showed that the C819S and C850S mutants of UL37 failed to induce RIG-I deamidation in transfected cells (FIG. 12B), indicating that these cysteines are required for the deamidase activity of UL37. Previous crystallography analysis showed that the N-terminus of UL37 adopts a helical bundle structure similar to multisubunit tethering complexes involved in intracellular trafficking (Pitts et al., 2014). Coupled with the observation that C819 and C850 are required for UL37 to deamidate RIG-I, Applicant reasoned that the C-terminal half (571-1123, designated UL37C) contains a protein deamidase domain. Applicant first determined whether UL37C was sufficient to block RIG-I-dependent IFN induction. Indeed, UL37C expression inhibited the SeV-induced transcription of PRDIII (FIG. 12C). Applicant then expressed and purified UL37C from E. coli to homogeneity for RIG-I deamidation studies. Consistent with results from transfected cells, UL37C was sufficient to deamidate RIG-I in vitro, demonstrating that UL37C contains intrinsic protein deamidase activity (FIG. 7B).


To pinpoint the cysteine residue of the active site. Applicant employed a small molecule electrophile for mass spectrometry analysis, an approach that was successfully used to quantitatively profile functional cysteines in proteomes (Weerapana et al., 2010). The rationale is that functional cysteines, such as those in enzymatic active sites, are hyper-reactive and react with small molecule electrophiles independent of concentration. As such, a ratio of the percentage of labeled peptides at high concentration to that at low concentration near 1 predicts functional cysteines. After reacting with 2-Chloro-N-(hydroxymethyl) acetamide (CNM), mass spectrometry analysis identified that C819 was primarily labeled by CNM within UL37C. Specifically, 38.3° % and 42.5% of C819 were labeled by CNM at 1 and 10 μM, respectively (FIG. 7C). C850 was labeled at minimal level (<10%) by CNM, suggesting that C850 is largely inaccessible. Taken together, these results support the conclusion that C819 is the active site of the catalytic triad.


To probe the roles of UL37-mediated deamidation in viral infection, Applicant introduced UL37 wild-type (UL37-WT) and UL37-C819S into the HSV-1 genome (designated HSV-1 UL37-WT and HSV-1 UL37-C819S) and examined RIG-I-mediated innate immune signaling. Gel electrophoresis of viral genomic DNA after BamHI digestion revealed identical pattern of migration, indicative of lack of large chromosome rearrangement (FIG. 12D). Immunoblotting analysis showed that UL37-WT and UL37-C819S were expressed at similar levels in 293T cells (FIG. 12E). Compared to HSV-1 UL37-WT, HSV-1 UL37-C819S failed to deamidate RIG-I by 2-DGE analysis (FIG. 7D). Infection of HSV-1 UL37-C819S, but not HSV-1 UL37-WT, induced RIG-I oligomerization corresponding to protein sizes of ˜440-670 kDa analyzed by gel filtration (FIG. 7E). Moreover, HSV-1 UL37-C819S induced higher levels of IFN-β and ISG56 expression (FIG. 7F), and IFN-β and RANTES secretion (FIG. 7G) in THP-1 macrophages. Similar results were obtained in HSV-1-infected HeLa, HFF and 293T cells (FIGS. 13A-13C). These results show that the deamidase activity of UL37 is critical for HSV-1 to evade RIG-I-mediated immune response.


Applicant then analyzed HSV-1 lytic replication and found that HSV-1 UL37-C819S produced ˜10% of virion progeny of HSV-1 UL37-WT in HFF (FIG. 7H). In HeLa cells, HSV-1 UL37-C819S was more impaired at 36 than at 24 hpi compared to HSV-1 UL37-WT with the MOI of 1, whereas the impaired replication phenotype of HSV-1 UL37-C819S was more pronounced at 12 than 24 hpi with the MOI of 0.1 (FIG. 13D). To determine whether the reduced replication of HSV-1 UL37-C819S is due to the elevated IFN response, Applicant characterized HSV-1 replication in Vero cells that are deficient in IFN induction. Compared to HSV-1 UL37-WT, HSV-1 UL37-C819S showed identical viral replication at 12 and 24 hpi (FIG. 13E). However, HSV-1 UL37-C819S produced ˜50% and 35% as many virion progeny as HSV-1 UL37-WT at 36 and 48 hpi, respectively. To further corroborate the roles of RIG-I in inhibiting HSV-1 replication, Applicant knocked down RIG-I and examined HSV-1 replication. As shown in FIG. 13F, RIG-I depletion restored the lytic replication HSV-1 UL37-C819S to levels of HSV-1 UL37-WT, at 12 and 24 hpi. However, RIG-I knockdown had no effect on the difference in lytic replication between HSV-1 UL37-WT and HSV-1 UL37-C819S, at 36 and 48 hpi. These results show that RIG-I-mediated antiviral activity suppresses HSV-1 lytic replication during early infection and that UL37 deamidase activity is important to antagonize RIG-I-mediated antiviral defense. Furthermore, UL37 deamidase activity is important for late stages of HSV-1 lytic replication.


Applicant previously reported that vGAT pseudo-enzymes of human KSHV and murine γHV68 recruited cellular PFAS to deamidate RIG-I and evade antiviral cytokine production (He et al., 2015). Interestingly, HSV-1 infection also induced RIG-I deamidation, despite the fact that genomes of alpha herpesviruses do not contain sequence homologues of vGAT proteins. Herein, Applicant identified UL37 as a viral deamidase that targets RIG-I for deamidation and inactivation, thereby preventing RIG-I from sensing viral dsRNA. To Applicant's knowledge, this is the first viral protein deamidase identified thus far. Previously reported protein deamidases contain either a cysteine-protease fold or a GAT domain (Cui et al., 2010; He et al., 2015; Sanada et al., 2012; Wang et al., 2009). UL37-mediated deamidation of RIG-I disarms downstream innate immune signaling, suggesting the critical, and likely more ubiquitous, roles of protein deamidation in signal transduction. UL37 is a large tegument protein that is implicated in viral trafficking, egress and innate immune regulation (Desai et al., 2001; Liu et al., 2008; Pitts et al., 2014). Taken together, UL37 inhibits the IRF-IFN branch of innate immune signaling through deamidation of RIG-I, while activating the NF-κB cascade, sharing functions similar to the gamma herpesvirus vGAT proteins.


Applicant's biochemical analyses show that UL37 is intrinsically a protein deamidase. UL37 and its carboxyl terminal fragment (571-1123) purified from E. coli were sufficient to deamidate RIG-I in vitro. Mutational analysis and electrophile reaction profiling of hyper-reactive cysteines identified C819 as the single residue critical for the deamidase activity, implying that C819 is the active cysteine of the catalytic triad of UL37. Interestingly, C850 is more conserved in alpha herpesviruses than C819 (data not shown). The fact that C850 is largely inaccessible suggests that it may be required for the structural integrity of the deamidase domain. It is unclear whether other UL37 homologs are deamidases. Future structural studies of the UL37 deamidase domain may define a new fold catalyzing protein deamidation and “visualize” the catalytic cysteine.


Although previous studies implicated RIG-I in sensing dsRNA produced by herpesviruses (da Silva and Jones, 2013; Jacquemont and Roizman, 1975; Rasmussen et al., 2009; Weber et al., 2006), Applicant's work provides further credence concerning the RIG-I-mediated immune defense against a model DNA virus and viral immune evasion thereof. HSV-1 infection prevents RIG-I activation and innate immune responses triggered by subsequent SeV infection. These phenotypes were recapitulated by UL37 expression, pointing to the key roles of UL37 in evading RIG-I activation by viral dsRNA. The deamidated RIG-I-DD (D495 and D549) mutant, failed to sense 5′ppp-RNA and SeV, which correlated with its inability to initiate host immune signaling and control VSV replication. Comparing HSV-1 replication kinetics in IFN-competent 293T and HeLa cells to that in IFN-deficient Vero cells, Applicant found that the deamidase activity of UL37 is critical in negating RIG-I-mediated inhibition of the early steps of HSV-1 lytic replication. The mutation abolishing UL37 deamidase activity, notably, also impaired HSV-1 replication during late stages of replication in an RIG-I-independent manner, implying the existence of other viral and cellular targets in addition to RIG-I. Nevertheless, uncoupling RIG-I deamidation from UL37, via either introducing the deamidation-resistant RIG-I-QQ into cells or engineering the C819S mutation of UL37 into the HSV-1 genome, restored RIG-I activation and downstream innate immune signaling, thereby reducing HSV-1 productive infection. These results unambiguously demonstrate the antiviral activity of RIG-I against a DNA herpesvirus and elucidate a new mechanism of viral immune evasion.


N495 and N549 reside in two α-helices that constitute the RNA-binding interface of the Hel2i domain. Interestingly, N549 forms hydrogen bonds with the backbone of T504 that ends the N495-containing α23 helix, providing a physical link between these two neighboring helices that are located immediately proximal to the RIG-I-bound dsRNA. These observations suggest that the two α-helices constitute a region responsible for regulating RNA-binding/sensing by RIG-I. The susceptibility of the hydrogen bonds between N549 and T504 to the deamidase activity of UL37 underpins the inactivation of RIG-I by HSV-1 infection. Remarkably, the N549Q mutation appears to conserve hydrogen bonds, and confers resistance to UL37-mediated deamidation, demonstrating the exquisite specificity of UL37-mediated deamidation. Deamidation of N495 and N549 within the Hel2i domain, unexpectedly, abolishes 5′ppp-RNA-binding and ATP hydrolysis of RIG-I, uncovering a simple but powerful mechanism to switch off RIG-I. Although the CTD of RIG-I is responsible for sensing viral dsRNA, emerging studies support the regulatory role of helicase domains in RNA-sensing by RIG-I. It was previously reported that Hel2i “measures” the length of dsRNA stem during RNA-binding by RIG-I (Kohlway et al., 2013). Structural analysis also highlighted the direct contact between Hel2i and dsRNA (Kowalinski et al., 2011; Luo et al., 2011). Moreover, mutations within a helicase domain reduced the ATPase activity of RIG-I, increased its association with cellular dsRNA and activated downstream signaling (Lassig et al., 2015). Together with these observations. Applicant's work further lends credence to the pivotal roles of Hel2i of RIG-I and site-specific deamidation thereof in interacting with and sensing viral dsRNA, suggesting more ubiquitous roles of protein deamidation in fundamental biological processes.


Two-Dimensional Gel Electrophoresis

Cells (1×106) were lysed in 150 μl rehydration buffer (8 M Urea, 2% CHAPS, 0.5% IPG Buffer, 0.002% bromophenol blue) by three pulses of sonication and whole cell lysates were centrifuged at 20,000 g for 15 min. Supernatants were loaded to IEF strips for focusing with a program comprising: 20 V, 10 h (rehydration); 100 V, 1 h; 500 V, 1 h; 1000 V, 1 h; 2000 V, 1 h; 4000 V, 1 h; 8000 V, 4 h. After IEF, strips were incubated with SDS equilibration buffer (50 mM Tris-HCl [pH8.8], 6 M urea, 30% glycerol, 2% SDS, 0.001% Bromophenol Blue) containing 10 mg/ml DTT for 15 min and then SDS equilibration buffer containing 2-iodoacetamide for 15 min. Strips were washed with SDS-PAGE buffer, resolved by SDS-PAGE, and analyzed by immunoblotting.


In Vitro Deamidation Assay

GST-RIG-I was purified from transfected 293T cells to homogeneity as determined by silver staining. In vitro on-column deamidation of RIG-I was performed as previously reported (He et al., 2015). Briefly, ˜0.2 μg of His-tagged UL37/UL37 (571-1123) expressed and purified from E. coli, and 0.6 μg of GST-RIG-I (bound to glutathione-conjugated agarose) were added to a total volume of 30 μl. The reaction was carried out at 30° C. for 45 min in deamidation buffer (50 mM Tris-HCl, pH 7.5, 100 mM NaCl, 5 mM MgCl2). Protein-bound GST beads were washed with deamidation buffer and GST-RIG-I was eluted with rehydration buffer (6 M Urea, 2 M Thio-urea, 2% CHAPS, 0.5% IPG Buffer, 0.002% bromophenol blue) at room temperature. Samples were then analyzed by two-dimensional gel electrophoresis and immunoblotting.


Constructing Recombinant HSV-1

Recombinant HSV-1 was engineered as previously described (Dong et al., 2010). Briefly, DNA fragments containing UL37 WT and C819S were amplified using overlapping primers. First round PCR products of ˜500 bp fragment upstream of UL37, UL37 open reading frames (WT and C819S) and ˜500 bp fragments downstream of UL37 were used as the template for second round PCR amplification. Purified PCR products of the second round, along with HSV-1 ΔUL37 (KOS) Bacmid, were transfected into 293T cells to generate recombinant HSV-1. The revertant (containing wild-type UL37, designated wild-type) and UL37-C819S mutant were plaque purified and validated by restriction digestion of viral genomic DNA and sequencing of the UL37 open reading frame.


RNA Electrophoresis Mobility Shift Assay (EMSA)

RNA EMSA was performed as previously described (Takahasi et al., 2008). 5′-ppp-dsRNA and control dsRNA were purchased from Invivogen and bottom strands were labeled with γ-[P32]ATP by T4 polynucleotide kinase (NEB). Purified RIG-I and RIG-I mutants were incubated with dsRNA at room temperature for 15 min. Binding buffer contains 20 mM Tris-HCl (pH=8.0), 1.5 mM MgCl2 and 1.5 mM DTT. Unlabeled ppp-dsRNA was used as competitor at 500-fold in excess. The reaction mixtures were run on 5% native polyacrylamide gels at a constant voltage of 200 V. Gels were dried and subjected to phosphorimaging.
















Labeled 5′-ppp-dsRNA
Top Strand
5′-ppp-GCAUGCGACCUCUGUUUGA-3′



Bottom Strand
3′-CGUACGCUGGAGACAAACU-5′-32P





Labeled 5′ control
Top Strand
5′-GCAUGCGACCUCUGUUUGA-3′


dsRNA
Bottom Strand
3′-CGUACGCUGGAGACAAACU-5′-32P









In Vitro ATPase Activity Assay

Purified RIG-I or RIG-I mutants were incubated with 5′-ppp-dsRNA (Invivogen) at 37° C. for 20 min in ATPase reaction buffer (50 mM Tris-HCl, pH 7.5, 2.5 mM MgCl2, and ATP). Released phosphates were measured using a PiColorLock™ phosphate detection reagent (Innova Biosciences). For reactions with varying concentrations of ATP, the concentrations of RIG-I proteins and RNA were 20 nM and 80 nM, respectively. For reactions with varying concentrations of the RNA, the concentrations of RIG-I proteins and ATP were 20 nM and 500 μM, respectively.


Experiment No. 2

Upon infection, eukaryotic cells immediately respond with innate immune activation to defeat the invading pathogens. Cyclic GMP-AMP (cGAMP) synthase (cGAS) is an essential cytosolic sensor that detects double-stranded (ds) DNA of microbial origin or aberrantly localized cellular DNA. Other DNA sensors, including AIM2, DAI, DDX41, RNA polymerase III, DNA-PK and IFI16, may play redundant roles in a tissue- or ligand-specific manner in detecting cytosolic dsDNA. Upon binding dsDNA, cGAS catalyzes the synthesis of cGAMP, which induces the dimerization and activation of the ER-anchored STING (also known as MITA). Within close proximity to the ER membrane, STING recruits TBK-1 and interferon regulatory factor 3 (IRF3) to assemble into a signaling complex that phosphorylates and activates IRF3. Along with NF-κB and AP-1, nuclear IRF3 potently up-regulates the gene expression of interferons (IFNs). IFNs, via autocrine and paracrine mechanisms, stimulate the expression of hundreds of genes, known as ISGs, which establish an immune defensive state of the cell. Parallel to the TBK-1-IRF3-IFN pathway, IKK kinase, consisting of IKKα, IKKβ and IKKγ (also known as NEMO), phosphorylates and induces the degradation of inhibitor of NF-κB (IκB). This enables NF-κB activation that induces the expression of inflammatory cytokines, such as interleukins and chemokines. The primary role of inflammatory cytokines is to attract professional immune cells to the site of infection. Thus, the innate immune system defends the host from infection via direct anti-microbial activities and enables the establishment of adaptive immunity in tissue local to the infection.


Though key steps of the cGAS-STING pathway are well established, the regulatory mechanisms governing cGAMP synthesis of cGAS to induce STING-dependent innate immune activation is not well understood. Studying viral immune evasion allows us to interrogate mechanisms regulating host immune responses. As one of the most successful pathogens, herpesviruses have evolved numerous intricate strategies to manipulate, evade and exploit host immune response to benefit their infection. The most common viral mechanism is to encode proteins that physically interact with central cellular signaling nodes of immune defense to derail host immune response. Viral proteins efficiently regulate cellular immune signal transduction by microbial enzyme-mediated reactions, such as proteolytic cleavage or post-translational modifications (PTMs). Virally encoded proteases cleave various adaptor molecules and effectively dampen innate immune signaling, while host cells often deploy reversible PTMs (phosphorylation, ubiquitination and sumoylation) to regulate immune response.


Protein deamidation is emerging as a key PTM that regulates immune responses against infecting microbes. First reported more than half a century ago, protein deamidation was regarded as a marker associated with protein “aging” or functional decay. Though initial studies focused on non-enzymatic protein deamidation, recent findings from bacterial effectors and mammalian cells imply that protein deamidation can be enzyme-catalyzed and thus highly regulated. Applicant has identified viral pseudo-enzymes and bona fide deamidases that target cellular innate immune RIG-I sensor to evade antiviral cytokine roduction. While gamma herpesvirus vGAT pseudoenzymes recruit cellular PFAS to deamidate RIG-I, the UL37 tegument protein of herpes simplex virus 1 (HSV-1) is a bona fide protein deamidase that deamidates RIG-I in vitro and in cells. While further characterizing the in vivo roles of UL37 deamidase in HSV-1 infection, Applicant discovered that UL37 antagonizes cGAS-mediated innate immune activation via deamidating cGAS. Moreover, HSV-1 carrying deamidase-deficient UL37 was highly attenuated, and more robustly induced innate and adaptive immune responses in mice than wild-type HSV-1. Vaccination with HSV-1 carrying deamidase-deficient UL37 protected mice from lethal dose challenge with wild-type HSV-1. These results imply that interfering with protein deamidation can boost antiviral immune responses and thwart viral infection.


Results

Applicant reports herein that the UL37 tegument protein of HSV-1 deamidates RIG-I to avoid dsRNA-induced innate immune activation. Recombinant HSV-1 carrying deamidase-deficient UL37C819S mutant (HSV-1 UL37C819S) more robustly induced antiviral cytokines than HSV-1 containing wild-type UL37 (HSV-1 UL37WT). To further characterize this recombinant HSV-1, Applicant examined antiviral immune responses in human THP-1 monocytes upon HSV-1 infection. Real-time PCR analysis of representative antiviral cytokines (IFNB1, ISG56, CXCL10, MX1, IFIT3 and IL6) indicated that HSV-1 UL37C819S virus induced ˜5-10-fold higher expression of cytokine genes than HSV-1 UL37 wild-type (WT) in THP-1 cells during early viral infection (FIG. 14A). Enzyme-linked immunosorbant assay (ELISA) further showed that THP-1 cells infected with HSV-1 UL37C819S virus secreted significantly higher IFN-β than those infected with HSV-1 UL37WT (FIG. 14B). Consistent with these results, HSV-1 UL37C819S virus more robustly induced activation of the IRF-IFN pathway than HSV-1 UL37WT virus in THP-1 cells, as evidenced by the elevated phosphorylation (and activation) of TBK-1 (Ser172) and IRF3 (Ser396) (FIG. 14C). These results clearly demonstrate that HSV-1 UL37C819S more robustly induces innate immune activation than HSV-1 UL37WT in human THP-1 monocytes.


cGAS is a crucial DNA sensor that detects cytosolic DNA of diverse human pathogens, including herpesviruses. Thus, Applicant assessed whether cGAS is required for effective antiviral immune responses against HSV-1 UL37C819S virus. Applicant infected wild-type and cGAS-deficient L929 fibroblasts with HSV-1 UL37WT and HSV-1 UL37C819S, and determined antiviral gene expression. In wild-type L929 fibroblasts, HSV-1 UL37C819S virus more robustly induced Isg56 and Cxcl10 expression than HSV-1 UL37WT virus, recapitulating the phenotype that was observed in human THP-1 monocytes. Remarkably, loss of cGAS abolished Isg56 and Cxcl10 expression in response to HSV-1 UL37WT and HSV-1 UL37C819S (FIG. 14D). Furthermore, similar levels of residual expression of Isg56 and Cxcl10 were detected in cGAS-deficient L929 cells infected by HSV-1 UL37WT and HSV-1 UL37C819S (FIG. 14D). These results indicate that induction of elevated antiviral cytokine expression by deamidase-deficient HSV-1 UL37C819S virus is dependent on cGAS.


To probe the effect of HSV-1 infection on the DNA-cGAS pathway, Applicant determined intracellular cGAMP concentrations using the THP-1/Lucia reporter cell line. Applicant applied known concentrations of cGAMP to establish a standard that demonstrated a high correlation between cGAMP concentration and luciferase activity with 0-30 ng/ml of cGAMP. Applicant determined that HSV-1 UL37WT induced approximately 3.5 ng of cGAMP per one million of THP-1 cells, while HSV-1 UL37C819S infection increased cGAMP production to ˜10.5 ng per one million of THP-1 cells (FIG. 14E). Enhanced activation of the DNA-cGAS pathway by HSV-1 UL37C819S than HSV-1 UL37WT is further supported by elevated levels of intracellular cGAMP, phosphorylated TBK-1 and IRF3, and the expression of antiviral cytokines.


UL37 Targets cGAS to Dampen Antiviral Cytokine Production


To determine whether UL37 is sufficient to inhibit cGAS-mediated innate immune responses, Applicant established a THP-1 cell line stably expressing UL37 by lentiviral transduction (FIG. 15A). When THP-1 cells were transfected with herring testis DNA (HT-DNA) that activates cGAS, Applicant found that UL37 expression reduced IFNB1 and ISG56 expression by ˜60% as analyzed by quantitative real-time PCR (FIG. 15B). A similar level of reduction in secreted IFN-β in THP-1 cells was also observed upon UL37 expression (FIG. 15C). Interestingly, the induction of IL6 and IL8 by HT-DNA transfection was not affected by UL3 expression. Moreover, CXCL10 and ISG56 expression induced by LPS was significantly increased by UL37 expression. This is likely due to the ability of UL37 to activate NF-κB, as evidenced by the slight elevation of CXCL10 and ISG56 in THP-1 cells expressing UL37 at baseline without stimulation. Upon sensing cytosolic DNA, cGAS catalyzes the synthesis of cGAMP, which subsequently activates the STING adaptor. To determine the mechanism of inhibition by UL37, Applicant assessed IFNB1 and ISG56 gene expression in THP-1 cells upon cGAMP transfection. Interestingly, UL37 expression did not significantly affect neither the expression of IFNB1 and ISG56 (FIG. 15D), nor the secretion of IFN-β, in THP-1 cells transfected with cGAMP. Applicant further examined the effect of UL37 on DNA-induced innate immune signaling in wild-type and cGAS-deficient L929 cells. UL37 expression reduced the mRNA levels of Ifnb1 and Isg56 in L929 cells transfected with HT-DNA. Consistent with previous reports, loss of cGAS abolished Ifnb1 and Isg56 gene expression induced by HT-DNA in L929 cells. UL37 expression in THP-1 cells inhibited the phsophorylation of TBK-1 and IRF3 induced by HT-DNA, but not those induced by cGAMP (FIG. 15E). Similar results were obtained in L929 cells. Interestingly, HT-DNA induced TBK-1 and IRF3 phosphorylation in cGAS-deficient L929 cells, which were not affected by UL37 expression. The cGAS-independent activation of TBK-1 and IRF3 by HT-DNA in L929 cells remains to be investigated. Similar to what was observed in THP-1 cells, UL37 expression didn't affect the induction of Ifnb1 and Isg56 by cGAMP in L929 wild type and cGAS-deficient cells. As previously reported, cGAMP-induced expression of Isg56 in cGAS-deficient L929 cells was significantly lower than that in wild-type L929 cells. These results collectively indicate that UL37 antagonizes cGAS to inhibit DNA-induced innate immune signaling.


Given that recombinant HSV-1 UL37C819S virus more robustly induced antiviral cytokine production in THP-1 cells (FIG. 14), Applicant sought to determine whether the deamidase activity of UL37 is necessary to suppress cGAS-mediated innate immune activation using the deamidase-deficient UL37C819S mutant. UL37WT potently reduced the expression of IFNB1, ISG56 and CXCL10 induced by HT-DNA in THP-1 cells, whereas UL37C819S mutant had no detectable effect on IFNB1 and ISG56 expression and increased CXCL10 expression (FIG. 15G). To test whether UL37 impacts the enzymatic activity of cGAS, Applicant determined intracellular cGAMP concentrations in THP-1 cells transfected with HT-DNA. This assay showed that HT-DNA induced approximately 60 ng of cGAMP per one million THP-1 cells (equivalent to 0.5 million molecules of cGAMP per cell), while UL37WT expression reduced cGAMP to ˜35 ng per one million THP-1 cells (FIG. 15H). The expression of UL37C819S slightly increased cGAMP production to ˜85 ng per one million THP-1 cells. These results collectively indicate that the deamidase activity of UL37 is required to suppress cGAMP synthesis catalyzed by cGAS.


UL37 Deamidates cGAS In Vitro and in Cells


UL37WT, but not the deamidase-deficient UL37C819S mutant, reduced cGAS-mediated cGAMP synthesis. Moreover, HSV-1 UL37C819S virus more robustly induced antiviral cytokines in THP-1 monocytes than HSV-1 UL37WT. These results imply that UL37 targets cGAS for deamidation. To test this hypothesis, Applicant first determined whether UL37 interacts with cGAS in HSV-1-infected cells. Using recombinant HSV-1 carrying FLAG-tagged UL37, Applicant demonstrated that cGAS precipitated with UL37 in HSV-1-infected THP-1 cells (FIG. 16A). Applicant noted that HSV-1 infection increased cGAS protein expression, consistent with established knowledge that cGAS is an interferon-inducible gene. Mutational analysis showed that the Mab21 enzyme domain (residue 162-522) of cGAS interacts with UL37, and both the N-terminal and C-terminal domains of UL37 are sufficient for binding cGAS.


To assess whether UL37 induces cGAS deamidation, Applicant analyzed the charge status of cGAS without or with UL37 expression by two-dimensional gel electrophoresis (2DGE). As shown in FIG. 16B, UL37 expression shifted cGAS toward the positive side of the gel strip, indicative of reduced charge due to potential deamidation upon UL37 expression. This directional shift of cGAS was recapitulated by HSV-1 infection (FIG. 16C). To identify sites of deamidation in cGAS, Applicant purified cGAS in 293T cells without or with UL37 expression and conducted tandem mass spectrometry (MS). Additionally, Applicant performed in vitro deamidation assays to augment the protein coverage of cGAS analyzed by tandem MS. MS analysis using purified cGAS deamidated in cells and in vitro identified a total of four sites of deamidation, all located within the Mab21 enzyme domain of cGAS: N196, N377, Q436 and Q439 (homologous to hcGAS N210, N389, Q451 and Q454) (FIG. 16D). Quantification of deamidated peptides indicated that UL37 expression increased the deamidation of these four sites by ˜2-3-fold (FIG. 16E). To validate the deamidation sites of cGAS, Applicant generated a deamidated mutant of all four deamidated residues, designated cGAS-DDEE. UL37 expression shifted cGAS-WT toward the positive side of the strip and to the position that was identical to that of cGAS-DDEE (FIG. 16F). Expression of UL37 did not further shift cGAS-DDEE, implying that there are no other deamidation sites in addition to the four identified. There was a residual amount of cGAS-WT and cGAS-DDEE that was not shifted by UL37 expression; this species may represent cGAS with other PTMs that counteract the charge change of deamidation.


Applicant has previously shown herein that UL37 is a bona fide protein deamidase of RIG-I. Thus, Applicant sought to determine whether UL37 is sufficient to deamidate cGAS in vitro. Applicant purified cGAS, UL37WT and UL37C819S mutant from bacteria to high homogeneity (FIG. 16G). There were two major species and one minor species of the purified cGAS as analyzed by 2DGE, likely due to deamidation or other modifications. Analysis of in vitro cGAS deamidation reactions by 2DGE showed that UL37WT shifted cGAS toward the positive end of the gel strip to a position of the deamidated cGAS-DDEE mutant, while the deamidase-deficient UL37C819S failed to do so (FIG. 16G). Again, UL37WT failed to further shift the deamidated cGAS-DDEE mutant. Taken together, UL37 is a bona fide protein deamidase that deamidates cGAS in cells and in vitro.


Deamidation Impairs the cGAMP Synthase Activity of cGAS


To probe the role of protein deamidation in cGAS-mediated antiviral immune response, Applicant first performed reporter assays to analyze the ability of various deamidated cGAS mutants in activating the IFN-β and NF-κB promoters. Applicant also has also generated mutations of all N and Q residues that are conserved within the Mab21 enzyme domain of human and mouse cGAS for these reporter assays. These reporter assays showed that N210D reduced cGAS-mediated gene expression by 50%, while the other three deamidations had marginal effects. The other deamidated residues did not significantly impair cGAS to activate the IFN-β promoter. However, combining the three mutations in NQQ389,451,454DEE modestly reduced cGAS-induced gene expression. When all four deamidated residues were introduced into cGAS, Applicant found that the cGAS-DDEE mutant failed to activate the IFN-β and NF-κB promoters by reporter assay (FIG. 17A). Thus, these deamidations negatively regulate cGAS-induced innate immune activation.


All four deamidation sites, N210, N389, Q451 and Q454, are conserved between mouse and human cGAS. These four sites correspond to N196, N377, Q436 and Q439 of mouse cGAS (mcGAS). Previous structural studies revealed an active site of mcGAS that catalyzes the synthesis of cGAMP, consisting of two parallel β-sheets (32 and 37, PDB: 4K9B) (FIG. 17B) that provide E211, D213 and D307 to form a catalytic triad. Additionally, 31 and 36 sheets sandwich the two core β-sheets. Linking the 31 sheet and the activation loop, N196 also is proximal to a hydrophobic pocket.


Interestingly, this hydrophobic pocket is formed by residues from core β-sheets (F212, V214 and F216 of β2, V306 and 1308 of 07) and a neighboring α-helix (V171, L175 and L179 of α2). In the cGAS structure bound to dsDNA, N196 lies between the hydrophobic cluster and the backbone of the dsDNA†(PDB:4K9B). Moreover, structural analysis by others show that, similar to other nonpolar residues with small side chain, N196 (or N210 of hcGAS) confers flexibility to the activation loop of cGAS. Thus, deamidation of N196 of mcGAS is expected to impinge on the nearby hydrophobic cluster and the flexibility of the activation loop that collectively enable the proper coordination of the catalytic triad.


This is supported by the structure wherein N196 is close to the catalytic residue D213. In fact, the three catalytically residues E211, D213 and D307 form a highly negatively charged spot on protein, whose structure and physical chemical properties are likely very sensitive to alternation of nearby electrostatic potential induced by the damindation of N196. Applicant therefore assessed the effect of N196 deamidation on the enzyme activity of cGAS. As shown in FIG. 17C, HT-DNA stimulated cGAMP synthesis catalyzed by cGAS, but had no significant effect on cGAMP synthesis catalyzed by hcGAS-N196D or cGAS-DDEE. Furthermore addition of UL37 to the cGAMP reaction greatly reduced cGAS-mediated cGAMP production, while the deamidase-deficient UL37C819S mutant had less, albeit still significant, inhibition of cGAMP production. These results collectively show that UL37-mediated deamidation inhibits the cGAMP synthase activity of cGAS.


Structural analyses also indicate that the side chain of N376 and N377 of mcGAS (corresponding to N388 and N389 of hcGAS) project toward the minor groove of the dsDNA helix, suggesting that deamidation of these residues potentially interferes cGAS ability to sense dsDNA. However, precipitation of biotinylated interferon-stimulating DNA (ISD) demonstrated that neither UL37-WT, nor UL37C819S diminished cGAS co-precipitated with ISD. In fact, UL37WT, but not UL37C819S, increased the interaction between cGAS and ISD by ˜50%. Similar results were recapitulated with the deamidated cGAS-DDEE mutant, which demonstrated slightly enhanced interaction with ISD. Given that all four deamidation sites reside in regions proximal to the dimerization interface of cGAS, Applicant sought to determine whether UL37 influences cGAS self-dimerization. Co-IP assay showed that expression of UL37WT or UL37C819S did not alter cGAS dimerization. Taken together, these results suggest that UL37-mediated deamidation does not impair either the dsDNA-binding or dimerization of cGAS.


Deamidated cGAS Fails to Activate Innate Immune Signaling and Restrict DNA Virus Replication


To probe the role of deamidation in regulating cGAS-mediated immune signaling and restricting viral replication, Applicant “reconstituted” cGAS-deficient L929 cells with cGAS wild-type and the deamidated cGAS-DDEE mutant (FIG. 17D). In response to HD-DNA transfection, L929 cells “reconstituted” with cGAS wild-type up-regulated the expression of Ifnb, Isg56, Cxcl10 and Ifit3 genes (FIG. 17E). In contrast, L929 cells “reconstituted” with the deamidated cGAS-DDEE mutant failed to induce the expression of these innate immune genes upon HD-DNA transfection. Applicant further “reconstituted” cGAS-deficient L929 cells with cGAS carrying individual deamidated residues. When these cells were transfected with HT-DNA, Applicant found that cytokine gene expression was significantly and most reduced in L929 cells “reconstituted” with cGAS-D210 compared to L929 “reconstituted” with cGAS wild-type. D389 and E454 also consistently reduced cGAS-dependent expression of cytokines, including Ifnb1 and Cxcl10. E451 had a minor effect on the expression of Ifnb 1, but had no effect on other cytokines. These results collectively show that the deamidation of N210, N389 and N454 reduces the activity of cGAS in innate immune signaling.


When infected with HSV-1 UL37WT or HSV-1 UL37C819S virus, L929 cells “reconstituted” with cGAS wild-type up-regulated the expression of inflammatory genes as potent as wild-type L929 cells. L929 cells “reconstituted” with cGAS-DDEE essentially behaved like cGAS-deficient L929 cells, demonstrating no induction of immune gene expression in response to HSV-1 infection (FIG. 17F). Applicant further tested the ability of cGAS wild-type and cGAS-DDEE in restricting DNA virus replication. Applicant found that “reconstituted” expression of cGAS wild-type reduced the replication of both HSV-1 and murine gamma herpesvirus 68 (MHV68) by >95% (FIGS. 4G and 4H). The expression of cGAS-DDEE had no effect on HSV-1 and MHV68 replication in L929 cells compared to cGAS-deficient L929 cells. These results demonstrate that deamidated cGAS fails to induce innate immune activation and to restrict DNA virus replication.


HSV-1 UL37C819S Virus More Robustly Induces Innate Immune Activation

To characterize the in vivo function of the deamidase activity of UL37, Applicant infected mice with HSV-1 UL37WT and HSV-1 UL37C819S virus. At 8 hours post-infection, HSV-1 UL37C819S virus induced ˜2-5-fold more cytokines in the sera of infected mice than HSV-1 UL37WT (FIG. 18A). For example, IFN-α and IFN-β were increased by 5- and 3.5-fold, respectively, in mice infected with HSV-1 UL37C819S than those infected with HSV-1 UL37WT. Conversely, the viral load of HSV-1 UL37C819S in the brain was reduced by >97% compared to that of HSV-1 UL37WT, as assessed by real-time PCR of viral genome copy number (FIG. 18B). When BL6 mice were inoculated with high dose (5×107 PFU) of HSV-1 UL37WT or HSV-1 UL37C819S virus, approximately 50% mice succumbed to HSV-1 UL37WT infection, while none of the mice infected with HSV-1 UL37C819S died or otherwise demonstrated apparent disease (FIG. 18C). These results show that HSV-1 carrying the deamidase-deficient UL37 is highly attenuated, while more robustly inducing innate immune responses in mice.


Previous studies have implicated the cGAS-STING pathway in promoting adaptive immune responses. Thus, Applicant tested whether the increased innate immune activation by HSV-1 UL37C819S virus translated into enhanced adaptive immunity. To quantify T cell immunity, Applicant analyzed virus-specific CD8+ T cells by tetramer staining against the most abundant epitope of glycoprotein B (gB, 498-505, SSIEFARL). This analysis showed that both HSV-1 UL37WT and HSV-1 UL37C819S induced similar CD8+ T cell response kinetics, peaking at 6 days post-infection (dpi) (FIG. 18D). At 6 dpi, HSV-1 UL37WT and HSV-1 UL37C819S induced ˜3.5% and 4.5% gB-specific CD8+ T cells, respectively (FIG. 18D). Additionally, when antibody against HSV-1 was quantified using whole virion-coated plates, Applicant found that HSV-1 UL37C819S virus induced as much 170% and 300% of HSV-1-specific antibody as HSV-1 UL37WT did at 14 and 20 dpi, respectively (FIG. 18E). These results indicate that HSV-1 containing the deamidase-deficient UL37C819S more robustly induces adaptive immunity, as evidenced by increased virus-specific CD8+ T cell response and antibody production.


To determine whether the elevated virulence of HSV-1 UL37WT is dependent on its ability to evade cGAS-mediated innate immune activation, Applicant analyzed the pathogenesis of HSV-1 UL37WT and HSV-1 UL37C819S in mice deficient in cGAS or STING. Mice deficient in cGAS or STING were highly susceptible to HSV-1 infection, demonstrating 100% lethality by 11 dpi. Importantly, cGAS-deficient mice infected with HSV-1 UL37C819S succumbed to death as rapidly as those infected with HSV-1 UL37WT (FIG. 18F). Consistent with this, HSV-1 UL37WT and HSV-1 UL37C819S mutant induced similar levels of inflammatory cytokines in the sera of cGAS-deficient mice. The concentration of these cytokines in the sera was dramatically lower than those in wild-type mice infected with either HSV-1 UL37WT or HSV-1 UL37C819S (FIG. 18A), supporting the conclusion that cGAS is critical for immediate innate immune responses against DNA viruses. Furthermore, viral loads in the brain of cGAS-deficient mice were similar to mice infected with HSV-1 UL37WT or HSV-1 UL37C819S. In mice deficient in STING, infection of HSV-1 UL37WT and HSV-1 UL37C819S resulted in mouse lethality of similar kinetics and serum levels of inflammatory cytokines as cGAS-deficient mice (FIG. 18G). Viral loads in the brain of STING-deficient mice infected with HSV-1 UL37WT and HSV-1 UL37C819S were identical as determined by plaque assay. Thus, mouse deficient in STING recapitulate phenotypes of cGAS knockout mice, when infected with HSV-1 UL37WT and HSV-1 UL37C819S. Taken together, these results show that the UL37 deamidase antagonizes the cGAS- and STING-mediated innate immune response in vivo.


Immunization with HSV-1 UL37C819S Protects Mice from HSV-1 Lethal Dose


Considering that HSV-1 UL37C819S more robustly induces immune responses and is highly attenuated in mice, Applicant explored the possibility that immunization with HSV-1 UL37C819S protects mice from pathogenesis induced by wild-type HSV-1 infection. For this experiment, Applicant used BALB/c mice, which are more susceptible to HSV-1 infection than BL/6 mice. After two rounds of HSV-1 UL37C819S infection at an interval of two weeks (FIG. 19A), Applicant challenged mice with a lethal dose of HSV-1 wild-type (5×106 PFU) via intravenous injection. Naïve mice all succumbed to HSV-1 infection by 8 dpi. All mice immunized with HSV-1 UL37C819S survived the challenge (FIG. 19B). Additionally, naïve mice demonstrated significant weight loss that peaked at ˜20% reduction when mice were euthanized at 5 dpi, while vaccinated mice had a decrease of ˜5% in body weight at 1 and 2 dpi, and quickly recovered to baseline body weight by 4 dpi (FIG. 19C). These results show that immunization with the deamidase-deficient HSV-1 UL37C819S potently protects mice from lethal challenge of wild-type HSV-1 infection.


To further characterize the pathology of HSV-1 infection, Applicant analyzed the brain of mice infected with HSV-1. Haematoxylin & Eosin (H&E) staining showed a significant fraction of cells had apparent morphology changes only in mice immunized with PBS and challenged with wild-type HSV-1 (FIG. 19D). A remarkable phenotype of the brain tissue of infected mice is the stark contrast of dark nuclear staining by H&E and the relatively small cell body compared to neighboring cells (FIG. 19D). The heavy stain by H&E and reduced cell size are likely due to the massive accumulation of viral proteins and nucleic acids from HSV-1 replication. Immunohistochemistry (IHC) staining with an antibody against UL37, a lytic protein, revealed a pattern similar to the morphological change revealed by H&E staining (FIG. 19F). Staining of the horizontal sections showed UL37-positive cells in the cortex, hippocampus and cerebellum. In the hippocampus, UL37-positive cells concentrated in CA3 and dentate gyrus (DG) regions. Consistent with the morphological analysis, IHC staining identified ˜35% brain cells expressing UL37 antigens (FIG. 19G). Immunization with HSV-1 UL37C819S mutant virus significantly reduced the number of UL37-positive cells similar to mock-infected mice (FIG. 19G). UL37-positive cells were observed in Purkinjie neurons that line the cerebellum (S6 Da), and were sporadically distributed in the cortex and stratum regions. IHC staining using anti-Vhs serum revealed significantly more viral replication in the brain of naïve mice than that of mice immunized with HSV-1 UL37C819S. Importantly, although low levels of HSV-1 replication were detected in mice immunized with HSV-1 UL37C819S, these mice were healthy and showed no diseased behavior. IHC staining with antibody against NeuN (a marker for neurons) and GFAP (a marker for astrocytes) showed that these HSV-1-infected cells were neuronal cells (FIG. 19H). Close inspection revealed that many neuron cells in the DG region of PBS-immunized mice were significantly smaller than that in HSV-1 UL37C819S immunized mice, after lethal dose challenge of wild-type HSV-1 (FIG. 19H). Comparing the IHC staining against NeuN to that against UL37 suggests that these smaller neurons are infected with HSV-1. These results collectively show that immunization with HSV-1 UL37C819S virus potently protects mice from acute HSV-1 infection and its pathogenesis.


Discussion

As innate immunity is essential to defeating pathogen infection, pathogens have evolved diverse mechanisms to evade host defense, providing a physiological system to examine host immune regulation. Employing HSV-1 for monocyte and mouse infection, Applicant discovered that the UL37 tegument protein of HSV-1 deamidates cGAS to abrogate its cGAMP synthesis activity, without diminishing the DNA-binding or dimerization. Site-specific deamidation of all four amide-containing residues distributed throughout the relatively large enzyme domain reveals an exquisite specificity of deamidation on the enzyme activity of cGAS. The physiological role of the deamidase activity of UL37 in counteracting cGAS-mediated immune defense is substantiated by significantly elevated levels of inflammatory cytokines in THP-1 monocytes and mice infected with the deamidase-deficient HSV-1 UL37C819S than those infected with HSV-1 UL37WT. Applicant further showed that elevated antiviral cytokines translated into more robust adaptive immunity against HSV-1 in mice, including CD8+ cytotoxic T cell response and serum antibody. These findings agree with a previous report that cGAMP and activation of cGAS-mediated innate immune signaling play an adjuvant role in immunization. In support of this conclusion, immunization with the highly inflammatory deamidase-deficient HSV-1 UL37C819S that had attenuated replication in vivo protected mice from challenge with lethal dose of wild-type HSV-1, representing a new vaccine candidate.


Applicant has shown that UL37 deamidates RIG-I to prevent dsRNA-induced activation. This work identifies cGAS as an additional target of UL37 in HSV-1-infected cells. In cGAS- and STING-deficient mice infected with HSV-1, Applicant found that the deamidase-deficient HSV-1 UL37C819S virus was as pathogenic as wild-type HSV-1, as measured by survival rates of mice infected with HSV-1 UL37WT and HSV-1 UL37C819S. These results clearly support the crucial role of UL37 in antagonizing the cGAS-STING pathway, but do not address the role of UL37-mediated RIG-I deamidation, previously shown to diminish antiviral cytokine production, in host defense against HSV-1 infection in mice. The identical pathogenesis of HSV-1 UL37WT and HSV-1 UL37C819S virus in mice deficient in cGAS or STING suggests that UL37 fails to antagonize mouse RIG-I in vivo. N495 of human RIG-I is not conserved in mouse, so it is possible that mouse RIG-I is resistant to UL37-mediated deamidation and inhibition. Although the roles of RIG-I in HSV-1 infection in vivo remain undefined, RIG-I is possibly important for innate immune defense against HSV-1 in cell types with limited or minimal cGAS expression, such as epithelial cells and keratinocytes. Previous studies demonstrating the antiviral activities of RIG-I against various herpesviruses primarily used mouse fibroblasts or human cells deficient in RIG-I.


Remarkably, all four cGAS deamidation sites impinge on cGAMP synthesis activity despite being located within three structurally distinct surfaces of cGAS. Two structural studies highlighted the importance of the N210 of hcGAS (or N196 of mcGAS) in regulating cGAS enzymatic activity. Specifically, others showed that N210 is located within the first half of the so-called activation loop. The sequence of this short loop features residues that have small and non-charged side chains. Additional mutational and functional analysis of G211 and S212 of hcGAS in this structural study demonstrated that the flexibility of the activation loop underpins the conformational change and subsequent coordination of the catalytic triad of cGAS upon DNA-binding and dimer formation. Thus, deamidation of N210 of hcGAS is expected to compromise the free rotation of the activation loop and proper formation of the catalytic triad.


Surprisingly, collective deamidation of N389, Q451 and Q454 reduced cGAMP synthesis, but not DNA-binding and dimerization, of cGAS. N389 and N388 lie at the center of the dsDNA-binding surface of cGAS and directly point to the minor groove of dsDNA. Deamidation of N389, and more so that of N388, are expected to diminish the DNA-binding ability of cGAS. However, Applicant's reporter assay showed that N388D and N389D mutations had no detectable effect on the ability of cGAS to activate the IFN-β promoter. Moreover, UL37WT expression and the deamidated cGAS mutant (cGAS-DDEE) appeared to slightly increase the DNA-binding of cGAS. Q451 and Q454 reside in a short α-helical structure that forms the front edge of the butterfly-shaped cGAS dimer. The expression of UL37WT and UL37C819S mutant had no detectable effect on cGAS dimer formation upon HT-DNA transfection. These results indicate that deamidation of cGAS does not impair the DNA-binding and dimerization of cGAS upon sensing dsDNA. On the other hand, cGAS-deficient L929 cells “reconstituted” with cGAS mutants harboring single deamidated residues, demonstrated lower activity to induce Ifnb 1 expression in response to transfected HT-DNA, suggesting that these sites are important for cGAS signaling. Thus, Applicant's reported methods are perhaps not sufficiently sensitive to accurately quantify the dsDNA-cGAS interaction, especially given the observation that the DNA-binding of cGAS appears to be of low affinity. If indeed the deamidation of these Gin and Asn residues do not impair the dimerization and DNA-binding of cGAS, it is possible that the deamidated surfaces of monomeric or dimeric cGAS serve as binding sites for cellular factors that regulate cGAS enzymatic activity. For example, cGAS sensing of HIV DNA requires the PQBP1 cofactor for innate immune activation. Whether deamidation impacts cGAS interaction with its cofactors remains to be determined. Nevertheless, the conformational changes induced by deamidation of these residues likely impact the active site and reduce cGAMP synthesis by cGAS. The molecular details of how these deamidations affect cGAS enzyme activity calls for further investigation. It is clear that these deamidations dampen the cGAMP synthesis activity of cGAS with explicit specificity.


cGAS is a cytosolic DNA sensor crucial for innate immune defense and aberrant activation of cGAS can lead to autoimmune diseases. Thus, the cGAS activity is tightly regulated. PTMs, such as phosphorylation, glutamylation, sumoylation and ubiquitination, play important roles in regulating the activity of cGAS. Phosphorylation of hcGAS S305 by AKT potently inhibits the enzymatic activity of cGAS. Glutamylation of cGAS by the enzymes TTLL6 and TTLL4 dampens the DNA-binding and synthase activity of cGAS, while the removal of glutamylation by CCP5 and CCP6 enhances cGAS activity.


Similarly, TRIM38 targets cGAS for sumoylation to prevent its polyubiquitination and degradation that is facilitated by SENP2-mediated desumoylation. Interestingly, sumoylation of cGAS at different residues suppresses its DNA-binding, oligomerization and synthase activities, and desumoylation by SENP7 increases cGAS activity. Thus, the activity of cGAS is dynamically regulated by sumoylation and desumoylation during infection. In this study, Applicant provides evidence that the activity of cGAS can be modified through deamidation, adding another PTM to the dynamic and complex regulation of cGAS.


Due to its core function in innate immune response against microbes, cGAS is often targeted by diverse pathogens to prevent innate immune activation. Human kaposi's sarcoma-associated herpesvirus (KSHV) notably deploys three distinct molecules, ORF52, LANA (ORF73) and vIRF1 (K9), to disable cGAS and its downstream signaling. The E7 oncogene of human papillomavirus and E1A of adenovirus utilize a common L×C×E motif to antagonize the DNA-sensing of cGAS. Recently, the protease cofactor NS2B of Dengue virus was shown to promote the lysosomal degradation of cGAS, thereby suppressing the induction of type I interferon production in infected cells. Applicant found that HSV-1 UL37 tegument protein deamidates cGAS to block cGAMP synthesis, revealing an efficient means of antagonizing the cGAS-STING pathway. Applicant and others have reported that herpesviruses and bacteria deploy deamidation to modify key signaling molecules to manipulate host immune responses.


In conclusion, Applicant has identified multiple sites of deamidation within cGAS targeted by HSV-1 UL37 deamidase. Deamidation of cGAS specifically ablates the cGAMP synthesis activity of cGAS. HSV-1 containing the deamidase-deficient UL37C819S is highly attenuated in mice and more robustly induces antiviral cytokines. Immunization with the deamidase-deficient HSV-1 virus potently protects mice from lethal dose challenge of HSV-1 wild-type. Collectively, these studies provide evidence that deamidation modifies protein function.


Experiment No. 3

The genetic data disclosed in this experiment shows that the cGAS and STING pathway is the primary target of UL37. Moreover, recombinant HSV-1 containing the deamidase-deficient UL37C819S is highly attenuated in mice, but induced much more robust antiviral immune response (innate arm and adaptive as well). Thus, these proteins are shown to be effective as a prophylactic vaccine. Indeed, mice vaccinated with the recombinant HSV-1 containing the deaminase-deficient UL37C819S potently protected mice from lethal dose infection of wild-type HSV-1.


Antibodies and Reagents

Commercially available antibodies used for this study include mouse monoclonal FLAG M2 antibody (Sigma), mouse monoclonal V5 antibody and β-actin antibody (Abcam), Phospho-TBK1 (Ser172) antibody, Phospho-IRF3 (Ser396) antibody and cGAS (D1D3G) antibody (Cell signaling), TBK1 antibody (Bethyl), His-probe antibody (H-3) and IRF3 antibody (FL-425) (Santa cruz), APC rat anti-mouse CD8a and PE hamster anti-mouse CD3ε (BD Biosciences).


Major histocompatibility complex (MHC)/peptide tetramers for HSV-1 gB 498-505/Kb (SSIEFARL) conjugated to PE were obtained from the NIH Tetramer Core Facility (Emory University, Atlanta, Ga.).


HSV-1 UL37 and VHS polyclonal antibodies were generated by repeatedly immunizing rabbit with purified proteins.


HT-DNA and LPS (Sigma-Aldrich), 2′, 3′-cGAMP (InvivoGen), streptavidin agarose (Thermo Fisher), Amylose Resin (New England Biolabs), Ni-NTA His-Bind Resin (Novagen), TEV protease (Invitrogen).


Biotin labeled ISD-45 DNA was ordered from IDT. [α-P32]-ATP was ordered from Perkin Elmer. Lipofectamine 2000 was purchased from Invitrogen.


Cells, Viruses, Mice and Viral Infections

THP1-Lucia ISG reporter cells (InvivoGen) was kindly provided by Dr. Fanxiu Zhu (Florida State University). L929 and L929 cGAS knockout cells were provided by Dr. Fanxiu Zhu.


MHV68 virus was propagated in BHK21 cells as previously described. HSV-1 WT and UL37 C819S recombinant viruses were propagated using VERO cells. cGAS knockout mice and BALB/c mice were purchased from the Jackson laboratory. STING knockout mice were provided by Dr. Jae Jung (the University of California). Six to eight-week old, gender-matched mice were used for all experiments. All animal work was performed under strict accordance with the recommendations in the Guide for the Care and Use of Laboratory Animals of the National Institutes of Health. The protocol was approved by the Institutional Animal Care and Use Committee (IACUC) of the University of Southern California.


For HSV-1 infection, BL/6, cGAS KO or STING KO mice were infected with 5×107 PFU of virus via intraperitoneal injection for survival curve analysis. For tetramer staining and antibody measurement, HSV-1 was reduced to 106 PFU per mouse.


For BALB/c mice, the mice were immunized with HSV-1 UL37 C819S virus (106) twice with an interval of two weeks and then challenged with HSV-1 (5×106) via intraperitoneal infection.


RNA Extraction and qRT-PCR


THP-1 or L929 cells were infected with HSV-1 (MOI=5) or stimulated with HT-DNA (2 μg/ml) or cGAMP (2 μg/ml) for 6 h unless specifically indicated otherwise. Cells were washed with cold PBS, and total RNA was extracted by using TRIzol Reagent (Invitrogen). RNA was digested with DNase I (New England Biolabs) to remove genomic DNA. One microgram of total RNA was used for reverse transcription with PrimeScript Reverse Transcriptase (Clontech) according to the manufacturer's instruction. Approximately 0.5% of the cDNA was used as template in each quantitative real-time PCR (qRT-PCR) reaction with SYBR master mix (Applied biosystems).


Q-PCR Primers Used in the Study:














Q-PCR primers for human genes












IFNB1
CTTTCGAAGCCTTTGCTCT
CAGGAGAGCAATTTGGAGGG



G
A





ISG56
TCTCAGAGGAGCCTGGCT
TGACATCTCAATTGCTCCAG



AA






CXCL10
CACCATGAATCAAACTGC
GCTGATGCAGGTACAGCGT



GA






IL6
AGTGAGGAACAAGCCAG
GTCAGGGGTGGTTATTGCA



AGC
T





IL8
TCCTGATTTCTGCAGCTCT
AAATTTGGGGTGGAAAGGT



GT 
T





MX1
AGCTCGGCAACAGACTCT 
GATGATCAAAGGGATGTGG



TC
C





IFIT3
TCGGAACAGCAGAGACAC
AAGTTCCAGGTGAAATGGC



AG
A





ACTB 
GTTGTCGACGACGAGCG
GCACAGAGCCTCGCCTT


(β-actin)










Q-PCR primers for mouse genes












Ifnb1
CCCTATGGAGATGACGGA
CCCAGTGCTGGAGAAATTG



GA
T





Isg56
CAAGGCAGGTTTCTGAGG
GACCTGGTCACCATCAGCA



AG
T





Cxcl10
CTCATCCTGCTGGGTCTG
CCTATGGCCCTCATTCTCAC



AG






Actb 
TCTACGAGGGCTATGCTC
TCTTTGATGTCACGCCGAT


(β-actin)
TCC
TTC









Luciferase Reporter Assay

HEK293T cells in 24-well plates were transfected with a reporter plasmid mixture that contained 50 ng of the plasmid expressing IFN-β or NF-κB firefly luciferase reporter and 20 ng of the plasmid expressing TK-renilla luciferase reporter. At 30 h post-transfection, cells were harvested and cell lysates were prepared. Cell lysates were used for dual luciferase assay according to the manufacturer's instruction (Promega).


Immunoprecipitation

Immunoprecipitation was carried out as described previously. Briefly, THP-1 cells were infected with HSV-1 FLAG-UL37 recombinant virus (MOI=0.5) for 16 h. The cells were harvested and lysed with NP40 buffer (50 mM Tris-HCl [pH 7.4], 150 mM NaCl, 1% NP-40, 1 mM EDTA, 5% glycerol) supplemented with a protease inhibitor cocktail (Roche). Centrifuged cell lysates were pre-cleared with Sepharose 4B beads and incubated with FLAG-agarose at 4° C. for 4 h. The agarose beads were washed three times with lysis buffer and precipitated proteins were released by boiling with 1×SDS sample buffer at 95° C. for 5 min. The resolved samples were applied to immunoblot analysis.


Protein Purification

Mouse MBP-cGAS fusion protein (151-522) was expressed in BL21 (DE3) and the bacteria were grown at 37° C. to an OD600 of 0.6. Then the cultures were cooled to 18° C. and protein expression was induced by adding 0.1 mM Isopropyl b-D-1-thiogalactopyranoside (IPTG) for 16 h. Cells were collected by centrifugation and lysed with lysis buffer (20 mM Tris-HCl [pH 7.4], 200 mM NaCl, 10% glycerol, 0.5% Triton X-100, 0.2 mg/ml lysozyme supplemented with protease inhibitor cocktail). Clarified lysates were mixed with amylose resin and incubated for 2 h at 4° C. The resin was washed extensively with lysis buffer and the recombinant proteins were eluted by 10 mM maltose.


For Mass spectrometry analysis, purified MBP-mcGAS was digested with TEV protease overnight at 4° C. and MBP proteins were depleted by incubation with Ni-NTA agarose.


UL37 or UL37C819S were purified as previously described. Briefly, HEK293T cells were transiently transfected with a plasmid containing UL37 or UL37C819S, harvested and lysed with lysis buffer (20 mM Tris (pH 7.4), 150 mM NaCl, 10% (vol/vol) glycerol, 0.5% Triton X-100, and 0.5 mM DTT) supplemented with a protease inhibitor cocktail (Roche), and lysates were precipitated with 20 μL of FLAG M2-conjugated agarose (Sigma). After extensive washing with lysis buffer, precipitated proteins was eluted with FLAG peptide (100 μg/ml) and used for in vitro deamidation assay.


Two-Dimensional Gel Electrophoresis

Cells or in vitro deamidation samples were resolved in 150 μl rehydration buffer (8 M urea, 2% CHAPS, 0.5% IPG buffer, and 0.002% bromophenol blue), and then the lysates were centrifuged at 20,000 g for 10 min. Supernatants were loaded to IEF strips with a program comprising 20 V, 10 hr; 100 V, 1 hr; 500 V, 1 hr; 1,000 V, 1 hr; 2,000 V, 1 hr; 4,000 V, 1 hr; and 8,000 V, 4 hr. Then strips were incubated with SDS equilibration buffer (50 mM Tris-HCl [pH 8.8], 6M urea, 30% glycerol, 2% SDS, 0.001% bromophenol blue) containing 10 mg/ml DTT for 15 min and SDS equilibration buffer containing 2-iodoacetamide for 15 min. Strips were washed with SDS-PAGE buffer, resolved by SDS-PAGE, and analyzed by immunoblotting.


In Vitro Deamidation Assay

The in vitro deamidation assay was performed as previously described. Briefly, 5 μg of purified MBP-mcGAS or MBP-mcGAS-DDEE mutant on amylose resin was incubated with 0.5 gig of purified FLAG-UL37 or FLAG-UL37C819S at 30° C. for 45 min in deamidation buffer (50 mM Tris-HCl [pH 7.5], 100 mM NaCl, and 5 mM MgCl2). Then the reaction was stopped by adding rehydration buffer and the eluted proteins were analyzed by two-dimensional gel electrophoresis.


cGAMP Reporter Assay


THP-1 Cells were transfected with HT-DNA (2 μg/ml) or infected with HSV-1 virus for 6 h. Cell extracts were prepared by heating at 95° C. for 5 min to denature most proteins, and then the precipitated proteins were removed by centrifugation. The supernatant containing cGAMP was delivered to digitonin-permeabilized (2.5 μg/ml for 30 min) THP1-Lucia cells at 37° C. for 30 min. The cells were cultured for another 20 h before Lucia reporter activity was measured according to the manufacturer's instruction (InvivoGen). Pure cGAMP was diluted and used as standard for the assay.


In Vitro cGAMP Activity Assay


1 μM of MBP-cGAS or mutant proteins was mixed with 100 μM ATP and 100 μM GTP and 10 μCi [α-P32]-ATP in reaction buffer (20 mM Tris-Cl [pH 7.5], 150 mM NaCl, 5 mM MgCl2, 1 mM dithiothreitol [DTT]). After 2 h of incubation at 37° C., 2 μl of reaction solution was spotted onto PEI-Cellulose thin layer chromatography plate (Sigma). Reaction products were resolved with running buffer (1 M (NH4)2SO4, 1.5 M KH2PO4, pH 3.8). The TLC plates were dried and scanned with Phosphoimager (Fuji).


Stable Cell Line Generation

Lentivirus production was carried out in 293T cells as described previously. THP-1 or L929 cells were infected with lentivirus containing UL37 or UL37 C819S mutant. After 36 h, THP-1 cells were selected with puromycin at 1 μg/ml and L929 cells were selected with puromycin at 5 μg/ml.


Deamidation Sites Analysis by LC/MS/MS

Purified mcGAS(141-507) was deamidated by purified FLAG-UL37 in vitro. Then the samples were resolved with SDS-PAGE and the mcGAS bands were excised and applied to LC/MS/MS analysis. The analysis of samples was carried out using a Thermo Scientific Q-Exactive hybrid Quadrupole-Orbitrap Mass Spectrometer and a Thermo Dionex UltiMate 3000 RSLCnano System. Peptide mixtures from each sample were loaded onto a peptide trap cartridge at a flow rate of 5 μL/min. The trapped peptides were eluted onto a reversed-phase PicoFrit column (New Objective, Woburn, Mass.) using a linear gradient of acetonitrile (3-36%) in 0.1% formic acid. The elution duration was 120 min at a flow rate of 0.3 μl/min. Eluted peptides from the PicoFrit column were ionized and sprayed into the mass spectrometer, using a Nanospray Flex Ion Source ES071 (Thermo) under the following settings: spray voltage, 2.0 kV, Capillary temperature, 250° C. Other settings were empirically determined. Raw data files were searched against mouse protein sequence database obtained from NCBI website using the Proteome Discoverer 1.4 software (Thermo, San Jose, Calif.) based on the SEQUEST algorithm. Carbamidomethylation (+57.021 Da) of cysteines was fixed modification, and Oxidation and Deamidation Q/N-deamidated (+0.98402 Da) were set as dynamic modifications. The minimum peptide length was specified to be five amino acids. The precursor mass tolerance was set to 15 ppm, whereas fragment mass tolerance was set to 0.05 Da. The maximum false peptide discovery rate was specified as 0.01. The resulting Proteome Discoverer Report contains all assembled proteins with peptides sequences and matched spectrum counts and peak area.


Tetramer Staining

HSV-1-infected mice were sacrificed at 4, 6, 8 days post-infection (dpi) and spleen were collected. Single cell suspension was generated by passing through 40 μM strainer on ice. Red blood cells were removed by adding 5 ml of Pharm Lysis buffer (BD Biosciences). Cells were washed once with cold PBS plus 1% FBS and subjected to tetramer staining.


Tetramer staining was carried out as previously described. Briefly, cells were incubated with anti-CD16/32 antibody for 10 min on ice, followed by staining for 1 h in the dark with tetramers (1:100). Then the cells were stained with anti-CD8 antibody for 20 min on ice. Samples were analyzed by flow cytometry using FACSCalibur and data were analyzed with FlowJo software.


H&E and Immunohistochemistry Staining

Mouse tissue samples were fixed in 10% (vol/vol) formalin solution (Sigma) overnight. Tissue specimens were dehydrated, embedded in paraffin, and cut into 8-μm sections. Tissue sections were analyzed by H&E and Images were collected with Keyence BZ-X700 microscope.


For immunohistochemistry staining, mouse tissue samples were fixed with 10% formalin solution overnight. Tissue specimens were dehydrated, embedded in paraffin, and cut into 8-μm sections. Tissue sections were analyzed by immunohistochemistry staining with antibodies against UL37, VHS and DAB substrate kit (Vector Laboratories). Images were visualized with Keyence BZ-X700 microscope.


Determining HSV-1-Specific Antibody

HSV-1-specific antibody detection was carried out as previously described. HSV-1 was purified by ultracentrifuge at 32,000 rpm for 2.5 h and concentrated viral particles were coated to ELISA plates at 4° C. overnight. Plates were then washed five times with PBS-Tween (0.05%) (PBST) and blocked with 1% BSA for 2 h at room temperature. Two-fold dilutions of sera, starting with an initial dilution of 1:10 in PBST were added to the wells and the plates were incubated at RT for 2 h. After washing, rabbit anti-mouse immunoglobulin conjugated to horseradish peroxidase (HRP) diluted at 1:5,000 was added and the plates were incubated for 1 h at room temperature. HSV-1-specific antibody was detected by adding TMB substrate (BD biosciences) and the absorbance was measured at 450 nm. Standard curve was generated by using mouse anti-HSV-1 antiserum. Antibody levels were expressed as arbitrary units against the standard.


Cytokine Measurement

THP-1 cells were stimulated with HT-DNA (2 μg/ml) or cGAMP (2 μg/ml) for 16 h or cells were infected with HSV-1 for 16 h. The medium were collected and applied to cytokine measurement. Mice were infected with HSV-1 or HSV-1 UL37 C819S virus (5×107) and the sera were collected 8 hours post-infection.


ELISA kit for human interferon-β (PBL assay science) was used to determine the concentration of human interferon-β. ELISA kits for murine interferon-α (PBL assay science), interferon-β (PBL assay science), CCL5 (R&D systems) and IL-6 (BD Biosciences) were used to determine the concentration of cytokines in the mouse serum according to the manufacturer's instructions.


EQUIVALENTS

It is to be understood that while the disclosure has been described in conjunction with the above embodiments, that the foregoing description and examples are intended to illustrate and not limit the scope of the disclosure. Other aspects, advantages and modifications within the scope of the disclosure will be apparent to those skilled in the art to which the disclosure pertains.


The embodiments illustratively described herein may suitably be practiced in the absence of any element or elements, limitation or limitations, not specifically disclosed herein. Thus, for example, the terms “comprising,” “including,” containing,” etc. shall be read expansively and without limitation. Additionally, the terms and expressions employed herein have been used as terms of description and not of limitation, and there is no intention in the use of such terms and expressions of excluding any equivalents of the features shown and described or portions thereof, but it is recognized that various modifications are possible within the scope of the disclosure.


Thus, it should be understood that although the present disclosure has been specifically disclosed by specific embodiments and optional features, modification, improvement and variation of the embodiments therein herein disclosed may be resorted to by those skilled in the art, and that such modifications, improvements and variations are considered to be within the scope of this disclosure. The materials, methods, and examples provided here are representative of particular embodiments, are exemplary, and are not intended as limitations on the scope of the disclosure.


The scoped of the disclosure has been described broadly and generically herein. Each of the narrower species and subgeneric groupings falling within the generic disclosure also form part of the disclosure. This includes the generic description with a proviso or negative limitation removing any subject matter from the genus, regardless of whether or not the excised material is specifically recited herein.


In addition, where features or aspects of the disclosure are described in terms of Markush groups, those skilled in the art will recognize that embodiments of the disclosure may also thereby be described in terms of any individual member or subgroup of members of the Markush group.


All publications, patent applications, patents, and other references mentioned herein are expressly incorporated by reference in their entirety, to the same extent as if each were incorporated by reference individually. In case of conflict, the present specification, including definitions, will control.












Sequence Listing 















Sequence ID NO.: 1


UL37 Wild-type


ATGGCAGACCGCGGTCTCCCGTCCGAGGCCCCCGTCGTCACGACCTCACCCGCCGGTCCGCCCT


CGGACGGACCTATGCAGCGCCTATTGGCGAGCCTAGCCGGCCTTCGCCAACCGCCAACCCCCAC


GGCCGAGACGGCAAACGGGGCGGACGACCCGGCGTTTCTGGCCACGGCCAAGCTGCGCGCCGCC


ATGGCGGCGTTTCTGTTGTCGGGAACGGCCATCGCCCCGGCAGACGCGCGGGACTGCTGGCGGC


CGCTGCTGGAACACCTGTGCGCGCTCCACCGGGCCCACGGGCTTCCGGAGACGGCGCTCTTGGC


CGAGAACCTCCCCGGGTTGCTCGTACACCGCTTGGTGGTGGCTCTCCCCGAGGCCCCCGACCAG


GCCTTCCGGGAGATGGAGGTCATCAAGGACACCATCCTCGCGGTCACCGGCTCCGACACGTCCC


ATGCGCTGGATTCCGCCGGCCTGCGCACCGCGGCGGCCCTGGGGCCGGTCCGCGTCCGCCAGTG


CGCCGTGGAGTGGATAGACCGCTGGCAAACCGTCACCAAGAGCTGCTTGGCCATGAGCCCGCGG


ACCTCCATCGAGGCCCTTGGGGAGACGTCGCTCAAGATGGCGCCGGTCCCGTTGGGGCAGCCCA


GCGCGAACCTTACCACCCCGGCGTACAGCCTGCTCTTCCCCGCCCCGTTCGTGCAAGAGGGCCT


CCGGTTCTTGGCCCTGGTGAGTAATCGGGTGACGCTGTTCTCGGCGCACCTCCAGCGCATAGAC


GACGCGACCCTCACTCCCCTCACACGGGCCCTCTTTACGTTGGCCCTGGTGGACGAGTACCTGA


CGACCCCCGAGCGGGGGGCTGTGGTCCCGCCGCCCCTGTTGGCGCAGTTTCAGCACACCGTGCG


GGAGATCGACCCGGCCATAATGATTCCGCCGCTCGAGGCCAACAAGATGGTTCGCAGCCGCGAG


GAGGTGCGCGTGTCGACGGCCCTCAGCCGCGTCAGCCCGCGCTCGGCCTGTGCGCCCCCGGGGA


CGCTAATGGCGCGCGTGCGGACGGACGTGGCCGTGTTTGATCCCGACGTGCCGTTCCTGAGTTC


GTCGGCACTGGCAGTCTTCCAGCCTGCCGTCTCCAGCCTGCTGCAGCTCGGGGAGCAGCCCTCC


GCCGGCGCCCAGCAGCGGCTGCTGGCTCTGCTGCAGCAGACGTGGACGTTGATCCAGAATACCA


ATTCGCCCTCCGTGGTGATCAACACCCTGATCGACGCTGGGTTCACGCCCTCGCACTGCACGCA


CTACCTTTCGGCCCTGGAGGGGTTTCTGGCGGCGGGCGTCCCCGCGCGGACGCCCACCGGCCAC


GGACTCGGCGAAGTCCAGCAGCTCTTTGGGTGCATTGCCCTCGCGGGGTCGAACGTGTTTGGGT


TGGCGCGGGAATACGGGTACTATGCCAACTACGTAAAAACTTTCAGGCGGGTCCAGGGCGCCAG


CGAGCACACGCACGGGCGGCTCTGCGAGGCGGTCGGCCTGTCGGGGGGCGTTCTAAGCCAGACG


CTGGCGCGTATCATGGGTCCGGCCGTGCCGACGGAACATCTGGCGAGCCTGCGGCGGGCGCTCG


TGGGGGAGTTTGAGACGGCCGAGCGCCGCTTTAGTTCCGGTCAACCCAGCCTTCTCCGCGAGAC


GGCGCTCATCTGGATCGACGTGTATGGTCAGACCCACTGGGACATCACCCCCACCACCCCGGCC


ACGCCGCTGTCCGCGCTTCTCCCCGTCGGGCAGCCCAGCCACGCCCCCTCTGTCCACCTGGCCG


CGGCGACCCAGATCCGCTTCCCCGCCCTCGAGGGCATTCACCCCAACGTCCTCGCCGACCCGGG


CTTCGTCCCCTACGTTCTGGCCCTGGTGGTCGGGGACGCGCTGAGGGCCACGTGTAGCGCGGCC


TACCTTCCCCGCCCGGTCGAGTTCGCCCTGCGTGTGTTGGCCTGGGCCCGGGACTTTGGGCTGG


GCTATCTCCCCACGGTTGAGGGCCATCGCACCAAACTGGGCGCGCTGATCACCCTCCTCGAACC


GGCCGCCCGGGGCGGCCTCGGCCCCACTATGCAGATGGCCGACAACATAGAGCAGCTGCTCCGG


GAGCTGTACGTGATCTCCAGGGGTGCCGTCGAGCAGCTGCGGCCCCTGGTCCAGCTGCAGCCCC


CCCCGCCCCCCGAGGTGGGCACCAGCCTCCTGTTGATTAGCATGTACGCCCTGGCCGCCCGGGG


GGTGCTGCAGGACCTCGCCGAGCGCGCAGACCCCCTGATTCGCCAACTGGAGGACGCCATCGTG


CTGCTGCGGCTGCACATGCGCACGCTCTCCGCCTTTTTCGAGTGTCGGTTCGAGAGCGACGGGC


GCCGCCTGTATGCGGTGGTCGGGGACACGCCCGACCGCCTGGGGCCCTGGCCCCCCGAGGCCAT


GGGGGACGCGGTGAGTCAGTACTGCAGCATGTATCACGACGCCAAGCGCGCGCTGGTCGCGTCC


CTCGCGAGCCTGCGTTCCGTCATCACCGAAACCACGGCGCACCTGGGGGTGTGCGACGAGCTGG


CGGCCCAGGTGTCGCACGAGGACAACGTGCTGGCCGTGGTCCGGCGCGAAATTCACGGGTTTCT


GTCCGTCGTGTCCGGCATTCACGCCCGGGCGTCGAAGCTGCTGTCGGGAGACCAGGTCCCCGGG


TTTTGCTTCATGGGTCAGTTTCTAGCGCGCTGGCGGCGTCTGTCGGCCTGCTATCAAGCCGCGC


GCGCGGCCGCGGGACCCGAGCCCGTGGCCGAGTTTGTCCAGGAACTCCACGACACGTGGAAGGG


CCTGCAGACGGAGCGCGCCGTGGTCGTGGCGCCCTTGGTCAGCTCGGCCGACCAGCGCGCCGCG


GCCATCCGAGAGGTAATGGCGCATGCGCCCGAGGACGCCCCCCCGCAAAGCCCCGCGGCCGACC


GCGTCGTGCTTACGAGCCGTCGCGACCTAGGGGCCTGGGGGGACTACAGCCTCGGCCCCCTGGG


CCAGACGACCGCGGTTCCGGACTCCGTGGATCTGTCTCGCCAGGGGCTGGCCGTTACGCTGAGT


ATGGATTGGTTACTGATGAACGAGCTCCTGCGGGTCACCGACGGCGTGTTTCGCGCTTCCGCGT


TTCGTCCGTTAGCCGGACCGGAGTCTCCCAGGGACCTGGAGGTCCGCGACGCCGGAAACAGTCT


CCCCGCGCCTATGCCCATGGACGCACAGAAGCCGGAGGCCTATGGGCACGGCCCACGCCAGGCG


GACCGCGAGGGGGCGCCTCATTCCAACACCCCCGTCGAGGACGACGAGATGATCCCGGAGGACA


CCGTCGCGCCACCCACGGACTTGCCGTTAACTAGTTACCAATAA





Sequence ID NO.: 2


UL37 C819S


ATGGCAGACCGCGGTCTCCCGTCCGAGGCCCCCGTCGTCACGACCTCACCCGCCGGTCCGCCCT


CGGACGGACCTATGCAGCGCCTATTGGCGAGCCTAGCCGGCCTTCGCCAACCGCCAACCCCCAC


GGCCGAGACGGCAAACGGGGCGGACGACCCGGCGTTTCTGGCCACGGCCAAGCTGCGCGCCGCC


ATGGCGGCGTTTCTGTTGTCGGGAACGGCCATCGCCCCGGCAGACGCGCGGGACTGCTGGCGGC


CGCTGCTGGAACACCTGTGCGCGCTCCACCGGGCCCACGGGCTTCCGGAGACGGCGCTCTTGGC


CGAGAACCTCCCCGGGTTGCTCGTACACCGCTTGGTGGTGGCTCTCCCCGAGGCCCCCGACCAG


GCCTTCCGGGAGATGGAGGTCATCAAGGACACCATCCTCGCGGTCACCGGCTCCGACACGTCCC


ATGCGCTGGATTCCGCCGGCCTGCGCACCGCGGCGGCCCTGGGGCCGGTCCGCGTCCGCCAGTG


CGCCGTGGAGTGGATAGACCGCTGGCAAACCGTCACCAAGAGCTGCTTGGCCATGAGCCCGCGG


ACCTCCATCGAGGCCCTTGGGGAGACGTCGCTCAAGATGGCGCCGGTCCCGTTGGGGCAGCCCA


GCGCGAACCTTACCACCCCGGCGTACAGCCTGCTCTTCCCCGCCCCGTTCGTGCAAGAGGGCCT


CCGGTTCTTGGCCCTGGTGAGTAATCGGGTGACGCTGTTCTCGGCGCACCTCCAGCGCATAGAC


GACGCGACCCTCACTCCCCTCACACGGGCCCTCTTTACGTTGGCCCTGGTGGACGAGTACCTGA


CGACCCCCGAGCGGGGGGCTGTGGTCCCGCCGCCCCTGTTGGCGCAGTTTCAGCACACCGTGCG


GGAGATCGACCCGGCCATAATGATTCCGCCGCTCGAGGCCAACAAGATGGTTCGCAGCCGCGAG


GAGGTGCGCGTGTCGACGGCCCTCAGCCGCGTCAGCCCGCGCTCGGCCTGTGCGCCCCCGGGGA


CGCTAATGGCGCGCGTGCGGACGGACGTGGCCGTGTTTGATCCCGACGTGCCGTTCCTGAGTTC


GTCGGCACTGGCAGTCTTCCAGCCTGCCGTCTCCAGCCTGCTGCAGCTCGGGGAGCAGCCCTCC


GCCGGCGCCCAGCAGCGGCTGCTGGCTCTGCTGCAGCAGACGTGGACGTTGATCCAGAATACCA


ATTCGCCCTCCGTGGTGATCAACACCCTGATCGACGCTGGGTTCACGCCCTCGCACTGCACGCA


CTACCTTTCGGCCCTGGAGGGGTTTCTGGCGGCGGGCGTCCCCGCGCGGACGCCCACCGGCCAC


GGACTCGGCGAAGTCCAGCAGCTCTTTGGGTGCATTGCCCTCGCGGGGTCGAACGTGTTTGGGT


TGGCGCGGGAATACGGGTACTATGCCAACTACGTAAAAACTTTCAGGCGGGTCCAGGGCGCCAG


CGAGCACACGCACGGGCGGCTCTGCGAGGCGGTCGGCCTGTCGGGGGGCGTTCTAAGCCAGACG


CTGGCGCGTATCATGGGTCCGGCCGTGCCGACGGAACATCTGGCGAGCCTGCGGCGGGCGCTCG


TGGGGGAGTTTGAGACGGCCGAGCGCCGCTTTAGTTCCGGTCAACCCAGCCTTCTCCGCGAGAC


GGCGCTCATCTGGATCGACGTGTATGGTCAGACCCACTGGGACATCACCCCCACCACCCCGGCC


ACGCCGCTGTCCGCGCTTCTCCCCGTCGGGCAGCCCAGCCACGCCCCCTCTGTCCACCTGGCCG


CGGCGACCCAGATCCGCTTCCCCGCCCTCGAGGGCATTCACCCCAACGTCCTCGCCGACCCGGG


CTTCGTCCCCTACGTTCTGGCCCTGGTGGTCGGGGACGCGCTGAGGGCCACGTGTAGCGCGGCC


TACCTTCCCCGCCCGGTCGAGTTCGCCCTGCGTGTGTTGGCCTGGGCCCGGGACTTTGGGCTGG


GCTATCTCCCCACGGTTGAGGGCCATCGCACCAAACTGGGCGCGCTGATCACCCTCCTCGAACC


GGCCGCCCGGGGCGGCCTCGGCCCCACTATGCAGATGGCCGACAACATAGAGCAGCTGCTCCGG


GAGCTGTACGTGATCTCCAGGGGTGCCGTCGAGCAGCTGCGGCCCCTGGTCCAGCTGCAGCCCC


CCCCGCCCCCCGAGGTGGGCACCAGCCTCCTGTTGATTAGCATGTACGCCCTGGCCGCCCGGGG


GGTGCTGCAGGACCTCGCCGAGCGCGCAGACCCCCTGATTCGCCAACTGGAGGACGCCATCGTG


CTGCTGCGGCTGCACATGCGCACGCTCTCCGCCTTTTTCGAGTGTCGGTTCGAGAGCGACGGGC


GCCGCCTGTATGCGGTGGTCGGGGACACGCCCGACCGCCTGGGGCCCTGGCCCCCCGAGGCCAT


GGGGGACGCGGTGAGTCAGTAC[[A]]GCAGCATGTATCACGACGCCAAGCGCGCGCTGGTCGC


GTCCCTCGCGAGCCTGCGTTCCGTCATCACCGAAACCACGGCGCACCTGGGGGTGTGCGACGAG


CTGGCGGCCCAGGTGTCGCACGAGGACAACGTGCTGGCCGTGGTCCGGCGCGAAATTCACGGGT


TTCTGTCCGTCGTGTCCGGCATTCACGCCCGGGCGTCGAAGCTGCTGTCGGGAGACCAGGTCCC


CGGGTTTTGCTTCATGGGTCAGTTTCTAGCGCGCTGGCGGCGTCTGTCGGCCTGCTATCAAGCC


GCGCGCGCGGCCGCGGGACCCGAGCCCGTGGCCGAGTTTGTCCAGGAACTCCACGACACGTGGA


AGGGCCTGCAGACGGAGCGCGCCGTGGTCGTGGCGCCCTTGGTCAGCTCGGCCGACCAGCGCGC


CGCGGCCATCCGAGAGGTAATGGCGCATGCGCCCGAGGACGCCCCCCCGCAAAGCCCCGCGGCC


GACCGCGTCGTGCTTACGAGCCGTCGCGACCTAGGGGCCTGGGGGGACTACAGCCTCGGCCCCC


TGGGCCAGACGACCGCGGTTCCGGACTCCGTGGATCTGTCTCGCCAGGGGCTGGCCGTTACGCT


GAGTATGGATTGGTTACTGATGAACGAGCTCCTGCGGGTCACCGACGGCGTGTTTCGCGCTTCC


GCGTTTCGTCCGTTAGCCGGACCGGAGTCTCCCAGGGACCTGGAGGTCCGCGACGCCGGAAACA


GTCTCCCCGCGCCTATGCCCATGGACGCACAGAAGCCGGAGGCCTATGGGCACGGCCCACGCCA


GGCGGACCGCGAGGGGGCGCCTCATTCCAACACCCCCGTCGAGGACGACGAGATGATCCCGGAG


GACACCGTCGCGCCACCCACGGACTTGCCGTTAACTAGTTACCAATAA





Sequence ID NO.: 3


RIG-I Wild-type


MTTEQRRSLQAFQDYIRKTLDPTYILSYMAPWFREEEVQYIQAEKNNKGPMEAATLFLK


FLLELQEEGWERGELDALDHAGYSGLYEAIESWDEKKIEKLEEYRLLLKRLQPEFKTRIIP


TDIISDLSECLINQECEEILQICSTKGMMAGAEKLVECLLRSDKENWPKTLKLALEKERN


KFSELWIVEKGTKDVETEDLEDKMETSDIQIFYQEDPECQNLSENSCPPSEVSDTNLYSPF


KPRNYQLELALPAMKGKNTIICAPTGCGKTFVSLLICEHHLKKFPQGQKGKVVFFANQIP


VYEQQKSVFSKYFERHGYRVTGTSGATAENVPVEQIVENNDIIILTPQILVNNLKKGTIPSL


SIFTLMIFDECHNTSKQHPYNMIMFNYLDQKLGGSSGPLPQVIGLTASVGVGDAKNTDE


ALDYICKLCASLDASVIATVKHNLEELEQVVYKPQKFFRKVESRISDKFKYIIAQLMRDT


ESLAKRICKDLENLSQIQNREFGTQKYEQWIVTVQKACMVFQMPDKDEESRICKALFLY


TSHLRKYNDALITSEHARMKDALDYLKDFFSNVRAAGFEEIEQDLTQRFEEKLQELESVS


RDPSNENPKLEDLCFILQEEYHLNPETITILFVKTRALVDALKNWIEGNPKLSFLKPGTLTG


RGKTNQNTGMTLPAQKCTLDAFKASGDHNILIATSVADEGTDIAQCNLVILYEYVGNVIK


MIQTRGRGRARGSKCFLLTSNAGVIEKEQINMYKEKMMNDSILRLQTWDEAVFREKILH


IQTHEKFIRDSQEKPKPVPDKENKKLLCRKCKALACYTADVRVIEECHYTVLGDAFKEC


FVSRPHPKPKQFSSFEKRAKIFCARQNCSHDWGTHVKYKTFEIPVIKIESFVVEDIATGVQ


TLYSKWKDFHFEKIPFDPAEMSK





Sequence ID NO.: 4


RIG-I-QQ


MTTEQRRSLQAFQDYIRKTLDPTYILSYMAPWFREEEVQYIQAEKNNKGPMEAATLFLK


FLLELQEEGWERGELDALDHAGYSGLYEAIESWDEKKIEKLEEYRLLLKRLQPEFKTRIIP


TDIISDLSECLINQECEEILQICSTKGMMAGAEKLVECLLRSDKENWPKTLKLALEKERN


KFSELWIVEKGTKDVETEDLEDKMETSDIQIFYQEDPECQNLSENSCPPSEVSDTNLYSPF


KPRNYQLELALPAMKGKNTIICAPTGCGKTFVSLLICEHHLKKFPQGQKGKVVFFANQIP


VYEQQKSVFSKYFERHGYRVTGTSGATAENVPVEQIVENNDIIILTPQILVNNLKKGTIPSL


SIFTLMIFDECHNTSKQHPYNMIMFNYLDQKLGGSSGPLPQVIGLTASVGVGDAKNTDE


ALDYICKLCASLDASVIATVKHNLEELEQVVYKPQKFFRKVESRISDKFKYIIAQLMRDT


ESLAKRICKDLE[[Q]]LSQIQNREFGTQKYEQWIVTVQKACMVFQMPDKDEESRICKALF


LYTSHLRKY[[Q]]DALITSEHARMKDALDYLKDFFSNVRAAGFEEIEQDLTQRFEEKLQEL


ESVSRDPSNENPKLEDLCFILQEEYHLNPETITILFVKTRALVDALKNWIEGNPKLSFLKP


GTLTGRGKTNQNTGMTLPAQKCTLDAFKASGDHNILIATSVADEGTDIAQCNLVILYEYV


GNVIKMIQTRGRGRARGSKCFLLTSNAGVIEKEQINMYKEKMMNDSILRLQTWDEAVF


REKILHIQTHEKFIRDSQEKPKPVPDKENKKLLCRKCKALACYTADVRVIEECHYTVLGD


AFKECFVSRPHPKPKQFSSFEKRAKIFCARQNCSHDWGTHVKYKTFEIPVIKIESFVVEDI


ATGVQTLYSKWKDFHFEKIPFDPAEMSK





Sequence ID NO.: 5


C819S KOS genome


81548 a→t 









gcagcccggg ccccccgcgg gcgcgcgcgc gcgcaaaaaa ggcgggcggc ggtccgggcg 


61
gcgtgcgcgc gcgcggcggg cgtggggggc ggggccgcgg gagcggggga ggagcggggg 


121
aggagcgggg ggaggagcgg ggggaggagc ggggggagga gcggggggag gagcgggggg 


181
aggagcgggg ggaggagcgg ggggaggagc ggggggagga gcggggggag gagcgggggg 


241
aggagcgggg ggaggagcgg ggggaggagc ggggggagga gcggggggag gagcgggggg 


301
aggagcgggg gaggagcggc cagaccccgg aaacgggccc cccccaaaac acaccccccg 


361
ggggtcgcgc gcggcccttt aaaggcgggc ggcgggcagc ccgggccccc cgcggccgag 


421
actagcgagt tagacaggca agcactactc gcctctgcac gcacatgctt gcctgtcaaa 


481
ctctaccacc ccggcacgct ctctgtctcc atggcccgcc gccgccatcg cggcccccgc 


541
cgcccccggc cgcccgggcc cacgggcgcg gtcccaaccg cacagtccca ggtaacctcc 


601
acgcccaact cggaacccgt ggtcaggagc gcgcccgcgg ccgccccgcc gccgcccccc 


661
gccagtgggc ccccgccttc ttgttcgctg ctgctgcgcc agtggctcca cgttcccgag 


721
tccgcgtccg acgacgacga cgacgactgg ccggacagcc ccccgcccga gccggcgcca 


781
gaggcccggc ccaccgccgc cgccccccgc ccccggtccc caccgcccgg cgcgggcccg 


841
gggggcgggg ctaacccctc ccaccccccc tcacgcccct tccgccttcc gccgcgcctc 


901
gccctccgcc tgcgcgtcac cgcagagcac ctggcgcgcc tgcgcctgcg acgcgcgggc 


961
ggggaggggg cgccgaagcc ccccgcgacc cccgcgaccc ccgcgacccc cacgcgggtg 


1021
cgcttctcgc cccacgtccg ggtgcgccac ctggtggtct gggcctcggc cgcccgcctg 


1081
gcgcgccgcg gctcgtgggc ccgcgagcgg gccgaccggg ctcggttccg gcgccgggtg 


1141
gcggaggccg aggcggtcat cgggccgtgc ctggggcccg aggcccgtgc ccgggccctg 


1201
gcccgcggag ccggcccggc gaactcggtc taacgttaca cccgaggcgg cctgggtctt 


1261
ccgcggagct cccgggagct ccgcaccaag ccgctctccg gagagacgat ggcaggagcc 


1321
gcgcatatat acgcttggag ccggcccgcc cccgaggcgg gcccgccctc ggagggcggg 


1381
actggccaat cggcggccgc cagcgcggcg gggcccggcc aaccagcgtc cgccgagtcg 


1441
tcggggcccg gcccactggg cggtaactcc cgcccagtgg gccgggccgc ccacttcccg 


1501
gtatggtaat taaaaacttg cagaggcctt gttccgcttc ccggtatggt aattagaaac 


1561
tcattaatgg gcggccccgg ccgcccttcc cgcttccggc aattcccgcg gcccttaatg 


1621
ggcaaccccg gtattccccg cctcccgcgc cgcgcgtaac cactcccctg gggttccggg 


1681
ttatgttaat tgcttttttg gcggaacaca cggcccctcg cgcattggcc cgcgggtcgc 


1741
tcaatgaacc cgcattggtc ccctggggtt ccgggtatgg taatgagttt cttcgggaag 


1801
gcgggaagcc ccggggcacc gacgcaggcc aagcccctgt tgcgtcggcg ggaggggcat 


1861
gctaatgggg ttctttgggg gacaccgggt tggtccccca aatcgggggc cgggccgtgc 


1921
atgctaatga tattctttgg gggcgccggg ttggtccccg gggacggggc cgccccgcgg 


1981
tgggcctgcc tcccctggga cgcgcggcca ttgggggaat cgtcactgcc gcccctttgg 


2041
ggaggggaaa ggcgtggggt ataagttagc cctggcccga cggtctggtc gcatttgcac 


2101
ctcggcactc ggagcgagac gcagcagcca ggcagactcg ggccgccccc tctccgcatc 


2161
accacagaag ccccgcctac gttgcgaccc ccagggaccc tccgtcagcg accctccagc 


2221
cgcatacgac ccccatggag ccccgccccg gagcgagtac ccgccggcct gagggccgcc 


2281
cccagcgcga ggtgaggggc cgggcgccat gtctggggcg ccatgttggg gggcgccatg 


2341
ttggggggcg ccatgttggg ggacccccga cccttacact ggaaccggcc gccatgttgg 


2401
gggaccccca ctcatacacg ggagccgggc gccatgttgg ggcgccatgt tagggggcgt 


2461
ggaaccccgt gacactatat atacagggac cgggggcgcc atgttagggg gcgcggaacc 


2521
ccctgaccct atatatacag ggaccggggt cgccctgtta ggggtcgcca tgtgaccccc 


2581
tgactttata tatacagacc cccaacacct acacatggcc cctttgactc agacgcaggg 


2641
cccggggtcg ccgtgggacc cccctgactc atacacagag acacgccccc acaacaaaca 


2701
cacagggacc ggggtcgccg tgttaggggg cgtggtcccc actgactcat acgcagggcc 


2761
cccttactca cacgcatcta ggggggtggg gaggagccgc ccgccatatt tgggggacgc 


2821
cgtgggaccc ccgactccgg tgcgtctgga gggcgggaga agagggaaga agaggggtcg 


2881
ggatccaaag gacggaccca gaccaccttt ggttgcagac ccctttctcc cccctcttcc 


2941
gaggccagca ggggggcagg actttgtgag gcgggggggg agggggaact cgtgggcgct 


3001
gattgacgcg ggaaatcccc ccattcttac ccgccccccc ttttttcccc tcagcccgcc 


3061
ccggatgtct gggtgtttcc ctgcgaccga gacctgccgg acagcagcga ctcggaggcg 


3121
gagaccgaag tgggggggcg gggggacgcc gaccaccatg acgacgactc cgcctccgag 


3181
gcggacagca cggacacgga actgttcgag acggggctgc tggggccgca gggcgtggat 


3241
gggggggcgg tctcgggggg gagccccccc cgcgaggaag accccggcag ttgcgggggc 


3301
gccccccctc gagaggacgg ggggagcgac gagggcgacg tgtgcgccgt gtgcacggat 


3361
gagatcgcgc cccacctgcg ctgcgacacc ttcccgtgca tgcaccgctt ctgcatcccg 


3421
tgcatgaaaa cctggatgca attgcgcaac acctgcccgc tgtgcaacgc caagctggtg 


3481
tacctgatag tgggcgtgac gcccagcggg tcgttcagca ccatcccgat cgtgaacgac 


3541
ccccagaccc gcatggaggc cgaggaggcc gtcagggcgg gcacggccgt ggactttatc 


3601
tggacgggca atcagcggtt cgccccgcgg tacctgaccc tgggggggca cacggtgagg 


3661
gccctgtcgc ccacccaccc tgagcccacc acggacgagg atgacgacga cctggacgac 


3721
ggtgaggcgg gggggcggcg aggaccctgg gggaggagga ggaggggggg gggagggagg 


3781
aataggcggg cgggcgggcg aggaaagggc gggccgggga gggggcgtaa cctgatcgcg 


3841
ccccccgttg tctcttgcag cagactacgt accgcccgcc ccccgccgga cgccccgcgc 


3901
ccccccacgc agaggcgccg ccgcgccccc cgtgacgggc ggggcgtctc acgcagcccc 


3961
ccagccggcc gcggctcgga cagcgccccc ctcggcgccc atcgggccac acggcagcag 


4021
taacactaac accaccacca acagcagcgg cggcggcggc tcccgccagt cgcgagccgc 


4081
ggtgccgcgg ggggcgtctg gcccctccgg gggggttggg gttgttgaag cggaggcggg 


4141
gcggccgagg ggccggacgg gcccccttgt caacagaccc gccccccttg caaacaacag 


4201
agaccccata gtgatcagcg actccccccc ggcctctccc cacaggcccc ccgcggcgcc 


4261
catgccaggc tccgcccccc gccccggtcc ccccgcgtcc gcggccgcgt cgggccccgc 


4321
gcgcccccgc gcggccgtgg ccccgtgtgt gcgggcgccg cctccggggc ccggcccccg 


4381
cgccccggcc cccggggcgg agccggccgc ccgccccgcg gacgcgcgcc gtgtgcccca 


4441
gtcgcactcg tccctggctc aggccgcgaa ccaagaacag agtctgtgcc gggcgcgtgc 


4501
gacggtggcg cgcggctcgg gggggccggg cgtggagggt ggacacgggc cctcccgcgg 


4561
cgccgccccc tccggcgccg ccccctccgg cgcccccccg ctcccctccg ccgcctctgt 


4621
cgagcaggag gcggcggtgc gtccgaggaa gaggcgcggg tcgggccagg aaaacccctc 


4681
cccccagtcc acgcgtcccc ccctcgcgcc ggcaggggcc aagagggcgg cgacgcaccc 


4741
cccctccgac tcagggccgg gggggcgcgg ccagggaggg cccgggaccc ccctgacgtc 


4801
ctcggcggcc tccgcctctt cctcctccgc ctcttcctcc tcggccccga ctcccgcggg 


4861
ggccacctct tccgccaccg gggccgcgtc ctcctccgct tccgcctcct cgggcggggc 


4921
cgtcggtgcc ctgggaggga gacaagagga aacctccctc ggcccccgcg ctgcttctgg 


4981
gccgcggggg ccgaggaagt gtgcccggaa gacgcgccac gcggagactt ccggggccgt 


5041
ccccgcgggc ggcctcacgc gctacctgcc catctcgggg gtctctagcg tggtcgccct 


5101
gtcgccttac gtgaacaaga cgatcacggg ggactgcctg cccatcctgg acatggagac 


5161
ggggaacatc ggggcgtacg tggtcctggt ggaccagacg ggaaacatgg cgacccggct 


5221
gcgggccgcg gtccccggct ggagccgccg caccctgctc cccgagaccg cgggtaacca 


5281
cgtgacgccc cccgagtacc cgacggcccc cgcgtcggag tggaacagcc tctggatgac 


5341
ccccgtgggg aacatgctgt tcgaccaggg caccctagtg ggcgccctgg acttccgcag 


5401
cctgcggtct cggcacccgt ggtccgggga gcagggggcg tcgacccggg acgagggaaa 


5461
acaataaggg acgcccccgt gtttgtgggg aggggggggt cgggcgctgg gtggtctctg 


5521
gccgcgccca ctacaccagc caatccgtgt cggggaggtg gaaagtgaaa gacacgggca 


5581
ccacacacca gcgggtcttt tgtgttggcc ctaataaaaa aaactcaggg gatttttgct 


5641
gtctgttggg aaataaaggt ttacttttgt atcttttccc tgtctgtgtt ggatgtatcg 


5701
cgggggtgcg tgggagtggg ggtgcgtggg agtgggggtg cgtgggagtg ggggtgcgtg 


5761
ggagtggggg tgcgtgggag tgggggtgcg tgggagtggg ggtgcgtggg agtgggggtg 


5821
cgtgggagtg ggggtgcgtg ggagtggggg tgccatgttg ggcaggctct ggtgttaacc 


5881
acagagccgc ggcccgggct gcctgaccac cgatccccga aagcatcctg ccactggcat 


5941
ggagccagaa ccacagtggg ttgggtgtgg gtgttaagtt tccgcgagcg cctgcccgcc 


6001
cggactgacc tggcctctgg ccgccacaaa gggcgggggg gggggttaac tacactatag 


6061
ggcaacaaag gatgggaggg gtagcggggc gggacggggc gcccaaaagg gggtcggcca 


6121
caccacagac gtgggtgttg gggggtgggg cggaggggtg gggggggaga cagaaacagg 


6181
aacatagtta gaaaacaaga atgcggtgca gccagagaat cacaggagac gaggggatgg 


6241
gcgtgttggt taccaaccca cacccaggca tgctcggtgg tatgaaggag ggggggcggt 


6301
gtttcttaga gaccgccggg ggacgtgggg ttggtgtgca aaggcacgcg cacccgcgcc 


6361
ggccaggtgg gccggtaccc catccccccc tcccccgacc cttcccaccc ccgcgtgcca 


6421
gagatcaccc cggtcccccg gcacccgcca ctcctccata tcctcgcttt aggaacaact 


6481
ttaggggggg gtacacacgc gccgtgcatt tccttccaca ccccccccct cccccgcact 


6541
cccccccccc aggcagtaag acccaagcat agagagccag gcacaaaaac acaggcgggg 


6601
tgggacacat gccttcttgg agtacgtggg tcattggcgt ggggggttac agcgacaccg 


6661
gccgaccccc tggcggtctt ccagccggcc cttagataag ggggcagttg gtggtcggac 


6721
gggtaagtaa cagagtctaa ctaagggtgg gaggggggga aaataacggg ctggtgtgct 


6781
gtaacacgag cccacccgcg agtggcgtgg ccgaccttag cctctggggc gccccctgtc 


6841
gtttgggtcc ccccccctct attggggaga agcaggtgtc taacctacct ggaaacgcgg 


6901
cgtctttgtt gaacgacacc ggggcgccct cgacgagtgg gataacgggg gaggaaggga 


6961
gggaggaggg tactgggggt gaaggggggg ggggagaagc gagaacagga aaggcgacgg 


7021
agcccggcag aacaccgagg aaaaaaaaac cacagcgcat gcgccgggcc gttgtggggc 


7081
cccgggccgg ggccccttgg gtccgccggg gccccgggcc gggccgccac gggggccggc 


7141
cgttggcggt aaccccgagt gttcatctca ggccccgggc cgggaacccg gaaaagcctc 


7201
cggggggcct ttttcgcgtc gcgtgccggc gagcgggtcc ggacggggcc cggaccgccg 


7261
cggtcggggg cccctcgtcc cgggccgtac gcggccttcg ccccgtgagg ggacagacga 


7321
acgaaacatt ccggcgacgg aacgaaaaac accccagacg ggttaaagaa acagaaaccg 


7381
caacccccac cacccccgaa acggggaaaa cgaaaaaaca gaccagcggc cggccggcgc 


7441
ttagggggag gatgtcgccg acgccccttg gccgccccgg ctgcaggggg gcccggagag 


7501
ccgcggcacc cggacgcgcc cggaaagtct ttcgcaccac cggcgatcgg cacggccgcg 


7561
cccccgcttt tataaaggct cagatgacgc agcaaaaaca ggccacagca ccacatgggt 


7621
aggtgatgta attttatttt cctcgtctgc ggcctaatgg atttccgggc gcggtgcccc 


7681
tgtctgcaga gcacttaacg gattgatatc tcgcgggcac gcgcgccctt aatggaccgg 


7741
cgcggggcgg ggggccggat acccacacgg gcgggggggg gtgtcgcggg ccgtctgctg 


7801
gcccgcggcc acataaacaa tgactcgggg cctttctgcc tctgccgctt gtgtgtgcgc 


7861
gcgccggctc tgcggtgtcg gcggcggcgg cggcggtggc cgccgtgttc ggtctcggta 


7921
gccggccggc gggtggactc gcggggggcc ggagggtgga aggcaggggg gtgtaggatg 


7981
ggtatcagga cttccacttc ccgtccttcc atcccccgtt cccctcggtt gttcctcgcc 


8041
tcccccaaca ccccgccgct ttccgttggg gttgttattg ttgtcgggat cgtgcgggcc 


8101
gggggtcgcc ggggcagggg cgggggcgtg ggcgggggtg ctcgtcgatc gaccgggctc 


8161
agtgggggcg tggggtgggt gggagaaggc gaggagactg gggtgggggt gtcggtgggt 


8221
ggttgttttt tgtggttgtt ttttgtgtct gtttccgtcc cccgtcaccc ccctccctcc 


8281
gtcccctccg tccccccgtc gcgggtgttt gtgtttgttt attccgacat cggtttattt 


8341
aaaataaaca cagccgttct gcgtgtctgt tcttgcgtgt ggctgggggc ttatatgtgg 


8401
ggtcccgggg gcgggatggg gtttagcggc ggggggcggc gcgccggacg gggcgctgga 


8461
gataacggcc cccggggaac gggggaccgg ggctgggtat cccgaggtgg gtgggtgggc 


8521
ggcggtggcc gggccgggcc gggccgggcc gggtgggcgg ggtttggaaa aacgaggagg 


8581
aggaggagaa ggcggggggg gagacggggg gaaagcaagg acacggcccg gggggtggga 


8641
gcgcgggccg ggccgctcgt aagagccgcg acccggccgc cggggagcgt tgtcgccgtc 


8701
ggtctgccgg cccccgtccc tccctttttt gaccaaccag cgccctcccc cccaccacca 


8761
ttcctactac caccaccacc accaccccca ccaccgacac ctcccgcgca cccccgccca 


8821
catcccccca ccccgcacca cgagcacggg gtgggggtag caggggatca aaggggggca 


8881
aagccggcgg ggcggttcgg gggggcggga gaccgagtag gcccgcccat acgcggcccc 


8941
tcccggcagc cacgcccccc agcgtcgggt gtcacgggga aagagcaggg gagagggggg 


9001
gagaggggag agggggggag aggggagagg gggggagagg ggagaggggg ggagagggga 


9061
gaggggggga gaggggagag ggggggagag gggagagggg gggagagggg agaggggggg 


9121
agaggggaga gggggggaga ggggagaggg ggggagaggg ggtatataaa ccaacgaaaa 


9181
gcgcgggaac ggggatacgg ggcttgtgtg gcacgacgtc gtggttgtgt tactgggcaa 


9241
acacttgggg actgtaggtt tctgtgggtg ccgaccctag gcgctatggg gattttgggt 


9301
tgggtcgggc ttattgccgt tggggttttg tgtgtgcggg ggggcttgtc ttcaaccgaa 


9361
tatgttattc ggagtcgggt ggctcgagag gtgggggata tattaaaggt gccttgtgtg 


9421
ccgctcccgt ctgacgatct tgattggcgt tacgagaccc cctcggctat aaactatgct 


9481
ttgatagacg gtatattttt gcgttatcac tgtcccggat tggacacggt cttgtgggat 


9541
aggcatgccc agaaggcata ttgggttaac ccctttttat ttgtggcggg ttttctggag 


9601
gacttgagtc accccgcgtt tcctgccaac acccaggaaa cagaaacgcg cttggccctt 


9661
tataaagaga tacgccaggc gctggacagt cgcaagcagg ccgccagcca cacacctgtg 


9721
aaggctgggt gtgtgaactt tgactattcg cgcacccgcc gctgtgtagg gcgacaggat 


9781
ttgggaccta ccaacggaac gtctggacgg accccggttc tgccgccgga cgatgaagcg 


9841
ggcctgcaac cgaagcccct caccacgccg ccgcccatca tcgccacgtc ggaccccacc 


9901
ccgcgacggg acgccgccac aaaaagcaga cgccgacgac cccactcccg gcgcctctaa 


9961
cgatgcctcg acggaaaccc gtccgggttc ggggggcgaa ccggccgcct gtcgctcgtc 


10021
agggccggcg ggcgctcctc gccgccctag aggctgtccc gctggtgtga cgttttcctc 


10081
gtccgcgccc cccgaccctc ccatggattt aacaaacggg ggggtgtcgc ctgcggcgac 


10141
ctcggcgcct ctggactgga ccacgtttcg gcgtgtgttt ctgatcgacg acgcgtggcg 


10201
gcccctgatg gagcctgagc tggcgaaccc cttaaccgcc cacctcctgg ccgaatataa 


10261
tcgtcggtgc cagaccgaag aggtgctgcc gccgcgggag gatgtgtttt cgtggactcg 


10321
ttattgcacc cccgacgagg tgcgcgtggt tatcatcggc caggacccat atcaccaccc 


10381
cggccaggcg cacggacttg cgtttagcgt gcgcgcgaac gtgccgcctc ccccgagtct 


10441
tcggaatgtc ttggtggccg tcaagaactg ttatcccgag gcacggatga gcggccacgg 


10501
ttgcctggaa aagtgggcgc gggacggcgt cctgttacta aacacgaccc tgaccgtcaa 


10561
gcgcggggcg gcggcgtccc actctagaat cggttgggac cgcttcgtgg gcggagttat 


10621
ccgccggttg gccgcgcgcc gccccggcct ggtgtttatg ctctggggcg cacacgccca 


10681
gaatgccatc aggccggacc ctcgggtcca ttgcgtcctc aagttttcgc acccgtcgcc 


10741
cctctccaag gttccgttcg gaacctgcca gcatttcctc gtggcgaacc gatacctcga 


10801
gacccggtcg atttcaccca tcgactggtc ggtttgaaag gcatcgacgt ccggggtttt 


10861
tgtcggtggg ggcttttggg tatttccgat gaataaagac ggttaatggt taaacctctg 


10921
gtctcatacg ggtcggtgat gtcgggcgtc gggggagagg gagttccctc tgcgcttgcg 


10981
attctagcct cgtggggctg gacgttcgac acgccaaacc acgagtcggg gatatcgcca 


11041
gatacgactc ccgcagattc cattcggggg gccgctgtgg cctcacctaa ccaaccttta 


11101
cacgggggcc cggaacggga ggccacagcg ccgtctttct ccccaacgcg cgcggatgac 


11161
ggcccgccct gtaccgacgg gccctacgtg acgtttgata ccctgtttat ggtgtcgtcg 


11221
atcgacgaat tagggcgtcg ccagctcacg gacaccatcc gcaaggacct gcggttgtcg 


11281
ctggccaagt ttagcattgc gtgcaccaag acctcctcgt tttcgggaaa cgccccgcgc 


11341
caccacagac gcggggcgtt ccagcgcggc acgcgggcgc cgcgcagcaa caaaagcctc 


11401
cagatgtttg tgttgtgcaa acgcgcccac gccgctcgag tgcgagagca gcttcgggtc 


11461
gttattcagt cccgcaagcc gcgcaagtat tacacgcgat cttcggacgg gcggctctgc 


11521
cccgccgtcc ccgtgttcgt ccacgagttc gtctcgtccg agccaatgcg cctccaccga 


11581
gataacgtca tgctggcctc gggggccgag taaccgcccc ccccccatgc caccctcact 


11641
gcccgtcgcg cgtgtttgat gttaataaat aacacataaa tttggctggt tgtttgttgt 


11701
ctttaatgga ccgcccgcaa gggggggggg gcgtttcagt gtcgggtgac gagcgcgatc 


11761
cggccgggat cctaggaccc caaaagtttg tctgcgtatt ccagggtggg gctcagttga 


11821
atctcccgca gcacctctac cagcaggtcc gcggtgggct ggagaaactc ggccgtcccg 


11881
gggcaggcgg ttgtcggggg tggaggcgcg gcgcccaccc cgtgtgccgc gcctggcgtc 


11941
tcctctgggg gcgacccgta aatggttgca gtgatgtaaa tggtgtccgc ggtccagacc 


12001
acggtcaaaa tgccggccgt ggcgctccgg gcgctttcgc cgcgcgagga gctgacccag 


12061
gagtcgaacg gatacgcgta catatgggcg tcccacccgc gttcgagctt ctggttgctg 


12121
tcccggccta taaagcggta ggcacaaaat tcggcgcgac agtcgataat caccaacagc 


12181
ccaatggggg tgtgctggat aacaacgcct ccgcgcggca ggcggtcctg gcgctcccgg 


12241
ccccgtacca tgatcgcgcg ggtgccgtac tcaaaaacat gcaccacctg cgcggcgtcg 


12301
ggcagtgcgc tggtcagcga ggccctggcg tggcataggc tatacgcgat ggtcgtctgt 


12361
ggattggaca tctcgcggtg ggtagtgagt cccccgggcc gggttcggtg gaactgtaag 


12421
gggacggcgg gttaatagac aatgaccacg ttcggatcgc gcagagccga tagtatgtgc 


12481
tcactaatga cgtcatcgcg ctcgtggcgc tcccggagcg gatttaagtt catgcgaagg 


12541
aattcggagg aggtggtgcg ggacatggcc acgtacgcgc tgttgaggcg caggttgccg 


12601
ggcgtaaagc agatggcgac cttgtccagg ctaaggccct gggagcgcgt gatggtcatg 


12661
gcaagcttgg agctgatgcc gtagtcggcg tttatggcca tggccagctc cgtagagtca 


12721
atggactcga caaactcgct gatgttggtg ttgacgacgg acatgaagcc gtgttggtcc 


12781
cgcaagacca cgtaaggcag gggggcctct tccagtaact cggccacgtt ggccgtcgcg 


12841
tgccgcctcc gcagctcgtc cgcaaaggca aacacccgtg tgtacgtgta tcccatgagc 


12901
gtataattgt ccgtctgcag ggcgacggac atcagccccc cgcgcggcga gccggtcagc 


12961
atctcgcagc cccggaagat aacgttgtcc acgtacgtgc taaagggggc gacttcaaat 


13021
gcctccccga agagctcttg gaggattcgg aatctcccga ggaaggcccg cttcagcagc 


13081
gcaaactggg tgtgaacggc ggcggtggtc tccggttccc cgggggtgta gtggcagtaa 


13141
aacacgtcga gctgttgttc gtccagcccc gcgaaaataa cgtcgaggtc gtcgtcggga 


13201
aaatcgtccg ggcccccgtc ccgcggcccc agttgcttaa aatcaaacgc acgctcgccg 


13261
ggggcgcctg cgtcggccat taccgacgcc tgcgtcggca cccccgaaga tttggggcgc 


13321
agagacagaa tctccgccgt tagttctccc atgcgggcgt acgcgagggt cctctgggtc 


13381
gcatccaggc ccgggcgctg cagaaagttg taaaaggaga taagcccgct aaatatgagc 


13441
cgcgacagga acctgtaggc aaactccacc gaagtctccc cctgagtctt tacaaagctg 


13501
tcgtcacgca acactgcctc gaaggcccgg aacgtcccac taaacccaaa aaccagtttt 


13561
cgcaggcgcg cggtcaccgc gatctggctg ttgaggacgt aagtgacgtc gttgcgggcc 


13621
acgaccagct gctgtttgct gtgcacctcg cagcgcatgt gccccgcgtc ctggtcctgg 


13681
ctctgcgagt agttggtgat gcggctggcg ttggccgtga gccacttttc aatagtcagg 


13741
ccgggctggt gtgtcagccg tcggtattcg tcaaactcct tgaccgacac gaacgtaagc 


13801
acggggaggg tgaacacgac gaactccccc tcacgggtca ccttcaggta ggcgtggagc 


13861
ttggccatgt acgcgctcac ctctttgtgg gaggagaaca gccgcgtcca gccggggagg 


13921
ttggcggggt tggtgatgta gttttccggg acgacgaagc gatccacgaa ctgcatgtgc 


13981
tcctcggtga tgggcaggcc gtactccagc accttcatga ggttaccgaa ctcgtgctcg 


14041
acgcaccgtt tgttgttaat aaaaatggcc cagctatacg agaggcgggc gtactcgcgc 


14101
agcgtgcggt tgcagatgag gtacgtgagc acgttctcgc tctggcggac ggaacaccgc 


14161
agtttctggt gctcgaaggt cgactccagg gacgccgtct gcgtcggcga gcccacacac 


14221
accaacacgg gccgcaggcg ggccgcgtac tggggggtgt ggtacagggc gttaatcatc 


14281
caccagcaat acaccacggc cgtgaggagg tgacgcccaa ggagcccggc ctcgtcgatg 


14341
acgatcacgt tgctgcgggt aaaggccggc agcgccccgt gggtggccgg ggccaaccgc 


14401
gtcagggcgc cctcggccaa ccccagggtc cgttccaggg cggccagggc gcgaaactcg 


14461
ttccgcaact cctcgccccc ggaggcggcc agggcgcgct tcgtgaggtc caaaatcacc 


14521
tcccagtagt acgtcagatc tcgtcgctgc aggtcctcca gcgaggcggg gttgctggtc 


14581
agggtgtacg ggtactgtcc cagttgggcc tggacgtgat tcccgcgaaa cccaaattca 


14641
tgaaagatgg tgttgatggg tcggctgaga aaggcgcccg agagtttggc gtacatgttt 


14701
tgggccgcaa tgcgcgtggc gcccgtcacc acacagtcca agacctcgtt gattgtctgc 


14761
acgcacgtgc tctttccgga gccagcgttg ccggtgataa gatacaccgc gaacggaaac 


14821
tccctgaggg gcaggcctgc gggggactct aaggccgcca cgtcccggaa ccactgcaga 


14881
cggggcactt gcgctccgtc gagctgttgt tgcgagagct ctcggatgcg cttaaggatt 


14941
ggctgcaccc cgtgcataga cgtaaaattt aaaaaggcct cggccctccc tggaacggct 


15001
ggtcggtccc cgggttgctg aaggtgcggc gggccgggtt tctgtccgtc tagctggcgc 


15061
tccccgccgg ccgccgccat gaccgcacca cgctcgtggg cccccactac gcgtgcgcgg 


15121
ggggacacgg aagcgctgtg ctcccccgag gacggctggg taaaggttca ccccaccccc 


15181
ggtacgatgc tgttccgtga gattctccac gggcagctgg ggtataccga gggccagggg 


15241
gtgtacaacg tcgtccggtc cagcgaggcg accacccggc agctgcaggc ggcgatcttt 


15301
cacgcgctcc tcaacgccac cacttaccgg gacctcgagg cggactggct cggccacgtg 


15361
gcggcccgcg gtctgcagcc ccaacggctg gttcgccggt acaggaacgc ccgggaggcg 


15421
gatatcgccg gggtggccga gcgggtgttc gacacgtggc ggaacacgct taggacgacg 


15481
ctgctggact ttgcccacgg gttggtcgcc tgctttgcgc cgggcggccc gagcggcccg 


15541
tcaagcttcc ccaaatatat cgactggctg acgtgcctgg ggctggtccc catattacgc 


15601
aagcgacaag aagggggtgt gacgcagggt ctgagggcgt ttctcaagca gcacccgctg 


15661
acccgccagc tggccacggt cgcggaggcc gcggagcgcg ccggccccgg gttttttgag 


15721
ctggcgctgg ccttcgactc cacgcgcgtg gcggactacg accgcgtgta tatctactac 


15781
aaccaccgcc ggggcgactg gctcgtgcga gaccccatca gcgggcagcg cggagaatgt 


15841
ctggtgctgt ggcccccctt gtggaccggg gaccgtctgg tcttcgattc gcccgtccag 


15901
cggctgtttc ccgagatcgt cgcgtgtcac tccctccggg aacacgcgca cgtctgccgg 


15961
ctgcgcaata ccgcgtccgt caaggtgctg ctggggcgca agagcgacag cgagcgcggg 


16021
gtggccggtg ccgcgcgggt cgttaacaag gtgttggggg aggacgacga gaccaaggcc 


16081
gggtcggccg cctcgcgcct cgtgcggctt atcatcaaca tgaagggcat gcgccacgta 


16141
ggcgacatta acgacaccgt gcgtgcctac ctcgacgagg ccggggggca cctgatagac 


16201
gccccggccg tcgacggtac cctccctgga ttcggcaagg gcggaaacag ccgcgggtct 


16261
gcgggccagg accagggggg gcgggcgccg cagcttcgcc aggccttccg cacggccgtg 


16321
gttaacaaca tcaacggcgt gttggagggc tatataaata acctgtttgg aaccatcgag 


16381
cgcctgcgcg agaccaacgc gggcctggcg acccaattgc aggagcgcga ccgcgagctc 


16441
cggcgcgcaa cagcgggggc cctggagcgc cagcagcgcg cggccgacct ggcggccgag 


16501
tccgtgaccg gtggatgcgg cagccgccct gcgggggcgg acctgctccg ggccgactat 


16561
gacattatcg acgtcagcaa gtccatggac gacgacacgt acgtcgccaa cagctttcag 


16621
cacccgtaca tcccttcgta cgcccaggac ctggagcgcc tgtcgcgcct ctgggagcac 


16681
gagctggtgc gctgttttaa aattctgtgt caccgcaaca accagggcca agagacgtcg 


16741
atctcgtact ccagcggggc gatcgccgca ttcgtcgccc cctactttga gtcagtgctt 


16801
cgggcccccc gggtaggcgc gcccatcacg ggctccgatg tcatcctggg ggaggaggag 


16861
ttatgggatg cggtgtttaa gaaaacccgc ctgcaaacgt acctgacaga catcgcggcc 


16921
ctgttcgtcg cggacgtcca gcacgcagcg ctgcccccgc ccccctcccc ggtcggcgcc 


16981
gatttccggc ccggcgcgtc cccgcggggc cggtccagat cgcggtcgcc cggaagaact 


17041
gcgccaggcg cgccggacca gggcgggggc atcgggcacc gggatggccg ccgcgacggc 


17101
cgacgatgag gggtcggccg ccaccatcct caagcaggcc atcgccgggg accgcagcct 


17161
ggtcgaggcg gccgaggcga ttagccagca gacgctgctc cgcctggcct gcgaggtgcg 


17221
ccaggtcggc gaccgccagc cgcggtttac cgccaccagc atcgcgcgcg tcgacgtcgc 


17281
gcctgggtgc cggttgcggt tcgttctgga cgggagtccc gaggacgcct atgtgacgtc 


17341
ggaggattac tttaagcgct gctgcggcca gtccagttat cgcggcttcg cggtggcggt 


17401
cctgacggcc aacgaggacc acgtgcacag cctggccgtg ccccccctcg ttctgctgca 


17461
ccggttctcc ctgttcaacc ccagggacct cctggacttt gagcttgcct gtctgctgat 


17521
gtacctggag aactgccccc gaagccacgc caccccgtcg acctttgcca aggttctggc 


17581
gtggctcggg gtcgcgggtc gccgcacgtc cccattcgaa cgcgttcgct gccttttcct 


17641
ccgcagttgc cactgggtcc taaacacact catgttcatg gtgcacgtaa aaccgttcga 


17701
cgacgagttc gtcctgcccc actggtacat ggcccggtac ctgctggcca acaacccgcc 


17761
ccccgttctc tcggccctgt tctgtgccac cccgacgagc tcctcattcc ggctgccggg 


17821
gccgcccccc cgctccgact gcgtggccta taaccccgcc gggatcatgg ggagctgctg 


17881
ggcgtcggag gaggtgcgcg cgcctctggt ctattggtgg ctttcggaga ccccaaaacg 


17941
acagacgtcg tcgctgtttt atcagttttg ttgaatttta ggaaataaac ccggttttgt 


18001
ttctgtggcc tcccgacgga tgcgcgtgtc cttcctccgt cttggtgggt gggtgtctgt 


18061
gtatcgcgtc ccatctgtgc ggagaggggg ggcatgtcgg cacgtattcg gacagactca 


18121
agcacacacg ggggagcgct cttgtctcag ggcaatgttt ttattggtca aactcaggca 


18181
aacagaaacg acatcttgtc gtcaaaggga tacacaaact tccccccctc tccccatact 


18241
cccgccagca ccccggtaaa caccaactca atctcgcgca ggatttcgcg caggtgatga 


18301
gcgcagtcca cgggggggag cacaaggggc cgcgggtata gatcgacggg gacgccgacc 


18361
gactccccgc ctccgggaca gacacgcacg acgcgccgcc agtagtgctc tgcgtccagc 


18421
aaggcgccgc cgcggaaggc agtggggggc aaggggtcgc tagcctcaaa gggggacacc 


18481
cgaacgctcc agtactccgc gtccaaccgt ttattaaacg cgtccacgat aaggcggtcg 


18541
caggcgtcct ccataaggcc ccgggccgtg agtgcgtcct cctccggcac gcctgccgtt 


18601
gtcaggccca ggacccgtcg cagcgtgtcg cgtacgaccc cggccgccgt ggtgtacgcg 


18661
ggcccgcgga gaggaaatcc cccaagatgg tcagtgttgt cgcgggagtt ccagaaccac 


18721
actcccgcct ggttccaggc gactgcgtgg gtgtagacgc cctcgagggc caggcacagt 


18781
gggtgccgca gccggaggcc gttggcccta agcacggctc ccacggccgt ctcgatggcc 


18841
cgccgggcgt cctcgatcac cccggaagcc gcatccgcgt cttgggggtc cacgttaaag 


18901
acaccccaga acgcaccccc atcgcccccg cagaccgcga acttcaccga gctggccgtc 


18961
tcctcgatct gcaggcagac ggcggccatt accccaccca ggagctgccg cagcgcaggg 


19021
caggcgtcgc acgtgtccgg gaccaggcgc tccaagacgg ccccggccca gggctctgag 


19081
ggagcggcca ccaccagcgc gtccagtctt gctaggcccg tccggccgtg ggggtccgcc 


19141
agcccgctcc ccccgaggtc ggccagggcc gccaggagct gggcgcgaag tccggggaag 


19201
caaaaccgcg ccgtccagac gggcccgacg gccgcgggcg ggtctaacag ttggatgatt 


19261
ttagtggcgg gatgccaccg cgccaccgcc tcccgcaccg cgggcaggag gcatccggct 


19321
gccgccgagg ccacgccggg ccaggctcgc ggggggagga cgaccctggc ccccaccgcg 


19381
ggccaggccc ccaggagcgc ggcgtaagcg gccgcggccc cgcgcaccag gtcccgtgcc 


19441
gactcggccg tggccggcac ggtgaacgtg ggccaacccg gaaaccccag gacggcaaag 


19501
tacgggacgg gtcccccccg gacctcaaac tcgggcccca gaaaggcaaa gacgggggcc 


19561
agggccccgg gggcggcgtg gaccgtggta tgccactgcc ggaaaagggc gacgagcgcc 


19621
ggcgcggaga acttctcgcc ggcgcttaca aagtagtcgt aatcgcgggg cagcagcacc 


19681
cgtgccgtga ctcgttgcgg gtgcccgcgt ggccgcaggc ccacctcgca cacctcgacc 


19741
aggtccccga acgcgccctc cttcttgatc ggcggaaacg caagagtctg gtattcgcgc 


19801
gcaaatagcg cggttccggt ggtgatgtta acggtcagcg aagcggcgga cgcgcactgg 


19861
ggggtgtcgc gaatggccgc caggcgcgcc cacgccagcc gcgcgtcggg atgctcggca 


19921
acgcgcgccg ccagggccat agggtcgatg tcaatgttgg cctccgcgac caggagagcg 


19981
gcgcgagggg cggcgggcgg gccccacgac gctctctcaa ctttcaccac cagtcccgtg 


20041
cgtgggtccg agccgatacg cagcggggcg aacagggcca ccggcccggt ctggcgctcc 


20101
agggccgcca ggacgcacgc gtacagcgcc cgccacagag tcgggttctc caggggctcc 


20161
agcggggagg cggccggcgt cgtcgcggcg cgggcggccg ccacgacggc ctggacggag 


20221
acgtccgcgg agccgtagaa atcccgcagc tccgtcgcgg tgacggagac ctccgcaaag 


20281
cgcgcgcgac cctcccctgc ggcgttgcga catacaaaat acaccagggc gtggaagtac 


20341
tcgcgagcgc gggggggcag ccataccgcg taaagggtaa tggcgctgac gctctcctcc 


20401
acccacacga tatctgcggt gtccatcgca cggcccctaa ggatcacggg cggtctgtgg 


20461
gtcccatgct gccgtgcctg gccgggcccg gtgggtcgcg gaaaccggtg acgggggggg 


20521
ggcggttttt ggggttgggg tgggggtggg aaacggcccg ggtccggggg ccaacttggc 


20581
ccctcggtgc gttccggcaa cagcgccgcc ggtccgcgga cgaccacgta ccgaacgagt 


20641
gcggtcccga gacttatagg gtgctaaagt tcaccgcccc ctgcatcatg ggccaggcct 


20701
cggtggggag ctccgacagc gccgcctcca ggatgatgtc agcgttgggg ttggcgctgg 


20761
atgagtgcgt gcgcaaacag cgcccccacg caggcacgcg tagcttgaag cgcgcgcccg 


20821
caaactcccg cttgtgggcc ataagcaggg cgtacagctg cctgtgggtc cggcaggcgc 


20881
tgtggtcgat gtggtgggcg tccaacaacc ccacgattgt ctgtttggtg aggtttttaa 


20941
cgcgccccgc cccgggaaac gtctgcgtgc ttttggccat ctgcacgcca aacagttcgc 


21001
cccagattat cttgaacagc gccaccgcgt ggtccgtctc gctaacggac ccgcgcgggg 


21061
gacagccgct tagggcgtcg gcgacgcgct tgacggcttc ctccgagagc agaagtccgt 


21121
cggttacgtt acagtggccc agttcgaaca ccagctgcat gtagcggtcg tagtgggggg 


21181
tcagtaggtc cagcacgtca tcggggccga aggtcctccc agatcccccg gccgccgagt 


21241
cccaatgcag gcgcgcggcc atggtgctgc acaggcacaa cagctcccag acgggggtta 


21301
cgttcagggt ggggggcagg gccacgagct ccagctctcc ggtgacgttg atcgtgggga 


21361
tgacgcccgt ggcgtagtgg tcatagatcc gccgaaatat ggcgctgctg cgggtggcca 


21421
tgggaacgcg gagacaggcc tccagcaacg ccaggtaaat aaaccgcgtg cgtcccatca 


21481
ggctgttgag gttgcgcatg agcgcgacaa tttccgccgg cgcgacatcg gaccggaggt 


21541
atttttcgac gaaaagaccc acctcctccg tctcggcggc ctgggccggc agcgacgcct 


21601
cgggatcccg gcaccgcagc tcccgtagat cgcgctgggc cctgagggcg tcgaaatgta 


21661
cgccccgcaa aaacagacag aagtcctttg gggtcagggt atcgtcgtgt ccccagaagc 


21721
gcacgcgtat gcagtttagg gtcagcagca tgtgaaggat gttaaggctg tccgagagac 


21781
acgccagcgt gcatctctca aagtagtgtt tgtaacggaa tttgttgtag atgcgcgacc 


21841
cccgccccag cgacgtgtcg catgccgacg cgtcacagcg ccccttgaac cggcgacaca 


21901
gcaggtttgt gacctgggag aactgcgcgg gccactggcc gcaggaactg accacgtgat 


21961
taaggagcat gggcgtaaag acgggctccg agcgcgcccc ggagccgtcc atgtaaatca 


22021
gtagctcccc cttgcggagg gtgcgcaccc gtcccaggga ctggtacacg gacaccatgt 


22081
ccggtccgta gttcatgggt tttacgtagg cgaacatgcc atcaaagtgc aggggatcga 


22141
agctgaggcc cacggttacg accgtcgtgt atataaccac gcggtattgg ccccacgtgg 


22201
tcacgtcccc gaggggggtg agcgagtgaa gcaacagcac gcggtccgta aactgacggc 


22261
agaaccgggc cacgatctcc gcgaaggaga ccgtcgacga aaaaatgcag atgttatcgc 


22321
ccccgccaag gcgcgcttcc agctccccaa agaacgtggc cccccgggcg tccggagagg 


22381
cgtccggaga cgggccgctc ggcggcccgg gcgggcgcag ggcagcctgc aggagctcgg 


22441
tccccagacg cgggagaaac aggcaccggc gcgccgaaaa cccgggcatg gcgtactcgc 


22501
cgaccaccac atgcacgttt ttttcgcccc ggagaccgca caggaagtcc accaactgcg 


22561
cgttggcggt tgcgtccatg gcgatgatcc gaggacaggt gcgcagcagg cgtagcatta 


22621
acgcatccac gcggcccagt tgctgcatcg ttggcgaata gagctggccc agcgtcgaca 


22681
taacctcgtc cagaacgagg acgtcgtagt tgttcagaag gttggggccc acgcgatgaa 


22741
ggctttccac ctggacgata agtcggtgga aggggcggtc gttcataatg taattggtgg 


22801
atgagaagta ggtgacaaag tcgaccaggc ctgactcagc gaaccgcgtc gccagggtct 


22861
gggtaaaact ccgacgacag gagacgacga gcacactcgt gtccggagag tggatcgctt 


22921
cccgcagcca gcggatcagc gcggtagttt ttcccgaccc cattggcgcg cggaccacag 


22981
tcacgcacct ggccgtcggg gcgctcgcgt tggggaaggt gacgggtccg tgctgctgcc 


23041
gctcgatcgt tgttttcggg tgaacccggg gcacccattc ggccaaatcc cccccgtaca 


23101
acatccgcgc tagcgatacg ctcgacgtgt actgttcgca ctcgtcgtcc ccaatgggac 


23161
gcccggcccc cagaggatct cccgactccg cgccccccac gaaaggcatg accggggcgc 


23221
ggacggcgtg gtgggtctgg tgtgtgcagg tggcgacgtt tgtggtctct gcggtctgcg 


23281
tcacggggct cctcgtcctg gcctctgtgt tccgggcacg gtttccctgc ttttacgcca 


23341
cggcgagctc ttatgccggg gtgaactcca cggccgaggt gcgcgggggt gtagccgtgc 


23401
ccctcaggtt ggacacgcag agccttgtgg gcacttatgt aatcacggcc gtgttgttgt 


23461
tggccgcggc cgtgtatgcc gtggtcggcg ccgtgacctc ccgctacgac cgcgccctgg 


23521
acgcgggccg ccgtctggct gcggcccgca tggccatgcc gcacgccacg ctgatcgccg 


23581
gaaacgtctg ctcttggttg ctgcagatca ccgtcctgtt gctggcccat cgcaccagcc 


23641
agctggccca cctggtttac gtcctgcact ttgcgtgtct ggtgtatttt gcggcccatt 


23701
tttgcaccag gggggtcctg agcgggacgt atctgcgtca ggtgcacggc ctgatggagc 


23761
cggccccgac tcatcatcgc gtcgttggcc cggctcgagc cgtgctgaca aacgccttgc 


23821
tgttgggcgt cttcctgtgc acggccgacg ccgcggtatc cctgaatacc atcgccgcgt 


23881
tcaactttaa tttttcggcc ccgggcatgc tcatatgcct gaccgtgctg ttcgcccttc 


23941
tcgtcgtatc gctgttgttg gtggtcgagg gggtgttgtg tcactacgtg cgcgtgttgg 


24001
tgggccccca cctgggggcc gtggccgcca cgggcatcgt cggcctggca tgcgagcact 


24061
attacaccaa cggctactac gttgtggaga cgcagtggcc gggggcccag acgggagtcc 


24121
gcgtcgccct cgccctggtc gccgcctttg ccctcggcat ggccgtgctc cgctgcaccc 


24181
gcgcctatct gtatcacagg cggcaccaca ccaaattttt tatgcgcatg cgcgacacgc 


24241
gacaccgcgc acattccgcc ctcaagcgcg tacgcagttc catgcgcgga tcgcgagacg 


24301
gccgccacag gcccgcaccc ggcagcccgc ccgggattcc cgaatatgcg gaagacccct 


24361
acgcgatctc atacggcggc cagctcgacc ggtacggaga ttccgacggg gagccgattt 


24421
acgacgaggt ggcggacgac caaaccgacg tattgtacgc caagatacaa cacccgcggc 


24481
acctgcccga cgacgagccc atctatgaca ccgttggggg gtacgacccc gagcccgccg 


24541
aggaccccgt gtacagcacc gtccgccgtt ggtagctgtt tggttccgtt ttaataaacc 


24601
gtttgtgttt aacccgaccg tggtgtatgt ctggtgtgtg gcgtccgatc ccgttactat 


24661
caccgttccc cccaaacccc ggcgattgtg ggttttttta aaaacgacac gcgtgcgacc 


24721
gtatacagaa cattgttgtt ttttattcgc tatcggacat ggggggtgga aactgggtgg 


24781
cggggcaggc gcctccgggg gttcgccggt gagtgtggcg cgagggggga tccgacgaac 


24841
gcaggcgctg tctccccggg gcccgcgtaa ccccgcgcat atccgggggc acgtagaaat 


24901
taccttcctc ttcggactcg atatccacga cgtcaaagtc gtgggcggtc agcgagacga 


24961
cctccccgtc gtcggtgatg aggacgttgt ttcggcagca gcagggccgg gtcccggaga 


25021
acgagaggcc catagctcgg cgagcgtgtc gtcgaacgcc aggcggctgc ttcgctgtat 


25081
ggccttatag atctccggat cgatgcggac gggggtaatg atcagggcga tcggaacggc 


25141
ctggttcggg agaatggacg ccttgctggg tcctgcggcc ccgagagccc cggcgccgtc 


25201
ctccaggcgg aacgttacgc cctcctccgc gctagtgcgg tgcctgccga taaacgtcac 


25261
cagatgcggg tggggggggc agtcggggaa gtggctgtcg agcacgtagc cctgcaccaa 


25321
gatctgctta aagttcgggt gacgggggtt cgcgaagacg ggctcgcggc gtaccagatc 


25381
cccggagctc caggacacgg gggagatggt gtggcgtccg aggtcggggg cgccaaacag 


25441
aagcacctcc gagacaacgc cgctatttaa ctccaccaag gcccgatccg cggcggagca 


25501
ccgccttttt tcgcccgagg cgtgggcctc tgaccaggcc tggtcttgcg tgacgagagc 


25561
ctcctccggg ccggggacgc gcccgggcgc gaagtatcgc acgctgggct tcgggatcga 


25621
ccggataaat gcccggaacg cctccgggga ccggtgtgcc atcaagtcct cgtacgcgga 


25681
ggccgtgggg tcgctggggt ccatggggtc gaaagcgtac ttggcccggc atttgacctc 


25741
gtaaaaggcc aggggggtct tggggactgg ggccaagtag ccgtgaatgt cccgaggaca 


25801
gacgagaata tccagggacg ccccgaccat ccccgtgtga ccgtccatga ggaccccaca 


25861
cgtatgcacg ttctcttcgg cgaggtcgcc gggttcgtgg aagataaagc gccgcgtgtc 


25921
ggcgccggcc tcgccgccgt cgtccgcgcg gcccacgcag tagcgaaaca gcaggcttcg 


25981
ggccgtcggc tcgttcaccc gcccgaacat caccgccgaa gactgtacat ccggccgcag 


26041
gctggcgttg tgcttcagcc actggggcga gaaacacgga ccctgggggc cccagcggag 


26101
ggtggatgcg gtcgtgaggc cccgccggag cagggcccat agctggcagt cggcctggtt 


26161
ttgcgtggcc gcctcgtaaa accccatgag gggccggggc gccacggcgt ccgcggcggc 


26221
cgggggcccg cggcgcgtca ggcgccatag gtgccggccg agtccgcggt ccaccatacc 


26281
cgcctcctcg aggaccacgg ccagggaaca cagataatcc aggcgggccc agaggggacc 


26341
gatggccaga ggggcgcgga cgccgcgcag caacccgcgc aggtggcgct cgaacgtctc 


26401
ggctagtata tgggagggca gcgcgttggg gatcaccgac gccgaccaca tagagtcaag 


26461
gtccggggag tcgggatcgg cgtccgggtc gcgggcgtgg gtgcccccag gagatagcgg 


26521
aatgtctggg gtcggaggcc ctgaggcgtc agaaagtgcc ggcgacgcgg cccggggctt 


26581
ttcgtctgcg gtgtcggtgg cgtgctgatc acgtgggggg ttaacgggcg aatgggagct 


26641
cgggtccaca gctgacgtcg tctggggtgg ggggggcagg ggacggaagg tggttgttag 


26701
cggaagactg ttagggcggg ggcgcttggg ggggctgtcg gggccacgag gggtgtcctc 


26761
ggccagggcc caggaacgct tagtcacggt gcgtcccggc ggacatgctg ggcctcccgt 


26821
ggactccatt tccgagacga cgtgggggga gcggtggttg agcgcgccgc cgggtgaacg 


26881
ctgattctca cgacagcgcg tgccgcgcgc acgggttggt gtgacacagg cgggacacca 


26941
gcaccaggag aggcttaagc tcgggaggca gcgccaccga cgacagtatc gccttgtgtg 


27001
tgtgctggta atttatacac cgatccgtaa acgcgcgccg aatcttggga ttgcggaggt 


27061
ggcgccggat gccctctggg acgtcatacg ccaggccgtg ggtgttggtc tcggccgagt 


27121
tgacaaacag ggctgggtgc agcacgcggc gataggcgag cagggccagg gcgaagtcca 


27181
gcgacagctg gttgttgaaa tactggtaac cgggaaaccg ggtcacgggt acgcccaggc 


27241
tcggggcgac gtacacgcta accaccaact ccagcagcgt ctggccaagg gcgtacaggt 


27301
caaccgctaa cccgacgtcg tgcttcaggc ggtggttggt aaattcggcc cgttcgttgt 


27361
taaggtattt caccaacagc tccgggggct ggttataccc gtgacccacc agggtgtgaa 


27421
agttggctgt ggttagggcg gtgggcatgc caaacatccg gggggacttg aggtccggct 


27481
cctggaggca aaactgcccc cgggcgatcg tggagttgga gttgagggtg acgaggctaa 


27541
agtcggcgag gacggcccgc cggagcgaga cggcgtccga ccgcagcatg acgaggatgt 


27601
tggcgcactt gatatccagg tggctgatcc cgcaggtggt gtttaaaaac acaacggcgc 


27661
gggccagctc cgtgaagcac tggtggaggg ccgtcgagac cgaggggttt gttgtgcgca 


27721
gggacgccag ttggccgata tacttaccga ggtccatgtc gtacgcgggg aacactatct 


27781
gtcgttgttg cagcgagaac ccgaggggcg cgatgaagcc gcggatgttg tgggtgcggc 


27841
cggcgcgtag agcgcactcc ccgaccaaca gggtcgcgat gagctcaacg gcaaaccact 


27901
ccttttcctt tatggtctta acggcaagct tatgttcgcg aatcagttgg acgtcgccgt 


27961
atcccccaga ccccccgaag cttcgggccc cggggatctc gagggtcgtg tagtgtaggg 


28021
cggggttgat ggcgaacacg gggctgcata gcttgcggat gcgcgtgagg gtaaggatgt 


28081
gcgaggggga cgaggggggt gcggttaacg ccgcctggga tctgcgcagg ggcgggcggt 


28141
tcagtttggc cgccgtaccg ggcgtctcgg gggacgcgcg gcgatgagac gagcggctca 


28201
ttcgccatcg ggatagtccc gcgcgaagcc gctcgcggag gccggatcgg tggcgggacc 


28261
cgtgggagga gcgggagccg gcggcgtcct ggagagaggg gccgctgggg cgcccggagg 


28321
ccccgtgtgg gttggagtgt atgtaggatg cgagccaatc cttgaaggac tgttggcgtg 


28381
caccttgggg gctgaggtta gctgccacat gaccagcagg tcgctgtctg cgggactcat 


28441
ccatccttcg gccaggtcgc cgtcttccca cagagaagcg ttggtcgctg cttcctcgag 


28501
ttgctcctcc tggtccgcaa gacgatcgtc cacggcgtcc aggcgctcac caagcgccgg 


28561
atcgaggtac cgtcggtgtg cggttagaaa gtcacgacgc gccgcttgct cctccacgcg 


28621
aattttaaca caggtcgcgc gctgtcgcat catctctaag cgcgcgcggg actttagccg 


28681
cgcctccaat tccaagtggg ccgcctttgc agccataaag gcgccaacaa accgaggatc 


28741
ttgggtgctg acgccctccc ggtgcagctg cagggtctgg tccttgtaaa tctcggctcg 


28801
gaggtgcgtc tcggccaggc gtcggcgcag ggccgcgtgg gcggcatctc ggtccattcc 


28861
gccaccctgc gggcgacccg gggggtgctc tgatagtctc gcgtgcccaa ggcccgtgat 


28921
cggggtactt cgccgccgcg acccgccacc cggtgtgcgc gatgtttggt cagcagctgg 


28981
cgtccgacgt ccagcagtac ctggagcgcc tcgagaaaca gaggcaactt aaggtgggcg 


29041
cggacgaggc gtcggcgggc ctcacaatgg gcggcgatgc cctacgagtg ccctttttag 


29101
atttcgcgac cgcgaccccc aagcgccacc agaccgtggt cccgggcgtc gggacgctcc 


29161
acgactgctg cgagcactcg ccgctcttct cggccgtggc gcggcggctg ctgtttaata 


29221
gcctggtgcc ggcgcaacta aaggggcgtg atttcggggg cgaccacacg gccaagctgg 


29281
aattcctggc ccccgagttg gtacgggcgg tggcgcgact gcggtttaag gagtgcgcgc 


29341
cggcggacgt ggtgcctcag cgtaacgcct actatagcgt tctgaacacg tttcaggccc 


29401
tccaccgctc cgaagccttt cgccagctgg tgcactttgt gcgggacttt gcccagctgc 


29461
ttaaaacctc cttccgggcc tccagcctca cggagaccac gggcccccca aaaaaacggg 


29521
ccaaggtgga cgtggccacc cacggccgga cgtacggcac gctggagctg ttccaaaaaa 


29581
tgatccttat gcacgccacc tactttctgg ccgccgtgct cctcggggac cacgcggagc 


29641
aggtcaacac gttcctgcgt ctcgtgtttg agatccccct gtttagcgac gcggccgtgc 


29701
gccacttccg ccagcgcgcc accgtgtttc tcgtcccccg gcgccacggc aagacctggt 


29761
ttctagtgcc cctcatcgcg ctgtcgctgg cctcctttcg ggggatcaag atcggctaca 


29821
cggcgcacat ccgcaaggcg accgagccgg tgtttgagga gatcgacgcc tgcctgcggg 


29881
gctggttcgg ttcggcccga gtggaccacg ttaaagggga aaccatctcc ttctcgtttc 


29941
cggacgggtc gcgcagtacc atcgtgtttg cctccagcca caacacaaac gtaagtcctc 


30001
ttttctttcg catggctctc ccaaggggcc ccgggtcgac ccgacccaca cccacccacc 


30061
cacccacata cacacacaac cagacgcggg aggaaagtct gccccgtggg cactgatttt 


30121
tattcgggat cgcttgagga ggcccgggca acggcccggg caacggtggg gcaactcgta 


30181
gcaaataggc gactgatgta cgaagagaag acacacaggc gccacccggc gctggtcggg 


30241
gggatgttgt ccgcgccgca ccgtcccccg acgacctctt gcagacggtc cgtgatgcaa 


30301
ggacggcggg gggcctgcag cagggtgacc gtatccacgg gatggccaaa gagaagcgga 


30361
cacaggctag catccccctg gaccgccagg gtacactggg ccatcttggc ccacagacac 


30421
ggggcgacgc agggacagga ctccgttacg acggaggaga gccacagtgc gttggcggaa 


30481
tcgatgtggg gcggcggggc gcaggactcg cagccccccg ggtggttggt gatcctggcc 


30541
aggagccatc ccagatggcg ggccctgctt cccggtggac agagcgaccc caggtcgctg 


30601
tccatggccc agcagtagat ctggccgctg gggaggtgcc accaggcccc cgggcccaag 


30661
gcgcaacacg cgcccggctc cgggggggtc ttcgcgggga ccagatacgc gccatccagc 


30721
tcgccgacca ctggctcctc cgcgagctgt tcggtggttg ggtcgggggt ttcctccggg 


30781
ggggtggccg cccgtatgcg ggcgaacgtg agggtgcaca ggagcggggt cagggggtgc 


30841
gtcacgctcc ggaggtggac gatcgcgcag tagcggcgct cgcggttaaa gaaaaagagg 


30901
gcaaagaagg tgttcggggg caaccgcagc gccttggggc gcgtcagata cagaaaaatc 


30961
tcgcagaaga gggcgcgccc ggggtctggg ttaggaaggg ccacctgaca cagaggctcg 


31021
gtgaggaccg ttagacaccg aaagatcttg agccgctcgt ccgcccgaac gacgcgccac 


31081
acaaagacgg agttgacaat gcgcgcgata gagtcgacgt ccgtccccag gtcgtcgact 


31141
ctgtcgcgcg tgccgcgagc tccggcccgg gaatccggcc ggggcaaggt ccccggggga 


31201
ccaggcggcg ccaggggccg ccggggtccc agctgcgcca tgccgggggc ggggggaggg 


31261
caaaccccag aggcgggggc caacggcgcg gggaggagtg ggtgggcgag gtggccgggg 


31321
gaaggcgccc gctagcgaga acggccgttc ccggacgaca ccttgcgaca aaacctaagg 


31381
acagcggccc gcgcgacggg gtccgagagg ctaaggtagg ccgcgatgtt aatggtgaac 


31441
gcaaagccgc cgggaaagac aactatgcca cagaggcggc gattaaaccc caggcagagg 


31501
taggcgtagc tttccccggg caggtattgc tcgcagaccc tgcgtggggc tgtggagggg 


31561
acggcctcca tgaagcgaca tttactctgc tcgcgtttac tgacgtcacc atccatcgcc 


31621
acggcgattg gacgattgtt aagccgcagc gtgtctccgc ttgtgctgta gtagtcaaaa 


31681
acgtaatggc cgtcggagtc ggcaaagcgg gccgggaggt cgtcgccgag cgggacgacc 


31741
cgccgccccc gaccgccccg tccccccagg tgtgccagga cggccagggc atacgcggtg 


31801
tgaaaaaagg cgtcgggggc ggtcccctcg acggcgcgca tcaggttctc gaggagaatg 


31861
gggaagcgcc tggtcacctc ccccagccac gcgcgttggt cggggccaaa gtcatagcgc 


31921
aggcgctgtg agattcgagg gccgccctga agcgcggccc ggatggcctg gcccagggcc 


31981
cggaggcacg ccagatgtat gcgcgcagta aaggcgacct cggcggcgat gtcaaagggc 


32041
ggcaggacgg ggcgcgggtg gcgcaggggc acctcgagcg cgggaaagcg gagcagcagc 


32101
tccgcctgcc cagcgggaga cagctggtgg gggcgcacga cgcgttctgc ggcgcaggcc 


32161
tcggtcaggg ccgtggccag cgccgaggac agcagcggag ggcgggcgcg tcgcccgccc 


32221
cacgccacgg agttctcgta ggagacgacg acgaagcgct gcttggttcc gtagtggtgg 


32281
cgcaggacca cggagataga acgacggctc cacagccagt ccggccggtc gccgccggcc 


32341
agggcttccc atccgcgatc caaccactcg accagcgacc gcggctttgc ggtaccaggg 


32401
gtcagggtta gaacgtcgtt caggatgtcc tcgcccccgg gcccgtgggg cactggggcc 


32461
acaaagcggc ccccgcctgg gggctccaga cccgccaaca ccgcatctgc gtcagccgcc 


32521
cccatggcgc ccccgctgac ggcctggtga accagggcgc cctggcggag ccccgatgca 


32581
acgccacagg ccgcacgccc ggtccgagcg cggaccgggt ggcggcgggt gacgtcctgc 


32641
actgcccgct gaaccaacgc gaggatctcc tcgttctcct gcgcgatgga cacgtcctgg 


32701
gccgcggtcg tgtcgccgcc gggggccgtc agctgctcct ccggggagat gggggggtcg 


32761
gacgccccga cgatgggcgg gtctgcgggc gcccccgcgt ggggccgggc caagggctgc 


32821
ggacgcgggg acgcgctttc ccccagaccc atggacaggt gggccgcagc ctccttcgcg 


32881
gccggcgggg cggcggcgcc aagcagagcg acgtagcggc acaaatgccg acagacgcgc 


32941
atgatgcgcg tgctgtcggc cgcgtagcgc gtgttggggg ggacgagctc gtcgtaacta 


33001
aacagaatca cgcgggcaca gctcgccccc gagccccacg caaggcgcag cgccgccacg 


33061
gcgtacgggt catagacgcc ctgtgcgtca cacaccacgg gcaaggagac gaacaacccc 


33121
ccggcgctgg acgcacgcgg aaggaggcca gggtgtgccg gcacgacggg ggccagaagc 


33181
tcccccaccg catccgcggg cacgtaggcg gcaaacgccg tgcaccacgg ggtacagtcg 


33241
ccggtggcat gagcccgagt ctggatttcg acctggaagt ttgcggccgt cccgagtccg 


33301
gggcggccgc gcatcagggc ggccagaggg attcccgcgg ccgccaggca ctcgctggat 


33361
atgatgacgt gaaccaaaga cgagggccga cccgggacgt ggccgagatc gtactggacc 


33421
tcgttggcca agtgcgcgtt catggttcgg gggtgggtgt gggtgtgtag gcgatgcggg 


33481
tcccccgagt ccgcgggaag ggcgcgggtt tggcgcgcgt atgcgtattc gccaacggag 


33541
gcgtgcgtgc ttatgcgcgg cgcgtttctt ctgtctccag ggaatccgag gccaggactt 


33601
taacctgctc tttgtcgacg aggccaactt tattcgcccg gatgcggtcc agacgattat 


33661
gggctttctc aaccaggcca actgcaagat tatcttcgtg tcgtccacca acaccgggaa 


33721
ggccagtacg agctttttgt acaacctccg cggggccgcc gacgagcttc tcaacgtggt 


33781
gacctatata tgcgatgatc acatgccgcg ggtggtgacg cacacaaacg ccacggcctg 


33841
ttcttgttat atcctcaaca agcccgtttt catcacgatg gacggggcgg ttcgccggac 


33901
cgccgatttg tttctggccg attccttcat gcaggagatc atcgggggcc aggccaggga 


33961
gaccggcgac gaccggcccg ttctgaccaa gtctgcgggg gagcggtttc tgttgtaccg 


34021
cccctcgacc accaccaaca gcggcctcat ggcccccgat ttgtacgtgt acgtggatcc 


34081
cgcgttcacg gccaacaccc gagcctccgg gaccggcgtc gctgtcgtcg ggcggtaccg 


34141
cgacgattat atcatcttcg ccctggagca cttttttctc cgcgcgctca cgggctcggc 


34201
ccccgccgac atcgcccgct gcgtcgtcca cagtctgacg caggtcctgg ccctgcatcc 


34261
cggggcgttt cgcggcgtcc gggtggcggt cgagggaaat agcagccagg actcggccgt 


34321
cgccatcgcc acgcacgtgc acacagagat gcaccgccta ctggcctcgg agggggccga 


34381
cgcgggctcg ggccccgagc ttctcttcta ccactgcgag cctcccggga gcgcggtgct 


34441
gtaccccttt ttcctgctca acaaacagaa gacgcccgcc tttgaacact ttattaaaaa 


34501
gtttaactcc gggggcgtca tggcctccca ggagatcgtt tccgcgacgg tgcgcctgca 


34561
gaccgacccg gtcgagtatc tgctcgagca gctgaataac ctcaccgaaa ccgtctcccc 


34621
caacacggac gtccgtacgt attccggaaa acggaacggc gcctcggatg accttatggt 


34681
cgccgtcatt atggccatct accttgcggc ccaggccgga cctccgcaca cattcgctcc 


34741
catcacacgc gtttcgtgag cgcccaataa acacacccag gtatgctacg cacgaccacg 


34801
gtgtcgcctg ttaagggggg gggaaggggg tgttggcggg aagcgtggga acacggggga 


34861
ttctctcacg accggcacca gtaccacccc cctgtgaaca cagaaacccc aacccaaatc 


34921
ccataaacat acgacacaca ggcatatttt ggaatttctt aggtttttat ttatttaggt 


34981
atgctggggt ttctccctgg atgcccaccc ccaccccccc ccgtgggtct agccgggcct 


35041
tagggatagc gtataacggg ggccatgtct ccggaccgca caacggccgc gccgtcaaag 


35101
gtgcacaccc gaaccacggg agccagggcc aaggtgtctc ctagttggcc cgcgtgggtc 


35161
agccaggcga cgagcgcctc gtagagcggc agccttcgct ctccatcctg catcagggcc 


35221
ggggcttcgg ggtgaatgag ctgggcggcc tcccgcgtga cactctgcat ctgcaggaga 


35281
gcgttcacgt acccgtcctg ggcacttagc gcaaagagcc gggggattag cgtaaggatg 


35341
atggtggttc cctccgtgat cgagtaaacc atgttaagga ccagcgatcg cagctcggcg 


35401
tttacggggc cgagttgttg gacgtccgcc agcagcgaga ggcgactccc gttgtagtac 


35461
agcacgttga ggtctggcag ccctccgggg tttctggggc tggggttcag gtcccggatg 


35521
cccctggcca cgagccgcgc cacgatttcg cgcgccaggg gcgatggaag cggaacggga 


35581
aaccgcaacg tgaggtccag cgaatccagg cgcacgtccg tcgcttggcc ctcgaacacg 


35641
ggcgggacga ggctgatggg gtccccgtta cagagatcta cgggggaggt gttgcgaagg 


35701
ttaacggtgc cggcgtgggt gaggcccacg tccagggggc aggcgacgat tcgcgtggga 


35761
agcacccggg tgatgaccgc ggggaagcgc cttcggtacg ccagcaacag ccccaacgtg 


35821
tcgggactga cgcctccgga gacgaaggat tcgtgcgcca cgtcggccag cgtcagttgc 


35881
cggcggatgg tcggcaggaa taccacccgc ccttcgcagc gctgcagcgc cgccgcatcg 


35941
gggcgcgaga tgcccgaggg tatcgcgatg tcagtttcaa agccgtccgc cagcatggcg 


36001
ccgatccacg cggcagggag tgcagtggtg gttcgggtgg cgggaggagc gcggtggggg 


36061
tcagcggcgt agcagagacg ggcgaccaac ctcgcatagg acggggggtg ggtcttaggg 


36121
ggttgggagg cgacagggac cccagagcat gcgcggggag gtctgtcggg cccagacgca 


36181
ccgagagcga atccgtccat ggagtcccgg cctgggtttt atggggcccg gccctcggaa 


36241
tcgcggcttg tcggcgggga caaagggggc ggggctaggg ggcttgcgga aacagaagac 


36301
gtgtgggata aaagaatcgc actaccccaa ggaagggcgg ggcggtttat tacagagcca 


36361
gtcccttgag cggggatgcg tcatagacga gatactgcgc gaagtgggtc tcccgcgcgt 


36421
gggcttcccc gttgcgggcg ctgcggagga gggcggggtc gctggcgcag gtgagcgggt 


36481
aggcctcctg aaacaggcca cacgggtcct ccacgagttc gcggcacccc ggggggcgct 


36541
taaactgtac gtcgctggcg gcggtggccg tggacaccgc cgaacccgtc tccacgatca 


36601
ggcgctccag gcagcgatgt ttggcggcga tgtcggccga cgtaaagaac ttaaagcagg 


36661
ggctgagcac cggcgaggcc ccgttgaggt ggtaggcccc gttatagagc aggtccccgt 


36721
acgaaaatcg ctgcgacgcc cacgggttgg ccgtggccgc gaaggcccgg gacgggtcgc 


36781
tctggccgtg gtcgtacatg agggcggtga catccccctc cttgtccccc gcgtaaacgc 


36841
ccccggcggc gcgtccccgg gggttgcagg gccggcggaa gtagttgacg tcggtcgaca 


36901
cgggggtggc gataaactca cacacggcgt cctggccgtg gtccatccct gcgcgccgcg 


36961
gcacctgggc gcacccgaac acggggacgg gctgggccgg ccccaggcgg tttcccgcca 


37021
cgaccgcgtt ccgcaggtac acggctgccg cgttgtccag tagaggggga gccccgcggc 


37081
ccaggtaaaa gttttgggga aggttgccca tgtcggtgac ggggttgcgg acggttgccg 


37141
tggccacgac ggcggtgtag cccacgccca ggtccacgtt cccgcgcggc tgggtgagcg 


37201
tgaagtttac ccccccgcca gtttcatgcc gggccacctg gagctggccc aggaagtacg 


37261
cctccgacgc gcgctccgag aacagcacgt tctcagtcac aaagcggtcc tgtcggacga 


37321
cggtgaaccc aaacccggga tggaggcccg tcttgagctg atgatgcaag gccacgggac 


37381
tgatcttgaa gtaccccgcc atgagcgcgt aggtcagcgc gttctccccg gccgcgctct 


37441
cgcggacgtg ctgcacgacg ggctgtcgga tcgacgaaaa gtagttggcc cccagagccg 


37501
gggggaccag ggggacctgc cgcgacaggt cgcgcagggc cggggggaaa ttgggcgcgt 


37561
tcgccacgtg gtcggccccg gcgaacagcg cgtggacggg gagggggtaa aaatagtcgc 


37621
cattttggat ggtatggtcc agatgctggg gggccatcag caggattccg gcgtgcaacg 


37681
ccccgtcgaa tatgcgcatg ttggtggtgg acgcggtgtt ggcgcccgcg tcgggcgccg 


37741
ccgagcagag cagcgccgtt gtgcgttcgg ccatgttgtg ggccagcacc tgcagcgtga 


37801
gcatggcggg cccgtccact accacgcgcc cgttgtgaaa catggcgttg accgtgttgg 


37861
ccaccagatt ggccgggtgc agggggtgcg cggggtccgt cacggggtcg ctggggcaat 


37921
cctcgccggg ggtgatctcc gggaccacca tgttctgcag ggtggcgtat acgcggtcga 


37981
agcgaacccc cgcggtgcag cagcggcccc gcgagaaggc gggcaccatc acgtagtagt 


38041
aaatcttgtg gtgcacggtc cagtccgccc cccggtgcgg ccggtcgtcc gcggcgtccg 


38101
cggctcgggc ctgggtgttg tgcagcagct ggccgtcgtt gcggttgaag tccgcggtcg 


38161
ccacgttaca cgccgctgcg tacacggggt cgtggccccc cgcgctaacc cggcagtcgc 


38221
gatggcggtc cagggccgcg cgccgcatca gggcgtcgca gtcccacacg aggggtggca 


38281
gcagcgccgg gtctcgcatt aggtgattca gttcggcttg cgcctgcccg cccagttccg 


38341
ggccggtcag ggtaaagtca tcaaccagct gggccagggc ctcgacgtgc gccaccaggt 


38401
cccggtacac ggccatgcac tcctcgggaa ggtctccccc gaggtaggtc acgacgtacg 


38461
agaccagcga gtagtcgttc acgaacgccg cgcaccgcgt gttgttccag tagctggtga 


38521
tgcactggac cacgagccgg gccagggcgc agaagacgtg ctcgctgccg tgtatggcgg 


38581
cctgcagcag gtaaaacacc gccgggtagt tgcggtcttc gaacgccccg cgaacggcgg 


38641
cgatggtggc gggggccatg gcgtggcgtc ccacccccag ctccaggccc cgggcgtccc 


38701
ggaacgccgc cggacatagc gccaggggca agttgccgtt caccacgcgc caggtggcct 


38761
ggatctcccc cgggccggcc gggggaacgt ccccccccgg cagctccacg tcggccaccc 


38821
ccacgaagaa gtcgaacgcg gggtgcagct caagagccag gttggcgttg tcgggctgca 


38881
taaactgctc cggggtcatc tggccttccg cgacccatcg gacccgcccg tgggccaggc 


38941
gctgccccca ggcgttcaaa aacagctgct gcatgtctgc ggcggggccg gccggggccg 


39001
ccacgtacgc cccgtacgga ttggcggctt cgacggggtc gcggttaagg cccccgaccg 


39061
ccgcgtcaac gttcatcagc gaagggtggc acacggtccc gatcgcgtgt tccagagaca 


39121
ggcgcagcac ctggcggtcc ttcccccaaa aaaacagctg gcggggcggg aaggcgcggg 


39181
gatccgggtg gccgggggcg gggactaggt ccccggcgtg cgcggcaaac cgttccatga 


39241
ccggattgaa caggcccagg ggcaggacga acgtcaggtc catggcgccc accagggggt 


39301
agggaacgtt ggtggcggcg tagatgcgct tctccagggc ctccaaaaag atcagcttct 


39361
cgccgatgga caccagatcc gcgcgcacgc gcgtcgtctg gggggcgctc tcgagctcgt 


39421
ccagcgtctg ccggttcagg tcgagctgct cctcctgcat ctccagcagg tggcggccca 


39481
cgtcgtccag acttcgcacg gccttgccca tcacgagcgc cgtgaccagg ttggccccgt 


39541
tcaggaccat ctcgccgtac gtcaccggca cgtcggcttc ggtgtcctcc actttcagga 


39601
aggactgcag gaggcgctgt ttgatcgggg cggtggtgac gagcaccccg tcgaccggac 


39661
gcccgcgcgt gtcggcatgc gtcagacggg gcacggccac ggagggctgc gtggccgtgg 


39721
tgaggtccac gagccaggcc tcgacggcct cccggcggtg gcccgccttg cccaggaaaa 


39781
agctcgtctc gcagaagctt cgctttagct cggcgaccag ggtcgcccgg gccaccctgg 


39841
tggccaggcg gccgttgtcc aggtatcgtt gcatcggcaa caacaaagcc aggggcggcg 


39901
ccttttccag cagcacgtgc agcatctggt cggccgtgcc gcgctcaaac gccccgagga 


39961
cggcctggac gttgcgagcg agctgttgga tggcgcgcaa ctggcgatgc gcgctgatac 


40021
ccgtcccgtc cagggcctcc cccgtgagca gggcgatggc ctcggtggcc aggctgaagg 


40081
cggcgttcag ggcccggcgg tcgataatct tggtcatgta attgtgtgtg ggttgctcga 


40141
tggggtgcgg gccgtcgcgg gcaatcagcg gctggtggac ctcgaactgt acgcgcccct 


40201
cgttcatgta ggccagctcc ggaaacttgg tacacacgca cgccaccgac aacccgagct 


40261
ccagaaagcg cacgagcgac agggtgttgc aatacgaccc caacagggcg tcgaactcga 


40321
cgtcatacag gctgtttgca tcggagcgca cgcgggaaaa aaaatcgaac aggcgtcgat 


40381
gcgacgccac ctcgatcgtg ctaaggaggg acccggtcgg caccatggcc gcggcatacc 


40441
ggtatcccgg agggtcgcgg ttgggagcgg ccatggggtc gcgtggagat cggctgtctc 


40501
tagcgatatt ggcccgggga ggctaagatc caccccaacg cccggccacc cgtgtacgtg 


40561
cccgacggcc caaggtccac cgaaagacac gacgggcccg gacccaaaaa ggcgggggat 


40621
gctgtgtgag aggccgggtg tcggtcgggg gggaaaggca ccgggagaag gctgcggcct 


40681
cgttccagga gaacccagtg tccccaacag acccggggac gtgggatccc aggccttata 


40741
tacccccccc gccccacccc cgttagaacg cgacgggtgc attcaagatg gccctggtcc 


40801
aaaagcgtgc caggaagaaa ttggcagagg cggcaaagct gtccgccgcc gccacccaca 


40861
tcgaggcccc ggccgcgcag gctatcccca gggcccgtgt gcgcagggga tcggtgggcg 


40921
gcagcatttg gttggtggcg ataaagtgga aaagcccgtc cggactgaag gtctcgtggg 


40981
cggcggcgaa caaggcacac agggccgtgc ctcccaaaaa cacggacatc ccccaaaaca 


41041
cgggcgccga caacggcaga cgatccctct tgatgttaac gtacaggagg agcgcccgca 


41101
ccgcccacgt aacgtagtag ccgacgatgg cggccaggat acaggccggc gccaccaccc 


41161
ttccggtcag cccgtaatac atgcccgctg ccaccatctc caacggcttc aggaccaaaa 


41221
acgaccaaag gaacagaatc acgcgctttg aaaagaccgg ctgggtatgg ggcggaagac 


41281
gcgagtatgc cgaactgaca aaaaagtcag aggtgccgta cgaggacaat gaaaactgtt 


41341
cctccagtgg cagttctccc tcctcccccc caaaggcggc ctcgtcgacc agatctcgat 


41401
ccaccagagg aaggtcatcc cgcatggtca tggggtgtgc ggtggaggtg gggagaccga 


41461
aaccgcaaag ggtcgcttac gtcagcagga tcccgagatc aaagacaccc gggttcttgc 


41521
acaaacacca cccgggttgc atccgcggag gcgagtgttt tgataaggcc gttccgcgcc 


41581
ttgatataac ctttgatgtt gaccacaaaa cccggaattt acgcctacgc cccaatgccc 


41641
acgcaagatg aggtaggtaa cccccccccc gtgggtgtga cgttgcgttt agttcattgg 


41701
aggccaaggg gaaaatgggg tggggaggaa acggaaaacc cagtaggccg tgttgggaac 


41761
acgcccgggg ttgtcctcaa aaggcagggt ccatactacg gaagccgtcg ttgtattcga 


41821
gacctgcctg tgcgacgcac gtcggggttg cctgtgtccg gttcggcccc accgcgtgcg 


41881
gcacgcacga ggacgagtcc gcgtgcttta ttggcgttcc aagcgttgcc ctccagtttc 


41941
tgttgtcggt gttcccccat acccacgccc acatccaccg tagggggcct ctgggccgtg 


42001
tcacgtcgcc gcccgcgatg gagcttagct acgccaccac catgcactac cgggacgttg 


42061
tgttttacgt cacaacggac cgaaaccggg cctactttgt gtgcgggggg tgtgtttatt 


42121
ccgtggggcg gccgtgtgcc tcgcagcccg gggagattgc caagtttggt ctggtcgttc 


42181
gagggacagg cccagacgac cgcgtggtcg ccaactatgt acgaagcgag ctccgacaac 


42241
gcggcctgca ggacgtgcgt cccattgggg aggacgaggt gtttctggac agcgtgtgtc 


42301
ttctaaaccc gaacgtgagc tccgagctgg atgtgattaa cacgaacgac gtggaagtgc 


42361
tggacgaatg tctggccgag tactgcacct cgctgcgaac cagcccgggt gtgctaatat 


42421
ccgggctgcg cgtgcgggcg caggacagaa tcatcgagtt gtttgaacac ccaacgatag 


42481
tcaacgtttc ctcgcacttt gtgtataccc cgtccccata cgtgttcgcc ctggcccagg 


42541
cgcacctccc ccggctcccg agctcgctgg aggccctggt gagcggcctg tttgacggca 


42601
tccccgcccc acgccagcca cttgacgccc acaacccgcg cacggatgtg gttatcacgg 


42661
gccgccgcgc cccacgaccc atcgccgggt cgggggcggg gtcggggggc gcgggcgcca 


42721
agcgggccac cgtcagcgag ttcgtgcaag tcaaacacat tgaccgcgtg ggccccgctg 


42781
gcgtttcgcc ggcgcctccg ccaaacaaca ccgactcaag ttccctggtg cccggggccc 


42841
aggattccgc cccgcccggc cccacgctaa gggagctgtg gtgggtgttt tatgccgcag 


42901
accgggcgct ggaggagccc cgcgccgact ctggcctcac ccgcgaggag gtacgtgccg 


42961
tacgtgggtt ccgggagcag gcgtggaaac tgtttggctc cgcgggggcc ccgcgggcgt 


43021
ttatcggggc cgcgttgggc ctgagccccc tccaaaagct agccgtttac tactatatca 


43081
tccaccgaga gaggcgcctg tcccccttcc ccgcgctagt ccggctcgta ggccggtaca 


43141
cacagcgcca cggcctgtac gtccctcggc ccgacgaccc agtcttggcc gatgccatca 


43201
acgggctgtt tcgcgacgcg ctggcggccg gaaccacagc cgagcagctc ctcatgttcg 


43261
accttctccc cccaaaggac gtgccggtgg gaagcgacgt gcaggccgac agcaccgctc 


43321
tgctgcgctt tatagaatcg caacgtctcg ccgtccccgg gggggtgatc tcccccgagc 


43381
acgtcgcgta ccttggtgcg ttcctgagcg tgctgtacgc tggccgcggg cgcatgtccg 


43441
cagccacgca caccgcgcgg ctgacagggg tgacctccct ggtgctagcg gtgggtgacg 


43501
tggaccgtct ttccgcgttt gaccgcggag cggcgggcgc ggccagccgc acgcgggccg 


43561
ccgggtacct ggatgtgctt cttaccgttc gtctcgctcg ctcccaacac ggacagtctg 


43621
tgtaacagac cccaataaac gtatgtcgct accacaccct tgtgtgtcaa tggacgcctc 


43681
tccggggggg aagggaaaac aaagaggggc tgggggagcg gcaccaccgg ggcctgaaca 


43741
aacaaaccac agacacggtt acagtttatt cggtcgggcg gagaaacggc cgaagccacg 


43801
ccccctttat tcgcgtctcc aaaaaaacgg gacacttgtc cggagaacct ttaggatgcc 


43861
agccagggcg gcggtaatca taaccacgcc cagcgcagag gcggccagaa acccgggcgc 


43921
aattgcggcc acgggctgcg tgtcaaaggc tagcaaatga atgacggttc cgtttggaaa 


43981
tagcaacaag gccgtggacg gcacgtcgct cgaaaacacg cttggggcgc cctccgtcgg 


44041
cccggcggcg atttgctgct gtgtgttgtc cgtatccacc agcaacacag acatgacctc 


44101
cccggccggg gtgtagcgca taaacacggc ccccacgagc cccaggtcgc gctggttttg 


44161
ggtgcgcacc agccgcttgg actcgatatc ccgggtggag ccttcgcatg tcgcggtgag 


44221
gtaggttagg aacagtgggc gtcggacgtc gacgccggtg agcttgtagc cgatcccccg 


44281
gggcagaggg gagtgggtga cgacgtagct ggcgttgtgg gtgatgggta ccaggatccg 


44341
tggctcgacg ttggcagact gccccccgca ccgatgtgag gcctcaggga cgaaggcgcg 


44401
gatcagggcg ttgtagtgtg cccagcgcgt cagggtcgag gcgaggccgt gggtctgctg 


44461
ggccaggact tcgaccgggg tctcggatcg ggtggcttga gccagcgcgt ccaggataaa 


44521
cacgctctcg tctagatcaa agcgcaggga ggccgcgcat ggcgaaaagt ggtccggaag 


44581
ccaaaagagg gttttctggt ggtcggcccg ggccagcgcg gtccggaggt cggcgttggt 


44641
cgctgcggcg acgtcggacg tacacagggc cgaggctatc agaaggctcc ggcgggcgcg 


44701
ttcccgctgc accgccgagg ggacgcccgc caagaacggc tgccggagga cagccgaggc 


44761
gtaaaatagc gcccggtgga cgaccggggt ggtcagcacg cggcccccta gaaactcggc 


44821
atacagggcg tcgatgagat gggctgcgct gggcgccact gcgtcgtacg ccgaggggct 


44881
atccagcacg aaggccagct gatagcccag cgcgtgtaat gccaagctct gttcgcgctc 


44941
cagaatctcg gccaccaggt gctggagccg agcctctagc tgcaggcggg ccgtgggatc 


45001
caagactgac acattaaaaa acacagaatc cgcggcacag cccgcggccc cgcgggcggc 


45061
caacccggca agcgcgcgcg agtgggccaa aaagcctagc aggtcggaga ggcagaccgc 


45121
gccgtttgcg tgggcggcgt tcacgaaagc aaaacccgac gtcgcgagca gccccgttag 


45181
gcgccagaag agagggggac gcgggccctg ctcggcgccc gcgtcccccg agaaaaactc 


45241
cgcgtatgcc cgcgacagga actgggcgta gttcgtgccc tcctccgggt agccgcccac 


45301
gcggcggagg gcgtccagcg cggagccgtt gtcggcccgc gtcagggacc ctaggacaaa 


45361
gacccgatac cgggggccgc ccgggggccc gggaagagcc cccggggggt tttcgtccgc 


45421
ggggtccccg acccgatcta gcgtctggcc cgcggggacc accatcactt ccaccggagg 


45481
gctgtcgtgc atggatatca cgagccccat gaattcccgc ccgtagcgcg cgcgcaccag 


45541
cgcggcatcg cacccgagca ccagctcccc cgtcgtccag atgcccacgg gccacgtcga 


45601
ggccgacggg gagaaataca cgtacctacc tggggatctc aacaggcccc gggtggccaa 


45661
ccaggtcgtg gacgcgttgt gcaggtgcgt gatgtccagc tccgtcgtcg ggtgccgccg 


45721
ggccccaacc ggcggtcggg ggggcggtgt atcacgcggc ccgctcgggt ggctcgccgt 


45781
cgccacgttg tctccccgcg ggaacgtcag ggcctcgggg tcagggacgg ccgaaaacgt 


45841
tacccaggcc cgggaacgca gcaacacgga ggcggttgga ttgtgcaaga gacccttaag 


45901
gggggcgacc gcggggggag gctgggcggt cggctcgacc gtgatggggg cgggcaggct 


45961
cgcgttcggg ggccggccga gcaggtaggt cttcgagatg taaagcagct ggccggggtc 


46021
ccgcggaaac tcggccgtgg tgaccaatac aaaacaaaag cgctcctcgt accagcgaag 


46081
aaggggcaga gatgccgtag tcaggtttag ttcgtccggc ggcgccagaa atccgcgcgg 


46141
tggtttttgg gggtcggggg tgtttggcag ccacagacgc ccggtgttcg tgtcgcgcca 


46201
gtacatgcgg tccatgccca ggccatccaa aaaccatggg tctgtctgct cagtccagtc 


46261
gtggacctga ccccacgcaa cgcccaaaag aataaccccc acgaaccata aaccattccc 


46321
catgggggac cccgtcccta acccacgggg cccgtggcta tggcagggct tgccgccccg 


46381
acgttggctg cgagccctgg gccttcaccc gaacttgggg gttggggtgg ggaaaaggaa 


46441
gaaacgcggg cgtattggcc ccaatggggt ctcggtgggg tatcgacaga gtgccagccc 


46501
tgggaccgaa ccccgcgttt atgaacaaac gacccaacac ccgtgcgttt tattctgtct 


46561
ttttatttcc gtcatagcgc gggttccttc cggtattgtc tccttccgtg tttcagttag 


46621
cctcccccat ctcccgggca aacgtgcgcg ccaggtcgca gatcgtcggt atggagcctg 


46681
gggtggtgac gtgggtctgg accatcccgg aggtaagttg cagcagggcg tcccggcagc 


46741
cggcgggcga ttggtcgtaa tccaggataa agacgtgcat gggacggagg cgtttggcca 


46801
agacgtccaa ggcccaggca aacacgttat acaggtcgcc gttgggggcc agcaactcgg 


46861
gggcccgaaa cagggtaaat aacgtgtccc cgatatgggg tcgtgggccc gcgttgctct 


46921
ggggctcggc accctggggc ggcacggccg tccccgaaag ctgtccccaa tcctcccgcc 


46981
acgacccgcc gccctgcaga taccgcaccg tattggcaag cagcccgtaa acgcggcgaa 


47041
tcgcggccaa catagccagg tcaagccgct cgccggggcg ctggcgtttg gccaggcggt 


47101
cgatgtgtct gtcctccgga agggccccca acacgatgtt tgtgccgggc aaggtcggcg 


47161
ggatgagggc cacgaacgcc agcacggcct ggggggtcat gctgcccata aggtatcgcg 


47221
cggccgggta gcacaggagg gcggcgatgg gatggcggtc gaagatgagg gtgagggccg 


47281
ggggcggggc atgtgagctc ccagcctccc ccccgatatg aggagccaga acggcgtcgg 


47341
tcacggcata aggcatgccc attgttatct gggcgcttgt cattaccacc gccgcgtccc 


47401
cggccgatat ctcaccctgg tcgaggcggt gttgtgtggt gtagatgttc gcgattgtct 


47461
cggaagcccc cagcacctgc cagtaagtca tcggctcggg tacgtagacg atatcgtcgc 


47521
gcgaacccag ggccaccagc agttgcgtgg tggtggtttt ccccatcccg tgaggaccgt 


47581
ctatataaac ccgcagtagc gtgggcattt tctgctccag gcggacttcc gtggcttctt 


47641
gctgccggcg agggcgcaac gccgtacgtc ggttgctatg gccgcgagaa cgcgcagcct 


47701
ggtcgaacgc agacgcgtgt tgatggcagg ggtacgaagc catacgcgct tctacaaggc 


47761
gcttgccaaa gaggtgcggg agtttcacgc caccaagatc tgcggcacgc tgttgacgct 


47821
gttaagcggg tcgctgcagg gtcgctcggt gttcgaggcc acacgcgtca ccttaatatg 


47881
cgaagtggac ctgggaccgc gccgccccga ctgcatctgc gtgttcgaat tcgccaatga 


47941
caagacgctg ggcggggttt gtgtcatcat agaactaaag acatgcaaat atatttcttc 


48001
cggggacacc gccagcaaac gcgagcaacg ggccacgggg atgaagcagc tgcgccactc 


48061
cctgaagctc ctgcagtccc tcgcgcctcc gggtgacaag atagtgtacc tgtgccccgt 


48121
cctggtgttt gtcgcccaac ggacgctccg cgtcagccgc gtgacccggc tcgtcccgca 


48181
gaaggtctcc ggtaatatca ccgcagtcgt gcggatgctc cagagcctgt ccacgtatac 


48241
ggtccccatg gagcctagga cccagcgagc ccgtcgccgc cgcggcggcg ctgcccgggg 


48301
gtctgcgagc agaccgaaaa ggtcacactc tggggcgcgc gacccgcccg agccagcggc 


48361
ccgccaggta ccacccgccg accaaacccc cgcctccacg gagggcgggg gggtgcttaa 


48421
gaggatcgcg gcgctcttct gcgtgcccgt ggccaccaag accaaacccc gagctgcctc 


48481
cgaatgagag tgtttcgttc cttccccctc cccccgcgtc agacaaaccc taaccaccgc 


48541
ttaagcggcc cccgcgaggt ccgaagactc atttggatcc ggcgggagcc acctgacaac 


48601
agcccccggg tttccccacg ccagacgccg gtccgctgtg ccatcgctcc ccttcatccc 


48661
acccccatct tgtccccaaa taaaacaagg tctggtagtt aggacaacga ccgcagttct 


48721
cgtgtgttat tgtcgctctc cgcctctcgc agatggaccc gtattgccca tttgacgctc 


48781
tggacgtctg ggaacacagg cgcttcatag tcgccgattc ccgaaacttc atcacccccg 


48841
agttcccccg ggacttttgg atgtcgcccg tctttaacct cccccgggag acggcggcgg 


48901
agcaggtggt cgtcctgcag gcccagcgca cagcggctgc cgctgccctg gagaacgccg 


48961
ccatgcaggc ggccgagctc cccgtcgata tcgagcgccg gttacgcccg atcgaacgga 


49021
acgtgcacga gatcgcaggc gccctggagg cgctggagac ggcggcggcc gccgccgaag 


49081
aggcggatgc cgcgcgcggg gatgagccgg cgggtggggg cgacgggggg gcgcccccgg 


49141
gtctggccgt cgcggagatg gaggtccaga tcgtgcgcaa cgacccgccg ctacgatacg 


49201
acaccaacct ccccgtggat ctgctacata tggtgtacgc gggccgcggg gcgaccggct 


49261
cgtcgggggt ggtgttcggg acctggtacc gcactatcca ggaccgcacc atcacggact 


49321
ttcccctgac cacccgcagt gccgactttc gggacggccg gatgtccaag accttcatga 


49381
cggcgctggt cctgtccctg cagtcgtgcg gccggctgta tgtgggccag cgccactatt 


49441
ccgccttcga gtgcgccgtg ttgtgtctct acctgctgta ccgaaacacg cacggggccg 


49501
ccgacgatag cgaccgcgct ccggtcacgt tcggggatct gctgggccgg ctgccccgct 


49561
acctggcgtg cctggccgcg gtgatcggga ccgagggcgg ccggccacag taccgctacc 


49621
gcgacgacaa gctccccaag acgcagttcg cggccggcgg gggccgctac gaacacggag 


49681
cgctggcgtc gcacatcgtg atcgccacgc tgatgcacca cggggtgctc ccggcggccc 


49741
cgggggacgt cccccgggac gcgagcaccc acgttaaccc cgacggcgtg gcgcaccacg 


49801
acgacataaa ccgcgccgcc gccgcgttcc tcagccgggg ccacaaccta ttcctgtggg 


49861
aggaccagac tctgctgcgg gcaaccgcga acaccataac ggccctgggc gttatccagc 


49921
ggctcctcgc gaacggcaac gtgtacgcgg accgcctcaa caaccgcctg cagctgggca 


49981
tgctgatccc cggagccgtc ccttcggagg ccatcgcccg tggggcctcc gggtccgact 


50041
cgggggccat caagagcgga gacaacaatc tggaggcgct atgtgccaat tacgtgcttc 


50101
cgctgtaccg ggccgacccg gcggtcgagc tgacccagct gtttcccggc ctggccgccc 


50161
tgtgtcttga cgcccaggcg gggcggccgg tcgggtcgac gcggcgggtg gtggatatgt 


50221
catcgggggc ccgccaggcg gcgctggtgc gcctcaccgc cctggaactc atcaaccgca 


50281
cccgcacaaa ccccaccccc gtgggggagg ttatccacgc ccacgacgcc ctggcgatcc 


50341
aatacgaaca ggggcttggc ctgctggcgc agcaggcacg cattggcttg ggctccaaca 


50401
ccaagcgttt ctccgcgttc aacgttagca gcgactacga catgttgtac tttttatgtc 


50461
tggggttcat tccacagtac ctgtcggcgg tttagtgggt ggtgggcgag gggggagggg 


50521
gcattaggga gaaagaacaa gagcctccgt tgggttttct ttgtgcctgt actcaaaagg 


50581
tcataccccg taaacggcgg gctccagtcc cggcccggcg gttggcgtga acgcaacggc 


50641
gggagctggg ttagcgttta gtttagcatt cgctctcgcc tttccgcccg cccccgaccg 


50701
ttgagccttt ttttttttcg tccaccaaag tctctgtggg tgcgcgcatg gcagccgatg 


50761
ccccgggaga ccggatggag gagcccctgc cagacagggc cgtgcccatt tacgtggctg 


50821
ggtttttggc cctgtatgac agcggggact cgggcgagtt ggcattggat ccggatacgg 


50881
tgcgggcggc cctgcctccg gataacccac tcccgattaa cgtggaccac cgcgctggct 


50941
gcgaggtggg gcgggtgctg gccgtggtcg acgacccccg cgggccgttt tttgtgggac 


51001
tgatcgcctg cgtgcaactg gagcgcgtcc tcgagacggc cgccagcgct gcgattttcg 


51061
agcgccgcgg gccgccgctc tcccgggagg agcgcctgtt gtacctgatc accaactacc 


51121
tgccctcggt ctccctggcc acaaaacgcc tggggggcga ggcgcacccc gatcgcacgc 


51181
tgttcgcgca cgtagcgctg tgcgcgatcg ggcggcgcct tggcactatc gtcacctacg 


51241
acaccggtct cgacgccgcc atcgcgccct ttcgccacct gtcgccggcg tctcgcgagg 


51301
gggcgcggcg actggccgcc gaggccgagc tcgcgctatc cggacgcacc tgggcgcccg 


51361
gcgtggaggc gctgacccac acgctgcttt ccaccgccgt taacaacatg atgctgcggg 


51421
accgctggag cctggtggcc gagcggcggc ggcaggccgg gatcgccgga cacacctacc 


51481
tccaggcgag cgaaaaattc aaaatgtggg gggcggagcc tgtttccgcg ccggcgcgcg 


51541
ggtataagaa cggggccccg gagtccacgg acataccgcc cggctcgatc gctgccgcgc 


51601
cgcagggtga ccggtgccca atcgtccgtc agcgcggggt cgcctcgccc ccggtactgc 


51661
cccccatgaa ccccgttcca acatcgggca ccccggcccc cgcgccgccc ggcgacggga 


51721
gctacctgtg gatcccggcc tcccattaca accagctcgt cgccggccac gccgcgcccc 


51781
aaccccagcc gcattccgcg tttggtttcc cggctgcggc gggggccgtg gcctatgggc 


51841
ctcacggcgc gggtctttcc cagcattacc ctccccacgt cgcccatcag tatcccgggg 


51901
tgctgttctc gggacccagc ccactcgagg cgcagatagc cgcgttggtg ggggccatag 


51961
ccgcggaccg ccaggcgggc ggtcagccgg ccgcgggaga ccctggggtc cgggggtcgg 


52021
gaaagcgtcg ccggtacgag gcggggccgt cggagtccta ctgcgaccag gacgaaccgg 


52081
acgcggacta cccgtactac cccggggagg ctcgaggcgg gccgcgcggg gtcgactctc 


52141
ggcgcgcggc ccgccagtct cccgggacca acgagaccat cacggcgctg atgggggcgg 


52201
tgacgtctct gcagcaggaa ctggcgcaca tgcgggctcg gaccagcgcc ccctatggaa 


52261
tgtacacgcc ggtggcgcac tatcgccctc aggtggggga gccggaacca acaacgaccc 


52321
acccggccct ttgtcccccg gaggccgtgt atcgcccccc accacacagc gccccctacg 


52381
gtcctcccca gggtccggcg tcccatgccc ccactccccc gtatgcccca gctgcctgcc 


52441
cgccaggccc gccaccgccc ccatgtcctt ccacccagac gcgcgcccct ctaccgacgg 


52501
agcccgcgtt cccccccgcc gccaccggat cccaaccgga ggcatccaac gcggaggccg 


52561
gggcccttgt caacgccagc agcgcagcac acgtggacgt tgacacggcc cgcgccgccg 


52621
atttgttcgt ctctcagatg atgggggccc gctgattcgc cccggtcttt ggtaccatgg 


52681
gatgtcttac tgtatatctt tttaaataaa ccaggtaata ccaaataaga cccattggtg 


52741
tatgttcttt ttttattggg aggggcgggt aggcgggtag ctttacaatg caaaagcctt 


52801
tgacgtggag gaaggcgtgg gggggaggaa atcggcactg accaaggggg tccgttttgt 


52861
cacgggaaag gaaagaggaa acaggccgcg gacacccggg ggagtttatg tgttcctttt 


52921
tctttcttcc cacacacaca caaaaggcgt accaaacaaa aaaaccaaaa gatgcgcatg 


52981
cggtttaaca cccgtggttt ttatttacaa caaacccccc gtcacaggtc gtcctcgtcg 


53041
gcgtcaccgt ctttgttggg aacttgggtg tagttggtgt tgcggcgctt gcgcatgacc 


53101
atgtcggtga ccttggcgct gagcagcgcg ctcgtgccct tcttcttggc cttgtgttcc 


53161
gtgcgctcca tggccgacac cagggccatg taccgtatca tctccctggc ctcggctagc 


53221
ttggcctcgt caaagtcgcc gccctcctcg ccctccccgg acgcgtccgg gttggtgggg 


53281
ttcttgagct ccttggtggt tagagggtac agggccttca tggggttgct ctgcagccgc 


53341
atgacgtaac gaaaggcgaa gaaggccgcc gccaggccgg ccaggaccaa cagacccacg 


53401
gccagcgccc caaaggggtt ggacatgaag gaggacacgc ccgacacggc cgataccacg 


53461
ccgcccacga tgcccatcac caccttgccg accgcgcgcc ccaggtcgcc catcccctcg 


53521
aagaacgcgc ccaggcccgc gaacatggcg gcgttggcgt cggcgtggat gaccgtgtcg 


53581
atgtcggcga agcgcaggtc gtgcagctgg ttgcggcgct ggacctccgt gtagtccagc 


53641
aggccgctgt ccttgatctc gtggcgggtg tacacctcca gggggacaaa ctcgtgatcc 


53701
tccagcatgg tgatgttgag gtcgatgaag gtgctgacgg tggtgatgtc ggcgcggctc 


53761
agctggtggg agtacgcgta ctcctcgaag tacacgtagc ccccaccgaa ggtgaagtag 


53821
cgccggtgtc ccacggtgca cggctcgatc gcatcgcgcg tcagccgcag ctcgttgttc 


53881
tcccccagct gcccctcgac caacgggccc tggtcttcgt accgaaagct gaccaggggg 


53941
cggctgtagc aggccccggg ccgcgagctg atgcgcatcg agttttggac gatcacgttg 


54001
tccgcggcga ccggcacgca cgtggagacg gccatcacgt cgccgagcat ccgcgcgctc 


54061
acccgccggc ccacggtgac cgaggcgatg gcgttggggt tcagcttgcg ggcctcgttc 


54121
cacagggtca gctcgtgatt ctgtagctcg caccacgcga tggcaacgcg gcccaacata 


54181
tcgttgacat ggcgctgtat gtggttgtac gtaaactgca gccgggcgaa ctcgatggag 


54241
gaggtggtct tgatgcgctc cacggacgcg ttggcgctgg ccccgggcgg cgggggcgtg 


54301
gggtttgggg gcttgcggct ctgctctcgg aggtgttccc gcacgtacag ctccgcgagc 


54361
gtgttgctga gaaggggctg gtacgcgatc agaaagcccc cattggccag gtagtactgc 


54421
ggctggccca ccttgatgtg cgtcgcgttg tacctgcggg cgaagatgcg gtccatggcg 


54481
tcgcgggcgt ccttgccgat gcagtccccc aggtccacgc gcgagagcgg gtactcggtc 


54541
aggttggtgg tgaaggtggt ggatatggcg tcggaggaga atcggaagga gccgccgtac 


54601
tcggagcgca gcatctcgtc cacttcctgc cacttggtca tggtgcagac cgacgggcgc 


54661
tttggcaccc agtcccaggc cacggtgaac ttgggggtcg tgagcaggtt ccgggtggtc 


54721
ggcgccgtgg cccgggcctt ggtggtgagg tcgcgcgcgt agaagccgtc gacctgcttg 


54781
aagcggtcgg cggcgtagct ggtgtgttcg gtgtgcgacc cctcccggta gccgtaaaac 


54841
ggggacatgt acacaaagtc gccagtcgcc agcacaaact cgtcgtacgg gtacaccgag 


54901
cgcgcgtcca cctcctcgac gatgcagttt accgtcgtcc cgtaccggtg gaacgcctcc 


54961
acccgcgagg ggttgtactt gaggtcggtg gtgtgccagc cccggctcgt gcgggtcgcg 


55021
gcgttggccg gtttcagctc catgtcggtc tcgtggtcgt cccggtgaaa cgcggtggtc 


55081
tccaggttgt tgcgcacgta cttggccgtg gaccgacaga cccccttggc gttgatcttg 


55141
tcgatcacct cctcgaaggg gacgggggcg cggtcctcaa agatccccat aaactgggag 


55201
tagcggtggc cgaaccacac ctgcgaaacg gtgacgtctt tgtagtacat ggtggccttg 


55261
aacttgtacg gggcgatgtt ctccttgaag accaccgcga tgccctccgt gtagttctga 


55321
ccctcgggcc gggtcgggca gcggcgcggc tgctcgaact gcaccaccgt ggcgcccgtg 


55381
gggggtgggc acacgtaaaa gtttgcatcg gtgttctccg ccttgatgtc ccgcaggtgc 


55441
tcgcgcaggg tggcgtggcc cgcggcgacg gtcgcgttgt cgccggcggg gcgtggtggc 


55501
gttgggtttt tcggtttttt gttcttcttc ggtttcgggt cccccgttgg ggcggcgcca 


55561
agggcgggcg gcgccggagt ggcagggccc ccgttcgccg cctgggtcgc ggccgcgacc 


55621
ccaggcgtgc cgggggaact cggagccgcc gacgccacca ggacccccag cgtcaacccc 


55681
aagagcgccc atacgacgaa ccaccggcgc ccccacgagg gggcgccctg gtgcatggcg 


55741
ggactacggg ggcccgtcgt gccccccgtc aggtagcctg ggggcgaggt gctggaggac 


55801
cgagtagagg atcgagaaaa cgtcgcggtc gtagaccacg accgaccggg ggccgataca 


55861
gccgtcgggg gcgctctcga cgatggccac cagcggacag tcggagtcgt acgtgagata 


55921
tacgccgggc gggtaacggt aacgaccttc ggaggtcggg cggctgcagt ccgggcggcg 


55981
caactcgagc tccccgcacc ggtagaccga ggcaaagagt gtggtggcga taatcagctc 


56041
gcgaatatat cgccaggcgg cgcgctgagt gggcgttatt ccggaaatgc cgtcaaaaca 


56101
gtaaaacctc tgaaattcgc tgacggccca atcagcaccc gagccccccg cccccatgat 


56161
gaaccgggcg agctcctcct tcaggtgcgg caggagcccc acgttctcga cgctgtaata 


56221
cagcgcggtg ttggggggct gggcgaagct gtgggtggag tgatcaaaga ggggcccgtt 


56281
gacgagctcg aagaagcgat gggtgatgct ggggagcagg gccgggtcca cctggtgtcg 


56341
caggagagac gctcgcatga accggtgcgc gtcgaacacg cccggcgccg agcggttgtc 


56401
gatgaccgtg cccgcgcccg ccgtcagggc gcagaagcgc gcgcgcgccg caaagccgtt 


56461
ggcgaccgcg gcgaacgtcg cgggcagcac ctcgccgtgg acgctgaccc gcagcatctt 


56521
ctcgagctcc ccgcgctgct cgcggacgca gcgccccagg ctggccaacg accgcttcgt 


56581
caggcggtcc gcgtacagcc gccgtcgctc ccgtacgtcc gcggccgctt gcgtggcgat 


56641
gtccccccac gtctcgggcc cctgcccccc gggcccgcgg cgacggtctt cgtcctcgcc 


56701
cccgcccccg ggagctccca acccccgtgc cccttcctct acggcgacac ggtccccgtc 


56761
gtcgtcgggg cccgcgccgc ccttgggcgc gtccgccgcg ccccccgccc ccatgcgcgc 


56821
cagcacgcga cgcagcgcct cctcgtcgca ctgttcgggg ctgacgaggc gccgcaagag 


56881
cggcgtcgtc aggtggtggt cgtagcacgc gcggatgagc gcctcgatct gatcgtcggg 


56941
tgacgtggcc tgaccgccga ttattagggc gtccaccata tccagcgccg ccaggtggct 


57001
cccgaacgcg cgatcgaaat gctccgcccg ccgcccgaac agcgccagtt ccacggccac 


57061
cgcggcggtc tcctgctgca actcgcgccg cgccagcgcg gtcaggttgc tggcaaacgc 


57121
gtccatggtg gtctggccgg cgcggtcgcc ggacgcgagc cagaatcgca attcgctgat 


57181
ggcgtacagg ccgggcgtgg tggcctgaaa cacgtcgtgc gcctccagca gggcgtcggc 


57241
ctccttgcgg accgagtcgt tctcgggcga cgggtggggc tgcccgtcgc cccccgcggt 


57301
ccgggccagc gcatggtcca acacggagag cgcccgcgcg cggtcggcgt ccgacagccc 


57361
ggcggcgtgg ggcaggtacc gccgcagctc gttggcgtcc agccgcacct gcgcctgctg 


57421
ggtgacgtgg ttacagatac ggtccgccag gcggcgggcg atcgtcgccc cctggttcgc 


57481
cgtcacacac agttcctcga aacagaccgc gcaggggtgg gacgggtcgc taagctccgg 


57541
ggggacgata aggcccgacc ccaccgcccc caccataaac tcccgaacgc gctccagcgc 


57601
ggcggtggcg ccgcgcgagg gggtgatgag gtggcagtag tttagctgct ttagaaagtt 


57661
ctcgacgtcg tgcaggaaac acagctccat atggacggtc ccgccatacg tatccagcct 


57721
gacccgttgg tgatacggac agggtcgggc caggcccatg gtctcggtga aaaacgccgc 


57781
gacgtctccc gcggtcgcga acgtctccag gctgcccagg agccgctcgc cctcgcgcca 


57841
cgcgtactct agcagcaact ccagggtgac cgacagcggg gtgagaaagg ccccggcctg 


57901
ggcctccagg cccggcctca gacgacgccg cagcgcccgc acctgaagcg cgttcagctt 


57961
cagttggggg agcttccccc gtccgatgtg ggggtcgcac cgccggagca gctctatctg 


58021
aaacacatag gtctgcacct gcccgagcag ggctaacaac ttttgacggg ccacggtggg 


58081
ctcggacacc ggggcggcca tctcgcggcg ccgatctgta ccgcggccgg agtatgcggt 


58141
ggaccgaggc ggtccgtacg ctacccggtg tctggctgag ccccggggtc cccctcttcg 


58201
gggcggcctc ccgcgggccc gccgaccggc aagccgggag tcggcggcgc gtgcgtttct 


58261
gttctattcc cagacaccgc ggagaggaat cacggcccgc ccagagatat agacacggaa 


58321
cacaaacaag cacggatgtc gtagcaataa tttattttac acacattccc cgccccgccc 


58381
taggttcccc caccccccaa cccctcacag catatccaac gtcaggtctc cctttttgtc 


58441
ggggggcccc tccccaaacg ggtcatcccc gtggaacgcc cgtttgcggc cggcaaatgc 


58501
cggtcccggg gcccccgggc cgccgaacgg cgtcgcgttg tcgtcctcgc agccaaaatc 


58561
cccaaagtta aacacctccc cggcgttgcc gagttggctg actagggcct cggcctcgtg 


58621
cgccacctcc agggccgcgt ccgtcgacca ctcgccgttg ccgcgctcca gggcacgcgc 


58681
ggtcagctcc atcatctcct cgcttaggta ctcgtcctcc aggagcgcca gccagtcctc 


58741
gatctgcagc tgctgggtgc ggggccccag gcttttcacg gtcgccacga acacgctact 


58801
ggcgacggcc gccccgccct cggagataat gccccggagc tgctcgcaca gcgagctttc 


58861
gtgcgctccg ccgccgaggt tcgaggccgc gcacacaaac ccggcccggg gacaggccag 


58921
gacgaacttg cgggtgcggt caaaaataag gagcgggcac gcgtttttgc cgcccatcag 


58981
gctggcccag ttcccggcct gaaacacacg gtcgttgccg gccatgccgt agtatttgct 


59041
gatgctcaac cccaacacga ccatggggcg cgccgccatg acgggccgca gcaggttgca 


59101
gctggcgaac atggacgtcc acgcgcccgg atgcgcgtcc acggcgtcca tcagcgcgcg 


59161
ggccccggcc tccaggcccg ccccgccctg cgcggaccac gcggccgccg cctgcacgct 


59221
ggggggacgg cgggaccccg cgatgatggc cgtgagggtg ttgatgaagt atgtcgagtg 


59281
atcgcagtac cgcagaatct ggtttgccat gtagtacatc gccagctcgc tcacgttgtt 


59341
gggggccagg ttaataaagt ttatcgcgcc gtagtccagg gaaaactttt taatgaacgc 


59401
gatggtctcg atgtcctcgc gcgacaggag ccgggcggga agctggttgc gttggagggc 


59461
cgtccagaac cactgcgggt tcggctggtt ggaccccggg ggcttgccgt tggggaagat 


59521
ggccgcgtgg aactgcttca gcagaaagcc cagcggtccg aggaggatgt ccacgcgctt 


59581
gtcgggcttc tggtaggcgc tctggaggct ggcgacccgc gccttggcgg cctcggacgc 


59641
gttggcgctc gcgcccgcga acaacacgcg gctcttgacg cgcagctcct tgggaaaccc 


59701
cagggtcacg cgggcaacgt cgccctcgaa gctgctctcg gcgggggccg tctggccggc 


59761
cgttaggctg ggggcgcaga tagccgcccc ctccgagagc gcgaccgtca gcgttttggc 


59821
cgacagaaac ccgttgttaa acatgtccat cacgcgccgc cgcagcaccg gttggaattg 


59881
attgcgaaag ttgcgcccct cgaccgactg cccggcgaac accccgtggc actggctcag 


59941
ggccaggtcc tgatacacgg cgaggttgga tcgccgcccg agaagctgaa gcagggggca 


60001
tggcccgcac gcgtacgggt ccagcgtcag ggacatggcg tggttggcct cgcccagacc 


60061
gtcgcgaaac ttgaagttcc tcccctccac caggttgcgc atcagctgct ccacctcgcg 


60121
gtccacgacc tgcctgacgt tgttcaccac cgtatgcagg gcctcgcggt tggtgatgat 


60181
ggtctccagc cgccccatgg ccgtggggac cgcctggtcc acgtactgca gggtctcgag 


60241
ttcggccatg acgcgctcgg tcgccgcgcg gtacgtctcc tgcatgatgg tccgggcggt 


60301
ctcggatccg tccgcgcgct tcagggccga gaaggcggcg tagtttccca gcacgtcgca 


60361
gtcgctgtac atgctgttca tggtcccgaa gacgccgatg gctccgcggg cggcgctggc 


60421
gaacttggga tggcgcgccc ggaggcgcat gagcgtcgtg tgtacgcagg cgtggcgcgt 


60481
gtcgaaggtg cacaggttgc agggcacgtc ggtctggttg gagtccgcga cgtatcgaaa 


60541
cacgtccatc tcctggcgcc cgacgatcac gccgccgtcg cagcgctcca ggtaaaacag 


60601
catcttggcc agcagcgccg gggaaaaccc acacagcatg gccaggtgct cgccggcaaa 


60661
ttcctgggtt ccgccgacga ggggcgcggt gggccgaccc tcgaacccgg gcaccacgtg 


60721
tccctcgcgg tccacctgtg ggttggccgc cacgtgggtc ccgggcacga ggaagaagcg 


60781
gtaaaaggag ggtttgctgt ggtcctttgg gtccgccgga ccggcgtcgt ccacctcggt 


60841
gagatggagg gccgagttgg tgctaaatac catggccccc acgagtcccg cggcgcgcgc 


60901
caggtacgcc ccgacggcgt tggcgcgggc cgcggccgtg tcctggccct cgcacagcgg 


60961
ccacgcggag atgtcggtgg gcggctcgtc gaagacggcc atcgacacga tagactcgag 


61021
ggccagggcg gcgtctccgg ccatgacgga ggccaggcgc tgttcgaacc cgcccgccgg 


61081
gcccttgccg ccgccgtcgc gcccaccccg cggggtctta ccctggctgg cttcgaaggc 


61141
cgtgaacgta atgtcggcgg ggagggcggc gccctcgtgg ttttcgtcaa acgccaggtg 


61201
ggcggccgcg cgggccacgg cgtccacgtt tcggcatcgc agtgccacgg cggcgggtcc 


61261
cacgaccgcc tcgaacagga ggcggttgag ggggcggtta aaaaacggaa gcgggtaggt 


61321
aaaattctcc ccgatcgatc ggtggttggc gttgaacggc tcggcgatga cccggctaaa 


61381
atccggcatg aacagctgca acggatacac gggtatgcgg tgcacctccg ccccgcctat 


61441
ggttaccttg tccgagcctc ccaggtgcag aaaggtgttg ttgatgcaca cggcctcctt 


61501
gaagccctcg gtaacgacca gatacaggag ggcgcggtcc gggtccaggc cgaggcgctc 


61561
acacagcgcc tcccccgtcg tctcgtgttt gaggtcgccg ggccgggggg tgtagtccga 


61621
aaagccaaaa tggcggcgtg cccgctcgca gagtcgcgtc aggtttgggg cctgggtgct 


61681
ggggtccagg tgccggccgc cgtgaaagac gtacacggac gagctgtagt gcgatggcgt 


61741
cagtttcagg gacaccgcgg tacccccgag ccccgtcgtg cgagaaccca cgaccacggc 


61801
tacgttggcc tcaaagccgc tctccacggt caggcccacg accaggggcg ccacggcgac 


61861
gtcggcatcg ccgctgcgcg ccgacagtaa cgccagaagc tcgatgcctt cggacggaca 


61921
cgcgcgagcg tacacgtatc ccaggggccc gggggggacc ttgatggtgg ttgccgtctt 


61981
gggctttgtc tccatgtcct cctggcaatc ggtccgcaaa cggaggtaat cccggcacga 


62041
cgacggacgc ccgacgaggt atgtctcccg agcgtcaaaa tccggggggg ggggcggcga 


62101
cggtcaaggg gagggtggga gaccggggtt ggggaatgaa tccctaccct tcacagacaa 


62161
cccccgggta accacggggt gccgatgaac cccggcggct ggcaacgcgg ggtccctgcg 


62221
agaggcacag atgcttacgg tcaggtgctc cgggccgggt gcgtctgata tgcggttggt 


62281
atatgtacac tttacctggg ggcgtgccgg accgccccag cccctcccac accccgcgcg 


62341
tcatcagccg gtgggcgtgg ccgctattat aaaaaaagtg agaacgcgaa gcgttcgcac 


62401
tttgtcctaa taatatatat attattagga caaagtgcga acgcttcgcg ttctcacttt 


62461
ttttataata gcggccacgc ccaccggcta cgtcacgctc ctgtcggccg ccggcggtcc 


62521
ataagcccgg ccggccgggc cgacgcgaat aaaccgggcc gccggccggg gcgccgcgca 


62581
gcagctcgcc gcccggatcc gccagacaaa caaggccctt gcacatgccg gcccgggcga 


62641
gcctgggggt ccggtaattt tgccatccca cccaagcggc ttttggggtt tttcctcttc 


62701
ccccctcccc acatcccccc tctttagggg ttcgggtggg aacaaccgcg atgttttccg 


62761
gtggcggcgg cccgctgtcc cccggaggaa agtcggcggc cagggcggcg tccgggtttt 


62821
ttgcgcccgc cggccctcgc ggagccggcc ggggaccccc gccttgtttg aggcaaaact 


62881
tttacaaccc ctacctcgcc ccagtcggga cgcaacagaa gccgaccggg ccaacccagc 


62941
gccatacgta ctatagcgaa tgcgatgaat ttcgattcat cgccccgcgg gtgctggacg 


63001
aggatgcccc cccggagaag cgcgccgggg tgcacgacgg tcacctcaag cgcgccccca 


63061
aggtgtactg cgggggggac gagcgcgacg tcctccgcgt cgggtcgggc ggcttctggc 


63121
cgcggcgctc gcgcctgtgg ggcggcgtgg accacgcccc ggcggggttc aaccccaccg 


63181
tcaccgtctt tcacgtgtac gacatcctgg agaacgtgga gcacgcgtac ggcatgcgcg 


63241
cggcccagtt ccacgcgcgg tttatggacg ccatcacacc gacggggacc gtcatcacgc 


63301
tcctgggcct gactccggaa ggccaccggg tggccgttca cgtttacggc acgcggcagt 


63361
acttttacat gaacaaggag gaggttgaca ggcacctaca atgccgcgcc ccacgagatc 


63421
tctgcgagcg catggccgcg gccctgcgcg agtccccggg cgcgtcgttc cgcggcatct 


63481
ccgcggacca cttcgaggcg gaggtggtgg agcgcaccga cgtgtactac tacgagacgc 


63541
gccccgctct gttttaccgc gtctacgtcc gaagcgggcg cgtgctgtcg tacctgtgcg 


63601
acaacttctg cccggccatc aagaagtacg agggtggggt cgacgccacc acccggttca 


63661
tcctggacaa ccccgggttc gtcaccttcg gctggtaccg tctcaaaccg ggccggaaca 


63721
acacgctagc ccagccgcgg gccccgatgg ccttcgggac atccagcgac gtcgagttta 


63781
actgtacggc ggacaacctg gccatcgagg ggggcatgag cgacctaccg gcatacaagc 


63841
tcatgtgctt cgatatcgaa tgcaaggcgg ggggggagga cgagctggcc tttccggtgg 


63901
ccgggcaccc ggaggacctg gttattcaga tatcctgtct gctctacgac ctgtccacca 


63961
ccgccctgga gcacgtcctc ctgttttcgc tcggttcctg cgacctcccc gaatcccacc 


64021
tgaacgagct ggcggccagg ggcctgccca cgcccgtggt tctggaattc gacagcgaat 


64081
tcgagatgct gttggccttc atgacccttg tgaaacagta cggccccgag ttcgtgaccg 


64141
ggtacaacat catcaacttc gactggccct tcttgctggc caagctgacg gacatttaca 


64201
aggtccccct ggacgggtac ggccgcatga acggccgggg cgtgtttcgc gtgtgggaca 


64261
taggccagag ccacttccag aagcgcagca agataaaggt gaacggcatg gtgaacatcg 


64321
acatgtacgg gatcataacc gacaagatca agctctcgag ctacaagctc aacgccgtgg 


64381
ccgaagccgt cctgaaggac aagaagaagg acctgagcta tcgcgacatc cccgcctact 


64441
acgccaccgg gcccgcgcaa cgcggggtga tcggcgagta ctgcatacag gattccctgc 


64501
tggtgggcca gctgtttttt aagtttttgc cccatctgga gctctcggcc gtcgcgcgct 


64561
tggcgggtat taacatcacc cgcaccatct acgacggcca gcagatccgc gtctttacgt 


64621
gcctgctgcg cctggccgac cagaagggct ttattctgcc ggacacccag gggcgattta 


64681
ggggcgccgg gggggaggcg cccaagcgtc cggccgcagc ccgggaggac gaggagcggc 


64741
cagaggagga gggggaggac gaggacgaac gcgaggaggg cgggggcgag cgggagccgg 


64801
agggcgcgcg ggagaccgcc ggccggcacg tggggtacca gggggccagg gtccttgacc 


64861
ccacttccgg gtttcacgtg aaccccgtgg tggtgttcga ctttgccagc ctgtacccca 


64921
gcatcatcca ggcccacaac ctgtgcttca gcacgctctc cctgagggcc gacgcagtgg 


64981
cgcacctgga ggcgggcaag gactacctgg agatcgaggt gggggggcga cggctgttct 


65041
tcgtcaaggc tcacgtgcga gagagcctcc tcagcatcct cctgcgggac tggctcgcca 


65101
tgcgaaagca gatccgctcg cggattcccc agagcagccc cgaggaggcc gtgctcctgg 


65161
acaagcagca ggccgccatc aaggtcgtgt gtaactcggt gtacgggttc acgggagtgc 


65221
agcacggact cctgccgtgc ctgcacgttg ccgcgacggt gacgaccatc ggccgcgaga 


65281
tgctgctcgc gacccgcgag tacgtccacg cgcgctgggc ggccttcgaa cagctcctgg 


65341
ccgatttccc ggaggcggcc gacatgcgcg cccccgggcc ctattccatg cgcatcatct 


65401
acggggacac ggactccata tttgtgctgt gccgcggcct cacggccgcc gggctgacgg 


65461
ccatgggcga caagatggcg agccacatct cgcgcgcgct gtttctgccc cccatcaaac 


65521
tcgagtgcga aaagacgttc accaagctgc tgctgatcgc caagaaaaag tacatcggcg 


65581
tcatctacgg gggtaagatg ctcatcaagg gcgtggatct ggtgcgcaaa aacaactgcg 


65641
cgtttatcaa ccgcacctcc agggccctgg tcgacctgct gttttacgac gataccgtat 


65701
ccggagcggc cgccgcgtta gccgagcgcc ccgcagagga gtggctggcg cgacccctgc 


65761
ccgagggact gcaggcgttc ggggccgtcc tcgtagacgc ccatcggcgc atcaccgacc 


65821
cggagaggga catccaggac tttgtcctca ccgccgaact gagcagacac ccgcgcgcgt 


65881
acaccaacaa gcgcctggcc cacctgacgg tgtattacaa gctcatggcc cgccgcgcgc 


65941
aggtcccgtc catcaaggac cggatcccgt acgtgatcgt ggcccagacc cgcgaggtag 


66001
aggagacggt cgcgcggctg gccgccctcc gcgagctaga cgccgccgcc ccaggggacg 


66061
agcccgcccc ccccgcggcc ctgccctccc cggccaagcg cccccgggag acgccgtcgc 


66121
atgccgaccc cccgggaggc gcgtccaagc cccgcaagct gctggtgtcc gagctggccg 


66181
aggatcccgc atacgccatt gcccacggcg tcgccctgaa cacggactat tacttctccc 


66241
acctgttggg ggcggcgtgc gtgacattca aggccctgtt tgggaataac gccaagatca 


66301
ccgagagtct gttaaaaagg tttattcccg aagtgtggca ccccccggac gacgtggccg 


66361
cgcggctccg ggccgcaggg ttcggggcgg tgggtgccgg cgctacggcg gaggaaactc 


66421
gtcgaatgtt gcatagagcc tttgatactc tagcatgagc cccccgtcga agctgatgtc 


66481
cctcatttta caataaatgt ctgcggccga cacggtcgga atctccgcgt ccgtgggttt 


66541
ctctgcgttg cgccggacca cgagcacaaa cgtgctctgc cacacgtggg cgacgaaccg 


66601
gtaccccggg cacgcggtga gcatccggtc tatgagccgg tagtgcaggt gggcggacgt 


66661
gccgggaaag atgacgtaca gcatgtggcc cccgtaagtg gggtccgggt aaaacaacag 


66721
ccgcgggtcg cacgccccgc ctccgcgcag gatcgtgtgg acgaaaaaaa gctcgggttg 


66781
gccaagaatc ccggccaaga ggtcctggag gggggcgttg tggcggtcgg ccaacacgac 


66841
caaggaggcc aggaaggcgc gatgctcgaa tatcgtgttg atctgctgca cgaaggccag 


66901
gattagggcc tcgcggctgg tggcggcgaa ccgcccgtct cccgcgttgc acgcgggaca 


66961
gcaacccccg atgcctaggt agtagcccat cccggagagg gtcaggcagt tgtcggccac 


67021
ggtctggtcc agacagaagg gcagcgacac gggagtggtc ttcaccaggg gcaccgagaa 


67081
cgagcgcacg atggcgatct cctcggaggg cgtctgggcg agggcggcga aaaggccccg 


67141
atagcgctgg cgctcgtgta aacacagctc ctgtttgcgg gcgtgaggcg gcaggctctt 


67201
ccgggaggcc cgacgcacca cgcccagagt cccgccggcc gcagaggagc acgaccgccg 


67261
gcgctccttg ccgtgatagg gcccgggccg ggagccgcgg cgatgggggt cggtatcata 


67321
cataggtaca cagggtgtgc tccagggaca ggagcgagat cgagtggcgt ctaagcagcg 


67381
cgcccgcctc acggacaaat gtggcgagcg cggtgggctt tggtacaaat acctgatacg 


67441
tcttgaaggt gtagatgagg gcacgcaacg ctatgcagac acgcccctcg aactcgttcc 


67501
cgcaggccag cttggccttg tggagcagca gctcgtcggg atgggtggcg gggggatggc 


67561
cgaacagaac ccaggggtca acctccatct ccgtgatggc gcacatgggg tcacagaaca 


67621
tgtgcttaaa gatggcctcg ggccccgcgg cccgcagcag gctcacaaac cggcccccgt 


67681
ccccgggctg cgtctcgggg tccgcctcga gctggtcgac gacgggtacg atacagtcga 


67741
agaggctcgt gttgttttcc gagtagcgga ccacggaggc ccggagtctg cgcagggcca 


67801
gccagtaagc ccgcaccagt aacaggttac acagcaggca ttctccgccg gtgcgcccgc 


67861
gcccccggcc gtgtttcagc acggtggcca tcagagggcc caggtcgagg tcgggctggg 


67921
catcgggttc ggtaaactgc gcaaagcgcg gagccacgtc gcgcgtgcgt gccccgcgat 


67981
gcgcttccca ggactggcgg accgtggcgc gacgggcctc cgcggcagcg cgcagctggg 


68041
gccccgactc ccagacggcg ggggtgccgg cgaggagcag caggaccaga tccgcgtacg 


68101
cccacgtatc cggcgactcc tccggctcgc ggtccccggc gaccgtctcg aattccccgt 


68161
tgcgagcggc ggcgcgcgta cagcagctgt ccccgccccc gcgccgaccc tccgtgcagt 


68221
ccaggagacg ggcgcaatcc ttccagttca tcagcgcggt ggtgagcgac ggctgcgtgc 


68281
cggatcccgc cgccgacccc gccccctcct cgcccccgga ggccaaggtt ccgatgaggg 


68341
cccgggtggc agactgcgcc aggaacgagt agttggagta ctgcaccttg gcggctcccg 


68401
gggagggcga gggcttgggt tgcttctggg catgccgccc gggcaccccg ccgtcggtac 


68461
ggaagcagca gtggagaaaa aagtgccggt ggatgtcgtt tatggtgagg gcaaagcgtg 


68521
cgaaggagcc gaccagggtc gccttcttgg tgcgcagaaa gtggcggtcc atgacgtaca 


68581
caaactcgaa cgcggccacg aagatgctag cggcgcagtg gggcgccccc aggcatttgg 


68641
cacagagaaa cgcgtaatcg gccacccact ggggcgagag gcggtaggtt tgcttgtaca 


68701
gctcgatggt gcggcagacc agacagggcc ggtccagcgc gaaggtgtcg atggccgccg 


68761
cggaaaaggg cccggtgtcc aaaagcccct ccccacaggg atccgggggc gggttgcggg 


68821
gtcctccgcg cccgcccgaa ccccctccgt cgcccgcccc cccgcgggcc cttgaggggg 


68881
cggtgaccac gtcggcggcg acgtcctcgt cgagcgtacc gacgggcggc acacctatca 


68941
cgtgactggc cgccaggagc tcggcgcaga gagcctcgtt aagagccagg aggctgggat 


69001
cgaaggccac atacgcgcgc tcgaacgccc ccgccttcca gctgctgccg ggggactctt 


69061
cgcacaccgc gacgctcgcc aggaccccgg ggggcgaagt tgccatggct gggcgggagg 


69121
ggcgcacgcg ccagcgaact ttacgggaca caatccccga ctgcgcgctg cggtcccaga 


69181
ccctggagag tctagacgcg cgctacgtct cgcgagacgg cgcgcatgac gcggccgtct 


69241
ggttcgagga tatgaccccc gccgagctgg aggttgtctt cccgactacg gacgccaagc 


69301
tgaactacct gtcgcggacg cagcggctgg cctccctcct gacgtacgcc gggcctataa 


69361
aagcgcccga cgacgccgcc gccccgcaga ccccggacac cgcgtgtgtg cacggcgagc 


69421
tgctcgcccg caagcgggaa agattcgcgg cggtcattaa ccggttcctg gacctgcacc 


69481
agattctgcg gggctgacgc gcgcgctgtt gggcgggacg gttcgcgaac cctttggtgg 


69541
gtttacgcgg gcacgcacgc tcccatcgcg ggcgccatgg cgggactggg caagccctac 


69601
cccggccacc caggtgacgc cttcgagggt ctcgttcagc gaattcggct tatcgtccca 


69661
tctacgttgc ggggcgggga cggggaggcg ggcccctact ctccctccag cctcccctcc 


69721
aggtgcgcct ttcagtttca tggccatgac gggtccgacg agtcgtttcc catcgagtat 


69781
gtactgcggc ttatgaacga ctgggccgag gtcccgtgca acccttacct gcgcatacag 


69841
aacaccggcg tgtcggtgct gtttcagggg ttttttcatc gcccacacaa cgcccccggg 


69901
ggcgcgatta cgccagagcg gaccaatgtg atcctgggct ccaccgagac gacggggctg 


69961
tccctcggcg acctggacac catcaagggg cggctcggcc tggatgcccg gccgatgatg 


70021
gccagcatgt ggatcagctg ctttgtgcgc atgccccgcg tgcagctcgc gtttcggttc 


70081
atgggccccg aagatgccgg acggacgaga cggatcctgt gccgcgccgc cgagcaggct 


70141
attacccgtc gccgccgaac ccggcggtcc cgggaggcgt acggggccga ggccgggctg 


70201
ggggtggctg gaacgggttt ccgggccagg ggggacggtt ttggcccgct ccccttgtta 


70261
acccaagggc cctcccgccc gtggcaccag gccctgcggg gtcttaagca cctacggatt 


70321
ggcccccccg cgctcgtttt ggcggcggga ctcgtcctgg gggccgctat ttggtgggtg 


70381
gttggtgctg gcgcgcgcct ataaaaaagg acgcaccgcc gccctaatcg ccagtgcgtt 


70441
ccggacgcct tcgccccaca cagccctccc gtccgacacc cccatatcgc ttcccgacct 


70501
ccggtcccga tggccgtccc gcaatttcac cgccccagca ccgttaccac cgatagcgtc 


70561
cgggcgcttg gcatgcgcgg gctcgtcttg gccaccaata actctcagtt tatcatggat 


70621
aacaaccacc cgcaccccca gggcacccaa ggggccgtgc gggagtttct ccgcggtcag 


70681
gcggcggcgc tgacggacct tggtctggcc cacgcaaaca acacgtttac cccgcagcct 


70741
atgttcgcgg gcgacgcccc ggccgcctgg ttgcggcccg cgtttggcct gcggcgcacc 


70801
tattcaccgt ttgtcgttcg agaaccttcg acgcccggga ccccgtgagg cccggggagt 


70861
tccttctggg gtgttttaat caataaaaga ccacaccaac gcacgagcct tgcgtttaat 


70921
gtcgtgttta ttcaagggag tgggataggg ttcgacggtt cgaaacttaa cacacaaaat 


70981
aatcgagcgc gtctagccca gtaacatgcg cacgtgatgt aggctggtca gcacggcgtc 


71041
gctgtgatga agcagcgccc ggcgggtccg ctgtaactgc tgttgtaggc ggtaacaggc 


71101
gcggatcagc accgccaggg cgctacgacc ggtgcgttgc acgtagcgtc gcgacagaac 


71161
tgcgtttgcc gatacgggcg gggggccgaa ttgtaagcgc gtcacctctt gggagtcatc 


71221
ggcggataac gcactgaatg gttcgttggt tatgggggag tgtggttccc gagggagtgg 


71281
gtcgagcgcc tcggcctcgg aatccgagag gaacaacgag gtggtgtcgg agtcttcgtc 


71341
gtcagagaca tacagggtct gaagcagcga cacgggcggg ggggtagcgt caatgtgtag 


71401
cgcgagggag gatgcccacg aagacacccc agacaaggag ctgcccgtgc gtggatttgt 


71461
ggacgacgcg gaagccggga cggatgggcg gttttgcggt gcccggaacc gaaccgccgg 


71521
atactccccg ggtgctacat gcccgttttg gggctggggt tggggctggg gctggggttg 


71581
gggttggggc tggggttggg gctggggttg gggctggggt tggggttggg gttggggctg 


71641
gggttggggt tggggctggg gctggggctg gggctggggc tggggctggg gctggggctg 


71701
gggctggggc tggggctggg gctggggctg gggctggggt tggggctggg gttggggctg 


71761
gggcgcggac aggcggttga cggtcaaatg cccccggggg cgcgcagatg tggtgggcgt 


71821
ggccaccggc tgccgtgtag tggggcggcg ggaaaccggg cctccgggcg taacaccgcc 


71881
ctccagcgtc aagtatgtgg ggggcgggcc tgacgtcggg ggcggggtga cgggttggac 


71941
cgcgggaggc gggggagagg gacctgcggg agaggatgag gtcggctcgg ccgggttgcg 


72001
gcctaaaaca ggggccgtgg ggtcggcggg gtcccagggt gaagggaggg attcccgcga 


72061
ttcggacagc gacgcgacag cggggcgcgt aaggcgccgc tgcggcccgc ctacgggaac 


72121
cctggggggg gttggcgcgg gacccgaggt tagcgggggg cggcggtttt cgcccccggg 


72181
caaaaccgtg ccggttgcga ccgggggcgg aacgggatcg atagggagag cgggagaagc 


72241
ctggccggcg aactggggac cgagcgggag gggcacacca gacaccaaag cgtggagcgc 


72301
tggctctggg ggtttgggag gggccggggg gcgcgcgaaa tcggtaaccg gggcgaccgt 


72361
gtcggggagg gcaggcggcc gccaaccctg ggtggtcgcg gaagcctggg tggcgcgcgc 


72421
cagggagcgt gcccggcggt gtcggcgcgc gcgcgacccg gacgaagaag cggcagaagc 


72481
gcgggaggag gcgggggggc ggggggcggt ggcatcgggg ggcgccgggg aactttgggg 


72541
ggacggcaag cgccggaagt cgtcgcgggg gcccacgggc gccggccgcg tgctttcggc 


72601
cgggacgccc ggtcgtgctt cgcgagccgt gactgccggc ccagggggcc gcggtgcaca 


72661
ctgggacgtg gggacggact gatcggcggt gggcgaaagg gggtccgggg caaggagggg 


72721
cgcggggccg ccggagtcgt cagacgcgag ctcctccagg ccgtgaatcc atgcccacat 


72781
gcgagggggg acgggctcgc cgggggtggc gtcggtgaat agcgtggggg ccaggcttcc 


72841
gggccccaac gagccctccg tcccaacaag gtccgccggg ccgggggtcg ggttcgggac 


72901
cgaggggctc tggtcgtcgg gggcgcgctg gtacaccgga tgccccggga atagctcccc 


72961
cgacaggagg gaggcgtcga acggccgccc gaggatagct cgcgcgagga aggggtcctc 


73021
gtcggtggcg ctggcggcga ggacgtcctc gccgcccgcc acaaacggga gctcctcggt 


73081
ggcctcgctg ccaacaaacc gcacgtcggg ggggccgggg gggtccgggt tttcccacaa 


73141
caccgcgacc ggggtcatgg agatgtccac gagcaccaga cacggcgggc cccgggcgag 


73201
gggccgctcg gcgatgagcg cggacaggcg cgggagctgt gccgccagac acgcgttttc 


73261
aatcgggttc aggtcggcgt gcaggaggcg gacggcccac gtctcgatgt cggacgacac 


73321
ggcatcgcgc aaggcggcgt ccggcccgcg agcgcgtgag tcaaacagcg tgagacacag 


73381
ctccagctcc gactcgcggg aaaaggccgt ggtgttgcgg agcgccacga cgacgggcgc 


73441
gcccaggagc actgccgcca gcaccaggtc catggccgta acgcgcgccg cgggggtgcg 


73501
gtgggtggcg gcggccggca cggcgacgtg ctggcccgtg ggccggtaga gggcgttggg 


73561
gggagcgggg ggtgacgcct cgcgcccccc cgaggggctc agcgtctgcc cagattccag 


73621
acgcgcggtc agaagggcgt cgaaactgtc atactctgtg tagtcgtccg gaaacatgca 


73681
ggtccaaaga gcggccagag cggtgcttgg gagacacatg cgcccgagga cgctcaccgc 


73741
cgccagcgcc tgggcgggac tcagctttcc cagcgcggcg ccgcgctcgg ttcccagctc 


73801
ggggaccgag cgccagggcg ccagggggtc ggtttcggac aacttgccgc ggcgccagtc 


73861
tgccagccgc gtgccgaaca tgaggccccg ggtcggaggg cctccggtcg aaaacactgg 


73921
cagcacgcgg atgcgggcgt ctggatgcgg ggtcaggcgc tgcacgaata gcatggaatc 


73981
tgctgcgttc tgaaacgcac gggggagggt gagatgcatg tactcgtgtt ggcggaccag 


74041
atccaggcgc caaaaggtgt aaatgtgttc cggggagctg gccaccagcg ccaccagcac 


74101
gtcgttctcg ttaaaggaaa cgcggtgcct agtggagctg tggggcccga gcggcggtcc 


74161
cggggccgcc gcgtcacccc cccattccag ctgggcccag cgacacccaa actcgcgcgt 


74221
gagagtggtc gcgacgaggg cgacgtagag ctcggccgcc gcatccatcg aggcccccca 


74281
tctcgcctgg cggtggcgca caaagcgtcc gaagagctga aagttggcgg cctgggcgtc 


74341
gctgagggcc agctgaagcc ggttgatgac ggtgatgacg tacatggccg tgacggtcga 


74401
ggccgactcc agggtgtccg tcggaagcgg ggggcgaatg catgccgcct cgggacacat 


74461
cagcagcgcg ccgagcttgt cggtcacggc cgggaagcag agcgcgtact gcagtggcgt 


74521
tccatccggg accaaaaagc tgggggcgaa cggccgatcc agcgtactgg tggcctcgcg 


74581
cagcaccagg ggccccgggc ctccgctcac tcgcaggtac gcctcgcccc ggcggcgcag 


74641
catctgcggg tcggcctctt ggccgggtgg ggcggacgcc cgggcgcggg cgtctagggc 


74701
gcgaagatcc acgagcaggg gcgcgggcgc ggcggccgcg cccgcgcccg tctggcctgt 


74761
ggccttggcg tacgcgctat ataagcccat gcggcgttgg atgagctccc gcgcgccccg 


74821
gaactcctcc accgcccatg gggccaggtc cccggccacc gcgtcgaatt ccgccaacag 


74881
gccccccagg gtgtcaaagt tcatctccca ggccaccctt ggcaccacct cgtcccgcag 


74941
ccgggcgctc aggtcggcgt gttgggccac gcgccccccg agctcctcca cggccccggc 


75001
ccgctcggcg ctcttggcgc ccaggacgcc ctggtacttg gcgggaaggc gctcgtagtc 


75061
ccgctgggct cgcagccccg acacagtgtt ggtggtgtcc tgcagggcgc gaagctgctc 


75121
gcatgccgcg cgaaatccct cgggcgattt ccaggccccc ccgcgaacgc ggccgaagcg 


75181
accccatacc tcgtcccact ccgcctcggc ctcctcgaga gacctccgca gggcctcgac 


75241
gcggcgacgg gtgtcgaaga gcgcctgcag gcgcgcgccc tgtcgcgtca ggaggcccgg 


75301
gccgtcgccg ctggccgcgt ttagcgggtg cgtctcaaag gtacgctggg catgttccaa 


75361
ccaggcgacc gcctgcacgt cgagctcgcg cgccttctcc gtctggtcca acagaatttc 


75421
gacctgatcc gcgatctcct ccgccgagcg cgcctggtcc agcgtcttgg ccacggtcgc 


75481
cgggacggcg accaccttca gcagggtctt cagattggcc agaccctcgg cctcgagctg 


75541
ggcccggcgc tcgcgcgcgg ccagcacctc ccgcagcccc gccgtgaccc gctcggtggc 


75601
ttcggcgcgc tgctgtttgg cgcgcaccac ggcgtccttg gtatcggcca ggtcctgtcg 


75661
ggtcacgaat gcgacgtagt cggcgtacgc cgtgtccttc acggggctct ggtccacgcg 


75721
ctccagcgcc gccacgcacg ccaccagcgc gtcctcgctc gggcagggca gggtgacccc 


75781
tgcccggaca agctcggcgg ccgccgccgg gtcgttgcgc accgcggata tctcctccgc 


75841
ggcggcggcc aggtccagcg ccacgcttcc gatcgcgcgc cgcgcgtcgg cccggagggc 


75901
gtccaggcga tcgcggatat ccacgtactc ggcgtagccc ttttgaaaaa acggcacgta 


75961
ctggcgcagg gccggcacgc cccccaagtc ttccgacagg tgtaggacgg cctcgtggta 


76021
gtcgataaac ccgtcgttcg cctgggcccg ctccagcagc ccccccgcca gccgcagaag 


76081
ccgcgccagg ggctcggtgt ccacccgaaa catgtcggcg tacgtgtcgg ccgcggcccc 


76141
gaaggccgcg ctccagtcga tgcggtgaat ggctgcgagc ggggggagca tggggtggcg 


76201
ctggttctcg ggggtgtatg ggttaaacgc aagggccgtc tccagggcaa gggtcaccgc 


76261
cttggcgttg gttcccagcg cctgttcggc ccgctttcgg aagtcccggg ggttgtagcc 


76321
gtgcgtgccc gccagcgcct gcaggcgacg gagctcgacc acgtcaaact cggcaccgct 


76381
ttccacgcgg tccagcacgg cctccacgtc ggcggcccag cgctcgtggc tactgcgggc 


76441
gcgctgggcc gccatcttct ctctcaggtc ggcgatggcg gcctcaagtt cgtcggcgcg 


76501
gcgtcgcgtg gcgccgatga cctttcccag ctcctgcagg gcgcgcccgc tgggggagtg 


76561
gtccccggcc gtcccttcgg cgtgcaacag gcccccgaac ctgccctcgt ggcccgcgag 


76621
gctttcccgc gcgccggtgg tcgcgcgcgt cgcggcctgg atcagggagg catgctctcc 


76681
ctccggttgg ttggcggccc ggcgcacctg gacgacaagg tcggctgccg ccgaccctaa 


76741
ggtcgtgagc tgggcgatgg ccccccgcgc gtccagggcc aaccgagtcg ccttgacgta 


76801
tcccgcggcg ctgtcggcca tggccgctag gaaggccagg ggggaggccg ggtcgctggc 


76861
ggccgcgccc agggccgtca ccgcgtcgac caggacgcgg tgcgcccgca cggccgcatc 


76921
caccgtcgac gcggggtctg ccgtcgcgac ggcggcgctg ccggcgttga tggcgttcga 


76981
gacggcgtgg gctatgatcg gggcgtgatc ggcgaagaac tgcaagagaa acggagtctc 


77041
tggggcgtcg gcgaacaggt tcttcagcac caccacgaag ctgggatgca agccagacag 


77101
agccgtcgcc gtgtccggag tcgggtgctc cagggcatct cggtactgcc ccagcagccc 


77161
ccacatgtcc gcccgcagcg ccgccgtaac ctcagggggc gccccccgaa cggcctcggg 


77221
gaggtccgac cagcccgccg gcagggaggc ccgcagggtc gccaggacgg ccggacaggc 


77281
ctttagcccc acaaagtcag ggagggggcg caggaccccc tggagtttgt gcaagaactt 


77341
ctcccgggcg tcgcgggcca ccttcgcccg ctcccgcgct ccctcgagca ttgcctccag 


77401
ggagcgcgcg cgctcccgca aacgggcacg cgcatcgggg gcgagctctg ccgtcagctt 


77461
ggcggcatcc atggcccgcg cctgccgcag cgcttcctcg gccatgcgcg tggcctctgg 


77521
cgacagcccg ccgtcgtcgg ggtagggcga cgcgccgggc gcaggaacaa aggccgcgtc 


77581
gctgtccagc tgctggccca gggccgcatc tagggcgtcg aagcgccgca gctcggccag 


77641
acccgagctg cggcgcgcct gctggtcgtt aatgtcgcgg atgctgcgcg ccagctcgtc 


77701
cagcggcttg cgttctatca gcccttggtt ggcggcgtcc gtcaggacgg agagccaggc 


77761
cgccaggtcc tcgggggcgt ccagcgtctg gccccgctgg atcagatccc gcaacaggat 


77821
ggccgtgggg ctggtcgcga tcgggggcgg ggcgggaatg gcggcgcgct gcgcgatgtc 


77881
ccgcgtgtgc tggtcgaaga caggcaggga ctcgagcagc tggaccacgg gcacgacggc 


77941
ggccgaagcc acgtgaaacc ggcggtcgtt gttgtcgctg gcctgtagag ccttggcgct 


78001
gtatacggcc ccccggtaaa agtactcctt aaccgcgccc tcgatcgccc gacgggcctg 


78061
ggtccgcacc tcctccagcc gaacctgaac ggcctcgggg cccagggggg gtgggcgcgg 


78121
agccccctgc ggggccgccc cggccggggc gggcattacg ccgaggggcc cggcgtgctg 


78181
tgagaccgcg tcgaccccgc gagcgagggc gtcgagggcc tcgcgcatct ggcgatcctc 


78241
cgcctccacc ctaatctctt cgccacgggc aaatttggcc agagcctgga ctctatacag 


78301
aagcggttct gggtgcgtcg gggtggcggg ggcaaaaagg gtgtccgggt gggcctgcga 


78361
gcgctccaga agccactcgc cgaggcgtgt atacagattg gccggcgggg ccgcgcgaag 


78421
ctgcagctcc aggtccgcga gttccccgta aaaggcgtcc gtctcccgaa tgacatccct 


78481
agccacaagg atcagcttcg ccagcgccag gcgaccgatc agagagtttt cgtccagcac 


78541
gtgctggacg aggggcagat gggcggccac gtcggccagg ctcaggcgcg tggaggccag 


78601
aaagtccccc acggccgttt tccggggcag catgctcagg gtaaactcca gcagggcggc 


78661
ggccgggccg gccaccccgg cctgggtgtg cgtccgggcc ccgttctcga tgagaaaggc 


78721
gaggacgcgt tcaaagaaaa aaataacaca gagctccagc agccccggag aagccggata 


78781
cggcgaccgt aaggcgctga tggtgagccg cgaacacgcg gcgacctcgc gggccagggt 


78841
ggcggagcac gcggtgaact taaccgccgt ggcggccacg tttgggtggg cctcgaacag 


78901
ctgggcgagg tctgcgcccg ggggctcggg tgagcggcga gtcttcagcg cctcgagggc 


78961
ctgtgaggac gccggaacca tgggcccgtc gtcctcgccc gcctcggcga ccggcggccc 


79021
ggccgggtcg gggggtgccg aggcgaggac aggctccgga acggaggcgg ggaccgcggc 


79081
cccgacgggg gttttgcctt tgggggtgga tttcttcttg gttttggcag ggggggccga 


79141
gcgtttcgtt ttctcccccg aagtcaggtc ttcgacgctg gaaggcggag tccaggtggg 


79201
tcggcggcgc ttgggaaggc cggccgagta gcgtgcccgg tgccgaccaa ccgggacgac 


79261
gcccatctcc aggacccgca tgtcgtcgtc atcttcttcg gccgcctctg cggcgggggt 


79321
cttgggggcg gagggaggcg gtggtgggat cgcggagggt gggtcggcgg aggggggatc 


79381
cgtgggtggg gtacccttta gggccaccgc ccatacatcg tcgggcgccc gattcgggcg 


79441
cttggcctct ggttttgccg acggaccggc cgtcccccgg gatgtctcgg aggccctgtc 


79501
gtcgcgacgg gcccgggtcg gtggcggcga ctgggcggct gtgggcgggt gtggccccgg 


79561
cccccctccc ccctcccggg ggcccacgcc gacgcagggc tcccccaggc ccgcgatctc 


79621
gccccgcagg gggtgcgtga tggccacgcg ccgttcgctg aacgcttcgt cctgcatgta 


79681
agtctcgctg gccccgtaaa gatgcagagc cgcggccgtc aagtccgcag gagccgcggg 


79741
ttccgggccc gacggcacga aaaacaccat ggctcccgcc caccgtacgt ccgggcgatc 


79801
gcgggtgtaa tacgtcaggt atggatacat gtcccccgcc cgcactttgg cgatgaacgc 


79861
gggggtgccc tccggaaggc catgcgggtc aaaaaggtat gcggtgtcgc cgtccctgaa 


79921
cagccccatc cctagggggc caatggttag gagcgtgtac gacagggggc gcagggccca 


79981
cgggccggcg aagaacgtgt gtgcggggca ttgtgtctcc agcaggcctg ccgcgggctc 


80041
cccgaagaag cccacctcgc cgtatacgcg cgagaagaca cagcgcagtc cgccgcgcgc 


80101
ccctgggtac tcgaggaagt tggggagctc gacgatcgaa cacatgcgcg gcggcccagg 


80161
gcccgcagtc gcgcgcgtcc actcgccccc ctcgaccaaa catccctcga tggcctccgc 


80221
ggacagaacg tcgcgagggc ccacatcaaa tatgaggctg agaaaggaca gcgacgagcg 


80281
catgcacgat accgaccccc ccggctccag gtcgggcgcg aactggttcc gagcaccggt 


80341
gaccacgatg tcgcgatccc ccccgcgttc catcgtggag tgcggtgggg tgcccgcgat 


80401
catatgtgcc ctgcgggcca gagacccggc ctgtttatgg accggacccc cggggttagt 


80461
gttgtttccg ccacccacgc ccccgtacca tggccccggt tcccctgatt aggctacgag 


80521
tcgcggtgat cgcttcccaa aaaccgagct gcgtttgtct gtcttggtct tccccccccc 


80581
cagcccgcac accataacac cgagaacaac acacgggggt gggcggaaca taataaagct 


80641
ttattggtaa ctagttaacg gcaagtccgt gggtggcgcg acggtgtcct ccgggatcat 


80701
ctcgtcgtcc tcgacggggg tgttggaatg aggcgcctcc tcgcggtcca cctggcgtgg 


80761
gccgtgccca taggcctccg gcttctgtgc gtccatgggc gtaggcgcgg ggagactgtt 


80821
tccggcgtcg cggacctcca ggtccctggg agcctccggt ccggctaacg gacgaaacgc 


80881
ggaagcgcga aacacgccgt cggtgacccg caggagctcg ttcatcagta accaatccat 


80941
actcagcgta acggccagcc cctggcgaga cagatccacg gagtccggaa ccgcggtcgt 


81001
ctggcccagg gggccgaggc tgtagtcccc ccaggcccct aggtcgcgac ggctcgtaag 


81061
cacgacgcgg tcggccgcgg ggctttgcgg gggggcgtcc tcgggcgcat gcgccattac 


81121
ctctcggatg gccgcggcgc gctggtcggc cgagctgacc aagggcgcca cgaccacggc 


81181
gcgctccgtc tgcaggccct tccacgtgtc gtggagttcc tggacaaact cggccacggg 


81241
ctcgggtccc gcggccgcgc gcgcggcttg atagcaggcc gacagacgcc gccagcgcgc 


81301
tagaaactga cccatgaaac aaaacccggg gacctggtct cccgacagca gcttcgacgc 


81361
ccgggcgtga atgccggaca cgacggacag aaacccgtga atttcgcgcc ggaccacggc 


81421
cagcacgttg tcctcgtgcg acacctgggc cgccagctcg tcgcacaccc ccaggtgcgc 


81481
cgtggtttcg gtgatgacgg aacgcaggct cgcgagggac gcgaccagcg cgcgcttggc 


81541
gtcgtgatac atgctgctgt actgactcac cgcgtccccc atggcctcgg ggggccaggg 


81601
ccccaggcgg tcgggcgtgt ccccgaccac cgcatacagg cggcgcccgt cgctctcgaa 


81661
ccgacactcg aaaaaggcgg agagcgtgcg catgtgcagc cgcagcagca cgatggcgtc 


81721
ctccagttgg cgaatcaggg ggtcggcgcg ctcggcgagg tcctgcagca ccccccgggc 


81781
agccagggcg tacatgctaa tcaacaggag gctggtgccc acctcggggg gcgggggggg 


81841
ctgcagttgg accaggggcc gcagctgctc gacggcaccc ctggagatca cgtacagctc 


81901
ccggagcagc tgctctatgt tgtcggccat ctgcatagtg gggccgaggc cgccccgggc 


81961
ggccggttcg aggagagtga tcagcgcgcc cagtttggtg cgatggccct cgaccgtggg 


82021
gagatagccc agcccaaagt cccgggccca ggccaacaca cgcagggcga actcgaccgg 


82081
gcggggaagg taggccgcgc tacacgtggc cctcagcgcg tccccaacca ccagggccag 


82141
aacgtagggg acgaagcccg ggtcggcgag gacgttgggg tgaatgccct cgagggcggg 


82201
gaagcggatc tgggtcgccg cggccaggtg gacagagggg gcatggctgg gctgcccgac 


82261
ggggagaagc gcggacagcg gcgtggccgg ggtggtgggg gtgatgtccc agtgggtctg 


82321
accatacacg tcgatccaga tgagcgccgt ctcgcggaga aggctgggtt gaccggaact 


82381
aaagcggcgc tcggccgtct caaactcccc cacgagcgcc cgccgcaggc tcgccagatg 


82441
ttccgtcggc acggccggcc ccatgatacg cgccagcgtc tggctcagaa cgccccccga 


82501
caggccgacc gcctcacaga gccgcccgtg cgtgtgctcg ctggcgccct ggacccgcct 


82561
gaaagttttt acgtagttgg catagtaccc gtattcccgc gccagaccaa acacgttcga 


82621
ccccgcgagg gcaatgcacc caaagagctg ctggacttcg ccgagtccgt ggccggcggg 


82681
cgtccgcgcg gggacgcccg ccgccagaaa cccctccagg gccgaaaggt agtgcgtgca 


82741
gtgcgagggc gtgaacccag cgtcgatcag ggtgttgatc accacggagg gcgaattggt 


82801
attctggatc aacgtccacg tctgctgcag cagagccagc agccgctgct gggcgccggc 


82861
ggagggctgc tccccgagct gcagcaggct ggagacggca ggctggaaga ctgccagtgc 


82921
cgacgaactc aggaacggca cgtcgggatc aaacacggcc acgtccgtcc gcacgcgcgc 


82981
cattagcgtc cccgggggcg cacaggccga gcgcgggctg acgcggctga gggccgtcga 


83041
cacgcgcacc tcctcgcggc tgcgaaccat cttgttggcc tccagtggcg gaatcattat 


83101
ggccgggtcg atctcccgca cggtgtgctg aaactgcgcc aacaggggcg gcgggaccac 


83161
agccccccgc tcgggggtcg tcaggtactc gtccaccagg gccaacgtaa agagggcccg 


83221
tgtgagggga gtgagggtcg cgtcgtctat gcgctggagg tgcgccgaga acagcgtcac 


83281
ccgattactc accagggcca agaaccggag gccctcttgc acgaacgggg cggggaagag 


83341
caggctgtac gccggggtgg taaggttcgc gctgggctgc cccaacggga ccggcgccag 


83401
cttgagcgac gtctccccaa gggcctcgat ggaggtccgc gggctcatgg ccaagcagct 


83461
cttggtgacg gtttgccagc ggtctatcca ctccacggcg cactggcgga cgcggaccgg 


83521
ccccagggcc gccgcggtgc gcaggccggc ggactccagc gcatgggacg tgtcggagcc 


83581
ggtgaccgcg aggatggtgt ccttgatgac ctccatctcc cggaaggcct ggtcgggggc 


83641
ctcggggaga gccaccacca agcggtgtac gagcaacccg gggaggttct cggccaagag 


83701
cgccgtctcc ggaagcccgt gggcccggtg gagcgcgcac aggtgttcca gcagcggccg 


83761
ccagcatgcc cgcgcgtctg ccggggcgat ggccgttccc gacaacagaa acgccgccat 


83821
ggcggcgcgc agcttggccg tggccagaaa cgccgggtcg tccgccccgt ttgccgtctc 


83881
ggccgtgggg gttggcggtt ggcgaaggcc ggctaggctc gccaataggc gctgcatagg 


83941
tccgtccgag ggcggaccgg cgggtgaggt cgtgacgacg ggggcctcgg acgggagacc 


84001
gcggtctgcc atgacgcccg gctcgcgtgg gtgggggaca gcgtagacca acgacgagac 


84061
cgggcgggaa tgactgtcgt gcgctgtagg gagcggcgaa ttatcgatcc cccgcggccc 


84121
tccaggaacc ccgcaggcgt tgcgagtacc ccgcgtcttc gcggggtgtt atacggccac 


84181
ttaagtcccg gcatcccgtt cgcggaccca ggcccggggg attgtccgga tgtgcgggca 


84241
gcccggacgg cgtgggttgc ggactttcgg cggggcggcc caaatggccc tttaaacgtg 


84301
tgtatacgga cgcgccgggc cagtcggcca acacaaccca ccggaggcgg tagccgcgtt 


84361
tggctgtggg gtgggtggtt ccgccttgcg tgagtgtcct ttcgaccccc cccctccccc 


84421
gggtcttgct aggtcgcgat ctgtggtcgc aatgaagacc aatccgctac ccgcaacccc 


84481
ttccgtgtgg ggcgggagta ccgtggaact cccccccacc acacgcgata ccgcggggca 


84541
gggcctgctt cggcgcgtcc tgcgcccccc gatctctcgc cgcgacggcc cagtgctccc 


84601
cagggggtcg ggaccccgga gggcggccag cacgctgtgg ttgcttggcc tggacggcac 


84661
agacgcgccc cctggggcgc tgacccccaa cgacgatacc gaacaggccc tggacaagat 


84721
cctgcggggc accatgcgcg ggggggcggc cctgatcggc tccccgcgcc atcatctaac 


84781
ccgccaagtg atcctgacgg atctgtgcca acccaacgcg gatcgtgccg ggacgctgct 


84841
tctggcgctg cggcaccccg ccgacctgcc tcacctggcc caccagcgcg ccccgccagg 


84901
ccggcagacc gagcggctgg gcgaggcctg gggccagctg atggaggcga ccgccctggg 


84961
gtcggggcga gccgagagcg ggtgcacgcg cgcgggcctc gtgtcgttta acttcctggt 


85021
ggcggcgtgt gccgcctcgt acgacgcgcg cgacgccgcc gatgcggtac gggcccacgt 


85081
cacggccaac taccgcggga cgcgggtggg ggcgcgcctg gatcgttttt ccgagtgtct 


85141
gcgcgccatg gttcacacgc acgtcttccc ccacgaggtc atgcggtttt tcggggggct 


85201
ggtgtcgtgg gtcacccagg acgagctagc gagcgtcacc gccgtgtgcg ccgggcccca 


85261
ggaggcggcg cacaccggcc acccgggccg gccccgctcg gccgtgatcc tcccggcgtg 


85321
tgcgttcgtg gacctggacg ccgagctggg gctggggggc ccgggcgcgg cgtttctgta 


85381
cctggtattc acttaccgcc agcgccggga ccaggagctg tgttgtgtgt acgtgatcaa 


85441
gagccagctc cccccgcgcg ggttggagcc ggccctggag cggctgtttg ggcgcctccg 


85501
gatcaccaac acgattcacg gcaccgagga catgacgccc ccggccccaa accgaaaccc 


85561
cgacttcccc ctcgcgggcc tggccgccaa tccccaaacc ccgcgttgct ctgctggcca 


85621
ggtcacgaac ccccagttcg ccgacaggct gtaccgctgg cagccggacc tgcgggggcg 


85681
ccccaccgca cgcacctgta cgtacgccgc ctttgcagag ctcggcatga tgcccgagga 


85741
tagtccccgc tgcctgcacc gcaccgagcg ctttggggcg gtcagcgtcc ccgttgtcat 


85801
cctggaaggc gtggtgtggc gccccggcga gtggcgggcc tgcgcgtgag cgtagcaaac 


85861
gccccgccca cacaacgctc cgcccccaac cccttccccg ctgtcactcg ttgttcgttg 


85921
acccggacgt ccgccaaata aagccactga aacccgaaac gcgagtgttg taacgtcctt 


85981
tgggcgggag gaagccacaa aatgcaaatg ggatacatgg aaggaacaca cccccgtgac 


86041
tcaggacatc ggcgtgtcct tttgggtttc actgaaactg gcccgcgccc cacccctgcg 


86101
cgatgtggat aaaaagccag cgcgggtggt ttagggtacc acaggtgggt gctttggaaa 


86161
cttgtcggtc gccgtgctcc tgtgagcttg cgtccctccc cggtttcctt tgcgctcccg 


86221
ccttccggac ctgctctcgc ctatcttctt tggctctcgg tgcgattcgt caggcagtgg 


86281
ccttgtcgaa tctcgacccc accactcgcc ggacccgccg acgtcccctc tcgagcccgc 


86341
cgaaacccgc cgcgtctgtt gaaatggcca gccgccccgc cgcatcctct cccgtcgaag 


86401
cgcgggcccc ggttggggga caggaggccg gcggccccag cgcagccacc cagggggagg 


86461
ccgccggggc ccctctcgcc cgcggccacc acgtgtactg ccagcgagtc aatggcgtga 


86521
tggtgctttc cgacaagacg cccgggtccg cgtcctaccg catcagcgat agcaactttg 


86581
tccaatgtgg ttccaactgc accatgatca tagacggaga cgtggtgcgc gggcgccccc 


86641
aggacccggg ggccgcggca tcccccgctc ccttcgttgc ggtgacaaac atcggagccg 


86701
gcagcgacgg cgggaccgcc gtcgtggcat tcgggggaac cccacgtcgc tcggcgggga 


86761
cgtctaccgg tacccagacg accgacgtcc ccaccgaggc ccttgggggc ccccctcctc 


86821
ctccccgctt caccctgggt ggcggctgtt gttcctgtcg cgacacacgg cgccgctctg 


86881
cggtattcgg gggggagggg gatcccgtcg gccccgcgga gttcgtctcg gacgaccggt 


86941
cgtccgattc cgactcggat gactcggagg acaccgactc ggagacgctg tcacacgcct 


87001
cctcggacgt gtccggcggg gccacgtacg acgacgccct tgactccgat tcgtcatcgg 


87061
atgactccct gcagatagat ggccccgtgt gtcgcccgtg gagcaatgac accgcgcccc 


87121
tggatgtttg ccccgggacc cccggcccgg gcgccgacgc cggtggtccc tcagcggtag 


87181
acccacacgc accgacgcca ggggccggcg ctggtcttgc ggccgatccc gccgtggccc 


87241
gggacgacgc ggaggggctt tcggaccccc ggccacgtct gggaacgggc acggcctacc 


87301
ccgtccccct ggaactcacg cccgagaacg cggaggccgt ggcgcgcttt ctgggagatg 


87361
ccgtgaaccg cgaacccgcg ctcatgctgg agtacttttg ccggtgcgcc cgcgaggaaa 


87421
ccaagcgtgt cccccccagg acattctgca gcccccctcg cctcacggag gacgactttg 


87481
ggcttctcaa ctacgcgctc gtggagatgc agcgcctgtg tctggacgtt cctccggtcc 


87541
tgccgaacgc atacatgccc tattatctca gggagtatgt gacgcggctg gtcaacgggt 


87601
tcaagccgct ggtgagccgg tccgctcgcc tttaccgcat cctgggggtt ctggtgcacc 


87661
tgcggatccg gacccgggag gcctcctttg aggagtggct gcgatccaag gaagtggccc 


87721
tggactttgg cctgacggaa aggcttcgcg agcacgaagc ccagctggtg atcctggccc 


87781
aggctctgga ccattacgac tgtctgatcc acagcacacc gcacacgctg gtcgagcggg 


87841
ggctgcaatc ggccctgaag tatgaggagt tttacctaaa gcgctttggc gggcactaca 


87901
tggagtccgt cttccagatg tacacccgca tcgccggctt tttggcctgc cgggccacgc 


87961
gcggcatgcg ccacatcgcc ctggggcgag aggggtcgtg gtgggaaatg ttcaagttct 


88021
ttttccaccg cctctacgac caccagatcg taccgtcgac ccccgccatg ctgaacctgg 


88081
ggacccgcaa ctactacacc tccagctgct acctggtaaa cccccaggcc accacaaaca 


88141
aggcgaccct gcgggccatc accagcaacg tcagcgccat cctcgcccgc aacgggggca 


88201
tcgggctatg cgtgcaggcg tttaacgact ccggccccgg gaccgctagc gtcatacccg 


88261
ccctcaaggt cctcgactcg ctggtggcgg cgcacaacaa agagagcgcg cgtccaaccg 


88321
gcgcgtgcgt gtacctggag ccgtggcaca ccgacgtgcg ggccgtgctc cggatgaagg 


88381
gggtcctcgc cggcgaagag gcccagcgct gcgacaatat cttcagcgcc ctctggatgc 


88441
cagacctgtt tttcaagcgc ctgattcgcc acctggacgg cgagaagaac gtcacatgga 


88501
ccctgttcga ccgggacacc agcatgtcgc tcgccgactt tcacggggag gagttcgaga 


88561
agctctacca gcacctcgag gtcatggggt tcggcgagca gatacccatc caggagctgg 


88621
cctatggcat tgtgcgcagt gcggccacga ccgggagccc cttcgtcatg ttcaaagacg 


88681
cggtgaaccg ccactacatc tacgacaccc agggggcggc catcgccggc tccaacctct 


88741
gcaccgagat cgtccatccg gcctccaagc gatccagtgg ggtctgcaat ctgggaagcg 


88801
tgaatctggc ccgatgcgtc tccaggcaga cgtttgactt tgggcggctc cgcgacgccg 


88861
tgcaggcgtg cgtgctgatg gtgaacatca tgatcgacag cacgctacaa cccacgcccc 


88921
agtgcacccg cggcaacgac aacctgcggt ccatgggaat cggcatgcag ggcctgcaca 


88981
cggcctgcct gaagctgggg ctggatctgg agtctgtcga atttcaggac ctgaacaaac 


89041
acatcgccga ggtgatgctg ctgtcggcga tgaagaccag caacgcgctg tgcgttcgcg 


89101
gggcccgtcc cttcaaccac tttaagcgca gcatgtatcg cgccggccgc tttcactggg 


89161
agcgctttcc ggacgcccgg ccgcggtacg agggcgagtg ggagatgcta cgccagagca 


89221
tgatgaaaca cggcctgcgc aacagccagt ttgtcgcgct gatgcccacc gccgcctcgg 


89281
cgcagatctc ggacgtcagc gagggctttg cccccctgtt caccaacctg ttcagcaagg 


89341
tgacccggga cggcgagacg ctgcgcccca acacgctcct gctaaaggaa ctggaacgca 


89401
cgtttagcgg gaagcgcctc ctggaggtga tggacagtct cgacgccaag cagtggtccg 


89461
tggcgcaggc gctcccgtgc ctggagccca cccaccccct ccggcgattc aagaccgcgt 


89521
ttgactacga ccagaagttg ctgatcgacc tgtgtgcgga ccgcgccccc tacgtcgacc 


89581
atagccaatc catgaccctg tatgtcacgg agaaggcgga cgggaccctc ccagcctcca 


89641
ccctggtccg ccttctggtc cacgcatata agcgcggact aaaaacaggg atgtactact 


89701
gcaaggttcg caaggcgacc aacagcgggg tctttggcgg cgacgacaac attgtctgca 


89761
cgagctgcgc gctgtgaccg acaaaccccc tccgcgccag gcccgccgcc actgtcgtcg 


89821
ccgtcccacg cgctcccccg ctgccatgga ttccgcggcc ccagccctct cccccgctct 


89881
gacggcccat acgggccaga gcgcgccggc ggacctggcg atccagattc caaagtgccc 


89941
cgaccccgag aggtacttct acacctccca gtgtcccgac attaaccacc tgcgctccct 


90001
cagcatcctt aaccgctggc tggaaaccga gcttgttttc gtgggggacg aggaggacgt 


90061
ctccaagctt tccgagggcg agctcagctt ttaccgcttc ctcttcgctt tcctgtcggc 


90121
cgccgacgac ctggttacgg aaaacctggg cggcctctcc ggcctgtttg agcagaagga 


90181
cattctccac tactacgtgg agcaggaatg catcgaagtc gtacactcgc gcgtgtacaa 


90241
catcatccag ctggtgcttt ttcacaacaa cgaccaggcg cgccgcgagt acgtggccgg 


90301
caccatcaac cacccggcca tccgcgccaa ggtggactgg ctggaagcgc gggtgcggga 


90361
atgcgcctcc gttccggaaa agttcatcct catgatcctc atcgagggca tcttttttgc 


90421
cgcctcgttt gccgccatcg cctaccttcg caccaacaac cttctgcggg tcacctgcca 


90481
gtcaaacgac ctcatcagcc gggacgaggc cgtgcacacg acggcctcgt gttacatcta 


90541
caacaactac ctcggcgggc acgccaagcc cccgcccgac cgcgtgtacg ggctgttccg 


90601
ccaggcggtc gagatcgaga tcggatttat ccgatcccag gcgccgacgg acagccatat 


90661
cctgagcccg gcggcgctgg cggccatcga aaactacgtg cgattcagcg cggatcgcct 


90721
gttgggcctt atccacatga agccactgtt ttccgcccca ccccccgacg ccagctttcc 


90781
gctgagcctc atgtccaccg acaaacacac caattttttc gagtgtcgca gcacctccta 


90841
cgccggggcg gtcgtcaacg atctgtgagg gtcgcggcgc gcttctaccc gtgtttgccc 


90901
ataataaacc tctgaaccaa actttgggtc tcattgtgat tcttgtcagg gacgcggggg 


90961
tgggagagga taaaaggcgg cgcaaaaagc agtaaccagg tccgtccaga ttctgagggc 


91021
ataggatacc ataattttat tggtgggtcg tttgttcggg gacaagcgcg ctcgtctgac 


91081
gtttgggcta ctcgtcccag aatttggcca ggacgtcctt gtagaacgcg ggtggggggg 


91141
cctgggtccg cagctgctcc agaaacctgt cggcgatatc aggggccgtg atatgccggg 


91201
tcacaataga tcgcgccagg ttttcgtcgc ggatgtcctg gtagataggc aggcgtttca 


91261
gaagagtcca cggcccccgc tccttggggc cgataagcga tatgacgtac ttaatgtagc 


91321
ggtgttccac cagctcggtg atggtcatgg gatcggggag ccagtccagg gactctgggg 


91381
cgtcgtggat gacgtggcgt cgccggctgg ccacataact gcggtgctct tccagcagct 


91441
gcgcgttcgg gacctggacg agctcgggcg gggtgagtat ctccgaggag gacgacctgg 


91501
ggccggggtg gcccccggta acgtcccggg gatccagggg gaggtcctcg tcgtcttcgt 


91561
atccgccggc gatctgttgg gttagaattt cggtccacga gacgcgcatc tcggtgccgc 


91621
cggcggccgg cggcaaaggg ggcctggttt ccgtggagcg cgagctggtg tgttcccggc 


91681
ggatggcccg ccgggtctga gagcgactcg ggggggtcca gtgacattcg cgcagcacat 


91741
cctccacgga ggcgtaggtg ttattgggat ggaggtcggt gtggcagcgg acaaagaggg 


91801
ccaggaactg ggggtagctc atcttaaagt actttagtat atcgcgacag ttgatcgtgg 


91861
gaatgtagca ggcgctaata tccaacacaa tatcacagcc catcaacagg aggtcagtgt 


91921
ctgtggtgta cacgtacgcg accgtgttgg tgtgatagag gttggcgcag gcatcgtccg 


91981
cctccagctg acccgagtta atgtaggcgt accccagggc ccggagaacg cgaatacaga 


92041
acagatgcgc cagacgcagg gccggcttcg agggcgcggc ggacggcagc gcggctccgg 


92101
acccggccgt cccccgggtc cccgaggcca gagaggtgcc gcgccggcgc atgttggaaa 


92161
aggcagagct gggtctggag tcggtgatgg gggaaggcgg tggagaggcg tccacgtcac 


92221
tggcctcctc gtccgtccgg cattgggccg tcgtgcgggc caggatggcc ttggctccaa 


92281
acacaaccgg ctccatacaa ttgaccccgc gatcggtaac gaagatgggg aaaagggact 


92341
tttgggtaaa cacctttaat aagcgacaga ggcagtgtag cgtaatggcc tcgcggtcgt 


92401
aactggggta tcggcgctga tatttgacca ccaacgtgta catgacgttc cacaggtcca 


92461
cggcgatggg ggtgaagtac ccggccgggg ccccaaggcc ctggcgcttg accagatggt 


92521
gtgtgtgggc aaacttcatc atcccgaaca aacccatgtc aggtcgattg taactgcgga 


92581
tcggcctaac taaggcgtgg ttggtgcgac ggtccgggac acccgagcct gtctctctgt 


92641
gtatggtgac ccagacaaca acaccgacac aagaggacaa taatccgtta ggggacgctc 


92701
tttataattt cgatggccca actccacgcg gattggtgca gcaccctgca tgcgccggtg 


92761
tgggccaaac ttccccccgc tcattgcctc ttccaaaagg gtgtggccta acgagctggg 


92821
ggcgtattta atcaggctag cgcggcgggc ctgccgtagt ttctggctcg gtgagcgacg 


92881
gtccggttgc ttgggtcccc tggctgccag caaaacccca ccctcgcagc ggcatacgcc 


92941
ccctccgcgt cccgcacccg agaccccggc ccggctgccc tcaccaccga agcccacctc 


93001
gtcactgtgg ggtgttccca gcccgcattg ggatgacgga ttcccctggc ggtgtggccc 


93061
ccgcctcccc cgtggaggac gcgtcggacg cgtccctcgg gcagccggag gagggggcgc 


93121
cctgccaggt ggtcctgcag ggcgccgaac ttaatggaat cctacaggcg tttgccccgc 


93181
tgcgcacgag ccttctggac tcgcttctgg ttatgggcga ccggggcatc cttatccata 


93241
acacgatctt tggggagcag gtgttcctgc ccctggaaca ctcgcaattc agtcggtatc 


93301
gctggcgcgg acccacggcg gcgttcctgt ctctcgtgga ccagaagcgc tccctcctga 


93361
gcgtgtttcg cgccaaccag tacccggacc tacgtcgggt ggagttggcg atcacgggcc 


93421
aggccccgtt tcgcacgctg gttcagcgca tatggacgac gacgtccgac ggcgaggccg 


93481
ttgagctagc cagcgagacg ctgatgaagc gcgaactgac gagctttgtg gtgctggttc 


93541
cccagggaac ccccgacgtt cagttgcgcc tgacgaggcc gcagctcacc aaggtcctta 


93601
acgcgaccgg ggccgatagt gccacgccca ccacgttcga gctcggggtt aacggcaaat 


93661
tttccgtgtt caccacgagt acctgcgtca catttgctgc ccgcgaggag ggcgtgtcgt 


93721
ccagcaccag cacccaggtc cagatcctgt ccaacgcgct caccaaggcg ggccaggcgg 


93781
ccgccaacgc caagacggtg tacggggaaa atacccatcg caccttctct gtggtcgtcg 


93841
acgattgcag catgcgggcg gtgctccggc gactgcaggt cgccgggggc accctcaagt 


93901
tcttcctcac gacccccgtc cccagtctgt gcgtcaccgc caccggtccc aacgcggtat 


93961
cggcggtatt tctcctgaaa ccccagaaga tttgcctgga ctggctgggt catagccagg 


94021
ggtctccttc agccgggagc tcggcctccc gggcctctgg gagcgagcca acagacagcc 


94081
aggactccgc gtcggacgcg gtcagccacg gcgatccgga agacctcgat ggcgctgccc 


94141
gggcgggaga ggcgggggcc tcgcacgcct gtccgatgcc gtcgtcgacc acgcgggtca 


94201
ctcccacgac caagcggggg cgctcggggg gcgaggatgc gcgcgcggac acggccctaa 


94261
agaaacctaa gacggggtcg cccaccgcac ccccgcccac agatccagtc cccctggaca 


94321
cggaggacga ctccgatgcg gcggacggga cggcggcccg tcccgccgct ccagacgccc 


94381
ggagcggaag ccgttacgcg tgttactttc gcgacctccc gaccggagaa gcaagccccg 


94441
gcgccttctc cgccttccgg gggggccccc aaaccccgta tggttttgga ttcccctgac 


94501
ggggcggggc cttggcggcc gcccaactct cgcaccatcc cgggttaatg taaataaact 


94561
tggtattgcc caacactctc ccgcgtgtcg cgtgtggttc atgtgtgtgc ctggcgtccc 


94621
ccaccctcgg gttcgtgtat ttcctttccc tgtccttata aaagccgtat gtggggcgct 


94681
gacggaacca ccccgcgtgc catcacggcc aaggcgcggg atgctccgca acgacagcca 


94741
ccgggccgcg tccccggagg acggccaggg acgggtcgac gacggacggc cacacctcgc 


94801
gtgcgtgggg gccctggcgc gggggttcat gcatatctgg cttcaggccg ccacgctggg 


94861
ttttgcggga tcggtcgtta tgtcgcgcgg gccgtacgcg aatgccgcgt ctggggcgtt 


94921
cgccgtcggg tgcgccgtgc tgggctttat gcgcgcaccc cctcccctcg cgcggcccac 


94981
cgcgcggata tacgcctggc tcaaactggc ggccggtgga gcggcccttg ttctgtggag 


95041
tctcggggag cccggaacgc agccgggggc cccgggcccg gccacccagt gcctggcgct 


95101
gggcgccgcc tatgcggcgc tcctggtgct cgccgatgac gtctatccgc tctttctcct 


95161
cgccccgggg cccctgttcg tcggcaccct ggggatggtc gtcggcgggc tgacgatcgg 


95221
aggcagcgcg cgctactggt ggatcggtgg gcccgccgcg gccgccttgg ccgcggcggt 


95281
gttggcgggc ccgggggcga ccaccgccag ggactgcttc tccagggcgt gccccgacca 


95341
ccgccgcgtc tgcgtcatcg tcgcaggcga gtctgtttcc cgccgccccc cggaggaccc 


95401
agagcgaccc ggggaccccg ggccaccgtc ccccccgaca ccccaacgat cccaggggcc 


95461
gccggccgat gaggtcgcac cggccggggt agcgcggccc gaaaacgtct gggtgcccgt 


95521
ggtcaccttt ctgggggcgg gcgcgctcgc cgtcaagacg gtgcgagaac atgcccggga 


95581
aacgccgggc ccgggcctgc cgctgtggcc ccaggtgttt ctcggaggcc atgtggcggt 


95641
ggccctgacg gagctgtgtc aggcgcttat gccctgggac cttacggacc cgctgctgtt 


95701
tgttcacgcc ggactgcagg tcatcaacct cgggttggtg tttcggtttt ccgaggttgt 


95761
cgtgtatgcg gcgctagggg gtgccgtgtg gatttcgttg gcgcaggtgc tggggctccg 


95821
gcgtcgcctg cacaggaagg accccgggga cggggcccgg ttggcggcga cgcttcgggg 


95881
cctcttcttc tccgtgtacg cgctggggtt tggggtgggg gcgctgctgt gccctccggg 


95941
gtcaacgggc gggtggtcgg gcgattgata tatttttcaa taaaaggcat tagtcccgaa 


96001
gaccgccggt gtgtgatgat ttcgccataa cacccaaacc ccggatgggg cccgggtata 


96061
aattccggaa ggggacacgg gctaccctca ctaccgaggg cgcttggtcg ggaggccgca 


96121
tcgaacgcac acccccatcc ggtggtccgt gtggaggtcg tttttcagtg cccggtctcg 


96181
ctttgccggg aacgctagcc gatccctcgc gagggggagg cgtcgggcat ggccccgggg 


96241
cgggtgggcc ttgccgtggt cctgtggagc ctgttgtggc tcggggcggg ggtggccggg 


96301
ggctcggaaa ctgcctccac cgggcccacg atcaccgcgg gagcggtgac gaacgcgagc 


96361
gaggccccca catcggggtc ccccgggtca gccgccagcc cggaagtcac ccccacatcg 


96421
accccaaacc ccaacaatgt cacacaaaac aaaaccaccc ccaccgagcc ggccagcccc 


96481
ccaacaaccc ccaagcccac ctccacgccc aaaagccccc ccacgtccac ccccgacccc 


96541
aaacccaaga acaacaccac ccccgccaag tcgggccgcc ccactaaacc ccccgggccc 


96601
gtgtggtgcg accgccgcga cccattggcc cggtacggct cgcgggtgca gatccgatgc 


96661
cggtttcgga attccacccg catggagttc cgcctccaga tatggcgtta ctccatgggt 


96721
ccgtcccccc caatcgctcc ggctcccgac ctagaggagg tcctgacgaa catcaccgcc 


96781
ccacccgggg gactcctggt gtacgacagc gcccccaacc tgacggaccc ccacgtgctc 


96841
tgggcggagg gggccggccc gggcgccgac cctccgttgt attctgtcac cgggccgctg 


96901
ccgacccagc ggctgattat cggcgaggtg acgcccgcga cccagggaat gtattacttg 


96961
gcctggggcc ggatggacag cccgcacgag tacgggacgt gggtgcgcgt ccgcatgttc 


97021
cgccccccgt ctctgaccct ccagccccac gcggtgatgg agggtcagcc gttcaaggcg 


97081
acgtgcacgg ccgccgccta ctacccgcgt aaccccgtgg agtttgtctg gttcgaggac 


97141
gaccgccagg tgtttaaccc gggccagatc gacacgcaga cgcacgagca ccccgacggg 


97201
ttcaccacag tctctaccgt gacctccgag gctgtcggcg gccaggtccc cccgcggacc 


97261
ttcacctgcc agatgacgtg gcaccgcgac tccgtgatgt tctcgcgacg caatgccacc 


97321
gggctggccc tggtgctgcc gcggccaacc atcaccatgg aatttggggt ccggcatgtg 


97381
gtctgcacgg ccggctgcgt ccccgagggc gtgacgtttg cctggttcct gggggacgac 


97441
ccctcaccgg cggctaagtc ggccgttacg gcccaggagt cgtgcgacca ccccgggctg 


97501
gctacggtcc ggtccaccct gcccatttcg tacgactaca gcgagtacat ctgtcggttg 


97561
accggatatc cggccgggat tcccgttcta gagcaccacg gcagtcacca gcccccaccc 


97621
agggacccca ccgagcggca ggtgatcgag gcgatcgagt gggtggggat tggaatcggg 


97681
gttctcgcgg cgggggtcct ggtcgtaacg gcaatcgtgt acgtcgtccg cacatcacag 


97741
tcgcggcagc gtcatcggcg gtaacgcgag acccccccgt taccttttta atatctatat 


97801
agtttggtcc ccctctatcc cgcccaccgc tgggcgctat aaagccgcca ccctctcttc 


97861
cctcaggtca tccttggtcg atcccgaacg acacacggcg tggagcaaaa cgcctccccc 


97921
tgagccgctt tcctaccaac acaacggcat gcctctgcgg gcatcggaac acgcctaccg 


97981
gcccctgggc cccgggacac cccccatgcg ggctcggctc cccgccgcgg cctgggttgg 


98041
cgtcgggacc atcatcgggg gagttgtgat cattgccgcg ttggtcctcg tgccctcgcg 


98101
ggcctcgtgg gcactttccc catgcgacag cggatggcac gagttcaacc tcgggtgcat 


98161
atcctgggat ccgaccccca tggagcacga gcaggcggtc ggcggctgta gcgccccggc 


98221
gaccctgatc ccccgcgcgg ctgccaaaca gctggccgcc gtcgcacgcg tccagtcggc 


98281
aagatcctcg ggctactggt gggtgagcgg agacggcatt cgggcctgcc tgcggctcgt 


98341
cgacggcgtc ggcggtattg accagttttg cgaggagccc gcccttcgca tatgctacta 


98401
tccccgcagt cccgggggct ttgttcagtt tgtaacttcg acccgcaacg cgctggggct 


98461
gccgtgaggc gcgtgtactg cggtctgtct cgtctcctct tctccccttc cctccccctc 


98521
cgcatcccag gatcacaccg gccaacgagg gttggggggg tccggcacgg acccaaaata 


98581
ataaacacac aatcacgtgc gataaaaaga acacgcggtc ccctgtggtg tttttggtta 


98641
tttttattaa atctcgtcga caaacagggg gaaaggggcg tggtctagcg acggcagcac 


98701
gggcggaggc gttcaccggc tccggcgtcc ttcgcgttta agcttggtca ggagggcgct 


98761
cagggcggcg acgttggtcg ggccgtcgtt ggtcagggcg ttggctcgat ggcgggcgag 


98821
gacgggcgag gggctcaacg gcgggggcgg gggtccggtg cggcccgggg gggaaaatag 


98881
ggcggatccc ccccagtcgt acaggggatt ttccgcctca atgtacgggg aggccggcgc 


98941
tgcattcgcc gtgttcacgc agacgttttc gtagacccgc atccatggta tttcctcgta 


99001
gacacgcccc ccgtcctcgc tcacggtctc gtatattgac tcgtcgtcct cgtagggggc 


99061
gtgccgttcg cgggccgagg cggcgtgggt ggctttgcgg cgggcgtcgt cgtcgtcgtc 


99121
gtcggccgtc agatacgtgg cttccatctg gtcgggttct ccctccgggg cgggtcccca 


99181
cacccgtggc cgatcgaggc tccccagaga cgcgcgccgg acaagaaggg ggcacgtcgc 


99241
cgccggcggt cgcctgtcgg gtcccgcgac gttacgggcc gggaggcgcg ggggcacctc 


99301
ccccatgtgc gtgtaatacg tggccggctg tgcggccgca gcggggggct cggcgaccgg 


99361
gtcgtccgca tccggaagcg ggggccccgc gccgtccgca cggcgcctcc ggaaccgccg 


99421
ggtggacggc gcgggggtcg agtgtaggcg aggtcggggg aggggcgggg gctcgttgtc 


99481
gcgccgcgcc cgctgaatct tttcccgaca ggtcccaccc cccgcgcgat gcccccccgg 


99541
gccgcgggcc atgtcgtccg ggggaggccc cgcggaccac gtcgtccggc gagacgccac 


99601
gagccgcagg atggactcgt agtggagcga cggcgccccg ctgcggagca gatccgcggc 


99661
cagggcggcc ccgaaccaag ccttgatgct caactccatc cgggcccagc tgggggcggt 


99721
catcgtgggg aacagggggg cggtggtccg acagaaacgc tcctggctgt ccaccgcggc 


99781
ccgcagatac tcgttgttca ggctgtcggt ggcccagacg ccgtacccgg tgagggtcgc 


99841
gttgatgata tactgggcgt ggtgatggac gatcgacaga acctccaccg tggataccac 


99901
ggtatccacg gtcccgtacg taccgccgct ccgcttgccg gtctgccaca ggttggctag 


99961
gcacgtcagg tggcccagga cgtcgctgac cgccgccctg agcgccatgc actgcatgga 


100021
gccggtcgtg ccgctgggac cccggtccag atggcgcgcg aacgtttccg cgggcgcctc 


100081
cgggctgccg ccgagcggga ggaaccggcg attggaggga ctcagccggt gacatacgtg 


100141
cttgtccgtc gtccacagca tccaggacgc ccaccggtac agcacggaga cgtaggccag 


100201
gagctcgttg agccgcagtg cggtgtcggt gctggggcgg cttgggtccg ccgggcgcat 


100261
aaagaacatg tactgctgaa tccgatggag ggcgtcgcgc aggccggcca cggtggcggc 


100321
gtacttggcc gccgcggccc cgctcttgaa cggggtgcgc gccagcagct ttggcgccag 


100381
ggtgggccgc agcagcacgt gaaggctggg gtcgcagtcg cccacggggt cctcggggac 


100441
gtccaggccg ctgggcacca ccgtctgcag gtacttccag tactgcgtga ggatggcgcg 


100501
gctcaactgg ccgccgggca gctccacctc gcccagcgcc tgggtggcgg ccgaagcgta 


100561
gtgccggatg tactcgtagt gcgggtcgct ggcgagcccg tccacgatca aactctcggg 


100621
aaccgtgttg tgttgccgcg cggccaaccg gacgctgcga tcggtgcagg tcagaaacgc 


100681
cggctgcgcg tcgtcggagc gctgccgcaa ggcgcccacg gccgcgctaa ggagcccctc 


100741
cggggtgggg agcagacacc cgccgaagat gcgccgctcg ggaacgcccg cgttgtcgcc 


100801
gcggatcagg ttggcaggcg tcaggcaccg cgccagccgc agggagctcg cgccgcgcgt 


100861
ccggcgctgc atggtgacgc ccgttcggtc gggacccgcc ggtcggagtt atgccgcgtc 


100921
cagggccatc ggggcgcttt ttatcgggag gagcttatgg gcgtggcggg cctcccagcc 


100981
cggtcgcgcg cctccccgac acgtgcgccc gcagggcggc ggccccctcg tctcccatca 


101041
gcagtttcct aaactgggac atgatgtcca ccacgcggac ccgcgggccc aacacggacc 


101101
cgccgcttac gggggcgggg gggaagggct ccaggtcctt gagcagaaag gcggggtctg 


101161
ccgtcccgga cacgggggcc cggggcgcgg aggaggcggg gcgcagatcc acgtgctccg 


101221
cggccgcgcg gacgtccgcc cagaacttgg cgggggtggt gcgcgcgtac aggggctggg 


101281
tcgctcggag gacacacgcg tagcgcaggg gggtgtacgt gcccacctcg ggggccgtga 


101341
atcccccgtc aaacgcggcc agtgtcacgc acgccaccac ggtgtcggca aagcccagca 


101401
gccgctgcag gacgagcccg gcggccagaa tggcgcgcgt ggtcgcagcg tcgtcccggc 


101461
gccggtgcgc gtccccgcac gcccgggcgt actttaaggt cactgtcgcc agggccgtgt 


101521
gcagcgcgta caccgcagcg cccagcacgg cgttgagccc gctgttggcg agcagccggc 


101581
gcgctgcggt gtcgcccagc gcctcgtgct cggcccccac gaccgcgggg cttcccaggg 


101641
gcagggcgcg aaacagctcc tcccgcgcca cgtccgcaaa ggcggggtgg tgcacgtgcg 


101701
ggtgcaggcg cgcccccacg accaccgaga gccactggac cgtctgctcc gccatcaccg 


101761
ccaacacatc cagcacgcgc cccaggaagg cggcctcccg cgtcaaaacg caccggacgg 


101821
cgtcgggatt gaagcgggcg agcagggccc cggtggccag gtacgtcatg cggccggcat 


101881
agcgggcggc cacgcgacag tcgcggtcca gcagcgcgcg caccccgggc cagtacagca 


101941
gggaccccag cgagctgcga aacaccgcgg cgtcggggcc ggattggggg gacactaacc 


102001
cccccgcgct cagtaacggc acggccgcgg ccccgacggg acgcaacgcc gtgaggctcg 


102061
cgaactgccg cctcagctcg gcagccctgt cgtccaggtc cgacccgcgc gcctctgcgt 


102121
gaaggcgcgt cccgcacacc cacccgttga tggccagccg cacgacggca tccgccaaaa 


102181
agctcatcgc ctgggcgggg ctggtttttg ttcgacgatc cgtcaggtca agaatcccat 


102241
cgcccgtgat ataccaggcc aacgcctcgc cctgctgcag ggtttggcgg aaaaacaccg 


102301
cggggttgtc gggggaggcg aagtgcatga cccccacgcg cgataacccg aacgcgctat 


102361
ccggacacgg gtaaaacccg gccggatgcc ccagggctag ggcggagcgc acggactcgt 


102421
cccacacggc aacctgaggg gccagtcgat ccaacgggaa tgccgcccgg agctccgggc 


102481
ccggcacgcg tccctccaga acctccacct tgggcgggga acgggccccg ccgccgtcct 


102541
ccggcccgac gtcttccggg tagtcgtcct cctcgtactg cagttcctct aggaacagcg 


102601
gcgacggcgc cacccgcgaa ccgccgaccc gccccaaaat agcccgcgcg tcgacgggac 


102661
ccaggtatcc cccctgccgg gcctgcggag gaccgcgggg aacctcatca tcatcgtcca 


102721
ggcgaccgcg caccgactgg ctacgggccg catcgggccc ggggcgctgc cgggacgctc 


102781
ggcgatggga tgagggcggg gcttccgacg cgcgccgtcg tcgggctcgc gggccttccc 


102841
gtcgacggcg cacgggcggc tcgtcgcccg ccatctcctc cagagcctct agctcgctgt 


102901
cgtcatcccc gcggaacacc gcacgcaggt accccatgaa ccccacccca tcgcccgctg 


102961
gctcgtccgc cacgggcgag gcgcgggggc gggtggatgc gcgcctcctg cgccccgcgg 


103021
gttcgcgagc cgacatggtg gcgatagacg cgggttatcg gatgtccgct accccccaaa 


103081
aaagaaaaag accccacagc gcggatggag gtcggggtag gtgccgccgg accccctcgc 


103141
gatgggaatg gacgggagcg acggggccgg cgcaaaaaac gcagtatctc ccgcgaaggc 


103201
tacccgccgc cccagccccc ggccaaatgc ggaaacggtc ccgcgctctc gcctttatac 


103261
gcgggccgcc ctgcgacaca atcacccgtc cgtggtttcg aatctacacg acaggcccgc 


103321
agacgcggct aacacacacg ccggcaaccc agaccccagt gggttggttg cgcggtcccg 


103381
tctcctggct agttctttcc cccaccacca aataatcaga cgacaaccgc aggtttttgt 


103441
aatgtatgtg ctcgtgttta ttgtggatac gaaccgggga cgggagggga aaacccagac 


103501
gggggatgcg ggtccggtcg cgccccctac ccaccgtact cgtcaattcc aagggcatcg 


103561
gtaaacatct gctcaaactc gaagtcggcc atatccagag cgccgtaggg ggcggagtcg 


103621
tggggggtaa atcccggacc cggggaatcc ccgtccccca acatgtccag atcgaaatcg 


103681
tctagcgcgt cggcatgcgc catcgccacg tcctcgccgt ctaagtggag ctcgtccccc 


103741
aggctgacat cggtcggggg ggccgtcgac agtctgcgcg tgtgtcccgc ggggagaaag 


103801
gacaggcgcg gagccgccag ccccgcctct tcgggggcgt cgtcgtccgg gagatcgagc 


103861
aggccctcga tggtagaccc gtaattgttt ttcgtacgcg cgcggctgta cgcgtgttcc 


103921
cgcatgaccg cctcggaggg cgaggtcgtg aagctggaat acgagtccaa cttcgcccga 


103981
atcaacacca taaagtaccc agaggcgcgg gcctggttgc catgcagggt gggaggggtc 


104041
gtcaacggcg cccctggctc ctccgtagcc gcgctgcgca ccagcgggag gttaaggtgc 


104101
tcgcgaatgt ggtttagctc ccgcagccgg cgggcctcga ttggcactcc ccggacggtg 


104161
agcgctccgt tgacgaacat gaagggctgg aacagacccg ccaactgacg ccagctctcc 


104221
aggtcgcaac agaggcagtc aaacaggtcg ggccgcatca tctgctcggc gtacgcggcc 


104281
cataggatct cgcgggtcaa aaatagatac aaatgcaaaa acagaacacg cgccagacga 


104341
gcggtctctc ggtagtacct gtccgcgatc gtggcgcgca gcatttctcc caggtcgcga 


104401
tcgcgtccgc gcatgtgcgc ctggcggtgc agctgccgga cgctggcgcg caggtaccgg 


104461
tacagggccg agcagaagtt ggccaacacg gttcgatagc tctcctcccg cgcccgtagc 


104521
tcggcgtgga agaaacgaga gagcgcttcg tagtagagcc cgaggccgtc gcgggtggcc 


104581
ggaagcgtcg ggaaggccac gtcgccgtgg gcgcgaatgt cgatttgggc gcgttcgggg 


104641
acgtacgcgt ccccccattc caccacatcg ctgggcagcg ttgataggaa tttacactcc 


104701
cggtacaggt cggcgttggt cggtaacgcc gaaaacaaat cctcgttcca ggtatcgagc 


104761
atggtacata gcgcggggcc cgcgctaaag cccaagtcgt cgaggagacg gttaaagagg 


104821
gcggcggggg ggacgggcat gggcggggag ggcatgagct gggcctggct caggcgcccc 


104881
gttgcgtaca gcggaggggc cgccggggtg tttttgggac ccccggccgg gcgggggggt 


104941
ggtggcgaag cgccgtccgc gtccatgtcg gcaaacagct cgtcgaccaa gaggtccatt 


105001
gggtggggtt gatacgggaa agacgatatc gggcttttga tgcgatcgtc cccgcccgcc 


105061
cagagagtgt gggacgcccg acggcgcggg aagagaaaaa cccccaaacg cgttagagga 


105121
ccggacggac cttatggggg gaagtgggca gcgggaaccc cgtccgttcc cgaggaatga 


105181
cagcccgtgg tcgccacccc gcatttaagc aacccgcacg ggccgccccg tacctcgtga 


105241
cttcccccca cattggctcc tgtcacgtga aggcaaaccg agggcggctg tccaacccac 


105301
cccccgccac ccagtcacgg tccccgtcgg attgggaaac aaaggcacgc aacgccaaca 


105361
ccgaatgaac ccctgttggt gctttattgt ctgggtacgg aagtttttca ctcgacgggc 


105421
cgtctggggc gagaagcgga gcgggctggg gctcgaggtc gctcggtggg gcgcgacgcc 


105481
gcagaacgcc ctcgagtcgc cgtggccgcg tcgacgtcct gcaccacgtc tggattcacc 


105541
aactcgttgg cgcgctgaat caggtttttg ccctcgcaga ccgtcacgcg gatggtggtg 


105601
atgccaagga gttcgttgag gtcttcgtct gtgcgcggac gcgacatgtc ccagagctgg 


105661
accgccgcca tccgggcatg catggccgcc aggcgcccaa ccgcggcgca gaagacgcgc 


105721
ttgttaaagc cggccacccg gggggtccat ggcgcgtcgg ggtttggggg ggcggtgcta 


105781
aagtgcagct ttctggccag cccctgcgcg ggtgtcttgg atcgggttgg cgccgtcgac 


105841
gcgggggcgt ctgggagtgc ggcggattct ggctgggccg atttcctgcc gcgggtggtc 


105901
tccgccgccg gggccgcggg ggccttagtc gccacccgct gggttcgggg ggcccggggg 


105961
gcggtggtgg gtgtgcgtcc ggcccctccg gacccagcgg gcggcggagg cgcccgcgca 


106021
ggccccgggg cggacaaaac cgccccggaa acgggacgcc gcgtccgggg gacctccggg 


106081
tgttcgtcgt cttcggatga cgagcccccg tagagggcat aatccgactc gtcgtactgg 


106141
acgaaacgga cctcgcccct tgggcgcgcg cgtgtctgta gggcgccacg gcgggaggtg 


106201
tcaggcggac tatcgggact cgccatacat gaagacgggg tgtagtacag atcctcgtac 


106261
tcatcgcgcg gaacctcccg cggacccgac ttcacggagc ggcgagaggt catggttcca 


106321
cgaacacgct agggtcggat gcgcggacaa ttaggcctgg gttcggacgg cgggggtggt 


106381
gcaggtgtgg agaggtcgag cgataggggc ggcccgggag agaagagagg gtccgcaaaa 


106441
cccactgggg atgcgtgagt ggccctctgt gggcggtggg ggagagtctt ataggaagtg 


106501
catataacca caacccatgg gtctaaccaa tccccagggg ccaagaaaca gacacgcccc 


106561
aaacggtctc ggtttccgcg aggaagggga agtcctggga caccctccac ccccacccct 


106621
caccccacac agggcgggtt caggcgtgcc cggcagccag tagcctctgg cagatctgac 


106681
agacgtgtgc gataatacac acgcccatcg aggccatgcc tacataaaag ggcaccaggg 


106741
cccccggggc agacatttgg ccagcgtttt gggtctcgca ccgcgcgccc ccgatcccat 


106801
cgcgcccgcc ctcctcgccg ggcggctccc cgtgcgggcc cgcgtctccc gccgctaagg 


106861
cgacgagcaa gacaaacaac aggcccgccc gacagaccct tctggggggg cccatcgtcc 


106921
ctaacaggaa gatgagtcag tggggatccg gggcgatcct tgtccagccg gacagcttgg 


106981
gtcgggggta cgatggcgac tggcacacgg ccgtcgctac tcgcgggggc ggagtcgtgc 


107041
aactgaacct ggtcaacagg cgcgcggtgg cttttatgcc gaaggtcagc ggggactccg 


107101
gatgggccgt cgggcgcgtc tctctggacc tgcgaatggc tatgccggct gacttttgtg 


107161
cgattattca cgcccccgcg ctatccagcc cagggcacca cgtaatactg ggtcttatcg 


107221
actcggggta ccgcggaacc gttatggccg tggtcgtagc gcctaaaagg acgcgggaat 


107281
ttgcccccgg gaccctgcgg gtcgacgtga cgttcctgga catcctggcg acccccccgg 


107341
ccctcaccga gccgatttcc ctgcggcagt tcccgcaact ggcgcccccc cctccaaccg 


107401
gggccgggat acgcgcagat ccttggttgg agggggcgct cggggaccca agcgtgactc 


107461
ctgccctacc ggcgcgacgc cgagggcggt ccctcgtcta tgccggcgag ctgacgccgg 


107521
ttcagacgga acacggggac ggcgtacgag aagccatcgc cttccttcca aaacgcgagg 


107581
aggatgccgg tttcgacatt gtcgtccgtc gcccggtcac cgtcccggca aacggcacca 


107641
cggtcgtgca gccatccctc cgcatgctcc acgcggacgc cgggcccgcg gcctgctatg 


107701
tgctggggcg gtcgtcgctc aacgcccgcg gcctcctggt cgttcctacg cgctggctcc 


107761
ccgggcacgt atgtgcgttt gttgtttaca accttacggg ggttcctgtg accctcgagg 


107821
ccggcgccaa ggtcgcccag ctcctggttg cgggggcgga cgctcttcct tggatccccc 


107881
cggacaactt tcacgggacc aaagcgcttc gaaactaccc caggggtgtt ccggactcaa 


107941
ccgccgaacc caggaacccg ccgctcctgg tgtttacgaa cgagtttgac gcggaggccc 


108001
ccccgagcga gcgcgggacc gggggttttg gctctaccgg tatttagccc atagcttggg 


108061
gttcgttccg ggcaataaaa aacgtttgta tctcatcttt cctgtgtgta gttgtttctg 


108121
ttggatgcct gtgggtctat cacacccgcc cctccatccc acaaacacag aacacacggg 


108181
ttggatgaaa acacgcattt attgacccaa aacacacgga gctgctcgag atgggccagg 


108241
gcgaggtgcg gttggggagg ctgtaggtct gggaacggac acgcggggac acgattccgg 


108301
tttggggtcc gggagggcgt cgccgtttcg ggcggcaggc gccagcgtaa cctccggggg 


108361
cggcgtgtgg gggtgcccca aggagggcgc ctcggtcacc ccaagccccc ccgagcgggt 


108421
tcccccggca accccgaagg cggagaggcc aagggcccgt tcggcgatgg ccacatcctc 


108481
catgaccacg tcgctctcgg ccatgctccg aatagcctgg gagacgagca catccgcgga 


108541
cttgtcagcc gcccccacgg acatgtacat ctgcaggatg gtggccatac acgtgtccgc 


108601
caggcgccgc atcttgtcct gatgggccgc cacggccccg tcgatcgtgg gggcctcgag 


108661
cccggggtgg tggcgcgcca gtcgttctag gttcaccatg caggcgtggt acgtgcgggc 


108721
caaggcgcgg gccttcacga ggcgtcgggt gtcgtccagg gaccccaggg cgtcatcgag 


108781
cgtgatgggg gcgggaagta gcgcgttaac gaccaccagg gcctcctgca gccgcggctc 


108841
cgcctccgag ggcggaacgg ccgcgcggat catctcatat tgttcctcgg ggcgcgctcc 


108901
ccagccacat atagccccga gaagagaagc catcgcgggc gggtactggc ccttgggcgc 


108961
gcggacgcaa tggggcagga agacgggaac cgcggggaga ggcgggcggc cgggactccc 


109021
gtggaggtga ccgcgcttta tgcgaccgac gggtgcgtta ttacctcttc gatcgccctc 


109081
ctcacaaact ctctactggg ggccgagccg gtttatatat tcagctacga cgcatacacg 


109141
cacgatggcc gtgccgacgg gcccacggag caagacaggt tcgaagagag tcgggcgctc 


109201
taccaagcgt cgggcgggct aaatggcgac tccttccgag taaccttttg tttattgggg 


109261
acggaagtgg gtgggaccca ccaggcccgc gggcgaaccc gacccatgtt cgtctgtcgc 


109321
ttcgagcgag cggacgacgt cgccgcgcta caggacgccc tggcgcacgg gaccccgcta 


109381
caaccggacc acatcgccgc caccctggac gcggaggcca cgttcgcgct gcatgcgaac 


109441
atgatcctgg ctctcaccgt ggccgtcaac aacgccagcc cccgcaccgg acgcgacgcc 


109501
gccgcggcgc agtatgatca gggcgcgtcc ctacgctcgc tcgtggggcg cacgtccctg 


109561
ggacaacgcg gccttaccac gctatacgtc caccacgagg cgcgcgtgct ggccgcgtac 


109621
cgcagggcgt attatggaag cgcgcagagt cccttctggt ttcttagcaa attcgggcct 


109681
gacgaaaaaa gcctggtgct caccactcgg tactacctgc ttcaggccca gcgtctgggg 


109741
ggcgcggggg ccacgtacga cctgcaggcc atcaaggaca tctgcgccac ctacgcgatt 


109801
ccccacgccc cccgccccga caccgtcagc gccgcgtccc tgacctcgtt tgccgccatc 


109861
acgcggttct gttgcacgag ccagtacgcc cgcggggccg cggcggccgg gtttccgctt 


109921
tacgtggagc gccgtattgc ggccgacgtc cgcgagacca gtgcgctgga gaagttcata 


109981
acccacgatc gcagttgcct gcgcgtgtcc gaccgtgaat tcattacgta catttacctg 


110041
gcccattttg agtgtttcag ccccccgcgc ctagccacgc atcttcgggc cgtgacgacc 


110101
caggacccca accccgcggc caacacggag cagccctcgc ccctgggcag ggaggccgtg 


110161
gaacaatttt tttgccacgt gcgcgcccaa ctgaatatcg gggagtacgt caaacacaac 


110221
gtgacccccc gggagaccgt cctggatggc gatacggcca aggcctacct gcgcgctcgc 


110281
acgtacgcgc ccggggccct gacgcccgcc cccgcgtatt gcggggccgt ggactccgcc 


110341
accaaaatga tggggcgttt ggcggacgcc gaaaagctcc tggtcccccg cgggtggccc 


110401
gcgtttgcgc ccgccagtcc cggggaggat acggcgggcg gcacgccgcc cccacagacc 


110461
tgcggaatcg tcaagcgcct cctgagactg gccgccacgg aacaacagga caccacgccc 


110521
ccggcgatcg cggcgcttat ccgtaatgcg gcggtgcaga ctcccctgcc cgtctaccgg 


110581
atatccatgg tccccacggg acaggcattt gccgcgctgg cctgggacga ctgggcccgc 


110641
ataacgcggg acgctcgcct ggccgaagcg gtcgtgtccg ccgaagcggc ggcgcacccc 


110701
gaccacggcg cgctgggcag gcggctcacg gatcgcatcc gcgcccaggg ccccgtgatg 


110761
ccccctggcg gcctggatgc cggggggcag atgtacgtga atcgcaacga gatatttaac 


110821
ggcgcgctgg caatcacaaa catcatcctg gatctcgaca tcgccctgaa ggagcccgtc 


110881
ccctttcgcc ggctccacga ggccctgggc cactttaggc gcggggctct ggcggcggtt 


110941
cagctcctgt ttcccgcggc ccgcgtggac cccgacgcat atccctgtta ttttttcaaa 


111001
agcgcatgtc ggcccggccc ggcgtccgtg ggttccggca gcggactcgg caacgacgac 


111061
gacggggact ggtttccctg ctacgacgac gccggtgatg aggagtgggc ggaggacccg 


111121
ggcgccatgg acacatccca cgatcccccg gacgacgagg ttgcctactt tgacctgtgc 


111181
cacgaagtcg gccccacggc ggaacctcgc gaaacggatt cgcccgtgtg ttcctgcacc 


111241
gacaagatcg gactgcgggt gtgcatgccc gtccccgccc cgtacgtcgt ccacggttct 


111301
ctaacgatgc ggggggtggc acgggtcatc cagcaggcgg tgctgttgga ccgagatttt 


111361
gtggaggcca tcgggagcta cgtaaaaaac ttcctgttga tcgatacggg ggtgtacgcc 


111421
cacggccaca gcctgcgttt gccgtatttt gccaaaatcg cccccgacgg gcctgcgtgc 


111481
ggaaggctgc tgccagtgtt tgtgatcccc cccgcctgca aagacgttcc ggcgtttgtc 


111541
gccgcgcacg ccgacccgcg gcgcttccat tttcacgccc cgcccaccta tctcgcttcc 


111601
ccccgggaga tccgtgtcct gcacagcctg ggtggggact atgtgagctt ctttgaaagg 


111661
aaggcgtccc gcaacgcgct ggaacacttt gggcgacgcg agaccctgac ggaggtcctg 


111721
ggtcggtaca acgtacagcc ggatgcgggg gggaccgtcg aggggttcgc atcggaactg 


111781
ctggggcgga tagtcgcgtg catcgaaacc cactttcccg aacacgccgg cgaatatcag 


111841
gccgtatccg tccggcgggc cgtcagtaag gacgactggg tcctcctaca gctagtcccc 


111901
gttcgcggta ccctgcagca aagcctgtcg tgtctgcgct ttaagcacgg ccgggcgagt 


111961
cgcgccacgg cgcggacatt cgtcgcgctg agcgtcgggg ccaacaaccg cctgtgcgtg 


112021
tccttgtgtc agcagtgctt tgccgccaaa tgcgacagca accgcctgca cacgctgttt 


112081
accattgacg ccggcacgcc atgctcgccg tccgttccct gcagcacctc tcaaccgtcg 


112141
tcttgataac ggcgtacggc ctcgtgctcg tgtggtacac cgtcttcggt gccagtccgc 


112201
tgcaccgatg tatttacgcg gtacgcccca ccggcaccaa caacgacacc gccctcgtgt 


112261
ggatgaaaat gaaccagacc ctattgtttc tgggggcccc gacgcacccc cccaacgggg 


112321
gctggcgcaa ccacgcccat atctgctacg ccaatcttat cgcgggtagg gtcgtgccct 


112381
tccaggtccc acccgacgcc acgaatcgtc ggatcatgaa cgtccacgag gcagttaact 


112441
gtctggagac cctatggtac acacgggtgc gtctggtggt cgtagggtgg ttcctgtatc 


112501
tggcgttcgt cgccctccac caacgccgat gtatgtttgg tgtcgtgagt cccgcccaca 


112561
agatggtggc cccggccacc tacctcttga actacgcagg ccgcatcgta tcgagcgtgt 


112621
tcctgcagta cccctacacg aaaattaccc gcctgctctg cgagctgtcg gtccagcggc 


112681
aaaacctggt tcagttgttt gagacggacc cggtcacctt cttgtaccac cgccccgcca 


112741
tcggggtcat cgtaggctgc gagttgatgc tacgctttgt ggccgtgggt ctcatcgtcg 


112801
gcaccgcttt catatcccgg ggggcatgtg cgatcacata ccccctgttt ctgaccatca 


112861
ccacctggtg ttttgtctcc accatcggcc tgacagagct gtattgtatt ctgcggcggg 


112921
gcccggcccc caagaacgca gacaaggccg ccgccccggg gcgatccaag gggctgtcgg 


112981
gcgtctgcgg gcgctgttgt tccatcatcc tgtcgggcat cgcaatgcga ttgtgttata 


113041
tcgccgtggt ggccggggtg gtgctcgtgg cgcttcacta cgagcaggag atccagaggc 


113101
gcctgtttga tgtatgacgt cacatccagg ccggcggaaa ccggaacggc atatgcaaac 


113161
tggaaactgt cctgtcttgg ggcccaccca cccgacgcgt catatgtaaa tgaaaatcgt 


113221
tcccccgagg ccatgtgtag cctggatccc aacgaccccg cccatgggtc ccaattggcc 


113281
gtcccgttac caagaccaac ccagccagcg tatccacccc cgcccgggtc cccgcggaag 


113341
cggaacggtg tatgtgatat gctaattaaa tacatgccac gtacttatgg tgtctgattg 


113401
gtccttgtct gtgccggagg tggggcgggg gccccgcccg gggggcggaa ctaggagggg 


113461
tttgggagag ccggccccgg caccacgggt ataaggacat ccaccacccg gccggtggtg 


113521
gtgtgcagcc gtgttccaac cacggtcacg cttcggtgcc tctccccgat tcgggcccgg 


113581
tcgcttgcta ccggtgcgcc accaccagag gccatatccg acaccccagc cccgacggca 


113641
gccgacagcc cggtcatggc gactgacatt gatatgctaa ttgacctcgg cctggacctc 


113701
tccgacagcg atctggacga ggaccccccc gagccggcgg agagccgccg cgacgacctg 


113761
gaatcggaca gcaacgggga gtgttcctcg tcggacgagg acatggaaga cccccacgga 


113821
gaggacggac cggagccgat actcgacgcc gctcgcccgg cggtccgccc gtctcgtcca 


113881
gaagaccccg gcgtacccag cacccagacg cctcgtccga cggagcggca gggccccaac 


113941
gatcctcaac cagcgcccca cagtgtgtgg tcgcgcctcg gggcccggcg accgtcttgc 


114001
tcccccgagc ggcacggggg caaggtggcc cgcctccaac ccccaccgac caaagcccag 


114061
cctgcccgcg gcggacgccg tgggcgtcgc aggggtcggg gtcgcggtgg tcccggggcc 


114121
gccgatggtt tgtcggaccc ccgccggcgt gcccccagaa ccaatcgcaa cccgggggga 


114181
ccccgccccg gggcggggtg gacggacggc cccggcgccc cccatggcga ggcgtggcgc 


114241
ggaagtgagc agcccgaccc acccggaggc ccgcggacac ggagcgtgcg ccaagcaccc 


114301
cccccgctaa tgacgctggc gattgccccc ccgcccgcgg acccccgcgc cccggccccg 


114361
gagcgaaagg cgcccgccgc cgacaccatc gacgccacca cgcggttggt cctgcgctcc 


114421
atctccgagc gcgcggcggt cgaccgcatc agcgagagct tcggccgcag cgcacaggtc 


114481
atgcacgacc cctttggggg gcagccgttt cccgccgcga atagcccctg ggccccggtg 


114541
ctggcgggcc aaggagggcc ctttgacgcc gagaccagac gggtctcctg ggaaaccttg 


114601
gtcgcccacg gcccgagcct ctatcgcact tttgccggca atcctcgggc cgcatcgacc 


114661
gccaaggcca tgcgcgactg cgtgctgcgc caagaaaatt tcatcgaggc gctggcctcc 


114721
gccgacgaga cgctggcgtg gtgcaagatg tgcatccacc acaacctgcc gctgcgcccc 


114781
caggacccca ttatcgggac ggccgcggcg gtgctggata acctcgccac gcgcctgcgg 


114841
ccctttctcc agtgctacct gaaggcgcga ggcctgtgcg gcctggacga actgtgttcg 


114901
cggcggcgtc tggcggacat taaggacatt gcatccttcg tgtttgtcat tctggccagg 


114961
ctcgccaacc gcgtcgagcg tggcgtcgcg gagatcgact acgcgaccct tggtgtcggg 


115021
gtcggagaga agatgcattt ctacctcccc ggggcctgca tggcgggcct gatcgaaatc 


115081
ctagacacgc accgccagga gtgttcgagt cgtgtctgcg agttgacggc cagtcacatc 


115141
gtcgcccccc cgtacgtgca cggcaaatat ttttattgca actccctgtt ttaggtacaa 


115201
taaaaacaaa acatttcaaa caaatcgccc cacgtgttgt ccttctttgc tcatggccgg 


115261
cggggcgtgg gtcacggcag atggcggggg tgggcccggc gtacggcctg ggtgggcgga 


115321
gggaactaac ccaacgtata aatccgtccc cgctccaagg ccggtgtcat agtgccctta 


115381
ggagcttccc gcccgggcgc atcccccctt ttgcactatg acagcgaccc ccctcaccaa 


115441
cctgttctta cgggccccgg acataaccca cgtggccccc ccttactgcc tcaacgccac 


115501
ctggcaggcc gaaacggcca tgcacaccag caaaacggac tccgcttgcg tggccgtgcg 


115561
gagttacctg gtccgcgcct cctgtgagac cagcggcaca atccactgct ttttctttgc 


115621
ggtatacaag gacacccacc atacccctcc gctgattacc gagctccgca actttgcgga 


115681
cctggttaac cacccgccgg tcctacgcga actggaggat aagcgcgggg tgcggctgcg 


115741
gtgtgcgcgg ccgtttagcg tcgggacgat taaggacgtc tctgggtccg gcgcgtcctc 


115801
ggcgggagag tacacgataa acgggatcgt gtaccactgc cactgtcggt atccgttctc 


115861
aaaaacatgc tggatggggg cctccgcggc cctacagcac ctgcgctcca tcagctccag 


115921
cggcatggcc gcccgcgcgg cagagcatcg acgcgtcaag attaaaatta aggcgtgatc 


115981
tccaaccccc catgaatgtg tgtaaccccc cccaaaaaaa taaagagccg taacccaacc 


116041
aaaccaggcg tggtgtgagt ttgtggaccc aaagccctca gagacaatgc gacaggccag 


116101
tatggaccgt gatactttta tttattaact cacaggggcg cttaccgcca caggaatacc 


116161
agaataatga ccaccacaat cgcgaccacc ccaaatacag catggcgcca caccacgcca 


116221
caacagccct gtcgccggta tggggcatga tcagacgagc cgcgcgccgc gcgttgggcc 


116281
ctgtacagct cgcgcgaatt gaccctagga ggccgccacg cgcccgagtt ttgcgttcgt 


116341
cgctggtcgt cgggcgccaa agccccggac ggctgttcgg tcgaacgaac ggccacgaca 


116401
gtggcatagg ttggggggtg gtccgacata gcctcggcgt acgtcgggag gcccgacaag 


116461
aggtcccttg tgatgtcggg tggggccaca agcctggttt ccggaagaaa caggggggtt 


116521
gccaataacc cgccagggcc aaaactccgg cgctgcgcac gtcgttcggc gcggcgccgg 


116581
gcgcgccgag cggctcgctg ggcggcttgg cgtgagcggc cccgctccga cgcctcgccc 


116641
tctccggagg aggttggcgg aattggcacg gacgacaggg gcccagcaga gtacggtgga 


116701
ggtgggtccg tgggggtgtc cagatcaata acgacaaacg gcccctcgtt cctaccagac 


116761
aagctatcgt aggggggcgg gggatcagca aacgcgttcc ccgcgctcca tagacccgcg 


116821
tcgggttgcg ccgcctccga agccatggat gcgccccaaa gccacgactc ccgcgcgcta 


116881
ggtccttggg gtaagggaaa aggccctact ccccatccaa gccagccaag ttaacgggct 


116941
acgccttcgg ggatgggact ggcaccccgg cggattttgt tgggctggta cgcgtcgccc 


117001
aaccgagggc cgcgtccacg ggacgcgcct tttataaccc cggggtcatt cccaacgatc 


117061
acatgcaatc taactggctc ccctctcccc ccctctcccc tctccccccc tctcccctct 


117121
ccccccctct cccctctccc cccctctccc ctctcccccc ctctcccctc tccccccctc 


117181
tcccctctcc ccccctctcc cctctccccc cctctcccct ctccccccct ctcccctctc 


117241
cccccctctc ccctgctctt tccccgtgac acccgacgct ggggggcgtg gctgccggga 


117301
ggggccgcgt atgggcgggc ctactcggtc tcccgccccc ccgaaccgcc ccgccggctt 


117361
tgcccccctt tgatcccctg ctacccccac cccgtgctcg tggtgcgggg tggggggatg 


117421
tgggcggggg tgcgcgggag gtgtcggtgg tgggggtggt ggtggtggtg gtagtaggaa 


117481
tggtggtggg ggggagggcg ctggttggtc aaaaaaggga gggacggggg ccggcagacc 


117541
gacggcgaca acgctccccg gcggccgggt cgcggctctt acgagcggcc cggcccgcgc 


117601
tcccaccccc cgggccgtgt ccttgctttc cccccgtctc cccccccgcc ttctcctcct 


117661
cctcctcgtt tttccaaacc ccgcccaccc ggcccggccc ggcccggccc ggccaccgcc 


117721
gcccacccac ccacctcggg atacccagcc ccggtccccc gttccccggg ggccgttatc 


117781
tccagcgccc cgtccggcgc gccgcccccc gccgctaaac cccatcccgc ccccgggacc 


117841
ccacatataa gcccccagcc acacgcaaga acagacacgc agaacggctg tgtttatttt 


117901
aaataaaccg atgtcggaat aaacaaacac aaacacccgc gacgggggga cggaggggac 


117961
ggagggaggg gggtgacggg ggacggaaac agacacaaaa aacaaccaca aaaaacaacc 


118021
acccaccgac acccccaccc cagtctcctc gccttctccc acccacccca cgcccccact 


118081
gagcccggtc gatcgacgag cacccccgcc cacgcccccg cccctgcccc ggcgaccccc 


118141
ggcccgcacg atcccgacaa caataacaac cccaacggaa agcggcgggg tgttggggga 


118201
ggcgaggaac aaccgagggg aacgggggat ggaaggacgg gaagtggaag tcctgatacc 


118261
catcctacac ccccctgcct tccaccctcc ggccccccgc gagtccaccc gccggccggc 


118321
taccgagacc gaacacggcg gccaccgccg ccgccgccgc cgacaccgca gagccggcgc 


118381
gcgcacacac aagcggcaga ggcagaaagg ccccgagtca ttgtttatgt ggccgcgggc 


118441
cagcagacgg cccgcgacac ccccccccgc ccgtgtgggt atccggcccc ccgccccgcg 


118501
ccggtccatt aagggcgcgc gtgcccgcga gatatcaatc cgttaagtgc tctgcagaca 


118561
ggggcaccgc gcccggaaat ccattaggcc gcagacgagg aaaataaaat tacatcacct 


118621
acccatgtgg tgctgtggcc tgtttttgct gcgtcatctg agcctttata aaagcggggg 


118681
cgcggccgtg ccgatcgccg gtggtgcgaa agactttccg ggcgcgtccg ggtgccgcgg 


118741
ctctccgggc ccccctgcag ccggggcggc caaggggcgt cggcgacatc ctccccctaa 


118801
gcgccggccg gccgctggtc tgttttttcg ttttccccgt ttcgggggtg gtgggggttg 


118861
cggtttctgt ttctttaacc cgtctggggt gtttttcgtt ccgtcgccgg aatgtttcgt 


118921
tcgtctgtcc cctcacgggg cgaaggccgc gtacggcccg ggacgagggg cccccgaccg 


118981
cggcggtccg ggccccgtcc ggacccgctc gccggcacgc gacgcgaaaa aggccccccg 


119041
gaggcttttc cgggttcccg gcccggggcc tgagatgaac actcggggtt accgccaacg 


119101
gccggccccc gtggcggccc ggcccggggc cccggcggac ccaaggggcc ccggcccggg 


119161
gccccacaac ggcccggcgc atgcgctgtg gttttttttt cctcggtgtt ctgccgggct 


119221
ccgtcgcctt tcctgttctc gcttctcccc cccccccttc acccccagta ccctcctccc 


119281
tcccttcctc ccccgttatc ccactcgtcg agggcgcccc ggtgtcgttc aacaaagacg 


119341
ccgcgtttcc aggtaggtta gacacctgct tctccccaat agaggggggg ggacccaaac 


119401
gacagggggc gccccagagg ctaaggtcgg ccacgccact cgcgggtggg ctcgtgttac 


119461
agcacaccag cccgttattt tccccccctc ccacccttag ttagactctg ttacttaccc 


119521
gtccgaccac caactgcccc cttatctaag ggccggctgg aagaccgcca gggggtcggc 


119581
cggtgtcgct gtaacccccc acgccaatga cccacgtact ccaagaaggc atgtgtccca 


119641
ccccgcctgt gtttttgtgc ctggctctct atgcttgggt cttactgcct gggggggggg 


119701
agtgcggggg aggggggggg tgtggaagga aatgcacggc gcgtgtgtac ccccccctaa 


119761
agttgttcct aaagcgagga tatggaggag tggcgggtgc cgggggaccg gggtgatctc 


119821
tggcacgcgg gggtgggaag ggtcggggga gggggggatg gggtaccggc ccacctggcc 


119881
ggcgcgggtg cgcgtgcctt tgcacaccaa ccccacgtcc cccggcggtc tctaagaaac 


119941
accgcccccc ctccttcata ccaccgagca tgcctgggtg tgggttggta accaacacgc 


120001
ccatcccctc gtctcctgtg attctctggc tgcaccgcat tcttgttttc taactatgtt 


120061
cctgtttctg tctccccccc cacccctccg ccccaccccc caacacccac gtctgtggtg 


120121
tggccgaccc ccttttgggc gccccgtccc gccccgctac ccctcccatc ctttgttgcc 


120181
ctatagtgta gttaaccccc ccccccgccc tttgtggcgg ccagaggcca ggtcagtccg 


120241
ggcgggcagg cgctcgcgga aacttaacac ccacacccaa cccactgtgg ttctggctcc 


120301
atgccagtgg caggatgctt tcggggatcg gtggtcaggc agcccgggcc gcggctctgt 


120361
ggttaacacc agagcctgcc caacatggca cccccactcc cacgcacccc cactcccacg 


120421
cacccccact cccacgcacc cccactccca cgcaccccca ctcccacgca cccccactcc 


120481
cacgcacccc cactcccacg cacccccact cccacgcacc cccactccca cgcacccccg 


120541
cgatacatcc aacacagaca gggaaaagat acaaaagtaa acctttattt cccaacagac 


120601
agcaaaaatc ccctgagttt tttttattag ggccaacaca aaagacccgc tggtgtgtgg 


120661
tgcccgtgtc tttcactttc cacctccccg acacggattg gctggtgtag tgggcgcggc 


120721
cagagaccac ccagcgcccg acccccccct ccccacaaac acgggggcgt cccttattgt 


120781
tttccctcgt cccgggtcga cgccccctgc tccccggacc acgggtgccg agaccgcagg 


120841
ctgcggaagt ccagggcgcc cactagggtg ccctggtcga acagcatgtt ccccacgggg 


120901
gtcatccaga ggctgttcca ctccgacgcg ggggccgtcg ggtactcggg gggcgtcacg 


120961
tggttacccg cggtctcggg gagcagggtg cggcggctcc agccggggac cgcggcccgc 


121021
agccgggtcg ccatgtttcc cgtctggtcc accaggacca cgtacgcccc gatgttcccc 


121081
gtctccatgt ccaggatggg caggcagtcc cccgtgatcg tcttgttcac gtaaggcgac 


121141
agggcgacca cgctagagac ccccgagatg ggcaggtagc gcgtgaggcc gcccgcgggg 


121201
acggccccgg aagtctccgc gtggcgcgtc ttccgggcac acttcctcgg cccccgcggc 


121261
ccagaagcag cgcgggggcc gagggaggtt tcctcttgtc tccctcccag ggcaccgacg 


121321
gccccgcccg aggaggcgga agcggaggag gacgcggccc cggtggcgga agaggtggcc 


121381
cccgcgggag tcggggccga ggaggaagag gcggaggagg aagaggcgga ggccgccgag 


121441
gacgtcaggg gggtcccggg ccctccctgg ccgcgccccc ccggccctga gtcggagggg 


121501
gggtgcgtcg ccgccctctt ggcccctgcc ggcgcgaggg ggggacgcgt ggactggggg 


121561
gaggggtttt cctggcccga cccgcgcctc ttcctcggac gcaccgccgc ctcctgctcg 


121621
acagaggcgg cggaggggag cgggggggcg ccggaggggg cggcgccgga gggggcggcg 


121681
ccgcgggagg gcccgtgtcc accctccacg cccggccccc ccgagccgcg cgccaccgtc 


121741
gcacgcgccc ggcacagact ctgttcttgg ttcgcggcct gagccaggga cgagtgcgac 


121801
tggggcacac ggcgcgcgtc cgcggggcgg gcggccggct ccgccccggg ggccggggcg 


121861
cgggggccgg gccccggagg cggcgcccgc acacacgggg ccacggccgc gcgggggcgc 


121921
gcggggcccg acgcggccgc ggacgcgggg ggaccggggc ggggggcgga gcctggcatg 


121981
ggcgccgcgg ggggcctgtg gggagaggcc gggggggagt cgctgatcac tatggggtct 


122041
ctgttgtttg caaggggggc gggtctgttg acaagggggc ccgtccggcc cctcggccgc 


122101
cccgcctccg cttcaacaac cccaaccccc ccggaggggc cagacgcccc ccgcggcacc 


122161
gcggctcgcg actggcggga gccgccgccg ccgctgctgt tggtggtggt gttagtgtta 


122221
ctgctgccgt gtggcccgat gggcgccgag gggggcgctg tccgagccgc ggccggctgg 


122281
ggggctgcgt gagacgcccc gcccgtcacg gggggcgcgg cggcgcctct gcgtgggggg 


122341
gcgcggggcg tccggcgggg ggcgggcggt acgtagtctg ctgcaagaga caacgggggg 


122401
cgcgatcagg ttacgccccc tccccggccc gccctttcct cgcccgcccg cccgcctatt 


122461
cctccctccc ccccccctcc tcctcctccc ccagggtcct cgccgccccc ccgcctcacc 


122521
gtcgtccagg tcgtcgtcat cctcgtccgt ggtgggctca gggtgggtgg gcgacagggc 


122581
cctcaccgtg tgccccccca gggtcaggta ccgcggggcg aaccgctgat tgcccgtcca 


122641
gataaagtcc acggccgtgc ccgccctgac ggcctcctcg gcctccatgc gggtctgggg 


122701
gtcgttcacg atcgggatgg tgctgaacga cccgctgggc gtcacgccca ctatcaggta 


122761
caccagcttg gcgttgcaca gcgggcaggt gttgcgcaat tgcatccagg ttttcatgca 


122821
cgggatgcag aagcggtgca tgcacgggaa ggtgtcgcag cgcaggtggg gcgcgatctc 


122881
atccgtgcac acggcgcaca cgtcgccctc gtcgctcccc ccgtcctctc gagggggggc 


122941
gcccccgcaa ctgccggggt cttcctcgcg gggggggctc ccccccgaga ccgccccccc 


123001
atccacgccc tgcggcccca gcagccccgt ctcgaacagt tccgtgtccg tgctgtccgc 


123061
ctcggaggcg gagtcgtcgt catggtggtc ggcgtccccc cgccccccca cttcggtctc 


123121
cgcctccgag tcgctgctgt ccggcaggtc tcggtcgcag ggaaacaccc agacatccgg 


123181
ggcgggctga ggggaaaaaa gggggggcgg gtaagaatgg ggggatttcc cgcgtcaatc 


123241
agcgcccacg agttccccct ccccccccgc ctcacaaagt cctgcccccc tgctggcctc 


123301
ggaagagggg ggagaaaggg gtctgcaacc aaaggtggtc tgggtccgtc ctttggatcc 


123361
cgacccctct tcttccctct tctcccgccc tccagacgca ccggagtcgg gggtcccacg 


123421
gcgtccccca aatatggcgg gcggctcctc cccacccccc tagatgcgtg tgagtaaggg 


123481
ggccctgcgt atgagtcagt ggggaccacg ccccctaaca cggcgacccc ggtccctgtg 


123541
tgtttgttgt gggggcgtgt ctctgtgtat gagtcagggg ggtcccacgg cgaccccggg 


123601
ccctgcgtct gagtcaaagg ggccatgtgt aggtgttggg ggtctgtata tataaagtca 


123661
gggggtcaca tggcgacccc taacagggcg accccggtcc ctgtatatat agggtcaggg 


123721
ggttccgcgc cccctaacat ggcgcccccg gtccctgtat atatagtgtc acggggttcc 


123781
acgcccccta acatggcgcc ccaacatggc gcccggctcc cgtgtatgag tgggggtccc 


123841
ccaacatggc ggccggttcc agtgtaaggg tcgggggtcc cccaacatgg cgccccccaa 


123901
catggcgccc cccaacatgg cgccccagac atggcgcccg gcccctcacc tcgcgctggg 


123961
ggcggccctc aggccggcgg gtactcgctc cggggcgggg ctccatgggg gtcgtatgcg 


124021
gctggagggt cgctgacgga gggtccctgg gggtcgcaac gtaggcgggg cttctgtggt 


124081
gatgcggaga gggggcggcc cgagtctgcc tggctgctgc gtctcgctcc gagtgccgag 


124141
gtgcaaatgc gaccagaccg tcgggccagg gctaacttat accccacgcc tttcccctcc 


124201
ccaaaggggc ggcagtgacg attcccccaa tggccgcgcg tcccagggga ggcaggccca 


124261
ccgcggggcg gccccgtccc cggggaccaa cccggcgccc ccaaagaata tcattagcat 


124321
gcacggcccg gcccccgatt tgggggacca acccggtgtc ccccaaagaa ccccattagc 


124381
atgcccctcc cgccgacgca acaggggctt ggcctgcgtc ggtgccccgg ggcttcccgc 


124441
cttcccgaag aaactcatta ccatacccgg aaccccaggg gaccaatgcg ggttcattga 


124501
gcgacccgcg ggccaatgcg cgaggggccg tgtgttccgc caaaaaagca attaacataa 


124561
cccggaaccc caggggagtg gttacgcgcg gcgcgggagg cggggaatac cggggttgcc 


124621
cattaagggc cgcgggaatt gccggaagcg ggaagggcgg ccggggccgc ccattaatga 


124681
gtttctaatt accataccgg gaagcggaac aaggcctctg caagttttta attaccatac 


124741
cgggaagtgg gcggcccggc ccactgggcg ggagttaccg cccagtgggc cgggccccga 


124801
cgactcggcg gacgctggtt ggccgggccc cgccgcgctg gcggccgccg attggccagt 


124861
cccgccctcc gagggcgggc ccgcctcggg ggcgggccgg ctccaagcgt atatatgcgc 


124921
ggctcctgcc atcgtctctc cggagagcgg cttggtgcgg agctcccggg agctccgcgg 


124981
aagacccagg ccgcctcggg tgtaacgtta gaccgagttc gccgggccgg ctccgcgggc 


125041
cagggcccgg gcacgggcct cgggccccag gcacggcccg atgaccgcct cggcctccgc 


125101
cacccggcgc cggaaccgag cccggtcggc ccgctcgcgg gcccacgagc cgcggcgcgc 


125161
caggcgggcg gccgaggccc agaccaccag gtggcgcacc cggacgtggg gcgagaagcg 


125221
cacccgcgtg ggggtcgcgg gggtcgcggg ggtcgcgggg ggcttcggcg ccccctcccc 


125281
gcccgcgcgt cgcaggcgca ggcgcgccag gtgctctgcg gtgacgcgca ggcggagggc 


125341
gaggcgcggc ggaaggcgga aggggcgtga gggggggtgg gaggggttag ccccgccccc 


125401
cgggcccgcg ccgggcggtg gggaccgggg gcggggggcg gcggcggtgg gccgggcctc 


125461
tggcgccggc tcgggcgggg ggctgtccgg ccagtcgtcg tcgtcgtcgt cggacgcgga 


125521
ctcgggaacg tggagccact ggcgcagcag cagcgaacaa gaaggcgggg gcccactggc 


125581
ggggggcggc ggcggggcgg ccgcgggcgc gctcctgacc acgggttccg agttgggcgt 


125641
ggaggttacc tgggactgtg cggttgggac cgcgcccgtg ggcccgggcg gccgggggcg 


125701
gcgggggccg cgatggcggc ggcgggccat ggagacagag agcgtgccgg ggtggtagag 


125761
tttgacaggc aagcatgtgc gtgcagaggc gagtagtgct tgcctgtcta actcgctagt 


125821
ctcggccgcg gggggcccgg gctgcccgcc gcccgccttt aaagggccgc gcgcgacccc 


125881
cggggggtgt gttttggggg gggcccgttt ccggggtctg gccgctcctc ccccgctcct 


125941
ccccccgctc ctccccccgc tcctcccccc gctcctcccc ccgctcctcc ccccgctcct 


126001
ccccccgctc ctccccccgc tcctcccccc gctcctcccc ccgctcctcc ccccgctcct 


126061
ccccccgctc ctccccccgc tcctcccccc gctcctcccc ccgctcctcc ccccgctcct 


126121
cccccgctcc tcccccgctc ccgcggcccc gccccccacg cccgccgcgc gcgcgcacgc 


126181
cgcccggacc gccgcccgcc ttttttgcgc gcgcgcgcgc ccgcgggggg cccgggctgc 


126241
cacaggtaaa acaacaccaa caaagcacgg cgcaatccgc acgtcacacg tcacgtcatc 


126301
caccacacct gcccaacaac acaactcaca gcgacaactc accgcgcaac aactcctgtt 


126361
cctcatccac acgtcaccgc gcacctcccg ctcctccaga cgtaccccgg cgcaacacac 


126421
cgctcctgct acacaccacc gcccctcccc agccccagcc ctccccagcc ccagccctcc 


126481
ccggccccag ccctccccgg ccccagccct ccccggcccc agccctcccc ggccccagcc 


126541
ctccccggcc ccagccctcc ccagccccag ccctccccag ccgcgtcccg cgctccctcg 


126601
ggggggttcg ggcatctcta cctcagtgcc gccaatctca ggtcagagat ccaaaccctc 


126661
cgggggcgcc cgcgcaccac caccgcccct cgccccctcc cgcccctcgc cccctcccgc 


126721
ccctcgcccc ctcccgcccc tcgccccctc ccgcccctcg ccccctcccg cccctcgccc 


126781
cctcccgccc ctcgccccct cccgcccctc gccccctccc gcccctcgcc ccctcccgcc 


126841
cctcgccccc tcccgcccct cgccccctcc cgcccctcgc cccctcccgc ccctcgcccc 


126901
ctcccgcccc tcgccccctc ccgcccctcg ccccctcccg cccctcgccc cctcccgccc 


126961
ctcgccccct cccgcccctc gccccctccc gcccctcgcc ccctcccgcc cctcgaataa 


127021
acaacgctac tgcaaaactt aatcaggtcg ttgccgttta ttgcgtcttc gggtttcaca 


127081
agcgccccgc cccgtcccgg cccgttacag caccccgtcc ccctcgaacg cgccgccgtc 


127141
gtcttcgtcc caggcgcctt cccagtccac aacgtcccgt cgcgggggcg tggccaagcc 


127201
cgcctccgcc cccagcacct ccacggcccc cgccgccgcc agcacggtgc cgctgcggcc 


127261
cgtggccgag gcccagcgaa tcccgggcgg cgccggcggc agggcccccg ggccgtcgtc 


127321
gtcgtcgccg cgcagcacca gcgggggggc gtcgtcgtcg ggctccagca gggcgcgggc 


127381
gcaaaagtcc ctccgcggcc cgcgccaccg ggccgggccg gcgcgcaccg cctcgcgccc 


127441
cagcgccacg tacacgggcc gcagcggcgc gcccaggccc cagcgcgcgc aggcgcggtg 


127501
cgagtgggcc tcctcctcgc agaagtccgg cgcgccgggc gccatggcgt cggtggtccc 


127561
cgaggccgcc gcccggccgt ccagcgccgg cagcacggcc cggcggtact cgcgcgggga 


127621
catgggcacc ggcgtgtccg ggccgaagcg cgtgcgcacg cggtagcgca cgttgccgcc 


127681
gcggcacagg cgcagcggcg gcgcgtcggg gtacaggcgc gcgtgcgcgg cctccacgcg 


127741
cgcgaagacc cccgggccga acacgcggcc cgaggccagc accgtgcggc gcaggtcccg 


127801
cgccgccggc cagcgcacgg cgcactgcac ggcgggcagc aggtcgcacg ccaggtaggc 


127861
gtgctgccgc gacaccgcgg gcccgtcggc gggccagtcg caggcgcgca cggtgttgac 


127921
cacgatgagc cgccggtcgc cggcgctggc gagcagcccc agaaactcca cggccccggc 


127981
gaaggccagg tcccgcgtgg acagcagcag cacgccctgc gcgcccagcg ccgacacgtc 


128041
gggggcgccg gtccagttgc ccgcccaggc ggccgtgtcc ggcccgcaca gccggttggc 


128101
cagggccgcc agcaggcagg acagcccgcc gcgctcggcg gaccactccg gcggcccccc 


128161
cgaggccccg ccgccggcca ggtcctcgcc cggcagcggc gagtacagca ccaccacgcg 


128221
cacgtcctcg gggtcgggga tctggcgcat ccaggccgcc atgcggcgca gcgggcccga 


128281
ggcgcgcagg gggccaaaga ggcggccccc ggcggccccg tgggggtggg ggttctcgtc 


128341
gtcgtcgccg ccgcacgcgg cctgggcggc gggggcgggc ccggcgcacc gcgcggcgat 


128401
cgaggccagg gcccgcgggt caaacatgag ggccggtcgc caggggacgg ggaacagcgg 


128461
gtggtccgtg agctcggcca cggcgcgcgg ggagcagtag gcctccaggg cggcggccgc 


128521
gggcgccgcc gtgtggctgg gcccccgggg ctgccgccgc cagccgccca gggggtcggg 


128581
gccctcggcg ggccggcgcg acagcgccac ggggcgcggg cgggcctgcg ccgcggcgcc 


128641
ccgggccgcc gcgggctggg cgggggtggg ctcgggcccc gggggcgtgg aggggggcgc 


128701
ggggaggggg gcgcgggcgt ccgagccggg ggcgtccgcg ccgctcttct tcgtcttcgg 


128761
gggtcgcggg ccgccgcctc cgggcggccg ggccgggccg ggactcttgc gcttgcgccc 


128821
ctcccgcggc gcggcggagg cggcggcggc cgccagcgcg tcggcggcgt ccggtgcgct 


128881
ggccgccgcc gccagcaggg ggcggaggct ctggttctca aacagcaggt ccgcggcggc 


128941
ggcggccgcg gagctcggca ggcgcgggtc ccgcggcagc gcggggccca gggccccggc 


129001
gaccaggctc acggcgcgca cggcggccac ggcggcctcg ctgccgccgg ccacgcgcag 


129061
gtccccgcgc aggcgcatga gcaccagcgc gtcgcgcacg aaccgcagct cgcgcagcca 


129121
cgcgcgcagg cggggcgcgt cggcgtgcgg cggcggcggg gaagcggggc ccgcgggtcc 


129181
ctccggccgc ggggggctgg cgggccgggc cccggccagc cccgggacgg ccgccaggtc 


129241
gccgtcgaag ccctcggcca gcgcctccag gatcccgcgg caggcggcca ggcactcgac 


129301
ggccacgcgg ccggcctggg cgcggcgccc ggcgtcggcg tcggcgtggc gggcggcgtc 


129361
ggggtcgtcg ccccccacgg gggaggcggg cgcggcggac agccgcccca gggcggcgag 


129421
gatccccgcg gcgccgtacc cggcgggcac cgcgcgctcg cccggtgcgg cggcggcgac 


129481
ggcggcgacc ccctcgtcat ctgcgccggc gccggggctc cccgcggccc ccgtcagcgc 


129541
cgcgttctcg cgcgccaaca ggggcgcgta ggcgcggcgc aggctggtca gcaggaagcc 


129601
cttctgcgcg cggtcgtatc ggcggctcat ggccacggcg gccgccgcgt gcgccaggcc 


129661
ccagccgaag cggccggccg ccatggcgta gcccaggtgg ggcacggccc gcgccacgct 


129721
gccggtgatg aaggagctgc tgttgcgcgc ggcgcccgag atccggaagc aggcctggtc 


129781
cagcgccacg tccccgggga ccacgcgcgg gttctggagc caccccatgg cctccgcgtc 


129841
cggggtgtac agcagccgcg tgatcagggc gtactgctgc gcggcgtcgc ccagctcggg 


129901
cgcccacacg gccgccgggg cgcccgaggc ctcgaaccgg cgtcgcgcct cctccgcctc 


129961
gggcgccccc cagaggcccg ggcggctgtc gcccaggccg ccgtacagca cccgccccgg 


130021
gggcgggggc ccggcgccgg gccacggctc cccgctgacg tacccgtcgc gatagcgcgc 


130081
gtagaaggcg ccggaggccg cgtcggcgtc cagctcgacc cgccggggct gcccggccgt 


130141
gaagcggccc gtggcgtcgc ggccggccac cgccgcgcgg gcccggcggc gctcgatgcg 


130201
gcccgcggag gccgcggggg tcctcgccgc cgcccggggc ttgggcgcgg cctcggagag 


130261
ggggggtggc ccgggcgggg gcggcgtccg cccgggggct tccggcgccg cgctcgacgg 


130321
accccgcccg acggcccgcg cctcgcgtgc gtggtcggcc gcgtcgttgc cgtcgtcgtc 


130381
ctcgtcctcg tcggacgacg aggacgaaga ggatgcggac gacgaggacg aggacccgga 


130441
gtccgacgag gtcgatgacg ccgatggccg ccgccggccg tgacgacgtc tccgcggcgg 


130501
ctgggccggc gggcgcggcg acaggcggtc cgtggggtcc ggatacgcgc cgcgtagcgg 


130561
ggcctcccgt tcgcggcccc gggccggggc ccggtcgccg gcggcgtcgg ctgcgtcgtc 


130621
gtactcgtcc ccgtcatcgt cgtcggctag aaaggcgggg gtccggggcg gcgaggccgc 


130681
ggggtcgggc gtcgggatcg tccggacggc ctcctctacc atggaggcca gcagagccag 


130741
ctgtcgcgac gagacggcgt ccccggcgtc ctcgccggcg tcggtgcccg ccgcgggggc 


130801
cctcccgtcc cgccgggcgt cgtcgaggtc gtgggggtgg tcggggtcgt ggtcggggtc 


130861
gtccccgccc tcctccgtct ccgcgcccca cccgagggcc ccccgctcgt cgcggtctgg 


130921
gctcggggtg ggcggcggcc cgtcggtggg gcccggggag ccggggcgct gcttgttctc 


130981
cgacgccatc gccgatgcgg ggcgatcctc cggggatacg gctgcgacgg cggacgtagc 


131041
acggtaggtc acctacggac tctcgatggg gggagggggc gagacccacg gaccccgacg 


131101
acccccgccg tcgacgcgga actagcgcgg accggtcgat gcttgggtgg ggaaaaagga 


131161
cagggacggc cgatccccct cccgcgcttc gtccgcgtat cggcgtcccg gcgcggcgag 


131221
cgtctgacgg tctgtctctg gcggtcccgc gtcgggtcgt ggatccgtgt cggcagccgc 


131281
gctccgtgtg gacgatcggg gcgtcctcgg gctcatatag tcccaggggc cggcgggaag 


131341
gaggagcagc ggaggccgcc ggccccccgc cccccggcgg gcccaccccg aacggaattc 


131401
cattatgcac gaccccgccc cgacgccggc acgccggggg cccgtggccg cggcccgttg 


131461
gtcgaacccc cggccccgcc catccgcgcc atctgccatg gacggggcgc gagggcgggt 


131521
gggtccgcgc cccgccccgc atggcatctc attaccgccc gatccggcgg tttccgcttc 


131581
cgttccgcat gctaacgagg aacgggcagg gggcggggcc cgggccccga cttcccggtt 


131641
cggcggtaat gagatacgag ccccgcgcgc ccgttggccg tccccgggcc cccggtcccg 


131701
cccgccggac gccgggacca acgggacggc gggcggccct tgggccgccc gccttgccgc 


131761
ccccccattg gccggcgggc gggaccgccc caagggggcg gggccgccgg gtaaaagaag 


131821
tgagaacgcg aagcgttcgc acttcgtccc aatatatata tattattagg gcgaagtgcg 


131881
agcactggcg ccgtgcccga ctccgcgccg gccccggggg cggacccggg cggcgggggg 


131941
cgggtctctc cggcgcacat aaaggcccgg cgcgaccgac gcccgcagac ggcgccagcc 


132001
acgaacgacg ggagcggctg cggagcacgc ggaccgggag cgggagtcgc agagggccgt 


132061
cggagcggac ggcgtcggca tcgcgacgcc ccggctcggg atcgggatcg catcggaaag 


132121
ggacacgcgg acgcgggggg gaaagacccg cccaccccac ccacgaaaca caggggacgc 


132181
accccggggg cctccgacga cagaaaccca ccggtccgcc ttttttgcac gggtaagcac 


132241
cttgggtggg cagaggaggg gggacgcggg ggcggaggag gggggacgcg ggggcggagg 


132301
aggggggacg cgggggcgga ggagggggga cgcgggggcg gaggaggggg gacgcggggg 


132361
cggaggaggg ggctcacccg cgttcgtgcc ttcccgcagg aggaacgccc tcgtcgaggc 


132421
gaccggcggc gaccgttgcg tggaccgctt cctgctcgtc gggggggggg gagccactgt 


132481
ggtcctccgg gacgttttct ggatggccga catttcccca ggcgcttttg tgccttgtgt 


132541
aaaagcgcgg cgtcccgctc tccgatcccc gcccctgggc acgcgcaagc gcaagcgccc 


132601
tgcccgcccc ctctcatcgg agtctgaggt cgaatccgag acagccttgg agtctgaggt 


132661
cgaatccgag acagcatcgg attcgaccga gtctggggac caggaggaag ccccccgcat 


132721
cggtggccgt agggcccccc ggaggcttgg ggggcggttt tttctggaca tgtcggcgga 


132781
atccaccacg gggacggaaa cggatgcgtc ggtgtcggac gaccccgacg acacgtccga 


132841
ctggtcttgt gacgacattc ccccacgacc caagcgggcc cgggtaaacc tgcggctcac 


132901
tagctctccc gatcggcggg atggggttat ttttcctaag atggggcggg tccggtctac 


132961
ccgggaaacg cagccccggg cccccacccc gtcggcccca agcccaaatg caatgctccg 


133021
gcgctcggtg cgccaggccc agaggcggag cagcgcacga tggacccccg acctgggcta 


133081
catgcgccag tgtatcaatc agctgtttcg ggtcctgcgg gtcgcccggg acccccacgg 


133141
cagtgccaac cgcctgcgcc acctgatacg cgactgttac ctgatgggat actgccgagc 


133201
ccgtctggcc ccgcgcacgt ggtgccgctt gctgcaggtg tccggcggaa cctggggcat 


133261
gcacctgcgc aacaccatac gggaggtgga ggctcgattc gacgccaccg cagaacccgt 


133321
gtgcaagctt ccttgtttgg aggccagacg gtacggcccg gagtgtgatc ttagtaatct 


133381
cgagattcat ctcagcgcga caagcgatga tgaaatctcc gatgccaccg atctggaggc 


133441
cgccggttcg gaccacacgc tcgcgtccca gtccgacacg gaggatgccc cctcccccgt 


133501
tacgctggaa accccagaac cccgcgggtc cctcgctgtg cgtctggagg atgagtttgg 


133561
ggagtttgac tggacccccc aggagggctc ccagccctgg ctgtctgcgg tcgtggccga 


133621
taccagctcc gtggaacgcc cgggcccatc cgattctggg gcgggtcgcg cagcagaaga 


133681
ccgcaagtgt ctggacggct gccggaaaat gcgcttctcc accgcctgcc cctatccgtg 


133741
cagcgacacg tttctccggc cgtgagtccg gtcgccccga cccccttgta tgtccccaaa 


133801
ataaaagacc aaaatcaaag cgtttgtccc agcgtcttaa tggcgggaag ggcggagaga 


133861
aacagaccac gcgtacatgg ggggtgtttg ggggtttatt gacatcgggg ctacagggtg 


133921
gtaaccggat agcagatgtg aggaagtctg ggccgttcgc cgcgaacggc gatcagaggg 


133981
tccgtttctt gcggaccacg gcccggtgat gtgggttgct cgtctaaaat ctcgggcata 


134041
cccatacacg cacaacacgg acgccgcacc gaatgggacg tcgtaagggg gtgggaggta 


134101
gctgggtggg gtttgtgcag agcaatcagg gaccgcagcc agcgcataca atcgcgctcc 


134161
cgtccgttgg tcccgggcag gaccacgccg tactggtatt cgtaccggct gagcagggtc 


134221
tccagggggt ggttgggtgc cgcggggaac ggggtccacg ccacggtcca ctcgggcaaa 


134281
aaccgagtcg gcacggccca cggttctccc acccacgcgt ctggggtctt gatggcgata 


134341
aatcttaccc cgagccggat tttttgggcg tattcgagaa acggcacaca cagatccgcc 


134401
gcgcctacca cccacaagtg gtagaggcga ggggggctgg gttggtctcg gtgcaacagt 


134461
cggaagcacg ccacggcgtc cacgacctcg gtgctctcca aggggctgtc ctccgcaaac 


134521
aggcccgtgg tggtgtttgg ggggcagcga caggacctag tgcgcacgat cgggcgggtg 


134581
ggtttgggta agtccatcag cggctcggcc aaccgtcgaa ggttggccgg gcgaacgacg 


134641
accggggtac ccaggggttc tgatgccaaa atgcggcact gcctaagcag gaagctccac 


134701
agggccgggc ttgcgtcgac ggaagtccgg ggcagggcgt tgttctggtc aaggagggtc 


134761
attacgttga cgacaacaac gcccatgttg gtatattaca ggcccgtgtc cggtttgggg 


134821
cacttgcaga tttgtaaggc cacgcacggc ggggagacag gccgacgcgg gggctgctct 


134881
aaaaatttaa gggccctacg gtccacagac ccgccttccc gggggggccc ttggagcgac 


134941
cggcagcgga ggcgtccggg ggaggggagg gttatttacg ggggggtagg tcagggggtg 


135001
ggtcgtcaaa ctgccgctcc ttaaaacccc ggggcccgtc gttcggggtg ctcgttggtt 


135061
ggcactcacg gtgcggcgaa tggcctgtcg taagttttgt cgcgtttacg ggggacaggg 


135121
caggaggaag gaggaggccg tcccgccgga gacaaagccg tcccgggtgt ttcctcatgg 


135181
ccccttttat accccagccg aggacgcgtg cctggactcc ccgcccccgg agacccccaa 


135241
accttcccac accacaccac ccggcgatgc cgagcgcctg tgtcatctgc aggagatcct 


135301
ggcccagatg tacggaaacc aggactaccc catagaggac gaccccagcg cggatgccgc 


135361
ggacgatgtc gacgaggacg ccccggacga cgtggcctat ccggaggaat acgcagagga 


135421
gctttttctg cccggggacg cgaccggtcc ccttatcggg gccaacgacc acatccctcc 


135481
cccgcgtggc gcatctcccc ccggtatacg acgacgcagc cgggatgaga ttggggccac 


135541
gggatttacc gcagaagagc tggacgccat ggacaggcag gcggctcgag ccatcagccg 


135601
cggcggcaag cccccctcga ccatggccaa gctggtgact ggcatgggct ttacgatcca 


135661
cggagcgctc accccaggat cggaggggtg tgtctttgac agcagccacc cagattaccc 


135721
ccaacgggta atcgtgaagg cggggtggta cacgagcacg agccacgagg cgcgactgct 


135781
gaggcgactg gaccacccgg cgatcctgcc cctcctggac ctgcatgtcg tctccggggt 


135841
cacgtgtctg gtcctcccca agtaccaggc cgacctgtat acctatctga gtaggcgcct 


135901
gaacccactg ggacgcccgc agatcgcagc ggtctcccgg cagctcctaa gcgccgttga 


135961
ctacattcac cgccagggca ttatccaccg cgacattaag accgaaaata tttttattaa 


136021
cacccccgag gacatttgcc tgggggactt tggtgccgcg tgcttcgtgc agggttcccg 


136081
atcaagcccc ttcccctacg gaatcgccgg aaccatcgac accaacgccc ccgaggtcct 


136141
ggccggggat ccgtatacca cgaccgtcga catttggagc gccggtctgg tgatcttcga 


136201
gactgccgtc cacaacgcgt ccttgttctc ggccccccgc ggccccaaaa ggggcccgtg 


136261
cgacagtcag atcacccgca tcatccgaca ggcccaggtc cacgttgacg agttttcccc 


136321
gcatccagaa tcgcgcctca cctcgcgcta ccgctcccgc gcggccggga acaatcgccc 


136381
gccttacacc cgaccggcct ggacccgcta ctacaagatg gacatagacg tcgaatatct 


136441
ggtttgcaaa gccctcacct tcgacggcgc gcttcgcccc agcgccgcag agctgctttg 


136501
tttgccgctg tttcaacaga aatgaccgcc cccggggggc ggtgctgttt gcgggttggc 


136561
acaaaaagac cccgacccgc gtctgtggtg tttttggcat catgtcgccg ggcgccatgc 


136621
gtgccgttgt tcccattatc ccattccttt tggttcttgt cggtgtatcg ggggttccca 


136681
ccaacgtctc ctccaccacc caaccccaac tccagaccac cggtcgtccc tcgcatgaag 


136741
cccccaacat gacccagacc ggcaccaccg actctcccac cgccatcagc cttaccacgc 


136801
ccgaccacac accccccatg ccaagtatcg gactggagga ggaggaggaa gaggaggagg 


136861
gggccgggga tggcgaacat cttaaggggg gagatgggac ccgtgacacc ctaccccagt 


136921
ccccgggtcc agccgtcccg ttggccgggg atgacgagaa ggacaaaccc aaccgtcccg 


136981
tagtcccacc ccccggtccc aacaactccc ccgcgcgccc cgagaccagt cgaccgaaga 


137041
caccccccac cagtatcggg ccgctggcaa ctcgacccac gacccaactc ccctcaaagg 


137101
ggcgaccctt ggttccgacg cctcaacata ccccgctgtt ctcgttcctc actgcctccc 


137161
ccgccctgga caccctcttc gtcgtcagca ccgtcatcca caccttatcg tttgtgtgta 


137221
ttgttgctat ggcgacacac ctgtgtggtg gttggtccag acgcgggcga cgcacacacc 


137281
ctagcgtgcg ttacgtgtgc ctgccgcccg aacgcgggta gggtatgggg cggggatggg 


137341
gagagcccac acgcggaaag caagaacaat aaaggcggcg ggatctagtt gatatgcgtc 


137401
tctgggtgtt tttggggtgt ggtgggcgcg gggcggtcat tggacggggg tgcagttaaa 


137461
tacatgcccg ggacccatga agcatgcgcg acttccgggc ctcggaaccc acccgaaacg 


137521
gccaacggac gtctgagcca ggcctggcta tccggagaaa cagcacacga cttggcgttc 


137581
tgtgtgtcgc gatgtctctg cgcgcagtct ggcatctggg gcttttggga agcctcgtgg 


137641
gggctgttct tgccgccacc catctgggac ctgcggccaa cacaacggac cccttaacgc 


137701
acgccccagt gtcccctcac cccagccccc tggggggctt tgccgtcccc ctcgtagtcg 


137761
gtgggctgtg tgccgtagtc ctgggggcgg cgtgtctgct tgagctcctg cgtcgtacgt 


137821
gccgcgggtg ggggcgttac catccctaca tggacccagt tgtcgtataa ttttttcccc 


137881
cccccccttc tccgcatggg tgatgtcggg tccaaactcc cgacaccacc agctggcatg 


137941
gtataaatca ccggtgcgcc ccccaaacca tgtccggcag ggggatgggg ggcgaatgcg 


138001
gagggcaccc aacaacaccg ggctaaccag gaaatccgtg gccccggccc ccaacaaaga 


138061
tcgcggtagc ccggccgtgt gacattatcg tccataccga ccacaccgac gaatccccta 


138121
agggggaggg gccattttac gaggaggagg ggtataacaa agtctgtctt taaaaagcag 


138181
gggttaggga gttgttcggt cataagcttc agtgcgaacg accaactacc ccgatcatca 


138241
gttatcctta aggtctcttt tgtgtggtgc gttccggtat ggggggggct gccgccaggt 


138301
tgggggccgt gattttgttt gtcgtcatag tgggcctcca tggggtccgc ggcaaatatg 


138361
ccttggcgga tgcctctctc aagatggccg accccaatcg ctttcgcggc aaagaccttc 


138421
cggtcctgga ccagctgacc gaccctccgg gggtccggcg cgtgtaccac atccaggcgg 


138481
gcctaccgga cccgttccag ccccccagcc tcccgatcac ggtttactac gccgtgttgg 


138541
agcgcgcctg ccgcagcgtg ctcctaaacg caccgtcgga ggccccccag attgtccgcg 


138601
gggcctccga agacgtccgg aaacaaccct acaacctgac catcgcttgg tttcggatgg 


138661
gaggcaactg tgctatcccc atcacggtca tggagtacac cgaatgctcc tacaacaagt 


138721
ctctgggggc ctgtcccatc cgaacgcagc cccgctggaa ctactatgac agcttcagcg 


138781
ccgtcagcga ggataacctg gggttcctga tgcacgcccc cgcgtttgag accgccggca 


138841
cgtacctgcg gctcgtgaag ataaacgact ggacggagat tacacagttt atcctggagc 


138901
accgagccaa gggctcctgt aagtacgccc tcccgctgcg catccccccg tcagcctgcc 


138961
tctcccccca ggcctaccag cagggggtga cggtggacag catcgggatg ctgccccgct 


139021
tcatccccga gaaccagcgc accgtcgccg tatacagctt gaagatcgcc gggtggcacg 


139081
ggcccaaggc cccatacacg agcaccctgc tgcccccgga gctgtccgag acccccaacg 


139141
ccacgcagcc agaactcgcc ccggaagacc ccgaggattc ggccctcttg gaggaccccg 


139201
tggggacggt ggcgccgcaa atcccaccaa actggcacat cccgtcgatc caggacgccg 


139261
cgacgcctta ccatcccccg gccaccccga acaacatggg cctgatcgcc ggcgcggtgg 


139321
gcggcagtct cctggcagcc ctggtcattt gcggaattgt gtactggatg caccgccgca 


139381
ctcggaaagc cccaaagcgc atacgcctcc cccacatccg ggaagacgac cagccgtcct 


139441
cgcaccagcc cttgttttac tagatacccc cccttaatgg gtgcgggggg gtcaggtctg 


139501
cggggttggg atgggacctt aactccatat aaagcgagtc tggaaggggg gaaaggcgga 


139561
cagtcgataa gtcggtagcg ggggacgcgc acctgttccg cctgtcgcac ccacagcttt 


139621
ttcgcgaacc gtcccgtttc gggatgccgt gccgcccgtt gcagggcctg gtgctcgtgg 


139681
gcctctgggt ctgtgccacc agcctggttg tccgtggccc cacggtcagt ctggtatcaa 


139741
actcatttgt ggacgccggg gccttggggc ccgacggcgt agtggaggaa gacctgctta 


139801
ttctcgggga gcttcgcttt gtgggggacc aggtccccca caccacctac tacgatgggg 


139861
tcgtagagct gtggcactac cccatgggac acaaatgccc acgggtcgtg catgtcgtca 


139921
cggtgaccgc gtgcccacgt cgccccgccg tggcatttgc cctgtgtcgc gcgaccgaca 


139981
gcactcacag ccccgcatat cccaccctgg agctgaatct ggcccaacag ccgcttttgc 


140041
gggtccggag ggcgacgcgt gactatgccg gggtgtacgt gttacgcgta tgggtcgggg 


140101
acgcaccaaa cgccagcctg tttgtcctgg ggatggccat agccgccgaa ggtactctgg 


140161
cgtacaacgg ctcggcccat ggctcctgcg acccgaaact gcttccgtct tcggccccgc 


140221
gtctggcccc ggcgagcgta taccaacccg cccctaaccc ggcctccacc ccctcgacca 


140281
ccacctccac cccctcgacc accatccccg ctccccaagc atcgaccaca cccttcccca 


140341
cgggagaccc aaaaccccaa cctcacgggg tcaaccacga acccccatcg aatgccacgc 


140401
gagcgacccg cgactcgcga tatgcgctaa cggtgaccca gataatccag atagccatcc 


140461
ccgcgtccat tatagccctg gtgtttctgg ggagctgtat ttgctttata cacagatgtc 


140521
aacgccgcta ccgacgctcc cgccgcccga tttacagccc ccagataccc acgggcatct 


140581
catgcgcggt gaacgaagcg gccatggccc gcctcggagc cgagctcaaa tcgcatccga 


140641
gcaccccccc caaatcccgg cgccggtcgt cacgcacgcc aatgccctcc ctgacggcca 


140701
tcgccgaaga gtcggagccc gcgggggcgg ctgggcttcc gacgcccccc gtggacccca 


140761
cgacatccac cccaacgcct cccctgttgg tataggtcca cggccactgg ccgggggcac 


140821
cacataaccg accgcagtca ctgagttggg aataaaccgg tattatttac ctatatccgt 


140881
gtatgtccat ttctttcttc cccccccccc ccggaaacca aagaaggaag caaagaatgg 


140941
atgggaggag ttcaggaagc cggggagagg gcccgcggcg catttaaggc gttgttgtgt 


141001
tgactttggc tcttctggcg ggttggtgcg gtgctgtttg ttgggctccc attttacccg 


141061
aagatcggct gctatccccg ggacatggat cgcggggcgg tggtggggtt tcttctcggt 


141121
gtttgtgttg tatcgtgctt ggcgggaacg cccaaaacgt cctggagacg ggtgagtgtc 


141181
ggcgaggacg tttcgttgct tccagctccg gggcctacgg ggcgcggccc gacccagaaa 


141241
ctactatggg ccgtggaacc cctggatggg tgcggcccct tacacccgtc gtgggtctcg 


141301
ctgatgcccc ccaagcaggt gcccgagacg gtcgtggatg cggcgtgcat gcgcgctccg 


141361
gtcccgctgg cgatggcgta cgcccccccg gccccatctg cgaccggggg tctacgaacg 


141421
gacttcgtgt ggcaggagcg cgcggccgtg gttaaccgga gtctggttat tcacggggtc 


141481
cgagagacgg acagcggcct gtataccctg tccgtgggcg acataaagga cccggctcgc 


141541
caagtggcct cggtggtcct ggtggtgcaa ccggccccag ttccgacccc acccccgacc 


141601
ccagccgatt acgacgagga tgacaatgac gagggcgagg acgaaagtct cgccggcact 


141661
cccgccagcg ggaccccccg gctcccgcct ccccccgccc ccccgaggtc ttggcccagc 


141721
gcccccgaag tctcacatgt gcgtggggtg accgtgcgta tggagactcc ggaagctatc 


141781
ctgttttccc ccggggagac gttcagcacg aacgtctcca tccatgccat cgcccacgac 


141841
gaccagacct actccatgga cgtcgtctgg ttgaggttcg acgtgccgac ctcgtgtgcc 


141901
gagatgcgaa tatacgaatc gtgtctgtat cacccgcagc tcccagaatg tctgtccccg 


141961
gccgacgcgc cgtgcgccgc gagtacgtgg acgtctcgcc tggccgtccg cagctacgcg 


142021
gggtgttcca gaacaaaccc cccaccgcgc tgttcggccg aggctcacat ggagcccgtc 


142081
ccggggctgg cgtggcaggc ggcctccgtc aatctggagt tccgggacgc gtccccacaa 


142141
cactccggcc tgtatctgtg tgtggtgtac gtcaacgacc atattcacgc ctggggccac 


142201
attaccatca gcaccgcggc gcagtaccgg aacgcggtgg tggaacagcc cctcccacag 


142261
cgcggcgcgg atttggccga gcccacccac ccgcacgtcg gggcccctcc ccacgcgccc 


142321
ccaacccacg gcgccctgcg gttaggggcg gtgatggggg ccgccctgct gctgtctgcg 


142381
ctggggttgt cggtgtgggc gtgtatgacc tgttggcgca ggcgtgcctg gcgggcggtt 


142441
aaaagcaggg cctcgggtaa ggggcccacg tacattcgcg tggccgacag cgagctgtac 


142501
gcggactgga gctcggacag cgagggagaa cgcgaccagg tcccgtggct ggcccccccg 


142561
gagagacccg actctccctc caccaatgga tccggctttg agatcttatc accaacggct 


142621
ccgtctgtat acccccgtag cgacgggcat caatctcgcc gccagctcac aacctttgga 


142681
tccggaaggc ccgatcgccg ttactcccag gcctccgatt cgtccgtctt ctggtaaggc 


142741
gccccatccc gaggccccac gtcggtcgcc gaactgggcg accgccggcg aggtggacgt 


142801
cggagacgag ctaatcgcga tttccgacga acgcggaccc ccccgacatg accgcccgcc 


142861
cctcgccacg tcgaccgcgc cctcgccaca cccgcgaccc ccgggctaca cggccgttgt 


142921
ctccccgatg gccctccagg ctgtcgacgc cccctccctg tttgtcgcct ggctggccgc 


142981
tcggtggctc cggggggctt ccggcctggg ggccgtcctg tgtgggattg cgtggtatgt 


143041
gacgtcaatt gcccgaggcg cacaaagggc cggtggtccg cctagccgca gcaaattaaa 


143101
aatcgtgagt cacagcgacc gcaacttccc acccggagct ttcttccggc ctcgatgacg 


143161
tcccggctct ccgatcccaa ctcctcagcg cgatccgaca tgtccgtgcc gctttatccc 


143221
acggcctcgc cagtttcggt cgaagcctac tactcggaaa gcgaagacga ggcggccaac 


143281
gacttcctcg tacgcatggg ccgccaacag tcggtattaa ggcgttgacg cagacgcacc 


143341
cgctgcgtcg gcatggtgat cgcctgtctc ctcgtggccg ttctgtcggg cggatttggg 


143401
gcgctcctga tgtggctgct ccgctaaaag accgcatcga cacgcgcgtc cttcttgtcg 


143461
tctctcttcc cccccatcac cccgcaattt gcacccagcc tttaactaca ttaaattggg 


143521
ttcgattggc aatgttgtct cccggttgat ttttgggtgg gtggggagtg ggtgggtggg 


143581
gagtgggtgg gtggggagtg ggtgggtggg gagtgggtgg gtggggagtg ggtgggtggg 


143641
gagtgggtgg gtggggagtg ggtgggtggg gagtgggtgg gtggggagtg ggtgggtggg 


143701
gagtggcaag gaagaaacaa gcccgaccac cagacagaaa atgtaaccat acccaaaccg 


143761
actctggggg ctgtttgtgg ggtcggaacc ataggatgaa caaaccaccc cgtacctccc 


143821
gcacccaagg gtgcgggtgg ctcatcggca tctgtccggt atgggttgtt ccccacccac 


143881
tcgcgttcgg acgtcttaga atcatggcgg ttttctatgc cgacatcggt tttctccccc 


143941
gcaataagac acgatgcgat aaaatctgtt tgtaaaattt attaagggta caaattgccc 


144001
tagcacaggg gtggggttag ggccgggtcc ccacacccaa acgcaccaaa cagatgcagg 


144061
cagtgggtcg agtacagccc cgcgtacgaa cacgtcgatg cgtgtgtcag acagcaccag 


144121
aaagcacagg ccatcaacag gtcgtgcatg tgtcggtggg tttggacgcg gggggccatg 


144181
gtggtgataa agttaatggc cgccgtccgc cagggccaca ggggcgacgt ctcttggttg 


144241
gcccggagcc actgggtgtg gaccagccgc gcgtggcggc ccaacatggc ccctgtagcc 


144301
gggggcgggg gatcgcgcac gtttgcagcg cacatgcgag acacctcgac cacggttcga 


144361
aagaaggccc ggtggtccgc gggcaacatc accaggtgcg caagcgcccg ggcgtccaga 


144421
gggtagagcc ctgagtcatc cgaggttggc tcatcgcccg ggtcttgccg caagtgcgtg 


144481
tgggttgggc ttccggtggg cgggacgcga accgcggtgt ggatcccgac gcgggcccga 


144541
gcgtatgctc catcttgtgg ggagaagggg tctgggctcg ccaggggggc atacttgccc 


144601
gggctataca gacccgcgag ccgtacgtgg ttcgcggggg gtgcgtgggg tccggggctc 


144661
cctgggagac cggggttgtc gtggatccct ggggtcacgc ggtaccctgg ggtctctggg 


144721
agctcgcggt actctgggtt ccctaggttc tcggggtggt cgcggaaccc ggggctcccg 


144781
gggaacacgc ggtgtcctgg ggattgttgg cggtcggacg gcttcagatg gcttcgagat 


144841
cgtagtgtcc gcaccgactc gtagtagacc cgaatctcca cattgccccg ccgcttgatc 


144901
attatcaccc cgttgcgggg gtccggagat catgcgcggg tgtcctcgag gtgcgtgaac 


144961
acctctgggg tgcatgccgg cggacggcac gccttttaag taaacatctg ggtcgcccgg 


145021
cccaactggg gccgggggtt gggtctggct catctcgaga gacacggggg ggaaccaccc 


145081
tccgcccaga gactcgggtg atggtcgtac ccgggactca acgggttacc ggattacggg 


145141
gactgtcggt cacggtcccg ccggttcttc gatgtgccac acccaaggat gcgttggggg 


145201
cgatttcggg cagcagcccg ggagagcgca gcaggggacg ctccgggtcg tgcacggcgg 


145261
ttctggccgc ctcccggtcc tcacgccccc ttttattgat ctcatcgcgt acgtcggcgt 


145321
acgtcctggg cccaacccgc atgttgtcca ggaaggtgtc cgccatttcc agggcccacg 


145381
acatgctttt ccccccgacg agcaggaagc ggtccacgca acggtcgccg ccggtcgcct 


145441
cgacgagggc gttcctcctg cgggaaggca cgaacgcggg tgagccccct cctccgcccc 


145501
cgcgtccccc ctcctccgcc cccgcgtccc ccctcctccg cccccgcgtc ccccctcctc 


145561
cgcccccgcg tcccccctcc tccgcccccg cgtcccccct cctctgccca cccaaggtgc 


145621
ttacccgtgc aaaaaaggcg gaccggtggg tttctgtcgt cggaggcccc cggggtgcgt 


145681
cccctgtgtt tcgtgggtgg ggtgggcggg tctttccccc ccgcgtccgc gtgtcccttt 


145741
ccgatgcgat cccgatcccg agccggggcg tcgcgatgcc gacgccgtcc gctccgacgg 


145801
ccctctgcga ctcccgctcc cggtccgcgt gctccgcagc cgctcccgtc gttcgtggct 


145861
ggcgccgtct gcgggcgtcg gtcgcgccgg gcctttatgt gcgccggaga gacccgcccc 


145921
ccgccgcccg ggtccgcccc cggggccggc gcggagtcgg gcacggcgcc agtgctcgca 


145981
cttcgcccta ataatatata tatattggga cgaagtgcga acgcttcgcg ttctcacttc 


146041
ttttacccgg cggccccgcc cccttggggc ggtcccgccc gccggccaat gggggggcgg 


146101
caaggcgggc ggcccaaggg ccgcccgccg tcccgttggt cccggcgtcc ggcgggcggg 


146161
accgggggcc cggggacggc caacgggcgc gcggggctcg tatctcatta ccgccgaacc 


146221
gggaagtcgg ggcccgggcc ccgccccctg cccgttcctc gttagcatgc ggaacggaag 


146281
cggaaaccgc cggatcgggc ggtaatgaga tgccatgcgg ggcggggcgc ggacccaccc 


146341
gccctcgcgc cccgtccatg gcagatggcg cggatgggcg gggccggggg ttcgaccaac 


146401
gggccgcggc cacgggcccc cggcgtgccg gcgtcggggc ggggtcgtgc ataatggaat 


146461
tccgttcggg gtgggcccgc cggggggcgg ggggccggcg gcctccgctg ctcctccttc 


146521
ccgccggccc ctgggactat atgagcccga ggacgccccg atcgtccaca cggagcgcgg 


146581
ctgccgacac ggatccacga cccgacgcgg gaccgccaga gacagaccgt cagacgctcg 


146641
ccgcgccggg acgccgatac gcggacgaag cgcgggaggg ggatcggccg tccctgtcct 


146701
ttttccccac ccaagcatcg accggtccgc gctagttccg cgtcgacggc gggggtcgtc 


146761
ggggtccgtg ggtctcgccc cctcccccca tcgagagtcc gtaggtgacc taccgtgcta 


146821
cgtccgccgt cgcagccgta tccccggagg atcgccccgc atcggcgatg gcgtcggaga 


146881
acaagcagcg ccccggctcc ccgggcccca ccgacgggcc gccgcccacc ccgagcccag 


146941
accgcgacga gcggggggcc ctcgggtggg gcgcggagac ggaggagggc ggggacgacc 


147001
ccgaccacga ccccgaccac ccccacgacc tcgacgacgc ccggcgggac gggagggccc 


147061
ccgcggcggg caccgacgcc ggcgaggacg ccggggacgc cgtctcgtcg cgacagctgg 


147121
ctctgctggc ctccatggta gaggaggccg tccggacgat cccgacgccc gaccccgcgg 


147181
cctcgccgcc ccggaccccc gcctttctag ccgacgacga tgacggggac gagtacgacg 


147241
acgcagccga cgccgccggc gaccgggccc cggcccgggg ccgcgaacgg gaggccccgc 


147301
tacgcggcgc gtatccggac cccacggacc gcctgtcgcc gcgcccgccg gcccagccgc 


147361
cgcggagacg tcgtcacggc cggcggcggc catcggcgtc atcgacctcg tcggactccg 


147421
ggtcctcgtc ctcgtcgtcc gcatcctctt cgtcctcgtc gtccgacgag gacgaggacg 


147481
acgacggcaa cgacgcggcc gaccacgcac gcgaggcgcg ggccgtcggg cggggtccgt 


147541
cgagcgcggc gccggaagcc cccgggcgga cgccgccccc gcccgggcca ccccccctct 


147601
ccgaggccgc gcccaagccc cgggcggcgg cgaggacccc cgcggcctcc gcgggccgca 


147661
tcgagcgccg ccgggcccgc gcggcggtgg ccggccgcga cgccacgggc cgcttcacgg 


147721
ccgggcagcc ccggcgggtc gagctggacg ccgacgcggc ctccggcgcc ttctacgcgc 


147781
gctatcgcga cgggtacgtc agcggggagc cgtggcccgg cgccgggccc ccgcccccgg 


147841
ggcgggtgct gtacggcggc ctgggcgaca gccgcccggg cctctggggg gcgcccgagg 


147901
cggaggaggc gcgacgccgg ttcgaggcct cgggcgcccc ggcggccgtg tgggcgcccg 


147961
agctgggcga cgccgcgcag cagtacgccc tgatcacgcg gctgctgtac accccggacg 


148021
cggaggccat ggggtggctc cagaacccgc gcgtggtccc cggggacgtg gcgctggacc 


148081
aggcctgctt ccggatctcg ggcgccgcgc gcaacagcag ctccttcatc accggcagcg 


148141
tggcgcgggc cgtgccccac ctgggctacg ccatggcggc cggccgcttc ggctggggcc 


148201
tggcgcacgc ggcggccgcc gtggccatga gccgccgata cgaccgcgcg cagaagggct 


148261
tcctgctgac cagcctgcgc cgcgcctacg cgcccctgtt ggcgcgcgag aacgcggcgc 


148321
tgacgggggc cgcggggagc cccggcgccg gcgcagatga cgagggggtc gccgccgtcg 


148381
ccgccgccgc accgggcgag cgcgcggtgc ccgccgggta cggcgccgcg gggatcctcg 


148441
ccgccctggg gcggctgtcc gccgcgcccg cctcccccgt ggggggcgac gaccccgacg 


148501
ccgcccgcca cgccgacgcc gacgccgggc gccgcgccca ggccggccgc gtggccgtcg 


148561
agtgcctggc cgcctgccgc gggatcctgg aggcgctggc cgagggcttc gacggcgacc 


148621
tggcggccgt cccggggctg gccggggccc ggcccgccag ccccccgcgg ccggagggac 


148681
ccgcgggccc cgcttccccg ccgccgccgc acgccgacgc gccccgcctg cgcgcgtggc 


148741
tgcgcgagct gcggttcgtg cgcgacgcgc tggtgctcat gcgcctgcgc ggggacctgc 


148801
gcgtggccgg cggcagcgag gccgccgtgg ccgccgtgcg cgccgtgagc ctggtcgccg 


148861
gggccctggg ccccgcgctg ccgcgggacc cgcgcctgcc gagctccgcg gccgccgccg 


148921
ccgcggacct gctgtttgag aaccagagcc tccgccccct gctggcggcg gcggccagcg 


148981
caccggacgc cgccgacgcg ctggcggccg ccgccgcctc cgccgcgccg cgggaggggc 


149041
gcaagcgcaa gagtcccggc ccggcccggc cgcccggagg cggcggcccg cgacccccga 


149101
agacgaagaa gagcggcgcg gacgcccccg gctcggacgc ccgcgccccc ctccccgcgc 


149161
ccccctccac gcccccgggg cccgagccca cccccgccca gcccgcggcg gcccggggcg 


149221
ccgcggcgca ggcccgcccg cgccccgtgg cgctgtcgcg ccggcccgcc gagggccccg 


149281
accccctggg cggctggcgg cggcagcccc gggggcccag ccacacggcg gcgcccgcgg 


149341
ccgccgccct ggaggcctac tgctccccgc gcgccgtggc cgagctcacg gaccacccgc 


149401
tgttccccgt cccctggcga ccggccctca tgtttgaccc gcgggccctg gcctcgatcg 


149461
ccgcgcggtg cgccgggccc gcccccgccg cccaggccgc gtgcggcggc gacgacgacg 


149521
agaaccccca cccccacggg gccgccgggg gccgcctctt tggccccctg cgcgcctcgg 


149581
gcccgctgcg ccgcatggcg gcctggatgc gccagatccc cgaccccgag gacgtgcgcg 


149641
tggtggtgct gtactcgccg ctgccgggcg aggacctggc cggcggcggg gcctcggggg 


149701
ggccgccgga gtggtccgcc gagcgcggcg ggctgtcctg cctgctggcg gccctggcca 


149761
accggctgtg cgggccggac acggccgcct gggcgggcaa ctggaccggc gcccccgacg 


149821
tgtcggcgct gggcgcgcag ggcgtgctgc tgctgtccac gcgggacctg gccttcgccg 


149881
gggccgtgga gtttctgggg ctgctcgcca gcgccggcga ccggcggctc atcgtggtca 


149941
acaccgtgcg cgcctgcgac tggcccgccg acgggcccgc ggtgtcgcgg cagcacgcct 


150001
acctggcgtg cgacctgctg cccgccgtgc agtgcgccgt gcgctggccg gcggcgcggg 


150061
acctgcgccg cacggtgctg gcctcgggcc gcgtgttcgg cccgggggtc ttcgcgcgcg 


150121
tggaggccgc gcacgcgcgc ctgtaccccg acgcgccgcc gctgcgcctg tgccgcggcg 


150181
gcaacgtgcg ctaccgcgtg cgcacgcgct tcggcccgga cacgccggtg cccatgtccc 


150241
cgcgcgagta ccgccgggcc gtgctgccgg cgctggacgg ccgggcggcg gcctcgggga 


150301
ccaccgacgc catggcgccc ggcgcgccgg acttctgcga ggaggaggcc cactcgcacc 


150361
gcgcctgcgc gcgctggggc ctgggcgcgc cgctgcggcc cgtgtacgtg gcgctggggc 


150421
gcgaggcggt gcgcgccggc ccggcccggt ggcgcgggcc gcggagggac ttttgcgccc 


150481
gcgccctgct ggagcccgac gacgacgccc ccccgctggt gctgcgcggc gacgacgacg 


150541
acggcccggg ggccctgccg ccggcgccgc ccgggattcg ctgggcctcg gccacgggcc 


150601
gcagcggcac cgtgctggcg gcggcggggg ccgtggaggt gctgggggcg gaggcgggct 


150661
tggccacgcc cccgcgacgg gacgttgtgg actgggaagg cgcctgggac gaagacgacg 


150721
gcggcgcgtt cgagggggac ggggtgctgt aacgggccgg gacggggcgg ggcgcttgtg 


150781
aaacccgaag acgcaataaa cggcaacgac ctgattaagt tttgcagtag cgttgtttat 


150841
tcgaggggcg ggagggggcg aggggcggga gggggcgagg ggcgggaggg ggcgaggggc 


150901
gggagggggc gaggggcggg agggggcgag gggcgggagg gggcgagggg cgggaggggg 


150961
cgaggggcgg gagggggcga ggggcgggag ggggcgaggg gcgggagggg gcgaggggcg 


151021
ggagggggcg aggggcggga gggggcgagg ggcgggaggg ggcgaggggc gggagggggc 


151081
gaggggcggg agggggcgag gggcgggagg gggcgagggg cgggaggggg cgaggggcgg 


151141
gagggggcga ggggcgggag ggggcgaggg gcggtggtgg tgcgcgggcg cccccggagg 


151201
gtttggatct ctgacctgag attggcggca ctgaggtaga gatgcccgaa cccccccgag 


151261
ggagcgcggg acgcggctgg ggagggctgg ggctggggag ggctggggcc ggggagggct 


151321
ggggccgggg agggctgggg ccggggaggg ctggggccgg ggagggctgg ggccggggag 


151381
ggctggggct ggggagggct ggggctgggg aggggcggtg gtgtgtagca ggagcggtgt 


151441
gttgcgccgg ggtacgtctg gaggagcggg aggtgcgcgg tgacgtgtgg atgaggaaca 


151501
ggagttgttg cgcggtgagt tgtcgctgtg agttgtgttg ttgggcaggt gtggtggatg 


151561
acgtgacgtg tgacgtgcgg attgcgccgt gctttgttgg tgttgtttta cctgtggcag 


151621
cccgggcccc ccgcgggcgc gcgcgcgcgc aaaaaaggcg ggcggcggtc cgggcggcgt 


151681
gcgcgcgcgc ggcgggcgtg gggggcgggg ccgcgggagc gggggaggag cgggggagga 


151741
gcggggggag gagcgggggg aggagcgggg ggaggagcgg ggggaggagc ggggggagga 


151801
gcggggggag gagcgggggg aggagcgggg ggaggagcgg ggggaggagc ggggggagga 


151861
gcggggggag gagcgggggg aggagcgggg ggaggagcgg ggggaggagc ggggggagga 


151921
gcgggggagg agcggccaga ccccggaaac gggccccccc caaaacacac cccccggggg 


151981
tcgcgcgcgg ccctttaaag gcgggcggcg g 









REFERENCES



  • 1. Anchisi, S., Guerra, J., and Garcin, D. (2015). RIG-I ATPase activity and discrimination of self-RNA versus non-self-RNA. MBio 6, e02349.

  • 2. Bhatt, D., and Ghosh, S. (2014). Regulation of the NF-kappaB-Mediated Transcription of Inflammatory Genes. Front Immunol 5, 71.

  • 3. Bidinosti, M., Ran, I., Sanchez-Carbente, M. R., Martineau, Y., Gingras, A. C., Gkogkas, C., Raught, B., Bramham, C. R., Sossin, W. S., Costa-Mattioli, M., et al. (2010). Postnatal Deamidation of 4E-BP2 in Brain Enhances Its Association with Raptor and Alters Kinetics of Excitatory Synaptic Transmission. Molecular cell 37, 797-808.

  • 4. Chan, Y. K., and Gack, M. U. (2015). RIG-I-like receptor regulation in virus infection and immunity. Curr Opin Virol 12, 7-14.

  • 5. Chen, Z. J., Parent, L., and Maniatis, T. (1996). Site-specific phosphorylation of IkappaBalpha by a novel ubiquitination-dependent protein kinase activity. Cell 84, 853-862.

  • 6. Cui, J., Yao, Q., Li, S., Ding, X., Lu, Q., Mao, H., Liu, L., Zheng, N., Chen, S., and Shao, F. (2010). Glutamine deamidation and dysfunction of ubiquitin/NEDD8 induced by a bacterial effector family. Science 329, 1215-1218.

  • 7. da Silva, L. F., and Jones, C. (2013). Small non-coding RNAs encoded within the herpes simplex virus type 1 latency associated transcript (LAT) cooperate with the retinoic acid inducible gene I (RIG-I) to induce beta-interferon promoter activity and promote cell survival. Virus research 175, 101-109.

  • 8. Desai, P., Sexton, G. L., McCaffery, J. M., and Person, S. (2001). A null mutation in the gene encoding the herpes simplex virus type 1 UL37 polypeptide abrogates virus maturation. J Virol 75, 10259-10271.

  • 9. Deverman, B. E., Cook, B. L., Manson, S. R., Niederhoff, R. A., Langer, E. M., Rosova, I., Kulans, L. A., Fu, X., Weinberg, J. S., Heinecke, J. W., et al. (2002). Bcl-xL deamidation is a critical switch in the regulation of the response to DNA damage. Cell 111, 51-62.

  • 10. Dho, S. H., Deverman, B. E., Lapid, C., Manson, S. R., Gan, L., Riehm, J. J., Aurora, R., Kwon, K. S., and Weintraub, S. J. (2013). Control of cellular Bcl-xL levels by deamidation-regulated degradation. PLoS Biol 11, e1001588.

  • 11. Dong, X., Feng, H., Sun, Q., Li, H., Wu, T. T., Sun, R., Tibbetts, S. A., Chen, Z. J., and Feng, P. (2010). Murine gamma-herpesvirus 68 hijacks MAVS and IKKbeta to initiate lytic replication. PLoS pathogens 6, e1001001.

  • 12. Dong, X., and Feng, P. (2011). Murine gamma herpesvirus 68 hijacks MAVS and IKKbeta to abrogate NFkappaB activation and antiviral cytokine production. PLoS pathogens 7, e1002336.

  • 13. Dong, X., He, Z., Durakoglugil, D., Arneson, L., Shen, Y., and Feng, P. (2012). Murine gammaherpesvirus 68 evades host cytokine production via replication transactivator-induced RelA degradation. Journal of virology 86, 1930-1941.

  • 14. Feng, P., Moses, A., and Fruh, K. (2013). Evasion of adaptive and innate immune response mechanisms by gamma-herpesviruses. Curr Opin Virol 3, 285-295.

  • 15. Fitzgerald, K. A., McWhirter, S. M., Faia, K. L., Rowe, D. C., Latz, E., Golenbock, D. T., Coyle, A. J., Liao, S. M., and Maniatis, T. (2003). IKKepsilon and TBK1 are essential components of the IRF3 signaling pathway. Nature immunology 4, 491-496.

  • 16. Flatau, G., Lemichez, E., Gauthier, M., Chardin, P., Paris, S., Fiorentini, C., and Boquet, P. (1997). Toxin-induced activation of the G protein p21 Rho by deamidation of glutamine. Nature 387, 729-733.

  • 17. Full, F., Jungnickl, D., Reuter, N., Bogner, E., Brulois, K., Scholz, B., Sturzl, M., Myoung, J., Jung, J. U., Stamminger, T., et al. (2014). Kaposi's sarcoma associated herpesvirus tegument protein ORF75 is essential for viral lytic replication and plays a critical role in the antagonization of ND10-instituted intrinsic immunity. PLoS pathogens 10, e1003863.

  • 18. Gaspar, M., Gill, M. B., Losing, J. B., May, J. S., and Stevenson, P. G. (2008). Multiple functions for ORF75c in murid herpesvirus-4 infection. PLoS One 3, e2781.

  • 19. He, S., Zhao, J., Song, S., He, X., Minassian, A., Zhou, Y., Zhang, J., Brulois, K., Wang, Y., Cabo, J., et al. (2015). Viral pseudo-enzymes activate RIG-I via deamidation to evade cytokine production. Mol Cell 58, 134-146.

  • 20. Jacquemont, B., and Roizman, B. (1975). Rna-Synthesis in Cells Infected with Herpes-Simplex Virus 0.10. Properties of Viral Symmetric Transcripts and of Double-Stranded-Rna Prepared from Them. Journal of virology 15, 707-713.

  • 21. Kato, H., Takeuchi, O., Sato, S., Yoneyama, M., Yamamoto, M., Matsui, K., Uematsu, S., Jung, A., Kawai, T., Ishii, K. J., et al. (2006). Differential roles of MDA5 and RIG-I helicases in the recognition of RNA viruses. Nature 441, 101-105.

  • 22. Kato, H., Sato, S., Yoneyama, M., Yamamoto, M., Uematsu, S., Matsui, K., Tsujimura, T., Takeda, K., Fujita, T., Takeuchi, O., et al. (2005). Cell type-specific involvement of RIG-I in antiviral response. Immunity 23, 19-28.

  • 23. Kawai, T., Takahashi, K., Sato, S., Coban, C., Kumar, H., Kato, H., Ishii, K. J., Takeuchi, O., and Akira, S. (2005). IPS-1, an adaptor triggering RIG-I- and Mda5-mediated type I interferon induction. Nature immunology 6, 981-988.

  • 24. Kerur, N., Veettil, M. V., Sharma-Walia, N., Bottero, V., Sadagopan, S., Otageri, P., and Chandran, B. (2011). IFI16 acts as a nuclear pathogen sensor to induce the inflammasome in response to Kaposi Sarcoma-associated herpesvirus infection. Cell Host Microbe 9, 363-375.

  • 25. Kohlway, A., Luo, D., Rawling, D. C., Ding, S. C., and Pyle, A. M. (2013). Defining the functional determinants for RNA surveillance by RIG-I. EMBO Rep 14, 772-779.

  • 26. Kolakofsky, D., and Garcin, D. (2015). gammaHV68 vGAT: a viral pseudoenzyme pimping for PAMPs. Mol Cell 58, 3-4.

  • 27. Kowalinski, E., Lunardi, T., McCarthy, A. A., Louber, J., Brunel, J., Grigorov, B., Gerlier, D., and Cusack, S. (2011). Structural basis for the activation of innate immune pattern-recognition receptor RIG-I by viral RNA. Cell 147, 423-435.

  • 28. Lassig, C., Matheisl, S., Sparrer, K. M., de Oliveira Mann, C. C., Moldt, M., Patel, J. R., Goldeck, M., Hartmann, G., Garcia-Sastre, A., Hornung, V., et al. (2015). ATP hydrolysis by the viral RNA sensor RIG-I prevents unintentional recognition of self-RNA. Elife 4.

  • 29. Lieber, D., and Bailer, S. M. (2013). Determination of HSV-1 infectivity by plaque assay and a luciferase reporter cell line. Methods Mol Biol 1064, 171-181.

  • 30. Liu, X., Fitzgerald, K., Kurt-Jones, E., Finberg, R., and Knipe, D. M. (2008). Herpesvirus tegument protein activates NF-kappaB signaling through the TRAF6 adaptor protein. Proc Natl Acad Sci USA 105, 11335-11339.

  • 31. Luo, D., Ding, S. C., Vela, A., Kohlway, A., Lindenbach, B. D., and Pyle, A. M. (2011). Structural insights into RNA recognition by RIG-I. Cell 147, 409-422.

  • 32. Luo, D., Kohlway, A., and Pyle, A. M. (2013). Duplex RNA activated ATPases (DRAs): platforms for RNA sensing, signaling and processing. RNA Biol 10, 111-120.

  • 33. Medzhitov, R. (2007). Recognition of microorganisms and activation of the immune response. Nature 449, 819-826.

  • 34. Meylan, E., Curran, J., Hofinann, K., Moradpour, D., Binder, M., Bartenschlager, R., and Tschopp, J. (2005). Cardif is an adaptor protein in the RIG-I antiviral pathway and is targeted by hepatitis C virus. Nature 437, 1167-1172.

  • 35. Mycek, M. J., and Waelsch, H. (1960). The enzymatic deamidation of proteins. J Biol Chem 235, 3513-3517.

  • 36. Pitts, J. D., Klabis, J., Richards, A. L., Smith, G. A., and Heldwein, E. E. (2014). Crystal structure of the herpesvirus inner tegument protein UL37 supports its essential role in control of viral trafficking. J Virol 88, 5462-5473.

  • 37. Rasmussen, S. B., Jensen, S. B., Nielsen, C., Quartin, E., Kato, H., Chen, Z. J., Silverman, R. H., Akira, S., and Paludan, S. R. (2009). Herpes simplex virus infection is sensed by both Toll-like receptors and retinoic acid-inducible gene-like receptors, which synergize to induce type I interferon production. J Gen Virol 90, 74-78.

  • 38. Robinson, N. E., and Robinson, A. B. (2001). Molecular clocks. Proceedings of the National Academy of Sciences of the United States of America 98, 944-949.

  • 39. Sanada, T., Kim, M., Mimuro, H., Suzuki, M., Ogawa, M., Oyama, A., Ashida, H., Kobayashi, T., Koyama, T., Nagai, S., et al. (2012). The Shigella flexneri effector OspI deamidates UBC13 to dampen the inflammatory response. Nature 483, 623-U149.

  • 40. Schmidt, G., Sehr, P., Wilm, M., Seizer, J., Mann, M., and Aktories, K. (1997). Gln 63 of Rho is deamidated by Escherichia coli cytotoxic necrotizing factor-1. Nature 387, 725-729.

  • 41. Seth, R. B., Sun, L., Ea, C. K., and Chen, Z. J. (2005). Identification and characterization of MAVS, a mitochondrial antiviral signaling protein that activates NF-kappaB and IRF 3. Cell 122, 669-682.

  • 42. Sharma, S., tenOever, B. R., Grandvaux, N., Zhou, G. P., Lin, R., and Hiscott, J. (2003). Triggering the interferon antiviral response through an IKK-related pathway. Science 300, 1148-1151.

  • 43. Sen, J., Liu, X., Roller, R., and Knipe, D. M. (2013). Herpes simplex virus US3 tegument protein inhibits Toll-like receptor 2 signaling at or before TRAF6 ubiquitination. Virology 439, 65-73.

  • 44. Sun, Q., Sun, L., Liu, H. H., Chen, X., Seth, R. B., Forman, J., and Chen, Z. J. (2006). The specific and essential role of MAVS in antiviral innate immune responses. Immunity 24, 633-642.

  • 45. Takahasi, K., Yoneyama, M., Nishihori, T., Hirai, R., Kumeta, H., Narita, R., Gale, M., Jr., Inagaki, F., and Fujita, T. (2008). Nonself RNA-sensing mechanism of RIG-I helicase and activation of antiviral immune responses. Molecular cell 29, 428-440.

  • 46. Takeuchi, O., and Akira, S. (2010). Pattern recognition receptors and inflammation. Cell 140, 805-820.

  • 47. Ting, J. P., Duncan, J. A., and Lei, Y. (2010). How the noninflammasome NLRs function in the innate immune system. Science 327, 286-290.

  • 48. Unterholzner, L., Keating, S. E., Baran, M., Horan, K. A., Jensen, S. B., Sharma, S., Sirois, C. M., Jin, T., Latz, E., Xiao, T. S., et al. (2010). IFI16 is an innate immune sensor for intracellular DNA. Nature immunology 11, 997-1004.

  • 49. Wang, H., Piatkov, K. I., Brower, C. S., and Varshavsky, A. (2009). Glutamine-specific N-terminal amidase, a component of the N-end rule pathway. Mol Cell 34, 686-695.

  • 50. Weber, F., Wagner, V., Rasmussen, S. B., Hartmann, R., and Paludan, S. R. (2006). Double-stranded RNA is produced by positive-strand RNA viruses and DNA viruses but not in detectable amounts by negative-strand RNA viruses. Journal of virology 80, 5059-5064.

  • 51. Weerapana, E., Wang, C., Simon, G. M., Richter, F., Khare, S., Dillon, M. B., Bachovchin, D. A., Mowen, K., Baker, D., and Cravatt, B. F. (2010). Quantitative reactivity profiling predicts functional cysteines in proteomes. Nature 468, 790-795.

  • 52. Weintraub, S. J., and Deverman, B. E. (2007). Chronoregulation by asparagine deamidation. Science's STKE: signal transduction knowledge environment 2007, re7.

  • 53. Xu, L. G., Wang, Y. Y., Han, K. J., Li, L. Y., Zhai, Z., and Shu, H. B. (2005). VISA is an adapter protein required for virus-triggered IFN-beta signaling. Molecular cell 19, 727-740.

  • 54. Zandi, E., Rothwarf, D. M., Delhase, M., Hayakawa, M., and Karin, M. (1997). The IkappaB kinase complex (IKK) contains two kinase subunits, IKKalpha and IKKbeta, necessary for IkappaB phosphorylation and NF-kappaB activation. Cell 91, 243-252.

  • 55. Zhao, J., Li, J., Xu, S., and Feng, P. (2016). Emerging roles of regulated protein deamidation in innate immune signaling. J Virol.


Claims
  • 1. An isolated polynucleotide encoding a RIG-I mutant, complementary polynucleotides and equivalents of each thereof.
  • 2. The polynucleotide of claim 1, wherein the RIG-I mutant is RIG-I-QQ.
  • 3. The polynucleotide of claim 1, further comprising a vector or gene delivery vehicle.
  • 4. An isolated host cell comprising the isolated polynucleotide of any of claims 1-3.
  • 5. An isolated RIG-I mutant polypeptide and equivalents thereof.
  • 6. The isolated polypeptide of claim 5, wherein the RIG-I mutant is RIG-I-QQ.
  • 7. An isolated host cell comprising the polypetide of claim 5.
  • 8. An isolated host cell comprising the polynucleotide of claim 1.
  • 9. A composition comprising one or more of the polynucleotide of claim 1 and a carrier.
  • 10. A composition comprising the host cell of claim 7 or 8 and a carrier.
  • 11. An isolated polypetide encoded by the isolated polynucleotide of claim 1 or 2.
  • 12. A vaccine or therapeutic composition comprising an effective amount of the isolated polypeptide of claim 5 or 6 and a pharmaceutically acceptable carrier.
  • 13. The composition of claim 12, further comprising an adjuvant.
  • 14. An isolated mutated UL 37 polynucleotide or polypeptide encoded by the UL 37 polynucleotide or an equivalent of each thereof.
  • 15. A polynucleotide or vaccine composition comprising an effective amount of the isolated polynucleotide of claim 1 or 2.
  • 16. A composition comprising the polynucleotide or vaccine composition of claim 15 and a carrier.
  • 17. The composition of claim 16, wherein the carrier is a pharmaceutically acceptable carrier and optionally an adjuvant.
  • 18. A method for one or more of: a. inhibiting viral replication in a cell or tissue;b. abolishing 5′-ppp-RNA-binding and ATP hydrolysis is a cell or tissue;c. switching off RIG-1 in a cell or tissue;d. blocking RNA-induced activation in a cell or tissue;e. inhibiting the deamidation activity of UL37 in a cell or tissue; orf. inducing an anti-viral immune in a tissue;g. inducing expression of anti-viral cytokine genes; orh. enhancing adaptive immunity,by contacting the cell or tissue with an effective amount of the composition of claim 12.
  • 19. A method for one or more of: a. inhibiting viral replication in a cell or tissue;b. abolishing 5′-ppp-RNA-binding and ATP hydrolysis is a cell or tissue;c. switching off RIG-1 in a cell or tissue;d. blocking RNA-induced activation in a cell or tissue;e. inhibiting the deamidation activity of UL37 in a cell or tissue;f. inducing an anti-viral immune in a tissue;g. inducing expression of anti-viral cytokine genes; orh. enhancing adaptive immunity,by contacting the cell or tissue with an effective amount of the composition of claim 15.
  • 20. The method of claim 18, wherein the contacting is in vitro or in vivo.
  • 21. The method of claim 19, wherein the contacting is in vitro or in vivo.
  • 22. A method for one or more of: a. inhibiting viral replication in subject infected with a virus;b. abolishing 5′-ppp-RNA-binding and ATP hydrolysis in a subject;c. switching off RIG-1 in a subject;d. blocking RNA-induced activation in a subject;e. inhibiting the deamidation activity of UL37 in a subject;f. inducing an anti-viral immunity in a subject;g. inducing expression of anti-viral cytokine genes; orh. enhancing adaptive immunity,comprising administering to the subject an effective amount of the composition of claim 12.
  • 23. A method for one or more of: a. inhibiting viral replication in subject infected with a virus;b. abolishing 5′-ppp-RNA-binding and ATP hydrolysis in a subject;c. switching off RIG-1 in a subject;d. blocking RNA-induced activation in a subject;e. inhibiting the deamidation activity of UL37 in a subject;f. inducing an anti-viral immunity in a subject;g. inducing expression of anti-viral cytokine genes; orh. enhancing adaptive immunity,comprising administering to the subject an effective amount of the composition of claim 15.
  • 24. The method of claim 22, wherein the subject is a mammal or a human patient.
  • 25. The method of claim 23, wherein the subject is a mammal or a human patient.
  • 26. The method of claim 22, wherein the administration is one or more doses.
  • 27. The method of claim 23, wherein the administration is one or more doses.
CROSS-REFERENCE TO RELATED APPLICATION

This application claims the benefit under 35 U.S.C. 119(e) of U.S. Provisional Ser. No. 62/414,592, filed Oct. 28, 2016, the contents of which is hereby incorporated by reference into the present disclosure.

STATEMENT OF GOVERNMENT SUPPORT

This invention was made with government support under the Grant No. DE021445, awarded by the National Institute for Health. Accordingly, the U.S. Government has certain rights to the invention.

Provisional Applications (1)
Number Date Country
62414592 Oct 2016 US