Long non-coding RNA expressed in aggressive cancer

Information

  • Patent Grant
  • 10364470
  • Patent Number
    10,364,470
  • Date Filed
    Wednesday, November 16, 2016
    8 years ago
  • Date Issued
    Tuesday, July 30, 2019
    5 years ago
Abstract
The invention described in the application relates to a long non-coding RNA expressed in cancer. The invention thus provides methods and compositions for evaluating levels of the long non-coding RNA to assess the aggressiveness of a cancer and for modulating levels of the long non-coding RNA.
Description
REFERENCE TO SUBMISSION OF A SEQUENCE LISTING AS AN ASCII TEXT FILE

This application includes a Sequence Listing as a text file named 077429-1029648_SEQ_ST25.TXT” created on Nov. 16, 2016 and containing 119,057 bytes. The material contained in this text file is hereby incorporated by reference in its entirety for all purposes.


BACKGROUND OF THE INVENTION

For decades, only protein encoding RNAs were thought to be involved in mediating cell functions. However, researchers have recently discovered that RNAs that do not encode proteins, such as micro ribonucleic acids (microRNAs) and long noncoding RNAs (lncRNAs), are also involved in the regulation of cellular processes. It is particularly striking to note that, although approximately 70% of genomic DNA is transcribed, only ˜1.2% of genomic transcripts have been found to encode proteins (Gutschner et al., RNA Biol, 9: 703-19, 2012).


To date, thousands of lncRNA have been discovered in cells, but the function of only a very small percentage (around 1%) of these lncRNAs has been characterized (Amaral et al., Nucleic Acids Res, 39:D146-51, 2011; Wilusz et al., Genes Develop. 23:1494-1504, 2009). Interestingly, of those lncRNA that have been studied, many have been shown to play important roles in the regulation of gene expression, and roles for lncRNA as mediators of oncogenic and tumour suppressive functions have been identified in pervasive cancer types, including breast and prostate cancers.


The mechanisms by which lncRNA are thought to modulate the expression of genes are multiple and diverse. One way in which nuclear-localized lncRNAs have been shown to act is in the control of gene expression. To do this, these lnRNAs guide epigenetic factors to gene regulatory sites in the genome. The lncRNA HOTAIR has been shown to work in cooperation with and recruit epigenetic factors to regulate programs of gene expression, and HOTAIR is involved processes that promote malignancy in cancers, including colorectal and breast cancers.


SATB1 (Dickinson et al., Cell, 70(4):631-45, 1992) is a nuclear protein that acts to regulate programs of gene expression through remodeling of the three-dimensional organization of the genome and epigenetic status of chromatin at a large number of target genomic loci. SATB1 is further described in U.S. Pat. No. 5,652,340 and antibodies made thereto are described in U.S. Pat. No. 5,869,621, which are hereby incorporated by reference.


SATB1 exhibits unique patterns of activity in the nuclei of aggressive breast cancer cells, and high levels of SATB1 expression directly correlate with a poor prognosis among breast cancer patients (Han et al., Nature, 452:187-193, 2008; Kohwi-Shigematsu et al., Seminars in Cancer Biology, 23:72-79, 2013; WO 2007/075206). In aggressive breast cancer cells, SATB1 reprograms the gene expression profile in order to promote cancer metastasis, and the knockdown of SATB1 expression in these cells blocks their tumorigenic and metastatic potential. Conversely, the ectopic expression of SATB1 in non-aggressive breast cancer cells increases their tumorigenic and metastatic potential. However, the mechanisms that operate in aggressive breast cancer to drive SATB1 expression or to guide SATB1 to its target gene loci are not yet known. Also, as a protein with other functions in multiple cell and tissue types, it is important to understand the mechanisms that are involved in directing the activities of SATB1 that increase the tumorigenic and, most importantly, the metastatic potential of breast cancers.


Cancers, including breast tumors, can be highly metastatic; when patients are diagnosed, there is often no way to tell if their tumor has the capacity to metastasize. This invention addresses the need to identify patients with potentially metastatic tumors vs. non-metastatic tumors.


BRIEF SUMMARY OF SOME EMBODIMENTS OF THE INVENTION

The present invention pertains to methods of screening for and determining the aggressiveness of cancer cells, e.g., breast cancer cells, based on the detection of a long non-coding RNA described herein, which is termed SAILOR. The invention further relates to methods and assay systems for rapid screening of therapeutics that target SAILOR; and use of SAILOR-targeted therapeutics for the treatment of breast cancer.


Thus, in some aspects, the invention is based on the discovery that a long intergenic noncoding (lncRNA; chr3:18486850-18572715; Assembly hg19) is co-expressed with a SATB1 exon 1d variant from a common CpG island region specifically in human aggressive cancer cells, but not non-aggressive cancer cells. SATB1 has multiple splice variants that contain unique exon 1 sequences (a through d), but all encode the same SATB1 protein. The exon 1d variant, which is transcribed from the distal CpG island region, is specifically expressed in aggressive cancer cells. The lncRNA, which is referred to herein as SATB1 Adjacent intergenic lncRNA (SAILOR), is transcribed in the opposite direction from SATB1 from the same CpG island region in aggressive cancer cells. SAILOR is a nuclear lncRNA, strongly suggesting that it has a role in gene regulation. Reduction in SAILOR can reduce the growth rate of aggressive cells and SAILOR loss alters gene expression programs that typically promote tumor aggressiveness.


In some embodiments, the invention provides a method of determining the aggressiveness of a cancer, the method comprising: detecting the level of SAILOR transcripts in a sample from primary tumor tissue of a subject that has the cancer, and correlating the levels of expression with the aggressiveness of the cancer when a high level of expression is detected. The SAILOR transcript can have the sequence of SEQ ID NO:2 or 3; or the sequence of SEQ ID NO:4. In some embodiments, the detecting step comprises performing a quantitative PCR reaction or an in situ hybridization. In some embodiments, the method further comprises detecting the level of a SATB1 exon 1d transcript, e.g., SEQ ID NO:5, in the primary tumor sample. In some embodiments, detecting the level of the SATB1 exon 1d transcript comprises performing a quantitative PCR reaction or an in situ hybridization. In some embodiments, the cancer is breast cancer, pancreatic cancer, colon cancer, or prostate cancer.


In some embodiments, the invention provides a method of determining the aggressiveness of a cancer, the method comprising: detecting the level of SAILOR transcripts in an exosome sample a subject that has the cancer, and correlating the presence of the SAILOR transcript with the aggressiveness of the cancer. The SAILOR transcript can have the sequence of SEQ ID NO:2 or 3; or the sequence of SEQ ID NO:4. The method can further comprise detecting the presence of a SATB1 transcript in the exosome sample. In some embodiments, the cancer is breast cancer, pancreatic cancer, colon cancer, or prostate cancer. In some embodiments, the method further comprises recommending that the patient receive aggressive treatment for cancer.


In a further aspect, the invention provides a method of detecting the level of expression of SAILOR in a tissue sample, the method comprising contacting a nucleic acid obtained from the tissue sample with a probe that selectively hybridizes to SEQ ID NO:2 or SEQ ID NO:3; and detecting the amount of probe hybridized to the nucleic acid, thereby determining the level of expression. In some embodiments, the probe selectively hybridizes to SEQ ID NO:4.


In a further aspect, the invention provides an isolated nucleic comprising the sequence of SEQ ID NO:2, SEQ ID NO:3, or SEQ ID NO:4, or a fragment of SEQ ID NO:2, SEQ ID NO:3, or SEQ ID NO:4 that comprises at least 20, 25, 30, 35, 40, 45, 50, or 75 contiguous nucleotides of SEQ ID NO:2, SEQ ID NO:3, or SEQ ID NO:4, or a complement thereof. In some embodiments, the fragment comprises at least 100, 200, 300, 400, 500, 600, or 700 contiguous nucleotides of SEQ ID NO:2 or SEQ ID NO:3, or a complement thereof; or at least 100, 200, 300, 400, 500, 600, 700, 800, 900, 1000, 2000, 3000, or more contiguous nucleotides of SEQ ID NO:4, or the complement thereof. In some embodiments, the nucleic acid is attached to a fluorescent label.


The invention further provides a vector comprising a nucleic acid as set forth in the preceding paragraph.


In a further aspect, the invention provides a method of inhibiting proliferation of cancer cells, the method comprising contacting the cancer cells with an inhibitor of SAILOR that decreases the level of SAILOR. In some embodiments, the inhibitor is a nucleic acid inhibitor of SAILOR. In some embodiments, the cancer cells are breast cancer, pancreatic cancer, colon cancer, or prostate cancer cells.


Other objects, features, and advantages of the present invention will be apparent to one of skill in the art from the following detailed description and figures.





BRIEF DESCRIPTION OF THE DRAWINGS


FIG. 1 is a map of the SATB1 gene locus, which is transcribed from multiple promoters to generate exon 1 variants (Exon 1a, 1b, 1c and 1d). SATB1 exon 1 is not translated and exon 2 has the ATG start site. Therefore, the SATB1 protein translated from each of these variants is identical. The far distal CpG island (>20 kb from exon 2) contains a promoter for SATB1 (exon 1d variant) and the nearby lncRNA. We refer to the lncRNA as SATB1 Adjacent Intergenic Long ncRNA, or SAILOR. The SAILOR gene spans 85.8 kb and encodes a 1.8 kb lncRNA (see zoom-out view). The distance between the transcriptional start sites of SATB1 exon 1d and SAILOR is 203 bp.



FIG. 2, panels A and B, show transcription levels of SATB1 exon 1d variant and SAILOR. (A) SATB1 exon 1 variant and SAILOR are specifically expressed in aggressive breast cancer cell lines (MDA-MB-231 and BT549). This was determined by quantitative reverse transcription (RT-) PCR with specific primers for SATB1 (one in exon 1d and the second in exon 2) and primers spanning SAILOR exon 2 and 3 to identify the transcripts. (B) Non-aggressive breast cancer cell line SKBR3 and non-tumorigenic MCF10A-1 (a variant of MCF10A, which has a low ATM level; Ordinario et al., PLoS ONE, 7(12):e51786, 2012) were either untreated (−) or forced to express SATB1 protein via the transgenic SATB1 proximal exon 1a variant (+). The data show that both SATB1 exon 1d variant and SAILOR transcription was activated by forced SATB1 protein expression. Thus, SATB1 protein can activate SAILOR transcription.



FIG. 3 shows that SAILOR is located in nuclei of aggressive breast cancer. The nuclei of BT549 cells were isolated from cytoplasm using a hypotonic swelling procedure followed by mild homogenization. Total RNA was harvested from each isolated fraction (i.e. nuclear and cytoplasm) and subjected to RT-PCR analysis using primers for SAILOR. Additional primers were used as controls: HOTAIR is a known nuclear lncRNA, GAPDH is reported to exist predominately in cytoplasmic fractions, and pre-spliced SAILOR transcript will only be found in nuclear fractions.



FIG. 4 shows that SAILOR is associated with SATB1 protein in BT549 cells. A RNA immunoprecipitation assay (RIP) in BT549 cells indicated that SAILOR is greatly enriched for association with SATB1 (11-fold) relative to coding mRNA AXL that is abundant in these cells, and compared to IgG and FoxP2 antibody controls that did not precipitate any RNA. This putative interaction suggests SAILOR and SATB1 share a common function.



FIG. 5 shows that knock down of SAILOR slowed growth rate for BT549 cells. We have successfully knocked down SAILOR by ˜80% in BT549 cell lines with a short-hairpin RNA (shRNA) approach. Reduced levels of SAILOR did not affect SATB1 transcription or protein level. However, SAILOR knockdown cells show a slowed growth rate, suggesting that reduction of the SAILOR transcript level overrides the effect of high SATB1, which promotes cancer progression.



FIG. 6 shows that knockdown of SAILOR leads to changes in transcription levels of some specific genes known to be SATB1 targets, such as Claudin1 (CLDN1; repressed by SATB1) and fibronectin 1 (FN1; activated by SATB1). Expression of these genes was altered in the same direction when either SATB1 or SAILOR was knocked down. Therefore, at least with a small number of genes found to be SAILOR-dependent (˜10% of 85 genes), SAILOR seems to affect gene expression toward cancer progression.



FIG. 7 shows that the expression level of SATB1 and SAILOR transcripts increased in aggressive types of cancer cells, as determined by qRT-PCR performed in non-aggressive cancer cells (MDA-MB-468, LNCaP, and CaCo2) versus aggressive cancer cells (MDA-MB-231, PC-3, HT29, and MiaPaca) from breast, prostate, colon, and pancreatic cancers.



FIG. 8, panels A and B, shows that cancer cells that attain resistance to drug treatments have an increased expression level of SATB1. (A) qRT-PCR analysis of MCF10A-1 cells treated individually with various drugs for 48 hrs that survived the treatment. (B) Vector control MCF10A-1 and SATB1-overexpressing MCF10A-1 cells were treated with a series of drug concentrations for 72 hrs and cell viability was assessed by MTT assay to determine LD50 of each drug.



FIG. 9 shows that SATB1 and SAILOR transcripts were present in exosomes (circulating extracellular vesicles), and the abundance of transcripts was proportional to aggressiveness of the cancer cell type. Endogenous SATB1 and SAILOR transcripts within a cell population (control breast cancer cell lines; MDA-MB-468 and MDA-MB-231), and those transcripts in exosomes isolated from conditional medium of each of breast cancer cell line and prostate cancer cell line, were detected by qRT-PCR





DETAILED DESCRIPTION OF ASPECTS OF THE INVENTION
I. Introduction

Described herein are methods for predicting the risk of disease recurrence, disease relapse, disease progression, and/or metastatic potential of a cancer in a subject. In one aspect, the invention thus provides methods of determining the expression level, such as the RNA expression level of SAILOR a tumor sample from a patient, and determining whether the subject has a likelihood of a poor prognosis based on the level of expression of SAILOR compared to a reference value of expression.


Evaluation of levels of SAILOR in tumor tissue will aid in selecting the treatment strategy for the patient. Importantly, an assay to detect SAILOR is easier to perform than the currently used immunohistochemical assays, which require analysis by trained pathologists. Furthermore, assays for coding mRNAs can be misleading because levels of mRNA do not always correlate with those of the proteins they encode. SAILOR therefore provides a superior marker for prognosis because it is the final, active product.


II. Definitions

As used herein, the following terms have the meanings ascribed to them unless specified otherwise.


The terms “a,” “an,” or “the” as used herein not only include aspects with one member, but also include aspects with more than one member. For instance, the singular forms “a,” “an,” and “the” include plural referents unless the context clearly dictates otherwise. Thus, for example, reference to “a cell” includes a plurality of such cells and reference to “the agent” includes reference to one or more agents known to those skilled in the art, and so forth.


The term “cancer” refers to any disease involving an abnormal growth of cells and includes all stages and all forms of the disease that affects any tissue, organ or cell in the body. The term includes all known cancers and neoplastic conditions, whether characterized as malignant, benign, soft tissue, or solid, and cancers of all stages and grades including pre- and post-metastic cancers. Examples of different types of cancer include, but are not limited to, breast cancer, lung cancer (e.g., non-small cell lung cancer); pancreatic cancer, digestive and gastrointestinal cancers such as colorectal cancer, gastrointestinal stromal tumors, gastrointestinal carcinoid tumors, colon cancer, rectal cancer, anal cancer, bile duct cancer, small intestine cancer, and stomach (gastric) cancer; esophageal cancer; gallbladder cancer; liver cancer; appendix cancer; ovarian cancer; renal cancer (e.g., renal cell carcinoma); cancer of the central nervous system; skin cancer; lymphomas; choriocarcinomas; head and neck cancers; osteogenic sarcomas; and blood cancers. As used herein, a “tumor” comprises one or more cancerous cells. The term “solid tumor” is used herein to refer to a mass of cancerous cells.


The terms “determining,” “assessing,” “assaying,” “measuring” and “detecting” can be used interchangeably and refer to both quantitative and semi-quantitative determinations.


The term “amount” or “level” refers to the quantity of a polynucleotide of interest, e.g., in the present invention, typically a SAILOR RNA in a sample. Such quantity may be expressed in the absolute terms, i.e., the total quantity of the polynucleotide in the sample, or in the relative terms, i.e., the concentration of the polynucleotide in the sample.


The term “nucleic acid” or “polynucleotide” or “oligonucleotide” as used herein refers to a deoxyribonucleotide or ribonucleotide. This term refers only to the primary structure of the molecule. Thus, the term includes triple-, double- and single-stranded DNA, as well as triple-, double- and single-stranded RNA. It also includes modifications, such as by methylation and/or by capping, and unmodified forms of the polynucleotide. The term encompasses nucleic acids containing known analogues of natural nucleotides which have similar or improved binding properties, for the purposes desired, as the reference nucleic acid. The term also includes nucleic acids which are metabolized in a manner similar to naturally occurring nucleotides or at rates that are improved for the purposes desired. The term also encompasses nucleic-acid-like structures with synthetic backbones. DNA backbone analogues provided by the invention include phosphodiester, phosphorothioate, phosphorodithioate, methylphosphonate, phosphoramidate, alkyl phosphotriester, sulfamate, 3′-thioacetal, methylene(methylimino), 3′-N-carbamate, morpholino carbamate, and peptide nucleic acids (PNAs); see Oligonucleotides and Analogues, a Practical Approach, edited by F. Eckstein, IRL Press at Oxford University Press (1991); Antisense Strategies, Annals of the New York Academy of Sciences, Volume 600, Eds. Baserga and Denhardt (NYAS 1992); Milligan (1993) J. Med. Chem. 36:1923-1937; Antisense Research and Applications (1993, CRC Press). PNAs contain non-ionic backbones, such as N-(2-aminoethyl) glycine units. Phosphorothioate linkages are described in WO 97/03211; WO 96/39154; Mata (1997) Toxicol. Appl. Pharmacol. 144:189-197. Other synthetic backbones encompassed by the term include methyl-phosphonate linkages or alternating methylphosphonate and phosphodiester linkages (Strauss-Soukup (1997) Biochemistry 36: 8692-8698), and benzylphosphonate linkages (Samstag (1996) Antisense Nucleic Acid Drug Dev 6: 153-156). The terms all encompass double- and single-stranded DNA, as well as double- and single-stranded RNA, microRNA, DNA:RNA hybrids, lncRNA, and hybrids between PNAs and DNA or RNA. As used herein, reference to a sequence by sequence identifier also explicitly encompasses reference to the complement of the sequence.


The term “hybridizing” refers to the binding, duplexing, or hybridizing of a nucleic acid molecule preferentially to a particular nucleotide sequence under stringent conditions. The term “stringent conditions” refers to conditions under which a probe will hybridize preferentially to its target subsequence, and to a lesser extent to, or not at all to, other sequences in a mixed population (e.g., a cell lysate or DNA preparation from a tissue biopsy). A “stringent hybridization” and “stringent hybridization wash conditions” in the context of nucleic acid hybridization (e.g., as in array, Southern or northern hybridizations) are sequence dependent, and are different under different environmental parameters. An extensive guide to the hybridization of nucleic acids is found in, e.g., Tijssen (1993) Laboratory Techniques in Biochemistry and Molecular Biology—Hybridization with Nucleic Acid Probes part I, Ch. 2, “Overview of principles of hybridization and the strategy of nucleic acid probe assays,” Elsevier, N.Y. Generally, highly stringent hybridization and wash conditions are selected to be about 5° C. lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. The Tm is the temperature (under defined ionic strength and pH) at which 50% of the target sequence hybridizes to a perfectly matched probe. Very stringent conditions are selected to be equal to the Tm for a particular probe. An example of stringent hybridization conditions for hybridization of complementary nucleic acids which have more than 100 complementary residues on an array or on a filter in a Southern or northern blot is 42° C. using standard hybridization solutions (see, e.g., Sambrook and Russell (2001) Molecular Cloning: A Laboratory Manual (3rd ed.) Vol. 1-3, Cold Spring Harbor Laboratory, Cold Spring Harbor Press, NY). An example of highly stringent wash conditions is 0.15 M NaCl at 72° C. for about 15 minutes. An example of stringent wash conditions is a 0.2×SSC wash at 65° C. for 15 minutes. Often, a high stringency wash is preceded by a low stringency wash to remove background probe signal. An example medium stringency wash for a duplex of, e.g., more than 100 nucleotides, is 1×SSC at 45° C. for 15 minutes. An example of a low stringency wash for a duplex of, e.g., more than 100 nucleotides, is 4×SSC to 6×SSC at 40° C. for 15 minutes.


The term “complementarity” refers to the ability of a nucleic acid to form hydrogen bond(s) with another nucleic acid sequence by either traditional Watson-Crick or other non-traditional types. A percent complementarity indicates the percentage of residues in a nucleic acid molecule which can form hydrogen bonds (e.g., Watson-Crick base pairing) with a second nucleic acid sequence (e.g., 5, 6, 7, 8, 9, 10 out of 10 being 50%, 60%, 70%, 80%, 90%, and 100% complementary). “Perfectly complementary” means that all the contiguous residues of a nucleic acid sequence will hydrogen bond with the same number of contiguous residues in a second nucleic acid sequence. “Substantially complementary” as used herein refers to a degree of complementarity that is at least 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%. 97%, 98%, 99%, or 100% over a region of 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 30, 35, 40, 45, 50, or more nucleotides, or refers to two nucleic acids that hybridize under stringent conditions. Typically, the total number of mismatched nucleotides over a hybridizing region is not more than 3 nucleotides for sequences about 15 nucleotides in length. Conditions under which only exactly complementary nucleic acid strands will hybridize are referred to as “stringent” or “sequence-specific” hybridization conditions. Stable duplexes of substantially complementary nucleic acids can be achieved under less stringent hybridization conditions. Those skilled in the art of nucleic acid technology can determine duplex stability empirically considering a number of variables including, for example, the length and base pair concentration of the oligonucleotides, ionic strength, and incidence of mismatched base pairs. For example, computer software for calculating duplex stability is commercially available from National Biosciences, Inc. (Plymouth, Minn.); e.g., OLIGO version 5, or from DNA Software (Ann Arbor, Mich.), e.g., Visual OMP 6. Stringent conditions are sequence-dependent and will be different in different circumstances. Generally, stringent conditions are selected to be about 5° C. lower to 5° C. higher than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. The Tm is the temperature (under defined ionic strength and pH) at which 50% of the duplex strands have dissociated. Relaxing the stringency of the hybridizing conditions will allow sequence mismatches to be tolerated; the degree of mismatch tolerated can be controlled by suitable adjustment of the hybridization conditions.


The term “probe” refers to an oligonucleotide that selectively hybridizes to a target nucleic acid under suitable conditions. A probe for detection of the biomarker sequences described herein can be any length, e.g., from 15-500 bp in length. Typically, in probe-based assays, hybridization probes that are less than 50 bp are preferred.


The term “target sequence” or “target region” refers to a region of a nucleic acid that is to be analyzed and comprises the sequence of interest.


The term “standard control” as used herein in the context of determining the level of a SAILOR polynucleotide, refers to an amount or concentration of a polynucleotide sequence that is present in an established tissue sample, e.g., a healthy, non-cancer tissue sample; or The standard control value is suitable for the use of a method of the present invention, to serve as a basis for comparing the amount of a specific mRNA or protein that is present in a test sample. An established sample serving as a standard control provides an average amount of a specific mRNA or protein that is typical in a normal tissue sample. A standard control value may vary depending on the nature of the sample as well as other factors such as the gender, age, ethnicity of the subjects based on whom such a control value is established.


The term “treatment,” “treat,” or “treating” refer to a method of reducing the effects of a disease or condition (e.g., cancer) or symptom of the disease or condition. Thus, in the disclosed method, treatment can refer to a 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, or 100% reduction in the severity of an established disease or condition or symptom of the disease or condition. For example, a method of treating a disease is considered to be a treatment if there is a 10% reduction in one or more symptoms of the disease in a subject as compared to a control. Thus, the reduction can be a 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100% or any percent reduction between 10 and 100% as compared to native or control levels. It is understood that treatment does not necessarily refer to a cure or complete ablation of the disease, condition, or symptoms of the disease or condition.


The term “sample” includes a biological sample or a sample from a biological source. Biological samples include samples from body fluids, e.g., blood, plasma, serum, or urine, or samples derived, e.g., by biopsy, from cells, tissues or organs, preferably tumor tissue suspected to include or essentially consist of cancer cells.


The term “recommending” or “suggesting,” as used herein in the context of a treatment of cancer, refers to making a suggestion or a recommendation for therapeutic intervention (e.g., drug therapy, adjunctive therapy, etc.) and/or disease management which are specifically applicable to the patient.


The terms “responsive,” “clinical response,” “positive clinical response,” and the like, as used in the context of a patient's response to an anticancer therapy, are used interchangeably and refer to a favorable patient response to a drug as opposed to unfavorable responses, i.e. adverse events. In a patient, beneficial response can be expressed in terms of a number of clinical parameters, including loss of detectable tumor (complete response, CR), decrease in tumor size and/or cancer cell number (partial response, PR), tumor growth arrest (stable disease, SD), enhancement of anti-tumor immune response, possibly resulting in regression or rejection of the tumor; relief, to some extent, of one or more symptoms associated with the tumor; increase in the length of survival following treatment; and/or decreased mortality at a given point of time following treatment. Continued increase in tumor size and/or cancer cell number and/or tumor metastasis is indicative of lack of beneficial response to treatment. In a population the clinical benefit of a drug, i.e., its efficacy can be evaluated on the basis of one or more endpoints. For example, analysis of overall response rate (ORR) classifies as responders those patients who experience CR or PR after treatment with drug. Analysis of disease control (DC) classifies as responders those patients who experience CR, PR or SD after treatment with drug.


A positive clinical response can be assessed using any endpoint indicating a benefit to the patient, including, without limitation, (1) inhibition, to some extent, of tumor growth, including slowing down and complete growth arrest; (2) reduction in the number of tumor cells; (3) reduction in tumor size; (4) inhibition (i.e., reduction, slowing down or complete stopping) of tumor cell infiltration into adjacent peripheral organs and/or tissues; (5) inhibition of metastasis; (6) enhancement of anti-tumor immune response, possibly resulting in regression or rejection of the tumor; (7) relief, to some extent, of one or more symptoms associated with the tumor; (8) increase in the length of survival following treatment; and/or (9) decreased mortality at a given point of time following treatment. Positive clinical response may also be expressed in terms of various measures of clinical outcome. Positive clinical outcome can also be considered in the context of an individual's outcome relative to an outcome of a population of patients having a comparable clinical diagnosis, and can be assessed using various endpoints such as an increase in the duration of recurrence-free interval (RFI), an increase in the time of survival as compared to overall survival (OS) in a population, an increase in the time of disease-free survival (DFS), an increase in the duration of distant recurrence-free interval (DRFI), and the like. Additional endpoints include a likelihood of any event (AE)-free survival, a likelihood of metastatic relapse (MR)-free survival (MRFS), a likelihood of disease-free survival (DFS), and a likelihood of distant metastatis-free survival (DMFS). An increase in the likelihood of positive clinical response corresponds to a decrease in the likelihood of cancer recurrence or relapse.


The term “good prognosis” refers to the prediction of the likelihood of disease-specific survival, overall survival or disease free survival, including partial remission, complete remission, and suppression of cancer cell proliferation and/or metastasis. A good prognosis for a patient with a solid tumor cancer includes a positive response rate in terms of disease remission or tumor shrinkage, or any other form of evaluating reduced tumor burden or growth. A good prognosis can be measured as the length (time) of survival.


The term “poor prognosis” refers to the prediction of the likelihood of cancer-attributable death or progression, including recurrence, relapse and/or metastatic spread.


The term “overall survival” refers to the time interval from either the time of diagnosis or the start of treatment that the patient is still alive.


The term “progression-free survival” refers to the time interval from treatment of the patient until the progression of cancer or death of the patient, whichever occurs first.


The term “responder” or “responsive” refers to a patient who has cancer, and who exhibits a beneficial clinical response following treatment with a cancer therapy.


The term “non-responder” or “non-responsive” refers to a patient who has a cancer, and who does not exhibit a beneficial clinical response following treatment with a cancer therapy.


The term “subject” as used herein is intended to cover an animal, preferably a mammal. Examples of subjects include humans, non-human primates, dogs, cows, horses, pigs, sheep, goats, cats, mice, rabbits, rats, and transgenic non-human animals. In some embodiments, the subject is a human, e.g., a human suffering from, at risk of suffering from, or potentially capable of suffering from cancers.


The term “score” refers to a statistically derived value that can provide physicians and caregivers valuable diagnostic and prognostic insight. In some instances, the score provides a projected risk of disease recurrence/relapse or disease progression, a projected rate of disease progression, and/or a projected response to a particular therapy. An individual's score can be compared to a reference score or a reference score scale to determine risk of disease recurrence/relapse or to assist in the selection of therapeutic intervention or disease management approaches.


The term “SAILOR score” or “SAILOR value” refers to an expression score, i.e., based on the level of SAILOR RNA.


III. Detailed Descriptions of Embodiments

The methods described herein are based, in part, on the discovery that SAILOR is expressed at high levels in aggressive cancer cells. The methods described herein can be used to evaluate a primary tumor to determine the likelihood of tumor progression. Non-limiting examples of cancer include breast cancer, pancreatic cancer, lung cancer, colorectal cancer, digestive and gastrointestinal cancers such as colorectal cancer, gastrointestinal stromal tumors, gastrointestinal carcinoid tumors, colon cancer, rectal cancer, anal cancer, bile duct cancer, small intestine cancer, and stomach (gastric) cancer; esophageal cancer; gallbladder cancer; liver cancer; pancreatic cancer; appendix cancer; bladder cancer; prostate cancer; cervical cancer, uterine cancer, ovarian cancer; renal cancer (e.g., renal cell carcinoma); cancer of the central nervous system, e.g., lower grade brain cancer; skin cancer; lymphomas; head and neck cancers; adrenocortical cancer; and thyroid cancer. In some embodiments, primary breast cancer tissue is evaluated for SAILOR transcript levels. In some embodiments, primary pancreatic cancer tissue is evaluated for SAILOR transcript levels.


In some embodiments, a subject may have an early stage cancer, e.g., Stage I or Stage 2 cancer, such as an early stage breast cancer, pancreatic cancer, lung cancer, gastric cancer, or ovarian cancer.


The tumor sample can be any biological sample comprising cancer cells. In some embodiments, the tumor sample is a fresh or archived sample obtained from a primary tumor, e.g., by a tumor biopsy or fine needle aspirate. The sample also can be any biological fluid containing cancer cells. The tumor sample can be isolated or obtained from any number of primary tumors, including, but not limited to, tumors of the breast, pancreas, lung, prostate, brain, liver, kidney, intestines, colon, spleen, thymus, testis, ovary, uterus, and the like. In some embodiments, the tumor sample is from a tumor cell line. The collection of a tumor sample from a subject is performed in accordance with the standard protocol generally followed by hospital or clinics, such as during a biopsy.


In some embodiments SAILOR transcripts are detected in exosomes. Detection of SAILOR in exosome is indicative of cancer this is progressing. Exosomes (reported diameter of between 30 and 100 nm) are vesicles released by all cells and are present in my biological fluids, including blood, urine, cerebrospinal fluid and saliva. Exosomes contain RNA, including mRNA, microRNA, lncRNA and other RNA species, as well as DNA and proteins, from their cell of origin (see, e.g., Raposo & Stoorvogel, J. Cell. Biol 200:373-383, 2013). In some embodiments, SAILOR or SATB1 transcripts are identified in exosomes present in plasma or serum samples from a patient. In some embodiments, exosomes for evaluation are at least partially purified.


The transcript level of SAILOR in a sample can be determined by any suitable method known in the art. Measurement of the level can be direct or indirect. For example, the abundance levels of SAILOR can be directly quantitated. Alternatively, the amount of SAILOR can be determined indirectly by measuring abundance levels of cDNAs, amplified RNAs or DNAs, or by measuring quantities or activities of RNAs, or other molecules that are indicative of the expression level of SAILOR. Although the following section describes detection of SAILOR lncRNA, one of skill understands that similar methodology can be employed to detect other transcripts, such as SATB1 exon 1d transcripts.


The invention employs routine techniques in the field of recombinant genetics. Basic texts disclosing the general methods of use in this invention include Green and Sambrook (2012) Molecular Cloning: A laboratory manual 4th ed. Cold Spring Harbor Laboratory Press; and Current Protocols in Molecular Biology and supplements through supplement 110, 2015) John Wiley and Sons.


The level of SAILOR expression can be detected or measured by a variety of methods including, an amplification assay, a hybridization assay, a sequencing assay, or an array. Non-limiting examples of such methods include reverse-transcription polymerase chain reaction (RT-PCR); quantitative real-time PCR (qRT-PCR); quantitative PCR, such as TagMan®; Northern blotting; in situ hybridization assays; microarray analysis, e.g., microarrays from NanoString Technologies; multiplexed hybridization-based assays, e.g., QuantiGene 2.0 Multiplex Assay from Panomics; serial analysis of gene expression (SAGE); cDNA-mediated annealing, selection, extension, and ligation; nucleic acid immunoassay, direct sequencing or pyrosequencing; massively parallel sequencing; next generation sequencing; high performance liquid chromatography (HPLC) fragment analysis; capillarity electrophoresis; mass spectrometry, including SELDI, MALDI; and other known methods.


Various methods involving amplification reactions and/or reactions in which probes are linked to a solid support and used to quantify RNA may be used. Alternatively, the RNA, or DNA copy of the RNA, may be linked to a solid support and quantified using a probe to the sequence of interest.


In some embodiments, the target RNA is first reverse transcribed and the resulting cDNA is quantified. In some embodiments, RT-PCR or other quantitative amplification techniques are used to quantify the target RNA. Amplification of cDNA using PCR is well known (see U.S. Pat. Nos. 4,683,195 and 4,683,202; PCR PROTOCOLS: A GUIDE TO METHODS AND APPLICATIONS (Innis et al., eds, 1990)). Methods of quantitative amplification are disclosed in, e.g., U.S. Pat. Nos. 6,180,349; 6,033,854; and 5,972,602, as well as in, e.g., Gibson et al., Genome Research 6:995-1001 (1996); DeGraves, et al., Biotechniques 34(1):106-10, 112-5 (2003); Deiman B, et al., Mol Biotechnol. 20(2):163-79 (2002). Alternative methods for determining the level of SAILOR in a sample may involve other nucleic acid amplification methods such as ligase chain reaction (Barany (1991) Proc. Natl. Acad. Sci. USA 88:189-193), self-sustained sequence replication (Guatelli et al. (1990) Proc. Natl. Acad. Sci. USA 87:1874-1878), transcriptional amplification system (Kwoh et al. (1989) Proc. Natl. Acad. Sci. USA 86:1173-1177), Q-Beta Replicase (Lizardi et al. (1988) Bio/Technology 6:1197), rolling circle replication (U.S. Pat. No. 5,854,033) or any other nucleic acid amplification method, followed by the detection of the amplified molecules using techniques well known to those of skill in the art.


In some embodiments, RNA (or a copy) is immobilized on a solid surface and contacted with a probe, e.g., in a microarry, dot blot or Northern format. A skilled artisan can readily adapt known RNA detection methods for use in detecting the level SAILOR.


In some embodiments, microarrays are employed. DNA microarrays provide one method for the simultaneous measurement of the expression levels of large numbers of genes. Each array consists of a reproducible pattern of capture probes attached to a solid support. Labeled RNA or DNA is hybridized to complementary probes on the array and then detected by laser scanning. Hybridization intensities for each probe on the array are determined and converted to a quantitative value representing relative gene expression levels. See, U.S. Pat. Nos. 6,040,138, 5,800,992 and 6,020,135, 6,033,860, and 6,344,316. High-density oligonucleotide arrays are particularly useful for determining the gene expression profile for a large number of RNA's in a sample.


Techniques for the synthesis of these arrays using mechanical synthesis methods are described in, e.g., U.S. Pat. No. 5,384,261. Although a planar array surface is often employed the array may be fabricated on a surface of virtually any shape or even a multiplicity of surfaces. Arrays may be peptides or nucleic acids on beads, gels, polymeric surfaces, fibers such as fiber optics, glass or any other appropriate substrate, see U.S. Pat. Nos. 5,770,358, 5,789,162, 5,708,153, 6,040,193 and 5,800,992. Arrays may be packaged in such a manner as to allow for diagnostics or other manipulation of an all-inclusive device.


In some embodiments, gene-specific probes and/or primers are used in hybridization assays to detect RNA expression. The probes and/or primers may be labeled with any detectable moiety or compound, such as a radioisotope, fluorophore, chemiluminescent agent, and enzyme.


Probes and primers can be selected using know algorithms that takes into account binding energies, base composition, sequence complexity, cross-hybridization binding energies, and secondary structure. See, e.g., PCT Patent Publication WO 01/05935, published Jan. 25, 2001; Hughes et al., Nat. Biotech. 19:342-7 (2001).


The probes and primers necessary for practicing the present invention can be synthesized and labeled using well known techniques. Oligonucleotides used as probes and primers may be chemically synthesized according to the solid phase phosphoramidite triester method first described by Beaucage and Caruthers, Tetrahedron Letts., 22:1859-1862, 1981, using an automated synthesizer, as described in Needham-Van Devanter et al., Nucleic Acids Res. 12:6159-6168, 1984.


In some embodiments, probes can be obtained, e.g., by polymerase chain reaction (PCR) amplification of genomic DNA or RNA or cloned sequences. PCR primers are selected based on a known sequence of the genome that will result in amplification of specific fragments of genomic DNA. Computer programs that are well known in the art are useful in the design of primers with the required specificity and optimal amplification properties, such as Oligo version 5.0 (National Biosciences). Typically each probe is between 10 bases and 50,000 bases, usually between 300 bases and 1,000 bases in length. It will be apparent to one skilled in the art that controlled robotic systems are useful for isolating and amplifying nucleic acids.


In some embodiments, in situ hybridization is employed to assess SAILOR transcript levels.


The probe for measuring SAILOR transcript level may be employed that hybridizes anywhere within SAILOR (SEQ ID NO:2, SEQ ID NO:3, or SEQ ID NO:4) that provides for selective measure of SAILOR.


The level of SAILOR can be normalized to a reference level for a control gene. The control value can be predetermined, determined concurrently, or determined after a sample is obtained from the subject. The standard can be run in the same assay or can be a known standard from a previous assay. In some embodiments, a normalized level of SAILOR can be transformed into a score for likelihood of progression.


Determining the Likelihood of Progression


After determining the level of SAILOR transcript one of skill can correlate the level with risk of having an aggressive cancer, where a high level of expression indicates an aggressive cancer. For example, in some embodiments, a high level of expression is determined relative to a reference scale, e.g., SAILOR expression levels obtained from corresponding tissue from a population of normal reference subjects that that do not have cancer or that have a non-aggressive from of the same cancer as the patient. The subjects in the reference population can be within the appropriate parameters, if applicable, for the purpose of screening for and/or monitoring cancer using the methods provided herein. Optionally, the reference subjects are of same gender, similar age, or similar ethnic background. The reference subject may be of the same gender, similar age, or similar ethnic background as the test subject. In some embodiments, the reference subjects may have a benign or non-malignant from of the cancer of interest. Thus, in some embodiments, determination of “high” level of expression is relative to levels in benign tumors that do not progress. In some embodiments, a “high” level of expression is at least 10-fold, at least 20-fold, 25-fold, 30-fold, or 50-fold greater, when normalized, than that observed in the corresponding normal tissue. In some embodiments, “high” expression is at least 40 or 50-fold greater than in normal tissue.


The status of the reference subjects can be confirmed by well established, routinely employed methods including but not limited to general physical examination of the individuals and general review of their medical history. As understood by one of skill in the art, a reference population is of a size sufficient to establish that the samples obtained from the group can be reasonably regarded as representative of the normal or average level among this population of subjects.


In some embodiments, a patient is consider to have a tumor with a high risk of progression when the level of SAILOR exceeds a threshold level. To establish a threshold value, an average value is determined based on the individual values found in each subject of the selected reference group. For example, a risk score over the threshold value can indicate a more than average likelihood of cancer progression whereas a risk score below the threshold value can indicate an average or below-average likelihood of cancer progression. In some embodiments, a standard deviation is also determined during the same process. In some cases, separate threshold values may be established for separately defined groups having distinct characteristics such as age, gender, or ethnic background. The patient's risk score is compared to one or more threshold values. In some embodiments, the risk score is deemed “high” if it is at least 2, 3, 4, 5, 10, 15, 20 or more standard deviations greater than the threshold value. In other embodiments, the risk score is “low” or below the threshold if it is at least 2, 3, 4, 5, 10, 15, 20 or more standard deviations lower than the threshold value.


In some embodiments, e.g., where SAILOR levels are evaluated using quantitative PCR, the test level and the control level may be expressed as a mean comparative quantification (Cq) test value and a mean comparative quantification (Cq) control value (delta Cq method). In such a case, the mean Cq test value and a mean Cq control value are normalized by an internal control. For example, in tumor tissue samples, the difference of threshold cycle (Cq) values obtained for SAILOR and internal control in a cancer specimen is compared to the difference of the Cq values obtained in adjacent normal tissue. The delta-delta Cq method may then be used to calculate the relative expression values between tissue samples.


The level of expression of SAILOR can be evaluated alone or in combination with other markers, such as the level of SATB1 exon 1d or other clinical parameters.


In some embodiments, a computer-based analysis program is used to translate the raw data generated by the detection methods described herein (e.g., the presence, absence, or amount of a given marker or markers) into a risk score of predictive value to a clinician.


High level of expression of SAILOR is indicative of an aggressive cancer. Aggressive cancer can often be associated with a poor prognosis, e.g., cancer relapse, cancer recurrence, cancer progression, and/or local or distant metastasis. In some cases, high levels of SAILOR can predicts that the subject will have poor metastatic relapse-free survival (MRFS), such as a short term MRFS, no MRFS, or a below-average probability of MRFS. In other cases, high levels of SAILOR is indicative that the subject will have poor any event (AE)-free survival, e.g., a short term AE-free survival, no AE-free survival, or a below-average probability of AE-free


As explained above, SAILOR transcript level is often used in conjunction with other diagnostic criteria, including both clinical and pathology evaluations. For example, in breast cancer, the estrogen receptor status is also typically determined.


In some embodiments, the presence of an aggressive cancer can be detected by detecting the presence of SAILOR and/or SATB1 transcripts in exosomes from a patient. In this context, detecting the “presence” of the transcripts in exosomes means that the level is at a statistically significant higher level, e.g., 1.5-fold, 1.6-, 1.7, 1.8, 1.9, or preferably 2.0-fold, or greater than that of a control value. The control value may represent the copy number or concentration of the transcript from a corresponding exosome sample from a normal patient, or a patient that has a non-aggressive from of the cancer.


Any of the methods described herein for determining risk of having an aggressive cancer may be totally or partially performed with a computer system including one or more processors, which can be configured to perform the steps. Thus, embodiments are directed to computer systems configured to perform the steps of any of the methods described herein, potentially with different components performing a respective step or a respective group of steps.


It should be understood that any of the embodiments of the present disclosure can be implemented in the form of control logic using hardware (e.g., an application specific integrated circuit or field programmable gate array) and/or using computer software with a generally programmable processor in a modular or integrated manner. As user herein, a processor includes a multi-core processor on a same integrated chip, or multiple processing units on a single circuit board or networked. Based on the disclosure and teachings provided herein, a person of ordinary skill in the art will know and appreciate other ways and/or methods to implement embodiments of the present disclosure using hardware and a combination of hardware and software.


Any of the software components or functions described in this application may be implemented as software code to be executed by a processor using any suitable computer language such as, for example, Java, C++ or Perl using, for example, conventional or object-oriented techniques. The software code may be stored as a series of instructions or commands on a computer readable medium for storage and/or transmission, suitable media include random access memory (RAM), a read only memory (ROM), a magnetic medium such as a hard-drive or a floppy disk, or an optical medium such as a compact disk (CD) or DVD (digital versatile disk), flash memory, and the like. The computer readable medium may be any combination of such storage or transmission devices.


Such programs may also be encoded and transmitted using carrier signals adapted for transmission via wired, optical, and/or wireless networks conforming to a variety of protocols, including the Internet. As such, a computer readable medium according to an embodiment of the present invention may be created using a data signal encoded with such programs. Computer readable media encoded with the program code may be packaged with a compatible device or provided separately from other devices (e.g., via Internet download). Any such computer readable medium may reside on or within a single computer product (e.g. a hard drive, a CD, or an entire computer system), and may be present on or within different computer products within a system or network. A computer system may include a monitor, printer, or other suitable display for providing any of the results mentioned herein to a user.


Selecting Cancer Therapies


some embodiments, the methods of the present disclosure also include selecting a therapeutic intervention for a subject with cancer having a high SAILOR transcript level score. For example, a more aggressive therapy may be selected for patients exhibited high levels of SAILOR. For example, multiple therapies, including chemotherapy, may be employed. Various therapies include radiation therapy, chemotherapy, drug therapy, e.g., hormone therapy, immunotherapy, surgery, or any combination thereof can be selected. In some instances, surgery and an adjuvant therapy, such as chemotherapy, drug therapy, e.g., hormone therapy, immunotherapy, or any combination thereof are suggested to the subject.


Non-limiting examples of useful chemotherapy agents include alkylating agents, e.g., cyclophosphamide, mechlorethamine, chlorambucil, ifosfamid, melphalan, streptozocin, carmustine, lomustine, busulfan, dacarbazine, temozolomide, thiotepa, and altretamine, antimetabolites, e.g., 5-fluorouracil (5-FU), 6-mercaptopurine (6-MP), capecitabine, cytarabine, floxuridine, gemcitabine, hydroxyurea, methotrexate, and pemetrexed, anthracyclines, e.g., daunorubicin, doxorubicin, epirubicin, and idarubicin, other anti-tumor antibiotics, e.g., actinomycin-D, bleomycin, mitomycin-c and mitoxantrone, topoisomerase inhibitors, e.g., topotecan, irinotecan, etoposide, teniposide, and metoxantrone, mitotic inhibitors, e.g., paclitaxel, docetaxel, ixabepilone, vinblastine, vincristine, vinrelbine, and estramustine, corticosteroids, e.g., prednisone, methylprednisone, and dexamethasone, L-aspraginase, and the proteasome inhibitor bortezomib. Without limitations, examples of radiation therapy include external beam radiation therapy, internal radiation therapy, or systemic radiation therapy.


The therapeutic intervention can include one or more DNA damaging compounds or agents, such as cisplatin, carboplatin, oxaliplatin, picoplatin, other platinum-based compounds, doxorubicin, daunorubicin, other anthracyclines, variants thereof and derivatives thereof.


In some instances, administration of one or more anthracyclines; cyclophosphamide; one or more taxanes; methotrexate; 5-fluorouracil; one or more anthracyclines and cyclophosphamide; one or more anthracyclines, cyclophosphamide, and one or more taxanes; cyclophosphamide, methotrexate and 5-fluorouracil, or any combination thereof is recommended.


SAILOR Inhibitors


In a further aspect, the invention provides SAILOR inhibitors to decrease SAILOR LEVELS. Such inhibitors can be used, e.g., therapeutically. SAILOR inhibitors include, but are not limited to, antisense oligonucleotides, inhibitory RNA molecules, such as miRNAs, siRNAs, piRNAs, and snRNAs, ribozymes, and small molecule inhibitors. Various types of inhibitors for inhibiting nucleic acid function are well known in the art.


The terms “microRNA,” “miRNA,” and MiR” are interchangeable and refer to endogenous or artificial non-coding RNAs that are capable of regulating gene expression. It is believed that miRNAs function via RNA interference.


in the context of this invention, the terms “siRNA” and “short interfering RNA” are interchangeable and refer to single-stranded or double-stranded RNA molecules that are capable of inducing RNA interference. SiRNA molecules typically have a duplex region that is between 18 and 30 base pairs in length.


The terms “snRNA” and “small nuclear RNA” are interchangeable and refer to a class of small RNAs involved in a variety of processes including RNA splicing and regulation of transcription factors. The subclass of small nucleolar RNAs (snoRNAs) is also included. The term is also intended to include artificial snRNAs, such as antisense derivatives of snRNAs comprising antisense sequences directed against SAILOR.


The term “hairpin” and “stem-loop” can be used interchangeably and refer to stem-loop structures. The stem results from two sequences of nucleic acid or modified nucleic acid annealing together to generate a duplex. The loop lies between the two strands comprising the stem.


Inhibitors can be single stranded or double stranded polynucleotides and may contain one or more chemical modifications, such as, but not limited to, locked nucleic acids, peptide nucleic acids, sugar modifications, such as 2′-O-alkyl (e.g., 2′-O-methyl, 2′-O-methoxyethyl), 2′-fluoro, and 4′-thio modifications, and backbone modifications, such as one or more phosphorothioate, morpholino, or phosphonocarboxylate linkages. In addition, inhibitory RNA molecules may have a “tail” covalently attached to their 3′- and/or 5′-end, which may be used to stabilize the RNA inhibitory molecule or enhance cellular uptake. Such tails include, but are not limited to, intercalating groups, various kinds of reporter groups, and lipophilic groups attached to the 3′ or 5′ ends of the RNA molecules. In certain embodiments, the RNA inhibitory molecule is conjugated to cholesterol or acridine. See, for example, the following for descriptions of syntheses of 3′-cholesterol or 3′-acridine modified oligonucleotides: Gamper, H. B., Reed, M. W., Cox, T., Virosco, J. S., Adams, A. D., Gall, A., Scholler, J. K., and Meyer, R. B. (1993) Facile Preparation and Exonuclease Stability of 3′-Modified Oligodeoxynucleotides. Nucleic Acids Res. 21 145-150; and Reed, M. W., Adams, A. D., Nelson, J. S., and Meyer, R. B., Jr. (1991) Acridine and Cholesterol-Derivatized Solid Supports for Improved Synthesis of 3′-Modified Oligonucleotides. Bioconjugate Chem. 2 217-225 (1993); herein incorporated by reference in their entireties. Additional lipophilic moieties that can be used, include, but are not limited to, oleyl, retinyl, and cholesteryl residues, cholic acid, adamantane acetic acid, 1-pyrene butyric acid, dihydrotestosterone, 1,3-Bis-O(hexadecyl)glycerol, geranyloxyhexyl group, hexadecylglycerol, borneol, menthol, 1,3-propanediol, heptadecyl group, palmitic acid, myristic acid, O.sub.3-(oleoyl)lithocholic acid, O.sub.3-(oleoyl)cholenic acid, dimethoxytrityl, or phenoxazine. Additional compounds, and methods of use, are set out in US Patent Publication Nos. 2010/0076056, 2009/0247608 and 2009/0131360; herein incorporated by reference in their entireties.


In one embodiment, a SAILOR inhibitor for use in this invention is an antisense oligonucleotide. An antisense oligonucletoide may comprise ribonucleotides or deoxyribonucleotides. Typically antisense oligonucleotides have at least one chemical modification, e.g., as described in the preceding paragraph. In some embodiments, suitable antisense oligonucleotides are 2′-O-methoxyethyl “gapmers” containing 2′-O-methoxyethyl-modified ribonucleotides on both 5′ and 3′ ends with at least ten deoxyribonucleotides in the center. Such “gapmers” trigger RNase H-dependent degradation mechanisms of RNA targets.


Other modifications of antisense oligonucleotides to enhance stability and improve efficacy, such as those described in U.S. Pat. No. 6,838,283, are known in the art and are suitable for use in the methods of the invention. Antisense oligonucleotides comprise a sequence that is at least partially complementary to a SAILOR target sequence, e.g., at least about 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% complementary to the SAILOR target sequence. In some embodiments, an antisense oligonucleotide at least about 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% complementary to a target SAILOR polynucleotide sequence present in SEQ ID NO:2 or SEQ ID NO:3 that is selective for SAILOR. In some embodiments, an antisense oligonucleotide at least about 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% complementary to a target SAILOR polynucleotide sequence present in SEQ ID NO:4 that is selective for SAILOR.


In some embodiments, a SAILOR inhibitor is an inhibitory RNA molecule such as a miRNA, a siRNA, a piRNA, or a snRNA that has a single-stranded or double-stranded region that is at least partially complementary to the SAILOR target sequence, e.g., about 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% complementary to the SAILOR target sequence. In some embodiments, the inhibitory RNA comprises a sequence that is at least about 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% complementary to a SAILOR target sequence, e.g., any sequence present in SEQ ID NO:2 that is selective for SAILOR. In certain embodiments, the inhibitory RNA molecule may be a double-stranded, small interfering RNA or a short hairpin RNA molecule (shRNA) comprising a stem-loop structure.


In some embodiments, an shRNA or other inhibitory RNA targets the mature SAILOR transcript, e.g., SEQ ID NO:2 or SEQ ID NO:3.


A SAILOR inhibitor that decreases SAILOR transcript levels (e.g., microRNA, siRNA, piRNA, snRNA, antisense oligonucleotide, ribozyme, or small molecule inhibitor) often reduces the amount and/or activity of SAILOR by at least about 10% to about 100%, 20% to about 100%, 30% to about 100%, 40% to about 100%, 50% to about 100%, 60% to about 100%, 70% to about 100%, 10% to about 90%, 20% to about 85%, 40% to about 84%, 60% to about 90%, including any percent within these ranges, such as but not limited to 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, and 99%.


In some embodiments, a SAILOR inhibitor is administered to a subject as a nucleic acid construct, for example using a plasmid-based delivery system or a viral delivery system. Numerous vectors are known in the art including, but not limited to linear polynucleotides, polynucleotides associated with ionic or amphiphilic compounds, plasmids, and viruses. Viral delivery systems include adenovirus vectors, adeno-associated viral vectors, herpes simplex viral vectors, retroviral vectors, pox viral vectors, lentiviral vectors, alphavirus vectors, poliovirus vectors, and other positive and negative stranded RNA viruses, viroids, and virusoids, or portions thereof. Methods of constructing and using such vectors are well known in the art.


Any of the conventional vectors used for expression in eukaryotic cells may be used for directly introducing nucleic acids into tissue. Expression vectors containing regulatory elements from eukaryotic viruses are often used in eukaryotic expression vectors. Such regulatory elements include, e.g., human CMV, simian CMV, viral LTRs, and the like. Typical vectors may comprise, e.g., termination sequences and other sequences for providing the desired expression activity of the vector. and other elements and an antibiotic resistance gene for selective growth in bacteria.


In certain embodiments, gene transfer may be performed under ex vivo conditions. Ex vivo gene therapy refers to the isolation of cells from an animal, the delivery of a nucleic acid into the cells in vitro, and then the return of the modified cells back into the subject.


Administering a nucleic acid, such as a microRNA, siRNA, piRNA, snRNA, or antisense nucleic acid inhibitor of SAILOR to cells can be performed using any known technique. For example, in some embodiments, a composition comprising one or more nucleic acid molecules (within or without vectors) can comprise a delivery vehicle, including liposomes, for administration to a subject, carriers and diluents and their salts, and/or can be present in pharmaceutically acceptable formulations. Methods for the delivery of nucleic acid molecules are described, for example, in Gilmore, et al., Curr Drug Delivery (2006) 3:147-5 and Patil, et al., AAPS Journal (2005) 7:E61-E77. Delivery of siRNA molecules is also described in several U.S. Patent Publications, including for example, 2006/0019912; 2006/0014289; 2005/0239687; 2005/0222064; and 2004/0204377, the disclosures of each of which are hereby incorporated herein by reference. Nucleic acid molecules can be administered to cells by a variety of methods known to those of skill in the art, including, but not restricted to, encapsulation in liposomes, by iontophoresis, by electroporation, or by incorporation into other vehicles, including biodegradable polymers, hydrogels, cyclodextrins (see, for example Gonzalez et al., 1999, Bioconjugate Chem., 10, 1068-1074; Wang et al., International PCT publication Nos. WO 03/47518 and WO 03/46185), poly(lactic-co-glycolic)acid (PLGA) and PLCA microspheres (see for example U.S. Pat. No. 6,447,796 and US Patent Application Publication No. 2002/130430), biodegradable nanocapsules, and bioadhesive microspheres, or by proteinaceous vectors (O'Hare and Normand, International PCT Publication No. WO 00/53722). In another embodiment, the nucleic acid molecules of the invention can also be formulated or complexed with polyethyleneimine and derivatives thereof, such as polyethyleneimine-polyethyleneglycol-N-acetylgalactosamine (PEI-PEG-GAL) or polyethyleneimine-polyethyleneglycol-tri-N-acetylgalactosamine (PEI-PEG-triGAL) derivatives.


In some embodiments, a SAILOR inhibitor is targeted to cancer cells. Suitable cancer cell-specific targeting approaches include a lentivirus-mediated Tet-On inducible system under the control of the matrix metalloproteinase-2 promoter; a dual promoter system that combines the human telomerase reverse transcriptase promoter (hTERT) and a tissue specific promoter to target expression to cancer cells. Other cancer cell-specific targeting approaches utilizing the hTERT tumor-specific promoter are also suitable for use in the present invention.


Administration may be systemic or via direct or local administration to a tumor site. By way of example, suitable modes of systemic administration include, without limitation orally, topically, transdermally, parenterally, intradermally, intramuscularly, intraperitoneally, intravenously, subcutaneously, or by intranasal instillation, by intracavitary or intravesical instillation, intraocularly, intraarterialy, intralesionally, or by application to mucous membranes. Suitable modes of local administration include, without limitation, catheterization, implantation, direct injection, dermal/transdermal application, or portal vein administration to relevant tissues, or by any other local administration technique, method or procedure generally known in the art.


Screening for SAILOR Inhibitors


In a further aspect, the invention provides method of screening for SAILOR inhibitors. Use of a regulatory element either individually for SATB1 or SAILOR expression, or simultaneously for expression of both genes within the CpG island region (see, SEQ ID NO:1) can be used in a small chemical inhibitor-screening assay to identify compounds that abrogate expression of both molecules.


In one embodiment, an illustrative screening is as follows. the entire SATB1 coding region is replaced with a reporter Green Fluorescent Protein (GFP) gene fused with the human SATB1 nuclear localization signal (NLS, amino acids 20-40) (Nakayama et al, Cell cycle 4:109901106, 2005) or any other established NLS, and also replace the entire target SAILOR lncRNA with a red fluorescent protein reporter fused with the coding sequence for a NLS. Based on this design, these gene features are cloned into an expression cassette: the two genes will be transcribed from the promoters of the CpG island in opposite directions. This expression cassette is transduced into a human aggressive breast cancer cell line for which we know the expression level of SATB1 and SAILOR lncRNA; both will be high. Dual expression in nuclei of both fluorescent-labeled reporter genes coming from the expression cassette will make it possible to directly visualize and compare the expression levels of each different transcript under various experimental conditions. For example, one assay that could be done will screen anti-cancer compounds to determine to what extent a small chemical can ablate or vastly diminish one or both colors in transduced cells.


In another embodiment, an expression construct contains the regulatory elements (i.e. promoters) within the CpG island that directs both the transcription of SATB1 and adjacent lncRNA. In some embodiments, the expression construct comprises a vector, reporter gene, and a gene, cDNA or nucleotide sequences that expresses SATB1.


The expression vector usable for screening includes pUC vectors (for example pUC118, pUC119), pBR vectors (for example pBR322), pBI vectors (for example pBI112, pBI221), pGA vectors (pGA492, pGAH), pNC (manufactured by Nissan Chemical Industries, Ltd.). In addition, virus vectors including but not limited to lentiviral, adenoviral, retroviral or sendai viral vectors can also be used.


The expression system usable in the screening methods described herein includes any system utilizing RNA or DNA sequences. It can be used to transform transiently or stably in the selected host. It includes any plasmid vectors, such as pUC, pBR, pBI, pGA, pNC derived vectors (for example pUC118, pBR322, pBI221 and pGAH). It also includes any viral DNA or RNA fragments derived from virus such as phage and retro-virus derived (TRBO, pEYK, LSNLsrc). Genes or nucleic acid sequences presented in the invention can be expressed by direct translation in case of RNA viral expression system, transcribed after in vivo recombination, downstream of promoter recognized by the host expression system (such as pLac, pVGB, pBAD, pPMA1, pGa14, pHXT7, pMet26, pCaMV-35S, pCMV, pSV40, pEM-7, pNos, pUBQ10, pDET3, or pRBCS.) or downstream of a promoter present in the expression system (vector or linear DNA). Promoters can be from synthetic, viral, prokaryote and eukaryote origin.


The expression cassette may include 5′ and 3′ regulatory sequences operably linked, for examples, to the reporter gene, the lncRNA or SATB1 gene. “Operably linked” is intended to mean a functional linkage between two or more elements. For example, an operable linkage between a gene and a regulatory sequence (i.e. a promoter) is functionally linked that allows for expression of the gene. Operably linked elements may be contiguous or non-contiguous. When used to refer to the joining of two protein coding regions, by operably linked is intended that the coding regions are in the same reading frame. The cassette may additionally contain at least one additional gene to be co-transfected into the organism. Alternatively, the additional gene(s) can be provided on multiple expression cassettes. Such an expression cassette is provided with a plurality of restriction sites and/or recombination sites for insertion of the gene sequence. The expression cassette may additionally contain selectable marker genes or a reporter gene to be under the transcriptional regulation of the regulatory regions.


The expression cassette may include in either the direction of transcription, a transcriptional initiation region (i.e. a promoter), translational initiation region, a polynucleotide of the invention, a translational termination region, and optionally, a transcriptional termination region functional in the host organism. The regulatory regions (e.g. promoters, transcriptional regulatory regions, and translational termination regions) and/or the polynucleotide of the invention may be native/analogous to the host cell or to each other. Alternatively, the regulatory regions and/or the gene may be heterologous to the host cell or to each other. As used herein, “heterologous” in reference to a sequence is a sequence that originates from a foreign species, or, if from the same species, is substantially modified from its native form in composition and/or genomic locus by deliberate human intervention. For example, a promoter operably linked to a heterologous polynucleotide is from a species different from the species from which the polynucleotide was derived, or if from the same/analogous species, one or both are substantially modified from their original form and/or genomic locus, or the promoter is not the native promoter for the operably linked polynucleotide.


Where appropriate, polynucleotides may be optimized for increased expression in the transformed organism. For example, polynucleotides can be synthesized using preferred codons for improved expression.


Additional sequence modifications are known to enhance gene expression in a cellular host. These include elimination of sequences encoding spurious polyadenylation signals, exon-intron splice site signals, transposon-like repeats, and other such well-characterized sequences that may be deleterious to gene expression. The G-C content of the sequence may be adjusted to levels average for a given cellular host, as calculated by reference to known genes expressed in the host cell. When possible, the sequence is modified to avoid predicted hairpin secondary mRNA structures.


The expression cassette can also comprise a selectable marker gene for the selection of transformed or modulated cells. Selectable marker genes are utilized for the selection of transformed or differentiated cells or tissues. Marker genes include genes encoding antibiotic resistance, such as those encoding neomycin phosphotransferase II (NEO) and hygromycin phosphotransferase (HPT). Additional selectable markers include phenotypic markers such as ?-galactosidase and fluorescent proteins such as green fluorescent protein (GFP) (Su et al. (2004) Biotechnol Bioeng 85:610-9 and Fetter et al. (2004) Plant Cell 16:215-28), cyan florescent protein (CYP) (Bolte et al. (2004) J. Cell Science 117:943-54 and Kato et al. (2002) Plant Physiol 129:913-42), and yellow florescent protein (PhiYFP from Evrogen, see, Bolte et al. (2004) J. Cell Science 117:943-54), and m-Cherry (Shaner et al., Nature Biotechnology 22: 1567-72). The above list of selectable marker genes is not meant to be limiting. Any selectable marker gene can be used in the present embodiments.


In one embodiment, an expression cassette comprising the nucleotide sequence operably linked to a promoter that drives expression of a selective agent, signal peptide or label in the host organism, and the expression cassette further comprising an operably linked polynucleotide encoding a selective agent, signal peptide or reporter. In other embodiments, the construct used herein includes an inducible reporter gene, such as mCherry, GFP, YFP, and the like.


Kits


For use in diagnostic applications, prognostic applications, and therapy selection applications described above, kits are also disclosed herein. The kits of the invention may comprise any or all of the reagents to perform the methods described herein. In such applications the kits may include any or all of the following: assay reagents, buffers, nucleic acids that bind to at least one of the genes described herein, hybridization probes and/or primers, that specifically bind to SAILOR. In addition, the kit may include reagents such as nucleic acids, hybridization probes, or primers, that specifically bind to SATB1 exon 1d variant transcripts.


The term “kit” as used herein in the context of detection reagents, are intended to refer to such things as combinations of multiple gene transcript product detection reagents, or one or more gene transcript product detection reagents in combination with one or more other types of elements or components (e.g., other types of biochemical reagents, containers, packages such as packaging intended for commercial sale, substrates to which gene expression detection product reagents are attached, electronic hardware components, etc.).


EXAMPLES

The following examples are offered to illustrate, but not to limit, the claimed invention.


Example 1. Identification of SAILOR Expression in Breast Cancer

SATB1 is transcribed from multiple gene promoters, which generate multiple SATB1 exon 1 variants. The ATG start codon for SATB1 translation is located in exon 2, and therefore any sequence within exon 1 from any variant will not be translated.


The distal CpG island upstream relative to SATB1 exon 2 (˜20 kb from exon 2, chr3:18,485,113-18,487,056; Assembly hg19; SEQ ID NO:1) contains the far distal promoter for SATB1 (exon 1d variant). The adjacent 85.8 kb lncRNA gene encodes SAILOR, which is a lncRNA that spans 1.8 kb (SEQ ID NO:2), and is transcribed in an opposite direction from the same CpG island region, either from the same far distal (1d) promoter, or from a distinct promoter that is adjacent to the 1d promoter. FIG. 1 shows a mpa of the SATB1 gene locus.


Quantitative reverse transcription (RT-) PCR based analyses of SAILOR and SATB1 transcript levels in both aggressive and non-aggressive human breast cancer cell lines have shown that SATB1 and SAILOR are each transcribed specifically in aggressive breast cancer cell lines MDA-MB-231 and BT549 (FIG. 2). The expression of transgenic SATB1 exon variant 1a in the non-aggressive SK-BR-3 and non-tumorigenic immortalized breast epithelial line MCF10A-1 was found by RT-PCR analyses to result in the activation of endogenous SATB1 exon 1d variant and SAILOR expression. The SATB1 protein therefore appears to activate the concerted transcription of SATB1 exon 1d and SAILOR from the aforementioned CpG island promoter(s). In support of this finding, SAILOR has been shown to contain long introns highly enriched in “base-unpairing regions” (BURs). BURS represent specialized genomic sequences of 200-300 bp that have a distinctive phosphate-backbone structure that is recognized by SATB1 (Bode et al., Science, 255:195-197, 1992; Dickson et al. Cell, 70:631-645, 1992). SATB1 acts to remodel the 3D organization of chromatin and regulate gene expression by binding at BURS located at target genes and tethering bound BURS to the scaffold-like matrix SATB1 forms within cell nuclei.


Furthermore, RT-PCR analysis of pre-spliced and mature SAILOR transcript levels has shown SAILOR is predominately localized within the cell nucleus as opposed to the cell cytoplasm (FIG. 3). RT-PCR analyses were performed using specific primers spanning mature, spliced SAILOR exon 2 and exon 3 (5′-GAGACTTCAGGTCAGGAAAGC-3′ (SEQ ID NO:6) and 5′-CTGGGCTGTGAAATTGATACC-3′ SEQ ID NO:7), and primers designed specifically for SATB1 exon 1d and exon 2 (5′-GGAGCCGTTCTTGGTTTCA-3′ (SEQ Id NO:8) and 5′-TTAGACATTTCTGAATGTTC-3′ (SEQ ID NO:9)). A second primer set for SATB1 exon 1d and exon 2 was also used (5′-GAGACTTCAGGTCAGGAAAGC-3′ (SEQ ID NO:10) and 5′-CTGGGCTGTGAAATTGATACC-3′ (SEQ ID NO:11)), and a pre-spliced SAILOR primer set for RT-PCR (5′-GAAGCCGCACTTTCTTGAAT-3′ (SEQ ID NO:12 and 5′-AATCTCCCTCCTGCTTCCAT-3′ (SEQ ID NO:13)).


Co-expression of SATB1 and SAILOR in nuclei of aggressive breast cancer cells suggests that these factors might physically associate. An RNA immunoprecipitation assay (RIP) in BT549 cells indicated that SAILOR is greatly enriched for association with SATB1 (11-fold) relative to coding mRNA Axl that is abundant in these cells, and compared to the IgG and FoxP2 antibody controls that did not precipitate any RNA (FIG. 4). This suggests SAILOR and SATB1 share a common function.


The effect of SAILOR on the growth rate of BT549 cells was studied. SAILOR expression was reduced by ˜80% in BT549 cell lines with a short-hairpin RNA (shRNA), 5′-GGTGGAAGAGTAAACTGTTCCCGAAGGAACAGTTTACTCTTCCACC-3′ (SEQ ID NO:14), which targets SAILOR exon 4 at sequence GGTGGAAGAGTAAACTGTTCC (SEQ ID NO:15) (FIG. 5). Reduced levels of SAILOR did not affect SATB1 transcription or protein level. However, SAILOR knockdown cells show a slowed growth rate, suggesting that the aggressive cancer properties of BT549 cells are attenuated when SAILOR is reduced.


The role of SAILOR on transcription of some genes known as SATB1 targets in breast cancer was also evaluated. Trancript levels of Claudin1 (CLDN1; repressed by SATB1) and fibronectin 1 (FN1; activated by SATB1) were altered in the same direction when SAILOR was knocked down (FIG. 6). Therefore, based on the small but important number of genes found to be SAILOR-dependent (˜10% of 85 genes in a Cancer Pathway Superarray RT-PCR assay), SAILOR seems to affect gene expression toward cancer progression.


High transcript expression levels of SATB1 and SAILOR were not restricted to breast cancer cells. High expression of these factors also occurs in pancreatic cancer, prostate cancer, and colon cancer cell lines (FIG. 7). Importantly, within each of these cancer lines, expression was found to be more highly present in the aggressive cancer form (e.g., higher in HT29 cells from colon) relative to non-aggressive cancer from. These data indicates that the link between aggressive cancer phenotypes and elevated SAILOR lncRNA expression is not limited to breast cancer.


When the non-aggressive breast cancer line MCF10A was exposed to chemotherapeutic drugs, surviving cells showed a greatly up-regulated expression of SATB1 relative to the initial cancer cell line population (FIG. 8A). Furthermore, the forced over-expression of SATB1 resulted in an increased resistance to a wide range of cancer drugs, indicated by the need for a higher dose to achieve 50% cell death in culture (FIG. 8B). These data indicate that cells that evade typical treatments are those that already expression SATB1 at high levels or that up-regulate SATB1, and in doing so evade cancer cell death. In either case, these findings are indicative that detection of elevated SATB1, and by association SAILOR, in a patient during treatment may faithfully identify a cancer that has acquired resistance to the treatment.


Exosomes from cancer cells were also evaluated. SATB1 and SAILOR transcripts are quite abundant in exosomes from cancer cells when purified either using a commercial kit or centrifugation. The level of SATB1 and SAILOR transcripts detected in exosomes was proportional to aggressiveness of the cancer cells (FIG. 9).


These data suggest that SAILOR is an important factor for the metastatic potential of tumor cells and that SAILOR and SATB1 are co-expressed specifically in aggressive tumor cells in human primary tumor specimens. High levels of SAILOR and SATB1 exon 1d variant transcripts will predict metastasis and correlate with shorter survival times. SAILOR transcript levels alone can serve as a prognostic marker. Determination of RNA levels for both SATB1 exon 1d variant can also be used.


The present examples, methods, procedures, specific compounds, and molecules are meant to exemplify and illustrate the invention, and should in no way be seen as limiting the scope of the invention. Any patents, publications, and publicly available sequences mentioned in this specification are indicative of levels of those skilled in the art to which the invention pertains, and are hereby incorporated by reference to the same extent as if each was specifically and individually incorporated by reference in the context in the application for which they are cited.












Illustrative Sequences-All sequences are shown 5′


to 3′















SEQ ID NO: 1


CpG island


>hg19_dna range = chr3: 18485113-18487056


CGTTTCCCCAGTAAGCACGTGGCACTCCCCGGACCTGCCACCTGCCTGCT


TCGTCCTTCTCGTCGTGGTTTCCCAAACCCCGGTTCTGCCGGCCCGGGAG


CCTTAGCACTGGAGCAATAGGAAAAGGCCACCGCGCTCGGGTCTGGACAG


CAGGAGGGAAACACGGTGTGGACTGCGAGGCTGCACCTGTGATGTCCCGG


CCCCTGCTAAGAGGACGGCCCTTTCTTCTGCCTCTTGCCCAACTCCAAAC


CCACATTCACGCCAGCAGCCTCTCCAGGACCGGCCTCGCTACAGCCAGCG


AGGGCTCGAAATGAGGAGTGCCGCGGCTTTCAAACTCCGGGCTCCAACTT


GAGCGCCCCGGCGCCCGAGTAGCTCCCGGGATGCAGAAGTTGCCACAAAC


TTCCCAGGCCCCTCTTCGCCGATGCTTACAATCAGCCGCGCAGGCAGGGA


GCGGAGGGAGGCGGAGATGGACCGGGAAAGGATGCTGAGCAGACTCGCGA


TCCGGTGGGGGAACATTACCACTCCCGCAGCCCACTCCTCCAGGCACCTT


ACTGCCCGCCCGGCTCCAGAACGCACCGAGAGGCTCCCCTTTTCCCCATT


TGCTTCCTTCGGTCTTTTCCACTCCCCTTTCCTTTTCTAAAAGGGGCCAT


ACCGGTGACCTGAAGGAGTTTGTTCAGCCAGGGTCTATTGGGCAGGTGTG


GTGGTGTGTCCACACCCAGACAGAAAACGAATGGCATCTTCAAATCCCCC


ATCCCGACCGCTCTCCCCTACTCTACCAGCCCACCCCTCCAAGGTCCGTC


TGCGTGAGAAAAGGGGCTCGGAAGACCGTTGAAGCCCTGCGCCCACGAGA


GGGGAGCCCAGCCGCCCCAATAGGGGACGAGGAGTGGGTGCTACGGAGAA


GTTTGGATTGATTCCGGAAAAAGAGGGACAGAGATAAAACAGCAAGAGTA


GCAAGGGGAAAAGGGAGGCAAAAGAGCAGAACTCACTCAGGCATGGACGT


TGGGGGCGGCGGTGGCTGTCGAGTGCGGGCCTGAAACCAAGAACGGCTCC


CCGGGCGGGCGCGCCGGCGTCGGACTTCCGAGGCGGCGGCTTCTGCCTCT


CCTGCCGCCGCCGCCGCCGCCGGAGCTGCGGCTGCCGCGGAAGTTAATTG


CAACTTGACTTCAAGTTGTCCTCTTTCCCCATACGAAGTGGGCGTTTAAA


GGGGAGAGCGAGGCGAGGAGCGAGCGAGCGAGCGCGCGGGGCCAAGGGAA


GGAAGAGAAGGAGGGGGAGGGAGGAGATGTTAACGGGCGGGGGGGGGAGA


AGGGGGAGGGGGCGGCGGCGGGGGCGGGAGGGGGAAGGGGCCGGCGGGAG


CTGCTCTCGTCTCGTCGGTCGCGGCGCCTGCAGTCTGGAGGCGCACCGGA


GCGGCCGGGGCGTCCCCCGCGGGAGCCCGCAGCCACCCGGGACGCGCATC


CAGACGTGGCGCTTCGGACCGGGCACGCTGCGCCCGGGGGCTCGGCGGAC


CCCGCGTAGCCGCCGCTTCGGAGCTTGTGCGGCGCGGGCTGGCCAGCGGG


GCGGCCAGGGCCCGGCCCGCCTCCCCAGCGCCCGCCCGGCTTCTCCCCCT


GGCGGTGGGAGCCTCGGCGGCCGCTGGCGACACTAGGCGCACTGAAGCCC


GAGCCGAGCCGAGCCCGAGCCGCCGCCGCCGCCGCCGCTGCTGCGCACCG


CTCCCGGGCTCCCTCCCAGCGCGCCGGCCGGGGTGTGGGGGGCGGCGGGC


CGGAGGGGCGAGGGCGGGCCAGGGGGCGCACACGGGGGTTGGCGCGGAAG


ACAGGACCCTCAGCCTCGAGGGGTAAGTGTGGGCGCTTGGGGGTGCGCTT


GGGGTGCGCGGCGCGGTTCTCGTCGCCCGCCAACCCTGCCCCCTCACCTC


TCCGGGGGCCCCCAACACGCGCACTCCTCCTCTTGTCGCCTGCG





SEQ ID NO: 2


Canonical SAILOR lncRNA sequence after transcript


splicing


>NR_125803.1 (SATB1-AS1)


gcggccgctggcgacactaggcgcactgaagcccgagccgagccgagccc


gagccgccgccgccgccgccgctgctgcgcaccgctcccgggctccctcc


cagcgcgccggccggggtgtggggggcggcgggccggaggggcgagggcg


ggccagggggcgcacacgggggttggcgcggaagacaggaccctcagcct


cgagggagcccctaggtgaccaggcaaaatggcagttccttccagctggt


cctcagatgggcacatctattagcctctgctcttgtaagaagttagctgc


agaacccacatgtgaatccttgtaggactctggagaagatcaaatgggag


cttagatgtggaagctctttggaaaccaagaaacactctgaaaatgaaaa


gggtggaagagtaaactgttcctgtttctccatcttgacaggaagcagaa


gttctccttagctgatttttcttcttgccttatactgggttctttaacac


cagaacaaaaataaataaataaaagaatcttccagaaattcatgaagaga


cttcaggtcaggaaagcctgaactttctcatccaacacccgttcaagtcg


gaggatgctgattaattcatgatggaaaggtatcaatttcacagcccaga


cctctacattcttgaaatgctcttctcatgggagacattaaataagcttt


gaattaactgagacctctgtggattctataactcaaagacccaggctgac


ggctggtggaatagctaccatctcaaacatggcgtgatgctgcatcagag


agaaaaataagctcttgcattgacaatgaaatgtctaccctgt





SEQ ID NO: 3 A transcript variant with an extended


5′ end and a truncated 3′ end *** Underlined


sequence is the target of shRNA to reduce SAILOR


levels


>TCONS_12_00019762


gccggggcgtcccccgcgggagcccgcagccacccgggacgcgcatccag


acgtggcgcttcggaccgggcacgctgcgcccgggggctcggcggacccc


gcgtagccgccgcttcggagcttgtgcggcgcgggctggccagcggggcg


gccagggcccggcccgcctccccagcgcccgcccggcttctccccctggc


ggtgggagcctcggcggccgctggcgacactaggcgcactgaagcccgag


ccgagccgagcccgagccgccgccgccgccgccgctgctgcgcaccgctc


ccgggctccctcccagcgcgccggccggggtgtggggggcggcgggccgg


aggggcgagggcgggccagggggcgcacacgggggttggcgcggaagaca


ggaccctcagcctcgagggagcccctaggtgaccaggcaaaatggcagtt


ccttccagctggtcctcagatgggcacatctattagcctctgctcttgta


agaagttagctgcagaacccacatgtgaatccttgtaggactctggagaa


gatcaaatgggagcttagatgtggaagctctttggaaaccaagaaacact


ctgaaaatgaaaagggtggaagagtaaactgttcctgtttctccatcttg


acaggaagcagaagttctccttagctgatttttcttcttgccttatactg


ggttctttaacaccagaacaaaaataaataaataaaa





SEQ ID NO: 4


Long non-coding RNA (unspliced) (identification


TCONS_12_00018460, TCONS_12_00018461, SATB1-AS1)


>hg19_dna range = ch3: 1846850-18572715


GCGGCCGCTGGCGACACTAGGCGCACTGAAGCCCGAGCCGAGCCGAGCCC


GAGCCGCCGCCGCCGCCGCCGCTGCTGCGCACCGCTCCCGGGCTCCCTCC


CAGCGCGCCGGCCGGGGTGTGGGGGGCGGCGGGCCGGAGGGGCGAGGGCG


GGCCAGGGGGCGCACACGGGGGTTGGCGCGGAAGACAGGACCCTCAGCCT


CGAGGGgtaagtgtgggcgcttgggggtgcgcttggggtgcgcggcgcgg


ttctcgtcgcccgccaaccctgccccctcacctctccgggggcccccaac


acgcgcactcctcctcttgtcgcctgcggcttcctcttgttgcttgttgt


ttggctgggttttggggggtgagtagagggggttactgtagtgtgtgtgc


aggcggaggaggaagtcaggtgagaggccgcgaagcacccccaccccaac


ccagcctgcgtggggtgtatgcttccccactaggggcatttgggccattt


tttttttctccgtcaatgttcggtcgagacgatgtttcctagagggcctc


cttcacttacagtctggtctgtccatctcccttcatcccaccctcgtccg


tcttagccccttgtccaggagccctgcaccccaagtgaggagcacgtggc


ggaaggaggaggaggccctctcttgaagacccccacgctgtgacccagcc


cattagtttaaatatttatcctcacatcaccagctgtacttttcaaccca


ctggaacaagacgcctaggcaaggtccaatttccccaaaaagctgggggc


aaagcgggagagatgagggttttattgacattgggagagaaggggccaag


gaaccttcccaagtggagactgaaactcaaaaaatctctacaaaaagact


agggtgactggaggctcaggactgcaggcttaggagaaactggagctcca


tcaactttatttctctccccctgcctctcctccccccagctctgaggaag


ctaacaggtcttttgttttgtattgtgtagcctttgggaaatttgcattg


agtatgtcaacagaaaattactactgtttactcaaagggattgcagaaag


ataaaacatagtaacatagtttttacttttcctcttctttgtacgcttgt


gtgtatttggcaagaagtgtagttaggttcatggcatgaactgcaaaacg


tcaggttgtaacaataaggactgtagagctgcctaggtgtatttttagaa


gccgcactttcttgaattcctttcacacaaggcttgctctctcattaagt


ccccataaataattttatttagagaggaggaaactatggaagcaggaggg


agattgctaagatttccatcctgccagcgcacagcactgtttgcagggaa


gttcctggaaagtgattgtgggctgagaattcagtttttcctaaaagctc


tgttaggtcctgttgtgtcagatggctatgaataaagcacgctgggaaaa


tcctgatatccgtgcagtcttattttatcttttacacagtggttgattcg


actttcaagccctggtgacaaacacatcaactctgctgtatttttgtgtg


ctttttctaaataaaggagaatacagcatgtattcatgctgcttttttcc


cctcatccatggttattgaattcactggaagttattcatcttttttcagt


tgatcatcattttatgccgagaccacaattccaaacatataggaattttt


taaaattatgaattagggttcttttagggaaccaaccctcctttttaaag


atacaagtgcccaccacaactcctagttttgtaaagcaaagcaaaaaggt


ataaatgattataatgagcaaaacatagtaaaacgttttccttcttggat


cactttctgtccccacaggtcccaggagacattaaatcctcctgccttaa


tttaccaaaatgctgcatttcctaaccagtaacaatatctgtgtctgaga


aagacatcaggtttggaatggaaaaaaatacattctcggtatttaattac


tgcttaacagtaatttattgtagcaggcagttgattttccaattctgaaa


gcctgaaatgaagctgaacagaaatttgaaatttttaaaaaaggtttttt


tttctttcttacactcttatatactcatctgcttccccaagttatcagcc


aagcccctataactgaaagtatgatatgctcctctgagttgaaccgagac


cattttgcccctcattttttgaatttgctttttcctcccaatagtcttgt


atagaaagcacatgggctctgttagaatacagaattatgcaattctgggt


gcagtgcattaaatgacatagtttgataaactcttaagccttacagctca


gatgttaataaaatattgtagttggccacaaatttcatagcacaaggcta


agatcatttttcagaagaaaggggcatttaatctggattatttggttagt


tgctatggtctgaatgttagtgttcccccaatattggggttcatgtgtta


gaacctaatatcaaaagtgatggtattaagaagtagggcctttgggaagt


gattaagtcatgaaggcttcaaccctcatgagtgacaggagtgcctttgt


aagacaggctgaatctagctaccttgccccttccactgtgtgaggacaca


gcaacaaaagtgccatctgtgaagcaaggagtgagccttcaccagatact


gaatctgctggtgctgtaatcttggactcctcagcttccagaactgtgag


caataaatttctgttgcttataaattacccagtcaaagataatttgttgt


agcacctcaaatgaactaacacaataatacatttgtgctactagattttt


ttaacaagtccaaatgtgacttaaatgactgaaagacaaatctggaacat


atcaaatggccaataatagtagctggtatcaaaagctgcactggttattt


ccttaaaagaatagacaataaaatattgcctactaaatattagaagaaag


atcaagggaaaggataaaacagcattggccttgaaggaagttatttgctg


gaaaaaagacatcaaaatgtgaaaagacaatggatcagttagtggtttga


atcaaagaaatacttgaggaagaaaaaatatatatatatatgtacatatt


tttctcttaagaccagaggaatcagagatcctaaagctctaggttggaag


gtatttttaaagttaatttgttctgtcaaatctttagaggccattcaatg


cagatattttgggtgtgaaggggaacttactacttaccgccattaaggca


aggatttttacctataccgtacttaccagagaatagagttcactatatat


tgtgcaagtgaatggataatttcaaaagtatttcatctttatatacttta


ttatgtttttaccttatatatatttttttaccttatattgcacagaatta


gtcttcctgaaacttccaaccgttagtcttcgtcctgccagttgcagcca


aacagaacaaatttgtttatcagtcaagataggcaatgtgctggagtaac


aattccaattttttagtggcctgtaacagcaaacgttttcttggttatgg


tttgaatccattgcagttcagctggtctgttccatgtctgtcactccagg


tccctggctaatggaacagttgccatcttgaatgtttcatgttgttatgg


caaaaataaagatcactctggagggtttcatgcagacatttaaatgctat


ggcccagaaatgccatacatcatatggcctcacaaagaattaaccaggat


tagttgtgtggctccaaagaatcatgacaagaaggtcggatagtgcagcc


tatcttgtacccaacagtagagagctgtggtacacaattctgcagactac


tgctactggtagtaatccttgtccatatgataacctgccagctaaacagt


ctcttcctgtagctttatggcccaagtcttccattttttagggcaaactg


ccagagtagccaccagcttagagggaaacattgttatcaggggaacccgc


ccccaatatttcaacatagattctttatattttccctaagtgtcggccag


tctgagcaataaagagaaagaatacaaagagaagaattttacagctgggc


cgctggggatgacatcacatatcagtagatccgtgatgcccacctgagct


gcaaaacctgcaagtttttattagggatttcaaaaggggaaggtgtgtat


gaacagggagtaggtcacaaagatcacctgcttcaaagggcaaaaggcag


agcaaagatcacatgcttctgaggaaacaggaccagagcaaaatcagaaa


ctcctgataagggtctatattcagcggtgcacgtattgtcttgataaaca


tcttaacagaaaacagggttcgagagcagagaaccggtctgacctcaaat


ttaccagggctggtgtttcccaatcctagtaagcctgagggtactgcagg


agaccagggcatatctcagtccttatctcaaccacataggacagacactc


ccggagtggccatttgtagacctccccccaggaatgcaattcttttccta


gggtcttaatattatattccttgctaggaaaagaatttagcgatatctct


cctacttgcacatctgtttataggctctctgcaagaagaaaaatatggct


cttttttcccaaccccacaggcagtcagaccttacagttgtcttcccttg


ttccctgaaaattgctgttactccgttctttttcaaggtgcagtgatttc


acattgttcaagcacacatgttttacaatcaatttgtacagtttaacaca


atagtggtcctgaggtgatgtacattctcagcttacgaagataacaggat


taagagattaaagacaggcataagaagttataaaagtattaattttggga


actgataaatgtccatgaaatcttcacaatttatgttcagagattgcagt


aaagacaggtgtaagaaattataaaactattaattttgggaactgatatg


tgtccatattaaagtgaaatcttcacaatttatgttcctctgctgtgact


ccagccagtccctctgtttggggtccctgacttcctgcaacacattgtgt


gacttcttttactctctactgtctctttgaatgcaacactacaacctcct


gtacccttagaacccctaacaaagtgtggcctcttatcaattttatgtca


ctcttttatatttactaatattatatacattgtattttctcaggacttat


tttcctggcaagggtataataaacaggcatttttaacaactgttttctga


aaatgtcatggtcattatcaaaatttgaaatttggtctgagtagctactt


cccttgttgtgaacatcaaatgtgttatatgattttgtaacacaaatttt


aaaaatccttcattcattcatttatcctacttttgttcattcaataaata


ttcattttctacctagtagaaggcaggcactcttttaaggattggggata


tggagatgattaagagttctcctcttcaagaaacacattttctgattggg


cagacagaattaaaagtcattataatgctaatagaagcacaaagtgttat


gtgaacatttagaaagaagcaaatagctagctggcaggatttagaaaagt


tccatagaggaattgacacttaaactaagtcttgaggataagtgagactt


ctccaggcagaaagaggagggagagaggggatatttcaaccaaggagacc


agcatggacaaaggcactgaacatgacaaagcttattgtttatggaactg


taaggaattcagtgaggctggcgcatggggtctttgtaaaggctgaaaat


tctgaagggccttgcagacattctctggagtttggacattatcccataca


tcaggaactgcaaactcttatgtctaccttgggccatgcatggatgtaca


tgactaaagcagaccaagttaagactataagcagtcgtggggctgtggca


atgtgaagaatatatcctctgtccaaatgggatagtaaatgccattggtg


cctccattctccaaagaaaagcaggaaatttagatgtatatgaaaaattg


aatattaaatgttgtcaacttatccatattttgaagtacactgttagtcc


ttaagctaattttgacctacagacaaattttgtatagcttttgcatgtat


atttctactgaaggtcaaagtgtttatacccattggagaatagtagccac


tctttatctgcaagttatatgttccaagatccccagtggatacctgaaac


tgtggatagtactgaactgtatgtatactgttttgtcctatacatgcata


cctgtgatgaagtttaatttataaattaggcaaagtaagatattactaac


aatagctaataataaaattgaatagttatgatacactgtaataaaagaga


ctatgatctctctctttcaaagttatcttattacactgtactcacctatt


ttcagactgcaattgacctcaggtaactgaaactccagaagagaaacctc


aaataaaggaagactattgcctgttttatctcaacctatcccatagccat


gtgcctaaggactgtcttgaggatatttaatttttgtcatttttttaatt


gagtccactaattacttactcttcctagctgctctcttgctttcatgcca


gacttgctgccactcaatagtcagaaatatgcagcaacaccccacacttt


cctatagcaattataatgttagaaatactaggtatcctttcataacgcat


gctgaccattttcttgtagatttggtcccatgggtacatccagattctta


aagtcattcaattcactgggccatcaggtctttaagttcttgacaacttg


gctgtacagattacaaattttattcgatcttcttgactgaaagaaagtca


acgttcctgggttgttttgtctaacttataaacaataacaatcatccatt


ttatgtgttctgtgagttttgaatatgaaggggcagcatgagcaaattat


attttatgtgaactttgaagtaaataaatatatctaggcaaagccacttt


aggtgtccaactgtctggattctaatcctggttcaattagttgctaccta


tatgacctagagcaaatctcctaatttctttatgcctcaatttcctcatc


catgttacagggatgctaatagttagtagatggggttagtaggtgcagag


tgcttcagaatagaaaaatgaatttactaggatgtaagtgggatataagt


acttaactcctggatgacatgatgttgtgctgggggtacacaagccaaag


aagaattatggcttctatttctggaatatttattttttacttgaggattt


tagaaattattttcttatgaaaacagtgacacatataatggaattaactg


tagcattttgcatgaattttaaagaaaaacaagagtctttaaagatcttt


cagctgcttcggattttggggggaagaatgggaactgcattttcttcctt


ttgctgcactctctcctttatcaggttggtggaaatgcttatgattagca


gaccatatacactagttacattagttacatttgcctcaaaactgtgaaag


tacctcctgtatttatgtgctatgaatgctgcccaagtgcagaatctttg


ccatttaaatgttcagattttaaaccagaaatgtgttggggtaaagaaga


aaattattagagcatgaccttagggaaatcaagtcacaaaaaagaaaaga


gctgaactgaattctatttcccattctgttactaaccagctggcttatta


atgtccataaagagagagatttttgtccttttcactgataatcctccaca


ccttgaataaatagataatgttgaatgatcttgggaaagacactttgtta


aagtctcagtttcctcagctgcaaaacaaagatgtgatcttaaacttaaa


ctttattgtgtatggaattaaatagagtttttgtttaaaatgcaggttct


ttgagtccatgcccagaaattcagatttggtagaaccagatgctattcag


caatccacatggatttggggcttgggttgtaaggagcaaaaatccaactt


aaaaagaaaggttttattttttattttgcttcattttttgtcccccctct


cttcccctccccgcacataaattggaaaatacaatgcaaggactgcttaa


ggcataattagatctagctgttcaaataatgtcaccaaccatcttttggt


tatgctttcttctcttttaactttatacttagtctggtttttcctttaag


gagttcatttttaagaacttcattcttaatctgccttttctcctagcttc


cattcttctgacatagaaaccccagcacaaagagtattcctttttctcaa


attttccagctaaagaaactacacccaacttgactgtattagattatgtc


aggtacccgtacctgcactcatcaccatggtcagcacctgtgattcacac


ctcagcagccttgagtcaaatgcccaccttggatttgggagtacagtcac


tcaactcaaaccatgtgggcttggacgtgaaggtgggatggttctccaag


acaaattggaattaatttttttagaaataaggaaaatgggtgctttttag


ccaaaactatagatctccttccactactgagcatttttaagacgtattta


gagatgaatgatccaaaaatgttacaaaaatataaccagtattccaagac


attgaggcattcagccctcaatgtgacaaagcaggggctgagaagtacct


aagatggtcacagtcccagtgccaggtgcctcacctatttacctcagtgt


ttggggaaaatattaagtattttccacgtgttattgtattagttttcaca


ctgccataaaaactacctgagactgggtaatttataaagaacagaggttt


aattgactcacagttccacatggctggagaggcctcaggaaacttacagt


catggcagaaggcaaagggaaagcaagacatgtcttacgtggcagcacga


gagagagagagagagcaagcgaggaactgccatacacttttaaaccatca


gatatcatcagatcttgagacagcactagggggatggtgctaaaccatta


aaaccacccccatgatccagtcatctcccaccaggtcccacgttcaactt


gtggggattacaatttgacatgagatttgggtggggacacagagcaaaac


catatcagctatgaaggtgatctatgtgcaggtgaactgtgacttgtgct


atgggaatcagtggattttgagaatcagtcaaaatcattattgttatgtt


tgacatgacattttgaagtcatctcaggcatacaataatagtactactct


aggatcttcatgacataacctggaagttggttgatgcaattgaaaagaca


aagccctagtacaaaaaaaaatgggacaaagaaacaggcaggaccagcaa


ttaaatcttgctttgatttggttatcagtttgtgtttatttaataatctt


ggacaaatgtattcatctaatgaatacagccccagcctggatcccagcat


gcctggttccaattaatgtttgaaacataatggatacatgctatacagat


aatgaaaaggggtaagcaggcaacagtgtggaagacaaaactgagttgac


ttagttatcagataggtcttaggtgtattacatgcgagttcatggaagta


aattcggatgaatgaattattaacccctatttttgttaagtgaagaagct


gctatcatttccttaagctcattaatacaattaataaaaataaaatgtct


acctcttcattttagtattaagtcattagtttagtttcagatgcttttcc


atacccttgccccactgttaccttcaccaaagtaatccaatgtgcagaga


aaagtgttttcaacaaataatgctggaataaatagatatccacgtgggaa


aacaaaacaacatttatccatatcataccatacacaaaattcatttgcaa


tggattatagatccaaatgtagaaggtacaagtataaagcttttataggg


aaacataatgtaaattgtagatagacaaagatttcttagatgagccatag


aaaacaataaccataaaataaaaaataggtaaattacatgttaaaatgta


aaacttagcataatggcatgtgcctgtagtcccagctacccaggaggctg


aggcaggaggatccctttagctcaggagttcaagtcaagccggggcatca


tagcaagactgcatttctatgattaattaattaaaaagttatgtttaaag


aaaagcagagtgaggagtgatatctggaaattccttattaaacaaacatt


actactagaaggatttcctctcttgtgcatttagttcaagttttttgttt


gttttgttttgttgtttttgagacaggggctccactctatcacccaggct


gcagtgcagtgatagaagctgactctgtgagcttaagcaaagcacatggg


ggattttcatgcaatccaaaatcagggatgcaatggagccttaagaatag


aagaactggctcggcacagtggctcacgcctgtaatcccagcactttggg


aggccgaggtgggcggatcatgaggccaggagattgagaccatcctggcc


aacatggtgaaaacctgtctctactaaaaatacaaaaatcagctgggcgt


ggtggcacacacctgtagtcccagctacttgggaggctggggcaggagaa


tcacttgaacctgggaggcggaggttgcagtaagccaagattgcaccact


gcactccagcctgggtgagagagagagactctgtctcaaaaaaaaaaaaa


aaaaaaaaaagaatagaagaactattgaagtgctatagggaaccaagtca


attctttctctccatatctcatctttgtctttcttcctcagagtagtcca


gctccctgcctcctcagttcacacagtgaatcatggtttctaattgttct


tgagccattttcagtcagtccaccctgagaaaggaccatcttcctgtctt


tcacaattctcagggaagagtctgattggcctgcttgggtccagtgacca


cccttattccaataagctgtgaccagaggcaggatgatgtttaaagaaac


atagcattttctttcatctcttgtaatgtggagtgaaaagaaattgcatc


acagttgagatccactgtacatttactcttcattagacatgagacagagt


tcacaaaaattttgcatttggtcacagtttttaatggcaacaattattgt


tgtgttggttttttacttttttgattatctaatatcggcatctcctctct


gcatatctatgtattgactaatacatattaagggccataaacaaagatgg


ttcattttagaaatgggcacatttgtttataaattattagccaagatcta


ttcaaatccagacatgatttgttctagtctggatatacaataaaattaat


ttaaataattttaatcaacttcaaaaacagcaaattccaagacaattagg


ataactttaagaactcatatttattgagttaacagatggtaaattaaaaa


acaatgaagctaatttttaaaatcattcattgaattaaaaaatagactta


tttaacaaaactgtttttaagctgcttttttatcctaatatgcatgtttt


aagattccaatttgggggactctagaaagacgcagagtgagtttcagtat


atgaacactcttctactgagttactttttaaaagcttcagaatcatctga


aaaaaaaactatcaacaataaaaatataagaaataagaatatttcagtta


ttggaagggatggagggtttgcttcttcaattccagtttgcttgctaaca


tggtcctatggtgtattagtttgttttcacactgctgatgaagacatacc


tgagactgggcaatttacaaaagaaagaggtttaattgaacttacagttc


catgtggctagggaagcctcacgattatggtagaaggcaaggaagagcaa


gtcatgtcttgcatggatggcagcaggcaaagagagagaacttgtgcagg


ggaactcctcttttttgtgagacttattcactatcaggagaacagcatgg


gaaagacttgccccagtgattcaattccctcctaccaggttcctcccaca


acatgtgaaaattcaagatgagatttgggtggagacacagccaaacccta


tcatatggtcttctgtatttctgaccttgaaatatcttgagagtttcctc


tgctaacagagtgaaagttgctcctggccagaactttatttctgcatttt


aatggacaaacataaaaaacttcccatctgatggcttcttgagcctagtg


catgggtgacttgaaatatgctagcaccacacagatattttttctttgat


cttttctgttttctcctttgttttctagttgtaagtgccagaagctgact


gtgagcttataggaaaaggagaatttcatggaagcacactgaaagcatgg


gttcttttccttttgggttctcacttggtggggtctccttgcagcttttg


gaggtggggcaattaacatcccccagtacatgaagtgtaccagaaactgt


ggttgttttcaaatgagagttcaatcacttctacttcctgaggattttat


agatgaagttggagataacaccattgatacattaggatacatcatttgca


ttttaccagctattattagttttaacatgatggcatatttagaactagag


ttatccatgtagaaatgaagttgttccataaaaaaaatgggaatacattt


gtttgaatggctagttgctcaattactgttatagtctttaccatgggcag


tcacaaactgttgcaatttattttatggtcctggaaacttcttgtctaag


atctgatcttattttacgttgtaggtattccattacttgatcaataatgt


ttagaattgtatatatttgtttttgtagaccaattcattagttgtaattt


gagttagaaaacatttagagatatagaaacacatttattgttgagattgt


agtaaaaattattttattgttatgcatttgaaactctttggcccctagcc


ccctttcctctggaaataattgtttttataataccaatttgataatatga


atgttcttaggaatgtaattataacagtatagtaggagagactatattgt


ttttaattaaacaagaacaggtgaaaataagaatagatcatgaccaaatg


aaaaaatgtactagagagatgtaacttctgttttgcagaaatagaaatca


aagatatagaggaaaaacttcataacttcataaacatccaaggaaattaa


taagagttaaatggataagaggtaattagatgacaaattatggagctcta


atttagaatgtaagctacttaatggcaataaacatcttgttgttacccaa


cccctaagacatagcttaacatatagcatagcttaacatatagtatatac


tattattgttaaataaacattattttcttgaatttataaataaaataatg


agtgttttaaaaatagagcagatggatcagaagtaaaaacttgttataat


gttataataaaataaaactttccatatgtaacaactacactgacattgta


ggatttcttttgagcaaaattaaggaatgcaaggcataagtaaataaatt


tttaaatttctgggacgaagaaaatttcttaaagtacatgtgaaggaaaa


acatacacgcttttgtagattgccttcatctttcttatatagaacaagaa


ctttttaaagaacacaggtaaatgtccacataattttaagagaaacaagt


ttgaagtatactagaaactctggttgtttccgtatgagagttcaatcact


tctacttcctgatattttacaaatgcaattggagataacaccaatgatgc


actaagctacaacatttgcattgtaccagctagtattagttttaacacga


tggtatatttggaactacagctatttatgtagaaatggtgttgttctata


aaacgttttatgcctacccaattatcagcttatacataaaagcaacagaa


taacaaaatcttccaggtatcagagaacataacaaccgaatgtcttttcc


ttaaaaattgcaggaaatattgtccagatcactgagaggtgaggtagtat


aaaaactcaagaataagaaattccaataaaaaagattaatactaagcatt


gaaggcaaagacaaaataagtgaaatatagtgaaatcaaagtgaattggg


aattgatgctgatagctagatattttttactaagggagcaatgttttcaa


taaaataatcctggagggttttacatttttaaataaagcatctataccta


gtgatcacaatatggaaagaggacaggaaggaaataaaagcattataaga


agcagcatttttgctttattttcaagtcattaaataagaatgcatataaa


agtatagttgtagacaatttaatgtcggcactagtagaattgaaagcagg


atatctattttctaaataattatcaaagataaaaaagcaaacatagtcca


ttcaaaagacaaaatgaaaggcaacaacaaggaattattgaagagaagaa


agtttaagtccaatcaagcttaaaaaatatgacaagtataaatggcttaa


attttcctattcaaatggaaagatccttatattaaattaaaatcaaagtc


taatggtttggtggtgcttaaaaaagtcatacgtaaaggaaaagaacaca


gaattattttttaatggcaaagttttttccagcctaacatgttcacatcc


aaatcaaaaatacaatatgaatatcaaacagggcagaatttttggcagaa


aaactccaaaacagtaatcatgacaaaagggactattttatacttgttaa


taatgtcgttctactgaaggaataatagtactgaaagagtatgcattaaa


taactacaaataaaaaataatttatacattctgttttaaaaatacagtat


tacttaccacagcaaaataatatatatacaagcagtacaaaagtttacaa


atttaggagctatacaagtgaaatactttgtatcagtaggttaaataaga


aataaattattaatttaatacattttttgaaagcagaagagaaaatttac


tatctgttcctgaaaaaatgttgaaaatagaataaaatattttatcttaa


ttattaatatcaaaattaattaagaattctaaaaatgtacatatttaaaa


caggatgggatgaaagttgctataatggttttgccataataatatcattt


aatagtatttttaaaactctgatcaatgtaataaaacataccaaggaaag


aaaagctatatagaatgaaaaaagagatacaacattatgtattgaaaaaa


tatttttatagtttaaaaacccaaagtagtcaatgaaaacctaaataaaa


ataattggtaaaatattaaaacatataaacattaacataaatcagcagta


taaatgtaagcaatcattaactcaaacataaacttttaaaagagagaact


ataaaaaacaaattcacagactggtttcaagtgtgaacgataacattttt


atattgttatttcccaagtttaatggaaatgtaataacaatctccctttc


attttttactttttgaggtagtggtgagaggaatttgattttagtgtaaa


tctagaagaaaaagtagtcaagaatagcaaagttcaatttttagaaagaa


ggagtgactagtcttatcaaatattaaaatatataataaagttataataa


ttaaaactatactggcaccaaaataaatatattaatgaggcaaaacatgt


gggcaaataaattatatataaaaatgaaatatttgataatggctggacca


cagattaaaggagagggaaaatgttttgcaatatatgggctaatggaaaa


aaaaaatcagcttttacactcattatgtacataaaatacattatgcaaac


taaaaccttaggaagtctaacgagactttattttataggcttaaaacatt


gtaacaaaattgcaaagtgaacagtaaaatagatttgattatattaaaca


cttgattagttaaaacaacaatgtaacaagcaaagggtggcctatgtgca


attgcaattatttgaacataataacaaagagttgatatgatttcacaaga


gctcatacaaactcatacttaatcaatatatgagataagaatgaaatcac


atttcataaaatatgaaatataacacatttgcaagcacataaaaatccag


ccttaataacaatcaaatgaatatacactaaaacaaaaaaaaatgagtta


tgtttaggtttaatcaactatttaaaattgatgtcaaggacttacacttt


cagctgttacatttagatcagctcatcctataaggacaagtagaacaaat


agaaaagttgaataaaatctaccaaacgtttgtttaaaagcactggagag


ctccacagtggagagagcttgagagcagagaatttaagaaagatcctaac


tgcctgtcagaaactaaagtaaaacatcacttaaagaagagatgatcttc


cagaggtctaaattatctctacaatttttcatacacaatgtctggcattc


aataaaattaccaggcataccagaagacaagataaaatgactgaaaacag


agctagaggagatccagatataggagtaacaaattttaaaatatctatgc


tatcttttaaaaattatatgagtagatagaaaatttcaacagactactat


taactgtgataaagaatcaaatggaaattccagaactgaaaaattaactg


aaattaaaaactcaatagttggatataagagcagattagaaattcttgaa


gaacaaacttatgagcaatagaaattttgatgttaatacctaacaaagtc


agaaaaggtgtattggaaaatggtttctctcatgtaactggtgacatgta


aacctatataccactttttggaaatactttgataatgtgaattcactatt


tgaacaaaaaaatttatatcttttaacccagtaatcttatctctgggaat


gtgccataggaaataagtttttaaatccacagtgcctcatgtatgaagat


attcactacagacttatgatagtaataactatctacagacaatgtttaaa


tactttatttcaattaatgcaggttaatcctcacaacaatctatggtgct


agtgaacattactatcctcattgtacagatgagaaaactgaaacattgag


aagtcaagtaacttgctcaagattatatagctagtgcatagaggggcaaa


gatttgaaccagattgcctggttccacatgcactttaacctgctttattt


aggcttttgaaagcaacctaaatagctaacattgactaaataagtgcagt


atacctggtctgtgtgacattttgcaggtattaaaaagtgaacattctga


acccttttagcaatatggcaaatacttctgatgtgatgtatgcatataaa


tatgatcgaaagcaaatatcccaaaaggttgatggcagttatgtcaggtt


agtagcacaatagtttcttttctcattttctcacatttttagtaatgtgt


gttgcctatattaaaaaacaaatattaaaaagtagtagcatgtattagat


cagagcatgggctcagactgcctgtgtcatctcttattagatgtatgacc


agggacaagttattgaccttttgtttctaagtctctaagtttatcctgta


aaatgaagataattgtagtaactacctaataacatgattgtaaggattaa


attatttagaacactatttgttacttggaaaatatgcaatcataagctat


tttgttgttattgttattttactaccaaagcttattgtttcagAGCCCCT


AGGTGACCAGGCAAAATGGCAGTTCCTTCCAGCTGGTCCTCAGATGGGCA


CATCTATTAgtaagtttcatttctgtatttggatctttattcatccttat


tatgaattccccaatactgaggtttctaggaacctacctgccattggctg


aagggttgcttctgaattctttcccagaggcagtgacaaaaatcaaacat


ccgatgtgttctttacctttcagtaagctcagcctccctgcttgtgtcac


atctctaaacatacgtacatgcgtgattgaaactgtctattgaaagcaat


attacattaacattatcaatactgaattagagaaaatatctatcaatgtc


accaattccacctgtctcttgatgtatgactttttacctgacaggctaaa


taacaatagaaagttctttttaaaaaatagaagagttgctatgcagggac


ttttttggaattaagtgccccagagaacctagaaatgtgctttaaaactt


tttgtttcacctggcagatccaaaaaaaatttggaagcttttggttgaat


tccctcattacttgaagttgttttatgaaaattgaatatatatatctgtt


aggttaccaagacttatggttagctatcttaatggctagtgatataagac


cttgtaacaaaccctaccaatggaactgacaggattttaccagggaggat


attgcagtacctgtaagaggaagggatacaaattggcacttcatacattt


gtgagaacaattgcctttaatgtgtctatttggttttcataatattgcag


GCCTCTGCTCTTGTAAGAAGTTAGCTGCAGAACCCACATGTGAATCCTTG


TAGgtgagtccaccatttcactaatactgtgttttaattgcctcagttac


gtccatctgacattcattggcaaagtcctttggttaaacttcctaataat


tctcagctctatcatatggtaaatgtttaacacattgctttaatgtttga


gtttttcatttttgttgaagttattattcctctcaggcttattcatgaag


gcatttctggatttatgcctcccctgacccattccaggatttaccccaaa


ccttccacactctcttctaacaggaaagttctgttatgacacaatagtac


ttattaagacagatttaccttctaagtctcaggacagcatttcacaacca


gaaataaccggtcacatgaagaaccagagtctggtagtagtgaaattcat


tttccttcttgaaaaagtggatcaaaggattcaaacagcaagtggtgaat


caatgaaaagtggtaaaatggtgaggaaaaaatgttactaaaagatgacc


tcaagattactggtgcatatgaattgcttttttatataggaaaatactgg


ataatttcttattgtcatagtataattagaagcaatttcatgtgttcatt


ttgccacatgagtttaaatggaatagatttggttccctctctaacatgag


ttcagtgtctgaacttgggcaaatttctaaacaattctgagcttcactac


ctctgcttgaaagtgagaacaattgtatttatctattatttgtctattag


gttatgagagcaaaaatgtcataacataaaacacctggcacccagcaagc


aattaatgctagtccttcccacccctatttatggaggtagaaagaaaaaa


gataacagacagctctacttttatttttacatatatccttcattgattac


cttatgagtaaacctaaaaacagcaaaattctcatctcttcatccttcat


ttctcagtgctttatcaaatttctactatgaactaagaactgagtgctat


gggagacacaaaggagagtcaggtattgttctgttctcaaggacattggc


atctagaaagggagatttttttttaatgccattagagagacaaaataaag


cactgtcctggttttaagcaggaaaaatcattttaactggggttaccagg


gaaggtttcatggaataggtagtgtttgggctttaaagaatgggtatagt


ttggaaagcaagaagagaggaaatacaatgagtcatgactgaagcacagt


tgacccttgaacaacacaagtctgaactgtatgggtccacttatagtgga


ttcttgtcatccaaaagtggatcaaaaatatggtatttgctggatgcaaa


acccatgtatagggaggactgacttttcttctatgcggattcagcagggt


ccacttgagtataccaggattttggtattctgtggggtgtcctggaacca


atcccctgcgcatgccgaggggagggacaactgtagtataattgagagaa


aaaccattgtcccaggattggaaggacaggttggcattaccttggatgaa


ccacttctcccctgtagccttcaacttcttcacttgtgagagatgtagaa


taatccctgccctgttggtctttcagGACTCTGGAGAAGATCAAATGGGA


GCTTAGATGTGGAAGCTCTTTGGAAACCAAGAAACACTCTGAAAATGAAA


AGGgtatctttttttccttttctccctttaccataaatttcatgatggca


cttaacagagccagtcgttttggtatttaaaaaatgtttggtgaaataat


tagttattggttgaatgaatgaatttataagtgaatacatgaattaaaca


aaggtatggagtaggaaagcacataatatgagcagagaataagggattct


atttggaagcatgcactagaactggatgttagggatgtgaatgtggctag


ctggaatgggtctgtggttctcacaatgtgatctgttgaccagcagtgtc


aataccacctgttaacttgttagaaatgcaaaaagaattatgaatttgaa


actcagggtgggacccagcaatctgtttctctctaagtcattctgatgca


tagcaaagttttgagaaccatgactttgtattaaggtggggagtttggaa


attatttttctgtaagttcggtgctaatgaagattttcaactgtgggtgg


acataatcacagccgtatttctaggatgatgaggcagaaggttgtaccta


caatatctatgacttcagcattaaaaggacaattagaagtactggaaaac


acaccgtatttgccaagctccagaaatgaagattttgtttttacctattt


ccgaataatatccccatacttacctaagaggcaaagcagagcaagcattg


ctcccaggcactggggaaacccacttggtctgcagagcccaggcaggatg


agaatgtttatgtgaaaaaagtaaatgagagcggactgtcaaatacggcc


ccaacctttctgcagagagaggagtaagtaagcctttagtcctgaccaac


tgtggggaaacattaaatggaaccttccaaattggtttaaagtgggcagc


taagcctatgccaaggactaagccaactgcactggctacaaacacacctg


tgtctttagggctggtggatacaaatttgccaaaggagagcacactacat


ggaggaaaaatgaagagccgcagggaaacatttatcctaaaggggaaaga


gtcaagacagagatatcatgggtcagttgggaaggagatggaggggagag


gcaggaggtgagctgttcagattgctagaatgaccaatccagatggctag


gatgaccagtcatcctggtttgcccaagagtgagggatagcccacatcat


aggatttccaggacaatccttgtcaaagcgggacaattagtcaccttaag


actctgccctttcacctgtgttcaagacctggggagaaggcataataaaa


atagaaaaaatcactgctgtcaacctccgcttctcaaaatattgtccaga


aactagcagcataggcttcacctgggagcttgttagaaatgcagatactc


aggcatcctagacctactgaatcagaatctgcatattaacaagagctctg


agtgatttgtatgcacattagagtttgtctcaaaagatcttgtggcatcc


cacagtaaataatggagcacaagtggcagaaataatttcctctgggctgt


aatatggctaatatccaggttgatcagatactaagctaaaaccagatcaa


ccttgcctgggaaggacgcaacctagggatggagagcaacccaggcagag


agggagaaagaggctgacacaaattagccaagaggggaccccctgaacta


ttaatagtagtagtatttgtagggaaagtgagaaaagccagcaagccagc


aattgtatggtaagaataaagctagagaaattaatgttgccagtactgtg


accctatttttaccacttctaggctgatgtgggctgggaaatgggttatg


cttagaaatgtggagcctgccaagtgtggtggctcacacctgtaatccta


gtactttgggaggccgaggagggcagataacctgaggtcaggagttcaag


accagcctgaccaacagggagaaaccctgtctctactaaaaatacaaaat


tagtctggtggtggatgcctgtaatcccagctgctcgggaggctgaggca


ggagaatcgcttgaacccaggaggcggaggttgtggtgagccgagatcac


accattgcactccagcttgggcaacaagagtgaaactccatctcaaaaaa


aaaaaaaaaaaaaaaggaaagaaagaaatgtggagccacaagtggtaagc


ttgtagcggtaagcttgtagagaaaaatccaaaatgatcatccactaaaa


gtgttcaactccagatcttggcctcatccacttgcatatcattcaaatac


agtgtgcaaagaatagttttctttcttttttctacctggtctgatcttca


tgggcttcagctctgcagtctagcagggataattgacacttaattaatag


tgtttcattctcctctagcttgaacatatttctttctctttcaacattga


agccagtagttctaaaaatcaaacatgcaaacatgcatcagtcacctgga


gggcttgttaaaacacaggttactgggtctgccctcaaggtttctgattc


agcacatctgaggtggagccctagaatttgcatttctgagttcccaggtg


atgccgacattttggtccacaggccacactttaagaacctctgattcaaa


ctattcagagtttatttcatatccaaaagtgattatttaaaaagtatctt


agattaatgctcctcaaactgattgatttttccaatcatgtaccaaatac


atagtcctatttcacatgaccagtattcagcttttgctgcaagcaactca


ccatgccagttccacacacctgaacaggttatatcctgtttaacaagatc


agcccactgatcacatactaggatgtcatggcaatgtcaatttatgatag


aagtttctaaacatgaactttcatctgtatttatctcacccccgaacagg


aacagtttgtggacttgcgctggcccttggaccacacttcgagtgacatt


gttatagatgacacggtctcttataggaaaatgcacagtctttcttagat


tctctacctccctcttccatctcattcccaacatagatctgggtacatga


gtggggtcttattataatcttgtgacacccttggatcatgcactaccctc


taaataatcaatagtttctactatagagaggctcaatttatcttctccct


ggaattgggaccactgaaatataactagaacccaactgatcttttgagat


gttgtgtgcttgcccttactgctactgccgtagttctgaacatttcccca


agcatcaaaaaggccccatggcctcttttcccctagacctctgccagtcc


accaacactctcagtggggaagtaagaagcctggtggatcctgcttccca


cacaaaggccatgatgagacagatgctttctaagtccgatatatctaccc


actttctgctgtcacctctgtacctcctgagtcacatggaatgtggtgct


gagccaaatggtcagtcttctagaagcagtggaatgttggaggcagctta


taccttagctgattcttaaattttcaggaattttgtgagccagtttttaa


acacagccattgttgaaaattaaacaatataaacttataatcaaatatat


taaaaataaagataatgccctcaactcatcacttcctatttattgtttta


cactttagtactatctatgcttattgtatgtgtatggtggaaatactaca


taatgatatttaatggtgtactgccacacagctcttcacaactctgcatt


cagtgacatcactttggtagcttgaaatcagccatgataggagtatttat


atcttggaaattggcaaatattacaaatcagcattccaccctcccttacc


tccccaccccccagccagttgttaaacatttaccagcatgcaaccaccca


gagcctgcatctgggaaagtgagccactaacccaatgacctgtgataacc


ctatacatatctagctgttactaagctgcccttcccataacggtctgccc


caaaaaggtgtgtgtgcagaggagaataaaaactaaaaccctaagatcat


cttatttacttgccctatttctgcctcttttctctccctgccatctctgg


ggtccagaagtagaagctttttattgcctccggagtttattcttatacat


caaggataaacattcatgacctaacatcatctctcttgttttctcccgcc


aaagctataaggatagtctaatcatgaaaaaacattaaacaaactcaaaa


tggggacagtctataaaatacctaaccagttctcttcagaacagttaagg


tcctgaaaggcgaggagagaagaagaaacaatcacaacttggaggacatt


aatgcactaaatttaatatggtttcataagttgggctgaaaagaaaagga


cagtagaaaagctgggatatgagaatacagtctaagtgtagtactaatgt


taagtttttaatctctaacttgatggcatggttatagaaaatagaaacat


taggcaaagctgggtgaaaggtatacaggaactctactatctatacaact


tttctgtaaacataaaattatttcaaaataaaactaatataagaaaaaaa


catggccgacataacagatagaatggatatcttctctagtctatgaaaaa


ccctgtgtcctacattcacttgctttgtgatataataaaagggggaaagg


aagggatgatagaaacattattcctcttaataaacttggattttaaaatc


ttttatcttttcacagcataaaacattccacttaggatgctatatgttga


gcattaactttctctttttctttatagtctttctgtaatagttcaaatcc


tgcccaccctgagggtggatgtttctagatgatgaaggaggtcataaaca


gagagtagattaatataattatctttatttcatgtctatttgcaaatggg


ccattgttcacatagttgtctctctttctaaatggaagaatagttggagt


tgggggtggggaaataccagaactgagaggagtaaaggtgcttcaagaca


atgcttcttaaactttggttttgtgtttgcatctcttggggatcttgttt


aaagtaggttgagattcagtatgtttggggaggggcccgttaatctgatt


ttataaaaagttcctagatgatgccatgctgctggtctctgagaagcaag


agtctggggtcctttctaaagtgtttcctctctcctctatacagtcagaa


aaggaaagtgcttactggtcagagatttaaaagacataggagatatacca


cacttctataagaccagaaacaaaaacaaagaaagaagaaacataataac


caaaatatgacattattgcacacaatgagatgcagtaaaagatggactac


ttaggcattcactggacataagtgaagctgcctttggagtttaattttaa


gttctcagtaactcactgtatttagtcatatcgtttcacatacagacaaa


attccttctctactaagaaggaactaaagctctcagaattactgagttgc


acagtaccaatttacatgaatatttttcttcctctaaaagtttgtaatga


ctatggttcctggaaaataatatttcattttttcaattcatactcttaat


ctaccaaaaaaacactcttccaaaaatataagctagaaaaacaggtaatt


tgctcatatcatagagacaactcataccaataaaaaaatacattttaaaa


tttggaaaattacaatgagagaataaagcattttgcatgcagatgtgtta


tgtttttactccacaatattgggagcttccaaagtgtaacatgcatttac


agagccactctcttagcatttaaactgggaggcatttttcgagaagcatg


agtcttgctcagcggtcctgtagccactctagcaatctcacacagtacat


gacttataaatattgtatgtgcttgatggaagtagacctggctttgctaa


tcacttaccagttcttttgcccagtatttctttgttttagaagtaacttt


gtttagcttaccaaatgtatattgtccaatttgtaggaaatataagtaat


attgcaaacagcaaaatgggattgtatgtgcagatctctgcatcctggtg


ggccctgtcatgttagtaatagcttttccttgtgctgttgccactgccaa


acagactgcttatgcgatctttggctttcttaactcctctttggggatca


ataaagtgtttcttttttttcttagataccaacttcatctctctttaaga


atacaatattagtttaaaagattgttcaacttaaagcaatcttgtagaga


ttgtcacccatgattgatgctgggtgagtagaaaggaaaatgttgatttt


atcctctgcctttcagatctttgaaggggaaagtgggctagttttttaaa


ctgcagcctgatcaaatgtttcaaaagttgttcaaaagtaatcttagtcc


aacactgctcctctgacttttatctagagaaatagaaaagtttatatatt


aacgatttgtttcttttaagtacttgtgtacatttttctcttgcaataag


tattaataattgaaaattattaaattgtcattttaatgttttttattaag


taagtttgatatgtttcttagcaacgaagacctgtggctcaaataggagc


agacaattcatggaatcaaattttaaattcaagcagagctatcatactga


catatttgtatttccctttgtcttggaattattggtttgtttaaaataat


aagtttagccaattattttgttattctgttatttaaaatataaatttgca


gtctaaaattttttgtgtaaaggtgagtgtataaatgaatataaatataa


ataaataactaatgatcacatttctactgggaaacatcattatttcttac


tttaaatgtgtcaattatctttccatgatgtgtttgtgcacatgcaagtg


ttggctaagcctgattctcttttataaaaaaaatggtaaatctcaaataa


aagtgcaaaatattatctttcatgcaaatacactaataataccaaagaaa


gcagaaaaacttacttaaaaataatttttcttgacaagagatttataaat


agcagcttgactaatcaatttcattactgttccttagctgcagccatgtt


ctcttggactatattgctcaatgatcttgtgctttctttgctactcatac


taaataatttgtgttgttttgcattttttatatcaggaagcaaaatactt


aaaaatttataaaagttgaagggaaaaagggcactcccatagtagataaa


agaaaatgtagcattacttcatcatctttatggagactcttcagaaagat


gcatggtttttttagaaatatgaaatgatatagtatctgaaagttgcttt


aactactaatgattgggatggtaaattggcacaacctttctgatgggcag


gtgggcactggtcattaaaaaacaaaacaagaaaacccttaagctgtata


aagctattcatacagagagatttacttttataaatttacgctaatgaaat


aattatggttgtgttcccaatttagcataagaatgtttatcacagcatgg


catatattggaagtttagtaaatgaagatacatccaatctgacattctat


tcacaaccacaaagagataatgtagaaacatatttagtaatgtgagaaaa


tactaaatattaagtgaaaaaggtaagttacaaaataatttgcataaaat


catctcatacaccacccccccatatatatacttatagtatgttatataat


gaaatgttaacagtctctggtttggggtgttggattatgaaatacatttt


tttctcatatgtatattcttgacccttcaactaacaagcattatgttttt


aataaggaaaataatcattcacctttaaaaaatgaattccattaatatgc


ttccatttgtcctttgaaaatttaataactttatatattagttattccat


aataagtagttgattttgaaaatagaaaattaggtagttgttctaaaaca


aggcaagaaagggttgtgtaaaataccataatgacttattttattaatag


ataagccataaatatcctgatgaattattttctgatgagaaataagaatt


gctgtggctcagagttacattttcttgtaatgtgaccaaggccaaaagga


atttgatagctattccaactttgtatcgtaaggagtagtttttaacattc


ctgaatagataagatagctcctggaatcttcccaattttcaactatttct


ttgatgactatctatagtgattgatttgttgatataaatgttaagctgac


atatatttactaaacgtctattattagttagatattactaactgctatgc


ataccagagagcatgaagtataggctctcacctaaaggaactacagtatt


gttgggtaataaattgtatgaaacagaacatttttagagaatataatgca


agtactaaaatgactgaagagataagaaccatgatatttataaagcagtg


gataaatgtaggctgagagtttgggaaaaacttcaagaaaagttgtgact


tgcctggctttggaaagataagtgatacttggttatgagagaggagggaa


aagcatttcagcttgtgagatgaaggtctaaagtccttagggctgaattt


gctggttgtggctatgtttactgcagttccagcagagttagagatgtggt


taggccaggacagtaaggatatctgcctggaagattcctgttttcccttt


agagggaatagattaaggtggcagggatcaggtaggaaatgatgggggac


cctgaagttagagctgagaatgtcttctcagcttctctgataccatgctc


taagtttgctcccatttctttcgtggttccttctcttttcatttgttggc


tcccaacctcttctctactcagtctctgaatgttggactcaactgaagtt


ccttagctgctcttctgcctctgttctcattccctgaatgaaaggagagg


gttcaaactatccaccactatggtttaatacaattacagagctccttgga


aaagtgatttgtctagggctggggagggataaaagaagatgagcctggag


gatgtcatggtaccaaaaagtaagtactcaagaaaagcgggggggcacat


tgagagaacacaggagccaacctaaaagagctaccaatggccaaagctag


accaatttcagacaaaataaaatattgaattatgaagcagaataaaataa


acacaatttagtccatactgatataaataaatgattgaataaaagtaaat


ggtagagaaaggacaacttcttacagaataattccaaataatagatgcaa


aaggaaaggaggaaatagaaaatcgttgtgagaacactagaacactggtg


aaacgttgctgcagggaggatctgcctatgcatgcataaattaatggaca


atgttttgagagaaaatggctatttggggccaggtgcggtggcttacacc


tgtaatcctagcactttgggaggctgaggcgggcggatcaccaggtcagg


agttcaagaccagcttggccaacatggtgaaaccccgtctctactccaaa


tacaaaaaattagctgggcgtggtggcaggtgcctgtaatcccagctact


caggaggctgaggtgggagaattgcttgaacccaggaggcagaggctgca


gtgagccaagattgtgccactgcactccagcctgggcaacggagcaagac


tctgtctcaaaaaacaacaacaacaacagcaacaaaaaacacaaaaacaa


acaaaaaacctttacaataattaaaaagcatggtaaacaccacattagcc


aaatgatcaaggtcaacatcaccagtaataagacatactgacaacaggta


ctgtgaatatgatgcactgagaagagcacctcacctctgtggtatctttc


ctgtgaatacatagcctcaatttgatcatgagaaagcattaggagatata


tctaatgttaaatggcgagttaatgggtgcagcacaccaacatggcacat


gtacacatatgtaactaacctgcacattgtgcacatgtaccctaaaactt


aaaagtatagaaaaaaaaagaattactcagactaggagacccaatatttg


aaatacaaagtaaacctctaaaaataaataaataaataaatagctattac


tgcaaaaaaaaaaaaatgcaaattgaggagtattttacaaaatacctgac


agatatttctcaagagtgtcatgaaagacaatgataaacgaggaattgtc


tcagattggaggagactaaggtgatgaagaattaaatacaatgtggaatt


ctggattggacccaggaacagaaaaaaggcattagtggaaaactggcaaa


atctgaataaagtctgtaattcaggtaatgatattgtaccaatgttaatt


tcttacttttgatcattgtaccccggtcatgtaagatgctaacattagaa


gaacctgagtgatgggtatatgggagctcagtgctatctttacaaatctg


taaatgtacaattatttcaaaataaaaagttaaacatacaaaacatctac


ctacagagacctctcagtcctctgtttctctgataggatcttatttttct


gcctttttaaaatctttgccttcttcctgtccacatttaaaatcttcacc


ccctttgtcctcacatgtgtttatcttgtcaccttgaaaacaaaagccac


tgggtcttctggataatttctcagatggaagtgaatggcaagtctccttt


tgccctccaagtacagaaatcctagataaaatattgtttttaaaatgtat


aggtaaactcaaacagggaaaattcccaggtgccagcacagaaagcgatg


gtctgtagacattagaaaccctggacctgagggcaggactgaagttgtaa


tgccattgctgggacaggaaagaaagcttcaggcatggataaggtaagga


gctagaactggaccctgcatcctcaaaaacacagggtgtaggaaagaccg


atcattggccggggaggtgtcaaagatgtgcgccttatgctaggggccac


aattgggaaaataatcactcataaattatcaaactcaaacctgcactctt


tgtagttgtgggatccaaattcatactatctatatggcacagaagctaat


aaatgaacgtagaaatggttatgaaatgggtaacacttctatgggcttgg


cagggacaaatgcaaaaccactttgcagggatgttttcataagcctgagc


aagagagtctctcaggtaaaaagctactgaatgtcagtttactatcaaaa


attataaagcaaagaaacaaatcctcatcagcaatgaacccagaaaatta


acactctaaaaactaaagataatttaaaaaatctaaaatatagtttgaaa


taagtatgtttaatttgtttaagaaaaaaatgactatgaaactagaaaat


gaagattttggaaaacatggaattttagaaataatccgtgtaagccgttg


atattaaaacctcaatggaagtgttacacacgagattagacacaatgaag


agcaagaattagatactagaaaacagatctgaggaaatcatccagaataa


agatcaaagaagtacagagaagaaaattttggaagaaaatttgattcatg


aagggtagaaggtgaaggttcaatatttttctaaaagcaattctacaaga


atagaagagagatggagaagcaatatttacattttatctactttatggtc


acacacatatatgtatgtgtgtgtgtatatacgcacactatatttgtgtg


tgtatatatatatacatatatatacacctatatatacaggtatatatata


cacatatatatacctatttatatatacctatatatacctatatatacacc


tatatatacctatatatacacctatatatacctatatatatctatatata


cacctatatatacacctatatatgcctatatacacacctatatataccta


tatatacacctatatatacacctatatatacctatatatacctatataca


cacatatacatatatatacctatatatatacatatacctatatatacata


tacctatatatatacatatacctatatatatacacctatatatatatata


tatataggtttcaccacatcccctttagtaaatttccatatggtctagca


aatcctttatcacatggcatcactcttatgtggtaactcaactgaaacta


acaacttaatatttcgataggactattacaaaaaaattgtgctggaaata


aaatgaaatagacttggctaacatggaccttttataattggagctcaaca


atgaaaaaaatagataattagtaggaaaagtaaaagtttagttttagggt


tctgtaatatatagggtcgtgtgtgtgtgtatatatatatatatacacac


acacacacacacaaatatagtgtgcgtatatacacacacagatacccaat


agctggtggataataaggactgtctgctgtactcagcataatctcactat


ttattaaataccacactcaactcatcatgctattgctgactggaaacatc


cctttttcctttcatccaaattattttactatcttccagatccaatctaa


gttttgacttttctatgaagggatctcaatgatgcatctcttttccttac


acttctgttattgttgcatctgaaccactcagtactaaattgtgttctat


ctacaccaagcttgtccaaccgacggccctgtggctgcatgttgcccagg


acagctttaaatgcagctcaatacaaatttggaaagtttcttaaaacatt


atgagttttttttagcgattttttttaagctcatcagctgtcggtagtgt


tagtgtattttatgtgtggcccaaggcaattctttttccagtgtggccca


gggaagccaaaagattggacgcccctattctacaccaagagttaggaaac


tatgaaagccaagcaaatcctgtccactgactgtttttgtatggtccaga


agctaagaatgatgtttacatttttaagtggttgaaaaaaaaaagagaag


aagaatgtttcatggcatattagaaattatatgaaattcaaattttggtg


tcctctcagtgtccaataaagtttattgaaacacagtcatgtccacttac


gtatgtattgtctatagctgcttccacgctgtaacagagttcagtagctc


caatagaaaatctgtggtgcacaatggctaaaatatgcactttctggcca


tttccagttgaagtttaccagtctttgttctacactgcaattttgtcatg


tcaagtacaggattttaagaaatgaatctcacttcatagtgcaggaggta


gcagtgtcccctctccctgttgggaactggactcaagagcagttcttttc


aaagtggtcctcctcagaaattcctctttcatctctcaaacctgacactt


ttatatccttgaggtgggtgagggcttccaggaaatttgtaactaggttt


cagcacatcccctttagtaaatttccatatggtctagcacctcctttatc


acatggcatcacgcttatgtggtaactcaactgagaataacaacttaata


ttttgatagggctattacaaaaaaattgtgctggaaataaaatgaaatag


acgtggctaacatggaccttttacaattggagctcaacaatgaaaaaaat


agataattaggaaaagtaaaagtttagttgtagggttctgtaataatttt


ttaaataagataagcaggtctttacacatttgaaaagatcctgggtgaaa


ctacggagcaaaaatagcgttttactgtggtatcatttttcttttcttgg


cttcagctgtgtgtttttagtgaaataaataagtattgaatgtcttggga


gatttgtgattttctttgataactgctaagacaccagaggtttcaattgt


ttttgatcctctttgatcttcctcagctactcttttttcagcattgtagt


atttttgaaatttttacatgagaatgaacaataaaagttaacactgtcat


aattaatttgaagagtatggctgtgtttacacaataaaaccatttgcaac


catcatcttcatctttttctcagtagattaatttttcatttgcatatata


cagcataactgcagcaatagcaagcaaatatctttttgtctttccaaaag


aatacttttttaaaaattagaccatttcatagtcttgaagtttagtgcta


aagcatttcaaacagcattcgtatctgaaaacctaccactgtcttttaca


ggaataacagaatgtttaataaccaaactttagccataccaaactttgaa


tttccccagatgcctaagatggcaatatttcatgggtcctttctagtagc


tctgtttagaaaacaaaacagctaggctttgtttagatggataattcaac


aaggatttccactaataatcatgtccagttcgttcagggggagcacactc


aagtttgatgctgcatcaaccgagtataaaccaggactgttttctgcaat


tttagctttcagtctgtattagctgtgttttaaaaagaacaagaagaaaa


gaaaaaggaaaaatgtaatgttaataggcttccttttgtgatctcttagg


gagaggtctttttaaataaggggcttgaacttgaccttcttcaatggcac


aaaggtccaatgcgcagtttttaaagctatatttaaattttaataattat


actcattggaggctagaatgagactttcagaacttaatcctcaatctttt


agtgaaactgcttggcaaagaacagaagctcaggaaaacgtctgcagtag


gtacacaaatatttccctgatttcttcatcccacagaatcaaactgactg


ctttgaggtcatcagtatagtatttgagtttgcaaatgtaatttaatata


gagttataatttaaaaaatgcttatcttcaagataaatctagttttagcg


acctgatactgaaactagattttcaacattttaaagaaatcacactctca


gtgttggcaaaatggcatggaaagagccactgttgtatgctgttgttgaa


agcataattttttaaaatctttctggaaggcagtttggccatatctgttg


aatgcctccaaattttgcataaaacctttaaaccagcatttttactttag


gagattcatgctaagaaaatagtcgtagatgtttgcaaaattatagctac


aagatttttatgtaagtgtttttggtagaaacaagaactggataaagcat


aaatatttaacaaccaggctctattcagccattaaaaatgatactagaga


aatatttaatgccatggaaatggttaaaaaagctgattaaaaacagtatg


cacaatgtgattcccttttgtaagaaaaaaaatttgcatcaaaaatactg


gataaaaacaaatagaatgttaacagtagaatctgtatataaaggaattg


tgaataccatttttccaacatgaaaatagcttcttaaaatattcttaaaa


ttaaaatgcatagtaggtgtaaaaatctcaaactataggaaagtaaaaat


tgtaagcccttttccttttaattgcataatttggtaggggacacacacac


acacgcacacacacacacacacacacacatacacatgctaccttctgttt


tggttttttattcaacagtttatcttggagaattttacgtgttagtgcat


gtagatctacttcattctttaaatttctatgtggtattccataatatgat


tgtgtaataccctttcttatcaactactctgattattcttatcaactact


taccacacttatcagctactctcaccattctgttccaacagcaggattta


tagatttgacctttttcttcatttcctctctgcttttcacttctctcctt


ttccagcttatatagttattatttcatttgacgctctctttttttgcccc


atctgcaaaaccacaactctcaattccctgtgtaaccaagcagacaagca


ctgttgaagaaaatcctgccatagggcagttatgtcccaccttttaagac


actcaatactacccggcaatcctgctagtttctctggtgaggtcaccctc


tcactgtctgccactattatttcaaacggtctccattcccctcaaaagtc


agatttccctggatctcctccaactttccctctcagagactgatgatgtc


tcataatatagaaaaaattgaagccttgacttccatgtaccaatttataa


accttctcaattgtgtatcttttctctctcttccttctttttacatggaa


gaaatgtatttgctttgctgttgtttcatttaggaattttgtacatattt


ttatgtaatattattttaacagttttcctatgtagactgcttttcttagt


tttgatatcagggtttattctgcttgacttgtagaatgacttgggaaact


ttcttttgtttctttgctttttccttttactcggtgctctaagaaatttg


ttccttgaaagtttggtagcagcattttggtacaagtctttgagtagtgg


taagggtgatattagtgtatgtgtgtgtgtgcctgtgtgtatgccaatct


ttgattaaccttttctttagtttctcctataattattggtctatctcaat


tttcttctacttcctaagttcattttggtaatttttatcttcagaaaaca


tctactttgtccttcatctagattagggatttgtttcttccctgtggaac


tctttggcagttgatcaagttgatgaagttgataaagacctcttcttaga


atcatttaaaaaaatcctacttgggagtggagtcacatgactttctgtgc


cttcctacaaaaatgtgtgtgtgtgtgtgtgttttctgtttacatacatg


gttttctcacaatgctatcttaatatgctcagtaattttatggaataaaa


ttgtataaaccctgcttgaataatatattctgtgtgcttgcctcctttac


ctcataacacagtgtacaaaaagggaattgaagaggaagtcaaataaaga


aaagaaaatactacccgtgcaacttgcaacatgtacggcctgtgtcacca


cacaatctgtgaataataagaagttactttaaataatacttcagcagtta


tctcaatcatgtgatttgtttgaacttccaaggctaatactttcagctca


acagttatctatattataacataccgtaatgccttatggagagttaaaaa


ttctcatttcaactgtcctcttctgcaaagctttaataagtaatgcaaaa


aactcaccccctttcatgtattgatgaatataggtagataaaaattttag


agggaaaaataggggctgggtcaccaatatgagaacatagataagtaggc


acatttgcctggtgggtggtcagctcatttgcaggggcatctgtagacag


aaacttggagattatgtgcagtctataggctgatctaatgactgccatga


atgatagtttcacatagatgtaccaactgaaaaatgacatacagatggtc


cccgatttatgatggctcaacttaagattttttgactttacaatggtgtg


aaaacaatatacattcagtagaaactgtacttagagtacccataaaatca


ttccgtttttcactttcattacagtattcaataaactccatgatatattc


aacattttgttataaaataagccttatattagaggattttgatcaaccgt


aggctaatgtaagtgttcagagcacatttaagacagtttagactaagcta


tgatgttcaatcgcttgggtttattaagtgcattttcgactttaagatat


ttttaacttatgatgggtttatcaggatgtaaccccaacataagtcgagg


agcctctgtcaacatttctgtgacttttattgatgaagttcgcaaatatt


gctaatacctctgtgtttgttacctaaattcataattgaaggccacgcta


cacttgagttagagatgagtggaaataaagacataatttattttaacatc


taagattatggacccattgaaacctagaatccaggattctggagtcttga


tttctagatttactagtgtattggtataaaggagtatgtaatataatatt


ctccttttttatgggtatatatttgctatgttaaaaaattatttctagag


aataagaattatacaattgccttaaatgtcttacttaataatgataattt


aaattgaataaagattttaatggctttcttttttctacttttttaatctt


atgcttttttctttattgcatgtgtacgtatatgtatatatatatatata


tatatgtatatatattgcttcatttgagtactgaattgagtaattcttgc


ataaaaagtttgtggattttaagctgtcagagtctttttatgcccatgag


tgttttcctttactcttacacatgattgataattttgctggacatagaat


tcttggttcaaaataatgtaatctcagaacccaagaagacattttttcat


tgtcttctaggatccagattgcagttgatacttttgtattctcattctaa


tttcatgcttttgaagctgaaaataactcttttcataattttgtgaaggt


tttgtgattttttctttatttttgaagttctacattttcaccaggttatg


aacatttctgcttgggagttgatgttcctgtcagattaaaggtttgcatc


tgttttttccagctcagggaattttttattattattctttcattattcca


aacttattccttattaattctgtctttattgaatctctgttaaatagaaa


ttgaaacttttaaatctatactttattttttctgtttcttcacactttct


acacctttacccttctcaatgcattctgacagaatggtttgcttgatctt


tgaggttgctaatctgttcattattgtgttcatgcaaaaaaaaaaacaaa


cagatttttcttaaaaactgaaaatcagcaaaatttacaagagcgttctc


ttgagtatatattgggtcacagattcctaagtgttgctgtgaacatattt


ttaccttcatttttttttcctgtgatttggaagggacagtactggttaaa


tgttttaaccaaccatcttgaatctggaatctgctgattgctatctagta


tgtgtcaggttccatgttaatcactggataaacaatgcaggcgtagagac


ctcaatcaaatagaacatcatgaacattgcaaacactaaagcattttctt


ctgaggcacacagaaaaaggtgagagaaagtggagagagaggtgatctga


acagcaaattggggtctctcattctgtatactttcttaaggtgtttggat


tttattatgtacacagtaggtagctgaaaaggggttaaattttgtgttta


tctgttttagatagagatggcatcttgctatgttaaccgggctggtcttg


aattccaggcgtcaagcaatccttctgactcagcctcccaagtagctgag


actacaagtgcatcccaccacaccccactggtttattatttttcatttta


tttttgtagatatggggtcttgctatgtttctcaggctgatctcaaactc


ctggcctcaagtaatcctcccatcttggcctcccagagtactgggattaa


gggtgtgagctaccctgtctggccatgatgaaaagtttttaggaaatttt


tttttaaaaaaaataccatgaaggatgaaaagaaagaaggcaggactgga


aagggaaaatcttttaggaggtaattcctgtagtctagaaggatcattgt


catctaaccttttgcaatggcactggagactggaaaaggagcataaaata


cttgacccaataccttccctaaatgcgcttttattagaaactggcattac


aactcatgtaaagagctacatcattaaccaacaaccacaatggtcacctt


tttgtaatttattttgcttgttactttttccttatccacactatagtcag


gaggagtagtgtcagtaaagttttataattttagtactgttatatgttgt


gcaaattttacatcatgtattttatgcataaagggtagatcatgggctgc


ttcaaaatgaaattgtattagaattgtgtgggtacttgagtattgatagt


ggggaatcacatgtaattctaaattgtgagcccaagtagccagatgtgat


attgagaggatgaggcaattcatccctaaactccaaggcagatactaaat


tttctttccaaatgtcattgcatggttgaagattacaatgctccctggaa


gagcagaggcattcttaggtcaaggataactctcttactagtagaattta


caataaggtcacatttctctgactagaaaagtgggcgttgggaaaattta


atagaagtgtagtaataaaagttttttttaaaaagtacttgttttacgca


tagtcctatgtataacaaagcattgaggttattccagcaagacagatgtg


gagcaggtgttgaaatttaatggataagtaagtaatacagaggcagtaga


tcagtaatggagatggattagatcatcaaggaaatgggtataaaacaatg


tattatgaaaatgcctttcgattgttttctttcaaaagttttcttatgtc


agatttaaggcagagtttcaggttcaaaattttgggtataaagtgaagcc


gtgtcttcagttttttcagaattctgtgttttgtgaagctactcttgaaa


gagtcatttccccatagaggcatatctttaagtctatacattggcaaata


aaatagttttatagtctaatgtaccgaagtttcaggcatatttattatat


taacaaacgtatatatgaaaatattttgattatgtaaaatttaaaaattg


cttgaaatttgtaccatgaattgatgtcaaataatacccaaatacaattt


ctatatattgataaagtggcttttactaaaacccatatatttttgatttt


atgaaacagtataaaaattcactgaaagtcacctaatttactgtgtgtcc


ctcaagcaccagccgctactctagatacttggatacaccactgaacaaat


cgagtggaacatccctttcttgaggaacttattttcttgtgtggggagaa


agactgagcaataaacatattataaaagttaaatacttagtaaaaagtat


aatgaaaaaagtaaagcagaaaaagaaaaagaaaccaaggtaaaggggat


cagaaatgtgggtgagatattttaattttaaattttaaatagggtcttca


aggtgggactcgttgagcaagaaacatttctgtgaagacatgaaggaggt


gaaacagtagaaaagcattatgaagtgctgccttaagcactcttaaaaaa


agaaatttgttttcatagggaaatttgcatttatcttttacttaaattgc


tgaattattttgattagtcaaaattccaaattcacattaccccagcattg


ttccctaaaatatactgaagttattttacatcaatgtctacttcttttgg


aataatttatatcaatatatttgttcatgcctatctcagaatccaaaatg


taggtgcccctcttttaaaaagaaattatatatatatatatatatatata


tatatatatatatatatatgtatgtatgtatgtatataaacaaaccaggt


atcatacttctaaaaagataaatgcatcttcactcacatctcaacctgca


ctgtatttgaaatcataaagctttttgatttctaagatgatacaaagtga


tatagggatttttgagtgaatgtagggcaaaactatttgaggtatttatt


ctttttgagttcaataaatagttttccatttgaaatattcatgttttgcc


cttccatatgtgttcattttcctacctgaacaataattgtttgctgagga


tataaatgcctgtgcaatgttatagggagacagtaggttcaatctgattg


cactgaataaagtgcagtgatttgcactcactctttcataagatattaat


tttggagattattggagtaaaccctatacattatatagatttgtcacttt


tatctatttaaaaatgtctgtttttattaccttctgacgattgtctcact


gtgctaaatattaaactacctcttttttatttagtcattcttttaaggaa


ctatgagtttcatgtgccaactttttaatttaattgattaaaagcaaaac


taattttcttaatcacagtttccttggcattgcacagctcaacttgcagt


tagagaaaagaatacaaaacaaataagcaatctgtacttaaaaaaagttt


tggacagtaaaaacaaatgtaaaattgtactcaagtaagcaatgctaatt


aagtggtagagttgtattttaaatgattaaatcaatgaaagaaaatatct


tatgccttttctttgaaaattaagcacttttttgtgttcttatttttggt


taggataatttggtgaggataactcaaagaatagttctggattattccta


ttataagaccaaagaaaatagcaagatcagtatttgcagagcttggtggt


atatgtggagcaaacaaaaagggtcagaaaactttattgcttttgagtgt


gtgtgtatgtgtgtgtatttttcagtgtgatgaggaaaacacctactaat


tatgctaactttgcagcagttaaacataacatatgatgctactttcttag


ttatgcgataaggtatgtgtttgctgctgtggttgtgtgatataatgttt


gcttttcttgatgttgttacatatcttacaactaatgacctgttttttaa


aatgtacacatatgacactaatgctgaaacactaatgttatgctttatta


ggatttaatgtatggcaaataatgttcaatttttagcctccacttattat


gcagatttaaaaaaaactcgtactatttgctcagccccagaaatgcacag


aaacaaataggggaaggcttttgacctcagtgaattcccaggttattggc


aaagtcaaaggggctacagaagtcagtatcatagtgcactgagggaagct


atagccactatgaggccatgtaaaaagaaatgtctaattcagggggtgaa


gcagttgggagaggggcatttcagacaaaggcccctggaggagatggatt


ctgagctgagtgtctagggaaagcaggatttagttaggtgaagaaggtat


agggaagacctttcagtcagggaaaacacagtctacaaagcagggatgca


agagagaacatttgaaacacttgtcgttgttaaatgcaatgacatttcag


atatgccattttcactcagaggaaaataagcaaaattgagatgccaagtt


tggaaaattgatatcccatttcacatctgaattattgagaaatccttaaa


cacaaaatattcgttcctctgctgtttttgaaaactgaatgtttgaacat


tacctcttctgctgaaagtaaacactttatagtgacccaaagttttcctt


gattttcctaaatatactctattagaaagaaaatatagattttcattgca


tcaggaaattcatatgatcgatttgttgcattttttggaatgagcaatac


acaatgaaagaagattttgcttttgtctggagccaccttcctgggcctgc


tagcaaacttgagaccctcacgctttcacacctgaggcacagctgcaacc


acatcaaggtcatcagtacaggatgttgatgcacacactcacacacgcac


aaataataaacttacatgcattttccttggggcaaaagcaaatggtatgg


ttgagttttcttttcttcgtcttcagcaaagtggcatggtggacattatg


ttttattttaaaacttttaaatgacatttatgtattttcatacataagtg


gaattcaattctgttgggcccggataactgggtgtgaccttggactgtct


aggtcagaataacctagaacattttgagaaggtacaaattccctggttct


gctcaagatctattgaatcagaatctctggctataagtacatatgtttta


tttttagcaagtccccaggtgatttcttgtcactctagttagagaaccac


tggtatacattttgcaactgctttggccctgggttaagtatttagggccc


caaataagtgaaaaccaatagcatatgtttgacacgatccatcaaactta


aaaatatttatatatacatttcagaattatagtcttgtagccttttttat


aatttcaaaactccttctagccactatggaaaacaccacaaacataactt


ttttgtgcatctaatactttcagaagtccaaagctcaattgtatgaagca


tggactgcctagcatttatttcttcctgacacatttgattttgtcttaca


gaaaattttttctttactaatttatacctctcaaggttacatttgcctac


agactcttctttcagtattttcgtatttatgatgggcactgaaacctatt


taaattatttctgacatacattttccttcaaatgccataaacattttctt


gtctcactaaaatgtctagtgttagttttctgtatttctggttttaaaat


atgtccacaattactattcctcaatacaatttttgcataggagatcttta


tcttttcagatagtatcataatcccctatgctattctgtaaaaatacctt


ctgagtcctcttggatgtataactggagcaagtgaagggcaggaatgatt


ttaatgttttttcttaccttctgtttctccatgccaggtgaagaatgagt


gagctgtgtgtcccttataatgtaaggacattttccagcatcactcatgc


tagaaaagcaaactaatacgggatcctcaaaataaaactgtgaacacata


gaatgccagagcaaaatctcatccttagcatctcatttctgaacagttgt


gttactctgtcagagttcaattggggaacaaaactattatgatatacaga


atccataatagaaagaccttatacaaatgtgagggtaaagttgaagaaat


gtccaaaaggaggaggtaaagattcaaataaaaggaccattcagtggtcc


ttttgacacactggtatgagagaaccagtagcttacaggaaaatctggga


agcaaagcatgtccagatgctgaagttgtactgtgaaggggtgagagtag


agaagtatatggaaagctgttgcctctgtgtcaggtggtgtgtctggggt


agcatttgatcagcagtgccacagatgagaagcagaactggacatggaag


agaagtgaagtaaggacaagctggaatctataggcatctctgcatctatc


tttcactgcatctagccatgacaaacttcagagtataatgactacagctt


tatctccaactttttttttttttttgagatggagttttcctggctggagt


gcagtggcgtgatcttggctcactgcaacctctgcctcgcgggttcaagt


gactctcctgcctcagcttcctgagtagctgggattacaggcatgtgcca


ccatacctggctaactttttttttttttaagtagagactgggtttcacca


tgttggccaggctggtctcgaactcctgacctcaggtgatccacccttct


cggcctcccaaagtgctgggattacaggtatgagccaccgcacctggcct


acctccaactcaaaaacctcatgtgaatttcgattttgatcaactgtaat


ctggaactgtaagtgaaaggaaactttgggaaaattcttccagcatagcc


aatttgataaattatcaataggaaacctttagagtctttatgagtttaat


gcatatacttctatatttttctagagcagcatgttttatttctttttcag


ttatagccaagattttgttttaaactgctttaaaacaggcagaaaactat


agcctccccctccttcttttatacacttcctacattattgatatcctatt


tgataaatgtattttttccttattgacaaatatgctattgagaagttgta


tgaaatctctgaagacattaggacctgtgtcttcatgacttgagccataa


gtcattcagctctctggatctgatccaccacattacatagcacatataag


tcttttgatagccgcatactttgagcccaagctaaggaatcatttgtcct


tgctgtatagcaaaccaccctaaaacttagaggcttaaaacaacaacaac


tccttatttatcctgattctgtgggttggctgagtgcttcctctgctggt


ttcatccaggttcactcatgagggtgagttggctggaggataggctaatc


tgcaaagtccaagatggcctcattcatatgtctggcagttggtgcttgct


gttcacaggggcatcttgttttgcttttgtgtgctctctcatcttctagg


aggctagactggcttccttttatactggtctcaggacagcaccccaatac


aacaaactggaagcttggagaccttttaaggcctagcttcaggtgtcaca


caatgttacttctgcacattctttttgtcaaagttcttcacaaggcaagg


ccaaatttgaggagagggggaaatacatttcacatcttatgagaggagat


gccaaatgctgtggccatgtctttcaatgtacagttgcatcttaaacgag


aaattatcttttagttaggtttctttagtaagaaattccaccttctgcct


ccctcatcttccctcccaaaatacacagacactactagtaattttgtata


tgagtggttctcagttgagtgcagttttgtccctgtcttcctcaagggac


atttggcaatgactggaggcatttttgattctcagcgagacttgggagtt


actactggcgtctagtggctagagaccaggaatagtgctacacatcctac


aatttatagaacagctccctgttacaatcagttatctggtccaaattgtg


ctgaggttgagtaaccctgccttatatggaatataatgctagcaagttgc


ctgagtaaaactttggtcaagagtaatacctaaaggtattctttttacac


agtctgaaatgatttaaaattttgaaggaatcataaatttctttttgggt


ggaaaggggctagttatggaagctgcctttctgtcattcattatttatga


gcaaaccattttagatagaatatgaccactcaaacccttctttaagattc


aaactgaacacagtacttagactttcgagactctgcgtgtttaattttca


cctctcattctggcttttacctttgtgtgtgcacacatgcacattgtttt


agctacttttgtcagaaatcttttaaggacttttccacattgttttaaag


gttgaaaagtatgtattaattggtcaaagccattgtctctcattaaatca


ggaataacaatttacatagaaccctgtgatttttgtacttgtatctttag


atatgggctgataaaagttaaacattctctaccatattgttgttatgaga


ggctgaaattttcaagatttcccctccagaataaaacatggtgcaattta


aaacaaaatgggacaaattggtgataacgcaaaacaagaattgtgtacaa


taaattagatttccttggggcagagaaaaataaaaatgctgaaaggggat


actgggtacagttttttcattgtttttaaaataagaattttaaaaataga


aatgggcagaaactagttatgcattgtagcccttttccatattccaatta


caattcttgggttactcctggcttgcatggtcaccaagtttggttattca


tgtttctcaccctaacacaccccatctgctccctcctgaaggtgtgtgat


cacaggtatcataccttatcaagcaatatgtagtgacatcccctctcaac


tgacagaggctgtttgttgttgtgcccgttccaaagcagtgttgactgat


aatggtagagtggtacataatttgaattcctcagtagaagtttattacat


tagctcaggcctgaggattcttgcacaattttgagagaaagagagcacaa


gaaagaaagacagagaaagataattcttgaaaattcaagaattcattgag


atcaataaaacagttattctaatctgtggatgagaccaaacccatagctg


taattaagatcactttggttgtctcaacaacagattaattcctcttttca


gttaaaaggttgatgtcatgaaagcaatattttactggagagaaattgct


aaaatctgtttgcttaacaaagcaattattgataacctgcaattgcttat


tgcttcccctgtcatcgacttgttggcaaaaacagtctcacgtctcgata


gtgatgcctacttttacttcagggaagactgataaataatatagttttca


cagaaacatgtaagacaacaataatccttaaactatgaggcatcatatcc


acaaaaattgaggaagcgttgctgaagtactaaacatattgtttgagttc


tctgcttcaggagtattgacctttgactttcaattgatatctctcttcag


aataatagtgatgaataatggaaataaacgtctttaacttgtggtgtcag


caacaagatcttttccttagggaacttaacatatgggtatcttccaggga


gcggataaatgaactgattttgaagtgctctcctaatgcaaacattttct


ttgtaactgcctcagaaaaatgcaggcatccaaggtaggccttcatcagc


cttgaagtagttgttttgtttgcttttttattttttgttttttatcaaga


ctcttccttctctcactcagaaataaatagcaacagcttcttgtgacatt


ggcctatatcttgagttcttaaaagtgtaaaacactggttttgaagtatt


actgcacccaattcccaaggtataactacactcacttaaatggctcacaa


ggatagctgtaccattcttaccactttagggtatcacttcctttagatgg


tagcaataagaatagtgcaagattaagactcttttctagtagccatggta


agagtaccacctgacatttgtagagattgttatacattttcaaagtatac


ttaaatgcattagagaaagaggcagtgtagtgtaaaggtcaaaaacttag


actttggagttaaggcaacacttttttagcaatgtaaaggcttagtttct


ttatctgtaaaaatatagataatagtacctaagttgtagcattattataa


agattagatgtatgcgaatagcttagtcttataactgtgccctagttagc


acagcataaacggtagctatcattgttactaccctaataatgcaggtatc


cttttcacttgttttatatatttagatcttaagatagttattataaattt


attgctacagtacagaatttttttttaaaaatctacatcggaactgagat


ccctggaaaatttgcaccacattccagcaaagaggcacaagagaacaaga


taggcagttgaattctccgtgaccaaataaaccacattctcatacaaagc


gcctccaagatcacagattccagctgacatcttctaaatagtatcttcaa


atactgtaaacatggctaaaaacatcttctaggttatcaattttaaacca


gagactaaaatttggatagtgctgacggagcaccaaatagtgctgaaaca


gattgcatcagcctcctatttttgcactccatgccattaggaacaccttc


ttttacgatctttttctgaggtgctaattcactaaacaaacagtacatgt


taccccaggcaccaaatccttcaggatgctcccaagttgcccattttcct


tggaatggtttgggggccacagaaagctcaagaactgtttcagagcctgc


tgcctggggtgccaaagaattatgcacgaataaaaccctgcggggaaaat


gtaagatgctaacttagaatggatagaaagaaaaaagaatggatagaaag


aaaaaagctttaaaccatcagcaaaattcaaattagaaagggataggtag


tcacaaaaatgatctgaattgtgatgtaaaacaaaatacaaaaataacat


gctgaagtctgatgttctggtgtagcaaacccattcactaattctgaaat


ggaatactattcttatatgatgcacattatagtcccaaatgccaaaatta


gaacttagacatatataattttggggagacctttaaatatgagaccctct


ttgagcatttaaaaaatattaaacatctgaaaaatgatgtacatacatct


tttttaggatgtatatatattctatatcagattttttattggaaaatcta


agccatcattaccctgagagagggcagtcaaggagagcagaagagcccca


agcatgtattctcagagtccgtttactataaattggaaaagacaaagaaa


atgataactgtaatcaatttgcctttaccttgatgtcattctagaatttt


gaaccatttaattttgttttctcccaatcctggccacagcacttttttct


ccttttagctattcattgagaaaacatgagaaataagggaggcttcaaga


atggtttgaggtagtatgtccatcccccaggttgaagagtactctgccat


ttggttttggtggcttctccttgccttatgtttatctaggtctctggctt


cagatctggcatacacacaacatctgtgtgtatgttgtatgtgtatttta


gatggttgtcttcaattagagaataatacttctgaggtggcatctcaggc


atagtcattttttaaaggagttcttactttctgggtatgttttgcttaga


gataaatatgtttatcctccgttttaacaccaaagagatggctgtaaaaa


cagttggatccaaatgttaaaagtaaaatgggaaattctctttgagcaga


gtgtgcaaggaaagcaatatttttgtacagtttgctattgttttatgaat


ctttagtataataacaccctgatgttttactcagctcatttatgtgtctg


catattcactccgaaatccaaaatgtcatttcaaacattcccagacatgt


taattcctttaagttggtgtcaactatttcatatttcccaactttatcaa


ggtataatggaatgaattacaatttctagttcttagaaacagctagaaaa


tttgttcagcttggtgtttaaaaaaagcaaatggccagtccttagtttct


gcttcctcaacagtttatgactttgcaggagttatttcaatgacttctct


tggtaactgccaagccatctcctgcatttgatttagtttctaggagacat


tttctagaaaaagaagagaatgcttgcaacaaagatggcaaatagaatta


ctgaatgtaaccctttccctgtaggaatcttgatgggagtaagcaaatgt


ttgattggtttcaccaacctgggtcaagttgctatgtgggtgactctcac


ctggagatctggtgtgagagactgaatactgaagtcttgagagcatggtg


ggaatgaagggctgagacaagggcaccaaactcagaagaacttaaagcaa


tgtcaggagctgagatgtggaacctggaagctaaggagagcatgacagtt


tgggctgaagtccaggtggtcagacagaaattcagtgtgcatgtagttat


gggcattctaaggttaggttggattcatttataggacttttgggtgagcc


taagacacaagattcacattggctaaacccacttactatatcttaggata


acaaattttatcctttacataaagatagtatctttaatttgtttcaaaac


tatgtcttgagcttcataatgcattgaagaaggggatgagattaagttta


gaatatgctctatttcctgttgcaggatttaaattatgaatgaatgagtg


tgtgaattagatctatttataagttaactgtctccatctcctttgtttat


aaaatagctttatgtgttgatttcatgacagaaaaactcagccttgaagg


ttttcccaacaaaactataactttcacttaagtttttcattttctgactc


ccatacacccccaacccaccctcacaaacacacacacacacacacacaag


cacacatattttacagttaagtataagatatccctctggaatttcatttt


ttagggttctctaaattttttcaagcttctctctttttttaaacttctgt


ttttggttctattcccagaacccatatacacctcacccagagtttacctc


gaatccactaatttactgccctagtcatgaaatgtaacttattcaatagg


tgaggttttttttcctcctgagaatcccaaaaagaaaaatcatgaaaatt


gcatttattggtcagaatttccacgtttcaaagttctgatattttagtat


ctgatcaggagagaaaaaagtctttcacatggaaattaaatgatttatac


aaattagtttgaaatttgagatagaaataagatcattgaaatatattcag


agatttagcctgaaaataactgagtcttgaggtcatcatctgaaacaatg


tgccttattattttgaaataattataaagatggttcgagtaagcctatag


ttctagaaaattccttatagagttactataaatactgaacagtagtcttt


gaatcatggggatgaatgggtttcatataagtcattcacaaatggctcag


tggacaaattttatctcagaaagagaatgccccataggagttaaaattat


catggctgagaattcctagatgatgtcaccagaatctgaaaacactactg


tggatgcaataatgatgctatctgcattttctgggtctaacttctgacaa


ggaatctgagttttctggcaagcagaatgcacatcagagtacacaagggg


catgtcttggcaacttccacctgagcacacatgcacggcttcctgttttg


gtctgatagcttcccctgggcttttgttaatgcaaaattacccacacaga


gatgggttgcccatgcaaaagagtaggtgtcttattttgggaaagtttgc


ttcacattatctttgttgttttattctactaaccctatttcctcttttat


tttacactagcgttttaaattcacttcatttgtctgtgagaaatacttcc


ccccgccccctcacttcttaggcttatttctcttgttatctgttttcttt


ttcttctctaggcactgccttattactgcttcatttgtttgcaatattaa


tgtctgctaaaatgtcttaagatatacaatgttaattcatttgctcaatg


atttaaaaatatttgtgagtctctcactcctccacatgtacatatacctt


ggaggtaggacaaaggtgccttaaatattcattgtactttcaggtttata


cagtttgggttgggagttatactttcacaaactgtggtacaaaaataaaa


agtgatggcttctataaaaggtaatgatgaaatgctatgaggcattttct


catgtgctttttggtcattttgtgtgtgcgtatatatatattcctttgtg


aagtgcctgttcaaatctcttgcccataattttccactggattaaaaaaa


ttgcattatagaaattgttacatagtctaggtacaagtcctttgcatgta


tatattatatgtacatacatgtggcttttaaacctattttcttacgattt


taatttctgtgaagtctactatatctttatgcttttcctttaatagtgag


tgctttctgtgtcttgtaaaaggaattttttgttattcttaagtcactaa


gatatatgcctgtgttttctttgatgtctaggtctacataataaatcatt


agagaaatgtgaattaaaaccaatgagttattactacagagtcactagaa


tggctacaactaaaagactgacaatactaagtgttgttgaggatgtgaaa


caagtggaatcaccatacactgataatgagagtataaaatgacaaaacga


cctaggaattagtctggcaatttctcatgtaaacatacacctgtcctacg


actcaccatttatgcttttaggttttactaaaaagaagtgaagacatgtg


tcaacaaaaagatttgtacaagaatgtttaaagcatccttattcataata


gccagtcactccaatgtccatcaataaaacagtagataaacaaactttgg


tttattcatataatagaatattacttggcaataaatatatacaaattact


gatatacataaataatataaattattctcaaaatgctgagctaaagaagt


tttccacaaaagagtatatgctatatgatttcatttatatgaagggccag


aataggtaaaactcacctgtgatgaaataaagcagtggttctgtggtctg


aatgtttgtgtctctctcaaattcatatgttgaatctacttaccagtgtg


atgacattaggaggtggggactttgggaggtgattaggtcagcaggacag


agccctcataattgggatttactgcccttaatcccaattaaaagtttcca


gaaatctctcttgctccttccaccaagtgaaaaaaacagcaagaaggcac


tatctataaatcagaaaacaggccctcaccagacattgaatgtgctagtg


ccttgatcttggactttccaccctccaaagctgtgagaaataaatgttgt


ttataagtcattcaatttgtaatattttgttagggcagtgttatgctgac


tgtaacaagcggggattgactggaacaaacaaggaggaaatgtgcttgag


caatggaaatgttatatatatagataggagtgtgagttatacaacagttt


atcttcatcaacactgggggaatgcacatttaagatttgagcatttcatt


gtatgcaaataatacctccttaaaacaatgaaaaagttctagagggcaga


ttttgcaattaattgattagaaatatggcatattggtttacatattgggt


tatactcagagtctcaaaccatagttctgaaacttattagctggcaaagt


ctggtaagtaacataacctccctgaacttgttttccctttgcacagctag


aataataccttctttaagagatgttataggaatgtatgtaaaaggcttat


cacaagggctgacataagaaattactttattgactaattctcctcctttt


tgtccttcttcttgtttattacttagtaaagtgtcaaatatggctgtcag


gttttggttgctgctgtttgtgaagatgataaattgttaacagaaataca


gagcacactagtaagatcatacctgggagagaataatgagtcatctaatg


aatttgagacaaatagggatgtcctgaatgtagtgtcaataatgggtctt


gaatttgggagagaaatcagaaaaggagctttggatctgaggtgtgctga


acagatacaccagatctctggggatgggaagggtctggaaaggtgtgact


tgtagtatttgctggtttccatgctgtaaatactgtgtcatgaccaattt


aaagttaccaaaccagattgcaaaattcctgaatgtttatcagtcagctc


ttttcagctggtgactgccacctccagcaaactgctagatctgagtcatc


tgctgagacagagaatgagaaagatacaattattttctcaccttacacca


ggtcttcttgcttgcaggatacattgagtttaattaaattattttgctat


gaatccccatttctgaatgtctctctacagctcacgctattttcagttat


gtatgaaaaacactttatcttcttgtgcccactaccttcaaagagggcct


cctttttataatacagaggccacccaggggccactaggcctcagtctggg


agagattcaagcacagcctcacactgtgaaaaagagcaactcttcccttc


ttttcctcagtgctaggcttctgcctcacttaagaaaatatttcattctc


tccttgctgggaaagacagggacactttcaaatctagttcattatctcct


acctcccattcctccccctgtccagttcataagtctgtctgcatgaaccg


caagtgtagacagagttgttttcatgagctggaggaatactgacaaacca


tggaatggggtttaaaactttatgtaatgtcatagagctgtctatagcaa


taagagtacttttgttaagagtacaagaagactcttcttaacttacaggc


aaactaaacactctgttaaaattacatttccagaagttctttgtcctaaa


aataacaggaatgtaaagaattttttgcccagggatgtttcttttcaata


ttactttttatagcaaaataaaatgaaaaacaaacaatcaacagaactat


acgcccatgtttccatggtaaaatgtgacatactcattaaaaataacatt


atgtagtccttaagatgattatgattgttattattatttttaagacagaa


tctcactccgttgcccaggctggagtgcagtggcatgatcttggctctct


gcagcctctgcctcccatgttcaagtaattctcctgcctcagcctcctga


gtggctgggactacaggcacgtgccaccacgcccggctaattttttctat


ttttagtagagacggggtttcaccgtgttagccaggttggcctcgatctc


ctgacctcgtgatccgcctgcctcggcctccgaaagtgctgaaattacag


gcatgagccactgcgcctggccttccttaagattatcaatgaaaaaagtt


ttattctcatgaaacatgtttatactatactttagtatggcatttaccat


tttaactgttaaaaatatgaaaagaggaatgattggaaatgagtttttca


gactgttaagtcactattattaccgtgaatttttcttttggtagtgggat


tttgggtggtttcttcctccttctctgtgcacttaagtgcttttccaaat


tttctatacaacaagcaaaaaatatctttcaataaattcccgatgccccc


agagaccacatttatattccaccatcttttgtttatctattccggttttt


ttttaaattcaaaacccatttgaagtctatagtcaaattatttttataac


acatataggacaaatgtctagtagcataaaaatgggctcctttagctatg


atgtacagtggtgtcacctctctggttattttctcttgtgcttcaataaa


gaaggctagtttaaaaaataagaacctaagctttcctcactgcaaagtac


ctttaacttgttggcaattcaatcttcaaacccctttaaatatgtatgta


tatctatatctgagttatacgttttcaacattttattatgaaaaatttca


aacacacagaaaagctaaaaaatggtacagtttatactactaatagacct


atcagctagattctataattaacattttagtatctttgctttctcacata


tctatttacccttccatcaatccatcttattttttgatacatttcaaagt


aagtagcagacacgagtatgctttgctcctaaatactcttccacgcatgt


ctttaactgaaagagttcgatgtttttcgcttttttttcttgctttgggg


gtaaaatttatgtgcaatggaatgttgaaatctttagttaccagtcagtg


agttttgacaaatgtgtgtgtctgtgtcacctaaactcttgtcaaaatat


agaacatcaccataatacgggataatttcttcatgtttcttctaagtccc


tgaactgcccatcccagagccaaccattgtcctgatttgattctacaatg


taatgtttttccaaggctagaactgcatatcaatggaatcatacagcatg


cagtcttttgtgctattttcattcagcataatgtctctgctgttcatccg


tgttgttttgttttttagtcattttctttttgttgccgagttatctccca


ctgtataagagattgtttaggtatccattttcctgttgatggatacctag


attattccaccttttagtactatgaacaaccttgtacgactctttttgta


ggcatctgttttcattgctcttgggaaaaaacctaggagtagaattgctg


gatcagaaggtagatgtgtatctgggtgtttttttaaaagaaacttctca


aactttttccagagaggcttatcatcttacactccaatccacaattcata


ggagtcctatttttctctatattctcggcaatacttggcattgtcaatcc


tttttttttttaatttagctttattatgtgtgtgtagtacctgaattgta


tcttttactgtatatatttaaagtatatgaattgtatttttaaaatgttg


aaatgtttggtcaatttctttgctgtctaaaggaggaaaaggcaagagag


acaagttttgctctaacaaacctagacttcctgagtccacagcacattga


aattggcacaatcctagtattttgtaagaatccatttttctcccatgaag


aactttgttttcccaagatttaaatggtaattactataggctcacttcac


cattatcataatcatgattgctggtccctcctcctaagtccttctctttt


gtttcttcacttctcaattcattaagactacttgtaaggattatggagtt


gcttcatacattcctggcatctttgtcaggtaatttcctctgtctcatgt


gttcttaggctgtaggcatttgtgtccatgtctgattcttttcttaatct


ataaaaattttgaagaaaggaaacatctaatttaccttagcacagccact


tgcacctagtaggtgctgactaaacatttattgaagagatgatggagtag


tgaagaacacaaaccctagggctagactcttccacctaccagctgtgtga


tatcaagcatgttacttaccctctctaagcctcagtttccatctgaataa


tgaagataataagagtacctaccttactgaagttttgtggggattaatga


gctaatttttaaaaagcctttagaagaatgcctaacatactatgaatata


tacaaataaaatgaaaccaaatcatctttggatcatatatcaattttata


gaaaactctggctttctaaatcaatgtctaccctatttggaaatttttct


taaaaacaacaacaagcaaaaagcaataattcaatctaaagctatttctt


tcaagtattctagttctgttattccagaatttggtgaatgaatctattta


tccattatgattttgtagcctttgatcatatgcctctcattctccagatt


ttcagaggaagcatgtctgtaataccttttctttttttcttatttagggt


cttttattatgtcttccttgaagtgctctgatcagatctgcaatactgga


gatgtaaattcaactctactttttttttcttttaaaaagaagagcaacat


aatccttagtgatcctaaatattattcaacttatcttaacattcaatttc


tttatatagcaaaggaatgcatttttagcaatgaaataaacagcaaacat


tgacattgcttattaacagtatttacggtcgaagtgccccacatattctg


aaaggttatgttgtacattcttttaagaagacttcagggttttctactga


aagaatttgaaaatcaagacttggaagggtcctgagagcaacattctata


gaggaaaaggctaaaaagtaccccaggatcttcagtagcagtgatagtat


ggctttgttttccttcactggactaggctccataactaagcgtcttatat


ttggaagtcaaaatgcatttgtttgtcttcttacaacatagagaacacat


gcttacaaaatcaaaggagagcattgactattgtgacctgtttaacatat


ttctccgctttgcttcggatgagagatttcaaatctcatcattaatgtgg


tgaaaaatattgaattggaaaacactaacattgtcttcccatattgagtc


attgtctgtttgtaacttccaaacaagaaaaagatattaatggtcacctt


ggctgtttccttgtgacctccaacatggctcgcatttaaaattgaatctg


gactggggtgtctgggggtagggagtggaaatctacagggtgcagaataa


aggctagatgtgtccacttggtattaaggaaatgcttctgcaagcacaaa


aagaaacaatatttggtaaaaatgaatgttgttttcccctgctccggcat


ctgcttgcgtttacaatggccgcagtgtttttctttacctggtaggtggc


ctctccatgtatcgatgaggcactgcgtgaaaatgtgcctccttatgtcg


gagggaaatttgctttcacttcttttccctgctctcctgtgcctcccttt


gttctaatcatacatgtgaggagaaaagccagtgatatcagcaatcattt


tcttctccagccactttttggagttaataattcaaacagatcttttacaa


actcattttattcttagaaatggccttattgctaacagtctgctctccag


ttgtactttatcaagcagcagatctgttgtaaaatgtgatgactggattt


catctgttttgccaccccctaagaattcattgcccacagtgcttcaactg


ctgttataagagacagtttaatttttgtggtacattgattgataactggg


ctccctggtaacatggtcttccttgaataggactgctgttttctaggtct


cagatcccctgaagatgctacataactccctgtatgaaacaaatcttcca


caccctaatagtaatataccaaatccccaaatgaacagaaaatacacaca


aatgcctataggtatcttcatgcctgtaggtgatataatatgggaaggga


agttcccctcggccttaggtgtgatgaaatcagcgaagcagaaaaaacag


agaaacaaccaagcaaccaaccaaacagaactggaagacttgtcacaatt


ctgaaaagttgtctaaactggtctatactgagtaataaagttgcaggtac


ttctttactccttgactaattttctgtagctctcaggtgaaccatgtagg


atacctgctatgtatcttcaaattaaaaaataatagtaataataacaaca


aaagagagggcaaaaatagttggcaaaaaggagaattgtaaaagatcata


aaaacaaggggtagaaggattgggtaaagagatttgaatctccaagttat


gggagcaatcttatcatcagtatatgaaaaaacaatttttaaacaaaact


ctcattattcttttcactatatcgaggccaaaaacttcatgtttacacac


acttcaaagaagaggacaatgatgtaggtcctaataaaaccattactact


ggttagtaaatttgctaagcctactgcatggaagaaagttgagggcttgg


ttaaaaccttagcagataatttagcacaaaataagttcttaggagcctaa


catgcctgatgattttggtgaaatctgtttcttcccaaccatgtacatcc


aatgactgatgcatttggtggcaatgttaagttggatggtcacaagttct


aatctgatatcccatgttgcttcattaatgattggtccggtcccctggtg


accactcctgtatccatttccagagcaactcttactctgcctactttctt


tttgcttgagttaaatgcgatggttagataaaactatctctgtattgaat


gttattcattcaacaaatatttattgagatactataatgcttcaggcact


aggagctggaattcagtaatgaacaagacagaaaaaaatttctgccttcc


tggaccttggactctagtgagtgagaggaataataaacaaaatacgtaaa


caaagtaaatgaaattagatgaaatgaattagtggctcacgcttgtaacc


ccagcactttgggaggctgaggtgggtgaatcgcttgagtgcaggagttc


cagactagcctgggtaacatagtgaaacactatctctatgaaaaataaaa


aaattagccgggcacggtggcgcatgcctgtagacccagcaacttgggcg


actgaggtgggagaattgcttgctgctgggaggcagaagttgcagtgagc


taagatcctgccactgcacttcagcctgggtgacagagccagatcccgtc


tcaaaaaaaaaaaaaaaaaaaagataagggctaatagtccagaaaggagg


gtaggaaggaaagttttagacagagtagccaagggtggtctcattgataa


ggtgattttgatgaaagacctaaagagagttggggaagtggccatgcaga


tatctggtggattagcaatccagacagaatagcaagtgcaaaggctgtgg


ggctgggacatgcccgacatttttgtggggaaagctaaagaggctgggat


tactgaagtagagtaaatgagggagagagttataaatgatgtcacggagg


taatgaagaccaggccacgtaggatcttacaagccatgaaaggaacattg


gcttttgctctgagtcccataggaagccattggagctttttgtgcacaaa


acactggagggttttgtgcagaataatgaatgttatgttgactgagaata


gatgaaaagagtgcaagggtgaatggtgggagatcagttagaaggctaca


gaagttattcagagaggataggatcagggtgacggtagcaggggtggtag


gtagtcatgttctggatacatctttgtggtagaaacgatagcatttgatg


atgcctgttagagtaagagtagagttatggctacaaccaaaatatgaatc


taagtaactggtagaattgagattcaatgaattgaggtggggaagactgg


aggaggtttttttgtggcatgtgtatgtaaagacactgattttggacatg


ctaagagtaagatgtctattagatattcaagtggaggtagcaagtaagca


cttgaatatataagtctgtagttgaggaaagaggcaaagttggagataga


aatttgagtatcatcagcatatagatggtatttaaagccatgagattaga


tgagatcccttagaacacctctgagagaaggcatctatgaactgaacctt


ggagttcagggagatgaggagaaactagccaatgaagctgagaaggagtg


gccaaaaagagtactggctgggcatggtggctcacatctgtaatcctagc


actttgggaggccgaggcaggcggatcacgaggtcaggagtttgagacca


gcctggccaatatggtgacaccttgtctctattaaaaatacaaaaattag


ccgggagtggtggcgtgtgcctgtagtgccagctactctggaggcgaagg


cagaagaatcgcttgaacccaggaggcagaggttgcaagtgagccgagat


tgcaccactgcactccagcctgggcgacaatgcgagactctatctcaaaa


ataaataaataaataaaaagagtaccatgtcctagaagaaaagtgaaaaa


catgtttcaaggcagatacagtgattgattgctgaaaggtcacatgtcgc


tgaaaggacaaataagatgggaatcattgaatggaatttagtgaccacat


ttaacaatatagagttgatcagtgaccttcagaataacttcagaatatct


gttgagcacaagtctgaagtaacccaaaacagcttatcaacctctatgga


ttaaataaaacaatgtgctcctgtgtaagtaacttgtttgaaagcaagct


ggagtgagggagctagatattgagagttctaagttccttaccatcacctt


cttgcacactaaagtattgaaagaatgtaatattcaaaacaggcaagcaa


acaaacaaatagaacatcatctgaatagattatatgtaacagtcttcaat


ggaaaccatgacaacagggaaaagaggccttgtaaaaaaattgtgtgtgt


gcttctgctctgctttctcaatgtatttatcatgtcatggccttggcaag


aacaagaggtagagataaataaataactgatttctcatcttttggaaaaa


aacaaaactggcctaatgtaaatggataaaagtcctctctgtagttgtca


atttaaacgtaaagccccaatttaaaaaaaaagtgaatggtgttgcagtt


gtgtgggtggatttggtagggagtttaattttgctacagagtggagttta


atctttgctaaagaaaagtatatccctttgatttttttagaatgccccaa


ggtccgtctgatttaaactagtattggaaaaagggaagagagatggaagg


gagtgatcaaagttggaaaacgatagaacttccccccatgaaacacaaga


aaggtgaatacaggtgaaaaataaagatagtgctgctccacatgtttgga


attgcatttttggtagtctcaactaaccgaaatgcagccacttagagaag


gctgtcacgaagcttatgtagtataggataggtgtgggactgcccctcag


ggcagaggggaacattaaacatactgattcctgttggttatttgatttcc


tctttgaggttagtgacagaaaacttactttttaatttgtgacctatcta


ggagtggggtacaaaggagagaatagaaacaaagagataatccatgagaa


gaacaatatacttaagtcccttaatgaccccattgggagtgatgtggttg


gcttgatactgatgttcaatagacacctccccctcccatcttcagtcgtc


tttagctctaaaatctctgtggtcgaacttttgagataggctgatgtgcc


accacatatgccatgacccaagggactggctggcaattactctggatgct


tcttgaaggttcttcattgttcttgcccttaacctgcctctgtaggatct


caacttttctcttcccgctccagctgcactatgtatccgtgcttggctct


cagttgttggattctcatgtctcctttttcacagtggctcaggtaagggg


acttgttgcaacatctcgagatgctgtctttagtcctagctctcttggaa


gagaggaaaggagatgagacatactgaaaccacattttcttgatgttgag


cagaatccagttcacagactagaaaatgttttattccttgtaggctcagc


agtacagaaacaaggtggggaaaggaaaacaatttctaaagagtgaatta


ttatgaattatcttataaatgcccatgccacctgctttacttctcataat


tcactaaacctcctttacttctcataattcactaaattcactatgctctg


catttcatctgtggtgtgatgagttaaaattgccttgttgggaatttttt


cagaacatttgtccacttgaacaaaaatcaatattctgtctcttaaacag


aagcatttgaacaagttgcactatgcggaggtgagaacaataattatatt


ggtacatgtcttaaaaattacccagaagcagatgagaatttgtgaataaa


tgactcattaagaaagtgtccccagaagaagtcagtaagtgagtgggggc


atgaggagaggaaaagggaagtcaagcaaagtgtgctatttttggccaag


tcttatagtagggagctttggcctaaatctacaagggaaacttagagcat


aaattgagttgtctgccataaatcaactgggaggaacttaggttttcata


cacttgcattatcagtcatagctaagaacacccaagggccatgtaaactc


ccaattacttctctcacagccatagggtaatcctctaaaaacagacaaat


ggacaggccattggaaataagagtcctggacctaggaagaaattcaggaa


acagtaaaggaattcagggaaacctaggtgggtcactgcagtgactacta


catcattttaagcatttattgagcgtttggtggtgcctagaaagctgtgt


gtgtgtgagtgtgtgtagaaagctatttagttctcacaatacccttagaa


ttatctctgttttacatactaaggaactaaagtgttcagtaatttggaag


agtgtttaaaaaaaggtctgcaatagagccaagattcaaatcctggtctg


tctgaaggccatgtccacagtcacgttattctctccttattatactacct


cctggtcagcctcctaaccaaagtaagaggtgaagtgcagaaaggatagg


agaacttgggatagggtaaacagaagaaaaaaaaaagcagagatgataaa


atagtttgtaactaagaaatgttatagcaaatgagtggggattattctat


ggccaagagaagggaagactgaggaaacttactaatgatcttcaagttca


tgaaagggtttaacacagaatagattggccagctgtgctcaaatttctac


atagtgaatttaaaaagaaaactatttgctgttatagcacgaggatttta


ggctggattgaagaaagtgttctctgggcaaaagggctattgagcactag


agtgaaatcctataaactcaaattagaaagaaaagaaaacagaaaagaaa


agagaaaaataaaagaaaagaaaagaaaagatgtgagcacagtctgtcat


gctgtgcaaatgtatgagatgttaccttgaacacagctgcttcacacctg


caaccttccacagagggagagccgtgaattaaaaaaataaagcagcgaac


ttcattgtagctgaggatgccaggaaacctgatggccttctagaaactat


catattaattgttgtattagggcagaaacatgtactggtctggtttagta


gtttgcttcttaatgcgaactctctttttcacttttgtcctttctccccc


aaagacaaacatttccaagacattatgtatatgtaaaagagcattttatg


tagaagtacaagaaataacatttggtgatgcaacaatgtggcacagagtc


aatgactgtttagatagcaaacggaatggactttgacggtaccttgtgaa


gtggcacatatcattgaataatctattgttttgatgtcctctatgcaccc


atagaccctgcccctgaaaattttctgcataaccaatttagatttatgta


tatatatatacacacacacatatatacacacatatctctatatatataca


cacatacatatagatatatatacataaacactctcaatgatgtgttgaga


tggtatgtactgcagccagactttaaactagctcattattttataatggg


gtgtatactaaaaatttattttgaagtcacttacaaatgtgttttcccca


tcaacattctgctgtagatgtcattatttgggggcagaatatcactgtaa


tcacaccaaatttacctgtgaagttactattttccagctacactgtagac


taaattaatagttctgagttactggagaaactcgatcaccagatcactat


ttctctggaaaaaaatgaattccattttgaatttggaactccagaaatat


ttccatcctaatgtgcctctcagtagaggtttgtcagagggttttccaca


cattgggtaaagccaattgtcaaaagtcaaatgttccctgaaattgcttt


tcctttaaccagagtggtgacttgtgctctgtgaaagagaattttccttc


cttccattttaatagcaggctttcataggattgaagaatttgtacaagag


caacaattattattagcacctactatgtattggacactatattattcatt


gtatatatattatcttatgtagcctccataacaaccttgggatgttatat


tattatcaccattctacagaagaggaaatagatgtaaagaagtcagcttg


ctggaagtcctataaataataaatgatagaattgggttgtgctgtggttt


gaatgtctcctcccaaattaatgttaaaatttaattgccattgtaacagt


attaagaggtaggactggtaagaggtagattaggccatgagagcttcacc


ctcatgaatggattaatccagctttcacaggagtgagatcattataaaag


gccaaatttggtcctctttgtcgcttgccctctcttgcccttctgccttc


caccatgggataatgccacaagaaggccctaatcagatgccagctcctca


atcttggacttagcctatggaactgtgataaatattttttctctttagaa


attactcagatactggtattttgttatagcagcacaaaacaatctaagac


aggttgtgaactctggactgactgaccctaaattcaagagccttttcatg


gatgttgtggtggacgtttgttggttgtgtgcctgacatccattctccct


taccattattttcctttggaaaattttaccttcactacctgtcccccata


ccacaccacatcacagacatttagtttgtatggaattgaccccactgcct


gtcttaggagtgagcccagattgacttaggccaaccactatattatattc


ctctccctcatttccccctcccatactgcctggctcagagataagtaggt


agctcaatcagagccaatgagatagaaggagatatttccttgtcaccttg


gaaggagaagctcccttctttaactgtgttgcaaaagaatgtgagttctg


aagtggtggaaacatttttttacctataggagacagcctgtttctggtgt


ggcaaccactgtgtagagccttagtttgaagccactggggcagaagacag


agggaatccagggtcaaaacaacaacttttggactcctgggtcaaccctt


atctgaagtaagagcggcttctgggctcaattgtaaacatatttgctcaa


acaacaacttgtagtttgagccacgttatggggttcgtgtcaatttcaac


ataaagagttgtaagtaacacaggcactccccaggactattccttcatta


aaaacatttacatcttccaaatctctggtatgatgagacttgacttccac


aatcacactcctgaaacaattcagcaacctaattaatcaaataagattac


attcaggctattctccttgctaagtgaaaaaacttgcccacttaaaattt


acgaagattctgagcaatacagaataaaaacaaagaatgttttggttaat


gattttaaggtaggcagaaaaaaactaattaaacgatgcctttttctgat


aattctagttaattggagttttaattccagtaggaataataaaaactggg


gtctaagtttggtaagtaaagttttcaaattatttatatttgcttaattt


agaaaaatgtatgtacacattcatttccctgtgagacattaaaatatatg


aacatagattaagcaaaatatattttcctttattgtgataattccttgtt


atctccagtctttcccccaaatgtgataagaatacatagctacagaggga


ggagccaagatggccgaataggaacagctccagtctacagctcccagcct


gagggacgcagaagatgggtgatttctgcatttccatctgaggtaccggg


ttcatctcactagggagtgacagacagtgggcgcaggtcagtgggtgcgc


acactgtgctcgagccgaagcagggtgaggcattgcctcactcgagaagc


gcaaggggtcagggagttccctttcctaatcaaagaaaggggtgacggat


ggcacctggaaaatcgggtcactcccaccggaatactgcgcttttccgat


gggcttaaaaaacggcgcatcacaagattatatccctcacctggcttgga


gggtcctaccccacggagtctcgctgattgctagcacagcagtctgagat


caaactgcaaggtggcagcgaggctgggggaggggcgcctgccattgccc


aggcttgcttaggtaaacaaagcagcggggaagctcgaactgggtggagc


ccaccacagctcaaggaggcctgcctgcctctgtaggctccacctctggg


ggcagggcacagacaaacaaaaagacagcagtaacctctgcagacttaaa


tgtccctgtctgacagctttcaaaagagcaggggttctcccagtaggcag


ctggagatctgagaatgggcagactgcctcctcaagtgggtccctgaccc


ctgacccccgagcagcctaactgggaggcaccctccagcaggggcacact


gacatctcacactgcagggtactccaacagacctgcagctgagggtcctg


tctgttagaaggaaaactaacaaacagaaaggacatccacaccaaaaacc


catctgtacatcaccatcatcaaagaccaaaagtagataaaaccacaaag


atggggaaaaaacagaacagaaaaactggaaactctaaaaagcagagcac


ctctcctcctccaaaggaacacagctcctcaccagcaacagaacaaagct


ggacagagaatgactttgacgagctgagagaagaaggcttcagacgatca


aattactctgagctacaggaggacattcaaaccaaaggcaaagaagttga


aaactttgaaaaaaatttagaagaatgtataactagaataaccaatacag


agaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaacta


cgtgaagaatgcagaagcctcaggagccgatgcgatcaactggaagaaaa


ggtatcagcgatggaagatgaaatgaatgaaatgaaacgagaaggaaagt


ttagagaaaaaagaataaaaagaaacgagcaaagcctccaagaaatatgg


gactatgtgaaaagaccaaatctacgtctgattggtgtacctgaaagtga


tggggagaatggaaccaagttggaaaacactctgcaggatattatccagg


agaatttccccaatctagcaaggcaggccaacgttcagattcaggaaata


cagagaacgccacaaagatactcctcgagaagagcaactccaagacacat


aatcgtcagattcaccaaagttgaaatgaaggaaaaaatgttaagggcag


ccagagagaaaggtcgggttaccctcaaagggaagcccatcagactaaca


gcggatctctcggcagaaaccctacaagccagaagagagtggggaccaat


attcaacattcttaaagaaaagaattttcaacccagaatttcatatccag


ccaaactaagcttcataagtgaaggagaaataaaatactttacagacaag


caaatgctgagagattttgtcaccaccaggcctgccctagaagagctcct


gaaggaagcgctaaacatggaaaggaacaaccggtacgagccgctgcaaa


atcatgccaaaatgtaaagaccatcgagactaggaagaaactgcatcaac


taacgagcaaaatcaccagctaacatcataatgacaggatcaaattcaca


cataacactattaactttaaatgtaaatggactaaatgctccaattaaaa


gacacagactggcaaattggataaagagtcaagacccatcagtgtgctgt


attcaggaaacccatctcacgtgcagagacacacataggctcaaaataaa


gggatggaggaaggtctaccaagcaaatggaaaacaaaaaaaggcagggg


ttgcaatcctagtctctgataaaacagactttaaaccaacaaagatcaaa


agagacaaagaaggccattacataatggtaaagggatcaattcaacaaga


agagctaactatcctaaatatatatgcacccaatacaggagcacccagat


tcataaagcaagtcctgagtgacctacaaagagacttagactcccacaca


ttaataatgggagactttaacaccccactgtcaacattagacagatcaac


gagacagaaagtcaacaaggatacccaggaattgaactcagctctgcacc


aagcggacctaatagacatctacagaactctccaccccaaatcaacagaa


tatacatttttttcagcaccacaccacacctattccaaaattgaccacat


acttggaagtaaagctctcctcagcaaatgtaaaagaacagaaattataa


aaaactatctctcagaccacagtgcagtcaaactagaactcaggattaag


aatctcactcaaaaccgctcaactacatggaaactgaaaaacctgctcct


gaatgactactgggtacataacgaaatgaaggcagaaatgaagatgttct


ttgaaaccaacgagaacaaagacacaacataccagaatctctgggacgca


ttcaaagcagtgtgtagagggaaatttatagcactaaatgcccacaagag


aaagcaggaaagatccaaaattgacaccctaacatcacaattaaaagaac


tagagaagcaagagcaaacacattcaaaagctagcagaaggcaagaaata


actaaaatcagagcagaactgaaggaaatagagacacagaaaacccttca


aaaaattaatgaatccaggagctggtttttttgaaaggatcaacaaaatt


gatagaccgccagcaagactaataaagaaaaaaagagagaagaatcaaat


agactcaataaaaaatgataaaggggatatcaccaccgatcccacagaaa


tacaaactaccatcagagaatactacaaacacctctacgcaaataaacta


gaaaatgtagaagaaatggataaattccttgacacatacactctcccaag


actaaaccaggaagaagttgaatctctgaatagaccaataacaggatctg


aaactgtggcaataatcaatagcttaccaaccaaaaagagtccaggacca


gatggattcacagccgaattctaccagaggtacaaggaggagctggtacc


attccttctgaaactattccaatcaatagaaaaagagggaatcctcccta


actcattttatgaggccagcatcattctgataccaaagccgggcagagac


acaaccaaaaaagagaattttagaccaatatccttgatgaacattgatgc


aaaaatcctcaataaaatactggcaaaccgaatccagcaccatatcaaaa


agcttatccaccatgatcaagtgggcttcatccctgggatgcaaggctgg


ttcaatatacacaaatcaataaacgtaatccagcatataaacagagccaa


agacaaaaaccatatgattatctcaatagacgcagaaaaggcctttgaca


aaattcaacaactcttcatgctaaaaactctcaataaattaggtattgat


gggacatatttcaaaataataagagctatctatgacaaacccacagccag


tatcatactgaatgggcaaaaactggaagcattccctttgaaaacgggca


gaaggcagggatgccctctctcaccactcctattcaacatagtgttggaa


gttctggccagggcaattagggaggagaaggaaataaagggtattcaatt


aggaaaagaggaagtcaaattgtccctgtttttagatgacatgattgtat


atctagaaaaccccattgtctcagcccaaaatctccttaagctgataagc


aacttcaggggtgatcagccagccacctggtggcaggttgattatattgg


acttcttccattatggaaagtgcagaagtttgtccttactggaatataca


cttactccagatataagtttgcctatcctgcatgcagtgcttctgccaag


actaccatctgtggactcacgtaatgccttatcaagtgtcatggtattcc


acacagcgttgcctctgaccaaggcactcactttacggctaaagaagtgt


gtcagtaggctcatgctcatggaattcgctggtcttcccatgttccacat


catcctgaagaagacggattgatagaatgttagaatagctttttgaagtc


acagttacaatgccaactaggcgacgatactttgcagggctggcgcaaag


ttctccagaaggctgtatatgctctgaatcagcgtccagtgtatggtact


ctttctcccatagccaggattcgcaagtccaggaatcgaggggtggaagt


ggaagtggcacctcttgatcatcaccattcactgtcacccctagggatcc


actagcaacagttttgcttcctgtccccacaacattgcattctgctggcc


tagaggtcttagttccagagggaggaacactgccaccaggagacacaaca


gttccattaaactggaagttagtattgccacctggagactttgggttcct


cctacctttaagtcaacaggctaagaaaggagttacagtgttggctgggg


tgatttacctggactatcaagatgaaatcattctactattccacaatgga


ggcaaggaagagtatgcatggaacacaggagacccattagggcgtctctt


agtattaccatgcctgtgattaaagtcaatgggaaactacaacagcccaa


tccaggcaggactacagatggcccagactcttcaggaatgaagatttggg


tcacaccaccaggaaaaaaaccatgaactattgaggtgcttgctgaaggc


aaagagaatacagaatgggtagtagaagaaggtagtcatcaacaccagct


acgactacgtgaccagttgcagaaatgaggagtgtaattgtcatgaatat


ttcctcttgattttgttaaaatgatgtttatgcatgtacacacttgtact


aagaaaaatcttcattttctttttcctttattatgtgacataagatttat


tgacttcctatcagcatttaagtattgttaactttaggtaatagtatctg


ggttgaggattggtgcatttctggttttatgaaggatagttctatgaagg


atagttgtattatcttaagcataattatgacctattaatgtgtttatttg


aagagtatgtatgatctcaggagatgtgtgtgggtacaagctgacaaaag


gtggacttgtgatggttaatactgagtgtcaacttgattggattgaagga


tgcaaagtattgatcctgggtgtctgtgagggtgttgtcaaaggagatta


acatttgagtcagtgaactgggaaaggcagacccactcttaacctgggtg


ggcaccatctaatcagcttccagcgaatataaagcaggcagaaaaacgtg


aaaaggctagatggcccagcctcccagcctttctcctgtgctggatgctt


cctgacctcgaacgtcggactccaggttcttcagttttggaactcagact


ggcttttcttgttcctcagcctgcaaatggcctattgtggtaccctgtga


ttgtgtgagttaatacctaataaactcccctttatatgtatccaatgagt


tctgtccctctagagaaccctgggtaacacaggatgttacagataaattt


gctatgaatatttgtgtacaaatctttatattgacagatactttaatttt


cttgggggaaatacttggaagtaggttgcatggattatatgcatgtgtgt


gttttgctttttaagaaactatcaaatggttttccaaagtagttgaatca


ttttacatttctatcatcagtgtatgagagtttcagttgctcaaaattct


tgctaaaaccagatgcggtgtatttttttttaaattagccattctaatag


gtctataatgttatctcattgtgattttaacttaatggtgttgagaatct


tttcaggtttatttgccatctatatgttttctttggtgaagtgtctgttc


aaatcttttgtccattgttttcttattattgaattttaagaattttaaat


atatatatataaatatattttggatataagtcttaaatcagatatgtggt


ttgcaaatattttctcccaagtatctggtttgtcttttcattttgctagc


agtgtgtttcaaaaaatagaagttttaaaattatgataatgtccaattta


tctatattttattttacagattgtgattttggtatcatatctaaggaatc


tttacctaatcaaagttgcagaagttttctcctaaaagttttataggttt


aagttttacatttaggtctatgattcatgttgagttaattttatggtgca


agatatagatcaaagtttatttttttttcttgcatatatatatatatcca


attattccagcaccatttgttgaaaagactaatctttctctactgaattg


cctttgaaactttgtcaaaatcatttgtctgtatatgtgtgggtctattt


ctggactctcttctgttccatttgtctatttgtctatctttacaccagta


ccaaactgtcttgattattgtagcttcataataagtgttagttctctaaa


tttgttcatctttttttttcagagttgttttggctattctgggtcctttg


aatttttatatcaattttagaatcagttaatttctacaaaaaaccctgct


agaattttaactgggattggtttaaatgtatggattggtttgggaagagt


ggctttcttagcattattgagtcttttgactaatgaacacaatataggta


gaacatcactaatctgaaaatctgaaatttgaaatgctctaaaatctgaa


actttttcggtaatgacatgatgccacaagtggaaaattacacatctgac


acctttgctttcttatagttcaatatatacaaactttgttttctgaacaa


aagtataaaaaattttgtgtaaaattacctttaggctgtgtacataaggt


gtatataaaacaaatgcattttgtctttagacttgggtcccattcccaag


atatctcattatgcatatgcaaatattccaaatccaaaaaaatttacaat


ccaaaacacttcttgtcccaagcattttggatgaggaatacacaacctgt


atatctcaccatttatttcggtgatctttaattactttcagcaatgtttt


atagttattcagtgcacaggtcttttacatcttttgacacttttattttt


aattatttcatatttttgaggccatttttaatggcattgttttcaaaata


ttaacttctgatggttcattgctaatatataaaataaaatggatttttat


atattgatcttttatcctgcaacattgcttcaactcatttaataattcta


gtagcttatttcaaaaatatagattccagtggattttctacatgaataat


tatgtcatttgtgaataaagagttttgtttatttcttttcaatctggata


ccttttatttctttttcttgctttattgcattgacaagaaattccaatat


aatgtgaaagagaagtggtgagagtggatatcttgctttgttcctgatct


tagctattccttattttttttaagctatattttcatagaagcccttcatc


aagttgagaaagttccttttttaaggcagtaaaattccttttttaaagca


aataaagaatgtatattgatatttgtcaaatactttcttctacatctatt


gagatgatcatatagttaatacatttcttttgtttctttattattatgga


gtacattaattgattttttgttgttaaaccaaccttgcacacttggtcat


gatgtattatcctttaacctggtgttggatttaatttgctcaaatttcag


taacaactttttcaacaatgatcgtgagggatgttggcctgcagttttct


tttcttgtaatatctctatctgttttggacatttaattctgggcttatag


aatgagttgaagaatgtcctctccttttcaattttgtgcaataatttgtg


tagaatggacactgtttcttccttgaatctttggtaggattatcaagtga


agccacctggacctccgcatttctttgtgggaaattgtataactattgat


ttaatttctttaatagtggagggctatttatattacctaatttttttctt


gagttagctttggtagtttgtatctttcaaagaatttgtccattgtatct


aagttgtagagtttattcaaataaggttgtatataatattcccttagtgt


cattttgatatttgtaaaatttgccctggtttcacttctttcatgcctac


tactgacaatttgtatcttccttctttttctctgaccagtctggctaggg


atttattaatttcatcaattttctcaattttctcaagactgacatattat


tttgcttttgtatgtctcaaaagtctttattttgctatcatttttgaaat


acttttttttcagtgggtatagaattctagaatagtttttccctcccaat


actttaaagatattgccctactgttttttacttttgcattgtttgtaaca


gaaatttactgttacccttatttctgtttctgtatacatatcttttcctt


ctactgcttataagatttcctatttatcacccattttgatacattttatt


attctgtgccttagtattctttcttttatgtttcttttgcttagggtttg


ctaagcttcttatatatgtgggtttgtcatttttatcaagtttggaaaat


tttcatccataatatcttcaaatatttcccctccattatgacttcaatta


ctcatgtattaggctgtttgagttgtcccacacctaacactctgtccagt


ttttaacagttatgttttctatgtttcattttgggttgcttctatttcca


tgccttcaagttcactaatcttttcttcctcaatgtctaatctgctgtta


attctagaaagtatatttttcatcttatacattttagttttaatcattaa


aagtttggtttggatctttttatgcttttcatgtttttacttaacttttt


gaacatatggaatacaattataataaccattttaatatccttctctgcta


actctaacatctgtggttagttcttaatcagattcaattgattgatttat


ctcctcattatatatcatattttcctgtttcttttgtatgtctggaaatt


tttgtttgaatgccagacattgtaaattatgccttgtgggatgctatata


tttttgtgttcctataaatattcttgagctttattataggacacagttaa


gttacttggaaacagtttgatcttttctacccttgtttttaggcattgct


aggtagaccagagtagtgttagtcatgggttaattattcattactactga


atcaagacccttcagaataccatactcaatgccacataaatcttgaggtt


ttccagtcttcctggtggatggcaggcactattctccccagtgagtgctc


actaggtacattttgggtggtcctttctctggtcgtaggtagtatcttta


catgaatgcatttacccatactcagaatgagggacattttgtagaaatct


gaagttctctctctgcacagctctttcttttctggtaatctgttctgtga


actcaagctgttttggtctcctcagaccctccatttcatttcctcagctc


atagagtcttctacactccacctgggttcccctccctgttctctggcctg


gaaactcccttaaggtagtaaggtagaggaatcacagagtccacatcact


tgtttcacatatctgagggctcagtatcctgcattgcctggtgactcagt


gtcttaaaaactattgttcatatattttgtttatttttttgttgtttcag


GTGGAAGAGTAAACTGTTCCTGTTTCTCCATCTTGACAGGAAGCAGAAGT


TCTCCTTAGCTGATTTTTCTTCTTGCCTTATACTGGGTTCTTTAACACCA


GAACAAAAATAAATAAATAAAAGAATCTTCCAGAAATTCATGAAGAGACT


TCAGgtatgcagcataggtgtttgacaactacgtagagcagtgggccagt


cttaatttgtttcattaatctggtaaaaaaagaaggatttgcatgaaatt


cgcaattcagattgccatattagccactaagtcagaagacttgagagctg


agccttggagaaacaaaatccttgacagttgttgatgtttttgtaatagg


agacttctattattttagcaaacagaaagactacataaaaatgtcagaaa


gaaactacttttggagaaaaaaataggaactgactcactaaagggaccta


ttggggataattacccagagcctgtggtagatcaggatggctgatttaaa


gttattgtttttgcaaattctgcaaaaaacaaaaaaacaaacaaagattt


ttttttcttggctttagcaatagtggaaaaatttcttccacagctgtaat


ctcatgtaaactgccaaagtggattttattctctgagtttctcaagcctc


cttttctctaatacagagtaatattggctaatgataaaacagtacaaact


ataatttgaaatagtgtttcaattttcaatttgcagtctgggaatagatc


acaagaattttagtttgttggtttcctgctgccgtagGTCAGGAAAGCCT


GAACTTTCTCATCCAACACCCGTTCAAGTCGGAGGATGCTGATTAATTCA


TGATGGAAAGGTATCAATTTCACAGCCCAGACCTCTACATTCTTGAAATG


CTCTTCTCATGGGAGACATTAAATAAGCTTTGAATTAACTgtaagtaaat


aaatattcaattatttgaatagcactatattctgtttcattaaaaatatt


tcttacttgatttttctcaccaataaaagtattcaaaaatgtttaattca


atcatcatcacatcaagaattacgatattatctccgaattatataagtga


ttttctggtcatgcacataccagttgttcttactctatataaatgtactt


ttttgttaaaatataatcatttagtttaaaattgagatatccttgttact


ggagaatttcaaatatatttaacaacaacaataaaaagcttaaatataaa


tagcaaatgtgagtaagcatgggataattgaatggctttttatatttttg


gaagtcaattccatcttcatagattaattcttccaggttttaattcacat


tgagttctaatggctgacaccatttttcctcctctgtatcttcttcagaa


tatggacatcttgggttgttgtgagaatggagttaccaaagttataatca


gttaaacactaagcaaagtgttatggcaaagtgtagtcaagtgcatttcc


tgtcagatggtcaacaccttaagtgcttgaggttgaagagaattgttttc


atctctgttgataggaagaaccaatgatgtgatccctggatactggcaat


tgttgcccaagaaaacctctccccttgctgtgggactgctgggcacccaa


ataatgcacccaaataatttgatagctcttgttacacagctcttcttgga


ctaaaaggccctcagcattcctgtcaatttctacactcctgaagcttcat


ctggtaattccctgacttatgcagtaccaccccactcctagagtgtcttt


agaggtgtttgctttcttcttaaagctattatcctcttgttcttcgaggg


cttattcttaggggacttggagggcttattaagccctaagactcaatcat


tagcaaccaaaaaattaacttctactctcccaaggtcagggacagcataa


tcttaaaatacagtggttagggaaaatatttgcataacgtattttaaaga


atgcacaggaagggaaggcaaatatagaggaggaagatgatttaagagag


tagagcagaaaatatacagctgccacaaaaatttagaattgcagaaccag


ctaaaggtaggtgtttgttctgcttactaataaagcatacaccaaataca


attatttggaagcctataattttttagggtaaaattttttaggtgttctt


ctagtcttaccccattattttaaagatgtaaaaactggatcccaaagagg


agaaatgacttggttaagctcacacagaaactagaatcctacatcctttg


tattgtatcaaatgcttttatgattaccatatgctgcctttttaaggcct


gtgatctccagagagagatttcagctttttgatatgccaaactaaggaat


tcttagttggccttttttttcttcttgagtttgtgcttgcttaacctaat


gagtaggtaataaccctgtgtttccgaatttcttaaatcagactttttgg


gggatgaaattttgttaatgattctgttaagaaataatgtcttttacact


ttctcagctgaaaggcaaggggatgcaggatggaggttagaaatacacag


ctgccatagtctaattcttatatagatattattgagtgattaaatgatca


agaaaactgcctactgcttctctttacttttctgtatttagcaggaacat


gaccttttctttttgatccataatgagagagatgaagagaatatgagaga


atatatgagaaatgaaagtgtgctaagaatccacataattaaacacccaa


acacttagtttatcattcaagatccttcatgatcttgccaatctattttc


ccaactatatcccatattatctccctttgcatacagtttaatccatacac


taaataacttgctattctttaaatgctttgtttttctacctctgcctcta


ttttcatactgctttctctacctgacattctctgtccccaaactgttgct


attcaaatttgacttagcttgggttacctgagatatagacactgagacaa


gcttttaaggttgagtaatttgttagagagtaaaggtaggagaatgatta


agtgaaagaatgaaagaaagtttattttggaagagcatgttatcaagaca


gctttcagttccaggaagcaactgaagcataatcccacagggaaagctct


tgaagcccagacaagctattctgccttaaagttatcctacccaaggggta


agggagctaagtatttatacaccaacactcttcagatattgcttgagagc


tgctcctagagacattaattccagaagcttccaacctgttgaacaggtgg


cagagtgagtttcagcagtgagaggacacccttaacaaagaactactggt


gctaggagttggaagttgggccaaagggaaacagatgctaacagggaggt


aatatgagaggcactgacagcatctgctacaaaactgtccccaaaagaaa


agatgggatacagctggttatgctgcagttaaaaagcaaccaagtattaa


tggcttaataccacaaaagtttatttttcacttcttgttatcagGAGACC


TCTGTGGATTCTATAACTCAAAGACCCAGGCTGACGGCTGGTGGAATAGC


TACCATCTCAAACATGGCGTGATGCTGCATCAGAGAGAAAAATAAGCTCT


TGCATTGACAATGAAATGTCTACCCTGT





SEQ ID NO: 5 SATB1 1d variant >DB126439 (Human


EST)


Normal text is exon 1


Bold text is exon 2


Underlined sequences are primer sites for detec-


tion of this transcript 


ATG in italics indicates the translation start


site in exon 2


CTCCTTCTCTTCCTTCCCTTGGCCCCGCGCGCTCGCTCGCTCGCTCCTCG


CCTCGCTCTCCCCTTTAAACGCCCACTTCGTATGGGGAAAGAGGACAACT


TGAAGTCAAGTTGCAATTAACTTCCGCGGCAGCCGCAGCTCCGGCGGCGG


CGGCGGCGGCAGGAGAGGCAGAAGCCGCCGCCTCGGAAGTCCGACGCCGG


CGCGCCCGCCCGGGGAGCCGTTCTTGGTTTCAGGCCCGCACTCGACAGCC


ACCGCCGCCCCCAACGTCCATGCCTGAGTGATCTTTAGACAGTGACTGAG



TA
custom character
ATCATTTGAACGAGGCAACTCAGGGGAAAGAACATTCAGAAAT

GT






CTAACAATGTGAGTGATCC

GAAGGGTCCACCAGCCAAGATTGCCCGCCTG




GAGCAGAACGGGAGCCCGCTAGGAAGAGGAAGGCTTGGGAGTACAGGTGC




AAAAATGCAGGGAGTGCCTTTAAAACACTCGGGCCATCTGATGAAAACCA




ACCTTAGGAAAGGAACCATGCTGCCAGTTTTCTGTGTGGTGGAACATTAT




GAAAACGCCATTGAATATGATTG









Claims
  • 1. A method for treating a cancer in a human patient, the method comprising: determining, in a sample from primary tumor tissue from the human patient, a level of SEQ ID NO:2 transcripts that is at least 10-fold greater, when normalized, compared to the level in control normal tissue, by: obtaining or having obtained a nucleic acid sample comprising RNA from a primary tumor;detecting or having detected a level of SEQ ID NO:2 transcripts that is at least 10-fold greater in comparison to the level in control normal tissue, wherein the transcripts are the RNA version of SEQ ID NO:2; andadministering a chemotherapy or immunotherapy agent to the human patient determined to have a level of SEQ ID NO: 2 transcripts that is at least 10-fold greater, when normalized, compared to the level in control normal tissue.
  • 2. The method of claim 1, wherein the detecting step comprises performing a quantitative PCR reaction.
  • 3. The method of claim 1 wherein the detecting step comprises performing an in situ hybridization reaction.
  • 4. The method of claim 1, further comprising detecting the level of a SATB1 exon 1d transcript in the primary tumor sample.
  • 5. The method of claim 4, wherein the step of detecting the level of the SATB1 exon 1d transcript comprises performing a quantitative PCR reaction or performing an in situ hybridization reaction.
  • 6. The method of claim 1, wherein the cancer is breast cancer, pancreatic cancer, colon cancer, or prostate cancer.
  • 7. A method of detecting the level of expression of an RNA version of the SAILOR sequence SEQ ID NO: 2 in a human tissue sample, wherein the method comprises performing a quantitative reverse transcriptase-PCR (RT-PCR) on RNA obtained from the human tissue sample using amplification oligonucleotides that amplify a fragment comprising at least 700 contiguous nucleotides of SEQ ID NO: 2.
  • 8. A method of detecting the level of expression of an RNA version of the SAILOR sequence SEQ ID NO: 2 in a human tissue sample, wherein the method comprises performing a quantitative reverse transcriptase-PCR (RT-PCR) on RNA obtained from the human tissue sample using amplification oligonucleotides that amplify a fragment comprising at least 600 contiguous nucleotides of SEQ ID NO: 2, wherein the human tissue sample is from a breast cancer, pancreatic cancer, colon cancer, or prostate cancer.
  • 9. The method of claim 7, wherein the human tissue sample is a blood sample.
  • 10. A method of detecting the level of expression of an RNA version of the SAILOR sequence SEQ ID NO:2 in exosomes from a blood sample from a human patient that has cancer, wherein the method comprises (i) performing reverse transcription on RNA from exosomes obtained from the blood sample and quantifying the amount of the RNA version of SAILOR sequence SEQ ID NO:2 in an amplification reaction using amplification oligonucleotides that amplify a fragment comprising at least 35 contiguous nucleotides of SEQ ID NO:2; or (ii) sequencing cDNA transcribed from RNA from exosomes from the blood sample in a sequencing reaction or directly sequencing RNA from exosomes from the blood sample in a sequencing reaction; and quantifying the level of RNA fragments that comprise at least 35 contiguous nucleotides of SEQ ID NO:2.
  • 11. The method of claim 10, wherein the method of (i) comprises performing quantitative RT-PCR.
  • 12. The method of claim 10, further comprising detecting the presence of a SATB1 transcript in the exosome sample.
  • 13. The method of claim 10, wherein the cancer is breast cancer, pancreatic cancer, colon cancer, or prostate cancer.
CROSS-REFERENCES TO RELATED APPLICATIONS

This application is a continuation of PCT application PCT/US2015/031431, which claims priority benefit of U.S. Provisional Application No. 61/994,732, filed May 16, 2014 and U.S. Provisional Application No. 62/002,125, filed May 22, 2014. Each application is incorporated by reference for all purposes.

STATEMENT AS TO RIGHTS TO INVENTIONS MADE UNDER FEDERALLY SPONSORED RESEARCH AND DEVELOPMENT

This invention was made during work supported under Grant No. CA39681 awarded by the National Cancer Institute of the National Institutes of Health and under Contract No. DE-ACO2-05CH11231 awarded by the U.S. Department of Energy. The government has certain rights in this invention.

US Referenced Citations (6)
Number Name Date Kind
5990299 Ruzdijic Nov 1999 A
7745391 Mintz Jun 2010 B2
20030082742 Astolfi et al. May 2003 A1
20030198961 Spelsberg et al. Oct 2003 A1
20040038240 Sinha et al. Feb 2004 A1
20040053962 Adrian Mar 2004 A1
Foreign Referenced Citations (4)
Number Date Country
2 524 968 Nov 2012 EP
2524968 Nov 2012 EP
2007075206 Jul 2007 WO
2012122101 Sep 2012 WO
Non-Patent Literature Citations (3)
Entry
Liu et al Clinical Immunology. 2004. 112: 225-230.
Coleman, R. Drug Discovery Today. 2003. 8: 233-235.
International Search Report and Written Opinion from PCT/US2015/031431 dated Mar. 2, 2016, 11 pages.
Related Publications (1)
Number Date Country
20170067125 A1 Mar 2017 US
Provisional Applications (2)
Number Date Country
61994732 May 2014 US
62002125 May 2014 US
Continuations (1)
Number Date Country
Parent PCT/US2015/031431 May 2015 US
Child 15353710 US