Gene fusions and alternatively spliced junctions associated with breast cancer

Information

  • Patent Grant
  • 10169530
  • Patent Number
    10,169,530
  • Date Filed
    Monday, November 4, 2013
    10 years ago
  • Date Issued
    Tuesday, January 1, 2019
    5 years ago
Abstract
The present invention relates to gene fusions and alternative spliced junctions associated with breast cancer. The present invention also relates to novel methods of identifying gene fusions and alternative spliced junctions in RNA sequencing data. The present invention further relates to predicting prognosis of a breast cancer patient based on the number of gene fusion events.
Description
SEQUENCE LISTING

The instant application contains a Sequence Listing which has been submitted electronically in ASCII format and is hereby incorporated by reference in its entirety. Said ASCII copy, created on Oct. 24, 2013, is named GHI-0056-PCT_SL.txt and is 1,261,876 bytes in size.


FIELD OF THE INVENTION

The present invention relates to gene fusions and genes comprising alternative spliced junctions associated with breast cancer. The present invention also relates to methods of identifying gene fusions and genes comprising alternative spliced junctions in samples obtained from a patient with cancer. Furthermore, the present invention relates to method of predicting the prognosis of a patient with breast cancer based on the number of gene fusion events.


INTRODUCTION

Genomic aberrations resulting in gene fusions and alternatively spliced genes play an important role in cancer. Gene fusions, for example, have been estimated to account for about 20% of human cancer morbidity. Mitelman et al., Nature Reviews Cancer 7:233-245 (2007). Gene fusions are hybrids created by joining two previously separate genes via genomic aberrations such as translocations, deletions, and inversions, or trans-splicing between precursor mRNAs. Gene fusions may up-regulate expression of oncogenic genes by fusing a strong promoter to an oncogene. The first gene fusion identified in human neoplasia was BCR-ABL1 in chronic myelogenous leukemia (CML). The protein resulting from this fusion exhibits constitutive tyrosine kinase activity. Discovery of BCR-ABL1 led to development of a targeted treatment for CML using the tyrosine kinase inhibitor imatinib, which was approved in 2001. Druker et al., New England Journal of Medicine 344:1038-1042 (2001). Most of the known gene fusions have been found in hematological disorders; however, with the advent of next-generation sequencing technology, rare recurrent gene fusion events have been identified in common solid tumors. See Kohno et al., (2012) Nature Medicine 18: 375-377 (2012); Takeuchi et al., Nature Medicine 18: 378-381 (2012); Lipson et al., Nature Medicine, 18: 382-384 (2012); and Ju, et al., Genome Res., 22: 436-445 (2012).


In cancer, aberrantly spliced pre-mRNAs escape the quality control mechanisms within cells (e.g., the nonsense mediated mRNA decay pathway) and are, therefore, translated into aberrant proteins. He et al., PLoS ONE 4(3):e4732 (2009). For example, alternative splicing is known to be related to the pathogenesis of colon cancer and has been described to occur in lung adenocarcinoma. Seo et al., Genome Research 1-11 (October 2012).


Transcriptome sequencing enables detection of transcriptional variants such as gene fusions and alternative splicing events. Current methods, such as ChimeranScan (Robinson, et al., Nature Medicine 17: 1646-1651 (2011)), SnowShoes-FTD (Asmann et al., Cancer Res, 72: 1921-1928 (2012)), GSTRUCT-fusions (Seshagiri, S. et al., Nature 488: 660-664 (2012)), and GFP (Ju et al., Genome Res., 22: 436-445 (2012)), use paired-end data obtained from fresh frozen tissue samples to detect gene fusions. Other methods, such as TopHat-Fusion (Kim and Salzberg, Genome Biol 12: R72 (2011)), FusionMap (Ge et al. Bioinformatics, 27: 1922-1928 (2011)), and FusionFinder (Francis et al. PLoS One, 7(6):e39987 (2012)) can use single-end data from cell lines or fresh frozen tissue samples to detect gene fusions.


Because standard clinical practices include generating formalin-fixed, paraffin-embedded (FFPE) tissue samples from biopsies and surgical resections, FFPE samples provide an enormous repository of information for cancer research. Nonetheless, current methods are not well suited for investigating RNA from FFPE samples as the RNA from such samples is often degraded and libraries generated from those samples have low complexity and small insert sizes.


The present bioinformatics approaches identify gene fusions and alternative spliced junctions from FFPE RNA-sequencing datasets at base-pair resolution.


SUMMARY

A bioinformatics approach was developed to identify gene fusion junctions using FFPE RNA-sequencing datasets. The present invention provides gene fusion junctions that are present in breast cancer tissue samples. These gene fusions are provided in Tables A and B. The present invention also provides a bioinformatics approach to identify alternative spliced junctions. The present invention provides alternative spliced junctions that are present in breast cancer tissue samples. These alternative spliced junctions are present in Table 5.


The present invention accommodates the use of archived paraffin-embedded biopsy material for assay of gene fusion transcripts, and therefore is compatible with the most widely available type of biopsy material. It is also compatible with other different methods of tumor tissue harvest, for example, via core biopsy or fine needle aspiration.


A multiplexed, whole genome sequencing methodology was used to enable whole transcriptome-wide gene fusion and alternative spliced junction discovery using low amounts of FFPE tissue. The methods described herein support the use of single end or paired end sequence reads.


In one aspect, the invention provides a method for identifying a gene fusion in a biological sample obtained from a patient with cancer. The method comprises obtaining a plurality of reads from RNA sequencing of the biological sample. The read is then mapped to the human genome. Next, the method comprises determining whether the read comprises a distant spliced junction and selecting the read comprising a distant spliced junction. A candidate gene fusion comprising the distant spliced junction is then identified. The method also comprises creating a first set of templates for the candidate gene fusion. The first set of templates comprises: (1) a fusion template comprising 50 base pairs (bp) of exonic sequence of a preserved region of a donor gene and 50 bp of exonic sequence of a preserved region of an acceptor gene, (2) a donor template comprising 50 bp of exonic sequence of a preserved region of a donor gene and 50 bp of exonic sequence of a discarded region of an donor gene, (3) an acceptor template comprising 50 bp of exonic sequence of a discarded region of a acceptor gene and 50 bp of exonic sequence of a preserved region of an acceptor gene, (4) a donor genomic template comprising 50 bp upstream genomic sequence of a donor splicing site and 50 bp downstream genomic sequence of a donor splicing site, and (5) an acceptor genomic template comprising 50 bp upstream genomic sequence of an acceptor splicing site and 50 bp downstream genomic sequence of an acceptor splicing site. The first set of templates is used to filter false positives and provide accurate read alignment information. A candidate gene fusion is removed if any of the first template set sequences are identical, but map to different genes in the human genome. Next, a second set of templates is created. The second set of templates comprises (a) a fusion template comprising 150 bp of exonic sequence of a preserved region of a donor gene and 150 bp of exonic sequence of a preserved region of an acceptor gene, (b) a donor template comprising 150 bp of exonic sequence of a preserved region of a donor gene and 150 bp of exonic sequence of a discarded region of an donor gene, (c) an acceptor template comprising 150 bp of exonic sequence of a discarded region of a acceptor gene and 150 bp of exonic sequence of a preserved region of an acceptor gene, (d) a donor genomic template comprising 150 bp upstream genomic sequence of a donor splicing site and 150 bp downstream genomic sequence of a donor splicing site, and (e) an acceptor genomic template comprising 150 bp upstream genomic sequence of an acceptor splicing site and 150 bp downstream genomic sequence of an acceptor splicing site. The second set of templates is also used to filter false positives by determining the homology between templates (b) and (c) and between templates (d) and (e) and removing the candidate gene fusion if templates (b) and (c) are homologous or if templates (d) and (e) are homologous. Next, a read obtained from RNA sequencing of the biological sample is aligned to the first set of templates and the read that maps to the fusion template of the first set of templates is selected.


In some embodiments, a gene fusion is identified by a candidate gene fusion having at least two non-duplicate reads that map to the fusion template of the first set of templates.


In other embodiments, a gene fusion is identified by a candidate gene fusion having one non-duplicate read that maps to the fusion template of the first set of templates. The method then comprises determining and comparing the expression levels of the exons and introns of the preserved regions of the donor gene and the acceptor gene to the expression levels of the exons and introns of the discarded regions of the donor gene and the acceptor gene. A gene fusion is then identified as having increased expression levels of exons and introns of the preserved regions of the donor gene and the acceptor gene compared to the expression levels of the exons and introns of the discarded regions of the donor gene and the acceptor gene.


In another aspect, the present invention provides a method for predicting the presence of a gene fusion in a biological sample obtained from a patient with cancer. The method comprises identifying a gene fusion according to any of claims 1-3 in a first biological sample. Next, a second biological sample that does not have reads that map to the gene fusion is obtained. Then, the method comprises determining in the second biological sample the expression levels of exons and introns of preserved and discarded regions of a donor gene and an acceptor gene of the gene fusion identified in any of claims 1-3. The expression level of the second biological sample is compared to the expression levels of the first biological sample. The presence of the gene fusion in the second biological sample is predicted based on having a similar expression profile compared to the first biological sample.


In yet another aspect, the invention provides a method of predicting a likelihood of poor prognosis in a breast cancer patient. Gene fusion events in a breast tumor sample from the patient are identified and the number of gene fusion events in the breast tumor sample is determined. The presence of three or more gene fusion events is positively correlated with an increased likelihood of poor prognosis.


In a further aspect, the present invention provides a method of identifying an alternatively spliced junction in a biological sample obtained from a patient with cancer. The method comprises obtaining a read from RNA sequencing of the biological sample. Next, the read is mapped to the human genome. It is then determined whether the read comprises a distant spliced junction and the read that comprises the distant spliced junction is selected. It is next determined whether the distant spliced junction is present in a single gene. The distant spliced junction that is present in a single gene is selected. In some embodiments, the method further comprises preparing a report based on the identification of an alternative spliced junction.


In still a further aspect, the present invention provides a method for predicting a risk of recurrence of breast cancer. The method comprises determining the presence of an alternative spliced junction in a breast cancer tumor sample obtained from said patient. The alternative spliced junction is selected from Table 5. The presence of junction −chr3:196118684_−chr3:196129890 in UBXN, junction −chr12:24366277_−chr12:24048958 in SOX5, junction −chr9:114148657_−chr9:114154104 in KIAA0368, junction +chr18:39629569_+chr18:39623697 in PIK3C3, or junction+chr1:155695810_chr1:155695173 is correlated with an decreased risk of recurrence, and wherein the presence of junction −chr2:99786013_−chr2:99787892 in MITD1 is correlated with an increased risk of recurrence. The presence of the alternative spliced junction can be determined by whole transcriptome sequencing or reverse transcriptase polymerase chain reaction (RT-PCR).


In some embodiments, an isolated polynucleotide comprises a gene fusion, wherein the isolated polynucleotide comprises a sequence selected from SEQ ID NO:1 to SEQ ID NO:100 is provided


In other embodiments, an isolated polynucleotide comprises an alternative spliced junction selected from −chr12:24366277_−chr12:24048958; −chr9:114148657_−chr9:114154104; +chr18:39629569_+chr18:39623697; +chr1:155695810_chr1:155695173; and −chr2:99786013_−chr2:99787892 is provided.





BRIEF DESCRIPTION OF THE DRAWINGS


FIG. 1A shows an overview of the bioinformatics method for identifying a gene fusion and the bioinformatics method for identifying alternatively spliced genes from RNA sequencing data. FIG. 1B shows classification of candidate fusions into 3 tiers based on the levels of supporting evidence.



FIG. 2 depicts the individual templates present in the five template set described in Example 3: (1) fusion template; (2) donor template; (3) acceptor template; (4) donor genomic template; and (5) donor acceptor template. FIG. 2 also provides the equation used to calculate the interrupt ratio (IR).



FIG. 3 (FIG. 3.1-FIG. 3.105) shows the sequences of the 100 bp five template set (fusion template, donor template, acceptor template, donor genomic template, and acceptor genomic template) and any supporting reads for the gene fusion candidates identified using the bioinformatics approach. Sample information appears below the corresponding sequence information. FIG. 3 discloses SEQ ID NOS 358-3278, respectively, in order of appearance.



FIG. 4 (FIG. 4.1-FIG. 4.105) shows exon and intron expression data from the candidate gene fusions. The identifiers beginning with “X” represent patient samples identified to be positive for the indicated fusion. For example, in FIG. 4.1, patient sample X111TTAGGC_7 was identified as positive for the ACACA_MSI2 fusion.


The top panel shows a scatter plot of the expression data. The x-axis shows the number of normalized counts for the indicated fusion donor and the y-axis shows the number of normalized counts for the indicated fusion acceptor. The closed circles represent samples that are positive for the indicated fusion and the open circles shows the remaining samples in the cohort that are negative for the fusion.


The middle panel shows a bar plot of the expression of the exons and introns of the indicated fusion donor. The x-axis shows the relative genomic location of the donor's exons and introns and the y-axis shows the number of normalized counts by length. The vertical line separates the donor exons and introns into those that preserved in the fusion (black bars) and those that are discarded from the fusion (gray bars). As shown on the x-axis, the symbol “d” indicates that the exon or intron is a donor exon or intron; the symbol “l” indicates that the exon or intron is located to the left of the vertical line separating discarded and preserved exons and introns; the symbol “r” indicates that the exon or intron is located to the right of the vertical line separating discarded and preserved exons and introns.


The bottom panel shows a bar plot of the expression of the exons and introns of the indicated fusion acceptor. The x-axis shows the relative genomic location of the acceptor's exons and introns and the y-axis shows the number of normalized counts by length. The vertical line separates the acceptor exons and introns into those that preserved in the fusion (black bars) and those that are discarded from the fusion (gray bars). As shown on the x-axis, the symbol “a” indicates that the exon or intron is an acceptor exon or intron; the symbol “l” indicates that the exon or intron is located to the left of the vertical line separating discarded and preserved exons and introns; the symbol “r” indicates that the exon or intron is located to the right of the vertical line separating discarded and preserved exons and introns.



FIG. 5A shows the distributions of block age, cancer relapse and ER status according to fusion number categories in Providence and Rush cohorts. The archived block age was plotted as mean and standard deviation for each category. ER status was assessed by immunohistochemistry. The patient number for each category was labeled accordingly.



FIG. 5B shows Kaplan-Meier plots of each fusion number category demonstrating patients with multiple fusions had poor prognosis in Providence, and a similar trend existed in Rush. The log-rank p-values were indicated in Kaplan-Meier plots.



FIG. 5C shows that the differentially expressed genes between multiple fusion samples versus no fusion samples in Providence were mapped to the Reactome FI database and clustered into five core sub-modules via linker proteins (in grey shaded rectangles) by the Reactome FI Cytoscape Plugin. The connected 84 genes are referred as the fusion gene signature. Nodes were manually arranged to display the sub-modules properly. Edges displayed FI direction attribute values as the following, “→” for activating/catalyzing, “-|” for inhibition, “-” for FIs extracted from complexes or inputs, and “---” for predicted FIs.



FIG. 5D shows fusion signature indexes plotted for each of the fusion number categories in Providence and Rush. The fusion signature index is the average expression levels of 84 fusion gene signatures as shown in FIG. 5C. The base counts of each signature gene were normalized by library size then scaled across the patient cohort before averaged in the signature index. The p-values were derived from Wilcoxon tests.



FIG. 6 depicts Kaplan-Meier plots of patient subsets of Providence or Rush patients as a function of fusion numbers, segregated by block age. Either upper three quantiles or lower three quantiles based on block age were selected to examine the effect of the block ages on the disease outcome. The log-rank p-values are displayed.



FIG. 7 depicts Venn diagrams between differentially expressed genes from samples segregated by fusion numbers and ER status in Providence. The additive model of edgeR was used only for DE analysis between multiple fusion samples versus no fusion samples. The up-regulated and down-regulated gene numbers were labeled with each differential expression analysis. The overlapped gene numbers combined from both up-regulated and down-regulated genes were labeled according to each comparison. A. ER+ samples contribute to fusion gene signatures more than ERsamples in Providence, consistent with the expression evidence from heatmaps. The overlapping genes between fusion gene signatures and differentially expressed genes due to ER status difference in both multiple fusion and no fusion groups are small.



FIG. 8 shows the expression level of AKAP12 in patients of the Rush cohort.



FIG. 9 shows Protein domains of fusion ESR1→AKAP12 are illustrated based on UniProt database (www.uniprot.org). The red vertical line indicates the fusion position on the corresponding protein. The amino acid length and amino acid positions of each fusion position are labeled on the top of each protein. A. The protein domains of ESR1 protein P03372. B. The protein domains of AKAP12 protein Q02952. C. The protein domains of two predicted fusion proteins ESR1→AKAP12. The one amino acid insertion generated from the fusion event is labeled on each fusion protein.





DETAILED DESCRIPTION

Before the present invention and specific exemplary embodiments of the invention are described, it is to be understood that this invention is not limited to particular embodiments described, as such may, of course, vary. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only, and is not intended to be limiting, since the scope of the present invention will be limited only by the appended claims.


Where a range of values is provided, it is understood that each intervening value, to the tenth of the unit of the lower limit unless the context clearly dictates otherwise, between the upper and lower limit of that range and any other stated or intervening value in that stated range is encompassed within the invention. The upper and lower limits of these smaller ranges may independently be included in the smaller ranges is also encompassed within the invention, subject to any specifically excluded limit in the stated range. Where the stated range includes one or both of the limits, ranges excluding either both of those included limits are also included in the invention.


As used herein and in the appended claims, the singular forms “a,” “an,” and “the” include plural referents unless the context clearly dictates otherwise. Thus, for example, reference to “an RNA transcript” includes a plurality of such RNA transcripts.


Unless defined otherwise, technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. For example, Singleton et al., Dictionary of Microbiology and Molecular Biology 2nd ed., J. Wiley & Sons (New York, N.Y. 1994), provide one skilled in the art with a general guide to many of the terms used in the present application.


Additionally, the practice of the present invention will employ, unless otherwise indicated, conventional techniques of molecular biology (including recombinant techniques), microbiology, cell biology, and biochemistry, which are within the skill of the art. Such techniques are explained fully in the literature, such as, “Molecular Cloning: A Laboratory Manual”, 2nd edition (Sambrook et al., 1989); “Oligonucleotide Synthesis” (M. J. Gait, ed., 1984); “Animal Cell Culture” (R. I. Freshney, ed., 1987); “Methods in Enzymology” (Academic Press, Inc.); “Handbook of Experimental Immunology”, 4th edition (D. M. Weir & C. C. Blackwell, eds., Blackwell Science Inc., 1987); “Gene Transfer Vectors for Mammalian Cells” (J. M. Miller & M. P. Calos, eds., 1987); “Current Protocols in Molecular Biology” (F. M. Ausubel et al., eds., 1987); and “PCR: The Polymerase Chain Reaction”, (Mullis et al., eds., 1994).


One skilled in the art will recognize many methods and materials similar or equivalent to those described herein, which could be used in the practice of the present invention. Indeed, the present invention is in no way limited to the methods and materials described. For purposes of the present invention, the following terms are defined below.


The term “annotate” refers to adding biological information to a genome sequence.


The terms “cancer” and “cancerous” refer to or describe the physiological condition in mammals that is typically characterized by unregulated cell growth. Examples of cancer include but are not limited to, breast cancer, colon cancer, lung cancer, prostate cancer, hepatocellular cancer, gastric cancer, pancreatic cancer, cervical cancer, ovarian cancer, liver cancer, bladder cancer, cancer of the urinary tract, thyroid cancer, renal cancer, carcinoma, melanoma, and brain cancer.


The term “correlates” or “correlating” as used herein refers to a statistical association between instances of two events, where events may include numbers, data sets, and the like. For example, when the events involve numbers, a positive correlation (also referred to herein as a “direct correlation”) means that as one increases, the other increases as well. A negative correlation (also referred to herein as an “inverse correlation”) means that as one increases, the other decreases. The present invention provides gene fusions and alternative spliced junctions which may be correlated with a particular outcome measure. For example, the presence of a gene fusion or an alternative spliced junction may be positively correlated with a likelihood of a good clinical outcome for the patient, such as an increased likelihood of long-term survival without recurrence and/or a positive response to a chemotherapy, and the like. Such a positive correlation may be demonstrated statistically in various ways, e.g. by a low hazard ratio. In another example, the presence of a gene fusion or an alternative spliced junction may be negatively correlated with a likelihood of good clinical outcome for the patient. In this case, for example, the patient may have a decreased likelihood of long-term survival without recurrence of the cancer and/or a positive response to a chemotherapy, and the like. Such a negative correlation indicates that the patient likely has a poor prognosis or will respond poorly to a chemotherapy, and this may be demonstrated statistically in various ways, e.g., by a high hazard ratio.


As used herein, the term “exon” refers to any segment of an interrupted gene that is represented in the mature RNA product (B. Lewin. Genes IV Cell Press, Cambridge Mass. 1990). As used herein, the terms “intron” and “intronic sequence” refer to any non-coding region found within genes.


The term “expression product” as used herein refers to an expression product of a coding RNA transcript. Thus, the term refers to a polypeptide or protein.


As used herein, the term “intergenic region” refers to a stretch of DNA or RNA sequences located between clusters of genes that contain few or no genes. Intergenic regions are different from intragenic regions (or “introns”), which are non-coding regions that are found between exons within genes. An intergenic region may be comprised of one or more “intergenic sequences.”


As used herein, the term “gene fusion” refers to a chimeric molecule derived from two separate genes—a donor gene and an acceptor gene. The donor gene is generally located upstream of the acceptor gene. The regions of the donor gene and the acceptor gene that are present in the gene fusion are referred to herein as a “preserved region” of the donor gene and a “preserved region” of the acceptor gene, respectively. The regions of the donor gene and the acceptor gene that are not present in the gene fusion are referred to herein as a “discarded region” of the donor gene and a “discarded region” of the acceptor gene, respectively. A gene fusion may arise from a chromosomal aberration, such as a translocation, deletion, or inversion, within a chromosome or between chromosomes. A gene fusion may result in an expression product with a new or different function compared to the fusion partners. Alternatively, a proto-oncogene may be fused to a strong promoter, resulting in expression of an oncogene. A gene fusion is recurrent when it is present in samples from two or more patients with the same type of cancer, for example, breast cancer.


As used herein, the term “homology” with regard to template sequences, refers to the degree of similarity between two sequences. In some embodiments, a 300 bp donor template and a 300 bp acceptor template are homologous if they share sequence identity of more than 14 bp. In other embodiments, a 300 bp donor genomic template and a 300 bp acceptor genomic template are homologous if they share sequence identity of more than 14 bp.


As used herein, the term “isolated” refers to a molecule that is separated from other constituents. For example, an isolated DNA molecule may be cleaved from genomic DNA or synthesized to include a portion of a naturally occurring DNA molecule. Isolated DNA is a free-standing portion of the larger, natural DNA molecule. Isolated DNA molecules, therefore, are not naturally occurring DNA molecules or native DNA molecules.


As used herein, the term “level” refers to qualitative or quantitative determination of the number of reads of exons and introns in the genes that comprise a gene fusion. An exon or an intron exhibits an “increased level” when the level of the exon or intron is higher in a first sample, such as in a preserved acceptor or donor region of a gene fusion, than in a second sample, such as in a discarded acceptor or donor region of a gene fusion.


The term “long-term” survival as used herein refers to survival for at least 3 years. In other embodiments, it may refer to survival for at least 5 years, or for at least 10 years following surgery or other treatment.


As used herein, the term “pathology” of cancer includes all phenomena that comprise the well-being of the patient. This includes, without limitation, abnormal or uncontrollable cell growth, metastasis, interference with the normal functioning of neighboring cells, release of cytokines or other secretory products at abnormal levels, suppression or aggravation of inflammatory or immunological response, neoplasia, premalignancy, malignancy, invasion of surrounding or distant tissues or organs, such as lymph nodes.


A “patient response” may be assessed using any endpoint indicating a benefit to the patient, including, without limitation, (1) inhibition, to some extent, of tumor growth, including slowing down and complete growth arrest; (2) reduction in the number of tumor cells; (3) reduction in tumor size; (4) inhibition (i.e., reduction, slowing down or complete stopping) of tumor cell infiltration into adjacent peripheral organs and/or tissues; (5) inhibition (i.e. reduction, slowing down or complete stopping) of metastasis; (6) enhancement of anti-tumor immune response, which may, but does not have to, result in the regression or rejection of the tumor; (7) relief, to some extent, of one or more symptoms associated with the cancer; (8) increase in the length of survival following treatment; and/or (9) decreased mortality at a given point of time following treatment.


The term “polynucleotide” when used in singular or plural, generally refers to any polyribonucleotide or polydeoxribonucleotide, which may be unmodified RNA or DNA or modified RNA or DNA. Thus, for instance, polynucleotides as defined herein include, without limitation, single- and double-stranded DNA, DNA including single- and double-stranded regions, single- and double-stranded RNA, and RNA including single- and double-stranded regions, hybrid molecules comprising DNA and RNA that may be single-stranded or, more typically, double-stranded or include single- and double-stranded regions. In addition, the term “polynucleotide” as used herein refers to triple-stranded regions comprising RNA or DNA or both RNA and DNA. The strands in such regions may be from the same molecule or from different molecules. The regions may include all of one or more of the molecules, but more typically involve only a region of some of the molecules. One of the molecules of a triple-helical region often is an oligonucleotide. The term “polynucleotide” specifically includes cDNAs. The term includes DNAs (including cDNAs) and RNAs that contain one or more modified bases. Thus, DNAs or RNAs with backbones modified for stability or for other reasons are “polynucleotides” as that term is intended herein. Moreover, DNAs or RNAs comprising unusual bases, such as inosine, or modified bases, such as tritiated bases, are included within the term “polynucleotides” as defined herein. In general, the term “polynucleotide” embraces all chemically, enzymatically and/or metabolically modified forms of unmodified polynucleotides, as well as the chemical forms of DNA and RNA characteristic of viruses and cells, including simple and complex cells.


The term “prognosis” as used herein, refers to the prediction of the likelihood of cancer-attributable death or progression, including recurrence, metastatic spread, and drug resistance, of neoplastic disease, such as breast cancer. The term “prediction” is used herein to refer to the likelihood that a patient will respond either favorably or unfavorably to a drug or set of drugs, and also the extent of those responses, or that a patient will survive, following surgical removal of the primary tumor and/or chemotherapy for a certain period of time without cancer recurrence. The methods of the present invention can be used clinically to make treatment decisions by choosing the most appropriate treatment modalities for any particular patient. The methods of the present invention are tools in predicting if a patient is likely to respond favorably to a treatment regimen, such as surgical intervention, chemotherapy with a given drug or drug combination, and/or radiation therapy, or whether long-term survival of the patient without cancer recurrence is likely, following surgery and/or termination of chemotherapy or other treatment modalities.


The term “recurrence,” as used herein, refers to local or distant (metastasis) recurrence of cancer. For example, breast cancer can come back as a local recurrence (in the treated breast or near the tumor surgical site) or as a distant recurrence in the body. The most common sites of breast cancer recurrence include the lymph nodes, bones, liver, or lungs.


As used herein, the term “RefSeq RNA” refers to an RNA that can be found in the Reference Sequence (RefSeq) database, a collection of publicly available nucleotide sequences and their protein products built by the National Center for Biotechnology Information (NCBI). The RefSeq database provides an annotated, non-redundant record for each natural biological molecule (i.e. DNA, RNA or protein) included in the database. Thus, a sequence of a RefSeq RNA is well-known and can be found in the RefSeq database at http://www.ncbi.nlm.nih.gov/RefSeq/. See also Pruitt et al., Nucl. Acids Res. 33(Supp 1):D501-D504 (2005). Accession numbers for donor and acceptor RefSeq are provided in Table A.


As used herein, the term “RNA transcript” refers to the RNA transcription product of DNA and includes coding and non-coding RNA transcripts. RNA transcripts include, for example, mRNA, an unspliced RNA, a splice variant mRNA, a microRNA, fragmented RNA, long intergenic non-coding RNAs (lincRNAs), intergenic RNA sequences or regions, and intronic RNAs.


The terms “read” and “sequence read” are used interchangeably herein to refer to sequence information obtained from an RNA sequencing experiment. A read may comprise, for example, 50 bases to 150 bases, 50 bases to 100 bases, 50 bases to 55 bases, 55 bases to 60 bases, 60 bases to 65 bases, 65 bases to 70 bases, 70 bases to 75 bases, 75 bases to 80 bases, 80 bases to 85 bases, 85 bases to 90 bases, 90 bases to 95 bases, 95 bases to 100 bases, 100 bases to 105, 105 bases to 110, 110 bases to 115, 115 bases to 120, 120 bases to 125, 125 bases to 130, 130 bases to 135, 135 bases to 140, 140 bases to 145, or 145 bases to 150 bases. A read may be a single read or a paired-end read. A single read refers to a read that is sequenced from one end. A paired-end read refers to a read that is sequenced from both ends.


As used herein, the term “splicing” refers to the process of removing introns and joining exons from pre-mRNA to generate mRNA. The terms “splice site” and “splice junction” are used interchangeably to refer to a region where a splicing event takes place, for example, at an exon-intron junction in a pre-mRNA molecule. For example, a slice donor site may be present at the 5′ end of an intron and a splice acceptor site may be present at the 3′ end of an intron. As used herein, a “distant spliced site” includes sites used in splicing events that occur between different genes or chromosomes. Distant splicing events may also include splicing events occurring within the same gene, but in the opposite transcription direction. Distant splicing events may include translocations, inversions, and the like. As used herein, “alternative splicing” refers to a process whereby identical pre-mRNA molecules are spliced in various ways to yield different mRNA molecules. The different mRNA molecules may be translated into different protein isoforms.


In some embodiments, the alternative spliced junction is present UBXN7, SOX5, KIAA0368, PIKC3C, DAP3, or MITD1. In other embodiments, the alternative spliced junction within UBXN7 comprises the junction −chr3:196118684_−chr3:196129890; the alternative spliced junction within SOX5 comprises the junction −chr12:24366277_−chr12:24048958; the alternative spliced junction within KIAA0368 comprises the junction −chr9:114148657_−chr9:114154104; the alternative spliced junction within PIK3C3 comprises the junction+chr18:39629569_+chr18:39623697; the alternative spliced junction within DAP3 comprises the junction+chr1:155695810_chr1:155695173; and the alternative spliced junction within MITD1 comprises the junction −chr2:99786013_−chr2:99787892.


The terms “subject,” “individual,” and “patient” are used interchangeably herein to refer to a mammal being assessed for treatment and/or being treated. In an embodiment, the mammal is a human. The terms “subject,” “individual,” and “patient” thus encompass individuals having cancer (e.g., breast cancer), including those who have undergone or are candidates for resection (surgery) to remove cancerous tissue.


As used herein, the term “surgery” applies to surgical methods undertaken for removal of cancerous tissue, including mastectomy, lumpectomy, lymph node removal, sentinel lymph node dissection, prophylactic mastectomy, prophylactic ovary removal, cryotherapy, and tumor biopsy. The tumor samples used for the methods of the present invention may have been obtained from any of these methods.


As used herein, the term “template” refers to a nucleotide sequence against which another nucleotide sequence may be compared. The templates used in the methods of the present invention include (1) a fusion template comprising 50 or 150 bp of exonic sequence of a preserved region of a donor gene and 50 or 150 bp of exonic sequence of a preserved region of an acceptor gene, (2) a donor template comprising 50 or 150 bp of exonic sequence of a preserved region of a donor gene and 50 or 150 bp of exonic sequence of a discarded region of an donor gene, (3) an acceptor template comprising 50 or 150 bp of exonic sequence of a discarded region of a acceptor gene and 50 or 150 bp of exonic sequence of a preserved region of an acceptor gene, (4) a donor genomic template comprising 50 or 150 bp upstream genomic sequence of a donor splicing site and 50 or 150 bp downstream genomic sequence of a donor splicing site, and (5) an acceptor genomic template comprising 50 or 150 bp upstream genomic sequence of an acceptor splicing site and 50 or 150 bp downstream genomic sequence of an acceptor splicing site. In some embodiments, the method comprises determining the homology between various templates. In other embodiments, the method comprises aligning a read obtained from RNA sequencing of a biological sample to the templates and selecting the read that maps to the fusion template.


The term “tumor” as used herein, refers to all neoplastic cell growth and proliferation, whether malignant or benign, and all pre-cancerous and cancerous cells and tissues.


The term “tumor sample” as used herein refers to a sample comprising tumor material obtained from a cancer patient. The term encompasses tumor tissue samples, for example, tissue obtained by surgical resection and tissue obtained by biopsy, such as for example, a core biopsy or a fine needle biopsy. In a particular embodiment, the tumor sample is a fixed, wax-embedded tissue sample, such as a formalin-fixed, paraffin-embedded tissue sample. Additionally, the term “tumor sample” encompasses a sample comprising tumor cells obtained from sites other than the primary tumor, e.g., circulating tumor cells. The term also encompasses cells that are the progeny of the patient's tumor cells, e.g. cell culture samples derived from primary tumor cells or circulating tumor cells. The term further encompasses samples that may comprise protein or nucleic acid material shed from tumor cells in vivo, e.g., bone marrow, blood, plasma, serum, and the like. The term also encompasses samples that have been enriched for tumor cells or otherwise manipulated after their procurement and samples comprising polynucleotides and/or polypeptides that are obtained from a patient's tumor material.


As used herein, “whole transcriptome sequencing” refers to the use of high throughput sequencing technologies to sequence the entire transcriptome in order to get information about a sample's RNA content. Whole transcriptome sequencing can be done with a variety of platforms for example, the Genome Analyzer or HiSeq 2000/2500 (Illumina, Inc., San Diego, Calif.) and the SOLiD™ Sequencing System (Life Technologies, Carlsbad, Calif.). However, any platform useful for whole transcriptome sequencing may be used.


The term “RNA-Seq” or “transcriptome sequencing” or “RNA sequencing” refers to sequencing performed on RNA (or cDNA) instead of DNA, where typically, the primary goal is to measure expression levels, detect fusion transcripts, alternative splicing, and other genomic alterations that can be better assessed from RNA. RNA-Seq includes whole transcriptome sequencing as well as target specific sequencing.


The term “computer-based system,” as used herein, refers to the hardware means, software means, and data storage means used to analyze information. The minimum hardware of a patient computer-based system comprises a central processing unit (CPU), input means, output means, and data storage means. A skilled artisan can readily appreciate that many of the currently available computer-based system are suitable for use in the present invention and may be programmed to perform the specific measurement and/or calculation functions of the present invention.


To “record” data, programming or other information on a computer readable medium refers to a process for storing information, using any such methods as known in the art. Any convenient data storage structure may be chosen, based on the means used to access the stored information. A variety of data processor programs and formats can be used for storage, e.g. word processing text file, database format, etc.


A “processor” or “computing means” references any hardware and/or software combination that will perform the functions required of it. For example, any processor herein may be a programmable digital microprocessor such as available in the form of an electronic controller, mainframe, server or personal computer (desktop or portable). Where the processor is programmable, suitable programming can be communicated from a remote location to the processor, or previously saved in a computer program product (such as a portable or fixed computer readable storage medium, whether magnetic, optical or solid state device based). For example, a magnetic medium or optical disk may carry the programming, and can be read by a suitable reader communicating with each processor at its corresponding station.


The present invention provides gene fusions and alternative spliced junctions that are associated with breast cancer. These gene fusions are listed in Tables A and B and the alternative spliced junctions are provided in Table 5. The present invention also provides a method for identifying gene fusions and a method for identifying alternative spliced junctions in a biological sample obtained from a patient with cancer. The present invention further provides a method for predicting a gene fusion in a biological sample obtained from a patient with cancer.


The gene fusions and alternative spliced junctions and associated information provided by the present invention also have utility in the development of therapies to treat cancers and screening patients for inclusion in clinical trials. The gene fusions and alternative spliced junctions and associated information may further be used to design or produce a reagent that modulates the level or activity of the gene fusion and alternative spliced junction. Such reagents may include, but are not limited to, a drug, an antisense RNA, a small inhibitory RNA (siRNA), a ribozyme, a small molecule, a monoclonal antibody, and a polyclonal antibody.


In various embodiments of the methods of the present invention, various technological approaches are available for determining the presence of gene fusions or alternative spliced junctions, including, without limitation, whole transcriptome sequencing, RT-PCR, microarrays, and serial analysis of gene expression (SAGE), which are described in more detail below.


Correlating the Presence of a Gene Fusion or an Alternative Spliced Junction to a Clinical Outcome


One skilled in the art will recognize that there are many statistical methods that may be used to determine whether there is a correlation between an outcome of interest (e.g., likelihood of survival) and the presence of a gene fusion or an alternative spliced junction. This relationship can be presented as a continuous recurrence score (RS), or patients may be stratified into risk groups (e.g., low, intermediate, high). For example, a Cox proportional hazards regression model may fit to a particular clinical endpoint (e.g., RFI, DFS, OS). One assumption of the Cox proportional hazards regression model is the proportional hazards assumption, i.e. the assumption that effect parameters multiply the underlying hazard. Assessments of model adequacy may be performed including, but not limited to, examination of the cumulative sum of martingale residuals. One skilled in the art would recognize that there are numerous statistical methods that may be used (e.g., Royston and Parmer (2002), smoothing spline, etc.) to fit a flexible parametric model using the hazard scale and the Weibull distribution with natural spline smoothing of the log cumulative hazards function, with effects for treatment (chemotherapy or observation) and RS allowed to be time-dependent. (See, e.g., P. Royston, M. Parmer, Statistics in Medicine 21(15:2175-2197 (2002).)


In an exemplary embodiment, power calculations are carried out for the Cox proportional hazards model with a single non-binary covariate using the method proposed by F. Hsieh and P. Lavori, Control Clin Trials 21:552-560 (2000) as implemented in PASS 2008.


Methods of Assaying Gene Fusions and Alternative Spliced Junctions


Methods of assaying gene fusions and alternative spliced junctions include methods based on sequencing of polynucleotides, methods based on hybridization analysis of polynucleotides, and proteomics-based methods. Representative methods for sequencing-based analysis include Massively Parallel Sequencing (see e.g., Tucker et al., The American J. Human Genetics 85:142-154, 2009) and Serial Analysis of Gene Expression (SAGE). Exemplary methods known in the art for the quantification of mRNA expression in a sample include northern blotting and in situ hybridization (Parker & Barnes, Methods in Molecular Biology 106:247-283 (1999)); RNAse protection assays (Hod, Biotechniques 13:852-854 (1992)); and PCR-based methods, such as reverse transcription polymerase chain reaction (RT-PCR) (Weis et al., Trends in Genetics 8:263-264 (1992)). Antibodies may be employed that can recognize sequence-specific duplexes, including DNA duplexes, RNA duplexes, and DNA-RNA hybrid duplexes or DNA-protein duplexes.


Nucleic Acid Sequencing-Based Methods


Nucleic acid sequencing technologies are suitable methods for expression analysis. The principle underlying these methods is that the number of times a cDNA sequence is detected in a sample is directly related to the relative RNA levels corresponding to that sequence. These methods are sometimes referred to by the term Digital Gene Expression (DGE) to reflect the discrete numeric property of the resulting data. Early methods applying this principle were Serial Analysis of Gene Expression (SAGE) and Massively Parallel Signature Sequencing (MPSS). See, e.g., S. Brenner, et al., Nature Biotechnology 18(6):630-634 (2000).


More recently, the advent of “next-generation” sequencing technologies has made DGE simpler, higher throughput, and more affordable. As a result, more laboratories are able to utilize DGE to screen the expression of more nucleic acids in more individual patient samples than previously possible. See, e.g., J. Marioni, Genome Research 18(9):1509-1517 (2008); R. Morin, Genome Research 18(4):610-621 (2008); A. Mortazavi, Nature Methods 5(7):621-628 (2008); N. Cloonan, Nature Methods 5(7):613-619 (2008). Massively parallel sequencing methods have also enabled whole genome or transcriptome sequencing, allowing the analysis of not only coding but also non-coding sequencees. As reviewed in Tucker et al., The American J. Human Genetics 85:142-154 (2009), there are several commercially available massively parallel sequencing platforms, such as the Illumina Genome Analyzer or HiSeq 2000/2500 (Illumina, Inc., San Diego, Calif.), Applied Biosystems SOLiD™ Sequencer (Life Technologies, Carlsbad, Calif.), Roche GS-FLX 454 Genome Sequencer (Roche Applied Science, Germany), and the Helicos® Genetic Analysis Platform (Helicos Biosciences Corp., Cambridge, Mass.). Other developing technologies may be used.


Reverse Transcriptase PCR (RT-PCR)


The starting material is typically total RNA isolated from a human tumor, usually from a primary tumor. Optionally, normal tissues from the same patient can be used as an internal control. RNA can be extracted from a tissue sample, e.g., from a sample that is fresh, frozen (e.g. fresh frozen), or fixed and paraffin-embedded (e.g. formalin-fixed).


General methods for RNA extraction are well known in the art and are disclosed in standard textbooks of molecular biology, including Ausubel et al., Current Protocols of Molecular Biology, John Wiley and Sons (1997). Methods for RNA extraction from paraffin embedded tissues are disclosed, for example, in Rupp and Locker, Lab Invest. 56:A67 (1987), and De Andrés et al., BioTechniques 18:42044 (1995). In particular, RNA isolation can be performed using a purification kit, buffer set and protease from commercial manufacturers, such as Qiagen, according to the manufacturer's instructions. For example, total RNA from cells in culture can be isolated using Qiagen RNeasy mini-columns. Other commercially available RNA isolation kits include MasterPure™ Complete DNA and RNA Purification Kit (EPICENTRE®, Madison, Wis.), and Paraffin Block RNA Isolation Kit (Ambion, Inc.). Total RNA from fresh frozen tissue samples can be isolated using RNA Stat-60 (Tel-Test). RNA prepared from a tumor sample can be isolated, for example, by cesium chloride density gradient centrifugation. The isolated RNA may then be depleted of ribosomal RNA as described in U.S. Pub. No. 2011/0111409.


The sample containing the RNA is then subjected to reverse transcription to produce cDNA from the RNA template, followed by exponential amplification in a PCR reaction. The two most commonly used reverse transcriptases are avian myeloblastosis virus reverse transcriptase (AMV-RT) and Moloney murine leukemia virus reverse transcriptase (MMLV-RT). The reverse transcription step is typically primed using specific primers, random hexamers, or oligo-dT primers, depending on the circumstances and the goal of the assay. For example, extracted RNA can be reverse-transcribed using a GeneAmp RNA PCR kit (Perkin Elmer, CA, USA), following the manufacturer's instructions. The derived cDNA can then be used as a template in the subsequent PCR reaction.


PCR-based methods use a thermostable DNA-dependent DNA polymerase, such as a Taq DNA polymerase. For example, TaqMan® PCR typically utilizes the 5′-nuclease activity of Taq or Tth polymerase to hydrolyze a hybridization probe bound to its target amplicon, but any enzyme with equivalent 5′ nuclease activity can be used. Two oligonucleotide primers are used to generate an amplicon typical of a PCR reaction product. A third oligonucleotide, or probe, can be designed to facilitate detection of a nucleotide sequence of the amplicon located between the hybridization sites of the two PCR primers. The probe can be detectably labeled, e.g., with a reporter dye, and can further be provided with both a fluorescent dye, and a quencher fluorescent dye, as in a TaqMan® probe configuration. Where a TaqMan® probe is used, during the amplification reaction, the Taq DNA polymerase enzyme cleaves the probe in a template-dependent manner. The resultant probe fragments disassociate in solution, and signal from the released reporter dye is free from the quenching effect of the second fluorophore. One molecule of reporter dye is liberated for each new molecule synthesized, and detection of the unquenched reporter dye provides the basis for quantitative interpretation of the data.


TaqMan® RT-PCR can be performed using commercially available equipment, such as, for example, ABI PRISM 7900™ Sequence Detection System™ (Perkin-Elmer-Applied Biosystems, Foster City, Calif., USA), or Lightcycler (Roche Molecular Biochemicals, Mannheim, Germany). In a preferred embodiment, the 5′ nuclease procedure is run on a real-time quantitative PCR device such as the ABI PRISM 7900™ Sequence Detection System™. The system consists of a thermocycler, laser, charge-coupled device (CCD), camera and computer. The system amplifies samples in a 384-well format on a thermocycler. The RT-PCR may be performed in triplicate wells with an equivalent of 2 ng RNA input per 10 μL-reaction volume. During amplification, laser-induced fluorescent signal is collected in real-time through fiber optics cables for all wells, and detected at the CCD. The system includes software for running the instrument and for analyzing the data.


5′-Nuclease assay data are generally initially expressed as a threshold cycle (“Ct”). Fluorescence values are recorded during every cycle and represent the amount of product amplified to that point in the amplification reaction. The threshold cycle (Ct) is generally described as the point when the fluorescent signal is first recorded as statistically significant.


To minimize errors and the effect of sample-to-sample variation, RT-PCR is usually performed using an internal standard. The ideal internal standard gene (also referred to as a reference gene) is expressed at a constant level among cancerous and non-cancerous tissue of the same origin (i.e., a level that is not significantly different among normal and cancerous tissues), and is not significantly affected by the experimental treatment (i.e., does not exhibit a significant difference in expression level in the relevant tissue as a result of exposure to chemotherapy). RNAs most frequently used to normalize patterns of gene expression are mRNAs for the housekeeping genes glyceraldehyde-3-phosphate-dehydrogenase (GAPDH) and β-actin. Gene expression measurements can be normalized relative to the mean of one or more (e.g., 2, 3, 4, 5, or more) reference genes. Reference-normalized expression measurements can range from 0 to 15, where a one unit increase generally reflects a 2-fold increase in RNA quantity.


Real time PCR is compatible both with quantitative competitive PCR, where an internal competitor for each target sequence is used for normalization, and with quantitative comparative PCR using a normalization gene contained within the sample, or a housekeeping gene for RT-PCR. For further details see, e.g. Held et al., Genome Research 6:986-994 (1996).


Design of PCR Primers and Probes


PCR primers and probes can be designed based upon exon, intron, or intergenic sequences present in the RNA transcript of interest. Primer/probe design can be performed using publicly available software, such as the DNA BLAT software developed by Kent, W. J., Genome Res. 12(4):656-64 (2002), or by the BLAST software including its variations.


Where necessary or desired, repetitive sequences of the target sequence can be masked to mitigate non-specific signals. Exemplary tools to accomplish this include the Repeat Masker program available on-line through the Baylor College of Medicine, which screens DNA sequences against a library of repetitive elements and returns a query sequence in which the repetitive elements are masked. The masked sequences can then be used to design primer and probe sequences using any commercially or otherwise publicly available primer/probe design packages, such as Primer Express (Applied Biosystems); MGB assay-by-design (Applied Biosystems); Primer3 (Steve Rozen and Helen J. Skaletsky (2000) Primer3 on the WWW for general users and for biologist programmers. In: Rrawetz S, Misener S (eds) Bioinformatics Methods and Protocols: Methods in Molecular Biology. Humana Press, Totowa, N.J., pp 365-386).


Other factors that can influence PCR primer design include primer length, melting temperature (Tm), and G/C content, specificity, complementary primer sequences, and 3′-end sequence. In general, optimal PCR primers are generally 17-30 bases in length, and contain about 20-80%, such as, for example, about 50-60% G+C bases, and exhibit Tm's between 50 and 80° C., e.g. about 50 to 70° C.


For further guidelines for PCR primer and probe design see, e.g. Dieffenbach, C W. et al, “General Concepts for PCR Primer Design” in: PCR Primer, A Laboratory Manual, Cold Spring Harbor Laboratory Press, New York, 1995, pp. 133-155; Innis and Gelfand, “Optimization of PCRs” in: PCR Protocols, A Guide to Methods and Applications, CRC Press, London, 1994, pp. 5-11; and Plasterer, T. N. Primerselect: Primer and probe design. Methods MoI. Biol. 70:520-527 (1997), the entire disclosures of which are hereby expressly incorporated by reference.


MassARRAY® System


In MassARRAY-based methods, such as the exemplary method developed by Sequenom, Inc. (San Diego, Calif.) following the isolation of RNA and reverse transcription, the obtained cDNA is spiked with a synthetic DNA molecule (competitor), which matches the targeted cDNA region in all positions, except a single base, and serves as an internal standard. The cDNA/competitor mixture is PCR amplified and is subjected to a post-PCR shrimp alkaline phosphatase (SAP) enzyme treatment, which results in the dephosphorylation of the remaining nucleotides. After inactivation of the alkaline phosphatase, the PCR products from the competitor and cDNA are subjected to primer extension, which generates distinct mass signals for the competitor- and cDNA-derived PCR products. After purification, these products are dispensed on a chip array, which is pre-loaded with components needed for analysis with matrix-assisted laser desorption ionization time-of-flight mass spectrometry (MALDI-TOF MS) analysis. The cDNA present in the reaction is then quantified by analyzing the ratios of the peak areas in the mass spectrum generated. For further details see, e.g. Ding and Cantor, Proc. Natl. Acad. Sci. USA 100:3059-3064 (2003).


Other PCR-Based Methods


Further PCR-based techniques that can find use in the methods disclosed herein include, for example, BeadArray® technology (Illumina, San Diego, Calif.; Oliphant et al., Discovery of Markers for Disease (Supplement to Biotechniques), June 2002; Ferguson et al., Analytical Chemistry 72:5618 (2000)); BeadsArray for Detection of Gene Expression® (BADGE), using the commercially available LuminexlOO LabMAP® system and multiple color-coded microspheres (Luminex Corp., Austin, Tex.) in a rapid assay for gene expression (Yang et al., Genome Res. 11:1888-1898 (2001)); and high coverage expression profiling (HiCEP) analysis (Fukumura et al., Nucl. Acids. Res. 31(16) e94 (2003).


Microarrays


In this method, polynucleotide sequences of interest (including cDNAs and oligonucleotides) are arrayed on a substrate. The arrayed sequences are then contacted under conditions suitable for specific hybridization with detectably labeled cDNA generated from RNA of a sample. The source of RNA typically is total RNA isolated from a tumor sample, and optionally from normal tissue of the same patient as an internal control or cell lines. RNA can be extracted, for example, from frozen or archived paraffin-embedded and fixed (e.g. formalin-fixed) tissue samples.


For example, PCR amplified inserts of cDNA clones of a gene to be assayed are applied to a substrate in a dense array. Usually at least 10,000 nucleotide sequences are applied to the substrate. For example, the microarrayed genes, immobilized on the microchip at 10,000 elements each, are suitable for hybridization under stringent conditions. Fluorescently labeled cDNA probes may be generated through incorporation of fluorescent nucleotides by reverse transcription of RNA extracted from tissues of interest. Labeled cDNA probes applied to the chip hybridize with specificity to each spot of DNA on the array. After washing under stringent conditions to remove non-specifically bound probes, the chip is scanned by confocal laser microscopy or by another detection method, such as a CCD camera. Quantitation of hybridization of each arrayed element allows for assessment of corresponding mRNA abundance.


With dual color fluorescence, separately labeled cDNA probes generated from two sources of RNA are hybridized pair wise to the array. The relative abundance of the transcripts from the two sources corresponding to each specified gene is thus determined simultaneously. The miniaturized scale of the hybridization affords a convenient and rapid evaluation of the expression pattern for large numbers of genes. Such methods have been shown to have the sensitivity required to detect rare transcripts, which are expressed at a few copies per cell, and to reproducibly detect at least approximately two-fold differences in the expression levels (Schena et at, Proc. Natl. Acad. Sci. USA 93(2):106-149 (1996)). Microarray analysis can be performed on commercially available equipment, following the manufacturer's protocols, such as by using the Affymetrix GenChip® technology, or Incyte's microarray technology.


Isolating RNA from Body Fluids


Methods of isolating RNA for expression analysis from blood, plasma and serum (see for example, Tsui N B et al. (2002) Clin. Chem. 48, 1647-53 and references cited therein) and from urine (see for example, Boom R et al. (1990) J Clin Microbiol. 28, 495-503 and reference cited therein) have been described.


Immunohistochemistry


Immunohistochemistry methods are also suitable for detecting the presence of gene fusions and alternative spliced junctions and applied to the method disclosed herein. Antibodies (e.g., monoclonal antibodies) that specifically bind a gene product of a gene of interest can be used in such methods. The antibodies can be detected by direct labeling of the antibodies themselves, for example, with radioactive labels, fluorescent labels, hapten labels such as biotin, or an enzyme such as horse radish peroxidase or alkaline phosphatase. Alternatively, unlabeled primary antibody can be used in conjunction with a labeled secondary antibody specific for the primary antibody. Immunohistochemistry protocols and kits are well known in the art and are commercially available.


Proteomics


The term “proteome” is defined as the totality of the proteins present in a sample (e.g. tissue, organism, or cell culture) at a certain point of time. Proteomics includes, among other things, study of the global changes of protein expression in a sample (also referred to as “expression proteomics”). Proteomics typically includes the following steps: (1) separation of individual proteins in a sample by 2-D gel electrophoresis (2-D PAGE); (2) identification of the individual proteins recovered from the gel, e.g. my mass spectrometry or N-terminal sequencing, and (3) analysis of the data using bioinformatics.


General Description of the RNA Isolation and Preparation from Fixed, Paraffin-Embedded Samples for Whole Transcriptome Sequencing


The steps of a representative protocol for profiling gene expression levels using fixed, paraffin-embedded tissues as the RNA source are provided in various published journal articles. (See, e.g., T. E. Godfrey et al., J. Molec. Diagnostics 2: 84-91 (2000); K. Specht et al., Am. J. Pathol. 158: 419-29 (2001), M. Cronin, et al., Am J Pathol 164:35-42 (2004)). Modified methods can used for whole transcriptome sequencing as described in the Examples section. Briefly, a representative process starts with cutting a tissue sample section (e.g. about 10 μm thick sections of a paraffin-embedded tumor tissue sample). The RNA is then extracted, and ribosomal RNA may be deleted as described in U.S. Pub. No. 2011/0111409. cDNA sequencing libraries may be prepared that are directional and allowed for single or paired-end sequencing using commercially available kits such as the ScriptSeg™ mRNA-Seq Library Preparation Kit (Illumina). The libraries may also be barcoded for multiplex sequencing using commercially available barcode primers such as the RNA-Seq Barcode Primers from Illumina. PCR is then carried out to generate the second strand of cDNA to incorporate the barcodes and to amplify the libraries. After the libraries are quantified, the sequencing libraries may be sequenced as described herein.


Kits of the Invention


The materials for use in the methods of the present invention are suited for preparation of kits produced in accordance with well known procedures. The present invention thus provides kits comprising agents, which may include primers and/or probes, for quantitating the level of the disclosed gene fusions or alternative spliced junctions or their expression products via methods such as whole transcriptome sequencing or RT-PCR for predicting prognostic outcome. Such kits may optionally contain reagents for the extraction of RNA from tumor samples, in particular, fixed paraffin-embedded tissue samples and/or reagents for whole transcriptome sequencing. In addition, the kits may optionally comprise the reagent(s) with an identifying description or label or instructions relating to their use in the methods of the present invention. The kits may comprise containers (including microliter plates suitable for use in an automated implementation of the method), each with one or more of the various reagents (typically in concentrated form) utilized in the methods, including, for example, pre-fabricated microarrays, buffers, the appropriate nucleotide triphosphates (e.g., dATP, dCTP, dGTP and dTTP; or rATP, rCTP, rGTP and UTP), reverse transcriptase, DNA polymerase, RNA polymerase, and one or more probes and primers of the present invention (e.g., appropriate length poly(T) or random primers linked to a promoter reactive with the RNA polymerase). Mathematical algorithms used to estimate or quantify prognostic information are also potential components of kits.


Reports


The methods of this invention are suited for the preparation of reports summarizing the findings of the methods of the present invention. A “report” as described herein, is an electronic or tangible document that includes elements that provide information of interest relating the presence of gene fusions in a sample, the presence of alternative spliced junctions in a sample, or a likelihood assessment and its results. A subject report can be completely or partially electronically generated, e.g., presented on an electronic display (e.g., computer monitor). A report can further include one or more of: 1) information regarding the testing facility; 2) service provider information; 3) patient data; 4) sample data; 5) an interpretive report, which can include various information including: a) indication; b) test data, where test data can include information regarding the presence of a gene fusion or alternative spliced junction of interest, and 6) other features.


The present invention therefore provides methods of creating reports and the reports resulting therefrom. The report may include a summary of the gene fusions or alternative spliced junctions, in the cells obtained from the patient's tumor sample. The report may include a prediction that the patient has an increased likelihood of breast cancer recurrence or the report may include a prediction that the subject has a decreased likelihood of breast cancer recurrence. The report may include a recommendation for a treatment modality such as surgery alone or surgery in combination with chemotherapy. The report may be presented in electronic format or on paper.


Thus, in some embodiments, the methods of the present invention further include generating a report that includes information regarding the patient's likelihood of long-term survival without breast cancer recurrence. For example, the methods of the present invention can further include a step of generating or outputting a report providing the results of a patient response likelihood assessment, which can be provided in the form of an electronic medium (e.g., an electronic display on a computer monitor), or in the form of a tangible medium (e.g., a report printed on paper or other tangible medium).


A report that includes information regarding the likelihood that a patient will exhibit breast cancer recurrence is provided to a user. An assessment as to the likelihood that a cancer patient will exhibit breast cancer recurrence is referred to as a “likelihood assessment.” A person or entity who prepares a report (“report generator”) may also perform the likelihood assessment. The report generator may also perform one or more of sample gathering, sample processing, and data generation, e.g., the report generator may also perform one or more of: a) sample gathering; b) sample processing; and c) determining the presence of a gene fusion or an alternative spliced junction. Alternatively, an entity other than the report generator can perform one or more sample gathering, sample processing, and data generation.


The term “user” or “client” refers to a person or entity to whom a report is transmitted, and may be the same person or entity who does one or more of the following: a) collects a sample; b) processes a sample; c) provides a sample or a processed sample; and d) generates data for use in the likelihood assessment. In some cases, the person or entity who provides sample collection and/or sample processing and/or data generation, and the person who receives the results and/or report may be different persons, but are both referred to as “users” or “clients.” In certain embodiments, e.g., where the methods are completely executed on a single computer, the user or client provides for data input and review of data output. A “user” can be a health professional (e.g., a clinician, a laboratory technician, a physician (e.g., an oncologist, surgeon, pathologist), etc.).


In embodiments where the user only executes a portion of the method, the individual who, after computerized data processing according to the methods of the invention, reviews data output (e.g., results prior to release to provide a complete report, a complete, or reviews an “incomplete” report and provides for manual intervention and completion of an interpretive report) is referred to herein as a “reviewer.” The reviewer may be located at a location remote to the user (e.g., at a service provided separate from a healthcare facility where a user may be located).


Where government regulations or other restrictions apply (e.g., requirements by health, malpractice, or liability insurance), all results, whether generated wholly or partially electronically, are subjected to a quality control routine prior to release to the user.


Computer-Based Systems and Methods


The methods and systems described herein can be implemented in numerous ways. In one embodiment of the invention, the methods involve use of a communications infrastructure, for example, the internet. Several embodiments of the invention are discussed below. The present invention may also be implemented in various forms of hardware, software, firmware, processors, or a combination thereof. The methods and systems described herein can be implemented as a combination of hardware and software. The software can be implemented as an application program tangibly embodied on a program storage device, or different portions of the software implemented in the user's computing environment (e.g., as an applet) and on the reviewer's computing environment, where the reviewer may be located at a remote site (e.g., at a service provider's facility).


In an embodiment of the invention, during or after data input by the user, portions of the data processing can be performed in the user-side computing environment. For example, the user-side computing environment can be programmed to provide for defined test codes to denote a likelihood “score,” where the score is transmitted as processed or partially processed responses to the reviewer's computing environment in the form of test code for subsequent execution of one or more algorithms to provide a result and/or generate a report in the reviewer's computing environment. The score can be a numerical score (representative of a numerical value) or a non-numerical score representative of a numerical value or range of numerical values (e.g., “A”: representative of a 90-95% likelihood of a positive response; “High”: representative of a greater than 50% chance of a positive response (or some other selected threshold of likelihood); “Low”: representative of a less than 50% chance of a positive response (or some other selected threshold of likelihood), and the like.


As a computer system, the system generally includes a processor unit. The processor unit operates to receive information, which can include test data (e.g., the presence of a gene fusion or an alternative spliced junction) and may also include other data such as patient data. This information received can be stored at least temporarily in a database, and data analyzed to generate a report as described above.


Part or all of the input and output data can also be sent electronically. Certain output data (e.g., reports) can be sent electronically or telephonically (e.g., by facsimile, using devices such as fax back). Exemplary output receiving devices can include a display element, a printer, a facsimile device and the like. Electronic forms of transmission and/or display can include email, interactive television, and the like. In an embodiment of the invention, all or a portion of the input data and/or output data (e.g., usually at least the final report) are maintained on a web server for access, preferably confidential access, with typical browsers. The data may be accessed or sent to health professionals as desired. The input and output data, including all or a portion of the final report, can be used to populate a patient's medical record that may exist in a confidential database as the healthcare facility.


The present invention also contemplates a computer-readable storage medium (e.g., CD-ROM, memory key, flash memory card, diskette, etc.) having stored thereon a program which, when executed in a computing environment, provides for implementation of algorithms to carry out all or a portion of the results of a likelihood assessment as described herein. Where the computer-readable medium contains a complete program for carrying out the methods described herein, the program includes program instructions for collecting, analyzing and generating output, and generally includes computer readable code devices for interacting with a user as described herein, processing that data in conjunction with analytical information, and generating unique printed or electronic media for that user.


Where the storage medium includes a program that provides for implementation of a portion of the methods described herein (e.g., the user-side aspect of the methods (e.g., data input, report receipt capabilities, etc.)), the program provides for transmission of data input by the user (e.g., via the internet, via an intranet, etc.) to a computing environment at a remote site. Processing or completion of processing of the data is carried out at the remote site to generate a report. After review of the report, and completion of any needed manual intervention, to provide a complete report, the complete report is then transmitted back to the user as an electronic document or printed document (e.g., fax or mailed paper report). The storage medium containing a program according to the invention can be packaged with instructions (e.g., for program installation, use, etc.) recorded on a suitable substrate or a web address where such instructions may be obtained. The computer-readable storage medium can also be provided in combination with one or more reagents for carrying out a likelihood assessment (e.g., primers, probes, arrays, or such other kit components).


Having described the invention, the same will be more readily understood through reference to the following Examples, which are provided by way of illustration, and are not intended to limit the invention in any way. All citations through the disclosure are hereby expressly incorporated by reference.


Example 1
Materials and Methods—Providence Cohort

Patients


One hundred and thirty-six primary breast cancer FFPE tumor specimens with clinical outcomes were provided by Providence St. Joseph Medical Center (Burbank, Calif.), with institutional review board approval. The time to first recurrence of breast cancer or death due to breast cancer (including death due to unknown cause) was determined from these records. Patients who were still alive without breast cancer recurrence or who died due to known other causes were considered censored at the time of last follow-up or death. These tumor specimens were used for biomarker discovery in the development of the Oncotype DX® assay. See e.g., U.S. Pat. No. 7,081,340; S. Paik et al., The New England Journal of Medicine 351, 2817 (2004). For the present study, 136 specimens had adequate RNA remaining. Among the 136 patients, 26 experienced breast cancer recurrence or death due to breast cancer. Clinical characteristics of the patients in the Providence cohort are described in Sinicropi et al., PLoS ONE 7(7):e40092 (2012) which is incorporated by reference in its entirety.


RNA-Seq Sample Preparation and Sequencing


Transcriptome RNA-Seq analysis of the Providence cohort is described in Sinicropi et al., PLoS ONE 7(7):e40092 (2012). Total RNA was prepared from three 10-μm-thick sections of FFPE tumor tissue as previously described using the MasterPure™ Purification Kit (Epicentre® Biotechnologies, Madison, Wis.). M. Cronin et al., The American Journal of Pathology 164, 35 (January 2004). One hundred nanograms of the isolated RNA were depleted of ribosomal RNA as described. See U.S. Pub. No. 2011/0111409. Sequencing libraries for whole transcriptome analysis were prepared using ScriptSeg™ mRNA-Seq Library Preparation Kits (Epicentre® Biotechnologies, Madison, Wis.). During the cDNA synthesis step, additional incubation for 90 minutes at 37° C. was implemented in the reverse transcription step to increase library yield. After 3′-terminal tagging, the di-tagged cDNA was purified using MinElute® PCR Purification Kits (Qiagen, Valencia, Calif.). Two 6 base index sequences were used to prepare barcoded libraries for duplex sequencing (RNA-Seq Barcode Primers; Epicentre® Biotechnologies, Madison, Wis.). PCR was carried out through 16 cycles to generate the second strand of cDNA, incorporate barcodes, and amplify libraries. The amplified libraries were size-selected by a solid phase reversible immobilization, paramagnetic bead-based process (Agencourt® AMPure® XP System; Beckman Coulter Genomics, Danvers, Mass.). Libraries were quantified by PicoGreen® assay (Life Technologies, Carlsbad, Calif.) and visualized with an Agilent Bioanalyzer using a DNA 1000 kit (Agilent Technologies, Waldbronn, Germany).


Two RNA-Seq libraries with different index barcodes were loaded into each lane of flow cells. The cluster generation in flow cells was carried out in an Illumina cBOT™ instrument using TruSeq™ SR Cluster Kits v2 following the manufacturer's protocol (Illumina Inc.; San Diego, Calif.). The flow cells were subsequently transferred to an Illumina HiSeq®2000 instrument (Illumina, Inc.) for sequence analysis using TruSeq SBS Kit v3-HS (50 cycles) following the manufacturer's protocol. The single-read runs were carried out for a total of 57 cycles including 7 cycles for the index sequences and 50 cycles (i.e. 50 bases) for the insert sequences.


Data Quality Assessment


Each sequencing lane was duplexed with two patient sample libraries using a 6 base barcode to differentiate between them. The mean read ratio+/−SD between the two samples in each lane was 1.05±0.38 and the mean+/−SD percentage of un-discerned barcodes was 2.08%±1.63%. Using principal components analysis and other exploratory data analysis methods, no systematic differences were found among samples associated with flow cell or barcode.


In a run-in phase of the study, duplicate libraries were prepared for 8 samples selected at random from the study set of 136. RefSeq RNA coverage for these libraries ranged between 3.1M and 6.7M uniquely mapped reads. Log count Pearson correlations among duplicate libraries ranged between 0.947 and 0.985. Single libraries were prepared for the remaining 128 samples and distributed in duplex mode among the lanes of 8 flow-cells. Sequencing in 3 lanes failed. Two libraries had low yield, resulting in low coverage. Three lanes were flagged by various Illumina process monitoring indices: low Q30 (coverage=2.8M and 4.2M), high cluster density (coverage=1.6M and 1.8M), or inadequate imaging (coverage=3.3M and 3.1M). For the remaining lanes, sample coverage ranged between 2.5M and 7.3M reads. New libraries for the samples that had low yield were prepared and sequenced. Libraries in the failed and flagged lanes, as well as some of the low coverage samples, were re-sequenced. Replicate correlations among all sequenced samples were very high, 0.985 for the samples with the high cluster density in the original run, and over 0.990 for all others. For the analysis data set, data for one of each of the duplicate libraries from the run-in experiment were kept. For the samples for which new libraries were prepared and for the samples in the failed and flagged lanes, the reads from the subsequent run were used. For the samples with low coverage for which the library was reprocessed, reads from the two runs were pooled. For the rest of the samples, the reads from the single lane were used. Results differed little when other data analysis procedures were used, for example, using only the second run when libraries were reprocessed.


Example 2
Rush Cohort

78 patient samples as described in Cobleigh et al., Clin. Cancer Res. 11:8623-8631 (2005) and in U.S. Pat. No. 7,569,345 were obtained from women with invasive breast cancer and ≥10 positive nodes with no evidence of metastatic disease who had surgery at Rush University Medical Center from 1979 to 1999. Clinical outcome data were available for all patients. Patients who were still alive without breast cancer recurrence or who died due to known other causes were considered censored at the time of last follow-up or death. For the present study, 76 specimens had adequate RNA remaining for RNA-Seq.


Example 3
Bioinformatics Approach to Identifying Gene Fusions

An overview of the bioinformatics approach used to identify gene fusions in samples from the Providence and Rush cohorts is depicted in FIG. 1A. The pipeline was developed in LINUX SHELL, PERL or R languages, and the data processing was on LINUX clusters. Data from the Providence and Rush cohorts was assessed separately in Steps 1-3, which are described herein. In Step 4, data from the Providence and Rush were merged together. However the expression profiling step was carried out separately within each cohort considering inter-cohort differences in block archive ages and library quality (FIG. 1B).


The underlying gene fusion method is based on the detection of distant splicing within a single read feature of a RNA-seq aligner GSNAP (Wu, T. D. and Nacu, S. (2010) Fast and SNP-tolerant detection of complex variants and splicing in short reads. Bioinformatics, 26, 873-881). The utility of GSNAP for gene fusion detection has been demonstrated in gene fusion detection methods such as GSTRUCT-fusions and GFP (Seshagiri, S. et al. (2012) Recurrent R-spondin fusions in colon cancer. Nature, 488, 660-664; Ju, Y. S. et al. (2012) A transforming KIF5B and RET gene fusion in lung adenocarcinoma revealed from whole-genome and transcriptome sequencing. Genome Res., 22, 436-445). Both methods depend on GSNAP to provide fusion read candidates, and apply a set of filtering modules to remove false positives in paired-end RNA-seq datasets. In RNA-seq paired end libraries prepared from fresh frozen tissue, bridging reads mapped to each side of fusion junction sites provide a very powerful filter in both GSTRUCT and GFP approaches, therefore single end read datasets are disadvantaged. To compensate for the short FFPE RNA length with median library size around 100 bp in Providence, we leverage data from the two patient cohorts as shown in FIG. 1A. The sample based strategy analyzes each RNA-seq sample individually and nominates candidate fusion junctions for the following cohort based strategy, which confirms the presence of each of fusion candidates in each individual sample across the whole cohort by examining read alignment and expression profiling evidence. To increase the possibilities of identifying recurrent gene fusions across the two cohorts studied here, fusion candidate templates provided by the sample based strategy from both Providence and Rush were merged together in the beginning step of the cohort based strategy (FIG. 1B). However the expression profiling step was carried out separately within each cohort considering inter-cohort differences in block archive ages and library quality (FIG. 1B). The insert size and complexity of the Providence cohort libraries is higher than that of the Rush cohort libraries, so here we describe results from the Providence RNA-seq dataset (Sinicropi, D. et al. (2012) Whole Transcriptome RNA-Seq Analysis of Breast Cancer Recurrence Risk Using Formalin-Fixed Paraffin-Embedded Tumor Tissue. PLoS ONE, 7, e40092) to illustrate the performance of the cohort based computational approach.


Step 1: Mapping FASTQ Files to the Human Genome Using GSNAP


Raw sequencing data from the Providence and Rush cohorts were converted to FASTQ files using CASAVA software. The FASTQ files were mapped to the human genome (version GHCh37/hg19) along with RefSeq splicing sites using the RNA-Seq aligner GSNAP. An important feature of GSNAP is its ability to detect a distant spliced junction within a single read. Local spliced junctions derive from splicing events within a single gene in a consistent transcription direction, whereas distant sliced junctions derive from splicing events between different genes or chromosomes. Distant splicing events can also include splicing events occurring within the same gene, but in the opposite transcription direction. Distant splicing events, therefore, include translocations, inversions, and the like.


Two filters were installed to remove low quality and unwanted reads. Good quality reads were identified as reads in which at least 30% of the bases have Sanger quality score 20 or above. Reads failing this threshold were flagged as low-quality and removed from alignment (BAM) files. Simultaneously, a number of abundant sequences including biological sequences (e.g., ribosomal RNA and mitochondrial sequences), and sequences introduced during library prep (e.g., phiX), were considered undesirable for gene fusion detection and were removed from alignment (BAM) files. Only reads passing both filtering thresholds and uniquely mapped to human genome were retained for the further analyses. Such reads were considered distant spliced reads.


The uniquely mapped reads in the cleaned BAM files were converted to (non-normalized) gene feature counts that provide expression values for exonic, intronic, and intergenic regions. The gene feature count is the number of aligned bases from reads mapped within the feature region. The gene level expression values were calculated by aggregating counts for exonic or intronic features. These gene feature counts are referred to as “gene tables” in FIG. 1.


Step 2: Retesting Reads Using GSNAP


In order to remove false positives, reads that mapped to the human genome in Step 1 were retested using GSNAP parameters that favor local alignment. Each alignment from the GSNAP rerun was examined, and any reads meeting all following criteria were considered as having false positive distant splicing reads in the original GSNAP output, and thus removed for the further analyses: (1) the total matched length was 44 bp or more; (2) the insertion length was 1 bp or 0; and (3) the deletion length was 1 bp or 0. For the Providence cohort, Step 2 filtered out 18% of the distant spliced junctions. Reads that successfully passed through this step were considered to include a distant spliced junction.


Step 3: Extracting Gene Fusions


In Step 3, the resulting distant splicing junctions were then annotated and candidate gene fusions were selected. Specifically, the alignments of reads that passed GSNAP re-testing step were examined, and reads with any mismatches within 5 bp of the distant splicing junction site or mapped to the anti-sense strand of annotated genes were removed from further analyses. The remaining reads were grouped according to the distant splicing junction sites, and each junction site was annotated based on UCSC refseq sequence annotation. Junctions mapped to a pseudogene or multiple mapped refseq genes were removed. At this stage, candidate gene fusions met one of the following criteria: (1) they mapped to different chromosomes; (2) they mapped to different refseq genes; (3) they were in opposite directions on same chromosome; or (4) they were at least 1 MB apart if on the same chromosome. For the Providence cohort, Step 3 filtered out 20% of the distant spliced junctions. Steps 1-3 make up the sample based approach identified in FIG. 1.


Step 4: Building Templates


A five template set was created to remove false positives introduced by homologous template sequences and to enable accurate mapping of supporting reads. At this stage, information from both the Providence and Rush cohorts was combined. The features of the five template set are depicted in FIG. 2. The five template set included the following individual templates, each of which included 100 bp:

    • 1. Fusion template: The 50 bp exonic sequence of the preserved region of donor gene plus 50 bp exonic sequence of the preserved region of acceptor gene,
    • 2. Donor template: The 50 bp exonic sequence of the preserved region of donor gene plus 50 bp exonic sequence of the discarded region of donor gene,
    • 3. Acceptor template: The 50 bp exonic sequence of the discarded region of acceptor gene plus 50 bp exonic sequence of the preserved region of acceptor gene,
    • 4. Donor genomic template: The 50 bp upstream genomic sequence of donor splicing site plus 50 bp downstream genomic sequence of donor splicing site, and
    • 5. Acceptor genomic template: The 50 bp upstream genomic sequence of acceptor splicing site plus 50 bp downstream genomic sequence of acceptor splicing site.


Donor and acceptor exon only or genomic containing template sequences were used as controls. The sequence of each template in the 5 template set was retrieved and annotated for each candidate gene fusion. Candidate gene fusions were considered to be false positives and were removed if any of its templates had the identical sequence, but were mapped to different locations on the human genome.


BLAST was used to investigate the homology of the remaining candidate gene fusions. A second five template set for each of the remaining candidate gene fusions was created. This template included the following individual templates, each of which included 300 bp:

    • a. Fusion template: The 150 bp exonic sequence of the preserved region of donor gene plus 150 bp exonic sequence of the preserved region of acceptor gene,
    • b. Donor template: The 150 bp exonic sequence of the preserved region of donor gene plus 150 bp exonic sequence of the discarded region of donor gene,
    • c. Acceptor template: The 150 bp exonic sequence of the discarded region of acceptor gene plus 150 bp exonic sequence of the preserved region of acceptor gene,
    • d. Donor genomic template: The 150 bp upstream genomic sequence of donor splicing site plus 150 bp downstream genomic sequence of donor splicing site, and
    • e. Acceptor genomic template: The 150 bp upstream genomic sequence of acceptor splicing site plus 150 bp downstream genomic sequence of acceptor splicing site.


Homology between (b) the donor template and (c) the acceptor template and between (d) the donor genomic template and (e) the acceptor genomic template was evaluated. The fusion template (a) was used to provide sequence information for RT-PCR experiments. Any candidate gene fusion meeting the following criteria was removed from further analysis: (1) sequence identity of more than 14 bp of 300 bp of the donor template and acceptor template; (2) sequence identity of more than 14 bp of 300 bp of the donor genomic template and acceptor genomic template; and (3) less than 50 bp exonic sequence on either side of fusion, donor, or acceptor template sequences. This step removed 27% of the candidate gene fusions from the Providence cohort.


Step 5: Retrieving Reads


The templates from the remaining candidate gene fusions from both the Providence and the Rush cohorts were used to create a genomic index using a tool from the GSNAP package. Based on the genomic location of all candidate fusion template sets, all short reads mapped near any junction sites and reads not mapped in the original GSNAP BAM file per RNA-seq library were selected. The selected reads were re-mapped into the built template genomic index by GSNAP with the splicing detection parameter turn off. The alignments between supporting reads and fusion templates were screened to allow minimum of 5 bp exact match sequence as overhang across the junction site.


Only reads uniquely mapped to the fusion template were kept. Reads were examined according to the below parameters to determine if they mapped to the fusion template with good quality: (1) no mismatch around 5 bp of junctions of any template; (2) number of reads with indels is no more than 75% of all reads mapped to a given template; (3) reads with splicing evidence in the original BAM files were not allowed to be mapped to fusion template; (4) no splicing or distant splicing allowed on these 100 bp template sequences; (5) no soft clipping of more than 3 bp on each read side; (6) no deletion more than 1 bp at a given indel location; and (7) no insertion more than 1 bp at a given indel location. Step 5 filtered out 5% of the candidate gene fusions from the Providence cohort.



FIG. 3 shows output reads for the gene fusions listed in Tables A and B. Because there were several gene fusion junctions having the same sequence, FIG. 3 shows information regarding 105 gene fusion junctions while Tables A and B list 100 gene fusion candidates.


Step 6: Expression Profiling


Expression profiles provide additional evidence for gene fusions. The utilization of expression data for gene fusion detection is a feature of the COPA (Cancer Outlier Profiling Analysis) method that was devised for analysis of microarray databases (Tomlins et al., 2005). Cancer related genes identified as expression outliers in microarray experiments led to the discovery of TMPRSS2 to ETS transcription factors, the first known recurrent gene fusions in common solid carcinomas. Gene fusion RNAs are expected to exhibit a marked expression discontinuity between the preserved side and discarded side of a given fusion junction, compared to samples without that gene fusion. Recently published gene fusions detected using RNA-seq have displayed this discrete expression pattern at acceptor fusion junction sites under RNA-seq platform (Lipson, D. et al. (2012) Identification of new ALK and RET gene fusions from colorectal and lung cancer biopsies. Nature Medicine, 18, 382-384; Ju, Y. S. et al. (2012) A transforming KIF5B and RET gene fusion in lung adenocarcinoma revealed from whole-genome and transcriptome sequencing. Genome Res., 22, 436-445). Multiple bioinformatics approaches including FusionSeq (Sboner, A. et al. (2010) FusionSeq: a modular framework for finding gene fusions by analyzing paired-end RNA-sequencing data. Genome Biol, 11, R104), deFuse (McPherson, A. et al. (2011) deFuse: An Algorithm for Gene Fusion Discovery in Tumor RNA-Seq Data. PLoS Comput Biol, 7) and TopHat-Fusion (Kim, D. and Salzberg, S. L. (2011) TopHat-Fusion: an algorithm for discovery of novel fusion transcripts. Genome Biol, 12, R72) have used expression data in their pipelines. However all these methods are based on an analysis of subjects, one by one. The cohort-based approach described here compares expression levels across the cohort of subjects to find expression outliers as well as exon/intron level expression interruption corresponding to matching fusion junctions. Due to the large proportion of sequences that map to introns in FFPE RNA-seq data (Sinicropi, D. et al. (2012) Whole Transcriptome RNA-Seq Analysis of Breast Cancer Recurrence Risk Using Formalin-Fixed Paraffin-Embedded Tumor Tissue. PLoS ONE, 7, e40092), introns were used along with exons to enhance expression measurements. The expression profiling step can nominate candidate fusions despite the existence of very limited reads. In fact, here we used the expression profile data to predict known fusions in samples having no detected fusion sequences


Methods for expression profiling included the following: The gene table described in Step 1 was normalized by scaling factors as described by R package DEseq, which is available at http://bioconductor.org/packages/release/bioc/html/DESeq.html). Anders and Huber, Genome Biology 11:R106 (2010). The intron immediately before the splicing site on the acceptor gene and the intron immediately after the splicing site on the donor gene were identified and excluded from expression analyses. The remaining exons and introns were analyzed and classified as either discarded or preserved based on their expression level. Discarded exons and introns having counts below 250 were padded to 250. Such counts were equivalent to 5 reads prior to normalization. The gene feature (exon or exon/intron) counts were normalized by the sequence length. The expression interrupt ratios of normalized counts between preserved and discarded sides were calculated for donor and acceptor genes for each sample according to the following formula:






IR
=



(

counts
length

)


preserved



(

counts
length

)


discarded






As an exemplary case, the fusion RABEP1→DNAH9 (a tier 1 fusion) was initially found in a single Providence sample and was supported by two non-redundant reads across the fusion template (FIG. 3.7). There are multiple reads across the donor RABEP1 mRNA template junction and no reads across the acceptor DNAH9 mRNA template junction. The same trend exists in pre-mRNA templates, therefore it suggests the strong donor promoter drives the expression of fusion transcripts which appear as one of two expression outliers in the Providence cohort. When the exon/intron expression levels of acceptor DNAH9 were examined across Providence, the other expression outlier appears to have the same discrete expression pattern which exists in the sample but no other samples. The individual exon/intron expression levels of DNAH9 also agree with the observation from the heatmap (data not shown). Therefore we predicted the sample has a tier 3 fusion RABEP1→DNAH9 although there are no reads across fusion junction in that sample. Both fusion events were validated by TaqMan with an average CT of 30.11 and 34.86 respectively (Table 1). This prediction succeeded due to the fact that the fusion transcript is the prevalent form over the non-fused acceptor transcript (FIG. 3.7).


Expression profiling results for the candidate gene fusions are shown in FIG. 4. FIG. 4 shows a scatter plot of normalized gene level counts for the indicated donor and acceptor genes. FIG. 4 also shows bar plots of normalized counts of each exon and intron of acceptor gene of the identified samples in the order of their genomic location. The vertical line separates the acceptor gene into discarded and preserved sides. Because there were several gene fusion junctions having the same sequence, FIG. 4 shows information regarding 105 gene fusion junctions.


Step 7: Review Evidence


Data were manually reviewed to identify candidate gene fusions. The following rules were used to select the final candidate gene fusions: (1) multiple samples sharing the same hits, but without good expression evidence were removed; (2) fusions with a minimum of two non-duplicate reads and a minimum of 15 bp overhang were kept; and (3) expression profiling evidence was reviewed to select candidates with favorable expression evidence. Steps 4-7 make up the cohort based approached identified in FIG. 1.


Gene Fusion List


Overall, 108 fusion events consisting of 100 unique fusion junctions were identified in the two cohorts (Tables A and B). Table A provides information regarding the cohort, the fusion junction, the fusion genes, COSMIC gene, donor Entrez gene ID, donor gene type, donor HUGO gene symbol, donor gene description, acceptor Entrez gene ID, acceptor gene type, acceptor HUGO gene symbol, and acceptor gene description. The symbol “→” indicates the splicing direction of the gene fusion. The symbol “_” is used interchangeably with the symbol “→” to denote junctions in the figures and tables. Splice donors are located to the left of the arrow and splice acceptors are located to the right of the arrow. The “+” symbol denoted in the junctions, refers to the plus-strand of the chromosome, whereas the “−” symbol denotes the minus-strand of the chromosome. Table B provides the gene fusion and the nucleotide sequence of the 100 unique gene fusions.


The majority of gene fusions are intra-chromosomal genomic rearrangements (69 out of total 100 fusion junctions), and this is consistent with findings of others (Robinson, D. R. et al. (2011) Functionally recurrent rearrangements of the MAST kinase and Notch gene families in breast cancer. Nature Medicine, 17, 1646-1651; Edgren, H. et al. (2011) Identification of fusion genes in breast cancer by paired-end RNA-sequencing. Genome Biol, 12, R6. Of the 100 unique fusions, only TFG→GPR128 has been discovered previously (Mitelman, F. et al. (2012) Mitelman Database of Chromosome Aberrations and Gene Fusions in Cancer; Asmann, Y. W. et al. (2012) Detection of Redundant Fusion Transcripts as Biomarkers or Disease-Specific Therapeutic Targets in Breast Cancer. Cancer Res, 72, 1921-1928). It is noteworthy that some of these rare fusions are detected in both of the examined patient cohorts. Here, we validated 3 recurrent gene fusions including TFG→GPR128, ESR1→AKAP12 and RABEP1→DNAH9 by TaqMan assay using amplified RNA from 6, 3 and 2 patients respectively in the two cohorts of 212 total patients. Interestingly, among three ESR1→AKAP12 fusion events in three different patients, there are two unique fusion junctions sharing the same acceptor junction site but differing at the donor junction sites by one exon. Since both ESR1→AKAP12 fusion junctions are in frame and the differed ESR1 exon doesn't harbor any known functional domains, these two fusion transcripts can be assumed possessing the same biological function. Further protein domain analysis showed both fusion proteins replace ESR1 ligand binding site with functional domains of AKAP12 (FIG. 9). Interestingly, the lost ligand binding site interacts with another AKAP family member AKAP13. (Rubino, D. et al. (1998) Characterization of Brx, a novel Dbl family member that modulates estrogen receptor action. Oncogene, 16, 2513-2526). Both fusion protein isoforms could potentially cause constitutive ligand-independent signaling, therefore disregulate protein kinase A pathway (FIG. 9). On the other hands, we also identified some different junctions between two identical fused partners within a single patient. One Providence patient has three different ERBB2→IKZF3 junctions, which only differ at the donor junction site, and one Rush patient has two different TRIM37→BCAS3 junctions which only differ at the donor junction site (see Table 3, below).


Also, multiple recurrent partners fused to different partners were found within the two cohorts. In the Providence sample harboring ESR1→AKAP12, another fusion ESR1→C6orf211 was found and validated, it suggests multiple copies of ESR1 existed and they were fused to different acceptors. LRP5 was also found and validated to be fused to different acceptors KAT6A and SLC22A24 in the same patient. However ADK was found and validated to be an acceptor in the fusion DLG5→ADK in one patient, and a donor in the fusion ADK→C10orf11 in another patient. Similarly, the gene ACACA was also found and validated as the donor of ACACA→M512 in one patient, and the acceptor of UTP18→ACACA in another patient. We further searched the Mitelman fusion database with all 184 unique fusion partners including donors and acceptors from the final 108 fusion list, and 29 partners were found fused to various different partners in the database (Mitelman, F. et al. (2012) Mitelman Database of Chromosome Aberrations and Gene Fusions in Cancer). Among them, ACACA, BCAS3, DDX5, FBXL20, IKZF3, RAF1, TFG and TRPS1 were fused to more than one partner in the database. These observations suggest fusion events are unlikely random although they appear to be rare in solid tumors.


The identified fusion partners also tend to be cancer related, and 82% of the total 83 gene fusions identified from the Providence cohort have at least one partner belonging to COSMIC database which contains many frequently altered cancer specific genes. This is consistent with other evidence for frequently mutated genes prone to genomic rearrangements in the cancer genomes (Ju, Y. S. et al. (2012) A transforming KIF5B and RET gene fusion in lung adenocarcinoma revealed from whole-genome and transcriptome sequencing. Genome Res., 22, 436-445). The discovery of gene fusions containing partners that regulate repair of DNA double-strand breaks and homologous recombination such as RAD21, RDM1, BRCA2 and SHFM1 is consistent with abundant evidence for DNA replication infidelity in cancer.


Example 4
Validation of the Candidate Gene Fusions

60 of the 100 fusion junctions were selected based on cancer relevance of fused partner genes, and tested by quantitative RT-PCR assay (TaqMan®) using amplified RNA samples from selected patients harboring corresponding candidate fusions. Reverse transcription was carried out using the Omniscript™ RT Kit (Qiagen) by incubating amplified RNA with random hexamers and gene-specific primers at 37° C. for 1 hour. Primer, probe, and amplicon sequences are shown in Supplementary Table 2. Fluorogenic probes were dual-labeled with 5′-FAM as a reporter and 3′-BHQ-2 as a quencher. Primers and probes were designed using the Primer3 program restricting amplicon sizes to 65-85 bps (http://frodo.wi.mit.edu/). When Primer3 failed, primer and probe sequences were optimized manually to ensure optimal performance of the TaqMan® assay design for the chimeric transcripts. Reverse transcription reaction in the absence of RNA template (i.e., water) was always used as a negative control in all assays. The samples that were previously validated as positive or negative for a particular gene fusion junction were served as controls when needed. Since the RT reaction was multiplexed by using a pooled gene specific primer set, the cDNA derived from a RNA sample was tested with all fusion gene qPCR assays within a validation gene set. All RNA samples were assayed in triplicate qPCR reactions with 10 ul per well. Thermalcycling conditions were standard for all assays (A heat activation step of 95° C. for 10 minutes followed by 40 cycles of 95° C. for 20 seconds and 60° C. for 45 seconds). Table C indicates the fusion genes, the fusion junction, primer design method, amplicon length, and primer, probe, and amplicon sequences.


At the end of this process, 83 fusion junctions representing only 0.56% of candidate fusion junctions from Step 1 were selected in the Providence dataset. Overall, 108 fusion events consisting of 100 unique fusion junctions were identified in the two cohorts (Tables A and B). Candidate fusions were classified into 3 tiers based on the levels of supporting evidence (FIG. 1B). A total of 60 fusion junctions were selected based on cancer relevance of fused partner genes, and tested by quantitative RT-PCR assay using amplified RNA samples from selected patients harboring corresponding candidate fusions. Tier 1 fusions have the strongest sequence evidence and highest validation rate. The second tier was selected based on the combination of sequence and expression profiling. The third tier has the least evidence since they are purely predicted from gene expression patterns, thus with the lowest validation rate (FIG. 1B).


An important feature of the gene fusion detection pipeline described here is using expression profiling to select tier 2 and tier 3 candidate gene fusions with minimal sequencing data at fusion junctions. Generally, functionally important gene fusions in cancer are characterized by donor genes that are expressed at relatively high levels in non-fused state, by acceptor genes that are expressed at relatively low levels in non-fused state. The strong promoter of a donor gene may up-regulate expression of an oncogenic acceptor gene to contribute to the disease pathology (Mitelman, F. et al. (2007) The impact of translocations and gene fusions on cancer causation. Nature Reviews Cancer, 7, 233-245). Among 31 validated tier 1 fusions, only 7 (23%) fail to show an interrupted expression patterns at either donor or acceptor fusion junctions. Therefore, the filtered false negative gene fusions by expression profiling are probably low and also less likely to be less pathologically relevant. As an internal control, we performed TaqMan assays on 4 fusion candidates that had single non-duplicate reads but without interrupted expression patterns, and only one, and only ESR1→C6orf211, was validated. These two assessments suggest the false negative rate of our pipeline at Step 6 at around 25%. It has been observed fused genes tend to have high copy number variation (Supper, J. et al. (2012) Detecting and visualizing gene fusions. Methods; Kangaspeska, S. et al. (2012) Reanalysis of RNA-Sequencing Data Reveals Several Additional Fusion Genes with Multiple Isoforms. PLoS ONE, 7, e48745). Multiple normal copies of candidates with a single copy of a fused gene can mask the expression profiling of the fused genes, which can lead to the false negative fusions undetected by expression profiling approach.


Fusion transcripts may result from genomic rearrangements or transcript level rearrangements such as trans-splicing which is also biologically relevant. Another type of trans-splicing is known reverse transcriptase artifacts derived from sequence homology (Houseley, J. and Tollervey, D. (2010) Apparent Non-Canonical Trans-Splicing Is Generated by Reverse Transcriptase In Vitro. PLoS ONE, 5, e12271). Although our method cannot distinguish genomic rearrangement derived gene fusions from trans-splicing derived, we used homology sequence search between templates to remove false positive fusions potentially caused by homologous sequences introduced by mapping algorithm or RT errors. This strategy should sufficiently reduce these errors. The existence of vast amount of true negative gene fusions validated by TaqMan (Table 1) also supports the very limited RT based trans-splicing artifact in this study.


Although the gene fusion event is relatively low per patient, which could be attributed to the low quality of FFPE RNA-seq libraries, patient stratification across a breast cancer cohort based on fusion frequencies demonstrates the clinical prognostic power of fusion detection. This was further validated in the biological pathway and network analysis as fusion signature genes were highlighted in the known cancer related network (FIG. 5C). However, we observed some relapsed patients with low fusion signature index in both cohorts (FIG. 5D). It is probably due to the lack of genes from another fundamental underlying cancer mechanism which is inflammation in this fusion signature. Therefore the high genome instability level is a sufficient but not necessary condition for tumor progression, which is consistent with complexities of hallmarks of cancer (Hanahan, D. and Weinberg, R. A. (2011) Hallmarks of Cancer: The Next Generation. Cell, 144, 646-674).


Table 1 shows a summary of the results from the RT-PCR experiments of the 108 fusion events. Shown is the average CT from triplicate 10 ul TaqMan assays.











TABLE 1







Avg


Fusion junction
Fusion genes
CT







+chr6:152265643->+chr6:151669846
ESR1->AKAP12
30.96


+chr11:68080273->−chr8:41907225
LRP5->KAT6A
33.67


−chr20:47324798->+chr20:48431545
PREX1->SLC9A8
29.78


−chr10:79613112->+chr10:76153899
DLG5->ADK
30.26


−chr8:116680772->−chr8:117671219
TRPS1->EIF3H
31.64


+chr17:37868701->−chr17:37949186
ERBB2->IKZF3
31.89


−chr17:62496667->−chr3:197640913
DDX5->IQCG
30.7


+chr16:11154879->−chr16:11914154
CLEC16A->BCAR4
30.56


−chr2:97527316->−chr2:161131275
SEMA4C->RBMS1
32.28


+chr11:63449250->−chr8:41591587
RTN3->ANK1
32.29


+chr1:165797169->−chr1:165697358
UCK2->TMCO1
35.04


−chr17:35536201->+chr17:55478740
ACACA->MSI2
32.47


+chr10:75984349->+chr10:77795766
ADK->C10orf11
31.34


+chr17:61086987->−chr17:34247276
TANC2->RDM1
34.47


−chr20:47790732->+chr20:39690034
STAU1->TOP1
35.93


+chr8:42256382->+chrX:29301055
VDAC3->IL1RAPL1
NA


−chr3:12705312->−chr3:23942540
RAF1->NKIRAS1
34.25


+chr10:127411703->
C10orf137->
NA


+chr10:127266780
LOC100169752



+chr17:49354665->−chr17:35487144
UTP18->ACACA
32.44


+chr1:36492899->−chr16:21212879
EIF2C3->ZP2
35.21


+chr19:52709316->+chr19:56473433
PPP2R1A->NLRP8
35.17


+chr13:103249553->+chr13:32890559
TPP2->BRCA2
33.84


+chr1:32650217->+chr5:10433706
TXLNA->MARCH6
NA


+chr8:104709524->−chr8:105436617
RIMS2->DPYS
34.01


−chr1:235628953->−chr1:235277225
B3GALNT2->TOMM20
NA


−chr1:38155278->+chr1:39792890
C1orf109->MACF1
NA


−chr17:37840850->−chr17:37333788
PGAP3->CACNB1
35.43


−chr4:153332455->−chr7:152055760
FBXW7->MLL3
40


+chr6:41040823->+chr6:40347021
NFYA->TDRG1
40


+chr18:39629569->−chr18:33613800
PIK3C3->RPRD1A
36.52


+chr19:8386587->−chr16:52118478
RPS28->LOC100505619
NA


−chr5:175837258->+chr5:175995679
CLTB->CDHR2
36.65


−chr5:58284320->+chr5:52218607
PDE4D->ITGA1
40


+chr17:5250220->+chr17:11532734
RABEP1->DNAH9
30.11


+chr17:5250220->+chr17:11532734
RABEP1->DNAH9
34.86


+chr11:36057799->−chr10:62039397
LDLRAD3->ANK3
NA


+chr12:51034635->−chr13:45379166
DIP2B->LINC00330
NA


−chr18:77710724->−chr13:45379166
PQLC1->LINC00330
NA


−chr18:77710724->−chr13:45379166
PQLC1->LINC00330
NA


+chr3:100438902->+chr3:100348442
TFG->GPR128
32.84


+chr3:100438902->+chr3:100348442
TFG->GPR128
35.15


+chr11:68133170->−chr11:62863578
LRP5->SLC22A24
31.77


+chr10:133761295->−chr10:91344222
PPP2R2D->PANK1
34.57


+chr15:99442850->+chr18:50278424
IGF1R->DCC
33.36


−chr20:16553874->+chr20:17240885
KIF16B->PCSK2
34.6


+chr2:223725976->+chr2:223553063
ACSL3->MOGAT1
NA


+chr17:37866134->−chr17:37949186
ERBB2->IKZF3
NA


+chr17:37868300->−chr17:37949186
ERBB2->IKZF3
NA


−chr17:78120592->−chr21:45953806
EIF4A3->TSPEAR
35.02


+chr12:122473333->
BCL7A->C12orf42
37.82


−chr12:103872225




+chr6:71123405->+chr6:123038932
FAM135A->PKIB
37.87


−chr14:103523336->−chr4:152594048
CDC42BPB->PET112
40


−chr14:51131897->−chr14:51245522
SAV1->NIN
NA


−chr12:15370363->+chr19:547280
RERG->GZMM
40


−chr19:35989618->−chr19:35617921
DMKN->LGI4
NA


−chr21:27326904->+chr21:30547033
APP->C21orf7
NA


+chr2:11680234->−chr2:9098771
GREB1->MBOAT2
40


−chr12:116450602->−chr12:39764063
MED13L->KIF21A
NA


−chr15:68695257->+chr17:80417868
ITGA11->NARF
NA


−chrX:122799493->+chrX:117676688
THOC2->DOCK11
NA


+chr10:31608221->+chr8:96166259
ZEB1->PLEKHF2
NA


−chr19:55610152->−chr11:1769349
PPP1R12C->IFITM10
40


−chr15:49059257->+chr15:90976951
CEP152->IQGAP1
NA


−chr1:169454801->−chr3:113442942
SLC19A2->NAA50
NA


+chr5:174905642->+chr5:110782384
SFXN1->CAMK4
40


+chr2:208435045->−chr2:98543950
CREB1->TMEM131
40


+chr6:152129499->+chr6:151785588
ESR1->C6orf211
36.19


+chr8:117779030->−chr8:117879000
UTP23->RAD21
40


−chr16:87760371->−chr4:3526778
KLHDC4->LRPAP1
NA


+chr7:7841374->+chr7:8043538
LOC729852->GLCCI1
NA


+chr15:80750317->−chr15:81274523
ARNT2->MESDC2
40


+chr8:18067689->+chr8:38099768
NAT1->DDHD2
NA


+chr9:129623018->−chr9:127818286
ZBTB34->SCAI
40


−chrX:76907604->−chrX:83419395
ATRX->RPS6KA6
40


−chr9:103115054->−chr11:85742653
TEX10->PICALM
40


+chr3:14960340->−chr4:75673359
FGD5->BTC
NA


−chr3:42744071->−chr17:73328878
HHATL->GRB2
40


−chr6:117923167->+chr6:126359851
GOPC->TRMT11
NA


−chr9:14693227->−chr13:31037831
ZDHHC21->HMGB1
NA


+chr9:95821112->−chr8:95511734
SUSD3->KIAA1429
NA


+chr7:56032394->−chr7:82595803
GBAS->PCLO
40


+chr6:158244478->+chr8:61531139
SNX9->RAB2A
40


−chr10:101769595->
DNMBP->TACC2
40


+chr10:123954555




+chr6:7108001->+chr6:7555951
RREB1->DSP
40


+chr1:111833572->+chr7:64291829
CHIA->ZNF138
NA


−chr19:37956215->−chr7:96324203
ZNF569->SHFM1
NA


−chr17:57092971->+chr17:58786580
TRIM37->BCAS3
34.66


+chr22:22020420->+chr22:30064322
PPIL2->NF2
NA


−chr1:53746259->−chr1:54275419
LRP8->TMEM48
40


+chr17:48797192->−chr17:36047395
LUC7L3->HNF1B
32.13


−chr8:117878825->+chr8:124968232
RAD21->FER1L6
34.78


+chr3:100438902->+chr3:100348442
TFG->GPR128
31.03


+chr17:33968994->−chr7:96115729
AP2B1->FLJ42280
NA


+chr3:100438902->+chr3:100348442
TFG->GPR128
32.83


+chr4:71670133->+chr4:71337932
RUFY3->MUC7
NA


+chr3:100438902->+chr3:100348442
TFG->GPR128
34.68


+chr3:100438902->+chr3:100348442
TFG->GPR128
35.36


+chr17:73521906->−chr3:131442469
LLGL2->CPNE4
NA


−chr2:97527316->+chr2:28561317
SEMA4C->BRE
31.08


+chr6:152201906->+chr6:151669846
ESR1->AKAP12
31.78


−chr17:58577760->+chr17:72345323
APPBP2->KIF19
NA


+chr8:38883403->−chr8:41585524
ADAM9->ANK1
NA


−chr17:27492960->−chr17:28120955
MYO18A->SSH2
NA


+chr7:30113748->−chr9:80537261
PLEKHA8->GNAQ
NA


−chr17:37453380->+chr17:44751780
FBXL20->NSF
31.16


−chr17:57094657->+chr17:58786580
TRIM37->BCAS3
NA


−chr20:62421174->+chr20:62559688
ZBTB46->DNAJC5
NA


+chr6:152201906->+chr6:151669846
ESR1->AKAP12
36.09









Example 5
Identification of Recurrent Gene Fusions

Heatmaps and bar plots generated in Example 3 (Step 6: Expression Profiling) were analyzed to identify gene fusions present in multiple patients from the Providence and Rush cohorts. Results are shown in Table 2, which lists the gene fusion, number of Providence patient samples positive for the gene fusion via heatmap inspection and the number of reads spanning the gene fusion junction (split reads) per Providence patient sample, number of Rush patient samples positive for the gene fusion via heatmap inspection and the number of reads spanning the gene fusion junction (split reads) per Rush sample. The symbol “” indicates that multiple junctions were observed for the ESR1→AKAP12 gene fusion. The asterisks “*” in Table 2 indicate that the identified fusion was predicted to be recurrent because split reads were not identified in all patient samples that were positive for the gene fusion via expression profiling evaluation. For example, the data for the TFG→GPR12 gene fusion indicates that 2 patients positive for the gene fusion via heatmap inspection in Providence cohort, 1 patient in the Providence cohort that was positive for that gene fusion via expression profiling evaluation had 3 supporting split reads, and 1 patient has 0 split read. However, in the Rush cohort, of the 4 patients positive for the gene fusion via heatmap inspection, 2 patient samples had 3 split reads, 1 patient sample had 1 split read, and 1 patient sample had 0 split reads. The symbol “‡” indicates that TFG_GPR128 TFG→GPR128 has been discovered previously Mitelman, F. et al. (2012) Mitelman Database of Chromosome Aberrations and Gene Fusions in Cancer; Asmann, Y. W. et al. (2012) Detection of Redundant Fusion Transcripts as Biomarkers or Disease-Specific Therapeutic Targets in Breast Cancer. Cancer Res, 72, 1921-1928. The symbol “--” in various samples from the Rush cohort indicates that there were no samples positive for the indicated gene fusion and there were no split reads supporting the gene fusion. Accordingly, for those gene fusions, recurrence was observed among the Providence patient samples only.











TABLE 2






No. of Patient Samples
No. of Patient



(Providence) and No.
Samples (Rush) and



of Split Reads Per
No. of Split Reads Per


Fusion Gene
Patient Sample
Patient Sample







ESR1_AKAP12
1 sample total:
2 samples total



41 split reads
1 patient: 5 split reads



(positive via RT-PCR)
(positive via RT-PCR)




1 patient: 1 split read




(positive via RT-PCR)


TFG_GPR128*
2 samples total
4 samples total



1 sample: 3 split reads
2 samples:



(positive via RT-PCR)
3 split reads



1 sample: 0 split reads
(positive via RT-PCR)



(positive by RT-PCR)
1 sample: 1 split read




(positive via RT-PCR)




1 sample: 0 split reads




(positive via RT-PCR)


RABEP1_DNAH9*
2 samples total




1 sample: 2 reads




(positive via RT-PCR)




1 sample: 0 reads




(positive via RT-PCR)



EIF2C3_ZP2*
3 samples total




1 sample: 4 split reads




(positive via RT-PCR)




2 samples: 0 split reads




(negative via RT-PCT)



NFYA_TDRG1*
10 samples total




1 sample: 2 split reads




(negative via RT-PCT)




9 samples: 0 split reads




(negative via RT-PCT)



KIF16B_PCSK2*
3 samples total




1 sample: 1 split read




(positive via RT-PCT)




2 samples: 0 split reads




(negative via RT-PCT)



BCL7A_C12orf42
3 samples total




1 sample: 1 split read




(positive via RT-PCT)




2 samples: 0 split reads




(negative via RT-PCT)



RERG_GZMM*
4 samples total




1 sample: 1 split read




(negative via RT-PCT)




3 samples: 0 split reads




(negative via RT-PCT)



RAF1_NKIRAS1*
3 samples total




1 sample: 6 split reads




(positive via RT-PCR)




2 samples: 0 split reads




(negative via RT-PCT)



UTP23_RAD21*
4 samples total




1 sample: 2 split reads




(negative via RT-PCT)




3 samples 0 split reads




(negative via RT-PCT)



LRP5_SLC22A24*
2 samples total




1 sample: 2 split reads




(RT-PCR results




unavailable)




1 sample: 0 split reads




(RT-PCR results




unavailable)



LRP5_KAT6A*
3 samples total




1 sample: 26 split reads




(positive via RT-PCT)




2 samples: 0 split reads




(negative via RT-PCT)



PREX1_SLC9A8*
2 samples total




1 sample: 22 split reads




(RT-PCR results




unavailable)




1 sample: 0 split reads




(RT-PCR results




unavailable)



PPP2R1A_NLRP8*
2 samples total




1 sample: 2 split reads




(positive via RT-PCR)




1 sample: 0 split reads




(negative via RT-PCR)



PQLC1_LINC00330
2 samples total




each sample: 1 split read




(RT-PCR results




unavailable)









The list of candidate gene fusions was also analyzed to identify recurrent gene fusions within a given sample in order to identify genes that may be hotspots for chromosomal aberrations that cause gene fusions. Table 3 shows gene fusions that were recurrent in a single sample.











TABLE 3






No. of Gene Fusion Junctions




Identified in a Single Patient
No. of Patient Samples (Rush) and



Sample (Providence) and No. of
No. of Split Reads Per Patient


Gene Fusion
Split Reads Per Junction
Sample







ERBB2_IKZF3
1 sample total (3 fusion junctions




within that sample)




1 fusion junction: 79 split




reads




1 fusion junction: 6 split




reads




1 fusion junction: 4 split




reads



TRIM37_BCAS3

1 sample total (2 fusion junctions




within that sample)




1 fusion junction: 19 split




reads




1 fusion junction: 1 split read


ESR1 (involved in two
1 sample total (two fusion junctions



fusion junctions)
involving ESR1)



ESR1_AKAP12
ESR1_AKAP12: 41 split




reads



ESR1_C6orf211
ESR1_C6orf211: 4 split




reads









The list of candidate gene fusions was further analyzed to identify recurrent gene fusions present in different samples (either within a single or cohort or between cohorts) in order to identify genes that may be hotspots for chromosomal aberrations that cause gene fusions. Results are shown in Table 4.











TABLE 4






No. Patient Samples Positive for the
No. Patient Samples Positive for the


Gene involved
Gene Fusion (Providence) and No. Split
Gene Fusion (Rush) and No. Split


in Gene Fusion
Reads per Junction
Reads per Junction







ACACA
1 sample: UTP18_ACACA




1 split read




1 sample: ACACA_MSI2




9 split reads



ADK
1 sample: DLG5_ADK




17 split reads




1 sample: ADK_C10orf11




6 split reads



ANK1
1 sample: RTN3_ANK1
1 sample: ADAM9_ANK1



7 split reads
7 split reads


SEMA4C
1 sample SEMA4C_RBMS1
1 sample SEMA4C_BRE



9 split reads
26 split reads


RAD21
1 sample: UTP23_RAD21
1 sample: RAD21_FER1L6



2 split reads
2 split reads


LINC00330
1 sample: DIP2B_LINC00330




1 split read




2 samples: PQLC1_LINC00330




Each sample: 1 split read









Also, multiple recurrent partners fused to different partners were found within the two cohorts. In the Providence sample harboring ESR1→AKAP12, another fusion ESR1→C6orf211 was found and validated, it suggests multiple copies of ESR1 existed and they were fused to different acceptors. LRP5 was also found and validated to be fused to different acceptors KAT6A and SLC22A24 in the same patient. However ADK was found and validated to be an acceptor in the fusion DLG5→ADK in one patient, and a donor in the fusion ADK→C10orf11 in another patient. Similarly, the gene ACACA was also found and validated as the donor of ACACA→MSI2 in one patient, and the acceptor of UTP18→ACACA in another patient. We further searched the Mitelman fusion database with all 184 unique fusion partners including donors and acceptors from the final 108 fusion list, and 29 partners were found fused to various different partners in the database (Mitelman, F. et al. (2012) Mitelman Database of Chromosome Aberrations and Gene Fusions in Cancer). Among them, ACACA, BCAS3, DDX5, FBXL20, IKZF3, RAF1, TFG and TRPS1 were fused to more than one partner in the database. These observations suggest fusion events are unlikely random although they appear to be rare in solid tumors.


Example 7
Identifying Alternative Spliced Junctions

An overview of the method for identifying alternative spliced junctions is shown in FIG. 1. To identify alternative spliced junctions, Steps 1-3 of Example 3 were performed using the Providence cohort samples. Novel distant splice junctions from the candidate gene fusions resulting from Step 3 of Example 3 were annotated using the Ensembl Reference (Biomart). First, Ensemble was used to create a canonical splice junction dictionary to enable identification of novel distant spliced junctions. 98% of the distant spliced junctions were considered novel. Next, the type of alternative spliced junction was classified as involving a novel donor (0.3%), a novel acceptor (0.42%), a novel donor and acceptor (1.27%), exon shuffling (96.37%), exon skipping (0.92%), or an unknown case (0.71%). 74.7% of the distant spliced junctions remained after application of a same gene filter and 19.6% of the distant spliced junctions remained after application of a minimum non-duplicate read filter. Finally, clinical information regarding breast cancer recurrence in the Providence cohort was applied to identify alternative spliced junctions that correlated with cancer recurrence. 1.8% of the junctions passed Fisher's Test of a p-value less than 0.05.


Six candidate alternative splicing junctions were identified using the above method. The candidate alternative splicing junctions are shown in Table 5. Table 5 shows the gene symbol, the alternative spliced junction within the gene, Fisher's p value, the non-recurrent ratio, the recurrent ratio, the non-recurrence sample count, the recurrence sample count, and the splice type identified by the method described herein. Alternative spliced junctions in UBXN7, SOX5, KIAA0368, PIKC3C, and DAP3 correlated with non-recurrence of breast cancer whereas an alternative spliced junction MITD1 correlated with recurrence of breast cancer. Furthermore, investigation of the alternative spliced junction in PIKC3 predicts a fusion protein with a junction at amino acids 701 and 887 (data not shown).
















TABLE 5










Non-






Fisher
Non-
Recur-
Recurrence
Recurrence



Gene

P
Recurrent
rent
Sample
Sample
Splice


Symbol
Junction
Value
Ratio
Ratio
Count
Count
Type






















UBXN7
−chr3:196118684_−chr3:196129890
0.0091
26.5%
3.7%
31 of 117
1 of 27
Exon Shuffle


SOX5
−chr12:24366277_−chr12:24048958
0.0095
25.6%
3.7%
30 of 117
1 of 27
Unknown


MITD1
−chr2:99786013_−chr2:99787892
0.0123
14.5%
37.0%
17 of 117
10 of 27 
Exon Shuffle


KIAA0368
−chr9:114148657_−chr9:114154104
0.0134
41.0%
14.8%
48 of 117
4 of 27
Exon Shuffle


PIK3C3
+chr18:39629569_+chr18:39623697
0.0276
41.9%
18.5%
49 of 117
5 of 27
Exon Shuffle


DAP3
+chr1:155695810_+chr1:155695173
0.0464
20.5%
3.7%
24 of 117
1 of 27
Exon Shuffle









Example 8
Validating Alternative Splice Junctions

Validation of the alternative splice junctions is determined using quantitative RT-PCR. Quantitative RT-PCR analysis using TaqMan® RT PCR is used to investigate the six alternative spliced junctions identified in Example 7. Reverse transcription is carried out using the OmniPure RT kit (Qiagen). Reverse transcription is performed by combining random hexamers and gene-specific primers at 37° C. for 1 hour.


Fluorogenic probes are dual-labeled with 5′-FAM as a reporter and 3′-BHQ-2 as a quencher. Primers and probes are designed using the Primer3 program (http://frodo.wi.mit.edu/). In some cases, primer and probe sequences are optimized manually to ensure optimal performance of the TaqMan® assay design for FFPE samples. The TaqMan® assay designs are manually optimized to select an amplicon size less than 100 bases in length and to enable the probe to approximately span the alternative spliced junction. No template (i.e., water) is used as a negative control.


Results from quantitative RT-PCR are obtained and analyzed to investigate the validity of the alternative spliced junctions identified using the bioinformatics approach.


Example 9
Increased Fusion Events are Associated with Poor Tumor Prognosis

The average fusion events per patient across Providence and Rush cohorts are 0.63 and 0.29 respectively, far less than the average of 4.2 fusions in fresh frozen breast cancer biopsies (Robinson, D. R. et al. (2011) Functionally recurrent rearrangements of the MAST kinase and Notch gene families in breast cancer. Nature Medicine, 17, 1646-1651; Asmann, Y. W. et al. (2012) Detection of Redundant Fusion Transcripts as Biomarkers or Disease-Specific Therapeutic Targets in Breast Cancer. Cancer Res, 72, 1921-1928). This difference can reasonably be attributed to the poor quality of FFPE RNAs. This is born out in the Rush and Providence data sets, the former having older archival ages therefore poorer quality RNA and yielding a far fewer identified gene fusions (FIG. 5A).


Within each patient cohort we investigated whether the number of fusion events detected within individual tumors related to the likelihood of disease recurrence. We thus stratified patients according to the numbers of fusion events within each cohort (FIG. 5A). Since not all fusions have tested by TaqMan assay, the final fusion list (Tables A and B) was used in stratification regardless of validation results. In view of the limited number of fusions detected in the Rush dataset we evaluated just 2 categories: fusion detected or not detected, whereas in the Providence dataset we evaluated four abundance categories. The 8 patients with greater than two fusions (subsequently referred to as multiple fusions) in Providence have statistically significant increased recurrence risk compared to patients from three other groups (FIG. 5B). In the Rush dataset disease recurred at an increased rate among patients with detected fusions, although this relationship doesn't not quite achieve statistical significance cutoff. To check whether minimizing the block age effect improves this relationship, we resorted to subset patients into either upper three or lower three quantiles of the block age since sizable patients with comparable block age were difficult to obtain considering the small cohort size. The separations of patients with fusions from patients without fusions were significantly improved in both subsets of Rush (FIG. 6). The similar results were observed with Providence block age subsets. Interestingly, we also observed enrichment of estrogen receptor negative (ER−) patients in Providence multiple fusion group and Rush fusion group (FIG. 3A), consistent with the well-known poor prognosis of ER− patients (DeSombre and Jensen, 1980) and the published evidence of increased chromosome instability levels in ER− tumors (Endesfelder et al., 2011).


In order to assess the biological significance of the fusion frequency, we identified genes differentially expressed between the multiple fusion group versus no fusion group in the Providence cohort. Since ER status can effect gene expression greatly, an additive model of edgeR using ER status as background was applied to remove ER effect in identifying genes related to fusion frequency. We took extra cautions to examine the pairwise comparisons between any Providence fusion frequency groups, and confirmed the differentially expressed genes between multiple fusion samples versus no fusion samples are specific to this comparison, which achieved maximum differentially expression among all comparisons tested (Table 6). Accordingly, Table 6 shows pairwise differentially expressed (DE) gene analysis between sample categories by fusion number in Providence show differentially expressed fusion gene signatures are specific to the comparison of multiple fusion samples to no fusion samples.











TABLE 6









Sample category by



Differentially expressed
fusion number












gene # up/down
2 fusions
1 fusion
0 fusion






Multiple fusions
4/2
49/2
128/6 




(2)
(42)




2 fusions
NA
 9/0
31/0





 (0)
(0)



1 fusion
NA
NA
29/7






(0)





Note:


The additive model of edgeR was used with ER status as background with FDR <0.05 for each pairwise DE analysis. The up-regulated and down-regulated gene numbers are shown. The numbers in parenthesis are the overlapping genes with DE genes of multiple fusion samples versus no fusion samples.






Both multiple fusion samples and no fusion samples were also segregated according to ER status, and the overlapped genes between each category were compared (FIG. 7). Although ER+ samples contribute more to the differentially expressed genes between multiple fusion samples versus no fusion samples due to the majority of Providence samples are ER+ (FIG. 7A), the overlapped fusion differentially expressed genes with genes differentially expressed solely due to ER status are reasonably small (FIG. 7B). These investigations showed evidence that the 134 differentially expressed genes obtained by the edgeR additive model reflect the difference by fusion frequency rather than by ER status. Then the total 134 genes were uploaded to Reactome FI (functional interaction) database via Cytoscape Plugin (G. Wu et al., 2010) to perform network clustering. As shown in FIG. 5C, 84 up-regulated genes in multiple fusion group versus no fusion group formed a network, which we termed as the fusion gene signature (Table 7). We show that expression of the fusion gene signature in Providence tumors was also significantly greater in patients with multiple fusions than in patients with 1 or 2 detected fusions (FIG. 5D). Further, in the Rush cohort the expression of this signature is significantly greater in tumors with identified fusions (FIG. 5D).


These fusion signature genes were clustered into five related functional steps (FIG. 5C). Strikingly, these functions are all cancer epithelial cells related, and are the characteristics of prominent pathological hallmarks of cancers (Hanahan, D. and Weinberg, R. A. (2011) Hallmarks of Cancer: The Next Generation. Cell, 144, 646-674). The underlying mechanism of these cancer hallmarks is the cancer enabling characteristic, genome instability, which can lead to chromosome rearrangement and therefore gene fusions. Thus the relationship between gene fusion frequency reflecting genome stability level and tumor prognosis is supported by the current understanding of tumor progression.













TABLE 7





Gene
Entrez

Step



symbol
gene ID
Gene description
#
Function



















AP4E1
23431
adaptor-related protein complex
0
Mitogenic signaling




4, epsilon 1 subunit




ATG5
9474
autophagy related 5
0
Mitogenic signaling


BSG
682
basigin (Ok blood group)
0
Mitogenic signaling


CAND1
55832
cullin-associated and
0
Mitogenic signaling




neddylation-dissociated 1




CCT2
10576
chaperonin containing TCP1,
0
Mitogenic signaling




subunit 2 (beta)




CD24
100133941
CD24 molecule
0
Mitogenic signaling


CIB1
10519
calcium and integrin binding 1
0
Mitogenic signaling




(calmyrin)




CLTC
1213
clathrin, heavy chain (Hc)
0
Mitogenic signaling


CPT1A
1374
carnitine palmitoyltransferase
0
Mitogenic signaling




1A (liver)




DCC
1630
deleted in colorectal carcinoma
0
Mitogenic signaling


FHL2
2274
four and a half LIM domains 2
0
Mitogenic signaling


GALNT1
2589
UDP-N-acetyl-alpha-D-
0
Mitogenic signaling




galactosamine:polypeptide N-






acetylgalactosaminyltransferase






1 (GalNAc-T1)




GNB5
10681
guanine nucleotide binding
0
Mitogenic signaling




protein (G protein), beta 5




HPGD
3248
hydroxyprostaglandin
0
Mitogenic signaling




dehydrogenase 15-(NAD)




IGF1R
3480
insulin-like growth factor 1
0
Mitogenic signaling




receptor




MAPK6
5597
mitogen-activated protein kinase 6
0
Mitogenic signaling


MST4
51765
serine/threonine protein kinase
0
Mitogenic signaling




MST4




PEX13
5194
peroxisomal biogenesis factor 13
0
Mitogenic signaling


PIP4K2C
79837
phosphatidylinositol-5-
0
Mitogenic signaling




phosphate 4-kinase, type II,






gamma




PRDX1
5052
peroxiredoxin 1
0
Mitogenic signaling


PTPRF
5792
protein tyrosine phosphatase,
0
Mitogenic signaling




receptor type, F




TMOD2
29767
tropomodulin 2 (neuronal)
0
Mitogenic signaling


TUBB3
10381
tubulin, beta 3 class III
0
Mitogenic signaling


USP8
9101
ubiquitin specific peptidase 8
0
Mitogenic signaling


ATP5B
506
ATP synthase, H+ transporting,
1
Energy metabolism




mitochondrial F1 complex, beta






polypeptide




BRF2
55290
BRF2, subunit of RNA
1
Energy metabolism




polymerase III transcription






initiation factor, BRF1-like




CDC20
991
cell division cycle 20 homolog
1
Energy metabolism




(S.cerevisiae)




CDCA8
55143
cell division cycle associated 8
1
Energy metabolism


COX6B1
1340
cytochrome c oxidase subunit
1
Energy metabolism




VIb polypeptide 1 (ubiquitous)




G6PD
2539
glucose-6-phosphate
1
Energy metabolism




dehydrogenase




IDH2
3418
isocitrate dehydrogenase 2
1
Energy metabolism




(NADP+), mitochondrial




LEO1
123169
Leo1, Paf1/RNA polymerase II
1
Energy metabolism




complex component, homolog






(S.cerevisiae)




MAT1A
4143
methionine adenosyltransferase
1
Energy metabolism




I, alpha




MED13
9969
mediator complex subunit 13
1
Energy metabolism


PGK1
5230
phosphoglycerate kinase 1
1
Energy metabolism


PSMB2
5690
proteasome (prosome,
1
Energy metabolism




macropain) subunit, beta type, 2




PSMC4
5704
proteasome (prosome,
1
Energy metabolism




macropain) 26S subunit,






ATPase, 4




SMARCD2
6603
SWI/SNF related, matrix
1
Energy metabolism




associated, actin dependent






regulator of chromatin,






subfamily d, member 2




SMG8
55181
smg-8 homolog, nonsense
1
Energy metabolism




mediated mRNA decay factor






(C.elegans)




SNRPB2
6629
small nuclear ribonucleoprotein
1
Energy metabolism




polypeptide B




TFCP2L1
29842
transcription factor CP2-like 1
1
Energy metabolism


UBB
7314
ubiquitin B
1
Energy metabolism


VPS4B
9525
vacuolar protein sorting 4
1
Energy metabolism




homolog B (S.cerevisiae)




CFL1
1072
cofilin 1 (non-muscle)
2
Cell motility


HMGCR
3156
3-hydroxy-3-methylglutaryl-
2
Cell motility




CoA reductase




IPP
3652
intracisternal A particle-
2
Cell motility




promoted polypeptide




MYO5A
4644
myosin VA (heavy chain 12,
2
Cell motility




myoxin)




MYO5C
55930
myosin VC
2
Cell motility


PAK4
10298
p21 protein (Cdc42/Rac)-
2
Cell motility




activated kinase 4




PPP1R1B
84152
protein phosphatase 1,
2
Cell motility




regulatory (inhibitor) subunit 1B




RAB11FIP1
80223
RAB11 family interacting
2
Cell motility




protein 1 (class I)




SEC23B
10483
Sec23 homolog B (S.cerevisiae)
2
Cell motility


SLC6A9
6536
solute carrier family 6
2
Cell motility




(neurotransmitter transporter,






glycine), member 9




TMED2
10959
transmembrane emp24 domain
2
Cell motility




trafficking protein 2




TMED9
54732
transmembrane emp24 protein
2
Cell motility




transport domain containing 9




TMOD3
29766
tropomodulin 3 (ubiquitous)
2
Cell motility


TRPM7
54822
transient receptor potential
2
Cell motility




cation channel, subfamily M,






member 7




ARPP19
10776
cAMP-regulated
3
Cell cycle




phosphoprotein, 19 kDa




BRIP1
83990
BRCA1 interacting protein C-
3
Cell cycle




terminal helicase 1




EIF3I
8668
eukaryotic translation initiation
3
Cell cycle




factor 3, subunit I




GABPB1
2553
GA binding protein transcription
3
Cell cycle




factor, beta subunit 1




GINS2
51659
GINS complex subunit 2 (Psf2
3
Cell cycle




homolog)




H2AFZ
3015
H2A histone family, member Z
3
Cell cycle


KPNA2
3838
karyopherin alpha 2 (RAG
3
Cell cycle




cohort 1, importin alpha 1)




MBD2
8932
methyl-CpG binding domain
3
Cell cycle




protein 2




OCLN
100506658
occludin
3
Cell cycle


POLI
11201
polymerase (DNA directed) iota
3
Cell cycle


QRSL1
55278
glutaminyl-tRNA synthase
3
Cell cycle




(glutamine-hydrolyzing)-like 1




RRM2
6241
ribonucleotide reductase M2
3
Cell cycle


ATP6V0B
533
ATPase, H+ transporting,
4
DNA damage response




lysosomal 21 kDa, V0 subunit b




DCAF7
10238
DDB1 and CUL4 associated
4
DNA damage response




factor 7




HSP90B1
7184
heat shock protein 90 kDa beta
4
DNA damage response




(Grp94), member 1




LDHA
3939
lactate dehydrogenase A
4
DNA damage response


LOXL4
84171
lysyl oxidase-like 4
4
DNA damage response


MDM2
4193
Mdm2, p53 E3 ubiquitin protein
4
DNA damage response




ligase homolog (mouse)




NQO1
1728
NAD(P)H dehydrogenase,
4
DNA damage response




quinone 1




P4HA1
5033
prolyl 4-hydroxylase, alpha
4
DNA damage response




polypeptide I




PDIA6
10130
protein disulfide isomerase
4
DNA damage response




family A, member 6




SLC2A1
6513
solute carrier family 2
4
DNA damage response




(facilitated glucose transporter),






member 1




TK1
7083
thymidine kinase 1, soluble
4
DNA damage response


AKR1A1
10327
aldo-keto reductase family 1,
5





member A1 (aldehyde






reductase)




SHMT2
6472
serine hydroxymethyltransferase
5





2 (mitochondrial)




BCAP31
10134
B-cell receptor-associated
6





protein 31




CANX
821
calnexin
6









Methods Regarding RNA-Seq Expression Analysis


The differentially expressed genes were analyzed by edgeR (Robinson, M. D. et al. (2010) edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics, 26, 139-140) based on base count tables tallied from GSNAP mapping results. For the comparison of Providence multiple fusions versus no fusion samples, the low expression filter requires minimum 8 samples with at least 250 base counts which equivalent to 5 reads, and the Cox-Reid profile-adjusted likelihood method was used to estimate mean-variance relationships. Due to the strong influence of ER status on gene expression profiling, the additive model of edgeR was used to get differentially expressed genes between multiple fusion samples versus no fusion samples adjusting for differences between ER positive and negative status. The false discovery rate of differentially expressed genes was set at 0.05 of Bonferroni-Holm method adjusted p values.


The differentially expressed genes were mapped to a cancer focused protein-protein interaction database, Reactome FI database as instructed by the Reactome FIs Cytoscape plugin. The network was clustered by a built-in spectral partition based clustering algorithm, and nodes in different network modules (FIG. 5C).


Example 10
Alternative Methods of Expression Profiling to Identify Gene Fusions

In other experiments, based on the expression profiling, the acceptor exon boundary is identified precisely. A chimera database is built with that fixed on the right hand of the sequence; the left part of the sequence is one of the 300,000 exons from RefSeq (all 20,000+ genes). Mapping each read against the chimera database potentially identifies the donor exon/gene.


Example 11
Identification of ESR1-AKAP12 Fusion Using Outlier Expression

In a separate set of experiments, outlier expression analysis was used to identify an ESR1-AKAP12 fusion in a patient sample from the Rush cohort. In summary, the fusion between ESR1 and AKAP12 in the Rush cohort was identified by a combination of identifying the expression outlier of AKAP12 from the entire cohort followed by the sequence comparison for AKAP12 and ESR1.


First, expression analysis was used to identify the original read counts for the entire Rush cohort. The original read counts were normalized by Q3 then log 2 transformed to assess the expression level for each gene. AKAP12 gene expression was plotted in a histogram to identify the expression outlier(s), defined by the 3 standard deviation of the mean level. FIG. 8 shows the histogram of AKAP12 counts for the patients in the Rush cohort. The expression outlier is circled.


Second, reads were investigated to identify those spanning the fusion between ESR1 and AKAP12. All of the reads from the above patient were mapped against the human genome by Bowtie. The fusion reads were in the unmapped category. Using ESR1 and AKAP12 as the two separate targets, the unmapped reads were aligned against them. The reads which could be aligned both to ESR1 and AKAP12 were then identified. This process identified the following read maps to the junction between ESR1 and AKAP12 at +chr6:152201906(ESR1)→+chr6:151669846(AKAP12).


All references cited throughout the disclosure, including the examples, are hereby expressly incorporated by reference for their entire disclosure.


While the present invention has been described with reference to what is considered to be specific embodiments, it is to be understood that the invention is not so limited. To the contrary, the invention is intended to cover various modifications and equivalents included within the spirit and scope of the appended claims.




















TABLE A









Donor
Donor
Donor
Donor
Accept or
Accept
Accept or
Accept or



Fusion
Fusion
COSMIC
Entrez
gene
HUGO
gene
Entrez
or gene
HUGO
gene


Cohort
junction
genes
gene
gene ID
type
gene symbol
description
gene ID
type
gene symbol
description


























Provi-
+chr6:152265643->+chr6:151669846
ESR1->
ESR1
2099
protein-
ESR1
estrogen
9590
protein-
AKAP12
A kinase (PRKA)


dence

AKAP12


coding

receptor 1

coding

anchor protein 12


Provi-
+chr11:68080273->−chr8:41907225
LRP5->
LRP5
4041
protein-
LRP5
low density
7994
protein-
KAT6A
K(lysine) acetyl-


dence

KAT6A


coding

lipoprotein

coding

transferase 6A









receptor-related













protein 5






Provi-
−chr20:47324798->+chr20:48431545
PREX1->
PREX1
57580
protein-
PREX1
phosphatidylinositol-
23315
protein-
SLC9A8
solute carrier


dence

SLC9A8


coding

3,4,5-trisphosphate-

coding

family 9, subfamily









dependent Rac



A (NHE8, cation









exchange factor 1



proton antiporter













8), member 8


Provi-
−chr10:79613112->+chr10:76153899
DLG5->
DLG5
9231
protein-
DLG5
discs, large
132
protein-
ADK
adenosine kinase


dence

ADK


coding

homolog 5

coding











(Drosophila)






Provi-
−chr8:116680772->−chr8:117671219
TRPS1->
TRPS1
7227
protein-
TRPS1
trichorhino-
8667
protein-
EIF3H
eukaryotic translation


dence

EIF3H


coding

phalangeal

coding

initiation factor 3,









syndrome I



subunit H


Provi-
+chr17:37868701->−chr17:37949186
ERBB2->
ERBB2
2064
protein-
ERBB2
v-erb-b2
22806
protein-
IKZF3
IKAROS family zinc


dence

IKZF3


coding

erythroblastic

coding

finger 3 (Aiolos)









leukemia viral













oncogene homolog













2, neuro/glioblas-













toma derived













oncogene homolog













(avian)






Provi-
−chr17:62496667->−chr3:197640913
DDX5->
DDX5
1655
protein-
DDX5
DEAD (Asp-Glu-
84223
protein-
IQCG
IQ motif


dence

IQCG


coding

Ala-Asp) (SEQ

coding

containing G









ID NO: 357)













box helicase 5






Provi-
+chr16:11154879->−chr16:11914154
CLEC16A->
CLEC16A
23274
protein-
CLEC16A
C-type lectin
400500
miscRNA
BCAR4
breast cancer anti-


dence

BCAR4


coding

domain family



estrogen resistance









16, member A



4 (non-protein coding)


Provi-
−chr2:97527316->−chr2:161131275
SEMA4C->
SEMA4C
54910
protein-
SEMA4C
sema domain,
5937
protein-
RBMS1
RNA binding motif,


dence

RBMS1


coding

immunoglobulin

coding

single stranded









domain (Ig),



interacting protein 1









transmembrane













domain (TM) and













short cytoplasmic













domain,













(semaphorin) 4C






Provi-
+chr11:63449250->−chr8:41591587
RTN3->
RTN3
10313
protein-
RTN3
reticulon 3
286
protein-
ANK1
ankyrin 1, erythrocytic


dence

ANK1


coding



coding




Provi-
+chr1:165797169->−chr1:165697358
UCK2->
UCK2
7371
protein-
UCK2
uridine-cytidine
54499
protein-
TMCO1
transmembrane and


dence

TMCO1


coding

kinase 2

coding

coiled-coil domains 1


Provi-
−chr17:35536201->+chr17:55478740
ACACA->
ACACA
31
protein-
ACACA
acetyl-CoA
124540
protein-
MSI2
musashi homolog 2


dence

MSI2


coding

carboxylase alpha

coding

(Drosophila)


Provi-
+chr10:75984349->+chr10:77795766
ADK->

132
protein-
ADK
adenosine kinase
83938
protein-
C10orf11
chromosome 10 open


dence

C10orf11


coding



coding

reading frame 11


Provi-
+chr17:61086987->−chr17:34247276
TANC2->
TANC2
26115
protein-
TANC2
tetratricopeptide
201299
protein-
RDM1
RAD52 motif 1


dence

RDM1


coding

repeat, ankyrin

coding











repeat and













coiled-coil













containing 2






Provi-
−chr20:47790732->+chr20:39690034
STAU1->
STAU1
6780
protein-
STAU1
staufen, RNA
7150
protein-
TOP1
topoisomerase


dence

TOP1


coding

binding protein,

coding

(DNA) I









homolog 1













(Drosophila)






Provi-
+chr8:42256382->+chrX:29301055
VDAC3->
VDAC3
7419
protein-
VDAC3
voltage-
11141
protein-
IL1RAPL1
interleukin 1


dence

IL1RAPL1


coding

dependent

coding

receptor accessory









anion channel 3



protein-like 1


Provi-
−chr3:12705312->−chr3:23942540
RAF1->
RAF1
5894
protein-
RAF1
v-raf-1
28512
protein-
NKIRAS1
NFKB inhibitor


dence

NKIRAS1


coding

murine leukemia

coding

interacting Ras-









viral oncogene



like 1









homolog 1






Provi-
+chr10:127411703->+chr10:127266780
C10orf137->
C10orf137
26098
protein-
C10orf137
chromosome 10
100169752
miscRNA
LOC100169752
uncharacterized


dence

LOC100169752


coding

open reading



LOC100169752









frame 137






Provi-
+chr17:49354665->−chr17:35487144
UTP18->
UTP18_ACACA
51096
protein-
UTP18
UTP18 small
31
protein-
ACACA
acetyl-CoA


dence

ACACA


coding

subunit (SSU)

coding

carboxylase alpha









processome













component













homolog (yeast)






Provi-
+chr1:36492899->−chr16:21212879
EIF2C3->
EIF2C3_ZP2
192669
protein-
EIF2C3
eukaryotic
7783
protein-
ZP2
zona pellucida


dence

ZP2


coding

translation

coding

glycoprotein









initiation



2 (sperm receptor)









factor 2C, 3






Provi-
+chr19:52709316->+chr19:56473433
PPP2R1A->
PPP2R1A
5518
protein-
PPP2R1A
protein
126205
protein-
NLRP8
NLR family,


dence

NLRP8


coding

phosphatase 2,

coding

pyrin domain









regulatory



containing 8









subunit A,













alpha






Provi-
+chr13:103249553->+chr13:32890559
TPP2->
TPP2
7174
protein-
TPP2
tripeptidyl
675
protein-
BRCA2
breast cancer 2,


dence

BRCA2


coding

peptidase II

coding

early onset


Provi-
+chr1:32650217->+chr5:10433706
TXLNA->
TXLNA
200081
protein-
TXLNA
taxilin alpha
10299
protein-
MARCH6
membrane-associated


dence

MARCH6


coding



coding

ring finger (C3HC4)













6, E3 ubiquitin













protein ligase


Provi-
+chr8:104709524->−chr8:105436617
RIMS2->
RIMS2
9699
protein-
RIMS2
regulating
1807
protein-
DPYS
dihydro


dence

DPYS


coding

synaptic

coding

pyrimidinase









membrane













exocytosis 2






Provi-
−chr1:235628953->−chr1:235277225
B3GALNT2->
B3GALNT2
148789
protein-
B3GALNT2
beta-1,3-N-
9804
protein-
TOMM20
translocase of


dence

TOMM20


coding

acetylgalac-

coding

outer mitochondrial









tosaminyl-



membrane 20









transferase 2



homolog (yeast)


Provi-
−chr1:38155278->+chr1:39792890
C1orf109->

54955
protein-
C1orf109
chromosome 1
23499
protein-
MACF1
microtubule-


dence

MACF1


coding

open reading

coding

actin crosslinking









frame 109



factor 1


Provi-
−chr17:37840850->−chr17:37333788
PGAP3->
PGAP3
93210
protein-
PGAP3
post-GPI
782
protein-
CACNB1
calcium channel,


dence

CACNBl


coding

attachment to

coding

voltage-dependent,









proteins 3



beta 1 subunit


Provi-
−chr4:153332455->−chr7:152055760
FBXW7->
FBXW7
55294
protein-
FBXW7
F-box and WD
58508
protein-
MLL3
myeloid/lymphoid


dence

MLL3


coding

repeat domain

coding

or mixed-lineage









containing 7,



leukemia 3









E3 ubiquitin













protein ligase






Provi-
+chr6:41040823->+chr6:40347021
NFYA->

4800
protein-
NFYA
nuclear
732253
miscRNA
TDRG1
testis development


dence

TDRG1


coding

transcription



related 1 (non-









factor Y, alpha



protein coding)


Provi-
+chr18:39629569->−chr18:33613800
PIK3C3->
PIK3C3
5289
protein-
PIK3C3
phosphoinosi-
55197
protein-
RPRD1A
regulation of


dence

RPRD1A


coding

tide-3-kinase,

coding

nuclear pre-mRNA









class 3



domain containing 1A


Provi-
+chr19:8386587->−chr16:52118478
RPS28->

6234
protein-
RPS28
ribosomal
100505619
miscRNA
LOC100505619
uncharacterized


dence

LOC100505619


coding

protein S28



LOC100505619


Provi-
−chr5:175837258->+chr5:175995679
CLTB->

1212
protein-
CLTB
clathrin, light
54825
protein-
CDHR2
cadherin-related


dence

CDHR2


coding

chain B

coding

family member 2


Provi-
−chr5:58284320->+chr5:52218607
PDE4D->
PDE4D
5144
protein-
PDE4D
phosphodiester-
3672
protein-
ITGA1
integrin, alpha 1


dence

ITGA1


coding

ase 4D, cAMP-

coding











specific






Provi-
+chr17:5250220->+chr17:11532734
RABEP1->
RABEP1
9135
protein-
RABEP1
rabaptin, RAB
1770
protein-
DNAH9
dynein, axonemal,


dence

DNAH9


coding

GTPase binding

coding

heavy chain 9









effector protein 1






Provi-
+chr17:5250220->+chr17:11532734
RABEP1->
RABEP1
9135
protein-
RABEP1
rabaptin, RAB
1770
protein-
DNAH9
dynein, axonemal,


dence

DNAH9


coding

GTPase binding

coding

heavy chain 9









effector protein 1






Provi-
+chr11:36057799->−chr10:62039397
LDLRAD3->
LDLRAD3
143458
protein-
LDLRAD3
low density
288
protein-
ANK3
ankyrin 3, node


dence

ANK3


coding

lipoprotein

coding

of Ranvier









receptor class



(ankyrin G)









A domain













containing 3






Provi-
+chr12:51034635->−chr13:45379166
DIP2B->
DIP2B
57609
protein-
DIP2B
DIP2 disco-
144817
miscRNA
LINC00330
long intergenic non-


dence

LINC00330


coding

interacting



protein coding









protein 2



RNA 330









homolog B













(Drosophila)






Provi-
−chr18:77710724->−chr13:45379166
PQLC1->
PQLC1
80148
protein-
PQLC1
PQ loop repeat
144817
miscRNA
LINC00330
long intergenic non-


dence

LINC00330


coding

containing 1



protein coding













RNA 330


Provi-
−chr18:77710724->−chr13:45379166
PQLC1->
PQLC1
80148
protein-
PQLC1
PQ loop repeat
144817
miscRNA
LINC00330
long intergenic non-


dence

LINC00330


coding

containing 1



protein coding













RNA 330


Provi-
+chr3:100438902->+chr3:100348442
TFG->
TFG
10342
protein-
TFG
TRK-fused gene
84873
protein-
GPR128
G protein-coupled


dence

GPR128


coding



coding

receptor 128


Provi-
+chr3:100438902->+chr3:100348442
TFG->
TFG
10342
protein-
TFG
TRK-fused gene
84873
protein-
GPR128
G protein-coupled


dence

GPR128


coding



coding

receptor 128


Provi-
+chr11:68133170->−chr11:62863578
LRP5->
LRP5
4041
protein-
LRP5
low density
283238
protein-
SLC22A24
solute carrier family


dence

SLC22A24


coding

lipoprotein

coding

22, member 24









receptor-related













protein 5






Provi-
+chr10:133761295->−chr10:91344222
PPP2R2D->
PPP2R2D
55844
protein-
PPP2R2D
protein
53354
protein-
PANK1
pantothenate


dence

PANK1


coding

phosphatase 2,

coding

kinase 1









regulatory













subunit B, delta






Provi-
+chr15:99442850->+chr18:50278424
IGF1R->
IGF1R
3480
protein-
IGF1R
insulin-like
1630
protein-
DCC
deleted in


dence

DCC


coding

growth factor

coding

coloretal carcinoma









1 receptor






Provi-
−chr20:16553874->+chr20:17240885
KIF16B->
KIF16B
55614
protein-
KIF16B
kinesin family
5126
protein-
PCSK2
proprotein convertase


dence

PCSK2


coding

member 16B

coding

subtilisin/kexin type 2


Provi-
+chr2:223725976->+chr2:223553063
ACSL3->
ACSL3
2181
protein-
ACSL3
acyl-CoA
116255
protein-
MOGAT1
monoacylglycerol


dence

MOGAT1


coding

synthetase

coding

O-acyltransferase 1









long-chain













family













member 3






Provi-
+chr17:37866134->−chr17:37949186
ERBB2->

2064
protein-
ERBB2
v-erb-b2
22806
protein-
IKZF3
IKAROS family


dence

IKZF3


coding

erythroblastic

coding

zinc finger 3









leukemia viral



(Aiolos)









oncogene homolog













2, neuro/glio-













blastoma













derived oncogene













homolog (avian)






Provi-
+chr17:37868300->−chr17:37949186
ERBB2->

2064
protein-
ERBB2
v-erb-b2
22806
protein-
IKZF3
IKAROS family


dence

IKZF3


coding

erythroblastic

coding

zinc finger 3









leukemia viral



(Aiolos)









oncogene homolog













2, neuro/glio-













blastoma













derived oncogene













homolog (avian)






Provi-
−chr17:78120592->−chr21:45953806
EIF4A3->
EIF4A3
9775
protein-
EIF4A3
eukaryotic
54084
protein-
TSPEAR
thrombospondin-


dence

TSPEAR


coding

translation

coding

type laminin G









initiation



domain and EAR









factor 4A3



repeats


Provi-
+chr12:122473333->−chr12:103872225
BCL7A->
BCL7A
605
protein-
BCL7A
B-cell CLL/
374470
protein-
C12orf42
chromosome 12


dence

C12orf42


coding

lymphoma 7A

coding

open reading













frame 42


Provi-
+chr6:71123405->+chr6:123038932
FAM135A->
FAM135A
57579
protein-
FAM135A
family with
5570
protein-
PKIB
protein kinase


dence

PKIB


coding

sequence

coding

(cAMP-dependent,









similarity



catalytic) inhibitor









135, member A



beta


Provi-
−chr14:103523336->−chr4:152594048
CDC42BPB->
CDC42BPB
9578
protein-
CDC42BPB
CDC42 binding
5188
protein-
PET112
PET112 homolog


dence

PET112


coding

protein kinase

coding

(yeast)









beta (DMPK-like)






Provi-
−chr14:51131897->−chr14:51245522
SAV1->
SAV1
60485
protein-
SAV1
salvador homolog
51199
protein-
NIN
ninein (GSK3B


dence

NIN


coding

1 (Drosophila)

coding

interacting protein)


Provi-
−chr12:15370363->+chr19:547280
RERG->
RERG
85004
protein-
RERG
RAS-like,
3004
protein-
GZMM
granzyme M


dence

GZMM


coding

estrogen-regulated,

coding

(lymphocytemet-ase 1)









growth inhibitor






Provi-
−chr19:35989618->−chr19:35617921
DMKN->
DMKN
93099
protein-
DMKN
dermokine
163175
protein-
LGI4
leucine-rich repeat


dence

LGI4


coding



coding

LGI family, member 4


Provi-
−chr21:27326904->+chr21:30547033
APP->

351
protein-
APP
amyloid beta (A4)
56911
protein-
C21orf7
chromosome 21


dence

C21orf7


coding

precursor protein

coding

open reading frame 7


Provi-
+chr2:11680234->−chr2:9098771
GREB1->
GREB1
9687
protein-
GREB1
growth regulation
129642
protein-
MBOAT2
membrane bound O-


dence

MBOAT2


coding

by estrogen in

coding

acyltransferase









breast cancer 1



domain containing 2


Provi-
−chr12:116450602->−chr12:39764063
MED13L->
MED13L
23389
protein-
MED13L
mediator complex
55605
protein-
KIF21A
kinesin family


dence

KIF21A


coding

subunit 13-like

coding

member 21A


Provi-
−chr15:68695257->+chr17:80417868
ITGA11->
ITGA11
22801
protein-
ITGA11
integrin, alpha 11
26502
protein-
NARF
nuclear prelamin A


dence

NARF


coding



coding

recognition factor


Provi-
−chrX:122799493->+chrX:117676688
THOC2->
THOC2
57187
protein-
THOC2
THO complex 2
139818
protein-
DOCK11
dedicator of


dence

DOCK11


coding



coding

cytokinesis 11


Provi-
+chr10:31608221->+chr8:96166259
ZEB1->
ZEB1
6935
protein-
ZEB1
zinc finger
79666
protein-
PLEKHF2
pleckstrin homology


dence

PLEKHF2


coding

E-box binding

coding

domain containing,









homeobox 1



family F (with FYVE













domain) member 2


Provi-
−chr19:55610152->−chr11:1769349
PPP1R12C->
PPP1R12C
54776
protein-
PPP1R12C
protein
402778
protein-
IFITM10
interferon induced


dence

IFITM10


coding

phosphatase 1,

coding

transmembrane









regulatory



protein 10









subunit 12C






Provi-
−chr15:49059257->+chr15:90976951
CEP152->
CEP152
22995
protein-
CEP152
centrosomal
8826
protein-
IQGAP1
IQ motif containing


dence

IQGAP1


coding

protein 152 kDa

coding

GTPase activating













protein 1


Provi-
−chr1:169454801->−chr3:113442942
SLC19A2->
SLC19A2
10560
protein-
SLC19A2
solute carrier
80218
protein-
NAA50
N(alpha)-acetyl-


dence

NAA50


coding

family 19

coding

transferase 50,









(thiamine



NatE catalytic









transporter),



subunit









member 2






Provi-
+chr5:174905642->+chr5:110782384
SFXN1->
SFXN1
94081
protein-
SFXN1
sideroflexin 1
814
protein-
CAMK4
calcium/calmodulin-


dence

CAMK4


coding



coding

dependent protein













kinase IV


Provi-
+chr2:208435045->−chr2:98543950
CREB1->
CREB1
1385
protein-
CREB1
cAMP responsive
23505
protein-
TMEM131
transmembrane


dence

TMEM131


coding

element binding

coding

protein 131









protein 1






Provi-
+chr6:152129499->+chr6:151785588
ESR1->

2099
protein-
ESR1
estrogen
79624
protein-
C6orf211
chromosome 6 open


dence

C6orf211


coding

receptor 1

coding

reading frame 211


Provi-
+chr8:117779030->−chr8:117879000
UTP23->

84294
protein-
UTP23
UTP23, small
5885
protein-
RAD21
RAD21 homolog


dence

RAD21


coding

subunit (SSU)

coding

(S. pombe)









processome













component,













homolog (yeast)






Provi-
−chr16:87760371->−chr4:3526778
KLHDC4->
KLHDC4
54758
protein-
KLHDC4
kelch domain
4043
protein-
LRPAP1
low density lipoprotein


dence

LRPAP1


coding

containing 4

coding

receptor-related













protein associated













protein 1


Provi-
+chr7:7841374->+chr7:8043538
LOC729852->

729852
miscRNA
LOC729852
uncharacterized
113263
protein-
GLCCI1
glucocorticoid


dence

GLCCI1




LOC729852

coding

induced transcript 1


Provi-
+chr15:80750317->−chr15:81274523
ARNT2->
ARNT2
9915
protein-
ARNT2
aryl-hydrocarbon
23184
protein-
MESDC2
mesoderm


dence

MESDC2


coding

receptor nuclear

coding

development









translocator 2



candidate 2


Provi-
+chr8:18067689->+chr8:38099768
NAT1->

9
protein-
NAT1
N-acetyltrans-
23259
protein-
DDHD2
DDHD domain


dence

DDHD2


coding

ferase 1

coding

containing 2









(arylamine N-













acetyltrans-













ferase)






Provi-
+chr9:129623018->−chr9:127818286
ZBTB34->
ZBTB34
403341
protein-
ZBTB34
zinc finger and BTB
286205
protein-
SCAI
suppressor of


dence

SCAI


coding

domain containing 34

coding

cancer cell invasion


Provi-
−chrX:76907604->−chrX:83419395
ATRX->
ATRX
546
protein-
ATRX
alphathalassemia/
27330
protein-
RPS6KA6
ribosomal protein


dence

RPS6KA6


coding

mental retardation

coding

S6 kinase, 90 kDa,









syndrome X-linked



polypeptide 6


Provi-
−chr9:103115054->−chr11:85742653
TEX10->
TEX10
54881
protein-
TEX10
testis
8301
protein-
PICALM
phosphatidylinositol


dence

PICALM


coding

expressed 10

coding

binding clathrin













assembly protein


Provi-
+chr3:14960340->−chr4:75673359
FGD5->
FGD5
152273
protein-
FGD5
FYVE, RhoGEF
685
protein-
BTC
betacellulin


dence

BTC


coding

and PH domain

coding











containing 5






Provi-
−chr3:42744071->−chr17:73328878
HHATL->
HHATL
57467
protein-
HHATL
hedgehog
2885
protein-
GRB2
growth factor


dence

GRB2


coding

acyltransferase-

coding

receptor-









like



bound protein 2


Provi-
−chr6:117923167->+chr6:126359851
GOPC->
GOPC
57120
protein-
GOPC
golgi-associated
60487
protein-
TRMT11
tRNA


dence

TRMT11


coding

PDZ and coiled-

coding

methyltransferase









coil motif



11 homolog









containing



(S. cerevisiae)


Provi-
−chr9:14693227->−chr13:31037831
ZDHHC21->

340481
protein-
ZDHHC21
zinc finger,
3146
protein-
HMGB1
high mobility group


dence

HMGB1


coding

DHHC-type

coding

box 1









containing 21






Provi-
+chr9:95821112->−chr8:95511734
SUSD3->
SUSD3
203328
protein-
SUSD3
sushi domain
25962
protein-
KIAA1429
KIAA1429


dence

KIAA1429


coding

containing 3

coding




Provi-
+chr7:56032394->−chr7:82595803
GBAS->
GBAS
2631
protein-
GBAS
glioblastoma
27445
protein-
PCLO
piccolo (presynaptic


dence

PCLO


coding

amplified sequence

coding

cytomatrix protein)


Provi-
+chr6:158244478->+chr8:61531139
SNX9->

51429
protein-
SNX9
sorting nexin 9
5862
protein-
RAB2A
RAB2A, member RAS


dence

RAB2A


coding



coding

oncogene family


Provi-
−chr10:101769595->+chr10:123954555
DNMBP->
DNMBP
23268
protein-
DNMBP
dynamin binding
10579
protein-
TACC2
transforming, acidic


dence

TACC2


coding

protein

coding

coiled-coil













containing protein 2


Provi-
+chr6:7108001->+chr6:7555951
RREB1->
RREB1
6239
protein-
RREB1
ras responsive
1832
protein-
DSP
desmoplakin


dence

DSP


coding

element binding

coding











protein 1






Provi-
+chr1:111833572->+chr7:64291829
CHIA->

27159
protein-
CHIA
chitinase, acidic
7697
protein-
ZNF138
zinc finger


dence

ZNF138


coding



coding

protein 138


Provi-
−chr19:37956215->−chr7:96324203
ZNF569->
ZNF569
148266
protein-
ZNF569
zinc finger
7979
protein-
SHFM1
split hand/foot


dence

SHFM1


coding

protein 569

coding

malformation













(ectrodactyly) type 1


Rush
−chr17:57092971->+chr17:58786580
TRIM37->

4591
protein-
TRIM37
tripartite motif
54828
protein-
BCAS3
breast carcinoma




BCAS3


coding

containing 37

coding

amplified sequence 3


Rush
+chr22:22020420->+chr22:30064322
PPIL2->

23759
protein-
PPIL2
peptidyl
4771
protein-
NF2
neurofibromin 2




NF2


coding

prolylisomerase

coding

(merlin)









(cyclophilin)-













like 2






Rush
−chr1:53746259->−chr1:54275419
LRP8->

7804
protein-
LRP8
low density lipo-
55706
protein-
TMEM48
transmembrane




TMEM48


coding

protein receptor-

coding

protein 48









related protein 8,













apolipoproteine













receptor






Rush
+chr17:48797192->−chr17:36047395
LUC7L3->

51747
protein-
LUC7L3
LUC7-like 3
6928
protein-
HNF1B
HNF1 homeobox B




HNF1B


coding

(S. cerevisiae)

coding




Rush
−chr8:117878825->+chr8:124968232
RAD21->

5885
protein-
RAD21
RAD21 homolog
654463
protein-
FER1L6
fer-1-like 6




FER1L6


coding

(S. pombe)

coding

(C. elegans)


Rush
+chr3:100438902->+chr3:100348442
TFG->
TFG
10342
protein-
TFG
TRK-fused gene
84873
protein-
GPR128
G protein-coupled




GPR128


coding



coding

receptor 128


Rush
+chr17:33968994->−chr7:96115729
AP2B1->

163
protein-
AP2B1
adaptor-related
401388
protein-
FLJ42280
putative




FLJ42280


coding

protein complex

coding

uncharacterized









2, beta 1 subunit



protein FLJ42280


Rush
+chr3:100438902->+chr3:100348442
TFG->
TFG
10342
protein-
TFG
TRK-fused gene
84873
protein-
GPR128
G protein-coupled




GPR128


coding



coding

receptor 128


Rush
+chr4:71670133->+chr4:71337932
RUFY3->

22902
protein-
RUFY3
RUN and FYVE
4589
protein-
MUC7
mucin 7, secreted




MUC7


coding

domain containing 3

coding




Rush
+chr3:100438902->+chr3:100348442
TFG->
TFG
10342
protein-
TFG
TRK-fused gene
84873
protein-
GPR128
G protein-coupled




GPR128


coding



coding

receptor 128


Rush
+chr3:100438902->+chr3:100348442
TFG->
TFG
10342
protein-
TFG
TRK-fused gene
84873
protein-
GPR128
G protein-coupled




GPR128


coding



coding

receptor 128


Rush
+chr17:73521906->−chr3:131442469
LLGL2->

3993
protein-
LLGL2
lethal giant
131034
protein-
CPNE4
copine IV




CPNE4


coding

larvae homolog 2

coding











(Drosophila)






Rush
−chr2:97527316->+chr2:28561317
SEMA4C->
SEMA4C
54910
protein-
SEMA4C
sema domain,
9577
protein-
BRE
brain and




BRE


coding

immunoglobulin

coding

reproductive









domain (Ig),



organ-expressed









transmembrane



(TNFRSF1A









domain (TM) and



modulator)









short cytoplasmic













domain,













(semaphorin) 4C






Rush
+chr6:152201906->+chr6:151669846
ESR1->

2099
protein-
ESR1
estrogen
9590
protein-
AKAP12
A kinase (PRKA)




AKAP12


coding

receptor 1

coding

anchor protein 12


Rush
−chr17:58577760->+chr17:72345323
APPBP2->

10513
protein-
APPBP2
amyloid beta
124602
protein-
KIF19
kinesin family




KIF19


coding

precursor protein

coding

member 19









(cytoplasmic













tail) binding













protein 2






Rush
+chr8:38883403->−chr8:41585524
ADAM9->

8754
protein-
ADAM9
ADAM metallo-
286
protein-
ANK1
ankyrin 1,




ANK1


coding

peptidase domain 9

coding

erythrocytic


Rush
−chr17:27492960->−chr17:28120955
MYO18A->

399687
protein-
MYO18A
myosin
85464
protein-
SSH2
slingshot homolog 2




SSH2


coding

XVIIIA

coding

(Drosophila)


Rush
+chr7:30113748->−chr9:80537261
PLEKHA8->

84725
protein-
PLEKHA8
pleckstrin homology
2776
protein-
GNAQ
guanine nucleotide




GNAQ


coding

domain containing,

coding

binding protein









family A



(G protein),









(phosphoinositide



q polypeptide









binding specific)













member 8






Rush
−chr17:37453380->+chr17:44751780
FBXL20->

84961
protein-
FBXL20
F-box and
4905
protein-
NSF
N-ethylmaleimide-




NSF


coding

leucine-rich

coding

sensitive factor









repeat protein 20






Rush
−chr17:57094657->+chr17:58786580
TRIM37->

4591
protein-
TRIM37
tripartite motif
54828
protein-
BCAS3
breast carcinoma




BCAS3


coding

containing 37

coding

amplified sequence 3


Rush
−chr20:62421174->+chr20:62559688
ZBTB46->

140685
protein-
ZBTB46
zinc finger and
80331
protein-
DNAJC5
DnaJ (Hsp40)




DNAJC5


coding

BTB domain

coding

homolog, subfamily









containing 46



C, member 5


Rush
+chr6:152201906->+chr6:151669846
ESR1->
ESR1
2099
protein-
ESR1
estrogen receptor 1
9590
protein-
AKAP12
A kinase (PRKA)




AKAP12


coding



coding

anchor protein 12


















TABLE B





Gene

SEQ


Fusion
Chimera Sequence
ID NO.







LRP5->
ATGGAGGCAGCGCCGCCCGGGCCGCCGTGGCCGCTGCTGCTGCTGCTGCT
SEQ ID


KAT6A
GCTGCTGCTGGCGCTGTGCGGCTGCCCGGCCCCCGCCGCGGATGGTAAAA
NO: 1



CTCGCAAACCCGCTTTATACTGAGTGGATTTTGGAGGCCATCAAAAAAGT



GAAAAAGCAGAAACAGCGTCCTTCAGAAGAAAGGATATGCAATGCTGTG



TCTTCATCCCATGGCTTGGATCGTAAAACTGTTTTAGAACAATTGGAGTTG



AGTGTTAAAGATGGAACAATTTTAAAAGTCTCAAATAAAGGACTCAATTC



CTATAAAGATCCTGATAATCCTGGGCGAATAGCACTTCCTAAGCCTCGGA



ACCATGGAAAATTGGATAATAAACAAAATGTGGATTGGAATAAACTGAT



AAAGCGGGCAGTTGAGGGCTTGGCAGAGTCTGGTGGCTCAACTTTGAAA



AGCATTGAACGTTTTTTGAAAGGTCAGAAGGATGTGTCTGCATTATTCGG



AGGCAGTGCTGCCTCTGGCTTTCACCAGCAGTTACGATTGGCTATCAAAC



GTGCCATTGGCCACGGCAGACTCCTTAAAGATGGACCTCTTTATCGGCTC



AACACTAAAGCAACCAACGTGGATGGGAAAGAGAGTTGTGAGTCTCTTT



CCTGTTTACCTCCAGTGTCCCTTCTTCCACATGAAAAGGATAAGCCGGTTG



CTGAACCAATCCCCATCTGTAGTTTCTGTCTTGGTACAAAAGAACAAAAC



CGAGAAAAGAAGCCAGAGGAACTCATCTCCTGTGCCGACTGTGGCAACA



GTGGCCATCCATCCTGTTTAAAGTTTTCCCCTGAACTAACGGTTCGAGTGA



AGGCCTTACGGTGGCAGTGCATCGAGTGTAAAACATGCAGCTCCTGTCGA



GATCAAGGCAAAAATGCGGATAACATGCTCTTTTGTGATTCATGTGACCG



AGGTTTTCACATGGAGTGTTGTGATCCGCCACTCACCCGTATGCCAAAAG



GCATGTGGATATGTCAAATATGTCGACCTAGGAAAAAAGGACGAAAACT



TCTACAAAAGAAGGCAGCACAGATAAAACGGCGCTATACTAATCCAATA



GGACGTCCAAAAAACAGGTTAAAGAAACAAAACACGGTATCAAAAGGTC



CCTTCAGCAAAGTTCGAACTGGCCCTGGAAGGGGTAGGAAACGAAAAAT



CACTCTTTCCAGCCAATCAGCATCATCATCATCAGAAGAAGGATATTTAG



AGCGGATAGATGGCTTGGACTTCTGCAGAGATAGCAATGTCTCCTTGAAG



TTCAACAAGAAAACCAAAGGGCTCATTGATGGCCTTACCAAATTTTTTAC



CCCTTCCCCTGATGGGCGGAAAGCTCGGGGGGAAGTGGTGGACTACTCTG



AGCAATATCGAATCAGAAAGAGGGGCAACAGGAAATCAAGCACTTCAGA



TTGGCCCACAGACAATCAGGATGGCTGGGATGGCAAACAAGAAAATGAG



GAGCGACTTTTTGGGAGCCAGGAAATCATGACTGAGAAAGATATGGAAT



TATTTCGTGATATCCAAGAACAAGCACTGCAGAAAGTTGGAGTGACTGGT



CCCCCTGATCCACAAGTCCGCTGTCCCTCTGTCATTGAGTTTGGGAAGTAT



GAAATTCACACCTGGTACTCCTCCCCATATCCTCAAGAATACTCAAGGCT



GCCCAAATTGTATCTTTGTGAATTTTGTCTAAAATATATGAAAAGTAGAA



CTATTCTGCAGCAGCACATGAAGAAATGTGGTTGGTTCCATCCTCCTGCC



AATGAGATTTACAGAAAGAATAATATTTCTGTCTTTGAGGTTGATGGGAA



TGTGAGTACCATTTATTGTCAAAACCTGTGTCTTTTGGCAAAGTTGTTTCT



TGACCACAAAACCCTCTATTACGATGTGGAGCCATTTCTTTTTTATGTACT



AACACAGAATGATGTCAAGGGCTGCCACCTTGTTGGCTACTTTTCTAAGG



AAAAGCACTGCCAACAGAAGTACAATGTTTCCTGTATAATGATTCTTCCT



CAATACCAGCGTAAGGGCTATGGCAGGTTTCTCATCGATTTCAGTTATTT



GTTATCAAAGCGTGAAGGCCAAGCAGGGTCTCCAGAGAAACCGTTATCT



GATCTGGGTCGTCTTTCCTACATGGCATATTGGAAAAGTGTAATATTGGA



GTGCCTTTATCACCAAAATGACAAGCAGATCAGCATTAAGAAGTTAAGCA



AGTTGACTGGAATCTGCCCTCAAGACATCACTTCCACACTCCACCACCTA



CGAATGCTGGACTTCCGTAGTGACCAATTTGTGATTATCCGCCGGGAAAA



ACTTATCCAGGATCACATGGCAAAGCTTCAGCTGAATTTGCGACCTGTAG



ATGTAGATCCAGAATGTTTGCGCTGGACTCCAGTCATAGTGTCCAACTCT



GTGGTCTCAGAGGAGGAAGAAGAGGAGGCTGAGGAAGGAGAAAACGAA



GAGCCACAGTGCCAGGAAAGAGAATTAGAGATCAGTGTGGGAAAGTCTG



TGTCTCATGAGAACAAAGAACAAGATTCTTATTCAGTAGAAAGTGAAAA



GAAACCAGAAGTTATGGCTCCAGTCAGTTCTACACGTTTGAGCAAACAAG



TCCTTCCTCATGATAGTCTTCCTGCAAATAGCCAGCCATCTCGGAGGGGC



CGCTGGGGGAGGAAGAACAGAAAAACCCAGGAACGTTTTGGTGATAAAG



ATTCTAAACTGCTCTTGGAAGAGACGTCTTCAGCTCCTCAGGAACAATAT



GGAGAATGTGGGGAGAAATCAGAAGCCACCCAGGAACAATACACTGAAA



GTGAAGAACAGCTGGTGGCTTCTGAGGAGCAGCCAAGCCAGGACGGGAA



ACCTGACCTTCCCAAGAGAAGACTCAGTGAGGGGGTTGAGCCCTGGCGA



GGACAGCTCAAGAAAAGCCCTGAGGCTCTGAAGTGCAGATTAACAGAAG



GAAGTGAGAGGCTGCCCCGTCGCTACAGTGAGGGTGACAGGGCTGTCCT



CAGGGGCTTCAGTGAGAGCAGCGAGGAGGAGGAGGAGCCGGAAAGCCC



TCGGTCAAGCTCGCCACCAATTCTCACAAAGCCCACGCTGAAGCGAAAG



AAACCATTTCTCCACCGAAGGAGGAGAGTCCGAAAGCGCAAACACCACA



ATAGCAGTGTAGTCACAGAAACTATTTCTGAGACCACTGAAGTGTTAGAT



GAACCTTTTGAAGATTCTGACTCCGAGAGGCCAATGCCAAGATTAGAACC



CACGTTTGAGATCGATGAAGAAGAGGAGGAAGAGGATGAAAATGAACTT



TTCCCTAGAGAATACTTCCGTCGTTTGTCTTCGCAGGATGTACTCAGGTGT



CAGTCCTCTTCTAAGAGGAAGTCTAAAGATGAAGAAGAAGATGAAGAGT



CAGATGATGCTGATGACACTCCTATCTTAAAGCCAGTATCTCTTTTGCGA



AAACGTGATGTGAAGAATTCTCCTCTTGAGCCAGATACATCCACACCTTT



GAAAAAGAAAAAGGGATGGCCCAAAGGCAAGAGCCGCAAACCAATCCA



CTGGAAGAAAAGACCTGGTCGAAAACCAGGATTTAAGTTGAGTCGGGAA



ATCATGCCAGTTTCTACTCAAGCATGCGTCATTGAGCCCATCGTTTCCATT



CCTAAAGCTGGACGTAAACCCAAGATCCAGGAGAGTGAAGAAACTGTTG



AGCCAAAAGAAGACATGCCCCTACCCGAGGAGAGGAAGGAGGAGGAGG



AGATGCAAGCAGAGGCAGAAGAGGCTGAAGAGGGTGAGGAAGAGGATG



CAGCCAGCAGTGAAGTCCCAGCAGCCTCTCCAGCAGACAGCAGCAATAG



TCCTGAGACCGAAACCAAGGAGCCTGAGGTGGAGGAGGAAGAAGAGAA



GCCCCGTGTCTCAGAGGAGCAGAGGCAGTCAGAGGAGGAGCAGCAGGAA



TTAGAGGAGCCAGAGCCAGAGGAGGAGGAAGATGCAGCTGCAGAGACT



GCCCAGAATGACGACCACGACGCTGATGATGAGGATGATGGCCACCTGG



AGTCCACAAAGAAAAAGGAGCTAGAGGAACAGCCCACGAGGGAAGATG



TCAAGGAGGAGCCTGGTGTTCAAGAGTCTTTTTTAGATGCTAATATGCAG



AAGAGTAGGGAAAAGATAAAGGATAAAGAGGAAACCGAGCTGGATTCC



GAAGAGGAGCAGCCTTCCCATGACACGTCCGTGGTGTCAGAGCAGATGG



CTGGGTCTGAGGACGACCACGAAGAAGACTCCCACACTAAGGAAGAGTT



AATCGAATTAAAAGAGGAGGAAGAGATTCCTCATAGTGAGCTGGATCTG



GAAACTGTACAGGCAGTGCAGTCTTTGACTCAAGAAGAAAGCAGTGAGC



ATGAGGGCGCCTACCAGGACTGTGAGGAAACTCTTGCGGCGTGTCAGAC



CCTGCAGAGTTACACCCAGGCTGACGAGGACCCTCAGATGTCCATGGTTG



AAGACTGTCATGCGTCAGAACATAATAGCCCTATCTCCTCCGTTCAGTCT



CACCCCAGCCAGTCAGTCCGTTCGGTCAGCAGTCCCAACGTGCCTGCCCT



TGAGAGTGGCTACACCCAGATCAGCCCAGAACAAGGATCCCTGTCCGCA



CCCTCTATGCAGAACATGGAGACCAGCCCCATGATGGATGTGCCTTCCGT



ATCAGACCACTCTCAGCAGGTGGTGGACAGCGGCTTCAGTGACCTGGGCA



GCATTGAGAGCACCACTGAAAACTATGAGAACCCAAGCAGTTACGACTC



CACGATGGGCGGCAGCATCTGTGGGAACAGCTCTTCCCAGAGCAGCTGCT



CCTACGGTGGGCTGTCGTCCTCCAGCAGCCTCACCCAGAGCAGCTGTGTG



GTCACTCAGCAGATGGCCAGCATGGGCAGCAGCTGCAGCATGATGCAGC



AGAGCAGCGTCCAGCCTGCTGCCAACTGCAGCATCAAGTCACCTCAGAGC



TGCGTGGTGGAGAGGCCTCCCAGTAACCAGCAGCAGCAGCCGCCACCAC



CGCCTCCACAGCAGCCACAGCCGCCGCCGCCACAACCACAACCAGCACC



ACAGCCTCCACCACCCCAGCAGCAGCCGCAACAGCAGCCGCAGCCTCAG



CCCCAGCAGCCTCCACCCCCACCCCCTCCCCAGCAGCAGCCCCCGCTGTC



ACAGTGTAGTATGAATAACAGTTTCACCCCAGCTCCTATGATCATGGAGA



TACCAGAATCTGGAAGCACTGGGAACATAAGTATCTATGAGAGGATTCC



AGGGGATTTTGGTGCCGGCAGCTACTCTCAACCATCAGCCACCTTCAGCC



TAGCCAAGCTGCAGCAGCTGACCAACACCATTATGGACCCTCATGCCATG



CCTTATAGCCATTCTCCTGCTGTGACTTCCTATGCAACCAGTGTTTCTCTG



TCCAATACAGGACTGGCTCAGCTGGCTCCATCTCATCCCTTAGCTGGGAC



TCCTCAAGCACAAGCCACCATGACGCCACCCCCAAACTTGGCATCCACTA



CCATGAACCTCACATCTCCTCTGCTTCAGTGCAACATGTCTGCCACCAAC



ATTGGCATTCCTCACACGCAGAGATTGCAAGGGCAAATGCCAGTGAAGG



GGCACATTTCCATCCGCTCCAAGTCTGCGCCACTGCCCTCTGCGGCTGCTC



ACCAGCAGCAGCTGTATGGCCGTAGCCCATCGGCAGTTGCCATGCAGGCT



GGCCCTCGCGCACTGGCTGTTCAGCGTGGCATGAACATGGGGGTTAATCT



GATGCCTACTCCCGCCTATAATGTCAATTCCATGAATATGAACACCTTGA



ATGCCATGAACAGCTATCGAATGACACAGCCCATGATGAACAGCAGTTA



CCATAGTAACCCTGCCTACATGAACCAGACAGCACAGTATCCTATGCAGA



TGCAGATGGGAATGATGGGGAGCCAGGCCTATACCCAGCAGCCTATGCA



GCCTAACCCTCATGGGAACATGATGTACACAGGCCCCTCCCATCACAGCT



ACATGAACGCTGCTGGCGTGCCCAAGCAGTCACTCAACGGACCTTACATG



AGAAGATGA





ESR1->
ATGACCATGACCCTCCACACCAAAGCATCTGGGATGGCCCTACTGCATCA
SEQ ID


AKAP12
GATCCAAGGGAACGAGCTGGAGCCCCTGAACCGTCCGCAGCTCAAGATC
NO: 2



CCCCTGGAGCGGCCCCTGGGCGAGGTGTACCTGGACAGCAGCAAGCCCG



CCGTGTACAACTACCCCGAGGGCGCCGCCTACGAGTTCAACGCCGCGGCC



GCCGCCAACGCGCAGGTCTACGGTCAGACCGGCCTCCCCTACGGCCCCGG



GTCTGAGGCTGCGGCGTTCGGCTCCAACGGCCTGGGGGGTTTCCCCCCAC



TCAACAGCGTGTCTCCGAGCCCGCTGATGCTACTGCACCCGCCGCCGCAG



CTGTCGCCTTTCCTGCAGCCCCACGGCCAGCAGGTGCCCTACTACCTGGA



GAACGAGCCCAGCGGCTACACGGTGCGCGAGGCCGGCCCGCCGGCATTC



TACAGGCCAAATTCAGATAATCGACGCCAGGGTGGCAGAGAAAGATTGG



CCAGTACCAATGACAAGGGAAGTATGGCTATGGAATCTGCCAAGGAGAC



TCGCTACTGTGCAGTGTGCAATGACTATGCTTCAGGCTACCATTATGGAG



TCTGGTCCTGTGAGGGCTGCAAGGCCTTCTTCAAGAGAAGTATTCAAGGA



CATAACGACTATATGTGTCCAGCCACCAACCAGTGCACCATTGATAAAAA



CAGGAGGAAGAGCTGCCAGGCCTGCCGGCTCCGTAAATGCTACGAAGTG



GGAATGATGAAAGGTGGGATACGAAAAGACCGAAGAGGAGGGAGAATG



TTGAAACACAAGCGCCAGAGAGATGATGGGGAGGGCAGGGGTGAAGTG



GGGTCTGCTGGAGACATGAGAGCTGCCAACCTTTGGCCAAGCCCGCTCAT



GATCAAACGCTCTAAGAAGAACAGCCTGGCCTTGTCCCTGACGGCCGACC



AGATGGTCAGTGCCTTGTTGGATGCTGAGCCCCCGATACTCTATTCCGAG



TATGATCCTACCAGACCCTTCAGTGAAGCTTCGATGATGGGCTTACTGAC



CAACCTGGCAGACAGGGAGCTGGTTCACATGATCAACTGGGCGAAGAGG



GTGCCAGTTGGACAGAGAGACTCTGAAGATGTGAGCAAAAGAGACTCCG



ATAAAGAGATGGCTACTAAGTCAGCGGTTGTTCACGACATCACAGATGAT



GGGCAGGAGGAGACACCCGAAATAATCGAACAGATTCCTTCTTCAGAAA



GCAATTTAGAAGAGCTAACACAACCCACTGAGTCCCAGGCTAATGATATT



GGATTTAAGAAGGTGTTTAAGTTTGTTGGCTTTAAATTCACTGTGAAAAA



GGATAAGACAGAGAAGCCTGACACTGTCCAGCTACTCACTGTGAAGAAA



GATGAAGGGGAGGGAGCAGCAGGGGCTGGCGACCACAAGGACCCCAGC



CTTGGGGCTGGAGAAGCAGCATCCAAAGAAAGCGAACCCAAACAATCTA



CAGAGAAACCCGAAGAGACCCTGAAGCGTGAGCAAAGCCACGCAGAAAT



TTCTCCCCCAGCCGAATCTGGCCAAGCAGTGGAGGAATGCAAAGAGGAA



GGAGAAGAGAAACAAGAAAAAGAACCTAGCAAGTCTGCAGAATCTCCGA



CTAGTCCCGTGACCAGTGAAACAGGATCAACCTTCAAAAAATTCTTCACT



CAAGGTTGGGCCGGCTGGCGCAAAAAGACCAGTTTCAGGAAGCCGAAGG



AGGATGAAGTGGAAGCTTCAGAGAAGAAAAAGGAACAAGAGCCAGAAA



AAGTAGACACAGAAGAAGACGGAAAGGCAGAGGTTGCCTCCGAGAAACT



GACCGCCTCCGAGCAAGCCCACCCACAGGAGCCGGCAGAAAGTGCCCAC



GAGCCCCGGTTATCAGCTGAATATGAGAAAGTTGAGCTGCCCTCAGAGG



AGCAAGTCAGTGGCTCGCAGGGACCTTCTGAAGAGAAACCTGCTCCGTTG



GCGACAGAAGTGTTTGATGAGAAAATAGAAGTCCACCAAGAAGAGGTTG



TGGCCGAAGTCCACGTCAGCACCGTGGAGGAGAGAACCGAAGAGCAGAA



AACGGAGGTGGAAGAAACAGCAGGGTCTGTGCCAGCTGAAGAATTGGTT



GAAATGGATGCAGAACCTCAGGAAGCTGAACCTGCCAAGGAGCTGGTGA



AGCTCAAAGAAACGTGTGTTTCCGGAGAGGACCCTACACAGGGAGCTGA



CCTCAGTCCTGATGAGAAGGTGCTGTCCAAACCCCCCGAAGGCGTTGTGA



GTGAGGTGGAAATGCTGTCATCACAGGAGAGAATGAAGGTGCAGGGAAG



TCCACTAAAGAAGCTTTTTACCAGCACTGGCTTAAAAAAGCTTTCTGGAA



AGAAACAGAAAGGGAAAAGAGGAGGAGGAGACGAGGAATCAGGGGAGC



ACACTCAGGTTCCAGCCGATTCTCCGGACAGCCAGGAGGAGCAAAAGGG



CGAGAGCTCTGCCTCATCCCCTGAGGAGCCCGAGGAGATCACGTGTCTGG



AAAAGGGCTTAGCCGAGGTGCAGCAGGATGGGGAAGCTGAAGAAGGAG



CTACTTCCGATGGAGAGAAAAAAAGAGAAGGTGTCACTCCCTGGGCATC



ATTCAAAAAGATGGTGACGCCCAAGAAGCGTGTTAGACGGCCTTCGGAA



AGTGATAAAGAAGATGAGCTGGACAAGGTCAAGAGCGCTACCTTGTCTT



CCACCGAGAGCACAGCCTCTGAAATGCAAGAAGAAATGAAAGGGAGCGT



GGAAGAGCCAAAGCCGGAAGAACCAAAGCGCAAGGTGGATACCTCAGTA



TCTTGGGAAGCTTTAATTTGTGTGGGATCATCCAAGAAAAGAGCAAGGAG



AGGGTCCTCTTCTGATGAGGAAGGGGGACCAAAAGCAATGGGAGGAGAC



CACCAGAAAGCTGATGAGGCCGGAAAAGACAAAGAGACGGGGACAGAC



GGGATCCTTGCTGGTTCCCAAGAACATGATCCAGGGCAGGGAAGTTCCTC



CCCGGAGCAAGCTGGAAGCCCTACCGAAGGGGAGGGCGTTTCCACCTGG



GAGTCATTTAAAAGGTTAGTCACGCCAAGAAAAAAATCAAAGTCCAAGC



TGGAAGAGAAAAGCGAAGACTCCATAGCTGGGTCTGGTGTAGAACATTC



CACTCCAGACACTGAACCCGGTAAAGAAGAATCCTGGGTCTCAATCAAG



AAGTTTATTCCTGGACGAAGGAAGAAAAGGCCAGATGGGAAACAAGAAC



AAGCCCCTGTTGAAGACGCAGGGCCAACAGGGGCCAACGAAGATGACTC



TGATGTCCCGGCCGTGGTCCCTCTGTCTGAGTATGATGCTGTAGAAAGGG



AGAAAATGGAGGCACAGCAAGCCCAAAAAAGCGCAGAGCAGCCCGAGC



AGAAGGCAGCCACTGAGGTGTCCAAGGAGCTCAGCGAGAGTCAGGTTCA



TATGATGGCAGCAGCTGTCGCTGACGGGACGAGGGCAGCTACCATTATTG



AAGAAAGGTCTCCTTCTTGGATATCTGCTTCAGTGACAGAACCTCTTGAA



CAAGTAGAAGCTGAAGCCGCACTGTTAACTGAGGAGGTATTGGAAAGAG



AAGTAATTGCAGAAGAAGAACCCCCCACGGTTACTGAACCTCTGCCAGA



GAACAGAGAGGCCCGGGGCGACACGGTCGTTAGTGAGGCGGAATTGACC



CCCGAAGCTGTGACAGCTGCAGAAACTGCAGGGCCATTGGGTGCCGAAG



AAGGAACCGAAGCATCTGCTGCTGAAGAGACCACAGAAATGGTGTCAGC



AGTCTCCCAGTTAACCGACTCCCCAGACACCACAGAGGAGGCCACTCCGG



TGCAGGAGGTGGAAGGTGGCGTACCTGACATAGAAGAGCAAGAGAGGCG



GACTCAAGAGGTCCTCCAGGCAGTGGCAGAAAAAGTGAAAGAGGAATCC



CAGCTGCCTGGCACCGGTGGGCCAGAAGATGTGCTTCAGCCTGTGCAGAG



AGCAGAGGCAGAAAGACCAGAAGAGCAGGCTGAAGCGTCGGGTCTGAA



GAAAGAGACGGATGTAGTGTTGAAAGTAGATGCTCAGGAGGCAAAAACT



GAGCCTTTTACACAAGGGAAGGTGGTGGGGCAGACCACCCCAGAAAGCT



TTGAAAAAGCTCCTCAAGTCACAGAGAGCATAGAGTCCAGTGAGCTTGTA



ACCACTTGTCAAGCCGAAACCTTAGCTGGGGTAAAATCACAGGAGATGG



TGATGGAACAGGCTATCCCCCCTGACTCGGTGGAAACCCCTACAGACAGT



GAGACTGATGGAAGCACCCCCGTAGCCGACTTTGACGCACCAGGCACAA



CCCAGAAAGACGAGATTGTGGAAATCCATGAGGAGAATGAGGTCGCATC



TGGTACCCAGTCAGGGGGCACAGAAGCAGAGGCAGTTCCTGCACAGAAA



GAGAGGCCTCCAGCACCTTCCAGTTTTGTGTTCCAGGAAGAAACTAAAGA



ACAATCAAAGATGGAAGACACTCTAGAGCATACAGATAAAGAGGTGTCA



GTGGAAACTGTATCCATTCTGTCAAAGACTGAGGGGACTCAAGAGGCTG



ACCAGTATGCTGATGAGAAAACCAAAGACGTACCATTTTTCGAAGGACTT



GAGGGGTCTATAGACACAGGCATAACAGTCAGTCGGGAAAAGGTCACTG



AAGTTGCCCTTAAAGGTGAAGGGACAGAAGAAGCTGAATGTAAAAAGGA



TGATGCTCTTGAACTGCAGAGTCACGCTAAGTCTCCTCCATCCCCCGTGG



AGAGAGAGATGGTAGTTCAAGTCGAAAGGGAGAAAACAGAAGCAGAGC



CAACCCATGTGAATGAAGAGAAGCTTGAGCACGAAACAGCTGTTACCGT



ATCTGAAGAGGTCAGTAAGCAGCTCCTCCAGACAGTGAATGTGCCCATCA



TAGATGGGGCAAAGGAAGTCAGCAGTTTGGAAGGAAGCCCTCCTCCCTG



CCTAGGTCAAGAGGAGGCAGTATGCACCAAAATTCAAGTTCAGAGCTCT



GAGGCATCATTCACTCTAACAGCGGCTGCAGAGGAGGAAAAGGTCTTAG



GAGAAACTGCCAACATTTTAGAAACAGGTGAAACGTTGGAGCCTGCAGG



TGCACATTTAGTTCTGGAAGAGAAATCCTCTGAAAAAAATGAAGACTTTG



CCGCTCATCCAGGGGAAGATGCTGTGCCCACAGGGCCCGACTGTCAGGC



AAAATCGACACCAGTGATAGTATCTGCTACTACCAAGAAAGGCTTAAGTT



CCGACCTGGAAGGAGAGAAAACCACATCACTGAAGTGGAAGTCAGATGA



AGTCGATGAGCAGGTTGCTTGCCAGGAGGTCAAAGTGAGTGTAGCAATT



GAGGATTTAGAGCCTGAAAATGGGATTTTGGAACTTGAGACCAAAAGCA



GTAAACTTGTCCAAAACATCATCCAGACAGCCGTTGACCAGTTTGTACGT



ACAGAAGAAACAGCCACCGAAATGTTGACGTCTGAGTTACAGACACAAG



CTCACGTGATAAAAGCTGACAGCCAGGACGCTGGACAGGAAACGGAGAA



AGAAGGAGAGGAACCTCAGGCCTCTGCACAGGATGAAACACCAATTACT



TCAGCCAAAGAGGAGTCAGAGTCAACCGCAGTGGGACAAGCACATTCTG



ATATTTCCAAAGACATGAGTGAAGCCTCAGAAAAGACCATGACTGTTGA



GGTAGAAGGTTCCACTGTAAATGATCAGCAGCTGGAAGAGGTCGTCCTCC



CATCTGAGGAAGAGGGAGGTGGAGCTGGAACAAAGTCTGTGCCAGAAGA



TGATGGTCATGCCTTGTTAGCAGAAAGAATAGAGAAGTCACTAGTTGAAC



CGAAAGAAGATGAAAAAGGTGATGATGTTGATGACCCTGAAAACCAGAA



CTCAGCCCTGGCTGATACTGATGCCTCAGGAGGCTTAACCAAAGAGTCCC



CAGATACAAATGGACCAAAACAAAAAGAGAAGGAGGATGCCCAGGAAG



TAGAATTGCAGGAAGGAAAAGTGCACAGTGAATCAGATAAAGCGATCAC



ACCCCAAGCACAGGAGGAGTTACAGAAACAAGAGAGAGAATCTGCAAA



GTCAGAACTTACAGAATCTTAA





SEMA4C
ATGGCCCCACACTGGGCTGTCTGGCTGCTGGCAGCAAGGCTGTGGGGCCT
SEQ ID


->
GGGCATTGGGGCTGAGGTGTGGTGGAACCTTGTGCCGCGTAAGACAGTGT
NO: 3


RBMS1
CTTCTGGGGAGCTGGCCACGGTAGTACGGCGGTTCTCCCAGACCGGCATC



CAGGACTTCCTGACACTGACGCTGACGGAGCCCACTGGGCTTCTGTACGT



GGGCGCCCGAGAGGCCCTGTTTGCCTTCAGCATGGAGGCCCTGGAGCTGC



AAGGAGCGATCTCCTGGGAGGCCCCCGTGGAGAAGAAGACTGAGTGTAT



CCAGAAAGGGAAGAACAACCAGACCGAGTGCTTCAACTTCATCCGCTTCC



TGCAGCCCTACAATGCCTCCCACCTGTACGTCTGTGGCACCTACGCCTTCC



AGCCCAAGTGCACCTACGTCAACATGCTCACCTTCACTTTGGAGCATGGA



GAGTTTGAAGATGGGAAGGGCAAGTGTCCCTATGACCCAGCTAAGGGCC



ATGCTGGCCTTCTTGTGGATGGTGAGCTGTACTCGGCCACACTCAACAAC



TTCCTGGGCACGGAACCCATTATCCTGCGTAACATGGGGCCCCACCACTC



CATGAAGACAGAGTACCTGGCCTTTTGGCTCAACGAACCTCACTTTGTAG



GCTCTGCCTATGTACCTGAGAGTGTGGGCAGCTTCACGGGGGACGACGAC



AAGGTCTACTTCTTCTTCAGGGAGCGGGCAGTGGAGTCCGACTGCTATGC



CGAGCAGGTGGTGGCTCGTGTGGCCCGTGTCTGCAAGGGCGATATGGGG



GGCGCACGGACCCTGCAGAGGAAGTGGACCACGTTCCTGAAGGCGCGGC



TGGCATGCTCTGCCCCGAACTGGCAGCTCTACTTCAACCAGCTGCAGGCG



ATGCACACCCTGCAGGACACCTCCTGGCACAACACCACCTTCTTTGGGGT



TTTTCAAGCACAGTGGGGTGACATGTACCTGTCGGCCATCTGTGAGTACC



AGTTGGAAGAGATCCAGCGGGTGTTTGAGGGCCCCTATAAGGAGTACCA



TGAGGAAGCCCAGAAGTGGGACCGCTACACTGACCCTGTACCCAGCCCTC



GGCCTGGCTCGTGCATTAACAACTGGCATCGGCGCCACGGCTACACCAGC



TCCCTGGAGCTACCCGACAACATCCTCAACTTCGTCAAGAAGCACCCGCT



GATGGAGGAGCAGGTGGGGCCTCGGTGGAGCCGCCCCCTGCTCGTGAAG



AAGGGCACCAACTTCACCCACCTGGTGGCCGACCGGGTTACAGGACTTGA



TGGAGCCACCTATACAGTGCTGTTCATTGGCACAGGAGACGGCTGGCTGC



TCAAGGCTGTGAGCCTGGGGCCCTGGGTTCACCTGATTGAGGAGCTGCAG



CTGTTTGACCAGGAGCCCATGAGAAGCCTGGTGCTATCTCAGAGCAAGAA



GCTGCTCTTTGCCGGCTCCCGCTCTCAGCTGGTGCAGCTGCCCGTGGCCG



ACTGCATGAAGTATCGCTCCTGTGCAGACTGTGTCCTCGCCCGGGACCCC



TATTGCGCCTGGAGCGTCAACACCAGCCGCTGTGTGGCCGTGGGTGGCCA



CTCTGGATCTCTACTGATCCAGCATGTGATGACCTCGGACACTTCAGGCA



TCTGCAACCTCCGTGGCAGTAAGAAAGATGTACAGAAAGGTGTTCTTACA



TGAAGAAGGGTGTGAAGGCTGAACAATCATGGATTTTTCTGATCAATTGT



GCTTTAGGAAATTATTGACAGTTTTGCACAGGTTCTTGAAAACGTTATTTA



TAATGAAATCAACTAAAACTATTTTTGCTATAAGTTCTATAAGGTGCATA



AAACCCTTAAATTCATCTAGTAGCTGTTCCCCCGAACAGGTTTATTTTAGT



AAAAAAAAAAAAACAAAAAACAAAAACAAAAGATTTTTATCAAATGTTA



TGATGCAAAAAAAGAAAAAGAAAAAAAAAAAGAAAAGAAAACTTCAAT



TTTCTGGGTATGCACAAAGACCATGAAGACTTATCCAAGTGCATGACCGG



ATTTTTGTGGTTTTGTTCATTTTGTGTTTAATTTGTGTTTTTTTTTTCCAGCT



GTATGAAATGGGCTTTCTGAAGTTTAAATAGTCCGACTTCACCCATGGTG



TTCTGTGCTTGCAGTGCGAGTGTTGCTGTAATTCAGTGTTGCCGTCAGTGT



CTCTTTTCTTAGCTTTCTGTCTTTCTTTCAACGTAGTGTGAAGTGTCTTATC



CTTTTCTATGAATTCCAATTTGCCTTAACTCTTTTGATGCTGTAGCTGTTTC



AGTAAAAGTTAGTTCAAACTAATGATGTAGAATGCTTTGACCAAATGAGC



TGGTCTATTATGCCTTGTAAAACAGCAGCATAGGGCTTTTAAAAGGTAGT



CAATAAAAGTTGCTGAAATTTTGGCTTTTTTAAATATGTAGTAGGTGTTTT



TAATGATTTTTCACATAATGTGTAAGGTAGTGAAATGCAAGAAGGGAAA



AATGTTTTGTGTGAAACACATTTTCTGACTGGGGAACTTTTATTAGGGTAA



ATTGTTTGTAAGGCTGTACGCCAACAGTTTCCTCTGATAGTTTGACTGATT



TAGGATATCTGCTGTATGATGCAATGTAAAGTCTTTTTTGCCTTTTTTCAG



GAAAAAAAAAAAGCTAACTTGATGTACTAGATTTAGTGTAGGTAGTGTTG



GGGTTGGGGATGGGGGTGGGGGAGGGGAGTCACTGAATGTTTTGTCCTTC



CTTTATACTAATGATAGTGCTTTAGAATGAGAATTATGCCTGAAATCTGG



CAAACCGAAAAATGTTGCTATTGCAACAAAGTGGCAAAAGCTAAAAGTA



AGGATTTATCTTCAAACATAAGCTGAGATAACGAATAGAAGCAAAACGA



TTGGCTACTAGCTCTCTCTCTCTCTCTCTATTAGGTAAATTTGAAAAATAA



AAATGACTTGGCACTTTTAAAGGTAACTTCACCAAAGACCGAAGAGCCA



GTAACCAGTAGCTCCAACTTGTCTCAGCATCACATCTTCTGTGCTCTTTAT



TTTTGCCGGACCAGTTTGCGGTTAGGAGAATGTGCCTTTTTTGTACCTTTG



CATTTAGGTTTTATAATTTTAATTGATGTATGGACACACACAAACAAAAA



AGCATGAAGGAAGATTTGGATCCAAGCAGTGCCACACTTTACATCATCAC



TACAAGTGTTCAAGTGTAAAGAAAACCAATTTTGAAACTATGAAATTCCT



GATTCATAAATACACAGTTATTTCTACTTTAGTACATATAAGATAATTCAC



TGTTATTAAAGCTCTTTTATTAAGGCAATTGCATATGTTTTAAAAGCAATG



GTAAATTAAGTTGTCTTCCAAAACTGTGTACTTGTCTGGTCAGCTGTGTAT



GATCAGTTATCTACCTCAGAGTCTATTTTCTTTTGTGCTGGGACAGGTTGC



TGGCCCTCCCTGTTTCCACAGACCAAATCCTCCTAGCTCAGGAGCTAGGG



CTAAGCAGTTATTTCTTTCAAGTATTTTTTAGTTCTTAAATTTTATGCTTGT



ATTTGATGATAGATGTCAGTGACATTTCATAGTTTCAAAAGTCCTTGCTGC



TCTGAGAAGTGTAGATTCTAGTGAAAATTACATAGTCATAAGAGAAATGT



GTTTTTGTTTTTGTTTTTGTTTCATTTTTTTAAAGTTGTGGTATTATTGGTTC



TATGCTCCCTGGAATATTACTGCTTTGTGAAAGTCCAGACTGAACGCAGC



ACCCTCTGTGTACCTAGTACAGTTATAAACCTGGGTCTCTCACTACTTGAT



ATTTTTGCATTAGTTAAGACAGAAATTTGATAGCTCGGTTAGAGGGGAGG



GGAAATCTGCTGCTAGAAATGTCTGAACTAAGTGCCATACTCGTCTGGGT



AAGATTTGGGAAACATAACCTCTGTACATAAAAAAAAAAAAATCAGTTA



AACATCACATAGTAGACAGCCATTAAATTATAAAAAAATTAATTTATGAA



GAAAGACCTTTTGTACAGATTGAAAAAAAAAGATTTTCATAGAGATATCT



ATATGATCAAGAGAGTTAATTTTTTATTTTTGTTTTACTAGTGCCACAGAC



TTGCCAGTGGTAACTTATTTGTCCGGTTCAAGATAACTCTGTAGTTTTCTT



TCCTAGGACTTGTTGTTAAACGCCAAAAGACATTTTTGAACTGTACATTTG



ATCAGATTGTTAGCTTTTCTGTTTTATTTCTTTTGAGAACCTTTGAATAAA



AAACATCTGAAATTTTA





TRPS1->
TAATAGTGTTGGTGTCTTGAAACTGACGTAATGCGCGGAGACTGAGGTCC
SEQ ID


EIF3H
TGACAAGCGATAACATTTCTGATAAAGACCCGATCTTACTGCAATCTCTA
NO: 4



GCGTCCTCTTTTTTGGTGCTGCTGGTTTCTCCAGACCTCGCGTCCTCTCGA



TTGCTCTCTCGCCTTCCTATTTCTTTTTTTTTTTTTTAAACAAAAAACAACA



CCCCCTCCCCTCTCCCACCCGGCACCGGGCACATCCTTGCTCTATTTCCTT



TCTCTTTCTCTCTCTCTCTCTCTCTCTCTCTTTTTTAATAAGGGTGGGGGAG



GGAAAGGGGGGGGAGGCAGGAAAGACCTTTTTCTCTCCCCCCCGCAATA



ATCCAAGATCAACTCTGCAAACAACAGAAGACGGTTCATGGCTTTGGCCG



CCGCGCCACCATCTTTCGGGCTGCCGAGGGTGTTCTTGACGATTAATCAA



CAGTCCAATATCAGATGGAAATGATGCGGAGCCTTCGCCATGTAAACATT



GATCATCTTCACGTGGGCTGGTATCAGTCCACATACTATGGCTCATTCGTT



ACCCGGGCACTCCTGGACTCTCAGTTTAGTTACCAGCATGCCATTGAAGA



ATCTGTCGTTCTCATTTATGATCCCATAAAAACTGCCCAAGGATCTCTCTC



ACTAAAGGCATACAGACTGACTCCTAAACTGATGGAAGTTTGTAAAGAA



AAGGATTTTTCCCCTGAAGCATTGAAAAAAGCAAATATCACCTTTGAGTA



CATGTTTGAAGAAGTGCCGATTGTAATTAAAAATTCACATCTGATCAATG



TCCTAATGTGGGAACTTGAAAAGAAGTCAGCTGTTGCAGATAAACATGA



ATTGCTCAGCCTTGCCAGCAGCAATCATTTGGGGAAGAATCTACAGTTGC



TGATGGACAGAGTGGATGAAATGAGCCAAGATATAGTTAAATACAACAC



ATACATGAGGAATACTAGTAAACAACAGCAGCAGAAACATCAGTATCAG



CAGCGTCGCCAGCAGGAGAATATGCAGCGCCAGAGCCGAGGAGAACCCC



CGCTCCCTGAGGAGGACCTGTCCAAACTCTTCAAACCACCACAGCCGCCT



GCCAGGATGGACTCGCTGCTCATTGCAGGCCAGATAAACACTTACTGCCA



GAACATCAAGGAGTTCACTGCCCAAAACTTAGGCAAGCTCTTCATGGCCC



AGGCTCTTCAAGAATACAACAACTAAGAAAAGGAAGTTTCCAGAAAAGA



AGTTAACATGAACTCTTGAAGTCACACCAGGGCAACTCTTGGAAGAAATA



TATTTGCATATTGAAAAGCACAGAGGATTTCTTTAGTGTCATTGCCGATTT



TGGCTATAACAGTGTCTTTCTAGCCATAATAAAATAAAACAAAATCTTGA



CTGCTTGCTCATTTGA





UCK2->
ATGGCCGGGGACAGCGAGCAGACCCTGCAGAACCACCAGCAGCCCAACG
SEQ ID


TMCO1
GCGGCGAGCCCTTCCTTATAGGCGTCAGCGGGGGAACAGCTAGCGGCAA
NO: 5



GAACATTCAGAAGATTCTCGGCCTTGCCCCTTCACGAGCCGCCACCAAGC



AGGCAGGTGGATTTCTTGGCCCACCACCTCCTTCTGGGAAGTTCTCTTGA





RTN3->
ATGGCGGAGCCGTCGGCGGCCACTCAGTCCCATTCCATCTCCTCGTCGTC
SEQ ID


ANK1
CTTCGGAGCCGAGCCGTCCGCGCCCGGCGGCGGCGGGAGCCCAGGAGCC
NO: 6



TGCCCCGCCCTGGGGACGAAGAGCTGCAGCTCCTCCTGTGCGGAATGGGT



TGAATGGCTTGCATCTGGCTTCTAAGGAAGGCCATGTGAAAATGGTGGTT



GAACTTCTGCACAAAGAAATCATTCTAGAAACGACAACCAAGAAGGGGA



ACACGGCCCTGCACATCGCTGCTCTAGCCGGGCAGGATGAGGTGGTCCGG



GAGCTTGTCAACTATGGAGCCAACGTCAACGCCCAGTCACAGAAAGGTTT



TACACCCCTGTACATGGCAGCACAAGAGAACCACTTGGAAGTGGTTAAGT



TTTTACTGGAAAATGGAGCTAACCAGAATGTAGCCACAGAAGACGGCTTC



ACGCCTCTGGCGGTAGCCCTGCAGCAGGGCCATGAGAACGTCGTCGCGC



ACCTCATCAACTACGGCACCAAGGGGAAGGTGCGCCTCCCGGCCCTGCAC



ATCGCGGCCCGCAACGACGACACGCGCACGGCTGCGGTGCTGCTGCAGA



ACGACCCCAACCCGGACGTGCTTTCCAAGACGGGATTCACGCCCCTGCAC



ATTGCGGCTCACTACGAGAACCTCAACGTGGCCCAGTTGCTCCTCAACAG



AGGAGCCAGCGTCAATTTCACACCACAGAACGGCATCACGCCACTGCAC



ATCGCCTCCCGCAGGGGCAACGTGATCATGGTGCGGCTGCTGCTGGATCG



GGGAGCCCAGATAGAAACCAAGACCAAGGACGAATTGACACCTCTCCAC



TGTGCAGCTCGAAATGGGCACGTGCGAATCTCAGAGATCCTGCTGGACCA



CGGGGCACCAATCCAAGCCAAAACCAAGAACGGCCTGTCCCCAATTCAC



ATGGCGGCTCAGGGAGACCACCTCGACTGTGTCCGGCTCCTGTTGCAATA



CGACGCAGAGATAGACGACATCACCCTGGACCACCTGACCCCACTCCAC



GTGGCTGCCCACTGTGGACACCACAGGGTGGCTAAGGTCCTTCTGGATAA



AGGGGCCAAACCCAACTCCAGAGCCCTGAATGGCTTTACCCCCTTACACA



TCGCCTGCAAAAAGAACCACGTCCGTGTCATGGAGCTGCTGCTGAAGACG



GGAGCCTCGATCGACGCGGTCACCGAGTCTGGCCTGACACCTCTCCACGT



GGCCTCCTTCATGGGGCACCTTCCCATCGTGAAGAACCTCCTGCAGCGGG



GGGCGTCGCCCAACGTCTCCAACGTGAAAGTGGAGACCCCGCTACACAT



GGCAGCCAGAGCCGGGCACACGGAAGTGGCCAAATATTTACTCCAGAAC



AAAGCCAAAGTCAATGCCAAGGCCAAGGATGACCAGACCCCACTTCACT



GTGCAGCTCGCATCGGCCACACAAACATGGTGAAGCTCCTGCTGGAAAAT



AACGCCAACCCCAACCTGGCCACCACCGCCGGGCACACCCCCCTGCACAT



TGCAGCCCGTGAGGGCCATGTGGAAACAGTCCTGGCCCTTCTGGAAAAG



GAAGCATCCCAGGCCTGCATGACCAAGAAAGGATTTACCCCTCTGCACGT



GGCGGCCAAGTACGGGAAGGTGCGGGTGGCAGAGCTGCTGCTGGAGCGG



GACGCACACCCGAATGCTGCCGGAAAAAATGGCCTGACCCCCCTGCACG



TGGCCGTCCATCACAACAACCTGGACATCGTCAAGCTGCTGCTTCCCCGG



GGCGGCTCCCCGCACAGCCCTGCCTGGAATGGCTACACCCCTTTGCACAT



CGCTGCCAAGCAGAACCAGGTGGAGGTGGCCCGTAGTCTGCTGCAGTAT



GGGGGCTCAGCAAACGCCGAGTCGGTGCAAGGTGTGACGCCCCTTCACCT



GGCCGCCCAGGAGGGCCACGCAGAGATGGTGGCTCTGCTGCTCTCGAAA



CAAGCCAATGGCAACCTGGGGAACAAGAGCGGACTCACTCCCCTCCATCT



GGTAGCACAAGAAGGCCACGTTCCAGTGGCAGATGTGCTGATCAAACAC



GGCGTCATGGTGGACGCCACCACCCGGATGGGCTACACTCCCCTCCATGT



GGCCAGTCACTATGGAAACATCAAGCTGGTGAAGTTTCTGCTGCAGCACC



AGGCAGATGTCAATGCCAAGACCAAGCTAGGATACAGCCCCCTGCACCA



GGCAGCCCAGCAGGGACACACAGACATCGTGACTCTGCTTCTGAAAAAC



GGTGCTTCCCCAAACGAGGTCAGCTCGGATGGAACCACACCTCTGGCCAT



AGCCAAGCGCTTGGGCTACATTTCTGTCACCGACGTGCTCAAGGTCGTCA



CGGATGAAACCAGTTTCGTGTTAGTCAGTGATAAGCATCGAATGAGTTTC



CCTGAGACAGTTGATGAGATCCTGGATGTCTCGGAAGATGAAGGGGAAG



AACTCATCAGCTTCAAGGCTGAGAGGCGGGATTCCAGGGATGTTGATGA



AGAGAAGGAGCTGCTGGATTTTGTGCCGAAGCTAGACCAAGTGGTGGAA



TCTCCAGCCATCCCCAGGATTCCCTGTGCCATGCCTGAGACAGTGGTGAT



CAGGTCAGAAGAGCAGGAGCAGGCATCTAAAGAGTATGATGAGGACTCC



CTCATCCCCAGCAGCCCGGCCACCGAGACCTCAGACAACATCAGCCCGGT



GGCCAGCCCGGTGCATACAGGGTTTCTGGTGAGCTTCATGGTTGACGCCC



GGGGTGGTTCCATGAGAGGAAGTCGCCACAACGGCCTGCGAGTGGTGAT



CCCGCCACGGACGTGCGCAGCGCCCACCCGCATCACCTGCCGCCTGGTCA



AGCCCCAGAAGCTCAGCACGCCGCCCCCACTGGCCGAGGAGGAGGGCCT



GGCCAGCAGGATCATAGCACTGGGGCCCACGGGGGCACAGTTCCTGAGC



CCTGTAATCGTGGAGATCCCGCACTTTGCCTCCCATGGCCGTGGAGACCG



CGAGCTCGTGGTTCTGAGGAGCGAAAACGGCTCCGTGTGGAAGGAGCAC



AGGAGCCGCTATGGAGAGAGCTACCTGGATCAGATCCTCAACGGGATGG



ACGAAGAGCTGGGGAGCCTGGAGGAGCTAGAGAAGAAGAGGGTGTGCC



GAATCATCACCACCGACTTCCCGCTGTACTTCGTGATCATGTCACGGCTCT



GCCAGGACTACGACACCATCGGTCCCGAAGGGGGCTCCCTGAAGAGCAA



GCTGGTGCCCCTGGTACAGGCAACGTTCCCGGAGAATGCCGTCACCAAGA



GAGTGAAGCTGGCTCTGCAGGCCCAGCCTGTCCCGGATGAGCTTGTCACT



AAGCTCCTGGGCAACCAGGCCACATTCAGCCCCATTGTCACCGTGGAGCC



CCGGCGCCGGAAGTTCCACCGCCCCATTGGGCTTCGGATCCCACTACCTC



CTTCCTGGACCGACAACCCGAGGGACAGCGGGGAGGGAGACACCACCAG



CCTGCGCCTGCTTTGCAGCGTCATTGGAGGAACAGACCAAGCCCAGTGGG



AAGACATAACAGGAACCACCAAACTTGTATATGCCAACGAGTGCGCCAA



CTTCACCACCAATGTCTCTGCCAGGTTTTGGCTGTCGGACTGTCCTCGGAC



TGCTGAGGCTGTGAACTTTGCCACCCTGCTGTACAAAGAGCTCACTGCAG



TGCCCTACATGGCCAAATTCGTCATCTTTGCCAAGATGAATGACCCCCGA



GAGGGGCGCCTGCGCTGCTACTGCATGACAGATGATAAAGTGGACAAGA



CCCTGGAGCAGCATGAGAACTTCGTGGAGGTGGCCCGGAGCAGGGACAT



AGAGGTGTTGGAAGGAATGTCCCTGTTTGCAGAACTCTCTGGGAACCTGG



TGCCTGTGAAGAAAGCTGCCCAGCAGCGGAGCTTCCACTTCCAGTCATTT



CGGGAGAACCGTCTGGCCATGCCTGTAAAGGTGAGGGACAGCAGTCGAG



AGCCGGGAGGGTCCCTGTCGTTTCTGCGCAAGGCGATGAAGTACGAGGA



CACCCAGCACATTCTCTGCCACCTGAACATCACCATGCCCCCCTGCGCCA



AGGGAAGTGGAGCCGAAGATAGGAGAAGGACCCCGACGCCCCTGGCCCT



GCGATACAGCATTCTCAGTGAGTCCACACCAGGTTCTCTCAGTGGGACAG



AGCAGGCAGAGATGAAGATGGCTGTTATCTCAGAGCACCTCGGTCTCAGC



TGGGCAGAGTTGGCCCGGGAGCTGCAGTTCAGTGTGGAAGACATCAACA



GGATCCGAGTGGAAAATCCCAACTCCCTGTTGGAGCAGAGTGTGGCCTTG



CTGAACCTCTGGGTCATCCGTGAAGGCCAAAACGCAAACATGGAGAATC



TGTACACAGCCCTGCAGAGCATTGACCGTGGCGAGATCGTGAACATGCTG



GAGGGTTCCGGCCGACAGAGCCGCAACTTGAAGCCAGACAGGCGGCACA



CCGACCGCGACTACTCGCTGTCACCCTCCCAGATGAATGGTTACTCCTCA



CTGCAGGACGAGCTGCTGTCCCCTGCCTCCCTGGGCTGTGCACTTTCCTCT



CCGCTACGTGCAGACCAGTACTGGAATGAGGTGGCCGTCCTAGACGCCAT



CCCCTTGGCGGCCACGGAGCATGACACCATGCTGGAGATGTCTGACATGC



AGGTGTGGTCTGCGGGCCTCACGCCTTCTCTGGTCACTGCTGAGGACTCC



TCTCTGGAGTGTAGCAAGGCTGAGGACTCTGATGCCACAGGTCACGAGTG



GAAGTTGGAGGGGGCACTCTCAGAGGAACCGCGGGGCCCCGAGTTGGGC



TCTCTGGAACTTGTGGAGGACGACACAGTGGATTCAGATGCCACAAATGG



CCTTATCGATTTGCTTGAACAGGAGGAAGGTCAGAGGTCAGAAGAGAAG



CTGCCAGGTTCTAAGAGGCAGGATGACGCGACAGGTGCAGGGCAGGACT



CAGAGAATGAAGTGTCTCTTGTTTCAGGCCATCAGAGGGGGCAAGCCCG



AATCACACATTCCCCCACCGTGAGTCAGGTGACGGAGAGGAGTCAGGAC



AGACTGCAGGACTGGGATGCAGACGGCTCGATTGTCTCATACCTGCAAGA



TGCTGCACAAGGTTCCTGGCAAGAGGAGGTCACGCAAGGTCCACACTCAT



TCCAGGGAACAAGTACCATGACTGAAGGGCTAGAGCCCGGTGGATCTCA



GGAGTACGAGAAGGTCCTGGTGTCTGTAAGTGAGCACACGTGGACAGAA



CAGCCCGAGGCTGAGAGCTCCCAGGCCGACAGGGACCGGAGGCAGCAAG



GCCAAGAAGAGCAGGTGCAGGAGGCCAAGAACACCTTCACCCAAGTGGT



GCAGGGGAATGAGTTTCAGAATATTCCAGGGGAGCAGGTGACAGAGGAG



CAATTCACGGATGAGCAGGGCAACATTGTCACCAAGAAGATCATTCGCA



AGGTGGTTCGACAGATAGACTTGTCCAGCGCCGATGCCGCCCAGGAGCA



CGAGGAGGTGGAGCTGAGAGGGAGTGGCCTACAGCCGGACCTGATAGAG



GGCAGGAAGGGGGCGCAGATAGTGAAGCGGGCCAGCCTGAAAAGGGGG



AAACAGTGA





ACACA->
ATGTGGTGGTCTACTCTGATGTCAATCTTGAGGGCTAGGTCTTTCTGGAA
SEQ ID


MSI2
GTGGATATCTACTCAGACAGTAAGAATTATAAGAGCTGTAAGAGCTCATT
NO: 7



TTGGAGGAATAATGGATGAACCATCTCCCTTGGCCCAACCTCTGGAGCTG



AACCAGCACTCTCGATTCATAATAGGTTCTGTGTCTGAAGATAACTCAGA



GGATGAGATCAGCAACCTGGTGAAGTTGGACCTACTGGAGGAGAAGGAG



GGCTCCTTGTCACCTGCTTCTGTTGGCTCAGATACACTCTCTGATTTGGGG



ATCTCTAGCCTACAGGATGGCTTGGCCTTGCACATAAGGTCCAGCATGTC



TGGCTTGCACCTAGTAAAGCAGGGCCGAGACAGAAAGAAAATAGATTCT



CAACGAGATTTCACTGTGGCTTCTCCAGCAGAATTTGTTACTCGCTTTGGG



GGAAATAAAGTGATTGAGAAGGTTCTTATTGCTAACAATGGCATTGCAGC



AGTGAAATGCATGCGGTCTATCCGTAGGTGGTCTTATGAAATGTTTCGAA



ATGAACGTGCAATTAGATTCGTTGTCATGGTCACACCTGAAGACCTTAAA



GCCAATGCAGAATACATTAAGATGGCAGATCACTATGTGCCAGTGCCTGG



AGGACCAAACAACAACAACTATGCAAATGTGGAATTAATTCTTGATATTG



CTAAAAGGATCCCAGTACAAGCAGTGTGGGCTGGCTGGGGTCATGCTTCT



GAGAATCCCAAACTACCGGAACTTCTCTTGAAAAATGGCATTGCCTTCAT



GGGTCCTCCAAGCCAGGCCATGTGGGCTTTAGGGGATAAGATTGCATCTT



CCATAGTGGCTCAAACTGCAGGTATCCCAACTCTTCCCTGGAGCGGCAGT



GGTCTTCGTGTGGACTGGCAGGAAAATGATTTTTCAAAACGTATCTTAAA



TGTTCCCCAGGAGCTATATGAAAAAGGTTATGTGAAAGATGTGGATGATG



GGCTACAGGCAGCTGAGGAAGTTGGATATCCAGTAATGATCAAGGCCTC



AGAGGGAGGAGGAGGGAAGGGAATTAGAAAAGTCAACAATGCAGATGA



CTTCCCTAATCTCTTCAGACAGGTTCAAGCTGAAGTTCCTGGATCTCCCAT



ATTTGTGATGAGACTAGCCAAACAATCTCGTCATCTGGAGGTGCAGATCT



TAGCGGACCAATATGGCAATGCTATCTCTTTGTTTGGTCGTGATTGCTCTG



TACAACGCAGGCATCAGAAGATTATTGAAGAAGCACCTGCTACTATTGCT



ACTCCAGCAGTATTTGAACACATGGAACAGTGTGCGGTGAAACTTGCCAA



AATGGTGGGTTATGTGAGTGCTGGGACTGTGGAATACCTGTACAGCCAGG



ATGGCAGCTTCTACTTTCTGGAATTGAATCCTCGGCTGCAGGTAGAGCAC



CCTTGTACAGAGATGGTGGCTGATGTCAATCTCCCTGCAGCACAGCTCCA



GATTGCCATGGGGATTCCTCTATATAGAATCAAGGATATCCGTATGATGT



ATGGGGTATCTCCCTGGGGTGATTCTCCCATTGATTTTGAAGATTCTGCAC



ACGTTCCTTGTCCAAGGGGCCATGTTATTGCTGCTCGGATCACTAGTGAA



AATCCAGATGAGGGTTTTAAGCCCAGCTCAGGAACAGTTCAGGAGCTAA



ATTTCCGCAGCAATAAGAATGTTTGGGGATATTTCAGTGTTGCTGCTGCA



GGGGGACTTCATGAATTTGCTGATTCTCAGTTTGGTCACTGCTTTTCTTGG



GGAGAAAACAGAGAAGAGGCAATTTCAAACATGGTGGTGGCTTTGAAGG



AGCTGTCTATTCGGGGTGACTTTCGAACTACAGTTGAATACCTGATCAAA



TTGTTAGAGACTGAAAGCTTTCAGATGAACAGAATTGATACTGGCTGGCT



GGACAGACTGATAGCAGAAAAAGTACAGGCTGAGCGACCTGACACCATG



TTGGGGGTTGTGTGTGGTGCCCTCCACGTGGCAGATGTGAGCCTGCGGAA



TAGCGTCTCTAACTTCCTTCACTCCTTAGAAAGGGGTCAAGTCCTTCCTGC



TCATACACTTCTGAATACAGTAGATGTTGAACTTATCTATGAGGGAGTCA



AGTATGTACTTAAGGTGACTCGACAGTCCCCCAACTCCTATGTGGTGATC



ATGAATGGCTCATGTGTAGAAGTAGATGTACATCGGCTGAGTGACGGTGG



ACTGCTCTTGTCCTATGATGGCAGCAGTTATACTACGTATATGAAAGAGG



AAGTGGATAGATATCGCATCACAATTGGCAATAAAACCTGTGTGTTTGAG



AAGGAAAATGACCCATCGGTGATGCGCTCACCTTCTGCTGGGAAGTTAAT



CCAGTACATTGTAGAAGATGGAGGTCATGTGTTTGCCGGCCAGTGCTATG



CTGAGATTGAGGTAATGAAGATGGTAATGACCTTAACAGCTGTGGAGTCT



GGCTGTATCCATTACGTCAAGCGACCTGGAGCAGCTCTTGACCCTGGCTG



TGTACTAGCCAAAATGCAACTGGACAACCCCAGCAAGGTTCAGCAGGCT



GAACTTCACACAGGTAGTCTGCCACGGATCCAGAGCACGGCACTCAGAG



GCGAGAAACTCCATCGAGTGTTCCATTATGTCCTGGATAATCTGGTCAAT



GTAATGAATGGATACTGCCTTCCAGATCCTTTCTTTAGCAGCAAGGTAAA



AGACTGGGTAGAGCGATTGATGAAAACCCTCAGAGATCCCTCCCTGCCTC



TCCTAGAATTGCAAGATATTATGACCAGTGTGTCTGGCCGCATTCCCCCC



AATGTGGAGAAGTCTATCAAGAAGGAAATGGCTCAGTATGCTAGCAACA



TCACATCAGTCCTCTGTCAGTTTCCCAGCCAGCAGATTGCAAACATCCTA



GATAGCCATGCAGCTACATTGAACCGGAAATCTGAACGGGAAGTCTTCTT



TATGAATACTCAGAGCATTGTTCAGCTGGTACAGAGGTACCGAAGTGGCA



TCCGAGGCCACATGAAGGCTGTGGTGATGGATCTGCTCCGGCAGTACCTG



CGAGTAGAGACACAATTCCAGAATGGTCACTATGACAAATGTGTATTCGC



CCTCCGAGAAGAGAATAAAAGTGACATGAACACTGTACTGAACTACATC



TTCTCTCACGCTCAAGTCACCAAGAAGAATCTTCTGGTCACAATGCTTATT



GATCAGTTGTGTGGCCGGGACCCTACTCTCACTGATGAGCTGCTGAATAT



TCTCACAGAGCTAACTCAACTCAGTAAGACCACCAATGCCAAAGTAGCAC



TTCGAGCACGCCAGGTTCTTATTGCCTCCCATTTGCCATCATATGAGCTTC



GCCATAACCAAGTAGAGTCTATCTTCCTATCAGCTATTGACATGTATGGA



CATCAATTTTGCATTGAGAACCTGCAGAAACTCATCCTATCAGAAACATC



TATTTTTGATGTCCTACCAAACTTCTTCTATCACAGCAACCAAGTAGTGAG



GATGGCAGCTCTGGAGGTGTATGTTCGAAGGGCTTATATTGCCTATGAAC



TTAACAGCGTACAACACCGCCAGCTTAAGGACAACACCTGTGTGGTGGA



ATTCCAGTTCATGCTGCCCACATCTCATCCAAACAGAGGGAACATCCCTA



CGCTAAACAGAATGTCCTTCTCCTCCAACCTCAACCACTATGGCATGACC



CATGTAGCTAGTGTCAGCGATGTACTGTTGGACAACTCATTCACTCCACC



TTGTCAGCGGATGGGCGGAATGGTCTCTTTTCGGACTTTTGAAGATTTTGT



CAGGATCTTTGATGAAGTGATGGGCTGCTTCTCTGACTCCCCACCCCAGA



GTCCCACATTCCCTGAGGCAGGTCACACGTCTCTTTATGATGAGGATAAG



GTTCCCAGGGATGAACCAATTCACATTCTCAATGTGGCTATCAAGACTGA



CTGTGATATTGAGGATGACAGGCTGGCAGCTATGTTCAGAGAATTTACCC



AGCAAAATAAAGCTACCCTGGTTGACCATGGGATCCGGCGCCTTACTTTC



CTGGTTGCACAAAAGGATTTCAGAAAGCAGGTCAACTATGAGGTGGATC



GGAGATTTCATAGAGAATTCCCTAAATTTTTTACATTCCGAGCAAGGGAT



AAGTTTGAGGAGGATCGTATCTATCGTCATCTGGAGCCTGCTCTGGCTTTC



CAGTTAGAGCTGAACCGGATGAGAAATTTTGACCTCACTGCCATTCCATG



TGCTAATCACAAGATGCACCTGTATCTCGGGGCAGCCAAGGTGGAAGTG



GGCACAGAAGTGACAGACTACAGGTTCTTTGTTCGTGCAATCATCAGGCA



TTCTGATCTGGTCACCAAGGAAGCTTCTTTTGAATATCTGCAAAATGAAG



GGGAGCGGCTACTCCTGGAAGCCATGGATGAGTTGGAAGTTGCTTTTAAC



AATACAAATGTCCGCACTGACTGTAACCACATCTTCCTCAACTTTGTGCCC



ACGGTTATCATGGACCCATCAAAGATTGAGGAATCCGTGCGGAGCATGGT



AATGCGGTATGGAAGTCGCCTGTGGAAATTGCGCGTCCTCCAGGCAGAAC



TGAAAATCAACATTCGCCTGACGCCAACTGGAAAAGCAATTCCCATCCGC



CTCTTCCTGACAAACGAGTCTGGCTATTACTTGGATATCAGCCTATACAA



GGAAGTGACTGACTCCAGGACAGCACAGATCATGTTTCAGGCATATGGA



GACAAACAGGGACCACTGCATGGAATGTTAATCAATACTCCATATGTGAC



CAAAGACCTGCTGCAATCAAAGAGGTTCCAGGCACAATCCTTAGGGACA



ACATACATATATGATATCCCAGAGATGTTTCGGCAGTCCCTGATCAAACT



CTGGGAGTCTATGTCCACTCAAGCATTTCTTCCATCTCCCCCTCTGCCTTC



TGACATGCTGACTTACACTGAACTGGTACTGGATGATCAAGGTCAGCTGG



TCCACATGAACAGGCTTCCAGGAGGAAATGAGATGGTCACAAGAACAAA



GAAAATATTTGTAGGCGGGTTATCTGCGAACACAGTAGTGGAAGATGTA



AAGCAATATTTCGAGCAGTTTGGCAAGGTGGAAGATGCAATGCTGATGTT



TGATAAAACTACCAACAGGCACAGAGGGTTTGGCTTTGTCACTTTTGAGA



ATGAAGATGTTGTGGAGAAAGTCTGTGAGATTCATTTCCATGAAATCAAT



AATAAAATGGTAGAATGTAAGAAAGCTCAGCCGAAAGAAGTCATGTTCC



CACCTGGGACAAGAGGCCGGGCCCGGGGACTGCCTTACACCATGGACGC



GTTCATGCTTGGCATGGGGATGCTGGGATATCCCAACTTCGTGGCGACCT



ATGGCCGTGGCTACCCCGGATTTGCTCCAAGCTATGGCTATCAGTTCCCA



GGCTTCCCAGCAGCGGCTTATGGACCAGTGGCAGCAGCGGCGGTGGCGG



CAGCAAGAGGATCAGGCTCCAACCCGGCGCGGCCCGGAGGCTTCCCGGG



GGCCAACAGCCCAGGACCTGTCGCCGATCTCTACGGCCCTGCCAGCCAGG



ACTCCGGAGTGGGGAATTACATAAGTGCGGCCAGCCCACAGCCGGGCTC



GGGCTTCGGCCACGGCATAGCTGGACCTTTGATTGCAACGGCCTTTACAA



ATGGATACCATTGA





DDX5->
ATGTCGGGTTATTCGAGTGACCGAGACCGCGGCCGGGACCGAGGGTTTG
SEQ ID


IQCG
GTGCACCTCGATTTGGAGGAAGTAGGGCAGGGCCCTTATCTGGAAAGAA
NO: 8



GTTTGGAAACCCTGGGGAGAAATTAGTTAAAAAGAAGTGGAATCTTGAT



GAGCTGCCTAAATTTGAGAAGAATTTTTATCAAGAGCACCCTGATTTGGC



TAGGCGCACAGCACAAGAGGTGGAAACATACAGAAGAAGCAAGGAAAT



TACAGTTAGAGGTCACAACTGCCCGAAGCCAGTTCTAAATTTTTATGAAG



CCAATTTCCCTGCAAATGTCATGGATGTTATTGCAAGACAGAATTTCACT



GAACCCACTGCTATTCAAGCTCAGGGATGGCCAGTTGCTCTAAGTGGATT



GGATATGGTTGGAGTGGCACAGACTGGATCTGGGAAAACATTGTCTTATT



TGCTTCCTGCCATTGTCCACATCAATCATCAGCCATTCCTAGAGAGAGGC



GATGGGCCTATTTGTTTGGTGCTGGCACCAACTCGGGAACTGGCCCAACA



GGTGCAGCAAGTAGCTGCTGAATATTGTAGAGCATGTCGCTTGAAGTCTA



CTTGTATCTACGGTGGTGCTCCTAAGGGACCACAAATACGTGATTTGGAG



AGAGGTGTGGAAATCTGTATTGCAACACCTGGAAGACTGATTGACTTTTT



AGAGTGTGGAAAAACCAATCTGAGAAGAACAACCTACCTTGTCCTTGATG



AAGCAGATAGAATGCTTGATATGGGCTTTGAACCCCAAATAAGGAAGAT



TGTGGATCAAATAAGACCTGATAGGCAAACTCTAATGTGGAGTGCGACTT



GGCCAAAAGAAGTAAGACAGCTTGCTGAAGATTTCCTGAAAGACTATATT



CATATAAACATTGGTGCACTTGAACTGAGTGCAAACCACAACATTCTTCA



GATTGTGGATGTGTGTCATGACGTAGAAAAGGATGAAAAACTTATTCGTC



TAATGGAAGAGATCATGAGTGAGAAGGAGAATAAAACCATTGTTTTTGT



GGAAACCAAAAGAAGATGTGATGAGCTTACCAGAAAAATGAGGAGAGAT



GGGTGGCCTGCCATGGGTATCCATGGTGACAAGAGTCAACAAGAGCGTG



ACTGGGTTCTAAATGAATTCAAACATGGAAAAGCTCCTATTCTGATTGCT



ACAGATGTGGCCTCCAGAGGGCTAGATGTGGAAGATGTGAAATTTGTCAT



CAATTATGACTACCCTAACTCCTCAGAGGATTATATTCATCGAATTGGAA



GAACTGCTCGCAGTACCAAAACAGGCACAGCATACACTTTCTTTACACCT



AATAACATAAAGCAAGTGAGCGACCTTATCTCTGTGCTTCGTGAAGCTAA



TCAAGCAATTAATCCCAAGTTGCTTCAGTTGGTCGAAGACAGAGGTTCAG



AGTCAGAATGAGTATATTGCTAACCTCAAGGACCAACTGCAAGAGATGA



AGGCAAAATCCAACTTGGAGAATCGCTACATGAAAACCAATACCGAGCT



GCAGATTGCCCAGACCCAGAAAAAGTGTAACAGAACAGAGGAACTCTTG



GTGGAAGAGATTGAGAAACTCAGGATGAAAACCGAAGAAGAGGCCCGG



ACTCATACAGAGATTGAAATGTTCCTTAGAAAGGAGCAGCAGAAACTTG



AGGAGAGGCTGGAGTTCTGGATGGAGAAATACGATAAGGACACAGAAAT



GAAACAGAATGAACTAAATGCTCTCAAAGCCACAAAGGCCAGTGACTTA



GCACACCTTCAAGACCTGGCAAAGATGATAAGAGAGTATGAACAGGTCA



TCATTGAAGATCGTATAGAAAAGGAGAGGAGCAAGAAGAAGGTAAAAC



AGGATCTCTTGGAATTAAAGAGCGTTATAAAGCTCCAGGCCTGGTGGCGA



GGCACTATGATACGGAGAGAAATTGGTGGTTTCAAGATGCCTAAAGACA



AAGTTGATAGCAAGGATTCAAAAGGCAAAGGTAAAGGCAAGGATAAGA



GGAGAGGCAAGAAGAAGTGA





PREX1->
ATGGAGGCGCCCAGCGGCAGCGAGCCCGGCGGCGACGGGGCCGGGGACT
SEQ ID


SLC9A8
GCGCCCACCCGGACCCCCGGGCCCCTGGCGCCGCGGCGCCCAGCTCCGG
NO: 9



CCCCGGCCCGTGCGCGGCCGCCCGGGAGTCCGAGCGCCAGCTGCGCCTCC



GCCTCTGCGTCCTCAACGAGATCTTGGGCACCGAGAGGGACTACGTGGGC



ACCTTGCGCTTCTTGCAGTCGGCATTCCTGCATCGCATCCGGCAGAACGT



GGCCGACTCAGTGGAGAAGGGCCTCACGGAGGAGAATGTCAAGGTCCTG



TTCTCGAACATCGAAGACATCCTGGAAGTTCATAAGGATTTCTTGGCCGC



CTTGGAGTATTGTTTACACCCGGAGCCGCAGTCTCAGCATGAACTTGGGA



ATGTTTTCTTAAAATTCAAGGACAAGTTCTGCGTGTACGAGGAGTATTGC



AGCAACCATGAGAAAGCCCTGAGGCTGCTGGTGGAGCTGAACAAGATCC



CTACCGTGCGCGCCTTCCTTTTGAGCTGCATGCTTCTGGGAGGCCGGAAG



ACCACGGACATCCCTTTGGAAGGCTACCTGTTGTCTCCGATCCAGAGGAT



CTGCAAGTACCCGCTCCTCCTTAAGGAGCTGGCCAAGAGGACTCCCGGCA



AGCACCCAGACCACCCCGCGGTCCAGAGTGCCCTGCAGGCCATGAAGAC



CGTTTGCTCCAACATCAATGAGACCAAGCGGCAGATGGAGAAGCTGGAA



GCCCTGGAGCAGCTGCAGTCCCACATCGAAGGCTGGGAGGAGGTTCCCC



AATACAACTCATGAGGGTTTCAATGTCACCCTCCACACCACCCTGGTTGT



CACGACGAAACTGGTGCTCCCGACCCCTGGCAAGCCCATCCTCCCCGTGC



AGACAGGGGAGCAGGCCCAGCAAGAGGAGCAGTCCAGCGGCATGACCAT



TTTCTTCAGCCTCCTTGTCCTAGCTATCTGCATCATATTGGTGCATTTACTG



ATCCGATACAGATTACATTTCTTGCCAGAGAGTGTTGCTGTTGTTTCTTTA



GGTATTCTCATGGGAGCAGTTATAAAAATTATAGAGTTTAAAAAACTGGC



GAATTGGAAGGAAGAAGAAATGTTTCGTCCAAACATGTTTTTCCTCCTCC



TGCTTCCCCCTATTATCTTTGAGTCTGGATATTCATTACACAAGGGTAACT



TCTTTCAAAATATTGGTTCCATCACCCTGTTTGCTGTTTTTGGGACGGCAA



TCTCCGCTTTTGTAGTAGGTGGAGGAATTTATTTTCTGGGTCAGGCTGATG



TAATCTCTAAACTCAACATGACAGACAGTTTTGCGTTTGGCTCCCTAATAT



CTGCTGTCGATCCAGTGGCCACTATTGCCATTTTCAATGCACTTCATGTGG



ACCCCGTGCTCAACATGCTGGTCTTTGGAGAAAGTATTCTCAACGATGCA



GTCTCCATTGTTCTGACCAACACAGCTGAAGGTTTAACAAGAAAAAATAT



GTCAGATGTCAGTGGGTGGCAAACATTTTTACAAGCCCTTGACTACTTCC



TCAAAATGTTCTTTGGCTCTGCAGCGCTCGGCACTCTCACTGGCTTAATTT



CTGCATTAGTGCTGAAGCATATTGACTTGAGGAAAACGCCTTCCTTGGAG



TTTGGCATGATGATCATTTTTGCTTATCTGCCTTATGGGCTTGCAGAAGGA



ATCTCACTCTCAGGCATCATGGCCATCCTTTTCTCAGGCATCGTGATGTCC



CACTACACGCACCATAACCTCTCCCCAGTCACCCAGATCCTCATGCAGCA



GACCCTCCGCACCGTGGCCTTCTTATGTGAAACATGTGTGTTTGCATTTCT



TGGCCTGTCCATTTTTAGTTTTCCTCACAAGTTTGAAATTTCCTTTGTCATC



TGGTGCATAGTGCTTGTACTATTTGGCAGAGCGGTAAACATTTTCCCTCTT



TCCTACCTCCTGAATTTCTTCCGGGATCATAAAATCACACCGAAGATGAT



GTTCATCATGTGGTTTAGTGGCCTGCGGGGAGCCATCCCCTATGCCCTGA



GCCTACACCTGGACCTGGAGCCCATGGAGAAGCGGCAGCTCATCGGCAC



CACCACCATCGTCATCGTGCTCTTCACCATCCTGCTGCTGGGCGGCAGCA



CCATGCCCCTCATTCGCCTCATGGACATCGAGGACGCCAAGGCACACCGC



AGGAACAAGAAGGACGTCAACCTCAGCAAGACTGAGAAGATGGGCAAC



ACTGTGGAGTCGGAGCACCTGTCGGAGCTCACGGAGGAGGAGTACGAGG



CCCACTACATCAGGCGGCAGGACCTTAAGGGCTTCGTGTGGCTGGACGCC



AAGTACCTGAACCCCTTCTTCACTCGGAGGCTGACGCAGGAGGACCTGCA



CCACGGGCGCATCCAGATGAAAACTCTCACCAACAAGTGGTACGAGGAG



GTACGCCAGGGCCCCTCCGGCTCCGAGGACGACGAGCAGGAGCTGCTCT



GA





TXLNA->
ATGAAGAACCAAGACAAAAAGAACGGGGCTGCCAAACAATCCAATCCAA
SEQ ID


MARCH6
AAAGCAGCCCAGGACAACCGGAAGCAGGACCCGAGGGAGCCCAGGAGC
NO: 10



GGCCCAGCCAGGCGGCTCCTGCAGTAGAAGCAGAAGGTCCCGGCAGCAG



CCAGGCTCCTCGGAAGCCGGAGGGGGCTCAAGCCAGAACGGCTCAGTCT



GGGGCCCTTCGTGATGTCTCTGAGGAGCTGAGCCGCCAACTGGAAGACAT



ACTGAGCACATACTGTGTGGACAATAACCAGGGGGGCCCCGGCGAGGAT



GGGGCACAGGGTGAGCCGGCTGAACCCGAAGATGCAGAGAAGTCCCGGA



CCTATGTGGCAAGGAATGGGGAGCCTGAACCAACTCCAGTAGTCAATGG



AGAGAAGGAACCCTCCAAGGGGGATCCAAACACAGAAGAGATCCGGCA



GAGTGACGAGGTCGGAGACCGAGACCATCGAAGGCCACAGGAGAAGAA



AAAAGCCAAGGGTTTGGGGAAGGAGATCACGTTGCTGATGCAGACATTG



AATACTCTGAGTACCCCAGAGGAGAAGCTGGCTGCTCTGTGCAAGAAGT



ATGCTGAACTGGTACCTTGTGGGTCAACGACTCGTGAACTACGAACGGAA



ATCTGGCAAACAAGGCTCATCTCCACCACCTCCACAGTCATCCCAAGAAT



AA





EIF2C3->
ATGGAAATCGGCTCCGCAGGACCCGCTGGGGCCCAGCCCCTACTCATGGT
SEQ ID


ZP2
GCCCAGAAGACCTGGCTATGGCACCATGGGCAAACCCATTAAACTGCTG
NO: 11



GCTAACTGTTTTCAAGTTGAAATCCCAAAGATTGATGTCTACCTCTATGA



GGTAGATATTAAACCAGACAAGTGTCCTAGGAGAGTGAACAGGGAGGTG



GTTGACTCAATGGTTCAGCATTTTAAAGTAACTATATTTGGAGACCGTAG



ACCAGTTTATGATGGAAAAAGAAGTCTTTACACCGCCAATCCACTTCCTG



TGGCAACTACAGGGGTAGATTTAGACGTTACTTTACCTGGGGAAGGTGGA



AAAGATCGACCTTTCAAGGTGTCAATCAAATTTGTCTCTCGGGTGAGTTG



GCACCTACTGCATGAAGTACTGACAGGACGGACCTTGCCTGAGCCACTGG



AATTAGACAAGCCAATCAGCACTAACCCTGTCCATGCCGTTGATGTGGTG



CTACGACATCTGCCCTCCATGAAATACACACCTGTGGGGCGTTCATTTTTC



TCCGCTCCAGAAGGATATGACCACCCTCTGGGAGGGGGCAGGGAAGTGT



GGTTTGGATTCCATCAGTCTGTTCGGCCTGCCATGTGGAAAATGATGCTT



AATATCGATGTTTCTGCCACTGCCTTCTACAAAGCACAACCTGTAATTCA



GTTCATGTGTGAAGTTCTTGATATTCATAATATTGATGAGCAACCAAGAC



CTCTGACTGATTCTCATCGGGTAAAATTCACCAAAGAGATAAAAGGTTTG



AAGGTTGAAGTGACTCATTGTGGAACAATGAGACGGAAATACCGTGTTTG



TAATGTAACAAGGAGGCCTGCCAGTCATCAAACCTTTCCTTTACAGTTAG



AAAACGGCCAAACTGTGGAGAGAACAGTAGCGCAGTATTTCAGAGAAAA



GTATACTCTTCAGCTGAAGTACCCGCACCTTCCCTGTCTGCAAGTCGGGC



AGGAACAGAAACACACCTACCTGCCACTAGAAGTCTGTAATATTGTGGCA



GGGCAACGATGTATCAAGAAGCTAACAGACAATCAGACTTCCACTATGA



TCAAGGCAACAGCAAGATCTGCACCAGATAGACAAGAGGAAATTAGCAG



ATTGGTAAGAAGTGCAAATTATGAAACAGATCCATTTGTTCAGGAGTTTC



AATTTAAAGTTCGGGATGAAATGGCTCATGTAACTGGACGCGTACTTCCA



GCACCTATGCTCCAGTATGGAGGACGGAATCGGACAGTAGCAACACCGA



GCCATGGAGTATGGGACATGCGAGGGAAACAATTCCACACAGGAGTTGA



AATCAAAATGTGGGCTATCGCTTGTTTTGCCACACAGAGGCAGTGCAGAG



AAGAAATATTGAAGGGTTTCACAGACCAGCTGCGTAAGATTTCTAAGGAT



GCAGGGATGCCCATCCAGGGCCAGCCATGCTTCTGCAAATATGCACAGG



GGGCAGACAGCGTAGAGCCCATGTTCCGGCATCTCAAGAACACATATTCT



GGCCTACAGCTTATTATCGTCATCCTGCCGGGGAAGACACCAGTGTATGA



TAATTCCTACCAACAACCTTATGGGGAAAACGAGTACCCTCTAGTGAGAT



TCCTCCGCCAACCAATTTACATGGAAGTGAGAGTCCTAAACAGGGATGAC



CCCAACATCAAGCTGGTCTTAGATGACTGCTGGGCGACGTCCACCATGGA



TCCAGACTCTTTCCCCCAGTGGAACGTTGTCGTGGATGGCTGTGCATATG



ACCTGGACAACTACCAGACCACCTTCCATCCAGTCGGCTCCTCTGTGACC



CATCCTGATCACTATCAGAGGTTTGACATGAAGGCTTTTGCCTTTGTATCA



GAAGCCCACGTGCTCTCTAGCCTGGTCTACTTCCACTGCAGTGCCTTAATC



TGTAATCGACTCTCCCCTGACTCCCCACTGTGTTCTGTGACCTGCCCTGTG



TCCTCTAGGCACAGGCGAGCCACAGGGGCCACTGAAGCAGAGAAAATGA



CAGTCAGCCTCCCAGGACCCATTCTCCTGTTGTCAGATGACTCCTCATTCA



GAGGTGTCGGCTCATCTGATCTAAAAGCAAGTGGGAGCAGTGGGGAGAA



GAGTAGGAGTGAAACAGGGGAGGAGGTTGGCTCACGAGGTGCTATGGAC



ACCAAAGGGCACAAGACTGCTGGAGATGTTGGTTCCAAAGCTGTGGCTG



CTGTGGCTGCCTTTGCAGGTGTGGTGGCAACTCTAGGCTTCATCTACTACC



TGTACGAGAAAAGGACTGTGTCAAATCACTAA





TPP2->
ATGGCCACCGCTGCGACTGAGGAGCCCTTCCCTTTTCACGGTCTCCTGCC
SEQ ID


BRCA2
GAAGAAGGAGACCGGAGCCGCCTCCTTCCTCTGCCGCTACCCGGAGTATG
NO: 12



ATGGGCGGGGGGTGCTCATCGCAGTCCTGGACACGGGGGTCGACCCGGG



GGCTCCGGGCATGCAGATGCCTATTGGATCCAAAGAGAGGCCAACATTTT



TTGAAATTTTTAAGACACGCTGCAACAAAGCAGATTTAGGACCAATAAGT



CTTAATTGGTTTGAAGAACTTTCTTCAGAAGCTCCACCCTATAATTCTGAA



CCTGCAGAAGAATCTGAACATAAAAACAACAATTACGAACCAAACCTAT



TTAAAACTCCACAAAGGAAACCATCTTATAATCAGCTGGCTTCAACTCCA



ATAATATTCAAAGAGCAAGGGCTGACTCTGCCGCTGTACCAATCTCCTGT



AAAAGAATTAGATAAATTCAAATTAGACTTAGGAAGGAATGTTCCCAAT



AGTAGACATAAAAGTCTTCGCACAGTGAAAACTAAAATGGATCAAGCAG



ATGATGTTTCCTGTCCACTTCTAAATTCTTGTCTTAGTGAAAGTCCTGTTG



TTCTACAATGTACACATGTAACACCACAAAGAGATAAGTCAGTGGTATGT



GGGAGTTTGTTTCATACACCAAAGTTTGTGAAGGGTCGTCAGACACCAAA



ACATATTTCTGAAAGTCTAGGAGCTGAGGTGGATCCTGATATGTCTTGGT



CAAGTTCTTTAGCTACACCACCCACCCTTAGTTCTACTGTGCTCATAGTCA



GAAATGAAGAAGCATCTGAAACTGTATTTCCTCATGATACTACTGCTAAT



GTGAAAAGCTATTTTTCCAATCATGATGAAAGTCTGAAGAAAAATGATAG



ATTTATCGCTTCTGTGACAGACAGTGAAAACACAAATCAAAGAGAAGCT



GCAAGTCATGGATTTGGAAAAACATCAGGGAATTCATTTAAAGTAAATA



GCTGCAAAGACCACATTGGAAAGTCAATGCCAAATGTCCTAGAAGATGA



AGTATATGAAACAGTTGTAGATACCTCTGAAGAAGATAGTTTTTCATTAT



GTTTTTCTAAATGTAGAACAAAAAATCTACAAAAAGTAAGAACTAGCAA



GACTAGGAAAAAAATTTTCCATGAAGCAAACGCTGATGAATGTGAAAAA



TCTAAAAACCAAGTGAAAGAAAAATACTCATTTGTATCTGAAGTGGAACC



AAATGATACTGATCCATTAGATTCAAATGTAGCAAATCAGAAGCCCTTTG



AGAGTGGAAGTGACAAAATCTCCAAGGAAGTTGTACCGTCTTTGGCCTGT



GAATGGTCTCAACTAACCCTTTCAGGTCTAAATGGAGCCCAGATGGAGAA



AATACCCCTATTGCATATTTCTTCATGTGACCAAAATATTTCAGAAAAAG



ACCTATTAGACACAGAGAACAAAAGAAAGAAAGATTTTCTTACTTCAGA



GAATTCTTTGCCACGTATTTCTAGCCTACCAAAATCAGAGAAGCCATTAA



ATGAGGAAACAGTGGTAAATAAGAGAGATGAAGAGCAGCATCTTGAATC



TCATACAGACTGCATTCTTGCAGTAAAGCAGGCAATATCTGGAACTTCTC



CAGTGGCTTCTTCATTTCAGGGTATCAAAAAGTCTATATTCAGAATAAGA



GAATCACCTAAAGAGACTTTCAATGCAAGTTTTTCAGGTCATATGACTGA



TCCAAACTTTAAAAAAGAAACTGAAGCCTCTGAAAGTGGACTGGAAATA



CATACTGTTTGCTCACAGAAGGAGGACTCCTTATGTCCAAATTTAATTGA



TAATGGAAGCTGGCCAGCCACCACCACACAGAATTCTGTAGCTTTGAAGA



ATGCAGGTTTAATATCCACTTTGAAAAAGAAAACAAATAAGTTTATTTAT



GCTATACATGATGAAACATCTTATAAAGGAAAAAAAATACCGAAAGACC



AAAAATCAGAACTAATTAACTGTTCAGCCCAGTTTGAAGCAAATGCTTTT



GAAGCACCACTTACATTTGCAAATGCTGATTCAGGTTTATTGCATTCTTCT



GTGAAAAGAAGCTGTTCACAGAATGATTCTGAAGAACCAACTTTGTCCTT



AACTAGCTCTTTTGGGACAATTCTGAGGAAATGTTCTAGAAATGAAACAT



GTTCTAATAATACAGTAATCTCTCAGGATCTTGATTATAAAGAAGCAAAA



TGTAATAAGGAAAAACTACAGTTATTTATTACCCCAGAAGCTGATTCTCT



GTCATGCCTGCAGGAAGGACAGTGTGAAAATGATCCAAAAAGCAAAAAA



GTTTCAGATATAAAAGAAGAGGTCTTGGCTGCAGCATGTCACCCAGTACA



ACATTCAAAAGTGGAATACAGTGATACTGACTTTCAATCCCAGAAAAGTC



TTTTATATGATCATGAAAATGCCAGCACTCTTATTTTAACTCCTACTTCCA



AGGATGTTCTGTCAAACCTAGTCATGATTTCTAGAGGCAAAGAATCATAC



AAAATGTCAGACAAGCTCAAAGGTAACAATTATGAATCTGATGTTGAATT



AACCAAAAATATTCCCATGGAAAAGAATCAAGATGTATGTGCTTTAAATG



AAAATTATAAAAACGTTGAGCTGTTGCCACCTGAAAAATACATGAGAGT



AGCATCACCTTCAAGAAAGGTACAATTCAACCAAAACACAAATCTAAGA



GTAATCCAAAAAAATCAAGAAGAAACTACTTCAATTTCAAAAATAACTGT



CAATCCAGACTCTGAAGAACTTTTCTCAGACAATGAGAATAATTTTGTCT



TCCAAGTAGCTAATGAAAGGAATAATCTTGCTTTAGGAAATACTAAGGAA



CTTCATGAAACAGACTTGACTTGTGTAAACGAACCCATTTTCAAGAACTC



TACCATGGTTTTATATGGAGACACAGGTGATAAACAAGCAACCCAAGTGT



CAATTAAAAAAGATTTGGTTTATGTTCTTGCAGAGGAGAACAAAAATAGT



GTAAAGCAGCATATAAAAATGACTCTAGGTCAAGATTTAAAATCGGACA



TCTCCTTGAATATAGATAAAATACCAGAAAAAAATAATGATTACATGAAC



AAATGGGCAGGACTCTTAGGTCCAATTTCAAATCACAGTTTTGGAGGTAG



CTTCAGAACAGCTTCAAATAAGGAAATCAAGCTCTCTGAACATAACATTA



AGAAGAGCAAAATGTTCTTCAAAGATATTGAAGAACAATATCCTACTAGT



TTAGCTTGTGTTGAAATTGTAAATACCTTGGCATTAGATAATCAAAAGAA



ACTGAGCAAGCCTCAGTCAATTAATACTGTATCTGCACATTTACAGAGTA



GTGTAGTTGTTTCTGATTGTAAAAATAGTCATATAACCCCTCAGATGTTAT



TTTCCAAGCAGGATTTTAATTCAAACCATAATTTAACACCTAGCCAAAAG



GCAGAAATTACAGAACTTTCTACTATATTAGAAGAATCAGGAAGTCAGTT



TGAATTTACTCAGTTTAGAAAACCAAGCTACATATTGCAGAAGAGTACAT



TTGAAGTGCCTGAAAACCAGATGACTATCTTAAAGACCACTTCTGAGGAA



TGCAGAGATGCTGATCTTCATGTCATAATGAATGCCCCATCGATTGGTCA



GGTAGACAGCAGCAAGCAATTTGAAGGTACAGTTGAAATTAAACGGAAG



TTTGCTGGCCTGTTGAAAAATGACTGTAACAAAAGTGCTTCTGGTTATTTA



ACAGATGAAAATGAAGTGGGGTTTAGGGGCTTTTATTCTGCTCATGGCAC



AAAACTGAATGTTTCTACTGAAGCTCTGCAAAAAGCTGTGAAACTGTTTA



GTGATATTGAGAATATTAGTGAGGAAACTTCTGCAGAGGTACATCCAATA



AGTTTATCTTCAAGTAAATGTCATGATTCTGTTGTTTCAATGTTTAAGATA



GAAAATCATAATGATAAAACTGTAAGTGAAAAAAATAATAAATGCCAAC



TGATATTACAAAATAATATTGAAATGACTACTGGCACTTTTGTTGAAGAA



ATTACTGAAAATTACAAGAGAAATACTGAAAATGAAGATAACAAATATA



CTGCTGCCAGTAGAAATTCTCATAACTTAGAATTTGATGGCAGTGATTCA



AGTAAAAATGATACTGTTTGTATTCATAAAGATGAAACGGACTTGCTATT



TACTGATCAGCACAACATATGTCTTAAATTATCTGGCCAGTTTATGAAGG



AGGGAAACACTCAGATTAAAGAAGATTTGTCAGATTTAACTTTTTTGGAA



GTTGCGAAAGCTCAAGAAGCATGTCATGGTAATACTTCAAATAAAGAAC



AGTTAACTGCTACTAAAACGGAGCAAAATATAAAAGATTTTGAGACTTCT



GATACATTTTTTCAGACTGCAAGTGGGAAAAATATTAGTGTCGCCAAAGA



GTCATTTAATAAAATTGTAAATTTCTTTGATCAGAAACCAGAAGAATTGC



ATAACTTTTCCTTAAATTCTGAATTACATTCTGACATAAGAAAGAACAAA



ATGGACATTCTAAGTTATGAGGAAACAGACATAGTTAAACACAAAATAC



TGAAAGAAAGTGTCCCAGTTGGTACTGGAAATCAACTAGTGACCTTCCAG



GGACAACCCGAACGTGATGAAAAGATCAAAGAACCTACTCTATTGGGTTT



TCATACAGCTAGCGGGAAAAAAGTTAAAATTGCAAAGGAATCTTTGGAC



AAAGTGAAAAACCTTTTTGATGAAAAAGAGCAAGGTACTAGTGAAATCA



CCAGTTTTAGCCATCAATGGGCAAAGACCCTAAAGTACAGAGAGGCCTGT



AAAGACCTTGAATTAGCATGTGAGACCATTGAGATCACAGCTGCCCCAAA



GTGTAAAGAAATGCAGAATTCTCTCAATAATGATAAAAACCTTGTTTCTA



TTGAGACTGTGGTGCCACCTAAGCTCTTAAGTGATAATTTATGTAGACAA



ACTGAAAATCTCAAAACATCAAAAAGTATCTTTTTGAAAGTTAAAGTACA



TGAAAATGTAGAAAAAGAAACAGCAAAAAGTCCTGCAACTTGTTACACA



AATCAGTCCCCTTATTCAGTCATTGAAAATTCAGCCTTAGCTTTTTACACA



AGTTGTAGTAGAAAAACTTCTGTGAGTCAGACTTCATTACTTGAAGCAAA



AAAATGGCTTAGAGAAGGAATATTTGATGGTCAACCAGAAAGAATAAAT



ACTGCAGATTATGTAGGAAATTATTTGTATGAAAATAATTCAAACAGTAC



TATAGCTGAAAATGACAAAAATCATCTCTCCGAAAAACAAGATACTTATT



TAAGTAACAGTAGCATGTCTAACAGCTATTCCTACCATTCTGATGAGGTA



TATAATGATTCAGGATATCTCTCAAAAAATAAACTTGATTCTGGTATTGA



GCCAGTATTGAAGAATGTTGAAGATCAAAAAAACACTAGTTTTTCCAAAG



TAATATCCAATGTAAAAGATGCAAATGCATACCCACAAACTGTAAATGA



AGATATTTGCGTTGAGGAACTTGTGACTAGCTCTTCACCCTGCAAAAATA



AAAATGCAGCCATTAAATTGTCCATATCTAATAGTAATAATTTTGAGGTA



GGGCCACCTGCATTTAGGATAGCCAGTGGTAAAATCGTTTGTGTTTCACA



TGAAACAATTAAAAAAGTGAAAGACATATTTACAGACAGTTTCAGTAAA



GTAATTAAGGAAAACAACGAGAATAAATCAAAAATTTGCCAAACGAAAA



TTATGGCAGGTTGTTACGAGGCATTGGATGATTCAGAGGATATTCTTCAT



AACTCTCTAGATAATGATGAATGTAGCACGCATTCACATAAGGTTTTTGC



TGACATTCAGAGTGAAGAAATTTTACAACATAACCAAAATATGTCTGGAT



TGGAGAAAGTTTCTAAAATATCACCTTGTGATGTTAGTTTGGAAACTTCA



GATATATGTAAATGTAGTATAGGGAAGCTTCATAAGTCAGTCTCATCTGC



AAATACTTGTGGGATTTTTAGCACAGCAAGTGGAAAATCTGTCCAGGTAT



CAGATGCTTCATTACAAAACGCAAGACAAGTGTTTTCTGAAATAGAAGAT



AGTACCAAGCAAGTCTTTTCCAAAGTATTGTTTAAAAGTAACGAACATTC



AGACCAGCTCACAAGAGAAGAAAATACTGCTATACGTACTCCAGAACAT



TTAATATCCCAAAAAGGCTTTTCATATAATGTGGTAAATTCATCTGCTTTC



TCTGGATTTAGTACAGCAAGTGGAAAGCAAGTTTCCATTTTAGAAAGTTC



CTTACACAAAGTTAAGGGAGTGTTAGAGGAATTTGATTTAATCAGAACTG



AGCATAGTCTTCACTATTCACCTACGTCTAGACAAAATGTATCAAAAATA



CTTCCTCGTGTTGATAAGAGAAACCCAGAGCACTGTGTAAACTCAGAAAT



GGAAAAAACCTGCAGTAAAGAATTTAAATTATCAAATAACTTAAATGTTG



AAGGTGGTTCTTCAGAAAATAATCACTCTATTAAAGTTTCTCCATATCTCT



CTCAATTTCAACAAGACAAACAACAGTTGGTATTAGGAACCAAAGTGTCA



CTTGTTGAGAACATTCATGTTTTGGGAAAAGAACAGGCTTCACCTAAAAA



CGTAAAAATGGAAATTGGTAAAACTGAAACTTTTTCTGATGTTCCTGTGA



AAACAAATATAGAAGTTTGTTCTACTTACTCCAAAGATTCAGAAAACTAC



TTTGAAACAGAAGCAGTAGAAATTGCTAAAGCTTTTATGGAAGATGATGA



ACTGACAGATTCTAAACTGCCAAGTCATGCCACACATTCTCTTTTTACATG



TCCCGAAAATGAGGAAATGGTTTTGTCAAATTCAAGAATTGGAAAAAGA



AGAGGAGAGCCCCTTATCTTAGTGGGAGAACCCTCAATCAAAAGAAACT



TATTAAATGAATTTGACAGGATAATAGAAAATCAAGAAAAATCCTTAAA



GGCTTCAAAAAGCACTCCAGATGGCACAATAAAAGATCGAAGATTGTTT



ATGCATCATGTTTCTTTAGAGCCGATTACCTGTGTACCCTTTCGCACAACT



AAGGAACGTCAAGAGATACAGAATCCAAATTTTACCGCACCTGGTCAAG



AATTTCTGTCTAAATCTCATTTGTATGAACATCTGACTTTGGAAAAATCTT



CAAGCAATTTAGCAGTTTCAGGACATCCATTTTATCAAGTTTCTGCTACAA



GAAATGAAAAAATGAGACACTTGATTACTACAGGCAGACCAACCAAAGT



CTTTGTTCCACCTTTTAAAACTAAATCACATTTTCACAGAGTTGAACAGTG



TGTTAGGAATATTAACTTGGAGGAAAACAGACAAAAGCAAAACATTGAT



GGACATGGCTCTGATGATAGTAAAAATAAGATTAATGACAATGAGATTC



ATCAGTTTAACAAAAACAACTCCAATCAAGCAGTAGCTGTAACTTTCACA



AAGTGTGAAGAAGAACCTTTAGATTTAATTACAAGTCTTCAGAATGCCAG



AGATATACAGGATATGCGAATTAAGAAGAAACAAAGGCAACGCGTCTTT



CCACAGCCAGGCAGTCTGTATCTTGCAAAAACATCCACTCTGCCTCGAAT



CTCTCTGAAAGCAGCAGTAGGAGGCCAAGTTCCCTCTGCGTGTTCTCATA



AACAGCTGTATACGTATGGCGTTTCTAAACATTGCATAAAAATTAACAGC



AAAAATGCAGAGTCTTTTCAGTTTCACACTGAAGATTATTTTGGTAAGGA



AAGTTTATGGACTGGAAAAGGAATACAGTTGGCTGATGGTGGATGGCTC



ATACCCTCCAATGATGGAAAGGCTGGAAAAGAAGAATTTTATAGGGCTCT



GTGTGACACTCCAGGTGTGGATCCAAAGCTTATTTCTAGAATTTGGGTTT



ATAATCACTATAGATGGATCATATGGAAACTGGCAGCTATGGAATGTGCC



TTTCCTAAGGAATTTGCTAATAGATGCCTAAGCCCAGAAAGGGTGCTTCT



TCAACTAAAATACAGATATGATACGGAAATTGATAGAAGCAGAAGATCG



GCTATAAAAAAGATAATGGAAAGGGATGACACAGCTGCAAAAACACTTG



TTCTCTGTGTTTCTGACATAATTTCATTGAGCGCAAATATATCTGAAACTT



CTAGCAATAAAACTAGTAGTGCAGATACCCAAAAAGTGGCCATTATTGA



ACTTACAGATGGGTGGTATGCTGTTAAGGCCCAGTTAGATCCTCCCCTCTT



AGCTGTCTTAAAGAATGGCAGACTGACAGTTGGTCAGAAGATTATTCTTC



ATGGAGCAGAACTGGTGGGCTCTCCTGATGCCTGTACACCTCTTGAAGCC



CCAGAATCTCTTATGTTAAAGATTTCTGCTAACAGTACTCGGCCTGCTCGC



TGGTATACCAAACTTGGATTCTTTCCTGACCCTAGACCTTTTCCTCTGCCC



TTATCATCGCTTTTCAGTGATGGAGGAAATGTTGGTTGTGTTGATGTAATT



ATTCAAAGAGCATACCCTATACAGTGGATGGAGAAGACATCATCTGGATT



ATACATATTTCGCAATGAAAGAGAGGAAGAAAAGGAAGCAGCAAAATAT



GTGGAGGCCCAACAAAAGAGACTAGAAGCCTTATTCACTAAAATTCAGG



AGGAATTTGAAGAACATGAAGAAAACACAACAAAACCATATTTACCATC



ACGTGCACTAACAAGACAGCAAGTTCGTGCTTTGCAAGATGGTGCAGAG



CTTTATGAAGCAGTGAAGAATGCAGCAGACCCAGCTTACCTTGAGGGTTA



TTTCAGTGAAGAGCAGTTAAGAGCCTTGAATAATCACAGGCAAATGTTGA



ATGATAAGAAACAAGCTCAGATCCAGTTGGAAATTAGGAAGGCCATGGA



ATCTGCTGAACAAAAGGAACAAGGTTTATCAAGGGATGTCACAACCGTG



TGGAAGTTGCGTATTGTAAGCTATTCAAAAAAAGAAAAAGATTCAGTTAT



ACTGAGTATTTGGCGTCCATCATCAGATTTATATTCTCTGTTAACAGAAGG



AAAGAGATACAGAATTTATCATCTTGCAACTTCAAAATCTAAAAGTAAAT



CTGAAAGAGCTAACATACAGTTAGCAGCGACAAAAAAAACTCAGTATCA



ACAACTACCGGTTTCAGATGAAATTTTATTTCAGATTTACCAGCCACGGG



AGCCCCTTCACTTCAGCAAATTTTTAGATCCAGACTTTCAGCCATCTTGTT



CTGAGGTGGACCTAATAGGATTTGTCGTTTCTGTTGTGAAAAAAACAGGA



CTTGCCCCTTTCGTCTATTTGTCAGACGAATGTTACAATTTACTGGCAATA



AAGTTTTGGATAGACCTTAATGAGGACATTATTAAGCCTCATATGTTAAT



TGCTGCAAGCAACCTCCAGTGGCGACCAGAATCCAAATCAGGCCTTCTTA



CTTTATTTGCTGGAGATTTTTCTGTGTTTTCTGCTAGTCCAAAAGAGGGCC



ACTTTCAAGAGACATTCAACAAAATGAAAAATACTGTTGAGAATATTGAC



ATACTTTGCAATGAAGCAGAAAACAAGCTTATGCATATACTGCATGCAAA



TGATCCCAAGTGGTCCACCCCAACTAAAGACTGTACTTCAGGGCCGTACA



CTGCTCAAATCATTCCTGGTACAGGAAACAAGCTTCTGATGTCTTCTCCTA



ATTGTGAGATATATTATCAAAGTCCTTTATCACTTTGTATGGCCAAAAGG



AAGTCTGTTTCCACACCTGTCTCAGCCCAGATGACTTCAAAGTCTTGTAA



AGGGGAGAAAGAGATTGATGACCAAAAGAACTGCAAAAAGAGAAGAGC



CTTGGATTTCTTGAGTAGACTGCCTTTACCTCCACCTGTTAGTCCCATTTG



TACATTTGTTTCTCCGGCTGCACAGAAGGCATTTCAGCCACCAAGGAGTT



GTGGCACCAAATACGAAACACCCATAAAGAAAAAAGAACTGAATTCTCC



TCAGATGACTCCATTTAAAAAATTCAATGAAATTTCTCTTTTGGAAAGTA



ATTCAATAGCTGACGAAGAACTTGCATTGATAAATACCCAAGCTCTTTTG



TCTGGTTCAACAGGAGAAAAACAATTTATATCTGTCAGTGAATCCACTAG



GACTGCTCCCACCAGTTCAGAAGATTATCTCAGACTGAAACGACGTTGTA



CTACATCTCTGATCAAAGAACAGGAGAGTTCCCAGGCCAGTACGGAAGA



ATGTGAGAAAAATAAGCAGGACACAATTACAACTAAAAAATATATCTAA





CLEC16A
ATGTTTGGCCGCTCGCGGAGCTGGGTGGGCGGGGGCCATGGCAAGACTTC
SEQ ID


->
CCGCAACATCCACTCCTTGGACCACCTCAAGTATCTGTACCACGTTTTGAC
NO: 13


BCAR4
CAAAAACACCACAGTCACAGAACAGAACCGGAACCTGCTAGTGGAGACC



ATCCGTTCCATCACTGAGATCCTGATCTGGGGAGATCAAAATGACAGCTC



TGTATTTGACTTCTTCCTGGAGAAGAATATGTTTGTTTTCTTCTTGAACAT



CTTGCGGCAAAAGTCGGGCCGTTACGTGTGCGTTCAGCTGCTGCAGACCT



TGAACATCCTCTTTGAGAACATCAGTCACGAGACCTCACTTTATTATTTGC



TCTCAAATAACTACGTAAATTCTATCATCGTTCATAAATTTGACTTTTCTG



ATGAGGAGATTATGGCCTATTATATATCGTTCCTGAAAACACTTTCGTTA



AAACTCAACAACCACACTGTCCATTTCTTTTATAATGAGCACACCAATGA



CTTTGCCCTGTACACAGAAGCCATCAAGTTTTTCAACCACCCTGAAAGCA



TGGTTAGAATTGCTGTAAGAACCATAACTTTGAATGTCTATAAAGTCTCA



GTGGATAACCAGGCCATGCTGCACTACATCCGAGATAAAACTGCTGTTCC



TTACTTCTCCAATTTGGTCTGGTTCATTGGGAGCCATGTGATCGAACTCGA



TGACTGCGTGCAGACTGATGAGGAGCATCGGAATCGGGGTAAACTGAGT



GATCTGGTGGCAGAGCACCTAGACCACCTGCACTATCTCAATGACATCCT



GATCATCAACTGTGAGTTCCTCAACGATGTGCTCACTGACCACCTGCTCA



ACAGGCTCTTCCTGCCCCTCTACGTGTACTCACTGGAGAACCAGGACAAG



GGAGGAGAACGGCCGAAAATTAGCCTGCCGGTGTCTCTTTATCTTCTGTC



ACAGGTCTTCTTAATTATACATCATGCACCGCTGGTGAACTCGTTAGCTG



AAGTCATTCTGAATGGTGATCTGTCTGAGATGTACGCTAAGACTGAACAG



GATATTCAGAGAAGTTCTGCCAAGCCCAGCATTCGGTGCTTCATTAAACC



CACCGAGACACTCGAGCGGTCCCTTGAGATGAACAAGCACAAGGGCAAG



AGGCGGGTGCAAAAGAGACCCAACTACAAAAACGTTGGGGAAGAAGAA



GATGAGGAGAAAGGGCCCACCGAGGATGCCCAAGAAGACGCCGAGAAG



GCTAAAGGTACAGAGGGTGGTTCAAAAGGCATCAAGACGAGTGGGGAGA



GTGAAGAGATCGAGATGGTGATCATGGAGCGTAGCAAGCTCTCAGAGCT



GGCCGCCAGCACCTCCGTGCAGGAGCAGAACACCACGGACGAGGAGAAA



AGCGCCGCCGCCACCTGCTCTGAGAGCACGCAATGGAGCAGACCCTTCCT



GGATATGGTGTACCACGCGCTGGACAGCCCGGATGATGATTACCATGCCC



TGTTCGTGCTCTGCCTCCTCTATGCCATGTCTCATAATAAAGGCATGGATC



CTGAAAAATTAGAGCGAATCCAGCTCCCCGTGCCAAATGCGGCCGAGAA



GACCACCTACAACCACCCGCTAGCTGAAAGACTCATCAGGATCATGAAC



AACGCTGCCCAGCCAGATGGGAAGATCCGGCTGGCGACGCTGGAGCTGA



GCTGCCTGCTTCTGAAGCAGCAAGTCCTGATGAGTGCTGGCTGCATCATG



AAGGACGTGCACCTGGCCTGCCTGGAGGGTGCGAGAGAAGAAAGTGTTC



ACCTTGTACGACATTTTTATAAGGGAGAAGACATTTTTTTGGACATGTTTG



AAGATGAGTATAGGAGCATGACAATGAAGCCCATGAACGTGGAATATCT



CATGATGGACGCCTCCATCCTGCTGCCCCCAACAGGCACGCCACTGACGG



GCATTGACTTCGTGAAGCGGCTGCCGTGTGGCGATGTGGAGAAGACCCG



GCGGGCCATCCGGGTGTTCTTCATGCTGCGTTCCCTGTCACTGCAATTGCG



AGGGGAGCCTGAGACACAGTTGCCGCTGACTCGGGAGGAGGACCTGATC



AAGACTGATGATGTCCTGGATCTGACAAAAAATCACCATGTACCAACCTA



TCCAAACTTATCCATGGATGAATCTATCCAGAAGACGGGAGTTCCGATGC



TTGTCTTGCTCTGAATGTCTGCTTGTCACCTGCTTAGGGTTATCGACTGTG



ATTCTGGGACTCATTGTTGTTCTACAGGACCCCTCTGACTCTGTGGTTTTC



TCTACTGGATTAACAATGATAGCCATAGGTGCTTTTTTTGTTGTCCTCACT



GGAGTGACAGCCCTGTGTACGGTTACAGTCGACGAGAACTTGCAGAAAA



CCACGAGGCTAAGACTAGGAGTGATACGAAAAAGCGGAAGTCTCCAAGG



AACTACAGAGCCTTCCATGACTCACTCAATAATCGCTAGCACCTCGCTGT



AGTTGTACATTGAACCCTGGCATCTTCGTCTTTGGAACTAAGTCTCCTGAG



CATTGTTTTTAAATAGAAATAAAATCTGGCTTTTAA





ERBB2->
ATGGAGCTGGCGGCCTTGTGCCGCTGGGGGCTCCTCCTCGCCCTCTTGCC
SEQ ID


IKZF3
CCCCGGAGCCGCGAGCACCCAAGTGTGCACCGGCACAGACATGAAGCTG
NO: 14



CGGCTCCCTGCCAGTCCCGAGACCCACCTGGACATGCTCCGCCACCTCTA



CCAGGGCTGCCAGGTGGTGCAGGGAAACCTGGAACTCACCTACCTGCCC



ACCAATGCCAGCCTGTCCTTCCTGCAGGATATCCAGGAGGTGCAGGGCTA



CGTGCTCATCGCTCACAACCAAGTGAGGCAGGTCCCACTGCAGAGGCTGC



GGATTGTGCGAGGCACCCAGCTCTTTGAGGACAACTATGCCCTGGCCGTG



CTAGACAATGGAGACCCGCTGAACAATACCACCCCTGTCACAGGGGCCTC



CCCAGGAGGCCTGCGGGAGCTGCAGCTTCGAAGCCTCACAGAGATCTTG



AAAGGAGGGGTCTTGATCCAGCGGAACCCCCAGCTCTGCTACCAGGACA



CGATTTTGTGGAAGGACATCTTCCACAAGAACAACCAGCTGGCTCTCACA



CTGATAGACACCAACCGCTCTCGGGCCTGCCACCCCTGTTCTCCGATGTG



TAAGGGCTCCCGCTGCTGGGGAGAGAGTTCTGAGGATTGTCAGAGCCTGA



CGCGCACTGTCTGTGCCGGTGGCTGTGCCCGCTGCAAGGGGCCACTGCCC



ACTGACTGCTGCCATGAGCAGTGTGCTGCCGGCTGCACGGGCCCCAAGCA



CTCTGACTGCCTGGCCTGCCTCCACTTCAACCACAGTGGCATCTGTGAGCT



GCACTGCCCAGCCCTGGTCACCTACAACACAGACACGTTTGAGTCCATGC



CCAATCCCGAGGGCCGGTATACATTCGGCGCCAGCTGTGTGACTGCCTGT



CCCTACAACTACCTTTCTACGGACGTGGGATCCTGCACCCTCGTCTGCCCC



CTGCACAACCAAGAGGTGACAGCAGAGGATGGAACACAGCGGTGTGAGA



AGTGCAGCAAGCCCTGTGCCCGAGTGTGCTATGGTCTGGGCATGGAGCAC



TTGCGAGAGGTGAGGGCAGTTACCAGTGCCAATATCCAGGAGTTTGCTGG



CTGCAAGAAGATCTTTGGGAGCCTGGCATTTCTGCCGGAGAGCTTTGATG



GATGATTCAATGAAAGTGAAAGATGAATACAGTGAAAGAGATGAGAATG



TTTTAAAGTCAGAACCCATGGGAAATGCAGAAGAGCCTGAAATCCCTTAC



AGCTATTCAAGAGAATATAATGAATATGAAAACATTAAGTTGGAGAGAC



ATGTTGTCTCATTCGATAGTAGCAGGCCAACCAGTGGAAAGATGAACTGC



GATGTGTGTGGATTATCCTGCATCAGCTTCAATGTCTTAATGGTTCATAAG



CGAAGCCATACTGGTGAACGCCCATTCCAGTGTAATCAGTGTGGGGCATC



TTTTACTCAGAAAGGTAACCTCCTCCGCCACATTAAACTGCACACAGGGG



AAAAACCTTTTAAGTGTCACCTCTGCAACTATGCATGCCAAAGAAGAGAT



GCGCTCACGGGGCATCTTAGGACACATTCTGTGGAGAAACCCTACAAATG



TGAGTTTTGTGGAAGGAGTTACAAGCAGAGAAGTTCCCTTGAGGAGCAC



AAGGAGCGCTGCCGTACATTTCTTCAGAGCACTGACCCAGGGGACACTGC



AAGTGCGGAGGCAAGACACATCAAAGCAGAGATGGGAAGTGAAAGAGC



TCTCGTACTGGACAGATTAGCAAGCAATGTGGCAAAACGAAAAAGCTCA



ATGCCTCAGAAATTCATTGGTGAGAAGCGCCACTGCTTTGATGTCAACTA



TAATTCAAGTTACATGTATGAGAAAGAGAGTGAGCTCATACAGACCCGC



ATGATGGACCAAGCCATCAATAACGCCATCAGCTATCTTGGCGCCGAAGC



CCTGCGCCCCTTGGTCCAGACACCGCCTGCTCCCACCTCGGAGATGGTTC



CAGTTATCAGCAGCATGTATCCCATAGCCCTCACCCGGGCTGAGATGTCA



AACGGTGCCCCTCAAGAGCTGGAAAAGAAAAGCATCCACCTTCCAGAGA



AGAGCGTGCCTTCTGAGAGAGGCCTCTCTCCCAACAATAGTGGCCACGAC



TCCACGGACACTGACAGCAACCATGAAGAACGCCAGAATCACATCTATC



AGCAAAATCACATGGTCCTGTCTCGGGCCCGCAATGGGATGCCACTTCTG



AAGGAGGTTCCCCGCTCTTACGAACTCCTCAAGCCCCCGCCCATCTGCCC



AAGAGACTCCGTCAAAGTGATCAACAAGGAAGGGGAGGTGATGGATGTG



TATCGGTGTGACCACTGCCGCGTCCTCTTCCTGGACTATGTGATGTTCACG



ATTCACATGGGCTGCCACGGCTTCCGTGACCCTTTCGAGTGTAACATGTG



TGGATATCGAAGCCATGATCGGTATGAGTTCTCGTCTCACATAGCCAGAG



GAGAACACAGAGCCCTGCTGAAGTGA





TANC2->
ATGTTTCGGAATAGTCTCAAGATGCTGCTTACTGGTGGGAAATCAAGTCG
SEQ ID


RDM1
TAAAAACAGGTCAAGTGAAAGTGGTAAAATAGCTGTGGAGTACAGACCC
NO: 15



AGTGAAGACATCGTAGGTGTCAGATGCGAAGAAGAACTACACGGTTTAA



TTCAAGTCCCTTGCTCTCCCTGGAAGCAGTATGGCCAAGAGGAGGAAGGG



TATCTCTCGGATTTCAGCTTGGAGGAGGAAGAGTTCAGGCTGCCAGAACT



TGACTAG





PIK3C3->
ATGGGGGAAGCAGAGAAGTTTCACTACATCTATAGTTGTGACCTGGATAT
SEQ ID


RPRD1A
CAACGTCCAGCTTAAGATAGGAAGCTTGGAAGGGAAGAGAGAACAAAAG
NO: 16



AGTTATAAAGCTGTCCTGGAAGACCCAATGTTGAAGTTCTCAGGACTATA



TCAAGAGACATGCTCTGATCTTTATGTTACTTGTCAAGTTTTTGCAGAAGG



GAAGCCTTTGGCCTTGCCAGTGAGAACATCCTACAAAGCATTTAGTACAA



GATGGAACTGGAATGAATGGCTGAAACTACCAGTAAAATACCCTGACCT



GCCCAGGAATGCCCAAGTGGCCCTCACCATATGGGATGTGTATGGTCCCG



GAAAAGCAGTGCCTGTAGGAGGAACAACGGTTTCGCTCTTTGGAAAATA



CGGCATGTTTCGCCAAGGGATGCATGACTTGAAAGTCTGGCCTAATGTAG



AAGCAGATGGATCAGAACCCACAAAAACTCCTGGCAGAACAAGTAGCAC



TCTCTCAGAAGATCAGATGAGCCGTCTTGCCAAGCTCACCAAAGCTCATC



GACAAGGACACATGGTGAAAGTAGATTGGCTGGATAGATTGACATTTAG



AGAAATAGAAATGATAAATGAGAGTGAAAAACGAAGTTCTAATTTCATG



TACCTGATGGTTGAATTTCGATGTGTCAAGTGTGATGATAAGGAATATGG



TATTGTTTATTATGAAAAGGACGGTGATGAATCATCTCCAATTTTAACAA



GTTTTGAATTAGTGAAAGTTCCTGACCCCCAGATGTCTATGGAGAATTTA



GTTGAGAGCAAACACCACAAGCTTGCCCGGAGTTTAAGAAGTGGACCTTC



TGACCACGATCTGAAACCCAATGCTGCCACGAGAGATCAGTTAAATATTA



TTGTGAGTTATCCACCAACCAAGCAACTTACATATGAAGAACAAGATCTT



GTTTGGAAGTTTAGATATTATCTTACGAATCAAGAAAAAGCCTTGACAAA



ATTCTTGAAATGTGTTAATTGGGATCTACCTCAAGAGGCCAAACAGGCCT



TGGAACTTCTGGGAAAATGGAAGCCGATGGATGTAGAGGACTCCTTGGA



GCTGTTATCCTCTCATTACACCAACCCAACTGTGAGGCGTTATGCTGTTGC



CCGGTTGCGACAGGCCGATGATGAGGATTTGTTGATGTACCTATTACAAT



TGGTCCAGGCTCTCAAATATGAAAATTTTGATGATATAAAGAATGGATTG



GAACCTACCAAGAAGGATAGTCAGAGTTCAGTGTCAGAAAATGTGTCAA



ATTCTGGAATAAATTCTGCAGAAATAGATAGCTCCCAAATTATAACCAGC



CCCCTTCCTTCAGTCTCTTCACCTCCTCCTGCATCAAAAACAAAAGAAGTT



CCAGATGGCGAAAATCTGGAACAAGATCTCTGTACCTTCTTGATATCGAG



AGCCTGCAAAAACTCAACACTGGCTAATTATTTATACTGGTATGTGATAG



TGGAATGTGAAGATCAAGATACTCAGCAGAGAGATCCAAAGACCCATGA



GATGTACTTGAACGTAATGAGAAGATTCAGCCAAGCATTGTTGAAGGGTG



ATAAGTCTGTCAGAGTTATGCGTTCTTTGCTGGCTGCACAACAGACATTT



GTAGATCGGTTGGTGCATCTAATGAAGGCAGTACAACGCGAAAGTGGAA



ATCGTAAGAAAAAGAATGAGAGACTACAGGCATTGCTTGGAGATAATGA



AAAGATGAATTTGTCAGATGTGGAACTTATCCCGTTGCCTTTAGAACCCC



AAGTGAAAATTAGAGGAATAATTCCGGAAACAGCTACACTGTTTAAAAG



TGCCCTTATGCCTGCACAGTTGTTTTTTAAGACGGAAGATGGAGGCAAAT



ATCCAGTTATATTTAAGCATGGAGATGATTTACGTCAAGATCAACTTATT



CTTCAAATCATTTCACTCATGGACAAGCTGTTACGGAAAGAAAATCTGGA



CTTGAAATTGACACCTTATAAGGTGTTAGCCACCAGTACAAAACATGGCT



TCATGCAGTTTATCCAGTCAGTTCCTGTGGCTGAAGTTCTTGATACAGAG



GGAAGCATTCAGAACTTTTTTAGAAAATATGCACCAAGTGAGAATGGGCC



AAATGGGATTAGTGCTGAGGTCATGGACACTTACGTTAAAAGCTGTGCTG



GATATTGCGTGATCACCTATATACTTGGAGTTGGAGACAGGCACCTGGAT



AACCTTTTGCTAACAAAAACAGCCAAACCAAACAGGAAGCTTACTTTTCT



CTACCTAGCCAATGATGTCATACAGAACAGCAAGAGGAAGGGGCCAGAG



TTTACAAAAGATTTTGCACCAGTTATAGTGGAGGCTTTTAAGCATGTTTCA



AGTGAAACTGATGAAAGTTGTAAGAAGCACCTTGGAAGAGTGTTATCTAT



TTGGGAAGAAAGGTCTGTTTATGAAAATGATGTATTAGAACAACTTAAAC



AAGCTCTGTATGGTGATAAGAAGCCTAGGAAGCGAACTTATGAACAGAT



AAAGGTGGATGAAAATGAAAACTGTTCCTCTCTGGGATCTCCAAGTGAAC



CACCACAGACTCTAGATCTCGTTAGAGCATTACAAGATCTGGAAAATGCA



GCCTCAGGTGATGCAGCAGTTCATCAGAGGATAGCTTCTTTACCTGTTGA



AGTCCAAGAAGTATCTCTATTAGATAAAATAACAGATAAAGAATCTGGA



GAAAGGCTTTCCAAAATGGTAGAGGATGCGTGTATGTTGCTGGCAGATTA



CAATGGCAGATTGGCGGCAGAAATAGATGATAGAAAGCAACTCACTCGA



ATGTTAGCAGATTTTCTTCGTTGTCAAAAGGAAGCCCTTGCAGAGAAAGA



GCATAAATTGGAAGAGTACAAGCGCAAGCTAGCCAGAGTTTCCCTGGTG



CGCAAAGAACTCAGGTCCCGGATCCAGAGCCTGCCAGACTTATCTCGATT



GCCCAATGTCACTGGCAGCCACATGCACCTGCCCTTTGCGGGAGACATCT



ACAGTGAAGATTGA





PPP2R1A
ATGGCGGCGGCCGACGGCGACGACTCGCTGTACCCCATCGCGGTGCTCAT
SEQ ID


->
AGACGAACTCCGCAATGAGGACGTTCAGCTTCGCCTCAACAGCATCAAG
NO: 17


NLRP8
AAGCTGTCCACCATCGCCTTGGCCCTTGGGGTTGAAAGGACCCGAAGTGA



GCTTCTGCCTTTCCTTACAGATACCATCTATGATGAAGATGAGGTCCTCCT



GGCCCTGGCAGAACAGCTGGGAACCTTCACTACCCTGGTGGGAGGCCCA



GAGTACGTGCACTGCCTGCTGAGCGCCAGAGAGCAATGGGCTGCATCGTT



GGTGGCAAGACTTATGCTCTGTGTTTGCAACGAATGATAAGCTGGAAGTC



CTGACTATGACCAACAGTGTTTTGGGGCCTCCTTTTTTGAAGGCTCTCGCG



GCCGCACTGAGGCACCCTCAGTGCAAACTGCAAAAGCTACTCCTAAGGC



GTGTGAATAGCACCATGTTGAACCAGGACTTAATCGGTGTTTTGACGGGG



AACCAGCATCTGAGATACTTGGAAATACAACATGTGGAAGTGGAGTCCA



AAGCTGTGAAGCTTCTATGCAGGGTGCTGAGATCCCCCCGGTGCCGTCTG



CAGTGTCTCAGGTTGGAAGACTGCTTGGCCACCCCTAGAATTTGGACTGA



TCTTGGCAATAATCTTCAAGGTAACGGGCATCTAAAGACTCTCATACTAA



GAAAAAACTCCCTGGAGAACTGTGGGGCGTATTACCTGTCTGTGGCCCAG



CTGGAGAGGCTGTCGATAGAGAACTGCAACCTTACACAGCTTACTTGTGA



AAGCCTTGCCTCCTGTCTCAGGCAGAGTAAGATGCTGACCCACCTGAGCT



TGGCAGAAAACGCCTTGAAAGATGAAGGGGCCAAGCATATTTGGAATGC



CCTGCCACACCTGAGATGTCCTCTGCAGAGGCTGGTACTGAGAAAGTGTG



ACTTGACCTTTAATTGCTGTCAGGATATGATCTCTGCGCTCTGTAAAAATA



AAACCCTGAAAAGTCTTGACCTAAGTTTTAATAGCCTGAAGGATGATGGG



GTGATCCTGCTGTGTGAGGCCCTGAAGAACCCTGACTGTACATTACAGAT



CCTGGAGCTGGAAAACTGCCTGTTCACCTCCATCTGCTGCCAGGCCATGG



CTTCCATGCTCCGCAAAAACCAACATCTGAGACATCTGGACTTGAGCAAG



AATGCGATTGGAGTCTATGGTATTCTGACCTTGTGCGAGGCCTTCTCAAG



CCAAAAGAAGAGAGAAGAGGTCATTTTCTGTATTCCTGCCTGGACTCGAA



TAACTAGCTTCTCCCCAACTCCTCACCCACCCGACTTCACGGGAAAAAGT



GACTGCCTATCCCAGATTAATCCTTAG





RPS28->
ATGGACACCAGCCGTGTGCAGCCTATCAAGCTGGCCAGGGTCACCAAGG
SEQ ID


LOC100505619
TCCTGGGCAGGACCGGTTCTCAGGGACAGTGCACGCAGGGTTCTTCTGCT
NO: 18



CACTGGGTCAGAAGCATCCGATTTCCTGTCATCGGTTGCGGTTCAGCCAT



GTTAATACTGAAAGTATAAGAATATCATAATCCCAGAGTGATATAAAGAC



CTTTCCAAGGAAGATGAGATACCCGGAGGCTGTGAAGGAGAAGTCTGAA



GGATTTGAACGCGTTCAAATGAAAACCTTTGGTTCAGCAAAACACCAGAA



ACAAATTCAAAAGATAAAATACACTCCGGAGAAGCCCATTCTCTCAACGC



ATCTGACAGTTCACAAAGAGCAACTGCAGTCCAACTTAAAAAATAAAGA



AAATGGTAGGAAAATGGGCAACATGCACTGAGGCAATATATAAGAAGAA



GGAATGCGCAATTAATCAACAAAAGCCTGCTCTTTCCTAACGAGTGAAGA



TACAAGCTTAGTAAATATTAAAATTTATTGAAGATAAAAATGGACACTTC



CGGCCATATATAAAAGCATTGTTGGTAAGAGAGTGGGGTAATTGACATGC



TCCTGGGAAATAATACGCATTCAGACAACCTCTAATGAGTGAATCCACTG



GGGGTATCCACTAGGAATAGAAAACGACCACCAATGCCCAAGTTTGTGG



TCCAGAATTTGAGCAGCGATCTAACAAAGTAGAGTCTCCCCCTGTGTCAT



GGGCCCCACTTCTTAAGCAAAGAAGAATGCCTCATTTTGGTAACTCACCC



AAATAAACAGAAATTTTTTTTGATAATCTTATCGCATTTATTTGGTTAAGT



CTGAGGCAGAATATTTTTCTCCTTTGTATCTTTGTTTTGTTTTGTTTTAATC



CTTGAAGTTTATTACCAAACTACATCTTGGTGTTTCTTCAGGATGTGAACA



GGCCTTTCTATTCTCAATTAAGTGTTTCTTTCCTTTAATTCAGGAATCTTCT



AATCATGTATTTTAATTTCTTTTTCTGTCATCCCTCCCCACGCCATAAGCT



ACCAGTCACTATACATGTTGGAGCTCTCTTTATTGTGCCATAGATCTCTGA



TGTCATGTCTTTAGTCATTATTTTGTGTCAGAAATTTTTGTTCTTCCGGGG



ACATCTGGCAATATCTAGAGACATGCTACTGGCACCTATAGAAATGCTAG



TGACTTATTATGGGTAGAGGCTAAAGATGCTGCTAAATATCCTACAATGC



CCAGGACAGCCACTGCCACAACAAAGAATTATGCAGTCCACAATGTCCAT



AGTGCTAAGGAGAAGAAACCCTGTCCTAGGGTCAAGGTTGGATGCTGAC



AAGACTAATTTCAAGTGATTTCAAACGATTGTGGGGAAAATACCCAGAA



GTTGACAGGAGCAATATGTAGATATTTCCTTCTGTAAATTATATTTCCCTA



ATGCTAATGCTGGAGCATTTGGAAACACGTGTATAAATATTTGAAGCACA



AAACCAGAGAGATGCTCTTATGTATTTAGCTTCTAGTTTTAAATAATAGA



TACTACAATAAACCTTGATTAAACCTTGATTATCTG





NFYA->
GAACGGTCACTGCGCAGGATCAAGCTACAATGAAGAGGAGGGAGGCAGT
SEQ ID


TDRG1
CTGCGCGCACCGCCATTTTCTAGGAACTGGGAAGCCCCCCCACCCCTTAG
NO: 19



GAAGATCCATCCCTGTGGAACCTTGCCCAGGCTTACCAGCCTTTGCTGAG



GTTGATCTATTGTCCCTCCTTGTCCCCATCAAAATATCCAGCACTCCACCT



TCAGGGAGTAGACTTGACCCTCAAATAGCAAGTTCAGCCTTCCCAGGTCT



AGGTTCCCTGGGAGGTCAAGATTCGTCTGGTTCCTTAGTACAGAGGGCTA



GCTGTGAGTTGGAATCCCCCTATGAGCTTTAGAATCAGTCAAGAGGAATT



GGGCCCCTTCCCTTCATCCCTCTTCTTTTCCCTTTTTGTCCCAGAGCTCAGC



TCTGACTCAAAAGTTTTTCCATTTACCATCAACATGGAAACTTGGCTCCTC



ACGTAGGTATATTATCCCCCTTTTGTACATGGTCTTGTTGATCCAAACTCC



CTTTCTGTGAAAGAGGCCTGTGGGGCTCAAGAAGCCTGGTCAGCCAGCCA



GGCTAGTCCCACATACCTCAGAACCAGTTTAATAAAGGCTCTATGTCATT



CTTTTTTG





RIMS2->
ATGTCGGCTCCTGTCGGGCCCCGGGGCCGCCTGGCTCCCATCCCGGCGGC
SEQ ID


DPYS
CTCTCAGCCGCCTCTGCAGCCCGAGATGCCTGACCTCAGCCACCTCACGG
NO: 20



AGGAGGAGAGGAAAATCATCCTGGCCGTCATGGATAGGCAGAAGAAAGA



AGAGGAGAAGGAGCAGTCCGTGCTCAAAAAACTGCATCAGCAGTTTGAA



ATGTATAAAGAGCAGGTAAAGAAGATGGGAGAAGAATCACAGCAACAG



CAAGAACAGAAGGGTGATGCGCCAACCTGTGGTATCTGCCACAAAACAA



AGTTTGCTGATGGATGTGGCCATAACTGTTCATATTGCCAAACAAAGTTC



TGTGCTCGTTGTGGAGGTCGAGTGTCATTACGCTCAAACAAGCATAGTGG



TAAAATGGATGAAAACAGATTTGTGGCAGTTACCAGCACAAATGCAGCC



AAAATTTTTAATCTCTATCCAAGAAAAGGAAGAATAGCTGTAGGATCAGA



TGCTGACATTGTTATTTGGGACCCAAAAGGCACAAGGACTATCTCAGCAA



AAACTCATCATCAGGCTGTTAACTTCAACATTTTCGAGGGCATGGTTTGC



CACGGGGTGCCCCTTGTGACTATTTCAAGAGGCAAAGTGGTATATGAAGC



CGGAGTGTTCAGTGTCACGGCAGGAGATGGGAAGTTTATTCCTCGAAAAC



CATTTGCTGAATATATTTACAAACGAATAAAGCAGCGAGACCGGACTTGC



ACACCTACCCCTGTGGAGCGTGCACCCTATAAGGGAGAAGTCGCCACACT



GAAATCCAGAGTGACAAAAGAAGATGCCACAGCAGGGACCAGGAAACA



GGCCCACCCCTGA





B3GALNT2
ATGCGAAACTGGCTGGTGCTGCTGTGCCCGTGTGTGCTCGGGGCCGCGCT
SEQ ID


->
GCACCTCTGGCTGCGGCTGCGCTCCCCGCCGCCCGCCTGCGCCTCCGGGG
NO: 21


TOMM20
CCGGCCCTGCAGATCAGTTGGCCTTATTTCCTCAGTGGAAATCTACTCACT



ATGATGTGGTAGTTGGCGTGTTGTCAGCTCGCAATAACCATGAACTTCGA



AACGTGATAAGAAGCACCTGGATGAGACATTTGCTACAGCATCCCACATT



AAGTCAACGTGTGCTTGTGAAGTTCATAATAGGTGCTCATGGCTGTGAAG



TGCCTGTGGAAGACAGGGAAGATCCTTATTCCTGTAAACTACTCAACATC



ACAAATCCAGTTTTGAATCAGGAAATTGAAGCGTTCAGTCTGTCCGAAGA



CACTTCATCGGGGCTGCCTGAGGATCGAGTTGTCAGCGTGAGTTTCCGAG



TTCTCTACCCCATCGTTATTACCAGTCTTGGAGTGTTCTACGATGCCAATG



ATGTGGGTTTCCAGAGGAACATCACTGTCAAACTTTATCAGGCAGAACAA



GAGGAGGCCCTCTTCATTGCTCGCTTCAGTCCTCCAAGCTGTGGTGTGCA



GGTGAACAAGCTGTGGTACAAGCCCGTGGAACAATTCATCTTACCAGAG



AGCTTTGAAGGTACAATCGTGTGGGAGAGCCAAGACCTCCACGGCCTTGT



GTCAAGAAATCTCCACAAAGTGACAGTGAATGATGGAGGGGGAGTTCTC



AGAGTCATTACAGCTGGGGAGGGTGCATTGCCTCATGAATTCTTGGAAGG



TGTGGAGGGAGTTGCAGGTGGTTTTATATATACTATTCAGGGTGAATATG



AGAAGGGCGTAGACCATCTGACAAATGCAATTGCTGTGTGTGGACAGCC



ACAGCAGTTACTGCAGGTCTTACAGCAAACTCTTCCACCACCAGTGTTCC



AGATGCTTCTGACTAAGCTCCCAACAATTAGTCAGAGAATTGTAAGTGCT



CAGAGCTTGGCTGAAGATGATGTGGAATGA





C1orf109->
ATGACTCAAGACCGGCCTCTGCTTGCCGTGCAGGAGGCGCTGAAGAAGT
SEQ ID


MACF1
GCTTCCCCGTGGTGGAGGAGCAGCAGGGCCTGTGGCAGAGTGCCCTGCG
NO: 22



GGACTGCCAGCCCCTCCTGTCCTCCCTCAGCAACCTGGCGGAACAGCTGC



AGGCCGCACAGAACCTGCGGTTTGAGGATGTGCCGGCGCTTCGGGCCTTC



CCAGATTTAAAAGAGCGGCTGAGGCGTAAGCAGCTGGTGGCTGGTGACA



TCGTCCTGGACAAGCTAGGGGAAAGGCTGCTCTCAGAAAAAGAGAAGAA



ACAAATATCTGAGCAATTGAATGCCCTAAACAAGGCTTACCATGACCTTT



GTGATGGTTCTGCAAATCAGCTTCAGCAGCTTCAGAGCCAGTTGGCTCAC



CAGACAGAACAAAAGACCCTGCAGAAACAACAAAATACCTGTCACCAGC



AACTGGAGGATCTTTGCAGTTGGGTAGGACAGGCAGAAAGAGCACTGGC



AGGCCACCAAGGCAGAACCACCCAGCAGGATCTCTCTGCTTTGCAGAAG



AACCAAAGTGACTTGAAGGATTTACAGGATGACATTCAGAATCGTGCCAC



CTCATTTGCCACTGTTGTCAAGGACATTGAGGGGTTCATGGAAGAGAATC



AGACCAAGCTGAGCCCACGTGAGTTGACAGCTCTTCGGGAAAAGCTTCAT



CAGGCTAAGGAGCAATATGAGGCGCTCCAGGAAGAGACACGTGTGGCCC



AGAAGGAACTGGAGGAAGCAGTGACCTCCGCCTTACAGCAGGAGACTGA



AAAGAGTAAAGCAGCAAAGGAACTGGCAGAGAACAAGAAGAAGATCGA



TGCTCTCCTGGATTGGGTAACTTCAGTAGGATCATCTGGTGGACAGCTGC



TGACCAACCTTCCAGGAATGGAGCAGCTCTCGGGAGCTAGCTTGGAGAA



AGGAGCCTTGGACACCACTGATGGTTACATGGGGGTGAATCAAGCCCCA



GAGAAACTGGACAAGCAATGTGAGATGATGAAGGCCCGTCACCAAGAAT



TGCTGTCCCAGCAGCAAAATTTCATTCTGGCCACCCAGTCAGCTCAGGCC



TTCTTGGATCAGCATGGCCACAATCTCACACCTGAGGAGCAACAGATGCT



GCAACAGAAGCTGGGAGAGCTAAAGGAACAATACTCTACTTCCCTGGCC



CAATCAGAGGCAGAACTGAAGCAGGTGCAGACACTTCAGGATGAGTTGC



AGAAATTTCTGCAGGATCATAAAGAGTTTGAAAGCTGGTTGGAACGATCC



GAGAAAGAGCTGGAGAACATGCATAAGGGAGGCAGCAGCCCCGAGACC



CTTCCCTCCCTGCTAAAGCGGCAAGGAAGCTTCTCAGAGGATGTCATTTC



CCACAAAGGAGACTTGAGATTTGTGACTATCTCAGGACAGAAAGTCTTGG



ACATGGAAAACAGTTTTAAGGAAGGCAAAGAACCATCAGAAATTGGAAA



CTTAGTAAAGGACAAGTTGAAGGATGCAACAGAAAGATACACTGCTCTC



CACTCAAAGTGTACACGATTAGGATCTCACCTGAATATGCTGTTAGGCCA



GTATCATCAATTCCAAAACAGTGCTGACAGCCTGCAGGCCTGGATGCAGG



CTTGTGAGGCCAACGTGGAGAAGCTCCTCTCAGATACTGTTGCCTCTGAC



CCTGGAGTTCTCCAGGAGCAGCTTGCAACAACAAAGCAGTTGCAGGAGG



AATTGGCTGAGCACCAAGTACCTGTGGAAAAACTCCAAAAAGTAGCTCG



TGACATAATGGAAATTGAAGGGGAGCCAGCCCCAGACCACAGGCATGTT



CAAGAAACTACAGATTCCATACTCAGCCACTTCCAAAGCCTCTCCTATAG



CCTGGCTGAGCGATCTTCTCTGCTGCAGAAAGCAATTGCCCAATCTCAGA



GTGTCCAGGAAAGCCTGGAGAGCCTGTTGCAGTCTATTGGGGAAGTTGAA



CAAAACCTGGAAGGGAAGCAGGTGTCATCACTCTCATCAGGAGTCATCC



AGGAAGCCTTAGCCACAAATATGAAATTGAAGCAGGACATTGCTCGGCA



AAAGAGCAGCTTGGAGGCCACCCGTGAGATGGTGACCCGATTCATGGAG



ACAGCAGACAGTACTACAGCAGCAGTGCTGCAGGGCAAACTGGCAGAGG



TGAGCCAGCGGTTCGAACAGCTCTGTCTACAGCAGCAAGAAAAGGAGAG



CTCCCTAAAGAAGCTTCTACCCCAGGCAGAGATGTTTGAACACCTCTCTG



GTAAGCTGCAGCAGTTCATGGAAAACAAAAGTCGGATGCTGGCCTCTGG



AAATCAGCCAGATCAAGATATTACACATTTCTTCCAACAGATCCAGGAGC



TCAATTTGGAAATGGAAGACCAACAGGAGAACCTAGATACTCTTGAGCA



CCTGGTCACTGAACTGAGCTCTTGTGGCTTTGCGCTGGACTTGTGCCAGC



ATCAGGACAGGGTACAGAATCTAAGAAAAGACTTCACAGAGCTACAGAA



GACAGTTAAAGAGAGAGAGAAAGATGCATCATCTTGCCAGGAACAGTTG



GATGAATTCCGGAAGCTGGTCAGGACCTTCCAGAAATGGTTGAAAGAAA



CTGAAGGGAGTATTCCACCTACGGAAACTTCTATGAGTGCTAAAGAGTTA



GAAAAGCAGATTGAACACCTGAAGAGTCTACTAGATGACTGGGCAAGTA



AGGGAACTCTGGTGGAAGAAATCAATTGCAAAGGTACTTCTTTAGAAAAT



CTCATCATGGAAATCACAGCACCTGATTCCCAAGGCAAGACAGGTTCCAT



ACTGCCCTCTGTAGGAAGCTCTGTAGGCAGTGTAAACGGATACCACACCT



GCAAAGATCTGACGGAGATCCAGTGTGACATGTCAGATGTAAACTTGAA



GTATGAGAAACTAGGGGGAGTACTTCATGAACGCCAGGAAAGCCTTCAG



GCTATCCTCAACAGAATGGAGGAGGTTCACAAGGAGGCAAACTCTGTGC



TGCAGTGGCTGGAATCAAAAGAGGAAGTCCTGAAATCCATGGATGCCAT



GTCATCTCCAACCAAGACAGAAACAGTGAAAGCCCAAGCTGAATCTAAC



AAGGCCTTCCTGGCTGAGTTGGAACAGAATTCTCCAAAAATTCAAAAAGT



AAAGGAAGCCCTGGCTGGATTACTGGTGACATATCCCAACTCACAGGAA



GCAGAAAATTGGAAGAAAATTCAGGAAGAACTCAATTCCCGATGGGAAA



GGGCCACTGAGGTTACTGTGGCTCGGCAAAGGCAGCTAGAGGAATCTGC



AAGTCATCTGGCCTGCTTCCAGGCTGCAGAATCCCAGCTCCGGCCGTGGC



TGATGGAGAAAGAACTGATGATGGGAGTGCTGGGGCCCCTGTCTATTGAC



CCCAACATGTTGAATGCACAAAAGCAACAGGTCCAGTTTATGCTAAAGG



AATTTGAAGCACGCAGGCAACAGCATGAGCAACTGAATGAGGCAGCTCA



GGGCATCCTAACAGGCCCTGGAGATGTCTCTCTGTCCACCAGCCAAGTAC



AGAAAGAACTCCAGAGCATCAATCAGAAATGGGTTGAGCTGACTGACAA



ACTCAACTCCCGTTCCAGCCAAATTGACCAAGCTATTGTTAAGAGCACCC



AGTACCAGGAACTGCTCCAGGACTTATCAGAGAAGGTGAGGGCAGTTGG



ACAACGGCTGAGTGTCCAGTCAGCTATCAGCACCCAACCAGAGGCTGTA



AAGCAGCAATTGGAAGAGACCAGTGAAATTCGATCTGACTTGGAGCAGT



TAGACCACGAGGTTAAGGAGGCTCAGACACTGTGCGATGAACTCTCAGT



GCTCATTGGTGAGCAGTACCTCAAGGATGAACTGAAGAAGCGTTTGGAG



ACAGTTGCCCTGCCTCTCCAAGGTTTAGAAGACCTTGCAGCCGATCGCAT



TAACAGACTCCAGGCAGCTCTTGCCAGCACCCAGCAGTTCCAGCAAATGT



TTGATGAGTTGAGGACCTGGTTGGATGATAAACAAAGCCAGCAAGCAAA



AAACTGCCCAATTTCTGCAAAATTGGAGCGGCTACAGTCTCAGCTACAGG



AGAATGAAGAGTTTCAGAAAAGTCTTAATCAACACAGTGGCTCCTATGAG



GTGATTGTGGCTGAAGGGGAATCTCTACTTCTTTCTGTACCTCCTGGAGA



AGAGAAAAGGACTCTACAAAACCAGTTGGTTGAGCTCAAAAACCATTGG



GAAGAGCTTAGTAAAAAAACTGCAGACAGACAATCCAGGCTCAAGGATT



GTATGCAGAAAGCTCAGAAATATCAGTGGCATGTGGAAGACCTTGTGCC



ATGGATAGAAGATTGTAAAGCTAAGATGTCTGAGTTGCGAGTCACTCTGG



ATCCAGTGCAGCTAGAGTCCAGTCTCCTAAGATCAAAGGCTATGCTGAAT



GAGGTGGAGAAGCGCCGCTCCCTGCTGGAAATATTGAATAGTGCTGCTGA



CATTCTGATCAATTCTTCAGAAGCAGATGAGGATGGAATCCGGGATGAGA



AGGCTGGGATCAACCAGAACATGGATGCTGTTACAGAAGAGCTGCAGGC



CAAAACAGGGTCACTCGAAGAAATGACTCAGAGGCTCAGGGAGTTCCAG



GAAAGCTTTAAGAATATTGAAAAGAAGGTTGAAGGAGCCAAACACCAAC



TTGAGATCTTTGATGCTCTGGGTTCTCAAGCCTGTAGCAACAAGAACCTG



GAGAAGCTAAGAGCTCAACAGGAAGTGCTGCAGGCCCTAGAGCCTCAGG



TAGACTATCTGAGGAACTTTACTCAGGGTCTGGTAGAAGATGCCCCAGAT



GGATCTGATGCTTCTCAACTTCTCCACCAAGCTGAGGTCGCCCAGCAAGA



GTTCCTCGAAGTTAAGCAAAGAGTGAACAGTGGTTGTGTGATGATGGAA



AACAAGCTGGAGGGGATTGGCCAGTTTCACTGCCGGGTCCGAGAGATGTT



CTCTCAATTGGCAGACCTGGATGATGAGCTAGATGGCATGGGTGCTATTG



GCAGAGACACTGATAGCCTCCAGTCCCAAATCGAGGATGTCCGGCTATTC



CTTAACAAAATTCACGTCCTCAAATTAGACATAGAGGCCTCTGAAGCAGA



GTGTCGACATATGCTAGAAGAAGAGGGGACTCTGGATTTGTTAGGTCTCA



AAAGGGAGCTAGAAGCCCTGAACAAACAGTGTGGCAAACTGACAGAGAG



GGGGAAAGCTCGTCAGGAACAGCTGGAACTGACACTAGGCCGTGTAGAG



GACTTCTACAGGAAATTGAAAGGACTCAATGACGCGACCACAGCAGCAG



AGGAGGCAGAGGCCCTCCAGTGGGTAGTGGGGACCGAAGTGGAAATCAT



CAACCAACAATTAGCAGATTTTAAAATGTTTCAGAAAGAACAAGTGGATC



CTCTTCAGATGAAATTGCAGCAGGTGAATGGACTTGGCCAGGGATTAATT



CAGAGTGCAGGAAAAGACTGTGATGTACAGGGTTTAGAACATGACATGG



AAGAGATCAATGCTCGATGGAATACATTGAATAAAAAGGTCGCACAAAG



AATTGCACAGCTACAGGAAGCTTTGTTGCATTGTGGGAAGTTTCAAGATG



CCTTGGAGCCATTGCTCAGCTGGTTGGCAGATACCGAGGAGCTCATAGCC



AATCAGAAACCTCCATCTGCTGAGTATAAAGTGGTGAAAGCACAGATCC



AAGAACAGAAGTTGCTCCAGCGGCTCCTAGATGATCGAAAGGCCACAGT



AGACATGCTTCAAGCAGAAGGAGGCAGAATAGCCCAGTCAGCAGAGCTG



GCTGATAGAGAGAAAATCACTGGACAGCTGGAGAGTCTTGAAAGTAGAT



GGACTGAACTACTCAGTAAGGCAGCAGCCAGGCAAAAACAGCTGGAAGA



CATCCTGGTTCTGGCCAAACAGTTCCATGAGACAGCTGAGCCTATTTCTG



ACTTCTTATCTGTCACAGAGAAAAAGCTTGCTAACTCAGAACCTGTTGGC



ACTCAGACTGCCAAAATACAGCAGCAGATCATTCGGCACAAGGCTCTGG



AAGAAGACATAGAAAACCATGCAACAGATGTGCACCAGGCAGTCAAAAT



TGGGCAGTCCCTCTCCTCCCTGACATCTCCTGCAGAACAGGGTGTGCTGT



CAGAAAAGATAGACTCATTGCAGGCCCGATACAGTGAAATTCAAGACCG



CTGTTGTCGGAAGGCAGCCCTACTTGACCAAGCTCTGTCTAATGCTAGGC



TGTTTGGGGAGGATGAGGTGGAGGTGCTCAACTGGCTGGCTGAGGTTGA



GGACAAGCTCAGTTCAGTGTTCGTAAAGGATTTCAAACAGGATGTCCTGC



ACAGGCAGCATGCTGACCACCTGGCTTTAAATGAAGAAATTGTTAATAGA



AAGAAGAATGTAGATCAAGCTATTAAAAATGGTCAGGCTCTTCTAAAAC



AAACCACAGGTGAGGAGGTGTTACTTATCCAGGAAAAACTAGATGGTAT



AAAGACTCGTTACGCAGACATCACAGTTACTAGCTCCAAGGCCCTCAGAA



CTTTAGAGCAAGCCCGGCAGCTGGCCACCAAGTTCCAGTCTACTTATGAG



GAACTGACCGGGTGGCTGAGGGAGGTGGAGGAGGAGCTGGCAACCAGTG



GAGGACAGTCTCCCACAGGGGAACAGATACCCCAGTTTCAGCAGAGACA



GAAGGAATTAAAGAAGGAGGTCATGGAGCACAGGCTGGTGTTGGACACA



GTGAATGAGGTGAGCCGTGCTCTCTTAGAGCTGGTGCCCTGGAGAGCCAG



AGAAGGGCTGGATAAACTTGTGTCCGATGCTAACGAGCAGTACAAACTA



GTCAGTGACACTATTGGACAAAGGGTGGATGAAATTGATGCTGCTATTCA



GAGATCACAACAGTATGAGCAAGCTGCCGATGCAGAACTAGCTTGGGTT



GCTGAAACAAAACGGAAACTGATGGCTCTGGGTCCAATTCGCCTGGAAC



AGGACCAGACCACAGCTCAGCTTCAGGTACAGAAGGCTTTCTCCATTGAC



ATTATTCGACACAAAGATTCAATGGATGAACTCTTCAGTCACCGTAGTGA



AATCTTTGGCACATGTGGGGAGGAGCAAAAAACTGTATTACAGGAAAAG



ACAGAGTCTCTAATACAGCAATATGAAGCCATTAGCCTACTCAATTCAGA



GCGTTATGCCCGCCTAGAGCGGGCCCAGGTCTTAGTAAACCAGTTTTGGG



AAACTTATGAAGAGCTCAGCCCCTGGATTGAGGAAACTCGGGCACTAAT



AGCACAGTTACCCTCTCCAGCCATTGATCATGAGCAGCTCAGGCAGCAAC



AAGAGGAAATGAGGCAATTAAGGGAATCTATTGCTGAACACAAACCTCA



TATTGACAAACTACTAAAGATAGGCCCACAACTAAAGGAATTAAACCCT



GAGGAAGGGGAAATGGTGGAAGAAAAATACCAGAAAGCAGAAAACATG



TATGCCCAAATAAAGGAGGAGGTGCGCCAGCGAGCCCTGGCTCTGGATG



AAGCCGTGTCCCAGTCCACACAGATTACAGAGTTTCATGATAAAATTGAG



CCTATGTTGGAGACACTGGAGAATCTTTCCTCTCGCCTGCGTATGCCACC



ACTGATCCCTGCTGAAGTAGACAAGATCAGAGAGTGCATCAGTGACAAT



AAGAGTGCCACCGTGGAGCTAGAAAAACTGCAGCCATCCTTTGAGGCCTT



GAAGCGCCGTGGAGAGGAGCTTATTGGACGATCTCAGGGAGCAGACAAG



GATCTGGCTGCAAAAGAAATCCAGGATAAATTGGATCAAATGGTATTCTT



CTGGGAGGACATCAAAGCTCGGGCTGAAGAACGAGAAATCAAATTTCTT



GATGTCCTTGAATTAGCAGAGAAGTTCTGGTATGACATGGCAGCTCTCCT



GACCACCATCAAAGACACCCAGGATATTGTCCATGACTTGGAAAGCCCA



GGCATTGATCCTTCCATCATCAAACAACAGGTTGAAGCTGCTGAGACTAT



TAAGGAAGAGACAGATGGTCTGCATGAAGAGCTGGAGTTTATTCGGATC



CTTGGAGCAGATTTGATTTTTGCCTGTGGAGAAACTGAGAAGCCTGAAGT



GAGGAAGAGCATTGATGAGATGAATAATGCTTGGGAGAACTTAAACAAA



ACATGGAAAGAGAGGCTAGAAAAACTTGAGGATGCTATGCAAGCTGCTG



TGCAGTATCAGGACACTCTTCAGGCTATGTTTGACTGGCTAGATAACACT



GTGATTAAACTCTGCACCATGCCCCCTGTTGGCACTGACCTCAATACTGTT



AAAGATCAGTTAAATGAAATGAAGGAGTTCAAAGTAGAAGTTTACCAAC



AGCAAATTGAGATGGAGAAGCTTAATCACCAGGGTGAACTGATGTTAAA



GAAAGCTACTGATGAGACGGACAGAGACATTATACGAGAACCACTGACA



GAACTCAAACACCTCTGGGAGAACCTGGGTGAGAAAATTGCCCACCGAC



AGCACAAACTAGAAGGGGCTCTGTTGGCCCTTGGTCAGTTCCAGCATGCC



TTAGAGGAACTAATGAGTTGGCTGACTCATACCGAAGAGTTGTTAGATGC



TCAGAGACCAATAAGTGGAGACCCAAAAGTCATTGAAGTTGAGCTCGCA



AAGCACCATGTCCTAAAAAATGATGTTTTGGCTCATCAAGCCACAGTGGA



AACAGTCAACAAAGCTGGCAATGAGCTTCTTGAATCCAGTGCTGGAGATG



ATGCCAGCAGCTTAAGGAGCCGTTTGGAAGCCATGAACCAATGCTGGGA



GTCAGTGTTACAGAAAACAGAGGAGAGGGAGCAGCAGCTTCAGTCAACT



CTGCAGCAGGCCCAGGGCTTCCACAGTGAAATTGAAGATTTCCTCTTGGA



ACTTACTAGAATGGAGAGCCAGCTTTCTGCATCTAAGCCCACAGGAGGAC



TTCCTGAAACTGCTAGGGAACAGCTTGATACACATATGGAACTCTATTCC



CAGCTGAAAGCCAAGGAAGAGACTTATAATCAACTACTTGACAAGGGCA



GACTCATGCTTCTAAGCCGTGACGACTCTGGGTCTGGCTCCAAGACAGAA



CAGAGTGTAGCACTTTTGGAGCAGAAGTGGCATGTGGTCAGCAGTAAGA



TGGAAGAAAGAAAGTCAAAGCTGGAAGAGGCCCTCAACTTGGCAACAGA



ATTCCAGAATTCCCTACAAGAATTTATCAACTGGCTCACTCTAGCAGAGC



AGAGTTTAAACATCGCTTCTCCACCAAGCCTGATTCTAAATACTGTCCTTT



CCCAGATAGAAGAGCACAAGGTTTTTGCTAATGAAGTAAATGCTCATCGA



GACCAGATCATTGAGCTGGATCAAACTGGGAATCAATTAAAGTTCCTTAG



CCAAAAGCAGGATGTTGTTCTGATCAAGAATTTGTTGGTGAGCGTGCAGT



CTCGATGGGAGAAGGTTGTCCAGCGATCTATTGAAAGAGGGCGATCACT



AGATGATGCCAGGAAGCGGGCAAAACAATTCCATGAAGCTTGGAAAAAA



CTGATTGACTGGCTAGAAGATGCAGAGAGTCACCTGGACTCAGAACTAG



AGATATCCAATGACCCAGACAAAATTAAACTTCAGCTTTCTAAGCATAAG



GAGTTTCAGAAGACTCTTGGTGGCAAGCAGCCTGTGTATGATACCACAAT



TAGAACTGGCAGAGCACTGAAAGAAAAGACTTTGCTTCCCGAAGATAGT



CAGAAACTTGACAATTTCCTAGGAGAAGTCAGAGACAAATGGGATACTG



TTTGTGGCAAGTCTGTGGAGCGGCAGCACAAGTTGGAGGAAGCCCTGCTC



TTTTCGGGTCAGTTCATGGATGCTTTGCAGGCATTGGTTGACTGGTTATAC



AAGGTGGAGCCACAGCTGGCTGAGGACCAGCCCGTGCACGGGGACCTTG



ACCTCGTCATGAACCTCATGGATGCACACAAGGTTTTCCAGAAGGAACTG



GGAAAGCGAACAGGAACCGTTCAGGTCCTGAAGCGGTCAGGCCGAGAGC



TGATTGAGAATAGTCGAGATGACACCACTTGGGTAAAAGGACAGCTCCA



GGAACTGAGCACTCGCTGGGACACTGTCTGTAAACTCTCTGTTTCCAAAC



AAAGCCGGCTTGAGCAGGCCTTAAAACAAGCGGAAGTGTTTCGAGACAC



AGTCCACATGCTGTTGGAGTGGCTTTCTGAAGCAGAGCAAACGCTTCGCT



TTCGGGGAGCACTTCCTGATGACACAGAGGCCCTGCAGTCTCTCATTGAC



ACCCATAAGGAATTCATGAAGAAAGTAGAAGAAAAGCGAGTGGACGTTA



ACTCAGCAGTAGCCATGGGAGAAGTCATCCTGGCTGTCTGCCACCCCGAT



TGCATCACAACCATCAAACACTGGATCACCATCATCCGAGCTCGCTTCGA



GGAGGTCCTGACATGGGCTAAGCAGCACCAGCAGCGTCTTGAAACGGCC



TTGTCAGAACTGGTGGCTAATGCTGAGCTCCTGGAAGAACTTCTGGCATG



GATCCAGTGGGCTGAGACCACCCTCATTCAGCGGGATCAGGAGCCAATCC



CGCAGAACATTGACCGAGTTAAAGCCCTTATCGCTGAGCATCAGACATTT



ATGGAGGAGATGACTCGCAAACAGCCTGACGTGGACCGGGTCACCAAGA



CATACAAAAGGAAAAACATAGAGCCTACTCACGCGCCTTTCATAGAGAA



ATCCCGCAGCGGAGGCAGGAAATCCCTAAGTCAGCCAACCCCTCCTCCCA



TGCCAATCCTTTCACAGTCTGAAGCAAAAAACCCACGGATCAACCAGCTT



TCTGCCCGCTGGCAGCAGGTGTGGCTGTTAGCACTGGAGCGGCAAAGGA



AACTGAATGATGCCTTGGATCGGCTGGAGGAGTTGAAAGAATTTGCCAAC



TTTGACTTTGATGTCTGGAGGAAAAAGTATATGCGTTGGATGAATCACAA



AAAGTCTCGAGTGATGGATTTCTTCCGGCGCATTGATAAGGACCAGGATG



GGAAGATAACACGTCAGGAGTTTATCGATGGCATTTTAGCATCCAAGTTC



CCCACCACCAAGTTAGAGATGACTGCTGTGGCTGACATTTTCGACCGAGA



TGGGGATGGTTACATTGATTATTATGAATTTGTGGCTGCTCTTCATCCCAA



CAAGGATGCGTATCGACCAACAACCGATGCAGATAAAATCGAAGATGAG



GTTACAAGACAAGTGGCTCAGTGCAAATGTGCAAAAAGGTTTCAGGTGG



AGCAGATCGGAGAGAATAAATACCGGTTTGGGGATTCTCAGCAGTTGCG



GCTGGTCCGTATTCTGCGCAGCACCGTGATGGTTCGCGTTGGTGGAGGAT



GGATGGCCTTGGATGAATTTTTAGTGAAAAATGATCCCTGCCGAGCACGA



GGTAGAACTAACATTGAACTTAGAGAGAAATTCATCCTACCAGAGGGAG



CATCCCAGGGAATGACCCCCTTCCGCTCACGGGGTCGAAGGTCCAAACCA



TCTTCCCGGGCAGCTTCCCCTACTCGTTCCAGCTCCAGTGCTAGTCAGAGT



AACCACAGCTGTACATCCATGCCATCTTCTCCAGCCACCCCAGCCAGTGG



AACCAAGGTTATCCCATCATCAGGTAGCAAGTTGAAACGACCAACACCA



ACTTTTCATTCTAGTCGGACATCCCTTGCTGGTGATACCAGCAATAGTTCT



TCCCCGGCCTCCACAGGTGCCAAAACTAATCGGGCAGACCCTAAAAAGTC



TGCCAGTCGCCCTGGGAGTCGGGCTGGGAGTCGAGCCGGGAGTCGAGCC



AGCAGCCGGCGAGGAAGTGACGCTTCTGACTTTGACCTCTTAGAGACGCA



GTCTGCTTGTTCCGACACTTCAGAAAGCAGCGCTGCAGGGGGCCAAGGCA



ACTCCAGGAGAGGGCTAAACAAACCTTCCAAAATCCCAACCATGTCTAA



GAAGACCACCACTGCCTCCCCCAGGACTCCAGGTCCCAAGCGATAA





PGAP3->
ATGGCCGGCCTGGCGGCGCGGTTGGTCCTGCTAGCTGGGGCAGCGGCGCT
SEQ ID


CACNB1
GGCGAGCGGCTCCCAGGGCGACCGTGAGCCGGTGTACCGCGACTGCGTA
NO: 23



CTGCAGTGCGAAGAGCAGAACTGCTCTGGGGGCGCTCTGAATCACTTCCG



CTCCCGCCAGCCAATCTACATGAGTCTAGCAGGCTGGACCTGTCGGGACG



ACTGTAAGTATGAGTGTATGTGGGTCACCGTTGGGCTCTACCTCCAGGAA



GGTCACAAAGTGCCTCAGTTCCATGGCAAGTGGCCCTTCTCCCGGTTCCT



GTTCTTTCAAGAGCCGGCATCGGCCGTGGCCTCGTTTCTCAATGGCCTGG



CCAGCCTGGTGATGCTCTGCCGCTACCGCACCTTCGTGCCAGCCTCCTCCC



CCATGTACCACACCTGTGTGGCCTTCGCCTGGGAAATGTTTGACATCATC



CTGGATGAGAACCAATTGGAGGATGCCTGCGAGCATCTGGCGGAGTACTT



GGAAGCCTATTGGAAGGCCACACACCCGCCCAGCAGCACGCCACCCAAT



CCGCTGCTGAACCGCACCATGGCTACCGCAGCCCTGGCTGCCAGCCCTGC



CCCTGTCTCCAACCTCCAGGGACCCTACCTTGCTTCCGGGGACCAGCCAC



TGGAACGGGCCACCGGGGAGCACGCCAGCATGCACGAGTACCCAGGGGA



GCTGGGCCAGCCCCCAGGCCTTTACCCCAGCAGCCACCCACCAGGCCGGG



CAGGCACGCTACGGGCACTGTCCCGCCAAGACACTTTTGATGCCGACACC



CCCGGCAGCCGAAACTCTGCCTACACGGAGCTGGGAGACTCATGTGTGG



ACATGGAGACTGACCCCTCAGAGGGGCCAGGGCTTGGAGACCCTGCAGG



GGGCGGCACGCCCCCAGCCCGACAGGGATCCTGGGAGGACGAGGAAGAA



GACTATGAGGAAGAGCTGACCGACAACCGGAACCGGGGCCGGAATAAGG



CCCGCTACTGCGCTGAGGGTGGGGGTCCAGTTTTGGGGCGCAACAAGAAT



GAGCTGGAGGGCTGGGGACGAGGCGTCTACATTCGCTGA





STAU1->
CACTTCCTGCCGGGCTGCGGGCGCCTGAGCGGCTCTTCAGCGTTTGCGCC
SEQ ID


TOP1
GGCGGCTGCCGCGTCTCTCTCGGCTCCCGCTTCCTTTGACCGCCTCCCCCC
NO: 24



CCCGGCCCGGCGGCGCCCGCCTCCTCCACGGCCACTCCGCCTCTTCCCTC



CCTTCGTCCCTTCTTCCTCTCCCTTTTTTCCTTCTTCCTTCCCCTCCTCGCCG



CCACCGCCCAGGACCGCCGGCCGGGGGACGAGCTCGGAGCAGCAGCCAG



GTGGAGTTTTGCTCTTGTCGCCCAGGCTGGAGTGCAGTGGCGTGATCTCG



GCTCACTGCAACCTCCACCTCCCAGGTTCAAGCGATTTTCCCACTTCAGCC



TCCCGATAAGCTGAGATTACAGAGTTTATTAACCACTTAACCTCTCAGAA



CTGAACAAAGACAACATTGTTCCTGGAACGCCCTCTTTTTAAAAAAGATT



CTCATAAACACAAAGATAAACACAAAGATCGAGAACACCGGCACAAAGA



ACACAAGAAGGAGAAGGACCGGGAAAAGTCCAAGCATAGCAACAGTGA



ACATAAAGATTCTGAAAAGAAACACAAAGAGAAGGAGAAGACCAAACA



CAAAGATGGAAGCTCAGAAAAGCATAAAGACAAACATAAAGACAGAGA



CAAGGAAAAACGAAAAGAGGAAAAGGTTCGAGCCTCTGGGGATGCAAA



AATAAAGAAGGAGAAGGAAAATGGCTTCTCTAGTCCACCACAAATTAAA



GATGAACCTGAAGATGATGGCTATTTTGTTCCTCCTAAAGAGGATATAAA



GCCATTAAAGAGACCTCGAGATGAGGATGATGCTGATTATAAACCTAAG



AAAATTAAAACAGAAGATACCAAGAAGGAGAAGAAAAGAAAACTAGAA



GAAGAAGAGGATGGTAAATTGAAAAAACCCAAGAATAAAGATAAAGAT



AAAAAAGTTCCTGAGCCAGATAACAAGAAAAAGAAGCCGAAGAAAGAA



GAGGAACAGAAGTGGAAATGGTGGGAAGAAGAGCGCTATCCTGAAGGC



ATCAAGTGGAAATTCCTAGAACATAAAGGTCCAGTATTTGCCCCACCATA



TGAGCCTCTTCCAGAGAATGTCAAGTTTTATTATGATGGTAAAGTCATGA



AGCTGAGCCCCAAAGCAGAGGAAGTAGCTACGTTCTTTGCAAAAATGCTC



GACCATGAATATACTACCAAGGAAATATTTAGGAAAAATTTCTTTAAAGA



CTGGAGAAAGGAAATGACTAATGAAGAGAAGAATATTATCACCAACCTA



AGCAAATGTGATTTTACCCAGATGAGCCAGTATTTCAAAGCCCAGACGGA



AGCTCGGAAACAGATGAGCAAGGAAGAGAAACTGAAAATCAAAGAGGA



GAATGAAAAATTACTGAAAGAATATGGATTCTGTATTATGGATAACCACA



AAGAGAGGATTGCTAACTTCAAGATAGAGCCTCCTGGACTTTTCCGTGGC



CGCGGCAACCACCCCAAGATGGGCATGCTGAAGAGACGAATCATGCCCG



AGGATATAATCATCAACTGTAGCAAAGATGCCAAGGTTCCTTCTCCTCCT



CCAGGACATAAGTGGAAAGAAGTCCGGCATGATAACAAGGTTACTTGGC



TGGTTTCCTGGACAGAGAACATCCAAGGTTCCATTAAATACATCATGCTT



AACCCTAGTTCACGAATCAAGGGTGAGAAGGACTGGCAGAAATACGAGA



CTGCTCGGCGGCTGAAAAAATGTGTGGACAAGATCCGGAACCAGTATCG



AGAAGACTGGAAGTCCAAAGAGATGAAAGTCCGGCAGAGAGCTGTAGCC



CTGTACTTCATCGACAAGCTTGCTCTGAGAGCAGGCAATGAAAAGGAGG



AAGGAGAAACAGCGGACACTGTGGGCTGCTGCTCACTTCGTGTGGAGCA



CATCAATCTACACCCAGAGTTGGATGGTCAGGAATATGTGGTAGAGTTTG



ACTTCCTCGGGAAGGACTCCATCAGATACTATAACAAGGTCCCTGTTGAG



AAACGAGTTTTTAAGAACCTACAACTATTTATGGAGAACAAGCAGCCCGA



GGATGATCTTTTTGATAGACTCAATACTGGTATTCTGAATAAGCATCTTCA



GGATCTCATGGAGGGCTTGACAGCCAAGGTATTCCGTACATACAATGCCT



CCATCACGCTACAGCAGCAGCTAAAAGAACTGACAGCCCCGGATGAGAA



CATCCCAGCGAAGATCCTTTCTTATAACCGTGCCAATCGAGCTGTTGCAA



TTCTTTGTAACCATCAGAGGGCACCACCAAAAACTTTTGAGAAGTCTATG



ATGAACTTGCAAACTAAGATTGATGCCAAGAAGGAACAGCTAGCAGATG



CCCGGAGAGACCTGAAAAGTGCTAAGGCTGATGCCAAGGTCATGAAGGA



TGCAAAGACGAAGAAGGTAGTAGAGTCAAAGAAGAAGGCTGTTCAGAGA



CTGGAGGAACAGTTGATGAAGCTGGAAGTTCAAGCCACAGACCGAGAGG



AAAATAAACAGATTGCCCTGGGAACCTCCAAACTCAATTATCTGGACCCT



AGGATCACAGTGGCTTGGTGCAAGAAGTGGGGTGTCCCAATTGAGAAGA



TTTACAACAAAACCCAGCGGGAGAAGTTTGCCTGGGCCATTGACATGGCT



GATGAAGACTATGAGTTTTAGCCAGTCTCAAGAGGCAGAGTTCTGTGAAG



AGGAACAGTGTGGTTTGGGAAAGATGGATAAACTGAGCCTCACTTGCCCT



CGTGCCTGGGGGAGAGAGGCAGCAAGTCTTAACAAACCAACATCTTTGC



GAAAAGATAAACCTGGAGATATTATAAGGGAGAGCTGAGCCAGTTGTCC



TATGGACAACTTATTTAAAAATATTTCAGATATCAAAATTCTAGCTGTAT



GATTTGTTTTGAATTTTGTTTTTATTTTCAAGAGGGCAAGTGGATGGGAAT



TTGTCAGCGTTCTACCAGGCAAATTCACTGTTTCACTGAAATGTTTGGATT



CTCTTAGCTACTGTATGCAAAGTCCGATTATATTGGTGCGTTTTTACAGTT



AGGGTTTTGCAATAACTTCTATATTTTAATAGAAATAAATTCCTAAACTCC



CTTCCCTCTCTCCCATTTCAGGAATTTAAAATTAAGTAGAACAAAAAACC



CAGCGCACCTGTTAGAGTCGTCACTCTCTATTGTCATGGGGATCAATTTTC



ATTAAACTTGAAGCAGTCGTGGCTTTGGCAGTGTTTTGGTTCAGACACCT



GTTCACAGAAAAAGCATGATGGGAAAATATTTCCTGACTTGAGTGTTCCT



TTTTAAATGTGAATTTTTATTTCTTTTTAATTATTTTAAAATATTTAAACCT



TTTTCTTGATCTTAAAGATCGTGTAGATTGGGGTTGGGGAGGGATGAAGG



GCGAGTGAATCTAAGGATAATGAAATAATCAGTGACTGAAACCATTTTCC



CATCATCCTTTGTTCTGAGCATTCGCTGTACCCTTTAAGATATCCATCTTTT



TCTTTTTAACCCTAATCTTTCACTTGAAAGATTTTATTGTATAAAAAGTTT



CACAGGTCAATAAACTTAGAGGAAAATGAGTATTTGGTCCAAAAAAAGG



AAAAATAATCAAGATTTTAGGGCTTTTATTTTTTCTTTTGTAATTGTGTAA



AAAATGGAAAAAAACATAAAAAGCAGAATTTTAATGTGAAGACATTTTT



TGCTATAATCATTAGTTTTAGAGGCATTGTTAGTTTAGTGTGTGTGCAGAG



TCCATTTCCCACATCTTTCCTCAAGTATCTTCTATTTTTATCATGAATTCCC



TTTTAATCAACTGTAGGTTATTTAAAATAAATTCCTACAACTTAATGGAAA





FBXW7->
ATGAATCAGGAACTGCTCTCTGTGGGCAGCAAAAGACGACGAACTGGAG
SEQ ID


MLL3
GCTCTCTGAGAGGTAACCCTTCCTCAAGCCAGGTAGATGAAGAACAGATG
NO: 25



AATCGTGTGGTAGAGGAGGAACAGCAACAGCAACTCAGACAACAAGAGG



AGGAGCACACTGCAAGGAATGGTGAAGTTGTTGGAGTAGAACCTAGACC



TGGAGGCCAAAATGATTCCCAGCAAGGACAGTTGGAAGAAAACAATAAT



AGATTTATTTCGGTAGATGAGGACTCCTCAGGAAACCAAGAAGAACAAG



AGGAAGATGAAGAACATGCTGGTGAACAAGATGAGGAGGATGAGGAGG



AGGAGGAGATGGACCAGGAGAGTGACGATTTTGATCAGTCTGATGATAG



TAGCAGAGAAGATGAACATACACATACTAACAGTGTCACGAACTCCAGT



AGTATTGTGGACCTGCCCGTTCACCAACTCTCCTCCCCATTCTATACAAAA



ACAACAAAAACCTCGAAGTAGGGGGAAAACTGCAGTGGAAGATGAGGA



CAGCATGGATGGGCTGGAGACAACAGAAACAGAAACGATTGTGGAAACA



GAAATCAAAGAACAATCTGCAGAAGAGGATGCTGAAGCAGAAGTGGATA



ACAGCAAACAGCTAATTCCAACTCTTCAGCGATCTGTGTCTGAGGAATCG



GCAAACTCCCTGGTCTCTGTTGGTGTAGAAGCCAAAATCAGTGAACAGCT



CTGCGCTTTTTGTTACTGTGGGGAAAAAAGTTCCTTAGGACAAGGAGACT



TAAAACAATTCAGAATAACGCCTGGATTTATCTTGCCATGGAGAAACCAA



CCTTCTAACAAGAAGGACATTGATGACAACAGCAATGGAACCTATGAGA



AAATGCAAAACTCAGCACCACGAAAACAAAGAGGACAGAGAAAAGAAC



GATCTCCTCAGCAGAATATAGTATCTTGTGTAAGTGTAAGCACCCAGACA



GCTTCAGATGATCAAGCTGGTAAACTGTGGGATGAACTCAGTCTGGTTGG



GCTTCCAGATGCCATTGATATCCAAGCCTTATTTGATTCTACAGGCACTTG



TTGGGCTCATCACCGTTGTGTGGAGTGGTCACTAGGAGTATGCCAGATGG



AAGAACCATTGTTAGTGAACGTGGACAAAGCTGTTGTCTCAGGGAGCAC



AGAACGATGTGCATTTTGTAAGCACCTTGGAGCCACTATCAAATGCTGTG



AAGAGAAATGTACCCAGATGTATCATTATCCTTGTGCTGCAGGAGCCGGC



ACCTTTCAGGATTTCAGTCACATCTTCCTGCTTTGTCCAGAACACATTGAC



CAAGCTCCTGAAAGATCGAAGGAAGATGCAAACTGTGCAGTGTGCGACA



GCCCGGGAGACCTCTTAGATCAGTTCTTTTGTACTACTTGTGGTCAGCACT



ATCATGGAATGTGCCTGGATATAGCGGTTACTCCATTAAAACGTGCAGGT



TGGCAATGTCCTGAGTGCAAAGTGTGCCAGAACTGCAAACAATCGGGAG



AAGATAGCAAGATGCTAGTGTGTGATACGTGTGACAAAGGGTATCATACT



TTTTGTCTTCAACCAGTTATGAAATCAGTACCAACCAATGGCTGGAAATG



CAAAAATTGCAGAATATGTATAGAGTGTGGCACACGGTCTAGTTCTCAGT



GGCACCACAATTGCCTGATATGTGACAATTGTTACCAACAGCAGGATAAC



TTATGTCCCTTCTGTGGGAAGTGTTATCATCCAGAATTGCAGAAAGACAT



GCTTCATTGTAATATGTGCAAAAGGTGGGTTCACCTAGAGTGTGACAAAC



CAACAGATCATGAACTGGATACTCAGCTCAAAGAAGAGTATATCTGCATG



TATTGTAAACACCTGGGAGCTGAGATGGATCGTTTACAGCCAGGTGAGGA



AGTGGAGATAGCTGAGCTCACTACAGATTATAACAATGAAATGGAAGTT



GAAGGCCCTGAAGATCAAATGGTATTCTCAGAGCAGGCAGCTAATAAAG



ATGTCAACGGTCAGGAGTCCACTCCTGGAATTGTTCCAGATGCGGTTCAA



GTCCACACTGAAGAGCAACAGAAGAGTCATCCCTCAGAAAGTCTTGACA



CAGATAGTCTTCTTATTGCTGTATCATCCCAACATACAGTGAATACTGAAT



TGGAAAAACAGATTTCTAATGAAGTTGATAGTGAAGACCTGAAAATGTCT



TCTGAAGTGAAGCATATTTGTGGCGAAGATCAAATTGAAGATAAAATGG



AAGTGACAGAAAACATTGAAGTCGTTACACACCAGATCACTGTGCAGCA



AGAACAACTGCAGTTGTTAGAGGAACCTGAAACAGTGGTATCCAGAGAA



GAATCAAGGCCTCCAAAATTAGTCATGGAATCTGTCACTCTTCCACTAGA



AACCTTAGTGTCCCCACATGAGGAAAGTATTTCATTATGTCCTGAGGAAC



AGTTGGTTATAGAAAGGCTACAAGGAGAAAAGGAACAGAAAGAAAATTC



TGAACTTTCTACTGGATTGATGGACTCTGAAATGACTCCTACAATTGAGG



GTTGTGTGAAAGATGTTTCATACCAAGGAGGCAAATCTATAAAGTTATCA



TCTGAGACAGAGTCATCATTTTCATCATCAGCAGACATAAGCAAGGCAGA



TGTGTCTTCCTCCCCAACACCTTCTTCAGACTTGCCTTCGCATGACATGCT



GCATAATTACCCTTCAGCTCTTAGTTCCTCTGCTGGAAACATCATGCCAAC



AACTTACATCTCAGTCACTCCAAAAATTGGCATGGGTAAACCAGCTATTA



CTAAGAGAAAATTTTCTCCTGGTAGACCTCGGTCCAAACAGGGGGCTTGG



AGTACCCATAATACAGTGAGCCCACCTTCCTGGTCCCCAGACATTTCAGA



AGGTCGGGAAATTTTTAAACCCAGGCAGCTTCCTGGCAGTGCCATTTGGA



GCATCAAAGTGGGCCGTGGGTCTGGATTTCCAGGAAAGCGGAGACCTCG



AGGTGCAGGACTGTCGGGGCGAGGTGGCCGAGGCAGGTCAAAGCTGAAA



AGTGGAATCGGAGCTGTTGTATTACCTGGGGTGTCTACTGCAGATATTTC



ATCAAATAAGGATGATGAAGAAAACTCTATGCACAATACAGTTGTGTTGT



TTTCTAGCAGTGACAAGTTCACTTTGAATCAGGATATGTGTGTAGTTTGTG



GCAGTTTTGGCCAAGGAGCAGAAGGAAGATTACTTGCCTGTTCTCAGTGT



GGTCAGTGTTACCATCCATACTGTGTCAGTATTAAGATCACTAAAGTGGT



TCTTAGCAAAGGTTGGAGGTGTCTTGAGTGCACTGTGTGTGAGGCCTGTG



GGAAGGCAACTGACCCAGGAAGACTCCTGCTGTGTGATGACTGTGACAT



AAGTTATCACACCTACTGCCTAGACCCTCCATTGCAGACAGTTCCCAAAG



GAGGCTGGAAGTGCAAATGGTGTGTTTGGTGCAGACACTGTGGAGCAAC



ATCTGCAGGTCTAAGATGTGAATGGCAGAACAATTACACACAGTGCGCTC



CTTGTGCAAGCTTATCTTCCTGTCCAGTCTGCTATCGAAACTATAGAGAA



GAAGATCTTATTCTGCAATGTAGACAATGTGATAGATGGATGCATGCAGT



TTGTCAGAACTTAAATACTGAGGAAGAAGTGGAAAATGTAGCAGACATT



GGTTTTGATTGTAGCATGTGCAGACCCTATATGCCTGCGTCTAATGTGCCT



TCCTCAGACTGCTGTGAATCTTCACTTGTAGCACAAATTGTCACAAAAGT



AAAAGAGCTAGACCCACCCAAGACTTATACCCAGGATGGTGTGTGTTTGA



CTGAATCAGGGATGACTCAGTTACAGAGCCTCACAGTTACAGTTCCAAGA



AGAAAACGGTCAAAACCAAAATTGAAATTGAAGATTATAAATCAGAATA



GCGTGGCCGTCCTTCAGACCCCTCCAGACATCCAATCAGAGCATTCAAGG



GATGGTGAAATGGATGATAGTCGAGAAGGAGAACTTATGGATTGTGATG



GAAAATCAGAATCTAGTCCTGAGCGGGAAGCTGTGGATGATGAAACTAA



GGGAGTGGAAGGAACAGATGGTGTCAAAAAGAGAAAAAGGAAACCATA



CAGACCAGGTATTGGTGGATTTATGGTGCGGCAAAGAAGTCGAACTGGG



CAAGGGAAAACCAAAAGATCTGTGATCAGAAAAGATTCCTCAGGCTCTA



TTTCCGAGCAGTTACCTTGCAGAGATGATGGCTGGAGTGAGCAGTTACCA



GATACTTTAGTTGATGAATCTGTTTCTGTTACTGAAAGCACTGAAAAAAT



AAAGAAGAGATACCGAAAAAGGAAAAATAAGCTTGAAGAAACTTTCCCT



GCCTATTTACAAGAAGCTTTCTTTGGAAAAGATCTTCTAGATACAAGTAG



ACAAAGCAAGATAAGTTTAGATAATCTGTCAGAAGATGGAGCTCAGCTTT



TATATAAAACAAACATGAACACAGGTTTCTTGGATCCTTCCTTAGATCCA



CTACTTAGTTCATCCTCGGCTCCAACAAAATCTGGAACTCACGGTCCTGCT



GATGACCCATTAGCTGATATTTCTGAAGTTTTAAACACAGATGATGACAT



TCTTGGAATAATTTCAGATGATCTAGCAAAATCAGTTGATCATTCAGATA



TTGGTCCTGTCACTGATGATCCTTCCTCTTTGCCTCAGCCAAATGTCAATC



AGAGTTCACGACCATTAAGTGAAGAACAGCTAGATGGGATCCTCAGTCCT



GAACTAGACAAAATGGTCACAGATGGAGCAATTCTTGGAAAATTATATA



AAATTCCAGAGCTTGGCGGAAAAGATGTTGAAGACTTATTTACAGCTGTA



CTTAGTCCTGCGAACACTCAGCCAACTCCATTGCCACAGCCTCCCCCACC



AACACAGCTGTTGCCAATACACAATCAGGATGCTTTTTCACGGATGCCTC



TCATGAATGGCCTTATTGGATCCAGTCCTCATCTCCCACATAATTCTTTGC



CACCTGGAAGCGGACTGGGAACTTTCTCTGCAATTGCACAATCCTCTTAT



CCTGATGCCAGGGATAAAAATTCAGCCTTTAATCCAATGGCAAGTGATCC



TAACAACTCTTGGACATCATCAGCTCCCACTGTGGAAGGAGAAAATGACA



CAATGTCGAATGCCCAGAGAAGCACGCTTAAGTGGGAGAAAGAGGAGGC



TCTGGGTGAAATGGCAACTGTTGCCCCAGTTCTCTACACCAATATTAATTT



CCCCAACTTAAAGGAAGAATTCCCTGATTGGACTACTAGAGTGAAGCAA



ATTGCCAAATTGTGGAGAAAAGCAAGCTCACAAGAAAGAGCACCATATG



TGCAAAAAGCCAGAGATAACAGAGCTGCTTTACGCATTAATAAAGTACA



GATGTCAAATGATTCCATGAAAAGGCAGCAACAGCAAGATAGCATTGAT



CCCAGCTCTCGTATTGATTCGGAGCTTTTTAAAGATCCTTTAAAGCAAAG



AGAATCAGAACATGAACAGGAATGGAAATTTAGACAGCAAATGCGTCAG



AAAAGTAAGCAGCAAGCTAAAATTGAAGCCACACAGAAACTTGAACAGG



TGAAAAATGAGCAGCAGCAGCAGCAACAACAGCAATTTGGTTCTCAGCA



TCTTCTGGTGCAGTCTGGTTCAGATACACCAAGTAGTGGGATACAGAGTC



CCTTGACACCTCAGCCTGGCAATGGAAATATGTCTCCTGCACAGTCATTC



CATAAAGAACTGTTTACAAAACAGCCACCCAGTACCCCTACGTCTACATC



TTCAGATGATGTGTTTGTAAAGCCACAAGCTCCACCTCCTCCTCCAGCCCC



ATCCCGGATTCCCATCCAGGATAGTCTTTCTCAGGCTCAGACTTCTCAGCC



ACCCTCACCGCAAGTGTTTTCACCTGGGTCCTCTAACTCACGACCACCATC



TCCAATGGATCCATATGCAAAAATGGTTGGTACCCCTCGACCACCTCCTG



TGGGCCATAGTTTTTCCAGAAGAAATTCTGCTGCACCAGTGGAAAACTGT



ACACCTTTATCATCGGTATCTAGGCCCCTTCAAATGAATGAGACAACAGC



AAATAGGCCATCCCCTGTCAGAGATTTATGTTCTTCTTCCACGACAAATA



ATGACCCCTATGCAAAACCTCCAGACACACCTAGGCCTGTGATGACAGAT



CAATTTCCCAAATCCTTGGGCCTATCCCGGTCTCCTGTAGTTTCAGAACAA



ACTGCAAAAGGCCCTATAGCAGCTGGAACCAGTGATCACTTTACTAAACC



ATCTCCTAGGGCAGATGTGTTTCAAAGACAAAGGATACCTGACTCATATG



CACGACCCTTGTTGACACCTGCACCTCTTGATAGTGGTCCTGGACCTTTTA



AGACTCCAATGCAACCTCCTCCATCCTCTCAGGATCCTTATGGATCAGTGT



CACAGGCATCAAGGCGATTGTCTGTTGACCCTTATGAAAGGCCTGCTTTG



ACACCAAGACCTATAGATAATTTTTCTCATAATCAGTCAAATGATCCATA



TAGTCAGCCTCCCCTTACCCCACATCCAGCAGTGAATGAATCTTTTGCCCA



TCCTTCAAGGGCTTTTTCCCAGCCTGGAACCATATCAAGGCCAACATCTC



AGGACCCATACTCCCAACCCCCAGGAACTCCACGACCTGTTGTAGATTCT



TATTCCCAATCTTCAGGAACAGCTAGGTCCAATACAGACCCTTACTCTCA



ACCTCCTGGAACTCCCCGGCCTACTACTGTTGACCCATATAGTCAGCAGC



CCCAAACCCCAAGACCATCTACACAAACTGACTTGTTTGTTACACCTGTA



ACAAATCAGAGGCATTCTGATCCATATGCTCATCCTCCTGGAACACCAAG



ACCTGGAATTTCTGTCCCTTACTCTCAGCCACCAGCAACACCAAGGCCAA



GGATTTCAGAGGGTTTTACTAGGTCCTCAATGACAAGACCAGTCCTCATG



CCAAATCAGGATCCTTTCCTGCAAGCAGCACAAAACCGAGGACCAGCTTT



ACCTGGCCCGTTGGTAAGGCCACCTGATACATGTTCCCAGACACCTAGGC



CCCCTGGACCTGGTCTTTCAGACACATTTAGCCGTGTTTCCCCATCTGCTG



CCCGTGATCCCTATGATCAGTCTCCAATGACTCCAAGATCTCAGTCTGACT



CTTTTGGAACAAGTCAAACTGCCCATGATGTTGCTGATCAGCCAAGGCCT



GGATCAGAGGGGAGCTTCTGTGCATCTTCAAACTCTCCAATGCACTCCCA



AGGCCAGCAGTTCTCTGGTGTCTCCCAACTTCCTGGACCTGTGCCAACTTC



AGGAGTAACTGATACACAGAATACTGTAAATATGGCCCAAGCAGATACA



GAGAAATTGAGACAGCGGCAGAAGTTACGTGAAATCATTCTCCAGCAGC



AACAGCAGAAGAAGATTGCAGGTCGACAGGAGAAGGGGTCACAGGACTC



ACCCGCAGTGCCTCATCCAGGGCCTCTTCAACACTGGCAACCAGAGAATG



TTAACCAGGCTTTCACCAGACCCCCACCTCCCTATCCTGGGAACATTAGG



TCTCCTGTTGCCCCTCCTTTAGGACCTAGATATGCTGTTTTCCCAAAAGAT



CAGCGTGGACCCTATCCTCCTGATGTTGCTAGTATGGGGATGAGACCTCA



TGGATTTAGATTTGGATTTCCAGGAGGTAGTCATGGTACCATGCCGAGTC



AAGAGCGCTTCCTTGTGCCTCCTCAGCAAATACAGGGATCTGGAGTTTCT



CCACAGCTAAGAAGATCAGTATCTGTAGATATGCCTAGGCCTTTAAATAA



CTCACAAATGAATAATCCAGTTGGACTTCCTCAGCATTTTTCACCACAGA



GCTTGCCAGTTCAGCAGCACAACATACTGGGCCAAGCATATATTGAACTG



AGACATAGGGCTCCTGACGGAAGGCAACGGCTGCCTTTCAGTGCTCCACC



TGGCAGCGTTGTAGAGGCATCTTCTAATCTGAGACATGGAAACTTCATTC



CCCGGCCAGACTTTCCGGGCCCTAGACACACAGACCCCATGCGACGACCT



CCCCAGGGTCTACCTAATCAGCTACCTGTGCACCCAGATTTGGAACAAGT



GCCACCATCTCAACAAGAGCAAGGTCATTCTGTCCATTCATCTTCTATGGT



CATGAGGACTCTGAACCATCCACTAGGTGGTGAATTTTCAGAAGCTCCTT



TGTCAACATCTGTACCGTCTGAAACAACGTCTGATAATTTACAGATAACC



ACCCAGCCTTCTGATGGTCTAGAGGAAAAACTTGATTCTGATGACCCTTC



TGTGAAGGAACTGGATGTTAAAGACCTTGAGGGGGTTGAAGTCAAAGAC



TTAGATGATGAAGATCTTGAAAACTTAAATTTAGATACAGAGGATGGCAA



GGTAGTTGAATTGGATACTTTAGATAATTTGGAAACTAATGATCCCAACC



TGGATGACCTCTTAAGGTCAGGAGAGTTTGATATCATTGCATATACAGAT



CCAGAACTTGACATGGGAGATAAGAAAAGCATGTTTAATGAGGAACTAG



ACCTTCCAATTGATGATAAGTTAGATAATCAGTGTGTATCTGTTGAACCA



AAAAAAAAGGAACAAGAAAACAAAACTCTGGTTCTCTCTGATAAACATT



CACCACAGAAAAAATCCACTGTTACCAATGAGGTAAAAACGGAAGTACT



GTCTCCAAATTCTAAGGTGGAATCCAAATGTGAAACTGAAAAAAATGAT



GAGAATAAAGATAATGTTGACACTCCTTGCTCACAGGCTTCTGCTCACTC



AGACCTAAATGATGGAGAAAAGACTTCTTTGCATCCTTGTGATCCAGATC



TATTTGAGAAAAGAACCAATCGAGAAACTGCTGGCCCCAGTGCAAATGT



CATTCAGGCATCCACTCAACTACCTGCTCAAGATGTAATAAACTCTTGTG



GCATAACTGGATCAACTCCAGTTCTCTCAAGTTTACTTGCTAATGAGAAA



TCTGATAATTCAGACATTAGGCCATCGGGGTCTCCACCACCACCAACTCT



GCCGGCCTCCCCATCCAATCATGTGTCAAGTTTGCCTCCTTTCATAGCACC



GCCTGGCCGTGTTTTGGATAATGCCATGAATTCTAATGTGACAGTAGTCT



CTAGGGTAAACCATGTTTTTTCTCAGGGTGTGCAGGTAAACCCAGGGCTC



ATTCCAGGTCAATCAACAGTTAACCACAGTCTGGGGACAGGAAAACCTG



CAACTCAAACTGGGCCTCAAACAAGTCAGTCTGGTACCAGTAGCATGTCT



GGACCCCAACAGCTAATGATTCCTCAAACATTAGCACAGCAGAATAGAG



AGAGGCCCCTTCTTCTAGAAGAACAGCCTCTACTTCTACAGGATCTTTTG



GATCAAGAAAGGCAAGAACAGCAGCAGCAAAGACAGATGCAAGCCATG



ATTCGTCAGCGATCAGAACCGTTCTTCCCTAATATTGATTTTGATGCAATT



ACAGATCCTATAATGAAAGCCAAAATGGTGGCCCTTAAAGGTATAAATA



AAGTGATGGCACAAAACAATCTGGGCATGCCACCAATGGTGATGAGCAG



GTTCCCTTTTATGGGCCAGGTGGTAACTGGAACACAGAACAGTGAAGGAC



AGAACCTTGGACCACAGGCCATTCCTCAGGATGGCAGTATAACACATCAG



ATTTCTAGGCCTAATCCTCCAAATTTTGGTCCAGGCTTTGTCAATGATTCA



CAGCGTAAGCAGTATGAAGAGTGGCTCCAGGAGACCCAACAGCTGCTTC



AAATGCAGCAGAAGTATCTTGAAGAACAAATTGGTGCTCACAGAAAATC



TAAGAAGGCCCTTTCAGCTAAACAACGTACTGCCAAGAAAGCTGGGCGT



GAATTTCCAGAGGAAGATGCAGAACAACTCAAGCATGTTACTGAACAGC



AAAGCATGGTTCAGAAACAGCTAGAACAGATTCGTAAACAACAGAAAGA



ACATGCTGAATTGATTGAAGATTATCGGATCAAACAGCAGCAGCAATGTG



CAATGGCCCCACCTACCATGATGCCCAGTGTCCAGCCCCAGCCACCCCTA



ATTCCAGGTGCCACTCCACCCACCATGAGCCAACCCACCTTTCCCATGGT



GCCACAGCAGCTTCAGCACCAGCAGCACACAACAGTTATTTCTGGCCATA



CTAGCCCTGTTAGAATGCCCAGTTTACCTGGATGGCAACCCAACAGTGCT



CCTGCCCACCTGCCCCTCAATCCTCCTAGAATTCAGCCCCCAATTGCCCAG



TTACCAATAAAAACTTGTACACCAGCCCCAGGGACAGTCTCAAATGCAAA



TCCACAGAGTGGACCACCACCTCGGGTAGAATTTGATGACAACAATCCCT



TTAGTGAAAGTTTTCAAGAACGGGAACGTAAGGAACGTTTACGAGAACA



GCAAGAGAGACAACGGATCCAACTCATGCAGGAGGTAGATAGACAAAGA



GCTTTGCAGCAGAGGATGGAAATGGAGCAGCATGGTATGGTGGGCTCTG



AGATAAGTAGTAGTAGGACATCTGTGTCCCAGATTCCCTTCTACAGTTCC



GACTTACCTTGTGATTTTATGCAACCTCTAGGACCCCTTCAGCAGTCTCCA



CAACACCAACAGCAAATGGGGCAGGTTTTACAGCAGCAGAATATACAAC



AAGGATCAATTAATTCACCCTCCACCCAAACTTTCATGCAGACTAATGAG



CGAAGGCAGGTAGGCCCTCCTTCATTTGTTCCTGATTCACCATCAATCCCT



GTTGGAAGCCCAAATTTTTCTTCTGTGAAGCAGGGACATGGAAATCTTTC



TGGGACCAGCTTCCAGCAGTCCCCAGTGAGGCCTTCTTTTACACCTGCTTT



ACCAGCAGCACCTCCAGTAGCTAATAGCAGTCTCCCATGTGGCCAAGATT



CTACTATAACCCATGGACACAGTTATCCGGGATCAACCCAATCGCTCATT



CAGTTGTATTCTGATATAATCCCAGAGGAAAAAGGGAAAAAGAAAAGAA



CAAGAAAGAAGAAAAGAGATGATGATGCAGAATCCACCAAGGCTCCATC



AACTCCCCATTCAGATATAACTGCCCCACCGACTCCAGGCATCTCAGAAA



CTACCTCTACTCCTGCAGTGAGCACACCCAGTGAGCTTCCTCAACAAGCC



GACCAAGAGTCGGTGGAACCAGTCGGCCCATCCACTCCCAATATGGCAG



CAGGCCAGCTATGTACAGAATTAGAGAACAAACTGCCCAATAGTGATTTC



TCACAAGCAACTCCAAATCAACAGACGTATGCAAATTCAGAAGTAGACA



AGCTCTCCATGGAAACCCCTGCCAAAACAGAAGAGATAAAACTGGAAAA



GGCTGAGACAGAGTCCTGCCCAGGCCAAGAGGAGCCTAAATTGGAGGAA



CAGAATGGTAGTAAGGTAGAAGGAAACGCTGTAGCCTGTCCTGTCTCCTC



AGCACAGAGTCCTCCCCATTCTGCTGGGGCCCCTGCTGCCAAAGGAGACT



CAGGGAATGAACTTCTGAAACACTTGTTGAAAAATAAAAAGTCATCTTCT



CTTTTGAATCAAAAACCTGAGGGCAGTATTTGTTCAGAAGATGACTGTAC



AAAGGATAATAAACTAGTTGAGAAGCAGAACCCAGCTGAAGGACTGCAA



ACTTTGGGGGCTCAAATGCAAGGTGGTTTTGGATGTGGCAACCAGTTGCC



AAAAACAGATGGAGGAAGTGAAACCAAGAAACAGCGAAGCAAACGGAC



TCAGAGGACGGGTGAGAAAGCAGCACCTCGCTCAAAGAAAAGGAAAAA



GGACGAAGAGGAGAAACAAGCTATGTACTCTAGCACTGACACGTTTACC



CACTTGAAACAGCAGAATAATTTAAGTAATCCTCCAACACCCCCTGCCTC



TCTTCCTCCTACACCACCTCCTATGGCTTGTCAGAAGATGGCCAATGGTTT



TGCAACAACTGAAGAACTTGCTGGAAAAGCCGGAGTGTTAGTGAGCCAT



GAAGTTACCAAAACTCTAGGACCTAAACCATTTCAGCTGCCCTTCAGACC



CCAGGACGACTTGTTGGCCCGAGCTCTTGCTCAGGGCCCCAAGACAGTTG



ATGTGCCAGCCTCCCTCCCAACACCACCTCATAACAATCAGGAAGAATTA



AGGATACAGGATCACTGTGGTGATCGAGATACTCCTGACAGTTTTGTTCC



CTCATCCTCTCCTGAGAGTGTGGTTGGGGTAGAAGTGAGCAGGTATCCAG



ATCTGTCATTGGTCAAGGAGGAGCCTCCAGAACCGGTGCCGTCCCCCATC



ATTCCAATTCTTCCTAGCACTGCTGGGAAAAGTTCAGAATCAAGAAGGAA



TGACATCAAAACTGAGCCAGGCACTTTATATTTTGCGTCACCTTTTGGTCC



TTCCCCAAATGGTCCCAGATCAGGTCTTATATCTGTAGCAATTACTCTGCA



TCCTACAGCTGCTGAGAACATTAGCAGTGTTGTGGCTGCATTTTCCGACCT



TCTTCACGTCCGAATCCCTAACAGCTATGAGGTTAGCAGTGCTCCAGATG



TCCCATCCATGGGTTTGGTCAGTAGCCACAGAATCAACCCGGGTTTGGAG



TATCGACAGCATTTACTTCTCCGTGGGCCTCCGCCAGGATCTGCAAACCC



TCCCAGATTAGTGAGCTCTTACCGGCTGAAGCAGCCTAATGTACCATTTC



CTCCAACAAGCAATGGTCTTTCTGGATATAAGGATTCTAGTCATGGTATT



GCAGAAAGCGCAGCACTCAGACCACAGTGGTGTTGTCATTGTAAAGTGGT



TATTCTTGGAAGTGGTGTGCGGAAATCTTTCAAAGATCTGACCCTTTTGA



ACAAGGATTCCCGAGAAAGCACCAAGAGGGTAGAGAAGGACATTGTCTT



CTGTAGTAATAACTGCTTTATTCTTTATTCATCAACTGCACAAGCGAAAA



ACTCAGAAAACAAGGAATCCATTCCTTCATTGCCACAATCACCTATGAGA



GAAACGCCTTCCAAAGCATTTCATCAGTACAGCAACAACATCTCCACTTT



GGATGTGCACTGTCTCCCCCAGCTCCCAGAGAAAGCTTCTCCCCCTGCCT



CACCACCCATCGCCTTCCCTCCTGCTTTTGAAGCAGCCCAAGTCGAGGCC



AAGCCAGATGAGCTGAAGGTGACAGTCAAGCTGAAGCCTCGGCTAAGAG



CTGTCCATGGTGGGTTTGAAGATTGCAGGCCGCTCAATAAAAAATGGAGA



GGAATGAAATGGAAGAAGTGGAGCATTCATATTGTAATCCCTAAGGGGA



CATTTAAACCACCTTGTGAGGATGAAATAGATGAATTTCTAAAGAAATTG



GGCACTTCCCTTAAACCTGATCCTGTGCCCAAAGACTATCGGAAATGTTG



CTTTTGTCATGAAGAAGGTGATGGATTGACAGATGGACCAGCAAGGCTAC



TCAACCTTGACTTGGATCTGTGGGTCCACTTGAACTGCGCTCTGTGGTCCA



CGGAGGTCTATGAGACTCAGGCTGGTGCCTTAATAAATGTGGAGCTAGCT



CTGAGGAGAGGCCTACAAATGAAATGTGTCTTCTGTCACAAGACGGGTGC



CACTAGTGGATGCCACAGATTTCGATGCACCAACATTTATCACTTCACTT



GCGCCATTAAAGCACAATGCATGTTTTTTAAGGACAAAACTATGCTTTGC



CCCATGCACAAACCAAAGGGAATTCATGAGCAAGAATTAAGTTACTTTGC



AGTCTTCAGGAGGGTCTATGTTCAGCGTGATGAGGTGCGACAGATTGCTA



GCATCGTGCAACGAGGAGAACGGGACCATACCTTTCGCGTGGGTAGCCTC



ATCTTCCACACAATTGGTCAGCTGCTTCCACAGCAGATGCAAGCATTCCA



TTCTCCTAAAGCACTCTTCCCTGTGGGCTATGAAGCCAGCCGGCTGTACT



GGAGCACTCGCTATGCCAATAGGCGCTGCCGCTACCTGTGCTCCATTGAG



GAGAAGGATGGGCGCCCAGTGTTTGTCATCAGGATTGTGGAACAAGGCC



ATGAAGACCTGGTTCTAAGTGACATCTCACCTAAAGGTGTCTGGGATAAG



ATTTTGGAGCCTGTGGCATGTGTGAGAAAAAAGTCTGAAATGCTCCAGCT



TTTCCCAGCGTATTTAAAAGGAGAGGATCTGTTTGGCCTGACCGTCTCTG



CAGTGGCACGCATAGCGGAATCACTTCCTGGGGTTGAGGCATGTGAAAAT



TATACCTTCCGATACGGCCGAAATCCTCTCATGGAACTTCCTCTTGCCGTT



AACCCCACAGGTTGTGCCCGTTCTGAACCTAAAATGAGTGCCCATGTCAA



GAGGTTTGTGTTAAGGCCTCACACCTTAAACAGCACCAGCACCTCAAAGT



CATTTCAGAGCACAGTCACTGGAGAACTGAACGCACCTTATAGTAAACAG



TTTGTTCACTCCAAGTCATCGCAGTACCGGAAGATGAAAACTGAATGGAA



ATCCAATGTGTATCTGGCACGGTCTCGGATTCAGGGGCTGGGCCTGTATG



CTGCTCGAGACATTGAGAAACACACCATGGTCATTGAGTACATCGGGACT



ATCATTCGAAACGAAGTAGCCAACAGGAAAGAGAAGCTTTATGAGTCTC



AGAACCGTGGTGTGTACATGTTCCGCATGGATAACGACCATGTGATTGAC



GCGACGCTCACAGGAGGGCCCGCAAGGTATATCAACCATTCGTGTGCACC



TAATTGTGTGGCTGAAGTGGTGACTTTTGAGAGAGGACACAAAATTATCA



TCAGCTCCAGTCGGAGAATCCAGAAAGGAGAAGAGCTCTGCTATGACTA



TAAGTTTGACTTTGAAGATGACCAGCACAAGATTCCGTGTCACTGTGGAG



CTGTGAACTGCCGGAAGTGGATGAACTGA





CLTB->
ATGGCTGATGACTTTGGCTTCTTCTCGTCGTCGGAGAGCGGTGCCCCGGA
SEQ ID


CDHR2
GGCGGCGGAGGAGGACCCGGCGGCCGCCTTCCTGGCCCAGCAGGAGAGC
NO: 26



GAGATTGCAGGCATAGAGAACGACGAGGGCTTCGGGGCACCTGCCGGCA



GCCATGCGGCCCCCGCGCAGCCGGGCCCCACGAGTGGGGCTGGTTCTGA



GGACATGGGGACCACAGTCAATGGAGATGTGTTTCAGGTGCCCAGGCCTT



CTGGTTGGTAGCGGAAGACCAGGACAATGACCCTCTGACCTATGGGATG



AGCGGCCCCAATGCCTACTTCTTCGCTGTCACTCCGAAAACTGGGGAAGT



GAAGCTGGCCAGCGCTCTGGACTACGAGACACTCTACACATTCAAAGTCA



CCATCTCCGTGAGCGACCCCTACATCCAGGTGCAGAGGGAGATGCTGGTG



ATTGTGGAAGATAGAAACGACAACGCACCCGTTTTCCAGAACACCGCTTT



CTCCACCAGCATCAACGAGACCCTGCCCGTGGGCAGTGTGGTGTTCTCCG



TGCTGGCCGTGGATAAAGACATGGGGTCTGCAGGCATGGTCGTGTACTCC



ATAGAGAAGGTCATCCCTAGCACTGGGGACAGCGAGCATCTCTTCCGGAT



CCTGGCCAATGGCTCCATAGTCCTCAATGGCAGCCTCAGCTACAACAACA



AGAGCGCTTTCTACCAGCTGGAGCTGAAGGCCTGTGACTTGGGCGGCATG



TACCACAACACCTTCACCATCCAGTGCTCCCTGCCTGTCTTCCTGTCCATC



TCCGTGGTGGACCAGCCTGACCTTGACCCCCAGTTTGTCAGGGAGTTTTA



CTCGGCCTCTGTGGCTGAGGATGCAGCCAAGGGAACCTCGGTGCTGACGG



TGGAGGCTGTGGATGGCGACAAAGGCATCAATGACCCTGTGATCTACAG



CATCTCCTACTCCACGCGGCCCGGCTGGTTTGACATCGGGGCAGATGGGG



TGATCAGGGTCAACGGCTCCCTGGACCGTGAGCAGCTGCTGGAGGCGGA



TGAGGAGGTGCAGCTGCAGGTCACGGCCACCGAGACACACCTCAACATC



TACGGGCAGGAGGCCAAGGTGAGCATCTGGGTGACAGTGAGAGTGATGG



ACGTCAATGACCACAAACCTGAGTTTTACAACTGCAGCCTCCCAGCCTGC



ACCTTCACCCCCGAAGAGGCCCAAGTGAACTTCACTGGCTACGTGGACGA



GCATGCCTCCCCCCGCATCCCCATCGATGACCTCACCATGGTGGTCTACG



ACCCGGACAAGGGCAGCAATGGCACCTTCCTGTTGTCGCTGGGGGGCCCC



GATGCAGAAGCCTTCAGCGTCTCCCCGGAGCGGGCAGTGGGCTCAGCCTC



CGTTCAGGTGCTGGTGAGAGTATCCGCGCTGGTGGACTACGAGAGGCAG



ACGGCGATGGCGGTGCAGGTTGTGGCCACAGACTCCGTCAGCCAGAACTT



CTCCGTCGCCATGGTGACCATCCACCTTAGAGACATTAATGACCACAGGC



CCACGTTTCCCCAGAGCTTGTACGTCCTCACGGTGCCAGAGCACAGCGCC



ACCGGCTCTGTGGTCACCGACAGCATCCACGCCACGGACCCAGACACGG



GCGCGTGGGGCCAAATTACCTACAGCCTGCTCCCAGGAAATGGGGCAGA



CCTCTTCCAAGTGGATCCCGTCTCAGGGACGGTGACGGTGAGGAACGGTG



AGCTGCTGGACCGGGAGAGCCAGGCCGTGTACTACCTGACGCTGCAGGC



CACAGACGGCGGGAACCTGTCCTCCTCCACCACACTGCAGATCCACCTGC



TGGACATCAACGACAATGCACCCGTGGTTAGCGGCTCCTACAACATCTTC



GTCCAGGAGGAGGAGGGCAATGTCTCCGTGACCATCCAGGCCCACGACA



ATGATGAGCCGGGCACCAACAACAGCCGTCTGCTCTTCAACCTGCTGCCT



GGCCCCTACAGCCACAACTTCTCCTTGGACCCTGACACAGGGCTCCTCAG



AAACCTGGGGCCCCTGGACAGAGAGGCCATCGACCCCGCCCTGGAGGGC



CGCATTGTGCTGACAGTGCTTGTGTCTGACTGCGGCGAGCCTGTCCTCGG



CACCAAAGTCAATGTCACCATCACTGTGGAGGACATCAATGATAACCTGC



CCATCTTCAATCAGTCCAGCTACAACTTTACGGTGAAGGAGGAGGATCCA



GGAGTGCTAGTGGGCGTGGTGAAGGCCTGGGACGCGGACCAGACGGAAG



CCAACAACCGCATCAGCTTCAGCCTGTCGGGGAGTGGTGCCAACTACTTC



ATGATCCGAGGCTTGGTGCTGGGGGCTGGGTGGGCTGAGGGCTACCTCCG



GCTGCCCCCGGACGTGAGCCTGGATTACGAGACACAGCCCGTCTTCAACT



TGACAGTGAGTGCTGAGAACCCAGACCCCCAGGGGGGTGAGACCATAGT



AGACGTCTGCGTGAATGTGAAAGACGTGAACGACAATCCCCCCACCCTG



GATGTAGCCTCACTCCGGGGCATCCGTGTGGCTGAGAATGGCTCACAGCA



CGGCCAGGTGGCTGTGGTGGTTGCCTCGGATGTGGACACCAGTGCCCAGC



TGGAGATACAGCTTGTGAACATTCTCTGCACCAAGGCCGGGGTCGATGTG



GGCAGCCTATGCTGGGGCTGGTTCTCAGTGGCGGCCAACGGCTCTGTGTA



CATCAACCAGAGCAAAGCCATCGACTACGAGGCCTGTGACCTGGTCACG



CTGGTTGTGCGGGCCTGTGACCTAGCCACGGACCCCGGCTTCCAGGCCTA



CAGCAACAATGGAAGCCTCCTCATTACCATTGAGGACGTGAATGACAATG



CACCCTATTTTCTGCCTGAGAATAAGACTTTTGTGATCATCCCTGAACTCG



TGCTGCCCAACCGGGAGGTGGCTTCTGTCCGGGCCAGAGACGATGATTCA



GGGAACAATGGCGTCATCCTGTTCTCCATCCTCCGAGTAGACTTCATCTCT



AAGGACGGGGCCACCATCCCTTTCCAGGGTGTCTTCTCGATCTTCACCTCC



TCCGAGGCCGACGTGTTCGCTGGGAGCATTCAGCCGGTGACCAGCCTCGA



CTCCACTCTCCAAGGCACCTACCAAGTGACAGTCCAGGCCAGGGACAGA



CCTTCCTTGGGTCCTTTCCTGGAAGCCACCACCACCCTGAATCTCTTCACC



GTGGACCAGAGTTACCGCTCGCGGCTGCAGTTCTCCACACCGAAGGAGG



AGGTGGGCGCCAACAGACAGGCGATTAATGCGGCTCTTACCCAGGCAAC



CAGGACTACAGTATACATTGTGGACATTCAGGACATAGATTCTGCAGCTC



GGGCCCGACCTCACTCCTACCTCGATGCCTACTTTGTCTTCCCCAATGGGT



CAGCCCTGACCCTTGATGAGCTGAGTGTGATGATCCGGAATGATCAGGAC



TCGCTGACGCAGCTGCTGCAGCTGGGGCTGGTGGTGCTGGGCTCCCAGGA



GAGCCAGGAGTCAGACCTGTCGAAACAGCTCATCAGTGTCATCATAGGAT



TGGGAGTGGCTTTGCTGCTGGTCCTTGTGATCATGACCATGGCCTTCGTGT



GTGTGCGGAAGAGCTACAACCGGAAGCTTCAAGCTATGAAGGCTGCCAA



GGAGGCCAGGAAGACAGCAGCAGGGGTGATGCCCTCAGCCCCTGCCATC



CCAGGGACTAACATGTACAACACTGAGCGAGCCAACCCCATGCTGAACC



TCCCCAACAAAGACCTGGGCTTGGAGTACCTCTCTCCCTCCAATGACCTG



GACTCTGTCAGCGTCAACTCCCTGGACGACAACTCTGTGGATGTGGACAA



GAACAGTCAGGAAATCAAGGAGCACAGGCCACCACACACACCACCAGAG



CCAGATCCAGAGCCCCTGAGCGTGGTCCTGTTAGGACGGCAGGCAGGCG



CAAGTGGACAGCTGGAGGGGCCATCCTACACCAACGCTGGCCTGGACAC



CACGGACCTGTGA





PDE4D->
ATGATGCACGTGAATAATTTTCCCTTTAGAAGGCATTCCTGGATATGTTTT
SEQ ID


ITGA1
GATGTGGACAATGGCACATCTGCGGGACGGAGTCCCTTGGATCCCATGAC
NO: 27



CAGCCCAGGATCCGGGCTAATTCTCCAAGCAAATTTTGTCCACAGTCAAC



GACGGGAGTCCTTCCTGTATCGATCCGACAGCGATTATGACCTCTCTCCA



AAGTCTATGTCCCGGAACTCCTCCATTGCCAGTGATATACACGGAGATGA



CTTGATTGTGACTCCATTTGCTCAGGTCTTGGCCAGTCTGCGAACTGTACG



AAACAACTTTGCTGCATTAACTAATTTGCAAGATCGAGCACCTAGCAAAA



GATCACCCATGTGCAACCAACCATCCATCAACAAAGCCACCATAACAGA



GGAGGCCTACCAGAAACTGGCCAGCGAGACCCTGGAGGAGCTGGACTGG



TGTCTGGACCAGCTAGAGACCCTACAGACCAGGCACTCCGTCAGTGAGAT



GGCCTCCAACAAGTTTAAAAGGATGCTTAATCGGGAGCTCACCCATCTCT



CTGAAATGAGTCGGTCTGGAAATCAAGTGTCAGAGTTTATATCAAACACA



TTCTTAGATAAGCAACATGAAGTGGAAATTCCTTCTCCAACTCAGAAGGA



AAAGGAGAAAAAGAAAAGACCAATGTCTCAGATCAGTGGAGTCAAGAA



ATTGATGCACAGCTCTAGTCTGACTAATTCAAGTATCCCAAGGTTTGGAG



TTAAAACTGAACAAGAAGATGTCCTTGCCAAGGAACTAGAAGATGTGAA



CAAATGGGGTCTTCATGTTTTCAGAATAGCAGAGTTGTCTGGTAACCGGC



CCTTGACTGTTATCATGCACACCATTTTTCAGGAACGGGATTTATTAAAA



ACATTTAAAATTCCAGTAGATACTTTAATTACATATCTTATGACTCTCGAA



GACCATTACCATGCTGATGTGGCCTATCACAACAATATCCATGCTGCAGA



TGTTGTCCAGTCTACTCATGTGCTATTATCTACACCTGCTTTGGAGGCTGT



GTTTACAGATTTGGAGATTCTTGCAGCAATTTTTGCCAGTGCAATACATG



ATGTAGATCATCCTGGTGTGTCCAATCAATTTCTGATCAATACAAGACAA



GCATGACTTTCAGGACTCTGTGAGAATAACGTTGGACTTTAATCTTACCG



ATCCAGAAAATGGGCCTGTTCTTGATGATTCTCTACCAAACTCAGTACAT



GAATATATTCCCTTTGCCAAAGATTGTGGAAATAAGGAAAAATGTATCTC



AGACCTCAGCCTGCATGTCGCCACCACTGAAAAGGACCTGCTGATTGTCC



GATCCCAGAATGATAAGTTCAACGTTAGCCTCACAGTCAAAAATACAAA



GGACAGTGCCTATAACACCAGGACAATAGTGCATTATTCTCCAAATCTAG



TTTTTTCAGGAATTGAGGCTATCCAAAAAGACAGTTGTGAATCTAATCAT



AATATCACATGTAAAGTTGGATATCCCTTCCTGAGAAGAGGAGAGATGGT



AACTTTCAAAATATTGTTTCAGTTTAACACATCCTATCTCATGGAAAATGT



GACCATTTATTTAAGTGCAACAAGTGACAGCGAAGAACCTCCTGAAACCC



TTTCTGATAATGTAGTAAACATTTCTATCCCGGTAAAATATGAAGTTGGA



CTACAGTTTTACAGCTCTGCAAGTGAATACCACATTTCAATTGCTGCCAAT



GAGACAGTCCCTGAAGTTATTAATTCTACTGAGGACATTGGAAATGAAAT



TAATATCTTCTACTTGATTAGAAAAAGTGGATCTTTTCCAATGCCAGAGCT



TAAGCTGTCAATTTCATTCCCCAATATGACATCAAATGGTTACCCTGTGCT



GTACCCAACTGGATTGTCATCTTCTGAGAATGCAAACTGCAGACCCCATA



TCTTTGAGGATCCTTTCAGTATCAACTCTGGAAAGAAAATGACTACATCA



ACTGACCATCTCAAACGAGGCACAATTCTGGACTGCAATACATGTAAATT



TGCTACCATCACATGTAATCTCACTTCTTCTGACATCAGCCAAGTCAATGT



TTCGCTTATCTTGTGGAAACCAACTTTTATAAAATCATATTTTTCCAGCTT



AAATCTTACTATAAGGGGAGAACTTCGGAGTGAAAATGCATCTCTGGTTT



TAAGTAGCAGCAATCAAAAAAGAGAGCTTGCTATTCAAATATCCAAAGA



TGGGCTACCGGGCAGAGTGCCATTATGGGTCATCCTGCTGAGTGCTTTTG



CCGGATTGTTGCTGTTAATGCTGCTCATTTTAGCACTGTGGAAGATTGGAT



TCTTCAAAAGACCACTGAAAAAGAAAATGGAGAAATGA





DIP2B->
ATGGCGGAACGAGGCCTGGAGCCGTCGCCGGCCGCGGTGGCGGCGCTGC
SEQ ID


LINC00330
CGCCTGAAGTGCGGGCGCAGCTGGCGGAGCTGGAGCTGGAGCTCTCGGA
NO: 28



GGGGGACATCACCCAGAAGGGCTATGAAAAGAAAAGGTCCAAACTCCTA



TCTCCTTACAGCCCGCAGACACAAGAAACTGATTCAGCAGTACAGAAAG



AACTTAGAAACCAGACACCTGCTCCATCTGCAGCTCAAACTTCTGCTCCC



TCTAAGTACCACCGAACTCGATCTGGGGGAGCCAGGGATGAACGATATC



GATCAGAGTCTTGCTCTGTCGCCCAGGCTGGAGTGCAGTGGTGCGACCTT



GGCTCACTGGTTTGGGGCATGTTCCTGGGTGTGTCTGTGAGGAGCAAGAA



AGCCAGTTCAAGTCCCAAAACCTCAAAAGTAGGGAAGCCGACAGGGCAG



CCTTAAGTCTGTGGCTGAAGGCCTGAGAGACCCTGGCAAATCACTGATAT



AAGTCCAAAAGCTGAAGAACTTGGAATCTGATGTTGGAATCTGATCTTGG



CCCATCCTTGAAGAATCTGGGGGTAATAAATACCCTGAACATTGATGAAC



CCAAAGGGAAGCCAGAAGGTCAGATGTCAGCTGTCGGGAAGAAAGGCTG



GATAGGAAATAGGAAAAAGCAGGGCAAACAAGGAGCTTCAAGGACTGA



AACCCTTAAGAACCACTTGACTCTGTCTCTCACCACCTCCAGCCTTGATGA



CACGGTGACATATAGAAGAACTGATGCCCTTTGCTTTGAGCTGCACAAAA



ACCTGGCGCTGGACTTGGAGAAGCTGAAAGACAGGACCCGGCAGGAGCC



TGAGGAGCTGCAGGCTGCTAGCCTGTAGACCACCGTGCTGTGTTAGCCAC



TGCAGCTGGGACCATTCTGACACTCAGAGCCTGGAAATGGCTGCTCCCTC



CTGCCTGCTATCCAAGTATCCTGTGGATACTTGGCTTTTCTCTTGTGGGCA



ACACTAACCTGAAACTCTTTGGGGAAGGGAATTGAGGAAATGTAGTTCCA



AGATTAGCCAATTTAACTCAGCATAAACCCATCACAGGGTTTGACTACAG



AGTTATAACGTTATTCTGTAGGCAATGAAAACAGAAGTTTTTGAAAAAGG



CATCAATTTGATGAAATTGATATTTTGGGGAAGATTAATCTTCTGCTGAGT



AAAATATAACAGGTTGGAGGAAGATGAAAATGGAGAGAGTAGAGTGTAA



TTAAAAGTTTGTTGGAGGCCGGGCACGGGATTACGGCTCACTCCTGTAAT



CCCAGCACTTTGGGAGGCTGAGGTGGGCGGATCACCTGAGGTCAGGAGC



TTGAGATCATCCCGGCCAACATAGTGAAACCCTGTCTCTAATAAAAATAC



GAAAAATTAGCTGGGCATGGTGGCGCACACCTGTAATCTCAGCTACTCGG



GAGGCTGAGGCAGGAGAATCCCTTGAACCCAGGAGGCAGAGGTTGCAAT



GAGCCAAGATGGTGTGATTGCACTCCAGCCCAGGGGACAATGTGAGACT



CTGTCTCAAAAAAAAAAAAAAATTTGTTGTAGTAGTCCAGGTGTCCGGGT



GTTGTTACCAGGGCACCCATGATGAAATAACTGGGAGAGTCCTTTTTTCT



GACTGGTTTTCTTCAGTCCCTTTATCTACAGGCACAACTGCTGAAGAAAC



CAGATGGCCTGGGATGGCACCAGAGCTTTTTTACCCTTGACCAGATACTA



GAGGGAATTAAGACCCCACAAGTGGGCACGAACTGGAACTGGCGACCCC



TAGTTGCCTTCAGATCATTAACACATCATTATAATGCTAAAATTCCCTGCC



ATTTTGTGAACATGGGTTGCATGAAGACGTAAGTTTATGAATTGCCTCTG



CACACCTAGAGTCCCACCCTGTACAGGCTAACATTCCTCCCTACATGCAC



CCCCCCACAAACACACCCTGCCTCCCCCAGTCCTTAGAAACCCTATGCCT



GGCCGGGCGCAGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCTGA



GGCGGGCGGATCACTAGGTCAGGAGATCGAGACCATCCTGGCTAACACG



GTGAAACCCTGTCTTTACTAAAAATC





PQLC1->
ATGGAGGCCGAGGGCCTGGACTGGCTCCTGGTGCCACTGCACCAGCTGGT
SEQ ID


LINC00330
GTCCTGGGGCGCGGCCGCGGCCATGGTCTTCGGAGGGGTGGTGCCCTACG
NO: 29



TCCCGCAGTATCGGGACATTCGCAGGACGCAGAACGCCGACGGCTTCTCC



ACCTACGTGTGCCTGGTGCTGCTGGTGGCCAACATTTTGCGGATACTCTTC



TGAGTCTTGCTCTGTCGCCCAGGCTGGAGTGCAGTGGTGCGACCTTGGCT



CACTGGTTTGGGGCATGTTCCTGGGTGTGTCTGTGAGGAGCAAGAAAGCC



AGTTCAAGTCCCAAAACCTCAAAAGTAGGGAAGCCGACAGGGCAGCCTT



AAGTCTGTGGCTGAAGGCCTGAGAGACCCTGGCAAATCACTGATATAAGT



CCAAAAGCTGAAGAACTTGGAATCTGATGTTGGAATCTGATCTTGGCCCA



TCCTTGAAGAATCTGGGGGTAATAAATACCCTGAACATTGATGAACCCAA



AGGGAAGCCAGAAGGTCAGATGTCAGCTGTCGGGAAGAAAGGCTGGATA



GGAAATAGGAAAAAGCAGGGCAAACAAGGAGCTTCAAGGACTGAAACC



CTTAAGAACCACTTGACTCTGTCTCTCACCACCTCCAGCCTTGATGACACG



GTGACATATAGAAGAACTGATGCCCTTTGCTTTGAGCTGCACAAAAACCT



GGCGCTGGACTTGGAGAAGCTGAAAGACAGGACCCGGCAGGAGCCTGAG



GAGCTGCAGGCTGCTAGCCTGTAGACCACCGTGCTGTGTTAGCCACTGCA



GCTGGGACCATTCTGACACTCAGAGCCTGGAAATGGCTGCTCCCTCCTGC



CTGCTATCCAAGTATCCTGTGGATACTTGGCTTTTCTCTTGTGGGCAACAC



TAACCTGAAACTCTTTGGGGAAGGGAATTGAGGAAATGTAGTTCCAAGAT



TAGCCAATTTAACTCAGCATAAACCCATCACAGGGTTTGACTACAGAGTT



ATAACGTTATTCTGTAGGCAATGAAAACAGAAGTTTTTGAAAAAGGCATC



AATTTGATGAAATTGATATTTTGGGGAAGATTAATCTTCTGCTGAGTAAA



ATATAACAGGTTGGAGGAAGATGAAAATGGAGAGAGTAGAGTGTAATTA



AAAGTTTGTTGGAGGCCGGGCACGGGATTACGGCTCACTCCTGTAATCCC



AGCACTTTGGGAGGCTGAGGTGGGCGGATCACCTGAGGTCAGGAGCTTG



AGATCATCCCGGCCAACATAGTGAAACCCTGTCTCTAATAAAAATACGAA



AAATTAGCTGGGCATGGTGGCGCACACCTGTAATCTCAGCTACTCGGGAG



GCTGAGGCAGGAGAATCCCTTGAACCCAGGAGGCAGAGGTTGCAATGAG



CCAAGATGGTGTGATTGCACTCCAGCCCAGGGGACAATGTGAGACTCTGT



CTCAAAAAAAAAAAAAAATTTGTTGTAGTAGTCCAGGTGTCCGGGTGTTG



TTACCAGGGCACCCATGATGAAATAACTGGGAGAGTCCTTTTTTCTGACT



GGTTTTCTTCAGTCCCTTTATCTACAGGCACAACTGCTGAAGAAACCAGA



TGGCCTGGGATGGCACCAGAGCTTTTTTACCCTTGACCAGATACTAGAGG



GAATTAAGACCCCACAAGTGGGCACGAACTGGAACTGGCGACCCCTAGT



TGCCTTCAGATCATTAACACATCATTATAATGCTAAAATTCCCTGCCATTT



TGTGAACATGGGTTGCATGAAGACGTAAGTTTATGAATTGCCTCTGCACA



CCTAGAGTCCCACCCTGTACAGGCTAACATTCCTCCCTACATGCACCCCC



CCACAAACACACCCTGCCTCCCCCAGTCCTTAGAAACCCTATGCCTGGCC



GGGCGCAGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGCG



GGCGGATCACTAGGTCAGGAGATCGAGACCATCCTGGCTAACACGGTGA



AACCCTGTCTTTACTAAAAATC





RABEP1->
ATGGCGCAGCCGGGCCCGGCTTCCCAGCCTGACGTTTCTCTTCAGCAACG
SEQ ID


DNAH9
GGTAGCAGAATTGGAAAAAATTAATGCAGAATTTTTACGTGCACAACAG
NO: 30



CAGCTTGAACAAGAATTTAATCAAAAGAGAGCAAAATTTAAGGAGTTAT



ATTTGGCTAAAGAGGAGGATCTGAAGAGGCAAAATGCAGTATTACAAGC



TGCACAAGATGATTTGGGACACCTTCGAACCCAGCTGTGGGAAGCTCAAG



CAGAGATGGAGAATATTAAGGCGATTGCCACAGTCTCTGAGAACACCAA



GCAAGAAGCTATAGATGAAGTGAAAAGACAGTGGAGAGAAGAAGTTGCT



TCACTTCAGGCTGTTATGAAAGAAACAGTTCGTGACTATGAGCACCAGTT



CCACCTTAGGCTGGAGCAGGAGCGAACACAGTGGGCACAGTATAGAGAA



TCCGCAGAGAGGGAAATAGCTGATTTAAGAAGAAGGCTGTCTGAAGGTC



AAGAGGAGGAAAATTTAGAAAATGAAATGAAAAAGGCCCAAGAGGATG



CTGAGAAACTTCGGTCCGTTGTGATGCCAATGGAAAAGGAAATTGCAGCT



TTGAAGGATAAACTGACAGAGGCTGAAGACAAAATTAAAGAGCTGGAGG



CCTCAAAGGTTAAAGAACTGAATCATTATCTGGAAGCTGAGAAATCTTGT



AGGACTGATCTAGAGATGTATGTAGCTGTTTTGAATACTCAGAAATCTGT



TCTACAGGAAGATGCTGAGAAACTGCGGAAAGAATTGCATGAAGGGTCT



TCTGAAGACGGCCCTGGATTTCCACAAACTGGGAAAGGTGGAGTTCAGC



GGCGTCAGAGGGAATGCTCTGAGTCAGCAGGTCCAGCAAATGCATGAAG



AATTTCAAGAGATGTACAGGCTTCTCTCAGGATCCTCCTCCGACTGCCTGT



ACCTCCAAAGCACGGACTTTGAAAATGACGTCTCTGAATTTAACCAGAAA



GTAGAAGATCTTGACCGAAGATTGGGGACTATCTTTATTCAAGCTTTTGA



TGATGCACCTGGCTTGGAGCATGCCTTTAAGCTGCTAGACATAGCAGGAA



ACCTCCTTGAAAGACCGCTGGTAGCGAGGGATACATCTGATAAATACCTG



GTCCTCATCCAAATGTTCAACAAAGATCTGGATGCAGTGAGGATGATCTA



CAGTCAGCACGTCCAGGAGGAAGCAGAACTTGGGTTCTCCCCGGTGCAC



AAGAACATGCCCACCGTGGCTGGCGGCCTCCGCTGGGCACAGGAGCTGA



GGCAGCGCATCCAGGGTCCTTTCAGCAACTTTGGACGCATCACACACCCT



TGCATGGAATCTGCAGAAGGAAAGCGAATGCAACAAAAATATGAAGATA



TGCTGTCATTGCTAGAAAAGTATGAGACAAGACTTTATGAGGATTGGTGC



CGGACAGTATCAGAGAAGTCACAGTACAATCTTTCCCAACCACTTCTAAA



ACGTGACCCAGAGACGAAGGAGATCACTATCAACTTTAACCCACAGCTG



ATTTCAGTGCTGAAAGAAATGAGCTATCTTGAACCCAGAGAGATGAAAC



ACATGCCTGAGACAGCAGCAGCCATGTTCTCCTCCAGGGATTTCTATCGG



CAGCTTGTGGCTAATTTAGAGTTGATGGCAAATTGGTACAACAAGGTTAT



GAAAACTCTGCTGGAGGTGGAATTTCCATTAGTGGAGGAAGAGCTGCAA



AATATTGATCTCCGCCTCAGAGCAGCAGAGGAGACTTTGAACTGGAAAA



CAGAAGGCATTTGCGATTATGTCACTGAAATCACCAGTAGTATTCATGAT



CTTGAACAAAGAATTCAGAAAACTAAAGACAATGTGGAAGAGATCCAAA



ACATCATGAAAACATGGGTGACTCCAATATTTAAGACAAAAGATGGAAA



AAGGGAATCCCTTCTTTCTCTGGATGATCGGCATGATCGAATGGAAAAAT



ATTACAATCTCATCAAGGAATCTGGCCTTAAGATCCACGCCCTTGTTCAG



GAAAACCTGGGTCTATTTTCAGCAGACCCAACCTCCAATATCTGGAAGAC



TTATGTTAACTCTATTGACAATTTGTTGCTGAATGGATTCTTTCTTGCCATT



GAGTGCTCCCTCAAGTATCTTCTGGAAAATACTGAGTGTAAGGCAGGACT



TACCCCAATATTTGAAGCACAACTGAGTCTAGCCATCCCAGAGCTAGTTT



TCTATCCGTCTCTGGAGTCTGGAGTGAAGGGGGGTTTCTGTGACATTGTT



GAGGGTCTCATCACCAGCATTTTTAGGATACCATCTCTGGTGCCACGGCT



TTCCCCACAAAATGGCTCTCCTCACTATCAGGTCGACCTGGACGGTATAC



CAGATTTGGCAAACATGCGGCGCACACTCATGGAGAGAGTCCAGAGAAT



GATGGGCCTCTGCTGTGGCTATCAGAGCACCTTCAGCCAGTATTCGTACC



TCTATGTGGAGGACCGGAAGGAGGTTCTGGGTCAGTTTCTGCTGTACGGG



CACATCCTCACTCCGGAAGAAATTGAAGACCATGTGGAAGATGGCATCCC



AGAGAACCCTCCCCTCCTTTCTCAGTTTAAAGTGCAAATCGACTCCTATG



AAACGCTCTATGAAGAGGTGTGCAGGCTGGAACCCATCAAGGTGTTTGAC



GGCTGGATGAAAATTGATATTCGACCCTTTAAGGCATCTCTGCTGAATAT



TATTAAGAGGTGGAGCCTCCTGTTCAAACAGCATCTTGTGGACCACGTCA



CTCACAGCTTGGCCAACCTGGATGCGTTTATAAAGAAGAGTGAGAGCGG



CTTACTCAAGAAAGTTGAAAAAGGAGATTTCCAAGGCTTGGTTGAGATCA



TGGGACACCTTATGGCTGTTAAAGAACGGCAGAGTAACACTGATGAGAT



GTTTGAGCCCTTAAAGCAGACTATTGAATTGCTGAAGACCTATGAACAAG



AATTGCCAGAAACAGTGTTTAAGCAGCTGGAGGAGCTGCCTGAGAAATG



GAACAACATAAAAAAGGTGGCCATTACTGTGAAGCAGCAGGTGGCCCCA



CTGCAGGCAAATGAAGTGACACTCCTCCGCCAGAGGTGCACAGCCTTCGA



TGCAGAACAGCAGCAATTCTGGGAGCAATTCCACAAAGAAGCCCCGTTC



AGGTTTGATAGCATCCACCCTCATCAAATGCTGGATGCCAGGCACATCGA



GATCCAGCAGATGGAATCCACTATGGCCTCCATTTCTGAGTCTGCCAGCT



TATTTGAAGTCAATGTCCCTGACTATAAGCAGCTGAGGCAGTGCAGGAAG



GAGGTCTGCCAGCTGAAGGAGCTCTGGGACACCATTGGAATGGTGACCTC



CAGCATCCATGCCTGGGAGACCACACCCTGGAGGAATATCAACGTGGAA



GCCATGGAGTTGGAGTGCAAACAGTTTGCCCGGCATATCCGAAACCTGGA



CAAGGAGGTCAGGGCCTGGGATGCATTCACAGGCCTGGAAAGCACTGTG



TGGAACACGCTGAGCTCCCTGAGGGCAGTAGCTGAGCTGCAGAATCCAG



CCATCCGGGAGCGGCACTGGAGGCAGCTGATGCAGGCCACCGGTGTGAG



CTTCACTATGGACCAGGACACCACCCTAGCGCACCTGCTGCAGCTCCAGC



TGCACCACTATGAGGATGAGGTCCGGGGCATTGTGGACAAAGCTGCAAA



AGAGATGGGTATGGAGAAAACCTTAAAGGAGCTGCAGACTACCTGGGCT



GGCATGGAATTCCAGTATGAGCCCCACCCACGGACCAATGTCCCCCTCCT



GTGCTCTGATGAGGACCTCATAGAGGTTCTGGAGGATAATCAAGTTCAAC



TTCAGAACCTGGTGATGTCCAAGTATGTTGCTTTCTTCTTGGAGGAGGTGT



CGGGCTGGCAGAAGAAGCTGTCCACAGTGGACGCTGTCATCTCTATCTGG



TTTGAAGTGCAGCGAACATGGACTCACCTGGAAAGCATATTCACTGGATC



TGAAGATATTCGGGCACAGCTACCCCAGGATTCTAAAAGGTTTGAAGGCA



TCGACATTGACTTTAAAGAGCTAGCTTATGATGCCCAGAAAATTCCAAAT



GTAGTGCAAACCACCAACAAGCCAGGCCTGTATGAAAAGCTGGAGGATA



TTCAGGGCAGATTGTGCCTGTGTGAGAAGGCCCTGGCAGAGTACCTCGAC



ACCAAGAGGCTTGCCTTCCCGCGGTTTTACTTTCTCTCCTCCTCCGATCTG



TTAGACATCCTTTCCAACGGCACAGCTCCACAACAGGTTCAACGTCACCT



TTCCAAACTCTTTGACAACATGGCCAAGATGCGATTCCAGCTAGATGCCA



GTGGGGAACCAACCAAGACAAGCCTCGGCATGTACAGCAAAGAAGAGGA



GTATGTGGCTTTCAGTGAGCCCTGTGACTGCAGCGGGCAGGTAGAAATAT



GGCTGAACCATGTCCTTGGTCACATGAAGGCCACTGTGAGGCATGAGATG



ACAGAAGGTGTAACTGCCTATGAAGAAAAGCCGAGGGAGCAGTGGCTTT



TTGACCACCCAGCTCAGGTGGCCCTGACCTGTACTCAGATCTGGTGGACA



ACAGAAGTGGGCATGGCATTTGCCAGGCTGGAGGAAGGCTATGAGAGTG



CCATGAAGGACTATTATAAGAAGCAAGTGGCCCAGCTCAAAACCCTTATC



ACCATGCTGATTGGCCAGCTCTCCAAGGGAGACCGGCAGAAGATTATGA



CTATATGCACCATCGATGTGCATGCCCGGGATGTGGTAGCCAAGATGATT



GCTCAGAAGGTAGACAATGCCCAGGCTTTCCTCTGGCTGTCTCAGCTGCG



CCATCGTTGGGATGACGAGGTCAAACACTGCTTTGCCAACATCTGTGATG



CCCAGTTTTTGTATTCCTATGAGTACCTGGGAAACACACCTCGCTTGGTGA



TCACACCTTTGACTGACAGGTGCTACATCACCCTCACCCAGTCCCTGCAC



CTGACCATGAGTGGGGCTCCCGCAGGACCTGCAGGCACAGGCAAGACCG



AGACCACCAAGGACCTGGGCCGCGCACTGGGCATCCTGGTCTATGTGTTC



AACTGCTCGGAGCAGATGGATTACAAGTCTTGTGGCAACATCTACAAAGG



CCTTGCTCAGACTGGTGCCTGGGGCTGCTTTGATGAGTTTAATCGAATCTC



CGTGGAGGTCTTGTCAGTGGTGGCAGTGCAGGTAAAAAGCATTCAAGAT



GCGATTAGAGATAAGAAGCAGTGGTTCAGCTTCCTTGGGGAGGAGATCA



GCCTGAATCCTTCTGTCGGTATCTTCATCACCATGAACCCAGGCTATGCTG



GCCGCACAGAGCTGCCAGAGAATCTCAAGTCTCTCTTCAGGCCTTGTGCA



ATGGTGGTTCCAGACTTTGAATTGATCTGTGAAATCATGCTGGTGGCAGA



AGGATTCATTGAAGCCCAGTCATTAGCCAGAAAGTTCATCACTCTTTACC



AGTTGTGCAAAGAGCTTCTCTCCAAACAGGATCACTACGACTGGGGCCTA



CGGGCCATCAAGTCCGTGCTGGTGGTGGCAGGATCCCTGAAGAGAGGAG



ACCCTGACCGGCCTGAGGACCAGGTCCTGATGCGCTCCTTGCGGGATTTC



AACATCCCCAAGATTGTGACTGATGACATGCCCATCTTCATGGGCCTGAT



CGGGGACCTCTTTCCCGCCCTGGATGTCCCCCGGAGGAGAGACCCCAACT



TCGAAGCTTTGGTTAGGAAGGCGATAGTGGATCTGAAGCTCCAGGCTGAG



GACAACTTTGTGCTCAAGGTGGTCCAGCTGGAGGAGCTCCTGGCTGTGCG



GCACTCTGTATTTGTGGTGGGTGGCGCTGGTACCGGCAAGTCACAGGTGC



TGAGGTCCTTGCACAAGACCTATCAGATCATGAAACGGCGCCCCGTCTGG



ACTGACCTCAATCCCAAAGCAGTCACAAATGATGAGCTCTTTGGCATCAT



CAATCCAGCCACAGGAGAATGGAAGGATGGATTGTTCTCTTCCATCATGC



GGGAGCTTGCCAACATCACCCATGATGGGCCCAAGTGGATTTTACTGGAT



GGCGACATAGATCCAATGTGGATTGAATCCCTGAATACTGTCATGGATGA



TAACAAGGTGCTGACATTGGCCAGCAATGAGAGGATTCCTCTGAACCCCA



CCATGAAGCTCCTCTTTGAGATCAGCCACCTGCGCACAGCCACTCCAGCA



ACTGTCTCTAGAGCAGGGATCTTGTACATCAACCCGGCAGACTTGGGATG



GAACCCTCCAGTGAGCAGCTGGATTGAGAAGAGGGAAATCCAGACAGAG



AGAGCCAACTTAACCATTTTGTTCGACAAGTATCTTCCAACCTGCCTAGA



CACACTCAGAACCAGGTTTAAGAAGATCATTCCCATCCCAGAGCAGAGC



ATGGTTCAGATGGTGTGTCACCTTCTGGAATGTCTCCTGACCACGGAGGA



CATCCCTGCAGACTGCCCTAAGGAAATTTATGAGCATTATTTTGTGTTTGC



TGCCATCTGGGCTTTCGGCGGAGCAATGGTCCAAGATCAGCTTGTGGACT



ACCGGGCAGAGTTCAGCAAATGGTGGCTGACTGAGTTCAAAACAGTCAA



GTTTCCTTCCCAAGGAACCATCTTTGACTATTACATCGACCCAGAGACCA



AGAAATTCGAGCCTTGGTCCAAGCTCGTCCCCCAGTTCGAATTTGACCCC



GAGATGCCCTTGCAGGCGTGTTTGGTGCACACGAGTGAGACCATCCGTGT



GTGCTACTTCATGGAGCGGTTGATGGCGCGGCAGCGGCCTGTCATGCTGG



TGGGCACGGCTGGCACTGGCAAGTCGGTGCTGGTGGGAGCTAAGCTGGC



CAGCCTTGACCCCGAGGCATACCTGGTGAAAAACGTGCCATTCAACTACT



ACACCACGTCAGCAATGCTGCAGGCTGTCCTGGAGAAGCCTCTGGAAAA



GAAGGCTGGCAGAAACTATGGCCCTCCAGGGAACAAGAAACTCATCTAT



TTCATTGATGACATGAACATGCCTGAGGTGGATGCCTACGGGACGGTGCA



GCCCCACACCATCATCCGGCAGCATCTGGACTATGGCCACTGGTATGATC



GGAGCAAGCTGTCCCTAAAGGAGATCACAAATGTACAGTATGTTTCCTGT



ATGAACCCCACGGCAGGCAGCTTCACCATCAACCCCCGGCTTCAGCGTCA



CTTCAGCGTGTTTGTCCTCTCCTTCCCGGGGGCAGATGCCCTGTCCTCTAT



CTACAGCATCATCCTCACTCAGCATCTGAAGCTCGGAAACTTCCCGGCGT



CCCTGCAGAAATCCATCCCCCCACTGATCGATCTGGCCCTCGCCTTCCACC



AGAAAATTGCTACCACCTTCCTACCCACAGGAATCAAATTCCACTACATC



TTCAACCTCAGAGATTTTGCCAACATTTTCCAGGGCATTCTCTTCTCCTCA



GTGGAATGTGTGAAATCCACATGGGATCTTATAAGGCTCTATCTGCATGA



ATCAAATCGAGTTTATCGGGATAAGATGGTAGAAGAAAAGGACTTTGAT



CTTTTTGATAAAATCCAGACAGAAGTGCTCAAGAAAACTTTTGATGATAT



TGAAGACCCTGTGGAGCAGACCCAAAGCCCGAACCTGTATTGTCACTTTG



CAAATGGTATTGGGGAGCCCAAATACATGCCTGTACAGTCTTGGGAACTT



TTGACCCAGACTCTGGTGGAGGCCTTGGAGAACCACAATGAAGTCAACA



CAGTGATGGACCTAGTTCTCTTTGAGGATGCCATGCGCCATGTCTGCCAT



ATCAATCGCATCTTGGAGTCCCCGCGGGGAAATGCTCTGCTGGTTGGTGT



AGGTGGGAGCGGCAAGCAGAGCCTGACAAGGCTGGCAGCTTTCATCAGC



TCCATGGATGTCTTCCAGATCACACTGCGCAAAGGCTACCAGATCCAGGA



CTTCAAGATGGACCTGGCCAGCCTGTGTCTGAAAGCTGGAGTGAAGAATC



TCAACACAGTGTTTCTCATGACTGATGCCCAAGTGGCTGATGAGAGGTTC



CTTGTGCTCATCAATGATCTTTTGGCATCTGGGGAGATCCCAGATCTCTAC



TCTGATGATGAAGTTGAAAACATCATAAGCAATGTGAGGAATGAAGTCA



AGAGCCAGGGTCTGGTTGACAACAGAGAGAACTGTTGGAAGTTCTTTATA



GATCGGATCCGGCGACAGCTGAAGGTGACTCTCTGTTTCTCCCCTGTGGG



AAACAAGCTAAGAGTCCGCAGCAGGAAGTTCCCAGCCATTGTGAACTGC



ACAGCCATCCACTGGTTCCACGAGTGGCCTCAGCAAGCATTGGAGTCTGT



CAGCCTCCGCTTCTTGCAGAACACAGAGGGCATTGAGCCCACAGTAAAGC



AGTCGATTAGCAAATTCATGGCCTTTGTCCACACAAGTGTCAACCAAACA



TCCCAGTCTTATCTGAGCAATGAACAGCGCTACAACTATACAACTCCCAA



GTCCTTTCTGGAGTTCATCAGACTCTACCAGAGCTTGTTGCACAGGCACA



GAAAAGAGCTCAAGTGCAAGACAGAGCGGTTGGAGAACGGGCTGCTGAA



GCTGCATAGCACCTCTGCCCAGGTGGATGATCTGAAAGCAAAGCTGGCTG



CCCAGGAAGTAGAGCTGAAGCAGAAAAATGAAGATGCAGACAAACTGAT



TCAGGTCGTGGGTGTGGAGACTGACAAAGTGAGCAGAGAGAAAGCCATG



GCAGATGAAGAGGAGCAGAAGGTGGCCGTCATCATGCTAGAGGTGAAAC



AGAAGCAGAAGGACTGTGAGGAGGACCTGGCAAAGGCTGAGCCAGCACT



CACAGCAGCGCAGGCAGCTCTCAACACCCTGAACAAGACCAACCTGACA



GAGCTGAAGTCATTTGGCTCTCCGCCTCTGGCCGTCAGCAATGTCAGCGC



TGCGGTGATGGTACTGATGGCTCCCAGGGGTAGGGTGCCCAAGGACCGG



AGCTGGAAGGCTGCTAAGGTCACCATGGCCAAAGTGGATGGCTTCCTGG



ACTCGCTAATAAACTTCAACAAAGAGAACATTCACGAGAACTGCCTCAA



AGCCATCAGGCCGTATCTGCAAGACCCCGAGTTCAATCCTGAGTTTGTGG



CCACCAAATCCTATGCGGCTGCAGGCCTCTGCTCCTGGGTCATCAATATT



GTGAGATTTTATGAGGTGTTCTGTGATGTGGAACCCAAGCGCCAGGCACT



GAACAAAGCCACCGCGGACCTCACAGCTGCCCAGGAGAAGCTGGCTGCC



ATCAAAGCCAAGATCGCTCACCTTAATGAAAACCTGGCAAAGCTCACAG



CCAGGTTTGAGAAAGCAACAGCAGACAAACTCAAATGTCAGCAAGAAGC



CGAAGTGACCGCAGTCACCATCTCCCTTGCCAACCGCCTGGTTGGAGGAC



TCGCTTCTGAAAACGTGAGGTGGGCAGATGCCGTGCAGAACTTCAAACA



GCAGGAAAGGACGTTATGTGGAGACATTTTACTTATAACGGCTTTCATTT



CCTACCTTGGCTTCTTCACAAAGAAATACCGGCAGAGCCTCCTGGACAGA



ACTTGGAGGCCCTACCTGAGCCAGCTGAAAACTCCCATTCCAGTCACCCC



AGCCCTGGATCCCCTGAGGATGCTGATGGATGATGCTGACGTGGCTGCCT



GGCAGAACGAGGGCCTCCCAGCCGACCGCATGTCCGTGGAGAATGCCAC



CATTCTCATCAACTGTGAGCGCTGGCCACTCATGGTTGACCCTCAGCTAC



AAGGCATCAAATGGATCAAGAATAAATATGGTGAAGATCTCCGGGTCAC



GCAGATTGGTCAGAAAGGCTACCTTCAAATCATAGAGCAGGCCCTGGAA



GCTGGAGCTGTGGTGCTGATTGAAAATCTAGAGGAGTCCATTGATCCTGT



TCTGGGACCCCTGCTTGGGAGAGAAGTCATTAAAAAAGGACGATTCATTA



AAATTGGAGACAAAGAATGTGAATACAATCCCAAGTTCCGGCTCATCCTC



CACACCAAGCTGGCTAATCCTCACTACCAGCCTGAGCTGCAGGCTCAGGC



CACCCTGATCAACTTCACCGTGACCAGGGATGGCCTGGAGGACCAGTTGC



TGGCCGCTGTGGTCAGCATGGAGAGGCCAGACTTGGAGCAGCTGAAGTC



CGATCTCACAAAGCAGCAGAATGGATTCAAAATTACCCTGAAAACGTTG



GAAGACAGTCTTCTCTCTCGCCTCTCCTCCGCCTCTGGGAACTTCCTGGGA



GAAACAGTGCTGGTGGAAAACCTAGAGATCACCAAGCAGACTGCTGCCG



AAGTTGAGAAAAAGGTCCAGGAGGCCAAGGTGACTGAAGTGAAAATCAA



CGAGGCCCGAGAGCACTACCGGCCAGCAGCTGCCAGGGCCTCACTGCTCT



ACTTCATCATGAACGACCTCAGCAAGATCCATCCAATGTACCAGTTTTCT



CTCAAGGCCTTCAGTATCGTCTTCCAGAAGGCTGTGGAGAGGGCTGCTCC



TGACGAAAGCCTCAGGGAGCGGGTGGCCAACCTAATAGACAGCATAACC



TTCTCTGTGTACCAGTACACCATCCGCGGGCTCTTTGAGTGTGATAAGCTG



ACCTACCTTGCCCAGCTCACCTTTCAGATTCTCCTCATGAACCGAGAAGTC



AATGCAGTGGAGTTGGATTTCCTGCTTCGATCTCCAGTGCAGACGGGCAC



CGCCAGCCCCGTGGAGTTCCTCTCCCATCAGGCGTGGGGAGCTGTCAAGG



TACTTTCATCAATGGAAGAATTCTCTAATCTGGATCGGGACATAGAGGGA



TCTGCTAAGAGCTGGAAAAAGTTTGTGGAGTCCGAATGTCCTGAGAAAG



AGAAGCTCCCACAGGAGTGGAAGAACAAGACAGCCCTGCAGCGCCTCTG



CATGCTGAGAGCCATGCGGCCCGACCGGATGACCTATGCTTTGCGAGATT



TTGTTGAAGAGAAGTTAGGAAGCAAATACGTGGTGGGAAGAGCCCTAGA



TTTTGCAACCTCATTTGAAGAATCGGGACCAGCCACTCCTATGTTTTTCAT



CCTGTCTCCAGGGGTGGACCCACTGAAGGATGTAGAAAGTCAAGGAAGA



AAACTTGGATACACCTTCAACAATCAGAACTTTCACAACGTGTCTTTGGG



GCAAGGACAGGAAGTGGTGGCTGAGGCTGCGCTGGACCTCGCTGCCAAG



AAAGGTCACTGGGTTATTTTGCAGAACATTCACCTGGTGGCCAAGTGGCT



CAGCACCCTGGAGAAGAAGCTGGAGGAGCACAGTGAGAACAGCCACCCA



GAGTTCAGGGTCTTCATGAGTGCAGAGCCAGCACCCTCCCCTGAGGGCCA



CATCATCCCCCAGGGCATCCTGGAGAACTCCATTAAGATCACCAATGAGC



CCCCCACGGGCATGCATGCCAACCTGCACAAGGCCCTGGACAACTTCACT



CAGGACACTCTGGAGATGTGTTCTCGGGAGACGGAGTTTAAGAGCATCCT



CTTTGCTCTTTGTTACTTCCATGCGGTGGTGGCAGAAAGACGAAAATTTG



GGCCCCAGGGATGGAATCGCTCATACCCCTTTAACACTGGAGACCTCACT



ATCTCTGTGAATGTCCTCTACAACTTCCTGGAGGCCAACGCAAAGGTCCC



CTATGATGATTTGCGCTACCTGTTTGGAGAGATCATGTATGGAGGCCATA



TCACAGATGACTGGGACAGAAGACTCTGCAGAACCTACCTGGGGGAATT



CATTCGACCAGAAATGTTAGAAGGAGAACTGTCTTTGGCCCCAGGGTTCC



CACTCCCAGGCAACATGGACTACAATGGTTATCATCAGTACATCGATGCT



GAGCTGCCCCCAGAATCCCCCTACCTCTATGGCCTCCACCCGAACGCAGA



GATTGGCTTCCTGACCCAAACCTCAGAAAAGCTCTTCCGCACTGTGCTGG



AGCTGCAGCCTCGGGACAGCCAGGCCAGAGACGGAGCGGGCGCCACAAG



AGAAGAAAAGGTCAAGGCACTTCTGGAAGAAATATTGGAGCGGGTGACA



GACGAGTTTAACATCCCAGAACTGATGGCCAAAGTGGAGGAGCGCACCC



CTTACATTGTAGTTGCCTTCCAGGAGTGTGGCCGGATGAATATCCTCACC



AGAGAGATTCAGCGCTCACTGAGGGAGCTGGAGCTCGGCTTAAAGGGGG



AGCTGACTATGACCAGCCACATGGAGAACTTACAGAATGCCCTGTACTTC



GATATGGTGCCAGAGTCCTGGGCTAGACGAGCCTACCCTTCCACAGCAGG



CCTGGCAGCCTGGTTTCCAGACCTCCTCAACAGAATCAAGGAGCTAGAGG



CTTGGACGGGTGACTTTACAATGCCCTCCACTGTGTGGCTGACAGGCTTC



TTCAACCCCCAGTCGTTCCTGACTGCCATCATGCAGTCCACGGCTCGCAA



GAATGAGTGGCCACTGGACCAGATGGCCCTGCAATGTGACATGACGAAG



AAGAACAGAGAAGAGTTTAGGAGTCCTCCTCGGGAAGGGGCCTACATCC



ATGGCCTCTTCATGGAAGGTGCCTGCTGGGACACACAGGCTGGGATCATT



ACAGAGGCAAAGCTGAAGGATCTGACACCCCCTATGCCTGTGATGTTCAT



CAAGGCCATTCCTGCAGATAAGCAGGACTGCCGCAGTGTCTATTCCTGTC



CTGTGTACAAGACTAGTCAGCGGGGACCCACCTACGTGTGGACTTTCAAC



CTGAAGACTAAGGAAAACCCATCCAAGTGGGTTCTGGCTGGAGTAGCCTT



GCTTCTCCAGATTTAG





C10orf137
ATGGGGGATGCCAAGGAGGCCGGAGCCGAGGGTCCGCCGGCCGGGGCCG
SEQ ID


->
CCGCTCGAGGAGGGCTCAGCCTCCTGTCCCAGGGAGAATCCGAGGAATCT
NO: 31


LOC100169752
TCTGCACAGGGATCAGCTTTATTTCTTGGAGGCAATGAAGTGAAGAGCCG



AGCTGTGGTGAAATACTCTTCTGCCCCTCCTCGAACAGCATTTGCACGCCT



TGAAGAGAAAACAGACTTGAAACTCCCACCTGCCAACTGGTTACGAGAG



AGTGCCAAACTAGGGCCAGCAGGAACTACCATTCTTGGCAACAGCAAGA



AAAGCAAGCCATTTTCAAGCTTTGGCATGGCATATGACTTTATTGATTCA



GTGGGAAATGATGTGGATGTTGTCTCTGACTCTGAAGTGAGGAGAAGATG



CTGTTCTGAGAGCTGCTGATAATACGTGGATCCAAACCCACAGGCTTGAG



CTGCTTAATTGAAATTCACGTCAAAATGAAATTCTGGTTTAGCTCAACTTG



ACTACTATGGATGATGAAAATAAATCCCTGGGTCACACAAGGATAGTGAT



TGAGTAGAGCTTGTTTGCTGGCACAGGAATGACATTTCAACTGGAATTAA



ATTGTCGTCAGTTAACAGAAC





TFG->
ATGAACGGACAGTTGGATCTAAGTGGGAAGCTAATCATCAAAGCTCAAC
SEQ ID


GPR128
TTGGGGAGGATATTCGGCGAATTCCTATTCATAATGAAGATATTACTTAT
NO: 32



GATGAATTAGTGCTAATGATGCAACGAGTTTTCAGAGGAAAACTTCTGAG



TAATGATGAAGTAACAATAAAGTATAAAGATGAAGATGGAGATCTTATA



ACAATTTTTGATAGTTCTGACCTTTCCTTTGCAATTCAGTGCAGTAGGATA



CTGAAACTGACATTATTTGGAAAATCTACTTCCTCATCAAGCACCCCTAC



AGAGTTCTGCAGGAATGGTGGAACCTGGGAAAATGGCAGATGTATTTGT



ACAGAAGAGTGGAAAGGACTGAGATGTACAATTGCTAATTTTTGTGAAA



ATAGTACCTATATGGGTTTTACTTTTGCCAGAATCCCAGTGGGCAGATAT



GGACCATCCTTGCAAACATGTGGCAAGGATACTCCAAATGCGGGCAATCC



AATGGCAGTCCGGTTGTGCAGTCTCTCTCTATATGGAGAGATAGAATTAC



AAAAAGTGACAATAGGAAATTGCAATGAAAATCTGGAAACCCTGGAAAA



GCAGGTAAAGGATGTCACAGCACCACTTAATAACATTTCTTCTGAAGTCC



AGATTTTAACATCTGATGCCAATAAATTAACTGCTGAGAACATCACTAGT



GCTACGCGAGTGGTTGGACAGATATTCAACACTTCCAGAAATGCTTCACC



TGAGGCAAAGAAAGTTGCCATAGTAACAGTGAGTCAACTCCTAGATGCC



AGTGAAGATGCTTTTCAAAGAGTTGCTGCTACTGCTAATGATGATGCCCT



TACAACGCTTATTGAGCAAATGGAGACTTATTCCTTGTCTTTGGGTAATCA



ATCAGTGGTGGAACCTAACATAGCAATACAGTCAGCAAATTTCTCTTCAG



AAAATGCGGTGGGGCCTTCAAATGTTCGCTTCTCTGTGCAGAAAGGAGCT



AGCAGTTCTCTAGTTTCTAGTTCAACATTTATACATACAAATGTGGATGGC



CTTAACCCAGATGCACAGACTGAGCTTCAGGTCTTGCTTAATATGACGAA



AAATTACACCAAGACATGCGGCTTTGTAGTTTATCAAAATGACAAGCTTT



TCCAATCAAAAACTTTTACAGCTAAATCGGATTTTAGTCAAAAAATTATC



TCAAGCAAAACTGATGAAAATGAGCAAGATCAGAGTGCTTCTGTTGACAT



GGTCTTTAGTCCAAAGTACAACCAAAAAGAATTTCAACTCTATTCCTATG



CCTGTGTCTATTGGAATTTGTCAGCGAAGGACTGGGACACATATGGCTGT



CAAAAAGACAAGGGCACTGATGGATTCCTGCGCTGCCGCTGCAACCATA



CTACTAATTTTGCTGTATTAATGACTTTCAAAAAGGATTATCAATATCCCA



AATCACTTGACATATTATCCAACGTTGGATGTGCACTGTCTGTTACTGGTC



TGGCTCTCACAGTTATATTTCAGATTGTCACCAGGAAAGTCAGAAAAACC



TCAGTAACCTGGGTTTTGGTCAATCTGTGCATATCAATGTTGATTTTCAAC



CTCCTCTTTGTGTTTGGAATTGAAAACTCCAATAAGAACTTGCAGACAAG



TGATGGTGACATCAATAATATTGACTTTGACAATAATGACATACCCAGGA



CAGACACCATTAACATCCCGAATCCCATGTGCACTGCGATTGCCGCCTTA



CTGCACTATTTTCTGTTAGTGACATTTACCTGGAACGCACTCAGCGCTGCA



CAGCTCTATTACCTTCTAATAAGGACCATGAAGCCTCTTCCTCGGCATTTC



ATTCTTTTCATCTCATTAATTGGATGGGGAGTCCCAGCTATAGTAGTGGCT



ATAACAGTGGGAGTTATTTATTCTCAGAATGGAAATAATCCACAGTGGGA



ATTAGACTACCGGCAAGAGAAAATCTGCTGGCTGGCAATTCCAGAACCC



AATGGTGTTATAAAAAGTCCGCTGTTGTGGTCATTCATCGTACCTGTAAC



CATTATCCTCATCAGCAATGTTGTTATGTTTATTACAATCTCGATCAAAGT



GCTGTGGAAGAATAACCAGAACCTGACAAGCACAAAAAAAGTTTCATCC



ATGAAGAAGATTGTTAGCACATTATCTGTTGCAGTTGTTTTTGGAATTACC



TGGATTCTAGCATACCTGATGCTAGTTAATGATGATAGCATCAGGATCGT



CTTCAGCTACATATTCTGCCTTTTCAACACTACACAGGGATTGCAAATTTT



TATCCTGTACACTGTTAGAACAAAAGTCTTCCAGAGTGAAGCTTCCAAAG



TGTTGATGTTGCTATCGTCTATTGGGAGAAGGAAGTCATTGCCTTCAGTG



ACGCGGCCGAGGCTGCGTGTAAAGATGTATAATTTCCTCAGGTCATTGCC



AACCTTACATGAACGCTTTAGGCTACTGGAAACCTCTCCGAGTACTGAGG



AAATCACACTCTCTGAAAGTGACAATGCAAAGGAAAGCATCTAG





LRP5->
ATGGAGGCAGCGCCGCCCGGGCCGCCGTGGCCGCTGCTGCTGCTGCTGCT
SEQ ID


SLC22A24
GCTGCTGCTGGCGCTGTGCGGCTGCCCGGCCCCCGCCGCGGCCTCGCCGC
NO: 33



TCCTGCTATTTGCCAACCGCCGGGACGTACGGCTGGTGGACGCCGGCGGA



GTCAAGCTGGAGTCCACCATCGTGGTCAGCGGCCTGGAGGATGCGGCCG



CAGTGGACTTCCAGTTTTCCAAGGGAGCCGTGTACTGGACAGACGTGAGC



GAGGAGGCCATCAAGCAGACCTACCTGAACCAGACGGGGGCCGCCGTGC



AGAACGTGGTCATCTCCGGCCTGGTCTCTCCCGACGGCCTCGCCTGCGAC



TGGGTGGGCAAGAAGCTGTACTGGACGGACTCAGAGACCAACCGCATCG



AGGTGGCCAACCTCAATGGCACATCCCGGAAGGTGCTCTTCTGGCAGGAC



CTTGACCAGCCGAGGGCCATCGCCTTGGACCCCGCTCACGGGTACATGTA



CTGGACAGACTGGGGTGAGACGCCCCGGATTGAGCGGGCAGGGATGGAT



GGCAGCACCCGGAAGATCATTGTGGACTCGGACATTTACTGGCCCAATGG



ACTGACCATCGACCTGGAGGAGCAGAAGCTCTACTGGGCTGACGCCAAG



CTCAGCTTCATCCACCGTGCCAACCTGGACGGCTCGTTCCGGCAGAAGGT



GGTGGAGGGCAGCCTGACGCACCCCTTCGCCCTGACGCTCTCCGGGGACA



CTCTGTACTGGACAGACTGGCAGACCCGCTCCATCCATGCCTGCAACAAG



CGCACTGGGGGGAAGAGGAAGGAGATCCTGAGTGCCCTCTACTCACCCA



TGGACATCCAGGTGCTGAGCCAGGAGCGGCAGCCTTTCTTCCACACTCGC



TGTGAGGAGGACAATGGCGGCTGCTCCCACCTGTGCCTGCTGTCCCCAAG



CGAGCCTTTCTACACATGCGCCTGCCCCACGGGTGTGCAGCTGCAGGACA



ACGGCAGGACGTGTAAGGCAGCTTGTGAGATCCACCATGAAGAAGGAGT



TGGATGCAGTCCGAATTAAAACATCCATTTTTTCCCTGTTCCGTGCACCCA



AATTGCGAATGAGAGTCTTCGGCCTGTGCTTTGTGAGATTCGCAATCACT



GTACCCTTTTATGGCCTGATACTCAACTTGCAGCACTTAGGGAGCAATGT



CTCCCTGTTCCAGATTCTCTGTGGAGCTGTCACATTCACAGCCAGATGTGT



TTCCCTTTTGACACTGAATCATATGGGTCGTCGAATAAGCCAGATATTGTT



CACGTTCCCGGTGGGACTTTTCATTCTGGTCAACACCTTTTTGCCCCAAGA



AATGCAGATCCTGCGTGTGGTTTTAGCAACTTTGGGAATTGGTAGTGTTTC



TGCTGCTAGCAACAGTGCTTCTGTCCACCACAACGAGCTCGTCCCCACCA



TATTGAGGTCAACAGTTGCAGGAATCAATGCAGTGTCCGGTAGGACTGGG



GCAGCACTGGCTCCTCTGTTGATGACCTTAATGGCGTATTCTCCCCACCTA



CCCTGGATTTCCTATGGAGTCTTCCCCATCCTTGCTGTCCCTGTTATCCTCC



TCCTTCCAGAAACCAGGGATCTACCTCTTCCTAACACCATCCAGGATGTG



GAAAATGACAGAAAAGATTCAAGAAACATAAAGCAGGAAGATACTTGCA



TGAAAGTAACACAGTTTTAA





PPP2R2D
AGCGGACATCATTTCCACCGTTGAGTTTAATTACTCTGGAGATCTTCTTGC
SEQ ID


->
AACAGGAGACAAGGGCGGCAGAGTTGTTATTTTTCAGCGTGAACAAGAG
NO: 34


PANK1
AATAAAAGCCGCCCTCATTCTAGGGGAGAATATAATGTTTACAGCACCTT



TCAAAGTCATGAACCGGAGTTTGACTATTTGAAAAGTCTAGAAATTGAGG



AAAAAATTAATAAAATTAGGTGGTTACCACAACAGAATGCTGCTCATTTT



CTACTGTCTACAAATGATAAAACTATAAAATTATGGAAAATAAGTGAACG



GGATAAAAGAGCAGAAGGTTATAACCTGAAAGACGAAGATGGAAGACTT



CGAGACCCATTTAGGATCACGGCGCTACGGGTCCCAATATTGAAGCCCAT



GGATCTTATGGTAGAAGCGAGTCCACGGCGAATTTTTGCAAATGCTCACA



CATATCATATAAATTCCATTTCAGTAAATAGTGATCATGAAACATATCTTT



CTGCAGATGACCTGAGAATTAATTTATGGCACTTAGAAATCACAGATAGA



AGCTTTAACATCGTGGACATCAAGCCTGCTAACATGGAGGAGCTGACCGA



AGTCATCACTGCAGCCGAGTTCCACCCGCACCAGTGCAACGTGTTCGTCT



ACAGCAGTAGCAAAGGGACCATCCGCCTGTGTGACATGCGCTCCTCGGCC



CTGTGCGACAGACACTCCAAGTTTTTTGAAGAGCCTGAAGATCCCAGCAG



TAGGTCCTTCTTCTCAGAAATAATTTCATCCATATCCGATGTAAAATTCAG



TCATAGTGGGCGGTACATGATGACCAGAGACTACCTGTCGGTGAAGGTGT



GGGACCTCAACATGGAGAGCAGGCCGGTGGAGACCCACCAGGTCCACGA



GTACCTGCGCAGCAAGCTCTGCTCTCTCTATGAGAACGACTGCATCTTTG



ACAAGTTTGAGTGTTGCTGGAACGGTTCGGATAGGGTTATTTTGGAGCCG



TTGGGGCACTGTTGGAACTGTTCAAAATGACTGATGACAAGTAG





IGF1R->
ATGAAGTCTGGCTCCGGAGGAGGGTCCCCGACCTCGCTGTGGGGGCTCCT
SEQ ID


DCC
GTTTCTCTCCGCCGCGCTCTCGCTCTGGCCGACGAGTGGAGAAATCTGCG
NO: 35



GGCCAGGCATCGACATCCGCAACGACTATCAGCAGCTGAAGCGCCTGGA



GAACTGCACGGTGATCGAGGGCTACCTCCACATCCTGCTCATCTCCAAGG



CCGAGGACTACCGCAGCTACCGCTTCCCCAAGCTCACGGTCATTACCGAG



TACTTGCTGCTGTTCCGAGTGGCTGGCCTCGAGAGCCTCGGAGACCTCTT



CCCCAACCTCACGGTCATCCGCGGCTGGAAACTCTTCTACAACTACGCCC



TGGTCATCTTCGAGATGACCAATCTCAAGGATATTGGGCTTTACAACCTG



AGGAACATTACTCGGGGGGCCATCAGGATTGAGAAAAATGCTGACCTCT



GTTACCTCTCCACTGTGGACTGGTCCCTGATCCTGGATGCGGTGTCCAATA



ACTACATTGTGGGGAATAAGCCCCCAAAGGAATGTGGGGACCTGTGTCC



AGGGACCATGGAGGAGAAGCCGATGTGTGAGAAGACCACCATCAACAAT



GAGTACAACTACCGCTGCTGGACCACAAACCGCTGCCAGAAAATGTGCC



CAAGCACGTGTGGGAAGCGGGCGTGCACCGAGAACAATGAGTGCTGCCA



CCCCGAGTGCCTGGGCAGCTGCAGCGCGCCTGACAACGACACGGCCTGT



GTAGCTTGCCGCCACTACTACTATGCCGGTGTCTGTGTGCCTGCCTGCCCG



CCCAACACCTACAGGTTTGAGGGCTGGCGCTGTGTGGACCGTGACTTCTG



CGCCAACATCCTCAGCGCCGAGAGCAGCGACTCCGAGGGGTTTGTGATCC



ACGACGGCGAGTGCATGCAGGAGTGCCCCTCGGGCTTCATCCGCAACGG



CAGCCAGAGCATGTACTGCATCCCTTGTGAAGGTCCTTGCCCGAAGGTCT



GTGAGGAAGAAAAGAAAACAAAGACCATTGATTCTGTTACTTCTGCTCAG



ATGCTCCAAGGATGCACCATCTTCAAGGGCAATTTGCTCATTAACATCCG



ACGGGGGAATAACATTGCTTCAGAGCTGGAGAACTTCATGGGGCTCATCG



AGGTGGTGACGGGCTACGTGAAGATCCGCCATTCTCATGCCTTGGTCTCC



TTGTCCTTCCTAAAAAACCTTCGCCTCATCCTAGGAGAGGAGCAGCTAGA



AGGGTTTTCAAATTAAAGCTTTCACAGCACTGCGCTTCCTCTCAGAACCTT



CTGATGCCGTCACAATGCGGGGAGGAAATGTCCTCCTCGACTGCTCCGCG



GAGTCCGACCGAGGAGTTCCAGTGATCAAGTGGAAGAAAGATGGCATTC



ATCTGGCCTTGGGAATGGATGAAAGGAAGCAGCAACTTTCAAATGGGTCT



CTGCTGATACAAAACATACTTCATTCCAGACACCACAAGCCAGATGAGGG



ACTTTACCAATGTGAGGCATCTTTAGGAGATTCTGGCTCAATTATTAGTCG



GACAGCAAAAGTTGCAGTAGCAGGACCACTGAGGTTCCTTTCACAGACA



GAATCTGTCACAGCCTTCATGGGAGACACAGTGCTACTCAAGTGTGAAGT



CATTGGGGAGCCCATGCCAACAATCCACTGGCAGAAGAACCAACAAGAC



CTGACTCCAATCCCAGGTGACTCCCGAGTGGTGGTCTTGCCCTCTGGAGC



ATTGCAGATCAGCCGACTCCAACCGGGGGACATTGGAATTTACCGATGCT



CAGCTCGAAATCCAGCCAGCTCAAGAACAGGAAATGAAGCAGAAGTCAG



AATTTTATCAGATCCAGGACTGCATAGACAGCTGTATTTTCTGCAAAGAC



CATCCAATGTAGTAGCCATTGAAGGAAAAGATGCTGTCCTGGAATGTTGT



GTTTCTGGCTATCCTCCACCAAGTTTTACCTGGTTACGAGGCGAGGAAGT



CATCCAACTCAGGTCTAAAAAGTATTCTTTATTGGGTGGAAGCAACTTGC



TTATCTCCAATGTGACAGATGATGACAGTGGAATGTATACCTGTGTTGTC



ACATATAAAAATGAGAATATTAGTGCCTCTGCAGAGCTCACAGTCTTGGT



TCCGCCATGGTTTTTAAATCATCCTTCCAACCTGTATGCCTATGAAAGCAT



GGATATTGAGTTTGAATGTACAGTCTCTGGAAAGCCTGTGCCCACTGTGA



ATTGGATGAAGAATGGAGATGTGGTCATTCCTAGTGATTATTTTCAGATA



GTGGGAGGAAGCAACTTACGGATACTTGGGGTGGTGAAGTCAGATGAAG



GCTTTTATCAATGTGTGGCTGAAAATGAGGCTGGAAATGCCCAGACCAGT



GCACAGCTCATTGTCCCTAAGCCTGCTATCCCAAGCTCCAGTGTCCTCCCT



TCGGCTCCCAGAGATGTGGTCCCTGTCTTGGTTTCCAGCCGATTTGTCCGT



CTCAGCTGGCGCCCACCTGCAGAAGCGAAAGGGAACATTCAAACTTTCAC



GGTCTTTTTCTCCAGAGAAGGTGACAACAGGGAACGAGCATTGAATACA



ACACAGCCTGGGTCCCTTCAGCTCACTGTGGGAAACCTGAAGCCAGAAGC



CATGTACACCTTTCGAGTTGTGGCTTACAATGAATGGGGACCGGGAGAGA



GTTCTCAACCCATCAAGGTGGCCACACAGCCTGAGTTGCAAGTTCCAGGG



CCAGTAGAAAACCTGCAAGCTGTATCTACCTCACCTACCTCAATTCTTATT



ACCTGGGAACCCCCTGCCTATGCAAACGGTCCAGTCCAAGGTTACAGATT



GTTCTGCACTGAGGTGTCCACAGGAAAAGAACAGAATATAGAGGTTGAT



GGACTATCTTATAAACTGGAAGGCCTGAAAAAATTCACCGAATATAGTCT



TCGATTCTTAGCTTATAATCGCTATGGTCCGGGCGTCTCTACTGATGATAT



AACAGTGGTTACACTTTCTGACGTGCCAAGTGCCCCGCCTCAGAACGTCT



CCCTGGAAGTGGTCAATTCAAGAAGTATCAAAGTTAGCTGGCTGCCTCCT



CCATCAGGAACACAAAATGGATTTATTACCGGCTATAAAATTCGACACAG



AAAGACGACCCGCAGGGGTGAGATGGAAACACTGGAGCCAAACAACCTC



TGGTACCTATTCACAGGACTGGAGAAAGGAAGTCAGTACAGTTTCCAGGT



GTCAGCCATGACAGTCAATGGTACTGGACCACCTTCCAACTGGTATACTG



CAGAGACTCCAGAGAATGATCTAGATGAATCTCAAGTTCCTGATCAACCA



AGCTCTCTTCATGTGAGGCCCCAGACTAACTGCATCATCATGAGTTGGAC



TCCTCCCTTGAACCCAAACATCGTGGTGCGAGGTTATATTATCGGTTATG



GCGTTGGGAGCCCTTACGCTGAGACAGTGCGTGTGGACAGCAAGCAGCG



ATATTATTCCATTGAGAGGTTAGAGTCAAGTTCCCATTATGTAATCTCCCT



AAAAGCTTTTAACAATGCCGGAGAAGGAGTTCCTCTTTATGAAAGTGCCA



CCACCAGGTCTATAACCGATCCCACTGACCCAGTTGATTATTATCCTTTGC



TTGATGATTTCCCCACCTCGGTCCCAGATCTCTCCACCCCCATGCTCCCAC



CAGTAGGTGTACAGGCTGTGGCTCTTACCCATGATGCTGTGAGGGTCAGC



TGGGCAGACAACTCTGTCCCTAAGAACCAAAAGACGTCTGAGGTGCGAC



TTTACACCGTCCGGTGGAGAACCAGCTTTTCTGCAAGTGCAAAATACAAG



TCAGAAGACACAACATCTCTAAGTTACACAGCAACAGGCCTCAAACCAA



ACACAATGTATGAATTCTCGGTCATGGTAACAAAAAACAGAAGGTCCAG



TACTTGGAGCATGACTGCACATGCCACCACGTATGAAGCAGCCCCCACCT



CTGCTCCCAAGGACTTGACAGTCATTACTAGGGAAGGGAAGCCTCGTGCC



GTCATTGTGAGTTGGCAGCCTCCCTTGGAAGCCAATGGGAAAATTACTGC



TTACATCTTATTTTATACCTTGGACAAGAACATCCCAATTGATGACTGGAT



TATGGAAACAATCAGTGGTGATAGGCTTACTCATCAAATCATGGATCTCA



ACCTTGATACTATGTATTACTTTCGAATTCAAGCACGAAATTCAAAAGGA



GTGGGGCCACTCTCTGATCCTATCCTCTTCAGGACTCTGAAAGTGGAACA



CCCTGACAAAATGGCTAATGACCAAGGTCGTCATGGAGATGGAGGTTATT



GGCCAGTTGATACTAATTTGATTGATAGAAGCACCCTAAATGAGCCGCCA



ATTGGACAAATGCACCCCCCGCATGGCAGTGTCACTCCTCAGAAGAACAG



CAACCTGCTTGTGATCATTGTGGTCACCGTTGGTGTCATCACAGTGCTGGT



AGTGGTCATCGTGGCTGTGATTTGCACCCGACGCTCTTCAGCCCAGCAGA



GAAAGAAACGGGCCACCCACAGTGCTGGCAAAAGGAAGGGCAGCCAGA



AGGACCTCCGACCCCCTGATCTTTGGATCCATCATGAAGAAATGGAGATG



AAAAATATTGAAAAGCCATCTGGCACTGACCCTGCAGGAAGGGACTCTC



CCATCCAAAGTTGCCAAGACCTCACACCAGTCAGCCACAGCCAGTCAGA



AACCCAACTGGGAAGCAAAAGCACCTCTCATTCAGGTCAAGACACTGAG



GAAGCAGGGAGCTCTATGTCCACTCTGGAGAGGTCGCTGGCTGCACGCCG



AGCCCCCCGGGCCAAGCTCATGATTCCCATGGATGCCCAGTCCAACAATC



CTGCTGTCGTGAGCGCCATCCCGGTGCCAACGCTAGAAAGTGCCCAGTAC



CCAGGAATCCTCCCGTCTCCCACCTGTGGATATCCCCACCCGCAGTTCACT



CTCCGGCCTGTGCCATTCCCAACACTCTCAGTGGACCGAGGTTTCGGAGC



AGGAAGAAGTCAGTCAGTGAGTGAAGGACCAACTACCCAACAACCACCT



ATGCTGCCCCCATCTCAGCCTGAGCATTCTAGCAGCGAGGAGGCACCAAG



CAGAACCATCCCCACAGCTTGTGTTCGACCAACTCACCCACTCCGCAGCT



TTGCTAATCCTTTGCTACCTCCACCAATGAGTGCAATAGAACCGAAAGTC



CCTTACACACCACTTTTGTCTCAGCCAGGGCCCACTCTTCCTAAGACCCAT



GTGAAAACAGCCTCCCTTGGGTTGGCTGGAAAAGCAAGATCCCCTTTGCT



TCCTGTGTCTGTGCCAACAGCCCCTGAAGTGTCTGAGGAGAGCCACAAAC



CAACAGAGGATTCAGCCAATGTGTATGAACAGGATGATCTGAGTGAACA



AATGGCAAGTTTGGAAGGACTCATGAAGCAGCTTAATGCCATCACAGGCT



CAGCCTTTTAA





KIF16B->
ATGGCATCGGTCAAGGTGGCCGTGAGGGTCCGGCCCATGAATCGCAGCTT
SEQ ID


PCSK2
CCCTTTGCTGAAGGTCTGTACCACTTTTATCACAATGGCCTTGCAAAGGCC
NO: 36



AAGAGAAGACGCAGCCTACACCACAAGCAGCAGCTGGAGAGAGACCCCA



GGGTAAAGATGGCTTTGCAGCAGGAAGGATTTGACCGAAAAAAGCGAGG



TTACAGAGACATCAATGAGATCGACATCAACATGAACGATCCTCTTTTTA



CAAAGCAGTGGTATCTGATCAATACTGGGCAAGCTGATGGCACTCCTGGC



CTTGATTTGAATGTGGCTGAAGCCTGGGAGCTGGGATACACAGGGAAAG



GTGTTACCATTGGAATTATGGATGATGGGATTGACTATCTCCACCCGGAC



CTGGCCTCCAACTATAATGCCGAAGCAAGTTACGACTTCAGCAGCAACGA



CCCCTATCCTTACCCTCGGTACACAGATGACTGGTTTAACAGCCACGGGA



CCCGATGTGCAGGAGAAGTTTCTGCTGCCGCCAACAACAATATCTGTGGA



GTTGGAGTAGCATACAACTCCAAGGTTGCAGGCATCCGGATGCTGGACCA



GCCATTCATGACAGACATCATCGAGGCCTCCTCCATCAGTCATATGCCAC



AGCTGATTGACATCTACAGCGCCAGCTGGGGCCCCACAGACAACGGCAA



GACAGTGGATGGGCCCCGGGAGCTCACGCTGCAGGCCATGGCCGATGGC



GTGAACAAGGGCCGCGGCGGCAAAGGCAGCATCTACGTGTGGGCCTCCG



GGGACGGCGGCAGCTATGACGACTGCAACTGCGACGGCTACGCCTCCAG



CATGTGGACCATCTCCATCAACTCAGCCATCAACGACGGCAGGACTGCCC



TGTACGACGAGAGCTGCTCTTCCACCTTGGCTTCCACCTTCAGCAACGGG



AGGAAAAGGAACCCCGAGGCCGGTGTGGCAACCACAGATTTGTACGGCA



ACTGCACTCTGAGGCATTCTGGGACATCTGCAGCTGCCCCCGAGGCAGCT



GGTGTGTTTGCACTGGCTCTGGAGGCTAACCTGGGTCTGACCTGGCGGGA



CATGCAGCATCTGACTGTGCTCACCTCCAAACGGAACCAGCTTCACGACG



AGGTCCATCAGTGGCGGCGCAATGGGGTCGGCCTGGAATTTAATCACCTC



TTTGGCTACGGGGTCCTTGATGCAGGTGCCATGGTGAAAATGGCTAAAGA



CTGGAAAACCGTGCCTGAGAGATTCCACTGTGTGGGAGGCTCCGTGCAGG



ACCCTGAGAAAATACCATCCACTGGCAAGTTGGTGCTGACACTCACAACC



GACGCCTGTGAGGGGAAGGAAAATTTTGTCCGCTACCTGGAGCATGTCCA



GGCTGTCATCACGGTCAACGCAACCAGAAGAGGAGACCTGAACATCAAC



ATGACTTCCCCTATGGGCACCAAGTCCATTTTGCTGAGCCGGCGTCCAAG



GGATGACGACTCCAAGGTGGGCTTTGACAAGTGGCCTTTCATGACCACTC



ACACGTGGGGGGAAGACGCCCGAGGCACCTGGACCCTGGAGCTGGGATT



TGTCGGCAGCGCCCCGCAGAAGGGGGTGCTGAAGGAGTGGACCCTGATG



CTGCATGGCACTCAGAGTGCCCCGTACATCGACCAGGTGGTGCGGGATTA



CCAGTCCAAGTTGGCCATGTCCAAGAAAGAGGAGCTGGAGGAAGAGCTG



GACGAAGCCGTGGAGAGAAGCCTGAAAAGCATCCTTAACAAGAACTAG





ACSL3->
GTCCCAGGCGGTTCCGCTCAACAGACGCTGCTGTGGCTGCGCCGGGCTGC
SEQ ID


MOGAT1
GACACTGCAGTTGTCTACGCGGCCGGGGCCGGGACGAGGAGGCGTTGGA
NO: 37



CGGGGTCGCATACGTTCGTCCCCTCGCATTGCGGCCCCGACAGCTGCGCC



AGGATCCCCGGGCGGCGGCGCGGGGCGTGAACGCTCTGGGGCTCAGCCA



GGCCTGCGCGGGCCCGAGGCCGGAGGAACCCGGACTCCGGCGTAGCGGG



CCGATGTCCATTGGAATCACTGTGATGCTGATCATACACAACTATTTGTTC



CTTTACATCCCTTATTTGATGTGGCTTTACTTTGACTGGCATACCCCAGAG



CGAGGAGGCAGGAGATCCAGCTGGATCAAAAATTGGACTCTTTGGAAAC



ACTTTAAGGACTATTTTCCAATTCATCTTATCAAAACTCAAGATTTGGATC



CAAGTCACAACTATATATTTGGGTTTCACCCCCATGGAATAATGGCAGTT



GGAGCCTTTGGGAATTTTTCTGTAAATTATTCTGACTTCAAGGACCTGTTT



CCTGGCTTTACTTCATATCTTCACGTGCTGCCACTTTGGTTCTGGTGTCCT



GTCTTTCGAGAATATGTGATGAGTGTTGGGCTGGTTTCAGTTTCCAAGAA



AAGTGTGTCCTACATGGTAAGCAAGGAGGGAGGTGGAAACATCTCTGTC



ATTGTCCTTGGGGGTGCAAAAGAATCACTGGATGCTCATCCTGGAAAGTT



CACTCTGTTCATCCGCCAGCGGAAAGGATTTGTTAAAATTGCTTTGACCC



ATGGCGCCTCTCTGGTCCCAGTGGTTTCTTTTGGTGAAAATGAACTGTTTA



AACAAACTGACAACCCTGAAGGATCATGGATTAGAACTGTTCAGAATAA



ACTGCAGAAGATCATGGGGTTTGCTTTGCCCCTGTTTCATGCCAGGGGAG



TTTTTCAGTACAATTTTGGCCTAATGACCTATAGGAAAGCCATCCACACT



GTTGTTGGCCGCCCGATCCCTGTTCGTCAGACTCTGAACCCGACCCAGGA



GCAGATTGAGGAGTTACATCAGACCTATATGGAGGAACTTAGGAAATTGT



TTGAGGAACACAAAGGAAAGTATGGCATTCCAGAGCACGAGACTCTTGT



TTTAAAATGACTTGACTATAAAAAAAAATTAAAAAATAAAAATAAATGA



CT





DLG5->
ATGGAGCCCCAGCGCCGGGAGCTGCTCGCCCAGTGTCAGCAGAGCCTGG
SEQ ID


ADK
CCCAGGCCATGACGGAGGTGGAAGCCGTGCTCGGGCTGCTCGAGGCCGC
NO: 38



GGGAGCGCTCAGTCCCGGCGAGCGGCGGCAGCTGGACGAGGAGGCGGGA



GGCGCCAAGGCGGAGCTGCTGCTCAAGCTGCTCTTGGCCAAGGAGCGGG



ACCACTTCCAGGACCTGCGGGCGGCGCTGGAGAAGACGCAGCCTCACCT



GCTGCCCATTCTCTACCTGAACGGCGTCGTCGGGCCGCCGCAGCCCGCCG



AAGGCGCGGGTTCTACCTACAGCGTCCTGTCCACCATGCCCTCAGACTCA



GAAAGCAGCAGCTCCCTCAGCAGTGTGGGCACTACCGGGAAGGCGCCGT



CCCCACCACCCCTCCTCACTGACCAGCAAGTGAATGAGAAGGTGGAGAA



CCTCTCCATTCAGCTGCGGCTGATGACCCGGGAGAGAAACGAGCTCCGCA



AGCGCCTGGCCTTTGCTACGCATGGCACGGCCTTTGACAAGAGGCCCTAC



CACAGGCTGAATCCTGACTATGAGAGGCTGAAGATCCAGTGCGTGCGAG



CCATGTCGGACCTGCAGAGCCTGCAGAACCAGCACACCAACGCCTTGAA



GAGGTGTGAGGAGGTGGCCAAGGAGACTGACTTCTACCACACACTCCAC



AGCCGGCTCCTGAGTGACCAGACTCGGCTGAAGGATGACGTGGACATGC



TGAGGCGGGAGAATGGGCAGCTGCTGCGGGAGCGAAACCTGCTGCAGCA



GTCATGGGAGGACATGAAGCGGCTCCACGAGGAGGACCAGAAGGAGATC



GGTGACCTCCGTGCCCAGCAGCAGCAGTGGATGATTCAACAGCCACACA



AAGCAGCAACATTTTTTGGATGCATTGGGATAGATAAATTTGGGGAGATC



CTGAAGAGAAAAGCTGCTGAAGCCCATGTGGATGCTCATTACTACGAGC



AGAATGAGCAGCCAACAGGAACTTGTGCTGCATGCATCACTGGTGACAA



CAGGTCCCTCATAGCTAATCTTGCTGCTGCCAATTGTTATAAAAAGGAAA



AACATCTTGATCTGGAGAAAAACTGGATGTTGGTAGAAAAAGCAAGAGT



TTGTTATATAGCAGGCTTTTTTCTTACAGTTTCCCCAGAGTCAGTATTAAA



GGTGGCTCACCATGCTTCTGAAAACAACAGGATTTTCACTTTGAATCTAT



CTGCACCGTTTATTAGCCAGTTCTACAAGGAATCATTGATGAAAGTTATG



CCTTATGTTGATATACTTTTTGGAAATGAGACAGAAGCTGCCACTTTTGCT



AGAGAGCAAGGCTTTGAGACTAAAGACATTAAAGAGATAGCCAAAAAGA



CACAAGCCCTGCCAAAGATGAACTCAAAGAGGCAGCGAATCGTGATCTT



CACCCAAGGGAGAGATGACACTATAATGGCTACAGAAAGTGAAGTCACT



GCTTTTGCTGTCTTGGATCAAGACCAGAAAGAAATTATTGATACCAATGG



AGCTGGAGATGCATTTGTTGGAGGTTTTCTGTCTCAACTGGTCTCTGACAA



GCCTCTGACTGAATGTATCCGTGCTGGCCACTATGCAGCAAGCATCATAA



TTAGACGGACTGGCTGCACCTTTCCTGAGAAGCCAGACTTCCACTGA





VDAC3->
ATGTGTAACACACCAACGTACTGTGACCTAGGAAAGGCTGCTAAGGATGT
SEQ ID


IL1RAPL1
CTTCAACAAAGGATATGGCTTTGGCATGGTCAAGATAGACCTGAAAACCA
NO: 39



AGTCTTGTAGTGGAGTGGAATTTTCTACTTCTGGTCATGCTTACACTGATA



CAGGGAAAGCATCAGGCAACCTAGAAACCAAATATAAGGTCTGTAACTA



TGGACTTACCTTCACCCAGAAATGGAACACAGACAATACTCTAGGGACA



GAAATCTCTTGGGAGAATAAGCCGATGGATGCACTGACTGGTCTATCGAT



ATCAAGAAATATCAAGTTTTGGTGGGAGAGCCTGTTCGAATCAAATGTGC



ACTCTTTTATGGTTATATCAGAACAAATTACTCCCTTGCCCAAAGTGCTGG



ACTCAGTTTGATGTGGTACAAAAGTTCTGGTCCTGGAGACTTTGAAGAGC



CAATAGCCTTTGACGGAAGTAGAATGAGCAAAGAAGAAGACTCCATTTG



GTTCCGGCCAACATTGCTACAGGACAGTGGTCTCTACGCCTGTGTCATCA



GAAACTCCACTTACTGTATGAAAGTATCCATCTCACTGACAGTGGGTGAA



AATGACACTGGACTCTGCTATAATTCCAAGATGAAGTATTTTGAAAAAGC



TGAACTTAGCAAAAGCAAGGAAATTTCATGCCGTGACATAGAGGATTTTC



TACTGCCAACCAGAGAACCTGAAATCCTTTGGTACAAGGAATGCAGGAC



AAAAACATGGAGGCCAAGTATTGTATTCAAAAGAGATACTCTGCTTATAA



GAGAAGTCAGAGAAGATGACATTGGAAATTATACCTGTGAATTAAAATA



TGGAGGCTTTGTTGTGAGAAGAACTACTGAATTAACTGTTACAGCCCCTC



TGACTGATAAGCCACCCAAGCTTTTGTATCCTATGGAAAGTAAACTGACA



ATTCAGGAGACCCAGCTGGGTGACTCTGCTAATCTAACCTGCAGAGCTTT



CTTTGGGTACAGCGGAGATGTCAGTCCTTTAATTTACTGGATGAAAGGAG



AAAAATTTATTGAAGATCTGGATGAAAATCGAGTTTGGGAAAGTGACATT



AGAATTCTTAAGGAGCATCTTGGGGAACAGGAAGTTTCCATCTCATTAAT



TGTGGACTCTGTGGAAGAAGGTGACTTGGGAAATTACTCCTGTTATGTTG



AAAATGGAAATGGACGTCGACACGCCAGCGTTCTCCTTCATAAACGAGA



GCTAATGTACACAGTGGAACTTGCTGGAGGCCTTGGTGCTATACTCTTGC



TGCTTGTATGTTTGGTGACCATCTACAAGTGTTACAAGATAGAAATCATG



CTCTTCTACAGGAATCATTTTGGAGCTGAAGAGCTCGATGGAGACAATAA



AGATTATGATGCATACTTATCATACACCAAAGTGGATCCTGACCAGTGGA



ATCAAGAGACTGGGGAAGAAGAACGTTTTGCCCTTGAAATCCTACCTGAT



ATGCTTGAAAAGCATTATGGATATAAGTTGTTTATACCAGATAGAGATTT



AATCCCAACTGGAACATACATTGAAGATGTGGCAAGATGTGTAGATCAA



AGCAAGCGGCTGATTATTGTCATGACCCCAAATTACGTAGTTAGAAGGGG



CTGGAGCATCTTTGAGCTGGAAACCAGACTTCGAAATATGCTTGTGACTG



GAGAAATTAAAGTGATTCTAATTGAATGCAGTGAACTGAGAGGAATTAT



GAACTACCAGGAGGTGGAGGCCCTGAAGCACACCATCAAGCTCCTGACG



GTCATTAAATGGCATGGACCAAAATGCAACAAGTTGAACTCCAAGTTCTG



GAAACGTTTACAGTATGAAATGCCTTTTAAGAGGATAGAACCCATTACAC



ATGAGCAGGCTTTAGATGTCAGTGAGCAAGGGCCTTTTGGGGAGCTGCAG



ACTGTCTCGGCCATTTCCATGGCCGCGGCCACCTCCACAGCTCTAGCCAC



TGCCCATCCAGATCTCCGTTCTACCTTTCACAACACGTACCATTCACAAAT



GCGTCAGAAACACTACTACCGAAGCTATGAGTACGACGTACCTCCTACCG



GCACCCTGCCTCTTACCTCCATAGGCAATCAGCATACCTACTGTAACATC



CCTATGACACTCATCAACGGGCAGCGGCCACAGACAAAATCGAGCAGGG



AGCAGAATCCAGATGAGGCCCACACAAACAGTGCCATCCTGCCGCTGTTG



CCAAGGGAGACCAGTATATCCAGTGTGATATGGTGA





ERBB2->
ATGGAGCTGGCGGCCTTGTGCCGCTGGGGGCTCCTCCTCGCCCTCTTGCC
SEQ ID


IKZF3
CCCCGGAGCCGCGAGCACCCAAGTGTGCACCGGCACAGACATGAAGCTG
NO: 40



CGGCTCCCTGCCAGTCCCGAGACCCACCTGGACATGCTCCGCCACCTCTA



CCAGGGCTGCCAGGTGGTGCAGGGAAACCTGGAACTCACCTACCTGCCC



ACCAATGCCAGCCTGTCCTTCCTGCAGGATATCCAGGAGGTGCAGGGCTA



CGTGCTCATCGCTCACAACCAAGTGAGGCAGGTCCCACTGCAGAGGCTGC



GGATTGTGCGAGGCACCCAGCTCTTTGAGGACAACTATGCCCTGGCCGTG



CTAGACAATGGAGACCCGCTGAACAATACCACCCCTGTCACAGGGGCCTC



CCCAGGAGGCCTGCGGGAGCTGCAGCTTCGAAGCCTCACAGAGATCTTG



AAAGGAGGGGTCTTGATCCAGCGGAACCCCCAGCTCTGCTACCAGGACA



CGATTTTGTGGAAGGACATCTTCCACAAGAACAACCAGCTGGCTCTCACA



CTGATAGACACCAACCGCTCTCGGGCCTGCCACCCCTGTTCTCCGATGTG



TAAGGGCTCCCGCTGCTGGGGAGAGAGTTCTGAGGATTGTCAGAGCCATG



ATTCAATGAAAGTGAAAGATGAATACAGTGAAAGAGATGAGAATGTTTT



AAAGTCAGAACCCATGGGAAATGCAGAAGAGCCTGAAATCCCTTACAGC



TATTCAAGAGAATATAATGAATATGAAAACATTAAGTTGGAGAGACATG



TTGTCTCATTCGATAGTAGCAGGCCAACCAGTGGAAAGATGAACTGCGAT



GTGTGTGGATTATCCTGCATCAGCTTCAATGTCTTAATGGTTCATAAGCGA



AGCCATACTGGTGAACGCCCATTCCAGTGTAATCAGTGTGGGGCATCTTT



TACTCAGAAAGGTAACCTCCTCCGCCACATTAAACTGCACACAGGGGAA



AAACCTTTTAAGTGTCACCTCTGCAACTATGCATGCCAAAGAAGAGATGC



GCTCACGGGGCATCTTAGGACACATTCTGTGGAGAAACCCTACAAATGTG



AGTTTTGTGGAAGGAGTTACAAGCAGAGAAGTTCCCTTGAGGAGCACAA



GGAGCGCTGCCGTACATTTCTTCAGAGCACTGACCCAGGGGACACTGCAA



GTGCGGAGGCAAGACACATCAAAGCAGAGATGGGAAGTGAAAGAGCTCT



CGTACTGGACAGATTAGCAAGCAATGTGGCAAAACGAAAAAGCTCAATG



CCTCAGAAATTCATTGGTGAGAAGCGCCACTGCTTTGATGTCAACTATAA



TTCAAGTTACATGTATGAGAAAGAGAGTGAGCTCATACAGACCCGCATG



ATGGACCAAGCCATCAATAACGCCATCAGCTATCTTGGCGCCGAAGCCCT



GCGCCCCTTGGTCCAGACACCGCCTGCTCCCACCTCGGAGATGGTTCCAG



TTATCAGCAGCATGTATCCCATAGCCCTCACCCGGGCTGAGATGTCAAAC



GGTGCCCCTCAAGAGCTGGAAAAGAAAAGCATCCACCTTCCAGAGAAGA



GCGTGCCTTCTGAGAGAGGCCTCTCTCCCAACAATAGTGGCCACGACTCC



ACGGACACTGACAGCAACCATGAAGAACGCCAGAATCACATCTATCAGC



AAAATCACATGGTCCTGTCTCGGGCCCGCAATGGGATGCCACTTCTGAAG



GAGGTTCCCCGCTCTTACGAACTCCTCAAGCCCCCGCCCATCTGCCCAAG



AGACTCCGTCAAAGTGATCAACAAGGAAGGGGAGGTGATGGATGTGTAT



CGGTGTGACCACTGCCGCGTCCTCTTCCTGGACTATGTGATGTTCACGATT



CACATGGGCTGCCACGGCTTCCGTGACCCTTTCGAGTGTAACATGTGTGG



ATATCGAAGCCATGATCGGTATGAGTTCTCGTCTCACATAGCCAGAGGAG



AACACAGAGCCCTGCTGAAGTGA





ERBB2->
ATGGAGCTGGCGGCCTTGTGCCGCTGGGGGCTCCTCCTCGCCCTCTTGCC
SEQ ID


IKZF3
CCCCGGAGCCGCGAGCACCCAAGTGTGCACCGGCACAGACATGAAGCTG
NO: 41



CGGCTCCCTGCCAGTCCCGAGACCCACCTGGACATGCTCCGCCACCTCTA



CCAGGGCTGCCAGGTGGTGCAGGGAAACCTGGAACTCACCTACCTGCCC



ACCAATGCCAGCCTGTCCTTCCTGCAGGATATCCAGGAGGTGCAGGGCTA



CGTGCTCATCGCTCACAACCAAGTGAGGCAGGTCCCACTGCAGAGGCTGC



GGATTGTGCGAGGCACCCAGCTCTTTGAGGACAACTATGCCCTGGCCGTG



CTAGACAATGGAGACCCGCTGAACAATACCACCCCTGTCACAGGGGCCTC



CCCAGGAGGCCTGCGGGAGCTGCAGCTTCGAAGCCTCACAGAGATCTTG



AAAGGAGGGGTCTTGATCCAGCGGAACCCCCAGCTCTGCTACCAGGACA



CGATTTTGTGGAAGGACATCTTCCACAAGAACAACCAGCTGGCTCTCACA



CTGATAGACACCAACCGCTCTCGGGCCTGCCACCCCTGTTCTCCGATGTG



TAAGGGCTCCCGCTGCTGGGGAGAGAGTTCTGAGGATTGTCAGAGCCTGA



CGCGCACTGTCTGTGCCGGTGGCTGTGCCCGCTGCAAGGGGCCACTGCCC



ACTGACTGCTGCCATGAGCAGTGTGCTGCCGGCTGCACGGGCCCCAAGCA



CTCTGACTGCCTGGCCTGCCTCCACTTCAACCACAGTGGCATCTGTGAGCT



GCACTGCCCAGCCCTGGTCACCTACAACACAGACACGTTTGAGTCCATGC



CCAATCCCGAGGGCCGGTATACATTCGGCGCCAGCTGTGTGACTGCCTGT



CCCTACAACTACCTTTCTACGGACGTGGGATCCTGCACCCTCGTCTGCCCC



CTGCACAACCAAGAGGTGACAGCAGAGGATGGAACACAGCGGTGTGAGA



AGTGCAGCAAGCCCTGTGCCCGAGATGATTCAATGAAAGTGAAAGATGA



ATACAGTGAAAGAGATGAGAATGTTTTAAAGTCAGAACCCATGGGAAAT



GCAGAAGAGCCTGAAATCCCTTACAGCTATTCAAGAGAATATAATGAAT



ATGAAAACATTAAGTTGGAGAGACATGTTGTCTCATTCGATAGTAGCAGG



CCAACCAGTGGAAAGATGAACTGCGATGTGTGTGGATTATCCTGCATCAG



CTTCAATGTCTTAATGGTTCATAAGCGAAGCCATACTGGTGAACGCCCAT



TCCAGTGTAATCAGTGTGGGGCATCTTTTACTCAGAAAGGTAACCTCCTC



CGCCACATTAAACTGCACACAGGGGAAAAACCTTTTAAGTGTCACCTCTG



CAACTATGCATGCCAAAGAAGAGATGCGCTCACGGGGCATCTTAGGACA



CATTCTGTGGAGAAACCCTACAAATGTGAGTTTTGTGGAAGGAGTTACAA



GCAGAGAAGTTCCCTTGAGGAGCACAAGGAGCGCTGCCGTACATTTCTTC



AGAGCACTGACCCAGGGGACACTGCAAGTGCGGAGGCAAGACACATCAA



AGCAGAGATGGGAAGTGAAAGAGCTCTCGTACTGGACAGATTAGCAAGC



AATGTGGCAAAACGAAAAAGCTCAATGCCTCAGAAATTCATTGGTGAGA



AGCGCCACTGCTTTGATGTCAACTATAATTCAAGTTACATGTATGAGAAA



GAGAGTGAGCTCATACAGACCCGCATGATGGACCAAGCCATCAATAACG



CCATCAGCTATCTTGGCGCCGAAGCCCTGCGCCCCTTGGTCCAGACACCG



CCTGCTCCCACCTCGGAGATGGTTCCAGTTATCAGCAGCATGTATCCCAT



AGCCCTCACCCGGGCTGAGATGTCAAACGGTGCCCCTCAAGAGCTGGAA



AAGAAAAGCATCCACCTTCCAGAGAAGAGCGTGCCTTCTGAGAGAGGCC



TCTCTCCCAACAATAGTGGCCACGACTCCACGGACACTGACAGCAACCAT



GAAGAACGCCAGAATCACATCTATCAGCAAAATCACATGGTCCTGTCTCG



GGCCCGCAATGGGATGCCACTTCTGAAGGAGGTTCCCCGCTCTTACGAAC



TCCTCAAGCCCCCGCCCATCTGCCCAAGAGACTCCGTCAAAGTGATCAAC



AAGGAAGGGGAGGTGATGGATGTGTATCGGTGTGACCACTGCCGCGTCC



TCTTCCTGGACTATGTGATGTTCACGATTCACATGGGCTGCCACGGCTTCC



GTGACCCTTTCGAGTGTAACATGTGTGGATATCGAAGCCATGATCGGTAT



GAGTTCTCGTCTCACATAGCCAGAGGAGAACACAGAGCCCTGCTGAAGT



GA





EIF4A3->
ATGGCGACCACGGCCACGATGGCGACCTCGGGCTCGGCGCGAAAGCGGC
SEQ ID


TSPEAR
TGCTCAAAGAGGAAGACATGACTAAAGTGGAATTCGAGACCAGCGAGGA
NO: 42



GGTGGATGTGACCCCCACGTTCGACACCATGGGCCTGCGGGAGGACCTGC



TGCGGGGCATCTACGCTTACGAGGAACGAGTACCTGCTGACGGTGGTGGC



AGAGGAGAGCGACCTGCTGCTGCTCGGCCTGCGGTTGTCACCTGCCCAGC



TGCACTTCCTGTTCCTTCGCGAGGACACGGCCGGCGCCTGGCAGACCCGA



GTGTCCTTCCGCAGCCCGGCCCTGGTGGATGGCCGCTGGCACACACTGGT



CCTGGCTGTGTCCGCAGGCGTCTTCTCCCTCACCACGGACTGCGGCCTCCC



GGTGGACATAATGGCCGATGTGCCCTTCCCAGCCACCCTGTCAGTGAAAG



GAGCTCGATTCTTCGTCGGCAGCCGGAGGAGAGCCAAAGGCCTGTTCATG



GGACTGGTGAGGCAACTGGTCCTGCTGCCGGGCTCAGACGCCACCCCAA



GGCTGTGTCCCAGCAGGAACGCCCCGCTGGCGGTGCTGTCCATCCCACGG



GTCCTGCAGGCTCTCACGGGGAAGCCAGAAGATAACGAGGTGCTAAAAT



ATCCCTATGAAACCAACATTCGAGTGACGCTGGGACCCCAGCCACCGTGT



ACCGAGGTGGAAGACGCCCAGTTCTGGTTTGATGCCAGCCGGAAGGGCC



TGTATCTGTGTGTTGGCAACGAGTGGGTCTCCGTGTTAGCAGCCAAAGAA



AGACTGGACTACGTGGAGGAGCATCAGAACTTGTCCACCAACTCAGAGA



CCCTGGGCATTGAGGTGTTCCGCATCCCTCAGGTGGGGCTCTTTGTGGCC



ACAGCCAATCGCAAAGCCACATCCGCCGTCTACAAGTGGACCGAAGAGA



AGTTCGTCTCATATCAGAACATCCCCACGCACCAAGCACAGGCCTGGAGG



CATTTCACCATCGGGAAAAAGATCTTCCTGGCAGTGGCTAATTTTGAACC



AGATGAGAAGGGTCAGGAGTTCTCTGTCATTTACAAATGGAGCCACAGA



AAGCTGAAGTTTACCCCATATCAGAGCATTGCCACACACAGCGCCCGAGA



CTGGGAGGCCTTCGAGGTGGATGGGGAGCACTTCCTGGCGGTGGCCAAC



CACCGGGAAGGCGACAACCACAACATCGACAGTGTCATCTACAAGTGGA



ACCCGGCAACCCGGCTCTTCGAGGCCAACCAGACCATCGCCACCTCCGGC



GCCTACGACTGGGAGTTCTTCAGTGTGGGGCCCTACTCGTTCCTGGTGGT



GGCCAACACCTTCAACGGCACCTCCACCAAGGTGCACTCGCACCTCTACA



TCCGACTCCTGGGCTCCTTCCAGCTCTTCCAGTCCTTCCCGACGTTCGGTG



CTGCAGACTGGGAGGTCTTCCAGATCGGGGAGAGGATCTTCCTCGCTGTG



GCAAACAGTCACAGCTACGATGTGGAGATGCAAGTCCAGAATGATTCCT



ATGTCATCAACTCCGTCATCTACGAGCTGAACGTGACCGCGCAGGCCTTT



GTCAAGTTCCAGGACATTCTCACCTGCAGTGCTCTGGACTGGGAGTTTTTC



TCGGTGGGAGAAGATTATTTCCTGGTGGTGGCCAACTCCTTCGATGGGCG



TACCTTCTCGGTGAACAGTATTATTTACAGGTGGCAGGGCTACGAGGGCT



TCGTGGCGGTGCACAGCCTCCCCACCGTCGGCTGCAGGGACTGGGAGGCC



TTCAGCACCACGGCTGGTGCCTACCTCATCTACTCCAGCGCCAAGGAGCC



CCTCTCCAGGGTCCTGCGGCTGAGGACACGCTGA





BCL7A->
ATGTCGGGCAGGTCGGTTCGAGCCGAGACGAGGAGCCGGGCCAAAGATG
SEQ ID


C12orf42
ATATCAAGAGGGTCATGGCGGCGATCGAGAAAGTGCGCAAATGGGAGAA
NO: 43



GAAATGGGTGACCGTTGGTGACACATCCCTACGAATCTACAAATGGGTCC



CTGTGACGGAGCCCAAGGTTGATGACAAAAACAAGAATAAGAAAAAAGG



CAAGGACGAGAAGTGTGGCTCAGAGGTGACCACTCCGGAGAACAGTTCC



TCCCCAGGGATGATGGACATGCATGATGTCTACAGTGATATGTATGAAAC



AAAGGGAAGAAGAATTCTTGCTAACCATCAGACCTTTTGCAAACAGGAT



GCAGAAATCCCCTTGCTATATTCCCATTGTGAGCAGTGCCACCCTGTGGG



ATAGAAGCACACCCAGTGCAAAGCACATCCCTTGTTATGAAAGAACTTCA



GTACCCTGCTCCAGATTCATTAATCACATGAAGAATTTCTCTGAATCTCCT



AAATTTCGTAGTCTACACTTTCTGAATTTTCCAGTATTTCCAGAAAGGACT



CAAAATTCAATGGCGTGTAAAAGACTACTTCATACTTGCCAGTACATAGT



CCCCAGGTGTTCTGTAAGCACAGTTTCTTTTGATGAAGAAAGCTATGAAG



AATTCCGTTCCTCTCCAGCACCATCCAGTGAAACTGATGAGGCCCCATTG



ATTTTTACTGCCAGAGGAGAAACTGAGGAGAGAGCCAGAGGAGCACCCA



AGCAGGCTTGGAACAGTTCATTTTTGGAACAACTGGTTAAAAAGCCTAAC



TGGGCACACTCAGTAAATCCTGTTCACCTGGAGGCTCAGGGCATACACAT



CAGTAGACACACAAGACCTAAGGGCCAGCCCTTGAGCAGTCCCAAGAAA



AATTCTGGTTCTGCCGCCAGACCTTCCACTGCCATCGGCCTCTGCAGGAG



GAGCCAGACGCCCGGCGCTCTGCAGAGCACCGGCCCGAGTAACACAGAG



CTCGAGCCGGAGGAGAGGATGGCAGTCCCAGCAGGCGCTCAGGCACACC



CCGACGACATCCAAAGCAGACTCCTGGGCGCGTCCGGAAATCCCGTCGG



AAAAGGCGCGGTTGCCATGGCGCCGGAGATGCTCCCCAAGCATCCTCATA



CCCCGCGGGACAGGAGGCCTCAGGCGGACACCTCCCTCCATGGCAATCTG



GCAGGAGCGCCCCTTCCTCTGCTGGCCGGTGCTTCCACCCATTTCCCCTCC



AAGAGGTTAATAAAGGTTTGCTCCTCAGCACCCCCCCGCCCAACCCGGCG



TTTCCATACGGTTTGTTCACAGGCCCTTTCTAGGCCGGTGGTGAATGCTCA



CTTACATTGA





ADK->
ATGACGTCAGTCAGAGAAAATATTCTCTTTGGAATGGGAAATCCTCTGCT
SEQ ID


C10orf11
TGACATCTCTGCTGTAGTGGACAAAGATTTCCTTGATAAGTATTCTCTGAA
NO: 44



ACCAAATGACCAAATCTTGGCTGAAGACAAACACAAGGAACTGTCACTG



GAAGGACTGAGCGCATTCAGGAGCCTGGAGGAACTCATCTTGGACAACA



ATCAGCTGGGGGACGACCTTGTGTTGCCAGGGTTACCCAGACTGCATACC



TTAACCCTCAACAAGAACCGAATCACTGATTTGGAGAACCTGCTGGATCA



CTTGGCAGAAGTGACACCAGCTCTGGAGTACCTCAGTCTGCTGGGCAACG



TGGCCTGTCCCAACGAGCTGGTCAGCTTGGAAAAGGATGAGGAAGACTA



CAAGAGATACAGATGCTTTGTTCTGTACAAGCTGCCCAACTTGAAATTTC



TGGATGCCCAGAAAGTAACCAGACAAGAACGAGAGGAGGCGTTGGTCAG



AGGAGTCTTCATGAAGGTGGTGAAGCCCAAGGCTTCTAGTGAGGACGTTG



CCAGCTCCCCGGAGCGCCACTACACGCCCTTGCCTTCTGCTTCCAGGGAA



CTCACCAGTCACCAAGGTGTCCTGGGGAAGTGTCGCTACGTTTACTATGG



GAAAAACTCAGAGGGCAACAGGTTTATCCGAGATGACCAGCTCTGA





FAM135A
GGTGAGAGAGGCCGGGGGCGGGGCCGTGCGGCCGGGGACCTGTTGATCG
SEQ ID


->
CAGGTATAGCCGGCTGGCCCGGGCTCCCTCGGGACTGGGGCGACTGCGC
NO: 45


PKIB
ATGCTCGCTGGCCGCGCTGGGCCAGTAGCCGAGCCGCGGTGACGAACCG



GCTCCGCGGTTGCGGTGTTTGCGGTTGCTGTGATGGCGATGTGAGGGGGC



CCGGGGCGGGATGGTGCTGACCCGGGTCGGGCCGTCTTCTTGCAGCTGGA



CAACGAGCTCCTCCGTTCGACAGGCGGGGGAAGAGGCCGAGCCGGGCGA



GAGATGTTGCTATGAGGACAGATTCATCAAAAATGACTGACGTGGAGTCT



GGGGTCGCCAATTTTGCATCTTCAGCAAGGGCAGGCCGCCGGAATGCCTT



ACCAGACATCCAGAGTTCAGCTGCCACAGACGGAACCTCAGATTTGCCCC



TCAAACTGGAGGCTCTCTCCGTGAAGGAAGATGCAAAAGAGAAAGATGA



AAAAACAACACAAGACCAATTGGAAAAGCCTCAAAATGAAGAAAAATG



AAGGCTCATAATCTATCAAGAGTGCTGAATTTCTGCATGTTGAAAGACTT



AGTGGTTCTGTTTTCTTGAGACATTTAATCTGGTGGTAACTGTGGTAACAT



TGCAGCCCTAAGCAGCATGTGTATATTAGATAATTGTGTTGTGATGCTAC



TCACTTTGATTGCAATGATGATGTCCAAGGTAAGCTATTAAAAGGCAGGT



TACTTCCAAATCGCACTGAAGGAAAAGGTTAAGAATAATACATGATCAC



AGAAATGCATACCACTGTCTGTAAACCCAACAAAATTCACTGTTCTCTTTT



GGATTTATTTAGCCTGATGTATTTTTAATTCAATTTTTATGGTGATGGGCA



AATCATTCTTGGTAAATGTAAATCAAACATGATTGATTTAAAACTTCATG



GAATTTGTAGAAAATTATGGACATTTTTGGTGAGAAAGAACAATAGTCAA



AACTCACATGGATAGAGTGTGTTTGTTTTTTGCCAAAAATGCCCCAGACT



TTTTCCCAAACCTCAAAAACGTCTTGGAAAAATTGTAAAAGTTTGATAAC



AGAAACATCTTTAGGATATTTTTGTCTGACATATTTTGCTTCTAGTATGTG



CCTACTGTGATTTTTTTCATGTGGAAAATGCAAAATTTGTAACAAAATGG



TTATATGGAACATGCCTATTAAATGAATTTTACTATCTTCCCTAACTTTGG



TCTGTGTATGTGTGTGTGTTTTACTTTAATATGAATTATACAAAATACTAG



TTGTTTTACACTCTCTTTTCTTATTCTTAGGGCTTTTGTGTATGTCTGACTT



GTTTTTAAATAACTTCCTCAGCAATGCAGACCTTAATTTTTATATTTTTTTA



AAGTAGCTAACATAGCAGTAGGCACTTAAGCATTTAGTCAATGATATTGG



TAGAAATAGTAAAATACATCCTTTAAATATATATCTAAGCATATATTTTA



AAAGGAGCAAAAATAAAACCAAAGTGTTAGTAAATTTTGATTTATTAGAT



ATTTTAGAAAAATAATAGAATTCTGAAGTTTTAAAAATGTCAGTAATTAA



TTTATTTTCATTTTCAGAAATATATGCATGCAGTTATGTTTTATTTGATTGT



TGACTTAGGCTATGTCTGTATACAGTAACCAAATAAACTCTTTCACTATTA



AAGAGATTTCTTACTGAC





CDC42BPB
ATGTCGGCCAAGGTGCGGCTCAAGAAGCTGGAGCAGCTGCTCCTGGACG
SEQ ID


->
GGCCCTGGCGCAACGAGAGCGCCCTGAGCGTGGAAACGCTGCTCGACGT
NO: 46


PET112
GCTCGTCTGCCTGTACACCGAGTGCAGCCACTCGGCCCTGCGCCGCGACA



AGTACGTGGCCGAGTTCCTCGAGTGGGGTGTTTGAGGAACTGTGGAAGA



GGGAAGGCAAGACTCCAGGGCAGATTGTTTCAGAAAAGCAGCTTGAACT



GATGCAGGACCAGGGGGCACTGGAGCAGCTCTGCCACTCTGTGATGGAG



GCCCATCCTCAAGTGGTAATGGATGTGAAGAACAGAAACCCCAGAGCTA



TAAATAAACTGATTGGGTTGGTCCGGAAAGCGACTCAAAGCCGAGCAGA



TCCAGTCATGATAAAGGAGATCCTGGAGAAGAAGCTGTCATTGTGA





SAV1->
ATGCTGTCCCGAAAGAAAACCAAAAACGAAGTGTCCAAGCCGGCCGAGG
SEQ ID


NIN
TGCAGGGGAAGTACGTGAAGAAGGAGACGTCGCCTCTGCTTCGGAATCTT
NO: 47



ATGCCTTCATTCATCCGGCATGGTCCAACAATTCCAAGACGAACTGATAT



CTGTCTTCCAGATTCAAGCCCTAATGCCTTTTCAACTTCTGGAGATGTAGT



TTCAAGAAACCAGAGTTTCCTTAGAACTCCAATTCAAAGAACACCTCATG



AAATAATGAGAAGAGAAAGCAACAGATTATCTGCACCTTCTTATCTTGCC



AGAAGTCTAGCAGATGTCCCTAGAGAGTATGGTTCTTCTCAGTCATTTGT



AACGGAAGTTAGTTTTGCTGTTGAAAATGGAGACTCTGGTTCCCGATATT



ATTATTCAGACAATTTTTTTGATGGTCAGAGAAAGCGGCCACTTGGAGAT



CGTGCACATGAAGACTACAGATATTATGAATACAACCATGATCTCTTCCA



AAGAATGCCACAGAATCAGGGGAGGCATGCTTCAGCACTGGAAGACGCA



ACGCAGTGAGGAGTATGAAGCGGAAGGCCAGTTAAGGTTTTGGAACCCA



GATGACTTGAATGCTTCACAGAGTGGATCTTCCCCTCCCCAAGACTGGAT



AGAAGAGAAACTGCAAGAAGTTTGTGAAGATTTGGGGATCACCCGTGAT



GGTCACCTGAACCGGAAGAAGCTGGTCTCCATCTGTGAGCAGTATGGTTT



ACAGAATGTGGATGGAGAGATGCTCGAGGAAGTATTCCATAATCTTGATC



CTGACGGTACAATGAGTGTAGAAGATTTTTTCTATGGTTTGTTTAAAAAT



GGAAAATCTCTTACACCATCAGCATCTACTCCATATAGACAACTAAAAAG



GCACCTTTCCATGCAGTCTTTCGATGAGAGTGGACGACGTACCACAACCT



CATCAGCAATGACAAGTACCATTGGCTTTCGGGTCTTCTCCTGCCTGGAT



GATGGGATGGGCCATGCATCTGTGGAGAGAATACTGGACACCTGGCAGG



AAGAGGGCATTGAGAACAGCCAGGAGATCCTGAAGGCCTTGGATTTCAG



CCTCGATGGAAACATCAATTTGACAGAATTAACACTGGCCCTTGAAAATG



AACTTTTGGTTACCAAGAACAGCATTCACCAGGCGGCTCTGGCCAGCTTT



AAGGCTGAAATCCGGCATTTGTTGGAACGAGTTGATCAGGTGGTCAGAG



AAAAAGAGAAGCTACGGTCAGATCTGGACAAGGCCGAGAAGCTCAAGTC



TTTAATGGCCTCGGAGGTGGATGATCACCATGCGGCCATAGAGCGGCGG



AATGAGTACAACCTCAGGAAACTGGATGAAGAGTACAAGGAGCGAATAG



CAGCCTTAAAAAATGAACTCCGAAAAGAGAGAGAGCAGATCCTGCAGCA



GGCAGGCAAGCAGCGTTTAGAACTTGAACAGGAAATTGAAAAGGCAAAA



ACAGAAGAGAACTATATCCGGGACCGCCTTGCCCTCTCTTTAAAGGAAAA



CAGTCGTCTGGAAAATGAGCTTCTAGAAAATGCAGAGAAGTTGGCAGAA



TATGAGAATCTGACAAACAAACTTCAGAGAAATTTGGAAAATGTGTTAGC



AGAAAAGTTTGGTGACCTCGATCCTAGCAGTGCTGAGTTCTTCCTGCAAG



AAGAGAGACTGACACAGATGAGAAATGAATATGAGCGGCAGTGCAGGGT



ACTACAAGACCAAGTAGATGAACTCCAGTCTGAGCTGGAAGAATATCGT



GCACAAGGCAGAGTGCTCAGGCTTCCGTTGAAGAACTCACCGTCAGAAG



AAGTTGAGGCTAACAGCGGTGGCATTGAGCCCGAACACGGGCTCGGTTCT



GAAGAATGCAATCCATTGAATATGAGCATTGAGGCAGAGCTGGTCATTG



AACAGATGAAAGAACAACATCACAGGGACATATGTTGCCTCAGACTGGA



GCTCGAAGATAAAGTGCGCCATTATGAAAAGCAGCTGGACGAAACCGTG



GTCAGCTGCAAGAAGGCACAGGAGAACATGAAGCAAAGGCATGAGAAC



GAAACGCACACCTTAGAAAAACAAATAAGTGACCTTAAAAATGAAATTG



CTGAACTTCAGGGGCAAGCAGCAGTGCTCAAGGAGGCACATCATGAGGC



CACTTGCAGGCATGAGGAGGAGAAAAAACAACTGCAAGTGAAGCTTGAG



GAGGAAAAGACTCACCTGCAGGAGAAGCTGAGGCTGCAACATGAGATGG



AGCTCAAGGCTAGACTGACACAGGCTCAAGCAAGCTTTGAGCGGGAGAG



GGAAGGCCTTCAGAGTAGCGCCTGGACAGAAGAGAAGGTGAGAGGCTTG



ACTCAGGAACTAGAGCAGTTTCACCAGGAGCAGCTGACAAGCCTGGTGG



AGAAACACACTCTTGAGAAAGAGGAGTTAAGAAAAGAGCTCTTGGAAAA



GCACCAAAGGGAGCTTCAGGAGGGAAGATATGAATCTGAAAAGCTTCAA



CAGGAAAATTCTATTTTGAGAAATGAAATTACTACTTTAAATGAAGAAGA



TAGCATTTCTAACCTGAAATTAGGGACATTAAATGGATCTCAGGAAGAAA



TGTGGCAAAAAACGGAAACTGTAAAACAAGAAAATGCTGCAGTTCAGAA



GATGGTTGAAAATTTAAAGAAACAGATTTCAGAATTAAAAATCAAAAAC



CAACAATTGGATTTGGAAAATACAGAACTTAGCCAAAAGAACTCTCAAA



ACCAGGAAAAACTGCAAGAACTTAATCAACGTCTAACAGAAATGCTATG



CCAGAAGGAAAAAGAGCCAGGAAACAGTGCATTGGAGGAACGGGAACA



AGAGAAGTTTAATCTGAAAGAAGAACTGGAACGTTGTAAAGTGCAGTCC



TCCACTTTAGTGTCTTCTCTGGAGGCGGAGCTCTCTGAAGTTAAAATACA



GACCCATATTGTGCAACAGGAAAACCACCTTCTCAAAGATGAACTGGAG



AAAATGAAACAGCTGCACAGATGTCCCGATCTCTCTGACTTCCAGCAAAA



AATCTCTAGTGTTCTAAGCTACAACGAAAAACTGCTGAAAGAAAAGGAA



GCTCTGAGTGAGGAATTAAATAGCTGTGTCGATAAGTTGGCAAAATCAAG



TCTTTTAGAGCATAGAATTGCGACGATGAAGCAGGAACAGAAATCCTGG



GAACATCAGAGTGCGAGCTTAAAGTCACAGCTGGTGGCTTCTCAGGAAA



AGGTTCAGAATTTAGAAGACACCGTGCAGAATGTAAACCTGCAAATGTCC



CGGATGAAATCTGACCTACGAGTGACTCAGCAGGAAAAGGAGGCTTTAA



AACAAGAAGTGATGTCTTTACATAAGCAACTTCAGAATGCTGGTGGCAAG



AGCTGGGCCCCAGAGATAGCTACTCATCCATCAGGGCTCCATAACCAGCA



GAAAAGGCTGTCCTGGGACAAGTTGGATCATCTGATGAATGAGGAACAG



CAGCTGCTTTGGCAAGAGAATGAGAGGCTCCAGACCATGGTACAGAACA



CCAAAGCCGAACTCACGCACTCCCGGGAGAAGGTCCGTCAGTTGGAATC



CAATCTTCTTCCCAAGCACCAAAAACATCTAAACCCATCAGGTACCATGA



ATCCCACAGAGCAAGAAAAATTGAGCTTAAAGAGAGAGTGTGATCAGTT



TCAGAAAGAACAATCTCCTGCTAACAGGAAGGTCAGTCAGATGAATTCCC



TTGAACAAGAATTAGAAACAATTCATTTGGAAAATGAAGGCCTGAAAAA



GAAACAAGTAAAACTGGATGAGCAGCTCATGGAGATGCAGCACCTGAGG



TCCACTGCGACGCCTAGCCCGTCCCCTCATGCTTGGGATTTGCAGCTGCTC



CAGCAGCAAGCCTGTCCGATGGTGCCCAGGGAGCAGTTTCTGCAGCTTCA



ACGCCAGCTGCTGCAGGCAGAAAGGATAAACCAGCACCTGCAGGAGGAA



CTTGAAAACAGGACCTCCGAAACCAACACACCACAGGGAAACCAGGAAC



AACTGGTAACTGTCATGGAGGAACGAATGATAGAAGTTGAACAGAAACT



GAAACTAGTGAAAAGGCTTCTTCAAGAGAAAGTGAATCAGCTCAAAGAA



CAAGTGAGCCTACCCGGTCATCTCTGTTCACCCACCTCACATTCCAGCTTT



AACTCCAGTTTTACATCCCTTTATTGCCATTAA





RERG->
ATGGCTAAAAGTGCGGAGGTCAAACTGGCAATATTTGGGAGAGCAGGCG
SEQ ID


GZMM
TGGGCAAGTCAGGCAGCTCCTTTGGGACCCAGATCATCGGGGGCCGGGA
NO: 48



GGTGATCCCCCACTCGCGCCCGTACATGGCCTCACTGCAGAGAAATGGCT



CCCACCTGTGCGGGGGTGTCCTGGTGCACCCAAAGTGGGTGCTGACGGCT



GCCCACTGCCTGGCCCAGCGGATGGCCCAGCTGAGGCTGGTGCTGGGGCT



CCACACCCTGGACAGCCCCGGTCTCACCTTCCACATCAAGGCAGCCATCC



AGCACCCTCGCTACAAGCCCGTCCCTGCCCTGGAGAACGACCTCGCGCTG



CTTCAGCTGGACGGGAAAGTGAAGCCCAGCCGGACCATCCGGCCGTTGG



CCCTGCCCAGTAAGCGCCAGGTGGTGGCAGCAGGGACTCGGTGCAGCAT



GGCCGGCTGGGGGCTGACCCACCAGGGCGGGCGCCTGTCCCGGGTGCTG



CGGGAGCTGGACCTCCAAGTGCTGGACACCCGCATGTGTAACAACAGCC



GCTTCTGGAACGGCAGCCTCTCCCCCAGCATGGTCTGCCTGGCGGCCGAC



TCCAAGGACCAGGCTCCCTGCAAGGGTGACTCGGGCGGGCCCCTGGTGTG



TGGCAAAGGCCGGGTGTTGGCCAGAGTCCTGTCCTTCAGCTCCAGGGTCT



GCACTGACATCTTCAAGCCTCCCGTGGCCACCGCTGTGGCGCCTTACGTG



TCCTGGATCAGGAAGGTCACCGGCCGATCGGCCTGA





DMKN->
ATGAAGTTCCAGGGGCCCCTGGCCTGCCTCCTGCTGGCCCTCTGCCTGGG
SEQ ID


LGI4
CAGTGGGGAGGCTGGCCCCCTGCAGAGCGGAGAGGAAAGCACTGGGACA
NO: 49



AATATTGGGGAGGCCCTTGGACATGGCCTGGGAGACGCCCTGAGCGAAG



GGGTGGGAAAGGCCATTGGCAAAGAGGCCGGAGGGGCAGCTGGCTCTAA



AGTCAGTGAGGCCCTTGGCCAAGGGACCAGAGAAGCAGTTGGCACTGGA



GTCAGGCAGGTTCCAGGCTTTGGCGTAGCAGATGCTTTGGGCAACAGGGT



CGGGGAAGCAGCCCATGCTCTGGGAAACACTGGGCACGAGATTGGCAGA



CAGGCAGAAGATGTCATTCGACACGGAGCAGATGCTGTCCGCGGCTCCTG



GCAGGGGGTGCCTGGCCACAATGGTGCTTGGGAAACTTCTGGAGGCCAT



GGCATCTTTGGCTCTCAAGGTGGCCTTGGAGGCCAGGGCCAGGGCAATCC



TGGAGGTCTGGGGACTCCGTGGGTCCACGGATACCCCGGAAACTCAGCA



GGCAGCTTTGGAATGAATCCTCAGGGAGCTCCCTGGGGTCAAGGAGGCA



ATGGAGGGCCACCAAACTTTGGGACCAACACTCAGGGAGCTGTGGCCCA



GCCTGGCTATGGTTCAGTGAGAGCCAGCAACCAGAATGAAGGGTGCACG



AATCCCCCACCATCTGGCTCAGGTGGAGGCTCCAGCAACTCTGGGGGAGG



CAGCGGCTCACAGTCGGGCAGCAGTGGCAGTGGCAGCAATGGTGACAAC



AACAATGGCAGCAGCAGTGGTGGCAGCAGCAGTGGCAGCAGCAGTGGCG



GCAGCAGTGGCGGCAGCAGTGGTGGCAGCAGTGGCAACAGTGGTGGCAG



CAGAGGTGACAGCGGCAGTGAGTCCTCCTGGGGATCCAGCACCGGCTCCT



CCTCCGGCAACCACGGTGGGAGCGGCGGAGGAAATGGACATAAACCCGG



GTGTGAAAAGCCAGGGAATGAAGCCCGCGGGAGCGGGGAATCTGGGATT



CAGAACTCTGAGACGTCTCCTGGGATGTTTAACTTTGACACTTTCTGGAA



GAATTTTAAATCCAAGCTGGGTTTCATCAACTGGGATGCCATAAACAAGA



ACCAGGTCCCGCCCCCCAGCACCCGAGCCCTCCTCTACTTCAGCCGACTC



TGGGAGGATTTCAAACAGAACACTCCTTTCCTCAACTGGAAAGCAATTAT



TGAGGGTGCGGACGCGTCATCACTGCAGAAACGTGCAGGCAGAGACGAT



CAGAACTACAATTACAACCAGCATGCGTATCCCACTGCCTATGGTGGGAA



GTACTCAGTCAAGACCCCTGCAAAGGGGGGAGTCTCACCTTCTTCCTCGG



CTTCCCGGGTGCAACCTGGCCTGCTGCAGTGGGTGAAGTTTTGGTAGAGC



TGTCCTGGTTCCAGACGGTGGGGGAGTCGGCACTGAGCGTAGAGCCCTTC



TCCTACCAAGGGGAGCCTCACATTGTGCTGGCACAGCCCTTCGCCGGCCG



CTGCCTGATTCTCTCCTGGGACTACAGCCTGCAGCGCTTCCGGCCCGAGG



AAGAGCTGCCCGCGGCCTCCGTGGTGTCCTGCAAGCCACTGGTGCTGGGC



CCGAGCCTCTTCGTGCTGGCTGCCCGCCTGTGGGGGGGCTCACAGCTGTG



GGCCCGGCCCAGTCCCGGCCTGCGCCTGGCCCCAACGCAGACCCTGGCCC



CGCGGCGGCTGCTGCGGCCCAATGACGCCGAGCTCCTGTGGCTGGAAGG



GCAACCCTGCTTCGTGGTGGCCGATGCCTCCAAGGCGGGCAGCACCACGC



TGCTGTGCCGCGACGGGCCCGGCTTTTACCCGCACCAGAGCCTGCACGCC



TGGCACCGGGACACGGACGCTGAGGCCCTGGAGCTGGACGGCCGGCCCC



ACCTGCTGCTGGCCTCGGCTTCCCAGCGGCCCGTGCTCTTCCACTGGACC



GGTGGCCGCTTCGAGAGACGCACAGACATCCCCGAGGCCGAGGATGTCT



ATGCCACACGCCACTTCCAGGCTGGTGGGGACGTGTTCCTGTGCCTCACA



CGCTACATTGGGGACTCCATGGTCATGCGCTGGGACGGCTCCATGTTTCG



TCTGCTGCAGCAACTTCCCTCGCGCGGTGCCCACGTCTTCCAGCCACTGCT



CATCGCCAGGGACCAGCTGGCCATCCTAGGCAGCGACTTCGCCTTCAGCC



AGGTCCTCCGCCTTGAGCCTGACAAGGGGCTCCTGGAGCCACTGCAGGAG



CTGGGGCCTCCGGCCCTGGTGGCCCCCCGTGCCTTTGCCCACATCACTAT



GGCCGGCAGACGCTTCCTCTTTGCTGCTTGCTTTAAGGGCCCCACACAGA



TCTACCAGCATCACGAGATCGACCTCAGTGCCTGA





UTP18->
ATGCCGCCGGAGCGGAGGAGACGAATGAAACTGGACCGGAGAACCGGA
SEQ ID


ACACA
GCGAAGCCGAAGCGGAAGCCCGGAATGAGGCCGGACTGGAAAGCCGGA
NO: 50



GCGGGGCCAGGCGGGCCTCCCCAAAAGCCTGCCCCTTCATCCCAGCGGA



AACCGCCGGCCCGGCCGAGCGCGGCGGCCGCTGCGATTGCAGTCGCGGC



GGCGGAGGAAGAGAGACGGCTCCGGCAGCGGAACCGCCTGAGGCTGGA



GGAGGACAAACCGGCCGTGGAGCGGTGCTTGGAGGAGCTGGTCTTCGGC



GACGTCGAGAACGACGAGGACGCGTTGCTGCGGCGTCTGCGAGGCCCGA



GGGTTCAAGAACATGAAGACTCGGGTGACTCAGAAGTGGAGAATGAAGC



AAAAGGTAATTTTCCACCTCAAAAGAAGCCAGTTTGGGTGGATGAAGAA



GATGAAGATGAGGAAATGGTTGACATGATGAACAATCGGTTTCGGAAGG



ATATGATGAAAAATGCTAGTGAAAGTAAACTTTCGAAAGACAACCTTAA



AAAGAGACTTAAAGAAGAATTCCAACATGCCATGGGAGGAGTACCTGCC



TGGGCAGAGACTACTAAGCGGAAAACATCTTCAGATGATGAAAGTGAAG



AGGATGAAGATGATTTGTTGCAAAGGACTGGGAATTTCATATCCACATCA



ACTTCTCTTCCAAGAGGAATCTTGAAGATGAAGAACTGCCAGCATGCGAA



TGCTGAACGTCCTACTGTTGCTCGGATCTCATCTGTGCAGTTCCATCCCGG



TGCACAGATTGTGATGGTTGCTGGATTAGATAATGCTGTATCACTATTTCA



GGTTGATGGGAAAACAAATCCTAAAATTCAGAGCATCTATTTGGAAAGGT



TTCCAATCTTTAAGGCTTGTTTTAGTGCTAATGGGGAAGAAGTTTTAGCCA



CGAGTACCCACAGCAAGGTTCTTTATGTCTATGACATGCTGGCTGGAAAG



TTAATTCCTGTGCATCAAGTGAGAGGTCCTCGGGCGGGAAGTGTACACCT



CCAATAACCAGCTGGGGGGCATCCAGATTATGCACAACAATGGGGTGAC



CCACTGCACTGTGTGTGATGACTTTGAAGGGGTTTTCACTGTCCTGCACTG



GCTGTCTTACATGCCCAAGAGCGTGCACAGTTCAGTTCCTCTTCTGAACTC



AAAGGATCCTATAGACAGAATCATCGAGTTTGTTCCCACAAAGACCCCAT



ACGATCCTCGATGGATGCTAGCAGGCCGTCCTCACCCAACCCAAAAAGGT



CAGTGGTTGAGTGGCTTTTTTGACTATGGATCTTTCTCAGAGATTATGCAG



CCCTGGGCACAGACTGTGGTGGTTGGTAGAGCCAGGCTAGGAGGAATAC



CTGTGGGAGTTGTTGCTGTAGAAACCCGAACAGTAGAACTAAGTATCCCA



GCTGATCCAGCAAACCTGGATTCTGAAGCCAAGATAATCCAGCAGGCTG



GCCAGGTTTGGTTCCCAGATTCTGCGTTTAAGACATATCAGGCCATCAAG



GACTTCAACCGGGAAGGGCTGCCTCTGATGGTCTTTGCCAACTGGAGAGG



CTTCTCTGGTGGAATGAAAGATATGTACGACCAAGTGCTGAAGTTTGGTG



CTTACATTGTGGATGGCTTGAGGGAGTGCTGCCAGCCTGTGCTGGTTTAC



ATTCCTCCCCAGGCTGAGCTGCGGGGTGGCTCCTGGGTGGTGATTGACTC



CTCCATCAACCCCCGGCACATGGAGATGTATGCTGACCGAGAAAGCAGG



GGATCTGTTCTGGAGCCAGAAGGGACAGTAGAAATCAAATTCCGCAGAA



AGGATCTGGTGAAAACCATGCGTCGGGTGGACCCAGTCTACATCCACTTG



GCTGAGCGATTGGGGACCCCAGAGCTAAGCACAGCTGAGCGGAAGGAGT



TGGAGAACAAGTTGAAGGAGCGGGAGGAATTCCTAATTCCCATTTACCAT



CAGGTAGCCGTGCAGTTTGCTGACTTGCACGACACACCAGGCCGGATGCA



GGAGAAGGGTGTTATTAGCGATATCCTGGATTGGAAAACATCCCGTACCT



TCTTCTACTGGCGGCTGAGGCGTCTTCTGCTGGAGGACCTGGTCAAGAAG



AAAATCCACAATGCCAACCCTGAGCTGACTGATGGCCAGATTCAAGCCAT



GTTAAGGCGCTGGTTTGTGGAAGTGGAAGGAACAGTGAAGGCTTATGTTT



GGGACAATAATAAGGATCTGGCGGAGTGGCTAGAGAAACAGCTGACAGA



GGAGGATGGTGTTCACTCGGTAATAGAGGAAAACATCAAATGCATCAGC



AGAGACTACGTCCTCAAGCAAATCCGCAGCTTGGTCCAGGCCAATCCAGA



GGTTGCCATGGATTCCATCATCCATATGACGCAGCACATATCACCCACTC



AGCGAGCAGAAGTCATACGGATCCTCTCCACAATGGATTCCCCTTCCACG



TAG





APP->
ATGCTGCCCGGTTTGGCACTGCTCCTGCTGGCCGCCTGGACGGCTCGGGC
SEQ ID


C21orf7
GCTGGAGGTACCCACTGATGGTAATGCTGGCCTGCTGGCTGAACCCCAGA
NO: 51



TTGCCATGTTCTGTGGCAGACTGAACATGCACATGAATGTCCAGAATGGG



AAGTGGGATTCAGATCCATCAGGGACCAAAACCTGCATTGATACCAAGG



AAGGCATCCTGCAGTATTGCCAAGAAGTCTACCCTGAACTGCAGATCACC



AATGTGGTAGAAGCCAACCAACCAGTGACCATCCAGAACTGGTGCAAGC



GGGGCCGCAAGCAGTGCAAGACCCATCCCCACTTTGTGATTCCCTACCGC



TGCTTAGTTGGTGAGTTTGTAAGTGATGCCCTTCTCGTTCCTGACAAGTGC



AAATTCTTACACCAGGAGAGGATGGATGTTTGCGAAACTCATCTTCACTG



GCACACCGTCGCCAAAGAGACATGCAGTGAGAAGAGTACCAACTTGCAT



GACTACGGCATGTTGCTGCCCTGCGGAATTGACAAGTTCCGAGGGGTAGA



GTTTGTGTGTTGCCCACTGGCTGAAGAAAGTGACAATGTGGATTCTGCTG



ATGCGGAGGAGGATGACTCGGATGTCTGGTGGGGCGGAGCAGACACAGA



CTATGCAGATGGGAGTGAAGACAAAGTAGTAGAAGTAGCAGAGGAGGA



AGAAGTGGCTGAGGTGGAAGAAGAAGAAGCCGATGATGACGAGGACGA



TGAGGATGGTGATGAGGTAGAGGAAGAGGCTGAGGAACCCTACGAAGAA



GCCACAGAGAGAACCACCAGCATTGCCACCACCACCACCACCACCACAG



AGTCTGTGGAAGAGGTGGTTCGAGAGGTGTGCTCTGAACAAGCCGAGAC



GGGGCCGTGCCGAGCAATGATCTCCCGCTGGTACTTTGATGTGACTGAAG



GGAAGTGTGCCCCATTCTTTTACGGCGGATGTGGCGGCAACCGGAACAAC



TTTGACACAGAAGAGTACTGCATGGCCGTGTGTGGCAGCGCCATGTCCCA



AAGTTTACTCAAGACTACCCAGGAACCTCTTGCCCGAGATCCTGTTAAAC



TTCCTACAACAGCAGCCAGTACCCCTGATGCCGTTGACAAGTATCTCGAG



ACACCTGGGGATGAGAATGAACATGCCCATTTCCAGAAAGCCAAAGAGA



GGCTTGAGGCCAAGCACCGAGAGAGAATGTCCCAGGTCATGAGAGAATG



GGAAGAGGCAGAACGTCAAGCAAAGAACTTGCCTAAAGCTGATAAGAAG



GCAGTTATCCAGCATTTCCAGGAGAAAGTGGAATCTTTGGAACAGGAAG



CAGCCAACGAGAGACAGCAGCTGGTGGAGACACACATGGCCAGAGTGGA



AGCCATGCTCAATGACCGCCGCCGCCTGGCCCTGGAGAACTACATCACCG



CTCTGCAGGCTGTTCCTCCTCGGCCTCGTCACGTGTTCAATATGCTAAAGA



AGTATGTCCGCGCAGAACAGAAGGACAGACAGCACACCCTAAAGCATTT



CGAGCATGTGCGCATGGTGGATCCCAAGAAAGCCGCTCAGATCCGGTCCC



AGGTTATGACACACCTCCGTGTGATTTATGAGCGCATGAATCAGTCTCTC



TCCCTGCTCTACAACGTGCCTGCAGTGGCCGAGGAGATTCAGGATGAAGT



TGGAAGGAGCTCATTGCCAAGTTAGATCAGGCAGAAAAGGAGAAGGTGG



ATGCTGCTGAGCTGGTTCGGGAATTCGAGGCTCTGACGGAGGAGAATCG



GACGTTGAGGTTGGCCCAGTCTCAATGTGTGGAACAACTGGAGAAACTTC



GAATACAGTATCAGAAGAGGCAGGGCTCGTCCTAA





GREB1->
GCATTACGCGCCCCACGCATCCTCTTCCATCCCCAGGCACAGATCAAAGG
SEQ ID


MBOAT2
CGCAGCCCAGGAGGCGGGAGCCCCTGCACACTTTCCACCTCTGCTGGGCT
NO: 52



TAGCCTCTTGGCTGGTTGGTCTGTGGAGTGCCTGAAGTGACCAGCTTTTTG



TAAGGTCAACTTTGTAGTGTGCCAACTCTTTGCCTTGCTAGCAGCCATTTG



GTTTCGAACTTATCTACATTCAAGCAAAACTAGCTCTTTTATAAGACATGT



AGTTGCTACCCTTTTGGGCCTTTATCTTGCACTTTTTTGCTTTGGATGGTAT



GCCTTACACTTTCTTGTACAAAGTGGAATTTCCTACTGTATCATGATCATC



ATAGGAGTGGAGAACATGCACAATTACTGCTTTGTGTTTGCTCTGGGATA



CCTCACAGTGTGCCAAGTTACTCGAGTCTATATCTTTGACTATGGACAAT



ATTCTGCTGATTTTTCAGGCCCAATGATGATCATTACTCAGAAGATCACTA



GTTTGGCTTGCGAAATTCATGATGGGATGTTTCGGAAGGATGAAGAACTG



ACTTCCTCACAGAGGGATTTAGCTGTAAGGCGCATGCCAAGCTTACTGGA



GTATTTGAGTTACAACTGTAACTTCATGGGGATCCTGGCAGGCCCACTTT



GCTCTTACAAAGACTACATTACTTTCATTGAAGGCAGATCATACCATATC



ACACAATCTGGTGAAAATGGAAAAGAAGAGACACAGTATGAAAGAACA



GAGCCATCTCCAAATACTGCGGTTGTTCAGAAGCTCTTAGTTTGTGGGCT



GTCCTTGTTATTTCACTTGACCATCTGTACAACATTACCTGTGGAGTACAA



CATTGATGAGCATTTTCAAGCTACAGCTTCGTGGCCAACAAAGATTATCT



ATCTGTATATCTCTCTTTTGGCTGCCAGACCCAAATACTATTTTGCATGGA



CGCTAGCTGATGCCATTAATAATGCTGCAGGCTTTGGTTTCAGAGGGTAT



GACGAAAATGGAGCAGCTCGCTGGGACTTAATTTCCAATTTGAGAATTCA



ACAAATAGAGATGTCAACAAGTTTCAAGATGTTTCTTGATAATTGGAATA



TTCAGACAGCTCTTTGGCTCAAAAGGGTGTGTTATGAACGAACCTCCTTC



AGTCCAACTATCCAGACGTTCATTCTCTCTGCCATTTGGCACGGGGTATAC



CCAGGATATTATCTAACGTTTCTAACAGGGGTGTTAATGACATTAGCAGC



AAGAGCTATGAGAAATAACTTTAGACATTATTTCATTGAACCTTCCCAAC



TGAAATTATTTTATGATGTTATAACATGGATAGTAACTCAAGTAGCAATA



AGTTACACAGTTGTGCCATTTGTGCTTCTTTCTATAAAACCATCACTCACG



TTTTACAGCTCCTGGTATTATTGCCTGCACATTCTTGGTATCTTAGTATTAT



TGTTGTTGCCAGTGAAAAAAACTCAAAGAAGAAAGAATACACATGAAAA



CATTCAGCTCTCACAATCCAAAAAGTTTGATGAAGGAGAAAATTCTTTGG



GACAGAACAGTTTTTCTACAACAAACAATGTTTGCAATCAGAATCAAGAA



ATAGCCTCGAGACATTCATCACTAAAGCAGTGATCGGGAAGGCTCTGAG



GGCTGTTTTTTTTTTTTGATGTTAACAGAAACCAATCTTAGCACCTTTTCA



AGGGGTTTGAGTTTGTTGGAAAAGCAGTTAACTGGGGGGAAATGGACAG



TTATAGATAAGGAATTTCCTGTACACCAGATTGGAAATGGAGTGAAACAA



GCCCTCCCATGCCATGTCTCCGTGGGCCACGCCTTATGTAAGAATATTTCC



ATATTTCAGTGGGCACTCCCAACCTCAGCACTTGTCCGTAGGGTCACACG



CGTGCCCTGTTGCTGAATGTATGTTGCGTATCCCAAGGCACTGAAGAGGT



GGAAAAATAATCGTGTCAATCTGGATGATAGAGAGAAATTAACTTTTCCA



AATGAATGTCTTGCCTTAAACCCTCTATTTCCTAAAATATTGTTCCTAAAT



GGTATTTTCAAGTGTAATATTGTGAGAACGCTACTGCAGTAGTTGATGTT



GTGTGCTGTAAAGGATTTTAGGAGGAATTTGAAACAGGATATTTAAGAGT



GTGGATATTTTTAAAATGCAATAAACATCTCAGTATTTGAAGGGTTTTCTT



AAAGTATGTCAAATGACTACAATCCATAGTGAAACTGTAAACAGTAATG



GACGCCAAATTATAGGTAGCTGATTTTGCTGGAGAGTTTAATTACCTTGT



GCAGTCAAAGAGCGCTTCCAGAAGGAATCTCTTAAAACATAATGAGAGG



TTTGGTAATGTGATATTTTAAGCTTATTCTTTTTCTTAAAAGAGAGAGGTG



ACGAAGGAAGGCAGGAATGAAGAAGCACTGCGTGGCCTCCGGTGGAATG



CACGGGGCACAGCCGCGACTCTGCAGGCAGCTTCCCCCCCATGCCAGGGC



TCTGCGCCGTCATGTGAGACTTAAAAAAAAAGTTGAATGACTTCGTGATA



CTTTGGACTTCTAAATTAAATTTATCAGGCATAAATTATGTAGAATTAGA



GGCTTTGAAAATAATACTGGTAGGTTGCTCAAAGGTTTTGAAAGAGAAAT



CGCTAGGTAGGTTACTATCTGGCTAATCCATTTCTTATCCTTGACAATTTA



ATTCATATTTGGGAAACTTTTAGGGAAATGAAAAATAAAAGTCACTGAGT



CTGGGTGACATTTTTTAAGAATAATATAAATTCAGTTTCAAACTCTTCTCA



CATTAAAATTTTGCTGTGAACTCTTACTAAAATGAGTTTTAGGTTCTGTAA



GTGGAAAAATGTGCTTTTATTTTATGGGCCATTTTTACCACAACTAATCTT



GCCTTGGATTACTAAGCATCTCCTGCGATCCCACAGAGGACTGTGGTGGC



CACAGGAGCTGAAAGCAGAAGAGTGGGATTTGATGCCAGGCAGTGGAGT



GGCCTCAGCCCCAGATTGTACCTCCTGCCCTGTAGGAGGGGAGGGGGCA



AAGCCTTCTGACTTCACCTTTGTTTGACCTATGTATGGAACTTACTTTTAC



TTTTTGCCTTAAATTTTTAATGAAATGCAAATTTTCTGTGATGGGGTTCTC



TCTCTCTTTTTTTCGGGGGGTGGAGTCACTAATAAATTTGCAAATGAAGTT



AAAGACAAGGCAACCATCTGGCTTATGCTATATAATACTTCATTTAAAGA



AGAAAGGAAAAGCAAATGCACTTGCAGCTTTTGAGGTCTCAGCAAAAAT



GGGCATGTGTCTTTTTTGAAGTTTAGAAATATCCTAATCTATTTTTATTTA



TCTAAAAGTAAGTGTTTTCCGGCTGATAAGGCTAACCCTACCCAGGAAAG



GATTGATAACTAAATAAATTTCCTCTGTTTTCCCATGCATTGAAATTATGT



TGGCTGAGCATGGTGGCTCACACCTGTAATCCTAGCACTTTGGGAGGCCG



AGGTGGGCGGATCACTTGAGGTCAGGAGTTGGAGACCAGCCTGGCCAAC



GTGGTGAATCCCCGTCTCTACTGAAAACACAAAAATTAGACGGGCATGGT



GGCGCACACCTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATT



GCTTGAACCTGGGAGGTGGAGGTTGCAGTGAGCTAAAATTGTGCCACTGC



ACTCCAGCCTGGGTGACAGAGGAAGACTCCGTCTCAC





MED13L->
ATGACTGCGGCAGCGAACTGGGTGGCGAACGGGGCGAGCCTGGAGGATT
SEQ ID


KIF21A
GTCACTCCAACCTCTTTTCGCTGGCTGAACTCACGGGAATCAAATGGCGT
NO: 53



AGGTACAATTTTGGAGGGCATGGGGACTGTGGACCCATAATTTCAGCCCC



AGCCCAAGATGATCCAATTCTGTTAAGTTTCATCCGCTGTCTGCAAGCTA



ACCTGCTTTGTGTATGGCGTCGTGATGTCAAACCAGATTGCAAAGAGTTA



TGGATATTCTGGTGGGGAGATGAACCCAACCTAGTGGGTGTAATACATCA



TGAACTGCAGGTTGTGGAAGAAGGACTCTGGGAAAATGGCCTTTCCTATG



AATGTAGGACGCTGCTCTTCAAAGCGATCCACAATCTGTTAGAAAGGTGC



CTAATGGATAAGAACTTCGTTAGGATTGGGAAATGGTTTGTCCGACCCTA



CGAAAAGGATGAAAAGCCAGTCAACAAAAGTGAGCATTTGTCCTGTGCT



TTCACATTCTTTCTGCATGGAGAAAGTAATGTATGCACAAGTGTGGAGAT



TGCCCAGCACCAGCCAATTTATTTGATCAATGAGGAGCATATACACATGG



CTCAGTCTTCACCTGCACCATTTCAAGTACTGGTAAGTCCTTATGGCTTAA



ATGGGACGCTAACAGGCCAAGCATACAAGATGTCAGACCCAGCCACTCG



TAAGTTGATTGAGGAATGGCAGTATTTCTACCCGATGGTGCTAAAAAAGA



AAGAAGAATCGAAAGAGGAAGACGAGTTGGGATATGATGATGATTTCCC



TGTGGCAGTTGAAGTAATTGTTGGTGGTGTTCGGATGGTTTACCCTTCAGC



ATTTGTTTTGATCTCTCAGAATGACATCCCGGTTCCTCAGAGTGTTGCCAG



TGCTGGAGGCCACATTGCAGTTGGGCAGCAAGGGCTTGGTAGTGTGAAG



GACCCAAGTAACTGTGGGATGCCTCTGACCCCTCCCACCTCTCCAGAACA



GGCTATCCTAGGTGAGAGTGGAGGTATGCAGAGTGCTGCCAGTCACCTGG



TTTCCCAAGATGGAGGGATGATAACGATGCACAGTCCAAAGAGATCGGG



GAAGATTCCTCCAAAACTCCACAATCATATGGTCCATCGAGTCTGGAAGG



AATGCATCCTCAACAGAACCCAGTCCAAGAGGAGCCAAATGTCAACTCC



AACTCTTGAAGAAGAGCCTGCTAGCAATCCTGCTACTTGGGATTTTGTGG



ATCCAACCCAAAGAGTCAGCTGTTCTTGTTCCAGAATAAGACCACAGCTT



GCCAAAGAGAAGATTGAAGGATGCCATATTTGTACATCTGTCACACCAGG



AGAGCCTCAGGTCTTCCTAGGGAAAGATAAGGCTTTTACTTTTGACTATG



TATTTGACATTGACTCCCAGCAAGAGCAGATCTACATTCAATGTATAGAA



AAACTAATTGAAGGTTGCTTTGAAGGATACAATGCTACAGTTTTTGCTTA



TGGACAAACTGGAGCTGGTAAAACATACACAATGGGAACAGGATTTGAT



GTTAACATTGTTGAGGAAGAACTGGGTATTATTTCTCGAGCTGTTAAACA



CCTTTTTAAGAGTATTGAAGAAAAAAAACACATAGCAATTAAAAATGGG



CTTCCTGCTCCAGATTTTAAAGTGAATGCCCAATTCTTAGAGCTCTATAAT



GAAGAGGTCCTTGACTTATTTGATACCACTCGTGATATTGATGCAAAAAG



TAAAAAATCAAATATAAGAATTCATGAAGATTCAACTGGAGGAATTTATA



CTGTGGGCGTTACAACACGTACTGTGAATACAGAATCAGAGATGATGCA



GTGTTTGAAGTTGGGTGCTTTATCCCGGACAACTGCCAGTACCCAGATGA



ATGTTCAGAGCTCTCGTTCACATGCCATTTTTACCATTCATGTGTGTCAAA



CCAGAGTGTGTCCCCAAATAGATGCTGACAATGCAACTGATAATAAAATT



ATTTCTGAATCAGCACAGATGAATGAATTTGAAACCCTGACTGCAAAGTT



CCATTTTGTTGATCTCGCAGGATCTGAAAGACTGAAGCGTACTGGAGCTA



CAGGCGAGAGGGCAAAAGAAGGCATTTCTATCAACTGTGGACTTTTGGC



ACTTGGCAATGTAATAAGTGCCTTGGGAGACAAGAGCAAGAGGGCCACA



CATGTCCCCTATAGAGATTCCAAGCTAACAAGACTACTACAGGATTCCCT



CGGGGGTAATAGCCAAACAATCATGATAGCATGTGTCAGCCCTTCAGACA



GAGACTTTATGGAAACGTTAAACACCCTGAAATACGCCAATCGAGCTAG



AAATATCAAGAATAAGGTGATGGTCAATCAGGACAGAGCTAGTCAGCAA



ATCAATGCACTTCGTAGTGAAATCACACGACTTCAGATGGAGCTCATGGA



GTACAAAACAGGTAAAAGAATAATTGACGAAGAGGGTGTGGAAAGCATC



AATGACATGTTTCATGAGAATGCTATGCTACAGACTGAAAATAATAACCT



GCGTGTAAGAATTAAAGCCATGCAAGAGACGGTTGATGCATTGAGGTCC



AGAATTACACAGCTTGTTAGTGATCAGGCCAACCATGTTCTTGCCAGAGC



AGGTGAAGGAAATGAGGAGATTAGTAATATGATTCATAGTTATATAAAA



GAAATCGAAGATCTCAGGGCAAAATTATTAGAAAGTGAAGCAGTGAATG



AGAACCTTCGAAAAAACTTGACAAGAGCCACAGCAAGAGCGCCATATTT



CAGCGGATCATCAACTTTTTCTCCTACCATACTATCCTCAGACAAAGAAA



CCATTGAAATTATAGACCTAGCAAAAAAAGATTTAGAGAAGTTGAAAAG



AAAAGAAAAGAGGAAGAAAAAAAGTGTGGCTGGTAAAGAGGATAATAC



AGACACTGACCAAGAGAAGAAAGAAGAAAAGGGTGTTTCGGAAAGAGA



AAACAATGAATTAGAAGTGGAAGAAAGTCAAGAAGTGAGTGATCATGAG



GATGAAGAAGAGGAGGAGGAGGAGGAGGAAGATGACATTGATGGGGGT



GAAAGTTCTGATGAATCAGATTCTGAATCAGATGAAAAAGCCAATTATCA



AGCAGACTTGGCAAACATTACTTGTGAAATTGCAATTAAGCAAAAGCTGA



TTGATGAACTAGAAAACAGCCAGAAAAGACTGCAGACTCTGAAAAAGCA



GTATGAAGAGAAGCTAATGATGCTGCAACATAAAATTCGGGATACTCAG



CTTGAAAGAGACCAGGTGCTTCAAAACTTAGGCTCGGTAGAATCTTACTC



AGAAGAAAAAGCAAAAAAAGTTAGGTCTGAATATGAAAAGAAACTCCAA



GCCATGAACAAAGAACTGCAGAGACTTCAAGCAGCTCAAAAAGAACATG



CAAGGTTGCTTAAAAATCAGTCTCAGTATGAAAAGCAATTGAAGAAATTG



CAGCAGGATGTGATGGAAATGAAAAAAACAAAGGTTCGCCTAATGAAAC



AAATGAAAGAAGAACAAGAGAAAGCCAGACTGACTGAGTCTAGAAGAA



ACAGAGAGATTGCTCAGTTGAAAAAGGATCAACGTAAAAGAGATCATCA



ACTTAGACTTCTGGAAGCCCAAAAAAGAAACCAAGAAGTGGTTCTACGT



CGCAAAACTGAAGAGGTTACGGCTCTTCGTCGGCAAGTAAGACCCATGTC



AGATAAAGTGGCTGGGAAAGTTACTCGGAAGCTGAGTTCATCTGATGCAC



CTGCTCAGGACACAGGTTCCAGTGCAGCTGCTGTCGAAACAGATGCATCA



AGGACAGGAGCCCAGCAGAAAATGAGAATTCCTGTGGCGAGAGTCCAGG



CCTTACCAACGCCGGCAACAAATGGAAACAGGAAAAAATATCAGAGGAA



AGGATTGACTGGCCGAGTGTTTATTTCCAAGACAGCTCGCATGAAGTGGC



AGCTCCTTGAGCGCAGGGTCACAGACATCATCATGCAGAAGATGACCATT



TCCAACATGGAGGCAGATATGAATAGACTCCTCAAGCAACGGGAGGAAC



TCACAAAAAGACGAGAGAAACTTTCAAAAAGAAGGGAGAAGATAGTCA



AGGAGAATGGAGAGGGAGATAAAAATGTGGCTAATATCAATGAAGAGAT



GGAGTCACTGACTGCTAATATCGATTACATCAATGACAGTATTTCTGATT



GTCAGGCCAACATAATGCAGATGGAAGAAGCAAAGGAAGAAGGTGAGA



CATTGGATGTTACTGCAGTCATTAATGCCTGCACCCTTACAGAAGCCCGA



TACCTGCTAGATCACTTCCTGTCAATGGGCATCAATAAGGGTCTTCAGGC



TGCCCAGAAAGAGGCTCAAATTAAAGTACTGGAAGGTCGACTCAAACAA



ACAGAAATAACCAGTGCTACCCAAAACCAGCTCTTATTCCATATGTTGAA



AGAGAAGGCAGAATTAAATCCTGAGCTAGATGCTTTACTAGGCCATGCTT



TACAAGATCTAGATAGCGTACCATTAGAAAATGTAGAGGATAGTACTGAT



GAGGATGCTCCTTTAAACAGCCCAGGATCAGAAGGAAGCACGCTGTCTTC



AGATCTCATGAAGCTTTGTGGTGAAGTGAAACCTAAGAACAAGGCCCGA



AGGAGAACCACCACTCAGATGGAATTGCTGTATGCAGATAGCAGTGAAC



TAGCTTCAGACACTAGTACAGGAGATGCCTCCTTGCCTGGCCCTCTCACA



CCTGTTGCAGAAGGGCAAGAGATTGGAATGAATACAGAGACAAGTGGTA



CTTCTGCTAGGGAAAAAGAGCTCTCTCCCCCACCTGGCTTACCTTCTAAG



ATAGGCAGCATTTCCAGGCAGTCATCTCTATCAGAAAAAAAAATTCCAGA



GCCTTCTCCTGTAACAAGGAGAAAGGCATATGAGAAAGCAGAAAAATCA



AAGGCCAAGGAACAAAAGCACTCAGATTCTGGAACTTCAGAGGCTAGTC



TTTCACCTCCTTCTTCCCCACCAAGCCGGCCCCGTAATGAACTGAATGTTT



TTAATCGTCTTACTGTTTCTCAGGGAAACACATCAGTTCAGCAGGATAAG



TCTGATGAAAGTGACTCCTCTCTCTCGGAGGTACACAGATCCTCCAGAAG



GGGCATAATCAACCCATTTCCTGCTTCAAAAGGAATCAGAGCTTTTCCAC



TTCAGTGTATTCACATAGCTGAAGGGCATACAAAAGCTGTGCTCTGTGTG



GATTCTACTGATGATCTCCTCTTCACTGGATCAAAAGATCGTACTTGTAAA



GTATGGAATCTGGTGACTGGGCAGGAAATAATGTCACTGGGGGGTCATCC



CAACAATGTCGTGTCTGTAAAATACTGTAATTATACCAGTTTGGTCTTCAC



TGTATCAACATCTTATATTAAGGTGTGGGATATCAGAGATTCAGCAAAGT



GCATTCGAACACTAACGTCTTCAGGTCAAGTTACTCTTGGAGATGCTTGTT



CTGCAAGTACCAGTCGAACAGTAGCTATTCCTTCTGGAGAGAACCAGATC



AATCAAATTGCCCTAAACCCAACTGGCACCTTCCTCTATGCTGCTTCTGGA



AATGCTGTCAGGATGTGGGATCTTAAAAGGTTTCAGTCTACAGGAAAGTT



AACAGGACACCTAGGCCCTGTTATGTGCCTTACTGTGGATCAGATTTCCA



GTGGACAAGATCTAATCATCACTGGCTCCAAGGATCATTACATCAAAATG



TTTGATGTTACAGAAGGAGCTCTTGGGACTGTGAGTCCCACCCACAATTT



TGAACCCCCTCATTATGATGGCATAGAAGCACTAACCATTCAAGGGGATA



ACCTATTTAGTGGGTCTAGAGATAATGGAATCAAGAAATGGGACTTAACT



CAAAAAGACCTTCTTCAGCAAGTTCCAAATGCACATAAGGATTGGGTCTG



TGCCCTGGGAGTGGTGCCAGACCACCCAGTTTTGCTCAGTGGCTGCAGAG



GGGGCATTTTGAAAGTCTGGAACATGGATACTTTTATGCCAGTGGGAGAG



ATGAAGGGTCATGATAGTCCTATCAATGCCATATGTGTTAATTCCACCCA



CATTTTTACTGCAGCTGATGATCGAACTGTGAGAATTTGGAAGGCTCGCA



ATTTGCAAGATGGTCAGATCTCTGACACAGGAGATCTGGGGGAAGATATT



GCCAGTAATTAA





ITGA11->
ATGGACCTGCCCAGGGGCCTGGTGGTGGCCTGGGCGCTCAGCCTGTGGCC
SEQ ID


NARF
AGGGTTCACGGACACCTTCAACATGGACACCAGGAAGCCCCGGGTCATC
NO: 54



CCTGGCTCCAGGACCGCCTTCTTTGGCTACACAGTGCAGCAGCACGACAT



CAGTGGCAATAAGTGGAATGTAGTAAGAAAACAAAAACTGATGACCAAG



AGAATGTGTCAGCCGATGCACCGAGTCCAGCCCAGGAAAATGGAGAGAA



GGGAGAATTCCACAAGTTGGCTGATGCCAAGATATTTTTGAGCGACTGCC



TGGCATGTGACAGCTGTATGACTGCAGAGGAAGGAGTCCAACTTTCCCAG



CAAAATGCCAAGGACTTCTTCCGCGTTCTGAACCTTAACAAGAAATGTGA



TACCTCAAAGCACAAAGTGCTGGTAGTGTCTGTGTGTCCTCAATCTTTGCC



TTATTTTGCTGCTAAATTCAACCTCAGTGTAACTGATGCATCCAGAAGACT



CTGTGGTTTCCTCAAAAGTCTTGGGGTGCACTATGTATTTGATACGACGAT



AGCTGCGGATTTTAGTATCCTGGAGAGTCAAAAAGAATTCGTGCGTCGCT



ATCGCCAGCACAGTGAGGAGGAACGCACCCTGCCCATGCTGACCTCTGCC



TGTCCTGGCTGGGTCCGATACGCCGAGCGGGTGCTGGGTCGCCCCATCAC



TGCCCACCTCTGCACCGCCAAGTCCCCCCAGCAGGTCATGGGCTCTTTGG



TGAAGGATTATTTCGCCAGACAGCAGAACCTGTCTCCAGAGAAGATTTTC



CACGTCATTGTGGCCCCTTGTTATGACAAGAAGCTGGAGGCTCTTCAGGA



AAGCCTTCCCCCTGCTTTGCATGGCTCCCGGGGCGCTGACTGCGTGTTAA



CATCAGGTGAAATTGCTCAAATAATGGAGCAAGGTGACCTCTCAGTGAG



AGATGCTGCCGTCGACACTCTGTTTGGAGACTTGAAGGAGGACAAAGTG



ACGCGTCATGATGGAGCCAGCTCAGACGGGCACCTGGCACACATCTTCAG



ACATGCGGCCAAGGAGCTGTTCAACGAGGATGTGGAGGAGGTCACTTAC



CGAGCCCTGAGAAACAAAGACTTCCAAGAGGTCACCCTTGAGAAGAACG



GAGAGGTGGTGTTACGCTTTGCTGCAGCCTATGGCTTTCGAAACATCCAG



AACATGATCCTGAAGCTTAAGAAGGGCAAGTTCCCATTCCACTTTGTGGA



GGTCCTCGCCTGTGCTGGAGGATGCTTAAATGGCAGAGGCCAAGCCCAG



ACTCCAGACGGACATGCGGATAAGGCCCTGCTGCGGCAGATGGAAGGCA



TTTACGCTGACATCCCTGTGCGGCGTCCGGAGTCCAGTGCACACGTGCAG



GAGCTGTACCAGGAGTGGCTGGAGGGGATCAACTCCCCCAAGGCCCGAG



AGGTGCTGCATACCACGTACCAGAGCCAGGAGCGTGGCACACACAGCCT



GGACATCAAGTGGTGA





LDLRAD3
ATGTGGCTGCTGGGGCCGCTGTGCCTGCTGCTGAGCAGCGCCGCGGAGAG
SEQ ID


->
CCAGCTGCTCCCCGGGAACAACTTCACCAATGAGTGCAACATACCAGGCA
NO: 55


ANK3
ACTTCATGTGCAGCAATGGACGGTGCATCCCGGGCGCCTGGCAGTGTGAC



GGGCTGCCTGACTGCTTCGACAAGAGTGATGAGAAGGAGTGCCTCTGATG



CCAATGCAAGTTACTTAAGAGCAGCTCGAGCTGGACACCTTGAAAAGGC



CCTCGACTACATAAAAAATGGAGTTGACATCAACATTTGCAATCAGAATG



GGTTGAACGCTCTCCACCTTGCTTCCAAAGAAGGCCATGTAGAGGTTGTT



TCTGAGCTGCTGCAGAGAGAAGCCAATGTGGATGCAGCTACAAAGAAAG



GAAACACAGCATTGCACATCGCATCTTTGGCTGGGCAAGCAGAGGTGGT



AAAAGTCTTGGTTACAAATGGAGCCAATGTCAATGCACAATCTCAGAATG



GTTTCACGCCATTGTATATGGCAGCCCAGGAAAATCACCTGGAAGTTGTC



AAGTTTCTTCTTGACAATGGTGCAAGCCAGAGCCTAGCCACAGAGGATGG



CTTCACACCATTGGCAGTGGCTTTGCAACAAGGTCACGACCAAGTCGTTT



CGCTCCTGCTAGAGAATGACACCAAAGGAAAAGTGCGTCTCCCAGCTCTT



CATATCGCGGCCCGAAAAGACGACACGAAAGCCGCCGCCCTGCTGCTGC



AGAATGACAACAATGCAGATGTGGAATCAAAGAGTGGCTTCACTCCGCT



CCACATAGCTGCTCACTATGGAAATATCAATGTAGCCACGTTGCTGTTAA



ACCGAGCGGCTGCTGTGGATTTCACCGCAAGGAATGACATCACTCCTTTA



CATGTTGCATCAAAAAGAGGAAATGCAAATATGGTAAAACTATTGCTCG



ATCGAGGAGCTAAAATCGATGCCAAAACCAGGGATGGTCTGACACCACT



GCACTGTGGAGCAAGGAGTGGCCACGAGCAGGTGGTAGAAATGTTGCTT



GATCGAGCTGCCCCCATTCTTTCAAAAACCAAGAATGGATTATCTCCATT



GCACATGGCCACACAAGGGGATCATTTAAACTGCGTCCAGCTTCTCCTCC



AGCATAATGTACCCGTGGATGATGTCACCAATGACTACCTGACTGCCCTA



CACGTGGCTGCCCACTGTGGCCATTACAAAGTTGCCAAGGTTCTCTTGGA



TAAGAAAGCTAACCCCAATGCCAAAGCCCTGAATGGCTTTACCCCTCTTC



ATATTGCCTGCAAGAAGAATCGAATTAAAGTAATGGAACTCCTTCTGAAA



CACGGTGCATCCATCCAAGCTGTAACCGAGTCGGGCCTTACCCCAATCCA



TGTTGCTGCCTTCATGGGGCATGTAAATATTGTATCACAACTAATGCATC



ATGGAGCCTCACCAAACACCACCAATGTGAGAGGAGAAACAGCACTGCA



CATGGCAGCTCGCTCCGGCCAAGCTGAAGTTGTGCGGTATCTGGTACAAG



ACGGAGCTCAGGTAGAAGCTAAAGCTAAGGATGACCAAACACCACTCCA



CATTTCAGCCCGACTGGGGAAAGCAGACATAGTACAACAGCTGTTGCAG



CAAGGGGCATCTCCAAATGCAGCCACAACTTCTGGGTACACCCCACTTCA



CCTTTCCGCCCGAGAGGGGCATGAGGATGTGGCCGCGTTCCTTTTGGATC



ATGGAGCGTCTTTATCTATAACAACAAAGAAAGGATTTACTCCTCTTCAT



GTGGCAGCAAAATATGGAAAGCTTGAAGTCGCCAATCTCCTGCTACAGA



AAAGTGCATCTCCAGATGCTGCTGGGAAGAGCGGGCTAACACCACTGCA



TGTAGCTGCACATTACGATAATCAGAAAGTGGCCCTTCTGCTTTTGGACC



AAGGAGCCTCACCTCACGCAGCCGCAAAGAATGGTTATACGCCACTGCA



CATCGCTGCCAAAAAGAACCAGATGGACATAGCGACAACTCTGCTGGAA



TATGGTGCTGATGCCAACGCAGTTACCCGGCAAGGAATTGCTTCCGTCCA



TCTCGCAGCTCAGGAAGGGCACGTGGACATGGTGTCGCTGCTCCTCGGTA



GAAATGCGAATGTGAACCTGAGCAATAAGAGCGGCCTGACCCCACTCCA



TTTGGCTGCTCAAGAAGATCGAGTGAATGTGGCAGAAGTCCTCGTAAACC



AAGGGGCTCATGTGGACGCCCAGACAAAGATGGGATACACACCACTGCA



TGTGGGCTGCCACTATGGAAATATCAAGATTGTTAATTTCCTGCTCCAGC



ATTCTGCAAAAGTTAATGCCAAAACAAAGAATGGGTATACGCCATTACAT



CAAGCAGCACAGCAGGGGCATACGCATATAATAAATGTCTTACTTCAGA



ACAACGCCTCCCCCAATGAACTCACTGTGAATGGGAATACTGCCCTTGGC



ATTGCCCGGCGCCTCGGCTACATCTCAGTAGTGGACACCCTGAAGATAGT



GACCGAAGAGACCATGACCACAACTACTGTCACAGAGAAGCACAAAATG



AATGTTCCAGAAACGATGAATGAAGTTCTTGATATGTCTGATGATGAAGT



TCGTAAAGCCAATGCCCCTGAAATGCTCAGTGATGGCGAATATATCTCAG



ATGTTGAAGAAGGTGAAGATGCAATGACCGGGGACACAGACAAATATCT



TGGGCCACAGGACCTTAAGGAATTGGGTGATGATTCCCTGCCTGCAGAGG



GTTACATGGGCTTTAGTCTCGGAGCGCGTTCTGCCAGCCTCCGCTCCTTCA



GTTCGGATAGGTCTTACACCTTGAACAGAAGCTCCTATGCACGGGACAGC



ATGATGATTGAAGAACTCCTTGTGCCATCCAAAGAGCAGCATCTAACATT



CACAAGGGAATTTGATTCAGATTCTCTTAGACATTACAGCTGGGCTGCAG



ACACCTTAGACAATGTCAATCTTGTTTCAAGCCCCATTCATTCTGGGTTTC



TGGTTAGCTTTATGGTGGACGCGAGAGGGGGCTCCATGAGAGGAAGCCG



TCATCACGGGATGAGAATCATCATTCCTCCACGCAAGTGTACGGCCCCCA



CTCGAATCACCTGCCGTTTGGTAAAGAGACATAAACTGGCCAACCCACCC



CCCATGGTGGAAGGAGAGGGATTAGCCAGTAGGCTGGTAGAAATGGGTC



CTGCAGGGGCACAATTTTTAGGCCCTGTCATAGTGGAAATCCCTCACTTT



GGGTCCATGAGAGGAAAAGAGAGAGAACTCATTGTTCTTCGAAGTGAAA



ATGGTGAAACTTGGAAGGAGCATCAGTTTGACAGCAAAAATGAAGATTT



AACCGAGTTACTTAATGGCATGGATGAAGAACTTGATAGCCCAGAAGAG



TTAGGGAAAAAGCGTATCTGCAGGATTATCACGAAAGATTTCCCCCAGTA



TTTTGCAGTGGTTTCCCGGATTAAGCAGGAAAGCAACCAGATTGGTCCTG



AAGGTGGAATTCTGAGCAGCACCACAGTGCCCCTTGTTCAAGCATCTTTC



CCAGAGGGTGCCCTAACTAAAAGAATTCGAGTGGGCCTCCAGGCCCAGC



CTGTTCCAGATGAAATTGTGAAAAAGATCCTTGGAAACAAAGCAACTTTT



AGCCCAATTGTCACTGTGGAACCAAGAAGACGGAAATTCCATAAACCAA



TCACAATGACCATTCCGGTGCCCCCGCCCTCAGGAGAAGGTGTATCCAAT



GGATACAAAGGGGACACTACACCCAATCTGCGTCTTCTCTGTAGCATTAC



AGGGGGCACTTCGCCTGCTCAGTGGGAAGACATCACAGGAACAACTCCTT



TGACGTTTATAAAAGATTGTGTCTCCTTTACAACCAATGTTTCAGCCAGAT



TTTGGCTTGCAGACTGCCATCAAGTTTTAGAAACTGTGGGGTTAGCCACG



CAACTGTACAGAGAATTGATATGTGTTCCATATATGGCCAAGTTTGTTGTT



TTTGCCAAAATGAATGATCCCGTAGAATCTTCCTTGCGATGTTTCTGCATG



ACAGATGACAAAGTGGACAAAACTTTAGAGCAACAAGAGAATTTTGAGG



AAGTCGCAAGAAGCAAAGATATTGAGGTTCTGGAAGGAAAACCTATTTA



TGTTGATTGTTATGGAAATTTGGCCCCACTTACCAAAGGAGGACAGCAAC



TTGTTTTTAACTTTTATTCTTTCAAAGAAAATAGACTGCCATTTTCCATCA



AGATTAGAGACACCAGCCAAGAGCCCTGTGGTCGTCTGTCTTTTCTGAAA



GAACCAAAGACAACAAAAGGACTGCCTCAAACAGCGGTTTGCAACTTAA



ATATCACTCTGCCAGCACATAAAAAGGAGACAGAGTCAGATCAAGATGA



TGAGATTGAGAAAACAGATAGACGACAGAGCTTCGCATCCTTAGCTTTAC



GTAAGCGCTACAGCTACTTGACTGAGCCTGGAATGATTGAACGGAGTACA



GGAGCAACAAGATCCCTCCCCACCACTTACTCATACAAGCCATTCTTTTCT



ACAAGACCATACCAGTCCTGGACAACAGCTCCGATTACAGTGCCTGGGCC



AGCCAAGTCAGGCTTCACTTCCTTATCAAGTTCTTCCTCTAATACGCCATC



AGCTTCTCCGTTAAAATCAATATGGTCTGTTTCGACACCTTCTCCAATCAA



ATCCACATTAGGCGCGTCAACTACATCTTCAGTTAAATCCATTAGTGACG



TGGCATCTCCAATTAGATCCTTTCGGACAATGTCTTCGCCGATAAAAACT



GTGGTGTCACAATCTCCATACAATATCCAAGTTTCCTCTGGTACCCTGGCT



AGAGCTCCAGCAGTCACGGAAGCTACGCCCTTAAAAGGGCTGGCATCCA



ATTCTACGTTTTCCTCTCGAACCTCTCCAGTGACTACAGCAGGGTCTCTTT



TGGAGAGGTCATCAATTACTATGACACCCCCTGCCTCCCCCAAATCAAAC



ATTAATATGTATTCCTCAAGTTTGCCATTTAAGTCAATTATTACATCAGCA



GCACCGCTAATATCTTCACCTTTAAAGTCAGTGGTGTCTCCAGTTAAATCA



GCAGTTGATGTCATTTCATCAGCCAAAATTACAATGGCATCTTCTCTCTCA



TCACCTGTGAAGCAGATGCCTGGACATGCAGAGGTAGCATTAGTCAATGG



ATCTATTTCCCCTCTAAAATATCCATCATCCTCAACTTTAATTAATGGATG



CAAAGCCACTGCCACGTTACAGGAAAAAATTTCTTCTGCTACAAACTCTG



TGAGCTCTGTGGTCAGTGCAGCCACTGACACAGTTGAGAAAGTGTTTTCT



ACCACGACTGCAATGCCATTTTCCCCACTCAGGTCATATGTTTCTGCAGCA



CCATCAGCTTTTCAGTCTCTAAGAACTCCTTCCGCAAGTGCACTCTATACA



TCCCTTGGGTCGTCAATATCTGCAACTACCTCATCTGTAACTTCATCAATT



ATAACAGTGCCAGTATACTCTGTAGTCAATGTTTTGCCAGAACCAGCATT



AAAGAAACTTCCAGACTCTAATTCATTTACAAAATCAGCAGCAGCCTTGC



TGTCACCCATTAAAACATTGACTACGGAGACACATCCTCAGCCTCACTTC



AGTCGAACTTCATCTCCAGTTAAGTCATCTTTGTTCCTTGCACCCTCTGCC



CTTAAGTTGTCTACACCATCTTCTTTATCTTCCAGTCAGGAGATACTAAAA



GATGTAGCTGAAATGAAAGAGGACCTAATGCGGATGACCGCAATACTAC



AGACAGATGTGCCTGAGGAGAAGCCATTCCAACCTGAACTCCCAAAGGA



AGGGAGAATAGATGATGAAGAACCTTTCAAAATTGTAGAGAAAGTAAAG



GAAGACTTAGTGAAAGTTAGTGAAATCCTTAAAAAGGATGTATGTGTAG



ATAATAAAGGATCACCCAAATCACCAAAGAGTGACAAAGGACACTCTCC



TGAAGATGACTGGATAGAATTTAGTTCGGAAGAAATCCGGGAAGCCAGA



CAACAAGCTGCTGCGAGCCAGTCTCCATCTCTGCCAGAGAGAGTGCAAGT



AAAAGCAAAAGCCGCCTCCGAAAAGGATTATAACTTGACCAAAGTTATT



GATTACCTAACAAATGATATTGGGAGTAGTTCACTGACAAACTTAAAATA



CAAGTTTGAGGATGCAAAGAAGGATGGTGAGGAGAGACAGAAAAGAGTT



TTAAAACCAGCAATTGCTTTGCAGGAACACAAACTCAAAATGCCTCCAGC



CTCCATGAGGACTTCCACCTCTGAGAAAGAATTGTGTAAAATGGCTGATT



CCTTTTTTGGAACAGATACTATTTTAGAGTCTCCTGATGACTTTTCTCAAC



ACGACCAAGATAAAAGTCCCTTGTCTGACAGTGGCTTTGAAACAAGAAGT



GAAAAGACACCTTCAGCCCCACAAAGCGCTGAAAGCACTGGTCCTAAAC



CACTTTTTCATGAAGTTCCCATCCCTCCTGTCATTACAGAAACAAGAACTG



AAGTGGTTCATGTTATCAGGAGCTATGATCCCTCAGCTGGGGATGTTCCC



CAGACCCAACCAGAGGAGCCTGTGTCACCTAAACCTTCACCTACTTTTAT



GGAATTGGAACCAAAGCCCACCACCTCTAGTATTAAAGAAAAGGTTAAA



GCATTTCAAATGAAAGCCAGTAGTGAAGAAGATGACCACAATCGGGTTTT



AAGCAAAGGCATGCGTGTTAAAGAAGAGACTCACATAACCACAACCACC



AGAATGGTTTATCATTCTCCACCAGGCGGTGAAGGTGCATCTGAAAGAAT



TGAAGAAACCATGTCAGTCCATGACATCATGAAGGCCTTTCAGTCCGGGC



GGGATCCTTCCAAAGAACTGGCAGGTCTGTTTGAACATAAGTCGGCAGTG



TCTCCAGATGTTCACAAGTCTGCTGCTGAAACCTCAGCCCAGCATGCAGA



GAAGGACAACCAAATGAAACCCAAACTGGAGCGTATAATAGAAGTCCAC



ATCGAAAAAGGTAACCAAGCTGAGCCCACTGAAGTCATTATTAGAGAAA



CCAAAAAGCATCCAGAAAAAGAAATGTATGTATATCAGAAAGACTTATC



CCGGGGAGATATTAACCTAAAAGATTTTCTGCCAGAAAAACACGATGCTT



TTCCTTGTTCAGAGGAACAGGGTCAGCAAGAAGAAGAAGAACTTACTGC



TGAAGAGTCATTGCCTTCTTATCTGGAGTCTTCCAGAGTAAACACTCCTGT



GTCCCAAGAAGAAGATAGCCGCCCTAGTTCTGCTCAACTCATATCTGATG



ACTCTTATAAAACATTGAAGCTTTTGAGTCAACACTCAATAGAATACCAT



GACGATGAGTTGTCAGAACTAAGAGGGGAGTCTTACAGGTTTGCTGAGA



AAATGCTTCTGTCAGAAAAGCTAGATGTGTCTCATTCTGATACTGAGGAA



TCGGTTACAGACCATGCAGGACCCCCTAGCTCAGAGTTACAGGGGTCTGA



TAAGCGGTCCAGAGAAAAAATAGCCACTGCCCCCAAAAAAGAAATTCTC



TCCAAAATCTATAAAGATGTTTCTGAAAATGGTGTAGGTAAAGTGTCTAA



AGATGAGCATTTTGATAAAGTGACAGTGTTGCACTATTCTGGCAATGTTA



GTAGTCCAAAACATGCCATGTGGATGCGCTTTACTGAGGACAGATTAGAC



AGAGGTAGAGAGAAGTTGATATATGAAGATAGGGTGGACAGGACTGTGA



AGGAGGCTGAAGAAAAACTGACTGAAGTGTCACAGTTTTTTCGTGACAA



AACTGAAAAGCTAAATGATGAACTGCAGTCCCCAGAGAAAAAGGCACGC



CCTAAAAATGGCAAAGAATATTCTTCTCAAAGCCCTACCAGTAGCAGCCC



TGAGAAAGTGCTACTGACAGAACTGCTGGCATCCAATGATGAGTGGGTTA



AGGCAAGACAGCATGGCCCTGATGGACAAGGCTTCCCCAAGGCCGAGGA



GAAGGCACCCAGTCTGCCCAGCAGCCCAGAGAAGATGGTTCTCTCCCAAC



AGACTGAGGACAGCAAGTCCACAGTGGAAGCCAAAGGAAGTATTTCACA



GAGCAAAGCACCAGATGGGCCCCAGTCTGGATTCCAGCTCAAACAATCT



AAACTCAGTTCCATTAGATTAAAATTTGAACAAGGCACACACGCAAAAA



GTAAGGACATGTCTCAAGAAGACAGAAAGTCAGATGGCCAGTCCAGAAT



CCCAGTTAAAAAAATACAGGAGAGCAAGCTACCCGTCTACCAAGTTTTTG



CTAGAGAAAAACAGCAGAAGGCCATAGACCTCCCAGATGAAAGTGTATC



TGTGCAAAAAGATTTTATGGTATTAAAAACCAAAGATGAGCATGCCCAA



AGCAACGAAATTGTTGTAAATGATTCTGGCTCTGATAATGTGAAAAAACA



GAGAACTGAAATGTCAAGTAAAGCAATGCCTGACTCTTTTTCTGAGCAGC



AGGCTAAAGACTTGGCATGTCATATAACCTCAGATTTAGCAACTAGGGGA



CCATGGGACAAAAAGGTCTTTAGAACATGGGAGAGTTCGGGAGCCACTA



ACAATAAGTCTCAGAAAGAAAAACTTTCGCATGTACTTGTTCATGATGTA



AGAGAGAATCACATTGGTCACCCTGAGAGTAAAAGTGTTGATCAAAAGA



ATGAATTTATGTCTGTGACTGAGAGAGAACGCAAATTGTTAACAAACGGC



TCTCTCTCAGAAATTAAAGAAATGACTGTAAAATCTCCCTCCAAAAAAGT



CTTATATAGGGAATATGTTGTGAAAGAAGGGGACCATCCAGGCGGATTG



CTTGATCAGCCTTCCAGGAGGAGCGAGAGCTCAGCAGTGTCACACATTCC



CGTCAGAGTTGCTGATGAGAGGAGAATGCTGTCTTCTAATATTCCCGATG



GTTTTTGTGAACAGTCGGCATTTCCAAAACATGAACTATCACAAAAATTG



TCCCAGTCAAGCATGAGTAAAGAGACAGTTGAGACACAGCACTTTAATTC



TATAGAAGATGAAAAAGTTACCTATTCAGAAATCAGCAAAGTTTCCAAAC



ACCAGAGTTATGTAGGTTTATGCCCACCTCTCGAGGAAACCGAAACCTCC



CCCACCAAATCTCCTGATTCTTTAGAGTTTAGCCCAGGAAAGGAATCTCC



CTCTAGTGATGTATTCGACCACAGTCCCATTGATGGATTGGAAAAACTCG



CACCACTAGCCCAGACAGAGGGAGGGAAAGAGATAAAAACTTTACCCGT



TTATGTCAGTTTTGTACAAGTGGGGAAGCAATATGAAAAGGAGATACAA



CAAGGAGGTGTAAAAAAAATCATAAGTCAGGAATGTAAGACAGTACAAG



AAACCAGGGGGACCTTTTATACAACTAGACAGCAAAAGCAACCTCCTTCT



CCCCAAGGTAGTCCAGAAGATGATACTCTAGAGCAAGTATCCTTTCTAGA



CAGCTCTGGGAAAAGCCCTTTAACCCCAGAAACACCCAGTTCAGAGGAA



GTGAGTTATGAATTTACATCTAAGACACCTGACTCGCTCATAGCTTATAT



ACCAGGCAAACCCAGCCCAATTCCCGAGGTTTCTGAGGAGTCAGAGGAG



GAGGAACAGGCCAAGTCAACCTCCCTTAAGCAGACTACAGTGGAGGAAA



CAGCAGTTGAGCGTGAAATGCCTAATGACGTGAGCAAAGACTCTAACCA



AAGACCCAAAAATAACAGAGTTGCCTATATTGAATTTCCCCCTCCTCCAC



CACTGGATGCGGACCAGATTGAGTCAGATAAGAAGCATCATTATCTCCCA



GAAAAAGAGGTTGACATGATTGAAGTCAATCTGCAAGATGAGCATGACA



AGTACCAGCTGGCTGAACCTGTCATTAGAGTGCAGCCACCTTCACCAGTT



CCTCCCGGGGCAGACGTCAGTGATTCAAGCGATGACGAATCTATTTATCA



GCCAGTCCCAGTTAAAAAATATACCTTCAAATTAAAGGAAGTGGACGAT



GAACAAAAAGAAAAACCCAAAGCTTCTGCTGAAAAGGCTTCCAACCAGA



AAGAACTGGAAAGTAATGGATCTGGAAAAGATAATGAATTTGGCCTTGG



CCTTGATTCACCTCAGAATGAAATTGCCCAGAATGGGAACAACGACCAGT



CCATCACAGAGTGTTCCATTGCCACCACAGCAGAGTTTTCTCATGACACG



GATGCCACAGAGATCGACTCTCTGGATGGCTATGACCTGCAAGATGAAG



ATGATGGCTTGACAGAGAGTGATTCTAAACTCCCAATTCAAGCCATGGAA



ATTAAGAAAGATATCTGGAACACAGAGGGCATTCTGAAGCCAGCTGACC



GCTCTTTTAGCCAAAGTAAACTTGAAGTTATCGAGGAGGAGGGAAAGGT



GGGACCAGATGAGGACAAGCCACCTTCTAAAAGTTCTTCATCTGAAAAG



ACTCCTGATAAGACTGATCAGAAGTCAGGGGCCCAGTTCTTCACACTGGA



AGGCAGACATCCTGACAGATCAGTGTTTCCTGATACTTACTTCAGTTACA



AAGTAGATGAAGAATTTGCCACTCCTTTTAAAACAGTAGCTACCAAAGGT



CTAGATTTTGACCCTTGGTCTAATAACCGAGGGGATGATGAAGTTTTTGA



CAGTAAATCACGGGAAGATGAAACTAAGCCATTTGGGCTGGCGGTAGAA



GACCGCTCTCCAGCAACAACCCCTGATACAACGCCAGCCAGAACGCCAA



CTGATGAAAGTACCCCAACTAGTGAGCCTAACCCCTTCCCATTTCATGAA



GGAAAAATGTTTGAGATGACTCGCAGTGGTGCAATTGACATGAGCAAGA



GGGATTTTGTTGAAGAGAGGCTCCAATTTTTCCAGATTGGTGAGCATACT



TCTGAAGGGAAGTCAGGGGACCAGGGGGAAGGGGATAAAAGTATGGTCA



CTGCCACACCACAGCCACAGTCAGGGGACACCACTGTAGAAACCAATCT



AGAGAGAAATGTAGAGACACCTACAGTGGAACCTAACCCCAGCATCCCG



ACCAGCGGAGAGTGTCAGGAAGGCACATCCAGTAGTGGCTCCCTGGAGA



AATCAGCAGCAGCCACTAACACCTCTAAAGTTGACCCCAAGTTGCGCACG



CCTATAAAAATGGGAATTTCTGCATCCACCATGACCATGAAGAAAGAAG



GCCCTGGAGAAATAACAGATAAGATAGAAGCGGTGATGACCAGTTGTCA



GGGATTAGAAAATGAAACTATAACAATGATTTCAAATACAGCCAATAGC



CAGATGGGCGTTAGGCCCCATGAAAAACATGATTTTCAAAAAGATAACTT



TAATAACAACAACAATTTGGATTCTTCCACTATACAGACAGATAACATTA



TGAGTAATATAGTTCTGACAGAACATTCTGCACCCACTTGTACCACAGAG



AAAGATAACCCAGTGAAAGTCTCATCAGGAAAAAAGACAGGGGTACTAC



AAGGACACTGTGTAAGAGATAAGCAGAAAGTTCTTGGAGAACAGCAAAA



AACAAAGGAATTGATAGGGATTAGGCAAAAATCCAAACTTCCCATAAAG



GCCACTTCACCAAAAGATACCTTCCCACCGAACCATATGTCAAACACTAA



AGCAAGTAAAATGAAGCAGGTTAGTCAATCCGAGAAAACCAAAGCCCTT



ACTACTTCTTCATGTGTAGATGTAAAGTCCAGAATTCCAGTGAAAAACAC



ACACAGGGATAACATAATTGCAGTTAGAAAAGCATGTGCCACACAAAAG



CAAGGGCAGCCAGAGAAAGGCAAGGCCAAACAGCTTCCATCCAAGTTGC



CAGTAAAGGTAAGATCCACCTGTGTCACTACCACCACCACCACTGCCACC



ACCACCACCACTACCACCACTACCACCACCACCAGCTGCACAGTTAAAGT



TAGGAAAAGTCAGCTCAAGGAAGTATGTAAACATTCCATTGAATATTTTA



AGGGAATTAGTGGTGAGACCTTAAAGCTTGTGGACCGCCTCTCTGAAGAA



GAAAAAAAGATGCAGTCCGAGTTGTCCGATGAGGAAGAAAGTACCTCAA



GAAACACGTCGTTGTCCGAGACTTCCCGGGGTGGCCAGCCTTCGGTTACA



ACGAAGTCTGCTAGAGATAAGAAAACAGAGGCAGCACCTTTAAAATCAA



AGAGTGAAAAGGCCGGCAGTGAGAAAAGGAGCAGTAGAAGGACTGGTC



CACAGAGTCCATGTGAACGGACAGATATCAGGATGGCAATAGTAGCCGA



TCACCTGGGACTTAGTTGGACAGAACTGGCAAGGGAACTGAATTTTTCAG



TGGATGAAATCAATCAAATACGTGTGGAAAATCCAAATTCTTTAATTTCT



CAGAGCTTCATGTTATTAAAAAAATGGGTTACCAGAGACGGAAAAAATG



CCACAACTGATGCCTTAACTTCGGTCTTGACAAAAATTAATCGAATAGAT



ATAGTGACACTGCTAGAAGGACCAATATTTGATTATGGAAATATTTCAGG



CACCAGAAGTTTTGCAGATGAGAACAATGTTTTCCATGACCCTGTTGATG



GTTGGCAGAATGAGACATCAAGTGGAAACCTAGAGTCCTGCGCTCAAGC



TCGAAGAGTAACTGGTGGGTTACTAGATCGACTGGATGACAGCCCTGACC



AGTGTAGAGATTCCATTACCTCATATCTCAAAGGAGAAGCTGGCAAATTT



GAAGCAAATGGAAGCCATACAGAAATCACTCCAGAAGCAAAGACAAAAT



CTTACTTTCCAGAATCCCAAAATGATGTAGGAAAACAGAGTACCAAGGA



AACTCTGAAACCAAAAATACATGGATCTGGTCATGTTGAAGAACCAGCAT



CACCACTAGCAGCATATCAGAAATCTCTAGAAGAAACCAGCAAGCTTAT



AATAGAAGAGACTAAACCCTGTGTGCCTGTCAGTATGAAAAAGATGAGT



AGGACTTCTCCAGCAGATGGCAAGCCAAGGCTTAGCCTCCATGAAGAAG



AGGGGTCCAGTGGGTCTGAGCAAAAGCAGGGAGAAGGTTTTAAGGTGAA



AACGAAGAAAGAAATCCGGCATGTGGAAAAGAAGAGCCACTCGTAA





THOC2->
ATGGCGGCCGCGGCTGTGGTGGTTCCCGCAGAGTGGATAAAGAACTGGG
SEQ ID


DOCK11
AGAAATCAGGGAGAGGCGAATTTTTGCATTTATGTCGGATCCTCAGTGAA
NO: 56



AATAAAAGCCATGATAGTTCAACATACAGAGATTTCCAGCAAGCTCTCTA



TGAGTTGTCATATCATGTAATTAAAGGAAATCTAAAGCATGAACAGGCAT



CTAATGTTCTTAGTGACATTAGTGAATTTCGTGAGGATATGCCCTCCATTC



TTGCTGATGTATTCTGCATATTAGACATTGAGACAAATTGTTTAGAAGAA



AAAAGCAAGAGAGACTATTTTACACAGTTGGTATTAGCATGTTTGTATTT



AGTTTCAGACACAGTTCTAAAGGAACGCCTGGATCCAGAAACACTGGAA



TCATTAGGGCTTATCAAACAATCACAGCAATTCAATCAAAAGTCAGTTAA



AATCAAGACAAAACTCTTTTATAAGCAGCAAAAATTCAATTTGTTAAGAG



AAGAGAATGAAGGTTATGCCAAGCTGATTGCTGAATTGGGGCAAGATTT



ATCTGGAAGTATTACTAGTGATTTAATCTTAGAAAATATCAAATCTTTAAT



AGGATGCTTTAATCTGGATCCCAATAGAGTTTTGGATGTCATTTTAGAAG



TGTTTGAATGCAGGCCAGAACACGATGACTTCTTTATATCTTTGTTAGAAT



CTTACATGAGTATGTGTGAACCGCAAACACTGTGTCATATTCTTGGGTTC



AAATTCAAGTTTTACCAGGAACCAAATGGCGAGACACCATCATCTTTATA



CAGAGTTGCAGCAGTACTTCTACAATTTAATCTTATTGATTTAGATGATCT



TTATGTACATCTTCTTCCGGCTGATAATTGCATTATGGATGAACACAAAC



GAGAAATTGCGGAAGCTAAGCAAATTGTTAGAAAGCTTACGATGGTTGT



GTTGTCTTCTGAAAAAATGGATGAGCGAGAGAAAGAAAAGGAAAAAGAA



GAGGAGAAAGTAGAGAAACCACCTGATAACCAAAAACTTGGCTTGTTGG



AAGCCTTATTAAAGATTGGTGATTGGCAACATGCACAGAACATTATGGAT



CAGATGCCTCCATACTATGCAGCTTCACACAAGCTAATAGCCCTTGCTAT



TTGCAAGCTCATTCATATAACTATTGAGCCTCTCTACCGAAGAGTTGGAG



TTCCTAAAGGTGCTAAAGGCTCACCTGTTAATGCTTTGCAAAACAAGAGA



GCACCAAAACAAGCTGAGAGCTTTGAAGATTTGAGGAGAGACGTGTTCA



ATATGTTCTGTTACCTTGGTCCTCACCTTTCTCACGATCCCATTTTATTTGC



AAAAGTGGTGCGCATAGGCAAGTCATTTATGAAGGAGGAAAAGGCCAAA



GTTGTTGAGCCCCTGGACTATGAGAATGTTATTGCCCAAAGAAAAACCCA



GATTTACAGCGACCCCCTCCGAGATCTGCTTATGTTCCCAATGGAAGATA



TATCTATCTCGGTGATAGGTCGTCAACGCAGAACGGTGCAGTCTACTGTA



CCAGAAGATGCTGAAAAGAGGGCCCAGAGTTTATTTGTTAAAGAGTGTAT



TAAAACCTATAGCACAGATTGGCACGTGGTAAACTACAAGTATGAGGAC



TTCTCTGGGGACTTTCGAATGTTGCCATGTAAATCTTTGAGACCAGAAAA



GATTCCTAATCATGTATTTGAGATAGATGAAGACTGTGAGAAAGATGAGG



ACTCATCTTCTTTATGTTCTCAGAAGGGTGGTGTGATAAAACAAGGCTGG



TTGCATAAAGCAAATGTAAATAGTACCATCACAGTAACCATGAAGGTATT



CAAGAGACGATATTTTTACTTGACCCAACTTCCTGACGGTTCATATATTCT



CAATTCCTATAAAGATGAGAAAAATTCAAAAGAATCGAAAGGTTGCATC



TACTTGGACGCCTGCATTGATGTTGTTCAGTGCCCCAAAATGCGCCGTCA



TGCTTTTGAACTCAAGATGTTAGATAAATATAGCCATTATCTGGCTGCTG



AAACTGAGCAGGAAATGGAGGAATGGTTGATAACTTTGAAAAAGATTAT



TCAGATCAACACCGACAGTTTAGTTCAAGAAAAAAAGGAGACGGTAGAA



ACAGCACAAGATGATGAAACTAGCAGCCAAGGAAAAGCCGAGAACATCA



TGGCAAGTTTGGAAAGGAGCATGCATCCGGAACTGATGAAGTATGGAAG



AGAAACTGAACAACTAAACAAACTCAGTAGAGGAGATGGAAGACAGAAT



CTCTTTTCTTTTGATTCAGAAGTTCAGAGGTTGGACTTTTCAGGAATTGAA



CCTGATATAAAGCCATTTGAAGAAAAATGCAATAAACGTTTCCTGGTGAA



TTGCCATGATTTAACTTTCAATATCTTGGGCCAAATTGGAGACAATGCAA



AAGGACCACCCACAAATGTTGAGCCCTTTTTTATCAATCTTGCCTTATTTG



ATGTAAAGAACAATTGTAAGATTTCAGCAGACTTTCATGTAGACCTGAAT



CCCCCATCTGTCCGTGAAATGCTGTGGGGCTCTTCAACCCAACTGGCCAG



TGACGGTAGCCCAAAGGGCTCTTCACCCGAATCTTACATTCATGGAATTG



CCGAATCTCAGTTACGCTACATACAACAGGGAATTTTCTCAGTGACGAAT



CCACATCCTGAAATTTTTCTAGTTGCCAGAATTGAAAAGGTACTACAGGG



AAACATTACACACTGTGCAGAACCCTATATCAAAAATTCTGATCCAGTAA



AGACGGCCCAGAAGGTGCACAGGACAGCTAAACAAGTGTGTAGCCGCCT



TGGACAATACAGAATGCCCTTCGCTTGGGCTGCCAGACCCATTTTCAAAG



ATACTCAAGGCTCTCTTGATCTGGATGGGAGATTTTCTCCTCTGTATAAAC



AAGACAGTAGCAAGCTTTCAAGTGAAGACATTCTCAAGTTGCTCTCAGAA



TATAAGAAGCCAGAAAAGACCAAACTGCAGATTATTCCTGGGCAGCTAA



ACATCACAGTAGAATGTGTTCCTGTGGATTTATCAAATTGTATTACTTCTT



CATATGTGCCCTTGAAGCCTTTTGAAAAGAATTGCCAAAATATTACTGTG



GAGGTTGAAGAGTTTGTTCCAGAAATGACAAAATATTGTTATCCATTTAC



TATTTACAAAAACCATCTGTATGTATATCCCCTGCAATTAAAATACGATA



GCCAGAAAACATTTGCCAAGGCAAGGAACATTGCAGTCTGTGTGGAATTC



CGGGATTCAGATGAAAGTGACGCTAGTGCCCTAAAGTGTATTTATGGAAA



ACCTGCAGGGTCTGTTTTTACCACAAATGCTTATGCTGTTGTCTCGCATCA



CAACCAAAATCCAGAGTTCTATGATGAGATTAAAATTGAGCTTCCCATTC



ACCTACATCAAAAACATCATTTGCTTTTCACTTTTTATCATGTAAGTTGTG



AAATTAACACAAAGGGAACAACCAAAAAGCAAGACACAGTTGAAACTCC



AGTTGGGTTTGCCTGGGTACCTTTGCTGAAAGATGGTAGAATCATCACAT



TTGAGCAGCAGCTGCCAGTTTCCGCCAATCTTCCCCCAGGCTACTTGAAT



CTGAATGATGCAGAATCAAGAAGGCAATGTAACGTGGATATTAAATGGG



TAGATGGTGCAAAGCCTTTGTTGAAGATTAAAAGCCACTTAGAATCTACC



ATTTACACTCAAGATCTGCATGTGCACAAATTCTTCCATCATTGCCAGCTG



ATTCAGTCAGGCTCGAAAGAAGTTCCAGGGGAGCTCATTAAATATTTAAA



GTGTTTGCATGCCATGGAGATCCAAGTCATGATACAGTTTCTACCTGTAA



TTCTTATGCAACTCTTCCGAGTTCTCACAAATATGACCCATGAAGATGAC



GTTCCTATCAACTGCACCATGGTTCTCTTACATATTGTATCAAAGTGCCAT



GAAGAAGGCTTGGATAGTTATCTAAGATCATTCATAAAGTATAGCTTCCG



ACCTGAAAAACCGAGTGCTCCTCAGGCCCAGCTGATACATGAAACCCTGG



CTACTACGATGATAGCAATATTGAAACAGTCTGCAGATTTTTTATCAATA



AACAAATTGCTAAAGTACTCATGGTTTTTCTTTGAAATAATTGCAAAGTC



AATGGCCACATACTTGTTGGAAGAGAATAAGATTAAGCTTCCCCGAGGCC



AGAGATTTCCCGAGACATATCATCATGTCTTACATTCACTGCTTCTTGCAA



TAATTCCCCATGTGACTATTCGGTATGCGGAGATTCCCGATGAGTCCAGA



AATGTGAACTATAGTTTGGCTAGCTTCCTGAAGCGCTGTTTGACACTAAT



GGATAGAGGATTTATTTTCAATTTAATAAATGACTATATATCTGGATTCA



GCCCCAAAGATCCTAAGGTTCTGGCTGAATACAAGTTTGAATTTCTGCAA



ACAATTTGCAATCACGAACATTACATTCCTCTGAACTTGCCAATGGCATTT



GCAAAACCTAAACTGCAGCGGGTTCAAGATTCAAATCTTGAATACAGTTT



ATCAGATGAGTATTGCAAGCATCACTTCTTGGTTGGTCTACTTCTGAGGG



AAACTTCCATTGCTCTTCAGGACAATTATGAGATCAGATATACAGCTATC



TCTGTTATAAAGAATCTTTTGATAAAACATGCATTTGACACAAGATACCA



GCACAAGAACCAACAAGCCAAAATAGCACAATTGTACCTCCCCTTTGTTG



GACTACTTTTGGAAAATATACAGCGATTAGCAGGTCGAGATACCTTGTAT



TCTTGTGCAGCCATGCCTAATTCTGCATCCAGAGATGAGTTTCCATGTGGC



TTTACTTCACCTGCCAATAGAGGGAGTCTGAGCACTGACAAAGACACCGC



TTATGGGTCTTTTCAAAATGGACATGGAATTAAGAGAGAAGATTCAAGAG



GTTCCCTCATCCCAGAAGGAGCAACAGGATTTCCAGATCAGGGCAACACT



GGTGAAAATACCCGACAGAGTTCTACAAGGAGTAGTGTATCCCAGTATA



ACCGCCTGGATCAGTATGAAATCAGAAGCCTCCTGATGTGCTACCTGTAT



ATAGTAAAAATGATTTCAGAAGATACTCTCTTAACTTACTGGAATAAAGT



ATCACCTCAGGAGCTCATAAACATTCTTATACTTTTAGAAGTATGCTTGTT



TCACTTTAGATATATGGGGAAAAGAAACATAGCAAGGGTGCATGATGCC



TGGCTGTCAAAACACTTCGGAATAGACCGAAAATCGCAAACCATGCCTGC



TCTTCGAAACAGATCAGGAGTAATGCAGGCCCGGCTTCAGCATCTTAGTA



GCCTAGAAAGTTCATTTACACTTAATCACAGTTCTACAACAACTGAAGCA



GACATTTTCCACCAGGCACTTCTTGAAGGCAATACAGCTACTGAAGTTTC



CCTAACAGTACTAGACACCATATCATTTTTCACTCAGTGCTTCAAGACCC



AACTTTTAAATAATGATGGCCATAACCCATTAATGAAAAAAGTGTTTGAT



ATACATCTTGCTTTTCTTAAAAATGGACAATCTGAAGTGTCGCTGAAACA



TGTATTTGCCTCACTGAGAGCTTTCATCAGTAAGTTTCCTTCAGCATTTTT



CAAAGGAAGAGTAAACATGTGTGCTGCATTTTGCTATGAGGTTTTAAAGT



GCTGCACATCGAAGATTAGCTCAACCAGGAATGAAGCATCTGCACTTTTG



TATCTTTTGATGAGAAACAACTTTGAGTATACCAAAAGGAAAACCTTTTT



GAGGACACATCTACAGATAATAATTGCTGTAAGCCAACTGATAGCTGATG



TAGCACTAAGCGGAGGATCAAGATTTCAGGAGTCTTTATTCATTATCAAT



AATTTTGCAAATAGTGACAGACCTATGAAGGCAACTGCCTTTCCCGCAGA



AGTCAAAGACTTGACCAAGAGAATCCGCACTGTTCTTATGGCCACTGCCC



AAATGAAGGAGCATGAGAAAGACCCTGAAATGCTAATTGATCTCCAGTA



TAGCTTAGCCAAGTCCTATGCAAGCACCCCAGAGCTCAGGAAAACCTGGC



TTGATAGCATGGCCAAGATTCATGTAAAAAATGGAGATTTTTCAGAGGCT



GCGATGTGTTATGTCCATGTAGCAGCTCTAGTTGCAGAGTTTCTTCATCGA



AAAAAATTATTTCCTAACGGATGTTCAGCGTTCAAGAAAATTACTCCCAA



TATAGATGAAGAAGGAGCAATGAAAGAAGATGCTGGGATGATGGATGTC



CATTATAGTGAAGAGGTCTTGCTGGAGTTGCTAGAACAATGTGTGGATGG



CTTATGGAAGGCAGAACGTTATGAAATAATTTCTGAGATTTCCAAGTTGA



TCGTTCCAATTTATGAGAAACGTCGTGAGTTTGAGAAACTTACTCAAGTT



TATAGAACTCTTCATGGAGCTTACACAAAAATTCTGGAAGTTATGCATAC



AAAAAAGAGACTTTTAGGCACTTTCTTCAGAGTTGCCTTTTATGGCCAAT



CTTTTTTTGAAGAAGAAGATGGAAAGGAGTACATCTATAAAGAACCAAA



GCTCACTGGCCTCTCAGAAATTTCCTTGAGACTTGTTAAACTTTATGGTGA



AAAGTTTGGTACGGAGAATGTCAAAATAATTCAGGATTCAGACAAGGTA



AATGCCAAAGAGCTTGATCCAAAATATGCTCATATACAAGTTACTTATGT



GAAGCCTTACTTTGATGACAAAGAACTCACAGAAAGGAAGACCGAGTTT



GAAAGAAATCATAATATCAGCAGATTTGTTTTTGAGGCCCCTTACACTTT



ATCAGGCAAAAAACAGGGCTGTATAGAAGAACAGTGCAAACGCCGTACA



ATCTTGACAACTTCAAACTCGTTTCCTTACGTGAAGAAGAGGATTCCTATT



AACTGTGAACAGCAGATTAATTTAAAACCAATTGATGTTGCCACTGATGA



AATAAAAGATAAAACTGCAGAGCTGCAAAAGCTTTGCTCCTCTACTGACG



TGGACATGATTCAGCTCCAACTTAAATTGCAGGGCTGTGTTTCTGTGCAG



GTCAATGCTGGTCCATTAGCATATGCAAGAGCTTTCTTAAATGACAGCCA



AGCTAGCAAGTATCCACCTAAGAAAGTGAGTGAGTTGAAAGACATGTTT



AGGAAATTTATACAAGCATGCAGCATTGCACTTGAACTAAATGAGCGGCT



AATTAAAGAAGATCAAGTTGAGTACCATGAAGGGCTAAAGTCAAATTTC



AGAGACATGGTAAAAGAATTATCTGACATTATCCATGAGCAGATATTACA



AGAAGACACAATGCATTCTCCCTGGATGAGCAACACATTACATGTATTTT



GTGCAATTAGTGGTACATCAAGTGACCGAGGTTATGGTTCCCCAAGATAC



GCTGAAGTGTGA





RAF1->
CAGAATCGGAGAGCCGGTGGCGTCGCAGGTCGGGAGGACGAGCACCGAG
SEQ ID


NKIRAS1
TCGAGGGCTCGCTCGTCTGGGCCGCCCGAGAGTCTTAATCGCGGGCGCTT
NO: 57



GGGCCGCCATCTTAGATGGCGGGAGTAAGAGGAAAACGATTGTGAGGCG



GGAACGGCTTTCTGCTGCCTTTTTTGGGCCCCGAAAAGGGTCAGCTGGCC



GGGCTTTGGGGCGCGTGCCCTGAGGCGCGGAGCGCGTTTGCTACGATGCG



GGGGCTGCTCGGGGCTCCGTCCCCTGGGCTGGGGACGCGCCGAATGTGAC



CGCCTCCCGCTCCCTCACCCGCCGCGGGGAGGAGGAGCGGGCGAGAAGC



TGCCGCCGAACGACAGGACGTTGGGGCGGCCTGGCTCCCTCAGGAATGG



AAGATTGCGAAACAATGGAAGATGTATACATGGCTTCAGTAGAAACAGA



CCGAGGAGTAAAAGAACAGTTACATCTTTATGACACCAGAGGTCTACAG



GAAGGCGTGGAGCTGCCAAAGCATTATTTTTCATTTGCTGATGGCTTCGTT



CTTGTGTACAGTGTGAATAACCTTGAATCCTTTCAAAGAGTGGAGCTTCT



GAAGAAAGAAATCGATAAGTTCAAAGACAAAAAAGAGGTAGCAATTGTG



GTATTAGGAAACAAAATCGACCTTTCTGAGCAGAGACAAGTGGACGCTG



AAGTGGCACAGCAGTGGGCAAAAAGTGAGAAAGTAAGACTGTGGGAGGT



GACTGTTACAGATCGGAAAACTCTGATTGAACCATTCACTTTATTAGCCA



GTAAACTTTCTCAACCCCAGAGCAAATCAAGCTTTCCTTTGCCTGGGAGG



AAAAACAAAGGGAACTCTAATTCTGAGAACTAAAAATCAGTAATTTCCA



CAATTGTATGTTGAATAGTGATTGCCTTTAAGTGTCTGTGAACATGGAGT



AATATTACTATTTAAAATAGGCCATTTGTATCTACCTTTGGTCCTTAGGAA



AATTCCTAAGGAAGTCAATTAATGCACTTTAGATGTTAAAAGTATTTGGG



CTAAGGTTATTATTGCCTGATATGAAATAATATATTCTTATTCTCATTGTT



TGAAACCTGTCTTTGAAATTAGCACCTTTGTTATTTATGTTGTACTTGTGA



AAACAGTAAAATAGTTTGGATAGTTATGCAAATGCACCTATGTGTAACTT



CCCCCCAACCCCAAGCTGTTTCGGAAGATATCATAATCATTCTGTGTAAC



ATTATGCAAACTTCTAAGCCCAAACATGACTTTGTTTTTAAAAAGTTCATT



AATCTAATGTCTAGGATTATAAAACATTTTTTTGTGTCTAAATTGGACCCA



AAACATTGAACAGTTTGGGGTAGTAAGCTAAATTTCATCTTGTGGAGATT



TTGCTAAACAGACTAAGACCCATGATTTAGCTTTGCTCAAATTAGAATGT



TTAGCATGAGTTGAGGTACCAGGTAGTGTTAAGTAGGTTCATCACGCTCT



AAGGCCGTTTTTTCCTTAGCCAGACCCCTGTTGATAGACCAGATACTTGA



GGGCAAACTGTTTGCTCCTCCTCTTGAAAATGATTAGGCACTTAAGGACA



GTAAAGCTGTATTTTCTGGAAGGAAGACTGTATCTTCTGGAATAGTTTTCT



AGAAAACTAGTCATATACAATAAAAGTATCAAAAATATTGGGCTCTAATT



TGATCTGACTTAGATGTCTGAGTTTGTGTTGTTTCTCTAAAGATTTTGGCA



AGACTCAAGCAATGTGGCTGACTGTAACTTTATTAATTTAAAAGGTAGGA



AGTAAGCTACTTAGTGGTTTCACCTGTGAAATAACTATTTTGACTGAAAT



GTAAAATAAGCTATTCAACAAAGAACATATTAAAACATCAA





ZEB1->
ATGGCGGATGGCCCCAGGTGTAAGCGCAGAAAGCAGGCGAACCCGCGGC
SEQ ID


PLEKHF2
GCAATAACGATGGTGGATCGCTTGGCAAACAGTGAAGCAAATACTAGAC
NO: 58



GTATAAGTATAGTGGAAAACTGTTTTGGAGCAGCTGGTCAACCTTTAACT



ATACCTGGACGAGTTCTTATTGGAGAAGGAGTATTGACTAAGTTGTGCAG



GAAAAAGCCCAAAGCAAGGCAGTTTTTCTTGTTTAATGATATTCTTGTAT



ATGGCAATATTGTCATCCAGAAGAAAAAATATAACAAACAACATATTATT



CCCCTGGAAAATGTCACTATTGATTCCATCAAAGATGAGGGAGACTTAAG



GAATGGATGGCTAATCAAGACACCAACTAAATCTTTTGCAGTTTATGCTG



CCACTGCTACGGAGAAATCAGAATGGATGAATCATATAAATAAATGTGTT



ACTGATTTACTCTCCAAAAGTGGGAAGACACCCAGTAATGAACATGCTGC



TGTCTGGGTTCCTGACTCTGAGGCAACTGTATGTATGCGTTGTCAGAAAG



CAAAATTCACACCTGTTAATCGTCGCCACCATTGCCGCAAATGTGGTTTT



GTTGTCTGTGGGCCCTGCTCTGAAAAGAGATTTCTTCTTCCCAGCCAGTCC



TCTAAGCCTGTGCGGATTTGTGACTTCTGCTATGACCTGCTTTCTGCTGGG



GACATGGCCACATGCCAGCCTGCTAGATCAGACTCTTACAGCCAGTCATT



GAAGTCTCCTTTAAATGATATGTCTGATGATGATGACGATGATGATAGCA



GTGACTAA





PPP1R12C
ATGTCCGGAGAGGATGGCCCGGCGGCTGGCCCGGGGGCGGCGGCGGCGG
SEQ ID


->
CTGCCCGGGAGCGGCGACGGGAGCAGCTGCGGCAGTGGGGGGCGCGGGC
NO: 59


IFITM10
GGGCGCCGAGCCTGGCCCCGGAGAGCGCCGCGCCCGCACCGTCCGCTTC



GAGCGCGCCGCCGAGTTCCTGGCGGCCTGTGCGGGCGGCGACCTGGACG



AGGCGCGTCTGATGCTGCGCGCCGCCGACCCTGGCCCCGGCGCCGAGCTC



GACCCCGCCGCGCCGCCGCCCGCCCGCGCCGTGCTGGACTCCACCAACGC



CGACGGTATCAGCGCCCTGCACCAGGCCTGCATTGATGAGAACCTGGAG



GTGGTGCGCTTCTTGGTGGAGCAGGGCGCCACTGTGAACCAGGCAGACA



ACGAGGGCTGGACGCCACTGCACGTGGCCGCCTCCTGTGGCTACCTAGAT



ATCGCCAGGTACCTCCTGAGCCACGGGGCCAACATCGCCGCCGTCAACAG



TGACGGGGACCTGCCCCTGGACCTGGCCGAGTCGGACGCCATGGAGGGG



CTGCTGAAGGCGGAGATCGCCCGCCGAGGTGTGGATGTGGAAGCAGCCA



AGCGGGCAGAAGAGGAATTGCTCCTTCATGACACGAGGTGCTGGCTGAA



TGGGGGCGCCATGCCAGAGGCCCGGCACCCCCGCACAGGCGCCTCTGCC



CTGCACGTGGCTGCTGCCAAGGGCTACATTGAGGTGATGAGGTTGCTCCT



TCAGGCTGGCTACGACCCAGAGCTCCGGGACGGGGACGGCTGGACTCCC



CTGCACGCAGCGGCACACTGGGGCGTGGAGGATGCCTGCCGCCTGCTGG



CCGAGCATGGCGGGGGCATGGACTCACTGACCCATGCGGGGCAGCGTCC



CTGTGACCTGGCCGATGAGGAAGTACTGAGCCTGTTGGAGGAACTGGCCC



GGAAACAGGAGGACGCCCAGGGCCCCGGCCAGTGCCCAGCCCCGCTGGG



AGACCCGGCCAGCACCACGGACGGCGCCCAGGAAGCCCGAGTCCCCCTG



GACGGGGCCTTCTGGATTCCGAGGCCCCCGGCAGGTTCGCCCAAGGGCTG



CTTCGCTTGCGTGTCCAAGCCCCCTGCCCTGCAGGCTCCGGCGGCCCCTG



CCCCTGAGCCCTCGGCCTCTCCCCCGATGGCGCCCACACTGTTCCCCATG



GAGTCCAAGAGCAGCAAGACCGACAGCGTGCGGGCTGCCGGCGCGCCCC



CTGCCTGCAAGCACCTAGCCGAGAAGAAGACGATGACCAACCCCACGAC



CGTCATCGAGGTCTACCCGGACACCACCGAGGTGAACGACTATTACCTGT



GGTCCATCTTCAACTTCGTCTACCTCAACTTCTGCTGCCTGGGCTTCATCG



CCTTGGCCTACTCCCTCAAAGTGCGAGACAAGAAGCTTCTCAATGACCTG



AATGGAGCCGTGGAGGATGCAAAGACGGCCCGGCTGTTCAACATCACCA



GTTCTGCCCTGGCAGCCTCCTGCATCATCCTCGTCTTCATCTTCCTGCGGT



ACCCCCTCACCGACTACTAA





CEP152->
ATGTCATTAGACTTTGGCAGTGTGGCACTACCAGTGCAAAATGAAGATGA
SEQ ID


IQGAP1
AGAGTATGACGAAGAGGACTATGAAAGAGAGAAAGAGTTGCAGCAGTTA
NO: 60



CTCACAGACCTTCCCCATGACATGCTGGATGACGACCTCTCCTCTCCAGA



GCTCCAGTATTCGGACTGCAGCGAGGATGGCACAGACGGACAACCACAT



CATCCTGAGCAATTGGAGATGAGCTGGAATGAGCAAATGCTGCCCAAAT



CTCAAAGTGTAAATGGCTATAATGAAATTCAGAGTTTATATGCTGGAGAA



AAATGTGGTAATGTCTGGGAAGAAAATAGAAGTAAAACTGAAGACCGAC



ATCCTGTGTACCATCCTGAAGAAGGTGGAGATGAAGGTGGAAGTGGTTAT



AGTCCTCCAAGTAAATGTGAACAGACTGATTTATATCACCTTCCTGAAAA



CTTTAGGCCATATACCAATGGTCAGAAGCAGGAATTTAATAACCAAGCAA



CCAATGTAATTAAATTTTCAGATCCTCAATGGAACCATTTTCAGGGTCCC



AGTTGTCAAGGTTTGGAACCGTATAATAAAGTGACATATAAACCTTATCA



GTCTTCTGCCCAGAATAATGGCTCACCAGCCCAGGAGATAACAGGAAGT



GACACATTCGAAGGCCTGCAACAACAATTTTTAGGAGCTAATGAGAACTC



TGCAGAAAATATGCAGATTATTCAACTTCAGGTTCTTAACAAAGCAAAAG



AGAGACAACTGGAGAACTTAATTGAAAAGTTAAATGAAAGTGAACGTCA



AATTCGATATCTGAATCACCAGCTTGTAATAATAAAAGATGAAAAGGATG



GTTTGACTCTCAGCCTTCGAGAATCACAGAAACTCTTTCAGAATGGAAAA



GAAAGAGAGATACAGCTTGAAGCTCAAATAAAAGCACTGGAGACTCAGA



TACAAGCATTAAAAGTCAATGAAGAACAGATGATCAAGAAGTCCAGAAC



AACTGAAATGGCTCTGGAAAGCTTGAAGCAGCAGCTGGTGGACCTTCATC



ATTCTGAATCACTTCAACGAGCTAGAGAACAGCATGAGAGCATTGTTATG



GGCCTCACAAAGAAGTACGAAGAGCAAGTATTGTCCTTACAAAAGAATT



TGGATGCCACAGTCACCGCACTTAAAGAACAGGAAGACATTTGCTCTCGT



CTGAAAGATCACGTGAAACAACTGGAAAGGAATCAAGAAGCAATCAAGT



TAGAAAAGACTGAGATCATTAATAAGTTGACAAGAAGTCTAGAGGAGAG



TCAAAAGCAGTGTGCCCACTTGTTGCAGTCCGGGTCAGTACAAGAGGTGG



CTCAGCTACAGTTCCAGCTGCAGCAAGCACAGAAGGCACATGCTATGAGT



GCAAACATGAACAAGGCTTTGCAAGAAGAATTAACAGAACTAAAAGATG



AAATTTCTCTCTATGAATCTGCTGCAAAACTAGGAATACATCCAAGTGAC



TCAGAAGGAGAATTAAATATAGAACTCACTGAATCGTATGTGGATTTGGG



TATTAAAAAGGTCAACTGGAAAAAATCCAAAGTTACCAGCATTGTACAA



GAAGAAGACCCAAATGAAGAGCTTTCAAAAGATGAGTTCATTCTGAAGT



TAAAGGCAGAAGTACAGCGTTTGCTGGGTAGCAACTCAATGAAGCGTCA



TCTGGTGTCTCAGTTACAAAATGACCTCAAAGACTGTCATAAGAAAATTG



AAGATCTCCACCAAGTGAAGAAGGATGAAAAAAGCATTGAGGTTGAGAC



TAAAACAGATACCTCAGAAAAACCAAAGAATCAATTATGGCCTGAGTCTT



CTACTTCTGATGTTGTCAGAGATGATATTCTGCTGCTTAAAAATGAAATTC



AAGTTTTACAACAACAAAATCAGGAACTTAAAGAAACTGAAGGAAAACT



GAGAAATACAAATCAAGACTTATGTAATCAAATGAGACAAATGGTACAA



GATTTTGACCATGACAAACAAGAAGCTGTGGATAGGTGTGAAAGGACTT



ATCAGCAGCACCATGAAGCCATGAAAACTCAAATACGTGAAAGCCTATT



AGCAAAGCATGCTTTGGAGAAGCAGCAGCTCTTTGAGGCTTATGAGAGA



ACTCATTTGCAACTGAGGTCTGAGTTGGATAAGTTGAATAAGGAGGTGAC



TGCTGTGCAGGAATGTTACCTAGAAGTGTGCAGAGAGAAGGATAATCTA



GAATTGACTCTCAGGAAGACCACTGAAAAGGAGCAACAGACTCAGGAGA



AGATTTTTTACCCAGAAACTACAGATATCTATGATCGAAAGAACATGCCA



AGATGTATCTACTGTATCCATGCACTCAGTTTGTACCTGTTCAAGCTAGGC



CTGGCCCCTCAGATTCAAGACCTATATGGAAAGGTTGACTTCACAGAAGA



AGAAATCAACAACATGAAGACTGAGTTGGAGAAGTATGGCATCCAGATG



CCTGCCTTTAGCAAGATTGGGGGCATCTTGGCTAATGAACTGTCAGTGGA



TGAAGCCGCATTACATGCTGCTGTTATTGCTATTAATGAAGCTATTGACC



GTAGAATTCCAGCCGACACATTTGCAGCTTTGAAAAATCCGAATGCCATG



CTTGTAAATCTTGAAGAGCCCTTGGCATCCACTTACCAGGATATACTTTAC



CAGGCTAAGCAGGACAAAATGACAAATGCTAAAAACAGGACAGAAAACT



CAGAGAGAGAAAGAGATGTTTATGAGGAGCTGCTCACGCAAGCTGAAAT



TCAAGGCAATATAAACAAAGTCAATACATTTTCTGCATTAGCAAATATCG



ACCTGGCTTTAGAACAAGGAGATGCACTGGCCTTGTTCAGGGCTCTGCAG



TCACCAGCCCTGGGGCTTCGAGGACTGCAGCAACAGAATAGCGACTGGT



ACTTGAAGCAGCTCCTGAGTGATAAACAGCAGAAGAGACAGAGTGGTCA



GACTGACCCCCTGCAGAAGGAGGAGCTGCAGTCTGGAGTGGATGCTGCA



AACAGTGCTGCCCAGCAATATCAGAGAAGATTGGCAGCAGTAGCACTGA



TTAATGCTGCAATCCAGAAGGGTGTTGCTGAGAAGACTGTTTTGGAACTG



ATGAATCCCGAAGCCCAGCTGCCCCAGGTGTATCCATTTGCCGCCGATCT



CTATCAGAAGGAGCTGGCTACCCTGCAGCGACAAAGTCCTGAACATAATC



TCACCCACCCAGAGCTCTCTGTCGCAGTGGAGATGTTGTCATCGGTGGCC



CTGATCAACAGGGCATTGGAATCAGGAGATGTGAATACAGTGTGGAAGC



AATTGAGCAGTTCAGTTACTGGTCTTACCAATATTGAGGAAGAAAACTGT



CAGAGGTATCTCGATGAGTTGATGAAACTGAAGGCTCAGGCACATGCAG



AGAATAATGAATTCATTACATGGAATGATATCCAAGCTTGCGTGGACCAT



GTGAACCTGGTGGTGCAAGAGGAACATGAGAGGATTTTAGCCATTGGTTT



AATTAATGAAGCCCTGGATGAAGGTGATGCCCAAAAGACTCTGCAGGCC



CTACAGATTCCTGCAGCTAAACTTGAGGGAGTCCTTGCAGAAGTGGCCCA



GCATTACCAAGACACGCTGATTAGAGCGAAGAGAGAGAAAGCCCAGGAA



ATCCAGGATGAGTCAGCTGTGTTATGGTTGGATGAAATTCAAGGTGGAAT



CTGGCAGTCCAACAAAGACACCCAAGAAGCACAGAAGTTTGCCTTAGGA



ATCTTTGCCATTAATGAGGCAGTAGAAAGTGGTGATGTTGGCAAAACACT



GAGTGCCCTTCGCTCCCCTGATGTTGGCTTGTATGGAGTCATCCCTGAGTG



TGGTGAAACTTACCACAGTGATCTTGCTGAAGCCAAGAAGAAAAAACTG



GCAGTAGGAGATAATAACAGCAAGTGGGTGAAGCACTGGGTAAAAGGTG



GATATTATTATTACCACAATCTGGAGACCCAGGAAGGAGGATGGGATGA



ACCTCCAAATTTTGTGCAAAATTCTATGCAGCTTTCTCGGGAGGAGATCC



AGAGTTCTATCTCTGGGGTGACTGCCGCATATAACCGAGAACAGCTGTGG



CTGGCCAATGAAGGCCTGATCACCAGGCTGCAGGCTCGCTGCCGTGGATA



CTTAGTTCGACAGGAATTCCGATCCAGGATGAATTTCCTGAAGAAACAAA



TCCCTGCCATCACCTGCATTCAGTCACAGTGGAGAGGATACAAGCAGAAG



AAGGCATATCAAGATCGGTTAGCTTACCTGCGCTCCCACAAAGATGAAGT



TGTAAAGATTCAGTCCCTGGCAAGGATGCACCAAGCTCGAAAGCGCTATC



GAGATCGCCTGCAGTACTTCCGGGACCATATAAATGACATTATCAAAATC



CAGGCTTTTATTCGGGCAAACAAAGCTCGGGATGACTACAAGACTCTCAT



CAATGCTGAGGATCCTCCTATGGTTGTGGTCCGAAAATTTGTCCACCTGCT



GGACCAAAGTGACCAGGATTTTCAGGAGGAGCTTGACCTTATGAAGATG



CGGGAAGAGGTTATCACCCTCATTCGTTCTAACCAGCAGCTGGAGAATGA



CCTCAATCTCATGGATATCAAAATTGGACTGCTAGTGAAAAATAAGATTA



CGTTGCAGGATGTGGTTTCCCACAGTAAAAAACTTACCAAAAAAAATAA



GGAACAGTTGTCTGATATGATGATGATAAATAAACAGAAGGGAGGTCTC



AAGGCTTTGAGCAAGGAGAAGAGAGAGAAGTTGGAAGCTTACCAGCACC



TGTTTTATTTATTGCAAACCAATCCCACCTATCTGGCCAAGCTCATTTTTC



AGATGCCCCAGAACAAGTCCACCAAGTTCATGGACTCTGTAATCTTCACA



CTCTACAACTACGCGTCCAACCAGCGAGAGGAGTACCTGCTCCTGCGGCT



CTTTAAGACAGCACTCCAAGAGGAAATCAAGTCGAAGGTAGATCAGATT



CAAGAGATTGTGACAGGAAATCCTACGGTTATTAAAATGGTTGTAAGTTT



CAACCGTGGTGCCCGTGGCCAGAATGCCCTGAGACAGATCTTGGCCCCAG



TCGTGAAGGAAATTATGGATGACAAATCTCTCAACATCAAAACTGACCCT



GTGGATATTTACAAATCTTGGGTTAATCAGATGGAGTCTCAGACAGGAGA



GGCAAGCAAACTGCCCTATGATGTGACCCCTGAGCAGGCGCTAGCTCATG



AAGAAGTGAAGACACGGCTAGACAGCTCCATCAGGAACATGCGGGCTGT



GACAGACAAGTTTCTCTCAGCCATTGTCAGCTCTGTGGACAAAATCCCTT



ATGGGATGCGCTTCATTGCCAAAGTGCTGAAGGACTCGTTGCATGAGAAG



TTCCCTGATGCTGGTGAGGATGAGCTGCTGAAGATTATTGGTAACTTGCT



TTATTATCGATACATGAATCCAGCCATTGTTGCTCCTGATGCCTTTGACAT



CATTGACCTGTCAGCAGGAGGCCAGCTTACCACAGACCAACGCCGAAAT



CTGGGCTCCATTGCAAAAATGCTTCAGCATGCTGCTTCCAATAAGATGTT



TCTGGGAGATAATGCCCACTTAAGCATCATTAATGAATATCTTTCCCAGT



CCTACCAGAAATTCAGACGGTTTTTCCAAACTGCTTGTGATGTCCCAGAG



CTTCAGGATAAATTTAATGTGGATGAGTACTCTGATTTAGTAACCCTCAC



CAAACCAGTAATCTACATTTCCATTGGTGAAATCATCAACACCCACACTC



TCCTGTTGGATCACCAGGATGCCATTGCTCCGGAGCACAATGATCCAATC



CACGAACTGCTGGACGACCTCGGCGAGGTGCCCACCATCGAGTCCCTGAT



AGGGGAAAGCTCTGGCAATTTAAATGACCCAAATAAGGAGGCACTGGCT



AAGACGGAAGTGTCTCTCACCCTGACCAACAAGTTCGACGTGCCTGGAGA



TGAGAATGCAGAAATGGATGCTCGAACCATCTTACTGAATACAAAACGTT



TAATTGTGGATGTCATCCGGTTCCAGCCAGGAGAGACCTTGACTGAAATC



CTAGAAACACCAGCCACCAGTGAACAGGAAGCAGAACATCAGAGAGCCA



TGCAGAGACGTGCTATCCGTGATGCCAAAACACCTGACAAGATGAAAAA



GTCAAAATCTGTAAAGGAAGACAGCAACCTCACTCTTCAAGAGAAGAAA



GAGAAGATCCAGACAGGTTTAAAGAAGCTAACAGAGCTTGGAACCGTGG



ACCCAAAGAACAAATACCAGGAACTGATCAACGACATTGCCAGGGATAT



TCGGAATCAGCGGAGGTACCGACAGAGGAGAAAGGCCGAACTAGTGAAA



CTGCAACAGACATACGCTGCTCTGAACTCTAAGGCCACCTTTTATGGGGA



GCAGGTGGATTACTATAAAAGCTATATCAAAACCTGCTTGGATAACTTAG



CCAGCAAGGGCAAAGTCTCCAAAAAGCCTAGGGAAATGAAAGGAAAGA



AAAGCAAAAAGATTTCTCTGAAATATACAGCAGCAAGACTACATGAAAA



AGGAGTTCTTCTGGAAATTGAGGACCTGCAAGTGAATCAGTTTAAAAATG



TTATATTTGAAATCAGTCCAACAGAAGAAGTTGGAGACTTCGAAGTGAAA



GCCAAATTCATGGGAGTTCAAATGGAGACTTTTATGTTACATTATCAGGA



CCTGCTGCAGCTACAGTATGAAGGAGTTGCAGTCATGAAATTATTTGATA



GAGCTAAAGTAAATGTCAACCTCCTGATCTTCCTTCTCAACAAAAAGTTC



TACGGGAAGTAA





SLC19A2
ATGGATGTGCCCGGCCCGGTGTCTCGGCGGGCGGCGGCGGCGGCGGCCA
SEQ ID


->
CTGTGCTCCTGCGGACCGCTCGGGTCCGTCGCGAATGCTGGTTCTTGCCG
NO: 61


NAA50
ACCGCGCTGCTCTGCGCCTACGGCTTCTTCGCCAGCCTCAGGCCGTCCGA



GCCCTTCCTGACCCCGTACCTGCTGGGGCCGGACAAGAACCTGACCGAGA



GGGAGTAGCCGGATCGAGCTGGGAGATGTGACACCACACAATATTAAAC



AGTTGAAAAGATTGAATCAGGTCATCTTTCCAGTCAGCTACAATGACAAG



TTCTACAAGGATGTGCTGGAGGTTGGCGAGCTAGCAAAACTTGCCTATTT



CAATGATATTGCTGTAGGTGCAGTATGCTGTAGGGTGGATCATTCACAGA



ATCAGAAGAGACTTTACATCATGACACTAGGATGTCTGGCACCTTACCGA



AGGCTAGGAATAGGAACTAAAATGTTAAATCATGTCTTAAACATCTGTGA



AAAAGATGGTACTTTTGACAACATTTATCTGCATGTCCAGATCAGCAATG



AGTCGGCAATTGACTTCTACAGGAAGTTTGGCTTTGAGATTATTGAGACA



AAGAAGAACTACTATAAGAGGATAGAGCCCGCAGATGCTCATGTGCTGC



AGAAAAACCTCAAAGTTCCTTCTGGTCAGAATGCAGATGTGCAAAAGAC



AGACAACTGA





SFXN1->
ACAGGCGCGCGCGAGGACGCGCTCCGGGGACGCGCGAGGACGCCGTGGC
SEQ ID


CAMK4
GGGAGAAGCGTTTCCGGTGGCGGCGGAGGCTGCACTGAGCGGGACCTGC
NO: 62



GAGCAGCGCGGGCGGCAGCCCGGGGGAAGCGTATCTACATGAAAATGGG



ATTGTCCATCGTGATCTCAAACCAGAGAATCTTCTTTATGCAACTCCAGCC



CCAGATGCACCACTCAAAATCGCTGATTTTGGACTCTCTAAAATTGTGGA



ACATCAAGTGCTCATGAAGACAGTATGTGGAACCCCAGGGTACTGCGCA



CCTGAAATTCTTAGAGGTTGTGCCTATGGACCTGAGGTGGACATGTGGTC



TGTAGGAATAATCACCTACATCTTACTTTGTGGATTTGAACCATTCTATGA



TGAAAGAGGCGATCAGTTCATGTTCAGGAGAATTCTGAATTGTGAATATT



ACTTTATCTCCCCCTGGTGGGATGAAGTATCTCTAAATGCCAAGGACTTG



GTCAGAAAATTAATTGTTTTGGATCCAAAGAAACGGCTGACTACATTTCA



AGCTCTCCAGCATCCGTGGGTCACAGGTAAAGCAGCCAATTTTGTACACA



TGGATACCGCTCAAAAGAAGCTCCAAGAATTCAATGCCCGGCGTAAGCTT



AAGGCAGCGGTGAAGGCTGTGGTGGCCTCTTCGCGCCTGGGAAGTGCCA



GCAGCAGCCATGGCAGCATCCAGGAGAGCCACAAGGCTAGCCGAGACCC



TTCTCCAATCCAAGATGGCAACGAGGACATGAAAGCTATTCCAGAAGGA



GAGAAAATTCAAGGCGATGGGGCCCAAGCCGCAGTTAAGGGGGCACAGG



CTGAGCTGATGAAGGTGCAAGCCTTAGAGAAAGTTAAAGGTGCAGATAT



AAATGCTGAAGAGGCCCCCAAAATGGTGCCCAAGGCAGTGGAGGATGGG



ATAAAGGTGGCTGACCTGGAACTAGAGGAGGGCCTAGCAGAGGAGAAGC



TGAAGACTGTGGAGGAGGCAGCAGCTCCCAGAGAAGGGCAAGGAAGCTC



TGCTGTGGGTTTTGAAGTTCCACAGCAAGATGTGATCCTGCCAGAGTACT



AAACAGCTTCCTTCAGATCTGGAAGCCAAACACCGGCATTTTATGTACTT



TGTCCTTCAGCAAGAAAGGTGTGGAAGCATGATATGTACTATAGTGATTC



TGTTTTTGAGGTGCAAAAAACATACATATATACCAGTTGGTAATTCTAAC



TTCAATGCATGTGACTGCTTTATGAAAATAATAGTGTCTTCTATGGCATGT



AATGGATACCTAATACCGATGAGTTAAATCTTGCAAGTTAACACAACGTA



ACACTTAAAAGCATACATTTTCAGCAACCAGTGGCACATATTTGAAGTGA



ATAGTAGCAAATTGTTTTTGCTTTGAAAATCTAGCCATCCTACATCCTTTG



GATTTCTTCACAAGGCAGTAATTCCTTTGAACTACTGCTTAGCTAATACTA



GGTAGTGCTAAAAGACATGTTCCCATAACTTTTACAACATTTTACTTTTTA



TCATTGATGTGTTCAAACTGTTTACAAGGAGATGCTTATAGATGATAGTT



GTACATATGTGCAAAAAAAAATCCACTTGCAATGGTAAGAAATTGAAGT



ATCCTTAAAGGCCATGAAGCCATATGTCCCTAAA





CREB1->
ATGACCATGGAATCTGGAGCCGAGAACCAGCAGAGTGGAGATGCAGCTG
SEQ ID


TMEM131
TAACAGAAGCTGAAAACCAACAAATGACAGTTCAAGCCCAGCCACAGAT
NO: 63



TGCCACATTAGCCCAGGTATCTATGCCAGCAGCTCATGCAACATCATCTG



CTCCCACCGTAACTCTAGTACAGCTGCCCAATGGGCAGACAGTTCAAGTC



CATGGAGTCATTCAGGCGGCCCAGCCATCAGTTATTCAGTCTCCACAAGT



CCAAACAGTTCAGATTTCAACTATTGCAGAAAGTGAAGATTCACAGGAGT



CAGTGGATAGTGTAACTGATTCCCAAAAGCGAAGGGAAATTCTTTCAAGG



AGGCCTTCCTACAGGAAAATTTTGAATGACTTATCTTCTGATGCACCAGG



AGTGCCAAGGATTGAAGAAGAGAAGTCTGAAGAGGAGACTTCAGCACCT



GCCATCACCACTGTAACGGTGCCAACTCCAATTTACCAAACTAGCAGTGG



ACAGTATACATTCGTTCAGTCAGAGAGCATAATAGAAGTACTGCGTTTTG



ATGATGGAGGGCTACTACAGACCGAGACAACACTTGGACTCAGTTCATAT



CAGCAGAAAAGTATATCTCTCTACCGGGGGAATTGCAGGCCCATACGATT



TGAGCCACCAATGCTGGATTTCCATGAACAACCAGTTGGAATGCCAAAAA



TGGAAAAAGTCTACTTACATAATCCTAGTTCTGAAGAAACGATTACTTTA



GTATCAATATCTGCTACAACATCACATTTTCATGCATCATTTTTTCAAAAT



AGGAAAATTCTTCCAGGAGGAAATACATCATTTGATGTAGTTTTTCTTGC



AAGAGTAGTAGGAAATGTAGAAAATACTTTATTTATTAATACATCTAATC



ATGGGGTATTTACTTACCAGGTATTTGGTGTTGGAGTTCCAAATCCATATC



GATTGAGGCCGTTCCTTGGGGCCAGAGTCCCTGTGAATAGCAGTTTCTCA



CCTATAATAAACATCCACAATCCTCACAGTGAGCCTTTACAGGTTGTAGA



AATGTACTCTAGTGGAGGAGACCTTCACCTAGAACTCCCAACGGGTCAAC



AAGGAGGTACCAGAAAACTGTGGGAAATTCCTCCTTATGAAACCAAGGG



AGTGATGAGAGCCAGTTTTTCATCTAGAGAAGCAGATAATCACACAGCCT



TCATAAGAATAAAGACTAATGCTTCAGACAGCACAGAGTTTATCATTCTT



CCTGTTGAGGTTGAAGTTACAACAGCTCCTGGAATTTATTCCTCAACTGA



AATGTTAGATTTTGGTACACTAAGAACACAAGATCTACCAAAAGTTTTAA



ACCTTCATTTATTAAATTCAGGAACAAAAGATGTACCAATAACAAGTGTT



CGACCTACACCACAAAATGATGCTATAACGGTACACTTTAAACCAATTAC



ATTAAAAGCATCAGAAAGTAAATACACCAAGGTTGCAAGCATTAGTTTTG



ATGCATCGAAGGCAAAAAAGCCATCTCAGTTTTCTGGGAAAATAACAGTT



AAAGCAAAGGAAAAGAGTTATTCTAAACTTGAAATACCATATCAAGCAG



AAGTTTTAGATGGTTATTTGGGATTTGATCATGCTGCAACATTATTTCACA



TCCGAGACAGCCCTGCTGATCCTGTGGAAAGGCCAATTTACCTTACTAAC



ACTTTCAGTTTTGCGATCCTCATTCACGATGTGTTGCTACCAGAAGAAGCC



AAAACAATGTTTAAAGTTCACAACTTCAGCAAACCAGTCTTAATTCTTCC



TAATGAATCAGGATACATTTTTACCCTGCTTTTTATGCCTTCCACATCATC



CATGCACATTGATAACAACATTTTACTTATTACCAATGCTTCTAAATTTCA



TTTACCCGTGCGGGTATACACAGGCTTTTTAGATTACTTTGTATTGCCCCC



CAAAATAGAGGAACGTTTCATAGATTTTGGAGTACTGAGTGCTACAGAAG



CAAGTAATATTTTATTTGCAATTATAAACAGCAATCCAATTGAGTTGGCT



ATAAAAAGTTGGCATATCATAGGAGACGGTTTATCAATAGAACTTGTAGC



TGTGGAAAGAGGCAATAGAACTACAATAATTTCAAGCCTGCCAGAGTTTG



AAAAATCCTCTTTATCAGATCAATCATCGGTAACATTAGCTTCAGGCTATT



TTGCAGTCTTCAGAGTCAAACTTACTGCAAAAAAATTAGAGGGGATTCAT



GATGGAGCCATCCAGATCACAACAGACTATGAGATCCTGACAATCCCTGT



GAAGGCTGTGATTGCAGTAGGCTCACTGACCTGCTTCCCTAAGCACGTGG



TTCTTCCACCTTCCTTTCCAGGGAAAATAGTTCATCAAAGTTTAAATATTA



TGAATTCCTTCTCACAGAAGGTAAAAATACAGCAAATACGATCTTTGTCA



GAAGATGTGCGATTTTACTATAAACGATTACGGGGCAATAAGGAAGACTT



GGAGCCAGGAAAAAAATCAAAGATTGCAAACATTTATTTTGATCCTGGAC



TACAGTGTGGGGATCATTGCTATGTTGGCTTGCCTTTTCTATCCAAATCTG



AACCCAAAGTGCAGCCTGGTGTAGCCATGCAGGAAGATATGTGGGATGC



TGACTGGGATTTGCATCAAAGCCTGTTCAAGGGATGGACAGGAATAAAG



GAAAATTCAGGTCATAGATTGAGTGCTATATTTGAAGTAAATACAGACCT



TCAAAAAAATATAATATCAAAAATCACTGCTGAGCTCTCCTGGCCTTCCA



TACTTAGCTCACCCCGGCACTTGAAATTTCCACTTACTAATACAAACTGCT



CCTCAGAAGAAGAGATTACTTTAGAAAATCCTGCAGATGTTCCTGTCTAT



GTTCAGTTTATTCCTCTGGCTTTATATTCCAACCCTTCAGTGTTTGTAGAT



AAGTTAGTATCAAGGTTTAACTTGAGTAAGGTGGCAAAGATAGATTTGAG



AACACTAGAATTTCAAGTCTTCAGAAACAGTGCTCATCCACTGCAGAGTT



CAACAGGATTTATGGAGGGCCTCTCTCGACATTTAATTTTAAACCTAATTT



TAAAACCTGGAGAAAAGAAATCTGTCAAAGTAAAGTTTACTCCAGTTCAC



AACAGAACTGTTTCTTCACTTATCATAGTCAGAAATAACCTGACTGTGAT



GGATGCTGTGATGGTCCAAGGACAAGGAACAACTGAGAACTTGAGGGTG



GCAGGCAAGCTTCCAGGTCCAGGAAGCTCCTTACGCTTTAAAATCACGGA



AGCATTGTTAAAAGATTGTACAGATAGTTTAAAACTAAGAGAACCAAATT



TCACATTGAAAAGAACATTTAAGGTAGAGAATACAGGACAACTTCAAAT



TCACATAGAAACCATTGAAATCAGTGGATACTCATGTGAAGGATATGGCT



TTAAAGTTGTTAATTGTCAAGAGTTTACTCTAAGTGCCAATGCTTCTAGAG



ATATAATCATATTGTTTACTCCTGATTTTACAGCTTCTAGAGTTATTCGGG



AACTGAAGTTTATAACAACCAGTGGCTCTGAGTTTGTATTTATATTGAAT



GCATCCCTTCCTTACCATATGTTAGCAACCTGTGCAGAAGCCCTACCCAG



ACCTAACTGGGAACTGGCTCTGTATATCATCATCTCAGGAATAATGAGTG



CACTGTTTCTTTTGGTCATTGGAACAGCCTATTTGGAAGCTCAAGGAATAT



GGGAGCCATTTCGAAGGCGGCTATCCTTTGAGGCCTCGAACCCGCCCTTC



GATGTGGGAAGGCCATTTGATCTCAGGAGAATCGTTGGTATTTCATCTGA



AGGAAACTTGAACACACTCAGCTGTGACCCCGGTCACAGTAGGGGGTTCT



GTGGAGCAGGCGGTTCATCATCCCGACCCAGTGCCGGGAGTCATAAGCA



GTGTGGCCCATCGGTCCACCCACACAGCAGTCACAGCAATAGAAACTCA



GCTGACGTGGAAAACGTCAGAGCCAAAAACAGTTCAAGTACCTCTAGTA



GGACTTCTGCTCAAGCAGCTTCTTCACAGTCTGCTAACAAAACAAGCCCC



CTTGTCTTAGATTCGAACACAGTGACTCAAGGTCATACAGCGGGCAGAAA



GTCCAAAGGGGCAAAGCAGAGCCAGCACGGCAGCCAGCACCATGCCCAC



AGCCCGCTGGAGCAGCACCCTCAGCCTCCTCTGCCACCGCCAGTGCCTCA



GCCCCAGGAGCCGCAGCCTGAAAGGCTGTCTCCCGCCCCCCTCGCACACC



CTTCCCACCCAGAACGTGCCAGCAGCGCGAGGCACAGTTCCGAGGACTC



GGACATCACCAGTCTCATAGAAGCCATGGACAAAGACTTCGACCACCAT



GACTCCCCAGCCCTAGAAGTGTTTACAGAGCAGCCTCCATCGCCATTGCC



AAAAAGCAAAGGGAAAGGAAAACCTCTTCAGCGCAAGGTGAAACCACCT



AAGAAGCAAGAGGAAAAGGAGAAGAAGGGAAAGGGAAAGCCACAGGA



AGATGAGCTGAAGGACTCTTTGGCTGATGATGATAGCTCCTCCACCACCA



CAGAGACCTCCAACCCTGACACAGAACCGCTCCTCAAGGAGGATACAGA



AAAGCAAAAGGGAAAACAAGCCATGCCTGAAAAACATGAAAGTGAAAT



GTCTCAAGTGAAGCAAAAAAGCAAAAAACTCTTAAATATTAAGAAAGAA



ATCCCAACAGATGTGAAACCCAGTTCATTAGAACTACCATATACTCCCCC



TTTGGAAAGTAAGCAACGTAGAAATCTCCCAAGCAAGATTCCTCTTCCAA



CTGCAATGACAAGTGGATCCAAATCACGAAATGCCCAGAAAACAAAAGG



TACAAGTAAGTTAGTGGATAACAGACCACCTGCCCTAGCAAAATTCCTCC



CGAATAGTCAAGAATTAGGCAACACCAGTAGCTCAGAGGGTGAAAAAGA



CTCTCCTCCACCGGAGTGGGATTCCGTTCCAGTTCACAAACCTGGCAGCT



CTACTGATAGTCTTTATAAACTTTCTCTGCAAACCCTCAACGCAGACATTT



TCTTAAAACAACGCCAGACCTCACCGACACCTGCTTCCCCGTCTCCCCCA



GCTGCCCCCTGCCCCTTTGTGGCCCGGGGCAGCTACAGCAGCATCGTCAA



CAGCAGCTCCAGCAGTGACCCTAAAATAAAACAGCCAAATGGAAGCAAA



CACAAGTTGACAAAGGCAGCCTCGCTCCCGGGCAAGAACGGCAACCCCA



CTTTTGCTGCAGTCACGGCTGGCTACGACAAGAGCCCAGGTGGGAATGGC



TTTGCTAAAGTTTCTTCAAACAAAACAGGTTTCTCCAGCAGCCTTGGCATT



TCACACGCTCCTGTTGACAGCGATGGCTCAGACAGCTCGGGTTTGTGGAG



TCCCGTCAGCAACCCAAGCAGCCCTGACTTCACTCCCCTCAATTCGTTCTC



CGCCTTTGGAAACTCTTTTAATCTAACTGGTGAAGTTTTCAGCAAACTCGG



ATTATCTCGATCGTGCAATCAGGCCTCACAGAGGAGCTGGAACGAGTTTA



ATAGTGGCCCTTCATACCTTTGGGAGTCGCCAGCGACAGATCCCAGTCCT



TCCTGGCCAGCCAGTTCCGGCTCCCCGACCCACACAGCCACATCGGTCCT



CGGTAACACCAGCGGCCTGTGGTCCACCACTCCATTCAGCAGCTCCATTT



GGTCCAGCAACCTTAGCAGCGCCCTTCCCTTCACCACTCCAGCAAACACG



CTGGCAAGCATCGGCCTCATGGGCACAGAAAACTCCCCTGCTCCTCACGC



TCCCTCCACCTCCAGTCCAGCTGACGACTTGGGACAGACCTACAACCCGT



GGCGGATATGGAGCCCCACGATTGGAAGAAGAAGCTCGGACCCTTGGTC



TAATTCGCACTTTCCTCACGAGAATTAA





ESR1->
ATGACCATGACCCTCCACACCAAAGCATCTGGGATGGCCCTACTGCATCA
SEQ ID


C6orf211
GATCCAAGGGAACGAGCTGGAGCCCCTGAACCGTCCGCAGCTCAAGATC
NO: 64



CCCCTGGAGCGGCCCCTGGGCGAGGTGTACCTGGACAGCAGCAAGCCCG



CCGTGTACAACTACCCCGAGGGCGCCGCCTACGAGTTCAACGCCGCGGCC



GCCGCCAACGCGCAGGTCTACGGTCAGACCGGCCTCCCCTACGGCCCCGG



GTCTGAGGCTGCGGCGTTCGGCTCCAACGGCCTGGGGGGTTTCCCCCCAC



TCAACAGCGTGTCTCCGAGCCCGCTGATGCTACTGCACCCGCCGCCGCAG



CTGTCGCCTTTCCTGCAGCCCCACGGCCAGCAGGTGCCCTACTACCTGGA



GAACGAGCCCAGCGGCTACACGGTGCGCGAGGCCGGCCCGCCGGCATTC



TACAGTCCACCAATCGATTACTTTGATGTATTTAAAGAATCAAAAGAGCA



AAATTTCTATGGGTCACAGGAATCCATCATTGCTTTATGTACTCACCTGCA



ACAATTGATAAGAACTATTGAAGACCTAGATGAAAATCAGCTGAAAGAT



GAGTTTTTTAAACTTCTGCAGATTTCACTGTGGGGAAATAAGTGTGATCT



GTCTCTCTCAGGTGGAGAAAGTAGTTCTCAGAATACCAATGTACTAAATT



CATTGGAAGACCTAAAACCTTTCATTTTATTGAATGATATGGAACATCTTT



GGTCATTGCTTAGCAATTGCAAGAAAACAAGAGAAAAAGCTTCTGCTACT



AGAGTGTATATTGTTCTCGATAATTCTGGATTTGAGCTTGTTACAGATTTA



ATATTAGCCGACTTCTTGTTGTCCTCTGAACTGGCTACTGAGGTTCATTTT



TATGGAAAAACAATTCCATGGTTTGTTTCTGATACTACTATACATGATTTT



AATTGGTTAATTGAACAGGTAAAACACAGTAATCATAAGTGGATGTCCAA



GTGTGGGGCTGACTGGGAAGAGTATATTAAAATGGGTAAATGGGTTTACC



ACAATCATATATTTTGGACTCTGCCTCATGAGTACTGTGCAATGCCTCAG



GTTGCACCTGACTTATATGCTGAACTACAGAAGGCACATTTAATTTTATTC



AAGGGTGATTTGAATTACAGGAAGTTGACAGGTGACAGAAAATGGGAGT



TTTCTGTTCCATTTCATCAGGCTCTGAATGGCTTCCATCCTGCACCACTCT



GTACCATAAGAACATTAAAAGCTGAAATTCAGGTTGGTCTGCAGCCTGGG



CAAGGGGAACAGCTCCTGGCCTCTGAGCCCAGCTGGTGGACCACTGGAA



AATATGGAATATTTCAGTACGATGGTCCCCTTTGA





UTP23->
ATGAAGATCACAAGGCAGAAACATGCCAAGAAGCATCTTGGCTTCTTCCG
SEQ ID


RAD21
CAACAACTTCGGAGTCCGCGAGCCGTACCAGATCCTGCTGGACGGCACCT
NO: 65



TCTGTCAGGCGGCGCTGCGGGGCCGCATCCAGCTGCGGGAGCAGCTGCCC



CGCTACCTCATGGGGGAGACGCAGCTGTGCACCACAAGATGTTCTACGCA



CATTTTGTTCTCAGTAAAAGAGGGCCTCTGGCCAAAATTTGGCTAGCGGC



CCATTGGGATAAGAAGCTAACCAAAGCCCATGTGTTCGAGTGTAATTTAG



AGAGCAGCGTGGAGAGTATCATCTCACCAAAGGTGAAAATGGCATTACG



GACATCAGGACATCTCTTACTGGGAGTAGTTCGAATCTATCACAGGAAAG



CCAAATACCTTCTTGCAGACTGTAATGAAGCATTCATTAAGATAAAGATG



GCTTTTCGGCCAGGTGTGGTTGACCTGCCTGAGGAAAATCGGGAAGCAGC



TTATAATGCCATTACTTTACCTGAAGAATTTCATGACTTTGATCAGCCACT



GCCTGACTTAGATGACATCGATGTGGCCCAGCAGTTCAGCTTGAATCAGA



GTAGAGTGGAAGAGATAACCATGAGAGAAGAAGTTGGGAACATCAGTAT



TTTACAAGAAAATGATTTTGGTGATTTTGGAATGGATGATCGTGAGATAA



TGAGAGAAGGCAGTGCTTTTGAGGATGACGACATGTTAGTAAGCACTACT



ACTTCTAACCTCCTATTAGAGTCTGAACAGAGCACCAGCAATCTGAATGA



GAAAATTAACCATTTAGAATATGAAGATCAATATAAGGATGATAATTTTG



GAGAAGGAAATGATGGTGGAATATTAGATGACAAACTTATTAGTAATAA



TGATGGCGGTATCTTTGATGATCCCCCTGCCCTCTCTGAGGCAGGGGTGA



TGTTGCCAGAGCAGCCTGCACATGACGATATGGATGAGGATGATAATGTA



TCAATGGGTGGGCCTGATAGTCCTGATTCAGTGGATCCCGTTGAACCAAT



GCCAACCATGACTGATCAAACAACACTTGTTCCAAATGAGGAAGAAGCA



TTTGCATTGGAGCCTATTGATATAACTGTTAAAGAAACAAAAGCCAAGAG



GAAGAGGAAGCTAATTGTTGACAGTGTCAAAGAGTTGGATAGCAAGACA



ATTAGAGCCCAACTTAGTGATTATTCAGATATTGTTACTACTTTGGATCTG



GCACCGCCCACCAAGAAATTGATGATGTGGAAAGAGACAGGAGGAGTAG



AAAAACTGTTTTCTTTACCTGCTCAGCCTTTGTGGAATAACAGACTACTGA



AGCTCTTTACACGCTGTCTTACACCGCTTGTACCAGAAGACCTTAGAAAA



AGGAGGAAAGGAGGAGAGGCAGATAATTTGGATGAATTCCTCAAAGAAT



TTGAAAATCCAGAGGTTCCTAGAGAGGACCAGCAACAGCAGCATCAGCA



GCGTGATGTTATCGATGAGCCCATTATTGAAGAGCCAAGCCGCCTCCAGG



AGTCAGTGATGGAGGCCAGCAGAACAAACATAGATGAGTCAGCTATGCC



TCCACCACCACCTCAGGGAGTTAAGCGAAAAGCTGGACAAATTGACCCA



GAGCCTGTGATGCCTCCTCAGCAGGTAGAGCAGATGGAAATACCACCTGT



AGAGCTTCCCCCAGAAGAACCTCCAAATATCTGTCAGCTAATACCAGAGT



TAGAACTTCTGCCAGAAAAAGAGAAGGAGAAAGAGAAGGAAAAAGAAG



ATGATGAAGAGGAAGAGGATGAAGATGCATCAGGGGGCGATCAAGATCA



GGAAGAAAGAAGATGGAACAAAAGGACTCAGCAGATGCTTCATGGTCTT



CAGCGTGCTCTTGCTAAAACTGGAGCTGAATCTATCAGTTTGCTTGAGTT



ATGTCGAAATACGAACAGAAAACAAGCTGCCGCAAAGTTCTACAGCTTCT



TGGTTCTTAAAAAGCAGCAAGCTATTGAGCTGACACAGGAAGAACCGTA



CAGTGACATCATCGCAACACCTGGACCAAGGTTCCATATTATATAA





KLHDC4
ATGGGCAAGAAGGGCAAGAAGGAGAAGAAGGGCCGCGGCGCGGAGAAG
SEQ ID


->
ACGGCCGCCAAGATGGAGAAGAAGGTGTCTAAGCGCTCGCGGAAGGAGG
NO: 66


LRPAP1
AGGAAGACCTGGAAGCGCTCATAGCCCATTTCCAGACACTCGATGCCAA



GAGGACTCAGACTGTGGAACTTCCGTGCCCCCCACCCTCACCAAGGTTAA



ATGCCTCCCTCTCGGTTCATCCTGAGAAAGATGAGTTAATCCTTTTTGGAG



GTGAATATTTCAACGGCCAAAAAACTTTTTTGTATAACGAGCTCTATGTCT



ACAATACCAGAAAGGACACCTGGACCAAAGTTGACATCCCCAGTCCACC



TCCGAGGCGCTGTGCTCACCAGGCGGTGGTAGTGCCTCAAGGTGGCGGAC



AGCTGTGGGTCTTTGGAGGGGAGTTTGCCTCTCCCAACGGAGAGCAGTTC



TACCACTACAAGGATCTCTGGGTCCTGCATTTGGCCACCAAGACCTGGGA



ACAAGTCAAATCAACAGGCGGTCCTTCGGGTCGGAGTGGACATCGGATG



GTGGCCTGGAAGAGACAATTGATCCTGTTTGGTGGCTTCCATGAAAGTAC



ACGGGATTACATCTACTACAACGACGTGTATGCCTTTAATCTGGACACCT



TCACATGGAGCAAGCTGTCCCCGTCAGGGACGGGGCCCACACCCAGATC



AGGCTGCCAGATGTCCGTCACTCCCCAGGGCGGCATCGTCGTCTATGGGG



GCTACTCGAAACAGCTGCATCTTCCTCCCGTGAGGCTGGCCGAGCTCCAC



GCTGATCTGAAGATACAGGAGAGGGACGAACTCGCCTGGAAGAAACTAA



AGCTTGACGGCTTGGACGAAGATGGGGAGAAGGAAGCGAGACTCATACG



CAACCTCAATGTCATCTTGGCCAAGTATGGTCTGGACGGAAAGAAGGAC



GCTCGGCAGGTGACCAGCAACTCCCTCAGTGGCACCCAGGAAGACGGGC



TGGATGACCCCAGGCTGGAAAAGCTGTGGCACAAGGCGAAGACCTCTGG



GAAATTCTCCGGCGAAGAACTGGACAAGCTCTGGCGGGAGTTCCTGCATC



ACAAAGAGAAAGTTCACGAGTACAACGTCCTGCTGGAGACCCTGAGCAG



GACCGAAGAAATCCACGAGAACGTCATTAGCCCCTCGGACCTGAGCGAC



ATCAAGGGCAGCGTCCTGCACAGCAGGCACACGGAGCTGAAGGAGAAGC



TGCGCAGCATCAACCAGGGCCTGGACCGCCTGCGCAGGGTCAGCCACCA



GGGCTACAGCACTGAGGCTGAGTTCGAGGAGCCCAGGGTGATTGACCTG



TGGGACCTGGCGCAGTCCGCCAACCTCACGGACAAGGAGCTGGAGGCGT



TCCGGGAGGAGCTCAAGCACTTCGAAGCCAAAATCGAGAAGCACAACCA



CTACCAGAAGCAGCTGGAGATTGCGCACGAGAAGCTGAGGCACGCAGAG



AGCGTGGGCGACGGCGAGCGTGTGAGCCGCAGCCGCGAGAAGCACGCCC



TGCTGGAGGGGCGGACCAAGGAGCTGGGCTACACGGTGAAGAAGCATCT



GCAGGACCTGTCCGGCAGGATCTCCAGAGCTCGGCACAACGAACTCTGA





LOC729852
ACCTGAGACTACCTTTCTGCGATCACAGGATTCCCGGCGGTGACTTGACC
SEQ ID


->
CCGGAAGTGGGGTGTGAAGCTCCGGTGCTGGTGCGGCGGGGGACTGCGG
NO: 67


GLCCI1
GGCCAGCCTCAGGTAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAG



CAGCAGCAGCAGCAGCAGCAGCAGCAATGTTTCACTTCTTCAGAAAGCCT



CCGGAATCTAAAAAGCCCTCAGTACCAGAGACAGAAGCAGATGGATTCG



TCCTTTTAGGAGATACAACAGATGAGCAAAGAATGACAGCAAGAGGCAA



AACTTCGGACATAGAGGCCAACCAACCTTTGGAGCGGACAAGGCAAAAT



CTCAGCAAGTTCGGACCTCTAGTACAATAAGGCGAACCTCCTCTTTGGAT



ACAATAACAGGACCTTACCTCACAGGACAGTGGCCACGGGATCCTCATGT



TCACTACCCTTCATGCATGAAAGACAAAGCTACTCAGACACCTAGCTGTT



GGGCAGAAGAGGGTGCAGAAAAGAGGTCACATCAGCGTTCTGCGTCATG



GGGGAGTGCTGATCAACTAAAAGAGCAGATCGCCAAACTGAGGCAGCAA



CTACAACGCAGTAAACAGAGTAGTCGTCACAGTAAGGAGAAAGATCGCC



AGTCACCTCTTCATGGCAACCATATAACAATCAGTCACACTCAGGCTACT



GGATCAAGGTCAGTTCCTATGCCACTGTCAAATATATCAGTGCCAAAATC



ATCTGTTTCGCGTGTGCCCTGCAATGTAGAAGGAATAAGTCCTGAATTAG



AAAAGGTATTCATTAAAGAAAATAATGGGAAGGAAGAAGTATCCAAGCC



GTTGGACATACCAGATGGTCGAAGAGCTCCACTTCCTGCTCATTACCGGA



GCAGTAGTACTCGCAGCATTGACACTCAGACTCCTTCTGTCCAGGAGCGC



AGCAGTAGCTGCAGCAGTCATTCACCCTGTGTCTCCCCTTTTTGTCCCCCG



GAATCCCAGGATGGTAGCCCTTGCTCAACAGAAGATTTGCTCTATGATCG



TGATAAAGACAGTGGGAGTAGCTCACCGTTACCCAAGTATGCTTCATCTC



CCAAACCAAACAACAGCTACATGTTCAAACGGGAGCCCCCAGAGGGATG



TGAGCGAGTGAAGGTCTTTGAGGAAATGGCGTCTCGTCAGCCTATCTCGG



CCCCTCTCTTTTCATGTCCTGACAAAAACAAGGTTAATTTCATCCCAACCG



GATCAGCTTTCTGTCCTGTAAAACTTCTAGGCCCCCTCTTACCTGCTTCTG



ACCTTATGCTCAAGAACTCCCCTAACTCTGGCCAGAGCTCAGCTTTGGCA



ACTCTGACCGTTGAGCAGCTCTCATCCCGGGTTTCCTTTACGTCTCTTTCT



GATGACACCAGCACAGCGGGCTCCATGGAGGCCTCTGTCCAGCAGCCATC



CCAGCAGCAGCAGCTCCTGCAGGAACTGCAGGGTGAGGACCACATCTCT



GCTCAGAACTATGTGATCATCTAAAAAAGGGGGAGCTGGCCTCCACCCTA



TGTTCCATGGATTCGGAACAAGATTTCAGACATCTGCATGAGTGACAAAC



TTTCTGAACACCACCACCACCAATAATACTTATCAGCATCATAAAGTATC



TCTTAAACACTGATCTTGGCAGGGACGGAACTCCTATTCAGCAGTTTTTGT



GGAAAGCAGTAATGCTTGCAAAACGTGTGTGTCATTCAGCATTTTAAGTG



GAGACTATGCATTTCATAGTATATTTGACAGATTAGTACTGTGTCCTGTGT



TTTGTTCCAGATTCTTCAGTATAAATAAGCTCTATATCAAAAAGTTGCCTG



TCTAAATAGAAAATGTCTTGCTGTGTTTTGTCCTATGGAAAATACTGTAAT



TCAGGATTATGTTTACAATTGATCCAGGTGTTTGTTTCTAACTTCTGTAAT



ACATACAATGCAAAAAAAAAAAAAAAAAAATGGCCACAACAGTTGCACA



GTGCCCACCCTATGGCCTAGCTTCAGGTACTTCAGTTGAAGTCTAAACTC



AGGTAACTTGGAATGTATATCATATTGGGATATTAAATATTTCACAGCTA



AAAAGCTAAAGAGGGAACATCACTCTTTTGCCTTTCCTTATTTTATGCATT



TCCCTTTCCTCATTACATTCCACATTCTTAGAATAAGAAGTGCATTCAATC



CTAGGAGAATGATAATCCTGGACATGGGTGAACATGAGGAGAACCAGCA



AAATCTGTGGTGTTTGACATCACTTTGTCATGTGGTTACAAGTAAAACAA



CTGTTGCATTCACTGTTTCAACATGTGTACATGTGGCTTTTTTAAAAGTTC



AGGTGTTGCTCAGTAAAGGACTGTGACAATGTTGCAAATAAAGTGTTCAG



TACTGGACTGTACATAAACATTCCACATTGTGTGTGATGAAATTTAAAGA



CAAGAATGTCTAGAGTTAATTTCAAAATAAGTGAAGTGTTTGACGGAATG



GTTGAGATTTTTTTGTTTATGTTAGCCATCAGGGTCATAACTGTTACCATT



TTATCTAAAGACATATTTATATTTAGTTTCTCCCTTGGAAATTCTTTATTTT



GCAGGTGAAAAAGTGACATACTTTTTGTTATTGTCTTCCTCAAGCAGTTTA



GGTGCATGATCTTCATTTACATAGAATACTTGGGTCTCAGAATTGATGCA



ACATAAGCAGGTTTTTTTGGTGACTTACAAGAGCAATAGTTTGAAGCTAT



CTCATTTAAGCCTCTCATAATGCATAATCATGAGTAGTTTTGAAATTTGCA



ACCTGTGAGGTAGAGCATAAACTCAAGAAAATAGCCTTGAACTTGCAGA



CTTTTGACACAAGTTCTCCACAAAGTGTGAAGAGAGCCCCAGGCATTCCT



GATTGGTCAATGGGAGAGCCTAACTTTCATTGTTTTCTTCAGTACAAAGA



GTATCCAAAAGCTAAGTTTTTGTATTCCACTACTTTCAGTTCAATAAAACC



TAGAGTTGTTTCATCTGCGCCTAAAGTGTATGGCACAATTTTCTTAAGAAT



TAGGGGAACCAGGTGCCTACAGTTAAAGGAACGTTTCAGTTCCTTTCATT



CATTCCTGGGTTTTTCTTTTATTTTCTAAGAAGGTTGAAGAAGGATGAGTG



ATAGAGAAGAAAGCAACACCATTGATTTTTTTTTTTAAGAAATGATATAT



ATATGTATATGTTTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTATTCTGT



GCATTATTTTGTCATGATCTCAATTCTCTTCTTTCCACCAAAGTTTGTCGTA



ATATTTTCTCCTGAAGGTGCATTCTGGCTCCTTTAAATTAGTCAGTGTTAT



ATTGTAGGAGACTGTCATGGAAAAAAGGACTCAGTTTACTTTCGTCATTT



TCACAGGGGAACCTTTTAAAACAATCTTTTCAGCAGCAGATACCTTTAAC



CCTAATAATCTCAGGCCTTGATGAAAATACTATATTTTGTAGATTATGGTT



AAAGGGGGAAAATTACTAGTTCCGTAAGATAAATATGAGCTCCATTTGAC



TTCTGATGTCTGGTTTAGCATTACATAATATGTTGATCTTACACTCTGCTT



TTGTCCAAATAAAATGCAATAGTATCAATATCAATTTCAGAAAAATGGAC



TGAATATGCTTTTTTGGTGATGAAATCTCATGTACGATATTTATAGTGATG



TGCTTTTATTTTCTCATGAGATACTAAATATTAATTGTGTTGTACATTTGTT



CTTAGCATATATTAAAGTTTTGAACCAAATGTGTTAAAGCTTACGCTTTGC



CATGTAAATTTCCCAGAAGTTGTTGAGCTCAAATGTATCCTACATCCAGC



TGTAGAAATTTGTCAGAAATTGTTTAAATTTTGTATATAATTGTACTGTTT



AATTCTAGCCATTGCGCTGAACAGTATTTGAGTTACCATATAATATGGCTT



TACACAAGGAAATGTGTGGCTTTTGTTTTGTATTTTTTCAGTATAGAAGTT



CCTGTGTCTTATTTAAATAAAGTTATTAGTAAAACTGAAA





ARNT2->
ATGGCAACCCCGGCGGCGGTCAACCCTCCGGAAATGGCTTCAGACATACC
SEQ ID


MESDC2
TGGATCTGTGACGTTGCCCGTTGCCCCCATGGCGGCCACCGGACAGGTGA
NO: 68



GGATGGCGGGGGCCATGCCTGCCCGTGGAGGAAAGCGGCGTTCCGGAAT



GGACTTCGATGATGAAGATGGTGAAGGCCCCAGTAAATTTTCAAGAAAG



ATGATGACATTGAAGAAGGAGATCTTCCAGAGCACAAGAGACCTTCAGC



ACCTGTCGACTTCTCAAAGATAGACCCAAGCAAGCCTGAAAGCATATTGA



AAATGACGAAAAAAGGGAAGACTCTCATGATGTTTGTCACTGTATCAGG



AAGCCCTACTGAGAAGGAGACAGAGGAAATTACGAGCCTCTGGCAGGGC



AGCCTTTTCAATGCCAACTATGACGTCCAGAGGTTCATTGTGGGATCAGA



CCGTGCTATCTTCATGCTTCGCGATGGGAGCTACGCCTGGGAGATCAAGG



ACTTTTTGGTCGGTCAAGACAGGTGTGCTGATGTAACTCTGGAGGGCCAG



GTGTACCCCGGCAAAGGAGGAGGAAGCAAAGAGAAAAATAAAACAAAG



CAAGACAAGGGCAAAAAAAAGAAGGAAGGAGATCTGAAATCTCGGTCTT



CCAAGGAAGAAAATCGAGCTGGGAATAAAAGAGAAGACCTGTGA





NAT1->
ACTTCCTCATAGACCTTGGATGTGGGAGGATTGCATTCAGTCTAGTTCCTG
SEQ ID


DDHD2
GTTGCCGGCTGAAATAACCTGTTAATGATTTTCGCAGTGTTTCCTTGAACT
NO: 69



TGCTACAGACACATTTTAAGAAAGCCCAAGAAAATCAGCAGATTGGGAG



GGTAGAATTTCTTCCAGTCAACTGGCACAGTCCTTTGCATTCTACTGGTGT



GGATGTAGATCTGCAGCGAATAACCCTGCCCAGCATTAACCGCCTCAGGC



ACTTCACCAATGACACAATTCTGGATGTCTTCTTCTACAATAGTCCCACCT



ACTGTCAGACTATTGTGGACACAGTTGCTTCTGAAATGAACCGAATATAC



ACACTTTTTCTACAGAGGAACCCTGATTTCAAAGGGGGTGTATCCATTGC



TGGTCATAGTTTAGGTTCGCTTATATTGTTTGATATCCTAACAAATCAGAA



AGATTCTTTGGGGGATATTGACAGTGAAAAGGATTCGCTAAATATTGTAA



TGGATCAAGGAGATACACCTACACTAGAGGAAGATTTGAAGAAACTTCA



GCTCTCTGAATTCTTTGATATCTTTGAGAAGGAGAAAGTAGATAAGGAAG



CTCTGGCTTTATGTACAGACCGAGATCTTCAGGAAATAGGAATTCCTTTA



GGACCAAGAAAGAAGATATTAAACTATTTCAGCACCAGAAAAAACTCAA



TGGGTATTAAGAGACCAGCCCCGCAGCCTGCTTCAGGGGCAAACATCCCC



AAAGAATCTGAGTTCTGCAGTAGCAGTAATACTAGAAATGGTGACTATCT



GGATGTTGGCATTGGGCAGGTGTCTGTGAAATACCCCCGGCTCATCTATA



AACCAGAGATATTCTTTGCCTTTGGATCTCCCATTGGAATGTTCCTTACTG



TCCGAGGACTAAAAAGAATTGATCCCAACTACAGATTTCCAACGTGCAAA



GGTTTCTTCAATATTTATCACCCTTTTGATCCTGTGGCCTATAGGATTGAA



CCAATGGTGGTCCCAGGAGTGGAATTTGAGCCAATGCTGATCCCACATCA



TAAAGGCAGGAAGCGGATGCACTTAGAACTGAGAGAGGGCTTGACCAGG



ATGAGTATGGACCTTAAGAACAACTTGCTAGGTTCGCTGCGGATGGCCTG



GAAGTCTTTTACCAGAGCTCCATACCCTGCCTTACAAGCTTCAGAAACAC



CAGAAGAAACTGAAGCAGAACCTGAATCAACTTCAGAGAAGCCTAGTGA



TGTTAACACAGAAGAGACCTCTGTGGCAGTTAAAGAAGAAGTCCTGCCTA



TCAATGTGGGGATGCTGAATGGAGGCCAACGCATTGACTATGTGCTACAG



GAGAAGCCTATTGAAAGTTTTAATGAGTATTTATTTGCTTTACAAAGCCA



TCTATGCTACTGGGAGTCTGAAGATACAGTATTGCTCGTCCTCAAAGAGA



TCTACCAAACCCAGGGTATCTTCCTTGATCAGCCTTTACAGTAAAAATGA



CCCATCTATGGCTGCTTAATACGGACATTGAGGGATCCTTCCCCAGAAAA



TCCACCTGTTTGTTGCTGCAATTTTCCTCTCCTCAGCTGCGTCATTTCCTGC



ATGTTGCCTGCCACTTACTCACCACTGGGGTCTTTGGAAGATAATCTTCCT



CTTTGGAAATGAATGGAAAAGCAAAAGGCCCTATTACTTTTAACCACTGG



CTTCATATAAACACTTGCCATTTTTTTCTGCATAGCTGGGGGTGGTTTGTG



TCTTTAATTCTTTGATGATAGTTTATAGTTGCCACACTTTATTGATTAGTA



CTTGACAGGGTGTAAAGCCTATTTTGGGTTTGATTTGTTTTGGGTGGGGTA



GACATGTTTTTAAGGAACTTATTGCTTATCTTTAGAAAATGTTCTAGTTTG



GAAACAGATTCTTGAGATTCAGAAGGCATTTTGGAGTACACTTATCTCTT



GTTTGTGTTGAACTGAAGGCTAAGTCTCAGTGGACATGGAAAAGACTTTT



GGGTGATTTATTTTTGAACCTGCATTTCTTTCTTATGTGTAGTGTATGAAG



AAAGACTAGAATGTAGCTTTAAAAAAGTGTTGTTTACTCTCTTAGAACTG



ACAGACTTATTGCCAGAAATCACTGATGTTCATTGTTTTTGCAACTGTTTG



AGCTGCTGTAAGAGTCTAAAGTTGACAAGTTAGTTCATGTTAGGTGCATC



TTTATAAAGCAAAGATGTTGTATATCCTAGGCCTCCCTTTTATATTTGATA



GAAGTTATTTGCTAATAGCTTCTATTCTTACGTTGAAAATAGTTGTAAAAG



CTGATGAACCTGAAATTGTGTAGCCTCTACAGGCTGCTGAGGTTCTAAAT



AAAACCTTTTAGTGGTGCCTTTATGGTGAAACAGAATTTGTCACCTGCCA



TTTCTACTTGAGCTAAGGTAGTATTGTGTATCCTCTTTCCTTCTTAGGTATC



CATAATCCACAAAGCATATTTAAAAGGCTCTTGGCACGGGCAGCATTGGT



TGAGCAGGTAGGTTTGGCTAGGGGGAAATGTTTAACTTGTTCTGAAAGAA



AAACTTATGTCTGTAGGGTCCAAGAAACAGCTATTCCAGAGTCAGTGTCA



GCTGAGTCTGGAACATATGAAGTGAGGTTTACTTCTAAGAACACAAGTGA



CTGCACACTAATTTTGTCAAGGCATCTTTTCACTACTTTGCTGTAGATTTT



TCTTCTTCATTGGTCAGTTTGTCATTGTCTTTGTAGTTCTCTTTATGATAAT



CCTTTATACTTGCTCTCAGATTCCACAGGCCTCTGTTTATAGAGTGGCAAA



GGCAGGCGAGCTGTGGTTTATTGTTTATAAATTTTTTTATAAATGTTATGG



TATTCAAAGCCACTGACATTTAATATTTACTGAAGCCATTCCTTAGACAG



CAGTGGTCTTTATCCCTTTCTGGAAAGAAAAGGAAAATGAAGGGTAATTA



CTGTCACCATGGAGATTGTAGAGGTAAGGTTGGGGTATAGGTCAGGCCTG



GCCTTTCTTTGTCATCTGCTTATAGTCTAGTGCTAAGTATGCCACTAAGTT



TCAGATATATGGAATACTTTATTTTTTTAAAGGTATATAAACTCTGAGTTA



TTGAGAATTAAGTATTCACTGTATATTAAGGGGAAGCTTTTGCCAAGTTG



TGGTCTTCAAATTTATGTTTACTCTTCCTATTGGCAGAATAGGTGCTATTT



AAGAGTAAACCAAAGGATAAGCAGAGGGAGTCCCTATAACCAAAGATGG



ACAGCATAGCCCTGGATAGCCAGATAAACCACTCTTTGTATTAAGAAATG



TTTCTTTCCTAGTGGTGAGGGGTGGGTAACTGTGAAAGAGCTTTATATCTT



GTCTATTCATGGTATTATAGCTGTATATTCCCAGGATGATAAGCTTGATTG



AAATCCTGTATTTAGTCATATATTATTTGCGCTGCTTCATTTGTATCATGT



GCAATCTCTAGACCAACCCTATTTTTAAACTCTGGTACAGCATCATTTTGT



ACATATTCCCAGCTGCAGAACTAGTATCACTTATCTCAGCAAAAGAGATT



GTTTGCATGGAAAGATTAATAGCACTGATTAGATTTCTAATATTTTGCATT



TTTGAAATGTTTGTTTTCTACGTGATTATATTTAAAACTTTAGTAAATACT



AACATGAAACCATAT





ZBTB34->
CGGGGACTGGCCTGGCGCCGGCGGCGGCGGAGGGGGCGCCGCGGGCGGG
SEQ ID


SCAI
CGATGTGAGCGCGGCGCTCTGGACAGGACTGAATTTGCTCTTAAAGAAAT
NO: 70



CATGTCCTCTGGAGGTGCTGAAGATGATATCCCACAGGGAGAGAGGAAA



ACAGTTACAGATTTTTGTTATCTTCTGGATAAATCTAAGCAACTGTTCAAT



GGGTTAAGAGATTTGCCACAATATGGACAGAAGCAGTGGCAGTCCTATTT



TGGAAGAACTTTTGATGTTTACACCAAACTCTGGAAGTTCCAGCAGCAGC



ATCGACAAGTCTTGGATAATCGGTATGGCTTGAAGCGCTGGCAAATAGGA



GAAATTGCTTCCAAGATTGGGCAGCTATACTATCATTATTACTTACGCAC



ATCGGAAACCAGCTATCTGAATGAGGCTTTTTCCTTCTATTCTGCAATCAG



ACAGAGATCATATTATTCTCAAGTCAATAAAGAGGACAGACCTGAATTGG



TAGTTAAGAAGTTACGATATTATGCAAGATTTATAGTAGTTTGTCTTCTTC



TCAACAAAATGGATGTTGTAAAGGATCTGGTAAAGGAATTGTCAGATGA



AATTGAAGATTATACTCACCGATTTAATACTGAAGATCAAGTGGAATGGA



ACTTGGTGCTTCAAGAAGTAGCAGCTTTCATTGAGGCGGATCCTGTAATG



GTATTAAATGATGATAATACCATTGTTATCACATCGAATCGCCTTGCTGA



AACAGGAGCCCCATTGCTGGAACAGGGCATGATTGTGGGACAGTTGTCTC



TGGCTGACGCACTCATTATTGGTAATTGTAATAATCAGGTTAAGTTCAGT



GAACTAACTGTTGACATGTTCCGGATGTTACAAGCTCTGGAAAGGGAGCC



AATGAATTTAGCTTCCCAGATGAATAAACCAGGAATGCAGGAATCAGCT



GACAAGCCTACTAGACGAGAAAACCCCCACAAGTATCTGCTCTACAAAC



CAACCTTCAGCCAGCTATATACCTTCTTAGCAGCGTCTTTTAAGGAGCTGC



CTGCCAATAGCGTGCTTCTGATTTACCTGTCGGCCACTGGCGTTTTCCCCA



CAGGTCGTTCTGATAGTGAAGGTCCTTATGATTTTGGAGGTGTACTTACTA



ATAGTAACCGGGATATTATTAATGGAGATGCCATCCACAAACGAAATCA



GTCCCACAAGGAAATGCACTGCCTTCATCCCGGGGATCTCTATCCTTTCA



CCAGGAAGCCACTGTTCATCATTGTGGATTCGTCTAATAGTGTTGCGTAT



AAGAATTTCACAAACTTGTTTGGACAGCCACTAGTCTGCTTGCTTTCTCCT



ACAGCATATCCAAAAGCTTTACAAGATCAATCTCAGCGAGGTAGCCTCTT



CACTCTCTTTTTGAACAATCCTCTAATGGCCTTCCTATTTGTCTCTGGATTG



TCAAGCATGCGCAGAGGCCTATGGGAAAAGTGTCAAGAATATCTTCGAA



AAATCAACCGTGATATTGCCCAGCTACTGACTCATTCACGTTCAATAGAT



CAGGCATTTCTCCAGTTTTTTGGAGATGAATTTCTTCGCTTGCTCCTCACA



AGATTTATCTTTTGTTCAGCCACCATGAGGATGCACAAGATTTTTCGGGA



AACACGAAATTATCCAGAATCATATCCACAACTGCCAAGGGATGAAACA



GTGGAGAATCCTCATCTCCAGAAGCACATTTTGGAATTAGCATCCATTCT



GGATGTTCGAAACGTGTTCTTTGAGAATACCATTGATGACTATTAAAACA



AAAACCCTGTTGTCGAAACAAGTTTTCATTTTCCACAAATTTTAAATGGTG



CAGTTTTCTAACGTGATAAGACACATAGTGGTGTTACTTAGTTTTTATTTT



TTAATTTAGGGCCACCATTTTAAAAACAAACAAAAAAATGTTCACACTTT



TAGGGTAACTGTTTTAAAATGCAACCTTTCAGGTCTTTTAAAATCTTAACC



TTGGAATTTTTATTTTTTGATTTTGAGGGATGGATATTTACCTCCAACTTCT



AATCCTACACTCAAATAGTCACTATTCTCACCCTGAGAAGAGTAAATCAT



TTATTTTTGTATAATGAGGTAAATCCAACTCTTATACTTGGACCTAAGTTA



AATGTCTGGATTTGGAAATATGTAATGGTTCATAATGATGAAGCTAGCCA



CCATGGACTACTGAAAATCAAGAACAGAGTCCCTCCATAATATATTTTTT



CTCATTCCAACTTAGCTGGTAGAAAAATGTTTGATCCTTTGAAACATGAT



CAAGCCAGTTTTTTGAAATCATTTAATTTCTTTCAATACTGTCATAATTTC



AGAAATTGGATTGAATTGCATCTGAAAGCTACATCTTGATTGAGGACTTG



AGGTGGTAATATTACTTGGAATGTATGAGTATTATGAGATTTATTTGCAT



ATTTTCTTTGTAGTCTGTGAATGCTGGAAATGAAAAAGGAAACAACTTTG



AAATATTTTAGTCAAAAAACCATGTCATTGGTTTCATAATACAATGTCTCC



AATAGGAATTCTATTTAAACTCTTGGTTTATGAGATCACATTTAAAGACT



GGATTGATGTCTGTGACATAAAGTTTTAATTTTTTGCCTCATTCAAATTTA



TGAGACTTCAAAGTCATAGATGTTTTAAATTTTGGTAAGGGGTAACTGGG



TGTTATGAAACCTAAAATGTAAGTTGATTTTGCATTGGTAATTTGGAAAT



AAGATTTATTAACATTCCCTGTATTTAGTATTTTATATTCATGCACAAAAT



TTTTATTATCTTAAATAGGACTCAGTGTATATTATAAAGATTTTTCTTTTTT



GTGTTTTCCTTCCCTGTCATTCCCTTTATAATTTCACAGTCACTTGCTATGA



AGTCCTCCCTCCTTTTTGATTATGCAGCATTCAATTTCAGAAATCTATGAA



TGAGAAGGCCTTATACCTAATTTTCTGTACAGGATCACATCTGGCAGTCC



CAGGAGAAGTGAGCTGATTGCTTTTGATATCTCTTTCCCATGGCCCTAGGT



AAATTATACTCTAAATTTATTTATTTTTTTTGAAACAGAGTCTTGCTTTTGT



TGCCCAGGTTGGAGTGCAGTGGGGCTATCTCGGCTCACTGCAACCTCCGC



CTCCCAGGTTCAAGCAATTTTCCTGCCTCAGCCTCCCGAGTAGCTGGGATT



ACAGGTGCCCGCCACCATGCCTGGCTAATTTTTGTATTTTTAGTAGAGAC



GGGGTTTCACCATGTTGGTCAGGCTGGTCTCAAACTCCTGACCTTGTGATC



TGCCTGCCTCAGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGC



ACCTGGCTATTATTTTTAACTTGGTTTTCATTTCTGGCATAGAGACTAATT



TGGGTCTAAGTTATTCCCATTCCTGCTTTGATTTGCTGATTTTTCAAACCT



GAAGTCCAAAGTTACTGTTTTTGGGGTTTGGGGACTTGGGGAGTATTTTA



CTTTTTTTTTTTTTTTTTTTTAAGAGACAGAGTCTTGCTTTGTTGCCCAGGT



TGGGGTGCAGCGGTACATTCATGGCTCACTGCAGCCTCAAATTCCTGGAT



TCAGGCAATCTTCTCACCTCAGCCTCCCAAGTAGCTGGGACTACAGGCAT



GCGCCACCATGCCTGGCTAATTTTCTTTATTTTTTAATTTTTTGTAGAGAC



ATGGTCTCACTTTGTTGAGCAGGCTGGCCTCAAACTCCTGGGCTCAAGTG



ATCCTCCTGCCTCAACCTCCCAAAGTGCTAGGATTACAGGTGTGAGCCAC



TGTACCTGGCCTGTTTACTTTTTAAAAATTTTTTTGTTTGTTCTTTCAAACA



GTATTTCAGAAAGCTTGGGAAAAGTTACCCTGGAAAGATTTCCTTTGCTA



GTGGGTTTTCATTTTCAGGTAAAGACTGCTTTCCTTTTTTTTCAGTATCTTA



ATAAACTTCATAGCTTTCCTTTAGTAACAGTGTTCATTGAACTACAGCAAT



AGACTATCTTCGGTTTTCTGACCTCTTGTTTCAATTCCCCTTAAGTCCTGTG



CCAGTTATGCTCCTACCAACACAGAGTTTCCTGTCCTCTAATGTTTAAAAT



AGTATTTACTGTTCTGAGCTTTATCCTCCCCCGTTAAGCATAAAGAACACT



GGTCTGAGAACTAGGAGACCTAAGTTCTAGTCCTGGCTTTCCTGGTAACT



TAAGCGTTATGCCCTTTGACAACTGTCTTCATGTCTCTGCTTGTTAGCACT



CTCATCTGTAAAATGAAGAGGTTGGCCCAGGTCTTCTCCAACTCTGATTCT



GTAATTTGGACTCTTGGTTCCAAAAATATTGATTTCCCCACTCCAACCACC



AAGAGAACTATTTACCCTGTTTGTAGTGTACACAACCTTTTCTTTTGTAAG



TCATATTTACCTAGATTTTGTTCAAGAAAATCTGGGTCCCACTTAGCTGTT



TTAGAAACTAGTACAGACAGAGACTCTCCTGAGGAAATTAGAGCTTTTAT



GATTAGAAACATGCTTGTCTAAAAATGAGGGTCTTAGAAATCACAACATT



GACCCTTATGATGTTGCCCCTTAAGCTAATAGTGTAATTCCTTACTGGTAG



TTAAAAATCTAAAGTGGACTGAAGTGATCTTGAATCTTCAAAGAGAGGA



AAACTGTGCTGGAAAATGTTATTGTTTCATTGATGCCTTCAAAAAAATGC



GTATTAAACAAAGACTATACTCCAGATTTTCTTCTGGGCATTGGGGAAGC



AAACAAAATAGACTTAAAAATCCTCATGTTCATGAAGCTTACATTCTAGT



AGAAGGAGTAAGGCAATAAACATAAGTAATTTATGTTAGAAAATGTTAA



GTAGGATGGAAAAGTGTAGATTATGGTAAGGTAAATCTTTTGTTTGTTTT



CTTAAGAGAAGAACAATGTTTAAAAGAGTGATCAAGCAGGGCATGGTGG



CATGACCCTATTAGTCCCAGCTACCTGGGAGGCCGAGGTGGGAGGGTCAC



TTGAGCCCAGGAGTTTGTGGCTGCAGTGAGCTGTGATTGCACCACTGCAC



TCCAGCCTGGATGGCAAAGCAAGACCTCGTCTCTGGAAAAAAAGAAAAG



AAAAAGTGATGATCAGGGTAGGCTTCATTAGGAAGGTCACATTTGCATGA



ACACTTTGCCTTTTCTTTCTTTTTATGAAAGTTTTAAACATAGTTTTTTTGT



TTTTTGTTTTTGAGACGGAGTTTCACTCTTGTTACTCAGGCTGCAGTGTGG



TGGCAGGATCTCAGCTCACTGCAACCTCCACCTTCCAGTTTCAAGTGATTC



TCCTGCCTCAGCCTCCCCAGTAGCTGGGATTACAGGCATGTGCCACCACA



CCCAGCTAATTTTGTGTTTTGAGATGGGGTTTCACCATGTTGGTCAGGCTG



GTCTCGAACTCCTGATTTGAGGTGATCCGTTGGCCTCGGCCTCCCAAAGT



GCTGGGATTACAGGCATGAGCCACCACGCCCAGCCTAAACATACAGTTTA



ATAAATCCTCCTTGTACCTTACCCAGCTTCTGTATTTCTTTAATCAAGGGA



GAAGAATTATCTAAAAGGACAGTTGCCCAATGGGCTTGAGCTGATGATA



GTGGCAAAGAATTTCATTTTTTAAAAATGCATACCACCTGCCCTTTATAA



AAAAATTTAAACCCTAATGTATTTAATTTTATAATTCATAAAAATTTAAAT



ATTGGAACTAAGAACAGAGCAAAGCAAAGGGAAAAAGCACTCCTTTGTT



TTCAAATTGCACAAACTGTCCCTGGAGATTCTGATAGTCTCTCATCTCTCT



TAACTATCACTGTAAACAACATAGAAAGCATAAGACTGCAAGAAGTCAG



TTATACACAAAACAAAATGAAGCACAGTGGAATTTTCTCTTTCTTTATCTG



TATATTTCTGAAGTTGGTAAAGCAACTATCTTCTCTATTTTACAGAAGAGA



AAGCTGAGACCCAAAAAGCTATTCATAGTGATAGAGCTAGAACTAAAAC



TCAGATCTCCTCATCTCAGCCCAGTCTTCTCAGTAGCTCAGACTGCTTTCT



ATAATGAACAAGAAAGAATAATTAGGGATTTGTATTTTGGCACAAAACA



GTATCAAATTTTTATTGTAAAAATCCACAAATCTGACATTTTATCTTTTAT



TCTAGCCAGGATGTATTCTTCATTTGCATGTGTGTGTTTCTCTTAGAAAGT



GGTGCTCTCAAACCTTTCAAAGCACAGAATTTATTGTACCCTCTTTATTTT



TACCAAATTGAAACGTTTCAGGATTTCAGAAGCAAAGTGCAATCCAATTG



TGGCTGGAGTTTTCCTGATAGAGGATCCGGGTGGTTCCCTCCCCTCCAACT



TCTGGCTACTGCCCTTTATTTCCTTTGTGCAATAGAATTCCCTGAAACAGA



TTATGTCGTAATTATACTGATTTAGTCAGGATAATGGGAAAAGCGACAAA



TCGGATTTATGTATTAGGTATTATGGAGCTTTCTTTAGCACTCTCCTCCAA



GTTTAATCTCTGACATGTGTGATATAGTAACTTTCTGTCTCTGAGAAGCAG



TTTGGACCCAGCTGAAAAATTTTATTTATGGAGACTTGTAATCAGCCTGA



TATACAGAGAACACTCTGGACTGGAAATATAGTCAGGTACCAAGGTCTGC



TAATTTGGCCATTGCCAGTGATTCAAAACCACCTCCCCACCCTGAGTAAA



AGATTTGATATGAATATGAATGCTATGAGTCTCCTGGTGATGTTTTGGGT



GTATATGTTCCTGTGTGTGTTTGGAAATCATAGAGGGTATTTACCTTGAGG



GCTCTTTTGTTGACCTTATTTATTTGAAATTCTTATTTTATTTGTAATGGCA



TATGTATATTTTTGTGATGCTACATGCAATCTCTGTGTATACTTTTTGTATA



TATGTATACATATTTGTAAATATATTGGTATGCAGTGATACAGTATGTGTA



TATTATGTACCCATACCCATTGTGGGCTGAATTTATTTAAGTTCTGTGAGG



ACATCCTAGTGGCATTGTTCATGCTGCTGTAATTAGGTTGTGGCTGTATCT



CATTCATAAGCCTCTATTTTGAGAGGCTTGTGTTTGGATAAAATGGGCTA



ATTTGGAGTTGAAATTAGGATATGAAGATCTTAACAGAAAGCTAGATATT



TATTCTTAGCTTTAAAATCACCTGCAAATTTTTATTTAAACTGATGATAAA



TTAGTGATATAACCTGAAAAACCTAAGTATGTGAAACTGAGACACCAGC



AGATTCTTAGGCTGTAGTAGTCCTTTATTTTCTTGATCCATATTTAAGTAT



TATTACTGAGTAGTAAATTATGTATATTGGGGCTTAATAGAGAGGCTGTG



GAAAAAAATTGCTTTAATCACATTTAAAAACATAGGAAAATGTCCATTAA



ATAGGGGAAATATATTCCCAGTTGTCATTTTTATGCCTCATACTGAATTTT



GATTTTGGAACAATTCTTTTTTCATCAGTGTTTTATGTGTCTCTATGAGGC



AGAAAAGGGGGCATTAGCTAAGGGATGGCCTAACCTAAGTAGGATAGGA



TTTAGTGAGGAACGGGAGGTTGTTAATTTGTATATTTGCAGCAGCTATGA



TTAATTTAGTCCTGTCTTGGCATTTCCGGCTACAGATTTTTTCTTACTGGA



CTTGCATATCAAAGGTGAGTAGGCTCATCTGAGAAAATGCTTCCACTGAA



TAAAATGCCTTCACTCGTATTGCTGTGTCTCTATGAGACCACTTGTGTTCT



GCAAGTTGATGATATATTCGCACATGAAGTAGAGTTTTTAATATTAGATG



CATGGTGTGCTATAAAAATCTGGGCCTTCAAACCTCTCCCAAGGATGATA



GGGTTGGGGAGGATAAGAGGATAGTAAATACGTGCAATGCCAATTTGTG



AACCTATAGAATGTTGCCAAGTATCAATGTTTGGTTACTTCAGTTGACAT



ATTTATTTGCATGAAAGAGTGTAGGAGATGTTTAATATACACTTAAATAC



ATTCACTGCAGAAATTAAGTAGAGGTTATTTTTGCATGATAAAATATAAC



TGCAAAGCAACTTTTTACACCGTATTCTTTGGAAGATTAAATCTATCTGAT



CCAATATTATCTTAAAATAGAGAAATTATCTCAGGAGTCTGTTTTACCAC



CCAAAAAAATGTTACATTTGCTGATTCATTATATTTTAAATAAAACAGCC



AACTATTCATAAGACTTTTCAATTATAGTATTAAATTCAGAAATAATTATA



ACAGCATTTTTCTTGAATAAATATTAAAATTTGGTAATTTATTATTTCTGA



AAAATCAGTGAAAACATCCATAGGATCTGATGGTTTGTTGTTTGAATAAC



CAAATCATTGTTTTTGCACTGGAGAGTAGTAGAGCAGATATTATACCATT



TAAAATTCATGGAACTGACTCAGGCTCCTTGTGGATATTAATTTGAGGAC



AGTAGTGGTAATCAAAAGTACAGTAGAAAACACTTGACATTGTGAATGT



GACCTCTGACCCCAACCGTTGTAAAGCCAATAACTTTGAGGGTGAATAAG



GTAACACTTGCTTAGAAAACCCAGCCCTTTCCCCCAGTCTCCTGCGCTCTT



ATACATTGATACTTTACAACTATTTTTGCTGTACCTCTCAGTCTTTCCTGG



ATAATGGGACCACTTAAAAAAAAAAGTGATTCATCTTCCTCAAACTAGAA



TTTCAAAGCCTAATACTACACTATGTTCCAGCTTCACTGAGTTTTGAACAA



AAAGACTTAAGCATGCCTTGTGTTTAAGGAAAAATAATTGATGTGTGAAT



GGAATCAATCACCTATTGACGGGAAGGGTGTCAGGATGATGTAGGGTGA



TATGCCTTTAATTGGATAAAGAGAACTGTATTAAGTATAACTAATTTAAT



AACTGATTGAAGCACCATTCTTAATTTTAAAATATTTTTGTTAATTTCTTTT



GTGGTTTTAAAAATTAATAATTTTTTTTTTTTTGAGATGGAGTTTTGCTCGT



TGCCCAGGCTGGAATGCAATGGCGCGATCTCAGCTCACTGCAACCTCCGC



CTCCTGGGTTCAAGAGATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGAT



TACAGGCGTGTGCCACCACGACCAGCTAATTTTGTATTTTTAGTAGAGAT



GGGGTTTCACCATGTTGGCCAGGTTGGTCTCGAACTCCTGACCTCAAGTG



ATCCACCCGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGTGTGAGCCAC



TGTGGCCGGCCTAAAAAAAATTCTTTTTAATAGTTATCAGACTACTTTAAC



CAGTGCCATGTTTAAACATACTTGGACAATAGACTCAGTTTTTTTAATAGT



TTGAAGTACATTATATTAAAACTACAAACTCACTTATTCAAGCCTAAATTT



CATCATAGACTATAAATTGGGTATCAATTTCCTATTTTGGTGGACAATTCC



AGTCTGTCTACTACGACAATCATCTTTTTAGGTGTAATATTGTTTAAGACA



TAGGTAATTTTAGGTTTTATACTTTTCTCCTAGGCTTATGGAATGAAAGCT



AATAGGATATCACTTTCAGAATGAAATAGGGAATTATGGCTAGCCTTCTT



ATAGGGCTGCCGTAAATGATACTTAAATGTTTGTTATATCACTGATTTTCT



TTTCTCTTCTGAAGCTGTTGTGAATGTCCACTTTGCCATGATTAAAAGTTG



AGATTTGTAAACATTACAATTTGAACCATAAATTTTCTCAAACATGAAAT



CAGTATGAATAAGTATAAAATTTGAAATGATCATACATGAGAATCCCAAA



AGTAAATACTTCCAAACCATAGAGGGATAACCACTGTATTTTTAAAAAGC



TTATTTCCACCCTTAAAGAAGCATGCTACTAATTGCTTTTTTTCTAAATTA



AAACTATGTACTGTGGTAAATTAAAAATCTGCAGGAAAAATTGCTTAGTG



TTTTAGCACAATAAGAATTTTTATTACTCAGGAGTTGGAGACCAGTCTGG



GCAATACAGTGAAACCCTGTCGCTACTAAAATACAAAAAAAAAAAAAAA



AAAAAAAAAATTAGCCAGGCATGGTGGTGTGCGCCTGTAATCCCAGCTA



CTTGGGAGGCTGAGGCAGGAGAACTGCTTGTAACCTGGGAGGCGGAGGT



TGCAGTGAGCCGAGATCGTGCCATTGCACTCCAGTCTGGGTGACAGAGCA



AGACTCATTCTCAAAAAAAAAAAAAAAAGGAATTTTTATTACTATTTCCT



GAAGAATGGTTTTTGTTAACTTGTTACTGTATCATTAAAAAGACCTTCTAA



TGGTTCAGTACAATAATCTAGAACTTGATTTATGTGGCTTTTTATAGTTAT



CTGAATGCATTCCTTTTGCCACATAGACCATATGGCTAGTTCTCCAACTTT



TTTGCTTATTTTTAATAAACCTTGCTGTTCAACAATCAGAGAAACCTTTAG



ATTTTGGATGATTCTTCCAGTTGAGGTAGAAACATCTTAGATAATAGGAA



AGGCAAATACAAAGTCCTAACATTTTCATAGTAGAGTTTACAAGTAAAAT



AACTTATCCATATAGGTTATCTTCGTTGTGTAGCACCAGTATAAATAGTG



ATTTCATTAATCATTGAATCAGATGAAGCAGTTATAAATCACTTTTTACTT



TGTGCTAAGAATTATTGTAATTTCAGGACACTTTATTATTTCCTCTGAGCA



GTTTCCATTGGAAGGTTGAGTTTCCCTTTTTTAAGTTCTAATCATCACTAA



AGGTTAAGATAATCAAATAGGAGTTAAAATAAGTTATGTTTGATCTTTTT



CCCTTGAAAATAATGCTGAACTTATTGTCTACATTCTGATTATTAGGCAGA



AATGCACTTGTTTAAATCATAGAAGTAATTCATTTGGAGGATATAATTAC



TCGATTTTCTAGTGGTGTGAAATACTTTTTAACAATTGTGCTTGTCTGTAA



CTGAAATGTTATAGAATTTTAACACTATAGGGATTATAGAGTTATATTAG



CTCTCCTCAAGAGACTGAAGCACAATATTTTTCATGTAACAATTCTTATCC



AAGTGCTGCTAATCTGTCGTGCAAATAATGAAGCTATTTGGTTGCCTATTT



AGCTATTCACAAATCACTGTAATCTTTGAAACAATCTTGTCGTTCATTTGT



ATTAATATTTGGATATTGTGAGTTAATACTTTAGAAAAAAATCCATCAAC



TCAGCCCCGTTAGCAAAACTGTTTGGATTCATAGTTTTTATATGTGTTAAC



AGTAGAATAAATTTTGAAGGGGCTATTTACTACCAATGACTAAGGGGAA



AATTATACTGTCACTATCATTTGACTTGAACATTTGTGGTATTGTAAAAGT



CTTGTCAGTTGTGTTCTAAATTGCTTAAGCCATACGTTCTCTTAAACAGGA



TGTTTTTTTCTTCCTTTCCAGCAGCCTTTTTCTTCTTTGTCTGTTATGGTTAA



TACTCCATAGATTTTAGAAATTGAGAAGTTCTTGAAACATTTTATTTTCTT



GAGTTCATCACTTTTGACTCTTGTATGAGATGTGATTTGTCATAAAAGATA



GCCTTCCACTACTTCACTAAATGAATTTCAGAGTAAACACTGTGATTCTGC



AGAGCGGATTCAGTAGGCTTTCCAATGTTTTCTCCTGCTATACAGTGCCTA



CCACCTTGAGGGCACTTCAGTACTAGAGGATGAAAACTGAAACGTTGTTT



TGATGTTTATTGAATAACGAGATTAGAGAATATTTGATTTTTGTTGTCAGT



GTATTAAAGAAATTTTCACATTGATAAATGTTCTCTAGGAATGTGTCTAC



ATTCATCAGGTGTGAACTCTTGTACATGAATTTTGTACCTTGAATCCACAT



ATATATTAAGTGTATCATCAATATAAAAATAAACATTATTTGCTTAA





ATRX->
ATGACCGCTGAGCCCATGAGTGAAAGCAAGTTGAATACATTGGTGCAGA
SEQ ID


RPS6KA6
AGCTTCATGACTTCCTTGCACACTCATCAGAAGAATCTGAAGAAACAAGT
NO: 71



TCTCCTCCACGACTTGCAATGAATCAAAACACAGATAAAATCAGTGGTTC



TGGAAGTAACTCTGATATGATGGAAAACAGCAAGGAAGAGGGAACTAGC



TCTTCAGAAAAATCCAAGTCTTCAGGATCGTCACGATCAAAGAGGAAACC



TTCAATTGTAACAAAGTATGTAGAATCAGATGATGAAAAACCTTTGGATG



ATGAAACTGTAAATGAAGATGCGTCTAATGAAAATTCAGAAAATGATATT



ACTATGCAGAGCTTGCCAAAAGGTACAGTGATTGTACAGCCAGAGCCAG



TGCTGAATGAAGACAAAGATGATTTTAAAGGGCCTGAATTTAGAAGCAG



AAGTAAAATGAAAACTGAAAATCTCAAAAAACGCGGAGAAGATGGGCTT



CATGGGATTGTGAGCTGCACTGCTTGTGGACAACAGGTCAATCATTTTCA



AAAAGATTCCATTTATAGACACCCTTCATTGCAAGTTCTTATTTGTAAGAA



TTGCTTTAAGTATTACATGAGTGATGATATTAGCCGTGACTCAGATGGAA



TGGATGAACAATGTAGGTGGTGTGCGGAAGGTGGAAACTTGATTTGTTGT



GACTTTTGCCATAATGCTTTCTGCAAGAAATGCATTCTACGCAACCTTGGT



CGAAAGGAGTTGTCCACAATAATGGATGAAAACAACCAATGGTATTGCT



ACATTTGTCACCCAGAGCCTTTGTTGGACTTGGTCACTGCATGTAACAGC



GTATTTGAGAATTTAGAACAGTTGTTGCAGCAAAATAAGAAGAAGATAA



AAGTTGACAGTGAAAAGAGTAATAAAGTATATGAACATACATCCAGATT



TTCTCCAAAGAAGACTAGTTCAAATTGTAATGGAGAAGAAAAGAAATTA



GATGATTCCTGTTCTGGCTCTGTAACCTACTCTTATTCCGCACTAATTGTG



CCCAAAGAGATGATTAAGAAGGCAAAAAAACTGATTGAGACCACAGCCA



ACATGAACTCCAGTTATGTTAAATTTTTAAAGCAGGCAACAGATAATTCA



GAAATCAGTTCTGCTACAAAATTACGTCAGCTTAAGGCTTTTAAGTCTGT



GTTGGCTGATATTAAGAAGGCTCATCTTGCATTGGAAGAAGACTTAAATT



CCGAGTTTCGAGCGATGGATGCTGTAAACAAAGAGAAAAATACCAAAGA



GCATAAAGTCATAGATGCTAAGTTTGAAACAAAAGCACGAAAAGGAGAA



AAACCTTGTGCTTTGGAAAAGAAGGATATTTCAAAGTCAGAAGCTAAACT



TTCAAGAAAACAGGTAGATAGTGAGCACATGCATCAGAATGTTCCAACA



GAGGAACAAAGAACAAATAAAAGTACCGGTGGTGAACATAAGAAATCTG



ATAGAAAAGAAGAACCTCAATATGAACCTGCCAACACTTCTGAAGATTTA



GACATGGATATTGTGTCTGTTCCTTCCTCAGTTCCAGAAGACATTTTTGAG



AATCTTGAGACTGCTATGGAAGTTCAGAGTTCAGTTGATCATCAAGGGGA



TGGCAGCAGTGGAACTGAACAAGAAGTGGAGAGTTCATCTGTAAAATTA



AATATTTCTTCAAAAGACAACAGAGGAGGTATTAAATCAAAAACTACAG



CTAAAGTAACAAAAGAATTATATGTTAAACTCACTCCTGTTTCCCTTTCTA



ATTCCCCAATTAAAGGTGCTGATTGTCAGGAAGTTCCACAAGATAAAGAT



GGCTATAAAAGTTGTGGTCTGAACCCCAAGTTAGAGAAATGTGGACTTGG



ACAGGAAAACAGTGATAATGAGCATTTGGTTGAAAATGAAGTTTCATTAC



TTTTAGAGGAATCTGATCTTCGAAGATCCCCACGTGTAAAGACTACACCC



TTGAGGCGACCGACAGAAACTAACCCTGTAACATCTAATTCAGATGAAG



AATGTAATGAAACAGTTAAGGAGAAACAAAAACTATCAGTTCCAGTGAG



AAAAAAGGATAAGCGTAATTCTTCTGACAGTGCTATAGATAATCCTAAGC



CTAATAAATTGCCAAAATCTAAGCAATCAGAGACTGTGGATCAAAATTCA



GATTCTGATGAAATGCTAGCAATCCTCAAAGAGGTGAGCAGGATGAGTC



ACAGTTCTTCTTCAGATACTGATATTAATGAAATTCATACAAACCATAAG



ACTTTGTATGATTTAAAGACTCAGGCGGGGAAAGATGATAAAGGAAAAA



GGAAACGAAAAAGTTCTACATCTGGCTCAGATTTTGATACTAAAAAGGGC



AAATCAGCTAAGAGCTCTATAATTTCTAAAAAGAAACGACAAACCCAGT



CTGAGTCTTCTAATTATGACTCAGAATTAGAAAAAGAGATAAAGAGCATG



AGTAAAATTGGTGCTGCCAGAACCACCAAAAAAAGAATTCCAAATACAA



AAGATTTTGACTCTTCTGAAGATGAGAAACACAGCAAAAAAGGAATGGA



TAATCAAGGGCACAAAAATTTGAAGACCTCACAAGAAGGATCATCTGAT



GATGCTGAAAGAAAACAAGAGAGAGAGACTTTCTCTTCAGCAGAAGGCA



CAGTTGATAAAGACACGACCATCATGGAATTAAGAGATCGACTTCCTAAG



AAGCAGCAAGCAAGTGCTTCCACTGATGGTGTCGATAAGCTTTCTGGGAA



AGAGCAGAGTTTTACTTCTTTGGAAGTTAGAAAAGTTGCTGAAACTAAAG



AAAAGAGCAAGCATCTCAAAACCAAAACATGTAAAAAAGTACAGGATGG



CTTATCTGATATTGCAGAGAAATTCCTAAAGAAAGACCAGAGCGATGAA



ACTTCTGAAGATGATAAAAAGCAGAGCAAAAAGGGAACTGAAGAAAAA



AAGAAACCTTCAGACTTTAAGAAAAAAGTAATTAAAATGGAACAACAGT



ATGAATCTTCATCTGATGGCACTGAAAAGTTACCTGAGCGAGAAGAAATT



TGTCATTTTCCTAAGGGCATAAAACAAATTAAGAATGGAACAACTGATGG



AGAAAAGAAAAGTAAAAAAATAAGAGATAAAACTTCTAAAAAGAAGGA



TGAATTATCTGATTATGCTGAGAAGTCAACAGGGAAAGGAGATAGTTGTG



ACTCTTCAGAGGATAAAAAGAGTAAGAATGGAGCATATGGTAGAGAGAA



GAAAAGGTGCAAGTTGCTTGGAAAGAGTTCAAGGAAGAGACAAGATTGT



TCATCATCTGATACTGAGAAATATTCCATGAAAGAAGATGGTTGTAACTC



TTCTGATAAGAGACTGAAAAGAATAGAATTGAGGGAAAGAAGAAATTTA



AGTTCAAAGAGAAATACTAAGGAAATACAAAGTGGCTCATCATCATCTG



ATGCTGAGGAAAGTTCTGAAGATAATAAAAAGAAGAAGCAAAGAACTTC



ATCTAAAAAGAAGGCAGTCATTGTCAAGGAGAAAAAGAGAAACTCCCTA



AGAACAAGCACTAAAAGGAAGCAAGCTGACATTACATCCTCATCTTCTTC



TGATATAGAAGATGATGATCAGAATTCTATAGGTGAGGGAAGCAGCGAT



GAACAGAAAATTAAGCCTGTGACTGAAAATTTAGTGCTGTCTTCACATAC



TGGATTTTGCCAATCTTCAGGAGATGAAGCCTTATCTAAATCAGTGCCTG



TCACAGTGGATGATGATGATGACGACAATGATCCTGAGAATAGAATTGCC



AAGAAGATGCTTTTAGAAGAAATTAAAGCCAATCTTTCCTCTGATGAGGA



TGGATCTTCAGATGATGAGCCAGAAGAAGGGAAAAAAAGAACTGGAAAA



CAAAATGAAGAAAACCCAGGAGATGAGGAAGCAAAAAATCAAGTCAATT



CTGAATCAGATTCAGATTCTGAAGAATCTAAGAAGCCAAGATACAGACA



TAGGCTTTTGCGGCACAAATTGACTGTGAGTGACGGAGAATCTGGAGAA



GAAAAAAAGACAAAGCCTAAAGAGCATAAAGAAGTCAAAGGCAGAAAC



AGAAGAAAGGTGAGCAGTGAAGATTCAGAAGATTCTGATTTTCAGGAAT



CAGGAGTTAGTGAAGAAGTTAGTGAATCCGAAGATGAACAGCGGCCCAG



AACAAGGTCTGCAAAGAAAGCAGAGTTGGAAGAAAATCAGCGGAGCTAT



AAACAGAAAAAGAAAAGGCGACGTATTAAGGTTCAAGAAGATTCATCCA



GTGAAAACAAGAGTAATTCTGAGGAAGAAGAGGAGGAAAAAGAAGAGG



AGGAGGAAGAGGAGGAGGAGGAGGAAGAGGAGGAGGAAGATGAAAAT



GATGATTCCAAGTCTCCTGGAAAAGGCAGAAAGAAAATTCGGAAGATTC



TTAAAGATGATAAACTGAGAACAGAAACACAAAATGCTCTTAAGGAAGA



GGAAGAGAGACGAAAACGTATTGCTGAGAGGGAGCGTGAGCGAGAAAA



ATTGAGAGAGGTAAATGGTCTTAAAATGGTTGATGAGCCAATGGAAGAG



GGAGAAGCAGATTCTTGTCATGATGAAGGAGTTGTTAAAGAAATCCCTAT



TACTCATCATGTTAAGGAAGGCTATGAGAAAGCAGATCCTGCACAGTTTG



AGTTGCTCAAGGTTCTTGGTCAGGGGTCATTTGGAAAGGTTTTTCTTGTTA



GAAAGAAGACCGGTCCTGATGCTGGGCAGCTCTATGCAATGAAGGTGTT



AAAAAAAGCCTCTTTAAAAGTTCGAGACAGAGTTCGGACAAAGATGGAG



AGGGATATACTGGTGGAAGTAAATCATCCATTTATTGTCAAATTGCACTA



TGCCTTTCAGACTGAAGGGAAACTGTACTTAATACTGGATTTTCTCAGGG



GAGGAGATGTTTTCACAAGATTATCCAAAGAGGTTCTGTTTACAGAGGAA



GATGTGAAATTCTACCTCGCAGAACTGGCCCTTGCTTTGGATCATCTGCA



CCAATTAGGAATTGTTTATAGAGACCTGAAGCCAGAAAACATTTTGCTTG



ATGAAATAGGACATATCAAATTAACAGATTTTGGACTCAGCAAGGAGTC



AGTAGATCAAGAAAAGAAGGCTTACTCATTTTGTGGTACAGTAGAGTATA



TGGCTCCTGAAGTAGTAAATAGGAGAGGCCATTCCCAGAGTGCTGATTGG



TGGTCATATGGTGTTCTTATGTTTGAAATGCTTACTGGTACTCTGCCATTT



CAAGGTAAAGACAGAAATGAGACCATGAATATGATATTAAAAGCAAAAC



TTGGAATGCCTCAATTTCTTAGTGCTGAAGCACAAAGTCTTCTAAGGATG



TTATTCAAAAGGAATCCAGCAAATAGATTGGGATCAGAAGGAGTTGAAG



AAATCAAAAGACATCTGTTTTTTGCAAATATTGACTGGGATAAATTATAT



AAAAGAGAAGTTCAACCTCCTTTCAAACCTGCTTCTGGAAAACCAGATGA



TACTTTTTGTTTTGATCCTGAATTTACTGCAAAAACACCTAAAGATTCTCC



CGGTTTGCCAGCCAGTGCAAATGCTCATCAGCTCTTCAAAGGATTCAGCT



TTGTTGCAACTTCTATTGCAGAAGAATATAAAATCACTCCTATCACAAGT



GCAAATGTATTACCAATTGTTCAGATAAATGGAAATGCTGCACAATTTGG



TGAAGTATATGAATTGAAGGAGGATATTGGTGTTGGCTCCTACTCTGTTT



GCAAGCGATGCATACATGCAACTACCAACATGGAATTTGCAGTGAAGAT



CATTGACAAAAGTAAGCGAGACCCTTCAGAAGAGATTGAAATATTGATG



CGCTATGGACAACATCCCAACATTATTACTTTGAAGGATGTCTTTGATGA



TGGTAGATATGTTTACCTTGTTACGGATTTAATGAAAGGAGGAGAGTTAC



TTGACCGTATTCTCAAACAAAAATGTTTCTCGGAACGGGAGGCTAGTGAT



ATACTATATGTAATAAGTAAGACAGTTGACTATCTTCATTGTCAAGGAGT



TGTTCATCGTGATCTTAAACCTAGTAATATTTTATACATGGATGAATCAGC



CAGTGCAGATTCAATCAGGATATGTGATTTTGGGTTTGCAAAACAACTTC



GAGGAGAAAATGGACTTCTCTTAACTCCATGCTACACTGCAAACTTTGTT



GCACCTGAGGTTCTTATGCAACAGGGATATGATGCTGCTTGTGATATCTG



GAGTTTAGGAGTCCTTTTTTACACAATGTTGGCTGGCTACACTCCATTTGC



TAATGGCCCCAATGATACTCCTGAAGAGATACTGCTGCGTATAGGCAATG



GAAAATTCTCTTTGAGTGGTGGAAACTGGGACAATATTTCAGACGGAGCA



AAGGATTTGCTTTCCCATATGCTTCATATGGACCCACATCAGCGGTATACT



GCTGAACAAATATTAAAGCACTCATGGATAACTCACAGAGACCAGTTGCC



AAATGATCAGCCAAAGAGAAATGATGTGTCACATGTTGTTAAGGGAGCA



ATGGTTGCAACATACTCTGCCCTGACTCACAAGACCTTTCAACCAGTCCT



AGAGCCTGTAGCTGCTTCAAGCTTAGCCCAGCGACGGAGCATGAAAAAG



CGAACATCAACTGGCCTGTAA





TEX10->
TGCGATCACGTGAGCACAGCAGGGAGGGGGAGGGGCCCTGATTTCCGGG
SEQ ID


PICALM
CGGCGGAAGGAGACGCGGCCGCGTGAGGACGAGGCTATTTGAAAACACG
NO: 72



CTCCGGGAGCTAGAGCCTGAGGTCGGCGGCGCACGCTGTTGCCCCGTGGG



CTTCTGCTCCCTCGCTTGTCTTCTCGGGCTTCTCGCCCCGGCCGCGGCCGG



GTCCTCAGACTTAATTCAGTGCACAAATGAGATGAATGTGAACATCCCAC



AGTTGGCAGACAGTTTATTTGAAAGAACTACTAATAGTAGTTGGGTGGTG



GTCTTCAAATCTCTCATTACAACTCATCATTTGATGGTGTATGGAAATGAG



CGTTTTATTCAGTATTTGGCTTCAAGAAACACGTTGTTTAACTTAAGCAAT



TTTTTGGATAAAAGTGGATTGCAAGGATATGACATGTCTACATTTATTAG



GCGGTATAGTAGATATTTAAATGAGAAAGCAGTTTCATACAGACAAGTTG



CATTTGATTTCACAAAAGTGAAGAGAGGGGCTGATGGAGTTATGAGAAC



AATGAACACAGAAAAACTCCTAAAAACTGTACCAATTATTCAGAATCAG



ATGGATGCACTTCTTGATTTTAATGTTAATAGCAATGAACTTACAAATGG



GGTAATAAATGCTGCCTTCATGCTCCTGTTCAAAGATGCCATTAGACTGTT



TGCAGCATACAATGAAGGAATTATTAATTTGTTGGAAAAATATTTTGATA



TGAAAAAGAACCAATGCAAAGAAGGTCTTGACATCTATAAGAAGTTCCT



AACTAGGATGACAAGAATCTCAGAGTTCCTCAAAGTTGCAGAGCAAGTT



GGAATTGACAGAGGTGATATACCAGACCTTTCACAGGCCCCTAGCAGTCT



TCTTGATGCTTTGGAACAACATTTAGCTTCCTTGGAAGGAAAGAAAATCA



AAGATTCTACAGCTGCAAGCAGGGCAACTACACTTTCCAATGCAGTGTCT



TCCCTGGCAAGCACTGGTCTATCTCTGACCAAAGTGGATGAAAGGGAAA



AGCAGGCAGCATTAGAGGAAGAACAGGCACGTTTGAAAGCTTTAAAGGA



ACAGCGCCTAAAAGAACTTGCAAAGAAACCTCATACCTCTTTAACAACTG



CAGCCTCTCCTGTATCCACCTCAGCAGGAGGGATAATGACTGCACCAGCC



ATTGACATATTTTCTACCCCTAGTTCTTCTAACAGCACATCAAAGCTGCCC



AATGATCTGCTTGATTTGCAGCAGCCAACTTTTCACCCATCTGTACATCCT



ATGTCAACTGCTTCTCAGGTAGCAAGTACATGGGGAGATCCTTTCTCTGC



TACTGTAGATGCTGTTGATGATGCCATTCCAAGCTTAAATCCTTTCCTCAC



AAAAAGTAGTGGTGATGTTCACCTTTCCATTTCTTCAGATGTATCTACTTT



TACTACTAGGACACCTACTCATGAAATGTTTGTTGGATTCACTCCTTCTCC



AGTTGCACAGCCACACCCTTCAGCTGGCCTTAATGTTGACTTTGAATCTGT



GTTTGGAAATAAATCTACAAATGTTATTGTAGATTCTGGGGGCTTTGATG



AACTAGGTGGACTTCTCAAACCAACAGTGGCCTCTCAGAACCAGAACCTT



CCTGTTGCCAAACTCCCACCTAGCAAGTTAGTATCTGATGACTTGGATTC



ATCTTTAGCCAACCTTGTGGGCAATCTTGGCATCGGAAATGGAACCACTA



AGAATGATGTAAATTGGAGTCAACCAGGTGAAAAGAAGTTAACTGGGGG



ATCTAACTGGCAACCAAAGGTTGCACCAACAACCGCTTGGAATGCTGCAA



CAATGGCACCCCCTGTAATGGCCTATCCTGCTACTACACCAACAGGCATG



ATAGGATATGGAATTCCTCCACAAATGGGAAGTGTTCCTGTAATGACGCA



ACCAACCTTAATATACAGCCAGCCTGTCATGAGACCTCCAAACCCCTTTG



GCCCTGTATCAGGAGCACAGATACAGTTTATGTAACTTGATGGAAGAAAA



TGGAATTACTCCAAAAAGACAAGTGCTCAAGCAGCAAAATCCTTACTTCC



AGCAAAATCCAAACTGCTGTCTCTTAAATCTCTTAAACTCTCTTCTTCCAT



TAGAATGCTACAAGTAACTCAGTGAAGGCCCATGAAGGAAATTGGGACT



AGTTTATAGGAGAACGTATCAATACAGTTTATAAAGCCAAGAATTGCTAT



GATTTAAGACTAAGATCTGTCTTTTTGGTGACTAACCCTTCAATTCTTTCA



ACTCCTGTTAATACCCATAATCAGTAACCTATCAAGAAAAGCCCTTATTT



GGAAAGTGTGAAATTTGTATTTGGAAAAGCTGCCTGGAGAGAAGAACTG



TGTCCTTTACTGTATTTCAACAGGACTCTTTTGGGGGATCAAAATTAAAAT



TCCTAATTATGCATTATCTTTCTTTTCTCCAGTCCTCACAAATACAGAAAC



AATAACTGAAATTAACTTTTCTTTTTTTAAAAAAAATTATATTCAGTTTGC



AGTAGACATTCCTTAAGTATTTGTATTTATTTATGATTATCAATTTTACAT



AACATTAATATTGTATCAGACCTCCTTATGAAAATGAGTATGGATGTGCA



CAGTATGTTTGATTTTTATCCACAAGAATGAATCTGATTCAGAATGCTTTT



CTCAGCTGACATACAGAGCACTAAATATTTTAAGGCAAGTCCATAGGTCT



GAATCTCTTAAGAATTCTCGGCCTCTGTGGGATTTAGGGAAGCATTATAA



ATGCATTAATCCTTATAGTCAATTCTGTGCCTAGGATTTTGCCAGGGAAC



AGTTCACTGACTAGGAAAAGCACTACATTTTAAATTCAGCATTAGTGCAT



TGGGAAGGATCTTTACTGCTTTGTGCTTGGCATGTCATTATTTTCCATTTG



ACATTAGGGCCTTTCCAAAATGAATGTGAGGAATTGCTTTCACTTCAAGA



CTTTCCTTCTTTTCACTAAAACTCTAGAAGGTGTTACAAGGGGGAGGGAA



GGGGGGCAAAGTCCTTGAACATTTTCTTTGGCTCGTGCCATGTTATGATC



ATATACCTTTTAAATAAGGGGAAATAGTATCTTTAAAGTTAATGTCTAGC



CAAGAGTTTAGTAAACGAAGAATTAAACTGCACTGTTGATCGGTGCTTTG



TGTAAATACATCTTTAACATTTGGGTGGAGAGGGGCCTTAAGAAGGACAG



TTCATTGTAGGAAAGCAATTCTGTACATGAGTTTAAGCATTCTTGTTGCAT



TGTCTCTGCAGATTCTATTTTTGTTTACAATATTAAAATGTATGTTAGCAA



AATGGGTGGATTTTCAAATAAAATGCAGCTTCCACAAAAGTTTTGTTATG



GTATTCTGGTCTGAGATGCATTTTCATTTTTCCTTTCTCTTTTTATTATCAA



TATTGTCATTTTTCCCTAATAAAATATACCCAGGTGATTATATTTGTTGAT



CTAATAACATGGAAGGTTTGTTTTATATGAATTTTCAAAAAGATGTCTCTT



TACACTTTTTGTTACCTTGTAGACTCTTATTGATAAATGCAACTACTTATT



AAAATTGTTCACTTTTTGTCTTTTGATCAGATGCCTTTAGTCAGGTAAGTT



TAAGGGAAAATACGCAGTTTAATGTTTTGGTACATATAATTATGTCTGCC



AAAGAAACCTTTGATTGTATCATATTGCCTATTTAGTAGTGCATAGGGTTC



AGAGTACATGATAAAGGATCAAAAGCTTTGCATTGATAAGTGTCTCATAA



TATTTGCTGTGATTGGAGAAAAAATGTAGTCGTAGCCAATAAATTTTATC



AGCTTTTAAGTTTCAGTATTATTAAACCATTTTCATATAAACTGG





FGD5->
ATGTTCAGGGGTCCGAAGCCCCCCATTGCCCCCAAGCCCAGGCTGACTGC
SEQ ID


BTC
CCCAAACGAGTGGAGAGCCAGTGTGTACCTGAATGACAGCTTGAACAAA
NO: 73



TGCAGCAACGGGCGGCTGCCCTGTGTAGACAGGGGGCTTGATGAGGGGC



CCCGGTCCATCCCAAAGTGCTCTGAGTCGGAGACCGACGAGGATTACATC



GTGGTCCCCAGGGTTCCGCTGAGGGAGGATGAACCCAAGGACGAGGGCA



GTGTGGGGAACAAAGCCCTGGTGTCTCCCGAGTCCTCTGCGGAAGAGGA



AGAGGAGCGTGAAGAGGGAGGCGAGGCATGTGGCCTGGAGGGTACAGG



AGCTGGTGAGGATTCAGTGGCCCCTGCTGCTCCGGGTGCAGGAGCGCTGA



GCAGGGAGGGTGAGGAAGGCACAGACCTTGCTCTTGAGGATGAAGGGGA



GGGCTGCGCTGATGAGCCAGGGACACTGGAGCAGGTGTCCAGAAGTGAG



GAGGAAGAGAAGCTAGTGCAGCCACACAGGGAGTGCAGCCTGGAGGAC



AGTGGGCCTTGGGCTGGAGAGGGGGTCTTCCAGAGCGACCTCCTCCTGCC



TCACATCCATGGAGAGGACCAGGAGCCCCCCGACACCCCCGGGGAGGCA



GAGGAGGATGATGAGGAAGGCTGTGCCAGCACAGACCCAGCAGGGGCA



GATGAGGGTTCGGGTCCTGACAGGCCCACGGAGGACATGGGACAGGATG



CTGAGGACACCAGTGAGGAGCCCCCTGAGAAGGAGGAGCTGGCCGGGGT



CCAGGAGGCAGAGACAGCCACAGACTGCCCTGAAGTTCTTGAGGAGGGA



TGTGAAGAGGCCACGGGTGTCACAGGTGGGGAACAGGTTGACCTCAGTG



AACCACCTGACCACGAGAAGAAAACCAACCAAGAAGTGGCAGCCGCCAC



CCTGGAGGACCATGCACAGGATGAGTCCGCCGAGGAGAGCTGCCAGATT



GTCCCTTTTGAGAATGACTGCATGGAGGACTTCGTGACTTCCCTCACAGG



AAGCCCCTATGAGTTCTTCCCAACTGAGAGCACCTCTTTTTGCAGCGAGA



GCTGTTCTCCTCTTTCTGAATCAGCGAAAGGTTTAGAATCAGAGCAGGCA



CCAAAGCTGGGGCTGCGTGCGGAGGAGAACCCCATGGTGGGGGCTTTGT



GTGGCCAGTGTGGCTCCCTACAGGGTGGAGCGGCCGAGGGTCCCGCAGC



CCCTGATGTGGTGGTCGTGCTGGAGGAGGAGGCCTTGGATGATGCACTGG



CCAACCCCTATGTGATGGGAGTGGGCCTGCCCGGTCAGGCGGCCCCTGGA



GAAGGAGGGCAGGCTGCATCGGACGCCCTGGGTGGTTATGGCTCGAAAG



AAGAATTGAACTGTGAGGCAGAGGGTGGCCTGGTTCCCGCGGACAGGAA



GAACACCAGCACGAGGGTCCGGCCCCACTCTGGGAAGGTGGCCGGCTAT



GTCCCAGAAACCGTCCCTGAAGAAACCGGACCTGAGGCGGGCTCGTCAG



CCCCTGGCATTGGAGGTGCCGCAGAGGAGGTGGGAAAGACGCTTTTGTC



ATTGGAGGGGAAGCCCTTGGAAGCCAGCAGGGCCTTGCCAGCAAAGCCC



AGGGCCTTTACTTTATACCCTCGGTCGTTCTCCGTGGAAGGCCGAGAGAT



TCCAGTGTCCGTGTACCAGGAGCCTGAGGGGTCAGGGTTGGATGACCACA



GGATAAAGAGGAAAGAGGACAATCTCTCTCTGTCGTGTGTAATTGGCTCC



TCTGGGAGTTTCTCCCAGAGAAACCACCTTCCGTCCAGCGGCACCTCCAC



GCCTTCTTCCATGGTCGACATCCCACCTCCTTTCGACCTGGCCTGCATCAC



CAAGAAGCCCATCACAAAGAGCTCTCCCTCACTCCTGATCGAGAGCGACT



CCCCGGACAAGTACAAGAAGAAGAAGTCATCCTTTAAGCGCTTCCTGGCA



CTGACGTTTAAGAAGAAGACGGAGAACAAATTGCATGTGGATGTGAACG



TGTCTTCCTCTAGGTCCTCTTCAGAGTCCAGCTACCACGGGCCTTCCAGGA



TTCTGGAAGTTGACCGGAGAAGCCTCAGCAACTCCCCTCAGCTTAAGTCT



CGGACTGGGAAGCTCCGGGCTTCTGAATCCCCCTCCTCCCTCATCTTTTAT



AGAGATGGCAAGAGGAAAGGTGTCCCCTTCAGCAGGACGGTGTCCAGAG



TGGAGTCCTTTGAAGACCGCTCCCGGCCGCCCTTCCTGCCCTTGCCACTGA



CCAAGCCACGGTCCATCTCCTTCCCCAGCGCTGACACTTCAGACTATGAG



AACATTCCAGCCATGAACTCGGACTATGAGAATATCCAGATTCCACCCCG



GAGACCTGCCAGGGCTGGCGCGTTCACGAAGCTGTTTGAAGATCAGAGC



AGAGCCCTGTCCACAGCAAACGAAAATGATGGCTACGTGGACATGAGCA



GCTTCAACGCCTTTGAGAGCAAACAGCAGAGTGCAGACCAGGACGCAGA



AAGCGCCTACACAGAGCCCTACAAAGTCTGTCCCATCTCGTCGGCAGCCC



CCAAAGAGGACCTTACGTCGGATGAAGAGCAGAGAAGCTCGGAGGAGGA



GGACAGTGCTTCAAGAGACCCCAGTGTCACCCACAAGGTGGAAGGACAG



TCCAGAGCCCTTGTCATCGCACAGGAACTGCTATCTTCAGAGAAAGCATA



CGTGGAGATGCTCCAGCACTTAAATCTGGATTTCCATGGAGCTGTCATGA



GGGCCTTGGATGACATGGACCATGAAGGCAGAGACACATTGGCCCGGGA



GGAGCTGAGGCAGGGCCTGAGTGAACTCCCAGCCATCCACGACCTTCATC



AAGGCATCCTGGAGGAGCTGGAGGAAAGGCTGTCAAATTGGGAGAGCCA



GCAGAAGGTAGCTGACGTCTTCCTGGCCCGGGAGCAGGGGTTTGATCACC



ACGCCACTCACATCCTGCAGTTCGACAGGTACCTAGGTCTGCTCAGTGAG



AATTGCCTCCACTCTCCCCGGCTGGCAGCTGCTGTCCGTGAATTTGAGCA



GAGTGTACAAGGAGGCAGCCAGACTGCGAAGCATCGGCTGCTGCGGGTG



GTTCAACGCCTCTTCCAGTACCAAGTGCTCCTCACAGACTATTTAAACAA



CCTTTGTCCGGACTCCGCCGAGTACGACAACACACAGGGTGCACTGAGCC



TCATCTCCAAAGTCACAGACCGTGCCAACGACAGCATGGAGCAAGGGGA



AAACCTGCAGAAGCTGGTCCACATTGAGCACAGCGTCCGGGGCCAAGGG



GATCTCCTCCAGCCAGGAAGGGAGTTTCTGAAGGAAGGGACGCTGATGA



AAGTAACAGGGAAAAACAGACGGCCCCGGCACCTATTTCTGATGAACGA



TGTGCTCCTGTACACCTATCCCCAGAAGGATGGGAAGTACCGGCTGAAGA



ACACATTGGCTGTGGCCAACATGAAGGTCAGCCGCCCTGTGATGGAGAA



AGTGCCCTACGCTCTAAAGATTGAGACTTCCGAGTCCTGCCTGATGCTGT



CTGCGAGCCCTCTTCGGAAACGTCGTAAAAGAAAGAAGAAAGAAGAAGA



AATGGAAACTCTGGGTAAAGATATAACTCCTATCAATGAAGATATTGAAG



AGACAAATATTGCTTAA





HHATL->
CAGCAGCTCTGCAGCACTCGGCTCTGCTCCACTCTGCTCAGCTCCGCTCCA
SEQ ID


GRB2
GGAAGGCCACCTCCTCCTCCCCCTCCTCCTCCCGCTGTCACCACTCACCGC
NO: 74



TCATAACCTCAAGGGGGTGGGGACCCCAGGGCTGGACACACCCCACCGT



GGCCCCAGAGCTCAGCCGGTCGCACGGACGGACAGTTGGAAGCCGGACC



CCAGAGCCTGAGGTGGGCAGTGTGCCAGGGTCCCTTGCGGCCTCCTCAAG



GTTTTGAACGAAGAATGTGATCAGAACTGGTACAAGGCAGAGCTTAATG



GAAAAGACGGCTTCATTCCCAAGAACTACATAGAAATGAAACCACATCC



GTGGTTTTTTGGCAAAATCCCCAGAGCCAAGGCAGAAGAAATGCTTAGCA



AACAGCGGCACGATGGGGCCTTTCTTATCCGAGAGAGTGAGAGCGCTCCT



GGGGACTTCTCCCTCTCTGTCAAGTTTGGAAACGATGTGCAGCACTTCAA



GGTGCTCCGAGATGGAGCCGGGAAGTACTTCCTCTGGGTGGTGAAGTTCA



ATTCTTTGAATGAGCTGGTGGATTATCACAGATCTACATCTGTCTCCAGA



AACCAGCAGATATTCCTGCGGGACATAGAACAGGTGCCACAGCAGCCGA



CATACGTCCAGGCCCTCTTTGACTTTGATCCCCAGGAGGATGGAGAGCTG



GGCTTCCGCCGGGGAGATTTTATCCATGTCATGGATAACTCAGACCCCAA



CTGGTGGAAAGGAGCTTGCCACGGGCAGACCGGCATGTTTCCCCGCAATT



ATGTCACCCCCGTGAACCGGAACGTCTAAGAGTCAAGAAGCAATTATTTA



AAGAAAGTGAAAAATGTAAAACACATACAAAAGAATTAAACCCACAAGC



TGCCTCTGACAGCAGCCTGTGAGGGAGTGCAGAACACCTGGCCGGGTCA



CCCTGTGACCCTCTCACTTTGGTTGGAACTTTAGGGGGTGGGAGGGGGCG



TTGGATTTAAAAATGCCAAAACTTACCTATAAATTAAGAAGAGTTTTTAT



TACAAATTTTCACTGCTGCTCCTCTTTCCCCTCCTTTGTCTTTTTTTTCATC



CTTTTTTCTCTTCTGTCCATCAGTGCATGACGTTTAAGGCCACGTATAGTC



CTAGCTGACGCCAATAATAAAAAACAAGAAACCAAGTGGGCTGGTATTC



TCTCTATGCAAAATGTCTGTTTTAGTTGGAATGACTGAAAGAAGAACAGC



TGTTCCTGTGTTCTTCGTATATACACACAAAAAGGAGCGGGCAGGGCCGC



TCGATGCCTTTGCTGTTTAGCTTCCTCCAGAGGAGGGGACTTGTAGGAAT



CTGCCTTCCAGCCCAGACCCCCAGTGTATTTTGTCCAAGTTCACAGTAGA



GTAGGGTAGAAGGAAAGCATGTCTCTGCTTCCATGGCTTCCTGAGAAAGC



CCACCTGGGCTGGGCGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGGG



AGGCCAAGGTGGGCGGATCACAAGGTCAGGAGTTCGAGACCAACCTAGC



CAACATGGTGAAACCCCGTCTCTACTAAAAATAAGAAATTAGCCGGGTGT



GGCACGCACCTGTAGTCCCAGCTACTTGGGAGCCTGAGGCAGGAGAATC



GCTTGAACCTGGGAAGTGGAGGTTGAGTGAGCCGGGACCGTGCCATTGT



ACTCCAGCCTGGGTGACAGAGCGAGATTCCGTCTCAAAAAAAAAAAAAA



AAAGCCCACCTGAAAGCCTGTCTCTTTCCACTTTGTTGGCCCTTCCAGTGG



GATTATCGAGCATGTTGTTTTTTCATAGTGCCTTTTTCCTTATTTCAAGGGT



TGCTTCTGAGTGGTGTTTTTTTTTTTTTTTTAATTTGTTTTGTTTTAAAATA



AGTTAAAGGCAGTCCAGAGCTTTTCAGCCAATTTGTCTCCTACTCTGTGTA



AATATTTTTCCCTCCGGGCAGGGGAGCCAGGGTAGAGCAAAGGAGACAA



AGCAGGAGTGGAAGGTGAGGCGTTCTCCTGCTTGTACTAAGCCAGGAGG



CTTTAAGCTCCAGCTTTAAGGGTTGTGAGCCCCTTGGGGGTTCAGGGAAC



TGCTTGCCCAGGGTGCAGTGTGAGTGTGATGGGCCACCGGGGCAAGAGG



GAAGGTGACCGCCCAGCTCTCCCACATCCCACTGGATCTGGCTTACAGGG



GGGTCGGAAGCCTGTCCTCACCGTCTCGGGGGTTGTGGCCCCCGCCCCCT



CCCTATATGCACCCCTGGAACCAGCAAGTCCCAGACAAGGAGAGCGGAG



GAGGAAGTCATGGGAACGCAGCCTCCAGTTGTAGCAGGTTTCACTATTCC



TATGCTGGGGTACACAGTGAGAGTACTCACTTTTCACTTGTCTTGCTCTTA



GATTGGGCCATGGCTTTCATCCTGTGTCCCCTGACCTGTCCAGGTGAGTGT



GAGGGCAGCACTGGGAAGCTGGAGTGCTGCTTGTGCCTCCCTTCCCAGTG



GGCTGTGTTGACTGCTGCTCCCCACCCCTACCGATGGTCCCAGGAAGCAG



GGAGAGTTGGGGAAGGCAAGATTGGAAAGACAGGAAGACCAAGGCCTC



GGCAGAACTCTCTGTCTTCTCTCCACTTCTGGTCCCCTGTGGTGATGTGCC



TGTAATCTTTTTCTCCACCCAAACCCCTTCCCACGACAAAAACAAGACTG



CCTCCCTCTCTTCCGGGAGCTGGTGACAGCCTTGGGCCTTTCAGTCCCAAA



GCGGCCGATGGGAGTCTCCCTCCGACTCCAGATATGAACAGGGCCCAGG



CCTGGAGCGTTTGCTGTGCCAGGAGGCGGCAGCTCTTCTGGGCAGAGCCT



GTCCCCGCCTTCCCTCACTCTTCCTCATCCTGCTTCTCTTTTCCTCGCAGAT



GATAAAAGGAATCTGGCATTCTACACCTGGACCATTTGATTGTTTTATTTT



GGAATTGGTGTATATCATGAAGCCTTGCTGAACTAAGTTTTGTGTGTATAT



ATTTAAAAAAAAAATCAGTGTTTAAATAAAGACCTATGTACTTAATCCTT



TAACTCTGCGGATAGCATTTGGTAGGTAGTGATTAACTGTGAATAATAAA



TACACAATGAATTCTTCA





GOPC->
ATGTCGGCGGGCGGTCCATGCCCAGCAGCAGCCGGAGGGGGCCCAGGGG
SEQ ID


TRMT11
GCGCCTCCTGCTCCGTGGGGGCCCCTGGCGGGGTATCCATGTTCCGGTGG
NO: 75



CTGGAGGTGCTGGAGAAGGAGTTCGACAAAGCTTTTGTGGATGTGGATCT



GCTCCTGGGAGAGATCGATCCAGACCAAGCGGACATCACTTATGAGGGG



CGACAGAAGATGACCAGCCTGAGCTCCTGCTTTGCACAGCTTTGCCACAA



AGCCCAGTCTGTGTCTCAAATCAACCACAAGCTGGAGAATCGGGACCAGT



ATTCACATCTGCTAAGTGATCATTTTCTGCCATACCAAGGTCATAATTCCT



TCCGTGAGAAATATTTTAGTGGGGTAACAAAAAGAATTGCCAAGGAAGA



AAAATCCACCCAGGAATGA





ZDHHC21
CAGCGGCTCCCATCGCGGCTCCCGGGAGCTAAGCGAGACGGCGACGGCG
SEQ ID


->
GCAGTCGTCCCTCCCCACGCGGGCGCGCGGGCATGCGGACACCCACTCGG
NO: 76


HMGB1
CCGGTCCAGGCCCTCAGGCTCCCGGAAGCGGAAGGGGAGAGCGGCCCGG



CCTGGGCGGCGGCGCCGGAGGAGGCGGAGGTGGCGCGGCAGGAGGAGG



GGAAAGAGCTGCTGGCGGTCGGGAGAGCGGCGGCAGCGAGAGGCGAGC



CAGCGGCGACGAAAAATAACTAAACATGGGCAAAGGAGATCCTAAGAAG



CCGAGAGGCAAAATGTCATCATATGCATTTTTTGTGCAAACTTGTCGGGA



GGAGCATAAGAAGAAGCACCCAGATGCTTCAGTCAACTTCTCAGAGTTTT



CTAAGAAGTGCTCAGAGAGGTGGAAGACCATGTCTGCTAAAGAGAAAGG



AAAATTTGAAGATATGGCAAAAGCGGACAAGGCCCGTTATGAAAGAGAA



ATGAAAACCTATATCCCTCCCAAAGGGGAGACAAAAAAGAAGTTCAAGG



ATCCCAATGCACCCAAGAGGCCTCCTTCGGCCTTCTTCCTCTTCTGCTCTG



AGTATCGCCCAAAAATCAAAGGAGAACATCCTGGCCTGTCCATTGGTGAT



GTTGCGAAGAAACTGGGAGAGATGTGGAATAACACTGCTGCAGATGACA



AGCAGCCTTATGAAAAGAAGGCTGCGAAGCTGAAGGAAAAATATGAAAA



GGATATTGCTGCATATCGAGCTAAAGGAAAGCCTGATGCAGCAAAAAAG



GGAGTTGTCAAGGCTGAAAAAAGCAAGAAAAAGAAGGAAGAGGAGGAA



GATGAGGAAGATGAAGAGGATGAGGAGGAGGAGGAAGATGAAGAAGAT



GAAGATGAAGAAGAAGATGATGATGATGAATAAGTTGGTTCTAGCGCAG



TTTTTTTTTTCTTGTCTATAAAGCATTTAACCCCCCTGTACACAACTCACTC



CTTTTAAAGAAAAAAATTGAAATGTAAGGCTGTGTAAGATTTGTTTTTAA



ACTGTACAGTGTCTTTTTTTGTATAGTTAACACACTACCGAATGTGTCTTT



AGATAGCCCTGTCCTGGTGGTATTTTCAATAGCCACTAACCTTGCCTGGTA



CAGTATGGGGGTTGTAAATTGGCATGGAAATTTAAAGCAGGTTCTTGTTG



GTGCACAGCACAAATTAGTTATATATGGGGATGGTAGTTTTTTCATCTTCA



GTTGTCTCTGATGCAGCTTATACGAAATAATTGTTGTTCTGTTAACTGAAT



ACCACTCTGTAATTGCAAAAAAAAAAAAAAAGTTGCAGCTGTTTTGTTGA



CATTCTGAATGCTTCTAAGTAAATACAATTTTTTTTATTAGTATTGTTGTC



CTTTTCATAGGTCTGAAATTTTTCTTCTTGAGGGGAAGCTAGTCTTTTGCT



TTTGCCCATTTTGAATCACATGAATTATTACAGTGTTTATCCTTTCATATA



GTTAGCTAATAAAAAGCTTTTGTCTACACACCCTGCATATCATAATGGGG



GTAAAGTTAAGTTGAGATAGTTTTCATCCATAACTGAACATCCAAAATCT



TGATCAGTTAAGAAATTTCACATAGCCCACTTACATTTACAAACTGAAGA



GTAATCAATCTACTCAAAGCATGGGATTATTAGAATCAAACATTTTGAAA



GTCTGTCCTTGAAGGACTAATAGAAAAGTATGTTCTAACCTTTACATGAG



GACTCTATTCTTTAACTCCCATTACCATGTAATGGCAGTTATATTTTGCAG



TTCCCACATTAAAGAAGACCTGAGAATGTATCCCCAAAAGCGTGAGCTTA



AAATACAAGACTGCCATATTAAATTTTTTGTTGACATTAGTCTCAGTGAA



GACTATGAAAATGCTGGCTATAGATGTCTTTTCCCATTTATCTAAATATGG



ACTGCTCAGGAAACGAGACTTTCCATTACAAGTATTTTTAATTAATTGGG



CCAGCTTTTCAAACAAAGATGCCACATTCAAAATAGGGTATATTTTCCTA



TATTACGGTTTGCCCCTTTATAAATCCAAGTAGATAGGAAGAAAGAAGAC



AAACTTTGCATCTCAGTATGAATTATTCAATTTATTTGAATGATTTTTCTTT



ACAAAACAAACTCATTCATTAGTCATGTTTATCTGCTTAGGAGTTTAGGG



AACAATTTGGCAATTTTGTGGTTTTCGAGATTATCGTTTTCTTAAAGTGCC



AGTATTTTAAAATAGCGTTCTTGTAATTTTACACGCTTTTGTGATGGAGTG



CTGTTTTGTTATATAATTTAGACTTGGATTCTTTCCATTTGCATTTGTTTAT



GTAATTTCAGGAGGAATACTGAACATCTGAGTCCTGGATGATACTAATAA



ACTAATAATTGCAGAGGTTTTAAATACTAGTTAAATGGCTTTCACTTAAG



AACTTAAGATTTTGTTACATATTTTTAAATCTTGTTTCTAATAATACCTCTT



AGCAGTACCTTTTAAATAAGTATAAGGGATGGCAAAGTTTTTCCCTTTAA



AAATACTCACTTTATGCTTATAAATAGGTTAATGGGCTGATAAAAGGTTT



TGTCAAACATTGCAAGTATTCGGTGCTATATATAAAGGAGGAAAAACTAG



TTTTACTTTCAGAATGATTTAAACAAGATTTTTAAAAACAAGATACATGC



AAGCGAACAGCAGGGTTAGTGATAGGCTGCAATTGTGTCGAACATCAGA



TTTTTTGTTAAGAGGAGCAAATGACTCAATCTGATTTAGATGGAAGTTTCT



ACTGTATAGAAATCACCATTAATCACCAACATTAATAATTCTGATCCATTT



AAAATGAATTCTGGCTCAAGGAGAATTTGTAACTTTAGTAGGTACGTCAT



GACAACTACCATTTTTTTAAGATGTTGAGAATGGGAACAGTTTTTTTAGG



GTTTATTCTTGACCACAGATCTTAAGAAAATGGACAAAACCCCTCTTCAA



TCTGAAGATTAGTATGGTTTGGTGTTCTAACAGTATCCCCTAGAAGTTGG



ATGTCTAAAACTCAAGTAAATGGAAGTGGGAGGCAATTTAGATAAGTGT



AAAGCCTTGTAACTGAAGATGATTTTTTTTAGAAAGTGTATAGAAACTAT



TTTAATGCCAAGATAGTTACAGTGCTGTGGGGTTTAAAGACTTTGTTGAC



ATCAAGAAAAGACTAAATCTATAATTAATTGGGCCAACTTTTAAAATGAA



GATGCTTTTTAAAACTAATGAACTAAGATGTATAAATCTTAGTTTTTTTGT



ATTTTAAAGATAGGCATATGGCATATTGATTAACGAGTCAAATTTCCTAA



CTTTGCTGTGCAAAGGTTGAGAGCTATTGCTGATTAGTTACCACAGTTCTG



ATGATCGTCCCATCACAGTGTTGTTAATGTTTGCTGTATTTATTAATTTTCT



TAAAGTGAAATCTGAAAAATGAAATTTGTGTGTCCTGTGTACCCGAGGGG



TAATGATTAAATGATAAAGATAAGAA





SUSD3->
ATGCGCTGGGCGGCCGCCACCCTCCGTGGCAAGGCGAGGCCCCGGGGGC
SEQ ID


KIAA1429
GGGCCGGGGTCACCACGCCTGCCCCAGGGAACCGCACAGTTCTTTAAGG
NO: 77



AAAAACAGTAGTGCTCTGCATAGTTTACTGAAACGAGTGGTCAGCACATT



TAGTAAGGACACAGGAGAGCTTGCATCTTCATTTTTAGAATTTATGAGAC



AAATTCTTAACTCTGACACAATTGGATGCTGTGGAGATGATAATGGTCTC



ATGGAAGTAGAGGGAGCTCATACATCACGGACGATGAGTATTAATGCTG



CAGAGTTAAAACAGCTTCTACAAAGCAAAGAAGAAAGTCCAGAAAATTT



GTTCCTTGAACTAGAGAAGCTTGTTTTGGAACATTCAAAAGATGATGACA



ATCTGGATTCTTTGTTGGACAGTGTAGTTGGACTTAAGCAGATGCTGGAG



TCATCAGGTGACCCTTTACCTCTCAGTGACCAGGATGTAGAACCAGTACT



TTCAGCTCCAGAATCTCTTCAGAATCTGTTTAACAATAGGACTGCCTATGT



GCTTGCTGATGTCATGGATGATCAGTTGAAATCTATGTGGTTCACTCCATT



TCAGGCTGAAGAGATAGATACAGATCTGGATTTGGTAAAGGTTGACTTAA



TTGAACTCTCTGAAAAATGCTGTAGTGACTTTGATTTGCACTCAGAATTA



GAGCGCTCATTTTTGTCAGAACCATCATCTCCAGGAAGAACCAAGACTAC



TAAAGGATTCAAACTTGGGAAGCACAAGCATGAGACCTTTATAACGTCA



AGTGGAAAATCTGAATACATTGAACCTGCCAAAAGAGCTCATGTTGTGCC



ACCACCAAGAGGAAGGGGCAGGGGAGGATTTGGACAGGGTATACGACCT



CATGATATTTTTCGTCAGAGAAAACAGAACACAAGTAGACCACCATCTAT



GCATGTGGATGACTTTGTTGCTGCTGAAAGTAAAGAAGTGGTTCCTCAAG



ATGGAATACCTCCACCAAAACGGCCACTCAAAGTATCACAGAAGATTTCT



TCCCGTGGTGGGTTTTCAGGCAATAGAGGAGGACGGGGTGCTTTCCACAG



TCAGAATAGGTTTTTCACACCACCTGCTTCAAAAGGAAACTACAGTCGTC



GGGAAGGAACAAGAGGCTCCAGTTGGAGTGCTCAGAATACTCCTCGAGG



AAATTACAATGAAAGTCGTGGAGGCCAGAGCAATTTTAACAGAGGCCCT



CTTCCACCATTACGACCCCTTAGTTCTACAGGTTACCGCCCAAGTCCTCGG



GACCGTGCTTCTAGAGGTCGTGGGGGACTTGGACCTTCCTGGGCTAGTGC



AAATAGCGGCAGTGGAGGCTCAAGAGGAAAGTTTGTTAGTGGAGGCAGT



GGTAGAGGTCGTCATGTACGCTCCTTTACACGATAA





GBAS->
ATGGCGGCGCGAGTGCTGCGCGCCCGCGGAGCGGCCTGGGCCGGCGGCC
SEQ ID


PCLO
TCCTGCAGCGGGCGGCCCCCTGCAGCCTCCTGCCCAGGCTCCGATTCAAG
NO: 78



AATGGCTTTGTTTAAATTGCCAAACCCAGAGAGCAATATCAGGACAGCTT



GGAGACATACGCAAAATGCCACCTGCACCATCAGGACCCAAAGCATCTC



CTATGCCTGTTCCTACAGAATCATCATCTCAGAAAACAGCAGTGCCTCCC



CAAGTAAAATTAGTGAAAAAGCAAGAACAAGAAGTAAAAACGGAAGCT



GAAAAAGTCATTCTGGAAAAAGTAAAGGAAACACTATCAATGGAAAAAA



TTCCTCCTATGGTAACCACAGATCAAAAACAAGAAGAGAGTAAACTAGA



GAAAGACAAAGCTTCAGCTCTTCAAGAAAAAAAGCCACTCCCTGAAGAA



AAAAAACTAATCCCTGAAGAAGAAAAGATACGTTCTGAAGAAAAAAAGC



CACTCCTAGAAGAAAAAAAGCCAACCCCTGAAGACAAAAAGCTACTCCC



AGAGGCAAAAACATCAGCCCCAGAAGAACAGAAACATGACTTACTTAAA



TCTCAAGTACAAATTGCTGAAGAAAAGCTTGAAGGCAGAGTGGCTCCAA



AGACAGTGCAAGAAGGGAAACAACCACAGACCAAGATGGAAGGTTTACC



ATCTGGCACACCTCAGAGTTTACCTAAAGAAGATGATAAGACAACCAAA



ACAATAAAAGAACAGCCACAGCCACCATGCACAGCAAAACCTGATCAGG



TGGAACCTGGGAAAGAAAAAACAGAAAAGGAAGATGACAAATCAGACA



CCTCAAGTTCTCAGCAGCCTAAAAGCCCCCAAGGTCTGAGCGACACGGG



ATATTCTTCCGATGGAATATCAAGCTCACTTGGTGAAATTCCAAGTCTTAT



TCCAACTGATGAAAAGGATATTCTCAAGGGACTCAAAAAGGACTCTTTTT



CACAAGAAAGCAGCCCTTCCAGCCCCTCAGATTTGGCTAAGTTAGAAAGT



ACAGTCCTATCTATTTTGGAAGCTCAAGCAAGTACACTTGCTGATGAAAA



GTCAGAAAAGAAAACACAACCCCATGAAGTTTCTCCTGAACAGCCTAAA



GACCAAGAGAAAACTCAGAGTTTATCTGAAACCTTGGAAATTACTATTTC



AGAAGAGGAGATCAAAGAGAGTCAAGAAGAAAGGAAAGACACTTTTAA



AAAAGATAGCCAACAAGATATTCCTTCCAGCAAGGACCATAAAGAGAAG



TCTGAGTTTGTTGATGACATAACTACTAGAAGAGAGCCTTATGATTCAGT



TGAAGAGAGTAGTGAAAGTGAAAACTCACCTGTTCCACAAAGAAAACGA



AGAACTAGTGTTGGCTCATCAAGCAGTGATGAGTATAAACAGGAAGACA



GCCAAGGATCAGGGGAAGAGGAGGACTTCATTCGAAAACAAATCATAGA



AATGAGTGCTGATGAAGATGCTTCAGGTTCTGAAGATGATGAGTTCATCA



GAAACCAGCTCAAAGAGATTAGTAGCAGTACTGAGAGCCAGAAGAAGGA



AGAAACAAAGGGAAAAGGCAAAATAACAGCAGGGAAACACAGACGACT



GACTCGAAAAAGTAGCACAAGCATTGATGAAGATGCAGGAAGACGTCAC



TCATGGCATGATGAAGACGATGAAGCATTTGATGAAAGTCCTGAACTTAA



ATACAGAGAAACTAAAAGTCAGGAAAGTGAAGAACTTGTAGTTACTGGA



GGAGGAGGGCTACGCCGATTTAAAACAATTGAGCTCAACAGTACAATAG



CAGATAAATATTCTGCAGAGTCATCACAGAAAAAAACAAGTTTGTATTTT



GACGAAGAGCCAGAATTGGAAATGGAAAGCCTGACAGACTCACCTGAAG



ATAGGTCAAGGGGAGAGGGATCTTCGAGTCTGCATGCTTCCAGCTTCACT



CCTGGTACATCCCCTACATCAGTATCATCACTTGATGAGGACAGTGACAG



TAGCCCGAGTCACAAAAAAGGAGAGAGCAAACAGCAACGCAAAGCTCG



GCACAGACCACATGGCCCTCTTTTGCCTACTATTGAAGATTCTTCAGAGG



AAGAAGAATTGAGAGAGGAAGAAGAATTATTAAAGGAGCAAGAAAAGC



AGAGGGAAATAGAACAGCAACAAAGAAAGAGTTCTAGTAAAAAATCAA



AGAAAGACAAAGATGAACTTCGAGCTCAGAGAAGAAGGGAAAGGCCAA



AGACACCACCTAGTAATCTCTCTCCCATTGAAGATGCATCTCCGACAGAA



GAGTTACGTCAGGCTGCAGAAATGGAGGAGCTCCATAGATCTTCTTGTTC



TGAATATTCACCTAGCATAGAGTCAGACCCAGAAGGTTTTGAAATAAGCC



CGGAAAAAATAATAGAAGTACAAAAAGTTTATAAATTGCCCACAGCTGT



TTCATTATACTCACCAACAGATGAGCAATCTATTATGCAGAAAGAAGGTA



GCCAAAAGGCGTTAAAAAGTGCTGAGGAGATGTATGAAGAAATGATGCA



TAAAACACACAAATACAAAGCTTTTCCAGCTGCAAATGAACGAGATGAA



GTGTTTGAAAAAGAGCCTTTGTATGGTGGGATGCTAATAGAGGATTATAT



TTATGAATCTTTAGTAGAAGACACGTACAATGGATCGGTAGATGGCAGTC



TGCTAACAAGGCAAGAAGAAGAAAATGGATTTATGCAGCAGAAAGGAAG



AGAGCAAAAGATAAGACTTTCAGAACAGATTTATGAAGATCCTATGCAG



AAAATTACAGACCTCCAGAAAGAGTTTTATGAGTTAGAAAGCTTACATTC



TGTTGTGCCTCAGGAAGATATTGTTTCAAGCTCTTTTATCATCCCAGAAAG



CCATGAGATAGTGGACCTGGGTACTATGGTAACTTCTACAGAAGAAGAA



AGGAAACTACTAGATGCTGATGCTGCCTATGAAGAACTTATGAAGAGGC



AACAGATGCAATTAACACCTGGATCTAGCCCAACCCAGGCCCCCATTGGT



GAGGATATGACAGAGTCCACCATGGACTTTGACAGAATGCCAGATGCCTC



TTTGACATCAAGTGTTCTCTCAGGAGCGTCTCTTACAGATTCGACCAGCA



GTGCAACACTCTCTATCCCAGATGTTAAAATAACCCAACATTTTTCAACA



GAAGAAATTGAGGATGAATATGTAACCGATTATACAAGAGAAATTCAAG



AGATAATTGCCCATGAATCGCTGATTTTGACCTACTCGGAGCCTTCAGAA



AGTGCTACATCTGTCCCACCCTCTGACACACCTTCTCTCACATCATCTGTT



TCTTCGGTCTGTACCACAGATAGCTCTTCACCCATTACTACCCTGGATAGC



ATAACCACAGTTTATACAGAGCCAGTGGACATGATAACTAAATTTGAAGA



TTCTGAGGAAATTTCTTCATCAACTTATTTTCCAGGCAGCATTATAGACTA



TCCAGAAGAAATAAGTGTATCTTTAGATCGGACTGCCCCACCAGATGGTA



GAGCTAGTGCTGATCATATTGTTATTTCCTTATCTGATATGGCATCTTCTA



TCATAGAATCTGTAGTACCTAAACCTGAAGGGCCAGTTGCTGACACTGTT



TCTACTGACTTACTTATATCTGAAAAGGACCCAGTGAAGAAAGCCAAGAA



GGAAACTGGGAATGGAATCATTCTGGAAGTTTTGGAAGCTTACAGAGAT



AAAAAGGAGTTGGAGGCCGAACGAACAAAAAGTAGCTTATCCGAAACCG



TGTTTGATCACCCACCTTCTTCTGTAATAGCCCTTCCAATGAAAGAGCAGC



TTTCAACTACATACTTTACATCTGGAGAGACCTTTGGTCAGGAAAAACCT



GCATCTCAGTTACCATCTGGCAGTCCTTCTGTTTCCTCTCTTCCAGCTAAA



CCTCGCCCATTCTTTAGAAGTTCTTCTTTGGATATATCAGCTCAACCTCCT



CCCCCTCCTCCCCCTCCCCCTCCTCCTCCTCCTCCACCACCACCCCCTCCTC



CCCCACCACTTCCTCCACCAACTTCACCTAAACCAACTATTCTTCCTAAAA



AAAAGTTAACAGTTGCATCTCCAGTGACTACAGCTACACCTCTGTTTGAT



GCTGTTACTACTCTAGAGACCACAGCTGTTCTGAGAAGTAATGGATTACC



TGTTACAAGAATATGTACTACTGCACCTCCTCCTGTTCCTCCTAAGCCATC



TTCAATTCCATCTGGACTTGTATTTACCCACAGGCCTGAGCCAAGCAAAC



CTCCAATCGCCCCCAAACCAGTGATTCCTCAGCTTCCAACAACTACACAA



AAACCAACAGATATACACCCCAAACCAACAGGCCTATCTTTAACTTCAAG



TATGACCTTAAATTTAGTGACTTCAGCAGATTATAAATTGCCTTCCCCTAC



CTCCCCACTTTCCCCACACTCCAACAAGTCTTCACCAAGATTTTCCAAATC



CCTCACAGAAACTTATGTAGTTATTACATTGCCATCTGAACCAGGGACTC



CAACAGATTCTTCTGCTAGTCAAGCAATTACCAGTTGGCCCTTGGGATCA



CCCTCCAAAGATCTGGTTTCTGTTGAACCTGTGTTTTCTGTAGTTCCTCCT



GTGACAGCTGTAGAAATTCCAATTTCTTCAGAACAGACCTTCTACATCTCT



GGAGCTTTACAGACATTTTCTGCTACCCCTGTCACAGCACCCTCTTCATTT



CAAGCAGCTCCCACATCAGTTACACAGTTTCTCACTACTGAAGTTTCCAA



GACTGAGGTTTCAGCAACCAGAAGTACAGCTCCTAGTGTTGGTCTCAGCA



GCATTTCCATAACAATTCCTCCAGAGCCTCTTGCTCTAGATAACATACATT



TAGAGAAGCCTCAGTATAAAGAAGATGGAAAATTGCAACTTGTTGGTGA



TGTAATTGATTTGCGTACAGTACCAAAGGTAGAAGTTAAAACAACTGATA



AATGTATTGATCTTTCTGCTTCTACAATGGATGTGAAAAGGCAGATCACA



GCAAATGAAGTTTATGGGAAACAAATTAGTGCTGTCCAACCCTCTATTAT



AAATCTTAGTGTGACATCATCAATAGTGACTCCTGTATCTCTGGCCACTG



AGACAGTGACCTTTGTCACATGCACAGCTAGTGCAAGTTACACTACAGGC



ACAGAAAGCCTAGTGGGTGCAGAACATGCAATGACAACACCACTCCAAC



TTACAACATCAAAGCATGCTGAGCCCCCATACAGGATACCAAGTGACCA



GGTCTTTCCTATAGCTAGGGAAGAAGCACCAATAAACTTATCTCTAGGTA



CTCCAGCACATGCAGTGACATTGGCTATTACAAAACCTGTCACTGTGCCT



CCTGTTGGTGTCACAAATGGATGGACTGATAGCACCGTATCCCAGGGAAT



CACTGATGGGGAAGTAGTGGATCTCAGTACAACCAAGTCTCACAGAACA



GTCGTAACAATGGATGAGTCTACTTCAAGTGTGATGACCAAAATAATAGA



AGATGAAAAACCCGTTGATTTAACCGCAGGGAGAAGAGCTGTGTGCTGT



GATGTGGTTTATAAATTACCATTTGGAAGGAGCTGCACAGCACAGCAGCC



TGCAACTACTCTTCCTGAGGATCGTTTTGGTTATAGGGATGACCACTATCA



GTATGATCGATCAGGGCCATATGGTTATAGAGGGATTGGGGGAATGAAG



CCTTCCATGTCTGACACAAATTTAGCAGAAGCTGGACATTTTTTCTATAAA



AGTAAGAATGCTTTTGATTATTCTGAAGGAACTGACACAGCAGTAGATCT



GACTTCAGGGAGAGTTACTACAGGTGAGGTAATGGATTATTCAAGCAAG



ACTACAGGTCCATATCCAGAAACACGACAAGTCATTTCAGGAGCTGGGAT



TAGTACCCCACAGTATTCCACAGCAAGAATGACACCACCACCAGGACCCC



AGTATTGTGTGGGGAGTGTTTTGAGGTCATCTAATGGTGTTGTCTATTCTT



CAGTAGCAACTCCAACACCCTCTACATTTGCTATCACCACACAACCTGGC



TCCATTTTCAGCACCACAGTGAGGGATTTGTCTGGTATTCATACGGCTGAT



GCAGTGACTTCATTACCTGCCATGCACCATAGCCAGCCAATGCCTAGATC



ATATTTTATAACAACAGGTGCATCTGAAACGGACATTGCAGTAACTGGTA



TTGATATCAGTGCCAGTTTGCAAACTATTACTATGGAGTCTCTTACTGCTG



AGACGATAGACTCTGTTCCCACTTTAACCACAGCATCCGAAGTGTTTCCT



GAAGTGGTGGGAGATGAAAGTGCTCTTTTAATTGTCCCTGAAGAAGATAA



ACAACAGCAGCAGCTAGACTTGGAGCGTGAGCTCCTGGAACTGGAGAAA



ATTAAGCAACAGCGCTTTGCTGAGGAATTGGAGTGGGAACGTCAGGAAA



TTCAAAGGTTCCGAGAACAAGAAAAGATCATGGTTCAGAAAAAGTTGGA



GGAGCTGCAGTCTATGAAGCAACACCTTCTCTTTCAGCAAGAAGAAGAGC



GGCAAGCCCAGTTCATGATGAGGCAGGAGACGTTAGCTCAGCAACAGTT



ACAGCTTGAGCAGATCCAACAGCTGCAACAACAGCTTCACCAGCAGCTG



GAGGAGCAAAAGATTCGGCAGATCTACCAGTATAACTATGACCCTTCTGG



AACTGCTTCTCCACAAACCACTACAGAGCAGGCAATTTTGGAAGGTCAGT



ATGCTGCTCTGGAAGGCAGTCAATTTTGGGCAACTGAAGATGCAACCACC



ACAGCTTCAGCTGTTGTGGCAATTGAAATACCACAAAGCCAAGGATGGTA



CACCGTTCAGTCTGATGGTGTTACTCAGTACATTGCCCCACCTGGTATCCT



GAGCACTGTTTCAGAAATACCTCTAACAGATGTTGTTGTGAAAGAGGAAA



AACAACCCAAAAAGAGAAGTTCTGGAGCTAAAGTCCGAGGACAGTATGA



TGACATGGGAGAAAATATGACAGATGATCCCCGAAGTTTTAAAAAGATA



GTGGACAGTGGTGTACAAACGGATGACGAAGATGCCACAGATCGGAGCT



ATGTGAGTAGGAGAAGGAGAACTAAAAAGAGTGTGGATACAAGCGTCCA



AACTGATGATGAAGATCAGGATGAGTGGGATATGCCTACTAGATCAAGG



AGGAAAGCTCGTGTAGGGAAATATGGTGACAGCATGACAGAGGCTGACA



AGACCAAACCCCTTTCCAAAGTCTCCAGCATAGCAGTTCAAACGGTAGCA



GAGATATCTGTGCAAACTGAACCAGTTGGAACCATAAGAACACCCTCCAT



ACGGGCACGAGTGGATGCCAAGGTAGAAATAATTAAACACATTTCAGCA



CCTGAAAAGACTTACAAAGGGGGCAGTTTAGGATGTCAAACAGAAGCAG



ATTCAGACACACAAAGTCCTCAATATCTGAGTGCCACATCTCCACCCAAA



GACAAGAAACGCCCAACACCTTTAGAGATTGGTTATTCATCTCACCTCCG



GGCAGATTCCACAGTACAGCTGGCTCCTTCCCCACCCAAATCCCCCAAAG



TCCTTTACTCACCCATCTCACCACTTTCACCAGGCAAAGCCTTAGAATCAG



CCTTTGTACCTTATGAAAAACCCCTCCCTGATGATATAAGTCCACAGAAA



GTACTGCATCCAGATATGGCTAAAGTTCCCCCAGCAAGTCCTAAGACAGC



CAAGATGATGCAGCGTTCTATGTCTGACCCCAAGCCTCTGAGTCCAACAG



CAGACGAAAGTTCCAGGGCTCCTTTTCAGTATACCGAGGGCTATACGACT



AAAGGTTCTCAAACCATGACATCCTCTGGAGCCCAGAAAAAAGTTAAAA



GAACTCTGCCAAATCCACCTCCTGAGGAGATTTCCACAGGAACTCAATCC



ACATTCAGCACAATGGGCACAGTTTCCAGGAGAAGGATCTGCAGAACCA



ACACAATGGCACGAGCCAAGATTCTCCAGGACATAGACAGAGAGCTTGA



TCTTGTGGAAAGGGAGTCTGCAAAACTTCGAAAGAAACAAGCAGAGCTT



GATGAAGAAGAAAAGGAGATTGATGCTAAGCTACGATACCTGGAAATGG



GAATTAACAGGAGGAAAGAGGCCCTATTAAAGGAGAGAGAAAAGAGAG



AACGAGCCTACCTCCAGGGAGTAGCTGAGGATCGTGATTACATGTCTGAC



AGTGAAGTGAGTAGCACAAGACCAACCCGAATAGAAAGTCAGCATGGCA



TTGAGCGACCAAGAACTGCTCCCCAAACTGAATTCAGCCAGTTTATACCA



CCACAAACCCAAACAGAATCTCAACTAGTTCCTCCGACAAGTCCTTACAC



ACAATACCAGTACTCTTCCCCTGCTCTTCCTACCCAAGCACCCACCTCATA



CACTCAACAGTCTCATTTTGAGCAACAAACTTTGTACCATCAGCAAGTTT



CACCTTATCAGACTCAGCCAACATTCCAAGCTGTGGCAACAATGTCCTTC



ACACCTCAAGTTCAACCTACACCAACCCCACAGCCTTCTTATCAGTTACCT



TCACAGATGATGGTGATACAACAGAAGCCACGGCAAACTACATTATATTT



GGAGCCCAAGATAACCTCAAACTATGAAGTGATTCGCAACCAACCCCTTA



TGATAGCACCTGTTTCTACGGATAACACATTTGCTGTTTCCCATCTTGGTA



GTAAGTACAATAGTTTAGACTTGAGAATAGGTTTGGAGGAAAGAAGTAG



CATGGCAAGCAGTCCAATATCAAGCATATCTGCAGATTCTTTCTATGCAG



ATATTGATCACCATACTCCACGAAATTATGTCCTAATTGACGACATTGGA



GAGATCACCAAAGGAACAGCGGCATTAAGCACCGCATTTAGCCTTCATG



AAAAGGATCTGTCAAAAACAGACCGTCTCCTTCGAACCACTGAGACACG



CCGGTCTCAAGAAGTGACAGATTTCCTAGCACCTTTACAGTCTTCCTCTAG



ATTGCATAGTTATGTGAAGGCGGAGGAAGACCCAATGGAGGATCCTTAC



GAGTTAAAGCTTCTGAAACATCAGATTAAACAGGAATTTCGTAGAGGGA



CAGAGAGCTTAGATCACCTTGCTGGTCTTTCTCATTATTACCATGCTGATA



CTAGCTACAGACATTTTCCAAAATCTGAGAAGTATAGCATCAGTAGACTC



ACACTTGAAAAACAAGCAGCAAAACAACTGCCAGCAGCCATACTTTATC



AAAAGCAGTCAAAGCATAAGAAATCACTAATTGACCCTAAAATGTCAAA



ATTTTCACCTATTCAAGAAAGTAGAGACCTTGAACCTGATTATTCAAGCT



ATATGACTTCTAGCACTTCATCTATTGGTGGCATTTCCTCCAGGGCAAGGC



TCCTTCAAGATGACATCACTTTTGGCCTCAGAAAAAATATTACAGACCAA



CAAAAATTTATGGGATCTTCTCTTGGCACAGGACTGGGCACATTAGGAAA



TACCATACGCTCAGCTCTGCAGGATGAAGCGGATAAGCCATACAGTAGTG



GCAGCAGGTCCAGACCTTCCTCCAGACCTTCCTCTGTCTATGGGCTTGATT



TATCAATTAAAAGGGATTCTTCTAGCTCTTCCCTAAGACTGAAAGCTCAA



GAGGCTGAAGCTCTAGATGTTTCCTTTAGTCATGCATCATCCTCTGCCAGA



ACTAAGCCGACCAGTTTGCCAATTAGTCAAAGTAGAGGAAGAATACCAA



TTGTGGCCCAGAATTCTGAAGAAGAAAGCCCACTCAGTCCTGTTGGCCAG



CCAATGGGAATGGCCAGGGCTGCAGCTGGACCCCTGCCACCAATATCTGC



AGACACCAGGGATCAGTTTGGATCAAGCCACTCATTGCCTGAAGTTCAGC



AACACATGAGGGAAGAATCACGGACTCGAGGCTATGACCGTGACATAGC



ATTCATCATGGATGACTTCCAACATGCCATGTCAGACAGTGAAGCCTATC



ATCTGCGTCGTGAGGAAACAGATTGGTTTGATAAACCCAGGGAGTCTCGT



TTGGAAAATGGACATGGTCTGGACCGAAAACTGCCGGAAAGATTGGTCC



ACTCTAGACCACTCAGTCAACATCAAGAGCAAATTATACAGATGAACGG



GAAAACTATGCACTACATCTTTCCTCACGCAAGGATAAAAATAACAAGA



GACTCAAAGGATCACACAGTTTCAGGTAATGGATTAGGAATTAGAATTGT



GGGTGGTAAAGAAATCCCGGGACATAGTGGAGAAATTGGAGCCTATATT



GCCAAGATTCTTCCTGGGGGAAGTGCGGAACAGACGGGGAAGCTTATGG



AAGGGATGCAAGTATTGGAATGGAATGGAATTCCCTTGACTTCTAAAACA



TATGAAGAAGTTCAGAGTATCATTAGTCAGCAAAGTGGGGAAGCAGAAA



TATGTGTAAGACTGGACCTCAATATGCTATCAGATTCTGAAAATTCCCAG



CATCTGGAACTTCATGAGCCACCAAAAGCTGTGGATAAGGCGAAATCCCC



AGGGGTTGATCCTAAGCAGTTGGCAGCAGAACTCCAGAAGGTTTCACTAC



AGCAGTCACCGCTGGTTCTGTCATCAGTTGTTGAAAAAGGATCTCATGTT



CATTCAGGTCCTACATCAGCAGGATCCAGTTCCGTTCCCAGCCCTGGGCA



ACCAGGGTCCCCCTCAGTGAGCAAAAAGAAGCACGGCAGCAGCAAGCCT



ACCGATGGAACAAAGGTTGTCTCTCATCCAATTACAGGAGAAATTCAGCT



TCAAATTAACTATGATCTTGGAAATCTCATAATACATATTCTCCAAGCAA



GAAATCTTGTTCCTCGAGACAACAATGGTTATTCTGACCCTTTTGTGAAA



GTGTACCTTCTTCCAGGGAGAGGTCAAGTCATGGTTGTCCAGAATGCAAG



TGCTGAGTACAAGAGAAGGACTAAACATGTCCAGAAAAGTCTTAATCCT



GAGTGGAATCAAACAGTAATTTATAAAAGTATTTCCATGGAACAGCTCAA



GAAGAAAACACTGGAGGTGACAGTTTGGGATTATGATAGATTTTCATCCA



ACGACTTCCTTGGGGAGGTATTGATTGATTTATCTAGCACATCTCACCTCG



ATAACACTCCAAGGTGGTATCCTCTCAAAGAACAGACTGAAAGCATTGAT



CATGGCAAGTCTCATTCCAGTCAGAGCAGCCAGCAGTCCCCAAAGCCATC



TGTTATCAAAAGCAGAAGCCATGGTATCTTCCCTGACCCATCAAAGGACA



TGCAGGTTCCCACCATTGAGAAATCCCATAGTAGTCCTGGTAGCTCAAAA



TCATCATCAGAAGGCCATCTCCGTTCTCATGGACCATCTCGCAGTCAAAG



CAAAACCAGCGTCACTCAGACCCACCTGGAAGATGCAGGGGCTGCCATA



GCTGCTGCCGAAGCTGCCGTGCAACAACTCCGCATTCAACCAAGTAAAAG



ACGCAAATAA





SNX9->
ATGGCCACCAAGGCATTTATTAATACAGCAAAAGAAATTTATGAAAAAA
SEQ ID


RAB2A
TTCAAGAAGGAGTCTTTGACATTAATAATGAGGCAAATGGCATTAAAATT
NO: 79



GGCCCTCAGCATGCTGCTACCAATGCAACACATGCAGGCAATCAGGGAG



GACAGCAGGCTGGGGGCGGCTGCTGTTGA





DNMBP->
CAGGGGCGGAGGTGACAGCGGGCTGGGGACTGGCGGCTGCAACTGCCTG
SEQ ID


TACC2
CCGCGCCGAGGGACCGCCGGGCGGCGGAAAGCAGGAGTTCCGATTCTGA
NO: 80



AGAGGCATTTGAGACCCCGGAGTCAACGACCCCTGTCAAAGCTCCGCCA



GCTCCACCCCCACCACCCCCCGAAGTCATCCCAGAACCCGAGGTCAGCAC



ACAGCCACCCCCGGAAGAACCAGGATGTGGTTCTGAGACAGTCCCTGTCC



CTGATGGCCCACGGAGCGACTCGGTGGAAGGAAGTCCCTTCCGTCCCCCG



TCACACTCCTTCTCTGCCGTCTTCGATGAAGACAAGCCGATAGCCAGCAG



TGGGACTTACAACTTGGACTTTGACAACATTGAGCTTGTGGATACCTTTC



AGACCTTGGAGCCTCGTGCCTCAGACGCTAAGAATCAGGAGGGCAAAGT



GAACACACGGAGGAAGTCCACGGATTCCGTCCCCATCTCTAAGTCTACAC



TGTCCCGGTCGCTCAGCCTGCAAGCCAGTGACTTTGATGGTGCTTCTTCCT



CAGGCAATCCCGAGGCCGTGGCCCTTGCCCCAGATGCATATAGCACGGGT



TCCAGCAGTGCTTCTAGTACCCTTAAGCGAACTAAAAAACCGAGGCCGCC



TTCCTTAAAAAAGAAACAGACCACCAAGAAACCCACAGAGACCCCCCCA



GTGAAGGAGACGCAACAGGAGCCAGATGAAGAGAGCCTTGTCCCCAGTG



GGGAGAATCTAGCATCTGAGACGAAAACGGAATCTGCCAAGACGGAAGG



TCCTAGCCCAGCCTTATTGGAGGAGACGCCCCTTGAGCCCGCTGTGGGGC



CCAAAGCTGCCTGCCCTCTGGACTCAGAGAGTGCAGAAGGGGTTGTCCCC



CCGGCTTCTGGAGGTGGCAGAGTGCAGAACTCACCCCCTGTCGGGAGGA



AAACGCTGCCTCTTACCACGGCCCCGGAGGCAGGGGAGGTAACCCCATC



GGATAGCGGGGGGCAAGAGGACTCTCCAGCCAAAGGGCTCTCCGTAAGG



CTGGAGTTTGACTATTCTGAGGACAAGAGTAGTTGGGACAACCAGCAGG



AAAACCCCCCTCCTACCAAAAAGATAGGCAAAAAGCCAGTTGCCAAAAT



GCCCCTGAGGAGGCCAAAGATGAAAAAGACACCCGAGAAACTTGACAAC



ACTCCTGCCTCACCTCCCAGATCCCCTGCTGAACCCAATGACATCCCCATT



GCTAAAGGTACTTACACCTTTGATATTGACAAGTGGGATGACCCCAATTT



TAACCCTTTTTCTTCCACCTCAAAAATGCAGGAGTCTCCCAAACTGCCCCA



ACAATCATACAACTTTGACCCAGACACCTGTGATGAGTCCGTTGACCCCT



TTAAGACATCCTCTAAGACCCCCAGCTCACCTTCTAAATCCCCAGCCTCCT



TTGAGATCCCAGCCAGTGCTATGGAAGCCAATGGAGTGGACGGGGATGG



GCTAAACAAGCCCGCCAAGAAGAAGAAGACGCCCCTAAAGACTGACACA



TTTAGGGTGAAAAAGTCGCCAAAACGGTCTCCTCTCTCTGATCCACCTTC



CCAGGACCCCACCCCAGCTGCTACACCAGAAACACCACCAGTGATCTCTG



CGGTGGTCCACGCCACAGATGAGGAAAAGCTGGCGGTCACCAACCAGAA



GTGGACGTGCATGACAGTGGACCTAGAGGCTGACAAACAGGACTACCCG



CAGCCCTCGGACCTGTCCACCTTTGTAAACGAGACCAAATTCAGTTCACC



CACTGAGGAGTTGGATTACAGAAACTCCTATGAAATTGAATATATGGAGA



AAATTGGCTCCTCCTTACCTCAGGACGACGATGCCCCGAAGAAGCAGGCC



TTGTACCTTATGTTTGACACTTCTCAGGAGAGCCCTGTCAAGTCATCTCCC



GTCCGCATGTCAGAGTCCCCGACGCCGTGTTCAGGGTCAAGTTTTGAAGA



GACTGAAGCCCTTGTGAACACTGCTGCGAAAAACCAGCATCCTGTCCCAC



GAGGACTGGCCCCTAACCAAGAGTCACACTTGCAGGTGCCAGAGAAATC



CTCCCAGAAGGAGCTGGAGGCCATGGGCTTGGGCACCCCTTCAGAAGCG



ATTGAAATTACAGCTCCCGAGGGCTCCTTTGCCTCTGCTGACGCCCTCCTC



AGCAGGCTAGCTCACCCCGTCTCTCTCTGTGGTGCACTTGACTATCTGGA



GCCCGACTTAGCAGAAAAGAACCCCCCACTATTCGCTCAGAAACTCCAGG



AGGAGTTAGAGTTTGCCATCATGCGGATAGAAGCCCTGAAGCTGGCCAG



GCAGATTGCTTTGGCTTCCCGCAGCCACCAGGATGCCAAGAGAGAGGCTG



CTCACCCAACAGACGTCTCCATCTCCAAAACAGCCTTGTACTCCCGCATC



GGGACCGCTGAGGTGGAGAAACCTGCAGGCCTTCTGTTCCAGCAGCCCG



ACCTGGACTCTGCCCTCCAGATCGCCAGAGCAGAGATCATAACCAAGGA



GAGAGAGGTCTCAGAATGGAAAGATAAATATGAAGAAAGCAGGCGGGA



AGTGATGGAAATGAGGAAAATAGTGGCCGAGTATGAGAAGACCATCGCT



CAGATGATAGAGGACGAACAGAGAGAGAAGTCAGTCTCCCACCAGACGG



TGCAGCAGCTGGTTCTGGAGAAGGAGCAAGCCCTGGCCGACCTGAACTC



CGTGGAGAAGTCTCTGGCCGACCTCTTCAGAAGATATGAGAAGATGAAG



GAGGTCCTAGAAGGCTTCCGCAAGAATGAAGAGGTGTTGAAGAGATGTG



CGCAGGAGTACCTGTCCCGGGTGAAGAAGGAGGAGCAGAGGTACCAGGC



CCTGAAGGTGCACGCGGAGGAGAAACTGGACAGGGCCAATGCTGAGATT



GCTCAGGTTCGAGGCAAGGCCCAGCAGGAGCAAGCCGCCCACCAGGCCA



GCCTGCGGAAGGAGCAGCTGCGAGTGGACGCCCTGGAAAGGACGCTGGA



GCAGAAGAATAAAGAAATAGAAGAACTCACCAAGATTTGTGACGAACTG



ATTGCCAAAATGGGGAAAAGCTAACTCTGAACCGAATGTTTTGGACTTAA



CTGTTGCGTGCAATATGACCGTCGGCACACTGCTGTTCCTCCAGTTCCATG



GACAGGTTCTGTTTTCACTTTTTCGTATGCACTACTGTATTTCCTTTCTAAA



TAAAATTGATTTGATTGTATGCAGTACTAAGGAGACTATCAGAATTTCTT



GCTATTGGTTTGCATTTTCCTAGTATAATTCATAGCAAGTTGACCTCAGAG



TTCCTGTATCAGGGAGATTGTCTGATTCTCTAATAAAAGACACATTGCTG



ACCTTGGCCTTGCCCTTTGTACACAAGTTCCCAGGGTGAGCAGCTTTTGG



ATTTAATATGAACATGTACAGCGTGCATAGGGACTCTTGCCTTAAGGAGT



GTAAACTTGATCTGCATTTGCTGATTTGTTTTTAAAAAAACAAGAAATGC



ATGTTTCAAATAAAATTCTCTATTGTAAATAAAATTTTTTCTTTGGATCTT



GGCAAT





RREB1->
CGGGATGGCAACTGCGGTCACCCTGCTAAAGTCGGGGCGGGGGCGGCGG
SEQ ID


DSP
TCCTCCCCCTCACCCCCCCCAGTCCGAGCGCCGCCGCCGCCGCCGCCGCC
NO: 81



GCCGCCGCGGCCGCTCAGTAACACGTCCCCAGGAGACTCGCAGGAGCAA



CACGTGATGTGTCTACTTATCAGGTCAAACCGGCACGATGTCCAGGCACC



AGAACCAGAACACCATCCAGGAGCTGCTGCAGAACTGCTCCGACTGCTTG



ATGCGAGCAGAGCTCATCGTGCAGCCTGAATTGAAGTATGGAGATGGAA



TACAACTGACTCGGAGTCGAGAATTGGATGAGTGTTTTGCCCAGGCCAAT



GACCAAATGGAAATCCTCGACAGCTTGATCAGAGAGATGCGGCAGATGG



GCCAGCCCTGTGATGCTTACCAGAAAAGGCTTCTTCAGCTCCAAGAGCAA



ATGCGAGCCCTTTATAAAGCCATCAGTGTCCCTCGAGTCCGCAGGGCCAG



CTCCAAGGGTGGTGGAGGCTACACTTGTCAGAGTGGCTCTGGCTGGGATG



AGTTCACCAAACATGTCACCAGTGAATGTTTGGGGTGGATGAGGCAGCA



AAGGGCGGAGATGGACATGGTGGCCTGGGGTGTGGACCTGGCCTCAGTG



GAGCAGCACATTAACAGCCACCGGGGCATCCACAACTCCATCGGCGACT



ATCGCTGGCAGCTGGACAAAATCAAAGCCGACCTGCGCGAGAAATCTGC



TATCTACCAGTTGGAGGAGGAGTATGAAAACCTGCTGAAAGCGTCCTTTG



AGAGGATGGATCACCTGCGACAGCTGCAGAACATCATTCAGGCCACGTC



CAGGGAGATCATGTGGATCAATGACTGCGAGGAGGAGGAGCTGCTGTAC



GACTGGAGCGACAAGAACACCAACATCGCTCAGAAACAGGAGGCCTTCT



CCATACGCATGAGTCAACTGGAAGTTAAAGAAAAAGAGCTCAATAAGCT



GAAACAAGAAAGTGACCAACTTGTCCTCAATCAGCATCCAGCTTCAGACA



AAATTGAGGCCTATATGGACACTCTGCAGACGCAGTGGAGTTGGATTCTT



CAGATCACCAAGTGCATTGATGTTCATCTGAAAGAAAATGCTGCCTACTT



TCAGTTTTTTGAAGAGGCGCAGTCTACTGAAGCATACCTGAAGGGGCTCC



AGGACTCCATCAGGAAGAAGTACCCCTGCGACAAGAACATGCCCCTGCA



GCACCTGCTGGAACAGATCAAGGAGCTGGAGAAAGAACGAGAGAAAATC



CTTGAATACAAGCGTCAGGTGCAGAACTTGGTAAACAAGTCTAAGAAGA



TTGTACAGCTGAAGCCTCGTAACCCAGACTACAGAAGCAATAAACCCATT



ATTCTCAGAGCTCTCTGTGACTACAAACAAGATCAGAAAATCGTGCATAA



GGGGGATGAGTGTATCCTGAAGGACAACAACGAGCGCAGCAAGTGGTAC



GTGACGGGCCCGGGAGGCGTTGACATGCTTGTTCCCTCTGTGGGGCTGAT



CATCCCTCCTCCGAACCCACTGGCCGTGGACCTCTCTTGCAAGATTGAGC



AGTACTACGAAGCCATCTTGGCTCTGTGGAACCAGCTCTACATCAACATG



AAGAGCCTGGTGTCCTGGCACTACTGCATGATTGACATAGAGAAGATCAG



GGCCATGACAATCGCCAAGCTGAAAACAATGCGGCAGGAAGATTACATG



AAGACGATAGCCGACCTTGAGTTACATTACCAAGAGTTCATCAGAAATAG



CCAAGGCTCAGAGATGTTTGGAGATGATGACAAGCGGAAAATACAGTCT



CAGTTCACCGATGCCCAGAAGCATTACCAGACCCTGGTCATTCAGCTCCC



TGGCTATCCCCAGCACCAGACAGTGACCACAACTGAAATCACTCATCATG



GAACCTGCCAAGATGTCAACCATAATAAAGTAATTGAAACCAACAGAGA



AAATGACAAGCAAGAAACATGGATGCTGATGGAGCTGCAGAAGATTCGC



AGGCAGATAGAGCACTGCGAGGGCAGGATGACTCTCAAAAACCTCCCTC



TAGCAGACCAGGGATCTTCTCACCACATCACAGTGAAAATTAACGAGCTT



AAGAGTGTGCAGAATGATTCACAAGCAATTGCTGAGGTTCTCAACCAGCT



TAAAGATATGCTTGCCAACTTCAGAGGTTCTGAAAAGTACTGCTATTTAC



AGAATGAAGTATTTGGACTATTTCAGAAACTGGAAAATATCAATGGTGTT



ACAGATGGCTACTTAAATAGCTTATGCACAGTAAGGGCACTGCTCCAGGC



TATTCTCCAAACAGAAGACATGTTAAAGGTTTATGAAGCCAGGCTCACTG



AGGAGGAAACTGTCTGCCTGGACCTGGATAAAGTGGAAGCTTACCGCTGT



GGACTGAAGAAAATAAAAAATGACTTGAACTTGAAGAAGTCGTTGTTGG



CCACTATGAAGACAGAACTACAGAAAGCCCAGCAGATCCACTCTCAGAC



TTCACAGCAGTATCCACTTTATGATCTGGACTTGGGCAAGTTCGGTGAAA



AAGTCACACAGCTGACAGACCGCTGGCAAAGGATAGATAAACAGATCGA



CTTTAGGTTATGGGACCTGGAGAAACAAATCAAGCAATTGAGGAATTATC



GTGATAACTATCAGGCTTTCTGCAAGTGGCTCTATGATGCTAAACGCCGC



CAGGATTCCTTAGAATCCATGAAATTTGGAGATTCCAACACAGTCATGCG



GTTTTTGAATGAGCAGAAGAACTTGCACAGTGAAATATCTGGCAAACGA



GACAAATCAGAGGAAGTACAAAAAATTGCTGAACTTTGCGCCAATTCAA



TTAAGGATTATGAGCTCCAGCTGGCCTCATACACCTCAGGACTGGAAACT



CTGCTGAACATACCTATCAAGAGGACCATGATTCAGTCCCCTTCTGGGGT



GATTCTGCAAGAGGCTGCAGATGTTCATGCTCGGTACATTGAACTACTTA



CAAGATCTGGAGACTATTACAGGTTCTTAAGTGAGATGCTGAAGAGTTTG



GAAGATCTGAAGCTGAAAAATACCAAGATCGAAGTTTTGGAAGAGGAGC



TCAGACTGGCCCGAGATGCCAACTCGGAAAACTGTAATAAGAACAAATT



CCTGGATCAGAACCTGCAGAAATACCAGGCAGAGTGTTCCCAGTTCAAA



GCGAAGCTTGCGAGCCTGGAGGAGCTGAAGAGACAGGCTGAGCTGGATG



GGAAGTCGGCTAAGCAAAATCTAGACAAGTGCTACGGCCAAATAAAAGA



ACTCAATGAGAAGATCACCCGACTGACTTATGAGATTGAAGATGAAAAG



AGAAGAAGAAAATCTGTGGAAGACAGATTTGACCAACAGAAGAATGACT



ATGACCAACTGCAGAAAGCAAGGCAATGTGAAAAGGAGAACCTTGGTTG



GCAGAAATTAGAGTCTGAGAAAGCCATCAAGGAGAAGGAGTACGAGATT



GAAAGGTTGAGGGTTCTACTGCAGGAAGAAGGCACCCGGAAGAGAGAAT



ATGAAAATGAGCTGGCAAAGGTAAGAAACCACTATAATGAGGAGATGAG



TAATTTAAGGAACAAGTATGAAACAGAGATTAACATTACGAAGACCACC



ATCAAGGAGATATCCATGCAAAAAGAGGATGATTCCAAAAATCTTAGAA



ACCAGCTTGATAGACTTTCAAGGGAAAATCGAGATCTGAAGGATGAAAT



TGTCAGGCTCAATGACAGCATCTTGCAGGCCACTGAGCAGCGAAGGCGA



GCTGAAGAAAACGCCCTTCAGCAAAAGGCCTGTGGCTCTGAGATAATGC



AGAAGAAGCAGCATCTGGAGATAGAACTGAAGCAGGTCATGCAGCAGCG



CTCTGAGGACAATGCCCGGCACAAGCAGTCCCTGGAGGAGGCTGCCAAG



ACCATTCAGGACAAAAATAAGGAGATCGAGAGACTCAAAGCTGAGTTTC



AGGAGGAGGCCAAGCGCCGCTGGGAATATGAAAATGAACTGAGTAAGGT



AAGAAACAATTATGATGAGGAGATCATTAGCTTAAAAAATCAGTTTGAG



ACCGAGATCAACATCACCAAGACCACCATCCACCAGCTCACCATGCAGA



AGGAAGAGGATACCAGTGGCTACCGGGCTCAGATAGACAATCTCACCCG



AGAAAACAGGAGCTTATCTGAAGAAATAAAGAGGCTGAAGAACACTCTA



ACCCAGACCACAGAGAATCTCAGGAGGGTGGAAGAAGACATCCAACAGC



AAAAGGCCACTGGCTCTGAGGTGTCTCAGAGGAAACAGCAGCTGGAGGT



TGAGCTGAGACAAGTCACTCAGATGCGAACAGAGGAGAGCGTAAGATAT



AAGCAATCTCTTGATGATGCTGCCAAAACCATCCAGGATAAAAACAAGG



AGATAGAAAGGTTAAAACAACTGATCGACAAAGAAACAAATGACCGGAA



ATGCCTGGAAGATGAAAACGCGAGATTACAAAGGGTCCAGTATGACCTG



CAGAAAGCAAACAGTAGTGCGACGGAGACAATAAACAAACTGAAGGTTC



AGGAGCAAGAACTGACACGCCTGAGGATCGACTATGAAAGGGTTTCCCA



GGAGAGGACTGTGAAGGACCAGGATATCACGCGGTTCCAGAACTCTCTG



AAAGAGCTGCAGCTGCAGAAGCAGAAGGTGGAAGAGGAGCTGAATCGG



CTGAAGAGGACCGCGTCAGAAGACTCCTGCAAGAGGAAGAAGCTGGAGG



AAGAGCTGGAAGGCATGAGGAGGTCGCTGAAGGAGCAAGCCATCAAAAT



CACCAACCTGACCCAGCAGCTGGAGCAGGCATCCATTGTTAAGAAGAGG



AGTGAGGATGACCTCCGGCAGCAGAGGGACGTGCTGGATGGCCACCTGA



GGGAAAAGCAGAGGACCCAGGAAGAGCTGAGGAGGCTCTCTTCTGAGGT



CGAGGCCCTGAGGCGGCAGTTACTCCAGGAACAGGAAAGTGTCAAACAA



GCTCACTTGAGGAATGAGCATTTCCAGAAGGCGATAGAAGATAAAAGCA



GAAGCTTAAATGAAAGCAAAATAGAAATTGAGAGGCTGCAGTCTCTCAC



AGAGAACCTGACCAAGGAGCACTTGATGTTAGAAGAAGAACTGCGGAAC



CTGAGGCTGGAGTACGATGACCTGAGGAGAGGACGAAGCGAAGCGGACA



GTGATAAAAATGCAACCATCTTGGAACTAAGGAGCCAGCTGCAGATCAG



CAACAACCGGACCCTGGAACTGCAGGGGCTGATTAATGATTTACAGAGA



GAGAGGGAAAATTTGAGACAGGAAATTGAGAAATTCCAAAAGCAGGCTT



TAGAGGCATCTAATAGGATTCAGGAATCAAAGAATCAGTGTACTCAGGT



GGTACAGGAAAGAGAGAGCCTTCTGGTGAAAATCAAAGTCCTGGAGCAA



GACAAGGCAAGGCTGCAGAGGCTGGAGGATGAGCTGAATCGTGCAAAAT



CAACTCTAGAGGCAGAAACCAGGGTGAAACAGCGCCTGGAGTGTGAGAA



ACAGCAAATTCAGAATGACCTGAATCAGTGGAAGACTCAATATTCCCGCA



AGGAGGAGGCTATTAGGAAGATAGAATCGGAAAGAGAAAAGAGTGAGA



GAGAGAAGAACAGTCTTAGGAGTGAGATCGAAAGACTCCAAGCAGAGAT



CAAGAGAATTGAAGAGAGGTGCAGGCGTAAGCTGGAGGATTCTACCAGG



GAGACACAGTCACAGTTAGAAACAGAACGCTCCCGATATCAGAGGGAGA



TTGATAAACTCAGACAGCGCCCATATGGGTCCCATCGAGAGACCCAGACT



GAGTGTGAGTGGACCGTTGACACCTCCAAGCTGGTGTTTGATGGGCTGAG



GAAGAAGGTGACAGCAATGCAGCTCTATGAGTGTCAGCTGATCGACAAA



ACAACCTTGGACAAACTATTGAAGGGGAAGAAGTCAGTGGAAGAAGTTG



CTTCTGAAATCCAGCCATTCCTTCGGGGTGCAGGATCTATCGCTGGAGCA



TCTGCTTCTCCTAAGGAAAAATACTCTTTGGTAGAGGCCAAGAGAAAGAA



ATTAATCAGCCCAGAATCCACAGTCATGCTTCTGGAGGCCCAGGCAGCTA



CAGGTGGTATAATTGATCCCCATCGGAATGAGAAGCTGACTGTCGACAGT



GCCATAGCTCGGGACCTCATTGACTTCGATGACCGTCAGCAGATATATGC



AGCAGAAAAAGCTATCACTGGTTTTGATGATCCATTTTCAGGCAAGACAG



TATCTGTTTCAGAAGCCATCAAGAAAAATTTGATTGATAGAGAAACCGGA



ATGCGCCTGCTGGAAGCCCAGATTGCTTCAGGGGGTGTAGTAGACCCTGT



GAACAGTGTCTTTTTGCCAAAAGATGTCGCCTTGGCCCGGGGGCTGATTG



ATAGAGATTTGTATCGATCCCTGAATGATCCCCGAGATAGTCAGAAAAAC



TTTGTGGATCCAGTCACCAAAAAGAAGGTCAGTTACGTGCAGCTGAAGG



AACGGTGCAGAATCGAACCACATACTGGTCTGCTCTTGCTTTCAGTACAG



AAGAGAAGCATGTCCTTCCAAGGAATCAGACAACCTGTGACCGTCACTG



AGCTAGTAGATTCTGGTATATTGAGACCGTCCACTGTCAATGAACTGGAA



TCTGGTCAGATTTCTTATGACGAGGTTGGTGAGAGAATTAAGGACTTCCT



CCAGGGTTCAAGCTGCATAGCAGGCATATACAATGAGACCACAAAACAG



AAGCTTGGCATTTATGAGGCCATGAAAATTGGCTTAGTCCGACCTGGTAC



TGCTCTGGAGTTGCTGGAAGCCCAAGCAGCTACTGGCTTTATAGTGGATC



CTGTTAGCAACTTGAGGTTACCAGTGGAGGAAGCCTACAAGAGAGGTCT



GGTGGGCATTGAGTTCAAAGAGAAGCTCCTGTCTGCAGAACGAGCTGTCA



CTGGGTATAATGATCCTGAAACAGGAAACATCATCTCTTTGTTCCAAGCC



ATGAATAAGGAACTCATCGAAAAGGGCCACGGTATTCGCTTATTAGAAG



CACAGATCGCAACCGGGGGGATCATTGACCCAAAGGAGAGCCATCGTTT



ACCAGTTGACATAGCATATAAGAGGGGCTATTTCAATGAGGAACTCAGTG



AGATTCTCTCAGATCCAAGTGATGATACCAAAGGATTTTTTGACCCCAAC



ACTGAAGAAAATCTTACCTATCTGCAACTAAAAGAAAGATGCATTAAGG



ATGAGGAAACAGGGCTCTGTCTTCTGCCTCTGAAAGAAAAGAAGAAACA



GGTGCAGACATCACAAAAGAATACCCTCAGGAAGCGTAGAGTGGTCATA



GTTGACCCAGAAACCAATAAAGAAATGTCTGTTCAGGAGGCCTACAAGA



AGGGCCTAATTGATTATGAAACCTTCAAAGAACTGTGTGAGCAGGAATGT



GAATGGGAAGAAATAACCATCACGGGATCAGATGGCTCCACCAGGGTGG



TCCTGGTAGATAGAAAGACAGGCAGTCAGTATGATATTCAAGATGCTATT



GACAAGGGCCTTGTTGACAGGAAGTTCTTTGATCAGTACCGATCCGGCAG



CCTCAGCCTCACTCAATTTGCTGACATGATCTCCTTGAAAAATGGTGTCG



GCACCAGCAGCAGCATGGGCAGTGGTGTCAGCGATGATGTTTTTAGCAGC



TCCCGACATGAATCAGTAAGTAAGATTTCCACCATATCCAGCGTCAGGAA



TTTAACCATAAGGAGCAGCTCTTTTTCAGACACCCTGGAAGAATCGAGCC



CCATTGCAGCCATCTTTGACACAGAAAACCTGGAGAAAATCTCCATTACA



GAAGGTATAGAGCGGGGCATCGTTGACAGCATCACGGGTCAGAGGCTTC



TGGAGGCTCAGGCCTGCACAGGTGGCATCATCCACCCAACCACGGGCCA



GAAGCTGTCACTTCAGGACGCAGTCTCCCAGGGTGTGATTGACCAAGACA



TGGCCACCAGGCTGAAGCCTGCTCAGAAAGCCTTCATAGGCTTCGAGGGT



GTGAAGGGAAAGAAGAAGATGTCAGCAGCAGAGGCAGTGAAAGAAAAA



TGGCTCCCGTATGAGGCTGGCCAGCGCTTCCTGGAGTTCCAGTACCTCAC



GGGAGGTCTTGTTGACCCGGAAGTGCATGGGAGGATAAGCACCGAAGAA



GCCATCCGGAAGGGGTTCATAGATGGCCGCGCCGCACAGAGGCTGCAAG



ACACCAGCAGCTATGCCAAAATCCTGACCTGCCCCAAAACCAAATTAAA



AATATCCTATAAGGATGCCATAAATCGCTCCATGGTAGAAGATATCACTG



GGCTGCGCCTTCTGGAAGCCGCCTCCGTGTCGTCCAAGGGCTTACCCAGC



CCTTACAACATGTCTTCGGCTCCGGGGTCCCGCTCCGGCTCCCGCTCGGG



ATCTCGCTCCGGATCTCGCTCCGGGTCCCGCAGTGGGTCCCGGAGAGGAA



GCTTTGACGCCACAGGGAATTCTTCCTACTCTTATTCCTACTCATTTAGCA



GTAGTTCTATTGGGCACTAGTAGTCAGTTGGGAGTGGTTGCTATACCTTG



ACTTCATTTATATGAATTTCCACTTTATTAAATAATAGAAAAGAAAATCC



CGGTGCTTGCAGTAGAGTGATAGGACATTCTATGCTTACAGAAAATATAG



CCATGATTGAAATCAAATAGTAAAGGCTGTTCTGGCTTTTTATCTTCTTAG



CTCATCTTAAATAAGCAGTACACTTGGATGCAGTGCGTCTGAAGTGCTAA



TCAGTTGTAACAATAGCACAAATCGAACTTAGGATTTGTTTCTTCTCTTCT



GTGTTTCGATTTTTGATCAATTCTTTAATTTTGGAAGCCTATAATACAGTT



TTCTATTCTTGGAGATAAAAATTAAATGGATCACTGATATTTTAGTCATTC



TGCTTCTCATCTAAATATTTCCATATTCTGTATTAGGAGAAAATTACCCTC



CCAGCACCAGCCCCCCTCTCAAACCCCCAACCCAAAACCAAGCATTTTGG



AATGAGTCTCCTTTAGTTTCAGAGTGTGGATTGTATAACCCATATACTCTT



CGATGTACTTGTTTGGTTTGGTATTAATTTGACTGTGCATGACAGCGGCAA



TCTTTTCTTTGGTCAAAGTTTTCTGTTTATTTTGCTTGTCATATTCGATGTA



CTTTAAGGTGTCTTTATGAAGTTTGCTATTCTGGCAATAAACTTTTAGACT



TTTGAAGTGTTTGTGTTTTAATTTAATATGTTTATAAGCATGTATAAACAT



TTAGCATATTTTTATCATAGGTCTAAAAATATTTGTTTACTAAATACCTGT



GAAGAAATACCATTAAAAAACTATTTGGTTCTGAATTCTTACTA





CHIA->
AAAGCTTCATGAAACCTCCTCGTCTGTGCACGAACAGGTGGCCGACTCTG
SEQ ID


ZNF138
GAGCCCAGGCTGTTGCTTTCCAGTCTGGTGGTGAATCCTCCATAGTCTGCT
NO: 82



CTGTGTTCTCGTTTTGCCCAAGACCTTTGGCTAGAGCAGAACATAAAAGA



TTCTTTCCAAAAAGTGACACTGAGCAGATATGGAAAATATGGACATAAG



AATTTACAGTTAAGAAAAGGCTGTAAAAGTGTGGATGAGTGTAAGGGAC



ACCAAGGAGGTTTTAATGGACTTAACCAATGTTTGAAAATTACCACAAGC



AAAATATTTCAATGTAATAAATATGTAAAAGTCATGCATAAATTTTCAAA



TTCAAATAGACACAAGATAAGACATACTGAAAATAAACATTTCAGATGT



AAAGAATGTGACAAATCACTTTGCATGCTTTCACGCCTAACTCAACATAA



AAAAATTCATACTAGAGAGAATTTCTACAAATGTGAAGAGTGTGGAAAA



ACCTTTAACTGGTCCACAAACCTTTCTAAACCTAAGAAAATTCATACTGG



AGAAAAACCCTACAAATGTGAAGTATGTGGAAAAGCCTTTCACCAATCCT



CAATCCTTACTAAACATAAGATAATTCGTACTGGAGAAAAACCCTATAAA



TGTGCACACTGTGGCAAAGCCTTTAAACAGTCCTCACACCTTACTAGACA



TAAGATAATTCATACTGAAGAGAAACCCTACAAATGTGAACAATGTGGC



AAGGTCTTTAAGCAGTCCCCAACCCTTACTAAACATCAGATAATTTATAC



TGGAGAGGAACCATACAAATGTGAGGAATGTGGCAAAGCTTTTAACCTA



TCTTAACAACTTACTGAACATAAGAAAATTTACACTAGAGAGAAAGCCTA



CAAATGTGAAGAATGTGGCAAAGCCTTTAACCAGTTTTCAACCCTTATTA



CACATAAGATAATTCATAGCGGAGAGAAACCCCACAAATGTGAAGAATG



TGGCAGAGCTTTTAACCAGTCCGCAAAGCTCACTGAACATAAGTTAATTC



ATACTGGAGAAAAACCCTACAAATGTAAAGAATGTGGAAAAGCTTTTCA



CCGATACTCAATCCTTAGTACACATAAGAAAATTCATACTGGGGAGAAAC



CCCACAAATGTGGAGAATGCGGAAAAGCCTTTAACTGGTCCTCAACTCTT



ATTACACATAAGATAATTCACAGTGGAGAAAAACCCTACAAATGTGAAG



AATGTGGCAAAGCTTTTAACCAGTCCTCACACCTTATGAGACATAAGAAA



ATTCATAGTAAAGAGAAACCCTACAAATGTGAACAGTGTGGCAAGGTCTT



TAAGAAGTCCTCAACTCTTACTGCACATAAGATCATTCATACTGGAGAGA



AACCTTACAAATGTGAGGAATGTGGCAAAGGTTTTAGCCAACTCTCAAAC



CTTACTAAACACAAGAAGATTCATACTAGAGAGAAACCCTACAAATGTG



AAGAATGTGGCATATCTTTTAACCAGTTCTCACAACTTGCTATACATAAG



ATGATTCACACTTGAATGAAACCCTACAAATGTGAACGATGTGGCAGTTG



TTTTAACTAGTTCTCGAACTTTACTATGCATAAGAAAATTCAAACTGGAG



AGAAACTCTACAAATGTGAAGAATGTGGCAAAGCTTTTAACCAAGTCTCA



ACACTTACTATACATAAGATAATTTATACTGGAGCAAAACCTTGGAAATT



CAAAGAATGTGGTAAAACTTACAATCCTCAAAACTTCTTACACCTAAAAT



TCATGCAGGAGAGAAACACCACAAATGTGAAAAATTTGGTAAATTCTTTA



ACAAGTCTTCAACCCTTTCTGCACATAATATAATTCATACTGGAGAGAAA



CCCCACAAATATGAAGAATGTGGTAATGCTTTTAACCAATTCTCAAATCT



TACTAAACAAAATTAATACTGAAAATGTTACAAACCAGAAAAATGTGAA



AATGATTTTAACAAAACCTTCAAATTTTTCTAAACATAAAGGAAATCATA



CTGGTAAGAAATTATAAAAATGTGAAGAATGTGACAAAGCCTTTAAATG



GTTGTCACACTTGATTGTAGGTAAGATAATTCATACTGGCAGAAACTCCC



AGAAGTGTGAAGAATATGGCAAAACTTTAATTCCTATACCTTATTGCACA



GGAAAGCATTTATACTTCAGAAAATGTTGTACTGATATAAAGAATGTAGA



AAAGCCATTAATATGTGCTTACATCTTATTCAACATTAGAGAGTTAGTAC



TTAATAAA





ZNF569->
AGCAGCGGAACGATTCGATTCTTCTCAGCACCAAGTTGCGCTCCCAATCT
SEQ ID


SHFM1
CTCAGAGCTGGGCTCGCGGGAGGCCGCTCGTGCAAAACCTAGGCTGAGC
NO: 83



TCCCCTGCGCGGAGCTGTGAGCCCTGGAACACCGTGGTCTGCTTCTCAGG



ACGCGCAAACAGTGAAGCCAGTCCCGCCCGGGTGAGCCGCGGGGGCCTC



TGGGAAGCGTCGCCCCTGGTGTAACGGACCGAGACTTGTGGCGCTCTCAG



CCACCGACAGCGCCGGCCTCAGTGCCGCCTCTGTCCCAGCCCGCGCCGGC



TCTGCCACTTTGGCAGCGTTAAGTGTGGAATCGGGGCCTGTGTCCGCGGG



CTTGGTGAGTTCTTCATATATTAAGGATTCATTCATTCATAGACTCATTTA



TTGAAGGCTGTCTGTGTAACAGGCACAATCCTAGGTGCTTGGGATATAGC



AGTGAACAAGAGACAAACCCCCTACTATCATGGTACTTACATTTTTGTGG



GCTGGATAATAAACAAGACTGGGCTGGCTTAGATGAAGATGAAGATGCA



CATGTCTGGGAGGATAATTGGGATGATGACAATGTAGAGGATGACTTCTC



TAATCAGTTACGAGCTGAACTAGAGAAACATGGTTATAAGATGGAGACTT



CATAGCATCCAGAAGAAGTGTTGAAGTAACCTAAACTTGACCTGCTTAAT



ACATTCTAGGGCAGAGAACCCAGGATGGGACACTAAAAAAATGTGTTTA



TTTCATTATCTGCTTGGATTTATTTGTGTTTTTGTAACACAAAAAATAAAT



GTTTTGATATAA





TRIM37->
ATGGATGAACAGAGCGTGGAGAGCATTGCTGAGGTTTTCCGATGTTTCAT
SEQ ID


BCAS3
TTGTATGGAGAAATTGCGGGATGCACGCCTGTGTCCTCATTGCTCCAAAC
NO: 84



TGTGTTGTTTCAGCTGTATTAGGCGCTGGCTGACAGAGCAGAGAGCTCAA



TGTCCTCATTGCCGTGCTCCACTCCAGCTACGAGAACTAGTAAATTGTCGT



TGGGCAGAAGAAGTAACACAACAGCTTGATACTCTTCAACTCTGCAGTCT



CACCAAACATGAAGAAAATGAAAAGGACAAATGTGAAAATCACCATGAA



AAACTTAGTGTATTTTGCTGGACTTGTAAGAAGTGTATCTGCCATCAGTGT



GCACTTTGGGGAGGAATGCATGGCGGACATACCTTTAAACCTTTGGCAGA



AATTTATGAGCAACACGTCACTAAAGTGAATGAAGAGGTAGCCAAACTT



CGTCGGCGTCTCATGGAACTGATCAGCTTAGTTCAAGAAGTGGAAAGGA



ATGTAGAAGCTGTAAGAAATGCAAAAGATGAGCGTGTTCGGGAAATTAG



GAATGCAGTGGAGATGATGATTGCACGGTTAGACACACAGCTGAAGAAT



AAGCTTATAACACTGATGGGTCAGAAGACATCTCTAACCCAAGAAACAG



AGCTTTTGGAATCCTTACTTCAGGAGGTGGAGCACCAGTTGCGGTCTTGT



AGTAAGAGTGAGTTGATATCTAAGAGCTCAGAGATCCTTATGATGTTTCA



GCAAGTTCATCGGAAGCCCATGGCATCTTTTGTTACCACTCCTGTTCCACC



AGACTTTACCAGTGAATTAGTGCCATCTTACGATTCAGCTACTTTTGTTTT



AGAGAATTTCAGCACTTTGCGTCAGAGAGCAGATCCTGTTTACAGTCCAC



CTCTTCAAGTTTCAGGACTTTGCTGGAGGTTAAAAGTTTACCCAGATGGA



AATGGAGTTGTGCGAGGTTACTACTTATCTGTGTTTCTGGAGCTCTCAGCT



GGCTTGCCTGAAACTTCTAAATATGAATATCGTGTAGAGATGGTTCACCA



GTCCTGTAATGATCCTACAAAAAATATCATTCGAGAATTTGCATCTGACT



TTGAAGTTGGAGAATGCTGGGGCTATAATAGATTTTTCCGTTTGGACTTA



CTCGCAAATGAAGGATACTTGAATCCACAAAATGATACAGTGATTTTAAG



GTTTCAGGTACGTTCACCAACTTTCTTTCAAAAATCCCGGGACCAGCATT



GGTACATTACTCAGTTGGAAGCTGCACAGACTAGTTATATCCAACAAATA



AACAACCTTAAAGAGAGACTTACTATTGAGCTGTCTCGAACTCAGAAGTC



AAGAGATTTGTCACCACCAGATAACCATCTTAGCCCCCAAAATGATGATG



CTCTGGAGACACGAGCTAAGAAGTCTGCATGCTCTGACATGCTTCTCGAA



GGTGGTCCTACTACAGCTTCTGTAAGAGAGGCCAAAGAGGATGAAGAAG



ATGAGGAGAAGATTCAGAATGAAGATTATCATCACGAGCTTTCAGATGG



AGATCTGGATCTGGATCTTGTTTATGAGGATGAAGTAAATCAGCTCGATG



GCAGCAGTTCCTCTGCTAGTTCCACAGCAACAAGTAATACAGAAGAAAAT



GATATTGATGAAGAAACTATGTCTGGAGAAAATGATGTGGAATATAACA



ACATGGAATTAGAAGAGGGAGAACTCATGGAAGATGCAGCTGCTGCAGG



ACCCGCAGGTAGTAGCCATGGTTATGTGGGTTCCAGTAGTAGAATATCAA



GAAGAACACATTTATGCTCCGCTGCTACCAGTAGTTTACTAGACATTGAT



CCATTAATTTTAATACATTTGTTGGACCTTAAGGACCGGAGCAGTATAGA



AAATTTGTGGGGCTTACAGCCTCGCCCACCTGCTTCACTTCTGCAGCCCAC



AGCATCATATTCTCGAAAAGATAAAGACCAAAGGAAGCAACAGGCAATG



TGGCGAGTGCCCTCTGATTTAAAGATGCTAAAAAGACTCAAAACTCAAAT



GGCCGAAGTTCGATGTATGAAAACTGATGTAAAGAATACACTTTCAGAA



ATAAAAAGCAGCAGTGCTGCTTCTGGAGACATGCAGACAAGCCTTTTTTC



TGCTGACCAGGCAGCTCTGGCTGCATGTGGAACTGAAAACTCTGGCAGAT



TGCAGGATTTGGGAATGGAACTCCTGGCAAAGTCATCAGTTGCCAATTGT



TACATACGAAACTCCACAAATAAGAAGAGTAATTCGCCCAAGCCAGCTC



GATCCAGTGTAGCAGGTAGTCTATCACTTCGAAGAGCAGTGGACCCTGGA



GAAAATAGTCGTTCAAAGGGAGACTGTCAGACTCTGTCTGAAGGCTCCCC



AGGAAGCTCTCAGTCTGGGAGCAGGCACAGTTCTCCCCGAGCCTTGATAC



ATGGCAGTATCGGTGATATTCTGCCAAAAACTGAAGACCGGCAGTGTAA



AGCTTTGGATTCAGATGCTGTTGTGGTTGCAGTTTTCAGTGGCTTGCCTGC



GGTTGAGAAAAGGAGGAAAATGGTCACCTTGGGATACATCAAGAAATCT



GGAATTTCATGAAATACATAGTACTGGGAATGAACCGCCTTTGTTGATTA



TGATTGGCTACAGTGATGGAATGCAGGTCTGGAGCATCCCTATCAGTGGT



GAAGCACAAGAGCTCTTCTCTGTTCGACATGGCCCAATTCGAGCGGCTAG



AATCTTGCCTGCTCCACAGTTTGGTGCTCAAAAATGTGATAACTTTGCTGA



AAAAAGACCCCTCCTTGGTGTTTGTAAGAGCATTGGATCTTCTGGCACAA



GCCCACCGTACTGTTGTGTGGATCTGTATTCACTTCGTACTGGGGAGATG



GTCAAGTCCATTCAATTTAAGACACCTATTTATGATCTCCATTGCAATAAA



CGGATCCTTGTCGTAGTCTTGCAGGAGAAAATTGCTGCCTTTGATAGCTG



TACTTTCACGAAGAAATTCTTTGTTACAAGCTGCTATCCATGTCCAGGGCC



AAACATGAATCCTATTGCTCTTGGGAGCCGCTGGCTTGCTTATGCAGAAA



ACAAGTTGATTCGATGTCATCAGTCCCGTGGTGGAGCCTGTGGAGACAAC



ATTCAGTCTTATACTGCCACAGTCATTAGTGCTGCTAAAACATTGAAAAG



TGGCCTGACAATGGTAGGGAAAGTGGTGACTCAGCTGACAGGCACACTG



CCTTCAGGTGTGACAGAAGATGATGTTGCCATCCACAGTAATTCACGGCG



GAGTCCTTTGGTCCCAGGCATCATCACAGTTATTGACACCGAAACCGTTG



GAGAGGGCCAGGTGCTTGTGAGTGAGGATTCTGACAGTGATGGCATTGTG



GCCCACTTCCCTGCCCATGAGAAGCCAGTGTGCTGCATGGCTTTTAATAC



AAGTGGAATGCTTCTAGTCACAACAGACACCCTTGGCCATGACTTTCATG



TCTTCCAAATTCTGACTCATCCTTGGTCCTCATCACAATGTGCTGTCCACC



ATCTGTATACTCTTCACAGGGGAGAAACTGAAGCCAAAGTACAGGACAT



CTGCTTCAGCCATGACTGTCGCTGGGTTGTGGTCAGTACTCTCCGGGGTA



CTTCCCACGTTTTCCCCATCAACCCTTATGGTGGCCAGCCTTGTGTTCGTA



CACATATGTCACCACGAGTAGTGAATCGCATGAGCCGTTTCCAGAAAAGT



GCTGGACTGGAAGAGATTGAACAAGAACTGACGTCTAAGCAAGGAGGTC



GCTGTAGCCCTGTTCCAGGTCTATCAAGCAGCCCTTCTGGGTCACCCTTGC



ATGGGAAACTGAACAGCCAAGACTCCTATAACAATTTTACCAACAACAA



CCCTGGCAACCCTCGGCTCTCTCCTCTTCCCAGCTTGATGGTAGTGATGCC



TCTTGCACAAATCAAGCAGCCAATGACATTGGGGACCATCACCAAACGA



ACCGGCAAAGTTAAACCTCCTCCACAAATTTCACCCAGCAAATCGATGGG



CGGAGAATTTTGTGTGGCTGCTATCTTCGGAACATCCAGGTCATGGTTTG



CAAATAATGCAGGTCTGAAAAGAGAAAAAGATCAGTCCAAACAAGTTGT



AGTTGAGTCCCTGTACATTATCAGTTGCTATGGCACCTTAGTGGAACACA



TGATGGAGCCGCGACCCCTCAGCACTGCACCCAAGATTAGTGACGACAC



ACCACTGGAAATGATGACATCGCCTCGAGCCAGCTGGACTCTGGTTAGAA



CCCCTCAATGGAATGAATTGCAGCCACCGTTTAATGCAAACCACCCTCTG



CTCCTCGCTGCAGATGCAGTACAGTATTATCAGTTCCTGCTTGCTGGCCTG



GTTCCCCCTGGAAGTCCTGGGCCCATTACTCGACATGGGTCTTACGACAG



TTTAGCTTCTGACCATAGTGGACAGGAAGATGAAGAATGGCTTTCCCAGG



TTGAAATTGTAACACACACTGGACCCCATAGACGTCTGTGGATGGGTCCA



CAGTTCCAGTTCAAAACCATCCATCCCTCAGGCCAAACCACAGTTATCTC



ATCCAGTTCATCTGTGTTGCAGTCTCATGGTCCGAGTGACACGCCACAGC



CTCTTTTGGATTTTGATACAGATGATCTTGATCTCAACAGTCTCAGGATCC



AGCCAGTCCGCTCTGACCCCGTCAGCATGCCAGGGTCATCCCGTCCAGTC



TCTGATCGAAGGGGAGTTTCCACAGTGATTGATGCTGCCTCAGGTACCTT



TGACAGGAGCGTGACCCTGCTGGAGGTGTGCGGGAGCTGGCCTGAGGGC



TTCGGGCTGCGGCACATGTCCTCCATGGAGCACACGGAGGAGGGCCTCCG



GGAGCGACTTGCCGACGCCATGGCCGAGTCACCTAGCCGGGACGTCGTG



GGATCCGGAACAGAACTTCAGCGAGAGGGAAGCATCGAGACTCTGAGTA



ACAGCTCAGGCTCCACCAGCGGCAGCATACCAAGAAACTTTGATGGCTAC



CGATCTCCGCTGCCCACCAATGAGAGCCAGCCCCTCAGCCTCTTCCCGAC



TGGCTTCCCGTAG





PPIL2->
ATGGGGAAGCGACAGCACCAAAAGGACAAAATATTCTCCAGCTATGTAT
SEQ ID


NF2
CGGGAACCATGATCTATTTATGAGGAGAAGGAAAGCCGATTCTTTGGAA
NO: 85



GTTCAGCAGATGAAAGCCCAGGCCAGGGAGGAGAAGGCTAGAAAGCAG



ATGGAGCGGCAGCGCCTCGCTCGAGAGAAGCAGATGAGGGAGGAGGCTG



AACGCACGAGGGATGAGTTGGAGAGGAGGCTGCTGCAGATGAAAGAAG



AAGCAACAATGGCCAACGAAGCACTGATGCGGTCTGAGGAGACAGCTGA



CCTGTTGGCTGAAAAGGCCCAGATCACCGAGGAGGAGGCAAAACTTCTG



GCCCAGAAGGCCGCAGAGGCTGAGCAGGAAATGCAGCGCATCAAGGCCA



CAGCGATTCGCACGGAGGAGGAGAAGCGCCTGATGGAGCAGAAGGTGCT



GGAAGCCGAGGTGCTGGCACTGAAGATGGCTGAGGAGTCAGAGAGGAGG



GCCAAAGAGGCAGATCAGCTGAAGCAGGACCTGCAGGAAGCACGCGAG



GCGGAGCGAAGAGCCAAGCAGAAGCTCCTGGAGATTGCCACCAAGCCCA



CGTACCCGCCCATGAACCCAATTCCAGCACCGTTGCCTCCTGACATACCA



AGCTTCAACCTCATTGGTGACAGCCTGTCTTTCGACTTCAAAGATACTGA



CATGAAGCGGCTTTCCATGGAGATAGAGAAAGAAAAAGTGGAATACATG



GAAAAGAGCAAGCATCTGCAGGAGCAGCTCAATGAACTCAAGACAGAAA



TCGAGGCCTTGAAACTGAAAGAGAGGGAGACAGCTCTGGATATTCTGCA



CAATGAGAACTCCGACAGGGGTGGCAGCAGCAAGCACAATACCATTAAA



AAGCTCACCTTGCAGAGCGCCAAGTCCCGAGTGGCCTTCTTTGAAGAGCT



CTAG





LRP8->
ATGGGCCTCCCCGAGCCGGGCCCTCTCCGGCTTCTGGCGCTGCTGCTGCT
SEQ ID


TMEM48
GCTGCTGCTGCTGCTGCTGCTGCAGCTCCAGCATCTTGCGGCGGCAGCGG
NO: 86



CTGATCCGCTGCTCGGCGGCCAAGGGCCGGCCAAGGATTGCGAAAAGGA



CCAATTCCAGTGCCGGAACGAGCGCTGCATCCCCTCTGTGTGGAGATGCG



ACGAGGACGATGACTGCTTAGACCACAGCGACGAGGACGACTGCCCCAA



GAAGACCTGTGCAGACAGTGACTTCACCTGTGACAACGGCCACTGCATCC



ACGAACGGTGGAAGTGTGACGGCGAGGAGGAGTGTCCTGATGGCTCCGA



TGAGTCCGAGGCCACTTGCACCAAGCAGGTGTGTCCTGCAGAGAAGCTG



AGCTGTGGACCCACCAGCCACAAGTGTGTACCTGCCTCGTGGCGCTGCGA



CGGGGAGAAGGACTGCGAGGGTGGAGCGGATGAGGCCGGCTGTGCTACC



TGCTATATTCCCAAAGCTTGGATTAGCACTGCTATGAACCTTCACATAGA



TGAGCAGGTTCATAGGCCACTTGACACAGTGAGTGGCCTCTTAAATCTCT



CGTTACTCTACCATGTCTGGCTGTGTGGTGTCTTTCTCCTGACGACTTGGT



ATGTCTCATGGATACTCTTCAAAATCTATGCCACAGAGGCTCATGTGTTTC



CTGTTCAACCACCATTTGCAGAAGGGTCAGATGAGTGCCTTCCAAAAGTG



TTAAATAGCAATCCTCCCCCCATCATAAAGTATTTAGCCTTGCAGGACCT



GATGTTGCTTTCTCAATATTCTCCTTCACGAAGACAAGAAGTTTTCAGCCT



CAGCCAACCAGGTGGACATCCCCACAATTGGACAGCCATTTCAAGGGAG



TGTTTGAATCTTTTAAATGGTATGACTCAGAAACTGATTCTCTATCAAGAA



GCTGCTGCTACGAATGGGAGAGTGTCTTCATCTTACCCAGTGGAACCTAA



GAAATTAAATTCTCCAGAAGAAACTGCTTTTCAGACACCAAAATCTAGCC



AGATGCCTCGGCCTTCAGTGCCACCATTAGTTAAAACATCACTGTTTTCTT



CAAAATTATCTACACCTGATGTTGTGAGCCCATTTGGGACCCCATTTGGCT



CTAGTGTAATGAATCGGATGGCTGGAATTTTTGATGTAAACACCTGCTAT



GGGTCACCGCAAAGTCCTCAGCTAATAAGAAGGGGGCCAAGATTGTGGA



CATCAGCTTCTGATCAGCAAATGACTGAATTTTCTAATCCTTCTCCATCTA



CCTCTATTAGTGCTGAGGGTAAGACAATGAGACAACCCAGTGTGATTTAT



TCATGGATTCAGAATAAACGTGAACAGATTAAGAATTTCTTGTCAAAACG



GGTGCTGATAATGTATTTTTTCAGTAAGCACCCAGAGGCCTCCATTCAGG



CTGTTTTTTCAGATGCCCAAATGCATATTTGGGCATTAGAAGGTCTGTCGC



ACTTAGTAGCAGCATCATTTACAGAGGATAGATTTGGAGTTGTCCAGACG



ACACTACCAGCTATCCTTAATACTTTGTTGACACTGCAAGAGGCAGTCGA



CAAGTACTTTAAGCTTCCTCATGCTTCCAGTAAACCACCCCGGATTTCAG



GAAGCCTTGTGGACACTTCATATAAAACATTAAGATTTGCATTCAGAGCA



TCACTGAAAACTGCCATCTATCGAATAACTACTACATTTGGTGAACATCT



GAATGCTGTGCAAGCATCTGCAGAACATCAGAAAAGACTTCAACAGTTCT



TGGAGTTCAAAGAATAG





LUC7L3->
ATGATTTCGGCCGCGCAGTTGTTGGATGAGTTAATGGGCCGGGACCGAAA
SEQ ID


HNF1B
CCTAGCCCCGGACGAGAAGCGCAGCAACGTGCGGTGGGACCACGAGAGC
NO: 87



TGTCCTCTACAAGCCTGGTGA





RAD21->
ATGTTCTACGCACATTTTGTTCTCAGTAAAAGAGGGCCTCTGGCCAAAAT
SEQ ID


FER1L6
TTGGCTAGCGGCCCATTGGGATAAGAAGCTAACCAAAGCCCATGTGTTCG
NO: 88



AGTGTAATTTAGAGAGCAGCGTGGAGAGTATCATCTCACCAAAGATGTTT



GGGCTGAAGGTGAAGAAGAAGAGAAATAAGGCAGAGAAGGGGTTAATC



CTAGCCAACAAGGCTGCGAAAGATAGTCAAGGTGACACTGAAGCACTGC



AGGAGGAGCCTTCTCACCAGGAAGGACCGAGAGGAGATTTGGTCCATGA



TGATGCTTCTATCTTTCCTGTCCCCTCAGCTTCTCCAAAGAGAAGATCAAA



ACTGTTGACTAAGATCCATGATGGGGAGGTCAGATCCCAAAATTATCAAA



TTGCCATAACCATCACCGAGGCTCGCCAGCTGGTGGGTGAGAACATTGAC



CCAGTTGTGACCATTGAGATTGGGGATGAGAAGAAGCAAAGCACAGTGA



AGGAAGGAACCAACAGCCCATTTTATAATGAATACTTTGTCTTCGACTTC



ATTGGGCCCCAAGTGCATCTTTTTGACAAGATCATCAAAATCTCCGTCTTT



CACCACAAGCTGATAGGAAGTGTACTGATTGGCTCTTTCAAAGTAGACCT



GGGGACCGTGTACAACCAACCTGGTCATCAGTTCTGCAACAAGTGGGCCC



TGCTCACAGACCCTGGTGACATCAGGACTGGCACCAAGGGGTACCTGAA



ATGTGACATCAGTGTCATGGGAAAAGGTGATGTCTTGAAGACCAGCCCTA



AAACTTCTGACACCGAGGAGCCAATAGAAAAGAACCTTTTGATCCCCAAT



GGGTTTCCACTGGAGAGACCGTGGGCCAGATTCTATGTGAGACTCTACAA



AGCAGAAGGGTTGCCCAAAATGAATTCAAGCATCATGGCGAACGTCACC



AAGGCATTTGTGGGTGACAGTAAGGACCTGGTGGATCCCTTTGTGGAGGT



CTCCTTTGCTGGGCAGATGGGGCGAACCACAGTGCAGAAGAACTGTGCTG



ATCCTGTGTGGCATGAACAGGTGATCTTCAAGGAAATGTTCCCTCCCTTG



TGTCGGAGGGTGAAAATCCAGGTGTGGGATGAAGGCAGCATGAATGACG



TAGCCCTGGCAACCCATTTCATTGACCTGAAGAAAATCTCCAACGAACAG



GATGGAGACAAAGGCTTTCTGCCCACCTTTGGGCCTGCCTGGATTAACCT



GTATGGCTCGCCCAGGAACCACAGTCTGATGGATGACTACCAGGAAATG



AACGAAGGCTTTGGGGAAGGTGTGTCATTCAGGGGCAGAATCTTGGTAG



AAATTGCTGTGGAAATCCTCTCAGGACGGGCACAGGAATCTAAATTTTCC



AAGGCCCTGAAGGAGCTCAAGTTGCCTTCCAAGGACAAAGACTCCAAAT



CTTCCAAAGGTAAAGACAAGGCTGACAAAACTGAAGATGGAAAATCCCA



ACAGGCTTCAAACAAAACTAACTCAACCGAGGTGGAGGTGGAATCGTTC



GATGTCCCCCCGGAGATTGTACCAGAAAAAAATGAGGAATTTTTACTCTT



TGGAGCATTTTTTGAAGCTACCATGATTGACCGGAAGATTGGAGATAAAC



CCATCAGCTTTGAAGTTTCTATTGGTAATTTTGGAAACCTGATTGATGGAG



GATCCCATCATGGGAGTAAGAAGTCAGCTGAATCAGCTGAAGAAGACCT



CCTTCCACTGCTTCACGAAGGGCAAGGGGATGTGGCCCATGATGTTCCCA



TTCCTATGGCCTCCACCACTCACCCGGAGAAGCCACTGGTGACAGAAGGG



AACAGGAATTACAACTATTTGCCATTTGAGGCTAAGAAGCCCTGTGTCTA



TTTCATCAGCTCTTGGGGAGACCAGACCTTCAGGCTGCACTGGTCCAACA



TGCTGGAGAAAATGGCAGACTTCCTGGAAGAAAGTATAGAAGAAGTGAG



AGAATTGATCAAGATTTCACAGGAGGCACCTGAAGAGAAAATGAAAACA



GTGCTCAGTGACTTCATCAGTCGGAGCAGTGCCTTTATCTCTGAAGCAGA



AAAAAAGCCCAAGATGTTGAACCAAACCACTTTAGATAAGAAGCGACTT



ACGCTCTGCTGGCAGGAGCTGGAAGCAATGTGCAAGGAGGCCAAGGGGA



TCATTCAGCAGCAGAAGAAAAAGTTATCTGTTGATGAAATGATTCACGAA



GCCCAAAACTTTGTGGAAAAAATCCGCTTTCTTGTTGATGAGCCCCAGCA



CACTATCCCTGACGTTTTCATCTGGATGCTCAGCAACAACAGGAGAGTGG



CCTATGCCCGCATCGCCTCCAAAGACCTCCTCTATTCCCCTGTCGCGGGGC



AGATGGGCAAACACTGCGGCAAGATCAAAACTCACTTCCTCAAACCTCCT



GGGAAACGACCGGCTGGTTGGTCTGTGCAAGCAAAAGTCGACGTGTACC



TGTGGCTGGGCTCCATCAAGCATGCCAGTGCCATTTTGGACAACTTGCCA



GTAGGCTATGAAGCAGAAATGTCCTCCAAAGGGGCTGGCACCAATCACC



CCCCATCTAACCTGCTCTACCAAGAACAGCATGTTTTTCAGCTGAGGGCT



CACATGTACCAAGCCCGGGGCCTCATCGCAGCTGACAGCAATGGACTTTC



AGACCCTTTTGCCAAAGTCACGTTCCTTTCTCACTGCCAGACAACAAAGA



TAATCTCCCAGACCCTCTCTCCGACCTGGAACCAGATGCTGCTGTTCAAT



GATTTGGTGCTGCATGGAGATGTGAAGGAGCTGGCAGAGTCCCCGCCCTT



AGTGGTGGTGGAGCTGTATGACAGCGACGCTGTGGGGAAGCCAGAATAT



TTGGGTGCCACAGTGGCTGCTCCTGTTGTGAAGCTGGCTGACCAGGACTA



TGAGCCCCCCAGGTTATGCTATCACCCCATCTTTTGTGGGAATCTCTCTGG



AGGGGATCTCCTTGCTGTATTTGAACTGCTGCAGGTTCCTCCTTCTGGGCT



GCAAGGCCTCCCACCCGTTGAGCCACCAGACATCACCCAGATCTACCCGG



TTCCTGCCAACATTCGGCCGGTGCTGAGCAAATACCGAGTGGAGGTTCTC



TTCTGGGGAGTTCGGGAAATGAAGAAGGTGCAGCTCCTCTCTGTGGATCG



GCCTCAGGCTCTCATTGAGTGCGGAGGACAAGGTGTGAAGTCCTGCGTGA



TCCAGAGCTACAAGAACAACCCGAACTTCAGCATCCAGGCAGACGCTTTC



GAAGTGGAACTGCCTGAGAACGAGCTTCTGCACCCGCCACTGAGCATCTG



CGTGGTGGACTGGAGAGCTTTTGGGAGGAGTACCCTTGTGGGCACCTACA



CCATCAACTACTTGAAGCAGTTTTTGTGTAAACTCAGAGAGCCCCTTGCC



CCCATCACACAGGTGGATGGAACCCAGCCTGGGCACGATATTTCAGATTC



GCTAACAGCCACTGAGTCCTCTGGAGCCCACAGCTCCTCCCAGGATCCCC



CAGCAGATCACATTTATGTGGATGTTGAGCCACCTCCCACAGTGGTGCCC



GACTCTGCCCAGGCCCAGCCGGCCATCCTGGTTGACGTCCCTGACTCATC



CCCGATGCTGGAGCCTGAACACACACCTGTAGCCCAGGAGCCACCAAAA



GATGGAAAACCTAAGGATCCCAGGAAGCCTTCCCGGAGGTCCACTAAGA



GGAGAAAGAGGACCATAGCAGATGAATCTGCTGAAAACGTGATTGACTG



GTGGTCTAAGTATTATGCCTCCCTGAAGAAAGCCCAGAAGGCAAAGGAG



AGAAATCCCAAGGGAAAAAAAGGCAATACAGAGGCAAAGCCAGATGAG



GTAGTGGTAGATATAGAAGATGGGCCAAAGAAGAAGAAAGACAAAATG



CTCAAGAAGAAACCCAAAGATGATGGAATCCCCAACCTGGCCATCTTGC



AGATATATGACGGTGATCTCGAGAGTGAATTCAACAATTTTGAAGACTGG



GTGAAAACTTTTGAGCTCTTCAGAGGCAAGTCTACGGAAGATGACCATGG



TCTTGATGGAGACCGAGTCATAGGAAAATTTAAGGGCTCCTTCTGCATCT



ACAAAAGCCCCCAGGATTCTAGCTCTGAGGACAGCGGGCAGCTGAGAAT



CCAGCAAGGGATTCCGCCCAATCACCCTGTCACAGTGCTGATCAGAGTAT



ACATTGTCGCGGCATTTAATCTTAGTCCAGCTGATCCAGATGGCAAATCA



GATCCCTACATTGTGATCAAGCTTGGCAAGACAGAAATCAAAGACCGGG



ATAAATACATCCCTAAACAACTGAACCCAGTATTTGGAAGGTCATTTGAG



ATCCAAGCCACATTCCCAAAAGAGTCCCTGCTCTCCATCCTGATCTATGA



CCATGACATGATTGGCACAGATGACCTTATTGGTGAGACCAAGATCGACC



TGGAGAACCGCTTCTACAGCAAACACCGAGCCATCTGTGGCTTGCAGAGC



CAGTATGAGATAGAAGGATACAATGCCTGGAGAGACACGTCCAAACCCA



CCGAAATCCTCACTAAGCTCTGCAAAGACAACAAGCTGGATGGACCCTAC



TTTCACCCTGGGAAAATACAGATAGGAAACCAAGTCTTTTCTGGAAAAAC



TATCTTCACTGAAGAGGACACTGATGAGACAGTGGAGTCTTATGAACACC



TGGCCCTCAAGGTTTTACACTCTTGGGAGGATATCCCGGAAGTCGGGTGT



AGGCTGGTTCCTGAACACATAGAAACTCGGCCACTGTACCACAAGGATA



AGCCAGGAATGGAGCAGGGCCGCCTGCAGATGTGGGTGGACATGTTTCC



CAAGGATATGCCTCAACCTGGACCTCCTGTTGACATCTCTCCAAGGCGAC



CCAAAGGATACGAATTGAGAGTGACCATCTGGAACACTGAAGATGTCAT



TTTAGAGGATGAGAATATCTTCACAGGCCAAAAATCAAGTGATATTTATG



TGAAAGGGTGGTTAAAGGGCTTGGAGGATGACAAGCAGGAGACAGATGT



GCATTACAACTCCCTGACTGGAGAGGGCAACTTCAACTGGCGCTTCCTGT



TTCCCTTTCAGTATCTCCCAGCTGAGAAGCAAATGGTCATTACCAAGAGG



GAGAACATCTTCTCTTTAGAGAAGATGGAGTGTAAGACTCCTGCTGTGTT



GGTGCTGCAGGTTTGGGATTTTGAAAGGCTGTCCTCAGATGACTTCCTGG



GCACCCTGGAAATGAACCTCAACAGTTTCCCTCGAGCAGCTAAGTCTGCC



AAAGCCTGTGATCTTGCCAAGTTTGAAAATGCAAGTGAGGAGACCAAGA



TCTCTATATTCCAGCAAAAACGTGTGCGTGGCTGGTGGCCTTTTTCTAAAA



GCAAAGAACTCACAGGCAAGGTTGAAGCTGAGTTCCACCTAGTTACAGC



AGAAGAAGCTGAGAAAAATCCTGTTGGAAAAGCCCGAAAGGAGCCAGA



GCCCCTGGCCAAGCCCAACCGCCCAGACACCTCCTTTTCGTGGTTCATGA



GCCCCTTTAAGTGCCTGTACTACCTCATCTGGAAGAATTACAAAAAGTAC



ATCATCATTGCTTTCATTCTCATCATCCTCATCATCTTCCTCGTCCTTTTCA



TCTACACCTTGCCAGGAGCCATCAGCCGAAGGATCGTTGTGGGCTCATAG





AP2B1->
ATGACTGACTCCAAGTATTTCACAACCAATAAAAAAGGAGAAATATTTGA
SEQ ID


FLJ42280
ACTAAAAGCTGAACTCAACAATGAAAAGAAAGAAAAGAGAAAGGAGGC
NO: 89



TGTGAAGAAAGTGATTGCTGCTATGACCGTGGGGAAGGATGTTAGTTCTC



TCTTTCCAGACGTAGTGAACTGTATGCAGACTGACAATCTGGAACTAAAG



AAGCTTGTGTATCTCTACTTGATGAACTACGCCAAGAGTCAGCCAGACAT



GGCCATCATGGCTGTAAACAGCTTTGTGAAGGACTGTGAAGATCCTAATC



CTTTGATTCGAGCCTTGGCAGTCAGAACCATGGGGTGCATCCGGGTAGAC



AAAATTACAGAATATCTCTGTGAGCCGCTCCGCAAGTGCTTGAAGGATGA



GGATCCCTATGTTCGGAAAACAGCAGCAGTCTGCGTGGCAAAACTCCATG



ATATCAATGCCCAAATGGTGGAAGATCAGGGATTTCTGGATTCTCTACGG



GATCTCATAGCAGATTCAAATCCAATGGTGGTGGCTAATGCCGTAGCGGC



ATTATCTGAAATCAGTGAGTCTCACCCAAACAGCAACTTACTTGATCTGA



ACCCACAGAACATTAATAAGCTGCTGACAGCCCTGAATGAATGCACTGA



ATGGGGCCAGATTTTCATCCTGGACTGCCTGTCTAATTACAACCCTAAAG



ATGATCGGGAGGCTCAGAGCATCTGTGAGCGGGTAACTCCCCGGCTATCC



CATGCCAACTCAGCAGTGGTGCTTTCAGCGGTAAAAGTCCTAATGAAGTT



TCTAGAATTGTTACCTAAGGATTCTGACTACTACAATATGCTGCTGAAGA



AGTTAGCCCCTCCACTTGTCACTTTGCTGTCTGGGGAGCCAGAAGTGCAG



TATGTCGCCCTGAGGAACATCAACTTAATTGTCCAGAAAAGGCCTGAAAT



CTTGAAGCAGGAAATCAAAGTCTTCTTTGTGAAGTACAATGATCCCATCT



ATGTTAAACTAGAGAAGTTGGACATCATGATTCGTTTGGCATCTCAAGCC



AACATTGCTCAGGTTCTGGCAGAACTGAAAGAATATGCTACAGAGGTGG



ATGTTGACTTTGTTCGAAAAGCTGTGCGGGCCATTGGACGGTGTGCCATC



AAGGTGGAGCAATCTGCAGAGCGCTGTGTAAGCACATTGCTTGATCTAAT



CCAGACCAAAGTGAATTATGTGGTCCAAGAAGCAATTGTTGTCATCAGGG



ACATCTTCCGCAAATACCCCAACAAGTATGAAAGTATCATCGCCACTCTG



TGTGAGAACTTAGACTCGCTGGATGAGCCAGATGCTCGAGCAGCTATGAT



TTGGATTGTGGGAGAATATGCTGAAAGAATTGACAATGCAGATGAGTTAC



TAGAAAGCTTCCTGGAGGGTTTTCACGATGAAAGCACCCAGGTGCAGCTC



ACTCTGCTTACTGCCATAGTGAAGCTGTTTCTCAAGAAACCATCAGAAAC



ACAGGAGCTAGTCCAGCAGGTCTTGAGTTTGGCAACACAGGACTCCAAC



ATTTGTGCTGTGTTTGCTGTACAAGGAGGAAAAGTGGGAAGAAAGCATG



GCATAAAAAGGGGGAGGAGACCCAGCATAAGAAGCCCAGCTCAGCGGG



CCAGAGGACCCTGGATCCATGAGAGTAAGCATCCGGCCTTTGCAAAGCA



ACAGATAAACTTGGAGATGCCCAACTCCAGAGCGACAACAGAGTTAGCC



TGGGTCTGCAGCTCCACCTCAAGAAAAAAGAAGTGGGCAAGGTCCCTGA



CTCTTTCCACTGCTCCACTGAGCCCCCCACCATCCTTGGTGCACTGTGAAG



ATTGTTCTTGCCTGCCTGGCTGCCATTCGGGTGACCTCTACAATCTGGCCC



CAGCAGAAAGAACTTGCTAG





RUFY3->
ATGTCTGCTCTGACGCCTCCGACCGATATGCCAACCCCCACCACTGACAA
SEQ ID


MUC7
GATCACACAGGCTGCCATGGAGACCATCTACCTTTGCAAATTCCGAGTGT
NO: 90



CCATGGATGGAGAATGGCTCTGCCTGCGAGAGCTGGATGACATCTCACTT



ACACCTGACCCAGAGCCTACCCATGAAGATCCTAATTATCTCATGGCTAA



TGAACGCATGAACCTCATGAACATGGCCAAGCTGAGTATCAAGGGCTTG



ATTGAATCAGCTCTGAACCTGGGGAGGACTCTTGACTCTGACTATGCACC



TCTCCAGCAATTCTTTGTGGTGATGGAGCACTGTCTGAAACATGGCTTGA



AAGCTAAAAAAACTTTTCTCGGACAAAATAAATCCTTCTGGGGGCCTCTA



GAACTGGTAGAAAAGCTTGTTCCAGAAGCCGCAGAGATAACAGCAAGTG



TTAAAGATCTTCCAGGACTTAAGACACCAGTAGGTAGAGGAAGAGCCTG



GCTTCGTTTGGCATTAATGCAAAAGAAACTTTCAGAATATATGAAAGCTT



TGATCAATAAGAAAGAACTTCTCAGTGAATTCTACGAACCCAATGCCCTC



ATGATGGAAGAAGAAGGAGCCATAATTGCTGGTCTGTTGGTGGGTCTGA



ATGTCATTGATGCCAATTTCTGTATGAAAGGAGAAGACTTGGACTCTCAG



GTTGGAGTTATAGATTTTTCAATGTATCTCAAGGACGGGAACAGCAGTAA



AGGTACTGAAGGAGACGGTCAGATTACTGCAATTCTGGACCAGAAGAAC



TATGTAGAAGAACTGAACAGACATTTGAATGCTACTGTAAACAACCTTCA



GGCAAAAGTAGATGCATTAGAAAAATCCAACACTAAACTGACAGAGGAG



CTTGCAGTTGCAAACAACAGGATCATTACCTTACAAGAAGAAATGGAAC



GAGTTAAAGAGGAAAGTTCCTACATACTGGAATCCAATCGGAAGGGTCC



CAAGCAAGACAGAACTGCAGAAGGGCAAGCACTAAGTGAAGCAAGAAA



GCATTTAAAAGAAGAGACACAATTACGATTGGATGTTGAGAAAGAACTG



GAGATGCAGATCAGCATGAGGCAGGAGATGGAATTGGCTATGAAGATGC



TGGAGAAGGATGTCTGTGAGAAGCAGGATGCCCTGGTATCTCTTCGGCAG



CAGCTGGATGACCTCAGAGCTCTCAAGCATGAACTTGCCTTTAAGCTGCA



GAGTTCAGACTTAGGAGTAAAACAGAAAAGTGAACTAAACAGTCGCTTG



GAAGAGAAGACTAATCAGATGGCTGCTACCATTAAACAACTTGAACAAA



GATTGCGCCAGGCTGAGCGAAGCCGCCAATCTGCTGAGTTGGACAACCG



GCTCTTCAAACAGGACTTTGGAGACAAGATCAACAGTCTGCAGCTGGAA



GTCGAGGAGCTCACCAGGCAGCGGAACCAGCTTGAGTTAGAACTAAAAC



AGGAAAAAGAAAGAAGATTACAAAACGACAGGAGCATCCCAGGAAGGG



GTTCCCAGAAGTCAGAATCCAAGATGGATGGGAAGCACAAAATGCAAGA



GGAAAATGTTAAACTAAAAAAGCCCCTGGAAGAAAGCCACAGGCTGCAA



CCCCACCCTATGGATGAACAGGATCAGCTGCTGCTCTCTGAAAAGCCACA



GTTGTGTCAGCTATGCCAGGAAGACGGCAGCCTAACAAAGATGAAAACT



CTGCCGCTGTTTGTGTGCATCTGTGCACTGAGTGCTTGCTTCTCGTTCAGT



GAAGGTCGAGAAAGGGATCATGAACTACGTCACAGAAGGCATCATCACC



AATCACCCAAATCTCACTTTGAATTACCACATTATCCTGGACTGCTAGCTC



ACCAGAAGCCGTTCATTAGAAAGTCCTATAAATGTCTGCACAAACGCTGT



AGGCCTAAGCTTCCACCTTCACCTAATAACCCCCCCAAATTCCCAAATCC



TCACCAGCCACCTAAACATCCAGATAAAAATAGCAGTGTGGTCAACCCTA



CCTTAGTGGCTACAACCCAAATTCCATCTGTGACTTTCCCATCAGCTTCCA



CCAAAATTACTACCCTTCCAAATGTGACTTTTCTTCCCCAGAATGCCACCA



CCATATCTTCAAGAGAAAATGTTAACACAAGCTCTTCTGTAGCTACATTA



GCACCAGTGAATTCCCCAGCTCCACAAGACACCACAGCTGCCCCACCCAC



ACCTTCTGCAACTACACCAGCTCCACCATCTTCCTCAGCTCCACCAGAGA



CCACAGCTGCCCCACCCACACCTTCTGCAACTACACAAGCTCCACCATCT



TCCTCAGCTCCACCAGAGACCACAGCTGCCCCACCCACACCTCCTGCAAC



TACACCAGCTCCACCATCTTCCTCAGCTCCACCAGAGACCACAGCTGCCC



CACCCACACCTTCTGCAACTACACCAGCTCCACTATCTTCCTCAGCTCCAC



CAGAGACCACAGCTGTCCCACCCACACCTTCTGCAACTACCCTAGACCCA



TCATCCGCCTCAGCTCCACCAGAGACCACAGCTGCCCCACCCACACCTTC



TGCAACTACACCAGCTCCACCGTCTTCCCCAGCTCCACAAGAGACCACAG



CTGCCCCAATTACCACACCTAATTCTTCCCCAACTACTCTTGCACCTGACA



CTTCTGAAACTTCAGCTGCACCCACACACCAGACTACTACTTCGGTCACT



ACTCAAACTACTACTACTAAACAACCAACTTCAGCTCCTGGCCAAAATAA



AATTTCTCGATTTCTTTTATATATGAAGAATCTACTAAACAGAATTATTGA



CGACATGGTGGAGCAATAG





LLGL2->
GGAGGTGAGCAGGAAGGAGACGGCCGCCCAGCAGCCCGTGGGCAGGCG
SEQ ID


CPNE4
CGGCGGAGCGAGCGGGGCCGGCGGCGGGCGCCGAGGGACGCCGAGGCC
NO: 91



TCGGGCGGGGGCTGGCCCGGGGTTCCAGGTTGACAGGACTGAGGTGATT



CGCACCTGCATAAACCCAGTGTACTCAAAACTGTTTACTGTGGACTTTTA



CTTTGAGGAGGTGCAGCGCCTGCGGTTTGAAGTCCATGACATCAGCAGCA



ACCACAATGGGCTGAAGGAGGCCGACTTCCTTGGTGGCATGGAGTGCAC



ACTTGGCCAGATTGTTTCCCAGAGAAAGCTGTCCAAATCCTTGCTGAAGC



ATGGGAACACAGCAGGGAAATCTTCCATCACGGTGATTGCTGAAGAATT



ATCTGGCAATGACGACTATGTTGAGCTTGCATTCAATGCACGGAAATTGG



ATGACAAGGATTTCTTCAGTAAATCTGACCCATTTCTGGAAATTTTTCGTA



TGAATGATGATGCAACTCAGCAGCTGGTGCACCGAACTGAGGTTGTGATG



AATAACTTAAGCCCAGCCTGGAAATCATTCAAAGTATCTGTAAATTCTCT



ATGCAGCGGAGACCCAGACCGCCGGCTAAAGTGCATAGTATGGGACTGG



GACTCCAATGGCAAGCATGACTTCATTGGAGAATTCACCTCGACATTCAA



GGAGATGAGAGGAGCAATGGAAGGGAAACAGGTGCAGTGGGAGTGCAT



CAATCCCAAGTACAAAGCCAAGAAGAAGAATTACAAGAACTCAGGCACT



GTGATTCTGAATCTGTGCAAGATTCACAAGATGCATTCTTTCTTGGACTAC



ATCATGGGTGGCTGCCAAATCCAGTTTACAGTAGCTATAGATTTCACTGC



CTCAAACGGGGACCCCAGGAACAGCTGTTCCTTGCACTACATCCACCCTT



ACCAACCCAATGAGTATCTGAAAGCTTTGGTAGCTGTGGGGGAGATTTGC



CAAGACTATGACAGTGACAAAATGTTCCCTGCCTTTGGGTTTGGCGCCAG



GATACCTCCAGAGTACACGGTCTCTCATGACTTTGCAATCAACTTTAATG



AAGACAACCCAGAATGTGCAGGAATTCAAGGAGTTGTGGAAGCCTATCA



GAGCTGTCTTCCTAAGCTCCAACTCTACGGTCCCACCAACATTGCCCCCAT



CATCCAGAAGGTTGCCAAGTCAGCGTCAGAGGAAACTAACACCAAGGAG



GCATCGCAATACTTCATCCTGCTGATCCTGACAGATGGTGTTATCACAGA



CATGGCCGACACCCGGGAGGCCATTGTCCATGCCTCCCACCTCCCCATGT



CAGTCATCATCGTGGGAGTAGGGAACGCTGACTTCAGTGACATGCAGATG



CTGGACGGTGATGATGGGATTCTGAGGTCACCCAAGGGAGAGCCTGTTCT



TCGAGACATCGTCCAGTTCGTGCCCTTCAGGAACTTCAAACACGCATCTC



CAGCTGCCCTGGCAAAGAGCGTGCTGGCTGAAGTCCCAAACCAAGTTGTG



GACTATTACAATGGCAAAGGAATTAAACCAAAATGTTCATCAGAAATGT



ATGAATCTTCCAGAACACTAGCACCATGAACTCCCCACACAGTTTTACAG



AGTTCTGAAATACTATTCCTGCTAATATTTCATATTTAATACTTCTACTAC



TCCTGTACTTTAAAAAACCAACAACATATACACATTTAAAAATAGCACGT



TTTGGTGATTTTTAACTATCTGACAATTTTTTTTGCATGTGTAGCCCTGAG



GCCTGGATCTGTTAAGCCCTTGTATTGTTAACTTTTTACAAAGAAACACA



GATAACAATAACTTACTATTTACATTACAGCATGTCGCCTTGAAATAAAA



TGGTATCTGTATCCATTTTTTATACAGGTTTGTTGAAATTTTGCTAAATTTC



TTATCTTTACACTCTAAAGCATTTTGAAACATTTACTGAATGTTGATAGAC



GAAATATACTTGGTTTTATCTGCTATAGGATGAGAGACTTTTTAAAATGG



CAGATGCATGGACTGTATTTTGCATGTTTAAAATAA





SEMA4C
ATGGCCCCACACTGGGCTGTCTGGCTGCTGGCAGCAAGGCTGTGGGGCCT
SEQ ID


->
GGGCATTGGGGCTGAGGTGTGGTGGAACCTTGTGCCGCGTAAGACAGTGT
NO: 92


BRE
CTTCTGGGGAGCTGGCCACGGTAGTACGGCGGTTCTCCCAGACCGGCATC



CAGGACTTCCTGACACTGACGCTGACGGAGCCCACTGGGCTTCTGTACGT



GGGCGCCCGAGAGGCCCTGTTTGCCTTCAGCATGGAGGCCCTGGAGCTGC



AAGGAGCGATCTCCTGGGAGGCCCCCGTGGAGAAGAAGACTGAGTGTAT



CCAGAAAGGGAAGAACAACCAGACCGAGTGCTTCAACTTCATCCGCTTCC



TGCAGCCCTACAATGCCTCCCACCTGTACGTCTGTGGCACCTACGCCTTCC



AGCCCAAGTGCACCTACGTCAACATGCTCACCTTCACTTTGGAGCATGGA



GAGTTTGAAGATGGGAAGGGCAAGTGTCCCTATGACCCAGCTAAGGGCC



ATGCTGGCCTTCTTGTGGATGGTGAGCTGTACTCGGCCACACTCAACAAC



TTCCTGGGCACGGAACCCATTATCCTGCGTAACATGGGGCCCCACCACTC



CATGAAGACAGAGTACCTGGCCTTTTGGCTCAACGAACCTCACTTTGTAG



GCTCTGCCTATGTACCTGAGAGTGTGGGCAGCTTCACGGGGGACGACGAC



AAGGTCTACTTCTTCTTCAGGGAGCGGGCAGTGGAGTCCGACTGCTATGC



CGAGCAGGTGGTGGCTCGTGTGGCCCGTGTCTGCAAGGGCGATATGGGG



GGCGCACGGACCCTGCAGAGGAAGTGGACCACGTTCCTGAAGGCGCGGC



TGGCATGCTCTGCCCCGAACTGGCAGCTCTACTTCAACCAGCTGCAGGCG



ATGCACACCCTGCAGGACACCTCCTGGCACAACACCACCTTCTTTGGGGT



TTTTCAAGCACAGTGGGGTGACATGTACCTGTCGGCCATCTGTGAGTACC



AGTTGGAAGAGATCCAGCGGGTGTTTGAGGGCCCCTATAAGGAGTACCA



TGAGGAAGCCCAGAAGTGGGACCGCTACACTGACCCTGTACCCAGCCCTC



GGCCTGGCTCGTGCATTAACAACTGGCATCGGCGCCACGGCTACACCAGC



TCCCTGGAGCTACCCGACAACATCCTCAACTTCGTCAAGAAGCACCCGCT



GATGGAGGAGCAGGTGGGGCCTCGGTGGAGCCGCCCCCTGCTCGTGAAG



AAGGGCACCAACTTCACCCACCTGGTGGCCGACCGGGTTACAGGACTTGA



TGGAGCCACCTATACAGTGCTGTTCATTGGCACAGGAGACGGCTGGCTGC



TCAAGGCTGTGAGCCTGGGGCCCTGGGTTCACCTGATTGAGGAGCTGCAG



CTGTTTGACCAGGAGCCCATGAGAAGCCTGGTGCTATCTCAGAGCAAGAA



GCTGCTCTTTGCCGGCTCCCGCTCTCAGCTGGTGCAGCTGCCCGTGGCCG



ACTGCATGAAGTATCGCTCCTGTGCAGACTGTGTCCTCGCCCGGGACCCC



TATTGCGCCTGGAGCGTCAACACCAGCCGCTGTGTGGCCGTGGGTGGCCA



CTCTGGATCTCTACTGATCCAGCATGTGATGACCTCGGACACTTCAGGCA



TCTGCAACCTCCGTGGCAGTAAGAAAGGGCTTATTTCAAAACCTTTGTCC



CTCAGTTCCAGGAGGCAGCATTTGCCAATGGAAAGCTCTAGGAAACACC



AGTCTTGAGAGGTGGCCAGCCAGACTGCCTGTCCACATGCGTGTCAGCAC



ATACAGCCGCTTCCTGGAAGCCGCCTGGAATGTCTTCACGGCAGCGTTTT



GCTCACACAGCAGCTTTTGCACGCCCCAGGCAGCCCCGACTGCTGAAATC



CAACTTGAGCTGGCTGGTGGTCCCTGGATCCTAGAGCCCTTCACTTCGGG



TTACTCCCTCTTTCTTGCCTCTATTTCTTAGTTGGAAGAAATAAACTCACA



AATTATGGTGCAGTAATTTTCCGGGGAAAGTAAAGCCTCAGGAATGCCCA



CGCCTTTCTTCCAAAGCCTTTGTCTCTGAGACCTCTTAAGTTCTAAGATTA



AATGCCCCTCGCTGTTCTTCCTCTGAAA





ESR1->
ATGACCATGACCCTCCACACCAAAGCATCTGGGATGGCCCTACTGCATCA
SEQ ID


AKAP12
GATCCAAGGGAACGAGCTGGAGCCCCTGAACCGTCCGCAGCTCAAGATC
NO: 93



CCCCTGGAGCGGCCCCTGGGCGAGGTGTACCTGGACAGCAGCAAGCCCG



CCGTGTACAACTACCCCGAGGGCGCCGCCTACGAGTTCAACGCCGCGGCC



GCCGCCAACGCGCAGGTCTACGGTCAGACCGGCCTCCCCTACGGCCCCGG



GTCTGAGGCTGCGGCGTTCGGCTCCAACGGCCTGGGGGGTTTCCCCCCAC



TCAACAGCGTGTCTCCGAGCCCGCTGATGCTACTGCACCCGCCGCCGCAG



CTGTCGCCTTTCCTGCAGCCCCACGGCCAGCAGGTGCCCTACTACCTGGA



GAACGAGCCCAGCGGCTACACGGTGCGCGAGGCCGGCCCGCCGGCATTC



TACAGGCCAAATTCAGATAATCGACGCCAGGGTGGCAGAGAAAGATTGG



CCAGTACCAATGACAAGGGAAGTATGGCTATGGAATCTGCCAAGGAGAC



TCGCTACTGTGCAGTGTGCAATGACTATGCTTCAGGCTACCATTATGGAG



TCTGGTCCTGTGAGGGCTGCAAGGCCTTCTTCAAGAGAAGTATTCAAGGA



CATAACGACTATATGTGTCCAGCCACCAACCAGTGCACCATTGATAAAAA



CAGGAGGAAGAGCTGCCAGGCCTGCCGGCTCCGTAAATGCTACGAAGTG



GGAATGATGAAAGGTGTTGGACAGAGAGACTCTGAAGATGTGAGCAAAA



GAGACTCCGATAAAGAGATGGCTACTAAGTCAGCGGTTGTTCACGACATC



ACAGATGATGGGCAGGAGGAGACACCCGAAATAATCGAACAGATTCCTT



CTTCAGAAAGCAATTTAGAAGAGCTAACACAACCCACTGAGTCCCAGGCT



AATGATATTGGATTTAAGAAGGTGTTTAAGTTTGTTGGCTTTAAATTCACT



GTGAAAAAGGATAAGACAGAGAAGCCTGACACTGTCCAGCTACTCACTG



TGAAGAAAGATGAAGGGGAGGGAGCAGCAGGGGCTGGCGACCACAAGG



ACCCCAGCCTTGGGGCTGGAGAAGCAGCATCCAAAGAAAGCGAACCCAA



ACAATCTACAGAGAAACCCGAAGAGACCCTGAAGCGTGAGCAAAGCCAC



GCAGAAATTTCTCCCCCAGCCGAATCTGGCCAAGCAGTGGAGGAATGCA



AAGAGGAAGGAGAAGAGAAACAAGAAAAAGAACCTAGCAAGTCTGCAG



AATCTCCGACTAGTCCCGTGACCAGTGAAACAGGATCAACCTTCAAAAAA



TTCTTCACTCAAGGTTGGGCCGGCTGGCGCAAAAAGACCAGTTTCAGGAA



GCCGAAGGAGGATGAAGTGGAAGCTTCAGAGAAGAAAAAGGAACAAGA



GCCAGAAAAAGTAGACACAGAAGAAGACGGAAAGGCAGAGGTTGCCTC



CGAGAAACTGACCGCCTCCGAGCAAGCCCACCCACAGGAGCCGGCAGAA



AGTGCCCACGAGCCCCGGTTATCAGCTGAATATGAGAAAGTTGAGCTGCC



CTCAGAGGAGCAAGTCAGTGGCTCGCAGGGACCTTCTGAAGAGAAACCT



GCTCCGTTGGCGACAGAAGTGTTTGATGAGAAAATAGAAGTCCACCAAG



AAGAGGTTGTGGCCGAAGTCCACGTCAGCACCGTGGAGGAGAGAACCGA



AGAGCAGAAAACGGAGGTGGAAGAAACAGCAGGGTCTGTGCCAGCTGA



AGAATTGGTTGAAATGGATGCAGAACCTCAGGAAGCTGAACCTGCCAAG



GAGCTGGTGAAGCTCAAAGAAACGTGTGTTTCCGGAGAGGACCCTACAC



AGGGAGCTGACCTCAGTCCTGATGAGAAGGTGCTGTCCAAACCCCCCGA



AGGCGTTGTGAGTGAGGTGGAAATGCTGTCATCACAGGAGAGAATGAAG



GTGCAGGGAAGTCCACTAAAGAAGCTTTTTACCAGCACTGGCTTAAAAAA



GCTTTCTGGAAAGAAACAGAAAGGGAAAAGAGGAGGAGGAGACGAGGA



ATCAGGGGAGCACACTCAGGTTCCAGCCGATTCTCCGGACAGCCAGGAG



GAGCAAAAGGGCGAGAGCTCTGCCTCATCCCCTGAGGAGCCCGAGGAGA



TCACGTGTCTGGAAAAGGGCTTAGCCGAGGTGCAGCAGGATGGGGAAGC



TGAAGAAGGAGCTACTTCCGATGGAGAGAAAAAAAGAGAAGGTGTCACT



CCCTGGGCATCATTCAAAAAGATGGTGACGCCCAAGAAGCGTGTTAGAC



GGCCTTCGGAAAGTGATAAAGAAGATGAGCTGGACAAGGTCAAGAGCGC



TACCTTGTCTTCCACCGAGAGCACAGCCTCTGAAATGCAAGAAGAAATGA



AAGGGAGCGTGGAAGAGCCAAAGCCGGAAGAACCAAAGCGCAAGGTGG



ATACCTCAGTATCTTGGGAAGCTTTAATTTGTGTGGGATCATCCAAGAAA



AGAGCAAGGAGAGGGTCCTCTTCTGATGAGGAAGGGGGACCAAAAGCAA



TGGGAGGAGACCACCAGAAAGCTGATGAGGCCGGAAAAGACAAAGAGA



CGGGGACAGACGGGATCCTTGCTGGTTCCCAAGAACATGATCCAGGGCA



GGGAAGTTCCTCCCCGGAGCAAGCTGGAAGCCCTACCGAAGGGGAGGGC



GTTTCCACCTGGGAGTCATTTAAAAGGTTAGTCACGCCAAGAAAAAAATC



AAAGTCCAAGCTGGAAGAGAAAAGCGAAGACTCCATAGCTGGGTCTGGT



GTAGAACATTCCACTCCAGACACTGAACCCGGTAAAGAAGAATCCTGGG



TCTCAATCAAGAAGTTTATTCCTGGACGAAGGAAGAAAAGGCCAGATGG



GAAACAAGAACAAGCCCCTGTTGAAGACGCAGGGCCAACAGGGGCCAAC



GAAGATGACTCTGATGTCCCGGCCGTGGTCCCTCTGTCTGAGTATGATGC



TGTAGAAAGGGAGAAAATGGAGGCACAGCAAGCCCAAAAAAGCGCAGA



GCAGCCCGAGCAGAAGGCAGCCACTGAGGTGTCCAAGGAGCTCAGCGAG



AGTCAGGTTCATATGATGGCAGCAGCTGTCGCTGACGGGACGAGGGCAG



CTACCATTATTGAAGAAAGGTCTCCTTCTTGGATATCTGCTTCAGTGACAG



AACCTCTTGAACAAGTAGAAGCTGAAGCCGCACTGTTAACTGAGGAGGT



ATTGGAAAGAGAAGTAATTGCAGAAGAAGAACCCCCCACGGTTACTGAA



CCTCTGCCAGAGAACAGAGAGGCCCGGGGCGACACGGTCGTTAGTGAGG



CGGAATTGACCCCCGAAGCTGTGACAGCTGCAGAAACTGCAGGGCCATT



GGGTGCCGAAGAAGGAACCGAAGCATCTGCTGCTGAAGAGACCACAGAA



ATGGTGTCAGCAGTCTCCCAGTTAACCGACTCCCCAGACACCACAGAGGA



GGCCACTCCGGTGCAGGAGGTGGAAGGTGGCGTACCTGACATAGAAGAG



CAAGAGAGGCGGACTCAAGAGGTCCTCCAGGCAGTGGCAGAAAAAGTGA



AAGAGGAATCCCAGCTGCCTGGCACCGGTGGGCCAGAAGATGTGCTTCA



GCCTGTGCAGAGAGCAGAGGCAGAAAGACCAGAAGAGCAGGCTGAAGC



GTCGGGTCTGAAGAAAGAGACGGATGTAGTGTTGAAAGTAGATGCTCAG



GAGGCAAAAACTGAGCCTTTTACACAAGGGAAGGTGGTGGGGCAGACCA



CCCCAGAAAGCTTTGAAAAAGCTCCTCAAGTCACAGAGAGCATAGAGTC



CAGTGAGCTTGTAACCACTTGTCAAGCCGAAACCTTAGCTGGGGTAAAAT



CACAGGAGATGGTGATGGAACAGGCTATCCCCCCTGACTCGGTGGAAAC



CCCTACAGACAGTGAGACTGATGGAAGCACCCCCGTAGCCGACTTTGACG



CACCAGGCACAACCCAGAAAGACGAGATTGTGGAAATCCATGAGGAGAA



TGAGGTCGCATCTGGTACCCAGTCAGGGGGCACAGAAGCAGAGGCAGTT



CCTGCACAGAAAGAGAGGCCTCCAGCACCTTCCAGTTTTGTGTTCCAGGA



AGAAACTAAAGAACAATCAAAGATGGAAGACACTCTAGAGCATACAGAT



AAAGAGGTGTCAGTGGAAACTGTATCCATTCTGTCAAAGACTGAGGGGA



CTCAAGAGGCTGACCAGTATGCTGATGAGAAAACCAAAGACGTACCATT



TTTCGAAGGACTTGAGGGGTCTATAGACACAGGCATAACAGTCAGTCGG



GAAAAGGTCACTGAAGTTGCCCTTAAAGGTGAAGGGACAGAAGAAGCTG



AATGTAAAAAGGATGATGCTCTTGAACTGCAGAGTCACGCTAAGTCTCCT



CCATCCCCCGTGGAGAGAGAGATGGTAGTTCAAGTCGAAAGGGAGAAAA



CAGAAGCAGAGCCAACCCATGTGAATGAAGAGAAGCTTGAGCACGAAAC



AGCTGTTACCGTATCTGAAGAGGTCAGTAAGCAGCTCCTCCAGACAGTGA



ATGTGCCCATCATAGATGGGGCAAAGGAAGTCAGCAGTTTGGAAGGAAG



CCCTCCTCCCTGCCTAGGTCAAGAGGAGGCAGTATGCACCAAAATTCAAG



TTCAGAGCTCTGAGGCATCATTCACTCTAACAGCGGCTGCAGAGGAGGAA



AAGGTCTTAGGAGAAACTGCCAACATTTTAGAAACAGGTGAAACGTTGG



AGCCTGCAGGTGCACATTTAGTTCTGGAAGAGAAATCCTCTGAAAAAAAT



GAAGACTTTGCCGCTCATCCAGGGGAAGATGCTGTGCCCACAGGGCCCG



ACTGTCAGGCAAAATCGACACCAGTGATAGTATCTGCTACTACCAAGAAA



GGCTTAAGTTCCGACCTGGAAGGAGAGAAAACCACATCACTGAAGTGGA



AGTCAGATGAAGTCGATGAGCAGGTTGCTTGCCAGGAGGTCAAAGTGAG



TGTAGCAATTGAGGATTTAGAGCCTGAAAATGGGATTTTGGAACTTGAGA



CCAAAAGCAGTAAACTTGTCCAAAACATCATCCAGACAGCCGTTGACCA



GTTTGTACGTACAGAAGAAACAGCCACCGAAATGTTGACGTCTGAGTTAC



AGACACAAGCTCACGTGATAAAAGCTGACAGCCAGGACGCTGGACAGGA



AACGGAGAAAGAAGGAGAGGAACCTCAGGCCTCTGCACAGGATGAAAC



ACCAATTACTTCAGCCAAAGAGGAGTCAGAGTCAACCGCAGTGGGACAA



GCACATTCTGATATTTCCAAAGACATGAGTGAAGCCTCAGAAAAGACCAT



GACTGTTGAGGTAGAAGGTTCCACTGTAAATGATCAGCAGCTGGAAGAG



GTCGTCCTCCCATCTGAGGAAGAGGGAGGTGGAGCTGGAACAAAGTCTG



TGCCAGAAGATGATGGTCATGCCTTGTTAGCAGAAAGAATAGAGAAGTC



ACTAGTTGAACCGAAAGAAGATGAAAAAGGTGATGATGTTGATGACCCT



GAAAACCAGAACTCAGCCCTGGCTGATACTGATGCCTCAGGAGGCTTAAC



CAAAGAGTCCCCAGATACAAATGGACCAAAACAAAAAGAGAAGGAGGA



TGCCCAGGAAGTAGAATTGCAGGAAGGAAAAGTGCACAGTGAATCAGAT



AAAGCGATCACACCCCAAGCACAGGAGGAGTTACAGAAACAAGAGAGA



GAATCTGCAAAGTCAGAACTTACAGAATCTTAA





APPBP2->
ATGGCGGCCGTGGAACTAGAGTGGATCCCAGAGACTCTCTATAACACCGC
SEQ ID


KIF19
CATCTCCGCTGTCGTGGACAACTACATCCGCTCCCGCCGAGACATCCGCT
NO: 94



CCTTGCCCGAGAACATCCAGTTTGATGTTTACTACAAGCTTTACCAACAG



GGACGCTTATGTCAACTGGGCAGTGAATTTTGTGAATTGGAAGTTTTTGC



TAAAGTACTGAGAGCTTTGGATAAAAGGTGAAGCAGAACCTCCTGAACG



TCTCCTACCACATCGCCCAGTACACCAGCATCATCGCTGACCTGCGGGGC



GAGATCCAGCGACTCAAGCGCAAGATTGATGAGCAGACTGGGCGGGGCC



AGGCCCGGGGCCGGCAGGATCGGGGTGACATCCGCCACATCCAAGCTGA



GGTCCAGCTGCACAGCGGGCAGGGTGAGAAGGCTGGCATGGGACAGCTT



CGGGAGCAGCTCGCCAGCGCCTTCCAGGAGCAGATGGATGTGCGGAGGC



GCCTGCTGGAGCTGGAGAACCGCGCCATGGAGGTCCAGATTGACACCTCC



CGACACCTGCTCACCATCGCCGGCTGGAAGCATGAGAAGTCCCGCCGGG



CCCTCAAATGGCGGGAGGAGCAGCGAAAGGAGTGCTACGCTAAGGACGA



CAGCGAGAAGGACTCAGACACAGGTGATGACCAACCAGACATCCTGGAG



CCACCCGAGGTGGCCGCAGCCCGGGAGAGCATTGCAGCCCTGGTGGACG



AGCAGAAGCAACTGCGCAAGCAGAAGCTGGCGCTGGAGCAGCGCTGCCG



GGAGCTGCGCGCGCGGGGCCGGCGCCTGGAGGAGACGCTGCCGCGGCGC



ATCGGCTCCGAGGAGCAGCGCGAGGTGCTCAGCCTGCTGTGCCGCGTGCA



CGAGCTCGAGGTGGAGAACACCGAGATGCAGTCGCACGCGCTGCTCCGC



GACGGTGCGCTCCGCCACCGCCACGAGGCCGTGCGCCGCCTGGAGCAGC



ACCGCAGTCTCTGCGACGAGATTATCCAGGGCCAGCGGCAGATCATCGAC



GACTACAACCTGGCCGTCCCGCAGCGCCTGGAAGAGCTCTACGAAGTGTA



CCTGCGGGAGCTGGAGGAGGGCAGCCTGGAGCAGGCCACCATCATGGAC



CAAGTGGCCTCCAGGGCCCTGCAGGACAGCTCCTTGCCCAAAATTACCCC



AGCAGGAACCTCACTGACCCCAGATTCTGACCTGGAGAGTGTGAAGACA



TTGAGCTCTGATGCCCAGCACCTGCAGAACAGCGCCCTCCCTCCCCTCAG



CACAGAGAGTGAAGGCCACCACGTGTTCAAGGCTGGTACTGGGGCCTGG



CAGGCAAAAAGCTCCTCTGTGCCCACCCCACCTCCCATCCAGCTCGGCAG



CCTGGTGACGCAGGAGGCCCCGGCTCAGGACAGCCTGGGCAGCTGGATC



AACTCTTCCCCTGACAGCAGTGAGAACCTGTCGGAGATCCCCTTGTCCCA



CAAAGAGAGGAAGGAGATCCTGACTGGCACCAAGTGCATCTGGGTGAAG



GCCGCCCGGCGGCGCTCGCGGGCCCTGGGAACCGAGGGGCGACACCTGC



TGGCACCCGCGACAGAGCGCAGCAGCCTGTCCCTGCACTCACTGAGCGA



GGGCGACGATGCGCGGCCACCAGGCCCACTGGCCTGCAAGCGGCCGCCC



AGCCCCACACTACAGCATGCTGCCAGTGAGGACAACCTGTCCAGCAGCA



CGGGCGAGGCCCCGTCCCGGGCAGTCGGACATCATGGGGACGGCCCCAG



GCCCTGGCTGCGTGGCCAGAAGAAAAGCCTGGGCAAGAAAAGGGAGGA



GTCGCTGGAGGCAAAGAGAAGGAAGCGGAGGTCCCGATCCTTCGAGGTC



ACCGGGCAAGGGCTCTCCCACCCCAAGACACACCTCCTGGGGCCCCATCA



GGCGGAGCGCATCTCGGACCACAGGATGCCAGTGTGCAGGCACCCAGCC



CCTGGTATCCGGCATCTGGGAAAGGTCACGCTACCTTTGGCCAAAGTCAA



ACTCCCTCCAAGCCAGAACACGGGCCCGGGGGACTCCTCACCCCTGGCTG



TTCCCCCCAACCCAGGTGGTGGTTCTCGACGGGCTACCCGTGGGCCCCGC



CTGCCCCACGGCACAAGCACCCATGGCAAAGATGGATGCTCCCGGCATA



ACTGA





ADAM9->
ATGGGGTCTGGCGCGCGCTTTCCCTCGGGGACCCTTCGTGTCCGGTGGTT
SEQ ID


ANK1
GCTGTTGCTTGGCCTGGTGGGCCCAGTCCTCGGTGCGGCGCGGCCAGGCT
NO: 95



TTCAACAGACCTCACATCTTTCTTCTTATGAAATTATAACTCCTTGGAGAT



TAACTAGAGAAAGAAGAGAAGCCCCTAGGCCCTATTCAAAACAAGTATC



TTATGTTATTCAGGCTGAAGGAAAAGAGCATATTATTCACTTGGAAAGGA



ACAAAGACCTTTTGCCTGAAGATTTTGTGGTTTATACTTACAACAAGGAA



GGGACTTTAATCACTGACCATCCCAATATACAGAATCATTGTCATTATCG



GGGCTATGTGGAGGGAGTTCATAATTCATCCATTGCTCTTAGCGACTGTTT



TGGACTCAGAGGATTGCTGCATTTAGAGAATGCGAGTTATGGGATTGAAC



CCCTGCAGAACAGCTCTCATTTTGAGCACATCATTTATCGAATGGATGAT



GTCTACAAAGAGCCTCTGAAATGTGGAGTTTCCAACAAGGATATAGAGA



AAGAAACTGCAAAGGATGAAGAGGAAGAGCCTCCCAGCATGACTCAGCT



ACTTCGAAGAAGAAGAGCTGTCTTGCCACAGACCCGGTATGTGGAGCTGT



TCATTGTCGTAGACAAGGAAAGGTATGACATGATGGGAAGAAATCAGAC



TGCTGTGAGAGAAGAGATGATTCTCCTGGCAAACTACTTGGATAGTATGT



ATATTATGTTAAATATTCGAATTGTGCTAGTTGGACTGGAGATTTGGACC



AATGGAAACCTGATCAACATAGTTGGGGGTGCTGGTGATGTGCTGGGGA



ACTTCGTGCAGTGGCGGGAAAAGTTTCTTATCACACGTCGGAGACATGAC



AGTGCACAGCTAGTTCTAAAGAAAGGTTTTGGTGGAACTGCAGGAATGG



CATTTGTGGGAACAGTGTGTTCAAGGAGCCACGCAGGCGGGATTAATGTG



AAGGGGAACACGGCCCTGCACATCGCTGCTCTAGCCGGGCAGGATGAGG



TGGTCCGGGAGCTTGTCAACTATGGAGCCAACGTCAACGCCCAGTCACAG



AAAGGTTTTACACCCCTGTACATGGCAGCACAAGAGAACCACTTGGAAGT



GGTTAAGTTTTTACTGGAAAATGGAGCTAACCAGAATGTAGCCACAGAA



GACGGCTTCACGCCTCTGGCGGTAGCCCTGCAGCAGGGCCATGAGAACGT



CGTCGCGCACCTCATCAACTACGGCACCAAGGGGAAGGTGCGCCTCCCG



GCCCTGCACATCGCGGCCCGCAACGACGACACGCGCACGGCTGCGGTGC



TGCTGCAGAACGACCCCAACCCGGACGTGCTTTCCAAGACGGGATTCACG



CCCCTGCACATTGCGGCTCACTACGAGAACCTCAACGTGGCCCAGTTGCT



CCTCAACAGAGGAGCCAGCGTCAATTTCACACCACAGAACGGCATCACG



CCACTGCACATCGCCTCCCGCAGGGGCAACGTGATCATGGTGCGGCTGCT



GCTGGATCGGGGAGCCCAGATAGAAACCAAGACCAAGGACGAATTGACA



CCTCTCCACTGTGCAGCTCGAAATGGGCACGTGCGAATCTCAGAGATCCT



GCTGGACCACGGGGCACCAATCCAAGCCAAAACCAAGAACGGCCTGTCC



CCAATTCACATGGCGGCTCAGGGAGACCACCTCGACTGTGTCCGGCTCCT



GTTGCAATACGACGCAGAGATAGACGACATCACCCTGGACCACCTGACC



CCACTCCACGTGGCTGCCCACTGTGGACACCACAGGGTGGCTAAGGTCCT



TCTGGATAAAGGGGCCAAACCCAACTCCAGAGCCCTGAATGGCTTTACCC



CCTTACACATCGCCTGCAAAAAGAACCACGTCCGTGTCATGGAGCTGCTG



CTGAAGACGGGAGCCTCGATCGACGCGGTCACCGAGTCTGGCCTGACAC



CTCTCCACGTGGCCTCCTTCATGGGGCACCTTCCCATCGTGAAGAACCTCC



TGCAGCGGGGGGCGTCGCCCAACGTCTCCAACGTGAAAGTGGAGACCCC



GCTACACATGGCAGCCAGAGCCGGGCACACGGAAGTGGCCAAATATTTA



CTCCAGAACAAAGCCAAAGTCAATGCCAAGGCCAAGGATGACCAGACCC



CACTTCACTGTGCAGCTCGCATCGGCCACACAAACATGGTGAAGCTCCTG



CTGGAAAATAACGCCAACCCCAACCTGGCCACCACCGCCGGGCACACCC



CCCTGCACATTGCAGCCCGTGAGGGCCATGTGGAAACAGTCCTGGCCCTT



CTGGAAAAGGAAGCATCCCAGGCCTGCATGACCAAGAAAGGATTTACCC



CTCTGCACGTGGCGGCCAAGTACGGGAAGGTGCGGGTGGCAGAGCTGCT



GCTGGAGCGGGACGCACACCCGAATGCTGCCGGAAAAAATGGCCTGACC



CCCCTGCACGTGGCCGTCCATCACAACAACCTGGACATCGTCAAGCTGCT



GCTTCCCCGGGGCGGCTCCCCGCACAGCCCTGCCTGGAATGGCTACACCC



CTTTGCACATCGCTGCCAAGCAGAACCAGGTGGAGGTGGCCCGTAGTCTG



CTGCAGTATGGGGGCTCAGCAAACGCCGAGTCGGTGCAAGGTGTGACGC



CCCTTCACCTGGCCGCCCAGGAGGGCCACGCAGAGATGGTGGCTCTGCTG



CTCTCGAAACAAGCCAATGGCAACCTGGGGAACAAGAGCGGACTCACTC



CCCTCCATCTGGTAGCACAAGAAGGCCACGTTCCAGTGGCAGATGTGCTG



ATCAAACACGGCGTCATGGTGGACGCCACCACCCGGATGGGCTACACTCC



CCTCCATGTGGCCAGTCACTATGGAAACATCAAGCTGGTGAAGTTTCTGC



TGCAGCACCAGGCAGATGTCAATGCCAAGACCAAGCTAGGATACAGCCC



CCTGCACCAGGCAGCCCAGCAGGGACACACAGACATCGTGACTCTGCTTC



TGAAAAACGGTGCTTCCCCAAACGAGGTCAGCTCGGATGGAACCACACC



TCTGGCCATAGCCAAGCGCTTGGGCTACATTTCTGTCACCGACGTGCTCA



AGGTCGTCACGGATGAAACCAGTTTCGTGTTAGTCAGTGATAAGCATCGA



ATGAGTTTCCCTGAGACAGTTGATGAGATCCTGGATGTCTCGGAAGATGA



AGGGGAAGAACTCATCAGCTTCAAGGCTGAGAGGCGGGATTCCAGGGAT



GTTGATGAAGAGAAGGAGCTGCTGGATTTTGTGCCGAAGCTAGACCAAG



TGGTGGAATCTCCAGCCATCCCCAGGATTCCCTGTGCCATGCCTGAGACA



GTGGTGATCAGGTCAGAAGAGCAGGAGCAGGCATCTAAAGAGTATGATG



AGGACTCCCTCATCCCCAGCAGCCCGGCCACCGAGACCTCAGACAACATC



AGCCCGGTGGCCAGCCCGGTGCATACAGGGTTTCTGGTGAGCTTCATGGT



TGACGCCCGGGGTGGTTCCATGAGAGGAAGTCGCCACAACGGCCTGCGA



GTGGTGATCCCGCCACGGACGTGCGCAGCGCCCACCCGCATCACCTGCCG



CCTGGTCAAGCCCCAGAAGCTCAGCACGCCGCCCCCACTGGCCGAGGAG



GAGGGCCTGGCCAGCAGGATCATAGCACTGGGGCCCACGGGGGCACAGT



TCCTGAGCCCTGTAATCGTGGAGATCCCGCACTTTGCCTCCCATGGCCGT



GGAGACCGCGAGCTCGTGGTTCTGAGGAGCGAAAACGGCTCCGTGTGGA



AGGAGCACAGGAGCCGCTATGGAGAGAGCTACCTGGATCAGATCCTCAA



CGGGATGGACGAAGAGCTGGGGAGCCTGGAGGAGCTAGAGAAGAAGAG



GGTGTGCCGAATCATCACCACCGACTTCCCGCTGTACTTCGTGATCATGTC



ACGGCTCTGCCAGGACTACGACACCATCGGTCCCGAAGGGGGCTCCCTGA



AGAGCAAGCTGGTGCCCCTGGTACAGGCAACGTTCCCGGAGAATGCCGT



CACCAAGAGAGTGAAGCTGGCTCTGCAGGCCCAGCCTGTCCCGGATGAG



CTTGTCACTAAGCTCCTGGGCAACCAGGCCACATTCAGCCCCATTGTCAC



CGTGGAGCCCCGGCGCCGGAAGTTCCACCGCCCCATTGGGCTTCGGATCC



CACTACCTCCTTCCTGGACCGACAACCCGAGGGACAGCGGGGAGGGAGA



CACCACCAGCCTGCGCCTGCTTTGCAGCGTCATTGGAGGAACAGACCAAG



CCCAGTGGGAAGACATAACAGGAACCACCAAACTTGTATATGCCAACGA



GTGCGCCAACTTCACCACCAATGTCTCTGCCAGGTTTTGGCTGTCGGACT



GTCCTCGGACTGCTGAGGCTGTGAACTTTGCCACCCTGCTGTACAAAGAG



CTCACTGCAGTGCCCTACATGGCCAAATTCGTCATCTTTGCCAAGATGAA



TGACCCCCGAGAGGGGCGCCTGCGCTGCTACTGCATGACAGATGATAAA



GTGGACAAGACCCTGGAGCAGCATGAGAACTTCGTGGAGGTGGCCCGGA



GCAGGGACATAGAGGTGTTGGAAGGAATGTCCCTGTTTGCAGAACTCTCT



GGGAACCTGGTGCCTGTGAAGAAAGCTGCCCAGCAGCGGAGCTTCCACTT



CCAGTCATTTCGGGAGAACCGTCTGGCCATGCCTGTAAAGGTGAGGGACA



GCAGTCGAGAGCCGGGAGGGTCCCTGTCGTTTCTGCGCAAGGCGATGAA



GTACGAGGACACCCAGCACATTCTCTGCCACCTGAACATCACCATGCCCC



CCTGCGCCAAGGGAAGTGGAGCCGAAGATAGGAGAAGGACCCCGACGCC



CCTGGCCCTGCGATACAGCATTCTCAGTGAGTCCACACCAGGTTCTCTCA



GTGGGACAGAGCAGGCAGAGATGAAGATGGCTGTTATCTCAGAGCACCT



CGGTCTCAGCTGGGCAGAGTTGGCCCGGGAGCTGCAGTTCAGTGTGGAA



GACATCAACAGGATCCGAGTGGAAAATCCCAACTCCCTGTTGGAGCAGA



GTGTGGCCTTGCTGAACCTCTGGGTCATCCGTGAAGGCCAAAACGCAAAC



ATGGAGAATCTGTACACAGCCCTGCAGAGCATTGACCGTGGCGAGATCGT



GAACATGCTGGAGGGTTCCGGCCGACAGAGCCGCAACTTGAAGCCAGAC



AGGCGGCACACCGACCGCGACTACTCGCTGTCACCCTCCCAGATGAATGG



TTACTCCTCACTGCAGGACGAGCTGCTGTCCCCTGCCTCCCTGGGCTGTGC



ACTTTCCTCTCCGCTACGTGCAGACCAGTACTGGAATGAGGTGGCCGTCC



TAGACGCCATCCCCTTGGCGGCCACGGAGCATGACACCATGCTGGAGATG



TCTGACATGCAGGTGTGGTCTGCGGGCCTCACGCCTTCTCTGGTCACTGCT



GAGGACTCCTCTCTGGAGTGTAGCAAGGCTGAGGACTCTGATGCCACAGG



TCACGAGTGGAAGTTGGAGGGGGCACTCTCAGAGGAACCGCGGGGCCCC



GAGTTGGGCTCTCTGGAACTTGTGGAGGACGACACAGTGGATTCAGATGC



CACAAATGGCCTTATCGATTTGCTTGAACAGGAGGAAGGTCAGAGGTCA



GAAGAGAAGCTGCCAGGTTCTAAGAGGCAGGATGACGCGACAGGTGCAG



GGCAGGACTCAGAGAATGAAGTGTCTCTTGTTTCAGGCCATCAGAGGGG



GCAAGCCCGAATCACACATTCCCCCACCGTGAGTCAGGTGACGGAGAGG



AGTCAGGACAGACTGCAGGACTGGGATGCAGACGGCTCGATTGTCTCAT



ACCTGCAAGATGCTGCACAAGGTTCCTGGCAAGAGGAGGTCACGCAAGG



TCCACACTCATTCCAGGGAACAAGTACCATGACTGAAGGGCTAGAGCCC



GGTGGATCTCAGGAGTACGAGAAGGTCCTGGTGTCTGTAAGTGAGCACA



CGTGGACAGAACAGCCCGAGGCTGAGAGCTCCCAGGCCGACAGGGACCG



GAGGCAGCAAGGCCAAGAAGAGCAGGTGCAGGAGGCCAAGAACACCTT



CACCCAAGTGGTGCAGGGGAATGAGTTTCAGAATATTCCAGGGGAGCAG



GTGACAGAGGAGCAATTCACGGATGAGCAGGGCAACATTGTCACCAAGA



AGATCATTCGCAAGGTGGTTCGACAGATAGACTTGTCCAGCGCCGATGCC



GCCCAGGAGCACGAGGAGGTGGAGCTGAGAGGGAGTGGCCTACAGCCGG



ACCTGATAGAGGGCAGGAAGGGGGCGCAGATAGTGAAGCGGGCCAGCCT



GAAAAGGGGGAAACAGTGA





MYO18A
ATGTTTAACCTAATGAAGAAAGACAAGGACAAAGATGGCGGGCGGAAGG
SEQ ID


->
AGAAGAAGGAGAAAAAGGAGAAAAAGGAGCGGATGTCAGCGGCAGAGC
NO: 96


SSH2
TTCGGAGCCTGGAGGAGATGAGCCTGCGACGTGGCTTCTTCAACCTGAAC



CGCTCCTCCAAGCGTGAATCCAAGACGCGCCTGGAAATCTCCAACCCCAT



CCCCATCAAGGTGGCCAGCGGCTCTGACCTGCACCTGACTGACATTGACT



CCGATAGTAACCGGGGCAGCGTCATCCTGGACTCGGGCCACCTAAGTACA



GCCAGCTCCAGCGATGACCTCAAGGGTGAGGAGGGTAGCTTCCGTGGCTC



GGTGCTGCAGCGGGCAGCCAAGTTCGGCTCACTGGCCAAGCAGAACTCA



CAGATGATTGTCAAGCGCTTTTCCTTCTCCCAGCGTAGCCGGGATGAGAG



CGCCTCAGAAACCTCGACGCCCTCAGAGCACTCTGCCGCCCCCTCGCCAC



AGGTGGAGGTGAGGACTCTAGAGGGACAGCTGGTGCAGCATCCTGGCCC



AGGCATCCCTCGACCAGGGCACCGATCCCGAGCCCCTGAGCTAGTGACTA



AAAAGTTCCCAGTCGACCTGCGCCTGCCCCCCGTGGTGCCCCTGCCCCCA



CCTACCCTCCGGGAGCTGGAGCTGCAACGACGGCCCACTGGAGACTTTGG



CTTCTCCCTGCGGCGCACAACCATGCTGGATCGGGGCCCCGAGGGCCAGG



CCTGTCGGCGTGTGGTCCACTTTGCTGAGCCTGGTGCAGGCACCAAGGAC



CTGGCCCTGGGGCTGGTGCCAGGAGATCGACTGGTGGAGATTAATGGGC



ACAATGTGGAGAGCAAGTCCAGGGATGAGATTGTGGAGATGATCCGGCA



GTCAGGGGACAGCGTGCGGCTCAAGGTGCAGCCCATTCCAGAGCTCAGC



GAGCTCAGCAGGAGCTGGCTGCGGAGCGGCGAGGGACCTCGCAGGGAGC



CATCCGATGAGGCAGACAGTGGGGAGGAAGAATGCCGGTCACAGCCCAG



GAGCATCAGCGAGAGCTTTCTAACTGTCAAAGGTGCTGCCCTTTTTCTAC



CACGGGGAAATGGCTCATCCACACCAAGAATCAGCCACAGACGGAACAA



GCATGCAGGCGATCTCCAACAGCATCTCCAAGCAATGTTCATTTTACTCC



GCCCAGAAGACAACATCAGGCTGGCTGTAAGACTGGAAAGTACTTACCA



GAATCGAACACGCTATATGGTAGTGGTTTCAACTAATGGTAGACAAGACA



CTGAAGAAAGCATCGTCCTAGGAATGGATTTCTCCTCTAATGACAGTAGC



ACTTGTACCATGGGCTTAGTTTTGCCTCTCTGGAGCGACACGCTAATTCAT



TTGGATGGTGATGGTGGGTTCAGTGTATCGACGGATAACAGAGTTCACAT



ATTCAAACCTGTATCTGTGCAGGCAATGTGGTCTGCACTACAGAGCTTAC



ACAAGGCTTGTGAAGTCGCCAGAGCGCATAACTACTACCCAGGCAGCCT



ATTTCTCACTTGGGTGAGTTATTATGAGAGCCATATCAACTCAGATCAAT



CCTCAGTCAATGAATGGAATGCAATGCAAGATGTACAGTCCCACCGGCCC



GACTCTCCAGCTCTCTTCACCGACATACCTACTGAACGTGAACGAACAGA



AAGGCTAATTAAAACCAAATTAAGGGAGATCATGATGCAGAAGGATTTG



GAGAATATTACATCCAAAGAGATAAGAACAGAGTTGGAAATGCAAATGG



TGTGCAACTTGCGGGAATTCAAGGAATTTATAGACAATGAAATGATAGTG



ATCCTTGGTCAAATGGATAGCCCTACACAGATATTTGAGCATGTGTTCCT



GGGCTCAGAATGGAATGCCTCCAACTTAGAGGACTTACAGAACCGAGGG



GTACGGTATATCTTGAATGTCACTCGAGAGATAGATAACTTCTTCCCAGG



AGTCTTTGAGTATCATAACATTCGGGTATATGATGAAGAGGCAACGGATC



TCCTGGCGTACTGGAATGACACTTACAAATTCATCTCTAAAGCAAAGAAA



CATGGATCTAAATGCCTTGTGCACTGCAAAATGGGGGTGAGTCGCTCAGC



CTCCACCGTGATTGCCTATGCAATGAAGGAATATGGCTGGAATCTGGACC



GAGCCTATGACTATGTGAAAGAAAGACGAACGGTAACCAAGCCCAACCC



AAGCTTCATGAGACAACTGGAAGAGTATCAGGGGATCTTGCTGGCAAGC



AAACAGCGGCATAACAAACTATGGAGATCTCATTCAGATAGTGACCTCTC



AGACCACCACGAACCCATCTGCAAACCTGGGCTAGAACTCAACAAGAAG



GATATCACCACCTCAGCAGACCAGATTGCTGAGGTGAAGACCATGGAGA



GTCACCCACCCATACCTCCTGTCTTTGTGGAACATATGGTCCCACAAGAT



GCAAATCAGAAAGGCCTGTGTACCAAAGAAAGAATGATCTGCTTGGAGT



TTACTTCTAGGGAATTTCATGCTGGACAGATTGAGGATGAATTAAACTTA



AATGACATCAATGGATGCTCATCAGGGTGTTGTCTGAATGAATCAAAATT



TCCTCTTGACAATTGCCATGCATCCAAAGCCTTAATTCAGCCTGGACATGT



CCCAGAAATGGCCAACAAGTTTCCAGACTTAACAGTGGAAGATTTGGAG



ACAGATGCACTGAAAGCAGACATGAATGTCCACCTACTGCCTATGGAAG



AATTGACATCTCCACTGAAAGACCCCCCCATGTCCCCTGATCCTGAGTCA



CCAAGCCCCCAACCCAGTTGCCAGACTGAAATCTCAGATTTCAGTACAGA



TCGCATTGACTTTTTTAGTGCCCTAGAGAAGTTTGTGGAGCTCTCCCAAGA



AACCCGGTCACGATCTTTTTCCCATTCAAGGATGGAGGAACTGGGTGGAG



GAAGGAATGAGAGCTGTCGACTGTCAGTGGTAGAAGTAGCCCCTTCCAA



AGTGACAGCTGATGACCAGAGAAGCAGCTCTTTGAGTAATACTCCCCATG



CATCAGAAGAATCTTCAATGGATGAGGAACAGTCAAAGGCAATTTCAGA



ACTGGTCAGCCCAGACATCTTCATGCAGTCTCACTCGGAAAATGCAATTT



CAGTCAAAGAAATTGTCACTGAAATTGAGTCCATCAGTCAAGGAGTTGGG



CAGATTCAACTGAAAGGAGACATCTTACCCAACCCATGCCATACACCAAA



GAAGAACAGCATCCATGAGCTGCTCCTTGAGAGGGCCCAGACTCCAGAG



AACAAACCTGGACATATGGAGCAAGATGAGGACTCCTGCACAGCCCAGC



CTGAACTAGCCAAAGACTCAGGGATGTGCAACCCAGAAGGCTGCCTAAC



CACACACTCATCTATAGCAGACTTGGAAGAAGGGGAACCAGCTGAGGGG



GAACAAGAGCTCCAGGGCTCAGGGATGCACCCAGGTGCCAAGTGGTACC



CTGGGTCTGTGAGGCGAGCCACCTTGGAGTTCGAAGAGCGCTTACGGCAG



GAGCAAGAGCATCATGGTGCTGCCCCAACATGTACCTCATTGTCCACTCG



TAAGAATTCAAAGAATGATTCTTCTGTGGCAGACCTAGCACCAAAAGGG



AAAAGTGATGAAGCCCCCCCAGAACATTCATTTGTCCTCAAGGAACCAGA



AATGAGCAAAGGCAAAGGGAAATACAGTGGGTCTGAGGCTGGCTCACTG



TCCCATTCTGAGCAGAATGCCACTGTTCCAGCTCCCAGGGTGCTGGAGTT



TGACCACTTGCCAGATCCTCAGGAGGGCCCAGGGTCAGATACTGGAACA



CAGCAGGAAGGAGTCCTGAAGGATCTGAGGACTGTGATTCCATACCAGG



AGTCTGAAACACAAGCAGTCCCTCTTCCCCTTCCCAAGAGGGTAGAAATC



ATTGAATATACCCACATAGTTACATCACCCAATCACACTGGGCCAGGGAG



TGAAATAGCCACCAGTGAGAAGAGCGGAGAGCAAGGGCTGAGGAAAGT



GAACATGGAAAAATCTGTCACTGTGCTCTGCACACTGGATGAAAATCTAA



ACAGGACTCTGGACCCCAACCAGGTTTCTCTGCACCCCCAAGTGCTACCT



CTGCCTCATTCTTCCTCCCCTGAGCACAACAGACCCACTGACCATCCAAC



CTCCATCCTGAGTAGCCCTGAAGACAGAGGCAGCAGCCTGTCCACAGCCC



TGGAGACAGCAGCACCTTTTGTCAGTCATACAACCCATTTACTGTCTGCC



AGTTTGGATTACCTGCATCCCCAGACTATGGTTCACCTGGAGGGCTTCAC



AGAGCAGAGCAGCACTACAGATGAGCCCTCTGCAGAACAGGTTAGCTGG



GAAGAAAGTCAGGAGAGCCCTCTCTCCAGTGGCAGTGAGGTGCCATATA



AGGACTCCCAGCTAAGTAGCGCAGACCTAAGTTTAATTAGCAAACTTGGT



GACAACACTGGGGAGTTACAGGAGAAAATGGACCCATTGCCTGTAGCCT



GTCGACTCCCACATAGCTCTAGTAGTGAAAACATAAAGAGTCTCAGCCAC



AGCCCCGGTGTGGTGAAGGAGCGTGCTAAAGAAATCGAGTCTCGAGTGG



TTTTCCAGGCAGGGCTCACCAAACCATCCCAAATGAGGCGCTCAGCTTCT



CTCGCCAAATTAGGTTACTTGGACCTCTGTAAAGACTGCTTACCAGAGAG



GGAGCCTGCCTCCTGTGAATCCCCTCATCTCAAACTGCTTCAGCCTTTCCT



CAGAACAGACTCAGGCATGCACGCGATGGAGGACCAAGAGTCCCTAGAA



AACCCAGGTGCCCCCCACAACCCAGAGCCCACCAAGTCTTTTGTAGAACA



ACTCACAACAACAGAGTGTATTGTGCAGAGCAAGCCAGTGGAGAGGCCC



CTTGTGCAGTATGCCAAAGAATTTGGTTCTAGTCAGCAGTATTTGCTCCCC



AGGGCAGGACTTGAATTGACTAGTTCTGAAGGAGGCCTTCCCGTGCTACA



GACCCAGGGACTGCAGTGTGCATGCCCAGCTCCAGGGCTGGCCGTGGCA



CCCCGTCAGCAACACGGCAGAACTCACCCCCTTAGGAGACTGAAAAAGG



CAAATGACAAAAAACGGACAACCAACCCCTTCTATAATACCATGTGA





PLEKHA8
ATGGAGGGGGTGCTGTACAAGTGGACCAACTATCTGAGCGGTTGGCAGC
SEQ ID


->
CTCGATGGTTCCTTCTCTGTGGGGGAATATTGTCCTATTATGATTCTCCTG
NO: 97


GNAQ
AAGATGCCTGGAAAGGTTGCAAAGGGAGCATACAAATGGCAGTCTGTGA



AATTCAAGTTCATTCTGTAGATAATACACGCATGGACCTGATAATCCCTG



GGGAACAGTATTTCTACCTGAAGGCCAGAAGTGTGGCTGAAAGACAGCG



GTGGCTGGTGGCCCTGGGATCAGCCAAGGCTTGCCTGACTGACAGTAGGA



CCCAGAAGGAGAAAGAGTTTGCTGAAAACACTGAAAACTTGAAAACCAA



AATGTCAGAACTAAGACTCTACTGTGACCTCCTTGTTCAGCAAGTAGATA



AAACAAAAGAAGTGACCACAACTGGTGTGTCCAATTCTGAGGAGGGAAT



TGATGTGGGAACTTTGCTGAAATCAACCTGTAATACTTTTCTGAAGACCTT



GGAAGAATGCATGCAGATCGCAAATGCAGCCTTCACCTCTGAGCTGCTCT



ACCGCACTCCACCAGGATCACCTCAGCTGGCCATGCTCAAGTCCAGCAAG



ATGAAACATCCTATTATACCAATTCATAATTCATTGGAAAGGCAAATGGA



GTTGAGCACTTGTGAAAATGGATCTTTAAATATGGAAATAAATGGTGAGG



AAGAAATCCTAATGAAAAATAAGAATTCCTTATATTTGAAATCTGCAGAG



ATAGACTGCAGCATATCAAGTGAGGAAAATACAGATGATAATATAACAG



TCCAAGGTGAAATAAGGAAGGAAGATGGAATGGAAAACCTGAAAAATCA



TGACAATAACTTGACTCAGTCTGGATCAGACTCAAGTTGCTCTCCGGAAT



GCCTCTGGGAGGAAGGCAAAGAAGTTATCCCAACTTTCTTTAGTACCATG



AACACAAGCTTTAGTGACATTGAACTTCTGGAAGACAGTGGCATTCCCAC



AGAAGCATTCTTGGCATCATGTTATGCTGTGGTTCCAGTATTAGACAAAC



TTGGCCCTACAGTGTTTGCTCCTGTTAAGATGGATCTTGTTGGAAATATTA



AGAAAGTAAATCAGAAGTATATAACCAACAAAGAAGAGTTTACCACTCT



CCAGAAGATAGTGCTGCACGAAGTGGAGGCGGATGTAGCCCAGGTTAGG



AACTCAGCGACTGAAGCCCTCTTGTGGCTGAAGAGAGGTCTCAAATTTTT



GAAGGGATTTTTGACAGAAGTGAAAAATGGGGAGAAGGATATCCAGACA



GCCCTAAATAATGCATATGGTAAAACATTGCGGCAACACCATGGCTGGGT



AGTTCGAGGGGTTTTTGCGGGACAGGAGAGAGTGGCAAGAGTACGTTTA



TCAAGCAGATGAGAATCATCCATGGGTCAGGATACTCTGATGAAGATAA



AAGGGGCTTCACCAAGCTGGTGTATCAGAACATCTTCACGGCCATGCAGG



CCATGATCAGAGCCATGGACACACTCAAGATCCCATACAAGTATGAGCA



CAATAAGGCTCATGCACAATTAGTTCGAGAAGTTGATGTGGAGAAGGTGT



CTGCTTTTGAGAATCCATATGTAGATGCAATAAAGAGTTTATGGAATGAT



CCTGGAATCCAGGAATGCTATGATAGACGACGAGAATATCAATTATCTGA



CTCTACCAAATACTATCTTAATGACTTGGACCGCGTAGCTGACCCTGCCT



ACCTGCCTACGCAACAAGATGTGCTTAGAGTTCGAGTCCCCACCACAGGG



ATCATCGAATACCCCTTTGACTTACAAAGTGTCATTTTCAGAATGGTCGAT



GTAGGGGGCCAAAGGTCAGAGAGAAGAAAATGGATACACTGCTTTGAAA



ATGTCACCTCTATCATGTTTCTAGTAGCGCTTAGTGAATATGATCAAGTTC



TCGTGGAGTCAGACAATGAGAACCGAATGGAGGAAAGCAAGGCTCTCTT



TAGAACAATTATCACATACCCCTGGTTCCAGAACTCCTCGGTTATTCTGTT



CTTAAACAAGAAAGATCTTCTAGAGGAGAAAATCATGTATTCCCATCTAG



TCGACTACTTCCCAGAATATGATGGACCCCAGAGAGATGCCCAGGCAGCC



CGAGAATTCATTCTGAAGATGTTCGTGGACCTGAACCCAGACAGTGACAA



AATTATCTACTCCCACTTCACGTGCGCCACAGACACCGAGAATATCCGCT



TTGTCTTTGCTGCCGTCAAGGACACCATCCTCCAGTTGAACCTGAAGGAG



TACAATCTGGTCTAA





FBXL20->
ATGAGGAGGGACGTGAACGGAGTGACCAAGAGCAGGTTTGAGATGTTCT
SEQ ID


NSF
CAAATAGTGATGAAGCTGTAATCAATAAAAAACTTCCCAAAGAACTCCTG
NO: 98



TTACGGATATTTTCTTTTCTAGATGTTGTTACCCTGTGCCGCTGTGCTCAG



GTCTCCAGGGCCTGGAATGTTCTGGCTCTGGATGGCAGTAACTGGCAGCG



AATTGACCTATTTGATTTCCAGAGGGATATTGAGGGCCGAGTAGTGGAGA



ATATTTCAAAACGATGTGGGGGCTTTTTACGAAAGTTAAGTCTTCGTGGA



TGTCTTGGAGTGGGAGACAATGCATTAAGAACCTTTGCACAAAACTGCAG



GAACATTGAAGTACTGAATCTAAATGGGTGTACAAAGACAACAGACGCG



TTGTAAACATGTTAAAGGCATCCTGTTATATGGACCCCCAGGTTGTGGTA



AGACTCTCTTGGCTCGACAGATTGGCAAGATGTTGAATGCAAGAGAGCCC



AAAGTGGTCAATGGGCCAGAAATCCTTAACAAATATGTGGGAGAATCAG



AGGCTAACATTCGCAAACTTTTTGCTGATGCTGAAGAGGAGCAAAGGAG



GCTTGGTGCTAACAGTGGTTTGCACATCATCATCTTTGATGAAATTGATGC



CATCTGCAAGCAGAGAGGGAGCATGGCTGGTAGCACGGGAGTTCATGAC



ACTGTTGTCAACCAGTTGCTGTCCAAAATTGATGGCGTGGAGCAGCTAAA



CAACATCCTAGTCATTGGAATGACCAATAGACCAGATCTGATAGATGAGG



CTCTTCTTAGACCTGGAAGACTGGAAGTTAAAATGGAGATAGGCTTGCCA



GATGAGAAAGGCCGACTACAGATTCTTCACATCCACACAGCAAGAATGA



GAGGGCATCAGTTACTCTCTGCTGATGTAGACATTAAAGAACTGGCCGTG



GAGACCAAGAATTTCAGTGGTGCTGAATTGGAGGGTCTGGTGCGAGCAG



CCCAGTCCACTGCTATGAATAGACACATAAAGGCCAGTACTAAAGTGGA



AGTGGACATGGAGAAAGCAGAAAGCCTGCAAGTGACGAGAGGAGACTTC



CTTGCTTCTTTGGAGAATGATATCAAACCAGCCTTTGGCACAAACCAAGA



AGATTATGCAAGTTACATTATGAACGGTATCATCAAATGGGGTGACCCAG



TTACTCGAGTTCTAGATGATGGGGAGCTGCTGGTGCAGCAGACTAAGAAC



AGTGACCGCACACCATTGGTCAGCGTGCTTCTGGAAGGCCCTCCTCACAG



TGGGAAGACTGCTTTAGCTGCAAAAATTGCAGAGGAATCCAACTTCCCGT



TCATCAAGATCTGTTCTCCTGATAAAATGATTGGCTTTTCTGAAACAGCCA



AATGTCAGGCCATGAAGAAGATCTTTGATGATGCGTACAAATCCCAGCTC



AGTTGTGTGGTTGTGGATGACATTGAGAGATTGCTTGATTACGTCCCTATT



GGCCCTCGATTTTCAAATCTTGTATTACAGGCTCTTCTCGTTTTACTGAAA



AAGGCACCTCCTCAGGGCCGCAAGCTTCTTATCATTGGGACCACTAGCCG



CAAAGATGTCCTTCAGGAGATGGAAATGCTTAACGCTTTCAGCACCACCA



TCCACGTGCCCAACATTGCCACAGGAGAGCAGCTGTTGGAAGCTTTGGAG



CTTTTGGGCAACTTCAAGGATAAGGAACGCACCACAATTGCACAGCAAGT



CAAAGGGAAGAAGGTCTGGATAGGAATCAAGAAGTTACTAATGCTGATC



GAGATGTCCCTACAGATGGATCCTGAATACCGTGTGAGAAAATTCTTGGC



CCTCTTAAGAGAAGAAGGAGCTAGCCCCCTTGATTTTGATTGA





TRIM37->
ATGGATGAACAGAGCGTGGAGAGCATTGCTGAGGTTTTCCGATGTTTCAT
SEQ ID


BCAS3
TTGTATGGAGAAATTGCGGGATGCACGCCTGTGTCCTCATTGCTCCAAAC
NO: 99



TGTGTTGTTTCAGCTGTATTAGGCGCTGGCTGACAGAGCAGAGAGCTCAA



TGTCCTCATTGCCGTGCTCCACTCCAGCTACGAGAACTAGTAAATTGTCGT



TGGGCAGAAGAAGTAACACAACAGCTTGATACTCTTCAACTCTGCAGTCT



CACCAAACATGAAGAAAATGAAAAGGACAAATGTGAAAATCACCATGAA



AAACTTAGTGTATTTTGCTGGACTTGTAAGAAGTGTATCTGCCATCAGTGT



GCACTTTGGGGAGGAATGCATGGCGGACATACCTTTAAACCTTTGGCAGA



AATTTATGAGCAACACGTCACTAAAGTGAATGAAGAGGTAGCCAAACTT



CGTCGGCGTCTCATGGAACTGATCAGCTTAGTTCAAGAAGTGGAAAGGA



ATGTAGAAGCTGTAAGAAATGCAAAAGATGAGCGTGTTCGGGAAATTAG



GAATGCAGTGGAGATGATGATTGCACGGTTAGACACACAGCTGAAGAAT



AAGCTTATAACACTGATGGGTCAGAAGACATCTCTAACCCAAGAAACAG



AGCTTTTGGAATCCTTACTTCAGGAGGTGGAGCACCAGTTGCGGTCTTGT



AGTAAGAGTGAGTTGATATCTAAGAGCTCAGAGATCCTTATGATGTTTCA



GCAAGTTCATCGGAAGCCCATGGCATCTTTTGTTACCACTCCTGTTCCACC



AGACTTTACCAGTGAATTAGTGCCATCTTACGATTCAGCTACTTTTGTTTT



AGAGAATTTCAGCACTTTGCGTCAGAGAGCAGATCCTGTTTACAGTCCAC



CTCTTCAAGTTTCAGGACTTTGCTGGAGGTTAAAAGTTTACCCAGATGGA



AATGGAGTTGTGCGAGGTTACTACTTATCTGTGTTTCTGGAGCTCTCAGCT



GGCTTGCCTGAAACTTCTAAATATGAATATCGTGTAGAGATGGTTCACCA



GTCCTGTAATGATCCTACAAAAAATATCATTCGAGAATTTGCATCTGACT



TTGAAGTTGGAGAATGCTGGGGCTATAATAGATTTTTCCGTTTGGACTTA



CTCGCAAATGAAGGATACTTGAATCCACAAAATGATACAGTGATTTTAAG



GTTTCAGGTACGTTCACCAACTTTCTTTCAAAAATCCCGGGACCAGCATT



GGTACATTACTCAGTTGGAAGCTGCACAGACTAGTTATATCCAACAAATA



AACAACCTTAAAGAGAGACTTACTATTGAGCTGTCTCGAACTCAGAAGTC



AAGAGATTTGTCACCACCAGATAACCATCTTAGCCCCCAAAATGATGATG



CTCTGGAGACACGAGCTAAGAAGTCTGCATGCTCTGACATGCTTCTCGAA



GGTGGTCCTACTACAGCTTCTGTAAGAGAGGCCAAAGAGGATGAAGAAG



ATGAGGAGAAGATTCAGAATGAAGATTATCATCACGAGCTTTCAGATGG



AGATCTGGATCTGGATCTTGTTTATGAGGATGAAGTAAATCAGCTCGATG



GCAGCAGTTCCTCTGCTAGTTCCACAGCAACAAGTAATACAGAAGAAAAT



GATATTGATGAAGAAACTATGTCTGGAGAAAATGATGTGGAATATAACA



ACATGGAATTAGAAGAGGGAGAACTCATGGAAGATGCAGCTGCTGCAGG



ACCCGCAGGTAGTAGCCATGGTTATGTGGGTTCCAGTAGTAGAATATCAA



GAAGAACACATTTATGCTCCGCTGCTACCAGTAGTTTACTAGACATTGAT



CCATTAATTTTAATACATTTGTTGGACCTTAAGGACCGGAGCAGTATAGA



AAATTTGTGGGGCTTACAGCCTCGCCCACCTGCTTCACTTCTGCAGCCCAC



AGCATCATATTCTCGAAAAGATAAAGACCAAAGGAAGCAACAGGCAATG



TGGCGAGTGCCCTCTGATTTAAAGATGCTAAAAAGACTCAAAACTCAAAT



GGCCGAAGTTCGATGTATGAAAACTGATGTAAAGAATACACTTTCAGAA



ATAAAAAGCAGCAGTGCTGCTTCTGGAGACATGCAGACAAGCCTTTTTTC



TGCTGACCAGGCAGCTCTGGCTGCATGTGGAACTGAAAACTCTGGCAGAT



TGCAGGATTTGGGAATGGAACTCCTGGCAAAGTCATCAGTTGCCAATTGT



TACATACGAAACTCCACAAATAAGAAGAGTAATTCGCCCAAGCCAGCTC



GATCCAGTGTAGCAGGTAGTCTATCACTTCGAAGAGCAGTGGACCCTGGA



GAAAATAGTCGTTCAAAGGGAGACTGTCAGACTCTGTCTGAAGATACATC



AAGAAATCTGGAATTTCATGAAATACATAGTACTGGGAATGAACCGCCTT



TGTTGATTATGATTGGCTACAGTGATGGAATGCAGGTCTGGAGCATCCCT



ATCAGTGGTGAAGCACAAGAGCTCTTCTCTGTTCGACATGGCCCAATTCG



AGCGGCTAGAATCTTGCCTGCTCCACAGTTTGGTGCTCAAAAATGTGATA



ACTTTGCTGAAAAAAGACCCCTCCTTGGTGTTTGTAAGAGCATTGGATCT



TCTGGCACAAGCCCACCGTACTGTTGTGTGGATCTGTATTCACTTCGTACT



GGGGAGATGGTCAAGTCCATTCAATTTAAGACACCTATTTATGATCTCCA



TTGCAATAAACGGATCCTTGTCGTAGTCTTGCAGGAGAAAATTGCTGCCT



TTGATAGCTGTACTTTCACGAAGAAATTCTTTGTTACAAGCTGCTATCCAT



GTCCAGGGCCAAACATGAATCCTATTGCTCTTGGGAGCCGCTGGCTTGCT



TATGCAGAAAACAAGTTGATTCGATGTCATCAGTCCCGTGGTGGAGCCTG



TGGAGACAACATTCAGTCTTATACTGCCACAGTCATTAGTGCTGCTAAAA



CATTGAAAAGTGGCCTGACAATGGTAGGGAAAGTGGTGACTCAGCTGAC



AGGCACACTGCCTTCAGGTGTGACAGAAGATGATGTTGCCATCCACAGTA



ATTCACGGCGGAGTCCTTTGGTCCCAGGCATCATCACAGTTATTGACACC



GAAACCGTTGGAGAGGGCCAGGTGCTTGTGAGTGAGGATTCTGACAGTG



ATGGCATTGTGGCCCACTTCCCTGCCCATGAGAAGCCAGTGTGCTGCATG



GCTTTTAATACAAGTGGAATGCTTCTAGTCACAACAGACACCCTTGGCCA



TGACTTTCATGTCTTCCAAATTCTGACTCATCCTTGGTCCTCATCACAATG



TGCTGTCCACCATCTGTATACTCTTCACAGGGGAGAAACTGAAGCCAAAG



TACAGGACATCTGCTTCAGCCATGACTGTCGCTGGGTTGTGGTCAGTACT



CTCCGGGGTACTTCCCACGTTTTCCCCATCAACCCTTATGGTGGCCAGCCT



TGTGTTCGTACACATATGTCACCACGAGTAGTGAATCGCATGAGCCGTTT



CCAGAAAAGTGCTGGACTGGAAGAGATTGAACAAGAACTGACGTCTAAG



CAAGGAGGTCGCTGTAGCCCTGTTCCAGGTCTATCAAGCAGCCCTTCTGG



GTCACCCTTGCATGGGAAACTGAACAGCCAAGACTCCTATAACAATTTTA



CCAACAACAACCCTGGCAACCCTCGGCTCTCTCCTCTTCCCAGCTTGATG



GTAGTGATGCCTCTTGCACAAATCAAGCAGCCAATGACATTGGGGACCAT



CACCAAACGAACCGGCAAAGTTAAACCTCCTCCACAAATTTCACCCAGCA



AATCGATGGGCGGAGAATTTTGTGTGGCTGCTATCTTCGGAACATCCAGG



TCATGGTTTGCAAATAATGCAGGTCTGAAAAGAGAAAAAGATCAGTCCA



AACAAGTTGTAGTTGAGTCCCTGTACATTATCAGTTGCTATGGCACCTTA



GTGGAACACATGATGGAGCCGCGACCCCTCAGCACTGCACCCAAGATTA



GTGACGACACACCACTGGAAATGATGACATCGCCTCGAGCCAGCTGGAC



TCTGGTTAGAACCCCTCAATGGAATGAATTGCAGCCACCGTTTAATGCAA



ACCACCCTCTGCTCCTCGCTGCAGATGCAGTACAGTATTATCAGTTCCTGC



TTGCTGGCCTGGTTCCCCCTGGAAGTCCTGGGCCCATTACTCGACATGGG



TCTTACGACAGTTTAGCTTCTGACCATAGTGGACAGGAAGATGAAGAATG



GCTTTCCCAGGTTGAAATTGTAACACACACTGGACCCCATAGACGTCTGT



GGATGGGTCCACAGTTCCAGTTCAAAACCATCCATCCCTCAGGCCAAACC



ACAGTTATCTCATCCAGTTCATCTGTGTTGCAGTCTCATGGTCCGAGTGAC



ACGCCACAGCCTCTTTTGGATTTTGATACAGATGATCTTGATCTCAACAGT



CTCAGGATCCAGCCAGTCCGCTCTGACCCCGTCAGCATGCCAGGGTCATC



CCGTCCAGTCTCTGATCGAAGGGGAGTTTCCACAGTGATTGATGCTGCCT



CAGGTACCTTTGACAGGAGCGTGACCCTGCTGGAGGTGTGCGGGAGCTG



GCCTGAGGGCTTCGGGCTGCGGCACATGTCCTCCATGGAGCACACGGAG



GAGGGCCTCCGGGAGCGACTTGCCGACGCCATGGCCGAGTCACCTAGCC



GGGACGTCGTGGGATCCGGAACAGAACTTCAGCGAGAGGGAAGCATCGA



GACTCTGAGTAACAGCTCAGGCTCCACCAGCGGCAGCATACCAAGAAAC



TTTGATGGCTACCGATCTCCGCTGCCCACCAATGAGAGCCAGCCCCTCAG



CCTCTTCCCGACTGGCTTCCCGTAG





ZBTB46->
ATGAACAACCGAAAGGAAGATATGGAAATCACGTCCCACTACCGGCACC
SEQ ID


DNAJC5
TGCTGCGGGAGCTCAACGAGCAGAGGCAGCACGGCGTCCTGTGCGACGT
NO: 100



CTGCGTGGTCGTGGAGGGCAAGGTCTTCAAGGCGCACAAGAACGTCCTG



CTGGGCAGCAGCCGCTACTTCAAGACGCTCTACTGCCAGGTGCAGAAGAC



GTCGGAGCAGGCCACGGTCACGCACCTGGACATCGTCACGGCCCAGGGC



TTCAAGGCCATCATCGACTTCATGTACTCAGCGCACCTGGCGCTCACCAG



CAGGAACGTCATCGAGGTGATGTCAGCCGCCAGCTTCCTGCAGATGACGG



ACATCGTGCAAGCCTGCCACGACTTCATCAAGGCGGCGCTGGACATCAGC



ATCAAGTCGGACGCCTCAGATGAGCTTGCGGAGTTCGAGATCGGCGCCTC



GTCCAGCAGCAGCACGGAAGCTCTCATCTCGGCCGTGATGGCTGGGAGG



AGCATCTCCCCGTGGCTGGCACGGCGAACGAGTCCTGCCAATTCTTCCGG



AGACTCGGCCATCGCCAGCTGTCACGACGGAGGGAGCAGCTACGGGAAA



GAGGATCAGGAGCCCAAGGCCGATGGCCCTGATGATGTTTCTTCACAGCC



TCTATGGCCTGGAGACGTGGGCTACGGGCCTCTGCGCATCAAGGAAGAG



CAGGTTTCACCGTCTCAGTACGGAGGGAGCGAGCTGCCTTCTGCCAAGGA



CGGTGCAGTACAGAACTCTTTCTCAGAGCAGAGTGCTGGTGATGCCTGGC



AGCCCACGGGCCGAAGGAAGAATCGGAAAAACAAAGAGACCGTCCGGC



ACATCACACAGCAGGTGGAAGATGACAGCCGGGCCAGCTCCCCGGTGCC



GTCCTTCCTGCCGACGTCGGGGTGGCCGTTCAGCAGCCGAGACTCAAATG



GCAGACCAGAGACAGCGCTCACTGTCTACCTCTGGGGAGTCATTGTACCA



CGTCCTTGGGTTGGACAAGAACGCAACCTCAGATGACATTAAAAAGTCCT



ATCGGAAGCTTGCCTTGAAATATCACCCCGACAAGAACCCCGACAACCCG



GAGGCCGCGGACAAGTTTAAGGAGATCAACAACGCGCACGCCATCCTCA



CGGACGCCACAAAAAGGAACATCTACGACAAGTACGGCTCGCTGGGTCT



CTACGTGGCCGAGCAGTTTGGGGAAGAGAACGTGAACACCTACTTCGTGC



TGTCCAGCTGGTGGGCCAAGGCCCTGTTTGTCTTCTGCGGCCTCCTCACGT



GCTGCTACTGCTGCTGCTGTCTGTGCTGCTGCTTCAACTGCTGCTGCGGGA



AGTGTAAGCCCAAGGCGCCTGAAGGCGAGGAGACGGAGTTCTACGTGTC



CCCCGAGGATCTGGAGGCACAGCTGCAGTCTGACGAGAGGGAGGCCACA



GACACGCCGATCGTCATACAGCCGGCATCCGCCACCGAGACCACCCAGCT



CACAGCCGACTCCCACCCCAGCTACCACACTGACGGGTTCAACTAA



























TABLE C







Primer
Amplicon

SEQ

SEQ

SEQ

SEQ


Fusion
Fusion
design
length
Forward
ID

ID
Reverse
ID

ID


genes
junction
method
(bp)
primer
NO:
Probe
NO:
primer
NO:
Amplicon
NO:


























ABI1->PDSS1#
−chr10:
Manual
88
GCAGACT
101
ACATA
102
TTCCGT
103
GCAGACTACTGT
104



27149676->


ACTGTGA

CAGGT

ACTGA

GAAAACAACTAC



+chr10: 27024169


AAACAACT

CTCTGT

TAGGC

ATACAGGTCTCT








TCTAG

GAT

GTTCTAGGATGT








GATGT



CCCGACCCAGTG








CCCG



GTGCATGAGATC












GCCTATCAGTAC












GGAA





ACACA->MSI2
−chr17:
Primer3
76
TCCACAT
105
TTGTTC
106
GCAGA
107
TCCACATGAACA
108



35536201->


GAACAG

TTGTG

TAACC

GGCTTCCAGGAG



+chr17: 55478740


GCTTCC

ACCAT

CGCCT

GAAATGAGATGG








CTCATT

ACAA

TCACAAGAACAA








TCCTCC



AGAAAATATTTG












TAGGCGGGTTAT












CTGC





ADK->C10orf11
+chr10: 75984349->
Primer3
73
TGACCAA
109
CAGTC
110
CTCCA
111
TGACCAAATCTT
112



+chr10: 77795766


ATCTTGG

CTTCC

GGCTC

GGCTGAAGACAA






CTGAAG

AGTGA

CTGAA

ACACAAGGAACT








CAGTT

TGC

GTCACTGGAAGG








CCTTGTG



ACTGAGCGCATT












CAGGAGCCTGGAG





ARNT2->MESDC2
+chr15:
Primer3
85
GGACTTC
113
TCTTTC
114
TGTGC
115
GGACTTCGATGA
116



80750317->−chr15:


GATGATG

TTGAA

TCTGG

TGAAGATGGTGA



81274523


AAGATGG

AATTT

AAGAT

AGGCCCCAGTAA








ACTGG

CTCCTT

ATTTTCAAGAAA








GGCCT



GATGATGACATT








TCA



GAAGAAGGAGAT












CTTCCAGAGCACA





ATRX->RPS6KA6
−chrX:
Manual
79
AAACGTA
117
ACCAT
118
CCATT
119
AAACGTATTGCT
120



76907604


TTGCTGA

TTACCT

GGCTC

GAGAGGGAGCGT



->−chrX:


GAGGGA

CTCTC

ATCAA

GAGCGAGAAAAA



83419395




AATTTT

CCATTT

TTGAGAGAGGTA








TCTCG

TAA

AATGGTCTTAAA








CTCAC



ATGGTTGATGAG








GC



CCAATGG





BCL7A->C12orf42
+chr12:
Manual
85
AACAGTT
121
TTGAC
122
CCTTTG
123
AACAGTTCCTCC
124



122473333->−chr12:


CCTCCCC

AAGTT

TTTCAT

CCAGGGATGATG



103872225


AGGGA

CAACT

ACATA

GACATGCATGGG








CCCAT

TCACT

AGTTGAACTTGT








GCATG

GTAGAC

CAAATTAATGTC








TCCATC



TACAGTGATATG












TATGAAACAAAGG





CDC42BPB->PET112
−chr14:
Manual
75
CCTGCGC
125
CCTCA
126
TCTTGC
127
CCTGCGCCGCGA
128



103523336->−chr4:


CGCGACA

AACAC

CTTCCC

CAAGTACGTGGC



152594048


AGTA

CCCAC

TCTTCC

CGAGTTCCTCGA








TCGAG

ACAG

GTGGGGTGTTTG








GAACT



AGGAACTGTGGA








CGGCC



AGAGGGAAGGCA












AGA





CLEC16A->BCAR4
+chr16:
Primer3
76
CTGACTC
129
TGTCA
130
GATAG
131
CTGACTCGGGAG
132



1154879->−chr16:


GGGAGG

GATCC

GTTGG

GAGGACCTGATC



11914154


AGGAC

AGGAC

TACAT

AAGACTGATGAT








ATCAT

GGTGA

GTCCTGGATCTG








CAGTC

TTTT

ACAAAAAATCAC








TTG



CATGTACCAACC












TATC





CLTB->CDHR2
−chr5:
Primer3
82
ACATGGG
133
CTGGG
134
AGGGT
135
ACATGGGGACCA
136



175837258->


GACCACA

CACCT

CATTG

CAGTCAATGGAG



+chr5:


GTCAA

GAAAC

TCCTG

ATGTGTTTCAGGT



175995679




ACATC

GTCTT

GCCCAGGCCTTC








TCCA



TGGTTGGTAGCG












GAAGACCAGGAC












AATGACCCT





CREB1->TMEM131
+chr2:
Manual
77
GGTGCCA
137
ACTGA
138
GCAGT
139
GGTGCCAACTCC
140



208435045->−chr2:


ACTCCAA

ACGAA

ACTTCT

AATTTACCAAAC



98543950


TTTACCA

TGTAT

ATTAT

TAGCAGTGGACA






AA

ACTGT

GCTCT

GTATACATTCGTT








CCACT

CTG

CAGTCAGAGAGC








GCTAG



ATAATAGAAGTA












CTGC





DDX5->IQCG
−chr17:
Primer3
79
CCCAAGT
141
CGAAG
142
TTGCA
143
CCCAAGTTGCTTC
144



62496667->−chr3:


TGCTTCA

ACAGA

GTTGG

AGTTGGTCGAAG



197640913


GTTGGT

GGTTC

TCCTTG

ACAGAGGTTCAG








AGAGT

AGGT

AGTCAGAATGAG








CA



TATATTGCTAACC












TCAAGGACCAAC












TGCAA





DLG5->ADK
−chr10:
Primer3
69
ACCAGAA
145
AATCA
146
CTGCTT
147
ACCAGAAGGAGA
148



79613112->


GGAGATC

TCCAC

TGTGT

TCGGTGACCTCC



+chr10: 76153899


GGTGAC

TGCTG

GGCTG

GTGCCCAGCAGC








CTGCT

TTG

AGCAGTGGATGA








GGG



TTCAACAGCCAC












ACAAAGCAG





DNMBP->TACC2
−chr10:
Manual
74
TGCCTGC
149
AATCG
150
CGGGG
151
TGCCTGCCGCGC
152



101769595->


CGCGCCGA

GAACT

TCTCA

CGAGGGACCGCC



+chr10: 123954555




CCTGC

AATGC

GGGCGGCGGAAA








TTTCCG

CTCTTC

GCAGGAGTTCCG








CCGC



ATTCTGAAGAGG












CATTTGAGACCC












CG





EIF2C3->ZP2
+chr1:
Manual
79
CCTACAG
153
AGGAA
154
GTTTTC
155
CCTACAGCTTATT
156



36492899->−chr16:


CTTATTA

TTATC

CCCAT

ATCGTCATCCTGC



21212879


TCGTCAT

ATACA

AAGGT

CGGGGAAGACAC






CC

CTGGT

TGTTGG

CAGTGTATGATA








GTCTTC



ATTCCTACCAAC








CCCGGC



AACCTTATGGGG












AAAAC





EIF4A3->TSPEAR
−chr17:
Manual
82
ACGTTCG
157
TCGTTC
158
CACCA
159
ACGTTCGACACC
160



78120592->−chr21:


ACACCAT

CTCGT

CCGTC

ATGGGCCTGCGG



45953806


GGGC

AAGCG

AGCAG

GAGGACCTGCTG








TAGAT

GTA

CGGGGCATCTAC








GCCCC



GCTTACGAGGAA












CGAGTACCTGCT












GACGGTGGTG





ERBB2->IKZF3
+chr17:
Manual
76
TTTGGGA
161
CATTG
162
TCTTTC
163
TTTGGGAGCCTG
164



37868701->−chr17:


GCCTGGC

AATCA

ACTGT

GCATTTCTGCCG



37949186


ATTT

TCCAT

ATTCA

GAGAGCTTTGAT








CAAAG

TCTTTC

GGATGATTCAAT








CTCTCC

AC

GAAAGTGAAAGA








GGC



TGAATACAGTGA












AAGA





ESR1->AKAP12
+chr6:
Manual
78
GCTCCGC
165
AGTCT
166
GAGTC
167
GCTCCGCAAATG
168



152201906->


AAATGCT

CTCTGT

TCTTTT

CTACGAAGTGGG



+chr6:


ACGAAG

CCAAC

GCTCA

AATGATGAAAGG



151669846




ACCTTT

CATCTT

TGTTGGACAGAG








CATCA

CA

AGACTCTGAAGA








TTCCC



TGTGAGCAAAAG












AGACTC





ESR1->AKAP12
+chr6:
Primer3
70
ATGATCA
169
GGTGC
170
CGGAG
171
ATGATCAACTGG
172



152265643->


ACTGGGC

CAGTT

TCTCTT

GCGAAGAGGGTG



+chr6:


GAAGAG

GGACA

TTGCTC

CCAGTTGGACAG



151669846




GAGAG

ACA

AGAGACTCTGAA












GATGTGAGCAAA












AGAGACTCCG





ESR1->C6orf211*
+chr6:
Manual
126
ACCTGGA
173
CGCCG
174
CCTGT
175
ACCTGGAGAACG
176



152129499->


GAACGA

GCATT

GACCC

AGCCCAGCGGCT



+chr6:


GCCCA

CTACA

ATAGA

ACACGGTGCGCG



151785588




GTCCA

AATTTT

AGGCCGGCCCGC








CC

GC

CGGCATTCTACA












GTCCACCAATCG












ATTACTTTGATGT












ATTTAAAGAATC












AAAAGAGCAAAA












TTTCTATGGGTCA












CAGG





ESR1->C6orf211**
+chr6:
Manual
82
TACACGG
177
CCGGC
178
GCTCTT
179
TACACGGTGCGC
180



152129499->


TGCGCGAG

ATTCT

TTGATT

GAGGCCGGCCCG



+chr6:




ACAGT

CTTTA

CCGGCATTCTAC



151785588




CCACC

AATAC

AGTCCACCAATC








AATCGA

ATCAA

GATTACTTTGATG










AGTA

TATTTAAAGAAT












CAAAAGAGC





FAM135A->PKIB
+chr6:
Primer3
80
AGCTCCT
181
ATAGC
182
TCATTT
183
AGCTCCTCCGTTC
184



71123405->


CCGTTCG

AACAT

TTGAT

GACAGGCGGGGG



+chr6:


ACAG

CTCTC

GAATC

AAGAGGCCGAGC



123038932




GCCCG

TGTCCTC

CGGGCGAGAGAT








GCT



GTTGCTATGAGG












ACAGATTCATCA












AAAATGA





FBXL20->NSF
−chr17:
Manual
84
GGAACAT
185
TTTAC
186
TCCAT
187
GGAACATTGAAG
188



37453380->


TGAAGTA

AACGC

ATAAC

TACTGAATCTAA



+chr17: 44751780


CTGAATC

GTCTG

AGGAT

ATGGGTGTACAA






TAA

TTGTCT

GCCTTT

AGACAACAGACG








TTGTA

AAC

CGTTGTAAACAT








CACCC



GTTAAAGGCATC












CTGTTATATGGA





FBXW7->MLL3
−chr4:
Primer3
72
TCCTCCC
189
AGTTTT
190
CTGTC
191
TCCTCCCCATTCT
192



153332455


CATTCTA

CCCCC

CTCAT

ATACAAAAACAA



->−chr7:


TACAAAA

TACTTC

CTTCC

CAAAAACCTCGA



152055760


AC

GAGGT

ACTGC

AGTAGGGGGAAA








TTTTGTT



ACTGCAGTGGAA












GATGAGGACAG





GBAS->PCLO
+chr7:
Manual
92
TCCTGCA
193
ATTCTT
194
CTGAT
195
TCCTGCAGCGGG
196



56032394->−chr7:


GCGGGCG

GAATC

ATTGC

CGGCCCCCTGCA



82595803


GC

GGAGC

TCTCTG

GCCTCCTGCCCA








CTGGG

GGTTT

GGCTCCGATTCA








CAGGA

GGC

AGAATGGCTTTG








GGCTG



TTTAAATTGCCA












AACCCAGAGAGC












AATATCAG





GREB1->MBOAT2
+chr2:
Manual
80
TTGGTCT
197
TGGCA
198
CTGCT
199
TTGGTCTGTGGA
200



11680234->−chr2:


GTGGAGT

CACTA

AGCAA

GTGCCTGAAGTG



9098771


GCCTGA

CAAAG

GGCAA

ACCAGCTTTTTGT








TTGAC

AGAG

AAGGTCAACTTT








CTTAC



GTAGTGTGCCAA








AAAAA



CTCTTTGCCTTGC








GCTGG



TAGCAG





HHATL->GRB2
−chr3:
Manual
77
TGGGCAG
201
CCTTG
202
CTGCC
203
TGGGCAGTGTGC
204



42744071


TGTGCCA

CGGCC

TTGTA

CAGGGTCCCTTG



->−chr17:


GGGT

TCCTC

CCAGT

CGGCCTCCTCAA



73328878




AAGGT

TCTGA

GGTTTTGAACGA








TTTGA

TCAC

AGAATGTGATCA








ACGAA



GAACTGGTACAA












GGCAG





IGF1R->DCC
+chr15: 99442850->
Primer3
74
AAAACCT
205
TGAAA
206
AAGCG
207
AAAACCTTCGCC
208



+ch18: 50278424


TCGCCTC

ACCCT

CAGTG

TCATCCTAGGAG






ATCCTA

TCTAG

CTGTG

AGGAGCAGCTAG








CTGCT

AAA

AAGGGTTTTCAA








CCTCTCC



ATTAAAGCTTTC












ACAGCACTGCGC












TT





KIF16B->PCSK2
−chr20:
Manual
81
TGAGCGA
209
TCCGG
210
AGTGG
211
TGAGCGATGGCA
212



16553874->


TGGCATC

CCCAT

TACAG

TCGGTCAAGGTG



+chr20: 17240885


GGTC

GAATC

ACCTT

GCCGTGAGGGTC








GCAGC

CAGC

CGGCCCATGAAT








TTCCCT



CGCAGCTTCCCTT








TT



TGCTGAAGGTCT












GTACCACT





LRP5->KAT6A
+chr11:
Manual
91
GCTGCTG
213
CCCGG
214
CCCCA
215
GCTGCTGGCGCT
216



68080273->−chr8:


GCGCTGT

CCCCC

AGAAA

GTGCGGCTGCCC



41907225


GCGGCT

GCCGC

CTAGT

GGCCCCCGCCGC








GGGAT

CAGCA

GGGATTCTTTCTA








TCTTTC

CTTCA

CTAATCCAGATA








TAC

AC

CTTGTTGAAGTG












CTGACTAGTTTCT












TGGGG





LRP5->SLC22A24
+chr11:
Primer3
63
CAACGGC
217
TCATG
218
GGACT
219
CAACGGCAGGAC
220



68133170->−chr11:


AGGACGT

GTGGA

GCATC

GTGTAAGGCAGC



62863578


GTAA

TCTCA

CAACT

TTGTGAGATCCA








CAAGC

CCTTC

CCATGAAGAAGG








TGCC



AGTTGGATGCAG












TCC





LRP8->TMEM48
−chr1:
Manual
84
AGAAGG
221
CCGGC
222
GGTTC
223
AGAAGGACTGCG
224



53746259


ACTGCGA

TGTGC

ATAGC

AGGGTGGAGCGG



->−chr1:


GGGTG

TACCT

AGTGC

ATGAGGCCGGCT



54275419




GCTAT

TAATC

GTGCTACCTGCT








ATTCC

CAAG

ATATTCCCAAAG








CAA



CTTGGATTAGCA












CTGCTATGAACC





LUC7L3->HNF1B
+chr17:
Manual
70
AGAAGC
225
TGGGA
226
GTGGT
227
AGAAGCGCAGCA
228



48797192->−chr17:


GCAGCAA

CCACG

GTGTG

ACGTGCGGTGGG



36047395


CGTGC

AGAGC

GGCAT

ACCACGAGAGCT








TGTCCT

CAC

GTCCTCTACAAG








CTACA



CCTGGTGATGCC








AGCC



CACACACCAC





NFYA->TDRG1
+chr6:
Manual
82
CTGGAGC
229
TGACC
230
CTCTTC
231
CTGGAGCCAATC
232



41040823->


CAATCAG

GTTCC

ATTGT

AGCGCGGGCAGC



+chr6:


CGCGGG

GTGCC

AGCTT

GAACCGGGGGAG



40347021




TCGCT

GATCC

CGAGGCACGGAA








CCCCC

TGCGC

CGGTCACTGCGC








GGTTC



AGGATCAAGCTA












CAATGAAGAG





PDE4D->
−chr5:
Manual
78
CATGATG
233
TGTCC
234
CTCAC
235
CATGATGTAGAT
236


ITGA1
58284320->


TAGATCA

AATCA

AGAGT

CATCCTGGTGTGT



+chr5:


TCCTGGTG

ATTTCT

CCTGA

CCAATCAATTTCT



52218607




GATCA

AAGTCA

GATCAATACAAG








ATACA



ACAAGCATGACT








AGACA



TTCAGGACTCTGT








AGC



GAG





PGAP3->CACNB1
−chr17:
Primer3
75
CCCCCAT
237
TGATG
238
CCTCC
239
CCCCCATGTACC
240



37840850->−chr17:


GTACCAC

TCAAA

AATTG

ACACCTGTGTGG



37333788


ACCT

CATTTC

GTTCTC

CCTTCGCCTGGG








CCAGG

ATCC

AAATGTTTGACA








CGA



TCATCCTGGATG












AGAACCAATTGG












AGG





PIK3C3->RPRD1A
+chr18:
Primer3
83
CAGGCAC
241
AAAAA
242
TGACA
243
CAGGCACCTGGA
244



39629569->−chr18:


CTGGATA

CAGCC

TCATT

TAACCTTTTGCTA



33613800


ACCTTT

AAACC

GGCTA

ACAAAAACAGCC








AAACA

GGTAG

AAACCAAACAGG








GGAAGC

AGA

AAGCTTACTTTTC












TCTACCTAGCCA












ATGATGTCA





PPP1R12C->
−chr19:
Manual
74
GTACTGA
245
CCCTG
246
AGCGG
247
GTACTGAGCCTG
248


IFITM10
55610152->−chr11:


GCCTGTT

GGCGT

GGCTG

TTGGAGGAACTG



1769349


GGAGGA

CCTCCT

GGCAC

GCCCGGAAACAG






ACTGG

GTTTCC

TGG

GAGGACGCCCAG








GG



GGCCCCGGCCAG












TGCCCAGCCCCG












CT





PPP2R1A->NLRP8*
+chr19: 52709316->
Primer3
84
AACAGCT
249
CAGAG
250
ATGCA
251
AACAGCTGGGAA
252



+chr19: 56473433


GGGAACC

TACGT

GCCCA

CCTTCACTACCCT






TTCACT

GCACT

TTGCTC

GGTGGGAGGCCC








GCCTG

TC

AGAGTACGTGCA








CTGA



CTGCCTGCTGAG












CGCCAGAGAGCA












ATGGGCTGCAT





PPP2R1A->NLRP8**
+chr19: 52709316->
Manual
89
AACCTTC
253
ATTGC
254
GTCTT
255
AACCTTCACTAC
256



+chr19: 56473433


ACTACCC

TCTCTG

GCCAC

CCTGGTGGGAGG






TGGTGGG

GCGCT

CAACG

CCCAGAGTACGT






AG

CAGCA

ATGCA

GCACTGCCTGCT








GGCAG

GC

GAGCGCCAGAGA








TGCAC



GCAATGGGCTGC








GTAC



ATCGTTGGTGGC












AAGAC





PPP2R2D->PANK1
+chr10:
Manual
84
CTTTGAC
257
CGGCT
258
CATTTT
259
CTTTGACAAGTTT
260



133761295->


AAGTTTG

CCAAA

GAACA

GAGTGTTGCTGG



−chr10: 91344222


AGTGTTG

ATAAC

GTTCC

AACGGTTCGGAT






CTG

CCTAT

AACAG

AGGGTTATTTTG








CCGAA

TGC

GAGCCGTTGGGG








CCG



CACTGTTGGAAC












TGTTCAAAATG





PREX1->SLC9A8
−chr20:
Primer3
81
GCAGATG
261
CTGCA
262
CATGA
263
GCAGATGGAGAA
264



47324798->


GAGAAG

GTCCC

GTTGT

GCTGGAAGCCCT



+chr20: 48431545


CTGGAAG

ACATC

ATTGG

GGAGCAGCTGCA








GAAG

GGAACC

GTCCCACATCGA












AGGCTGGGAGGA












GGTTCCCCAATA












CAACTCATG





RABEP1->DNAH9
+chr17: 5250220->
Primer3
69
ACTGCGG
265
AGGGC
266
CCACC
267
ACTGCGGAAAGA
268



+chr17: 11532734


AAAGAAT

CGTCTT

TTTCCC

ATTGCATGAAGG






TGCAT

CAGAA

AGTTT

GTCTTCTGAAGA








GACCC

GTG

CGGCCCTGGATT








TTC



TCCACAAACTGG












GAAAGGTGG





RAD21->FER1L6
−chr8:
Primer3
85
GCAGCGT
269
AGCCC
270
ACCCC
271
GCAGCGTGGAGA
272



117878825->


GGAGAGT

AAACA

TTCTCT

GTATCATCTCACC



+chr8:


ATCATC

TCCCCT

GCCTT

AAAGAAAGGGGA



124968232




TTCTTT

ATTTC

TGTTTGGGCTGA








GGTGA



AGGTGAAGAAGA












AGAGAAATAAGG












CAGAGAAGGGGT





RAF1->NKIRAS1
−chr3:
Manual
77
GCCGAAC
273
TCTTCC
274
CATGT
275
GCCGAACGACAG
276



12705312


GACAGG

ATTCCT

ATACA

GACGTTGGGGCG



->−chr3:


ACGTT

GAGGG

TCTTCC

GCCTGGCTCCCTC



23942540




AGCCA

ATTGTT

AGGAATGGAAGA








GGCC

TCGC

TTGCGAAACAAT












GGAAGATGTATA












CATG





RERG->GZMM
−chr12:
Manual
78
GGAGGTC
277
AAAGG
278
GCCCC
279
GGAGGTCAAACT
280



15370363->


AAACTGG

AGCTG

CGATG

GGCAATATTTGG



+chr19: 547280


CAATATT

CCTGA

ATCTG

GAGAGCAGGCGT






TGGG

CTTGC

GGT

GGGCAAGTAGGC








CCACG



AGCTCCTTTGGG








CC



ACCCAGATCATC












GGGGGC





RIMS2->DPYS
+chr8:
Primer3
78
TGTGGAG
281
CCATTT
282
GCTGG
283
TGTGGAGGTCGA
284



104709524->−chr8:


GTCGAGT

TACCA

TAACT

GTGTCATTACGCT



105436617


GTCATT

CTATG

GCCAC

CAAACAAGCATA








CTTGTT

AAATC

GTGGTAAAATGG








TGAGCG



ATGAAAACAGAT












TTGTGGCAGTTA












CCAGC





RREB1->DSP
+chr6:
Primer3
85
AGACTCG
285
CTTATC
286
TGGAT
287
AGACTCGCAGGA
288



7108001->


CAGGAGC

AGGTC

GGTGT

GCAACACGTGAT



+chr6:


AACA

AAACC

TCTGG

GTGTCTACTTATC



7555951




GGCAC

TTCTG

AGGTCAAACCGG








GATG



CACGATGTCCAG












GCACCAGAACCA












GAACACCATCCA





RTN3->ANK1*
+chr11:
Manual
120
TCCTTCG
289
CCCAT
290
GCCAG
291
TCCTTCGGAGCC
292



63449250->−chr8:


GAGCCGA

TCCGC

ATGCA

GAGCCGTCCGCG



41591587


GCCG

ACAGG

AGCCA

CCCGGCGGCGGC








AGGAG

TTC

GGGAGCCCAGGA








CTGC



GCCTGCCCCGCC












CTGGGGACGAAG












AGCTGCAGCTCC












TCCTGTGCGGAA












TGGGTTGAATGG












CTTGCATCTGGC





RTN3->ANK1**
+chr11:
Manual
80
CCCCGCC
293
CAACC
294
GCCTT
295
CCCCGCCCTGGG
296



63449250->−chr8:


CTGGGGA

CATTC

CCTTA

GACGAAGAGCTG



41591587


CGAA

CGCAC

GAAGC

CAGCTCCTCCTGT








AGGAG

CAGAT

GCGGAATGGGTT








GAGCT

GCAAG

GAATGGCTTGCA








GCAGC

CC

TCTGGCTTCTAAG












GAAGGC





SEMA4C->BRE
−chr2:
Primer3
68
ACACTTC
297
AATAA
298
ACTGA
299
ACACTTCAGGCA
300



97527316->


AGGCATC

GCCCT

GGGAC

TCTGCAACCTCC



+chr2:


TGCAAC

TTCTTA

AAAGG

GTGGCAGTAAGA



28561317




CTGCC

TTTTGA

AAGGGCTTATTT








ACGGAG



CAAAACCTTTGT












CCCTCAGT





SEMA4C->RBMS1
−chr2:
Primer3
83
GACACTT
301
CCTCC
302
AGCCT
303
GACACTTCAGGC
304



97527316


CAGGCAT

GTGGC

TCACA

ATCTGCAACCTC



->−chr2:


CTGCAA

AGTAA

CCCTTC

CGTGGCAGTAAG



161131275




GAAAG

TTCA

AAAGATGTACAG












AAAGGTGTTCTT












ACATGAAGAAGG












GTGTGAAGGCT





SFXN1->CAMK4
+chr5:
Manual
85
AGCGGG
305
CATGT
306
TGGTTT
307
AGCGGGACCTGC
308



174905642->


ACCTGCG

AGATA

GAGAT

GAGCAGCGCGGG



+chr5:


AGCAG

CGCTT

CACGA

CGGCAGCCCGGG



110782384




CCCCC

TGGAC

GGAAGCGTATCT








GGGCT

AATCCC

ACATGAAAATGG








GCCGC



GATTGTCCATCGT








CC



GATCTCAAACCA





SNX9->RAB2A
+chr6:
Manual
86
TCCCGGG
309
CGCCC
310
CTCCTT
311
TCCCGGGCCGGG
312



158244478->


CCGGGG

GCCAT

CTTGA

GGACCCGCCCGC



+chr8:




GGCCA

ATTTTT

CATGGCCACCAA



61531139




CCAAG

TCATA

GGCATTTATTAAT








GCATTT

AATTT

ACAGCAAAAGAA










CTTTTG

ATTTATGAAAAA










CTG

ATTCAAGAAGGAG





STAU1->TOP1
−chr20:
Manual
83
GAACTGA
313
CCTGG
314
GATCT
315
GAACTGAACAAA
316



47790732->


ACAAAG

AACGC

TTGTGT

GACAACATTGTT



+chr20: 39690034


ACAACAT

CCTCTT

TTATCT

CCTGGAACGCCC






TGTT

TTTAA

TTGTGT

TCTTTTTAAAAAA








AAAAG

TTAT

GATTCTCATAAA








ATTCTC



CACAAAGATAAA












CACAAAGATC





TANC2->RDM1
+chr17:
Manual
81
ACTGGTG
317
TCCAC
318
GATGT
319
ACTGGTGGGAAA
320



61086987->−chr17:


GGAAATC

AGCTA

CTTCA

TCAAGTCGTAAA



34247276


AAGTCGT

TTTTAC

CTGGG

AACAGGTCAAGT






AA

CACTTT

TCTGTA

GAAAGTGGTAAA








CACTT



ATAGCTGTGGAG








GACCTG



TACAGACCCAGT












GAAGACATC





TEX10->
−chr9:
Primer3
85
CCCTCGC
321
CGGCC
322
TTCAC
323
CCCTCGCTTGTCT
324


PICALM
103115054->


TTGTCTT

GGGTC

ATTCA

TCTCGGGCTTCTC



−chr11:


CTCG

CTCAG

TCTCAT

GCCCCGGCCGCG



85742653




ACTTA

TTGTGC

GCCGGGTCCTCA








ATTCA



GACTTAATTCAG












TGCACAAATGAG












ATGAATGTGAA





TFG->GPR128
+chr3:
Manual
75
TGCAATT
325
CTGAA
326
TCTGT
327
TGCAATTCAGTG
328



100438902->


CAGTGCA

ACTGA

AGGGG

CAGTAGGATACT



+chr3:


GTAGGAT

CATTA

TGCTT

GAAACTGACATT



100348442




TTTGG

GAT

ATTTGGAAAATC








AAAAT



TACTTCCTCATCA








CTACTT



AGCACCCCTACA








CC



GA





TPP2->BRCA2
+chr13: 103249553->
Primer3
83
CATCGCA
329
TCCGG
330
CCTAC
331
CATCGCAGTCCT
332



+chr13: 32890559


GTCCTGG

GCATG

GATAT

GGACACGGGGGT






ACAC

CAGAC

TCCTCC

CGACCCGGGGGC








TTATTT

AATGC

TCCGGGCATGCA








ACCA



GACTTATTTACCA












AGCATTGGAGGA












ATATCGTAGG





TRIM37->BCAS3
−chr17:
Manual
84
TGCGGTT
333
CCAGA
334
CATTC
335
TGCGGTTGAGAA
336



57092971->


GAGAAA

TTTCTT

CCAGT

AAGGAGGAAAAT



+chr17: 58786580


AGGAGG

GATGT

ACTAT

GGTCACCTTGGG








ATCCC

GTATTT

ATACATCAAGAA








AAGGT

CATGA

ATCTGGAATTTC








GACC

AA

ATGAAATACATA












GTACTGGGAATG





TRPS1->
−chr8:
Manual
87
GTGTTCT
337
AGGCT
338
CCACG
339
GTGTTCTTGACG
340


EIF3H
116680772


TGACGAT

CCGCA

TGAAG

ATTAATCAACAG



->−chr8:


TAATCAA

TCATTT

ATGAT

TCCAATATCAGA



117671219


CAG

CCATC

CAATG

TGGAAATGATGC








TGATA

TTTA

GGAGCCTTCGCC








TTGG



ATGTAAACATTG












ATCATCTTCACGT












GG





UCK2->TMCO1
+chr1:
Primer3
63
CCTTCCT
341
GGAAC
342
GGCCG
343
CCTTCCTTATAGG
344



165797169->−chr1:


TATAGGC

AGCTA

AGAAT

CGTCAGCGGGGG



165697358


GTCAGC

GCGGC

CTTCTG

AACAGCTAGCGG








AAG

AATG

CAAGAACATTCA












GAAGATTCTCGG












CC





UTP18->ACACA
+chr17:
Manual
86
GTCTATG
345
CGCCC
346
GCTGG
347
GTCTATGACATG
348



49354665->−chr17:


ACATGCT

GAGGA

TTATTG

CTGGCTGGAAAG



35487144


GGCTGGA

CCTCTC

GAGGT

TTAATTCCTGTGC






AA

ACTTG

GTACA

ATCAAGTGAGAG








ATGC

CTT

GTCCTCGGGCGG












GAAGTGTACACC












TCCAATAACCAGC





UTP23->RAD21
+chr8:
Primer3
64
TACCTCA
349
CAGCT
350
TCTGG
351
TACCTCATGGGG
352



117779030->−chr8:


TGGGGGA

GTGCA

CTGGC

GAGACGCAGCTG



117879000


GACG

CCACA

TATGA

TGCACCACAAGG








AGGTT

AAACA

TTTTCTTCTGTTT








TTCTTC



TCATAGCCAGCC












AGA





ZBTB34->SCAI
+chr9:
Manual
86
CGCGGGC
353
CAAAT
354
CATCTT
355
CGCGGGCGGGCG
356



129623018->−chr9:


GGGCGAT

TCAGT

CAGCA

ATGTGAGCGCGG



127818286


GT

CCTGT

CCTCC

CGCTCTGGACAG








CCAGA

AGAGG

GACTGAATTTGC








GCGCC

ACATG

TCTTAAAGAAAT








GCGCTC



CATGTCCTCTGG












AGGTGCTGAAGA












TG





#Internal control.


*Old design, which was found to not be optimal


**New design





Claims
  • 1. A method for identifying a gene fusion in a biological sample obtained from a human subject, wherein the biological sample is a formalin-fixed paraffin embedded (FFPE) sample, comprising: obtaining a plurality of reads from RNA sequencing of the biological sample;mapping a read to the human genome;determining whether the read comprises a distant spliced junction;selecting the read comprising a distant spliced junction;identifying a candidate gene fusion comprising the distant spliced junction;creating a first set of templates for the candidate gene fusion, wherein the first set of templates comprises: (1) a fusion template comprising 50 base pairs (bp) of exonic sequence of a preserved region of a donor gene and 50 bp of exonic sequence of a preserved region of an acceptor gene,(2) a donor template comprising 50 bp of exonic sequence of a preserved region of a donor gene and 50 bp of exonic sequence of a discarded region of a donor gene,(3) an acceptor template comprising 50 bp of exonic sequence of a discarded region of an acceptor gene and 50 bp of exonic sequence of a preserved region of an acceptor gene,(4) a donor genomic template comprising 50 bp upstream genomic sequence of a donor splicing site and 50 bp downstream genomic sequence of a donor splicing site, and(5) an acceptor genomic template comprising 50 bp upstream genomic sequence of an acceptor splicing site and 50 bp downstream genomic sequence of an acceptor splicing site;removing the candidate gene fusion if any of the first template set sequences are identical, but map to different genes in the human genome;creating a second set of templates comprising: (a) a fusion template comprising 150 bp of exonic sequence of a preserved region of a donor gene and 150 bp of exonic sequence of a preserved region of an acceptor gene,(b) a donor template comprising 150 bp of exonic sequence of a preserved region of a donor gene and 150 bp of exonic sequence of a discarded region of a donor gene,(c) an acceptor template comprising 150 bp of exonic sequence of a discarded region of an acceptor gene and 150 bp of exonic sequence of a preserved region of an acceptor gene,(d) a donor genomic template comprising 150 bp upstream genomic sequence of a donor splicing site and 150 bp downstream genomic sequence of a donor splicing site, and(e) an acceptor genomic template comprising 150 bp upstream genomic sequence of an acceptor splicing site and 150 bp downstream genomic sequence of an acceptor splicing site;determining the homology between templates (b) and (c) and between templates (d) and (e);removing the candidate gene fusion if templates (b) and (c) are homologous or if templates (d) and (e) are homologous; andaligning a read obtained from RNA sequencing of the biological sample to the first set of templates;selecting the read that maps to the fusion template of the first set of templates; andidentifying a gene fusion, wherein the gene fusion is identified by at least two non-duplicate reads that map to the fusion template of the first set of templates.
  • 2. The method of claim 1, further comprising preparing a report based on the identification of a gene fusion.
  • 3. The method of claim 1, wherein the read is a single end read.
  • 4. The method of claim 1, wherein the read is a paired-end read.
  • 5. The method of claim 1, wherein the read is at least 50 bases.
  • 6. The method of claim 1, wherein human subject is a cancer patient, and the cancer is selected from breast cancer, colon cancer, lung cancer, prostate cancer, hepatocellular cancer, gastric cancer, pancreatic cancer, cervical cancer, ovarian cancer, liver cancer, bladder cancer, cancer of the urinary tract, thyroid cancer, renal cancer, carcinoma, melanoma, and brain cancer.
Parent Case Info

This application is a national phase application under 35 U.S.C. § 371 of International Application No. PCT/US2013/068236, filed Nov. 4, 2013, and claims the benefit under 35 U.S.C. § 119(e) of U.S. Provisional Application No. 61/722,634, filed Nov. 5, 2012, and U.S. Provisional Application No. 61/766,561, filed Feb. 19, 2013, the disclosures of which are hereby incorporated herein by reference in their entirety.

PCT Information
Filing Document Filing Date Country Kind
PCT/US2013/068236 11/4/2013 WO 00
Publishing Document Publishing Date Country Kind
WO2014/071279 5/8/2014 WO A
US Referenced Citations (3)
Number Name Date Kind
6849400 Harvey et al. Feb 2005 B1
20040115686 Dolginow et al. Jun 2004 A1
20050214823 Blume et al. Sep 2005 A1
Non-Patent Literature Citations (2)
Entry
Levin et al., “Targeted Next-Generation Sequencing of a Cancer Transcriptome Enhances Detection of Sequence Variants and Novel Fusion Transcripts”, Genome Biology, vol. 10, 2009, 8 pages.
International Search Report and Written Opinion dated Apr. 18, 2014, for International Patent Application No. PCT/US2013/068236, filed Nov. 4, 2013.
Related Publications (1)
Number Date Country
20150302143 A1 Oct 2015 US
Provisional Applications (2)
Number Date Country
61766561 Feb 2013 US
61722634 Nov 2012 US