Methods And Compositions Involving Intrinsic Genes

Information

  • Patent Application
  • 20140087959
  • Publication Number
    20140087959
  • Date Filed
    August 05, 2013
    11 years ago
  • Date Published
    March 27, 2014
    10 years ago
Abstract
Disclosed are compositions and methods related intrinsic gene sets and methods and compositions related to detecting and classifying cancer.
Description
INCORPORATION-BY-REFERENCE OF SEQUENCE LISTING

The contents of the text file named “40448-201C01US_ST25.txt”, which was created on Nov. 23, 2013 and is 121 KB, is hereby incorporated by reference in its entirety.


II. BACKGROUND

A major challenge for microarray studies, especially those with clinical implications, is validation (Ioannidis 2005; Jenssen and Hovig 2005; Michiels et al. 2005). Due to the practical considerations of cost and accessing large numbers of fresh samples with associated clinical information, very few microarray studies have analyzed enough samples to allow the findings to be extended to the general population. Furthermore, it has been difficult to combine and/or validate results from independent laboratories due to differences in sample preparation, patient demographics and the microarray platforms used. An accepted method for validation is to derive a prognostic gene set from a “training set” and then apply it to a “test set” that was not used in any way, to derive the prognostic gene set (Simon et al. 2003); the “purest” test sets have also been suggested to be comprised of samples not contained in the training set and not generated by the primary investigators (Ioannidis 2005). What is needed in the art is a new breast tumor intrinsic gene list that identifies new and important biological features of breast tumors and validates this predictor using a true test set.


III. SUMMARY

Described herein is a method of diagnosing cancer, the method comprising comparing expression levels of a combination of genes from Table 21 to test nucleic acids wherein specific expression patterns of the test nucleic acids indicates a cancerous state.


Also, disclosed is a method of quantitating level of expression of a test nucleic acid comprising: a) comparing gene expression levels of a combination of genes from Table 21 to test nucleic acids corresponding to the same combination of genes; and b) quantitating level of expression of the test nucleic acid.


Also disclosed is a method for determining prognosis based on the expression patterns in a subject diagnosed with cancer comprising: a) comparing expression levels of a combination of genes from Table 21 to test nucleic acids corresponding to the same combination of genes; and b) quantitating level of expression of the test nucleic acid.


Disclosed is a method of classifying cancer in a subject, comprising: a) identifying intrinsic genes of the subject to be used to classify the cancer; b) obtaining a sample from the subject; c) amplifying and detecting levels of intrinsic genes in the subject; and d) classifying cancer or subject based upon results of step c.


Also disclosed is a method of diagnosing cancer in a subject the method comprising: a) amplifying and detecting intrinsic genes; and b) diagnosing cancer based on expression levels of the gene within the subject.


Disclosed herein is a method of deriving a minimal intrinsic gene set for making biological classifications of cancer comprising: a) collecting data from multiple samples from the same individual to identify potential intrinsic classifier genes; b) weighting intrinsic classifier genes of multiple individuals identified using the method of step a relative to each other and forming classification clusters; c) estimating the number of clusters formed in step b) and assigning individual samples to classification clusters; d) identifying genes that optimally distinguish the samples in the assigned groups of step c); e) performing iterative cross-validation with a nearest centroid classifier and overlapping gene sets of various sizes using the genes identified in step d); and f) choosing a gene set which provides the highest class prediction accuracy when compared to the classifications made in step b).


Also disclosed is a method of assigning a sample to an intrinsic subtype, comprising a) creating an intrinsic subtype average profile (centroid) for each subtype; b) individually comparing a new sample to each centroid; and c) assigning the new sample to the centroid that is most similar to the expression profile of new sample.





IV. BRIEF DESCRIPTION OF THE DRAWINGS

The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate several embodiments and together with the description illustrate the disclosed compositions and methods.



FIG. 1 shows the expression levels for the five genes shown by tissue sample. Top: raw data. Bottom: log-scale.



FIG. 2 shows the expression levels of the 10 genes shown by sample and tissue type. Vandesompele data set in log-scale.



FIG. 3 shows the mean squared error (MSE) of each gene by tissue-type. The sign is determined by the direction of the bias. The MSE is broken down into the contributing components of the squared bias (Biaŝ2) and the variance (Sigmâ2). Vandesompele data set.



FIG. 4 shows two-way hierarchical clustering of microarray data for the same samples assayed by qRT-PCR. Samples were classified based on the expression of 402 “intrinsic” genes defined in Sorlie et al. 2003. The expression level for each gene is shown relative to the median expression of that gene across all the samples with high expression represented by red and low expression represented by green. Genes with median expression are black and missing values are gray. The sample-associated dendrogram shows the same classes seen by qRT-PCR (FIG. 5). Samples are grouped into Luminal, HER2+/ER−, Normal-like, and Basal-like subtypes. Overall, 114/123 (93%) primary breast samples classified the same between microarray and qRT-PCR.



FIG. 5 shows two-way hierarchical clustering of real-time qRT-PCR data from 126 unique samples. The sample-associated dendrogram (5A) shows the same classes seen by microarray. Samples are grouped into Luminal (blue), HER2+/ER− (pink), Normal-like (green), and Basal-like (red) subtypes. The expression level for each gene is shown relative to the median expression of that gene across all the samples with high expression represented by red and low expression represented by green. Genes with median expression are black and missing values are gray. A minimal set of 37 “intrinsic” genes (5B) was used to classify tumors into their primary “intrinsic” subtypes. The “intrinsic” gene set was supplemented using PgR and EGFR (5C), and proliferation genes (5D). The genes in 1C and ID were clustered separately in order to determine agreement between the minimal 37 qRT-PCR “intrinsic” set (5A) and the larger 402 microarray “intrinsic” set.



FIG. 6 shows Receiver Operator Curves. The agreement between immunohistochemistry (IHC) and gene expression is shown for ER (6A), PR (6B), and HER2 (6C) using ROC. A cut-off for relative gene copy number was selected by minimizing the sum of the observed false positive and false negative errors. The sensitivity and specificity of the resulting classification rule were estimated via bootstrap adjustment for optimism. Since many biomarkers having concordant expression and can serve as surrogates for one another, we tested the accuracy of using GATA3 and GRB7 as surrogates (dotted lines) for calling ER and HER2 protein status, respectively. There was overall good agreement between gene expression and IHC status for ER and PR, but poor agreement between gene expression and IHC status for HER2. The surrogate markers had similar accuracy to the actual markers for predicting HiC status.



FIG. 7 shows outcome for “intrinsic” subtypes. Kaplan-Meier plots showing relapse free survival (RFS) and overall survival (OS) for patients with Luminal tumors compared to those with HER2+/ER− or Basal-like tumors. Patients with Luminal tumors showed significantly better outcomes for RFS (3A) and OS (3B) compared to HER2+/ER− (RFS: ρ=0.023; OS: p=0.003) and Basal-like (RFS: ρ=0.065; OS: p=0.002) tumors. Classifications were made from real-time qRT-PCR data using the minimal 37 “intrinsic” gene list. Pairwise log-rank tests were used to test for equality of the hazard functions among the intrinsic classes. Tumors in the Normal Breast-like subtype were excluded from the analyses since this class maybe artificially created from having a sample comprised primarily of normal cells.



FIG. 8 shows grade and proliferation as predictors of relapse free survival. Kaplan-Meier plots are shown for grade (8A) and the proliferation genes (8B) using Cox regression analysis. The analysis for the proliferation genes was performed on continuous expression data, although the plots are shown in tertiles. The proliferation index (log average of the 14 proliferation genes) has significant predictive value for outcome, even after correcting for other clinical parameters important for survival. Furthermore, when we include both grade and the proliferation index (and stage) in a model for RFS, we find that the proliferation index is the superior predictor (Grade p=0.51; Proliferation index p=0.047).



FIG. 9 shows co-clustering of real-time qRT-PCR and microarray data using 50 genes and 252 samples. The relative copy number (qRT-PCR) and R/G ratio (microarray) for each gene was Iog2 transformed and combined into a single dataset using distance weighted discrimination. Two-way hierarchical clustering was performed on the combined dataset using Spearman correlation and average linkage. The sample associated dendrogram (5A) shows the same classes as seen in FIG. 1. Samples are classified as Basal-like (red), HER2+/ER−, Luminal, and Normal-like. The expression level for each gene is shown relative to the median expression of that gene across all the samples with overexpressed genes and underexpressed genes, as well as average expression. The gene associated dendrogram (5B) shows that the Luminal tumors and Basal-like tumors differentially express estrogen associated genes (cluster 1); as well as basal keratins (KRT 5 and 17), inflammatory response genes (CX3CL1 and SLPI), and genes in the Wnt pathway (FZD7) (cluster 3). The main distinguishers of the HER2+/ER− group are low expression of genes in cluster 1 and high expression of genes on the 1/q12 amplicon (ERBB2 and GRB7) (cluster 4). The proliferation genes (cluster 2) have high expression in the ER negative tumors (Basal-like and HER2+/ER−) and low expression in ER positive (Luminal) and Normal-like samples.



FIG. 10 shows a flow chart of the steps of deriving minimal intrinsic gene sets for making biological classifications of breast cancer.



FIG. 11 shows an overview and flow of the data sets used and analyses performed.



FIG. 12 shows a hierarchical cluster analysis of the training set using the Intrinsic/UNC gene set. 146 microarrays, representing 105 tumors and 9 normal breast samples were analyzed using the 1300 gene Intrinsic/UNC gene set. A) Overview of the complete cluster diagram (the full cluster diagram can be found as Supplemental FIG. 1). B) Experimental sample associated dendrogram. The 26 paired samples used for the intrinsic analysis are identified by the black bars. C) Luminal/ER+ gene expression cluster with GATA3-regulated genes shown in pink. D) HER2 and GRB7 containing expression cluster. E) Basal epithelial enriched expression cluster. F) Proliferation associated expression cluster. The genes in red are mentioned in the text. The Single Sample Predictor/SSP was applied back onto this training data set with the individual sample classifications identified using colored squares (Pink=HER2+/ER−, Red=Basal-like, Dark Blue=Luminal A, Light Blue=Luminal B, and Green=Normal Breast-like).



FIG. 13 shows Androgen Receptor (AR) immunohistochemistry on human breast tumors. A) AR staining on the HER2+/ER− subtype tumor BR00-0284. B) AR staining on the HER2+/ER− subtype tumor PB455 showing nuclear localization. C) AR staining on the Luminal subtype tumor BR01-0246. D) Lack of AR staining on the Basal-like subtype tumor BR97-0137. The magnification is approximately 200×.



FIG. 14 shows hierarchical cluster analysis the combined test set of 311 tumors and 4 normal breast samples analyzed using the Intrinsic/UNC gene set reduced to 306 genes. A) Overview of the complete cluster diagram. B) Experimental sample associated dendrogram. C) Luminal/ER+ gene expression cluster with GATA3-regulated genes in pink text. D) HER2 and GRB7 containing expression cluster. E) Interferon-regulated cluster containing STAT1. F) Basal epithelial enriched cluster. G) proliferation cluster.



FIG. 15 shows univariate Kaplan-Meier survival plots using RFS as the endpoint, for the common clinical parameters present within the combined test set of 311 tumors. Survival plots for A) ER status, B) node status, C) grade, and D) tumor size.



FIG. 16 shows univariate Kaplan-Meier survival plots for intrinsic subtype analyses. A) Relapse-free survival for the 105 patients/tumors training set classified using hierarchical clustering and complete 1300 gene the Intrinsic/UNC list. B) Relapse-free survival for the 315 sample combined test set analyzed using the Intrinsic/UNC list reduced to 306 genes. C) Survival analysis of the 60 adjuvant tamoxifen-treated patients from the Ma et al. 2004 study who were classified as either LumA, LumB or Normal Breast-like using the Single Sample Predictor. D) Survival analysis of the 96 local treatment only (i.e. surgery alone) test set patients taken from Chang et al. 2005, which were classified using the Single Sample Predictor. E) Survival analysis of a second pure test set of 45 patients treated with adjuvant tamoxifen and classified using the Single Sample Predictor. F) Relapse-free survival for the 105 patients/tumors training set, and classified using the Single Sample Predictor. All p-values were based on a log-rank test.



FIG. 17 shows grade and proliferation as predictors of relapse free survival. A Cox regression model was used to determine probability of relapse over time. Kaplan-Meier curves show time to event given different grades and levels of proliferation. Grade was scored as low (green), medium (red) or high (blue). The proliferation score was based on continuous expression data and is shown as textiles that correspond to low (green), medium (red), and high (blue) levels of expression. The proliferation meta-gene (log 2 average of the 14 proliferation genes) showed significant value in predicting relapse, even after correcting for other clinical parameters important for survival (Table 1). Furthermore, when both grade and proliferation were used in a model for RFS, it was found that the proliferation meta-gene is the better predictor (Grade p=0.51; Proliferation index p=0.047).





V. DETAILED DESCRIPTION

Before the present compounds, compositions, articles, devices, and/or methods are disclosed and described, it is to be understood that they are not limited to specific synthetic methods or specific recombinant biotechnology methods unless otherwise specified, or to particular reagents unless otherwise specified, as such may, of course, vary. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting.


A. DEFINITIONS

As used in the specification and the appended claims, the singular forms “a,” “an” and “the” include plural referents unless the context clearly dictates otherwise. Thus, for example, reference to “a pharmaceutical carrier” includes mixtures of two or more such carriers, and the like.


Ranges can be expressed herein as from “about” one particular value, and/or to “about” another particular value. When such a range is expressed, another embodiment includes from the one particular value and/or to the other particular value. Similarly, when values are expressed as approximations, by use of the antecedent “about,” it will be understood that the particular value forms another embodiment. It will be further understood that the endpoints of each of the ranges are significant both in relation to the other endpoint, and independently of the other endpoint. It is also understood that there are a number of values disclosed herein, and that each value is also herein disclosed as “about” that particular value in addition to the value itself. For example, if the value “10” is disclosed, then “about 10” is also disclosed. It is also understood that when a value is disclosed that “less than or equal to” the value, “greater than or equal to the value” and possible ranges between values are also disclosed, as appropriately understood by the skilled artisan. For example, if the value “10” is disclosed the “less than or equal to 10” as well as “greater than or equal to 10” is also disclosed. It is also understood that the throughout the application, data is provided in a number of different formats, and that this data, represents endpoints and starting points, and ranges for any combination of the data points. For example, if a particular data point “10” and a particular data point 15 are disclosed, it is understood that greater than, greater than or equal to, less than, less than or equal to, and equal to 10 and 15 are considered disclosed as well as between 10 and 15. It is also understood that each unit between two particular units are also disclosed. For example, if 10 and 15 are disclosed, then 11, 12, 13, and 14 are also disclosed.


As used throughout, by a “subject” is meant an individual. Thus, the “subject” can include, for example, domesticated animals, such as cats, dogs, etc., livestock (e.g., cattle, horses, pigs, sheep, goats, etc.), laboratory animals (e.g., mouse, rabbit, rat, guinea pig, etc.) mammals, non-human mammals, primates, non-human primates, rodents, birds, reptiles, amphibians, fish, and any other animal. The subject can be a mammal such as a primate or a human.


“Treating” or “treatment” does not mean a complete cure. It means that the symptoms of the underlying disease are reduced, and/or that one or more of the underlying cellular, physiological, or biochemical causes or mechanisms causing the symptoms are reduced. It is understood that reduced, as used in this context, means relative to the state of the disease, including the molecular state of the disease, not just the physiological state of the disease.


By “reduce” or other forms of reduce means lowering of an event or characteristic. It is understood that this is typically in relation to some standard or expected value, in other words it is relative, but that it is not always necessary for the standard or relative value to be referred to. For example, “reduces phosphorylation” means lowering the amount of phosphorylation that takes place relative to a standard or a control.


By “inhibit” or other forms of inhibit means to hinder or restrain a particular characteristic. It is understood that this is typically in relation to some standard or expected value, in other words it is relative, but that it is not always necessary for the standard or relative value to be referred to. For example, “inhibits phosphorylation” means hindering or restraining the amount of phosphorylation that takes place relative to a standard or a control.


By “prevent” or other forms of prevent means to stop a particular characteristic or condition. Prevent does not require comparison to a control as it is typically more absolute than, for example, reduce or inhibit. As used herein, something could be reduced but not inhibited or prevented, but something that is reduced could also be inhibited or prevented. It is understood that where reduce, inhibit or prevent are used, unless specifically indicated otherwise, the use of the other two words is also expressly disclosed. Thus, if inhibits phosphorylation is disclosed, then reduces and prevents phosphorylation are also disclosed.


By “specific expression pattern” is meant an elevation or reduction of expression of given genes when compared with a control or a standard. One of ordinary skill in the art is capable of identifying and measuring the expression of gene patterns of genes related to the methods disclosed herein.


The term “therapeutically effective” means that the amount of the composition used is of sufficient quantity to ameliorate one or more causes or symptoms of a disease or disorder. Such amelioration only requires a reduction or alteration, not necessarily elimination. The term “carrier” means a compound, composition, substance, or structure that, when in combination with a compound or composition, aids or facilitates preparation, storage, administration, delivery, effectiveness, selectivity, or any other feature of the compound or composition for its intended use or purpose. For example, a carrier can be selected to minimize any degradation of the active ingredient and to minimize any adverse side effects in the subject.


Throughout the description and claims of this specification, the word “comprise” and variations of the word, such as “comprising” and “comprises,” means “including but not limited to,” and is not intended to exclude, for example, other additives, components, integers or steps.


The term “cell” as used herein also refers to individual cells, cell lines, or cultures derived from such cells. A “culture” refers to a composition comprising isolated cells of the same or a different type.


References in the specification and concluding claims to parts by weight, of a particular element or component in a composition or article, denotes the weight relationship between the element or component and any other elements or components in the composition or article for which a part by weight is expressed. Thus, in a compound containing 2 parts by weight of component X and 5 parts by weight component Y, an Y are present at a weight ratio of 2:5, and are present in such ratio regardless of whether additional components are contained in the compound.


A weight percent of a component, unless specifically stated to the contrary, is based on the total weight of the formulation or composition in which the component is included.


In this specification and in the claims which follow, reference will be made to a number of terms which shall be defined to have the following meanings:


“Optional” or “optionally” means that the subsequently described event or circumstance may or may not occur, and that the description includes instances where said event or circumstance occurs and instances where it does not.


“Primers” are a subset of probes which are capable of supporting some type of enzymatic manipulation and which can hybridize with a target nucleic acid such that the enzymatic manipulation can occur. A primer can be made from any combination of nucleotides or nucleotide derivatives or analogs available in the art which do not interfere with the enzymatic manipulation.


“Probes” are molecules capable of interacting with a target nucleic acid, typically in a sequence specific manner, for example through hybridization. The hybridization of nucleic acids is well understood in the art and discussed herein. Typically a probe can be made from any combination of nucleotides or nucleotide derivatives or analogs available in the art.


B. COMPOSITIONS AND METHODS

Disclosed herein are methods and compositions for deriving a minimal intrinsic gene set for making biological classifications of cancer. Also disclosed are methods of using intrinsic genes in a real-time qRT-PCR assay for cancer classification, prognosis and/or treatment. Described herein are several algorithms for use in combination in order to generate a statistically validated minimal gene set that makes biological classifications of cancers. While the methods disclosed herein are generally useful with any type of cancer, breast cancer is specifically used as an example herein. Below follows a list of specific cancers that are useful with the methods disclosed herein, and the example of breast cancer is not intended to be limiting, but rather exemplary. The samples disclosed herein can be obtained from a variety of sources, including fresh tissue, fresh-frozen samples, or formalin-fixed paraffin-embedded samples.


The methodology described herein can be used to make a classification that distinguishes 2 or more intrinsic subtypes of breast cancer. The intrinsic subtypes can be designated as Luminal (and classes therein), HER2/ER− (and classes therein), Basal (and classes therein), Normal-like (and classes therein). The steps for finding the minimal intrinsic gene set for making subtype (and class) distinctions are as follows.


The first step is to use microarray data from biological replicates from the same patient to find intrinsic classifier genes. For example, a data set of tumors and normal breast samples can be used. In one embodiment, these data sets can comprise paired biological replicates to identify the intrinsic gene set. This is described, for example, in Perou et al. (2000), which is herein incorporated by reference in its entirety for its teaching regarding finding intrinsic classifier genes. In Perou et al., the molecular portraits revealed in the patterns of gene expression not only uncovered similarities and differences among the tumors, but also point to a biological interpretation. Variation in growth rate, in the activity of specific signalling pathways, and in the cellular composition of the tumors were all reflected in the corresponding variation in the expression of specific subsets of genes.


In the second step of the method disclosed herein, hierarchical cluster microarray data was obtained using an intrinsic gene set. Here, data can be combined from different microarray platforms for clustering using methods described in Example 2. Specifically, the “intrinsic gene set” from the first step (above) is tested on new tumors and normal breast samples after combining different datasets (such as cross platform analyses) and common genes/elements are hierarchically clustered. For example, a two-way average linkage hierarchical cluster analysis can be performed using a centered Pearson correlation metric and the program “Cluster” (Eisen et al. 1998), with the data being displayed relative to the median expression for each gene (i.e. median centering of the rows/genes).


In the third step, the number of clusters formed in the microarray dataset is estimated, and samples/tumors are assigned to clusters based on the sample-associated dendrogram groupings. In other words, the “test set” is used as a training set to create subtype centroids based upon the expression of the common intrinsic genes. New samples are assigned to the subtype corresponding to the nearest centroid when using Spearman correlation values.


In the fourth step, genes are found that optimally distinguish the samples in the assigned groups using the ratio of between-group to within-group sums of squares (the entire microarray dataset is used in this analysis). An example of this can be found in Chung et al, Cancer Cell 2004, herein incorporated by reference in its entirety for its teaching concerning identification of genes that optimally distinguish samples.


In the fifth step, iterative cycles of 10-fold cross-validation are performed with a nearest centroid classifier and overlapping gene sets of varying sizes. In other words, each gene and gene set are ranked based upon the metric from step four above, and various overlapping and every increasing sized genes lists are used in a 10-fold cross validation.


In the sixth, and final step, the smallest gene set which provides the highest class prediction accuracy when compared to the classifications made by the complete microarray-based intrinsic gene set is chosen. Subtypes are assigned for each gene set and the minimal gene set with the highest agreement in sample assignment to the full intrinsic gene set is chosen, hi one example, using a 1410 intrinsic gene set as disclosed in Example 2, 100 genes were identified (see Table 12 (7p 100), after the “Examples” section) that are important for identifying 7 different biological classes of breast cancer. Specific steps and sample sets used to develop the 7-class predictor as shown in FIG. 11. Also disclosed in Table 13 is an extended list of genes for classification resulting from the 7p analyses. This list is ranked in terms of significance for separating the different classes of intrinsic classifier genes. Another set of intrinsic genes that can be used for classification is found in Table 21, along with the primers that can be used to amplify those genes. It should be noted that the primers are optional and exemplary only, as any primer that can amplify a given gene can be used.


The minimal intrinsic gene set (identified using the methods described above, and found in Tables 12 and 13) has prognostic and predictive significance in breast cancer. The complete assay for making these biological “intrinsic” classifications includes 3 “housekeeper” genes (MRPL1 9, PUM1, and PSMC4) for normalizing the quantitative data. In addition, it has been shown that proliferation genes can also be used in combination with the housekeeper genes for providing a quantitative measurement of grade and for assessing prognosis in breast cancer.


Also disclosed herein is the Single Sample Predictor (SSP). The Single Sample Predictor/SSP is based upon the Nearest Centroid method presented in (Hastie et al. 2001). The subtype centroids (either all intrinsic genes or the minimal gene lists) can be used to make subtype predictions on additional test sets (e.g., homogenously treated subjects from clinical trial groups). The resulting classifications are then analyzed using Kaplan-Meier survival plots to determine prognostic and therapeutic significance. An example of SSP can be found in Example 2.


1. Intrinsic Genes and Cancer


An intrinsic gene is a gene that shows little variance within repeated samplings of the same tumor, but which shows high variance across tumors. Disclosed herein are genes that can be used as intrinsic genes with the methods disclosed herein. The intrinsic genes disclosed herein can be genes that have less than or equal to 0.00001, 0.0001, 0.001, 0.01, 0.1, 0.2. 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 1,000, 10,000, or 100,000% variation between two samples from the same tissue. It is also understood that these levels of variation can also be applied across 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, or 50 or more tissues, and the level of variation compared. It is also understood that variation can be determined as discussed in the examples using the algorithms as disclosed herein.


“Intrinsic gene set” is defined herein as comprising one or more intrinsic genes. “Minimal intrinsic gene set” is defined herein as being derived from an intrinsic gene set, and is considered the fewest number of intrinsic genes that can be used to classify a sample.


Disclosed herein is a set of 212 minimal intrinsic genes, as found in Table 21. These genes can be used alone, or in combination, as intrinsic genes for the purposes of classification, prognosis, and diagnosis of cancer, for example. 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154. 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199 of the genes can be used with the methods disclosed herein for analyzing samples.


Described herein is a method of diagnosing cancer, the method comprising comparing expression levels of a combination of genes from Table 21 to test nucleic acids corresponding to the same combination of genes, wherein specific expression patterns of the test nucleic acids indicates a cancerous state.


Also disclosed is a method of quantitating level of expression of a test nucleic acid comprising: a) comparing gene expression levels of a combination of genes from Table 21 to test nucleic acids corresponding to the same combination of genes; and b) quantitating level of expression of the test nucleic acid.


Also disclosed is a method of prognosing outcome in a subject diagnosed with cancer comprising: a) comparing expression levels of a combination of genes from Table 21 to test nucleic acids corresponding to the same combination of genes, b) identifying a subtype of cancer of the subject, and c) prognosing the outcome based on the subtype of cancer of the subject.


The intrinsic genes disclosed herein can be normalized to control housekeeper genes and used in a qRT-PCR diagnostic assay that uses relative copy number to assess risk or therapeutic response in cancer. For example, MRPL19 (SEQ ID NO:1), PSMC4 (SEQ ID NO:2), SF3A1 (SEQ IDNO:3), PUM1 (SEQ ID NO:4), ACTB (SEQ ID NO:5) and GAPD (SEQ ID NO:6). Other genes include GUSB, RPLPO, and TFRC, whose sequences can be found in Genbank. These are part of the 212 gene list. Other genes as disclosed herein can also be considered intrinsic genes.


The intrinsic genes can be used in any combination or singularly in any method described herein. It is also understood that any nucleic acid related to the expression control genes, such as the RNA, mRNA, exons, introns, or 5′ or 3′ upstream or downstream sequence, or DNA or gene can be used or identified in any of the methods or with any of the compositions disclosed herein.


2. Molecules for Detecting Genes, Gene Expression Products, Proteins Encoded by Genes


The disclosed methods involve using specific intrinsic genes or gene sets or expression control genes or gene sets such that they are detected in some way or their expression product is detected in some way. Typically the expression of a gene or its expression product will be detected by a primer or probe as disclosed herein. However, it is understood that they can also be detected by any means, such as in a microarray analysis or a specific monoclonal antibody or other visualization technique. Often, the expression of the genes of interest (control “housekeeper” genes or intrinsic classifier genes) can be detected after or during an amplification process, such as RT-PCR, including quantitative PCR.


3. Method of Diagnosing or Prognosing Cancer


Microarrays have shown that gene expression patterns can be used to molecularly classify various types of cancers into distinct and clinically significant groups. In order to translate these profiles into routine diagnostics, a microarray breast cancer classification system has been recapitulated using real-time quantitative (q)RT-PCR (Example 2). Statistical analyses were performed on multiple independent microarray datasets to select an “intrinsic” gene set that can classify breast tumors into four different subtypes designated as Luminal, Normal-like, HER2+/ER−, and Basal-like. Intrinsic genes, as described in Perou et al. (Nature (2000) 406:747-752), are statistically selected to have low variation in expression between biological sample replicates from the same individual and high variation in expression across samples from different individuals. Thus, intrinsic genes are the classifier genes for breast cancer classification and each classifier gene can be normalized to the housekeeper (or control) genes in order to make the classification. A minimal gene set from the microarray “intrinsic” list, and additional genes important for outcome (e.g., proliferation genes), were used to develop a real-time qRT-PCR assay comprised of 53 classifiers and 3 housekeepers. The expression data and classifications from microarray and real-time qRT-PCR were respectively compared using 123 unique breast samples (117 invasive carcinomas, 1 fibroadenoma and 5 normal tissues) and 3 cells lines. The overall correlation for the 50 genes in common between microarray and qRT-PCR was 0.76. There was 91% (114/126) concordance in the hierarchical clustering classification of the real-time qRT-PCR minimal “intrinsic” gene set (37 genes) and the larger (550 genes) microarray intrinsic gene set from which the PCR list was derived. As expected, the Luminal tumors (ER+) had a significantly better outcome than the HER2+/ER− (p=0.043) and Basal-like tumors (p=0.001). High expression of the proliferation genes GTBP4 (p=0.011), HSP A14 (p=0.023), and STK6 (ρ=0.027) were significant predictors of relapse free survival (RFS) independent of grade and stage. It has been shown that genomic microarray data can be translated into a qRT-PCR diagnostic assay that improves the standard of care in breast cancer.


The overlap in the minimized gene set discussed above and in Example 2 versus those in Example 3 is 14 out of 40. There are 108 genes in common between the larger intrinsic gene sets, which included 427 in Perreard et al versus 1300 used in Example 3. Example 2 illustrates how intrinsic gene sets can be minimized from microarray data and used on fresh tissue in a qRT-PCR assay to recapitulate the microarray classifications. It also shows the importance of the ‘proliferation’ genes in risk stratifying Luminal (ER+) breast tumors. Example 3 discusses a version of the intrinsic gene set from Hu et al and shows again how it can be minimized to provide intrinsic classifications on both fresh and FFPE tissue and using microarray or qRT-PCR data. Validated primer sequences from FFPE tissues for 212 genes important for breast cancer diagnostics are presented in Table 21.


A major challenge in the clinical care of cancer has been providing an accurate diagnosis for appropriate management of breast cancer. For over 50 years, medicine has relied on morphological features (histopathology) and anatomic staging (Tumor size/Node involvement/Metastasis) for classification of tumors (Greenough, R. B. J Cancer Res 9:452-463; Bloom et al. (1957) British Journal of Cancer 9:359-377). The TNM staging system provides information about the extent of disease and has been the “gold standard” for prognosis (Henson, et al. (1991) Cancer 68:2142-2149; Fitzgibbons, et al (2000) Arch Pathol Lab Med 124:966-978).


In addition to TNM, the grade of the tumor is also prognostic for relapse free survival (RFS) and overall survival (OS) (Elston et al. (1991) Histopathology 19:403-410). Tumor grade is determined from histological assessment of tubule formation, nuclear pleomorphism, and mitotic count. Due to the subjective nature of grading and difficulties standardizing methods, there has been less than optimal agreement between pathologists (Dalton et al. (1994) Cancer 73:2765-2770). Applying the Nottingham combined histological grade has made scoring more quantitative and improved agreement between observers (Frierson (1995) Am J Clin Pathol 103:195-198), however, more objective methods are still needed before grade is integrated into the TNM classification (Singletary (2003) Surg Clin North Am 83:803-819). For instance, most studies show significance in outcome between Grade 1 (low/least aggressive) and Grade 3 (high/most aggressive), but Grade 2 (intermediate) tumors show variability in outcome and are commonly not classified the same across institutions (Kollias et al. (1999) Eur J Cancer 35:908-912; Robbins et al. (1995) Hum Pathol 26:873-879; Genestie et al. (1998) Anticancer Res 18:571-576.). Alternatively, proliferation assays, such as S-phase fraction and mitotic index, have shown to be independent prognostic indicators and could be used in conjunction with, or instead of grade (Michels et al. (2004) Cancer 100:455-464; CaIy et al. (2004) Anticancer Res 24:3283-3288). It has been shown that proliferation genes can be used in a qRT-PCR assay and the genes can be averaged to produce a proliferation meta-gene that correlates with grade but is more prognostic (FIG. 17).


Women with the same stage of breast cancer can have widely different clinical outcomes due to differences in tumor biology (van't Veer et al. (2002) Nature 415:530-536; van van de Vijver et al. (2002) N Engl J Med 347:1999-2009. The use of gene expression markers in breast pathology can provide addition clinical information that complements the TNM system for prognosis and is important for making therapeutic decisions (van't Veer et al. (2002) Nature 415:530-536; van de Vijver et al. (2002) N Engl J Med 347:1999-2009; Paik et al. (2004) N Engl J Med 351:2817-2826; Sørlie et al. (2001) Proc Natl Acad Sci USA 98:10869-10874; Sorlie et al. (2003) Proc Natl Acad Sci USA 100:8418-8423). Undoubtedly, one of the greatest advancements in breast cancer medicine has been the identification and routine testing for the expression of the hormone receptors, namely the Estrogen Receptor (ER) and the Progesterone Receptor (PgR), which allows the clinician to offer endocrine blockade therapy that can significantly prolong survival in women with tumors expressing these proteins (Buzdar et al. (2003) J Clin Oncol 21:1007-1014; Fisher et al (1989) N Engl J Med 320:479-484).


Although ER expression is a predictive marker, it also serves as a surrogate marker for describing a tumor biology that is characteristically less aggressive (e.g. lower grade) than ER− negative tumors (Fisher et al. (1981) Breast Cancer Res Treat 1:37-41). Microarrays have elucidated the richness and diversity in the biology of breast cancer and have identified many genes that associate with ER-positive and ER-negative tumors (Perou et al. (2000) Nature 406:747-752; West et al. (2001) Proc Natl Acad Sci USA 98:11462-11467; Gruvberger et al. (2001) Cancer Res 61:5979-5984). When microarray data from invasive breast carcinomas are analyzed by hierarchical clustering, samples are separated primarily based on ER status (Sotiriou et al. (2003) Proc Natl Acad Sci USA 100:10393-10398).


Breast tumors of the “Luminal” subtype are ER positive and have a similar keratin expression profile as the epithelial cells lining the lumen of the breast ducts (Taylor-Papadimitriou et al. (1989) J Cell Sci 94:403-413; Perou et al. (2000) New Technologies for life sciences: A Trends Guide:67-76). Conversely, ER-negative tumors can be broken into two main subtypes, namely those that overexpress (and are DNA amplified for) HER2 and GRB7 (HER2+/ER−), and “Basal-like” tumors that have an expression profile similar to basal epithelium and express Keratin 5, 6B and 17. Both these tumor subtypes are aggressive and typically more deadly than Luminal tumors; however, there are subtypes of Luminal tumors that lead to poor outcome despite being ER− positive. For instance, Sorlie et al. identified a Luminal B subtype with similar outcomes to the HER2+/ER− and Basal-like subtypes, and Sotiriou et al. showed that there are 3 different types of Luminal tumors with different outcomes. The Luminal tumors with poor outcomes consistently share the histopathological feature of being higher grade and the molecular feature of highly expressing proliferation genes.


The so called “proliferation genes” show periodicity in expression through the cell cycle and have a variety of functions necessary for cell growth, DNA replication, and mitosis (Whitfield et al. (2002) MoI Biol Cell 13:1977-2000; Ishida et al. MoI Cell Biol 21:4684-4699). Despite their diverse functions, proliferation genes have similar gene expression profiles when analyzed by hierarchical clustering. As might be expected, proliferation genes correlate with grade, the mitotic index (Perou et al. (1999) Proc Natl Acad Sci USA 96:9212-9217), and outcome (Sørlie et al. (2001) Proc Natl Acad Sci USA 98:10869-10874). Proliferation genes are often selected when supervised analysis is used to find genes that correlate with patient outcome. For example, the SAM264 “survival” list presented in Sorlie et al., the 231 “prognosis classifier” list in van't Veer et al., and the “485 prognostic gene” list in Sotiriou et al., identified common proliferation genes (PCNA, TOP2A, CENPF). This suggests that all these studies are likely tracking a similar phenotype.


Gene expression profiling using DNA microarrays is a powerful tool to discover genes for molecular classifications of cancer but the platforms are labor intensive, expensive and currently not amenable to routine clinical diagnostics. Real-time qRT-PCR is well-suited for solid tumor diagnostics since it is rapid, homogenous (amplification and quantification in a single vessel), and can be performed from archived (FFPE tissue) samples. Example 3 shows that FFPE samples can perform as well as fresh samples. It has been shown that “intrinsic” breast cancer classifications from microarray can be recapitulated by qRT-PCR using a minimal “intrinsic” gene set. In addition, by supplementing the “intrinsic” gene set with proliferation genes, a more objective measurement of grade has been developed. The assay disclosed herein adds prognostic information to the standard of care for breast cancer.


Microarray used in conjunction with RT-PCR provides a powerful system for discovering and translating genomic markers into the clinical laboratory for molecular diagnostics. Although these platforms are fundamentally very different, the quantitative data across the methods have a high correlation. In fact, the data across the methods is no more disparate then across different microarray platforms. By hierarchical clustering, it has been shown that a biological classification of breast cancer derived from microarray data can be recapitulated using real-time qRT-PCR. Biological classification by real-time qRT-PCR makes the important clinical distinction between ER positive and ER negative tumors and identifies additional subtypes that have prognostic (ie, correlate to outcome) and predictive value (ie, correlate to treatment response).


The benefit of using real-time qRT-PCR for cancer diagnostics is that new informative markers can be readily validated and implemented, making tests expandable and/or tailored to the individual. For instance, it has been shown that including proliferation genes serves a similar purpose to grade but is more prognostic. Since grade has been shown to be universal as a prognostic factor in cancer, it is likely that the same markers correlate to grade and are important for survival in other tumor types. Real-time qRT-PCR is attractive for clinical use because it is fast, reproducible, tissue sparing, and able to be automated. Although genomic profiling should currently be used for ancillary testing, the fact that normal tissues can be distinguished from tumor tissue shows that these molecular assays may eventually be used for cancer diagnostics without histological corroboration.


Disclosed is a method of classifying cancer in a subject, comprising: a) identifying intrinsic genes of the subject to be used to classify the cancer; b) obtaining a sample from the subject; c) amplifying and detecting levels of intrinsic genes in the subject; and d) classifying cancer based upon results of step c. The sample can be fresh, or can be an FFPE sample.


Also disclosed is a method of diagnosing cancer in a subject the method comprising: a) amplifying and detecting intrinsic genes; and b) diagnosing cancer based on expression levels of the gene within the subject. The methods disclosed herein can be used with any of the types of cancer listed herein. The cancer can be breast cancer, for example. The breast cancer can be classified into one of four or more groups: luminal, normal-like, HER2+/ER− and basal-like, for example. Again, the sample can be fresh, or can be an FFPE sample.


Disclosed are methods of analyzing nucleic acid expression levels in a sample, the methods comprising comparing expression levels of an intrinsic gene set to a test nucleic acid, wherein specific expression patterns of the test gene relative to the intrinsic gene set indicates a diagnoses, poor prognosis, likelihood of obtaining, predisposition to obtaining, or presence of a cancer. Also disclosed are methods wherein the step of comparing comprises identifying the expression levels of an intrinsic gene set and a test nucleic acid by interaction with a primer or probe.


Disclosed are methods where a specific expression pattern of a test nucleic acid relative to an intrinsic gene set indicates the presence of a cancer, a poor (or good) prognosis for a patient having a cancer, a predisposition of getting a cancer, or a diagnoses of cancer or a cancerous state.


It is understood that any method of assaying any gene discussed herein can be performed. For example methods of assaying gene copy number or mRNA expression copy number can be performed. For example, RT-PCR, PCR, quantitative PCR, and any other forms of nucleic acid amplification can be performed. Furthermore, methods of hybridization, such as blotting, such as Northern or Southern techniques, such as chip and microarray techniques and any other techniques involving hybridizing of nucleic acids.


4. A Non-Limiting List of Cancers which can be Assayed with Disclosed Compositions and Methods


The disclosed compositions can be used to diagnose or prognose any disease where uncontrolled cellular proliferation occurs such as cancers. A non-limiting list of different types of cancers is as follows: lymphomas (Hodgkins and non-Hodgkins), leukemias, carcinomas, carcinomas of solid tissues, squamous cell carcinomas, adenocarcinomas, sarcomas, gliomas, high grade gliomas, blastomas, neuroblastomas, plasmacytomas, histiocytomas, melanomas, adenomas, hypoxic tumours, myelomas, AIDS-related lymphomas or sarcomas, metastatic cancers, or cancers in general.


A representative but non-limiting list of cancers that the disclosed compositions can be used to diagnose or prognose is the following: lymphoma, B cell lymphoma, T cell lymphoma, mycosis fungoides, Hodgkin's Disease, myeloid leukemia, bladder cancer, brain cancer, nervous system cancer, head and neck cancer, squamous cell carcinoma of head and neck, kidney cancer, lung cancers such as small cell lung cancer and non-small cell lung cancer, neuroblastoma/glioblastoma, ovarian cancer, pancreatic cancer, prostate cancer, skin cancer, liver cancer, melanoma, squamous cell carcinomas of the mouth, throat, larynx, and lung, colon cancer, cervical cancer, cervical carcinoma, breast cancer, and epithelial cancer, renal cancer, genitourinary cancer, pulmonary cancer, esophageal carcinoma, head and neck carcinoma, large bowel cancer, hematopoietic cancers; testicular cancer; colon and rectal cancers, prostatic cancer, or pancreatic cancer.


Compounds disclosed herein may also be used for the diagnosis or prognosis of precancer conditions such as cervical and anal dysplasias, other dysplasias, severe dysplasias, hyperplasias, atypical hyperplasias, and neoplasias.


5. Methods of Identifying a Minimal Intrinsic Gene Set


Disclosed are methods of identifying minimal intrinsic genes. These methods are described in detail above, and generally comprise the following: deriving a minimal intrinsic gene set for making biological classifications of cancer comprising: a) collecting data from multiple samples from the same or different individuals to identify potential intrinsic classifier genes (microarray data can be used in this step, for example); b) weighting intrinsic classifier genes of multiple individuals identified using the method of step a relative to each other and forming classification clusters (weighting can be done, for example, by forming hierarchical clusters); c) estimating the number of clusters formed in step b) and assigning individual samples to clusters; d) identifying genes that optimally distinguish the samples in the assigned groups of step c); e) performing iterative cross-validation with a nearest centroid classifier and overlapping gene sets of various sizes using the genes identified in step d); and f) choosing a gene set which provides the highest class prediction accuracy when compared to the classifications made in step b).


Also disclosed is a method of assigning a sample to an intrinsic subtype, comprising a) creating an intrinsic subtype average profile (centroid) for each subtype; b) individually comparing a new sample to each centroid; and c) assigning the new sample to the centroid that is most similar to the new sample. This is known as the Single Sample Predictor (SSP) method, and is described in further detail in Example 2.


Also disclosed are computerized implementing systems, as well as storage and retrieval systems, of biological information, comprising: a data entry means; a display means; a programmable central processing unit; and a data storage means having expression data for a gene electronically stored; wherein the stored sequences are used as input data for determining which sequence is the best intrinsic gene set for a specific tissue type.


C. COMPOSITIONS

Disclosed are the components to be used to prepare the disclosed compositions as well as the compositions themselves to be used within the methods disclosed herein. These and other materials are disclosed herein, and it is understood that when combinations, subsets, interactions, groups, etc. of these materials are disclosed that while specific reference of each various individual and collective combinations and permutation of these compounds may not be explicitly disclosed, each is specifically contemplated and described herein. For example, if a particular expression control gene is disclosed and discussed and a number of modifications that can be made to a number of molecules including the expression control gene are discussed, specifically contemplated is each and every combination and permutation of expression control gene and the modifications that are possible unless specifically indicated to the contrary. Thus, if a class of molecules A, B, and C are disclosed as well as a class of molecules D, E, and F and an example of a combination molecule, A-D is disclosed, then even if each is not individually recited each is individually and collectively contemplated meaning combinations, A-E, A-F, B-D, B-E, B-F, C-D, C-E, and C-F are considered disclosed. Likewise, any subset or combination of these is also disclosed. Thus, for example, the subgroup of A-E, B-F, and C-E would be considered disclosed. This concept applies to all aspects of this application including, but not limited to, steps in methods of making and using the disclosed compositions. Thus, if there are a variety of additional steps that can be performed it is understood that each of these additional steps can be performed with any specific embodiment or combination of embodiments of the disclosed methods.


1. Sequence Similarities


It is understood that as discussed herein the use of the terms homology and identity mean the same thing as similarity. Thus, for example, if the use of the word homology is used between two non-natural sequences it is understood that this is not necessarily indicating an evolutionary relationship between these two sequences, but rather is looking at the similarity or relatedness between their nucleic acid sequences. Many of the methods for determining homology between two evolutionarily related molecules are routinely applied to any two or more nucleic acids or proteins for the purpose of measuring sequence similarity regardless of whether they are evolutionarily related or not.


In general, it is understood that one way to define any known variants and derivatives or those that might arise, of the disclosed genes and proteins herein, is through defining the variants and derivatives in terms of homology to specific known sequences. This identity of particular sequences disclosed herein is also discussed elsewhere herein. In general, variants of genes and proteins herein disclosed typically have at least, about 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, or 99 percent homology to the stated sequence or the native sequence. Those of skill in the art readily understand how to determine the homology of two proteins or nucleic acids, such as genes. For example, the homology can be calculated after aligning the two sequences so that the homology is at its highest level.


Another way of calculating homology can be performed by published algorithms. Optimal alignment of sequences for comparison may be conducted by the local homology algorithm of Smith and Waterman Adv. Appl. Math. 2: 482 (1981), by the homology alignment algorithm of Needleman and Wunsch, J. MoL Biol. 48: 443 (1970), by the search for similarity method of Pearson and Lipman, Proc. Natl. Acad. Sci. U.S.A. 85: 2444 (1988), by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., Madison, Wis.), or by inspection.


The same types of homology can be obtained for nucleic acids by for example the algorithms disclosed in Zuker, M. Science 244:48-52, 1989, Jaeger et al. Proc. Natl. Acad. Sci. USA 86:7706-7710, 1989, Jaeger et al. Methods Enzymol 183:281-306, 1989 which are herein incorporated by reference for at least material related to nucleic acid alignment. It is understood that any of the methods typically can be used and that in certain instances the results of these various methods may differ, but the skilled artisan understands if identity is found with at least one of these methods, the sequences would be said to have the stated identity, and be disclosed herein.


For example, as used herein, a sequence recited as having a particular percent homology to another sequence refers to sequences that have the recited homology as calculated by any one or more of the calculation methods described above. For example, a first sequence has 80 percent homology, as defined herein, to a second sequence if the first sequence is calculated to have 80 percent homology to the second sequence using the Zuker calculation method even if the first sequence does not have 80 percent homology to the second sequence as calculated by any of the other calculation methods. As another example, a first sequence has 80 percent homology, as defined herein, to a second sequence if the first sequence is calculated to have 80 percent homology to the second sequence using both the Zuker calculation method and the Pearson and Lipman calculation method even if the first sequence does not have 80 percent homology to the second sequence as calculated by the Smith and Waterman calculation method, the Needleman and Wunsch calculation method, the Jaeger calculation methods, or any of the other calculation methods. As yet another example, a first sequence has 80 percent homology, as defined herein, to a second sequence if the first sequence is calculated to have 80 percent homology to the second sequence using each of calculation methods (although, in practice, the different calculation methods will often result in different calculated homology percentages).


2. Hybridization/Selective Hybridization


The term hybridization typically means a sequence driven interaction between at least two nucleic acid molecules, such as a primer or a probe and a gene. Sequence driven interaction means an interaction that occurs between two nucleotides or nucleotide analogs or nucleotide derivatives in a nucleotide specific manner. For example, G interacting with C or A interacting with T are sequence driven interactions. Typically sequence driven interactions occur on the Watson-Crick face or Hoogsteen face of the nucleotide. The hybridization of two nucleic acids is affected by a number of conditions and parameters known to those of skill in the art. For example, the salt concentrations, pH, and temperature of the reaction all affect whether two nucleic acid molecules will hybridize.


Parameters for selective hybridization between two nucleic acid molecules are well known to those of skill in the art. For example, in some embodiments selective hybridization conditions can be defined as stringent hybridization conditions. For example, stringency of hybridization is controlled by both temperature and salt concentration of either or both of the hybridization and washing steps. For example, the conditions of hybridization to achieve selective hybridization may involve hybridization in high ionic strength solution (6×SSC or 6×SSPE) at a temperature that is about 12-25° C. below the Tm (the melting temperature at which half of the molecules dissociate from their hybridization partners) followed by washing at a combination of temperature and salt concentration chosen so that the washing temperature is about 5° C. to 20° C. below the Tm. The temperature and salt conditions are readily determined empirically in preliminary experiments in which samples of reference DNA immobilized on filters are hybridized to a labeled nucleic acid of interest and then washed under conditions of different stringencies. Hybridization temperatures are typically higher for DNA-RNA and RNA-RNA hybridizations. The conditions can be used as described above to achieve stringency, or as is known in the art. (Sambrook et al., Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y., 1989; Kunkel et al. Methods Enzymol. 1987:154:367, 1987 which is herein incorporated by reference for material at least related to hybridization of nucleic acids). A preferable stringent hybridization condition for a DNA:DNA hybridization can be at about 68° C. (in aqueous solution) in 6×SSC or 6×SSPE followed by washing at 68° C. Stringency of hybridization and washing, if desired, can be reduced accordingly as the degree of complementarity desired is decreased, and further, depending upon the G-C or A-T richness of any area wherein variability is searched for. Likewise, stringency of hybridization and washing, if desired, can be increased accordingly as homology desired is increased, and further, depending upon the G-C or A-T richness of any area wherein high homology is desired, all as known in the art.


Another way to define selective hybridization is by looking at the amount (percentage) of one of the nucleic acids bound to the other nucleic acid. For example, in some embodiments selective hybridization conditions would be when at least about, 60, 65, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100 percent of the limiting nucleic acid is bound to the non-limiting nucleic acid. Typically, the non-limiting primer is in for example, 10 or 100 or 1000 fold excess. This type of assay can be performed at under conditions where both the limiting and non-limiting primer are for example, 10 fold or 100 fold or 1000 fold below their kd, or where only one of the nucleic acid molecules is 10 fold or 100 fold or 1000 fold or where one or both nucleic acid molecules are above their kd.


Another way to define selective hybridization is by looking at the percentage of primer that gets enzymatically manipulated under conditions where hybridization is required to promote the desired enzymatic manipulation. For example, in some embodiments selective hybridization conditions would be when at least about, 60, 65, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100 percent of the primer is enzymatically manipulated under conditions which promote the enzymatic manipulation, for example if the enzymatic manipulation is DNA extension, then selective hybridization conditions would be when at least about 60, 65, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100 percent of the primer molecules are extended. Preferred conditions also include those suggested by the manufacturer or indicated in the art as being appropriate for the enzyme performing the manipulation.


Just as with homology, it is understood that there are a variety of methods herein disclosed for determining the level of hybridization between two nucleic acid molecules. It is understood that these methods and conditions may provide different percentages of hybridization between two nucleic acid molecules, but unless otherwise indicated meeting the parameters of any of the methods would be sufficient. For example if 80% hybridization was required and as long as hybridization occurs within the required parameters in any one of these methods it is considered disclosed herein.


It is understood that those of skill in the art understand that if a composition or method meets any one of these criteria for determining hybridization either collectively or singly it is a composition or method that is disclosed herein.


3. Nucleic Acids


There are a variety of molecules disclosed herein that are nucleic acid based, including for example the nucleic acids that encode, for example, the intrinsic genes disclosed herein (Table 12), as well as various functional nucleic acids. The disclosed nucleic acids are made up of for example, nucleotides, nucleotide analogs, or nucleotide substitutes. Non-limiting examples of these and other molecules are discussed herein. It is understood that for example, when a vector is expressed in a cell, that the expressed mRNA will typically be made up of A, C, G, and U. Likewise, it is understood that if, for example, an antisense molecule is introduced into a cell or cell environment through for example exogenous delivery, it is advantageous that the antisense molecule be made up of nucleotide analogs that reduce the degradation of the antisense molecule in the cellular environment.


a) Nucleotides and Related Molecules


A nucleotide is a molecule that contains a base moiety, a sugar moiety and a phosphate moiety. Nucleotides can be linked together through their phosphate moieties and sugar moieties creating an internucleoside linkage. The base moiety of a nucleotide can be adenin-9-yl (A), cytosin-1-yl (C), guanin-9-yl (G), uracil-1-yl (U), and thymin-1-yl (T). The sugar moiety of a nucleotide is a ribose or a deoxyribose. The phosphate moiety of a nucleotide is pentavalent phosphate. An non-limiting example of a nucleotide would be 3′-AMP (3′-adenosine monophosphate) or 5′-GMP (5′-guanosine monophosphate).


b) Primers and Probes


It is understood that primers and probes can be produced for the actual gene (DNA) or expression product (mRNA) or intermediate expression products which are not fully processed into mRNA. Discussion of a particular gene is also a disclosure of the DNA, mRNA, and intermediate RNA products associated with that particular gene.


Disclosed are compositions including primers and probes, which are capable of interacting with the intrinsic genes disclosed herein, as well as the any other genes or nucleic acids discussed herein, hi certain embodiments the primers are used to support DNA amplification reactions. Typically the primers will be capable of being extended in a sequence specific manner. Extension of a primer in a sequence specific manner includes any methods wherein the sequence and/or composition of the nucleic acid molecule to which the primer is hybridized or otherwise associated directs or influences the composition or sequence of the product produced by the extension of the primer. Extension of the primer in a sequence specific manner therefore includes, but is not limited to, PCR, DNA sequencing, DNA extension, DNA polymerization, RNA transcription, or reverse transcription. Techniques and conditions that amplify the primer in a sequence specific manner are preferred. In certain embodiments the primers are used for the DNA amplification reactions, such as PCR or direct sequencing. It is understood that in certain embodiments the primers can also be extended using non-enzymatic techniques, where for example, the nucleotides or oligonucleotides used to extend the primer are modified such that they will chemically react to extend the primer in a sequence specific manner. Typically the disclosed primers hybridize with the disclosed genes or regions of the disclosed genes or they hybridize with the complement of the disclosed genes or complement of a region of the disclosed genes.


The size of the primers or probes for interaction with the disclosed genes in certain embodiments can be any size that supports the desired enzymatic manipulation of the primer, such as DNA amplification or the simple hybridization of the probe or primer. A typical disclosed primer or probe would be at least 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 61, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, 350, 375, 400, 425, 450, 475, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1250, 1500, 1750, 2000, 2250, 2500, 2750, 3000, 3500, or 4000 nucleotides long.


In other embodiments the disclosed primers or probes can be less than or equal to 6, 7, 8, 9, 10, 11, 12 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, 350, 375, 400, 425, 450, 475, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1250, 1500, 1750, 2000, 2250, 2500, 2750, 3000, 3500, or 4000 nucleotides long.


The primers for the disclosed genes in certain embodiments can be used to produce an amplified DNA product that contains the desired region of the disclosed genes. In general, typically the size of the product will be such that the size can be accurately determined to within 10, 5, 4, 3, or 2 or 1 nucleotides.


In certain embodiments this product is at least 20, 21, 22, 23, 24, 25, 27, 28 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 61, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, 350, 375, 400, 425, 450, 475, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1250, 1500, 1750, 2000, 2250, 2500, 2750, 3000, 3500, or 4000 nucleotides long.


In other embodiments the product is less than or equal to 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 61, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, 350, 375, 400, 425, 450, 475, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1250, 1500, 1750, 2000, 2250, 2500, 2750, 3000, 3500, or 4000 nucleotides long.


In certain embodiments the primers and probes are designed such that they are targeting as specific region in one of the genes disclosed herein. It is understood that primers and probes having an interaction with any region of any gene disclosed herein are contemplated: In other words, primers and probes of any size disclosed herein can be used to target any region specifically defined by the genes disclosed herein. Thus, primers and probes of any size can begin hybridizing with nucleotide 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or any specific nucleotide of the genes or gene expression products disclosed herein. Furthermore, it is understood that the primers and probes can be of a contiguous nature meaning that they have continuous base pairing with the target nucleic acid for which they are complementary. However, also disclosed are primers and probes which are not contiguous with their target complementary sequence. Disclosed are primers and probes which have at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 500, or more bases which are not contiguous across the length of the primer or probe. Also disclosed are primers and probes which have less than or equal to 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 500, or more bases which are not contiguous across the length of the primer or probe.


In certain embodiments the primers or probes are designed such that they are able to hybridize specifically with a target nucleic acid. Specific hybridization refers to the ability to bind a particular nucleic acid or set of nucleic acids preferentially over other nucleic acids. The level of specific hybridization of a particular probe or primer with a target nucleic acid can be affected by salt conditions, buffer conditions, temperature, length of time of hybridization, wash conditions, and visualization conditions. By increasing the specificity of hybridization means decreasing the number of nucleic acids that a given primer or probe hybridizes to typically under a given set of conditions. For example, at 20 degrees Celsius under a given set of conditions a given probe may hybridize with 10 nucleic acids in a sample. However, at 40 degrees Celsius with all other conditions being equal, the same probe may only hybridize with 2 nucleic acids in the same sample. This would be considered an increase in specificity of hybridization. A decrease in specificity of hybridization means an increase in the number of nucleic acids that a given primer or probe hybridizes to typically under a given set of conditions. For example, at 700 mM NaCl under a given set of conditions a particular probe or primer may hybridize with 2 nucleic acids in a sample, however when the salt concentration is increased to 1 Molar NaCl the primer or probe may hybridize with 6 nucleic acids in the same sample.


The salt can be any salt such as those made from the alkali metals: Lithium, Sodium, Potassium, Rubidium, Cesium, or Francium or the alkaline earth metals: Beryllium, Magnesium, Calcium, Strontium, Barium, or Radiumsodium, or the transition metals: Scandium, Titanium, Vanadium, Chromium, Manganese, Iron, Cobalt, Nickel, Copper, Zinc, Yttrium, Zirconium, Niobium, Molybdenum, Technetium, Ruthenium, Rhodium, Palladium, Silver, Cadmium, Hafnium, Tantalum, Tungsten, Rhenium, Osmium, Iridium, Platinum, Gold, Mercury, Rutherfordium, Dubniuni, Seaborgium, Bohrium, Hassium, Meitnerium, Ununnilium, Unununium or Unuribium at any molar strength to promoter the desired condition, such as 1, 0.7, 0.5, 0.3, 0.2, 0.1, 0.05, or 0.02 molar salt, ha general increasing salt concentration decreases the specificity of a given probe or primer for a given target nucleic acid and decreasing the salt concentration increases the specificity of a given probe or primer for a given target nucleic acid.


The buffer conditions can be any buffer such as TRIS at any pH, such as 5.0, 5.5, 6.0, 6.1, 6.2, 6.3, 6.4, 6.5, 6.6, 6.1, 6.8, 6.9, 7.0, 7.1, 7.2, 7.3, 7.4, 7.5, 7.6, 7.7, 7.8, 7.9, 8.0, 8.5, or 9.0. In general pHs above or below 7.0 increase the specificity of hybridization.


The temperature of hybridization can be any temperature. For example, the temperature of hybridization can occur at 20°, 21°, 22°, 23°, 24°, 25°, 26°, 27°, 28°, 29°, 31°, 32°, 33°, 34°, 35°, 36°, 37°, 38°, 39°, 40°, 41°, 42°, 43°, 44°, 45°, 46°, 47°, 48°, 49°, 50°, 51°, 52°, 53°, 54°, 55°, 56°, 57°, 58°, 59°, 60°, 61°, 62°, 63°, 64°, 65°, 66°, 67°, 68°, 69°, 70°, 81°, 82°, 83°, 84°, 85°, 86°, 87°, 88°, 89°, 90°, 91°, 92°, 93°, 94°, 95°, 96°, 97°, 98°, or 99° Celsius.


The length of time of hybridization can be for any time. For example, the length of time can be for 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 120, 150, 180, 210, 240, 270, 300, 360, minutes or 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 30, 36, 48 or more hours.


It is understood that any wash conditions can be used including no wash step. Generally the wash conditions occur by a change in one or more of the other conditions designed to require more specific binding, by for example increasing temperature or decreasing the salt or changing the length of time of hybridization.


It is understood that there are a variety of visualization conditions which have different levels of detection capabilities. Li general any type of visualization or detection system can be used. For example, radiolabeling or fluorescence labeling can be used and in general fluorescence labeling would be more sensitive, meaning a fewer number of absolute molecules would have to be present to be detected.


c) Sequences


There are a variety of sequences related to the intrinsic genes as well as the others disclosed herein and others are herein incorporated by reference in their entireties as well as for individual subsequences contained therein. A specific intrinsic gene set can be found in Table 12.


4. Kits


Disclosed are kits comprising nucleic acids which can be used in the methods disclosed herein and, for example, buffers, salts, and other components to be used in the methods disclosed herein. Disclosed are kits for identifying minimal intrinsic gene sets comprising nucleic acids, such as in a microarray. Also disclosed are specific minimal intrinsic genes used for classifying cancer, such as those found in Table 21. As described above, these intrinsic genes can be used in any combination or permutation, and any combination of permutation of these genes can be used in a kit. Also disclosed are kits comprising instructions.


5. Chips and Micro Arrays


Disclosed are chips where at least one address is the sequences or part of the sequences set forth in any of the nucleic acid sequences disclosed herein.


Also disclosed are chips where at least one address is a variant of the sequences or part of the sequences set forth in any of the nucleic acid sequences disclosed herein.


6. Computer Readable Mediums


Those of skill in the art understand how to display and express any nucleic acid or protein sequence in any of the variety of ways that exist, each of which is considered herein disclosed. Specifically contemplated herein is the display of these sequences on computer readable mediums, such as, commercially available floppy disks, tapes, chips, hard drives, compact disks, and video disks, or other computer readable mediums. Also disclosed are the binary code representations of the disclosed sequences. Those of skill in the art understand what computer readable mediums. Thus, computer readable mediums on which the nucleic acids or protein sequences are recorded, stored, or saved.


Disclosed are computer readable mediums comprising the sequences and information regarding the sequences set forth herein.


D. METHODS OF MAKING THE COMPOSITIONS

The compositions disclosed herein and the compositions necessary to perform the disclosed methods can be made using any method known to those of skill in the art for that particular reagent or compound unless otherwise specifically noted.


1. Nucleic Acid Synthesis


For example, the nucleic acids, such as, the oligonucleotides to be used as primers can be made using standard chemical synthesis methods or can be produced using enzymatic methods or any other known method. Such methods can range from standard enzymatic digestion followed by nucleotide fragment isolation (see for example, Sambrook et al., Molecular Cloning: A Laboratory Manual, 2nd Edition (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989) Chapters 5, 6) to purely synthetic methods, for example, by the cyanoethyl phosphoramidite method using a Milligen or Beckman System 1Plus DNA synthesizer (for example, Model 8700 automated synthesizer of Milligen-Biosearch,


Burlington, Mass. or ABI Model 380B). Synthetic methods useful for making oligonucleotides are also described by Dcuta et al., Ann. Rev. Biochem. 53:323-356 (1984), (phosphotriester and phosphite-triester methods), and Narang et al., Methods EnzymoL, 65:610-620 (1980), jfiosp otf es er metKo). Protein nucleic acid molecules can be made using known methods such as those described by Nielsen et ah, Bioconjug. Chem. 5:3-7 (1994).


E. METHODS OF USING THE COMPOSITIONS

1. Methods of Using the Compositions as Research Tools


The disclosed compositions can be used in a variety of ways as research tools. The compositions can be used for example as targets in combinatorial chemistry protocols or other screening protocols to isolate molecules that possess desired functional properties related to the disclosed genes.


The disclosed compositions can also be used diagnostic tools related to diseases, such as cancers, such as those listed herein.


The disclosed compositions can be used as discussed herein as either reagents in micro arrays or as reagents to probe or analyze existing microarrays. The disclosed compositions can be used in any known method for isolating or identifying single nucleotide polymorphisms. The compositions can also be used in any method for determining allelic analysis of for example, the genes disclosed herein. The compositions can also be used in any known method of screening assays, related to chip/micro arrays. The compositions can also be used in any known way of using the computer readable embodiments of the disclosed compositions, for example, to study relatedness or to perform molecular modeling analysis related to the disclosed compositions.


Throughout this application, various publications are referenced. The disclosures of these publications in their entireties are hereby incorporated by reference into this application in order to more fully describe the state of the art to which this pertains. The references disclosed are also individually and specifically incorporated by reference herein for the material contained in them that is discussed in the sentence in which the reference is relied upon.


F. EXAMPLES

The following examples are put forth so as to provide those of ordinary skill in the art with a complete disclosure and description of how the compounds, compositions, articles, devices and/or methods claimed herein are made and evaluated, and are intended to be purely exemplary and are not intended to limit the disclosure. Efforts have been made to ensure accuracy with respect to numbers (e.g., amounts, temperature, etc.), but some errors and deviations should be accounted for. Unless indicated otherwise, parts are parts by weight, temperature is in ° C. or is at ambient temperature, and pressure is at or near atmospheric.


1. Example 1
Biological Classification of Breast Cancer by Real-Time Quantitative RT-PCR: Comparisons to Microarray and Histopathology

a) Methods


Patient selection. An ethnically diverse cohort of patients were studied using samples collected from various locations throughout the United States. Tissues analyzed included 117 invasive breast cancers, 1 fibroadenoma, 5 “normal” samples (from reduction mammoplasty), and 3 cells lines. Patients were heterogeneously treated in accordance with the standard of care dictated by their disease stage, ER and HER2 status. Patients were censored for recurrence and/or death for up to 118 months (median 21.5 months). Clinical data presented in supplementary Table 7.


Sample preparation and first strand synthesis for qRT-PCR. Nucleic acids were extracted from fresh frozen tissue using RNeasy Midi Kit (Qiagen Inc., Valencia, Calif.). The quality of RNA was assessed using the Agilent 2100 Bioanalyzer with the RNA 6000 Nano LabChip Kit (Agilent Technologies, Palo Alto, Calif.). All samples used had discernable 18S and 28S ribosomal peaks. First strand cDNA was synthesized from approximately 1.5 mg total RNA using 500 ng Oligo(dT)12-18 and Superscript III reverse transcriptase (1st Strand Kit, Invitrogen, Carlsbad, Calif.). The reaction was held at 42° C. for 50 min followed by a 15-min step at 70° C. The cDNA was washed on a QIAquick PCR purification column and stored at −80° C. in TE' (25 mM Tris, 1 mM EDTA) at a concentration of 5 ng/ul (concentration estimated from the starting RNA concentration used in the reverse transcription).


Primer design. Genbank sequences were downloaded from Evidence viewer (NCBI website) into the Lightcycler Probe Design Software (Roche Applied Science, Indianapolis, Ind.). All primer sets were designed to have a Tm>>60° C., GC content>>50% and to generate a PCR amplicon <200 bps. Finally, BLAT and BLAST searches were performed on primer pair sequences using the UCSC Genome Bioinformatics (http://genome.ucsc.edu/) and NCBI (http://www.ncbi.nhn.nih.gov/BLAST/) to check for uniqueness. Primer sets and identifiers are provided in supplementary Table 8.


Real-time PCR. For PCR, each 20 μL reaction included IX PCR buffer with 3 mM MgCl2 (Idaho Technology Inc., Salt Lake City, Utah), 0.2 mM each of dATP, dCTP, and dGTP, 0.1 mM dTTP, 0.3 mM dUTP (Roche, Indianapolis, Ind.), 10 ng cDNA and IU Platinum Taq (Invitrogen, Carlsbad, Calif.). The dsDNA dye SYBR Green I (Molecular Probes, Eugene, Oreg.) was used for all quantification (1/50000 final). PCR amplifications were performed on the Lightcycler (Roche, Indianapolis, Ind.) using an initial denaturation step (94° C., 90 sec) followed by 5 cycles: denaturation (94° C., 3 sec), annealing (58° C., 5 sec with 20° C./s transition), and extension (72° C., 6 sec with 2° C./sec transition). Fluorescence (530 nm) from the dsDNA dye SYBR Green I was acquired each cycle after the extension step. Specificity of PCR was determined by post-amplification melting curve analysis. Reactions were automatically cooled to 60° C. at a rate of 3° C./s and slowly heated at 0.1° C./s to 95° C. while continuously monitoring fluorescence.


Relative quantification by RT-PCR. Quantification was performed using the LightCycler 4.0 software. The crossing threshold (Ct) for each reaction was determined using the 2nd derivative maximum method (Wittwer et al. (2004) Washington, D.C.: ASM Press; Rasmussen (2001) Heidelberg: Springer Verlag. 21-34). Relative copy number was calculated using an external calibration curve to correct for PCR efficiency and a within run calibrator to correct for the variability between run. The calibrator is made from 4 equal parts of RNA from 3 cell lines (MCF7, SKBR3, ME16C) and Universal Human Reference RNA (Stratagene, La Jolla, Calif., Cat #740000). Differences in cDNA input were corrected by dividing target copy number by the arithmetic mean of the copy number for 3 housekeeper genes (MRPL1 9, PSMC4, and PUM1) (Szabo et al. (2004) Genome Biol 5:R59). The normalized relative gene copy number was Iog2 transformed and analyzed by hierarchical clustering using Cluster (Eisen et al. (1998) Proc Natl Acad Sci USA 95:14863-14868). The clustering was visualized using Treeview software (Eisen Lab, http:/rana.lbl.gov/EisenSoftware.htm).


Microarray experiments. The same 126 samples used for qRT-PCR were analyzed by microarray (Agilent Human oligonucleotide). Total RNA was prepared and quality checked as described above. Labeling and hybridization of RNA for microarray was done using the Agilent low RNA input linear amplification kit (http://www.chem.agilent.com/Scripts/PDS.asp?1Page=10003), but with one-half the recommended reagent volumes and using a Qiagen PCR purification kit to clean up the cRNA. Each sample was assayed versus a common reference sample that was a mixture of Stratagene's Human Universal Reference total RNA (100 ug) enriched with equal amounts of RNA (0.3 μg each) from MCF7 and ME16C cell lines. Microarray hybridizations were carried out on Agilent Human oligonucleotide microarrays (1 A-v1, 1 A-v2 and custom designed 1 A-v1 based microarrays) using 2 μg each of Cy3-labeled “reference” and Cy5-labeled “experimental” sample. Hybridizations were done using the Agilent hybridization kit and a Robbins Scientific “22 k chamber” hybridization oven. The arrays were incubated overnight and then washed once in 2×SSC and 0.0005% triton X-102 (10 min), twice in 0.1×SSC (5 min), and then immersed into Agilent Stabilization and Drying solution for 20 seconds. All microarrays were scanned using an Axon Scanner 4000A. The image files were analyzed with GenePix Pro 4.1 and loaded into the UNC Microarray Database at the University of North Carolina at Chapel Hill (https://genome.unc.edu/) where a lowess normalization procedure was performed to adjust the Cy3 and Cy5 channels (Yang et al. (2002) Nucleic Acids Res 30:e15). All primary microarray data associated with this study are available at the UNC Microarray Database and have been deposited into the GEO (http://www.ncbi.nlm.nih.gov/geo/) under the accession number of GSE1992, series GSM34424-GSM34568.


Selecting genes for real-time qRT-PCR. A new “intrinsic” gene set for classifying breast tumors was derived using 45 before and after therapy samples from the combined data sets presented in Sorlie et al. (see Table 9 for the list of 45 pairs). The two-color DNA microarray data sets were downloaded from the internet and the R/G ratio (experimental/reference) for each spot was normalized and Iog2 transformed. Missing values were imputed using the k-NN imputation algorithm described by Troyanskaya et al. (Troyanskaya et al. (2001) Bioinformatics 17:520-525). The “intrinsic” analysis identified 550 gene elements.


Next, a completely independent data set was utilized (van't Veer et al. 2002) to derive an optimized version of the 550 intrinsic gene list. To allow across data set analyses, gene annotation from each dataset was translated to UniGene Cluster IDs (UCID) using the SOURCE database (Diehn et al. (2003) Nucleic Acids Res 31:219-223). Following the alogorithm outlined by Tibshirani and colleagues (Bair et al. (2004) PLoS Biol 2:E108; Bullinger et al. (2004) N Engl J Med 350:1605-1616), the 97 samples from the van't Veer et al. 2002 study were hierarchical clustered using a common set of 350 genes, and assigned an “intrinsic subtype of either Luminal, HER2+/ER−, Basal-like, or Normal-like to each sample. A feature/gene selection was then performed to identify genes that optimally distinguished these 4 classes using a version of the gene selection method first described by Dudoit et al. (Genome Biol 3:RESEARCH0036), where the best class distinguishers are identified according to the ratio of between-group to within-group sums of squares (a type of ANOVA). In addition to statistically selecting “intrinsic” classifiers proliferation genes (e.g., TOP2A, KI-67, PCNA) were also chosen, and other important prognostic markers (e.g., PgR) that have potential for diagnostics. In total, 53 differentially expressed biomarkers were used in the real-time qRT-PCR assay (Table 8).


Combining microarray and qRT-PCR datasets. Distance Weighted Discrimination (DWD) was used to identify and correct systematic biases across the microarray and qRT-PCR datasets (Benito et al. (2004) Bioinformatics 20:105-114). Prior to DWD, each dataset was normalized by setting the mean to zero and the variance to one. Normalization was done within each microarray experiment and for genes profiled across many experimental runs for real-time qRT-PCR. After DWD, genes in common between the datasets were clustered using Spearman correlation and average linkage association.


Receiver operator curves. In order to determine agreement between protein expression (immunohistochemistry) and gene expression (qRT-PCR), a cut-off for relative gene copy number was selected by minimizing the sum of the observed false positive and false negative errors. That is, minimizing the estimated overall error rate under equal priors for the presence/absence of the protein. The sensitivity and specificity of the resulting classification rule were estimated via bootstrap adjustment for optimism (Efron et al. (1998) CRC Press LLC. p 247 pp).


Survival analyses. Survival curves were estimated by the Kaplan-Meier method and compared via a log-rank or stratified log-rank test as appropriate. Standard clinical pathological parameters of age (in years), node status (positive vs. negative), tumor size (cm, as a continuous variable), grade (1-3, as a continuous covariate), and ER status (positive vs. negative) were tested for differences in RFS and OS using Cox proportional hazards regression model. Pairwise log-rank tests were used to test for equality of the hazard functions among the intrinsic classes. Only the classes Luminal, HER2+/ER−, and Basal-like classes were included in the analyses because it was believed the Normal Breast-like subtype is not a pure tumor class and may result from normal breast contamination. Cox regression was used to determine predictors of survival from continuous expression data. All statistical analyses were performed using the R statistical software package (R Foundation for Statistical Computing).


b) Results


Recapitulating microarray breast cancer classifications by qRT-PCR. 126 different breast tissue samples (117 invasive, 5 normal, 1 fibroadenoma, and 3 cell lines) were expression profiled using a real-time qRT-PCR assay comprised of 53 biological classifiers and 3 control/housekeepers genes. Genes were statistically selected to optimally identify the 4 main breast tumor intrinsic subtypes, and to create an objective gene expression predictor for cell proliferation and outcome (Ross et al. (2000) Nat Genet 24:227-235). There were 402 genes in common between this microarray dataset and the 550 “intrinsic” genes selected from the Sorlie et al. 2003 study. Two-way hierarchical clustering of the 402 genes in the microarray gave the same tumor subtypes as the minimal 37 “intrinsic” genes assayed by qRT-PCR (FIG. 4). The samples were grouped into Luminal, HER2+/ER−, Normal-like, and Basal-like subtypes. Out of 123 breast samples compared across the platforms, 114 (93%) were classified the same. The minimal “intrinsic” gene set identified expression signatures within the 3 different cell lines that were characteristic of each tumor subtype: Luminal (MCF7), HER2+/ER− (SKBR3), and Basal-like (ME1 6C). The genes EGFR and PgR, which were added for their predictive and prognostic value in breast cancer Nielsen et al. (2004) Clin Cancer Res 10:5367-5374; Makretsov et al. (2004) Clin Cancer Res 10:6143-6151), had opposite expression and were found to associate with either ER-positive tumors (high expression of PgR) or ER-negative tumors (high expression of EGFR) (FIG. 4C).


Proliferation and grade. Expression of the 14 “proliferation” genes (FIG. 4D) assayed by qRT-PCR showed that Luminal tumors have relatively low replication activity compared to HER2+/ER− and Basal-like tumors. As expected, the Normal-like samples showed the lowest expression of the “proliferation” genes. When correlating (Spearman correlation) the gene expression of all 53 genes with grade, it was found that the top 3 proliferation genes with a positive correlation (i.e., high expression correlates with high grade) were the proliferation genes CENPF (p=2.00E-07), BUB1 (p=6.84E-07), and STK6 (p=2.67E-06) (see supplementary Table 10). Interestingly, all the proliferation genes, except PCNA, were at the top of the list for having a positive correlation to grade. Conversely, the top markers with significant negative correlations with grade (i.e., low expression correlates with high grade) were GATA3 (p=3.53E-07), XBP1 (p=9.64E-06), and ESR1 (p=4.53E-05).


Agreement between immunohistochemistry, qRT-PCR “intrinsic” classifications, and gene expression. Fifty out of fifty-five (91%) Luminal tumors with IHC data were scored positive for ER. Conversely, 50 out of 56 (89%) tumors classified as HER2+/ER− or Basal-like were negative for ER by IHC. Cluster analysis showed that the Luminal tumors co-express ER and estrogen responsive genes such as LIV1/SLC39A6, X-box binding protein 1 (XBP1), and hepatocyte nuclear factor 3a (HNF3A/FOXA1). The gene with the highest correlation in expression to ESR1 was GATA3 (0.79, 95% CI: 0.71-0.85). It was found that the gene expression of ESR1 alone had 88% sensitivity and 85% specificity for calling ER status by IHC, and GATA3 alone showed 79% sensitivity and 88% specificity (FIG. 5A). In addition, gene expression of PgR correlated well with PR IHC status (sensitivity=89%, specificity=82%) (FIG. 5B). The data showed a very high correlation in expression between HER2/ERBB2 and GRB7 (0.91, 95% CI: 0.87-0.94), which are physically located near one another and are commonly overexpressed and DNA amplified together (Pollack et al. (1999) Nature Genetics 23:41-46; Pollack et al. (2002) Proc Natl Acad Sci USA 99:12963-12968). However, neither ERBB2 (sensitivity=91%, specificity=54%) nor GRB7 (sensitivity=52% specificity=78%) gene expression had both high sensitivity and specificity for predicting HER2 status by IHC (FIG. 5C).


Reproducibility of qRT-PCR. The run-to-run variation in Cp (cycle number determined from fluorescence crossing point) for all 56 genes (53 classifiers and 3 housekeepers) was determined from 8 runs. The median CV (standard deviation/mean) for all the genes was 1.15% (0.28%-6.55%) and 51/56 genes (91%) had a CV<2%. The reproducibility of the classification method is illustrated from the observation that replicates of the same sample (UB57A&B and UB60A&B), cluster directly adjacent to one another. Notably, the replicates were from separate RNA/cDNA preparations done on different pieces of the same tumor.


Survival Predictors. The clinical significance of individual markers and “intrinsic” subtypes were analyzed using qRT-PCR data. Patients with Luminal tumors showed significantly better outcomes for relapse-free survival (RPS) and overall survival (OS) compared to HER2+/ER− (RFS: p=0.023; OS: p=0.003) and Basal-like (RFS: ρ=0.065; OS: p=0.002) tumors (FIG. 6). This difference in outcome was significant for overall survival even after adjustment for stage (HER2+/ER−: p=0.043; Basal-like: p=0.001). There was no difference in outcome between patients with HER2+/ER− and Basal-like tumors. Analysis of the same cohort using standard clinical pathological information shows that stage, tumor size, node status, and ER status were prognostic for RFS and OS.


Using a Cox proportional hazards model to find biomarkers from the qRT-PCR data that predict survival, it was found that high expression of the proliferation genes GTBP4 (ρ=0.011), HSPA14 (p=0.023), and STK6 (ρ=0.027) were significant predictors of RFS independent of grade and stage (FIG. 7). The only proliferation gene significant for OS after correction for grade and stage was GTBP4 (p=0.011). Overall, the best predictor for both RFS (p=0.004) and OS (ρ=0.004) independent of grade and stage was SMA3 (Table 10).


Co-clustering qRT-PCR and Microarray Data. In order to determine if qRT-PCR and microarray data could be analyzed together in a single dataset, DWD was used to combine data for 50 genes and 126 samples profiled on both platforms (252 samples total). Hierarchical clustering of these data show that 98% (124/126) of the paired samples classified in the same group and 83/126 (66%) clustered directly adjacent to their corresponding partner (FIG. 10). Thus, DNA microarray and real-time qRT-PCR can be combined into a seamless dataset without sample segregation based on platform. Overall, the correlation between microarray and qRT-PCR expression data was 0.76 (95% CI: 0.75, 0.77) before DWD and 0.77 (95% CI: 0.76, 0.78) after DWD (FIG. 5). The DWD does not significantly effect the correlation but corrects for systematic biases between the platforms.


c) Discussion


Gene expression analyses can identify differences in breast cancer biology that are important for prognosis. However, a major challenge in using genomics for diagnostics is finding biomarkers that can be reproducibly measured across different platforms and that provide clinically significant classifications on different patient populations. Using microarray data, 402 “intrinsic” genes were identified that classify breast cancers based on vastly different expression patterns. This “intrinsic” gene set was shown to provide the same classifications when applied to a completely new and ethnically diverse population. Furthermore, the microarray dataset can be minimized to 37 “intrinsic” genes, translated into a real-time qRT-PCR assay, and provide the same classifications as the larger gene set. Molecular classifications using the “intrinsic” qRT-PCR assay agree with standard pathology and are clinically significant for prognosis. Thus, biological classifications based on “intrinsic” genes are robust, reproducible across different platforms, and can be used for breast cancer diagnostics.


The greatest contribution genomic assays have made towards clinical diagnostics in breast cancer has been in identifying risk of recurrence in women with early stage disease. For instance, MammaPrint™ is a microarray assay based on the 70 gene prognosis signature originally identified by van't Veer et al. On the test set validation, the 70 gene assay found that individuals with a poor prognostic signature had approximately a 50% chance of remaining free of distant metastasis at 10 years while those with a good-prognostic signature had a 85% chance of remaining free of disease. Another assay with similar utility is Oncotype Dx (Genomic Health Inc)—a real-time qRT-PCR assay that uses 16 classifiers to assess if patients with ER positive tumors are at low, intermediate, or high risk for relapse. While recurrence can be predicted with high and low risk tumors, patients in the intermediate risk group still have variable outcomes and need to be diagnosed more accurately.


In general, tumors that have a low risk of early recurrence are low grade and have low expression of proliferation genes. Due to the correlation of proliferation genes with grade and their significance in predicting outcome, a group of 14 proliferation genes were assayed. While the classic proliferation markers TOP2A and MKI67 significantly correlated with grade in the cohort, they were not near the top of the list. Furthermore, PCNA did not significantly correlate with grade (p=0.11) in the cohort. This could result from PCR primer design or differences between RNA and protein stability. Nevertheless, the proliferation gene that was found had the highest correlation to grade was CENPF (mitosin); another commonly used mitotic marker that has been shown to correlate with grade and outcome in breast cancer (Clark et al. (1997) Cancer Res 57:5505-5508). Since tumor grade and the mitotic index have been shown to be important in predicting risk of relapse (Chia et al. (2004) J Clin Oncol 22:1630-1637; Manders et al. (2003) Breast Cancer Res Treat 77:77-84), it is not surprising that 4 (GTBP4, HSPA14, STK6/15, BUB1) out the top 5 predictors for RFS (independent of stage) were proliferation genes. The proliferation gene that was the best predictor of RFS was GTBP4, a GTP-binding protein implicated in chronic renal disease and shown to be upregulated after serum administration (i.e., serum response gene) (Laping et al. (2001) J Am Soc Nephrol 12:883-890). Overall, the best predictor for both RFS (ρ=0.004) and OS (p=0.004) independent of grade and stage was SMA3. The role of SMA3 in the pathogenesis of breast cancer is still unclear, although it has also been associated with the BCL2 anti-apoptotic pathway (Iwahashi et al. (1997) Nature 390:413-417).


2. Example 2


A New Breast Tumor Intrinsic Gene List Identifies Novel Characteristics that are Conserved Across Microarray Platforms

A training set of 105 tumors were used to derive a new breast tumor “intrinsic” gene list and validated it using a combined test set of 315 tumors compiled from three independent microarray studies. An unchanging Single Sample Predictor was also used, and applied to three additional test sets. The Mrinsic/UNC gene set identified a number of findings not seen in previous analyses including 1) significance in multivariate testing, 2) that the proliferation signature is an intrinsic property of tumors, 3) the high expression of many Kallikrein genes in Basal-like tumors, and 4) the expression of the Androgen Receptor within the HER2+/ER− and Luminal tumor subtypes. The Single Sample Predictor that was based upon subtype average profiles, was able to identity groups of patients within a test set of local therapy only patients, and two independent tamoxifen-treated patient sets, which showed significant differences in outcomes. The analyses demonstrates that the “intrinsic” subtypes add value to the existing repertoire of clinical markers used for breast cancer patients. The computation approach also provides a means for quickly validating gene expression profiles using publicly available data.


Breast cancers represent a spectrum of diseases comprised of different tumor subtypes, each with a distinct biology and clinical behavior. Despite this heterogeneity, global analyses of primary breast tumors using microarrays have identified gene expression signatures that characterize many of the essential qualities important for biological and clinical classification. Using cDNA microarrays, five distinct subtypes of breast tumors arising from at least two distinct cell types (basal-like and luminal epithelial cells) were previously identified (Perou et al. 2000; Sorlie et al. 2001; Sorlie et al. 2003). This molecular taxonomy was based upon an “intrinsic” gene set, which was identified using a supervised analysis to select genes that showed little variance within repeated samplings of the same tumor, but which showed high variance across tumors (Perou et al. 2000). An intrinsic gene set reflects the stable biological properties of tumors and typically identifies distinct tumor subtypes that have prognostic significance, even though no knowledge of outcome was used to derive this gene set.


315 breast tumor samples compiled from publicly available microarray data were generated on different microarray platforms. These analyses show for the first time, that the breast tumor intrinsic subtypes are significant predictors of outcome when correcting for standard clinical parameters, and that common patterns of expression and outcome predictions can be identified when comparing data sets generated by independent labs.


a) Methods


Tissue samples, RNA preparations and microarray protocols. 105 fresh frozen breast tumor samples and 9 normal breast tissue samples were used as the training set and were obtained from 4 different sources using IRB approved protocols from each participating institution: the University of North Carolina at Chapel Hill, The University of Utah, Thomas Jefferson University and the University of Chicago. Thus, this sample set represents an ethnically diverse cohort from different geographic regions in the US with the clinical and microarray data for samples provided in Table 11. Patients were heterogeneously treated in accordance with the standard of care dictated by their disease stage, ER and HER2 status. The 105 patient training data set had a median follow up of 19.5 months, while the 315 sample combined test set had a median follow up of 74.5 months. Finally, another 16 tamoxifen-treated patient tumor samples were included that were used for the Single Sample Predictor additional test set analysis (tamoxifen-treated set #2).


Total RNA was purified from each sample using the Qiagen RNeasy Kit according to the manufacturer's protocol (Qiagen, Valencia Calif.) and using 10-50 milligram of tissue per sample. The integrity of the RNA was determined using the RNA 6000 Nano LabChip Kit and an Agilent 2100 Bioanalyzer (Agilent Technologies, Palo Alto, Calif.). The total RNA labeling and hybridization protocol used is described in the Agilent low RNA input linear amplification kit (http://www.chem.agilent.com/Scripts/PDS.asp?1 Page=10003) with the following modifications: 1) a Qiagen PCR purification kit was used to clean up the cRNA and 2) all reagent volumes were cut in half. Each sample was assayed versus a common reference sample that was a mixture of Stratagene's Human Universal Reference total RNA (Novoradovskaya et al. 2004) (100 ug) enriched with equal amounts of RNA (0.3 μg each) from MCF7 and ME16C cell lines. Microarray hybridizations were carried out on Agilent Human oligonucleotide microarrays (1A-v1, 1A-V2 and custom designed 1A-v1 based microarrays) using 2 μg of Cy3-labeled Reference and 2 μg of Cy5-labeled experimental sample. Hybridizations were done using the Agilent hybridization kit and a Robbins Scientific “22 k chamber” hybridization oven. The arrays were incubated overnight and then washed once in 2×SSC and 0.0005% triton X-105 (10 min), twice in 0.1×SSC (5 min), and then immersed into Agilent Stabilization and Drying solution for 20 seconds. AU microarrays were scanned using an Axon Scanner GenePix 4000B. The image files were analyzed with GenePix Pro 4.1 and loaded into the UNC Microarray Database at the University of North Carolina at Chapel Hill (https://genome.unc.edu/) where a Lowess normalization procedure was performed to adjust the Cy3 and Cy5 channels (Yang et al. 2002). All primary microarray data associated with this study are available at https://genome.unc.edu/pubsup/breastTumor/ and have been deposited into the GEO (http://www.ncbi.nlm.nih.gov/geo/) under the accession number of GSE1992, series GSM34424-GSM34568.


Intrinsic gene set analysis. A new breast tumor intrinsic gene set was derived, called the “Intrinsic/UNC” list using 105 patients (146 total arrays) and 15 repeated tumor samples that were different physical pieces (and RNA preparations) of the same tumor, 9 tumor-metastasis pairs and 2 normal sample pairs (26 paired samples in total, Table 11). This sample size was chosen based upon Basal-like, Luminal A, Luminal B, HER2+/ER−, and Normal-like samples, which occur at a frequency of 15%, 40%, 15%, 20%, and 10%, respectively; and it was estimated that most clinically relevant classes would constitute at least 10% of the affected population, and it was hoped to acquire at least 10 samples from each class in the new data set. Therefore, a sample size of 100 tumors was deemed adequate to identify most classes that might be present in breast cancer patients.


The background subtracted, Lowess normalized Iog2 ratio of Cy5 over Cy3 intensity values were first filtered to select genes that had a signal intensity of at least 30 units above background in both the Cy5 and Cy3 channels. Only genes that met these criteria in at least 70% of the 146 microarrays were included for subsequent analysis. Next, an “intrinsic” analysis was performed as described in Sorlie et al. 2003 (Sorlie et al. 2003) using the 26 paired samples and 86 additional microarrays. An intrinsic analysis identifies genes that have low variability in expression within paired samples and high variability in expression across different tumors; for an intrinsic analysis, each gene receives a score that is the average “within-pair variance” (the average square before/after difference), as well as the “between-subject variance” (the variance of the pair averages across subjects). The ratio D=(within-pair variance)/(between-subject variance) was then computed, and those genes with a small value of D (i.e. cut-off) declared to be “intrinsic”. The choice of a value of D was set at one standard deviation below the mean intrinsic score of all genes. This analysis resulted in the selection of 1410 microarray elements representing 1300 genes. In order to obtain an estimate of the number of false-positive intrinsic genes, the sample labels were permuted to generate 26 random pairs and 86 non-paired samples. This permutation was performed 100 times and the intrinsic scores were calculated for each. These permuted scores were used to determine a threshold on the intrinsic score corresponding to a false discovery rate less than 1%. The selected threshold resulted in 1410 microarray features being called significant with a FDR=0.3% and the 90th percentile FDR=0.5%. (See Tusher et al. for a complete description of this calculation (Tusher et al. 2001)).


These 1410 microarray elements were then used to perform a two-way average linkage hierarchical cluster analysis using a centered Pearson correlation metric and the program “Cluster” (Eisen et al. 1998), with the data being displayed relative to the median expression for each gene (i.e. median centering of the rows/genes). The cluster results were then visualized using “Treeview”.


Combined test set analysis. The two-color DNA microarray data sets of Sorlie et al. 2001 and 2003 van't Veer et al. and Sotiriou et al. (Sotiriou et al. 2003) were each downloaded from the internet and pre-processed similarly. Briefly, pre-processing included Iog2 transformation of the R/G ratio and then Lowess normalization of the data set (Yang et al. 2002 J. Next, missing values were imputed using the k-NN imputation algorithm described by Troyanskaya et al. (Troyanskaya et al. 2001). Gene annotation from each dataset was translated to UniGene Cluster IDs (UCTD) using the SOURCE database (Diehn et al. 2003), which gave a common gene set of approximately 2800 genes that were present across all four data sets. UniGene was chosen because a majority of the identifiers from each dataset could be easily mapped to a UniGene identifier (Build 161). Multiple occurrences of a UCDD were collapsed by taking the median value for that E) within each experiment and platform. Next, Distance Weighted Discrimination was performed in a pair-wise fashion by first combining the Sorlie et al. data set with the Sotiriou et al. data set, and then combining this with the van't Veer et al. data to make a single data set. In the final step of pre-processing, each individual experiment (microarray) was normalized by setting the mean to zero and its variance to one. The data for 306 of the 1300 Intrinsic/UNC genes was present in the combined test set and was used in a two-way average linkage hierarchical cluster analysis across the set of 315 microarrays as described above.


Single Sample Predictor. The Single Sample Predictor/SSP is based upon the Nearest Centroid method presented in (Hastie et al. 2001). More specifically, the combined test set was utilized, and 306 Intrinsic/UNC gene set hierarchical cluster presented in FIG. 14, as the starting point to create five Subtype Mean Centroids. A mean vector (centroid) for each of the five intrinsic subtypes (LumA, LumB, HER2+/ER−, Basal-like and Normal Breast-like) was created by averaging the gene expression profiles for the samples clearly assigned to each group (which limited the analysis to 249 samples total); the hierarchical clustering dendrogram in FIG. 14 were used as a guide for deciding those samples to group together. Next, using the 249 samples and 306 genes as a new training set (see FIG. 11), the SSP was applied back onto this data set (only the 249 samples) using Spearman correlation (which will calculate a training set error rate) and assigned a sample to the subtype to which it was most similar. This analysis showed 92% concordance with the clustering based subtype assignments.


Three additional test data sets were then analyzed: First the 60 sample data set of Ma et al. (Ma et al. 2004) was taken, which is an already pre-processed data set of Iog2 transformed ratios (GEO GSE1379), and performed a DWD correction using the 278 genes that were in common between the Ma et al. data set and the set of 306 Intrinsic/UNC genes used in the SSP. The SSP was applied to the 60 Ma et al. samples and, using Spearman correlation, each of the 60 samples were assigned to an intrinsic subtype based upon the highest correlation value to a centroid. Next, 220 samples from Chang et al. (Chang et al. 2005) were analyzed and 16 additional samples from UNC that were not used in the training set. The 220 samples represent an extension of the sample set presented in van't Veer et al. (van't Veer et al. 2002), and the combination of these two are the data used in van de Vijver et al. (van de Vijver et al. 2002). Each sample was column-standardized and then performed DWD to combine the 249 SSP samples (306 intrinsic genes) with the 220 samples from Chang et al. and the 16 UNC additional test set samples. Next, each sample's correlation to each centroid was calculated using a Spearman correlation and a sample was assigned to the centroid it was closest to, and the test set was then split into a local only therapy test set, and a tamoxifen-treated test set. Finally, the SSP was applied to the 105 sample original training set after DWD normalization.


Survival analyses. Univariate Kaplan-Meier analysis using a log-rank test was performed using WinSTAT for excel (R. Fitch Software). Standard clinical pathological parameters of age (in decades), node status (positive vs. negative), tumor size (categorical variable of T1-T4), grade (I vs. II and I vs. III), and ER status (positive vs. negative) were tested for differences in RFS, OS and DSS using a proportional hazards regression model. The likelihood ratio test was used to test for equality of the hazard functions among the intrinsic classes after adjusting for the covariates listed above. For the intrinsic subtype analyses, the coding was such that Lum A was the reference group to which the other classes were compared. SAS (SAS Institute Inc., SAS/STAT User's Guide, Version 8, 1999, Cary, N.C.) was used for proportional hazards modeling.


Immunohistochemistry. Five micron sections from formalin-fixed, paraffin-embedded tumors were cut and mounted onto Probe On Plus slides (Fisher Scientific). Following deparaffinization in xylene, slides were rehydrated through a graded series of alcohol and placed in running water. Endogenous peroxidase activity was blocked with 3% hydrogen peroxidase and methanol. Samples were steamed for antigen retrieval with 10 mM citrate buffer (pH 6.0) for 30 min. Following protein block, slides were incubated with biotinylated antibody for the Androgen Receptor (Zymed, 08-1292) and incubated with streptavidin conjugated HRP using Vectastain ABC kit protocol (Vector Laboratories). 3,3′-diaminobenzidine tetrahydrochloride (DAB) chromogen (the substrate) was used for the visualization of the antibody/enzyme complex. Slides were counterstained with hematoxylin (Biomedax-M1O) and examined by light microscopy.


b) Results


Overview. The goals were to create a new breast tumor intrinsic list and validate this list using multiple test data sets so that new biology could be identified, and the clinical significance of “intrinsic” classifications shown. A new intrinsic list was created using paired samples that were similarly treated (note that these were different “intrinsic” pairs than previously used since they were not before and after therapy pairs). In deriving the “new” list microarrays containing many more thousands of genes than was used before were used. A diagram representing the flow of data sets used here, and the different analysis methods, is presented in FIG. 11. First, a new 1300 gene “Intrinsic/UNC” list was created using 26 paired samples and a “training set” of 105 patients. Second, a large “combined test set” of 315 samples was created by combining three publicly available data sets. A reduced version of the Intrinsic/UNC gene set (reduced to an overlapping set of 306 genes) was applied onto this pure test set and show significance in a multivariate analysis. Finally, using the “combined test set”, a Single Sample Predictor (SSP) was created from the subtype average profiles (i.e. centroids) and assign subtype designation onto three “additional test sets”. Thus, the “combined test set” becomes the training set for the SSP, which is then used to predict subtype, and ultimately outcome, on the “additional test sets”.


Identification of the Intrinsic/UNC gene set. A new breast tumor intrinsic gene set was created, called the “Intrinsic/UNC” list, using 26 paired samples comprised of 15 paired primary tumors that were different physical pieces (and RNA preparations) of the same tumor, 9 primary tumor-metastasis pairs, and 2 normal breast sample pairs. In total, 105 biologically diverse breast tumor specimens and 9 normal breast samples (146 microarrays, see Table 11) were assayed on Agilent oligo DNA microarrays representing 17,000 genes (GEO accession number GSE 1992). This intrinsic analysis identified 1410 microarray elements that represented 1300 genes. When this new gene list was used in a two-way hierarchical clustering analysis on the training set (FIG. 12), the experimental sample dendrogram (FIG. 12B) showed four groups corresponding to the previously defined HER2+/ER−, Basal-like, Luminal and Normal Breast-like groups (Perou et al. 2000). AU 26 tumor pairs were paired in this clustering analysis, including the 5 primary tumor-local metastasis pairs and the 4 distant metastasis pairs (FIG. 12); thus, the individual portraits of tumors are maintained even in their metastasis samples (Weigelt et al. 2003).


The biology of the intrinsic subtypes is rich and extensive, and the current analysis identified new biologically important features. A HER2+ expression cluster was observed that contained genes from the 17ql 1 amplicon including HER2/ERBB2 and GRB7 (FIG. 12D). The HER2+ expression subtype (pink dendrogram branch in FIG. 12B) was predominantly ER− negative (i.e. HER2+/ER−), but showed expression of the Androgen Receptor (AR) gene. To determine if this finding extended to the protein level, immunohistochemistry for AR was performed, and it was confirmed that the HER2+/ER− and many Luminal tumors, expressed AR at moderate to high levels (FIG. 13); in some cases, high nuclear expression was observed (FIG. 13B).


A Basal-like expression cluster was also present and contained genes characteristic of basal epithelial cells such as SOX9, CK17, c-KIT, FOXC1 and P-Cadherin (FIG. 13E). These analyses extend the Basal-like expression profile to contain four Kallikrein genes (KLK5-8), which are a family of serine proteases that have diverse functions and proven utility as biomarkers (e.g. KLK3/PSA); however, it should be noted that KLK3/PSA was not part of the basal profile. Finally, a Luminal/ER+ cluster was present and contained ER, XBP1, FOXA1 and GAT A3 (FIG. 12C). GATA3 has recently been shown to be somatically mutated in some ER+ breast tumors (Usary et al. 2004), and some of the genes in FIG. 12C are GAT A3-regulated (FOXA1, TFF3 and AGR2). In addition, the Luminal/ER+ cluster contained many new biologically relevant genes such as AR (FIG. 12C), FBP1 (a key enzyme in gluconeogenesis pathway) and BCMP11.


The subtype defining genes from this analysis showed similarity to the previous breast tumor intrinsic lists (i.e. Intrinsic/Stanford) described in (Perou et al. 2000; Sorlie et al. 2003), except there was a significant increase in gene numbers likely due to the increased number of genes present on the current microarrays, and another significant difference was that the new Intrinsic/UNC list contained a large proliferation signature (FIG. 12F) (Perou et al. 1999; Chung et al. 2002; Whitfield et al. 2002). The inclusion of proliferation genes in the Intrinsic/UNC gene set, but not in the previous Intrinsic/Stanford lists, is likely due to the fact that the Intrinsic/Stanford lists were based upon before and after chemotherapy paired samples of the same tumor, while the Intrinsic/UNC list was based upon identically treated paired samples. This finding suggests that tumor cell proliferation rates did vary before and after chemotherapy, and that proliferation is a reproducible feature of a tumor's expression profile. Thus, the new Intrinsic/UNC list likely encompasses most features of the previous lists, adds new genes to each subtype's defining gene set and adds a biological and clinically relevant feature that is the proliferation signature.


Combined test set analysis. Another difference between the intrinsic subtypes found in the 105 sample training data set versus those presented in Sorlie et al. 2001 and 2003 (Sorlie et al. 2001; Sorlie et al. 2003), was that the training set did not have a clear Luminal B (LumB) group as determined by hierarchical clustering analysis. The lack of a LumB group in the training set cluster analysis could be due to few LumB tumors being present in this data set, an artifact of the clustering analysis, or the lack of LumB defining genes in the Intrinsic/UNC gene list. To address this question, a “combined test set” of 315 breast samples was made (311 tumors and 4 normal breast samples) that was a single data set created by combining together the data from Sorlie et al. 2001 and 2003 (cDNA microarrays), van't Veer et al. 2002 (custom Agilent oligo microarrays) and Sotiriou et al. 2003 (cDNA microarrays).


A single data table of these three sets was created by first identifying the common genes present across all four microarray data sets (2800 genes). Next, Distance Weighted Discrimination (DWD) was used to combine these three data sets together (Benito et al. 2004); DWD is a multivariate analysis tool that is able to identify systematic biases present in separate data sets and then make a global adjustment to compensate for these biases. Finally, it was determined that 306 of the 1300 unique Intrinsic/UNC genes were present in the combined test set. FIG. 14 shows the 315 sample combined test set and the 306 Intrinsic/UNC genes in a two-way hierarchical cluster analysis (see Supplementary FIG. 12 for the complete cluster diagram). As expected, this analysis identified the same expression patterns seen in FIG. 12 and more. For example, there was a Luminal/ER+ cluster containing ER, GATA3 and GAT A3-regulated genes (FIG. 14C), a HER2+ cluster (FIG. 15D), a Basal-like cluster (FIG. 14F) and a prominent proliferation signature (FIG. 14). The sample-associated dendrogram (FIG. 14B) showed the major subtypes seen in Sorlie et al. 2003 including a LumB group, and a potential new tumor group (Luminal T) characterized by the high expression of Interferon (IFN)-regulated genes (FIG. 14E). The IFN-regulated cluster contained STAT1, which is likely the transcription factor that regulates expression of these IFN-regulated genes (Bromberg et al. 1996; Matikainen et al. 1999). The IFN cluster was one of the first expression patterns to be identified in breast tumors (Perou et al. 1999), and since has been linked to positive lymph node metastasis status and a poor prognosis (Huang et al. 2003; Chung et al. 2004). The effectiveness of the DWD normalization is evident upon close examination of the sample associated dendrogram, which shows that every subtype is populated by samples from each data set (i.e. significant inter-data set mixing).


Even though there was limited overlap between the new Intrinsic/UNC list and the Intrinsic/Stanford list of Sorlie et al. 2003 (108 genes in common), there was high agreement in sample classification. For example, it was found 85% concordance in subtype assignments for the 416 tumor data set (combined samples from training and combined test set) that were analyzed independently using the Intrinsic/Stanford and Intrinsic/UNC lists, and both lists showed significance in univariate survival analyses (data not shown). This analysis suggests that, even though the exact constituent genes may vary, the different lists are tracking the same phenotypes and the same “portraits” are seen. However, since the Intrinsic/UNC list contained many more genes and a biologically relevant pattern of expression not seen in the Intrinsic/Stanford lists (i.e. proliferation signature), therefore, it can be more biologically representative of breast tumors. The Intrinsic/UNC list can also be more valuable because it provides a larger number of genes for performing across data set analyses and thus, classifications made across different platforms are less susceptible to artifactual groupings as a result of gene attrition.


Multivariate analyses. In the training set and combined test set, the standard clinical parameters of ER status, node status, grade, and tumor size were all significant predictors of Relapse-Free Survival (RFS, where an event is either a recurrence or death) using univariate Kaplan-Meier analysis (FIG. 15 for combined test set analysis). In addition, the Intrinsic/UNC gene set identified tumor groups/subtypes that were predictive of RFS on both the training (FIG. 16A) and combined test set (FIG. 16B). As before, the Luminal group had the best outcome and the HER2+/ER− and Basal-like groups had the worst. The Intrinsic/UNC gene list was also predictive of Overall Survival (OS) on the training and combined test set. As previously seen, patients of the LumB classification showed worse outcomes that LumA, despite being clinically ER+ tumors (FIG. 16B). Finally, the new class of Luml showed similar outcomes to LumB, and both showed elevated proliferation rates when compared to LumA tumors (FIG. 14G).


When the five standard clinical parameters were tested on the 315 sample combined test set using a proportional hazards regression model and RFS, OS or Disease-Specific Survival (DSS) as endpoints, tumor size, grade and ER status were the significant predictors with node status being close to significant (p=0.06-0.07); however, node status was still prognostic in a univariate analysis (FIG. 15B). The next objective was to test for differences in survival among the intrinsic subtypes on the combined test set after adjusting for the clinical covariates of age, ER, node status, grade and tumor size. The approach used was a proportional hazards regression model for RFS (or time to distant metastasis for the van't Veer et al. samples), OS and DSS (which was limited to the Sorlie et al. and Sotiriou et al. data sets). P-values of 0.05 (RFS), 0.009 (OS) and 0.04 (DSS) were obtained when the intrinsic subtypes were tested in a model that included the clinical covariates, which showed that the classifications have significantly different hazard functions, and thus, different survival curves after taking into account (or adjusting for) the effects of age, node status, size, grade, and ER status (Table 11, example for RFS). In this analysis, the Basal-like, LumB and HER2+/ER− subtypes were significantly different from the LumA group (the reference group), while Luml was not. Similar findings were also obtained for the other endpoints except for the LumB subtype, which was not significantly different from LumA in OS (p=0.36) or DSS (p=0.08).


Single Sample Predictions using three additional test sets. A major limitation of using hierarchical clustering as a classifications tool, is its' dependence upon the sample/gene set used for the analysis (Simon et al. 2003). That is, new samples cannot be analyzed prospectively by simply adding them to an existing dataset because it may alter the initial classification of a few previous samples. If an assay is going to be used in the clinical setting, it must be robust and unchanging. To address this concern, a Single Sample Predictor (SSP) was developed using the “combined test set” and its 306 Intrinsic/UNC genes (See FIG. 11); the SSP is based upon “Subtype Mean Centroids” and a nearest centroid predictor (Hastie et al. 2001) (see Methods). For the SSP, an intrinsic subtype average profile (centroid) was created for each subtype using the combined test set presented in FIG. 14, and then a new sample is individually compared to each centroid and assigned to the subtype/centroid that it is the most similar to using Spearman correlation. Using this method, an intrinsic subtype can be assigned to any sample, from any data set, one at a time.


Using the combined test set, five centroids representing the LumA, LumB, Basal-like, HER2+/ER− and Normal Breast-like groups were created). The SSP was tested on three “additional test sets”, the first of which was the Ma et al. data set of ER+ patients that were homogenously treated with tamoxifen (Ma et al. 2004). Using the 60 whole tissue samples of Ma et al., the SSP called 2 Basal-like, 2 HER2+/ER−, 12 Normal Breast-like, 34 LumA, and 9 LumB. Since this patient set had RFS data, the SSP classifications were tested in terms of outcomes (the 2 Basal-like and 2 HER2+/ER− samples by SSP analysis were excluded). The SSP assignments were a significant predictor for this group of adjuvant tamoxifen treated patients (p=0.04, FIG. 16C).


Next, the SSP was applied onto a 96 sample test set of local only (surgery) treated patients from Chang et al. (Chang et al. 2005), which showed highly significant results (FIG. 16D, p=0.0006). The final additional test set analyzed was a second adjuvant tamoxifen-treated patient set created by combining similarly treated patients from Chang et al. 2005 plus 16 patients from UNC (which were not included within the 105 patient training data set); for the 45 patient tamoxifen treated data set #2, the SSP called 3 Normal-like, 2 Basal-like and 2 HER2+/ER−, and these samples were excluded from the survival analyses. Again, the SSP-based assignments were a statistically significant predictor of outcomes (FIG. 16E for tamoxifen-treated set #2, p=0.02). Finally, if the SSP was applied back onto the original training data set of 105 samples, it was noted that 17 tumors were called LumB (FIG. 12) and that the survival analysis showed that these tumors did show a poor outcome (FIG. 16F, p=0.02). Thus, the SSP that was based upon hundreds of samples, was able to define clinically relevant distinctions that the hierarchical clustering analysis of 105 samples missed, which further demonstrates the usefulness and objectivity of the SSP.


c) Discussion


This study identified a number of new biologically relevant “intrinsic” features of breast tumors and methods that are important for the microarray community. These new biological features include the 1) demonstration that proliferation is a stable and intrinsic feature of breast tumors, 2) identification of the high expression of many Kallikrein genes in Basal-like tumors, and 3) demonstration that there are multiple types of “HER2-positive” tumors; the HER2-positive tumors falling into the “HER2+/ER−” intrinsic subtype were also shown to associate with the expression of the Androgen Receptor, while those not falling into this group were present in the LumB or LumI subtypes and usually showed better outcomes. relative to the HER2+/ER− tumors. Recent studies in prostate cancer have shown that HER2 signaling enhances AR signaling under low androgen levels (Mellinghoff et al. 2004). When this finding is coupled to the observation that some HER2+/ER− tumors showed nuclear AR expression (FIG. 13B), this suggests that active AR signaling maybe occurring and that anti-androgen therapy can be helpful in these HER2+ (i.e. amplified) and AR+ patients.


Microarray studies are often criticized for a lack of reproducibility and limited validation due to small sample sizes (Simon et al. 2003; Ioannidis 2005). By using DWD, multiple microarray data sets have been combined together to create a single and large combined test set, and it has been shown that the same “intrinsic” patterns can be identified in different data sets in a coordinated analysis, even though entirely different patient populations were investigated on different microarray platforms. The analysis of the 315 sample combined test set showed that the “intrinsic” subtypes based upon the Intrinsic/UNC list, were independent prognostic variables, and thus, were providing new clinical information.


To be of routine clinical use, a gene expression-based test must be based upon an unchanging assay that is capable of making a prediction on a single sample. Therefore, a Single Sample Predictor/SSP was created that was able to classify samples from three additional test sets of similarly treated patients. In particular, the new Intrinsic/UNC list and the SSP, recapitulated the finding that the intrinsic subtypes are truly prognostic on a test set of local only treated patients (FIG. 16D), and it was shown on two additional test sets that LumB patient fair worse than LumA patient in the presence of tamoxifen (FIGS. 16C and 16E). It should be noted that the distinction of LumA versus LumB closely minors the “Recurrence Score” predictor of Paik et al. (Paik et al. 2004), where outcome predictions for tamoxifen-treated ER+ tumors were stratified based mostly on the expression of genes in the HER2-amplicon (HER2 and GRB7), genes of proliferation (STK15 and MYBL2), and genes associated with positive ER status (ESR1 and BCL2). In essence, high expression of HER2-amplicon and/or proliferation genes gives a high Recurrence Score (and correlates with LumB because most HER2+ and ER+ tumors fall into this subtype), while low expression of these genes and high expression of ER status genes gives a low Recurrence Score (and correlates with LumA).


This data shows that the breast tumor intrinsic subtypes identified using the Intrinsic/UNC gene list can be generalized to many different patient sets, both treated and untreated. The intrinsic portraits of breast tumors are recognizable patterns of expression that are of biological and clinical value, and the SSP-based classification tool represents an unchanging predictor to be used for individualized medicine.


3. Example 3
Agreement in Breast Cancer Classification Between Microarray and qRT-PCR from Fresh-Frozen and Formalin-Fixed Paraffin-Embedded Tissues

Microarray analyses of breast cancers have identified different biological groups that are important for prognosis and treatment. In order to transition these classifications into the clinical laboratory, a real-time quantitative (q)RT-PCR assay has been developed for profiling breast tumors from formalin-fixed paraffin-embedded (FFPE) tissues and evaluate its performance relative to fresh-frozen (FF) RNA samples.


Micro array data from 124 breast samples were used as a training set for classifying tumors into four different previously defined molecular subtypes of Luminal, HER2+/ER−, Basal-like, and Normal-like. Sample class predictors were developed from hierarchical clustering of microarray data using two different centroid-based algorithms: Prediction Analysis of Microarray and a Single Sample Predictor. The training set data was applied to predicting sample class on an independent test set of 35 breast tumors procured as both fresh-frozen and formalin-fixed, paraffin embedded tissues (70 samples). Classification of the test set samples was determined from microarray data using a large 1300 gene set, and using a minimized version of this gene list (40 genes). The minimized gene set was also used in a real-time qRT-PCR assay to predict sample subtype from the fresh-frozen and formalin-fixed, paraffin embedded tissues. Agreement between primer set performance on fresh-frozen and formalin-fixed, paraffin embedded tissues was evaluated using diagonal bias, diagonal correlation, diagonal standard deviation, concordance correlation coefficient, and subtype assignment.


The centroid-based algorithms (Prediction Analysis of Microarray and Single Sample Predictor) had complete agreement in classification from formalin-fixed, paraffin-embedded tissues using qRT-PCR and the minimized ‘intrinsic’ gene set (40 classifiers). There was 94% (33/35) concordance between the diagnostic algorithms when comparing subtype classification from fresh-frozen tissue using microarray (large and minimized gene set) and qRT-PCR data. By qRT-PCR alone, there was 97% (34/35) concordance between fresh-frozen and formalin-fixed, paraffin embedded tissues using Prediction Analysis of Microarray and 91% (32/35) concordance using Single Sample Predictor. Finally, we used several analytical techniques to assess primer set performance between fresh-frozen and formalin-fixed, paraffin-embedded tissues and found that the ratio of the diagonal standard deviation to the dynamic range was the best method for assessing agreement on a gene-by-gene basis.


Determining agreement in classification between platforms and procurement methods requires a variety of methods. It has been shown that centroid-based algorithms are robust classifiers for breast cancer subtype assignment across platforms (microarray and qRT-PCR data) and procurement conditions (fresh-frozen and formalin-fixed, paraffin-embedded tissues). In addition, the standard deviation, dynamic range, and concordance correlation coefficient are important parameters to assess individual primer set performance across procurement methods. The strategy for primer set validation and classification have applications in routine clinical practice for stratifying breast cancers and other tumor types.


Expression-based classifications are important for determining risk of relapse and making treatment decisions in breast cancer (Fan et a. N Engl J Med 2006, 355:560-569; Paik et al. N Engl J Med 2004, 351:2817-2826; Perou et al. Nature 2000, 406:747-752; van't Veer et al. Nature 2002, 415:530-536). Classifications are often developed using microarray data and then further validated on the same or different platforms using minimized gene sets. For instance, van't Veer and van de Vijver used microarray data in training and test sets to validate a 70-gene signature that predicts relapse in early stage ER-positive and ER-negative tumors (van't Veer et al. Nature 2002, 415:530-536; van de Vijver et al. N Engl J Med 2002, 347:1999-2009). In addition, Paik et al developed a 16-gene classifier that predicts relapse in ER-positive tumors using qRT-PCR on formalin-fixed, paraffin embedded (FFPE) tissues. Furthermore, Perou and Sorlie showed that hierarchical clustering of microarray data separates breast tumors into different ‘biological’ subtypes (Luminal, HER2+/ER−, Basal-like, and Normal-like) and that these subtypes are prognostic (Sorlie et al. Proc Natl Acad Sci USA 2001, 98:10869-10874). The biological classification has been validated on multiple patient cohorts using cross-platform microarray analyses and qRT-PCR (Hu et al. BMC Genomics 2006, 7:96; Perreard et al. Breast Cancer Res 2006, 8:R23; Sorlie et al. Proc Natl Acad Sci USA 2003; 100:8418-8423).


Although there are few genes in common between those used to determine the biological subtypes and those used in other classifications for breast cancer prognosis, the different tests identify similar properties that predict tumor behavior (Fan et al. N Engl J Med 2006, 355:560-569). A major difference between the classification for biological subtypes and other classifications for breast cancer is that it is based on hierarchical clustering. The unsupervised nature of hierarchical clustering is effective for discovery (Eisen et al. Proc Natl Acad Sci USA 1998, 95:14863-14868), but it is not suitable for predicting a new sample's class since dendrogram associations can change when new data is introduced. However, it is possible to classify samples within the framework of hierarchical clustering using centroid-based methods (Tibshirani et al. Proc Natl Acad Sci USA 2002, 99:6567-6572; Bair et al. PLoS Biol 2004, 2:E108; Bullinger et al. N Engl J Med 2004, 350:1605-1616). For instance, Tibshirani et al has shown that the nearest shrunken centroid method, used in Prediction Analysis of Microarray (PAM), can classify samples as accurately as statistical approaches like artificial neural networks. In addition, Hu et al employed another simple centroid method called Single Sample Predictor (SSP) to classify subtypes of breast cancer (Hu et al. 2006).


a) Materials and Methods


(1) Tissue Procurement and Processing


All tissues and data used in this study were collected and handled in compliance with federal and institutional guidelines. Breast samples received in pathology were flash frozen in liquid nitrogen and stored at −80° C. Samples were procured at the University of North Carolina at Chapel Hill, Thomas Jefferson University, University of Chicago, and University of Utah. The 159 breast samples analyzed included a 124-sample microarray training set and a 35-sample test set profiled by microarray and real-time qRT-PCR (FF and FFPE). Total RNA from FF samples was isolated using the RNeasy Midi Kit (Qiagen, Valencia, Calif.) and treated on-column with DNase Ito eliminate contaminating DNA. The RNA was stored at −80° C. until used for cDNA synthesis.


Each FF sample in the test set was compared to the clinical FFPE tissue block. An H&E slide was used to confirm the presence of >50% tumor and 20 micron cuts were prepared using a microtome. Tissue blocks were 1-5 years in age (i.e. early age FFPE). The FFPE cut was de-paraffinized in Hemo-De (Scientific Safety Solvents) and washed with 100% ethanol. Total RNA was isolated using the High Pure RNA Paraffin Kit (Roche Molecular Biochemicals, Mannheim, Germany). Manufacturer's instructions were followed for RNA extraction except that the reagents were increased 2-fold for the first proteinase K digestion. Samples were treated with TURBO DNA-free (Ambion, #1906) and stored at −80° C. until cDNA synthesis.


(2) First-Strand cDNA Synthesis


cDNA synthesis for each sample was performed in 40 μl total volume reaction containing 600 ng total RNA. Total RNA was first mixed with 2 μl gene specific cocktail containing 55 primers (each anti-sense primer at 1 pmol/μl) and 2 μl 10 nM dNTP Mix (10 mM each dATP, dGTP, dCTP, dTTP at pH7). Reagents were heated at 65° C. for 5 minutes in a PTC-100 Thermal Cycler (MJ Research, Inc.) and briefly centrifuged. The following reagents were added to each tube: 8 μl 5× First-Strand Buffer, 2 μl 0.1M DTT, 2 μl RNase Out


(Invitrogen), and 2 μl Superscript DI polymerase (200 units/μl). The reaction was thoroughly mixed by pipetting and incubated at 55° C. for 45 minutes followed by 15 minutes at 70° C. for enzyme inactivation. Following cDNA synthesis, samples were purified with the QIAquick PCR Purification Kit (Qiagen, Valencia, Calif.). Samples were adjusted to a final concentration of 1.25 ng/μl cDNA with TE (10 mM Tris-HCl, pH 8.0, 0.1 mM EDTA).


(3) Primer Design and Optimization


Primers were designed using Roche LightCycler Probe Design Software 2.0. Reference gene sequences were obtained through NCBI LocusLink and optimal primer sites were found with the aid of Evidence Viewer (http://www.ncbi.nlm.nih.gov). Primers sets were selected to avoid known insertions/deletions and mismatches while including all isoforms possible. Amplicons were limited to 60-100 bp in length due to the degraded condition of the FFPE mRNA. When possible, RNA specific amplicons were localized between exons spanning large introns (>1 kb). Finally, NCBI BLAST was used to verify gene target specificity of each primer set. Primer sequences are presented in Table 1. Primers were synthesized by Operon, Inc. (Huntsville, Ala.), re-suspended in TE to a final concentration of 60 uM, and stored at −80° C. Each new FFPE primer set was assessed for performance through qRT-PCR runs with three serial 10-fold dilutions of reference cDNA in duplicate and two no template control reactions. Primers were verified for use when they fulfilled the following criteria: 1) target Cp<30 in 10 ng reference cDNA; 2) PCR efficiency>1.75; 3) no primer-dimers in presence of template as determined through post amplification melting curve analysis; and 4) no primer-dimers in negative template control before cycle 40.


(4) Real-Time Quantitative (q)RT-PCR


PCR amplification was carried out on the Roche LightCycler 2.0. Each reaction contained 2 μl cDNA (2.5 ng) and 18 μl of PCR master mix with the following final concentration of reagents: 1 U Platinum Taq, 5 OmM Tris-HCl (pH 9.1), 1.6 mM (NH4)2SO4, 0.4 mg/μl BSA, 4 mM MgCl2, 0.2 mM dATP, 0.2 mM dCTP, 0.2 mM dGTP, 0.6 mM dUTP, 1/40000 dilution of SYBR Green I dye (Molecular Probes, Eugene, Oreg., USA), and 0.4 μM of both forward and reverse primers for the selected target. The PCR was done with an initial denaturation step at 94° C. for 90 s and then 50 cycles of denaturation (94° C., 3 s), annealing (58° C., 6 s), and extension (72° C., 6 s). Fluorescence acquisition (530 nm) was taken once each cycle at the end of the extension phase. After PCR, a post-amplification melting curve program was initiated by heating to 94° C. for 15 s, cooling to 58° C. for 15 seconds, and slowly increasing the temperature (0.1° C./s) to 95° C. while continuously measuring fluorescence.


Each PCR run contained a no template control, a calibrator reference in triplicate, and each sample in duplicate. The calibrator reference sample was comprised of 3 breast cancer cell lines (MCF7, SKBR3, and ME16C2) and Stratagene Universal Human Reference RNA (Stratagene, La Jolla, Calif., USA) represented in equal parts. The crossing point (Cp) for each reaction was automatically calculated by the Roche LightCycler Software 4.0. Relative quantification was done by importing an external efficiency curve (Eff=1.89) and setting the calibrator at 10 ng for each gene. In order to correct for differences in sample quality and cDNA input, copy numbers were adjusted to the arithmetic mean of 5 ‘housekeeper’ genes (ACTB, PSMC4, PUM1, MRPL19, SF3A1). Values from replicate samples were averaged and data was Iog2 transformed.


(5) Microarray


AU samples were analyzed by DNA microarray (Agilent Human Al, Agilent Human A2, and Agilent custom oligonucleotide microarrays). Labeling and hybridization of RNA for microarray analysis were performed using the Agilent low RNA input linear amplification kit (http://www.chem.agilent.com/Scripts/PDS.asp?1Page:=10003) as described in Hu et al (Hu et al. Biotechniques 2005, 38:121-124). Each sample was assayed versus a common reference that was a mixture of Stratagene's Human Universal Reference total RNA (Stratagene, La Jolla, Calif., USA) enriched with equal amounts of RNA from the MCF7 and ME16C cell lines. Microarray hybridizations were carried out on Agilent Human oligonucleotide microarrays using 2 μg Cy3-labeled ‘reference’ sample and 2 μg Cy5-labeled ‘experimental’ sample.


All microarrays were scanned using an Axon Scanner 4000B (Axon Instruments Inc, Foster City, Calif., USA). The image files were analyzed with GenePix Pro 4.1 (Axon Instruments) and were uploaded into the UNC Microarray Database at the University of North Carolina at Chapel Hill (https://genome.unc.edu/), where a Lowess normalization procedure was performed to adjust the Cy3 and Cy5 channels (Yang et al. Nucleic Acids Res 2002′ 30:e15).


(6) Clinical Lnmunohistochemistry and PCR


Samples were scored for protein expression at the time of diagnosis using standard operating procedures established at each institution. Greater than 10% positive staining nuclei was considered positive for the ER and PR. Staining and scoring criteria for HER2 were carried out according to the Dako HercepTest™ (Dako, Carpinteria, Calif., USA). Quantitative PCR, used to determine DNA copy number of the ERBB2 gene, was done using a clinical assay from ARUP Laboratories Inc (cat #00049390, Salt Lake City, Utah, USA).


(7) Selecting Genes for Real-Time qRT-PCR


The real-time qRT-PCR assay consisted of 5 housekeeper genes (Szabo et al. Genome Biol 2004, 5:R59), 5 proliferation genes for risk stratification of the Luminal (ER-positive) tumors, and 40 ‘intrinsic’ genes important for distinguishing biological subtypes of breast cancer. The minimal 40 ‘intrinsic’ classifiers were statistically selected from a larger 1300 ‘intrinsic’ gene set previously reported in Hu et al (2006). The larger gene set was minimized as described in Perreard et al (2006). Briefly, a semi-supervised classification method was used in which samples are hierarchical clustered and assigned subtypes based on the sample-associated dendrogram. Samples were designated as Luminal, HER2+/ER−, Basal-like, or Normal-like. The best class distinguishers were identified according to the ratio of between-group to within-group sums of squares. A 10-fold cross-validation was performed using a nearest centroid classifier and testing overlapping gene sets of varying sizes. The smallest gene set which provided the highest class prediction accuracy when compared to the classifications made by the complete microarray-based intrinsic gene set was selected.


(8) Assessing qRT-PCR Agreement Between FF and FFPE Tissues


Thirty-five matched FF and FFPE samples (70 samples total) were analyzed by qRT-PCR using the same primer sets. Agreement in the quantitative data was determined using diagonal bias (m), diagonal spread (d), diagonal standard deviation (dsd), diagonal correlation (rd), and concordance correlation coefficient (CCC).


In diagonal bias, a best fitting line parallel to the diagonal (slope equals 1) is made from a plot of the qRT-PCR data (FF versus FFPE). Numerically, if (x, . . . , y{), i=1, . . . , n denote the measurement pairs then the best fitting line parallel to the diagonal is given by the expression:






y=x+ yx


where x and y denote the sample means of the x and y measurements, respectively.


Then diagonal bias is calculated as:






m
=



y
_

-

x
_



2






The diagonal standard deviation was calculated as follows:







s
d

=






i
=
1

n



d
i
2



n
-
1







Let d represent:






d
=





i
=
1

n



d
i


n





Diagonal correlation was used to determine the spread of points around the diagonal line:







r
d

=


2


Cov


(

X
,
Y

)





Var


(
X
)


+

Var


(
Y
)








This method does not provide information about the extent of deviation but allows measurements with different units to be compared. Further, if we let p denote the correlation coefficient and Oχ and Oγ the respective standard deviations, then







r
d

=

ρ


2



σ
X


σ
Y


+


σ
Y


σ
X









That is, the diagonal correlation penalizes the correction coefficient if there is a scale shift (σx≠σγ). The combined effect of the bias and scale shift was measured using the concordance correlation coefficient (CCC) proposed by Lin et al (Lin et al. Biometrics 1989, 45:255-268):







C





C





C

=


2


Cov


(

X
,
Y

)





Var


(
X
)


+

Var


(
Y
)


+


(


Y
_

-

X
_


)

2







(9) Assessing Agreement Between Microarray and qRT-PCR for Classification.


A breast cancer subtype predictor was developed in PAM (http://www-stat.Stanford.edu/˜tibs/P AMA and SSP using 124 breast samples and the ‘intrinsic’ gene set identified in Hu et al (2006). The training set contained representative samples of Luminal (64 samples), HER2+/ER− (23 samples), Basal-like (28 samples), and Normal-like (9 samples) subtypes. Classification of an independent test set (35 matched FF and FFPE samples) was done using a large (1300 genes) and minimized (40 genes) version of the ‘intrinsic’ set. Subtypes were assigned based on Spearman correlation to the centroid. The qRT-PCR data from the test set was merged with the microarray data of the training set prior to classification using distance weighted discrimination (Benito et al. Bioinformatics 2004, 20:105-1.14). The gold standard for classification of the training and test samples was based on FF tissue RNA and using the classifications obtained when performing hierarchical clustering analysis using the 1300 gene intrinsic gene set from microarray data,


b) Results


(1) Assessment of qRT-PCR Primer Set Performance by Comparing Agreement Between FF and FFPE Tissues.


The data set of 35 matched FF and FFPE tissues (70 samples) was evaluated for 50 genes using the same PCR conditions. Agreement between FF and FFPE tissues was assessed for diagonal bias (m), diagonal correlation (rd) diagonal standard deviation (dsd), and concordance correlation coefficient (ccc). An agreement plot between FF and FFPE for the estrogen receptor gene (ESR1) was produced after normalization to the 5 housekeepers. The large dynamic range of ESR1 expression provides clear separation of the tumors from both FF and FFPE. ESR1 alone measured from FF tissue has very high sensitivity and specificity using ER status by IHC as the gold standard (Perreard 2006).


For each gene, the agreement between FF and FFPE was analyzed using the raw data, housekeeper normalized data, and DWD adjusted normalized data. Scatter plots are provided in FIGS. 20-23 and values are presented in Table 14. The line graphs in 19 show the effects at each step of data processing. The raw (pre-normalized) data shows a negative bias for all genes likely due to lower RNA quality in the FFPE tissue (FIG. 19A). Much of the bias was corrected by normalization to the ‘housekeeper’ genes and using DWD adjustment. As expected, DWD had a significant effect on bias (m) but did not effect other measurements of agreement (FIG. 19B-D).


The median biases for the un-normalized, housekeeper normalized, and DWD adjusted normalized data were −1.5 (−3.1 to −0.033), 0.58 (−1.1 to 2) and 0.24 (−0.3 to 1.3), respectively. Normalization to the housekeeper genes had a relatively modest effect on the diagonal standard deviation with a change in the median from 1.1 (0.76-2) to 0.81 (0.38-1.8). While most genes had a similar standard deviation (e.g. ESR1) after applying the housekeepers, other genes such as C10orf7 and COX6C had nearly a 3-fold reduction in standard deviation after normalization.


In general, genes with the highest diagonal correlation between FF and FFPE also had the largest dynamic range in expression (e.g., ESR1, TFF3, COX6C, and FBP1). Housekeeper genes and other genes with low variability in expression (IGBP1) had the lowest diagonal correlation since they form more of a cloud than a line around the diagonal. The housekeeper genes all had high agreement in terms of having low variability in expression across samples in the FF and FFPE tissues.


The concordance correlation coefficient (CCC) considers both bias and scale shift when determining agreement. The median concordance correlation coefficient between FF and FFPE for the raw data of the 45 genes (housekeepers excluded) was 0.28. Normalization to housekeepers raised the CCC median to 0.48, and adjusting with DWD brought the median to 0.61. Only 27% of the genes had a CCC value greater than 0.5, whereas 47% of the genes were above that value in the normalized data, and 76% were above that when using DWD adjusted normalized data. A comparison of the CCC value to the ratio of the diagonal standard deviation over the dynamic range identified many of the same primer sets as good (or poor) performers from the FFPE derived samples.


(2) Breast Cancer Subtype Classification of Test Set Using PAM and SSP.


Hierarchical clustering of the 124 sample training set using the “intrinsic” gene set identified in Hu et al shows 4 distinct classes representing Luminal, HER2+/ER−, Basal-like, and Normal-like. Centroid classifiers were developed from the microarray expression data using PAM and SSP (Hu et al. 2006, Tibshirani et al. 2002). Class predictions were made on the test set using microarray (large and minimized ‘intrinsic’ sets) and qRT-PCR data (15). Each individual microarray (large and minimized) and PCR datasets were DWD merged with the training set prior to subtype class prediction.


Agreement in Classification Between Large and Minimized Microarray Gene Sets.


Thirty-three out of 35 (94%) samples classified the same between PAM and SSP when using the large ‘intrinsic’ microarray dataset for classification. In both discrepant cases, IHC data agreed with the PAM classification. There was the same agreement (94%) when performing the analysis with the minimized version of the microarray data. Interestingly, there was one sample that was called HER2+/ER− by both PAM and SSP when using the large microarray dataset, but called Basal-like by both methods when using the minimized microarray dataset. Additional analysis of this sample by quantitative PCR showed no DNA amplification of HER2/ERBB2 amplicon.


Agreement in Classification Between FF and FFPE.


By qRT-PCR, there was 97% (34/35) concordance between FF and FFPE using PAM, and 91% (32/35) concordance using SSP. There was 94% (33/35) concordance between the diagnostic algorithms from FF tissue and complete agreement in classification from FFPE tissue. Since the FFPE samples were obtained from the clinical block, it is likely that there was a higher tumor percentage in those samples than in the matched FF sample, which could affect the agreement. Indeed, 2 out of the 3 discrepancies in classification made by SSP were when the FF tissue sample was called Normal-like (microarray and PCR) and the FFPE sample was called Luminal (PCR). These samples were ER-positive by IHC and likely Luminal. The only discrepancy in PAM was in a sample classified as Normal-like from FF tissue and Luminal from FFPE.


Overall concordance across methods. Overall, PAM diagnosed 33 out of 35 samples (94%) the same across microarray and qRT-PCR, while SSP diagnosed 30 out of 35 samples (86%) the same across platforms and procurement methods. Discrepancies were of several types including Luminal tumors classified as Normal-like, HER2+/ER− tumors classified as Luminal, and Basal-like tumors classified as HER2+/ER−.


c) Discussion


The transition of large-scale microarray experiments into a clinical test requires identifying a minimum set of genes for classification, translating the assay from microarray to qRT-PCR for routine diagnostics, and validating the assay using both FF and FFPE specimen types.


A previous qRT-PCR assay for identifying biological subtypes was based on an intrinsic gene set derived from first generation microarrays that contained 8,100 genes. In comparison, the current intrinsic set was derived from a different microarray platform (cDNA versus Agilent), contained a larger number of genes (427 vs. 1300), and used pre-treatment samples only (Hu et al. 2006. The overlap in the minimized gene set developed here versus the list in Perreard et al. was 14 out of 40, which is not surprising since there were only 108 genes in common between the larger intrinsic gene sets. It has been shown that the new intrinsic gene set reproducibly identifies the same breast cancer subtypes within independent datasets (i.e. pure training and test sets), and that the biological classification adds significant clinical information when used in a multivariate Cox analysis.


It has been shown that the centroid-based method called Single Sample Predictor can use microarray data to classify breast cancers into biological subtypes that predict survival in treated and untreated patients (Hu et al. 2006). Here PAM is directly compared to SSP using the large microarray dataset applied in Hu et al, and also tested a minimized version using microarray and qRT-PCR data. Both methods performed well.


This method of classification is considered semi-supervised since data from hierarchical clustering is initially used to develop a centroid or shrunken centroid from a training set and new samples are then classified based on the distance to the centroid. In this way, the training set is not only necessary for initial discovery and validation but the data continues to be used as a reference base for future classification of new samples. Similarly, the Oncotype Dx assay established cut points for risk of relapse from a training set and this classifier rule is applied to new samples to derive a recurrence score.


Determining agreement between methods is a complex issue that requires consideration of several factors before reaching a conclusion. Cronin et al used Pearson correlation to show that the genes with the highest correlation in microarray maintained their association with qRT-PCR. They used short amplicons and control ‘housekeeper’ genes in the qRT-PCR assay to correct biases between FF and FFPE tissues. Although correlation provides information about the linearity and slope (positive or negative correlation) of the data, it does not indicate the amount of bias, scale shift, or data spread. These additional measurements are helpful in determining whether the discrepancies in the data can be compensated for experimentally (e.g., housekeeper genes) or by software algorithms. For example, when the qRT-PCR data from FF and FFPE were compared, it was found that a significant bias could be corrected by normalization to the housekeepers and applying Distance Weighted


Discrimination. Distance Weighted Discrimination corrected systematic biases but did not change other measurements of agreement. After correcting for systematic bias, it is then possible to evaluate variation due to noise that cannot be predicted or controlled.


It was found that the most useful analyses for assessing PCR primer set performance across FF and FFPE tissues were the concordance correlation coefficient, the diagonal standard deviation, and the dynamic range. Genes with a large dynamic range often had high correlation and were good classifiers across conditions, even with relatively large diagonal standard deviations. Although genes with a small dynamic range can be good classifiers, the measurement may not be as reproducible if there is a large amount of variation. Thus, it was found that the best assessment of a classifier was using a ratio of the diagonal standard deviation to the dynamic range. This allowed genes with smaller dynamic ranges to be considered as good classifiers, if they also had low diagonal standard deviations. The concordance correlation coefficient and the ratio of the diagonal standard deviation to the dynamic range selected many of the same genes as having similar performance from the FF and FFPE tissues.


Translating an assay from microarray to qRT-PCR provides a second level of gene validation and allows the test to be used on archived FFPE tissue blocks from clinical trials or on samples submitted for routine diagnostics (Paik et al. 2004; Cronin et al. Am J Pathol 2004, 164:35-42). qRT-PCR on formalin-fixed paraffin-embedded tissue can be effectively used for gene expression based diagnostics for translation into the clinical laboratory. The FFPE procured RNA provided accurate subtype classifications in breast cancer, and in some instances provided more tumor specific information than the FF derived samples. This study also developed methodologies that have wider application for developing qRT-PCR assays for subtype classification in a wide variety of cancer types. These gene expression based tests can provide powerful new prognostic clinical tools and aid in more appropriate individualized treatment decisions.









TABLE 11







Regression model using RFS and the intrinsic classes from the 315


tumor sample Combined Test Set.














Hazard

95%
95%

Std



Ratio
p-value
CI lower
CI upper
Param Est
Error
















age (decade)
1.079
0.2949
0.936133111
1.242912729
0.07573
0.07231


ER
0.692
0.1404
0.42483297 
1.128714303
−0.36749
0.24927


Node status
1.35 
0.1261
0.919128261
1.981847717
0.29985
0.19601


Grade 1 vs. 2
1.879
0.1376
0.817125651
4.322363677
0.63092
0.42494


Grade 1 vs. 3
2.576
0.0321
1.084274609
6.120897891
0.94631
0.44153


size
1.591
<0.0001
1.300348623
1.947657951
0.46463
0.10306


LumA vs. Basal-like
2.023
0.0358
1.047852886
3.904839964
0.70448
0.33558


LumA vs. HER2+/ER−
3.468
0.0003
1.780768834
6.75548522 
1.2437
0.34013


LumA vs. LumB
1.923
0.0284
1.071712675
3.449405854
0.65373
0.2982 


LumA vs. Luml
1.401
0.3669
0.673503019
2.914105175
0.33715
0.37368


LumA vs. Normal-like
1.556
0.3686
0.589038578
4.163947739
0.4486
0.49891
















TABLE 7







SAMPLES CLINICAL DATA
























ER


Size













(1 = positive;


(1 = <= 2 cm;













0 = negative);


2 => 2 cm













((fmol = 10 = +


to <= 5 cm;













(used fmol for


3 => 5 cm;





Overall







rosetta and


4 = any size





Survival







singapore) and


with direct

RFS event


number
Event







norway as


extension to

(0 = no relapse,

number
of nodes
(0 = alive,
Overall






detailed in PNAS


chest wall

1 = relapsed or
RFS
of nodes
positive
1 = DOD
survival



Sample Name
Age
Race
2003 Table
HER2
PGR
or skin)
Grade
died of disease)
months
examined
for tumor
or DOC)
months
Important Comments
























02573-BC-PRIMARY
41
AA
1


3
3
1
10
26
14
0
22
pimary for a patient wih an associated brain


A1-17-left-breast-T
84
C
0
0

4
3
1
2


1
2
Autopsy Patient Sample


A4-LUL_Lung-Mel
44
C
1


4
3
1
22


1
22
Autopsy Patient Sample


A5-Skin  text missing or illegible when filed _Right-Mel
85
AA
0


4
3
1
20
14
3
1
28
Autopsy Patient Sample


BC00010
47
C
0


3
2
1
18
21
18
1
18



BC00014T
88
AA
0


4
3
1
18
40
36
1
23



BC00024
88
AA
0


3
3
1
3
116
14
1
3
pt was diagnosed with MM at some time as  text missing or illegible when filed


BC00020
44
C
0

0

3
0
82
7
3
0
82
lymph node  text missing or illegible when filed  sample - no primary tumor  text missing or illegible when filed


BC00034
88
AA
0


1
2
0
81


0
81



BC00038
55
AA
0


2
2
0
10
23
1
0
10



BC0004
87
C
0


1
2
0
118
20
0
0
118



BC00041T
46
AA
0
0

2
3
1
13
18
0
1
28



BC00043T
43
C
0


2
3
0
78
24
0
0
76



BC00048
43
C
0


2
3
1
48
13
1
0
72
her2 was 1+ on recurrent tumor, not  text missing or illegible when filedtext missing or illegible when filed


BC00051
51
C
0


2
2
0
88
12
12
0
68



BC00052
47
AA
0
2

2
3
1
14
13
8
1
18
pt had LABC, had  text missing or illegible when filed  chemo, this specim  text missing or illegible when filed


post chemo
















BC80053
71
AA
0


2
3
1
27
21
7
1
28



BC00057
51
AA
0
3

4
3
1
8
8
8
1
12
pt had IBC, had  text missing or illegible when filed  chemo, this specimen


post chemo
















BC00064
44
C
0
2

1
3
1
10


1
47
pt had local recurrence (this is the sample  text missing or illegible when filed


RECUR
















BC00068
43
AA
0
3

3
3
1
18
38
4
1
18



BC00070
38
AA
0
0

2
2
1
22


1
25
contralateral breast cancer dx Nov. 15, 2000, dx  text missing or illegible when filed


BC00071
33
C
0


2
2
1
16
20
4
1
47



BC00078
88
AA
0
0

3
3
0
12
16
12
1
12
cirrhosis was cause of death


BC00082
84
AA
0
0

2
3
0
27
3
0
1
27
pt admitted wth CHF/NQWMI, prob died of  text missing or illegible when filed


BC00085
24
AA
0
1

1
2
0
18


0
10
extensive OCIS w/ multiple small foci of invasi  text missing or illegible when filed


BR00-03448
85
AA
1
0
0
2
3
1
7
15
2
1
30



BR00-03658
43
AA
1
2
1
4
3
0
22
8
6
0
22



BR00-03878
57
AA
1
0
1
4
2
1
8
17
10
0
51



BR00-05048
88
C
1
0
1
2
1
0
38
15
1
0
39



BR00-05728
45
AA
0
3
0
3
3
1
11
31
7
0
42



BR00-05878
88
C
1
2
1
2
3
0
37
14
0
0
37



BR00-2848
63
C
0
3
0
3
3
0
43
8
0
0
49



BR01-01258
40
C
1
0
1
3
2
0
33
17
1
0
33



BR01-02488
36
Other
1
0
1
2
2
0
31
16
8
0
37



BR01-03498
37
C
1
3
0
3
3
1
3
24
22
1
24



BR94-10838
48
C
0
3
1
1
3
1
23
19
1
1
47



BR85-00358
74
C
0
3
0
2
3
0
106
13
1
0
105



BR05-01528
72
C
1
3
0
4
3
1
26
15
0
0
101



BR85-01848
74
C
1
3
0
2
3
0
96
20
1
0
96



BR96-00148
47
AA
1
3
1
4
1
0
95


0
98



BR07-01378
53
Other
0

0
3
3
1
20
21
1
1
21
died of Unconfirmed mel ca (symptoms of  text missing or illegible when filed


BR08-01818
67
AA
0
3
0
2
2
1
36
24
0
1
80



BR88-02818
44
C
0
3
0
2
3
0
85
14
0
0
85



BR99-02078
84
C
1
0
0
2
2
0
57
5
1
0
57



BR99-03488
85
AA
1
2
1
2
2
0
32
33
0
0
32
died of other causes (deydration secondary  text missing or illegible when filed


HCI00-038
















HCI00-052
















HCI00-088L
















HCI00-182
















HCI00-283
















HCI01-041
















HCI01-155
















HCI02-235
57
C
0
0
0
2
3


12
0





HCI02-254
60
C
1
1
1
3
0
1
0
20
0


ER positive tumor (5 cml) but no positive node


M875
53

1
3
1
2
3
0
20
15
0
0
20



M876
57

1
0
1
2
2
0
22
11
0
0
22



M877
80

1
0
1
2
3
0
22
17
0
0
22
Had right breast radical mastectomy in 1979,  text missing or illegible when filed


M870
50

0
2
0
2
3


5
4





M870
63

1
3
1
2
3
0
19
7
0
0
18



M800
50

1
0
1
2
3


0
0





M881
84

1
0
1
2
2
1
13
1
1
0
16
Several recurrence (cutaneous gastric)


M883
31

0
0
0
2
3


17
0





M885
77

1
0
1
2
3


11
2





M886(LN)
72

0
0
0

3
1
15
17
7
0
41
Lymph node metastasis - Several recurrences


M887
73

0
0
0

3


1
1


metastasis is small intestine


PB120-MET-L
81
AA
0


2
3
1
1
15
14
1
13
lymph node metastasis sample this patient  text missing or illegible when filed


PB126
29
AA
0
0
0
4
3
1
1
7
7
1
18
This patient was never disease-free and died


PB126-MET-LN

AA
0
0
0
4
3
1
1
7
7
1
18



PB188
58
C
0
0
0
2
2
0
30
0
0
0
30



PB140
41
C
1
2
1
2
1
0
34
10
0
0
34



PB152-MET-LN

C
0
1
0








ER, Her2 and PQR are for PB152 but maybe


PB150T
88
AA
1


3
3
0
30
0
0
0
30



PB184
50
C
1
3
0
1
3
0
28
2
0
0
28



PB205T
38
C
0
1
0
4
2
0
5
7
1
0
5



PB244
38
AA
0
3
0
1
3
0
24
12
0
0
24



PB249
36
C
1
3
1
1
3
0
8
3
3
0
0



PB256
58
C
1
2
1
2
3
0
4
14
1
0
4



PB257
44
AA
0
2
0
2
3
0
20
32
1
0
20



PB271
45
AA
1
3
1
2
3
0
14
12
3
0
14



PB277
48
C
1
2
1
2
3
0
12
18
8
0
12



PB264
34
C
1
2
1
1
1


0
0





PB283
58
C
1
2
1
2
2
0
11
12
0
0
11



PB297
55
A
0
1
0
2
3
0
18
0
0
0
18



PB307
35

1
1
1
3
3
0
8
15
0
0
8



PB311
48
C
1
0
1
3
3
0
14
12
2
0
14



PB314
51
C
0
3
0
3
3
0
21
13
8
0
21



PB334
60
AA
0
0
0
1
3
0
19
0
0
0
19



PB370
67
C
1
0
1
2
3
0
20
11
2
0
20



PB376
50
AA
0
1
0
2
3
0
15
3
0
0
15



PB377
77
C
1
1
0
2
3
0
18
8
0
0
18
there are 2 different tumors within the same  text missing or illegible when filed


PB378
55
Other
1
1
1
2
3
0
17
12
4
0
17



PB388
90
C
1
1
1
2
3
0
18
5
0
0
18



PB407
56
C
1
0
1
3
3


11
6





PB413
63
AA
1
0
1
2
2
0
8
8
3
0
8



PB419
49
C
0
0
0
2
3
0
10
1
0
0
10



PB432
79

1
1
1
2
3


21
4





PB441
83
C
1
0
1
1
2
0
8
0
0
0
8
bilateral breast cancer and renal carcinoma


PB455
52
AA
0
3
0
3
2
0
8
8
3
0
9



PB475
60
C
1
0
1
2
2
0
2
5
0
0
2



PB479
52
Asian
1
0
1
2
3


19
1





PB516

AA
0
0
0
3
3


14
2


IDC and OCIS


UB21
77

1
0
1
1
1
0
30
1
0
0
30



UB22







0
25



25
no  text missing or illegible when filed  (fibroadenoma)


UB27
81
C
1
2
1
3
2
0
29
14
2
0
29



UB28
48
C
0
0
0
1
3
0
30
20
0
0
30



UB28A
58
C
0
0
0
2
3
0
25
19
0
0
25



UB37
42
C
0
2
1
1
3
0
26
14
3
0
25



UB38
50
C
1
0
1
1
1
0
20
13
0
0
20



UB39
48
C
1
0
0
1
2
0
25
10
0
0
25



UB43
48
C
1
1
1
1
3
0
18
14
14
0
19



UB44
50
C
1
0
1
2
3
0
24
3
1
0
24
Had the other breast removed (contained  text missing or illegible when filed


UB45
48
C
1
1
1
2
2
0
21
5
1
0
21
Had a second small tumor (6 mm - grade 1-H)


UB55
58
C
1
2
1
1
1
0
22
4
0
0
22



UB57
60
C
1
0
1
1
2
0
17
2
0
0
17



UB58
58
C
1
1
1
1
1
0
19
4
1
0
18
Graded 1 on the tissue received (then got


UB80
72
C
0
3
0
2
3
0
20
13
10
0
20



UB81
51
0
1
3
0
2
2
0
10
16
0
0
19



UB62
28
C



1
1
0
9
23
1
0
8
No evidence of malignancy (we had INC value  text missing or illegible when filed


UB84
87
C
1
3
1
2
2
0
7
15
0
8
7
No follow-up visit (person out of state)


UB86
88
Other
1
0
1
2
1
0
9
18
0
0
9
(From  text missing or illegible when filed ,  text missing or illegible when filed -chest X-ray visit used as  text missing or illegible when filed


UB87
90
C
0
0
0
1
3
0
15
15
1
0
10



UB88
40
C
1
0
8
1
1
0
13
3
0
0
13
(Can′t find IHC data the database to confirm


UB78
41
hisp
1
0
0
4
2
1
3
20
20
0
14
has bone metastasis, in abdomen and pelvis


UB79
48

1
1
0
2
2
0
2
9
2
0
2
Macro-metastasis in the lymp  text missing or illegible when filed  - Not in






text missing or illegible when filed indicates data missing or illegible when filed














TABLE B







Primer Sets and Gene ID













Gene ID




Gene symbol
Gene name
(NCBI) 
Forward primer
Reverse primer














Intrinsic






gene list






ACADSB
Acyl-Coenzyme A dehydrogenese, short/branched chain
35
CTA ACA TAC AAT GCT GCT AGG C
CAA TCT TTG CAT CTC GGA AGT





B3GNT5
UDP-GlcNacbetaGal beta-1,3-N-acetylglucosaminyltransferase 5
84002
AGA ACT AGG TGG TGT CTA C
GAT TTT CCC TAA CAG GTG C





BF
B factor, propardin
829
CAT GTG TTC AAA GTC AAG GAT A
TGC TTG TGG TAA TCG GT





C5ORF18(cDP1)
chromosome 5 open reading frame 18
7005
GTG TTC GGT TAT GGA GC
GGT ATC ATC TTC TTT GTT GGG A





COK2AP1
COK2-associated protein 1
8008
CGC AGG GAG CAA GAG T
CTT CAA AAC CAA CAA GGC AG





COX0C
cytochrome c oxidase subunit VIc
1345
AGC TTT GTA TAA GTT TCG TGT
CCA GCC TTC CTC ATC TC





CXSCL1
Chemokine (C-X3-C motif) text missing or illegible when filed and 1
6375
ATG ACA TCA AAG ATA CCT GTA G
GAC CCA TTG CTC CTT CG





CYB5
cytochrome b-5
1528
GCA CCA CAA GGT GTA CG
GCC CGA CAT CCT CAA AG





DSC2 (ESTs)
Desmocollin 2
1024
GAA TCT GGA GAC TGA AAG CAA
CAA ATG GAG GAT CAT TCT GAT AGG





EGFR
Epidermal growth factor receptor (arythroblastic leukemic viral 
1056
AGG ACA GCA TAG ACG ACA C
AGG ATT CTG CAC AGA GCC A



(v-arb-b) oncogene homolog, avian)








ERBB2
V-arb-b2 erythroblastic leukemia viral oncogene homolog,2,
2054
TCC TGT GTG GAC CTG GAT
TGC CGT CGC TTG ATG AG



neuro/gliblastoma derived text missing or illegible when filed homoltext missing or illegible when filed (ovian)








ESR1
Estrogen receptor 1
2089
CATGATCAGGTCCACCTTGT
AGCAGCATGTCGAAGATCTC





FLJ14525
Hypothetical protein FLJ14525
84805
CCT TTT CTC CTG GGA AAC
GCT TTG GAC AGT GGT CT





FOXA1
Forkhead box A1
3160
GTTAGGAACTGTGAAGATGG
GCCGCTCGTAGTCATG





FZD7
Frizzled homolog 7 (Drosophila)
0324
AGC CAT TTT GTC CTG TTT TC
CCT TCC TCT TCG TTC ACT





GARS
Glycyl-tRNA synthetase
2817
AGG GAC CGT GAC TCA A
AAA CAG AGG ATA CCT GGC





GATA3
GATAb binging protein 3
2825
AAC TGT CAG ACC ACC ACA A
GAA GTC CTC CAG TGA GTC AT





GRB7
Growth factor recaptor-bound protein 7
2886
TCG ATG CAC ACA CTG GTA T
TTC ACA TCT GCC ACG TAC T





GSTP1
Glutalhone S-transforms a p1
2050
GGG CTC TAT GGG AAG G
GTT CTG GGA CAG CAG G





HSD17B4
hydroxysteroid (17-beta) dehydrogeneso 4
3205
TGG GGC TAA GTG GAC TAT
TGC CTT CTG AGG GTC AA





KIAA0310
KIAA0310 gene product
9810
GCC CTT CTA CAA CCC TG
GCT CCA AGT GCA AGT TC





KIT
V-text missing or illegible when filed  Hardy-Zuckramon 4 (alline sarcoma viral oncogene homolog
3815
CAC GDA CCT GCT GAA AT
TCT ACC ACG GGC TTC TGT C





KRT17
Keratin 17
3872
GAG ATT GCC ACC TAC CG
GAG GAG ATG ACC TTG CC





KRT5
Keratin 5 (epidormolysis bullose simplex, Dowling-
3852
GGA GAA GGA GTT GGA CC
CCA CTG CTG CTG GAG TA



Monral/Kobner/Weber-Cockayne types)








NAT1
N-acetyltransferase 1 (arylamine N-acetyltransferase)
0
ACA GCA CTC CAG CCA AA
CTG GTA TGA GCG TCC AAA C





PGR
Progesterontext missing or illegible when filed  receptor
5241
AGC TCA CAG CGT TTC TAT C
TGT GCA GCA ATA ACT TCA GAC





PLGD1
procollagen-lysine 1,2-oxoglutartext missing or illegible when filed s-5-dioxygenase 1
5351
CGT GCC GAC TAT TGA CAT
GTA GCG GAC GAC AAA GG





PTPAA2
protein tyrosine phosphatase type IVA, member 2
6073
TCA AAG ATT CCA ACG GTC ATA G
TCT CAA GTT CCA CTT CCA GTA G





RABEP1-
Rabaptin-5
9135
ATG TCA GTG AGC AAG TCC
GCT GGT TAA TGT CTG TCA GT





RARRE53
retionic acid receptor-responder (text missing or illegible when filed rolene induced) 3
6920
GCT GAG ATA TGG CAA GTC C
CTC CTA ATC GCA AAA GAG C





S100A11
S100 calcium binding protein AB (calpranulin A)
5262
CAA AAA TCT CCA GCC CTA CA
TAA CCA TCC TTT CCA GCA TAC





SOC2
Syndecan 2 (heparan sulfate proteoglycan 1, cell surface-
6389
AAA CCA GCA CTC TGA AT
ATT TGT ATC CTC TTC GGC TG



associated fibroglycan)








SLC39AB
solute carrier family 39 (zinc transporter), member text missing or illegible when filed
25800
ACC ACC ATA GTC ATA GCC
CAT ACT TGG ACA ACT GCT TC





SLC7AB
Solute carrier family 7 (catienic amino acid transporter, y+
5057
AGC GTT TTA CAC CTA TCC C
CCA CGA AGA ACC AGT AGC



system), member B








SLPI
secretory leukocyte protease inhibitor (antileukoproteinase)
6590
GTG TGG GAA ATC CTG CG
GTG GRG GAG CCA AGT CT





SMA3
SMA3
10571
CCG TAC CTG ATG CAC GAA
GTG CCC GTA GTT GCG ATA





TAP1
transporter 1, ATP-binding cassette, sub-family 0 (MDR/TAP
8890
AAG ACA CTC AAC CAG AAG G
GGT AGA GAA CAA ATG TGA CAA GG





TRIM29
Tripartha metext missing or illegible when filed -combining 29
23650
AAC AAC TAC ACG AAC AGC
ATT CTT CTG GGT GGT CTC





XBP1
X-box binding protein 1
7494
CTG TTG GGC ATT CTG GAC
GGA GGC TGG TAA GGA ACT





Proliferation






genes






BIRC5
bactext missing or illegible when filed  IAP repeat-containing 5 (survivin)
332
CGA CCC CAT AGA GGA ACA TAA
TTC TTG ACA GAA AGG AAA GCG





BUB1
budding uninhibited by benzimidazoles 1 homolog (yeast)
889
CAC TTG GGA CTG TTG ATG
TGG ATA GGA ACT CAC TGG T





CENFF
Centromace protein F, 350/40 Dka (milosin)
1863
CCA CTG AGT CTC GGC AA
ATT TCG TGG TGG GTT CT





CKS2
CDC2B protein kinase regulatory subunit 2
1184
TGG AGG AGA CTT GGT GT
GAA TAT GTG GTT CTG GCT CA





FAM54A(=DUPD1)
family with sequence similarity 54, member A
118110
GTG GAA ATG CAG GAA CTG AA
GCT CGT CAC TCA AGC CAA





GTPBF4
GTP binding protein 4
23560
GGT GTT GAC ATG GAC GAT AA
CTT CCC GCT TTC TTT TCC TA





HSPA14
heat shock 70 kDa protein 14
51102
GTT TAG AAG CAA TCA GAG GAC T
CCT CCA CAA AGG ACA ACC





MKI87
Antigen identified by monoclonal antibody KI-87
4230
TCA GAC TCC ATG TGC CT
CTT CAC TGT CCC TAT GAC TTC





MYBL2
v-myb myelabiastosis viral oncogene homolog (avian)-like 2
4805
CAC ACT GCC CAA GTC TCT A
AAG CTG TTG TCT TCT TTG ATA CC





NEK2
NIMA (never in mitosis gene a)-related kinese 2
4751
AGC TTG GAG ACT TTG GG
GTA ATA AGG TGT GCC AAC AAA T





PCNA
Proliferating cell nuclear text missing or illegible when filed
5111
GTC ACA GAC AAG TAA TGT CG
TAC TGA GTG TCA CCG TT





STK0
serine/theronine kinase 0
8700
CTT ACT GTC ATT CGA AGA GAG TT
AGT CAT CCG AAC TTC AAT C





TDP2A
Tepoisemerase (DNA) text missing or illegible when filed  alpha 170 kDa
7153
AAG CAC ATC AGG TGA AAA AT
TAC CAC AGC CAA TGG CA





TTK
TTK protein kinase
7272
ACG GAA TCA AGT CTT CTA GC
TGC CAC TGT TTC TGG TTA C





Housekeeper






genes






MRFL1B
Mitochondrial ribosomol protein L1B
9601
GGG ATT TGC ATT CAG AGA TCA G
GGA AGG GCA TCT CGT AAG





PEMC4
Proteasome (prosome, macropein) 20S subunit, ATPase, 4
5704
GGC ATG GAC ATC CAG AAG
CCA CGA CCC GGA TGA AT





PUM1
Pumilla homolog 1 (Drosophila)
8500
TGAGGTGTGCACCATGAAC
CAGAATGTGCTTGCCATAGG






text missing or illegible when filed indicates data missing or illegible when filed














TABLE 9





45 Paired Samples for Intrinsic Analysis from Sorlie et al. 2003

















shaz111.BC.FUMI05.AF



shaz110.BC.FUMI05.BE



shaz105.BC.FUMI06.AF



shaz104.BC.FUMI06.BE



shaz117.BC.FUMI07.AF



shaz116.BC.FUMI07.BE



shby032.BC.FUMI20.AF



shby020.BC.FUMI20.BE



shaz123.BC.FUMI27.AF



shaz122.BC.FUMI27.BE



shaz115.BC.FUMI35.AF



shaz114.BC.FUMI35.BE



shaz127.BC.FUMI37.AF



shaz126.BC.FUMI37.BE



svl012..BC104A.BE



svl013..BC104B.AF



svl005..BC106A.AF



svl006..BC106B.BE



svcc63..BC107A.AF



svcc98..BC107B.BE



svl003..BC108A.BE



svl004..BC108B.AF



svcc77..BC110A.AF



svcc78..BC110B.BE



svcc97..BC112A.AF



svcc53..BC112B.BE



svcc81..BC114A.BE



svcc52..BC114B.AF



svcc64..BC115A.AF



svcc106.BC115B.BE



svcc112.BC118A.AF



svcc134.BC118B.BE



svl015..BC119A.BE



svl014..BC119B.AF



svl027..BC120A.BE



svl02B..BC120B.AF



svl017..BC121A.AF



svl016..BC121B.BE



svcc91..BC123A.AF



svcc89..BC123B.BE



svcc111.BC124A.BE



svcc109.BC124B.AF



svl018..BC125A.BE



svl019..BC125B.AF



svcc96..BC2



svcc113.BC2.LN2



svcc93..BC206A.BE



svcc135.BC206B.AF



svcc107.BC208A.BE



svcc125.BC208B.AF



svcc79..BC213A.AF



svcc76..BC213B.BE



svcc103.BC214A.AF



svcc92..BC214B.BE



svl021..BC303A.AF



svl020..BC303B.BE



svcc131.BC305A.BE



svcc58..BC305B.AF



svl032..BC307A.AF



svl103..BC307B.BE



svcc115.BC38



svcc116.BC38.LN38



svcc66..BC402B.AF



svcc83..BC402B.BE



svcc36..BC404A.AF



svl033..BC404B.BE



svl029..BC405A.BE



svl030..BC405B.AF



shby035.BC601A.BE



shby036.BC601B.AF



svl042..BC608A.AF



svl036..BC608B.BE



svl040..BC702A.AF



svl041..BC702B.BE



shby034.BC703A.AF



shby037.BC703B.BE



svl039..BC706A.BE



svl038..BC706B.AF



svcc86..BC708A.AF



svcc104.BC708B.BE



svcc85..BC709A.AF



svcc84..BC709B.BE



svcc101.BC710A.BE



svcc82..BC710B.AF



svcc65..BC711A.AF



svcc120.BC711B.BE



svcc105.BC805A.BE



svcc121.BC805B.AF



svcc126.BC808A.AF



svcc124.BC808A.BE






















TABLE 10





Gene
OS~Gene
OS~Gene + Grade
OS~Gene + Stage
OS~Gene + Grade + Stage
Prolif. Gene




















SMA3
0.0010086
0.00814571
0.000398174
0.00357674
NO


KIT
0.000332738
0.00154407
0.00272027
0.00672142
NO


GBTPBP4
0.00445804
0.0307721
0.00150072
0.0112402
YES


COX6C
0.00289023
0.00951953
0.0028745
0.0125619
NO


CX3CL1
0.00217324
0.00425494
0.0181299
0.0152864
NO


KRT17
0.0321012
0.0420179
0.0233713
0.015837
NO


B3GNT5
0.032762
0.117857
0.00427977
0.0406316
NO


PLOD
0.00730183
0.152132
0.025899
0.0608959
NO


SLPI
0.0533249
0.0795638
0.0372877
0.0720347
NO


DSC2
0.0432628
0.19777
0.0199733
0.076893
NO


GRB7
0.0023925
0.00997476
0.0212037
0.076893
NO


TRIM29
0.0758398
0.969003
0.10943
0.0808424
NO


STK6
0.0353601
0.192395
0.0169665
0.0990307
YES


BUB1
0.0572953
0.237575
0.0218123
0.123044
YES


NAT1
0.0127223
0.0791954
0.0189787
0.135405
NO


CYB5
0.0557461
0.287241
0.0273843
0.137872
NO


PTP4A2
0.160424
0.0858591
0.342854
0.138471
NO


TTK
0.110921
0.45438
0.0192107
0.143497
YES


HSPA14
0.391113
0.8142
0.0511814
0.144083
YES


GATA3
0.0324598
0.289619
0.0175668
0.157456
NO


ESR1
0.030409
0.145509
0.0405537
0.184542
NO


SLC39A6
0.0733459
0.430962
0.024724
0.207555
NO


ERBB2
0.0459011
0.0828308
0.169867
0.24427
NO


FOXA1
0.110671
0.4427
0.094167
0.330446
NO


EGFR
0.145898
0.183089
0.3197
0.57336
NO


DUFD1
0.378603
0.985614
0.0888335
0.59478
YES


MYBL2
0.0399249
0.176578
0.0716375
0.361422
YES


S100A11
0.34613
0.556875
0.230849
0.363064
NO


XBP1
0.045776
0.268606
0.0926021
0.400871
NO


TOP2A
0.240971
0.655786
0.0969129
0.404568
YES


KIAA0310
0.484382
0.772587
0.342042
0.406749
NO


KRT5
0.985088
0.984712
0.641471
0.409027
NO


BF
0.046196
0.204647
0.105472
0.463932
NO


GSTP1
0.687906
0.677131
0.557251
0.465849
NO


FZD7
0.594194
0.90597
0.384141
0.47759
NO


NEK2
0.46014
0.932809
0.172718
0.500592
YES


TAP1
0.663093
0.482788
0.541857
0.534398
NO


FLJ14525
0.17537
0.17907
0.613531
0.561022
NO


ACADSB
0.0698192
0.387308
0.118621
0.576123
NO


GARS
0.709987
0.923267
0.902252
0.630522
NO


BIRC5
0.397737
0.975853
0.170876
0.632892
YES


HSD17B4
0.206242
0.395994
0.305472
0.635554
NO


MKI67
0.311764
0.709371
0.195635
0.640833
YES


PCNA
0.868635
0.731512
0.557926
0.645851
YES


PGR
0.355079
0.965257
0.181127
0.681739
NO


RABEP1
0.543773
0.963589
0.377702
0.682359
NO


SLC7A6
0.432451
0.689547
0.419107
0.685462
NO


SDC2
0.47607
0.37331
0.914923
0.689713
NO


CKS2
0.936337
0.36756
0.180917
0.763492
YES


DP1
0.149164
0.576409
0.32648
0.839276
NO


CENPF
0.19591
0.730895
0.203913
0.8435
YES


CDK2AP1
0.711736
0.908545
0.835195
0.883836
NO


RARRES3
0.0189691
0.107372
0.698642
0.943889
NO



















TABLE 12





Rank
UGCluster
Symbol
Gene Name


















1
Hs.163484
FOXA1
Forkhead box A1 || NM_004496 || 14q12-q13


2
Hs.446352
ERBB2
V-erb-b2 erythroblastic leukemia viral oncogene homolog 2, neuro/glioblastoma derived oncogene





homolog (avian) || NM_001005862 || 17q11.2-q12


3
Hs.496240
AR
Androgen receptor (dihydrotestosterone receptor; testicular feminization; spinal and bulbar muscular





atrophy; Kennedy disease) || NM_000044 || Xq11.2-q12


4
Hs.387057
FLJ13710
Hypothetical protein FLJ13710 || BX641106 || 15q23


5
Hs.437638
XBP1
X-box binding protein 1 || AK093842 || 22q12.1


6
Hs.348883
FOXC1
Forkhead box C1 || NM_001453 || 6p25


7
Hs.82961
TFF3
Trefoil factor 3 (intestinal) || BU536516 || 21q22.3


8
Hs.155956
NAT1
N-acetyltransferase 1 (arylamine N-acetyltransferase) || BC013732 || 8p23.1-p21.3


9
Hs.100686
BCMP11
Breast cancer membrane protein 11 || BG540617 || 7p21.1


10
Hs.524134
GATA3
GATA binding protein 3 || NM_001002295 || 10p15


11
Hs.530009
AGR2
Anterior gradient 2 homolog (Xenopus laevis) || BM924878 || 7p21.3


12
Hs.208124
ESR1
Estrogen receptor 1 || NM_000125 || 6q25.1


13
Hs.523468
SCUBE2
Signal peptide, CUB domain, EGF-like 2 || NM_020974 || 11p15.3


14
Hs.469649
BUB1
BUB1 budding uninhibited by benzimidazoles 1 homolog (yeast) || AF053305 || 2q14


15
Hs.79136
SLC39A6
Solute carrier family 39 (zinc transporter), member 6 || NM_012319 || 18q12.2


16
Hs.144197
UGTB
UDP glycosyltransferase 8 (UDP-galactose ceramide galactosyltransferase) || NM_003360 || 4q26


17
Hs.27373
LOC40045
Hypothetical gene supported by AK075564; BC060873 || NM_207446 || 15q26.1


18
Hs.414407
KNTC2
Kinetochore associated 2 || NM_006101 || 18p11.32


19
Hs.115838
TMC5
Transmembrane channel-like 5 || AY358155 || 16p12.3


20
Hs.210995
CA12
Carbonic anhydrase XII || NM_001218 || 15q22


21
Hs.532968
DKFZp762
Hypothetical protein DKFZp762E1312 || AK074809 || 2q37.1


22
Hs.514527
BIRC5
Baculoviral IAP repeat-containing 5 (survivin) || NM_001012271 || 17q25


23
Hs.62180
ANLN
Anillin, actin binding protein (scraps homolog, Drosophila) || NM_018685 || 7p15-p14


24
Hs.14559
C10orf3
Chromosome 10 open reading frame 3 || NM_018131 || 10q23.33


25
Hs.76277
C19orf32
Chromosome 19 open reading frame 32 || BC008201 || 19p13.3


26
Hs.194698
CCNB2
Cyclin B2 || AK023404 || 15q22.2


27
Hs.520189
ELOVL5
ELOVL family member 5, elongation of long chain fatty acids





(FEN1/Elo2, SUR4/Elo3-like, yeast) || AL833001 || 6p21.1-p12.1


28
Hs.504301
LOC12022
Transmembrane protein 45B || NM_138788 || 11q24.3


29
Hs.169840
TTK
TTK protein kinase || NM_003318 || 6q13-q21


30
Hs.87417
CTSL2
Cathepsin L2 || BC067289 || 9q22.2


31
Hs.1594
CENPA
Centromere protein A, 17 kDa || BM911202 || 2p24-p21


32
Hs.127407
GALNT7
UDP-N-acetyl-alpha-D-galactosamine:polypeptide N-acetylgalactosaminyltransferase 7





(GalNAc-T7) || BC047468 || 4q31.1


33
Hs.260720
DNAJC12
DnaJ (Hsp40) homolog, subfamily C, member 12 || NM_021800 || 10q22.1


34
Hs.102406
MLPH
Melanophilin || AK096789 || 2q37.3


35
Hs.692
TACSTD1
Tumor-associated calcium signal transducer 1 || AK026585 || 2p21


36
Hs.524947
CDC20
CDC20 cell division cycle 20 homolog (S. cerevisiae) || BG256659 || 1p34.1


37
Hs.99949
PIP
Prolactin-induced protein || BF965123 || 7q34


38
Hs.470654
CDCA7
Cell division cycle associated 7 || AL834186 || 2q31


39
Hs.279651
MIA
Melanoma inhibitory activity || BG765502 || 19q13.32-q13.33


40
Hs.205952
LOC20189
Hypothetical protein LOC201895 || BC047541 || 4p14


41
Hs.267659
VAV3
Vav 3 oncogene || NM_006113 || 1p13.3


42
Hs.86859
GRB7
Growth factor receptor-bound protein 7 || NM_005310 || 17q12


43
Hs.93002
UBE2C
Ubiquitin-conjugating enzyme E2C || BC032677 || 20q13.12


44
Hs.271224
PH-4
Hypoxia-inducible factor prolyl 4-hydroxylase || NM_017732 || 3p21.31


45
Hs.24976
ART3
ADP-ribosyltransferase 3 || AK129914 || 4p15.1-p14


46
Hs.184339
MELK
Maternal embryonic leucine zipper kinase || NM_014791 || 9p13.2


47
Hs.524571
CDCA8
Cell division cycle associated 8 || BC000703 || 1p34.3


48
Hs.406050
DNALI1
Dynein, axonemal, light intermediate polypeptide 1 || AK126963 || 1p35.1


49
Hs.152385
FLJ10980
Hypothetical protein FLJ10980 || BC040548 || 15q21.2-q21.3


50
Hs.523220
RAD54L
RA054-like (S. cerevisiae) || NM_003579 || 1p32


51
Hs.406013
KRT18
Keratin 18 || CR616919 || 12q13


52
Hs.487036
MYO5C
Myosin VC || NM_018728 || 15q21


53
Hs.494496
FBP1
Fructose-1,6-bisphosphatase 1 || AK223395 || 9q22.3


54
Hs.474217
CDC45L
CDC45 cell division cycle 45-like (S. cerevisiae) || BM478416 || 22q11.21


55
Hs.189119
CXXC5
CXXC finger 5 || NM_016463 || 5q31.2


56
Hs.284153
FANCA
Fanconi anemia, complementation group A || NM_000135 || 16q24.3


57
Hs.531941
MYB
V-myb myeloblastosis viral oncogene homolog (avian)|| AJ606319 || 6q22-q23


58
Hs.549195
OGFRL1
Opioid growth factor receptor-like 1 || NM_024576 || 6q13


59
Hs.69360
KIF2C
Kinesin family member 2C || NM_006845 || 1p34.1


60
Hs.226390
RRM2
Ribonucleotide reductase M2 polypeptide || AK123010 || 2p25-p24


61
Hs.250822
STK6
Serine/threonine kinase 6 || NM_198433 || 20q13.2-q13.3


62
Hs.490655
ARP3BETtext missing or illegible when filed
Actin-related protein 3-beta || AB209174 || 7q32-q36


63
Hs.516297
TCF7L1
Transcription factor 7-like 1 (T-cell specific, HMG-box) || AK128630 || 2p11.2


64
Hs.252387
CELSR1
Cadherin, EGF LAG seven-pass G-type receptor 1 (flamingo homolog,






Drosophila) || AF231024 || 22q13.3



65
Hs.179718
MYBL2
V-myb myeloblastosis viral oncogene homolog (avian)-like 2 || BX647151 || 20q13.1


66
Hs.201034
NTN4
Netrin 4 || AF278532 || 12q22-q23


67
Hs.42645
SLC16A6
Solute carrier family 16 (monocarboxylic add transporters), member 6 || NM_004694 || 17q24.2


68
Hs.66762
C10orf38
Chromosome 10 open reading frame 38 || AL050367 || 10p13


69
Hs.231320
GPR160
G protein-coupled receptor 160 || AJ249248 || 3q26.2-q27


70
Hs.517549
PIB5PA
Phosphatidylinositol (4,5) bisphosphate 5-phosphatase, A || AK092859 || 22q11.2-q13.2


71
Hs.370549
BCL11A
B-cell CLL/lymphoma 11A (zinc finger protein) || NM_022893 || 2p16.1


72
Hs.96055
E2F1
E2F transcription factor 1 || BC050369 || 20q11.2


73
Hs.505469
RACGAP1
Rac GTPase activating protein 1 || NM_013277 || 12q13.12


74
Hs.436187
TRIP13
Thyroid hormone receptor interactor 13 || NM_004237 || 5p15.33


75
Hs.5199
HSPC150
Ubiquitin-conjugating enzyme E2T (putative) || BF690859 || 1q32.1


76
Hs.529181
CAPN13
Calpain 13 || BX647678 ||


77
Hs.49433
PTE2B
Peroxisomal acyl-CoA thioesterase 2B || AK055797 || 14q24.3


78
Hs.459362
PRC1
Protein regulator of cytokinesis 1 || NM_003981 || 15q26.1


79
Hs.485158
SPDEF
SAM pointed domain containing ets transcription factor || BC021299 || 6p21.3


80
Hs.262811
KIAA1324
Maba1 || AB037745 || 1p13.3


81
Hs.213424
SFRP1
Secreted frizzled-related protein 1 || BC036503 || 8p12-p11.1


82
Hs.364544
TM4SF13
Tetraspanin 13 || AK128509 || 7p21.1


83
Hs.533185
MAD2L1
MAD2 mitotic arrest deficient-like 1 (yeast) || BQ215664 || 4q27


84
Hs.153704
NEK2
NIMA (never in mitosis gene a)-related kinase 2 || BC043502 || 1q32.2-q41


85
Hs.105547
NPDC1
Neural proliferation, differentiation and control, 1 || AK054950 || 9q34.3


86
Hs.489353
GPSM2
G-protein signalling modulator 2 (AGS3-like, C. elegans) || NM_013296 || 1p13.3


87
Hs.77695
DLG7
Discs, large homolog 7 (Drosophila) || NM_014750 || 14q22.3


88
Hs.529285
SLC40A1
Solute carrier family 40 (iron-regulated transporter), member 1 || NM_014585 || 2q32


89
Hs.49760
ORC6L
Origin recognition complex, subunit 6 homolog-like (yeast) || NM_014321 || 16q12


90
Hs.498248
EXO1
Exonuclease 1 || NM_130398 || 1q42-q43


91
Hs.73625
KIF20A
Kinesin family member 20A || AK025790 || 5q31


92
Hs.165904
EPN3
Epsin 3 || AK000785 || 17q21.33


93
Hs.350966
PTTG1
Pituitary tumor-transforming 1 || BQ278502 || 5q35.1


94
Hs.199487
RERG
RAS-like, estrogen-regulated, growth inhibitor || BC007997 || 12p12.3


95
Hs.351344
TMEM25
Transmembrane protein 25 || AK124814 || 11q23.3


96
Hs.487296
PHGDH
Phosphoglycerate dehydrogenase || AK093306 || 1p12


97
Hs.396783
SLC9A3Rtext missing or illegible when filed
Solute carrier family 9 (sodium/hydrogen exchanger), isoform 3 regulator 1 || BX648303 || 17q25.1


98
Hs.404323
FLJ10156
Family with sequence similarity 64, member A || CR590914 || 17p13.2


99
Hs.269109
SEMA3C
Sema domain, immunoglobulin domain (Ig), short basic domain, secreted, (semaphorin)





3C || NM_006379 || 7q21-q31.


100
Hs.234545
CDCA1
Cell division cycle associated 1 || NM_145697 || 1q23.3






text missing or illegible when filed indicates data missing or illegible when filed




















TABLE 13





7P-








SAM-








Order
CLID
Gene Name
Annotation
ProbeID
ProbeID
ProbeID





















1
Hs.163484
Forkhead box A1
Forkhead box A1 || NM_004496 || 14q12-q13
AGI_HUM1_OLIGO_A_23_P37127




2
Hs.446352
V-erb-b2 erythroblastic leukemia viral oncogene
V-erb-b2 erythroblastic leukemia viral oncogene
AGI_HUM1_OLIGO_A_23_P89249






homolog 2, neural/glioblastoma derived oncoge  text missing or illegible when filed
homolog 2, neural/glioblastoma derived oncoge  text missing or illegible when filed





3
Hs.496240
Androgen receptor (dihydrotestosterone receptor,
Androgen receptor (dihydrotestosterone receptor,
AGI_HUM1_OLIGO_A_23_P113111






testicular feminization; spinal and bulbar musc  text missing or illegible when filed
testicular feminization; spinal and bulbar musc  text missing or illegible when filed





4
Hs.387057
Hypothetical protein FLJ13710
Hypothetical protein FLJ13710 || BX641106 || 15q23
AGI_HUM1_OLIGO_A_23_P148249




5
Hs.437638
X-box binding protein 1
X-box binding protein 1 || AK093842 || 22q12.1
AGI_HUM1_OLIGO_A_23_P120845




6
Hs.348883
Forkhead box C1
Forkhead box C1 || NM_001453 || 6p25
AGI_HUM1_OLIGO_A_23_P390504




7
Hs.82961
Trefoil factor 3 (intestinal)
Trefoil factor 3 (intestinal) || BU536516 || 21q22.3
AGI_HUM1_OLIGO_A_23_P257296




8
Hs.155956
N-acetyltransferase 1 (arylamine N-
N-acetyltransferase 1 (arylamine N-acetyltransferase)
AGI_HUM1_OLIGO_A_23_P95596






acetyltransferase)
|| BC013732 || 8p23.1-p21.3





9
Hs.100686
Breast cancer membrane protein 11
Breast cancer membrane protein 11 || BG540617 || 7p21.1
AGI_HUM1_OLIGO_A_23_P42811




10
Hs.524134
GATA binding protein 3
GATA binding protein 3 || NM_001002295 || 10p15
AGI_HUM1_OLIGO_A_23_P75056




11
Hs.530009
Anterior gradient 2 homolog (Xenopuslaevis)
Anterior gradient 2 homolog (Xenopuslaevis)
AGI_HUM1_OLIGO_A_23_P31407







|| BM924878 || 7p21.3





12
Hs.208124
Estrogen receptor 1
Estrogen receptor 1 || NM_000125 || 6q25.1
AGI_HUM1_OLIGO_A_23_P309739
AGI_HUM1_OLIGO_








A_23_P59308



13
Hs.523468
Signal peptide, CUB domain, EGF-like 2
Signal peptide, CUB domain, EGF-like 2 || NM_020974 ||
AGI_HUM1_OLIGO_A_23_P105144







11p15.3





14
Hs.469649
BUB1 budding uninhibited by benzimidazoles 1
BUB1 budding uninhibited by benzimidazoles 1 homolog
AGI_HUM1_OLIGO_A_23_P124417






homolog (yeast)
(yeast) || AF053305 || 2q14





15
Hs.79136
Solute carrier family 39 (zinc transporter),
Solute carrier family 39 (zinc transporter), member 6
AGI_HUM1_OLIGO_A_23_P50167






member 6
|| NM_012319 || 18q12.2





16
Hs.144197
UDP glycosyltransferase 8 (UDP-galactose
UDP glycosyltransferase 8 (UDP-galactose ceramide
AGI_HUM1_OLIGO_A_23_P51348






ceramide galactosyltransferase)
galactosyltransferase) || NM_003360 || 4q26





17
Hs.27373
Hypothetical gene supported by AK075564;
Hypothetical gene supported by AK075564; BC060873
AGI_HUM1_OLIGO_A_23_P100001






BC060873
|| NM_207446 || 15q26.1





18
Hs.414407
Kinetochore associated 2
Kinetochore associated 2 || NM_006101 || 18p11.32
AGI_HUM1_OLIGO_A_23_P50108




19
Hs.115838
Transmembrane channel-like 5
Transmembrane channel-like 5 || AY358155 || 16p12.3
AGI_HUM1_OLIGO_A_23_P15101




20
Hs.210995
Carbonic anyhydrase XII
Carbonic anyhydrase XII || NM_001218 || 15q22
AGI_HUM1_OLIGO_A_23_P151956
AGI_HUM1_OLIGO_








A_23_P163336



21
Hs.532968
Hypothetical protein DKFZp762E1312
Hypothetical protein DKFZp762E1312 || AK074809 ||
AGI_HUM1_OLIGO_A_23_P79429







2q37.1





22
Hs.514527
Baculoviral IAP repeat-containing 5 (survivin)
Baculoviral IAP repeat-containing 5 (survivin)
AGI_HUM1_OLIGO_A_23_P118815







|| NM_001012271 || 17q25





23
Hs.62180
Anillin, actin binding protein (scraps homolog,
Anillin, actin binding protein (scraps homolog, Drosophila)
AGI_HUM1_OLIGO_A_23_P157099







Drosophila)

|| NM_018685 || 7p15-p14





24
Hs.14559
Chromosome 10 open reading frame 3
Chromosome 10 open reading frame 3 || NM_018131 ||
AGI_HUM1_OLIGO_A_23_P115872







10q23.33





25
Hs.76277
Chromosome 19 open reading frame 32
Chromosome 19 open reading frame 32 || BC008201 ||
AGI_HUM1_OLIGO_A_23_P90510







19p13.3





26
Hs.194698
Cyclin B2
Cyclin B2 || AK023404 || 15q22.2
AGI_HUM1_OLIGO_A_23_P65757




27
Hs.520189
ELOVL family member 5, elongation of
ELOVL family member 5, elongation of long chain
AGI_HUM1_OLIGO_A_23_P156498






long chain fatty acids (FEN1/Elo2,
fatty acids (FEN1/Elo2, SUR4/Elo3-like, yeast  text missing or illegible when filed







SUR4/Elo3-like, yeast






28
Hs.504301
Transmembrane protein 45B
Transmembrane protein 45B || NM_138788 || 11q24.3
AGI_HUM1_OLIGO_A_23_P1682




29
Hs.169840
TTK protein kinase
TTK protein kinase || NM_003318 || 6q13-q21
AGI_HUM1_OLIGO_A_23_P259586




30
Hs.87417
Cathepsin L2
Cathepsin L2 || BC067289 || 9q22.2
AGI_HUM1_OLIGO_A_23_P146456




31
Hs.1594
Centromere protein A, 17 kDa
Centromere protein A, 17 kDa || BM911202 || 2p24-p21





32
Hs.127407
UDP-N-acetyl-alpha-D-galactosamine:
UDP-N-acetyl-alpha-D-galactosamine:polypeptide
AGI_HUM1_OLIGO_A_23_P108910






polypeptide N-acetylgalactosaminyltransferase
N-acetylgalactosaminyltransferase 7 (GalNAc  text missing or illegible when filed
AGI_HUM1_OLIGO_A_23_P144384






7 (GalNAc






33
Hs.260720
DnaJ (Hsp40) homolog, subfamily C,
DnaJ (Hsp40) homolog, subfamily C, member 12
AGI_HUM1_OLIGO_A_23_P127220






member 12
|| NM_021800 || 10q22.1





34
Hs.102406
Melanophilin
Melanophilin || AK096789 || 2q37.3
AGI_HUM1_OLIGO_A_23_P154400




35
Hs.692
Tumor-associated calcium signal transducer 1
Tumor-associated calcium signal transducer 1
AGI_HUM1_OLIGO_A_23_P91081







|| AK026585 || 2p21





36
Hs.524947
CDC20 cell division cycle 20 homolog
CDC20 cell division cycle 20 homolog (S.cerevisiae)
AGI_HUM1_OLIGO_A_23_P149195






(S.cerevisiae)
|| BG256659 || 1p34.1





37
Hs.99949
Prolactin-induced protein
Prolactin-induced protein || BF965123 || 7q34
AGI_HUM1_OLIGO_A_23_P8702




38
Hs.470654
Cell division cycle associated 7
Cell division cycle associated 7 || AL834186 || 2q31
AGI_HUM1_OLIGO_A_23_P251421




39
Hs.279651
Melanoma inhibitory activity
Melanoma inhibitory activity || BG765502 || 19q13.32-q13.33
AGI_HUM1_OLIGO_A_23_P4714




40
Hs.205952
Hypothelical protein LOC201895
Hypothelical protein LOC201895 || BC047541 || 4p14
AGI_HUM1_OLIGO_A_23_P112634




41
Hs.267659
Vav 3 oncogene
Vav 3 oncogene || NM_006113 || 1p13.3
AGI_HUM1_OLIGO_A_23_P201551




42
Hs.86859
Growth factor receptor-bound protein 7
Growth factor receptor-bound protein 7 || NM_005310 ||
AGI_HUM1_OLIGO_A_23_P163983







17q12





43
Hs.93002
Ubiquitin-conjugating enzyme E2C
Ubiquitin-conjugating enzyme E2C || BC032677 || 20q13.12
AGI_HUM1_OLIGO_A_23_P143207




44
Hs.271224
Hypoxia-inducible factor prolyl-4-hydroxylase
Hypoxia-inducible factor prolyl-4-hydroxylase
AGI_HUM1_OLIGO_A_23_P113317







|| NM_017732 || 3p21.31





45
Hs.24976
ADP-ribosyltransferase 3
ADP-ribosyltransferase 3 || AK129914 || 4p15.1-p14
AGI_HUM1_OLIGO_A_23_P80918




46
Hs.184339
Maternal embryonic leucine zipper kinase
Maternal embryonic leucine zipper kinase || NM_014791 ||
AGI_HUM1_OLIGO_A_23_P94422







9p13.2





47
Hs.524571
Cell division cycle associated 8
Cell division cycle associated 8 || BC000703 || 1p34.3
AGI_HUM1_OLIGO_A_23_P375




48
Hs.406050
Dynein, axonemal, light intermediate
Dynein, axonemal, light intermediate polypeptide 1
AGI_HUM1_OLIGO_A_23_P160377






polypeptide 1
|| AK126963 || 1p35.1





49
Hs.152385
Hypothetical protein FLJ10980
Hypothetical protein FLJ10980 || BC040548 ||
AGI_HUM1_OLIGO_A_23_P99853







15q21.2-q21.3





50
Hs.523220
RADS4-like (S.cerevisiae)
RADS4-like (S.cerevisiae) || NM_003579 || 1p32
AGI_HUM1_OLIGO_A_23_P74115




51
Hs.406013
Keratin 18
Keratin 18 || CR616919 || 12q13
AGI_HUM1_OLIGO_A_23_P122650
AGI_HUM1_OLIGO_








A_23_P99324



52
Hs.487036
Myosin VC
Myosin VC || NM_018728 || 15q21
AGI_HUM1_OLIGO_A_23_P140434




53
Hs.494496
Fructose-1,6-bisphosphatase 1
Fructose-1,6-bisphosphatase 1 || AK223395 || 9q22.3
AGI_HUM1_OLIGO_A_23_P257111




54
Hs.474217
CDC45 cell division cycle 45-like
CDC45 cell division cycle 45-like (S.cerevisiae)
AGI_HUM1_OLIGO_A_23_P57379






(S.cerevisiae)
|| BM478416 || 22q11.21





55
Hs.189119
CXXV finger 5
CXXC finger 5 || NM_016463 || 5q31.2
AGI_HUM1_OLIGO_A_23_P213680




56
Hs.284153
Fanconi anemia, complementation group A
Fanconi anemia, complementation group A
AGI_HUM1_OLIGO_A_23_P206441







|| NM_000135 || 16q24.3





57
Hs.531941
V-myb myeloblastosis viral oncogene homolog
V-myb myeloblastosis viral oncogene homolog (avian)
AGI_HUM1_OLIGO_A_23_P31073






(avian)
|| AJ606319 || 6q22-q23





58
Hs.549195
Opioid growth factor receptor-like 1
Opioid growth factor receptor-like 1 || NM_024576 || 6q13
AGI_HUM1_OLIGO_A_23_P7791




59
Hs.69360
Kinesin family member 2C
Kinesin family member 2C || NM_006845 || 1p34.1
AGI_HUM1_OLIGO_A_23_P34788




60
Hs.226390
Ribonucleotide reductase M2 polypeptide
Ribonucleotide reductase M2 polypeptide || AK123010 ||
AGI_HUM1_OLIGO_A_23_P136222







2p25-p24





61
Hs.250822
Serine/threonine kinase 6
Serine/threonine kinase 6 || NM_198433 || 20q13.2-q13.3
AGI_HUM1_OLIGO_A_23_P131866




62
Hs.490655
Actin-related protein 3-beta
Actin-related protein 3-beta || AB209174 || 7q32-q36
AGI_HUM1_OLIGO_A_23_P123193




63
Hs.516297
Transcription factor 7-like 1 (T-cell specific,
Transcription factor 7-like 1 (T-cell specific, HMG-box)
AGI_HUM1_OLIGO_A_23_P142872






HMG-box)
|| AK128630 || 2p11.2





64
Hs.252387
Cadherin, EGF LAG seven-pass G-type
Cadherin, EGF LAG seven-pass G-type receptor 1
AGI_HUM1_OLIGO_A_23_P132378






receptor 1 (flamingo homolog, Drosophlia)
(flamingo homolog, Drosophlia) || AF231024





65
Hs.179718
V-myb myeloblastosis viral oncogene homolog
V-myb myeloblastosis viral oncogene homolog
AGI_HUM1_OLIGO_A_23_P143184






(avian)-like 2
(avian)-like 2 || BX647151 || 20q13.1





66
Hs.201034
Netrin 4
Netrin 4 || AF278532 || 12q22-q23
AGI_HUM1_OLIGO_A_23_P04630




67
Hs.42645
Solute carrier family 16 (monocarboxylic acid
Solute carrier family 16 (monocarboxylic acid transporters),
AGI_HUM1_OLIGO_A_23_P152791






transporters), member 6
member 6 || NM_004694 || 17q24.2





68
Hs.66762
Chromosome 10 open reading frame 38
Chromosome 10 open reading frame 38 || AL050367 ||
AGI_HUM1_OLIGO_A_23_P44964







10p13





69
Hs.231320
G protein-coupled receptor 160
G protein-coupled receptor 160 || AJ249248 || 3q26.2-q27
AGI_HUM1_OLIGO_A_23_P167005




70
Hs.517549
Phosphatidylinositol (4,5) bisphosphate
Phosphatidylinositol (4,5) bisphosphate 5-phosphatase, A
AGI_HUM1_OLIGO_A_23_P91669






5-phosphatase, A
|| AK092859 || 2q11.2-q13.2





71
Hs.370549
B-cell CLL/lymphoma 11A (zinc finger protein)
B-cell CLL/lymphoma 11A (zinc finger protein)
AGI_HUM1_OLIGO_A_23_P218584







|| NM_022893 || 2p16.1





72
Hs.96055
E2F transcription factor 1
E2F transcription factor 1 || BC050369 || 20q11.2
AGI_HUM1_OLIGO_A_23_P80032




73
Hs.505469
Rac GTPase activating protein 1
Rac GTPase activating protein 1 || NM_13277 || 12q13.12
AGI_HUM1_OLIGO_A_23_P65110




74
Hs.436187
Thyroid hormone receptor interactor 13
Thyroid hormone receptor interactor 13 || NM_004237 ||
AGI_HUM1_OLIGO_A_23_P167607







5p15.33





75
Hs.5199
Ubiquitin-conjugating enzyme E2T (putative)
Ubiquitin-conjugating enzyme E2T (putative)
AGI_HUM1_OLIGO_A_23_P115482







|| BF690859 || 1q32.1





76
Hs.529181
Calpain 13
Calpain 13 || BX647678 ||
AGI_HUM1_OLIGO_A_23_P101972




77
Hs.49433
Peroxisomal acyl-CoA thioesterase 28
Peroxisomal acyl-CoA thioesterase 28 || AK055797 ||
AGI_HUM1_OLIGO_A_23_P14515







14q24.3





78
Hs.459362
Protein regulator of cytokineses 1
Protein regulator of cytokineses 1 || NM003981 || 15q26.1
AGI_HUM1_OLIGO_A_23_P206059




79
Hs.485158
SAM pointed domain containing ets
SAM pointed domain containing ets transcription factor
AGI_HUM1_OLIGO_A_23_P111194






transcription factor
|| BC021299 || 6p21.3





80
Hs.262811
Maba1
Maba1 || AB037745 || 1p13.3
AGI_HUM1_OLIGO_A_23_P15392




81
Hs.213424
Secreted frizzled-related protein 1
Secreted frizzled-related protein 1 || BC036503 ||
AGI_HUM1_OLIGO_A_23_P10121







8p12-p11.1





82
Hs.364544
Tetraspanin 13
Tetraspanin 13 || AK128509 || 7p21.1
AGI_HUM1_OLIGO_A_23_P168610




83
Hs.533185
MAD2 mitotic arrest deficient-like 1 (yeast)
MAD2 mitotic arrest deficient-like 1 (yeast) || BQ215664 ||
AGI_HUM1_OLIGO_A_23_P92441







4q27





84
Hs.153704
NIMA (never in mitosis gene a)-related kinase 2
NIMA (never in mitosis gene a)-related kinase 2
AGI_HUM1_OLIGO_A_23_P35219







|| BC043502 || 1q32.2-q41





85
Hs.105547
Neural proliferation, differentiation and control, 1
Neural proliferation, differentiation and control, 1
AGI_HUM1_OLIGO_A_23_P146565







|| AK054950 || 9q34.3





86
Hs.489353
G-protein signalling modulator 2 (AGS3-like,
G-protein signalling modulator 2 (AGS3-like, C.elegans)
AGI_HUM1_OLIGO_A_23_P63402







C.
elegans)

|| NM_013296 || 1p13.3





87
Hs.77695
Discs, large homolog 7 (Drosophila)
Discs, large homolog 7 (Drosophila) || NM_014750 ||
AGI_HUM1_OLIGO_A_23_P88331







14q22.3





88
Hs.529285
Solute carrier family 40 (iron-regulated
Solute carrier family 40 (iron-regulated transporter),
AGI_HUM1_OLIGO_A_23_P102391






transporter), member 1
member 1 || NM_014585 || 2q32





89
Hs.49760
Origin recognition complex, subunit 6
Origin recognition complex, subunit 6 homolog-like
AGI_HUM1_OLIGO_A_23_P100344






homolog-like (yeast)
(yeast) || NM_014321 || 16q12





90
Hs.498248
Exonuclease 1
Exonuclease 1 || NM_130398 || 1q42-q43
AGI_HUM1_OLIGO_A_23_P23303




91
Hs.73625
Kinesin family member 20A
Kinesin family member 20A || AK025790 || 5q31
AGI_HUM1_OLIGO_A_23_P256956




92
Hs.165904
Epsin 3
Epsin 3 || AK000785 || 17q21.33
AGI_HUM1_OLIGO_A_23_P130027




93
Hs.350966
Pituitary tumor-transforming 1
Pituitary tumor-transforming 1 || BQ278502 || 5q35.1
AGI_HUM1_OLIGO_A_23_P60024




94
Hs.199487
RAS-like, estrogen-regulated, growth inhibitor
RAS-like, estrogen-regulated, growth inhibitor
AGI_HUM1_OLIGO_A_23_P204296
AGI_HUM1_OLIGO_






|| BC007997 || 12p12.3

A_23_P7636



95
Hs.351344
Transmembrane protein 25
Transmembrane protein 25 || AK124814 || 11q23.3
AGI_HUM1_OLIGO_A_23_P203115




96
Hs.487296
Phosphoglycerate dehydrogenase
Phosphoglycerate dehydrogenase || AK093306 || 1p12
AGI_HUM1_OLIGO_A_23_P85780




97
Hs.396783
Solute carrier family 9 (sodium/hydrogen
Solute carrier family 9 (sodium/hydrogen exchanger),
AGI_HUM1_OLIGO_A_23_P152593






exchanger), isoform 3 regulator 1
isoform 3 regulator 1 || BX648303 || 17q25





98
Hs.404323
Family with sequence similarity 64, member A
Family with sequence similarity 64, member A
AGI_HUM1_OLIGO_A_23_P49876







|| CR590914 || 17p13.2





99
Hs.269109
Sema domain, immunoglobulin domain (Ig),
Sema domain, immunoglobulin domain (Ig), short basic
AGI_HUM1_OLIGO_A_23_P258473






short basic domain, secreted, (semaphorin) 3C
domain, secreted, (semaphorin) 3C || N  text missing or illegible when filed





100
Hs.234545
Cell division cycle associated 1
Cell division cycle associated 1 || NM_145697 || 1q23.3
AGI_HUM1_OLIGO_A_23_P74349




101
Hs.400556
Breast carcinoma amplified sequence 1
Breast carcinoma amplified sequence 1 || CR749643 ||
AGI_HUM1_OLIGO_A_23_P17420







20q13.2-q13.3





102
Hs.446438
G protein-coupled receptor, family C, group 5,
G protein-coupled receptor, family C, group 5, member C
AGI_HUM1_OLIGO_A_23_P38167






member C
|| AK131210 || 17q25





103
Hs.516727
RNA-binding protein
RNA-binding protein || BC071585 || 4p13-p12
AGI_HUM1_OLIGO_A_23_P132910




104
Hs.501309
Cold inducible RNA binding protein
Cold inducible RNA binding protein || AK095781 ||
AGI_HUM1_OLIGO_A_23_P142322







19p13.3





105
Hs.21028
Asp (abnormal spindle)-like microcephaly
Asp (abnormal spindle)-like microcephaly associated
AGI_HUM1_OLIGO_A_23_P52017






associated (Drosphila)
(Drosphila) || AY367055 || 1q31





106
Hs.421956
Spindle pole body component 25 homolog
Spindle pole body component 25 homolog (S.cerevisiae)
AGI_HUM1_OLIGO_A_23_P51085






(S.cerevisiae)
|| BC022255 || 2q24.3





107
Hs.155017
Nuclear receptor interacting protein 1
Nuclear receptor interacting protein 1 || NM_003469 ||
AGI_HUM1_OLIGO_A_23_P211007







21q11.2





108
Hs.18268
Adenylate kinase 5
Adenylate kinase 5 || NM_012093 || 1p31
AGI_HUM1_OLIGO_A_23_P200015




109
Hs.436912
Kinesin family member C1
Kinesin family member C1 || XM_371813 || 6p21.3
AGI_HUM1_OLIGO_A_23_P133954




110
Hs.226307
Apolipoprotein B mRNA editing enzyme,
Apolipoprotein B mRNA editing enzyme, catalytic
AGI_HUM1_OLIGO_A_23_P109539






catalytic polypeptide-like 38
polypeptide-like 38 || AK024854 || 22q13.1-q1  text missing or illegible when filed





111
Hs.469198
Ring finger protein 103
Ring finger protein 103 || NM_005667 || 2p11.2
AGI_HUM1_OLIGO_A_23_P56709




112
Hs.13291
Cyclin G2
Cyclin G2 || AK092638 || 4q21.1
AGI_HUM1_OLIGO_A_23_P110122




113
Hs.444637
Low density lipoprotein receptor-related
Low density lipoprotein receptor-related protein 8,
AGI_HUM1_OLIGO_A_23_P200222






protein 8, apolipoprotein e receptor
apolipoprotein e receptor || NM_004631 || 1p3





114
Hs.1892
Phenylethanolamine N-methyltransferase
Phenylethanolamine N-methyltransferase || NM_002686 ||
AGI_HUM1_OLIGO_A_23_P100642







17q21-q22





115
Hs.534367
Frizzled homolog 9 (Drosphila)
Frizzled homolog 9 (Drosphila) || BC026333 || 7q11.23
AGI_HUM1_OLIGO_A_23_P68610




116
Hs.244580
TPX2, microtubule-associated protein homolog
TPX2, microtubule-associated protein homolog
AGI_HUM1_OLIGO_A_23_P






(Xenopuslaevis)
(Xenopuslaevis) || NM_012112 || 20q11.2





117
Hs.1058837
Similar to common salivary prolein 1
Similar to common salivary prolein 1 || BU558247 ||
AGI_HUM1_OLIGO_A_23_P118203







16p19.3





118
Hs.254414
Serine-arginine repressor protein (35 kDa)
Serine-arginine repressor protein (35 kDa)
AGI_HUM1_OLIGO_A_23_P110901







|| AK027365 || 6q15





119
Hs.479220
Prominin 1
Prominin 1 || AF117225 || 4p15.32
AGI_HUM1_OLIGO_A_23_P258462




120
Hs.518055
Leucin-rich repeals and Immunoglobulin-like
Leucin-rich repeals and Immunoglobulin-like domains 1
AGI_HUM1_OLIGO_A_23_P109636






domains 1
|| BC071561 || 3p14





121
Hs.129591
Zinc finger protein 552
Zinc finger protein 552 || AK023769 || 19q13.43
AGI_HUM1_OLIGO_A_23_P38824




122
Hs.492261
Tumor protein p53 Inducible nuclear protein 1
Tumor protein p53 Inducible nuclear protein 1
AGI_HUM1_OLIGO_A_23_P168882







|| AK125880 || 8q22





123
Hs.473595
Chloride intracellular channel 6
Chloride intracellular channel 6 || AF4483439 || 21q22.12
AGI_HUM1_OLIGO_A_23_P132088




124
Hs.336768
4-aminobutyrate aminotransferase
4-aminobutyrate aminotransferase aminotransferase
AGI_HUM1_OLIGO_A_23_P141114
AGI_HUM1_OLIGO_






|| NM_020696 || 16p13.2

A_23_P152505



125
Hs.351875
Cytochrome c oxidase subunit Vic
Cytochrome c oxidase subunit Vic || AK128382 || 8q2-q23
AGI_HUM1_OLIGO_A_23_P8900




126
Hs.335139
Potassium channel tetramerisation domain
Potassium channel tetramerisation domain containing 3
AGI_HUM1_OLIGO_A_23_P160406






containing 3
|| NM_016121 || 1q41





127
Hs.10082
Potassium intermediate/small conductance
Potassium intermediate/small conductance calcium-
AGI_HUM1_OLIGO_A_23_P67529






calcium-activated channel, subfamily N,
activated channel, subfamily N, member 4 ||







member 4






128
Hs.75438
Quinoid dihydropteridine reductase
Quinoid dihydropteridine reductase || AK124952 || 4p15.31
AGI_HUM1_OLIGO_A_23_P133049




129
Hs.432638
SRY (sex determining region Y)-box 11
SRY (sex determining region Y)-box 11 || AB028641 ||
AGI_HUM1_OLIGO_A_23_P22378







2p25





130
Hs.283749
Angiogenin, ribonuclease, RNase A family, 5
Angiogenin, ribonuclease, RNase A family, 5
AGI_HUM1_OLIGO_A_23_P205531







|| NM_194430 || 14q11.1





131
Hs.79741
Likely ortholog of mouse dilute suppressor
Likely ortholog of mouse dilute suppressor
AGI_HUM1_OLIGO_A_23_P108948







|| BC082990 || 2q35





132
Hs.473087
CTP synthase
CTP synthase || BC009408 || 1p34.1
AGI_HUM1_OLIGO_A_23_P21706
AGI_HUM1_OLIGO_








A_23_P33103



133
Hs.444082
Enhancer of zeste homoalog 2 (Drosophila)
Enhancer of zeste homoalog 2 (Drosophila) || AK023816 ||
AGI_HUM1_OLIGO_A_23_P259641







7q35-q36





134
Hs.11729
Solute carrier family27 (fatty acid transporter),
Solute carrier family27 (fatty acid transporter), member 2
AGI_HUM1_OLIGO_A_23_P140450






member 2
|| AK223145 || 15q21.2





135
Hs.546241
Complement component 4B
Complement component 4B || BC063289 || 6p21.3
AGI_HUM1_OLIGO_A_23_P42279




136
Hs.56650
Hedgehog acyltransferase
Hedgehog acyltransferase || AK18135 || 1q32
AGI_HUM1_OLIGO_A_23_P136355




137
Hs.95612
Desmocollin 2
Desmocollin 2 || NM_004949 || 18q12.1
AGI_HUM1_OLIGO_A_23_P4494




138
Hs.519057
Neuropeptide Y receptor Y1
Neuropeptide Y receptor Y1 || AB209237 || 4q31.3-q32
AGI_HUM1_OLIGO_A_23_P69699




139
Hs.517860
Chromosome 3 open reading frame 18
Chromosome 3 open reading frame 18 || AK127002 ||
AGI_HUM1_OLIGO_A_23_P155477







3p21.3





140
Hs.239
Forkhead box M1
Forkhead box M1 || NM_202002 || 12p13
AGI_HUM1_OLIGO_A_23_P151150




141
Hs.514033
Sperm associated antigen 5
Sperm associated antigen 5 || NM_006461 || 17q11.2
AGI_HUM1_OLIGO_A_23_P89509




142
Hs.129895
T-box 3 (ulnar mammary syndrome)
T-box 3 (ulnar mammary syndrome) || NM_016569 ||
AGI_HUM1_OLIGO_A_23_P204100







12q24.1





143
Hs.252712
Karyopherin alpha 2 (RAG cohort 1, importin
Karyopherin alpha 2 (RAG cohort 1, importin alpha 1)
AGI_HUM1_OLIGO_A_23_P125265






alpha 1)
|| BC067848 || 17q23.1-q23.3





144
Hs.458304
Ropporin, rhophilin associated protein 1
Ropporin, rhophilin associated protein 1 || AL133624 ||
AGI_HUM1_OLIGO_A_23_P166922







3q21.1





145
Hs.446554
RAD51 homolog (RecA homolog, E.coli)
RAD51 homolog (RecA homolog, E.coli) (S.cerevisiae)
AGI_HUM1_OLIGO_A_23_P88731






(S.cerevisiae)
|| AL833420 || 15q15.1





146
Hs.283532
Uncharacterized bone marrow protein BM039
Uncharacterized bone marrow protein BM039
AGI_HUM1_OLIGO_A_23_P88740







|| AK03669 || 16q23.2





147
Hs.48706
Superoxide dismutase 2, mitochondrial
Superoxide dismutase 2, mitochondrial || AK097395 ||
AGI_HUM1_OLIGO_A_23_P134176







6q25.3





148
Hs.522665
Melanoma antigen family O, 2
Melanoma antigen family O, 2 || AK092463 || Xp11.2
AGI_HUM1_OLIGO_A_23_P33898




149
Hs.514146
Titin-cap (telethonin)
Titin-cap (telethonin) || AK096328 || 17q12
AGI_HUM1_OLIGO_A_23_P107051




150
Hs.23960
Cyclin B1
Cyclin B1 || NM_031966 || 5q12
AGI_HUM1_OLIGO_A_23_P122197




151
Hs.434604
Similar to ovostalin-2
Similar to ovostalin-2 || XM_495907 || 12q13.31
AGI_HUM1_OLIGO_A_23_P25069




152
Hs.390788
Protein kinase, X-linked
Protein kinase, X-linked || NM_005044 || Xp22.3
AGI_HUM1_OLIGO_A_23_P217339




153
Hs.33102
Transcription factor AP-2 beta (activating
Transcription factor AP-2 beta (activating enhancer
AGI_HUM1_OLIGO_A_23_P145104






enhancer binding protein 2 beta)
binding protein 2 beta) || NM_003221 || 5p21





154
Hs.473583
Nuclease sensitive element binding protein 1
Nuclease sensitive element binding protein 1 || BF525416 ||
AGI_HUM1_OLIGO_A_23_P34766







1p34





155
Hs.444767
Kinesin family member 13B
Kinesin family member 13B || NM_015254 || 8p12
AGI_HUM1_OLIGO_A_23_P147388
AGI_HUM1_OLIGO_








A_23_P95441



156
Hs.514211
Hypothetical protein MGC4251
Hypothetical protein MGC4251 || BM_542308 || 17q21.31
AGI_HUM1_OLIGO_A_23_P15516




157
Hs.104741
PDZ binding kinase
PDZ binding kinase || NM_018492 || 8p21.2
AGI_HUM1_OLIGO_A_23_P82699




158
Hs.518997
Hypothetical protein FLJ10901
Hypothetical protein FLJ10901 || AK001763 || 1q32.1
AGI_HUM1_OLIGO_A_23_P1043




159
Hs.118552
Heat shock 70 kDa protein 5 (glucose-regulated
Heat shock 70 kDa protein 5 (glucose-regulated protein,
AGI_HUM1_OLIGO_A_23_P24716






protein, 78 kDa) binding protein 1
78 kDa) binding protein 1 || NM_017870 |





160
Hs.2025
Transforming growth factor, beta 3
Transforming growth factor, beta 3 || AK122902 || 14q24
AGI_HUM1_OLIGO_A_23_P88404




161
Hs.2006
Glutathione S-transferase M3 (brain)
Glutathione S-transferase M3 (brain) || NM_000849 ||
AGI_HUM1_OLIGO_A_23_P12343







1p13.3





162
Hs.368072
Progesterone receptor
Progesterone receptor || NM_000926 || 11q22-q23
AGI_HUM1_OLIGO_A_23_P138938




163
Hs.413111
Phospholipase C, gamma 2
Phospholipase C, gamma 2 (phosphatidylinositol-specific)
AGI_HUM1_OLIGO_A_23_P106675






(phosphatidylinositol-specific)
|| AB208914 || 16q24.1





164
Hs.480837
Inositol polyphosphate-4-phosphatase, type II,
Inositol polyphosphate-4-phosphatase, type II, 105 kDa
AGI_HUM1_OLIGO_A_23_P18559






105 kDa
|| BX649890 || 4q31.21





165
Hs.270845
Kinesin family member 23
Kinesin family member 23 || NM_138555 || 15q23
AGI_HUM1_OLIGO_A_23_P48835




166
Hs.215766
GTP binding protein 4
GTP binding protein 4 || NM_012341|| 10p15-p14
AGI_HUM1_OLIGO_A_23_P12874




167
Hs.209983
Stathmin 1/oncoprotein 18
Stathmin 1/oncoprotein 18 || BX647885 || 1p36.1-p35
AGI_HUM1_OLIGO_A_23_P200866




168
Hs.162807
Trefoil factor 1 (breast cancer, estrogen-
Trefoil factor 1 (breast cancer, estrogen-Inducible sequence
AGI_HUM1_OLIGO_A_23_P68759






Inducible sequence expressed in)
expressed in) || BM923753 || 21q22





169
Hs.532803
Hematological and neurological expressed 1
Hematological and neurological expressed 1 || BC039343 ||
AGI_HUM1_OLIGO_A_23_P100632







17q25.1





170
Hs.233160
Stanniocalcin 2
Stanniocalcin 2 || NM_003714 || 5q35.1
AGI_HUM1_OLIGO_A_23_P110685




171
Hs.415098
DEP domain containing 1
DEP domain containing 1 || BC065304 || 1p31.2
AGI_HUM1_OLIGO_A_23_P200310




172
Hs.169348
Bloom syndrome
Bloom syndrome || BC034480 || 15q26.1
AGI_HUM1_OLIGO_A_23_P88630




173
Hs.515122
Thymidine kinase 1, soluble
Thymidine kinase 1, soluble || BF683703 || 17q23.2-q25.3
AGI_HUM1_OLIGO_A_23_P107421




174
Hs.29724
Pleckstrin homology domain containing,
Pleckstrin homology domain containing, family F
AGI_HUM1_OLIGO_A_23_P20275






family F (with FYVE domain) member 2
(with FYVE domain) member 2 || NM_024613|





175
Hs.49143
MKL/myocardin-like2
MKL/myocardin-like2 || NM_014048 || 16p13.12
AGI_HUM1_OLIGO_A_23_P54556




176
Hs.334562
Cell division cycle 2, G1 to S and G2 to M
Cell division cycle 2, G1 to S and G2 to M
AGI_HUM1_OLIGO_A_23_P138507







|| CR933728 || 10q21.1





177
Hs.511755
Pituitary tumor-transforming 2
Pituitary tumor-transforming 2 || AF095288 || 4p12
AGI_HUM1_OLIGO_A_23_P18579




178
Hs.212088
Epoxide hydrolase 2, cytoplasmic
Epoxide hydrolase 2, cytoplasmic || NM_001979 || 8p21-p12
AGI_HUM1_OLIGO_A_23_P8834




179
Hs.25318
RAB27B, member RAS oncogene family
RAB27B, member RAS oncogene family || AF131784 ||
AGI_HUM1_OLIGO_A_23_P107611







18q21.2





180
Hs.462998
Insulin-like growth factor binding protein 4
Insulin-like growth factor binding protein 4 || NM_001552 ||
AGI_HUM1_OLIGO_A_23_P38574







17q12-q21.1





181
Hs.268728
Tweety homolog 1 (Drosophila)
Tweety homolog 1 (Drosophila) || AK126690 || 19q13.4
AGI_HUM1_OLIGO_A_23_P50815




182
Hs.258326
B/K protein
B/K protein || NM_016524 || 16p12.3
AGI_HUM1_OLIGO_A_23_P163697




183
Hs.525419
Epithelial protein lost in neoplasm beta
Epithelial protein lost in neoplasm beta || BX647194 || 12q13
AGI_HUM1_OLIGO_A_23_P151267




184
Hs.122908
DNA replication factor
DNA replication factor || A8053172 || 16q24.3
AGI_HUM1_OLIGO_A_23_P37704




185
Hs.12970
Proteasome (prosome, micropain) 26S
Proteasome (prosome, micropain) 26S subunit, non-ATPase,
AGI_HUM1_OLIGO_A_23_P26785






subunit, non-ATPase, 3
3 || D67025 || 17q21.1





186
Hs.417962
Dual specificity phosphatase 4
Dual specificity phosphatase 4 || NM_057158 || 8p12-p11
AGI_HUM1_OLIGO_A_23_P134935




187
Hs.211511
Hypothetical protein FLJ1127
Hypothetical protein FLJ1127 || AK128417 || 12q24.11
AGI_HUM1_OLIGO_A_23_P76402




188
Hs.444959
Acyl-Coenzyme A oxidase 2, branched chain
Acyl-Coenzyme A oxidase 2, branched chain || BC033517 ||
AGI_HUM1_OLIGO_A_23_P10182







3p14.3





189
Hs.323583
Hypothetical protein DKFZp434L142
Hypothetical protein DKFZp434L142 || NM_016613 ||
AGI_HUM1_OLIGO_A_23_P218928







4q32.1





190
Hs.119192
H2A histone family, member Z
H2A histone family, member Z || AK056803 || 4q24
AGI_HUM1_OLIGO_A_23_P133147




191
Hs.95243
Transcription elongation factor A (Sll)-like 1
Transcription elongation factor A (Sll)-like 1 || BM690957 ||
AGI_HUM1_OLIGO_A_23_P73901







Xq22.1





192
Hs.190440
ADP-ribosylation factor-like 6 interacting
ADP-ribosylation factor-like 6 interacting protein 2
AGI_HUM1_OLIGO_A_23_P209619






protein 2
|| AK026946 || 2p22.2-p22.1





193
Hs.444915
Solute carrier family 1 (neuronal/epithelial high
Solute carrier family 1 (neuronal/epithelial high affinity
AGI_HUM1_OLIGO_A_23_P216468






affinity glutamate transporter, system Xag), mem
glutamate transporter, system Xag), mem





194
Hs-326391
Phytanoyl-CoA dioxygenase domain
Phytanoyl-CoA dioxygenase domain containing 1
AGI_HUM1_OLIGO_A_23_P71997






containing 1
|| AK095000 || 9q34.11





195
Hs.443861
SFRS protein kinase 1
SFRS protein kinase 1 || AJ318054 || 6p21.3-p21.2
AGI_HUM1_OLIGO_A_23_P19543




196
Hs.159142
Lunatic fringe homolog (Drosophila)
Lunatic fringe homolog (Drosophila) || AK096284 || 7p22
AGI_HUM1_OLIGO_A_23_P8452




197
Hs.272416
SID1 transmembrane family, member 1
SID1 transmembrane family, member 1 || AK000181 ||
AGI_HUM1_OLIGO_A_23_P132515







3q13.2





198
Hs.374613
Inositol 1,4,5-triphosphate receptor, type 1
Inositol 1,4,5-triphosphate receptor, type 1 || D26070 ||
AGI_HUM1_OLIGO_A_23_P92042







3p26-p25





199
Hs.129452
Dachshund homolog 1 (Drosophila)
Dachshund homolog 1 (Drosophila) || NM_080759 || 13q22
AGI_HUM1_OLIGO_A_23_P205134




200
Hs.489207
Asparagine synthetase
Asparagine synthetase || NM_133436 || 7q21.3
AGI_HUM1_OLIGO_A_23_P145694




201
Hs.270833
Amphiregulin (schwannoma-derived growth
Amphiregulin (schwannoma-derived growth factor)
AGI_HUM1_OLIGO_A_23_P259071






factor)
|| AK023449 || 4q13-q21





202
Hs.446680
Retinoic acid induced 2
Retinoic acid induced 2 || BC07937 || Xp22
AGI_HUM1_OLIGO_A_23_P254165




203
Hs.190284
Smith-Magenis syndrome chromosome region,
Smith-Magenis syndrome chromosome region, candidate 6
AGI_HUM1_OLIGO_A_23_P129766






candidate 6
|| AB209609 || 17p11.2





204
Hs.404741
Nuclear factor (erythroid-derived 2)-like 3
Nuclear factor (erythroid-derived 2)-like 3 || NM_004289 ||
AGI_HUM1_OLIGO_A_23_P42718







7p15-p14





205
Hs.444118
MCM6 minichromosome maintenance deficient
MCM6 minichromosome maintenance deficient 6 (MIS5
AGI_HUM1_OLIGO_A_23_P90612






6 (MIS5 homolog, S.pombe) (S.cerevisiae)
homolog, S.pombe) (S.cerevisiae) || N





206
Hs.527295
Ectonucleotide pyrophosphatase/
Ectonucleotide pyrophosphatase/phosphodiesterase 1
AGI_HUM1_OLIGO_A_23_P156880






phosphodiesterase 1
|| NM_006208 || 9q22-q23





207
Hs.96843
Dynein, cytoplasmic, light polypeptide 2B
Dynein, cytoplasmic, light polypeptide 2B || BC035235 ||
AGI_HUM1_OLIGO_A_23_P94840







16q23.3





208
Hs.525198
TAL1 (SCL) interrupting locus
TAL1 (SCL) interrupting locus || NM_003035 || 1q32
AGI_HUM1_OLIGO_A_23_P51966




209
Hs.471508
Insulin receptor substrate 1
Insulin receptor substrate 1|| NM_005544 || 2q36
AGI_HUM1_OLIGO_A_23_P90649




210
Hs.435458
SET binding protein 1
SET binding protein 1 || BX640904 || 18q21.1
AGI_HUM1_OLIGO_A_23_P4551




211
Hs.15929
Chromosome 6 open reading frame 211
Chromosome 6 open reading frame 211 || AK022972 ||
AGI_HUM1_OLIGO_A_23_P254472







6q25.1





212
Hs.525796
Chromosome 15 open reading frame 23
Chromosome 15 open reading frame 23 || CR602848 ||
AGI_HUM1_OLIGO_A_23_P140705







15q15.1





213
Hs.514840
Chitinase 3-like 2
Chitinase 3-like 2 || U58515 || 1p13.3
AGI_HUM1_OLIGO_A_23_P12082




214
Hs.481307
MLF1 Interacting protein
MLF1 Interacting protein || AF516710 || 4q35.1
AGI_HUM1_OLIGO_A_23_P254733




215
Hs.408767
Crystallin, alpha B
Crystallin, alpha B || BU734674 || 11q22.3-q23.1
AGI_HUM1_OLIGO_A_23_P75589




216
Hs.8878
Kinesin family member 11
Kinesin family member 11 || NM_004523 || 10q24.1
AGI_HUM1_OLIGO_A_23_P52278




217
Hs.201083
Mal, T-cell differentiation protein 2
Mal, T-cell differentiation protein 2 || AY007723 ||
AGI_HUM1_OLIGO_A_23_P60130




218
Hs.507669
Hypothetical protein CG003
Hypothetical protein CG003 || U50534 || 13q13.1
AGI_HUM1_OLIGO_A_23_P105862




219
Hs.118722
Fucosyltransferase 8 (alpha (1.6)
Fucosyltransferase 8 (alpha (1.6) fucosyltransferase)
AGI_HUM1_OLIGO_A_23_P14432






fucosyltransferase)
|| AJ536056 || 14q24.3





220
Hs.388255
DC13 protein
DC13 protein || AK123993 || 16q23.2
AGI_HUM1_OLIGO_A_23_P106544




221
Hs.491148
Pericentriolar material 1
Pericentriolar material 1 || NM_006197 || 8p22-p21.3
AGI_HUM1_OLIGO_A_23_P82950




222
Hs.36972
Tetraspanin 1
Tetraspanin 1 || BQ216899 || 1p34.1
AGI_HUM1_OLIGO_A_23_P160167




223
Hs.494261
Phosphoserine aminotransferase 1
Phosphoserine aminotransferase 1 || NM_058179 || 9q21.2
AGI_HUM1_OLIGO_A_23_P259692




224
Hs.465413
Cytochrome b-5
Cytochrome b-5 || AB209617 || 18q23
AGI_HUM1_OLIGO_A_23_P101208




225
Hs.121536
Family with sequence similarity 54, member A
Family with sequence similarity 54, member A
AGI_HUM1_OLIGO_A_23_P253752







|| AK125758 || 6q23.3





226
Hs.369762
Thymidylate synthetase
Thymidylate synthetase || BQ058428 || 18p11.32
AGI_HUM1_OLIGO_A_23_P50096




227
Hs.368250
Hypothetical proein MGC11242
Hypothetical proein MGC11242 || BC002865 || 17q21.32
AGI_HUM1_OLIGO_A_23_P118894




228
Hs.19114
High-mobility group box 3
High-mobility group box 3 || BX537505 || Xq28
AGI_HUM1_OLIGO_A_23_P217236




229
Hs.3041
Uracil-DNA glycosylase 2
Uracil-DNA glycosylase 2 || BC004877 || 5p15.2-p13.1
AGI_HUM1_OLIGO_A_23_P92860




230
Hs.15250
Peroxisomal D3,D2-enoyl-CoA Isomerase
Peroxisomal D3,D2-enoyl-CoA Isomerase || AB209917 ||
AGI_HUM1_OLIGO_A_23_P156852







6p24.3





231
Hs.191842
Cadherin 3, type 1, P-cadherin (placental)
Cadherin 3, type 1, P-cadherin (placental) BC041846 ||
AGI_HUM1_OLIGO_A_23_P49155







16q22.1





232
Hs.491767
V-yes-1 Yamaguchi sarcoma viral related
V-yes-1 Yamaguchi sarcoma viral related oncogene
AGI_HUM1_OLIGO_A_23_P147431






oncogene homolog
homolog BC059394 || 8q13





233
Hs.461086
Cadherin 1, type 1, E-cadherin (epithelial)
Cadherin 1, type 1, E-cadherin (epithelial) || NM_004360 ||
AGI_HUM1_OLIGO_A_23_P206359







16q22.1





234
Hs.368884
Chromosome 2 open reading frame 23
Chromosome 2 open reading frame 23 || AK023172 ||
AGI_HUM1_OLIGO_A_23_P66285







2p11.2





235
Hs.89603
Mucin 1, transmembrane
Mucin 1, transmembrane || J05581 || 1q21
AGI_HUM1_OLIGO_A_23_P137856




236
Hs.473420
BTG family, member 3
BTG family, member 3 || BU730087 || 21q21.1-q21.2
AGI_HUM1_OLIGO_A_23_P80068




237
Hs.533782
Keratin 8
Keratin 8 || CR607281 || 12q13
AGI_HUM1_OLIGO_A_23_P14072




238
Hs.518448
Lysosomal-associated membrane protein 3
Lysosomal-associated membrane protein 3 || BC032940 ||
AGI_HUM1_OLIGO_A_23_P29773







3q26.3-q27





239
Hs.507230
Kelch/ankyrin repeal containing cyclin A1
Kelch/ankyrin repeal containing cyclin A1 interacting
AGI_HUM1_OLIGO_A_23_P66100






interacting protein
protein || BC032482 || 1q23.3





240
Hs.445898
V-myb myeloblastosis viral oncogene
V-myb myeloblastosis viral oncogene homolog
AGI_HUM1_OLIGO_A_23_P43157






homolog (avian)-like 1
(avian)-like 1 || XM_034274 || 8q22





241
Hs.486354
Protein kinase (cAMP-dependent, catalytic)
Protein kinase (cAMP-dependent, catalytic) Inhibitor beta
AGI_HUM1_OLIGO_A_23_P145529






Inhibitor beta
|| CR749456 || 6q22.31





242
Hs.511093
Nucleolar and spindle associated protein 1
Nucleolar and spindle associated protein 1 || AK222819 ||
AGI_HUM1_OLIGO_A_23_P206183







15q15.1





243
Hs.69089
Galactosidase, alpha
Galactosidase, alpha || NM_000169 || Xq22
AGI_HUM1_OLIGO_A_23_P45475




244
Hs.523968
Tumor protein p53 binding protein, 2
Tumor protein p53 binding protein, 2 || NM_005426 ||
AGI_HUM1_OLIGO_A_23_P12523







1q42.1





245
Hs.546382
Transcription factor CP2-like 2
Transcription factor CP2-like 2 || BC063299 || 2p25.1
AGI_HUM1_OLIGO_A_23_P5882




246
Hs.487200
SPARC related modular calcium binding 2
SPARC related modular calcium binding 2 || AL832303 ||
AGI_HUM1_OLIGO_A_23_P70307







6q27





247
Hs.519601
Inhibitor of DNA binding 4, dominant
Inhibitor of DNA binding 4, dominant negative helix-loop-
AGI_HUM1_OLIGO_A_23_P59375






negative helix-loop-helix protein
helix protein || BM701438 || 6p22-p21





248
Hs.435711
Tripartite motif-containing 2
Tripartite motif-containing 2 || AF220016 || 4q31.3
AGI_HUM1_OLIGO_A_23_P213141




249
Hs.79361
Kallikrein 6 (neurosin, zyme)
Kallikrein 6 (neurosin, zyme) || NM_002774 || 19q13.3
AGI_HUM1_OLIGO_A_23_P142090




250
Hs.534313
Early growth response 3
Early growth response 3 || NM_00430 || 8p23-p21
AGI_HUM1_OLIGO_A_23_P216225




251
Hs.370834
ATPase family, AAA domain containing 2
ATPase family, AAA domain containing 2 || CR749832 ||
AGI_HUM1_OLIGO_A_23_P215068







8q24.13





252
Hs.3346
Family with sequence similarity 63, member A
Family with sequence similarity 63, member A
AGI_HUM1_OLIGO_A_23_P160546







|| AB037811 || 1q21.2





253
Hs.432448
Keratin 16 (focal non-epidermolytic
Keratin 16 (focal non-epidermolytic palmoplantar
AGI_HUM1_OLIGO_A_23_P38537






palmoplantar keratoderma)
keratoderma) BC039169 || 17q12-q21





254
Hs.405619
Phosphoribosyl transferase domain, containing 1
Phosphoribosyl transferase domain, containing 1
AGI_HUM1_OLIGO_A_23_P202004







NM_020200 || 10p12.1





255
Hs.145932
Metallothionein-like 5, testis-specific (tesmin)
Metallothionein-like 5, testis-specific (tesmin)
AGI_HUM1_OLIGO_A_23_P161507







|| AK128303 || 11q-13.2-q13.3





256
Hs.299654
Dual specificity phosphatase 6
Dual specificity phosphatase 6 || BC037236 || 12q22-q23
AGI_HUM1_OLIGO_A_23_P139704




257
Hs.519162
BTG family, member 2
BTG family, member 2 || NM_006763 || q132
AGI_HUM1_OLIGO_A_23_P62901




258
Hs.81934
Acyl-Coenzyme A dehydrogenase,
Acyl-Coenzyme A dehydrogenase, short/branched chain
AGI_HUM1_OLIGO_A_23_P158570






short/branched chain
|| NM_001609 || 10q25-q25





259
Hs.27695
Midline 1 (Opitz/BBB syndrome)
Midline 1 (Opitz/BBB syndrome) || AF041210 || Xp22
AGI_HUM1_OLIGO_A_23_P10031
AGI_HUM1_OLIGO_
AGI_HUM1_OLIGO_







A_23_P170037
A_23_P3283


260
Hs.406515
NAD(P)H dehydrogenase, quinone 1
NAD(P)H dehydrogenase, quinone 1 || NM_000903 ||
AGI_HUM1_OLIGO_A_23_P206662







16q22.1





261
Hs.87247
Harkiri, BCL2 interacting protein
Harkiri, BCL2 interacting protein (contains only BH3
AGI_HUM1_OLIGO_A_23_P25194






(contains only BH3 domain)
domain) || D83699 || 12q24.22





262
Hs.546434
V-set domain containing T cell activation
V-set domain containing T cell activation inhibitor 1
AGI_HUM1_OLIGO_A_23_P518






inhibitor 1
|| BX648021 || 1p13.1





263
Hs.9029
Keratin 23 (histone deacetylase inducible)
Keratin 23 (histone deacetylase inducible) || NM_015515 ||
AGI_HUM1_OLIGO_A_23_P78248







17q21.2





264
Hs.71465
Squalene epoxidase
Squalene epoxidase || NM_003129 || 6q24.1
AGI_HUM1_OLIGO_A_23_P146284




265
Hs.253970
Aldehyde dehydrogenase 6 family, member A1
Aldehyde dehydrogenase 6 family, member A1
AGI_HUM1_OLIGO_A_23_P128967







|| NM_005589 || 14q24.3





266
Hs.523836
Glutathione S-transferase p1
Glutathione S-transferase p1 || BM926728 || 11q13
AGI_HUM1_OLIGO_A_23_P202653




267
Hs.306777
Gasdermin-like
Gasdermin-like || BX647700 || 17q21.1
AGI_HUM1_OLIGO_A_23_P66454




268
Hs.520942
Claudin 4
Claudin 4 || AK126462 || 7q11.23
AGI_HUM1_OLIGO_A_23_P19944




269
Hs.111732
Fc fragment of IgG binding protein
Fc fragment of IgG binding protein || NM_003890 ||
AGI_HUM1_OLIGO_A_23_P21495
AGI_HUM1_OLIGO_






19q13.1

A_23_P32895



270
Hs.149397
Myosin VI
Myosin VI || NM_004999 || 6q13
AGI_HUM1_OLIGO_A_23_P255952




271
Hs.87752
Moesin
Moesin || NM_002444 || Xq11.2-q12
AGI_HUM1_OLIGO_A_23_P73589




272
Hs.85137
Cyclin A2
Cyclin A2 || CR604810 || 4q25-q31
AGI_HUM1_OLIGO_A_23_P58321




273
Hs.511978
Huntington interacting protein K
Huntington interacting protein K || AF370428 || 15q15.3
AGI_HUM1_OLIGO_A_23_P117683




274
Hs.503709
Pro-oncosis receptor inducin membrane
Pro-oncosis receptor inducin membrane injury gene
AGI_HUM1_OLIGO_A_23_P202964






injury gene
|| AK075420 || 11q22.1





275
Hs.332847
Cysteine-rich motor neuron 1
Cysteine-rich motor neuron 1|| AF167706 || 2p21
AGI_HUM1_OLIGO_A_23_P51105




276
Hs.197922
Calcium/calmodulin-dependent protein
Calcium/calmodulin-dependent protein kinase II
AGI_HUM1_OLIGO_A_23_P11800






kinase II
|| CR604926 || 1p36.12





277
Hs.188606
START domain containing 10
START domain containing 10 || AB209473 || 11q13
AGI_HUM1_OLIGO_A_23_P36345




278
Hs.6147
Tensin like C1 domain containing phosphatase
Tensin like C1 domain containing phosphatase
AGI_HUM1_OLIGO_A_23_P151297







|| CR936725|| 12q-13.13





279
Hs.367992
Inositol(myo)-1(or 4)-monophosphatase 2
Inositol(myo)-1(or 4)-monophosphatase 2 || BM924855 ||
AGI_HUM1_OLIGO_A_23_P50081







18p11.2





280
Hs.503134
7-dehydrocholesterol reductase
7-dehydrocholesterol reductase || BC000054 ||
AGI_HUM1_OLIGO_A_23_P24444







11q13.2-q13.5





281
Hs.458360
Uridine-cytidine kinase 2
Uridine-cytidine kinase 2 || BX640859 || 1q23
AGI_HUM1_OLIGO_A_23_P487




282
Hs.491582
Plasminogen activator, tissue
Plasminogen activator, tissue || BX641021 || 8p12
AGI_HUM1_OLIGO_A_23_P82858




283
Hs.439760
Cytochrome P450, family 4, subfamily X,
Cytochrome P450, family 4, subfamily X, polypeptide 1
AGI_HUM1_OLIGO_A_23_P72111






polypeptide 1
|| NM_178033 || 1p33





284
Hs.74120
Chromosome 10 open reading frame 116
Chromosome 10 open reading frame 116 || AL17440 ||
AGI_HUM1_OLIGO_A_23_P161439







10q23.2





285
Hs.58756
Period homolog 2 (Drosophila)
Period homolog 2 (Drosophila) || NM_022817 || 2q37.3
AGI_HUM1_OLIGO_A_23_P209320




286
Hs.372924
CAMP responsive element binding protein
CAMP responsive element binding protein 3-like 4
AGI_HUM1_OLIGO_A_23_P63232






3-like 4
|| AY049977 || 1q21.3





287
Hs.508284
F-box and leucine-rich repeat protein 3
F-box and leucine-rich repeat protein 3 || AL833187 ||
AGI_HUM1_OLIGO_A_23_P140069







13q22





288
Hs.35086
Ubiquitin specific protease 1
Ubiquitin specific protease 1 || BC050525 || 1q32.1-p31.3
AGI_HUM1_OLIGO_A_23_P11652




289
Hs.121520
Amphoterin induced gene 2
Amphoterin induced gene 2 || NM_181847 || 12q13.11
AGI_HUM1_OLIGO_A_23_P14083




290
Hs.173859
Frizzled homolog 7 (Drosophila)
Frizzled homolog 7 (Drosophila) || AB017365 || 2q33
AGI_HUM1_OLIGO_A_23_P209449




291
Hs.440494
Chemokine-like factor super family 7
Chemokine-like factor super family 7 || AK055554 || 3p23
AGI_HUM1_OLIGO_A_23_P256413




292
Hs.6776
Macrophage receptor with collagenous structure
Macrophage receptor with collagenous structure
AGI_HUM1_OLIGO_A_23_P101992







|| BC016004 || 2q12-q13





293
Hs.200227
FYVE and coiled coil domain containing 1
FYVE and coiled coil domain containing 1 || AJ292348 ||
AGI_HUM1_OLIGO_A_23_P212339







3p21.31





294
Hs.28309
UDP-glucose dehydrogenase
UDP-glucose dehydrogenase || AF061016 || 4p15.1
AGI_HUM1_OLIGO_A_23_P167067




295
Hs.438864
FN5 protein
FN5 protein || AK098204 || 11q13.3-q23.3





296
Hs.486143
Biliverdin reductase A
Biliverdin reductase A || BX647539 || 7p14-cen
AGI_HUM1_OLIGO_A_23_P75430




297
Hs.370379
Zinc finger protein 462
Zinc finger protein 462 || NM_021224 || 9q31.2
AGI_HUM1_OLIGO_A_23_P71148




298
Hs.408061
Fatty acid binding protein 5 (psoriasis-associated)
Fatty acid binding protein 5 (psoriasis-associated)
AGI_HUM1_OLIGO_A_23_P60498







|| BG282526 || 8q21.13





299
Hs.444924
CDP- diacylglycerol synthase (phosphatidate
CDP- diacylglycerol synthase (phosphatidate
AGI_HUM1_OLIGO_A_23_P59876






cytidylyltransferase) 1
cytidylyltransferase) 1 || NM_001263 || 4q21.23





300
Hs.505575
UDP-N-acetyl-alpha-D-galactosamine:
UDP-N-acetyl-alpha-D-galactosamine: polypeptide N-
AGI_HUM1_OLIGO_A_23_P7245






polypeptide N-acetylgalactosaminyltransferase 6
acetylgalactosaminyltransferase 6 (GalNac







(GalNac






301
Hs.89625
Parathyroid hormone-like hormone
Parathyroid hormone-like hormone || J03580 || 12p12.1-p11.2
AGI_HUM1_OLIGO_A_23_P204133




302
Hs.518475
Replication factor C (activator 1) 4, 37 kDa
Replication factor C (activator 1) 4, 37 kDa || NM_002916 ||
AGI_HUM1_OLIGO_A_23_P2271







3q27





303
Hs.24529
CHK1 checkpoint homolog (S.pombe)
CHK1 checkpoint homolog (S.pombe) || NM_001274 ||
AGI_HUM1_OLIGO_A_23_P18196







11q24-q24





304
Hs.515258
Growth differentiation factor 15
Growth differentiation factor 15 || BQ883534 || 19p13.1-13.2
AGI_HUM1_OLIGO_A_23_P116123




305
Hs.103755
Receptor-interacting serine-threonine kinase 2
Receptor-interacting serine-threonine kinase 2 || AY358814 ||
AGI_HUM1_OLIGO_A_23_P16523







8q21





306
Hs.390736
Hypothetical protein FLJ20365
Hypothetical protein FLJ20365 || AB195679 || 8q23.2
AGI_HUM1_OLIGO_A_23_P252106




307
Hs.369520
Synaptotagmin-like 2
Synaptotagmin-like 2 || AY386362 || 11q14
AGI_HUM1_OLIGO_A_23_P134734




308
Hs.81131
Guanidinoacetate N-methyltransferase
Guanidinoacetate N-methyltransferase || BM541904 ||
AGI_HUM1_OLIGO_A_23_P531963







19p13.3





309
Hs.199338
GLI-Kruppel family member GLI3 (Greig
GLI-Kruppel family member GLI3 (Greig
AGI_HUM1_OLIGO_A_23_P108143






cephalopolysyndactyly syndrome)
cephalopolysyndactyly syndrome) || M57609 || 7p13
AGI_HUM1_OLIGO_A_23_P111532




310
Hs.56145
Thymosin-like 8
Thymosin-like 8 || BG471140 || Xq.21.33-q22.3
AGI_HUM1_OLIGO_A_23_P137178




311
Hs.374378
CDC28 protein kinase regulatory subunit 1B
CDC28 protein kinase regulatory subunit 1B || BQ278454 ||
AGI_HUM1_OLIGO_A_23_P45917







1q21.2





312
Hs.145209
Ubiquitin protein ligase E3 component
Ubiquitin protein ligase E3 component n-recognin 1
AGI_HUM1_OLIGO_A_23_P152066






n-recognin 1
|| NM_174916 || 15q13





313
Hs.513053
DnaJ (Hsp40) homolog, subfamily A, member 4
DnaJ (Hsp40) homolog, subfamily A, member 4
AGI_HUM1_OLIGO_A_23_P206140







|| NM_018602 || 15q25.1





314
Hs.6739
Signal transducer and activator of transcription 3
Signal transducer and activator of transcription 3 Interacting
AGI_HUM1_OLIGO_A_23_P78438






Interacting protein 1
protein 1 || AK095760 || 18q12.2





315
Hs.508141
Diaphanous homolog 3 (Drosophila)
Diaphanous homolog 3 (Drosophila) || BC034952 || 13q21.2
AGI_HUM1_OLIGO_A_23_P162719




316
Hs.511739
SUMO-1 activating enzyme subunit 2
SUMO-1 activating enzyme subunit 2 || AK124730 || 19q12
AGI_HUM1_OLIGO_A_23_P209020




317
Hs.400662
Collagen, type XIV, alpha 1 (undulin)
Collagen, type XIV, alpha 1 (undulin) || NM_021110 || 8q23
AGI_HUM1_OLIGO_A_23_P216361




318
Hs.487561
Islet cell autoantigen 1, 69 kDa
Islet cell autoantigen 1, 69 kDa || CR605198 || 7p22
AGI_HUM1_OLIGO_A_23_P215418




319
Hs.520973
Heat shock 37 kDa protein 1
Heat shock 37 kDa protein 1 || BM541936 || 7q11.23
AGI_HUM1_OLIGO_A_23_P257704




320
Hs.31034
Peroxisomal biogenesis factor 11A
Peroxisomal biogenesis factor 11A || AL360141 || 15q26.1
AGI_HUM1_OLIGO_A_23_P37560




321
Hs.311609
DEAD (Asp-Glu-Ala-Asp) box polypeptide 39
DEAD (Asp-Glu-Ala-Asp) box polypeptide 39
AGI_HUM1_OLIGO_A_23_P78664







|| CR592759 || 19p13.12





322
Hs.232543
Programmed cell death 4 (neoplastic transformation
Programmed cell death 4 (neoplastic transformation
AGI_HUM1_OLIGO_A_23_P258862






inhibitor)
inhibitor) || BX537500 || 10q24





323
Hs.512599
Cyclin-dependent kinase inhibitor 2A (melanoma,
Cyclin-dependent kinase inhibitor 2A (melanoma, p16,
AGI_HUM1_OLIGO_A_23_P43490






p16, inhibits CDK4)
inhibits CDK4) BM719878 || 9p21





324
Hs.272805
HRAS-like suppressor 2
HRAS-like suppressor 2 || AK025029 || 11q12.3
AGI_HUM1_OLIGO_A_23_P105012




325
Hs.75117
Interleukin enhancer binding factor 2, 45 kDa
Interleukin enhancer binding factor 2, 45 kDa || BG121872 ||
AGI_HUM1_OLIGO_A_23_P257956







1q21.3





326
Hs.55279
Serine (or Cysteine) proteinase inhibitor, clase B
Serine (or Cysteine) proteinase inhibitor, clase B (ovalbumin),
AGI_HUM1_OLIGO_A_23_P208126






(ovalbumin), member 5
member 5 || BX640597 || 16q21.3





327
Hs.377484
BCL2-associated athanogene
BCL2-associated athanogene || BM799512 || 9p12
AGI_HUM1_OLIGO_A_23_P146654




328
Hs.75360
Carboxypeptidase E
Carboxypeptidase E || NM_001873 || 4q32.3
AGI_HUM1_OLIGO_A_23_P259442




329
Hs.83756
CDC28 protein kinase regulatory subunit 2
CDC28 protein kinase regulatory subunit 2 || BQ898949 ||
AGI_HUM1_OLIGO_A_23_P71727







9q22





330
Hs.40403
Cbp/p300-interacting transactivator, with
Cbp/p300-interacting transactivator, with Glu/Asp-rich
AGI_HUM1_OLIGO_A_23_P73517






Glu/Asp-rich carboxy-terminal doman, 1
carboxy-terminal doman, 1|| BM664781 |





331
Hs.10649
Chromosome 1 open readng frame 38
Chromosome 1 open readng frame 38 || AK094833 || 1p35.3
AGI_HUM1_OLIGO_A_23_P873




332
Hs.446192
Contactin associated protein-like 2
Contactin associated protein-like 2 || NM_014141 || q35-q36
AGI_HUM1_OLIGO_A_23_P84399




333
Hs.382202
Chitinase 3-like 1 (cartilage glycoprotein-39)
Chitinase 3-like 1 (cartilage glycoprotein-39) || AB209459 ||
AGI_HUM1_OLIGO_A_23_P137672







q132.1





334
Hs.108029
SH3 doman binding glutamic acid-rich protein like
SH3 doman binding glutamic acid-rich protein like
AGI_HUM1_OLIGO_A_23_P148297







|| AK024892 || Xq13.3





335
Hs.477693
NCK adaptor protein 1
NCK adaptor protein 1 || NM_006153 || 3q21
AGI_HUM1_OLIGO_A_23_P255785




336
Hs.221941
Cytochrome b reductase 1
Cytochrome b reductase 1 || AL136693 || 2q31.1
AGI_HUM1_OLIGO_A_23_P209564




337
Hs.50411
Tripartite motif-containing 29
Tripartite motif-containing 29 || BX648072 || 11q22-q23
AGI_HUM1_OLIGO_A_23_P203267




338
Hs.514470
Solute carrier family 25 (mitochrondrial
Solute carrier family 25 (mitochrondrial deoxynucleotide
AGI_HUM1_OLIGO_A_23_P55036






deoxynucleotide carrier), member 19
carrier), member 19 || AK097882 || 17q21





339
Hs.75238
Chromatin assembly factor 1, subunit B (p60)
Chromatin assembly factor 1, subunit B (p60) || NM_005441 ||
AGI_HUM1_OLIGO_A_23_P57305







21q22.13





340
Hs.6980
Aldo-keto reductase family 7, member A3
Aldo-keto reductase family 7, member A3(aflatoxin aldehyde
AGI_HUM1_OLIGO_A_23_P103968






(aflatoxin aldehyde reductase)
reductase) || NM_012057 || 1p35.1





341
Hs.62771
Hypothetical protein FLJ20186
Hypothetical protein FLJ20186 || NM_207514 || 16q24.3
AGI_HUM1_OLIGO_A_23_P88893




342
Hs.433201
CDK2-associated protein 1
CDK2-associated protein 1 || NM_0046242 || 12q24.31
AGI_HUM1_OLIGO_A_23_P199486




343
Hs.368254
Homogentisate 1,2-dioxygenase (homogentisate
Homogentisate 1,2-dioxygenase (homogentisate oxidase)
AGI_HUM1_OLIGO_A_23_P250164






oxidase)
|| BC071757 || 3q21-q23





344
Hs.32973
Glycne receptor, beta
Glycne receptor, beta || NM_000824 || 4q31.3
AGI_HUM1_OLIGO_A_23_P250164




345
Hs.434255
Pieckstrin and Sec7 domain containing 3
Pieckstrin and Sec7 domain containing 3 || NM_015310 ||
AGI_HUM1_OLIGO_A_23_P213265







8pter-p23.3
AGI_HUM1_OLIGO_A_23_P216167




346
Hs.14623
Interferon, gamma-inducible protein 30
Interferon, gamma-inducible protein 30 || AK123477 ||
AGI_HUM1_OLIGO_A_23_P153745







19p13.1





347
Hs.430324
Annexin A9
Annexin A9 || AJ009985 || 1q21
AGI_HUM1_OLIGO_A_23_P103614




348
Hs.233952
Proteasome (prosome, macropain) subunit,
Proteasome (prosome, macropain) subunit, alpha type 7
AGI_HUM1_OLIGO_A_23_P91464






alpha type 7
|| AK127210 || 20q13.33





349
Hs.44278
RAB17, member RAS oncogene family
RAB17, member RAS oncogene family || BX647412 || 2q37.3
AGI_HUM1_OLIGO_A_23_P5778




350
Hs.522500
KIAA0310
KIAA0310 || XM_083459 || 9q34.3
AGI_HUM1_OLIGO_A_23_P251303




351
Hs.533573
CDC7 cell division cycle 7 (S.cerevisiae)
CDC7 cell division cycle 7 (S.cerevisiae) || AB209337 || 1p22
AGI_HUM1_OLIGO_A_23_P148807




352
Hs.530024
Chromosome 7 open reading frame 24
Chromosome 7 open reading frame 24 || BF570959 ||
AGI_HUM1_OLIGO_A_23_P426895







7p15-p14





353
Hs.520313
CD164 antigen, sialomucin
CD164 antigen, sialomucin || BC040317 || 6q21
AGI_HUM1_OLIGO_A_23_P254756




354
Hs.208912
Chromosome 22 open reading frame 18
Chromosome 22 open reading frame 18 || AK123479 ||
AGI_HUM1_OLIGO_A_23_P103159







22q13.2





355
Hs.444751
POZ domain containing 1
POZ domain containing 1 || NM_002614 || 1q21
AGI_HUM1_OLIGO_A_23_P52121




356
Hs.126248
Collagen, type IX, alpha 3
Collagen, type IX, alpha 3 || NM_001853 || 20q13.3
AGI_HUM1_OLIGO_A_23_P40108




357
Hs.81892
KIAA0101
KIAA0101|| AY358648 || 15q22.31
AGI_HUM1_OLIGO_A_23_P117852




358
Hs.416358
Sal-like 2 (Drosophila)
Sal-like 2 (Drosophila) || NM_005407 || 14q11.1-q12
AGI_HUM1_OLIGO_A_23_P48585




359
Hs.508461
Mitogen-activated protein kinase kinase kinase 1
Mitogen-activated protein kinase kinase kinase 1
AGI_HUM1_OLIGO_A_23_P41796







|| XM_042066 || 5q11.2





360
Hs.491172
Neurobeachin
Neurobeachin || NM_015678 || 13q13
AGI_HUM1_OLIGO_A_23_P21128
AGI_HUM1_OLIGO_








A_23_P65278



361
Hs.6434
Chromosome 14 open reading frame 132
Chromosome 14 open reading frame 132 || BC043593 ||
AGI_HUM1_OLIGO_A_23_P151525







14q32.2





362
Hs.376984
SRY (sex determining region Y)-box 10
SRY (sex determining region Y)-box 10 || BC018808 ||
AGI_HUM1_OLIGO_A_23_P143694







22q13.1





363
Hs.525205
NDRG family member 2
NDRG family member 2 || AK096999 || 14q11.2
AGI_HUM1_OLIGO_A_23_P37205




364
Hs.520463
PiggyBac transposable element derived 5
PiggyBac transposable element derived 5 || NM_024554 ||
AGI_HUM1_OLIGO_A_23_P126648







1q42.13





365
Hs.104650
Mago-nashi homolog
Mago-nashi homolog || NM_018048 || 12p13.2
AGI_HUM1_OLIGO_A_23_P2423




366
Hs.29802
Slit homolog 2 (Drosophila)
Slit homolog 2 (Drosophila) || AF055585 || 4p15.2
AGI_HUM1_OLIGO_A_23_P144348




367
Hs.84113
Cyclin-dependent kinase inhibitor 3
Cyclin-dependent kinase inhibitor 3 (CDK2-associated
AGI_HUM1_OLIGO_A_23_P48669






(CDK2-associated dual specificity phosphatase)
dual specificity phosphatase) || BQ056331 text missing or illegible when filed





368
Hs.42650
ZW10 interactor
ZW10 interactor || NM_001005414 || 10q21-q22
AGI_HUM1_OLIGO_A_23_P63789




369
Hs.512732
Nei endonuclease VIII-like 1 (E.coli)
Nei endonuclease VIII-like 1 (E.coli) || AK1283752 || 15q23
AGI_HUM1_OLIGO_A_23_P129157




370
Hs.525105
SLIT and NTRK-like family, member 6
SLIT and NTRK-like family, member 6 || NM_032229 ||
AGI_HUM1_OLIGO_A_23_P65307







13q31.1





371
Hs.497741
Centromere protein F, 350/400 ka (mitosin)
Centromere protein F, 350/400 ka (mitosin) || NM_016343 ||
AGI_HUM1_OLIGO_A_23_P401







1q32-q41





372
Hs.404321
Glycyl-tRNA synthetase
Glycyl-tRNA synthetase || NM_002047 || 7p15
AGI_HUM1_OLIGO_A_23_P82361




373
Hs.546280
Pentraxin-related gene, rapidity induced by
Pentraxin-related gene, rapidity induced by IL-1 beta
AGI_HUM1_OLIGO_A_23_P121064






IL-1 beta
|| NM_002852 || 3q25





374
Hs.118631
Timeless homolog (Drosophila)
Timeless homolog (Drosophila) || BC050557 || 12q12-113
AGI_HUM1_OLIGO_A_23_P53276




375
Hs.279766
Kinesin family members 4A
Kinesin family members 4A || AF071592 || Xq13.1
AGI_HUM1_OLIGO_A_23_P148475




376
Hs.244723
Cyclin E1
Cyclin E1 || BC035498 || 18q12
AGI_HUM1_OLIGO_A_23_P209200




377
Hs.505934
CGI-119 protein
CGI-119 protein || AK127285 || 12q14.1-q15
AGI_HUM1_OLIGO_A_23_P13694




378
Hs.409065
Flap structure-specific endonuclease 1
Flap structure-specific endonuclease 1 || NM_004111 || 11q12
AGI_HUM1_OLIGO_A_23_P87192




379
Hs.26770
Fatty acid binding protein 7, brain
Fatty acid binding protein 7, brain || AB208815 || 6q22-q23
AGI_HUM1_OLIGO_A_23_P134139




380
Hs.532265
Gene model 83
Gene model 83 || AK001693 || 8q22.3
AGI_HUM1_OLIGO_A_23_P215875




381
Hs.155597
D Component of complement (adipsin)
D Component of complement (adipsin) || BQ1712715 ||
AGI_HUM1_OLIGO_A_23_P119562







19p13.3





382
Hs.513141
Isocitrate dehydrogenase 2 (NADP+),
Isocitrate dehydrogenase 2 (NADP+), mitochondrial
AGI_HUM1_OLIGO_A_23_P129204






mitochondrial
|| AK127371 || 15q26.1





383
Hs.484813
DEK oncogene (DNA binding)
DEK oncogene (DNA binding) || BX6411063 || 6p23
AGI_HUM1_OLIGO_A_23_P254702




384
Hs.30824
Leucine zipper transcription factor-like 1
Leucine zipper transcription factor-like 1 || BC042483 ||
AGI_HUM1_OLIGO_A_23_P41049







3p21.3





385
Hs.472010
Prion protein (p27-30) (Creutzfield-Jakob disease,
Prion protein (p27-30) (Creutzfield-Jakob disease,
AGI_HUM1_OLIGO_A_23_P109143






Gerstmann-Strausler-Scheinker syndrome  text missing or illegible when filed
Gerstmann-Strausler-Scheinker syndrome,  text missing or illegible when filed





386
Hs.42151
Histamine N-methyltransferase
Histamine N-methyltransferase || NM_006895 || 2q22.1
AGI_HUM1_OLIGO_A_23_P56734




387
Hs.368433
Tumor protein D52
Tumor protein D52 || NM_005079 || 8q21
AGI_HUM1_OLIGO_A_23_P216259




388
Hs.16064
CNKSR family member 3
CNKSR family member 3 || AY328891 || 6q25.2
AGI_HUM1_OLIGO_A_23_P134085




389
Hs.7879
Interferon-related developmental regulator 1
Interferon-related developmental regulator 1
AGI_HUM1_OLIGO_A_23_P251825







|| NM_001007245 || 7q22-q31





390
Hs.519168
Fibromodulin
Fibromodulin || BC035281 || 1q32
AGI_HUM1_OLIGO_A_23_P114883




391
Hs.524216
Cell division cycle associated 3
Cell division cycle associated 3 || AK092246 || 12q13
AGI_HUM1_OLIGO_A_23_P162481




392
Hs.518602
Wolfram syndrome 1 (wolframin)
Wolfram syndrome 1 (wolframin) || BC069213 || 4p16
AGI_HUM1_OLIGO_A_23_P121499




393
Hs.460693
Glutamic pyruvate transaminase
Glutamic pyruvate transaminase (alanine aminotransferase)
AGI_HUM1_OLIGO_A_23_P37892






(alanine aminotransferase) 2
2 || NM_133443 || 16q12.1





394
Hs.527412
N-acylsphingosine amidohydrolase
N-acylsphingosine amidohydrolase (acid ceramidase) 1
AGI_HUM1_OLIGO_A_23_P216325






(acid ceramidase) 1
|| NM_004315 || 8p22-p21.3





395
Hs.131683
Cytoplasmic polyadenylation element binding
Cytoplasmic polyadenylation element binding protein 3
AGI_HUM1_OLIGO_A_23_P46812






protein 3
|| NM_014912 || 10q23.32





396
Hs.109425
GDNF family receptor alpha 1
GDNF family receptor alpha 1 || AF038421 || 10q26
AGI_HUM1_OLIGO_A_23_P1686




397
Hs.153479
Extra spindle poles like 1 (S.cerevisiae)
Extra spindle poles like 1 (S.cerevisiae) || D79987 || 12q
AGI_HUM1_OLIGO_A_23_P32707




398
Hs.443551
Hypothetical protein FLJ10706
Hypothetical protein FLJ10706 || AK127098 || 1q24.2
AGI_HUM1_OLIGO_A_23_P11862




399
Hs.458485
Interferon, alpha-inducible protein
Interferon, alpha-inducible protein (clone IFI-15K)
AGI_HUM1_OLIGO_A_23_P811






(clone IFI-15K)
|| BQ279256 || 1p36.33





400
Hs.370858
Fucosdase, alpha-L-1, tissue
Fucosdase, alpha-L-1, tissue || BC017336 || 1p34
AGI_HUM1_OLIGO_A_23_P11543




401
Hs.405958
CDC6 cell division cycle 6 homolog
CDC6 cell division cycle 6 homolog (S.cerevisiae)
AGI_HUM1_OLIGO_A_23_P49972






(S.cerevisiae)
|| NM_001254 || 17q21.3





402
Hs.78619
Gamma-glutamyl hydrolase (conjugase,
Gamma-glutamyl hydrolase (conjugase,
AGI_HUM1_OLIGO_A_23_P134910






folypolygammaglutamyl hydrolase)
folypolygammaglutamyl hydrolase) || CD359152 || 8q12  text missing or illegible when filed





403
Hs.446149
Lactate dehydrogenase B
Lactate dehydrogenase B || AB209231 || 12p12.2-p12.1
AGI_HUM1_OLIGO_A_23_P53476




404
Hs.8257
Cytokine inducible SH2-containing protein
Cytokine inducible SH2-containing protein || NM_013324 ||








3p21.3





405
Hs.528721
Sema domain, immunoglobulin domain (Ig),
Sema domain, immunoglobulin domain (Ig), short basic
AGI_HUM1_OLIGO_A_23_P144096






short basic domain, secreted (semaphorin) 3E
domain, secreted (semaphorin) 3E || N  text missing or illegible when filed
AGI_HUM1_OLIGO_A_23_P59578




406
Hs.160562
Insulin-like growth factor 1 (somatomedin C)
Insulin-like growth factor 1 (somatomedin C) || NM_000618 ||
AGI_HUM1_OLIGO_A_23_P13907







12q22-q23





407
Hs.552582
Leucine rich repeat containing 17
Leucine rich repeat containing 17 || NM_005824 || 7q22.1
AGI_HUM1_OLIGO_A_23_P253958




408
Hs.104019
Transforming, acidic coiled-coil containing
Transforming, acidic coiled-coil containing protein 3
AGI_HUM1_OLIGO_A_23_P212844






protein 3
|| AJ243997 || 4p16.3





409
Hs.288356
Prefoldin 5
Prefoldin 5 || AK024094 || 12q12
AGI_HUM1_OLIGO_A_23_P128183




410
Hs.512636
Proline-rich nuclear receptor coactivator 2
Proline-rich nuclear receptor coactivator 2 || BC085018 ||
AGI_HUM1_OLIGO_A_23_P103201







1p36.11





411
Hs.517830
Biotinidase
Biotinidase || NM_000060 || 3p25
AGI_HUM1_OLIGO_A_23_P155348




412
Hs.504550
RAD51 associatsd protein 1
RAD51 associatsd protein 1 || CR625391 || 12p13.2-p13.1
AGI_HUM1_OLIGO_A_23_P99292




413
Hs.82222
Sema domain, immunoglobulin domain (Ig),
Sema domain, immunoglobulin domain (Ig), short basic
AGI_HUM1_OLIGO_A_23_P132718






short basic domain, secreted, (semaphorin) 3B
domain, secreted, (semaphorin) 3B || A  text missing or illegible when filed





414
Hs.524195
Rho GTPase activating protein 21
Rho GTPase activating protein 21 || AF480466 || 10p12.1
AGI_HUM1_OLIGO_A_23_P115605




415
Hs.460789
Trinucleolide repeat containing 9
Trinucleolide repeat containing 9 || AK095095 || 16q12.1
AGI_HUM1_OLIGO_A_23_P54681




416
Hs.81848
RAD21 homolog (S.pombe)
RAD21 homolog (S.pombe) || NM_006265 || 8q24
AGI_HUM1_OLIGO_A_23_P20463




417
Hs.369042
Hypothetical protein FLJ20605
Hypothetical protein FLJ20605 || AK125512 || 1q41
AGI_HUM1_OLIGO_A_23_P200685




418
Hs.268698
Methylenetetrahydrofolate dehydrogenase
Methylenetetrahydrofolate dehydrogenase (NADP+
AGI_HUM1_OLIGO_A_23_P214907






(NADP+ dependent) 1-like
dependent) 1-like || AK127089 || 6q25.1





419
Hs.156519
MutS homolog 2, colon cancer, nonpolyposis
MutS homolog 2, colon cancer, nonpolyposis type 1 (E.coli)
AGI_HUM1_OLIGO_A_23_P102471






type 1 (E.coli)
|| AK223284 || 2p22-p21





420
Hs.279746
Transient receptor potential cation channel,
Transient receptor potential cation channel, subfamily V,
AGI_HUM1_OLIGO_A_23_P207911






subfamily V, member 2
member 2 || AK126996 || 17p11.2





421
Hs.184062
Chromosome 20 open reading frame 24
Chromosome 20 open reading frame 24 || BG462041 ||
AGI_HUM1_OLIGO_A_23_P102582







20q11.23





422
Hs.321541
RAB11A, member RAS oncogene family
RAB11A, member RAS oncogene family || BC013348 ||
AGI_HUM1_OLIGO_A_23_P77142







15q21.3-q22.31





423
Hs.3352
Histone deacetylase 2
Histone deacetylase 2 || AB209190 || 6q21
AGI_HUM1_OLIGO_A_23_P122304




424
Hs.330384
Coronin, actin binding protein, 1C
Coronin, actin binding protein, 1C || NM_014325 || 12q24.1
AGI_HUM1_OLIGO_A_23_P53456




425
Hs.435730
Iroquois hemeobox protein 5
Iroquois hemeobox protein 5 || AY335945 || 16q11.2-q13
AGI_HUM1_OLIGO_A_23_P9779




426
Hs.528763
Small nuclear ribonucleoprotein polypeptide A
Small nuclear ribonucleoprotein polypeptide A || AK090986 ||
AGI_HUM1_OLIGO_A_23_P14686







15q26.3





427
Hs.531561
Epithelial membrane protein 2
Epithelial membrane protein 2 || AK096403 || 16p13.2
AGI_HUM1_OLIGO_A_23_P106682




428
Hs.470477
Protein tyrosine phosphatase type IVA, member 2
Protein tyrosine phosphatase type IVA, member 2
AGI_HUM1_OLIGO_A_23_P200414







|| NM_003479 || 1p35





429
Hs.438720
MCM7 minichromosome maintenance deficient 7
MCM7 minichromosome maintenance deficient 7
AGI_HUM1_OLIGO_A_23_P93690






(S.cerevisiae)
(S.cerevisiae) || NM_182776 || 7q21.3-q22.1
_




430
Hs.191179
RAB11 family interacting protein 1 (class I)
RAB11 family interacting protein 1 (class I)
AGI_HUM1_OLIGO_A_23_P31873







|| NM_001002814 || 8p11.22





431
Hs.307905
V-rel reticuloendotheliosis viral oncogene homolog
V-rel reticuloendotheliosis viral oncogene homolog B,
AGI_HUM1_OLIGO_A_23_P55706






B, nuclear factor of kapa light polypeptide  text missing or illegible when filed
nuclear factor of kapa light polypeptide  text missing or illegible when filed





432
Hs.123253
SHC SH2-domain binding protein 1
SHC SH2-domain binding protein 1 || NM_024745 || 16q11.2
AGI_HUM1_OLIGO_A_23_P206544




433
Hs.508950
Transglutaminase 1 (K polypeptide epidermal type
Transglutaminase 1 (K polypeptide epidermal type I,
AGI_HUM1_OLIGO_A_23_P65617






I, protein-glutamine-gamma-glutamyltransfer  text missing or illegible when filed
protein-glutamine-gamma-glutamyltransfer  text missing or illegible when filed





434
Hs.19191l
Nuclear factor I/A
Nuclear factor I/A || BX648791 || 1p31.3-p31.2
AGI_HUM1_OLIGO_A_23_P85682




435
Hs.54470
ATP-binding cassette, sub-family C (CF7R/MRP),
ATP-binding cassette, sub-family C (CF7R/MRP), member 8
AGI_HUM1_OLIGO_A_23_P24774






member 8
|| NM_000352 || 11p15.1





436
Hs.19545
Frizzled homolog 4 (Drosophila)
Frizzled homolog 4 (Drosophila) || AB032417 || 11q14.2
AGI_HUM1_OLIGO_A_23_P54617




437
Hs.111554
ADP-ribosylation factor-like 7
ADP-ribosylation factor-like 7 || NM_005737 || 2q37.1
AGI_HUM1_OLIGO_A_23_P251551




438
Hs.534385
THO complex 4
THO complex 4 || BO279142 || 17q25.3
AGI_HUM1_OLIGO_A_23_P152984




439
Hs.375707
Coiled-coil-helix-coled-coil-helix domain
Coiled-coil-helix-coled-coil-helix domain containing 5
AGI_HUM1_OLIGO_A_23_P154279






containing 5
|| AK024631 || 2q13





440
Hs.446850
Chromosome 14 open reading frame 100
Chromosome 14 open reading frame 100 || AK128628 ||
AGI_HUM1_OLIGO_A_23_P205580







14q23.1





441
Hs.89497
Lamin B1
Lamin B1 || BC052951 || 5q23.3-q31.1
AGI_HUM1_OLIGO_A_23_P258493




442
Hs.444468
CTO (carboxy-terminal domain, RNA
CTO (carboxy-terminal domain, RNA polymerase II,
AGI_HUM1_OLIGO_A_23_P28263






polymerase II, polypeptide A) small phosphatase 1
polypeptide A) small phosphatase 1 AF22





443
Hs.18349
Mitochondrial ribosomal protein L15
Mitochondrial ribosomal protein L15 || BQ278804 ||
AGI_HUM1_OLIGO_A_23_P94174







8q11.2-q13





444
Hs.532491
Cryptochrome 2 (photolyase-like)
Cryptochrome 2 (photolyase-like) || BC035181 || 11p11.2
AGI_HUM1_OLIGO_A_23_P127394
AGI_HUM1_OLIGO_








A_23_P138787



445
Hs.510402
Membrane cofactor protein (CD46, trophoblast-
Membrane cofactor protein (CD46, trophoblast-lymphocyte
AGI_HUM1_OLIGO_A_23_P201758






lymphocyte cross-reactive agent)
cross-reactive agent) || BX537451





446
Hs.524399
Trophinin associated protein (tastin)
Trophinin associated protein (tastin) || AK128056 || 12q13.12
AGI_HUM1_OLIGO_A_23_P150935




447
Hs.522730
G protein-coupled receptor associated sorting
G protein-coupled receptor associated sorting protein 1
AGI_HUM1_OLIGO_A_23_P96590






protein 1
|| NM_014710 || Xq22.1





448
Hs.275464
Kallikrein 10
Kallikrein 10 || AK026045 || 19q13.3-q13.4
AGI_HUM1_OLIGO_A_23_P107911




449
Hs.123464
Purinergic receptor P2Y, G-proiein coupled, 5
Purinergic receptor P2Y, G-proiein coupled, 5 || BC045651 ||
AGI_HUM1_OLIGO_A_23_P2705







13q14





450
Hs.534293
Serine (or cysteine) proteinase inhibitor, clade A
Serine (or cysteine) proteinase inhibitor, clade A
AGI_HUM1_OLIGO_A_23_P162915






(alpha-1 antiproteinase, antitrypsin), member 3
(alpha-1 antiproteinase, antitrypsin), member 3





451
Hs.303476
Flavin containing monooxygenase 5
Flavin containing monooxygenase 5 || NM_001461 || 1q21.1
AGI_HUM1_OLIGO_A_23_P231




452
Hs.479208
F-box and leucine-rich repeat protein 5
F-box and leucine-rich repeat protein 5 || NM_033535 ||
AGI_HUM1_OLIGO_A_23_P213247







4p15.23





453
Hs.124165
Mitochondrial ribosomal protein S30
Mitochondrial ribosomal protein S30 || BX538300 || 5q11
AGI_HUM1_OLIGO_A_23_P252369




454
Hs.477481
MCM2 minichromosome maintenance deficient 2,
MCM2 minichromosome maintenance deficient 2, mitotin
AGI_HUM1_OLIGO_A_23_P250801






mitotin (S.cerevisiae)
(S.cerevisiae) || NM_004526 || 3q21





455
Hs.510334
Serine (or cysteine) proteinase inhibitor, clade A
Serine (or cysteine) proteinase inhibitor, clade A
AGI_HUM1_OLIGO_A_23_P205355






(alpha-1 antiproteinase, antitrypsin), member 5
(alpha-1 antiproteinase, antitrypsin), member 5





456
Hs.180535
UNC-112 related protein 2
UNC-112 related protein 2 || BC004347 || 11q13.1
AGI_HUM1_OLIGO_A_23_P64038




457
Hs.12068
Carnitine acetyltransferase
Carnitine acetyltransferase || NM_000755 || 9q34.1
AGI_HUM1_OLIGO_A_23_P3196




458
Hs.513726
Guanylate binding protein 5
Guanylate binding protein 5 || AK090479 || 1p22.2
AGI_HUM1_OLIGO_A_23_P74290




459
Hs.72620
Chromosome 20 open reading frame 28
Chromosome 20 open reading frame 28 || NM_015417 ||
AGI_HUM1_OLIGO_A_23_P40280







20pter-q11.23





460
Hs.36794
Cyclin D-type binding-protein 1
Cyclin D-type binding-protein 1 || CR614852 || 15q14-q15
AGI_HUM1_OLIGO_A_23_P26243




461
Hs.421907
Glioma tumor suppressor candidate region gene 2
Glioma tumor suppressor candidate region gene 2
AGI_HUM1_OLIGO_A_23_P3915







|| AK024486 || 19q13.3





462
Hs.479754
V-kit Hardy-Zuckerman 4 feline sarcoma viral
V-kit Hardy-Zuckerman 4 feline sarcoma viral oncogene
AGI_HUM1_OLIGO_A_23_P110253






oncogene homolog
homolog || BC0715963 || 4q11-q12





463
Hs.5719
Chromosome condensation-related SMC-associated
Chromosome condensation-related SMC-associated protein 1
AGI_HUM1_OLIGO_A_23_P252936






protein 1
|| D63880 || 12p13.3





464
Hs.278857
Heterogeneous nuclear ribonudeoprotein H2 (H′)
Heterogeneous nuclear ribonudeoprotein H2 (H′)
AGI_HUM1_OLIGO_A_23_P11283







|| CR624721 || Xq22





465
Hs.546324
Guanine monophosphate synthetase
Guanine monophosphate synthetase || NM _003875 || 3q24
AGI_HUM1_OLIGO_A_23_P21033




466
Hs.461847
KIAA0182 protein
KIAA0182 protein || NM_014615 || 16q24.1
AGI_HUM1_OLIGO_A_23_P152415




467
Hs.433160
DNA replication complex GINS protein PSF2
DNA replication complex GINS protein PSF2 || AK091519 ||
AGI_HUM1_OLIGO_A_23_P118245







16q24.1





468
Hs.483036
Praja 2, RING-H2 motif containing
Praja 2, RING-H2 motif containing || NM_014819 || 5q21.3
AGI_HUM1_OLIGO_A_23_P133470




469
Hs.449415
Eukaryotic translation initiation factor 2C, 2
Eukaryotic translation initiation factor 2C, 2 || 6C04491 ||
AGI_HUM1_OLIGO_A_23_P112159







8q24





470
Hs.272499
Dehydrogenase/reductase (SDR family) member 2
Dehydrogenase/reductase (SDR family) member 2
AGI_HUM1_OLIGO_A_23_P48570







|| AB09653 || 14q11.2





471
Hs.370927
Hypothetical protein PRO1855
Hypothetical protein PRO1855 || AK025328 || 17q21.33
AGI_HUM1_OLIGO_A_23_P207481




472
Hs.546239
Alpha-2-glycoprprotein 1, zinc
Alpha-2-glycoprprotein 1, zinc || BC014470 || 7q22.1
AGI_HUM1_OLIGO_A_23_P71270




473
Hs.101774
Chromosome 20 open reading frame 23
Chromosome 20 open reading frame 23 || AY166853 ||
AGI_HUM1_OLIGO_A_23_P17503







20p11.23





474
Hs.546418
Zinc finger protein 339
Zinc finger protein 339 || AK022284 || 20pter-q11.23
AGI_HUM1_OLIGO_A_23_P143348




475
Hs.49688
Actin binding LIM protein family, member 3
Actin binding LIM protein family, member 3 || AB020650 ||
AGI_HUM1_OLIGO_A_23_P256204







5q32





476
Hs.188881
Hypothetical protein FLJ34633
Hypothetical protein FLJ34633 || AK091952 || 1p36.11
AGI_HUM1_OLIGO_A_23_P800




477
Hs.72071
Potassium channel tetramerisation domain
Potassium channel tetramerisation domain containing 9
AGI_HUM1_OLIGO_A_23_P43226






containing 9
|| AL117436 || 8p21.1





478
Hs.75868
Hypothetical protein FLJ14490
Hypothetical protein FLJ14490 || AF370364 || 1p34.2
AGI_HUM1_OLIGO_A_23_P43817




479
Hs.433845
Keratin 5 (epidermolysis bullosa simplex,
Keratin 5 (epidermolysis bullosa simplex, Dowling-Meara/
AGI_HUM1_OLIGO_A_23_P218040






Dowling-Meara/Kobner/Weber-Cockayne types)
Kobner/Weber-Cockayne types) || M2  text missing or illegible when filed





480
Hs.514289
Homeo box B2
Homeo box B2 || NM_002145 || 17q21-q22
AGI_HUM1_OLIGO_A_23_P107283




481
Hs.438533
Polymerase (DNA directed) Iota
Polymerase (DNA directed) Iota || BC032617 || 18q21.1
AGI_HUM1_OLIGO_A_23_P4461




482
Hs.133539
Microtubule associated serine/threonine kinase
Microtubule associated serine/threonine kinase family
AGI_HUM1_OLIGO_A_23_P110571






family member 4
member 4 || XM_291141 || 5q12.3





483
Hs.413835
Sin3-associated polypeptide, 30 kDa
Sin3-associated polypeptide, 30 kDa || BC016757 || 4q34.1
AGI_HUM1_OLIGO_A_23_P121602




484
Hs.54483
N-myc (and STAT) interactor
N-myc (and STAT) interactor || AK124323 || 2p24-q21.3
AGI_HUM1_OLIGO_A_23_P154235




485
Hs.183617
Claudin 23
Claudin 23 || BC016047 || 8p23.1
AGI_HUM1_OLIGO_A_23_P134854




486
Hs.173092
Solute carrier family 24 (sodium/potassium/
Solute carrier family 24 (sodium/potassium/calcium
AGI_HUM1_OLIGO_A_23_P205910






calcium exchanger), member 1
exchanger), member 1 || AB014602 || 15q22





487
Hs.533977
Thioredoxin interacting protein
Thioredoxin interacting protein || NM_006472 || 1q21.1
AGI_HUM1_OLIGO_A_23_P97700




488
Hs.448520
Solute carrier family 7 (cationic amino acid
Solute carrier family 7 (cationic amino acid transporter, y+
AGI_HUM1_OLIGO_A_23_P255837






transporter, y+ system), member 2
system), member 2 || NM_003046 || 8





489
Hs.408658
Cyclin E2
Cyclin E2 || NM_057749 || 8q22.1
AGI_HUM1_OLIGO_A_23_P215976




490
Hs.374596
Tumor protein, translationally-controlled 1
Tumor protein, translationally-controlled 1 || BG033621 ||
AGI_HUM1_OLIGO_A_23_P53797







13q12-q14





491
Hs.210532
KIAA0141
KIAA0141 || NM_014773 || 5q31.3
AGI_HUM1_OLIGO_A_23_P213369




492
Hs.297304
Glycosyltransferase 8 domain containing 1
Glycosyltransferase 8 domain containing 1
AGI_HUM1_OLIGO_A_23_P132669







|| NM_001010983 || 3p21.1





493
Hs.513915
Claudin 7
Claudin 7 || CR594337 || 17p13
AGI_HUM1_OLIGO_A_23_P164283




494
Hs.334832
NAD(P)H:quinone oxidoreductase type 3,
NAD(P)H:quinone oxidoreductase type 3, polypeptide A2
AGI_HUM1_OLIGO_A_23_P52101






polypeptide A2
|| AK123705 || 1p36.13-q41





495
Hs.500761
Solute carrier family 16 (monocarboxylic acid
Solute carrier family 16 (monocarboxylic acid transporters),
AGI_HUM1_OLIGO_A_23_P147345
AGI_HUM1_OLIGO_





transporters), member 3
member 3 || AK127319 || 17q25

A_23_P158725



496
Hs.86368
Calmegin
Calmegin || AK093096 || 4q28.3-q31.1
AGI_HUM1_OLIGO_A_23_P18684




497
Hs.251673
DNA (cytosine-5)-methyltransferase 3 beta
DNA (cytosine-5)-methyltransferase 3 beta || NM_006892 ||
AGI_HUM1_OLIGO_A_23_P28953







20q11.2





498
Hs.4944
Chromosome 9 open reading frame 58
Chromosome 9 open reading frame 58 || AK128526 ||
AGI_HUM1_OLIGO_A_23_P94380







9q34.13-q34.3





499
Hs.134434
Ovo-like 1 (Drosophila)
Ovo-like 1 (Drosophila) || BC059408 || 11q13
AGI_HUM1_OLIGO_A_23_P202810




500
Hs.396393
Ubiquitin-conjugating enzyme E2S
Ubiquitin-conjugating enzyme E2S || BM479313 || 19q13.43
AGI_HUM1_OLIGO_A_23_P55769




501
Hs.227817
BCL2-related protein A1
BCL2-related protein A1 || BF677029 || 15q24.3
AGI_HUM1_OLIGO_A_23_P151995




502
Hs.83304
Phospholipase A2, group VII (platelet-activating
Phospholipase A2, group VII (platelet-activating factor
AGI_HUM1_OLIGO_A_23_P145096






factor acetylhydrolase, plasma)
acetylhydrolase, plasma) || BC025674 || 6





503
Hs.181326
Myotubularin related protein 2
Myotubularin related protein 2 || NM_201278 || 11q22
AGI_HUM1_OLIGO_A_23_P64018




504
Hs.02661
Guanylate binding protein 1, interferon-inducible,
Guanylate binding protein 1, interferon-inducible, 67 kDa
AGI_HUM1_OLIGO_A_23_P62890






67 kDa
|| AB208912 || 1q22.2





505
Hs.517582
MCM5 minichromosome maintenance deficient 5,
MCM5 minichromosome maintenance deficient 5, cell
AGI_HUM1_OLIGO_A_23_P132277






cell division cycle 46 (S.cerevisiae)
division cycle 46 (S.cerevisiae) || AB209  text missing or illegible when filed





506
Hs.7946
Mitochrondrial tumor suppressor 1
Mitochrondrial tumor suppressor 1 || NM_00100197 || 8p22
AGI_HUM1_OLIGO_A_23_P94358




507
Hs.159799
Thyroid hormone receptor associated protein 2
Thyroid hormone receptor associated protein 2
AGI_HUM1_OLIGO_A_23_P47991







|| NM_015335 || 12q24.21





508
Hs.511668
Vacuolar protein sorting 13C (yeast)
Vacuolar protein sorting 13C (yeast) || AJ608771 || 15q22.2
AGI_HUM1_OLIGO_A_23_P206228




509
Hs.444448
Potassium channel, subfamily K, member 5
Potassium channel, subfamily K, member 5 || BC060793 ||
AGI_HUM1_OLIGO_A_23_P30728







6p21





510
Hs.480938
LPS-responsive vesicle trafficking, beach and
LPS-responsive vesicle trafficking, beach and anchor
AGI_HUM1_OLIGO_A_23_P251992






anchor containing
containing || AF467287 || 4q31.23-q31.3





511
Hs.334370
Brain expressed, X-linked 1
Brain expressed, X-linked 1 || BM804232 || Xq21-q23
AGI_HUM1_OLIGO_A_23_P159952




512
Hs.253594
Trichorhinophalangeal syndrome I
Trichorhinophalangeal syndrome I || NM_014112 || 8q24.12
AGI_HUM1_OLIGO_A_23_P134755




513
Hs.533710
Fibronectin leucine rich transmembrane protein 2
Fibronectin leucine rich transmembrane protein 2
AGI_HUM1_OLIGO_A_23_P99802







|| NM_013231 || 14q24-q32





514
Hs.307529
Kinesin family member 15
Kinesin family member 15 || NM_020242 || 3p21.31
AGI_HUM1_OLIGO_A_23_P80902




515
Hs.116651
Epithelial V-like antigen 1
Epithelial V-like antigen 1 || NM_005797 || 11q24
AGI_HUM1_OLIGO_A_23_P150379




516
Hs.375957
Integrin beta 2 (antigen CD18 (p95), lymphocyte
Integrin beta 2 (antigen CD18 (p95), lymphocyte
AGI_HUM1_OLIGO_A_23_P211180






function-associated antigen 1; macrophage an  text missing or illegible when filed
function-associated antigen 1; macrophage an  text missing or illegible when filed





517
Hs.13155
Integrin, beta 5
Integrin, beta 5 || AK091595 || 3q21.2
AGI_HUM1_OLIGO_A_23_P166627




518
Hs.507798
Lipoma HMGIC fusion partner
Lipoma HMGIC fusion partner || CR749848 || 13q12
AGI_HUM1_OLIGO_A_23_P88069




519
Hs.109438
Potassium channel tetramerisation domain
Potassium channel tetramerisation domain containing 12
AGI_HUM1_OLIGO_A_23_P128674






containing 12
|| AF359381 || 13q22.3





520
Hs.477959
Seven in absentia homolog 2 (Drosophila)
Seven in absentia homolog 2 (Drosophila) || NM_005067 ||
AGI_HUM1_OLIGO_A_23_P69121







3q25





521
Hs.182014
Hematopoietic protein 1
Hematopoietic protein 1 || NM_005337 || 12q13.1
AGI_HUM1_OLIGO_A_23_P128195




522
Hs.25647
V-fos FBJ murine osteosarcoma viral oncogene
V-fos FBJ murine osteosarcoma viral oncogene homolog
AGI_HUM1_OLIGO_A_23_P106192






homolog
|| BX647104 || 14q24.3





523
Hs.148989
Cingulin-like
Cingulin-like || NM_032885 || 15q21.3
AGI_HUM1_OLIGO_A_23_P163305




524
Hs.112949
Chromosome 1 open reading frame 34
Chromosome 1 open reading frame 34 || AB007921 || 1p32.3
AGI_HUM1_OLIGO_A_23_P160214




525
Hs.437474
RID kinase 1 (yeast)
RID kinase 1 (yeast) || BC006104 || 6p24.3
AGI_HUM1_OLIGO_A_23_P42432




526
Hs.425427
Hypothetical protein FLJ20425
Hypothetical protein FLJ20425 || AK000432 || 4p16.3
AGI_HUM1_OLIGO_A_23_P41327




527
Hs.517227
Junctional adhesion molecule 2
Junctional adhesion molecule 2 || NM_01219 || 21q21.2
AGI_HUM1_OLIGO_A_23_P120667




528
Hs.106880
Bystin-like
Bystin-like || AK095253 || 6p21.1
AGI_HUM1_OLIGO_A_23_P145194




529
Hs.12813
TCDD-inducible poly(ADP-ribose) polymerase
TCDD-inducible poly(ADP-ribose) polymerase
AGI_HUM1_OLIGO_A_23_P143845







|| CR749647 || 3q25.31





530
Hs.274313
Insulin-like growth factor binding protein 6
Insulin-like growth factor binding protein 6 || BM913156 ||
AGI_HUM1_OLIGO_A_23_P128520







12q13





531
Hs.325667
Thrombospondin, type 1, domain containing 1
Thrombospondin, type 1, domain containing 1 || AK096289 ||
AGI_HUM1_OLIGO_A_23_P14184







13q14.3





532
Hs.481571
Ubiquinol-cytochrome c reductase hinge protein
Ubiquinol-cytochrome c reductase hinge protein
AGI_HUM1_OLIGO_A_23_P200118







|| BF127835 || 1p34.1





533
Hs.35096
Zinc finger and BTB domain containing 4
Zinc finger and BTB domain containing 4
AGI_HUM1_OLIGO_A_23_P100553







|| BC043352 || 17p13.1





534
Hs.167531
Methylcrotonyl-Coenzyme A carboxylase 2
Methylcrotonyl-Coenzyme A carboxylase 2 (beta)
AGI_HUM1_OLIGO_A_23_P18897






(beta)
|| AK094987 || 5q12-q13





535
Hs.330663
Hypothetical protein FLJ20641
Hypothetical protein FLJ20641 || AK000548 || 12q23.3
AGI_HUM1_OLIGO_A_23_P87769




536
Hs.434321
ATP/GTP binding protein 1
ATP/GTP binding protein 1 || AB028958 || 9q21.33
AGI_HUM1_OLIGO_A_23_P169277




537
Hs.440379
Rho GTPase-activating protein
Rho GTPase-activating protein || AL833052 || 11q4-q25
AGI_HUM1_OLIGO_A_23_P161686




538
Hs.104570
Kallikrein 8 (neuropsin/ovasin)
Kallikrein 8 (neuropsin/ovasin) || BC040887 || 19q13.3-q13.4
AGI_HUM1_OLIGO_A_23_P130694




539
Hs.495728
Pirin (iron-binding nuclear protein)
Pirin (iron-binding nuclear protein) || BX537579 || Xp22.2
AGI_HUM1_OLIGO_A_23_P137035




540
Hs.275775
Selenoprotein P, plasma, 1
Selenoprotein P, plasma, 1 || BC030009 || 5q31
AGI_HUM1_OLIGO_A_23_P121926




541
Hs.74034
Caveolin 1, caveolae protein, 22 kDa
Caveolin 1, caveolae protein, 22 kDa || NM_001753 || 7q31.1
AGI_HUM1_OLIGO_A_23_P134454




542
Hs.9728
Armadillo repeat containing, X-linked 1
Armadillo repeat containing, X-linked 1 || AB039670 ||
AGI_HUM1_OLIGO_A_23_P22682







Xp21.33-q22.2





543
Hs.221889
Cold shock domain protein A
Cold shock domain protein A || AB209896 || 12p13.1
AGI_HUM1_OLIGO_A_23_P25229




544
Hs.422889
Nudix (nucleoside diphosphate linked moiety X)-
Nudix (nucleoside diphosphate linked moiety X)-type motif 6
AGI_HUM1_OLIGO_A_23_P155857






type motif 6
|| AB209758 || 4q26





545
Hs.4314938
Forkhead box P1
Forkhead box P1 || BX538242 || 3p14.1
AGI_HUM1_OLIGO_A_23_P155257




546
Hs.171299
Zinc finger and BTB domain containing 16
Zinc finger and BTB domain containing 16 || AB208916 ||
AGI_HUM1_OLIGO_A_23_P104804







11q23.1





547
Hs.59554
Sestrin 1
Sestrin 1 || AK127043 || 6q21
AGI_HUM1_OLIGO_A_23_P93562




548
Hs.533738
Hypothetical protein FLJ21827
Hypothetical protein FLJ21827 || NM_020153 || 11q23.3
AGI_HUM1_OLIGO_A_23_P116202




549
Hs.501513
Comparative gene indentification transcript 37
Comparative gene indentification transcript 37
AGI_HUM1_OLIGO_A_23_P54834







|| NM_016101 || 16q22.1





550
Hs.188464
Mannosidase, alpha, class 2B, member 2
Mannosidase, alpha, class 2B, member 2 || AB023152 ||
AGI_HUM1_OLIGO_A_23_P250372







4p16.1





551
Hs.24719
Modulator of apaptosis 1
Modulator of apaptosis 1 || NM_022151 || 14q32
AGI_HUM1_OLIGO_A_23_P205389




552
Hs.33455
Peptidyl arginine deiminase, type II
Peptidyl arginine deiminase, type II || AB023211 ||
AGI_HUM1_OLIGO_A_23_P201747







1p35.2-p35.1





553
Hs.549109
Protein tyrosine phosphatase, receptor type, T
Protein tyrosine phosphatase, receptor type, T
AGI_HUM1_OLIGO_A_23_P135576
AGI_HUM1_OLIGO_






|| NM_133170 || 20q12-q13

A_23_P146970



554
Hs.171625
Basic helix-loop domain containing, class B, 2
Basic helix-loop domain containing, class B, 2
AGI_HUM1_OLIGO_A_23_P57836







|| NM_003670 || 3p26





555
Hs.108106
Ubiquitin-like containing PHD and RING
Ubiquitin-like containing PHD and RING finger domains, 1
AGI_HUM1_OLIGO_A_23_P208880






finger domains, 1
|| AK04377 || 19p13.3





556
Hs.47166
Chromosome 3 open reading frame 14
Chromosome 3 open reading frame 14 || BM699794 || 3p14.2
AGI_HUM1_OLIGO_A_23_P29695




557
Hs.293907
Family with sequence similarity 3B, member B
Family with sequence similarity 3B, member B || AK056572 ||
AGI_HUM1_OLIGO_A_23_P130376







18p11.22





558
Hs.83381
Guanine nucleotide binding protein
Guanine nucleotide binding protein (G protein), gamma 11
AGI_HUM1_OLIGO_A_23_P111701






(G protein), gamma 11
|| BF971151 || 7q31-q32





559
Hs.447530
Hyaluronan and proteoglycan link protein 3
Hyaluronan and proteoglycan link protein 3 || BC053689 ||
AGI_HUM1_OLIGO_A_23_P14754







15q26.1





560
Hs.6985
Matrilin 3
Matrilin 3 || NM_002381 || 2p24-p23
AGI_HUM1_OLIGO_A_23_P102058




561
Hs.75969
Protein-rich nuclear receptor coactivator 1
Protein-rich nuclear receptor coactivator 1 || NM_005813 ||
AGI_HUM1_OLIGO_A_23_P145074







6q16





562
Hs.477869
Phospholipid scramblase 4
Phospholipid scramblase 4 || BC028354 || 3q24
AGI_HUM1_OLIGO_A_23_P91912




563
Hs.128196
Hypothetical protein FLJ14 966
Hypothetical protein FLJ14 966 || AK027672 || 11p15.3
AGI_HUM1_OLIGO_A_23_P2041




564
Hs.156346
Topoisomerase (DNA) II alpha 170 kDa
Topoisomerase (DNA) II alpha 170 kDa || NM_001067 ||
AGI_HUM1_OLIGO_A_23_P116834







17q21-q22





565
Hs.109439
Osteoglycin (osteoinductive factor, mimecan)
Osteoglycin (osteoinductive factor, mimecan) || NM_033014 ||
AGI_HUM1_OLIGO_A_23_P82990







9q22





566
Hs.103147
Sperm protein SSP411
Sperm protein SSP411 || AK125807 || 17q21.33
AGI_HUM1_OLIGO_A_23_P18633




567
Hs.485640
Primase, polypeptide 2A, 56 kDa
Primase, polypeptide 2A, 56 kDa || NM_000947 || 6p12-p11.1
AGI_HUM1_OLIGO_A_23_P44139
AGI_HUM1_OLIGO_








A_23_P61009



568
Hs.477789
ATPase, Na+/K+ transporting, beta 3
ATPase, Na+/K+ transporting, beta 3 polypeptide
AGI_HUM1_OLIGO_A_23_P66007






polypeptide
|| AK094673 || 3q23





569
Hs.518060
ADP-ribosylation-like factor 6 interacting
ADP-ribosylation-like factor 6 interacting protein 5
AGI_HUM1_OLIGO_A_23_P166640






protein 5
|| NM_006407 || 3p14





570
Hs.550539
NudC domain containing 1
NudC domain containing 1 || BC043406 || 8q23
AGI_HUM1_OLIGO_A_23_P123343




571
Hs.471405
Tubulin tyrosine ligase-like family, member 4
Tubulin tyrosine ligase-like family, member 4 || D79995 ||
AGI_HUM1_OLIGO_A_23_P142598







2p24.3-p24.1





572
Hs.467733
GKEB1 protein
GKEB1 protein || NM_014668 || 2p25.1
AGI_HUM1_OLIGO_A_23_P108862




573
Hs.509264
Kelch domain containing 2
Kelch domain containing 2 || AK056298 || 14q21.3
AGI_HUM1_OLIGO_A_23_P54165




574
Hs.500812
Beta-transducin repeat containing
Beta-transducin repeat containing || NM_033637 || 10q24.32
AGI_HUM1_OLIGO_A_23_P35427
AGI_HUM1_OLIGO_








A_23_P46819



575
Hs.104320
Golgi autoantigen, golgin subfamily a, 5
Golgi autoantigen, golgin subfamily a, 5 || NM_005113 ||
AGI_HUM1_OLIGO_A_23_P3183







14q32.12-q32.13





576
Hs.87435
Rho guanine exchange factor [GEF] 16
Rho guanine exchange factor [GEF] 16 || CR609458 || 1q36.3
AGI_HUM1_OLIGO_A_23_P114670




577
Hs.549157
Coenzyme Q4 tamotog (yeast)
Coenzyme Q4 tamotog (yeast) || AK128853 || 9q34.11
AGI_HUM1_OLIGO_A_23_P112493




578
Hs.482976
Hypothetical gene supported by AF038182:
Hypothetical gene supported by AF038182: BC009203
AGI_HUM1_OLIGO_A_23_P122007






BC009203
|| BC009203 || 5q21.1





579
Hs.291
IQ motif containing GTPase activating protein 2
IQ motif containing GTPase activating protein 2
AGI_HUM1_OLIGO_A_23_P253002







|| NM_008633 || 5q13.3





580
Hs.496267
Immunoglobulin (CD79A] binding protein 1
Immunoglobulin (CD79A] binding protein 1 || AK054596 ||
AGI_HUM1_OLIGO_A_23_P171249







Xq13.1-q13.3





581
Hs.530934
Cysteine and glycine-rich protein 2
Cysteine and glycine-rich protein 2 || AB209321 || 12q21.1
AGI_HUM1_OLIGO_A_23_P44724




582
Hs.506603
DIP13 beta
DIP13 beta || BX649010 || 12q4.1
AGI_HUM1_OLIGO_A_23_P105747




583
Hs.30246
Solute carrier family 19 (thiamine transporter),
Solute carrier family 19 (thiamine transporter), member 2
AGI_HUM1_OLIGO_A_23_P160466






member 2
AJ237724 || 1q23.3





584
Hs.328865
Dynactin 4 (p62)
Dynactin 4 (p62) || AK125973 || 5q31-q32
AGI_HUM1_OLIGO_A_23_P251945




585
Hs.50915
Kallikrein 5
Kallikrein 5 || AY359010 || 19q13.3-q13.4
AGI_HUM1_OLIGO_A_23_P153475




586
Hs.483444
Chemokine (C-X-C motif) ligand 14
Chemokine (C-X-C motif) ligand 14 || NM_004887 || 5q31
AGI_HUM1_OLIGO_A_23_P213745




587
Hs.494337
Golgi phosphoprotein 2
Golgi phosphoprotein 2 || NM_016456 || 9q21.33
AGI_HUM1_OLIGO_A_23_P146506




588
Hs.62128
Trophoblast glycoprotein
Trophoblast glycoprotein || NM_006670 || 6q14-q15
AGI_HUM1_OLIGO_A_23_P59261




589
Hs.147433
Proliferatng cell nuclear antigen
Proliferatng cell nuclear antigen || BE96331 || 20pter-p12
AGI_HUM1_OLIGO_A_23_P28886




590
Hs.521459
ADAM-like, decysin 1
ADAM-like, decysin 1 || Y13323 || 8p21.2
AGI_HUM1_OLIGO_A_23_P256425




591
Hs.415762
Lymphocyte antigen 6 complex, locus D
Lymphocyte antigen 6 complex, locus D || BC034542 ||
AGI_HUM1_OLIGO_A_23_P134764







8q24-qter





592
Hs.524161
Ras suppressor protein 1
Ras suppressor protein 1 || NM_012425 || 10p13
AGI_HUM1_OLIGO_A_23_P138417




593
Hs.18376
Cingulin
Cingulin || AF263462 || 1q21
AGI_HUM1_OLIGO_A_23_P149388




594
Hs.523798
Basic transcription factor 3
Basic transcription factor 3 || BX537826 || 5q13.2
AGI_HUM1_OLIGO_A_23_P213458




595
Hs.510262
Membrane targeting (tandem) C2 domain
Membrane targeting (tandem) C2 domain containing 1
AGI_HUM1_OLIGO_A_23_P88439






containing 1
|| NM_152334 || 14q32.12





596
Hs.445052
MutS homolog 6 (E.coli)
MutS homolog 6 (E.coli) || BC071594 || 2p16
AGI_HUM1_OLIGO_A_23_P102202




597
Hs.276905
Microtubule associated serine/threonine
Microtubule associated serine/threonine kinase-like
AGI_HUM1_OLIGO_A_23_P201988






kinase-like
|| AK123004 || 10p12.1





598
Hs.105940
Jerky homolog-like (mouse)
Jerky homolog-like (mouse) || NM_003772 || 11q21
AGI_HUM1_OLIGO_A_23_P202737




599
Hs.236774
High mobility group nucleosomal binding
High mobility group nucleosomal binding domain 4
AGI_HUM1_OLIGO_A_23_P19389






domain 4
|| NM_006353 || 6p21.3





600
Hs.386733
Polyribonucleotide nucleotidyltransferase 1
Polyribonucleotide nucleotidyltransferase 1 || BC053660 ||
AGI_HUM1_OLIGO_A_23_P154488







2p15





601
Hs.170673
Epidermal retinal dehydrogenase 2
Epidermal retinal dehydrogenase 2 || NM_138969 || 8q12.1
AGI_HUM1_OLIGO_A_23_P257457
AGI_HUM1_OLIGO_








A_23_P96410



602
Hs.49421
Hypothetical protein FLJ23129
Hypothetical protein FLJ23129 || AK127011 || 1p31.2
AGI_HUM1_OLIGO_A_23_P200670




603
Hs.297413
Matrix metalloproteinase 9 (gelatinase B, 92 kDa
Matrix metalloproteinase 9 (gelatinase B, 92 kDa gelatinase,
AGI_HUM1_OLIGO_A_23_P40174






gelatinase, 92 kDa type IV collagenase)
92 kDa type IV collagenase) BC00  text missing or illegible when filed





604
Hs.163109
Monoamine oxidase A
Monoamine oxidase A || NM_000240 || Xp11.4-p11.3
AGI_HUM1_OLIGO_A_23_P83857




605
Hs.494870
B-box and SPRY domain containing
B-box and SPRY domain containing || AK092607 || 9q32
AGI_HUM1_OLIGO_A_23_P71946




606
Hs.406861
Hydroxysteroid (17-beta) dehydrogenase 4
Hydroxysteroid (17-beta) dehydrogenase 4 || AB208932 ||
AGI_HUM1_OLIGO_A_23_P82954







5q21





607
Hs.496645
Interleukin 13 receptor, alpha 1
Interleukin 13 receptor, alpha 1 || Y10659 || Xq24
AGI_HUM1_OLIGO_A_23_P137196




608
Hs.239154
Ankyrin repeat, family A (RFXANK-like), 2
Ankyrin repeat, family A (RFXANK-like), 2 || NM_023039 ||
AGI_HUM1_OLIGO_A_23_P159011
AGI_HUM1_OLIGO_






5q12-q13

A_23_P41634



609
Hs.2128
Dual specficity phosphatase 5
Dual specficity phosphatase 5 || NM_004419 || 10q25
AGI_HUM1_OLIGO_A_23_P150016




610
Hs.26010
Phosphofructokinase, platelet
Phosphofructokinase, platelet || AK126153 || 10p15.3-p15.2
AGI_HUM1_OLIGO_A_23_P46928




611
Hs.508597
Integrin, beta-like 1 (with EGF-like repeat
Integrin, beta-like 1 (with EGF-like repeat domains)
AGI_HUM1_OLIGO_A_23_P113777






domains)
|| AK095102 || 13q33





612
Hs.435051
Cyclin-dependent kinase inhibitor 2D
Cyclin-dependent kinase inhibitor 2D (p19, inhibits CDK4)
AGI_HUM1_OLIGO_A_23_P89941






(p19, inhibits CDK4)
|| NM_001800 || 19p13





613
Hs.288998
S100 calcium binding protein A14
S100 calcium binding protein A14 || BG674026 || 1q21.3
AGI_HUM1_OLIGO_A_23_P124619




614
Hs.80976
Antigen identified by monoclonal antibody KI-67
Antigen identified by monoclonal antibody KI-67
AGI_HUM1_OLIGO_A_23_P202232







|| NM_002417 || 10q25-qter





615
Hs.512144
Chromosome 6 open reading frame 66
Chromosome 6 open reading frame 66 || CD555939 ||
AGI_HUM1_OLIGO_A_23_P70617







6q16.1





616
Hs.82028
Transforming growth factor, beta receptor II
Transforming growth factor, beta receptor II (70/80 kDa)
AGI_HUM1_OLIGO_A_23_P211957






(70/80 kDa)
|| BX648313 || 3p2





617
Hs.385189
G-2 and S-phase expressed 1
G-2 and S-phase expressed 1 || NM_016426 || 22q13-q13.3
AGI_HUM1_OLIGO_A_23_P57588




618
Hs.368592
Sortilin-related receptor, L(OLR class) A
Sortilin-related receptor, L(OLR class) A repeats-containing
AGI_HUM1_OLIGO_A_23_P87049






repeats-containing
|| NM_003105 || 11q23.2-q24.2





619
Hs.495473
Notch homolog 1, translocation-associated
Notch homolog 1, translocation-associated (Drosophila)
AGI_HUM1_OLIGO_A_23_P60393






(Drosophila)
|| NM_017617 || 9q34.3





620
Hs.381099
Lymphocyte cytosolic protein 1 (L-plastin)
Lymphocyte cytosolic protein 1 (L-plastin) || NM_002298 ||
AGI_HUM1_OLIGO_A_23_P204840







13q14.3





621
Hs.304682
Cystatin C (amyloid angiopathy and cerebral
Cystatin C (amyloid angiopathy and cerebral hemorrhage)
AGI_HUM1_OLIGO_A_23_P154745






hemorrhage)
|| BX647523 || 20p11.21





622
Hs.840
Indoleamine-pyrrole 2,3 dioxygenase
Indoleamine-pyrrole 2,3 dioxygenase || M34455 || 8p12-p11
AGI_HUM1_OLIGO_A_23_P112026




623
Hs.66170
SET and MYND domain containing 2
SET and MYND domain containing 2 || BC049367 || 1q41
AGI_HUM1_OLIGO_A_23_P170587




624
Hs.525157
Tumor necrosis factor (ligand) superfamily,
Tumor necrosis factor (ligand) superfamily, member 13b
AGI_HUM1_OLIGO_A_23_P14174






member 13b
|| NM_006573 || 13q32-34





625
Hs.389374
Hypthetical protein LOC257106
Hypthetical protein LOC257106 || BX537846 || 1q23.3
AGI_HUM1_OLIGO_A_23_P35049




626
Hs.173536
Protein kinase D3
Protein kinase D3 || NM_005813 || 2p21
AGI_HUM1_OLIGO_A_23_P108574




627
Hs.656
Cell division cycle 25C
Cell division cycle 25C || BC039100 || 5q31
AGI_HUM1_OLIGO_A_23_P70249




628
Hs.486502
Neuroblastoma RAS viral (v-ras) oncogene
Neuroblastoma RAS viral (v-ras) oncogene homolog
AGI_HUM1_OLIGO_A_23_P63189






homolog
|| X02751 || 1p13.2





629
Hs.375108
CD24 antigen (small cell lung carcinoma cluster
CD24 antigen (small cell lung carcinoma cluster 4 antigen)
AGI_HUM1_OLIGO_A_23_P114457






4 antigen)
|| AK125531 || 6q21





630
Hs.473838
Down syndrome critical region gene 2
Down syndrome critical region gene 2 || CR624273 || 21q22.3
AGI_HUM1_OLIGO_A_23_P68717




631
Hs.153692
Monogenic, audiogenic seizure susceptibility
Monogenic, audiogenic seizure susceptibility 1 homolog
AGI_HUM1_OLIGO_A_23_P19134






1 homolog (mouse)
(mouse) || AF435925 || 5q13





632
Hs.62185
Solute carrier family 9 (sodium/hydrogen
Solute carrier family 9 (sodium/hydrogen exchanger),
AGI_HUM1_OLIGO_A_23_P22625






exchanger), isoform 8
isoform 8 || BC035029 || Xq26.3





633
Hs.553497
Phosphatidylinositol glycan, class H
Phosphatidylinositol glycan, class H || BC071849 || 14q11-q24
AGI_HUM1_OLIGO_A_23_P2884




634
Hs.535901
Block of proliferation 1
Block of proliferation 1 || NM_015201 || 8q24.3
AGI_HUM1_OLIGO_A_23_P43800




635
Hs.498317
CGI-146 protein
CGI-146 protein || NM_016076 || 1q44
AGI_HUM1_OLIGO_A_23_P201445




636
Hs.513439
Galactosylceramidase (Krabbe disease)
Galactosylceramidase (Krabbe disease) || NM_000153 ||
AGI_HUM1_OLIGO_A_23_P25964







14q31





637
Hs.17109
Integral membrane protein 2A
Integral membrane protein 2A || AB209310 || Xq13.3-Xq21.2
AGI_HUM1_OLIGO_A_23_P171074




638
Hs.348920
FSH primary response (LRPR1 homolog, rat) 1
FSH primary response (LRPR1 homolog, rat) 1
AGI_HUM1_OLIGO_A_23_P252292







|| NM_006733 || Xq22.1





639
Hs.25338
Protease, serine, 23
Protease, serine, 23 || AL832007 || 11q14.1
AGI_HUM1_OLIGO_A_23_P150789




640
Hs.531550
Transducer of ERBB2,1
Transducer of ERBB2,1 || BC031406 || 17q21
AGI_HUM1_OLIGO_A_23_P164179




641
Hs.482390
Transforming growth factor, beta receptor III
Transforming growth factor, beta receptor III (betaglycan,
AGI_HUM1_OLIGO_A_23_P200780






(betaglycan, 300 kDa)
300 kDa) || L07594 || 1p33-p32





642
Hs.72550
Hyaluronan-mediated motility receptor
Hyaluronan-mediated motility receptor (RHAMM)
AGI_HUM1_OLIGO_A_23_P70007






(RHAMM)
|| AF032862 || 5q33.3-qter





643
Hs.416073
S100 calcium binding protein AB (calgranulin A)
S100 calcium binding protein AB (calgranulin A)
AGI_HUM1_OLIGO_A_23_P200288







|| BG739729 || 1q21





644
Hs.424783
Tetratricopeptide repeat domain 13
Tetratricopeptide repeat domain 13 || NM_024525 || 1q42.2
AGI_HUM1_OLIGO_A_23_P103864




645
Hs.534450
ORM1-like 2 (S.cerevisiae)
ORM1-like 2 (S.cerevisiae) || CR621685 || 12q13.2
AGI_HUM1_OLIGO_A_23_P87500




646
Hs.363431
Runt-related transcription factor 1; translocated
Runt-related transcription factor 1; translocated to, 1
AGI_HUM1_OLIGO_A_23_P216307






to, 1 (cyclin D-related)
(cyclin D-related) || NM_004349 || 6q22 || NM_014815 ||








17q21.1





647
Hs.462983
Thyroid hormone receptor associated protein 4
Thyroid hormone receptor associated protein 4
AGI_HUM1_OLIGO_A_23_P124760
AGI_HUM1_OLIGO_






|| NM_014815 || 17q21.1

A_23_P136148



648
Hs.315137
Alanyl-tRNA synthetase
Alanyl-tRNA synthetase || AK222824 || 16q22
AGI_HUM1_OLIGO_A_23_P89020




649
Hs.446450
Integral membrane protein 2B
Integral membrane protein 2B || BX537657 || 13q14.3
AGI_HUM1_OLIGO_A_23_P139934




650
Hs.506748
Hepatoma-derived growth factor (high-mobility
Hepatoma-derived growth factor (high-mobility group
AGI_HUM1_OLIGO_A_23_P149239






group protein 1-like)
protein 1-like) || NM_004494 || 1q21-q23





651
Hs.2316
SRY (sex determining region Y)-box 9
SRY (sex determining region Y)-box 9 (campomelic
AGI_HUM1_OLIGO_A_23_P26843






(campomelic dysplasia, autosomal sex-reversal)
dysplasia, autosomal sex-reversal) || NM_0  text missing or illegible when filed





652
Hs.18676
Sprouty homolog 2 (Drosophila)
Sprouty homolog 2 (Drosophila) || BX648582 || 13q31.1
AGI_HUM1_OLIGO_A_23_P128698




653
Hs.172052
Polo-like kinase 4 (Drosophila)
Polo-like kinase 4 (Drosophila) || NM_014264 || 4q27-q28
AGI_HUM1_OLIGO_A_23_P155968




654
Hs.24950
Regulator of G-protein signalling 5
Regulator of G-protein signalling 5 || NM_003617 || 1q23.1
AGI_HUM1_OLIGO_A_23_P46045
AGI_HUM1_OLIGO_








A_23_P51518



655
Hs.372688
Rho-related BTB domain containing 2
Rho-related BTB domain containing 2 || AB018260 || 8p21.3
AGI_HUM1_OLIGO_A_23_P20423




656
Hs.504765
ETs variant gene 6 (TEL oncogene)
ETs variant gene 6 (TEL oncogene) || NM_001987 || 12p13
AGI_HUM1_OLIGO_A_23_P105264




657
Hs.204096
Secretogoblin, family 10, member 2
Secretogoblin, family 10, member 2 || 11q13
AGI_HUM1_OLIGO_A_23_P150555




658
Hs.512973
Butyrate-induced transcript 1
Butyrate-induced transcript 1 || BX648759 || 15q22.2
AGI_HUM1_OLIGO_A_23_P99924




659
Hs.494648
Testis expressed sequence 10
Testis expressed sequence 10 || AK000294 || 9q31.1
AGI_HUM1_OLIGO_A_23_P11241




660
Hs.499000
DnaJ (Hsp40) homolog, subfamily C, member 1
DnaJ (Hsp40) homolog, subfamily C, member 1
AGI_HUM1_OLIGO_A_23_P127128







|| CR613772 || 10p12.31





661
Hs.114062
Protein tyrosine phosphatase-like (proline instead
Protein tyrosine phosphatase-like (proline instead of catalytic
AGI_HUM1_OLIGO_A_23_P161352






of catalytic arginin), member a
arginin), member a || AY4556942 ||





662
Hs.522054
Synaptotagmin-like 4 (granuphilin-a)
Synaptotagmin-like 4 (granuphilin-a) || AL832596 ||
AGI_HUM1_OLIGO_A_23_P11136




663
Hs.460095
Chromosome 16 open reading frame 45
Chromosome 16 open reading frame 45 || AK092923 ||
AGI_HUM1_OLIGO_A_23_P326319







15p13.11





664
Hs.161640
Tyrosine aminotransferase
Tyrosine aminotransferase || NM_000353 || 16q22.1
AGI_HUM1_OLIGO_A_23_P206776




665
Hs.129758
Proline-serine-threonine phosphatase interacting
Proline-serine-threonine phosphatase interacting protein 1
AGI_HUM1_OLIGO_A_23_P48997






protein 1
|| CR593209 || 15q24-q25.1





666
Hs.370359
Nuclear factor I/B
Nuclear factor I/B || BX537698 || 9p24.1
AGI_HUM1_OLIGO_A_23_P216448




667
Hs .533444
3-hydroxymethyl-3-methylglutaryl-Coenzyme A
3-hydroxymethyl-3-methylglutaryl-Coenzyme A lyase
AGI_HUM1_OLIGO_A_23_P145






lyase (hydroxymethylglutaricaciduria)
(hydroxymethylglutaricaciduria) || BG0335  text missing or illegible when filed





668
Hs .476052
SNF related kinase
SNF related kinase || CR749621 || 3p22.1
AGI_HUM1_OLIGO_A_23_P211985




669
Hs.24763
RAN binding protein 1
RAN binding protein 1 || AK094410 || 22q11.21
AGI_HUM1_OLIGO_A_23_P91590




670
Hs.432548
Chromosome 10 open reading frame 18
Chromosome 10 open reading frame 18
AGI_HUM1_OLIGO_A_23_P24244







|| XM_374765 || 10p15.1





671
Hs.150693
Activated leukocyte cell adhesion molecule
Activated leukocyte cell adhesion molecule || AL833702 ||
AGI_HUM1_OLIGO_A_23_P41227







3q13.1





672
Hs.274329
TP53 activated protein 1
TP53 activated protein 1 || BC068535 || 7q21.1
AGI_HUM1_OLIGO_A_23_P145895




673
Hs.193491
Tubulin, beta 6
Tubulin, beta 6 || AK092677 || 18p11.21
AGI_HUM1_OLIGO_A_23_P254271




674
Hs.414469
Potassium voltage-gated channel, delayed-rectifier,
Potassium voltage-gated channel, delayed-rectifier, subfamily
AGI_HUM1_OLIGO_A_23_P120105






subfamily S, member 3
S, member 3 || BC015947 || 2p24





675
Hs.46720
Transmembrane protease, serine 5 (spinesin)
Transmembrane protease, serine 5 (spinesin) || AF495727 ||
AGI_HUM1_OLIGO_A_23_P52797







11q





676
Hs.549192
Zinc finger, FYVE domain containing 21
Zinc finger, FYVE domain containing 21 || AK055909 ||
AGI_HUM1_OLIGO_A_23_P14273







14q32.33





677
Hs.4 69030
Methylenetetrahydrofolate dehydrogenase (NADP+
Methylenetetrahydrofolate dehydrogenase (NADP+
AGI_HUM1_OLIGO_A_23_P120315






dependent) 2, methenyltetrahydrofolate cycl  text missing or illegible when filed
dependent) 2, methenyltetrahydrofolate cycl  text missing or illegible when filed





678
Hs.438824
CK2 interacting protein 1; HQ0024c protein
CK2 interacting protein 1; HQ0024c protein || AK125609 ||
AGI_HUM1_OLIGO_A_23_P35114







1q21.2





679
Hs.266728
Hypothetical protein FLJ13639
Hypothetical protein FLJ13639 || AK023701 || 13q14.3
AGI_HUM1_OLIGO_A_23_P205200




680
Hs.501574
A disintegrin and metalloproteinase domain B
A disintegrin and metalloproteinase domain B || NM_001109 ||
AGI_HUM1_OLIGO_A_23_P115759







10q26.3





681
Hs.529846
Calcium modulating ligand
Calcium modulating ligand || NM_001745 || 5q23
AGI_HUM1_OLIGO_A_23_P213728




682
Hs.497159
Chromosome 1 open reading frame 21
Chromosome 1 open reading frame 21 || NM_030806 || 1q25
AGI_HUM1_OLIGO_A_23_P113161




683
Hs.551530
Trans-prenyltransferase
Trans-prenyltransferase || AB209763 || 10p12.1
AGI_HUM1_OLIGO_A_23_P161152




684
Hs.492314
Lysosomal associated protein transmembrane 4 beta
Lysosomal associated protein transmembrane 4 beta
AGI_HUM1_OLIGO_A_23_P59926







|| BC038117 || 8q22.1





685
Hs.530003
Solute carrier family 2 (facilitated gtucose/fructose
Solute carrier family 2 (facilitated gtucose/fructose
AGI_HUM1_OLIGO_A_23_P160159






transporter), member 5
transporter), member 5 || BC035878 || 1p36.2





686
Hs.60339
N-myristoyltransferase 2
N-myristoyltransferase 2 || NM_004808 || 10p13
AGI_HUM1_OLIGO_A_23_P138686




687
Hs.36761
HRAS-like suppressor
HRAS-like suppressor || BC005856 || 3q29
AGI_HUM1_OLIGO_A_23_P57658




688
Hs.442658
Aurora kinase B
Aurora kinase B || CD049640 || 17p13.1
AGI_HUM1_OLIGO_A_23_P130182




689
Hs.122514
Mitochondrial solute carrier protein
Mitochondrial solute carrier protein || AK127666 || 5p21.2
AGI_HUM1_OLIGO_A_23_P216004




690
Hs.514151
ORM1-like 3 (S.cerevisae)
ORM1-like 3 (S.cerevisae) || AK093063 || 17q12-q21.1
AGI_HUM1_OLIGO_A_23_P129824
AGI_HUM1_OLIGO_








A_23_P38190



691
Hs.520506
F-box protein 5
F-box protein 5 || AK055221 || 6q25-q26
AGI_HUM1_OLIGO_A_23_P8241




692
Hs.175322
Ubiquitin specific protease 13 (isopeptidase T-3)
Ubiquitin specific protease 13 (isopeptidase T-3)
AGI_HUM1_OLIGO_A_23_P40969







|| BC049199 || 3q26.2-q28.3





693
Hs.943
Interleukin 32
Interleukin 32 || BF569086 || 16p13.3
AGI_HUM1_OLIGO_A_23_P15146




694
Hs.255973
CREBBP/EP300 inhibitor 1
CREBBP/EP300 inhibitor 1 || NM_014335 || 15q21.1-q21.2
AGI_HUM1_OLIGO_A_23_P54276




695
Hs.346950
Cellular retinoic acid binding protein 1
Cellular retinoic acid binding protein 1 || AK096006 || 15q24
AGI_HUM1_OLIGO_A_23_P117882




696
Hs.183861
Chromatin modifying protein 4C
Chromatin modifying protein 4C || NM_152284 || 8q21.13
AGI_HUM1_OLIGO_A_23_P43019




697
Hs.276878
Nucleoporin 93 kDa
Nucleoporin 93 kDa || AK056637 || 16q13
AGI_HUM1_OLIGO_A_23_P89056




698
Hs.18616
Leucine zipper protein 5
Leucine zipper protein 5 || BX537845 || 7q36.3
AGI_HUM1_OLIGO_A_23_P168747




699
Hs.153752
Cell division cycle 25B
Cell division cycle 25B || NM_021874 || 20p13
AGI_HUM1_OLIGO_A_23_P210726




700
Hs.378996
Hyperparathyroidism 2 (with jaw tumor)
Hyperparathyroidism 2 (with jaw tumor) || NM_024529 ||
AGI_HUM1_OLIGO_A_23_P137731







1q25





701
Hs.493309
KIAA0020
KIAA0020 || AL832245 || 9p24.2
AGI_HUM1_OLIGO_A_23_P20683




702
Hs.433839
Eukaryotic translation elongation factor 1 alpha 2
Eukaryotic translation elongation factor 1 alpha 2
AGI_HUM1_OLIGO_A_23_P256033







|| AB209064 || 20q13.3





703
Hs.16184
RAD17 homolog (S.pombe)
RAD17 homolog (S.pombe) || AF076838 || 5q113
AGI_HUM1_OLIGO_A_23_P159053




704
Hs.530331
Pyruvate dehydrogenase (lipoamide)alpha 1
Pyruvate dehydrogenase (lipoamide)alpha 1 || BG036317 ||
AGI_HUM1_OLIGO_A_23_P251095







Xp22.2-p22.1





705
Hs.440401
All-trans-13,14-dihydroretinol saturase
All-trans-13,14-dihydroretinol saturase || BC058517 || 2p11.2
AGI_HUM1_OLIGO_A_23_P209946




706
Hs.79018
Chromatin assembly factor 1, subunit A (p150)
Chromatin assembly factor 1, subunit A (p150)
AGI_HUM1_OLIGO_A_23_P208895







|| NM_005486 || 19p13.3





707
Hs.531642
RAB11 family interacting protein 3 (class II)
RAB11 family interacting protein 3 (class II) || BC051380 ||
AGI_HUM1_OLIGO_A_23_P106727







16p13.3





708
Hs.518608
Morf4 family associated protein 1-like 1
Morf4 family associated protein 1-like 1 || AF258591 || 4p16.1
AGI_HUM1_OLIGO_A_23_P133058




709
Hs.446685
Peroxisomal long-chain acyl-coA thioesterase
Peroxisomal long-chain acyl-coA thioesterase || NM_006821 ||
AGI_HUM1_OLIGO_A_23_P3111







14q24.3





710
Hs.523375
KIAA0514
KIAA0514 || NM014696 || 10q11.22
AGI_HUM1_OLIGO_A_23_P98115




711
Hs.301526
Tripartite motif-containing 45
Tripartite motif-containing 45 || AY669488 || 1p13.1
AGI_HUM1_OLIGO_A_23_P160518




712
Hs.413801
Proteasome (prosome, macropain] activator
Proteasome (prosome, macropain] activator subunit 4
AGI_HUM1_OLIGO_A_23_P79628






subunit 4
|| NM_014614 || 2p16.3





713
Hs.48172
Myosin X
Myosin X || AB018342 || 5p15.1-p14.3
AGI_HUM1_OLIGO_A_23_P7596




714
Hs.514167
Keratin 19
Keratin 19 || BG29068 || 17q21.2
AGI_HUM1_OLIGO_A_23_P66798




715
Hs.445890
HSPC163 protein
HSPC163 protein || BX649076 || 1q42.12
AGI_HUM1_OLIGO_A_23_P200507




716
Hs.34114
ATPase, No+/K+ transporting, alpha 2 (+)
ATPase, No+/K+ transporting, alpha 2 (+) polypeptide
AGI_HUM1_OLIGO_A_23_P148879






polypeptide
|| NM_000702 || 1q21-q23





717
Hs.371021
Lysosomal associated multispanning membrane
Lysosomal associated multispanning membrane protein
AGI_HUM1_OLIGO_A_23_P86283






protein
|| CR607037 || 1p34





718
Hs.188569
Zinc linger, DHHC domain containing 13
Zinc linger, DHHC domain containing 13 || BC036020 ||
AGI_HUM1_OLIGO_A_23_P13065







11p15.1





719
Hs.478150
Programmed cell death 10
Programmed cell death 10 || BC002506 || 3q26.1
AGI_HUM1_OLIGO_A_23_P18325




720
Hs.434953
High-mobility group box 2
High-mobility group box 2 || CR600021 || 4q31
AGI_HUM1_OLIGO_A_23_P155765




721
Hs.25640
Claudin 3
Claudin 3 || BM701226 || 7q11.23
AGI_HUM1_OLIGO_A_23_P71017




722
Hs.35198
Ectonucleotide pyrophosphatase/ phosphodiesterase
Ectonucleotide pyrophosphatase/phosphodiesterase 5
AGI_HUM1_OLIGO_A_23_P214244






5 (putative function)
(putative function) || BX647968 || 8p21.1-  text missing or illegible when filed





723
Hs.280932
Peroxisomal biogenesis factor 7
Peroxisomal biogenesis factor 7 || BC031606 || 6q21-q22.2
AGI_HUM1_OLIGO_A_23_P93543




724
Hs.292097
SEC15-like 1 (S.cerevisiae)
SEC15-like 1 (S.cerevisiae) AK128190 || 10q23.33
AGI_HUM1_OLIGO_A_23_P169576




725
Hs.515100
Peroxisomal biogenesis factor 11 gamma
Peroxisomal biogenesis factor 11 gamma || AK127684 ||
AGI_HUM1_OLIGO_A_23_P101410







19p13.2





726
Hs.5206
Hypothetical protein FLJ2048519
Hypothetical protein FLJ2048519 || NM_019042 || 7q22.3
AGI_HUM1_OLIGO_A_23_P82478




727
Hs.269944
Mitochondrial carrier homolog 2 (C.elegans)
Mitochondrial carrier homolog 2 (C.elegans) || AY380792 ||
AGI_HUM1_OLIGO_A_23_P84010







11p11.2





728
Hs.336219
Peroxisome biogenesis factor 13
Peroxisome biogenesis factor 13 || AK093866 || 2p14-p16
AGI_HUM1_OLIGO_A_23_P257131




729
Hs.127032
Relaxin 2
Relaxin 2 || NM005059 || 9p24.1
AGI_HUM1_OLIGO_A_23_P216454




730
Hs.515601
Leukocyte immunoglobulin-like receptor, subfamily
Leukocyte immunoglobulin-like receptor, subfamily B
AGI_HUM1_OLIGO_A_23_P208500






B (with TM and ITIM domains), member 6
(with TM and ITIM domains), member 6 ||  text missing or illegible when filed





731
Hs.352018
Transporter 1, ATP-binding cassette, sub-family
Transporter 1, ATP-binding cassette, sub-family B
AGI_HUM1_OLIGO_A_23_P59005






B (MDR/TAP)
(MDR/TAP) BX648013 || 6p21.3





732
Hs.150718
Junctional adhesion molecule 3
Junctional adhesion molecule 3 || NM_032801 || 11q25
AGI_HUM1_OLIGO_A_23_P217998




733
Hs.486835
Chromosome 6 open reading frame 96
Chromosome 6 open reading frame 96 || AK000634 || 6q25.1
AGI_HUM1_OLIGO_A_23_P70708




734
Hs.277035
Monoglyceride lipase
Monoglyceride lipase || NM_007283 || 3q21.3
AGI_HUM1_OLIGO_A_23_P80438




735
Hs.326035
Early growth response 1
Early growth response 1 || NM_001964 || 5q31.1
AGI_HUM1_OLIGO_A_23_P214080




736
Hs.130759
Phospholipid scramblase 1
Phospholipid scramblase 1 || AB006746 || 3q23
AGI_HUM1_OLIGO_A_23_P89109




737
Hs.365861
Kelch-like 7 (Drosophila)
Kelch-like 7 (Drosophila) || NM_018846 || 7p15.3
AGI_HUM1_OLIGO_A_23_P215517




738
Hs.498397
CGI-49 protein
CGI-49 protein || NM_016002 || 1q44
AGI_HUM1_OLIGO_A_23_P62807




739
Hs.436367
Laminin, alpha 3
Laminin, alpha 3 || NM_198129 || 18q11.2
AGI_HUM1_OLIGO_A_23_P89780




740
Hs.492555
Enhancer of yellow 2 homolog (Drosophila)
Enhancer of yellow 2 homolog (Drosophila) || AK095651 ||
AGI_HUM1_OLIGO_A_23_P82748







8q23.1





741
Hs.528334
Fatty acid amide hydrolase
Fatty acid amide hydrolase || NM_001441 || 1p35-p34
AGI_HUM1_OLIGO_A_23_P103223




742
Hs.546366
Carbohydrate (chondroitin 4) sulfotransferase 11
Carbohydrate (chondroitin 4) sulfotransferase 11
AGI_HUM1_OLIGO_A_23_P139919







|| AL833176 || 12q





743
Hs.31439
Serine protease inhibitor, Kunitz type, 2
Serine protease inhibitor, Kunitz type, 2 || AK127479 ||
AGI_HUM1_OLIGO_A_23_P27795







19q13.1





744
Hs.271135
ATP synthase, H+ transporting, mitochondrial
ATP synthase, H+ transporting, mitochondrial F1 complex,
AGI_HUM1_OLIGO_A_23_P63649






F1 complex, gamma polpeptide 1
gamma polpeptide 1 || BF13167 ||





745
Hs.335614
SEC14-like 2 (S.cerevisiae)
SEC14-like 2 (S.cerevisiae) || AB033012 || 22q12.2
AGI_HUM1_OLIGO_A_23_P17808




746
Hs.198363
MCM10 minichromosome maintenance deficient
MCM10 minichromosome maintenance deficient 10
AGI_HUM1_OLIGO_A_23_P161474






10 (S.cerevisiae)
(S.cerevisiae)|| AL136840 || 10p13





747
Hs.80342
Keratin 15
Keratin 15 || AK122864 || 17q21.2
AGI_HUM1_OLIGO_A_23_P27133




748
Hs.224607
Syndecan 1
Syndecan 1 || NM_001006946 || 2p24.1
AGI_HUM1_OLIGO_A_23_P16944




749
Hs.463421
ATP-binding cassette, sub-family C (CFTR/MRP),
ATP-binding cassette, sub-family C (CFTR/MRP), member 3
AGI_HUM1_OLIGO_A_23_P207507






member 3
|| NM_020038 || 17q22





750
Hs.110675
Apolipoprotein C-  text missing or illegible when filed
Apolipoprotein C-  text missing or illegible when filed  || AJ249921 || 19q13.2
AGI_HUM1_OLIGO_A_23_P4649




751
Hs.337295
Stress-induced-phosphoprotein 1 (Hsp70/Hsp90-
Stress-induced-phosphoprotein 1 (Hsp70/Hsp90-
AGI_HUM1_OLIGO_A_23_P113078
AGI_HUM1_OLIGO_





organizing protein)
organizing protein) || BC039299 || 11q13

A_23_P124470



752
Hs.130989
Sodium channel, nonvoltage-gated 1 alpha
Sodium channel, nonvoltage-gated 1 alpha || AK172792 ||
AGI_HUM1_OLIGO_A_23_P128323







12p13





753
Hs.372914
N-myc downstream regulated gene 1
N-myc downstream regulated gene 1 || AK124709 || 8q24.3
AGI_HUM1_OLIGO_A_23_P20494




754
Hs.59757
Zinc finger protein 281
Zinc finger protein 281 || BC060820 || 1q32.1
AGI_HUM1_OLIGO_A_23_P149615




755
Hs.54697
Cdc42 guanine nucleotide exchange factor (GEF) 9
Cdc42 guanine nucleotide exchange factor (GEF) 9
AGI_HUM1_OLIGO_A_23_P251701







|| NM_015185 || Xq11.2





756
Hs.482625
Cardiomyopathy associated 5
Cardiomyopathy associated 5 || NM_153610 || 5q14.1
AGI_HUM1_OLIGO_A_23_P124946




757
Hs.94865
TEA domain family member 4
TEA domain family member 4 || NM_003213 || 12p13.2-p13.3
AGI_HUM1_OLIGO_A_23_P32758




758
Hs.435063
Rho GTPase activating protein 22
Rho GTPase activating protein 22 || BC047096 || 10q11.22
AGI_HUM1_OLIGO_A_23_P75310




759
Hs.524138
Brain-specific angiogenesis inhibitor 2
Brain-specific angiogenesis inhibitor 2 || NM_001703 || 1p35
AGI_HUM1_OLIGO_A_23_P149019




760
Hs.8859
Calcium activated nucleotidase 1
Calcium activated nucleotidase 1 || NM_138793 || 17q25.3
AGI_HUM1_OLIGO_A_23_P267556




761
Hs.165195
VAMP (vesicle-associated membrane protein)-
VAMP (vesicle-associated membrane protein)-associated
AGI_HUM1_OLIGO_A_23_P207957






associated protein A, 33 kDa
protein A, 33 kDa || NM_003574 || 18p





762
Hs.3416
Adipose differentiation-related protein
Adipose differentiation-related protein || NM_001122 || 9p22.1
AGI_HUM1_OLIGO_A_23_P134953




763
Hs.497788
Glutamyl-prolyl-tRNA synthetase
Glutamyl-prolyl-tRNA synthetase || NM_004446 || 1q41-q42
AGI_HUM1_OLIGO_A_23_P97632
AGI_HUM1_OLIGO_








A_23_P94795



764
Hs.501140
KIAA1598
KIAA1598 || AK09178 || 10q25.3
AGI_HUM1_OLIGO_A_23_P202587




765
Hs.534395
Plakophilin 3
Plakophilin 3 || NM_007183 || 11p15
AGI_HUM1_OLIGO_A_23_P95810




766
Hs.29058
Hypothetical protein DKF2p751P0423
Hypothetical protein DKF2p751P0423 || XM_291277 ||
AGI_HUM1_OLIGO_A_23_P250212







8p23.1





767
Hs.187376
Tetratricopeptide repeat domain 10
Tetratricopeptide repeat domain 10 || AK126658 || 13q12.1
AGI_HUM1_OLIGO_A_23_P48339




768
Hs.530272
Similar to RIKEN cDNA 573052BL13 gene
Similar to RIKEN cDNA 573052BL13 gene || AK092292 ||
AGI_HUM1_OLIGO_A_23_P146584







9q31.1





769
Hs.11463
UMP-CMP kinase
UMP-CMP kinase || AK025258 ||
AGI_HUM1_OLIGO_A_23_P115366




770
Hs.476415
Adaptor protein containing pH domain, PT8
Adaptor protein containing pH domain, PT8 domain and
AGI_HUM1_OLIGO_A_23_P166663






domain and leucine zipper motif 1
leucine zipper motif 1 || NM_012096 || 3





771
Hs.522373
Gelsolin (amyloidosis, Finnish type)
Gelsolin (amyloidosis, Finnish type) || AK125810 || 9q33
AGI_HUM1_OLIGO_A_23_P255884




772
Hs.532359
Ribosomal protein L5
Ribosomal protein L5 || AK095815 || 1p22.1
AGI_HUM1_OLIGO_A_23_P12133




773
Hs.297638
WD repeat domain 5
WD repeat domain 5 || NM_017588 || 9q34
AGI_HUM1_OLIGO_A_23_P32558




774
Hs.466507
Liver-specific bHLH-Zip transcription factor
Liver-specific bHLH-Zip transcription factor || AK126834 ||
AGI_HUM1_OLIGO_A_23_P142389







19q13.12





775
Hs.533747
Hypothetical protein MGC13183
Hypothetical protein MGC13183 || AK027638 || 12p13.33
AGI_HUM1_OLIGO_A_23_P99172




776
Hs.7736
Mitochondrial ribosomal protein L27
Mitochondrial ribosomal protein L27 || NM_14871 ||
AGI_HUM1_OLIGO_A_23_P49768







17q21.3-q22





777
Hs.89404
Msh homeo box homolog 2 (Drosophila)
Msh homeo box homolog 2 (Drosophila) || D89377 ||
AGI_HUM1_OLIGO_A_23_P213910







5q34-q35





778
Hs.518750
OCIA domain containing 1
OCIA domain containing 1 || AK123529 || 4p11
AGI_HUM1_OLIGO_A_23_P213093




779
Hs.78482
Paralemmin
Paralemmin || NM_002579 || 19p13.3
AGI_HUM1_OLIGO_A_23_P208991




780
Hs.370457
LETM1 domain containing 1
LETM1 domain containing 1 || AK123080 || 12q13.12
AGI_HUM1_OLIGO_A_23_P117037




781
Hs.1600
Chaperonin containing TCP1, subunit 5 (epsilon)
Chaperonin containing TCP1, subunit 5 (epsilon)
AGI_HUM1_OLIGO_A_23_P257863







|| NM_012073 || 5p15.2





782
Hs.6551
ATPase, H+ transporting, lysosomal accessory
ATPase, H+ transporting, lysosomal accessory protein 1
AGI_HUM1_OLIGO_A_23_P250462






protein 1
|| AK090462 || Xq28





783
Hs.211600
Tumor necrosis factor, alpha-induced protein 3
Tumor necrosis factor, alpha-induced protein 3 || BC041790 ||
AGI_HUM1_OLIGO_A_23_P156898







6q23





784
Hs.339811
UDP glycosyltransferase 2 family, polypeptide
UDP glycosyltransferase 2 family, polypeptide B11
AGI_HUM1_OLIGO_A_23_P212968






B11
|| AK124272 || 4q13.2





785
Hs.1051
Granzyme B (granzyme 2, cytotoxic T-lymphocyte-
Granzyme B (granzyme 2, cytotoxic T-lymphocyte-associated
AGI_HUM1_OLIGO_A_23_P117602






associated serine esterase 1)
serine esterase 1) || BQ052893 ||





786
Hs.479491
Putative NFkB activating protein 373
Putative NFkB activating protein 373 || BX647545 || 1p31.2
AGI_HUM1_OLIGO_A_23_P200653




787
Hs.461113
Cirrhosis, autosomal recessive 1A (cirhin)
Cirrhosis, autosomal recessive 1A (cirhin) || AB075868 ||
AGI_HUM1_OLIGO_A_23_P54626







16q22.1





788
Hs.57652
Cardherin, EGF LAG seven-pass G-type receptor 2
Cardherin, EGF LAG seven-pass G-type receptor 2
AGI_HUM1_OLIGO_A_23_P201576






(flamingo homolog, Drosophila)
(flamingo homolog, Drosophila) || AF234887





789
Hs.472847
Chromosome 20 open reading frame 35
Chromosome 20 open reading frame 35 || AL390160 ||
AGI_HUM1_OLIGO_A_23_P213986
AGI_HUM1_OLIGO_






20q13.12

A_23_P28772



790
Hs.445758
E2F transcription factor 5, p130-binding
E2F transcription factor 5, p130-binding || AB209185 || 8q21.2
AGI_HUM1_OLIGO_A_23_P31713




791
Hs.926
Myxovirus (influenza virus) resistance 2 (mouse)
Myxovirus (influenza virus) resistance 2 (mouse)
AGI_HUM1_OLIGO_A_23_P6263







|| AK122952 || 21q22.3





792
Hs.549043
Insulin-like growth factor 2 (somatomedin A)
Insulin-like growth factor 2 (somatomedin A) || AK074614 ||
AGI_HUM1_OLIGO_A_23_P203458







11p15.5





793
Hs.492516
Prefoldin 2
Prefoldin 2 || BF203500 || 1q23.3
AGI_HUM1_OLIGO_A_23_P51906




794
Hs.31564
FRAS1 related extracellular matrix l
FRAS1 related extracellular matrix l || BX648240 || 9p22.3
AGI_HUM1_OLIGO_A_23_P43334




795
Hs.116935
Zinc finger protein 521
Zinc finger protein 521 || AK027354 || 18q11.2
AGI_HUM1_OLIGO_A_23_P159027




796
Hs.488173
Hypothetical protein MGC7036
Hypothetical protein MGC7036 || AK054942 || 12q24.31
AGI_HUM1_OLIGO_A_23_P76109




797
Hs.546467
Epithelial stromal interaction 1 (breast)
Epithelial stromal interaction 1 (breast) || AL831953 || 13q13.3
AGI_HUM1_OLIGO_A_23_P105794




798
Hs.503660
6-pyruvoyltetrahydropterin synthase
6-pyruvoyltetrahydropterin synthase || BG249563 ||
AGI_HUM1_OLIGO_A_23_P127579







11q22.3-q23.3





799
Hs.75573
Centromere protein E, 312 kDa
Centromere protein E, 312 kDa || NM_001813 || 4q24-q25
AGI_HUM1_OLIGO_A_23_P253524




800
Hs.550491
Histone 1, H2ak
Histone 1, H2ak || BC034487 || 8p22-p21.3
AGI_HUM1_OLIGO_A_23_P42220




801
Hs.47649
Methylcrotonoyl-Coenzyme A carboxylase 1
Methylcrotonoyl-Coenzyme A carboxylase 1 (alpha)
AGI_HUM1_OLIGO_A_23_P58036






(alpha)
|| BC042453 || 3q27





802
Hs.398157
Polo-like kinase 2 (Drosophila)
Polo-like kinase 2 (Drosophila) || AF059617 || 5q12.1-q13.2
AGI_HUM1_OLIGO_A_23_P30254




803
Hs.170019
Runt-related transcription factor 3
Runt-related transcription factor 3 || NM_004350 || 1p36
AGI_HUM1_OLIGO_A_23_P51231




804
Hs.371013
Jumonji domain containing 2B
Jumonji domain containing 2B || AB020683 || 19p13.3
AGI_HUM1_OLIGO_A_23_P165051




805
Hs.266273
Chromosome 20 open reading frame 172
Chromosome 20 open reading frame 172 || BC026011 ||
AGI_HUM1_OLIGO_A_23_P165937







20q11.23





806
Hs.202453
V-myc myelocytomatosis viral oncogene homolog
V-myc myelocytomatosis viral oncogene homolog (avian)
AGI_HUM1_OLIGO_A_23_P215956






(avian)
|| NM_002467 || 8q24.12-q24.13





807
Hs.546305
Transcription elongation factor B (SIII),
Transcription elongation factor B (SIII), polypeptide 1
AGI_HUM1_OLIGO_A_23_P60028






polypeptide 1 (15 kDa, elongin C)
(15 kDa, elongin C) || AK057889 || 8q21.11





808
Hs.87016
Hypothetical protein FLJ10647
Hypothetical protein FLJ10647 || BM911450 || 1p34.3
AGI_HUM1_OLIGO_A_23_P62830




809
Hs.404802
Histone deacetylase 11
Histone deacetylase 11 || AL834223 || 3p25.1
AGI_HUM1_OLIGO_A_23_P155358




810
Hs.531668
Chemokine (C-X3-C motif) ligand 1
Chemokine (C-X3-C motif) ligand 1 || AB209037 || 16q13
AGI_HUM1_OLIGO_A_23_P37727




811
Hs.492407
Tyrosine 3-monooxygenase/tryptophan5-
Tyrosine 3-monooxygenase/tryptophan5-monooxygenase
AGI_HUM1_OLIGO_A_23_P71290






monooxygenase activation protein,
activation protein, zeta polypeptide ||





812
Hs.410497
Brain protein 13
Brain protein 13 || BU589543 || 7q21.3
AGI_HUM1_OLIGO_A_23_P122915




813
Hs.369232
Erythrocyte membrane protein band 4.1 like 5
Erythrocyte membrane protein band 4.1 like 5 || BC054508 ||
AGI_HUM1_OLIGO_A_23_P209298







2q14.2





814
Hs.75367
Src-like-adaptor
Src-like-adaptor || BX647569 || 8q22.3-qter
AGI_HUM1_OLIGO_A_23_P216340




815
Hs.119581
V-erb-b2 erythroblastic leukemia viral oncogene
V-erb-b2 erythroblastic leukemia viral oncogene homolog 3
AGI_HUM1_OLIGO_A_23_P203856






homolog 3 (avian)
(avian) || NM_001985 || 12q13





815
Hs.458276
Nuclear factor of kappa light polypeptide gene
Nuclear factor of kappa light polypeptide gene enhancer in
AGI_HUM1_OLIGO_A_23_P30655






enhancer in B-cells inhibitor, epsilon
B-cells inhibitor, epsilon || BC063609





817
Hs.499115
TAR (HIV) RNA binding protein 1
TAR (HIV) RNA binding protein 1 || U38847 || 1q42.3
AGI_HUM1_OLIGO_A_23_P52058




818
Hs.87889
Dicer1, Dcr-1 homolog (Drosophila)
Dicer1, Dcr-1 homolog (Drosophila) || NM_177438 ||
AGI_HUM1_OLIGO_A_23_P37111







14q32.13





819
Hs.56729
Lymphocyte-specific protein 1
Lymphocyte-specific protein 1 || AK056576 || 11p15.5
AGI_HUM1_OLIGO_A_23_P13382




820
Hs.317192
DnaJ (Hsp40)homolog, subfamily B, member 11
DnaJ (Hsp40)homolog, subfamily B, member 11
AGI_HUM1_OLIGO_A_23_P166899







|| BC046500 || 3q27.3





821
Hs.4747
Dyskeratosis congenita 1, dyskerin
Dyskeratosis congenita 1, dyskerin || BC009928 || Xq28
AGI_HUM1_OLIGO_A_23_P137143




822
Hs.192854
Rhotekin
Rhotekin || NM_033046 || 2p13.1
AGI_HUM1_OLIGO_A_23_P120054




823
Hs.444247
Mst3 and SOK1 -related kinase
Mst3 and SOK1 -related kinase || BC070058 || Xq26.2
AGI_HUM1_OLIGO_A_23_P21017




824
HS.181042
Dmx-like 1
Dmx-like 1 || AJ005821 || 5q22
AGI_HUM1_OLIGO_A_23_P113582
AGI_HUM1_OLIGO_








A_23_P250571



825
Hs.171626
S-phase kinase-associated protein 1A (p19A)
S-phase kinase-associated protein 1A (p19A)
AGI_HUM1_OLIGO_A_23_P133424







|| NM_006930 || 5q31





826
Hs.517586
Myoglobin
Myoglobin || BF67063 || 22q13.1
AGI_HUM1_OLIGO_A_23_P6433




827
Hs.406551
Similar to R1KEN cDNA 4921524J17
Similar to R1KEN cDNA 4921524J17 || BX647945 || 16q11.2
AGI_HUM1_OLIGO_A_23_P49279




828
Hs.272848
Hypothetical protein FLJ21019
Hypothetical protein FLJ21019 || AB208939 || 17q21.2
AGI_HUM1_OLIGO_A_23_P152755




829
Hs.516633
NCK-associated protein 1
NCK-associated protein 1 || AB011159 || 2q32
AGI_HUM1_OLIGO_A_23_P73239




830
Hs.333823
Mitochondrial ribosomal protein L13
Mitochondrial ribosomal protein L13 || AK123239 ||
AGI_HUM1_OLIGO_A_23_P44974







8q22.1-q22.3





831
Hs.400095
Heat shock 22 kDa protein 8
Heat shock 22 kDa protein 8 || NM_014365 || 12q24.23
AGI_HUM1_OLIGO_A_23_P162679




832
Hs.386470
Neuromedin B
Neuromedin B || BE781314 || 15q22-qter
AGI_HUM1_OLIGO_A_23_P88522




833
Hs.380403
Polycomb group ring finger 4
Polycomb group ring finger 4 || NM_005180 || 10p11.23
AGI_HUM1_OLIGO_A_23_P115732




834
Hs.201671
SRY (sex determining region Y)-box 13
SRY (sex determining region Y)-box 13 || NM_005686 || 1q32
AGI_HUM1_OLIGO_A_23_P85703




835
Hs.333297
Hypothetical protein LOC339745
Hypothetical protein LOC339745 || BC071613 || 2q22.1
AGI_HUM1_OLIGO_A_23_P79681




836
Hs.241575
N-acetytglucosamine-l-phosphotransferase, gamma
N-acetytglucosamine-l-phosphotransferase, gamma subunit
AGI_HUM1_OLIGO_A_23_P14886






subunit
|| AK126110 || 16p13.3





837
Hs.526735
Zinc finger, MYND domain containing 10
Zinc finger, MYND domain containing 10 || AB209621 ||
AGI_HUM1_OLIGO_A_23_P29663







3p21.3





838
Hs.149443
Cytochrome b-561 domain containing 2
Cytochrome b-561 domain containing 2 || BX641103 || 3p21.3
AGI_HUM1_OLIGO_A_23_P121326




833
Hs.260041
O-acetytransferase
O-acetytransferase || BC06384 || 7q21.3
AGI_HUM1_OLIGO_A_23_P215607




840
Hs.276770
CD52 antigen (CAMPATH-1 antigen)
CD52 antigen (CAMPATH-1 antigen) || BU739882 || 1p36
AGI_HUM1_OLIGO_A_23_P85800




841
Hs.204749
Protein tyrosine phosphatase, non-receptor type 14
Protein tyrosine phosphatase, non-receptor type 14
AGI_HUM1_OLIGO_A_23_P149111







|| NM_005401 || 1q32.2





842
Hs.111903
Fc fragment of IgG, receptor, transporter, alpha
Fc fragment of IgG, receptor, transporter, alpha || AK074734 ||
AGI_HUM1_OLIGO_A_23_P55936







19q13.3





843
Hs.505077
Chromosome 12 open reading frame 11
Chromosome 12 open reading frame 11 || BC003081 ||
AGI_HUM1_OLIGO_A_23_P36464







12p11.23





844
Hs.282984
Dehydrogenase/reductase (SDR family) member 8
Dehydrogenase/reductase (SDR family) member 8
AGI_HUM1_OLIGO_A_23_P21644







|| AY358553 || 4q22.1





845
Hs.283683
Chromosome 8 open reading frame 4
Chromosome 8 open reading frame 4 || CR60070 || 8p11.2
AGI_HUM1_OLIGO_A_23_P253345




846
Hs.444028
Cytoskeleton associated protein 2
Cytoskeleton associated protein 2 || NM_018204 || 13q14
AGI_HUM1_OLIGO_A_23_P151405




847
Hs.18442
E-1 enzyme
E-1 enzyme || AF113125 || 4q21.3
AGI_HUM1_OLIGO_A_23_P121806




848
Hs.127788
Hypohetical protein FLJ12078
Hypohetical protein FLJ12078 || BX538123 || 5q15
AGI_HUM1_OLIGO_A_23_P156067




849
Hs.15590
Cathepsin F
Cathepsin F || BC013359 || 11q13
AGI_HUM1_OLIGO_A_23_P24433




850
Hs.26530
Serum deprivation response (phosphatidylserine
Serum deprivation response (phosphatidylserine binding
AGI_HUM1_OLIGO_A_23_P72668






binding protein)
protein) || NM_004657 || 2q32-q33





851
Hs.127799
Baculoviral IAP repeat-containing-3
Baculoviral IAP repeat-containing-3 || NM_001165 || 11q22
AGI_HUM1_OLIGO_A_23_P98350




852
Hs.525709
Hypothetical protein FLJ20607
Hypothetical protein FLJ20607 || BQ935360 || 12q24.22
AGI_HUM1_OLIGO_A_23_P76538




653
Hs.237856
Solute carrier family 15, member 3
Solute carrier family 15, member 3 || AK127216 || 11q12.2
AGI_HUM1_OLIGO_A_23_P75780




854
Hs.2785
Keratin 17
Keratin 17 || BX647923 || 17q12-q21
AGI_HUM1_OLIGO_A_23_P96149




855
Hs.145575
Ubiquitin-like 3
Ubiquitin-like 3 || BC044582 || 13q12-q13
AGI_HUM1_OLIGO_A_23_P140029




856
Hs.22543
Ubiquitin protein ligase E3A (human papilloma
Ubiquitin protein ligase E3A (human papilloma virus E6-
AGI_HUM1_OLIGO_A_23_P48790






virus E6-associated protein, Angelman syndrome
associated protein, Angelman syndrome





857
Hs.5210
Glia maturation factor, gamma
Glia maturation factor, gamma || BG259135 || 19q13.2
AGI_HUM1_OLIGO_A_23_P208866




858
Hs.408557
Elongation of very long chain fatty acids
Elongation of very long chain fatty acids (FEN1/Elo2, SUR4/
AGI_HUM1_OLIGO_A_23_P251606






(FEN1/Elo2, SUR4/Elo3, yeast)-like 2
Elo3, yeast)-like 2 || BC0502776 || 6p  text missing or illegible when filed





859
Hs.389438
KIAA0590 gene product
KIAA0590 gene product || AB209020 || 16p13.3
AGI_HUM1_OLIGO_A_23_P140725




860
Hs.412019
Chromosome 6 open reading frame 80
Chromosome 6 open reading frame 80 || AK092592 ||
AGI_HUM1_OLIGO_A_23_P31085







6q23.1-q24.1





861
Hs.412842
Chromosome 10 open reading frame 7
Chromosome 10 open reading frame 7 || AK023925 || 10p13
AGI_HUM1_OLIGO_A_23_P150035




862
Hs.308122
Inositol 1,3,4-triphosphate 5/6 kinase
Inositol 1,3,4-triphosphate 5/6 kinase || AK024887 || 14q31
AGI_HUM1_OLIGO_A_23_P37399




863
Hs.484738
Myosin regulatory light chain interacting protein
Myosin regulatory light chain interacting protein
AGI_HUM1_OLIGO_A_23_P31034







|| NM_013262 || 6p23-p22.3





864
Hs.182385
Hepsin (transmembrane protease, serine 1)
Hepsin (transmembrane protease, serine 1) || AK125670 ||
AGI_HUM1_OLIGO_A_23_P101801







19q11-q13.2





855
Hs.446528
Ribosomal protein S4, X-linked
Ribosomal protein S4, X-linked || BM994563 || Xq13.1
AGI_HUM1_OLIGO_A_23_P125519




865
Hs.376208
Lymphotoxin beta (TNF superfamily, member 3)
Lymphotoxin beta (TNF superfamily, member 3)
AGI_HUM1_OLIGO_A_23_P93348







|| AK095821 || 6p21.3





867
Hs.435535
Zinc finger protein 355
Zinc finger protein 355 || NM_018660 || 8p21.1
AGI_HUM1_OLIGO_A_23_P146077
AGI_HUM1_OLIGO_








A_23_P157460



868
Hs.438362
EPS8-like 1
EPS8-like 1 || AF370395 || 19q13.42
AGI_HUM1_OLIGO_A_23_P208779




869
Hs.103527
SH2 domain protein 2A
SH2 domain protein 2A || NM_003975 || 1q21
AGI_HUM1_OLIGO_A_23_P160618




870
Hs.122926
Nuclear receptor subfamily 3, group C. member 1
Nuclear receptor subfamily 3, group C. member 1
AGI_HUM1_OLIGO_A_23_P214059






(glucocorticoid receptor)
(glucocorticoid receptor) || NM_000175 || 5q3  text missing or illegible when filed





871
Hs.544577
Glyceraldehyde-3-phosphate dehydrogenase
Glyceraldehyde-3-phosphate dehydrogenase || BF983396 ||
AGI_HUM1_OLIGO_A_23_P13897







12p13





872
Hs.31130
Transmembrane 7 superfamily member 2
Transmembrane 7 superfamily member 2 || AF023676 ||
AGI_HUM1_OLIGO_A_23_P127426







11q13





873
Hs.1908
Proteoglycan 1, secretory granule
Proteoglycan 1, secretory granule || CD359027 || 10q22.1
AGI_HUM1_OLIGO_A_23_P86653




874
Hs.487471
Hypothetical protein FLJ20171
Hypothetical protein FLJ20171 || BX647570 || 8q22.1
AGI_HUM1_OLIGO_A_23_P259127




875
Hs.23198
Chromosome 1 open reading frame 31
Chromosome 1 open reading frame 31 || CR602593 || 1q42.2
AGI_HUM1_OLIGO_A_23_P63459




876
Hs.149305
Hypothetical protein MGC2603
Hypothetical protein MGC2603 || AK024326 || 1p36.11
AGI_HUM1_OLIGO_A_23_P160537




877
Hs.522891
Chemokine (C-X-C motif) ligand 12 (stromal cell-
Chemokine (C-X-C motif) ligand 12 (stromal cell-derived
AGI_HUM1_OLIGO_A_23_P202448






derived factor 1)
factor 1) || BX647204 || 10q11.1





878
Hs.58324
A disintegrin-like and metalloprotease (reprolysin
A disintegrin-like and metalloprotease (reprolysin type)
AGI_HUM1_OLIGO_A_23_P40415






type) with thrombospondin type 1 motif, 5 (agg
with thrombospondin type 1 motif, 5 (agg  text missing or illegible when filed





879
Hs.88297
Serine/threonine kinase 17b (apoptosis-inducing)
Serine/threonine kinase 17b (apoptosis-inducing)
AGI_HUM1_OLIGO_A_23_P154367







|| BC052561 || 2q32.3





880
Hs.516777
SH3-domain binding protein 4
SH3-domain binding protein 4 || BC057396 || 2q37.1-q37.2
AGI_HUM1_OLIGO_A_23_P79259




881
Hs.492869
Family with sequence similarity 49, member B
Family with sequence similarity 49, member B || CR749628 ||
AGI_HUM1_OLIGO_A_23_P43255







8q24.21





862
Hs.508148
Abl-interactor 1
Abl-interactor 1 || NM_005470 || 10p11.2
AGI_HUM1_OLIGO_A_23_P126992




883
Hs.44298
Mitochondrial ribosomal protein S17
Mitochondrial ribosomal protein S17 || AK026553 || 7p11
AGI_HUM1_OLIGO_A_23_P258321




884
Hs.368610
3′-phosphoadenosine 5′-phosphosulfate synthase 1
3′-phosphoadenosine 5′-phosphosulfate synthase 1
AGI_HUM1_OLIGO_A_23_P144465







|| NM_005443 || 4q24





885
Hs.442657
TBC1 domain family, member 8 (with GRAM
TBC1 domain family, member 8 (with GRAM domain)
AGI_HUM1_OLIGO_A_23_P253281






domain)
|| AB024057 || 2q11.2





885
Hs.473721
Solute carrier family 2 (facilitated glucose
Solute carrier family 2 (facilitated glucose transporter),
AGI_HUM1_OLIGO_A_23_P571






transporter), member 1
member 1 || NM_006516 || 1p35-p31.3





887
Hs.511425
Signal recognition particle 9 kDa
Signal recognition particle 9 kDa || BC064351 || 1q42.12
AGI_HUM1_OLIGO_A_23_P45928




888
Hs.515126
Intercellular adhesion molecule 1 (CD54), human
Intercellular adhesion molecule 1 (CD54), human rhinovirus
AGI_HUM1_OLIGO_A_23_P153320






rhinovirus receptor
receptor || BC015969 || 19p13.3-p  text missing or illegible when filed





889
Hs.9315
Olfactomedin-like 3
Olfactomedin-like 3 || AK075544 || 1p13.2
AGI_HUM1_OLIGO_A_23_P115172




890
Hs.533736
RNA binding motif protein 7
RNA binding motif protein 7 || AB209753 || 11q23.1-q23.2
AGI_HUM1_OLIGO_A_23_P138975




891
Hs.532253
F-box protein 16
F-box protein 16 || NM_172366 || 8p21.1
AGI_HUM1_OLIGO_A_23_P168846




592
Hs.298813
Scavenger receptor class B, member 1
Scavenger receptor class B, member 1 || AB209436 ||
AGI_HUM1_OLIGO_A_23_P203900







12q24.31





893
Hs.110757
DNA segment on chromosome 21 (unique) 2056
DNA segment on chromosome 21 (unique) 2056 expressed
AGI_HUM1_OLIGO_A_23_P80129






expressed sequence
sequence || NM_003683 || 21q22.3





894
Hs.48924
Armadillo repeat containing, X-linked 2
Armadillo repeat containing, X-linked 2 || BC052628 ||
AGI_HUM1_OLIGO_A_23_P73750







Xq21.33-q22.2





895
Hs.435560
SCY1-like 3 (S.cerevisiae)
SCY1-like 3 (S.cerevisiae) || BX647352 || 1q24.2
AGI_HUM1_OLIGO_A_23_P74320




895
Hs.19492
Protocadherin 8
Protocadherin 8 || AF061573 || 13q14.3-q21.1
AGI_HUM1_OLIGO_A_23_P36985




897
Hs.523852
Cyclin D1 (PRAD1: parathyroid adenomatosis 1)
Cyclin D1 (PRAD1: parathyroid adenomatosis 1)
AGI_HUM1_OLIGO_A_23_P202837







|| NM_053056 || 11q13





898
Hs.177576
Hypothetical protein MGC52110
Hypothetical protein MGC52110 || AK128366 ||
AGI_HUM1_OLIGO_A_23_P28507




893
Hs.549185
PEST-containing nuclear protein
PEST-containing nuclear protein || BX647886 || 3q12.3
AGI_HUM1_OLIGO_A_23_P155332




900
Hs.493919
Myelin protein zero-like 1
Myelin protein zero-like 1 || NM_003953 || 1q24.2
AGI_HUM1_OLIGO_A_23_P11874




901
Hs.514718
Chromosome 18 open reading frame 43
Chromosome 18 open reading frame 43 || CR627465 ||
AGI_HUM1_OLIGO_A_23_P38677







18p11.21





902
Hs.171695
Dual specificity phosphatase 1
Dual specificity phosphatase 1 || AK127679 || 5q24
AGI_HUM1_OLIGO_A_23_P110712




903
Hs.443057
CD53 antgen
CD53 antgen || BC035456 || 1p13
AGI_HUM1_OLIGO_A_23_P74538




904
Hs.369779
Sirtuin (silent mating type information regulation 2
Sirtuin (silent mating type information regulation 2 homolog)
AGI_HUM1_OLIGO_A_23_P98022






homolog) 1 (S.cerevisiae)
1 (S.cerevisiae) || NM_012238 || 10





905
Hs.458283
Glutaredoxin 2
Glutaredoxin 2 || BM908128 || 1q31.2-q31.3
AGI_HUM1_OLIGO_A_23_P160503




906
Hs.519346
Erbb2 interacting protein
Erbb2 interacting protein || NM_018695 || 5q12.3
AGI_HUM1_OLIGO_A_23_P30175




907
Hs.695
Cystatin B (stefin B)
Cystatin B (stefin B) || CR591371 || 21q22.3
AGI_HUM1_OLIGO_A_23_P154889




908
Hs.241517
Polymerase (DNA directed), theta
Polymerase (DNA directed), theta || CR936627 || 3q13.33
AGI_HUM1_OLIGO_A_23_P218827




909
Hs.514681
Mitogen-activated protein kinase kinase 4
Mitogen-activated protein kinase kinase 4 || AK131544 ||
AGI_HUM1_OLIGO_A_23_P152687







17p11.2





910
Hs.369089
Collagen, type IV, alpha 5 (Alport syndrome)
Collagen, type IV, alpha 5 (Alport syndrome) || NM_033380 ||
AGI_HUM1_OLIGO_A_23_P45365







Xq22





911
Hs.171425
Nuclear receptor coactivator 7
Nuclear receptor coactivator 7 || AL834442 || 6q2.32
AGI_HUM1_OLIGO_A_23_P156957




912
Hs.481704
Hypothetical protein FLJ20152
Hypothetical protein FLJ20152 || AL832438 || 5p15.1
AGI_HUM1_OLIGO_A_23_P167599




913
Hs.146246
Hypothetical protein MGC457580
Hypothetical protein MGC457580 || NM_173833 | 6p21.1
AGI_HUM1_OLIGO_A_23_P94103




914
Hs.532851
Ribonuclease H, large subunit
Ribonuclease H, large subunit || CR619517 || 9p13.13
AGI_HUM1_OLIGO_A_23_P164826




915
Hs.308613
CGI-12 protein
CGI-12 protein || NM_015942 || 8q22.1
AGI_HUM1_OLIGO_A_23_P43071




916
Hs.156316
Decorin
Decorin || NM_001920 || 12q13.2
AGI_HUM1_OLIGO_A_23_P54873




917
Hs.104476
Hypothetical protein MGC17299
Hypothetical protein MGC17299 || BC072393 || 1p34.2
AGI_HUM1_OLIGO_A_23_P115022




918
Hs.437388
Phosphatidylinositol glycan, class T
Phosphatidylinositol glycan, class T || AK123590 ||
AGI_HUM1_OLIGO_A_23_P79842







20q12-q13.12





919
Hs.246381
CD68 antigen
CD68 antigen || BC015557 || 17p13
AGI_HUM1_OLIGO_A_23_P15394




920
Hs.2962
S100 calsium binding protein P
S100 calsium binding protein P || CA313584 || 4p16
AGI_HUM1_OLIGO_A_23_P58266




921
Hs.518805
High mobility group AT-hook 1
High mobility group AT-hook 1 || BC078664 || 6p21
AGI_HUM1_OLIGO_A_23_P42331




922
Hs.438102
Insulin-like growth factor binding protein 2, 36 kDa
Insulin-like growth factor binding protein 2, 36 kDa
AGI_HUM1_OLIGO_A_23_P119943







|| AB209509 || 2q33-q34





923
Hs.418367
Neuromedin U
Neuromedin U || BG034907 || 4q12
AGI_HUM1_OLIGO_A_23_P69537




924
Hs.396644
Poly(A) binding protein interacting protein 2
Poly(A) binding protein interacting protein 2 || BC048106 ||
AGI_HUM1_OLIGO_A_23_P213754







5q31.2





925
Hs.530792
GTP cyclohydrolase I feedback regulator
GTP cyclohydrolase I feedback regulator || BC027487 || 15q15
AGI_HUM1_OLIGO_A_23_P77328




926
Hs.154073
Solute carrier family 35, member B1
Solute carrier family 35, member B1 || AK124975 || 17q21.33
AGI_HUM1_OLIGO_A_23_P89455




927
Hs.76057
UDP-galactose-4-epimerase
UDP-galactose-4-epimerase || AK057302 || 1p36-p35
AGI_HUM1_OLIGO_A_23_P160148




928
Hs.211800
Component of oligomeric golgi complex 2
Component of oligomeric golgi complex 2 || AL832190 ||
AGI_HUM1_OLIGO_A_23_P160807







1q42.2





929
Hs.470943
Signal transducer and activator of transcription 1,
Signal transducer and activator of transcription 1, 91 kDa
AGI_HUM1_OLIGO_A_23_P56630






91 kDa
|| NM_007315 || 2q32.2





930
Hs.54416
Sine oculis homeobox homolog 1 (Drosophila)
Sine oculis homeobox homolog 1 (Drosophila)
AGI_HUM1_OLIGO_A_23_P78914







|| AK093780 || 14q23.1





931
Hs.23077
Choline phosphotransferase 1
Choline phosphotransferase 1 || AK025141 || 12q
AGI_HUM1_OLIGO_A_23_P105571




932
Hs.435991
Chromosome 4 open reading frame 16
Chromosome 4 open reading frame 16 || BX847702 || 4q25
AGI_HUM1_OLIGO_A_23_P69788




933
Hs.429656
CCAAT/enhancer binding protein (C/EBP), gamma
CCAAT/enhancer binding protein (C/EBP), gamma
AGI_HUM1_OLIGO_A_23_P208801







|| NM_001806 || 19q13.11





934
Hs.513633
G protein-coupled rcceptor 56
G protein-coupled rcceptor 56 || NM_201524 || 16q13
AGI_HUM1_OLIGO_A_23_P206280




935
Hs.46679
CUE domain containing 1
CUE domain containing 1 || CR627470 || 17q23.2
AGI_HUM1_OLIGO_A_23_P118384




936
Hs.525232
Low density lipoprotein receptor-related protein 10
Low density lipoprotein receptor-related protein 10
AGI_HUM1_OLIGO_A_23_P205493







|| NM_014045 || 14q11.2





937
Hs.504641
CD163 antigen
CD163 antigen || X22970 || 12p13.2
AGI_HUM1_OLIGO_A_23_P33723




938
Hs.411641
Eukaryotic translation initiation factor 4E binding
Eukaryotic translation initiation factor 4E binding protein 1
AGI_HUM1_OLIGO_A_23_P22224






protein 1
|| BM564526 || 8p12





939
Hs.125474
Leupaxin
Leupaxin || BC034230 || 11q12.1
AGI_HUM1_OLIGO_A_23_P87150




940
Hs.517307
Myxovirus (influenza virus) resistance 1,
Myxovirus (influenza virus) resistance 1, interferon-inducible
AGI_HUM1_OLIGO_A_23_P17853






interferon-inducible protein p78 (mouse)
protein p78 (mouse) || AK095355 ||





941
Hs.146602
Low molecular mass ubiquinone-binding protein
Low molecular mass ubiquinone-binding protein (9.5 kDa)
AGI_HUM1_OLIGO_A_23_P213716






(9.5 kDa)
|| BM701597 || 5q31.1





942
Hs.473117
Chromosome 20 open reading frame 17
Chromosome 20 open reading frame 17 || NM_173485 ||
AGI_HUM1_OLIGO_A_23_P154627







20q13.2





943
Hs.124027
Selenophosphate synthetase 1
Selenophosphate synthetase 1 || AK125066 || 10p14
AGI_HUM1_OLIGO_A_23_P150092




944
Hs.517603
Manic fringe homolog (Drosophila)
Manic fringe homolog (Drosophila) || U94352 || 22q12
AGI_HUM1_OLIGO_A_23_P103100




945
Hs.15299
HMBA-inducible
HMBA-inducible || AB021179 || 17q21.31
AGI_HUM1_OLIGO_A_23_P118552




04S
Hs.503721
Dynein, cytoplasmic, heavy polypeptide 2
Dynein, cytoplasmic, heavy polypeptide 2 || XM_370652 ||
AGI_HUM1_OLIGO_A_23_P147397







11q21-q22.1





947
Hs.77828
START domain containing 3
START domain containing 3 || AL831952 || 17q11-q12
AGI_HUM1_OLIGO_A_23_P118451




948
Hs.158529
Calsyntenin 2
Calsyntenin 2 || AJ278018 || 3q23-q24
AGI_HUM1_OLIGO_A_23_P212808




949
Hs.443831
Programmed cell death 5
Programmed cell death 5 || AB209040 || 19q12-q13.1
AGI_HUM1_OLIGO_A_23_P50608




950
Hs.497183
Influenza virus NS1A binding protein
Influenza virus NS1A binding protein || NM_016389 ||
AGI_HUM1_OLIGO_A_23_P137514







1q25.1q31.1





551
Hs.480042
Annexin A3
Annexin A3 || AB209868 || 4q13-q22
AGI_HUM1_OLIGO_A_23_P121716




952
Hs.512660
C-type lectin domain family 11, member A
C-type lectin domain family 11, member A || BM719769 ||
AGI_HUM1_OLIGO_A_23_P153487







19q13.3





953
Hs.357567
Hypothetical protein LOC130576
Hypothetical protein LOC130576 || NM_177964 || 2q23.3
AGI_HUM1_OLIGO_A_23_P79302




954
Hs.212102
Protein disulfide isomerase-associated B
Protein disulfide isomerase-associated B || AK127433 ||
AGI_HUM1_OLIGO_A_23_P56956







2p25.1





955
Hs.512464
Surfeit 1
Surfeit 1 || BM923055 || 9q34.2
AGI_HUM1_OLIGO_A_23_P20648




958
Hs.459615
Septin 10
Septin 10 || AB208875 || 2q13
AGI_HUM1_OLIGO_A_23_P43175




957
Hs.520004
Discoidin domain receptor family, member 1
Discoidin domain receptor family, member 1 || NM_013994 ||
AGI_HUM1_OLIGO_A_23_P93311







6p21.3





958
Hs.126774
RA-regulated nuclear matrix-associated protein
RA-regulated nuclear matrix-associated protein
AGI_HUM1_OLIGO_A_23_P10385
AGI_HUM1_OLIGO_






|| NM_016448 ||

A_23_P84620



959
Hs.127445
Lipase A, lysosomal acid, cholesterol esterase
Lipase A, lysosomal acid, cholesterol esterase (Wolman
AGI_HUM1_OLIGO_A_23_P97865






(Wolman disease)
disease) || AK091558 || 1023.2-q23.3





950
Hs.414362
Cytochrome b5 reductase b5R.2
Cytochrome b5 reductase b5R.2 || AB209000 || 11p15.4
AGI_HUM1_OLIGO_A_23_P2181




951
Hs.483136
COMM domain containing 10
COMM domain containing 10 || BC036897 || 5q23.1
AGI_HUM1_OLIGO_A_23_P252403




962
Hs.243678
SRY (sex determining region Y)-box B
SRY (sex determining region Y)-box B || NM_014587 ||
AGI_HUM1_OLIGO_A_23_P66137







16p13.3





963
Hs.301350
FXYD domain containing ion transport regulator 3
FXYD domain containing ion transport regulator 3
AGI_HUM1_OLIGO_A_23_P209043







|| BF676327 || 19q13.11-q13.12





964
Hs.407135
Adenosine deaminase
Adenosine deaminase || 20q12-q13.11
AGI_HUM1_OLIGO_A_23_P210482




965
Hs.184523
Serine/threonine kinase 38 like
Serine/threonine kinase 38 like || AB023182 || 12p11.23
AGI_HUM1_OLIGO_A_23_P64743




955
Hs.134665
Schwannomin interacting protein 1
Schwannomin interacting protein 1 || BC848179 ||
AGI_HUM1_OLIGO_A_23_P257031







3q25.32-q25.33





967
Hs.412636
Factor for adipocyte differentiation 158
Factor for adipocyte differentiation 158 || BC036122 || 1p22.2
AGI_HUM1_OLIGO_A_23_P103773




968
Hs.502338
Solute carrier family 1 (glial high affinity glutamate
Solute carrier family 1 (glial high affinity glutamate
AGI_HUM1_OLIGO_A_23_P162068






transporter), member 2
transporter), member 2 || AY056021 || 11p13-  text missing or illegible when filed





959
Hs.497599
Tryptophanyl-tRNA synthetase
Tryptophanyl-tRNA synthetase || NM_004184 || 14q32.31
AGI_HUM1_OLIGO_A_23_P65651




970
Hs.377972
Chromosome 13 open reading frame 21
Chromosome 13 open reading frame 21 || AK123212 ||
AGI_HUM1_OLIGO_A_23_P139965







13q14.11





971
Hs.132340
Chromosome 6 open reading frame 85
Chromosome 6 open reading frame 85 || NM_021945 || 6p25.2
AGI_HUM1_OLIGO_A_23_P7882




972
Hs.181220
Hypothetical gene CG01B
Hypothetical gene CG01B || AL832677 || 13q12-q13
AGI_HUM1_OLIGO_A_23_P76659




973
Hs.471675
Diacylglycerol kinase, delta 130 kDa
Diacylglycerol kinase, delta 130 kDa || NM_15879 || 2q37.1
AGI_HUM1_OLIGO_A_23_P210253




974
Hs.369022
MOB1, Mps One Binder kinase activator-like 2B
MOB1, Mps One Binder kinase activator-like 2B
AGI_HUM1_OLIGO_A_23_P146548






(yeast)
(yeast) || NM_024761 || 9p21.2





975
Hs.34871
Zinc finger homeobox 1b
Zinc finger homeobox 1b || NM_014795 || 2q22
AGI_HUM1_OLIGO_A_23_P142560




976
Hs.166551
Chromosome 5 open reading frame 3
Chromosome 5 open reading frame 3 || CR749447 || 5q31-q33
AGI_HUM1_OLIGO_A_23_P41912




977
Hs.90073
CSE1 chromosome segregation 1-like (yeast)
CSE1 chromosome segregation 1-like (yeast) || NM_001316 ||
AGI_HUM1_OLIGO_A_23_P17392







20q13





978
Hs.546282
Retinoblastoma binding protein B
Retinoblastoma binding protein B || NM_002894 || 18q11.2
AGI_HUM1_OLIGO_A_23_P252371




979
Hs.508343
Alpha-methylacyl-CoA racemase
Alpha-methylacyl-CoA racemase || CR616479 || 5p13.2-p11.1
AGI_HUM1_OLIGO_A_23_P110676




980
Hs.248174
Histone 1, H2ab
Histone 1, H2ab || CK13054 || 6p21.3
AGI_HUM1_OLIGO_A_23_P251633




981
Hs.69771
B-factor, properdin
B-factor, properdin || NM_001710 || 6p21.3
AGI_HUM1_OLIGO_A_23_P156687




982
Hs.83383
Peroxiredoxin 4
Peroxiredoxin 4 || BM_674623 || Xp22.11
AGI_HUM1_OLIGO_A_23_P114232




983
Hs.233119
Malic enzyme 2, NAD(+)-dependent, mitochondrial
Malic enzyme 2, NAD(+)-dependent, mitochondrial
AGI_HUM1_OLIGO_A_23_P38748







|| NM_002396 || 6p25-p24





984
Hs.89545
Proteasome (prosome, macropain) subunit, beta
Proteasome (prosome, macropain) subunit, beta type, 4
AGI_HUM1_OLIGO_A_23_P769






type, 4
|| CR604108 || 1q21





985
Hs.502755
AHNAK nucleoprotein (desmoyokin)
AHNAK nucleoprotein (desmoyokin) || NM_001620 ||
AGI_HUM1_OLIGO_A_23_P127789
AGI_HUM1_OLIGO_






11q12.2

A_23_P21363



986
Hs.227067
ATPase family, AAA domain containing 3A
ATPase family, AAA domain containing 3A || AK092833 ||
AGI_HUM1_OLIGO_A_23_P201357







1p36.33





987
Hs.497684
Protein phosphatase 2, regulatory subunit B (B56),
Protein phosphatase 2, regulatory subunit B (B56), alpha
AGI_HUM1_OLIGO_A_23_P256432






alpha isoform
isoform || NM_006243 || 1q32.3-q32.3





933
Hs.47357
Cholesterol 25-hydroxylase
Cholesterol 25-hydroxylase || NM_003956 || 10q23
AGI_HUM1_OLIGO_A_23_P86470




989
Hs.412707
Hypoxanthine phosphoribosyltransferase 1
Hypoxanthine phosphoribosyltransferase 1 (Lesch-Nyhan
AGI_HUM1_OLIGO_A_23_P11372






(Lesch-Nyhan syndrome)
syndrome) || NM_000194 || Xq26.1





990
Hs.55220
BCL2-associated athanogene 2
BCL2-associated athanogene 2 || AK023735 || 6p12.3-p11.2
AGI_HUM1_OLIGO_A_23_P255363




991
Hs.22891
Solute carrier family 7 (cationic amino acid
Solute carrier family 7 (cationic amino acid transporter, y+
AGI_HUM1_OLIGO_A_23_P205460






transporter, y+ system), member 8
system), member 8 || Y18483 || 14q11





992
Hs.118183
Hypothetical protein FLJ22833
Hypothetical protein FLJ22833 || AL832659 || 2q32.3
AGI_HUM1_OLIGO_A_23_P72897




993
Hs.17558
Hypothetical protein FLJ90586
Hypothetical protein FLJ90586 || BC035517 || 7q34
AGI_HUM1_OLIGO_A_23_P42908




994
Hs.75093
Procollagen-lysine 1, 2-oxoglutarate
Procollagen-lysine 1, 2-oxoglutarate S-dioxygenase 1
AGI_HUM1_OLIGO_A_23_P137525






S-dioxygenase 1
|| NM_000302 || 1p36-3-p36.2





995
Hs.159118
Adenosylmethionine decarboxylase 1
Adenosylmethionine decarboxylase 1 || BC041345 || 6q21-q22
AGI_HUM1_OLIGO_A_23_P214121




995
Hs.64016
Protein S (alpha)
Protein S (alpha) || M14338 || 3q112.2
AGI_HUM1_OLIGO_A_23_P84510




997
Hs.3109
Rho GTPase activating protein 4
Rho GTPase activating protein 4 || BC052303 || Xq28
AGI_HUM1_OLIGO_A_23_P159927




998
Hs.471441
Proteasome (prosome, macropain) subunit, beta
Proteasome (prosome, macropain) subunit, beta type, 2
AGI_HUM1_OLIGO_A_23_P170058






type, 2
|| BM545813 || 1p34.2





999
Hs.104315
Soc-2 suppressor of clear homolog (C.elegans)
Soc-2 suppressor of clear homolog (C.elegans)
AGI_HUM1_OLIGO_A_23_P202565







|| BC044752 || 10q25





1000
Hs.77448
Aldehyde dehydrogenase 4 family, member A1
Aldehyde dehydrogenase 4 family, member A1
AGI_HUM1_OLIGO_A_23_P158945
AGI_HUM1_OLIGO_






|| NM_003748 || 1p36

A_23_P170337



1001
Hs.433203
HSPC171 protein
HSPC171 protein || BF204699 || 16q22.1
AGI_HUM1_OLIGO_A_23_P206369




1002
Hs.524763
Two pore segment channel 1
Two pore segment channel 1 || AB032995 ||
AGI_HUM1_OLIGO_A_23_P218086




1003
Hs.50130
Necdin homolog (mouse)
Necdin homolog (mouse) || NM_002487 || 15q11.2-q12
AGI_HUM1_OLIGO_A_23_P106405




1004
Hs.345694
Potassium channel modulatory factor 1
Potassium channel modulatory factor 1 || NM_020122 ||
AGI_HUM1_OLIGO_A_23_P131475







2p11.2





1005
Hs.7917
Likely ortholog of mouse hypoxia induced gene 1
Likely ortholog of mouse hypoxia induced gene 1
AGI_HUM1_OLIGO_A_23_P40885







|| AL833541 || 3p22.1





1005
Hs.482363
Solute carrier family 30 (zinc transporter),
Solute carrier family 30 (zinc transporter), member 5
AGI_HUM1_OLIGO_A_23_P218988






member 5
|| BX537394 || 5q12.1





1007
Hs.161301
Cathepsin S
Cathepsin S || NM_004079 || 1q21
AGI_HUM1_OLIGO_A_23_P46141




1008
Hs.126688
Choline dehydrogenase
Choline dehydrogenase || AK055402 || 3p21.1
AGI_HUM1_OLIGO_A_23_P69293




1009
Hs.381178
Breast carcinoma amplified sequence 4
Breast carcinoma amplified sequence 4 || BC056886 ||
AGI_HUM1_OLIGO_A_23_P40209







20q13.13





1010
Hs.426359
DKFZp564J157 protein
DKFZp564J157 protein || BE906094 || 12q12
AGI_HUM1_OLIGO_A_23_P139575




1011
Hs.408528
Retinoblastoma 1 (including osteosarcoma)
Retinoblastoma 1 (including osteosarcoma) || L41870 ||
AGI_HUM1_OLIGO_A_23_P204850







13q14.2





1012
Hs.435326
Actin-like 6A
Actin-like 6A || NM_178042 || 3q26.33
AGI_HUM1_OLIGO_A_23_P69249




1013
Hs.21611
Kinesin family member 3C
Kinesin family member 3C || BX571741 || 2p23
AGI_HUM1_OLIGO_A_23_P120325




1014
Hs.502004
Related RAS viral (r-ras) oncogene homolog 2
Related RAS viral (r-ras) oncogene homolog 2 || BQ228116 ||
AGI_HUM1_OLIGO_A_23_P202852







11p15.2





1015
Hs.281898
Absent in melanoma 2
Absent in melanoma 2 || BC010940 || 1q22
AGI_HUM1_OLIGO_A_23_P12100




1016
Hs.483473
Chromosome 5 open reading frame 5
Chromosome 5 open reading frame 5 || AF251038 || 5q31
AGI_HUM1_OLIGO_A_23_P136460




1017
Hs.16362
Pyrimidinergic receptor P2Y, G-protein coupled, 6
Pyrimidinergic receptor P2Y, G-protein coupled, 6
AGI_HUM1_OLIGO_A_23_P64611







|| NM_004154 || 11q13.5





1018
Hs.511067
Hypothetical protein FLJ10579
Hypothetical protein FLJ10579 || AK123282 || 15q15.1
AGI_HUM1_OLIGO_A_23_P152087




1019
Hs.125867
Enah/Vasp-like
Enah/Vasp-like || AL133642 || 14q32.2
AGI_HUM1_OLIGO_A_23_P129034




1020
Hs.404088
Sarcoma antigen NY-SAR-48
Sarcoma antigen NY-SAR-48 || AK130803 || 19p13.11
AGI_HUM1_OLIGO_A_23_P141965




1021
Hs.118554
Lactamase, beta 2
Lactamase, beta 2 || NM_016027 || 8p22-p22.3
AGI_HUM1_OLIGO_A_23_P252711




1022
Hs.432438
Echinoderm microtubule associated protein like 4
Echinoderm microtubule associated protein like 4
AGI_HUM1_OLIGO_A_23_P165896







|| NM_019063 || 2p22-p21





1023
Hs.517232
Peroxisomal biogenesis factor 19
Peroxisomal biogenesis factor 19 || NM_002857 || 1q22
AGI_HUM1_OLIGO_A_23_P160188




1024
Hs.128686
Nucleobindin 2
Nucleobindin 2 || AK128739 || 11p15.1-p14
AGI_HUM1_OLIGO_A_23_P13364




1025
Hs.93832
Putative membrane protein
Putative membrane protein || BF680501 || 1q22-q25
AGI_HUM1_OLIGO_A_23_P46084




1026
Hs.497200
Phospholipase A2, group IVA (cytosolic,
Phospholipase A2, group IVA (cytosolic, calcium-dependent)
AGI_HUM1_OLIGO_A_23_P11685






calcium-dependent)
|| M68874 || 1q25





1027
Hs.418123
Cathepsin L
Cathepsin L || AK055599 || 9q21-q22
AGI_HUM1_OLIGO_A_23_P94533




1028
Hs.485938
Ras-related GTP binding D
Ras-related GTP binding D || AL137502 || 6q15-q16
AGI_HUM1_OLIGO_A_23_P133684




1029
Hs.280342
Protein kinase, cAMP-dependent regulatory, type I,
Protein kinase, cAMP-dependent regulatory, type I, alpha
AGI_HUM1_OLIGO_A_23_P164170






alpha (tissue specific extinguisher 1)
(tissue specific extinguisher 1) || CR7  text missing or illegible when filed





1030
Hs.1987
CD28 antigen (Tp44)
CD28 antigen (Tp44) || NM_006139 || 2q33
AGI_HUM1_OLIGO_A_23_P91095




1031
Hs.533628
KIAA0133
KIAA0133 || NM_014777 || 1q42.13
AGI_HUM1_OLIGO_A_23_P74914




1032
Hs.337594
Serine dehydralase-like
Serine dehydralase-like || BC009849 || 12q24.13
AGI_HUM1_OLIGO_A_23_P53439




1033
Hs.83169
Matrix metalloproteinase 1 (interstitial collagenase)
Matrix metalloproteinase 1 (interstitial collagenase)
AGI_HUM1_OLIGO_A_23_P1691







|| BC031875 || 11q22.3





1034
Hs.470608
Solute carrier family 25 (mitochondrial carrier,
Solute carrier family 25 (mitochondrial carrier, Aralar),
AGI_HUM1_OLIGO_A_23_P142714






Aralar), member 12
member 12 || AJ496568 || 2q24





1035
Hs.282326
Down syndrome critical region gene 1
Down syndrome critical region gene 1 || AY325903 ||
AGI_HUM1_OLIGO_A_23_P166246







21q22.1-q22.2





1036
Hs.517581
Heme oxygenase (decycling) 1
Heme oxygenase (decycling) 1 || BG165629 || 22q12
AGI_HUM1_OLIGO_A_23_P120883




1037
Hs.95351
Lipase, hormone-sensitive
Lipase, hormone-sensitive || BC070041 || 19q13.2
AGI_HUM1_OLIGO_A_23_P38876




1038
Hs.433512
ARP3 actin-related protein 3 homolog (yeast)
ARP3 actin-related protein 3 homolog (yeast) || BC044590 ||
AGI_HUM1_OLIGO_A_23_P108785







2q14.1





1039
Hs.292156
Dickkopf homolog 3 (Xenopuslaevis)
Dickkopf homolog 3 (Xenopuslaevis) || NM_015881 ||
AGI_HUM1_OLIGO_A_23_P162047







11p15.2





1040
Hs.439726
Laminin, beta 2 (laminin S)
Laminin, beta 2 (laminin S) || NM_002292 || 3p21
AGI_HUM1_OLIGO_A_23_P21382




1041
Hs.506325
Nudix (nucleoside diphosphate linked moiety X)-
Nudix (nucleoside diphosphate linked moiety X)-type
AGI_HUM1_OLIGO_A_23_P2366






type motif 4
motif 4 || NM_199040 ||





1042
Hs.50984
Sarcoma amplified sequence
Sarcoma amplified sequence || BX647402 || 12q13.3
AGI_HUM1_OLIGO_A_23_P24984




1043
Hs.476319
Enoyl Coenzyme A hydrolase domain containing 2
Enoyl Coenzyme A hydrolase domain containing 2
AGI_HUM1_OLIGO_A_23_P200203







|| BX647186 || 1p32.3





1044
Hs.332706
Optineurin
Optineurin || NM_001008211 || 10p13
AGI_HUM1_OLIGO_A_23_P1461




1045
Hs.105700
Secreted frizzled-related protein 4
Secreted frizzled-related protein 4 || AF026692 || 7p14.1
AGI_HUM1_OLIGO_A_23_P215320




1046
Hs.26403
Glutathione transferase zeta 1 (maleylacetoacetate
Glutathione transferase zeta 1 (maleylacetoacetate
AGI_HUM1_OLIGO_A_23_P106204






isomerase)
isomerase) || AB209360 || 14q24.3





1047
Hs.476365
Sterol carrier protein 2
Sterol carrier protein 2 || AB208789 || 1p32
AGI_HUM1_OLIGO_A_23_P126057




1048
Hs.533260
KIAA0649
KIAA0649 || NM_014811 || 9q34.3
AGI_HUM1_OLIGO_A_23_P146497




1049
Hs.435661
Serine palmitoyltransferase, long chain base
Serine palmitoyltransferase, long chain base subunit 2
AGI_HUM1_OLIGO_A_23_P3146






subunit 2
|| NM_004863 || 14q24.3-q31





1050
Hs.459952
Stannin
Stannin || NM_003498 || 16p13
AGI_HUM1_OLIGO_A_23_P152160




1051
Hs.97220
Chondroadherin
Chondroadherin || NM_001267 || 17q21.33
AGI_HUM1_OLIGO_A_23_P26976




1052
Hs.20013
GCIP-interacting protein p29
GCIP-interacting protein p29 || BC015824 || 1p36.11
AGI_HUM1_OLIGO_A_23_P45756




1053
Hs.19439
Transcription elongation factor A (SII)-like 4
Transcription elongation factor A (SII)-like 4
AGI_HUM1_OLIGO_A_23_P259188







|| CR594284 || Xq22.2





1054
Hs.495710
Glycoprotein M6B
Glycoprotein M6B || NM_001001995 || Xp22.2
AGI_HUM1_OLIGO_A_23_P85067




1055
Hs.90753
HIV-1 Tat interactive protein 2, 30 kDa
HIV-1 Tat interactive protein 2, 30 kDa || NM_006410 ||
AGI_HUM1_OLIGO_A_23_P64129







11p15.1





1056
Hs.411847
Mitogen-activated protein kinase 5
Mitogen-activated protein kinase 5 || NM_002748 || 15q21
AGI_HUM1_OLIGO_A_23_P3204




1057
Hs.78944
Regulator of G-protein signalling 2, 24 kDa
Regulator of G-protein signalling 2, 24 kDa || BC042755 ||
AGI_HUM1_OLIGO_A_23_P114947







1q31





1058
Hs.467769
Family with sequence similarity 49, member A
Family with sequence similarity 49, member A || AK055334||
AGI_HUM1_OLIGO_A_23_P21560







2p24.3-p24.2





1059
Hs.188634
Sorting nexin 1
Sorting nexin 1 || AB209013 || 15q22.31
AGI_HUM1_OLIGO_A_23_P49033




1060
Hs.107740
Kruppel-like factor 2 (lung)
Kruppel-like factor 2 (lung) || BM549806 || 19p13.13-p13.11
AGI_HUM1_OLIGO_A_23_P119196




1061
Hs.534169
Heat shock 70 kDa protein 14
Heat shock 70 kDa protein 14 || BC026226 || 10p13
AGI_HUM1_OLIGO_A_23_P63829




1062
Hs.508234
Kruppel-like factor 5 (intestinal)
Kruppel-like factor 5 (intestinal) || AF132818 || 13q22.1
AGI_HUM1_OLIGO_A_23_P53891




1063
Hs.128065
Cathepsin C
Cathepsin C || BX537913 || 11q14.1-q14.3
AGI_HUM1_OLIGO_A_23_P1552




1064
Hs.105153
Shugoshin-like 1 (S.pombe)
Shugoshin-like 1 (S.pombe) || AB187578 || 3p24.3
AGI_HUM1_OLIGO_A_23_P29723




1065
Hs.9088
Ankyrin repeat domain 34
Ankyrin repeat domain 34 || AK04282 || 1q21.1
AGI_HUM1_OLIGO_A_23_P23855




1066
Hs.530157
FP15737
FP15737 || AF495725 ||
AGI_HUM1_OLIGO_A_23_P250833




1067
Hs.173288
SH2 domain binding protein 1 (tetratricopeptide
SH2 domain binding protein 1 (tetratricopeptide repeat
AGI_HUM1_OLIGO_A_23_P127676






repeat containing)
containing) || BC058914 || 11p15.3





1068
Hs.5175785
Biliverdin reductase B (Ravin reductase (NADPH))
Biliverdin reductase B (Ravin reductase (NADPH))
AGI_HUM1_OLIGO_A_23_P153351







|| BF341546 || 19q13.1-q13.2





1069
Hs.493096
Pre-B-cell leukemia transcription factor 1
Pre-B-cell leukemia transcription factor 1 || CR749446 || 1q23
AGI_HUM1_OLIGO_A_23_P62948




1070
Hs.8526
UDP-GlcNAcbetaGal beta-1,3-N-
UDP-GlcNAcbetaGal beta-1,3-N-







acetylglucosaminyltransferase 6
acetylglucosaminyltransferase 6 || NM_006876 || 11q13.2
AGI_HUM1_OLIGO_A_23_P86899




1071
Hs.497636
Laminin, beta 3
Laminin, beta 3 || NM_001017402 || 1q32
AGI_HUM1_OLIGO_A_23_P86012




1072
Hs.491351
Clathrin, heavy polypeptide (Hc)
Clathrin, heavy polypeptide (Hc) || NM_004859 || 17q11-qter
AGI_HUM1_OLIGO_A_23_P118543




1073
Hs.52332
Ornithine aminotransferase (gyrate atrophy)
Ornithine aminotransferase (gyrate atrophy) || AB208817 ||
AGI_HUM1_OLIGO_A_23_P98092







10q26





1074
Hs.477114
Pleckstrin homology-like domain, family 8,
Pleckstrin homology-like domain, family 8, member 2
AGI_HUM1_OLIGO_A_23_P250063






member 2
|| AL832205 || 3q13.2





1075
Hs.509966
Chromosome 14 open reading frame 5B
Chromosome 14 open reading frame 5B || AY260577 ||
AGI_HUM1_OLIGO_A_23_P140364







14q24.3





1076
Hs.75069
Serine hydroxymethyl transferase 2 (mitochondrial)
Serine hydroxymethyl transferase 2 (mitochondrial)
AGI_HUM1_OLIGO_A_23_P158239
AGI_HUM1_OLIGO_






|| AK056053 || 12q12-q14

A_23_P169629



1077
Hs.523009
Sparc/osteonectin, cwcv and kazal-like domains
Sparc/osteonectin, cwcv and kazal-like domains
AGI_HUM1_OLIGO_A_23_P161280






proteoglycan (testican) 2
proteoglycan (testican) 2 || NM_014767 || 10p  text missing or illegible when filed





1078
Hs.8036
MBC3205
MBC3205 || AK127147 || 19p13.2
AGI_HUM1_OLIGO_A_23_P90099




1079
Hs.549198
F-box protein 31
F-box protein 31 || AF318348 || 16q24.2
AGI_HUM1_OLIGO_A_23_P89030




1080
Hs.532626
MGC1602B similar to RIKEN cDNA 1700019E19
MGC1602B similar to RIKEN cDNA 1700019E19 gene
AGI_HUM1_OLIGO_A_23_P48728






gene
|| BU733407 || 14q24.3





1081
Hs.169075
PTK9 protein tyrosine kinase 9
PTK9 protein tyrosine kinase 9 || NM_198974 || 12q12
AGI_HUM1_OLIGO_A_23_P48166




1082
Hs.495912
Dystrophin (muscular dystrophy, Duchenne and
Dystrophin (muscular dystrophy, Duchenne and Becker types)
AGI_HUM1_OLIGO_A_23_P113453






Becker types)
|| NM_004010 || Xp21.2





1083
Hs.421724
Cathepsin G
Cathepsin G || BU621869 || 14q11.2
AGI_HUM1_OLIGO_A_23_P140384




1084
Hs.532815
Elastin microfibril interface 2
Elastin microfibril interface 2 || AF270513 || 18p11.3
AGI_HUM1_OLIGO_A_23_P27315




1085
Hs.477921
WW domain containing transcription regulator 1
WW domain containing transcription regulator 1
AGI_HUM1_OLIGO_A_23_P29763







|| AL833852 || 3q23-q24





1086
Hs.520494
Hypothetical protein FLJ14925
Hypothetical protein FLJ14925 || BC068649 || 1q42.13-q43
AGI_HUM1_OLIGO_A_23_P138034




1087
Hs.31442
RecQ protein-like 4
RecQ protein-like 4 || BC020496 || 8q24.3
AGI_HUM1_OLIGO_A_23_P71558




1088
Hs.2030
Thrombomodulin
Thrombomodulin || NM_000361 || 20p12-cen
AGI_HUM1_OLIGO_A_23_P91390




1089
Hs.508480
RAP2A, member of RAS oncogene family
RAP2A, member of RAS oncogene family || NM_021033 ||
AGI_HUM1_OLIGO_A_23_P151384







13q34





1090
Hs.153678
Reproduction 8
Reproduction 8 || NM_005671 || 8p12-p11.2
AGI_HUM1_OLIGO_A_23_P157465




1091
Hs.319334
Nuclear autoantigenic sperm protein
Nuclear autoantigenic sperm protein (histone-binding)
AGI_HUM1_OLIGO_A_23_P34794






(histone-binding)
|| AY700118 || 1p34.1





1092
Hs.396358
Hypothetical protein FLJ11273
Hypothetical protein FLJ11273 || NM_018374 || 7p21.3
AGI_HUM1_OLIGO_A_23_P8522




1093
Hs.514402
Hypothetical protein MGC10986
Hypothetical protein MGC10986 || NM_030576 || 17p23.3
AGI_HUM1_OLIGO_A_23_P26823




1094
Hs.166950
Ganglioside-induced differentiation-associated
Ganglioside-induced differentiation-associated protein 1
AGI_HUM1_OLIGO_A_23_P216071






protein 1
|| AL110252 || 8q21.11





1095
Hs.74576
GDP dissociation inhibitor 1
GDP dissociation inhibitor 1 || AL123405 || Xq28
AGI_HUM1_OLIGO_A_23_P45496




1096
Hs.310542
Translocase of outer mitochondrial membrane-40
Translocase of outer mitochondrial membrane-40 homolog
AGI_HUM1_OLIGO_A_23_P153266






homolog (yeast)
(yeast) || BC047528 || 19q13





1097
Hs.285976
LAG1 longevity assurance homolog 2
LAG1 longevity assurance homolog 2 (S.cerevisiae)
AGI_HUM1_OLIGO_A_23_P63009






(S.cerevisiae)
|| NM_181746 || 1q21.2





1098
Hs.519018
SH3 domain protein D19
SH3 domain protein D19 || BX647422 || 4q31.3
AGI_HUM1_OLIGO_A_23_P33364




1099
Hs.524464
ATP synthase, H+ transporting, mitochondrial F0
ATP synthase, H+ transporting, mitochondrial F0 complex,
AGI_HUM1_OLIGO_A_23_P87616






complex, subunit c (subunit 9), isoform 2
subunit c (subunit 9), isoform 2 || CR  text missing or illegible when filed





1100
Hs.369554
Solute carrier family 16 (monocarboxylic acid
Solute carrier family 16 (monocarboxylic acid transporters),
AGI_HUM1_OLIGO_A_23_P159129






transporters), members 5
members 5 || AK092512 || 17q25.1





1101
Hs.513315
Nudix (nucleoside diphosphate linked moiety X)-
Nudix (nucleoside diphosphate linked moiety X)- type motif
AGI_HUM1_OLIGO_A_23_P49429






type motif 16-like 1
16-like 1 || BQ679635 || 16p13.3





1102
Hs.10595
Cytochrome P450, family 26, subfamily A,
Cytochrome P450, family 26, subfamily A, polypeptide 1
AGI_HUM1_OLIGO_A_23_P138655






polypeptide 1
|| AK027560 || 10q23-q24






text missing or illegible when filed indicates data missing or illegible when filed






















TABLE 14








Diagonal bias
Diagonal corr.
Diagonal spread
Concordance corr.
Diagonal std. dev
Lowe
Upper
Range


Gene
Raw
Raw
Raw
Raw
Raw
Limits Raw
Limits Raw
Raw





ACTB
−1.6
−0.012
0.85
−0.004
1.1
0.026
1.8
6.5


ASF1A
−2.3
0.044
1.1
0.012
1.4
0.0067
1.5
5.9


B3GNT5
−0.51
0.49
0.89
0.45
1.1
0.064
5.7
8.6


BLVRA
−2.1
0.055
1.1
0.021
1.6
0.0054
3
7.8


BTG3
−1.1
0.39
0.84
0.25
1.1
0.038
3.3
7.6


BUB1
−1.5
0.51
0.68
0.23
0.94
0.038
1.5
5.3


C10ORF7
−2.1
0.18
0.86
0.045
1.1
0.015
1.1
5.7


C16ORF45
−2.8
0.31
1.1
0.093
1.5
0.0032
1.3
9.2


CaMKIIN/Alpha
−0.91
0.69
0.68
0.51
0.87
0.075
2.2
8.3


CDH3
−1.5
0.65
0.96
0.45
1.3
0.02
2.9
9.7


CHI3L2
−1.5
0.84
0.91
0.66
1.1
0.024
2
13


COX6C
−2.2
0.59
1
0.28
1.3
0.0081
1.6
10


CSDA
−1.1
0.35
0.92
0.24
1.4
0.022
4.7
6.1


CTPS
−1.8
0.27
0.83
0.096
1.1
0.018
1.6
5.4


ERBB2
−0.65
0.73
0.72
0.64
0.91
0.088
3.2
8.1


ESR1
−1.6
0.9
0.8
0.74
1
0.028
1.6
13


FABP7
−0.9
0.81
1.4
0.78
2
0.0086
20
23


FBP1
−1.8
0.67
0.9
0.37
1.2
0.016
1.6
10


FLJ10980
−2.8
0.42
0.83
0.086
1.1
0.0074
0.54
6.1


FOXC1
−0.034
0.72
0.83
0.72
1.1
0.11
8.9
8.2


FZD7
−0.93
0.3
0.7
0.17
0.92
0.067
2.4
5.8


GATA3
−1.5
0.77
0.88
0.53
1.1
0.026
1.9
11


GRB7
−1.4
0.73
0.7
0.46
0.94
0.042
1.7
8.5


GSTM3
−2.3
0.65
0.86
0.25
1.1
0.012
0.83
9


GSTP1
−0.033
0.51
0.76
0.51
0.97
0.14
6.5
5.1


HIS1
−1.3
0.3
0.75
0.13
0.94
0.046
1.8
6


ID4
−1.1
0.64
0.74
0.44
0.95
0.055
2.3
6.9


IGBP1
−2.5
−0.16
1.1
−0.035
1.5
0.0048
1.4
6.1


INPP4B
−1.5
0.67
0.81
0.4
1.1
0.027
1.8
10


KIT
−1.7
0.28
1.2
0.16
1.7
0.0071
5.5
9.4


KRT17
−0.7
0.66
0.92
0.59
1.2
0.049
5.1
9


MKI67
−1.3
0.48
0.7
0.24
0.92
0.047
1.7
5.7


MRPL19
−0.61
0.0035
0.88
0.0027
1.1
0.062
4.9
4.7


MYBL2
−0.18
0.8
0.57
0.79
0.76
0.19
3.7
5.6


PSMC4_R3
−2.6
−0.061
1
−0.012
1.3
0.0056
1
6.1


PUM1
−2.5
−0.15
0.92
−0.026
1.2
0.0084
0.97
6.2


S100A11
−0.89
0.34
0.86
0.23
1
0.054
3.3
5.8


SEMA3C
−1.1
0.46
0.77
0.29
1.1
0.042
2.7
7.3


SF3A1
−2.3
−0.24
0.93
−0.043
1.2
0.0095
1.1
6


SLC39A6
−1.5
0.76
0.78
0.49
0.99
0.033
1.6
7.8


SLC5A6
−1.3
0.59
0.61
0.27
0.79
0.058
1.3
5.1


STK6
−1.4
0.63
0.63
0.32
0.87
0.046
1.4
5.7


TCEAL1
−1.2
0.47
0.64
0.24
0.85
0.061
1.7
6.1


TFF3
−1.8
0.8
0.89
0.55
1.2
0.016
1.8
10


TMSB10
−2.3
0.21
0.97
0.063
1.3
0.0082
1.4
6


TOP2A
−2
0.53
0.9
0.23
1.2
0.014
1.5
7.3


TP53BP2
−0.93
0.47
0.74
0.3
0.89
0.07
2.3
5.3


VAV3
−3.1
0.48
0.97
0.12
1.3
0.0037
0.6
8.5


WWP1
−1.7
0.55
0.74
0.21
0.94
0.029
1.1
7.1


XBP1
−0.87
0.67
0.81
0.54
1
0.057
3.2
N/A


















Diagonal bias
Diagonal corr.
Diagonal spread
Concordance corr.
Diagonal std. dev
Lowe
Upper Limits
Range
Diagonal bias


Norm
Norm
Norm
Norm
Norm
Limits Norm
Norm
Norm
DWD





N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A


−0.41
0.43
0.38
0.31
0.51
0.24
1.8
2.1
0.061


1.5
0.63
0.64
0.26
0.76
0.97
19
4.8
0.24


−0.16
0.46
0.52
0.45
0.83
0.17
4.4
4.6
−0.3


0.91
0.39
0.64
0.24
0.9
0.42
14
6.6
0.47


0.43
0.59
0.56
0.51
0.7
0.39
6
4.4
0.42


−0.14
0.7
0.28
0.67
0.38
0.42
1.8
3.6
0.36


−0.74
0.74
0.59
0.61
0.82
0.097
2.4
6.4
0.079


1.1
0.67
0.6
0.44
0.91
0.51
18
6.5
0.001


0.48
0.82
0.56
0.76
0.72
0.39
6.6
8.4
0.64


0.51
0.8
0.88
0.78
1.2
0.16
17
12
0.95


−0.28
0.93
0.39
0.91
0.48
0.3
1.9
6.4
−0.23


0.79
0.51
0.69
0.39
0.96
0.33
14
7.6
0.12


0.12
0.56
0.47
0.54
0.59
0.35
3.6
3.3
0.3


1.3
0.75
0.71
0.48
0.9
0.64
22
7
0.24


0.52
0.88
0.69
0.86
1.2
0.17
16
9.2
−0.17


1.1
0.74
1.3
0.67
1.8
0.093
97
14
1.3


0.18
0.89
0.41
0.88
0.62
0.36
4
7.3
−0.13


−0.82
0.73
0.51
0.52
0.65
0.12
1.6
4.7
0.26


1.8
0.77
0.58
0.36
0.81
1.2
29
6.2
−0.055


1.1
0.47
0.58
0.21
0.72
0.72
12
4.2
0.13


0.58
0.72
0.84
0.68
1.3
0.14
22
10
0.25


0.65
0.78
0.66
0.69
0.85
0.35
10
6.8
0.49


−0.26
0.74
0.6
0.73
0.97
0.12
5.2
7.4
0.27


2
0.53
0.64
0.13
0.77
1.5
31
4.2
0.014


0.71
0.42
0.56
0.26
0.7
0.5
7.9
3.3
0.1


0.93
0.56
0.8
0.41
1
0.33
19
7.5
0.85


−0.59
0.37
0.44
0.23
0.61
0.17
1.8
3.8
−0.035


0.51
0.86
0.47
0.79
0.64
0.47
5.8
5.8
0.019


0.71
0.66
0.61
0.54
0.86
0.37
11
6
0.65


1.4
0.57
0.91
0.38
1.3
0.32
47
9.2
0.91


0.66
0.41
0.63
0.29
0.78
0.41
8.8
3.4
0.5


N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A


1.7
0.65
0.73
0.3
0.95
0.83
34
6.2
0.15


N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A


N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A


1.1
0.48
0.57
0.2
0.7
0.78
12
3.3
0.092


0.97
0.62
0.67
0.43
0.9
0.44
15
6
0.2


N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A


0.51
0.85
0.62
0.8
0.81
0.34
8.1
6
0.2


0.65
0.56
0.62
0.42
0.77
0.42
8.7
5.6
0.58


0.53
0.65
0.57
0.55
0.74
0.39
7.2
5.2
0.48


0.82
0.56
0.61
0.39
0.82
0.44
11
4.9
0.36


0.18
0.89
0.7
0.89
0.88
0.21
6.7
9.4
−0.025


−0.34
0.54
0.41
0.46
0.53
0.25
2
2.5
0.18


−0.5
0.55
0.94
0.52
1.3
0.048
7.7
6.6
0.41


1.1
0.54
0.55
0.26
0.68
0.75
11
4.6
0.3


−1.1
0.65
0.62
0.47
1
0.047
2.6
7.8
0.35


0.26
0.73
0.55
0.7
0.67
0.34
4.9
5.4
0.22


1.2
0.7
0.71
0.48
1
0.46
23
6.6
−0.1
















Diagonal corr.
Diagonal spread
Concordance corr.
Diagonal std. dev
Lowe Limits
Upper Limits
Range


DWD
DWD
DWD
DWD
DWD
DWD
DWD





N/A
N/A
N/A
N/A
N/A
N/A
N/A


0.43
0.38
0.42
0.51
0.39
2.9
2.1


0.63
0.64
0.61
0.76
0.28
5.6
4.8


0.46
0.52
0.43
0.83
0.15
3.8
4.6


0.39
0.64
0.34
0.9
0.27
9.3
6.6


0.59
0.56
0.51
0.7
0.38
5.9
4.4


0.7
0.28
0.55
0.38
0.68
3
3.6


0.74
0.59
0.74
0.82
0.22
5.4
6.4


0.67
0.6
0.67
0.91
0.17
5.9
6.5


0.82
0.56
0.73
0.72
0.45
7.7
8.4


0.8
0.88
0.71
1.2
0.25
26
12


0.93
0.39
0.91
0.48
0.31
2
6.4


0.51
0.69
0.51
0.96
0.17
7.4
7.6


0.56
0.47
0.5
0.59
0.42
4.3
3.3


0.75
0.71
0.73
0.9
0.22
7.4
7


0.88
0.69
0.88
1.2
0.088
8.1
9.2


0.74
1.3
0.65
1.8
0.11
120
14


0.89
0.41
0.88
0.62
0.26
2.9
7.3


0.73
0.51
0.7
0.65
0.36
4.6
4.7


0.77
0.58
0.77
0.81
0.2
4.6
6.2


0.47
0.58
0.47
0.72
0.28
4.7
4.2


0.72
0.84
0.71
1.3
0.1
16
10


0.78
0.66
0.72
0.85
0.3
8.6
6.8


0.74
0.6
0.73
0.97
0.2
8.6
7.4


0.53
0.64
0.53
0.77
0.23
4.5
4.2


0.42
0.56
0.41
0.7
0.28
4.4
3.3


0.56
0.8
0.43
1
0.3
17
7.5


0.37
0.44
0.37
0.61
0.29
3.2
3.8


0.86
0.47
0.86
0.64
0.29
3.6
5.8


0.66
0.61
0.56
0.86
0.35
10
6


0.57
0.91
0.47
1.3
0.2
29
9.2


0.41
0.63
0.33
0.78
0.35
7.5
3.4


N/A
N/A
N/A
N/A
N/A
N/A
N/A


0.65
0.73
0.64
0.95
0.18
7.4
6.2


N/A
N/A
N/A
N/A
N/A
N/A
N/A


N/A
N/A
N/A
N/A
N/A
N/A
N/A


0.48
0.57
0.47
0.7
0.28
4.3
3.3


0.62
0.67
0.61
0.9
0.21
7.1
6


N/A
N/A
N/A
N/A
N/A
N/A
N/A


0.85
0.62
0.84
0.81
0.25
6
6


0.56
0.62
0.44
0.77
0.39
8.1
5.6


0.65
0.57
0.56
0.74
0.37
6.9
5.2


0.56
0.61
0.52
0.82
0.28
7.1
4.9


0.89
0.7
0.89
0.88
0.17
5.5
9.4


0.54
0.41
0.51
0.53
0.42
3.4
2.5


0.55
0.94
0.53
1.3
0.12
19
6.6


0.54
0.55
0.5
0.68
0.35
5.1
4.6


0.65
0.62
0.62
1
0.19
10
7.8


0.73
0.55
0.71
0.67
0.33
4.6
5.4


0.7
0.71
0.7
1
0.13
6.5
6.6



















TABLE 15










Prediction



Single Sample Predictor
Analysis
















PCR-FFPE
for Microarrays


Sample
MA (1393)
MA (40)
PCR-FF (40)
(40)
MA (1393)





BR00-0284
HER2+/ER−
HER2+/ER−
HER2+/ER−
HER2+/ER−
HER2+/ER−


BR00-0365
LUMINAL
LUMINAL
LUMINAL
LUMINAL
LUMINAL


BR00-0572
BASAL
BASAL
BASAL
BASAL
BASAL


BR00-0587
LUMINAL
LUMINAL
LUMINAL
LUMINAL
LUMINAL


BR99-0207
LUMINAL
LUMINAL
LUMINAL
LUMINAL
LUMINAL


BR99-0348
LUMINAL
LUMINAL
LUMINAL
LUMINAL
LUMINAL


PB120-LN
BASAL
BASAL
BASAL
BASAL
BASAL


PB126
BASAL
BASAL
BASAL
BASAL
BASAL


PB149
LUMINAL
NORMAL-LIKE
NORMAL-LIKE
LUMINAL
LUMINAL


PB205
BASAL
BASAL
BASAL
BASAL
BASAL


PB255
LUMINAL
LUMINAL
LUMINAL
LUMINAL
LUMINAL


PB297
BASAL
BASAL
BASAL
BASAL
BASAL


PB311
HER2+/ER−
HER2+/ER−
HER2+/ER−
LUMINAL
LUMINAL


PB314
HER2+/ER−
HER2+/ER−
HER2+/ER−
HER2+/ER−
HER2+/ER−


PB334
BASAL
BASAL
BASAL
BASAL
BASAL


PB362
NORMAL-LIKE
NORMAL-LIKE
NORMAL-LIKE
LUMINAL
NORMAL-LIKE


PB370
LUMINAL
LUMINAL
LUMINAL
LUMINAL
LUMINAL


PB376
HER2+/ER−
BASAL
BASAL
BASAL
HER2+/ER−


PB413
LUMINAL
LUMINAL
LUMINAL
LUMINAL
LUMINAL


PB441
LUMINAL
LUMINAL
LUMINAL
LUMINAL
LUMINAL


PB455
HER2+/ER−
HER2+/ER−
HER2+/ER−
HER2+/ER−
HER2+/ER−


UB29
BASAL
BASAL
BASAL
BASAL
BASAL


UB37
HER2+/ER−
HER2+/ER−
HER2+/ER−
HER2+/ER−
HER2+/ER−


UB38
LUMINAL
LUMINAL
LUMINAL
LUMINAL
LUMINAL


UB39
LUMINAL
LUMINAL
LUMINAL
LUMINAL
LUMINAL


UB43
LUMINAL
LUMINAL
LUMINAL
LUMINAL
LUMINAL


UB45
LUMINAL
LUMINAL
LUMINAL
LUMINAL
LUMINAL


UB55
LUMINAL
LUMINAL
LUMINAL
LUMINAL
LUMINAL


UB57
LUMINAL
LUMINAL
LUMINAL
LUMINAL
LUMINAL


UB58
LUMINAL
LUMINAL
LUMINAL
LUMINAL
LUMINAL


UB60
LUMINAL
HER2+/ER−
HER2+/ER−
HER2+/ER−
HER2+/ER−


UB66
LUMINAL
LUMINAL
LUMINAL
LUMINAL
LUMINAL


UB67
BASAL
BASAL
BASAL
BASAL
BASAL


UB71
BASAL
BASAL
BASAL
BASAL
BASAL


UB79
LUMINAL
LUMINAL
LUMINAL
LUMINAL
LUMINAL













Prediction Analysis for Microarrays












PCR-FFPE
Immunohistochemistry
HER2 DNA














Sample
MA (40)
PCR-FF (40)
(40)
ER
PR
HER2
PCR





BR00-0284
HER2+/ER−
HER2+/ER−
HER2+/ER−


+


BR00-0365
LUMINAL
LUMINAL
LUMINAL
+
+
+


BR00-0572
BASAL
BASAL
BASAL


+



BR00-0587
LUMINAL
LUMINAL
LUMINAL
+
+
+


BR99-0207
LUMINAL
LUMINAL
LUMINAL
+




BR99-0348
LUMINAL
LUMINAL
LUMINAL
+
+
+


PB120-LN
BASAL
BASAL
BASAL





PB126
BASAL
BASAL
BASAL





PB149
LUMINAL
LUMINAL
LUMINAL
+
+
+


PB205
BASAL
BASAL
BASAL


+



PB255
LUMINAL
LUMINAL
LUMINAL
+
+
+


PB297
BASAL
BASAL
BASAL





PB311
LUMINAL
LUMINAL
LUMINAL
+
+



PB314
HER2+/ER−
HER2+/ER−
HER2+/ER−


+



PB334
BASAL
BASAL
BASAL





PB362
NORMAL-LIKE
NORMAL-LIKE
LUMINAL
+
+



PB370
LUMINAL
LUMINAL
LUMINAL
+
+



PB376
BASAL
BASAL
BASAL


+



PB413
LUMINAL
LUMINAL
LUMINAL
+
+



PB441
LUMINAL
LUMINAL
LUMINAL
+
+



PB455
HER2+/ER−
HER2+/ER−
HER2+/ER−


+



UB29
BASAL
BASAL
BASAL





UB37
HER2+/ER−
HER2+/ER−
HER2+/ER−

+
+


UB38
LUMINAL
LUMINAL
LUMINAL
+
+




UB39
LUMINAL
LUMINAL
LUMINAL
+




UB43
LUMINAL
LUMINAL
LUMINAL
+
+




UB45
LUMINAL
LUMINAL
LUMINAL
+
+



UB55
LUMINAL
LUMINAL
LUMINAL
+
+
+


UB57
LUMINAL
LUMINAL
LUMINAL
+
+



UB58
LUMINAL
LUMINAL
LUMINAL
+
+



UB60
HER2+/ER−
HER2+/ER−
HER2+/ER−


+



UB66
LUMINAL
LUMINAL
LUMINAL
+
+



UB67
BASAL
BASAL
BASAL





UB71
BASAL
BASAL
BASAL





UB79
LUMINAL
LUMINAL
LUMINAL
+


















TABLE 16







MA P3m test (40 g × 35 s) Gene Scores (threshold = 0)










Offset Quantile
50
Offset Value
0.650934012



both
RNG Seed
420473












Prior Distribution (Sample Prior)











Class
BASAL
HER2
LUMINAL
NORMAL-LIKE





Prob.
0.225806
0.185483871
0.516129032
0.072580645














gene
BASAL
HER2
LUMINAL
NORMAL-LIKE















1
ERBB2
−0.3895
1.2282
−0.1621
−0.4533


2
KRT17
0.4627
−0.2412
−0.2612
1.12


3
KIT
0.3877
−0.2516
−0.2197
1.0848


4
ESR1
−0.8232
−0.9961
0.8007
−0.3565


5
FOXC1
0.9915
−0.1912
−0.4871
0.9538


6
TFF3
−0.9369
0
0.441
−0.0307


7
B3GNT5
0.9196
0.045
−0.4806
0.2112


8
XBP1
−0.8375
0
0.44
−0.4901


9
GRB7
0
0.8365
−0.2925
0.0187


10
ID4
0.4412
−0.1331
−0.2462
0.8043


11
COX6C
−0.2273
−0.2661
0.3347
−0.762


12
TMSB10
0.2812
0.744
−0.3593
−0.2403


13
GATA3
−0.7118
−0.2662
0.4969
−0.4083


14
SLC39A6
−0.5871
−0.7076
0.6434
−0.7101


15
WWP1
−0.3005
0
0.2562
−0.689


16
FABP7
0.4915
0.1749
−0.4044
0.6688


17
CDH3
0.6578
0.0768
−0.3978
0.3551


18
BTG3
0.6466
0.0631
−0.3522
0.1006


19
CHI3L2
0.6145
−0.072
−0.2852
0.3859


20
GSTP1
0.6044
0
−0.2967
0.1985


21
FBP1
−0.6029
−0.2298
0.4607
−0.5828


22
TP53BP2
0.5756
0.0568
−0.2241
−0.3619


23
C10orf7
0.5399
0.1613
−0.2739
−0.1639


24
SLC5A6
0.3315
0.5335
−0.4014
0.2293


25
FZD7
0.4426
−0.1423
−0.2053
0.5327


26
ASF1A
0.5161
0.1316
−0.2183
−0.409


27
INPP4B
−0.5015
0
0.2227
0


28
FLJ10980
−0.4928
0.0031
0.2695
−0.477


29
CTPS
0.4773
0.2509
−0.3185
0


30
GSTM3
−0.4558
0.0512
0.1822
−0.0945


31
CSDA
0.4181
0.1737
−0.3408
0.4479


32
SEMA3C
−0.4457
−0.1442
0.2904
−0.0793


33
VAV3
−0.4123
−0.0092
0.278
−0.4404


34
S100A11
0.2956
0.3957
−0.2321
−0.2994


35
CaMKIINalpha
−0.3896
0.0698
0.1268
0


36
C16orf45
−0.3465
−0.0975
0.1899
0


37
TCEAL1
−0.3093
−0.2368
0.2565
−0.0257


38
BLVRA
−0.2509
0.1475
0.0522
−0.0533


39
HIS1
−0.1223
−0.1352
0.1643
−0.2116


40
IGBP1
−0.1384
0
0.0446
0.1256










FF PCR test (40 g × 35 s) Gene Scores (threshold = 0)










Offset Quantile
50
Offset Value
0.625782014



both
RNG Seed
420473












Prior Distribution (Sample Prior)











Class
BASAL
HER2
LUMINAL
NORMAL-LIKE





Prob.
0.257142857
0.2
0.514285714
0.028571429














gene
BASAL
HER2
LUMINAL
NORMAL-LIKE















1
KIT
0.3675
−0.1395
−0.2345
1.8906


2
ESR1
−1.1883
−1.3459
1.102
0.2805


3
ID4
0.3245
−0.2605
−0.1333
1.3023


4
FOXC1
0.9875
−0.0291
−0.5523
1.2582


5
COX6C
−0.4466
−0.3998
0.4409
−1.1188


6
SLC39A6
−0.6537
−0.959
0.761
−1.1022


7
ERBB2
−0.4563
1.0711
−0.1409
−0.8546


8
FABP7
1.0494
−0.2077
−0.4884
0.7995


9
FZD7
0.2722
−0.2176
−0.1094
1.042


10
CTPS
0.5663
0.2286
−0.3146
−1.0343


11
GRB7
−0.1824
1.0102
−0.2542
−0.8547


12
TFF3
−0.9615
−0.0486
0.5382
−0.6947


13
INPP4B
−0.617
−0.2413
0.3534
0.8813


14
FBP1
−0.7295
−0.1266
0.4627
−0.8762


15
XBP1
−0.7415
0.1957
0.3423
−0.8578


16
B3GNT5
0.8518
−0.0531
−0.4234
0.3263


17
KRT17
0.8451
0.5354
−0.5876
−0.7765


18
GSTM3
−0.4103
−0.109
0.2041
0.7821


19
GATA3
−0.7014
−0.7154
0.6473
−0.3318


20
HIS1
−0.0594
−0.0717
0.0968
−0.7067


21
GSTP1
0.7038
−0.023
−0.3381
−0.0862


22
TP53BP2
0.6448
0.2512
−0.4122
−0.1419


23
CSDA
0.6429
0.2468
−0.4262
0.1579


24
SLC5A6
0.6212
0.2941
−0.4275
0.0468


25
CDH3
0.5387
0.6139
−0.5217
0.2459


26
C16orf45
−0.6045
−0.3263
0.4143
0.2668


27
BTG3
0.5543
0.5727
−0.5099
0.1804


28
FLJ10980
−0.5708
−0.1958
0.3594
0.0384


29
SEMA3C
−0.5628
0.158
0.1883
0.5699


30
TMSB10
0.3611
0.5673
−0.3855
−0.2823


31
S100A11
0.5274
0.452
−0.4164
−0.415


32
C10orf7
0.523
0.1684
−0.3132
−0.2481


33
TCEAL1
−0.4665
−0.2879
0.3164
0.5185


34
CHI3L2
0.5001
−0.2528
−0.142
−0.1756


35
VAV3
−0.3937
−0.0416
0.2301
−0.3076


36
CaMKIINalpha
−0.3558
0.237
0.0826
0.0551


37
WWP1
−0.3116
−0.349
0.2975
−0.1076


38
BLVRA
−0.2769
0.2018
0.0711
−0.2014


39
ASF1A
0.2576
0.1658
−0.202
0.1569


40
IGBP1
−0.0697
−0.0909
0.0723
−0.0384
















TABLE 17





Supplemental Table 4. Raw qRT-PCR (replicates averaged)







FFPE-PCR data 40 classifiers + 5 proliferation genes + 5 Housekeepers

















Gene/PCR












Sample name


Matrix_5HK +
BR00-
BR00-
BR00-
BR00-
BR99-
BR99-


prolif_(45 g × 35 s)
0284_FFPE
0365_C_FFPE
0572_Int-VIM_FFPE
0587_GATA3_FFPE
0207_C_Int_FFPE
0348 GATA3
IPB120-MET-LN_FFPE
PB126_FFPE
PB149_FFPE
PB184_FFPE





Intrinsic gene list


ASF1A_R1
0.6625
0.3485
1.27
0.467
0.28500003
0.3445
1.1800001
0.2085
0.949
0.089


B3GNT5_DR3
0.6835
0.542
6.5550003
0.861
0.838
1.3
8.035
4.3100004
1.75
1.315


BLVRA_R2
2.1100001
2.96
0.6615
3.01
2.69
1.49
2.15
0.1985
2.2150002
0.1285


BTG3_R1
1.905
0.87450004
3.29
0.6385
0.8345
2.31
3.545
2.7350001
1.345
1.28


C10ORF7_R2
0.529
0.37449998
2.585
0.546
0.38050002
0.5325
1.655
0.7095
0.7575
0.195


C16ORF45_R1
1.075
0.574
0.246
3.455
0.23949999
7.4449997
0.69949996
0.18180001
3.5149999
0.376


CaMKIINalpha_R2
0.752
4.35
0.268
1.325
0.8355
4.14
1.0285
0.12450001
3.0700002
4.755


CDH3_R1
0.5675
0.9015
2.31
0.142
0.127
1.08
6.08
4.09
1.185
0.64900005


CHI3L2_R1
3.48
2.3175
268.5
2.575
29.15
14.1
108
643.5
15.1
13.65


COX6C_R2
0.024999999
0.305
0.069
0.29500002
0.4435
0.8325
0.0995
0.016800001
0.056649998
0.01145


CSDA_R1
6.185
2.315
33.75
2.8000002
3.33
4.23
10.95
10.8
10.8
1.725


CTPS_R1
0.2785
0.9785
0.81
0.43150002
0.6375
0.6625
2.96
1.56
0.87549996
0.22049999


ERBB2_R5
3.685
1.195
0.62600005
0.97
1.325
0.899
0.6925
0.5395
0.854
0.885


ESR1_R3
0.2565
5.615
0.053729996
7.625
17.05
6.785
0.149
0.162
3.245
3.15


FABP7_R2
0.002871
0.002871
0.119
0.002871
0.002871
N/A
0.007755
0.952
0.009945
0.00559


FBP1_R1
0.4415
1.0999999
0.014249999
0.498
0.96349996
1.23
0.23699999
0.02985
0.5495
0.303


FLJ10980_R2
1.9100001
0.3215
0.288
3.035
0.651
2.595
0.729
0.1345
1.5799999
0.2995


FOXC1_DR2
5.19
3.595
90.1
1.9200001
1.71
4.99
31.099998
43.2
7.765
3.245


FZD7_DR4
5.795
4.5
18.5
9.715
1.565
6.3500004
4.7
5.33
13.950001
11.75


GATA3_R4
0.2295
1.44
0.143
2.065
4.935
3.57
0.0501
0.2995
1.9649999
1.525


GRB7_DR4
2.66
0.7035
0.2125
0.64750004
0.336
0.906
0.32
1.2950001
0.29
0.6385


GSTM3_R3
0.21149999
0.5825
0.3195
1.2
0.1255
3.355
0.4765
0.08825
0.49
0.6595


GSTP1_R2
5.605
3.7749999
27.95
3.79
4.2200003
15.1
24.2
10.5
12.1
10.5


HIS1_DR1
1.88
3.845
0.988
2.4099998
3.375
6.44
3.335
1.49
5.2200003
1.665


ID4_DR4
9.255
4.7749996
42.9
3.93
4.94
16.35
27.5
9.885
34.9
34.699997


IGBP1_R1
0.656
0.374
0.681
0.8815
0.372
0.538
0.6575
0.1655
0.87049997
0.0765


INPP4B_R1
1.065
1.24
0.15900001
1.034
0.4105
3.315
1.735
0.0367
2.95
1.033


KIT_R4
0.32599998
0.3745
1.0095
0.26200002
0.29000002
1.9300001
1.0475
0.58000004
1.5150001
2.565


KRT17_R6
0.371
0.66550004
2.4850001
0.46100003
0.196
0.517
2.65
5.1099997
2.1399999
1.6


S100A11_R5
0.9425
1.425
1.1700001
0.741
0.3215
1.5899999
1.6500001
2.175
0.9505
1.315


SEMA3C_R1
1.05
1.0295
0.5225
0.427
0.7885
0.7585
0.3755
0.27899998
1.65
2.385


SLC39A6_DR3
0.04795
0.292
0.141
1.74
1.97
0.556
0.08335
0.0515
1.035
0.2585


SLC5A6_DR4
0.5075
1.8900001
6.27
0.4365
0.64199996
2.01
1.605
4.705
1.3
0.8835


TCEAL1_DR3
0.883
2.1750002
1.635
1.433675
2.895
2.75
1.565
0.832
1.38
4.825


TFF3_R2
1.175
35.35
0.5905
2.105
6.05
2.025
0.282
0.337
3.385
0.45200002


TMSB10_R1
1.22
0.467
2.13
0.65999997
0.2365
0.5115
1.265
1.2
0.8215
0.1155


TP53BP2_R1
1.46
2.31
16.1
0.833
0.9485
1.89
6.5600004
6.0649996
2.4099998
1.4200001


VAV3_R1
1.7
7.8900003
1.3599999
6.815
0.12830001
5.2200003
1.2049999
1.3
6.95
2.6999998


WWP1_R2
0.4345
5.695
0.371
0.8155
0.4075
4.515
0.85249996
0.17050001
0.94449997
1.275


XBP1_R2
2.72
6.08
0.3595
4.5950003
3.275
4.95
1.28
0.437
3.055
4.135


Proliferation genes


BUB1_R3
0.382
0.8895
2.02
0.1995
0.3455
0.403
3.025
1.09
0.1615
0.1083


MKI67_R4
0.2405
0.612
1.175
0.255
0.37449998
0.41000003
1.5999999
1.047
0.29500002
0.15


MYBL2_R7
0.85899997
2
4.79
0.23699999
0.3725
0.39499998
4.725
2.665
0.2715
0.0725


STK6_R5
0.13
0.4505
0.569
0.08515
0.14
0.1275
0.7625
0.4385
0.07080001
0.052500002


TOP2A_R7
0.21700001
0.249
0.2115
0.004185
0.0484
0.11
0.3735
0.2395
0.035949998
0.00465


Housekeeper genes


ACTB_R2
1.9649999
1.21
0.7185
1.1600001
1.04
1.7
2.415
0.90400004
1.355
0.6815


MRPL19_R2
2.225
3.335
5.575
2.6750002
2.865
2.63
5.02
1.995
2.99
0.4555


PSMC4_R3
0.16499999
0.4955
0.814
0.15
0.225
0.4115
0.5185
0.1325
0.19150001
0.048699997


PUM1_R6
1.24
0.649
1.28
1.17
0.6795
1.0450001
2.03
0.2685
1.74
0.188


SF3A1_R3
1.34
0.747
0.86300004
1.235
0.7705
0.96000004
1.615
0.4105
1.985
0.29549998










FF-PCR data 40 classifiers + 5 proliferation genes + 5 Housekeepers




















Gene/PCR Sample name




BR99-







Matrix_5HK +


prolif_(45 g × 35 s)
BR00-0284
BR00-03655_C
BR00-0572_Int-VIM
BR00-0587_GATA3
0207_C_Int
BR99-0348_GATA3
PB120-MET-LN
PB126
PB149
PB184





Intrinsic gene list


ASF1A_R1
11.95
3.725
1.24
3.255
6.27
4.58
4.0950003
11.049999
1.58
10.450001


B3GNT5_DR3
4.31
1.21
1.625
1.345
1.685
3.3899999
15.4
39.55
1.545
8.085


BLVRA_R2
31.85
18.4
0.39499998
11.1
26.45
16.900002
7.85
8.715
3.94
4.085


BTG3_R1
19.95
2.73
2.355
1.8399999
4.84
10.085
9.49
31.15
2.7849998
10.6


C10ORF7_R2
7.94
3.3600001
9.110001
3.5999998
5.495
4.5950003
5.635
22
2.05
5.385


C16ORF45_R1
10.15
10.165
0.771
13.15
4.37
152.5
2.71
3.6299999
14.75
9.735


CaMKIINalpha_R2
3.315
6.76
0.31800002
2.595
3.925
12.85
1.625
0.82
4.025
2.42


CDH3_R1
8.6449995
2.5900002
1.0785
0.6365
0.5325
7.52
21.65
154
8.825
14.85


CHI3L2_R1
153.5
6.265
2015
22.95
460.5
162
336.5
6120
71.05
127


COX6C_R2
0.622
1.26
0.0402
2.685
6.88
9.96
0.5365
0.6795
0.174
0.4015


CSDA_R1
16.5
4.79
65.2
1.5795
14.299999
10.950001
6.2200003
57.850002
5.625
35.75


CTPS_R1
3.04
2.255
1.155
1.064
4.9750004
5.3450003
9.434999
15.799999
0.9515
10.5


ERBB2_R5
28.05
2.58
0.187
2.16
4.26
4.33
2.1
3.075
1.51
1.26


ESR1_R3
2.345
20.599998
0.06465
88.6
179
44.2
0.3445
3.4
9.75
0.3955


FABP7_R2
0.01145
0.003852
0.9575
0.0145
0.003852
0.003852
0.01118
2.065
0.09295
1.73


FBP1_R1
13.1
6.59
0.0352
5.035
13.2
10.7
0.7365
0.22049999
1.345
0.8715


FLJ10980_R2
39.6
6.83
1.5450001
25.099998
13.8
50.15
4.06
3.66
12.95
6.295


FOXC1_DR2
17.6
3.085
23
2.24
2.0700002
6.415
36.85
93.6
5.475
55.599997


FZD7_DR4
27.75
14.3
25.75
35.6
6.8050003
25.25
10.049999
24.35
27.55
25.550002


GATA3_R4
3.77
4.345
0.035
15.700001
45.35
29.15
0.05835
2.21
7.025
0.0978


GRB7_DR4
18.45
1.455
0.2145
1.48
1.465
5.4049997
1.34
5.885
1.225
1.925


GSTM3_R3
5.46
6.51
2.295
21.400002
1.575
54.5
2.2
3.235
2.655
0.4105


GSTP1_R2
20.75
5.175
17.05
4.855
9.645
36.4
31.8
57.5
10.585
35


HIS1_DR1
8.115
5.0550003
1.355
4.2349997
15.5
23.349998
5.455
20
6.66
11.35


ID4_DR4
35.05
25.599998
415
21.099998
17.5
83.8
63
86.35
100.15
483


IGBP1_R1
13.6
5.2250004
0.9165
6.2799997
7.465
8.15
3.31
6.895
3.475
3.27


INPP4B_R1
13.700001
7.125
0.1825
7.315
3.6399999
19.05
5.395
0.9835
10.55
2.145


KIT_R4
2.77
1.26
4.585
1.5150001
1.28
11.45
0.8555
11.3
5.3
6.095


KRT17_R6
3.9899998
1.025
2.4650002
1.915
0.6885
1.175
2.05
21.75
6.275
42.25


S100A11_R5
6.285
1.575
0.873
1.9300001
1.33
4.495
5.5
14.475
1.23
3.3899999


SEMA3C_R1
9.975
4.4049997
0.412
1.815
6.5550003
3.33
0.7355
1.155
5.75
1.013


SLC39A6_DR3
0.93050003
0.94200003
0.14199999
9.88
16.45
4.7700005
0.31649998
0.82350004
2.555
0.74600005


SLC5A6_DR4
9.315001
6.525
19.05
1.45
4.105
12.6
5.96
49.449997
3.28
7.6499996


TCEAL1_DR3
11
8.639999
1.325
8.775
13.3
15.200001
6.5299997
5.26
7.635
5.33


TFF3_R2
41.35
171.5
0.38300002
17.05
40.050003
13.55
0.77750003
2.185
9.315001
2.04


TMSB10_R1
18.25
4.76
0.877
2.835
3.715
6.27
10.6
25.55
2.19
8.395


TP53BP2_R1
13.299999
6.4399996
7.745
2.335
5.2749996
9.315001
22.400002
54.2
3.2350001
9.055


VAV3_R1
35.45
87.850006
2.3600001
82.15
3.885
80.4
4.27
73.5
39.65
5.365


WWP1_R2
7.205
42.75
0.736
7.06
5.43
31.8
5.0599997
2.46
3.105
4.27


XBP1_R2
40.8
26.95
0.151
40.35
20.6
24.85
2.165
1.685
9.26
2.76


Proliferation genes


BUB1_R3
8.014999
3.395
5.755
1.49
2.025
1.695
13
23.1
0.5825
11.1


MKI67_R4
4.6
2.43
1.7850001
1.14
1.94
1.845
6.6
13.6
0.629
12.25


MYBL2_R7
4.6549997
2.645
3.065
0.2655
0.519
0.624
6.415
8.684999
0.2665
7.4849997


STK6_R5
2.4099998
1.895
1.385
0.406
0.8295
0.5585
2.57
6.955
0.1875
4.265


TOP2A_R7
19.2
1.745
1.2
0.62
0.845
0.92649996
2.975
18.4
0.125
5.1800003


Housekeeper genes


ACTB_R2
15.45
3.37
0.8175
3.9250002
8.645
8.695
7.1499996
12.8
2.355
8.53


MRPL19_R2
14.85
5.37
1.1949999
3.625
7.4849997
6.6400003
10.094999
18
3.0149999
4.635


PSMC4_R3
4.1549997
3.595
1.175
2.45
4.6549997
3.7800002
5.0550003
8.565
0.6475
9.49


PUM1_R6
21.5
10.7
9.18
9.184999
11.4
12.05
11.35
15.049999
7.99
11.799999


SF3A1_R3
17.15
6.55
1.5699999
7.42
12.1
9.525
8.465
11.4
4.915
17.95





















Gene/PCR Sample name













Matrix_5HK +


prolif_(45 g × 35 s)
PB205T_FFPE
PB255_FFPE
PB297_FFPE
PB311_FFPE
PB314_FFPE
PB334_FFPE
PB362_FFPE
PB370_FFPE
PB376_FFPE
PB379_FFPE
PB413_FFPE





Intrinsic gene list


ASF1A_R1
0.33850002
0.537
1.1500001
0.73800004
0.1163
0.519
0.14050001
0.24599999
1.505
0.06535
0.6705


B3GNT5_DR3
5.635
3.0700002
5.925
1.6500001
1.175
6.14
0.6345
1.28
4.83
6.08
1.985


BLVRA_R2
0.417
1.755
0.668
3.15
1.085
0.99549997
0.5935
0.978
3.045
0.314
2.31


BTG3_R1
4.99
2.51
6.995
1.62
1.15
4.255
1.155
1.0320001
3.26
2.15
1.375


C10ORF7_R2
0.72749996
0.764
1.585
1.36
0.3385
2.455
0.176
0.363
1.19
0.291
1.045


C16ORF45_R1
0.283
2.275
0.354
0.85150003
0.202
0.18180001
0.532
3.01
0.49
0.18180001
1.05


CaMKIINalpha_R2
2.3899999
10.85
2.5900002
6.6499996
3.48
0.544
3.055
2.865
1.04
0.8985
1.6700001


CDH3_R1
1.0955
1.565
10.549999
2.52
1.56
1.625
0.2105
0.24849999
3.2649999
1.22
0.81299996


CHI3L2_R1
2.745
8.785
7605
156.5
2.79
560
32.3
14.85
22.5
122.5
29


COX6C_R2
0.033150002
0.2805
0.107999995
0.65849996
0.01845
0.03415
0.0268
0.06425
0.0315
0.0836
1.165


CSDA_R1
6.27
3.645
7.365
8.065
1.87
16.65
1.17
2.6999998
4.65
1.78
3.48


CTPS_R1
2.21
0.6825
1.665
1.4100001
0.5185
2.3600001
0.2075
0.30900002
1.4300001
0.2015
0.56149995


ERBB2_R5
0.951
1.77
1.395
0.668
13.9
0.3315
0.614
1.45
0.49600002
0.39450002
1.135


ESR1_R3
0.053729996
3.585
0.11965
3.275
0.059699997
0.29500002
6.88
22.2
0.186
0.99399996
33.25


FABP7_R2
0.00589
0.002871
1.405
0.00842
0.002871
0.17449999
0.002871
0.00526
0.01255
0.002871
0.047399998


FBP1_R1
0.034649998
0.927
0.05485
0.579
0.1285
0.105900005
0.255
0.40850002
0.7705
0.10349999
2.5549998


FLJ10980_R2
0.35750002
1.23
0.7365
0.4885
0.31149998
0.37300003
0.228
1.31
0.5845
0.5395
1.78


FOXC1_DR2
3.755
3.81
63
6.9700003
1.16
49.5
0.31849998
3.2
4.335
3.53
5.625


FZD7_DR4
4.575
14.6
17.9
9.29
3.4250002
15.8
6.075
8.725
5.14
3.04
9.105


GATA3_R4
1.615
3.485
0.26749998
1.535
0.2235
0.100150004
1.905
1.745
0.1109
0.407
9.49


GRB7_DR4
0.782
0.56949997
0.947
0.51600003
9.26
0.1815
0.3505
0.503
0.289
0.2095
0.43449998


GSTM3_R3
0.2165
2
0.2805
1.53
0.07805
0.1745
1.5350001
1.375
0.192
0.4705
0.2195


GSTP1_R2
46.95
17.45
37.1
10.385
3.2649999
34.9
2.655
4.99
30.05
19.849998
27.3


HIS1_DR1
6.075
6.975
4.3599997
3.085
1.44
4.455
1.365
5.925
4.985
1.21
4.84


ID4_DR4
6.42
58.15
224.5
15.9
4.585
134
11.950001
14.700001
9.35
8.184999
73.55


IGBP1_R1
0.425
0.745
0.303
0.89100003
0.13550001
0.38599998
0.12449999
0.2935
0.85
0.0676
1.016


INPP4B_R1
0.199
3.455
0.363
4.1800003
0.15
0.31599998
2.135
2.81
0.63199997
0.261
1.735


KIT_R4
0.2065
2.08
10.030001
0.49150002
0.6355
8.475
0.471
1.385
0.72650003
2.27
3.7150002


KRT17_R6
1.575
1.38
3.2150002
8.245
1.2950001
52.699997
0.481
0.3565
2.65
8.535
2.12


S100A11_R5
2.5549998
3.225
4.55
3.1950002
1.1800001
2.605
1.275
1.205
1.1800001
1.425
1.5550001


SEMA3C_R1
0.396
1.3050001
0.51750004
1.8399999
0.204
0.224
0.71650004
0.7985
1.26
0.465
0.89049995


SLC39A6_DR3
0.0646
0.9415
0.23449999
0.2865
0.041950002
0.09715
0.242
1.075
0.105000004
0.261
0.722


SLC5A6_DR4
6.225
3.095
2.52
1.8499999
2.255
2.59
1.535
1.31
1.575
0.37150002
0.744


TCEAL1_DR3
1.3535
9.842501
1.655
3.1
0.71650004
0.7285
1.665
1.6269999
1.7745001
2.605
8.84


TFF3_R2
0.429
8.31
0.90900004
9.835
0.40350002
0.4305
1.305
7.61
1.485
0.5105
9.565001


TMSB10_R1
0.774
0.934
1.62
2.2800002
0.212
1.165
0.1115
0.321
1.4
0.1245
0.479


TP53BP2_R1
3.22
2.665
27.35
3.1149998
1.81
4.45
3.0149999
2
2.245
1.765
2.58


VAV3_R1
0.90749997
9.235001
1.99
2.8049998
1.0665
0.597
1.3199999
2.705
3.045
1.0545
3.97


WWP1_R2
0.64199996
2.295
1.4
1.245
0.605
0.89849997
1.0550001
1.6949999
0.704
0.6185
1.9


XBP1_R2
0.5995
12
1.135
12.05
3.495
1.645
2.7849998
9.309999
3.175
9.285
14.1


Proliferation genes


BUB1_R3
2.7350001
0.87399995
2.3200002
1.115
0.6
1.4449999
0.357
0.24649999
0.7985
0.6455
0.8225


MKI67_R4
4.17
0.8835
1.385
1.1
0.594
2.3899999
0.32099998
0.5615
1.65
0.324
0.748


MYBL2_R7
10.9
1.48
4.41
1.985
1.2750001
8.16
0.6465
0.7945
1.8
1.1099999
0.667


STK6_R5
3.8
1.105
0.7175
0.449
0.1745
0.822
0.1595
0.131
0.36900002
0.0878
0.26099998


TOP2A_R7
1.095
0.2705
0.9
0.382
1.56
0.7575
0.135
0.004185
0.282
0.0343
0.21000001


Housekeeper genes


ACTB_R2
1.555
2.12
2.29
2.665
0.62049997
1.14
0.5445
0.6555
1.16
0.949
1.62


MRPL19_R2
1.985
1.915
4.4849997
4.39
0.91550004
4.205
0.7355
1.1800001
3.0749998
0.56299996
3.21


PSMC4_R3
0.24149999
0.2415
0.41750002
0.51100004
0.08335
0.2625
0.055600002
0.1125
0.14
0.142
0.4325


PUM1_R6
0.6115
0.62450004
2.275
1.715
0.289
0.8995
0.44050002
0.272
0.85249996
0.244
1.1800001


SF3A1_R3
0.662
0.6185
2.0149999
1.675
0.3095
1.0799999
0.4765
0.379
0.9735
0.4145
0.96099997






















Gene/PCR Sample name














Matrix_5HK +


prolif_(45 g × 35 s)
PB441_FFPE
PB455_FFPE
UB29_1C_FFPE
PB205T
PB255
PB297
PB311
PB314
PB334
PB362
PB370
PB376





Intrinsic gene list


ASF1A_R1
1.78
0.741
1.91
3.72
6.285
10.355
5.465
2.085
6.86
2.79
26.5
9.105


B3GNT5_DR3
1.69
2.83
19.4
4.255
1.96
11.35
1.855
0.935
20.150002
2.495
17.5
10.530001


BLVRA_R2
6.69
2.85
2.4050002
3.4099998
14.450001
3.1750002
12.9
7.925
8.775
5.4399996
79.7
16.6


BTG3_R1
2.32
4.69
7.855
7.565
6.21
35.75
6.915
16.05
15.645
4.125
31.1
22


C10ORF7_R2
0.93850005
0.82
3.38
3.885
6.5550003
11.700001
8.915001
5.565
32.2
2.56
36.35
12.95


C16ORF45_R1
3.155
1.205
0.6935
2.145
34.05
4.335
5.965
6.435
3.2199998
10.635
455
5.3599997


CaMKIINalpha_R2
5.34
3.24
2.1
2.795
29.25
7.48
30.7
21.25
2.1000001
4.09
98
4.935


CDH3_R1
0.3085
5.31
11.05
3.021
4.5150003
35.75
6.915
16.05
15.645
4.125
38.95
58.1


CHI3L2_R1
250.5
13.9
943
3.845
123
35450
286
66.25
1815
59.050003
415.5
250.5


COX6C_R2
0.53999996
0.042400002
0.07665
0.175
5.485
1.035
5.745
0.5565
0.77699995
0.14649999
21.15
0.468


CSDA_R1
1.935
4.4350004
12.15
7.975
12.95
39.25
23.55
12.6
43.25
6.4300003
53
10.4


CTPS_R1
0.8255
1.84
1.4849999
8.360001
3.51
8.595
6.225
5.6800003
14.799999
0.8915
36.6
9.32


ERBB2_R5
1.4200001
35.85
1.21
0.7705
8.255
2.76
1.2065
35.15
1.0265
0.6115
16.65
3.1100001


ESR1_R3
47.7
0.264
0.373
0.193
47.75
0.5525
17.6
1.12
3.505
10.059999
653
0.9755


FABP7_R2
0.002871
0.00473
5.95
0.0303
0.007445
17.95
0.00484
0.004915
1.4
0.0436
0.0431
0.123500004


FBP1_R1
2.2
1.1800001
0.533
0.168
7.74
0.3475
2.2
1.78
1.49
0.796
48.6
5.19


FLJ10980_R2
3.9650002
1.185
0.7655
2.5149999
32.25
11.9
5.4449997
13.450001
11.25
9.395
103
13.55


FOXC1_DR2
5.9300003
9.47
98.9
2.345
2.13
98.5
4.685
1.355
57.8
10.6
19.95
3.755


FZD7_DR4
13.75
22.849998
18.05
8.014999
25.8
66.85
11.25
18.400002
94.1
34.25
220.5
17.45


GATA3_R4
4.4
0.5855
1.1
2.545
21.05
1.56
9.205
5.26
1.37
2.8400002
79
1.92


GRB7_DR4
0.469
26.45
1.12
1.255
4.77
3.13
1.97
77.25
4.325
0.729
25.95
2.965


GSTM3_R3
2.085
3.87
0.22749999
0.814
29.85
4.04
9.365
1.9649999
2.12
9.450001
244
4.285


GSTP1_R2
18.45
11.3
41.25
37.4
12.950001
64.5
5.17
2.995
26.9
6.76
70.1
60.3


HIS1_DR1
9.36
3.3200002
3.395
8.675
22.15
13.55
10.4
9.700001
12.8
3.7649999
88
27


ID4_DR4
12.2
27.4
21.4
6.5550003
220
399.5
19.25
14.950001
159.5
145.5
272.5
11.950001


IGBP1_R1
3.16
0.85800004
0.76699996
3.365
8.235001
2.185
5.4049997
4.7650003
7.29
3.585
39.6
5.83


INPP4B_R1
7.83
1.815
1.3299999
0.3255
12.45
1.2650001
14
1.4300001
1.85
10.025
204
2.12


KIT_R4
1.0125
7.17
5.14
0.7635
8.475
38.8
1.235
5.115
66.65
22.75
50.5
4.7200003


KRT17_R6
0.5475
1.425
7.2650003
2.64
1.4200001
5.1049995
7.415
5.5299997
49.75
7.8
20.099998
18.599998


S100A11_R5
3.395
4.4
11.24
4.715
2.6799998
16.55
10.55
7.425
18
1.725
31.5
8.73


SEMA3C_R1
1.815
1.4649999
1.525
0.509
4.99
1.3
4.825
2.4
0.6605
4.0699997
54.7
9.065001


SLC39A6_DR3
3.405
0.44099998
0.155
0.156
9.129999
1.29
0.7575
0.6045
1.225
0.36
29
0.4385


SLC5A6_DR4
1.092
2.0549998
6.8199997
7.665
8.02
12.45
6.5950003
12.200001
22.05
3.6
42.75
11.049999


TCEAL1_DR3
4.875
3.165
2.73
2.1799998
24.55
4.25
7.42
3.06
2.795
8.09
90.1
11.5


TFF3_R2
44.8
20
0.7475
2.6
107
2.24
38.6
25.25
1.405
5.42
153.5
5.97


TMSB10_R1
0.627
1.0550001
2.33
6.225
6.855
17.2
13.4
6.4300003
13.1
2.96
54.550003
21.5


TP53BP2_R1
3.605
7.755
7.135
3.355
5.51
77.4
5.5699997
7.455
20.2
3.7450001
29.5
12.55


VAV3_R1
16
2.345
2.325
13.1
260.5
25.45
66.2
115
22.7
28.3
845
147


WWP1_R2
2.885
1.095
0.90999997
1.22
10.35
9.105
6.535
4.67
6.6549997
3.8400002
58.95
4.7749996


XBP1_R2
26.900002
11.45
2.12
0.7985
34.1
2.6399999
25.349998
11.55
8.73
3.12
122.5
14.6


Proliferation genes


BUB1_R3
0.4895
0.845
2.155
5.125
2.84
15.85
5.74
3.43
7.3050003
0.62549996
20.5
7.17


MKI67_R4
0.4685
0.95449996
1.5550001
4.295
2.535
9.2
4.0150003
1.925
10.450001
0.5715
27.05
9.34


MYBL2_R7
0.2595
0.724
7.035
7.21
1.0139999
5.045
2.78
2.065
6.8050003
0.299
11.450001
4.6949997


STK6_R5
0.167
0.252
1.025
9.025
5.465
3.795
2.05
1.125
4.205
0.17750001
7.875
3.0749998


TOP2A_R7
0.174
1.15
0.5115
2.415
2.065
10.200001
2.58
20.05
6.955
0.149
11.9
4.005


Housekeeper genes


ACTB_R2
1.96
4.06
4.58
5.3900003
10.125
9.82
9.635
7.6
13.65
3.3249998
73.55
10.799999


MRPL19_R2
8.74
3.89
6.3999996
3.975
4.41
6.285
6.2650003
3.405
9.93
2.52
30.150002
8.115


PSMC4_R3
0.7155
0.6185
0.94299996
3.7450001
4.1549997
5.28
7.255
4.45
8.76
1.1099999
33.05
4.5550003


PUM1_R6
2.395
1.095
2.27
5.445
17.45
25.25
14.450001
12.2
21.55
8.13
113
16.150002


SF3A1_R3
1.55
1.15
2.0349998
6.425
13
14.65
10.275
4.5649996
14.950001
6.215
69.65
11.5




















Gene/PCR Sample name












Matrix_5HK +


prolif_(45 g × 35 s)
PB379
PB413
PB441
PB455
UB29
UB37_7V_FFPE
UB38_1D_FFPE
UB39_5I_FFPE
UB43_4B_FFPE
UB45_6D_FFPE





Intrinsic gene list


ASF1A_R1
2.58
1.74
3.75
0.4525
4.41
0.268
0.9395
1.28
0.1805
0.01845


B3GNT5_DR3
3.105
0.5425
1.6700001
0.1915
8.92
1.79
4.295
1.605
1.2375
1.0455


BLVRA_R2
9.425
5.41
13.15
2.22
3.21
1.345
4.8900003
4.9
0.66499996
0.00733


BTG3_R1
3.48
1.94
2.9450002
1.3900001
11.3
1.965
2.01
1.885
1.0345
3.06


C10ORF7_R2
3.4099998
2.725
2.8000002
0.723
7.6
0.4855
1.115
1.245
0.3395
0.07805


C16ORF45_R1
3.67
10.13
11.29
6.34
1.9100001
0.3085
3.725
1.165
0.18180001
0.18180001


CaMKIINalpha_R2
4.49
2.615
3.79
1.655
1.645
1.007
3.93
9.139999
1.905
1.39


CDH3_R1
1.6700001
1.5150001
0.513
1.8
9.87
3.7
1.1949999
3.13
1.425
1.095


CHI3L2_R1
560
93.45
382
4.3199997
1560
2.3175
79.75
166.5
42.8
264


COX6C_R2
3.28
1.9300001
1.27
0.019749999
0.19
0.0295
0.3115
0.925
0.225
0.0284


CSDA_R1
8.645
5.455
4.8500004
1.9849999
29
2.7849998
8.345
3.82
0.339
0.0174


CTPS_R1
1.81
2.3400002
2.325
1.59
2.8449998
0.6025
0.71000004
1.34
0.324
0.023850001


ERBB2_R5
1.765
1.0525
1.26
12.014999
0.376
17
1.385
5.105
0.3515
2.185


ESR1_R3
19
43.55
70.1
0.07635
0.216
0.0881
9.450001
18.7
2.23
17.95


FABP7_R2
0.003852
0.08135
0.003852
0.003852
8.945
0.002871
0.01315
0.007085
0.00319
0.002871


FBP1_R1
1.26
6.235
3.205
0.527
0.4035
0.0704
1.8199999
2.75
0.57449996
0.11570001


FLJ10980_R2
20.05
17.1
18.6
2.6599998
4.865
0.277
3.4850001
3.005
0.366
0.1621


FOXC1_DR2
3.4099998
2.83
2.205
1.2195001
45.5
2.53
14.450001
9.665
1.14
0.41750002


FZD7_DR4
4.42
6.455
15.5
3.915
21.2
6.245
14.3
14.6
5.965
14.15


GATA3_R4
3.94
12.35
7.575
0.52
1.62
0.08505
3.225
15.75
1.795
1.85


GRB7_DR4
1.96
0.8815
1.0285001
15.549999
0.979
9.344999
0.78
3.56
0.2165
2.5500002


GSTM3_R3
10.549999
0.4615
6.575
8.035
0.5255
0.4405
0.913
2.3000002
0.093150005
0.03655


GSTP1_R2
14.75
14.549999
4.36
2.08
13.6
8.965
30.150002
17.7
7.715
5.705


HIS1_DR1
12.1
10.9
17.5
2.375
3.085
2.395
8.17
9.264999
0.5765
0.5145


ID4_DR4
10.9
179.5
21.5
8.9
32.1
32.85
45.800003
20.2
15.25
11


IGBP1_R1
4.305
3.46
7.37
0.6545
1.715
0.27899998
1.89
1.225
0.2175
0.0288


INPP4B_R1
0.9165
2.67
11.549999
0.65250003
1.97
0.2945
2.5300002
7.245
2.13
1.54


KIT_R4
2.895
3.205
1.345
2.955
7.755
0.612
6.855
2.995
1.23
0.518


KRT17_R6
0.20899999
2.045
0.5865
0.333
6.245
1.11
2.8400002
8.51
0.7365
0.702


S100A11_R5
1.765
0.5955
3.685
1.0699999
5.205
2.145
2.265
1.49
1.175
2.385


SEMA3C_R1
0.9805
0.77849996
3.135
0.3435
0.57449996
0.7995
2.1399999
3.025
1.79
5.74


SLC39A6_DR3
4.215
1.99
4.3599997
0.13
0.1445
0.115
1.7950001
2.68
0.25550002
3.5149999


SLC5A6_DR4
1.7850001
1.635
2.4499998
1.69
11.25
1.23
1.705
0.90550005
1.11
3.14


TCEAL1_DR3
9.565001
15.6
5.84
1.56
3.385
1.1800001
5.105
7.74
1.6099999
3.03775


TFF3_R2
6.0299997
19.45
51.8
17.1
0.4125
0.4775
41.65
42.9
5.8599997
0.31849998


TMSB10_R1
2.145
1.395
3.35
1.9399999
5.83
1.115
1.7950001
1.27
0.48049998
0.0396


TP53BP2_R1
4.25
2.585
4.7200003
2.645
5.16
2.5149999
5.1850004
2.92
1.435
2.795


VAV3_R1
131.5
43
84.95
9.174999
20
2.3600001
12.05
4.45
0.235
0.4255


WWP1_R2
7.205
4.1549997
7.025
0.4305
1.29
0.4275
2.69
4.27
1.105
0.8405


XBP1_R2
25.1
7.8
36.6
2.13
1.0235
1.72
10.6
14.5
2.355
6.675


Proliferation genes


BUB1_R3
2.175
1.25
0.94200003
0.6385
3.23
0.4795
0.6305
0.92149997
0.662
0.46850002


MKI67_R4
0.96500003
1.095
0.7195
0.52
3.105
0.33
0.668
0.9235
0.68350005
0.3595


MYBL2_R7
1.505
0.3645
0.2335
0.3615
4.385
1.4549999
0.549
1.15
1.2850001
0.758


STK6_R5
0.3655
0.372
0.301
0.176
1.5350001
0.176
0.24450001
0.521
0.25849998
0.19999999


TOP2A_R7
0.22850001
0.3885
0.3375
0.867
0.913
0.004185
0.134
0.418
0.113000005
0.004185


Housekeeper genes


ACTB_R2
3.16
1.95
3.39
2.605
5.545
1.435
3.125
1.54
2.065
0.5985


MRPL19_R2
2.05
2.33
3.29
1.1555
2.79
1.855
5.615
3.125
2.205
0.305


PSMC4_R3
3.375
1.0799999
2.775
0.495
1.665
0.31
0.75549996
0.729
0.2685
0.04485


PUM1_R6
6.075
5.8450003
9.46
1.575
7.6000004
0.55550003
2.22
1.61
0.5185
0.2659


SF3A1_R3
5.52
2.565
7.7250004
1.125
3.57
0.4425
2.64
1.4749999
0.72099996
0.184




















Gene/PCR Sample name












Matrix_5HK +


prolif_(45 g × 35 s)
UB55_5D_FFPE
UB57_3D_FFPE
UB58_7E_FFPE
UB60_3D_FFPE
UB66_1D_FFPE
UB37
UB38
UB39
UB43
UB45





Intrinsic gene list


ASF1A_R1
0.0633
0.15349999
0.164
2.15
0.078
4.67
6.8149996
4.755
2.415
4.88


B3GNT5_DR3
1.605
0.9
0.865
2.865
0.9555
6.73
5.74
2.2649999
1.38
3.185


BLVRA_R2
0.276
0.488
0.58000004
7.01
0.479
16.65
14.2
12.8
21
14.3


BTG3_R1
0.83449996
1.345
0.7475
5.375
2.335
8.59
3.6399999
3.555
4.6400003
13.85


C10ORF7_R2
0.566
0.2845
0.52349997
1.775
0.26999998
5.08
2.84
4.18
6.99
5.02


C16ORF45_R1
3.04
0.62549996
0.7075
1.6949999
0.18180001
10.92
28.55
9.175
15.1
166.5


CaMKIINalpha_R2
6.495
1.58
2.1
5.5699997
1.175
5.75
6.805
10.715
3.44
9.42


CDH3_R1
0.2495
0.08095
0.5145
4.575
0.273
26.45
2.49
3.645
8.559999
5.7


CHI3L2_R1
42.5
12.275
8.5
43.4
87.35
6.54
574.5
415
73.45
500


COX6C_R2
0.02795
0.2615
0.089200005
0.1125
0.2105
0.6685
1.895
3.9899998
2.805
1.705


CSDA_R1
2.1
3.0149999
1.34
5.42
0.275
11.15
8.715
6.595
6.5950003
10.55


CTPS_R1
0.2805
0.5095
0.18149999
1.38
0.208
5.945
3.145
5.7799997
5.135
4.455


ERBB2_R5
1.435
0.704
0.7135
23
1.51
50.6
2.3400002
4.705
2.185
5.525


ESR1_R3
6.3500004
4.825
3.145
0.553
25.3
0.7985
55.1
48.5
14.15
65.95


FABP7_R2
0.002871
0.00746
0.002871
0.02895
0.002871
0.003852
0.005975
0.00428
0.007535
0.00972


FBP1_R1
0.5405
0.8655
0.22
1.033
0.2515
0.82299995
6.5249996
7.0550003
5.21
4.31


FLJ10980_R2
1.98
0.635
1.89
4.5150003
0.737
8.94
41
24.05
14.75
53.75


FOXC1_DR2
1.565
2.54
0.9785
9.13
0.875
3.61
4.205
3.135
1.7349999
1.63


FZD7_DR4
10.3
8.555
4.925
12.95
10.33
31.900002
16.85
15.799999
17.55
31.099998


GATA3_R4
3.5949998
5.7349997
1.0450001
0.5345
4.74
0.754
23.900002
33.4
9.94
11.305


GRB7_DR4
0.96599996
0.969
0.294
9.35
1.07
57.6
1.665
7.2349997
1.335
11.200001


GSTM3_R3
3.045
0.1895
1.0699999
0.505
0.988
4.365
5.1850004
14.35
3.08
3.5100002


GSTP1_R2
10.200001
5.09
4.315
50.7
14.549999
15.45
23.2
5.67
8.46
4.7650003


HIS1_DR1
4.185
2.505
1.625
8.095
1.7850001
10.6
22.650002
26.849998
9.235001
19.05


ID4_DR4
8.375
6.365
7.64
46.35
23.349998
224.5
43.45
36.55
37.9
30.25


IGBP1_R1
0.112
0.22299999
0.182
2.48
0.312
4.3500004
11.2
7.34
4.9700003
17.55


INPP4B_R1
3.6599998
0.5875
1.12
5.3149996
1.54
2.9
12.15
14.299999
6.585
28.8


KIT_R4
2.355
0.315
1.385
3.9299998
0.47
4.32
5.625
2.3200002
3.59
3.8650002


KRT17_R6
0.486
0.3085
0.94
10.45
0.17639999
2.3449998
0.799
10.25
4.88
2.7649999


S100A11_R5
2.85
0.621
1.1
6.075
1.565
6.9049997
4.0699997
1.5150001
3.9899998
12.7


SEMA3C_R1
2.58
0.8355
1.525
2.6399999
3.565
6.21
5.5299997
5.19
7.24
27.75


SLC39A6_DR3
1.7
0.512
0.6955
0.176
0.98599994
0.62049997
5.755
7.7349997
1.78
10.75


SLC5A6_DR4
1.29
0.64849997
0.75
5.15
0.8725
7.04
3.25
1.855
4.735
11.1


TCEAL1_DR3
5.79
6.115
1.66
4.285
4.8775
5.98
21.599998
16.45
6.6499996
14


TFF3_R2
1.006
62.35
15.55
9.82
12.5
4.575
390.5
187.5
140.5
38.75


TMSB10_R1
0.201
0.226
0.44
6.4449997
0.1415
15.8
8.555
4.065
14.5
8.035


TP53BP2_R1
4.475
1.1
1.4300001
8.26
1.455
12.8
18.4
4.2200003
5.025
12.65


VAV3_R1
9.110001
1.825
8.945
8.13
0.845
81.2
108
60.2
84.649994
224.5


WWP1_R2
1.9949999
0.398
0.7585
1.975
1.5450001
5.325
13.7
14.15
6.91
16.65


XBP1_R2
7.625
10.235001
1.4
8.799999
17.95
7.725
31.75
16.5
9.025
26.150002


Proliferation genes


BUB1_R3
0.503
0.271
0.252
0.854
0.542
3.225
1.02
1.6500001
0.978
8.175


MKI67_R4
0.5045
0.3265
0.2495
0.66349995
0.4395
2.225
1.25
1.315
0.7865
1.335


MYBL2_R7
1.1800001
0.34100002
0.342
1.0999999
0.2035
2.985
0.3015
0.50699997
0.86950004
0.784


STK6_R5
0.2855
0.117
0.1275
0.286
0.217
1.0799999
0.538
1.035
0.5625
0.6925


TOP2A_R7
0.26749998
0.0674
0.059699997
0.09445
0.004185
0.6055
0.4745
1.017
0.228
0.6175


Housekeeper genes


ACTB_R2
1.0215
0.559
0.677
5.24
0.68850005
12.1
5.6949997
3.5900002
15.21
10.95


MRPL19_R2
0.5935
1.0799999
0.8175
7.62
1.255
7.635
6.955
2.615
2.925
3.44


PSMC4_R3
0.15200001
0.1285
0.2335
1.145
0.121999994
7.415
4.03
2.99
3.565
5.665


PUM1_R6
0.347
0.32349998
0.7125
2.6399999
0.2135
10.25
7.8199997
11.45
7.495
16


SF3A1_R3
0.385
0.563
0.35549998
2.435
0.2995
8.355
10.264999
4.85
9.545
13.55

















Gene/PCR Sample name








Matrix_5HK + prolif_(45 g × 35 s)
UB55
UB57B
UB58
UB60A
UB66







Intrinsic gene list



ASF1A_R1
0.7085
4.63
4.9049997
7.135
8.73



B3GNT5_DR3
0.9235
5.5299997
4.9750004
6.18
4.84



BLVRA_R2
3.74
6.0699997
18.75
15.95
85.75



BTG3_R1
0.347
0.25335
9.76
14.86
19.8



C10ORF7_R2
1.47
5.1800003
5.8
6.25
20.099998



C16ORF45_R1
33.85
24.95
17.099998
13.434999
111



CaMKIINalpha_R2
15.5
5.8450003
20.7
14.299999
1.0799999



CDH3_R1
0.347
0.25335
9.76
14.86
3.09



CHI3L2_R1
183.5
101.7
66.4
106.6
49.65



COX6C_R2
0.1645
7.7200003
1.075
0.42650002
5.72



CSDA_R1
2.01
10.9
12.75
9.34
3.08



CTPS_R1
0.975
3.9499998
3.37
3.7
2.705



ERBB2_R5
0.97099996
0.985
4.02
36.050003
1.53



ESR1_R3
23.55
58
33.65
3.01
234.5



FABP7_R2
0.003852
0.00513
0.01087
0.07665
0.003852



FBP1_R1
1.89
8.059999
4.1800003
2.645
9.959999



FLJ10980_R2
10.65
14.200001
28.8
25
106.5



FOXC1_DR2
1.2195001
4.6400003
3.3049998
7.68
2.74



FZD7_DR4
9.97
22
57.15
19.1
73.8



GATA3_R4
7.775
38.35
20.45
1.175
18.7



GRB7_DR4
0.9325
2.33
4.38
20.900002
5.295



GSTM3_R3
11.35
3.2150002
18.900002
1.235
19.85



GSTP1_R2
2.195
4.215
16.3
18.1
25.75



HIS1_DR1
4.91
17.9
22.2
13.05
10.75



ID4_DR4
3.5749998
19.2
56.35
84.3
130.5



IGBP1_R1
3.295
9.139999
7.4049997
8.24
45.15



INPP4B_R1
6.475
3.435
13.85
9.48
40



KIT_R4
1.815
2.1799998
9.805
10.9
5.495



KRT17_R6
0.1795
0.376
12.049999
9.94
0.116



S100A11_R5
2.73
2.2350001
10.645
9.365
6.165



SEMA3C_R1
2.525
2.505
7.3999996
5.2200003
18.7



SLC39A6_DR3
3.4499998
4.355
12.9
0.5205
5.875



SLC5A6_DR4
1.51
1.5699999
7.1099997
8.485001
5.4049997



TCEAL1_DR3
5.3999996
21.650002
9.585
8.275
54.05



TFF3_R2
5.4449997
458.5
385.5
67.100006
220.5



TMSB10_R1
1.065
3.24
6.7
12.6
12.1



TP53BP2_R1
3.22
2.935
7.835
11.15
4.9700003



VAV3_R1
104
76.45
264.5
64.85
62.449997



WWP1_R2
4.025
3.745
10.2
6.5150003
27.85



XBP1_R2
5.285
39.75
8.51
13.35
N/A



Proliferation genes



BUB1_R3
1.235
1.19
3.3200002
3
4.495



MKI67_R4
0.84749997
0.963
3.26
2.82
3.5549998



MYBL2_R7
0.6625
0.269
0.998
1.245
0.3635



STK6_R5
0.54700005
0.4525
1.26
1.115
1.36



TOP2A_R7
0.5415
0.4925
2
0.57449996
2.6750002



Housekeeper genes



ACTB_R2
3.095
5.365
12.04999
9.275
18.35



MRPL19_R2
1.575
3.675
5.6
3.76
4.26



PSMC4_R3
1.905
5.23
5.555
5.505
11.1



PUM1_R6
5.45
11.25
17.25
11.35
33.75



SF3A1_R3
2.4
15.15
11.275
8.85
32.949997






























































































































































P3 minimized

P2 minimized



ntrinsic gene list

Intrinsic gene list







ASF1A

ACADSB



BLVRA
B3GNT5
BF



BTG3
COX6C
C5ORF18 (=DP1)



C10orf7
ERBB2
CDK2AP1



C16orf45
ESR1
CX3CL1



CaMKIINalpha
FOXC1
CYB5



CDH3
FZD7
DSC2 (ESTs)



CHI3L2
GATA3
EGFR



CSDA
GRB7
FLJ14525



CTPS
GSTP1
FOXA1



FABP7
KIT
GARS



FBP1
KRT17
HSD17B4



FLJ10980
S100A11
KIAA0310



GSTM3
SLC39A6
KRT5



HIS1
XBP1
NAT1



ID4

PGR



IGBP1

PLOD1



INPP4B

PTP4A2



SEMA3C

RABEP1



SLC5A6

RARRES3



TCEAL1

SDC2



TFF3

SLPI



TMSB10

SMA3



TP53BP2

TAP1



VAV3

TRIM29



WWP1







Proliferation genes

Proliferation genes







BUB1

BIRC5



MKI67

BUB1



MYBL2

CENPF



STK6

CKS2



TOP2A

FAM54A (=DUFD1)





GTPBP4





HSPA14





MKI67





MYBL2





NEK2





PCNA





STK6





TOP2A





TTK







Housekeeper genes

Housekeeper genes







ACTB

MRPL19



MRPL19

PSMC4



PSMC4

PUM1



PUM1



SF3A1



















TABLE 21







GENE
Other Symbol
Gene Name





ACTB
PS1TP5BP1
actin, beta


GAPDH
GAPDH, GAPD
glyceraldehyde-3-phosphate dehydrogenase


GUSB

glucuronidase, beta


RPLP0
36B4, P0, L10E, RPPO, PRLP0
ribosomal protein, large, P0; also known as 60S acidic 




ribosomal protein P0, nltext missing or illegible when filed


TFRC
CD71, TFR1
transferrin receptor (p90, CD71


MRPL19
MRP-L15, RPML15, KIAA0104,
mitochondrial ribosomal protein L19



RLX1



PSMC4
TBP7, S6, MIP224
proteasome (prosome, macropain) 26S subunit, ATPase, 4


PUM1
PUMH1, KIAA0099
pumilio homolog 1 (Drosophila)


SF3A1
SF3a120, SAPI14, PRPF21,
splicing factor 3a, subunit 1, 120 kDa



Prp21



SLC7A6
y + LAT-2, KIAA0245, LAT3,
solute carrier family 7 (cationic amino acid transporter, y+



LAT-2
system), member 6


S100A11
S100C
S100 calcium binding protein A11 (calgizzarin)


ASF1A
DKFZP547E2110, CIA
ASF1 anti-silencing function 1 homolog A (S. cerevisiae)


BLVRA
BLVR
biliverdin reductase A


BTG3
ANA, MGC8928, TOB5, TOB55,
BTG family, member 3



TOFA



C10orf7
D123
chromosome 10 open reading frame 7


C16orf45
FLJ32618
chromosome 16 open reading frame 45


CAMK2N1
CaMKIINalpha, ICAP-1alpha
calcium/calmodulin-dependent protein kinase II inhibitor 1


CHI3L2
YKL-39, YKL39
chitinase 3-like 2


CSDA
dbpA, ZONAB, CSDA1
cold shock domain protein A


FABP7
B-FABP, BLBP
fatty acid binding protein 7


HEXIM1
CLP-1, HIS1, MAQ1, EDG1
hexamethylene bis-acetamide inducible 1


ID4

inhibitor of DNA binding 4, dominant negative helix-loop-helix




protein


IGBP1
alpha 4
immunoglobulin (CD79A) binding protein 1


INPP4B
MGC132014
inositol polyphosphate-4-phosphatase, type II, 105 kDa


SLC5A6
SMVT
solute carrier family 5 (sodium-dependent vitamin transporter),




member 6


TMSB10

thymosin, beta 10


WWP1
AIP5, DKFZP434D2111
WW domain containing E3 ubiquitin protein ligase 1


BAG1

BCL2-associated athanogene


GSTM1
MU, H-B
glutathione S-transferase M1


MMP11

matrix metallopeptidase 11 (stromelysin 3)


CD68
SCARD1, macrosialin
CD68 antigen


C17orf37
MGC14832, ORB3, XTP4
chromosome 17 open reading frame 37


TCAP
LGMD2G, T-cap, TELE,
titin-cap (telethonin)



telethonin, CMD1N



EMSY
C11orf30
chromosome 11 open reading frame 30


IGFBP2
IBP2
insulin-like growth factor binding protein 2, 36 kDa


MDM2
HDM2
Mdm2, transformed 3T3 cell double minute 2, p53 binding protei


PTEN
MMAC1, TEP1, PTEN1
phosphatase and tensin homolog (mutated in multiple advanced




cancers 1)


TP53
P53
tumor protein p53 (Li-Fraumeni syndrome)


CDC6

CDC6 cell division cycle 6 homolog (S. cerevisiae)


KIF13B
GAKIN, KJAA0639
kinesin family member 13B


MUC1
CD227
mucin 1, cell surface associated


TK1

thymidine kinase 1, soluble


CLDN7
CEPTRL2, CPETRL2
claudin 7


FGFR4
JTK2, CD334
fibroblast growth factor receptor 4


PDSS1
TPT, COQ1, TPRT
prenyl (decaprenyl) diphosphate synthase, subunit 1


AKT3
PKBG, RAC-gamma, PRKBG
v-akt murine thymoma viral oncogene homolog 3 (protein kinase




B, gamma)


AVEN
PDCD12
apoptosis, caspase activation inhibitor


BCL2A1
GRS, BFL1, BCL2L5
BCL2-related protein A1


CA9
MN
carbonic anhydrase IX


CDKN1B
KIP1, P27K1P1
cyclin-dependent kinase inhibitor 1B (p27, Kip1)


CFLAR
CASH, Casper, CLARP, FLAME,
CASP8 and FADD-like apoptosis regulator



FLIP, T-FLICE, M



FIGF
VEGF-D, VEGFD
c-fos induced growth factor (vascular endothelial growth factor




D)


IGF1
JTK13
insulin-like growth factor 1 receptor


KPNA1
SRP1, RCH2, NPI-1, IPOA5
karyopherin alpha 1 (importin alpha 5)


KRAS
KRAS2
v-Ki-ras2 Kirsten rat sarcoma viral oncogene homolng


LRIG1
LIG-1, DKFZP586O1624, LIG1
leucine-rich repeats and immunoglobulin-like domains 1


MAP2
MAP2A, MAP2B, MAP2C
microtubule-associated protein 2


MAPT
MTBT1, tau, PPND, FTDP-17,
microtubule-associated protein tau



TAU, MSTD, MTBT



PPMID
Wip1
protein phospbatase ID magnesium-dependent, delta isoform


PTGS2

prostaglandin-endoperoxide synthase 2 (prostaglandin G/H




synthase and cycloo


RABEP1

rabaptin,RAB GTPase binding effector protein 1


RARA

retinoic acid receptor, alpha


RHOC

ras homolog gene family, member C


ROPN1
ODF6, ropporin, ROPN1A
ropponn, rhophilin associated protein 1


S100A7

S100 calcium binding protein A7 (psoriasin 1)


S100A8

S100 calcium binding protein A8 (calgranulin A)


S100A9
P14, MIF., NTF, LIAG, MRP14,
S100 calcium binding protein A9 (calgranulin B)



MAC387, 60B8AG



SHC1
SHC-P66
SHC (Src homology 2 domain containing) transforming protein 1


TAP1

transporter 1, ATP-binding cassette, sub-family B (MDR/TAP)


TP73L

tumor protein p73L


CKS2

CDC28 protein kinase regulatory subunit 2


FAM54A
DUFD1
homily with sequence similarity 54, member A


GTPBP4
CRFG, NGB, FLJ10690,
GTP binding protein 4



FLJ10686



HSPA14
HSP70-4, HSP70L1
heat shock 70 kDa protein 14


PCNA

proliferating cell nuclear antigen


FOXA1
HNF3A
forkhead box A1


GATA3

GATA binding protein 3


CDCA1
NUF2R
cell division cycle associated 1


AGR2
XAG-2, HAG-2, AG2
anterior gradient 2 homolog (Xenopus laevis)


ESR1
NR3A1, Era, ESR
estrogen receptor 1


SCUBE2
Cegf1, Cegb1, FLJ16792
signal peptide, CUB domain, EGF-like 2


BUB1
hBUB1, BUB1A, BUB1L
BUB1 budding uninhibited by benzimidazoles 1 homolog (yeast)


SLC39A6
LIV-1
solute earner family 39 (zinc transporter), member 6


UGT8
CGT
UDP glycosyltransferase 8 (UDP-galactose ceramide




galactosyltransferase)


LOC400451

Hypothetical gene supported by AK075564; BC060873


KNTC2
HEC, HEC1
kinetochore associated 2


TMC5
FLJ13593
transmembrane channel-like 5


ERBB2
NEU, HER-2
v-erb-b2 crythroblastic leukemia viral oncogene homolog 2,




neuro/glioblastoma


CA12
HsT18816
carbonic anhydrase XII


DKFZp762E1312

Hypothetical protein DKFZp762E1312


BIRC5
EPR-1, AP14
baculoviral IAP repeat-containing 5 (survivin)


ANLN
ANILLIN, Scraps
anillin, actin binding protein (scraps homolog, Drosophila)


CEP5S
FLJ10540, CEP55 C10orf3
centrosomal protein 55 kDa


REEP6
DPIL1, FLJ25383, C19orf32
receptor accessory protein 6


ELOVL5
HELO1, dJ483K16.1
ELOVL family member 5, elongation of long chain fatty acids




(FEN1/Elo2, SUtext missing or illegible when filed


TMEM45B
LOC120224
Transmembrane protein 45B


TTK
MPSIL1
TTK protein kinase


AR
AKR1B1, ALDR1
aldo-keto reductase family 1, member B1 (aldose reductase)


CTSL2
CTSU, CTSV
cathepsin L2


CENPA

centromere protein A, 17 kDa


GALNT7
GALNACT7
UDP-N-acetyl-alpha-D-galactosamine:polypeptide 




N-acetylgalactosarmnyltrans


DNAJC12
JDP1, “J domain protein 1”
DnaJ (Hsp40) homolog, subfamily C, member 12


MLPH
1Rk3, I(1)-3Rk, Slac-2a,
melanophilin



In, exophilin-3



TACSTD1
Ly74, TROP1, GA733-2, EGP,
tumor-associated calcium signal transducer 1



KSA M4S1, MIC18



CDC20
p55CDC
CDC20 cell division cycle 20 homolog (S. cerevisiae)


PIP

prolactin-induced protein


CDCA7
FLJ14736, JPO1
cell division cycle associated 7 - variant2


MIA
CD-RAP
melanoma inhibitory activity


XBP1
XRP1
X-box binding protein 1


C4orf34
FIJ13289, LOC201895
chromosome 4 open reading frame 34


VAV3

vav 3 oncogene


GRB7

growth factor receptor-bound protein 7


UBE2C
UBCH10
ubiquitin-conjugating enzyme E2C


PH-4

hypoxia-inducible factor prolyl 4-hydroxylase


ART3

ADP-ribosyltransferase 3


MELK
KIAA0175
maternal embryonic leucine zipper kinase


CDCA8
FLJ12042
cell division cycle associated 8


DNALI1
P28
dynein, axonemal, light intermediate polypeptide 1


KIAA1370
FLJ10980
KIAA1370


THSD4
FVSY9334, PRO34005,
thrombospondin, type 1, domain containing 4



FLJ13710



KRT18

keratin 18


MYO5C

myosin VC


FBP1
FBP
fructose-1,6-bisphosphatase 1


CDC45L
CDC45L2
CDC45 cell division cycle 45-like (S. cerevisiae)


CXXC5
HSPC195
CXXC finger 5


FANCA
FACA, FANCH, FAA, FA-H, FAH
Fanconi anemia, complementation group A


MYB
c-myb
v-myb myeloblastosis viral oncogene homolog (avian)


OGFRL1
dJ331H24.1
opioid growth factor receptor-like 1


KIF2C
MCAK
kinesin family member 2C


RRM2

ribonucleotide reductase M2 polypeptide


FOXC1
FREAC3 ARA, IGDA, IHG1
forkhead box C1


SFRP1
SARP2, FRP, FRP-1
secreted frizzled-related protein 1


AURKA
STK6
serine/threonine kinase 6


ACTR3B
ARP11, ARP3beta, ARP3BETA
ARP3 actin-related protein 3 homolog B (yeast)


TCF7L1
TCF3
transcription factor 7-like 1 (T-cell specific, HMG-box)


MYBL2
BMYB
v-myb myeloblastosis viral oncogene homolog (avian)-like 2


CELSR1
ME2, HFMI2, FM12, CDHF9
cadherin, EGF LAG seven-pass G-type receptor 1 (flamingo 




homolog, Drosophi


NTN4

netrin 4


SLC16A6
MCT6
solute carrier family 16 (monocarboxylic acid transporters), 




member 6


C10orf38
FLJ12884
chromosome 10 open reading frame 38


GPR160
GPCR150, GPCR1
G protein-coupled receptor 160


TFF3
HITF
trefoil factor 3 (intestinal)


PIB5PA

phosphatidylinositol(4,5)bisphosphate 5-phosphatase, A


BCL11A
Evi9, BCL11A-XL, BCLI1A-L,
B-cell CLL/lymphoma 11A (zinc finger protein)



BCL11A-S, EVI9



E2F1
RBP3
E2F transcription factor 1


RACGAP1
MgcRacGAP
Rac GTPase activating protein 1


TRIP13

thyroid hormone receptor interactor 13


UBE2T
HSPC150
ubiquitin-conjugating enzyme E2T (putative)


CAPN13
FLJ23523
calpain 13


ACOT4
ACOT4, PLJ31235, PTE-1b,
acyl-CoA thioesterase 4


PRC1

protein regulator of cytokinesis 1


SPDEF
PDEF, bA375E1.3
SAM pointed domain containing ets transcription factor


NAT1

N-acetyltransferase 1 (arylamine N-acetyltransferase)


KIAA1324
maba1
KIAA1324


TSPAN13
NET-6, TMISF13, TSPAN13
tetraspanin 13


MAD2L1
MAD2, HSMAD2
MAD2 mitotic arrest deficient-like 1 (yeast)


NEK2
NLK1, “HsPK 21”
NIMA (never in mitosis gene a)-related kinase 2


NPDC1

neural proliferation, differentiation and control, 1


GPSM2
LGN, Pins
G-protein signalling modulator 2 (AGS3-like, C. elegans)


DLG7
KIAA0008,DLG1, HURP
discs, large homolog 7 (Drosophila)


SLC40A1
MTP1, IREG1, FPN1, HFE4
solute carrier family 40 (iron-regulated transporter), member 1


ORC6L
ORC6
origin recognition complex, subunit 6 homolog-like (yeast)


BCMP11
HAG3, hAG-3
breast cancer membrane protein 11


EXO1
HEX1, hExo1
exonuclease 1


KIF20A
RAB6KIFL
kinesin family member 20A


EPN3
FLJ20778
epsin 3


PTTG1
PTTG, HPTTG, EAP1, securin
pituitary tumor-transforming 1


RERG
MGC15754
RAS-like, estrogen-regulated, growth inhibitor


TMEM25
FLJ14399, 0610039I01Rik
transmembrane protein 25


PHGDH
SERA, PGDH, PDG
phosphoglycerate dehydrogcnase


SLC9A3R1
NHE3
solute carrier family 9 (sodium/hydrogen exchanger), member 3


FAM64A
FLJ10156
family with sequence similarity 64, member A


SEMA3C
SemEA\, SEMAE
sema domain, immunoglobulin domain (Ig), short basic domain,




secreted, (sema


PGR
PR, NR3C3
progesterone receptor


BCL2
Bcl-2
B-cell CLL/lymphoma 2


ABCC3
MRP3, cMOAT2, EST90757,
ATP-binding cassette, sub-family C (CFTR/MRP), member 3



MLP2, MOAT-D



CCND1
BCL1, D11S287E, PRAD1
cyclin D1


CCNE1
CCNE
cyclin E1


CDH1
uvomorulin
cadherin 1, type 1, E-cadherin (epithelial)


EGFR
ERBB
epidermal growth factor receptor (erythroblastic leukemia 




viral (v-erb-b) oncoge


KRT6B
KRTL1
keratin 6B


MYC
c-Myc
v-myc myelocytomatosis vital oncogene homolog (avian)


KRT5

keratin 5 (epidermolysis bullosa simplex, Dowling-Meara/




Kobnerlweber-Cocke


GSTP1

glutathione S-trausferase pi


B3GNT5
B3GN-T5, beta3Gn-T5
UDP-GlcNAc:betaGal beta-1,3-N-acetylglucosaminyltransferase 5


COX6C

cytochrome c oxidase subunit VIc


FZD7
PzE3
frizzled homolog 7 (Drosophila)


TCEAL1
p21, pp21, SIIR, P21
transcription elongation factor A (SII)-like 1


KIT
PBT, CD117, SCFR, C-Kit
v-kit Hardy-Zuckerman 4 feline sarcoma viral oncogene homolog


KRT17
PCHC1
keratin 17


CDH3
CDHP, PCAD
cadherin 3, type 1, P-cadherin (placental)


GSTM3
GST5
glutathione S-trausferase M3 (brain)


TP53BP2
2 choices: PPP1R13A, TP53BP
TP53BP1 tumor protein p53 binding protein, 1 (15q15-q21) or




TP53BP2 tumor protein p53 bindi


CENPF

centromere protein P, 350/400 ka (mitosin)


TOP2A

topoisomerase (DNA) II alpha 170 kDa


TYMS

thymidylate synthetase


CCNB1
CCNB
cyclin B1


MKI67

antigen identified by monoclonal antibody Ki-67


CLDN3
RVP1, C7orf1, CPETR2
claudin 3


CLDN4
CPE-R, WBSCR8, hCPE-R,
claudin 4



CPETR, CPETR1



CRYAB
HSPB5
crystallin, alpha B


CTPS

CTP synthase


GSDML
PRO2521
gasdermin-like


KRT14
EBS3, EBS4
keratin 14 (epidermolysis bullosa simplex, Dowling-Meara,




Koebner)


KRT19

keratin 19


KRT8

keratin 8


RARRES3

retinoic acid receptor responder (tazarotene induced) 3


TRIM29
ATDC
tripartite motif-containing 29 - Variant 1


VEGF

vascular endothelial growth factor


WIRE
WICH
WIRE protein


YBX1
NSEP1
Y box binding protein 1


















Forward Sequence
Reverse sequence




GENE
Location
(SEQ ID NO: 29-240)
(SEQ ID NO: 242-454)
Source






ACTB
7p15-p12
TTCCTGGGCATGGAGTC
CAGGTCTTTGCGGATGTC
Housekeeper






GAPDH
12p13
TGGAAGGACTCATGACCACA
GGCCATCCACAGTCTTCT
Housekeeper






GUSB
7q22
ACTATGCAGCAGACAAGG
CCGTAGTCGTGATACCAAGA
Housekeeper






RPLP0
12q24.2
TCTACAACCCTGAAGTGCT
GACAGACACTGGCAACA
Housekeeper






TFRC
3q26.2-qter
GACTATTGCTGTGATCGTCT
TTACAATAGCCCAAGTAGCCA
Housekeeper






MRPL19
2p11.2-q11.2
GGAAGAGGACTTGGAGCTACT
TCCTGGACCCGAGGATTAT
Housekeeper






PSMC4
19q13.11-q13.13
ACCTGGCTGTGGGAAGA
GCCCTCACCCAGATACT
Housekeeper






PUM1
1p35.2
CGGGAGATTGCTGGACATATAA
TGGCACGCTCCAGTTTC
Housekeeper






SF3A1
22q12.2
CGATGATGAGGTGTACGC
GCTCAGCCAACTGCTTC
Housekeeper






SLC7A6
16q22.1
TGGCACTCATCTACCTCATC
CCACGAAGAACCAGTAGC
MIP2






S100A11
1q21
GCGCACAGAGCTCTCAG
AGGGCTGGAGATTTTTGC
MIP2/MIP3






ASF1A
6q22.31
TCCACCAGTAAAACCAGACTT
TTGTGACCCTGGGATTAGATG
MIP3






BLVRA
7p14-cen
GCTGGCTGAGCAGAAAG
TTCCTCCATCAAGAGTTCAACA
MIP3






BTG3
21q21.1-q23.2
AGCCGCAAGTCCTGTGTA
GGGTGCCACATTGGAAGA
MIP3






C10orf7
10p13
CCCCAAGGGATGCGTAT
AGATGTCGCTGAGGGTT
MIP3






C16orf45
16p13.2
GCTCAGGTTCATGATGGAT
GATGGCGACCAAGTTCT
MIP3






CAMK2N1
1p36.12
GAGCAAGCGGGTTGTTAT
TCTTTGGGGGAGTTAGACAC
MIP3






CHI3L2
1p13.3
TCCCAAACTGAAAATTCTCTTGTC
GTGATGTAGAAGAATCCACCATAG
MIP3






CSDA
12p13.1
TCGCTCACGGGTCTTAC
TTCATCTCTCCAATCTCACCAG
MIP3






FABP7
6q22-q23
GCAAAATGGTTATGACCCTTAC
CAGGAACATTTTTATGCCTTCTC
MIP3






HEXIM1
17q21.31
TCCGAGGCCAGTAAGTTG
TGTCTCTGGTGCTGTCC
MIP3






ID4
6p22-p21
CCCAACAAGAAAGTCAGCAA
AGGTCCAGGATGTAGTCG
MIP3






IGBP1
Xq13.1-q13.3
TGCCGAAATGTTATCGCAG
TGGTGAGGGCTCCTTGA
MIP3






INPP4B
4q31.21
GGCAGCACTTTCTTCCTACA
GCGGTTCATTTGGAGTCT
MIP3






SLC5A6
2p23
TCTTCAGCGGCTCTCTCA
TCAGGGAACCAAGGTCG
MIP3






TMSB10
2p11.2
TGGCAGACAAACCAGACAT
TCCGCTTCTCCTGCTCA
MIP3






WWP1
8q21
CTTGCTCACTTCCGTTATTTG
CGGGACACATTGATCTTTACA
MIP3






BAG1
9p12
CTGGAAGAGTTGAATAAAGAGC
GCAAATCCTTGGGCAGA
Other






GSTM1
1p13.3
GGACGCTCCTGATTATGAC
AGGGCAGATTGGGAAAG
Other






MMP11
22q11.2
AGGGGTGCCCTCTGAGAT
TCACAGGGTCAAACTTCCAGT
Other






CD68
17p13
GGGCAGAGCTTCAGTTG
CTGGAGCCTCAGGGAGA
Other






C17orf37
17q12
TCTCCAGCCACCTCATAC
TATTACCGAGGCGAAGAGT
Other






TCAP
17q12
GTGGTGCCTGTCAGCAA
CCTCTCAGCCTCTCTGTG
Other






EMSY
11q13.5
GCTCCCAGCTTCTTCAGAGA
GAGGATCCTTGGGTTATAATTGG
Other






IGFBP2
2q33-q34
GAGTGCTGGTGTGTGAAC
TGTAGAAGAGATGACACTCGG
Other






MDM2
12q13-q14
GACTCCAAGCGCGAAAAC
CAGACATGTTGGTATTGCACATT
Other






PTEN
10q23
GGGAAGTAAGGACCAGAGACAA
TCCAGATGATTCTTTAACAGGTAGC
Other






TP53
17p13.1
AGGCCTTGGAACTCAAGGAT
CCCTTTTTGGACTTCAGGTG
Other






CDC6
17q21.3
GTAAATCACCTTCTGAGCCT
ACTTGGGATATGTGAATAAGACC
Other






KIF13B
8p21
GCCCTCTCTGTTTCTCCC
GGATTCAAGTAGGATGCTGC
Other






MUC1
1q21
GATCGTAGCCCCTATGAGAC
ACTGCTGGGTTTGTGTAAG
Other






TK1
17q23.2-q25.3
CAGCTTCTGCACACATGACC
CGTCGATGCCTATGACAGC
Other






CLDN7
17p13
GGGAGACGACAAAGTGAAG
ATACCAGGAGCAAGCTACC
Other






FGFR4
5q33-qter
GATCGTCCTGCAGAATCTC
GGGTCCTCATCATCGTTG
Other






PDSS1
10p12.2
ACTCGGTTGGAGAGACT
GGCTTTCCCTTTCCCAT
Other






AKT3
1q43-44
TGGATTTACCTTATCCCCTCAA
TGGCTTTGGTCGTTCTGTTT
Other






AVEN
15q13.1
GGACCTGAAATCCAAGGAAGAT
CAGTCACAGATGGTTTTGCAC
Other






BCL2A1
15q24.3
AACGTCCAGAGTGCTACA
CCAAGCATGACTTCAGATTC
Other






CA9
9p12
TCAGCCGCTACTTCCAATA
CTCAGCATCACTGTCTGGTTA
Other






CDKN1B
12p13.1-p12
CCCTAGAGGGCAAGTACGAGT
AGTAGAACTCGGGCAAGCTG
Other






CFLAR
2q33-q34
CTCACCGTCCCTGTACCTG
CAGGAGTGGGCGTTTTCTT
Other






FIGF
Xp22.31
ACTCTCATCTCCAGGAACC
CTCGCAACGATCTTCGTC
Other






IGF1
15q25-q26
GCAGTCTTCCAACCCAAT
GAGGACATGGTGTGCATC
Other






KPNA1
3q21
GCTTGGGCCATCACAAAT
CGGCTTGATACAACCCAGTT
Other






KRAS
12p12.1
TGGACGAATATGATCCAACAAT
TCCCTCATTGCACTGTACTCC
Other






LRIG1
3p14
CCAGAATCACTGAAGGGTC
AGGAAGTCATCGCACAC
Other






MAP2
2q34-q35
AACCCTTTGAGAACACGAC
TCTTTCCGTTCATCTGCCA
Other






MAPT
17q21
TGTGGCTCATTAGGCAAC
CTTCGACTGGACTCTGT
Other






PPMID
17q23.3
TTTCTGGCAGTAGCAAGAG
ACTTGTGTCTGGTTCAGG
Other






PTGS2
1q25.2-q25.3
GCTGAAGCCCTATGAATCATTT
TCCAACTCTGCAGACATTTCC
Other






RABEP1
17p13.2
CAGTGGAGAGAAGAAGTTGC
CTGGTGCTCATAGTCACG
Other






RARA
17q12
CAAAGCGCACCAGGAAAC
GTTGTTCTGAGCTGTTGTTCGT
Other






RHOC
1p13.1
GCAGCCTGGGAACTTCAG
CACCAGCTTCTTTCGGATTG
Other






ROPN1
3q21.1
GAGTCGCTTTGTGTAACCG
TGAGAATGCAGGATCTTTAACAG
Other






S100A7
1q21
TGCTGACGATGATGAAGGAG
CGAGGTAATTTGTGCCCTTT
Other






S100A8
1q12-q32
CTGGAGAAAGCCTTGAACT
CTGTAGACGGCATGGAAAT
Other






S100A9
1q21
GTGCGAAAAGATCTGCAAAA
TCAGCTGCTTGTCTGCATTT
Other






SHC1
1q21
GGGGTTTCCTACTTGGTTCG
CCGGGTGTTGAAGTCCAG
Other






TAP1
6p21.3
GCCAGGAGACGGAGTTT
CGTGTCCTCTGTTACCCGA
Other






TP73L
3q27-28)
CACTCTCCATGCCATCCAC
GCCCAACCTCGCTAAGAAA
Other






CKS2
9q22
TGGAGGAGACTTGGTGT
GAATATGTGGTTCTGGCTCA
Proliferation






FAM54A
6q23.2
GTGGAAATGCAGGAACTGAA
GCTCGTCACTCAAGCCAA
Proliferation






GTPBP4
10p15-p14
GGATCATTACAAGTTGGCTCT
CTTCATCAGTCGCACATAATCT
Proliferation






HSPA14
10p14
TGGAATTGGACAAGACTCCC
ACGCTGAGAGATAAGGATG
Proliferation






PCNA
20pter-p12
CCACTCTCTTCAACGGT
AGTGTCCCATATCCGCA
Proliferation






FOXA1
14q12-q13
GCTACTACGCAGACACG
CTGAGTTCATGTTGCTGACC
Top 100 (1)






GATA3
10p15
CATTAAGCCCAAGCGAAGG
TGACAGTTCGCACAGGAC
Top 100 (10)






CDCA1
1q23.1
GGAGGCGGAAGAAACCAG
GGGGAAAGACAAAGTTTCCA
Top 100 (100)






AGR2
7p21.3
TTTGTCCTCCTCAATCTGGTTT
CATAATCCTGGGGACATACTGG
Top 100 (11)






ESR1
6q24-q27
GCAGGGAGAGGAGTTTGT
GACTTCAGGGTGCTGGAC
Top 100 (12)






SCUBE2
11p15.3
GTTCCAGGTCCCATACG
TAGAGCCTGCCATCTCG
Top 100 (13)






BUB1
2p11-q21
GTTTGCGGTTCAGGTTTGG
CATGTGGGCTTCAAGCATC
Top 100 (14)






SLC39A6
18q12.2
TCGAACTGAAGGCTATTTACGAG
CTGCTGAGAATCAAAGTGGGA
Top 100 (15)






UGT8
4q26
AACTCCGAAGCCTCCCTTA
GTGTTTGTGCGCTGAATC
Top 100 (16)






LOC400451
15q26.1
CCAGGGTTTGTGTATTTGC
ACTGAAGAACCGAAGATGG
Top 100 (17)






KNTC2
18p11.31
TGGGTCGTGTCAGGAAAC
CACCGCTGGAAACTGAAC
Top 100 (18)






TMC5
16p13.11
GCCTGGGTTGTCTCTACAGG
CCCCAGGGTTACTGTGTGTC
Top 100 (19)






ERBB2
17q11.2-q12
GCTGGCTCTCACACTGATAG
GCCCTTACACATCGGAGAAC
Top 100 (2)






CA12
15q22
GCAGGTCCAGAAGTTCGATG
CCGCAGTACAGACTTGCACTT
Top 100 (20)






DKFZp762E1312
2q37.1
GCTCCAAGGAGAACTTCATAC
CTTGCAATCTCTTAATGCCC
Top 100 (21)






BIRC5
17q25
GCACAAAGCCATTCTAAGTC
GACGCTTCCTATCACTCTATTC
Top 100 (22)






ANLN
7p15-p14
ACAGCCACTTTCAGAAGCAAG
CGATGGTTTTGTACAAGATTTCTC
Top 100 (23)






CEP5S
10q24.1
CCTCACGAATTTGCTGAACTT
CCACAGTCTGTGATAAACGG
Top 100 (24)






REEP6
19p13.3
CGAGTTCTTCAGCGATCTAC
AGCCATGCAGAACAACAG
Top 100 (25)






ELOVL5
6p21.1-p12.1
CCCTTCCATGCGTCCATA
TGTCAGCACAAACTGAAGCA
Top 100 (27)






TMEM45B
11q24.3
GTCGAAGCCGCAATTAGG
GGAACAAACTGCTCTGCCA
Top 100 (28)






TTK
6q13-q21
GGAGTTTGGGTTCCATCTT
TTCTCTGCCACTTAAATCCTCG
Top 100 (29)






AR
7q35
TGTCCATCTTGTCGTCTTC
CTCCTTCCTCCTGTAGTTTC
Top 100 (3)






CTSL2
9q22.2
GTACCAGTGGAAGGCAAC
ACACTGCTCTCCTCCATC
Top 100 (30)






CENPA
2p24-p21
CTGCACCCAGTGTTTCTGTC
GAGAGTCCCCGGTATCATCC
Top 100 (31)






GALNT7
4q31.1
GCACTGTGCCGCTTATAG
TCGGGCATACCCATCTTC
Top 100 (32)






DNAJC12
10
GAGTCGAGCCCGCTATGA
CAACCCAGTGCATTGACG
Top 100 (33)






MLPH
2q37.2
GTGGAATGCCTGCTGACC
CGCACTCCAGCACCTAGAC
Top 100 (34)






TACSTD1
2p21
AGTTGGTGCACAAAATACTGTCAT
TCCCAAGTTTTGAGCCATTC
Top 100 (35)






CDC20
1p34.1
CTGTCTGAGTGCCGTGGAT
TCCTTGTAATGGGGAGACCA
Top 100 (36)






PIP
7q32-qter
TGCCTATGTGACGACAATCC
GGCTGCAATTTGCACAGTTC
Top 100 (37)






CDCA7
2q31
AAAGAGGAAGACCGTGGATGG
CACTGGGCGAATTATATGCG
Top 100 (38)






MIA
19q13.32-q13.3
CCAGTAGCATTGTCCGAG
CCCATTTGTCTGTCTTCAC
Top 100 (39)






XBP1
5q22.2
CTGGAACAGCAAGTGGTAG
GCCATGAGTTTTCTCTCGT
Top 100 (4)






C4orf34
4p14
TCAAGTAAAATCAAGCTGGGTAATC
TAGGACTGGGACTGCCGTAA
Top 100 (40)






VAV3
1p13.3
ACAAGGGACACTCAAACTAC
TGTTTAGGAGTTCTTCGCAG
Top 100 (41)






GRB7
17q11.2-17q21
CGTGGCAGATGTGAACGA
AGTGGGCATCCCGTAGA
Top 100 (42)






UBE2C
20
TGCCCTGTATGATGTCAGGA
GGGACTATCAATGTTGGGTCTC
Top 100 (43)






PH-4
3p21.31
ACCGACAGGGATCACTTCAT
AGCCGACACTCTTCATCAGTC
Top 100 (44)






ART3
4p14-p15.1
TTGAACCCACCCAAATACCT
GATGCAGAAGGATGGCTTTT
Top 100 (45)






MELK
9p13.1
CCAACAAAATATTCATGGTTCTTG
AGGCGATCCTGGGAAATTAT
Top 100 (46)






CDCA8
1p34.2
TCCTTTCTGAAAGACTTCGACC
CCTGTCTGACTCAATTTGCT
Top 100 (47)






DNALI1
1p35.1
CCGCAGGGAACTCTACTCAC
GGATCTCGTCCCGGACTC
Top 100 (48)






KIAA1370
15q21.2-q21.3
ATGGATCTTGGAGCCAGTTC
ACACAAATGAGCGGACAG
Top 100 (49)






THSD4
15q23
GTGGGAACCATTTGCAGAAG
ATTGCCTGGCAGTTCAACTC
Top 100 (5)






KRT18
12q13
TGATGACACCAATATCACACGA
GGCTTGTAGGCCTTTTACTTCC
Top 100 (51)






MYO5C
15q21
GGCCTACAGCCGAGGATT
GCCTTATGTTCCTCCAGCAT
Top 100 (52)






FBP1
9q22.3
GTGTCCGTTGGAACCAT
CTCAGAAGGCTCATCAGT
Top 100 (53)






CDC45L
22q11.2
GTTTGAGCTGGCTTGGATG
TCTTGTCTTGCACCCACTG
Top 100 (54)






CXXC5
5q31.3
CATGAAATAGTGCATAGTTTGCC
CCATCAACATTCTCTTTATGAACG
Top 100 (55)






FANCA
16q24.3
GCCATCATGGTGTTTGAG
GAAGTGGGACACGTAGTAAG
Top 100 (56)






MYB
6q22-q23
GCTCCTAATGTCAACCGAGAA
AGCTGCATGTGTGGTTCTGT
Top 100 (57)






OGFRL1
6q13
GAGCACAACCACACTTACATTC
GAAGTTCAAGCCTTGTTCTC
Top 100 (58)






KIF2C
1p34.1
GGAGATCCGTCAACTCCAAA
AGTGGACATGCGAGTGGAG
Top 100 (59)






RRM2
2p25-p24
CAGCAAGCGATGGCATAGT
AGCGGGCTTCTGTAATCTGA
Top 100 (59)






FOXC1
6p25
GATGTTCGAGTCACAGAGG
GACAGCTACTATTCCCGTT
Top 100 (6)






SFRP1
8p12-p11.2
AATGCCACCGAAGCCTC
GCCTCAGATTTCAACTCGT
Top 100 (60)






AURKA
20q13.2-20q13.3
TCCAGGCCACTGAATAACAC
TTTGATGCCAGTTCCTCCTC
Top 100 (61)






ACTR3B
7q34
AAAGATTCCTGGGACCTGA
TGGGGCAGTTCTGTATTACTTC
Top 100 (62)






TCF7L1
2p11.2
CCATGAACGCCTCGATGT
GAGCCACCATGTGAGGAGAG
Top 100 (63)






MYBL2
20q13.1
CGAGATCGCCAAGATGTT
GATGGTAGAGTTCCAGTGATT
Top 100 (64)






CELSR1
22q13.3
TGGTGACAGTGGATGATTGTG
CGGTCAGATCCAGGGACTT
Top 100 (64)






NTN4
12q22-12q23
CCAGGCTTCTATCGTGAC
AGTTGGCAGGAAGGACA
Top 100 (66)






SLC16A6
17q24.3
TGGATAATCTCAATCTGTGTGTTTG
CGAAACGATTGCTCAGGACT
Top 100 (67)






C10orf38
10p13
GTGGCGGTTTGACCAGAA
TGGTGCACAAGACCCAGAC
Top 100 (68)






GPR160
3q26.2-q27
TTCGGCTGGAAGGAACC
TATGTGAGTAAGCTCGGAGAC
Top 100 (69)






TFF3
21q22.3
TGCTGGGCTGGTCCTG
GGCACGGCACACTGGTT
Top 100 (7)






PIB5PA
22
AACTTCGCTCCCACCTTC
GCTGGCTTCCGTTTCTTG
Top 100 (70)






BCL11A
2p16.1
CCCAAACAGGAACACATAGCA
GAGCTCCATGTGCAGAACG
Top 100 (71)






E2F1
20q11-20q11
AGACCGTAGGTGGGATCAG
GGTGGTGGTGACACTATGG
Top 100 (72)






RACGAP1
12q13
GCCTTAACAGAGCCTTTATGGA
CAGCTATGCTGTTGTCTTCA
Top 100 (73)






TRIP13
5p15
CTCATGCGCTGTATGTCCA
GTCCACTGCCAGAGACAGG
Top 100 (74)






UBE2T
1q32.1
GTGAGGGGTGTCAGCTCAGT
CACACAGTTCACTGCTCCACA
Top 100 (75)






CAPN13
2p22-p21
TTCCACTCGATTTCCAAGTGA
GTGGAAATTTCTCCCGGAAC
Top 100 (76)






ACOT4
14q24.1
GTATGCTACATGCTTCAACATCC
AGGCCATTGAGAGACAAATATC
Top 100 (77)






PRC1
15q26
ACCATTATGTCTGGGTCAAAGG
TTCTTCCAACCGATCCACTTC
Top 100 (78)






SPDEF
6p21.3
CTGCAAGCTGCTCAACATC
CGGTATTGGTGCTCTGTC
Top 100 (79)






NAT1
8p23.1-p21.3
AGCCTCGAACAATTGAAGA
ACACAGATGATGGAGATGTC
Top 100 (8)






KIAA1324
1p13.3
TTCCTACTCCAATGGCTCAGA
AGCGTGTTCCACCATTTGTA
Top 100 (80)






TSPAN13
7p21.2
GCCATGTGCTCCAATCATAG
GCCAAACACCCAGGATCTC
Top 100 (82)






MAD2L1
4q27
GGTGACATTTCTGCCACTG
GTCCCGACTCTTCCCAT
Top 100 (83)






NEK2
1q32-q42
ACATTTGTTGGCACACCTTA
ATTGTAGGACATGCGATTCA
Top 100 (84)






NPDC1
9q34.3
GCTCTGTGTGCCCAGGAT
GGAAGTCAATCTCATCTTCCAGTC
Top 100 (85)






GPSM2
1p13.3
ATTGACCACCGAATTCCAAA
CAAAGAACCCTTCATCTCCAA
Top 100 (86)






DLG7
14q22.1-q22.3
AAATGCCGGTCCTCAGAATAC
TCCTGCTTTCAGGAATACTC
Top 100 (87)






SLC40A1
2q32
GATTGTTGTTGTTGCAGGAGA
CCTTCGTATTGTGGCATTC
Top 100 (88)






ORC6L
16q12
ATCGACTGTGTAAACAACTAGAGAAGA
AGTAGCTACATCTCCAGGTTCTCTG
Top 100 (89)






BCMP11
7p21.1
TGAAGAAGGTCTCTTTTATGCTCA
TGGGCAAATACTTTCTTTAGTGC
Top 100 (9)






EXO1
1q42-q43
CCCATCCATGTGAGGAAGTATAA
TGTGAAGCCAGCAATATGTATC
Top 100 (90)






KIF20A
5q31
AAGCCACACACAGGTTC
CATCTCCTTCACAGTTAGGTTG
Top 100 (91)






EPN3
17q21.33
CACCTTCGCTTCCAGATG
GCCTATTGTCTCTTGCTGTT
Top 100 (92)






PTTG1
5q35.1
CCTCAGATGATGCCTATCCA
GCAGGTCAAAACTCTCAAAG
Top 100 (93)






RERG
12p13.1
AACTCGCAAACGCAACCT
TCTTGGAAGAGTCCACAATCC
Top 100 (94)






TMEM25
11q23.3
CAAGGTTTCATCCGCCTC
TCATCACTGCTCACGCT
Top 100 (95)






PHGDH
1p12
TGCCGCAGAACTCACTTG
CATTTGCCGTCCTTCATCG
Top 100 (96)






SLC9A3R1
5p15.3
CCAATGGGGAGATACAGAAGG
CACTGGAGGCGGATCTCA
Top 100 (97)






FAM64A
17p13.2
CCATTACGGCGATCAAGG
CCCACAGGCTCTAGGTCACT
Top 100 (98)






SEMA3C
7q21-q31
GACAAAGACAGGAGGAAAGAG
TCCCTGTGAAGTGGCTATTA
Top 100 (99)






PGR
11q22-q23
TTTAAGAGGGCAATGGAAGG
CGGATTTTATCAACGATGCAG
Top 141






BCL2
18q21.3
TACCTGAACCGGCACCTG
GCCGTACAGTTCCACAAAGG
Top 141






ABCC3
17q21
TGCTCTCCTTCATCAATCCA
TGGGGTTGGAGATAAACCTG
Top 141






CCND1

GAAGATCGTCGCCACCTG
GACCTCCTCCTCGCACTTCT
Top 141






CCNE1
19q12
GGCCAAAATCGACAGGAC
GGGTCTGCACAGACTGCAT
Top 141






CDH1
16q22.1
CCACCAAAGTCACGCTGAA
TGCTTGGATTCCAGAAACG
Top 141






EGFR
7p12
ACACAGAATCTATACCCACCAGAGT
ATCAACTCCCAAACGGTCAC
Top 141






KRT6B
12q12-q13
TCGACCACGTCAAGAAGC
GTTCTTAGCATCCTTGAGGG
Top 141






MYC
8q24
AGGCGAACACACAACGTC
TCTGGTCACGCAGGGCAA
Top 141






KRT5
12q
GTTGGACCAGTCAACATCTCTG
GCCATAGCCACTGCCACT
Top 141






GSTP1
11q13-qter
ACCTCACCCTGTACCAGTC
CTGCTGGTCCTTCCCATAG
Top 141






B3GNT5
3q28
CCGGAGCTGCCTATGTAATC
CAGAGGCCCATGAACACATC
Top 141






COX6C
8q22-q23
CATTCGTGCTATCCCTGG
TGTAGAAATCTGCGTATGCC
Top 141






FZD7
2q33
CTGACCCTGTCTCTGTGT
GTTCAAACCTTCCTCTTCGT
Top 141






TCEAL1
Xq22.1
CAACATGGACAAACCACG
CCTCTCCTCATCGGTCT
Top 141






KIT
4q11-q12
ATTCCCAGAGCCCACAATA
ATCCACTGGCAGTACAGAAG
Top 141






KRT17
17q12-17q21
ACTCAGTACAAGAAAGAACCG
GAGGAGATGACCTTGCC
Top 141






CDH3
16q22.1
GACAAGGAGAATCAAAAGATCAGC
ACTGTCTGGGTCCATGGCTA
Top 141






GSTM3
1p13.3
CAAGCTAGACCTGGACT
GCATTGCTCTGGGTGAT
Top 141






TP53BP2

AGGCTCTGCTTCTGTACC
CGGACGCACTTTCTTCTC
Top 141






CENPF
1q32-q41
GTGGCAGCAGATCACAA
GGATTTCGTGGTGGGTTC
Top 141






TOP2A
17q21-q22
CAACATGCCAATTGAGTGAAA
ACTTGGGCTTTAAACTTCACC
Top 141






TYMS
18p11.31-p11.21
CAAACGTGTGTTCTGGAAGG
ACAGCTCTTTAGCATTTGTGGA
Top 141






CCNB1
5q12
CTTTCGCCTGAGCCTATTT
GGGCACATCCAGATGTTT
Top 141






MKI67
10q25-qter
GTCTCTGGTAATGCACACT
CTGATGGTTGAGGCTGTT
Top 141






CLDN3
7q11
CTACGACCGCAAGGACTACG
GTGGTGGTGTTGGTGGTG
Top 141






CLDN4
7q11.23
ATCGGCAGCAACATTGTCA
CACGCAGTTCATCCATAGG
Top 141






CRYAB
11q22.3-q23.1
CAAGGAAACAGGTCTCTGG
GCAGGCTTCTCTTCACG
Top 141






CTPS
1p34.1
TGCCATGTTGAGCCTGA
CAAGGGGACTCGGTAGA
Top 141






GSDML
17q21.2
TGGATTCTGGGCTCCAAG
CAACTCTCCCGTTGAGTC
Top 141






KRT14

CGCAGTCATCCAGAGATGTG
CGTGCACATCCATGACCTT
Top 141






KRT19
17q21-q23
GTCATGGCCGAGCAGAAC
CCGGTTCAATTCTTCAGTCC
Top 141






KRT8
12q13
GATGAACCGGAACATCAGC
CTCCAGGGAAGCCCTCTG
Top 141






RARRES3
11q23
AGCACTTTGTCACCCAG
GCCACACCAACTTCAACC
Top 141






TRIM29
11q22-q23
TGAGATTGAGGATGAAGCTGAG
CATTGGTGGTGAAGCTCTTG
Top 141






VEGF
6p21-p12
AGTGTGTGCCCACTGAGGA
GGTGAGGTTTGATCCGCATA
Top 141






WIRE
17q21.2
CAACATTAATGATCGGAGTGCT
CTCCTCCAGAGCCATAGCC
Top 141






YBX1
1p34
CAGTATTCCAACCCTCCTGTG
GTTCTCCTGCACCCTGGTT
Top 141






text missing or illegible when filed indicates data missing or illegible when filed







G. REFERENCES



  • Akilesh S, Shaffer D J, Roopenian D. “Customized molecular phenotyping by quantitative gene expression and pattern recognition analysis” Genome Res 13:1719-1727 (2003).

  • Bair, E., and Tibshirani, R. “Semi-supervised methods to predict patient survival from gene expression data” PLoS Biol 2:E108 (2004).

  • Bloom, H. J. G., and Richardson, W. W. “Histologic grading and prognosis in breast cancer” British Journal of Cancer 9:359-377 (1957).

  • Benito, M., Parker, J., Du, Q., Wu, J., Xiang, D., Perou, C. M., and Marron, J. S. “Adjustment of systematic microarray data biases” Bioinformatics 20:105-114 (2004).

  • Bhatia P, Taylor W R, Greenberg A H, Wright J A. “Comparison of glyceraldehyde-3-phosphate dehydrogenase and 28S-ribosomal RNA gene expression as RNA loading controls for northern blot analysis of cell lines of varying malignant potential” Anal Biochem 216:223-226 (1994).

  • Bullinger, L., Dohner, K., Bair, E., Frohling, S., Schlenk, R. F., Tibshirani, R., Dohner, H., and Pollack, J. R. “Use of gene-expression profiling to identify prognostic subclasses in adult acute myeloid leukemia” N Engl J Med 350:1605-1616 (2004).

  • Buzdar, A., O'Shaughnessy, Booser, D. J., Pippen, J. E., Jr., Jones, S. E., Munster, P. N., Peterson, P., Melemed, A. S., Winer, E., and Hudis, C. “Phase II, randomized, double-blind study of two dose levels of arzoxifene in patients with locally advanced or metastatic breast cancer” J Clin Oncol 21:1007-1014 (2003).

  • Caly, M., Genin, P., Ghuzlan, A. A., Elie, C., Freneaux, P., Klijanienko, J., Rosty, C., Sigal-Zafrani, B., Vincent-Salomon, A., Douggaz, A., et al. “Analysis of correlation between mitotic index, MIB1 score and S-phase fraction as proliferation markers in invasive breast carcinoma. Methodological aspects and prognostic value in a series of 257 cases” Anticancer Res 24:3283-3288 (2004).

  • Chia, S. K., Speers, C. H., Bryce, C. J., Hayes, M. M., and Olivotto, I. A. “Ten-year outcomes in a population-based cohort of node-negative, lymphatic, and vascular invasion-negative early breast cancers without adjuvant systemic therapies” J Clin Oncol 22:1630-1637 (2004).

  • Clark G. M., Allred, D. C., Hilsenbeck, S. G., Charmless, G. C., Osborne, C. K., Jones, D., and Lee, W. H. “Mitosin (a new proliferation marker) correlates with clinical outcome in node-negative breast cancer” Cancer Res 57:5505-5508 (1997).

  • Cronin, M., Pho, M., Dutta, D., Stephans, Shak, S., Kiefer, M. C., Esteban, S. M., and Baker, J. B. “Measurement of gene expression in archival paraffin-embedded tissues: development and performance of a 92-gene reverse transcriptase-polymerase chain reaction assay” Am J Pathol 164:35-42 (2004).

  • Dalton, L. W., Page, D. L., and Dupont, W. D. “Histologic grading of breast carcinoma. A reproducibility study” Cancer 73:2765-2770 (1994).

  • Dhanasekaran S M, Barrette T R, Ghosh D, Shah R, Varambally S, Kurachi K, Pienta K J, Rubin M A, Chinnaiyan A M. “Delineation of prognostic biomarkers in prostate cancer” Nature 412:822-826 (2001).

  • Diehn, M., Sherlock, G., Binkley, G., Jin, H., Matese, J. C., Hernandez-Boussard, T., Rees, C. A., Cherry, J. M., Botstein, D., Brown, P. O., et al. “SOURCE: a unified genomic resource of functional annotations, ontologies, and gene expression data” Nucleic Acids Res 31:219-223 (2003).

  • Dudoit, S., and Fridlyand, I. “A prediction-based resampling method for estimating the number of clusters in a dataset” Genome Biol 3:RESEARCH0036 (2002).

  • Efron, B., Tibshirani, R. J. “An Introduction to the Bootstrap” Boca Raton, Fla.: CRC Press LLC. p 247 pp (1998).

  • Eggert A, Brodeur G M, Ilcegaki N. “Relative quantitative R T-PCR protocol for TrkB expression in neuroblastoma using GAPD as an internal control” Biotechniques 28:681-682, 686, 688-691 (2000).

  • Eisen, M. B., Spellman, P. T., Brown, P. O., and Botstein, D. “Cluster analysis and display of genome-wide expression patterns” Proc Natl Acad Sci USA 95:14863-14868 (1998).

  • Elston, C. W., and Ellis, L O. “Pathological prognostic factors in breast cancer. L The value of histological grade in breast cancer: experience from a large study with long-term follow-up” Histopathology 19:403-410 (1991).

  • Fisher, E. R., Osborne, C. K., McGuire, W. L., Redmond, C., Knight, W. A., 3rd, Fisher, B., Bannayan, G., Walder, A., Gregory, E L, Jacobsen, A., et al. “Correlation of primary breast cancer histopathology and estrogen receptor content” Breast Cancer Res Treat 1:37-41 (1981).

  • Fisher, B., Costantino, I., Redmond, C., Poisson, R., Bowman, D., Couture, J., Dimitrov, N. V., Wolmark, N., Wickerham, D. L., Fisher, E. R., et al. “A randomized clinical trial evaluating tamoxifen in the treatment of patients with node-negative breast cancer who have estrogen-receptor-positive tumors” N Engl I Med 320:479-484 (1989).

  • Fitzgibbons, P. L., Page, D. L., Weaver, D., Thor, A D., Allred, D. C., Clark, G. M., Ruby, S. G., O'Malley, F., Simpson, J. F., Connolly, J. L., et al. “Prognostic factors in breast cancer. College of American Pathologists Consensus Statement 1999” Arch Pathol Lab Med 124:966-978 (2000).

  • Frank S G, Bernard, P. S. “Profiling Breast Cancer using Real-Time Quantitative PCR. In Rapid Cycle Real-Time PCR: Methods and Applications” Edited by S. Meuer W, C., Nakagawara, K. Heidelberg, Germany, Springer pp 95-106 (2003).

  • Frierson, H. F., Jr., Wolber, R. A., Berean, K. W., Franquemont, D. W., Gaffey, M. J., Boyd, I C., and Wilbur, D. C. “Interobserver reproducibility of the Nottingham modification of the Bloom and Richardson histologic grading scheme for infiltrating ductal carcinoma” Am J Clin Pathol 103:195-198 (1995).

  • Genestie, C., Zafrani, B., Asselain, B., Fourquet, A., Rozan, S., Validire, P., Vincent-Salomon, A., and Sastre-Garau, X. “Comparison of the prognostic value of Scarff-Bloom-Richardson and Nottingham histological grades in a series of 825 cases of breast cancer: major importance of the mitotic count as a component of both grading systems” Anticancer Res 18:571-576 (1998).

  • Greenough, R. B. “Varying degrees of malignancy in cancer of the breast” J Cancer Res 9:452-463 (1925).

  • Gruvberger, S., Ringner, M., Chen, Y., Panavally, S., Saal, L. H., Borg, A., Ferno, M., Peterson, C., and Meltzer, P. S. “Estrogen receptor status in breast cancer is associated with remarkably distinct gene expression patterns” Cancer Res 61:5979-5984 (2001).

  • Henson, D. E., Ries, L., Freedman, L. S., and Carriaga, M. “Relationship among outcome, stage of disease, and histologic grade for 22,616 cases of breast cancer. The basis for a prognostic index” Cancer 68:2142-2149 (1991),

  • Ishida, S., Huang E., Zuzan, H., Spang, R., Leone, G., West, M., and Nevins, J. R. “Role for E2F in control of both DNA replication and mitotic functions as revealed from DNA microarray analysis” Mol Cell Biol 21:4684-4699 (2001).

  • Iwahashi, H., Eguchi, Y., Yasuhara, N., Hanafusa, T., Matsuzawa, Y., and Tsujimoto, Y. “Synergistic anti-apoptotic activity between Bcl-2 and SMN implicated in spinal muscular atrophy” Nature 390:413-417 (1997).

  • Kollias, J., Murphy, C. A., Elston, C. W., Ellis, I. O., Robertson, J. F., and Blarney, R. W. “The prognosis of small primary breast cancers” Eur J Cancer 35:908-912 (1999).

  • Kristt D, Turner I, Koren R, Ramadan E, Gal R. “Overexpression of cyclin D1 mRNA in colorectal carcinomas and relationship to clinicopathological features: an in situ hybridization analysis” Pathol Oncol Res 6:65-70 (2000).

  • Laping, N. J., Olson, B. A., and Zhu, Y. “Identification of a novel nuclear guanosine triphosphate-binding protein differentially expressed in renal disease” J Am Soc Nephrol 12:883-890 (2001).

  • Manders, P., Bult, P., Sweep, C. G., Tjan-Heijnen, V. C., and Beex, L. V. “The prognostic value of the mitotic activity index in patients with primary breast cancer who were not treated with adjuvant systemic therapy” Breast Cancer Res Treat 77:77-84 (2003).

  • Makretsov, N. A., Huntsman, D. G., Nielsen, T. O., Yorida, E., Peacock, M., Cheang M. C., Dunn, S. E., Hayes, M., van de Rijn, M., Bajdik, C., et al. “Hierarchical clustering analysis of tissue microarray immunostaining data identifies prognostically significant groups of breast carcinoma” Clin Cancer Res 10:6143-6151 (2004).

  • Michels, J. J., Marnay, J., Delozier, T., Denoux, Y., and Chasle, J. “Proliferative activity in primary breast carcinomas is a salient prognostic factor” Cancer 100:455-464 (2004).

  • Miller C L, Yolken R H. “Methods to optimize the generation of cDNA from postmortem human brain tissue” Brain Res Brain Res Protoc 10:156-167 (2003).

  • Mischel P S, Nelson S F, Cloughesy T F. “Molecular analysis of glioblastoma: pathway profiling and its implications for patient therapy” Cancer Biol Ther 2:242-247 (2003).

  • Nielsen, T. O., Hsu, F. D., Jensen, K., Cheang, M., Karaca, G., Hu, Z., Hernandez-Boussard, T., Livasy, C., Cowan, D., Dressler, L., et al. “Immunohistochemical and clinical characterization of the basal-like subtype of invasive breast carcinoma” Clin Cancer Res 10:5367-5374 (2004).

  • Paik, S., Shak, S., Tang, G., Kim, C., Baker, J., Cronin, M., Baehner, F. L., Walker, M. G., Watson, D., Park, T., et al. “A multigene assay to predict recurrence of tamoxifen-treated, node-negative breast cancer” N Engl J Med 351:2817-2826 (2004).

  • Panaro N J, Yuen P K, Sakazume T, Fortina P, Kxicka L T, Wilding P. “Evaluation of DNA fragment sizing and quantification by the agilent 2100 bioanalyzer” Clin Chem 46:1851-1853 (2000).

  • Perou C M, Sorlie T, Eisen M B, van de Rijn M, Jeffrey S S, Rees C A, Pollack J R, Ross D T, Johnsen H, Akslen L A, Fluge O, Pergamenschikov A, Williams C, Zhu S X, Lonning P E, Borresen-Dale A L, Brown P O, Botstein D. “Molecular portraits of human breast tumours” Nature 406:747-752 (2000).

  • Perou C M, Brown P O, Botstein D. “Tumor classification using gene expression patterns from DNA microarrays” New Technologies for life sciences: A Trends Guide pp 67-76 (2000).

  • Perou, C. M., Jeffrey, S. S., van de Rijn, M., Rees, C. A., Eisen, M. B., Ross, D. T., Pergamenschikov, A., Williams, C. F., Zhu, S. X., Lee, I C., et al. “Distinctive gene expression patterns in human mammary epithelial cells and breast cancers” Proc Natl Acad Sci USA 96:9212-9217 (1999).

  • Pinheiro J C B D. “Mixed-effects models in S and S-PLUS” New York, Springer (2000).

  • Pollack, J. R., Sorlie, T., Perou, C. M., Rees, C. A., Jeffrey, S. S., Lonning, P. E., Tibshirani, R., Botstein, D., Borresen-Dale, A. L., and Brown, P. O. “Microarray analysis reveals a major direct role of DNA copy number alteration in the transcriptional program of human breast tumors” Proc Natl Acad Sci USA 99:12963-12968 (2002).

  • Pollack, J. R., Perou, C. M., Alizadeh, A. A., Eisen, M. B., Pergamenschikov, A., Williams, C. F., Jeffrey, S. S., Botstein, D., and Brown, P. O. “Genome-wide analysis of DNA copy-number changes using cDNA microarrays” Nature Genetics 23:41-46 (1999).

  • Rasmussen R P. “Quantification on the LightCycler. In Rapid Cycle Real-Time PCR: Methods and Applications” Edited by Wittwer C T, Meuer, S., Nakagawara, K. Heidelberg, Springer Verlag, pp 21-34 (2001).

  • Robbins, P., Pinder, S., de Klerk, N., Dawkins, H., Harvey, J., Sterrett, G., Ellis, I., and Elston, C. “Histological grading of breast carcinomas: a study of interobserver agreement” Hum Pathol 26:873-879 (1995).

  • Ross, D. T., Scherf, U., Eisen, M. B., Perou, C. M., Rees, C., Spellman, P., Iyer, V., Jeffrey, S. S., Van de Rijn, M., Waltham, M., et al. “Systematic variation in gene expression patterns in human cancer cell lines [see comments]” Nat Genet 24:227-235 (2000).

  • Roux S, Pichaud F, Quinn J, Lalande A, Morieux C, Jullienne A, de Vernejoul M C. “Effects of prostaglandins on human hematopoietic osteoclast precursors” Endocrinology 138:1476-1482 (1997).

  • SantaLucia J. “A unified view of polymer, dumbbell, and oligonucleotide DNA nearest-neighbor thermodynamics” Proc Natl Acad Sci USA 95:1460-1465 (1998).

  • Schena M, Shalon D, Davis R W, Brown P O. “Quantitative monitoring of gene expression patterns with a complementary DNA microarray” Science 270:467-470 (1995).

  • Schwarz G. “Estimating the dimension of a model” The Annals of Statistics 6:461-464 (1978).

  • Singletary, S. E., Allred, C., Ashley, P., Bassett, L. W., Berry, D., Bland, K. I., Borgen, P. I., Clark, G. M., Edge, S. B., Hayes, D. F., et al. “Staging system for breast cancer” revisions for the 6th edition of the AJCC Cancer Staging Manual. Surg Clin North Am 83:803-819 (2003).

  • Sorlie, T., Tibshirani, R., Parker, J., Hastie, T., Marron, J. S., Nobel, A., Deng, S., Johnsen, H., Pesich, R., Geisler, S., et al. “Repeated observation of breast tumor subtypes in independent gene expression data sets” Proc Natl Acad Sci USA 100:8418-8423 (2003).

  • Sørlie, T., Perou, C. M., Tibshirani, R., Aas, T., Geisler, S., Johnsen, H., Hastie, T., Eisen, M. B., van de Rijn, M., Jeffrey, S. S., et al. “Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications” Proc Natl Acad Sci USA 98:10869-10874 (2001).

  • Sotiriou, C., Neo, S. Y., McShane, L. M., Korn, E. L., Long, P. M., Jazaeri, A., Martiat, P., Fox, S. B., Harris, A. L., and Liu, E. T. “Breast cancer classification and prognosis based on gene expression profiles from a population-based study” Proc Natl Acad Sci USA 100:10393-10398 (2003).

  • Spanakis E. “Problems related to the interpretation of autoradiographic data on gene expression using common constitutive transcripts as controls” Nucleic Acids Res 21:3809-3819 (1993).

  • Suzuki T, Higgins P J, Crawford D R. “Control selection for RNA quantitation” Biotechniques 29:332-337 (2000).

  • Szabo, A., Perou, C. M., Karaca, M., Perreard, L., Quackenbush, U., and Bernard, P. S. “Statistical modeling for selecting housekeeper genes” Genome Biol 5:R59 (2004).

  • Taylor-Papadimitriou, J., Stampfer, M., Bartek, J., Lewis, A., Boshell, M., Lane, E. B., and Leigh, I. M. “Keratin expression in human mammary epithelial cells cultured from normal and malignant tissue: relation to in vivo phenotypes and influence of medium” J Cell Sci 94:403-113 (1989).

  • Troyanskaya, O., Cantor, M., Sherlock, G., Brown, P., Hastie, T., Tibshirani, R., Botstein, D., and Altman, R. B. “Missing value estimation methods for DNA microarrays” Bioinformatics 17:520-525 (2001).

  • Tubbs R R, Pettay J D, Roche P C, Stoler M H, Jenkins R B, Grogan T M. “Discrepancies in clinical laboratory testing of eligibility for trastuzumab therapy: apparent immunohistochemical false-positives do not get the message” J Clin Oncol 19:2714-2721 (2001).

  • van de Vijver M J, He Y D, van't Veer Li, Dai H, Hart A A, Voskuil D W, Schreiber G J, Peterse J L, Roberts C, Marton M I, Parrish M, Atsma D, Witteveen A, Glas A, Delahaye L, van der Velde T, Bartelink H, Rodenhuis S, Rutgers E T, Friend S H, Bernards R. “A gene-expression signature as a predictor of survival in breast cancer” N Engl I Med 347:1999-2009 (2002).

  • van't Veer, L. I., Dai, H., van de Vijver, M. J., He, Y. D., Hart, A. A., Mao, M., Peterse, H. L., van der Kooy, K., Marton, M. J., Witteveen, A T., et al. “Gene expression profiling predicts clinical outcome of breast cancer” Nature 415:530-536 (2002).

  • Vandesompele I, De Preter K, Pattyn F, Poppe B, Van Roy N, De Paepe A, Speleman F. “Accurate normalization of real-time quantitative R T-PCR data by geometric averaging of multiple internal control genes” Genome Biol 3:RESEARCH0034 (2002).

  • Welsh J B, Zarrinkar P P, Sapinoso L M, Kern S G, Behling C A, Monk B J, Lockhart D I, Burger R A, Hampton G M. “Analysis of gene expression profiles in normal and neoplastic ovarian tissue samples identifies candidate molecular markers of epithelial ovarian cancer” Proc Natl Acad Sci USA 98:1176-1181 (2001).

  • West, M., Blanchette, C., Dressman, H., Huang, E., Ishida, S., Spang, R., Zuzan, H., Olson, L A., Jr., Marks, J. R., and Nevins, J. R. “Predicting the clinical status of human breast cancer by using gene expression profiles” Proc Natl Acad Sci USA′98:11462-11467 (2001).

  • Whitfield, M. L., Sherlock, G., Saldanha, A. J., Murray, J. I., Ball, C. A., Alexander, K. E., Matese, J. C., Perou, C. M., Hurt, M. M., Brown, P. O., et al. “Identification of genes periodically expressed in the human cell cycle and their expression in tumors” Mol Biol Cell 13:1977-2000 (2002).

  • Wittwer C T, a.K., N. “Real-time PCR. In Molecular Microbiology” T. Persing D H, F C, Versalovic, I, Tang, Y W, Unger, E R, Reiman, D A, and White, T I, editor. Washington, D.C.: ASM Press (2004).

  • Yang, Y. H., Dudoit, S., Luu, P., Lin, D. M., Peng, V., Ngai, I., and Speed, T. P. “Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variation” Nucleic Acids Res 30:e15 (2002).

  • Yu, K., Lee, C. H., Tan, P. H., and Tan, P. “Conservation of breast cancer molecular subtypes and transcriptional patterns of tumor progression across distinct ethnic populations” Clin Cancer Res 10:5508-5517 (2004).


Claims
  • 1. A method of diagnosing cancer, the method comprising comparing expression levels of a combination of genes from Table 21 to test nucleic acids, wherein specific expression patterns of the test nucleic acids indicates a cancerous state.
  • 2. The method of claim 1, wherein the combination of genes includes at least 10 genes from Table 21.
  • 3. The method of claim 1, wherein the combination of genes includes at least 25 genes from Table 21.
  • 4. The method of claim 1, wherein the combination of genes includes at least 50 genes from Table 21.
  • 5. The method of claim 1, wherein the combination of genes includes at least 75 genes from Table 21.
  • 6. A method of quantitating level of expression of a test nucleic acid comprising: a) comparing gene expression levels of a combination of genes from Table 21 to test nucleic acids corresponding to the same combination of genes; and b) quantitating level of expression of the test nucleic acid.
  • 7. A method determining prognosis based on expression patterns in a subject diagnosed with cancer comprising: a) comparing expression levels of a combination of genes from Table 21 to test nucleic acids corresponding to the same combination of genes, b) identifying a subtype of cancer of the subject, and c) determining prognosis based on expression patterns in the subject.
  • 8. A method of classifying cancer in a subject, comprising: a) identifying intrinsic genes of the subject to be used to classify the cancer; b) obtaining a sample from the subject; c) amplifying and detecting levels of intrinsic genes in the subject; and d) classifying cancer based upon results of step c.
  • 9. A method of diagnosing cancer in a subject the method comprising: a) amplifying and detecting intrinsic genes; and b) diagnosing cancer based on expression levels of the gene within the subject.
  • 10. A method of deriving a minimal intrinsic gene set for making biological classifications of cancer comprising: a) collecting data from multiple samples from the same individual to identify potential intrinsic classifier genes;b) weighting intrinsic classifier genes of multiple individuals identified using the method of step a relative to each other and forming classification dusters;c) estimating the number of clusters formed in step b) and assigning individual samples to classification clusters;d) identifying genes that optimally distinguish the samples in the assigned groups of step c);e) performing iterative cross-validation with a nearest centroid classifier and overlapping gene sets of various sizes using the genes identified in step d); andf) choosing a gene set which provides the highest class prediction accuracy when compared to the classifications made in step b).
  • 11. The method of claim 10, wherein the cancer is selected from the group consisting of breast cancer, colon cancer, or melanoma.
  • 12. The method of claim 1, wherein the genes are derived from fresh samples.
  • 13. The method of claim 1, wherein the genes are derived from formalin-fixed paraffin embedded (FFPE) samples.
  • 14. The method of claim 10, wherein sample comprises mRNA.
  • 15. The method of claim 10, wherein the sample is amplified by PCR.
  • 16. The method of claim 15, wherein the PCR is real time PCR.
  • 17. The method of claim 11, wherein the breast cancer is classified into luminal, normal-like, HER2+/ER−, and basal-like.
  • 18. The method of claim 10, wherein the intrinsic gene set is identified using a microarray.
  • 19. The method of claim 10, wherein the intrinsic gene set is modified from a microarray.
  • 20. The method of claim 19, wherein the intrinsic gene set includes at least one housekeeper gene.
  • 21. A method of assigning a sample to an intrinsic subtype, comprising: a) creating an intrinsic subtype average profile (centroid) for each subtype;b) individually comparing a new sample to each centroid; andc) assigning the new sample to the centroid that is most similar to an expression profile of the new sample.
RELATED APPLICATIONS

This application is a continuation of U.S. Ser. No. 12/094,898, filed Mar. 13, 2009, which is §371 NATL phase entry of PCT/US2006/044737, which claims priority to, and the benefit of, U.S. Ser. No. 60/739,155, filed Nov. 23, 2005. The contents of each of these applications are incorporated by reference in their entireties.

I. ACKNOWLEDGEMENTS

This work was supported in part by the National Cancer Institute (P50-CA58223-11 and R33 CA097769-01 and U01 CA114722. The United States Government may have certain rights in the inventions disclosed herein.

Provisional Applications (1)
Number Date Country
60739155 Nov 2005 US
Continuations (1)
Number Date Country
Parent 12094898 Mar 2009 US
Child 13959575 US