Gene expression profiling of primary breast carcinomas using arrays of candidate genes

Description

FIELD OF THE INVENTION

[0002] This invention relates to polynucleotide analysis and, in particular, to polynucleotide expression profiling of carcinomas using arrays of candidate polynucleotides.

BACKGROUND

[0003] Pathologists and clinicians in charge of the management of breast cancer patients are facing two major problems, namely the extensive heterogeneity of the disease and the lack of factors—among conventional histological and clinical features—predicting with reliability the evolution of the disease and its sensitivity to cancer therapies. Breast tumors of the same apparent prognostic type vary widely in their responsiveness to therapy and consequent survival of the patient. New prognostic and predictive factors are needed to allow an individualization of therapy for each patient.

[0004] Great hope is currently being placed on molecular studies, which address the problem in a global fashion. Methods such as cytogenetics, comparative genomic hybridization, and whole-genome allelotyping have addressed the issue at the genome level. Currently, the modifications that take place in human tumors at the level of transcription can also be studied in a large, unprecedented scale, using new methods such as cDNA arrays that allow quantitative measurement of the mRNA expression levels of many genes simultaneously. Thus, it would be advantageous to provide a means to assess the capacity of cDNA array testing-in clinical practice to better classify an heterogeneous cancer into tumor subtypes with more homogeneous clinical outcomes, and to identify new potential prognostic factors and therapeutics targets.

SUMMARY OF THE INVENTION

[0005] The invention relates to a polynucleotide library useful in the molecular characterization of a carcinoma, the library including a pool of polynucleotide sequences or subsequences thereof wherein the sequences or subsequences are either underexpressed or overexpressed in tumor cells, further wherein the sequences or subsequences correspond substantially to any of the polynucleotide sequences set forth in any of SEQ ID NOS: 1-468 or the complement thereof.

BRIEF DESCRIPTION OF THE DRAWINGS

[0006]
FIG. 1 shows an example of differential gene expression between normal breast tissue (NB) and breast tumor samples.

[0007]
FIG. 2 is a representation of expression levels of 176 genes in normal breast tissue (NB) and 34 samples of breast carcinoma.

[0008]
FIG. 3 is prognostic classification of breast cancer by gene expression profiling.

[0009]
FIG. 4 shows the correlation of GATA3 (SEQ ID NO: 78) expression with ER phenotype.

DETAILED DESCRIPTION OF THE INVENTION

[0010] In the context of this disclosure, a number of terms shall be utilized.

[0011] The term “polynucleotide” refers to a polymer of RNA or DNA that is single-stranded, optionally containing synthetic, non-natural or altered nucleotide bases. A polynucleotide in the form of a polymer of DNA may be comprised of one or more segments of cDNA, genomic DNA or synthetic DNA.

[0012] The term “subsequence” refers to a sequence of nucleic acids that comprises a part of a longer sequence of nucleic acids.

[0013] The term “immobilized on a support” means bound directly or indirectly thereto including attachment by covalent binding, hydrogen bonding, ionic interaction, hydrophobic interaction or otherwise.

[0014] Breast cancer is characterized by an important histoclinical heterogeneity that currently hampers the selection of the most appropriate treatment for each case. This problem could be solved by the identification of new parameters that better predict the natural history of the disease and its sensitivity to treatment. An important object of the present invention relates to a large-scale molecular characterization of breast cancer that could help in prediction, prognosis and cancer treatment.

[0015] An important aspect of the invention relates to the use of cDNA arrays, which allows quantitative study of mRNA expression levels of 188 candidate genes in 34 consecutive primary breast carcinomas in three areas: comparison of tumor samples, correlations of molecular data with conventional histoclinical prognostic features and gene correlations. The experimentation evidenced extensive heterogeneity of breast tumors at the transcriptional level. Hierarchical clustering algorithm identified two molecularly distinct subgroups of tumors characterized by a different clinical outcome after chemotherapy. This outcome could not have been predicted by the commonly used histoclinical parameters. No correlation was found with the age of patients, tumor size, histological type and grade. However, expression of genes was differential in tumors with lymph node metastasis and according to the estrogen receptor status; ERBB2 (SEQ ID No: 119) expression was strongly correlated with the lymph node status (p≦0.0001) and that of GATA3 (SEQ ID No: 78) with the presence of estrogen receptors (p≦0.001). Thus, experimental results identified new ways to group tumors according to outcome and new potential targets of carcinogenesis. They show that the systematic use of cDNA array testing holds great promise to improve the classification of breast cancer in terms of prognosis and chemosensitivity and to provide new potential therapeutic targets.

[0016] DNA arrays consist of large numbers of DNA molecules spotted in a systematic order on a solid support or substrate such as a nylon membrane, glass slide, glass beads, a membrane on a glass support, or a silicon chip. Depending on the size of each DNA spot on the array, DNA arrays can be categorized as microarrays (each DNA spot has a diameter less than 250 microns) and macroarrays (spot diameter is greater than 300 microns). When the solid substrate used is small in size, arrays are also referred to as DNA chips. Depending on the spotting technique used, the number of spots on a glass microarray can range from hundreds to thousands.

[0017] DNA microarrays serve a variety of purposes, including gene expression profiling, de novo gene sequencing, gene mutation analysis, gene mapping and genotyping. cDNA microarrays are printed with distinct cDNA clones isolated from cDNA libraries. Therefore, each spot represents an expressed gene, since it is derived from a distinct mRNA.

[0018] Typically, a method of monitoring gene expression involves (1) providing a pool of sample polynucleotides comprising RNA transcript(s) of one or more target gene(s) or nucleic acids derived from the RNA transcript(s); (2) reacting, such as hybridizing the sample polynucleotide to an array of probes (for example, polynucleotides obtained from a polynucleotide library) (including control probes) and (3) detecting the reacted/hybridized polynucleotides. Detection can also involve calculating/quantifying a relative expression (transcription) level.

[0019] The present invention concerns a polynucleotide library useful in the molecular characterization of a carcinoma, said library comprising a pool of polynucleotide sequences or subsequences thereof wherein said sequences or subsequences are either underexpressed or overexpressed in tumor cells, flrher wherein said sequences or subsequences correspond substantially to any of the polynucleotide sequences set forth in any of SEQ ID Nos: 1-468 in annex or the complement thereof.

[0020] Obviously, sequences having a great degree of homology with the above sequences could also be used to realize the molecular characterization of the invention, namely when those sequences present one or a few punctual mutations when compared with any one of the sequences represented by SEQ ID Nos: 1-468.

[0021] A particular embodiment of the invention relates to a polynucleotide library of sequences or subsequences corresponding substantially to any combination of at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets 1 to 188 as defined in table 4.

[0022] A polynucleotide sequence library useful for the realization of the invention can comprise also any sequence comprised between 3′ end and 5′ end of each polynucleotide sequence set as defined in table 4, allowing the complete detection of the implicated gene.

[0023] The invention relates also to a polynucleotide library useful to differentiate a normal cell from a cancer cell wherein the pool of polynucleotide sequences or subsequences correspond substantially to any combination of at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequences sets indicated in table 5, useful in differentiating a normal cell from a cancer cell.

[0024] Preferably the polynucleotide library useful to differentiate a normal cell from a cancer cell corresponds substantially to any combination of at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets indicated in table 5A, and of at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets indicated in table 5B.

[0025] The detection of an overexpression of genes identified with sets of polynucleotide sequences defined in table 5A, together with detection of an underexpression of genes identified with sets of polynucleotide sequences defined in table 5B allows distinction between normal patients and patients suffering from tumor pathology.

[0026] The invention relates also to a polynucleotide library useful to detect a hormone-sensitive tumor cell wherein the pool of polynucleotide sequences or subsequences correspond substantially to any combination of at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets defined in table 6.

[0027] Preferably the polynucleotide library useful to detect a hormone-sensitive tumor cell correspond substantially to any combination of at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets defined in table 6A together with at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets defined in table 6B.

[0028] The detection of an overexpression of genes identified with sets of polynucleotides sequences defined in table 6A, together with detection of an underexpression of genes identified with sets of polynucleotides sequences defined in table 6B allows distinction between patients having a hormone-sensitive tumor and patients having a hormone-resistant tumor.

[0029] The invention also concerns a polynucleotide library useful to differentiate a tumor in which a lymph node has been invaded by a tumor cell from a tumor in which a lymph node has not been invaded by a tumor cell wherein the pool of polynucleotide sequences or subsequences correspond substantially to any combination of at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets defined in table 7.

[0030] Preferably, the polynucleotide library useful to differentiate a tumor in which a lymph node has been invaded by a tumor cell from a tumor in which a lymph node has not been invaded by a tumor cell correspond substantially to any combination of at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets defined in table 7A together with at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets defined in table 7B.

[0031] The detection of an overexpression of genes identified with sets of polynucleotide sequences defined in table 7A, together with detection of an underexpression of genes identified with sets of polynucleotide sequences defined in table 7B allows distinction between patients having a tumor in which a lymph node has been invaded by a tumor cell and patients having a tumor in which a lymph node has not been invaded by a tumor cell.

[0032] The invention concerns also a polynucleotide library useful to differentiate anthracycline-sensitive tumors from anthracycline-insensitive tumors wherein the pool of polynucleotide sequences or subsequences correspond substantially to any combination of at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets defined in table 8.

[0033] Preferably, the polynucleotide library useful to differentiate anthracycline-sensitive tumors from anthracycline-insensitive tumors correspond substantially to any combination of at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets defined in table 8A together with at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets defined in table 8B.

[0034] The detection of an overexpression of genes identified with sets of polynucleotide sequences defined in table 8A, together with detection of an underexpression of genes identified with sets of polynucleotide sequences defined in table 8B allows distinction between patients having an anthracycline-sensitive tumor from patients having an anthracycline-insensitive tumor.

[0035] The invention also concerns a polynucleotide library useful to classify good and poor prognosis primary breast tumors wherein the pool of polynucleotide sequences or subsequences correspond substantially to any combination of at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets defined in table 9.

[0036] Preferably, the polynucleotide library useful to classify good and poor prognosis primary breast tumors correspond substantially to any combination of at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets defined in table 9A together with at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets defined in table 9B.

[0037] The detection of an overexpression of genes identified with sets of polynucleotide sequences defined in table 9A, together with detection of an underexpression of genes identified with sets of polynucleotide sequences defined in table 9B allows to classify patients having good or poor prognosis primary breast tumors.

[0038] In a preferred embodiment, the tumor cell presenting underexpressed or overexpressed sequences from the polynucleotide library of the invention are breast tumor cells.

[0039] In a particular embodiment the polynucleotides of the polynucleotide library of the present invention are immobilized on a solid support in order to form a polynucleotide array, and said solid support is selected from the group consisting of a nylon membrane, nitrocellulose membrane, glass slide, glass beads, membranes on glass support or a silicon chip.

[0040] Another object of the present invention concerns a polynucleotide array useful for prognosis or diagnosis of a tumor bearing at least one immobilized polynucleotide library set as previously defined.

[0041] The invention also concerns a polynucleotide array useful to differentiate a normal cell from a cancer cell bearing any combination of at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets indicated in table 5, useful in differentiating a normal cell from a cancer cell.

[0042] Preferably the polynucleotide array useful to differentiate a normal cell from a cancer cell bears any combination of at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets indicated in table 5A, and of at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets indicated in table 5B.

[0043] The invention relates also to a polynucleotide array useful to detect a hormone-sensitive tumor cell bearing any combination of at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets defined in table 6.

[0044] Preferably the polynucleotide array useful to detect a hormone-sensitive tumor cell bears any combination of at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets defined in table 6A together with at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets defined in table 6B.

[0045] The invention concerns also a polynucleotide array useful to differentiate a tumor in which a lymph node has been invaded by a tumor cell from a tumor in which a lymph node has not been invaded by a tumor cell bearing any combination of at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets defined in table 7.

[0046] Preferably, the polynucleotide array useful to differentiate a tumor in which a lymph node has been invaded by a tumor cell from a tumor in which a lymph node has been invaded by a tumor cell bears any combination of at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets defined in table 7A together with at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets defined in table 7B.

[0047] The invention also concerns a polynucleotide array useful to differentiate anthracycline-sensitive tumors from anthracycline-insensitive tumors bearing any combination of at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets defined in table 8.

[0048] Preferably, the polynucleotide array useful to differentiate anthracycline-sensitive tumors from anthracycline-insensitive tumors bears any combination of at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets defined in table 8A together with at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets defined in table 8B.

[0049] The invention concerns also a polynucleotide array useful to classify good and poor prognosis primary breast tumors bearing any combination of at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence set defined in table 9.

[0050] Preferably, the polynucleotide array useful to classify good and poor prognosis primary breast tumors bears any combination of at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets defined in table 9A together with at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets defined in table 9B.

[0051] The present invention also concerns a method for detecting differentially expressed polynucleotide sequences that are correlated with a cancer, said method comprising:

[0052] obtaining a polynucleotide sample from a patient;

[0053] reacting the polynucleotide sample obtained in step (a) with a probe immobilized on a solid support wherein said probe comprises any of the polynucleotide sequences of the libraries previously defined or an expression product encoded by any of the polynucleotide sequences of the libraries previously defined; and

[0054] detecting the reaction product of step (b).

[0055] Preferably, the polynucleotide sample obtained at step (a) is labeled before its reaction at step (b) with the probe immobilized on a solid support.

[0056] The label of the polynucleotide sample is selected from the group consisting of radioactive, colorimetric, enzymatic, molecular amplification, bioluminescent or fluorescent labels.

[0057] In a particular embodiment the reaction product of step (c) is quantified by further comparison of said reaction product to a control sample.

[0058] In a first embodiment, the polynucleotide sample isolated from the patient and obtained at step (a) is either RNA or mRNA.

[0059] In another embodiment the polynucleotide sample isolated from the patient is cDNA is obtained by reverse transcription of the mRNA.

[0060] Preferably the reaction step (b) of the method for detecting differentially expressed polynucleotide sequences comprises a hybridization of the sample RNA issued from patient with the probe.

[0061] Preferably the sample RNA is labeled before hybridization with the probe and the label is selected from the group consisting of radioactive, calorimetric, enzymatic, molecular amplification, bioluminescent or fluorescent labels.

[0062] This method for detecting differentially expressed polynucleotide sequences is particularly useful for detecting, diagnosing, staging, monitoring, predicting, preventing or treating conditions associated with cancer, and particularly breast cancer.

[0063] The method for detecting differentially expressed polynucleotide sequences is also particularly useful when the product encoded by any of the polynucleotide sequence or subsequence set is involved in a receptor-ligand reaction on which detection is based.

[0064] The present invention is also related to a method for screening an anti-tumor agent comprising the above-depicted method for detecting differentially expressed polynucleotide sequences wherein the sample has been treated with the anti-tumor agent to be screened.

[0065] In a particular embodiment the method for screening an anti-tumor agent comprises detecting polynucleotide sequences reacting with at least one library of polynucleotides or polynucleotide sequence set as previously defined or of products encoded by said library in a sample obtained from a patient.

[0066] Tumor Samples and RNA Extraction

[0067] To avoid any bias of selection as to the type and size of the tumors, the RNAs to be tested were prepared from unselected samples. Samples of primary invasive breast carcinomas were collected from 34 patients undergoing surgery at the Institute Paoli-Calmette. After surgical resection, the tumors were macrodissected: a section was taken for the pathologist′ s diagnosis and an adjacent piece was quickly frozen in liquid nitrogen for molecular analyses. The median age of patients at the time of diagnosis was 55 years (range 39, 83) and most of them were post-menopausal. Tumors were classified according to the WHO histological typing of breast tumors in: 29 ductal carcinomas, 2 lobular carcinomas, 1 mixed ductal and lobular carcinoma, and 2 medullar carcinomas. They had various sizes, inferior or equal to 20 mm (n 13), between 20 and 50 mm (n=18) or superior to 50 mm (n=3), axillary′ s lymph node status (negative: 19 tumors, positive: 15 tumors), SBR grading (I: 3 tumors, II: 20 tumors, III: 10 tumors, not evaluable: 1 tumor), and estrogen receptor status (ER) evaluated by immunohistochemical assay (23 ER-positive, 11 ER-negative). ER positivity cutoff value was 10%. Adjuvant treatment with radiotherapy and when necessary multi-agent anthracycline-based chemotherapy (n=16) was given to patients according to local practice.

[0068] Total RNA was extracted from tumor samples by standard methods (43). Total RNA from normal breast tissue was obtained from Clontech (Palo Alto, Calif.): RNA was isolated from 8 tissue specimens from Caucasian females, age range 23-47. RNA integrity was controlled by denaturing formaldehyde agarose gel electrophoresis and Northern blots using a 28S-specific oligonucleotide.

[0069] cDNA Arrays Preparation

[0070] Gene expression was analyzed by hybridization of arrays with radioactive probes. The arrays contained PCR products of 5 control clones, and 180 IMAGE human cDNA clones selected with practical criteria (3′ sequence of mRNA, same cloning vector, host bacteria and insert size). This represented 176 genes (4 genes were represented by 2 different clones): 121 with proven or putative implication in cancer and 55 implicated in immune reactions (the list is available on the website: http:/tagc.univ-mrs.fr/pub/Cancer/). Their identity was verified by 5′ tag-sequencing of plasmid DNA and comparison with sequences in the EST (dbEST) and nucleotide (GenBank) databases at the NCBI. Identity was confirmed for all but 14 clones without significant gene similarity, which were referenced by their GenBank accession number. The control clones were: Arabidopsis thaliana cytochrome c 554 gene (used for hybridization signal normalization), 3 poly(A) sequences of different sizes and the vector pT 7T 3D (negative controls).

[0071] PCR amplification, purification and robotical spotting of PCR products onto Hybond-N+ membranes (Amersham) were done according to described protocols (4). All PCR products were spotted in duplicate. For normalization purpose, the c 554 gene was spotted 96-fold scattered over the whole membrane.

[0072] cDNA Array Hybridizations

[0073] Hybridizations were done successively with a vector oligonucleotide (to precisely determine the amount of target DNA accessible to hybridization in each spot), then after stripping of vector probe, with complex probes made from the RNAs (4). Each complex probe was hybridized to a distinct filter. Probes were prepared from total RNA with an excess of oligo(dT 25) to saturate the poly(A) tails of the messengers, and to insure that the reverse transcribed product did not contain long poly(T) sequences. A precise amount of c 554 mRNA was added to the total RNA before labeling to allow normalization of the data.

[0074] Five ng of total RNA (˜100 ng of mRNA) from tissue samples were used for each labeling. Probe preparation and hybridization of the membranes were done according to known procedures (http:/tagc.univ-mrs.fr/pub/Cancer/). Hybridization was done in excess of target (15 ng of DNA in each spot) and binding of cDNAs to the targets was linear and proportional to the quantity of cDNA in the probe.

[0075] Detection and Quantification of cDNA Array Hybridization Signals

[0076] Quantitative data were obtained using an imaging plate device. Hybridization signal detection with a FUJI BAS 1500 machine and quantification with the HDG Analyzer software (Genomic Solutions, Ann Arbor, Mich.) were done as previously described (http:/tagc.univ-mrs.fr/pub/Cancer/). Quantification was done by integrating all spot pixel intensities and substracting a spot background value determined in the neighboring area. Spots were located with a LaPlacian transformation. Spot background level was the median intensity of all the pixels present in a small window centered on the spot and which were not part of any spot (44). Quantified data were normalized in three steps and expressed as absolute gene expression levels (i.e. in percentage of abundance of individual mRNA with respect to mRNA within the sample), as described (4).

[0077] Array Data Analysis

[0078] Before analysis of the results, the reproducibility of the experiments was verified by comparing duplicate spots, or one hybridization with the same probe on two independent arrays, or two independent hybridizations with probes prepared from the same RNA. In every case, the results showed good reproducibility with respective correlation coefficients of 0.95, 0.98 and 0.98 (data not shown). Moreover, genes represented by two different clones on the array, such as CDK 4 (SEQ ID No: 288) or ETV 5 (SEQ ID No: 300), displayed similar expression profiles for the two clones in all samples. This reproducibility was sufficient to consider a 2-fold expression difference as significantly differential.

[0079] For graphical representation, data were displayed as absolute expression levels (FIG. 2a). For better visualization of clustering, results were log-transformed and displayed as relative values median-centered in each row and in each column (FIG. 2b). Hierarchical clustering was applied to the tissue samples and the genes using the Cluster program developed by Eisen (45) (average linkage clustering using Pearson correlation as similarity metric). Results in FIGS. 2 and 3 were displayed with the TreeView program (45).

[0080] Subsequent analysis was done using Excel software (Microsoft) and statistical analyses with the SPSS software. Metastasis-free survival and overall survival were measured from diagnosis until the first metastatic relapse or death respectively. They were estimated with the Kaplan-Meier method and compared between groups with the Log-Rank test. Correlations of gene pairs based on expression profiles were measured with the correlation coefficient r. The search for genes with expression levels correlated with tumor parameters was done in several successive steps.

[0081] First, genes were detected by comparing their median expression level in the two subgroups of tumors discordant according to the parameter of interest. The median values rather than the mean values were used because of the high variability of the expression levels for many genes, resulting in a standard deviation of expression level similar or superior to the mean value and making comparisons with means impossible. Second, these detected genes were inspected visually on graphics, and finally, an appropriate statistical analysis was applied to those that were convincing to validate the correlation. Comparison of GATA3 (SEQ ID No: 78) expression between ER-positive tumors and ER-negative tumors was validated using a Mann-Witney test. Correlation coefficients were used to compare the gene expression levels to the number of axillary nodes involved.

[0082] Northern Blot Analysis

[0083] Seventy-nine breast tumors, including 22 of the 34 tested on the arrays, were analyzed for GATA3 (SEQ ID No: 78) expression by Northern blot hybridization. RNA extraction from tumor samples and Northern blots were done as previously described (43). The GATA3 probe was prepared from the IMAGE cDNA clone 129757 (SEQ ID No: 78), which corresponds to the 3′ region (from +843 to +1689) of the GATA3 cDNA sequence (GenBank accession no. X55122). The insert (846 bp) was obtained by digestion of the clone with EcoRI and Pael enzymes. Northern blots were stripped and re-hybridized using an â-actin probe (46).

[0084]
FIG. 1 shows an example of differential gene expression between normal breast tissue (NB) and breast tumor samples. Each cDNA array on Nylon filter was hybridized with a complex probe made from 5 fg of total RNA. The top image corresponds to the whole membrane. For the two bottom images, only the right portion of the membranes is shown. Numbers below the spots indicate housekeeping genes (1, GAPDH and 2, actin), negative control clones (3, 4 and 5) and examples of genes differentially expressed between NB and breast tumor (6, stromelysin 3 (SEQ ID No: 346); 7, ERBB2 (SEQ ID No: 119); 8, MYBL2 (SEQ ID No: 310); 9, FOS (SEQ ID No: 318); 10, TGFáR 3; 11, desmin (SEQ ID No: 170)), and between ER- breast tumor and ER+ breast tumor (12, GATA3).

[0085]
FIG. 2 is a representation of expression levels of 176 genes in normal breast tissue (NB) and 34 samples of breast carcinoma. Each column corresponds to a single tissue, and each row to a single gene. (a) The results are expressed as percentage abundance of individual mRNA within the sample, and are represented using a blue color scale. The color scale (log scale with a 3-fold interval) indicated at the bottom left ranges from light blue (expression level ≧0.001%) to dark blue (expression level >3%). White squares indicate clones with undetectable expression levels and gray squares indicate missing data. The tissue samples are arbitrarily ordered and the clones are ordered from top to bottom according to increasing median expression levels. Horizontal black arrows on the right of the figure mark three clones with highly variable expression levels between the tumors (stromelysin 3 (SEQ ID No: 346), IGF2 (SEQ ID No: 61), GATA3 (SEQ ID No: 78) from top to bottom). (b) The results are shown as relative expression levels (relative to the median value of each row and each column) and are represented with a color scale indicated at the bottom left ranging from {fraction (1/100)} to 100 fold changes (gray squares: missing data). Eighteen clones with median expression level equal to zero in the 34 tumors are omitted. The clustering program arranges samples (n=35) along the horizontal axis so that those with the most similar expression profiles are placed adjacent to each other. Similarly, clones (n=162) are near each other along the vertical axis if they show a strong expression profile correlation across all tissues. The length of the branches of the dendrograms capturing respectively the samples (top) and the clones (left) reflects the similarity of the related elements. Two groups of tumors are separated and color coded: group A (blue) and group B (orange). Horizontal black and horizontal red arrows on the right of the figure respectively mark three genes with highly variable expression levels between the tumors (IGF2 (SEQ ID No: 61), GATA3 (SEQ ID No: 78), stromelysin 3 (SEQ ID No: 346) from top to bottom) and four pairs of different clones representing four genes. (c) Zoom representation of group A from FIG. 2b, excluding the two outlyer tumors at the right. The clustering separates two subgroups of tumors, A1 and A2. The dotted branches correspond to tumors associated with metastatic relapse and death. Follow-up was longer in A2 than in A1 (median 81 months for A2 versus 47 months for A1).

[0086]
FIG. 3 is prognostic classification of breast cancer by gene expression profiling showing that gene expression-based tumor classification correlates with clinical outcome. The 12 samples of group A (see FIG. 2band 2c) were reclustered using the top 32 differentially expressed genes between A1 and A2 subgroups. Data were displayed as in FIG. 2band shown with the same color key. The hierarchical clustering was applied to expression data from the 23 clones, out of 32, of which expression levels presented an at least two-fold change in at least two samples (out of 12). Two subgroups of tumors A1 and A2 are shown as well as two groups of differentially expressed clones. The dotted branches of tumor cluster A1 correspond to samples associated with metastatic relapse and death. FIG. 3a shows two-dimensional representation of hierarchical clustering results shown in FIGS. 2a and 2b. The analysis delineates 4 groups of tumours A, B, C and D. Black squares indicate patients alive at last follow-up visit and red squares indicate patients who died. Three classes of patients with a statistically different clinical outcome were defined according to gene expression profiles: class A (n=16), class B+C (n=34), class D (n=5). FIG. 3b illustrates a Kaplan-Meier plot of overall survival of the 3 classes of patients (p<0.005, log-rank test). And FIG. 3c illustrates a Kaplan-Meier plot of metastasis-free survival of the 3 classes of patients (p<0.05, log-rank test).

[0087]
FIG. 4 shows the correlation of GATA3 (SEQ ID No: 78) expression with ER phenotype. (a) The expression levels of GATA3 in 34 breast cancer samples (y axis) monitored by cDNA array analysis are reported in percentage of abundance of individual mRNA with respect to mRNA within the sample (log scale). GATA3 is significantly overexpressed in the ER-positive tumors (n=23) versus the ER-negative tumors (n=11) using the Mann-Witney test (p=0.0004). The expression level of GATA3 in normal breast tissue is reported on the right (NB). (b) Northern blot analysis of GATA3 in normal breast sample (NB) and 9 breast cancer samples (AT: tumor analyzed with cDNA array and Northern blot; NT: tumor analyzed with Northern blot). Blots were probed successively with cDNA from GATA3 (top) and â-actin (bottom). ER status is indicated for each tumor sample.

[0088] Data Representation

[0089]
FIG. 1 shows examples of hybridizations of cDNA arrays with probes made from RNA extracted from normal breast tissue and breast tumors.

[0090] The crude results of all hybridizations were processed to be presented either as absolute or relative values in schematic figures. The normalization procedure allowed display of absolute values expressed in percent of abundance of mRNA in the probe as shown in FIG. 2a. Each level of the blue color ladder represents a 3-fold interval of absolute abundance of mRNA. Each column corresponds to a tissue sample and each row to a gene. For graphic purposes, genes were ordered from top to bottom according to increasing median expression levels. Tumor samples were not ordered. The values in each sample displayed a wide range of intensities (3 decades in log scale) corresponding to expression levels ranging from approximately 0.002% to 5% of mRNA abundance. Many genes (see for example stromelysin 3 (SEQ ID No: 346), IGF2 (SEQ ID No: 61) and GATA3 (SEQ ID No: 78), arrows) displayed highly variable expression levels across all tumor samples, scattered over the whole dynamic range of values. A representation of relative values is shown in FIG. 2b. Absolute values were log-transformed, omitting 18 clones whose median intensity was equal to zero across all tissues. Data for each of the 162 remaining clones were then median-centered, as well as data for each sample, so that the relative variation was shown, rather than the absolute intensity. A color scale was used to display data: red for expression level higher than the median and green for expression level lower than the median. The magnitude of the deviation from the median was represented by the color intensity. A hierarchical clustering program was then applied to group the 35 samples according to their overall gene expression profiles, and to group the 162 clones on the basis of similarity of their expression levels in all tissues. This resulted in a picture highlighting groups of correlated tissues and groups of correlated genes as depicted by dendrograms.

[0091] Breast Tumor Classification

[0092] As shown in FIG. 2b, the clustering algorithm identified two groups of samples, designated A (n=15, including normal breast, NB) and B (n=20). These groups were similar with respect to patient age, menopausal status at diagnosis, SBR grading and tumor pathological size. However, 72% of tumors in group A were node-positive and 75% in group B were node-negative. Moreover, 80% of the tumors in group B were estrogen receptor (ER) positive and 50% in group A were ER-negative. With a median follow-up of 44 months after diagnosis, overall survival was different between A and B groups: 5 women died in A (median follow-up 58 months) and 1 in B (median follow-up 40 months). But the frequency of metastatic relapse was relatively similar in the two groups, with 5 women who relapsed in A and 6 in B. Because the time between the diagnosis of metastasis and last follow-up is too short in B, a longer follow-up is needed to determine if these two different groups, defined with expression profiles, have really a different outcome with respect to overall survival.

[0093] In the group A of 15 samples, three samples (normal breast and two tumors) were different from each other and from the other 12 samples. The latter constituted two subgroups of tumors, A1 (n=6) and A2 (n=6), which could be further separated by clustering as shown in FIG. 2c. The 12 tumors had a uniformly high risk of metastatic relapse according to conventional prognostic features as shown in Table 1. Most of them had received comparable adjuvant anthracycline-based chemotherapy after surgery, with more women treated in the A1 subgroup. Interestingly, these two subgroups, which could not be distinguished with commonly used histoclinical features, had a very different clinical outcome: there were 4 metastatic relapses and 4 deaths in A1 (median follow-up: 44 months). In contrast and despite a longer median follow-up (90 months), no metastasis or death occurred in A2. This resulted in a significant better metastasis-free survival (p<0.01) and overall survival (p<0.005) for group A2 than for group A1 tumors. No such subgrouping could be done in B.

1TABLE 1SubgroupA1A2Tumor position in the cluster123456789101112Age, years465860635158464750474666Nodal status1001613371041200Histological size, mm602026352030272530252022SBR grade||||||||||||||||||||||||||||ER statusnegnegnegnegnegnegposnegposposposposAdjuvant chemotherapyyesyesnoyesyesyesyesyesnoyesnonoMetastasisyesnoyesyesnoyesnonononononoFollow-up, months58106354741318598954919141Patients statusDADDADAAAAAAPatient characteristics in subgroups A1 and A2. The 12 tumors are numbered from 1 to 12 according to their position from left to right in the clustering graphic displayed in FIG. 3. Adjuvant chemotherapy was anthracycline-based. In the line concerning the patient status, A means alive and D means death from cancer progression.

[0094] Genes responsible for group A substructure were searched. These are potentially relevant to the prognosis and the sensitivity to chemotherapy in these tumors. Thirty-two genes out of 188 were identified by comparing their median expression level in A1 vs A2. Then, the 12 tumors were reclustered using the expression profiles of these genes as shown in FIG. 3. The same subgroups A1 and A2 were evident and separated by 2 groups of genes: as expected, high expression of ERBB2 (SEQ ID No: 119), MYC (SEQ ID No: 75) and EGFR (SEQ ID No: 137) was associated with bad prognosis subgroup A1 (6-8), and that of E-cadherin (SEQ ID No: 328) and the proto-oncogene MYB (SEQ ID No: 355) with good prognosis subgroup A2 (9, 10). For most of the other genes, these results may stimulate new investigations. Differentiation state is a good prognostic factor in breast cancer and, accordingly, genes associated with cell differentiation, such as GATA3 (SEQ ID No: 78) (11) and CRABP2 (SEQ ID No: 158) (12), had a high level of expression in the better outcome group. The high expression of Ephrin-Al mRNA in the bad prognosis subgroup suggests a role of this growth factor in breast cancer and can be paralleled with its up-regulation during melanoma progression (13).

[0095] Differential Gene Expression Between Normal Breast and Breast Tumors

[0096] To identify genes differentially expressed between breast tumors (T) and normal breast (NB), the NB value for each gene was compared to its expression level in each tumor. When the expression level of a gene in NB was undetectable, only qualitative information could be deduced and the mRNA was considered as differentially expressed if the signal intensity in the tumor was superior to the reproducibility threshold (0.002% of mRNA abundance). In the other cases, differential expression was defined by an at least 2-fold expression difference. Also, the number of tumors where it was over- or underexpressed was measured. Table 2 shows a list of the top 20 over- and underexpressed genes. For these genes, the T/NB ratio is reported, where T represented their median expression value in the 34 tumors. This ratio ranged from 2.70 (ABCC 5; (SEQ ID No: 325) to 17.76 (GATA3; (SEQ ID No: 78) for the overexpressed genes, and from 0.00 (desmin, (SEQ ID No: 170) to 0.29 (APC; (SEQ ID No: 56) for the underexpressed genes.

2TABLE 2Clone IDGene/Protein identityGene symbolChrom. locationN T/NBOverexpressed genes154343Granzyme HGZMH14q11.232 9.51235947Stromelysin 3STMY322q11.231 15.92207378MYB Related Protein BMYBL220q13.131 (a)153275Cellular Retinoic Acid Binding Protein 2CRABP21q21.329 7.16129757GATA-binding protein 3GATA310p1528 17.76120649T-Lymphocyte surface CD2 antigenCD21p13.128 7.54109677CREB Binding ProteinCREBBP16p13.328 5.08172152EGFR-binding protein GRB2GRB217q24-q2528 5.00 66969Transcription factor RELBRELB1928 3.61182007ETS-Related Transcription Factor ELF1ELF113q1327 3.58153446LIM domain protein RILRIL5q31.126 4.03203394ETS Variant gene 5 (ETS-related molecule)ETV53q2825 3.67160963Thrombospondin 1THBS115q1525 3.39188393POU domain, class 2, transcription Factor 2POU2F21924 4.02187822Integrin, beta 2ITGB221q22.324 3.01243907Nuclear Factor of Activating T cell Subunit p45NF45124 2.84158347EST H27202EST23 2.91230933EST AW184517EST22 2.85212366ATP-Binding Cassette, sub-family C (CFTR/MRP), 5ABCC53q2722 2.70149401Cathepsin DCTSD11p15.521 2.97Underexpressed genes153854DesminDES2q3534 0.00208717P55-C-FOS proto-oncogene proteinFOS14q24.333 0.05159093Transcription Factor AP4TFAP416p1333 0.11124340Tenascin XATNXA6p21.333 0.14133738ProlactinPRL6p22.2-p21.332 0.00133891Chorionic Somatomammotropin Hormone 1CSH117q22-q2432 0.00151501Tyrosine Kinase Receptor TEKTEK9p2132 0.00183030Activating Transcription Factor 3ATF3132 0.07120916Phosphodiesterase IPDNP28q24.132 0.14155716EST R72075EST31 0.00208118Transforming Growth Factor Beta Receptor Type IIITGFBR31p33-p3231 0.14187547Diphtheria Toxin ReceptorDTR5q2331 0.17108490HIV-1 Rev Binding proteinHRB2q3631 0.20147002B-cell CLL/lymphoma 2BCL218q21.331 0.26182610Microsomal Glutathione S Transferase 1MGST112p12.3-p12.131 0.28152802Phospholipase A2 Membrane Associated, group IIAPLA2G2A1p3530 0.03183087Interleukin 3 Receptor Alpha chainIL3RAXp22.3; Yp13.330 0.24108571Retinoblastoma-Like 2 (p130)RBL216q12.229 0.28125294Adenomatous Polyposis Coli ProteinAPC5q21-q2229 0.29151767FASL ReceptorTNFRSF610q24.128 0.27List of the genes that show the most frequent differential expression between normal breast tissue and 34 breast carcinomas as measured by cDNA array analysis. N indicates the number of tumor samples where the gene is dysregulated (fold change □ 2) compared to normal breast tissue. T/NB represents the ratio: median expression level in 34 breast tumors/expression level in normal breast. (a) MYBL2 transcript displayed a median expression level of 0.025% in breast tumors # and was undetectable in NB.

[0097] High expression of mucin I (SEQ ID No: 58), NM 23, ERBB2 (SEQ ID No: 119), FGFRJ (SEQ ID No: 182) and FGFR 2 (SEQ ID No: 15), MYC (SEQ ID No: 75), stromelysin 3 (SEQ ID No: 346), cathepsin D (SEQ ID No: 128) and downregulation of FOS (SEQ ID No: 318), APC (SEQ ID No: 56), RBL2, FAS, BCL2 (SEQ ID No: 117) were found, reflecting what is known about their biology in cancer. GATA3 (SEQ ID No: 78), which codes for a member of the GATA family of zinc finger transcription factors, and CRABP2 (SEQ ID No: 158), encoding one of the two cellular retinoic acid-binding proteins, showed high expression of mRNA, extending previous results on cDNA arrays (4). Differential gene expression among various breast tumors and correlation with histoclinical prognostic parameters

[0098] To search for potential prognostic markers in breast cancer, genes with expression levels correlated with conventional histoclinical prognostic parameters were looked for: age of patients, axillary node status, tumor size, histological grade and ER status. No significant correlation was found with age, tumor size and histological grade. However, the expression profiles of some genes correlated with ER status and axillary node involvement.

[0099] To identify genes potentially relevant to the hormone-responsive phenotype, the gene expression profiles in ER-positive breast cancers (n=23) versus ER-negative breast cancers (n=11) were compared. Sixteen clones displayed a median intensity of 0 in both groups. Twenty-five presented a fold change superior to 2. Table 3a displays the top 10 over- and underexpressed genes. Among them, the most differentially expressed was GATA3 (SEQ ID No: 78) with a median intensity ratio ER+/ER− of 28.6 and a value for the first quartile of ER-positive tumors superior (5-fold) to the value of the third quartile of the ER-negative tumors as shown in FIG. 4a. The high expression of GATA3 in ER-positive tumors was statistically significant using a Mann-Witney test (p<0.001). All ER-positive tumors and only 18% of ER-negative tumors displayed a GATA3 expression level greatly superior (fold change >3) to the normal breast value. Furthermore GATA3 expression was analyzed by Northern blot hybridization (FIG. 4b) in a panel of 79 breast cancers (21 ER-negative tumors and 58 ER-positive tumors), including 22 of the tumors analyzed with cDNA arrays. It confirmed the array results for those 22 tumors as well as the strong correlation between ER status and GATA3 RNA expression (Mann-Witney test, p<0.0001).

3TABLE 3aCloneER+/IDGene/Protein identityGene symbolER−129757GATA-binding protein 3GATA328.6356763Granzyme AGZMA5.7248613MYB proto-oncogeneMYB3.4211999KIAA1075 proteinKIAA10753.3235947Stromelysin 3STMY33.1229839Macrophage Stimulating 1MST12.8153275Cellular Retinoic Acid Binding Protein 2CRABP22.7301950X-box Binding Protein 1XBP12.7205314Tumor Protein p53TP532.5126233Insulin-like Growth Factor 2IGF22.4 66322CD3G antigen, GammaCD3G0.0195022Interleukin 2 Receptor Gamma chainIL2RG0.0111461SOX4 ProteinSOX40.4151475Epidermal Growth Factor ReceptorEGFR0.5195022Interleukin 2 Receptor Beta chainIL2RB0.5130788Topoisomerase (DNA) II beta (180 kD)TOP2B0.6323948SOX9 ProteinSOX90.6183641S100 calcium-binding protein BetaS100B0.6246620EST N53133EST0.6231424Glutathione S Transferase PiGSTP10.6

[0100] To search for genes whose expression profile was correlated with axillary lymph node status, a strong prognostic factor in breast cancer, the group of node-negative tumors (n=19) was compared with the group of tumors with massive axillary extension (10 or more positive nodes). Furthermore, because survival decreases with the increase of the number of tumor-involved lymph nodes and because the expression measurements were quantitative, correlation between the expression levels of these genes and the number of tumor-involved nodes (quantitative variables) was determined. Table 3bshows a list of the top 10 over- and underexpressed genes between these 2 groups. Most of these genes have not been previously reported as associated with node status, but some of these results are in agreement with literature data. The gene encoding the tyrosine kinase receptor ERBB2 (SEQ ID No: 119) was the most significantly overexpressed gene in node-positive tumors and displayed the highest correlation coefficient (r=0.68; p<0.0001).

4TABLE 3bCloneN−/IDGene/Protein identityGene symbol10N+129757GATA-binding protein 3GATA311.0160963Thrombospondin 1THBS16.6151475Epidermal Growth Factor ReceptorEGFR5.4120916Phosphodiesterase IPDNP24.9183030Activating Transcription Factor 3ATF34.6211999KIAA1075 proteinKIAA10754.5110480Nuclear Factor 1 A-typeNF1A4.5182264P-SelectinSELP4.4356763Granzyme AGZMA4.3214008E-cadherinCDH14.0147016ERBB2 Receptor Protein-Tyrosine KinaseERBB20.2179197Protein Phosphatase PP2A, 55 kD SubunitPP2A BR0.2gamma231424Glutathione S Transferase PiGSTP10.4111461SOX4 ProteinSOX40.4195022Interleukin 2 Receptor Beta chainIL2RB0.4220451Zinc Finger protein 144ZNF1440.5125413Mucin 1MUC10.6290007CD44 antigen, epithelial formCD440.6108571Retinoblastoma-Like 2 (p130)RBL20.7130788Topoisomerase (DNA) II Beta (180 kD)TOP2B0.7List of genes differentially expressed between ER-positive and ER-negative breast tumors (a) and between axillary lymph node-negative tumors and tumors with 10 or more involved lymph nodes (b).

[0101] Gene clustering from FIG. 2b showed groups of genes with correlated expression across samples. When different clones represented the same gene, they were clustered next to each other (red arrows). Correlation coefficients between gene pairs in the 34 tumors were often high (1% of the 13,041 gene pairs showed a correlation coefficient superior to 0.95—not shown). An example of highly correlated gene expression is that of BCL2 (SEQ ID No: 117) and RBL2. Such correlated expression, although it has not been described in the literature, probably reflects a common mechanism of regulation for these two genes. Furthermore, these genes also exhibited significant correlated expression with other genes such as PPP2CA (SEQ ID No;184), AKT2 (SEQ ID No: 254), PRKCSH (SEQ ID No: 264) or TNFRSF6/FAS SEQ ID No.143). In particular, a striking correlated expression between BCL2 and FAS could be observed (r=0.91; data not shown). The exact meaning of this correlation is unknown, although it may reflect the necessary balance between apoptosis and anti-apoptosis for cell survival.

[0102] Although in human cancer the proportion of changes that is reflected at the RNA level is not known, monitoring gene expression patterns appears as a very promising way of increasing the knowledge of the disease. Several different types of cancer have been investigated using cDNA arrays: cervical (14), hepatocellular (15), ovarian (16), colon (17) and renal carcinomas (18), glioblastomas (19), melanomas (20) (21), rhabdomyosarcomas (22), acute leukemias (23) and lymphomas (24). In breast cancer, pioneering studies have yielded the first expression patterns (4, 25-31). They have in particular addressed the important issue of molecular differences in hormone-responsive and non-responsive breast tumors. Thus, Yang et al. (28) and Hoch et al. (25) compared expression profiles of breast carcinoma cell lines known to represent these two categories and identified a few genes with differential expression. One of these genes was GATA3. In these studies, cell lines were mostly used and tumor samples were rarely tested and generally in small numbers. The first study analyzing the expression profiles of a large series of breast cancers was published recently (32), but no correlation with clinical outcome was mentioned.

[0103] Several interesting points can be made based on the present experimentation. First, the differences in expression patterns among the tumors provided molecular transcriptional evidence of the histoclinical heterogeneity of breast cancer. This diversity was multifactorial, linked to many different genes, highlighting the interest of high throughput analysis in this context. It was possible, with a hierarchical clustering program integrating the expression profiles, to separate normal breast tissue from most tumors and, moreover, to identify two different groups of tumors. Most importantly, two different subgroups of tumors with a very distinct clinical outcome that could not be predicted with classical prognostic factors have been identified by clustering. Indeed, all these tumors had a theoretically bad prognosis as evaluated by current histoclinical tools. All these patients would be at the present time treated with adjuvant chemotherapy, but without the capacity for the physicians to identify patients who will benefit from this treatment and those who will not benefit.

[0104] Gene expression profiles were able to make this discrimination. Such predictive tools have important therapeutic implications. Patients with features of poor prognosis are candidates for other treatment than standard chemotherapy, avoiding loss of time and toxicities related to first-line chemotherapy. These results suggest that the histoclinical category of poor prognosis breast cancer, currently treated with adjuvant anthracycline-based chemotherapy, groups together at least two molecularly distinct subgroups of tumors with different outcome which would require distinct chemotherapy regimens. Expression profiles could thus provide a new and more accurate way of classifying breast tumors of poor prognosis and managing patients.

[0105] Similarly, despite molecular heterogeneity, significant correlations between the expression level of genes (GATA3 (SEQ ID No: 78), ERBB2 (SEQ ID No: 119)) and histological tumor parameters were identified. The ER-positivity in breast cancer has been correlated with tumor differentiation, low proliferating rate, favorable prognosis and response to hormonal therapy. The relation between hormone sensitivity of breast cancer and ER status is not perfect, and it is possible that some genes related to ER expression are more important than ER to characterize the hormone-sensitive phenotype. These genes could serve as predictive factors to guide the therapy.

[0106] GATA3 mRNA expression was highly correlated with ER status. GATA3, which is not estrogen-regulated (25), is a transcription factor that could regulate the expression of genes involved in the ER-positive phenotype. Among the other genes that were found associated with ER status during the experimental work leading to the present invention, some, such as MYB (SEQ ID No: 355) (10), stromelysin 3 (SEQ ID No: 346) (33), and CRABP2 (SEQ ID No: 158) (34), have been previously reported expressed at high levels in ER-positive breast tumors. The higher levels of TP53 MnRNA in ER-positive tumors studied were surprising, although in agreement with a recent study (27). Most studies concerning TP53 expression analyzed the protein level rather than the mRNA level, and TP53 protein levels are classically negatively correlated with the ER status (35). The high expression of CRABP2 could be related to the better differentiated status of the ER-positive tumors. The low expression of the three immunity-related genes IL2RB (SEQ ID No: 99), IL2RG (SEQ ID No: 281) and CD3G (SEQ ID No: 416) may be related to the low lymphoid infiltration in these well differentiated tumors. ERBB2 high expression in breast cancer has been associated with a poor prognosis and some resistance to hormonal therapy and chemotherapy (36). It is involved in the regulation of cellular differentiation, adhesion, and motility. The motility-enhancing activity of ERBB2 (37) could be responsible for the increased metastatic potential and the unfavorable prognosis of the breast tumors that overexpress ERBB2. The low expression of E-cadherin (SEQ ID No: 328) and thrombospondin 1 (SEQ ID No: 217) in node-positive tumors are consistent with their putative role in different steps of metastatic spread: E-cadherin is an epithelial cell adhesion molecule whose disturbance is a prerequisite for the release of invasive cells in carcinomas (38) and thrombospondin 1 inhibits angiogenesis (39). Similarly, the high expression of the molecule surface antigen Mucin 1 in node-positive tumors (40) can reduce cell-cell interactions facilitating cell detachment and metastasis. CD44 (SEQ ID No: 376), encoding a transmembrane glycoprotein involved in cell adhesion and lymph node homing (41) was expressed at high levels in node-positive tumors as well as GSTPI (SEQ ID No: 336) (Glutathione-S-Transferase Pi), recently reported associated with increased tumor size (27).

[0107] Second, there were a number of genes with highly correlated expression patterns. Gene correlations have already been reported with larger series of genes, essentially under dynamic experimental conditions (42) and recently in steady states (17). Here, correlations were based on expression profiles of a relatively small but selected series of genes and in steady states represented by different breast tumors. Gene correlations are potentially useful tools for cancer research in two ways: i) they can provide information about the general regulation circuitry of a cancerous cell, allowing the identification of regulatory elements controlling expression networks; ii) they offer the possibility of reducing the complexity of the system analyzed by replacing, for example, the intensities of a large number of genes present in a gene cluster by their respective mean intensities.

[0108] Finally, these results highlight the great potential of cDNA array in cancer research. The gene expression profiles confirmed the heterogeneity of breast cancer, and most importantly allowed us to identify, among a series of poor prognosis breast tumors, two subtypes of the disease not yet recognized with usual histoclinical parameters but with a different clinical outcome after adjuvant chemotherapy. Furthermore, the present invention allows detection of genes of which expression was correlated with classical prognostic factors.

[0109] Table 4 displays a library of polynucleotides SEQ ID NO: 1 to SEQ ID NO: 468 corresponding to a population of polynucleotide sequences underexpressed or overexpressed in cells derived from tumors, more particularly breast tumors, and their respective complements.

5TABLE 4CORRELATION BETWEEN SEQ ID NO AS FILED WITH US PROVISIONAL APPLICATIONNo 60/254,090 and SEQ ID NO FILLED WITH NEW APPLICATIONGeneProvisionalProvisionalCurrent,Current,Current,SymbolsNoNameImageSeq3′Seq5′Seq3′Seq5′(mRNA)GATA31GATA-binding pro-129757SEQ ID No:1SEQ ID No:76SEQ ID No:77SEQ ID No:78tein 3 (GATA3)MYB2v-myb avian myelo-248613SEQ ID No:20SEQ ID No:354SEQ ID No:355blastosis viral onco-gene homolog(MYB)KIAA 10753KIAA 1075 protein211999SEQ ID No:3SEQ ID No:4SEQ ID No:322SEQ ID No:3230STMY34matrix metallopro-235947SEQ ID No:5SEQ ID No:3450SEQ ID No:346teinase 11 (strom-elysin 3) (MMP11)(ex STMY3)HGFL5macrophage-stim-229839SEQ ID No:6SEQ ID No:7SEQ ID No:331SEQ ID No:332SEQ ID No:333ulating protein(MST1) (ex HGFL)CRABP6cellular retinoic153275SEQ ID No:8SEQ ID No:9SEQ ID No:156SEQ ID No:157SEQ ID No:158acid-binding protein2 (CRABP2)XBP17X-box binding pro-301950SEQ ID No:10SEQ ID No:11SEQ ID No:385SEQ ID No:386SEQ ID No:387tein 1 (XBP1)TP538tumor protein p53205314SEQ ID No:12SEQ ID No:44200(Li-Fraumeni syn-drome) (TP53)IGF29insulin-like growth126233SEQ ID No:13SEQ ID No:14SEQ ID No:59SEQ ID No:60SEQ ID No:61factor 2 (somato-medin A) (IGF2),CD3G10CD3G antigen,66322SEQ ID No:15SEQ ID No:16SEQ ID No:414SEQ ID No:415SEQ ID No:416gamma polypeptide(TiT3 complex)(CD3G)IL2RG11interleukin 2 recep-195022SEQ ID No:17SEQ ID No:18SEQ ID No:279SEQ ID No:280SEQ ID No:281tor, gamma (severecomnbined immuno-deficiency) (IL2RG)SOX412SRY (sex determin-111461SEQ ID No:19SEQ ID No:20SEQ ID No:22SEQ ID No:23SEQ ID No:24ing region Y)-box 4(SOX4)EGFR13epidermal growth151475SEQ ID No:21SEQ ID No:22SEQ ID No:135SEQ ID No:136SEQ ID No:137factor receptor(avian erythroblasticTOP2B14topIIb mRNA for130788SEQ ID No:230SEQ ID No:82SEQ ID No:83topoisomerase IIb.S100B15S100 calcium-bind-183641SEQ ID No:240SEQ ID No:255SEQ ID No:256ing protein, beta(neural) (S100B)EST N5313316EST N53133246620SEQ ID No:25SEQ ID No:3520SEQ ID No:353GSTP117glutathione S-trans-231424SEQ ID No:26SEQ ID No:27SEQ ID No:334SEQ ID No:335SEQ ID No:336ferase pi (GSTP1)THBS118thrombospondin 1160963SEQ ID No:28SEQ ID No:2160SEQ ID No:217(THBS1)PDNP219cctonucleotide120916SEQ ID No:29SEQ ID No:30SEQ ID No:39SEQ ID No:40SEQ ID No:41pyrophosphatase/phosphodiesterase2(autotaxin)(ENPP2) (exPDNP2)ATF320activating transcrip-183030SEQ ID No:31SEQ ID No:32SEQ ID No:250SEQ ID No:251SEQ ID No:252tion factor 3 (ATF3)NF1A21(ex NF1A)110480SEQ ID No:33SEQ ID No:1600SELP22selectinm P (granule182264SEQ ID No:34SEQ ID No:438SEQ ID No:4390membrane protein140kD, antigenCD62) (SELP)CDH123cadherin 1, E-214008SEQ ID No:35SEQ ID No:36SEQ ID No:326SEQ ID No:327SEQ ID No:328cadherin (epi-thelial) (CDH1)ERBB224v-erb-b2 avian147016SEQ ID No:370SEQ ID No:118SEQ ID No:119erythroblasticleukemia viraloncogene homolog2 (neuro/glioblastomaderived oncogenehomolog) (ERBB2)PP2A BR25(PP2A BR gamma)179197SEQ ID No:38SEQ ID No:39SEQ ID No:238SEQ ID No:2390gammaZNF14426zinc finger pro-220451SEQ ID No:40SEQ ID No:410SEQ ID No:329SEQ ID No:330tein 144 (Mel-18)(ZNF144)MUC127mucin 1, transmem-125413SEQ ID No:420SEQ ID No:57SEQ ID No:58brane (MUC1)CD4428CD44E (epithelial290007SEQ ID No:43SEQ ID No:44SEQ ID No:374SEQ ID No:375SEQ ID No:376form)PLA2G2A29phospholipase A2,152802SEQ ID No:45SEQ ID No:46SEQ ID No:147SEQ ID No:148SEQ ID No:149group IIA (plate-lets, synovialfluid) (PLA2G2A),nuclear geneencoding mito-chondrial proteinACVRL130activin A receptor153350SEQ ID No:47SEQ ID No:48SEQ ID No:159SEQ ID No:160SEQ ID No:161type II-like1 (ACVRL1)AXL31AXL receptor tyro-112500SEQ ID No:49SEQ ID No:50SEQ ID No:27SEQ ID No:28SEQ ID No:29sine kinase (AXL)PKU-ALPHA32KU-alpha, partial109569SEQ ID No:510SEQ ID No:5SEQ ID No:6cds (new genesymbol Tlk2)ABCC533ATP-binding212366SEQ ID No:520SEQ ID No:324SEQ ID No:325cassette, sub-family C (CFTR/MRP), member5 (ABCC5)EDNRB34endothelial recep-154244SEQ ID No:530SEQ ID No:176SEQ ID No:177tor type B(EDNRB), trans-cript variant1DTR35diphtheria toxin187547SEQ ID No:540SEQ ID No:265SEQ ID No:266receptor (hep-arin-bindingepidermalIGF1R36insulin-like150361SEQ ID No:550SEQ ID No:129SEQ ID No:130growth factor 1receptor (IGF1R)KIAA042737KIAA0427127507SEQ ID No:56SEQ ID No:57SEQ ID No:65SEQ ID No:66SEQ ID No:67CD6938CD69 antigen (p60,276727SEQ ID No:580SEQ ID No:370SEQ ID No:371early, T-cellactivation anti-gen)FGFR439fibroblast116781SEQ ID No:59SEQ ID No:60SEQ ID No:36SEQ ID No:37SEQ ID No:38growth factorreceptor 4(FGFR4)EST T8568340EST T85683 cathe-112622SEQ ID No:610SEQ ID No:30SEQ ID No:31spin B (CTSB)EST R0056941EST R00569 IL2-123871SEQ ID No:620SEQ ID No:44SEQ ID No:45inducible T-cell kinase (ITK)TGFBR342transforming growth208118SEQ ID No:63SEQ ID No:64SEQ ID No:311SEQ ID No:312SEQ ID No:313factor, betareceptor III(TGFBR3)INSR43insulin receptor151149SEQ ID No:650SEQ ID NO;131SEQ ID No:132(INSR)MARK344MAP/microtubule110599SEQ ID No:66SEQ ID No:67#N/A#N/A#N/Aaffinity-reg-ulating kinase 3(MARK3)TIMP245tissue inhibitor131504SEQ ID No:680SEQ ID No:86SEQ ID No:87of metallopro-teinase 2 (TIMP2)EST R8555746EST R85557 throm-180219SEQ ID No:69SEQ ID No:2400SEQ ID No:241bospondin 3(THBD3)GNRH147gonadotropin-releas-192688SEQ ID No:700SEQ ID No:277SEQ ID No:278ing hormone 1(GNHR1)FGFR248fibroblast growth110387SEQ ID No:71SEQ ID No:72SEQ ID No:13SEQ ID No:14SEQ ID No:15factor receptor2 (FGFR2)NFKB249NFKB2114879SEQ ID No:73SEQ ID No:3500VIL250villin 2 (ezrin)124701SEQ ID No:74SEQ ID No:75SEQ ID No:51SEQ ID No:52SEQ ID No:53(VIL2)ENG51endoglin (ENG)156979SEQ ID No:76SEQ ID No:77SEQ ID No:196SEQ ID No:197SEQ ID No:198EPHA252EphA2 (EPHA2)162004SEQ ID No:78SEQ ID No:2210SEQ ID No:222CREM53cAMP responsive258584SEQ ID No:79SEQ ID No:80SEQ ID No:358SEQ ID No:359SEQ ID No:360element modulator(CREM)ETV5-a54ets variant270549SEQ ID No:81SEQ ID No:82SEQ ID No:368SEQ ID No:369SEQ ID No:300gene 5 (ETV5)EST N6853655EST N68536 MAX-298242SEQ ID No:83SEQ ID No:840SEQ ID No:380SEQ ID No:381interacting pro-tein 1 (MX11)EST R8112656EST R81126 lym-146635SEQ ID No:85SEQ ID No:86SEQ ID No:11400photoxin beta re-ceptor (LTBR)POU2F257(POu2F2)188393SEQ ID No:87SEQ ID No:88SEQ ID No:2710SEQ ID No:272FLI158Friend leukemia vir-198144SEQ ID No:89SEQ ID No:90SEQ ID No:293SEQ ID No:294SEQ ID No:295us integration 1(FLI1)TIE59tyrosine kinase with144081SEQ ID No:910SEQ ID No:109SEQ ID No:110immunoglobulin andepidermal growthfactor homologydomains(TIE)PRLR60prolactin receptor138788SEQ ID No:92SEQ ID No:93SEQ ID No:94SEQ ID No:95SEQ ID No:96(PRLR)PPP3CA61protein phosphatase110481SEQ ID No:94SEQ ID No:95SEQ ID No:17SEQ ID No:18SEQ ID No:193 (formerly 2B),catalytic subunit,gamma isoform(calcineurin Agamma) (PPP3CC)(ex PPP3CA)PTPN262protein tyrosine161451SEQ ID No:96SEQ ID No:97SEQ ID No:218SEQ ID No:219SEQ ID No:220phosphatase, non-re-ceptor type 2(PTPN2)PGF63placental growth139326SEQ ID No:980SEQ ID No:102SEQ ID No:103factor, vascularendothelial growthfactor-relatedprotein (PGF)TNFAIP364tumor necrosis309943SEQ ID No:99SEQ ID No:388SEQ ID No:389SEQ ID No:390factor, alpha-in-duced protein 3(TNFAIP3)PHB65PHB (prohibitin)236008SEQ ID No:100SEQ ID No:347SEQ ID No:348SEQ ID No:349RIL66LIM domain pro-153446SEQ ID No:1010SEQ ID No:162SEQ ID No:163tein (RIL)MYBL267v-myb avian mye-207378SEQ ID No:102SEQ ID No:103SEQ ID No:308SEQ ID No:309SEQ ID No:310loblastosis viraloncogene homolog-like 2 (MYBL2)RELB68v-rel avian retic-66969SEQ ID No:104SEQ ID No:105SEQ ID No:417SEQ ID No:418SEQ ID No:419uloendotheliosisviral oncogenehomolog B (nuclearfactor of kappa lightpolypeptide geneenhancer in B-cells3) (RELB)EST R9721869Est R97218200394SEQ ID No:106SEQ ID No:296SEQ ID No:2970GZMH70granzyme B (gran-154343SEQ ID No:107SEQ ID No:1780SEQ ID No:179zyme 2, cytotoxicT-lymphocyte-ass-ociated serine es-terase 1) (GZMB)(ex GZMH)MYC71c-myc proto-onco-129438SEQ ID No:108SEQ ID No:109SEQ ID No:73SEQ ID No:74SEQ ID No:75geneCASP172caspase 4, apop-131502SEQ ID No:110SEQ ID No:840SEQ ID No:85tosis-related cy-steine protease(CASP4) (exCASP1)SYK73spleen tyrosine128142SEQ ID No:111SEQ ID No:112SEQ ID No:68SEQ ID No:69SEQ ID No:70kinase (SYK)EST H2720274EST H27202 trans-158347SEQ ID No:113SEQ ID No:114SEQ ID No:204SEQ ID No:2050cription factorE1AF geneHRB75syndecan 1)108490SEQ ID No:115SEQ ID No:116SEQ ID No:10SEQ ID No:2(SDC1) (ex HRB)SHC176p66shc (SHC)153548SEQ ID No:1170SEQ ID No:164SEQ ID No:165CSF177colony stimulating124554SEQ ID No:118SEQ ID No:119SEQ ID No:48SEQ ID No:49SEQ ID No:50factor 1 (CSF1)UBE3A78ubiquitin protein141924SEQ ID No:1200SEQ ID No:104SEQ ID No:105ligase E3A(UBE3A)FKHR79forkhead box151247SEQ ID No:1210SEQ ID No:133SEQ ID No:134O1A (rhabdomyo-sarcoma)(FOXO1A) (exFKHR)CSF1R80colony stimulating196282SEQ ID No:122SEQ ID No:2910SEQ ID No:292factor 1 re-ceptor (CSF1R)IFI7581interferon-induced205612SEQ ID No:123SEQ ID No:124SEQ ID No:305SEQ ID No:306SEQ ID No:307protein 75 (IFI75)GATA182GATA-binding pro-109093SEQ ID No:1250SEQ ID No:3SEQ ID No:4tein 1 (globintranscriptionfactor 1) (GATA1)STAT183signal transducer110101SEQ ID No:1260SEQ ID No:11SEQ ID No:12and activator oftranscription 1(STAT1)CREBBP84CREB binding pro-109677SEQ ID No:127SEQ ID No:128SEQ ID No:7SEQ ID No:80tein (Rubinstein-Taybi syndrome)(CREBBP)IL7R85interleukin 7129059SEQ ID No:1290SEQ ID No:71SEQ ID No:72receptor (IL7R)ANXA786annexin A7160580SEQ ID No:1300SEQ ID No:214SEQ ID No:215(AN-XA7)TNXA87tenascin XA124340SEQ ID No:1310SEQ ID No:46SEQ ID No:47(TN-XA)CNBP188zinc finger pro-251963SEQ ID No:132SEQ ID No:3560SEQ ID No:357tein 9 (a cellularretroviral nucleicacid binding pro-tein) (ZNF9) (exCNBP1)CDK4-a89cyclin-dependent204586SEQ ID No:133SEQ ID No:134SEQ ID No:301SEQ ID No:302SEQ ID No:288kinase 4 (CDK4)CSNK2B90gene for casein153879SEQ ID No:1350SEQ ID No:171SEQ ID No:172kinase II subunitbeta (EC 2.7.1.37).EFNA191ephrin-A1 (EFNA1)162997SEQ ID No:1360SEQ ID No:226SEQ ID No:227SELE92selectin E (endo-186132SEQ ID No:137SEQ ID No:138SEQ ID No:259SEQ ID No:260SEQ ID No:261thelial adhesionmolecule 1) (SELE)APC93adenomatosis poly-125294SEQ ID NO:139SEQ ID No:140SEQ ID No:54SEQ ID No:55SEQ ID No:56posis coli (APC)FAK94PTK2 protein tyro-195731SEQ ID No:1410SEQ ID No:284SEQ ID No:285sine kinase 2(PTK2) (ex FAK)FOS-a95v-fos FBJ murine208717SEQ ID No:1420SEQ ID No:317SEQ ID No:318osteosarcomaviral oncogenehomolog (FOS)FGFR196fibroblast growth154472SEQ ID No:143SEQ ID No:144SEQ ID No:180SEQ ID No:181SEQ ID No:182factor receptor(FGFr)MC1R97melanocortin 1 re-155691SEQ ID No:1450SEQ ID No:187SEQ ID No:188ceptor (alphamelanocyte stim-ulating hormonereceptor) (MC1R)PCNA98proliferating cell232941SEQ ID No:146SEQ ID No:147SEQ ID No:339SEQ ID No:340SEQ ID No:341nuclear antigen(PCNA)DDT99D-dopachrome tau-132109SEQ ID No:148SEQ ID No:149SEQ ID No:88SEQ ID No:89SEQ ID No:90tomerase (DDT)GRB2100growth factor re-172152SEQ ID No:150SEQ ID No:151SEQ ID No:230SEQ ID No:231SEQ ID No:232ceptor-boundprotein 2 (GRB2)AMFR101autocrine motility146280SEQ ID No:152SEQ ID No:153SEQ ID No:111SEQ ID No:112SEQ ID No:113factor receptor(AMFR)ITGB2102integrin, beta 2187822SEQ ID No:1540SEQ ID No:267SEQ ID No:2682 (antigen CD18(p95), lymphocytefunction-ass-ociated antigen 1;macrophage antigen1 (mac-1) betasubunit) (ITGB2)JUND103jun D proto-175421SEQ ID No:155SEQ ID No:2330SEQ ID No:234oncogene (JUND)NF45104interleukin en-243907SEQ ID No:1560SEQ ID No:350SEQ ID No:351hancer bindingfactor 2 (ILF2) (exNF45)PPP4C105protein phosphatase114097SEQ ID No:157SEQ ID No:158SEQ ID No:32SEQ ID No:33SEQ ID No:344 (formerly X)(PPP4C)EMS1106ATX1 (antioxidant149172SEQ ID No:159SEQ ID No:123SEQ ID No:124SEQ ID No:125protein 1, yeast)homolog 1(ATOX1) (exEMS1)BCL2107B-cell CLL/lymph-147002SEQ ID No:160SEQ ID No:161SEQ ID No:115SEQ ID No:116SEQ ID No:117oma 2 (BCL2), nu-clear gene encodingmitochondrial pro-tein, transcript var-iant alphaMGST1108protein phosphatase182610SEQ ID No:162SEQ ID No:163SEQ ID No:2480SEQ ID No:2491, catalytic sub-unit, alpha iso-form (PPP1CA) (exMGST1)PDGFRB109platelet-derived158976SEQ ID No:1640SEQ ID No:208SEQ ID No:209growth factor re-ceptor, beta poly-peptide (PDGFRB)ANXA11110annexin A11158892SEQ ID No:1650SEQ ID No:206SEQ ID No:207(ANXA11)GPX1111histocompatability159809SEQ ID No:1660SEQ ID No:212SEQ ID No:213class II antigengamma chain(CD74) (ex GPX1Glutation S trans-férase)CFR-1112Golgi apparatus pro-153974SEQ ID No:167SEQ ID No:168SEQ ID No:173SEQ ID No:174SEQ ID No:175tein 1 (GLG1) (exCFR-1)BTF3L3113basic transcription195889SEQ ID No:169SEQ ID No:2890SEQ ID No:290factor 3 (BTF3)EST R55460114EST R55460154997SEQ ID No:1700SEQ ID No:1850AKT2115v-akt murine thy-182552SEQ ID No:171SEQ ID No:2530SEQ ID No:254moma viral onco-gene homolog 2(ATK2)CDKN1A116cyclin-dependent152524SEQ ID No:172SEQ ID No:173SEQ ID No:144SEQ ID No:145SEQ ID No:146kinase inhibitor(CDKN1A)PPP2CA117protein phosphatase54685SEQ ID No:174SEQ ID No:1750SEQ ID No:183SEQ ID No:1842 (formerly 2A),catalytic subunit,alpha isoform(PPP2CA)MDM2118mouse double min-148052SEQ ID No:1760SEQ ID No:120SEQ ID No:121ute 2, human homo-logy of; p53-bindingprotein (MDM2),transcript variantMDM2TNFRSF6119tumor necrosis151767SEQ ID No:177SEQ ID No:178SEQ ID No:141SEQ ID No:142SEQ ID No:143factor receptorsuperfamily, mem-ber 6 (TNFRSF6)CNTFR120ciliary neurotrophic156431SEQ ID No:1790SEQ ID No:192SEQ ID No:193factor receptor(CNTFR)JUNB121jun B proto-onco-153213SEQ ID No:180SEQ ID No:181SEQ ID No:153SEQ ID No:154SEQ ID No:155gene (JUNB)CCND1122cyclin D1 (PRAD1:110022SEQ ID No:182SEQ ID No:90SEQ ID No:10parathyroidadenomatosis 1)(CCND1)TDPX1123peroxiredoxin 2208439SEQ ID No:183SEQ ID No:184SEQ ID No:314SEQ ID No:315SEQ ID No:316(PRDX2) (exTDPX1)GRB7124growth factor130323SEQ ID No:185SEQ ID No:186SEQ ID No:79SEQ ID No:80SEQ ID No:81receptor-bound pro-tein 7 (GRB7)RBBP7125retinoblastoma-bind-210874SEQ ID No:187SEQ ID No:188SEQ ID No:319SEQ ID No:320SEQ ID No:321ing protein 7(RBBP7)TIMP1126tissue inhibitor of162246SEQ ID No:190SEQ ID No:223SEQ ID No:224SEQ ID No:225SEQ ID NO:189metalloproteinase 1(erythyroid po-tentiating act-ivity, collagen-ase inhibitor)(TIMP1)YES1127v-yes-1 Yamaguchi204634SEQ ID No:191SEQ ID No:3030SEQ ID No:304sarcoma viral onco-gene homolog 1(YES1)RNF5128ring finger protein112098SEQ ID No:1920SEQ ID No:25SEQ ID No:265 (RNF5)PRKCSH129protein kinase C187232SEQ ID No:1930SEQ ID No:263SEQ ID No:264substrate 80K-H(PRKCSH)CTSD130cathepsin D (lyso-149401SEQ ID No:194SEQ ID No:195SEQ ID No:126SEQ ID No:127SEQ ID No:128somal aspartyl pro-tease) (CTSD)NEO1131neogenin (chicken)188380SEQ ID No:1960SEQ ID No:269SEQ ID No:270homolog 1 (NEO1)GAPD-a132glyceraldehyde-3-152847SEQ ID No:197SEQ ID No:150SEQ ID No:151SEQ ID No:152phosphatase dehy-drogenase (GAPD)ACTG1133actin, gamma 1182291SEQ ID No:198SEQ ID No:199SEQ ID No:242SEQ ID No:243SEQ ID No:244(ACTG1)ITGA6134integrin, alpha 6182431SEQ ID No:200SEQ ID No:201SEQ ID No:245SEQ ID No:246SEQ ID No:247(ITGA6)GAPD-b135glyceraldehyde-3-153607SEQ ID No:202SEQ ID No:203SEQ ID No:166SEQ ID No:167SEQ ID No:152phosphate dehydro-genase (GAPD)ETV5-b136ets variant gene 5203394SEQ ID No:204SEQ ID No:205SEQ ID No:298SEQ ID No:299SEQ ID No:300(ets-related mole-cule) (ETV5)CDK4-b137cyclin-dependent195800SEQ ID No:206SEQ ID No:207SEQ ID No:286SEQ ID No:287SEQ ID No:288kinase 4 (CDK4)FOS-b138v-fos FBJ murine363796SEQ ID No:208SEQ ID No:209SEQ ID No:404SEQ ID No:405SEQ ID No:318osteosarcoma viraloncogene homo-log (FOS)HOXA5139homebox protein300564SEQ ID No:210SEQ ID No:211SEQ ID No:382SEQ ID No:383SEQ ID No:384(HOX-1.3) (ex HoxA5)RELA140NF-kappa-B trans-122056SEQ ID No:212SEQ ID No:420SEQ ID No:43cription factor p65DNA binding sub-unit (ex RELa)SUI1141S100 calcium-bind-155345SEQ ID No:213SEQ ID No:214SEQ ID No:18600ing protein A11(calgizzarin)(S100A11)ANG142angiogenin, ribonu-156720SEQ ID No:2150SEQ ID N:194SEQ ID No:195clease, RNaseA family, 5(ANG)ITGA6143integrin, alpha 6182431SEQ ID No:216SEQ ID No:217SEQ ID No:245SEQ ID No:246SEQ ID No:247(ITGA6)PRMT2144HMT1 (hnRNP158038SEQ ID No:218SEQ ID No:219SEQ ID No:201SEQ ID No:202SEQ ID No:203methyltransfer-ase, S. cerevis-iae)-like 1(HRMTIL1) (exPRMT2)EST R55460145EST R55460154997SEQ ID No:2200SEQ ID No:1850GZMA146granzyme A (gran-356763SEQ ID No:221SEQ ID No:222SEQ ID No:4020SEQ ID No:403zyme 1, cytotoxicT-lymphocyte-ass-ociated serine es-terase 3) (GZMA)SOX9147SRY (sex-deter-323948SEQ ID No:223SEQ ID No:3940SEQ ID No:395mining region Y)-box 9 (campomel-ic dysplasia, auto-somal sex-reversal)(SOX9)SRF148serum response321329SEQ ID No:224SEQ ID No:391SEQ ID No:392SEQ ID No:393factor (c-fos serumresponse element-binding transcriptionfactor) (SRF)EDNI149endothelial 1153424SEQ ID No:225#N/A#N/A#N/A(EDN1)PTPN6150protein tyrosine66778SEQ ID No:226#N/A#N/A#N/Aphosphatase, non-receptor type 6(PTPN6)TFAP4151transcription factor159093SEQ ID No:2270SEQ ID No:210SEQ ID No:211AP-4 (activatingenhancer bind-ing protein 4)(TFAP4)ELF1152Human cis-acting-182007SEQ ID No:228SEQ ID No:41700sequence Elf-1CD2153CD2 antigen (p50),120649SEQ ID No:229SEQ ID No:43100sheep red bloodcell receptor(CD2)CCND2154cyclin D2 (CCND2)175256SEQ ID No:230#N/A#N/A#N/AIL3RA155interleukin 3 recep-183087SEQ ID No:231SEQ ID No:440SEQ ID No:4410tor (hIL-3Ra)JUP156junction plakoglobin157958SEQ ID No:232#N/A#N/A#N/A(JUP)RBL2157retinoblastoma-like108571SEQ ID No:233SEQ ID No:430002 (p130) (RBL2)HOXA4158homeo box A4110731SEQ ID No:234SEQ ID No:20SEQ ID No:210(HOXA4)ACY1159aminoacylase160764SEQ ID No:235SEQ ID No:435SEQ ID No:4360(ACY1)GADD45A160growth arrest and115176SEQ ID No:236#N/A#N/A#N/aDNA-damage-in-ducible, alpha(GADD45A)nm23161non-metastatic174388SEQ ID No:237#N/A#N/A#N/Acells 1, protein(NM23A) express-ed (NME1)BBC1162ribosomal protein178317SEQ ID No:238#N/A#N/A#N/AL13 (RPL13) (exBBC1)VEGFB163vascular endothe-162499SEQ ID No:239#N/A#N/A#N/Alial growth factor B(VEGFB)LAMR1164laminin receptor 1199837SEQ ID No:240#N/A#N/A#N/A(67kD, ribosomalprotein SA)(LAMR1)IL2RB165interleukin 2 re-139073SEQ ID No:241SEQ ID No:242SEQ ID No:97SEQ ID No:98SEQ ID No:99ceptor, beta(IL2RB)DES166desmin153854SEQ ID No:243SEQ ID No:168SEQ ID No:169SEQ ID No:170PRL167prolactin133738SEQ ID No:244SEQ ID No:91SEQ ID No:92SEQ ID No:93CSH1168Chorionic soma-133891SEQ ID No:245SEQ ID No:43200tomammotropin hor-mone 1 (placentallactogen) = LAC-TOGEN PrecursorTEK169tyrosine proteine151501SEQ ID No:246SEQ ID No:247SEQ ID No:138SEQ ID No:139SEQ ID No:140kinase receptorNrg1170neuregulin 1 (EST155716SEQ ID No:248SEQ ID No:249SEQ ID No:189SEQ ID No:190SEQ ID No:191R72075)PLATrienpas dEST ni160149SEQ ID No:433SEQ ID No:4340mRNAESTrienimage ?AW184517

[0110] Tables 5 hereunder displays subpopulations of polynucleotide sequences interesting to distinguish a person without cancer from a cancer patient.

6TABLE 5GenesymbolNoNameSeq3′Seq5′RefHRB1hiv-1 rev binding proteinSEQ IDSEQ IDNo:1No:2EST T819194ests, weakly similar to alu7_human alu subfamilySEQ IDSEQ IDsq sequence contamination warning entry [h. sapines]No:7No:8ENPP218ectonucleotide pyrophosphatase/phosphodiesterase 2SEQ IDSEQ IDSEQ ID(autotaxin)No:39No:40No:41TNXB21tenascin xbSEQ IDSEQ IDNo:46No:47APC24adenomatosis polyposis coliSEQ IDSEQ IDSEQ IDNo:54No:55NO:56GATA332gata-binding protein 3SEQ IDSEQ IDSEQ IDNo:76No:77No:78PRL38prolactinSEQ IDSEQ IDSEQ IDNo:91No:92No:93BCL248b-cell cll/lymphoma 2SEQ IDSEQ IDSEQ IDNo:115No:116No:117CTSD53cathepsin d (lysosomal aspartyl protease)SEQ IDSEQ IDSEQ IDNo:126No:127No:128TEK58tek tyrosine kinase, endothelial (venousSEQ IDSEQ IDSEQ IDmalformations, multiple cutaneous and mucosal)No:138No:139No:140TNFRSF659tumor necrosis factor receptor superfamily, memberSEQ IDSEQ IDSEQ ID6No:141No:142No:143PLA2G2A61phospholipase a2, group iia (platelets, synovialSEQ IDSEQ IDSEQ IDfluidNo:147No:148No:149CRABP264cellular retinoic acid-binding protein 2SEQ IDSEQ IDSEQ IDNo:156No:157No:158RIL66lim domain proteinSEQ IDSEQ IDSEQ IDNo:162No:163DES69desminSEQ IDSEQ IDSEQ IDNo:168No:169No:170GZMB73granzyme b (granzyme 2, cytotoxic t-lymphocyte-SEQ IDSEQ IDassociated serine esterase 1)No:178No:179ETV485ets variant gene 4 (e1a enhancer-binding protein,SEQ IDSEQ IDe1af)No:204No:205WBSCR1488williams-beuren syndrome chromosome region 14SEQ IDSEQ IDNo:210No:211THBS191thrombospondin 1SEQ IDSEQ IDNo:216No:217GRB297growth factor receptor-bound protein 2SEQ IDSEQ IDSEQ IDNo:230No:231No:232RAD9104rad9 (s. pombe) homologSEQ IDSEQ IDNo:248No:249ATF3105activating transcription factor 3SEQ IDSEQ IDSEQ IDNo:250No:251No:252DTR112diphtheria receptor (heparin-binding epidermalSEQ IDSEQ IDgrowth factor-like growth factor)No:265No:266ITGB2113integrin, beta 2 (antigen cd18 (p95), lymphocyteSEQ IDSEQ IDfunction-associated antigen 1, macrophage entigen 1No:267No:268(mac-1) beta subunit)POU2F2115pou domain, class 2, transcription factor 2SEQ IDSEQ IDNo:271No:272MYBL2131v-myb avian myeoblastosis viral oncogeneSEQ IDSEQ IDSEQ IDhomolog-like 2No:308No:309No:310TGFBR3132transforming growth factor, beta receptor iiiSEQ IDSEQ IDSEQ ID(betaglycan, 300kd)No:311No:312No:313FOS134v-fos fbj murine osteosarcoma viral oncogeneSEQ IDSEQ IDhomologNo:317No:318ABCC5137atp-binding cassette, sub-family c (cftr/mrp),SEQ IDSEQ IDmember 5No:324No:325MMP11145matrix metalloproteinase 11 (stromelysin 3)SEQ IDSEQ IDNo:345No:346ILF2147interleukin enhancer binding factor 2, 45kdSEQ IDSEQ IDNo:350No:351ETV5155ets variant gene 5 (ets-related molecule)SEQ IDSEQ IDSEQ IDNo:368No:369No:300RELB175v-rel avian reticuloendotheliosis viral oncogeneSEQ IDSEQ IDSEQ IDhomolog b (nuclear factor of kappa light polypeptideNo:417No:418No:419gene enhancer in b-cells 3)EST T80406180similar to SP:S36648 S36648 RB2/P130 PROTEINSEQ IDNo:430EST Y95640181similar to gb:M16336 T-CELL SURFACESEQ IDANTIGEN CD2No:431EST R28523182similar to placental lactogen (CSH1)SEQ IDNo:432EST H28056185Homo sapines E74-like factor 1 (ets domainSEQ IDtranscription factor) (ELF1)No:437ESTs H42957 &187Human interleukin 3 receptor (hIL-3Ra)SEQ IDSEQ IDH42888No:440No:441

[0111] Tables 5A and 5B hereunder displays two subpopulations corresponding to the 5 top overexpressed and to the 5 top underexpressed polynucleotide sequences particularly interesting to distinguish a person without cancer from a cancer patient.

7TABLE 5Aoverexpressed genes:top 5GenesymbolNoNameSeq3′Seq5′RefGATA332gata-binding protein 3SEQ IDSEQ IDSEQ IDNo:76No:77No:78GZMB73granzyme b (granzyme 2, cytotoxic t-SEQ IDSEQ IDlymphocyte-associated serine esterase 1)No:178No:179MYBL2131v-myb avian myeloblastosis viral oncogeneSEQ IDSEQ IDSEQ IDhomolog-like 2No:308No:309No:310MMP11145matrix metallopropteinase 11 (stromelysin 3)SEQ IDSEQ IDNo:345No:346EST181similar to gb:M16336 T-CELL SURFACESEQ IDT95640ANTIGEN CD2No:431

[0112]

8

TABLE 5B

underexpressed genes:top 5

Gene

symbol
No
Name
Seq3′
Seq5′
Ref

PRL
38
prolactin
SEQ ID
SEQ ID
SEQ ID

No:91
No:92
No:93

TEK
58
tek tyrosine kinase, endothelial (venous
SEQ ID
SEQ ID
SEQ ID

malformations, multiple cutaneous and mucosal)
No:138
No:139
No:140

PLA2GA
612
phospholipase a2, group iia (platelets, synovial fluid)
SEQ ID
SEQ ID
SEQ ID

No:147
No:148
No:149

DES
69
desmin
SEQ ID
SEQ ID
SEQ ID

No:168
No:169
No:170

EST R28523
182
similar to placental lactogen (CSH1)
SEQ ID

No:432

[0113] Table 6 hereunder relates to subpopulations of polynucleotide sequences interesting to detect hormone-sensitive tumors allowing distinction between ER+ and ER-samples.

9TABLE 6GenesymbolNoNameSeq3′Seq5′RefSOX411sry (sex determining region y)-box 4SEQ IDSEQ IDSEQ IDNo:22No:23No:24IGF226insulin-like growth factor 2 (somatomedian a)SEQ IDSEQ IDSEQ IDNo:59No:60No:61GATA332gata-binding protein 3SEQ IDSEQ IDSEQ IDNo:76No:77No:78TOP2B34topoisomerase (dna) ii beta (180kd)SEQ IDSEQ IDNo:82No:83IL2RB40interleukin 2 receptor, betaSEQ IDSEQ IDSEQ IDNo:97No:98No:99EGFR57epidermal growth factor receptor (avianSEQ IDSEQ IDSEQ IDerythroblastic leukemia viral (v-erb-b) oncogeneNo:135No:136No:137homolog)CRABP264cellular retinoic acid-binding protein 2SEQ IDSEQ IDSEQ IDNo:156No:157No:158S100B107s100 calcium-binding protein, beta (neural)SEQ IDSEQ IDNo:255No:256IL2RG119interleukin 2 receptor, gamma (severe combinedSEQ IDSEQ IDSEQ IDimmunodeficiency)No:279No:280No:281KIAA1075136kiaa 1075 proteinSEQ IDSEQ IDNo:322No:323MST1140macrophage stimulating 1 (hepatocyte growth factor-SEQ IDSEQ IDSEQ IDlike)No:331No:332No:333GSTP1141glutathione s-transferase piSEQ IDSEQ IDSEQ IDNo:334No:335No:336MMP11145matrix metalloproteinase 11 (stromelysin 3)SEQ IDSEQ IDNo:345No:346FLJ11307148hypothetical protein flj11307SEQ IDSEQ IDNo:352No:353MYB149v-myb avian myeloblastosis viral oncogene homologSEQ IDSEQ IDNo:354No:355XBP1162x-box binding protein 1SEQ IDSEQ IDSEQ IDNo:385No:386No:387SOX9165sry (sex dtermining region y)-boc 9 (campomelicSEQ IDSEQ IDdysplasia, autosomal sex-reversal)No:394No:395GZMA169granzyme a (granzyme 1, cytotoxic t-lymphocyte-SEQ IDSEQ IDassociated serine esterase 3)No:402No:403CD3G174cd3g antigen, gamma polypeptide (tit3 complex)SEQ IDSEQ IDSEQ IDNo:414No:415No:416EST188Human tumor protein p53 (Li-Fraumeni syndrome)SEQ IDH57912(TP53)No:442

[0114] Tables 6A and 6B hereunder relate to two subpopulations of polynucleotide sequences particularly interesting to detect hormone-sensitive tumors allowing distinction between ER+ and ER− samples

10TABLE 6Aoverexpressed genes:top 5ER +/ER −GeneCLsymbolNoNameSeq3′Seq5′RefGATA332gata-binding protein 3SEQ IDSEQ IDSEQ IDNo:76No:77No:78KIAA1075136kiaa 1075 proteinSEQ IDSEQ IDNo:322No:323MMP11145matrix metalloproteinase 11SEQ IDSEQ ID(stromelysin 3)No:345No:346MYB149v-myb avian myeloblastosis viralSEQ IDSEQ IDoncogene homologNo:354No:355GZMA169granzyme a (granzyme 1, yutotoxic t-SEQ IDSEQ IDlymphocyte-associated serine esterase 3)No:402No:403

[0115]

11

TABLE 6B

underexpressed genes:top 5

Gene

symbol
No
Name
Seq3′
Seq5′
Ref

SOX4
11
sry (sex determining region y)-box 4
SEQ ID
SEQ ID
SEQ ID

No:22
No:23
No:24

IL2RB
40
interleukin 2 receptor, beta
SEQ ID
SEQ ID
SEQ ID

No:97
No:98
No:99

EGFR
57
epidermal growth factor receptor (avian
SEQ ID
SEQ ID
SEQ ID

eryhtroblastic leukemia viral (v-erb-b)
No:135
No:136
No:137

oncogene homolog)

IL2RG
119
interleukin 2 receptor, gamma (severe
SEQ ID
SEQ ID
SEQ ID

combined immunodeficiency)
No:279
No:280
No:281

CD3G
174
cd3g antigen, gamma polypeptide (tit3
SEQ ID
SEQ ID
SEQ ID

complex)
No:414
No:415
No:416

[0116] Tables 7 hereunder relates to subpopulations of polynucleotide sequences interesting to distinguish tumors in which a lymph node has been invaded by a tumor cell from tumors in which a lymph node has not been so invaded.

12TABLE 7GeneCLsymbolNoNameSeq3′Seq5′RefEST T899808estsSEQ IDNo:16SOX411sry (sex determining region y)-box 4SEQ IDSEQ IDSEQ IDNo:22No:23No:24ENPP218ectonucleotideSEQ IDSEQ IDSEQ IDpyrophosphatase/phosphodiesterase 2No:39No:40No:41(autotoxin)MUC125mucin 1, transmembraneSEQ IDSEQ IDNo:57No:58GATA332gata-binding protein 3SEQ IDSEQ IDSEQ IDNo:76No:77No:78TOP2B34topoisomerase (dna) it beta (180kd)SEQ IDSEQ IDNo:82No:83IL2RB40interleukin 2 receptor, betaSEQ IDSEQ IDSEQ IDNo:97No:98No:99ERBB249v-erb-b2 avian erythroblasticSEQ IDSEQ IDleukemia viral oncogene homolog 2No:118No:119(neuro/glioblastoma derived oncogenehomolog)EGFR57epidermal growth factor receptor (avianSEQ IDSEQ IDSEQ IDerythroblastic leukemia viral (v-erb-b)No:135No:136No:137oncogene homolog)THBS191thrombospondin 1SEQ IDSEQ IDNo:216No:217PPP2R2C100protein phosphatase 2 (formerly 2a),SEQ IDSEQ IDregulatory subunit b (pr 52), gammaNo:238No:239isoformATF3105activating transcription factor 3SEQ IDSEQ IDSEQ IDNo:250No:251No:252KIAA1075136kiaa 1075 proteinSEQ IDSEQ IDNo:322No:323CDH1138cadherin 1, type 1, e-cadherin (epithelial)SEQ IDSEQ IDSEQ IDNo:326No:327No:328ZNF144139zinc finger protein 144 (mel-18)SEQ IDSEQ IDNo:329No:330GSTP1141glutathione s-transferase piSEQ IDSEQ IDSEQ IDNo:334No:335No:336CD44158cd44 antigen (homing function and indianSEQ IDSEQ IDSEQ IDblood group system)No:374No:375No:376GZMA169granzyme a (granzyme 1, cytotoxic t-lym-SEQ IDSEQ IDphocyte-associated serine esterase 3)No:402No:403EST T80406180similar to SP;S36648 S36648 RB2/P130SEQ IDPROTEINNo:430ESTs H30141 &186Homo sapiens selectin PSEQ IDSEQ IDH27466No:438No:439

[0117] Tables 7A and 7B hereunder relate to two subpopulations of polynucleotide sequences particularly interesting to distinguish tumors in which a lymph node has been invaded by a tumor cell from tumors in which a lymph node has not been so invaded.

13TABLE 7AOverexpressed genes:top 5GenesymbolNoNameSeq3′Seq5′RefENPP218ectonucleotideSEQ IDSEQ IDSEQ IDpyrophosphatase/phosphodiesterase 2No:39No:40No:41(autotaxin)GATA332gata-binding protein 3SEQ IDSEQ IDSEQ IDNo:76No:77No:78EGFR57epidermal growth factor receptor (avianSEQ IDSEQ IDSEQ IDerythroblastic leukemia viral (v-erb-b)No:135No:136No:137oncogene homolog)THBS191thrombospondin 1SEQ IDSEQ IDNo:216No:217ATF3105activating transcription factor 3SEQ IDSEQ IDSEQ IDNo:250No:251No:252

[0118]

14

TABLE 7B

Underexpressed genes: top 5

Gene

symbol
No
Name
Seq 3′
Seq 5′
Ref

SOX4
11
sry (sex determining region y)-box 4
SEQ ID
SEQ ID
SEQ ID

No: 22
No: 23
No: 24

IL2RB
40
interleukin 2 receptor, beta
SEQ ID
SEQ ID
SEQ ID

No: 97
No: 98
No: 99

ERBB2
49
v-erb-b2 avian erythroblastic leukemia

SEQ ID
SEQ ID

viral oncogene homolog 2

No: 118
No: 119

(neuro/glioblastoma derived oncogene

homolog)

PPP2R2C
100
protein phosphatase 2 (formerly 2a),
SEQ ID
SEQ ID

regulatory subunit b (pr 52), gamma
No: 238
No: 239

isoform

GSTP1
141
glutathione s-transferase pi
SEQ ID
SEQ ID
SEQ ID

No: 334
No: 335
No:336

[0119] Table 8 hereunder relates to subpopulations of polynucleotide sequences particularly interesting to distinguish tumors sensitive to anthracycline from tumors insensitive to anthracycline.

15TABLE 8A1/A2GenesymbolNoNameSeq 3′Seq 5′RefSOX411sry (sex determining region y)-boxSEQ IDSEQ IDSEQ IDNo: 22No: 23No: 24CSF122colony stimulating factor 1 (macrophage)SEQ IDSEQ IDSEQ IDNo: 48No: 49No: 50VIL223villin 2 (ezrin)SEQ IDSEQ IDSEQ IDNo: 51No: 52No: 53IGF226insulin-like growth factor 2 (somatomedin a)SEQ IDSEQ IDSEQ IDNo: 59No: 60No: 61KIAA042728kiaa0427 gene productSEQ IDSEQ IDSEQ IDNo: 65No: 66No: 67MYC31v-myc avian myelocytomatosis viral oncogeneSEQ IDSEQ IDSEQ IDhomologNo: 73No: 74No: 75GATA332gata-binding protein 3SEQ IDSEQ IDSEQ IDNo: 76No: 77No: 78TOP2B34topoisomerase (dna) ii beta (180 kd)SEQ IDSEQ IDNo: 82No: 83ERBB249v-erb-b2 avian erythroblastic leukemia viralSEQ IDSEQ IDoncogene homolog 2 (neuro/glioblastomaNo: 118No: 119derived oncogene homolog)EGFR57epidermal growth factor receptor (avianSEQ IDSEQ IDSEQ IDerythroblastic leukemia viral (v-erb-b)No: 135No: 136No: 137oncogene homolog)CRABP264cellular retinoic acid-binding protein 2SEQ IDSEQ IDSEQ IDNo: 156No: 157No: 158GZMB73granzyme b (granzyme 2, cytotoxic t-SEQ IDSEQ IDlymphocyte-associated serine esterase 1)No: 178No: 179IGKC77immunoglobulin kappa constantSEQ IDNo: 186ANG81angiogenin, ribonuclease, rnase a family, 5SEQ IDSEQ IDNo: 194No: 195EFNA195ephrin-alSEQ IDSEQ IDNo: 226No: 227MYBL2131v-myb avian myeloblastosis viral oncogeneSEQ IDSEQ IDSEQ IDhomolog-like 2No: 308No: 309No: 310CDH1138cadherin 1, type 1, e-cadherin (epithelial)SEQ IDSEQ IDSEQ IDNo: 326No: 327No: 328MST1140macrophage stimulating 1 (hepatocyte growthSEQ IDSEQ IDSEQ IDfactor-like)No: 331No: 332No: 333MYB149v-myb avian myeloblastosis viral oncogeneSEQ IDSEQ IDhomologNo: 354No: 355XBP1162x-box binding protein 1SEQ IDSEQ IDSEQ IDNo: 385No: 386No: 387SRF164serum response factor (c-fos serum responseSEQ IDSEQ IDSEQ IDelement-binding transcription factor)No: 391No: 392No: 393SOX9165sry (sex determining region y)-box 9SEQ IDSEQ ID(campomelic dysplasia, autosomal sex-reversal)No: 394No: 395ESTs H21879183Homo sapiens plasminogen activator (PLAT)SEQ IDSEQ ID& H21880No: 433No: 434

[0120] Tables 8A and 8B hereunder relate to two subpopulations of polynucleotide sequences particularly interesting to distinguish tumors sensitive to anthracycline from tumors insensitive to anthracycline.

16TABLE 8AOverexpressed genes: top 5GenesymbolNoNameSeq 3′Seq 5′RefGATA332gata-binding protein 3SEQ IDSEQ IDSEQ IDNo: 76No: 77No: 78KIAA1075136kiaa1075 proteinSEQ IDSEQ IDNo: 322No: 323MMP11145matrix metalloproteinase 11SEQ IDSEQ ID(stromelysin 3)No: 345No: 346MYB149v-myb avian myeloblastosis viralSEQ IDSEQ IDoncogene homologNo: 354No: 355GZMA169Granzyme a (granzyme 1, cytotoxic t-SEQ IDSEQ IDlymphocyte-associated serine esterase 3)No: 402No: 403

[0121]

17

TABLE 8B

underexpressed genes: top 5

Gene

symbol
No
Name
Seq 3′
Seq 5′
Ref

SOX4
11
sry (sex determining region y)-box 4
SEQ ID
SEQ ID
SEQ ID

No: 22
No: 23
No: 24

IL2RB
40
interleukin 2 receptor, beta
SEQ ID
SEQ ID
SEQ ID

No: 97
No: 98
No: 99

EGFR
57
epidermal growth factor receptor (avian
SEQ ID
SEQ ID
SEQ ID

erythroblastic leukemia viral (v-erb-b)
No: 135
No: 136
No: 137

oncogene homolog)

IL2RG
119
interleukin 2 receptor, gamma (severe
SEQ ID
SEQ ID
SEQ ID

combined immunodeficiency)
No: 279
No: 280
No: 281

CD3G
174
cd3g antigen, gamma polypeptide (tit3
SEQ ID
SEQ ID
SEQ ID

complex)
No: 414
No: 415
No:416

[0122] Tables 9, 9A and 9B hereunder relate to subpopulations of polynucleotide sequences particularly interesting in classifying good and poor prognosis primary breast tumors.

18TABLE 9GeneSETsymbolNoNameSeq 3′Seq 5′RefCTSB14cathepsin bSEQ IDSEQ IDNo: 30No: 31VIL223villin 2 (ezrin)SEQ IDSEQ IDSEQ IDNo: 51No: 52No: 53MUC125mucin 1, transmembraneSEQ IDSEQ IDNo: 57No: 58EMR127egf-like module containing, mucin-like,SEQ IDSEQ IDSEQ IDhormone receptor-like sequence 1No: 62No: 63No: 64KIAA042728kiaa0427 gene productSEQ IDSEQ IDSEQ IDNo: 65No: 66No: 67GATA332gata-binding protein 3SEQ IDSEQ IDSEQ IDNo: 76No: 77No: 78PRLR39prolactin receptorSEQ IDSEQ IDSEQ IDNo: 94No: 95No: 96GATA341gata-binding protein 3SEQ IDSEQ IDSEQ IDNo: 100No: 101No: 78TC2144oncogene tc21SEQ IDSEQ IDSEQ IDNo: 106No: 107No: 108BCL248b-cell cll/lymphoma 2SEQ IDSEQ IDSEQ IDNo: 115No: 116No: 117GATA351gata-binding protein 3SEQ IDSEQ IDNo: 122No: 78CRABP264cellular retinoic acid-binding protein 2SEQ IDSEQ IDSEQ IDNo: 156No: 157No: 158ANG81angiogenin, ribonuclease, mase aSEQ IDSEQ IDfamily, 5No: 194No: 195EGF83epidermal growth factor (beta-SEQ IDSEQ IDurogastrone)No: 199No: 200THBS191thrombospondin 1SEQ IDSEQ IDNo: 216No: 217EDNRA96endothelin receptor type aSEQ IDSEQ IDNo: 228No: 229SMARCA299swi/snf related, matrix associated, actinSEQ IDSEQ IDSEQ IDdependent regulator of chromatin,No: 235No: 236No: 237subfamily a, member 2ABCB1108atp-binding cassette, sub-family bSEQ IDSEQ ID(mdr/tap), member 1No: 257No: 258EGF110epidermal growth factor (beta-SEQ IDSEQ IDurogastrone)No: 262No: 200BIRC4116baculoviral iap repeat-containing 4SEQ IDSEQ IDNo: 273No: 274DAP3117death associated protein 3SEQ IDSEQ IDNo: 275No: 276GNRH1118gonadotropin-releasing hormone 1SEQ IDSEQ ID(leutinizing-releasing hormone)No: 277No: 278DAP3120death associated protein 3SEQ IDSEQ IDSEQ IDNo: 282No: 283No: 276EST R97218126ests, highly similar to tvhumeSEQ IDSEQ IDhepatocyte growth factor receptorNo: 296No: 297precursor [h. sapiens]BCL2142b-cell cll/lymphoma 2SEQ IDSEQ IDSEQ IDNo: 337No: 338No: 117BS69144adenovirus 5 ela binding proteinSEQ IDSEQ IDSEQ IDNo: 342No: 343No: 344MYB149v-myb avian myeloblastosis viralSEQ IDSEQ IDoncogene homologNo: 354No: 355CTSB152cathepsin bSEQ IDSEQ IDNo: 361No: 31MLANA153melan-aSEQ IDSEQ IDSEQ IDNo: 362No: 363No: 364APR-1154apr-1 proteinSEQ IDSEQ IDSEQ IDNo: 365No: 366No: 367TC21157oncogene tc21SEQ IDSEQ IDSEQ IDNo: 372No: 373No: 108CDKN3159cyclin-dependent kinase inhibitor 3SEQ IDSEQ IDSEQ ID(cdk2-associated dual specificityNo: 377No: 378No: 379phosphatase)XBP1162x-box binding protein 1SEQ IDSEQ IDSEQ IDNo: 385No: 386No: 387CDH15166cadherin 15, m-cadherin (myotubule)SEQ IDSEQ IDSEQ IDNo: 396No: 397No: 398BCL2167b-cell cll/lymphoma 2SEQ IDSEQ IDSEQ IDNo: 399No: 400No: 117EST W73386168estsSEQ IDNo: 401ILF1171interleukin enhancer binding factor 1SEQ IDSEQ IDSEQ IDNo: 406No: 407No: 408ARHGDIA172rho gdp dissociation inhibitor (gdi)SEQ IDSEQ IDSEQ IDalphaNo: 409No: 410No: 411C4A173complement component 4aSEQ IDSEQ IDNo: 412No: 413ESR1176estrogen receptor 1SEQ IDSEQ IDSEQ IDNo: 420No: 421No: 422PBX1177pre-b-cell leukemia transcription factorSEQ IDSEQ IDSEQ ID1No: 423No: 424No: 425GLI3178gli-kruppel family member gli3 (greigSEQ IDSEQ IDSEQ IDcephalopolysyndactyly syndrome)No: 426No: 427No: 428ILF1179interleukin enhancer binding factor 1SEQ IDSEQ IDNo: 429No: 408ESTs184Homo sapiens aminoacylase 1 (ACY1).SEQ IDSEQ IDH24628 &No: 435No: 436H24592EST H28056185Homo sapiens E74-like factor 1 (etsSEQ IDdomain transcription factor) (ELF1)No: 437

[0123]

19

TABLE 9A

Gene
SET

symbol
No
Name
Seq 3′
Seq 5′
Ref

VIL2
23
villin 2 (ezrin)
SEQ ID
SEQ ID
SEQ ID

No: 51
No: 52
No: 53

MUC1
25
mucin 1, transmembrane

SEQ ID
SEQ ID

No: 57
No: 58

GATA3
32
gata-binding protein 3
SEQ ID
SEQ ID
SEQ ID

No: 76
No: 77
No: 78

GATA3
41
gata-binding protein 3
SEQ ID
SEQ ID
SEQ ID

No: 100
No: 101
No: 78

BCL2
48
b-cell cll/lymphoma 2
SEQ ID
SEQ ID
SEQ ID

No: 115
No: 116
No: 117

GATA3
51
gata-binding protein 3
SEQ ID

SEQ ID

No: 122

No: 78

CRABP2
64
cellular retinoic acid-binding protein 2
SEQ ID
SEQ ID
SEQ ID

No: 156
No: 157
No: 158

ANG
81
angiogenin, ribonuclease, rnase a family, 5

SEQ ID
SEQ ID

No: 194
No: 195

EGF
83
epidermal growth factor (beta-urogastrone)
SEQ ID

SEQ ID

No: 199

No: 200

THBS1
91
thrombospondin 1
SEQ ID

SEQ ID

No: 216

No: 217

SMARCA2
99
swi/snf related, matrix associated, actin
SEQ ID
SEQ ID
SEQ ID

dependent regulator of chromatin, subfamily
No. 235
No.236
No: 237

a, member 2

EGF
110
epidermal growth factor (beta-urogastrone)
SEQ ID

SEQ ID

No: 262

No: 200

BIRC4
116
baculoviral iap repeat-containing 4
SEQ ID

SEQ ID

No: 273

No: 274

BCL2
142
b-cell cll/lymphoma 2
SEQ ID
SEQ ID
SEQ ID

No: 337
No: 338
No: 117

BS69
144
adenovirus 5 ela binding protein
SEQ ID
SEQ ID
SEQ ID

No: 342
No: 343
No: 344

MYB
149
v-myb avian myeloblastosis viral oncogene

SEQ ID
SEQ ID

homolog

No: 354
No: 355

XBP1
162
x-box binding protein 1
SEQ ID
SEQ ID
SEQ ID

No: 385
No: 386
No: 387

BCL2
167
b-cell cll/lymphoma 2
SEQ ID
SEQ ID
SEQ ID

No: 399
No: 400
No: 117

ILF1
171
interleukin enhancer binding factor 1
SEQ ID
SEQ ID
SEQ ID

No: 406
No: 407
No: 408

ARHGDIA
172
rho gdp dissociation inhibitor (gdi) alpha
SEQ ID
SEQ ID
SEQ ID

No: 409
No: 410
No: 411

C4A
173
complement component 4a
SEQ ID

SEQ ID

No: 412

No: 413

ESR1
176
estrogen receptor 1
SEQ ID
SEQ ID
SEQ ID

No: 420
No: 421
No: 422

PBX1
177
pre-b-cell leukemia transcription factor 1
SEQ ID
SEQ ID
SEQ ID

No: 423
No: 424
No: 425

GLI3
178
gli-kruppel family member gli3 (greig
SEQ ID
SEQ ID
SEQ ID

cephalopolysyndactyly syndrome)
No: 426
No: 427
No: 428

ILF1
179
interleukin enhancer binding factor 1
SEQ ID

SEQ ID

No: 429

No: 408

ESTs
184

Homo sapiens
aminoacylase 1 (ACY1).
SEQ ID
SEQ ID

H24628 &

No: 435
No: 436

H24592

EST
185

Homo sapiens
E74-like factor 1 (ets domain
SEQ ID

H28056

transcription factor) (ELF1)
No: 437

[0124]

20

TABLE 9B

Gene

symbol
SET No
Name
Seq 3′
Seq 5′
Ref

CTSB
14
cathepsin b

SEQ ID
SEQ ID

No: 30
No: 31

EMR1
27
egf-like module containing, mucin-like,
SEQ ID
SEQ ID
SEQ ID

hormone receptor-like sequence 1
No: 62
No: 63
No: 64

KIAA0427
28
kiaa0427 gene product
SEQ ID
SEQ ID
SEQ ID

No: 65
No: 66
No: 67

PRLR
39
prolactin receptor
SEQ ID
SEQ ID
SEQ ID

No: 94
No: 95
No: 96

TC21
44
oncogene tc21
SEQ ID
SEQ ID
SEQ ID

No: 106
No: 107
No: 108

EDNRA
96
endothelin receptor type a
SEQ ID

SEQ ID

No: 228

No: 229

ABCB1
108
atp-binding cassette, sub-family b
SEQ ID

SEQ ID

(mdr/tap), member 1
No: 257

No: 258

DAP3
117
death associated protein 3
SEQ ID

SEQ ID

No: 275

No: 276

GNRH1
118
gonadotropin-releasing hormone 1

SEQ ID
SEQ ID

(leutinizing-releasing hormone)

No: 277
No: 278

DAP3
120
death associated protein 3
SEQ ID
SEQ ID
SEQ ID

No: 282
No: 283
No: 276

EST R97218
126
ests, highly similar to tvhume hepatocyte
SEQ ID
SEQ ID

growth factor receptor precursor
No: 296
No: 297

[h. sapiens]

CTSB
152
cathepsin b
SEQ ID

SEQ ID

No: 361

No: 31

MLANA
153
melan-a
SEQ ID
SEQ ID
SEQ ID

No: 362
No: 363
No: 364

APR-1
154
apr-1 protein
SEQ ID
SEQ ID
SEQ ID

No: 365
No: 366
No: 367

TC21
157
oncogene tc21
SEQ ID
SEQ ID
SEQ ID

No: 372
No: 373
No: 108

CDKN3
159
cyclin-dependent kinase inhibitor 3 (cdk2-
SEQ ID
SEQ ID
SEQ ID

associated dual specificity phosphatase)
No: 377
No: 378
No: 379

CDH15
166
cadherin 15, m-cadherin (myotubule)
SEQ ID
SEQ ID
SEQ ID

No: 396
No: 397
No: 398

EST
168
ests
SEQ ID

W73386

No: 401

[0125] Overexpression of genes detected by using at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets indicated in table 9A combined with underexpression of genes detected with at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets indicated in table 9B present a Good outcome.

[0126] So, a preferred DNA array according to the invention comprises at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequences indicated in table 9A and at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequences indicated in table 9B.

[0127] Such DNA arrays are particularly useful to distinguish patients having a high risk (Bad Outcome) from those having a good prognosis (Good Outcome). References

[0128] 1. DeRisi, J., Penland, L., Brown, P. O., Bittner, M. L., Meltzer, P. S., Ray, M., Chen, Y., Su, Y. A., and Trent, J. M. (1996) Use of a cDNA microarray to analyze gene expression patterns in human cancer. Nat Genet, 14, 457-460.

[0129] 2. Jordan, B. R. (1998) Large-scale expression measurement by hybridization methods: from high- density membranes to “DNA chips”. J. Biochem (Tokyo), 124, 251-258.

[0130] 3. Nguyen, C., Rocha, D., Granjeaud, S., Baldit, M., Bernard, K., Naquet, P., and Jordan, B. R. (1995) Differential gene expression in the murine thymus assayed by quantitative hybridization of arrayed cDNA clones. Genomics, 29, 207-216.

[0131] 4. Bertucci, F., Van Hulst, S., Bernard, K., Loriod, B., Granjeaud, S., Tagett, R., Starkey, M., Nguyen, C., Jordan, B., and Birnbaum, D. (1999) Expression scanning of an array of growth control genes in human tumor cell lines. Oncogene, 18, 3905-3912.

[0132] 5. Bertucci, F., Bernard, K., Loriod, B., Chang, Y. C., Granjeaud, S., Birnbaum, D., Nguyen, C., Peck, K., and Jordan, B. R. (1999) Sensitivity issues in DNA array-based expression measurements and performance of nylon microarrays for small samples [In Process Citation]. Hum Mol Genet, 8, 1715-1722.

[0133] 6. Ross, J. S. and Fletcher, J. A. (1999) The HER-2/neu oncogene: prognostic factor, predictive factor and target for therapy. Semin Cancer Biol, 9, 125-138.

[0134] 7. Scorilas, A., Trangas, T., Yotis, J., Pateras, C., and Talieri, M. (1999) Determination of c-myc amplification and overexpression in breast cancer patients: evaluation of its prognostic value against c-erbB-2, cathepsin-D and clinicopathological characteristics using univariate and multivariate analysis. Br J Cancer, 81, 1385-1391.

[0135] 8. Fox, S. B., Smith, K., Hollyer, J., Greenall, M., Hastrich, D., and Harris, A. L. (1994) The epidermal growth factor receptor as a prognostic marker: results of 370 patients and review of 3009 patients. Breast Cancer Res Treat, 29, 41-49.

[0136] 9. Heimann, R., Lan, F., McBride, R., and Hellman, S. (2000) Separating favorable from unfavorable prognostic markers in breast cancer: the role of E-cadherin. Cancer Res, 60, 298-304.

[0137] 10. Guerin, M., Sheng, Z. M., Andrieu, N., and Riou, G. (1990) Strong association between c-myband oestrogen-receptor expression in human breast cancer. Oncogene , 5, 131-135.

[0138] 11. Lim, K. C., Lakshmanan, G., Crawford, S. E., Gu, Y., Grosveld, F., and Douglas Engel, J. (2000) Gata 3 loss leads to embryonic lethality due to noradrenaline deficiency of the sympathetic nervous system. Nat Genet, 25, 209-212.

[0139] 12. Mills, K. J., Vollberg, T. M., Nervi, C., Grippo, J. F., Dawson, M. I., and Jetten, A. M. (1996) Regulation of retinoid-induced differentiation in embryonal carcinoma PCC 4.azalR cells: effects of retinoid-receptor selective ligands. Cell Growth Differ, 7, 327-337.

[0140] 13. Easty, D. J., Hill, S. P., Hsu, M. Y., Fallowfield, M. E., Florenes, V. A., Herlyn, M., and Bennett, D. C. (1999) Up-regulation of ephrin0A1 during melanoma progression. Int J Cancer, 84, 494-501.

[0141] 14. Shim, C., Zhang, W., Rhee, C. H., and Lee, J. H. (1998) Profiling of differentially expressed genes in human primary cervical cancer by complementary DNA expression array. Clin Cancer Res, 4, 3045-3050.

[0142] 15. Tsou, A. P., Wu, K. M., Tsen, T. Y., Chi, C. W., Chiu, J. H., Lui, W. Y., Hu, C. P., Chang, C., Chou, C. K., and Tsai, S. F. (1998) Parallel hybridization analysis of multiple protein kinase genes: identification of gene expression patterns characteristic of human hepatocellular carcinoma. Genomics, 50, 331-340.

[0143] 16. Schummer, M., Ng, W. V., Bumgarner, R. E., Nelson, P. S., Schummer, B., Bednarski, D. W., Hassell, L., Baldwin, R. L., Karlan, B. Y., and Hood, L. (1999) Comparative hybridization of an array of 21,500 ovarian cDNAs for the discovery of genes overexpressed in ovarian carcinomas. Gene, 238, 375-385.

[0144] 17. Alon, U., Barkai, N., Notterman, D. A., Gish, K., Ybarra, S., Mack, D., and Levine, A. J. (1999) Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. Proc Natl Acad Sci U S A, 96, 6745-6750.

[0145] 18. Moch, H., Schraml, P., Bubendorf, L., Mirlacher, M., Kononen, J., Gasser, T., Mihatsch, M. J., Kallioniemi, O. P., and Sauter, G. (1999) High-throughput tissue microarray analysis to evaluate genes uncovered by cDNA microarray screening in renal cell carcinoma. Am J Pathol, 154, 981-986.

[0146] 19. Rhee, C. H., Hess, K., Jabbur, J., Ruiz, M., Yang, Y., Chen, S., Chenchik, A., Fuller, G. N., and Zhang, W. (1999) cDNA expression array reveals heterogeneous gene expression profiles in three glioblastoma cell lines. Oncogene, 18, 2711-2717.

[0147] 20. Huang, F., Adelman, J., Jiang, H., Goldstein, N. I., and Fisher, P. B. (1999) Identification and temporal expression pattern of genes modulated during irreversible growth arrest and terminal differentiation in human melanoma cells. Oncogene, 18, 3546-3552.

[0148] 21. Bittner, M., Meltzer, P., Chen, Y., Jiang, Y., Seftor, E., Hendrix, M., Radmacher, M., Simon, R., Yakhini, Z., Ben-Dor, A., Sampas, N., Dougherty, E., Wang, E., Marincola, F., Gooden, C., Lueders, J., Glatfelter, A., Pollock, P., Carpten, J., Gillanders, E., Leja, D., Dietrich, K., Beaudry, C., Berens, M., Alberts, D., and Sondak, V. (2000) Molecular classification of cutaneous malignant melanoma by gene expression profiling. Nature, 406, 536-540.

[0149] 22. Khan, J., Simon, R., Bittner, M., Chen, Y., Leighton, S. B., Pohida, T., Smith, P. D., Jiang, Y., Gooden, G. C., Trent, J. M., and Meltzer, P. S. (1998) Gene expression profiling of alveolar rhabdomyosarcoma with cDNA microarrays. Cancer Res, 58, 5009-5013.

[0150] 23. Golub, T. R., Slonim, D. K., Tamayo, P., Huard, C., Gaasenbeek, M., Mesirov, J. P., Coller, H., Loh, M. L., Downing, J. R., Caligiuri, M. A., Bloomfield, C. D., and Lander, E. S. (1999) Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science, 286, 531-537.

[0151] 24. Alizadeh, A. A., Eisen, M. B., Davis, R. E., Ma, C., Lossos, I. S., Rosenwald, A., Boldrick, J. C., Sabet, H., Tran, T., Yu, X., Powell, J. I., Yang, L., Marti, G. E., Moore, T., Hudson, J., Jr., Lu, L., Lewis, D. B., Tibshirani, R., Sherlock, G., Chan, W. C., Greiner, T. C., Weisenburger, D. D., Armitage, J. O., Warnke, R., and Staudt, L. M. (2000) Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling [In Process Citation]. Nature, 403, 503-511.

[0152] 25. Hoch, R. V., Thompson, D. A., Baker, R. J., and Weigel, R. J. (1999) GATA-3 is expressed in association with estrogen receptor in breast cancer. Int J Cancer, 84, 122-128.

[0153] 26. Hilsenbeck, S. G., Friedrichs, W. E., Schiff, R., O° Connell, P., Hansen, R. K., Osborne, C. K., and Fuqua, S. A. (1999) Statistical analysis of array expression data as applied to the problem of tamoxifen resistance. J Natl Cancer Inst, 91, 453-459.

[0154] 27. Martin, K. J., Kritzman, B. M., Price, L. M., Koh, B., Kwan, C. P., Zhang, X., Mackay, A., O'Hare, M. J., Kaelin, C. M., Mutter, G. L., Pardee, A. B., and Sager, R. (2000) Linking gene expression patterns to therapeutic groups in breast cancer. Cancer Res, 60, 2232-2238.

[0155] 28. Yang, G. P., Ross, D. T., Kuang, W. W., Brown, P. O., and Weigel, R. J. (1999) Combining SSH and cDNA microarrays for rapid identification of differentially expressed genes. Nucleic Acids Res, 27, 1517-1523.

[0156] 29. Perou, C. M., Jeffrey, S. S., van de Rijn, M., Rees, C. A., Eisen, M. B., Ross, D. T., Pergamenschikov, A., Williams, C. F., Zhu, S. X., Lee, J. C., Lashkari, D., Shalon, D., Brown, P. O., and Botstein, D. (1999) Distinctive gene expression patterns in human mammary epithelial cells and breast cancers. Proc Natl Acad Sci U S A, 96, 9212-9217.

[0157] 30. Nacht, M., Ferguson, A. T., Zhang, W., Petroziello, J. M., Cook, B. P., Gao, Y. H., Maguire, S., Riley, D., Coppola, G., Landes, G. M., Madden, S. L., and Sukumar, S. (1999) Combining serial analysis of gene expression and array technologies to identify genes differentially expressed in breast cancer. Cancer Res, 59, 5464-5470.

[0158] 31. Sgroi, D. C., Teng, S., Robinson, G., LeVangie, R., Hudson, J. R., Jr., and Elkahloun, A. G. (1999) In vivo gene expression profile analysis of human breast cancer progression. Cancer Res, 59, 5656-5661.

[0159] 32. Perou, C. M., Sorlie, T., Eisen, M. B., van de Rijn, M., Jeffrey, S. S., Rees, C. A., Pollack, J. R., Ross, D. T., Johnsen, H., Akslen, L. A., Fluge, O., Pergamenschikov, A., Williams, C., Zhu, S. X., Lonning, P. E., Borresen-Dale, A. L., Brown, P. O., and Botstein, D. (2000) Molecular portraits of human breast tumours. Nature, 406, 747-752.

[0160] 33. Hahnel, E., Harvey, J. M., Joyce, R., Robbins, P. D., Sterrett, G. F., and Hahnel, R. (1993) Stromelysin-3 expression in breast cancer biopsies: clinico- pathological correlations. Int J Cancer, 55, 771-774.

[0161] 34. Skoog, L., Humla, S., Klintenberg, C., Pasqual, M., and Wallgren, A. (1985) Receptors for retinoic acid and retinol in human mammary carcinomas. Eur J Cancer Clin Oncol, 21, 901-906.

[0162] 35. Thor, A. D., Moore, D. H., I I, Edgerton, S. M., Kawasaki, E. S., Reihsaus, E., Lynch, H. T., Marcus, J. N., Schwartz, L., Chen, L. C., Mayall, B. H., and et al. (1992) Accumulation of p 53 tumor suppressor gene protein: an independent marker of prognosis in breast cancers. J Natl Cancer Inst, 84, 845-855.

[0163] 36. Allred, D. C., Harvey, J. M., Berardo, M., and Clark, G. M. (1998) Prognostic and predictive factors in breast cancer by immunohistochemical analysis. Mod Pathol , 11, 155-168.

[0164] 37. Spencer, K. S., Graus-Porta, D., Leng, J., Hynes, N. E., and Klemke, R. L. (2000) ErbB 2 is necessary for induction of carcinoma cell invasion by ErbB family receptor tyrosine kinases. J Cell Biol, 148, 385-397.

[0165] 38. Behrens, J. (1993) The role of cell adhesion molecules in cancer invasion and metastasis. Breast Cancer Res Treat, 24, 175-184.

[0166] 39. Roberts, D. D. (1996) Regulation of tumor growth and metastasis by thrombospondin-1. Faseb J, 10, 1183-1191.

[0167] 40. Taylor-Papadimitriou, J., Burchell, J., Miles, D. W., and Dalziel, M. (1999) MUCI and cancer. Biochim Biophys Acta, 1455, 301-313.

[0168] 41. Sneath, R. J. and Mangham, D. C. (1998) The normal structure and function of CD44 and its role in neoplasia. Mol Pathol, 51, 191-200.

[0169] 42. Iyer, V. R., Eisen, M. B., Ross, D. T., Schuler, G., Moore, T., Lee, J. C. F., Trent, J. M., Staudt, L. M., Hudson, J., Jr., Boguski, M. S., Lashkari, D., Shalon, D., Botstein, D., and Brown, P. O. (1999) The transcriptional program in the response of human fibroblasts to serum. Science, 283, 83-87.

[0170] 43. Theillet, C., Adelaide, J., Louason, G., Bonnet-Dorion, F., Jacquemier, J., Adnane, J., Longy, M., Katsaros, D., Sismondi, P., Gaudray, P., and et al. (1993) FGFRI and PLAT genes and DNA amplification at 8p12 in breast and ovarian cancers. Genes Chromosomes Cancer, 7, 219-226.

[0171] 44. Granjeaud, S., Nguyen, C., Rocha, D., Luton, R., and Jordan, B. R. (1996) From hybridization image to numerical values: a practical, high throughput quantification system for high density filter hybridizations. Genet Anal, 12, 151-162.

[0172] 45. Eisen, M. B., Spellman, P. T., Brown, P. O., and Botstein, D. (1998) Cluster analysis and display of genome-wide expression patterns. Proc Natl Acad Sci U S A , 95, 14863-14868.

[0173] 46. Ferrari, S., Battini, R., and Cossu, G. (1990) Differentiation-dependent expression of apolipoprotein A-I in chicken myogenic cells in culture. Dev Biol, 140, 430-436.

Claims

1. A polynucleotide library useful in the molecular characterization of a carcinoma, said library comprising a pool of polynucleotide sequences or subsequences thereof wherein said sequences or subsequences are either underexpressed or overexpressed in tumor cells, further wherein said sequences or subsequences correspond substantially to any of the polynucleotide sequences set forth in any of SEQ ID Nos: 1-468 or the complement thereof.
2. A polynucleotide library useful in the molecular characterization of a carcinoma, said library comprising a pool of polynucleotide sequences or subsequences thereof wherein said sequences or subsequences are overexpressed or underexpressed in tumor cells, further wherein said sequences or subsequences correspond substantially to any of the polynucleotide sequences set forth in any of SEQ ID No: 1, SEQ ID No: 3, SEQ ID No: 5, SEQ ID No: 7, SEQ ID No: 8, SEQ ID No: 9, SEQ ID No: 11, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 16, SEQ ID No: 17, SEQ ID No: 18, SEQ ID No: 20, SEQ ID No: 22, SEQ ID No: 23, SEQ ID No: 25, SEQ ID No: 27, SEQ ID No: 28, SEQ ID No: 30, SEQ ID No: 32, SEQ ID No: 33, SEQ ID No: 35, SEQ ID No: 36, SEQ ID No: 37, SEQ ID No: 39, SEQ ID No: 40, SEQ ID No: 42, SEQ ID No: 44, SEQ ID No: 46, SEQ ID No: 48, SEQ ID No: 49, SEQ ID No: 51, SEQ ID No: 52, SEQ ID No: 54, SEQ ID No: 55, SEQ ID No: 57, SEQ ID No: 59, SEQ ID No: 60, SEQ ID No: 65, SEQ ID No: 66, SEQ ID No: 68, SEQ ID No: 69, SEQ ID No: 71, SEQ ID No: 73, SEQ ID No: 74, SEQ ID No: 76, SEQ ID No: 79, SEQ ID No: 80, SEQ ID No: 82, SEQ ID No: 86, SEQ ID No: 88, SEQ ID No: 89, SEQ ID No: 91, SEQ ID No: 94, SEQ ID No: 95, SEQ ID No: 97, SEQ ID No: 98, SEQ ID No: 102, SEQ ID No: 104, SEQ ID No: 109, SEQ ID No: 111, SEQ ID No: 112, SEQ ID No: 114, SEQ ID No: 115, SEQ ID No: 116, SEQ ID No: 120, SEQ ID No: 123, SEQ ID No: 126, SEQ ID No: 127, SEQ ID No: 129, SEQ ID No: 131, SEQ ID No: 133, SEQ ID No: 135, SEQ ID No: 136, SEQ ID No: 138, SEQ ID No: 139, SEQ ID No: 141, SEQ ID No: 142, SEQ ID No: 144, SEQ ID No: 145, SEQ ID No: 147, SEQ ID No: 148, SEQ ID No: 150, SEQ ID No: 153, SEQ ID No: 154, SEQ ID No: 156, SEQ ID No: 157, SEQ ID No: 159, SEQ ID No: 160, SEQ ID No: 162, SEQ ID No: 164, SEQ ID No: 166, SEQ ID No: 167, SEQ ID No: 168, SEQ ID No: 171, SEQ ID No: 173, SEQ ID No: 174, SEQ ID No: 176, SEQ ID No: 178, SEQ ID No: 180, SEQ ID No: 181, SEQ ID No: 183, SEQ ID No: 185, SEQ ID No: 186, SEQ ID No: 187, SEQ ID No: 189, SEQ ID No: 190, SEQ ID No: 192, SEQ ID No: 194, SEQ ID No: 196, SEQ ID No: 197, SEQ ID No: 201, SEQ ID No: 202, SEQ ID No: 204, SEQ ID No: 205, SEQ ID No: 206, SEQ ID No: 208, SEQ ID No: 212, SEQ ID No: 214, SEQ ID No: 216, SEQ ID No: 218, SEQ ID No: 219, SEQ ID No: 221, SEQ ID No: 223, SEQ ID No: 224, SEQ ID No: 226, SEQ ID No: 230, SEQ ID No: 231, SEQ ID No: 233, SEQ ID No: 238, SEQ ID No: 239, SEQ ID No: 240, SEQ ID No: 242, SEQ ID No: 243, SEQ ID No: 245, SEQ ID No: 246, SEQ ID No: 248, SEQ ID No: 250, SEQ ID No: 251, SEQ ID No: 253, SEQ ID No: 255, SEQ ID No: 259, SEQ ID No: 260, SEQ ID No: 263, SEQ ID No: 265, SEQ ID No: 269, SEQ ID No: 271, SEQ ID No: 277, SEQ ID No: 279, SEQ ID No: 280, SEQ ID No: 284, SEQ ID No: 286, SEQ ID No: 287, SEQ ID No: 289, SEQ ID No: 291, SEQ ID No: 293, SEQ ID No: 294, SEQ ID No: 296, SEQ ID No: 298, SEQ ID No: 299, SEQ ID No: 301, SEQ ID No: 302, SEQ ID No: 303, SEQ ID No: 305, SEQ ID No: 306, SEQ ID No: 308, SEQ ID No: 309, SEQ ID No: 311, SEQ ID No: 312, SEQ ID No: 314, SEQ ID No: 315, SEQ ID No: 317, SEQ ID No: 319, SEQ ID No: 320, SEQ ID No: 322, SEQ ID No: 323, SEQ ID No: 324, SEQ ID No: 326, SEQ ID No: 327, SEQ ID No: 329, SEQ ID No: 331, SEQ ID No: 332, SEQ ID No: 334, SEQ ID No: 335, SEQ ID No: 339, SEQ ID No: 340, SEQ ID No: 345, SEQ ID No: 347, SEQ ID No: 350, SEQ ID No: 352, SEQ ID No: 354, SEQ ID No: 356, SEQ ID No: 358, SEQ ID No: 359, SEQ ID No: 368, SEQ ID No: 369, SEQ ID No: 370, SEQ ID No: 374, SEQ ID No: 375, SEQ ID No: 380, SEQ ID No: 382, SEQ ID No: 383, SEQ ID No: 385, SEQ ID No: 386, SEQ ID No: 388, SEQ ID No: 392, SEQ ID No: 394, SEQ ID No: 402, SEQ ID No: 404, SEQ ID No: 405, SEQ ID No: 414, SEQ ID No: 415, SEQ ID No: 417, SEQ ID No: 418, SEQ ID No: 430, SEQ ID No: 431, SEQ ID No: 435, SEQ ID No: 437, SEQ ID No: 439, SEQ ID No: 440, SEQ ID No: 443, SEQ ID No: 444, SEQ ID No: 445, SEQ ID No: 446, SEQ ID No: 447, SEQ ID No: 448, SEQ ID No: 449, SEQ ID No: 450, SEQ ID No: 451, SEQ ID No: 452, SEQ ID No: 453, SEQ ID No: 454, SEQ ID No: 455, SEQ ID No: 456, SEQ ID No: 457, SEQ ID No: 458, SEQ ID No: 459, SEQ ID No: 460, SEQ ID No: 461, SEQ ID No: 462, SEQ ID No: 463, SEQ ID No: 464, SEQ ID No: 465, SEQ ID No: 466, SEQ ID No: 467, SEQ ID No: 468 or the complement thereof.
3. The polynucleotide library of claim 2 wherein said tumor cells are breast tumor cells.
4. The polynucleotide library of claim 2 wherein the pool of polynucleotide sequences or subsequences correspond substantially to the polynucleotide sequences set forth in any of SEQ ID No: 1, SEQ ID No: 3, SEQ ID No: 5, SEQ ID No: 7, SEQ ID No: 8, SEQ ID No: 9, SEQ ID No: 11, SEQ ID No: 13, SEQ ID No: 14 SEQ ID No: 16, SEQ ID No: 17, SEQ ID No: 18, SEQ ID No: 20, SEQ ID No: 22, SEQ ID No: 23, SEQ ID No: 25, SEQ ID No: 27, SEQ ID No: 28, SEQ ID No: 30, SEQ ID No: 32, SEQ ID No: 33, SEQ ID No: 35, SEQ ID No: 36, SEQ ID No: 37, SEQ ID No: 39, SEQ ID No: 40, SEQ ID No: 42, SEQ ID No: 44, SEQ ID No: 46, SEQ ID No: 48, SEQ ID No: 49, SEQ ID No: 51, SEQ ID No: 52, SEQ ID No: 54, SEQ ID No: 55, SEQ ID No: 57, SEQ ID No: 59, SEQ ID No: 60, SEQ ID No: 65, SEQ ID No: 66, SEQ ID No: 68, SEQ ID No: 69, SEQ ID No: 71, SEQ ID No: 73, SEQ ID No: 74, SEQ ID No: 76, SEQ ID No: 79, SEQ ID No: 80, SEQ ID No: 82, SEQ ID No: 86, SEQ ID No: 88, SEQ ID No: 89, SEQ ID No: 91, SEQ ID No: 94, SEQ ID No: 95, SEQ ID No: 97, SEQ ID No: 98, SEQ ID No: 102, SEQ ID No: 104, SEQ ID No: 109, SEQ ID No: 111, SEQ ID No: 112, SEQ ID No: 114, SEQ ID No: 115, SEQ ID No: 116, SEQ ID No: 120, SEQ ID No: 123, SEQ ID No: 126, SEQ ID No: 127, SEQ ID No: 129, SEQ ID No: 131, SEQ ID No: 133, SEQ ID No: 135, SEQ ID No: 136, SEQ ID No: 138, SEQ ID No: 139, SEQ ID No: 141, SEQ ID No: 142, SEQ ID No: 144, SEQ ID No: 145, SEQ ID No: 147, SEQ ID No: 148, SEQ ID No: 150, SEQ ID No: 153, SEQ ID No: 154, SEQ ID No: 156, SEQ ID No: 157, SEQ ID No: 159, SEQ ID No: 160, SEQ ID No: 162, SEQ ID No: 164, SEQ ID No: 166, SEQ ID No: 167, SEQ ID No: 168, SEQ ID No: 171, SEQ ID No: 173, SEQ ID No: 174, SEQ ID No: 176, SEQ ID No: 178, SEQ ID No: 180, SEQ ID No: 181, SEQ ID No: 183, SEQ ID No: 185, SEQ ID No: 186, SEQ ID No: 187, SEQ ID No: 192, SEQ ID No: 194, SEQ ID No: 196, SEQ ID No: 197, SEQ ID No: 201, SEQ ID No: 202, SEQ ID No: 204, SEQ ID No: 205, SEQ ID No: 206, SEQ ID No: 208, SEQ ID No: 212, SEQ ID No: 214, SEQ ID No: 216, SEQ ID No: 218, SEQ ID No: 219, SEQ ID No: 221, SEQ ID No: 223, SEQ ID No: 224, SEQ ID No: 226, SEQ ID No: 230, SEQ ID No: 231, SEQ ID No: 233, SEQ ID No: 238, SEQ ID No: 239, SEQ ID No: 240, SEQ ID No: 242, SEQ ID No: 243, SEQ ID No: 245, SEQ ID No: 246, SEQ ID No: 248, SEQ ID No: 250, SEQ ID No: 251, SEQ ID No: 253, SEQ ID No: 255, SEQ ID No: 259, SEQ ID No: 260, SEQ ID No: 263, SEQ ID No: 265, SEQ ID No: 269, SEQ ID No: 271, SEQ ID No: 277, SEQ ID No: 279, SEQ ID No: 280, SEQ ID No: 284, SEQ ID No: 286, SEQ ID No: 287, SEQ ID No: 289, SEQ ID No: 291, SEQ ID No: 293, SEQ ID No: 294, SEQ ID No: 296, SEQ ID No: 298, SEQ ID No: 299, SEQ ID No: 301, SEQ ID No: 302, SEQ ID No: 303, SEQ ID No: 305, SEQ ID No: 306, SEQ ID No: 308, SEQ ID No: 309, SEQ ID No: 311, SEQ ID No: 312, SEQ ID No: 314, SEQ ID No: 315, SEQ ID No: 317, SEQ ID No: 319, SEQ ID No: 320, SEQ ID No: 322, SEQ ID No: 323, SEQ ID No: 324, SEQ ID No: 326, SEQ ID No: 327, SEQ ID No: 329, SEQ ID No: 331, SEQ ID No: 332, SEQ ID No: 334, SEQ ID No: 335, SEQ ID No: 339, SEQ ID No: 340, SEQ ID No: 345, SEQ ID No: 347, SEQ ID No: 350, SEQ ID No: 352, SEQ ID No: 354, SEQ ID No: 356, SEQ ID No: 358, SEQ ID No: 359, SEQ ID No: 368, SEQ ID No: 369, SEQ ID No: 370, SEQ ID No: 374, SEQ ID No: 375, SEQ ID No: 380, SEQ ID No: 382, SEQ ID No: 383, SEQ ID No: 385, SEQ ID No: 386, SEQ ID No: 388, SEQ ID No: 392, SEQ ID No: 394, SEQ ID No: 402, SEQ ID No: 404, SEQ ID No: 405, SEQ ID No: 414, SEQ ID No: 415, SEQ ID No: 417, SEQ ID No: 418, SEQ ID No: 430, SEQ ID No: 431, SEQ ID No: 435, SEQ ID No: 437, SEQ ID No: 439, SEQ ID No: 440, SEQ ID No: 443, SEQ ID No: 444, SEQ ID No: 445, SEQ ID No: 446, SEQ ID No: 447, SEQ ID No: 448, SEQ ID No: 449, SEQ ID No: 450, SEQ ID No: 451, SEQ ID No: 452, SEQ ID No: 453, SEQ ID No: 454, SEQ ID No: 455, SEQ ID No: 456, SEQ ID No: 457, SEQ ID No: 458, SEQ ID No: 459, SEQ ID No: 460, SEQ ID No: 461, SEQ ID No: 462, SEQ ID No: 463, SEQ ID No: 464, SEQ ID No: 465, SEQ ID No: 466, SEQ ID No: 467, SEQ ID No: 468; and further wherein said polynucleotide sequences or subsequences of said pool are useful in differentiating a normal cell from a cancer cell.
5. The polynucleotide library of claim 2 wherein the pool of polynucleotide sequences or subsequences correspond substantially to the polynucleotide sequences set forth in any of SEQ ID No: 1, SEQ ID No: 3, SEQ ID No: 5, SEQ ID No: 7, SEQ ID No: 8, SEQ ID No: 9, SEQ ID No: 11, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 16, SEQ ID No: 17, SEQ ID No: 18, SEQ ID No: 20, SEQ ID No: 22, SEQ ID No: 23, SEQ ID No: 25, SEQ ID No: 27, SEQ ID No: 28, SEQ ID No: 30, SEQ ID No: 32, SEQ ID No: 33, SEQ ID No: 35, SEQ ID No: 36, SEQ ID No: 37, SEQ ID No: 39, SEQ ID No: 40, SEQ ID No: 42, SEQ ID No: 44, SEQ ID No: 46, SEQ ID No: 48, SEQ ID No: 49, SEQ ID No: 51, SEQ ID No: 52, SEQ ID No: 54, SEQ ID No: 55, SEQ ID No: 57, SEQ ID No: 59, SEQ ID No: 60 , SEQ ID No: 65, SEQ ID No: 66, SEQ ID No: 68, SEQ ID No: 69, SEQ ID No: 71, SEQ ID No: 73, SEQ ID No: 74, SEQ ID No: 76, SEQ ID No: 79, SEQ ID No: 80, SEQ ID No: 82, SEQ ID No: 86, SEQ ID No: 88, SEQ ID No: 89, SEQ ID No: 94, SEQ ID No: 95, SEQ ID No: 97, SEQ ID No: 98, SEQ ID No: 102, SEQ ID No: 104, SEQ ID No: 109, SEQ ID No: I-l, SEQ ID No: 112, SEQ ID No: 114, SEQ ID No: 115, SEQ ID No: 116, SEQ ID No: 120, SEQ ID No: 123, SEQ ID No: 126, SEQ ID No: 127, SEQ ID No: 129, SEQ ID No: 131, SEQ ID No: 133, SEQ ID No: 135, SEQ ID No: 136, SEQ ID No: 141, SEQ ID No: 142, SEQ ID No: 144, SEQ ID No: 145, SEQ ID No: 147, SEQ ID No: 148, SEQ ID No: 150, SEQ ID No: 153, SEQ ID No: 154, SEQ ID No: 156, SEQ ID No: 157, SEQ ID No: 159, SEQ ID No: 160, SEQ ID No: 162, SEQ ID No: 164, SEQ ID No: 166, SEQ ID No: 167, SEQ ID No: 171, SEQ ID No: 173, SEQ ID No: 174, SEQ ID No: 176, SEQ ID No: 178, SEQ ID No: 180, SEQ ID No: 181, SEQ ID No: 183, SEQ ID No: 185, SEQ ID No: 186, SEQ ID No: 187, SEQ ID No: 192, SEQ ID No: 194, SEQ ID No: 196, SEQ ID No: 197, SEQ ID No: 201, SEQ ID No: 202, SEQ ID No: 204, SEQ ID No: 205, SEQ ID No: 206, SEQ ID No: 208, SEQ ID No: 212, SEQ ID No: 214, SEQ ID No: 216, SEQ ID No: 218, SEQ ID No: 219, SEQ ID No: 221, SEQ ID No: 223, SEQ ID No: 224, SEQ ID No: 226, SEQ ID No: 230, SEQ ID No: 231, SEQ ID No: 233, SEQ ID No: 238, SEQ ID No: 239, SEQ ID No: 240, SEQ ID No: 242, SEQ ID No: 243, SEQ ID No: 245, SEQ ID No: 246, SEQ ID No: 248, SEQ ID No: 250, SEQ ID No: 251, SEQ ID No: 253, SEQ ID No: 255, SEQ ID No: 259, SEQ ID No: 260, SEQ ID No: 263, SEQ ID No: 265, SEQ ID No: 269, SEQ ID No: 271, SEQ ID No: 277, SEQ ID No: 279, SEQ ID No: 280, SEQ ID No: 284, SEQ ID No: 286, SEQ ID No: 287, SEQ ID No: 289, SEQ ID No: 291, SEQ ID No: 293, SEQ ID No: 294, SEQ ID No: 296, SEQ ID No: 298, SEQ ID No: 299, SEQ ID No: 301, SEQ ID No: 302, SEQ ID No: 303, SEQ ID No: 305, SEQ ID No: 306, SEQ ID No: 308, SEQ ID No: 309, SEQ ID No: 311, SEQ ID No: 312, SEQ ID No: 314, SEQ ID No: 315, SEQ ID No: 317, SEQ ID No: 319, SEQ ID No: 320, SEQ ID No: 322, SEQ ID No: 323, SEQ ID No: 324, SEQ ID No: 326, SEQ ID No: 327, SEQ ID No: 329, SEQ ID No: 331, SEQ ID No: 332, SEQ ID No: 334, SEQ ID No: 335, SEQ ID No: 339, SEQ ID No: 340, SEQ ID No: 345, SEQ ID No: 347, SEQ ID No: 350, SEQ ID No: 352, SEQ ID No: 354, SEQ ID No: 356, SEQ ID No: 358, SEQ ID No: 359, SEQ ID No: 368, SEQ ID No: 369, SEQ ID No: 370, SEQ ID No: 374, SEQ ID No: 375, SEQ ID No: 380, SEQ ID No: 382, SEQ ID No: 383, SEQ ID No: 385, SEQ ID No: 386, SEQ ID No: 388, SEQ ID No: 392, SEQ ID No: 394, SEQ ID No: 402, SEQ ID No: 404, SEQ ID No: 405, SEQ ID No: 414, SEQ ID No: 415, SEQ ID No: 417, SEQ ID No: 418, SEQ ID No: 430, SEQ ID No: 431, SEQ ID No: 435, SEQ ID No: 437, SEQ ID No: 439, SEQ ID No: 440, SEQ ID No: 443, SEQ ID No: 444, SEQ ID No: 445, SEQ ID No: 446, SEQ ID No: 447, SEQ ID No: 448, SEQ ID No: 449, SEQ ID No: 450, SEQ ID No: 451, SEQ ID No: 452, SEQ ID No: 453, SEQ ID No: 454, SEQ ID No: 455, SEQ ID No: 456, SEQ ID No: 457, SEQ ID No: 458, SEQ ID No: 459, SEQ ID No: 460, SEQ ID No: 461, SEQ ID No: 462, SEQ ID No: 463, SEQ ID No: 464, SEQ ID No: 465, SEQ ID No: 466, SEQ ID No: 467, and wherein said polynucleotide sequences or subsequences of said pool are useful in detecting a hornone-sensitive tumor cell.
6. The library polynucleotide of claim 2 wherein the pool of polynucleotide sequences or subsequences correspond substantially to the polynucleotide sequences set forth in any of SEQ ID No: 1, SEQ ID No: 3, SEQ ID No: 5, SEQ ID No: 7, SEQ ID No: 8, SEQ ID No: 9, SEQ ID No: 1, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 16, SEQ ID No: 17, SEQ ID No: 18, SEQ ID No: 20, SEQ ID No: 22, SEQ ID No: 23, SEQ ID No: 25, SEQ ID No: 27, SEQ ID No: 28, SEQ ID No: 30, SEQ ID No: 32, SEQ ID No: 33, SEQ ID No: 35, SEQ ID No: 36, SEQ ID No: 37, SEQ ID No: 39, SEQ ID No: 40, SEQ ID No: 42, SEQ ID No: 44, SEQ ID No: 46, SEQ ID No: 48, SEQ ID No: 49, SEQ ID No: 51, SEQ ID No: 52, SEQ ID No: 54, SEQ ID No: 55, SEQ ID No: 57, SEQ ID No: 59, SEQ ID No: 60, SEQ ID No: 65, SEQ ID No: 66, SEQ ID No: 68, SEQ ID No: 69, SEQ ID No: 71, SEQ ID No: 73, SEQ ID No: 74, SEQ ID No: 76, SEQ ID No: 79, SEQ ID No: 80, SEQ ID No: 82, SEQ ID No: 86, SEQ ID No: 88, SEQ ID No: 89, SEQ ID No: 94, SEQ ID No: 95, SEQ ID No: 97, SEQ ID No: 98, SEQ ID No: 102, SEQ ID No: 104, SEQ ID No: 109, SEQ ID No: 111, SEQ ID No: 112, SEQ ID No: 114, SEQ ID No: 115, SEQ ID No: 116, SEQ ID No: 120, SEQ ID No: 123, SEQ ID No: 126, SEQ ID No: 127, SEQ ID No: 129, SEQ ID No: 131, SEQ ID No: 133, SEQ ID No: 135, SEQ ID No: 136, SEQ ID No: 141, SEQ ID No: 142, SEQ ID No: 144, SEQ ID No: 145, SEQ ID No: 147, SEQ ID No: 148, SEQ ID No: 150, SEQ ID No: 153, SEQ ID No: 154, SEQ ID No: 156, SEQ ID No: 157, SEQ ID No: 159, SEQ ID No: 160, SEQ ID No: 162, SEQ ID No: 164, SEQ ID No: 166, SEQ ID No: 167, SEQ ID No: 171, SEQ ID No: 173, SEQ ID No: 174, SEQ ID No: 176, SEQ ID No: 178, SEQ ID No: 180, SEQ ID No: 181, SEQ ID No: 183, SEQ ID No: 185, SEQ ID No: 186, SEQ ID No: 187, SEQ ID No: 192, SEQ ID No: 194, SEQ ID No: 196, SEQ ID No: 197, SEQ ID No: 201, SEQ ID No: 202, SEQ ID No: 204, SEQ ID No: 205, SEQ ID No: 206, SEQ ID No: 208, SEQ ID No: 212, SEQ ID No: 214, SEQ ID No: 216, SEQ ID No: 218, SEQ ID No: 219, SEQ ID No: 221, SEQ ID No: 223, SEQ ID No: 224, SEQ ID No: 226, SEQ ID No: 230, SEQ ID No: 231, SEQ ID No: 233, SEQ ID No: 238, SEQ ID No: 239, SEQ ID No: 240, SEQ ID No: 242, SEQ ID No: 243, SEQ ID No: 245, SEQ ID No: 246, SEQ ID No: 248, SEQ ID No: 250, SEQ ID No: 251, SEQ ID No: 253, SEQ ID No: 255, SEQ ID No: 259, SEQ ID No: 260, SEQ ID No: 263, SEQ ID No: 265, SEQ ID No: 269, SEQ ID No: 271, SEQ ID No: 277, SEQ ID No: 279, SEQ ID No: 280, SEQ ID No: 284, SEQ ID No: 286, SEQ ID No: 287, SEQ ID No: 289, SEQ ID No: 291, SEQ ID No: 293, SEQ ID No: 294, SEQ ID No: 296, SEQ ID No: 298, SEQ ID No: 299, SEQ ID No: 301, SEQ ID No: 302, SEQ ID No: 303, SEQ ID No: 305, SEQ ID No: 306, SEQ ID No: 308, SEQ ID No: 309, SEQ ID No: 311, SEQ ID No: 312, SEQ ID No: 314, SEQ ID No: 315, SEQ ID No: 317, SEQ ID No: 319, SEQ ID No: 320, SEQ ID No: 322, SEQ ID No: 323, SEQ ID No: 324, SEQ ID No: 326, SEQ ID No: 327, SEQ ID No: 329, SEQ ID No: 331, SEQ ID No: 332, SEQ ID No: 334, SEQ ID No: 335, SEQ ID No: 339, SEQ ID No: 340, SEQ ID No: 345, SEQ ID No: 347, SEQ ID No: 350, SEQ ID No: 352, SEQ ID No: 354, SEQ ID No: 356, SEQ ID No: 358, SEQ ID No: 359, SEQ ID No: 368, SEQ ID No: 369, SEQ ID No: 370, SEQ ID No: 374, SEQ ID No: 375, SEQ ID No: 380, SEQ ID No: 382, SEQ ID No: 383, SEQ ID No: 385, SEQ ID No: 386, SEQ ID No: 388, SEQ ID No: 392, SEQ ID No: 394, SEQ ID No: 402, SEQ ID No: 404, SEQ ID No: 405, SEQ ID No: 414, SEQ ID No: 415, SEQ ID No: 417, SEQ ID No: 418, SEQ ID No: 430, SEQ ID No: 431, SEQ ID No: 435, SEQ ID No: 437, SEQ ID No: 439, SEQ ID No: 440, SEQ ID No: 443, SEQ ID No: 444, SEQ ID No: 445, SEQ ID No: 446, SEQ ID No: 447, SEQ ID No: 448, SEQ ID No: 449, SEQ ID No: 450, SEQ ID No: 451, SEQ ID No: 452, SEQ ID No: 453, SEQ ID No: 454, SEQ ID No: 455, SEQ ID No: 456, SEQ ID No: 457, SEQ ID No: 458, SEQ ID No: 459, SEQ ID No: 460, SEQ ID No: 461, SEQ ID No: 462, SEQ ID No: 463, SEQ ID No: 464, SEQ ID No: 465, SEQ ID No: 466, SEQ ID No: 467; and wherein said polynucleotide sequences or subsequences of said pool are useful in differentiating a tumor in which a lymph node has been invaded by a tumor cell from a tumor in which a lymph node has not been invaded by a tumor cell.
7. The polynucleotide library of claim 2 wherein the pool of polynucleotide sequences or subsequences correspond substantially to the polynucleotide sequences set forth in any of SEQ ID No: 1, SEQ ID No: 3, SEQ ID No: 5, SEQ ID No: 7, SEQ ID No: 8, SEQ ID No: 9, SEQ ID No: 11, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 16, SEQ ID No: 17, SEQ ID No: 18, SEQ ID No: 22, SEQ ID No: 23, SEQ ID No: 25, SEQ ID No: 27, SEQ ID No: 28, SEQ ID No: 30, SEQ ID No: 32, SEQ ID No: 33, SEQ ID No: 35, SEQ ID No: 36, SEQ ID No: 37, SEQ ID No: 39, SEQ ID No: 40, SEQ ID No: 42, SEQ ID No: 44, SEQ ID No: 46, SEQ ID No: 48, SEQ ID No: 49, SEQ ID No: 51, SEQ ID No: 52, SEQ ID No: 54, SEQ ID No: 55, SEQ ID No: 57, SEQ ID No: 59, SEQ ID No: 60, SEQ ID No: 65, SEQ ID No: 66, SEQ ID No: 68, SEQ ID No: 69, SEQ ID No: 71, SEQ ID No: 73, SEQ ID No: 74, SEQ ID No: 76, SEQ ID No: 79, SEQ ID No: 80, SEQ ID No: 82, SEQ ID No: 86, SEQ ID No: 88, SEQ ID No: 89, SEQ ID No: 94, SEQ ID No: 95, SEQ ID No: 102, SEQ ID No: 104, SEQ ID No: 109, SEQ ID No: 111, SEQ ID No: 112, SEQ ID No: 114, SEQ ID No: 115, SEQ ID No: 116, SEQ ID No: 120, SEQ ID No: 123, SEQ ID No: 126, SEQ ID No: 127, SEQ ID No: 129, SEQ ID No: 131, SEQ ID No: 133, SEQ ID No: 135, SEQ ID No: 136, SEQ ID No: 141, SEQ ID No: 142, SEQ ID No: 144, SEQ ID No: 145, SEQ ID No: 147, SEQ ID No: 148, SEQ ID No: 150, SEQ ID No: 153, SEQ ID No: 154, SEQ ID No: 156, SEQ ID No: 157, SEQ ID No: 159, SEQ ID No: 160, SEQ ID No: 162, SEQ ID No: 164, SEQ ID No: 166, SEQ ID No: 167, SEQ ID No: 171, SEQ ID No: 173, SEQ ID No: 174, SEQ ID No: 176, SEQ ID No: 178, SEQ ID No: 180, SEQ ID No: 181, SEQ ID No: 183, SEQ ID No: 185, SEQ ID No: 186, SEQ ID No: 187, SEQ ID No: 192, SEQ ID No: 194, SEQ ID No: 196, SEQ ID No: 197, SEQ ID No: 201, SEQ ID No: 202, SEQ ID No: 204, SEQ ID No: 205, SEQ ID No: 206, SEQ ID No: 208, SEQ ID No: 212, SEQ ID No: 214, SEQ ID No: 216, SEQ ID No: 218, SEQ ID No: 219, SEQ ID No: 221, SEQ ID No: 223, SEQ ID No: 224, SEQ ID No: 226, SEQ ID No: 230, SEQ ID No: 231, SEQ ID No: 233, SEQ ID No: 238, SEQ ID No: 239, SEQ ID No: 240, SEQ ID No: 242, SEQ ID No: 243, SEQ ID No: 245, SEQ ID No: 246, SEQ ID No: 248, SEQ ID No: 250, SEQ ID No: 251, SEQ ID No: 253, SEQ ID No: 255, SEQ ID No: 259, SEQ ID No: 260, SEQ ID No: 263, SEQ ID No: 265, SEQ ID No: 269, SEQ ID No: 271, SEQ ID No: 277, SEQ ID No: 279, SEQ ID No: 280, SEQ ID No: 284, SEQ ID No: 286, SEQ ID No: 287, SEQ ID No: 289, SEQ ID No: 291, SEQ ID No: 293, SEQ ID No: 294, SEQ ID No: 296, SEQ ID No: 298, SEQ ID No: 299, SEQ ID No: 301, SEQ ID No: 302, SEQ ID No: 303, SEQ ID No: 305, SEQ ID No: 306, SEQ ID No: 308, SEQ ID No: 309, SEQ ID No: 311, SEQ ID No: 312, SEQ ID No: 314, SEQ ID No: 315, SEQ ID No: 317, SEQ ID No: 319, SEQ ID No: 320, SEQ ID No: 322, SEQ ID No: 323, SEQ ID No: 324, SEQ ID No: 326, SEQ ID No: 327, SEQ ID No: 329, SEQ ID No: 331, SEQ ID No: 332, SEQ ID No: 334, SEQ ID No: 332, SEQ ID No: 339, SEQ ID No: 340, SEQ ID No: 345, SEQ ID No: 347, SEQ ID No: 350, SEQ ID No: 332, SEQ ID No: 354, SEQ ID No: 356, SEQ ID No: 358, SEQ ID No: 359, SEQ ID No: 368, SEQ ID No: 369, SEQ ID No: 370, SEQ ID No: 374, SEQ ID No: 375, SEQ ID No: 380, SEQ ID No: 382, SEQ ID No: 383, SEQ ID No: 385, SEQ ID No: 386, SEQ ID No: 388, SEQ ID No: 392, SEQ ID No: 394, SEQ ID No: 402, SEQ ID No: 404, SEQ ID No: 405, SEQ ID No: 414, SEQ ID No: 415, SEQ ID No: 417, SEQ ID No: 418, SEQ ID No: 439, SEQ ID No: 443, SEQ ID No: 444, SEQ ID No: 445, SEQ ID No: 446, SEQ ID No: 447, SEQ ID No: 448, SEQ ID No: 449, SEQ ID No: 450, SEQ ID No: 451, SEQ ID No: 452, SEQ ID No: 453, SEQ ID No: 454, SEQ ID No: 455, SEQ ID No: 456, SEQ ID No: 457, and wherein said polynucleotide sequences or subsequences of said pool are usefull in differentiating anthracycline-sensitive tumors from anthracycline-insensitive tumors.
8. The polynucleotide library of any of claims 2-7 wherein said polynucleotide sequences or subsequences of said pool are immobilized on a solid support in order to form a polynucleotide array.
9. The polynucleotide library of claim 8 wherein the solid support is selected from the group consisting of a nylon membrane, glass slide, glass beads, or a silicon chip.
10. The polynucleotide library of claim 8 wherein the solid support is a membrane on a glass support.
11. A method for detecting differentially expressed polynucleotide sequences which are correlated with a cancer, said method comprising: obtaining a polynucleotide sample from a patient; labeling said polynucleotide sample by reacting said polynucleotide sample with a labeled probe immobilized on a solid support wherein said probe comprises any of the polynucleotide sequences of the polynucleotide library of claim 2 or an expression product encoded by any of the polynucleotide sequences of the polynucleotide library of claim 2; and detecting a polynucleotide sample reaction product.
12. The method of claim 11 further comprising obtaining a control polynucleotide sample, labeling said control sample by reacting said control sample with said labeled probe, detecting a control sample reaction product, and comparing the amount of said polynucleotide sample reaction product to the amount of said control sample reaction product.
13. The method of claims 11 or 12 wherein RNA or mRNA is isolated from said polynucleotide sample.
14. The method of claim 13 wherein mRNA is isolated from said polynucleotide sample and cDNA is obtained by reverse transcription of said niRNA.
15. The method of claim 11 wherein said labeling is performed by hybridizing the polynucleotide sample with the labeled probe.
16. The method of claim 13 wherein said method is used for detecting, diagnosing, staging, monitoring, predicting, preventing or treating conditions associated with cancer.
17. The method of claim 11 wherein the cancer is breast cancer.
18. The method of claim 11 wherein the product encoded by any of the polynucleotide sequences or subsequences is involved in a receptor-ligand reaction on which detection is based.
19. A method for screening an anti-tumor agent comprising the method of claim 11 wherein the polynucleotide sample has been treated with an anti-tumor agent to be screened.
20. The method of claim 11 wherein the label is selected from the group consisting of radioactive, colorimetric, enzymatic, molecular amplification, bioluminescent or fluorescent labels.
21. A library of polynucleotides comprising a population of polynucleotide sequences overexpressed or underexpressed in cells derived from a tumor selected from SEQ ID No: 1, SEQ ID No: 3, SEQ ID No: 5, SEQ ID No: 7, SEQ ID No: 8, SEQ ID No: 9, SEQ ID No: 11, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 16, SEQ ID No: 17, SEQ ID No: 18, SEQ ID No: 20, SEQ ID No: 22, SEQ ID No: 23, SEQ ID No: 25, SEQ ID No: 27, SEQ ID No: 28, SEQ ID No: 30, SEQ ID No: 32, SEQ ID No: 33, SEQ ID No: 35, SEQ ID No: 36, SEQ ID No: 37, SEQ ID No: 39, SEQ ID No: 40, SEQ ID No: 42, SEQ ID No: 44, SEQ ID No: 46, SEQ ID No: 48, SEQ ID No: 49, SEQ ID No: 51, SEQ ID No: 52, SEQ ID No: 54, SEQ ID No: 55, SEQ ID No: 57, SEQ ID No: 59, SEQ ID No: 60, SEQ ID No: 65, SEQ ID No: 66, SEQ ID No: 68, SEQ ID No: 69, SEQ ID No: 71, SEQ ID No: 73, SEQ ID No: 74, SEQ ID No: 76, SEQ ID No: 79, SEQ ID No: 80, SEQ ID No: 82, SEQ ID No: 86, SEQ ID No: 88, SEQ ID No: 89, SEQ ID No: 91, SEQ ID No: 94, SEQ ID No: 95, SEQ ID No: 97, SEQ ID No: 98, SEQ ID No: 102, SEQ ID No: 104, SEQ ID No: 109, SEQ ID No: 111, SEQ ID No: 112, SEQ ID No: 114, SEQ ID No: 115, SEQ ID No: 116, SEQ ID No: 120, SEQ ID No: 123, SEQ ID No: 126, SEQ ID No: 127, SEQ ID No: 129, SEQ ID No: 131, SEQ ID No: 133, SEQ ID No: 135, SEQ ID No: 136, SEQ ID No: 138, SEQ ID No: 139, SEQ ID No: 141, SEQ ID No: 142, SEQ ID No: 144, SEQ ID No: 145, SEQ ID No: 147, SEQ ID No: 148, SEQ ID No: 150, SEQ ID No: 153, SEQ ID No: 154, SEQ ID No: 156, SEQ ID No: 157, SEQ ID No: 159, SEQ ID No: 160, SEQ ID No: 162, SEQ ID No: 164, SEQ ID No: 166, SEQ ID No: 167, SEQ ID No: 168, SEQ ID No: 171, SEQ ID No: 173, SEQ ID No: 174, SEQ ID No: 176, SEQ ID No: 178, SEQ ID No: 180, SEQ ID No: 181, SEQ ID No: 183, SEQ ID No: 185, SEQ ID No: 186, SEQ ID No: 187, SEQ ID No: 189, SEQ ID No: 190, SEQ ID No: 192, SEQ ID No: 194, SEQ ID No: 196, SEQ ID No: 197, SEQ ID No: 201, SEQ ID No: 202, SEQ ID No: 204, SEQ ID No: 205, SEQ ID No: 206, SEQ ID No: 208, SEQ ID No: 212, SEQ ID No: 214, SEQ ID No: 216, SEQ ID No: 218, SEQ ID No: 219, SEQ ID No: 221, SEQ ID No: 223, SEQ ID No: 224, SEQ ID No: 226, SEQ ID No: 230, SEQ ID No: 231, SEQ ID No: 233, SEQ ID No: 238, SEQ ID No: 239, SEQ ID No: 240, SEQ ID No: 242, SEQ ID No: 243, SEQ ID No: 245, SEQ ID No: 246, SEQ ID No: 248, SEQ ID No: 250, SEQ ID No: 251, SEQ ID No: 253, SEQ ID No: 255, SEQ ID No: 259, SEQ ID No: 260, SEQ ID No: 263, SEQ ID No: 265, SEQ ID No: 269, SEQ ID No: 271, SEQ ID No: 277, SEQ ID No: 279, SEQ ID No: 2 , SEQ ID No: 284, SEQ ID No: 286, SEQ ID No: 287, SEQ ID No: 289, SEQ ID No: 291, SEQ ID No: 293, SEQ ID No: 294, SEQ ID No: 296, SEQ ID No: 298, SEQ ID No: 299, SEQ ID No: 301, SEQ ID No: 302, SEQ ID No: 303, SEQ ID No: 305, SEQ ID No: 306, SEQ ID No: 308, SEQ ID No: 309, SEQ ID No: 311, SEQ ID No: 312, SEQ ID No: 314, SEQ ID No: 315, SEQ ID No: 317, SEQ ID No: 319, SEQ ID No: 320, SEQ ID No: 322, SEQ ID No: 323, SEQ ID No: 324, SEQ ID No: 326, SEQ ID No: 327, SEQ ID No: 329, SEQ ID No: 331, SEQ ID No: 332, SEQ ID No: 334, SEQ ID No: 335, SEQ ID No: 339, SEQ ID No: 340, SEQ ID No: 345, SEQ ID No: 347, SEQ ID No: 350, SEQ ID No: 352, SEQ ID No: 354, SEQ ID No: 356, SEQ ID No: 358, SEQ ID No: 359, SEQ ID No: 368, SEQ ID No: 369, SEQ ID No: 3 , SEQ ID No: 374, SEQ ID No: 375, SEQ ID No: 380, SEQ ID No: 382, SEQ ID No: 383, SEQ ID No: 385, SEQ ID No: 386, SEQ ID No: 388, SEQ ID No: 392, SEQ ID No: 394, SEQ ID No: 402, SEQ ID No: 401, SEQ ID No: 430, SEQ ID No: 414, SEQ ID No: 415, SEQ ID No: 417, SEQ ID No: 418, SEQ ID No: 430, SEQ ID No: 431, SEQ ID No: 435, SEQ ID No: 437, SEQ ID No: 439, SEQ ID No: 440, SEQ ID No: 443, SEQ ID No: 444, SEQ ID No: 45 , SEQ ID No: 4 , SEQ ID No: 447, SEQ ID No: 448, SEQ ID No: 449, SEQ ID No: 450, SEQ ID No: 451, SEQ ID No: 452, SEQ ID No: 453, SEQ ID No: 454, SEQ ID No: 455, SEQ ID No: 456, SEQ ID No: 457, SEQ ID No: 45, SEQ ID No: 459, SEQ ID No: 460, SEQ ID No: 461, SEQ ID No: 462, SEQ ID No: 463, SEQ ID No: 464, SEQ ID No: 465, SEQ ID No: 466, SEQ ID No: 467, SEQ ID No: 468 and their respective complements.
22. A library of polynucleotide sequences for distinguishing a person with cancer from a person without cancer comprising SEQ ID No: 1, SEQ ID No: 7, SEQ ID No: 8, SEQ ID No: 39, SEQ ID No: 40, SEQ ID No: 46, SEQ ID No: 54, SEQ ID No: 55, SEQ ID No: 76, SEQ ID No: 91, SEQ ID No: 115, SEQ ID No: 116, SEQ ID No: 126, SEQ ID No: 127, SEQ ID No: 138, SEQ ID No: 139, SEQ ID No: 141, SEQ ID No: 142, SEQ ID No: 147, SEQ ID No: 148, SEQ ID No: 156, SEQ ID No: 157, SEQ ID No: 162, SEQ ID No: 168, SEQ ID No: 178, SEQ ID No: 204, SEQ ID No: 205, SEQ ID No: 216, SEQ ID No: 230, SEQ ID No: 231, SEQ ID No: 248, SEQ ID No: 250, SEQ ID No: 251, SEQ ID No: 265, SEQ ID No: 271, SEQ ID No: 308, SEQ ID No: 309, SEQ ID No: 311, SEQ ID No: 312, SEQ ID No: 317, SEQ ID No: 324, SEQ ID No: 345, SEQ ID No: 350, SEQ ID No: 368, SEQ ID No: 369, SEQ ID No: 417, SEQ ID No: 418, SEQ ID No: 430, SEQ ID No: 431, SEQ ID No: 437, SEQ ID No: 440, SEQ ID No: 450, SEQ ID No: 452, SEQ ID No: 453, SEQ ID No: 454, SEQ ID No: 460andSEQ ID No: 468.
23. A library of polynucleotide sequences for detecting hormone-sensitive tumors comprising SEQ ID No: 22, SEQ ID No: 23, SEQ ID No: 59, SEQ ID No: 60, SEQ ID No: 76, SEQ ID No: 82, SEQ ID No: 97, SEQ ID No: 98, SEQ ID No: 135, SEQ ID No: 136, SEQ ID No: 156, SEQ ID No: 157, SEQ ID No: 255, SEQ ID No: 279, SEQ ID No: 280, SEQ ID No: 322, SEQ ID No: 323, SEQ ID No: 331, SEQ ID No: 332, SEQ ID No: 334, SEQ ID No: 335, SEQ ID No: 345, SEQ ID No: 352, SEQ ID No: 354, SEQ ID No: 385, SEQ ID No: 386, SEQ ID No: 394, SEQ ID No: 402, SEQ ID No: 414, SEQ ID No: 415, SEQ ID No: 443 and SEQ ID No: 457.
24. A library of polynucleotide sequences for distinguishing a tumor in which a lymph node has been invaded by a tumor cell from a tumor in which a lymph node has not been invaded by a tumor cell comprising SEQ ID No: 16, SEQ ID No: 22, SEQ ID No: 23, SEQ ID No: 39, SEQ ID No: 40, SEQ ID No: 57, SEQ ID No: 76, SEQ ID No: 82, SEQ ID No: 97, SEQ ID No: 98, SEQ ID No: 135, SEQ ID No: 136, SEQ ID No: 216, SEQ ID No: 238, SEQ ID No: 239, SEQ ID No: 250, SEQ ID No: 251, SEQ ID No: 322, SEQ ID No: 323, SEQ ID No: 326, SEQ ID No: 327, SEQ ID No: 329, SEQ ID No: 334, SEQ ID No: 335, SEQ ID No: 374, SEQ ID No: 375, SEQ ID No: 402, SEQ ID No: 430, SEQ ID No: 439, SEQ ID No: 444, SEQ ID No: 445 and SEQ ID No: 457.
25. A library of polynucleotide sequences for distinguishing anthracycline-sensitive tumors from anthracyline-insensitive tumors comprising SEQ ID No: 22, SEQ ID No: 23, SEQ ID No: 48, SEQ ID No: 49, SEQ ID No: 51, SEQ ID No: 52, SEQ ID No: 59, SEQ ID No: 60, SEQ ID No: 65, SEQ ID No: 66, SEQ ID No: 73, SEQ ID No: 74, SEQ ID No: 76, SEQ ID No: 82, SEQ ID No: 86, SEQ ID No: 135, SEQ ID No: 136, SEQ ID No: 156, SEQ ID No: 157, SEQ ID No: 178, SEQ ID No: 194, SEQ ID No: 226, SEQ ID No: 308, SEQ ID No: 309, SEQ ID No: 326, SEQ ID No: 327, SEQ ID No: 331, SEQ ID No: 332, SEQ ID No: 354, SEQ ID No: 385, SEQ ID No: 386, SEQ ID No: 392, SEQ ID No: 394, SEQ ID No: 433, SEQ ID No-. 434, SEQ ID No: 444 and SEQ ID No: 456.
26. A method of detecting differentially expressed genes correlated with a cancer comprising detecting at least one polynucleotide sequence or subsequence of a polynucleotide library according to claim 2 or detecting at least one product encoded by said polynucleotide library in a sample obtained from a patient.
27. A method according to claim 26 further comprising comparing an amount of said at least one polynucleotide sequence or subsequence or product encoded by said polynucleotide sequence with an amount of said polynucleotide sequence or subsequence or product encoded by said polynucleotide sequence or subsequence obtained from a control sample.
28. A method according to claim 26 comprising extracting mRNA from said polynucleotide sample.
29. A method according to claim 28 comprising reverse transcribing said mRNA to cDNA.
30. The method according to claim 26 comprising hybridizing said at least one polynucleotide sequence or subsequence with mRNA or cDNA from the polynucleotide sample.
31. The method of claim 26 comprising detecting, diagnosing, staging, monitoring, predicting, preventing or treating conditions associated with cancer.
32. The method according to claim 26 wherein the product encoded by any of the polynucleotide sequences or subsequences is involved in a receptor-ligand reaction on which detection is based.
33. A method for screening an anti-tumor agent comprising the method according to claim 26 wherein the polynucleotide sample has been treated with an anti-tumor agent to be screened.
34. A polynucleotide library according to claim 1 wherein said sequences or subsequences correspond substantially to any combination of at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets comprising: SET 1: (SEQ ID No: 1; SEQ ID No: 2); SET 2: (SEQ ID No: 3; SEQ ID No: 4); SET 3: (SEQ ID No: 5; SEQ ID No: 6); SET 4: (SEQ ID No: 7;SEQ ID No: 8); SET 5: (SEQ ID No: 9; SEQ ID No: 10); SET 6: (SEQ ID No: 1: SEQ ID No: 12); SET 7: (SEQ ID No: 13; SEQ ID No: 14;SEQ ID No: 15); SET 8: (SEQ ID No: 16); SET 9: (SEQ ID No: 17; SEQ ID No: 18; SEQ ID No: 19); SET 10: (SEQ ID No: 20; SEQ ID No: 21); SET 11: (SEQ ID No: 22; SEQ ID No: 23; SEQ ID No: 24); SET 12: (SEQ ID No: 25; SEQ ID No: 26); SET 13: (SEQ ID No: 27; SEQ ID No: 28; SEQ ID No: 29); SET 14: (SEQ ID No: 30; SEQ ID No: 31); SET 15: (SEQ ID No: 32; SEQ ID No: 33; SEQ ID No: 34); SET 16: (SEQ ID No: 35); SET 17: (SEQ ID No: 36; SEQ ID No: 37; SEQ ID No: 38); SET 18: (SEQ ID No: 39; SEQ ID No: 40; SEQ ID No: 41); SET 19: (SEQ ID No: 42; SEQ ID No: 43); SET 20: (SEQ ID No: 44; SEQ ID No: 45); SET 21: (SEQ ID No: 46; SEQ ID No: 47); SET 22: (SEQ ID No: 48; SEQ ID No: 49; SEQ ID No: 50); SET 23: (SEQ ID No: 51; SEQ ID No: 52; SEQ ID No: 53); SET 24: (SEQ ID No: 54; SEQ ID No: 55; SEQ ID No: 56); SET 25: (SEQ ID No: 57; SEQ ID No: 58); SET 26: (SEQ ID No: 59; SEQ ID No: 60; SEQ ID No: 61); SET 27: (SEQ ID No: 62; SEQ ID No: 63; SEQ ID No: 64); SET 28 : (SEQ ID No: 65; SEQ ID No: 66; SEQ ID No: 67); SET 29 : (SEQ ID No: 68; SEQ ID No: 69; SEQ ID No: 70); SET 30: (SEQ ID No: 71; SEQ ID No: 72); SET 31: (SEQ ID No: 73; SEQ ID No: 74; SEQ ID No: 75); SET 32: (SEQ ID No: 76; SEQ ID No: 77; SEQ ID No: 78); SET 33: (SEQ ID No: 79; SEQ ID No: 80; SEQ ID No: 81); SET 34: (SEQ ID No: 82; SEQ ID No: 83); SET 35: (SEQ ID No: 84; SEQ ID No: 85); SET 36: (SEQ ID No: 86; SEQ ID No: 87); SET 37: (SEQ ID No: 88; SEQ ID No: 89; SEQ ID No: 90); SET 38: (SEQ ID No: 91; SEQ ID No: 92; SEQ ID No: 93); SET 39: (SEQ ID No: 94; SEQ ID No: 95; SEQ ID No: 96); SET 40: (SEQ ID No: 97; SEQ ID No: 98; SEQ ID No: 99) ; SET 41: (SEQ ID No: 100; SEQ ID No: 101; SEQ ID No: 78); SET 42: (SEQ ID No: 102; SEQ ID No: 103); SET 43: (SEQ ID No: 104; SEQ ID No: 105); SET 44: (SEQ ID No: 106; SEQ ID No: 107; SEQ ID No: 108); SET 45: (SEQ ID No: 109; SEQ ID No: 110); SET 46: (SEQ ID No: 111; SEQ ID No: 112; SEQ ID No: 113); SET 47: (SEQ ID No: 114); SET 48: (SEQ ID No: 115; SEQ ID No: 116; SEQ ID No: 117); SET 49: (SEQ ID No: 118; SEQ ID No: 119); SET 50: (SEQ ID No: 120; SEQ ID No: 121); SET 51: (SEQ ID No: 122; SEQ ID No: 78); SET 52: (SEQ ID No: 123; SEQ ID No: 124; SEQ ID No: 125); SET 53: (SEQ ID No: 126; SEQ ID No: 127; SEQ ID No: 128); SET 54: (SEQ ID No: 129; SEQ ID No: 130); SET 55: (SEQ ID No: 131; SEQ ID No: 132); SET 56: (SEQ ID No: 133; SEQ ID No: 134); SET 57: (SEQ ID No: 135; SEQ ID No: 136; SEQ ID No: 137); SET 58: (SEQ ID No: 138; SEQ ID No: 139; SEQ ID No: 140); SET 59: (SEQ ID No: 141; SEQ ID No: 142; SEQ ID No: 143); SET 60: (SEQ ID No: 144; SEQ ID No: 145; SEQ ID No: 146) SET 61: (SEQ ID No: 147; SEQ ID No: 148; SEQ ID No: 149); SET 62: (SEQ ID No: 150; SEQ ID No: 151; SEQ ID No: 152); SET 63: (SEQ ID No: 153; SEQ ID No: 154; SEQ ID No: 155); SET 64: (SEQ ID No: 156; SEQ ID No: 157; SEQ ID No: 158); SET 65: (SEQ ID No: 159; SEQ ID No: 160; SEQ ID No: 161); SET 66 : (SEQ ID No: 162; SEQ ID No: 163); SET 67: (SEQ ID No: 164; SEQ ID No: 165); SET 68: (SEQ ID No: 166; SEQ ID No: 167; SEQ ID No: 152) ;SET 69: (SEQ ID No: 168; SEQ ID No: 169; SEQ ID No: 170); SET 70: (SEQ ID No: 171; SEQ ID No: 172); SET 71: (SEQ ID No: 173; SEQ ID No: 174; SEQ ID No: 175); SET 72: (SEQ ID No: 176; SEQ ID No: 177); SET 73 : (SEQ ID No: 178; SEQ ID No: 179); SET 74: (SEQ ID No: 180; SEQ ID No: 181; SEQ ID No: 182); SET 75 : (SEQ ID No: 183; SEQ ID No: 184); SET 76: (SEQ ID No: 185); SET 77: (SEQ ID No: 186); SET 78: (SEQ ID No: 187; SEQ ID No: 188); SET 79: (SEQ ID No: 189; SEQ ID No: 190; SEQ ID No: 191); SET 80: (SEQ ID No: 192; SEQ ID No: 193) ;SET 81: (SEQ ID No: 194; SEQ ID No: 195); SET 82: (SEQ ID No: 196; SEQ ID No: 197; SEQ ID No: 198); SET 83: (SEQ ID No: 199; SEQ ID No: 200); SET 84: (SEQ ID No: 201; SEQ ID No: 202; SEQ ID No: 203); SET 85: (SEQ ID No: 204; SEQ ID No: 205); SET 86: (SEQ ID No: 206; SEQ ID No: 207); SET 87: (SEQ ID No: 208; SEQ ID No: 209); SET 88: (SEQ ID No: 210; SEQ ID No: 211); SET 89: (SEQ ID No: 212; SEQ ID No: 213); SET 90: (SEQ ID No: 214; SEQ ID No: 215); SET 91: (SEQ ID No: 216; SEQ ID No: 217); SET 92: (SEQ ID No: 218; SEQ ID No: 219; SEQ ID No: 220); SET 93 : (SEQ ID No: 221; SEQ ID No: 222); SET 94: (SEQ ID No: 223; SEQ ID No: 224; SEQ ID No: 225); SET 95: (SEQ ID No: 226; SEQ ID No: 227); SET 96: (SEQ ID No: 228; SEQ ID No: 229); SET 97: (SEQ ID No: 230; SEQ ID No: 231; SEQ ID No: 232); SET 98: (SEQ ID No: 233; SEQ ID No: 234); SET 99: (SEQ ID No: 235; SEQ ID No: 236; SEQ ID No: 237) ;SET 100 : (SEQ ID No: 238; SEQ ID No: 239); SET 101: (SEQ ID No: 240; SEQ ID No: 241); SET 102: (SEQ ID No: 242; SEQ ID No: 243; SEQ ID No: 244); SET 103: (SEQ ID No: 245; SEQ ID No: 246; SEQ ID No: 247); SET 104: (SEQ ID No: 248; SEQ ID No: 249); SET 105 :(SEQ ID No: 250; SEQ ID No: 251; SEQ ID No: 252); SET 106: (SEQ ID No: 253; SEQ ID No: 254); SET 107: (SEQ ID No: 255; SEQ ID No: 256); SET 108: (SEQ ID No: 257; SEQ ID No: 258); SET 109 (SEQ ID No: 259; SEQ ID No: 260; SEQ ID No: 261); SET 110: (SEQ ID No: 262; SEQ ID No: 200); SET 11: (SEQ ID No: 263; SEQ ID No: 264); SET 112: (SEQ ID No: 265; SEQ ID No: 266) SET 113 :(SEQ ID No: 267; SEQ ID No: 268); SET 114: (SEQ ID No: 269; SEQ ID No: 270); SET 115: (SEQ ID No: 271; SEQ ID No: 272); SET 116: (SEQ ID No: 273; SEQ ID No: 274); SET 117: (SEQ ID No: 275; SEQ ID No: 276); SET 118: (SEQ ID No: 277; SEQ ID No: 278); SET 119 :(SEQ ID No: 279; SEQ ID No: 280; SEQ ID No: 281); SET 120: (SEQ ID No: 282; SEQ ID No: 283; SEQ ID No: 276); SET 121: (SEQ ID No: 284; SEQ ID No: 285); SET 122: (SEQ ID No: 286; SEQ ID No: 287; SEQ ID No: 288); SET 123: (SEQ ID No: 289; SEQ ID No: 290); SET 124: (SEQ ID No: 291; SEQ ID No: 292); SET 125 : (SEQ ID No: 293; SEQ ID No: 294; SEQ ID No: 295); SET 126: (SEQ ID No: 296; SEQ ID No: 297); SET 127: (SEQ ID No: 298; SEQ ID No: 299; SEQ ID No: 300); SET 128: (SEQ ID No: 301; SEQ ID No: 302; SEQ ID No: 288); SET 129: (SEQ ID No: 303; SEQ ID No: 304); SET 130: (SEQ ID No: 305; SEQ ID No: 306; SEQ ID No: 307); SET 131: (SEQ ID No: 308; SEQ ID No: 309; SEQ ID No: 310); SET 132: (SEQ ID No: 311; SEQ ID No: 312; SEQ ID No: 313); SET 133: (SEQ ID No: 314; SEQ ID No: 315; SEQ ID No: 316); SET 134: (SEQ ID No: 317; SEQ ID No: 318); SET 135 : (SEQ ID No: 319; SEQ ID No: 320; SEQ ID No: 321); SET 136: (SEQ ID No: 322; SEQ ID No: 323); SET 137: (SEQ ID No: 324; SEQ ID No: 325) ; SET 138: (SEQ ID No: 326; SEQ ID No: 327; SEQ ID No: 328); SET 139: (SEQ ID No: 329; SEQ ID No: 330); SET 140: (SEQ ID No: 331;SEQ ID No: 332;SEQ ID No: 333); SET 141: (SEQ ID No: 334; SEQ ID No: 335; SEQ ID No: 336); SET 142: (SEQ ID No: 337; SEQ ID No: 338; SEQ ID No: 117); SET 143: (SEQ ID No: 339; SEQ ID No: 340;SEQ ID No: 341); SET 144: (SEQ ID No: 342; SEQ ID No: 343; SEQ ID No: 344); SET 145 : (SEQ ID No: 345; SEQ ID No: 346); SET 146: (SEQ ID No: 347; SEQ ID No: 348; SEQ ID No: 349); SET 147: (SEQ ID No: 350; SEQ ID No: 351); SET 148: (SEQ ID No: 352; SEQ ID No: 353); SET 149: (SEQ ID No: 354; SEQ ID No: 355); SET 150: (SEQ ID No: 356; SEQ ID No: 357); SET 151: (SEQ ID No: 358; SEQ ID No: 359; SEQ ID No: 360); SET 152: (SEQ ID No: 361; SEQ ID No: 31); SET 153: (SEQ ID No: 362; SEQ ID No: 363; SEQ ID No: 364); SET 154: (SEQ ID No: 365; SEQ ID No: 366; SEQ ID No: 367); SET 155: (SEQ ID No: 368; SEQ ID No: 369; SEQ ID No: 300); SET 156: (SEQ ID No: 370; SEQ ID No: 371); SET 157: (SEQ ID No: 372; SEQ ID No: 373; SEQ ID No: 108); SET 158: (SEQ ID No: 374; SEQ ID No: 375; SEQ ID No: 376); SET 159: (SEQ ID No: 377; SEQ ID No: 378; SEQ ID No: 379); SET 160: (SEQ ID No: 380; SEQ ID No: 381); SET 161: (SEQ ID No: 382; SEQ ID No: 383; SEQ ID No: 384); SET 162: (SEQ ID No: 385; SEQ ID No: 386; SEQ ID No: 387); SET 163: (SEQ ID No: 388; SEQ ID No: 389; SEQ ID No: 390); SET 164: (SEQ ID No: 391; SEQ ID No: 392; SEQ ID No: 393); SET 165: (SEQ ID No: 394; SEQ ID No: 395); SET 166: (SEQ ID No: 396; SEQ ID No: 397; SEQ ID No: 398); SET 167: (SEQ ID No: 399; SEQ ID No: 400; SEQ ID No: 117) ;SET 168: (SEQ ID No: 401); SET 169: (SEQ ID No: 402; SEQ ID No: 403); SET 170: (SEQ ID No: 404; SEQ ID No: 405; SEQ ID No: 318); SET 171: (SEQ ID No: 406; SEQ ID No: 407; SEQ ID No: 408); SET 172: (SEQ ID No: 409; SEQ ID No: 410; SEQ ID No: 411); SET 173: (SEQ ID No: 412; SEQ ID No: 413); SET 174: (SEQ ID No: 414; SEQ ID No: 415; SEQ ID No: 416); SET 175: (SEQ ID No: 417; SEQ ID No: 418; SEQ ID No: 419); SET 176: (SEQ ID No: 420; SEQ ID No: 421; SEQ ID No: 422); SET 177: (SEQ ID No: 423; SEQ ID No: 424; SEQ ID No: 425); SET 178: (SEQ ID No: 426; SEQ ID No: 427; SEQ ID No: 428); SET 179: (SEQ ID No: 429; SEQ ID No: 408); SET 180: (SEQ ID No: 430); SET 181: (SEQ ID No: 431); SET 182: (SEQ ID No: 432); SET 183: (SEQ ID No: 433; SEQ ID No: 434); SET 184: (SEQ ID No: 435; SEQ ID No: 436); SET 185: (SEQ ID No: 437); SET 186: (SEQ ID No: 438; SEQ ID No: 439); SET 187: (SEQ ID No: 440; SEQ ID No: 441); SET 188: (SEQ ID No: 442); SET 189: (SEQ ID No: 443); SET 190: (SEQ ID No: 444); SET 191: (SEQ ID No: 329; SEQ ID No: 330; SEQ ID No: 345); SET 192: (SEQ ID No: 446; SEQ ID No: 447); SET 193 : (SEQ ID No: 380; SEQ ID No: 381; SEQ ID No: 448); SET 194: (SEQ ID No: 449); SET 195: (SEQ ID No: 271; SEQ ID No: 272; SEQ ID No: 450); SET 196: (SEQ ID No: 84; SEQ ID No: 85; SEQ ID No: 451); SET 197: (SEQ ID No: 452); SET 198 : (SEQ ID No: 453); SET 199 :(SEQ ID No: 454); SET 200: (SEQ ID No: 183; SEQ ID NO: 184; SEQ ID No: 455); SET 201: (SEQ ID No: 456); SET 202: (SEQ ID No: 402; SEQ ID No: 403; SEQ ID No: 457); SET 203: (SEQ ID No: 458); SET 204: (SEQ ID No: 459); SET 205: (SEQ ID No: 460); SET 206: (SEQ ID No: 461); SET 207: (SEQ ID No: 462); SET 208: (SEQ ID No: 463); SET 209: (SEQ ID No: 464); SET 210: (SEQ ID No: 465); SET 211: (SEQ ID No: 466); SET 212: (SEQ ID No: 467); SET 213: (SEQ ID No: 468).
35. The library of claim 1 wherein the pool of polynucleotide sequences or subsequences correspond substantially to any combination of at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets comprising: SET 1: (SEQ ID No: l; SEQ ID No: 2); SET 4: (SEQ ID No: 7 ; SEQ ID No: 8); SET 18: (SEQ ID No: 39 ; SEQ ID No: 40 ; SEQ ID No: 41); SET 21: (SEQ ID No: 46 ; SEQ ID No: 47); SET 24: (SEQ ID No: 54; SEQ ID No: 55; SEQ ID No: 56); SET 32: (SEQ ID No: 76; SEQ ID No: 77 ; SEQ ID No: 78); SET 38: (SEQ ID No: 91; SEQ ID No: 92 ; SEQ ID No: 93); SET 48: (SEQ ID No: 115; SEQ ID No: 116; SEQ ID No: 117); SET 53: (SEQ ID No: 126; SEQ ID No: 127; SEQ ID No: 128); SET 58: (SEQ ID No: 138 ; SEQ ID No: 139 ; SEQ ID No: 140); SET 59: (SEQ ID No: 141; SEQ ID No: 142 ; SEQ ID No: 143); SET 61: (SEQ ID No: 147 ; SEQ ID No: 148; SEQ ID No: 149); SET 64: (SEQ ID No: 156 ; SEQ ID No: 157 ; SEQ ID No: 158); SET 66: (SEQ ID No: 162 ; SEQ ID No: 163); SET 69: (SEQ ID No: 168 ; SEQ ID No: 169; SEQ ID No: 170); SET 73: (SEQ ID No: 178; SEQ ID No: 179); SET 85: (SEQ ID No: 204; SEQ ID No: 205); SET 88: (SEQ ID No: 210; SEQ ID No: 211); SET 91: (SEQ ID No: 216; SEQ ID No: 217); SET 97: (SEQ ID No: 230; SEQ ID No: 231; SEQ ID No: 232); SET 104: (SEQ ID No: 248; SEQ ID No: 249); SET 105: (SEQ ID No: 250; SEQ ID No: 251; SEQ ID No: 252); SET 112: (SEQ ID No: 265 ; SEQ ID No: 266); SET 113: (SEQ ID No: 267 ; SEQ ID No: 268); SET 115 ; (SEQ ID No: 271; SEQ ID No: 272); SET 131: (SEQ ID No: 308 ; SEQ ID No: 309; SEQ ID No: 310); SET 132: (SEQ ID No: 311; SEQ ID No: 312; SEQ ID No: 313); SET 134: (SEQ ID No: 317; SEQ ID No: 318); SET 137: (SEQ ID No: 324; SEQ ID No: 325); SET 145: (SEQ ID No: 345 ; SEQ ID No: 346); SET 147: (SEQ ID No: 350; SEQ ID No: 351); SET 155: (SEQ ID No: 368 ; SEQ ID No: 369 ; SEQ ID No: 300); SET 175: (SEQ ID No: 417; SEQ ID No: 418; SEQ ID No: 419); SET 180: (SEQ ID No: 430) SET 181: (SEQ ID No: 431); SET 182: (SEQ ID No: 432); SET 185: (SEQ ID No: 437); SET 187: (SEQ ID No: 440; SEQ ID No: 441), wherein said sequences are useful in differentiating a normal cell from a cancer cell.
36. A polynucleotide library according to claim 1 wherein the pool of polynucleotide sequences or subsequences correspond substantially to any combination of at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets comprising: SET 32: (SEQ ID No: 76; SEQ ID No: 77; SEQ ID No: 78); SET 73: (SEQ ID No: 178; SEQ ID No: 179); SET 131: (SEQ ID No: 308; SEQ ID No: 309 ; SEQ ID No: 310); SET 145: (SEQ ID No: 345; SEQ ID No: 346) and SET 181: (SEQ ID No: 431). and of at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequences sets comprising: SET 38 : (SEQ ID No: 91; SEQ ID No: 92 ; SEQ ID No: 93); SET 58 : (SEQ ID No: 138 SEQ ID No: 139 ; SEQ ID No: 140); SET 61: (SEQ ID No: 147 ; SEQ ID No: 148; SEQ ID No: 149); SET 69: (SEQ ID No: 168 ; SEQ ID No: 169 ; SEQ ID No: 170) and SET 182: (SEQ ID No: 432).
37. The library of claim 1 wherein the pool of polynucleotide sequences or subsequences correspond substantially to any combination of at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets comprising: SET 11: (SEQ ID No: 22 ; SEQ ID No: 23 ; SEQ ID No: 24); SET 26: (SEQ ID No: 59; SEQ ID No: 60;SEQ ID No: 61);SET 32:(SEQ ID No: 76;SEQ ID No: 77;SEQ ID No: 78);SET 34: (SEQ ID No: 82 ; SEQ ID No: 83); SET 40: (SEQ ID No: 97 ; SEQ ID No: 98 ; SEQ ID No: 99); SET 57: (SEQ ID No: 135; SEQ ID No: 136; SEQ ID No: 137); SET 64: (SEQ ID No: 156; SEQ ID No: 157; SEQ ID No: 158); SET 107: (SEQ ID No: 255; SEQ ID No: 256); SET 119: (SEQ ID No: 279; SEQ ID No: 280; SEQ ID No: 281); SET 136: (SEQ ID No: 322; SEQ ID No: 323); SET 140: (SEQ ID No: 331; SEQ ID No: 332; SEQ ID No: 333); SET 141: (SEQ ID No: 334; SEQ ID No: 335; SEQ ID No: 336); SET 145: (SEQ ID No: 345; SEQ ID No: 346); SET 148: (SEQ ID No: 352; SEQ ID No: 353); SET 149: (SEQ ID No: 354; SEQ ID No: 355); SET 162: (SEQ ID No: 385; SEQ ID No: 386; SEQ ID No: 387); SET 165: (SEQ ID No: 394; SEQ ID No: 395); SET 169 (SEQ ID No: 402; SEQ ID No: 403); SET 174: (SEQ ID No: 414; SEQ ID No: 415 ; SEQ ID No: 416) and SET 188: (SEQ ID No: 442), wherein said sequences are useful in detecting a hormone-sensitive tumor cell.
38. The library of claim 37 wherein the pool of polynucleotide sequences or subsequences correspond substantially to any combination of at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets comprising: SET 32: (SEQ ID No: 76; SEQ ID No: 77; SEQ ID No: 78); SET 136: (SEQ ID No: 322; SEQ ID No: 323); SET 145 : (SEQ ID No: 345 ; SEQ ID No: 346); SET 149: (SEQ ID No: 354; SEQ ID No: 355) and SET 169: (SEQ ID No: 402; SEQ ID No: 403) and of at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets comprising: SET 11: (SEQ ID No: 22 ; SEQ ID No: 23; SEQ ID No: 24); SET 40: (SEQ ID No: 97; SEQ ID No: 98; SEQ ID No: 99); SET 57: (SEQ ID No: 135 ; SEQ ID No: 136; SEQ ID No: 137); SET 119: (SEQ ID No: 279; SEQ ID No: 280; SEQ ID No: 281) and SET 174: (SEQ ID No: 414; SEQ ID No: 415 ; SEQ ID No: 416).
39. The library of claim 1 wherein the pool of polynucleotide sequences or subsequences correspond substantially to any combination of at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets comprising: SET 8: (SEQ ID No: 16); SET ll: (SEQ ID No: 22; SEQ ID No: 23; SEQ ID No: 24); SET 18: (SEQ ID No: 39; SEQ ID No: 40; SEQ ID No: 41); SET 25: (SEQ ID No: 57; SEQ ID No: 58); SET 32: (SEQ ID No: 76; SEQ ID No: 77; SEQ ID No: 78); SET 34: (SEQ ID No: 82; SEQ ID No: 83); SET 40: (SEQ ID No: 97; SEQ ID No: 98; SEQ ID No: 99); SET 49: (SEQ ID No: 118; SEQ ID No: 119);SET 57:(SEQ ID No: 135;SEQ ID No: 136;SEQ ID No: 137) ;SET 91: (SEQ ID No: 216; SEQ ID No: 217); SET 100 : (SEQ ID No: 238 ; SEQ ID No: 239); SET 105 :(SEQ ID No: 250; SEQ ID No: 251: SEQ ID No: 252); SET 136: (SEQ ID No: 322 ; SEQ ID No: 323); SET 138 : (SEQ ID No: 326 ; SEQ ID No: 327; SEQ ID No: 328); SET 139 : (SEQ ID No: 329 ; SEQ ID No: 330); SET 141: (SEQ ID No: 334 ; SEQ ID No: 335 ; SEQ ID No: 336); SET 158 :(SEQ ID No: 374; SEQ ID No: 375 ; SEQ ID No: 376); SET 169: (SEQ ID No: 402; SEQ ID No: 403); SET 180: (SEQ ID No: 430) and SET 186: (SEQ ID No: 438 ; SEQ ID No: 439), wherein said sequences are useful in differentiating a tumor in which a lymph node has been invaded by a tumor cell from a tumor in which a lymph node has not been invaded by a tumor cell.
40. The library of claim 39 wherein the pool of polynucleotide sequences or subsequences correspond substantially to any combination of at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets comprising: SET 18 : (SEQ ID No: 39 ; SEQ ID No: 40 ; SEQ ID No: 41); SET 32: (SEQ ID No: 76 SEQ ID No: 77; SEQ ID No: 78); SET 57: (SEQ ID No: 135 ; SEQ ID No: 136; SEQ ID No: 137); SET 91: (SEQ ID No: 216; SEQ ID No: 217) and SET 105 (SEQ ID No: 250; SEQ ID No: 251; SEQ ID No: 252) and of at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets comprising: SET 11: (SEQ ID No: 22; SEQ ID No: 23; SEQ ID No: 24); SET 40: (SEQ ID No: 97; SEQ ID No: 98 SEQ ID No: 99);SET 49: (SEQ ID No: 118; SEQ ID No: 119);SET 100: (SEQ ID No: 238; SEQ ID No: 239) and SET 141: (SEQ ID No: 334; SEQ ID No: 335 ; SEQ ID No: 336).
41. The library of claims 1 or 2 wherein the pool of polynucleotide sequences or subsequences correspond substantially to any combination of at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets comprising: SET 1: (SEQ ID No: 22; SEQ ID No: 23; SEQ ID No: 24); SET 22: (SEQ ID No: 48; SEQ ID No: 49 ; SEQ ID No: 50); SET 23 :(SEQ ID No: 51; SEQ ID No: 52 ; SEQ ID No: 53); SET 26: (SEQ ID No: 59; SEQ ID No: 60; SEQ ID No: 61); SET 28: (SEQ ID No: 65 ; SEQ ID No: 66 ; SEQ ID No: 67); SET 31: (SEQ ID No: 73; SEQ ID No: 74; SEQ ID No: 75); SET 32: (SEQ ID No: 76; SEQ ID No: 77 ; SEQ ID No: 78); SET 34: (SEQ ID No: 82 ; SEQ ID No: 83); SET 49: (SEQ ID No: 118 ; SEQ ID No: 119); SET 57: (SEQ ID No: 135 ; SEQ ID No: 136 ; SEQ ID No: 137); SET 64: (SEQ ID No: 156; SEQ ID No: 157 ; SEQ ID No: 158); SET 73: (SEQ ID No: 178; SEQ ID No: 179); SET 77: (SEQ ID No: 186); SET 81 : (SEQ ID No: 194; SEQ ID No: 195); SET 95: (SEQ ID No: 226 ; SEQ ID No: 227); SET 131: (SEQ ID No: 308 ; SEQ ID No: 309 ;SEQ ID No: 310); SET 138: (SEQ ID No: 326; SEQ ID No: 327 ; SEQ ID No: 328); SET 140: (SEQ ID No: 331; SEQ ID No: 332; SEQ ID No: 333); SET 149: (SEQ ID No: 354; SEQ ID No: 355); SET 162: (SEQ ID No: 385; SEQ ID No: 386 ; SEQ ID No: 387); SET 164: (SEQ ID No: 391; SEQ ID No: 392; SEQ ID No: 393); SET 165: (SEQ ID No: 394; SEQ ID No: 395) and SET 183: (SEQ ID No: 433; SEQ ID No: 434). wherein said sequences are usefal in differentiating anthracycline-sensitive tumors from anthracycline-insensitive tumors.
42. A library according to claim 41 wherein the pool of polynucleotide sequences or subsequences correspond substantially to any combination of at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets consisting of SET 32: (SEQ ID No: 76; SEQ ID No: 77, SEQ ID No: 78); SET 36: (SEQ ID No: 322; SEQ ID No: 323); SET 145: (SEQ ID No: 345; SEQ ID No: 346); SET 149: (SEQ ID No: 354; SEQ ID No: 355); SET 169: (SEQ ID No: 402; SEQ ID No: 403) and of at least one polynucleotide sequence sets consisting of: SET 11: (SEQ ID No: 22; SEQ ID No: 23; SEQ ID No: 24); SET 40: (SEQ ID No: 97; SEQ ID No: 98; SEQ ID No: 99); SET 57: (SEQ ID No: 135; SEQ ID No: 136; SEQ ID No: 137); SET 119: (SEQ ID No: 279; SEQ ID No: 280; SEQ ID No: 281); SET 174: (SEQ ID No: 414; SEQ ID No: 415; SEQ ID No: 416).
43. The library of claim 1 wherein the pool of polynucleotide sequences or subsequences correspond substantially to any combination of at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequences sets comprising: SET 14 (SEQ ID No: 30; SEQ ID No: 31); SET 23 (SEQ ID No: 51; SEQ ID No: 52; SEQ ID No: 53); SET 25 (SEQ ID No: 57; SEQ ID No: 58); SET 27 (SEQ ID No: 62; SEQ ID No: 63; SEQ ID No: 64); SET 28 (SEQ ID No: 65; SEQ ID No: 66; SEQ ID No: 67); SET 32 (SEQ ID No: 76; SEQ ID No: 77; SEQ ID No: 78); SET 39 (SEQ ID No: 94; SEQ ID No: 95; SEQ ID No: 96); SET 41 (SEQ ID No: 100; SEQ ID No: 101; SEQ ID No: 78); SET 44 (SEQ ID No: 106; SEQ ID No: 107; SEQ ID No: 108); SET 48 (SEQ ID No: 115; SEQ ID No: 116; SEQ ID No: 117); SET 51 (SEQ ID No: 122; SEQ ID No: 78); SET 64 (SEQ ID No: 156; SEQ ID No: 157; SEQ ID No: 158); SET 81 (SEQ ID No: 194; SEQ ID No: 195); SET 83 (SEQ ID No: 199; SEQ ID No: 200); SET 91 (SEQ ID No: 216; SEQ ID No: 217); SET 96 (SEQ ID No: 228; SEQ ID No: 229); SET 99 (SEQ ID No: 235; SEQ ID No: 236; SEQ ID No: 237); SET 108 (SEQ ID No: 257; SEQ ID No: 258); SET 110 (SEQ ID No: 262; SEQ ID No: 200); SET 116 (SEQ ID No: 273; SEQ ID No: 274); SET 117 (SEQ ID No: 275; SEQ ID No: 276); SET 118 (SEQ ID No: 277; SEQ ID No: 278); SET 120 (SEQ ID No: 282; SEQ ID No: 283; SEQ ID No: 276); SET 126 (SEQ ID No: 296; SEQ ID No: 297;); SET 142 (SEQ ID No: 337; SEQ ID No: 338; SEQ ID No: 117); SET 144 (SEQ ID No: 342; SEQ ID No: 343; SEQ ID No: 344); SET 149 (SEQ ID No: 354; SEQ ID No: 355); SET 152 (SEQ ID No: 361; SEQ ID No: 31); SET 153 (SEQ ID No: 362; SEQ ID No: 363; SEQ ID No: 364) SET 154 (SEQ ID No: 365; SEQ ID No: 366; SEQ ID No: 367); SET 157 (SEQ ID No: 372; SEQ ID No: 373; SEQ ID No: 108); SET 159 (SEQ ID No: 377; SEQ ID No: 378; SEQ ID No: 379); SET 162 (SEQ ID No: 385; SEQ ID No: 386; SEQ ID No: 387); SET 166 (SEQ ID No: 396; SEQ ID No: 397; SEQ ID No: 398); SET 167 (SEQ ID No: 399; SEQ ID No: 400; SEQ ID No: 117); SET 168 (SEQ ID No: 401); SET 171 (SEQ ID No: 406; SEQ ID No: 407; SEQ ID No: 408); SET 172 (SEQ ID No: 409; SEQ ID No: 410; SEQ ID No: 411); SET 173 (SEQ ID No: 412; SEQ ID No: 413); SET 176 (SEQ ID No: 420; SEQ ID No: 421; SEQ ID No: 422); SET 177 (SEQ ID No: 423; SEQ ID No: 424; SEQ ID No: 425); SET 178 (SEQ ID No: 426; SEQ ID No: 427; SEQ ID No: 428); SET 179 (SEQ ID No: 429; SEQ ID No: 408); SET 184 (SEQ ID No: 435; SEQ ID No: 436); SET 185 (SEQ ID No: 437) wherein said sequences are useful in classifying good and poor prognosis primary breast tumors.
44. The library of claim 1 wherein the pool of polynucleotide sequences or subsequences correspond substantially to any combination of at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets comprising: SET 23 (SEQ ID No: 51; SEQ ID No: 52; SEQ ID No: 53); SET 25 (SEQ ID No: 57 ; SEQ ID No: 58); SET 32 (SEQ ID No: 76 ; SEQ ID No: 77 ; SEQ ID No: 78); SET 41 (SEQ ID No: 100; SEQ ID No: 101; SEQ ID No: 78); SET 48 (SEQ ID No: 115; SEQ ID No: 116; SEQ ID No: 117); SET 51 (SEQ ID No: 122; SEQ ID No: 78); SET 64 (SEQ ID No: 156; SEQ ID No: 157; SEQ ID No: 158); SET 81 (SEQ ID No: 194; SEQ ID No: 195); SET 83 (SEQ ID No: 199; SEQ ID No: 200); SET 91 (SEQ ID No: 216; SEQ ID No: 217); SET 99 (SEQ ID No: 235; SEQ ID No: 236 SEQ ID No: 237); SET 110 (SEQ ID No: 262; SEQ ID No: 200); SET 116 (SEQ ID No: 273; SEQ ID No: 274); SET 142 (SEQ ID No: 337; SEQ ID No: 338; SEQ ID No: 117); SET 144 (SEQ ID No: 342; SEQ ID No: 343; SEQ ID No: 344); SET 149 (SEQ ID No: 354; SEQ ID No: 355); SET 162 (SEQ ID No: 385; SEQ ID No: 386; SEQ ID No: 387); SET 167 (SEQ ID No: 399; SEQ ID No: 400 ; SEQ ID No: 117); SET 171 (SEQ ID No: 406 ; SEQ ID No: 407; SEQ ID No: 408); SET 172 (SEQ ID No: 409 ; SEQ ID No: 410; SEQ ID No: 411); SET 173 (SEQ ID No: 412 ; SEQ ID No: 413); SET 176 (SEQ ID No: 420 ; SEQ ID No: 421; SEQ ID No: 422); SET 177 (SEQ ID No: 423 ; SEQ ID No: 424 ; SEQ ID No: 425); SET 178 (SEQ ID No: 426; SEQ ID No: 427; SEQ ID No: 428); SET 179 (SEQ ID No: 429 ; SEQ ID No: 408); SET 184 (SEQ ID No: 435 ; SEQ ID No: 436); SET 185 (SEQ ID No: 437), and at least one polynucleotide sequence selected among those included in each one of predefined polynucleotide sequence sets comprising: SET 14 (SEQ ID No: 30 ; SEQ ID No: 31); SET 27 (SEQ ID No: 62 ; SEQ ID No: 63; SEQ ID No: 64); SET 28 (SEQ ID No: 65 ; SEQ ID No: 66 ; SEQ ID No: 67); SET 39 (SEQ ID No: 94; SEQ ID No: 95;SEQ ID No: 96); SET44(SEQ ID No: 106;SEQ ID No: 107;SEQ ID No: 108); SET 96 (SEQ ID No: 228 ; SEQ ID No: 229); SET 108 (SEQ ID No: 257 ; SEQ ID No: 258) ; SET 117 (SEQ ID No: 275 ; SEQ ID No: 276); SET 118 (SEQ ID No: 277; SEQ ID No: 278); SET 120 (SEQ ID No: 282 ; SEQ ID No: 283 ; SEQ ID No: 276); SET 126 (SEQ ID No: 296 ; SEQ ID No: 297); SET 152 (SEQ ID No: 361; SEQ ID No: 31); SET 153 (SEQ ID No: 362 ; SEQ ID No: 363; SEQ ID No: 364); SET 154 (SEQ ID No: 365 ; SEQ ID No: 366; SEQ ID No: 367); SET 157 (SEQ ID No: 372 ; SEQ ID No: 373; SEQ ID No: 108); SET 159 (SEQ ID No: 377 ; SEQ ID No: 378; SEQ ID No: 379); SET 166 (SEQ ID No: 396; SEQ ID No: 397; SEQ ID No: 398); SET 168 (SEQ ID No: 401), wherein the combination of overexpression of the genes identified by said first group of cluster sequences with the underexpression of the genes identified by said second group of cluster sequences are useful in classifying good and poor prognosis primary breast tumors.
45. The polynucleotide library of claim 1 wherein said tumor cells are breast tumor cells.
46. The polynucleotide library of claim 1 wherein said polynucleotides are immobilized on a solid support in order to form a polynucleotide array.
47. The polynucleotide library of claim 46 wherein the support is selected from the group consisting of a nylon membrane, nitrocellulose membrane, glass slide, glass beads, membranes on glass support or a silicon chip.
48. A polynucleotide array useful for prognosis or diagnosis of a tumor comprising an immobilized polynucleotide library according to claim 1 or 34.
49. A polynucleotide array useful to differentiate a normal cell from a cancer cell comprising any combination of immobilized polynucleotide sequence sets according to claim 35.
50. A polynucleotide array useful to differentiate a normal cell from a cancer cell comprising any combination of immobilized polynucletide sequence sets according to claim 36.
51. A polynucleotide array useful to detect a hormone-sensitive tumor cell comprising any combination of immobilized polynucleotide sequence sets according to claim 37.
52. A polynucleotide array useful to detect a hormone-sensitive tumor cell comprising any combination of immobilized polynucleotide sequence sets according to claim 38.
53. A polynucleotide array useful to differentiate a tumor in which a lymph node has been invaded by a tumor cell from a tumor in which a lymph node has not been invaded by a tumor cell comprising any combination of immobilized polynucleotide sequence sets according to claim 39.
54. A polynucleotide array useful to differentiate a tumor in which a lymph node has been invaded by a tumor cell from a tumor in which a lymph node has not been invaded by a tumor cell comprising any combination of immobilized polynucleotide sequences sets according to claim 40.
55. A polynucleotide array useful to differentiate anthracycline-sensitive tumors from anthracycline-insensitive tumors comprising any combination of immobilized polynucleotide sequence sets according to claim 41.
56. A polynucleotide array useful to classify good and poor prognosis primary breast tumors comprising any combination of immobilized polynucleotide sequence sets according to claim 42.
57. A polynucleotide array useful to classify good and poor prognosis primary breast tumors comprising any combination of immobilized polynucleotide sequence sets according to claim 43.
58. A polynucleotide array useful to classify good and poor prognosis primary breast tumors comprising any combination of polynucleotide sequence sets according to claim 44.
59. A method for detecting differentially expressed polynucleotide sequences which are correlated with a cancer, said method comprising: obtaining a polynucleotide sample from a patient; reacting said polynucleotide sample with a probe immobilized on a solid support wherein said probe comprises any of the polynucleotide sequences of the polynucleotide library of claim 1 or an expression product encoded by any of the polynucleotide sequences of the polynucleotide library of claim 1;detecting a polynucleotide sample reaction product.
60. The method of claim 58 wherein said polynucleotide sample is labeled before said reacting step.
61. The method of claim 60 wherein the label of the polynucleotide sample is selected from the group consisting of radioactive, colorimetric, enzymatic, molecular amplification, bioluminescent or fluorescent labels.
62. The method of claim 59 furither comprising obtaining a control polynucleotide sample, reacting said control sample with said probe detecting a control sample reaction product, and comparing the amount of said polynucleotide sample reaction product to the amount of said control sample reaction product.
63. The method of claim 59 wherein RNA or mRNA is isolated from said polynucleotide sample.
64. The method of claim 63 wherein mRNA is isolated from said polynucleotide sample and cDNA is obtained by reverse transcription of said mRNA.
65. The method of claim 59 wherein said reacting step is performed by hybridizing the polynucleotide sample RNA with the probe.
66. The method of claim 59 wherein said method is used for detecting, diagnosing, staging, monitoring, predicting, preventing or treating conditions associated with cancer.
67. The method of claim 59 wherein the cancer is breast cancer.
68. The method of claim 59 wherein the product encoded by any of the polynucleotide sequences or polynucleotide sequence sets is involved in a receptor-ligand reaction on which detection is based.
69. A method for screening an anti-tumor agent comprising the method of claim 59 wherein said polynucleotide sample has been treated with an anti-tumor agent to be screened.

CROSS-REFERENCE TO RELATED APPLICATION

[0001] This application claims the benefit of priority of provisional application Serial No. 60/254,090 filed Dec. 8, 2000.

Provisional Applications (1)

	Number	Date	Country
	60254090	Dec 2000	US

Gene expression profiling of primary breast carcinomas using arrays of candidate genes

Information

Publication Number

Date Filed

Date Published

Inventors

CPC

US Classifications

International Classifications

Abstract

Description

Claims

CROSS-REFERENCE TO RELATED APPLICATION

Provisional Applications (1)