Information
-
Patent Application
-
20040191783
-
Publication Number
20040191783
-
Date Filed
March 31, 200321 years ago
-
Date Published
September 30, 200420 years ago
-
CPC
-
US Classifications
-
International Classifications
Abstract
A method and kit comprising reagents for detection and/or quantification of polynucleotide or polypeptide sequences potentially present in a sample, said sequences representing the gene expression associated with different cell phenotypes and functions and differentiating the gene expressed in cancer tissue compared to normal or reference material. The method and kit are especially suited for the identification and/or characterization of cancer tissues and for follow up of patient treatments.
Description
BACKGROUND OF THE INVENTION
[0001] The present invention pertains to the field of diagnosis and prognosis of human breast cancer. In particular, the present invention pertains to a method of diagnosing the onset of breast cancer and allowing a reliable prognosis of its future development. In addition, the present invention relates to a micro-array, containing selected polynucleotides or polypeptides, which enable the quantification of particular differentially expressed genes in tumors for a precise diagnosis and prognosis and eventually curative follow up of breast tumor patients.
[0002] In Western Countries about 1 out of 11 women develop breast cancer, which is second only to lung cancer in tumor associated diseases. Breast cancer is a very heterogeneous disease with a number of so far recognized and still unknown factors being involved. Female hormones have been found to exhibit a significant impact on oncogenes, the transcription and overexpression, respectively, of which may result in the development of breast cancer, including e.g. the amplification of HER-2 and the epidermal growth factor receptor genes, and overexpression of cyclin D1. Likewise, genetic alterations or the loss of tumor suppressor genes, e.g. p53, have been found to also account for the occurrence of breast cancer. Also, in the recent past, two genes termed BRCA1 and BRCA2 have been characterized which are supposed to be implicated in pre-menopausal familial breast cancer.
[0003] A number of factors are deemed to increase a woman's risk of having the disease, including age, history of prior breast cancer, exposure to radiation, hereditary history, upper socioeconomic class, nulliparity, early menarche, late menopause, or age at first pregnancy greater than 30 years. Also, prolonged use of oral contraceptives and long-lasting postmenopausal estrogen replacement are considered to add to the risk.
[0004] Despite the considerable progress in the molecular understanding of the various causatives of breast cancer as well as progresses made in the treatment thereof, e.g. in radio-, chemo-, and hormone-therapy, more than one-third of female patients still succumb to the disease. In most cases death results from a dissemination of cancer cells and their proliferation at secondary sites.
[0005] It is well acknowledged in the art that a diagnosis of breast cancer as early as possible is vital to secure a most favorable outcome for treatment. For this reason, many countries with advanced healthcare systems have instituted screening programs for breast cancer, such as mammography. Abnormal tissues detected during these screening procedures are typically investigated in more detail by clinico-pathological analysis methods. However, due to the cumbersome and sometimes long lasting experimentation involved, new approaches are needed to ensure a better characterization and treatment of the extensively heterogeneous breast tumors.
[0006] One major problem of a generalized use of new technologies in oncological routine is their complexity, their low prognostic relevance due to different factors being involved and above all the costs involved.
[0007] In the past few decades, molecular biology techniques have raised some hope to replace conventional clinico-pathological techniques. To this end a tissue sample, suspected to have developed cancer, is investigated primarily on the basis of genes, in particular their expression, which genes are known and/or suspected to be involved in the onset and progression of cancer.
[0008] So far a variety of different genes have been characterized and found or suspected to be involved in different subtypes of breast cancer.
[0009] In WO/0210436 a particular set of genes is disclosed that are differentially expressed in tumors characterized as high or low mitotic index activity (MAI) tumors.
[0010] In WO/0175160 a method for the stratification of a cancer patient population into various cancer therapy groups is disclosed, based on an analysis by genomic DNA micro-array of multiple gene amplifications or deletions present or absent in the abnormal tissue of each patient. In particular, the teaching laid down herein involves patient stratification into one of at least four cancer therapy groups based on the micro-array analysis of gene amplification or gene deletion at multiple chromosome locations.
[0011] WO/055173 refers to a list of polynucleotides and polypeptides for the detection, prevention and treatment of disorders in the female reproductive system, particularly breast cancer. The document illustrates potential sequences of interest which are supposed to be related to breast or ovarian tumors but does not give any classification of cancers.
[0012] In WO/9906831 there are disclosed genes associated with the development of estrogen independent malignant cell growth, i.e. the BCAR1, BCAR2 and BCAR3 genes.
[0013] Both of WO/02059271 and WO/0194629 propose a list of genes found to be differentially expressed in normal tissue and breast carcinomas. The only gene classification presented in these documents is based on differences between infiltrating lobular and ductal carcinomas, which relates to the sublocalization of cancer in the mammary gland.
[0014] WO/0151628 refers to polynucleotide sequences which were identified through subtracted library to be differentially expressed in breast carcinoma. This publication proposes a classification of the genes according to the origin site of cancer: invasive lobular carcinomas (ILC), clinical invasive ductal carcinomas (IDC) and clinical ductal carcinomas in situ (DCIS) versus normal breast tissue samples. Genes are also classified according to the aggressiveness of the tumor.
[0015] All of the above mentioned documents based their findings on a different expression of particular genes in either normal tissue and tumor tissue, concluding that the differentially expressed genes should be involved in the onset and/or progress of cancer.
[0016] In principle, gene expression may be determined both at the transcriptional (mRNA) and at the translational (protein) level. Methods on the basis of proteins are presently still difficult to use and are limited in the number of proteins to be detected simultaneously. For this reason the main focus resides on assays for gene expression in a biological sample by qualitative and quantitative analysis of its mRNA population (transcriptome), which may be carried out through the use of the so-called “DNA micro-arrays” or “DNA biochips”.
[0017] These DNA micro-arrays comprise solid surfaces bearing multiple cDNA- or oligo- or polynucleotides spotted thereon, that play the role of so called capture probes. These capture probes, which represent genes or parts of genes of different length, e.g. between 10 and 1500 nucleotides, are either chemically synthesized in situ on the surface or laid down using a special device, the “arrayer” (cDNA-based arrays).
[0018] In most studies involving micro-arrays, labeled target cDNAs obtained by reverse transcription from the population of cellular mRNAs are incubated with the array, and the amount of material hybridized to the specific capture probes is determined by various techniques, such as e.g. radioactivity, colorimetry or fluorescence. Micro-arrays have the inherent advantage to detect the expression of genes in parallel with a direct read out of the hybridization results.
[0019] However, most of the DNA micro-arrays commercially available carry several thousands of capture probes and are, therefore, due to this vast amount of sequences to be synthesized, purified, quantified, and to be fixed on the solid support quite expensive and require a rather complicated data analysis. Moreover, they may carry many capture probes devoid of real interest in a perspective of routine breast cancer study, because these are specific to genes unexpressed, unvariable, or which expression level has never been explored in that kind of cancer. Thus, although these “high-density” DNA micro-arrays may give to the basic researcher a means to identify potential novel mRNAs regulated in different tissues at different stages of their normal or abnormal development, they do not provide a data/price ratio high enough to satisfy the clinician's desire for a tool applicable for routine analysis in their everyday clinical activities.
[0020] Accordingly, there remains a need in the art for means that permits a more accurate diagnostics, prognosis and therapy follow up of breast cancer.
SUMMARY OF THE INVENTION
[0021] In consequence, a problem of the present invention resides in providing a novel means for allowing a rapid and reliable diagnosis and prognosis of breast cancer, which may be produced at low costs and which may be performed rapidly and without difficulty.
[0022] This problem has been solved by providing a micro-array for the diagnosis and prognosis of human breast cancer that contains a solid support onto which a plurality of particular polynucleotides or polypeptides are present in the form of an array. These polynucleotides/polypeptides (or fragments thereof) are selected such that specific genes or their complementary, or their products (or fragments thereof) are represented thereon essentially corresponding to two major categories, namely (A) at least one gene or a fragment thereof representing at least 4 out of 6 phenotypes, preferably 5 out of 6 phenotypes, more preferably 6 out of 6 phenotypes, selected from luminal/epithelial, basal/myoepithelial, mesenchymal, the ErbB2, the hormonal phenotypes and the hereditary susceptibility to breast cancer, and (B) at least 10 genes associated with at least 3 cellular and/or house keeping functions.
[0023] The above categories A and B are illustrated in more detail in table I below:
1TABLE I
|
|
Examples of gene groups related to the categories A and B
|
A) Genes related to a breast cancer phenotype
1Luminal/Epithelial phenotype
2Basal/myoepithelial phenotype
3Mesenchymal phenotype
4ErbB2 phenotype
5Hormonal phenotype
6Hereditary phenotype
B1) Cellular Gene Functions
1Adhesion
2Cell cycle regulation and proliferation
3Chemoresistance
4Angiogenesis
5Protein processing and turnover including cleavage, synthesis,
stabilization and transport, proteolysis
6Oxidative metabolism
7Inflammatory response
8Cell structure
B2) House keeping genes
|
[0024] Table II illustrates some specific examples for all of the categories A and B genes:
2TABLE II
|
|
List of 210 genes whose mRNA level is indicative of breast cancer tissues or cells
A
BREAST CANCERB
GENE NAMEGENE PRODUCT NAME(S)PHENOTYPECELL FUNCTION
|
ABCB1ATP-binding cassette, sub-family B, member 1; P-Chemoresistance
glycoprotein; Multidrug resistance protein 1 (MDR1)
ABCC1ATP-binding cassette, sub-family C, member 1;Chemoresistance
Multidrug resistance-associated protein (MRP, MRP-
1)
ABCG2ATP-binding cassette, sub-family G, member 2;Chemoresistance
Breast cancer resistance protein (BCRP);
Placenta-specific ATP-binding cassette transporter
(ABCP)
ACTR1AARP1 actin-related protein 1 homolog AHereditary phenotypeCell structure
AIB3Thyroid hormone receptor binding protein (TRBP);Hormonal phenotype
Cancer-amplified transcriptional coactivator ASC-2;
Nuclear receptor coactivator RAP250; Peroxisome
proliferator-activated receptor interacting protein
(PRIP); KIAA0181 protein
APEXAPEX nuclease (multifunctional DNA repair enzyme) 1Hereditary phenotype
ARVCFArmadillo repeat gene deletes in velocardiofacialHereditary phenotypeAdhesion
syndrome
ATMAtaxia telangiectasia mutatedCell cycle regulation and
proliferation
BAG1BCL-2-associated athanogeneCell cycle regulation and
proliferation
BAK1Bcl-2-antagonist/killer 1Cell cycle regulation and
proliferation
BAXBCL2-associated X proteinCell cycle regulation and
proliferation
BCAR1Breast cancer anti-estrogen resistance-1;Hormonal phenotype
p130Cas adaptor protein
BCL2B-cell CLL/lymphoma 2Cell cycle regulation and
proliferation
BCL2L1Bcl-2-like 1; Bcl-XCell cycle regulation and
proliferation
BECN1Bcl-2-interacting protein beclin-1Cell cycle regulation and
proliferation
BRCA1Breast cancer 1, early onset; Breast cancer type ICell cycle regulation and
susceptibility proteinproliferation
BRCA2Breast cancer 2, early onset; Breast cancer type IICell cycle regulation and
susceptibility proteinproliferation
BRF1BRF1 homologHereditary phenotype
BSGBasigin; Extracellular matrix metalloproteinaseProtein processing and
inducer (EMMPRIN); Tumor collagenase stimulatoryturnover, incl. synthesis,
factor (TCSF); CD147cleavage, stabilization and
transport, proteolysis
CADCarbamoyl-phosphate synthetase 2, aspartate trans-Hereditary phenotype
carbamylase, and dihydroorotase
CAV1Caveolin-1; Caveolae protein, 22 kMesenchymal phenotype
CBX3Chromobox homolog 3Hereditary phenotypeCell structure
CCND1Cyclin D1; Parathyroid adenomatosis 1 (PRAD1);Cell cycle regulation and
Bcl-1proliferation
CCNE1Cyclin E1Cell cycle regulation and
proliferation
CD36CD36 antigen; Collagen type I receptor;Basal/myoepithelialAdhesion
Thrombospondin receptorphenotype
CD44CD44 antigen; Hyaluronate receptor;Mesenchymal phenotypeAdhesion
Hermes antigen gp90 homing receptor
CDH1Cadherin-1; E-cadherin (epithelial); UvomorulinLuminal epithelialAdhesion
phenotype
CDH11Cadherin-11; OB-cadherin (osteoblast)Mesenchymal phenotypeAdhesion
CDH13Cadherin-13; H-cadherin (heart)Mesenchymal phenotypeAdhesion
CDK4Cyclin dependent kinase 4Hereditary phenotypeCell cycle regulation and
proliferation
CDKN1ACyclin-dependent kinase inhibitor 1A; p21/waf1/cip1Cell cycle regulation and
proliferation
CDKN1BCyclin-dependent kinase inhibitor 1B; p27/kip1Cell cycle regulation and
proliferation
CDKN1CCyclin-dependent kinase inhibitor 1C; p57/waf2Cell cycle regulation and
proliferation
CDKN2ACyclin-dependent kinase inhibitor 2A; p16/ink4/mts1Cell cycle regulation and
proliferation
CEACAM5Carcinoembryonic antigen-related Adhesion moleculeAdhesion
5; CD66e
COX6CCytochrome c oxidase subunit VIcHereditary phenotype
CSDACold shock domain protein AHereditary phenotype
CSF1Colony stimulating factor-1; Macrophage-colonyInflammatory response
stimulating factor (M-CSF, M-CSF1)
CSF1RColony stimulating factor-1 receptor;Inflammatory response
c-fms-encoded protein
CST6Cystatin E/MProtein processing and
turnover, incl. synthesis,
cleavage, stabilization and
transport, proteolysis
CSTACystatin A; Stefin AProtein processing and
turnover, incl. synthesis,
cleavage, stabilization and
transport, proteolysis
CTNNB1Catenin (cadherin-associated protein), beta 1 (88 kD)Adhesion
CTPSCTP synthaseHereditary phenotype
CTSBCathepsin BMesenchymal phenotypeProtein processing and
turnover, incl. synthesis,
cleavage, stabilization and
transport, proteolysis
CTSDCathepsin DProtein processing and
turnover, incl. synthesis,
cleavage, stabilization and
transport, proteolysis
CTSLCathepsin LMesenchymal phenotypeProtein processing and
turnover, incl. synthesis,
cleavage, stabilization and
transport, proteolysis
CX3CL1Chemokine (C-X3-C motif) ligand 1; Small inducibleBasal/myoepithelial
cytokine subfamily D (Cys-X3-Cys), member 1;phenotype
Fractalkine; Neurotactin
CYP19Cytochrome P450, subfamily XIX; Aromatase;Oxidative metabolism
Estrogen synthetase
D123D123 gene productHereditary phenotypeCell cycle regulation and
proliferation
ECGF1Endothelial cell growth factor 1, platelet-derived (PD-Angiogenesis
ECGF); Thymidine phosphorylase (TP)
EGFREpidermal growth factor receptorMesenchymal phenotypeCell cycle regulation and
proliferation
EIF4EEukaryotic translation initiation factor 4E (EIF4-E);Protein processing and
Cap-binding proteinturnover, incl. synthesis,
cleavage, stabilization and
transport, proteolysis
EMS1Cortactin; AmplaxinAdhesion
ERBB2c-erbB-2; Herstatin (HER-2); NeuErbB2 phenotype
ESR1Estrogen receptor 1; Estrogen receptor-alphaLuminal epithelial phenotype;
Hormonal
phenotype
ESR2Estrogen receptor 2; Estrogen receptor-betaHormonal phenotype
FABP4Fatty acid binding protein 4, adipocyteBasal/myoepithelial
phenotype
FGF2Fibroblast growth factor 2;Angiogenesis
Fibroblast growth factor, basic (bFGF)
FGF8Fibroblast growth factor-8 (androgen-induced)Mesenchymal phenotypeAngiogenesis
FGFR1FGF receptor-1; Fms-related tyrosine kinase 2Angiogenesis
FHITFragile histidine triad; Bis(5′-adenosyl) triphosphatase;Cell cycle regulation and
Diadenosine triphosphate (Ap3A) hydrolaseproliferation
FIGFc-fos induced growth factor;Angiogenesis
Vascular endothelial growth factor D (VEGFD)
FLT1Fms-related tyrosine kinase 1; VEGF receptor 1Angiogenesis
FLT4Fms-related tyrosine kinase 4; VEGF receptor 3Angiogenesis
FOXA1Forkhead box A1; Hepatocyte nuclear factor 3-alphaLuminal epithelial
(HNF3A)phenotype
FOXM1Forkhead box M1Hereditary phenotypeCell cycle regulation and
proliferation
G22P1Thyroid autoantigen 70 kDaHereditary phenotype
GATA3GATA binding protein 3Luminal epithelial
phenotype
GD12GDP dissociation inhibitor 2Hereditary phenotype
GJA1Gap junction protein, alpha 1, 43 kD; Connexin 43Adhesion
(Cx43)
GJB2Gap junction protein, beta 2, 26 kD; Connexin 26Adhesion
(Cx26)
GNAI3Guanine nucleotide binding protein (G protein), alphaHereditary phenotype
inhibiting activity polypeptide 3
GPX4Glutathione peroxidase 4Hereditary phenotype
GRB7Growth factor receptor-bound protein 7ErbB2 phenotype
GSNGelsolinCell structure
GSTP1Glutathione-S-transferase P(i)1Mesenchymal phenotypeChemoresistance
HADHAHydroxyacyl-Coenzyme AHereditary phenotype
HGFHepatocyte growth factor; Scatter factor (SF);Angiogenesis
Hepapoietin A
HSPC195Hypothetical protein HSPC195Hereditary phenotype
IBSPIntegrin-binding sialoprotein; Bone sialoprotein (BSP)Adhesion
ICAM1Intercellular adhesion molecule-1; Rhinovirus receptor;Mesenchymal phenotypeAdhesion
CD54 antigen
IGF2Insulin-like growth factor 2; Somatomedin ALuminal epithelial
phenotype
IL11Interleukin 11; Adipogenesis inhibitory factor (ADIF)Mesenchymal phenotypeInflammatory response
IL1AInterleukin 1, alphaInflammatory response
IL1BInterleukin 1, betaMesenchymal phenotypeInflammatory response
IL6Interleukin 6; Interferon, beta 2Mesenchymal phenotypeInflammatory response
IL8Interleukin-8; Monocyte-derived neutrophil-activatingMesenchymal phenotypeAngiogenesis
protein (MONAP); Monocyte-derived neutrophil
chemotactic factor (MDNCF)
ILF2Interleukin enhancer binding factor 2Hereditary phenotype
ING1Inhibitor of growth 1 family, member 1;Cell cycle regulation and
p33ING1 proteinproliferation
ITGA6Integrin, alpha 6Adhesion
ITGAVIntegrin, alpha V; Vitronectin receptor alphaAdhesion
polypeptide; CD51 antigen
ITGB1Integrin, beta 1; Fibronectin receptor, beta polypeptide;Mesenchymal phenotypeAdhesion
CD29 antigen
ITGB3Integrin, β-3; Platelet glycoprotein IIIa; CD61 antigenAdhesion
ITGB8Integrin, β-8Hereditary phenotypeAdhesion
KAI1Kangai-1; Suppression of tumorigenicity 6 (ST6);Cell cycle regulation and
CD82 antigenproliferation
KDRKinase insert domain receptor; VEGF receptor 2;Angiogenesis
Flk-1 protein
KIAA0601KIAA0601 proteinHereditary phenotype
KISS1Kiss-1 metastasis suppressorCell cycle regulation and
proliferation
KLK3Kallikrein-3; Prostate-specific antigen (PSA)Protein processing and
turnover, incl. synthesis,
cleavage, stabilization and
transport, proteolysis
KRT17Keratin 17Basal/myoepithelialCell structure
phenotype
KRT18Keratin 18Luminal epithelialCell structure
phenotype
KRT19Keratin 19Luminal epithelialCell structure
phenotype
KRT5Keratin 5; Epidermolysis bullosa simplex, Dowling-Basal/myoepithelialCell structure
Meara/Kobner/Weber-Cockayne typesphenotype
KRT8Keratin 8Luminal epithelial phenotype;Cell structure
Hereditary
phenotype
LAMC2Laminin, gamma 2Basal/myoepithelialCell adhesion
phenotype
LAP18Leukemia-associated phosphoprotein p18; OncoproteinCell cycle regulation and
18 (OP 18); Stathminproliferation
LIV-1LIV-1 protein, estrogen regulatedHormonal phenotype
LRP1Low density lipoprotein-related protein 1Hereditary phenotypeCell cycle regulation and
proliferation
MCAMMelanoma Adhesion molecule (MCAM); MUC18Adhesion
glycoprotein; CD166 antigen
MCM7Minichromosome maintenance deficient 7Hereditary phenotype
MDM2Mouse double minute 2, human homolog of;Cell cycle regulation and
p53 binding proteinproliferation
METMet-protooncogene product; Hepatocyte growthMesenchymal phenotypeAngiogenesis
factor receptor
MGB1Mammaglobin-1Cell cycle regulation and
proliferation
MKI67Ki-67 antigen; Mib-1 antigenCell cycle regulation and
proliferation
MMP1Matrix metalloproteinase-1;Mesenchymal phenotypeProtein processing and
Interstitial collagenaseturnover, incl. synthesis,
cleavage, stabilization and
transport, proteolysis
MMP11Matrix metalloproteinase-11;Mesenchymal phenotypeProtein processing and
Stromelysin 3turnover, incl. synthesis,
cleavage, stabilization and
transport, proteolysis
MMP13Matrix metalloproteinase-13;Protein processing and
Collagenase 3turnover, incl. synthesis,
cleavage, stabilization and
transport, proteolysis
MMP14Matrix metalloproteinase-14 (membrane-inserted);Mesenchymal phenotypeProtein processing and
Membrane-type matrix metalloproteinase 1 (MT1-turnover, incl. synthesis,
MMP)cleavage, stabilization and
transport, proteolysis
MMP15Matrix metalloproteinase-15 (membrane-inserted);Protein processing and
Membrane-type matrix metalloproteinase 2 (MT2-turnover, incl. synthesis,
MMP)cleavage, stabilization and
transport, proteolysis
MMP2Matrix metalloproteinase-2; Gelatinase A;Mesenchymal phenotypeProtein processing and
72 kD-gelatinaseturnover, incl. synthesis,
cleavage, stabilization and
transport, proteolysis
MMP3Matrix metalloproteinase-3; Stromelysin 1Protein processing and
turnover, incl. synthesis,
cleavage, stabilization and
transport, proteolysis
MMP7Matrix metalloproteinase-7; MatrilysinMesenchymal phenotypeProtein processing and
turnover, incl. synthesis,
cleavage, stabilization and
transport, proteolysis
MMP9Matrix metalloproteinase-9; Gelatinase B; 92 kD-Protein processing and
gelatinaseturnover, incl. synthesis,
cleavage, stabilization and
transport, proteolysis
MTMR4Myotubularin related protein 4Hereditary phenotype
MUC1Mucin-1, transmembrane; CA15-3 antigen;Luminal epithelialAdhesion
Episialin; Polymorphic epithelial mucin (PEM);phenotype
Epithelial membrane antigen (EMA);
MVPMajor vault protein; Lung resistance protein (LRP)Chemoresistance
MX2Myxovirus (influenza virus) resistance 2Hereditary phenotype
MYCV-myc avian myelocytomatosis viral oncogeneCell cycle regulation and
homologproliferation
NCOA1Nuclear receptor coactivator 1;Hormonal phenotype
Steroid receptor coactivator 1 (SRC-1)
NCOA2Nuclear receptor coactivator 2; Steroid receptorHormonal phenotype
coactivator 2 (SRC-2); Transcriptional intermediary
factor 2 (TIF2); Glucocorticoid receptor interacting
protein 1 (GRIP1)
NCOA3Nuclear receptor coactivator 3; Steroid receptor coactivatorHormonal phenotype
3; Amplified in Breast Cancer (AIB1); Thyroid
hormone receptor activator molecule (TRAM-1);
Receptor-associated coactivator 3 (RAC3)
NCOR1Nuclear receptor co-repressor 1; KIAA1047 proteinHormonal phenotype
NCOR2Nuclear receptor co-repressor 2; Silencing mediator ofHormonal phenotype
retinoid and thyroid hormone action (SMRT)
NIFUNitrogen fixation cluster-likeHereditary phenotype
NME1Non-metastatic cells 1, protein expressed in; Nm23-h1,Cell cycle regulation and
nm23A; Nucleoside diphosphate kinase A (NDKA);proliferation
NME2Non-metastatic cells 2, protein expressed in; Nm23-h2,Cell cycle regulation and
nm23B; Nucleoside diphosphate kinase B (NDKB);proliferation
NSPEP1Nuclease sensitive element binding protein 1Hereditary phenotype
ODC1Ornithine decarboxylase 1Hereditary phenotypeCell cycle regulation and
proliferation
PAI-RBP1PAI-1 mRNA-binding proteinHereditary phenotype
PCNAProliferating cell nuclear antigenHereditary phenotypeCell cycle regulation and
proliferation
PDGFBPlatelet-derived growth factor beta polypeptideHereditary phenotype
PDZK1PDZ domain containing 1Hormonal phenotype
Luminal epithelial
phenotype
PFKPPhosphofructokinase, plateletHereditary phenotype
PGRProgesterone receptorHormonal phenotype
Luminal epithelial
phenotype
PHYHPhytanoyl-CoA hydroxylaseHereditary phenotype
PIPProlactin-induced protein;Luminal epithelial
Gross cystic disease fluid protein 15 (GCDFP-15)phenotype
PLATPlasminogen activator, tissue-type (tPA)Mesenchymal phenotypeProtein processing and
turnover, incl. synthesis,
cleavage, stabilization and
transport, proteolysis
PLAUPlasminogen activator, urokinase (uPA)Mesenchymal phenotypeProtein processing and
turnover, incl. synthesis,
cleavage, stabilization and
transport, proteolysis
PLAURPlasminogen activator, urokinase receptor (uPAR);Protein processing and
CD87 antigenturnover, incl. synthesis,
cleavage, stabilization and
transport, proteolysis
PPP1CBProtein phosphatase 1, catalytic subunit, beta isoformHereditary phenotype
PTGS2Prostaglandin-endoperoxide synthase 2; ProstaglandinMesenchymal phenotypeInflammatory response
G/H synthase; Cyclooxygenase-2 (COX-2)
PTHLHParathyroid hormone-like hormone;Cell cycle regulation and
Parathyroid hormone-related proteinproliferation
PTNPleiotrophin; Heparin binding growth factor 8;Basal/myoepithelialCell cycle regulation and
Neurite growth-promoting factor 1phenotypeproliferation
RB1Retinoblastoma-1Cell cycle regulation and
proliferation
RBL2Retinoblastoma-like 2Hereditary phenotype
S100A4S100 calcium-binding protein A4; Metastasin;Cell cycle regulation and
Placental calcium-binding protein (CAPL)proliferation
SELENBP1Selenium binding protein 1 (SBP1)Luminal epithelial
phenotype
SERPINB2Serine (or cysteine) proteinase inhibitor, clade B,Protein processing and
member 2; Plasminogen activator inhibitor, type IIturnover, incl. synthesis,
(PAI-2)cleavage, stabilization and
transport, proteolysis
SERPINB5Serine (or cysteine) proteinase inhibitor, clade B,Basal/myoepithelialProtein processing and
member 5; Protease inhibitor 5 (P15); Maspinphenotypeturnover, incl. synthesis,
cleavage, stabilization and
transport, proteolysis
SERPINE1Serine (or cysteine) proteinase inhibitor, clade E,Mesenchymal phenotypeProtein processing and
member 1; Plasminogen activator inhibitor, type Iturnover, incl. synthesis,
(PAI-1)cleavage, stabilization and
transport, proteolysis
SLPISecretory leukocyte protease inhibitor;Basal/myoepithelialProtein processing and
Antileukoproteinasephenotypeturnover, incl. synthesis,
cleavage, stabilization and
transport, proteolysis
SOD2Superoxide dismutase-2, mitochondrial; Manganese-Angiogenesis
containing superoxide dismutase (MnSOD)
SPHARS-phase responseHereditary phenotypeCell cycle regulation and
proliferation
SPRR1ASmall proline-rich protein 1ACell structure
SPSSelenophosphate synthetaseHereditary phenotype
SRA1Steroid receptor RNA activator (1)Hormonal phenotype
ST13Suppression of tumorigenicity 13Hereditary phenotype
STAB1Stabilin 1Hereditary phenotype
STARD3START domain containing 3; MLN 64 proteinErbB2 phenotype
(MLN64); Steroidogenic acute regulatory protein
related
STC2Stanniocalcin 2Hormonal phenotype
TFAP2CTranscription factor AP-2 gammaHereditary phenotype
TFF1Trefoil factor 1; pS2; Breast cancer, estrogen-inducibleHormonal phenotype
sequence expressed in (BCEI)Luminal epithelial
phenotype
TFF3Trefoil factor 3; Intestinal trefoil factorHormonal phenotype
Luminal epithelial
phenotype
THBS1Thrombospondin-1Mesenchymal phenotypeAngiogenesis
THBS2Thrombospondin-2Angiogenesis
TIMP1Tissue inhibitor of metalloproteinase-1;Mesenchymal phenotypeProtein processing and
Erythroid potentiating activity (EPA)turnover, incl. synthesis,
cleavage, stabilization and
transport, proteolysis
TIMP2Tissue inhibitor of metalloproteinase-2Protein processing and
turnover, incl. synthesis,
cleavage, stabilization and
transport, proteolysis
TJP1Tight junction protein-1;Luminal epithelialAdhesion
Zonula occludens 1 protein (ZO-1)phenotype
TMSB10Thymosin, beta 10Cell cycle regulation and
proliferation
TNFTumor necrosis factor alphaAngiogenesis
TNFRSF11ATumor necrosis factor receptor superfamily, memberCell cycle regulation and
11Aproliferation
TNFRSF11BTumor necrosis factor receptor superfamily, memberCell cycle regulation and
11Bproliferation
TNFSF11Tumor necrosis factor superfamily, member 11Cell cycle regulation and
proliferation
TOB1Transducer of ERBB2, 1Hereditary phenotypeCell cycle regulation and
proliferation
TOP2ATopoisomerase (DNA) II alpha (170 kD)Chemoresistance
TP53Tumor protein p53Cell cycle regulation and
proliferation
TP53BP2Tumor protein p53 binding protein, 2Hereditary phenotype
UGTREL1UDP-galactose transporter relatedHereditary phenotype
VEGFVascular endothelial growth factor;Angiogenesis
Vascular permeability factor (VPF)
VEGFBVascular endothelial growth factor BAngiogenesis
VEGFCVascular endothelial growth factor CMesenchymal phenotypeAngiogenesis
VIMVimentinMesenchymal phenotypeCell structure
VLDLRVery low density lipoprotein receptorHereditary phenotype
VWFVon Willebrand factorAngiogenesis
XBP1X-box binding protein 1Hormonal phenotype
Luminal epithelial
phenotype
ZNF22Zinc finger protein 22Hereditary phenotype
ZNF161Zinc finger protein 161Hereditary phenotype
RPL13A23 KDa Highly basic proteinHouse keeping gene
ALDOAAldolase A, fructose biphosphateHouse keeping gene
K-ALPHA-1Alpha-tubulinHouse keeping gene
ACTBBeta-ActinHouse keeping gene
PPIECyclophilin 33AHouse keeping gene
GAPDGlyceraldehyde-3-phosphate-dehydrogenaseHouse keeping gene
HK1Hexokinase 1House keeping gene
HPRT1Hypoxanthine phosphoribosyltransferase 1House Keeping gene
MDH1Malate dehydrogenase 1House keeping gene
YWHAZPhospholipase A2House keeping gene
RPS9Ribosomal Proteine S9House keeping gene
SDSSerine DehydrataseHouse keeping gene
TFRCTransferrin receptorHouse keeping gene
|
[0025] Due to the sequencing of the human genome and the publication of the gene map the above mentioned examples are known to the skilled person and appropriate capture probes may be designed on the basis of such information. Likewise, additional genes comprised by the above categories are known to the skilled person and may be derived e.g. from public databases like the National Center for Biotechnology Information (NCBI), the LocusLink web site (http://www.ncbi.nlm.nih.gov/LocusLink/) or the GeneCards Encyclopedia (http://bioinformatics.weizmann.ac.il/cards/). LocusLink provides a single query interface to curated sequence and descriptive information about genetic loci. It presents information on official nomenclature, aliases, sequence accessions, phenotypes, EC numbers, MIM numbers, UniGene clusters, homology, map locations, and related web sites. The GeneCards Encyclopedia integrates a subset of the information stored in major data sources dealing with human genes and their products (with a major focus on medical aspects).
[0026] Additional features and advantages of the present invention are described in, and will be apparent from, the following Detailed Description of the Invention and the figures.
BRIEF DESCRIPTION OF THE FIGURES
[0027]
FIG. 1 is a schematic presentation of a pattern of a microarray for the quantification of differentiated breast cancer gene expression with appropriated controls.
DETAILED DESCRIPTION OF THE INVENTION
[0028] The present inventors have found that in order to provide a proper and reliable diagnosis and prognosis of breast cancer it is not enough to just simply classify a tumor according to genes known to be involved, but that also additional information about the present cellular status of the cell is required. Only this entire information allows the attending physician to apply an appropriate regime for the particular type of cancer, to give a reliable prognosis and eventually to monitor the success of the treatment. Basically, the present inventors provide a characterization of the biological and the pathological aspect of a breast tumor based on the quantification of a minimal number of genes covering 6 phenotypes and other several cellular functions which allow to class the tumors and cell lines according to the expected cell tumor origin and their biological characteristic.
[0029] In the present invention the term “cellular function” means a function which is essential in order to obtain an overview of the modifications occurring in the “vital” cellular functions under specific biological conditions. “Vital” functions are functions which are essential for life, division and growth of the cells. Examplarily mentioned “cellular functions” are mentioned in table I, supra.
[0030] The term “expressed genes” are the parts of the genomic DNA which are transcribed into mRNA and then translated into a peptides or proteins. The measurement of the expressed genes is performed on either molecules within this process most currently the detection of the mRNA or of the peptide or protein. The detection can also be based on specific property of the protein being for example its enzymatic activity.
[0031] The terms “nucleic acid, array, probe, target nucleic acid, bind substantially, hybridizing specifically to, background, quantifying” are as described in the international patent application WO97/273 17, which is incorporated herein by reference.
[0032] The term “nucleotide triphosphate” refers to nucleotides present in either as DNA or RNA and thus includes nucleotides which incorporate adenine, cytosine, guanine, thymine and uracil as bases, the sugar moieties being deoxyribose or ribose. Other modified bases capable of base pairing with one of the conventional bases adenine, cytosine, guanine, thymine and uracil may be employed. Such modified bases include for example 8-azaguanine and hypoxanthine.
[0033] The term “nucleotide” as used herein refers to nucleosides present in nucleic acids (either DNA or RNA) compared with the bases of said nucleic acid, and includes nucleotides comprising usual or modified bases as above described.
[0034] References to nucleotide(s), polynucleotide(s) and the like include analogous species wherein the sugar-phosphate backbone is modified and/or replaced, provided that its hybridization properties are not destroyed. By way of example the backbone may be replaced by an equivalent synthetic peptide, called Peptide Nucleic Acid (PNA).
[0035] The terms “nucleotide species” is a composition of related nucleotides for the detection of a given sequence by base pairing hybridization; nucleotides are synthesized either chemically or enzymatically but the synthesis is not always perfect and the main sequence is contaminated by other related sequences like shorter one or sequences differing by a one or a few nucleotides. The essential characteristic of one nucleotides species for the invention being that the overall species can be used for capture of a given sequence belonging to a given gene.
[0036] “Polynucleotide” sequences that are complementary to one or more of the genes described herein, refers to polynucleotides that are capable of hybridizing under stringent conditions to at least part of the nucleotide sequence of said genes. Polynucleotides also include oligonucleotides which can be used under particular conditions; such hybridizable polynucleotides will typically exhibit at least about 75% sequence identity at the nucleotide level to said genes, preferably about 80% or 85% sequence identity or more preferably about 90% or 95% or more nucleotide sequence identity to said genes. They are composed of either small sequences typically 15-30 base long or longer ones being between 30 and 100 or even longer, between 100 and 300 bases long.
[0037] “Bind(s) substantially” refers to complementary hybridization between a probe nucleic acid and a target nucleic acid and embraces minor mismatches that can be accommodated by reducing the stringency of the hybridization media to achieve the desired detection of the target polynucleotide sequence.
[0038] The term “capture probe” designates a molecule which is able to specifically binds to a given polynucleotide or polypeptide. Polynucleotide binding is obtained through base pairing between two polynucleotides, one being the immobilized capture probe and the other one the target to be detected. Polypeptide binding is best performed using antibodies specific of the polypeptide for the capture of a given polypeptide or protein. Part of the antibodies, or recombinant proteins incorporating part of the antibodies, typically the variable domains, or even proteins being able to specifically recognized the peptide can also be used as capture probes.
[0039] The terms “background” or “background signal intensity” refers to hybridization signals resulting from non-specific binding, or other interactions, between the labeled target nucleic acids and components of the polynucleotide array (e. g., the polynucleotide probes, control probes, the array substrate, etc.). Background signals may also be produced by intrinsic fluorescence of the array components themselves. A single background signal can be calculated for the entire array, or a different background signal may be calculated for each target nucleic acid. In a preferred embodiment, background is calculated individually for each spot, being the level intensity of the signal around each of the spot.
[0040] The phrase “hybridizing specifically to” refers to the binding, duplexing or hybridizing of a molecule substantially to or only to a particular nucleotide sequence or sequences under stringent conditions when that sequence is present in a complex mixture (e. g., total cellular) DNA or RNA.
[0041] The “hybridized nucleic acids” are typically detected by detecting one or more “labels” attached to the sample nucleic acids. The labels may be incorporated by any of a number of means well known to those of skill in the art, such as detailed in WO 99/32660, which is incorporated herein by way of reference.
[0042] The term “capture probes” in the sense of the present invention shall designate genes or parts of genes of different length, e.g. between 10 and 1500 nucleotides, which are either synthesized chemically in situ on the surface of the support or laid down thereon. Moreover, this term shall also designate polypeptides or fragments thereof, or antibodies directed to particular polypeptides, which terms are used interchangeably, attached or adsorbed on the support.
[0043] During the extensive studies leading to the present invention it has been found that it is in effect possible to provide a tool for diagnostic, prognostic and the curative follow up of breast tumors by the quantification of the expression of specifically selected genes, or their products, according to 6 particular phenotypes, namely the luminal/epithelial phenotype, the basal/myoepithelial phenotype, the mesenchymal phenotype, the ErbB2 phenotype, the hormonal phenotype and the hereditary susceptibility representing a molecular signature characteristic of the breast tumor, and taken together with some other less specific functions related to cellular functions, e.g. to cell adhesion, cell cycle regulation, chemoresistance, angiogenesis, protein processing and turnover, oxidative metabolism, inflammatory response and cell structure and/or some house-keeping genes.
[0044] None of the prior art documents mentions the necessity to investigate the different cellular functions related to cancer cell phenotypes together with specific cellular gene functions and house keeping gene quantification to arrive at a reliable and simple diagnostic tool.
[0045] In general, the differentiated gene expression of genes associated with at least 4 or 5 of the 6 phenotypes gives a good representation of the tumor status and of the characteristics of the cells for a diagnostic, prognostic and therapeutic implication for the medical treatment of the tumors and of the patient. Specifically, the luminal/epithelial phenotypes characterize cells from the upper layer of mammary epithelium; the basal/myoepithelial phenotype are characteristic for cells present on the lower layer of the epithelium and give rise to other cell type including myoepithelial cells. The mesenchymal phenotype is characterized of cells of the mesenchyme and the mesenchymal phenotype is represented into carcinosarcoma. On the other side, ErbB2 phenotype resulting from ErbB2 amplification is characterized by aggressiveness and poor prognostic of the patient outcome. It can be present or not on the other cell phenotypes. The hormone phenotype is the presence or not of the steroid hormone receptors and the other proteins involved into the transfer of the information into biochemical and gene expression activation and repression due to the presence of hormone. The estrogen receptor-alpha (ER-alpha, gene ESR1), is one of the best representative of such receptor phenotype family with the other members described in table I. The hormone phenotype considered as ER+ is a prognostic indicator which expression is associated with a longer survival of patients and a predictor of patient responsiveness to anti-estrogens, particularly in node-negative patients (cf. Valavaara R. Oncology (Huntingt) 11 (1997), 14-8). The hereditary susceptibility is mostly due to mutations in some specific genes which increases the predispositon risk and necessitates adapted medical care. It constitutes in itself a particular phenotype since it gives information on the possible origin of the cancer or to the susceptibility of the patient to the cancer occurring due to mutations in the genomic DNA of the cells.
[0046] In one embodiment the hormone phenotype is also associated with the expression of mRNA encoded by TFF1, CCND1, PGR, MYC, which are relevant for the transcriptional functionality of the ER.
[0047] Exemplarily named other genes that may serve as clinical indicators in breast cancer cells are keratin 19 (gene KRT19), parathyroid hormone-related peptide (PTHLH), interleukin-6 (IL6), vascular endothelial growth factor (VEGF) and bcl-2 (BCL2).
[0048] E.g., the gene expression changes associated with the hereditary phenotype is linked to mutations occurring in specific genes including the BRCA1 and/or BRCA2 (Hedenfalk et al. N Engl J Med 344 (2001), 539-48). The gene expression pattern is linked to allele mutant conferring the patient a predisposition to ovarian cancer and may be linked to an increase risk for ovarian and pancreatic cancer. Risk of cancer for the phenotype positive patients is also influenced by other environment effect such as hormones which will modulate the hereditary susceptibility for cancer. In another embodiment the gene expression obtained in the hereditary phenotype allows the classification of the tumors according to the hereditary and the sporadic forms of cancers. In a more precise analysis, gene expression changes are associated specifically with the BRCA 1 mutations and some other with the BRCA 2 mutations. The gene expression changes associated with the BRAC1 mutations are part of the group of KRT8, VLDLR, MCM7, BRF1, SPS. And the gene expression changes associated with the BRCA 2 mutations are part of the group containing ACTRIA, PCNA, UGTREL1, ZNF161, ARVCF, PDGFB, PPPICB. Other genes like MCH2, CTGF and PDCD5 are also useful to discriminate between the BRCA1 and BRCA2 mutations.
[0049] As mentioned above, the inventors have found that apart from knowledge about any of the above phenotypes a minimum information about other vital cellular functions are required to obtain a general picture of the cell and to give a reliable and useful diagnostic and/or prognostic of cell tumor. The additional information obtained from the cellular functions allows a skilled person to better understand how the tumor cell has been altered. Cellular functions relate to the vitality, defense, metabolism, apoptose, cell division, cell response to cytokine or growth factors and other basic cellular properties of a given cell. On the other hand, the phenotype is characteristic of the tumor cell. The information obtained from the alteration of the cellular functions would indicate how the function has been altered in relation with the tumor phenotype. Hence, modifications of cellular functions are important to better characterize tumors (diagnostic markers), to foresee the evolution and the complications of tumors (prognostic markers) and provide an estimation of the patient responsiveness to specific therapy (predictive markers).
[0050] Some phenotypes or protein functions are important for monitoring patient treatment. E.g. in case the estrogen receptor is present treatment with anti-estrogen drugs will be applied. The level of proliferative activity is also useful to determine the sensitivity to anti-proliferative drugs. The chemoresistance activated will also influence the type of chemotherapeutic agents chosen to be active in the cancer cells. The level of angiogenesis is also a target for anti-angiogenesis drugs. The inhibition of protease of collagenase is also linked to the proteolysis activity of the cells. The adhesion is important to estimate the possible level of metastasis. High oxidative metabolism is a usual characteristic of aggressive tumors. Inflammatory cancers are usually aggressive tumors with fort evolution. Some of the genes related to protein processing and turnover are also linked with specific tumors characterization.
[0051] From a considerable number of studies eventually resulting in the present invention, it has been found that a clear link exists between the curative outcome of a breast cancer patient and the phenotype of the tumor. This latter relates mainly to the expression of three extended gene signatures (“luminal epithelial”, “basal/myo-epithelial”, “mesenchymal”), to which both, the transformed cells and their normal surrounding cells may contribute and the hereditary phenotype. The phenotype of a tumor is also crucially associated to its expression level of the (hormone) estrogen receptor-alpha (gene ESR1) and the tyrosine kinase ErbB-2 (gene ERBB2). These, and other accompanying molecules, define the “hormonal phenotype” and the “ERBB2-phenotype”, respectively.
[0052] On the other hand, specific cellular functions involved in tumor progression, such as proliferation, angiogenesis, protein degradation, have drawn a considerable attention in breast cancer research. From related studies, various function-targeted therapeutic strategies have been designed.
[0053] According to an embodiment of the invention, the variations of the gene expression fulfil the following criteria in order to be relevant for interpretation: 1) these variations are recognized as being, or at least suspected to be, of diagnostic, prognostic, or predictive value; 2) they are expressed in tumors at levels detectable by the DNA chip. The present invention gives also a means of providing micro-array with a direct detection of more than 75% of the genes with cDNA prepared from as low as 1 to 10 μg of total RNA; 3) variations of mRNA amount parallel the variations of the corresponding protein and gene expression. The amount of mRNA at a given moment in the cells reflects the equilibrium between transcription and degradation. This amount is also influenced by gene deletions and amplifications (as frequently observed i.e. for genes like ERBB2, MYC, CCND1, EMS1, FGFR1, MDM2). Protein structures and activities are the ultimate support of cell functions and tumor properties. A good correlation between mRNA and protein levels may be crucial when it concerns therapeutic target proteins, such as proteinases, vascular endothelial growth factor (VEGF), c-erbB-2 (ERBB2), or resistance markers such as breast cancer resistance protein (ABCG2), multidrug resistance protein-1 (ABCB1) or the lung resistance protein (MVP). Regarding ER-alpha (ESR1), a simple linear relationship exists between its 6.7-kb mRNA measured by northern blot and the receptor level evaluated by ligand-binding assay (LBA) (Lacroix M et al. Res. Treat. Jun 67(3) (2001), 263-71). The polynucleotide detection is favourably replaces by the polypeptide detection in order to directly obtained the quantification of the transcript of the genes.
[0054] In principle the micro-array may contain as few as 16 capture probes, i.e. one capture probe associated with each of the 6 cancer phenotypes and 10 capture probes associated with the cellular functions selected. Yet, the number of capture probes on the micro-array may be selected according to the need of the skilled person and may contain capture probes for the detection of up to about 3000 different genes, e.g. about 100, or 200 or 500 or 1000, or 2000 different genes. Since the capture probes are arranged on the solid support in the form of an array each gene is quantified by spots with a single nucleotide species, wherein one spot is sufficient for the identification and quantification of one gene.
[0055] According to a preferred embodiment of the invention capture probes are long polynucleotides and are uniques for each of the genes to be detected and quantified on the array. Long capture probes mean capture probes of 15 to 1000 nucleotides in length, e.g. of 15 to 200, or 15 to 150 nucleotides or 15 to 100, and are fixed on a support being any solid support as long as they are able to hybridized with their corresponding cDNA and be identified and quantified. The density of the capture nucleotide sequences bound to the surface of the solid support may be superior to 3 fmoles per cm of solid support surface.
[0056] Direct capture of the cDNA is the preferred embodiment making the data mining and the interpretation of the results easier. The method does not exclude the use of fragmented cDNA or RNA as a means of detection of gene by determining for each gene the pattern of hybridization on oligonucleotides present on the array.
[0057] In another embodiment, the capture probes bind the cDNA of the genes close to the 3 poly(A+) region of the corresponding mRNA. Retro-transcription of mRNAs begin at their 3′-poly(A+) region and is not always complete, which indeed gives rise to a population of more or less complete cDNAs. To improve the efficiency of hybridization, it may thus appear preferable to design capture sequences specific to mRNAs regions close to their 3′-end.
[0058] The capture probes sequences are preferably chosen so as to avoid cross-reaction with other gene(s). Therefore, regions of high (>50%) homologies between genes, able to yield cross-hybridizations between capture probes and target cDNAs will not be the preferred species.
[0059] Also variants of the genes may specifically be detected and quantified on the array by specific capture probes. In some tissues, multiple mRNAs are transcribed from the same gene. These variants often exhibit more or less overlapping sequences. Selection of the capture probes has to be specific for such marker, when all variants do not have the same potential importance as diagnostic, prognostic, or predictive indicator, or when it is recommended to detect only some specific ones. For instance, two integrin alpha 6 (ITGA6) mRNA variants have been identified in breast tumors. They encode proteins differing by their C-terminal cytoplasmic domain. Increased integrin alpha 6 expression could be associated with the metastatic phenotype of breast cancer cells, but this has not been specifically ascribed to any of the variants. Similarly, the potential prognostic value of bcl-2 (BCL2) in breast cancer has not been clearly associated so far to any of its two transcripts, differing by their C-terminus. For ITGA6 as well as for BCL2, it appears thus pertinent to design a capture probe recognizing both variant mRNAs.
[0060] At least four variants (A to D) have been found for the integrin beta 1 (ITGB 1), which mediates interactions between cells and the extracellular matrix. The relative amount of these forms is likely to vary in breast tumors. However, preferably the C variant will be detected, which has been shown to inhibit cell proliferation in vitro and is down-regulated in carcinomas. Two capture probes should be designed, one specific to the C form, the other detecting all ITGB 1 variants.
[0061] The CD44 glycoprotein is involved in cell-cell and cell-matrix interactions. The CD44 gene contains 20 exons. Exons 1-5 and 16-20 are spliced together to form a transcript that encodes the ubiquitously expressed standard isoform (known as CD44s). The 10 variable exons 6-15 (also named v1-v10) can be alternatively spliced and included within the standard exons at an insertion site between exons 5 and 16, giving rise to a pleiad of so-called CD44v variants. Studies show that not all variants do exhibit the same interest as indicators in breast cancer. Accordingly at least a capture probe is to be designed recognizing the CD44v6 variant, which is a marker to identify node-negative patients with a relatively favorable prognosis. Also the CD44v7-v8 variant, that seems to direct breast tumor cells to lymph nodes and lymphatic vessels, is considered to be a valuable probe.
[0062] Another example is MUC1. This gene encodes at least three proteins, Muc1/y, Muc1/sec, Muc1/rep. All Muc1 proteins appear to play a role in reducing cell-cell and integrin-mediated cell-matrix interactions, and probably in the metastatic spread of cancer cells from the initial tumor site. More generally, MUC1 expression seems to enhance tumor initiation and progression. This suggests that the impact of MUC1 on tumor properties depends on the relative levels of its three protein products. It is, therefore of considerably interest to specifically detect the messengers encoding the three proteins Muc1/y, Muc1/sec and Muc1/rep.
[0063] After hybridzation with the target nucleotides of the biological sample the spot intensity is read according to the label utilized, e.g. in fluorescence or colorimetry. The quantification of the genes may be performed by standard techniques, e.g. by comparison with internal standards introduced into the retro-transcription of mRNA.
[0064] The solid support as such may be made from any material conventionally used for this purpose and is preferably selected from glass, plastic, filters, metals and/or electronic chips.
[0065] According to another embodiment the present invention also provides a method for the diagnosis and/or prognosis of breast cancer, which comprises the steps of providing a micro-array as detailed above and quantifying differentially expressed genes, selected from at least 4 of the 6 cancer phenotypes provided on the support, and at least 10 other genes associated with at least 3 cellular functions and/or the house keeping genes.
[0066] According to an embodiment the method may be performed on cDNA obtained by retrotranscription from total RNA or mRNA. To this end, total RNA is extracted from tissue and an amount of about 0.1 to 100 μg, preferably 0.1. to 50 μg, more preferably 0.1. to 20 μg, even more preferably 0.1 to 10 μg, or even more preferred between 0.1 and 2 μg is used for direct labeling and hybridization on the array. mRNA may also be processed in the same way with a much lower amount to be used for the copying into cDNA. When RNA is amplified by T7 polymerase based method, PCR, rolling circle or other methods, detection is possible even at lower concentration than 0.1 μg of total RNA or mRNA depending on the amplification obtained being usually in the order of a few hundreds for the T7 polymerase and much higher for the PCR or rolling circle amplifications. In extreme cases, detection of a single cell or a group of a few cells like obtained by laser dissection methods is feasible. In amplification methods, however, different genes are amplified with different efficiencies and corrections have to be provided for the pitfall introduced in the differentially amplified genes.
[0067] According to another preferred embodiment the original nucleotide sequences to be detected and/or to be quantified are RNA sequences by the retro-transcription of the 3′ or 5′ end wherein consensus primer and possibly a stopper sequence are used. Preferably, the copied or amplified sequences are detected without previous cutting of original sequences into smaller portions.
[0068] The present invention also pertains to a diagnostic and/or prognostic kit, which comprises means and media for performing the above method.
[0069] Specifically, a gene expression analysis is performed on the present micro-array by preparing a gene expression profile from cells or from tissues incubated in the presence of drugs or from samples of a patient comprising tumor cells treated e.g. with a drug and comparing the expression profile to a gene expression profile from an untreated cell population comprising breast cancer cells or not. Based on the array identification of gene expression the effect of the drugs or chemical may then be evaluated for its potential activity. Therefore, the present invention also comprises the use of the present micro-array in the treatment of breast cancer.
[0070] E.g. a ductal carcinoma may be identified in a patient, comprising detecting the level of expression in a tissue sample of at least one gene associated with each of at least 4, preferably 5 of the 6 cell phenotypes and at least 10 other genes associated with at least 3 functions as listed in table I-II, wherein differential expression of the genes in table II is indicative of ductal carcinoma.
[0071] Also, the progression of carcinogenesis in a patient may be determined according to the present invention, comprising detecting the level of expression in a tissue sample of at least one gene associated with each of at least 4, preferably 5 of the 6 cell phenotypes and at least 10 other genes associated with at least 3 functions as listed in table I-II; wherein differential expression of the genes in table II is indicative of breast carcinogenesis.
[0072] The hereditary origin or the prognostic of susceptibility to breast tumor cancer is determined by determination of the differentiation of gene expression in breast cells on a microarray bearing the capture probes for at least 4 genes or their products typical of the hereditary phenotype together and at least 4 genes associated to other cellular functions and/or house keeping genes.
[0073] In one of the embodiment, the invention provides a method of screening for an agent capable of modulating the onset or progression of breast cancer, comprising the steps of exposing a cell to the agent; and detecting the expression level of at least one gene associated with each of at least 4, preferably 5 of the 6 cell phenotypes and at least 10 other genes associated with at least 3 functions as listed in table I-II.
[0074] In another embodiment, the invention further includes computer systems comprising a database containing information identifying the expression level in breast tissue of at least one gene associated with each of the 6 cell phenotypes and at least 10 other genes associated with at least 3 functions as listed in tables I-II and a user interface to view the information. The database may further include sequence information for the genes, information identifying the expression level for the set of genes in normal breast tissue and cancerous tissue and may contain links to external databases such as GenBank. The present invention includes relational databases containing sequence information, for instance for one or more of the genes of Tables I-II, as well as gene expression information in various breast tissue samples. Databases may also contain information associated with a given sequence or tissue sample such as descriptive information about the gene associated with the sequence information, descriptive information concerning the clinical status of the tissue sample, or information concerning the patient from which the sample was derived. The database may be designed to include different parts, for instance a sequence database and a gene expression database. Methods for the configuration and construction of such databases are widely available, for instance, U.S. Pat. No. 5,953,727, which is incorporated herein by reference in its entirety.
[0075] According to the present invention, potential drugs can be screened to determine if application of the drug alters the expression of the genes identified herein. This may be useful, for example, in determining whether a particular drug is effective in treating a particular patient with breast cancer. In the case where a gene expression is affected by the potential drug such that its level of expression returns to normal, the drug is indicated in the treatment of breast cancer. Similarly, a drug which causes expression of a gene which is not normally expressed by epithelial cells in the breast, may be contraindicated in the treatment of breast cancer.
[0076] Assays to monitor the expression of a marker or markers as e.g. defined in Tables I-II may utilize any available means of monitoring for changes in the expression level of the nucleic acids of the invention. As used herein, an agent is said to modulate the expression of a nucleic acid of the invention if it is capable of up- or down-regulating expression of the nucleic acid in a cell.
[0077] Agents that are assayed in the above methods can be randomly selected or rationally selected or designed. As used herein, an agent is said to be randomly selected when the agent is chosen randomly without considering the specific sequences involved in the association of the a protein of the invention alone or with its associated substrates, binding partners, etc. An example of randomly selected agents is the use a chemical library or a peptide combinatorial library, or a growth broth of an organism.
[0078] The genes identified as being differentially expressed in breast cancer may be used in a variety of nucleic acid detection assays to detect or quantify the expression level of a gene or multiple genes in a given sample. For example, traditional Northern blotting, nuclease protection, RT-PCR and differential display methods may be used for detecting gene expression levels.
[0079] The protein products of the genes identified herein can also be assayed to determine the amount of expression. Methods for assaying for a protein include Western blot, immunoprecipitation, radioimmunoassay and protein chips. Protein chips are supports bearing as capture probes antibodies or related proteins specific of the different proteins or peptides to be analyzed. Antibodies are either on the same support as a protein array or are on different supports as beads, each beads being specific for the detection of proteins. It is preferred, however, that the mRNA be assayed as an indication of expression. Methods for assaying for mRNA include Northern blots, slot blots, dot blots, and hybridization to an ordered array of polynucleotides. Any method for specifically and quantitatively measuring a specific protein or mRNA or DNA product can be used. However, methods and assays of the invention are most efficiently designed with PCR or array or chip hybridization-based methods for detecting the expression of a large number of genes.
[0080] Any hybridization assay format may be used, including solution-based and solid support-based assay formats. A preferred solid support is a low density array also known as a DNA chip or a gene chip. In one assay format, the array containing probes to at least one gene associated with each of the 6 cell phenotypes and at least 10 other genes associated with at least 3 functions as e.g. listed in tables I-II, may be used to directly monitor or detect changes in gene expression in the treated or exposed cell as described herein. Assays of the invention may measure the expression levels of about 14, 50, 100, 400, 1000 or 3000 genes with some or all from the table II. The number of genes to be detected is limited in the invention since it allows to better concentrate on gene useful for the characterization of the tumors and to give a more quick and precise response to the questions of the prognostic, diagnostic and therapeutic follow up of the patients. The larger the number of genes to be analyzed and treated for data mining, the less efficient the correlation and the outcome of the analysis.
[0081] In another assay format, cells or cell lines are first identified which express one or more of the gene products of the invention physiologically. Cells and/or cell lines so identified would preferably comprise the necessary cellular machinery to ensure that the transcriptional and/or translational apparatus of the cells would faithfully mimic the response of normal or cancerous breast tissue to an exogenous agent. Such machinery would likely include appropriate surface transduction mechanisms and/or cytosolic factors.
[0082] By way of example, and not limitation, examples of the present invention will now be given.
EXAMPLE 1
[0083] Gene expression in cell line of cancer origin.
[0084] Cell lines of different origin were analyzed according to the production of the capture nucleotide sequences and of the targets.
[0085] The cell lines were:
[0086] BT-474 (ESR1+, ERBB2+), HS578T (ESR1−, ERBB2−), MCF-7 (ESR1+, ERBB2−), MDA-MB-231 (ESR1−, ERBB2−), MDA-MB-453 (ESR1−, ERBB2+), T-47D (ESR1+, ERBB2−), as described at ATCC (www.atcc.org). Evsa-T (Borras M. et al., Cancer Lett. 1997 Nov. 25; 120(1):23-30), IBEP-1, IBEP-2, IBEP-3 (Siwek B. et al., Int J Cancer. 1998 May 29;76(5):677-83), KPL-1 (Kurebayashi J. et al., Br J Cancer. 1995 April;71(4):845-53).
[0087] 1. RNA Extraction:
[0088] Frozen tumors, or tumors maintained in RNAlater (Ambion) were crushed in liquid nitrogen, using mortar and pestle. Total RNA was extracted from powdered tumors and cultured breast cancer cells by TriPure (Roche), according to the manufacturer's instructions. Poly(A+) RNA (mRNA) was obtained from total RNA using FastTrack columns (InVitrogen). Poly(A+) RNA was resuspended in RNAse-free water.
[0089] The concentration and purity of RNA was determined by diluting an aliquot of the preparation in TE (10 mM Tris-HCl pH 8, 1 mM EDTA) and measuring (reading) its absorbance (in a spectrophotometer) at 260 nM and 280 nm.
[0090] While A260 allows to evaluate the RNA concentration, the A260/A280 ratio gives an indication of RNA purity. For a RNA to be used, its ratio must be comprised between 1.8 and 2.
[0091] The overall quality of the RNA preparation was determined by electrophoresis on a denaturing 1% agarose gel (Sambrook et al., eds. (1989) Molecular Cloning—A Laboratory Manual, 2nd ed. Cold Spring Harbor, N.Y.: Cold Spring Harbor Laboratory Press).
[0092] 2. cDNA Synthesis:
[0093] 1 μl of poly(A+) RNA sample (0.5 μg/μl) was mixed with 2 μl oligo(dT)12-18 (0.5 μg/μl, Roche), 3.5 μl H2O, and 3 μl of a solution of 3 different synthetic well-defined poly(A+) RNAs. These latter served as internal standards to assist in quantification and estimation of experimental variation introduced during the subsequent steps of analysis. After an incubation of 10 minutes at 70 C and 5 minutes on ice, 9 μl of reaction mix were added. Reaction mix consisted in 4 μl Reverse Transcription Buffer 5×(Gibco BRL), 1 μl RNAsin Ribonuclease Inhibitor (40 U/ml, Promega), and 2 μl of a 10×dNTP mix, made of dATP, dTTP, dGTP (5 mM each, Roche), dCTP (800 μM, Roche), and Biotin-11-dCTP (800 μM, NEN).
[0094] After 5 minutes at room temperature, 1.5 μl SuperScript II (200 U/ml, Gibco BRL) was added and incubation was performed at 42 C for 90 minutes. Addition of SuperScript and incubation were repeated once. The mixture was then placed at 70 C for 15 minutes and 1 μl Ribonuclease H (2 U/μl) was added for 20 minutes at 37 C. Finally, a 3-minutes denaturation step was performed at 95 C. The biotinylated cDNA, was kept at −20 C.
[0095] 3. Hybridization of (Using) Biotinylated cDNA:
[0096] The BreastChips used in this study is composed of 145 genes and several different controls including positive and negative detection control, positive and negative hybridization control, three different internal standards all dispersed at different locations among the genes to be analyzed on the micro-array (FIG. 1). In this example each spots was covered with a capture probe being a polynucleotide species which allow the specific binding of one target polynucleotide corresponding to a specific gene listed in table 2.
[0097] Hybridization chambers were from Biozym (Landgraaf, The Netherlands). Hybridization mixture consisted in biotinylated cDNA (the total amount of labeled cDNA), 6.5 μl HybriBuffer A (Eppendorf, Hambourg, Germany), 26 μl HybriBuffer B (Eppendorf, Hambourg, Germany), 8 μl H2O, and 2 μl of positive hybridization control.
[0098] Hybridization was carried out overnight at 60° C. The micro-arrays were then washed 4 times for 2 min with washing buffer (B 1 0.1×+Tween 0.1%) (Eppendorf, Hamburg, Germany).
[0099] The micro-arrays were than incubated for 45 minutes at room temperature with the Cy3-conjugated IgG Anti biotin (Jackson Immuno Research Laboratories, Inc #200-162-096) diluted 1/1000× Conjugate-Cy3 in the blocking reagent and protect from light.
[0100] The micro-arrays were washed again 4 times for 2 minutes with washing buffer (B1 0.1×+Tween 0.1%) and 2 times for 2 minutes with distilled water before being dried under a flux of N2.
[0101] 4. Scanning and Data Analysis:
[0102] The hybridized micro-arrays were scanned using a laser confocal scanner “ScanArray” (Packard, USA) at a resolution of 10 μm. To maximize the dynamic range of the assay the same arrays were scanned at different photomultiplier tube (PMT) settings. After image acquisition, the scanned 16-bit images were imported to the software, ‘ImaGene4.0’ (BioDiscovery, Los Angeles, Calif., USA), which was used to quantify the signal intensities. Data mining and determination of significantly expressed gene in the test compared to the reference arrays was performed according to the method described by Delongueville et al (Biochem Pharmacol. 2002 Jul. 1;64(1):137-49). Briefly, the spots intensities were first corrected for the local background and than the ration between the test and the reference arrays were calculated. To account variation in the different experimental steps, the data obtained from different hybridizations were normalized in two ways. First the values are corrected using a factor calculated from the intensity ratios of the internal standard reference and the test sample. The presence of 3 internal standard probes at different locations on the micro-array allows measurement of a local background and evaluation of the micro-array homogeneity, which is going to be considered in the normalization (Schuchhardt et al., Nucleic Acids Res. 28 (2000), E47). However, the internal standard control does not account for the quality of the mRNA samples, therefore a second step of normalization was performed based on the expression levels of housekeeping genes. This process involves calculating the average intensity for a set of housekeeping genes, the expression of which is not expected to vary significantly. The variance of the normalized set of housekeeping genes is used to generate an estimate of expected variance, leading to a predicted confidence interval for testing the significance of the ratios obtained (Chen et al, J. Biomed. Optics 1997, 2, 364-74). Ratios outside the 95% confidence interval were determined to be significantly changed by the treatment.
[0103] In this experiment, the ratios of gene expressed in each cell type was compared with the average gene expression obtained from a mixture of mRNA isolated from the 11 cell lines.
EXAMPLE 2
[0104] Gene expression in breast cancer.
[0105] The RNA extraction, cDNA preparation, hybridization, and quantification of the array were performed as described in the example 1 from tumors tissues obtained from chirurgical intervention.
[0106] The tumor cDNA was analyzed on an array present on a slide with the presence of another array. This second array serves as reference and was hybridized from cDNA obtained as a mixture of the RNA from the 12 cell lines used in the example 1. The identification and quantification of the genes differentially expressed in tumors were obtained from the comparison of the hybridization yield of the two arrays.
[0107] It should be understood that various changes and modifications to the presently preferred embodiments described herein will be apparent to those skilled in the art. Such changes and modifications can be made without departing from the spirit and scope of the present invention and without diminishing its intended advantages. It is therefore intended that such changes and modifications be covered by the appended claims.
Claims
- 1. A micro-array for the diagnosis and prognosis of human breast cancer comprising:
a solid support; and a plurality of capture probes, present on the solid support in the form of an array, which are selected from the group consisting of polynucleotides, polypeptides and fragments thereof, derived from (a) at least one gene representing at least 4 out of 6 phenotypes, selected from the group consisting of luminal/epithelial, basal/myoepithelial, mesenchymal, ErbB2, hormonal phenotypes and hereditary susceptibility to breast cancer; and (b) at least 10 genes associated with at least 3 cellular functions.
- 2. The micro-array according to claim 1, which comprises capture probes for the detection of not more than 3000 different genes.
- 3. The micro-array according to claim 1, wherein each gene is quantified on an array by spots with single nucleotide species.
- 4. The micro-array according to claim 1, wherein each gene is quantified on an array by spots with attached or adsorbed antibodies specific of the proteins derived from the expressed genes.
- 5. The micro-array according to claim 1, wherein one spot is sufficient for the identification and quantification of one gene or gene product.
- 6. The micro-array according to claim 1, wherein a spot intensity is read in fluorescence or colorimetry.
- 7. The micro-array according to claim 1, wherein the quantification of the genes is performed by comparison with internal standards introduced into the retro-transcription of mRNA.
- 8. The micro-array according to claim 1, wherein the solid support is selected from the group consisting of glass, plastic, filters, metals and electronic chips.
- 9. A method for the diagnosis and prognosis of breast cancer, comprising the steps of:
providing a micro-array comprising a solid support and a plurality of capture probes, present on the solid support in the form of an array, which are selected from the group consisting of polynucleotides, polypeptides and fragments thereof, and quantifying the different expression of at least 4 of the 6 cancer phenotypes and at least 10 other genes associated with at least 3 cellular functions and/or house keeping functions.
- 10. The method according to claim 9, comprising the step of performing the method on cDNA obtained by retro-transcription from less than 20 μg of total RNA.
- 11. The method according to claim 9, comprising the step of performing the method on cDNA obtained by retro-transcription from less than 10 μg of total RNA.
- 12. The method according to claim 9, including the step of obtaining by retro-transcription from less than 1 μg of mRNA.
- 13. The method according to claim 9, wherein the differentially expressed genes are quantified and/or identified by reference with a reference tissue or reference cell.
- 14. The method according to claim 9, wherein the results on the breast cancer and to a reference material is obtained on micro-arrays present on the same support.
- 15. The method according to claim 9, wherein the density of the capture nucleotide sequences bound to the surface of the solid support is superior to 3 fmoles per cm of solid support surface.
- 16. The method according to claim 9, wherein the insoluble solid support is selected from the group consisting of glasses, electronic devices, silicon supports, plastic supports, compact discs, filters, gel layers, metallic supports and a mixture thereof.
- 17. The method according to claim 9, wherein the nucleotide sequences to be detected and to be quantified are RNA sequences obtained by retro-transcription of the 3′ or 5′ end of the transcript by using consensus primer and possibly a stopper sequence
- 18. The method according to claim 9, wherein the copied or amplified sequences are detected without previous cutting of original sequences into smaller portions
- 19. A diagnostic kit, comprising:
an array for a solid support; a plurality of captures probes, present on the solid support in the form of an array, which are selected from the group consisting of polynucleotides, polypeptides and fragments thereof, derived from at least 4 of the 6 cancer phenotypes selected from the group consisting of luminal/epithelial, basal/myoepithelial, mesenchymal, ErbB2, hormonal phenotypes and hereditary susceptibility to breast cancer and at least 10 other genes associated with at least 3 cellular functions.
- 20. The kit according to claim 19, wherein the solid support is selected from the group consisting of glass, silicon, plastic, filters, gel layers, metal and a mixture thereof.
- 21. A micro-array for the detection of cancer comprising:
a solid support; and a plurality of captures probes, present on the solid support in the form of an array, which are selected from the group consisting of polynucleotides, polypeptides and fragments thereof, derived from (a) at least one gene representing at least 4 out of 6 phenotypes, selected from the group consisting of luminal/epithelial, basal/myoepithelial, mesenchymal, ErbB2, hormonal phenotypes and hereditary susceptibility to breast cancer; and (b) at least 10 genes associated with at least 3 cellular functions.
- 22. A micro-array for the determination of the progress of cancer comprising:
a solid support; and a plurality of captures probes, present on the solid support in the form of an array, which are selected from the group consisting of polynucleotides, polypeptides and fragments thereof, derived from (a) at least one gene representing at least 4 out of 6 phenotypes, selected from the group consisting of luminal/epithelial, basal/myoepithelial, mesenchymal, ErbB2, hormonal phenotypes and hereditary susceptibility to breast cancer; and (b) at least 10 genes associated with at least 3 cellular functions.
- 23. The micro-array of claim 21, wherein the cancer is ductal carcinogenesis.
- 24. The micro-array of claim 22, wherein the cancer is ductal carcinogenesis.
- 25. A micro-array for the detection of cancerogenous agents and/or for the detection of cytostatic or antiproliferative agents comprising:
a solid support; and a plurality of captures probes, present on the solid support in the form of an array, which are selected from the group consisting of polynucleotides, polypeptides and fragments thereof, derived from (a) at least one gene representing at least 4 out of 6 phenotypes, selected from the group consisting of luminal/epithelial, basal/myoepithelial, mesenchymal, ErbB2, hormonal phenotypes and hereditary susceptibility to breast cancer; and (b) at least 10 genes associated with at least 3 cellular functions.
- 26. A method for the diagnosis and prognosis of breast cancer, comprising the steps of:
providing a micro-array comprising a solid support containing a plurality of capture probes which are selected for the detection of a group comprising polynucleotides or polypeptides derived from at least 4 expressed genes or its complement representing the hereditary phenotype and at least 4 genes associated with other cellular functions and/or house keeping genes.