U.S. application Ser. No. 10/864,207 filed Jun. 9, 2004, and U.S. Appl. Nos. 60/477,228 and 60/477,235, both filed Jun. 9, 2003, are herein incorporated by reference in their entirety.
The present invention relates to compositions and methods for treating, characterizing, and diagnosing cancer. In particular, the present invention provides gene expression profiles associated with solid tumor stem cells, as well as novel stem cell cancer markers useful for the diagnosis, characterization, and treatment of solid tumor stem cells. The invention further provides distinct tumor stem cell gene signatures useful for categorizing tumors into subclasses and for diagnosis, prognosis, and treatment of cancer.
Breast cancer is the most common female malignancy in most industrialized countries, as it is estimated to affect about 10% of the female population during their lifespan. Although its mortality has not increased along with its incidence, due to earlier diagnosis and improved treatment, it is still one of the predominant causes of death in middle-aged women. Despite earlier diagnosis of breast cancer, about 1-5% of women with newly diagnosed breast cancer have a distant metastasis at the time of the diagnosis. In addition, approximately 50% of the patients with local disease who are primarily diagnosed eventually relapse with the metastasis. Eighty-five percent of these recurrences take place within the first five years after the primary manifestation of the disease.
On presentation, most patients with metastatic breast cancer have only one or two organ systems involved. As the disease progresses over time, multiple sites usually become involved. Indeed, metastases can be found in nearly every organ of the body at autopsy. The most common sites of metastatic involvement observed are locoregional recurrences in the skin and soft tissues of the chest wall, as well as in axilla, and supraclavicular area. The most common site for distant metastasis is the bone (30-40% of distant metastasis), followed by lung and liver. Metastatic breast cancer is generally considered to be an incurable disease. However, the currently available treatment options often prolong the disease-free state and overall survival rate, as well as increase the quality of the life. The median survival from the manifestation of distant metastases is about three years.
Current methods of diagnosing and staging breast cancer include the tumor-node-metastasis (TNM) system that relies on tumor size, tumor presence in lymph nodes, and the presence of distant metastases as described in the American Joint Committee on Cancer: AJCC Cancer Staging Manual. Philadelphia, Pa.: Lippincott-Raven Publishers, 5th ed., 1997, pp 171-180, and in Harris, J R: “Staging of breast carcinoma” in Harris, J. R., Hellman, S., Henderson, I. C., Kinne D. W. (eds.): Breast Diseases. Philadelphia, Lippincott, 1991. These parameters are used to provide a prognosis and select an appropriate therapy. The morphologic appearance of the tumor can also be assessed but because tumors with similar histopathologic appearance can exhibit significant clinical variability, this approach has serious limitations. Finally assays for cell surface marker can be used to divide certain tumors types into subclasses. For example, one factor considered in the prognosis and treatment of breast cancer is the presence of the estrogen receptor (ER) as ER-positive breast cancers typically respond more readily to hormonal therapies such as tamoxifen than ER-negative tumors. Yet these analyses, though useful, are only partially predictive of the clinical behavior of breast tumors, and there is much phenotypic diversity present in breast cancers that current diagnostic tools fail to detect.
Traditional modes of cancer therapy include radiation therapy, chemotherapy, and hormonal therapy. Yet because of the difficulty in predicting the clinical course of early stage breast cancer from standard clinical and pathologic features, current practice is to offer systemic chemotherapy to most women even though the majority of these women would have good outcome in the absence of chemotherapy. Chemotherapy has severe side effects and itself carries a 1% mortality rate, and thus unnecessary suffering and deaths could be avoided if patients could be divided into high and low risk subgroups. Thus, there exists a need for improved methods to classifying tumors for better prognosis and treatment selection.
Furthermore, although current therapies can often prolong the disease-free state and overall survival when used on high-risk patients, they are limited by their lack of specificity and the emergence of treatment-resistant cancer cells. Approximately two thirds of people diagnosed with cancer will die of their cancer within five years. Thus there is a great call for the identification of additional genes that can serve as selective therapies for the treatment of cancer.
Colorectal cancer is the third most common cancer and the fourth most frequent cause of cancer deaths worldwide. Approximately 5-10% of all colorectal cancers are hereditary with one of the main forms being familial adenomatous polyposis (FAP), an autosomal dominant disease in which about 80% of affected individuals contain a germline mutation in the adenomatous polyposis coli (APC) gene. Colorectal carcinoma has a tendency to invade locally by circumferential growth and for lymphatic, hematogenous, transperitoneal, and perineural spread. The most common site of extralymphatic involvement is the liver, with the lungs the most frequently affected extra-abdominal organ. Other sites of hematogenous spread include the bones, kidneys, adrenal glands, and brain.
The current staging system for colorectal cancer is based on the degree of tumor penetration through the bowel wall and the presence or absence of nodal involvement. This staging system is defined by three major Duke's classifications: Duke's A disease is confined to submucosa layers of colon or rectum; Duke's B disease has tumors that invade through muscularis propria and can penetrate the wall of the colon or rectum; and Duke's C disease includes any degree of bowel wall invasion with regional lymph node metastasis.
Surgical resection is highly effective for early stage colorectal cancers, providing cure rates of 95% in Duke's A and 75% in Duke's B patients. The presence of positive lymph node in Duke's C disease predicts a 60% likelihood of recurrence within five years. Treatment of Duke's C patients with a post surgical course of chemotherapy reduces the recurrence rate to 40%-50%, and is now the standard of care for these patients. Because of the relatively low rate of reoccurrence, the benefit of post surgical chemotherapy in Duke' B has been harder to detect and remains controversial. However, the Duke's B classification is imperfect as approximately 20-30% of these patients behave more like Duke's C and relapse within five years. Thus there is a clear need to identify better prognostic factors for selecting Duke's B patients that are likely to relapse and would benefit from therapy.
During normal animal development, cells of most or all tissues are derived from normal precursors, called stem cells (Morrison et al., Cell 88(3): 287-98 (1997); Morrison et al., Curr. Opin. Immunol. 9(2): 216-21 (1997); Morrison et al., Annu. Rev. Cell. Dev. Biol. 11: 35-71 (1995)). Stem cells are cells that: (1) have extensive proliferative capacity; 2) are capable of asymmetric cell division to generate one or more kinds of progeny with reduced proliferative or developmental potential; and (3) are capable of symmetric cell divisions for self-renewal or self-maintenance. In adult animals, some cells (including cells of the blood, gut, breast ductal system, and skin) are constantly replenished from a small population of stem cells in each tissue. The best-known example of adult cell renewal by the differentiation of stem cells is the hematopoietic system where developmentally immature precursors (hematopoietic stem and progenitor cells) respond to molecular signals to form the varied blood and lymphoid cell types.
Solid tumors are composed of heterogeneous cell populations. For example, breast cancers are a mixture of cancer cells and normal cells, including mesenchymal (stromal) cells, inflammatory cells, and endothelial cells. Classic models of cancer hold that phenotypically distinct cancer cell populations all have the capacity to proliferate and give rise to a new tumor. In the classical model, tumor cell heterogeneity results from environmental factors as well as ongoing mutations within cancer cells resulting in a diverse population of tumorigenic cells. This model rests on the idea that all populations of tumor cells would have some degree of tumorigenic potential. (Pandis et al., Genes, Chromosomes & Cancer 12:122-129 (1998); Kuukasjrvi et al., Cancer Res. 57: 1597-1604 (1997); Bonsing et al., Cancer 71: 382-391 (1993); Bonsing et al., Genes Chromosomes & Cancer 82: 173-183 (2000); Beerman H et al., Cytometry. 12(2): 147-54 (1991); Aubele M & Werner M, Analyt. Cell. Path. 19: 53 (1999); Shen L et al., Cancer Res. 60: 3884 (2000).).
An alternative model for the observed solid tumor cell heterogeneity is that solid tumors result from a “solid tumor stem cell” (or “cancer stem cell” from a solid tumor) that subsequently undergoes chaotic development through both symmetric and asymmetric rounds of cell divisions. In this stem cell model, solid tumors contain a distinct and limited (possibly even rare) subset of cells that share the properties of normal “stem cells”, in that they extensively proliferate and efficiently give rise both to additional solid tumor stem cells (self-renewal) and to the majority of tumor cells of a solid tumor that lack tumorigenic potential. Indeed, mutations within a long-lived stem cell population can initiate the formation of cancer stem cells that underlie the growth and maintenance of tumors and whose presence contributes to the failure of current therapeutic approaches.
Although great strides have been made understanding the genetic changes that lead to cancer (e.g. breast cancer and colorectal cancer), the lack of reliable tumor assay for de novo human cancer cells has hindered the ability to understand the effects of these mutations at the cellular level. Also, the lack of identified cancer markers for solid tumor stem cells has hindered the development of diagnostics and therapeutics for cancer patients (e.g. breast cancer patients). As such, what is needed is a reliable tumor assay as well as the identification of cancer markers for solid tumor stem cells.
The present invention relates to compositions and methods for treating, characterizing, and diagnosing cancer. In particular, the present invention provides gene expression profiles associated with solid tumor stem cells, as well as novel stem cell cancer markers useful for the diagnosis, characterization, and treatment of solid tumor stem cells. The present invention further provides gene signatures derived from solid tumor stem cell markers that when detected in a tumor sample act as significant predictors of poor clinical outcome, including high risk of metastasis and death.
In some embodiments, the present invention provides methods of detecting solid tumor stem cells, comprising; a) providing a tissue sample from a subject, and b) detecting at least one stem cell cancer marker (e.g., 1, 2, 3, 5, 10, . . . etc.) from Tables 4-9 in the tissue sample under conditions such that the presence or absence of solid tumor stem cells in the tissue sample is determined. In particular embodiments, detecting the solid tumor stem cells comprises determining the presence of (or absence of), or an expression level for at least one stem cell cancer marker. In other embodiments, the detecting comprises detecting mRNA expression of the at least one stem cell cancer marker. In particular embodiments, the detecting comprises exposing the stem cell cancer marker mRNA to a nucleic acid probe complementary to the stem cell cancer marker mRNA.
In certain embodiments, the detecting comprises detecting polypeptide expression of the at least one stem cell cancer marker. In other embodiments, the detecting comprises exposing the stem cell cancer marker polypeptide to an antibody specific to the stem cell cancer marker polypeptide and detecting the binding of the antibody to the stem cell cancer polypeptide. In further embodiments, the subject comprises a human subject. In additional embodiments, the tissue sample comprises tumor tissue. In some embodiments, the tumor tissue sample is a post-surgical tumor tissue sample (e.g. tumor biopsy).
In other embodiments, the methods further comprise c) providing a prognosis to the subject. In some embodiments, the at least one stem cell cancer marker is from Table 8. In some embodiments, the at least one stem cell cancer marker comprises: Bmi-1, eed, easyh1, easyh2, rnf2, yy1, smarcA3, smarcA5, smarcD3, smarcE1, mllt3, FZD1, FZD2, FZD3, FZD4, FZD6, FZD7, FZD8, FZD9, FZD10, WNT2, WNT2B, WNT3, WNT5A, WNT10B, WNT16, AXIN1, BCL9, MYC, and (TCF4).
In other embodiments reagents and methods for predicting the clinical outcome including metastasis and death are provided using the cancer stem cell markers of the present invention. Cancer stem cell gene signatures comprising identified cancer stem cell markers are provided that are predictive of metastasis and overall survival and can thus be used to classify tumors into low and high-risk subclasses and further provide a diagnosis, provide a prognosis, select a therapy, and monitor a therapy. In one embodiment a method of classifying a tumor comprising: a) providing a tumor sample for example by obtaining a tumor biopsy from a patient; b) determining expression or activity of one or more of genes encoding a protein or polypeptide selected from a solid tumor stem cell gene signature; and c) classifying the tumor as belonging to a high risk or low risk tumor class based on the results of b). In another embodiment of the invention, the method further comprises providing a diagnosis, prognosis, selecting a therapy, or monitoring a therapy. According to certain of the inventive methods the presence or amount of a gene product, e.g., a polypeptide or a nucleic acid, encoded by a solid tumor stem cell gene is detected in a sample derived from a subject (e.g. a sample of tissue or cells obtained from a tumor or a blood sample obtained from a subject). In general the subject is a human, however the subject can also be an animal of another kind. The subject can be an individual who has or can have a tumor. The sample can be subject to a number of processing steps prior to or in the course of detection. In certain embodiments of the invention the gene product is a polypeptide that is detected using an antibody capable of binding to the polypeptide. In certain embodiments of the invention the antibody is used to perform immunohistochemical staining on a sample obtained from a subject. In certain embodiments of the invention solid tumor stem cell gene mRNA expression is measured using a microarray. In other embodiments of the invention solid tumor stem cell gene mRNA is measured by quantitative PCR using a set of primers designed to amplify a portion of the gene. Other detection means are know to one of ordinary skill in the art e.g. see U.S. Pat. No. 6,057,105.
In particular embodiments, the present invention provides methods for reducing the size of a solid tumor (e.g. in research drug screening, or therapeutic applications) comprising contacting cells of a solid tumor with a biologically (e.g. therapeutically) effective amount of a composition comprising at least one agent directed against at least one stem cell cancer marker shown in Tables 4-9. In some embodiments, the biologically effective amount is an amount sufficient to cause cell death of or inhibit proliferation of solid tumor stem cells in the solid tumor. In other embodiments, the biologically effective amount is an amount that interference with the survival pathways (e.g. notch related genes) or self-renewal pathways (e.g. WNT pathways) of the solid tumor stem cell.
Examples of solid tumors from which solid tumor stem cells can be isolated or enriched for according to the invention include, but are not limited to, sarcomas and carcinomas such as, but not limited to: fibrosarcoma, myxosarcoma, liposarcoma, chondrosarcoma, osteogenic sarcoma, chordoma, angiosarcoma, endotheliosarcoma, lymphangiosarcoma, lymphangioendotheliosarcoma, synovioma, mesothelioma, Ewing's tumor, leiomyosarcoma, rhabdomyosarcoma, colon carcinoma, pancreatic cancer, breast cancer, ovarian cancer, prostate cancer, squamous cell carcinoma, basal cell carcinoma, adenocarcinoma, sweat gland carcinoma, sebaceous gland carcinoma, papillary carcinoma, papillary adenocarcinomas, cystadenocarcinoma, medullary carcinoma, bronchogenic carcinoma, renal cell carcinoma, hepatoma, bile duct carcinoma, choriocarcinoma, seminoma, embryonal carcinoma, Wilms' tumor, cervical cancer, testicular tumor, lung carcinoma, small cell lung carcinoma, bladder carcinoma, epithelial carcinoma, glioma, astrocytoma, medulloblastoma, craniopharyngioma, ependymoma, pinealoma, hemangioblastoma, acoustic neuroma, oligodendroglioma, meningioma, melanoma, neuroblastoma, and retinoblastoma. The invention is applicable to sarcomas and epithelial cancers, such as ovarian cancers and breast cancers.
In additional embodiments, the at least one agent is an antibody, peptide or small molecule. In further embodiments, the antibody, peptide, anti-sense, siRNA, or small molecule is directed against an extracellular domain of the at least one stem cell cancer marker. In some embodiments, the at least one stem cell cancer marker is selected from the group consisting of: Bmi-1, eed, easyh1, easyh2, rnf2, yy1, smarcA3, smarcA5, smarcD3, smarcE 1, mllt3, FZD1, FZD2, FZD3, FZD4, FZD6, FZD7, FZD8, FZD9, FZD10, WNT2, WNT2B, WNT3, WNT5A, WNT10B, WNT16, AXIN1, BCL9, MYC, and (TCF4).
In other embodiments, the present invention provides methods for reducing the size of a solid tumor, comprising contacting cells of a solid tumor with a biologically (e.g. therapeutically) effective amount of a composition comprising at least one agent that modulates the activity of at least one stem cell cancer marker shown in Tables 4-9. In some embodiments, the present invention provides methods for killing or inhibiting the proliferation of solid tumor stem cells comprising contacting the solid tumor stem cells with a biologically effective amount of a composition comprising at least one agent targeted to at least one stem cell cancer marker shown in Tables 4-9. In certain embodiments, the methods further comprise identifying the death of or the prevention of the growth of the solid tumor stem cells following the contacting. In additional embodiments, the cell death is caused by apoptosis. In other embodiments, the biologically effective amount is an amount that interferes with the survival pathways (e.g. notch related genes) or self-renewal pathways (e.g. WNT pathways) of the solid tumor stem cell. In other embodiments, the at least one stem cell cancer marker is selected from the group consisting of: Bmi-1, eed, easyh1, easyh2, rnf2, yy1, smarcA3, smarcA5, smarcD3, smarcE1, mllt3, FZD1, FZD2, FZD3, FZD4, FZD6, FZD7, FZD8, FZD9, FZD10, WNT2, WNT2B, WNT3, WNT5A, WNT10B, WNT16, AXIN1, BCL9, MYC, and (TCF4).
In particular embodiments, the solid tumor stem cells express cell surface marker CD44, ESA, or B38.1. In other embodiments, the solid tumor stem cells fail to express at least one LINEAGE marker selected from the group consisting of CD2, CD3, CD IO, CD 14, CD16, CD31, CD45, CD64, and CD140b (see, e.g., U.S. Appl. Pub. US20040037815A1, and US20020119565, both of which are herein incorporated by reference).
In other embodiments, the present invention provides methods for selectively targeting a solid tumor stem cell comprising, (a) identifying at least one stem cell cancer marker from Tables 4-9 present on a solid tumor stem cell; and (b) obtaining an agent or set of agents that selectively binds to or regulates the at least one stem cell cancer marker. In some embodiments, the agent genetically modifies the solid tumor stem cell. In particular embodiments, the agent comprises a bi-specific conjugate. In further embodiments, the agent comprises an adenoviral vector.
In some embodiments, the present invention provides methods for forming a tumor in an animal, comprising: introducing purified solid tumor stem cells (e.g. a cell dose of) into an animal, wherein: (a) the solid tumor stem cells are derived from a solid tumor; and (b) the solid tumor stem cells are enriched at least 2-fold relative to unfractionated tumor cells based on the presence of at least one stem cell cancer marker in Tables 4-9. In other embodiments, the animal is an immunocompromised animal. In certain embodiments, the animal is an immunocompromised mammal, such as a mouse (e.g., a nude mouse, SCID mouse, NOD/SCID mouse, Beige/SCID mouse; and microglobin deficient NOD/SCID mouse). In particular embodiments, the number of cells in the cell dose is between about 100 cells and about 5×105 cells.
In certain embodiments, the present invention provides kits for detecting solid tumor stem cells in a subject, comprising: a) a reagent capable of specifically detecting at least one stem cell cancer marker from Tables 4-9 in a tissue or cell sample from a subject, and, optionally, b) instructions for using the reagent for detecting the presence or absence of solid tumor stem cells in the tissue sample. In further embodiments, the reagent comprises a nucleic acid probe complementary to mRNA from the at least one stem cell cancer marker. In other embodiments, the reagent comprises an antibody or antibody fragment.
In some embodiments, the present invention provides methods of screening compounds, comprising: a) providing; i) a solid tumor stem cell; and ii) one or more test compounds; and b) contacting the solid tumor stem cell with the test compound; and c) detecting a change in expression of at least one stem cell cancer marker shown in Tables 4-9 in the presence of the test compound relative to the absence of the test compound. In particular embodiments, the detecting comprises determining an expression level for the at least one stem cell cancer marker. In particular embodiments, the detecting comprises detecting mRNA expression of the at least one stem cell cancer marker. In some embodiments, the detecting comprises detecting polypeptide expression of the at least one stem cell cancer marker. In additional embodiments, the solid tumor stem cell is in vitro. In other embodiments, the solid tumor stem cell is in vivo. In further embodiments, the test compound comprises a drug (e.g. small molecule, antibody, antibody-toxin conjugate, siRNA, etc.).
In some embodiments, the present invention provides compositions comprising at least two agents (e.g. small molecule, antibody, antibody-toxin conjugate, siRNA, etc.), wherein each of the agents modulates the activity of at least one stem cell cancer marker shown in Tables 4-9. In additional embodiments, the composition comprises at least three agents.
In particular embodiments, the present invention provides methods of distinguishing tumorigenic from non-tumorigenic cancer cells, comprising: detecting the presence of β-catenin in a cancer cell such that the localization of βcatenin in the cancer cell is determined to be primarily nuclear or primarily cytoplasmic. In some embodiments, the method further comprises identifying the cancer cell as tumorigenic if the βcatenin localization is primarily nuclear, or identifying the cancer cell as non-tumorigenic if the β-catenin localization is primarily cytoplasmic.
In certain embodiments, the present invention provides methods of distinguishing a tumorigenic from a non-tumorigenic cancer cell, comprising; a) providing; i) a cancer cell, and ii) a composition comprising an agent configured to bind β-catenin; and b) contacting the cancer cell with the composition under conditions such that the localization of β-catenin in the cancer cell is determined to be primarily nuclear and cytoplasmic or primarily cytoplasmic and membrane associated, and c) identifying the cancer cell as tumorigenic if the β-catenin localization is primarily nuclear and cytoplasmic, or identifying the cancer cell as non-tumorigenic if the β-catenin localization is primarily cytoplasmic and membrane associated.
This invention is based on the discovery of solid tumor stem cells (also referred to as cancer stem cells from a solid tumor) as a distinct and limited subset of cells within the heterogenous cell population of established solid tumors. These cancer stem cells share the properties of normal stem cells in that they extensively proliferate and efficiently give rise both to additional solid tumor stem cells (self-renewal) and to the majority of tumor cells of a solid tumor that lack tumorigenic potential. Identification of cancer stem cells from solid tumors relied on their expression of a unique pattern of cell-surface receptors that could be used to isolate them from the bulk of non-tumorigenic tumor cells and on the assessment of their properties of self-renewal and proliferation in culture and in xenograft animal models. An ESA+; CD44+; CD24−/low; Lineage− population greater than 50-fold enriched for the ability to form tumors relative to unfractionated tumor cells was discovered (Al-Hajj et al., 2003; U.S. Appl. Publ. Nos. 20020119565 and 20040037815, herein incorporated by reference in their entireties).
The present invention relates to compositions and methods for treating, characterizing and diagnosing cancer. In particular, the present invention provides gene expression profiles associated with solid tumor stem cells, as well as novel markers useful for the diagnosis, characterization, and treatment of solid tumor stem cells. Suitable markers that can be targeted (e.g. for diagnostic or therapeutic purposes) are the genes and peptides encoded by the genes that are differentially expressed in solid tumor stem cells as shown in Tables 4-9. The differentially expressed genes, and the peptides encoded thereby, can be detected (e.g. quantitatively) in order to identify the presence of solid tumor stem cells, and to determine and screen molecules suitable for reducing the proliferation (or killing), interfering with self-renewal pathways, or interfering with survival pathways of any solid tumor stem cells that are present. The differentially expressed genes, and peptides encoded thereby, shown in these tables are also useful for generating therapeutic agents targeted to one or more of these markers (e.g. to inhibit or promote the activity of the marker).
In order to identify solid tumor stem cell markers, cells from 5 patients, AML stem cells and non-tumorigenic cancer cells from 6 patients, normal hematopoietic stem cells (HSC5), normal hematopoietic cells, normal colon epithelial cells, and normal breast epithelial cells were analyzed for differential expression.
The present invention also provides solid tumor stem cells that differentially express from other cells one or more of the markers provided in Tables 4-9. The solid tumor stem cells can be human or other animal. The expression can be either to a greater extent or to a lesser extent. The other cells can be selected from normal cells, hematopoietic stem cells, acute myelogenous leukemia (AML) stem cells, or any other class of cells.
The invention provides a method of selecting cells of a population, which results in a purified population of solid tumor stem cells (e.g. from a patient to select or test therapeutic agents are some for the patient). The present invention also provides a method of selecting a purified population of tumor cells other than solid tumor stem cells, such as a population of non-tumorigenic (NTG) tumor cells. The present invention provides methods of raising antibodies to the selected cells. The invention provides diagnostic methods using the selected cells. The invention also provides therapeutic methods, where the therapeutic is directed to a solid tumor stem cell (e.g. directed to one of the stem cells cancer markers identified herein directly or indirectly).
Accordingly, the invention provides methods of selecting cells, diagnosing disease, conducting research studies, and treating solid tumors using selection methods, diagnostic methods and therapeutics directed to specific genes on a given pathway. Included are one or more of the following genes and gene products: Bmi-1, eed, easyhi, easyh2, mf2, yy1, smarcA3, smarcA5, smarcD3, smarcE1 and mllt3, as well as those shown in Tables 4-9. Many of these genes are differentially expressed in solid tumor stem cells as compared with normal cells and non-tumorigenic cancer cells, as shown herein.
The invention provides in vivo and in vitro assays of solid tumor stem cell function and cell function by the various populations of cells isolated from a solid tumor. The invention provides methods for using the various populations of cells isolated from a solid tumor (such as a population of cells enriched for solid tumor stem cells) to identify factors influencing solid tumor stem cell proliferation. By the methods of the present invention, one can characterize the phenotypically heterogeneous populations of cells within a solid tumor. In particular, one can identify, isolate, and characterize a phenotypically distinct cell population within a tumor having the stem cell properties of extensive proliferation and the ability to give rise to all other tumor cell types. Solid tumor stem cells are the tumorigenic cells that are capable of re-establishing a tumor following treatment.
In certain embodiments, the present invention provides methods for screening for anti-cancer agents; for the testing of anti-cancer therapies; for the development of drugs targeting novel pathways; for the identification of new anti-cancer therapeutic targets; the identification and diagnosis of malignant cells in pathology specimens; for the testing and assaying of solid tumor stem cell drug sensitivity; for the measurement of specific factors that predict drug sensitivity; and for the screening of patients (e.g., as an adjunct for mammography).
The present invention further provides for the first time cancer stem cell gene signatures comprising cancer stem cell markers that are predictive of clinical outcome including metastasis and overall survival. These cancer stem cell signatures shown in Tables 9-13 and 17-24 are established as predictive of a poor prognosis. In some embodiments of the present invention the cancer stem cell signatures are used clinically to classify tumors as low or high risk and to assign a tumor to a low or high risk category. The cancer stem cell signatures can further be used to provide a diagnosis, prognosis, and select a therapy based on the classification of a tumor as low or high risk as well as to monitor the diagnosis, prognosis, and/or therapy over time. In another embodiment, the cancer stem cell signatures can be used experimentally to test and assess lead compounds including, for example, small molecules, siRNAs, and antibodies for the treatment of cancer.
In certain embodiments a tumor cell profile is detected by quantifying expression levels of polynucleotides by, for example, RT-PCR. The polynucleotides selected for quantification by RT-PCR are those polynucleotides comprising the cancer stem cell signature. Alternatively the tumor cell profiles are detected by quantifying expression levels of proteins by, for example, quantitative immunofluorescence or ELISA. The proteins selected for quantification are those proteins encoded by genes comprising the cancer stem cell signature. In a some embodiment, a tumor cell profile is detected by microarray analysis using microarrays that comprise a cancer stem cell gene signature. These microarrays can detect the presence of a tumor cell profile by expression levels of polynucleotides, for example mRNA, in a patient sample or, alternatively, by expression levels of proteins in a patient sample using, for example, antibodies. In another some embodiment, a tumor cell profile is detected by real-time PCR using primer sets that specifically amplify the genes comprising the cancer stem cell signature. In other embodiments of the invention, microarrays are provided that contain polynucleotides or proteins (i.e. antibodies) that detect the expression of a cancer stem cell signature for use in prognosis.
Other features, objects, and advantages of the invention will be apparent from the detailed description below. Additional guidance is provided in Published PCT patent application WO 02/12447 by the Regents of the University of Michigan and PCT patent application PCT/USO2/39191 by the Regents of the University of Michigan, both of which are incorporated herein by reference.
Definitions
To facilitate an understanding of the present invention, a number of terms and phrases are defined below:
As used here, the term “antibody” includes monoclonal antibodies (including full length monoclonal antibodies), polyclonal antibodies, multispecific antibodies (e.g., bispecific antibodies), and antibody fragments so long as they exhibit the desired biological activity (e.g. able to bind a stem cell cancer marker as described herein). Antibodies can be conjugated to other molecules (e.g., toxins).
As used herein, the term “antibody fragments” refers to a portion of an intact antibody. Examples of antibody fragments include, but are not limited to, linear antibodies; single-chain antibody molecules; Fc or Fc′ peptides, Fab and Fab fragments, and multispecific antibodies formed from antibody fragments.
As used herein, “humanized” forms of non-human (e.g., murine) antibodies are chimeric antibodies that contain minimal sequence, or no sequence, derived from non-human immunoglobulin. For the most part, humanized antibodies are human immunoglobulins (recipient antibody) in which residues from a hypervariable region of the recipient are replaced by residues from a hypervariable region of a non-human species (donor antibody) such as mouse, rat, rabbit or nonhuman primate having the desired specificity, affinity, and capacity. In some instances, Fv framework region (FR) residues of the human immunoglobulin are replaced by corresponding non-human residues. Furthermore, humanized antibodies can comprise residues that are not found in the recipient antibody or in the donor antibody. These modifications are generally made to further refine antibody performance. In general, the humanized antibody will comprise substantially all of at least one, and typically two, variable domains, in which all or substantially all of the hypervariable loops correspond to those of a nonhuman immunoglobulin and all or substantially all of the FR residues are those of a human immunoglobulin sequence. The humanized antibody can also comprise at least a portion of an immunoglobulin constant region (Fc), typically that of a human immunoglobulin. Examples of methods used to generate humanized antibodies are described in U.S. Pat. No. 5,225,539 to Winter et al. (herein incorporated by reference).
“Enriched”, as in an enriched population of cells, can be defined phenotypically based upon the increased number of cells having a particular marker (e.g. as shown in Tables 4-9) in a fractionated set of cells as compared with the number of cells having the marker in the unfractionated set of cells. However, the term “enriched can be defined functionally by tumorigenic function as the minimum number of cells that form tumors at limit dilution frequency in test mice. For example, if 500 tumor stem cells form tumors in 63% of test animals, but 5000 unfractionated tumor cells are required to form tumors in 63% of test animals, then the solid tumor stem cell population is 10-fold enriched for tumorigenic activity. The stem cell cancer markers of the present invention can be used to generate enriched populations of cancer stem cells. In some embodiments, the stem cell population is enriched at least 1.4 fold relative to unfractioned tumor cells (e.g. 1.4 fold, 1.5 fold, 2 fold, 5 fold, 10 fold, . . . , 20 fold, . . . ).
“Isolated” in regard to cells, refers to a cell that is removed from its natural environment (such as in a solid tumor) and that is isolated or separated, and is at least about 30%, 50%, 75%, 90% free, from other cells with which it is naturally present, but which lack the marker based on which the cells were isolated. The stem cell cancer markers of the present invention can be used to generate isolated populations of cancer stem cells.
As used herein, the terms “low levels”, “decreased levels”, “low expression”, “reduced expression” or “decreased expression” in regards to gene expression are used herein interchangeably to refer to expression of a gene in a cell or population of cells, particularly a cancer stem cell or population of cancer stem cells, at levels less than the expression of that gene in a second cell or population of cells, for example normal breast epithelial cells. “Low levels” of gene expression refers to expression of a gene in a cancer stem cell or population of cancer stem cells at levels: 1) half that or below expression levels of the same gene in normal breast epithelial cells and 2) at the lower limit of detection using conventional techniques. “Low levels” of gene expression can be determined by detecting decreased to nearly undetectable amounts of a polynucleotide (mRNA, cDNA, etc.) in cancer stem cells compared to normal breast epithelium by, for example, quantitative RT-PCR or microarray analysis. Alternatively “low levels” of gene expression can be determined by detecting decreased to nearly undetectable amounts of a protein in cancer stem cells compared to normal breast epithelium by, for example, ELISA, Western blot, quantitative immunofluorescence, etc.
The terms “high levels”, “increased levels”, “high expression”, “increased expression” or “elevated levels” in regards to gene expression are used herein interchangeably to refer to expression of a gene in a cell or population of cells, particularly a cancer stem cell or population of cancer stem cells, at levels higher than the expression of that gene in a second cell or population of cells, for example normal breast epithelial cells. “Elevated levels” of gene expression refers to expression of a gene in a cancer stem cell or population of cancer stem cells at levels twice that or more of expression levels of the same gene in normal breast epithelial cells. “Elevated levels” of gene expression can be determined by detecting increased amounts of a polynucleotide (mRNA, cDNA, etc.) in cancer stem cells compared to normal breast epithelium by, for example, quantitative RT-PCR or microarray analysis. Alternatively “elevated levels” of gene expression can be determined by detecting increased amounts of a protein in cancer stem cells compared to normal breast epithelium by, for example, ELISA, Western blot, quantitative immunofluorescence, etc.
The term “undetectable levels” or “loss of expression” in regards to gene expression as used herein refers to expression of a gene in a cell or population of cells, particularly a cancer stem cell or population of cancer stem cells, at levels that cannot be distinguished from background using conventional techniques such that no expression is identified. “Undetectable levels” of gene expression can be determined by the inability to detect levels of a polynucleotide (mRNA, cDNA, etc.) in cancer stem cells above background by, for example, quantitative RT-PCR or microarray analysis. Alternatively “undetectable levels” of gene expression can be determined by the inability to detect levels of a protein in cancer stem cells above background by, for example, ELISA, Western blot, immunofluorescence, etc.
As used herein, the term “receptor binding domain” refers to any native ligand for a receptor, including cell adhesion molecules, or any region or derivative of such native ligand retaining at least a qualitative receptor binding ability of a corresponding native ligand.
As used herein, the term “antibody-immunoadhesin chimera” comprises a molecule that combines at least one binding domain of an antibody with at least one immunoadhesin. Examples include, but are not limited to, the bispecific CD4-IgG chimeras described in Berg et al., PNAS (USA) 88:4723-4727 (1991) and Charnow et al., J. Immunol., 153:4268 (1994), both of which are hereby incorporated by reference.
As used herein, the terms “cancer” and “cancerous” refer to or describe the physiological condition in mammals in which a population of cells are characterized by unregulated cell growth. Examples of cancer include, but are not limited to, carcinoma, lymphoma, blastoma, sarcoma, and leukemia. More particular examples of such cancers include squamous cell cancer, small-cell lung cancer, non-small cell lung cancer, adenocarcinoma of the lung, squamous carcinoma of the lung, cancer of the peritoneum, hepatocellular cancer, gastrointestinal cancer, pancreatic cancer, glioblastoma, cervical cancer, ovarian cancer, liver cancer, bladder cancer, hepatoma, breast cancer, colon cancer, colorectal cancer, endometrial or uterine carcinoma, salivary gland carcinoma, kidney cancer, liver cancer, prostate cancer, vulval cancer, thyroid cancer, hepatic carcinoma and various types of head and neck cancer.
“Metastasis” as used herein refers to the process by which a cancer spreads or transfers from the site of origin to other regions of the body with the development of a similar cancerous lesion at the new location. A “metastatic” or “metastasizing” cell is one that loses adhesive contacts with neighboring cells and migrates via the bloodstream or lymph from the primary site of disease to invade neighboring body structures.
The term “epitope” as used herein refers to that portion of an antigen that makes contact with a particular antibody.
When a protein or fragment of a protein is used to immunize a host animal, numerous regions of the protein can induce the production of antibodies which bind specifically to a given region or three-dimensional structure on the protein; these regions or structures are referred to as “antigenic determinants”. An antigenic determinant can compete with the intact antigen (i.e., the “immunogen” used to elicit the immune response) for binding to an antibody.
The terms “specific binding” or “specifically binding” when used in reference to the interaction of an antibody and a protein or peptide means that the interaction is dependent upon the presence of a particular structure (i.e., the antigenic determinant or epitope) on the protein; in other words the antibody is recognizing and binding to a specific protein structure rather than to proteins in general. For example, if an antibody is specific for epitope “A,” the presence of a protein containing epitope A (or free, unlabelled A) in a reaction containing labeled “A” and the antibody will reduce the amount of labeled A bound to the antibody.
As used herein, the terms “non-specific binding” and “background binding” when used in reference to the interaction of an antibody and a protein or peptide refer to an interaction that is not dependent on the presence of a particular structure (i.e., the antibody is binding to proteins in general rather that a particular structure such as an epitope).
As used herein, the term “subject” refers to any animal (e.g., a mammal), including, but not limited to, humans, non-human primates, rodents, and the like, which is to be the recipient of a particular treatment. Typically, the terms “subject” and “patient” are used interchangeably herein in reference to a human subject.
As used herein, the term “subject suspected of having cancer” refers to a subject that presents one or more symptoms indicative of a cancer (e.g., a noticeable lump or mass) or is being screened for a cancer (e.g., during a routine physical). A subject suspected of having cancer can also have one or more risk factors. A subject suspected of having cancer has generally not been tested for cancer. However, a “subject suspected of having cancer” encompasses an individual who has received an initial diagnosis but for whom the stage of cancer is not known. The term further includes people who once had cancer (e.g., an individual in remission).
As used herein, the term “subject at risk for cancer” refers to a subject with one or more risk factors for developing a specific cancer. Risk factors include, but are not limited to, gender, age, genetic predisposition, environmental exposure, previous incidents of cancer, preexisting non-cancer diseases, and lifestyle.
As used herein, the term “characterizing cancer in subject” refers to the identification of one or more properties of a cancer sample in a subject, including but not limited to, the presence of benign, pre-cancerous or cancerous tissue, the stage of the cancer, and the subject's prognosis. Cancers can be characterized by the identification of the expression of one or more cancer marker genes, including but not limited to, the cancer markers disclosed herein.
The terms “cancer stem cell”, “tumor stem cell”, or “solid tumor stem cell” are used interchangeably herein and refer to a population of cells from a solid tumor that: (1) have extensive proliferative capacity; (2) are capable of asymmetric cell division to generate one or more kinds of differentiated progeny with reduced proliferative or developmental potential; and (3) are capable of symmetric cell divisions for self-renewal or self-maintenance. These properties of “cancer stem cells”, “tumor stem cells” or “solid tumor stem cells” confer on those cancer stem cells the ability to form palpable tumors upon serial transplantation into an immunocompromised mouse compared to the majority of tumor cells that fail to form tumors. Cancer stem cells undergo self-renewal versus differentiation in a chaotic manner to form tumors with abnormal cell types that can change over time as mutations occur. The solid tumor stem cells of the present invention differ from the “cancer stem line” provided by U.S. Pat. No. 6,004,528. In that patent, the “cancer stem line” is defined as a slow growing progenitor cell type that itself has few mutations but which undergoes symmetric rather than asymmetric cell divisions as a result of tumorigenic changes that occur in the cell's environment. This “cancer stem line” hypothesis thus proposes that highly mutated, rapidly proliferating tumor cells arise largely as a result of an abnormal environment, which causes relatively normal stem cells to accumulate and then undergo mutations that cause them to become tumor cells. U.S. Pat. No. 6,004,528 proposes that such a model can be used to enhance the diagnosis of cancer. The solid tumor stem cell model is fundamentally different than the “cancer stem line” model and as a result exhibits utilities not offered by the “cancer stem line” model. First, solid tumor stem cells are not “mutationally spared”. The “mutationally spared cancer stem line” described by U.S. Pat. No. 6,004,528 can be considered a pre-cancerous lesion, while the solid tumor stem cells described by this invention are cancer cells that themselves contain the mutations that are responsible for tumorigenesis. That is, the solid tumor stem cells (“cancer stem cells”) of the invention would be included among the highly mutated cells that are distinguished from the “cancer stem line” in U.S. Pat. No. 6,004,528. Second, the genetic mutations that lead to cancer can be largely intrinsic within the solid tumor stem cells as well as being environmental. The solid tumor stem cell model predicts that isolated solid tumor stem cells can give rise to additional tumors upon transplantation (thus explaining metastasis) while the “cancer stem line” model would predict that transplanted “cancer stem line” cells would not be able to give rise to a new tumor, since it was their abnormal environment that was tumorigenic. Indeed, the ability to transplant dissociated, and phenotypically isolated human solid tumor stem cells to mice (into an environment that is very different from the normal tumor environment), where they still form new tumors, distinguishes the present invention from the “cancer stem line” model. Third, solid tumor stem cells likely divide both symmetrically and asymmetrically, such that symmetric cell division is not an obligate property. Fourth, solid tumor stem cells can divide rapidly or slowly, depending on many variables, such that a slow proliferation rate is not a defining characteristic.
As used herein “tumorigenic” refers to the functional features of a solid tumor stem cell including the properties of self-renewal (giving rise to additional tumorigenic cancer stem cells) and proliferation to generate all other tumor cells (giving rise to differentiated and thus non-tumorigenic tumor cells) that allow solid tumor stem cells to form a tumor.
As used herein, the terms “stem cell cancer marker(s)”, “cancer stem cell marker(s)”, “tumor stem cell marker(s)”, or “solid tumor stem cell marker(s)” refer to a gene or genes or a protein, polypeptide, or peptide expressed by the gene or genes whose expression level, alone or in combination with other genes, is correlated with the presence of tumorigenic cancer cells compared to non-tumorigenic cells. The correlation can relate to either an increased or decreased expression of the gene (e.g. increased or decreased levels of mRNA or the peptide encoded by the gene).
“Gene expression profile” refers to identified expression levels of at least one polynucleotide or protein expressed in a biological sample.
A “gene profile,” “gene pattern,” “expression pattern” or “expression profile” refers to a specific pattern of gene expression that provides a unique identifier of a biological sample, for example, a breast or colon cancer pattern of gene expression obtained by analyzing a breast or colon cancer sample in comparison to a reference sample will be referred to as a “breast cancer gene profile” or a “colon cancer expression pattern”. “Gene patterns” can be used to diagnose a disease, make a prognosis, select a therapy, and/or monitor a disease or therapy after comparing the gene pattern to a cancer stem cell gene signature.
The terms “cancer stem cell gene signature”, “tumor stem cell gene signature”, “cancer stem cell signature”, “tumor stem cell signature”, “tumorigenic gene signature”, and “TG gene signature” are used interchangeably herein to refer to gene signatures comprising genes differentially expressed in cancer stem cells compared to other cells or population of cells, for example normal breast epithelial tissue. In some embodiments the cancer stem cell gene signatures comprise genes differentially expressed in cancer stem cells versus normal breast epithelium by a fold change, for example by 2 fold reduced and/or elevated expression, and further limited by using a statistical analysis such as, for example, by the P value of a t-test across multiple samples. In another some embodiment, the genes differentially expressed in cancer stem cells are divided into cancer stem cell gene signatures based on the correlation of their expression with a chosen gene in combination with their fold or percentage expression change. Most some cancer stem cell signatures are predictive both retrospectively and prospectively of an aspect of clinical variability, including but not limited to metastasis and death.
As used herein, the term “a reagent that specifically detects expression levels” refers to reagents used to detect the expression of one or more genes (e.g., including but not limited to, the cancer markers of the present invention). Examples of suitable reagents include but are not limited to, nucleic acid probes capable of specifically hybridizing to the gene of interest, aptamers, PCR primers capable of specifically amplifying the gene of interest, and antibodies capable of specifically binding to proteins expressed by the gene of interest. Other non-limiting examples can be found in the description and examples below.
As used herein, the term “detecting a decreased or increased expression relative to non-cancerous control” refers to measuring the level of expression of a gene (e.g., the level of mRNA or protein) relative to the level in a non-cancerous control sample. Gene expression can be measured using any suitable method, including but not limited to, those described herein.
As used herein, the term “detecting a change in gene expression in a cell sample in the presence of said test compound relative to the absence of said test compound” refers to measuring an altered level of expression (e.g., increased or decreased) in the presence of a test compound relative to the absence of the test compound. Gene expression can be measured using any suitable method.
As used herein, the term “instructions for using said kit for detecting cancer in said subject” includes instructions for using the reagents contained in the kit for the detection and characterization of cancer in a sample from a subject.
As used herein, “providing a diagnosis” or “diagnostic information” refers to any information that is useful in determining whether a patient has a disease or condition and/or in classifying the disease or condition into a phenotypic category or any category having significance with regards to the prognosis of or likely response to treatment (either treatment in general or any particular treatment) of the disease or condition. Similarly, diagnosis refers to providing any type of diagnostic information, including, but not limited to, whether a subject is likely to have a condition (such as a tumor), information related to the nature or classification of a tumor, information related to prognosis and/or information useful in selecting an appropriate treatment. Selection of treatment can include the choice of a particular chemotherapeutic agent or other treatment modality such as surgery, radiation, etc., a choice about whether to withhold or deliver therapy, etc.
As used herein, the terms “providing a prognosis”, “prognostic information”, or “predictive information” refer to providing information regarding the impact of the presence of cancer (e.g., as determined by the diagnostic methods of the present invention) on a subject's future health (e.g., expected morbidity or mortality, the likelihood of getting cancer, and the risk of metastasis).
The term “low risk” in regards to tumors or to patients diagnosised with cancer refers to a tumor or patient with a lower probability of metastasis and/or lower probability of causing death or dying within about five years of first diagnosis than all the tumors or patients within a given population.
The term “high risk” in regards to tumors or to patients diagnosised with cancer refers to a tumor or patient with a higher probability of metastasis and/or higher probability of causing death or dying within about five years of first diagnosis than all the tumors or patients within a given population.
As used herein, the term “post surgical tumor tissue” refers to cancerous tissue (e.g., biopsy tissue) that has been removed from a subject (e.g., during surgery).
As used herein, the term “subject diagnosed with a cancer” refers to a subject who has been tested and found to have cancerous cells. The cancer can be diagnosed using any suitable method, including but not limited to, biopsy, x-ray, blood test, and the diagnostic methods of the present invention.
As used herein, the terms “biopsy tissue”, “patient sample”, “tumor sample”, and “cancer sample” refer to a sample of cells, tissue or fluid that is removed from a subject for the purpose of determining if the sample contains cancerous tissue, including cancer stem cells or for determining gene expression profile of that cancerous tissue. In some embodiment, biopsy tissue or fluid is obtained because a subject is suspected of having cancer. The biopsy tissue or fluid is then examined for the presence or absence of cancer, cancer stem cells, and/or cancer stem cell gene signature expression.
As used herein, the term “gene transfer system” refers to any means of delivering a composition comprising a nucleic acid sequence to a cell or tissue. For example, gene transfer systems include, but are not limited to, vectors (e.g., retroviral, adenoviral, adeno-associated viral, and other nucleic acid-based delivery systems), microinjection of naked nucleic acid, polymer-based delivery systems (e.g., liposome-based and metallic particle-based systems), biolistic injection, and the like. As used herein, the term “viral gene transfer system” refers to gene transfer systems comprising viral elements (e.g., intact viruses, modified viruses and viral components such as nucleic acids or proteins) to facilitate delivery of the sample to a desired cell or tissue. As used herein, the term “adenovirus gene transfer system” refers to gene transfer systems comprising intact or altered viruses belonging to the family Adenoviridae.
As used herein, the term “site-specific recombination target sequences” refers to nucleic acid sequences that provide recognition sequences for recombination factors and the location where recombination takes place.
As used herein, the term “nucleic acid molecule” refers to any nucleic acid containing molecule, including but not limited to, DNA or RNA. The term encompasses sequences that include any of the known base analogs of DNA and RNA including, but not limited to, 4-acetylcytosine, 8-hydroxy-N6-methyladenosine, aziridinylcytosine, pseudoisocytosine, 5-(carboxyhydroxylmethyl) uracil, 5-fluorouracil, 5-bromouracil, 5-carboxymethylaminomethyl-2-thiouracil, 5-carboxymethylaminomethyluracil, dihydrouracil, inosine, N6-isopentenyladenine, 1-methyladenine, 1-methylpseudouracil, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-methyladenine, 7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta-D-mannosylqueosine, 5′-methoxycarbonylmethyluracil, 5-methoxyuracil, 2-methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid, oxybutoxosine, pseudouracil, queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, N-uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid, pseudouracil, queosine, 2-thiocytosine, and 2,6-diaminopurine.
The term “gene” refers to a nucleic acid (e.g., DNA) sequence that comprises coding sequences necessary for the production of a polypeptide, precursor, or RNA (e.g., rRNA, tRNA). The polypeptide can be encoded by a full length coding sequence or by any portion of the coding sequence so long as the desired activity or functional properties (e.g., enzymatic activity, ligand binding, signal transduction, immunogenicity, etc.) of the full-length or fragment are retained. The term also encompasses the coding region of a structural gene and the sequences located adjacent to the coding region on both the 5′ and 3′ ends for a distance of about 1 kb or more on either end such that the gene corresponds to the length of the full-length mRNA. Sequences located 5′ of the coding region and present on the mRNA are referred to as 5′ non-translated sequences. Sequences located 3′ or downstream of the coding region and present on the mRNA are referred to as 3′ non-translated sequences. The term “gene” encompasses both cDNA and genomic forms of a gene. A genomic form or clone of a gene contains the coding region interrupted with non-coding sequences termed “introns” or “intervening regions” or “intervening sequences.” Introns are segments of a gene that are transcribed into nuclear RNA (hnRNA); introns can contain regulatory elements such as enhancers. Introns are removed or “spliced out” from the nuclear or primary transcript; introns therefore are absent in the messenger RNA (mRNA) transcript. The mRNA functions during translation to specify the sequence or order of amino acids in a nascent polypeptide.
As used herein, the term “heterologous gene” refers to a gene that is not in its natural environment. For example, a heterologous gene includes a gene from one species introduced into another species. A heterologous gene also includes a gene native to an organism that has been altered in some way (e.g., mutated, added in multiple copies, linked to non-native regulatory sequences, etc). Heterologous genes are distinguished from endogenous genes in that the heterologous gene sequences are typically joined to DNA sequences that are not found naturally associated with the gene sequences in the chromosome or are associated with portions of the chromosome not found in nature (e.g., genes expressed in loci where the gene is not normally expressed).
As used herein, the term “gene expression” refers to the process of converting genetic information encoded in a gene into RNA (e.g., mRNA, rRNA, tRNA, or snRNA) through “transcription” of the gene (e.g., via the enzymatic action of an RNA polymerase), and for protein encoding genes, into protein through “translation” of mRNA. Gene expression can be regulated at many stages in the process. “Up-regulation” or “activation” refers to regulation that increases the production of gene expression products (e.g., RNA or protein), while “down-regulation” or “repression” refers to regulation that decrease production. Molecules (e.g., transcription factors) that are involved in up-regulation or down-regulation are often called “activators” and “repressors,” respectively.
In addition to containing introns, genomic forms of a gene can also include sequences located on both the 5′ and 3′ end of the sequences that are present on the RNA transcript. These sequences are referred to as “flanking” sequences or regions (these flanking sequences are located 5′ or 3′ to the non-translated sequences present on the mRNA transcript). The 5′ flanking region can contain regulatory sequences such as promoters and enhancers that control or influence the transcription of the gene. The 3′ flanking region can contain sequences that direct the termination of transcription, post-transcriptional cleavage and polyadenylation.
The term “siRNAs” refers to short interfering RNAs. In some embodiments, siRNAs comprise a duplex, or double-stranded region, of about 18-25 nucleotides long; often siRNAs contain from about two to four unpaired nucleotides at the 3′ end of each strand. At least one strand of the duplex or double-stranded region of a siRNA is substantially homologous to or substantially complementary to a target RNA molecule. The strand complementary to a target RNA molecule is the “antisense strand;” the strand homologous to the target RNA molecule is the “sense strand,” and is also complementary to the siRNA antisense strand. siRNAs can also contain additional sequences; non-limiting examples of such sequences include linking sequences, or loops, as well as stem and other folded structures. siRNAs appear to function as key intermediaries in triggering RNA interference in invertebrates and in vertebrates, and in triggering sequence-specific RNA degradation during posttranscriptional gene silencing in plants.
The term “RNA interference” or “RNAi” refers to the silencing or decreasing of gene expression by siRNAs. It is the process of sequence-specific, post-transcriptional gene silencing in animals and plants, initiated by siRNA that is homologous in its duplex region to the sequence of the silenced gene. The gene can be endogenous or exogenous to the organism, present integrated into a chromosome or present in a transfection vector that is not integrated into the genome. The expression of the gene is either completely or partially inhibited. RNAi can also be considered to inhibit the function of a target RNA; the function of the target RNA can be complete or partial.
As used herein, the terms “nucleic acid molecule encoding,” “DNA sequence encoding,” and “DNA encoding” refer to the order or sequence of deoxyribonucleotides along a strand of deoxyribonucleic acid. The order of these deoxyribonucleotides determines the order of amino acids along the polypeptide (protein) chain. The DNA sequence thus codes for the amino acid sequence.
As used herein, the terms “an oligonucleotide having a nucleotide sequence encoding a gene” and “polynucleotide having a nucleotide sequence encoding a gene,” means a nucleic acid sequence comprising the coding region of a gene or in other words the nucleic acid sequence that encodes a gene product. The coding region can be present in a cDNA, genomic DNA or RNA form. When present in a DNA form, the oligonucleotide or polynucleotide can be single-stranded (i.e., the sense strand) or double-stranded. Suitable control elements such as enhancers/promoters, splice junctions, polyadenylation signals, etc. can be placed in close proximity to the coding region of the gene if needed to permit proper initiation of transcription and/or correct processing of the primary RNA transcript. Alternatively, the coding region utilized in the expression vectors of the present invention can contain endogenous enhancers/promoters, splice junctions, intervening sequences, polyadenylation signals, etc. or a combination of both endogenous and exogenous control elements.
As used herein the term “portion” when in reference to a nucleotide sequence (as in “a portion of a given nucleotide sequence”) refers to fragments of that sequence. The fragments can range in size from four nucleotides to the entire nucleotide sequence minus one nucleotide (10 nucleotides, 20, 30, 40, 50, 100, 200, etc.).
The phrases “hybridizes”, “selectively hybridizes”, or “specifically hybridizes” refer to the binding or duplexing of a molecule only to a particular nucleotide sequence under stringent hybridization conditions when that sequence is present in a complex mixture (e.g., a library of DNAs or RNAs). See, e.g., Andersen (1998) Nucleic Acid Hybridization Springer-Verlag; Ross (ed. 1997) Nucleic Acid Hybridization Wiley.
The phrase “stringent hybridization conditions” refers to conditions under which a probe will hybridize to its target subsequence, typically in a complex mixture of nucleic acid, but to no other sequences. Stringent conditions are sequence-dependent and will be different in different circumstances. Longer sequences hybridize specifically at higher temperatures. An extensive guide to the hybridization of nucleic acids is found in Tijssen, Techniques in Biochemistry and Molecular Biology—Hybridization with Nucleic Probes, “Overview of principles of hybridization and the strategy of nucleic acid assays” (1993). Generally, stringent conditions are selected to be about 5-10° C. lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength. The Tm is the temperature (under defined ionic strength, pH, and nucleic concentration) at which 50% of the probes complementary to the target hybridize to the target sequence at equilibrium (as the target sequences are present in excess, at Tm, 50% of the probes are occupied at equilibrium). Stringent conditions will be those in which the salt concentration is less than about 1.0 M sodium ion, typically about 0.01 to 1.0 M sodium ion concentration (or other salts) at pH 7.0 to 8.3 and the temperature is at least about 30° C. for short probes (e.g., 10 to 50 nucleotides) and at least about 60° C. for long probes (e.g., greater than 50 nucleotides). Stringent conditions can also be achieved with the addition of destabilizing agents such as formamide. For high stringency hybridization, a positive signal is at least two times or 10 times background hybridization. Exemplary high stringency or stringent hybridization conditions include: 50% formamide, 5×SSC, and 1% SDS incubated at 42° C. or 5×SSC and 1% SDS incubated at 65° C., with a wash in 0.2×SSC and 0.1% SDS at 65° C. For PCR, a temperature of about 36° C. is typical for low stringency amplification, although annealing temperatures can vary between about 32° C. and 48° C. depending on primer length. For high stringency PCR amplification, a temperature of about 62° C. is typical, although high stringency annealing temperatures can range from about 50-65° C., depending on the primer length and specificity. Typical cycle conditions for both high and low stringency amplifications include a denaturation phase of 90-95° C. for 30-120 sec, an annealing phase lasting 30-120 sec., and an extension phase of about 72° C. for 1-2 min.
The terms “in operable combination,” “in operable order,” and “operably linked” as used herein refer to the linkage of nucleic acid sequences in such a manner that a nucleic acid molecule capable of directing the transcription of a given gene and/or the synthesis of a desired protein molecule is produced. The term also refers to the linkage of amino acid sequences in such a manner so that a functional protein is produced.
The term “isolated” when used in relation to a nucleic acid, as in “an isolated oligonucleotide” or “isolated polynucleotide” refers to a nucleic acid sequence that is identified and separated from at least one component or contaminant with which it is ordinarily associated in its natural source. Isolated nucleic acid is such present in a form or setting that is different from that in which it is found in nature. In contrast, non-isolated nucleic acids as nucleic acids such as DNA and RNA found in the state they exist in nature. For example, a given DNA sequence (e.g., a gene) is found on the host cell chromosome in proximity to neighboring genes; RNA sequences, such as a specific mRNA sequence encoding a specific protein, are found in the cell as a mixture with numerous other mRNAs that encode a multitude of proteins. However, isolated nucleic acid encoding a given protein includes, by way of example, such nucleic acid in cells ordinarily expressing the given protein where the nucleic acid is in a chromosomal location different from that of natural cells, or is otherwise flanked by a different nucleic acid sequence than that found in nature. The isolated nucleic acid, oligonucleotide, or polynucleotide can be present in single-stranded or double-stranded form. When an isolated nucleic acid, oligonucleotide or polynucleotide is to be utilized to express a protein, the oligonucleotide or polynucleotide will contain at a minimum the sense or coding strand (i.e., the oligonucleotide or polynucleotide can be single-stranded), but can contain both the sense and anti-sense strands (i.e., the oligonucleotide or polynucleotide can be double-stranded).
“Amino acid sequence” and terms such as “polypeptide”, “protein”, or “peptide” are not meant to limit the amino acid sequence to the complete, native amino acid sequence associated with the recited protein molecule.
The term “native protein” as used herein to indicate that a protein does not contain amino acid residues encoded by vector sequences; that is, the native protein contains only those amino acids found in the protein as it occurs in nature. A native protein can be produced by recombinant means or can be isolated from a naturally occurring source.
As used herein the term “portion” when in reference to a protein (as in “a portion of a given protein”) refers to fragments of that protein. The fragments can range in size from four amino acid residues to the entire amino acid sequence minus one amino acid.
The term “Southern blot,” refers to the analysis of DNA on agarose or acrylamide gels to fractionate the DNA according to size followed by transfer of the DNA from the gel to a solid support, such as nitrocellulose or a nylon membrane. The immobilized DNA is then probed with a labeled probe to detect DNA species complementary to the probe used. The DNA can be cleaved with restriction enzymes prior to electrophoresis. Following electrophoresis, the DNA can be partially depurinated and denatured prior to or during transfer to the solid support. Southern blots are a standard tool of molecular biologists (J. Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Press, NY, pp 9.31-9.58 [1989]).
The term “Northern blot,” as used herein refers to the analysis of RNA by electrophoresis of RNA on agarose gels to fractionate the RNA according to size followed by transfer of the RNA from the gel to a solid support, such as nitrocellulose or a nylon membrane. The immobilized RNA is then probed with a labeled probe to detect RNA species complementary to the probe used. Northern blots are a standard tool of molecular biologists (J. Sambrook, et al., supra, pp 7.39-7.52 [1989]).
The term “Western blot” refers to the analysis of protein(s) (or polypeptides) immobilized onto a support such as nitrocellulose or a membrane. The proteins are run on acrylamide gels to separate the proteins, followed by transfer of the protein from the gel to a solid support, such as nitrocellulose or a nylon membrane. The immobilized proteins are then exposed to antibodies with reactivity against an antigen of interest. The binding of the antibodies can be detected by various methods, including the use of radiolabeled antibodies.
The term “transgene” as used herein refers to a foreign gene that is placed into an organism by, for example, introducing the foreign gene into newly fertilized eggs or early embryos. The term “foreign gene” refers to any nucleic acid (e.g., gene sequence) that is introduced into the genome of an animal by experimental manipulations and can include gene sequences found in that animal so long as the introduced gene does not reside in the same location as does the naturally occurring gene.
As used herein, the term “vector” is used in reference to nucleic acid molecules that transfer DNA segment(s) from one cell to another. The term “vehicle” is sometimes used interchangeably with “vector.” Vectors are often derived from plasmids, bacteriophages, or plant or animal viruses.
The term “expression vector” as used herein refers to a recombinant DNA molecule containing a desired coding sequence and appropriate nucleic acid sequences necessary for the expression of the operably linked coding sequence in a particular host organism. Nucleic acid sequences necessary for expression in prokaryotes usually include a promoter, an operator (optional), and a ribosome binding site, often along with other sequences. Eukaryotic cells are known to utilize promoters, enhancers, and termination and polyadenylation signals.
The terms “overexpression” and “overexpressing” and grammatical equivalents, are used in reference to levels of mRNA to indicate a level of expression approximately 1.5-fold higher (or greater) than that observed in a given tissue in a control or non-transgenic animal. Levels of mRNA are measured using any of a number of techniques known to those skilled in the art including, but not limited to Northern blot analysis. Appropriate controls are included on the Northern blot to control for differences in the amount of RNA loaded from each tissue analyzed (e.g., the amount of 28 S rRNA, an abundant RNA transcript present at essentially the same amount in all tissues, present in each sample can be used as a means of normalizing or standardizing the mRNA-specific signal observed on Northern blots). The amount of mRNA present in the band corresponding in size to the correctly spliced transgene RNA is quantified; other minor species of RNA which hybridize to the transgene probe are not considered in the quantification of the expression of the transgenic mRNA.
As used herein, the term “in vitro” refers to an artificial environment and to processes or reactions that occur within an artificial environment. In vitro environments can consist of, but are not limited to, test tubes and cell culture. The term “in vivo” refers to the natural environment (e.g., an animal or a cell) and to processes or reaction that occur within a natural environment.
The terms “test compound” and “candidate compound” refer to any chemical entity, pharmaceutical, drug, and the like that is a candidate for use to treat or prevent a disease, illness, sickness, or disorder of bodily function (e.g., cancer). Test compounds comprise both known and potential therapeutic compounds. A test compound can be determined to be therapeutic by screening using the screening methods of the present invention. In some embodiments of the present invention, test compounds include antisense compounds.
As used herein, the term “sample” includes a specimen or culture obtained from any source, as well as biological and environmental samples. Biological samples can be obtained from animals (including humans) and encompass fluids, solids, tissues, and gases. Biological samples include blood products, such as plasma, serum and the like. Environmental samples include environmental material such as surface matter, soil, water, crystals and industrial samples. Such examples are not however to be construed as limiting the sample types applicable to the present invention.
The present invention provides compositions and methods for treating, characterizing, and diagnosing cancer. In particular, the present invention provides gene expression profiles associated with solid tumor stem cells, as well as novel markers useful for the diagnosis, characterization, and treatment of solid tumor stem cells.
I. Stem Cells and Solid Tumor Stem Cells
Common cancers arise in tissues that contain a large sub-population of proliferating cells that are responsible for replenishing the short-lived mature cells. In such organs, cell maturation is arranged in a hierarchy in which a rare population of stem cells give rise to the mature cells and perpetuate themselves through a process called self renewal (Akashi & Weissman, Developmental Biology of Hematopoiesis, Oxford Univ. Press, NY, 2001; Spangrude et al., 1988, Science 241:58-61; Baum et al., 1992, PNAS 89:2804-8; Morrison et al., 1995, PNAS 92:10302-6; Morrison et al., 1996, Immunity 5:207-16; Morrison et al., 1995, Annu. Rev. Cell Dev. Biol. 11:35-71; Morrison et al., 1997, Dev. 124:1929-39; Morrison & Weissman, 1994, Immunity 1:661; Morrison et al., 1997, Cell 88:287-98; Uchida et al., 2000, PNAS 97:14720-5; Morrison et al., 2000, Cell 101:499-510). Due to their rarity, stem cells should be isolated in order to study their biological, molecular, and biochemical properties. Although it is likely that they give rise to most tissues, stem cells have been rigorously identified and purified in only a few tissues. The stem cells that give rise to the lympho-hematopoietic system, called hematopoietic stem cells (HSCs), have been isolated from mice and humans and are the best characterized stem cells. The utility of tissue containing HSCs has been demonstrated in cancer therapy with their extensive use for bone marrow transplantation to regenerate the hematolymphoid system following myeloablative protocols (Baum et al., Bone Marrow Transplantation, Blackwell Scientific Publications, Boston, 1994). The prospective isolation of HSCs from patients can result in a population that is cancer free for autologous transplantation (Tricot et al., 1998, Blood 91:4489-95; Negrin et al., 2000, Biol Blood Marrow Transplantation 6:262-5; Michallet et al., 2000, Exp. Hematol. 28:858-70; Veona et al., 2002, Br. J. Haematol. 117:642-5; Barbui et al., 2002, Br. J. Haemat. 116:202-10).
Understanding the cellular biology of the tissues in which cancers arise, and specifically of the stem cells residing in those tissues, provides new insights into cancer biology. Several aspects of stem cell biology are relevant to cancer. First, both normal stem cells and cancer stem cells undergo self-renewal, and emerging evidence suggests that similar molecular mechanisms regulate self-renewal in normal stem cells and their malignant counterparts. Next, it is quite likely that mutations that lead to cancer accumulate in normal stem cells. Finally, it is likely that tumors contain a “cancer stem cell” population with indefinite proliferative potential that drives the growth and metastasis of tumors (Southam & Brunschwig, 1961, Cancer 14:971-78; Bruce & Gaag, 1963, Nature 199:79-80; Wodinsky et al., 1967, Cancer Chemother. Rep. 51:415-21; Bergsagel & Valeriote, 1968, Cancer Res. 28:2187-96; Park et al., 1971, J. Natl. Cancer Inst. 46:411-22; Hamburger & Salmon, 1977, Science 197:461-3; Lagasse & Weissman, 1994, J. Exp. Med. 179:1047-52; Reya et al., 2001, Nature 414:105-11; Al-Hajj et al., 2002, PNAS 100:3983).
HSCs are the most studied and best understood somatic stem cell population (Akashi & Weissman, Developmental Biology of Hematopoiesis, Oxford Univ. Press, NY, 2001). Hematopoiesis is a tightly regulated process in which a pool of hematopoietic stem cells eventually gives rise to the lymphohematopoietic system consisting of the formed blood elements, e.g., red blood cells, platelets, granulocytes, macrophages, and B-and T-lymphocytes. These cells are important for oxygenation, prevention of bleeding, immunity, and infections, respectively. In the adult, HSCs have two fundamental properties. First, HSCs need to self-renew in order to maintain the stem cell pool; the total number of HSCs is under strict genetic regulation (Morrison et al., 2002, J. Immunol. 168:635-42). Second, they must undergo differentiation to maintain a constant pool of mature cells in normal conditions, and to produce increased numbers of a particular lineage in response to stresses such as bleeding or infection.
In the hematopoietic system, multipotent cells constitute 0.05% of mouse bone marrow cells and are heterogeneous with respect to their ability to self-renew. There are three different populations of multipotent cells: long-term self-renewing HSCs, short-term self-renewing HSCs, and multipotent progenitors without detectable self-renewal potential (Morrison & Weissman, 1994, Immunity 1:661; Christensen & Weissman, 2001, PNAS 98:14541-6). These populations form a hierarchy in which the long-term HSCs give rise to short-term HSCs, which in turn give rise to multipotent progenitors (FIG. 1 in Morrison & Weissman, 1994, Immunity 1:661). As HSCs mature from the long-term self-renewing pool to multipotent progenitors they become more mitotically active but lose the ability to self-renew. Only long-term HSCs can give rise to mature hematopoietic cells for the lifetime of the animal, while short-term HSCs and multipotent progenitors reconstitute lethally irradiated mice for less than eight weeks (Morrison & Weissman, 1994, Immunity 1:661).
Despite the fact that the phenotypic and functional properties of mouse and human HSCs have been extensively characterized (Baum et al., 1992, PNAS 89:2804-8), our understanding of the fundamental stem cell property, self-renewal, is minimal (Weissman, 2000, Science 287:1442; Osawa et al., 1996, Science 273:242-5; Reya et al., 2001, Nature 414:105-11). In most cases, HSCs differentiate when exposed to combinations of growth factors that can induce extensive proliferation in long-term cultures (Domen et al., 2000, J. Exp. Med. 192:1707-18). Although recent progress has been made in identifying culture conditions that maintain HSC activity in culture for a limited period of time (for example see Miller & Eaves, 1997, PNAS 94:13648-53), it has proven to be exceedingly difficult to identify tissue culture conditions that promote a significant and prolonged expansion of progenitors with transplantable HSC activity.
Maintenance of a tissue or a tumor is determined by a balance of proliferation and cell death (Hanahan & Weinberg, 2000, Cell 100:57-70). In a normal tissue, stem cell numbers are under tight genetic regulation resulting in maintenance a constant number of stem cells in the organ (Phillips et al., 1992, PNAS 89:11607-11; Muller-Sieburg et al., 2000, Blood 95:2446-8; Morrison et al., 2002, J. Immunol. 168:635-42). By contrast, cancer cells have escaped this homeostatic regulation and the number of cells within a tumor that have the ability to self renew is constantly expanding, resulting in the inevitable growth of the tumor. As would be expected, many of the mutations that drive tumor expansion regulate either cell proliferation or survival. For example, the prevention of apoptosis by enforced expression of the oncogene Bcl-2 promotes the development of lymphoma and also results in increased numbers of HSCs in vivo, suggesting that cell death plays a role in regulating the homeostasis of HSCs (Domen et al., 1998, Blood 91:2272-82; Domen et al., 2000, J. Exp. Med. 191:253-64). In fact, the progression to experimental acute myelogenous leukemia in mice requires at least 3, and likely 4 independent events to block the several intrinsically triggered and extrinsically induce programmed cell death pathways of myeloid cells (Traver et al., 1998, Immunity 9:47-57). Proto-oncogenes such as c-myb and c-myc that drive proliferation of tumor cells are also essential for HSCs development (Prochowinki & Kukowska, 1986, Nature 322:848-50; Clarke et al., 1988, Mol. Cellular. Biol. 8:884-92; Mucenski et al., 1991, Cell 65:677-89; Danish et al., 1992, Oncogene 7:901-7).
Since cancer cells and normal stem cells share the ability to self-renew, it is not surprising that a number of genes classically associated with cancer can also regulate normal stem cell development (reviewed in Reya et al., 2001, Nature 414:105-11 and Taipale & Beachy, 2001, Nature 411:349-54). In combination with other growth factors, Shh signaling has also been implicated in the regulation of self-renewal by the finding that cells highly enriched for human hematopoietic stem cells (CD34+Lin−CD38−) exhibited increased self-renewal in response to Shh stimulation in vitro (Bhardwaj et al., 2001, Nat. Immunol. 2:172-80). Several other genes related to oncogenesis have been shown to be important for stem cell function. For example, mice deficient for tal-1/SCL, which is involved in some cases of human acute leukemia, lack embryonic hematopoiesis (Shivdasani et al., 1995, Nature 373:432-4) suggesting that it is required for intrinsic or extrinsic events necessary to initiate hematopoiesis, for maintenance of the earliest definitive blood cells, or for the decision to form blood cells downstream of embryonic HSCs (Shivdasani et al., 1995, Nature 373:432-4; Porcher et al., 1996, Cell 86:47-57). Members of the Hox family have also been implicated in human leukemia. Enforced expression of HoxB4 can affect stem cell functions (Buske et al., 2002, Blood 100:862-681; Antonchuk & Humphries, 2002, Cell 109:39-45). One of the major targets of the p53 tumor suppressor gene is p21cip1. Bone marrow from p21cip1 deficient mice has a reduced ability to serially reconstitute lethally irradiated recipients. Failure at serial transfer could result from exhaustion of the stem cell pool, loss of telomeres, or loss of transplantability (Cheng et al., 2000, Science 287:1804-8). In mice, bmi-1, a gene that cooperates with c-myc to induce lymphoma (van Lohuizen et al., 1991, Nature 353:353-55; van der Lugt et al., 1994, Genes & Dev. 8:757-69), is required for the maintenance of adult HSCs and leukemia cells. Thus, many genes involved in stem cell fate decisions are also involved in malignant transformation.
Two other signaling pathways implicated in oncogenesis in both mice and humans, the Wnt/β-catenin and Notch pathways, can play central roles in the self-renewal of both normal and cancer stem cells. The Notch family of receptors was first identified in Drosophila and has been implicated in development and differentiation (Artavanis-Tsakonas et al., 1999, Science 284:770-6). In C elegans, Notch plays a role in germ cell self-renewal (Berry et al., 1997, Dev 124:925-36). In neural development transient Notch activation initiates an irreversible switch from neurogenesis to gliogenesis by embryonic neural crest stem cells (Morrison et al., 2000, Cell 101:499-510). Notch activation of HSCs in culture using either of the Notch ligands Jagged-1 or Delta transiently increased primitive progenitor activity that could be observed in vitro and in vivo, suggesting that Notch activation promotes either the maintenance of progenitor cell multipotentiality or HSC self-renewal (Shelly et al., 1999, J. Cell Biochem. 73:164-75; Varnum-Finney et al., Nat. Med. 6:1278-81). While the Notch pathway plays a central role in development and the mouse int-3 oncogene is a truncated Notch4 (Gallahan & Callahan, 1997, Oncogene 14:1883-90), the role for Notch in de novo human cancer is complex and less well understood. Various members of the Notch signaling pathway are expressed in cancers of epithelial origin and activation by Notch by chromosomal translocation is involved in some cases of leukemia (Ellisen et al., 1991, Cell 66:649-61; Zagouras et al., 1995, PNAS 92:6414; Liu et al., 1996, Genomics 31:58-64; Capobianco et al., 1997, Mol. Cell. Biol. 17:6265-73; Leethanakul et al., 2000, Oncogene 19:3220-4). Microarray analysis has shown that members of the Notch pathway are often over-expressed by tumor cells (Liu et al., 1996, Genomics 31:58-64; Leethanakul et al., 2000, Oncogene 19:3220-4). A truncated Notch4 mRNA is expressed by some breast cancer cell lines (Imatani & Callahan, 2000, Oncogene 19:223-31). Overexpression of Notch1 leads to growth arrest of a small cell lung cancer cell line, while inhibition of Notch1 signals can induce leukemia cell lines to undergo apoptosis (Shelly et al., 1999, J. Cell Biol. 73:164-75; Artavanis-Tsakonas, 1999, Science 284:770-6; Jehn et al., 1999, J. Immunol. 162:635-8). Work by Miele and colleagues showed that activation of Notch-1 signaling maintains the neoplastic phenotype in Ras-transformed human cells (Weizen et al., 2002, Nat. Med. 8:979-86). They also found that in de novo cancers, cells with an activating Ras mutation also demonstrated increased expression of Notch-1 and Notch-4.
Wnt/β-catenin signaling also plays a pivotal role in the self-renewal of normal stem cells and malignant transformation (Cadigan et al., 1997, Genes & Dev. 11:3286-305; Austin et al., 1997, Blood 89:3624-35; Spink et al., 2000, EMBO 19:2270-9). The Wnt pathway was first implicated in MMTV-induced breast cancer where in deregulated expression of Wnt-1 due to proviral insertion resulted in mammary tumors (Tsukamoto et al., 1988, Cell 55:619-25; Nusse et al., 1991, Cell 64:231). Subsequently, it has been shown that Wnt proteins play a central role in pattern formation. Wnt-1 belongs to large family of highly hydrophobic secreted proteins that function by binding to their cognate receptors, members of the Frizzled and low-density lipoprotein receptor-related protein families, resulting in activation of β-catenin (Cadigan & Nusse, 1997, Dev 11:3286-305; Leethanakul et al., 2000, Oncogene 19:3220-4; Reya et al., 2000, Immunity 13:15-24; Wu et al., 2000, Dev. 127:2773-84; Taiple & Beachy, 2001, Nature 411:349-54). In the absence of receptor activation, β-catenin is marked for degradation by a complex consisting of the Adenomatous Polyposis Coli (APC), Axin and glycogen synthase kinase-3β proteins (Austin et al., 1997, Blood 89:3624-35; van den Berg et al., 1998, Blood 92:3189-202; Gat et al., 1998, Cell 95:605-14; Chan et al., 1999, Nat. Genet. 21:410-3; Hedgepeth et al., 1999, Mol Cell Biol. 19:7147-57; Spink et al., 2000, EMBO 19:2270-9; Leethanakul et al., 2000, Oncogene 19:3220-4). Wnt proteins are expressed in the bone marrow, and activation of Wnt/β-catenin signaling by Wnt proteins in vitro or by expression of a constitutively active β-catenin expands the pool of early progenitor cells and enriched normal transplantable hematopoietic stem cells in tissue culture and in vivo (Austin et al., 1997, Blood 89:3624-35; van den Berg et al., 1998, Blood 92:3189-202; Reya et al., 2001, Nature 414:105-11). Inhibition of Wnt/β-catenin by ectopic expression of Axin, an inhibitor of β-catenin signaling, leads to inhibition of stem cell proliferation both in vitro and in vivo. Other studies suggest that the Wnt/β-catenin pathway mediates stem or progenitor cell self-renewal in other tissues (Gat et al., 1998, Cell 95:605-14; Korinek et al., 1998, Nat. Genet. 19:379-83; Zhu & Watt, 1999, Dev. 126:2285-98; Chan et al., 1999, Nat. Genet. 21:410-3). Higher levels of β-catenin are seen in keratinocytes with higher proliferative potential than those seen in keratinocytes with lower proliferative capacity (Gat et al., 1998, Cell 95:605-14; Chan et al., 1999, Nat. Genet. 21:410-3; Zhu & Watt, 1999, Dev. 126:2285-98). Like their normal hematopoietic stem cell counterparts, enforced expression of an activated β-catenin increased the ability of epidermal stem cells to self renew and decreased their ability to differentiate. Mice that fail to express TCF-4, one of the transcription factors that is activated when bound to β-catenin, soon exhaust their undifferentiated crypt epithelial progenitor cells, further suggesting that Wnt signaling is involved in the self renewal of epithelial stem cells (Korinek et al., 1998, Nat. Genet. 19:379-83; Taipale & Beachy, 2001, Nature 411:349-54).
Activation of β-catenin in colon cancer by inactivation of the protein degradation pathway, most frequently by mutation of APC, is common (Hedgepeth et al., 1999, Mol. Cell. Biol. 19:7147-57; Leethanakul et al., 2000, Oncogene 19:3220-4; Spink et al., 2000, EMBO 19:2270-9; Taipale & Beachy, 2001, Nature 411:349-54). Expression of certain Wnt genes is elevated in some other epithelial cancers suggesting that activation of β-catenin is secondary to ligand activation in such cancers (Nusse, 1992, J. Steroid Biochem. Mol. Biol. 43:9-12; Cadigan & Nusse, 1997, Genes & Dev. 11:3286-305; Kirkoshi et al., 2001, Int. J. Oncol. 19:997-1001; van de Wetering et al., 2002, Cell 111:241-50; Weeraratna et al., 2002, Cancer Cell 1:279-88; Saitoh et al., 2002, Int. J. Oncology 20:343-8; Saitoh et al., 2002, Int. J. Mol. Med. 9:515-9). There is evidence that constitutive activation of the Wnt/β-catenin pathway can confer a stem/progenitor cell phenotype to cancer cells. Inhibition of β-catenin/TCF-4 in a colon cancer cell line induced the expression of the cell cycle inhibitor p21cip-1 and induced the cells to stop proliferating and to acquire a more differentiated phenotype (van de Wetering et al., 2002, Cell 111:241-50). Enforced expression of the proto-oncogene c-myc, which is transcriptionally activated by β-catenin/TCF-4, inhibited the expression of p21cip-1 and allowed the colon cancer cells to proliferate when β-catenin/TCF-4 signaling was blocked, linking Wnt signaling to c-myc in the regulation of cell proliferation and differentiation. Although many studies have implicated the Wnt/β-catenin pathway in breast cancer, activating mutations of β-catenin are rare in this disease and no studies have definitively linked this pathway to human breast cancer (Candidus et al., 1996, Cancer Res. 56:49-52; Sorlie et al., 1998, Hum. Mutat. 12:215; Jonsson et al., 2000, Eur. J. Cancer 36:242-8; Schlosshauer et al., 2000, Cancinogenesis 21:1453-6; Lin et al., 2000, PNAS 97:4262-6; Wong et al., 2002, J. Pathol. 196:145-53).
The implication of roles for genes like Notch, Wnt, c-myc and Shh in the regulation of self-renewal of HSCs and perhaps of stem cells from multiple tissues suggests that there can be common self-renewal pathways in many types of normal somatic stem cells and cancer stem cells. It is important to identify the molecular mechanisms by which these pathways work and to determine whether the pathways interact to regulate the self-renewal of normal stem cells and cancer cells.
The Wnt pathway is involved in the self-renewal of normal stem cells and activating mutations of Wnt induce breast cancer in mice. This pathway plays a role in tumor formation by human breast cancer stem cells isolated from some patients. Furthermore, evidence suggests that the ability of different populations of breast cancer cells to form tumors differs. Interestingly, the expression of members of the Wnt/Frizzled/β-catenin pathway are heterogeneously expressed by different populations of cancer cells and expression of particular members of the pathway can correlate with the capacity to form tumors.
The different populations of cancer cells and tumor cells drive the proliferation of breast cancer cells. Activated β-catenin is seen in the cancer cells in a significant number of patients. The tumors that contain cancer cells with this pathway constitutively active behave differently than those without constitutively activated β-catenin.
II. Xenograft Model of Human Breast Cancer
Although cell lines have led to remarkable advances in our understanding of the molecular and biochemical changes in cancer cells, their use in the identification of effective cancer therapies is somewhat limited. Cell lines are imperfect predictors of drug efficacy in de novo tumors (Brown, 1997, Oncol. Res. 9:213-5; Hoffman, 1999, Invest. New Drugs 17:343-359). Several factors likely account for this deficiency. Cancer cell lines are selected from a sub-population of cancer cells that are specifically adapted to growth in tissue culture and the biological and functional properties of these cell lines can change dramatically (Leglise et al., 1988, Blood Cells 13:319-37; Ikeda et al., 1993, Exp. Hematol. 21:1686-94; Weidmann et al., 1997, Leukemia 11:709-13; Dorrell et al., 2000, Blood 95:102-10). Furthermore, cancer cells from only a minority of breast cancer tumors establish cell lines or xenograft tumors (Ethier et al., 1993, Cancer Res. 53:627-35; Krasna et al., 2002, Breast Cancer Res Treat. 71:219-35). The phenotypic and functional characteristics of these cell lines can change drastically relative to their properties in vivo (Leglise et al., 1988, Blood Cells 13:319-37). For example, the marker expression of both normal hematopoietic and leukemic tissue culture cells can change rapidly in tissue culture and often does not reflect that of the original stem cells from which they were derived (Furley et al., 1986, Blood 68:1101-7; Leglise et al., 1988, Blood Cells 13:319-37; Ikeda et al., 1993, Exp. Hematol. 21:1686-94; Dorrell et al., 2000, Blood 95:102-10). Even when conditions are devised to permit the proliferation of normal stem cells in culture, the conditions often promote self-renewal or differentiation in a way that prevents the stem cells in culture from recapitulating the hierarchy of cell populations that exist in vivo. Taken together, these observations suggest that the biological properties of cancer cell lines can differ markedly from the cancer cells from which they were derived. This likely explains at least in part why the cell lines often are poor predictors of a drug's efficacy in the clinic.
Thus, the lack of an effective method to consistently grow primary human breast cancer cells in vitro or in vivo for long periods of time has severely limited our ability to understand the biology of this disease. The most efficient xenograft models report the engraftment of pieces of breast cancer tumors in the ovarian, but not mammary, fat pad of SCID mice approximately 60-75% of the time (Sakakibara et al., 1996, Cancer J. Si Am. 2:291-300). Engraftment of dissociated cells is not possible in this model, and cancer cells isolated from pleural effusions only form tumors in immunodeficient mice approximately 10% of the time (Hoffman, 1999, Invest. New Drugs 17:343-59). The present invention (see Example 1 below) provides a xenograft model in which one is able to establish tumors from primary breast tumors via injection of tumors in the mammary gland of severely immunodeficient mice. The xenograft of the present invention allows one to do biological and molecular tests to characterize the clonogenic breast cancer cell as well as other cell types. Importantly, the xenograft tumors developed in accordance with the present invention contain the phenotypically diverse cancer cell types found in the human tumors from which they were derived and the different populations of cancer cells differ markedly in their ability to form tumors (Al-Hajj et al., 2003, PNAS 100:3983).
The development of an efficient xenograft model in accordance with the present invention (see e.g., Example 1), has for the first time reliably allows dissociated solid tumor cells obtained from a patient to form tumors. Importantly, this enables one to routinely analyze biochemical pathways in an individual patient's cancer cells and to do molecular manipulations that allow one to understand the cellular consequences of specific genetic pathways on tumor formation by de novo human solid tumor cancer cells.
III. Solid Tumor Stem Cells Cancer Markers
The present invention provides markers whose expression is specifically altered in solid tumor stem cells (e.g. up regulated or down regulated). Such markers find use in the diagnosis and characterization and alteration (e.g., therapeutic targeting) of various cancers (e.g. breast cancer).
Example 4, provided below, describes methods used to identify solid tumor cancer markers. some cancer markers are provided below in Tables 4-8, as well as Notch 4. Cancer stem cell markers predictive of clinical outcome are provided in Table 9. While these tables provide gene names, it is noted that the present invention contemplates the use of both the nucleic acid sequences as well as the peptides encoded thereby, as well as fragments of the nucleic acid and peptides, in the therapeutic and diagnostic methods and compositions of the present invention.
Additional solid tumor stem cells cancer markers can be identified, for example, using the methods described in Example 4 below.
Another aspect of the present invention is the means and methods for classifying tumors based upon the profiling of solid tumor samples by comparing a gene expression signature of a cancer sample to a cancer stem cell gene expression signature. This invention for the first time has discovered tumor stem cell gene expression signatures that are predictors of distant metastases and death. The microarray data of the present invention identifies cancer stem cell markers likely to play a role in breast cancer development, progression, and/or maintenance while also identifying cancer stem cell gene signatures useful in classifying breast tumors into low and high risk of, for example, metastasis and death. Classification based on the detection of differentially expressed polynucleotides and/or proteins that comprise a cancer gene profile when compared to a cancer stem cell gene signature can be used to predict clinical course, predict sensitivity to chemotherapeutic agents, guide selection of appropriate therapy, and monitor treatment response. Furthermore, following the development of therapeutics targeting such cancer stem cell markers, detection of cancer gene signatures described in detail below will allow the identification of patients likely to benefit from such therapeutics.
As described herein, the invention employs methods for clustering genes into gene expression profiles by determining their expression levels in two different cell or tissue samples. The invention further envisions using these gene profiles as compared to a cancer stem cell gene signature to predict clinical outcome including, for example, metastasis and death. The microarray data of the present invention identifies gene profiles comprising similarly and differentially expressed genes contained on the Affymetrix HG-U133 array between two tissue samples, one a test sample and one a reference sample, including between tumor stem cells and normal breast epithelium, non-tumorigenic tumor cells and normal breast epithelium, and tumor stem cells and non-tumorigenic tumor cells. These broad gene expression profiles can then be further refined, filtered, and subdivided into gene signatures based on various different criteria including, but not limited to, fold expression change, statistical analyses (e.g. t-test P value from multiple compared samples), biological function (e.g. cell cycle regulators, transcription factors, proteases, etc.), some therapeutic targets (e.g. genes encoding extracellular membrane associated proteins suitable for antibody based therapeutics), identified expression in additional patient samples, and ability to predict clinical outcome.
Thus in some embodiments of the present invention the genes differentially expressed in tumor stem cells versus normal breast epithelium are subdivided into different cancer stem cell gene signatures based on their fold expression change. For example genes with between 2 to 2.5 fold elevated (or reduced, or both elevated and reduced) expression in tumor stem cells can comprise one tumor stem cell gene signature, genes with between 2.5 to 3 fold elevated (or reduced, or both) expression can comprise another tumor stem cell gene signature. Alternatively, all genes above a certain fold expression change are included in a tumor stem cell gene signature. For example, all genes with a 1 fold or more reduced (or elevated, or both) expression in tumor stem cells can comprise one tumor stem cell gene signature, all genes with a 2 fold or more reduced (or elevated, or both) expression in tumor stem cells can comprise another tumor stem cell gene signature, and so on. In other embodiments, the genes differentially expressed in tumor stem cells versus normal breast epithelium are filtered by using statistical analysis. For example, all genes with elevated (or reduced, or both) expression with a t-test P value across samples between 0.01 and 0.005 can comprise one tumor stem cell gene signature, all genes with elevated (or reduced, or both) expression with a t-test P value across samples of 0.005 and 0.001 can comprise another tumor stem cell gene signature, and so on. Furthermore, gene expression analysis of independent patient samples or different cell lines can be compared to any cancer stem cell gene signature generated as described above. A tumor stem cell gene signature can be modified, for example, by calculating individual phenotype association indices as described (Glinsky et al., 2004, Clin. Cancer Res. 10:2272) to increase or maintain the predictive power of a given tumor stem cell gene signature. In addition a tumor stem cell gene signature can be further narrowed or expanded gene by gene by excluding or including genes subjectively (e.g. inclusion of a some therapeutic targetor exclusion of a gene included in another gene signatures).
In yet further embodiments, a broad gene expression profile such as those generated by the Affymetrix HG-U133 array analyses of the present invention can be further refined, filtered, or subdivided into gene signatures based on two or more different criteria. In some embodiments of the present invention the genes differentially expressed in tumor stem cells versus normal breast epithelium are subdivided into different tumor stem cell gene signatures based on their fold expression change as well as their biological function. For example, all genes involved in cell cycle regulation with between 3 to 3.5 fold elevated (or reduced, or both) expression in tumor stem cells versus normal breast epithelium can comprise one tumor stem cell gene signature, all genes involved in cell cycle regulation with between 3.5 to 4 fold elevated (or reduced, or both) expression can comprise another tumor stem cell gene signature, all genes encoding extracellular membrane associated proteins with 4 fold or more elevated (or reduced, or both) expression can comprise another tumor stem cell gene signature, all genes encoding extracellular membrane associated proteins with 5 fold or more elevated (or reduced, or both) expression can comprise yet another tumor stem cell gene signature.
In one some embodiment, the genes differentially expressed in tumor stem cells versus normal breast epithelium are divided into different tumor stem cell gene signatures based on their fold expression change and by statistical analysis. Specifically, in some embodiments the microarray analysis of the invention was used to identify genes with two-fold reduced and two-fold elevated expression in tumorigenic cells versus normal breast epithelium. This tumor stem cell gene signature was then further filtered by the P value of a t-test between the tumorigenic and normal breast epithelium samples to generate nine cancer stem cell gene signatures (gene signatures 1-9) comprising an increasingly restricted number of genes: 290, 242, 186, 114, 62, 51, 37, 30, and 16 genes, respectively, of which five (gene signatures 1-4, &7) are described in detail (Tables 9A-13A). These cancer stem cell gene signatures were then compared against gene expression analysis from two independent cancer patient populations: 295 consecutive early breast cancer patients from the Netherlands Cancer Institute (van de Vijver et al., 2002, N. Eng. J. Med. 347:1999) and 286 lymph node negative breast cancer patients from the Erasmus Medical Center (Wang et al., 2005, Lancet 365:671) as described in detailed in the Examples below (Tables 9B,C,D-13B,C,D).
In another some embodiment, the genes differentially expressed in tumor stem cells versus normal breast epithelium are divided into different tumor stem cell gene signatures based on the fold expression change. Specifically, in some embodiments the microarray analysis of the invention was used to generate eight cancer stem cell gene signatures (gene signature 10-17) with reduced and elevated expression at log 2 (4 fold), log 2.5 (5.7 fold), log 3 (8 fold), log 3.5 (11.3 fold), log 4 (16 fold), log 4.5 (22.6), log 5 (32 fold) and a log 5.5 (45.3 fold) comprising 585, 331, 205, 127, 89, 51, 35, and 20 genes, respectively (Tables 17A-24A). These cancer stem cell gene signatures were then compared to gene expression profiles generated from two independent cancer patient populations: 295 consecutive early breast cancer patients from the Netherlands Cancer Institute (van de Vijver et al., 2002, N. Eng. J. Med. 347:1999) and 286 lymph node negative breast cancer patients from the Erasmus Medical Center (Wang et al., 2005, Lancet 365:671) as described in detailed in the Examples below (Tables 17B,C,D-21B,C,D; 22A,B, C; 23A,B-24A,B).
* All genes overlap with the 295 Array
* All genes overlap with the 295 Array
The invention further embodies the use of these tumor stem cell gene signatures to predict clinical outcome including, but not limited to, metastasis and death. Any independent patient population that includes gene expression analysis (e.g microarray analysis, immunohistochemical analysis, etc) or tumor samples suitable for gene expression analysis (e.g. frozen tissue biopsies, paraffin embedded tumor tissue samples, etc) along with determined clinical parameters or ongoing monitoring of clinical parameters including, for example, lymph node status, metastasis, death, etc. can be used to assess the ability of a tumor stem cell gene signature to predict clinical outcomes. In some embodiments the invention tests the ability of cancer stem cell gene signatures to predict metastasis and death in two independent cancer patient populations: 295 consecutive early breast cancer patients from the Netherlands Cancer Institute (van de Vijver et al., 2002, N. Eng. J. Med. 347:1999) and 286 lymph node negative breast cancer patients from the Erasmus Medical Center (Wang et al., 2005, Lancet 365:671). Many statistical analyses can be used to determine predictive ability. These include, for example, Kaplan-Meier survival analysis, Cox proportional hazard survival analysis, chi-square analysis, or multivariate analysis. In some embodiments of the present invention, Cox proportional hazard survival analysis of the 295 consecutive early breast cancer patients from the Netherlands Cancer Institute and chi-square analysis of the 286 patients from the Erasmus Medical Center were used to identify cancer stem cell gene signatures significantly predictive of clinical outcome (
The invention therefore establishes multiple cancer stem cell gene signatures, as predictors of poor clinical outcome. In some embodiments of the present invention these cancer stem cell gene signatures are used clinically to classify tumors as low or high risk and to assign a tumor to a low or high-risk category. The cancer stem cell gene signatures can further be used to provide a diagnosis, prognosis, and/or select a therapy based on the classification of a tumor as low or high risk as well as to monitor a diagnosis, prognosis, and/or therapy over time. If it is known that a patient has a tumor that expresses the genes comprising a cancer stem cell gene signature and thus has a poor prognosis, a more aggressive approach to therapy can be warranted than in tumors not falling within these subcategories. For example, in patients where there is no evidence of disease in lymph nodes (node-negative patients), a decision must be made regarding whether to administer chemotherapy (adjuvant therapy) following surgical removal of the tumor. While some patients are likely to benefit from such treatment, it has significant side effects and is preferrably avoided by patients with low risk tumors. Presently it is difficult or impossible to predict which patients would benefit. Knowing that a patient falls into a poor prognosis category can help in this decision. Furthermore, detecting expression of a cancer gene profile that is highly correlated with a cancer stem cell gene signature of the present invention can provide information related to tumor progression. It is well known that as tumors progress, their phenotypic characteristics can change. The invention thus contemplates the possibility that breast tumors can evolve from expressing a cancer gene profile that is highly correlated with a cancer stem cell gene signature to not (or vice versa) either in response to therapy or in response to lack of therapy. Thus detection of a cancer gene profile that either correlates with or fails to correlate with a cancer stem cell gene signature can be used to detect such progression and alter therapy accordingly.
It is well known in the art that some tumors respond to certain therapies while others do not. At present there is very little information that can be used to determine, prior to treatment, the likelihood that a specific tumor will respond to a given therapeutic agent. Many compounds have been tested for anti-tumor activity and appear to be effective in only a small percentage of tumors. Due to the current inability to predict which tumors will respond to a given agent, these compounds have not been developed as therapeutics. This problem reflects the fact that current methods of classifying tumors are limited. However, the present invention offers the possibility of identifying tumor subgroups and characterizing tumors by a significant likelihood of response to a given agent. Tumor sample archives containing tissue samples obtained from patients that have undergone therapy with various agents are available along with information regarding the results of such therapy. In general such archives consist of tumor samples embedded in paraffin blocks. These tumor samples can be analyzed for their expression of polypeptides that are then compared to the polypeptides encoded by the genes comprising a cancer stem cell signature of the present invention. For example, immunohistochemistry can be performed using antibodies that bind to the polypeptides. Alternatively these tumor samples can be analyzed by their expression of polynucleotides that are then compared to the polynucleotides encoded by the genes comprising a cancer stem cell signature of the present invention. For example, RNA can be extracted from the tumor sample and RT-PCR used to quantitatively amplify mRNAs that would then be compared to the mRNAs comprising a cancer stem cell signature. Tumors belonging to one or more of thirteen cancer stem cell subclasses can be identified on the basis of this information. It is then possible to correlate the expression of the cancer gene profile with a cancer stem cell gene signature predicted response of the tumor to therapy, thereby identifying particular compounds that show a superior efficacy against tumors of a certain subclass as compared with their efficacy against tumors overall or against tumors not falling within the subclass. Once such compounds are identified it will be possible to select patients whose tumors fall into a particular subclass for additional clinical trials using these compounds. Such clinical trials, performed on a selected group of patients, are more likely to demonstrate efficacy. The reagents provided herein, therefore, are valuable both for retrospective and prospective trials.
In the case of prospective trials, detection of expression of one or more of the genes or encoded proteins in a cancer gene profile that correlates with a cancer stem cell signature can be used to stratify patients prior to their entry into the trial or while they are enrolled in the trial. In clinical research, stratification is the process or result of describing or separating a patient population into more homogeneous subpopulations according to specified criteria. Stratifying patients initially rather than after the trial is frequently some (including by regulatory agencies such as the U.S. Food and Drug Administration involved in the approval process for a medication), and stratification is frequently useful in performing statistical analysis of the results of a trial. In some cases stratification can be required by the study design. Various stratification criteria can be employed in conjunction with detection of expression of one or more cancer gene profiles that correlate with a cancer stem cell gene signature. Commonly used criteria include age, family history, lymph node status, tumor size, tumor grade, etc. Other criteria that can be used include, but are not limited to, tumor aggressiveness, prior therapy received by the patient, estrogen receptor (ER) and/or progesterone receptor (PR) positivity, Her2/neu status, p53 status, etc. Ultimately, once compounds that exhibit superior efficacy against cancer gene profile tumors that are highly correlated with cancer stem cell gene signature are identified, reagents for detecting expression of the gene profile can be used to guide the selection of appropriate therapy for additional patients. Thus, by providing reagents and methods for classifying tumors based on their expression of a cancer gene profile that is compared to a cancer stem cell gene signature, the present invention provides a means to identify a patient population that can benefit from potentially promising therapies that have been abandoned due to inability to benefit broader or more heterogenous patient populations and further offers a means to individualize cancer therapy.
Information regarding the expression of cancer stem cell signature genes is thus useful even in the absence of specific information regarding their biological function or role in tumor development, progression, and maintenance. Although the reagents disclosed herein find particular application with respect to breast cancer, the invention also contemplates their use to provide diagnostic and/or prognostic information for other cancer types including but not limited to: biliary tract cancer; bladder cancer; brain cancer including glioblastomas and medulloblastomas; cervical cancer; choriocarcinoma; colon cancer; ondometial cancer; esophageal cancer; gastric cancer; hematological neoplasms including acute lymphocytic and myelogenous leukemia; multiple myeloma; AIDS-associated leukemnius and adult T-cell leukemia lymphoma; intraepithelial neoplasms including Bowen's disease and Paget's disease; liver cancer; lung cancer; lymphomas including Hodgkin's disease and lymnphocytic lymphomas; neuroblastomas; oral cancer including squamous cell carcinoma; ovarian cancer including those arising from epithelial cells, stromal cells, germ cells and mesenchymal cells; pancreatic cancer; prostate cancer; rectal cancer; sarcomas including leiomyosarcoma, rhabdomyosarcoma, liposarcoma, fibrosarcoma, and osteosarcoma; skin cancer including melanoma, Kaposi's sarcoma, basocellular cancer, and squamous cell cancer; testicular cancer including germinal tumors such as seminoma, non-seminoma (teratomas, choriocarcinomas), stromal tumors, and germ cell tumors; thyroid cancer including thyroid adenocarcinoma and modullar carcinoma; and renal cancer including adenocarcinoma and Wilms tumor.
In other embodiments of the present invention, the cancer stem cell signatures are used experimentally to test and assess lead compounds including, for example, small molecules, siRNAs, and antibodies for the treatment of cancer. For example tumor cells from a patient can be screened for expression of a cancer stem cell gene signature and then transplanted into the xenograft model described herein and the effect of test compounds, such as for example antibodies against one or more cancer stem cell markers described herein, tested for effects on tumor growth and survival. Furthermore a cancer gene profile can be determined following treatment and the cancer gene profile compared to a cancer stem cell gene signature to assess the effectiveness of the therapy and in turn guide a future treatment regimen. In addition the efficacy of test compounds can be assessed against different tumor subclasses. For example test compounds can be used in xenografts of tumors that express a cancer gene profile that is highly correlated with a tumor stem cell gene signature versus tumors having a gene profile that does not correlate with the tumor stem cell gene signature or that express other gene signature such as, for example, a serum or wound response gene signature (Chang et al., 2005, PNAS 102:3738). Any differences in response of the different tumor subclasses to the test compound are determined and used to optimize treatment for particular classes of tumors.
The cancer stem cell gene signatures were identified from genes that are expressed at decreased or elevated levels in tumor stem cells compared to normal breast epithelium. Thus in certain embodiments of the invention expression levels of mRNA, or amplified or cloned version thereof, are determined from a tumor sample by hybridization to polynucleotides that represent each particular gene comprising a cancer stem cell gene signature. some polynucleotides of this type contain at least about 20 to at least about 32 consecutive basepairs of a gene sequence that is not found in other gene sequences. Even more some are polynucleotides of at least or about 50 to at least or about 400 basepairs of a gene sequence that is not found in other gene sequences. Such polynucleotides are also referred to as polynucleotide probes in that they are capable of hybridizing to sequences of the genes, or unique portions thereof, described herein. The sequences can be those of mRNA encoded by the genes, the corresponding cDNA to such mRNAs, and/or amplified versions of such sequences. In one some embodiment of the invention a cancer gene profile is detected by polynucleotide probes that comprise the polynucleotides comprising the stem cell gene signature immobilized on an array (such as a cDNA microarray).
In another some embodiment of the invention, all or part of the disclosed polynucleotides of a cancer stem cell gene signature are amplified and detected by methods such as the polymerase chain reaction (PCR) and variations thereof, such as, but not limited to, quantitative PCR (Q-PCR), reverse transcription PCR (RT-PCR), and real-time PCR (including means of measuring the initial amounts of mRNA copies for each sequence in a sample). Real-time RT-PCR or real-time Q-PCR can be used. Such methods utilize one or two primers that are complementary and hybridize to portions of a disclosed sequence, where the primers are used to prime nucleic acid synthesis. The newly synthesized nucleic acids are optionally labeled and can be detected directly or by hybridization to a polynucleotide of the invention. Additional methods to detect expressed nucleic acids include RNAse protection assays, including liquid phase hybridizations, and in situ hybridization of cells or tissue samples.
In yet other embodiments of the invention, gene expression can be determined by analysis of protein expression. Protein expression can be detected by use of one or more antibodies specific for one or more epitopes of individual gene products (proteins), or proteolytic fragments thereof, of a cancer stem cell signature in a tumor sample. Detection methodologies suitable for use in the practice of the invention include, but are not limited to, immunohistochemistry of cells in a tumor sample, enzyme linked immunosorbent assays (ELISAs) including antibody sandwich assays of cells in a tumor sample, mass spectroscopy, immuno-PCR, FACS, and protein microarrays.
It is envisioned that any patient sample can be used to detect a cancer stem cell signature. Importantly, though the cancer stem cell signatures were discovered from a comparison of cancer stem cells against a non-tumorigenic tissue, such as for example, normal breast tissue, its prognostic ability was identified from microarray analysis of unfractionated, and thus heterogenous, breast tumor samples normalized either against a reference set of tumor samples (van't Veer et al., 2002, Nature 415:530; van de Vijver et al., 2002, N. Eng. J. Med. 347:1999) or to a target intensity (Wang et al., 2005, Lancet 365:671). Thus unfractioned tumor samples, including but not limited to a solid tissue biopsy, fine needle aspiration, or pleural effusion, can be used for detecting a cancer stem cell signature in the tumor sample and generating a cancer gene profile. More selective samples that are isolated from a heterogenous patient sample such as, for example, by isolating tumorigenic cancer cells, laser capture microdissections, etc. can also be used. Alternatively the sample can permit the collection of cancer cells as well as normal cells for analysis so that the gene expression patterns for each sample can be determined and compared to a cancer stem cell gene signature to generate a cancer gene profile.
IV. Detection of Solid Tumor Stem Cell Cancer Markers
In some embodiments, the present invention provides methods for detection of expression of stem cell cancer markers (e.g., breast cancer stem cell cancer markers). In some embodiments, expression is measured directly (e.g., at the RNA or protein level). In some embodiments, expression is detected in tissue samples (e.g., biopsy tissue). In other embodiments, expression is detected in bodily fluids (e.g., including but not limited to, plasma, serum, whole blood, mucus, and urine). The present invention further provides panels and kits for the detection of markers. In some embodiments, the presence of a stem cell cancer marker is used to provide a prognosis to a subject. The information provided is also used to direct the course of treatment. For example, if a subject is found to have a marker indicative of a solid tumor stem cell (see, e.g. Tables 4-9), additional therapies (e.g., hormonal or radiation therapies) can be started at an earlier point when they are more likely to be effective (e.g., before metastasis). In addition, if a subject is found to have a tumor that is not responsive to hormonal therapy, the expense and inconvenience of such therapies can be avoided.
The present invention is not limited to the markers described above. Any suitable marker that correlates with cancer or the progression of cancer can be utilized. Additional markers are also contemplated to be within the scope of the present invention. Any suitable method can be utilized to identify and characterize cancer markers suitable for use in the methods of the present invention, including but not limited to, those described in illustrative Example 4 below. For example, in some embodiments, markers identified as being up or down-regulated in solid tumor stem cells using the gene expression microarray methods of the present invention are further characterized using tissue microarray, immunohistochemistry, Northern blot analysis, siRNA or antisense RNA inhibition, mutation analysis, investigation of expression with clinical outcome, as well as other methods disclosed herein.
In some embodiments, the present invention provides a panel for the analysis of a plurality of markers. The panel allows for the simultaneous analysis of multiple markers correlating with carcinogenesis and/or metastasis. Depending on the subject, panels can be analyzed alone or in combination in order to provide the best possible diagnosis and prognosis. Markers for inclusion on a panel are selected by screening for their predictive value using any suitable method, including but not limited to, those described in the illustrative examples below.
1. Detection of RNA
In some embodiments, detection of solid tumor stem cell cancer markers (e.g., including but not limited to, those disclosed in Tables 4-9) are detected by measuring the expression of corresponding mRNA in a tissue sample (e.g., breast cancer tissue). mRNA expression can be measured by any suitable method, including but not limited to, those disclosed below.
In some embodiments, RNA is detection by Northern blot analysis. Northern blot analysis involves the separation of RNA and hybridization of a complementary labeled probe.
In still further embodiments, RNA (or corresponding cDNA) is detected by hybridization to an oligonucleotide probe). A variety of hybridization assays using a variety of technologies for hybridization and detection are available. For example, in some embodiments, TaqMan assay (PE Biosystems, Foster City, Calif.; See e.g., U.S. Pat. Nos. 5,962,233 and 5,538,848, each of which is herein incorporated by reference) is utilized. The assay is performed during a PCR reaction. The TaqMan assay exploits the 5′-3′ exonuclease activity of the AMPLITAQ GOLD DNA polymerase. A probe consisting of an oligonucleotide with a 5′-reporter dye (e.g., a fluorescent dye) and a 3′-quencher dye is included in the PCR reaction. During PCR, if the probe is bound to its target, the 5′-3′ nucleolytic activity of the AMPLITAQ GOLD polymerase cleaves the probe between the reporter and the quencher dye. The separation of the reporter dye from the quencher dye results in an increase of fluorescence. The signal accumulates with each cycle of PCR and can be monitored with a fluorimeter.
In yet other embodiments, reverse-transcriptase PCR (RT-PCR) is used to detect the expression of RNA. In RT-PCR, RNA is enzymatically converted to complementary DNA or “cDNA” using a reverse transcriptase enzyme. The cDNA is then used as a template for a PCR reaction. PCR products can be detected by any suitable method, including but not limited to, gel electrophoresis and staining with a DNA specific stain or hybridization to a labeled probe. In some embodiments, the quantitative reverse transcriptase PCR with standardized mixtures of competitive templates method described in U.S. Pat. Nos. 5,639,606, 5,643,765, and 5,876,978 (each of which is herein incorporated by reference) is utilized.
2. Detection of Protein
In other embodiments, gene expression of stem cell cancer markers is detected by measuring the expression of the corresponding protein or polypeptide. Protein expression can be detected by any suitable method. In some embodiments, proteins are detected by immunohistochemistry. In other embodiments, proteins are detected by their binding to an antibody raised against the protein. The generation of antibodies is described below.
Antibody binding is detected by techniques known in the art (e.g., radioimmunoassay, ELISA (enzyme-linked immunosorbant assay), “sandwich” immunoassays, immunoradiometric assays, gel diffusion precipitation reactions, immunodiffusion assays, in situ immunoassays (e.g., using colloidal gold, enzyme or radioisotope labels, for example), Western blots, precipitation reactions, agglutination assays (e.g., gel agglutination assays, hemagglutination assays, etc.), complement fixation assays, immunofluorescence assays, protein A assays, and immunoelectrophoresis assays, etc.
In some embodiments, antibody binding is detected by detecting a label on the primary antibody. In another embodiment, the primary antibody is detected by detecting binding of a secondary antibody or reagent to the primary antibody. In a further embodiment, the secondary antibody is labeled. Many methods are known in the art for detecting binding in an immunoassay and are within the scope of the present invention.
In some embodiments, an automated detection assay is utilized. Methods for the automation of immunoassays include those described in U.S. Pat. Nos. 5,885,530, 4,981,785, 6,159,750, and 5,358,691, each of which is herein incorporated by reference. In some embodiments, the analysis and presentation of results is also automated. For example, in some embodiments, software that generates a prognosis based on the presence or absence of a series of proteins corresponding to cancer markers is utilized.
In other embodiments, the immunoassay described in U.S. Pat. Nos. 5,599,677 and 5,672,480; each of which is herein incorporated by reference.
3. cDNA Microarray Technology
cDNA microarrays consist of multiple (usually thousands) of different cDNAs spotted (usually using a robotic spotting device) onto known locations on a solid support, such as a glass microscope slide. The cDNAs are typically obtained by PCR amplification of plasmid library inserts using primers complementary to the vector backbone portion of the plasmid or to the gene itself for genes where sequence is known. PCR products suitable for production of microarrays are typically between 0.5 and 2.5 kB in length. Full length cDNAs, expressed sequence tags (ESTs), or randomly chosen cDNAs from any library of interest can be chosen. ESTs are partially sequenced cDNAs as described, for example, in Hillier, et al., 1996, 6:807-828. Although some ESTs correspond to known genes, frequently very little or no information regarding any particular EST is available except for a small amount of 3′ and/or 5′ sequence and, possibly, the tissue of origin of the mRNA from which the EST was derived. As will be appreciated by one of ordinary skill in the art, in general the cDNAs contain sufficient sequence information to uniquely identify a gene within the human genome. Furthermore, in general the cDNAs are of sufficient length to hybridize, specifically or uniquely, to cDNA obtained from mRNA derived from a single gene under the hybridization conditions of the experiment.
In a typical microarray experiment, a microarray is hybridized with differentially labeled RNA, DNA, or cDNA populations derived from two different samples. Most commonly RNA (either total RNA or poly A+ RNA) is isolated from cells or tissues of interest and is reverse transcribed to yield cDNA. Labeling is usually performed during reverse transcription by incorporating a labeled nucleotide in the reaction mixture. Although various labels can be used, most commonly the nucleotide is conjugated with the fluorescent dyes Cy3 or Cy5. For example, Cy5-dUTP and Cy3-dUTP can be used. cDNA derived from one sample (representing, for example, a particular cell type, tissue type or growth condition) is labeled with one fluorophore while cDNA derived from a second sample (representing, for example, a different cell type, tissue type, or growth condition) is labeled with the second fluorophore. Similar amounts of labeled material from the two samples are cohybridized to the microarray. In the case of a microarray experiment in which the samples are labeled with Cy5 (which fluoresces red) and Cy3 (which fluoresces green), the primary data (obtained by scanning the microarray using a detector capable of quantitatively detecting fluorescence intensity) are ratios of fluorescence intensity (red/green, RIG). These ratios represent the relative concentrations of cDNA molecules that hybridized to the cDNAs represented on the microarray and thus reflect the relative expression levels of the mRNA corresponding to each cDNA/gene represented on the microarray.
Each microarray experiment can provide tens of thousands of data points, each representing the relative expression of a particular gene in the two samples. Appropriate organization and analysis of the data is of key importance, and various computer programs that incorporate standard statistical tools have been developed to facilitate data analysis. One basis for organizing gene expression data is to group genes with similar expression patterns together into clusters. A method for performing hierarchical cluster analysis and display of data derived from microarray experiments is described in Eisen et al., 1998, PNAS 95:14863-14868. As described therein, clustering can be combined with a graphical representation of the primary data in which each data point is represented with a color that quantitatively and qualitatively represents that data point. By converting the data from a large table of numbers into a visual format, this process facilitates an intuitive analysis of the data. Additional information and details regarding the mathematical tools and/or the clustering approach itself can be found, for example, in Sokal & Sneath, Principles of numerical taxonomy, xvi, 359, W. H. Freeman, San Francisco, 1963; Hartigan, Clustering algorithms, xiii, 351, Wiley, New York, 1975; Paull et al., 1989, J. Natl. Cancer Inst. 81:1088-92; Weinstein et al. 1992, Science 258:447-51; van Osdol et al., 1994, J. Natl. Cancer Inst. 86:1853-9; and Weinstein et al., 1997, Science, 275:343-9.
Further details of the experimental methods used in the present invention are found in the Examples. Additional information describing methods for fabricating and using microarrays is found in U.S. Pat. No. 5,807,522, which is herein incorporated by reference. Instructions for constructing microarray hardware (e.g., arrayers and scanners) using commercially available parts can be found at http://cmgm.stanford.edu/pbr-own/ and in Cheung et al., 1999, Nat. Genet. Supplement 21:15-19, which are herein incorporated by reference. Additional discussions of microarray technology and protocols for preparing samples and performing microarray experiments are found in, for example, DNA arrays for analysis of gene expression, Methods Enzymol, 303:179-205, 1999; Fluorescence-based expression monitoring using microarrays, Methods Enzymol, 306: 3-18, 1999; and M. Schena (ed.), DNA Microarrays: A Practical Approach, Oxford University Press, Oxford, UK, 1999. Descriptions of how to use an arrayer and the associated software are found at http://cmgm.stanford.edu/pbrown/mguide/arrayerHTML/ArrayerDocs.html, which is herein incorporated by reference.
4. Data Analysis
In some embodiments, a computer-based analysis program is used to translate the raw data generated by the detection assay (e.g., the presence, absence, or amount of a given marker or markers) into data of predictive value for a clinician. The clinician can access the predictive data using any suitable means. Thus, in some some embodiments, the present invention provides the further benefit that the clinician, who is not likely to be trained in genetics or molecular biology, need not understand the raw data. The data is presented directly to the clinician in its most useful form. The clinician is then able to immediately utilize the information in order to optimize the care of the subject.
The present invention contemplates any method capable of receiving, processing, and transmitting the information to and from laboratories conducting the assays, information provides, medical personal, and subjects. For example, in some embodiments of the present invention, a sample (e.g., a biopsy or a serum or urine sample) is obtained from a subject and submitted to a profiling service (e.g., clinical lab at a medical facility, genomic profiling business, etc.), located in any part of the world (e.g., in a country different than the country where the subject resides or where the information is ultimately used) to generate raw data. Where the sample comprises a tissue or other biological sample, the subject can visit a medical center to have the sample obtained and sent to the profiling center, or subjects can collect the sample themselves and directly send it to a profiling center. Where the sample comprises previously determined biological information, the information can be directly sent to the profiling service by the subject (e.g., an information card containing the information can be scanned by a computer and the data transmitted to a computer of the profiling center using an electronic communication system). Once received by the profiling service, the sample is processed and a profile is produced (e.g., expression data), specific for the diagnostic or prognostic information desired for the subject.
The profile data is then prepared in a format suitable for interpretation by a treating clinician. For example, rather than providing raw expression data (e.g. examining a number of the markers described in Tables 4-9), the prepared format can represent a diagnosis or risk assessment for the subject, along with recommendations for particular treatment options. The data can be displayed to the clinician by any suitable method. For example, in some embodiments, the profiling service generates a report that can be printed for the clinician (e.g., at the point of care) or displayed to the clinician on a computer monitor.
In some embodiments, the information is first analyzed at the point of care or at a regional facility. The raw data is then sent to a central processing facility for further analysis and/or to convert the raw data to information useful for a clinician or patient. The central processing facility provides the advantage of privacy (all data is stored in a central facility with uniform security protocols), speed, and uniformity of data analysis. The central processing facility can then control the fate of the data following treatment of the subject. For example, using an electronic communication system, the central facility can provide data to the clinician, the subject, or researchers.
In some embodiments, the subject is able to directly access the data using the electronic communication system. The subject can chose further intervention or counseling based on the results. In some embodiments, the data is used for research use. For example, the data can be used to further optimize the inclusion or elimination of markers as useful indicators of a particular condition or stage of disease.
5. Kits
In yet other embodiments, the present invention provides kits for the detection and characterization of cancer (e.g. for detecting one or more of the markers shown in Tables 4-9, or for modulating the activity of a peptide expressed by one or more of markers shown in Tables 4-9). In some embodiments, the kits contain antibodies specific for a cancer marker, in addition to detection reagents and buffers. In other embodiments, the kits contain reagents specific for the detection of mRNA or cDNA (e.g., oligonucleotide probes or primers). In some embodiments, the kits contain all of the components necessary and/or sufficient to perform a detection assay, including all controls, directions for performing assays, and any necessary software for analysis and presentation of results.
Another aspect of the present invention comprises a kit to test for the presence of the polynucleotides or proteins, e.g. in a tissue sample or in a body fluid, of a cancer stem cell signature. The kit can comprise, for example, an antibody for detection of a polypeptide or a probe for detection of a polynucleotide. In addition, the kit can comprise a reference or control sample; instructions for processing samples, performing the test and interpreting the results; and buffers and other reagents necessary for performing the test. In certain embodiments the kit comprises a panel of antibodies for detecting expression of one or more of the proteins encoded by the genes of a cancer stem cell signature. In other embodiments the kit comprises pairs of primers for detecting expression of one or more of the genes of the cancer stem cell signature. In yet other embodiments the kit comprises a cDNA or oligonucleotide array for detecting expression of one or more of the genes of a cancer stern cell signature.
6. In Vivo Imaging
In some embodiments, in vivo imaging techniques are used to visualize the expression of cancer markers in an animal (e.g., a human or non-human mammal). For example, in some embodiments, cancer marker mRNA or protein is labeled using a labeled antibody specific for the cancer marker. A specifically bound and labeled antibody can be detected in an individual using an in vivo imaging method, including, but not limited to, radionuclide imaging, positron emission tomography, computerized axial tomography, X-ray or magnetic resonance imaging method, fluorescence detection, and chemiluminescent detection. Methods for generating antibodies to the cancer markers of the present invention are described below.
The in vivo imaging methods of the present invention are useful in the diagnosis of cancers that express the solid tumor stem cell cancer markers of the present invention (e.g., in breast cancer). In vivo imaging is used to visualize the presence of a marker indicative of the cancer. Such techniques allow for diagnosis without the use of an unpleasant biopsy. The in vivo imaging methods of the present invention are also useful for providing prognoses to cancer patients. For example, the presence of a marker indicative of cancer stem cells can be detected. The in vivo imaging methods of the present invention can further be used to detect metastatic cancers in other parts of the body.
In some embodiments, reagents (e.g., antibodies) specific for the cancer markers of the present invention are fluorescently labeled. The labeled antibodies are introduced into a subject (e.g., orally or parenterally). Fluorescently labeled antibodies are detected using any suitable method (e.g., using the apparatus described in U.S. Pat. No. 6,198,107, herein incorporated by reference).
In other embodiments, antibodies are radioactively labeled. The use of antibodies for in vivo diagnosis is well known in the art. Sumerdon et al., (Nucl. Med. Biol 17:247-254 [1990] have described an optimized antibody-chelator for the radioimmunoscintographic imaging of tumors using Indium-111 as the label. Griffin et al., (J Clin One 9:631-640 [1991]) have described the use of this agent in detecting tumors in patients suspected of having recurrent colorectal cancer. The use of similar agents with paramagnetic ions as labels for magnetic resonance imaging is known in the art (Lauffer, Magnetic Resonance in Medicine 22:339-342 [1991]). The label used will depend on the imaging modality chosen. Radioactive labels such as Indium-111, Technetium-99m, or Iodine-131 can be used for planar scans or single photon emission computed tomography (SPECT). Positron emitting labels such as Fluorine-19 can also be used for positron emission tomography (PET). For MRI, paramagnetic ions such as Gadolinium (III) or Manganese (II) can be used.
Radioactive metals with half-lives ranging from 1 hour to 3.5 days are available for conjugation to antibodies, such as scandium-47 (3.5 days) gallium-67 (2.8 days), gallium-68 (68 minutes), technetiium-99m (6 hours), and indium-11 (3.2 days), of which gallium-67, technetium-99m, and indium-111 are preferable for gamma camera imaging, gallium-68 is preferable for positron emission tomography.
A useful method of labeling antibodies with such radiometals is by means of a bifunctional chelating agent, such as diethylenetriaminepentaacetic acid (DTPA), as described, for example, by Khaw et al. (Science 209:295 [1980]) for In-111 and Tc-99m, and by Scheinberg et al. (Science 215:1511 [1982]). Other chelating agents can also be used, but the 1-(p-carboxymethoxybenzyl)EDTA and the carboxycarbonic anhydride of DTPA are advantageous because their use permits conjugation without affecting the antibody's immunoreactivity substantially.
Another method for coupling DPTA to proteins is by use of the cyclic anhydride of DTPA, as described by Hnatowich et al. (Int. J. Appl. Radiat. Isot. 33:327 [1982]) for labeling of albumin with In-111, but which can be adapted for labeling of antibodies. A suitable method of labeling antibodies with Tc-99m which does not use chelation with DPTA is the pretinning method of Crockford et al., (U.S. Pat. No. 4,323,546, herein incorporated by reference).
A some method of labeling immunoglobulins with Tc-99m is that described by Wong et al. (Int. J. Appl. Radiat. Isot., 29:251 [1978]) for plasma protein, and recently applied successfully by Wong et al. (J. Nucl. Med., 23:229 [1981]) for labeling antibodies.
In the case of the radiometals conjugated to the specific antibody, it is likewise desirable to introduce as high a proportion of the radiolabel as possible into the antibody molecule without destroying its immunospecificity. A further improvement can be achieved by effecting radiolabeling in the presence of the specific stem cell cancer marker of the present invention, to insure that the antigen binding site on the antibody will be protected.
In still further embodiments, in vivo biophotonic imaging (Xenogen, Almeda, Calif.) is utilized for in vivo imaging. This real-time in vivo imaging utilizes luciferase. The luciferase gene is incorporated into cells, microorganisms, and animals (e.g., as a fusion protein with a cancer marker of the present invention). When active, it leads to a reaction that emits light. A CCD camera and software is used to capture the image and analyze it.
V. Antibodies and Antibody Fragments
The present invention provides isolated antibodies and antibody fragments (e.g, Fabs). In some embodiments, the present invention provides monoclonal antibodies or antibody fragments that specifically bind to an isolated polypeptide comprised of at least five, or at least 15 amino acid residues of the stem cell cancer markers described herein (e.g., as shown in Tables 4-9). These antibodies or antibody fragments find use in the diagnostic, drug screening, and therapeutic methods described herein (e.g. to detect or modulate the activity of a stem cell cancer marker peptide).
An antibody, or antibody fragment, against a protein of the present invention can be any monoclonal or polyclonal antibody, as long as it can recognize the protein. Antibodies can be produced by using a protein of the present invention as the antigen according to a conventional antibody or antiserum preparation process.
The present invention contemplates the use of both monoclonal and polyclonal antibodies. Any suitable method can be used to generate the antibodies used in the methods and compositions of the present invention, including but not limited to, those disclosed herein. For example, for preparation of a monoclonal antibody, protein, as such, or together with a suitable carrier or diluent is administered to an animal (e.g., a mammal) under conditions that permit the production of antibodies. For enhancing the antibody production capability, complete or incomplete Freund's adjuvant can be administered. Normally, the protein is administered once every 2 weeks to 6 weeks, in total, about 2 times to about 10 times. Animals suitable for use in such methods include, but are not limited to, primates, rabbits, dogs, guinea pigs, mice, rats, sheep, goats, etc.
For preparing monoclonal antibody-producing cells, an individual animal whose antibody titer has been confirmed (e.g., a mouse) is selected, and 2 days to 5 days after the final immunization, its spleen or lymph node is harvested and antibody-producing cells contained therein are fused with myeloma cells to prepare the desired monoclonal antibody producer hybridoma. Measurement of the antibody titer in antiserum can be carried out, for example, by reacting the labeled protein, as described hereinafter and antiserum and then measuring the activity of the labeling agent bound to the antibody. The cell fusion can be carried out according to known methods, for example, the method described by Koehler and Milstein (Nature 256:495 [1975]). As a fusion promoter, for example, polyethylene glycol (PEG) or Sendai virus (HVJ) is used.
Examples of myeloma cells include NS-1, P3U1, SP2/0, AP-1 and the like. The proportion of the number of antibody producer cells (spleen cells) and the number of myeloma cells to be used is about 1:1 to about 20:1. PEG (e.g., PEG 1000-PEG 6000) can be added in concentration of about 10% to about 80%. Cell fusion can be carried out efficiently by incubating a mixture of both cells at about 20° C. to about 40° C. or about 30° C. to about 37° C. for about 1 minute to 10 minutes.
Various methods can be used for screening for a hybridoma producing the antibody (e.g., against a tumor antigen or autoantibody of the present invention). For example, where a supernatant of the hybridoma is added to a solid phase (e.g., microplate) to which antibody is adsorbed directly or together with a carrier and then an anti-immunoglobulin antibody (if mouse cells are used in cell fusion, anti-mouse immunoglobulin antibody is used) or Protein A labeled with a radioactive substance or an enzyme is added to detect the monoclonal antibody against the protein bound to the solid phase. Alternately, a supernatant of the hybridoma is added to a solid phase to which an anti-immunoglobulin antibody or Protein A is adsorbed and then the protein labeled with a radioactive substance or an enzyme is added to detect the monoclonal antibody against the protein bound to the solid phase.
Selection of the monoclonal antibody can be carried out according to any known method or its modification. Normally, a medium for animal cells to which HAT (hypoxanthine, aminopterin, thymidine) are added is employed. Any selection and growth medium can be employed as long as the hybridoma can grow. For example, RPMI 1640 medium containing 1% to 20%, or 10% to 20% fetal bovine serum, GIT medium containing 1% to 10% fetal bovine serum, a serum free medium for cultivation of a hybridoma (SFM-101, Nissui Seiyaku) and the like can be used. Normally, the cultivation is carried out at 20° C. to 40° C., or 37° C. for about 5 days to 3 weeks, or 1 week to 2 weeks under about 5% CO2 gas. The antibody titer of the supernatant of a hybridoma culture can be measured according to the same manner as described above with respect to the antibody titer of the anti-protein in the antiserum.
Separation and purification of a monoclonal antibody (e.g., against a cancer marker of the present invention) can be carried out according to the same manner as those of conventional polyclonal antibodies such as separation and purification of immunoglobulins, for example, salting-out, alcoholic precipitation, isoelectric point precipitation, electrophoresis, adsorption and desorption with ion exchangers (e.g., DEAE), ultracentrifugation, gel filtration, or a specific purification method wherein only an antibody is collected with an active adsorbent such as an antigen-binding solid phase, Protein A or Protein G and dissociating the binding to obtain the antibody.
Polyclonal antibodies can be prepared by any known method or modifications of these methods including obtaining antibodies from patients. For example, a complex of an immunogen (an antigen against the protein) and a carrier protein is prepared and an animal is immunized by the complex according to the same manner as that described with respect to the above monoclonal antibody preparation. A material containing the antibody against is recovered from the immunized animal and the antibody is separated and purified.
As to the complex of the immunogen and the carrier protein to be used for immunization of an animal, any carrier protein and any mixing proportion of the carrier and a hapten can be employed as long as an antibody against the hapten, which is crosslinked on the carrier and used for immunization, is produced efficiently. For example, bovine serum albumin, bovine cycloglobulin, keyhole limpet hemocyanin, etc. can be coupled to a hapten in a weight ratio of about 0.1 parts to about 20 parts, or about 1 part to about 5 parts per 1 part of the hapten.
In addition, various condensing agents can be used for coupling of a hapten and a carrier. For example, glutaraldehyde, carbodiimide, maleimide activated ester, activated ester reagents containing thiol group or dithiopyridyl group, and the like find use with the present invention. The condensation product as such or together with a suitable carrier or diluent is administered to a site of an animal that permits the antibody production. For enhancing the antibody production capability, complete or incomplete Freund's adjuvant can be administered. Normally, the protein is administered once every 2 weeks to 6 weeks, in total, about 3 times to about 10 times.
The polyclonal antibody is recovered from blood, ascites and the like, of an animal immunized by the above method. The antibody titer in the antiserum can be measured according to the same manner as that described above with respect to the supernatant of the hybridoma culture. Separation and purification of the antibody can be carried out according to the same separation and purification method of immunoglobulin as that described with respect to the above monoclonal antibody.
The protein used herein as the immunogen is not limited to any particular type of immunogen. For example, a stem cell cancer marker of the present invention (further including a gene having a nucleotide sequence partly altered) can be used as the immunogen. Further, fragments of the protein can be used. Fragments can be obtained by any methods including, but not limited to expressing a fragment of the gene, enzymatic processing of the protein, chemical synthesis, and the like. The antibodies and antibody fragments can also be conjugated to therapeutic (e.g. cancer cell killing compounds). In this regard, the antibody directed toward one of the stem cell cancer markers is used to specifically deliver a therapeutic agent to a solid tumor cancer cell (e.g. to inhibit the proliferation of such sell or kill such a cell).
VI. Drug Screening
In some embodiments, the present invention provides drug screening assays (e.g., to screen for anticancer drugs). The screening methods of the present invention utilize stem cell cancer markers identified using the methods of the present invention (e.g., including but not limited to, the stem cell cancer markers shown in Tables 4-9). For example, in some embodiments, the present invention provides methods of screening for compound that alter (e.g., increase or decrease) the expression of stem cell cancer marker genes. In some embodiments, candidate compounds are antisense agents or siRNA agents (e.g., oligonucleotides) directed against cancer markers. In other embodiments, candidate compounds are antibodies that specifically bind to a stem cell cancer marker of the present invention. In certain embodiments, libraries of compounds of small molecules are screened using the methods described herein.
In one screening method, candidate compounds are evaluated for their ability to alter stem cell cancer marker expression by contacting a compound with a cell expressing a stem cell cancer marker and then assaying for the effect of the candidate compounds on expression. In some embodiments, the effect of candidate compounds on expression of a cancer marker gene is assayed by detecting the level of cancer marker mRNA expressed by the cell. mRNA expression can be detected by any suitable method. In other embodiments, the effect of candidate compounds on expression of cancer marker genes is assayed by measuring the level of polypeptide encoded by the cancer markers. The level of polypeptide expressed can be measured using any suitable method, including but not limited to, those disclosed herein. In some embodiments, other changes in cell biology (e.g., apoptosis) are detected.
Specifically, the present invention provides screening methods for identifying modulators, i.e., candidate or test compounds or agents (e.g., proteins, peptides, peptidomimetics, peptoids, small molecules or other drugs) which bind to, or alter the signaling or function associated with the cancer markers of the present invention, have an inhibitory (or stimulatory) effect on, for example, stem cell cancer marker expression or cancer markers activity, or have a stimulatory or inhibitory effect on, for example, the expression or activity of a cancer marker substrate. Compounds thus identified can be used to modulate the activity of target gene products (e.g., stem cell cancer marker genes) either directly or indirectly in a therapeutic protocol, to elaborate the biological function of the target gene product, or to identify compounds that disrupt normal target gene interactions. Compounds that inhibit the activity or expression of cancer markers are useful in the treatment of proliferative disorders, e.g., cancer, particularly metastatic cancer or eliminating or controlling tumor stem cells to prevent or reduce the risk of cancer.
In some embodiments, the invention provides assays for screening candidate or test compounds that are substrates of a cancer markers protein or polypeptide or a biologically active portion thereof. In another embodiment, the invention provides assays for screening candidate or test compounds that bind to or modulate the activity of a cancer marker protein or polypeptide or a biologically active portion thereof.
The test compounds of the present invention can be obtained using any of the numerous approaches in combinatorial library methods known in the art, including biological libraries; peptoid libraries (libraries of molecules having the functionalities of peptides, but with a novel, non-peptide backbone, which are resistant to enzymatic degradation but which nevertheless remain bioactive; see, e.g., Zuckennann et al., J. Med. Chem. 37: 2678-85 [1994]); spatially addressable parallel solid phase or solution phase libraries; synthetic library methods requiring deconvolution; the ‘one-bead one-compound’ library method; and synthetic library methods using affinity chromatography selection. The biological library and peptoid library approaches are some for use with peptide libraries, while the other four approaches are applicable to peptide, non-peptide oligomer or small molecule libraries of compounds (Lam (1997) Anticancer Drug Des. 12:145).
Examples of methods for the synthesis of molecular libraries can be found in the art, for example in: DeWitt et al., Proc. Natl. Acad. Sci. U.S.A. 90:6909 [1993]; Erb et al., Proc. Nad. Acad. Sci. USA 91:11422 [1994]; Zuckermann et al., J. Med. Chem. 37:2678 [1994]; Cho et al., Science 261:1303 [1993]; Carrell et al., Angew. Chem. Int. Ed. Engl. 33.2059 [1994]; Carell et al., Angew. Chem. Int. Ed. Engl. 33:2061 [1994]; and Gallop et al., J. Med. Chem. 37:1233 [1994].
Libraries of compounds can be presented in solution (e.g., Houghten, Biotechniques 13:412-421 [1992]), or on beads (Lam, Nature 354:82-84 [1991]), chips (Fodor, Nature 364:555-556 [1993]), bacteria or spores (U.S. Pat. No. 5,223,409; herein incorporated by reference), plasmids (Cull et al., Proc. Nad. Acad. Sci. USA 89:18651869 [1992]) or on phage (Scott and Smith, Science 249:386-390 [1990]; Devlin Science 249:404-406 [1990]; Cwirla et al., Proc. Natl. Acad. Sci. 87:6378-6382 [1990]; Felici, J. Mol. Biol. 222:301 [1991]).
In some embodiments, an assay is a cell-based assay in which a cell that expresses a stem cell cancer marker protein or biologically active portion thereof is contacted with a test compound, and the ability of the test compound to the modulate cancer marker's activity is determined. Determining the ability of the test compound to modulate stem cell cancer marker activity can be accomplished by monitoring, for example, changes in enzymatic activity. The cell, for example, can be of mammalian origin.
The ability of the test compound to modulate cancer marker binding to a compound, e.g., a stem cell cancer marker substrate, can also be evaluated. This can be accomplished, for example, by coupling the compound, e.g., the substrate, with a radioisotope or enzymatic label such that binding of the compound, e.g., the substrate, to a cancer marker can be determined by detecting the labeled compound, e.g., substrate, in a complex.
Alternatively, the stem cell cancer marker is coupled with a radioisotope or enzymatic label to monitor the ability of a test compound to modulate cancer marker binding to a cancer markers substrate in a complex. For example, compounds (e.g., substrates) can be labeled with 125I, 35S 14C or 3H, either directly or indirectly, and the radioisotope detected by direct counting of radioemmission or by scintillation counting. Alternatively, compounds can be enzymatically labeled with, for example, horseradish peroxidase, alkaline phosphatase, or luciferase, and the enzymatic label detected by determination of conversion of an appropriate substrate to product.
The ability of a compound (e.g., a stem cell cancer marker substrate) to interact with a stem cell cancer marker with or without the labeling of any of the interactants can be evaluated. For example, a microphysiometer can be used to detect the interaction of a compound with a cancer marker without the labeling of either the compound or the cancer marker (McConnell et al. Science 257:1906-1912 [1992]). As used herein, a “microphysiometer” (e.g., Cytosensor) is an analytical instrument that measures the rate at which a cell acidifies its environment using a light-addressable potentiometric sensor (LAPS). Changes in this acidification rate can be used as an indicator of the interaction between a compound and cancer markers.
In yet another embodiment, a cell-free assay is provided in which a cancer marker protein or biologically active portion thereof is contacted with a test compound and the ability of the test compound to bind to the stem cell cancer marker protein or biologically active portion thereof is evaluated. some biologically active portions of the cancer markers proteins to be used in assays of the present invention include fragments that participate in interactions with substrates or other proteins, e.g., fragments with high surface probability scores.
Cell-free assays involve preparing a reaction mixture of the target gene protein and the test compound under conditions and for a time sufficient to allow the two components to interact and bind, thus forming a complex that can be removed and/or detected.
The interaction between two molecules can also be detected, e.g., using fluorescence energy transfer (FRET) (see, for example, Lakowicz et al., U.S. Pat. No. 5,631,169; Stavrianopoulos et al., U.S. Pat. No. 4,968,103; each of which is herein incorporated by reference). A fluorophore label is selected such that a first donor molecule's emitted fluorescent energy will be absorbed by a fluorescent label on a second, ‘acceptor’ molecule, which in turn is able to fluoresce due to the absorbed energy.
Alternately, the ‘donor’ protein molecule can simply utilize the natural fluorescent energy of tryptophan residues. Labels are chosen that emit different wavelengths of light, such that the ‘acceptor’ molecule label can be differentiated from that of the ‘donor’. Since the efficiency of energy transfer between the labels is related to the distance separating the molecules, the spatial relationship between the molecules can be assessed. In a situation in which binding occurs between the molecules, the fluorescent emission of the ‘acceptor’ molecule label in 1 5 the assay should be maximal. An FRET binding event can be conveniently measured through standard fluorometric detection means well known in the art (e.g., using a fluorimeter).
In another embodiment, determining the ability of the stem cell cancer markers protein to bind to a target molecule can be accomplished using real-time Biomolecular Interaction Analysis (BIA) (see, e.g., Sjolander and Urbaniczky, Anal. Chem. 63:2338-2345 [1991] and Szabo et al. Curr. Opin. Struct. Biol. 5:699-705 [1995]). “Surface plasmon resonance” or “BIA” detects biospecific interactions in real time, without labeling any of the interactants (e.g., BIAcore). Changes in the mass at the binding surface (indicative of a binding event) result in alterations of the refractive index of light near the surface (the optical phenomenon of surface plasmon resonance (SPR)), resulting in a detectable signal that can be used as an indication of real-time reactions between biological molecules.
In some embodiments, the target gene product or the test substance is anchored onto a solid phase. The target gene product/test compound complexes anchored on the solid phase can be detected at the end of the reaction. The target gene product can be anchored onto a solid surface, and the test compound, (which is not anchored), can be labeled, either directly or indirectly, with detectable labels discussed herein.
It can be desirable to immobilize stem cell cancer markers, an anti-cancer marker antibody or its target molecule to facilitate separation of complexed from non-complexed forms of one or both of the proteins, as well as to accommodate automation of the assay. Binding of a test compound to a stem cell cancer marker protein, or interaction of a cancer marker protein with a target molecule in the presence and absence of a candidate compound, can be accomplished in any vessel suitable for containing the reactants. Examples of such vessels include microtiter plates, test tubes, and micro-centrifuge tubes. In some embodiments, a fusion protein can be provided which adds a domain that allows one or both of the proteins to be bound to a matrix. For example, glutathione-S-transferase-cancer marker fusion proteins or glutathione-S-transferase/target fusion proteins can be adsorbed onto glutathione Sepharose beads (Sigma Chemical, St. Louis, Mo.) or glutathione-derivatized microtiter plates, which are then combined with the test compound or the test compound and either the non-adsorbed target protein or cancer marker protein, and the mixture incubated under conditions conducive for complex formation (e.g., at physiological conditions for salt and pH). Following incubation, the beads or microtiter plate wells are washed to remove any unbound components, the matrix immobilized in the case of beads, complex determined either directly or indirectly, for example, as described above.
Alternatively, the complexes can be dissociated from the matrix, and the level of cancer markers binding or activity determined using standard techniques. Other techniques for immobilizing either cancer markers protein or a target molecule on matrices include using conjugation of biotin and streptavidin. Biotinylated cancer marker protein or target molecules can be prepared from biotin-NHS (N-hydroxy-succinimide) using techniques known in the art (e.g., biotinylation kit, Pierce Chemicals, Rockford, EL), and immobilized in the wells of streptavidin-coated 96 well plates (Pierce Chemical).
In order to conduct the assay, the non-immobilized component is added to the coated surface containing the anchored component. After the reaction is complete, unreacted components are removed (e.g., by washing) under conditions such that any complexes formed will remain immobilized on the solid surface. The detection of complexes anchored on the solid surface can be accomplished in a number of ways. Where the previously non-immobilized component is pre-labeled, the detection of label immobilized on the surface indicates that complexes were formed. Where the previously non-immobilized component is not pre-labeled, an indirect label can be used to detect complexes anchored on the surface; e.g., using a labeled antibody specific for the immobilized component (the antibody, in turn, can be directly labeled or indirectly labeled with, e.g., a labeled anti-IgG antibody).
This assay is performed utilizing antibodies reactive with stem cell cancer marker protein or target molecules but which do not interfere with binding of the stem cell cancer markers protein to its target molecule. Such antibodies can be derivatized to the wells of the plate, and unbound target or cancer markers protein trapped in the wells by antibody conjugation. Methods for detecting such complexes, in addition to those described above for the GST-immobilized complexes, include immunodetection of complexes using antibodies reactive with the cancer marker protein or target molecule, as well as enzyme-linked assays which rely on detecting an enzymatic activity associated with the cancer marker protein or target molecule.
Alternatively, cell free assays can be conducted in a liquid phase. In such an assay, the reaction products are separated from unreacted components, by any of a number of standard techniques, including, but not limited to: differential centrifugation (see, for example, Rivas and Minton, Trends Biochem Sci 18:284-7 [1993]); chromatography (gel filtration chromatography, ion-exchange chromatography); electrophoresis (see, e.g., Ausubel et al., eds. Current Protocols in Molecular Biology 1999, J. Wiley: New York.); and immunoprecipitation (see, for example, Ausubel et al., eds. Current Protocols in Molecular Biology 1999, J. Wiley: New York). Such resins and chromatographic techniques are known to one skilled in the art (See e.g., Heegaard J. Mol. Recognit. 11: 141-8 [1998]; Hageand Tweed J. Chromatogr. Biomed. Sci. Appl 699:499-525 [1997]). Further, fluorescence energy transfer can also be conveniently utilized, as described herein, to detect binding without further purification of the complex from solution.
The assay can include contacting the stem cell cancer markers protein or biologically active portion thereof with a known compound that binds the cancer marker to form an assay mixture, contacting the assay mixture with a test compound, and determining the ability of the test compound to interact with a cancer marker protein, wherein determining the ability of the test compound to interact with a cancer marker protein includes determining the ability of the test compound to preferentially bind to cancer markers or biologically active portion thereof, or to modulate the activity of a target molecule, as compared to the known compound.
To the extent that stem cell cancer markers can, in vivo, interact with one or more cellular or extracellular macromolecules, such as proteins, inhibitors of such an interaction are useful. A homogeneous assay can be used can be used to identify inhibitors.
For example, a preformed complex of the target gene product and the interactive cellular or extracellular binding partner product is prepared such that either the target gene products or their binding partners are labeled, but the signal generated by the label is quenched due to complex formation (see, e.g., U.S. Pat. No. 4,109,496, herein incorporated by reference, that utilizes this approach for immunoassays). The addition of a test substance that competes with and displaces one of the species from the preformed complex will result in the generation of a signal above background. In this way, test substances that disrupt target gene product-binding partner interaction can be identified. Alternatively, cancer markers protein can be used as a “bait protein” in a two-hybrid assay or three-hybrid assay (see, e.g., U.S. Pat. No. 5,283,317; Zervos et al., Cell 72:223-232 [1993]; Madura et al., J. Biol. Chem. 268.12046-12054 [1993]; Bartel et al., Biotechniques 14:920-924 [1993]; Iwabuchi et al., Oncogene 8:1693-1696 [1993]; and Brent WO 94/10300; each of which is herein incorporated by reference), to identify other proteins, that bind to or interact with cancer markers (“cancer marker-binding proteins” or “cancer marker-bp”) and are involved in cancer marker activity. Such cancer marker-bps can be activators or inhibitors of signals by the cancer marker proteins or targets as, for example, downstream elements of a cancer markers-mediated signaling pathway.
Modulators of cancer markers expression can also be identified. For example, a cell or cell free mixture is contacted with a candidate compound and the expression of cancer marker mRNA or protein evaluated relative to the level of expression of stem cell cancer marker mRNA or protein in the absence of the candidate compound. When expression of cancer marker mRNA or protein is greater in the presence of the candidate compound than in its absence, the candidate compound is identified as a stimulator of cancer marker mRNA or protein expression. Alternatively, when expression of cancer marker mRNA or protein is less (i.e., statistically significantly less) in the presence of the candidate compound than in its absence, the candidate compound is identified as an inhibitor of cancer marker mRNA or protein expression. The level of cancer markers mRNA or protein expression can be determined by methods described herein for detecting cancer markers mRNA or protein.
A modulating agent can be identified using a cell-based or a cell free assay, and the ability of the agent to modulate the activity of a cancer markers protein can be confirmed in vivo, e.g., in an animal such as an animal model for a disease (e.g., an animal with prostate cancer or metastatic prostate cancer; or an animal harboring a xenograft of a prostate cancer from an animal (e.g., human) or cells from a cancer resulting from metastasis of a prostate cancer (e.g., to a lymph node, bone, or liver), or cells from a prostate cancer cell line.
This invention further pertains to novel agents identified by the above-described screening assays (See e.g., below description of cancer therapies). Accordingly, it is within the scope of this invention to further use an agent identified as described herein (e.g., a cancer marker modulating agent, an antisense cancer marker nucleic acid molecule, a siRNA molecule, a cancer marker specific antibody, or a cancer marker-binding partner) in an appropriate animal model (such as those described herein) to determine the efficacy, toxicity, side effects, or mechanism of action, of treatment with such an agent. Furthermore, novel agents identified by the above-described screening assays can be, e.g., used for treatments as described herein (e.g. to treat a human patient who has cancer).
VII. Cancer Therapies
In some embodiments, the present invention provides therapies for cancer (e.g., breast cancer). In some embodiments, therapies target cancer markers (e.g., including but not limited to, those shown in Tables 4-9).
A. Antisense Therapies
Candidate therapeutic agents also find use in drug screening and research applications. In some embodiments, the present invention targets the expression of stem cell cancer markers. For example, in some embodiments, the present invention employs compositions comprising oligomeric antisense compounds, particularly oligonucleotides (e.g., those identified in the drug screening methods described above), for use in modulating the function of nucleic acid molecules encoding stem cell cancer markers of the present invention, ultimately modulating the amount of cancer marker expressed. This is accomplished by providing antisense compounds that specifically hybridize with one or more nucleic acids encoding cancer markers of the present invention. The specific hybridization of an oligomeric compound with its target nucleic acid interferes with the normal function of the nucleic acid. This modulation of function of a target nucleic acid by compounds that specifically hybridize to it is generally referred to as “antisense.” The functions of DNA to be interfered with include replication and transcription. The functions of RNA to be interfered with include all vital functions such as, for example, translocation of the RNA to the site of protein translation, translation of protein from the RNA, splicing of the RNA to yield one or more mRNA species, and catalytic activity that can be engaged in or facilitated by the RNA. The overall effect of such interference with target nucleic acid function is modulation of the expression of cancer markers of the present invention. In the context of the present invention, “modulation” means either an increase (stimulation) or a decrease (inhibition) in the expression of a gene. For example, expression can be inhibited to potentially prevent tumor proliferation.
It is some to target specific nucleic acids for antisense. “Targeting” an antisense compound to a particular nucleic acid, in the context of the present invention, is a multistep process. The process usually begins with the identification of a nucleic acid sequence whose function is to be modulated. This can be, for example, a cellular gene (or mRNA transcribed from the gene) whose expression is associated with a particular disorder or disease state, or a nucleic acid molecule from an infectious agent. In the present invention, the target is a nucleic acid molecule encoding a stem cell cancer marker of the present invention. The targeting process also includes determination of a site or sites within this gene for the antisense interaction to occur such that the desired effect, e.g., detection or modulation of expression of the protein, will result. Within the context of the present invention, a some intragenic site is the region encompassing the translation initiation or termination codon of the open reading frame (ORF) of the gene. Since the translation initiation codon is typically 5′-AUG (in transcribed mRNA molecules; 5′-ATG in the corresponding DNA molecule), the translation initiation codon is also referred to as the “AUG codon,” the “start codon” or the “AUG start codon”. A minority of genes have a translation initiation codon having the RNA sequence 5′-GUG, 5′-UUG or 5′-CUG, and 5′-AUA, 5′-ACG and 5′-CUG have been shown to function in vivo. Thus, the terms “translation initiation codon” and “start codon” can encompass many codon sequences, even though the initiator amino acid in each instance is typically methionine (in eukaryotes) or formylmethionine (in prokaryotes). Eukaryotic and prokaryotic genes can have two or more alternative start codons, any one of which can be preferentially utilized for translation initiation in a particular cell type or tissue, or under a particular set of conditions. In the context of the present invention, “start codon” and “translation initiation codon” refer to the codon or codons that are used in vivo to initiate translation of an mRNA molecule transcribed from a gene encoding a tumor antigen of the present invention, regardless of the sequence(s) of such codons.
Translation termination codon (or “stop codon”) of a gene can have one of three sequences (i.e., 5′-UAA, 5′-UAG and 5′-UGA; the corresponding DNA sequences are 5′-TAA, 5′-TAG and 5′-TGA, respectively). The terms “start codon region” and “translation initiation codon region” refer to a portion of such an mRNA or gene that encompasses from about 25 to about 50 contiguous nucleotides in either direction (i.e., 5′ or 3′) from a translation initiation codon. Similarly, the terms “stop codon region” and “translation termination codon region” refer to a portion of such an mRNA or gene that encompasses from about 25 to about 50 contiguous nucleotides in either direction (i.e., 5′ or 3′) from a translation termination codon.
The open reading frame (ORF) or “coding region,” which refers to the region between the translation initiation codon and the translation termination codon, is also a region that can be targeted effectively. Other target regions include the 5′ untranslated region (5′ UTR), referring to the portion of an mRNA in the 5′ direction from the translation initiation codon, and thus including nucleotides between the 5′ cap site and the translation initiation codon of an mRNA or corresponding nucleotides on the gene, and the 3′ untranslated region (3′ UTR), referring to the portion of an mRNA in the 3′ direction from the translation termination codon, and thus including nucleotides between the translation termination codon and 3′ end of an mRNA or corresponding nucleotides on the gene. The 5′ cap of an mRNA comprises an N7-methylated guanosine residue joined to the 5′-most residue of the mRNA via a 5′-5′ triphosphate linkage. The 5′ cap region of an mRNA is considered to include the 5′ cap structure itself as well as the first 50 nucleotides adjacent to the cap. The cap region can also be a some target region.
Although some eukaryotic mRNA transcripts are directly translated, many contain one or more regions, known as “introns,” that are excised from a transcript before it is translated. The remaining (and therefore translated) regions are known as “exons” and are spliced together to form a continuous mRNA sequence. mRNA splice sites (i.e., intron-exon junctions) can also be some target regions, and are particularly useful in situations where aberrant splicing is implicated in disease, or where an overproduction of a particular mRNA splice product is implicated in disease. Aberrant fusion junctions due to rearrangements or deletions are also some targets. It has also been found that introns can also be effective, and therefore preferred, target regions for antisense compounds targeted, for example, to DNA or pre-mRNA.
In some embodiments, target sites for antisense inhibition are identified using commercially available software programs (e.g., Biognostik, Gottingen, Germany; SysArris Software, Bangalore, India; Antisense Research Group, University of Liverpool, Liverpool, England; GeneTrove, Carlsbad, Calif.). In other embodiments, target sites for antisense inhibition are identified using the accessible site method described in U.S. Patent WO0198537A2, herein incorporated by reference.
Once one or more target sites have been identified, oligonucleotides are chosen that are sufficiently complementary to the target (i.e., hybridize sufficiently well and with sufficient specificity) to give the desired effect. For example, in some embodiments of the present invention, antisense oligonucleotides are targeted to or near the start codon.
In the context of this invention, “hybridization,” with respect to antisense compositions and methods, means hydrogen bonding, which can be Watson-Crick, Hoogsteen or reversed Hoogsteen hydrogen bonding, between complementary nucleoside or nucleotide bases. For example, adenine and thymine are complementary nucleobases that pair through the formation of hydrogen bonds. It is understood that the sequence of an antisense compound need not be 100% complementary to that of its target nucleic acid to be specifically hybridizable. An antisense compound is specifically hybridizable when binding of the compound to the target DNA or RNA molecule interferes with the normal function of the target DNA or RNA to cause a loss of utility, and there is a sufficient degree of complementarity to avoid non-specific binding of the antisense compound to non-target sequences under conditions in which specific binding is desired (i.e., under physiological conditions in the case of in vivo assays or therapeutic treatment, and in the case of in vitro assays, under conditions in which the assays are performed).
Antisense compounds are commonly used as research reagents and diagnostics. For example, antisense oligonucleotides, which are able to inhibit gene expression with specificity, can be used to elucidate the function of particular genes. Antisense compounds are also used, for example, to distinguish between functions of various members of a biological pathway.
The specificity and sensitivity of antisense is also applied for therapeutic uses. For example, antisense oligonucleotides have been employed as therapeutic moieties in the treatment of disease states in animals and man. Antisense oligonucleotides have been safely and effectively administered to humans and numerous clinical trials are presently underway. It is thus established that oligonucleotides are useful therapeutic modalities that can be configured to be useful in treatment regimes for treatment of cells, tissues, and animals, especially humans.
While antisense oligonucleotides are a some form of antisense compound, the present invention comprehends other oligomeric antisense compounds, including but not limited to oligonucleotide mimetics such as are described below. The antisense compounds in accordance with this invention can comprise from about 8 to about 30 nucleobases (i.e., from about 8 to about 30 linked bases), although both longer and shorter sequences can find use with the present invention. Particularly some antisense compounds are antisense oligonucleotides, e.g., those comprising from about 12 to about 25 nucleobases.
Specific examples of some antisense compounds useful with the present invention include oligonucleotides containing modified backbones or non-natural internucleoside linkages. As defined in this specification, oligonucleotides having modified backbones include those that retain a phosphorus atom in the backbone and those that do not have a phosphorus atom in the backbone. For the purposes of this specification, modified oligonucleotides that do not have a phosphorus atom in their internucleoside backbone can also be considered to be oligonucleosides.
Modified oligonucleotide backbones include, for example, phosphorothioates, chiral phosphorothioates, phosphorodithioates, phosphotriesters, aminoalkylphosphotriesters, methyl and other alkyl phosphonates including 3′-alkylene phosphonates and chiral phosphonates, phosphinates, phosphoramidates including 3′-amino phosphoramidate and aminoalkylphosphoramidates, thionophosphoramidates, thionoalkylphosphonates, thionoalkylphosphotriesters, and boranophosphates having normal 3′-5′ linkages, 2′-5′ linked analogs of these, and those having inverted polarity wherein the adjacent pairs of nucleoside units are linked 3′-5′ to 5′-3′ or 2′-5′ to 5′-2′. Various salts, mixed salts and free acid forms are also included.
Modified oligonucleotide backbones that do not include a phosphorus atom therein have backbones that are formed by short chain alkyl or cycloalkyl internucleoside linkages, mixed heteroatom and alkyl or cycloalkyl internucleoside linkages, or one or more short chain heteroatomic or heterocyclic internucleoside linkages. These include those having morpholino linkages (formed in part from the sugar portion of a nucleoside); siloxane backbones; sulfide, sulfoxide and sulfone backbones; formacetyl and thioformacetyl backbones; methylene formacetyl and thioformacetyl backbones; alkene containing backbones; sulfamate backbones; methyleneimino and methylenehydrazino backbones; sulfonate and sulfonamide backbones; amide backbones; and others having mixed N, O, S and CH2 component parts.
In other some oligonucleotide mimetics, both the sugar and the internucleoside linkage (i.e., the backbone) of the nucleotide units are replaced with novel groups. The base units are maintained for hybridization with an appropriate nucleic acid target compound. One such oligomeric compound, an oligonucleotide mimetic that has been shown to have excellent hybridization properties, is referred to as a peptide nucleic acid (PNA). In PNA compounds, the sugar-backbone of an oligonucleotide is replaced with an amide containing backbone, in particular an aminoethylglycine backbone. The nucleobases are retained and are bound directly or indirectly to aza nitrogen atoms of the amide portion of the backbone. Representative United States patents that teach the preparation of PNA compounds include, but are not limited to, U.S. Pat. Nos. 5,539,082; 5,714,331; and 5,719,262, each of which is herein incorporated by reference. Further teaching of PNA compounds can be found in Nielsen et al., Science 254:1497 (1991).
Most some embodiments of the invention are oligonucleotides with phosphorothioate backbones and oligonucleosides with heteroatom backbones, and in particular —CH2, —NH—O—CH2—, —CH2—N(CH3)—O—CH2— [known as a methylene (methylimino) or MMI backbone], —CH2—O—N(CH3)—CH2—, —CH2—N(CH3)—N(CH3)—CH2—, and —O—N(CH3)—CH2—CH2— [wherein the native phosphodiester backbone is represented as —O—P—O—CH2—] of the above referenced U.S. Pat. No. 5,489,677, and the amide backbones of the above referenced U.S. Pat. No. 5,602,240. Also some are oligonucleotides having morpholino backbone structures of the above-referenced U.S. Pat. No. 5,034,506.
Modified oligonucleotides can also contain one or more substituted sugar moieties. some oligonucleotides comprise one of the following at the 2′ position: OH; F; O-, S-, or N-alkyl; O-, S-, or N-alkenyl; O-, S- or N-alkynyl; or O-alkyl-O-alkyl, wherein the alkyl, alkenyl and alkynyl can be substituted or unsubstituted C1 to C10 alkyl or C2 to C10 alkenyl and alkynyl. Particularly some are O[(CH2)nO]mCH3, O(CH2)nOCH3, O(CH2)nNH2, O(CH2)nCH3, O(CH2)nONH2, and O(CH2)nON[(CH2)nCH3)]2, where n and m are from 1 to about 10. Other some oligonucleotides comprise one of the following at the 2′ position: C1 to C10 lower alkyl, substituted lower alkyl, alkaryl, aralkyl, O-alkaryl or O-aralkyl, SH, SCH3, OCN, Cl, Br, CN, CF3, OCF3, SOCH3, SO2CH3, ONO2, NO2, N3, NH2, heterocycloalkyl, heterocycloalkaryl, aminoalkylamino, polyalkylamino, substituted silyl, an RNA cleaving group, a reporter group, an intercalator, a group for improving the pharmacokinetic properties of an oligonucleotide, or a group for improving the pharmacodynamic properties of an oligonucleotide, and other substituents having similar properties. A some modification includes 2′-methoxyethoxy (2′-O—CH2CH2OCH3, also known as 2′-O-(2-methoxyethyl) or 2′-MOE) (Martin et al., Helv. Chim. Acta 78:486 [1995]) i.e., an alkoxyalkoxy group. A further some modification includes 2′-dimethylaminooxyethoxy (i.e., a O(CH2)2ON(CH3)2 group), also known as 2′-DMAOE, and 2′-dimethylaminoethoxyethoxy (also known in the art as 2′-O-dimethylaminoethoxyethyl or 2′-DMAEOE), i.e., 2′-O—CH2—O—CH2—N(CH2)2.
Other some modifications include 2′-methoxy(2′-O—CH3), 2′-aminopropoxy(2′-OCH2CH2CH2NH2) and 2′-fluoro (2′-F). Similar modifications can also be made at other positions on the oligonucleotide, particularly the 3′ position of the sugar on the 3′ terminal nucleotide or in 2′-5′ linked oligonucleotides and the 5′ position of 5′ terminal nucleotide. Oligonucleotides can also have sugar mimetics such as cyclobutyl moieties in place of the pentofuranosyl sugar.
Oligonucleotides can also include nucleobase (often referred to in the art simply as “base”) modifications or substitutions. As used herein, “unmodified” or “natural” nucleobases include the purine bases adenine (A) and guanine (G), and the pyrimidine bases thymine (T), cytosine (C) and uracil (U). Modified nucleobases include other synthetic and natural nucleobases such as 5-methylcytosine (5-me-C), 5-hydroxymethyl cytosine, xanthine, hypoxanthine, 2-aminoadenine, 6-methyl and other alkyl derivatives of adenine and guanine, 2-propyl and other alkyl derivatives of adenine and guanine, 2-thiouracil, 2-thiothymine and 2-thiocytosine, 5-halouracil and cytosine, 5-propynyl uracil and cytosine, 6-azo uracil, cytosine and thymine, 5-uracil (pseudouracil), 4-thiouracil, 8-halo, 8-amino, 8-thiol, 8-thioalkyl, 8-hydroxyl and other 8-substituted adenines and guanines, 5-halo particularly 5-bromo, 5-trifluoromethyl and other 5-substituted uracils and cytosines, 7-methylguanine and 7-methyladenine, 8-azaguanine and 8-azaadenine, 7-deazaguanine and 7-deazaadenine and 3-deazaguanine and 3-deazaadenine. Further nucleobases include those disclosed in U.S. Pat. No. 3,687,808. Certain of these nucleobases are particularly useful for increasing the binding affinity of the oligomeric compounds of the invention. These include 5-substituted pyrimidines, 6-azapyrimidines and N-2, N-6 and O-6 substituted purines, including 2-aminopropyladenine, 5-propynyluracil and 5-propynylcytosine. 5-methylcytosine substitutions have been shown to increase nucleic acid duplex stability by 0.6-1.2. degree ° C. and are presently some base substitutions, even more particularly when combined with 2′-O-methoxyethyl sugar modifications.
Another modification of the oligonucleotides of the present invention involves chemically linking to the oligonucleotide one or more moieties or conjugates that enhance the activity, cellular distribution or cellular uptake of the oligonucleotide. Such moieties include but are not limited to lipid moieties such as a cholesterol moiety, cholic acid, a thioether, (e.g., hexyl-5-tritylthiol), a thiocholesterol, an aliphatic chain, (e.g., dodecandiol or undecyl residues), a phospholipid, (e.g., di-hexadecyl-rac-glycerol or triethylammonium 1,2-di-O-hexadecyl-rac-glycero-3-H-phosphonate), a polyamine or a polyethylene glycol chain or adamantane acetic acid, a palmityl moiety, or an octadecylamine or hexylamino-carbonyl-oxycholesterol moiety.
One skilled in the relevant art knows well how to generate oligonucleotides containing the above-described modifications. The present invention is not limited to the antisense oligonucleotides described above. Any suitable modification or substitution can be utilized.
It is not necessary for all positions in a given compound to be uniformly modified, and in fact more than one of the aforementioned modifications can be incorporated in a single compound or even at a single nucleoside within an oligonucleotide. The present invention also includes antisense compounds that are chimeric compounds. “Chimeric” antisense compounds or “chimeras,” in the context of the present invention, are antisense compounds, particularly oligonucleotides, which contain two or more chemically distinct regions, each made up of at least one monomer unit, i.e., a nucleotide in the case of an oligonucleotide compound. These oligonucleotides typically contain at least one region wherein the oligonucleotide is modified so as to confer upon the oligonucleotide increased resistance to nuclease degradation, increased cellular uptake, and/or increased binding affinity for the target nucleic acid. An additional region of the oligonucleotide can serve as a substrate for enzymes capable of cleaving RNA:DNA or RNA:RNA hybrids. By way of example, RNaseH is a cellular endonuclease that cleaves the RNA strand of an RNA:DNA duplex. Activation of RNaseH, therefore, results in cleavage of the RNA target, thereby greatly enhancing the efficiency of oligonucleotide inhibition of gene expression. Consequently, comparable results can often be obtained with shorter oligonucleotides when chimeric oligonucleotides are used, compared to phosphorothioate deoxyoligonucleotides hybridizing to the same target region. Cleavage of the RNA target can be routinely detected by gel electrophoresis and, if necessary, associated nucleic acid hybridization techniques known in the art.
Chimeric antisense compounds of the present invention can be formed as composite structures of two or more oligonucleotides, modified oligonucleotides, oligonucleosides and/or oligonucleotide mimetics as described above.
The present invention also includes pharmaceutical compositions and formulations that include the antisense compounds of the present invention as described below.
B. Genetic Therapies
The present invention contemplates the use of any genetic manipulation for use in modulating the expression of stem cell cancer markers of the present invention. Examples of genetic manipulation include, but are not limited to, gene knockout (e.g., removing the cancer marker gene from the chromosome using, for example, recombination), expression of antisense constructs with or without inducible promoters, addition of a heterologous gene (e.g. controlled by an inducible promoter), and the like. Delivery of nucleic acid construct to cells in vitro or in vivo can be conducted using any suitable method. A suitable method is one that introduces the nucleic acid construct into the cell such that the desired event occurs (e.g., expression of an antisense construct).
Introduction of molecules carrying genetic information into cells is achieved by any of various methods including, but not limited to, directed injection of naked DNA constructs, bombardment with gold particles loaded with said constructs, and macromolecule mediated gene transfer using, for example, liposomes, biopolymers, and the like. some methods use gene delivery vehicles derived from viruses, including, but not limited to, adenoviruses, retroviruses, vaccinia viruses, and adeno-associated viruses. Because of the higher efficiency as compared to retroviruses, vectors derived from adenoviruses are the some gene delivery vehicles for transferring nucleic acid molecules into host cells in vivo. Adenoviral vectors have been shown to provide very efficient in vivo gene transfer into a variety of solid tumors in animal models and into human solid tumor xenografts in immune-deficient mice. Examples of adenoviral vectors and methods for gene transfer are described in PCT publications WO 00/12738 and WO 00/09675 and U.S. Pat. Nos. 6,033,908, 6,019,978, 6,001,557, 5,994,132, 5,994,128, 5,994,106, 5,981,225, 5,885,808, 5,872,154, 5,830,730, and 5,824,544, each of which is herein incorporated by reference in its entirety.
Vectors can be administered to a subject in a variety of ways. For example, in some embodiments of the present invention, vectors are administered into tumors or tissue associated with tumors using direct injection. In other embodiments, administration is via the blood or lymphatic circulation (See e.g., PCT publication 99/02685 herein incorporated by reference in its entirety). Exemplary dose levels of adenoviral vector can be about 108 to 1011 vector particles added to the perfusate.
C. Antibody Therapy
In some embodiments, the present invention provides antibodies that target tumors that express a stem cell cancer marker of the present invention (e.g., those shown in Tables 4-9). Any suitable antibody (e.g., monoclonal, polyclonal, or synthetic) can be utilized in the therapeutic methods disclosed herein. In some embodiments, the antibodies used for cancer therapy are humanized antibodies. Methods for humanizing antibodies are well known in the art (See e.g., U.S. Pat. Nos. 6,180,370, 5,585,089, 6,054,297, and 5,565,332; each of which is herein incorporated by reference).
In some embodiments, the therapeutic antibodies comprise an antibody generated against a stem cell cancer marker of the present invention, wherein the antibody is conjugated to a cytotoxic agent. In such embodiments, a tumor specific therapeutic agent is generated that does not target normal cells, thus reducing many of the detrimental side effects of traditional chemotherapy. For certain applications, it is envisioned that the therapeutic agents will be pharmacologic agents that will serve as useful agents for attachment to antibodies, particularly cytotoxic or otherwise anticellular agents having the ability to kill or suppress the growth or cell division of endothelial cells. The present invention contemplates the use of any pharmacologic agent that can be conjugated to an antibody, and delivered in active form. Exemplary anticellular agents include chemotherapeutic agents, radioisotopes, and cytotoxins. The therapeutic antibodies of the present invention can include a variety of cytotoxic moieties, including but not limited to, radioactive isotopes (e.g., iodine-131, iodine-123, technicium-99m, indium-111, rhenium-188, rhenium-186, gallium-67, copper-67, yttrium-90, iodine-125 or astatine-211), hormones such as a steroid, antimetabolites such as cytosines (e.g., arabinoside, fluorouracil, methotrexate or aminopterin; an anthracycline; mitomycin C), vinca alkaloids (e.g., demecolcine; etoposide; mithramycin), and antitumor alkylating agent such as chlorambucil or melphalan. Other embodiments can include agents such as a coagulant, a cytokine, growth factor, bacterial endotoxin or the lipid A moiety of bacterial endotoxin. For example, in some embodiments, therapeutic agents will include plant-, fungus- or bacteria-derived toxin, such as an A chain toxins, a ribosome inactivating protein, α-sarcin, aspergillin, restrictocin, a ribonuclease, diphtheria toxin or pseudomonas exotoxin, to mention just a few examples. In some embodiments, deglycosylated ricin A chain is utilized.
In any event, it is proposed that agents such as these may, if desired, be successfully conjugated to an antibody, in a manner that will allow their targeting, internalization, release or presentation to blood components at the site of the targeted tumor cells as required using known conjugation technology (See, e.g., Ghose et al., Methods Enzymol., 93:280 [1983]).
For example, in some embodiments the present invention provides immunotoxins targeted a stem cell cancer marker of the present invention. Immunotoxins are conjugates of a specific targeting agent typically a tumor-directed antibody or fragment, with a cytotoxic agent, such as a toxin moiety. The targeting agent directs the toxin to, and thereby selectively kills, cells carrying the targeted antigen. In some embodiments, therapeutic antibodies employ crosslinkers that provide high in vivo stability (Thorpe et al., Cancer Res., 48:6396 [1988]).
In other embodiments, particularly those involving treatment of solid tumors, antibodies are designed to have a cytotoxic or otherwise anticellular effect against the tumor vasculature, by suppressing the growth or cell division of the vascular endothelial cells. This attack is intended to lead to a tumor-localized vascular collapse, depriving the tumor cells, particularly those tumor cells distal of the vasculature, of oxygen and nutrients, ultimately leading to cell death and tumor necrosis.
In some embodiments, antibody based therapeutics are formulated as pharmaceutical compositions as described below. In some embodiments, administration of an antibody composition of the present invention results in a measurable decrease in cancer (e.g., decrease or elimination of tumor).
D. RNAi Therapies
In other embodiments, RNAi is used to regulate expression of the stem cell cancer markers of the present invention (e.g. those shown in Tables 4-9). RNAi represents an evolutionary conserved cellular defense for controlling the expression of foreign genes in most eukaryotes, including humans. RNAi is triggered by double-stranded RNA (dsRNA) and causes sequence-specific mRNA degradation of single-stranded target RNAs homologous in response to dsRNA. The mediators of mRNA degradation are small interfering RNA duplexes (siRNAs), which are normally produced from long dsRNA by enzymatic cleavage in the cell. siRNAs are generally approximately twenty-one nucleotides in length (e.g. 21-23 nucleotides in length), and have a base-paired structure characterized by two nucleotide 3′-overhangs. Following the introduction of a small RNA, or RNAi, into the cell, it is believed the sequence is delivered to an enzyme complex called RISC(RNA-induced silencing complex). RISC recognizes the target and cleaves it with an endonuclease. It is noted that if larger RNA sequences are delivered to a cell, RNase III enzyme (Dicer) converts longer dsRNA into 21-23 nt ds siRNA fragments.
Chemically synthesized siRNAs have become powerful reagents for genome-wide analysis of mammalian gene function in cultured somatic cells. Beyond their value for validation of gene function, siRNAs also hold great potential as gene-specific therapeutic agents (Tuschl and Borkhardt, Molecular Intervent. 2002; 2(3): 158-67, herein incorporated by reference).
The transfection of siRNAs into animal cells results in the potent, long-lasting post-transcriptional silencing of specific genes (Caplen et al, Proc Natl Acad Sci U.S.A. 2001; 98: 9742-7; Elbashir et al., Nature. 2001; 411:494-8; Elbashir et al., Genes Dev. 2001; 15: 188-200; and Elbashir et al., EMBO J. 2001; 20: 6877-88, all of which are herein incorporated by reference). Methods and compositions for performing RNAi with siRNAs are described, for example, in U.S. Pat. No. 6,506,559, herein incorporated by reference.
siRNAs are extraordinarily effective at lowering the amounts of targeted RNA, and by extension proteins, frequently to undetectable levels. The silencing effect can last several months, and is extraordinarily specific, because one nucleotide mismatch between the target RNA and the central region of the siRNA is frequently sufficient to prevent silencing Brummelkamp et al, Science 2002; 296:550-3; and Holen et al, Nucleic Acids Res. 2002; 30:1757-66, both of which are herein incorporated by reference.
E. Pharmaceutical Compositions
The present invention further provides pharmaceutical compositions (e.g., comprising a small molecule, antisense, antibody, or siRNA that targets the stem cell cancer markers of the present invention). The pharmaceutical compositions of the present invention can be administered in a number of ways depending upon whether local or systemic treatment is desired and upon the area to be treated. Administration can be topical (including ophthalmic and to mucous membranes including vaginal and rectal delivery), pulmonary (e.g., by inhalation or insufflation of powders or aerosols, including by nebulizer; intratracheal, intranasal, epidermal and transdermal), oral or parenteral. Parenteral administration includes intravenous, intraarterial, subcutaneous, intraperitoneal or intramuscular injection or infusion; or intracranial, e.g., intrathecal or intraventricular, administration.
Pharmaceutical compositions and formulations for topical administration can include transdermal patches, ointments, lotions, creams, gels, drops, suppositories, sprays, liquids and powders. Conventional pharmaceutical carriers, aqueous, powder or oily bases, thickeners and the like can be necessary or desirable.
Compositions and formulations for oral administration include powders or granules, suspensions or solutions in water or non-aqueous media, capsules, sachets or tablets. Thickeners, flavoring agents, diluents, emulsifiers, dispersing aids or binders can be desirable.
Compositions and formulations for parenteral, intrathecal or intraventricular administration can include sterile aqueous solutions that can also contain buffers, diluents and other suitable additives such as, but not limited to, penetration enhancers, carrier compounds and other pharmaceutically acceptable carriers or excipients.
Pharmaceutical compositions of the present invention include, but are not limited to, solutions, emulsions, and liposome-containing formulations. These compositions can be generated from a variety of components that include, but are not limited to, preformed liquids, self-emulsifying solids and self-emulsifying semisolids.
The pharmaceutical formulations of the present invention, which can conveniently be presented in unit dosage form, can be prepared according to conventional techniques well known in the pharmaceutical industry. Such techniques include bringing into association the active ingredients with the pharmaceutical carrier(s) or excipient(s). In general the formulations are prepared by uniformly and intimately bringing into association the active ingredients with liquid carriers or finely divided solid carriers or both, and then, if necessary, shaping the product.
The compositions of the present invention can be formulated into any of many possible dosage forms such as, but not limited to, tablets, capsules, liquid syrups, soft gels, suppositories, and enemas. The compositions of the present invention can also be formulated as suspensions in aqueous, non-aqueous or mixed media. Aqueous suspensions can further contain substances that increase the viscosity of the suspension including, for example, sodium carboxymethylcellulose, sorbitol and/or dextran. The suspension can also contain stabilizers.
In some embodiments of the present invention the pharmaceutical compositions can be formulated and used as foams. Pharmaceutical foams include formulations such as, but not limited to, emulsions, microemulsions, creams, jellies and liposomes. While basically similar in nature these formulations vary in the components and the consistency of the final product.
Agents that enhance uptake of oligonucleotides at the cellular level can also be added to the pharmaceutical and other compositions of the present invention. For example, cationic lipids, such as lipofectin (U.S. Pat. No. 5,705,188), cationic glycerol derivatives, and polycationic molecules, such as polylysine (WO 97/30731), also enhance the cellular uptake of oligonucleotides.
The compositions of the present invention can additionally contain other adjunct components conventionally found in pharmaceutical compositions. Thus, for example, the compositions can contain additional, compatible, pharmaceutically-active materials such as, for example, antipruritics, astringents, local anesthetics or anti-inflammatory agents, or can contain additional materials useful in physically formulating various dosage forms of the compositions of the present invention, such as dyes, flavoring agents, preservatives, antioxidants, opacifiers, thickening agents and stabilizers. However, such materials, when added, should not unduly interfere with the biological activities of the components of the compositions of the present invention. The formulations can be sterilized and, if desired, mixed with auxiliary agents, e.g., lubricants, preservatives, stabilizers, wetting agents, emulsifiers, salts for influencing osmotic pressure, buffers, colorings, flavorings and/or aromatic substances and the like which do not deleteriously interact with the nucleic acid(s) of the formulation.
Certain embodiments of the invention provide pharmaceutical compositions containing (a) one or more compounds that modulate the activity of a stem cell caner marker (e.g. antibody, small molecule, siRNA, anti-sense, etc.) and (b) one or more other chemotherapeutic agents. Examples of such chemotherapeutic agents include, but are not limited to, anticancer drugs such as daunorubicin, dactinomycin, doxorubicin, bleomycin, mitomycin, nitrogen mustard, chlorambucil, melphalan, cyclophosphamide, 6-mercaptopurine, 6-thioguanine, cytarabine (CA), 5-fluorouracil (5-FU), floxuridine (5-FUdR), methotrexate (MTX), colchicine, vincristine, vinblastine, etoposide, teniposide, cisplatin and diethylstilbestrol (DES). Anti-inflammatory drugs, including but not limited to nonsteroidal anti-inflammatory drugs and corticosteroids, and antiviral drugs, including but not limited to ribivirin, vidarabine, acyclovir and ganciclovir, can also be combined in compositions of the invention. Other chemotherapeutic agents are also within the scope of this invention. Two or more combined compounds can be used together or sequentially.
Dosing is dependent on severity and responsiveness of the disease state to be treated, with the course of treatment lasting from several days to several months, or until a cure is effected or a diminution of the disease state is achieved (e.g. reduction in tumor size). Optimal dosing schedules can be calculated from measurements of drug accumulation in the body of the patient. The administering physician can easily determine optimum dosages, dosing methodologies and repetition rates. Optimum dosages can vary depending on the relative potency of individual oligonucleotides, and can generally be estimated based on EC50s found to be effective in in vitro and in vivo animal models or based on the examples described herein. In general, dosage is from 0.01 μg to 100 g per kg of body weight, and can be given once or more daily, weekly, monthly or yearly. The treating physician can estimate repetition rates for dosing based on measured residence times and concentrations of the drug in bodily fluids or tissues. Following successful treatment, it can be desirable to have the subject undergo maintenance therapy to prevent the recurrence of the disease state, wherein the oligonucleotide is administered in maintenance doses, ranging from 0.01 μg to 100 g per kg of body weight, once or more daily, to once every 20 years.
VIII. Transgenic Animals Expressing Cancer Marker Genes
The present invention contemplates the generation of transgenic animals comprising an exogenous cancer marker gene of the present invention or mutants and variants thereof (e.g., truncations or single nucleotide polymorphisms) or knock-outs thereof. In some embodiments, the transgenic animal displays an altered phenotype (e.g., increased or decreased presence of markers) as compared to wild-type animals. Methods for analyzing the presence or absence of such phenotypes include but are not limited to, those disclosed herein. In some some embodiments, the transgenic animals further display an increased or decreased growth of tumors or evidence of cancer.
The transgenic animals of the present invention find use in drug (e.g., cancer therapy) screens. In some embodiments, test compounds (e.g., a drug that is suspected of being useful to treat cancer) and control compounds (e.g., a placebo) are administered to the transgenic animals and the control animals and the effects evaluated.
The transgenic animals can be generated via a variety of methods. In some embodiments, embryonal cells at various developmental stages are used to introduce transgenes for the production of transgenic animals. Different methods are used depending on the stage of development of the embryonal cell. The zygote is the best target for micro-injection. In the mouse, the male pronucleus reaches the size of approximately 20 micrometers in diameter that allows reproducible injection of 1-2 picoliters (pl) of DNA solution. The use of zygotes as a target for gene transfer has a major advantage in that in most cases the injected DNA will be incorporated into the host genome before the first cleavage (Brinster et al., 1985, PNAS 82:4438-4442). As a consequence, all cells of the transgenic non-human animal will carry the incorporated transgene. This will in general also be reflected in the efficient transmission of the transgene to offspring of the founder since 50% of the germ cells will harbor the transgene. U.S. Pat. No. 4,873,191 describes a method for the micro-injection of zygotes; the disclosure of this patent is incorporated herein in its entirety.
In other embodiments, retroviral infection is used to introduce transgenes into a non-human animal. In some embodiments, the retroviral vector is utilized to transfect oocytes by injecting the retroviral vector into the perivitelline space of the oocyte (U.S. Pat. No. 6,080,912, incorporated herein by reference). In other embodiments, the developing non-human embryo can be cultured in vitro to the blastocyst stage. During this time, the blastomeres can be targets for retroviral infection (Janenich, 1976, PNAS 73:1260). Efficient infection of the blastomeres is obtained by enzymatic treatment to remove the zona pellucida (Hogan et al., in Manipulating the Mouse Embryo, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. [1986]). The viral vector system used to introduce the transgene is typically a replication-defective retrovirus carrying the transgene (Jahner et al., 1985, PNAS 82:6927). Transfection is easily and efficiently obtained by culturing the blastomeres on a monolayer of virus-producing cells (Stewart, et al., 1987, EMBO J., 6:383).
Alternatively, infection can be performed at a later stage. Virus or virus-producing cells can be injected into the blastocoele (Jahner et al., 1982, Nature 298:623). Most of the founders will be mosaic for the transgene since incorporation occurs only in a subset of cells that form the transgenic animal. Further, the founder can contain various retroviral insertions of the transgene at different positions in the genome that generally will segregate in the offspring. In addition, it is also possible to introduce transgenes into the germline, albeit with low efficiency, by intrauterine retroviral infection of the midgestation embryo (Jahner et al., supra [1982]). Additional means of using retroviruses or retroviral vectors to create transgenic animals known to the art involve the micro-injection of retroviral particles or mitomycin C-treated cells producing retrovirus into the perivitelline space of fertilized eggs or early embryos (PCT International Application WO 90/08832 [1990], and Haskell and Bowen, 1995, Mol. Reprod. Dev., 40:386).
In other embodiments, the transgene is introduced into embryonic stem cells and the transfected stem cells are utilized to form an embryo. ES cells are obtained by culturing pre-implantation embryos in vitro under appropriate conditions (Evans et al., 1981, Nature 292:154; Bradley et al., 1984, Nature 309:255; Gossler et al., 1986, PNAS 83:9065; and Robertson et al., 1986, Nature 322:445). Transgenes can be efficiently introduced into the ES cells by DNA transfection by a variety of methods known to the art including calcium phosphate co-precipitation, protoplast or spheroplast fusion, lipofection and DEAE-dextran-mediated transfection. Transgenes can also be introduced into ES cells by retrovirus-mediated transduction or by micro-injection. Such transfected ES cells can thereafter colonize an embryo following their introduction into the blastocoel of a blastocyst-stage embryo and contribute to the germ line of the resulting chimeric animal (for review, See, Jaenisch, Science, 1988, 240:1468). Prior to the introduction of transfected ES cells into the blastocoel, the transfected ES cells can be subjected to various selection protocols to enrich for ES cells which have integrated the transgene assuming that the transgene provides a means for such selection. Alternatively, the polymerase chain reaction can be used to screen for ES cells that have integrated the transgene. This technique obviates the need for growth of the transfected ES cells under appropriate selective conditions prior to transfer into the blastocoel.
In still other embodiments, homologous recombination is utilized to knock-out gene function or create deletion mutants (e.g., truncation mutants). Methods for homologous recombination are described in U.S. Pat. No. 5,614,396, incorporated herein by reference.
The following examples are provided in order to demonstrate and further illustrate certain some embodiments and aspects of the present invention and are not to be construed as limiting the scope thereof.
In the experimental disclosure which follows, the following abbreviations apply: N (normal); M (molar); mM (millimolar); μM (micromolar); mol (moles); mmol (millimoles); μmol (micromoles); nmol (nanomoles); pmol (picomoles); g (grams); mg (milligrams); μg (micrograms); ng (nanograms); l or L (liters); ml (milliliters); μl (microliters); cm (centimeters); mm (millimeters); μm (micrometers); nm (nanometers); and ° C. (degrees Centigrade).
This example describes the generation of tumors in mice using human solid tumor cells from humans and the analysis of these tumors.
Materials and Methods
Mouse preparation. 8-week old female NOD-SCID mice were anesthetized by an intra-peritoneal injection of 0.2 ml Ketamine/Xylazine (300 mg Ketamine combined with 20 mg Xylazine in a 4 ml volume. 0.02 ml of the solution was used per 20 g mouse). Dilution to 200 μl was done using HBSS. Mice were then treated with VP-16 (etoposide) via an intra-peritoneal injection (30 mg etoposide dose per 1 kg mouse, diluted in serum-free HBSS for a final injection volume of 200 μl). At the same time, estrogen pellets were placed subcutaneously on the back of the mouse's neck using a trocar. All tumor injections/implants were done 5 days after this procedure. In the following procedures, mice were anesthetized as described above.
Primary tumor specimen implantations. For the implantation of fresh specimens, samples of human breast tumors were received within an hour after surgery. The tumors were cut up with scissors into small pieces, and the pieces were then minced with a blade to yield 2×2 mm-size pieces. Mincing was done in sterile RPMI 1640 medium supplemented with 20% Fetal Bovine Serum (FBS) under sterile conditions on ice. The tumor pieces were washed with serum-free HBSS before implantation. A 2-mm incision was then made in the mid abdomen area, and using a trocar, one to two small tumor pieces were implanted in the region of the upper right and upper left mammary fat pats (right below the second nipple on both sides). A 6-0 suture was wrapped twice around the MFP-Nipple allowing it to hold the implanted pieces in place. Sutures were removed after 5 days. Nexaban was used to seal the incision and mice were monitored weekly for tumor growth.
Pleural effusions injections. For the injection of the pleural effusions, cells were received shortly after thorocentesis and washed with serum-free HBSS. Cells were then suspended in serum free-RPMI/Matrigel mixture (1:1 volume) and then injected into the upper right and left mammary pads using an 18 G needed. 0.2 ml containing 1-2 million cells were typically injected. The site of the needle injection was sealed with Nexaban to prevent any cell leakage.
Preparation of single cell suspensions of tumor cells. Prior to digestion with collagenase, Xenograft tumors or primary human tumors were cut up into small pieces and then minced completely using sterile blades. To obtain single cell suspensions, either pleural effusion cells or the resulting tumor pieces were then mixed with ultra-pure Collagenase III in HBSS solution (200-250 U Collagenase per ml) and allowed to incubate at 37° C. for 3-4 hours. Pipetting with a 10 ml pipette was done every 15-20 minutes. At the end of the incubation, cells were filtered through a 45 μl nylon mesh and washed with RPMI-20% FBS, then washed twice with HBSS. Cells to be injected were then suspended in HBSS/Matrigel mix (1:1 volume) and injected into the area of the mammary fat pad as described above. Nexaban was used to seal the injection site.
Cell staining for flow-cytometry. Cells were counted and then transferred to a 5 ml tube, washed twice with HBSS with 2% Heat-inactivated calf serum (HICS) (5 min @ 1000 rpm), then re-suspended in 100 μl (per 106 cells) of HBSS with 2% HICS. 5 ml of Sandoglobin solution (1 mg/ml) was then added and incubated on ice for 10 minutes, after which the sample was washed twice with HBSS 2% HICS and re-suspended in 100 ml (per 106 cells) of HBSS 2% HICS. Antibodies (using appropriate dilution per antibody) were then added and incubated for 20 minutes on ice, and then washed twice with HBSS 2% HICS. When needed, a secondary antibody addition was conducted by re-suspending in 100 ul (per 106 cells) of HBSS 2% HICS, and then adding 1-4 ml of secondary antibody (depending on the secondary antibody and its concentration), followed by a 20 minute incubation. When a streptavidin step was used, cells were re-suspended in 100 ul (per 106 cells) of HBSS 2% HICS and then 1 ul of strepavidin conjugated with the indicated fluorescent dye was added, followed by a 20 minute incubation. The cells were washed twice with HBSS 2% heat-inactivated fetal calf serum (HICS) and re-suspended in 0.5 ml (per million cells) of HBSS 2% HICS that contained 7AAD (1 mg/ml final concentration).
Flow-cytometry. The antibodies used were anti-CD44 (APC, PE or Biotin), anti-CD24 (PE or FITC), anti-B38.1 (APC), anti-ESA-FITC (Biomeda, Calif.), anti-H2Kd, (Santa Cruz Products, Santa Cruz, Calif.). Lineage marker antibodies were anti-CD2, -CD3-CD10, -CD16, -CD18, -CD31, -CD64 and -CD140b. Unless noted, antibodies were purchased from Pharmingen (San Diego, Calif.). Antibodies were directly conjugated to various fluorochromes depending on the tests. In all tests, mouse cells and/or Lineage+ cells were eliminated by discarding H2Kd+ (class I MHC) cells or Lineage+ cells during flow-cytometry. Dead cells were eliminated using the viability dye 7-AAD. Flow-cytometry was performed on a FACSVantage (Becton Dickinson, San Jose, Calif.). Side scatter and forward scatter profiles were used to eliminate cell doublets. Cells were routinely sorted twice and the cells were re-analyzed for purity, which typically was greater than 95%.
In solid tumors, it has been demonstrated that only a small proportion of the tumor cells are able to form colonies in an in vitro clonogenic assay 21-24,101-103 (Southam & Brunschwig, 1961, Cancer 14:971-8; Wodinsky et al., 1967, Cancer Chemother. Rep. 51:415-21; Bergsagel & Valeriote, 1968, Cancer Res 28:2187-96; Fialkow, 1976, Birth Defects Orig. Artic Ser. 12:123-32; Hamburger & Salmon, 1977, Science 197:461-3; Heppner, 1984, Cancer Res. 44:2259-65; Weisenthal & Lippman, 1985, Cancer Treat. Report 69:615-48). Furthermore, large numbers of cells must typically be transplanted to form tumors in xenograft models. One possible explanation for these observations is that every cell within a tumor has the ability to proliferate and form new tumors but that the probability of an individual cell completing the necessary steps in these assays is small. An alternative explanation is that only a rare, phenotypically distinct subset of cells has the capacity to significantly proliferate and form new tumors, but that cells within this subset do so very efficiently (Reya et al., 2001, Nature 414:105-11). To distinguish between these possibilities it is necessary to identify the clonogenic cells in these tumors with markers that distinguish these cells from other non-tumorigenic cells. This has been accomplished in acute myelogenous leukemia (AML), where it was demonstrated that a specific subpopulation of leukemia cells (that expressed markers similar to normal hematopoietic stem cells) was consistently enriched for clonogenic activity in NOD/SCID immunocompromised mice while other cancer cells were depleted of clonogenic activity (Lapidot et al., 1994, Nature 17:645-8; Larochelle et al., 1996, Nat. Med. 2:1329-37; Bonnet & Dick, 1997, Nat. Med. 3:730-7). Such tests have not been reported in solid cancers.
To investigate the mechanisms of solid tumor heterogeneity, a mouse model was developed that was a modification of the NOD/SCID immunodeficient mouse model in which human breast cancers were efficiently propagated in the mouse mammary fat pad (Sakakibara et al., 1996, Cancer J. Si. Am. 2:291-300). In the present application, it was shown that solid tumors contain a distinct population of cells with the exclusive ability to form tumors in mice. These cells are referred to as tumorigenic cells or cancer initiating cells since they consistently formed tumors while other cancer cell populations were depleted of cells capable of tumor formation. Cell surface markers were identified which can distinguish between these cell populations. These findings provide a new model of breast tumor biology in which a defined subset of cells drives tumorigenesis, as well as generating tumor cell heterogeneity. The prospective identification of this tumorigenic population of cancer cells allows for the identification of molecules expressed in these cells that can then serve as targets to eliminate this critical population of cancer cells.
Tumor specimens and engraftment rate. Human breast cancer specimens obtained from primary or metastatic sites in 9 different patients (designated tumors 1-9; T1-T9) all engrafted in the NOD/SCID mice. (Table 1). In one case, the cancer cells were obtained from a primary breast tumor (T2) while in other cases the cells were obtained from metastatic pleural effusions (T1, T3-T9). Some tests were conducted on cells after they had been passaged once or twice in mice (designated Passage 1 & 2) while other tests were conducted on unpassaged fresh or frozen tumor samples obtained directly from patients. When using human cancer cells from tumors passaged in mice, contaminating mouse cells were removed by eliminating H2K+ cells [mouse histocompatability class I (MHC)].
Table 1 presented the results of engraftment of human breast cancers into NOD/SCID mice. Mice were injected with unsorted T1 and T3 cells, and a 2 mm piece of T2. Cells from T4-T9 were isolated by flow cytometry as described in
Identification of tumorigenicity markers. Breast cancer cells were heterogeneous with respect to expression of a variety of cell surface-markers including CD44, CD24, and B38.1. CD24 and CD44 are adhesion molecules, while B38.1 has been described as a breast/ovarian cancer-specific marker (Kufe et al., 1983, Cancer Res. 43:851-7; Uchida et al., 2000, PNAS 97:14720-5; Ahrens et al., 2001, Oncogene 20). To determine whether these markers could distinguish tumorigenic from non-tumorigenic cells, flow-cytometry was used to isolate cells that were positive or negative for each marker from first passage T1 or T2 cells. When 2×105-8×105 cells of each population were injected, all injections of CD44+ cells (8/8); B38.1+ cells (8/8), or CD24−/low cells (12/12) gave rise to visible tumors within 12 weeks of injection, but none of the CD44− cell (0/8), or B38.1− cell (0/8) injections formed detectable tumors (Table 2). Although no tumors could be detected by palpation in the locations injected with CD24+ cells, 2 of 12 mice injected with CD24+ cells did contain small growths at the injection site that were detected upon necropsy. These growths most likely arose from the 1-3% of CD24− cells that invariably contaminate the sorted CD24+ cells, or alternatively from CD24+ cells with reduced proliferative capacity (Table 2). Because the CD44+ cells were exclusively B38.1+, we focused on the CD44 and CD24 markers in subsequent tests.
Several antigens associated with normal cell types (Lineage markers; CD2, CD3, CD10, CD16, CD18, CD31, CD64, and CD140b) were found not to be expressed by the cancer cells based on analyses of tumors that had been passaged multiple times in mice. By eliminating Lineage+ cells from unpassaged or early passage tumor cells, normal human leukocytes, endothelial cells, mesothelial cells and fibroblasts were eliminated. By microscopic examination, the Lineage− tumor cells had the appearance of neoplastic cells (
Table 2 shows the results of cells isolated by flow cytometry as described in
Depending on the tumor, 11% to 35% of the Lineage− cancer cells in tumors or pleural effusions were CD44+CD24−/low (
As shown in Table 3, tumorigenic breast cancer cells were highly enriched in the ESA+CD44+CD24−/low population. Cells were isolated from first passage (designated Mouse Passage 1) Tumor 1, Tumor 2 and Tumor 3, second passage Tumor 3 (designated mouse Passage 2), unpassaged cells obtained from 6 different patients, T1, T4, T5, T6, T8 and T9, (designated Patients' tumor cells). CD44+CD24+Lineage− populations and CD44+CD24−/lowLineage− cells were isolated by flow-cytometry as described in
In three of the tumors, further enrichment of tumorigenic activity was possible by isolating the ESA+ subset of the CD44+CD24−/low population. ESA (Epithelial Specific Antigen, Ep-CAM) has been used in the past to distinguish epithelial cancer cells from benign reactive mesothelial cells (Packeisen et al., 1999, Hybridoma 18:37-40). When ESA+CD44+CD24−/lowLineage− cells were isolated from passaged T1, as few as 200 cells consistently formed tumors of approximately 1 cm between 5-6 months after injection whereas 2000 ESA−CD44+CD24−/lowLineage− cells or 20,000 CD44+CD24− cells always failed to form tumors (Table 3). Ten thousand unsorted cells formed tumors in only 3 of 12 mice. This suggests that the ESA+CD44+CD24−/lowLineage− population was more than 50 fold enriched for the ability to form tumors relative to unfractionated tumor cells (Table 3). The ESA+CD44+CD24−/lowLineage− population accounted for 2-4% of first passage T1 cells (2.5-5% of cancer cells). The ESA+CD44+CD24−/lowLineage− population (0.6% of cancer cells) from unpassaged T5 cells was also enriched for tumorigenic activity compared to ESA−CD44+CD24−/lowLineage− cells, but both the ESA+ and ESA− fractions had some tumorigenic activity (Table 3). Among unpassaged T5 cells, as few as 1000 ESA+CD44+CD24−/lowLineage− cells consistently formed tumors.
In order to determine whether the difference in tumorigenicity of the cell populations was due to differences in cell cycle, populations were analyzed by flow-cytometry. Comparison of the cell cycle status of tumorigenic and non-tumorigenic cancer cells from T1 revealed that both exhibited a similar cell cycle distribution (
Six months after inoculation, the injection sites of 20,000 tumorigenic CD44+CD24−/lowLineage− cells and 20,000 CD44+CD24+Lineage− cells were examined by histology. The CD44+CD24−/lowLineage− injection sites contained tumors approximately 1 cm in diameter while the CD44+CD24+Lineage− injection sites contained no detectable tumors (
The tumorigenic population is capable of generating the phenotypic heterogeneity found in the initial tumor. The ability of small numbers of CD44+CD24−/lowLineage− tumorigenic cells to give rise to new tumors was reminiscent of the organogenic capacity of normal stem cells. Normal stem cells self-renew and give rise to phenotypically diverse cells with reduced proliferative potential. To test whether tumorigenic breast cancer cells also exhibit these properties, tumors arising from 200 ESA+CD44+CD24−/lowLineage− T1 or 1,000 CD44+CD24−/lowLineage− T2 cells were dissociated and analyzed by flow-cytometry. The heterogeneous expression patterns of ESA, CD44 or CD24 in the secondary tumors resembled the phenotypic complexity of the tumors from which they were derived (
Expression of Wnt pathway genes in subpopulations of breast cancer tumor cells. The Frizzled proteins are receptors for the growth/survival factors of the Wnt family. In some normal stem cells, Wnt is known to play a role in proliferation, survival and differentiation. In certain situations, stimulation of Wnt can promote stem cell self-renewal. Upon activation, Wnt induces the stabilization of β-catenin. Flow cytometry using an antibody against β-catenin demonstrates that Tumor 1 cells express this protein (
To confirm the RT-PCR results for the expression of frizzled proteins, an Affymetrix microarray was probed with cDNA made from Tumor 1, Tumor 2 and Tumor 3 cancer stem cells. All three tumors expressed Frizzled 2 & 6. In addition, Tumors 2 & 3 appeared to express frizzled 4.
Isolation of normal cells from a tumor. Efforts were then made to determine whether sufficient normal cells could be isolated from a tumor to do molecular studies with these cells. Normal fibroblast and endothelial cells from a patient's tumor (approximately 3 cm in size) were isolated by flow cytometry. 2% of the tumor cells were CD31+endothelial cells and 8% were CD140b+ fibroblasts (
Infection of breast cancer stem cells with an adenovirus vector. Since the xenograft tumors can only be grown briefly in tissue culture, conventional transfection methods are generally not useful for gene expression studies and only viral vectors have the potential to efficiently transduce the breast cancer stem cells. Therefore, the ability of adenovirus vectors to infect T1 breast cancer stem cells was tested. To do this, groups of 10,000 breast cancer stem cells or control MCF-7 cells were infected with 0, 50, 500, or 5,000 LacZ adenovirus particles.
The following data is a description of work that has been done studying hematopoietic stem cells. It illustrates fundamental stem cell properties, and it also demonstrates how the isolation of stem cells enables one to first characterize these cells and then to do molecular and biochemical studies to functionally characterize them.
Adult stem cell numbers are strictly regulated. The regulation of hematopoietic stem cell (HSC) homeostasis is not well understood. We screened for genetic polymorphisms that were linked to differences between mouse strains in the numbers of long-term reconstituting HSCs or restricted progenitors in the bone marrow. AKR/J mice had significantly higher frequencies and numbers of both HSCs and restricted progenitors in their bone marrow than C57BL/Ka-Thy-1.1 mice. The C57BL/Ka-Thy-1.1 alleles were partially dominant. A locus on chromosome 17, including the H-2 complex, was significantly linked to the frequency of long-term self-renewing HSCs but showed no evidence of linkage to the frequency of restricted progenitors. Conversely, a chromosome 1 locus exhibited suggestive linkage to restricted progenitor frequencies but was not linked to HSC frequency. This demonstrates that there are distinct genetic determinants of the frequencies of HSCs and restricted progenitors in vivo. The AKR/J chromosome 17 locus was not sufficient to increase HSC frequencies when bred onto a C57BL background. This suggests that to affect HSC frequencies, the product(s) of this locus likely depend on interactions with unlinked modifying loci. The present invention demonstrates that stem cell expansion is under tight genetic regulation in an animal.
Genomic analysis of hematopoietic stem cells. Hematopoietic stem cells (HSCs) have self-renewal capacity and multilineage developmental potentials. The molecular mechanisms that control the self-renewal of HSCs are still largely unknown. A systematic approach using bioinformatics and array hybridization techniques to analyze gene expression profiles in HSCs was done. To enrich mRNAs predominantly expressed in uncommitted cell lineages, 54 000 cDNA clones generated from a highly enriched population of HSCs and a mixed population of stem and early multipotent progenitor (MPP) cells were arrayed on nylon membranes (macroarray or high-density array), and subtracted with cDNA probes derived from mature lineage cells including spleen, thymus, and bone marrow. Five thousand cDNA clones with very low hybridization signals were selected for sequencing and further analysis using microarrays on glass slides. Two populations of cells, HSCs and MPP cells, were compared for differential gene expression using microarray analysis. HSCs have the ability to self-renew, while MPP cells have lost the capacity for self-renewal. A large number of genes that were differentially expressed by enriched populations of HSCs and MPP cells were identified. These included transcription factors, signaling molecules, and previously unknown genes.
Bmi-1 is required for HSC self-renewal. The gene expression analysis of HSCs allowed us to identify genes potentially important for self-renewal. After analysis of the gene expression data, we began mechanistic studies to identify important stem cell regulatory genes. A central issue in stem cell biology is to understand the mechanisms that regulate self-renewal of HSCs, which is required for hematopoiesis to persist for the lifetime of the animal. We found that adult and E14.5 fetal mouse and adult human hematopoietic stem cells express the proto-oncogene bmi-1. The number of fetal liver HSCs, as measured by flow cytometry, was normal in loss of function bmi-1 mice, and the bmi-1−/−HSCs were able to migrate normally towards a chemokine gradient. In post-natal bmi-1−/− mice, the number of HSCs, but not early progenitor cells was markedly reduced. Both fetal liver and bone marrow cells obtained from bmi-1−/− mice were able to contribute only transiently to hematopoiesis when transplanted into lethally irradiated recipients. There was no detectable self-renewal of adult hematopoietic stem cells, indicating a cell autonomous defect in bmi-1−/− mice. This study indicates that expression of bmi-1 is essential for the generation of self-renewing adult hematopoietic stem cells. See the manuscript by Park et al., “Bmi-1 is required for maintenance of adult self-renewing hematopoietic stem cells” Nature (2003).
Summary: The xenograft model developed by this laboratory has made possible the analysis of human breast cancer cells at the cellular level. Although cancer cell lines have proven useful for many studies, the cell lines are adapted to the unique conditions imposed by tissue culture and many of their properties clearly differ from the cancer cells in patients' tumors (Porter & Berry, 1964, Br. J. Cancer 17; Brown, 1997, Oncol. Res. 9:213-5). Recently, the size of primary breast cancer tumors prior to resection has markedly decreased. This has made biological and biochemical studies using patient samples difficult. It is contemplated that the xenograft model described in the preliminary results ameliorates this problem. Preliminary results suggest that the xenograft tumors appear to recapitulate the phenotypic and biological diversity seen in the original patients' tumors. Although there can be some differences in the mouse and human tumors due to environmental factors, the NOD/SCID model described here is the best available model of human breast cancer. Results demonstrate that breast cancer cells reliably engraft in this xenograft model and in the early passages reflect the cellular and biological diversity found in the original human tumor. These tests also show that different populations of cancer cells can differ in their ability to form tumors.
This Example provides illustrative screening methods using the systems and methods of the present invention. This example describes, for example, how one could characterize the Wnt/β-catenin pathway in human breast cancer tumors using the xenograft model described above. The Wnt/β-catenin pathway plays a role in the proliferation and self-renewal of normal stem cells. Although a significant percentage of human breast cancers appear to have constitutive activation of this critical pathway, unlike colon cancer, it has not been definitively established what role this pathway plays in the pathology of this disease in humans (Candidus et al., 1996, Cancer Res. 56:49-52; Sorlie et al., 1998, Hum. Mutat. 12:215; Jonsson et al., 2000, Eur. J. Cancer 36:242-8; Schlosshauer et al., 2000, Carcinogenesis 21:1453-6; Lin et al., 2000, PNAS 97:4262-6; Wong et al., 2002, J. Pathol. 196:145-53). The xenograft model described above can be used to characterize the biological consequences of this pathway in human breast cancer tumors. These tests are done using cancer cells directly after removal from patients and early passage xenograft tumors.
The function of the Wnt/frizzled/β-catenin signaling pathway in multiple patients' tumors. Rationale: Almost 90% of colon cancers contain mutations that result in activation of β-catenin. The most common mutations are in the APC gene, which is involved in targeting β-catenin for degradation, or mutations in the β-catenin protein itself (Webster et al., 2000, Genes Chromosomes Cancer 28:443-53; Taiple & Beachy, 2001, Nature 411:349-54). These latter mutations prevent degradation. Although the cancer cells in many breast tumors appear to have constitutively active of β-catenin, in contrast to colon cancer, mutations in the APC gene or β-catenin itself account for only 6-10% of these cases (Candidus et al., 1996, Cancer Res. 56:49-52; Sorlie et al., 1998, Hum. Mutat. 12:215; Jonsson et al., 2000, Eur. J. Cancer 36:242-8; Schlosshauer et al., 2000, Carcinogenesis 21:1453-6; Lin et al., 2000, PNAS 97:4262-6; Wong et al., 2002, J. Pathol. 196:145-53). Examination of the Wnt/β-catenin signaling pathway in breast cancer cells should lead to new insights into the pathogenesis of this disease. There are a large number of Wnt proteins that are thought to differentially bind to different Frizzled receptors (Nusse et al., 1991, Cell 64:231; Nusse, 1992, J. Steroid Biochem. Mol. Biol. 43:9-12; Cadigan & Nusse, 1997, Genes & Dev. 11:3286-305; Nusse, 1999, Trends Genet. 15:1-3; Taiple & Beachy, 2001, Nature 411:349-54). Only a subset of Wnts, and by inference Frizzled receptors, can activate β-catenin. Normally, β-catenin is bound to E-cadherin at the cell membrane. Cytoplasmic β-catenin forms a complex with the APC and Axin proteins and facilitates β-catenin phosphorylation by GSK3β (Sorlie et al., 1998, Hum. Mutat 12:215; Jonsson et al., 2000, Eur. J. Cancer 36:242-8; Webster et al., 2000, Genes Chromosomes Cancer 28:443-53). The phosphorylated β-catenin is then degraded via the ubiquitin degradation pathway. However, upon activation of frizzled receptors by a Wnt, β-catenin is stabilized. The protein then translocates to the nucleus where it forms a complex with the LGLS/BCL9, PYGO and TCF proteins to activate transcription (Korinek et al., 1998, Mol. Cell. Biol. 18:1248-56; Kramps et al., 2002, Cell 109:47-60). We believe that our xenograft model and cellular assays are unique and powerful tools for understanding this critical pathway. We analyze 10 tumors that have constitutive β-catenin signaling and 10 that do not. These studies give new insights into the mechanisms by which the Wnt pathway is activated and the consequences of this activation in human breast cancer.
In mice, ectopic expression of various Wnt proteins results in breast tumor formation, while in humans activated β-catenin in breast cancer cells is associated with expression of cyclin D1 and poor prognosis (Nusse & Varmus, 1982, Cell 31:99-109; Nusse, 1991, Curr. Topics Microbiol. Immunol. 171:43-65; Nusse, 1992, J. Steroid Biochem. Mol. Biol. 43:9-12; Lin et al., 2000, PNAS 97:4262-6). However, it is not known whether continuous β-catenin signaling is necessary for tumorigenic breast cancer cells to form tumors. There are several possible roles that constitutive β-catenin signaling can play in human breast cancer. First, it can be necessary for continued proliferation and/or viability of the tumorigenic cancer cells. Next, it can be necessary for the initiation of the tumor, but subsequent mutations bypass the need for β-catenin signaling. Third, it can make the cancer cells more resistant to chemotherapy due to the activation of downstream targets such as cyclin D1. Fourth, constitutive β-catenin signaling accelerates cancer cell growth, but is not necessary for tumorigenicity. Finally, the role of β-catenin signaling in tumor formation might differ in tumors with and without constitutive activation of β-catenin. For example, the former tumors might require β-catenin signaling whereas the latter tumors might require Wnt signals from other tumor cells or they might be independent of β-catenin because they have constitutive activation of downstream targets such as c-myc and/or cyclin D1. The tests described here are designed to distinguish between these possibilities using a novel xenograft model of human cancer. The data shows that the xenograft model virtually recapitulates a human breast tumor. Thus, this model allows us to study the Wnt pathway in de novo human tumors in as physiological conditions as possible.
Is β-catenin signaling required for tumor formation by cancer cells isolated from multiple patients? The tests here determine whether the β-catenin pathway is obligate for breast cancer cell growth or whether activation is not required for tumor formation but does increase the rate of proliferation of the cancer cells. Although the xenograft tumors appear to closely resemble human tumors, over time selection pressure result in tumors that are adapted to the mouse environment. The cancer cells in such tumors differ in some ways with the cancer cells that made up the original human tumors. We identify cancer cells from five different xenograft tumors and five unpassaged tumors that have activated β-catenin (cytoplasmic and/or nuclear expression by immunohistochemistry) and cancer cells from five xenograft tumors and five unpassaged tumors that do not (membrane-associated expression by immunohistochemistry). We select tumors that are heterogeneous for important prognostic features that include estrogen receptor/progesterone receptor (ER/PgR), primary tumor vs. metastatic tumor, wild type vs. mutant p53, and amplification of Her2/neu.
To identify cells that have constitutive activation of β-catenin, we take advantage of the observation that this results in stabilization of β-catenin and accumulation of the protein in the cytoplasm and nucleus. When not activated, β-catenin is associated with the plasma membrane. We therefore analyze the breast cancer cell population from each of the tumors using immunohistochemistry to determine the sub-cellular localization of β-catenin and using flow cytometry to determine the amount of β-catenin expressed by each population of cells. To do this, we use flow cytometry to isolate the Lineage− cancer cells from multiple tumors. Viably frozen xenograft or patient tumor cells are used for this analysis. The cancer cells then are stained with an anti-β-catenin-FITC antibody for immunohistochemistry and flow cytometry analysis using the antibody manufacturer's protocol (Transduction Laboratories). Cells with activated β-catenin have cytoplasmic/nuclear localization and increased levels of the protein.
To determine the role of β-catenin signaling in tumorigenesis, Lineage− cancer cells isolated from each of the 20 tumors are infected with either an adenovirus vector or a lentivirus vector that contains a dominant-negative (dn) TCF4-IRES-GFP minigene or a control GFP virus (for details of virus construction and use, see Clarke et al., 1995, PNAS 92:11024-8). The adenovirus vector expresses the dnTCF4 transiently for 1-3 weeks, while the lentivirus vector expresses the dnTCF4 permanently. The dnTCF4 adenovirus has already been made using a dnTCF4 minigene (a gift from Eric Fearon). The dnTCF4 forms a complex with β-catenin thereby inhibiting transcriptional transactivation by the activated β-catenin. Note that the dnTCF4 blocks signaling from all members of the TCF family that mediate β-catenin signaling (Eric Fearon, personal communication). Limiting dilution tests are done to determine the ability of the transduced cells to form colonies in vitro and tumors in vivo. The tests here are done using cancer cells isolated from either patient or human tumors by flow-cytometry. By eliminating the lineage cocktail to eliminate the normal cells, colony formation in tissue culture and tumor formation in mice by cancer cells can be measured (The possible contributions of normal stromal cells to the growth of tumorigenic cells are analyzed as described below in aim 2B). To determine the role of β-catenin signaling on cancer cell growth and viability, five sets of 1,000, 5, 000, 20,000, 50,000 and 100,000 Lineage− cancer cells from each of the tumors infected with the dnTCF4 viruses (either the adenovirus or lentivirus vectors) and control viruses are cultured in vitro in medium containing the Notch ligand Delta and the number of colonies that form are determined. The colonies in a control tissue culture plate are stained with cytokeratin to confirm that they arose from neoplastic cells (Ethier et al., 1993, Cancer Res 53:627-35). Two days after infection the cells are examined with a fluorescent microscope to confirm that greater than 90% of the cells were transduced by the virus. Similarly, in vivo limiting dilution tests are done to determine whether the dnTCF4 viruses affect tumor formation by the cancer cells isolated from the different patients. After infection, ten sets of 5, 000, 20,000, 50,000 and 100,000 Lineage− cancer cells are isolated by flow-cytometry and then infected with the dnTCF4 adenovirus or control adenovirus. The infected cells are injected into the breast of NOD/SCID mice. We then determine the number of cancer cells needed to form tumors in each group, the time needed to form tumors in each group, the rate of growth of each group, and the size of the tumors that form in each group. This allows us to determine whether β-catenin is necessary for tumor formation by cancer cells that do or do not have constitutively activated β-catenin.
Subsequent tests depend on the results of the in vivo and in vitro limiting dilution tests. If inhibition of β-catenin transcriptional transactivation blocks tumor formation or slows tumor growth, then we begin to test whether downstream β-catenin targets such as cyclin D1 or c-myc are required for tumorigenicity (Lin et al., 2000, PNAS 97:4262-6; Yu et al., 2001, Nature 411:1017-21; Wong et al., 2002, J. Pathol. 196:145-53). To do this, we infect the Lineage− cancer cells isolated by flow-cytometry and infect the cells with either the control or dnTCF4 adenovirus as well as a control gfp vector, a c-myc-IRES-gfp retrovirus vector, a cyclin D1-IRES-rfp retrovirus vector, or both the myc-IRES-gfp and the cyclin D1-IRES-rfp retrovirus vectors. Infected cells are isolated by flow cytometry, and then ten sets of 5,000, 10,000, 20,000, 50,000 or 100,000 Lineage− cancer cells of each test group are injected into mice. The mice are analyzed weekly for the formation of tumors, and the rate of growth of each test group. This allows us to determine whether enforced expression of either c-myc and/or cyclin rescues the cells from inhibition of β-catenin signaling.
If inhibition of β-catenin does not have any discernable effects on tumor formation, we first confirm that both of the dominant-negative viruses are inhibiting expression of the dnTCF4 minigene. If not, we use another method to inhibit the β-catenin pathway. In addition to RNA-i and antisense approaches (Sazani et al., 2001, Nucl. Acid Res. 29:3965-74; Caplen et al., 2001, PNAS 98:9742-7; Martinez et al., 2002, Cell 110:563-74; Paul et al., 2002, Nat. Biotech. 29:505-8), overexpression of Axin (which targets β-catenin for degradation) can be used to inhibit (3-catenin (Hedgepeth et al., 1999, Mol. Cell. Biol. 19:7147-57; Spink et al., 2000, EMBO 19:2270-9). If β-catenin signaling was inhibited and there was minimal or no effect on tumor formation, then we determine whether there are more subtle changes on the cancer stem cells. Expression of cyclin D1, whose expression is induced by β-catenin, has been associated with resistance to chemotherapy. Therefore, we treat mice with Adriamycin (8 mg/kg) or Taxol (60 mg/kg) five days after the dnTCF4-transduced or control cancer stem cells were injected into mice to determine whether inhibition of β-catenin enhance the efficacy of chemotherapy. The effect on tumor formation and tumor growth rate is determined as described above.
Expected results. Although cancer cells in a significant number of breast tumors have a constitutively active β-catenin signaling pathway, it is not known whether this pathway is essential for malignant transformation. If the Wnt/β-catenin pathway is necessary for the cancer cells to form tumors, then dominant-negative inhibitors block the ability of cancer cells to form tumors. If constitutive β-catenin signaling enhances tumor cell growth after malignant transformation but is not necessary for tumor formation, then the dominant-negative inhibitor slow growth of the tumor cells but not block tumor formation. If oncogenic mutations subsequent to tumor initiation make the cells independent of Wnt signaling, then the dominant-negative inhibitor do not affect tumor formation or growth. Finally, it is possible that constitutive activation of the Wnt pathway contributes to resistance to apoptosis and therefore makes the cells resistant to chemotherapy.
In a model of mouse cancer, a brief inhibition of c-ras or c-myc activity in cancer cells transformed by these genes resulted in a permanent loss of tumorigenicity (Chin et al., 1999, Nature 400:468-72; Jain et al., 2002, Science 297:102-4). If this is also true for β-catenin signaling, then transient inhibition of signaling by the adenovirus inhibit tumor formation. If inhibition of β-catenin signaling inhibits tumorigenicity, but the cells remain viable and restoration of β-catenin signaling enables them to form tumors, then the adenovirus vector slow tumor formation whereas the lentivirus vector inhibit tumor formation. If β-catenin signaling increases the rate of proliferation but is not obligate for tumorigenicity, then both viral vectors delay tumor formation and slow the growth of the tumors. If some tumors rely on β-catenin signaling and others rely on other pathways or have constitutive activation of downstream effectors of β-catenin signaling, then some tumors are affected by the viral vectors while others do not. The tests described above allow us to answer these critical questions using a unique model recapitulates human tumors. These tests for the first time delineate the biological function(s) of β-catenin signaling in de novo human breast cancers.
The lentivirus can be made using other envelopes until one is found that infects the cells efficiently (Hughes et al., 2002, Mol. Ther. 5:16-24; Wang et al., 2002, PNAS 94:10705-10).
Note that with the lentivirus vector, infection efficiency can only be in the range of 30-70%. This would mean that a significant number of tumor cells would remain that could form tumors. However if inhibition of β-catenin signaling inhibits tumor formation, then the resultant tumors would not express gfp. Flow cytometry is used to measure gfp-expressing cells in the tumors infected with the dnTCF4 and control viruses. The tumors arising from the dnTCF4 group have a marked decrease in such cells if β-catenin signaling does play a role in tumor formation.
Does inhibition of β-catenin signaling alter the phenotype of tumorigenic breast cancer cells? One of the informative markers useful for the separation of tumorigenic and non-tumorigenic breast cancer cells is CD44. Interestingly, CD44 is one of the target genes that is transcriptionally upregulated by β-catenin and epithelial stem cells, but not their differentiated progeny, are felt to express this marker (Liu et al., 1997, PNAS 94:10705-10; van de Wetering et al., 2002, Cell 111:241-50). We contemplate that inhibition of β-catenin signaling result in the differentiation of the tumorigenic breast cancer cells and cause them to lose expression of CD44. We further contemplate that the CD44− non-tumorigenic cancer cells do not have active β-catenin. To test this, we use flow-cytometry to isolate ESA+CD44+CD24−/lowLineage− cancer cells from Tumor 1, Tumor 2 and Tumor 3 and infect them with the dnTCF4 adenovirus or a control adenovirus. The cells are cultured in tissue culture medium containing soluble Delta. We have found that this medium allows the tumorigenic cells to grow in tissue culture for 1-3 weeks. The cells are monitored for growth in vitro over a 3-week period. In addition, 1, 3 and 7 days after infection, the dnTCF4 adenovirus or a control adenovirus infected cells are analyzed by flow-cytometry for the expression of ESA, CD44 and CD24.
Next, we determine if there is a difference in β-catenin signaling in the tumorigenic cancer cells, the CD44+cancer cells, or the CD44− cancer cells. To do this, we use flow-cytometry to isolate ESA+CD44+CD24−/lowLineage− tumorigenic cancer cells, CD44+ cancer cells, and CD44− non-tumorigenic cancer cells from tumor 1, tumor 2 and tumor 3. Each population of cells are stained with an anti-β-catenin antibody that has been conjugated with APC. Each population of cells are analyzed by fluorescent microscopy to determine whether the β-catenin is membrane bound (not constitutively active), and by flow-cytometry to determine the amount of the protein in the cells. The level of β-catenin is associated with activity. In addition, we use commercially available antibodies that recognize phosphorylated and unphosphorylated β-catenin. The phosphorylated form is marked for degradation while the unphosphorylated form is active (van Noort et al., 2002, J. Biol. Chem. 277:17901-5; van Noort et al., 2002, Exp. Cell Res. 274-72). These tests allow us to determine whether CD44 expression and β-catenin signaling are linked in patients' cancer cells.
CD44 is one of the best markers that allows one to distinguish tumorigenic cancer cells from non-tumorigenic cancer cells. Since CD44 is transcriptionally activated by β-catenin, then inhibition of β-catenin signaling result in downregulation of CD44.
Does the differential expression of the frizzled proteins affect breast cancer stem cell fate in Tumor 1? Data suggest that in Tumor 1, the tumorigenic stem cells express frizzled 2 and 6, whereas the non-tumorigenic neoplastic cells express Wnt 3, 4, 7A, 7B, 10B, and 11. This suggests a paracrine system in this particular tumor where the non-tumorigenic cells might drive the proliferation of the cancer stem cells. Preliminary data also suggest that in Tumor 1, the tumorigenic stem cells express frizzled 2 and 6, whereas the non-tumorigenic neoplastic cells express frizzled 2 and 7. It is possible that differential expression of frizzled genes plays a role in cancer cell fate decisions. The other possibility is that differential expression of these genes is a function of differentiation or immortality but does not directly regulate cell fate decisions in this tumor. This Example illustrates systems of the present invention that find use in distinguishing between these possibilities.
Tumor 1 tumorigenic cells express frizzled 6 and non-tumorigenic cancer cells express frizzled 7. It is possible that frizzled 6 enhances and frizzled 7 inhibits the proliferation or self-renewal of the cancer cells. To test this possibility, in vitro and in vivo clonogenic assays are done. Tumor 1 tumorigenic and non-tumorigenic cancer cells are infected with a lentivirus vector that expresses either frizzled 6-IRES-GFP or frizzled 7-IRES-GFP. A lentivirus is used rather than an adenovirus since the former virus can infect and stably transduce a high proportion of primary cells, whereas adenovirus transduction is often transient. It is conceivable that expression of frizzled 6 confers the ability to self renew to the cancer cells. If so, infection of stem cells and/or non-tumorigenic cells with a lentivirus vector containing a frizzled 6-IRES-GFP minigene can enhance tumorigenicity of the stem cell or allow the previously non-tumorigenic cells to form tumors. Conversely, enforced expression of frizzled 7 can inhibit tumorigenicity. After infection with either the frizzled or control virus, limiting dilution tests are done to determine whether enforced expression of each gene alters the ability of each population of cancer cells to form tumors.
These tests allow us to determine whether enforced expression of frizzled 6 increases stem cell proliferation and/or self-renewal or expression of frizzled 7 inhibits tumorigenic cancer cell proliferation and/or self-renewal. To test this possibility, we isolate the tumorigenic ESA+CD44+CD24−/lowLineage− cancer cells and the other Lineage−, non-tumorigenic cancer cells are isolated by flow-cytometry from each of the tumors. First, immunohistochemistry are done using the anti-β-catenin antibody to determine whether there is a difference in the amount of active β-catenin in the tumorigenic and non-tumorigenic cells. Next, we determine the amount of phosphorylated (inactivated) and non-phosphorylated (active) β-catenin the tumorigenic and non-tumorigenic cells (van Noort et al., 2002, J. Biol. Chem. 277:17901-5).
Next, in vitro assays are designed to determine the affects of each gene on colony formation by tumorigenic and non-tumorigenic cancer cells in tissue culture. After isolation by flow cytometry, each population of cells are infected with an identical MOI of either the frizzled 6/GFP, frizzled 7/GRP or a control GFP virus. Triplicate cultures of 100, 500, 1,000 and 5,000 cells are placed in tissue culture medium. The total number of GFP+ colonies as well as the total number of colonies and the number of GFP+ colonies are counted on days 3, 7, 14, 21 and 28. At the end of 21 days, we attempt to pass the cells to determine whether expression of the particular frizzled gene affects self-renewal.
The influence of enforced expression of each frizzled gene on the ability of the neoplastic cells to form tumors in the NOD/SCID mice are determined. Normally, 200 Tumor 1 cells are required to form a tumor. Therefore, the frizzled 6, frizzled 7 or a control GFP lentivirus are used to infect 50, 100, 500, 1,000, 5,000, and 10,000 tumorigenic cancer cells or non-tumorigenic cancer cells. The cells are injected into the immunodeficient mice. The number of cells needed to form tumors and the rate of tumor growth are monitored. After the tumors have reached one centimeter in size, they are excised and analyzed by flow cytometry for expression of GFP. By comparing the percentage of cells infected by the GFP virus and frizzled/GFP virus, we are able to estimate the efficiency of infection and the affect of the latter virus on proliferation. These tests are replicated three times.
Predicted Results: In Tumor 1, different populations of cells express different frizzled proteins, and the non-tumorigenic cells appear to preferentially express Wnt proteins. This suggests that certain populations of non-tumorigenic cells promote tumor formation through Wnts. If β-catenin signaling is downregulated in the non-tumorigenic cells and active in the tumorigenic subset, this is detected by the immunohistochemistry analysis of the expression patterns of phosphorylated & unphosphorylated β-catenin in the non-tumorigenic and tumorigenic cancer cells respectively. If there is no affect of the particular frizzled/GFP virus, then a similar percentage of cells would express GFP in each group and there is no difference in the number of cells needed to form a tumor. If the particular frizzled virus decreases or increases tumorigenicity or proliferation, then tumors infected with frizzled/GFP virus would have fewer or more GFP+ cells and/or would require more or fewer cells to form tumors, respectively.
If necessary, a Feline Leukemia Virus lentivirus based vector system is used. This latter vector efficiently transduces non-replicating cells, and results in prolonged expression of transgenes. We can infect cells using a tet-inducible dnTCF-IRES-GFP lentivirus flanked by gene insulators or a control GFP lentivirus. 1-2 days prior to harvesting tumors, the transgene is activated. GFP+ breast cancer stem cells are harvested and transplanted into the breasts of NOD/SCID mice. We continue to induce the expression of the transgene in the mice, and we are able to monitor them for the ability of the cells to form tumors.
The β-catenin signaling pathway differs in the cancer cells isolated from cancers with and without constitutively activated β-catenin. Rationale: Unlike colon cancer, mutations in the β-catenin signaling pathway have been detected in only a minority of breast cancer cells. However, these studies have concentrated only on APC and β-catenin. Using the systems and methods of the present invention, we closely examine the β-catenin pathway in each of the tumors that were analyzed at the biological level in specific aim 1A.
Does the Wnt pathway differ in cancer cells isolated from different tumors? In these experiments, we characterize the Wnt/β-catenin pathway in each of the tumors. To do this, we use RT-PCR to amplify the coding sequence of β-catenin, each of the frizzled proteins, the low-density lipoprotein-related Wnt receptors, APC, TCF family members, Axin, and Bcl-9 expressed by the cancer cells from each of the 10 tumors with constitutive activation of β-catenin. RT-PCR products of the expressed genes are sequenced to determine whether there are mutations in any of the genes. Any possible mutant genes are confirmed by repeated sequencing of an independent RT-PCR sample. If mutations are found, we determine whether the mutations result in the constitutive activation of the Wnt/β-catenin pathway. To do this, the mutated gene-IRES-GFP are cloned into the pCDNA3 eukaryotic expression vector. For example, if we find a mutant frizzled 2, then HEK 293 cells (which do not have activated B-catenin; Gazit et al., 1999, Oncogene 18:5959-66) are transfected with the mutant frizzled 2-IRES-GFP expression vector or a control IRES-GFP vector. Cells are stained with an anti-β-catenin-PE antibody and fluorescent microscopy is done to determine whether the mutant frizzled 6 causes cytoplasmic/nuclear localization of β-catenin, indicating activation of signaling. This assay allows us to determine whether mutation of components of the β-catenin pathway result in aberrant signaling in human breast cancer stem cells.
Expected results. Although constitutively active β-catenin is seen in the cancer cells in a significant number of breast cancer tumors, the mechanism is not known. There are differences in the signaling pathway in different tumor cells that are detected by these studies. If a mutation in a Wnt receptor or β-catenin modifier is present, then the sequencing studies detect this difference. If autocrine stimulation is present, then we see expression of one of the Wnt ligands by the cancer cells.
Does Wnt expression by different populations of tumor cells in some tumors drive breast cancer cell growth? Perhaps more so than any other type of cancer, a breast cancer tumor contains a heterogeneous population of normal cells including mesenchymal (stromal) cells, inflammatory cells, and endothelial cells that interact with malignant cells to modulate tumor growth and invasion. The purpose is to begin to understand the role of the Wnt pathway in such interactions. We contemplate that normal stromal elements including mesenchymal and endothelial cells produce different Wnts that influence tumor cell proliferation and invasion. Just unpassaged tumors are analyzed since the xenograft tumors would be expected to have infiltrating normal mouse stromal cells and analysis of the mouse cells would be too complicated. Purification of these cells by flow-cytometry allow both molecular and biological analysis of these cells without first placing the cells in tissue culture. This is particularly important since the normal cells are known to change expression of genes when cultured in vitro.
The normal stromal cells are thought to play a role in the proliferation of breast cancer cells. It is also likely that the cell-cell interactions between cancer cells contribute to tumor growth. Wnt signaling is one of the major pathways that normal tissue cells use to talk to each other. Therefore, it is important to understand how this pathway is regulated in tumors. Specific Wnt proteins can activate specific frizzled receptors. Some frizzled receptors signal through β-catenin, while others signal through different pathways. To understand how the various populations of tumor cells within a tumor might talk to the tumorigenic breast cancer cells through this pathway, we must first determine which frizzled and Wnt genes are expressed by the normal cells and the cancer cells from multiple patients' tumors. Therefore, we identify the Wnt pathway genes that are expressed by each population of normal cells and the cancer cells isolated from the 5 patients' tumor samples that have constitutive β-catenin signaling and the cancer cells from 5 patients' tumors that do not have constitutive activation of this protein.
Since our evidence suggests that there are differences in the expression of Wnt and frizzled genes in the different populations of cancer cells, it is important to isolate the different phenotype subsets of cells in the cancer to do these tests. This is because the apparently tumorigenic population of cells is a minority population, and the genes that these cells express might otherwise be missed in the analyses. Therefore, flow-cytometry is used to isolate tumorigenic and non-tumorigenic breast cancer cells, as well as normal endothelial cells and fibroblasts from the patients' original tumor. This is done as described in preliminary results and aim 1. RNA is isolated from pools of 35,000 of each population of cells and then linear amplification is done to make sufficient probe for the microarray analysis (Ramalho-Santos et al., 2002, Science 298:597-600; Ivanova et al., 2002, Science 107:3823; Terskikh et al., 2002, Blood 99:488-98; Akashi et al., 2003, Blood 101:383-9). To determine which frizzled and Wnt genes are expressed by the each population of cells found in each tumor, we probe an affymetrix microarray chip (3 chips for each cell type) that includes the Wnt and frizzled genes (the newly released U133 chip has the majority of these genes).
Results are confirmed by quantitative RT-PCR of the different populations of cancer cells isolated from the primary tumors with and without activated β-catenin in the cancer cells. Real time RT-PCR is done to determine the level of expression of each of the frizzled and Wnt genes by different populations of normal and neoplastic tumor cells. To do this, we make PCR primers for detection each of these genes. Each set of primers span at least one exon so RT-PCR can be used to detect expression of the mRNA in different populations of tumor cells. Flow-cytometry is used to isolate the tumorigenic population of cells identified in each of the tumors. Real-time PCR then is used to measure the expression of each of the Wnt pathway-related RNAs by each respective cell population identified in the microarray analysis (reviewed in Bustin, 2000, J. Mol. Endocrinol. 25:169-93). To do the real-time PCR gene expression analysis, mRNA is purified from 3×104 cells (isolated by flow-cytometry). Part of the RNA is used to directly measure RNA amount by the Ribogreen RNA quantitative method (Molecular Probes, Eugene, Oreg.), and part used to measure rRNA and GAPDH expression (a control housekeeping gene) via the Taqman real-time RT-PCR assay. Taken together, these control measurements allow us to normalize expression of the genes of interest between the different populations of cells (Bustin, 2000, J. Mol. Endocrinol. 25:169-93). Although fewer cells can be used in this assay, analysis of RNA isolated from 3×104 cells should result in a more accurate measurement of gene expression.
Each frizzled receptor expressed by the different populations of cancer cells from each tumor is analyzed for the ability to activate β-catenin and transform cells when stimulated by each of the different Wnt genes that are expressed by different populations of cells within a tumor. Two biological systems are used for these studies.
First, we use HEK 293 cells transfected with each individual frizzled identified in this screen to test the ability of the identified Wnts to activate β-catenin through the frizzled proteins expressed by the tumorigenic cells. Next, we use a mammary epithelial cell line to determine whether a particular Wnt or frizzled gene is able to transform the cell line.
To measure the biochemical functions of the different Wnt and Frizzled proteins expressed by the breast cancer cells, we use a transient transfection assay as described by Gazit et al., 1999, Oncogene 18:5959-66. In this assay, HEK 293T cells are transiently transfected with a frizzled minigene or a control minigene and aTCF-luciferase or control reporter minigene. To test the ability of a particular Wnt protein to stimulate β-catenin signaling, a second group of HEK 293T cells are transfected with each of the Wnt genes expressed by the various populations of tumor cells. The frizzled-transfected cells are mixed with the Wnt-transfected cells to measure paracrine activation of a particular frizzled receptor expressed by the breast cancer stem cells activates β-catenin when stimulated by a particular Wnt protein expressed by one of the various populations of tumor cells.
The C57MG cell line is used to determine whether activation of particular frizzled receptors by particular Wnts causes morphological transformation (Wong et al., 1994, Mol. Cell. Biol. 14:6278-86). These cells undergo morphologic transformation when exposed to Wnt-1, Wnt-2, Wnt-3A, Wnt-6 and Wnt-7A, but not Wnt4, Wnt-5A, Wnt-5B and Wnt-7B. These data suggest that the non-transforming Wnts signal differently than the transforming Wnts, or that they signal through different receptors not expressed by the C57MG cells. Therefore, to fully characterize the functions of the different frizzled and Wnt proteins expressed by the cancer cells in the patients' tumors, we must first determine which frizzled genes are expressed by the C57MG cells. The cells are transfected with minigenes that express any frizzled genes expressed by tumorigenic breast cancer cells but not expressed by the C57MG cells. Next, cells are cultured in the presence of lethally irradiated fibroblasts or HEK 293T cells transfected with individual Wnt genes that were expressed by the different populations of tumor cells. The cells are analyzed for morphological transformation as described by Shimizu et al., 1997, Cell Growth Diff. 8:1349-58.
Next, we characterize the in vivo response of cancer cells from the different patients' tumors to different Wnts made by the tumor cells. The Wnt proteins are often found in the extracellular matrix and difficult to prepare in soluble forms. Therefore, we make control HEK 293 cell lines that express each of the Wnts made by the various types of tumor cells present in 2 patients' tumors. To do this, we first analyze HEK 293 cells to determine whether they constitutively make any of the Wnt proteins. Next, we stably transfect the HEK 293 cells with each of the Wnts made by the patients' tumor cells. To determine the affect of Wnt stimulation in the breast cancer cells by each of its ligands in vivo, 0, 10, 50, 100, 200, 500 and 1,000 Tumor 1 stem cells are mixed with 500,000 lethally irradiated control 293 cells or 293 cells transfected with one or more relevant Wnt minigenes and then injected into immunodeficient mice. Each injection is done in five mice. The mice then are monitored weekly for tumor formation. If a particular Wnt stimulates self renewing cell division, then either fewer cells are needed to initiate a tumor and/or tumors form more quickly. Conversely, if the ligand induces commitment to differentiation, then more cells are required to form a tumor and/or tumors take longer to form.
Expected results. The interaction of cancer cells with the normal stromal cells in tumors is thought to be critical for tumor formation and metastasis (Hanahan & Weinberg, 2000, Cell 100:57-70). The Wnt pathway is one of the central pathways by which cells in normal tissues communicate (Cadigan & Nusse, 1997, Genes & Dev. 11:3286-305). It is therefore likely that such communications are maintained to some extent in tumors. The models described in this proposal for the first time enable such studies to be conducted using patients' tumor cells. If the stromal cells indeed promote tumor growth through Wnt signaling, then the various populations of stromal cells make specific Wnts that provide a proliferative signal for the tumorigenic cancer cells.
To minimize these problems, all tests are done in triplicate with different numbers of cells. Expression of a control RNA of a known quantity is used to construct a standard curve to analyze the data(reviewed in Bustin, 2002, J. Mol. Endocrinol. 25:169-193). If necessary, new PCR primers are made, or RT is done with gene specific primers recognizing a different part of the mRNA (oligo dT primers are used for the RT reaction initially).
Summary: These tests using the systems and methods of the present invention provide means for comprehensive detail the molecular mechanisms by which the β-catenin pathway is activated in vivo in tumorigenic populations of breast cancer cells obtained directly from multiple patients' tumors, and the biological consequences of this activation in de novo breast cancer cells.
In normal hematopoietic cells, nuclear β-catenin is found only in the stem cell compartment. Reya et al. further demonstrate that β-catenin signaling is necessary for normal stem cells to self-renew. A recently completed analysis of the subcellular localization of β-catenin in tumorigenic and non-tumorigenic tumor 1 breast cancer cells further supports this notion. Normally, the subcellular distribution of β-catenin is heterogeneous in cancer cells. In some cells, the protein is located primarily in the outer membrane, while in others primarily in the nucleus. The subcellular distribution of the protein differs in the tumorigenic and non-tumorigenic cancer cells. The β-catenin is primarily located in the cytoplasm of the non-tumorigenic cancer cells, while it is primarily in the nucleus of the tumorigenic cells (
To begin to understand the biological consequences of β-catenin signaling in breast cancer, we have tested our dominant negative TCF-4 (dTCF4) adenovirus vector in several cell lines. This adenovirus acts to inhibit β-catenin signaling. Two different breast cancer cell lines, SKBR3 and MCF7, and a gastrointestinal tract cancer cell line, RKO, were infected with the dTCF4 adenovirus or a control adenovirus (empty vector). Four days after infection, the number of viable cells in each group was determined. As shown in
The observation that β-catenin is located primarily in the nucleus in the tumorigenic but not the non-tumorigenic cancer cells taken together with the observation that inhibition of β-catenin signaling affects the viability of some breast cancer cell lines shows that like normal stem cells, Wnt signals can play a role in the self-renewal of cancer stem cells.
This Example describes how various stem cell cancer markers were identified using microarray screens. The results of these screens were processed and the names of the differentially expressed genes are reported in Tables 4-9 (see above).
In order to generate gene expression profiles, human breast tumorigenic cells that were initially isolated. A series of samples were accumulated from human breast tumors or normal tissues. These were generated as follows. Three passaged breast tumors—breast tumor cells from patient 1, 2, 3 were engrafted on mice. Each tumor was engrafted on three mice to make the triplicate tumors. The breast tumorigenic cells were then isolated from these tumors. Two or three unpassaged breast tumors from three patients SUM, PE13, PE15 were labeled and sorted into tumorigenic cells (TG) or non-tumorigenic cells (NTG). Both PE15-TG and PE15-NTG were triplicate. Two or three normal breast samples were from breast reduction patients. Breast epithelial cells (Breast) were isolated with flow cytometry and used for microarray. Two or three normal colon samples were collected freshly from colon patients. Colon epithelial cells (Colon) were isolated with flow cytometry and used for microarray. Two or three normal stem cell samples (normal bone marrow) were collected from bone marrow donors. Hematopoietic stem cells (HSC) were isolated with flow cytometry. Probes were made from the following were made from the various cells types for use in the microarray analysis.
In order to perform the various microarray screens Affymetrix HG-U133 gene chips were used. The normalized gene expression intensity was used to generate the data that was collected in a number of large tables. The results in these tables was processed and used to generate Tables 4, 5, 6, 7a, 7b, 7c, 7d and 8, which present the names of the genes found to be differentially expressed. For tables 4-6, candidate cancer markers were sorted by identifying genes whose expression was greater or less than 1.5 fold in unpassaged breast tumorigenic cells comparing to non-tumorigenic cells or the normal stem cells (HSC). Table 6 shows only those genes found to be down regulated in UPTG vs. UPNTG. Table 5 shows only those genes found to be up regulated in UPTG vs. HSC. Table 4 shows only those genes found to be up regulated in UPTG vs UPNTG. For tables 7a, 7b, 7c and 7d, cancer markers were generated from the larger tables by standard T-test. These tables were sorted based on T-score is <0.01 and ratio is more than 2 fold. Table 7a shows only those genes found to be up regulated in UPTG vs. HSC. Table 7b shows only those genes found to be down regulated in UPTG vs. HSC. Table 7c shows only those genes found to be up regulated in PTG vs HSC. Table 7d shows only those genes found to be down regulated in PTG vs HSC. In the tables, the column headers refer to the gene's name or samples name and array numbers.
This example describes the identification of a cancer stem cell gene signature useful to diagnose, predict clinical outcome of a cancer including metastasis and overall survival, as well as for use in clinical trials for achieving stratification of patients and for testing of a potential therapeutic for treating cancer and for guiding cancer therapy. The cancers for which a cancer stem cell gene signature can be used to predict clinical outcome include for example, breast cancer, colon and rectum cancer, pancreas, lung and bronchus, urinary bladder, kidney, head and neck cancer and additional cancers as set forth in more detail above. Furthermore, use of these cancer stem cell gene signatures to classify tumor samples into low and high risk for the purpose of prognosis and therapy selection is provided.
Cancer Stem Cell Gene Signatures 1-9 Generated by Expression Level and P Value
The genome-wide analysis of the present invention identified cancer stem cell gene signatures by determining gene expression levels in tumor stem cells compared to a non-tumorigenic tissue, such as normal breast epithelium. These gene expression levels were then further sorted and the resulting tumor stem cell gene signatures tested for the ability to predict clinical outcome including metastasis and death. To generate a tumor stem cell gene signature from the microarray comparison of cancer stem cells gene expression versus normal breast tissue, genes with two-fold reduced or two-fold elevated expression in cancer stem cells were identified. These genes were then further filtered by the P value of a t-test between multiple samples. This resulted in nine cancer stem cell gene signatures (gene signatures 1-9) with t-test P values of 0.01, 0.0075, 0.005, 0.0025, 0.001, 0.00075, 0.0005, 0.00025, and 0.0001 comprising 291, 243, 186, 115, 62, 51, 38, 30, and 16 genes, respectively.
To assess the ability of cancer stem cell gene signatures to predict metastasis and death two independent cancer patient populations were used: 295 consecutive early breast cancer patients from the Netherlands Cancer Institute (van de Vijver et al., 2002, N. Eng. J. Med. 347:1999) and 286 lymph node negative breast cancer patients from the Erasmus Medical Center (Wang et al., 2005, Lancet 365:671). The gene expression from both patient populations had been analyzed by microarray analysis. Each microarray platform contained a subset of genes within each cancer stem cell gene signature (described in detail below) the expression of which was then used to generate a cancer gene profile that was then compared to a cancer stem cell gene signature. The predictive power of each of the nine cancer stem cell gene signatures was then assessed by Cox proportional hazard survival analysis of the 295 consecutive early breast cancer patients from the Netherlands Cancer Institute and chi-square analysis of the 286 patients from the Erasmus Medical Center. This analysis identified five of the nine cancer stem cell gene signatures including signatures 1, 2, 3, 4, and 7 (presented in Tables 9-13) as significantly predictive of metastasis and death for the 295 Netherlands patients and metastasis for the 286 Erasmus patients (
Cancer stem cell gene signature 1 comprises 290 genes selected from two-fold decreased and two-fold elevated expression in tumor stem cells versus normal breast tissue with a t-test P value of 0.01 across samples (Table 9A). Of the 290 genes that comprise signature one, 243 (Table 9B) are shared with the array used to analyze expression of the 295 patients from the Netherlands Cancer Institute (the 295 array). Correlation and Cox proportional hazard survival analysis of microarray data from these 295 patients revealed cancer stem cell gene signature 1 as significantly predictive of metastasis with a univariate hazard ratio of 1.27 per 0.1 correlation (P=1.7×10−4) and significantly predictive of death with a univariate hazard ratio of 1.32 per 0.1 correlation (P=1.6×10−4) (Table 14). Of the 290 genes that comprise signature one, 178 (Table 9C) are shared with the array used to analyze expression of the 286 patients from the Erasmus Medical Center (the 286 array). Correlation and chi-square analysis of microarray data from these 286 patients revealed that patient tumors with a gene expression profile that correlated with cancer stem cell signature 1 had a significantly increased risk of metastasis (P=0.0005) (Table 14). Of the 290 genes that comprise signature one, 162 (Table 9D) are shared with both the 295 array and the 286 array. Using these common 162 genes, correlation and Cox proportional hazard survival analysis of microarray data from the 295 Netherlands patients again revealed cancer stem cell gene signature 1 as significantly predictive of metastasis with a univariate hazard ratio of 1.24 per 0.1 correlation (P=3.1×10−4) and significantly predictive of death with a univariate hazard ratio of 1.27 per 0.1 correlation (P=4.2×10−4) (Table 15). Using these 162 genes, correlation and chi-square analysis of microarray data from the 286 Erasmus patients also revealed that patient tumors with a gene expression profile that correlated with cancer stem cell gene signature 1 had a significant increased risk of metastasis (P=0.015) (Table 15).
Cancer stem cell gene signature 2 comprises 242 genes selected from two-fold decreased and two-fold elevated expression with a t-test P value of 0.0075 across samples (Table 10A). Of the 242 genes that comprise cancer stem cell gene signature two, 202 (Table 10B) are shared with the 295 array. Correlation and Cox proportional hazard survival analysis of microarray data from the 295 patients from the Netherlands Cancer Institute revealed cancer stem cell gene signature 2 as significantly predictive of metastasis with a univariate hazard ratio of 1.28 per 0.1 correlation (P=7.5×10−5) and significantly predictive of death with a univariate hazard ratio of 1.33 per 0.1 correlation (P=6.2×10−5) (Table 14). Of the 242 genes that comprise cancer stem cell gene signature two, 146 (Table 10C) are shared with the 286 array. Correlation and chi-square analysis of microarray data from the 286 patients from the Erasmus Medical Center revealed that patient tumors with a gene expression profile that correlated with cancer stem cell gene signature 2 had a significantly increased risk of metastasis (P=0.0001) (Table 14). Of the 242 genes that comprise cancer stem cell gene signature two, 132 (Table 10D) are shared with both the 295 array and the 286 array. Using these common 132 genes, correlation and Cox proportional hazard survival analysis of microarray data from the 295 Netherlands patients again revealed cancer stem cell gene signature 2 as significantly predictive of metastasis with a univariate hazard ratio of 1.25 per 0.1 correlation (P=6.2×10−5) and significantly predictive of death with a univariate hazard ratio of 1.29 per 0.1 correlation (P=6.9×10−5) (Table 15). Using these 132 genes, correlation and chi-square analysis of microarray data from the 286 Erasmus patients also revealed that patient tumors with a gene expression profile that correlated with cancer stem cell gene signature 2 had a significant increased risk of metastasis (P=0.0002) (Table 15).
Cancer stem cell gene signature 3 comprises 186 genes selected from two-fold decreased and two-fold elevated expression with a t-test P value of 0.005 across samples (Table 11A). This cancer stem cell gene signature was determined to be the most statistically significant predictor of clinical outcome of all the cancer stem cell signatures (
Cancer stem cell gene signature 4 comprises 114 genes selected from two-fold decreased and two-fold elevated expression with a t-test P value of 0.0025 across samples (Table 12A). Of the 114 genes that comprise cancer stem cell gene signature four, 93 (Table 12B) are shared with the 295 array. Correlation and Cox proportional hazard survival analysis of microarray data from the 295 patients from the Netherlands Cancer Institute revealed cancer stem cell gene signature 4 as significantly predictive of metastasis with a univariate hazard ratio of 1.27 per 0.1 correlation (P=2.3×10−5) and significantly predictive of death with a univariate hazard ratio of 1.33 per 0.1 correlation (P=1.1×10−5) (Table 14). Of the 114 genes that comprise signature four, 66 (Table 12C) are shared with the 286 array. Correlation and chi-square analysis of microarray data from the 286 patients from the Erasmus Medical Center revealed that patient tumors with a gene expression profile that correlated with cancer stem cell gene signature 4 had a significantly increased risk of metastasis (P<0.0001) (Table 14). Of the 114 genes that comprise cancer stem cell gene signature four, 60 (Table 12D) are shared with both the 295 array and the 286 array. Using these common 60 genes, correlation and Cox proportional hazard survival analysis of microarray data from the 295 Netherlands patients again revealed cancer stem cell gene signature 4 as significantly predictive of metastasis with a univariate hazard ratio of 1.22 per 0.1 correlation (P=7.9×10−6) and significantly predictive of death with a univariate hazard ratio of 1.28 per 0.1 correlation (P=2.9×10−6) (Table 15). Using these 60 genes, correlation and chi-square analysis of microarray data from the 286 Erasmus patients also revealed that patient tumors with a gene expression profile that correlated with cancer stem cell gene signature 4 had a significant increased risk of metastasis (P=0.0001) (Table 15).
Cancer stem cell gene signature 7 comprises 37 genes selected from two-fold decreased and two-fold elevated expression with a t-test P value of 0.0005 across samples (Table 13A). Of the 37 genes that comprise signature seven, 28 (Table 13B) are shared with the 295 array. Correlation and Cox proportional hazard survival analysis of microarray data from the 295 patients from the Netherlands Cancer Institute revealed cancer stem cell gene signature 7 as significantly predictive of metastasis with a univariate hazard ratio of 1.21 per 0.1 correlation (P=3.2×10−3) and significantly predictive of death with a univariate hazard ratio of 1.23 per 0.1 correlation (P=4.1×10−3) (Table 14). Of the 37 genes that comprise signature seven, 17 (Table 13C) are shared with the 286 array. Correlation and chi-square analysis of microarray data from the 286 patients from the Erasmus Medical Center revealed that patient tumors with a gene expression profile that correlated with cancer stem cell gene signature 7 had a significantly increased risk of metastasis (P=0.0103) (Table 14). Of the 37 genes that comprise cancer stem cell gene signature seven, 15 (Table 13D) are shared with both the 295 array and the 286 array. Using these common 15 genes, correlation and Cox proportional hazard survival analysis of microarray data from the 295 Netherlands patients again revealed cancer stem cell gene signature 7 as significantly predictive of metastasis with a univariate hazard ratio of 1.13 per 0.1 correlation (P=1.8×10−4) and significantly predictive of death with a univariate hazard ratio of 1.17 per 0.1 correlation (P=3.6×10−5) (Table 15). However, using these 15 genes, correlation and chi-square analysis of microarray data from the 286 Erasmus patients revealed that patient tumors with a gene expression profile that correlated with cancer stem cell signature 7 had only a weakly significant increased risk of metastasis (P=0.088) (Table 15).
Because of its high statistical significance in the prediction of clinical outcome, the cancer stem cell gene signature 3 was further analyzed using the 295 patient from the Netherlands Cancer Institute. The correlation between the gene expression of each of the 295 patient samples and cancer stem cell gene signature 3 identified 230 patients with a correlation greater than 0 and 65 patients with a correlation less than 0. Kaplan-Meier survival analysis was used to compare the occurrence of metastasis and death between patients with correlations greater than 0 and those with correlations less than 0 (
The cancer stem cell gene signature 3 was further correlated against cancer gene profiles of different tumor subclasses as well as several clinical parameters. Microarray analysis has been used to segregate breast carcinomas based on gene expression into five distinct subclasses: Luminal A, Luminal B, ERBB2, Basal, and Normal-like (Sorlie et al., 2001, PNAS 98:10869). The correlation of cancer stem cell gene signature 3 with these different tumor subclasses is shown pictorially in
A similar analysis was undertaken with genes differentially expressed in non-tumorigenic tumor cells compared to normal breast epithelium. To generate a non-tumorigenic gene signature from the microarray comparison of non-tumorigenic cells versus normal breast tissue, genes with two-fold reduced or two-fold elevated expression in non-tumorigenic cells were identified. These genes were then further filtered by the P value of a t-test between the two samples. This resulted in nine non-tumorigenic gene signatures with t-test P values of 0.05, 0.025, 0.01, 0.0075, 0.005, 0.0025, 0.001, 0.00075, and 0.0005 comprising 572, 345, 168, 133, 98, 53, 29, 22, and 16 genes, respectively. Correlation plus survival analysis of microarray data from the 295 patients from the Netherlands Cancer Institute and correlation plus chi-square analysis of microarray data from 286 patients from the Erasmus Medical Center revealed non-tumorigenic gene signatures were statistically weak predictors of clinical outcome compared to cancer stem cell gene signatures.
The power of cancer stem cell gene signature 3 to predict tumor metastasis at five years was compared with other identified gene signatures. Analysis of receiver operating characteristic (ROC) curves revealed a previously described wound-response (WR) signature (Table 26; Chang et al., 2005, PNAS 102:3738) with a very similar AUC (area under the curve) value of 0.71 (P<0.001). The univariate Cox survival analysis using the cancer stem cell gene signature and the WR signature in 262 tumors selected from NKI 295 breast patients with metastasis at 5 years or metastasis-free after more than 5 years revealed similar p-values and hazard ratios (p-value=2.4e−7 & 2.9e−7; HR per 0.1=1.5 & 1.4). A multivariate Cox survival assay also showed that these two gene signatures performed independently with a combined predictive power with a p-value<2.4e−8 suggesting that analysis with both signatures may be even more powerful than either alone. To test this, the patients in the 295 patient NKI database were stratified based on both the cancer stem cell gene signature 3 and the WR signature. This stratification allowed the identification of patients with extremely low, intermediate, and high risk for developing metastasis (
The current use of adjuvant chemotherapy for the treatment of breast cancer is based on evidence from randomized clinical trials and from prospective studies, some of which are summarized in the NIH and St. Gallen (SG) consensus guidelines (“http://” followed by “consensus.nih.gov/”) Goldhirsch et al., 2003, J. Clin. Oncol. 21:3357-65). Patients in the NKI database were stratified into high and low risk groups based on criteria listed in the NIH and SG guidelines, 284 and 273 all of whom, respectively, would be uniformly treated with adjuvant chemotherapy to prevent recurrence of their cancers. However, only 35% of these patients (99/284 or 97/273) actually developed tumor metastasis. Moreover, 65% of these patients (185/284 or 176/273) had a greater than 10-year metastasis-free survival, suggesting they would have unlikely benefited from adjuvant chemotherapy.
Using Pearson correlation and Kaplan-Meier survival analysis, 284 high risk tumors by NIH consensus criteria were further stratified using cancer stem cell gene signature 3. This analysis revealed that 21% of patients (60/284) had tumors whose gene expression profiles negatively correlated with cancer stem cell gene signature 3 (cor<0; p<0.0001), of which 13% cases developed metastasis (8/60) with a 10 year metastasis-free survival of 81% (
Since cancer stem cells from different tumors share the properties of self-renewal and proliferation to generate tumor heterogeneity, the prognostic ability of cancer stem cell gene signature 3 was assessed in cancers from different organs. Specifically, the ability of cancer stem cell gene signature 3 to predict overall survival or metastasis-free survival for patients with lung cancer, medulloblastoma and prostate cancer was determined. The microarray gene expression data of each cancer contained a subset of genes that overlapped with the 186-gene signature. Again, Pearson correlation analysis was applied to determine how closely correlated the patient's tumor gene expression profile was with cancer stem cell gene signature 3. 62 lung cancer, 60 medulloblastoma, and 21 prostate cancer tumors were analyzed and subsequently categorized into good and poor prognosis groups based on the Peterson correlation assay (
A second set of cancer stem cell gene signatures was generated from microarray comparisons by segregating genes differentially expressed in tumor stem cells versus normal breast epithelium into different cancer stem cell gene signatures based on their fold expression change. Specifically, eight cancer stem cell gene signatures (gene signature 10-17) were generated by the reduced and elevated expression of genes at log 2 (4 fold), log 2.5 (5.7 fold), log 3 (8 fold), log 3.5 (11.3 fold); log 4 (16 fold), log 4.5 (22.6), log 5 (32 fold) and a log 5.5 (45.3 fold) in tumor stein cells versus normal breast tissue. These cancer stem cell gene signatures comprise 585, 331, 205, 127, 89, 51, 35, and 20 genes, respectively (Tables 17A-24A).
To assess the ability of cancer stem cell gene signatures 10-17 to predict metastasis and death two independent cancer patient populations were used: 295 consecutive early breast cancer patients from the Netherlands Cancer Institute (van de Vijver et al., 2002, N. Eng. J. Med. 347:1999) and 286 lymph node negative breast cancer patients from the Erasmus Medical Center (Wang et al., 2005, Lancet 365:671). The gene expression from both patient populations had been analyzed by microarray analysis. Each microarray platform contained a subset of genes within each cancer stem cell gene signature (described in detail below) the expression of which was then used to generate a cancer gene profile for each patient sample that was then compared to the cancer stem cell gene signature. The predictive power of each of the eight cancer stem cell gene signatures was then assessed by Cox proportional hazard survival analysis of the 295 consecutive early breast cancer patients from the Netherlands Cancer Institute and chi-square analysis of the 286 patients from the Erasmus Medical Center. This analysis identified subsets of many of the eight cancer stem cell gene signatures (presented in Tables 17B,C,D-21 B,C,D; 22B,C; 23A,B-24A,B), all of which were significantly predictive of metastasis and death for the 295 Netherlands patients and metastasis for the 286 Erasmus patients (Tables 25 & 26) as described in detail below.
Cancer stem cell gene signature 10 comprises 585 genes selected from a log 2 (4 fold) change in gene expression (Table 17A). Of the 585 genes that comprise signature ten, 494 (Table 17B) are shared with the array used to analyze expression of the 295 patients from the Netherlands Cancer Institute (the 295 array). Correlation and Cox proportional hazard survival analysis of microarray data from these 295 patients revealed cancer stem cell gene signature 10 as significantly predictive of metastasis with a univariate hazard ratio of 1.39 per 0.1 correlation (P=1.6×10−7) and significantly predictive of death with a univariate hazard ratio of 1.58 per 0.1 correlation (P=1.7×10−9) (Table 25). Of the 585 genes that comprise signature ten, 410 (Table 17C) are shared with the array used to analyze expression of the 286 patients from the Erasmus Medical Center (the 286 array). Correlation and chi-square analysis of microarray data from these 286 patients revealed that patient tumors with a gene expression profile that correlated with cancer stem cell signature 10 had a significantly increased risk of metastasis (P=0.0073) (Table 25). Of the 585 genes that comprise cancer stem cell signature ten, 367 (Table 17D) are shared with both the 295 array and the 286 array. Using these common 367 genes, correlation and Cox proportional hazard survival analysis of microarray data from the 295 Netherlands patients again revealed cancer stem cell gene signature 10 as significantly predictive of metastasis with a univariate hazard ratio of 1.37 per 0.1 correlation (P=4.2×10−7) and significantly predictive of death with a univariate hazard ratio of 1.55 per 0.1 correlation (P=1.1×10−8) (Table 26). Using these 367 genes, correlation and chi-square analysis of microarray data from the 286 Erasmus patients also revealed that patient tumors with a gene expression profile that correlated with cancer stem cell gene signature 10 had a significant increased risk of metastasis (P=0.0073) (Table 26).
Cancer stem cell gene signature 11 comprises 331 genes selected from a log 2.5 (5.7 fold) change in gene expression (Table 18A). Of the 331 genes that comprise signature eleven, 277 (Table 18B) are shared with the 295 array. Correlation and Cox proportional hazard survival analysis of microarray data from the 295 patients from the Netherlands Cancer Institute revealed cancer stem cell gene signature 11 as significantly predictive of metastasis with a univariate hazard ratio of 1.35 per 0.1 correlation (P=3.0×10−7) and significantly predictive of death with a univariate hazard ratio of 1.52 per 0.1 correlation (P=7.2×10−9) (Table 25). Of the 331 genes that comprise signature eleven, 237 (Table 18C) are shared with the 286 array. Correlation and chi-square analysis of microarray data from the 286 patients from the Erasmus Medical Center revealed that patient tumors with a gene expression profile that correlated with cancer stem cell gene signature 11 had a significantly increased risk of metastasis (P=0.0048) (Table 25). Of the 331 genes that comprise cancer stem cell gene signature eleven, 211 (Table 18D) are shared with both the 295 array and the 286 array. Using these common 211 genes, correlation and Cox proportional hazard survival analysis of microarray data from the 295 Netherlands patients again revealed cancer stem cell gene signature 11 as significantly predictive of metastasis with a univariate hazard ratio of 1.30 per 0.1 correlation (P=1.5×10−6) and significantly predictive of death with a univariate hazard ratio of 1.43 per 0.1 correlation (P=7.1×10−8) (Table 26). Using these 211 genes, correlation and chi-square analysis of microarray data from the 286 Erasmus patients also revealed that patient tumors with a gene expression profile that correlated with cancer stem cell gene signature 11 had a significant increased risk of metastasis (P=0.00167) (Table 26).
Cancer stem cell gene signature 12 comprises 205 genes selected from a log 3 (8 fold) change in gene expression (Table 19A). Of the 205 genes that comprise cancer stem cell gene signature twelve, 177 (Table 19B) are shared with the 295 array. Correlation and Cox proportional hazard survival analysis of microarray data from the 295 patients from the Netherlands Cancer Institute revealed cancer stem cell gene signature 12 as significantly predictive of metastasis with a univariate hazard ratio of 1.33 per 0.1 correlation (P=6.6×10−7) and significantly predictive of death with a univariate hazard ratio of 1.47 per 0.1 correlation (P=2.8×10−8) (Table 25). Of the 205 genes that comprise cancer stem cell gene signature twelve, 145 (Table 19C) are shared with the 286 array. Correlation and chi-square analysis of microarray data from the 286 patients from the Erasmus Medical Center revealed that patient tumors with a gene expression profile that correlated with cancer stem cell signature 12 had a significantly increased risk of metastasis (P=0.0086) (Table 25). Of the 205 genes that comprise signature twelve, 132 (Table 19D) are shared with both the 295 array and the 286 array. Using these common 132 genes, correlation and Cox proportional hazard survival analysis of microarray data from the 295 Netherlands patients again revealed cancer stem cell gene signature 12 as significantly predictive of metastasis with a univariate hazard ratio of 1.27 per 0.1 correlation (P=3.3×10−6) and significantly predictive of death with a univariate hazard ratio of 1.38 per 0.1 correlation (P=2.3×10−7) (Table 26). Using these 132 genes, correlation and chi-square analysis of microarray data from the 286 Erasmus patients also revealed that patient tumors with a gene expression profile that correlated with cancer stem cell gene signature 12 had a significant increased risk of metastasis (P=0.0086) (Table 26).
Cancer stem cell gene signature 13 comprises 127 genes selected from a log 3.5 (11.3 fold) change in gene expression (Table 20A). Of the 127 genes that comprise cancer stem cell gene signature thirteen, 115 (Table 20B) are shared with the 295 array. Correlation and Cox proportional hazard survival analysis of microarray data from the 295 patients from the Netherlands Cancer Institute revealed cancer stem cell gene signature 13 as significantly predictive of metastasis with a univariate hazard ratio of 1.30 per 0.1 correlation (P=5.5×10−7) and significantly predictive of death with a univariate hazard ratio of 1.43 per 0.1 correlation (P=2.0×10−8) (Table 25). Of the 127 genes that comprise cancer stem cell gene signature thirteen, 92 (Table 20C) are shared with the 286 array. Correlation and chi-square analysis of microarray data from the 286 patients from the Erasmus Medical Center revealed that patient tumors with a gene expression profile that correlated with cancer stem cell gene signature 13 had a significantly increased risk of metastasis (P=0.0021) (Table 25). Of the 127 genes that comprise cancer stem cell gene signature thirteen, 85 (Table 20D) are shared with both the 295 array and the 286 array. Using these common 85 genes, correlation and Cox proportional hazard survival analysis of microarray data from the 295 Netherlands patients again revealed cancer stem cell gene signature 13 as significantly predictive of metastasis with a univariate hazard ratio of 1.24 per 0.1 correlation (P=3.9×10−6) and significantly predictive of death with a univariate hazard ratio of 1.34 per 0.1 correlation (P=3.0×10−7) (Table 26). Using these 85 genes, correlation and chi-square analysis of microarray data from the 286 Erasmus patients also revealed that patient tumors with a gene expression profile that correlated with cancer stem cell gene signature 13 had a significant increased risk of metastasis (P=0.0014) (Table 26).
Cancer stem cell gene signature 14 comprises 89 genes selected from a log 4 (16 fold) change in gene expression (Table 21A). Of the 89 genes that comprise cancer stem cell gene signature fourteen, 83 (Table 21B) are shared with the 295 array. Correlation and Cox proportional hazard survival analysis of microarray data from the 295 patients from the Netherlands Cancer Institute revealed cancer stem cell gene signature 14 as significantly predictive of metastasis with a univariate hazard ratio of 1.29 per 0.1 correlation (P=7.0×10−7) and significantly predictive of death with a univariate hazard ratio of 1.44 per 0.1 correlation (P=1.2×10−8) (Table 25). Of the 89 genes that comprise signature fourteen, 63 (Table 21C) are shared with the 286 array. Correlation and chi-square analysis of microarray data from the 286 patients from the Erasmus Medical Center revealed that patient tumors with a gene expression profile that correlated with cancer stem cell gene signature 14 had a significantly increased risk of metastasis (P<0.0001) (Table 25). Of the 89 genes that comprise signature fourteen, 59 (Table 21D) are shared with both the 295 array and the 286 array. Using these common 59 genes, correlation and Cox proportional hazard survival analysis of microarray data from the 295 Netherlands patients again revealed cancer stem cell gene signature 14 as significantly predictive of metastasis with a univariate hazard ratio of 1.23 per 0.1 correlation (P=6.0×10−6) and significantly predictive of death with a univariate hazard ratio of 1.35 per 0.1 correlation (P=1.6×10−7) (Table 26). Using these 59 genes, correlation and chi-square analysis of microarray data from the 286 Erasmus patients also revealed that patient tumors with a gene expression profile that correlated with cancer stem cell gene signature 14 had a significant increased risk of metastasis (P<0.0001) (Table 26).
Cancer stem cell gene signature 15 comprises 51 genes selected from a log 4.5 (22.6 fold) change in gene expression (Table 22A). Of the 51 genes that comprise cancer stem cell gene signature fifteen, 50 (Table 22B) are shared with the 295 array. Correlation and Cox proportional hazard survival analysis of microarray data from the 295 patients from the Netherlands Cancer Institute revealed cancer stem cell gene signature 15 as significantly predictive of metastasis with a univariate hazard ratio of 1.24 per 0.1 correlation (P=3.3×10−6) and significantly predictive of death with a univariate hazard ratio of 1.39 per 0.1 correlation (P=2.1×10−8) (Table 25). Of the 51 genes, 34 (Table 22C) are shared with the 286 array. Correlation and chi-square analysis of microarray data from the 286 patients from the Erasmus Medical Center revealed that patient tumors with a gene expression profile that correlated with cancer stem cell gene signature 15 had a significantly increased risk of metastasis (P=0.0012) (Table 25). Of the 51 genes that comprise signature fifteen, 34 (Table 22C) are shared with both the 295 array and the 286 array. Using these common 34 genes, correlation and Cox proportional hazard survival analysis of microarray data from the 295 Netherlands patients again revealed cancer stem cell gene signature 15 as significantly predictive of metastasis with a univariate hazard ratio of 1.16 per 0.1 correlation (P=9.3×10−5) and significantly predictive of death with a univariate hazard ratio of 1.26 per 0.1 correlation (P=1.8×10−6) (Table 26).
Cancer stem cell gene signature 16 comprises 35 genes selected from a log 5 (32 fold) change in gene expression (Table 23A). Of the 35 genes that comprise cancer stem cell gene signature sixteen, all are present on the 295 array. Correlation and Cox proportional hazard survival analysis of microarray data from the 295 patients from the Netherlands Cancer Institute revealed cancer stem cell gene signature 16 as significantly predictive of metastasis with a univariate hazard ratio of 1.20 per 0.1 correlation (P=5.9×10−5) and significantly predictive of death with a univariate hazard ratio of 1.32 per 0.1 correlation (P=7.0×10−7) (Table 25). Of the 35 genes, 23 (Table 23B) are shared with the 286 array. Correlation and chi-square analysis of microarray data from the 286 patients from the Erasmus Medical Center revealed that patient tumors with a gene expression profile that correlated with cancer stem cell gene signature 16 had a significantly increased risk of metastasis (P<0.0001) (Table 25). Of the 35 genes that comprise cancer stem cell gene signature sixteen, 23 (Table 23B) are shared with both the 295 array and the 286 array. Using these common 23 genes, correlation and Cox proportional hazard survival analysis of microarray data from the 295 Netherlands patients again revealed cancer stem cell gene signature 16 as significantly predictive of metastasis with a univariate hazard ratio of 1.13 per 0.1 correlation (P=1.2×10−3) and significantly predictive of death with a univariate hazard ratio of 1.20 per 0.1 correlation (P=6.5×10−5) (Table 26).
Cancer stem cell gene signature 17 comprises 20 genes selected from a log 5.5 (45.3 fold) change in gene expression (Table 24A). Of the 20 genes that comprise cancer stem cell gene signature seventeen, all are present on the 295 array. Correlation and Cox proportional hazard survival analysis of microarray data from the 295 patients from the Netherlands Cancer Institute revealed cancer stem cell gene signature 17 as significantly predictive of metastasis with a univariate hazard ratio of 1.16 per 0.1 correlation (P=2.1×10−4) and significantly predictive of death with a univariate hazard ratio of 1.26 per 0.1 correlation (P=6.7×10−6) (Table 25). Of the 20 genes, 14 (Table 24B) are shared with the 286 array. Correlation and chi-square analysis of microarray data from the 286 patients from the Erasmus Medical Center revealed that patient tumors with a gene expression profile that correlated with cancer stem cell gene signature 17 had a significantly increased risk of metastasis (P=0.0005) (Table 25). Of the 20 genes that comprise signature seventeen, 14 (Table 24B) are shared with both the 295 array and the 286 array. Using these common 14 genes, correlation and Cox proportional hazard survival analysis of microarray data from the 295 Netherlands patients again revealed cancer stem cell gene signature 17 as significantly predictive of metastasis with a univariate hazard ratio of 1.12 per 0.1 correlation (P=2.0×10−3) and significantly predictive of death with a univariate hazard ratio of 1.20 per 0.1 correlation (P=6.8×10−5) (Table 26).
Use of Cancer Stem Cell Signatures to Classify Tumors and Provide a Prognosis
The identification of cancer stem cell gene signatures that predict poor prognosis in a dataset of 295 early breast cancer patients and 286 lymph-node negative breast cancer patients show its further use in predicting the clinical outcome of individual patients. Importantly, though the cancer stem cell gene signatures were identified from a comparison of cancer stem cells against normal breast tissue, their prognostic ability was identified from microarray analysis of unfractionated, and thus heterogenous, breast tumor samples normalized either against a reference set of tumor samples (van't Veer et al., 2002, Nature 415:530; van de Vijver et al., 2002, N. Eng. J. Med. 347:1999) or scaled to a target intensity (Wang et al., 2005, Lancet 365:671). Thus unfractioned tumor samples including but not limited to solid tissue biopsies, fine needle aspirations, pleural effusions, etc. can be used to generate cancer gene profiles and compared to cancer stem cell gene signatures for predicting clinical outcomes in cancer patients.
To test individual patients, a tumor sample is obtained and is fresh-frozen for storage in liquid nitrogen at −180° C. before analysis. RNA is extracted from the tumor sample and fluorescently labeled cDNA probes are generated for microarray analysis using standard techniques in the art. cDNA probes from the patient sample are hybridized on DNA microarrays containing genes, or fragments thereof, which comprise a cancer stem cell gene signature. The hybridization is performed in combination with a reference sample composed of a pool of cDNAs from a number of tumor samples that are labeled with a complementary fluorophore. The hybridized array is scanned and the fluorescence intensities quantified, normalized, and corrected to yield the transcript abundance of each gene as an intensity ratio with respect to that of the signal of the reference sample. Alternatively the isolated RNA is used for quantitative RT-PCR using primer sets that specifically amplify the gene(s) that comprise a cancer stem cell gene signature and expression is normalized to glyceraldehydes-3-phosphate dehydrogenase (GAPDH). Gene expression in the patient sample produces a cancer gene profile that is then correlated with the cancer stem cell gene signature. The detection of a positive correlation with a cancer stem cell gene signature is used to categorize a tumor as high risk and an appropriate therapy is thus administered. The signature can further be used over time to monitor the response to the chosen therapy. Alternatively, if no correlation or a negative correlation with a cancer stem cell gene signature is detected in the sample, a patient can be monitored over time for the appearance of the signature.
It should be understood the signatures described above are generally optimized. The present invention is not limited to the use of these particular signatures. Any combination of one or more markers that provides useful information can be used in the methods of the present invention. For example, it should be understood that one or more markers can be added or subtracted from the above signatures, while maintaining the ability of the signatures to yield useful information (e.g., more information than would be obtained in the absence of testing).
All publications and patents mentioned in the above specification are herein incorporated by reference. Various modifications and variations of the described method and system of the invention will be apparent to those skilled in the art without departing from the scope and spirit of the invention. Although the invention has been described in connection with specific some embodiments, it should be understood that the invention as claimed should not be unduly limited to such specific embodiments. Indeed, various modifications of the described modes for carrying out the invention which are obvious to those skilled in the relevant fields are intended to be within the scope of the following claims.
This application claims benefit of the filing date of U.S. Provisional Application No. 60/690,003, filed Jun. 13, 2005, which is incorporated by reference herein its entirety.
This invention was made with government support under Grant No. 5P01 CA07513606 awarded by the National Institutes of Health. The Government has certain rights in the invention.
Number | Date | Country | |
---|---|---|---|
60690003 | Jun 2005 | US |