Cancer is a leading cause of death worldwide and accounted for approximately 7.6 million deaths (around 13% of all deaths) in 2008 (Ferlay J, et al., GLOBOCAN 2008 v1.2, Cancer Incidence and Mortality Worldwide: IARC CancerBase No. 10 [Internet]. Lyon, France: International Agency for Research on Cancer; 2010). Although significant progress in the treatment of certain types of cancer such as childhood leukemia has been achieved over the past several decades, many of the most common types of cancer remain difficult to manage and are often incurable, particularly if discovered after the tumor has invaded locally or metastasized. Tumors can exhibit marked variability in terms of aggressiveness and response to treatment, despite displaying similar histopathologic features and stage. Such variability can complicate development of appropriate treatment plans for individual patients. There is a need in the art for identification and elucidation of pathways and cellular changes that contribute to malignancy. There is also a need in the art for innovative approaches for tumor prognosis and for selecting appropriate treatment regimens for individuals with cancer.
In some aspects, the invention provides a method of diagnosing cancer in a subject comprising the steps of: determining the level of Heat Shock Factor-1 (HSF1) expression or the level of HSF1 activation in a sample obtained from the subject, wherein increased HSF1 expression or increased HSF1 activation in the sample is indicative that the subject has cancer. In some embodiments, the method comprises comparing the level of HSF1 gene expression or HSF1 activation in the sample with a control level of HSF1 gene expression or HSF1 activation, wherein a greater level in the sample as compared with the control level is indicative that the subject has cancer. In some embodiments, the cancer is a cancer in situ (CIS). In some embodiments, the sample does not show evidence of invasive cancer. In some embodiments the sample comprises breast, lung, colon, prostate tissue, cervical, or nerve sheath tissue. In some embodiments the sample comprises breast tissue and the cancer is ductal carcinoma in situ (DCIS).
In some aspects, the invention provides a method of identifying cancer comprising the steps of: (a) providing a biological sample; and (b) determining the level of HSF1 expression or the level of HSF1 activation in the sample, wherein increased HSF1 expression or increased HSF1 activation in the sample is indicative of cancer. In some embodiments the method comprises comparing the level of HSF1 gene expression or HSF1 activation in the sample with a control level of HSF1 gene expression or HSF1 activation, wherein a greater level in the sample as compared with the control level is indicative of cancer. In some embodiments the sample does not show evidence of invasive cancer. In some embodiments the sample comprises breast, lung, colon, prostate, cervical, or nerve sheath tissue. In some embodiments the sample comprises breast tissue and the cancer is ductal carcinoma in situ (DCIS).
In some aspects, the invention provides a method of assessing a tumor with respect to aggressiveness, the method comprising: determining the level of HSF1 expression or HSF1 activation in a sample obtained from the tumor, wherein an increased level of HSF1 expression or activation is correlated with increased aggressiveness, thereby classifying the tumor with respect to aggressiveness. In some embodiments, the method comprises: (a) determining the level of HSF1 expression or the level of HSF1 activation in a sample obtained from the tumor; (b) comparing the level of HSF1 expression or HSF1 activation with a control level of HSF1 gene expression or HSF1 activation; and (c) assessing the aggressiveness of the tumor based at least in part on the result of step (b), wherein a greater level of HSF1 gene expression or HSF activation in the sample obtained from the tumor as compared with the control level of HSF1 gene expression or HSF activation, respectively, is indicative of increased aggressiveness.
In some aspects, the invention provides a method of classifying a tumor according to predicted outcome comprising steps of: determining the level of HSF1 expression or HSF1 activation in a sample obtained from the tumor, wherein an increased level of HSF1 expression or activation is correlated with poor outcome, thereby classifying the tumor with respect to predicted outcome. In some embodiments the method comprises (a) determining the level of HSF1 expression or the level of HSF1 activation in a tumor sample; and (b) comparing the level of HSF1 expression or HSF1 activation with a control level of HSF1 expression or HSF1 activation, wherein if the level determined in (a) is greater than the control level, the tumor is classified as having an increased likelihood of resulting in a poor outcome.
In some aspects, the invention provides a method of predicting cancer outcome in a subject, the method comprising: determining the level of HSF1 gene expression or the level of HSF1 activation in a tumor sample, wherein an increased level of HSF1 expression or activation is correlated with poor outcome, thereby providing a prediction of cancer outcome. In some embodiments the method comprises: (a) determining the level of HSF1 expression or the level of HSF1 activation in the tumor sample; and (b) comparing the level of HSF1 gene expression or HSF1 activation with a control level of HSF1 gene expression or HSF1 activation, wherein if the level determined in (a) is greater than the control level, the subject has increased likelihood of having a poor outcome.
In some aspects, the invention provides a method for providing prognostic information relating to a tumor, the method comprising: determining the level of HSF1 expression or HSF1 activation in a tumor sample from a subject in need of tumor prognosis, wherein if the level of HSF1 expression or HSF1 activation is increased, the subject is considered to have a poor prognosis. In some embodiments the method comprises: (a) determining the level of HSF1 expression or HSF1 activation in the sample; and (b) comparing the level with a control level, wherein if the level determined in (a) is greater than the control level, the subject is considered to have a poor prognosis.
In some aspects, the invention provides a method for providing treatment-specific predictive information relating to a tumor, the method comprising: determining the level of HSF1 expression or HSF1 activation in a tumor sample from a subject in need of treatment-specific predictive information, wherein the level of HSF1 expression or HSF1 activation correlates with tumor sensitivity or resistance to a treatment, thereby providing treatment-specific predictive information. In some embodiments the treatment comprises hormonal therapy, and the method comprises steps of: (a) determining the level of HSF1 expression or HSF1 activation in the sample; and (b) comparing the level with a control level, wherein if the level determined in (a) is greater than the control level, the tumor has an increased likelihood of being resistant to hormonal therapy. In some embodiments, the treatment comprises proteostasis modulator therapy, method comprising steps of: (a) determining the level of HSF1 expression or HSF1 activation in the sample; and (b) comparing the level with a control level, wherein if the level determined in (a) is greater than the control level, the tumor has an increased likelihood of being sensitive to proteostasis modulator therapy. In some embodiments proteostasis modulator therapy comprises a heat shock response (HSR) inhibitor. In some embodiments proteostasis modulator therapy comprises an HSF1 inhibitor. In some embodiments proteostasis modulator therapy comprises an HSP90 inhibitor. In some embodiments proteostasis modulator therapy comprises a proteasome inhibitor.
In some aspects, the invention provides a method of determining whether a subject with a tumor is a suitable candidate for treatment with a proteostasis modulator, the method comprising assessing the level of HSF1 expression or HSF1 activation in a tumor sample obtained from the subject, wherein an increased level of HSF1 expression or an increased level of HSF1 activation in the sample is indicative that the subject is a suitable candidate for treatment with a proteostasis modulator. In some embodiments the proteostasis modulator is an HSR inhibitor. In some embodiments the proteostasis modulator is an HSF1 inhibitor. In some embodiments, the proteostasis modulator is an HSP90 inhibitor. In some embodiments the proteostasis modulator is a proteasome inhibitor.
In some aspects, the invention provides a method of predicting the likelihood that a tumor will be sensitive to a protein homeostasis modulator, the method comprising: (a) determining the level of HSF1 gene expression or the level of HSF1 activation in a sample obtained from the tumor; and (b) comparing the level of HSF1 gene expression or HSF1 activation with a control level of HSF1 gene expression or HSF1 activation, wherein if the level determined in (a) is greater than the control level, the tumor has an increased likelihood of being sensitive to the protein homeostasis modulator. In some embodiments the proteostasis modulator is an HSR inhibitor. In some embodiments the proteostasis modulator is an HSF1 inhibitor. In some embodiments, the proteostasis modulator is an HSP90 inhibitor. In some embodiments the proteostasis modulator is a proteasome inhibitor. In some embodiments the tumor is a carcinoma, e.g., an adenocarcinoma. In some embodiments the tumor is a CIS. In some embodiments the tumor is a Stage I tumor. In some embodiments the tumor is a breast, lung, colon, prostate, cervical, or malignant nerve sheath tumor. In some embodiments the tumor is a stage I lung adenocarcinoma or stage I breast tumor. In certain embodiments the tumor is a breast tumor, e.g., a breast tumor that is positive for estrogen receptor (ER) positive breast tumor, human epidermal growth factor 2 (HER2), or both. In some embodiments the tumor is a lymph node negative tumor, e.g., a lymph node negative breast tumor. In certain embodiments the tumor is a ductal carcinoma in situ (DCIS). In certain embodiments in which the tumor is a breast tumor, the method further comprises assessing the sample for ER, progesterone receptor (PR), HER2 status, or lymph node status (or any combination thereof).
In some aspects, the invention provides a method for tumor diagnosis, prognosis, treatment-specific prediction, or treatment selection comprising: (a) providing a sample obtained from a subject in need of diagnosis, prognosis, treatment-specific prediction, or treatment selection for a tumor; (b) determining the level of HSF1 expression or HSF1 activation in the sample; (c) scoring the sample based on the level of HSF1 expression or HSF1 activation, wherein the score provides diagnostic, prognostic, treatment-specific predictive, or treatment selection information. In some embodiments, scoring comprises determining the level of an HSF1 gene product in the sample. In some embodiments, scoring comprises determining the level of HSF1 in nuclei of cells in the sample. In some embodiments, scoring comprises generating a composite score based on the percentage of cells that exhibit nuclear HSF1 and the level of nuclear HSF1. In some embodiments, scoring comprises comparing the level of HSF1 expression or HSF1 activation in the sample with the level of HSF1 expression or HSF1 activation in a control. In some embodiments the tumor is a carcinoma, e.g., an adenocarcinoma. In some embodiments the tumor is a sarcoma. In some embodiments the tumor is a CIS. In some embodiments the tumor is a stage I tumor. In some embodiments the tumor is a breast, lung, colon, prostate, cervical, or malignant nerve sheath tumor. In some embodiments the tumor is a stage I lung adenocarcinoma or stage breast tumor. In certain embodiments the tumor is a breast tumor, e.g., a breast tumor that is positive for estrogen receptor (ER) positive breast tumor, human epidermal growth factor 2 (HER2), or both. In some embodiments the tumor is a lymph node negative tumor, e.g., a lymph node negative breast tumor. In certain embodiments the tumor is a ductal carcinoma in situ (DCIS). In certain embodiments the tumor is an ER positive, lymph node negative breast tumor. In some embodiments wherein the tumor is a breast tumor and the method further comprises scoring the tumor for ER, PR, HER2, or lymph node status.
In some embodiments of any of the methods, determining the level of HSF1 expression comprises determining the level of an HSF1 gene product.
In some embodiments of any of the methods, determining the level of HSF1 expression comprises determining the level of HSF1 mRNA.
In some embodiments of any of the methods, determining the level of HSF1 expression comprises determining the level of HSF1 polypeptide.
In some embodiments of any of the methods, determining the level of HSF1 expression comprises detecting HSF1 polypeptide using an antibody that binds to HSF1 polypeptide.
In some embodiments of any of the methods, the sample comprises a tissue sample, and determining the level of expression or activation of HSF1 comprises performing immunohistochemistry (IHC) on the tissue sample.
In some embodiments of any of the methods, determining the level of HSF1 activation comprises measuring at least one bioactivity of HSF1 protein.
In some embodiments of any of the methods, determining the level of HSF1 activation comprises determining the localization of HSF1 polypeptide in cells, wherein nuclear localization is indicative of HSF1 activation. In some embodiments, nuclear localization is assessed using IHC.
In some embodiments of any of the methods, determining the level of HSF1 activation comprises detecting at least one post-translational modification of HSF1 polypeptide.
In some embodiments of any of the methods, determining the level of HSF1 activation comprises determining the level of phosphorylation of HSF1 polypeptide on serine 326, wherein phosphorylation of HSF1 polypeptide on serine 326 is indicative of HSF1 activation. In some embodiments the level of phosphorylated HSF1 (e.g., HSF1 phosphorylated on serine 326), is determined using an antibody that binds specifically to phosphorylated HSF1.
In some embodiments of any of the methods, determining the level of HSF1 activation comprises determining the level of chromatin occupancy by HSF1 polypeptide.
In some embodiments of any of the methods, determining the level of HSF1 activation comprises determining the level of a gene expression product of at least one HSF1-regulated gene other than a heat shock protein (HSP) gene.
In some aspects, the invention relates to identification of a transcriptional program regulated by HSF1 in cancer cells. In some aspects, the invention provides HSF1 cancer program (HSF1-CP) genes and subsets thereof. In some aspects, the invention provides HSF1 cancer signature set (CSS) genes and subsets thereof. In some aspects, the invention provides HSF1-CaSig, HSF1-CaSig2, HSF1-CaSig3, and refined HSF1-CSS cancer signature sets. In some aspects, the invention provides coordinately regulated sets of genes (Modules 1-5) comprising subsets of the HSF1-CP genes.
In some embodiments of any of the methods comprising determining the level of HSF1 activation, such determining comprises assessing expression of at least one HSF1 cancer program (HSF1-CP) gene. In some embodiments determining the level of HSF1 activation comprises determining the level of a gene product of at least one HSF1-CP gene. In some embodiments determining the level of HSF1 activation comprises assessing expression of an HSF1 cancer signature set (CSS) or subset thereof. In some embodiments determining the level of HSF1 activation comprises determining the level of a gene product of at least one HSF1-CSS gene.
In some embodiments of any of the methods, an HSF1 cancer signature set is HSF1-CaSig, HSF1-CaSig2, HSF1-CaSig3, or a refined HSF1-CSS. In some embodiments of any of the methods, an HSF1 cancer signature set gene is part of HSF1-CaSig, HSF1-CaSig2, HSF1-CaSig3, or a refined HSF1-CSS.
In some aspects, the invention provides a method of diagnosing cancer in a subject comprising: (a) determining a gene expression profile of an HSF1 cancer signature set (HSF1-CSS) or subset thereof in a sample obtained from a subject; and (b) determining whether the sample represents cancer based at least in part on the gene expression profile. In some aspects, the invention provides a method of identifying cancer comprising the steps of: (a) providing a biological sample; and (b) determining a gene expression profile of an HSF1 cancer signature set or subset thereof in the sample; and (c) determining whether the sample represents cancer based at least in part on the gene expression profile. In some embodiments, a method of diagnosing cancer or identifying cancer comprises determining whether the gene expression profile clusters with gene expression profiles representative of cancer or whether the gene expression profile clusters with gene expression profiles representative of non-cancer. In some embodiments the method comprises determining whether expression of the HSF1-CSS falls into a high or low expression subset, wherein high expression is indicative of cancer.
In some aspects, the invention provides a method of assessing a tumor with respect to aggressiveness, the method comprising: (a) determining a gene expression profile of an HSF1 cancer signature set or subset thereof in a sample obtained from a subject; and (b) determining whether the sample represents an aggressive cancer based at least in part on the gene expression profile, thereby classifying the tumor with respect to aggressiveness. In some embodiments the level of HSF1-CSS expression is compared with a control. In some embodiments an increased level of HSF1-CSS expression as compared with a control is indicative of increased aggressiveness. In some embodiments, the method comprises determining whether the gene expression profile clusters with gene expression profiles representative of aggressive cancer or whether the gene expression profile clusters with gene expression profiles representative of non-aggressive cancer or non-cancer. In some embodiments the method comprises determining whether expression of the HSF1-CSS falls into a high or low expression subset, wherein high expression is indicative of aggressive cancer.
In some aspects, the invention provides a method of classifying a tumor according to predicted outcome comprising steps of: (a) determining a gene expression profile of an HSF1 cancer signature set or subset thereof in a sample obtained from a subject; and (b) classifying the tumor with respect to predicted outcome based at least in part on the gene expression profile. In some embodiments the level of HSF1-CSS expression is compared with a control. In some embodiments an increased level of HSF1-CSS expression as compared with a control is indicative of increased likelihood of poor outcome. In some aspects, the invention provides a method for providing prognostic information relating to a tumor, the method comprising: (a) determining a gene expression profile of an HSF1 cancer signature set or subset thereof in a tumor sample obtained from a subject in need of tumor prognosis; and (b) determining a prognosis based at least in part on the gene expression profile. In some embodiments the level of HSF1-CSS expression is compared with a control. In some embodiments an increased level of HSF1-CSS expression as compared with a control is indicative of a poor prognosis. In some embodiments the level of HSF1-CSS expression is compared with a control. In some embodiments an increased level of HSF1-CSS expression as compared with a control is indicative of increased likelihood of poor outcome, or poor prognosis. In some embodiments, the method comprises determining whether the gene expression profile clusters with gene expression profiles representative of cancers with a poor outcome, or poor prognosis or whether the gene expression profile clusters with gene expression profiles representative of cancers with a good outcome, or good prognosis. In some embodiments the method comprises determining whether expression of the HSF1-CSS genes falls into a high or low expression subset, wherein high expression is indicative of cancer with an increased likelihood of poor outcome (poor prognosis).
In some aspects, the invention provides a method for providing treatment-specific predictive information relating to a tumor, comprising: (a) determining a gene expression profile of an HSF1 cancer signature set or subset thereof in a tumor sample from a subject in need of treatment-specific predictive information for a tumor, wherein the gene expression profile correlates with tumor sensitivity or resistance to a treatment, thereby providing treatment-specific predictive information. In some embodiments, the method comprises determining whether the gene expression profile clusters with gene expression profiles representative of cancers that are sensitive or resistant to a treatment.
In some aspects, the invention provides a method for tumor diagnosis, prognosis, treatment-specific prediction, or treatment selection comprising: (a) providing a sample obtained from a subject in need of diagnosis, prognosis, treatment-specific prediction, or treatment selection for a tumor; (b) determining a gene expression profile of an HSF1 cancer signature set or subset thereof in in the sample; (c) scoring the sample based on the gene expression profile, wherein the score provides diagnostic, prognostic, treatment-specific predictive, or treatment selection information. In some embodiments, the method comprises determining whether the gene expression profile clusters with gene expression profiles representative of cancers having a selected prognosis, outcome, or likelihood of treatment response. In some embodiments the method comprises determining whether expression of the HSF1-CSS falls into a high or low expression subset.
In some aspects, the invention provides a method of predicting the likelihood that a tumor will be sensitive to a protein homeostasis modulator, the method comprising: (a) determining a gene expression profile of an HSF1 cancer signature set or subset thereof in a tumor sample obtained from a subject in need of treatment for cancer; and (b) predicting the likelihood that a tumor will be sensitive to a protein homeostasis modulator based at least in part on the gene expression profile. In some embodiments the level of HSF1-CSS expression is compared with a control. In some embodiments an increased level of HSF1-CSS expression as compared with a control is indicative that the tumor has an increased likelihood of being sensitive to the protein homeostasis modulator. In some aspects, the invention provides a method of determining whether a subject with a tumor is a suitable candidate for treatment with a proteostasis modulator, comprising (a) determining a gene expression profile of an HSF1 cancer signature set or subset thereof in a tumor sample obtained from a subject in need of treatment for cancer; and (b) predicting the likelihood that a tumor will be sensitive to a proteostasis modulator based at least in part on the gene expression profile, wherein if the tumor is likely to be sensitive to the proteostasis modulator, the subject is a suitable candidate for treatment with the proteostasis modulator. In some embodiments the level of HSF1-CSS expression is compared with a control. In some embodiments an increased level of HSF1-CSS expression as compared with a control is indicative that the subject is a suitable candidate for treatment with a proteostasis modulator.
In some embodiments a gene expression profile comprises a measurement of expression of at least 2, 3, 4, 5, 10, 15, 20, 25, 30, 40, 50, 60, 70, 80, 90, 100, 150, 200, 250, 300, 350, 400, 450 or at least 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 99%, or all HSF1-CP genes, Group A genes, Group B genes, HSF1-CSS genes, HSF1-CaSig2 genes, HSF1-CaSig3 genes, refined HSF1-CSS genes, Module 1 genes, Module 2 genes, Module 3 genes, Module 4 genes, or Module 5 genes. In some embodiments a gene expression profile comprises a measurement of expression of at least 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 40, 50, 60, 70, 80, 90, or 100 HSF1-CP gene whose expression is increased by at least 1.2-fold in cancer cells as compared with non-transformed control cells not subjected to heat shock. In some embodiments an HSF1 cancer signature set is HSF1-CaSig, HSF1-CaSig2, HSF1-CaSig3 gene, or a refined HSF1-CSS. In some embodiments an HSF1 cancer signature set comprises or is composed of genes listed in Table T4C, Table T4D, Table T4E, or Table T4F. In some embodiments at least 70%, 80%, 90%, 95%, or more (e.g., 100%) of the genes in an HSF1-CSS or subset thereof are positively regulated by HSF1 in cancer cells. In some embodiments expression of at least 70%, 80%, 90%, 95%, or more (e.g., 100%) of the genes in an HSF1-CSS are positively correlated with poor prognosis. In some embodiments, expression of a gene is positively weighted if its expression is positively correlated with an outcome or characteristic of interest (e.g., poor prognosis) and negatively weighted if its expression is negatively correlated with an outcome or characteristic of interest. In some embodiments, expression of a gene is positively weighted if its regulation by HSF1 is positively correlated with an outcome or characteristic of interest (e.g., poor prognosis) and negatively weighted if its regulation by HSF1 is negatively correlated with an outcome or characteristic of interest.
In some aspects, the invention provides a method of identifying a candidate modulator of HSF1 cancer-related activity, the method comprising: (a) providing a cell comprising a nucleic acid construct comprising (i) at least a portion of a regulatory region of an HSF1-CP gene operably linked to a nucleic acid sequence encoding a reporter molecule, wherein the HSF1-CP gene is an HSF1-CP Group A gene, Module 1 gene, Module 2 gene, Module 3 gene, Module 4 gene, Module 5 gene, HSF1-CaSig2 gene, HSF1-CaSig3 gene, refined HSF1-CSS gene, or HSF1-CSS gene that is more highly bound by HSF in cancer cells than in heat shocked non-transformed cells; (b) contacting the cell with a test agent; and (c) assessing expression of the nucleic acid sequence encoding the reporter molecule, wherein the test agent is identified as a candidate modulator of HSF1 cancer-related activity if expression of the nucleic acid sequence encoding the reporter molecule differs from a control level. In some embodiments the cell is a cancer cell. In some embodiments assessing expression of the nucleic acid sequence encoding comprises measuring the level or activity of the reporter molecule. In some embodiments the portion of a regulatory region comprises a HSE and a YY1 element. In some embodiments the portion of a regulatory region comprises a YY1 binding site and a HSE comprising exactly 3 inverted repeat units. In some embodiments the test agent is identified as a candidate inhibitor of HSF1 cancer-related activity if expression of the nucleic acid sequence encoding the reporter molecule is reduced as compared with the control level. In some embodiments the method further comprises assessing the effect of the test agent on expression of one or more HSF1-CP genes. In some embodiments the method further comprises assessing the effect of the test agent on a gene expression profile of an HSF1 cancer signature set or subset thereof. In some embodiments, if the test agent modulates expression of the one or more HSF1-CP genes or HSF1 cancer signature set, the test agent is confirmed as a candidate modulator of HSF1 cancer-related activity.
In some aspects, the invention provides a method of identifying a candidate modulator of HSF1 cancer-related activity comprising steps of: (a) contacting a cell that expresses HSF1 with a test agent; (b) measuring the level of an HSF1 cancer-related activity exhibited by the cell; and (c) determining whether the test agent modulates the HSF1 cancer-related activity, wherein a difference in the level of the HSF1 cancer-related activity in the presence of the test agent as compared to the level in the absence of the test agent identifies the agent as a candidate modulator of HSF1 cancer-related activity. In some embodiments measuring the level of an HSF cancer-related activity comprises measuring binding of HSF1 to a regulatory region of an HSF1-CP gene, Group A gene, HSF1-CSS gene, HSF1-CaSig2 gene, HSF1-CaSig3 gene, refined HSF1-CSS gene, Module 1 gene, Module 2 gene, Module 3 gene, Module 4 gene, or Module 5 gene or measuring expression of an HSF1-CP gene, Group A gene, Group B gene, HSF1-CSS gene, refined HSF1-CSS gene, Module 1 gene, Module 2 gene, Module 3 gene, Module 4 gene, or Module 5 gene, wherein the gene is more highly bound by HSF1 in cancer cells than in heat shocked non-transformed control cells. In some embodiments measuring the level of an HSF cancer-related activity comprises measuring binding of HSF1 to the regulatory regions of at least 2, 3, 4, 5, 10, 15, 20, 25, 30, 40, 50, 60, 70, 80, 90, 100, 150, 200, 250, 300, 350, 400, 450 or at least 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 99%, or all HSF1-CP genes, Group A genes, HSF1-CSS genes, HSF1-CaSig2 genes, HSF1-CaSig3 genes, refined HSF1-CSS genes, Module 1 genes, Module 2 genes, Module 3 genes, Module 4 genes, or Module 5 genes or measuring expression of at least 2, 3, 4, 5, 10, 15, 20, 25, 30, 40, 50, 60, 70, 80, 90, 100, 150, 200, 250, 300, 350, 400, 450 or at least 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 99%, or all HSF1-CP genes, Group A genes, Group B genes, HSF1-CSS genes, HSF1-CaSig2 genes, HSF1-CaSig3 genes, refined HSF1-CSS genes, Module 1 genes, Module 2 genes, Module 3 genes, Module 4 genes, or Module 5 genes, wherein at least one of the genes is more highly bound by HSF1 in cancer cells than in heat shocked non-transformed control cells.
In some aspects, the invention provides a method of identifying a candidate modulator of HSF1 cancer-related activity, the method comprising: (a) providing a cell comprising a nucleic acid construct comprising (i) at least a portion of a regulatory region of an HSF1-CP gene operably linked to a nucleic acid sequence encoding a reporter molecule, wherein the HSF1-CP gene is an HSF1-CP Group A gene, Module 1 gene, Module 2 gene, Module 3 gene, Module 4 gene, Module 5 gene, HSF1-CaSig2 gene, HSF1-CaSig3 gene, refined HSF1-CSS gene, or HSF1-CSS gene that is more highly bound by HSF1 in cancer cells than in heat shocked non-transformed cells; (b) contacting the cell with a test agent; and (c) assessing expression of the nucleic acid sequence encoding the reporter molecule, wherein the test agent is identified as a candidate modulator of HSF1 cancer-related activity if expression of the nucleic acid sequence encoding the reporter molecule differs from a control level.
In some aspects, the invention provides an isolated nucleic acid comprising at least one YY1 binding site and a heat shock element (HSE). In some embodiments the invention provides a nucleic acid construct comprising the isolated nucleic acid and a sequence encoding a reporter molecule. In some embodiments the sequence of an isolated nucleic acid comprises at least a portion of a regulatory region of a Group A gene, Module 1 gene, Module 2 gene, Module 3 gene, Module 4 gene, Module 5 gene, HSF1-CaSig2 gene, HSF1-CaSig3 gene, refined HSF1-CSS gene, or HSF1-CSS gene that is more highly bound by HSF1 in cancer cells than in heat shocked non-transformed control cells. Further provided are vectors and cells comprising the isolated nucleic acid or nucleic acid construct. Further provided are methods of using the isolated nucleic acid, nucleic acid construct, vector, or cell, e.g., in identification of candidate modulators of HSF1 cancer-related activity.
In some embodiments of any aspect herein, a tumor is a breast, lung, colon, prostate, pancreas, cervical, or nerve sheath tumor. In some embodiments a tumor is breast, lung, or colon tumor. In some embodiments a tumor is a breast tumor. In some embodiments a tumor is an estrogen receptor (ER) positive breast tumor. In some embodiments a tumor is a human epidermal growth factor 2 (HER2) positive breast tumor. In some embodiments a tumor is a lymph node negative breast tumor. In some embodiments a tumor is an estrogen receptor (ER) positive, lymph node negative breast tumor.
In various embodiments of the methods described herein, a control sample can comprise normal non-neoplastic cells or tissue, e.g., normal non-neoplastic cells or tissue of the same type or origin as that from which a tumor arose. In various embodiments of the methods described herein, a control level of HSF1 expression or HSF1 activation can be a level measured in normal non-neoplastic cells or tissue, e.g., normal non-neoplastic cells or tissue of the same type or origin as that from which a tumor arose, e.g., as measured under conditions that do not activate the heat shock response.
In some embodiments, any of the methods can comprise providing a sample, e.g., a tumor sample. In some embodiments, any of the method can comprise providing a subject, e.g., a subject in need of tumor diagnosis, prognosis, or treatment selection.
In some embodiments, any of the methods can further comprise assessing at least one additional cancer biomarker. The at least one additional cancer biomarker is typically a gene or gene product (e.g., mRNA or protein) whose expression, activation, localization, or activity, correlates with the presence or absence of cancer, with cancer aggressiveness, with cancer outcome, cancer prognosis, or treatment-specific cancer outcome. The cancer biomarker(s) can be selected, e.g., at least in part based on the tumor type.
In some embodiments, any of the methods can further comprise selecting or administering a therapeutic agent based at least in part on results of assessing the level of HSF1 expression or HSF1 activation. In some aspects, the invention provides a method comprising selecting or administering a treatment to a subject in need of treatment for a tumor, wherein the treatment is selected based at least in part on an assessment of the level of HSF1 expression or HSF1 activation in a sample obtained from the tumor. In some embodiments, a method comprises selecting or administering an appropriate therapy if CIS is detected. For example, the therapy can comprise surgical removal of the CIS. In some embodiments a method comprises selecting or administering a more aggressive therapy if a tumor (or sample obtained therefrom) is classified as having an increased likelihood of being aggressive, if a tumor or subject is classified as having an increased likelihood of having a poor outcome, or if a subject is classified as having a poor prognosis. For example, in some embodiments a method comprises selecting or administering adjuvant therapy (e.g., adjuvant chemotherapy) if a tumor (or sample obtained therefrom) is classified as having an increased likelihood of being aggressive, if a tumor or subject is classified as having an increased likelihood of having a poor outcome, or if a subject is classified as having a poor prognosis. In some embodiments a method comprises selecting or administering a proteostasis modulator if the level of HSF1 expression or the level of HSF1 activation is increased.
In some aspects, the invention provides a kit that comprises at least one agent of use to measure the level of HSF1 expression or HSF1 activation in a sample, e.g., an agent that specifically binds to an HSF1 gene product (e.g., HSF1 mRNA or HSF1 protein). The agent may be, e.g., an antibody, or a nucleic acid. In some embodiments the agent is validated for use in assessing HSF1 expression or HSF1 activation, in that results of an assay using the agent have been shown to correlate with cancer outcome, prognosis, or treatment efficacy of at least one specific treatment. In some embodiments the agent is an antibody useful for performing IHC. In some embodiments the kit comprises a reporter construct suitable for assessing HSF1 cancer-related transcription. In some embodiments the kit comprises a cell comprising a reporter construct suitable for assessing HSF1 cancer-related transcription. In some aspects, the invention provides a kit or collection comprising reagents suitable for assessing expression of at least 2, 3, 4, 5, 10, 15, 20, 25, 30, 40, 50, 60, 70, 80, 90, 100, 150, 200, 250, 300, 350, 400, 450 or at least 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 99%, or all HSF1-CP genes, Group A genes, Group B genes, HSF1-CSS genes, HSF1-CaSig2 genes, HSF1-CaSig3 genes, refined HSF1-CSS genes, Module 1 genes, Module 2 genes, Module 3 genes, Module 4 genes, or Module 5 genes.
Certain conventional techniques and concepts of cell biology, cell culture, molecular biology, microbiology, recombinant nucleic acid (e.g., DNA) technology, immunology, etc., which are within the skill and knowledge of those of ordinary skill in the art, may be of use in aspects of the invention. Non-limiting descriptions of certain of these techniques are found in the following publications: Ausubel, F., et al., (eds.), Current Protocols in Molecular Biology, Current Protocols in Immunology, Current Protocols in Protein Science, and Current Protocols in Cell Biology, all John Wiley & Sons, N.Y., editions as of 2008; Sambrook, Russell, and Sambrook, Molecular Cloning: A Laboratory Manual, 3rd ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 2001; Harlow, E. and Lane, D., Antibodies—A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 1988; Burns, R., Immunochemical Protocols (Methods in Molecular Biology) Humana Press; 3rd ed., 2005; Buchwalow, I, and Böcker, W. (2010) Immunohistochemistry: Basics and Methods, Methods in Molecular Medicine, Springer) Lodish H, et al. (2007). Molecular cell biology (6th ed.). New York: W.H. Freeman and CO. Further information on cancer and treatment thereof may be found in Cancer: Principles and Practice of Oncology (V. T. De Vita et al., eds., J. B. Lippincott Company, 8th ed., 2008 or 9th ed., 2011) and Weinberg, R A, The Biology of Cancer, Garland Science, 2006. All patents, patent applications, books, journal articles, databases, websites, and other publications mentioned herein are incorporated herein by reference in their entirety. In the event of a conflict or inconsistency with the specification, the specification shall control. Applicants reserve the right to amend the specification based on any of the incorporated references and/or to correct obvious errors. None of the content of the incorporated references shall limit the invention.
For convenience, certain terms employed herein are collected below. It should be understood that any description of a term or concept below or elsewhere herein may be applied wherever such term or concept appears herein.
The term “antibody” refers to an immunoglobulin, whether natural or wholly or partially synthetically produced. An antibody may be a member of any immunoglobulin class, including any of the mammalian, e.g., human, classes: IgG, IgM, IgA, IgD, and IgE, or subclasses thereof, and may be an antibody fragment, in various embodiments of the invention. An antibody can originate from any of a variety of vertebrate (e.g., mammalian or avian) organisms, e.g., mouse, rat, rabbit, hamster, goat, chicken, human, etc. As used herein, the term “antibody fragment” refers to a derivative of an antibody which contains less than a complete antibody. In general, an antibody fragment retains at least a significant portion of the full-length antibody's specific binding ability. Examples of antibody fragments include, but are not limited to, Fab, Fab′, F(ab′)2, scFv, Fv, dsFv diabody, Fd fragments, and domain antibodies. Standard methods of antibody identification and production known in the art can be used to produce an antibody that binds to a polypeptide of interest. In some embodiments, an antibody is a monoclonal antibody. Monoclonal antibodies can be identified and produced, e.g., using hybridoma technology or recombinant nucleic acid technology (e.g., phage or yeast display). In some embodiments, an antibody is a chimeric or humanized or fully human antibody. In some embodiments, an antibody is a polyclonal antibody. In some embodiments an antibody is affinity purified. It will be appreciated that certain antibodies, e.g., recombinantly produced antibodies, can comprise a heterologous sequence not derived from naturally occurring antibodies, such as an epitope tags. In some embodiments an antibody further has a detectable label attached (e.g., covalently attached) thereto (e.g., the label can comprise a radioisotope, fluorescent compound, enzyme, hapten).
“Cancer” is generally used interchangeably with “tumor” herein and encompasses pre-invasive and invasive neoplastic growths comprising abnormally proliferating cells, including malignant solid tumors (carcinomas, sarcomas) and including hematologic malignancies such as leukemias in which there may be no detectable solid tumor mass. As used herein, the term cancer includes, but is not limited to, the following types of cancer: breast cancer; biliary tract cancer; bladder cancer; brain cancer (e.g., glioblastomas, medulloblastomas); cervical cancer; choriocarcinoma; colon cancer; endometrial cancer; esophageal cancer; gastric cancer; hematological neoplasms including acute lymphocytic leukemia and acute myelogenous leukemia; T-cell acute lymphoblastic leukemia/lymphoma; hairy cell leukemia; chronic lymphocytic leukemia, chronic myelogenous leukemia, multiple myeloma; adult T-cell leukemia/lymphoma; intraepithelial neoplasms including Bowen's disease and Paget's disease; liver cancer; lung cancer; lymphomas including Hodgkin's disease and lymphocytic lymphomas; neuroblastoma; melanoma, oral cancer such as oral squamous cell carcinoma; ovarian cancer including ovarian cancer arising from epithelial cells, stromal cells, germ cells and mesenchymal cells; pancreatic cancer; prostate cancer, rectal cancer; sarcomas including angiosarcoma, gastrointestinal stromal tumors, leiomyosarcoma, rhabdomyosarcoma, liposarcoma, fibrosarcoma, and osteosarcoma; renal cancer including renal cell carcinoma and Wilms tumor; skin cancer including basal cell carcinoma and squamous cell cancer; testicular cancer including germinal tumors such as seminoma, non-seminoma (teratomas, choriocarcinomas), stromal tumors, and germ cell tumors; thyroid cancer including thyroid adenocarcinoma and medullary carcinoma. “Carcinoma” as used herein, refers to a cancer arising or believed to have arisen from epithelial cells, e.g., cells of the cancer possess various molecular, cellular, and/or histological characteristics typical of epithelial cells. “Cancer in situ” (CIS) refers to cancers in which neoplastic cells are present at a location, e.g., as a tumor, but have not detectably invaded beyond the original site where they were discovered, e.g., cancer cells have not detectably passed through the basal lamina. It will be appreciated that a CIS may have undergone some local spread at the time of discovery. In many embodiments a CIS is a tumor that would be classified as Stage 0, e.g., TisN0M0 or TaN0M0 according to the TNM Classification of Malignant Tumours (TNM) (Sobin L H, et al., eds. TNM Classification of Malignant Tumors, 7th ed. Wiley-Blackwell, Oxford 2009). In some embodiments, a CIS is a bladder cancer, breast cancer (e.g., ductal carcinoma in situ of the breast (DCIS)), cervical cancer (in which case the term high grade squamous epithelial lesion (HSIL) may be used instead of CIS), colon cancer, lung cancer (e.g., bronchioloalveolar carcinoma (BAC)), high grade prostatic intraepithelial neoplasia, or skin cancer.
The term “diagnostic method” generally refers to a method that provides information regarding the identity of a disease or condition that affects a subject or whether a subject is suffering from a disease or disorder of interest, such as cancer. For example, a diagnostic method may determine that a subject is suffering from a disease or condition of interest or may identify a disease or condition that affects a subject or may identify a subject suffering from a disease or condition of interest.
“Modulator” refers to an agent or condition that alters, e.g., inhibits (reduces, decreases) or enhances (activates, stimulates, increases), a process, pathway, phenomenon, state, or activity. For example, a modulator of protein activity may increase or decrease the level of one or more activit(ies) of a protein.
The term “prognostic method”, generally refers to a method that provides information regarding the likely course or outcome of a disease regardless of treatment or across treatments (e.g., after adjusting for treatment variables or assuming that a subject receives standard of care treatment). For example, a prognostic method may comprise classifying a subject or sample obtained from a subject into one of multiple categories, wherein the categories correlate with different likelihoods that a subject will experience a particular outcome. For example, categories can be low risk and high risk, wherein subjects in the low risk category have a lower likelihood of experiencing a poor outcome (e.g., within a given time period such as 5 years or 10 years) than do subjects in the high risk category. A poor outcome could be, for example, disease progression, disease recurrence, or death attributable to the disease.
The term “treatment-specific predictive method” generally refers to a method that provides information regarding the likely effect of a specified treatment, e.g., that can be used to predict whether a subject is likely to benefit from the treatment or to predict which subjects in a group will be likely or most likely to benefit from the treatment. It will be understood that a treatment-specific predictive method may be specific to a single treatment or to a class of treatments (e.g., a class of treatments having the same or a similar mechanism of action or that act on the same biological process, pathway or molecular target, etc.). A treatment-specific predictive method may comprise classifying a subject or sample obtained from a subject into one of multiple categories, wherein the categories correlate with different likelihoods that a subject will benefit from a specified treatment. For example, categories can be low likelihood and high likelihood, wherein subjects in the low likelihood category have a lower likelihood of benefiting from the treatment than do subjects in the high likelihood category. In some embodiments, a benefit is increased survival, increased progression-free survival, or decreased likelihood of recurrence. In some embodiments, a “suitable candidate for treatment” with a specified agent refers to a subject for whom there is a reasonable likelihood that the subject would benefit from administration of the agent, e.g., the tumor has one or more characteristics that correlate with a beneficial effect resulting from administration of the agent as compared with, e.g., no treatment or as compared with a standard treatment. In some embodiments, a “suitable candidate for treatment” with an agent refers to a subject for whom there is a reasonable likelihood that the subject would benefit from administration of the agent in combination with (i.e., in addition to) one or more other therapeutic interventions, e.g., the tumor has one or more characteristics that correlate with a beneficial effect from treatment with the agent and the other therapeutic interventions as compared with treatment with the other therapeutic interventions only. In some embodiments, a suitable candidate for treatment with an agent is a subject for whom there is a reasonable likelihood that the subject would benefit from addition of the agent to a standard regimen for treatment of cancer. See, e.g., De Vita, et al., supra for non-limiting discussion of standard regimens for treatment of cancer.
“Expression” refers to the cellular processes involved in producing RNA and protein such as, but not limited to, transcription, RNA processing, and translation.
As used herein, the term “gene product” (also referred to as a “gene expression product”) encompasses products resulting from expression of a gene, such as RNA transcribed from a gene and polypeptides arising from translation of mRNA. RNA transcribed from a gene can be non-coding RNA or coding RNA (e.g., mRNA). It will be appreciated that gene products may undergo processing or modification by a cell. For example, RNA transcripts may be spliced, polyadenylated, etc., prior to mRNA translation, and/or polypeptides may undergo co-translational or post-translational processing such as removal of secretion signal sequences or modifications such as phosphorylation, fatty acylation, etc. The term “gene product” encompasses such processed or modified forms. Genomic, mRNA, polypeptide sequences from a variety of species, including human, are known in the art and are available in publicly accessible databases such as those available at the National Center for Biotechnology Information (www.ncbi.nih.gov) or Universal Protein Resource (www.uniprot.org). Exemplary databases include, e.g., GenBank, RefSeq, Gene, UniProtKB/SwissProt, UniProtKB/Trembl, and the like. In general, sequences, e.g., mRNA and polypeptide sequences, in the NCBI Reference Sequence database may be used as gene product sequences for a gene of interest. It will be appreciated that multiple alleles of a gene may exist among individuals of the same species due to natural allelic variation. For example, differences in one or more nucleotides (e.g., up to about 1%, 2%, 3-5% of the nucleotides) of the nucleic acids encoding a particular protein may exist among individuals of a given species. Due to the degeneracy of the genetic code, such variations frequently do not alter the encoded amino acid sequence, although DNA polymorphisms that lead to changes in the amino acid sequences of the encoded proteins can exist. It will also be understood that multiple isoforms of certain proteins encoded by the same gene may exist as a result of alternative RNA splicing or editing. Examples of polymorphic variants can be found in, e.g., the Single Nucleotide Polymorphism Database (dbSNP) (available at the NCBI website at www.ncbi.nlm.nih.gov/projects/SNP/), which contains single nucleotide polymorphisms (SNPs) as well as other types of variations (see, e.g., Sherry S T, et al. (2001). “dbSNP: the NCBI database of genetic variation”. Nucleic Acids Res. 29 (1): 308-311; Kitts A, and Sherry S, (2009). The single nucleotide polymorphism database (dbSNP) of nucleotide sequence variation in The NCBI Handbook [Internet]. McEntyre J, Ostell J, editors. Bethesda (Md.): National Center for Biotechnology Information (US); 2002 (www.ncbi.nlm.nih.gov/bookshelf/br.fcgi?book=handbook&part=ch5). In general, where aspects of the invention relate to a gene or gene product it should be understood that embodiments relating to such isoforms or allelic variants are encompassed unless indicated otherwise. For example, in general, allelic variants and most isoforms would be detectable using the same reagents (e.g., antibodies, probes, etc.) and methods. Certain embodiments may be directed to a particular sequence or sequences, e.g., a particular allele or isoform. One of ordinary skill in the art could readily develop reagents and methods that could distinguish between different isoforms or allelic variants or could verify that particular isoform(s) or allelic variant(s) are detected by a particular detection method or reagent.
“Isolated”, in general, means 1) separated from at least some of the components with which it is usually associated in nature; 2) prepared or purified by a process that involves the hand of man; and/or 3) not occurring in nature, e.g., present in an artificial environment.
“Nucleic acid” is used interchangeably with “polynucleotide” and encompasses in various embodiments naturally occurring polymers of nucleosides, such as DNA and RNA, and non-naturally occurring polymers of nucleosides or nucleoside analogs. In some embodiments a nucleic acid comprises standard nucleosides (abbreviated A, G, C, T, U). In other embodiments a nucleic acid comprises one or more non-standard nucleosides. In some embodiments, one or more nucleosides are non-naturally occurring nucleosides or nucleotide analogs. A nucleic acid can comprise modified bases (for example, methylated bases), modified sugars (2′-fluororibose, arabinose, or hexose), modified phosphate groups or other linkages between nucleosides or nucleoside analogs (for example, phosphorothioates or 5′-N-phosphoramidite linkages), locked nucleic acids, or morpholinos, in various embodiments. In some embodiments, a nucleic acid comprises nucleosides that are linked by phosphodiester bonds, as in DNA and RNA. In some embodiments, at least some nucleosides are linked by non-phosphodiester bond(s). A nucleic acid can be single-stranded, double-stranded, or partially double-stranded. An at least partially double-stranded nucleic acid can have one or more overhangs, e.g., 5′ and/or 3′ overhang(s). Nucleic acid modifications (e.g., nucleoside and/or backbone modifications, including use of non-standard nucleosides) known in the art as being useful in the context of RNA interference (RNAi), aptamer, antisense, primer, or probe molecules may be used in various embodiments of the invention. See, e.g., Crooke, S T (ed.) Antisense drug technology: principles, strategies, and applications, Boca Raton: CRC Press, 2008; Kurreck, J. (ed.) Therapeutic oligonucleotides, RSC biomolecular sciences. Cambridge: Royal Society of Chemistry, 2008. In some embodiments, a modification increases half-life and/or stability of a nucleic acid, e.g., relative to RNA or DNA of the same length and strandedness. A nucleic acid may comprise a detectable label, e.g., a fluorescent dye, radioactive atom, etc. “Oligonucleotide” refers to a relatively short nucleic acid, e.g., typically between about 4 and about 100 nucleotides long. Where reference is made herein to a polynucleotide, it is understood that both DNA, RNA, and in each case both single- and double-stranded forms (and complements of each single-stranded molecule) are provided. “Polynucleotide sequence” as used herein can refer to the polynucleotide material itself and/or to the sequence information (i.e. the succession of letters used as abbreviations for bases) that biochemically characterizes a specific nucleic acid. A polynucleotide sequence, if presented herein, is presented in a 5′ to 3′ direction unless otherwise indicated.
“Polypeptide” refers to a polymer of amino acids. The terms “protein” and “polypeptide” are used interchangeably herein. A peptide is a relatively short polypeptide, typically between about 2 and 100 amino acids in length. Polypeptides used herein typically contain the standard amino acids (i.e., the 20 L-amino acids that are most commonly found in proteins). However, a polypeptide can contain one or more non-standard amino acids (which may be naturally occurring or non-naturally occurring) and/or amino acid analogs known in the art in certain embodiments. One or more of the amino acids in a polypeptide may be modified, for example, by the addition of a chemical entity thereto. Exemplary modifications include phosphorylation, glycosylation, SUMOylation, acetylation, methylation, acylation, etc. In some embodiments, a polypeptide is modified by attachment of a linker useful for conjugating the polypeptide to or with another entity. Polypeptides may be present in or purified from natural sources, produced using recombinant DNA technology, synthesized through chemical means such as conventional solid phase peptide synthesis, etc. The term “polypeptide sequence” or “amino acid sequence” as used herein can refer to the polypeptide material itself and/or to the sequence information (i.e., the succession of letters or three letter codes used as abbreviations for amino acid names) that biochemically characterizes a polypeptide. A polypeptide sequence, if presented herein, is presented in an N-terminal to C-terminal direction unless otherwise indicated.
A “sample” as used herein can be any biological specimen that contains cells, tissue, or cellular material (e.g., cell lysate or fraction thereof). Typically, a sample is obtained from (i.e., originates from, was initially removed from) a subject. Methods of obtaining such samples are known in the art and include, e.g., tissue biopsy such as excisional biopsy, incisional biopsy, or core biopsy; fine needle aspiration biopsy; brushings; lavage; or collecting body fluids such as blood, sputum, lymph, mucus, saliva, urine, etc., etc. In many embodiments, a sample contains at least some intact cells at the time it is removed from a subject and, in many embodiments, the sample retains at least some of the tissue microarchitecture. In many embodiments a sample will have been obtained from a tumor either prior to or after removal of the tumor from a subject. A sample may be subjected to one or more processing steps after having been obtained from a subject and/or may be split into one or more portions, which may entail removing or discarding part of the original sample. It will be understood that the term “sample” encompasses such processed samples, portions of samples, etc., and such samples are still considered to have been obtained from the subject from whom the initial sample was removed. In many embodiments, a sample is obtained from an individual who has been diagnosed with cancer or is at increased risk of cancer, is suspected of having cancer, or is at risk of cancer recurrence. A sample used in a method of the present invention may have been procured directly from a subject, or indirectly by receiving the sample from one or more persons who procured the sample directly from the subject, e.g., by performing a biopsy or other procedure on the subject. A “tumor sample” is a sample that includes at least some cells, tissue, or cellular material obtained from a tumor. In general, a “sample” as used herein is typically a tumor sample or a sample obtained from tissue being evaluated for presence of a tumor.
The term “small molecule” refers to an organic molecule that is less than about 2 kilodaltons (kDa) in mass. In some embodiments, the small molecule is less than about 1.5 kDa, or less than about 1 kDa. In some embodiments, the small molecule is less than about 800 daltons (Da), 600 Da, 500 Da, 400 Da, 300 Da, 200 Da, or 100 Da. Often, a small molecule has a mass of at least 50 Da. In some embodiments, a small molecule contains multiple carbon-carbon bonds and can comprise one or more heteroatoms and/or one or more functional groups important for structural interaction with proteins (e.g., hydrogen bonding), e.g., an amine, carbonyl, hydroxyl, or carboxyl group, and in some embodiments at least two functional groups. Small molecules often comprise one or more cyclic carbon or heterocyclic structures and/or aromatic or polyaromatic structures, optionally substituted with one or more of the above functional groups. In some embodiments a small molecule is an artificial (non-naturally occurring) molecule. In some embodiments, a small molecule is non-polymeric. In some embodiments, a small molecule is not an amino acid. In some embodiments, a small molecule is not a nucleotide. In some embodiments, a small molecule is not a saccharide. In some embodiments, the term “small molecule” excludes molecules that are ingredients found in standard tissue culture medium.
“Specific binding” generally refers to a physical association between a target molecule or complex (e.g., a polypeptide) and a binding agent such as an antibody or ligand. The association is typically dependent upon the presence of a particular structural feature of the target such as an antigenic determinant, epitope, binding pocket or cleft, recognized by the binding agent. For example, if an antibody is specific for epitope A, the presence of a polypeptide containing epitope A or the presence of free unlabeled A in a reaction containing both free labeled A and the binding molecule that binds thereto, will typically reduce the amount of labeled A that binds to the binding molecule. It is to be understood that specificity need not be absolute but generally refers to the context in which the binding occurs. For example, it is well known in the art that antibodies may in some instances cross-react with other epitopes in addition to those present in the target. Such cross-reactivity may be acceptable depending upon the application for which the antibody is to be used. One of ordinary skill in the art will be able to select antibodies or ligands having a sufficient degree of specificity to perform appropriately in any given application (e.g., for detection of a target molecule such as HSF1). It is also to be understood that specificity may be evaluated in the context of additional factors such as the affinity of the binding agent for the target versus the affinity of the binding agent for other targets, e.g., competitors. If a binding agent exhibits a high affinity for a target molecule that it is desired to detect and low affinity for nontarget molecules, the antibody will likely be an acceptable reagent. Once the specificity of a binding molecule is established in one or more contexts, it may be employed in other contexts, e.g., similar contexts such as similar assays or assay conditions, without necessarily re-evaluating its specificity. In some embodiments specificity of an antibody can be tested by performing an appropriate assay on a sample expected to lack the target (e.g., a sample from cells in which the gene encoding the target has been disabled or effectively inhibited) and showing that the assay does not result in a signal significantly different to background.
“Subject” refers to any individual who has or may have cancer or is at risk of developing cancer or cancer recurrence. The subject is preferably a human or non-human animal, including but not limited to animals such as rodents (e.g., mice, rats, rabbits), cows, pigs, horses, chickens, cats, dogs, primates, etc., and is typically a mammal, and in many embodiments is a human. In some embodiments a subject is female. In some embodiments a subject is male. A subject may be referred to as a “patient”.
“Vector” is used herein to refer to a nucleic acid or a virus or portion thereof (e.g., a viral capsid or genome) capable of mediating entry of, e.g., transferring, transporting, etc., a nucleic acid molecule into a cell. Where the vector is a nucleic acid, the nucleic acid molecule to be transferred is generally linked to, e.g., inserted into, the vector nucleic acid molecule. A nucleic acid vector may include sequences that direct autonomous replication (e.g., an origin of replication), or may include sequences sufficient to allow integration of part or all of the nucleic acid into host cell DNA. Useful nucleic acid vectors include, for example, DNA or RNA plasmids, cosmids, and naturally occurring or modified viral genomes or portions thereof or nucleic acids (DNA or RNA) that can be packaged into viral capsids. Plasmid vectors typically include an origin of replication and one or more selectable markers. Plasmids may include part or all of a viral genome (e.g., a viral promoter, enhancer, processing or packaging signals, etc.). Viruses or portions thereof that can be used to introduce nucleic acid molecules into cells are referred to as viral vectors. Useful viral vectors include adenoviruses, adeno-associated viruses, retroviruses, lentiviruses, vaccinia virus and other poxviruses, herpesviruses (e.g., herpes simplex virus), and others. Viral vectors may or may not contain sufficient viral genetic information for production of infectious virus when introduced into host cells, i.e., viral vectors may be replication-defective, and such replication-defective viral vectors may be preferable for therapeutic use. Where sufficient information is lacking it may, but need not be, supplied by a host cell or by another vector introduced into the cell. The nucleic acid to be transferred may be incorporated into a naturally occurring or modified viral genome or a portion thereof or may be present within the virus or viral capsid as a separate nucleic acid molecule. It will be appreciated that certain plasmid vectors that include part or all of a viral genome, typically including viral genetic information sufficient to direct transcription of a nucleic acid that can be packaged into a viral capsid and/or sufficient to give rise to a nucleic acid that can be integrated into the host cell genome and/or to give rise to infectious virus, are also sometimes referred to in the art as viral vectors. Vectors may contain one or more nucleic acids encoding a marker suitable for use in the identifying and/or selecting cells that have or have not taken up (e.g., been transfected with) or maintain the vector. Markers include, for example, proteins that increase or decrease either resistance or sensitivity to antibiotics (e.g., an antibiotic-resistance gene encoding a protein that confers resistance to an antibiotic such as puromycin, G418, hygromycin or blasticidin) or other compounds, enzymes whose activities are detectable by assays known in the art (e.g., β-galactosidase or alkaline phosphatase), and proteins or RNAs that detectably affect the phenotype of transfected cells (e.g., fluorescent proteins). Expression vectors are vectors that include regulatory sequence(s), e.g., expression control sequences such as a promoter, sufficient to direct transcription of an operably linked nucleic acid. Regulatory sequences may also include enhancer sequences or upstream activator sequences. Vectors may optionally include 5′ leader or signal sequences. Vectors may optionally include cleavage and/or polyadenylation signals and/or a 3′ untranslated regions. Vectors often include one or more appropriately positioned sites for restriction enzymes, to facilitate introduction into the vector of the nucleic acid to be expressed. An expression vector typically comprises sufficient cis-acting elements for expression; other elements required or helpful for expression can be supplied by the cell or in vitro expression system into which the vector is introduced.
Various techniques known in the art may be employed for introducing nucleic acid molecules into cells. Such techniques include chemical-facilitated transfection using compounds such as calcium phosphate, cationic lipids, cationic polymers, liposome-mediated transfection, non-chemical methods such as electroporation, particle bombardment, or microinjection, and infection with a virus that contains the nucleic acid molecule of interest (sometimes termed “transduction”). For purposes of convenience the term “transfection” may be used to refer to any and all such techniques. Markers can be used for the identification and/or selection of cells that have taken up the vector and, typically, express the nucleic acid. Cells can be cultured in appropriate media to select such cells and, optionally, establish a stable cell line, e.g., polyclonal or monoclonal cell line. For example, a stable cell line can be composed of cells that have an exogenous nucleic acid encoding a gene product to be expressed integrated into the genome of the cells or, in some embodiments, present on an episome that is maintained and transmitted with high fidelity to daughter cells during cell division. Methods of generating stable cell lines are well known in the art and include, e.g., transfection, viral infection (e.g., using retroviruses (e.g., lentiviruses), adenoviruses, adeno-associated viruses, herpesviruses, etc.), typically followed by selection of cells that have taken up and stably maintain an introduced nucleic acid or portion thereof. A stable cell line may be polyclonal (descended from a pool of cells that have taken up a vector) or may be monoclonal (descended from a single cell that has taken up a vector).
Selection of appropriate expression control elements may be based at least in part on the cell type and species in which the nucleic acid is to be expressed and/or the purposes for which the vector is to be used. One of ordinary skill in the art can readily select appropriate expression control elements and/or expression vectors. In some embodiments, expression control element(s) are regulatable, e.g., inducible or repressible. Exemplary promoters suitable for use in bacterial cells include, e.g., Lac, Trp, Tac, araBAD (e.g., in a pBAD vectors), phage promoters such as T7 or T3. Exemplary expression control sequences useful for directing expression in mammalian cells include, e.g., the early and late promoters of SV40, adenovirus or cytomegalovirus immediate early promoter, or viral promoter/enhancer sequences, retroviral LTRs, promoters or promoter/enhancers from mammalian genes, e.g., actin, EF-1 alpha, phosphoglycerate kinase, etc. Regulatable (e.g., inducible or repressible) expression systems such as the Tet-On and Tet-Off systems (regulatable by tetracycline and analogs such as doxycycline) and others that can be regulated by small molecules such as hormone receptor ligands (e.g., steroid receptor ligands, which may or may not be steroids), metal-regulated systems (e.g., metallothionein promoter), etc.
HSF1 as a Marker for Cancer Classification
Heat shock factor 1 (HSF1), also known as heat shock transcription factor 1, is a multifaceted transcription factor that governs the cellular response to a variety of disruptions in protein homeostasis, serving as the master transcriptional regulator of the cellular response to heat and various other stressors in mammals. Under normal (non-stressed) conditions, HSF1 is predominantly located in the cytoplasm as a monomer, which is unable to bind DNA. Upon exposure to stressors, HSF1 is activated and translocates to the nucleus, where it regulates gene expression by binding to DNA sequence motifs known as heat-shock elements (HSE) located in the promoter regions of target genes. To protect the proteome under various physiologic or environmental stresses, HSF1 drives the production of classic heat-shock proteins (HSPs) such as HSP27, HSP70 and HSP90 that act as protein chaperones. Among other activities, HSPs facilitate proper protein folding and assembly and help prevent deleterious protein aggregation. This response, termed the heat shock response (HSR), is present in eukaryotes ranging from yeast to humans (1-3).
As described herein, Applicants have discovered that HSF1 expression and activation are increased across a broad range of human tumor types and that increased HSF1 expression and activation in tumors are an indicator of aggressive tumor phenotypes and poor clinical outcome. For example, Applicants observed a striking increase in the levels of HSF1, as well as a shift in its localization from the cytoplasm to the nucleus, in a panel of human breast cancer samples as compared with normal breast tissue. Applicants also found that HSF1 expression and nuclear localization were increased in lung, colon, prostate, cervical carcinomas as well in other tumors including malignant peripheral nerve sheath tumor. Nuclear HSF1 levels were elevated in ˜80% of in situ and invasive breast carcinomas analyzed. In invasive carcinomas, HSF1 expression was associated with high histologic grade, larger tumor size, and nodal involvement at diagnosis. Applicants hypothesized that this increase in nuclear HSF1 might be associated with poor prognosis. To investigate this possibility, Applicants examined the relationship between HSF1, clinicopathological characteristics, and survival outcomes among over 1,800 invasive breast cancer cases from the Nurses' Health Study. They found that increased levels of HSF1 expression and nuclear localization in tumor samples correlated with high histologic grade, larger tumor size, and nodal involvement at diagnosis in invasive breast carcinomas. Increased HSF1 levels and nuclear localization of HSF1 were associated with advanced clinical stage at the time of diagnosis and with increased mortality. The prognostic value of HSF1 protein was retained after adjusting for age, stage, grade, and adjuvant therapy. Thus, HSF1 is an independent prognostic indicator of outcome in breast cancer. Increased HSF1 expression and activation were shown to correlate with decreased overall survival and decreased disease free progression in a group of 70 stage 1 lung cancer patients and with decreased survival in colon cancer patients. Thus, increased HSF1 expression and activation in tumors correlates with aggressive tumor phenotype and worse clinical outcomes.
Without wishing to be bound by any theory, Applicants hypothesized that HSF1 may in part enable more aggressive cancer phenotypes and lead to worse clinical outcomes as a result of HSP elevation, driven by HSF1 responding to the protein folding conditions that are common in malignancies, such as increased protein load from dysregulation of the translation machinery, accumulation of mutated or fusion proteins, and imbalances in the stoichiometry of protein complexes due to aneuploidy. However, Applicants hypothesized that HSF1's role in cancer is much broader. Malignant transformation alters cellular physiology and imposes significant metabolic and genetic stresses in addition to proteomic stresses. HSF1's impact on cell cycle control, survival signaling, and energy metabolism during tumor initiation and progression may allow tumor cells to cope with these malignancy-associated stressors and/or may facilitate progression to invasive cancer and/or emergence of drug resistance by enabling the generation of greater phenotypic diversity. Furthermore, as described herein, Applicants found that HSF1 has a direct and pervasive role in cancer biology. Extending far beyond protein folding and stress, HSF1-bound genes are involved in many facets of tumorigenesis, tumor growth, persistence, progression, and/or response to therapy, including the cell cycle, apoptosis, energy metabolism, and other processes.
In some aspects, the invention provides methods of classifying a sample with respect to cancer diagnosis (e.g., the presence or absence of cancer), cancer aggressiveness, cancer outcome, or cancer treatment selection, based at least in part on assessing the level of HSF1 expression or HSF1 activation in the sample. In some aspects, the invention provides methods of cancer diagnosis, prognosis, or treatment-specific prediction, based at least in part on assessing the level of HSF1 expression or HSF1 activation in a sample, e.g., a tumor sample or suspected tumor sample. In some embodiments, the cancer is an adenocarcinoma. In some embodiments the cancer is a breast, lung, colon, prostate, or cervical cancer, e.g., a breast, lung, colon, prostate, or cervical adenocarcinoma. In some embodiments the tumor is a squamous cell carcinoma. In some embodiments the tumor is not a squamous cell carcinoma. In some embodiments the cancer is a sarcoma. In some embodiments the sarcoma is a nerve sheath tumor, e.g., a peripheral nerve sheath tumor. In some embodiments the nerve sheath tumor is a malignant nerve sheath tumor, e.g., a malignant peripheral nerve sheath tumor. In some embodiments a tumor is a Stage I tumor as defined in the TNM Classification of Malignant Tumours (2009). In some embodiments a tumor is a Stage II tumor as defined in the TNM Classification of Malignant Tumours (2009). It will be understood that results of an assay of HSF1 expression or HSF1 activation may be used in combination with results from other assays, or other information, to provide a sample classification, diagnosis, prognosis, or prediction relating to cancer, cancer outcome, or treatment response. Such combination methods are within the scope of the invention.
In some aspects, the invention relates to methods for classifying a sample according to the level of HSF1 expression (i.e., the level of expression of the HSF1 gene) or according to the level of HSF1 activation in the sample. For purposes hereof, a method that comprises assessing HSF1 expression or assessing HSF1 activation may be referred to as an “HSF1-based method”. A procedure that is used to assess (detect, measure, determine, quantify) HSF1 expression or HSF1 activation may be referred to as an “HSF1-based assay”. It will be understood that either HSF1 expression, HSF1 activation, or both, can be assessed in various embodiments of the invention. Certain assays such as IHC can be used to assess both expression and activation. In general, as described further in the Examples, the level of HSF1 activation detected in tumor samples correlated with the level of HSF1 expression, e.g., samples that exhibited increased nuclear HSF1 levels tended to have increased HSF1 protein expression.
In some embodiments, the level of HSF1 expression is assessed by determining the level of an HSF1 gene product in the sample. Thus in some embodiments, the invention relates to methods for classifying a sample according to the level of an HSF1 gene product in the sample. In some embodiments, the invention provides a method of classifying a sample, the method comprising steps of: (a) providing a sample obtained from a subject; and (b) assessing HSF1 expression in the sample, wherein the level of HSF1 expression is correlated with a phenotypic characteristic, thereby classifying the sample with respect to the phenotypic characteristic. In some embodiments, the invention provides a method of classifying a sample, the method comprising steps of: (a) providing a sample obtained from a subject; and (b) determining the level of an HSF1 gene product in the sample, wherein the level of an HSF1 gene product is correlated with a phenotypic characteristic, thereby classifying the sample with respect to the phenotypic characteristic. In some embodiments the phenotypic characteristic is presence or absence of cancer. In some embodiments, the cancer is invasive cancer. In some embodiments the sample does not show evidence of invasive cancer, and the phenotypic characteristic is presence or absence of pre-invasive cancer (cancer in situ). In some embodiments the phenotypic characteristic is cancer prognosis. In some embodiments the phenotypic characteristic is predicted treatment outcome. In some embodiments the HSF1 gene product is HSF1 mRNA. In some embodiments the HSF1 gene product is HSF1 polypeptide.
In some aspects, the invention provides a method of classifying a sample, the method comprising: (a) determining the level of HSF1 expression or the level of HSF1 activation in a sample; (b) comparing the level of HSF1 expression or HSF1 activation with a control level of HSF1 gene expression or HSF1 activation; and (c) classifying the sample with respect to cancer diagnosis, wherein a greater (increased) level of HSF1 gene expression or HSF1 activation in the sample as compared with the control level of HSF1 expression or HSF activation, respectively, is indicative of the presence of cancer. In some embodiments, a greater level of HSF1 expression or HSF1 activation in the sample is indicative of the presence of in situ cancer in a sample that does not show evidence of invasive cancer. If the level of HSF1 expression or HSF1 activation is not increased (e.g., HSF1 is not detectable or is not significantly greater than present in normal tissue), then cancer is not diagnosed based on HSF1.
In some aspects, the invention provides a method of classifying a sample, the method comprising: (a) determining the level of HSF1 expression or the level of HSF1 activation in a sample obtained from a tumor; (b) comparing the level of HSF1 expression or HSF1 activation with a control level of HSF1 gene expression or HSF1 activation; and (c) classifying the sample with respect to cancer prognosis, wherein a greater level of HSF1 gene expression or HSF activation in the sample obtained from the tumor as compared with the control level of HSF1 gene expression or HSF activation, respectively, is indicative that the sample originated from a tumor that belongs to a poor prognosis class. In some aspects, the invention provides a method of classifying a tumor, the method comprising: (a) determining the level of HSF1 expression or the level of HSF1 activation in a sample obtained from a tumor; (b) comparing the level of HSF1 expression or HSF1 activation with a control level of HSF1 gene expression or HSF1 activation; and (c) classifying the sample with respect to cancer prognosis, wherein a greater level of HSF1 gene expression or HSF activation in the sample obtained from the tumor as compared with the control level of HSF1 gene expression or HSF1 activation, respectively, is indicative that the tumor belongs to a poor prognosis class.
In some aspects, the invention relates to methods for classifying a sample according to the level of HSF1 activation in cells of the sample. As used herein, “HSF1 activation” refers the process in which HSF1 polypeptide is phosphorylated, trimerizes, and translocates to the nucleus, where it binds to DNA sequences and regulates expression of genes containing such sequences (e.g., in their promoter regions) (“HSF1-regulated genes”). In some embodiments, the invention is directed to a method of classifying a sample with respect to a phenotypic characteristic, the method comprising steps of: (a) providing a sample obtained from a subject; and (b) determining the level of activation of HSF1 polypeptide in the sample, wherein the level of activation of an HSF1 polypeptide is correlated with a phenotypic characteristic, thereby classifying the sample with respect to the phenotypic characteristic. In some embodiments the sample does not show evidence of invasive cancer, and the phenotypic characteristic is presence or absence of pre-invasive cancer. In some embodiments the phenotypic characteristic is cancer prognosis. In some embodiments the phenotypic characteristic is predicted treatment outcome. In some embodiments, the level of HSF1 activation is assessed by determining the level of nuclear HSF1 in the sample. Thus in some embodiments the invention relates to methods for classifying a sample according to the level of nuclear HSF1 in the sample. In some embodiments, assessing the level of HSF1 activation comprises assessing HSF1 activity. In some embodiments, assessing the level of HSF1 activity comprises measuring expression of one or more HSF1-regulated genes. In some embodiments assessing the level of HSF1 activity comprises measuring expression of one or more HSF1 cancer program (HSF1-CP) genes. In some embodiments assessing the level of HSF1 activity comprises measuring expression of one or more HSF1-cancer signature set (HSF1-CSS), Group A, Group B, HSF1-CaSig2, HSF1-CaSig3, refined HSF1-CSS, Module 1, Module 2, Module 3, Module 4, or Module 5 genes. HSF1-CP genes, HSF1-CSS genes, Group A, Group B, HSF1-CaSig2, HSF1-CaSig3, refined HSF1-CSS, Module 1, Module 2, Module 3, Module 4, and Module 5 genes are described in further detail elsewhere herein. In some embodiments, assessing the level of HSF1 activity comprises measuring binding of HSF1 to the promoter region of one or more HSF1-regulated genes. In some embodiments assessing the level of HSF1 activity comprises measuring binding of HSF1 to a regulatory region, e.g., a promoter region or a distal regulatory region of one or more HSF1-CP genes, e.g., one or more HSF1-CSS, Group A, Group B, HSF1-CaSig2, HSF1-CaSig3, refined HSF1-CSS, Module 1, Module 2, Module 3, Module 4, or Module 5 genes. In some embodiments “one or more” genes is at least 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 40, 50, 60, 70, 80, 90, 100, 150, 200, 250, 300, 350, 400, or 450, up to the total number of genes in a set or list of genes. In some embodiments “one or more” genes is at least 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%, or more, up to 100% in a set or list of genes.
In some aspects of the invention, detection of increased HSF expression or activation in a sample is of use for diagnosis of cancer, e.g., for detection of cancer. According to certain of the methods of the invention, samples can be classified as belonging to (i.e., obtained from) an individual who has cancer or is likely to develop cancer. Among other things, the present invention provides the recognition that HSF1 expression in many instances initially becomes elevated during the in situ stage of malignant transformation, prior to invasion. In some aspects of the invention, detection of elevated (increased) HSF expression or activation in a sample is of use for early diagnosis of cancer, e.g., for detection of cancer in situ. According to certain of the methods of the invention, samples can be classified as belonging to (i.e., obtained from) an individual who has cancer in situ (CIS) or is likely to develop CIS or who has CIS and is likely to develop invasive cancer. In some embodiments the sample can be classified as belonging to (i.e., obtained from) an individual who has or is likely to develop ductal carcinoma in situ of the breast (DCIS).
In some embodiments, detection of increased HSF1 expression or activation in a sample indicates that a subject has an increased likelihood of having CIS or developing CIS than would be the case in the absence of increased HSF1 expression or activation. In some embodiments, detection of increased HSF1 expression or activation in a sample is of use to detect a CIS before it becomes detectable on physical examination or, in some embodiments, before it becomes detectable on imaging. In some embodiments, detection of increased HSF1 expression or activation in a sample may be used to help differentiate lesions that are malignant or that have significant potential to become invasive or metastasize from benign lesions. In accordance with certain embodiments of the invention, a lesion has an increased likelihood of being malignant or having significant potential to become invasive or metastasize if increased HSF1 expression or activation is detected in the sample than would be the case if increased HSF1 expression or activation is not detected. Detection of increased HSF1 expression or activation in a sample could, for example, indicate a need for additional or more frequent follow-up of the subject or for treatment of the subject from whom the sample was obtained. In some embodiments, detection of elevated HSF1 expression or activation in a sample is used together with one or more other indicators of dysplasia and/or neoplasia to detect the presence of CIS or to differentiate lesions that are malignant or that have significant potential to become invasive or metastasize from benign lesions. In some embodiments, detection of elevated HSF1 expression may enable classification of a sample that could not be reliably classified (e.g., as high risk or low risk) using standard histopathologic criteria. It will be understood that whether a sample (or tumor from which the sample originated) has an increased level of HSF1 expression or HSF1 activation can be determined by comparing the sample with a suitable control.
In some aspects, the invention provides method of identifying CIS, comprising assessing expression of HSF1 or activation of HSF1 in a tissue or cell sample, wherein the sample does not show evidence of invasive cancer, and wherein increased expression of HSF1 or increased activation of HSF1 in the sample is indicative of CIS. In some aspects, the invention provides a method of predicting the likelihood that a subject will develop invasive cancer, comprising assessing expression of the HSF1 gene or activation of HSF1 in a tissue or cell sample obtained from the subject, wherein increased expression of HSF1 or increased activation of HSF1 in the sample is indicative of an increased likelihood that the subject will develop invasive cancer. In some aspects, the invention provides a method of method of diagnosing CIS in a subject, comprising assessing expression of HSF1 or activation of HSF1 in a tissue or cell sample obtained from the subject, wherein the sample does not show evidence of invasive cancer, and wherein increased expression of HSF1 or increased activation of HSF1 in the sample indicates the presence of CIS in the subject.
In some embodiments, classification of DCIS lesions based on HSF1 expression or HSF1 activation may be used to differentiate DCIS lesions that are likely to progress to invasive cancer from those lesions that are likely to remain unchanged over extended periods of time or to disappear. DCIS lesions that exhibit elevated HSF1 expression or activation in a sample obtained from the lesion would be classified as having a greater likelihood of progression (e.g., within a time period such as 1 year) than lesions that do not exhibit elevated HSF1 expression or HSF1 activation in a sample obtained therefrom.
In some embodiments, a method of identifying, detecting, or diagnosing cancer, e.g., cancer in situ, is applied to a sample obtained from a subject who is at increased risk of cancer (e.g., increased risk of developing cancer or having cancer) or is suspected of having cancer or is at risk of cancer recurrence. A subject at increased risk of cancer may be, e.g., a subject who has not been diagnosed with cancer but has an increased risk of developing cancer as compared with a control, who may be matched with regard to one or more demographic characteristics such as age, gender, etc. For example, the subject may have a risk at least 1.2, 1.5, 2, 3, 5, 10 or more times that of an age-matched control (e.g., of the same gender), in various embodiments of the invention. It will be understood that “age-matched” can refer to the same number of years of age as the subject or within the same age range as the subject (e.g., a range of 5 or 10 years). For example, a control may be up to 5 years older or younger than the subject. Determining whether a subject is considered “at increased risk” of cancer is within the skill of the ordinarily skilled medical practitioner. Any suitable test(s) and/or criteria can be used. For example, a subject may be considered “at increased risk” of developing cancer if any one or more of the following apply: (i) the subject has a mutation or genetic polymorphism that is associated with increased risk of developing or having cancer relative to other members of the general population not having such mutation or genetic polymorphism (e.g., certain mutations in the BRCA1 or BRCA2 genes are well known to be associated with increased risk of a variety of cancers, including breast cancer and ovarian cancer, mutations in tumor suppressor genes such as Rb or p53 can be associated with a variety of different cancer types); (ii) the subject has a gene or protein expression profile, and/or presence of particular substance(s) in a sample obtained from the subject (e.g., blood), that is/are associated with increased risk of developing or having cancer relative to other members of the general population not having such gene or protein expression profile, and/or substance(s) in a sample obtained from the subject; (iii) the subject has one or more risk factors such as having a family history of cancer, having been exposed to a tumor-promoting agent or carcinogen (e.g., a physical carcinogen, such as ultraviolet or ionizing radiation; a chemical carcinogen such as asbestos, tobacco components or other sources of smoke, aflatoxin, or arsenic; a biological carcinogen such as certain viruses or parasites), or has certain conditions such as chronic infection/inflammation that are correlated with increased risk of cancer; (iv) the subject is over a specified age, e.g., over 60 years of age, etc. In the case of breast cancer, a subject diagnosed as having lobular carcinoma in situ (LCIS) is at increased risk of developing cancer. A subject suspected of having cancer may be a subject who has one or more symptoms of cancer or who has had a diagnostic procedure performed that suggested or was at least consistent with the possible existence of cancer but was not definitive. A subject at risk of cancer recurrence can be any subject who has been treated for cancer such that the cancer was rendered undetectable as assessed, for example, by appropriate methods for cancer detection.
According to certain methods of the invention, a sample, tumor, or subject can be classified as belonging to a particular class of outcome based at least in part on the level of HSF1 expression or HSF1 activation. For example, in some embodiments, a sample, tumor, or subject can be classified as belonging to a high risk class (e.g., a class with a prognosis for a high likelihood of recurrence after treatment or a class with a prognosis for a high likelihood of discovery of metastasis post-diagnosis or a class with a poor prognosis for survival after treatment) or a low risk class (e.g., a class with a prognosis for a low likelihood of recurrence after treatment or a class with a prognosis for a low likelihood of discovery of metastasis post-diagnosis or a class with a good prognosis for survival after treatment). In some embodiments, survival after treatment is assessed 5 or 10 years after diagnosis, wherein increased expression of HSF1 or increased activation of HSF1 is predictive of decreased likelihood of survival at 5 years or 10 years post-diagnosis. In some embodiments, increased expression of HSF1 or increased activation of HSF1 is predictive of decreased mean (average) or median survival. In some embodiments survival is overall survival, wherein increased expression of HSF1 or increased activation of HSF1 is predictive of decreased overall survival (increased overall mortality). In some embodiments survival is disease-specific survival, wherein increased expression of HSF1 or increased activation of HSF1 is predictive of decreased disease-specific survival (i.e., increased disease-specific mortality), wherein “disease-specific” in the context of outcome, refers to considering only deaths due to cancer, e.g., breast cancer.
According to certain methods of the invention, a sample, tumor, or subject can be classified as belonging to a particular class with regard to tumor aggressiveness. For example, a sample or tumor can be classified into a more aggressive class or a less aggressive class or a subject can be classified as having a tumor that is more aggressive or less aggressive. “More aggressive” in this context means that the sample or tumor has one or more features that correlate with a poor outcome. A poor outcome may be, e.g., progression (e.g., after treatment), recurrence after treatment, or cancer-related mortality (e.g., within 5, 10, or 20 years after treatment). For example, a tumor classified as more aggressive may have an increased likelihood of having metastasized locally or to remote site(s) at the time of diagnosis, an increased likelihood of metastasizing or progressing locally (e.g., within a specified time period after diagnosis such as 1 year, 2 years, etc.), an increased likelihood of treatment resistance (e.g., a decreased likelihood of being eradicated or rendered undetectable by treatment). In some aspects, the invention provides a method of assessing the aggressiveness of a tumor, the method comprising: determining the level of HSF1 expression or the level of HSF1 activation in a sample obtained from the tumor, wherein if the level of HSF1 gene expression or HSF activation in the sample obtained from the tumor is increased, the tumor is classified as belonging to a more aggressive class. In some aspects, the invention provides a method of assessing the aggressiveness of a tumor, the method comprising: (a) determining the level of HSF1 expression or the level of HSF1 activation in a sample obtained from the tumor; (b) comparing the level of HSF1 expression or HSF1 activation with a control level of HSF1 gene expression or HSF1 activation; and (c) assessing the aggressiveness of the tumor based at least in part on the result of step (b), wherein a greater level of HSF1 gene expression or HSF activation in the sample obtained from the tumor as compared with the control level of HSF1 gene expression or HSF activation, respectively, is indicative of increased aggressiveness.
In some aspects, the invention provides a method of assessing the likelihood that a tumor has metastasized, the method comprising: determining the level of Heat Shock Factor-1 (HSF1) expression or the level of HSF1 activation in a sample obtained from the tumor, wherein if the level of HSF1 gene expression or HSF activation in the sample obtained from the tumor is increased, the tumor has an increased likelihood of having metastasized. In some aspects, the invention provides a method of assessing the likelihood that a tumor will metastasize, the method comprising: determining the level of HSF1 expression or the level of HSF1 activation in a sample obtained from the tumor, wherein if the level of HSF1 gene expression or HSF activation in the sample obtained from the tumor is increased, the tumor has an increased likelihood of metastasizing. In some aspects, the invention provides a method of assessing the likelihood that a tumor has metastasized, the method comprising: (a) determining the level of HSF1 expression or the level of HSF1 activation in a sample obtained from the tumor; (b) comparing the level of HSF1 expression or HSF1 activation with a control level of HSF1 gene expression or HSF1 activation, wherein a greater level of HSF1 gene expression or HSF activation in the sample obtained from the tumor as compared with a control level is indicative of a greater likelihood that the tumor has metastasized. In some aspects, the invention provides a method of assessing likelihood that a tumor will metastasized, the method comprising: (a) determining the level of HSF1 expression or the level of HSF1 activation in a sample obtained from the tumor; (b) comparing the level of HSF1 expression or HSF1 activation with a control level of HSF1 gene expression or HSF1 activation, wherein a greater level of HSF1 gene expression or HSF activation in the sample obtained from the tumor as compared with a control level is indicative of a greater likelihood that the tumor will metastasize.
An HSF1-based method of the invention may be useful for selecting a treatment regimen for a subject. For example, such results may be useful in determining whether a subject should receive, e.g., would likely benefit from, administration of one or more chemotherapeutic agents (chemotherapy), hormonal therapy, an anti-HER2 agent, or other treatment such as radiation. In some embodiments, “chemotherapeutic agent” refers to an anti-tumor agent that has cytotoxic or cytostatic properties and does not act primarily by interacting with (e.g., interfering with) a hormonal pathway that is specific or relatively specific to particular cell type(s). Exemplary chemotherapeutic agents include anti-metabolites, alkylating agents, microtubule stabilizers or microtubule assembly inhibitors (e.g., taxanes or vinca alkaloids), topoisomerase inhibitors, and DNA intercalators (e.g., anthracycline antibiotics). Such agents are frequently administered systemically. Often, multiple agents are administered. Exemplary treatment regimens for breast cancer include CMF (cyclophosphamide, methotrexate, and 5-FU), AC (doxorubicin and cyclophosphamide), and anthracycline-based regimens. Capecitabine is is a prodrug, that is enzymatically converted to 5-fluorouracil following administration (e.g., in tumor tissue) and is a component of a number of breast cancer treatment regimens. Tegafur is another 5-FU prodrug, which may be administered together with uracil, a competitive inhibitor of dihydropyrimidine dehydrogenase. A “hormonal therapy” (also termed “endocrine therapy”) refers to an antitumor agent that acts primarily by interacting with the endocrine system, e.g., by interfering with a hormonal pathway that is active in a hormonally responsive tissue such as breast, prostate, or endometrium. Exemplary hormonal therapies include, e.g., drugs that inhibit the production or activity of hormones that would otherwise contribute to tumor cell survival, proliferation, etc. For example, in the case of breast cancer, hormonal therapy can comprise an agent that inhibits ER signaling. The agent may interact with and inhibit the ER or inhibit estrogen biosynthesis. In some embodiments hormonal therapy comprises a selective estrogen receptor modulator (SERM) such as tamoxifen, raloxifene, or toremifene. It will be appreciated that SERMs can act as ER inhibitors (antagonists) in breast tissue but, depending on the agent, may act as activators (e.g., partial agonists) of the ER in certain other tissues (e.g., bone). It will also be understood that tamoxifen itself is a prodrug that has relatively little affinity for the ER but is metabolized into active metabolites such as 4-hydroxytamoxifen (afimoxifene) and N-desmethyl-4-hydroxytamoxifen (endoxifen). Such active metabolites may be used as ER inhibitors. In some embodiments, hormonal therapy comprises a selective estrogen receptor down-regulators (SERD) such as fulvestrant or CH4986399. In some embodiments hormonal therapy comprises an agent that inhibits estrogen biosynthesis. For example, estrogen deprivation can be achieved using inhibitors that block the last stage in the estrogen biosynthetic sequence, i.e., the conversion of androgens to estrogens by the enzyme aromatase (“aromatase inhibitors”). Aromatase inhibitors include, e.g., letrozole, anastrazole, and exemestane. In the case of prostate cancer, “hormonal therapy” can comprise administering an agent that interferes with androgen receptor (AR) signaling. For example, antiandrogens are drugs that bind to and inhibit the AR, blocking the growth- and survival-promoting effects of testosterone on certain prostate cancers. Examples include flutamide and bicalutamide. Analogs of gonadotropin-releasing hormone (GnRH) can be used to suppress production of estrogen and progesterone from the ovaries, or to suppress testosterone production from the testes. Leuprolide and goserelin are GnRH analogs which are used primarily for the treatment of hormone-responsive prostate cancer.
“Adjuvant therapy” refers to administration of one or more antitumor agents in connection with, e.g., following, local therapy such as surgery and/or radiation. Adjuvant therapy may be used, e.g., when a cancer appears to be largely or completely eradicated, but there is risk of recurrence. Such therapy may help eliminate residual cells at the site of the primary tumor and/or cells that have disseminated.
“Neoadjuvant therapy” refers to adjuvant therapy administered prior to local therapy, e.g., to shrink a primary tumor.
“Anti-HER2” therapy refers to administration of an antitumor agent that acts primarily by interacting with (e.g., interfering with) HER2. Such agents may be referred to as “anti-HER2” agents. Anti-HER2 agents include, e.g., monoclonal antibodies that bind to HER2, such as trastuzumab and pertuzumab, and various small molecule kinase inhibitors that bind to HER2 and inhibits its kinase activity. Pertuzumab is a recombinant, humanized monoclonal antibody that binds to the extracellular domain II, sterically blocking homo- and heterodimerization with other ERBB receptors, thus preventing signal transduction. In some embodiments, an anti-HER2 agent inhibits HER2 and at least one other member of the human epidermal growth factor receptor family. Examples of such agents include, e.g., dual EGFR (Erb-B1) and HER2 kinase inhibitors such as lapatinib and pan-ERBB kinase inhibitors such as neratinib. In some embodiments, an anti-tumor agent is an antibody-drug conjugate (ADC). For example, an anti-HER2 antibody can be conjugated to a cytotoxic agent. Cytotoxic agents useful for such purposes include, e.g., calicheamicins, auristatins, maytansinoids, and derivatives of CC 1065. For example, trastuzumab emtansine (T-DM1) is an antibody-drug conjugate ADC that combines intracellular delivery of the cytotoxic agent, DM1 (a derivative of maytansine) with the antitumor activity of trastuzumab.
In some embodiments, results of an HSF1-based assay may be useful for selecting an appropriate treatment regimen and/or for selecting the type or frequency of procedures to be used to monitor the subject for local or metastatic recurrence after therapy and/or the frequency with which such procedures are performed. For example, subjects classified as having a poor prognosis (being at high risk of poor outcome) may be treated and/or monitored more intensively than those classified as having a good prognosis. Thus any of the diagnostic, prognostic, or treatment-specific predictive methods can further comprise using information obtained from the assay to help in selecting a treatment or monitoring regimen for a subject suffering from cancer or at increased risk of cancer or at risk of cancer recurrence or in providing an estimate of the risk of poor outcome such as cancer related mortality or recurrence. The information may be used, for example, by a subject's health care provider in selecting a treatment or in treating a subject. A health care provider could also or alternatively use the information to provide a cancer patient with an accurate assessment of his or her prognosis. In some embodiments, a method of the invention can comprise making a treatment selection or administering a treatment based at least in part on the result of an HSF1-based assay. In some embodiments, a method of the invention can comprise selecting or administering more aggressive treatment to a subject, if the subject is determined to have a poor prognosis. In some embodiments, a method of the invention can comprise selecting or administering more aggressive treatment, if the subject is determined to have CIS that is positive for HSF1 expression or HSF1 activation. Often a “treatment” or “treatment regimen” refers to a course of treatment involving administration of an agent or use of a non-pharmacological therapy multiple times over a period of time, e.g., over weeks or months. A treatment can include one or more pharmacological agents (often referred to as “drugs” or “compounds”) and/or one or more non-pharmacological therapies such as radiation, surgery, etc. A treatment regimen can include the identity of agents to be administered to a subject and may include details such as the dose(s), dosing interval(s), number of courses, route of administration, etc. “Monitoring regimen” refers to repeated evaluation of a subject over time by a health care provider, typically separated in time by weeks, months, or years. The repeated evaluations can be on a regular or predetermined approximate schedule and are often performed with a view to determining whether a cancer has recurred or tracking the effect of a treatment on a tumor or subject.
“More aggressive” treatment (also referred to as “intensive” or “more intensive” treatment herein) can comprise, for example, (i) administration of chemotherapy in addition to, or instead of, hormonal therapy; (ii) administration of a dose of one or more agents (e.g., chemotherapeutic agent) that is at the higher end of the acceptable dosage range (e.g., a high dose rather than a medium or low dose, or a medium dose rather than a low dose) and/or administration of a number of doses or a number of courses at the higher end of the acceptable range and/or use of non-hormonal cytotoxic/cytostatic chemotherapy; (iii) administration of multiple agents rather than a single agent; (iv) administration of more, or more intense, radiation treatments; (v) administration of a greater number of agents in a combination therapy; (vi) use of adjuvant therapy; (vii) more extensive surgery, such as mastectomy rather than breast-conserving surgery such as lumpectomy. For example, a method can comprise (i) selecting that the subject not receive chemotherapy (e.g., adjuvant chemotherapy) if the tumor is considered to have a good prognosis; or (ii) selecting that the subject receive chemotherapy (e.g., adjuvant chemotherapy), or administering such chemotherapy, if the tumor is considered to have a poor prognosis. In some embodiments, a method of the invention can comprise selecting that a subject receives less aggressive treatment or administering such treatment, if the subject is determined to have a good prognosis. “Less aggressive” (also referred to as “less intensive”) treatment could entail, for example, using dose level or dose number at the lower end of the acceptable range, not administering adjuvant therapy, selecting a breast-conserving therapy rather than mastectomy, selecting hormonal therapy rather than non-hormonal cytotoxic/cytostatic chemotherapy, or simply monitoring the patient carefully. “More intensive” or “intensive” monitoring could include, for example, more frequent clinical and/or imaging examination of the subject or use of a more sensitive imaging technique rather than a less sensitive technique. “Administering” a treatment could include direct administration to a subject, instructing another individual to administer a treatment to the subject (which individual may be the subject themselves in the case of certain treatments), arranging for administration to a subject, prescribing a treatment for administration to a subject, and other activities resulting in administration of a treatment to a subject. “Selecting” a treatment or treatment regimen could include determining which among various treatment options is appropriate or most appropriate for a subject, recommending a treatment to a subject, or making a recommendation of a treatment for a subject to the subject's health care provider.
In some aspects, the invention provides a method of selecting a regimen for monitoring or treating a subject in need of treatment for cancer comprising: (a) assessing the level of HSF1 expression or HSF1 activation in a sample obtained from the subject; and (b) selecting an intensive monitoring or treatment regimen if the level of HSF1 expression or HSF1 activation is increased in the sample. In some aspects, the invention provides a method of selecting a regimen for monitoring or treating a subject in need of treatment for cancer, wherein said regimen is selected from among multiple options including at least one more intensive regimen and at least one less intensive regimen, the method comprising: (a) obtaining a classification of the subject, wherein the subject is classified into a high risk or a low risk group based at least in part on an assessment of the level of HSF1 expression or HSF1 activation in a sample obtained from the subject; and (b) selecting a more intensive regimen if the subject is classified as being in a high risk group or selecting a less intensive regimen if the subject is classified as being in a low risk group. In some aspects, the invention provides a method of monitoring or treating a subject in need of treatment for cancer comprising: (a) obtaining a classification of the subject, wherein the classification is based at least in part on an assessment of the level of HSF1 expression or HSF1 activation in a sample obtained from the subject; and (b) monitoring or treating the subject according to an intensive regimen if the subject is classified as being in a high risk group or monitoring or treating the subject with a less intensive regimen if the subject is classified as being in a low risk group. “Obtaining a classification” could comprise any means of ascertaining a classification such as performing an HSF1-based assay (or directing that an HSF1-based assay be performed) and assigning a classification based on the results, receiving results of an HSF1-based assay and assigning a classification using the results, receiving or reviewing a classification that was previously performed, etc.
In some embodiments a subject has been previously treated for the cancer, while in other embodiments the subject has not previously received treatment for the cancer. In some embodiments the previous treatment for a breast tumor is hormonal therapy such as tamoxifen or another anti-estrogen agent, e.g., another SERM.
In some embodiments, a subject falls within a selected age group or range, e.g., 40 years old or less, 50 years old or less, 55 years old or less, 60 years old or less, between 40 and 60 years of age, 40 years old or more, 50 years old or more, 55 years old or more, 60 years old or more, etc. Any age group or range may be selected in various embodiments of the invention, whether or not specifically mentioned here. In some embodiments, a female subject is pre-menopausal. In some embodiments, a female subject is post-menopausal.
In some embodiments a subject, e.g., a subject having or at risk of lung cancer or lung cancer recurrence, is a current smoker or former smoker. In some embodiments a subject, e.g., a subject having or at risk of developing lung cancer or lung cancer recurrence, is a non-smoker who has no or essentially no history of smoking.
In some embodiments, an HSF1-based method may be used to identify cancer patients that do not require adjuvant therapy, e.g., adjuvant hormonal therapy and/or adjuvant chemotherapy. For example, a prognostic method may identify patients that have a good prognosis and would be unlikely to experience clinically evident recurrence and/or metastasis even without adjuvant therapy. Since adjuvant therapy can cause significant side effects, it would be beneficial to avoid administering it to individuals whom it would not benefit. In some embodiments, an HSF1-based prognostic method of the invention may be used to identify cancer patients that have a poor prognosis (e.g., they are at high risk of recurrence and/or metastasis) and may therefore benefit from adjuvant therapy. In some embodiments, an HSF1-based prognostic method may be used to identify cancer patients that might not be considered at high risk of poor outcome based on other prognostic indicators (and may therefore not receive adjuvant therapy) but that are in fact at high risk of poor outcome, e.g., recurrence and/or metastasis. Such patients may therefore benefit from adjuvant therapy. In some embodiments, HSF1-based method may be used in a subject with cancer in whom an assessment of the tumor based on standard prognostic factors, e.g., standard staging criteria (e.g., TMN staging), histopathological grade, does not clearly place the subject into a high or low risk category for recurrence after local therapy (e.g., surgery) and/or for whom the likelihood of benefit from adjuvant therapy is unclear, as may be the case in various early stage cancers where, e.g., the cancer is small and has not detectably spread to regional lymph nodes or metastasized more remotely.
In some embodiments, an HSF1-based method may be used to provide prognostic information for a subject with a breast tumor that has one or more recognized clinicopathologic features and/or that falls into a particular class or category based on gene expression profiling. For example, breast cancers can be classified into molecular subtypes based on gene expression profiles, e.g., luminal A, luminal B, ERBB2-associated, basal-like, and normal-like (see, e.g., Serlie, T., et al., Proc Natl Acad Sci USA. (2001) 98(19):10869-74). Breast cancers can be classified based on a number of different clinicopathologic features such as histologic subtype (e.g., ductal; lobular; mixed), histologic grade (grade 1, 2, 3); estrogen receptor (ER) and/or progesterone receptor (PR) status (positive (+) or negative (−)), HER2 (ERBB2) expression status, and lymph node involvement. For example, the following breast cancer subtypes can be defined based on expression of estrogen receptor (ER) and human epidermal growth factor receptor 2 (HER2), e.g., as assessed by immunohistochemistry (IHC): (1) ER+, HER2+; (2) ER+, HER2; (3) ER−, HER2+; and (4) ER−, HER2−. The level of expression can be used to further divide these subtypes. Amplification of the HER2 locus can be assessed, e.g., using in situ hybridization (ISH), e.g., fluorescent in situ hybridization (FISH). In some embodiments, an HSF1-based method is applied to a tumor that is ER+. In some embodiments an HSF1-based method is applied to a tumor that is ER−. In some embodiments an HSF1-based method is applied to a tumor that is HER2+. In some embodiments an HSF1-based method is applied to a tumor that is HER2−. In some embodiments an HSF1-based method is applied to a tumor that is PR+. In some embodiments an HSF1-based method is applied to a tumor that is PR−. In some embodiments an HSF1-based method is applied to a tumor that is EGFR+. In some embodiments an HSF-based method is applied to a tumor that is EGFR−. It will be understood that these markers may be present or absent in any combination in various embodiments. For example, in some embodiments an HSF1-based method is applied to a tumor that is ER+/HER2+ or ER+/HER2− (each of which categories can include tumors that are PR+ or PR− and are EGFR+ or EGFR−). In some embodiments, the sample or tumor is not “triple negative”, i.e., the sample or tumor is negative for expression of ER, PR, and HER2.
In some embodiments a subject has DCIS. In some embodiments a subject has Stage I or Stage II breast cancer. In some embodiments a subject has Stage III breast cancer. In some embodiments, cancer stage is assigned using pathologic criteria, clinical criteria, or a combination of pathologic and clinical criteria.
In some embodiments a subject does not have detectable lymph node involvement, i.e., the subject is “lymph node negative” (LNN). For example, the subject may have be ER+/lymph node negative. The clinical management of subjects in this early stage group (e.g., treatment selection) is challenging due to the lack of markers indicating which small portion of the population will have a recurrence (e.g., following surgery) and could therefore benefit from more intensive monitoring and/or more aggressive treatment. In accordance with certain embodiments of the invention, a subject with ER+, LNN cancer that has increased HSF1 expression or increased HSF1 activation is monitored and/or treated more intensively than if the cancer does not have increased HSF1 expression or increased HSF1 activation.
In some embodiments, increased HSF1 expression or increased HSF1 activation in a sample from an ER+ breast tumor identifies patients having ER+ tumors that may be resistant to hormonal therapy. Such patients may benefit from use of a more aggressive treatment regimen, e.g., chemotherapy in addition to, or instead of, hormonal therapy, or more extensive surgery.
It has been reported that while about half of all breast cancers are assigned histologic grade 1 or 3 status (with a low or high risk of recurrence, respectively), a substantial percentage of tumors (30%-60%) are classified as histologic grade 2, which is less informative for clinical decision making because of intermediate risk of recurrence (Sotiriou C, et al., J Natl Cancer Inst., 98(4):262-72, 2006). Improved prognostic methods could be of significant use in this setting, for example. In some embodiments, an HSF1-based method is applied to a tumor classified as histologic grade 2, e.g., to classify histologic grade 2 tumors into high and low risk groups. In some embodiments, an HSF1-based method is applied to a tumor classified as histologic grade 2, e.g., to classify histologic grade 2 tumors into higher and lower risk groups, wherein tumors that have increased HSF1 expression or HSF1 activation are classified into the higher risk group. Tumors that do not have increased HSF1 expression or HSF1 activation would be classified into the lower risk group.
In some embodiments, an HSF1-based assay is used to provide sample classification, diagnostic, prognostic, or treatment-predictive information pertaining to lung cancer, e.g., non-small cell lung cancer (NSCLS), such as a lung adenocarcinoma. In some embodiments, the lung cancer, e.g., lung adenocarcinoma, is a Stage I cancer (T1 N0 M0 or T2 N0 M0). In some embodiments the cancer is a Stage 1A lung cancer (T1 N0 M0). In some embodiments the cancer is a Stage IB lung cancer (T1N0M0). In some embodiments, the lung cancer, e.g., lung adenocarcinoma, is a Stage II cancer. Stage I and II lung cancers are typically treated by surgical resection of the tumor. Although surgery can be curative, a significant fraction of patients develop recurrence or metastases. Such patients might benefit from adjuvant therapy (radiation and/or chemotherapy). However, the current standard staging system (TMN) cannot predict which stage I or II lung cancers will recur. Although studies have shown adjuvant chemotherapy to be of benefit in groups of patients with stage II lung cancer, its role in treating stage I lung cancer is unclear. Without wishing to be bound by any theory, the number of patients diagnosed with stage I or II lung cancer may increase significantly at least in part due to the increased use of imaging modalities such as computed tomography (CT) scans for screening purposes, e.g., in individuals who have a significant smoking history. It would be useful to be able to identify those patients with stage I or stage II cancer who are at increased likelihood of recurrence and may therefore be more likely to benefit from adjuvant chemotherapy. In some embodiments, an HSF1-based method is applied to classify a stage I or stage II lung tumor into a higher or lower risk group, wherein tumors that have increased (e.g., high or intermediate) HSF1 expression or HSF1 activation are classified into the higher risk group. Tumors that have absent or low HSF1 expression or HSF1 activation are classified into the lower risk group. Subjects with tumors classified into the higher risk group have an increased likelihood of recurrence than subjects with tumors classified into the lower risk group and may benefit from adjuvant chemotherapy. Subjects with tumors classified into the lower risk group may be treated with surgery alone. Adjuvant chemotherapy for operable lung cancer frequently includes a platinum-based agent (e.g., cisplatin or carboplatin), optionally in combination with an anti-mitotic agent (e.g., an anti-microtubule agent) such as a taxane (e.g., paclitaxel (Taxol) or docetaxel (Taxotere)) or a vinca alkaloid such as vinblastine, vincristine, vindesine and vinorelbine. Other agents that may be administered as adjuvant chemotherapy in operable lung cancer, typically in combination with a platinum agent, include mitomycin, doxorubicin, or etoposide. Other adjuvant chemotherapy regiments include tegafur alone, uracil alone, a combination of tegafur and uracil, or a combination of tegafur and/or uracil with a platinum agent.
In some embodiments a subject has been previously treated for the cancer, while in other embodiments the subject has not previously received treatment for the cancer. In some embodiments the previous treatment for a breast tumor is hormonal therapy such as tamoxifen or another anti-estrogen agent, e.g., another SERM.
In some embodiments, a subject falls within a selected age group or range, e.g., 40 years old or less, 50 years old or less, 55 years old or less, 60 years old or less, between 40 and 60 years of age, 40 years old or more, 50 years old or more, 55 years old or more, 60 years old or more, etc. Any age group or range may be selected in various embodiments of the invention, whether or not specifically mentioned here. In some embodiments, a female subject is pre-menopausal. In some embodiments, a female subject is post-menopausal.
In some embodiments a subject, e.g., a subject having or at risk of lung cancer or lung cancer recurrence, is a current smoker or former smoker. In some embodiments a subject, e.g., a subject having or at risk of developing lung cancer or lung cancer recurrence, is a non-smoker who has no or essentially no history of smoking.
Any method of the invention that comprises assessing HSF1 expression or HSF1 activation or using the level of expression or activation of an HSF1 gene product may, in certain embodiments, further comprise assessing or using the level of expression, activation, or activity of one or more additional cancer biomarkers. Any method of the invention that comprises assessing HSF1-CP expression or using the level of expression of one or more HSF1-CP gene products may, in certain embodiments, further comprise assessing or using the level of expression, activation, or activity of one or more additional cancer biomarkers. In certain embodiments, the level of expression, activation, or activity of an HSF1 gene product and/or an HSF1-CP gene product is used in conjunction with the level of expression, activation, or activity of one or more additional cancer biomarkers in a method of providing diagnostic, prognostic, or treatment-specific predictive information. The additional cancer biomarker(s) may be selected based at least in part on the site in the body from which a sample was obtained or the suspected or known tissue of origin of a tumor. For example, in the case of suspected or known breast cancer, one or more breast cancer biomarkers may be assessed.
In some embodiments, an HSF1-based assay is used together with additional information, such as results of a second assay (or multiple assays) and/or clinicopathological information to provide diagnostic, prognostic, or treatment-predictive information pertaining to breast cancer. In some embodiments, such information comprises, e.g., subject age, tumor size, nodal involvement, tumor histologic grade, ER status, PR status, and/or HER2 status, menopausal status, etc.). In some embodiments, the additional information includes the PR status of the tumor. For example, a method can comprise determining the PR status of a tumor and, if the PR status is positive, classifying the tumor with respect to prognosis or treatment selection based on expression of HSF1 or activation of HSF1. In some embodiments, information from an HSF1-related assay is used together with a decision making or risk assessment tool such as the computer program Adjuvant! Online (https://www.adjuvantonline.com/index.jsp). The basic format of an early version of Adjuvant! was described in the article Ravdin, Siminoff, Davis, et al. JCO 19(4) 980-991, 2001. In some embodiments, the second assay is a gene expression profiling assay such as the MammaPrint® (Agendia BV, Amsterdam, the Netherlands), Oncotype DX™ (Genomic Health, Redwood City, Calif.), Celera Metastasis Score™ (Celera, Inc., Rockville, Md.), Breast BioClassifier (ARUP, Salt Lake City, Utah), Rotterdam signature 76-gene panel (Erasmus University Cancer Center, Rotterdam, The Netherlands), MapQuant Dx™ Genomic Grade test (Ipsogen, Stamford, Conn.), Invasiveness Gene Signature (OncoMed Pharmaceuticals, Redwood City, Calif.), NuvoSelect™ assay (Nuvera Biosciences, Woburn, Mass.), THEROS Breast Cancer IndexSM (BCI) (bioTheranostics, San Diego) that classifies tumors (e.g., into high or low risk groups) based on expression level of multiple genes using, e.g., a microarray or multiplex RT-polymerase chain reaction (PCR) assay. The phrase “used together” with in regard to two or more assays means that the two or more assays are applied to a particular tumor. In some embodiments, the two or more assays are applied to the same sample (or a portion thereof) obtained from the tumor.
In some embodiments, an HSF1-based assay may be used together with a gene expression profile in which expression level of at least 1, at least 5, or at least 10 different genes (“classifier genes”) is used to classify a tumor. It will be understood that such gene expression profile assays may measure expression of control genes as well as classifier genes. In some embodiments an HSF1-based assay is used together with an H:I™ test (bioTheranostics, Carlsbad, Calif.), in which the ratio of expression of HOXB 13 and IL-17B genes is used to classify a tumor. In some embodiments, an HSF1-based assay is used together with an antibody-based assay, e.g., the ProEx™ Br (TriPath Oncology, Durham, N.C.), Mammostrat® (Applied Genomics, Inc., Huntsville, Ala.), ADH-5 (Atypical Ductal Hyperplasia) Breast marker antibody cocktail (Biocare Medical, Concord, Calif.), measurement of urokinase-like plasminogen activator (uPA) and/or its inhibitor plasminogen activator inhibitor 1 (PAI1), or a FISH-based test such as the eXaagenBC™ (eXagen Diagnostics, Inc., Albuquerque, N. Mex.). In some embodiments, an HSF1-based assay is used together with an assay that measures proliferation. For example, expression of a proliferation marker such as Ki67 (Yerushalmi et al., Lancet Oncol. (2010), 11(2):174-83) can be used. In some embodiments, an HSF1-based assay is used together with a miRNA-based assay (e.g., an assay that measures expression of one or more miRNAs or miRNA precursors). For example, in some embodiments, an HSF1-based assay is used together with a miR31-based assay, e.g., as described in PCT/US2009/067015 (WO/2010/065961).
An HSF1-based assay (e.g., any of the HSF1-based assays described herein) may be used together with another assay in any of a number of ways in various embodiments of the invention. For example, in some embodiments, if results of two tests are discordant (e.g., one test predicts that the subject is at high risk while the other predicts that the subject is at low risk), the subject may receive more aggressive therapeutic management than if both tests predict low risk. In some embodiments, if a result of a non-HSF1-based assay is inconclusive or indeterminate, an HSF1-based assay can be used to provide a diagnosis, prognosis, or predictive information. In some embodiments, one can have increased confidence if results of an HSF1-based assay and a second assay are in agreement. For example, if both tests indicate that the subject is at low risk, there can be increased confidence in the appropriateness of providing less aggressive therapeutic management, e.g., to not administer adjuvant chemotherapy, while if both tests indicate that the subject is at high risk, there can be increased confidence in the appropriateness of providing more aggressive therapeutic management.
In some embodiments, a method of the invention comprises providing treatment-specific predictive information relating to use of a proteostasis modulator to treat a subject with cancer, based at least in part on assessing the level of expression of HSF1 or activation of HSF1 in a sample obtained from the subject. “Proteostasis” (which term is used interchangeably with “protein homeostasis”) refers to controlling the concentration, conformation (e.g., folding), binding interactions (quaternary structure), and subcellular location of the proteins within a cell, often through mechanisms such as transcriptional and/or translational changes, chaperone-assisted folding and disaggregation, or controlled protein degradation. Proteostasis can be thought of as a network comprising multiple distinguishable pathways (“proteostasis pathways”) that may interact with and influence each other. Proteostasis pathways include, e.g., the HSR (discussed above), the ubiquitination-proteasome degradation pathway, and the unfolded protein response (UPR). “Proteostasis modulator” refers to an agent that modulates one or more proteostasis pathways.
In some embodiments, a sample can be classified as belonging to (i.e., obtained from) a subject with cancer who is a suitable candidate for treatment with a proteostasis modulator. For example, the invention provides a method of determining whether a subject with cancer is a suitable candidate for treatment with a proteostasis modulator, comprising assessing the level of HSF1 expression or HSF1 activation in a sample obtained from the subject, wherein an increased level of HSF1 expression or an increased level of HSF1 activation in the sample is indicative that the subject is a suitable candidate for treatment with a proteostasis modulator. In some embodiments, the invention provides a method of determining whether a subject with cancer is likely to benefit from treatment with a proteostasis modulator, comprising: assessing the level of HSF1 expression or HSF1 activation in a sample obtained from the subject, wherein an increased level of HSF1 expression or an increased level of HSF1 activation in the sample is indicative that the subject is likely to benefit from treatment with a proteostasis modulator. In some embodiments, the invention provides a method of identifying a subject with cancer who is likely to benefit from treatment with a proteostasis modulator, comprising assessing the level of HSF1 expression or HSF1 activation in a sample obtained from the subject, wherein an increased level of HSF1 expression or an increased level of HSF1 activation in the sample identifies the subject as being likely to benefit from treatment with a proteostasis modulator. In some embodiments, the invention provides a method of predicting the likelihood that a tumor will be sensitive to a protein homeostasis modulator, the method comprising: assessing the level of HSF1 expression or the level of HSF1 activation in a sample obtained from the tumor; wherein if the level of HSF1 expression or activation is increased, the tumor has an increased likelihood of being sensitive to the protein homeostasis modulator. A tumor is “sensitive” to a treatment if the subject experiences a partial or complete response or stabilization of disease following treatment. Response can be assessed, for example, by objective criteria such as anatomical tumor burden, as known in the art. In some embodiments, a response correlates with increased progression-free survival or increased overall survival. Thus in some embodiments, a tumor is sensitive to a treatment if administration of the treatment correlates with increased progression-free survival or increased overall survival.
In some embodiments, treatment with a proteostasis modulator comprises administering a proteostasis modulator to the subject in addition to a standard treatment regimen for treating the subject's cancer. It will be understood that the proteostasis modulator is typically administered in an effective amount in a suitable pharmaceutical composition that may comprise one or more pharmaceutically acceptable carriers. “Pharmaceutically acceptable carrier” refers to a diluent, excipient, or vehicle with which the therapeutically active agent is administered. An effective amount may be administered in one dose or multiple doses.
Without wishing to be bound by any theory, increased HSF1 activity may help tumor cells cope with the stress of therapy (e.g., pharmacological agents, radiation, etc.) and/or may promote phenotypic diversity among tumor cells by helping tumor cells cope with the consequences of mutations. Such effects may contribute to poor outcomes in cancer by, for example, promoting emergence of malignant or more aggressive tumor subclones and/or promoting treatment resistance. Administration of a proteostasis modulator may counteract such effects. In some embodiments, a therapeutic benefit could result at least in part from a proteostasis modulator reducing the likelihood that a tumor will become resistant to such treatment or at least in part reversing resistance that may be present at the time of treatment. For example, addition of a proteostasis modulator to a standard chemotherapy or hormonal regimen for breast cancer may reduce the likelihood that a tumor will become resistant to such regimen, or at least in part reverse resistance that may be present at the time of treatment. Based at least in part on the discovery that HSF1 expression and HSF1 activation are increased in pre-invasive cancer, the invention encompasses the recognition that intervention at the pre-invasive stage of cancer with a proteostasis modulator (e.g., to counteract HSF1's activity) may delay or reduce the likelihood of progression to invasive cancer. In some aspects, the invention encompasses the recognition that treatment of subjects without evidence of cancer (e.g., subjects at increased risk of cancer) with a proteostasis modulator (e.g., to counteract HSF1's activity) may inhibit or reduce the likelihood that the subject will develop cancer. It should be noted that a subject may be a suitable candidate for treatment with a proteostasis modulator even if the tumor does not exhibit increased HSF1 expression or increased HSF1 activation. For example, subjects with early stage cancer that has not progressed to a state in which HSF1 is activated may benefit
In some aspects, the invention provides a method of treating a subject who has pre-invasive cancer, the method comprising administering a proteostasis modulator to a subject with pre-invasive cancer. Such treatment may, for example, inhibit progression of the pre-invasive cancer to invasive cancer. In some aspects, the invention provides a method of treating a subject at increased risk of cancer, the method comprising administering a proteostasis modulator to the subject. In some aspects, the invention provides a method of inhibiting development of cancer in a subject, the method comprising administering a proteostasis modulator to the subject.
In some aspects, the invention provides a method of inhibiting recurrence of cancer in a subject, the method comprising administering a proteostasis modulator to the subject. In some embodiments, the cancer is characterized by increased HSF1 expression or increased HSF1 activation.
In some aspects, the invention provides a method of inhibiting emergence of resistance to therapy in a subject with cancer, the method comprising administering a proteostasis modulator to the subject in combination with an additional therapy, thereby reducing the likelihood of resistance to the additional therapy. In some embodiments, the additional therapy is a chemotherapeutic agent. In some embodiments, the additional therapy is a hormonal agent. In some embodiments, the cancer is characterized by increased HSF1 expression or increased HSF1 activation.
In some embodiments, a proteostasis modulator is an HSR modulator, e.g., an HSR inhibitor. “HSR inhibitor” refers to an agent that inhibits expression or activity of at least one component of the HSR. HSR components include, e.g., HSF1 itself and heat shock proteins such as HSP 40, HSP70, and HSP90. In some embodiments, the component of the HSR is HSP90. For purposes of the present invention, HSP90 refers to HSP90A family HSP90, commonly referred to in the art as “cytoplasmic HSP90” (see Taipale, M, et al., Nat. Rev. Mol. Cell. Biol. (2010) 11(7):515-28 for review). Most vertebrates, including humans, have two genes encoding HSP90A proteins with very similar sequences and highly overlapping functions: HSP90AA1 (Gene ID for human gene: 3320; Gene ID for mouse ortholog: 15519) and HSP90AB1 (Gene ID for human gene: 3326; Gene ID for mouse gene: 15516). The proteins encoded by HSP90AA1 and HSP90AB1 are referred to as HSP90a and HSP90β, respectively. For purposes of the present invention, an “HSP90 inhibitor” refers to a compound that inhibits at least one HSP90A, e.g., HSP90β. In some embodiments, the compound inhibits both HSP90α and HSP90β. HSP90A is an ATPase and contains three main structural domains: a highly conserved N-terminal (NTD) domain of ˜25 kDa, which contains a binding pocket for ATP; a middle domain (MD) of ˜40 kDa, and a C-terminal domain (CTD) of ˜12 kDa. HSP90A forms homodimers and undergoes a dynamic cycle termed the “chaperone cycle” involving ATP binding and hydrolysis, during which it undergoes conformational shifts that are important in its recognition and release of client proteins.
Numerous HSP90 inhibitors are known in the art. In general, an HSP90 inhibitor can inhibit HSP90 activity in any of a variety of ways, such as by inhibiting the ATPase activity of HSP90. In some embodiments an HSP90 inhibitor specifically binds to the ATP binding pocket of HSP90. In some embodiments an HSP90 inhibitor binds outside the ATP binding pocket. A number of HSP90 inhibitors have shown promise in the treatment of cancer, and others are under investigation. Exemplary HSP90 inhibitors include, e.g., benzoquinone ansamycins such as geldanamycin and herbimycin, resorcylic acid lactones such as radicicol, purine scaffold compounds, and a variety of synthetic compounds based on other chemical scaffolds (see, e.g., Taldone, T., et al. Bioorg Med Chem., 17(6):2225-35, 2009 or Trepel, J., et al., Nat Rev Cancer. 10(8):537-49, 2010). Exemplary HSP90 inhibitors that have entered clinical development (i.e., they have been administered to at least one human subject in a clinical trials) include, e.g., geldanamycin analogs such as 17-allylamino-17-demethoxygeldanamycin (17-AAG, also called tanespimycin), 17-dimethylaminoethylamino-17-demethoxygeldanamycin (I 7-DMAG), retaspimycin (IPI-504), alvespimycin (IPI-493), SNX-5422, AUY922, STA-9090, HSP990, CNF2024 (BIIB021), XL888, AT13387, and MPC-3100.
An ongoing challenge in the development of HSP90 inhibitors has been the identification of which patients are likely to benefit from treatment with these drugs (36-39). The basal level of HSP90 in tumors per se has generally not proven to be predictive. Without wishing to be bound by any theory, the effectiveness of HSF1, even as a single marker, in predicting the outcome of cancers as described herein may reflect the fact that HSF1, as a dominant regulator of the entire heat shock network, serves as a better indicator of the overall stress levels within a tumor and consequently the “load” on the HSP-based chaperone machinery. In accordance with certain aspects of the invention, this load could determine which patients might benefit from a HSP90 inhibitor, either alone or in combination with other agents. In some embodiments, the HSP90 inhibitor has entered clinical development for, e.g., treatment of cancer. In some embodiments the HSP90 inhibitor is a small molecule.
In some embodiments, a proteostasis modulator is an HSF1 inhibitor. As used herein, an “HSF1 inhibitor” is an agent that inhibits expression or activity of HSF1. In some embodiments, an HSF1 inhibitor is an RNAi agent, e.g., a short interfering RNA (siRNA) or short hairpin RNA (shRNA) that, when present in a cell (e.g., as a result of exogenous introduction of an siRNA or intracellular expression of a shRNA) results in inhibition of HSF expression by RNA interference (e.g., by causing degradation or translational repression of mRNA encoding HSF1, mediated by the RNAi-induced silencing complex). Exemplary RNAi agents that inhibit HSF1 expression are disclosed, e.g., in PCT/EP2010/069917 (WO/2011/073326) or in reference 6. In some embodiments an HSF1 inhibitor may be an intrabody that binds to HSF1, or an agent such as a single chain antibody, aptamer, or dominant negative polypeptide that binds to HSF1, wherein the agent optionally comprises a moiety that allows it to gain entry into tumor cells. For example, the agent may comprise a protein transduction domain that allows the agent to cross the plasma membrane or a ligand that binds to a cell surface receptor such that the agent is internalized, e.g., by endocytosis. In some embodiments the HSF1 inhibitor comprises a small molecule. In some embodiments the HSF1 inhibitor comprises an agent that inhibits activation of HSF1. For example, the agent may at least in part block assembly of multimers, e.g., trimers, comprising HSF1. Suitable agents for inhibiting HSF1 may be identified using a variety of screening strategies.
In some embodiments, a proteostasis modulator is a proteasome inhibitor. The proteasome is a large, multi-protein complex that unfolds and proteolyses substrate polypeptides, reducing them to short fragments (Lodish, et al., supra). Most protein degradation by the proteasome occurs via the ubiquitination-proteasome degradation pathway (UPD pathway), a multistep enzymatic cascade in eukaryotes in which ubiquitin is conjugated via a lysine residue to target proteins for destruction. Proteins tagged with lysine-linked chains of ubiquitin are marked for degradation in the proteasome. Proteasome-mediated protein degradation, e.g., via the UPD pathway, allows cells to eliminate excess and misfolded proteins and regulates various biological processes, such as cell proliferation. “Proteasome inhibitor” refers to an agent that inhibits activity of the proteasome or inhibits synthesis of a proteasome component. Proteasome inhibitors include, e.g., a variety of peptidic and non-peptidic agents that bind reversibly to the proteasome, bind covalently to the active site of the proteasome, or bind to the proteasome outside the active site (sometimes termed “allosteric inhibitors”) (Ruschak A M, et al., J Natl Cancer Inst. (2011) 103(13):1007-17). A number of proteasome inhibitors have shown promise in the treatment of cancer, including bortezomib (Velcade®) (approved by the US FDA), and various others under investigation. Exemplary proteasome inhibitors that have been tested in clinical trials in cancer include bortezomib, CEP-18770, MLN-9708, carfilzomib, ONX 0912, and NPI-0052 (salinosporamide A). HIV protease inhibitors such as nelvinavir also inhibit the proteasome. Other agents that inhibit the proteasome include chloroquine, 5-amino-8-hydroxyquinoline (5AHQ), disulfiram, tea polyphenols such as epigallocatechin-3-gallate, MG-132, PR-39, PS-I, PS-IX, and lactacystin. In some embodiments, a method of the invention is applied with regard to proteasome inhibitor that has entered clinical development for, e.g., treatment of cancer.
In some aspects, the invention encompasses use of a method comprising assessing the level of HSF1 expression or HSF1 activation as a “companion diagnostic” test to determine whether a subject is a suitable candidate for treatment proteostasis modulator. In some embodiments a proteostasis modulator may be approved (allowed to be sold commercially for treatment of humans or for veterinary purposes) by a government regulatory agency (such as the US FDA, the European Medicines Agency (EMA), or government agencies having similar authority over the approval of therapeutic agents in other jurisdictions) with the recommendation or requirement that the subject is determined to be a suitable candidate for treatment with the proteostasis modulator based at least in part on assessing the level of HSF1 expression or HSF1 activation in a tumor sample obtained from the subject. For example, the approval may be for an “indication” that includes the requirement that a subject or tumor sample be classified as having high levels or increased levels of HSF1 expression or HSF1 activation. Such a requirement or recommendation may be included in the package insert provided with the agent. In some embodiments a particular method for detection or measurement of an HSF1 gene product or of HSF1 activation or a specific test reagent (e.g., an antibody that binds to HSF1 polypeptide or a probe that hybridizes to HSF1 mRNA) or kit may be specified. In some embodiments, the method, test reagent, or kit will have been used in a clinical trial whose results at least in part formed the basis for approval of the proteostasis modulator. In some embodiments, the method, test reagent, or kit will have been validated as providing results that correlate with outcome of treatment with the proteostasis modulator.
In some aspects, the invention provides a method of assessing efficacy of treatment of cancer comprising: (a) assessing the level of HSF1 expression or HSF1 activation in a sample obtained from a subject that has been treated for cancer, wherein absence of increased HSF1 expression or increased HSF1 activation in said sample indicates effective treatment. In some embodiments, step (a) is repeated at one or more time points following treatment of the subject for cancer, wherein continued absence of increased HSF1 expression or increased HSF1 activation of over time indicates effective treatment. The sample may be obtained, for example, from or close to the site of a cancer that was treated (e.g., from or near a site from which a tumor was removed).
In some aspects, the invention provides a method of assessing efficacy of treatment of cancer comprising: (a) assessing the level of HSF1 expression or HSF1 activation in a sample obtained from a subject having cancer, and (b) repeating step (a) at one or more time points during treatment of the subject for cancer, wherein decreased HSF1 expression or decreased HSF1 activation of over time indicates effective treatment. The sample may be obtained, for example, from or close to the site of a cancer being treated.
In some aspects, the invention provides a method of monitoring a subject for cancer recurrence comprising: (a) assessing the level of HSF1 expression or HSF1 activation in a sample obtained from a subject that has been treated for cancer, wherein presence of increased HSF1 expression or increased HSF1 activation in the sample indicates cancer recurrence. In some embodiments, step (a) is repeated at one or more time points following treatment of the subject for cancer. The sample may be obtained, for example, from or close to the site of a cancer that was treated (e.g., from or near a site from which a tumor was removed).
In certain embodiments of any aspect of the invention, a cancer is breast cancer. In certain aspects, the invention provides the recognition that assessment of HSF1 expression or activation for diagnostic, prognostic, or predictive purposes may be of particular use in estrogen receptor (ER) positive breast cancer. In certain embodiments of any of the inventive methods relating to breast cancer, the breast cancer is estrogen receptor (ER) positive breast cancer.
Certain aspects and embodiments of the invention are described herein mainly in regard to breast cancer (e.g., breast tumor cells, breast tumor samples, breast tumors, and/or subjects in need of prognosis, diagnosis, or treatment selection for breast cancer). It will be understood that the invention encompasses embodiments in which products and processes described herein are applied in the context of tumors arising from organs or tissues other than the breast. One of ordinary skill in the art will recognize that certain details of the invention may be modified according, e.g., to the particular tumor type or tumor cell type of interest. Such embodiments are within the scope of the invention.
It will be understood that many of the methods provided herein, e.g., methods of classification, may be described in terms of samples, tumors, or subjects and such descriptions maybe considered equivalent and freely interchangeable. For example, where reference is made herein to a method of classifying a sample, such method may be expressed as a method of classifying a tumor from which the sample was obtained or as a method of classifying a subject from which the sample originated in various embodiments. Similarly, where reference is made herein to assessing the level of HSF1 expression or HSF1 activation in a sample, such method may be expressed as a method of assessing the level of HSF1 expression or HSF1 activation in a tumor from which the sample was obtained in various embodiments. It will also be understood that a useful diagnostic, prognostic, or treatment-specific predictive method need not be completely accurate. For example, “predicting”, “predicting the likelihood”, and like terms, as used herein, do not imply or require the ability to predict with 100% accuracy and do not imply or require the ability to provide a numerical value for a likelihood (although such value may be provided). Instead, such terms typically refer to forecast of an increased or a decreased probability that a result, outcome, event, etc., of interest exists or will occur, e.g., when particular criteria or conditions exist, as compared with the probability that such result, outcome, or event, etc., exists or will occur when such criteria or conditions are not met.
Methods of Assessing HSF1 Expression or HSF1 Activation
HSF1 genomic, mRNA, polypeptide sequences from a variety of species, including human, are known in the art and are available in publicly accessible databases such as those available at the National Center for Biotechnology Information (www.ncbi.nih.gov) or Universal Protein Resource (www.uniprot.org). Exemplary databases include, e.g., GenBank, RefSeq, Gene, UniProtKB/SwissProt, UniProtKB/Trembl, and the like. The HSF1 gene has been assigned NCBI GeneID: 3297. The NCBI Reference Sequence accession numbers for human HSF1 mRNA and polypeptide are NM—005526 and NP—005517, respectively, and the human HSF1 polypeptide GenBank acc. no. is AAA52695.1. The human HSF1 gene is located on chromosome 8 (8q24.3), RefSeq accession number NC—000008.10. Sequences of other nucleic acids and polypeptides of interest herein could also be readily obtained from such databases. Sequence information may be of use, for example, to generate reagents for detection of HSF1 gene products.
In general, the level of HSF1 expression of HSF1 activation can be assessed using any of a variety of methods. In many embodiments, the level of HSF1 expression is assessed by determining the level of an HSF1 gene product in a sample obtained from a tumor. In some embodiments an HSF1 gene product comprises HSF1 mRNA. In general, any suitable method for measuring RNA can be used to measure the level of HSF1 mRNA in a sample. For example, methods based at least in part on hybridization and/or amplification can be used. Exemplary methods of use to detect mRNA include, e.g., in situ hybridization, Northern blots, microarray hybridization (e.g., using cDNA or oligonucleotide microarrays), reverse transcription PCR (e.g., real-time reverse transcription PCR), nanostring technology (see, e.g., Geiss, G., et al., Nature Biotechnology (2008), 26, 317-325; U.S. Ser. No. 09/898,743 (U.S. Pat. Pub. No. 20030013091) for exemplary discussion of nanostring technology and general description of probes of use in nanostring technology). A number of such methods include contacting a sample with one or more nucleic acid probe(s) or primer(s) comprising a sequence (e.g., at least 10 nucleotides in length, e.g., at least 12, 15, 20, or 25 nucleotides in length) substantially or perfectly complementary to a target RNA (e.g., HSF1 mRNA). The probe or primer is often detectably labeled using any of a variety of detectable labels. In many embodiments the sequence of the probe or primer is sufficiently complementary to HSF1 mRNA to allow the probe or primer to distinguish between HSF1 mRNA and most or essentially all (e.g., at least 99%/o, or more) transcripts from other genes in a mammalian cell, e.g., a human cell, under the conditions of an assay. In some embodiments, “substantially complementary” refers to at least 90% complementarity, e.g., at least 95%, 96%, 97%, 98%, or 99% complementarity. A probe or primer may also comprise sequences that are not complementary to HSF1 mRNA, so long as those sequences do not hybridize to other transcripts in a sample or interfere with hybridization to HSF1 mRNA under conditions of the assay. Such additional sequences may be used, for example, to immobilize the probe or primer to a support. A probe or primer may be labeled and/or attached to a support or may be in solution in various embodiments. A support may be a substantially planar support that may be made, for example, of glass or silicon, or a particulate support, e.g., an approximately spherical support such as a microparticle (also referred to as a “bead” or “microsphere”). In some embodiments, a sequencing-based approach such as serial analysis of gene expression (SAGE) (including variants thereof) or RNA-Sequencing (RNA-Seq) is used. RNA-Seq refers to the use of any of a variety of high throughput sequencing techniques to quantify RNA transcripts (see, e.g., Wang, Z., et al. Nature Reviews Genetics (2009), 10, 57-63). Other methods of use for detecting RNA include, e.g., electrochemical detection, bioluminescence-based methods, fluorescence-correlation spectroscopy, etc. It will be understood that certain methods that detect mRNA may, in some instances, also detect at least some pre-mRNA transcript(s), transcript processing intermediates, and degradation products of sufficient size.
In some embodiments an HSF1 gene product comprises HSF1 polypeptide. In general, any suitable method for measuring proteins can be used to measure the level of HSF1 polypeptide in a sample. In many embodiments, an immunological method or other affinity-based method is used. In general, immunological detection methods involve detecting specific antibody-antigen interactions in a sample such as a tissue section or cell sample. The sample is contacted with an antibody that binds to the target antigen of interest. The antibody is then detected using any of a variety of techniques. In some embodiments, the antibody that binds to the antigen (primary antibody) or a secondary antibody that binds to the primary antibody has been tagged or conjugated with a detectable label. In some embodiments a label-free detection method is used. A detectable label may be, for example, a fluorescent dye (e.g., a fluorescent small molecule) or quencher, colloidal metal, quantum dot, hapten, radioactive atom or isotope, or enzyme (e.g., peroxidase). It will be appreciated that a detectable label may be directly detectable or indirectly detectable. For example, a fluorescent dye would be directly detectable, whereas an enzyme may be indirectly detectable, e.g., the enzyme reacts with a substrate to generate a directly detectable signal. Numerous detectable labels and strategies that may be used for detection, e.g., immunological detection, are known in the art. Exemplary immunological detection methods include, e.g., immunohistochemistry (IHC); enzyme-linked immunosorbent assay (ELISA), bead-based assays such as the Luminex® assay platform (Invitrogen), flow cytometry, protein microarrays, surface plasmon resonance assays (e.g., using BiaCore technology), microcantilevers, immunoprecipitation, immunoblot (Western blot), etc. IHC generally refers to immunological detection of an antigen of interest (e.g., a cellular constituent) in a tissue sample such as a tissue section. As used herein, IHC is considered to encompass immunocytochemistry (ICC), which term generally refers to the immunological detection of a cellular constituent in isolated cells that essentially lack extracellular matrix components and tissue microarchitecture that would typically be present in a tissue sample. Traditional ELISA assays typically involve use of primary or secondary antibodies that are linked to an enzyme, which acts on a substrate to produce a detectable signal (e.g., production of a colored product) to indicate the presence of antigen or other analyte. IHC generally refers to the immunological detection of a tissue or cellular constituent in a tissue or cell sample comprising substantially intact (optionally permeabilized) cells. As used herein, the term “ELISA” also encompasses use of non-enzymatic reporters such as fluorogenic, electrochemiluminescent, or real-time PCR reporters that generate quantifiable signals. It will be appreciated that the term “ELISA” encompasses a number of variations such as “indirect”, “sandwich”, “competitive”, and “reverse” ELISA.
In some embodiments, e.g., wherein IHC is used for detecting HSF1, a sample is in the form of a tissue section, which may be a fixed or a fresh (e.g., fresh frozen) tissue section or cell smear in various embodiments. A sample, e.g., a tissue section, may be embedded, e.g., in paraffin or a synthetic resin or combination thereof. A sample, e.g., a tissue section, may be fixed using a suitable fixative such as a formalin-based fixative. The section may be a paraffin-embedded, formalin-fixed tissue section. A section may be deparaffinized (a process in which paraffin (or other substance in which the tissue section has been embedded) is removed (at least sufficiently to allow staining of a portion of the tissue section). To facilitate the immunological reaction of antibodies with antigens in fixed tissue or cells it may be helpful to unmask or “retrieve” the antigens through pretreatment of the sample. A variety of antigen retrieval procedures (sometimes called antigen recovery), can be used in IHC. Such methods can include, for example, applying heat (optionally with pressure) and/or treating with various proteolytic enzymes. Methods can include microwave oven irradiation, combined microwave oven irradiation and proteolytic enzyme digestion, pressure cooker heating, autoclave heating, water bath heating, steamer heating, high temperature incubator, etc. To reduce background staining in IHC, the sample may be incubated with a buffer that blocks the reactive sites to which the primary or secondary antibodies may otherwise bind. Common blocking buffers include, e.g., normal serum, non-fat dry milk, bovine serum albumin (BSA), or gelatin, and various commercial blocking buffers. The sample is then contacted with an antibody that specifically binds to the antigen whose detection is desired (e.g., HSF1 protein). After an appropriate period of time, unbound antibody is then removed (e.g., by washing) and antibody that remains bound to the sample is detected. After immunohistochemical staining, a second stain may be applied, e.g., to provide contrast that helps the primary stain stand out. Such a stain may be referred to as a “counterstain”. Such stains may show specificity for discrete cellular compartments or antigens or stain the whole cell. Examples of commonly used counterstains include, e.g., hematoxylin, Hoechst stain, or DAPI. The tissue section can be visualized using appropriate microscopy, e.g., light microscopy, fluorescence microscopy, etc. In some embodiments, automated imaging system with appropriate software to perform automated image analysis is used.
In some embodiments, flow cytometry (optionally including cell sorting) is used to detect HSF1 expression. The use of flow cytometry would typically require the use of isolated cells substantially removed from the surrounding tissue microarchitecture, e.g., as a single cell suspension. HSF1 mRNA or polypeptide level could be assessed by contacting cells with a labeled probe that binds to HSF1 mRNA or a labeled antibody that binds to HSF1 protein, respectively, wherein said probe or antibody is appropriately labeled (e.g., with a fluorophore, quantum dot, or isotope) so as to be detectable by flow cytometry. In some embodiments, cell imaging can be used to detect HSF1.
In some embodiments, an antibody for use in an immunological detection method, e.g., IHC, is monoclonal. In some embodiments an antibody is polyclonal. In some embodiments, an antibody is a preparation that comprises multiple monoclonal antibodies. In some embodiments, the monoclonal or polyclonal antibodies have been generated using the same portion of HSF1 (or full length HSF) as an immunogen or binding target. In some embodiments, an antibody is an anti-peptide antibody. In some embodiments, a monoclonal antibody preparation may comprise multiple distinct monoclonal antibodies generated using different portions of HSF1 as immunogens or binding targets. Many antibodies that specifically bind to HSF1 are commercially available and may be used in embodiments of the present invention. One of ordinary skill in the art would readily be able to generate additional antibodies suitable for use to detect HSF1 polypeptide using standard methods.
In some embodiments, a ligand that specifically binds to HSF1 but is not an antibody is used as an affinity reagent for detection of HSF1. For example, nucleic acid aptamers or certain non-naturally occurring polypeptides structurally unrelated to antibodies based on various protein scaffolds may be used as affinity reagents. Examples include, e.g., agents referred to in the art as affibodies, anticalins, adnectins, synbodies, etc. See, e.g., Gebauer, M. and Skerra, A., Current Opinion in Chemical Biology, (2009), 13(3): 245-255 or PCT/US2009/041570. In some embodiments an aptamer is used as an affinity reagent. The terms “affinity reagent” and “binding agent” are used interchangeably herein.
In some embodiments, a non-affinity based method is used to assess the level of HSF1 polypeptide or HSF1 activation. For example, mass spectrometry could be used to detect HSF1 or to specifically detect phosphorylated HSF1.
In some embodiments, an antibody (or other affinity reagent) or procedure for use to detect HSF1 (or HSF1 phosphorylated on serine 326) can be validated, if desired, by showing that the classification obtained using the antibody or procedure correlate with a phenotypic characteristic of interest such as presence or absence of CIS, cancer prognosis, or treatment outcome, in an appropriate set of samples. For example, as described in the Examples, a commercially available monoclonal antibody preparation RT-629-PABX (Thermo Scientific) comprising a combination of rat monoclonal antibodies (“antibody cocktail”) was validated for use in IHC for detection of HSF1 and classification of samples and subjects into different categories correlated with presence or absence of CIS, cancer prognosis, or treatment outcome. Other exemplary antibodies of use for detecting or isolating HSF1 are also disclosed in the Examples. In some embodiments, an antibody or antibody preparation or a protocol or procedure for performing IHC may be validated for use in an inventive method by establishing that its use provides similar results to those obtained using RT-629-PABX and the procedures described in the Examples on an appropriate set of test samples. For example, an antibody or antibody preparation or a procedure may be validated by establishing that its use results in the same classification (concordant classification) of at least 80%, 85%, 90%, 95% or more of samples in an appropriate set of test samples as is obtained using the antibody preparation of RT-629-PABX. A set of test samples may be selected to include, e.g., at least 10, 20, 30, or more samples in each category in a classification scheme (e.g., “positive” and “negative” categories; categories of“no”, “low”, or “high” expression, scores of 1, 2, 3; etc.). In some embodiments, a set of test samples comprises breast tissue samples, e.g., from the NHS. In some embodiments a set of samples is in the form of a tissue microarray. Once a particular antibody or procedure is validated, it can be used to validate additional antibodies or procedures. Likewise, a probe, primer, microarray, or other reagent(s) or procedure(s) to detect HSF1 RNA can be validated, if desired, by showing that the classification obtained using the reagent or procedure correlates with a phenotypic characteristic of interest such as presence or absence of CIS, cancer prognosis, or treatment outcome, in an appropriate set of samples.
It will be understood that suitable controls and normalization procedures can be used to accurately quantify HSF1 expression, where appropriate. For example, measured values can be normalized based on the expression of one or more RNAs or polypeptides whose expression is not correlated with a phenotypic characteristic of interest. In some embodiments, a measured value can be normalized to account for the fact that different samples may contain different proportions of a cell type of interest, e.g., cancer cells, versus non-cancer cells. For example, in some embodiments, the percentage of stromal cells, e.g., fibroblasts, may be assessed by measuring expression of a stromal cell-specific marker, and the overall results adjusted to accurately reflect HSF1 mRNA or polypeptide level specifically in the tumor cells. Similarly, appropriate controls and normalization procedures can be used to accurately quantify HSF1 activation, where appropriate. It would also be understood that if a sample such a tissue section contains distinguishable (e.g., based on standard histopathological criteria), areas of neoplastic and non-neoplastic tissue, such as at the margin of a tumor, the level of HSF1 expression or activation could be assessed specifically in the area of neoplastic tissue, e.g., for purposes of comparison with a control level, which may optionally be the level measured in the non-neoplastic tissue.
In certain embodiments of the invention the level of HSF1 mRNA or protein level is not measured or analyzed simply as a contributor to a cluster analysis, dendrogram, or heatmap based on gene expression profiling in which expression at least 20; 50; 100; 500; 1,000, or more genes is assessed. In certain embodiments of the invention, e.g., if HSF1 mRNA or protein level is measured as part of such a gene expression profile, the level of HSF1 mRNA or protein is used to classify samples or tumors (e.g., for diagnostic, prognostic or treatment-specific predictive purposes) in a manner that is distinct from the manner in which the expression of many or most other genes in the gene expression profile are used. For example, the level of HSF1 mRNA or polypeptide may be used independently of most or all of the other measured expression levels or may be weighted more strongly than many or most other mRNAs in analyzing or using the results.
In some embodiments, HSF1 mRNA or polypeptide level is used together with levels of a set of no more than 10 other mRNAs or proteins that are selected for their utility for classification for diagnostic, prognostic, or predictive purposes in one or more types of cancer, such as breast cancer. For example, in the case of breast cancer, HSF1 mRNA or polypeptide levels can be used together with a measurement of estrogen receptor (ER), progesterone receptor (PR), or human epidermal growth factor receptor 2 (HER2) mRNA or polypeptide levels. In some embodiments, measurement of ER, PR, HER2 mRNA and/or other mRNA is performed using ISH. In some embodiments, measurement of ER, PR, HER2 polypeptide and/or other polypeptides is performed using IHC. In some embodiments such testing is performed in accordance with recommendations of the American Society of Clinical Oncology/College of American Pathologists Guideline Recommendations for Immunohistochemical Testing of Estrogen and Progesterone Receptors in Breast Cancer or the American Society of Clinical Oncology/College of American Pathologists Guideline Recommendations for Human Epidermal Growth Factor Receptor 2 Testing in Breast Cancer. In some embodiments such testing is performed according to recommendations of a commercially available kit, e.g., a kit approved by a governmental regulatory agency (e.g., the U.S. Food and Drug Administration) for use in clinical diagnostic, prognostic, or predictive purposes.
In general, the level of HSF1 activation can be assessed using any of a variety of methods in various embodiments of the invention. In some embodiments, the level of HSF1 activation is determined by detecting HSF1 polypeptide in cell nuclei, wherein nuclear localization of HSF1 polypeptide is indicative of HSF1 activation. HSF1 localization can be assessed, for example, using IHC, flow cytometry, FACS, etc. Alternately, or additionally, cell nuclei could be isolated and HSF1 polypeptide detected by immunoblot. In some embodiments, HSF1 nuclear localization could be assessed by staining for HSF1 protein, counterstaining with a dye that binds to a nuclear component such as DNA, and assessing co-localization of HSF1 and such nuclear component. Cell imaging can be used in some embodiments. It will be understood that “detecting” as used herein, can encompass applying a suitable detection procedure and obtaining a negative result, i.e., detecting a lack of expression or activation.
In some embodiments, the level of HSF1 activation is determined by determining the level of HSF1 phosphorylation, wherein HSF1 phosphorylation is indicative of HSF1 activation. In some embodiments, phosphorylation of HSF1 on serine 326 is determined as an indicator of HSF1 activation. Phosphorylation of HSF1 on serine 326 can be assessed, for example, using antibodies that bind specifically to HSF1 phosphorylated on serine 326. In some embodiments, a ratio of phosphorylated HSF1 to unphosphorylated HSF1 (on serine 326) is used as an indicator of HSF1 activation, with a higher ratio indicating more activation. Measurement of other post-translational modifications indicative of HSF1 activation could be used in various embodiments.
In some embodiments, the level of HSF1 activation is determined by measuring a gene expression profile of one or more genes whose expression is regulated by HSF1, wherein increased expression of a gene that is positively regulated by HSF1 or decreased expression of a gene that is negatively regulated by HSF1 is indicative of HSF1 activation. In many embodiments, the HSF1-regulated gene is not an HSP (e.g., HSP90) or, if HSP expression is measured, at least one additional HSF1-regulated gene other than an HSP is also measured. In some embodiments a gene expression profile measures expression of at least 5 HSF1-regulated genes, e.g., between 5 and about 1,000 HSF1-regulated genes. In some embodiments at least some of the genes are HSF1-CP genes. In some embodiments at least some of the HSF1-CP genes are HSF1-CSS genes. In some embodiments at least some of the HSF1-CP genes are HSF1-CaSig2 genes. In some embodiments at least some of the HSF1-CP genes are HSF1-CaSig3 genes. In some embodiments at least some of the HSF1-CP genes are refined HSF1-CSS genes. In some embodiments at least some of the HSF1-CP genes are Module 1, Module 2, Module 3, Module 4, or Module 5 genes. Of course the gene expression profile may in some embodiments also measure expression of one or more genes that are not regulated by HSF1. In some embodiments measurement of expression of one or more genes that are not regulated by HSF1 is used as a control or for normalization purposes. In some embodiments measurement of expression of one or more genes that are not regulated by HSF1 may be disregarded. In some embodiments no more than 1%, 5%, 10%, 20%, 30%, 40%, or 50%, of measurements are of genes that are not bound and/or regulated by HSF1. In some embodiments, determining whether HSF1 is activated comprises comparing a gene expression profile obtained from a sample of interest with gene expression profile(s) obtained from one or more samples in which HSF1 is activated or is not activated. If the gene expression profile obtained from the sample clusters with or resembles the gene expression profile obtained from sample(s) in which HSF1 is activated, the sample of interest can be classified as exhibiting HSF1 activation. On the other hand, if the gene expression profile obtained from the sample of interest clusters with or resembles the gene expression profile obtained from sample(s) in which HSF1 is not activated, the sample of interest can be classified as not exhibiting HSF1 activation. Methods for clustering samples are well known in the art or assigning a sample to one of multiple clusters are well known in the art and include, e.g., hierarchical clustering, k-means clustering, and variants of these approaches.
In some embodiments, the level of HSF1 activation is determined by measuring binding of HSF1 to the promoter of one or more HSF1-regulated genes, wherein binding of HSF1 to the promoter of an HSF1-regulated gene is indicative of HSF1 activation. In some embodiments, an HSF1-regulated gene is a gene whose expression level (e.g., as assessed based on mRNA or protein levels) is increased or decreased by at least a factor of 1.2 as a result of HSF1 activation. In some embodiments, an HSF1-regulated gene is among the 1,000 genes in the human genome whose expression is most strongly affected (increased or inhibited) by HSF1. In some embodiments, an HSF1-regulated gene is among the 1,000 genes in the human genome whose promoter is most strongly bound by HSF1 under conditions in which HSF1 is activated. Methods for measuring binding of a protein (e.g., HSF1) to DNA (e.g., genomic DNA) include, e.g., chromatin immunoprecipitation using an antibody to the protein followed by microarray hybridization to identify bound sequences, commonly referred to as ChIP-on-chip (see, e.g., U.S. Pat. Nos. 6,410,243; 7,470,507; 7,575,869); ChIP-Sequencing, which uses chromatin immunoprecipitation followed by high throughput sequencing to identify the bound DNA; and DamID (DNA adenine methyltransferase identification; see, e.g., Vogel M J, et al (2007). “Detection of in vivo protein-DNA interactions using DamID in mammalian cells”. Nat Protoc 2 (6): 1467-78).
In some embodiments, an assay to detect HSF1 expression or activation makes use of fluorescence resonance energy transfer (FRET).
In some embodiments, the level of an HSF1 gene product or the level of HSF1 activation is determined to be “increased” or “not increased” by comparison with a suitable control level or reference level. The terms “reference level” and “control level” may be used interchangeably herein. A suitable control level can be a level that represents a normal level of HSF1 gene product or HSF1 activation, e.g., a level of HSF1 gene product or HSF1 activation existing in cells or tissue in a non-diseased condition and in the substantial absence of stresses that activate the heat shock response. Thus any method that includes a step of (a) assessing (determining) the level of HSF1 gene expression or the level of HSF1 activation in a sample can comprise a step of(b) comparing the level of HSF1 gene expression or HSF1 activation with a control level of HSF1 gene expression or HSF1 activation, wherein if the level determined in (a) is greater than the control level, then the level determined in (a) is considered to be “increased” (or, if the level determined in (a) is not greater than the control level, then the level determined in (a) is considered to be “not increased”. For example, if a tumor has an increased level of HSF1 expression or HSF1 activation as compared to a control level, the tumor is classified as having a high risk of poor outcome, while if the tumor does not have a significantly increased level of HSF1 relative to a control level, the tumor is classified as having a low risk of poor outcome. A control level may be determined in a variety of ways. In some embodiments a control level is an absolute level. In some embodiments a control level is a relative level, such as the percentage of tumor cells exhibiting nuclear HSF1 staining or the percentage of tumor cells or tumor cell nuclei exhibiting intense staining for HSF1. A comparison can be performed in various ways. For example, in some embodiments one or more samples are obtained from a tumor, and one or more samples are obtained from nearby normal (non-tumor) tissue composed of similar cell types from the same patient. The relative level of HSF1 gene product or HSF1 activation in the tumor sample(s) versus the non-tumor sample(s) is determined. In some embodiments, if the relative level (ratio) of HSF1 gene product in the tumor samples versus the non-tumor sample(s) is greater than a predetermined value (indicating that cells of the tumor have increased HSF1), the tumor is classified as high risk. In some embodiments the predetermined value is, e.g., at least 1.5, 2, 2.5, 3, 5, 10, 20, or more. In some embodiments the predetermined value is between about 1.5 and about 10. A control level can be a historical measurement. For example, the data provided herein provide examples of levels of HSF1 expression and HSF1 activation in normal breast, cervix, colon, lung, pancreas, prostate, and meningeal tissue and tissue from breast, cervix, colon, lung, pancreas, prostate, and meningeal tumors, thereby providing examples of suitable control levels. It will be understood that in at least some embodiments a value may be semi-quantitative, qualitative or approximate. For example, visual inspection (e.g., using light microscopy) of a stained IHC sample can provide an assessment of the level of HSF1 expression or HSF1 activation without necessarily counting cells or nuclei or precisely quantifying the intensity of staining.
Various risk categories may be defined. For example, tumors may be classified as at low, intermediate, or high risk of poor outcome. A variety of statistical methods may be used to correlate the risk of poor outcome with the relative or absolute level of HSF1 expression or HSF1 activation.
For purposes of description herein it is assumed that the control or reference level represents normal levels of HSF1 expression or HSF1 activation present in non-cancer cells and tissues. However, it will be understood that a level of HSF1 expression or HSF1 activation characteristic of cancer (e.g., breast cancer) could be used as a reference or control level. In that case, the presence of HSF1 expression or HSF1 activation at a level comparable to, e.g., approximately the same, as or greater than the control level would be indicative of the presence of cancer, poor cancer prognosis, aggressive cancer phenotype, or to identify a subject who is a suitable candidate for treatment with a proteostasis modulator, while a decreased level of HSF1 expression or HSF1 activation as compared with the control level would be predictive of good cancer prognosis, less aggressive cancer phenotype or to identify a subject who may not be a suitable candidate for treatment with a proteostasis modulator, etc.
Methods have generally been stated herein mainly in terms of conclusions or predictions that can be made if increased HSF1 expression or increased HSF1 activation is present. Methods could equally well have been stated in terms of conclusions or predictions that can be made if increased HSF1 expression or increased HSF1 activation is not present. For example, if HSF1 expression is absent in a sample being assessed for the presence or absence of cancer, the sample would not be classified as cancer based on HSF1 expression. If HSF1 expression or HSF activation is absent or low in a sample from an invasive tumor, the tumor would be classified as having a good prognosis. If HSF1 expression or HSF activation is absent or low in a sample from an invasive tumor, the subject may not benefit from treatment with a proteostasis modulator.
Any of the methods of the invention may, in certain embodiments, comprise assigning a score to a sample (or to a tumor from which a sample was obtained) based on the level of HSF1 expression or HSF1 activation measured in the sample, e.g., based on the level of an HSF1 gene product or the level of HSF1 activation or a combination thereof.
In some embodiments a score is assigned based on assessing both HSF1 polypeptide level and HSF1 activation level. For example, a score can be assigned based on the number (e.g., percentage) of nuclei that are positive for HSF1 and the intensity of the staining in the positive nuclei. For example, a first score (e.g., between 0 and 5) can be assigned based on the percentage positive nuclei, and a second score (e.g., between 0 and 5) assigned based on staining intensity in the nuclei. In some embodiments, the two scores are added to obtain a composite score (e.g., ranging between 0 and 10). In some embodiments the two scores are multiplied to obtain a composite score (e.g., ranging between 0 and 25). The range can be divided into multiple (e.g., 2 to 5) smaller ranges, e.g., 0-9, 10-18, 19-25, and samples or tumors are assigned an overall HSF1 expression/activation score based on which subrange the composite score falls into. For example, 0-9 is low, 10-18 is intermediate, and 19-25 is high in some embodiments. A higher score indicates, for example, increased aggressiveness, increased likelihood of poor outcome, poor prognosis. Thus in some aspects, the invention provides a method of assigning a score to a sample comprising cells, the method comprising steps of: (a) assigning a first score to the sample based on the number or percentage of cell nuclei that are positive for HSF1 protein; (b) assigning a second score to the sample based on the level of HSF1 protein in cell nuclei; and (c) obtaining a composite score by combining the scores obtained in step (a) and step (b). In some embodiments, combining the scores comprises adding the scores. In some embodiments combining the scores comprises multiplying the scores. In some embodiments the method further comprises assigning the sample to an HSF1 expression/activation category based on the composite score. It will be understood that if the sample is a tissue sample that comprises areas of neoplastic tissue and areas of non-neoplastic tissue (e.g., as identified using standard histopathological criteria), the score(s) can be assigned based on assessing neoplastic tissue. The non-neoplastic tissue may be used as a control.
In some embodiments, a score is assigned using a scale of 0 to X, where 0 indicates that the sample is “negative” for HSF1 (e.g., no detectable HSF1 polypeptide in cell nuclei), and X is a number that represents strong (high intensity) staining in the majority of cell nuclei. X can be, e.g., 2, 3, 4, or 5 in various embodiments. In some embodiments, a score is assigned using a scale of 0, 1, or 2, where 0 indicates that the sample is negative for HSF1 (no detectable HSF1 polypeptide in cell nuclei), 1 is low level nuclear staining and 2 is strong (high intensity) staining in the majority of cell nuclei. A higher score indicates a less favorable prognosis than a lower score, e.g., more likely occurrence of metastasis, shorter disease free survival, lower likelihood of 5 year survival, lower likelihood of 10 year survival, or shorter average survival. A score can be obtained by evaluating one field or multiple fields in a cell or tissue sample. Multiple samples from a tumor may be evaluated in some embodiments. It will be understood that “no detectable HSF1” could mean that the level detected, if any, is not noticeably or not significantly different to background levels. It will be appreciated that a score can be represented using numbers or using any suitable set of symbols or words instead of, or in combination with numbers. For example, scores can be represented as 0, 1, 2; negative, positive; negative, low, high; −, +, ++, +++; 1+, 2+, 3+, etc.
In some embodiments, at least 20, 50, 100, 200, 300, 400, 500, 1000 cells, or more (e.g., tumor cells) are assessed to evaluate HSF1 expression or HSF activation in a sample or tumor, e.g., to assign a score to a sample or tumor. In some embodiments, samples or tumors that do not exhibit HSF1 polypeptide in nuclei, e.g., as assessed using IHC, may be considered negative for HSF1.
The number of categories in a useful scoring or classification system can be at least 2, e.g., between 2 and 10, although the number of categories may be greater than 10 in some embodiments. The scoring or classification system often is effective to divide a population of tumors or subjects into groups that differ in terms of an outcome such as local progression, local recurrence, discovery or progression of regional or distant metastasis, death from any cause, or death directly attributable to cancer. An outcome may be assessed over a given time period, e.g., 2 years, 5 years, 10 years, 15 years, or 20 years from a relevant date. The relevant date may be, e.g., the date of diagnosis or approximate date of diagnosis (e.g., within about 1 month of diagnosis) or a date after diagnosis, e.g., a date of initiating treatment. Methods and criteria for evaluating progression, response to treatment, existence of metastases, and other outcomes are known in the art and may include objective measurements (e.g., anatomical tumor burden) and criteria, clinical evaluation of symptoms), or combinations thereof. For example, 1, 2, or 3-dimensional imaging (e.g., using X-ray, CT scan, or MRI scan, etc.) and/or functional imaging may be used to detect or assess lesions (local or metastatic), e.g., to measure anatomical tumor burden, detect new lesions, etc. In some embodiments, a difference between groups is statistically significant as determined using an appropriate statistical test or analysis method, which can be selected by one of ordinary skill in the art. In many embodiments, a difference between groups would be considered clinically meaningful or clinically significant by one of ordinary skill in the art.
HSF1 Mediates a Distinct Malignancy-Enabling Transcriptional Program in Cancer
Previous work in mice revealed that HSF1 is co-opted by tumor cells to promote their survival, to the detriment of their hosts. The importance of HSF1 in supporting carcinogenesis has been demonstrated in model systems by the dramatically reduced susceptibility of Hsf1-knockout mice to tumor formation. This has been established for cancers driven by oncogenic RAS, tumor suppressor p53 mutations, and chemical carcinogens. In addition to its role in tumor formation in mice, HSF1 fosters the growth of human tumor cells in culture. Depleting HSF1 from established human cancer lines markedly reduces their proliferation and survival (Dai et al., 2007; Meng et al., 2010; Min et al., 2007; Santagata et al., 2012; Zhao et al., 2011). In mouse models, HSF1 enables adaptive changes in a diverse array of cellular processes, including signal transduction, glucose metabolism and protein translation (Dai et al., 2007; Khaleque et al., 2008; Lee et al., 2008; Zhao et al., 2011; Zhao et al., 2009). The commonly held view is that HSF1 exerts this broad influence in cancer simply by allowing cells to manage the imbalances in protein homeostasis that arise in malignancy. According to this view, the main impact of HSF1 on tumor biology occurs indirectly, through the actions of molecular chaperones like Hsp90 and Hsp70 on their client proteins (Jin et al., 2011; Solimini et al., 2007).
Described herein is the discovery that HSF1 has a broad range of direct gene regulating effects (e.g., transactivating or repressing effects) in cancer cells. By comparing cells with high and low malignant potential alongside their non-transformed counterparts, Applicants identified an HSF1-regulated transcriptional program specific to malignant cells and distinct from heat shock. In a genome-wide survey of HSF1 DNA binding, numerous genes whose regulatory regions were bound by HSF1 in a highly malignant tumor cell line under normal temperature conditions were identified. Similar HSF1 binding patterns were observed in multiple human cancer cell lines of various cancer types and in human tumor samples, thus demonstrating the presence of a dramatic basal level of HSF1 activation in cancer even in the absence of thermal stress. The term “thermal stress” is used interchangeably herein with “heat shock” and refers to exposing cells to elevated temperature (i.e., temperature above physiologically normal for such cells) for a sufficient period of time to detectably, e.g., robustly, induce the heat shock response. One of ordinary skill in the art will know of suitable protocols to heat shock cells, e.g., mammalian cells, without causing substantial, e.g., irreversible, cell damage or death. In some embodiments heat shock comprises exposing cells to a temperature of 42±0.5 degrees C., e.g., 42 degrees C., for about 1 hour or similar exposures to elevated temperatures (e.g., at or above 40 or 41 degrees C.) resulting in similar or at least approximately equivalent induction of the heat shock response. In some embodiments heat shock comprises exposing cells to a temperature of 43±0.5 degrees C. or 44±0.5 degrees C. for, e.g., between 30 and 60 minutes. In some embodiments cells are not “pre-conditioned” by prior exposure to elevated temperature within a relevant time period, e.g., within 24 hours prior to heat shock. In some embodiments cells are pre-conditioned by prior exposure to elevated temperature within a relevant time period, e.g., within 24 hours prior to heat shock. In some embodiments cells are allowed to recover for up to about 60 minutes, e.g., about 30 minutes, at normal (sub-heat shock) temperature, e.g., 37 degrees C., prior to isolation of RNA or DNA. In some embodiments assessment of the effect of heat shock on expression may occur after allowing an appropriate amount of time for translation of a transcript whose expression is induced by HSF1. In some embodiments cells are returned to normal temperature conditions for no more than 2, 3, 4, 6, or 8 hours prior to assessment of the effect of heat shock (or harvesting of cells, RNA, or DNA for subsequent assessment). Unless otherwise indicated or evident from the context, the term “heat shocked cells” or “cells subjected to heat shock” refers to heat shocked non-transformed cells. The terms “non-transformed”, “non-cancer”, “non-tumorigenic”, and “non-tumor” are used interchangeably herein to refer to cells that are not cancer cells or tissue that is not tumor tissue. In some aspects, non-cancer cells lack morphological characteristics typical of cancer cells and lack the ability to form tumors when introduced into an immunologically compatible host. In some embodiments a non-cancer cell is a primary cell. In some embodiments a non-cancer cell is an immortal cell. In some embodiments an immortal non-cancer cell expresses human teloinerase catalytic subunit (hTERT) or a non-human ortholog thereof. In some embodiments a non-cancer cell is a cell that has been immortalized by introducing a nucleic acid encoding human telomerase catalytic subunit (hTERT) or a non-human ortholog thereof into the cell or an ancestor of the cell. In some embodiments non-transformed cells used as control cells for comparison with transformed cells are of the same type or tissue of origin as transformed cells with which they are compared. In some embodiments non-transformed cells are immortalized cells derived from normal (non-cancer) tissue. It is generally assumed herein that, unless otherwise indicated, heat shocked cells and cancer cells are not deliberately subjected to other stresses known to activate the heat shock response. However, the present disclosure encompasses embodiments in which HSF1 activity in response to alternate stresses rather than heat shock is compared with HSF1 cancer-related activity as described herein in detail with respect to heat shock.
HSF1 was found to regulate a transcriptional program in cancer cells that is distinct from the HSF1 transcriptional program elicited by heat shock. Some genes are bound by HSF1 in cancer cells, e.g., malignant cancer cells, but are not detectably bound by HSF1 in non-transformed control cells subjected to heat shock. Some genes are bound by HSF1 both in cancer cells, e.g., malignant cancer cells, and in heat shock conditions. In the case of many genes that are bound in both cancer cells and in non-transformed cells subjected to heat shock, HSF1 binding was found to differ quantitatively, resulting in different effects on transcription in cancer cells as compared with non-transformed cells subjected to heat shock. In some aspects, the present disclosure provides the insight that the broad influence exerted by HSF1 in cancer is not limited to indirect effects occurring through the actions of molecular chaperones like Hsp90 and Hsp70 (whose transcription is induced by HSF1) on their client proteins. Instead HSF1 plays a direct role in rewiring the transcriptome and, thereby, the physiology of cancer cells. As described herein, Applicants defined a genome-wide transcriptional program that HSF1 coordinates in malignancy. This program differs fundamentally from that induced by thermal stress (although some genes are shared between the two programs). Its activation is common in a wide variety of human cancers and is shown herein to be strongly associated with metastasis and death in at least the three cancers responsible for ˜30% of all cancer-related deaths worldwide: those of the breast, colon and lung. Furthermore, the very broad range of tumors in which immunohistochemical evidence of HSF1 activation is observed confirms that it plays a pervasive role throughout tumor biology.
Surprisingly, the types of cellular processes that HSF1 regulates in cancer constitute a diverse array that extends far beyond protein folding. Some of these processes were previously known to be affected by the loss of HSF1 (Dai et al., 2007; Jin et al., 2011; Zhao et al., 2009). To explain such results, a common assumption has been that the effects of HSF1 loss are ultimately due to reduced chaperone activity and altered protein homeostasis (Jin et al., 2011; Meng et al., 2010; Solimini et al., 2007). Applicants find that, in addition to regulating chaperone proteins, HSF1 binds to, and directly regulates, genes underlying diverse cancer-related biological processes. Without wishing to be bound by any theory, the remarkable breadth of the HSF1 cancer program in humans may explains why HSF1 is such a powerful modifier of tumorigenesis in multiple animal models (Dai et al., 2007; Jin et al., 2011; Zhao et al., 2009) and why HSF1 was identified as one of only six potent metastasis-promoting genes in a genome-wide screen for enhancers of invasion by malignant melanoma cells (Scott et al., 2011). Not only is the repertoire of HSF1-regulated genes in cancer much more extensive than just heat-shock genes, but even the manner in which some of the classical heat-shock genes are regulated diverges between cancers and heat shock. For example, while HSP90AA1 (HSP90), HSPD1 (HSP60) and HSPA8 (HSC70) are activated by HSF1 in both situations, regulation of other HSP genes such as HSPA6 (HSP70B′), a pillar of the heat-shock response, differs dramatically in these two states. Following thermal stress, HSPA6 is typically the most highly induced of all mRNAs, yet, surprisingly in cancer, HSPA6 is only bound very weakly by HSF1. Its expression is not significantly changed following HSF1 depletion and its transcript level does not correlate with that of HSF1 in a meta-analysis of 12,000 gene expression experiments (described below). This observation has implications for efforts to better understand the regulation of HSF1 in cancer, and to identify modulators of HSF1 activity in cancer. In some aspects, the present disclosure provides reporters that are more likely to capture elements of HSF1 biology distinct to the malignant state, as compared with the heat shock response, than reporters controlled by the HSPA6 promoter (Boellmann and Thomas, 2010; Stanhill et al., 2006) or reporters controlled by other promoters that are weakly bound or not bound by HSF1 in cancer cells.
Multiple mechanisms may regulate HSF1 activity during the classic heat shock response. These include the release of HSF1 from its normal sequestration by chaperones when unfolded substrates compete for chaperone binding. In addition, HSF1 is also subject to extensive post-translational modifications including acetylation, sumoylation and numerous phosphorylations (Anckar and Sistonen, 2011). Some of these heat-shock regulatory mechanisms are likely to be shared by cancer cells. For instance, impaired protein homeostasis driven by the accumulation of mutant, misfolding-prone oncoproteins (Shimizu et al., 2006) aneuploidy (Tang et al., 2011) and the increased rate of translation in cancer could chronically stimulate HSF1 activation by releasing it from sequestration from chaperones (Anckar and Sistonen, 2011). The present disclosure provides the insight that dysregulation of signaling pathways in cancer may drive post-translational modifications to HSF1 in cancer cells. Some of these signaling pathways (such as those responsible for phosphorylation at serine 326) may also function to post-translationally modify HSF1 in heat-shocked cells, but others will likely be unique to cancer, and in some embodiments, at least some such pathways may be distinct in different cancers. Among the prominent pathways most frequently activated in cancer are the EGFR/HER2 axis (Zhao et al., 2009), the RAS/MAPK pathway (Stanhill et al., 2006), and the insulin/IGFI-like growth factor system (Chiang et al., 2012) have been reported to alter HSF1 activity. Additional modes of cancer-specific regulation may include the binding of co-regulators. As known in the art, HSF1 binds to DNA sequences termed heat shock elements (HSEs). As described herein, many genes in the HSF1 cancer program differ from those of the classic heat shock response in having a different number of HSE repeats and different co-regulator binding sites.
For purposes hereof, a gene characterized in that its regulatory region is detectably bound by HSF1 in at least some cancers or cancer cell lines even in the absence of thermal stress (e.g., at 37 degrees C.) may be referred to as an “HSF1 cancer program” (HSF1-CP) gene. In some embodiments the regulatory region of an HSF1-CP gene is more highly bound by HSF1 in at least some cancers or cancer cell lines as compared with non-transformed control cells subjected to heat shock. In some embodiments, the regulatory region is at least 1.5, 2, 3, 4, 5, 10, 20, or 50-fold more highly bound in cancer cells than in non-transformed heat shocked control cells. In some embodiments, the regulatory region is detectably bound in cancer cells and not detectably bound (i.e., not bound above background levels) on non-transformed heat shocked control cells. In some embodiments the regulatory region of an HSF1-CP gene is more highly bound by HSF1 in a diverse set of cancers or cancer cell lines as compared with non-transformed control cells subjected to heat shock. Certain HSF1-CP genes whose regulatory regions were found to be more highly bound by HSF1 in a highly malignant cell line, as compared with non-transformed control cells subjected to heat shock, are listed in Table T4A and may be referred to herein Group A genes. Certain HSF1-CP genes whose regulatory regions were found to be bound by HSF1 both in a highly malignant cell line (BPLER) and in either of the non-transformed control cells (BPE or HME) subjected to heat shock (but not in non-transformed control cells not subjected to heat shock) are listed in Table T4B and may be referred to herein Group B genes. In some aspects, the terms “strongly bound”, “highly bound”, and similar terms refer to the amount of binding, which may be assessed, e.g., using an appropriate method such as ChIP-on-chip or ChIP-Seq). One of ordinary skill in the art will be aware of suitable computer programs and methods for, e.g., detecting binding peaks, quantifying binding strength, representing results, etc. Exemplary methods of performing ChIP-Seq and analyzing results thereof are provided in the Examples. Other examples may be found in, e.g., Kim H A, et al., A short survey of computational analysis methods in analysing ChIP-seq data. Hum Genomics. 2011 January; 5(2):117-23 or Giannopoulou, E G and Elemento, O., An integrated ChIP-seq analysis platform with customizable workflows, BMC Bioinformatics 2011, 12:277. Gene names as recognized in the art are used in the Tables. As noted above, sequences, e.g., mRNA and polypeptide sequences, in the NCBI Reference Sequence (RefSeq) database may be used as representative gene product sequences for a gene of interest, e.g., the HSF1-CP genes. Genomic sequences of such genes are readily available. Chromosomal locations can be readily retrieved and aligned to a genome build e.g., at the UCSC Genome Browser web site (http://genome.ucsc.edu/). As will be appreciated by those of ordinary skill in the art, in those gene names (e.g., in the Tables) that begin with a “C” followed by a number and include the term “ORF” followed by a number, such as C10ORF4, the number following the C indicates a chromosome, and the number following ORF indicates the number of the open reading frame (e.g., open reading frame 4) on the chromosome of that number (e.g., chromosome 10).
In some embodiments an HSF1-CP gene is characterized in that it is strongly bound by HSF1 in cancer cells. Representative examples of strong and weak binding and of genes that are strongly bound or weakly bound are provided in the Examples and Figures hereof. Representative examples of genes that are bound more strongly in cancer cells than heat shocked cells, bound less strongly in cancer cells than heat shocked cells, or bound to about the same extent in cancer cells and heat shocked cells are provided in the Examples and Figures hereof. Any such genes may be used in a method disclosed herein and/or as a comparator to classify binding as strong or weak and/or to classify binding as stronger in cancer cells than heat shocked cells, weaker in cancer cells than heat shocked cells, or shared (bound at reasonably similar levels in both cancer cells and heat shocked cells) in various embodiments. In some embodiments, “weak binding” is binding at about the same level as HSF1 binds to HSPA6 in metastatic cancer cells such as BPLER cells. In some embodiments, “strong binding” is binding at about the same level as HSF1 binds to HSPA6 in non-transformed heat shocked control cells such as heat shocked BPE cells or binding at about the same level as HSF1 binds to HSPA8 in metastatic cancer cells such as BPLER cells. In some embodiments strong binding is binding at about the same level as HSF1 binds to CKS2, LY6K, or RBM23 in metastatic cancer cells such as BPLER cells. In some embodiments an HSF1-CP gene is among the 5%, 10%, 20%, 30%, 40%, or 50% genes that are most highly bound by HSF1 in cancer cells, e.g., in metastatic cancer cells such as BPLER cells.
In some embodiments a characteristic, property, or result is considered to be present “in cancer” or “in cancer cells” if it is evident in a specific cancer, cancer type, or cancer cell line. In some embodiments a characteristic, property, or result is considered to be present in “cancer” if it is evident in at least some members of a diverse set of cancers or cancer cell lines, e.g., at least 30%, 40%, 50%, 60%, 70%, 80%, 85%, 90%, or more of the members in a diverse set of cancers or cancer cell lines. In some embodiments a measurement representative of “cancer” may be obtained by obtaining an average of values measured in a diverse set of cancers or cancer cell lines. In some embodiments members of a diverse set of cancers or cancer cell lines are randomly selected, or at least not selected with knowledge of whether or not a particular characteristic, property, or result of interest is evident in the cancer or cancer cell line. In some embodiments a diverse set of cancers or cancer cell lines comprises at least 5, 10, 20, 25, 30, 40, 50, 100, 200, 500, or 1,000, or more cancers and/or cancer cell lines. In some embodiments at least some of such cancers and/or cancer cell lines are of different types. For example, in some embodiments a diverse set of cancers or cancer cell lines comprises at least 3, 5, 10, 20, or more cancer types. In some embodiments a diverse set of cancer cell lines includes between 1 and 15 of the following cancer cell lines: BT474, H441, H838, H1703, HCC38, HCC1954, HCT15, HT29, SKBR3, SW620, ZR75-1, BT20, MDA-MB-231, MCF7, T47D cells. In some embodiments a diverse set of cancer cell lines comprises the NCI-60 cancer cell lines, or a randomly selected subset thereof. If desired, cells may be tested to confirm whether they are derived from a single individual or a particular cell line by any of a variety of methods known in the art such as DNA fingerprinting (e.g., short tandem repeat (STR) analysis) or single nucleotide polymorphism (SNP) analysis (which may be performed using, e.g., SNP arrays (e.g., SNP chips) or sequencing), etc. If desired, a cell or cell line, e.g., a cancer cell or cancer cell line, or a tissue sample may be classified as being of a particular type or having a particular tissue of origin based at least in part on expression of characteristic cellular markers, e.g., cell surface markers. Such markers are known to those of ordinary skill in the art. In some embodiments a diverse set of cancer cell lines or cancers comprises solid tumors, e.g., carcinomas and/or sarcomas. In some embodiments a diverse set of cancer cell lines or cancers comprises at least one cancer cell line or cancer that one of ordinary skill in the art would consider representative of adenocarcinomas. In some embodiments a diverse set of cancer cell lines or cancers includes at least one cancer cell line or cancer that one of ordinary skill in the art would consider representative of breast, lung, and colon cancer cell lines or breast, lung, and colon cancers. A cancer or cancer cell line may be represented by a sample, e.g., in a tissue microarray, tissue or cell bank or repository, etc. In some embodiments a cancer or cancer cell line is represented by a dataset, e.g., in a publicly available database such as Oncomine (https://www.oncomine.org/resource/login.html), ArrayExpress (www.ebi.ac.uk/arrayexpress/), NCBI's Gene Expression Omnibus (www.ncbi.nlm.nih.gov/geo/), Celsius (Day, A., et al., Genome Biology 2007, 8:R112; http://celsius.genome.ucla.edu/), or published in the scientific literature. A dataset may comprise, e.g., gene expression information, such as microarray data or RNA-Seq data, DNA binding information such as ChIP-chip or ChIP-Seq data, etc. Exemplary non-transformed cell lines, which may be used as control cells, include, e.g., HME, BPE, and MCF10A. In some embodiments a cell line that has comparable characteristics with respect to heat shock response as such cells may be used. In some embodiments historical control data are used.
Numerous tumor cell lines and non-transformed cell lines, in addition to those exemplified or mentioned herein, are known in the art. Cell lines may be obtained, e.g., from depositories or cell banks such as the American Type Culture Collection (ATCC), Coriell Cell Repositories, Deutsche Sammlung von Mikroorganismen und Zellkulturen (German Collection of Microorganisms and Cell Cultures; DSMZ), European Collection of Cell Cultures (ECACC), Japanese Collection of Research Bioresources (JCRB), RIKEN, Cell Bank Australia, etc. The paper and online catalogs of the afore-mentioned depositories and cell banks are incorporated herein by reference. In some embodiments non-cancer cells, e.g., a non-transformed cell line, originates from normal tissue not showing evidence of cancer. In some embodiments non-cancer cells have not had exogenous genetic material introduced therein. In some embodiments tumor cells, e.g., a tumor cell line, originate from a human tumor. In some embodiments tumor cells, e.g., a tumor cell line, originates from a tumor of a non-human animal, e.g., a tumor that was not produced by introduction of tumor cells into the non-human animal. In some embodiments tumor cells originate from a naturally arising tumor (i.e., a tumor that was not intentionally induced or generated for, e.g., experimental purposes). In some embodiments a cancer cell line or cancer is metastatic. A metastatic cancer cell line may be derived from a metastatic cancer and/or may have been shown to be capable of producing metastases in a non-human animal into which the cells have been introduced. In some embodiments a cancer cell line is highly tumorigenic. For example, the cancer cell line may be capable of giving rise to a tumor upon injection of, on average, between about 100-1,000 cells into an appropriate non-human animal host. In some embodiments experimentally produced tumor cells may be used. In some embodiments an experimentally produced tumor cell may be produced by genetically modifying a non-transformed cell. In some embodiments an engineered tumor cell may be produced from a non-tumor cell by a method that comprises expressing or activating an oncogene in the non-tumor cell and/or inactivating or inhibiting expression of one or more tumor suppressor genes or inhibiting activity of a gene product of a tumor suppressor gene. One of ordinary skill in the art will be aware of numerous oncogenes and tumor suppressor genes and methods of expressing or inhibiting expression thereof. Certain experimentally produced tumor cells and exemplary methods of producing tumor cells are described in PCT/US2000/015008 (WO/2000/073420) and/or in U.S. Ser. No. 10/767,018. In certain embodiments a non-tumor cell may be immortalized by a method comprising causing the cell to express telomerase catalytic subunit (e.g., human telomerase catalytic subunit; hTERT), to produce a non-transformed cell line. In some embodiments a tumor cell may be produced from a non-tumor cell by a method that comprises genetically modifying the non-tumor cell, e.g., by introducing one or more expression vector(s) comprising an oncogene into the cell or modifying an endogenous gene (proto-oncogene or tumor suppressor gene) by a targeted insertion into or near the gene or by deletion or replacement of a portion of the gene. In some embodiments the engineered tumor cell ectopically expresses hTERT, SV40-Large T Ag (LT) and H-Ras (RAS).
In some embodiments an HSF1-CP gene is characterized in that its expression in cancer cells increases or decreases by at least a factor of 1.2, 1.5, 2.0, 2.5, 3.0, 4.0, 5.0, or more following inhibition of HSF1 expression by, e.g., RNA interference. In some embodiments inhibition of HSF1 expression is by at least 25%, 50%, 60%, 70%, 80%, 90%, or more. In some embodiments expression of an HSF1-CP gene by cells in which HSF1 expression is inhibited is measured under conditions in which such inhibition does not result in substantial loss of cell viability (e.g., at a time point before maximum reduction in HSF1 level).
In some aspects, the invention relates to a set of 456 HSF1-CP genes characterized in that their promoter regions were found to be bound by HSF1 across a diverse set of malignant cell lines (see Examples). For purposes hereof such genes may be referred to as an “HSF1 cancer signature set” (sometimes abbreviated herein as HSF1-CSS or HSF1-CaSig) (Table T4C). As described further below, increased average expression of the HSF1-CSS genes was shown to correlate with decreased survival in a variety of representative human cancer types. In some aspects, the invention provides methods of assessing expression of one or more HSF-CSS genes, reagents useful for assessing expression of one or more HSF-CSS genes, and methods of using results of such assessment. In some aspects, subsets of the HSF1-CP genes or HSF1-CSS genes, reagents useful for modulating expression of such subsets, reagents useful for assessing or expression of such subsets, and methods of using results of such assessment, are provided. As used herein, a set C is considered a “subset” of a set D, if all elements (members) of C are also elements of D, but C is not equal to D (i.e. there exists at least one element of D not contained in C). Thus, a subset of the HSF1-CSS includes between 1 and 455 genes of the HSF1-CSS. Any and all such subsets are provided. In some embodiments a subset has between 300 and 400 genes. In some embodiments a subset has between 200 and 300 genes. In some embodiments a subset has between 100 and 200 genes. In some embodiments a subset has between 50 and 100 genes. In some embodiments a subset has between 25 and 50 genes. In some embodiments a subset has between 10 and 25 genes. In some embodiments a subset has between 5 and 10 genes. A subset of the HSF1-CSS genes may be referred to as a “refined HSF1-CSS”. In some aspects, a refined HSF1-CSS is useful for at least some of the same purposes as the full HSF1-CSS. For example, in some embodiments increased average expression of a refined HSF1-CSS correlates with decreased survival. In some embodiments, increased average expression of a refined HSF1-CSS correlates with decreased survival approximately equally well or at least as well as increased average expression of the HSF1-CSS. In some embodiments a refined HSF1-CSS has between 200 and 350 genes. In some embodiments a refined HSF1-CSS has between 100 and 200 genes, e.g., about 150 genes. An exemplary refined HSF1-CSS having 150 genes is presented in Table T4D. In some embodiments a refined HSF1-CSS has between 50 and 100 genes. In some embodiments a refined HSF1-CSS has between 25 and 50 genes. In some embodiments a refined HSF1-CSS has between 10 and 25 genes. In some embodiments a refined HSF1-CSS has between 5 and 10 genes. In some embodiments a subset of the HSF1-CP genes comprises the genes listed in Table T4G, T4H, or T4I.
In some aspects, the invention relates to additional HSF1 cancer signature sets composed of subsets of genes in the HSF1-CP. In some embodiments, a subset of the HSF1-CP genes is composed of HSF1-Module 1 and Module 2 genes. A representative subset of the HSF1-CP genes, which subset is composed of Module 1 and Module 2 genes is presented in Table T4E (this HSF1 cancer signature set is also referred to herein as “HSF1-CaSig2”). Genes in the HSF1-CaSig2 were positively regulated by HSF1 in malignant cells. In some embodiments, a subset of the HSF1-CP genes contains both positively and negatively regulated genes. An exemplary embodiment of such a subset is presented in Table T4F (this HSF1 cancer signature set is also referred to herein as “HSF1-CaSig3”). As described in further detail in the Examples, HSF1-CaSig, HSF1-CaSig2, and HSF1-CaSig3 signatures were strongly associated with patient outcome across multiple tumor types. In aspect herein in which the HSF-CSS genes are used, embodiments are provided in which the HSF-CaSig2 genes (listed in Table T4E) are used unless otherwise indicated or evident from the context. In aspect herein in which the HSF-CSS genes are used, embodiments are provided in which the HSF-CaSig3 genes (listed in Table T4F) are used unless otherwise indicated or evident from the context.
In some embodiments, an HSF1-CSS or refined HSF1-CSS disclosed herein may be further refined. In some embodiments, refinement may be performed by omitting one or more genes from the HSF1-CSS or refined HSF1-CSS to produce a reduced set of genes. The ability of the reduced set of genes to predict patient outcome across multiple datasets representing one or more tumor types can be determined. In some embodiments, a reduced set of genes is at least as effective as the HSF-CaSig, HSF1-CaSig2, or HSF1-CaSig3 genes in predicting patient outcome.
In some embodiments the invention relates to additional HSF1-CSS genes selected from among the HSF1-CP genes. In some embodiments an additional HSF1-CSS may be selected by identifying a subset of HSF1-CP genes composed of at least some HSF1-CP genes that are most positively correlated with poor outcome or composed of at least some HSF1-CP genes that most negatively correlated (anti-correlated) with poor outcome (based on a suitable statistic such as a t-test statistic) in one or more datasets containing tumor gene expression data. In some embodiments an additional HSF1-CSS may be selected by identifying a subset of HSF1-CP genes composed of (i) at least some HSF1-CP genes that are most positively correlated with poor outcome (ii) at least some HSF1-CP genes that most negatively correlated with poor outcome in one or more datasets containing tumor gene expression data. The number of positively and negatively correlated genes may be the same or different. In some embodiments, genes present in the relevant group (i.e., positively correlated with poor outcome or negatively correlated with poor outcome) in at least 50%, 60%, 70%, 80%, 90%, or more of the datasets are used in the additional HSF1-CSS. In some embodiments the ability of an additional HSF1-CSS to predict patient outcome may be validated using one or more tumor gene expression datasets not used for selection of such HSF1-CSS.
In some embodiments, tumor gene expression data that are used to select an additional HSF1-CSS is composed largely (e.g., at least 80%, 90%, 95%) or entirely of data obtained from tumors of a particular tumor type, subtype, or tissue of origin and/or excludes tumors of a particular tumor type, subtype or tissue of origin. Tumors of any tumor type, subtype or tissue of origin may be included or excluded. In some embodiments a tumor subtype is at least in part defined based on expression of one or more markers, molecular features, histopathological features, and/or clinical features, used in the art for tumor classification or staging. For example, in the case of breast cancer, a subtype may be defined based at least in part on expression of ER, PR, HER2/neu, and/or EGFR and/or on lymph node status. In some embodiments, an HSF1 cancer signature set selected using expression data from tumors of one or more selected tumor types, subtypes, or tissues of origin is of particular use for classifying or providing prognostic, diagnostic, predictive, or treatment selection information with regard to tumors of such selected tumor types, subtypes, or tissues of origin, e.g., the CSS may perform particularly well with regard to such tumors as compared with its performance among tumors of other types, subtypes, or tissues of origin. In some embodiments, the CSS is of use for classifying or providing prognostic, diagnostic, predictive, or treatment selection information with regard to tumors of other tumor types, subtypes, or tissues in addition to tumors of the selected type, subtype, or tissue of origin. For example, as described herein, HSF1 cancer signature sets derived from breast tumor expression data are useful in the context of lung and colon tumors, as well as breast tumors. In some embodiments, an HSF1 cancer signature set is selected using expression data from tumors of multiple different tumor types, subtypes, or tissues of origin. In some embodiments such an HSF1 cancer signature set of use in classifying or providing prognostic, diagnostic, predictive, or treatment selection information with regard to tumors of any of multiple selected tumor types, subtypes, or tissues of origin which may include, but not be limited to, tumors of the types, subtypes, or tissues of origin from which the expression data used to obtain the signature was obtained.
Further provided are sets of genes that comprise (a) (i) the HSF1-CSS or (ii) at least one subset of the HSF1-CSS (but not the full HSF1-CSS); and (b) at least one additional gene that is not within the HSF1-CSS. In some embodiments one or more additional gene(s) may be useful for any one or more purposes for which the HSF1-CSS is of use. In some embodiments one or more additional gene(s) may be useful as controls or for normalization.
In some embodiments, a subset of the HSF1-CP comprises or consists of genes that are coordinately regulated in cancer cells. In some embodiments a group of coordinately regulated genes may be referred to as a “module”. In some embodiments coordinately regulated genes are characterized in that their mRNA expression levels correlate across a set of diverse cancer cell lines or cancer samples. In some embodiments the Pearson correlation coefficient of the mRNA expression levels of coordinately regulated genes is at least 0.5, 0.6, or 0.7 across diverse cancer cell lines or cancer samples. In some embodiments coordinately regulated genes are characterized in that their expression level (e.g., as assessed by mRNA level) in cancer cells increases or decreases in the same direction following inhibition of HSF1 expression. In some embodiments, an HSF1-CP module comprises genes involved in protein folding, translation and/or mitosis (Module 1). In some embodiments, an HSF1-CP module comprises RNA binding genes and/or DNA damage binding genes (Module 2). In some aspects, transcription of genes in Module 1 or 2 is positively regulated (activated) by HSF1. In some embodiments, an HSF1-CP module comprises genes involved in immune functions or death receptor signaling (Module 3), insulin secretion (Module 4), or apoptosis, development, or insulin secretion (Module 5). In some aspects, transcription of genes in Module 3, 4, or 5 is negatively regulated (repressed) by HSF1. In some embodiments, modules are based at least in part on datasets that comprise data obtained using multiple probes for at least some genes. In some embodiments, a module is refined by excluding genes for which fewer than 50%, 60%, 70%, 80%, 90%, or more (e.g., 100%) of the probes fall within the module.
In some embodiments a subset of the HSF1-CP genes comprises or consists of genes that are involved in a process, pathway, or structure of interest or have a biological function or activity of interest. In some embodiments a gene may be classified as being involved in a process, pathway, or structure or as having a particular biological function or activity based on annotation in an art-recognized database such as the Gene Ontology database (http://www.geneontology.org/), KEGG database (http://www.genome.jp/kegg/), or Molecular Signatures database (http://www.broadinstitute.org/gsea/msigdb/index.jsp). In some embodiments a subset of the HSF1-CP comprises or consists of genes that are involved in protein folding, stress response, cell cycle, signaling, DNA repair, chromatin remodeling (e.g., chromatin modifying enzymes), apoptosis, transcription, mRNA processing, translation, energy metabolism, adhesion, development, and/or extracellular matrix. In some embodiments a subset of the HSF1-CP comprises or consists of genes that are involved in any of two or more processes, pathways, or structures of interest.
Wherever an aspect or embodiment disclosed herein refers to the HSF1-CP genes and/or HSF1-CSS genes, aspects or embodiments pertaining to each of(l) Group A, (2) Group B, (3) refined HSF1-CSS, (4) Module 1, (5) Module 2, (6) Module 3, (7) Module 4, (8) Module 5, (9) HSF1-CaSig2, (10) HSF1-CaSig3, and (12) subsets of any of the foregoing composed of genes that are more highly bound in cancer cells than in heat shocked, non-transformed control cells, are also disclosed herein, unless otherwise indicated or clearly evident from the context. For purposes of brevity, these individual aspects or embodiments may not always be expressly listed. It will be understood that certain details of such aspects or embodiments may differ depending, e.g., on whether the particular genes in the subset are positively or negatively regulated by HSF1 or positively or negatively correlated with poor (or good) outcome, treatment response, etc. In some aspects, measuring the expression of genes in the HSF1 cancer program is of use to classify cancers, to provide diagnostic or prognostic information. For example, high average expression of a set of genes whose promoter regions are bound by HSF1 in cancer cells (referred to herein as HSF1 cancer signature set (HSF1-CSS) genes) had a remarkable correlation with poor prognosis among multiple cohorts of breast cancer patients. The HSF1-CSS was more significantly associated with outcome than various well established prognostic indicators including the oncogene MYC, the proliferation marker Ki67 and MammaPrint, an expression-based diagnostic tool used in routine clinical practice (Kim and Paik, 2010). Expression of the HSF1-CSS was more strongly associated with poor outcome than any individual HSP transcript or even a panel of HSP genes. The HSF1-CSS was significantly associated with metastatic recurrence in women initially diagnosed with ER+/lymph node negative tumors. Increased expression of the HSF1-CSS in colon and lung cancers was strongly associated with reduced survival and more significantly associated with outcome than any individual HSP transcript or a panel of HSP genes.
In some embodiments, a method of diagnosing cancer in a subject comprises the steps of: determining the level of HSF1-CSS expression in a sample obtained from the subject, wherein increased HSF1-CSS expression in the sample is indicative that the subject has cancer. In some aspects, a method of identifying cancer comprises the steps of: (a) providing a biological sample; and (b) determining the level of HSF1-CSS expression in the sample, wherein increased HSF-CSS expression in the sample is indicative of cancer. In some embodiments a method of diagnosing or identifying cancer comprises comparing the level of HSF1-CSS expression with a control level of HSF1-CSS expression wherein a greater level in the sample as compared with the control level is indicative that the subject has cancer. In some embodiments, a method of assessing a tumor with respect to aggressiveness comprises: determining the level of HSF1-CSS expression in a sample obtained from the tumor, wherein an increased level of HSF1-CSS expression is correlated with increased aggressiveness, thereby classifying the tumor with respect to aggressiveness. In some embodiments the method comprises: (a) determining the level of HSF1-CSS expression in a sample obtained from the tumor; (b) comparing the level of HSF1-CSS expression with a control level of HSF1-CSS expression; and (c) assessing the aggressiveness of the tumor based at least in part on the result of step (b), wherein a greater level of HSF1-CSS expression in the sample obtained from the tumor as compared with the control level of is indicative of increased aggressiveness. In some embodiments, a method of classifying a tumor according to predicted outcome comprising steps of: determining the level of HSF1-CSS expression in a sample obtained from the tumor, wherein an increased level of HSF1-CSS expression is correlated with poor outcome, thereby classifying the tumor with respect to predicted outcome. In some embodiments the method comprises: (a) determining the level of HSF1-CSS expression in a tumor sample; and (b) comparing the level of HSF1-CSS expression with a control level of HSF1-CSS expression, wherein if the level determined in (a) is greater than the control level, the tumor is classified as having an increased likelihood of resulting in a poor outcome. In some embodiments a method of predicting cancer outcome in a subject comprises: determining the level of HSF1-CSS expression in a tumor sample from the subject, wherein an increased level of HSF1-CSS expression is correlated with poor outcome, thereby providing a prediction of cancer outcome. In some embodiments the method comprises (a) determining the level of HSF1-CSS expression in the tumor sample; and (b) comparing the level of HSF1-CSS expression with a control level of HSF1-CSS expression, wherein if the level determined in (a) is greater than the control level, the subject has increased likelihood of having a poor outcome. In some embodiments a method for providing prognostic information relating to a tumor comprises: determining the level of HSF1-CSS expression in a tumor sample from a subject in need of tumor prognosis, wherein if the level of HSF1-CSS expression is increased, the subject is considered to have a poor prognosis. In some embodiments the method comprises steps of: (a) determining the level of HSF1-CSS expression in the sample; and (b) comparing the level with a control level, wherein if the level determined in (a) is greater than the control level, the subject is considered to have a poor prognosis. In some embodiments a method for providing treatment-specific predictive information relating to a tumor comprises: determining the level of HSF1-CSS expression in a tumor sample from a subject in need of treatment-specific predictive information for a tumor, wherein the level of HSF1-CSS expression correlates with tumor sensitivity or resistance to a treatment, thereby providing treatment-specific predictive information. In some embodiments a method for tumor diagnosis, prognosis, treatment-specific prediction, or treatment selection comprises: (a) providing a sample obtained from a subject in need of diagnosis, prognosis, treatment-specific prediction, or treatment selection for a tumor; (b) determining the level of HSF1-CSS expression in the sample; (c) scoring the sample based on the level of HSF1-CSS expression, wherein the score provides diagnostic, prognostic, treatment-specific predictive, or treatment selection information. In some embodiments a control level of HSF1-CSS expression is a level representative of non-tumor tissue. In some embodiments, e.g., in a method for providing prognostic information, assessing tumor aggressiveness, or predicting cancer outcome, a control level of HSF1-CSS expression may be a level representative of tumors that have a good prognosis, low aggressiveness, or low propensity to metastasize or recur. In general, any method known in the art can be used to measure HSF1-CSS expression. For example, microarray analysis, nanostring technology, RNA-Seq, or RT-PCR may be used. In some embodiments a value representing an average expression level representative of the HSF1-CSS is obtained. Expression of an HSF1-CSS gene may be normalized, e.g., using a gene whose expression is not expected to change significantly in cancer versus non-transformed cells. In some embodiments actin is used for normalization. In some embodiments a method comprises classifying a tumor or tumor sample by comparing HSF1-CSS expression in the tumor or tumor sample with HSF1-CSS expression among a representative cohort of tumors that have known outcomes. In some embodiments clustering may be used to position a tumor sample of interest with respect to tumors having known outcomes. In some embodiments, tumors classified among the upper 25% of tumors by average expression level are determined to have a worse prognosis than tumors classified in the lower 75% (or any lower percentile, such as the lower 60%, 50%, 40%, 30%, etc.) In some embodiments a refined HSF1-CSS is used to classify tumors. In some embodiments expression of Module 1 or Module 2 genes is used to classify tumors. In some embodiments a refined HSF1-CSS is listed in Table T4D. In some embodiments HSF1-CaSig2 (Table T4E), or HSF1-CaSig3 (Table T4F) is used to classify tumors.
Without wishing to be bound by any theory, it is likely that the HSF1 cancer program supports the malignant state in a diverse spectrum of cancers because it regulates core processes rooted in fundamental tumor biology that ultimately affect outcome. The broad range of cancer types in which HSF1 is activated suggests that this program may have originated to support basic biological processes. Indeed, the sole heat-shock factor in yeast (yHSF), even at basal temperatures, binds many genes that are involved in a wide-range of core cellular functions (Hahn et al., 2004). These transcriptional targets allow yeast not only to adapt to environmental contingencies but also to modulate metabolism and maintain proliferation under normal growth conditions (Hahn et al., 2004; Hahn and Thiele, 2004). As a result, yHSF is essential for viability, paralleling the importance of HSF1 for the survival of cancer cells (Dai et al., 2007). Activation of HSF1 may also be advantageous in animals in states of high proliferation and altered metabolism such as immune activation and wound healing (Rokavec et al., 2012; Xiao et al., 1999; Zhou et al., 2008). Moreover, in diverse eukaryotes, HSF acts as a longevity factor. However, the evolutionarily ancient role played by HSF1 in helping cells to adapt, survive and proliferate is co-opted frequently to support highly malignant cancers. By enabling oncogenesis, the activation of this ancient pro-survival mechanism thereby actually impairs survival of the host. Without wishing to be bound by any theory, HSF1 activation in a particular tumor may reflect the degree to which accumulated oncogenic mutations have disrupted normal physiology even before overt invasion or metastasis occurs. This interpretation could explain the broad prognostic value of the HSF1-cancer signature across disparate cancers and even at early stages of disease. In some embodiments, the HSF1-CSS finds use as a sensitive measure of the malignant state and prognostic indicator. For example, in some embodiments the HSF1-CSS is of use in identifying tumors that are indolent and do not require intervention (e.g., wherein the tumor would not be expected to invade, metastasize, or progress to a state in which it impairs the functioning or physical condition of a subject or reduces the life expectancy of the subject), reducing the burdens of unnecessary treatment. In some embodiments the HSF1-CSS is of use in providing prognostic information or assessment of aggressiveness for a tumor of unknown tissue type or origin.
In some embodiments, an HSF1 cancer signature set or subset thereof is used to analyze one or more datasets (e.g., publicly available datasets) containing tumor gene expression data, wherein the dataset contains, in addition to gene expression data from tumors, information regarding an outcome or event of interest or one or more tumor characteristics associated with the corresponding tumor or subject having the tumor. In some embodiments, the HSF1 cancer signature set or subset thereof is used to classify tumors based on the expression data (e.g., into groups with high or low expression of the HSF1 cancer signature set or subset thereof). In some aspects, an HSF1 cancer signature set or subset thereof is used to identify or confirm a correlation between HSF1 activity and an outcome or event of interest in cancer (e.g., a poor outcome, good outcome, development of metastasis, survival, response (or lack of response) to a particular treatment, etc.) or one or more tumor characteristics. The predictive power of HSF1 activity with regard to an outcome of interest in cancer or one or more tumor characteristics may thus be identified or confirmed using an HSF1 cancer signature set or subset thereof as an indicator of HSF1 activity. In some aspects, the use of an HSF1 cancer signature set or subset thereof as a surrogate for HSF1 cancer-related activity leverages the availability of tumor gene expression datasets to identify or confirm a correlation between HSF1 activity and an outcome of interest in cancer or one or more tumor characteristics. In some embodiments, detection of HSF1 protein expression or activation (e.g., using IHC) is then used to apply such correlation to additional tumors, e.g., for purposes of providing prognostic, predictive, diagnostic, or treatment selection information.
As noted above, HSF1 binds to heat shock elements (HSEs). In some embodiments an HSE comprises two or more adjacent inverted repeats of the sequence 5′-n1GAAn5-3′, where n1 and n5 are independently A, G, C, or T, so that a single inverted repeat consists of 5′-n−5TTCn−1n1GAAn5-3′(SEQ ID NO.1), wherein n−1 is complementary to n1 and n−5 is complementary to n5. In some aspects, the disclosure relates to the discovery that regulatory regions of HSF1-CP genes that are strongly bound in cancer cells but not in heat shocked cells are enriched for HSEs that comprise exactly 3 inverted repeats, e.g., each having the sequence 5′-n-5TTCn−1n1GAAn5-3′(SEQ ID NO.1), wherein n−1 is complementary to n1 and n−5 is complementary to n5. In some embodiments at least one of the inverted repeats has the sequence 5′-AGAAn5-3′, so that a single inverted repeat consists of ‘5’-n−5TTCTAGAAn5-3′(SEQ ID NO.2). In some embodiments at least one of the inverted repeats has the sequence 5′-GGAA n5-3′, so that a single inverted repeat consists of 5′-n−5TTCCGGAAn5-3′(SEQ ID NO.3). In some embodiments 2 of the inverted repeats are directly adjacent to each other (i.e., there are no intervening nucleotides). In some embodiments each of the inverted repeats is directly adjacent to at least one other inverted repeat. In some aspects, the disclosure relates to the discovery that regulatory regions of HSF1-CP genes that are strongly bound in cancer cells but not in heat shocked cells are enriched for binding sites for the transcription factor YY1 (Gene ID: 7528 (human); Gene ID: 22632 (mouse)). YY1 is a widely or ubiquitously distributed transcription factor belonging to the GLI-Kruppel class of zinc finger proteins and is involved in repressing and activating a diverse number of promoters. YY1 may direct histone deacetylases and histone acetyltransferases to a promoter in order to activate or repress the promoter, thus histone modification may play a role in the function of YY1. In some embodiments a YY binding site comprises or consists of GCnGCCA, wherein n is A, G, C, or T. In some aspects, the disclosure relates to the discovery that regulatory regions strongly bound in heat-shocked cells but not cancer cells are enriched for expanded HSEs, containing a fourth inverted repeat of 5′-n1GAAn5-3′ and for binding sites for the transcription factor AP1/Fos (NFE2L2). In some embodiments an AP1/Fos (NFE2L2) binding element comprises or consists of TGACTnA, wherein n is A, G, C, or T. In some embodiments n is C or A. In some aspects, the disclosure provides methods based, in some embodiments, at least in part on the identification of distinct patterns of transcription factor binding sites in genes that are strongly bound by HSF1 in cancer cells versus in heat-shocked cells. In some embodiments, methods of monitoring HSF1 cancer-related activity and methods of identifying modulators of HSF1 cancer-related activity are provided. In some embodiments reporter constructs are provided. In some embodiments, such methods and reporter constructs allow monitoring of HSF1 activity and/or identification of HSF1 modulators that are at least somewhat specific for HSF1 activity in cancer cells relative to heat shocked cells. For example, such modulators may inhibit HSF1 activity in cancer cells to a significantly greater extent than in heat shocked control cells and/or may selectively inhibit HSF1 binding or regulation of genes that are more strongly bound in cancer cells than in heat shocked control cells as compared with genes that are less strongly bound in cancer cells than in heat shocked control cells.
In some aspects, the invention provides an isolated nucleic acid comprising at least one YY binding site and an HSE that comprises exactly 3 inverted repeats. In some embodiments the sequence of the isolated nucleic acid comprises the sequence of at least a portion of a regulatory region of a Group A gene, Group B gene, Module 1 gene, Module 2 gene, Module 3 gene, Module 4 gene, Module 5 gene, HSF1-CaSig2 gene, HSF1-CaSig3 gene, refined HSF1-CSS gene, or HSF1-CSS gene that is more highly bound by HSF1 in cancer cells than in heat shocked non-transformed control cells. In some embodiments, the sequence of the isolated nucleic acid comprises the sequence of at least a portion of a promoter region of a Group A gene, Group B gene, Module 1 gene, Module 2 gene, Module 3 gene, Module 4 gene, Module 5 gene, refined HSF1-CSS gene, or HSF1-CSS gene that is more highly bound by HSF1 in cancer cells than in heat shocked non-transformed control cells. In some embodiments the gene is positively regulated by HSF1 in cancer cells. In some embodiments the gene is strongly bound in cancer cells and weakly bound or not bound in non-transformed heat shocked control cells. In some embodiments, the sequence of the isolated nucleic acid comprises the sequence of at least a portion of a distal regulatory region of a Group A gene, Module 1 gene, Module 2 gene, Module 3 gene, Module 4 gene, Module 5 gene, HSF1-CaSig2 gene, HSF1-CaSig3 gene, refined HSF1-CSS gene, or HSF1-CSS gene that is more highly bound by HSF1 in cancer cells than in heat shocked non-transformed control cells. In some embodiments the gene is negatively regulated by HSF1 in cancer cells.
In some embodiments the invention provides an isolated nucleic acid comprising at least a portion of a regulatory region of a Group A gene, Module 1 gene, Module 2 gene, Module 3 gene, Module 4 gene, Module 5 gene, HSF1-CaSig2 gene, HSF1-CaSig3 gene, refined HSF1-CSS gene, or HSF1-CSS gene that is more highly bound by HSF1 in cancer cells than in heat shocked non-transformed cells, wherein the at least a portion of a regulatory region comprises an HSE. In some embodiments the isolated nucleic acid comprises at least a portion of a regulatory region of a Group A gene, Module 1 gene, Module 2 gene, Module 3 gene, Module 4 gene, Module 5 gene, HSF1-CaSig2 gene, HSF1-CaSig3 gene, refined HSF1-CSS gene, or HSF1-CSS gene that is more highly bound by HSF1 in cancer cells than in heat shocked non-transformed cells, wherein the at least a portion of a regulatory region comprises an HSE. In some embodiments the sequence of the nucleic acid comprises the sequence of at least a portion of a promoter region of a Group A gene, Module 1 gene, Module 2 gene, Module 3 gene, Module 4 gene, Module 5 gene, HSF1-CaSig2 gene, HSF1-CaSig3 gene, refined HSF1-CSS gene, or HSF1-CSS gene that is more highly bound by HSF1 in cancer cells than in heat shocked non-transformed control cells. In some embodiments the gene is positively regulated by HSF1 in cancer cells. In some embodiments the gene is strongly bound in cancer cells and weakly bound or not bound in non-transformed heat shocked control cells. In some embodiments the gene is HSPA8. In some embodiments the gene is CKS2, LY6K, or RBM23. In some embodiments an HSF1-CP gene is among the 5%, 10%, 20%, 30%, 40%, or 50% genes that are most highly bound by HSF1 in cancer cells, e.g., in metastatic cancer cells such as BPLER cells. In some embodiments, the sequence of the isolated nucleic acid comprises the sequence of at least a portion of a distal regulatory region of a Group A gene, Module 1 gene, Module 2 gene, Module 3 gene, Module 4 gene, Module 5 gene, HSF1-CaSig2 gene, HSF1-CaSig3 gene, refined HSF1-CSS gene, or HSF1-CSS gene that is more highly bound by HSF1 in cancer cells than in heat shocked non-transformed control cells. In some embodiments the gene is negatively regulated by HSF1 in cancer cells. In some embodiments the HSE comprises exactly 3 inverted repeats and, in some embodiments, further comprises a YY1 binding site. The HSE and YY binding site can be positioned in any order in various embodiments. In some embodiments the HSE and YY binding site are separated by up to 50 nt, 100 nt, 200 nt, 500 nt, 1 kB, 2 kB, 3 kB, 4 kB, 5 kB, 6 kB, 7 kB, 8 kB, 9 kB, or 10 kB.
In some embodiments of any of the afore-mentioned isolated nucleic acids, the isolated nucleic acid does not comprise an AP1/Fos (NFE2L2) binding site.
In some embodiments any of the afore-mentioned isolated nucleic acids comprise a binding site for RNA polymerase II and sufficient nucleic acid sequences for assembly of a transcription pre-initiation complex (Lee T I, Young R A (2000). “Transcription of eukaryotic protein-coding genes”. Annu. Rev. Genet. 34: 77-137; Kornberg R D (2007). “The molecular basis of eukaryotic transcription”. Proc. Natl. Acad. Sci. U.S.A. 104 (32): 12955-61).
In some embodiments an isolated nucleic acid is between 50 nucleotides (nt) and 20 kB long. In some embodiments an isolated nucleic acid is at least 100 nt, 200 nt, 500 nt, 1 kB, 2 kB, 3 kB, or 5 kB long and/or the isolated nucleic acid is up to 500 nt, 1 kB, 2 kB, 3 kB, 4 kB, 5 kB, 10 kB, or 20 kB long. All specific lengths and ranges are expressly contemplated. For example, in some embodiments the isolated nucleic acid is between 200 nt and 500 nt, between 500 nt and 1 kB, between 1 kB and 2 kB, between 2 kB and 3 kB, between 3 kB and 4 kB between 4 kB and 5 kB, between 5 kB and 10 kB etc. In some embodiments an isolated nucleic acid comprises at least a portion of a transcribed region of an HSF1-CP gene. In some embodiments an isolated nucleic acid comprises at least a portion of a coding region of an HSF1-CP gene. In some embodiments an isolated nucleic acid does not comprises a portion of a transcribed region of an HSF1-CP gene. For example, in some embodiments the sequence of an isolated nucleic acid comprises a sequence that lies upstream of (5′ with respect to) the transcription start site of an HSF1-CP gene. In some embodiments an isolated nucleic acid does not comprise a portion of a coding region of an HSF1-CP gene. In some embodiments the sequence of an isolated nucleic acid comprises a sequence that lies downstream of (3′ with respect to) the coding region, polyadenylation site, or transcribed portion of an HSF1-CP gene.
In some embodiments an isolated nucleic acid comprises at least a portion of a regulatory region of an HSF1-CP gene. In some aspects, a regulatory region comprises any nucleic acid sequence on the same piece of DNA as a transcription start site (TSS) of a gene that affects, e.g., direct, enhances, or represses transcription originating from such TSS. In some embodiments a regulatory region is located within 20 kB upstream or downstream of a TSS. In some embodiments a regulatory region is located within 20 kB upstream or downstream of a transcription termination site or DNA sequence corresponding to a polyadenylation site of a transcribed RNA. In some embodiments a regulatory region is located within 10 kB upstream or downstream of a TSS. In some embodiments a regulatory region is located within 10 kB upstream or downstream of a transcription termination site or DNA sequence corresponding to a polyadenylation site of a transcribed RNA. In some embodiments a regulatory region comprises a promoter region, comprising, e.g., a binding site for an RNA polymerase II and sufficient nucleic acid sequences for assembly of a transcription pre-initiation complex. In some embodiments a promoter region is located within −8 kB to +2 kB of the transcription start site (TSS) of a gene. In some embodiments a promoter region is located within −7 kB, −6 kB, −5 kB, −4 kB, −3 kB, or −2 kB, up to the TSS, +1 kB, or +2 kB of the TSS of a gene. In some embodiments a regulatory region is a distal regulatory region. In some embodiments a distal regulatory region is located beyond 2 kB and up to 8 kB downstream of the end of the coding region, end of the transcribed portion of a gene, or DNA sequence corresponding to a polyadenylation site of an RNA transcribed from such gene. In some embodiments the sequence of an isolated nucleic acid comprises or consists of a sequence that lies within −8, −6, −5, or −2 kb from the transcription start site (TSS) to either +5, +6, +8, or +10 kb from the TSS of an HSF1-CP gene. In some embodiments the sequence of an isolated nucleic acid comprises or consists of a sequence that lies within −8, −6, −5, or −2 kb from the transcription start site (TSS) to either +2, +5, +6, or +8 10 kb from the end of a coding region, end of the transcribed portion of an HSF1-CP gene, or DNA sequence corresponding to a polyadenylation site of an RNA transcribed from such gene. The sequence may be of any of the lengths mentioned in the preceding paragraph, in various embodiments.
In some aspects, the invention provides a nucleic acid construct comprising any of the afore-mentioned isolated nucleic acids and a nucleic acid sequence that encodes a reporter molecule. Such a nucleic acid construct may be referred to herein as an HSF1-CP reporter. A reporter molecule may comprise any genetically encodable detectable label (RNA or protein). In some embodiments, the reporter molecule is operably linked to the nucleic acid comprising an HSE. In some aspects, the invention provides vectors comprising any of the afore-mentioned isolated nucleic acids or nucleic acid constructs.
In some aspects, the invention provides cells comprising any of the afore-mentioned isolated nucleic acids, nucleic acid constructs, or vectors. A cell may be prokaryotic (e.g., bacterial) or eukaryotic (e.g., fungal, insect, vertebrate, avian, mammalian, human, etc.). In some embodiments a cell is of a species that is known to get cancer, e.g., an avian or mammalian cell. In some embodiments a prokaryotic, fungal, plant, or insect cell may be useful to, e.g., propagate a vector, produce a molecule, identify a protein-protein interaction, etc. In some embodiments a cell is a primary cell, non-immortal cell, immortal cell, non-cancer cell, or cancer cell. In some embodiments the nucleic acid construct or vector (or at least a portion thereof comprising the HSEs and the sequence encoding the reporter molecule) is integrated into the genome of the cell. In some embodiments cell lines derived from the cell or from a population of such cells are provided. In some embodiments any cell or cell line may be genetically modified by introducing a nucleic acid or vector encoding a polypeptide comprising HSF1 or a variant or fragment thereof. In some embodiments the nucleic acid encoding HSF1 is operably linked to expression control elements (e.g., a promoter) sufficient to direct expression in the cell. In some embodiments expression is regulatable, e.g., inducible. In some embodiments the polypeptide is a fusion protein comprising HSF1 or a variant or fragment thereof and a heterologous polypeptide. In some embodiments the heterologous polypeptide comprises a detectable protein or epitope tag. The heterologous polypeptide may be used, e.g., to assess HSF1 expression or localization, monitor alterations in HSF1 expression or localization over time, to isolate HSF1 from cells, etc. In some embodiments, the cell's endogenous HSF1 gene may be mutated or at least in part deleted. In some embodiments an HSF1 variant is a functional variant. In some embodiments an HSF1 variant is at least 90%, 95%, 96%, 97%, 98%, 99%, 99.5%, or more identical to HSF1 across at least 50%/., 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%, 99.5%, or the full length of HSF1. In some embodiments computer programs such as BLAST2, BLASTN, BLASTP, Gapped BLAST, etc., may be used to generate alignments and/or to obtain a percent identity (See, e.g., Karlin and Altschul, Proc. Natl. Acad. Sci. USA 87:22264-2268, 1990; Karlin and Altschul, Proc. Natl. Acad Sci. USA 90:5873-5877,1993; Altschul, et al., J. Mol. Biol. 215:403-410, 1990; Altschul, et al. Nucleic Acids Res. 25: 3389-3402, 1997). When utilizing such programs, the default parameters of the respective programs may be used. See the Web site having URL www.ncbi.nlm.nih.gov and/or McGinnis, S. and Madden, T L, W20-W25 Nucleic Acids Research, 2004, Vol. 32, Web server issue. In some embodiments no more than 20%, 10%, 5%, or 1% of positions in either sequence or in both sequences over a window of evaluation are occupied by a gap.
In some aspects, a cell comprising an HSF1-CP reporter is useful to assess HSF1 cancer-related activity, to identify modulators of HSF1 cancer-related activity, or to assess or monitor the effect of any agent on HSF1 cancer-related activity. In some embodiments a cell contains at least two such isolated nucleic acids, nucleic acid constructs, or vectors, wherein the at least two isolated nucleic acids, nucleic acid constructs, or vectors each comprises at least a portion of a regulatory region of an HSF1-CP gene, and wherein the reporter molecules are distinguishable. In some embodiments, this allows, e.g., assessment of expression regulated by each of multiple different regulatory regions of HSF1-CP genes in a given cell. In some embodiments a test agent that affects expression regulated by each of such regulatory regions is identified. In some embodiments a cell is a member of a population of cells, e.g., a population of cells obtained from a sample, or members of a cell line. It will be understood that various compositions disclosed herein may comprise a population of cells, and various methods herein may be practiced using a population of cells. For example, a measurement of DNA binding or a measurement of expression or assessing a test agent may be performed on or using a population of cells. Wherever relevant, aspects and embodiments pertaining to individual cells and aspects and embodiments pertaining to populations of cells are encompassed within the scope of the present disclosure. In some embodiments a population of cells is about 10, 102, 103, 104, 105, 106, 107, 108, 109, cells, or more.
Certain aspects of the invention comprise or use a detectable label that comprises a detectable protein. For example, in some embodiments a reporter molecule comprises a detectable protein. In some embodiments a detectable protein comprises a fluorescent or luminescent protein. In some embodiments a detectable protein comprises an enzyme, e.g., an enzyme capable of catalyzing a reaction that converts a substrate to a detectable substance or otherwise produces a detectable event. Those of ordinary skill in the art will be aware of many such proteins and methods of detecting them and using them to, e.g., produce nucleic acid constructs useful for monitoring expression and/or monitoring activity of regulatory sequences contained in such constructs. Fluorescent proteins include, e.g., green fluorescent protein (GFP) from the jellyfish Aequorea victoria, related naturally occurring green fluorescent proteins, and related proteins such as red, yellow, and cyan fluorescent protein. Many of these proteins are found in diverse marine animals such as Hydrozoa and Anthozoa species, crustaceans, comb jellies, and lancelets. See, e.g., Chalfie, M. and Kain, S R (eds.) Green fluorescent protein: properties, applications, and protocols (Methods of biochemical analysis, v. 47). Wiley-Interscience, Hoboken, N.J., 2006, and/or Chudakov, D M, et al., Physiol Rev. 90(3):1103-63, 2010, for further information and references. In some embodiments, a detectable protein is monomeric. Examples of fluorescent proteins include Sirius, Azurite, EBFP2, TagBFP, mTurquoise, ECFP, Cerulean, TagCFP, mTFP1, mUkG1, mAG1, AcGFP1, TagGFP2, EGFP, mWasabi, EmGFP, TagYPF, EYFP, Topaz, SYFP2, Venus, Citrine, mKO, mKO2, mOrange, mOrange2, TagRFP, TagRFP-T, mStrawberry, mRuby, mCherry, mRaspberry, mKate2, mPlum, mNeptune, T-Sapphire, mAmetrine, mKeima, mTomato. See Chudakov D M (cited above). In some embodiments a detectable protein comprises a luciferase. “Luciferase” refers to members of a class of enzymes that catalyze reactions that result in production of light. Luciferases are found in a variety of organisms including a variety of marine copepods, beetles, and others. Examples of luciferases include, e.g., luciferase from species of the genus Renilla (e.g., Renilla reniformis (Rluc), or Renilla mulleri luciferase), luciferase from species of the genus Gaussia (e.g., Gaussia princeps luciferase, Metridia luciferase from species of the marine copepod Metridia, e.g., Metridia longa, luciferase from species of the genus Pleuromamma, beetle luciferases (e.g. luciferase of the firefly Photinus pyralis or of the Brazilian click beetle Pyrearinus termitilluminans), etc. In some embodiments, a fluorescent or luminescent protein or luciferase is an engineered variant of a naturally occurring protein. Such variants may, for example, have increased stability (e.g., increased photostability, increased pH stability), increased fluorescence or light output, reduced tendency to dimerize, oligomerize, or aggregate, an altered absorption/emission spectrum (in the case of a fluorescent protein) and/or an altered substrate utilization. See, e.g., Chalfie, M. and Kain, S R (cited above) for examples. For example, the A. Victoria GFP variant known as enhanced GFP (eGFP) may be used. See, e.g., Loening, A M, et al., Protein Engineering, Design and Selection (2006) 19 (9): 391-400, for examples. In some embodiments a sequence is codon optimized for expression in cells of interest, e.g., mammalian cells. In some embodiments a detectable protein comprises a signal sequence that directs secretion of the protein. In some embodiments the secreted protein is soluble. In some embodiments the secreted protein remains attached to the cell. In some embodiments a detectable protein lacks a functional signal sequence. In some embodiments a signal sequence is at least in part removed or modified to render it nonfunctional or is at least in part replaced by a signal sequence endogenous to or functional in cells of interest, e.g., mammalian cells.
In some aspects, the disclosure provides methods of identifying agents, genes, gene products, and/or pathways that modulate HSF1 activity in cancer cells. In some embodiments a regulator of HSF1 activity regulates HSF1 expression, activation, or otherwise alters at least one activity performed by HSF1 in cancer cells. An activity performed by HSF1 in cancer cells may be referred to herein as an “HSF1 cancer-related activity”. In some embodiments an HSF1 cancer-related activity comprises modulating (e.g., activating or repressing) transcription of an HSF1-CP gene. In some embodiments an HSF1 cancer-related activity comprises binding to a regulatory region of an HSF1-CP gene. In some embodiments an HSF1 cancer-related activity is specific to cancer cells. In some embodiments an HSF1 cancer-related activity is not specific to cancer cells. For example, the activity may occur both in cancer cells and in non-transformed cells subjected to stress, e.g., thermal stress. “Thermal stress” is used interchangeably herein with “heat shock” and refers to exposing cells to elevated temperature (i.e., temperature above physiologically normal) for a sufficient period of time to detectably, e.g., robustly, induce the heat shock response. In some embodiments heat shock comprises exposing cells to a temperature of 42±0.5 degrees C. for about 1 hour or similar exposures to elevated temperatures (above 40 or 41 degrees C.) resulting in similar or at least approximately equivalent induction of the heat shock response. In some embodiments cells are allowed to recover for up to about 60 minutes, e.g., about 30 minutes, at sub-heat shock temperature, e.g., 37 degrees C., prior to isolation of RNA or DNA. In some embodiments assessment of the effect of heat shock on expression may occur after allowing an appropriate amount of time for translation of a transcript whose expression is induced by HSF1.
In some embodiments the level of an HSF1 activity is expressed as an absolute level. In some embodiments the level of an HSF1 activity is expressed as a relative level. For example, activation or repression of an HSF1-CP gene by HSF1 in cancer cells may be expressed as a fold-increase or fold-decrease in expression relative to a reference value. In some embodiments a reference value for a level of an activity is the level of the relevant activity in non-cancer cells not subjected to heat shock. In some embodiments a reference value is the level of the relevant activity in cells in which expression or activity of functional HSF1 is inhibited.
In some embodiments an HSF1 cancer-related activity is detectable in cancer cells and is not detectable in heat shocked non-cancer cells. In some embodiments the level of an HSF1 cancer-related activity is detectably greater in cancer cells than in heat shocked non-cancer cells and is not detectably greater in heat-shocked non-cancer cells than in non-cancer cells maintained under normal conditions. In some embodiments an HSF1 cancer-related activity is detectable in cancer cells and in heat shocked non-cancer cells. In some embodiments the level of an HSF1 cancer-related activity is significantly greater in cancer cells and in heat shocked non-cancer cells than in non-cancer cells maintained under normal conditions. In some embodiments the level of an HSF1 cancer-related activity is greater in cancer cells than in non-cancer cells subjected to heat shock. In some embodiments a first level (e.g., a level of an HSF1 cancer-related activity in cancer cells) is greater than a second level (e.g., a level of an HSF1 cancer-related activity in non-cancer cells) by a statistically significantly amount. In some embodiments a first level is greater than a second level by a factor of at least 1.1., 1.2, 1.3, 1.4, 1.5, 1.75, 2.0, 2.5, 3.0, 4.0, 5.0, 7.5, 10, 15, 20, 25, 50, 100, or more.
Modulators of HSF1 Cancer-Related Activity
In addition to its value in classification and prognosis, HSF1 is a promising target for cancer therapeutics. The protein's widespread activation in many different tumor types augurs a broad range of clinical applications. In this regard, the homogeneity of HSF1 expression throughout entire sections of tumors is notable. Pre-existing heterogeneities for the expression of many recently identified therapeutic targets has emerged as a major factor contributing to the emergence of resistance (Gerlinger et al., 2012). Without wishing to be bound by any theory, the uniform reliance of cancer cells on HSF1 activity for proliferation and survival suggests that HSF1-targeted therapeutics may be less susceptible to this liability.
In some aspects, the invention provides methods of identifying candidate modulators (e.g., candidate inhibitors or enhancers) of HSF1 cancer-related activity. In some embodiments a method of identifying a candidate modulator of HSF1 cancer-related activity comprises: (a) providing a nucleic acid comprising at least a portion of a regulatory region a gene, wherein the regulatory region is bound by HSF1 in cancer cells; (b) contacting the nucleic acid with a test agent; and (c) assessing the level of expression of the gene or the level of activity of a gene product of the gene, wherein the test agent is identified as a candidate modulator of HSF1 activity if the level of expression of the gene or the level of activity of a gene product of the gene differs from a control level. In some embodiments the method comprises providing a cell that contains the nucleic acid construct and contacting the cell with the test agent. In some embodiments the cell is a tumor cell. In some embodiments the regulatory region is operably linked to a nucleic acid sequence that encodes a reporter molecule, and assessing the level of expression of the gene comprises assessing the level or activity of the reporter molecule.
In some embodiments a method of identifying a candidate modulator of HSF1 cancer-related activity comprises steps of: (a) contacting a cell that expresses HSF1 with a test agent; (b) measuring the level of an HSF1 cancer-related activity exhibited by the cell; and (c) determining whether the test agent modulates the HSF1 cancer-related activity, wherein a difference in the level of the HSF1 cancer-related activity in the presence of the test agent as compared to the level in the absence of the test agent identifies the agent as a candidate modulator of HSF1 cancer-related activity. In some embodiments the HSF1 cancer-related activity is binding to a regulatory region of a HSF1-CP gene. In some embodiments the HSF1 cancer-related activity is expression of a HSF1-CP gene. In some embodiments the HSF1-CP gene is a Group A gene, Group B gene, HSF1-CSS gene, HSF1-CaSig2 gene, HSF1-CaSig3 gene, refined HSF1-CSS gene, Module 1 gene, Module 2 gene, Module 3 gene, Module 4 gene, or Module 5 gene, wherein the gene is more highly bound by HSF1 in cancer cells than in heat shocked non-transformed control cells. In some embodiments the HSF1 cancer-related activity is measured by measuring expression of an HSF1-CP reporter. In some embodiments an HSF1 cancer-related activity exhibited by a cell may be assessed while the cell is alive (e.g., by detecting a fluorescent reporter molecule). In some embodiments an HSF1 cancer-related activity exhibited by a cell may be assessed in a sample obtained from the cell (e.g., DNA, RNA, cell lysate, etc.).
In some embodiments, a test agent is identified as an inhibitor of HSF1 cancer-related activity if it inhibits binding of HSF1 to a regulatory region of at least 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 40, 50, 60, 70, 80, 90, 100, 150, 200, 250, 300, 350, 400, 450 or at least 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 99%, or all HSF1-CP genes, Group A genes, Group B genes, HSF1-CSS genes, HSF1-CaSig2 genes, HSF1-CaSig3 genes, refined HSF1-CSS genes, Module 1 genes, Module 2 genes, Module 3 genes, Module 4 genes, or Module 5 genes or inhibits expression of one or more genes that are positively regulated by HSF1 in cancer cells or increases expression of one or more genes that are negatively regulated by HSF1 in cancer cells.
In some embodiments any of the methods comprises comparing the effect of a test agent on HSF1 binding to, or regulation of, an HSF1-CP gene in cancer cells and in heat shocked non-transformed control cells. In some embodiments the HSF1-CP gene is one that is bound in both cancer cells and in heat shocked non-transformed control cells. Such methods may be used, e.g., to identify agents that selectively affect, e.g., inhibit, HSF1 activity in cancer cells.
The term “agent” is used interchangeably with “compound” herein. Any of a wide variety of agents may be used as a test agent in various embodiments. For example, an agent, e.g., a test agent, may be a small molecule, polypeptide, peptide, nucleic acid, oligonucleotide, lipid, carbohydrate, or hybrid molecule. In some embodiments an oligonucleotide comprises an siRNA, shRNA, antisense oligonucleotide, aptamer, or random oligonucleotide. In some embodiments a cDNA comprises a full length cDNA. In some embodiments a cDNA comprises a portion of a full length cDNA, wherein the portion retains at least some of the functional activity of the full length cDNA.
Agents can be obtained from natural sources or produced synthetically. Agents may be at least partially pure or may be present in extracts or other types of mixtures. Extracts or fractions thereof can be produced from, e.g., plants, animals, microorganisms, marine organisms, fermentation broths (e.g., soil, bacterial or fungal fermentation broths), etc. In some embodiments, a compound collection (“library”) is tested. A compound library may comprise natural products and/or compounds generated using non-directed or directed synthetic organic chemistry. In some embodiments a library is a small molecule library, peptide library, peptoid library, cDNA library, oligonucleotide library, or display library (e.g., a phage display library). In some embodiments a library comprises agents of two or more of the foregoing types. In some embodiments oligonucleotides in an oligonucleotide library comprise siRNAs, shRNAs, antisense oligonucleotides, aptamers, or random oligonucleotides.
A library may comprise, e.g., between 100 and 500,000 compounds, or more. In some embodiments a library comprises at least 10,000, at least 50,000, at least 100,000, or at least 250,000 compounds. In some embodiments compounds of a compound library are arrayed in multiwell plates. They may be dissolved in a solvent (e.g., DMSO) or provided in dry form, e.g., as a powder or solid. Collections of synthetic, semi-synthetic, and/or naturally occurring compounds may be tested. Compound libraries can comprise structurally related, structurally diverse, or structurally unrelated compounds. Compounds may be artificial (having a structure invented by man and not found in nature) or naturally occurring. In some embodiments compounds that have been identified as “hits” or “leads” in a drug discovery program and/or analogs thereof. In some embodiments a library may be focused (e.g., composed primarily of compounds having the same core structure, derived from the same precursor, or having at least one biochemical activity in common). Compound libraries are available from a number of commercial vendors such as Tocris BioScience, Nanosyn, BioFocus, and from government entities such as the U.S. National Institutes of Health (NIH). In some embodiments, an “approved human drug” or compound collection comprising one or more approved human drugs is tested. An “approved human drug” is an agent that has been approved for use in treating humans by a government regulatory agency such as the US Food and Drug Administration, European Medicines Evaluation Agency, or a similar agency responsible for evaluating at least the safety of therapeutic agents prior to allowing them to be marketed. A test agent may be, e.g., an antineoplastic, antibacterial, antiviral, antifungal, antiprotozoal, antiparasitic, antidepressant, antipsychotic, anesthetic, antianginal, antihypertensive, antiarrhythmic, antiinflammatory, analgesic, antithrombotic, antiemetic, immunomodulator, antidiabetic, lipid- or cholesterol-lowering (e.g., statin), anticonvulsant, anticoagulant, antianxiety, hypnotic (sleep-inducing), hormonal, or anti-hormonal drug, etc. In some embodiments an agent has undergone at least some preclinical or clinical development or has been determined or predicted to have “drug-like” properties. For example, an agent may have completed a Phase I trial or at least a preclinical study in non-human animals and shown evidence of safety and tolerability. In some embodiments an agent is not an agent that is found in a cell culture medium known or used in the art, e.g., for culturing vertebrate, e.g., mammalian cells, e.g., an agent provided for purposes of culturing the cells, or, if the agent is found in a cell culture medium known or used in the art, the agent may be used at a different, e.g., higher, concentration when used in a method or composition described herein. In some embodiments a test agent is not an agent known in the art as being useful for treating tumors (e.g., for inhibiting tumor cell survival or proliferation or for inhibiting tumor maintenance, growth, or progression) or for treating side effects associated with chemotherapy. In some embodiments a test agent is not a compound that binds to and inhibits Hsp90. In some embodiments a test agent has at least one known target or biological activity or effect. For example, the test agent may be a receptor ligand (e.g., an agonist or antagonist), enzyme inhibitor (e.g., a kinase inhibitor). In some embodiments a test agent is capable of binding to HSF1 or is tested for ability to bind to HSF1. In some embodiments the HSF1 is purified from cancer cells.
In some embodiments the effect of overexpression or knockdown (reduced expression) of one or more genes on an HSF1 cancer-related activity is assessed. In some embodiments one or more cDNAs, RNAi agents (e.g., siRNAs, microRNAs, or shRNAs), or antisense agents whose sequence corresponds to a gene is used as a test agent. In some embodiments the cDNA, RNAi agent, or antisense agent is directly introduced into cells. In some embodiments the cDNA, RNAi agent, or antisense agent is introduced into cells by introducing a nucleic acid construct or vector comprising a sequence that encodes the cDNA, RNAi agent, or antisense agent, operably linked to appropriate expression control elements (e.g., a promoter) to direct expression in cells of interest. The cDNA, RNAi agent, or antisense agent is then expressed intracellularly. In some embodiments, if cells into which the cDNA, RNAi agent, or antisense agent is introduced exhibit an alteration in expression of an HSF1 reporter molecule or exhibit altered HSF1 activity, the agent is identified as a candidate modulator of HSF1 cancer-related activity. In some embodiments, if cells into which the cDNA, RNAi agent, or antisense agent is introduced exhibit an alteration exhibit an alteration in expression of an HSF1 reporter molecule or exhibit altered HSF1 activity, the gene to which the agent corresponds is identified as a candidate genetic modifier of HSF1 cancer-related activity. In some embodiments, if cells into which the cDNA, RNAi agent, or antisense agent is introduced exhibit an alteration in expression of an HSF1 reporter molecule or exhibit altered HSF1 activity, a gene product of the gene to which the agent corresponds is identified as a candidate modulator of HSF1 cancer-related activity. In some embodiments a library of such agents is tested. In some embodiments the library comprises test agents whose sequences correspond to at least 50%, 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%, or more (e.g., all) of the genes in the genome of an organism or species of interest (e.g., human, mouse). In some embodiments the library comprises test agents whose sequences correspond to at least 50%, 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%, or more (e.g., all) of the members of a focused subset of the genes in the genome of an organism or species of interest (e.g., human, mouse), wherein the focused subset consists of genes that can be classified into the same functional category, have the same or a similar biochemical activity (e.g., catalyze the same biochemical reaction), participate in the same pathway or process etc. Examples of focused subsets include kinases (e.g., protein kinases), phosphatases, chromatin modifying enzymes, transcription factors, transcriptional co-regulators, G protein coupled receptors, small GTPases, cell surface receptors, signal transduction proteins, and subsets of any of the foregoing. It will be understood that a gene may fall into multiple subsets.
In some embodiments, a method is of use to identify one or more genes and/or gene products that regulate HSF1. In some embodiments gene products that play a direct or indirect role in expression, post-translational modification, or nuclear localization, of HSF1 (and/or genes that encode such gene products) may be identified. For example, a kinase that phosphorylates HSF1 and thereby regulates (e.g., activates) HSF1 activity may be identified. In some embodiments gene products that physically interact with HSF1 (and/or genes that encode such gene products) may be identified. For example, a transcriptional co-activator that cooperates with HSF1 to activate or repress transcription of one or more HSF1-CP genes may be identified. In some embodiments, such proteins are targets for drug development.
In some aspects, disclosed herein are methods of identifying a post-translational modification of HSF1, wherein the post-translational modification potentially regulates HSF1 cancer-related activity. As used herein, the term “post-translational modification” (PTM) encompasses any alteration to a polypeptide that occurs in cells during or after translation of mRNA that encodes the polypeptide. Examples of PTMs include covalent addition of a moiety to a side chain or terminus (e.g., phosphorylation, glycosylation, SUMOylation, methylation, acetylation, acylation (e.g., fatty acid acylation), ubiquitination, Neddylation), altering the chemical identity of an amino acid, or site-specific cleavage. In some embodiments a PTM is catalyzed by a cellular enzyme. A PTM may be described by the name of the particular modification and the site (position) within the polypeptide at which the modification occurs. A “PTM pattern” refers to the presence of a PTM at each of two or more sites in a single protein molecule. PTMs in a PTM pattern may be the same (e.g., phosphorylation at each of multiple sites) or at least some of them may differ (e.g., a phosphorylation at a first site and a SUMOylation at a second site). A site of potential post-translational modification is any site that is compatible with being post-translationally modified. For example, serine, threonine, tyrosine, and histidine residues are potential phosphorylation sites in eukaryotic cells. In some embodiments a PTM site occurs within a consensus sequence for an enzyme that catalyzes the PTM.
In some embodiments a method of identifying a PTM of HSF1 comprises identifying PTMs or PTM patterns that differ in HSF1 in or isolated from cancer cells as compared to HSF1 in or isolated from non-cancer cells comprises: (a) comparing the extent to which a PTM or PTM pattern occurs in HSF1 of cancer cells with the extent to which it occurs in HSF1 of non-cancer cells, and (b) identifying the PTM or PTM pattern as a PTM or PTM pattern that differs in cancer if the extent to which the PTM or PTM pattern occurs in HSF1 of cancer cells differs from the extent to which it occurs in HSF1 of non-cancer cells. In some embodiments, step (b) comprises (i) obtaining HSF1 isolated from cancer cells and measuring the PTM or PTM pattern; and (ii) obtaining HSF1 isolated from non-cancer cells and measuring the s the PTM or PTM pattern. In some embodiments a historical value is used for either or both measurements of the PTM or PTM pattern. In some embodiments the method comprises isolating HSF1 from cancer cells and/or non-cancer cells. In some embodiments cancer cells and/or non-cancer cells are subjected to heat shock for at least a period of time within the 1, 2, 3, 4, 6, 8, 12, 16, 24, 36, or 48 hours prior to isolation of HSF1. In some embodiments cancer cells and non-cancer cells are not subjected to heat shock within the 1, 2, 3, 4, 6, 8, 12, 16, 24, 36, or 48 hours prior to isolation of HSF1 or, if subjected to heat shock within such time period, have returned to a state that does not differ significantly from that of non-heat shocked cells. Any suitable method can be used to identify or measure a PTM or PTM pattern. Useful methods include, e.g., amino acid sequencing, peptide mapping, use of modification state-specific antibodies or other binding agents, mass spectrometry (MS) analysis (e.g., MS/MS), etc. In some embodiments site-directed mutagenesis is used to identify a PTM that affects HSF1 cancer-related activity. For example, an amino acid that is a site of PTM in cancer cells may be altered to a different amino acid that is not post-translationally modified. The variant may be tested for at least one HSF1 cancer-related activity. If the alteration affects HSF1 cancer-related activity, then the PTM is of potential functional significance to HSF1 cancer-related activity. In some embodiments, a gene product that catalyzes a functionally significant HSF1 PTM is a target of interest for drug development. In some embodiments a PTM or PTM pattern comprises phosphorylation at S121, S230, S292, S303, S307, S314, S319, S326, S344, S363, S419, and/or S444.
In some aspects, disclosed herein are methods of identifying PTMs or PTM patterns that affect the localization or activity of HSF1 in cancer cells. In some embodiments a PTM or PTM pattern selectively affects localization or activity of HSF1 in cancer cells. The PTM or PTM pattern may occur differentially in cancer cells as compared to non-cancer cells and/or may have a different effect on HSF1 localization or activity in cancer cells as compared to its effect in non-cancer cells.
In some aspects, disclosed herein are methods of identifying intracellular molecules, e.g., RNAs or proteins, that interact with HSF1, e.g., in a cancer-specific manner. Any of a variety of methods for detecting protein-protein interactions or protein-RNA interactions may be used. In some embodiments such molecules may be identified by immunoprecipitating HSF1 in cancer cells and in non-transformed heat shocked cells, and identifying molecules that are enriched or specifically present in HSF1 immunoprecipitates from cancer cells as compared with HSF1 immunoprecipitates from non-transformed heat shocked cells. In some embodiments a method comprises performing a two-hybrid screen using HSF1 as a bait in cancer cells and in non-cancer heat shocked control cells, and identifying molecules that are enriched or specifically interact with HSF1 in cancer cells as compared with HSF1 in non-transformed heat shocked cells. In some embodiments a protein fragment complementation assay or a luminescence-based mammalian interactome mapping (LUMIER) assay may be used. In some embodiments a fusion protein comprising (a) HSF1 or a variant or fragment thereof; and (b) a detectable protein is used.
In some embodiments a high throughput screen (HTS) is performed. High throughput screens often involve testing large numbers of test agents with high efficiency, e.g., in parallel. For example, tens or hundreds of thousands of agents may be routinely screened in short periods of time, e.g., hours to days. Such screening is often performed in multiwell plates (sometimes referred to as microwell or microtiter plates or microplates) containing, e.g., 96, 384, 1536, 3456, or more wells or other vessels in which multiple physically separated depressions, wells, cavities, or areas (collectively “wells”) are present in or on a substrate. Different test agent(s) may be present in or added to the different wells. It will be understood that some wells may be empty, may comprise replicates, or may contain control agents or vehicle. High throughput screens may involve use of automation, e.g., for liquid handling, imaging, and/or data acquisition or processing, etc. In some embodiments an integrated robot system comprising one or more robots transports assay-microplates from station to station for, e.g., addition, mixing, and/or incubation of assay constituents (e.g., test agent, target, substrate) and, in some embodiments, readout or detection. A HTS system may prepare, incubate, and analyze many plates simultaneously. Certain general principles and techniques that may be applied in embodiments of a HTS are described in Macarrón R & Hertzberg R P. Design and implementation of high-throughput screening assays. Methods Mol Biol., 565:1-32, 2009 and/or An W F & Tolliday N J., Introduction: cell-based assays for high-throughput screening. Methods Mol Biol. 486:1-12, 2009, and/or references in either of these. Exemplary methods are also disclosed in High Throughput Screening: Methods and Protocols (Methods in Molecular Biology) by William P. Janzen (2002) and High-Throughput Screening in Drug Discovery (Methods and Principles in Medicinal Chemistry) (2006) by Jorg H{umlaut over (ν)}ser. Test agent(s) showing an activity of interest (sometimes termed “hits”) may be retested and/or, optionally (e.g., depending at least in part on results of retesting) selected for further testing, development, or use.
In some embodiments one or more “confirmatory” or “secondary” assays or screens may be performed to confirm that a test agent identified as a candidate modulator in an initial (“primary”) assay or screen modulates a target molecule of interest (e.g., HSF1) or modulates an activity of interest (e.g., HSF1 cancer-related activity) or to measure the extent of modulation or to assess specificity. Confirmatory testing may utilize the same assay or a different assay as that used to identify the test agent. The exact nature of the confirmatory testing may vary depending on a variety of factors such as the nature of the primary assay, the nature of the candidate modulator, etc. One of ordinary skill in the art will be able select one or more assays sufficient to reasonably confirm to the satisfaction of those of ordinary skill in the art that an agent indeed modulates a selected target molecule or activity of interest. In some embodiments a candidate modulator that has given satisfactory results upon confirmatory testing may be referred to as a “confirmed modulator”. In some embodiments a test agent that exhibits a reasonable degree of specificity for a selected target molecule (e.g., HSF1) or activity of interest (e.g., HSF1 cancer-related activity) may be identified or selected, e.g., for further testing or development or use.
In some embodiments one or more agents identified as a candidate modulator or confirmed modulator of HSF1 cancer-related activity may be selected for, e.g., further testing, development, or use. For example, an agent that is determined or predicted to have higher potency, greater selectivity for a target of interest (e.g., HSF1 or an endogenous regulator of HSF1), one or more drug-like properties, potential for useful modification, or any other propert(ies) of interest, e.g., as compared with one or more other hits, e.g., as compared with the majority of other hits, may be selected. A selected agent may be referred to as a “lead”. Further testing may comprise, e.g., resynthesis or re-ordering of a hit, retesting of the original hit preparation or resynthesized or newly ordered preparation in the same or a different assay, etc. Development of an agent may comprise producing an altered agent. In some embodiments a pharmacophore is identified based on structures of multiple hit compounds, which may be used to design additional compounds (e.g., structural analogs). In some embodiments any of the methods may comprise producing an altered agent, e.g., an altered lead agent. In some embodiments a method comprises modifying an agent to achieve or seek to achieve an alteration in one or more properties, e.g., (1) increased affinity for a target of interest; (2) decreased affinity for a non-target molecule, (3) increased solubility (e.g., increased aqueous solubility); (4) increased stability (e.g., in vivo); (5) increased potency; (6) increased selectivity, e.g., for a target molecule or for tumor cells, e.g., a higher selectivity for tumor versus non-tumor cells; (7) a decrease in one or more side effects (e.g., decreased adverse side effects, e.g., decreased toxicity); (8) increased therapeutic index; (9) one or modified pharmacokinetic properties (e.g., absorption, distribution, metabolism and/or excretion); (10) modified onset of therapeutic action or duration of effect; (11) modified, e.g., increased, oral bioavailability; (12) modified, e.g., increased, tissue or tumor penetration; (13) modified, e.g., increased, cell permeability; (14) modified, e.g., increased, delivery to a selected subcellular organelle; (15) modified, e.g., increased, increased ability to cross the blood-brain barrier (increased ability to cross the blood-brain barrier may be desirable in some embodiments if use of the agent to treat central nervous system (CNS) tumors, e.g., brain tumors, is contemplated; decreased ability to cross the blood-brain barrier may be desirable in some embodiments if the agent has adverse effects on the CNS); (16) altered plasma protein binding (e.g., to albumin, alpha-1 acid glycoprotein, α, β, γ globulins, etc.).
In some embodiments any of the methods may further comprise determining an in vitro activity or in vivo activity or toxicology profile of an agent or altered agent. One or more additional alterations may be performed, e.g., based at least in part on such analysis. Multiple cycles of alteration and testing may be performed, thereby generating additional altered agents. In some embodiments any of the methods may further comprise performing a quantitative structure activity relationship analysis of multiple hit, lead, or altered agents. In some embodiments alteration may be accomplished through at least partly random or non-predetermined modification, predetermined modification, and/or using computational approaches. An altered agent, e.g., an altered lead agent, may be produced using any suitable method. In some embodiments an agent or an intermediate obtained in the course of synthesis of the agent may be used as a starting material for alteration. In some embodiments an altered agent may be synthesized using any suitable materials and/or synthesis route. In some embodiments alteration may make use of established principles or techniques of medicinal chemistry, e.g., to predictably alter one or more properties. In some embodiments, a first library of test agents is screened using any of the methods described herein, one or more test agents that are “hits” or “leads” is identified, and at least one such hit or lead is subjected to systematic structural alteration to create a second library of compounds structurally related to the hit or lead. In some embodiments the second library is then screened using methods described herein or other methods.
In some embodiments any of the methods may comprise producing an altered agent, e.g., an altered lead agent, by modifying an agent to incorporate or be attached to a label, which may optionally be used to detect or measure the agent or a metabolite of the agent, e.g., in a pharmacokinetic study. In some embodiments any of the methods may comprise producing an altered agent, e.g., an altered lead agent, by modifying an agent to incorporate or be attached to a second moiety (or more than two moieties). In some embodiments a second (or additional) moiety comprises a linker, tag, or targeting moiety. In some embodiments a second (or additional) moiety may modify one or more properties (1)-(16) listed above. In some embodiments a modification may cause increased delivery of the agent to or increased accumulation of the agent at a site of desired activity in the body of a subject. A site may be, e.g., a tumor, organ, tissue, or cell type.
In some embodiments any of the methods may comprise producing a composition by formulating an agent (e.g., a test agent, candidate HSF1 modulator, altered agent, candidate anti-tumor agent, etc.) or two or more agents with a pharmaceutically acceptable carrier.
In some embodiments any of the methods may comprise testing the effect of an agent (e.g., a test agent, candidate HSF1 modulator, altered agent, etc.) on one or more tumor cell lines. In some embodiments an agent is tested in a diverse set of cancers or cancer cell lines. Any cancer or cancer cell line can be used. Exemplary cancers and cancer cell lines are discussed herein. Tumor cells may be maintained in a culture system comprising a culture medium to which an agent is added or has been added. The effect of the agent on tumor cell viability, proliferation, tumor-initiating capacity, or any other tumor cell property may be assessed. In general, any suitable method known in the art may be used for assessing tumor cell viability or proliferation or tumor-initiating capacity in various embodiments. In certain embodiments survival and/or proliferation of a cell or cell population, e.g., in cell culture, may be determined by: a cell counting assay (e.g., using visual inspection, automated image analysis, flow cytometer, etc.), a replication assay, a cell membrane integrity assay, a cellular ATP-based assay, a mitochondrial reductase activity assay, a BrdU, EdU, or H3-Thymidine incorporation assay, a DNA content assay using a nucleic acid dye, such as Hoechst Dye, DAPI, Actinomycin D, 7-aminoactinomycin D or propidium iodide, a cellular metabolism assay such as resazurin (sometimes known as AlamarBlue or by various other names), MTT, XTT, and CellTitre Glo, etc., a protein content assay such as SRB (sulforhodamine B) assay; nuclear fragmentation assays; cytoplasmic histone associated DNA fragmentation assay; PARP cleavage assay; TUNEL staining; or annexin staining.
It will be understood that inhibition of cell proliferation or survival by a useful agent may or may not be complete. For example, cell proliferation may, or may not, be decreased to a state of complete arrest for an effect to be considered one of inhibition or reduction of cell proliferation. In some embodiments, “inhibition” may comprise inhibiting proliferation of a cell that is in a non-proliferating state (e.g., a cell that is in the GO state, also referred to as “quiescent”) and/or inhibiting proliferation of a proliferating cell (e.g., a cell that is not quiescent). Similarly, inhibition of cell survival may refer to killing of a cell, or cells, such as by causing or contributing to necrosis or apoptosis, and/or the process of rendering a cell susceptible to death. The inhibition may be at least about 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 99%, or 100% of a reference level (e.g., a control level). In some embodiments an agent is contacted with tumor cells in an amount (e.g., at a concentration) that inhibits tumor cell proliferation or survival by a selected amount, e.g., by at least about 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 99%, or 100% of a reference level (e.g., a control level).
In some embodiments an anti-tumor effect is inhibition of the capacity of tumor cells to form colonies in suspension culture. In some embodiments an anti-tumor effect is inhibition of capacity of the one or more tumor cells to form colonies in a semi-solid medium such as soft agar or methylcellulose. In some embodiments an anti-tumor effect is inhibition of capacity of the one or more tumor cells to form tumor spheres in culture. In some embodiments an anti-tumor effect is inhibition of the capacity of the one or more tumor cells to form tumors in vivo.
In some embodiments any of the methods may comprise testing an agent in vivo, by administering one or more doses of the agent to a subject, e.g., a subject harboring a tumor cell or tumor, and evaluating one or more pharmacokinetic parameters, evaluating the effect of the agent on the subject (e.g., monitoring for adverse effects) and/or evaluating the effect of the agent on the growth and/or survival of the cancer cell in the subject. It will be understood that the agent may be administered in a suitable composition comprising the agent. In some embodiments any of the methods may comprise testing an agent in a tumor model in vivo, by administering one or more doses of the composition to a non-human animal (“test animal”) that serves as a tumor model and evaluating the effect of the agent on the tumor in the subject. In some embodiments a test animal is a non-human mammal, e.g., a rodent such as a mouse, rat, hamster, rabbit, or guinea pig; a dog, a cat, a bovine or ovine, a non-human primate (e.g., a monkey such as a cynomolgus or rhesus monkey). By way of example, certain in vivo tumor models are described in U.S. Pat. No. 4,736,866; U.S. Ser. No. 10/990,993; PCT/US2004/028098 (WO/2005/020683); and/or PCT/US2008/085040 (WO/2009/070767). Introduction of one or more cells into a subject (e.g., by injection or implantation) may be referred to as “grafting”, and the introduced cell(s) may be referred to as a “graft”. In general, any tumor cells may be used in an in vivo tumor model in various embodiments. Tumor cells may be from a tumor cell line or tumor sample. In some embodiments tumor cells originate from a naturally arising tumor (i.e., a tumor that was not intentionally induced or generated for, e.g., experimental purposes). In some embodiments experimentally produced tumor cells may be used. The number of tumor cells introduced may range, e.g., from 1 to about 10, 102, 103, 104, 105, 106, 107, 108, 109, or more. In some embodiments the tumor cells are of the same species or inbred strain as the test animal. In some embodiments the tumor cells may originate from the test animal itself. In some embodiments the tumor cells are of a different species than the test animal. For example, the tumor cells may be human cells. In some embodiments, a test animal is immunocompromised, e.g., in certain embodiments in which the tumor cells are from a different species to the test animal or originate from an immunologically incompatible strain of the same species as the test animal. For example, a test animal may be selected or genetically engineered to have a functionally deficient immune system or may be treated (e.g., with radiation or an immunosuppressive agent or surgery such as removal of the thymus) so as to reduce immune system function. In some embodiments, a test animal is a SCID mouse, NOD mouse, NOD/SCID mouse, nude mouse, and/or Rag1 and/or Rag2 knockout mouse, or a rat having similar immune system dysfunction. Tumor cells may be introduced at an orthotopic or non-orthotopic location. In some embodiments tumor cells are introduced subcutaneously, under the renal capsule, or into the bloodstream. Non-tumor cells (e.g., fibroblasts, bone marrow derived cells), an extracellular matrix component or hydrogel (e.g., collagen or Matrigel®), or an agent that promotes tumor development or growth may be administered to the test animal prior to, together with, or separately from the tumor cells. Tumor cells may be contacted with an agent prior to grafting and/or following grafting (by administering the agent to the test animal). The number, size, growth rate, metastasis, or other properties may be assessed at one or more time points following grafting. In some embodiments a tumor in an in vivo tumor model arises due to neoplastic transformation that occurs in vivo, e.g., at least in part as a result of one or more mutations existing or occurring in a cell in vivo. In some embodiments a test animal is a tumor-prone animal. The animal may, for example, be of a species or strain that naturally has a predisposition to develop tumors and/or may be a genetically engineered animal. For example, the animal may be a genetically engineered animal at least some of whose cells comprise, as a result of genetic modification, at least one activated oncogene and/or in which at least one tumor suppressor gene has been functionally inactivated. Standard methods of generating genetically modified animals, e.g., transgenic animals that comprises exogenous genes or animals that have an alteration to an endogenous gene, e.g., an insertion or an at least partial deletion or replacement (sometimes referred to as “knockout” or “knock-in” animal) may be used.
An agent may be administered by any route or regimen in various embodiments. For example, the agent can be administered prior to, concomitant with, and/or following the administration of tumor cells or development of a tumor. An agent can be administered regularly throughout the course of the testing period, for example, one, two, three, four, or more times a day, weekly, bi-weekly, or monthly, beginning before or after tumor cells have been administered, in other embodiments, the agent is administered continuously to the subject (e.g., intravenously or by release from an implant, pump, sustained release formulation, etc.). The dose of the agent to be administered can depend on multiple factors, including the type of agent, weight of the test animal, frequency of administration, etc. Determination of dosages is routine for one of ordinary skill in the art. In some embodiments doses are 0.01 mg/kg-200 mg/kg (e.g., 0.1-20 mg/kg or 1-10 mg/kg). The test animal may be used to assess effect of the agent or a combination of agents on tumor formation, tumor size, tumor number, tumor growth rate, progression (e.g., local invasion, regional or distant metastasis), etc. In some embodiments a non-human animal is used to assess efficacy, half-life, clearance, metabolism, and/or toxicity of an agent or combination of agents. Methods known in the art can be used for such assessment. For example, tumor number, size, growth rate, or metastasis may, for example, be assessed using various imaging modalities, e.g., X-ray, magnetic resonance imaging, functional imaging, e.g., of metabolism (e.g., using PET scan), etc. In some embodiments tumor(s) may be removed from the body (e.g., at necropsy) and assessed (e.g., tumors may be counted, weighed, and/or size (e.g., dimensions) measured). In some embodiments the size and/or number of tumors may be determined non-invasively. For example, in certain tumor models, tumor cells that are fluorescently labeled (e.g., by expressing a fluorescent protein such as GFP) can be monitored by various tumor-imaging techniques or instruments, e.g., non-invasive fluorescence methods such as two-photon microscopy. The size of a tumor implanted subcutaneously can be monitored and measured underneath the skin.
In some embodiments, an agent may be contacted with tumor cells ex vivo, and the tumor cells are then introduced into a test animal that serves as a tumor model. The ability of the agent to inhibit tumor development, tumor size, or tumor growth is assessed. The agent may or may not also be administered to the subject.
In some embodiments samples or data may be acquired at multiple time points, e.g., during or after a dose or series of doses. In some embodiments a suitable computer program may be used for data analysis, e.g., to calculate one or more pharmacokinetic parameters. In certain embodiments, the subject is a mouse, rat, rabbit, dog, cat, sheep, pig, non-human primate, or human.
In some aspects, a computer-readable medium is provided. In some embodiments a computer-readable medium stores at least some results of a screen to identify agents that modulate, e.g., inhibit, HSF1 cancer-related activity. The results may be stored in a database and may include one or more screening protocols, results obtained from a screen, predicted properties of hits, leads, or altered leads, or results of additional testing of hits, leads, or altered leads.
In some embodiments an agent capable of causing a decrease in level or activity of a target, e.g., HSF1, of at least 25%, 50%, 75%, 90%, 95%, 99%, or more when used in a suitable assay at a concentration equal to or less than approximately 1 mM, 500 μM, 100 μM, 50 μM, 10 μM, 5 μM, 1 μM, 500 nM, 100 nM, 50 nM, 10 nM, 5 nM, 1 nM, 0.5 nM, or 0.1 nM may be screened for, identified, produced, provided, or used.
In some embodiments an agent capable of causing a decrease of at least 25%, 50%, 75%, 90%, 95%, 99%, or more in tumor cell survival or proliferation (i.e., a decrease to 75%, 50%, 25%, 10%, 5%, 1% or less of the number of viable cells that would be expected in the absence of the agent) when used in a suitable cell culture system at a concentration equal to or less than approximately 1 mM, 500 μM, 100 μM, 50 μM, 10 μM, 5 μM, 1 μM, 500 nM, 100 nM, 50 nM, 10 nM, 5 nM, 1 nM, 0.5 nM, or 0.1 nM may be screened for, identified, produced, provided, or used. In some embodiments a decrease is between 50% and 75%, between 75% and 90%, between 90% and 95%, between 95% and 100%. A decrease of 100% may be a reduction to background levels or essentially no viable cells or no cell proliferation. In general, any suitable method for assessing tumor cell survival or proliferation may be used.
In some embodiments, genes and/or gene products that regulate HSF1 cancer-related activity are targets of interest for drug development. For example, in some embodiments an inhibitor or activator of a gene product that modulates HSF1 activity in cancer cells is of use to modulate HSF1 cancer-related activity. As but one example, a kinase that phosphorylates HSF1 in cancer cells and thereby increases activity or nuclear localization of HSF1 would be a target of interest for identification and/or development of an inhibitor of the kinase. Such an inhibitor may be useful to inhibit HSF1 in cancer cells, e.g., in cell culture and/or in subjects in need of treatment for cancer. In some embodiments, a screen is performed to identify an inhibitor or activator of a gene product identified as a modulator of HSF1 cancer-related activity. Such a screen may be performed using similar test agents and methods as described above. It will be understood that details of a screen may depend at least in part on the identity of the particular gene product. For example, if the gene product has an enzymatic activity, the screen may utilize a composition comprising the gene product and a substrate of the gene product and may seek to identify test agents that affect utilization or modification of the substrate when present in the composition. Test agents identified as inhibitors or activators of gene products that modulate HSF1 cancer-related activity may be confirmed as modulators of HSF1 cancer-related activity and/or may be tested in an in vitro or in vivo tumor model.
In some aspects, methods of identifying candidate therapeutic agents, e.g., candidate anti-tumor agents are provided. In some embodiments an inhibitor of HSF1 cancer-related activity is a candidate anti-tumor agent. For example, an agent that has been assessed, e.g., by a method described herein, and determined to modulate, e.g., inhibit, HSF1 cancer-related activity, may be considered a candidate therapeutic agent, e.g., a candidate anti-tumor agent. A candidate anti-tumor agent that has been assessed in an ex vivo or in vivo tumor model and has been determined to inhibit tumor cell survival or proliferation or to inhibit tumor development, maintenance, growth, invasion, metastasis, resistance to chemotherapy, recurrence, or otherwise shown a useful anti-tumor effect may be considered an anti-tumor agent. An anti-tumor agent may be tested in a clinical trial in a population of subjects in need of treatment for cancer to confirm its therapeutic utility or further define subject characteristics or tumor characteristics that correlate with (e.g., are predictive of) efficacy or to identify particularly effective agents, combinations, doses, etc. In some embodiments, methods disclosed herein may identify agents that increase HSF1 expression or activity. Agents that increase HSF1 activity may find use as, e.g., cell protective agents (e.g., for neuroprotection, cardioprotection, etc.), longevity-increasing agents, anti-aging agents, etc. For example, increasing HSF1 activity may be useful in protecting cells subjected to stress due to injury, disease, or exposure to cytotoxic or cell damaging agents or in individuals who have mutations or polymorphisms that result in abnormally low HSF1 functional activity, e.g., under stress conditions.
Wherever relevant herein, a difference between two or more values (e.g., measurements) or groups, or a relationship between two or more variables, may be statistically significant. For example, a difference in, or level of inhibition or reduction of, binding, expression, activity, cell proliferation, cell survival, tumor size, tumor number, tumor growth rate, tumor metastasis, e.g., as compared with a reference or control level, may be statistically significant. As used herein, “statistically significant” may refer to a p-value of less than 0.05 using an appropriate statistical test. One of ordinary skill in the art will be aware of appropriate statistical tests and models for assessing statistical significance, e.g., of differences in measurements, relationships between variables, etc., in a given context. Exemplary tests and models include, e.g., t-test, ANOVA, chi-square test, Wilcoxon rank sum test, log-rank test, Cox proportional hazards model, etc. In some embodiments multiple regression analysis may be used. In some embodiments, a p-value may be less than 0.025. In some embodiments, a p-value may be less than 0.01. In some embodiments a two-sided statistical test is used. In some embodiments, a result or outcome or difference between two or more values is “statistically significant” if it has less than a 5%, less than a 2.5%, or less than a 1% probability of occurring by chance. In some embodiments, a difference between two or more values or a relationship between two or more variables may be statistically significant with a p-value of less than 0.05, less than 0.025, or less than 0.01. In some embodiments, values may be average values obtained from a set of measurements obtained from different individuals, different samples, or different replicates of an experiment. Software packages such as SAS, GraphPad, etc., may be used for performing statistical analysis. It will be understood that any values may be appropriately normalized in some embodiments In some aspects, disclosed herein are a composition, nucleic acid construct, or cell comprising: (a) a first isolated nucleic acid comprising a sequence that encodes HSF1; and (b) a second isolated nucleic acid comprising a sequence that encodes YY1. In some aspects, disclosed herein are a composition, nucleic acid construct, or cell comprising: (a) a first agent that modulates expression or activity of HSF1; and (b) a second agent that modulates expression or activity of YY1. In some embodiments the first agent inhibits expression or activity of HSF1 and the second agent inhibits expression or activity of YY1. In some embodiments the first agent and the second agent comprise nucleic acids. In some embodiments the first agent and the second agent comprise RNAi agents.
In some aspects, disclosed herein is a method of modulating expression of an HSF1-CP gene, the method comprising contacting a cell with a first agent that modulates expression or activity of HSF1 and a second agent that modulates expression or activity of YY1. In some embodiments the first agent inhibits expression or activity of HSF1. In some embodiments the first and second agents inhibit expression or activity of HSF1 and YY1, respectively. In some embodiments the first and second agents are RNAi agents. In some embodiments, modulating expression or activity of HSF1 and YY1 may have additive or synergistic effects on, e.g., cancer cell viability or proliferation. In some embodiments, assessing YY1 expression or activity may be useful in conjunction with an HSF1-based assay or method, e.g., for diagnostic, prognostic, treatment selection or other purposes.
Kits and Systems
In some aspects, the invention provides kits comprising reagents suitable for performing an assay to assess HSF1 expression or HSF1 activation, e.g., for use in a method of the invention. Such kits may contain, e.g., (i) a probe or primer (optionally labeled and/or attached to a support) for detecting, reverse transcribing, and/or amplifying an HSF1 RNA, (e.g, HSF1 mRNA); (ii) a probe or primer for detecting, reverse transcribing, and/or amplifying an RNA (e.g., mRNA) transcribed from an HSF1-regulated gene; (iii) an antibody that binds to an HSF1 polypeptide (e.g., for use in IHC); (iv) one or more control reagents; (v) a detection reagent such as a detectably labeled secondary antibody or a substrate; (vi) one or more control or reference samples that can be used for comparison purposes or to verify that a procedure for detecting HSF1 expression or activation is performed appropriately or is giving accurate results. A control reagent can be used for negative or positive control purposes. A control reagent may be, for example, a probe or primer that does not detect or amplify HSF1 mRNA or an antibody that does not detect HSF1 polypeptide or a purified HSF1 polypeptide or portion thereof(e.g., an HSF1 peptide). A probe, primer, antibody, or other reagent may be attached to a support, e.g., a bead, slide, chip, etc.
In some embodiments, a kit comprises any one or more isolated nucleic acids, nucleic acid constructs, vectors, or cells disclosed herein. In some embodiments a kit comprises reagents suitable for assessing expression of one or more HSF1-CP genes. Such kits may contain, for each of one or more HSF1-CP genes, e.g., (i) a probe or primer (optionally labeled and/or attached to a support) for detecting, reverse transcribing, and/or amplifying an RNA (e.g., mRNA) transcribed from an HSF1-CP gene; (ii) a binding agent, e.g., an antibody, that binds to an HSF1-CP polypeptide (e.g., for use in IHC); (iii) one or more control reagents; (iv) a detection reagent such as a detectably labeled secondary antibody or a substrate; (v) one or more control or reference samples that can be used for comparison purposes or to verify that a procedure for detecting HSF1-CP expression or activity is performed appropriately or is giving accurate results.
In some embodiments a kit comprises probes, primers, binding agents, or other primary detection reagents suitable for detecting multiple HSF1-CP mRNA or polypeptides, wherein the probes, primers, binding agents, or other primary detection reagents are attached to a support, e.g., a bead, slide, chip, etc. In some embodiments the primary detection reagents are arranged in an array format, e.g., in mutually perpendicular rows and columns.
In some embodiments the kit comprises a microarray, e.g., an oligonucleotide microarray. In some embodiments, a kit comprises reagents useful to assess expression of one or more HSF1-CSS, HSF1-CaSig2 gene, HSF1-CaSig3 gene, refined HSF1-CSS, Group A, Group B, Module 1, Module 2, Module 3, Module 4, or Module 5 genes. In some embodiments a kit comprises a nucleic acid construct useful as a reporter of HSF1 activity, e.g., as described above. In some embodiments a kit comprises probes, primers, or binding agents, or other primary detection reagents suitable for measuring at least 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, or all of the HSF1-CSS, HSF1-CaSig2, HSF1-CaSig3, refined HSF1-CSS, Group A, Group B, Module 1, Module 2, Module 3, Module 4, or Module 5 genes. In some embodiments at least 50% of probes, primers, binding agents, or other primary detection reagents in a kit are specific for HSF1-CP genes.
Individual kit components may be packaged in separate containers (e.g., tubes, bottles, etc.) The individual component containers may be packaged together in a larger container such as a box for commercial supply. Optionally the kit comprises written material, e.g., instructions, e.g., in a paper or electronic format (e.g., on a computer-readable medium). Instructions may comprise directions for performing the assay and/or for interpreting results, e.g., in regard to tumor classification, diagnosis, prognosis, or treatment-specific prediction. Such material could be provided online.
In some embodiments, the invention provides a system which is adapted or programmed to assess HSF1 expression or HSF1 activation, e.g., for use in a method of the invention. In some embodiments the system may include one or more instruments (e.g., a PCR machine), an automated cell or tissue staining apparatus, an imaging device (i.e., a device that produces an image), and/or one or more computer processors. The system may be programmed with parameters that have been selected or optimized for detection and/or quantification of an HSF1 gene product, e.g., in tumor samples. The system may be adapted to perform the assay on multiple samples in parallel and/or may have appropriate software to analyze samples (e.g., using computer-based image analysis software) and/or provide an interpretation of the result. The system can comprise appropriate input and output devices, e.g., a keyboard, display, etc. In some embodiments, the invention provides a system which is adapted or programmed to assess expression of one or more HSF1-CP genes, e.g., one or more HSF1-CSS, HSF1-CaSig2, HSF1-CaSig3, refined HSF1-CSS, Group A, Module 1, Module 2, Module 3, Module 4, or Module 5 genes. In some embodiments a system classifies a sample based on assessing expression of one or more HSF1-CP genes in the sample. In some embodiments, the invention provides a system which is adapted or programmed to assess binding of HSF1 to regulatory regions of one or more HSF1-CP genes, e.g., one or more HSF1-CSS, HSF1-CaSig2, HSF1-CaSig3, refined HSF1-CSS, Group A, Module 1, Module 2, Module 3, Module 4, or Module 5 genes. In some embodiments a system classifies a sample based on assessing binding of HSF1 to regulatory regions of one one or more HSF1-CP genes in the sample.
In some embodiments, an assay is performed at one or more central testing facilities, which may be specially qualified or accredited (e.g., by a national or international organization which, in some embodiments, is a government agency or organization or a medical or laboratory professional organization) to perform the assay and, optionally, provide a result. A sample can be sent to the laboratory, and a result of the assay, optionally together with an interpretation, is provided to a requesting individual or entity. In some embodiments, determining the level of HSF1 expression or the level of HSF1 activation in a sample obtained from the tumor comprises providing a tumor sample to a testing facility. In some aspects, the invention provides a method comprising: providing to a testing facility (a) a sample obtained from a subject; and (b) instructions to perform an assay to assess the level of HSF1 expression or HSF1 activation (and, optionally, instructions to perform one or more additional assays, e.g., one or more additional assays described herein). In some aspects, the invention provides a method comprising: (a) providing to a testing facility a sample obtained from a subject; and (b) receiving results of an assay of HSF1 expression or HSF1 activation. In some aspects, the invention further provides a method comprising providing, e.g., electronically, a result of such an assay, to a requestor. In some aspects, the invention further provides a method comprising receiving, e.g., electronically, a sample and a request for an assay of HSF1 expression or HSF1 activation, performing such assay, and reporting the result of such assay to a requestor. A result can comprise one or more measurements, scores and/or a narrative description. In some embodiments, a result provided comprises a measurement, score, or image of the sample, with associated diagnostic, prognostic, or treatment-specific predictive information. In some embodiments, a result provided comprises a measurement, score, or image of the sample, without associated diagnostic, prognostic, or treatment-specific predictive information. The invention contemplates that an assay may be performed at a testing facility which is remote from the site where the sample is obtained from a subject (e.g., at least 1 kilometer away). It is contemplated that samples and/or results may be transmitted to one or more different entities, which may carry out one or more steps of an assay or a method of the invention or transmit or receive results thereof. All such activities are within the scope of various embodiments of the invention.
Study Design and Population
The Nurses' Health Study (NHS) is a prospective cohort study initiated in 1976 (40, 41). 121,700 female US-registered nurses between the ages of 30-55 completed a questionnaire on factors relevant to women's health with follow-up biennial questionnaires used to update exposure information and ascertain non-fatal incident diseases (40). The follow-up rate was greater than 90% through 1996. Participants who developed breast cancer were identified through the biennial questionnaires and permission was obtained for a review of the medical record. The diagnosis of cancer was confirmed by chart review in 99% participants who self-reported the development of breast cancer. Tumor size, existence of metastatic disease, histologic subtype and invasive or in situ status were recorded from the medical record. This information was used to assign a clinical stage to the patients using the parameters listed in the legend of Table 1. In cases of deceased participants, death certificates and medical records were obtained to ascertain information relevant to the study. Use of this information and associated pathology materials for the study reported here was approved by the Human Subjects Committee at Brigham and Women's Hospital in Boston, Mass.
Tissue Microarray Construction
The NHS breast cancer tissue block collection and tissue microarray (TMA) assembly have been described previously (40, 41). Formalin fixed paraffin-embedded tissue blocks were collected from breast cancers that developed within a follow-up period of 20 years spanning 1976 to 1996. Samples were successfully obtained from 3,752 of the 5,620 participants that were eligible for block collection. The diagnosis, tumor type, and histologic grade were confirmed by review of Hematoxylin and eosin (H&E) stained sections. A total of 23 TMA blocks were constructed at the Dana Farber/Harvard Cancer Center Tissue Microarray Core Facility in Boston from 3,093 primary tumors and lymph nodes with metastatic disease derived from 2,897 study participants. For this study, tissue was available from 21TMAs including samples from 2656 individuals.
Paraffin blocks were also obtained from the archives of Brigham and Women's Hospital (BWH) in accordance with the regulations for excess tissue use stipulated by the BWH institutional review board. Twenty-four blocks from individual patients were used to construct an additional tissue microarray from normal breast tissue derived from breast reduction mammoplasty procedures. Normal breast epithelial lobules were identified on H&E stained sections and three 0.6 mm cores were taken and transferred into a recipient paraffin block at the Dana Farber/Harvard Cancer Center Tissue Microarray Core Facility. Epithelium from 16 lobules could be identified in the sections used for this study. Additional whole tissue sections were made from paraffin blocks of invasive ductal carcinoma or ductal carcinoma in situ.
Lung, colon, and prostate tissue studied was also formalin-fixed paraffin-embedded human biopsy material.
Immunohistochemistry of Tissues
Paraffin sections of human and mouse tissues and tissue microarrays were stained with a rat monoclonal antibody cocktail to HSF1 (Thermo Scientific RT-629-PABX).
According to the manufacturer's data sheet, this antibody preparation contains a combination of monoclonal antibodies obtained from hybridoma clones 4B4, 10H4, and 10H8, generated using recombinant mouse HSF1 protein (amino acids 1-503) as an immunogen, and reported to recognize an epitope within amino acids 288-439. Deparaffinized sections were blocked with 3% H2O2, antigen retrieval was performed using a pressure cooker with Dako citrate buffer (pH 6.0) at 120° C.+/−2° C., 15+/−5 PSI, slides were blocked with 3% normal rabbit serum and primary HSF1 antibody (1:2000) was incubated at room temperature for 40 minutes. Application of the primary antibodies was followed by 30 minute incubation with Dako Labeled Polymer-HRP anti-rat IgG as a secondary antibody, and visualized with 3,3′-diaminobenzidine (DAB) as a chromogen (Dako Envision+ System). Mayer-hematoxylin was used for counterstaining.
Immunostained sections were reviewed by light microscopy and scored visually with a value assigned to each individual core. Scoring was based on a semi-quantitative review of staining intensity with 0 indicating no nuclear staining, 1 indicating low level nuclear staining and 2 indicating strong nuclear staining for HSF1. The immunostained sections were evaluated independently by two pathologists (SS and TAI) who were blinded to the survival outcomes of the participants and scores given by the other pathologist. Scoring averages were determined per case from values assigned to all evaluable cores from the two independent readings. If diagnostic tissue was absent or if the staining was uninterpretable for all three cores, the case status was recorded as missing. The kappa value was used to measure inter-observer variability among the two pathologist reviews. The kappa statistic was 0.92 for the scoring of HSF1-positive versus negative tumors and 0.84 for the scoring of HSF1-negative, HSF1-low, versus HSF1-high tumors. Cases with no detectable HSF1 or only cytoplasmic immunoreactivity are referred to as HSF1-negative tumors and cases with low or high nuclear HSF1 are referred to as HSF1-positive tumors unless indicated otherwise. The ER, PR and HER2 status of each case was determined as previously described (42). HSF1 wild-type and null mice as a source of tissue for immunostaining controls were a kind gift from Ivor Benjamin (3).
In the analysis depicted in
In the analysis described in Example 8 and depicted in
Immunoblotting
Tissue blot IMB-130a from Imgenex Corp (San Diego, Calif.) was blocked with 5% non-fat dry milk in IX PBS (pH 7.4) and washed with IX PBS (pH 7.4) containing 0.1% Tween 20. Primary antibodies were applied in IX PBS (pH 7.4)+0.5% non-fat dry milk for 1 hour at room temperature. Peroxidase-conjugated secondary antibodies were applied at room temperature for 1 hour and the signal was visualized by incubation with a chemiluminescent substrate (Pico-West, Thermo-Fisher). Tissues lysates from HSF1 wild-type and null mice were made from freshly harvested organs that were immediately frozen in liquid nitrogen, and subsequently extracted in cold lysis buffer (100 mM NaCl, 30 mM Tris-HCl (pH 7.6), 1% NP-40, 1 mM EDTA, 1 mM sodium orthovanadate, 30 mM sodium fluoride, and a complete protease inhibitor cocktail tablet (Roche Diagnostics)). Protein concentrations were determined using a BCA reagent (Pierce Biochemical) and proteins were separated on NuPAGE® Novex gels and transferred to Immun-Blot® PVDF membrane (Bio-Rad).
Selection Criteria for Outcome Analysis
This study included women with either ductal carcinoma in situ or invasive breast carcinoma that were diagnosed between 1976, after the completion of the baseline initial questionnaire, and 1996. Inclusion in the study (n=2656) required that tissue from the primary breast lesion was available for TMA construction and that outcome data was also available. Kaplan-Meier analysis and multivariate analysis were performed with data from participants with invasive breast cancer at diagnosis. Participants were excluded from outcome analysis if they had in situ carcinoma only (n=408), stage 1V breast cancer at the time of diagnosis (n=50) or HSF1-status could not be evaluated due to missing cores (n=357). Hence, outcome analysis was performed on 1,841 women. Expression of HSF1 was also analyzed in 200 cases of ductal carcinoma in situ which were not included in outcome analysis.
Covariates Evaluated in the Analysis
The medical record and supplemental questionnaires were used to garner information on the breast tumor and treatments including year of diagnosis, stage, radiation, chemotherapy and hormonal treatments. Histological grade was determined by centralized pathology review as described previously (41). Covariates considered in the multivariate model were based on both statistical significance and clinical significance. They included age at diagnosis, date of diagnosis, estrogen receptor status, disease stage, tumor grade, radiation treatment, chemotherapy and hormonal treatment.
Statistical Analysis
HSF1-positive (including HSF1-high and HSF-low) and HSF1-negative tumors were compared according to tumor characteristics and treatment variables by the chi-square test or Wilcoxon rank sum test, as appropriate. The survival endpoint was death from breast cancer. Deaths from any other causes were censored. Therefore, all mention of survival and mortality refer only to breast cancer-specific survival and mortality. Survival curves were estimated by the Kaplan-Meier method and statistical significance was assessed with the log-rank test. Cox proportional hazards regression models were used to evaluate the relationship between HSF1 status and breast cancer-specific mortality after adjusting for covariates. All analyses of the NHS data were run with SAS version 9.1 statistical software. Survival of patients from Van de Vivjer et al. (17) was analyzed by Kaplan-Meier methods and statistical significance was assessed with the log-rank test using GraphPad Prism 5. All statistical tests were two-sided and a P value of <0.05 was considered statistically significant.
Cell Culture Methods.
HME, HMLER and MCF10A cells were cultured in MEGM medium supplemented as specified by the manufacturer (Lonza). BPE and BPLER cells were cultured in WIT-I and WIT-T medium, respectively, in accordance with recommendations by the manufacturer (Stemgent). The HME, BPE, HMLER and BPLER cells are available from the Ince laboratory upon request. BT474, H441, H838, H1703, HCC38, HCC1954, HCT15, HT29, SKBR3, SW620 and ZR75-1 cells were cultured in RPMI-1640 medium supplemented with 10% fetal bovine serum. BT20, MDA-MB-231, MCF7 and T47D cells were cultured in Dulbecco's modified Eagle's medium supplemented with 10% fetal bovine serum. All established cell lines were from A.T.C.C.
ChIP-Seq and ChIP-PCR.
ChIP-qPCR and ChIP-Seq experiments were performed as described previously (Lee et al., 2006), with modifications and analysis methods detailed in Supplemental Experimental Procedures.
Gene Expression.
Lentiviral shRNA sequences, viral production and transduction of cells have been described previously (Dai et al., 2007). Gene expression analysis was performed as described in Supplemental Experimental Procedures using an Affymetrix Gene Chip HT Human Genome U133 96-Array Plate. Data were analyzed using previously described methods (Ince et al., 2007). All microarray raw data were deposited in a public database (NCBI Gene Expression Omnibus). For ChIP-PCR, HSF1 was depleted using siRNA as described in Supplemental Experimental Procedures.
Immunohistochemistry of Tissues.
Paraffin sections of tissue microarrays were stained using a rat HSF1 monoclonal antibody cocktail (Thermo Scientific, RT-629-PABX) as detailed in Supplemental Experimental Procedures.
The Nurses' Health Study Analysis Design and Population, Exclusion Criteria and Statistical Analysis.
The Nurses' Health Study (NHS) is a prospective cohort study initiated in 1976 (Hu et al., 2011; Tamimi et al., 2008). For design and study population, exclusion criteria and statistical analysis, see above.
Correlation of Gene Expression with Outcome.
The “HSF1-CaSig” was generated from the 456 genes that were bound in BPLER cells by HSF1 near their transcription start sites (bound from −8 kb to +2 kb of the TSS). Table T4C lists the HSF1-CaSig genes. The HSF1-CaSig2 was generated from the genes found in Modules 1 and 2 of our gene-gene correlation analysis (
We used all breast cancer datasets with reported clinical outcome available in the Oncomine database (Rhodes et al., 2007) containing at least 70 tumors, excluding several datasets based on older microarray platforms that were missing many currently annotated genes. This left 10 high-quality datasets, the majority of which contained more than 150 tumors (Table T5). We stratified each dataset into two groups of tumors based on high (highest 25%) and low (lowest 75%) average expression of the gene or gene signature being queried. For analysis of the MammaPrint and the HSF1-CaSig3 gene signature, the subset of genes positively correlating with poor outcome was positively weighted and the subset of genes negatively correlating with poor outcome was negatively weighted, as described previously (van 't Veer et al., 2002; van de Vijver et al., 2002). Data for the three versions of the HSF1-CaSig for KM analysis were retrieved from Oncomine (Rhodes et al., 2007).
All data for comparisons with random signatures were obtained from NCBI GEO and KM analysis was repeated. (The VandeVijver and TCGA datasets were not on an Affymetrix platform and were excluded from this analysis.) If CEL files were available, Affymetrix microarrays were processed with RMA using Bioconductor; otherwise, preprocessed expression matrices were obtained from NCBI GEO or author web sites. Monte Carlo cross validation was applied to contrast HSF1-CaSig signatures with random signatures of genes of the same number. Random sets of signatures containing the same number of probesets as each HSF1 signature were generated for each dataset with a particular emphasis on U133A probesets (present on both U133A and U1133 Plus 2.0 arrays). The 10,000 random signatures were processed in the same manner as the original signature, sorting samples by increasing mean expression of each mean-centered probeset. Cancer samples, partitioned into the high and low HSF1-CaSig as before, were then analyzed for survival with the log-rank test, producing 10,000 test statistics. Median p values were calculated across a tumor subtype and Monte Carlo cross validation was applied.
Statistical Analysis.
Correlation of gene expression with location of HSF1 occupancy was performed using a two-tailed Fisher's Exact Test. Statistical methods for ChIP-Seq analysis and the Nurses' Health Study outcome data analysis are detailed in Supplemental Experimental Procedures. Kaplan-Meier analysis was used to compare outcome events and p-values were generated using the logrank test. For all other data, mean+/−standard deviation is reported and statistical significance between means was determined using a two-tailed t test.
Gene-Gene Correlation Analysis.
Correlation values of HSF1-bound genes were determined by using the UCLA Gene Expression Tool (genome.ucla.edu/projects/UGET) to query gene expression profile data collected in Celsius, a data warehousing system that aggregates Affymetrix CEL files and associated metadata. Nearly 12,000 Affymetrix HG-U133 Plus 2.0 human gene expression profiles, predominantly representing neoplasms of highly diverse human origin, were interrogated.
Supplemental Experimental Procedures for Examples 9-14
ChIP Antibodies.
For ChIP-Seq, HSF1 antibody (Santa Cruz, sc-9144) and normal rabbit IgG (Santa Cruz, sc-2027) were used. For ChIP-qPCR, HSF1 antibody (Santa Cruz, sc-9144) and, as a control, a second HSF1 antibody (Thermo Scientific, RT-629-PABX), were used. Similar results were obtained and RT-629-PABX antibody data are reported. Additionally, (RNA polymerase II CTD repeat YSPTSPS antibody [4H8](Abcam, ab5408) and normal rabbit IgG (Santa Cruz, sc-2027) were used, as indicated.
ChIP-Seq and ChIP-PCR.
For ChIP-Seq, 5×107 cells were used for each immunoprecipitation. For heat-shock, cells were transferred to a 42′C (5% CO2) incubator for 1 hr. ChIP and ChIP-Seq experiments were performed as described previously (Lee et al., 2006) with several modifications (Novershtern et al., 2011). In place of RIPA buffer, immunoprecipitations were washed sequentially with buffer B (20 mM Tris-HCl, pH 8.0, 150 mM NaCl, 2 mM EDTA, pH 8.0, 0.1% SDS and 1.0% Triton X-100), buffer C (20 mM Tris-HCl, pH 8.0, 500 mM NaCl, 2 mM EDTA, pH 8.0, 0.1% SDS and 1.0% Triton X-100), buffer D (10 mM Tris-HCl, pH 8.0, 250 mM LiCl, 1 mM EDTA, pH 8.0, 1.0% Na-Deoxycholate and 1.0% IGEPAL CA-630), and buffer TE (10 mM Tris-HCl, pH 8.0, 1 mM EDTA, pH 8.0). Preparation of the ChIP-Seq DNA library and deep sequencing using an Illumina Solexa genome analyzer were performed as described previously (Yu et al., 2009).
Images acquired from the Illumina sequencer were processed through the bundled Illumina image extraction pipeline. ChIP-Seq reads were aligned to HG18 using ELAND software (Illumina). Identification of enriched genomic regions was performed as described previously (Guenther et al., 2008). Briefly, each ChIP-Seq read (a maximum of two repeat reads were allowed) was extended 100 bp to approximate the middle of the sequenced fragment. The extended fragments were subsequently allocated to 25 bp bins across the genome. Read density for each bin was calculated and enriched bins were identified by comparison to a Poisson background model using a p-value threshold of 10−12. The minimum ChIP-seq read density required to meet this threshold for each dataset is indicated in Table T1. Enriched bins within 200 bp were combined to form enriched regions. Enriched regions less than 100 bp were removed. Because of the non-random nature of background reads, enriched bins and regions were also required to have an eight-fold greater ChIP-seq density versus a nonspecific control IgG immunoprecipitation performed under identical conditions. All RefSeq genes that were within 8 kb of enriched regions were considered to be enriched genes. A summary of the experiments is provided in Table T1. The raw data will be or have been deposited in a public database (NCBI Gene Expression Omnibus).
The unions of all HSF1 enriched regions identified by ChIP-Seq in each sample were merged to identify a global set of regions. Short reads overlapping these regions were quantified using HTSeq-count (http://www-huber.embl.de/users/anders/HTSeq/doc/count.html). The counts matrix was median-normalized using the total number of mapped reads. After adding 1 pseudocount, counts were log 2-normalized and analyzed by principal components as implemented by the MADE4 program in Bioconductor (Culhane et al., 2005).
For ChIP-qPCR, 5×106 cells were used for each immunoprecipitation. The protocol was modified as described above. RT2 SYBR Green qPCR Mastermix (SABiosciences) was used with the indicated oligo pairs (Table T7) on a 7700 ABI Detection System.
Preparation of human breast and colon tumors for ChIP-seq was performed using 300 mg of cryopreserved material. Frozen tumor tissue was retrieved from the Brigham and Women's Hospital (BWH) Tissue Bank in accordance with the regulations for excess tissue use stipulated by the BWH institutional review board. Frozen sections for immunohistochemistry were prepared using a cryostat from adjacent tissue. Frozen samples were processed for ChIP-Seq using a tissue pulverizer, and this material was subsequently suspended in PBS and passed serially through needles of increasing gauge. This suspension was then fixed for 10 minutes and the pellet was processed as described above.
Gene Expression Analysis.
Lentiviral shRNA sequences, viral production and transduction of cells have been described previously (Dai et al., 2007). RNA was purified following extraction with TRIzol reagent (Invitrogen, #15596-026), 60 hours after viral infection. Protein lysates of concurrent infections were prepared in TNES buffer consisting of 50 mM Tris, pH 7.4; NP-40 1%; EDTA 2 mM; NaCl 200 mM plus protease inhibitor cocktail (Roche Diagnostics, Cat#11836153001). Protein concentration was measured by BCA assay (Thermo Fisher Scientific 23227) and 15 μg total protein/lane was analyzed by SDS-PAGE and immunoblotting using rat monoclonal anti-HSF1 antibody cocktail (Ab4, Thermo Scientific, 1:1000 dilution) and Actin Monoclonal Antibody (mAbGEa; clone DM1A, Thermo Scientific, 1:1,000). Because prolonged depletion of HSF1 is toxic to malignant cells (Dai et al., 2007), we analyzed mRNA expression early, before HSF1 knockdown was complete and cell viability was grossly impaired. Thus, results likely underestimate the effects of HSF1 on gene expression in malignant cells. For gene expression after heat-shock, cells were transferred to a 42° C. (5% CO2) incubator for 1 hr and allowed to recover for 30 minutes in a 37° C. (5% CO2) incubator before RNA extraction. Gene expression analysis was performed using an Affymetrix GeneChip HT Human Genome U133 96-Array Plate and data were analyzed using previously described methods (Ince et al., 2007). All microarray raw data were deposited in a public database (NCBI Gene Expression Omnibus).
For evaluating the effects of HSF1 knockdown on the expression of target genes, HSF1 was depleted using siRNA (Dharmacon, Lafayette, Colo.): M012109-01 siGenome SMART pool, Human HSF1 (target sequences:
siGLO RISC-Free siRNA (D-001600-01) and siGENOME Non-Targeting siRNA #5 (D-001210-05) were used as controls. Cells were transfected using Lipofectamine™ RNAiMAX Transfection Reagent (Invitrogen, #13778) and were harvested in Trizol (Invitrogen, #15596-026). RNA was purified using Direct-zol™ RNA MiniPrep (Zymo Research, Irving, Calif.). Quantitative PCR to evaluate mRNA levels was performed as described above using RT2 SYBR Green qPCR Mastermix (SABiosciences) and primer assay pairs (SABiosciences; Valencia, Calif.) on a 7700 ABI Detection System.
Gene-Gene Correlation Analysis.
Correlation values of HSF1-bound genes were determined using the UCLA Gene Expression Tool (genome.ucla.edu/projects/UGET) to query gene expression profile data collected in Celsius, a data warehousing system that aggregates Affymetrix CEL files and associated metadata. Nearly 12,000 Affymetrix HG-U133 Plus 2.0 human gene expression profiles, predominantly representing neoplasms of highly diverse human origin, were interrogated. A pair-wise correlation matrix was built by assessing genes bound in at least two of the three cell lines with most robust HSF1 activation (BT20, NCIH838, SKBR3). This generated 1042 genes. The final map as displayed contains 709 unique genes, with genes required to have an absolute value of the correlation coefficient >0.3 (|a|>0.3) with at least 100 other genes. Data was ordered using hierarchical clustering (correlation centered, average linkage).
Xenografts.
5×106 HMLER and BPLER cells in a 50/50 mix of PBS/Matrigel were inoculated subcutaneously in the right inguinal region of each mouse using a 27 g needle. Tumors were removed, and fixed in 10% formalin. Following standard tissue processing, 5 μM sections were cut and immunostained as described below.
Immunohistochemistry of Tissues and Scoring.
Paraffin blocks of human tumor and normal tissue were obtained from the archives of BWH in accordance with the regulations for excess tissue use stipulated by the BWH institutional review board. Tissue microarrays were purchased from Pantomics (Richmond, Calif.) for carcinoma of the breast (BRC501, BRC1502), cervix (CXC1501), colon (COC1503), lung (LUC1501), pancreas (PAC481) and prostate (PRC1961). Whole sections of 40 meningioma specimens were retrieved from the archives of BWH. A TMA of triple negative breast cancer cases was kindly provided by Dr. Andrea Richardson (BWH). Normal tissue cores on the TMAs and adjacent normal tissues in the whole sections were used to evaluate expression of HSF1 in non-neoplastic tissues.
Formalin-fixed, paraffin-embedded (FFPE) sections were first deparaffinized. Frozen sections were first post-fixed in 10% formalin. FFPE or fixed-frozen sections were blocked with 3% H2O2 and antigen retrieval was performed using a pressure cooker with Dako citrate buffer (pH 6.0) at 120° C.+/−2° C., 15+/−5 PSI. Slides were blocked using 3% normal rabbit serum, primary HSF1 antibody (1:2000) was applied at room temperature for 40 minutes, followed by a 30 minute incubation with Dako Labeled Polymer-HRP anti-rat IgG as a secondary antibody. Visualization was achieved with 3,3′-diaminobenzidine (DAB) as a chromogen (Dako Envision+ System). Counterstaining was performed with Mayer-hematoxylin. Immunostained sections were scored independently by two pathologists (SS and TAI) using light microscopy. HSF1 immunostains of FFPE tumor sections were scored using a 0 to 25 scale in
Immunofluorescence.
Immunofluorescence was performed using 1:250 dilution of rat monoclonal anti-HSF1-antibody cocktail (Ab4, Thermo Scientific, 1:1000 dilution), 1:100 dilution of rabbit polyclonal anti-p53 (Santa Cruz, #sc-6243) and with fluorescence labeled secondary antibodies. The slides were then reviewed by standard fluorescence microscope.
References cited in Supplemental Experimental Procedures for Examples 9-14
To facilitate our studies of HSF1, we verified the specificity of a commercially-available HSF1 antibody cocktail on samples from HSF1 wild-type and null mice. A strong immunoreactive band of the expected size for HSF1 was present in wild-type lysates but was absent in lysates null for HSF1 (
We examined the expression of HSF1 in invasive carcinoma and matched normal adjacent breast tissue from seven patients by immunoblot (
As a transcription factor HSF1 is active only in the nucleus. Hence, we examined the localization and expression levels of HSF1 in tumor cells versus normal cells by IHC in a small panel of breast carcinoma tissue sections. A striking difference between malignant cells and the adjacent normal breast epithelium was apparent (
In 20 HSF1-positive tumors, there was widespread uniform expression of HSF1 throughout the tumor cell nuclei. The uniform intensity of HSF1 expression is important to contrast with the variable patterns seen with most prognostic markers that are surveyed in human tumor sections with IHC. HSF1 staining was not stronger in tumor cells at the center of the tumor versus those at the stromal interface (
We also monitored HSF1 localization and levels of expression by immunohistochemistry (IHC) in a set of 301 clinical cases of invasive ductal carcinoma. The tumors were also characterized for expression of conventional breast cancer biomarkers, including estrogen receptor (ER), progesterone receptor (PR) and HER2. In total, 67 ER+ and/or PR+ tumors, 54 HER2+ tumors, and 180 triple negative (TN) tumors were evaluated along with 16 normal mammary tissue samples. In samples of normal breast tissue, HSF1 was rarely present in the nucleus (
The findings in ten in situ carcinomas were similar to those in invasive cancer. In the majority of ductal carcinoma in situ (DCIS) cases, there was increased nuclear HSF1 compared to neighboring normal breast epithelium (
We also examined HSF1 expression and localization in a range of other tumor types including lung, colon, and prostate adenocarcinomas using IHC. Increased HSF1 expression and increased nuclear HSF1 were seen in the neoplastic tissue in each of these tumor types (
We next performed an in-depth analysis of HSF1 protein expression in a large breast cancer cohort. 1,841 invasive breast cancer cases from the Nurses' Health Study (NHS) were evaluated for HSF1 localization and expression (
Nuclear HSF1 was detected in 84.5% of the DCIS cases. The frequency and levels of HSF1 expression were similar between DCIS and invasive cancer, confirming our earlier observations on a smaller number of tumor sections. No statistically significant association was found between HSF1 expression and DCIS nuclear grade, however (Table S1). Our limited sample size of DCIS cases (n=200) may have limited the power to detect such an association. Nonetheless, these observations highlight that HSF1 is activated before malignant cells gain the ability to invade across the basement membrane.
We next investigated the relationship between HSF1 expression and breast cancer survival. A total of 1841 women met inclusion criteria such as the absence of metastases at the time of diagnosis. Median follow-up time was 14.9 years. Kaplan-Meier curves show that women with HSF1-positive tumors had worse survival relative to women with HSF1-negative tumors (P<0.0001) (
We also examined survival considering HSF1-status in three categories: HSF1-negative, HSF1-low and HSF1-high groups. Survival decreased as HSF1 levels increased from none to low and still further to high (P<0.0001) suggesting a dose-dependent association between HSF1 and survival outcomes (
To account for the effects of all variables considered on the relationship between HSF1 levels and survival, we assessed this relationship using several multivariate models. Across all cases, adjusting for age (model 1, Table 2), HSF1 positive tumors were associated with a 74% increase in breast cancer mortality (Table 2; Hazards Ratio (HR) 1.74, 95% Confidence Interval (CI), 1.35-2.25; P value<0.0001) relative to HSF1-negative tumors. After adjusting for age, ER-status, date of diagnosis, stage, grade, and treatment variables (radiotherapy, chemotherapy, endocrine therapy) (model 2, Table 2), HSF1 positive tumors were associated with a 50% increase in breast cancer mortality (Table 2; HR 1.50, 95% CI, 1.15-1.95; P value=0.0026). HSF1-low and HSF1-high tumors were associated with 45% (P=0.008) and 62% (P=0.001) increases in mortality, respectively (Table 3). Similar results were seen in the ER-positive population with HSF1-positive tumors associated with 86% increased mortality (Table 2; HR, 1.86; 95% CI, 1.34-2.59; P value=0.0002). Among the HSF1-positive tumors, HSF1-low and HSF1-high tumors were associated with 75% and 110%/o increases in mortality, respectively (Table 3).
74% (n=700) of the ER-positive patients received hormonal therapy. In this group, there was a significant association between HSF1-positive tumors and increased mortality (Table 2; HR, 2.20; 95% CI, 1.19-4.05; P value=0.0115). In women with ER-positive tumors who did not receive hormonal therapy (26%, n=247), the magnitude of the association was similar (Table 2; HR, 2.01; 95% CI, 0.69-5.88; P value=0.2002) but the study may have been underpowered to detect a significant association in this group. The data may suggest that HSF1 can contribute to tamoxifen resistance, an effect that may be evaluated further in follow-up studies prospectively in a uniformly-treated population.
HSF1 was also associated with worse clinical outcomes in patients with HER2-positive breast cancer. We observed that 88.4% of HER2-positive invasive tumors were HSF1-positive and 40.7% had high levels of HSF1, the greatest percentage of any molecular subtype. In Kaplan-Meier analysis, a suggestive association between HSF1-status and survival in patients with HER2-positive tumors was observed (
We undertook an analysis of a subset of 947 women in the NHS cohort with ER+/lymph node negative tumors. This population is challenging to manage clinically since it is often unclear which small fraction of the population will experience a recurrence and could therefore benefit from early intervention and more aggressive treatment. Survival was examined by KM analysis considering HSF1-status in three categories: HSF1-negative, HSF1-low and HSF1-high groups. Survival decreased as HSF1 levels increased from none to low and further to high (P=0.0015) suggesting a dose-dependent association between HSF1 activation and survival (
We examined whether the associations between HSF1 protein level and outcome in breast cancer could also be detected using HSF1 mRNA levels. Since mRNA expression profiling data is not available from tumors in the NHS, we used data from the publicly available van de Vijver cohort (17) for this analysis. Consistent with our immunohistochemistry analysis in the NHS sample obtained from the tumors, HSF1 mRNA levels were higher in ER-negative than in ER-positive cancers (P<0.0001). We analyzed survival using two HSF1 categories: HSF1-high and HSF1-low. Kaplan-Meier curves show that women with HSF1-high tumors in the van de Vijver cohort had worse survival relative to women with HSF1-low tumors (
We performed IHC for HSF1 protein in tissue samples from a group of 70 stage I lung cancers (Stage I lung adenocarcinomas (T1 N0 M0 or T2 N0 M0)) and examined the relationship between HSF1 expression and overall survival and progression-free survival. Survival was examined by KM analysis considering HSF1-status in three categories: HSF1-low, HSF1-intermediate, and HSF1-high groups. Both overall survival and time to progression decreased as HSF1 levels increased from low to intermediate and further to high, suggesting a dose-dependent association between HSF1 activation and survival (
To investigate the HSF1-regulated transcriptional network in cancer and how it relates to the classical heat-shock response, we used a panel of human mammary epithelial cell lines with very different abilities to form tumors and metastasize (Ince et al., 2007). Two types of primary mammary epithelial cells (HMEC and BPEC) were isolated from normal breast tissue derived from the same donor during reductive mammoplasty. These pairs of isogenic cells were established using different culture conditions that are believed to have supported the outgrowth of distinct cell types. The cells were immortalized with hTERT (HME and BPE) and then transformed with an identical set of oncogenes (HMLER and BPLER). The resulting tumorigenic breast cell lines had very different malignant and metastatic potentials (low, HMLER and high, BPLER) supporting the concept that the cell type from which a cancer arises (“cell-of-origin”) can significantly influence its ultimate phenotype (Ince et al., 2007). Despite their initial isogenic nature and transformation by the same oncogenes, the tumor initiating cell frequency in BPLER cells is ˜104 times greater (more tumorigenic) than isogenic HMLER cells derived from the same donor (Ince et al., 2007). While HMLER cells are non-metastatic, the BPLER cells form metastases in lungs from orthotopic and subcutaneous tumors with very high frequency (>75-85%) (Ince et al., 2007). Hence, the panel of immortalized, non-tumorigenic cells (HME and BPE) and their transformed counterparts with low (HMLER) and high (BPLER) malignant potential provided a well-controlled system for simultaneously studying the changes that occur during transformation as well as the molecular differences that drive variation in malignant potential (Ince et al., 2007).
We asked if HSF1 expression differed in the highly malignant BPLER and the much less malignant HMLER breast cancer cells. We used two sets of such cells, each pair derived from a different donor. In both, HSF1 protein expression was higher in the more malignant member of the pair, BPLER cells (
To determine if these differences in HSF1 were simply an artifact of growth in cell culture, we implanted the cells into immunocompromised mice and allowed them to form tumors. HSF1 immunostaining was weak in the HMLER tumors. Moreover, it was largely restricted to nonmalignant, infiltrating stroma and to tumor areas bordering necrosis (
Given this evidence for the activation of HSF1 in BPLER cells, we asked if they were more dependent on HSF1 than HMLER for growth and survival. Neither cell type was affected by negative control shRNA. With two independent shRNA that knockdown HSF1 expression, however, cell growth and viability were far more strongly reduced in BPLER than HMLER cells (
To determine if the transcriptional program driven by HSF1 in highly malignant cells differs from that driven by a classical thermal stress, we used chromatin immunoprecipitation coupled with massively parallel DNA sequencing (ChIP-Seq) (Johnson et al., 2007), characterizing HSF1 binding sites genome-wide. We first assessed the immortalized non-transformed progenitor cells, HME and BPE, grown at 37° C. or following a 42° C. heat shock (
In the HME and BPE parental cell lines, a limited number of genes were bound by HSF1 in the absence of heat shock, and these were bound weakly (
A small number of genes were bound by HSF1 under basal conditions in the transformed cells with low malignant potential, HMLER (37° C.;
Surprisingly, a full 60% of the genes bound by HSF1 in BPLER cells were not bound in non-transformed parental lines, even after heat-shock (
We analyzed the 100 bp genomic regions surrounding the peaks of HSF1 binding unique to BPLER cells using the ab initio motif discovery algorithm MEME (Machanick and Bailey, 2011). The canonical heat-shock element (HSE) was highly enriched in the HSF1-bound regions (p-value=1.4×10−97;
The remaining 40% of genes bound by HSF1 in BPLER cells under basal conditions were also bound in the parental lines following heat-shock. As expected, these genes included many classical heat-shock genes, and were enriched for protein folding categories (
Notably, for many of the genes bound in both cancer and heat shock, HSF1 binding differed quantitatively. For example, the strongly heat-shock inducible HSPA6 gene (encoding HSP70B′) was highly bound in parental lines upon heat shock but only weakly bound in BPLER cells at 37° C. (
To investigate the consequences of HSF1 occupancy on gene expression, we compared RNA profiles in HMLER and BPLER cells transduced with control shRNA hairpins to those transduced with hairpins that knockdown HSF1. As we previously reported, the growth and survival of malignant cells is compromised by prolonged depletion of HSF1 (Dai et al., 2007). Therefore, we only analyzed mRNA expression in the early stages of shRNA inhibition, where HSF1 knockdown was still incomplete (
Control hairpins that did not reduce HSF1 levels (Scr and GFP;
Relating the effects of the hairpins on gene expression to our earlier ChIP-Seq analysis, ˜70% of genes positively regulated by HSF1 were bound at the promoter while only ˜30% of these genes were bound in distal regions (
We also examined the effects of HSF1 knockdown on gene expression in MCF7 cells. In contrast to genetically engineered HMLER and BPLER cells, the MCF7 line was established from a human breast cancer metastasis (Soule et al., 1973). Moreover, as an estrogen receptor positive (ER+) line, its biology is fundamentally distinct from the hormone-receptor negative HMLER and BPLER cell lines. Despite these differences, the pattern of changes in gene expression caused by HSF1 knockdown was very similar in BPLER cells and MCF7 cells for HSF1 targets (
Next we used ChIP-qPCR to monitor HSF1 binding to a representative set of the HSF1-target genes in cell lines derived from patients with breast cancer. We used nine well-studied cancer lines (including MCF7 cells) representing all three major categories of breast cancer: ER+, HER2+ and Triple Negative (TN). Under basal conditions (at 37° C.) we detected HSF1 binding in each of the major breast cancer subtypes (
These ChIP-PCR data spurred us to employ ChIP-Seq to generate high-resolution maps of HSF1 occupancy, and to do so in a panel of human tumor lines that extended to other types of malignancy (
After heat shock, MCF10A cells exhibited an HSF1-binding profile that was comparable to that of heat-shocked HME and BPE cells. In the absence of heat shock the overall magnitude of HSF1 binding in all of the non-tumorigenic cell lines (nt) was uniformly very weak and the total number of bound genes was small (
Irrespective of the level of binding, the distribution of HSF1 occupancy on a genome-wide scale was remarkably similar among the cancer cell lines and distinct from the pattern of binding in the heat-shocked cells (
Data from these multiple cell lines allowed us to confidently identify regions of HSF1 binding that were strong in cancer cells but not in heat-shocked cells, weak in cancer but strong in heat-shock or similarly strong in both (
We performed motif analysis to evaluate the 100 bp genomic regions surrounding the peaks of HSF1 binding in each of these groups. The HSE, comprised of adjacent inverted repeats of 5′-nGAAn-3′, was the most enriched motif in all three groups (
Integrating our diverse data sets (
The largest module was enriched for protein folding, translation and mitosis. Genes within this dominant module showed the strongest positive correlation with the expression of HSF mRNA itself. Many of these genes had indeed proven to be regulated by HSF1 in our HSF1 shRNA knockdown experiments (
As described above, we evaluated HSF1 expression and localization in a cohort of breast cancer patients culled from the Nurses' Health Study (NHS) (Santagata et al., 2011). In that work, HSF1 was cytoplasmic and expressed at low levels in normal breast epithelial cells but it accumulated in the nucleus of the majority of tumor specimens. Here, we have confirmed that finding (
Next, because our ChIP-Seq analysis showed that the HSF1 cancer program is engaged not just in breast cancer lines but also in colon and lung cancer cell lines, we examined more than 300 formalin-fixed surgical specimens taken directly from patients. We included not only colon and lung cancer but also a wide variety of other tumor types. Normal cells adjacent to the tumor demonstrated low HSF1 levels and cytoplasmic localization of the protein. In contrast, high-level nuclear expression of HSF1 was common across every cancer type we examined, including carcinomas of the cervix, colon, lung, pancreas and prostate as well as mesenchymal tumors such as meningioma (
To further confirm that the high-level nuclear localization of HSF1 detected by immunostaining was truly indicative of its activation, we obtained human tumor samples from breast and colon adenocarcinomas that had been cryopreserved and were of a quality suitable for ChIP-Seq analysis (
In our analysis of the Nurses' Health cohort, HSF1 overexpression and nuclear localization was associated with reduced survival (see Examples 2-7 above; see also Santagata et al, 2011a). To acquire more precise and molecularly defined information about the effects of HSF1 activation in cancer, we asked if malignant potential and long-term outcomes correlate with the HSF1 transcriptional program identified above. We distilled an “HSF1-cancer signature” of 456 genes that were bound by HSF1 near their transcription start sites (
High expression of our HSF1-cancer signature had a remarkable correlation with poor prognosis (HSF1-CaSig;
Next, we considered a recent finding that many published cancer signatures are not significantly better outcome predictors than random signatures of identical size (Venet et al., 2011). We performed Kaplan-Meier analysis on independent datasets to evaluate associations between 10,000 individual randomly generated gene signatures and patient outcome (example shown in
Our HSF1-cancer signature was more significantly associated with outcome than other well established prognostic indicators (
At the time of diagnosis, the majority of breast cancer patients have ER+ tumors and early-stage disease (ER+/lymph-node negative tumors). A small fraction of these patients will experience a recurrence and might benefit from more aggressive treatment, but it is currently very difficult to identify them in advance. We found that our HSF1-cancer signature was significantly associated with metastatic recurrence in women initially diagnosed with ER+/lymph node negative tumors (p-value=0.0149) (
To confirm the prognostic value of HSF1 in this particularly challenging population, we returned to the Nurses' Health Study cohort, because it provides one of the largest collections of patients with ER+/lymph node negative tumors for evaluation (n=947), and has the longest patient follow up. Because RNA samples are not available from this collection (initiated in 1976) we could assess only the levels and nuclear localization of HSF1. Survival decreased as HSF1 nuclear levels increased in a dose-dependent manner (p-value=0.0015;
Finally, we asked if the HSF1-cancer signature might have prognostic value beyond breast cancer. Analyzing multiple independent gene expression datasets that include outcomes data, increased expression of the HSF1 cancer program in colon and lung cancers was strongly associated with reduced survival (
Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, many equivalents to the specific embodiments of the invention described herein. The scope of the present invention is not intended to be limited to the Description or the details set forth therein. Articles such as “a”, “an” and “the” may mean one or more than one unless indicated to the contrary or otherwise evident from the context. Claims or descriptions that include “or” between one or more members of a group are considered satisfied if one, more than one, or all of the group members are present in, employed in, or otherwise relevant to a given product or process unless indicated to the contrary or otherwise evident from the context. The invention includes embodiments in which exactly one member of the group is present in, employed in, or otherwise relevant to a given product or process. The invention includes embodiments in which more than one, or all of the group members are present in, employed in, or otherwise relevant to a given product or process. Furthermore, it is to be understood that the invention encompasses all variations, combinations, and permutations in which one or more limitations, elements, clauses, descriptive terms, etc., from one or more of the claims (whether original or subsequently added claims) is introduced into another claim (whether original or subsequently added). For example, any claim that is dependent on another claim can be modified to include one or more element(s), feature(s), or limitation(s) found in any other claim, e.g., any other claim that is dependent on the same base claim. Any one or more claims can be modified to explicitly exclude any one or more embodiment(s), element(s), feature(s), etc. For example, any particular type of tumor, tumor characteristic, test agent, candidate modulator, therapeutic agent, gene, set of genes, or combinations thereof can be excluded from any one or more claims.
It should be understood that (i) any method of classification, assessment, diagnosis, prognosis, treatment-specific prediction, treatment selection, treatment, etc., can include a step of providing a sample, e.g., a sample obtained from a subject in need of classification, assessment, diagnosis, prognosis, treatment-specific prediction, treatment selection, or treatment for cancer, e.g., a tumor sample obtained from the subject; (ii) any method of classification, assessment, diagnosis, prognosis, treatment-specific prediction, treatment selection, treatment, etc., can include a step of providing a subject in need of classification, assessment, diagnosis, prognosis, treatment-specific prediction, treatment selection, or treatment for cancer.
Where the claims recite a method, certain aspects of the invention provide a product, e.g., a kit or composition, suitable for performing the method.
Where elements are presented as lists, e.g., in Markush group format, each subgroup of the elements is also disclosed, and any element(s) can be removed from the group. For purposes of conciseness only some of these embodiments have been specifically recited herein, but the invention includes all such embodiments. It should also be understood that, in general, where the invention, or aspects of the invention, is/are referred to as comprising particular elements, features, etc., certain embodiments of the invention or aspects of the invention consist, or consist essentially of, such elements, features, etc.
Where numerical ranges are mentioned herein, the invention includes embodiments in which the endpoints are included, embodiments in which both endpoints are excluded, and embodiments in which one endpoint is included and the other is excluded. It should be assumed that both endpoints are included unless indicated otherwise. Furthermore, unless otherwise indicated or otherwise evident from the context and understanding of one of ordinary skill in the art, values that are expressed as ranges can assume any specific value or subrange within the stated ranges in different embodiments of the invention, to the tenth of the unit of the lower limit of the range, unless the context clearly dictates otherwise. Where phrases such as “less than X”, “greater than X”, or “at least X” is used (where X is a number or percentage), it should be understood that any reasonable value can be selected as the lower or upper limit of the range. It is also understood that where a list of numerical values is stated herein (whether or not prefaced by “at least”), the invention includes embodiments that relate to any intervening value or range defined by any two values in the list, and that the lowest value may be taken as a minimum and the greatest value may be taken as a maximum. Furthermore, where a list of numbers, e.g., percentages, is prefaced by “at least”, the term applies to each number in the list. For any embodiment of the invention in which a numerical value is prefaced by “about” or “approximately”, the invention includes an embodiment in which the exact value is recited. For any embodiment of the invention in which a numerical value is not prefaced by “about” or “approximately”, the invention includes an embodiment in which the value is prefaced by “about” or “approximately”. “Approximately” or “about” generally includes numbers that fall within a range of 1% or in some embodiments 5% or in some embodiments 10% of a number in either direction (greater than or less than the number) unless otherwise stated or otherwise evident from the context (e.g., where such number would impermissibly exceed 100% of a possible value).
Section headings used herein are not to be construed as limiting in any way. It is expressly contemplated that subject matter presented under any section heading may be applicable to any aspect or embodiment described herein.
Embodiments or aspects herein may be directed to any agent, composition, article, kit, and/or method described herein. It is contemplated that any one or more embodiments or aspects can be freely combined with any one or more other embodiments or aspects whenever appropriate. For example, any combination of two or more agents, compositions, articles, kits, and/or methods that are not mutually inconsistent, is provided. It will be understood that any description or exemplification of a term anywhere herein may be applied wherever such term appears herein (e.g., in any aspect or embodiment in which such term is relevant) unless indicated or clearly evident otherwise.
indicates data missing or illegible when filed
This application claims the benefit of U.S. Provisional Application No. 61/642,394, filed May 3, 2012, and U.S. Provisional Application No. 61/656,343, filed Jun. 6, 2012. The entire teachings of the above applications are incorporated herein by reference.
The invention was made with government support under R01-CA146445-01 awarded by the National Cancer Institute, W81XWH-08-1-0282 BC-07456 awarded by the Department of Defense, and K08NS064168 awarded by the National Institutes of Health. The government has certain rights in the invention.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2013/039527 | 5/3/2013 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
61642394 | May 2012 | US | |
61656343 | Jun 2012 | US |