The present invention relates to TNBC biomarker profile indicating TNBC subtype, predisposition, existence, recurrence and progression. It also provides the methods for determining sensitivity or resistance of tumor cells to particular chemotherapies.
Analysis of genomic sequences and gene expression patterns has provided a way to improve the diagnosis and risk stratification of many diseases. For example, analysis of genomic sequences and global gene expression patterns has identified molecularly distinct subtypes of cancer in diseases that were considered homogeneous based on classical diagnostic methods. Such molecular subtypes are often associated with different clinical outcomes. Genomic aberration and global gene expression patterns can also be examined for features that correlate with clinical behavior to create prognostic signatures. Identification of genetic mutations and polynucleotides that are differentially expressed in cancer subtypes, pre-cancerous lesions, or low metastatic potential cells relative to normal cells of the same tissue type can provide the basis for diagnostic tools, facilitates drug discovery by providing for targets for candidate agents, and further serves to identify therapeutic targets for cancer therapies that are more tailored for the type or subtype of cancer to be treated.
Breast cancer is the most frequently diagnosed cancer in women and the second leading cause of cancer deaths in women. According to the World Health Organization, more than 1.2 million people will be diagnosed with breast cancer yearly worldwide, with approximately 213,000 women in the United States are diagnosed with invasive breast cancer each year (Stages I-IV). The chance of developing invasive breast cancer during a woman's lifetime is approximately 1 in 8 (about 13%). Another 62,000 women will be diagnosed with in situ breast cancer, a very early form of the disease. It is estimated that approximately 40,970 women and 460 men die from breast cancer in the United States yearly.
An important problem associated with cancer chemotherapy is that the histology of the cancer does not predict an individual's response to a given agent or a given therapeutic protocol. Global gene expression profiling has uncovered previously unrecognized subsets of human breast cancer, including the “triple-negative” subtype. Breast tumors that lack expression of estrogen receptor, progesterone receptor and Her2-neu amplification (referred to as “triple negative subtype” or “triple negative breast cancer (TNBC)”) account for approximately 15% of all breast cancer diagnoses. The TNBC subtype is among the most refractory of human breast cancers, since it cannot be treated with effective hormonal and Trastuzumab-based therapies.
Cancer management relies on a combination of initial prevention, early diagnosis, appropriate treatment, and prevention of recurrence, and genetic testing can play a role in all of these management areas. All clinical oncologists desire a strategy to use customized information to make chemotherapy recommendations that are tailored specifically to a patient's tumor characteristics. The approach has enormous intuitive appeal and is more logical to both patients and physicians than the empiric approach, whereby all patients with similar tumor type are treated according to a standardized regimen. There remains, however, a need for identification of genetic marker profiles predictive or diagnostic of particular cancer subtypes. Identification of genetic markers associated with refractory cancers may also aid in identification of key molecular pathways that may be targeted for therapeutic purposes.
Provided herein, in part, are methods for predicting or diagnosing of a predisposition, existence, or recurrence of a triple negative cancer subtype, in particular, a triple negative subtype of breast cancer. Also provided are methods for assessing the progression of a triple negative breast cancer. Also provided herein are methods for determining sensitivity or resistance of tumor cells to particular chemotherapies, and methods for determining suitability of a therapeutic regimen for a breast cancer tumor.
Provided herein is a method for detecting existence of a triple negative breast cancer (TNBC) subtype in a tumor comprising analyzing the genome or transcriptome of a tumor tissue sample for the presence of a TNBC biomarker profile, wherein the presence of the TNBC biomarker profile indicates existence of the TNBC subtype in the tumor, wherein the TNBC biomarker profile comprises at least one alteration of the group consisting of an RB1 gene deletion, a PTEN gene deletion, an ERBB4 gene deletion, an ABCB1 gene mutation, a SLC9A11 gene translocation, a NCO6A gene translocation, a chromosome 11 translocation, a chromosome 16 translocation, a NF1 gene deletion, a FBXW7 gene deletion; INPP5F inversion; overexpression of AURKB, FOXM1, PLK1, AURKA genes, BRAF, ERAS, IQGAP3; and underexpression of PTEN, ERBB4, INPP4B, CTNNA1 and NOVA1 gene. In one example, the TNBC biomarker profile may comprise a deletion of exon in a RB1 gene. In some examples, the TNBC biomarker profile may comprise a deletion of exon 6 in a PTEN gene; the TNBC biomarker profile may comprise a deletion in an ERBB4 gene; the TNBC biomarker profile may comprise a mutation in an ABCB1 gene; the TNBC biomarker profile may comprise a chromosome 11 translocation and/or a chromosome 16 translocation; the TNBC biomarker profile may comprise a translocation of an SLC9A11 gene and a NCO6A gene; the TNBC biomarker profile may comprise overexpression of a gene selected from the group consisting of AURKB, FOXM1, PLK1, AURKA, BRAF, ERAS, and IQGAP3; the TNBC biomarker profile may comprise underexpression of a gene selected from the group consisting of PTEN, ERBB4, INPP4B and NOVA1; the TNBC biomarker profile may comprise copy number variation correlated with altered gene expression. In one example, the TNBC biomarker profile may comprise an overexpression of a FOXM1 gene.
Also provided is a method for treating TNBC by determining sensitivity or resistance of tumor cells to a particular chemotherapy comprising assessing a tumor tissue sample for a triple negative breast cancer (TNBC) biomarker profile, wherein the presence of TNBC subtype in the tumor sample indicates sensitivity or resistance of the tumor to a particular chemotherapy, wherein the TNBC biomarker profile comprises at least one alteration of the group consisting of an RB1 gene deletion, a PTEN gene deletion, an ERBB4 gene deletion, an ABCB1 gene mutation, a SLC9A11 gene translocation, a NCO6A gene translocation, a chromosome 11 translocation, a chromosome 16 translocation, a NF1 gene deletion, a FBXW7 gene deletion; INPP5F inversion; overexpression of AURKB, FOXM1, PLK1, AURKA genes, BRAF, ERAS, IQGAP3; and underexpression of PTEN, ERBB4, INPP4B, CTNNA1 and NOVA1 gene. In one example, the TNBC subtype is characterized by a biomarker profile comprising BRAF amplification and INPP4B under expression and is sensitive to treatment with combined MEK and AKT inhibitors. In another example, wherein the TNBC subtype is characterized by a biomarker profile comprising TOP2a and PBK overexpression and is sensitive to treatment with Eribulin. In yet another example, the TNBC subtype is characterized by a biomarker profile comprising IQGAP3 and AKT3 overexpression and INPP4B underexpression and is sensitive to treatment with BEZ235. In still another example, the TNBC subtype is characterized by a biomarker profile comprising ALK overexpression and is sensitive to treatment with ALK inhibitor. In one example, the TNBC subtype is characterized by a biomarker profile comprising FBXW7 and INPP4B underexpression and is sensitive to treatment with combined Taxol, Avastin and Everolimus. In a further example, wherein the TNBC subtype is characterized by a biomarker profile comprising DNA repair related gene mutations and is sensitive to treatment with combined BSI 201, Gemcitabine and Carboplatin. In another example, the TNBC subtype is characterized by a biomarker profile comprising IQGAP3 overexpression, INPP5F inversion and NEDD4 mutation and is resistant to treatment with BEZ 235. In still another example, wherein the TNBC subtype of the method is characterized by a biomarker profile comprising GART overexpression and is resistant to treatment with combined MEK and AKT inhibitors.
Further provided is a method for selecting a treatment for a subject with TNBC, and the method comprises: (a) obtaining a tumor tissue sample from the subject; (b) obtaining the TNBC biomarker profile of the sample and grouping the subject to a TNBC subtype; (c) determining sensitivity or resistance of the subject to a particular chemotherapy based on the relationship between the TNBC subtype and sensitivity or resistance to particular chemotherapies; and (d) selecting a treatment based on the TNBC subtype. The general method may further comprise administering an effective amount of both MEK and AKT inhibitors to the subject with a TNBC biomarker profile comprising BRAF amplification and INPP4B under expression. In another example, the method may further comprise administering an effective amount of Eribulin to the subject with a TNBC biomarker profile comprising IQGAP3 and AKT3 overexpression and INPP4B underexpression. The method may comprise administering an effective amount of ALK inhibitor to the subject with a TNBC biomarker profile comprising ALK overexpression. The method may also further comprise administering an effective amount of combined Taxol, Avastin and Everolimus to the subject with a TNBC biomarker profile comprising FBXW7 and INPP4B underexpression. In still another example, the general method may further comprise administering an effective amount of combined BSI 201, Gemcitabine and Carboplatin to the subject with a TNBC biomarker profile comprising DNA repair related gene mutations.
Somatic genomic alterations have been discovered in the triple negative subtype of breast cancer. Disclosed herein are particular genomic alterations found to be associated with TNBC and which can be used as a biomarker profile for TNBC. Such genomic alterations are useful for predictive purposes, diagnostic purposes, for methods for predicting treatment response of a tumor, methods for monitoring cancer progression, and methods for monitoring treatment progress, as described in further detail herein. Further applications of the TNBC biomarkers includes methods to determine sensitivity or resistance of tumor cells to particular chemotherapies and methods for selecting therapeutic options, as well as kits for use in the methods described herein.
The genomic alterations provided herein include deletions (for example, deletions comprising exons, introns, and/or splice sites) and translocations. Within the tumor, such genomic deletions or translocations can be homozygous or heterozygous. Genomic alterations also includes transcriptome perturbations that lead to altered gene or protein expression in the tumor cells compared to normal cells, including, for example, overexpression or underexpression.
As demonstrated herein, genomic alterations identified in TNBC include, but are not limited to, an RB1 gene deletion, a PTEN gene deletion, an ERBB4 gene deletion, an ABCB1 gene mutation, a SLC9A11 gene translocation, a NCO6A gene translocation, a chromosome 11 translocation, a chromosome 16 translocation, a NF1 gene deletion, and/or a FBXW7 gene deletion; overexpression of AURKB, FOXM1, PLK1, AURKA genes and/or, BRAF, ERAS, and/or IQGAP3; and/or underexpression of PTEN, ERBB4, INPP4B gene and/or NOVA1 gene. Alterations can serve as biomarkers for TNBC and can be used alone, or in combination with other markers, to generate a biomarker profile for TNBC. In one embodiment, the TNBC biomarker profile includes at least one of the genomic alterations described herein. In addition, these mutations include PTEN homozygous deletion or down-regulation, INPP4B down-regulation, FBXW7 homozygous deletion, and ERAS overexpression activate PI3K/AKT/mTOR pathway, which is an event signifying specific treatment options.
2. Genomic Alteration Enriched Pathways Associated with mTNBC
(i) PI3K/AKT/m TOR Pathway
The PI3K/AKT/mTOR pathway is an intracellular signaling pathway important in apoptosis and hence cancer e.g. breast cancer and non-small-cell lung cancer. The phosphatidylinositol-3-kinase (PI3K) signaling in this pathway is crucial to many aspects of cell growth and survival. It is targeted by genomic aberrations including mutation, amplification and rearrangement more frequently than any other pathway in human cancer. The PI3K/AKT/mTOR pathway cross-talks with other pathways including the RAS, p53 and retinoblastoma (Rb) at multiple levels and constitute a signaling network implicated in tumor initiation and progression. In addition, the PI3K pathway is stimulated as a physiological consequence of many growth factors and regulators. Without the full understanding of various mechanisms, it has been found that the activation of the PI3K pathway results in a disturbance of control of cell growth and survival, which contributes to a competitive growth advantage, metastatic competence and, frequently, therapy resistance. This pathway is therefore an attractive target for the development of novel anticancer agents.
However, as it is not presently clear whether patients with aberrations of particular isoforms, alleles, or molecules in the pathway will require treatment with drugs targeting particular points in the pathway, it is important to co-develop molecular markers and targeted therapeutics. Further, an ability to pre-select patients who are likely to respond, and to identify patients not responding at an early point during treatment and triage them to alternative therapies, will greatly increase the likelihood of demonstrating efficacy and decrease the size, cost and duration of clinical trials while concurrently protecting patient life, an approach favored by drug companies and recently the FDA.
In one embodiment, the TNBC biomarker profile includes at least one of the genomic alterations activating PI3K/AKT/mTOR pathway, and these mutations may comprise PTEN homozygous deletion or down-regulation, INPP4B down-regulation, FBXW7 homozygous deletion, and ERAS overexpression.
(ii) RAS/RAF/MEK/ERK Pathway
Growth factors and mitogens use the Ras/Raf/MEK/ERK signaling cascade to transmit signals from their receptors to regulate gene expression and prevent apoptosis. Some components of these pathways are mutated or aberrantly expressed in human cancer (e.g., Ras, B-Raf). Mutations also occur at genes encoding upstream receptors (e.g., EGFR and Flt-3) and chimeric chromosomal translocations which transmit their signals through these cascades. Even in the absence of obvious genetic mutations, this pathway has been reported to be activated in over 50% of acute myelogenous leukemia and acute lymphocytic leukemia and is also frequently activated in other cancer types (e.g., breast and prostate cancers). Importantly, this increased expression is associated with a poor prognosis.
The Ras/Raf/MEK/ERK and Ras/PI3K/PTEN/AKT pathways interact with each other to regulate growth and in some cases tumorigenesis. For example, in some cells, PTEN mutation may contribute to suppression of the Raf/MEK/ERK cascade due to the ability of activated AKT to phosphorylate and inactivate different Rafs. Although both of these pathways are commonly thought to have anti-apoptotic and drug resistance effects on cells, they display different cell lineage specific effects. For example, Raf/MEK/ERK is usually associated with proliferation and drug resistance of hematopoietic cells, while activation of the Raf/MEK/ERK cascade is suppressed in some prostate cancer cell lines which have mutations at PTEN and express high levels of activated AKT.
Furthermore the Ras/Raf/MEK/ERK and Ras/PI3K/PTEN/AKT pathways also interact with the p53 pathway. For example, Raf/MEK/ERK may promote cell cycle arrest in prostate cells and this may be regulated by p53 as restoration of wild-type p53 in p53 deficient prostate cancer cells results in their enhanced sensitivity to chemotherapeutic drugs and increased expression of Raf/MEK/ERK pathway. Thus in advanced prostate cancer, it may be advantageous to induce Raf/MEK/ERK expression to promote cell cycle arrest, while in hematopoietic cancers it may be beneficial to inhibit Raf/MEK/ERK induced proliferation and drug resistance. Thus the Raf/MEK/ERK pathway has different effects on growth, prevention of apoptosis, cell cycle arrest and induction of drug resistance in cells of various lineages which may be due to the presence of functional p53 and PTEN and the expression of lineage specific factors.
The present invention provides the selection of TNBC biomarker profile utilizing genomic alteration enriched pathways such as PI3K/AKT/mTOR and Raf/MEK/ERK found herein.
In one embodiment, the TNBC biomarker profile includes a RB1 gene deletion. RB1 (Retinoblastoma-associated protein, P06400) is a key regulator of entry into cell division that acts as a tumor suppressor. In some embodiments, the RB1 gene deletion results in a loss of exon 13 from the RB1 transcript. In certain embodiments, the RB1 gene deletion occurs between exons 12 and 14 and includes a splice donor site. In certain embodiments, the RB1 gene deletion results in a loss of amino acids 406N-444Q from the RB1 polypeptide.
In one embodiment, the TNBC biomarker profile includes a PTEN gene deletion. PTEN (Mutated in multiple advanced cancers 1, P60484) is a tumor suppressor. In some embodiments, the PTEN gene deletion results in a loss of exon 6 from the PTEN transcript. In certain embodiments, the PTEN gene deletion is a deletion of approximately 1 kb encompassing exon 6. In some embodiments, the PTEN deletion is homozygous in the TNBC. In some embodiments, the PTEN gene deletion results in a frameshift mutation and premature truncation of the PTEN polypeptide. In certain embodiments, the PTEN gene deletion results in a loss of exon 6 bases 493-634 from the PTEN transcript.
In one embodiment, the TNBC biomarker profile includes an ERBB4 gene deletion. ERBB4 (Receptor tyrosine-protein kinase erbB-4, Q15303) plays an essential role as cell surface receptor for neuregulins and EGF family members. In some embodiments, the ERBB4 gene deletion includes the deletion of approximately 4.3 kb of the ERBB4 gene and is intronic.
In one embodiment, the TNBC biomarker profile includes an ABCB1 gene mutation. ABCB1 (Multidrug resistance protein 1, P08183) are reported to be associated with susceptibility to inflammatory bowel disease
In one embodiment, the TNBC biomarker profile includes a chromosome 11 translocation and/or a chromosome 16 translocation.
In one embodiment, the TNBC biomarker profile includes an SLC9A11 gene translocation and/or a NCOA6 gene translocation. NCOA6 (Nuclear receptor coactivator 6, Q14686) directly binds nuclear receptors and stimulates the transcriptional activities in a hormone-dependent fashion. In some embodiments, the TNBC biomarker profile includes an SLC9A11-NCOA6 translocation. In certain embodiments, the SLC9A11-NCOA6 translocation results in a fusion transcript from the two genes. In some embodiments, the SLC9A11-NCOA6 translocation results in a fusion transcript comprising exons 2-9 of NCOA6 and exons 22-27 of SLC9A11. SLC9A11 (Sodium/hydrogen exchanger 11, Q5TAH2). SLC9A11 is reported to be involved in pH regulation.
In one embodiment, the TNBC biomarker profile includes a NF1 gene deletion. NF1 (Neurofibromin, P21359) Stimulates the GTPase activity of Ras. Defects in NF1 are the cause of many diseases including neurofibromatosis type 1 (NF1) and colorectal cancer. In some embodiment, the deletion is homozygous. In some embodiment, the NF1 gene deletion includes the homozygous deletion of a 19 bp coding region.
In one embodiment, the TNBC biomarker profile includes a FBXW7 gene deletion. FBXW7 (F-box/WD repeat-containing protein 7, Q969H0) is reported to recognize and bind to phosphorylated target proteins. In some embodiment, the FBXW7 gene deletion includes a homozygous deletion.
In one embodiment, the TNBC biomarker profile includes over expression of AURKB, FOXM1, PLK1, AURKA, BRAF, ERAS and/or IQGAP3 genes in the tumor tissue relative to normal cells of the same type of tissue. In another embodiment, the TNBC biomarker profile includes underexpression of PTEN, ERBB4, INPP4B and/or NOVA1 genes in the tumor tissue relative to normal cells of the same type of tissue.
In certain embodiments, the TNBC biomarker profile includes an over expression of a AURKB gene. AURKB is a serine/threonine-protein kinase component of the chromosomal passenger complex (CPC), a complex that acts as a key regulator of mitosis. The CPC complex has essential functions at the centromere in ensuring correct chromosome alignment and segregation and is required for chromatin-induced microtubule stabilization and spindle assembly. It is also involved in the bipolar attachment of spindle microtubules to kinetochores and is a key regulator for the onset of cytokinesis during mitosis.
In certain embodiments, the TNBC biomarker profile includes an over expression of a FOXM1 gene. FOXM1 is a forkhead transcription factor whose loss has been reported to lead to centrosome amplification and mitotic catastrophe in certain cancer cells, such as primary breast cancer (Wonsey et al., Cancer Research, 65:5181 (2005)). FOXM1 is a key regulator of cell proliferation and is overexpressed in a variety of primary cancer forms, such as breast cancer, cell carcinoma and hepatocellular carcinoma (Kwok, Molecular Cancer Research, 8:24 (2010)). The over expression of FOXM1 has been reported to mediate breast cancer resistance to treatment with Herceptin and Paclitaxel (Carr, Cancer Research, 70:5054 (2010)).
In certain embodiments, the TNBC biomarker profile includes an over expression of a PLK1 gene. PLK1 performs several important functions throughout M phase of the cell cycle, including the regulation of centrosome maturation and spindle assembly, the removal of cohesins from chromosome arms, the inactivation of anaphase-promoting complex/cyclosome (APC/C) inhibitors, and the regulation of mitotic exit and cytokinesis.
In certain embodiments, the TNBC biomarker profile includes an over expression of a AURKA gene. AURKA is a mitotic serine/threonine kinases that contributes to the regulation of cell cycle progression.
In certain embodiments, the TNBC biomarker profile includes an over expression of a BRAF gene. BRAF is involved in the transduction of mitogenic signals from the cell membrane to the nucleus. In certain embodiments, the TNBC biomarker profile includes an over expression of a ERAS gene. ERAS is a Ras protein that binds GDP/GTP and possess intrinsic GTPase activity. ERAS plays an important role in the tumor-like growth properties of embryonic stem cells.
In certain embodiments, the TNBC biomarker profile includes an over expression of a ERAS gene. ERAS is a Ras GTPase-activating-like protein, and is an oncogenic kinase.
In certain embodiments, the TNBC biomarker profile includes an under expression of a PTEN gene. PTEN is a tumor suppressor. It acts as a dual-specificity protein phosphatase, dephosphorylating tyrosine-, serine- and threonine-phosphorylated proteins. It also acts as a lipid phosphatase, removing the phosphate in the D3 position of the inositol ring from phosphatidylinositol 3,4,5-trisphosphate, phosphatidylinositol 3,4-diphosphate, phosphatidylinositol 3-phosphate and inositol 1,3,4,5-tetrakisphosphate.
In certain embodiments, the TNBC biomarker profile includes an under expression of a ERBB4 gene. ERBB4 is a tyrosine-protein kinase that plays an essential role as cell surface receptor for neuregulins and EGF family members and regulates development of the heart, the central nervous system and the mammary gland, gene transcription, cell proliferation, differentiation, migration and apoptosis.
In certain embodiments, the TNBC biomarker profile includes an under expression of a INPP4B gene. INPP4B catalyzes the hydrolysis of the 4-position phosphate of phosphatidylinositol 3,4-bisphosphate, inositol 1,3,4-trisphosphate and inositol 1,4-bisphosphate.
In certain embodiments, the TNBC biomarker profile includes an under expression of a NOVA1 gene. NOVA1 may regulate RNA splicing or metabolism in a specific subset of developing neurons.
In one embodiment, the TNBC biomarker profile includes any one or a combination of two or more of the genomic alterations described herein.
In one embodiment, the TNBC biomarker profile includes at least two of the following genomic alterations: an RB1 gene deletion, a PTEN gene deletion, an ERBB4 gene deletion, an ABCB1 gene mutation, a SLC9A11 gene translocation, a NCO6A gene translocation, a chromosome 11 translocation, a chromosome 16 translocation, a NF1 gene deletion, a FBXW7 gene deletion; INPP5F inversion; overexpression of AURKB, FOXM1, PLK1, AURKA genes, BRAF, ERAS, IQGAP3; underexpression of PTEN, ERBB4, INPP4B gene and/or NOVA1 gene.
In one embodiment, the TNBC biomarker profile includes at least three of the following genomic alterations: an RB1 gene deletion, a PTEN gene deletion, an ERBB4 gene deletion, an ABCB1 gene mutation, a SLC9A11 gene translocation, a NCO6A gene translocation, a chromosome 11 translocation, a chromosome 16 translocation, a NF1 gene deletion, a FBXW7 gene deletion; INPP5F inversion; overexpression of AURKB, FOXM1, PLK1, AURKA genes, BRAF, ERAS, IQGAP3; underexpression of PTEN, ERBB4, INPP4B gene and/or NOVA1 gene.
In one embodiment, the TNBC biomarker profile includes at least four of the following genomic alterations: an RB1 gene deletion, a PTEN gene deletion, an ERBB4 gene deletion, an ABCB1 gene mutation, a SLC9A11 gene translocation, a NCO6A gene translocation, a chromosome 11 translocation, a chromosome 16 translocation, a NF1 gene deletion, a FBXW7 gene deletion; INPP5F inversion; overexpression of AURKB, FOXM1, PLK1, AURKA genes, BRAF, ERAS, IQGAP3; underexpression of PTEN, ERBB4, INPP4B gene and/or NOVA1 gene.
In one embodiment, the TNBC biomarker profile includes at least two of: an RB1 gene deletion, a PTEN gene deletion, an ERBB4 gene deletion, an ABCB1 gene mutation, a SLC9A11 gene translocation, a NCO6A gene translocation, a chromosome 11 translocation, a chromosome 16 translocation, a NF1 gene deletion, a FBXW7 gene deletion. In another embodiment, the TNBC biomarker profile includes at least two of the following genomic alterations: overexpression of AURKB, FOXM1, PLK1, AURKA genes, BRAF, ERAS, IQGAP3; and/or underexpression of PTEN, ERBB4, INPP4B gene and/or NOVA1 gene.
In one aspect, provided herein are methods for predicting a predisposition, existence, or recurrence of a triple negative cancer phenotype, for example, a triple negative subtype of breast cancer. Such methods comprise assessing a sample of the suspect cancer tissue for the triple negative biomarker profile as described herein. With regard to a breast cancer tumor or suspect tissue sample, determination of the presence of the TNBC biomarker profile indicates a predisposition of the tumor to become a TNBC subtype or determination of the presence of the TNBC biomarker profile indicates that a TNBC subtype exists or has recurred in the tumor or suspect tissue
In one embodiment, a method is provided for predicting a predisposition of a tumor to be a TNBC subtype comprising analyzing the genome or transcriptome of a tumor tissue sample for the presence of a TNBC biomarker profile as described herein, wherein the presence of the TNBC biomarker profile indicates a predisposition of the tumor to be a TNBC subtype.
In one embodiment, a method is provided for predicting the existence of a TNBC subtype in a tumor comprising analyzing the genome or transcriptome of the tumor tissue sample for the presence of a TNBC biomarker profile as described herein, wherein the presence of the TNBC biomarker profile indicates existence of the TNBC subtype in the tumor.
In one embodiment, provided is a method for predicting of a predisposition of a tumor to be a TNBC subtype comprising analyzing the genome, transcriptome, or polypeptides of a tumor tissue sample for the presence of a TNBC biomarker profile comprising at least two of the following alterations: an RB1 gene deletion, a PTEN gene deletion, an ERBB4 gene deletion, an ABCB1 gene mutation, a SLC9A11 gene translocation, a NCO6A gene translocation, a chromosome 11 translocation, a chromosome 16 translocation, a NF1 gene deletion, a FBXW7 gene deletion; INPP5F inversion; overexpression of AURKB, FOXM1, PLK1, AURKA genes, BRAF, ERAS, IQGAP3; and underexpression of PTEN, ERBB4, INPP4B gene and/or NOVA1 gene.
In one embodiment, provided is a method for predicting existence of a TNBC subtype in a tumor comprising analyzing the genome, transcriptome or polypeptides of a tumor tissue sample for the presence of a TNBC biomarker profile comprising at least two of the following alterations: an RB1 gene deletion, a PTEN gene deletion, an ERBB4 gene deletion, an ABCB1 gene mutation, a SLC9A11 gene translocation, a NCO6A gene translocation, a chromosome 11 translocation, a chromosome 16 translocation, a NF1 gene deletion, a FBXW7 gene deletion; INPP5F inversion; overexpression of AURKB, FOXM1, PLK1, AURKA genes, BRAF, ERAS, IQGAP3; underexpression of PTEN, ERBB4, INPP4B gene and/or NOVA1 gene, wherein the presence of the TNBC biomarker profile indicates existence of the TNBC subtype in the tumor.
In another aspect, provided herein are methods for assessing the progression of a TNBC in a subject by detecting the level of a TNBC biomarker profile in a first sample from the subject at a first period of time, detecting the level of a TNBC biomarker profile in a second sample from the subject at a second period of time and comparing the level of the TNBC biomarker profiles. In some embodiments, the first sample is taken from the subject prior to being treated for the TNBC and the second sample is taken from the subject after being treated for the cancer.
Genomic alterations provided herein including deletions (for example, deletions comprising exons, introns, and/or splice sites) and translocations, whether homozygous or heterozygous. Genomic alterations also includes transcriptome perturbations that lead to altered gene or protein expression in the tumor cells compared to normal cells, including, for example, overexpression or underexpression. Methods, reagents, equipment, and software for genome and transcriptome sequencing and analysis are known in the art and commercially available. Methods, reagents, and equipment for detecting and/or measuring the amount of the TNBC biomarker nucleic acids and proteins are known in the art and commercially available.
Different expression may be because of various gene alleles in metastatic triple negative breast cancer (mTNBC) and non-metastatic TNBC cell. An allele includes any form of a particular nucleic acid that may be recognized as a form of the particular nucleic acid on account of its location, sequence, expression level, expression specificity or any other characteristic that may identify it as being a form of the particular gene. Alleles include but need not be limited to forms of a gene that include point mutations, silent mutations, deletions, frameshift mutations, single nucleotide polymorphisms (SNPs), inversions, translocations, heterochromatic insertions, and differentially methylated sequences relative to a reference gene, whether alone or in combination. An allele of a gene may or may not produce a functional protein; may produce a protein with altered function, localization, stability, dimerization, or protein-protein interaction; may have overexpression, underexpression or no expression; may have altered temporal or spacial expression specificity.
An allele may be compared to another allele that may be termed a wild type form of an allele. In comparison to the wild type allele, a different allele may be called a mutation or a mutant. Mutants may also be interchangeably called variants. In some cases, the wild type allele is more common than the mutant. A genetic mutation or variance may be any detectable change in genetic material such as DNA, or a corresponding change in the RNA or protein product of that genetic material. In the example of gene mutation, the DNA sequence of a gene or any controlling elements surrounding the gene is altered. Controlling elements include promoter, enhancer, suppressor or silencing elements capable of controlling a given gene. Other examples of mutations include alterations in the products of gene expression such as RNA or protein that result from corresponding mutations in the DNA.
Conserved variants encompass any mutation or other variant in which a given amino acid residue in a protein or enzyme has been changed without altering the overall conformation and function of the polypeptide, including, but not limited to, replacement of an amino acid with one having similar properties (such as, for example, polarity, hydrogen bonding potential, acidic, basic, hydrophobic, aromatic, and the like). Amino acids with similar properties are well known in the art. For example, arginine, histidine and lysine are hydrophilic-basic amino acids and may be interchangeable. Similarly, isoleucine, a hydrophobic amino acid, may be replaced with leucine, methionine or valine. Depending on the location of the variance in the overall context of the protein, some substitution may have little or no effect on the apparent molecular weight or isoelectric point of the protein or polypeptide.
Amino acids other than those indicated as conserved may differ in a protein or enzyme so that the percent protein or amino acid sequence similarity between any two proteins of similar function may vary and may be, for example, from about 70% to about 99% as determined according to an alignment scheme such as by the Cluster Method, wherein similarity is based on the MEGALIGN algorithm. The concept of a variant further encompasses a polypeptide or enzyme which has at least 60%, 75%, 85%, 90%, or 95% amino acid identity as determined by algorithms such as BLAST or FASTA and which has the same or substantially similar properties and/or activities as the native or parent protein or enzyme to which it is compared.
Another example of gene variant is a gain-of-function variant. Gain-of-function variants of polypeptides encompass any variant in which a change in one or more amino acid residues in a protein or enzyme improves the activity of the polypeptide. Examples of activities of a polypeptide that may be improved by a change resulting in a gain of function variant include but are not limited to enzymatic activity, binding affinity, phosphorylation or dephosphorylation efficiency, activation, deactivation, or any other activity or property of a protein that may be quantitatively measured by some method now known or yet to be disclosed.
The presence or absence of an allele may be detected through the use of any process known in the art, including using primers and probes designed according to a specific allele for PCR, sequencing, hybridization analyses.
One aspect of this invention provides a method of identifying subjects for treatment with specific chemo agents, and excluding subjects with chemo-resistance from chemotherapy using biomarkers. Specifically, the biomarkers disclosed in this invention include overexpression of AURKB, FOXM1, PLK1, AURKA genes, BRAF, ERAS, IQGAP3; underexpression of PTEN, ERBB4, INPP4B gene and/or NOVA1 gene, all of which are associated to mTNBC.
The expression of a biomarker in a sample may be more or less than that of a predetermined level to predict the presence or absence of a cellular or physiological characteristic. The expression of the biomarker or target in the test subject may be 1,000,000×, 100,000×, 10,000×, 1000×, 100×, 10×, 5×, 2×, 1×, 0.5×, 0.1×, 0.01×, 0.001×, 0.0001×, 0.00001×, 0.000001×, or 0.0000001× of the predetermined level indicting the presence or absence of a cellular or physiological characteristic. The predetermined level of expression may be derived from a single control sample or a set of control samples.
The expression level of a biomarker can be determined, for example, by comparing mRNA or protein level in a test subject or sample to a control. In one embodiment of this invention, the comparison is between the test subject or sample and the paired subject or sample that is without cancerous cells. In one embodiment, the expression of the biomarker in a sample may be compared to a control level of expression predetermined to predict the presence or absence of a particular physiological characteristic. The predetermined control level of biomarker expression may be derived from a single control or a set of controls. Alternatively, a control may be a sample having a previously determined control level of expression of a specific biomarker. Comparison of the expression of the biomarker in the sample to a control level of expression results in a prediction that the sample exhibits or does not exhibit the cellular or physiological characteristic.
Expression of a biomarker may be assessed by any number of methods used to detect material derived from a nucleic acid template used currently in the art and yet to be developed. Examples of such methods include any biomarker nucleic acid detection method such as the following nonlimiting examples, microarray analysis, RNA in situ hybridization, RNAse protection assay, Northern blot, reverse transcriptase PCR, quantitative PCR, quantitative reverse transcriptase PCR, quantitative real-time reverse transcriptase PCR, reverse transcriptase treatment followed by direct sequencing. Other examples include any method of assessing biomarker protein expression such as flow cytometry, immunohistochemistry, ELISA, Western blot, and immunoaffinity chromatography, HPLC, mass spectrometry, protein microarray analysis, PAGE analysis, isoelectric focusing, 2-D gel electrophoresis, or any enzymatic assay.
Other methods used to assess biomarker expression include the use of natural or artificial ligands capable of specifically binding a biomarker or a target. Such ligands include antibodies, antibody complexes, conjugates, natural ligands, small molecules, nanoparticles, or any other molecular entity capable of specific binding to a target. The term “antibody” is used herein in the broadest sense and refers generally to a molecule that contains at least one antigen binding site that immunospecifically binds to a particular antigen target of interest. Antibody thus includes but is not limited to native antibodies and variants thereof, fragments of native antibodies and variants thereof, peptibodies and variants thereof, and antibody mimetics that mimic the structure and/or function of an antibody or a specified fragment or portion thereof, including single chain antibodies and fragments thereof. The term thus includes full length antibodies and/or their variants as well as immunologically active fragments thereof, thus encompassing, antibody fragments capable of binding to a biological molecule (such as an antigen or receptor) or portions thereof, including but not limited to Fab, Fab′, F(ab′)2, facb, pFc′, Fd, Fv or scFv (See, e.g., CURRENT PROTOCOLS IN IMMUNOLOGY, (Colligan et al., eds., John Wiley & Sons, Inc., NY, 1994-2001).
Ligands may be associated with a label such as a radioactive isotope or chelate thereof, dye (fluorescent or nonfluorescent,) stain, enzyme, metal, or any other substance capable of aiding a machine or a human eye from differentiating a cell expressing a target from a cell not expressing a target. Additionally, expression may be assessed by monomeric or multimeric ligands associated with substances capable of killing the cell. Such substances include protein or small molecule toxins, cytokines, pro-apoptotic substances, pore forming substances, radioactive isotopes, or any other substance capable of killing a cell.
In addition, biomarker differential expression encompasses any detectable difference between the expression of a biomarker in one sample relative to the expression of the biomarker in another sample. Differential expression may be assessed by a detector, an instrument containing a detector, or by aided or unaided human eye. Examples include but are not limited to differential staining of cells in an IHC assay configured to detect a target, differential detection of bound RNA on a microarray to which a sequence capable of binding to the target is bound, differential results in measuring RT-PCR measured in ΔCt or alternatively in the number of PCR cycles necessary to reach a particular optical density at a wavelength at which a double stranded DNA binding dye (e.g. SYBR Green) incorporates, differential results in measuring label from a reporter probe used in a real-time RT-PCR reaction, differential detection of fluorescence on cells using a flow cytometer, differential intensities of bands in a Northern blot, differential intensities of bands in an RNAse protection assay, differential cell death measured by apoptotic markers, differential cell death measured by shrinkage of a tumor, or any method that allows a detection of a difference in signal between one sample or set of samples and another sample or set of samples.
Techniques using microarrays may also be advantageously implemented to detect genetic abnormalities or assess gene expression. Gene expression may be that of the one or more biomarkers chosen from AURKB, FOXM1, PLK1, AURKA, BRAF, ERAS, IQGAP3, PTEN, ERBB4, INPP4B, NOVA1 gene or the expression of another set of genes upstream or downstream in a pathway of which the one or more biomarkers is a component or a regulator. In one embodiment, microarrays may be designed so that the same set of identical oligonucleotides is attached to at least two selected discrete regions of the array, so that one can easily compare a normal sample, contacted with one of said selected regions of the array, against a test sample, contacted with another of said selected regions. Examples of microarray techniques include those developed by Nanogen, Inc. (San Diego, Calif.) and those developed by Affymetrix (Santa Clara, Calif.). However, all types of microarrays, also called “gene chips” or “DNA chips”, may be adapted for the identification of mutations. Such microarrays are well known in the art.
In one embodiment of detecting the presence of a biomarker associated with a characteristic of a disease, a threshold value may be obtained by performing the one or more above mentioned assays on samples obtained from a population of patients having a certain disease condition (chemo-resistant cancer, for example) and from a second population of subjects that do not have the disease condition. In assessing disease outcome or the effect of treatment, a population of patients with a disease condition may be followed for a period of time. After the period of time expires, the population may be divided into two or more groups based on one or more parameters. For example, the population may be divided into a first group of patients whose disease progresses to a particular endpoint and a second group of patients whose disease does not progress to the particular endpoint. Examples of endpoints include disease recurrence, death, metastasis, chemo response or resistance, or other clinically meaningful indexes. Based on the observation of the parameters, a predetermined level of expression of a biomarker for each group may be selected to signify a particular physiological or cellular characteristic including identifying or diagnosing a particular disease, assessing a risk of outcome or a prognostic risk, or assessing the risk that a particular treatment will or will not be effective. If expression of the biomarker in a test sample is more similar to the predetermined expression of the biomarker in one group relative to the other group, the sample may be assigned a risk of having the same outcome as the patient group to which it is more similar.
Additionally, a predetermined level of biomarker expression may be established by assessing the expression of a biomarker in a sample obtained first from one patient, assessing the expression of the biomarker in additional samples obtained later in time from the same patient, and comparing the expression of the biomarker from the samples later in time with the previous sample(s). This method may be used in the case of biomarker that indicates, for example, progression or worsening of disease or lack of efficacy of a treatment regimen or remission of a disease or efficacy of a treatment regimen.
In a preferred embodiment, predicting a test sample or subject's response to a therapy, such as a drug therapy, is based on the detection of an altered expression of a biomarker in the test subject in comparison to the expression level in a subject responsive to the therapy, and the one or more biomarkers is selected from AURKB, FOXM1, PLK1, AURKA, BRAF, ERAS, IQGAP3, PTEN, ERBB4, INPP4B, NOVA1.
3. TNBC Biomarker Profile Guided mTNBC Therapy
Identification of a tumor as a TNBC subtype would provide valuable information for determining treatment regimens and options for the patient. In another aspect, provided herein are methods for determining sensitivity or resistance of tumor cells to particular chemotherapies and methods for determining suitability of a therapeutic regimen for a breast cancer tumor. Such methods comprise assessing a tumor tissue sample for the TNBC biomarker profile described herein, wherein the presence of TNBC subtype in the tumor sample would indicate the sensitivity or resistance of the tumor to particular chemotherapies and/or that particular therapeutic regimens with little or no therapeutic value to the patient. Such a determination would save the patient valuable time and resources.
Studies have indicated that the triple negative subtype of breast cancer may disproportionally affect African American women and that women of black African ancestry may comprise a high-risk population for TNBC. Accordingly, in certain embodiments, the methods provided relate to subjects that include black African ancestry such as populations comprising persons of African descent or lineage. Black African ancestry may be determined by self reporting as African-Americans, Afro-Americans, Black Americans, or being a member of the black race. For example, African Americans or Black Americans are those persons living in North America and having origins in any of the black racial groups of Africa. In another example, self-reported persons of black African ancestry may have at least one parent of black African ancestry or at least one grandparent of black African ancestry.
Examples of chemotherapy medications for breast cancer include: Abraxane (chemical name: paclitaxel), Adriamycin (chemical name: doxorubicin), carboplatin (brand name: Paraplatin), Cytoxan (chemical name: cyclophosphamide), daunorubicin (brand names: Cerubidine, DaunoXome), Doxil (chemical name: doxorubicin), Ellence (chemical name: epirubicin), fluorouracil (also called 5-fluorouracil or 5-FU; brand name: Adrucil), Gemzar (chemical name: gemcitabine), Halaven (chemical name: eribulin), Ixempra (chemical name: ixabepilone), methotrexate (brand names: Amethopterin, Mexate, Folex), Mitomycin (chemical name: mutamycin), mitoxantrone (brand name: Novantrone), Navelbine (chemical name: vinorelbine), Taxol (chemical name: paclitaxel), Taxotere (chemical name: docetaxel), thiotepa (brand name: Thioplex), vincristine (brand names: Oncovin, Vincasar PES, Vincrex), Xeloda (chemical name: capecitabine). In many cases, chemotherapy medicines are given in combination, known as chemotherapy regimens. In early stage breast cancer, standard chemotherapy regimens lower the risk of the cancer coming back. In advanced breast cancer, chemotherapy regimens make the cancer shrink or disappear in about 30-60% of people treated. However, every cancer responds differently to chemotherapy, and many are resistant to one or more chemotherapy agents.
In one embodiment, the presence of TNBC subtype in the tumor sample characterized by a biomarker profile comprising BRAF amplification and INPP4B under expression indicates sensitivity to treatment with combined MEK and AKT inhibitors. In one embodiment, the MEK inhibitor is GSK1120212, and the AKT inhibitor is GSK2141795.
In one embodiment, the presence of TNBC subtype in the tumor sample characterized by a biomarker profile comprising TOP2a and PBK overexpression indicates sensitivity to treatment with Eribulin. TOP2a (DNA topoisomerase 2-alpha, P11388) controls topological states of DNA by transient breakage and subsequent rejoining of DNA strands. Topoisomerase II makes double-strand breaks. PBK (Lymphokine-activated killer T-cell-originated protein kinase, Q96 KB5), when phosphorylated, forms a complex with TP53, leading to TP53 destabilization and attenuation of G2/M checkpoint during doxorubicin-induced DNA damage.
In one embodiment, the presence of TNBC subtype in the tumor sample characterized by a biomarker profile comprising IQGAP3 and AKT3 overexpression and INPP4B underexpression indicates sensitivity to treatment with BEZ235. AKT3 (RAC-gamma serine/threonine-protein kinase, Q9Y243) regulates many processes including metabolism, proliferation, cell survival, growth and angiogenesis through serine and/or threonine phosphorylation of a range of downstream substrates.
In one embodiment, the presence of TNBC subtype in the tumor sample characterized by a biomarker profile comprising ALK overexpression indicates sensitivity to treatment with ALK inhibitor including LDK378. ALK (ALK tyrosine kinase receptor, Q9UM73) plays an important role in the genesis and differentiation of the nervous system.
In one embodiment, the presence of TNBC subtype in the tumor sample characterized by a biomarker profile comprising FBXW7 and INPP4B underexpression indicates sensitivity to treatment with combined Taxol, Avastin and Everolimus.
In one embodiment, the presence of TNBC subtype in the tumor sample characterized by a biomarker profile comprising DNA repair related gene mutations indicates sensitivity to treatment with combined BSI 201, Gemcitabine and Carboplatin.
In one embodiment, the presence of TNBC subtype in the tumor sample characterized by a biomarker profile comprising IQGAP3 overexpression, INPP5F inversion and NEDD4 mutation indicates resistance to treatment with BEZ 235.
In one embodiment, the presence of TNBC subtype in the tumor sample characterized by a biomarker profile comprising GART overexpression indicates resistance to treatment with combined MEK and AKT inhibitors.
Examples of current investigational or off-label cancer drugs under the TNBC biomarker guided treatment include, but not limited to, BEZ235, or NVP-BEZ235, an PI3K inhibitor being investigated as a possible cancer treatment; velcade, a drug approved for the treatment of multiple myeloma and mantle cell lymphoma (a cancer of lymph nodes); pemitrexed, approved to be used alone or with other drugs to treat malignant pleural mesothelioma in patients who cannot be treated with surgery and advanced or metastasized non-small cell lung cancer; ENZ2208, for advanced cancers; Curcumin, being investigated for treating multiple myeloma, pancreatic cancer, myelodysplastic syndromes, colon cancer, psoriasis, and Alzheimer's disease; Everolimus, currently used as an immunosuppressant to prevent rejection of organ transplants and treatment of renal cell cancer; BSI 201, an investigational anticancer agent being studied in multiple cancers, including Phase II and Phase III clinical trials in breast, lung and ovarian cancers.
The invention further provides kits that facilitate the detection of altered expression of one or more biomarkers associated to mTNBC. The kit may comprise one or more reagents to identify a patient for elimination from a chemotherapy treatment. The reagents in the kit may be primers, probes, and/or antibodies that are capable of identifying a biomarker or target associated to mTNBC.
The kit that facilitates nucleic acid based assays may further comprise one or more of the following: nucleic acid extraction reagents, controls, disposable cartridges, labeling reagents, enzymes including PCR amplification reagents such as the DNA polymerases Taq or Pfu, reverse transcriptase, or one or more other polymerases, and/or reagents that facilitate hybridization.
In another embodiment, the kit may further comprise a label that can be used to label the primer or probe oligonucleotide. A label may be any substance capable of aiding a machine, detector, sensor, device, or enhanced or unenhanced human eye from differentiating a sample that that displays positive expression from a sample that displays reduced expression. Examples of labels include but are not limited to: a radioactive isotope or chelate thereof, a dye (fluorescent or nonfluorescent,) stain, enzyme, or nonradioactive metal. Specific examples include but are not limited to: fluorescein, biotin, digoxigenin, alkaline phosphatase, biotin, streptavidin, 3H, 14C, 32P, 35S, or any other compound capable of emitting radiation, rhodamine, 4-(4′-dimethylaminophenylazo)benzoic acid (“Dabcyl”); 4-(4′-dimethylamino-phenylazo)sulfonic acid (sulfonyl chloride) (“Dabsyl”); 5-((2-aminoethyl)-amino)-naphtalene-1-sulfonic acid (“EDANS”); Psoralene derivatives, haptens, cyanines, acridines, fluorescent rhodol derivatives, cholesterol derivatives; ethylene diamine tetra-acetic acid (“EDTA”) and derivatives thereof or any other compound that signals the presence of the labeled nucleic acid. In one embodiment of the invention, the label includes one or more dyes optimized for use in genotyping. Examples of such dyes include but are not limited to: dR110, 5-FAM, 6FAM, dR6G, JOE, HEX, VIC, TET, dTAMRA, TAMRA, NED, dROX, PET, BHQ+, Gold540, and LIZ.
In yet another embodiment, the primers and probes in the kit may have been labeled, and can be applied without labeling process in PCR, sequencing reaction, or binding to a solid substrate such as oligonucleotide array.
The kit that facilitates the detection of altered expression of one or more biomarkers or targets associated to mTNBC may also comprise instructions for use. In one embodiment, the kit may further comprise an indication that links the output of the assays provided by the kit to a particular result. For example, an indication may provide guide to associate the presence or absence of one or more sequences to a specific treatment plan. The output of the assay may be in a form of a particular sequence, a particular genotype, a particular expression level in a real-time quantitative PCR reaction, a level of fluorescence or radioactive decay, a value derived from a standard curve, or from a positive or negative control, or any combination of these and other outputs. The indication may be printed on a writing that may be included in the kit or it may be posted on the Internet or embedded in a software package. The writing may include graphical depictions of results such as a photomicrograph or amplification plot.
The kit that facilitate the detection of altered expression of one or more biomarkers or targets associated to mTNBC may further comprise a device used to collect the sample. Such devices may include but need not be limited to: swabs, needles, blood collection tubes, wipes, or any other apparatus that may be used to collect a biological sample from a subject.
The following examples are provided by way of illustration and not by way of limitation.
Samples of TNBC tumor and non-tumor normal tissue were obtained from an African American female subject (age 53) for whole-genome and transcriptome sequencing and sequence analysis. The subject's primary treatment was adjuvant therapy (anthracyclines and taxanes followed by paclitaxel (AC/T). In response, the tumor “melted away” with cycle 1 of AC and at definitive surgery, the subject had a pathologic complete response with no cancer in her breast or axillary lymph nodes. Two years later the subject presented with a very rapidly growing ipsilateral internal mammary lympho node protruding from her chest and invading into her sternum as was as some other mediastinal lymph nodes. The initial disease responded to a secondary treatment of gemcitabine/carboplatin plus iniparib (PARP inhibitor) for 8 cycles (6 months). The subject has not had BRCA1/2 testing.
Tissue sample preparation, sequencing, and analysis was performed using the SOLiD™ 4.0 System (Applied Biosystems) and manufacture's protocols.
For the whole-genome sequencing study, four simultaneous SOLiD™ 4 System runs were performed with 50 bp×50 bp mate-pairs, 1.5 kb inserts, and two slides per run. A total of eight slides were run. The tumor and germline genomes were sequenced and analyzed using analysis tools (including BFAST and SOLiD™ BioScope™ Software (Applied Biosystems)) to uncover somatic alterations.
For the transcriptome sequencing, libraries were prepared from RNA of the tissue samples and two libraries per sample were processed using one 50 bp fragment library per quad and 50 bp×35 bp paired end runs later. Libraries were prepared from two ethnicity-matched population controls of hyperplastic breast tissue and two replicates per sample were processed. Tumor and control transcriptomes (RNA-seq) were sequenced and analyzed using tools including SOLiD™ BioScope™ Software, EdgeR, and DESeq to compare RNA expression.
Analysis of the whole-genome and transcriptome sequencing results revealed genomic alterations in the TNBC tumor tissue compared with the normal tissue, including, for example, point mutations, small indels, copy number alterations (e.g. gains and/or losses), and translocations (see Table 1). Some somatic alterations associated with the TNBC included exon deletions, intronic deletions and translocations. The transcriptome analysis revealed consequences of a number of the genomic alterations including exon skipping and the generation of an apparent fusion transcript. In addition, transcriptome analysis revealed differential gene expression between the TNBC and normal tissue, including transcriptome perturbations in cell cycle gene expression.
One genomic alteration identified in the TNBC is a deletion in the Retinoblastoma-associated protein 1 (RB1) gene. Specifically, the deletion in the RB1 gene occurs between exons 12 and 14 and includes a splice donor site. This deletion results in exon skipping and the loss of exon 13 from the RB1 transcript. With the deletion at RB1 1288-1332+2del47, exon 12 is joined in frame with exon 14 and amino acids 406N-444Q are deleted from the RB1 polypeptide.
Another genomic alteration identified in the TNBC is a homozygous deletion of approximately 1 kb encompassing exon 6 of the phosphatase and tensin homolog (PTEN) gene. The PTEN gene deletion results deletion of exon 6 bases 493-634 from the transcript and in a frameshift mutation and premature truncation of the polypeptide. The frameshift mutation occurs after G165 and the polypeptide truncates at amino acid 173. PTEN was significantly underexpressed in the TNBC tissue. Immunohistochemical analysis of the TNBC tissue also demonstrates that the truncating mutation results in loss of the PTEN protein.
Another genomic alteration identified in the TNBC is a deletion of approximately 4.3 kb in the ERBB4 gene. The ERBB4 deletion is intronic and results in a loss of expression of the ERBB4 gene.
Another genomic alteration identified in the TNBC is a single nucleotide mutation of the ABCB1 gene.
Another genomic alteration identified in the TNBC are translocations including chromosome 11 and chromosome 16.
Another genomic alteration identified in the TNBC are apparent translocations with somatic discordant mate-pairs mapping to chromosome 1 (SLC9A11) and chromosome 20 (NCOA6). The RNA-seq data from the transcriptome analysis provides strong evidence for a fusion transcript comprising exons 2-9 of NCOA6 and Exons 22-27 of SLC9A11.
In addition, differential gene expression between the TNBC and normal tissue was found for a number of genes, including up-regulation and down-regulation of gene expression. For example, AURKB, FOXM1, PLK1 and AURKA were found to be over-expressed in the tumor tissue (see further in Example 2).
From the patient described here in Example 1, 1.5 kb mate pair libraries from tumor and normal tumor tissue were sequenced on SOLiD™ version 4 (Applied Biosystems) to a depth of 30× mapped using 50mer reads. Tumor tissue purity was estimated at 95% tumor content. The tumor transcriptome was sequenced to 120 million reads comprising four replicates and compared against the transcriptome sequencing of an ethnicity matched population based control hyperplastic breast tissue.
Copy number variation (CNV) and gene expression were analyzed. Differentially expressed genes and 279 regions of CNV were found. 230 CNV regions overlapped the exons of annotated genes. While copy number change was significantly correlated to gene expression differences (2e-16), the effect was relatively small, explaining only 1.5% of the gene expression variation. Noting that while the correlation (0.3) and explained percent variance (0.093%) was higher in genes that were highly expressed (RPKM>0.1), there were still a large number of genes (2070) that were switched off or had significantly lower expression in tumor despite having high expression in normal tissue. Accordingly, expression of these genes appears to be disrupted.
Although significant progress has been made towards treatment of hormone receptor (estrogen, progesterone) positive and HER2-neu positive breast tumors, women diagnosed with triple negative breast cancer (TNBC), or tumors that show no expression of these receptors, have few options and outcomes remain relatively poor. Furthermore, there is evidence that TNBC is more prevalent among young pre-menopausal women of recent west African descent, namely African American women. In many cases, TNBC exhibits phenotypic and genotypic characteristics similar to the basal-like or BRCA-associated breast cancers. These similarities have led to the development of therapies aimed at BRCA molecular mechanisms such as the class of PARP inhibitors that are showing promise in clinical trials with significant increases in time to progression and survival. However some TNBC have very different genotypic profiles and represent a heterogeneous class of aggressive breast cancers. In many cases, patients with an initial diagnosis of TNBC are cured with adjuvant chemotherapy and surgery, but roughly 30% of patients have rapid relapse of chemoresistant disease. In patients with metastatic chemoresistant TNBC, molecularly guided therapies can improve outcomes in patients as compared to conventional therapeutic decision methods. In this invention, a disease specific approach was undertaken to identify molecularly targeted treatment options for patients with chemoresistant metastatic TNBC. To uncover targets within each patient's tumor, both whole genomes and transcriptomes were interrogated utilizing Next Generation Sequencing technologies (NGS). The results of deep whole genome and transcriptome sequencing of a metastatic tumor from a patient with chemoresistant TNBC may be used to discover therapeutically actionable targets for treatment to assist the patient treatment.
The integrated analysis of matched normal and tumor whole genome and tumor transcriptome sequencing data were from four African American patients with metastatic chemo-resistant TNBC for better understanding of the compendium of somatic events occurring in these tumors within this high-risk population.
Material and methods: Greater than 1 cm3 surgical fresh frozen tumor specimens were collected during surgery and quality assessed for tumor cellularity, necrosis, crush artifact, etc. A blood sample was also provided for the collection of constitutional genomic DNA. For differential expression analysis, a series of fresh frozen hyperplastic epithelial breast tissue specimens were obtained from, which were matched for age and ethnicity. RNA and DNA were extracted from biospecimens using the Qiagen All Prep kit (Germantown, Md.). Table 2 provides information regarding patients and samples.
Next Generation Sequencing was performed using the SOLiD version 4.0 system. Patient matched tumor and germline genomes were sequenced to >20× coverage and analyzed using custom paired analysis tools to uncover somatic alterations. Tumor transcriptomes were sequenced (RNA-seq) to 30 million uniquely aligned reads using Life Technologies Bioscope (Applied Biosystems, Carlsbad, Calif.). For expression analysis, patient RNA-seq data are compared to data generated from ethnicity-matched population-based control hyperplastic breast tissue using EdgeR (Robinson et al, 2010).
On average, over 3 million germline variants were detected in the four African American patients. Somatic paired analysis has revealed point mutations, small indels, copy number alterations, and translocations, in known cancer genes, and has revealed potential novel genes as well. The germline variation review result also provides information on inherited breast cancer mutations. Among four African American patients sequenced, both unique and common genomic and transcriptomic perturbations were uncovered. In all patients, unique somatic genomic alterations are associated with upregulation of specific signaling pathways.
RNA-seq-based expression analysis has revealed a profile dominated by perturbations in cell cycle mitotic checkpoint defects. Furthermore, integrated analysis also provided key insights into the transcriptional consequences of a number of genomic alterations including exon skipping events and the discovery of potential fusion transcripts. Importantly, DNA and RNA changes have been validated by independent methods and technologies, such as expression microarray analysis, and Sanger sequencing.
Deep genome and transcriptome profiling of chemo-resistant TNBC has provided insights into events occurring in this cancer subtype which is difficult to treat. This invention provides the leveraging of these data to provide an opportunity for informed therapeutic options for intervention of this cancer subtype.
Paired Genome Sequencing—
All NGS was carried out using the SOLiD™ version 4.0 mate-pair chemistry (Applied Biosystems by Life Technologies, Foster City, Calif.) following the manufacturer's instructions. Briefly, for each mate-pair sequencing library, 20 micrograms (mg) high molecular weight genomic DNA was randomly fragmented using the Hydroshear (Applied Biosystems of Life Technologies, Foster City, Calif.) to produce libraries with of approximately 1.5 kilobasepairs (kb) inserts on average. Two independent mate-pair libraries were generated from the tumor DNA and two libraries from the patient's constitutional DNA. Emulsion PCR was used to amplify libraries, which were subsequently enriched using the EZBead emulsion PCR system (Applied Biosystems of Life Technologies, Foster City, Calif.) using the manufacturer's recommendations. Approximately 500,000,000 enriched templated sequencing beads were deposited on each SOLiD™ version 4.0 flowcell slide and sequenced to generate 50×50 bp mate-pair reads per templated bead.
Transcriptome Sequencing.—
Whole transcriptome sequencing was carried out using the SOLiD™ Total RNA-Seq kit (Applied Biosystems of Life Technologies, Foster City, Calif.) using the manufacturer's recommendations. Briefly, 10 μg of total RNA was depleted of ribosomal RNA species using the Ribo-Minus kit (Invitrogen by Life Technologies, Carlsbad, Calif.) using the manufacturer's recommendations). To generate SOLiD RNA-Seq fragment libraries, 200-500 nanograms (ng) of ribosomal RNA depleted total RNA was fragmented by sonication. Size selected fragments were ligated to SOLiD™ RNA-Seq specific sequencing adaptors and a reverse transcription was carried out to generate cDNA. These cDNA molecules are further size selected and amplified. An emulsion PCR was carried out and ˜100,000,000 enriched templated sequencing beads were deposited for sequencing for each RNA-Seq library.
Paired Somatic Single Nucleotide Variant Detection—
For paired somatic single nucleotide detection, SolSNP was utilized to detect single nucleotide variants. However, variants are called conservatively in germlines in order to reduce false positives, and variants are called liberally in tumors in order to reduce false negatives. A custom script was then utilized to compare variants to search for tumor specific changes using an algorithm, which provides a Phred like statistical score for likelihood of a somatic variant being truly tumor specific.
Paired Somatic Copy Number Analysis—
For copy number analysis, a custom tool was developed based on a sliding window comparison of coverage for tumor/normal. These analyses utilized a 1 kb sliding window, with coverage differences output as log 2 ratios and plotted against the genome to identify regions of gain or loss in the tumor.
Paired Somatic Insertion/Deletion Detection—
For detecting somatic indels a two step strategy was employed. In the first step we removed from the tumor sample bam, reads whose insert size lay outside a 500 bp-5000 bp interval for SOLiD. Genome Analysis Toolkit (GATK) is then used to generate a list of potential small indels from this bam. A customized Perl script, which uses the Bio-SamTools library from BioPerl 9Stajich et al Genome Res. 2002 October; 12(10):1611-8), takes these indel positions and for each of the indels looks at the region in the normal sample consisting of 5 bp upstream from the start and 5 bp downstream from the end of the indel. An indel is determined to be somatic only if there was no indel detected in the region under consideration.
Paired Somatic Translocation Detection—
A series of customized Perl scripts were employed in the detection of translocation. These scripts used SAMtools (McKenna A. et al Genome Res. 2010 September; 20(9):1297-303) internally to access the bam files. The algorithm consists of two steps. The first is to detect potential translocation in both tumor and normal samples. The second is a comparison of potential translocations in tumor to those detected in the normal sample to weed out potential false positives. The detection of a potential translocation was an exercise in outlier detection. A sliding window of 2 kbp was focused and the discordant reads were counted, whose mates align on a different chromosome. A 2 kbp window size was used, as it is close to the mean of the estimated insert size distribution, and gave the best resolution for the detection of an interchromosomal translocation using 1.5 kb mate pair libraries. For each window we chose the highest hit to be the chromosome to which mates of most of the discordant reads mapped. For purposes of brevity, the subset of discordant reads whose mate maps to the highest hit in the window as the hit discordant reads was called. The ratios of the hit discordant reads were compared to the total aligned reads, across all the windows to detect potential outliers. Outlier detection was performed under the assumption that the distribution, of the proportion of hit discordant reads in a 2 kb window aggregates across the chromosome, and will follow a normal distribution. The mean of this distribution was then computed and a cutoff of 3 standard deviations was chosen. The window with a proportion of hit discordant reads, higher than this cutoff contains the region of potential translocation. The actual region of translocation was then determined by the span of the hit discordant reads in the window. For somatic translocations, the normal and the tumor sample were called separately and regions of overlap were eliminated. These regions were further inspected visually to reduce false positives and arrive at the most confident list.
Germline Variant Detection—
To identify SNPs and variation in patient germline DNA samples, SolSNP was utilized, which is an individual sample variant detector (classifier) implemented in Java. The variant calling was based on a modified Kolmogorov-Smirnov like statistic. The algorithm is non-parametric and makes no assumptions on the nature of the data. It compared the discrete sampled distribution, the pileup on each strand, to the expected distributions (according to ploidy). An important aspect of SolSNP that reduces overcalling inherent to the K-S statistic algorithm is that filters are included to reduce false positive rates, among of which is that both strands must provide evidence for the variation.
Transcriptome Analysis—
SOLiD™ BioScope™ Whole Transcriptome Analysis (WTA) pipeline for single reads was used to align the reads, count aligned reads per exon, and calculate per base coverage. Reads were aligned to three references: whole human genome build 36, splice junction reference derived from refSeq annotations and a filter reference which contains rRNA, tRNA, single-base-repeat (e.g. poly-A and T) and adapter sequences. Aligned reads from the alignments were merged into a final mapped reads BAM file. Any reads aligned to the filter genome were not included in the mapped reads BAM. Filtered reads were put in a filtered reads BAM and unmapped reads were put in a separate unmapped reads BAM file. WTA pipeline CountTags module provided normalize RPKM (Reads Per Kilobase of exon sequence, per Million reads) values along with read counts per exon. Bam2Wig module calculated coverage per base and produces WIG files containing the coverage information. BioScope 1.2.1 version was used with all default WTA settings.
Differential Expression Analysis—
EdgeR, a Bioconductor package specifically for the differential expression analysis on the counts per gene from the tumor and normal samples, was used. Gene counts were normalized using EdgeR's TMM (trimmed mean M(=log fold-change gene expression)) algorithm. TMM attempts to adjust for unequal transcriptome sizes between samples by calculating an “effective” library size that is used to normalize the counts. The “effective” library size was calculated by multiplying or dividing the original library size by an offset. EdgeR's differential expression analysis was based on the assumption that the biological replicates follow a negative binomial distribution and technical replicates followed a Poisson distribution. For this study, the normal samples' technical replicates were merged and the analysis was done on the biological replicates. Gene expression was corrected using a moderated binomial dispersion correction, and then an exact test (similar to Fisher's exact test) was used to assess differential expression (Table 7). EdgeR's exact test produced a table of log concentrations, log fold-changes, p-values and false discovery rate (FDR).
Five metastatic chemo-resistant TNBC normal/tumor pairs were completed in both whole genome and transcriptome sequencing. Quality control and assessment suggests high quality genome and transcriptome data. Initial analyses revealed several novel and known events involved in TNBC (Table 8). Somatic alterations at the ERBB4 locus may be common in metastatic chemo-resistant TNBC among African American women. ERBB4 intronic 4 kb homozygous deletion results in loss of expression in TNBC-001. In addition there is a significant intrachromosomal 21 Mb deletion on chromosome 2 in mate pairs, which breaks ERBB4 and ATG16L1 in TNBC-007. Further analysis uncovered new insights into the genes and molecular pathways commonly perturbed in metastatic chemo-resistant TNBC among African American women.
The DNA and RNA of TNBC samples were sequenced in order to identify relationships between somatic changes and expression. Integrated analysis of matched normal and tumor whole genome data, along with tumor transcriptome sequencing data were obtained from multiple samples with metastatic chemo-resistant TNBC. Two independent 1.5 kb Mate-pair libraries were generated for both tumor and germ-line derived genomic DNA and sequenced using SOLiD™ 4.0 paired 50mers to a target of 30× depth.
The tumor transcriptome was sequenced on four replicates and compared to transcriptome sequencing from ethnicity-matched population-based control hyperplastic breast tissue. Genome analysis was performed using multiple aligners and variant callers. Transcriptome alignment was performed using Bioscope™ pipeline, and differential expression analysis was performed using EdgeR (Robinson et al, 2010) and DESeq (Anders, 2010). Germline and somatic variants were annotated by integrative analysis with differential expression results. Several striking examples of intronic events correlating with either altered splicing or differential expression were observed in genes suggesting that transcriptomic data may have high value in interpreting somatic events that fall outside of coding regions. Final integration of data was validated through knowledge mining and convergence of somatic events and expression.
For RNA sequencing, normal breast tissue was collected from two human populations to match and/or compare to gene expression in samples with tumors. The specialized tumor-normal variant call pipeline included a somatic ‘copy number variant’ (CNV) caller, a germline SNP caller, two annotated somatic ‘single nucleotide variant’ (SNV, i.e., single nucleotides that differ from the standard human reference sequence) callers, a somatic translocation tool and an annotated somatic indel caller based on GATK.
Somatic mutation calls by the pipeline were manually evaluated using Integrative Genomics Viewer (IGV). By viewing the normal and tumor BAM files on the same slide it is possible to look at the distribution and enrichment of patterns to eliminate false positives that occur due to complexity of the human genome. Allelic ratios calculated from tumor RNA libraries showed increased variation compared to DNA allelic ratios suggesting the extent of functionally important regulatory variation.
Following two stringency criteria, it was determined that 105 and 2167 genes (enrichments summarized in Table 9) respectively, that are differentially expressed in 7 TNBC samples when compared to two populations of normal RNAs (from African American and Caucasian American samples). These enriched genes usually clustered together, and the enriched categories of gene ontology, pathways and protein domains of most differentially expressed genes are shown in Table 9:
It is important to find enriched pathways in order to understand the underlying changes in cellular biology for specific samples. Enriched pathways were detected by pathway network analysis.
In
Splicing and gene fusion events were also called for using LifeScope 2.0™ WT package (Life Technologies, Applied Biosystems, Carlsbad, Calif.). Several known and putative splicing junctions were found that were up-regulated or down-regulated in multiple samples in comparison to RNA normals. The MA plot highlights the 500 most highly differentially expressed junctions (
Among the alternative splicing regulations discovered above, transcription factors such as Nova1 silence or enhance exons on pre-mRNAs containing at least one type of YCAY cluster motif. An overall correlation between NOVA1 depletion (on 5 out of 7 TNBC samples) and its targets was observed. Two particular exons (CLATN1 and RAP1GAP) showing correlation of regulation with NOVA1 are shown in
The clinical application of next generation sequencing toT comprehensively characterize groups of driving mutations in individual metastatic triple negative breast cancer (mTNBC) genomes has the potential to reveal therapeutically relevant pathway dependencies (See Example 2 Table 6). Tissue from 14 patients with mTNBC were harvested and deep whole genome and transcriptome sequencing were conducted for each case to identify mutations that can guide therapeutic targeting within available phase I/II clinical trials.
As detailed in previous examples, the Life Technologies SOLiD system was utilized to sequence germline and tumor DNA to sufficient depth to identify somatic genome alterations including point mutations, indels, and structural events including translocations. Furthermore, RNA-seq was performed on these tumors, along with a series of age and ethnicity matched normal breast controls to perform deep differential expression analysis, isoform expression analysis, and fusion transcript detection. Investigational therapeutic options for each patient were examined by evaluating the sequencing findings.
The whole genome and transcriptome sequencing study revealed numerous known and novel mutations in mTNBC. All patients' cancers analyzed to date had alterations that would activate the MAPK pathway, however, through various mechanisms in different patients. These metastasized TNBC associated alterations include BRAF amplification and overexpression, NF1 homozygous deletion, and consistent IQGAP3 overexpression. Furthermore, all patients' cancers also harbor mutations that would activate the PI3K/AKT pathway. These mutations include PTEN homozygous deletion or down-regulation, consistent INPP4B down-regulation, FBXW7 homozygous deletion, and ERAS overexpression. Moreover, although it has been shown ERBB4 downregulation in breast tumors, an associated unique somatic genomic event that significantly alters the ERBB4 locus and leads to its loss on the RNA level in the majority (5/7) of our patients' tumors was provided herein (Table 10).
The isolation of these metastasized TNBC associated alterations make it possible to prioritize therapeutic targeting in chemotherapy-refractory mTNBC patients. Among the targeted therapies, it was observed that one patient with a high-level BRAF amplification/overexpression along with down-regulation of PTEN and INPP4B, had a major response to combined MEK plus AKT inhibitors on a phase I study (See Example 4).
The primary objective of the clinical trial is to provide information that might suggest a therapeutic option for patients with metastatic (or locally recurrent) triple negative breast cancer (mTNBC), and to provide information relating to Time to Progress (TTP) for patients with molecular profiled prescription versus TTP on their prior therapy. The secondary objective is to conduct a comprehensive evaluation of genetic mutations in TNBC that may accelerate the development of rational and precise therapeutics; and to evaluate patient's best response to molecularly-selected therapy.
A patient eligible for clinical trial inclusion in this study had to meet all of the following criteria: (1) has a metastatic or locally recurrent triple negative breast cancer and is scheduled for medically indicated surgical biopsy or resection of disease; (2) have measurable or evaluable (nonmeasurable) disease based on results generated by RECIST v 1.1 (Eisenhauera, 2009) presented after surgical biopsy/resection. Following surgical resection, if the tumor sample is found to be inadequate for comprehensive molecular analysis, the patient would be deemed ineligible and would be replaced; (3) has received at least 1 prior chemotherapeutic regimen for their metastatic or locally recurrent TNBC prior to initiating the molecularly-selected therapy. There is no limit on the prior therapy for TNBC. Intervening therapies were strongly recommended to be held off if possible from the time of biopsy to the completion of sequencing so as not to change the cancer under the selective pressure of treatment, so that the sequencing results would be reflective of the current cancer; (4) is ≧18 years of age; (5) has an expected survival of at least 6 months, as estimated by the treating oncologist; (6) has planned surgical resection (indicated for the medical care of the patient) that will yield a minimum fresh/frozen tumor sample of 1 cm×1 cm×1 cm that will be available for molecular profiling analysis; (6) is agreeable to having a blood sample (10-20 mL) drawn and analyzed to compare their normal genetic profile to that of their tumor sample; (7) has signed the most recent Patient Informed Consent Form and a Patient Authorization Form.
A patient would be excluded from this study if she meets any of the following criteria: (1) has breast cancer other than metastatic or locally recurrent TNBC; and surgical resection of the recurrent TNBC will render the patient as “no evidence of disease” (NED), and thus to be replaced; (2) has a history of heart disease, other conditions that would prevent treatment with a standard chemotherapeutic agent; (3) has evidence of CNS involvement that is progressing or that requires radiation, resection or steroid therapy; (4) has a serious uncontrolled intercurrent medical or psychiatric illness, including serious infection; (5) is pregnant or nursing; (6) is unable to comply with requirements of study.
Generally, 14 patients with previously treated triple negative breast cancer were included in the clinical study (Table 11). Samples were collected for sequencing with the SOLiD™ system (Applied Biosystems, Carlsbad, Calif.), and the patients were treated according to what is found after CLIA certification of the target.
Every tumor is genomically unique with SNVs, indels, translocations, inversions, amplifications, and deletions. For example, after all the analysis (detailed in previous examples), it was found that TNBC1 comprises 3.7 Million SNPs; 9,766 genomic SNVs; 41 SNVs that are coding missense or nonsense; Small insertions or deletions with 16 in the coding region and 2,727 in the genomic non-coding region; 20 translocations with 4 within genes; and 2,867 large deletions or insertions with 16 crossing gene(s). As a further example, CTNNA1 (catenin cadherin associated alpha 1) homozygous deletions were observed in two African American patients.
Further, TNBC001 whole transcriptome analysis identified significant up-regulation of AURKB, FOXM1, PLK1, AURKA (p<0.0001) (
Among the listed genes in Table 13, TNBC-001 contains a PTEN Exon 6 homozygous deletion of about 1 kb. PTEN has clear evidence of expression in tumor, but with exon 6 deleted. However, PTEN has higher expression in normal breast, and all reads show normal splicing of exons 5, 6, and 7. Exon 6 deletion leads to transcript mutation of bases 493-634, resulting in frameshift mutation in premature truncation. In turn, the protein mutation is G165Ifs173X, standing for PTEN normal reading to amino acid 165, then frameshifts and truncates 8 bases later at amino acid 173. PTEN G165Ifs173X protein truncating mutation leads to complete protein loss (
In TNBC-002, high level BRAF amplification and increased mRNA expression were observed (Table 13). In addition, concomitant PTEN and INPP4B underexpression took place. BRAF was amplified by CGH (comparative genomic hybridization) and showed increased mRNA expression. INPP4A and PTEN were down in tumor at expression level, although there are no significant variants or alterations at CGH level. ERBB4 is down by expression in tumor and presents with a possibly causal nonsense SNV. Two genes that can make part of PI3K reg class IA show discordant expression changes. PIK3CD expression is up in tumor. There is no significant alteration in PIK3CA. FGFR1 is highly upregulated at expression level. FGFR3 does not show much of change. It is unknown what ligand is used by tumor from only tumor data. It also appeared that all EGF/ERBB receptors are down regulated. NRG3 is up at expression and genomic level, but its ligand expression level is down.
Patients, with completed with sequencing and with lesions found in both PI3K/AKT/mTOR and Ras/MEK/ERK pathways, were treated with dual pathway inhibitor combination: Glaxo Smith Klein agents trametinib and GSK2141795. The baseline was measured at the beginning of the treatment, after 2 cycles (close to two months), 75% regression in primary lesion were observed (
In the case of TNBC-009 patient, multiple DNA damage and double-strand break repair defects were observed (Table 14). The patient was randomized on Iniparib (also called BSI 201) plus Gemcitabine and Carboplatin treatment and responded favorably.
A total of 10 mTNBC tumors were completed with whole genome and transcriptome sequencing, and with clinical annotation and outcome. The analysis uncovered previously known and unknown DNA and RNA alterations associated with TNBC. Complicated solid tumors pathways rather than individual genes were shown herein to affect therapeutic strategy and outcome. Provided in the present study, the dual RAS/RAF/MEK/ERK & PI3K/AKT/mTOR activation by multiple mechanisms in different tumors may be common in mTNBC. Thus these two pathways are likely to be highly therapeutically relevant. In addition, multiple mutations in double strand break repair pathway in metastatic tumor was shown to be responsive to Iniparib.
In comparison, the Ion AmpliSeq Cancer Panel, designed by NCI, OHSU and some other leading cancer research institutions, contains 46 genes with 190 amplicons and 739 mutations, and over 100 of the mutations are in BRAF, EGFR and KRAS (Table 15).
Using the Ion AmpliSeq™ Cancer Panel, the expected cancer associated mutations in patients are entirely different (see Table 16). Further, no therapeutic suggestions or decisions were made from what is likely to be found on the current AmpliSeq™ Cancer Panel using these mutations or targets.
In comparison, using the approach of pathway with enriched alterations, the treatments selected (Table 17) and the responses (Table 18) among the patients in the clinical trial are shown. Unique therapeutic treatments for an individual patient can be devised by detecting targetable pathways. As understood, using pathways enriched with gene alterations to make therapeutic decisions may mean frequent needs to get off-label use for drugs or obtain investigational agents.
Comprehensive genomic and transcriptomic interrogation of mTNBCs revealed events supporting co-activation of the MAPK and PI3K/AKT pathways in all the tumors, albeit by different mutational mechanisms and supports potential effectiveness of combination therapy in the treatment of mTNBC. These patients may be treated with combined mek plus akt inhibitors to determine the effectiveness of co-inhibition of these pathways based on this frequent genomic context.
Fresh frozen relapse refractory tumor specimens from clinical biopsy and peripheral blood from 14 women clinically diagnosed with mTNBC were collected. Seven of the patients are African American (AA), and seven are European American (EA). Demographics and clinical treatment history for these patients is provided in Table 19.
1AA = African American, EA = European American,
2Age at diagnosis; CR: Complete Response; PR: Partial Response; SD: Stable Disease; PD: Progressive Disease.
SOLiD version 4 (50 bp) mate pair reads with 1.5 kb inserts were utilized for sequencing of germline and tumor DNA. Genome sequence coverage typically was above 30× for both tumor and germline genomes. Whole transcriptome sequencing (RNA-seq) was performed on high quality total RNA extracted from each tumor specimen, and from fresh hyperplastic breast specimens. On average 43 million (83%) RNA reads uniquely mapping to the genome/transcriptome were generated.
Somatic variants were detected using a pipeline consisting of multiple tools and described in detail in Supplementary Methods. Somatic mutations analyzed included single nucleotide variants (SNVs), indels, translocations, intrachromosomal rearrangements (inversions, etc), and copy number alterations. Two independent differential gene expression analyses were performed. One analysis was performed to determine differential expression between mTNBC tumors and nonmalignant hyperplastic breast samples using EdgeR (Robinson et al, 2010). A second analysis was based on a ‘leave one out’ method to assess differential expression for each tumor when compared against other tumors within our cohort. Integrated genome/transcriptome analyses were performed to assess allele specific expression between tumor DNA and tumor RNA using Bayesian analysis for shifts in the proportion of mutant alleles and reference alleles within RNA given an event at the DNA level.
Finally, genomic results, knowledge mining, pathway tools, and drug space were leveraged to link therapeutically actionable events or concepts with known drugs available through clinical trials to inform therapy for chemoresistant tumors.
Integrated Analysis of mTNBC Genome and Transcriptome Data
To identify somatic events in each tumor, germline/tumor paired genome analyses were performed to uncover point mutations and structural events within tumor genomes. As shown in Table 20 by the number of events, coding mutations were identified within the sample set, which include point mutations and small indels.
1dbSNP129;
2Ti/Tv = Transition/Transversion rate;
3Focal events are amplifications and deletions that are less than 15 Mb and greater than 1.5 Kb, with a log2 fold-change greater than 0.5.
4Genic refers to when one of two breakpoints occurs between promotor and 3′ UTR.
On average 9.3 mutations per megabase in the cohort were detected. Importantly, there were no recurring point mutations that occurred at the same nucleotide residue in multiple tumors. Leveraging RNA-seq data, the expression of somatic SNVs at the transcript level using RNA-seq data were also assessed, by calculating the number of mutations that were present in both DNA and RNA versus those only present in DNA. In this analysis all data points were taken including situations where a gene harbored a somatic SNV, but the gene was essentially not expressed at all at the RNA level. The mean percentage of somatic SNVs that were expressed across our 10 samples was 34% (range 22%-60%).
As has been previously described in basal-like breast tumors, TP53 mutations were most frequent in this mTNBC dataset, observed in 9 of 14 mTNBC tumors. This included four tumors with nonsysnonymous substitutions, three with nonsense mutations, one with an indel, and one tumor with a mutation in a conserved splice consensus sequence. In all cases of TP53 mutation, loss of heterozygosity (LOH) was detected without hemizygous deletion of 17p or the TP53 locus, suggesting uniparental disomy as the likely mechanism for TP53 LOH in mTNBC. Extending to a genome-wide analysis of allele specific expression between the tumor RNA and tumor DNA, TP53 was the only gene with more than two somatic mutations exhibiting significant transcriptional allelic imbalance. Integrating RNA-seq data revealed that the frequency of the mutated allele exhibited transcriptional allelic inbalance, where the mutated allele was expressed above 90% in all but two tumors. Further, this was observed in the face of upwards of >30% normal contaminating stroma in samples used for analyte extraction. This observation is consistent with the absence of expression of wildtype TP53 within neighboring stromal contaminating tissue as has been previously reported, for which the actual mechanism remains unknown. Recent studies on brain tumors show similar monoallelic expression of TP53 due to LOH have prognostic value towards outcome.
Fifteen additional genes harbored somatic nonsynonymous or consensus splice site SNVs and/or small indels in more than one tumor. Among these genes included HERC1 (n=2), LRP1B (n=2), TOP2A (n=2), and CDH5 (n=3). HERC1 acts as an E3 ubiquitin ligase (Q15751), and is known to interact with and destabilize the tumor suppressor TSC2. Mutations within the LRP1B gene (Low-density lipoprotein receptor-related protein 1B, Q9NZR2) were also detected in multiple tumors. LRP1B mutations have been reported to be frequent in non-small cell lung carcinoma. TOP2A (DNA topoisomerase 2-alpha, P11388) controls the topological states of DNA and is altered in cancer. Importantly, mutations in TOP2A have been associated with drug resistance. CDH5 (Cadherin-5, P33151) is a member of the cadherin family of cell adhesion molecules, and this cadherin is believed to play a critical role in endothelial cell biology.
Aside from point mutations, additional small to moderately sized coding deletions (2 bp-5 kb) were also detected. Among genes containing somatic deletions in more than one tumor were RB1 and PTEN, both of which are known tumor suppressor genes (Table 18). Importantly, combined genome and transcriptome data allows for immediate interpretation of the consequence of deletions. One example was an RB1 49 bp deletion in mTNBC1 that includes 47 bases of exon 12, and extends to also delete the two nucleotides (GT) that define the conserved splice acceptor site (
To assess large copy number changes across the genomes of mTNBC tumors, an algorithm that calculates the log 2 ratio was utilized for the normalized read coverage from the germline genome data and normalized read coverage from the tumor genome data based on a 500 bp window. The most common region of broad copy number loss was at 5q, while copy number gains were observed most commonly at 1q, 6p, 8q, and 10p. Focal events were emphasized, described herein as log 2FC copy number gains of ≧1.5 from neutral and less than 15 Mb in length, or homozygous deletions less than 15 Mb in length. Among the focal copy number events that occurred in more than one tumor were unique homozygous deletions in two tumors (mTNBC1, mTNBC6) that involved the adjacent CTNNA1 and SIL1 loci at 5q31.2 (
Integration of RNA-seq differential expression analyses using both comparison against nonmalignant samples and using our outlier analysis showed significant down-regulation of CTNNA1 (Catenin alpha-1; P35221) but not SIL1 (Nucleotide exchange factor SILL Q9H173) in both tumors harboring homozygous deletions. Therefore, these data strongly suggest CTNNA1 as the target tumor suppressor in mTNBC. The two tumors in this study that exhibit CTNNA1 loss were both from African American patients. In a previous study describing somatic alterations by whole genome sequencing of both a primary and metastatic lesion from a single 44 year-old African American patient with refractory TNBC, a homozygous deletion was also detected at the CTNNA1/SIL1 locus. This raises the question as to whether or not homozygous deletions of CTNNA1 are enriched in TNBC tumors with metastatic potential, particularly among African American patients. CTNNA1 forms a complex to anchor E-cadherin to the cytoplasmic cell membrane to maintain normal cell adhesion properties. Aberrations deregulating this complex result in dissociation of cancer cells from tumor foci, and represent a key primer for invasion and metastasis. Furthermore, a neuronal knockout model of alpha E-catenin demonstrated abnormal CNS cell growth with spreading of ventricular zone cells throughout the brain, which formed invasive tumor-like masses similar to those seen in human CNS tumors such as medulloblastoma, neuroblastoma, and retinoblastoma. Therefore taken together, these data suggest an important role for CTNNA1 loss in TNBC with invasive metastatic behavior, and begs the question concerning enrichment of this aberration among African American patients.
Several regions of amplification were identified within this cohort of mTNBC that encompass important oncogenes including WT1/WIT1, IQGAP3, IRS2, MYC, WHSC1L1/FGFR1, MYB, ARAF, and BRAF. A XMb region detected as a copy number amplification at 7q34 in the tumor from mTNBC2 contained the BRAF locus. An increased BRAF expression was also detected from RNA-seq data when comparing against nonmalignant controls (log FC=1.7) or when comparing across tumors using the outlier analysis (log FC=2.2). Upon further inspection of mate-pair genome sequence reads spanning the chromosome 7 breakpoints, it was evident that this putative amplicon was actually part of a more complex rearrangement illustrated in
Our approach has also uncovered additional cis and trans intrachromosomal rearrangements (inversions) and translocations. Notable alterations included two tandem overlapping inversion events within the INPP5F locus in mTNBC3. As both inversions span multiple exons of the INPP5F locus, it is likely that this structural alteration would lead to significant gene disruption in this tumor. Furthermore, copy number analysis shows copy number neutrality, therefore CGH analysis would not detect this rearrangement. INPP5F (Q9Y2H2) encodes inositol polyphosphate 5-phosphatase F, which modulates AKT/GSK3B signaling by decreasing AKT and GSK3B phosphorylation. Therefore, loss of INPP5F through structural events may lead to aberrant AKT signaling. These types of events generally go undetected in genomic studies using array-based or microscopy-based technologies.
A series of distinct somatic alterations at the ERBB4 locus in 3/14 tumors are of importance. These include a 4 kb intronic homozygous deletion (mTNBC1), a somatic point mutation (mTNBC2), and a breakpoint defining a larger 21 Mb rearrangement at 2q34-q37.1 (mTNBC6). Furthermore, significant down-regulation of ERBB4 in all mTNBC in our study were observed, when comparing mTNBA tumors against nonmalignant controls. These data show that ERBB4 is frequently altered in mTNBC. The role of ERBB4 (Receptor tyrosine-protein kinase erbB-4, Q15303) in mammary physiology, including maturation of mammary glands during pregnancy and lactation through StatS activation is well known. ERBB4 mutations have been reported in multiple tumor types including lung carcinoma and melanoma, where mutations are believed to be oncogenic. However, ERBB4 also has been implicated as a tumor suppressor with growth inhibitory functions, and reactivation of epigenetically silenced ERBB4 using 5-aza-2′-deoxycytidine resulted in increased apoptosis in BT20 breast cancer cells. However, to our knowledge, this is the first report of somatic alterations at the ERBB4 locus in TNBC.
Therapeutically Actionable Molecular Concepts in mTNBC
Through the deep molecular profiling study described herein, a series of somatic events within individual tumors were detected that have informed or may have predicted the effectiveness of targeted therapies for some patients. Table 21 provides key information on therapeutically actionable concepts and clinical outcomes in the patient cohort used for this example.
1Focal Amplificiation,
2Focal homozygous Deletion;
mTNBC2—
This was a 59 year old Caucasian woman diagnosed in 2009 with a 16 cm left breast grade 3 triple negative invasive ductal cancer with skin erosion and left axillary adenopathy. Staging studies showed hilar, lung and liver metastases. Family history was unknown as she was adopted; BRCA 1/2 testing was not done. She had a partial response with gemcitabine, carboplatin and iniparib on a phase III trial. Her disease remained unresectable, however, and she developed progressive disease in her breast after 7 months on this therapy. She was then treated with docetaxel/capecitabine without response. Tissue was then harvested from her breast mass for whole genome sequencing. During this time she was treated with paclitaxel plus bevacizumab and had a transient 2-month response. Sequencing revealed a high level BRAF amplification in this tumor, which was validated as being part of complex extrachromosomal double minute by FISH, likely leading to increased signaling of the RAS/RAF/MEK/ERK pathway, which would suggest the use of a RAF or MEK inhibitor. Furthermore, RNA-seq based expression analysis showed overexpression of BRAF and significant underexpression of INPP4B. Thus, a therapeutic strategy targeting both signaling pathways was supported.
In March, 2011, the patient began treatment on a Phase I study of the oral MEK inhibitor GSK1120212 in combination with the oral AKT inhibitor GSK 2141795, based on the findings of a high level BRAF amplicon and decreased levels of INPP4B in her cancer on whole genome sequencing that were validated in a CLIA laboratory. By April, 2011, her breast mass had nearly completely regressed leaving an open wound on her chest wall. She developed toxicities including skin ulceration, diarrhea, anorexia and significant fatigue. After 3 months on therapy the patient had a seizure and was found to have a hemorrhagic brain metastasis and she discontinued the investigational therapies.
mTNBC6—
TNBC-006 is a 32 year old African American woman who presented in June 2009 with a locally advanced, high grade, right side TNBC, and received preoperative TAC (docetaxel, doxorubicin, cyclophosphamide) chemotherapy. At bilateral mastectomy she had 7 cm residual disease in her right breast with extensive lymphphovascular invasion and 2 positive axillary lymph nodes with extranodal extension. She was treated with chest wall and regional radiation therapy along with weekly carboplatin during the radiation. Two paternal aunts and her maternal grandmother and aunt had postmenopausal breast cancer. BRCA1/2 testing was negative for a germline mutation. In September, 2010 her disease recurred diffusely over chest wall including a 4×4 cm protruberant mass on her left chest wall that was excised for whole genome sequencing that was biopsy confirmed as highly proliferative triple negative carcinoma. PET CT scan showed a right pleural effusion, small lung metastases and left hilar adenopathy. She was randomized to receive paclitaxel, bevacizumab and everolimus on a phase II study, had a complete response of 12 months duration.
A focal (212 kb) homozygous deletion at 4q31.2 encompassing the FBXW7 locus was detected. Furthermore, both differential expression methods showed significant under expression of FBXW7 in mTNBC6. FBXW7 is a tumor suppressor that targets mTOR for degradation through direct interaction, and cell lines harboring FBXW7 mutations and deletions were highly sensitive to the mTOR inhibitor rapamycin. A second hit in the PI3K/AKT/mTOR pathway was also detected in this tumor in the form of two independent inversion events encompassing exons, and a nonsynonymous SNV within the INPP5A gene. INPP5A encodes inositol polyphosphate-5-phosphatase F, which decreases AKT and GSK3B phosphorylation through its 5-phosphatase activities on phosphatidylinositol 4,5-bisphosphate (PIP2) and phosphatidylinositol 3,4,5-triphosphate (PIP3). This patient has since progressed to her liver. She has had stable disease on treatment with gemcitabine and carboplatin. The initial significant response to combination paclitaxel, bevacizumab and everolimus was believed to be associated at least in part with targeting of mTOR by everolimus, where mTOR activity was driven by loss of FBXW7 and possibly additional PI3K/AKT/mTOR signaling associated with INPP5A mutation.
mTNBC9—
TNBC-009 was a 38 year old Caucasian woman who presented in May, 2008 with a T4 right breast mass and palpable right axillary adenopathy. Biopsy revealed high grade TNBC with a ki-67 of 78% and PET CT scan showed diffuse adenopathy in right axilla, bilateral supraclavicular fossa and hila, as well as mediastinum. She had a complete response on exam and PET CT with preoperative AC/T (doxorubicin, cyclophosphamide then paclitaxel) chemotherapy, and at bilateral mastectomy had minimal residual disease in her breast and axilla. She was treated with wide-field radiotherapy and 4 cycles of adjuvant cisplatin. Her maternal lineage included her grandfather with gastric cancer, aunt with colon cancer and uncle with melanoma. BRCA1/2 testing did not show a germline mutation. In April, 2010 she underwent resection of a solitary left occipital brain metastasis and declined radiotherapy. Seven months later she developed left posterior cervical lymphadenopathy, which was partially excised for whole genome sequencing, as well as recurrent brain metastases and underwent whole brain radiotherapy. She had a complete response documented on PET CT scan with 4 months of treatment with gemcitabine, carboplatin, iniparib on an expanded access protocol. She stopped treatment for family reasons and two months later developed leptomeningeal metastases.
Sequencing analysis revealed a somatic event in multiple genes encoding proteins involved in DNA repair, double strand break repair, and homologous recombination repair. A class of small molecule inhibitors are in mature clinical development that target DNA repair enzymes including Poly-ADP polymerases PARP-1 and PARP-2 are believed to be active in advanced basal-like triple negative breast tumors with defects in DNA repair genes including those tumor arising in patients with inherited BRCA1 or BRCA2 mutations. It is possible that the effectiveness of the therapeutic combination that included Iniparib may have been due to the convergence of mutations in double strand break repair and homologous recombination repair.
PI3K/AKT/mTOR and RAS/RAF/MEK/ERK Vulnerabilities in mTNBC
Five independent mTNBC (Table 18) described in this report demonstrated several biologically and potentially clinically relevant somatic alterations in genes involved in PI3K/AKT/mTOR and/or RAS/RAF/MEK/ERK signaling pathways (
This study demonstrated benefits of understanding the comprehensive genomic framework of a tumor through whole genome and transcriptome analysis. For example, the single observation of high-level BRAF amplification in mTNBC2 is therapeutically relevant, but as in the context of wildtype BRAF, it is important to note that RAF inhibitors can enhance tumor growth in a RAS dependent manner, in contrast to MEK inhibitors. Thus, as mTNBC2 had gross wildtype BRAF amplification, a PI3K/AKT inhibitor in combination with a MEK inhibitor might be more effective than a combination including a RAF inhibitor, especially in light of additional alterations in the PI3K/AKT/mTOR pathway that were detected in this tumor. This supports the contention that understanding full genomic context may be prudent as the field continues to explore the effectiveness of prospective selection of therapeutic strategies through genomic interrogation.
In summary, NGS technologies and analytical tools were applied to perform a comprehensive genomic survey to elucidate the lexicon of somatic events occurring in an ethnicity matched cohort of 14 mTNBC. This study is incredibly integrative, where base level whole genome and transcriptome analysis were combined, and extremely comprehensive molecular data with deep clinical history and outcomes information were integrated. Through this study the presence of alterations occurring within the genomes and transcriptomes of these tumors were validated and confirmed including common TP53 mutation, copy number changes (i.e. CTNNA1 homozygous deletion), and enrichment of gene expression ontologies associated with cell cycle control and mitosis. Additional genes that may play a role in mTNBC were also discovered. There was equal representation of both African American and Europea American patients, and at least one event (CTNNA1 homozygous deletion) may be enriched among women of recent west African descent. Furthermore the study revealed genomic concepts that may have significant downstream therapeutic implications such as dual activation of RAS/RAF/MEK/ERK and PI3K/AKT/mTOR signaling pathways in mTNBC, suggesting that combination trials using inhibitors of both signaling nodes should be initiated specifically for patients diagnosed with mTNBC. These results add tremendously to our nascent understanding of how best to manage and treat these generally irremedial tumors.
This application is related to and claims the priority benefit of U.S. provisional application 61/438,959, filed on Feb. 2, 2011,U.S. provisional application 61/444,644, filed on Feb. 18, 2011, U.S. provisional application 61/472,004, filed on Apr. 5, 2011, U.S. provisional application 61/490,457, filed on May 26, 2011, U.S. provisional application 61/499,123, filed on Jun. 20, 2011, and U.S. provisional application 61/551,201 filed on Oct. 25, 2011, the teachings and content of which are incorporated by reference herein.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/US12/23695 | 2/2/2012 | WO | 00 | 10/10/2013 |
Number | Date | Country | |
---|---|---|---|
61438958 | Feb 2011 | US | |
61444644 | Feb 2011 | US | |
61472004 | Apr 2011 | US | |
61490457 | May 2011 | US | |
61499123 | Jun 2011 | US | |
61551201 | Oct 2011 | US |