The invention relates to methods and systems of treating and prognosising bladder cancer.
Bladder cancer treatments vary significantly depending upon whether the cancer is or will become malignant. Subjects who have non-invasive cancer are best treated by conservative methods, while the more aggressive approach of cystectomy (removal of the bladder) and/or systemic chemotherapy/radiation is necessary for subjects with invasive cancers. About 80% of the new diagnoses of bladder cancer involve subjects with non invasive cancer, many of which will eventually experience a transtition to invasive disease and potentially metastatic cancer. With current technologies available in the clinic, it is not possible to accurately predict which patients will develop invasive or metastatic cancer, and thus numerous subjects do not receive the optimal treatment for their particular condition.
As described herein, the inventors have identified gene signatures which permit the identification of patients who will benefit from (e.g. have optimal outcomes) cystectomy and/or chemotherapyas treatment for bladder cancer. Accordingly, provided herein are methods of treatment and assays relating to bladder cancer and the administration of cystectomies.
In one aspect, described herein is a method of treatment comprising, detecting, in a sample obtained from a subject in need of treatment for bladder cancer, the level of expression products of at least one marker gene selected from Table 1 or Table 2; administering cystectomy or chemotherapy to the subject if the level of expression products selected from Table 1 are increased relative to a reference level or the level of expression products selected from Table 2 are decreased relative to a reference level; and not administering a cystectomy or other invasive treatment to the subject if the level of expression products selected from Table 1 are not increased relative to a reference level or the level of expression products selected from Table 2 are not decreased relative to a reference level. In one aspect, described herein is a method of treatment comprising, administering cystectomy or chemotherapy to a subject determined to have a level of expression products selected from Table 1 increased relative to a reference level or a level of expression products selected from Table 2 decreased relative to a reference level; and not administering a cystectomy or other invasive treatment to a subject determined to have a level of expression products selected from Table 1 not increased relative to a reference level or a level of expression products selected from Table 2 not decreased relative to a reference level.
In one aspect, described herein is an assay comprising, detecting, in a sample obtained from a subject in need of treatment for bladder cancer, the level of expression products of at least one marker gene selected from Table 1 or Table 2; wherein the subject is likely to benefit from cystectomy or chemotherapy if the level of expression products selected from Table 1 is increased relative to a reference level or the level of expression products selected from Table 2 is decreased relative to a reference level; and wherein the subject is not likely to benefit from a cystectomy or other invasive treatment if the level of expression products selected from Table 1 is not increased relative to a reference level or the level of expression products selected from Table 2 is not decreased relative to a reference level.
In one aspect, described herein is a method of determing if a subject is likely to benefit from cystectomy, the method comprising, detecting, in a sample obtained from a subject in need of treatment for bladder cancer, the level of expression products of at least one marker gene selected from Table 1 or Table 2; wherein the subject is likely to benefit from cystectomy or chemotherapy if the level of expression products selected from Table 1 is increased relative to a reference level or the level of expression products selected from Table 2 is decreased relative to a reference level; and wherein the subject is not likely to benefit from a cystectomy or other invasive treatment if the level of expression products selected from Table 1 is not increased relative to a reference level or the level of expression products selected from Table 2 is not decreased relative to a reference level.
In some embodiments, the one or more marker genes is selected from the genes of Table 3. In some embodiments, the one or more marker genes is selected from the group consisting of IGFBP5; CD44; CCND1; VEGF; TRAF4; RAB31; MDK; SNX14; ANXA1; CSPG2; CASP8; BIRC2; PAK1; PLA2G2F; PICK1; GATA2; and ABCA5. In some embodiments, the level of the expression products of IGFBP5; CD44; CCND1; VEGF; TRAF4; RAB31; MDK; SNX14; ANXA1; CSPG2; CASP8; BIRC2; PAK1; PLA2G2F; PICK1; GATA2; and ABCA5 are determined In some embodiments, the expression products are mRNA expression products. In some embodiments, the expression products are polypeptide expression products. In some embodiments, the subject has TA or T1 bladder cancer. In some embodiments, the sample is a tumor cell sample. In some embodiments, the sample is a urine sample. In some embodiments, the subject is a human.
As described herein, the inventors have discovered that a number of genes are differentially regulated in tumors of patients with progressive bladder cancer who will benefit from cystectomy and/or chemotherapyas compared to those subjects with non-progressive bladder cancer who will not benefit from cytoreductive surgery. Accordingly, there are provided herein methods and assays relating to the prognosis, risk assessment, and treatment of subjects having bladder cancer, particularly as relates to subjects receiving cystectomy and/or chemotherapyas part of their treatment regimen for bladder cancer.
“Bladder cancer” refers to cancers arising in, or involving, the bladder, e.g. in the epithelium of the bladder. As used herein, the term “cancer”or “tumor” refers to an uncontrolled growth of cells which interferes with the normal functioning of the bodily organs and systems. A subject who has a cancer or a tumor is a subject having objectively measurable cancer cells present in the subject's body. Included in this definition are benign and malignant cancers, as well as dormant tumors or micrometastases. Cancers which migrate from their original location and seed vital organs can eventually lead to the death of the subject through the functional deterioration of the affected organs.
In some embodiments, the methods described herein relate to treating a subject having or diagnosed as having bladder cancer. Subjects having bladder cancer can be identified by a physician using current methods of diagnosing bladder cancer. Symptoms and/or complications of bladder cancer which characterize these conditions and aid in diagnosis are well known in the art and include but are not limited to hematuria, and painful and/or frequent urination. Tests that may aid in a diagnosis of, e.g. bladder cancer include, but are not limited to, cystoscopy, NMP22, human complement factor H-related protein, carcinoembryonic antigen, FGFR3, CERTNDX™, UROVYSION™, CXBLADDER™, HEXVIX™/CYSVIEW™, and transurethral resection tests. A family history of cancer or exposure to risk factors for bladder cancer (e.g. smoking) can also aid in determining if a subject is likely to have bladder cancer or in making a diagnosis of bladder cancer.
The standard treatment for bladder cancer varies significantly depending upon the stage and/or severity of the bladder cancer. Subjects with TA or T1 cancer are typically treated by local surgical removal of the diseased tissue and/or partial resection. Subjects with T2 (or greater, e.g. T3 or T4) cancer are typically treated by cystectomy, i.e. removal of the bladder, and optionally additional internal organs. Subjects with T2 (or greater) cancer can also be treated with chemotherapy and/or radiation, and option not typically persued for TA or T1 cancer patients. Of subjects diagnosed with TA and/or T1 cancer, some have cancers that will progress to T2 and other have cancers that will not progress to T2 (or greater, e.g. T3 or T4). Clinical outcomes can be greatly improved by identifying which subjects have progressive and/or potentially progressive cancers and treating them with a cystectomy or chemotherapy before the cancer actually progresses to the T2 stage (or greater, e.g. T3 or T4). In some embodiments, “benefiting from cystectomy or chemotherapy” can refer to a subject who will have an optimal outcome from cystectomy or chemotherapy, e.g. particularly an early treatment before the cancer reaches a T2 stage (or greater, e.g. T3 or T4).
Conversely, if a subject has a TA or T1 bladder cancer that will not progress to a T2 (or greater) bladder cancer, then invasive treatments are unnecessary for that patient. Accordingly, provided herein are methods relating to diagnosing, prognosing, and treating subjects with bladder cancer, comprising not administering an invasive treatment to a subject determined to have a non-progressive cancer according to the methods and assays described below herein. An invasive treatment for bladder cancer can include cystectomy, chemotherapy (including radiation therapy), radical TUR, or segmented partial resection. Removal of tumors, standard TUR, and/or partial resection are not considered invasive treatmens as defined herein.
As described herein, the inventors have identified certain genes which are differentially regulated, to a statistically significant degree, as compared to a reference level, in subjects who have progressive TA or T1 bladder cancer (i.e. bladder cancer that will, absent cystectomy, progress to a T2 cancer). These subjects will benefit from cystectomy or chemotherapy, particularly early treatment before the cancer progresses to T2. The identified genes are sometimes referred to herein as marker genes to indicate their relation to being a marker for whether cystectomy and/or chemotherapy will be efficacious. Accordingly, some embodiments of the invention are generally related to assays, methods and systems for assessing the likely response of a subject to cystectomy and/or chemotherapy and/or radiation and/or treating a subject for bladder cancer, e.g. by determining if a subject will benefit from cystectomy and/or chemotherapy and/or radiation and administering the treatment if it is likely to be efficacious. In certain embodiments, the assays and methods are directed to determination and/or measurement of the expression level of a gene product (e.g. protein and/or gene transcript such as mRNA) in a biological sample of a subject. In certain embodiments the assays and methods are directed to determination of the expression level of a gene product of at least two genes in a biological sample of a subject, i.e. at least two genes, at least three genes, at least four genes , at least five genes, at least six genes, at least seven genes, at least eight genes, at least nine genes, at least 10 genes . . . at least 15 genes, . . . at least 25 genes, . . . at least 30 genes, or more genes, or any number of genes selected from Table 1 and/or Table 2 as described herein. In some embodiments, the marker gene(s) is selected from the group listed in Table 3. In some embodiments, the marker gene(s) is selected from the group consisting of IGFBP5; CD44; CCND1; VEGF; TRAF4; RAB31; MDK; SNX14; ANXA1; CSPG2; CASP8; BIRC2; PAK1; PLA2G2F; PICK1; GATA2; and ABCA5. In some embodiments, the assays, methods, and systems described herein are directed to determination of the expression level of a gene product of at least two genes in a biological sample of a subject, e.g. at least two genes, or at least three genes, or at least four genes, or, e.g. all of the following genes: IGFBP5; CD44; CCND1; VEGF; TRAF4; RAB31; MDK; SNX14; ANXA1; CSPG2; CASP8; BIRC2; PAK1; PLA2G2F; PICK1; GATA2; and ABCA5.
The gene names listed in Tables 1, 2 and 3 are common names. NCBI Gene ID numbers for each of the genes listed in Tables 1, 2 and 3 can be obtained by searching the “Gene” Database of the NCBI (available on the World Wide Web at http://www.ncbi.nlm nih.gov/) using the common name as the query and selecting the first returned Homo sapiens gene.
In some embodiments, the methods and assays described herein include (a) transforming the gene expression product into a detectable gene target; (b) measuring the amount of the detectable gene target; and (c) comparing the amount of the detectable gene target to an amount of a reference, wherein if the amount of the detectable gene target is statistically significantly different than the amount of the reference level, the subject is identified as likely to benefit from and/or is administered cystectomy. In some embodiments, if the amount of the detectable gene target is not statistically significantly different than the amount of the reference level, the subject is identified as unlikely to benefit from and/or is not administered cystectomy.
In some embodiments, the reference can be a level of expression of the marker gene product in a population of subjects who have been demonstrated to not benefit from cystectomy. In some embodiments, the reference can be a level of expression of the marker gene product in a population of subjects who have been demonstrated to not be in need of cystectomy. In some embodiments, the reference can be a level of expression of the marker gene product in a population of subjects who have been demonstrated to have non-progressive bladder cancer, e.g. bladder cancer that does not progress from TA and/or T1 to T2. In some embodiments, the reference can also be a level of expression of the marker gene product in a control sample, a pooled sample of control individuals or a numeric value or range of values based on the same.
In certain embodiments, the marker gene(s) are selected from the genes listed in Table 1 and/or Table 2. In certain embodiments, one or more marker genes are selected from the group the genes listed in Table 3. In certain embodiments, one or more marker genes are selected from the group consisting of IGFBP5; CD44; CCND1; VEGF; TRAF4; RAB31; MDK; SNX14; ANXA1; CSPG2; CASP8; BIRC2; PAK1; PLA2G2F; PICK1; GATA2; and ABCA5.
In certain embodiments, the marker gene is one of IGFBP5; CD44; CCND1; VEGF; TRAF4; RAB31; MDK; SNX14; ANXA1; CSPG2; CASP8; BIRC2; PAK1; PLA2G2F; PICK1; GATA2; and ABCA5. In certain embodiments, the marker genes include at least two of IGFBP5; CD44; CCND1; VEGF; TRAF4; RAB31; MDK; SNX14; ANXA1; CSPG2; CASP8; BIRC2; PAK1; PLA2G2F; PICK1; GATA2; and ABCA5. In certain embodiments, the marker genes include at least three of IGFBP5; CD44; CCND1; VEGF; TRAF4; RAB31; MDK; SNX14; ANXA1; CSPG2; CASP8; BIRC2; PAK1; PLA2G2F; PICK1; GATA2; and ABCA5. In certain embodiments, the marker genes include at least four of IGFBP5; CD44; CCND1; VEGF; TRAF4; RAB31; MDK; SNX14; ANXA1; CSPG2; CASP8; BIRC2; PAK1; PLA2G2F; PICK1; GATA2; and ABCA5. In certain embodiments, the marker genes include at least five of IGFBP5; CD44; CCND1; VEGF; TRAF4; RAB31; MDK; SNX14; ANXA1; CSPG2; CASP8; BIRC2; PAK1; PLA2G2F; PICK1; GATA2; and ABCA5. In certain embodiments, the marker genes include at least six of IGFBP5; CD44; CCND1; VEGF; TRAF4; RAB31; MDK; SNX14; ANXA1; CSPG2; CASP8; BIRC2; PAK1; PLA2G2F; PICK1; GATA2; and ABCA5. In certain embodiments, the marker genes include at least seven of IGFBP5; CD44; CCND1; VEGF; TRAF4; RAB31; MDK; SNX14; ANXA1; CSPG2; CASP8; BIRC2; PAK1; PLA2G2F; PICK1; GATA2; and ABCA5. In certain embodiments, the marker genes include at least eight of IGFBP5; CD44; CCND1; VEGF; TRAF4; RAB31; MDK; SNX14; ANXA1; CSPG2; CASP8; BIRC2; PAK1; PLA2G2F; PICK1; GATA2; and ABCA5. In certain embodiments, the marker genes include at least nine of IGFBP5; CD44; CCND1; VEGF; TRAF4; RAB31; MDK; SNX14; ANXA1; CSPG2; CASP8; BIRC2; PAK1; PLA2G2F; PICK1; GATA2; and ABCA5. In certain embodiments, the marker genes include at least ten of IGFBP5; CD44; CCND1; VEGF; TRAF4; RAB31; MDK; SNX14; ANXA1; CSPG2; CASP8; BIRC2; PAK1; PLA2G2F; PICK1; GATA2; and ABCA5. In certain embodiments, the marker genes include at least eleven of IGFBP5; CD44; CCND1; VEGF; TRAF4; RAB31; MDK; SNX14; ANXA1; CSPG2; CASP8; BIRC2; PAK1; PLA2G2F; PICK1; GATA2; and ABCA5. In certain embodiments, the marker genes include at least twelve of IGFBP5; CD44; CCND1; VEGF; TRAF4; RAB31; MDK; SNX14; ANXA1; CSPG2; CASP8; BIRC2; PAK1; PLA2G2F; PICK1; GATA2; and ABCA5. In certain embodiments, the marker genes include at least thirteen of IGFBP5; CD44; CCND1; VEGF; TRAF4; RAB31; MDK; SNX14; ANXA1; CSPG2; CASP8; BIRC2; PAK1; PLA2G2F; PICK1; GATA2; and ABCA5. In certain embodiments, the marker genes include at least fourteen of IGFBP5; CD44; CCND1; VEGF; TRAF4; RAB31; MDK; SNX14; ANXA1; CSPG2; CASP8; BIRC2; PAK1; PLA2G2F; PICK1; GATA2; and ABCA5. In certain embodiments, the marker genes include at least fifteen of IGFBP5; CD44; CCND1; VEGF; TRAF4; RAB31; MDK; SNX14; ANXA1; CSPG2; CASP8; BIRC2; PAK1; PLA2G2F; PICK1; GATA2; and ABCA5. In certain embodiments, the marker genes include at least sixteen of IGFBP5; CD44; CCND1; VEGF; TRAF4; RAB31; MDK; SNX14; ANXA1; CSPG2; CASP8; BIRC2; PAK1; PLA2G2F; PICK1; GATA2; and ABCA5. In certain embodiments, the marker genes include IGFBP5; CD44; CCND1; VEGF; TRAF4; RAB31; MDK; SNX14; ANXA1; CSPG2; CASP8; BIRC2; PAK1; PLA2G2F; PICK1; GATA2; and ABCA5.
In certain embodiments, the marker gene(s) are selected from the group consisting of IFGBP5; LSP1; STIM1; APOL4; CCPG1; ANTXR2; C10orf76; ABCA5; TBC1D4; OPTN; CYP4Z2P; VEGF; MDK; CYP4B1; AGPS; NLRP1; NFIA; CDC42BPG; SH3D19; KALRN; ITGA2; PLA2G2F; ALDH16A1; LTBP1; DHRS7; 5S_rRNA; CASP8; and PDLIM5. In certain embodiments, the marker genes include at least two of IFGBP5; LSP1; STIM1; APOL4; CCPG1; ANTXR2; C10orf76; ABCA5; TBC1D4; OPTN; CYP4Z2P; VEGF; MDK; CYP4B1; AGPS; NLRP1; NFIA; CDC42BPG; SH3D19; KALRN; ITGA2; PLA2G2F; ALDH16A1; LTBP1; DHRS7; 5S_rRNA; CASP8; and PDLIM5. In certain embodiments, the marker genes include at least three of IFGBP5; LSP1; STIM1; APOL4; CCPG1; ANTXR2; C10orf76; ABCA5; TBC1D4; OPTN; CYP4Z2P; VEGF; MDK; CYP4B1; AGPS; NLRP1; NFIA; CDC42BPG; SH3D19; KALRN; ITGA2; PLA2G2F; ALDH16A1; LTBP1; DHRS7; 5S_rRNA; CASP8; and PDLIM5. In certain embodiments, the marker genes include at least four of IFGBP5; LSP1; STIM1; APOL4; CCPG1; ANTXR2; C10orf76; ABCA5; TBC1D4; OPTN; CYP4Z2P; VEGF; MDK; CYP4B1; AGPS; NLRP1; NFIA; CDC42BPG; SH3D19; KALRN; ITGA2; PLA2G2F; ALDH16A1; LTBP1; DHRS7; 5S_rRNA; CASP8; and PDLIM5. In certain embodiments, the marker genes include at least five of IFGBP5; LSP1; STIM1; APOL4; CCPG1; ANTXR2; C10orf76; ABCA5; TBC1D4; OPTN; CYP4Z2P; VEGF; MDK; CYP4B1; AGPS; NLRP1; NFIA; CDC42BPG; SH3D19; KALRN; ITGA2; PLA2G2F; ALDH16A1; LTBP1; DHRS7; 5S_rRNA; CASP8; and PDLIM5. In certain embodiments, the marker genes include at least six of IFGBP5; LSP1; STIM1; APOL4; CCPG1; ANTXR2; C10orf76; ABCA5; TBC1D4; OPTN; CYP4Z2P; VEGF; MDK; CYP4B1; AGPS; NLRP1; NFIA; CDC42BPG; SH3D19; KALRN; ITGA2; PLA2G2F; ALDH16A1; LTBP1; DHRS7; 5S_rRNA; CASP8; and PDLIM5. In certain embodiments, the marker genes include at least seven of IFGBP5; LSP1; STIM1; APOL4; CCPG1; ANTXR2; C10orf76; ABCA5; TBC1D4; OPTN; CYP4Z2P; VEGF; MDK; CYP4B1; AGPS; NLRP1; NFIA; CDC42BPG; SH3D19; KALRN; ITGA2; PLA2G2F; ALDH16A1; LTBP1; DHRS7; 5S_rRNA; CASP8; and PDLIM5. In certain embodiments, the marker genes include at least eight of IFGBP5; LSP1; STIM1; APOL4; CCPG1; ANTXR2; C10orf76; ABCA5; TBC1D4; OPTN; CYP4Z2P; VEGF; MDK; CYP4B1; AGPS; NLRP1; NFIA; CDC42BPG; SH3D19; KALRN; ITGA2; PLA2G2F; ALDH16A1; LTBP1; DHRS7; 5S_rRNA; CASP8; and PDLIM5. In certain embodiments, the marker genes include at least nine of IFGBP5; LSP1; STIM1; APOL4; CCPG1; ANTXR2; C10orf76; ABCA5; TBC1D4; OPTN; CYP4Z2P; VEGF; MDK; CYP4B1; AGPS; NLRP1; NFIA; CDC42BPG; SH3D19; KALRN; ITGA2; PLA2G2F; ALDH16A1; LTBP1; DHRS7; 5S_rRNA; CASP8; and PDLIM5. In certain embodiments, the marker genes include at least ten of IFGBP5; LSP1; STIM1; APOL4; CCPG1; ANTXR2; C10orf76; ABCA5; TBC1D4; OPTN; CYP4Z2P; VEGF; MDK; CYP4B1; AGPS; NLRP1; NFIA; CDC42BPG; SH3D19; KALRN; ITGA2; PLA2G2F; ALDH16A1; LTBP1; DHRS7; 5SrRNA; CASP8; and PDLIM5.
In subjects who are likely to benefit from cystectomy, the marker genes listed in Table 1 can be upregulated and those in Table 2 can be downregulated, e.g. for marker genes listed in Table 1, if the measured marker gene expression in a subject is higher as compared to a reference level of that marker gene's expression, then the subject is identified as likely to benefit from cystectomy. Likewise, for marker genes listed in Table 2, if the measured marker gene expression in a subject is lower as compared to a reference level of that marker gene's expression, then the subject is identified as likely to benefit from cystectomy. Preferably, once looks at a statistically significant change. However, even if a few genes in a group do not differ from normal, a subject can be identified as likely to benefit from cystectomy and/or chemotherapy if the overall change of the group shows a significant change, preferably a statistically significant change.
The level of a gene expression product of a marker gene in Table 1 which is higher than a reference level of that marker gene by at least about 10% than the reference amount, at least about 20%, at least about 30%, at least about 40%, at least about 50%, at least about 80%, at least about 100%, at least about 200%, at least about 300%, at least about 500% or at least about 1000% or more, is indicative that the subject is likely to benefit from cystectomy and/or chemotherapy and/or that the subject should be administered cystectomy and/or chemotherapy in accordance with the methods described herein.
The level of a gene expression product of a marker gene in Table 2 which is lower than a reference level of that marker gene by at least about 10% than the reference amount, at least about 20%, at least about 30%, at least about 40%, at least about 50%, at least about 80%, at least about 90% or more, is indicative that the subject is likely to benefit from cystectomy and/or chemotherapy and/or that the subject should be administered cystectomy and/or chemotherapy in accordance with the methods described herein.
By way of non-limiting example, Table 6 depicts non-limiting potential combinations of two marker genes that can be used in the methods and assays described herein. All possible combinations of 2 or more of the indicated markers are contemplated herein.
In certain embodiments, the marker gene(s) are selected from the genes listed in Table 10. In certain embodiments, one or more marker genes are selected from the group consisting of IGFBP5; CD44; VEGF; TRAF4; RAB31; SNX14; CASP8; PLA2G2F; PICK1; and ABCA5. In certain embodiments, the marker gene is one of IGFBP5; CD44; VEGF; TRAF4; RAB31; SNX14; CASP8; PLA2G2F; PICK1; and ABCA5. In certain embodiments,the marker genes comprise at least one of IGFBP5; CD44; VEGF; TRAF4; RAB31; SNX14; CASP8; PLA2G2F; PICK1; and ABCA5. In certain embodiments, the marker genes comprise at least two of IGFBP5; CD44; VEGF; TRAF4; RAB31; SNX14; CASP8; PLA2G2F; PICK1; and ABCA5. In certain embodiments, the marker genes comprise at least three of IGFBP5; CD44; VEGF; TRAF4; RAB31; SNX14; CASP8; PLA2G2F; PICK1; and ABCA5. In certain embodiments, the marker genes comprise at least four of IGFBP5; CD44; VEGF; TRAF4; RAB31; SNX14; CASP8; PLA2G2F; PICK1; and ABCA5. In certain embodiments, the marker genes comprise at least five of IGFBP5; CD44; VEGF; TRAF4; RAB31; SNX14; CASP8; PLA2G2F; PICK1; and ABCA5. In certain embodiments, the marker genes comprise at least six of IGFBP5; CD44; VEGF; TRAF4; RAB31; SNX14; CASP8; PLA2G2F; PICK1; and ABCA5. In certain embodiments, the marker genes comprise at least seven of IGFBP5; CD44; VEGF; TRAF4; RAB31; SNX14; CASP8; PLA2G2F; PICK1; and ABCA5. In certain embodiments, the marker genes comprise at least eight of IGFBP5; CD44; VEGF; TRAF4; RAB31; SNX14; CASP8; PLA2G2F; PICK1; and ABCA5. In certain embodiments, the marker genes comprise at least nine of IGFBP5; CD44; VEGF; TRAF4; RAB31; SNX14; CASP8; PLA2G2F; PICK1; and ABCA5. In certain embodiments, the marker genes comprise at least IGFBP5; CD44; VEGF; TRAF4; RAB31; SNX14; CASP8; PLA2G2F; PICK1; and ABCA5.
As used herein, the term “transforming” or “transformation” refers to changing an object or a substance, e.g., biological sample, nucleic acid or protein, into another substance. The transformation can be physical, biological or chemical. Exemplary physical transformation includes, but not limited to, pre-treatment of a biological sample, e.g., from whole blood to blood serum by differential centrifugation. A biological/chemical transformation can involve at least one enzyme and/or a chemical reagent in a reaction. For example, a DNA sample can be digested into fragments by one or more restriction enzyme, or an exogenous molecule can be attached to a fragmented DNA sample with a ligase. In some embodiments, a DNA sample can undergo enzymatic replication, e.g., by polymerase chain reaction (PCR).
Methods to measure gene expression products associated with the marker genes described herein are well known to a skilled artisan. Such methods to measure gene expression products, e.g., protein level, include ELISA (enzyme linked immunosorbent assay), western blot, and immunoprecipitation, immunofluorescence using detection reagents such as an antibody or protein binding agents. Alternatively, a peptide can be detected in a subject by introducing into a subject a labeled anti-peptide antibody and other types of detection agent. For example, the antibody can be labeled with a radioactive marker whose presence and location in the subject is detected by standard imaging techniques.
For example, antibodies for the polypeptide expression products of the marker genes described herein are commercially available and can be used for the purposes of the invention to measure protein expression levels, e.g. anti-IGFBP5 (Cat. No. 4255; Abeam; Cambridge, MA). Alternatively, since the amino acid sequences for the marker genes described herein are known and publically available at NCBI website, one of skill in the art can raise their own antibodies against these proteins of interest for the purpose of the invention. The amino acid sequences of the marker genes described herein have been assigned NCBI accession numbers for different species such as human, mouse and rat.
In some embodiments, immunohistochemistry (“IHC”) and immunocytochemistry (“ICC”) techniques can be used. IHC is the application of immunochemistry to tissue sections, whereas ICC is the application of immunochemistry to cells or tissue imprints after they have undergone specific cytological preparations such as, for example, liquid-based preparations Immunochemistry is a family of techniques based on the use of an antibody, wherein the antibodies are used to specifically target molecules inside or on the surface of cells. The antibody typically contains a marker that will undergo a biochemical reaction, and thereby experience a change color, upon encountering the targeted molecules. In some instances, signal amplification can be integrated into the particular protocol, wherein a secondary antibody, that includes the marker stain or marker signal, follows the application of a primary specific antibody.
In some embodiments, the assay can be a Western blot analysis. Alternatively, proteins can be separated by two-dimensional gel electrophoresis systems. Two-dimensional gel electrophoresis is well known in the art and typically involves iso-electric focusing along a first dimension followed by SDS-PAGE electrophoresis along a second dimension. These methods also require a considerable amount of cellular material. The analysis of 2D SDS-PAGE gels can be performed by determining the intensity of protein spots on the gel, or can be performed using immune detection. In other embodiments, protein samples are analyzed by mass spectroscopy.
Immunological tests can be used with the methods and assays described herein and include, for example, competitive and non-competitive assay systems using techniques such as Western blots, radioimmunoassay (RIA), ELISA (enzyme linked immunosorbent assay), “sandwich” immunoassays, immunoprecipitation assays, immunodiffusion assays, agglutination assays, e.g. latex agglutination, complement-fixation assays, immunoradiometric assays, fluorescent immunoassays, e.g. FIA (fluorescence-linked immunoassay), chemiluminescence immunoassays (CLIA), electrochemiluminescence immunoassay (ECLIA, counting immunoassay (CIA), lateral flow tests or immunoassay (LFIA), magnetic immunoassay (MIA), and protein A immunoassays. Methods for performing such assays are known in the art, provided an appropriate antibody reagent is available. In some embodiment, the immunoassay can be a quantitative or a semi-quantitative immunoassay.
An immunoassay is a biochemical test that measures the concentration of a substance in a biological sample, typically a fluid sample such as serum, using the interaction of an antibody or antibodies to its antigen. The assay takes advantage of the highly specific binding of an antibody with its antigen. For the methods and assays described herein, specific binding of the target polypeptides with respective proteins or protein fragments, or an isolated peptide, or a fusion protein described herein occurs in the immunoassay to form a target protein/peptide complex. The complex is then detected by a variety of methods known in the art. An immunoassay also often involves the use of a detection antibody.
Enzyme-linked immunosorbent assay, also called ELISA, enzyme immunoassay or EIA, is a biochemical technique used mainly in immunology to detect the presence of an antibody or an antigen in a sample. The ELISA has been used as a diagnostic tool in medicine and plant pathology, as well as a quality control check in various industries.
In one embodiment, an ELISA involving at least one antibody with specificity for the particular desired antigen (i.e. a marker gene polypeptide as described herein) can also be performed. A known amount of sample and/or antigen is immobilized on a solid support (usually a polystyrene micro titer plate). Immobilization can be either non-specific (e.g., by adsorption to the surface) or specific (e.g. where another antibody immobilized on the surface is used to capture antigen or a primary antibody). After the antigen is immobilized, the detection antibody is added, forming a complex with the antigen. The detection antibody can be covalently linked to an enzyme, or can itself be detected by a secondary antibody which is linked to an enzyme through bio-conjugation. Between each step the plate is typically washed with a mild detergent solution to remove any proteins or antibodies that are not specifically bound. After the final wash step the plate is developed by adding an enzymatic substrate to produce a visible signal, which indicates the quantity of antigen in the sample. Older ELISAs utilize chromogenic substrates, though newer assays employ fluorogenic substrates with much higher sensitivity.
In another embodiment, a competitive ELISA is used. Purified antibodies that are directed against a target polypeptide or fragment thereof are coated on the solid phase of multi-well plate, i.e., conjugated to a solid surface. A second batch of purified antibodies that are not conjugated on any solid support is also needed. These non-conjugated purified antibodies are labeled for detection purposes, for example, labeled with horseradish peroxidase to produce a detectable signal. A sample (e.g., tumor, blood, serum or urine) from a subject is mixed with a known amount of desired antigen (e.g., a known volume or concentration of a sample comprising a target polypeptide) together with the horseradish peroxidase labeled antibodies and the mixture is then are added to coated wells to form competitive combination. After incubation, if the polypeptide level is high in the sample, a complex of labeled antibody reagent-antigen will form. This complex is free in solution and can be washed away. Washing the wells will remove the complex. Then the wells are incubated with TMB (3,3′,5, 5′-tetramethylbenzidene) color development substrate for localization of horseradish peroxidase-conjugated antibodies in the wells. There will be no color change or little color change if the target polypeptide level is high in the sample. If there is little or no target polypeptide present in the sample, a different complex in formed, the complex of solid support bound antibody reagents-target polypeptide. This complex is immobilized on the plate and is not washed away in the wash step. Subsequent incubation with TMB will produce much color change. Such a competitive ELSA test is specific, sensitive, reproducible and easy to operate.
There are other different forms of ELISA, which are well known to those skilled in the art. The standard techniques known in the art for ELISA are described in “Methods in Immunodiagnosis”, 2nd Edition, Rose and Bigazzi, eds. John Wiley & Sons, 1980; and Oellerich, M. 1984, J. Clin. Chem. Clin. Biochem. 22:895-904. These references are hereby incorporated by reference in their entirety.
In one embodiment, the levels of a polypeptide in a sample can be detected by a lateral flow immunoassay test (LFIA), also known as the immunochromatographic assay, or strip test. LFIAs are a simple device intended to detect the presence (or absence) of antigen, e.g. a polypeptide, in a fluid sample. There are currently many LFIA tests are used for medical diagnostics either for home testing, point of care testing, or laboratory use. LFIA tests are a form of immunoassay in which the test sample flows along a solid substrate via capillary action. After the sample is applied to the test strip it encounters a colored reagent (generally comprising antibody specific for the test target antigen) bound to microparticles which mixes with the sample and transits the substrate encountering lines or zones which have been pretreated with another antibody or antigen. Depending upon the level of target polypeptides present in the sample the colored reagent can be captured and become bound at the test line or zone. LFIAs are essentially immunoassays adapted to operate along a single axis to suit the test strip format or a dipstick format. Strip tests are extremely versatile and can be easily modified by one skilled in the art for detecting an enormous range of antigens from fluid samples such as urine, blood, water, and/or homogenized tumor samples etc. Strip tests are also known as dip stick test, the name bearing from the literal action of “dipping” the test strip into a fluid sample to be tested. LFIA strip tests are easy to use, require minimum training and can easily be included as components of point-of-care test (POCT) diagnostics to be use on site in the field. LFIA tests can be operated as either competitive or sandwich assays. Sandwich LFIAs are similar to sandwich ELISA. The sample first encounters colored particles which are labeled with antibodies raised to the target antigen. The test line will also contain antibodies to the same target, although it may bind to a different epitope on the antigen. The test line will show as a colored band in positive samples. In some embodiments, the lateral flow immunoassay can be a double antibody sandwich assay, a competitive assay, a quantitative assay or variations thereof Competitive LFIAs are similar to competitive ELISA. The sample first encounters colored particles which are labeled with the target antigen or an analogue. The test line contains antibodies to the target/its analogue. Unlabelled antigen in the sample will block the binding sites on the antibodies preventing uptake of the colored particles. The test line will show as a colored band in negative samples. There are a number of variations on lateral flow technology. It is also possible to apply multiple capture zones to create a multiplex test.
The use of “dip sticks” or LFIA test strips and other solid supports have been described in the art in the context of an immunoassay for a number of antigen biomarkers. U.S. Pat. Nos. 4,943,522; 6,485,982; 6,187,598; 5,770,460; 5,622,871; 6,565,808, U.S. patent applications Ser. No. 10/278,676; U.S. Ser. No. 09/579,673 and U.S. Ser. No. 10/717,082, which are incorporated herein by reference in their entirety, are non-limiting examples of such lateral flow test devices. Examples of patents that describe the use of “dip stick” technology to detect soluble antigens via immunochemical assays include, but are not limited to U.S. Pat. Nos. 4,444,880; 4,305,924; and 4,135,884; which are incorporated by reference herein in their entireties. The apparatuses and methods of these three patents broadly describe a first component fixed to a solid surface on a “dip stick” which is exposed to a solution containing a soluble antigen that binds to the component fixed upon the “dip stick,” prior to detection of the component-antigen complex upon the stick. It is within the skill of one in the art to modify the teachings of this “dip stick” technology for the detection of polypeptides using antibody reagents as described herein.
Other techniques can be used to detect the level of a polypeptide in a sample. One such technique is the dot blot, and adaptation of Western blotting (Towbin et at., Proc. Nat. Acad. Sci. 76:4350 (1979)). In a Western blot, the polypeptide or fragment thereof can be dissociated with detergents and heat, and separated on an SDS-PAGE gel before being transferred to a solid support, such as a nitrocellulose or PVDF membrane. The membrane is incubated with an antibody reagent specific for the target polypeptide or a fragment thereof The membrane is then washed to remove unbound proteins and proteins with non-specific binding. Detectably labeled enzyme-linked secondary or detection antibodies can then be used to detect and assess the amount of polypeptide in the sample tested. The intensity of the signal from the detectable label corresponds to the amount of enzyme present, and therefore the amount of polypeptide. Levels can be quantified, for example by densitometry.
In certain embodiments, the gene expression products as described herein can be instead determined by determining the level of messenger RNA (mRNA) expression of genes associated with the marker genes described herein. Such molecules can be isolated, derived, or amplified from a biological sample, such as a tumor biopsy. Detection of mRNA expression is known by persons skilled in the art, and comprise, for example but not limited to, PCR procedures, RT-PCR, Northern blot analysis, differential gene expression, RNA protection assay, microarray analysis, hybridization methods, next-generation sequencing etc. Non-limiting examples of next-generation sequencing technologies can include Ion Torrent, Illumina, SOLiD, 454; Massively Parallel Signature Sequencing solid-phase, reversible dye-terminator sequencing; and DNA nanoball sequencing.
In general, the PCR procedure describes a method of gene amplification which is comprised of (i) sequence-specific hybridization of primers to specific genes or sequences within a nucleic acid sample or library, (ii) subsequent amplification involving multiple rounds of annealing, elongation, and denaturation using a thermostable DNA polymerase, and (iii) screening the PCR products for a band of the correct size. The primers used are oligonucleotides of sufficient length and appropriate sequence to provide initiation of polymerization, i.e. each primer is specifically designed to be complementary to a strand of the genomic locus to be amplified. In an alternative embodiment, mRNA level of gene expression products described herein can be determined by reverse-transcription (RT) PCR and by quantitative RT-PCR (QRT-PCR) or real-time PCR methods. Methods of RT-PCR and QRT-PCR are well known in the art. The nucleic acid sequences of the marker genes described herein have been assigned NCBI accession numbers for different species such as human, mouse and rat. Accordingly, a skilled artisan can design an appropriate primer based on the known sequence for determining the mRNA level of the respective gene.
Nucleic acid and ribonucleic acid (RNA) molecules can be isolated from a particular biological sample using any of a number of procedures, which are well-known in the art, the particular isolation procedure chosen being appropriate for the particular biological sample. For example, freeze-thaw and alkaline lysis procedures can be useful for obtaining nucleic acid molecules from solid materials; heat and alkaline lysis procedures can be useful for obtaining nucleic acid molecules from urine; and proteinase K extraction can be used to obtain nucleic acid from blood (Roiff, A et al. PCR: Clinical Diagnostics and Research, Springer (1994)).
In general, the PCR procedure describes a method of gene amplification which is comprised of (i) sequence-specific hybridization of primers to specific genes within a nucleic acid sample or library, (ii) subsequent amplification involving multiple rounds of annealing, elongation, and denaturation using a DNA polymerase, and (iii) screening the PCR products for a band of the correct size. The primers used are oligonucleotides of sufficient length and appropriate sequence to provide initiation of polymerization, i.e. each primer is specifically designed to be complementary to each strand of the nucleic acid molecule to be amplified.
In an alternative embodiment, mRNA level of gene expression products described herein can be determined by reverse-transcription (RT) PCR and by quantitative RT-PCR (QRT-PCR) or real-time PCR methods. Methods of RT-PCR and QRT-PCR are well known in the art.
In some embodiments, one or more of the reagents (e.g. an antibody reagent and/or nucleic acid probe) described herein can comprise a detectable label and/or comprise the ability to generate a detectable signal (e.g. by catalyzing reaction converting a compound to a detectable product). Detectable labels can comprise, for example, a light-absorbing dye, a fluorescent dye, or a radioactive label. Detectable labels, methods of detecting them, and methods of incorporating them into reagents (e.g. antibodies and nucleic acid probes) are well known in the art.
In some embodiments, detectable labels can include labels that can be detected by spectroscopic, photochemical, biochemical, immunochemical, electromagnetic, radiochemical, or chemical means, such as fluorescence, chemifluoresence, or chemiluminescence, or any other appropriate means. The detectable labels used in the methods described herein can be primary labels (where the label comprises a moiety that is directly detectable or that produces a directly detectable moiety) or secondary labels (where the detectable label binds to another moiety to produce a detectable signal, e.g., as is common in immunological labeling using secondary and tertiary antibodies). The detectable label can be linked by covalent or non-covalent means to the reagent. Alternatively, a detectable label can be linked such as by directly labeling a molecule that achieves binding to the reagent via a ligand-receptor binding pair arrangement or other such specific recognition molecules. Detectable labels can include, but are not limited to radioisotopes, bioluminescent compounds, chromophores, antibodies, chemiluminescent compounds, fluorescent compounds, metal chelates, and enzymes.
In other embodiments, the detection reagent is label with a fluorescent compound. When the fluorescently labeled antibody is exposed to light of the proper wavelength, its presence can then be detected due to fluorescence. In some embodiments, a detectable label can be a fluorescent dye molecule, or fluorophore including, but not limited to fluorescein, phycoerythrin, phycocyanin, o-phthaldehyde, fluorescamine, Cy3™, Cy5™, allophycocyanine, Texas Red, peridenin chlorophyll, cyanine, tandem conjugates such as phycoerythrin-Cy5™, green fluorescent protein, rhodamine, fluorescein isothiocyanate (FITC) and Oregon Green™, rhodamine and derivatives (e.g., Texas red and tetrarhodimine isothiocynate (TRITC)), biotin, phycoerythrin, AMCA, CyDyes™, 6-carboxyfhiorescein (commonly known by the abbreviations FAM and F), 6-carboxy-2′,4′,7′,4,7-hexachlorofiuorescein (HEX), 6-carboxy-4′,5′-dichloro-2′,7′-dimethoxyfiuorescein (JOE or J), N,N,N′,N′-tetramethyl-6carboxyrhodamine (TAMRA or T), 6-carboxy-X-rhodamine (ROX or R), 5-carboxyrhodamine-6G (R6G5 or G5), 6-carboxyrhodamine-6G (R6G6 or G6), and rhodamine 110; cyanine dyes, e.g. Cy3, Cy5 and Cy7 dyes; coumarins, e.g umbelliferone; benzimide dyes, e.g. Hoechst 33258; phenanthridine dyes, e.g. Texas Red; ethidium dyes; acridine dyes; carbazole dyes; phenoxazine dyes; porphyrin dyes; polymethine dyes, e.g. cyanine dyes such as Cy3, Cy5, etc; BODIPY dyes and quinoline dyes. In some embodiments, a detectable label can be a radiolabel including, but not limited to 3H, 125I, 35S, 14C, 32P and 33P. In some embodiments, a detectable label can be an enzyme including, but not limited to horseradish peroxidase and alkaline phosphatase. An enzymatic label can produce, for example, a chemiluminescent signal, a color signal, or a fluorescent signal. Enzymes contemplated for use to detectably label an antibody reagent include, but are not limited to, malate dehydrogenase, staphylococcal nuclease, delta-V-steroid isomerase, yeast alcohol dehydrogenase, alpha-glycerophosphate dehydrogenase, triose phosphate isomerase, horseradish peroxidase, alkaline phosphatase, asparaginase, glucose oxidase, beta-galactosidase, ribonuclease, urease, catalase, glucose-VI-phosphate dehydrogenase, glucoamylase and acetylcholinesterase. In some embodiments, a detectable label is a chemiluminescent label, including, but not limited to lucigenin, luminol, luciferin, isoluminol, theromatic acridinium ester, imidazole, acridinium salt and oxalate ester. In some embodiments, a detectable label can be a spectral colorimetric label including, but not limited to colloidal gold or colored glass or plastic (e.g., polystyrene, polypropylene, and latex) beads.
In some embodiments, detection reagents can also be labeled with a detectable tag, such as c-Myc, HA, VSV-G, HSV, FLAG, V5, HIS, or biotin. Other detection systems can also be used, for example, a biotin-streptavidin system. In this system, the antibodies immunoreactive (i. e. specific for) with the biomarker of interest is biotinylated. Quantity of biotinylated antibody bound to the biomarker is determined using a streptavidin-peroxidase conjugate and a chromagenic substrate. Such streptavidin peroxidase detection kits are commercially available, e. g. from DAKO; Carpinteria, Calif. A reagent can also be detectably labeled using fluorescence emitting metals such as 152Eu, or others of the lanthanide series. These metals can be attached to the reagent using such metal chelating groups as diethylenetriaminepentaacetic acid (DTPA) or ethylenediaminetetraacetic acid (EDTA).
In some embodiments of any of the aspects described herein, the level of expression products of more than one gene can be determined simultaneously (e.g. a multiplex assay) or in parallel. In some embodiments, the level of expression products of no more than 200 other genes is determined In some embodiments, the level of expression products of no more than 100 other genes is determined In some embodiments, the level of expression products of no more than 20 other genes is determined In some embodiments, the level of expression products of no more than 10 other genes is determined
The term “sample” or “test sample” as used herein denotes a sample taken or isolated from a biological organism, e.g., a tumor sample from a subject. Exemplary biological samples include, but are not limited to, a biofluid sample; serum; plasma; urine; saliva; a tumor sample; a tumor biopsy and/or tissue sample etc. The term also includes a mixture of the above-mentioned samples. The term “test sample” also includes untreated or pretreated (or pre-processed) biological samples. In some embodiments, a test sample can comprise cells from subject. In some embodiments, a test sample can be a tumor cell test sample, e.g. the sample can comprise cancerous cells, cells from a tumor, and/or a tumor biopsy. In some embodiments, the test sample can be a urine sample.
The test sample can be obtained by removing a sample of cells from a subject, but can also be accomplished by using previously isolated cells (e.g. isolated at a prior timepoint and isolated by the same or another person). In addition, the test sample can be freshly collected or a previously collected sample.
In some embodiments, the test sample can be an untreated test sample. As used herein, the phrase “untreated test sample” refers to a test sample that has not had any prior sample pre-treatment except for dilution and/or suspension in a solution. Exemplary methods for treating a test sample include, but are not limited to, centrifugation, filtration, sonication, homogenization, heating, freezing and thawing, and combinations thereof In some embodiments, the test sample can be a frozen test sample, e.g., a frozen tissue. The frozen sample can be thawed before employing methods, assays and systems described herein. After thawing, a frozen sample can be centrifuged before being subjected to methods, assays and systems described herein. In some embodiments, the test sample is a clarified test sample, for example, by centrifugation and collection of a supernatant comprising the clarified test sample. In some embodiments, a test sample can be a pre-processed test sample, for example, supernatant or filtrate resulting from a treatment selected from the group consisting of centrifugation, filtration, thawing, purification, and any combinations thereof In some embodiments, the test sample can be treated with a chemical and/or biological reagent. Chemical and/or biological reagents can be employed to protect and/or maintain the stability of the sample, including biomolecules (e.g., nucleic acid and protein) therein, during processing. One exemplary reagent is a protease inhibitor, which is generally used to protect or maintain the stability of protein during processing. The skilled artisan is well aware of methods and processes appropriate for pre-processing of biological samples required for determination of the level of an expression product as described herein.
In some embodiments, the methods, assays, and systems described herein can further comprise a step of obtaining a test sample from a subject. In some embodiments, the subject can be a human subject.
For convenience, the meaning of some terms and phrases used in the specification, examples, and appended claims, are provided below. Unless stated otherwise, or implicit from context, the following terms and phrases include the meanings provided below. The definitions are provided to aid in describing particular embodiments, and are not intended to limit the claimed invention, because the scope of the invention is limited only by the claims. Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. If there is an apparent discrepancy between the usage of a term in the art and its definition provided herein, the definition provided within the specification shall prevail.
For convenience, certain terms employed herein, in the specification, examples and appended claims are collected here.
As used herein, a “chemotherapy” refers to a substance that reduces or decreases the growth, survival, and/or metastasis of of cancer cells. Chemotherapies can include toxins, small molecules, polypeptides, and/or radiation therapies. In some embodiments, chemotherapy can include the use of radiation or radiation therapy. In some embodiments, chemotherapy can include, by way of non-limiting example, gemcitabine, cisplastin, paclitaxel, carboplatin, bortezomib, AMG479, vorinostat, rituximab, temozolomide, rapamycin, ABT-737, PI-103; alkylating agents such as thiotepa and CYTOXAN® cyclosphosphamide; alkyl sulfonates such as busulfan, improsulfan and piposulfan; aziridines such as benzodopa, carboquone, meturedopa, and uredopa; ethylenimines and methylamelamines including altretamine, triethylenemelamine, trietylenephosphoramide, triethiylenethiophosphoramide and trimethylolomelamine; acetogenins (especially bullatacin and bullatacinone); a camptothecin (including the synthetic analogue topotecan); bryostatin; callystatin; CC-1065 (including its adozelesin, carzelesin and bizelesin synthetic analogues); cryptophycins (particularly cryptophycin 1 and cryptophycin 8); dolastatin; duocarmycin (including the synthetic analogues, KW-2189 and CB1-TM1); eleutherobin; pancratistatin; a sarcodictyin; spongistatin; nitrogen mustards such as chlorambucil, chlornaphazine, cholophosphamide, estramustine, ifosfamide, mechlorethamine, mechlorethamine oxide hydrochloride, melphalan, novembichin, phenesterine, prednimustine, trofosfamide, uracil mustard; nitrosureas such as carmustine, chlorozotocin, fotemustine, lomustine, nimustine, and ranimnustine; antibiotics such as the enediyne antibiotics (e.g., calicheamicin, especially calicheamicin gammall and calicheamicin omegall (see, e.g., Agnew, Chem. Intl. Ed. Engl., 33: 183-186 (1994)); dynemicin, including dynemicin A; bisphosphonates, such as clodronate; an esperamicin; as well as neocarzinostatin chromophore and related chromoprotein enediyne antiobiotic chromophores), aclacinomysins, actinomycin, authramycin, azaserine, bleomycins, cactinomycin, carabicin, caminomycin, carzinophilin, chromomycinis, dactinomycin, daunorubicin, detorubicin, 6-diazo-5-oxo-L-norleucine, ADRIAMYCIN@ doxorubicin (including morpholino-doxorubicin, cyanomorpholino-doxorubicin, 2-pyrrolino-doxorubicin and deoxydoxorubicin), epirubicin, esorubicin, idarubicin, marcellomycin, mitomycins such as mitomycin C, mycophenolic acid, nogalamycin, olivomycins, peplomycin, potfiromycin, puromycin, quelamycin, rodorubicin, streptonigrin, streptozocin, tubercidin, ubenimex, zinostatin, zorubicin; anti-metabolites such as methotrexate and 5-fluorouracil (5-FU); folic acid analogues such as denopterin, methotrexate, pteropterin, trimetrexate; purine analogs such as fludarabine, 6-mercaptopurine, thiamiprine, thioguanine; pyrimidine analogs such as ancitabine, azacitidine, 6-azauridine, carmofur, cytarabine, dideoxyuridine, doxifluridine, enocitabine, floxuridine; androgens such as calusterone, dromostanolone propionate, epitiostanol, mepitiostane, testolactone; anti-adrenals such as aminoglutethimide, mitotane, trilostane; folic acid replenisher such as frolinic acid; aceglatone; aldophosphamide glycoside; aminolevulinic acid; eniluracil; amsacrine; bestrabucil; bisantrene; edatraxate; defofamine; demecolcine; diaziquone; elformithine; elliptinium acetate; an epothilone; etoglucid; gallium nitrate; hydroxyurea; lentinan; lonidainine; maytansinoids such as maytansine and ansamitocins; mitoguazone; mitoxantrone; mopidanmol; nitraerine; pentostatin; phenamet; pirarubicin; losoxantrone; podophyllinic acid; 2-ethylhydrazide; procarbazine; PSK® polysaccharide complex (JHS Natural Products, Eugene, Oreg.); razoxane; rhizoxin; sizofuran; spirogermanium; tenuazonic acid; triaziquone; 2,2′,2″-trichlorotriethylamine; trichothecenes (especially T-2 toxin, verracurin A, roridin A and anguidine); urethan; vindesine; dacarbazine; mannomustine; mitobronitol; mitolactol; pipobroman; gacytosine; arabinoside (“Ara-C”); cyclophosphamide; thiotepa; taxoids, e.g., TAXOL® paclitaxel (Bristol-Myers Squibb Oncology, Princeton, N.J.), ABRAXANE® Cremophor-free, albumin-engineered nanoparticle formulation of paclitaxel (American Pharmaceutical Partners, Schaumberg, Ill.), and TAXOTERE® doxetaxel (Rhone-Poulenc Rorer, Antony, France); chloranbucil; GEMZAR® gemcitabine; 6-thioguanine; mercaptopurine; methotrexate; platinum analogs such as cisplatin, oxaliplatin and carboplatin; vinblastine; platinum; etoposide (VP-16); ifosfamide; mitoxantrone; vincristine; NAVELBINE™; vinorelbine; novantrone; teniposide; edatrexate; daunomycin; aminopterin; xeloda; ibandronate; irinotecan (Camptosar, CPT-11) (including the treatment regimen of irinotecan with 5-FU and leucovorin); topoisomerase inhibitor RFS 2000; difluoromethylornithine (DMFO); retinoids such as retinoic acid; capecitabine; combretastatin; leucovorin (LV); oxaliplatin, including the oxaliplatin treatment regimen (FOLFOX); lapatinib (Tykerb™); inhibitors of PKC-alpha, Raf, H-Ras, EGFR (e.g., erlotinib (Tarceva®)) and VEGF-A that reduce cell proliferation and pharmaceutically acceptable salts, acids or derivatives of any of the above.
The terms “decrease”, “reduced”, or “reduction”, are all used herein to mean a decrease by a statistically significant amount. In some embodiments, “reduce,” “reduction” or “decrease” typically means a decrease by at least 10% as compared to a reference level (e.g. the absence of a given treatment) and can include, for example, a decrease by at least about 10%, at least about 20%, at least about 25%, at least about 30%, at least about 35%, at least about 40%, at least about 45%, at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 95%, at least about 98%, at least about 99% , or more. As used herein, “reduction” or does not encompass a complete inhibition or reduction as compared to a reference level.
The terms “increased”, “increase”, “enhance”, or “activate” are all used herein to mean an increase by a statically significant amount. In some embodiments, the terms “increased”, “increase”, “enhance”, or “activate” can mean an increase of at least 10% as compared to a reference level, for example an increase of at least about 20%, or at least about 30%, or at least about 40%, or at least about 50%, or at least about 60%, or at least about 70%, or at least about 80%, or at least about 90% or up to and including a 100% increase or any increase between 10-100% as compared to a reference level, or at least about a 2-fold, or at least about a 3-fold, or at least about a 4-fold, or at least about a 5-fold or at least about a 10-fold increase, or any increase between 2-fold and 10-fold or greater as compared to a reference level. In the context of a marker or symptom, a “increase” is a statistically significant increase in such level.
As used herein, a “subject” means a human or animal. Usually the animal is a vertebrate such as a primate, rodent, domestic animal or game animal. Primates include chimpanzees, cynomologous monkeys, spider monkeys, and macaques, e.g., Rhesus. Rodents include mice, rats, woodchucks, ferrets, rabbits and hamsters. Domestic and game animals include cows, horses, pigs, deer, bison, buffalo, feline species, e.g., domestic cat, canine species, e.g., dog, fox, wolf, avian species. In some embodiments, the subject is a mammal, e.g., a primate, e.g., a human. The terms, “individual,” “patient” and “subject” are used interchangeably herein.
Preferably, the subject is a mammal. The mammal can be a human, non-human primate, mouse, rat, dog, cat, horse, or cow, but is not limited to these examples. Mammals other than humans can be advantageously used as subjects that represent animal models of bladder cancer. A subject can be female or male.
A subject can be one who has been previously diagnosed with or identified as suffering from or having a condition in need of treatment (e.g. cancer) or one or more complications related to such a condition, and optionally, have already undergone treatment for cancer or the one or more complications related to cancer. Alternatively, a subject can also be one who has not been previously diagnosed as having cancer or one or more complications related to cancer. For example, a subject can be one who exhibits one or more risk factors for cancer or one or more complications related to cancer or a subject who does not exhibit risk factors.
A “subject in need” of treatment for a particular condition can be a subject having that condition, diagnosed as having that condition, or at risk of developing that condition.
As used herein, the terms “protein” and “polypeptide” are used interchangeably herein to designate a series of amino acid residues, connected to each other by peptide bonds between the alpha-amino and carboxy groups of adjacent residues. The terms “protein”, and “polypeptide” refer to a polymer of amino acids, including modified amino acids (e.g., phosphorylated, glycated, glycosylated, etc.) and amino acid analogs, regardless of its size or function. “Protein” and “polypeptide” are often used in reference to relatively large polypeptides, whereas the term “peptide” is often used in reference to small polypeptides, but usage of these terms in the art overlaps. The terms “protein” and “polypeptide” are used interchangeably herein when referring to a gene product and fragments thereof. Thus, exemplary polypeptides or proteins include gene products, naturally occurring proteins, homologs, orthologs, paralogs, fragments and other equivalents, variants, fragments, and analogs of the foregoing.
As used herein, the term “nucleic acid” or “nucleic acid sequence” refers to any molecule, preferably a polymeric molecule, incorporating units of ribonucleic acid, deoxyribonucleic acid or an analog thereof. The nucleic acid can be either single-stranded or double-stranded. A single-stranded nucleic acid can be one nucleic acid strand of a denatured double-stranded DNA. Alternatively, it can be a single-stranded nucleic acid not derived from any double-stranded DNA. In one aspect, the nucleic acid can be DNA. In another aspect, the nucleic acid can be RNA. Suitable nucleic acid molecules are DNA, including genomic DNA or cDNA. Other suitable nucleic acid molecules are RNA, including mRNA.
As used herein, the terms “treat,” “treatment,” “treating,” or “amelioration” refer to therapeutic treatments, wherein the object is to reverse, alleviate, ameliorate, inhibit, slow down or stop the progression or severity of a condition associated with a disease or disorder, e.g. ovarian cancer. The term “treating” includes reducing or alleviating at least one adverse effect or symptom of a condition, disease or disorder associated with a cancer. Treatment is generally “effective” if one or more symptoms or clinical markers are reduced. Alternatively, treatment is “effective” if the progression of a disease is reduced or halted. That is, “treatment” includes not just the improvement of symptoms or markers, but also a cessation of, or at least slowing of, progress or worsening of symptoms compared to what would be expected in the absence of treatment. Beneficial or desired clinical results include, but are not limited to, alleviation of one or more symptom(s), diminishment of extent of disease, stabilized (i.e., not worsening) state of disease, delay or slowing of disease progression, amelioration or palliation of the disease state, remission (whether partial or total), and/or decreased mortality, whether detectable or undetectable. The term “treatment” of a disease also includes providing relief from the symptoms or side-effects of the disease (including palliative treatment).
The term “statistically significant” or “significantly” refers to statistical significance and generally means a two standard deviation (2SD) or greater difference.
Other than in the operating examples, or where otherwise indicated, all numbers expressing quantities of ingredients or reaction conditions used herein should be understood as modified in all instances by the term “about.” The term “about” when used in connection with percentages can mean ±1%.
As used herein the term “comprising” or “comprises” is used in reference to compositions, methods, and respective component(s) thereof, that are essential to the method or composition, yet open to the inclusion of unspecified elements, whether essential or not.
The term “consisting of” refers to compositions, methods, and respective components thereof as described herein, which are exclusive of any element not recited in that description of the embodiment.
As used herein the term “consisting essentially of” refers to those elements required for a given embodiment. The term permits the presence of elements that do not materially affect the basic and novel or functional characteristic(s) of that embodiment.
The singular terms “a,” “an,” and “the” include plural referents unless context clearly indicates otherwise. Similarly, the word “or” is intended to include “and” unless the context clearly indicates otherwise. Although methods and materials similar or equivalent to those described herein can be used in the practice or testing of this disclosure, suitable methods and materials are described below. The abbreviation, “e.g.” is derived from the Latin exempli gratia, and is used herein to indicate a non-limiting example. Thus, the abbreviation “e.g.” is synonymous with the term “for example.”
Definitions of common terms in cell biology and molecular biology can be found in “The Merck Manual of Diagnosis and Therapy”, 19th Edition, published by Merck Research Laboratories, 2006 (ISBN 0-911910-19-0); Robert S. Porter et al. (eds.), The Encyclopedia of Molecular Biology, published by Blackwell Science Ltd., 1994 (ISBN 0-632-02182-9); Benjamin Lewin, Genes X, published by Jones & Bartlett Publishing, 2009 (ISBN-10: 0763766321); Kendrew et al. (eds.),
Biology and Biotechnology: a Comprehensive Desk Reference, published by VCH Publishers, Inc., 1995 (ISBN 1-56081-569-8) and Current Protocols in Protein Sciences 2009, Wiley Intersciences, Coligan et al., eds.
Unless otherwise stated, the present invention was performed using standard procedures, as described, for example in Sambrook et al., Molecular Cloning: A Laboratory Manual (4 ed.), Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., USA (2012); Davis et al., Basic Methods in Molecular Biology, Elsevier Science Publishing, Inc., New York, USA (1995); Current Protocols in Protein Science (CPPS) (John E. Coligan, et. al., ed., John Wiley and Sons, Inc.), which are all incorporated by reference herein in their entireties.
Other terms are defined herein within the description of the various aspects of the invention.
All patents and other publications; including literature references, issued patents, published patent applications, and co-pending patent applications; cited throughout this application are expressly incorporated herein by reference for the purpose of describing and disclosing, for example, the methodologies described in such publications that might be used in connection with the technology described herein. These publications are provided solely for their disclosure prior to the filing date of the present application. Nothing in this regard should be construed as an admission that the inventors are not entitled to antedate such disclosure by virtue of prior invention or for any other reason. All statements as to the date or representation as to the contents of these documents is based on the information available to the applicants and does not constitute any admission as to the correctness of the dates or contents of these documents.
The description of embodiments of the disclosure is not intended to be exhaustive or to limit the disclosure to the precise form disclosed. While specific embodiments of, and examples for, the disclosure are described herein for illustrative purposes, various equivalent modifications are possible within the scope of the disclosure, as those skilled in the relevant art will recognize For example, while method steps or functions are presented in a given order, alternative embodiments may perform functions in a different order, or functions may be performed substantially concurrently. The teachings of the disclosure provided herein can be applied to other procedures or methods as appropriate. The various embodiments described herein can be combined to provide further embodiments. Aspects of the disclosure can be modified, if necessary, to employ the compositions, functions and concepts of the above references and application to provide yet further embodiments of the disclosure. Moreover, due to biological functional equivalency considerations, some changes can be made in protein structure without affecting the biological or chemical action in kind or amount. These and other changes can be made to the disclosure in light of the detailed description. All such modifications are intended to be included within the scope of the appended claims.
Specific elements of any of the foregoing embodiments can be combined or substituted for elements in other embodiments. Furthermore, while advantages associated with certain embodiments of the disclosure have been described in the context of these embodiments, other embodiments may also exhibit such advantages, and not all embodiments need necessarily exhibit such advantages to fall within the scope of the disclosure.
The technology described herein is further illustrated by the following examples which in no way should be construed as being further limiting.
Some embodiments of the technology described herein can be defined according to any of the following numbered paragraphs:
Described herein is the identification of gene signatures in transitional cell carcinoma that can differentiate high-grade T1 non-progressive (T1NP) bladder cancer (BCa) from those T1 progressive (T1P) tumors that progress to muscularis propria invasive T2 Tumors.
A high-throughput RNA sequencing (RNA-Seq) was performed on formalin-fixed and paraffin-embedded (FFPE) BCa specimens with clinical pathological characteristics best representing the general clinical development of the disease. For the T1NP group, only patients with long term follow-up (6-17 years) and periodic examinations (average of 4 resections and 9 cytology tests) were selected. For the T1P group, only patients in whom a complete resection was performed after a minimum of at least 8 months after the initial T1 diagnosis were selected, therefore eliminating the possibility of under diagnosis. Only samples in which muscularis propria was present and uninvolved were included, further assuring a correct diagnosis. The RNA-Seq reads were mapped to the human genome build NCBI 36 (hg18) using Tophat with no mismatch. After alignment to the transcriptome and expression quantification, a linear statistical model was built using Limma between T1NP and T1P samples to identify differentially expressed genes.
A total of 5,561 genes were mapped to all samples and used for RNA-Seq analysis to identify a gene signature that was significantly and differentially expressed between T1NP and T1P BCa patients. Signature-based stratification indicated the gene signature correlated notably with the time of T1 development to T2 tumor, suggesting that the molecular signature might be used as an independent predictor for the pace of high-grade T1 BCa progression.
This is the first demonstration that RNA-Seq can be applied as a powerful tool to study BCa using FFPE specimens. A gene signature was identified that can distinguish patients diagnosed with high-grade T1 BCa which remain as non-muscle invasive tumors from those patients with cancers progressing to muscle invasive tumors. These findings will permit prognostic tools for accurate prediction of T1 BCa progression that can considerably influence the clinical decision—making process, treatment regimen and patient survival.
Introduction
Management of high-grade T1 (lamina propria invasive) bladder cancer (BCa) poses a challenging dilemma to clinicians and patients. Most BCa patients present initially as non-muscle invasive transitional cell carcinoma (TCC). There are a significant number of high-grade T1 lesions which have the potential to progress to muscle invasive BCa with increased risk for developing metastatic disease (1-2). For high-grade T1 TCC, approximately 80% of treated tumors recur and 5 to 15%% of recurrent tumors progress to invasive disease. Once BCa becomes metastatic, the cancer related survival is approximately 5.4% at 5 years (3). Currently, no definitive risk criteria exist to distinguish patients with T1 non-progressive (T1NP) BCa who may suffer multiple recurrences of the disease without developing muscle invasive tumors from those patients with the T1 progressive (T1P) BCa whose cancer first presented as T1 disease but eventually developed into muscle invasive and metastatic disease. Therefore, establishing new prognostic criteria to distinguish high-grade T1NP from T1P would meet a great clinical need.
Previously, BCa prognostic factors have been reported and specific markers and molecular pathways have been linked with BCa tumor stage and progression (4-6). Thus far, no standard guidelines have been adopted into clinical practice. Most of these studies used samples pooled from distinct groups of non-invasive BCa (Ta and T1) tumors and compared them with muscularis propria invasive BCa (T2, T3 and T4) tumors. Specimens with long-term clinical T1 BCa follow-up were lacking. This makes distinguishing BCa patients with T1NP from those of T1P particularly difficult, if not impossible from these studies.
Gene expression profiling has been used to develop prognostic signatures in a wide range of diseases (7-8), but its application has been limited, to an extent, by the fact that gene expression technologies work best with fresh frozen tissue (9-11). By far, the vast majority of human disease tissue samples and those with the best outcome data are archived formalin-fixed paraffin-embedded (FFPE) surgical specimens. Important clinical information and disease outcome are often collected years after the initial specimen collection. However, FFPE tissue processing is known to cause fragmentation and chemical modification of RNA, presenting challenges for gene expression profiling. The analysis of BCa specimens is potentially more difficult as they are typically obtained by transurethral resection (TUR) and are associated with cautery artifacts that may further degrade nucleic acids.
High-throughput RNA sequencing (RNA-Seq) is a recently developed gene expression profiling method that has several advantages over other expression profiling technologies, including higher sensitivities, low background noise and the ability to detect splicing variants and somatic mutations, resulting in precise measurements of levels of transcripts and their isoforms (12-13). To this date, the RNA-Seq technology using FFPE samples on BCa analysis has not been published.
The goal of the work described herein was to determine whether RNA-Seq analysis with archival FFPE BCa specimens could be used to identify a genomic signature capable of differentiating high-grade BCa of T1NP from those T1 diseases, which eventually progress to muscle invasive tumors. FFPE samples of BCa patients with long follow-up were used to determine the natural history of the disease. RNA-Seq analysis performed on these FFPE samples identified a gene expression signature associated with disease progression. The results described herein demonstrate the applicability of using RNA-Seq to study bladder tumors obtained by TUR and stored long-term as FFPE specimens. The gene signature identified by this study permit a diagnostic tool for patients at high risk for rapid progression to muscle invasive BCa.
Materials and Methods
Patients and tumor specimens. Research was performed with the approval by the hospital Institutional Review Board (IRB) at Massachusetts General Hospital (MGH). All the samples were obtained from patients treated at MGH between 1994 and 2011. BCa tumor FFPE specimens from 3 T1NP and 4 T1P patients were analyzed for each experiment condition. Specimens of papillary BCa were carefully evaluated by an expert urologic pathologist (CLW) and the tumor tissues were identified, marked and highlighted. Two cores with only papillary bladder tumors and without any stromal or muscular tissues were collected from highlighted areas using a biopsy punch with a plunger (1.5 mm in diameter) in a RNA-free environment. Cancer grade was assigned using the 1973 World Health Organization pathology (WHO) criteria for BCa and cancer stage assigned according to the 7th edition of AJCC TNM system (17-18).
RNA extraction, rRNA removal and sequencing library construction. Total RNA was extracted using hot phenol with additional purification using the RNEASY™ mini kit (Qiagen, Germantown, MD) following the manufacturer's instructions. RNA integrity was assessed using an Agilent bioanalyzer (Agilent, Santa Clara, Calif.) and the RNA integrity number (RIN) was calculated for each sample and the average RIN for all samples was 2.6, ranging from 2.3 to 3.3). A cDNA library was constructed for each sample using Illumina's mRNA-Seq Sample Prep Kit™ (Illumina San Diego, Calif.). Briefly, for each sample, 100 ng of total RNA was used to generate a sequencing library and the RNA was directly subjected to fragmentation without the mRNA purification step. The resulting sample libraries were subjected to DSN (double-specific nuclease) treatment using the Trimmer-Direct cDNA Normalization™ kit (Evrogen, Moscow, Russia).
RNA-Seq read mapping and annotation. The cDNA library of each sample was loaded to a single lane of Illumina flow cell and the libraries were sequenced on Illumina Genome Analyzer II™. Image deconvolution and calculation of quality value were performed using the Boat module (Firecrest v1.1.4.0 and Bustard v.1.4.0 programs) of Illumina pipeline V1.4™. Sequence base calls were assigned using Illumina CASAVA™ software. The reads were 36 bases long and each lane produced an average 30 million of 36-mer raw sequence reads. Reads were mapped to the human genome build NCBI 36 (hg18) using Tophat™ (19) with no mismatch. The mapped reads were assembled and annotated using Cufflinks™ software tools (20).
Transcript quantification and gene expression consolidation. Transcript abundances were quantified in FPKM (Fragments Per Kilobase of exon per Million fragments mapped) by Cufflinks™ which taking into account both the gene length and the mapped reads for each sample and normalized accordingly (21). Due to the fragmented nature of mRNA in FFPE samples, the abundance measurement at the gene level was focused upon. When multiple transcript abundance measurements were reported for a gene, the maximum value was chosen to represent the expression level of that gene.
Differentially expressed genes identification in RNA-Seq data. After alignment to the transcriptome and expression quantification, a linear statistical model was built using limma (21-22) between 3 T1NP samples and 4 T1P samples to identify differentially expressed genes with T1NP samples as the reference. The analyses were accomplished using R™ and Bioconductor™ packages (23). To validate our gene signature, Illumina DASL Cancer Panel™ (The cDNA-mediated Annealing, Selection, extension, and Ligation) Assay was performed independently with the same specimens and genes were identified to show significantly differential expressions between the T1NP and T1P patients using both the RNA-Seq™ and the DASL™ platforms.
Functional enrichment and network analysis. Network enrichment for the significantly differentially expressed genes was analyzed using Ingenuity Pathway Analysis (IPA)™ software. The network interaction of the focused genes in the network is based on their connectivity in Ingenuity Knowledge Base™.
Results
Patient cohort characteristics. To ensure the high specificity of the gene signature obtained, tissue samples were selected from patients with the clinicalpathologial characteristics that best represented the general clinical development of non-progressive from progressive disease. 7 patients in this study were selected based on several considerations. First, all patients had well-established lamina propria invasive TCC confirmed by an expert urologic pathologist. Second, every patient's disease history and their pathology reports were carefully evaluated to ascertain BCa progression. Third, only patients with extended follow-up and sufficient tumor cells in the paraffin blocks were included in this study. All specimens had muscularis propria present which was uninvolved. Patients had a short interval follow-up biopsy after initial diagnosis which demonstrated no tumor. As shown in Table 4, for the T1NP group, these 3 patients were confirmed as non-progressive disease by subsequent biopsies, and with significantly extended follow-up times, ranging from 6 to 17 years (average 9 years). For T1P patients, we excluded any cases with short interval between T1 and T2 diagnosis, eliminating the possibility of T1 BCa being under-staged. The 4 T1P patients had an average time to development of muscularis propria invasive cancer of 2.43 years, ranging from 0.8 to 4.5 years and with an average follow-up time of 4.75 years ranging from 1 to 8 years. All these data coincided with the general clinical observation of T1 BCa progression.
RNA-Seq analysis of T1NP and T1P tumors. RNA-Seg™ analysis was performed using the Illumina GAII™ platform. The study design and the workflow for the RNA-Seq are illustrated in
Identification of differentially expressed genes in T1NP compared to T1P samples. Limma™ (22) was used to construct a linear model and identified a total of 181 significantly differentially expressed genes (Tables 1 and 2, p-value <0.05) between the non-progressive and progressive patients whose original diagnosis was high-grade T1 BCa. The 181 most significantly differentially expressed genes (p-value <0.05) that can distinguish high-grade T1NP tumors with non-progressive recurrence from those T1P muscle invasion tumors are listed. The genes were sorted based on the p Value and their expression levels were indicated by the log FC (P/NP) values, genes that over-expressed were with positive logFC (P/NP) values and genes that under-expressed were with negative logFC (P/NP) values. Among them, 101 up-regulated and 80 down-regulated genes in T1P relative to T1NP were found. The results were validated with the DASL Cancer Panel™ analysis using the same samples. Average-linkage hierarchical clustering was then performed using a Pearson correlation-coefficient distance metric using the gene signature identified (data not shown) in 3 T1NP samples and 4 T1P samples and median-centered log2 (FPKM) for each gene. In examining the resulting hierarchical clustering dendrogram, one can see a correlation between gene expression levels and the time for disease progression for the 4 T1P samples. As shown in
Discussion
Genomics has created an unprecedented opportunity to survey expression patterns across the genome and to use the resulting data to develop diagnostic and prognostic biomarkers. However, doing this requires the availability of well-annotated clinical samples with extensive clinical data so that patterns of gene expression can be linked to outcome or other relevant endpoints.
While there are many archival pathological samples reserved as FFPE tissues, these have proven difficult to analyze using most of the available genomic technologies. This is largely due to the fact that the process of creating FFPE samples is known to introduce chemical modification and cross-linking between DNA, RNA and proteins in these samples. In BCa, the situation has proven particularly difficult as specimens are typically obtained through TUR and the cautery effect associated with the procedure may further degrade nucleic acids. To overcome these limitations, a variation on RNA-Seq technology that relies on short nucleic acid fragments coupled with DSN normalization was used. DSN removes ribosomal RNA and other abundant double-stranded DNA and DNA-RNA hybrid complexes, allowing creation of RNA-Seq libraries from most highly degraded samples (26-27). Using this approach, a gene signature was identified that significantly differentially expressed between the non-progressive and progressive patients T1 BCa (Tables 1 and 2). The data described herein demonstrate that FFPE specimens obtained by TUR can be used in RNA-Seq, genome-wide expression analysis.
In the work described herein samples were selected that had an appropriate diagnosis and were of high quality. It is known that up to 25% of T1 G3 tumors are incorrectly staged on initial surgical resection (TURB). Studies without systematically repeated TURB could have been hampered by the lack of patients with correctly documented T1 who subsequently progress at a latter date. It is therefore critical that a re-resection be performed, particularly if muscularis propria is not present in the specimen. In this study, a complete resection was performed for each and all patients with a minimum of at least 8 months after the initial T1 diagnosis. Only samples in which muscularis propria was present and uninvolved were included, further assuring a correct diagnosis. For each patient included in the present study, there were longitudinal data, including TURB and cytology tests (between 14 and 24, ranging from 2-40), followed the original diagnosis (data not shown). For the non-progressive group, one of the most critical issues is the length of the follow-up. In the present study, T1NP patients were followed up to 17 years with periodic examinations (average being more than 4 resections and 9 cytology tests), providing strong evidence to support the assignment of non-progressive status. For the progressive group, the important issue was that a second resection should be performed following an initial T1 diagnosis without evidence of T2 disease. In the present study, a sample was required to have a second resection to be accepted as T1 progression, therefore eliminating the possibility of under diagnosis. Only with careful exclusion of patients not fitting these criteria can one be certain of a true T1 diagnosis and not a misdiagnosed T2 BCa.
Previous studies of superficial papillary BCa have associated specific genomic alterations with the disease, including mutations in and dysregulation of FGFR3, PI3K, KRAS, HRAS, TP53, P16, TSC1, and PTEN, loss of chromosome 9, 9p or 9q as well as loss of RB1 (9-10, 28-29). Recently, mutations in genes responsible for chromatin remodeling were identified (30) and high-throughput assays including RNA-Seq performed on fresh tissue samples have shown promising results, demonstrating molecular assessment can be used for the development of mutation and pathway-based trials of targeted therapy for cancer patients (16). Ingenuity Pathway Analysis (IPA) software was used to search for over-representation of biological pathways and annotated gene functional classes among the genes found to be significant. Among the highest ranking were cancer-related pathways associated with cell death, cellular growth and proliferation, and cell cycle (
As described herein, a gene signature was identified that was able to distinguish T1NP from T1P patients. The gene signature can distinguish patients diagnosed with high-grade T1 BCa that remain as non-muscle invasive tumors from those patients with cancers progressing to muscle invasive tumors. This is the first demonstration that RNA-Seq can be applied as a tool to study BCa using FFPE specimens. These findings permit improvied clinical T1 BCa treatment regimens and thereby impact patient survival.
Management of high-grade T1 (lamina propria invasive) bladder cancer (BCa) poses a challenging dilemma to clinicians and patients. Currently, no definitive prognostic criteria exist to help in treatment decision making for high-grade T1 BCa patients. As described herein, gene expression biomarkers were identified in transitional cell carcinoma that can differentiate high-grade T1 non-progressive BCa from those T1 tumors that eventually progress to T2 (muscularis propria invasive) tumors. High-throughput RNA sequencing (RNA-Seq) was performed on BCa specimens obtained by transurethral resection (TUR), as formalin-fixed and paraffin-embedded (FFPE) with a confirmed history of disease development and extended follow-up. Surgical specimens of three T1 non-progressive and four T1 progressive tumors were used. A total of 5,561 genes were mapped to all samples and used to identify significantly and differentially expressed genes between the non-progressive and the progressive T1 BCa patients (p<0.05). Signature-based stratification indicates that the gene signature correlates notably with the time of T1 development to T2 tumor, suggesting that the molecular signature might be used as an independent predictor for the rate of high-grade T1 BCa progression. The data described herein demonstrate that RNA-Seq could be applied as a powerful tool to study BCa gene expression using FFPE samples and permit improved T1 BCa treatment regimens and thereby impact patient survival.
Introduction
Bladder cancer (BCa) is among the five most commonly diagnosed malignancies in the world and is one of the most prevalent types of cancer, responsible for annual deaths of approximately 150,000 worldwide (1-3). Most BCa patients present initially as non-muscle invasive transitional cell carcinoma (TCC), including non-invasive papillary TCC, TCC in situ and lamina propria invasive TCC (stage T1 cancer). Significant cases of T1 lesions are high-grade T1 lesions and have the potential to progress to muscularis propria or muscle invasive BCa with increased risk for developing metastatic cancer (4, 5). For high grade T1 TCC, approximately 80% of treated tumors recur and 20% of recurrent tumors progress to invasive disease. Once BCa becomes metastatic, despite aggressive treatments such as surgical resection, radiotherapy and chemotherapy, treatment fails in 95% patients with advanced disease and th cancer related survival is approximately 5.4% at 5 years (6). Currently, no definitive risk criteria exist to distinguish patients with T1 non-progressive (T1NP) BCa who may suffer multiple recurrences of the disease without developing muscle invasive tumors from those patients with the T1 progressive (T1P) BCa whose cancer first presented as T1 disease but eventually developed into muscle invasive and/or metastatic tumors. Most experts would not recommend immediate cystectomy to prevent progression, given the morbidity of the procedure and the fact that the majority of patient's with T1 disease do not progress and their tumors will not threaten their lives. Therefore, establishing new prognostic criteria to distinguish high-grade T1NP from T1P would meet a great clinical need.
In the past several years, studies have found distinct genotypic and phenotypic patterns to be associated with different stages and progression of BCa. Although prognostic factors including grade, size, recurrence and multiplicity have been reported and specific markers and molecular pathways have been linked and correlated with BCa tumor stage and progression (7-9), thus far, no standard guidelines have been adopted into clinical practice. Most of these studies used samples pooled from distinct groups of non-invasive BCa (Ta and T1) tumors and compared them with muscularis propria invasive BCa (T2, T3 and T4) tumors and specimens with longterm clinical T1 BCa follow-up were lacking. This makes distinguishing BCa patients with T1NP from those of T1P particularly difficult, if not impossible. The current limited prognostic indicators have failed to adequately discriminate T1 patients with the relatively indolent T1NP BCa from those patients with the aggressive invasive T1P BCa.
Gene expression profiling has been used to develop prognostic signatures in a wide range of diseases (10-11), but its application has been limited, to an extent, by the fact that gene expression technologies work best with fresh frozen tissue (12-14). By far, the vast majority of human disease tissue samples, and those with the best outcome data are archived formalin-fixed paraffin-embedded (FFPE) surgical specimens. Important clinical information and disease outcome are often collected years after the initial specimen collection. However, FFPE tissue processing is known to cause fragmentation and chemical modification of RNA, presenting challenges for gene expression profiling. The analysis of BCa specimens is potentially more difficult as they are typically obtained by transurethral resection (TUR) and are associated with cautery heat artifacts that may further degrade nucleic acid. It is unknown if the time FFPE kept in storage would also pose technical difficulties.
High-throughput RNA sequencing (RNA-Seq) is a recently developed gene expression profiling method that has several advantages over other expression profiling technologies, including higher sensitivities, low background noise and the ability to detect splicing variants and somatic mutations, resulting in precise measurements of levels of transcripts and their isoforms (15-16). To this date, the RNA-Seq technology using FFPE samples on bladder tumor analysis has not been published.
The objective of the work described herein was to determine whether RNA-Seq using archival FFPE BCa specimens could be used to identify a genomic signature capable of differentiating high-grade BCa of T1NP from those T1 diseases, which eventually progress to muscle invasive tumors (T2 or higher stages). FFPE samples of BCa patients with extended years of follow-up to determine the natural history of the disease were used. A total of 7 cases, including 3 high-grade T1NP with follow-up time ranging from 6 to 17 years and 4 high-grade T1P whose disease progressed from T1 to muscularis propria invasive tumor following the initial diagnosis in the period between 0.8 to 4.5 years. RNA-Seq analysis was performed on these FFPE samples and a gene expression signature associated with disease progression was identified. Among the top differentially expressed genes are those involved in cell growth and apoptosis and the gene signature described herein correlated considerably with T1 BCa progression. These results demonstrated the applicability of using RNA-Seq to study bladder tumors obtained by TUR and stored in long-term as FFPE specimens. The gene signature identified by this study represents a promising diagnostic tool for patients at high risk for rapid progression to muscle invasive BCa.
Materials and Methods
Patient and tumor samples. Research was performed with the approval by the hospital Institutional Review Board (IRB) at Massachusetts General Hospital. All the samples were obtained from patients treated at Massachusetts General Hospital, between 1994 and 2011. 7 patients in this study were selected based on several considerations (detailed in the Results section). Tumor specimens from 3 T1 non-progressive BCa and 4 T1 progressive BCa patients were analyzed. Tissues were collected by TUR and stored as FFPE specimens. Cancer specimen slides were reviewed and chosen by an expert urologic pathologist (CLW). Cancer grade was assigned using the 1973 World Health Organization pathology (WHO) criteria for bladder cancer and cancer stage assigned according to the 7th edition of AJCC TNM system (20, 21). Two 1 5 mm punched cores were obtained from the tumor area in the paraffin block and subjected to total RNA extraction.
Histological staining and Images collection. The paraffin-embedded bladder cancer tissues were cut at 5 μm and deparaffinized with xylene and rehydrated, following by hematoxylin and eosin (H&E) histological staining The slides were examined by an expert urologic pathologist and the images of representative regions of T1 bladder cancer were collected.
RNA extraction, rRNA removal and sequencing library construction. Total RNA was extracted using hot phenol with additional purification using the RNeasy mini kit™ (Qiagen, Germantown, MD) following the manufacturer's instruction. The quality and quantity of the extracted RNA were evaluated. RNA integrity was assessed using an Agilent bioanalyzer (Agilent, Santa Clara, Calif.) and the RNA integrity number (RIN) was calculated for each sample.
A cDNA library was constructed for each and all 7 samples using Illumina's mRNA-Seq Sample Prep Kit™ (Illumina San Diego Calif.). Briefly, for each sample, 100 ng of total RNA was used to generate a sequencing library without any additional other treatment. The RNA was directly subjected to fragmentation without the mRNA purification step. The first and the second-strand cDNA were synthesized from the fragmented RNA using random hexamer primers. End repair, A-tailing, adaptor ligation, cDNA template purification and enrichment of the purified cDNA template using PCR were then performed. The resulting sample libraries were subjected to DSN™ (double-specific nuclease) treatment using the Trimmer-Direct cDNA Normalization kit™ (Evrogen, Moscow, Russia).
RNA-Seq read mapping and annotation. The cDNA library of each sample was loaded to a single lane of Illumina™ flow cell and the libraries were sequenced on Illumina Genome Analyzer II™. Image deconvolution and calculation of quality value were performed using the Boat module (Firecrest v1.1.4.0 and Bustard v.1.4.0 programs) of Illumina pipeline Vi.4™. Sequence base calls were assigned using Illumina CASAVA™ software. The reads were 36 bases long and each lane produced an average 30 million of 36-mer raw sequence reads. Reads were mapped to the human genome build NCBI 36™ (hg18) using Tophat™ (22) with no mismatch. The mapped reads were assembled and annotated using Cufflinks™ software tools (23).
Transcript quantification and gene expression consolidation. Transcript abundances were quantified in FPKIVI™ (Fragments Per Kilobase of exon per Million fragments mapped) by Cufflinks™. Due to the fragmented nature of mRNA in FFPE samples, the abundance measurement at the gene level was focused upon. When multiple transcript abundance measurements were reported for a gene, the maximum value was chosen to represent the expression level of that gene.
Differentially expressed genes identification in RNA-Seq data. After alignment to the transcriptome and expression quantification, a linear statistical model was built using limma™ (24-25) between 3 T1NP samples and 4 T1P samples to identify differentially expressed genes with T1NP samples as the reference. The analyses were accomplished using R™ and Bioconductor™ packages (26).
DASL Microarray Assay. The Illumina DASL™ (cDNA-mediated Annealing, Selection, extension and Ligation) Human Cancer Panel™ gene set (Illumina Inc., San Diego, Calif., USA) were represented by a pool of selected probe groups that target 502 gene mRNAs collected from publicly available cancer gene lists, including oncogenes, tumor suppressor genes and genes in their associated pathways.
Each mRNA was targeted in three locations by three separate probes. In this study, following manufacture's instruction, briefly, A 100 ng portion of total RNA from BCa samples was converted into cDNA using biotinylated random nonamers, oligo-deoxythymidine 18 primers and Illumina-supplied reagents, according to manufacturer's instructions. The resulting biotinylated cDNA was annealed to assay oligonucleotides and bound to streptadivin-conjugated paramagnetic particles to select the cDNA/oligo complexes. After oligo hybridization, mis-hybridized and non-hybridized oligos were washed away, while bound oligos were extended and ligated to generate templates to be subsequently amplified with shared PCR primers. For each sample, at least three technical replicates were performed. After hybridization, the arrays were scanned by laser confocal microscopy using the Illumina BeadArrary Reader 500™ system. To identify significantly differentially expressed genes between T1NP and T1P samples in DASL Cancer Panel™ data, the same linear modeling as in RNA-Seq was applied.
Functional enrichment and network analysis. Network enrichment for the significantly differentially expressed genes was analyzed using Ingenuity Pathway Analysis (IPA)™ software. The network interaction of the focused genes in the network is based on their connectivity in Ingenuity Knowledge Base™.
Results
Patients’ FFPE sample selection. To ensure the high specificity of the gene signature obtained, tissue samples for which the clinicalpathologial characteristics that best represent the general clinical development of the disease were selected. 7 patients in this study were selected based on several considerations. First, all patients had well-established lamina propria invasive TCC confirmed by an expert urologic pathologist. Second, every patient's disease history and their pathology reports were carefully evaluated to ascertain BCa progression. Third, only patients with extended follow-up and sufficient tumor cells in the paraffin blocks were included in this study. As shown in Table 4, for the T1NP group, these 3 patients were confirmed as non-progressive disease by subsequent biopsies, and with significantly extended follow-up times, ranging from 6 to 17 years (average 9 years). For T1P patients, it was recognized that some T1 cancers could be under-staged. When selecting the cases for the present study, any cases with very short interval between T1 and T2 diagnosis were excluded. The 4 patients in the T1P group had an average time to development of muscularis propria invasive cancer of 2.43 years, ranging from 0.8 to 4.5 years and with an average follow-up time of 4.75 years ranging from 1-8 years (Table 4). All these data coincided with the general clinical observation of T1 BCa progression. The representative images collected from both T1NP (
RNA-Seq analysis. Specimens of papillary BCa were carefully evaluated by an expert urologic pathologist and the tumor tissues were identified, marked and highlighted. FFPE cores were collected from these areas using a biopsy punch with a plunger (1.5 mm in diameter) in a RNA-free environment and all the excess amount of paraffin was removed. Once the RNA was extracted, the quantity of extracted RNA was examined and sample integrity was further evaluated on an Agilent Bioanalyzer.
RNA-Seq analysis of T1NP and T1P tumors. RNA-Seq was performed using the Illumina GAII™ platform. The study design and the workflow for the RNA-Seq are illustrated in
As shown in
Identification of differential expressed genes in T1NP compared to T1P samples. Limma™ (25) was used to construct a linear model and a total of 181 significantly differentially expressed genes between the non-progressive and progressive groups of patients (p-value <0.05) whose original diagnosis were high-grade T1 BCa (Tables 1 and 2) was identified. Among these, 101 genes were up regulated and 80 were down regulated in T1P relative to T1NP. Table 7 lists the top 28 most significantly differentially expressed genes (p-value <0.01) that can distinguish high-grade T1NP tumors with non-progressive recurrence from those T1P muscle invasion tumors. Average-linkage hierarchical clustering was performed using a Pearson correlation-coefficient distance metric using the gene signature identified (
Application of DASL array assay to identify differential expressed genes between T1NP and T1P FFPE samples. The DASL Cancer Panel assay has showed limited but demonstrated ability to profile a significant number of FFPE cancer samples. It has generated gene expression profiles from FFPE samples and other samples containing partially degraded RNAs (29-30). To validate the RNA-Seq result described above, DASL caner panel analysis was performed with the same cohort on all the 7 patients' samples. This method enables the measurement RNA abundance of over 502 genes in parallel per sample. Analysis was performed on the DASL Cancel Panel and 218 genes (among the 502 gene) were expressed in all blocks and 150 genes were not. A Total of 13 genes on the DASL Cancer Panel were present in the presentlyd escribed 181-gene signature identified by RNA-Seq using the same samples to distinguish T1NP from T1P (As shown in
The top enriched network associated with BCa progression. Ingenuity Pathway Analysis (IPA) software was used to search for over-representation of biological pathways and annotated gene functional classes among the genes found to be significant. Among the significant genes, 193 significant annotated biological functions (Table 8) and 68 over-represented canonical pathways (Fisher's Exact Test P value <0.05) were found within the expression gene signature (Table 9) with the top enriched network associated with BCa progression shown in
Discussion
Genomics has created an unprecedented opportunity to survey expression patterns across the genome and to use the resulting data to develop diagnostic and prognostic biomarkers. However, doing this requires the availability of well-annotated clinical samples with extensive clinical data so that patterns of gene expression can be linked to outcome or other relevant endpoints.
While there are many of archival pathological samples reserved as formalin-fixed paraffin-embedded (FFPE) tissues, these have proven difficult to analyze using most of the available genomic technologies. This is largely due to the fact that the process of creating FFPE samples is known to introduce chemical modification and cross-linking of RNA molecules through the addition of mono-methylol to amino groups for RNA and cross-linking between DNA, RNA and proteins occur in these samples. In BCa, the situation has proven particularly difficult as specimens are typically obtained through TUR and the cautery effect associated with the procedure may further degrade nucleic acid.
To overcome these limitations, a variation on RNA-Seq technology that relies on short nucleic acid fragments coupled with double-stranded nuclease (DSN) normalization was used herein. DSN removes ribosomal RNA and other abundant double-stranded DNA and DNA-RNA hybrid complexes, allowing creation of RNA-Seq libraries from most highly degraded samples (31-32). Using this approach, it is demonstrated herein that FFPE specimens obtained by TUR could be used in RNA-Seq, genome-wide expression analysis. The present data suggest that the length of FFPE specimen storage is not associated with reduced RNA quantity or quality (Table 4).
In this study, DASL Caner Panel analysis was performed on all the 7 samples and 13 genes were present on the gene signature identified by RNA-Seq. Surprisingly, out of the 13 genes on the list, an independent analysis demonstrated that 5 of the 13 genes showed significantly differential expressions between the T1NP and T1P patients when comparing between RNA-Seq and DASL data. Moreover, a high degree of correlation was found (Spearman correlation coefficient range from 0.82 to 0.93). This is extremely encouraging, given the challenge of the FFPE samples and the fundamental differences between the two approaches.
Previous studies of superficial papillary bladder tumors have associated with etiologic factors such and smoking and specific genomic alterations with the disease, including mutations in and dysregulation of FGFR3, PI3K KRAS, HRAS, TPp53, P16, TSC1, and PTEN, loss of chromosome 9, 9p or 9q as well as frequent amplification of 6p22 and loss of RB1 (12, 13, 33-46). To date, studies have not produced clinical useful prognostic predictors for T1 BCa progression. Described herein is the identification of a gene signature that was able to distinguish T1NP from T1P patients.
1E00
indicates data missing or illegible when filed
This application is a continuation-in-part application of International Patent Application No. PCT/US13/067792 filed Oct. 31, 2013, which designates the United States and claims benefit under 35 U.S.C. §119(e) of U.S. Provisional Applications No. 61/795,990 filed Oct. 31, 2012 and No. 61/721,300 filed Nov. 1, 2012, the contents of which are incorporated herein by reference in their entirety.
Number | Date | Country | |
---|---|---|---|
61795990 | Oct 2012 | US | |
61721300 | Nov 2012 | US |
Number | Date | Country | |
---|---|---|---|
Parent | PCT/US2013/067792 | Oct 2013 | US |
Child | 14698254 | US |