The present disclosure relates generally to a prostate cancer biomarker signature. More particularly, the present disclosure relates to a multi-model signature for the prognosis of prostate cancer outcomes, which can inform treatment decisions and guide therapy.
Prostate cancer is the most commonly diagnosed non-skin malignancy in men, resulting in an estimated 256,000 deaths worldwide in 20101. While the vast majority of men present with localized, and thus potentially curable disease, current clinical prognostic factors explain only a fraction of the heterogeneity of treatment response. These factors thus do not optimally triage individual patients into appropriate risk groupings that determine the appropriate treatment aggression2,3.
Localized prostate cancers exhibit striking inter-tumoural heterogeneity, at both the genomic4,5 and microenvironmental levels6. In particular, intermediate risk prostate cancers are localized, non-indolent and clinically heterogeneous. Despite current management with either surgery or radiotherapy, more than 30% of men suffer relapse of their disease; in 10% of these men (approximately 10,000 a year), rapid biochemical recurrence can portend prostate cancer-specific death7. Having a rigorous understanding of the genetic factors driving progression and aggression in the initial pre- and post-treatment setting is a critical need for both clinicians and genetic researchers, as distinct genomic pathways of progression could define prostate cancer sub-types leading to novel curative therapeutics. It is of utmost importance to be able to identify the genetic drivers of localized, non-indolent prostate cancer, as they cannot be inferred from genomic studies of metastatic castrate-resistant prostate cancer (mCRPC), due to tumour cell selection and adaption to secondary androgen deprivation therapy8.
In a general aspect, there is provided a method of prognosing and/or predicting disease progression in subject with prostate cancer. The method comprises use of at least 2 patient biomarkers determined or measured from genetic material of cancer cells and comparing them to corresponding reference or control measures of the same biomarkers. The biomarkers are selected from T category, and aberrations in ACTL6B, TCERGL1, chr7:61 Mbp, ATM and MYC. Statistically significant aberrations of the subject biomarkers when compared to the reference biomarkers would be indicative of a worse outcome.
In an aspect, there is provided a method of prognosing and/or predicting disease progression in subject with prostate cancer, the method comprising: a) providing a sample containing genetic material from cancer cells; b)determining or measuring at least 2 patient biomarkers regarding the prostate cancer tumor selected from the group consisting of: T category, ACTL6B methylation, TCERGL1 methylation, chr7:61 Mbp inter-chromosomal translocation, ATM single nucleotide variants and MYC copy number aberrations; c) comparing said patient biomarkers to corresponding reference or control biomarkers; and d) determining the likelihood of disease progression; wherein a likelihood of disease progression is higher with each of ACTL6B hyper-methylation, TCERGLI hypo-methylation, higher T category, and higher incidences of chr7:61 Mbp inter-chromosomal translocation, ATM single nucleotide variants and MYC copy number aberrations, when the difference is statistically significant on comparison with the reference or control biomarkers.
In an aspect, there is provided a computer-implemented method of predicting disease progression in patient with prostate cancer, the method comprising: a) receiving, at at least one processor, data reflecting at least 2 patient biomarkers regarding the prostate cancer tumor selected from the group consisting of: T category, ACTL6B methylation, TCERGL1 methylation, chr7:61 Mbp inter-chromosomal translocation, ATM single nucleotide variants and MYC copy number aberrations; b) constructing, at the at least one processor, an expression profile corresponding to the expression levels; c) comparing, at the at least one processor, said patient biomarkers to corresponding reference or control biomarkers; d) determining, at the at least one processor, the likelihood of disease progression; wherein a likelihood of disease progression is higher with each of ACTL6B hyper-methylation, TCERGLI hypo-methylation, higher T category, and higher incidences of chr7:61 Mbp inter-chromosomal translocation, ATM single nucleotide variants and MYC copy number aberrations, when the difference is statistically significant on comparison with the reference or control biomarkers.
In an aspect, there is provided a computer program product for use in conjunction with a general-purpose computer having a processor and a memory connected to the processor, the computer program product comprising a computer readable storage medium having a computer mechanism encoded thereon, wherein the computer program mechanism may be loaded into the memory of the computer and cause the computer to carry out the method described herein.
In an aspect, there is provided a computer readable medium having stored thereon a data structure for storing the computer program product described herein.
In an aspect, there is provided a device for predicting disease progression in patient with prostate cancer, the device comprising: at least one processor; and electronic memory in communication with the at one processor, the electronic memory storing processor-executable code that, when executed at the at least one processor, causes the at least one processor to: a) receive data reflecting at least 2 patient biomarkers regarding the prostate cancer tumor selected from the group consisting of: T category, ACTL6B methylation, TCERGL1 methylation, chr7:61 Mbp inter-chromosomal translocation, ATM single nucleotide variants and MYC copy number aberrations; b) compare said patient biomarkers to corresponding reference or control biomarkers; and c) determining, at the at least one processor, the likelihood of disease progression; wherein a likelihood of disease progression is higher with each of ACTL6B hyper-methylation, TCERGL1 hypo-methylation, higher T category, and higher incidences of chr7:61 Mbp inter-chromosomal translocation, ATM single nucleotide variants and MYC copy number aberrations, when the difference is statistically significant on comparison with the reference or control biomarkers.
Other aspects and features of the present disclosure will become apparent to those ordinarily skilled in the art upon review of the following description of specific embodiments in conjunction with the accompanying figures.
Embodiments of the present disclosure will now be described, by way of example only, with reference to the attached Figures.
Herein we report the largest prostate cancer whole-genome sequencing cohort to date: 200 non-indolent localized specimens. We provide saturating discovery of recurrent driver single nucleotide variants (SNVs), copy number aberrations (CNAs) and genomic rearrangements (GRs) in this ethnic and clinical group, and associate these with epigenomic profiles. Many well-characterized recurrent molecular aberrations are confirmed, and novel prognostic translocations, inversions and epigenetic events are identified.
Specifically, we analyzed 200 whole-genome sequences and 477 whole-exome sequences from localized, non-indolent prostate tumours. These were supplemented with RNA and methylation analyses in a subset of cases. All tumours had similar pre-clinical risk profiles, reflecting the most common disease state on initial clinical presentation. These tumours have a paucity of clinically-actionable SNVs, unlike those seen in metastatic disease. Rather, a significant proportion of tumours harbour recurrent non-coding aberrations, large-scale genomic rearrangements, and a novel mode whereby an inversion represses transcription within its boundaries. Local hypermutation events (kataegis and chromothripsis) were frequent, and correlated with specific genomic profiles. Numerous molecular aberrations were prognostic for disease recurrence, including several DNA methylation events. These outperformed well-described prognostic biomarkers like MYC amplification, NKX3-1 and PTEN deletion, and percentage genome alteration. Our data suggest that novel therapeutic approaches should focus on recurrent targets in localized prostate cancer to improve cures in aggressive localized disease.
From these analyses a multimodal signature for prostate cancer was developed.
In a general aspect, there is provided a method of prognosing and/or predicting disease progression in subject with prostate cancer. The method comprises use of at least 2 patient biomarkers determined or measured from genetic material of cancer cells and comparing them to corresponding reference or control measures of the same biomarkers. The biomarkers are selected from T category, and aberrations in ACTL6B, TCERGL1, chr7:61 Mbp, ATM and MYC. Statistically significant aberrations of the subject biomarkers when compared to the reference biomarkers would be indicative of a worse outcome.
The methods described herein are useful for prognosing the outcome of a subject that has, or has had, a cancer associated with the prostate. The cancer may be prostate cancer or a cancer that has metastasized from a cancer of the prostate.
In an aspect, there is provided a method of prognosing and/or predicting disease progression in subject with prostate cancer, the method comprising: a) providing a sample containing genetic material from cancer cells; b)determining or measuring at least 2 patient biomarkers regarding the prostate cancer tumor selected from the group consisting of: T category, ACTL6B methylation, TCERGL1 methylation, chr7:61 Mbp inter-chromosomal translocation, ATM single nucleotide variants and MYC copy number aberrations; c) comparing said patient biomarkers to corresponding reference or control biomarkers; and d) determining the likelihood of disease progression; wherein a likelihood of disease progression is higher with each of ACTL6B hyper-methylation, TCERGL1 hypo-methylation, higher T category, and higher incidences of chr7:61 Mbp inter-chromosomal translocation, ATM single nucleotide variants and MYC copy number aberrations, when the difference is statistically significant on comparison with the reference or control biomarkers.
The term “subject” as used herein refers to any member of the animal kingdom, preferably a human being and most preferably a human being that has, has had, or is suspected of having prostate cancer.
The term “sample” as used herein refers to any fluid (e.g. blood, urine, semen), cell, tumor or tissue sample from a subject which can be assayed for the biomarkers described herein.
The term “genetic material” used herein refers to materials found/originate in the nucleus, mitochondria and cytoplasm, which play a fundamental role in determining the structure and nature of cell substances, and capable of self-propagating and variation. In the context of the present methods, the genetic material is any material from which one can measure the biomakers described herein. The genetic material is preferably DNA.
A “genetic aberration” is any change in genetic material that is unusual or uncommon when compared to wild-type or control genetic material. Genetic aberrations include deletions, substitutions, insertions, SNVs, translocations, hyper or hypo-methylation, copy number abberations and any other genetic mutations.
The term “prognosis” as used herein refers to the prediction of a clinical outcome associated with a disease subtype which is reflected by a reference profile such as a biomarker reference profile. The prognosis provides an indication of disease progression and includes an indication of likelihood of death due to cancer. The prognosis may be a prediction of metastasis, or alternatively disease recurrence. In one embodiment the clinical outcome class includes a better survival group and a worse survival group. The term “prognosing or classifying” as used herein means predicting or identifying the clinical outcome of a subject according to the subject's similarity to a reference profile or biomarker associated with the prognosis. For example, prognosing or classifying comprises a method or process of determining whether an individual has a better or worse survival outcome, or grouping individuals into a better survival group or a worse survival group, or predicting whether or not an individual will respond to therapy.
In various embodiments, the at least 2 patient biomarkers, are at least 3, 4, 5 or 6 patient biomarkers.
In some embodiments, the prostate cancer is localized prostate cancer, preferably non-indolent localized prostate cancer.
In some embodiments, the method further comprises building a patient biomarker profile from the determined or measured patient biomarkers.
The term “biomarker profile” as used herein refers to a dataset representing the state or expression level(s) of one or more biomarkers. A biomarker profile may represent one subject, or alternatively a consolidated dataset of a cohort of subjects, for example to establish a reference biomarker profile as a control.
As used herein, the term “control” refers to a specific value or dataset that can be used to prognose or classify the value e.g the measured biomarker or reference biomarker profile obtained from the test sample associated with an outcome. In one embodiment, a dataset may be obtained from samples from a group of subjects known to have cancer having different tumor states and/or healthy individuals. The state or expression data of the biomarkers in the dataset can be used to create a control value that is used in testing samples from new patients. In some embodiments, a cohort of subjects is used to obtain a control dataset. A control cohort patients may be a group of individuals with or without cancer.
In some embodiments, the prediction of disease progression is following at least one of surgery, endocrine therapy, chemotherapy, radiotherapy, hormone therapy, gene therapy, thermal therapy, and ultrasound therapy.
In some embodiments, the method further comprises classifying the patient into a high risk group if the likelihood of disease progression is relatively high or a low risk group if the likelihood of disease progression is relatively low.
As used herein, “overall survival” refers to the percentage of or length of time that people in a study or treatment group are still alive following from either the date of diagnosis or the start of treatment for a disease, such as cancer. In a clinical trial, measuring the overall survival is one way to see how well a new treatment works.
As used herein, “relapse-free survival” refers to, in the case of caner, the percentage of or length of time that people in a study or treatment group survive without any signs or symptoms of that cancer after primary treatment for that cancer. In a clinical trial, measuring the relapse-free survival is one way to see how well a new treatment works. It is defined as any disease recurrence or relapse (local, regional, or distant).
The term “good survival” or “better survival” as used herein refers to an increased chance of survival as compared to patients in the “poor survival” group. For example, the biomarkers of the application can prognose or classify patients into a “good survival group”. These patients are at a lower risk of death after surgery and can also be categorized into a “low-risk group”.
The term “poor survival” or “worse survival” as used herein refers to an increased risk of disease progression or death as compared to patients in the “good survival” group. For example, biomarkers or genes of the application can prognose or classify patients into a “poor survival group”. These patients are at greater risk of death or adverse reaction from disease or surgery, treatment for the disease or other causes, and can also be categorized into a “high-risk group”.
A person skilled in the art would understand how to implement differing cut-offs for good survival vs. worse survival, depending on the clinical outcome one is predicting and the biomarkers being assayed.
In some embodiments, the method further comprises treating the patient with more aggressive therapy if the patient is in the high risk group.
In some embodiments, the more aggressive therapy comprises adjuvant therapy, preferably hormone therapy, chemotherapy or radiotherapy.
The present system and method may be practiced in various embodiments. A suitably configured computer device, and associated communications networks, devices, software and firmware may provide a platform for enabling one or more embodiments as described above. By way of example,
The present system and method may be practiced on virtually any manner of computer device including a desktop computer, laptop computer, tablet computer or wireless handheld. The present system and method may also be implemented as a computer-readable/useable medium that includes computer program code to enable one or more computer devices to implement each of the various process steps in a method in accordance with the present invention. In case of more than computer devices performing the entire operation, the computer devices are networked to distribute the various steps of the operation. It is understood that the terms computer-readable medium or computer useable medium comprises one or more of any type of physical embodiment of the program code. In particular, the computer-readable/useable medium can comprise program code embodied on one or more portable storage articles of manufacture (e.g. an optical disc, a magnetic disk, a tape, etc.), on one or more data storage portioned of a computing device, such as memory associated with a computer and/or a storage system.
In an aspect, there is provided a computer-implemented method of predicting disease progression in patient with prostate cancer, the method comprising: a) receiving, at at least one processor, data reflecting at least 2 patient biomarkers regarding the prostate cancer tumor selected from the group consisting of: T category, ACTL6B methylation, TCERGL1 methylation, chr7:61 Mbp inter-chromosomal translocation, ATM single nucleotide variants and MYC copy number aberrations; b) constructing, at the at least one processor, an expression profile corresponding to the expression levels; c) comparing, at the at least one processor, said patient biomarkers to corresponding reference or control biomarkers; d) determining, at the at least one processor, the likelihood of disease progression; wherein a likelihood of disease progression is higher with each of ACTL6B hyper-methylation, TCERGL1 hypo-methylation, higher T category, and higher incidences of chr7:61 Mbp inter-chromosomal translocation, ATM single nucleotide variants and MYC copy number aberrations, when the difference is statistically significant on comparison with the reference or control biomarkers.
In an aspect, there is provided a computer program product for use in conjunction with a general-purpose computer having a processor and a memory connected to the processor, the computer program product comprising a computer readable storage medium having a computer mechanism encoded thereon, wherein the computer program mechanism may be loaded into the memory of the computer and cause the computer to carry out the method described herein.
In an aspect, there is provided a computer readable medium having stored thereon a data structure for storing the computer program product described herein.
In an aspect, there is provided a device for predicting disease progression in patient with prostate cancer, the device comprising: at least one processor; and electronic memory in communication with the at one processor, the electronic memory storing processor-executable code that, when executed at the at least one processor, causes the at least one processor to: a) receive data reflecting at least 2 patient biomarkers regarding the prostate cancer tumor selected from the group consisting of: T category, ACTL6B methylation, TCERGL1 methylation, chr7:61 Mbp inter-chromosomal translocation, ATM single nucleotide variants and MYC copy number aberrations; b) compare said patient biomarkers to corresponding reference or control biomarkers; and c) determining, at the at least one processor, the likelihood of disease progression; wherein a likelihood of disease progression is higher with each of ACTL6B hyper-methylation, TCERGL1 hypo-methylation, higher T category, and higher incidences of chr7:61 Mbp inter-chromosomal translocation, ATM single nucleotide variants and MYC copy number aberrations, when the difference is statistically significant on comparison with the reference or control biomarkers.
As used herein, “processor” may be any type of processor, such as, for example, any type of general-purpose microprocessor or microcontroller (e.g., an Intel™ x86, PowerPC™, ARM™ processor, or the like), a digital signal processing (DSP) processor, an integrated circuit, a field programmable gate array (FPGA), or any combination thereof.
As used herein “memory” may include a suitable combination of any type of computer memory that is located either internally or externally such as, for example, random-access memory (RAM), read-only memory (ROM), compact disc read-only memory (CDROM), electro-optical memory, magneto-optical memory, erasable programmable read-only memory (EPROM), and electrically-erasable programmable read-only memory (EEPROM), or the like. Portions of memory 102 may be organized using a conventional filesystem, controlled and administered by an operating system governing overall operation of a device.
As used herein, “computer readable storage medium” (also referred to as a machine-readable medium, a processor-readable medium, or a computer usable medium having a computer-readable program code embodied therein) is a medium capable of storing data in a format readable by a computer or machine. The machine-readable medium can be any suitable tangible, non-transitory medium, including magnetic, optical, or electrical storage medium including a diskette, compact disk read only memory (CD-ROM), memory device (volatile or non-volatile), or similar storage mechanism. The computer readable storage medium can contain various sets of instructions, code sequences, configuration information, or other data, which, when executed, cause a processor to perform steps in a method according to an embodiment of the disclosure. Those of ordinary skill in the art will appreciate that other instructions and operations necessary to implement the described implementations can also be stored on the computer readable storage medium. The instructions stored on the computer readable storage medium can be executed by a processor or other suitable processing device, and can interface with circuitry to perform the described tasks.
As used herein, “data structure” a particular way of organizing data in a computer so that it can be used efficiently. Data structures can implement one or more particular abstract data types (ADT), which specify the operations that can be performed on a data structure and the computational complexity of those operations. In comparison, a data structure is a concrete implementation of the specification provided by an ADT.
The above listed aspects and/or embodiments may be combined in various combinations as appreciated by a person of skill in the art. The advantages of the present disclosure are further illustrated by the following examples. The examples and their particular details set forth herein are presented for illustration only and should not be construed as a limitation on the claims of the present invention.
To address the genetic heterogeneity of non-indolent localized disease, we first comprehensively profiled CNAs in 284 localized prostate adenocarcinomas (
We next performed high-depth whole-genome sequencing (WGS) of 130 of these tumours (and matched bloods), focusing on localized tumours amenable to surgery (i.e. Gleason Score (GS) of 3+3, 3+4 or 4+3). These were supplemented by 70 tumour/normal pairs with publicly-available read-level WGS data9-12 and 277 read-level exome sequences9,10,12,13, all with similar GSs. WGS data covered 84.2±2.5% (mean±standard-deviation) of the non-repetitive genome to at least 17x tumour and 10x normal (67.1-85.7%), allowing robust analysis of the entire genome. All samples were aligned and profiled for SNVs and GRs, using well characterized and validated pipelines (14 and Lee et al. in review;
We observed a low overall SNV burden, with a median 0.53 (0.05-6.92) somatic SNVs/Mbp across all tumours (
Individual tumours harboured 0-98 exomic SNVs (
We next explored the non-coding regions of the genome in the 200 tumours with WGS. Multiple recurrent ncSNVs (i.e. identical genomic position) were detected: 7 ncSNVs were observed in at least 7/200 patients, while another 63 were mutated in 4-6 patients. These SNVs are thus altered at a similar ˜2-4% mutation rate as TP53, MED12 and FOXA1 (
Next we sought to assess mutational signatures by considering the trinucleotide profiles of somatic SNVs using non-negative matrix factorization18. Three distinct trinucleotide signatures were identified from WGS data (
GRs have been poorly-studied in localized prostate cancer; however they may provide evidence for DNA double-stranded break events during progression. As expected, the TMPRSS2:ERG (T2E) fusion on chromosome 21 was the most recurrent GR, observed in 38% of tumours (76/200;
To further understand regional GR effects, we then divided the genome into 1 Mbp bins and considered the frequency of GRs in each (
While some tumours are initiated or driven by recurrent point mutations in specific genes, others could be driven by focal genomic instability either at the level of DNA double-stranded breaks (i.e. chromothripsis22) or DNA single-stranded breaks (i.e. kataegis18). Using ShatterProof23 we detected chromothriptic characteristics in 20% (38/186) of tumours with CNA data (
To quantify the presence of kataegis, we developed a sliding-window approach using the binomial test, a test for base change enrichment and an assessment of the expected proportion of variants within a given window. We detected kataegis in 46/200 samples (23%;
To better characterize the suite of recurrent events in localized prostate cancer, we next evaluated the association of each of these with patient survival. Of our patients with whole-genome sequencing, 130/200 had available data on disease relapse, as measured by biochemical recurrence (BCR, rise of PSA levels following primary therapy, see methods), with a median 7.96-year follow-up. We systematically evaluated the clinical relevance of 40 recurrent genomic alterations in localized prostate cancer: 3 measures of mutation density, kataegis, chromothripsis, 5 recurrent coding SNVs, 6 recurrent non-coding SNVs, 6 methylation events, 6 recurrent translocations, 4 recurrent inversions, and 8 CNAs. For each of these we employed univariate CoxPH modeling (
Remarkably, methylation status was tightly associated with patient outcome, much more so than any other genomic characteristic: of the 9 events significantly (p<0.05; Wald test) associated with disease recurrence, 6 involved DNA methylation. For example, hyper-methylation of a probe 5′ of a transcriptional elongation regulator (TCERG1L) shows a strong association with poor outcome (HR=2.90; 95% CI: 1.30-6.30; p=0.007). Fascinatingly, another probe on the 3′ end of TCERG1L showed the inverse association, with hypo-methylation associated with good outcome (HR=0.17; 95% CI: 0.06-0.49; p=9.45×10−4;
Finally, we evaluated whether these diverse events could be integrated into a multi-modal, DNA-based biomarker to predict the disease relapse. Such a biomarker would be of significant clinical value, as many of these patients are over-treated in current management. We applied multivariate CoxPH modeling using cross-validation to test the outcome of a multi-modal biomarker: T category, ACTL6B hyper-methylation, TCERGLI hypo-methylation, the chr7:61 Mbp CTX, ATM SNVs, and MYC CNA. This signature was highly discriminative of patients who would experience disease relapse, with an Area Under the ROC Curve of 0.83 (95% CI: 0.80-0.86, as compared to that of 0.61 for the validated PGA biomarker (
Localized, non-indolent prostate cancer is the most common state at initial clinical presentation. We used whole-genome sequencing to identify a series of recurrent mutational events outside of the exome. Because of the paucity of driver and prognostic coding aberrations, consideration of the entire prostate cancer genome may be critical in biomarker studies to find driver aberrations missed in smaller studies4,10. For example, we identify several inversions associated with mRNA abundance decreases, potentially representing a novel mode of tumour-suppressor inactivation.
Our data highlights the differences in mutational profile between localized intermediate risk cancers versus metastatic, castration-resistant prostate cancer (mCRPC). Nearly 50% of mCRPC tumours harbour mutations in AR, ETS genes, TP53 and PTEN and ˜20% have aberrations in DNA damage response genes (e.g. BRCA1, BRCA2, and ATM; which may portend sensitivity to PARP inhibitors26-28). Furthermore, more than 60% of mCRPC tumours contain clinically-actionable mutations that are non-AR related8. In contrast, non-SNV mutations dominate the genomics in localized non-indolent prostate cancer. No single gene was mutated at >10% frequency and the only prognostic SNV was ATM.
In the modern era of PSA screening, localized, non-indolent prostate cancer represents the vast majority of cases on initial clinical presentation. We show that localized disease represents a different biology from advanced mCRPC, which have undergone significant selective pressure, often through multiple courses of treatment29. As recurrent SNV driver aberrations are rare in localized disease, tumours requiring intensified therapy may benefit from widespread genotoxic chemotherapy as supported by clinical trials in metastatic non-castrate disease30. Similarly, the development of novel therapeutics will be improved by a robust understanding of the non-exomic drivers of aggression in localized prostate cancer.
Data Availability mRNA and methylation data is available in GEO under accession GSE84043. Raw sequencing data is available in EGA at: https://www.ebi.ac.uk/eqa/studies/EGAS00001000900. Processed variant calls have been uploaded to the ICGC Data Coordinating Centre. Baca and Barbieri WGS/WXS data is available on dbGaP under accession phs000447.v1.p1. Berger WGS data is available on dbGaP under accession phs000330.v1.p1. Weischenfeldt WGS data is available on EGA under accession EGAS00001000258. TCGA WGS/WXS data is available at Genomic Data Commons Data Portal under Project ID TCGA-PRAD.
Prostate cancer may be a C-class tumour31. To investigate this postulate, we began by generating copy-number aberration (CNA) profiles of 284 localized prostate adenocarcinomas using OncoScan SNP arrays. For patients treated with image-guided radiotherapy (IGRT; n=147), a pre-treatment ultrasound-guided biopsy of the dominant lesion was taken and flash frozen. For radical prostatectomy (surgery) patients (n=137), the dominant lesion was excised from the gross specimen and flash frozen. All frozen tissue samples used for genomic studies contained at least 70% malignant epithelia prior to manual macro-dissection. Pathological Gleason scores for each sample analyzed were based on the consensus of two urological pathologists (data not shown).
Our CNA analyses revealed a median of 34 separate CNAs encompassing 5.4% of the genome. We observed, however, dramatic inter-tumoural heterogeneity with tumours having between 0 and 267 CNAs, covering 0-39.2% of the genome (data not shown). Unsupervised analysis identified 6 CNA subtypes (
Despite this global difference in CNA-density, no single gene was mutated at statistically different frequencies amongst GS 3+3, 3+4 and 4+3 tumours (
TFBS Analyses of ncSNVs and Other Aberrations
To determine if these ncSNVs were biased to co-localizing in regulatory regions, we identified transcription-factor binding sites across the genome from the ENCODE project35 and used binomial tests to assess enrichment of mutations in transcription factor binding sites (TFBS). We only considered genomic positions that were well-covered in our sequencing study (i.e. those with ≥10x coverage in normal and ≥17x in tumours) and assessed enrichment of 58 separate ChIP-Seq datasets. TFBSs enriched for mutations at a 5% FDR level included elevated mutation rate in H3K27 trimethylation sites, H3K9 trimethylation sites and H3K4 mono- and tri-methylation sites, all in multiple cell-lines and with multiple types of aberrations (
The International Cancer Genome Consortium (ICGC) is a multi-national project aimed at comprehensively cataloging somatic mutations of at least 50 individual tumour-types by profiling the genomes of at least 500 tumours of each36. This sample-size was selected to ensure sufficient statistical power to identify mutations present in 1% or more of individual patients. The present study from the Canadian Prostate Cancer Genome Network (CPC-GENE) provides a foundational piece towards achieving this goal, focusing on the genomes of localized GS6 and GS7 prostate cancer. This analysis of almost 500 exome-sequences provides near-saturation identification of genes altered by coding point-mutations. Moreover, the assessment of almost 200 whole-genomes provides an initial interrogation of recurrent genomic rearrangements and non-coding SNVs, representing—to our knowledge—the largest such study to date.
All documents disclosed herein, including those in the following reference list, are incorporated by reference. Although preferred embodiments of the invention have been described herein, it will be understood by those skilled in the art that that the detailed description and the specific examples while indicating preferred embodiments of the invention are given by way of illustration only, since various changes and modifications within the spirit and scope of the invention will become apparent to those skilled in the art from this detailed description.
1. Lozano, R. et al. Global and regional mortality from 235 causes of death for 20 age groups in 1990 and 2010: a systematic analysis for the Global Burden of Disease Study 2010. Lancet 380, 2095-2128, doi:10.1016/S0140-6736(12)61728-0 (2012).
2. Klotz, L. et al. Long-term follow-up of a large active surveillance cohort of patients with prostate cancer. J Clin Oncol 33, 272-277, doi:10.1200/JCO.2014.55.1192 (2015).
3 D'Amico, A. V. et al. Cancer-specific mortality after surgery or radiation for patients with clinically localized prostate cancer managed during the prostate-specific antigen era. J Clin Oncol 21, 2163-2172 (2003).
4. Boutros, P. C. et al. Spatial genomic heterogeneity within localized, multifocal prostate cancer. Nature Genetics 47, 736-745, doi:10.1038/ng.3315 (2015).
5. Cooper, C. S. et al. Analysis of the genetic phylogeny of multifocal prostate cancer identifies multiple independent clonal expansions in neoplastic and morphologically normal prostate tissue. Nat. Genet. 47, 367-372, doi:10.1038/ng.3221 (2015).
6. Lalonde, E. et al. Tumour genomic and microenvironmental heterogeneity for integrated prediction of 5-year biochemical recurrence of prostate cancer: a retrospective cohort study. Lancet Oncol 15, 1521-1532, doi:10.1016/S1470-2045(14)71021-6 (2014).
7. Buyyounouski, M. K., Pickles, T., Kestin, L. L., Allison, R. & Williams, S. G. Validating the interval to biochemical failure for the identification of potentially lethal prostate cancer. J Clin Oncol 30, 1857-1863, doi:10.1200/JCO.2011.35.1924 (2012).
8. Robinson, D. et al. Integrative clinical genomics of advanced prostate cancer. Cell 161, 1215-1228, doi:10.1016/j.ce11.2015.05.001 (2015).
9. Berger, M. F. et al. The genomic complexity of primary human prostate cancer. Nature 470, 214-220, doi:10.1038/nature09744 (2011).
10. Baca, S. C. et al. Punctuated evolution of prostate cancer genomes. Cell 153, 666-677, doi:10.1016/j.ce11.2013.03.021 (2013).
11. Weischenfeldt, J. et al. Integrative genomic analyses reveal an androgen-driven somatic alteration landscape in early-onset prostate cancer. Cancer Cell 23, 159-170, doi:10.1016/j.ccr.2013.01.002 (2013).
12. Cancer Genome Atlas Research Network. The Molecular Taxonomy of Primary Prostate Cancer. Cell 163, 1011-1025, doi:10.1016/j.cell.2015.10.025 (2015).
13. Barbieri, C. E. et al. Exome sequencing identifies recurrent SPOP, FOXA1 and MED12 mutations in prostate cancer. Nat. Genet. 44, 685-689, doi:10.1038/ng.2279 (2012).
14. Ewing, A. D. et al. Combining tumor genome simulation with crowdsourcing to benchmark somatic single-nucleotide-variant detection. Nat Methods 12, 623-630, doi:10.1038/nmeth.3407 (2015).
15. Lawrence, M. S. et al. Discovery and saturation analysis of cancer genes across 21 tumour types. Nature 505, 495-501, doi:10.1038/nature12912 (2014).
16. Ciriello, G. et al. Emerging landscape of oncogenic signatures across human cancers. Nat. Genet. 45, 1127-1133, doi:10.1038/ng.2762 (2013).
17. Zhou, J. & Troyanskaya, O. G. Predicting effects of noncoding variants with deep learning-based sequence model. Nat Methods 12, 931-934, doi:10.1038/nmeth.3547 (2015).
18. Alexandrov, L. B. et al. Signatures of mutational processes in human cancer. Nature 500, 415-421, doi:10.1038/nature12477 (2013).
19. Gandhi, M., Evdokimova, V. & Nikiforov, Y. E. Frequency of close positioning of chromosomal loci detected by FRET correlates with their participation in carcinogenic rearrangements in human cells. Genes, chromosomes & cancer 51, 1037-1044, doi:10.1002/gcc.21988 (2012).
20. Nikiforova, M. N. et al. Proximity of chromosomal loci that participate in radiation-induced rearrangements in human cells. Science 290, 138-141 (2000).
21. Rickman, D. S. et al. Oncogene-mediated alterations in chromatin conformation. Proc. Natl. Acad. Sci. U.S.A. 109, 9083-9088, doi:10.1073/pnas.1112570109 (2012).
22. Korbel, J. O. & Campbell, P. J. Criteria for inference of chromothripsis in cancer genomes. Cell 152, 1226-1236, doi:10.1016/j.ce11.2013.02.023 (2013).
23. Govind, S. K. et al. ShatterProof: operational detection and quantification of chromothripsis. BMC Bioinfo. 15, 78, doi:10.1186/1471-2105-15-78 (2014).
24. Zafarana, G. et al. Copy number alterations of c-MYC and PTEN are prognostic factors for relapse after prostate cancer radiotherapy. Cancer 118, 4053-4062, doi:10.1002/cncr.26729 (2012).
25. Hieronymus, H. et al. Copy number alteration burden predicts prostate cancer relapse. Proceedings of the National Academy of Sciences of the United States of America 111, 11139-11144, doi:10.1073/pnas.1411446111 (2014).
26. Mateo, J. et al. DNA-Repair Defects and Olaparib in Metastatic Prostate Cancer. The New England journal of medicine 373, 1697-1708, doi:10.1056/NEJMoa1506859 (2015).
27. Schiewer, M. J. et al. Dual roles of PARP-1 promote cancer growth and progression. Cancer discovery 2, 1134-1149, doi:10.1158/2159-8290.CD-12-0120 (2012).
28. Feng, F. Y., de Bono, J. S., Rubin, M. A. & Knudsen, K. E. Chromatin to Clinic: The Molecular Rationale for PARP1 Inhibitor Function. Molecular cell 58, 925-934, doi:10.1016/j.molcel.2015.04.016 (2015).
29. Gundem, G. et al. The evolutionary history of lethal metastatic prostate cancer. Nature 520, 353-357, doi:10.1038/nature14347 (2015).
30. Graff, J. N. & Beer, T. M. Should docetaxel be administered earlier in prostate cancer therapy? Expert Rev Anticancer Ther 15, 977-979, doi:10.1586/14737140.2015.1074042 (2015).
31. Ciriello, G. et al. Emerging landscape of oncogenic signatures across human cancers. Nat. Genet. 45, 1127-1133, doi:10.1038/ng.2762 (2013).
32. Taylor, B. S. et al. Integrative genomic profiling of human prostate cancer. Cancer Cell 18, 11-22, doi:10.1016/j.ccr.2010.05.026 (2010).
33. Lalonde, E. et al. Tumour genomic and microenvironmental heterogeneity for integrated prediction of 5-year biochemical recurrence of prostate cancer: a retrospective cohort study. Lancet Oncol 15, 1521-1532, doi:10.1016/S1470-2045(14)71021-6 (2014).
34. Hieronymus, H. et al. Copy number alteration burden predicts prostate cancer relapse. Proceedings of the National Academy of Sciences of the United States of America 111, 11139-11144, doi:10.1073/pnas.1411446111 (2014).
35. Encode Project Consortium. An integrated encyclopedia of DNA elements in the human genome. Nature 489, 57-74, doi:10.1038/nature11247 (2012).
36. Hudson, T. J. et al. International network of cancer genome projects. Nature 464, 993-998, doi:nature08987 [pii] 10.1038/nature08987 (2010).
37. Van Loo, P. et al. Analyzing cancer samples with SNP arrays. Methods in molecular biology 802, 57-72, doi:10.1007/978-1-61779-400-1_4 (2012).
38. Song, S. et al. qpure: A tool to estimate tumor cellularity from genome-wide single-nucleotide polymorphism profiles. PLoS One 7, e45835, doi:10.1371/journal.pone.0045835 (2012).
39. Alexandrov, L. B. et al. Signatures of mutational processes in human cancer. Nature 500, 415-421, doi:10.1038/nature12477 (2013).
This application claims the benefit of priority of U.S. Provisional Patent Application No. 62/290,246 filed Feb. 2, 2016 and incorporated herein by reference in its entirety.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/CA2017/000023 | 2/2/2017 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
62290246 | Feb 2016 | US |