The present invention provides methods and compositions directed to identification of genetic markers associated with prostate cancer.
Prostate cancer (PCa) is the most common cancer among men in the United States. Approximately 242,000 men are expected to be diagnosed with PCa in 2012. The lifetime probability of developing prostate cancer for men is 1 in 6 in the United States (US), the highest in comparison to other cancers. Although many PCa patients have an indolent (non-aggressive) form of the disease that may not even require treatment, a large number of men (˜28,000) die from this disease annually in the US alone. The inability to reliably distinguish between these two forms of the disease, especially at early stages, has resulted in over-treatment of many and under treatment of some. Therefore, it is critical to identify markers that can distinguish these two types of PCa patients at the time of diagnosis, as well as genes that drive cancer progression. Early identification of patients at either high- or low-risk for lethal disease has important clinical and public health implications. Such identification would enable clinically meaningful treatments at potentially curable stages for high-risk patients, while reducing unnecessary treatments for low-risk patients, thus reducing mortality and improving quality of life.
Clinicopathologic parameters can predict PCa biochemical recurrence, which while prognostic, is an imperfect surrogate of PCa-mortality. In addition, a high Gleason score, advanced tumor stage, and short PSA doubling time have been shown to be predictive of death from PCa. While these parameters are useful for the identification of patients at high risk of dying from PCa, they have limited utility in predicting mortality in patients with early stage disease when therapy is likely to be more effective. During the evolution of human cancers, genetic or molecular alterations that promote tumorigenesis precede traditional clinicopathologic changes that are associated with more aggressive disease. Thus, the elucidation of molecular makers that correlate with PCa-specific death may help identify a subset of patients with early stage but particularly aggressive cancers. Such patients would be candidates for early and perhaps more intense therapy. Moreover, the identification of biomarkers linked to PCa prognosis will also shed light on the mechanisms that drive the malignant phenotype that underlies PCa-mortality.
Although clinicopathologic parameters are commonly used predictors of outcome, they are insufficient for identification of potentially life-threatening forms of PCa prior to the development of advanced pathological phenotypes. Results from many studies suggest that most men with PSA-detected PCa have disease that will not progress to threaten their life. Extensive efforts have been made by many groups to improve upon the ability of current clinicopathologic parameters to predict aggressive forms of PCa using markers in somatic tissues. Although many have shown promise, none have been sufficiently validated and/or robust to justify their clinical application.
The present invention overcomes previous shortcomings in the art by identifying significant statistical associations between genetic markers and prostate cancer. Thus, the present invention provides methods and compositions for identifying a subject at increased risk of developing aggressive prostate cancer by detecting the genetic markers of this invention in the subject.
In one aspect, the present invention provides a method of identifying a human subject as having an increased risk of having or developing aggressive prostate cancer, comprising detecting in a nucleic acid sample from the subject 1) a deletion at 1q42.2 from chr1:229894700-230947362 bp, 2) a deletion at 242.1 from chr2:139707778-140858852 bp, 3) a deletion at 1143 from chr11:113321588-113946501 bp, 4) an amplification at 141.3 from chr1:152725557-153275233 bp, or 5) any combination of 1-4 above, wherein the detection of same identifies the subject as having an increased risk of having or developing aggressive prostate cancer.
In an additional aspect, the present invention provides a method of identifying a human subject as having an increased likelihood of having or developing prostate cancer, comprising detecting in a nucleic acid sample from the subject 1) a deletion at 1q42.2 from chr1:229894700-230947362 bp, 2) a deletion at 242.1 from chr2:139707778-140858852 bp, 3) a deletion at 11q23 from chr11:113321588-113946501 bp, 4) an amplification at 1q21.3 from chr1:152725557-153275233 bp, or 5) any combination of 1-4 above, wherein the detection of same identifies the subject as having an increased likelihood of having or developing prostate cancer.
A further aspect of the present invention is a method of identifying a human subject as having an increased risk of having or developing aggressive prostate cancer, comprising detecting in a nucleic acid sample from the subject 1) a copy number alteration at 8q24.21 from chr8:128095593-129190507 bp (based on the physical position of the updated UCSC Genome Browser on Human Mar. 2006 (NCBI36/hg18) or from the corresponding physical positions defined in other versions or forms of the human genome browsers), 2) a copy number alteration at 1q21.3 from chr1:152725557-153275233 bp, 3) a copy number alteration at 1q21.33-22.1 from chr18:58288577-60834535 bp, 4) a copy number alteration at 8q21.13 from chr8:81128386-81867950 bp, 5) a copy number alteration at16q24.1 from chr16:82877051-83540927 bp, 6) a copy number alteration at10q23.31 from chr10:89613175-89888562 bp, 7) a copy number alteration at 17p13.1 from chr17:7501561-7781403 bp, and 8) any combination thereof, wherein the detection of same identifies the subject as having an increased risk of having) or developing aggressive prostate cancer.
An additional aspect of this invention is a method of identifying a human subject as having an increased likelihood of prostate cancer-specific death, comprising detecting in a nucleic acid sample from the subject 1) a copy number alteration at 8q24.21 from chr8:128095593-129190507 bp, 2) a copy number alteration at 1q21.3 from chr1:152725557-153275233 bp, 3) a copy number alteration at 18q21.33-22.1 from chr18:58288577-60834535 bp, 4) a copy number alteration at 8q21.13 from chr8:81128386-81867950 bp, 5) a copy number alteration at16q24.1 from chr16:82877051-83540927 bp, 6) a copy number alteration at10q23.31 from chr10:89613175-89888562 bp, 7) a copy number alteration at 17p13.1 from chr17:7501561-7781403 bp, and 8) any combination thereof, wherein the detection of same identifies the subject as having an increased likelihood of prostate cancer-specific death.
In yet further aspects, the present invention provides a method of identifying a human subject as having an increased risk of developing aggressive prostate cancer, comprising detecting in a nucleic acid sample from the subject 1) a copy number alteration in the MYC gene, 2) a copy number alteration in the ADAR gene, 3) a copy number alteration in the SERPIN5 gene, 4) a copy number alteration in the TPD52 gene, 5) a copy number alteration in the USP10 gene, 6) a copy number alteration in the PTEN gene, 7) a copy number alteration the TP53 gene, and 8) any combination thereof, wherein the detection of same identifies the subject as having an increased risk of developing aggressive prostate cancer.
As an additional aspect, the present invention provides a method of identifying a human subject as having an increased likelihood of prostate cancer-specific death, comprising detecting in a nucleic acid sample from the subject 1) a copy number alteration in the MYC gene, 2) a copy number alteration in the ADAR gene, 3) a copy number alteration in the SERPIN5 gene, 4) a copy number alteration in the TPD52 gene, 5) a copy number alteration in the USP10 gene, 6) a copy number alteration in the PTEN gene, 7) a copy number alteration the TP53 gene, and 8) any combination thereof, wherein the detection of same identifies the subject as having an increased likelihood of prostate cancer-specific death.
In addition, the present invention provides a method of identifying a human subject as having an increased risk of developing aggressive prostate cancer, comprising detecting in a nucleic acid sample from the subject a deletion in the gene PTEN and amplification of the gene MYC.
The present invention also provides a method of identifying a human subject as having an increased likelihood of prostate cancer-specific death, comprising detecting in a nucleic acid sample from the subject a deletion in the gene PTEN and amplification of the gene MYC.
Furthermore, the present invention provides a kit containing probes and other reagents for detecting a genetic marker (e.g., copy number alteration) of this invention.
Additionally provide herein is a computer-assisted method of identifying a proposed treatment and/or management for aggressive prostate cancer as an effective and/or appropriate treatment and/or management for a subject carrying a genetic marker correlated with aggressive prostate cancer, comprising the steps of (a) storing a database of biological data for a plurality of subjects, the biological data that is being stored including for each of said plurality of subjects: (i) a treatment type, (ii) at least one genetic marker associated with aggressive prostate cancer, and (iii) at least one disease progression measure for prostate cancer from which treatment efficacy can be determined; and then (b) querying the database to determine the dependence on said genetic marker of the effectiveness of a treatment type in treating prostate cancer, thereby identifying a proposed treatment as an effective and/or appropriate treatment for a subject carrying a genetic marker correlated with prostate cancer.
The present invention is explained in greater detail below. This description is not intended to be a detailed catalog of all the different ways in which the invention may be implemented, or all the features that may be added to the instant invention. For example, features illustrated with respect to one embodiment may be incorporated into other embodiments, and features illustrated with respect to a particular embodiment may be deleted from that embodiment. In addition, numerous variations and additions to the various embodiments suggested herein will be apparent to those skilled in the art in light of the instant disclosure, which do not depart from the instant invention. Hence, the following specification is intended to illustrate some particular embodiments of the invention, and not to exhaustively specify all permutations, combinations and variations thereof.
The present invention is based on the unexpected discovery of genetic markers that are statistically associated with an increased risk of developing aggressive prostate cancer and an increased likelihood of prostate cancer-specific death. There are numerous benefits to carrying out the methods of this invention to identify a subject having an increased risk of developing aggressive prostate cancer, including but not limited to, identifying subjects who are good candidates for prophylactic and/or therapeutic treatment, and screening for cancer at an earlier time or more frequently than might otherwise be indicated, to increase the chances of early detection of an aggressive prostate cancer and reduce the incidence of prostate cancer-specific death.
Thus, in one aspect, the present invention provides a method of identifying a human subject as having an increased risk of having or developing aggressive prostate cancer, comprising detecting in a nucleic acid sample from the subject 1) a deletion at 1q42.2 from chr1:229894700-230947362 bp, 2) a deletion at 2q22.1 from chr2:139707778-140858852 bp, 3) a deletion at 11q23 from chr11:113321588-113946501 bp, 4) an amplification at 1q21.3 from chr1:152725557-153275233 bp, or 5) any combination of 1-4 above, wherein the detection of same identifies the subject as having an increased risk of having or developing aggressive prostate cancer.
In an additional aspect, the present invention provides a method of identifying a human subject as having an increased likelihood of prostate cancer-specific death, comprising detecting in a nucleic acid sample from the subject 1) a deletion at 1q42.2 from chr1:229894700-230947362 bp, 2) a deletion at 2q22.1 from chr2:139707778-140858852 bp, 3) a deletion at 11q23 from chr1:113321588-113946501 bp, 4) an amplification at 1q21.3 from chr1:152725557-153275233 bp, or 5) any combination of 1-4 above, wherein the detection of same identifies the subject as having an increased likelihood of prostate cancer-specific death.
A further aspect of the present invention is a method of identifying a human subject as having an increased risk of developing aggressive prostate cancer, comprising detecting in a nucleic acid sample from the subject 1) a copy number alteration at 8q24.21 from chr8:128095593-129190507 bp, 2) a copy number alteration at 1q21.3 from chr1:152725557-153275233 bp, 3) a copy number alteration at 18q21.33-22.1 from chr18:58288577-60834535 bp, 4) a copy number alteration at 8q21.13 from chr8:81128386-81867950 bp, 5) a copy number alteration at16q24.1 from chr16:82877051-83540927 bp, 6) a copy number alteration at10q23.31 from chr10:89613175-89888562 bp, 7) a copy number alteration at 17p13.1 from chr17:7501561-7781403 bp, and 8) any combination thereof, wherein the detection of same identifies the subject as having an increased risk of developing aggressive prostate cancer. An additional aspect of this invention is a method of identifying a human subject as having an increased likelihood of prostate cancer-specific death, comprising detecting in a nucleic acid sample from the subject 1) a copy number alteration at 8q24.21 from chr8:128095593-129190507 bp, 2) a copy number alteration at 1q21.3 from chr1:152725557-153275233 bp, 3) a copy number alteration at 18q21.33-22.1 from chr18:58288577-60834535 bp, 4) a copy number alteration at 8q21.13 from chr8:81128386-81867950 bp, 5) a copy number alteration at16q24.1 from chr16:82877051-83540927 bp, 6) a copy number alteration at10q23.31 from chr10:89613175-89888562 bp, 7) a copy number alteration at 17p13.1 from chr17:7501561-7781403 bp, and 8) any combination thereof, wherein the detection of same identifies the subject as having an increased likelihood of prostate cancer-specific death.
In yet further aspects, the present invention provides a method of identifying a human subject as having an increased risk of developing aggressive prostate cancer, comprising detecting in a nucleic acid sample from the subject 1) a copy number alteration in the MYC gene, 2) a copy number alteration in the ADAR gene, 3) a copy number alteration in the SERPIN5 gene, 4) a copy number alteration in the TPD52 gene, 5) a copy number alteration in the USP10 gene, 6) a copy number alteration in the PTEN gene, 7) a copy number alteration the TP53 gene, and 8) any combination thereof, wherein the detection of same identifies the subject as having an increased risk of developing aggressive prostate cancer.
As an additional aspect, the present invention provides a method of identifying a human) subject as having an increased likelihood of prostate cancer-specific death, comprising detecting in a nucleic acid sample from the subject 1) a copy number alteration in the MYC gene, 2) a copy number alteration in the ADAR gene, 3) a copy number alteration in the SERPIN5 gene, 4) a copy number alteration in the TPD52 gene, 5) a copy number alteration in the USP10 gene, 6) a copy number alteration in the PTEN gene, 7) a copy number alteration the TP53 gene, and 8) any combination thereof, wherein the detection of same identifies the subject as having an increased likelihood of prostate cancer-specific death.
In addition, the present invention provides a method of identifying a human subject as having an increased risk of developing aggressive prostate cancer, comprising detecting in a nucleic acid sample from the subject a deletion in the gene PTEN and amplification of the gene MYC.
Also provided in the present invention is a method of identifying a human subject as having an increased likelihood of prostate cancer-specific death, comprising detecting in a nucleic acid sample from the subject a deletion in the gene PTEN and amplification of the gene MYC.
Regarding a deletion in the gene PTEN, both hemizygous deletion (one of the two alleles or copies was deleted, about 27% of PCa patients) and homozygous deletion (both alleles/copies were deleted, about 13% of PCa patients) were observed in the tumors of the patients with prostate cancer. About 90% of hemizygous deletions affected the whole gene of PTEN, with about 10% affecting only a part of PTEN. About 60% of the homozygous deletions affected the whole gene of PTEN, while 40% of homozygous deletions affected only a part the gene.
When a subject is identified as having an increased risk of developing aggressive prostate cancer, various steps can be taken. For example, the methods of this invention could be used for each PCa-positive biopsy core to determine whether it contains the seven CNAs described herein that are associated with lethal PCa. If any core from a subject has the CNA signature at any or combination of these seven genes, the subject will be more likely to have a poor outcome. Therefore, a physician may choose to treat the subject aggressively at critical times using surgery, radiation, hormonal therapy and/or chemotherapy. If a subject does have CNAs of these seven genes, a physician may manage the disease through active surveillance. If a PTEN deletion is detected in the tumor of a subject, a physician may add PI3K pathway inhibitors as part of the treatment strategy (e.g., PI3K-Akt-mTOR pathway treatment to target PTEN deletion). If the) subject harbors a TP53 deletion in the tumor, a physician may choose gene therapy to restore p53, and/or another drug or drugs to activate the p53 pathway.
Thus, as a nonlimiting example, the methods of this invention can be used to guide a subject's prostate cancer treatment regimen, comprising carrying out any of the methods of this invention and guiding the subject's treatment regimen such that detection of the CNA signature at any or a combination of the genes in the seven genomic regions associated with lethal prostate cancer described herein in a subject with prostate cancer leads to more active surveillance and/or more aggressive treatment and/or management of the subject than would be implemented for a subject with prostate cancer in whom none of these markers were detected, including surgery, radiation therapy, hormone therapy and/or chemotherapy as well as more frequent timing and duration of such therapies, and no detection of these genetic markers in a subject with prostate cancer leads to standard treatment and/or routine monitoring as are well known in the art.
As another nonlimiting example, the methods of this invention can be used to guide a physician's actions with regard to a subject in whom prostate cancer has not been diagnosed or detected, comprising carrying out any of the methods of this invention and guiding the physician's actions such that detection of the CNA signature at any or a combination of the genes in the seven genomic regions described herein in a subject without prostate cancer leads to more active surveillance and/or more aggressive prophylactic treatment of the subject than would be implemented for a subject without prostate cancer in whom none of these markers were detected, including surgery, radiation therapy, hormone therapy and/or chemotherapy as well as more frequent timing and duration of such therapies, and no detection of these genetic markers in a subject without prostate cancer leads to standard prophylactic treatment and/or routine monitoring as are well known in the art.
The present invention relates to a set of genomic regions with DNA copy number alterations or abnormalities (CNAs) for identifying aggressive prostate cancers (PCa) leading to cancer specific mortality, a set of genomic regions in which CNAs are not or rarely observed in cancer cells for using as internal references for calculating and defining CNAs and methods of using these described genomic regions for identifying aggressive PCa imposing higher risk to the patients dying from this disease at early stage.
In the present invention, the identification of somatic DNA CNAs in the tumor genome that predict for PCa-specific death after prostatectomy for clinically localized disease is described. Using a retrospective study consisting of four cohorts of PCa patients with distinct clinicopathologic profiles from different geographical locations, the identification and validation of CNAs that are significantly associated with PCa-mortality is demonstrated, with some of them being independent of Gleason grade, pathological stage, and pre-operative PSA levels. Furthermore, 69 genomic regions in which CNAs are not or rarely observed in the tumor cells are defined for use as references in testing CNAs in PCa via various methods. Methods of using these described genomic regions for identifying aggressive PCa imposing higher risk to the patients dying from this disease are also included herein.
A set of genomic regions with DNA copy number alterations (CNAs) for identifying significant targets in prostate cancer according to embodiments of present invention may include deletions (Table 7 and Table 8) or amplifications (Table 9 and Table 10) of one or more genes, with the number of regions and genes dependent upon the criteria q-value and join segment size.
Identification of the chromosome regions described herein is based on the UCSC Genome Browser on Human Mar. 2006 (NCBI36/hg18) Assembly.
In some embodiments, a q-value of 0.25 and a joint segment size of 60 probes can be used for selection of significant cancer targets among the CNAs. In some embodiments, a q-value of 0.01 and a joint segment size of 80 probes can be used for selection of significant cancer targets among the CNAs. Using the SNP arrays and GISTIC algorithm with a q-value of 0.01 to analyze 125 primary tumors from the JHH discovery cohort, the 20 most significant CNAs along with the most commonly gained or deleted gene(s) within each region are identified (
The locations of the significant targets are defined by the cytobands, while the size of each region may be defined by the wide peak boundaries. As the number of genes in each of the significant regions varies, the CNAs may be named by the known or suspected tumor suppressor or oncogene within the altered sequences or by the first gene listed by GISTIC in the region.
A set of genomic regions with DNA CNAs for identifying the aggressive PCa leading to higher risk of cancer mortality according to some embodiments of the present invention may includes one or more genes; with a P value<0.05 resulted from a univariate analysis (Table 2). These include regions at 8q24.21 from chr8:128095593-129190507 base pair (bp), 1q21.3 from chr1:152725557-153275233 bp, 18q21.33-22.1 from chr18:58288577-60834535 bp, 8q21.13 D from chr8:81128386-81867950 bp, 16q24.1 from chr16:82877051-83540927 bp, 10q23.31 from chr10:89613175-89888562 bp, 17p13.1 from chr17:7501561-7781403 bp, including but not limited to the genes MYC, ADAR, SERPINB5, TPD52, USP10, PTEN, and TP53.
A set of genomic regions with DNA CNAs contributing additional prognostic mortality-information independent of that provided by pathologic stage, Gleason score, and initial PSA level, according to some embodiments of the present invention may be determined by multivariate analysis and therefore, includes deletion of the sequences at 10q23.31 and/or amplification of the sequences at 8q24.21, represented by the genes PTEN and MYC, respectively (Table 3).
A joint effect of alterations at PTEN and MYC on PCa-specific mortality may be explored to identify patients who may have even higher risk of having aggressive PCa leading to cancer specific death (Table 4).
A set of genomic regions in which CNAs are not or rarely observed in the tumor cells for use as references for calculating and defining CNAs according to embodiments of the present invention may include one or more or any combination of the sequences described in Table 11.
The methods for detecting DNA CNAs in identification of patients with aggressive PCa leading to high risk of cancer specific death may include comparative genomic hybridization or the same, such as metaphase (or conventional) and BAC/oligo/cDNA/single nucleotide polymorphic (SNP; or array-based) hybridization with various resolutions. It is preferable to use fluorescent in situ hybridization for detection of CNAs in clinical settings for identification of patients with aggressive PCa leading to high risk of dying. It is even more preferable to use a PCR based method, including but not limited to quantitative PCR and multiplex ligation-dependent probe amplification, for analyzing CNAs at the specific regions depicted in the present invention.
The sources of DNA for detecting CNAs in identification of patients with aggressive PCa leading to high risk of cancer specific death may include biological fluids including but not limited to blood, serum/plasma and urine, circulating tumor cells (CTCs), and tumor as well as matched normal tissues from the prostate and other anatomical sites of metastases. In some embodiments, DNA derived from blood and serum/plasma can be used. In some embodiments, DNA isolated from formalin-fixed, paraffin-embedded tissues can be used. In some) embodiments, DNA from fresh frozen, including but not limited to CTCs, biopsy and prostatectomy, tissues can be used.
In some embodiments, methods other than CGH based, such as PCR based methods can be used to analyze DNA derived from tissues or sources obtained via less invasive approaches than prostatectomy.
Methods and algorithms will be used for identification of patients with aggressive PCa that may lead to early cancer specific death if not treated aggressively or appropriately. The current invention may also be used in an active surveillance program to monitor the progression of PCa. In addition it may be used to monitor the response to treatments of PCa.
Various detection protocols can be used in the methods of this invention to detect the CNAs described herein. Nonlimiting examples include comparative genomic hybridization such as metaphase (or conventional) and BAC/oligo/cDNA/single nucleotide polymorphic (SNP; or array-based) hybridization with various resolutions; Affymetrix SNP array, NanoString technology (such as nCounter), multiplex ligation-dependent probe amplification (MLPA); fluorescence in situ hybridization (FISH); an amplification reaction (e.g., quantitative polymerase chain reaction (PCR)); an amplification reaction and single base extension (e.g., wherein the single base extension is spotted on a silicone chip); matrix-assisted laser desorption/ionization-time of flight mass spectrometry (MALDI-TOF-MS); sequencing, hybridization; restriction endonuclease digestion analysis; electrophoresis; or any combination thereof.
In particular embodiments, detection can be carried out by multiplex ligation-dependent probe amplification (MLPA). The use of MLPA has the advantage that all seven regions can be measured simultaneously in an MLPA kit and fresh frozen and formalin fixed paraffin-embedded samples can be used as a source of DNA for analysis. Such a MLPA-based method can be used to cost effectively measure all seven DNA copy number alterations in prostate biopsy tissues. Some strengths of this method are: 1) all seven regions associated with lethal prostate cancer are included, 2) at least 3 probes for each of seven targeted regions, 3) no known SNPs within the probes, and 4) all internal reference regions do not or rarely have known copy number alterations in the prostate tumors tested.
Thus in some aspects, the present invention provides a kit for carrying out the methods of this invention (e.g., a kit comprising reagents, as well as a probe mix, to detect the CNAs of this invention in a nucleic acid sample). Such a kit can comprise oligonucleotides (e.g., primers, probes, primer/probe sets, etc.), reagents, buffers, etc., as would be known in the art, for the detection of the genetic markers of this invention in a nucleic acid sample. Such oligonucleotides can be identified and prepared and employed in methods according to the teachings and protocols described herein and as are well known in the art. A kit of this invention can further comprise blocking probes, labeling reagents, blocking agents, restriction enzymes, antibodies, sampling devices, positive and negative controls, etc., as would be well known to those of ordinary skill in the art.
As used herein, “a,” “an” or “the” can mean one or more than one. For example, “a” cell can mean a single cell or a multiplicity of cells.
Also as used herein, “and/or” refers to and encompasses any and all possible combinations of one or more of the associated listed items, as well as the lack of combinations when interpreted in the alternative (“or”).
Furthermore, the term “about,” as used herein when referring to a measurable value such as an amount of a compound or agent of this invention, dose, time, temperature, and the like, is meant to encompass variations of ±20%, ±10%, ±5%, ±1%, ±0.5%, or even ±0.1% of the specified amount.
As used herein, the term “prostate cancer” or “PCa” describes an uncontrolled (malignant) growth of cells in the prostate gland, which is located at the base of the urinary bladder and is responsible for helping control urination as well as forming part of the semen. Symptoms of prostate cancer can include, but are not limited to, urinary problems (e.g., not being able to urinate; having a hard time starting or stopping the urine flow; needing to urinate often, especially at night; weak flow of urine; urine flow that starts and stops; pain or burning during urination), difficulty having an erection, blood in the urine and/or semen, and/or frequent pain in the lower back, hips, and/or upper thighs.
As used herein, the term “aggressive prostate cancer” means prostate cancer that is poorly differentiated, having a Gleason grade of 7 or above. An “indolent prostate cancer” means prostate cancer having a Gleason grade below 7 (e.g., 6 or less). The Gleason grading system is the most commonly used method for grading PCa and is well known in the art.
The term “chromosome region” as used herein refers to a part of a chromosome defined) either by anatomical details, especially by banding, or by its linkage groups.
Also as used herein, “linked” describes a region of a chromosome that is shared more frequently in family members or members of a population manifesting a particular phenotype and/or affected by a particular disease or disorder, than would be expected or observed by chance, thereby indicating that the gene or genes or other identified marker(s) within the linked chromosome region contain or are associated with an allele that is correlated with the phenotype and/or presence of a disease or disorder (e.g., aggressive PCa), or with an increased or decreased likelihood of the phenotype and/or of the disease or disorder. Once linkage is established, association studies (linkage disequilibrium) can be used to narrow the region of interest or to identify the marker (e.g., allele or haplotype) correlated with the phenotype and/or disease or disorder.
Furthermore, as used herein, the term “linkage disequilibrium” or “LD” refers to the occurrence in a population of two or more (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, etc.) linked alleles at a frequency higher or lower than expected on the basis of the gene frequencies of the individual genes. Thus, linkage disequilibrium describes a situation where alleles occur together more often than can be accounted for by chance, which indicates that the two or more alleles are physically close on a DNA strand.
The term “genetic marker” or “polymorphism” as used herein refers to a characteristic of a nucleotide sequence (e.g., in a chromosome) that is identifiable due to its variability among different subjects (i.e., the genetic marker or polymorphism can be a single nucleotide polymorphism, an allele of a single nucleotide polymorphism, a restriction fragment length polymorphism, a microsatellite, a deletion of nucleotides, an addition of nucleotides, a substitution of nucleotides, a repeat or duplication of nucleotides, a translocation of nucleotides, a copy number alteration, and/or an aberrant or alternate splice site resulting in production of a truncated or extended form of a protein, etc., as would be well known to one of ordinary skill in the art).
A “single nucleotide polymorphism” (SNP) in a nucleotide sequence is a genetic marker that is polymorphic for two (or in some case three or four) alleles. SNPs can be present within a coding sequence of a gene, within noncoding regions of a gene and/or in an intergenic (e.g., intron) region of a gene. A SNP in a coding region in which both forms lead to the same polypeptide sequence is termed synonymous (i.e., a silent mutation) and if a different polypeptide sequence is produced, the alleles of that SNP are non-synonymous. SNPs that are not in protein coding regions can still have effects on gene splicing, transcription factor binding and/or the sequence of non-coding RNA.
The SNP nomenclature provided herein refers to the official Reference SNP (rs) identification number as assigned to each unique SNP by the National Center for Biotechnological Information (NCBI), which is available in the GenBank® database.
In some embodiments, the term genetic marker is also intended to describe a phenotypic effect of an allele or haplotype, including for example, an increased or decreased amount of a messenger RNA, an increased or decreased amount of protein, an increase or decrease in the copy number of a gene, production of a defective protein, tissue or organ, etc., as would be well known to one of ordinary skill in the art.
An “allele” as used herein refers to one of two or more alternative forms of a nucleotide sequence at a given position (locus) on a chromosome (e.g., at a single nucleotide polymorphism). An allele can be a nucleotide present in a nucleotide sequence that makes up the coding sequence of a gene and/or an allele can be a nucleotide in a non-coding region of a gene (e.g., in a genomic sequence). A subject's genotype for a given gene is the set of alleles the subject happens to possess. As noted herein, an individual can be heterozygous or homozygous for any allele of this invention.
Also as used herein, a “haplotype” is a set of alleles on a single chromatid that are statistically associated. It is thought that these associations, and the identification of a few alleles of a haplotype block, can unambiguously identify all other alleles in its region. The term “haplotype” is also commonly used to describe the genetic constitution of individuals with respect to one member of a pair of allelic genes; sets of single alleles or closely linked genes that tend to be inherited together.
The terms “increased risk” and “decreased risk” as used herein define the level of risk that a subject has of developing aggressive prostate cancer, as compared to a control subject that does not have the alleles of this invention in the control subject's nucleic acid.
A sample of this invention can be any sample containing nucleic acid from a subject, as would be well known to one of ordinary skill in the art. Nonlimiting examples of a sample of this invention include a cell, a body fluid, a tissue, biopsy or surgery material, a washing, a swabbing, etc., as would be well known in the art.
A subject of this invention is any animal that is susceptible to prostate cancer as defined herein and can include, for example, humans, as well as animal models of prostate cancer (e.g., rats, mice, dogs, nonhuman primates, etc.). In some aspects of this invention, the subject can be Caucasian (e.g., white; European-American; Hispanic), as well as of black African ancestry (e.g., black; African, Sub-Saharan African, African American; African-European; African-Caribbean, etc.) or Asian. In further aspects of this invention, the subject can have a family history of prostate cancer or aggressive prostate cancer (e.g., having at least one first degree relative having or diagnosed with prostate cancer or aggressive prostate cancer) and in some embodiments, the subject does not have a family history of prostate cancer or aggressive prostate cancer. Additionally a subject of this invention can have a diagnosis of prostate cancer or aggressive prostate cancer in certain embodiments and in other embodiments, a subject of this invention does not have a diagnosis of prostate cancer or aggressive prostate cancer. In yet further embodiments, the subject of this invention can have an elevated prostate-specific antigen (PSA) level and in other embodiments, the subject of this invention can have a normal or non-elevated PSA level. In some embodiments, the PSA level of the subject may not be known and/or has not been measured.
As used herein, “nucleic acid” encompasses both RNA and DNA, including cDNA, genomic DNA, mRNA, synthetic (e.g., chemically synthesized) DNA and chimeras, fusions and/or hybrids of RNA and DNA. The nucleic acid can be double-stranded or single-stranded. Where single-stranded, the nucleic acid can be a sense strand or an antisense strand. In some embodiments, the nucleic acid can be synthesized using oligonucleotide analogs or derivatives (e.g., inosine or phosphorothioate nucleotides, etc.). Such oligonucleotides can be used, for example, to prepare nucleic acids that have altered base-pairing abilities or increased resistance to nucleases.
An “isolated nucleic acid” is a nucleotide sequence that is not immediately contiguous with nucleotide sequences with which it is immediately contiguous (one on the 5′ end and one on the 3′ end) in the naturally occurring genome of the organism from which it is derived or in which it is detected or identified. Thus, in one embodiment, an isolated nucleic acid includes some or all of the 5′ non-coding (e.g., promoter) sequences that are immediately contiguous to a coding sequence. The term therefore includes, for example, a recombinant DNA that is incorporated into a vector, into an autonomously replicating plasmid or virus, or into the genomic DNA of a prokaryote or eukaryote, or which exists as a separate molecule (e.g., a cDNA or a genomic DNA fragment produced by PCR or restriction endonuclease treatment), independent of other sequences. It also includes a recombinant DNA that is part of a hybrid nucleic acid encoding an additional polypeptide or peptide sequence.
The term “isolated” can refer to a nucleic acid or polypeptide that is substantially free of cellular material, viral material, and/or culture medium (e.g., when produced by recombinant DNA techniques), or chemical precursors or other chemicals (when chemically synthesized). Moreover, an “isolated fragment” is a fragment of a nucleic acid or polypeptide that is not naturally occurring as a fragment and would not be found in the natural state.
The term “oligonucleotide” refers to a nucleic acid sequence of at least about five nucleotides to about 500 nucleotides (e.g. 5, 6, 7, 8, 9, 10, 12, 15, 18, 20, 21, 22, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 100, 125, 150, 175, 200, 250, 300, 350, 400, 450, 500, 550 or 600 nucleotides). In some embodiments, for example, an oligonucleotide can be from about 15 nucleotides to about 30 nucleotides, or about 20 nucleotides to about 25 nucleotides, which can be used, for example, as a primer in a polymerase chain reaction (PCR) amplification assay and/or as a probe in a hybridization assay or in a microarray. Oligonucleotides of this invention can be natural or synthetic, e.g., DNA, RNA, PNA, LNA, modified backbones, etc., as are well known in the art.
The present invention further provides fragments of the nucleic acids of this invention, which can be used, for example, as oligonucleotides, primers and/or probes. Such fragments or oligonucleotides can be detectably labeled, ligated or modified, for example, to include and/or incorporate a restriction enzyme cleavage site when employed as a primer in an amplification (e.g., PCR) assay.
The detection of a polymorphism, genetic marker or allele of this invention can be carried out according to various protocols standard in the art and as described herein for analyzing nucleic acid samples and nucleotide sequences, as well as identifying specific nucleotides in a nucleotide sequence.
For example, nucleic acid can be obtained from any suitable sample from the subject that will contain nucleic acid and the nucleic acid can then be prepared and analyzed according to) well-established protocols for the presence of genetic markers according to the methods of this invention. In some embodiments, analysis of the nucleic acid can be carried by amplification of the region of interest according to amplification protocols well known in the art (e.g., polymerase chain reaction, ligase chain reaction, strand displacement amplification, transcription-based amplification, self-sustained sequence replication (3SR), Qβ replicase protocols, nucleic acid sequence-based amplification (NASBA), repair chain reaction (RCR) and boomerang DNA amplification (BDA), etc.). The amplification product can then be visualized directly in a gel by staining or the product can be detected by hybridization with a detectable probe. When amplification conditions allow for amplification of all allelic types of a genetic marker, the types can be distinguished by a variety of well-known methods, such as hybridization with an allele-specific probe, secondary amplification with allele-specific primers, by restriction endonuclease digestion, and/or by electrophoresis. Thus, the present invention further provides oligonucleotides for use as primers and/or probes for detecting and/or identifying genetic markers according to the methods of this invention.
In some embodiments of this invention, detection of an allele or combination of alleles of this invention can be carried out by an amplification reaction and single base extension. In particular embodiments, the product of the amplification reaction and single base extension is spotted on a silicone chip.
In yet additional embodiments, detection of an allele or combination of alleles of this invention can be carried out by matrix-assisted laser desorption/ionization-time of flight mass spectrometry (MALDI-TOF-MS).
It is further contemplated that the detection of an allele or combination of alleles of this invention can be carried out by various methods that are well known in the art, including, but not limited to nucleic acid sequencing, hybridization assay, restriction endonuclease digestion analysis, ligation, electrophoresis, and any combination thereof.
The genetic markers (e.g., CNAs) of this invention are correlated with (i.e., identified to be statistically associated with) aggressive prostate cancer as described herein according to methods well known in the art and as disclosed in the Examples provided herein for statistically correlating genetic markers with various phenotypic traits, including disease states and pathological conditions as well as determining levels of risk associated with developing a particular phenotype, such as a disease or pathological condition. In general, identifying such correlation involves conducting analyses that establish a statistically significant association and/or a statistically significant correlation between the presence of a genetic marker or a combination of markers and the phenotypic trait in a population of subjects and controls (e.g., a population of subjects in whom the phenotype is not present or has not been detected). The correlation can involve one or more than one genetic marker of this invention (e.g., two, three, four, five, or more) in any combination. An analysis that identifies a statistical association (e.g., a significant association) between the marker or combination of markers and the phenotype establishes a correlation between the presence of the marker or combination of markers in a population of subjects and the particular phenotype being analyzed. A level of risk (e.g., increased or decreased) can then be determined for an individual on the basis of such population-based analyses.
Thus, in certain embodiments, the present invention provides a method of screening a subject for a genetic marker (e.g., a copy number alteration) that is associated with aggressive prostate cancer, comprising: a) performing a population based study to detect genetic markers (e.g., copy number alterations) in a group of subjects with aggressive prostate cancer and a group of control subjects; b) identifying copy number alterations in the aggressive prostate cancer group of subjects that are statistically associated with the presence of aggressive prostate cancer; and c) screening a subject for the presence of the copy number alterations identified in step (b).
The present invention further provides a method of identifying an effective and/or appropriate (i.e., for a given subject's particular condition or status) treatment regimen for a subject with aggressive prostate cancer, comprising detecting one or more of the genetic markers associated with aggressive prostate cancer of this invention in the subject, wherein the one or more genetic markers are further statistically correlated with an effective and/or appropriate treatment regimen for aggressive prostate cancer according to protocols as described herein and as are well known in the art.
Also provided is a method of identifying an effective and/or appropriate treatment regimen for a subject with aggressive prostate cancer, comprising: a) correlating the presence of one or more genetic markers of this invention in a test subject or population of test subjects with aggressive prostate cancer for whom an effective and/or appropriate treatment regimen has been identified; and b) detecting the one or more markers of step (a) in the subject, thereby identifying an effective and/or appropriate treatment regimen for the subject.
Further provided is a method of correlating a genetic marker of this invention with an effective and/or appropriate treatment regimen for aggressive prostate cancer, comprising: a) detecting in a subject or a population of subjects with aggressive prostate cancer and for whom an effective and/or appropriate treatment regimen has been identified, the presence of one or more genetic markers or polymorphisms of this invention; and b) correlating the presence of the one or more genetic markers of step (a) with an effective treatment regimen for aggressive prostate cancer.
Examples of treatment regimens for prostate cancer are well known in the art. Subjects who respond well to particular treatment protocols can be analyzed for specific genetic markers and a correlation can be established according to the methods provided herein. Alternatively, subjects who respond poorly to a particular treatment regimen can also be analyzed for particular genetic markers correlated with the poor response. Then, a subject who is a candidate for treatment for aggressive prostate cancer can be assessed for the presence of the appropriate genetic markers and the most effective and/or appropriate treatment regimen can be provided as early as possible.
In some embodiments, the methods of correlating genetic markers with treatment regimens of this invention can be carried out using a computer database. Thus the present invention provides a computer-assisted method of identifying a proposed treatment for aggressive prostate cancer and/or appropriate treatment for a subject carrying a genetic marker correlated with aggressive prostate cancer.
The method can include the steps of (a) storing a database of biological data for a plurality of subjects, the biological data that is being stored including for each of said plurality of subjects, for example, (i) a treatment type, (ii) at least one genetic marker associated with aggressive prostate cancer and (iii) at least one disease progression measure for aggressive prostate cancer from which treatment efficacy can be determined; and then (b) querying the database to determine the correlation between the presence of said genetic marker and the effectiveness of a treatment type in treating aggressive prostate cancer, to thereby identify a proposed treatment as an effective for aggressive prostate cancer and/or an appropriate treatment for a subject carrying a genetic marker correlated with aggressive prostate cancer.
In some embodiments, treatment information for a subject is entered into the database (through any suitable means such as a window or text interface), genetic marker information for that subject is entered into the database, and disease progression information is entered into the database. These steps are then repeated until the desired number of subjects has been entered into the database. The database can then be queried to determine whether a particular treatment is effective for subjects carrying a particular marker or combination of markers, not effective for subjects carrying a particular marker or combination of markers, etc. Such querying can be carried out prospectively or retrospectively on the database by any suitable means, but is generally done by statistical analysis in accordance with known techniques, as described herein.
The following examples are not intended to limit the scope of the claims to the invention, but are rather intended to be exemplary of certain embodiments. Any variations in the exemplified methods that occur to the skilled artisan are intended to fall within the scope of the present invention. As will be understood by one skilled in the art, there are several embodiments and elements for each aspect of the claimed invention, and all combinations of different elements are hereby anticipated, so the specific combinations exemplified herein are not to be construed as limitations in the scope of the invention as claimed. If specific elements are removed or added to the group of elements available in a combination, then the group of elements is to be construed as having incorporated such a change.
Using DNA samples from frozen tumors of 125 patients treated by radical prostatectomy with a median follow-up of ˜seven years from Johns Hopkins Hospital (JHH) in the US and the algorithm of Genomic Identification of Significant Targets in Cancer (GISTIC), seven copy number alterations (CNAs) were identified that were significantly associated with early PCa-specific mortality. These include gains of chromosomal regions that contain the genes MYC, ADAR, or TPD52 and losses of sequences that incorporate SERPINB5, USP10, PTEN, or TP53. Furthermore, multivariate analysis revealed that deletion of the gene PTEN and amplification of the gene MYC contributed additional prognostic information independent of that provided by traditional clinicopathologic measurements, such as pathologic stage, Gleason score, and initial PSA level. Finally, 69 genomic regions in which CNAs were not or rarely observed in the tumor cells were defined using DNA copy number data from 5 different PCa cohorts for use as references in testing CNAs in PCa. The present invention describes the use of these CNAs for identification of patients with aggressive PCa that may lead to high risk of early cancer specific death if patients are not treated aggressively or appropriately. These CNAs allow for more accurate patient prognosis, at the time of surgery or biopsy, and help guide the selection of appropriate therapy and disease management strategy.
Genetic Markers Associated with Early Cancer-Specific Mortality Following Prostatectomy.
PCa is the most common cancer among men in the United States with ˜242,000 expected to be diagnosed in 2012.1 Although many of these tumors may be indolent and not require treatment, ˜28,000 men die from this disease annually.1 Treatment options for patients who present with localized disease include surgery, radiotherapy, medical therapies, or surveillance. The outcome following prostatectomy is generally excellent, with large series reporting PCa-specific mortality rates as ˜15% or less after ten years or more.2,3 However, the Swedish randomized trial showed that the benefit of surgery over conservative management was modest, with a projected PCa-specific mortality rate after fifteen years of 14.6% versus 20.7%, respectively.4 Although this study showed that prostatectomy can prevent cancer deaths, a sizable proportion of patients had tumors that were not life-threatening at least within the fifteen-year time frame. Conversely, one in seven of the patients who have a prostatectomy nonetheless relapses and dies of progressive cancer.
From this and other studies, it is apparent that the care of patients with early PCa could be improved by the development of novel therapies and tumor prognostic markers. These can help distinguish between those patients with life-threatening disease that require more aggressive therapy from those with indolent disease which could be treated conservatively. One strategy to achieve these goals is to exploit knowledge of the molecular pathogenesis of the disease. As in all cancers, the malignant phenotype of PCa is driven by the acquisition and collaboration of somatic alterations.5-8 Thus, analysis of copy number abnormalities in tumor genomes provides insights into the specific genes and molecular pathways that promote tumorigenesis and determine the clinical course. As described here, high resolution SNP arrays and GISTIC, respectively, were used to identify and determine the significant CNAs in tumors obtained at prostatectomy from 125 patients. These CNAs were then correlated with clinicopathologic features and clinical outcome. The results revealed new CNAs and genes that likely contribute to the pathogenesis of PCa and associations between specific CNAs and early PCa-specific mortality.
The initial discovery cohort consisted of 141 patients who underwent radical prostatectomy at Johns Hopkins Hospital (JHH) in the U.S. between 1988 and 2004. To minimize the effects from confounding factors and to include as many eligible patients as possible for better statistical power, the effects of clinical variables were evaluated, including Gleason score, metastasis, race and adjuvant therapy on PCa-specific mortality. The following patients were eliminated: 5 patients without Gleason score, 5 patients with metastases and 6 patients without death information from the original cohort of 141 patients. The resulting 125 patients were then used to analyze the effects of race and no association between race and PCa-specific death was found. Therefore 115 Caucasian, eight African Americans and two others were included in the study and race effects were not stratified in the analysis. In addition a significant correlation (P=0.04) was found between patients with higher Gleason scores and patients with adjuvant therapy, however this did not lead to a demonstrable reduction in PCa-mortality. Therefore, 36 patients who received hormone- and/or chemo-therapies were included in the final study population of 125 eligible patients with a median follow-up of seven years from JHH. As shown in Table 1, many of these patients had a more aggressive form of PCa; 34%, 33%, and 44% of patients, respectively had a pathologic Gleason score≧8, a pathologic stage≧T3b, and pretreatment serum PSA>10 ng/mL. Consistent with this frequency of high risk disease, 22 of these patients (˜18%) died of PCa, while the remaining 103 were still alive or died from other causes as of June, 2009. Eight out of 22 PCa-specific deaths happened within five years after surgery in the JHH cohort; the five-year survival rate was 64%.
The second cohort included 103 prostatectomy patients who were treated between 2002 and 2008 with a median follow-up of about five years from the Karolinska University Hospital (KUH) in Sweden (Table 1). In contrast to the JHH cohort described above, most of the Swedish patients had a less aggressive form of PCa. About 85%, 82% and 64% of patients, respectively had a pathologic Gleason score≦7, a pathologic stage<T3, and pretreatment serum PSA≦10 ng/mL. Consistent with this frequency of low risk disease and shorter follow-up time, only four subjects died from PCa, while the remaining 99 were still alive or died from other causes at the time of data analysis as of December, 2010. Three out of four PCa-specific deaths happened within five years after surgery in the KUH cohort; the five-year survival rate was 25%. A third cohort was composed of 216 patients from Memorial Sloan Kettering Cancer Center (MSKCC) with clinicopathologic and survival information publically available.7 An additional group of 14 JHH patients who died of progressive PCa and underwent autopsy provided tumor samples for the study of lethal PCa.6 Informed consent was obtained and the Institutional Review Board/Ethics Committee in participating institutions approved the study.
Somatic tumor and matched normal DNA from patients of the JHH and KUH cohorts was prepared and used for SNP array analysis of genome-wide CNAs as described previously.6 GISTIC method9 was used to identify significant CNAs. As the number of genes in each of the significant regions varied, the CNAs were named by the known or suspected tumor suppressor or oncogene within the altered sequences or by the first gene listed GISTIC9 in the region.
Association of clinicopathologic variables and DNA CNAs with PCa-specific mortality was explored using logistic regression. CNAs were coded as either deletions or gains. The primary outcome was mortality due to PCa. Both univariate and multivariate analyses were incorporated in the logistic regression model. First, univariate analysis was performed in order to check the relationship between the end point and the explanatory variables. Second, for associations with PCa-specific mortality, multiple variables were included in the logistic regression model based on stepwise model selection and all models retained clinicopathologic variables, including age, Gleason score, and preoperative PSA, whether they were significant. Tumor-stage was not included the because of multicollinearity with Gleason score. Studies were also done to test for any pair of alterations that was significantly associated with PCa-specific mortality in a multivariate analysis. The analysis was restricted to significant alterations because of the small sample size. The P value for the difference between the two areas under the receiver operating characteristic curve (AUC) was calculated using a nonparametric approach.10
To explore the joint effect of alterations at PTEN and MYC on PCa-specific mortality, the raw data were displayed using a four by two table. Odds ratio (OR) and the 95% confidence intervals were calculated based on the contingency table. P-values were calculated using Fisher's exact test, the Cochran-Armitage trend test, and Cochran-Mantel-Haenszel (CMH) test. All of the analyses were performed using SAS 9.2 Software.
Genome-Wide Analysis of Tumor DNA Reveals the 20 Most Significant CNAs in Clinically Localized PCa.
Using the SNP arrays and GISTIC algorithm with a q-value of 0.01 to analyze 125 primary tumors from the JHH discovery cohort, the 20 most significant CNAs were identified, along with the most commonly gained or deleted gene(s) within each region (
Four of the CNAs, consisting of deletions that involved DISC1 (102.2), LRP1B (2q22.1) and HRT3A (11q23) or gains of ADAR1 (1q21.3) had not been described previously. Nine of the other CNAs affected known PCa tumor suppressor genes [CHD1 (5q21.1), MAPK3K7 (6q15), PTEN (10q23.31), CDKN1B (12p13.1), RB1 (13q14.2), and TP53 (17p13.1)] or oncogenes [MYC (8q24.2), TPD52 (8q21.3) and TMPRSS2-ERG fusions (21q22)]. For the other CNAs, the analyses identified or reduced the number of candidate targets. For example, LRP1B and RYBP1 appeared to be targets of the novel CNA at 2q22.1 and deletions at 3p13, respectively. It was also confirmed that deletions on 5q21.1 targeted CHD1 whereas those at 5q11.2 involved PDE4D and RAB3C.7
Association of CNAs with Clinicopathologic Features and Clinical Outcome.
As expected, univariate analysis showed that high tumor Gleason score (P=0.01) and stage (P=0.05) correlated with PCa-specific mortality in the JHH cohort, although PSA level and age at surgery did not (Table 2). It was also found that adjuvant treatment of a subset of patients did not reduce PCa-specific mortality. On univariate analysis, six of the twenty CNAs (TP53, MAP3K7, CHD1, PDE4D, COL1A2, and PTEN) were associated with high Gleason scores, tumor stage, or both. Importantly, CNAs of MYC (8q24.21), SERPINB5 (18q21.33), TPD52 (8q21.13), USP10 (16q24.1), PTEN (10q23.31), TP53 (17p13.1), and a novel one described here, ADAR (1q21.3), significantly correlated with PCa-specific mortality (Table 2). For five, the strength of association was similar to or greater than that observed with Gleason score or tumor stage. Gains of MYC conferred the greatest risk of dying from PCa with an OR of 4.75 (P=0.002).
Multivariate logistic regression analysis using a model that incorporated the genetic markers and clinicopathologic variables (forced-in) showed that the CNAs of PTEN and MYC conferred additional independent prognostic information. The P-value of the Hosmer-Lemeshow test for the calibration of this model was 0.703. For PTEN loss, the estimated OR for PCa-specific mortality was 7.31 [95% confidence interval (CI): 1.98-27.0; P=0.003] and for MYC gain was 7.82 (95% CI: 2.30-26.6; P=0.001) (Table 3). To explore a joint effect, distributions of CNAs at PTEN and MYC within this patient population were analyzed (Table 4). Although the presence of either conferred a borderline increase in risk, those patients whose tumors harbored both alterations had a significant and markedly increased OR for PCa-specific mortality of 53 (95% C.I.=6.92-405, P=1×10−4). Consistent with a joint effect, patients whose tumors had any combination of the two CNAs had an OR of 7.36 (95% CI: 1.57-34.5; P=0.0112). The independent prognostic information provided by the CNA data appeared to be most relevant to tumors with Gleason scores of 7 or less. Tests were conducted to determine if the AUC was significantly improved by adding genetic variables using the nonparametric method of De Long et al.10 In this group, inclusion of the genetic data in prognostic models generated by logistic regression improved the AUC by ˜12% to 0.89, which achieved borderline significance, whereas the improvement for tumors with Gleason scores of 8 or more was not significant.
Association of CNAs at PTEN/MYC with PCa-Specific Mortality in Independent Patient Cohorts and with Metastatic Disease.
To confirm the associations of PTEN and MYC alterations in tumor DNA with PCa-specific mortality, two other independent patient cohorts were studied. These included 103 patients who underwent prostatectomy at KUH (Table 1) and 216 patients treated at MSKCC. In the latter group, the clinical and comparative genomic hybridization data were retrieved from a publically accessible database.7 The analysis of both groups was confounded by the low number of PCa-specific deaths (four in each set) that was likely related to the inclusion of patients with favorable clinicopathologic prognostic features in the study populations. Nonetheless, in the MSKCC group, a significant association of MYC gain (P=0.0216) or CNAs at PTEN and/or MYC (P=0.0092) with PCa-specific mortality was found. The Cochran-Armitage trend test revealed a highly significant joint effect (Ptrend=0.003). For tumors from the KUH cohort, the trend test revealed equivocal results (P=0.1071); however, when the KUH and MSKCC cohorts were analyzed together by the Cochran-Mantel-Haenszel (CMH) test, the relationship between the CNAs of PTEN/MYC and PCa-specific mortality proved to be highly significant (P=0.004). In the combined group, only a single patient out of 201 (0.5%) whose tumors lacked these markers died of PCa, in contrast to 6% for those with tumors harboring CNAs of MYC, PTEN, or both.
One inference from the above results is that PCa harboring CNAs of PTEN and MYC are more likely to have the acquired lethal phenotype and thus alterations in these genes are expected to be over-represented in lethal PCa. As a first step to address this possibility, the CNAs in tumor tissues obtained at autopsy from 14 men who died from metastatic PCa6 were analyzed. The results showed that tumors from all of these patients (100%) had alterations in at least one of these two genes and eight (57%) had alterations in both genes, compared to 58.4% and 9.6%, respectively, in localized PCa from the JHH prostatectomy cohort.
Among the twenty regions of significant CNAs (
Although seven CNAs [MYC (8q24.21), ADAR (1q21.3), SERPINB5 (18q21.33-22.1), TPD52 (8q21.13), USP10 (16q24.1), PTEN (10q23.31), and TP53 (17p13.1)] significantly correlated with early PCa-specific mortality in the JHH cohort, only those of PTEN and MYC contributed prognostic information beyond that provided by standard clinicopathologic features. The relationship of these two CNAs with clinical outcome was also evident in the MSKCC and KUH patients despite the low number of PCa-specific deaths in each group. This study is the first to demonstrate a joint effect in clinical cohorts where CNAs of both PTEN and MYC in the tumor genome imposed the most significant risk for PCa-specific mortality following prostatectomy.
An important question is whether the independent prognostic information provided by CNAs of PTEN and MYC is sufficient to impact clinical management or the stratification of patients in clinical trials. A variety of other molecular markers, including genetic alterations and gene expression profiles, have been developed and tested in hopes of improving the accuracy of prognostic models.11, 15, 16 Although many have shown promise, none have been sufficiently validated and/or robust to justify their clinical application. CNAs detected by SNP microarrays have also been reported to correlate with biochemical relapse following prostatectomy or radiation therapy.7, 17 However, biochemical relapse per se often shows little or no correlation with PCa-specific mortality.18, 19
Our analyses of tumor DNAs from three additional cohorts with a total of 333 patients support an association between CNAs of PTEN and MYC with PCa-specific mortality. These genetic data could be incorporated into predictive models to help select prostatectomy patients who are more or less likely to benefit from adjuvant therapies and stratify patients for clinical studies.23, 24 The revised model may be particularly helpful for segregating patients with Gleason 7 tumors, a troublesome category in terms of predicting outcome. As shown here, the main impact of the CNA data on prognostic accuracy in the JHH patients was for those patients whose tumors had Gleason scores of ≦7. In addition, we noted that none of the 37 (0%) JHH patients with Gleason≦7 tumors lacking CNAs at PTEN and MYC died of PCa. Similarly only one out of 201 (0.5%) in the MSKCC and KUH cohorts, the majority of whom had Gleason≦7 tumors, died of PCa. This compares to nine of 45 (20%) and seven of 117 (6%), respectively, for those patients whose tumors contained one or both markers, that died of PCa. Thus, patients whose tumors have lower Gleason scores and lack of CNAs of PTEN and MYC are unlikely to benefit from adjuvant therapies. The prognostic significance of the CNAs detected in resected PCa may also apply to CNAs determined by FISH or SNP microarray analyses of genomic DNAs in tumor cells obtained by needle biopsy. CNA profiles in biopsy samples could help select those patients best treated with prostatectomy or radiation and those who could be managed conservatively.
Abstract.
Most prostate cancers are considered to be indolent (non-aggressive) and may not even require treatment. However, some of them are aggressive tumors that are characterized by uncontrolled cell proliferation resulting in cancer progression, recurrence and metastases, leading to ˜28,000 estimated deaths in 2012. Clinicopathologic parameters are strong predictors of disease recurrence, but there is no reliable marker to distinguish between those patients within each subgroup who are at high or low risk for prostate cancer specific mortality. To identify novel effectors and markers of localized but potentially life-threatening prostate cancer, DNA copy number alterations (CNAs) and nucleotide mutations were evaluated in the tumor genomes from patients who underwent prostatectomy using high resolution SNP arrays and exome sequencing, respectively. Studies were carried out to determine whether these somatic alterations can augment clinicopathologic parameters in predicting early prostate cancer specific death. Using the algorithm of Genomic Identification of Significant Targets in Cancer with a q-value of 0.01 to analyze the data from the tumor genomes of 125 patients, twenty significant regions of CNAs were identified, four of them novel, and the unique target genes of four of the altered regions were identified. By univariate analysis, seven CNAs were significantly associated with early prostate cancer specific mortality. These included gains of chromosomal regions that contain the genes MYC, ADAR, or TPD52 and losses of sequences that incorporate SERPINB5, USP10, PTEN, or TP53. On multivariate analysis, only the CNAs of PTEN and MYC contributed additional prognostic information independent of that provided by pathologic stage, Gleason score, and initial PSA level. Patients whose tumors had alterations of both genes had a markedly elevated risk of prostate cancer specific mortality (OR=53; C.I.=6.92-405, P=1×10−4). Analyses of the tumor genomes of 333 patients from three additional distinct patient cohorts confirmed the relationship between CNAs of PTEN and MYC and lethal prostate cancer. This study identifies new CNAs and genes that likely contribute to the pathogenesis of localized PCa and indicates that patients whose tumors have acquired CNAs of PTEN, MYC, or both have an increased risk of early PCa-specific mortality.
Patients and Methods.
The initial discovery cohort consisted of 125 eligible patients who underwent radical prostatectomy at Johns Hopkins Hospital (JHH) in the U.S. between 1988 and 2004 (Table 1). Many of these patients had a more aggressive form of prostate cancer (PCa). The second cohort included 103 prostatectomy patients who were treated between 2002 and 2008 with a median follow-up of about five years from the Karolinska University Hospital (KUH) in Sweden (Table 1). In contrast to the JHH cohort described above, most of the Swedish patients had a less aggressive form of PCa. A third cohort was composed of 216 patients from Memorial Sloan Kettering Cancer Center (MSKCC) with clinicopathologic and survival information publically available.
Somatic tumor and matched normal DNA from patients of the JHH and KUH cohorts was prepared and used for SNP array analysis of genome-wide CNAs. GISTIC method was used to identify significant CNAs. As the number of genes in each of the significant regions varied, the CNAs were named by the known or suspected tumor suppressor or oncogene within the altered sequences or by the first gene listed GISTIC in the region. Association of clinicopathologic variables and DNA CNAs with PCa-specific mortality was explored using logistic regression. Both univariate and multivariate analyses were incorporated in the logistic regression model. P-values were calculated using Fisher's exact test, Cochran-Armitage trend test, and Cochran-Mantel-Haenszel (CMH) test.
Genome-Wide Analysis of Tumor DNA Reveals the 20 Most Significant CNAs in Clinically Localized PCa.
Using the SNP arrays and GISTIC algorithm with a q-value of 0.01 to analyze 125 primary tumors from the JHH discovery cohort, the 20 most significant CNAs were identified, along with the most commonly gained or deleted gene(s) within each region (
Four of the CNAs, consisting of deletions that involved DISC1 (1q42.2), LRP1B (2q22.1) and HRT3A (11q23) or gains of ADAR1 (1q21.3) had not been described previously. Nine of the other CNAs affected known PCa tumor suppressor genes [CHD1 (5q21.1), MAPK3K7 (6q15), PTEN (10q23.31), CDKN1B (12p13.1), RB1 (13q14.2), and TP53 (17p13.1)] or oncogenes [MYC (8q24.2), TPD52 (8q21.3) and TMPRSS2-ERG fusions (21q22)]. For the other CNAs, the analyses identified or reduced the number of candidate targets. For example, LRP1B and RYBP1 appeared to be targets of the novel CNA at 2q22.1 and deletions at 3p13, respectively. It was also confirmed that deletions on 5q21.1 targeted CHD1 whereas those at 5q11.2 involved PDE4D and RAB3C.
Association of CNAs with Clinicopathologic Features and Clinical Outcome.
As expected, univariate analysis showed that high tumor Gleason score (P=0.01) and stage (P=0.05) correlated with PCa-specific mortality in the JHH cohort, although PSA level and age at surgery did not (Table 2). It was also found that adjuvant treatment of a subset of patients did not reduce PCa-specific mortality. On univariate analysis, six of the twenty CNAs (TP53, MAP3K7, CHD1, PDE4D, COL1A2, and PTEN) were associated with high Gleason scores, tumor stage, or both. Importantly, CNAs of MYC (8q24.21), SERPINB5 (18q21.33), TPD52 (8q21.13), USP10 (16q24.1), PTEN (10q23.31), TP53 (17p13.1), and a novel one described here, ADAR (1q21.3), significantly correlated with PCa-specific mortality (Table 2). Gains of MYC) conferred the greatest risk of dying from PCa with an OR of 4.75 (P=0.002).
Multivariate logistic regression analysis using a model that incorporated the genetic markers and clinicopathologic variables (forced-in) showed that the CNAs of PTEN and MYC conferred additional independent prognostic information. For PTEN loss, the estimated OR for PCa-specific mortality was 7.31 [95% confidence interval (CI): 1.98-27.0; P=0.003] and for MYC gain was 7.82 (95% CI: 2.30-26.6; P=0.001) (Table 3). To explore a joint effect, the distributions of CNAs at PTEN and MYC within this patient population were analyzed (Table 4). Although the presence of either conferred a borderline increase in risk, those patients whose tumors harbored both alterations had a significant and markedly increased OR for PCa-specific mortality of 53 (95% C.I.=6.92-405, P=1×10−4). Consistent with a joint effect, patients whose tumors had any combination of the two CNAs had an OR of 7.36 (95% CI: 1.57-34.5; P=0.0112).
Independent Prognostic Information Provided by the CNA Data Appears to be Most Relevant to Tumors with Gleason Scores of 7 or Less.
Studies were conducted to determine whether the receiver operating characteristic curve (AUC) was significantly improved by adding genetic variables using the nonparametric method of De Long et al. In this group, inclusion of the genetic data in prognostic models generated by logistic regression improved the AUC by ˜27% to 0.88 which achieved statistical significance, whereas the improvement for tumors with Gleason scores of 8 or more was not significant (Table 5).
The model may be particularly helpful for segregating patients with Gleason 7 tumors, a troublesome category in terms of predicting outcome. As shown here, the main impact of the CNA data on prognostic accuracy in the JHH patients was for those patients whose tumors had Gleason scores of ≦7. In addition, it was noted that none of the 37 (0%) JHH patients with Gleason≦7 tumors lacking CNAs at PTEN and MYC died of PCa.
Association of CNAs at PTEN/MYC with PCa-Specific Mortality in Independent Patient Cohorts and with Metastatic Disease.
To confirm the associations of PTEN and MYC alterations in tumor DNA with PCa-specific mortality, two other independent patient cohorts were studied. These included 103 patients who underwent prostatectomy at KUH (Table 1) and 216 patients treated at MSKCC. In the later group, the clinical and CGH data were retrieved from a publically accessible database. The analysis of both groups was confounded by the low number of PCa-specific deaths (four in each set) that was likely related to the inclusion of patients with favorable clinicopathologic prognostic features in the study populations (Table 6). Nonetheless, in the MSKCC group, a significant association of MYC gain (P=0.0216) or CNAs at PTEN and/or MYC (P=0.0092) with PCa-specific mortality was found. The Cochran-Armitage trend test revealed a highly significant joint effect (Ptrend=0.003). For tumors from the KUH cohort, the trend test revealed equivocal results (P=0.1071); however, when the KUH and MSKCC cohorts were analyzed together by the Cochran-Mantel-Haenszel (CMH) test, the relationship between the CNAs of PTEN/MYC and PCa-specific mortality proved to be highly significant (P=0.004). In the combined group, only a single patient out of 201 (0.5%) whose tumors lacked these markers died of PCa, in contrast to 6% for those with tumors harboring CNAs of MYC, PTEN, or both.
One inference from the above results is that PCa harboring CNAs of PTEN and MYC are more likely to have the acquired lethal phenotype and thus alterations in these genes are expected to be over-represented in lethal PCa. As a first step to address this possibility, the CNAs in tumor tissues obtained at autopsy from 14 men who died from metastatic PCa at JHH were analyzed. The results showed that tumors from all of these patients (100%) had alterations in at least one of these two genes and eight (57%) had alterations in both genes, compared to 58.4% and 9.6%, respectively, in localized PCa from the JHH prostatectomy cohort.
All publications and patent applications, nucleotide sequences and/or amino acid sequences identified by GenBank® Database Accession numbers are herein incorporated by reference to the same extent as if each individual publication or patent application or sequences was specifically and individually indicated to be incorporated by reference.
Although the foregoing invention has been described in some detail by way of illustration and example for purposes of clarity of understanding, it will be apparent that certain changes and modifications may be practiced within the scope of the list of the foregoing embodiments and the appended claims.
†Clinicopathological variables are dichotomized at the cut-off point of presented values. ‘d’ and ‘dd’ denote hemizygous and homozygous deletion, respectively. ‘g’ and ‘a’ denote one and > one additional copy gain of DNA, respectively.
This application claims the benefit, under 35 U.S.C. §119(e), of U.S. Provisional Application Ser. No. 61/785,636, filed Mar. 14, 2013, the entire contents of which are incorporated by reference herein.
This invention was made with government support under Grant Nos. CA106523, CA95052, CA105055, CA133066 and CA135008 awarded by the National Institutes of Health and under Grant No. PC051264 awarded by the Department of Defense. The government has certain rights in the invention.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2014/028371 | 3/14/2014 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
61785636 | Mar 2013 | US |