The present invention relates to methods of selecting treatment for, ascertaining whether treatment is working in, and/or prognostication of cancer in a subject. Also provided herein are a set of probes, kits, and an in vitro assay for use in methods of the present invention.
Clinical implementable molecular stratification underlies the drive for precision medicine in cancer. Prostate cancer is a leading cause of cancer death among men and in the past few years, studies investigating the genomic landscape of metastatic prostate cancer have led to the identification of targetable molecular alterations, emerging resistance mechanisms, and new therapeutic options.
Following the approval of several PARP (poly-ADP ribose polymerase) inhibitors as a therapeutic option for various cancers, such as metastatic castration-resistant prostate cancer (mCRPC) patients with an alteration in BRCA1/2 and in some situations other DNA repair genes (for example ATM, FANCA, CHEK2, PALB2 and HDAC2), biomarker-driven treatment selection is becoming clinically possible (de Bono et al., N Engl J Med. 2020; 382(22):2091-2102). However, archival tissue biopsies, used for biomarker assessment for trial enrollment, have a number of limitations including not being representativeness of current disease, bias from potential intra-patient heterogeneity and a high failure rate (Colomer et al., E Clinical Medicine 2020; 25:100487).
Recently, liquid biopsies have proved to be a promising alternative to tissue biopsy for detecting genomic aberrations and molecular subtype characterization (Maia et al., Nat Rev Urol 2020, Nat Rev Urol. 2020; 17(5):271-291), also allowing for serial testing, through non-invasive blood draws. Liquid biopsies could also help to readily identify possible resistance mechanisms and to detect minimal residual disease (Heitzer et al., Nature Reviews Genetics 2019, Nat Rev Genet. 2019; 20(2):71-88). Further, liquid biopsies provide a more comprehensive characterization of a patient's tumor that is neither temporally nor spatially restricted, as is the case of tissue biopsies (Siravegna et al., Nature Reviews Clinical Oncology 2017, Nat Rev Clin Oncol. 2017; 14(9):531-548), and can serve for the enrollment of patients in umbrella trials. For example, liquid biopsies that involve the analysis of circulating free (cfDNA), and specifically circulating tumor DNA (ctDNA), have shown promise in the stratification and treatment selection of mCRPC patients (Chi et al., Journal of Clinical Oncology 2020 38:15_suppl, 5551-5551).
However, biological and technical issues influence the ability to accurately stratify patients using current methods, especially for copy number change rich cancers, such as prostate cancer (Hieronymus et al., Elife. 2018 Sep. 4; 7:e37294 and U et al., Nature. 2020 February; 578(7793):112-121). Biological and technical challenges for accurate detection of genomic lesions in ctDNA, such as metastatic prostate cancer (mPC) plasma ctDNA, include broad ranges of tumor fractions, intra-patient genomic heterogeneity, frequent aneuploidy, and common imbalanced copy number changes. For example, low ctDNA fractions limit the accurate detection of copy number changes, in particular copy number losses, and further make it hard to discriminate between mono- and biallelic gene loss (Warner et al., Clin. Canc. Res. 2021, Clin Cancer Res. 2021; 27(6):1650-1662).
ctDNA fractions depend on the cancer type and stage of the disease, and can significantly vary between patients. For example, in one study it was found that in a CRPC cohort the median ctDNA fraction in the plasma sample was 18.1% (Romanel et al., Sci Transl Med. 2015; 7(312):312re10). In another study, in the earlier setting of patients with CRPC, a median ctDNA of 11% was found (Vandekerkhove et al., Eur Urol. 2019; 75(4):667-675).
In samples containing low levels of tumor DNA (for example a cfDNA sample comprising <20% ctDNA), current methods display low accuracy in the identification of copy number losses. This greatly limits the amount of information that can be inferred from a sample about the patient's disease. For example, without an accurate measure of allelic imbalance and copy number status, it is not possible to detect genome aberrations such as copy-number neutral losses of heterozygosity. This is important because without an accurate measure of allelic imbalance and copy number status, it is not possible to accurately identify biallelic alterations such as a monoallelic mutation coupled with loss of the other allele.
There therefore remains a significant need for improved diagnostic and analytical methods for the stratification and monitoring of cancers, particularly methods that display greater sensitivity and that provide more detailed information on the genetic aberrations present in a cancer.
The present invention provides an in vitro method for staging, classification, screening, monitoring, stratification, selecting treatment for, ascertaining whether treatment is working in, and/or prognostication of cancer in a subject, wherein said method comprises the steps defined in claim 1.
The present invention provides an in vitro assay for staging, classification, screening, monitoring, stratification, selecting treatment for, ascertaining whether treatment is working in, and/or prognostication of cancer in a subject, wherein said method comprises the steps defined in claim 24.
The present inventors have found that the method and assay of the present invention are surprisingly effective and sensitive at detecting and monitoring allele-specific copy number aberrations (asCNAs) in subjects known or suspected of having a cancer. The method and assay of the present invention utilizes a selected panel of high MAF SNP loci in bespoke gene-regions including the exonic, intronic and flanking regions of a number of selected genes. The combination of the genes, the gene-region designs, and the SNPs selected within each region, have been found by the inventors to lead to the method and assay of the invention being both effective and sensitive for detecting asCNAs. Example 1 below describes a method according to the present invention and an assay according to the present invention, and shows results of the use of the method/assay (“Example 1 SNP panel”). As can be seen from the results section and
The Example 1 SNP panel method was also applied on serial samples from 3 patients treated with a PARP inhibitor (1 BRCA mutant and 2 ATM mutant patients). For the BRCA mutant patient, the Example 1 SNP panel consistently identified a hemizygous deletion (cnA=1, cnB=0) of BRCA2 gene-region, including detecting this mono-allelic deletion at samples with a low tumor fraction (around 15%). The inventors also found that the deletion was accompanied by a pathogenic missense BRCA2 germline mutation, showing that this patient had biallelic gene loss of function (monoallelic mutation coupled with loss of the other allele). Identification of this biallelic gene loss of function would not have been possible without detailed information about the asCNAs present in the sample. For both ATM mutants, complex asCNAs were detected in plasma samples for both patients. Specifically, both had a complex ATM copy number status (2 copies of one allele and 1 of the other) within aneuploid genome accompanied by a nonsense mutation harboured on the non-gained allele with a CN-corrected VAF=33% (see
Some drugs, for example PARP inhibitors, require full-impairment of genes to be effective cancer treatments. In view of this, the ability to accurately identify complex aberrations leading to gene impairment has important clinical implications. The method and assay of the invention is able to achieve this type of accurate identification. The results in the serial samples from the 1 BRCA mutant patient in
As such, the present inventors have demonstrated that the method of the invention is especially useful for the real-time strategic assessment of drug target biomarkers in cancer patients, including PARP-inhibitors, and thus allows for the optimization of treatments and the monitoring of disease progression/regression in patients undergoing treatments. The present methods find particular utility for the detection and monitoring of asCNAs in subjects known or suspected of having prostate cancer.
The present invention also provides a set of oligonucleotide probes and kits suitable for use in the method of the present invention.
Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly recognized by one of ordinary skill in the art to which this invention belongs.
As used herein, the term “biological sample” refers to a biological sample derived from a subject to be screened. The biological sample may be any suitable sample known in the art in which the expression of the selected markers can be detected. Included are individual cells and cell populations obtained from bodily tissues or fluids. Examples of suitable body fluids to be tested are plasma, blood, lymph, cerebral fluid, urine and saliva. Examples of suitable body tissues to be tested include prostate tissue, bladder tissue, breast tissue, ovarian tissue and pancreatic tissue.
As used herein, the term “non-tumor DNA” refers to chromosomal genomic DNA present in, or derived from, a non-cancerous cell. For example, non-tumor DNA may be derived from a white blood cell (WBC).
As used herein, the term “target gene” refers to a gene that is known or suspected of being commonly mutated in certain types of cancer and/or frequently altered in signalling pathways known or suspected of being druggable pathways for treating a cancer. Examples of such genes include: AR, AKT1, AKT2, AKT3, APC, ARID1A, ASXL1, ATM, ATR, AURKA, BRAF, BRCA1, BRCA2, CCND1, CDK12, CDK4, CDK6, CDKN1B, CDKN2A, CHD1, CLU, CTNNB1, CUL1, CYLD, ERCC1, KDM6A, ERCC2, ERCC3, MED12, ERCC4, ERCC5, ERG_TMPRSS2, SMARCA1, FAM183B, FAM60A, FANCA, FANCC, FANCD2, FANCE, FANCF, FANCG, FBXW7, FOXA1, FOXP1, GNAS, HSD3B1, IDH1, IDH2, KMT2C, KMT2D, KRAS, MDM2, MDM4, MET, MLH1, MSH2, MSH6, MYC, MYCN, NCOA2, NFE2L2, NKX3-1, PALB2, PIK3CA, PIK3CB, PIK3R1, PTEN, RAD51B, RAD51C, RB1, RNF43, RUNX1, RYBP, SPOP, TP53, ZBTB16 and ZFHX3. Preferred examples of target genes include: AKT1, AKT2, AKT3, APC, ARID1A, ASXL1, ATM, ATR, AURKA, BRAF, BRCA1, BRCA2, CCND1, CDK12, CDK4, CDK6, CDKN1B, CDKN2A, CHD1, CLU, CTNNB1, CUL1, CYLD, ERCC1, ERCC2, ERCC3, ERCC4, ERCC5, ERG_TMPRSS2, FAM183B, FAM60A, FANCA, FANCC, FANCD2, FANCE, FANCF, FANCG, FBXW7, FOXA1, FOXP1, GNAS, HSD3B1, IDH1, KMT2C, KMT2D, KRAS, MDM2, MDM4, MET, MLH1, MSH2, MSH6, MYC, MYCN, NCOA2, NFE2L2, NKX3-1, PALB2, PIK3CA, PIK3CB, PIK3R1, PTEN, RAD51B, RAD51C, RB1, RNF43, RUNX1, RYBP, SPOP, TP53, ZBTB16 and ZFHX3.
As used herein, the term “control gene” is a gene that is known to not be commonly mutated in a cancer and/or to not acquire mutations during the progression of a cancer. Examples of such genes include: ADAM2, APBB11P, AQP8, ATG14, ATXN7, C9orf47, CACNG5, CST7, CSTF2T, DRD1, ENPEP, EPN1, FAM183B, FAM60A, FCER1A, HDAC8, GRHPR, IL1RL2, KRT85, L3MBTL1, LRRC17, MESDC2, M1S18A, MTIF3, NAPG, NFXL1, N1T2, NOTCH3, OR3A3, PTAFR, PTGER4, PXDNL, RNF125, RSF1, SPDYA, TEX11, TK2, TOP3B, UGT2B17, VNN3 and ZBTB9. Preferred examples of control genes include: ADAM2, APBB11P, AQP8, ATG14, ATXN7, C9orf47, CACNG5, CST7, CSTF2T, DRD1, ENPEP, EPN1, FAM183B, FAM60A, FCER1A, GRHPR, IL1RL2, KRT85, L3MBTL1, LRRC17, MESDC2, MIS18A, MTIF3, NAPG, NFXL1, NIT2, NOTCH3, OR3A3, PTAFR, PTGER4, PXDNL, RNF125, RSF1, SPDYA, TK2, TOP3B, UGT2B17, VNN3 and ZBTB9. The control genes for one type of cancer may be different to the control genes for a different cancer, or they may be the same. For example, control genes for prostate cancer may be one or more control genes selected from the lists of control genes above.
As used herein, the term “gene-region” refers to a region of the genome that covers a gene in the genome, e.g. the genome of a subject, for example a human, and includes a gene and a maximum extension of 200 Kbp per side of the gene. A gene-region may cover the intronic (i.e. non-coding), exonic (i.e. coding) and flanking regions of a gene. Preferably, a gene-region of the invention covers intronic, exonic and flanking regions of a gene. A gene-region may be referred to using the genomic location of the region, for example using the coordinates of the start position and end position of the location in a specific chromosome. For a human subject a genomic region is suitably described by a genomic location, and in particular a genomic location with reference to a reference genome (for example, a digital nucleic acid sequence database, assembled a representative example of a species' set of genes). For example, for a human subject, with reference to the human reference genome GRCh37 (also referred to as Human Genome 19 (hg19)) or human reference genome GRCh38 (also referred to as Human Genome 38 (hg38)). For the present inventions, preferably the reference genome is human reference genome GRCh37 (also known as hg19).
As used herein, the term “target gene-region” refers to a gene-region that includes a target gene (supra), and the term “control gene-region” refers to a gene-region that includes a control target (supra).
As used herein, the term “allelic fraction” or “AF” refers to the number of times a mutated base/variant is observed at a genomic locus, divided by the total number of times any base is observed at the genomic locus. For example, in a sample comprising DNA derived from a diploid genome, an allelic fraction of 0.5 implies that there is one copy of an allele with base X at position Y and one copy of an allele with base Z at position Y. That is to say, that 50% of the DNA in the sample contains the allele with base X at position Y and the other 50% of the DNA in the sample contains the allele having base Z at position Y. In a DNA sample derived from an aneuploid cell (for example a cell that contains three copies of a chromosome), an AF of 0.33 for a first allele and an AF of 0.67 for a second implies that there is one copy of an allele with base X at position Y and two copy of an allele with base Z at position Y. That is to say, that 33% of the DNA in the sample contains the allele with base X at position Y and the other 67% of the DNA in the sample contains the allele having base Z at position Y.
As used herein, the term “single nucleotide polymorphism” or “SNP” refers to a polymorphism that occurs at a polymorphic site occupied by a single nucleotide. The site of the SNP is usually preceded by and followed by highly conserved sequences (e.g., sequences that vary in less than 1/100 or 1/1000 members of a population). As used herein, “SNPs” is the plural of SNP. SNPs are most frequently diallelic. The most common allele of a SNP is called a “major” or “wild-type” allele and an alternative allele of said SNP is called a “minor” or “mutant” allele. A SNP usually arises due to substitution of one nucleotide for another at the polymorphic site. SNPs can also arise from a deletion of a nucleotide or an insertion of a nucleotide relative to a reference allele.
As used herein, the term “SNP location” and “SNP locus” refer to a polymorphic site at which a polymorphism occurs. The term “SNP loci” is the plural form of the term “SNP locus”. A “SNP location” or “SNP locus” can be referred to using its unique dbSNP Reference SNP ID. (also referred to as the rsid) in the National Center for Biotechnology Information's dbSNP database. A dbSNP Reference SNP (rsid or RefSNP) number is a locus accession for a variant type assigned by dbSNP. The RefSNP catalog is a non-redundant collection of submitted variants which were clustered, integrated and annotated. RefSNP number is the stable accession regardless of the differences in genomic assemblies. They provide a stable variant notation for mutation and polymorphism analysis, annotation, reporting, data mining, and data integration. A RefSNP is represented by a number preceded by the letters “rs”. A “SNP location” or “SNP locus” can alternatively be referred to using the coordinates of the position of the polymorphic site in a specific chromosome. For a human subject a genomic location is suitably described by reference to a reference genome (for example, a digital nucleic acid sequence database, assembled from a representative example of a species' set of genes). For example, for a human subject, with reference to the human reference genome GRCh37 (also referred to as Human Genome 19 (hg19)) or human reference genome GRCh38 (also referred to as Human Genome 38 (hg38)). For the present inventions preferably the reference genome is human reference genome GRCh37 (also known as hg19).
As used herein, the term “polymorphism” refers to a genetic variation, or the occurrence of two or more genetically determined alternative sequences at a single genetic locus in a population. Each version of the sequence with respect to the polymorphic site is referred to as an “allele” of the polymorphic site. Typically, polymorphisms have two alleles, with the minor allele occurring at a frequency of greater than 1%, and more preferably greater than 5% or 10% of a selected population. The allelic form occurring most frequently in a selected population is sometimes referenced as the “wild-type” form. Diploid organisms may be homozygous or heterozygous for allelic forms. A biallelic polymorphism has two forms. A triallelic polymorphism has three forms. Examples of polymorphisms include restriction fragment length polymorphisms (RFLPs), variable number of tandem repeats (VNTRs), single nucleotide polymorphisms (SNPs), single nucleotide variants (SNVs), dinucleotide repeats, trinucleotide repeats, tetranucleotide repeats, simple sequence repeats, indels, and insertion elements such as Alu.
As used herein, the terms “single nucleotide variant” or “SNV” refer to a single nucleotide variation in a genome sequence. SNVs may be rare or common in a population. If an SNV is present in at least 1% of the population, the SNV may be referred to as a “single nucleotide polymorphism” or “SNP”.
SNPs tend to be evolutionarily stable from generation to generation and, as such, can be used to study specific genetic abnormalities throughout a population. If SNPs occur in the protein coding region (i.e. exonic region) it can lead to the expression of a variant, sometimes defective, form of the protein that may lead to development of a genetic disease. Such SNPs can therefore serve as effective indicators of the genetic disease. Some SNPs may occur in non-coding regions, but nevertheless, may result in differential or defective splicing, or altered protein expression levels. SNPs can therefore be used as diagnostic tools for identifying individuals with a predisposition for certain diseases, genotyping the individual suffering from the disease in terms of the genetic causes underlying the condition, and facilitating drug development based on the insight revealed regarding the role of target proteins in the pathogenesis process.
As used herein, the terms “heterozygous single nucleotide polymorphism” and “heterozygous SNP” refer to a SNP that is present in the DNA of a sample at an AF of between 0.05 to 0.95. In certain embodiments, the terms a heterozygous single nucleotide polymorphism” and “heterozygous SNP” refers a SNP that is present in the DNA of a sample at an AF of between 0.2 to 0.8.
As used herein, the term “minor allele frequency” or “MAF” refers to the frequency at which the second most common allele occurs in a given population. The frequency may be considered high or low. As used here, a “high MAF” is typically an allele that occurs at a frequency of 0.2 to 0.5 in a population, for example at a frequency of 0.2 to 0.5 in the 1000 Genomes Project Genotype Data (release 20130502; PMID: 26432245). As used herein, a “low MAF” is typically an allele that occurs at frequency of less than 0.2, for example less than 0.2 to around 0.01 in a population, for example at a frequency of 0.2 to 0.5 in the 1000 Genomes Project Genotype Data (release 20130502).
As used herein, the term “informative single nucleotide polymorphism”, “informative SNP” or “iSNP” refers to a heterozygous SNP that is present in a gene-region in the genome of a subject. The term “iSNPs” is the plural form of the term “iSNP”.
As used herein, the terms “iSNP location” or “iSNP locus” refer to a polymorphic site at which an iSNP occurs. The term “iSNP loci” is the plural form of the term “iSNP locus”. A “iSNP location” or “iSNP locus” can be referred to using its unique dbSNP Reference SNP ID. (also referred to as the rsid) in the National Center for Biotechnology Information's dbSNP database. A “iSNP location” or “iSNP locus” can alternatively be referred to using the coordinates of the position of the polymorphic site in a specific chromosome as described supra. For the present inventions, preferably the reference genome is human reference genome GRCh37 (also known as hg19).
As used herein, the terms “allele imbalance”, “allelic imbalance” and “AI” refer to an imbalance in the identity of the allele present in the genome of a subject. For AI to be present in a genome there must be at least two different alleles for a gene-region. A genome having identical alleles for a gene-region does not exhibit AI. A loss of heterozygosity (LOH) is a common form of allelic imbalance.
As used herein, the term “copy number” refers to the number of copies of a gene-region, gene, or part thereof, present in the chromosomal DNA of an individual or in chromosomal DNA derived from a cell of an individual. A “normal copy number” when used herein refers to the copy number of a normal or wild-type allele present in a normal cell of a subject. The copy number for any given gene-region, gene or part thereof, may range from 0 to 3 (for example 0, 1, 2 or 3), 0 to 4 (for example 0, 1, 2, 3 or 4), 0 to 5, 0 to 6, or 0 to more than 6. A change in copy number may arise from copy number alteration such as gains or losses of large segments of the genome.
As used herein, the terms “allele-specific copy number aberration”, “allele-specific informed copy number aberration” and “asCNA” refer to alteration in the total copy number of a gene-region, or part thereof, and the specific number of copies of each chromosome. Types of asCNAs include: copy number gain, copy number loss, and copy number-neutral loss of heterozygosity. A gain in copy number may be a gain of either one or both of the two inherited copies. An gain may be referred to as “unbalanced” when the gain differs in magnitude between the alleles (for example a 2,1 gain, 3,2 gain, a 4,2 gain, a 4,3 gain etc.). An gain may be referred to as “balanced” when both alleles gained equally much (for example a 2,2 gain, 3,3 gain, a 4,4, gain, a 5,5, gain, etc.). A copy number loss may be the loss of one of the parental copies of a gene-region, or part thereof (this may be referred to as a mono-allelic copy number loss). Such a loss may result in loss of heterozygosity (LOH) if the parental copies were heterozygous for that gene. A copy number loss may be the loss of both of the parental copies of a gene-region, or part thereof (this may be referred to as a bi-allelic copy number loss). The loss of one parental copy of a gene-region, or part thereof, accompanied by a simultaneous gain of the other parental copy of the same gene-region, or part thereof can also occur. If the parental copies were heterozygous for that gene, such an aberration may result in a copy number-neutral loss of heterozygosity (i.e. a loss of heterozygosity without a change in total copy number). This may also be referred to as a loss of heterozygosity 0,2. The loss of one parental copy of a gene-region, or part thereof, accompanied by a simultaneous gain of multiple copies, for example 2, 3, 4, 5, or more than 5 copies, of the other parental copy of the same gene-region, or part thereof can also occur. Such an aberration can be referred to as a loss of heterozygosity 0,3 when two copies of the other parental copy of the same gene-region are gained; a loss of heterozygosity 0,4 when three copies of the other parental copy of the same gene-region are gained; and a loss of heterozygosity 0, X when X-1 copies of the other parental copy of the same gene-region are gained wherein X is 4, 5, 6, 7, or a higher integer.
As used herein, the term “androgen receptor (AR) associated” means a gene-region that includes a gene associated with the function in the androgen (receptor) signalling pathway. Examples of such genes include: AR, FOXA1, FOXP1, HSD3B1, NCOA2 and ZBTB16. Aberrant activity of the androgen receptor is associated with certain types of prostate cancer. Aberrant activity of the androgen receptor has also been associated with breast cancer, ovarian cancer, pancreatic cancer and bladder cancer. If a genetic aberration, such as an asCNA, or a somatic or germline mutation, is detected in one or more gene-region of this type in a subject, it indicates that a subject would benefit from ceasing or altering treatment with one or more class of drug that is known or suspected of targeting AR function. An example of this class of drug include hormonal agents, such as LHRH agonists (for example leuprolide, goserelin, triptorelin, or histrelin), LHRH antagonists (for example degarelix), androgen blockers (for example abiraterone or ketoconazole), anti-androgens (for example flutamide, bicalutamide, nilutamide, enzalutamide, apalutamide or darolutamide), androgen synthesis inhibitors (for example abiraterone), estrogens and steroids (for example prednisone or dexamethasone); and in particular androgen blockers (for example abiraterone or ketoconazole), anti-androgens (for example flutamide, bicalutamide, nilutamide, enzalutamide, apalutamide or darolutamide) and androgen synthesis inhibitors (for example abiraterone). It also indicates that a subject would benefit from ceasing or altering treatment with one or more class of drug that is known or suspected of targeting AR function from treatment with one or more alternative cancer treatment, for example a chemotherapy such as a taxane (for example docetaxel and cabazitaxel) or a platinum-based antineoplastic drugs (for example carboplatin).
As used herein, the term “cell cycle associated” means a gene-region that includes a gene associated with the process of growth and proliferation of a cell. Examples of such genes include: AURKA, BRAF, CCND1, CDK12, CDK4, CDK6, CDKN1B, CDKN2A, CUL1, FBXW7, KRAS, MDM2, MDM4, MYC, MYCN, RB1 and TP53. Genetic aberrations in genes associated with growth and proliferation of a cell are known to occur in cancer such as prostate cancer, breast cancer, ovarian cancer, pancreatic cancer and bladder cancer, and in metastatic cancer. If a genetic aberration, such as an asCNA, or a somatic or germline mutation, is detected in one or more gene-region of this type in a subject, it indicates that a subject might benefit from treatment with one or more class of drug that is known or suspected of targeting cell cycle perturbations. Examples of such class of drug include: ATR inhibitors (for example Berzosertib), CDK inhibitors (for example Flavopiridol (alvocidib), abemaciclib, ribociclib, Olomoucine, Roscovitine (Seliciclib), Purvalanol, Paullones, Butryolactone, Thio/oxoflavopiridols, Oxindoles, Aminothiazoles, Benzocarbazoles, and Pyrimidines; and in particular Flavopiridol, Palbociclib, ribociclib and abemaciclib) chemotherapies (for example a taxane (for example docetaxel or cabazitaxel), and c-Met inhibitors (for example cabozantinib)), WEE1 inhibitors (for example adavosertib), Aurora kinase inhibitors (for example Alisertib, ZM447439, hesperidin, and VX-680) and alkylating agents (for example nitrogen mustards (such as cyclophosphamide, chlormethine, uramustine, melphalan, chlorambucil, ifosfamide, and bendamustine), nitrosoureas (such as carmustine, lomustine, and streptozocin) and alkyl sulfonates (such as busulfan)).
As used herein, the term “chromatin remodelling associated” means a gene-region that includes a gene that is associated with cell growth and cell division steps, such as cell-cycle progression and chromosome segregation. Genes associated with chromatin remodelling typically exert a suppressive effect on tumor growth. Examples of such genes include: ARID1A, CHD1, KDM6A, MED12, SMARCA1, KMT2C, KMT2D and RYBP. Genetic aberrations in genes associated with chromatin remodelling functions are known to occur in cancer such as prostate cancer, breast cancer, ovarian cancer, pancreatic cancer and bladder cancer, and in metastatic cancer. If a genetic aberration, such as an asCNA, or a somatic or germline mutation, is detected in one or more gene-region of this type in a subject, it indicates that a subject might benefit from treatment with one or more class of drug that is known or suspected to target chromatin remodelling functions. Examples of such class of drug include DNMT1 inhibitors (for example 5-azacitidine), HDAC inhibitors (for example vorinostat and romidepsin) and BET inhibitors (for example I-BET 151, I-BET 762, OTX-015, TEN-010, CPI-203, CPI-0610, olinone, RVX-208, ABBV-744, AZD5153, MT-1, and MS645).
As used herein, the term “DNA repair associated” means a gene-region that includes a gene associated with the identification and correction of damage to DNA. Examples of such genes include: ATM, ATR, BRCA1, BRCA2, CHD1, ERCC1, ERCC2, ERCC3, ERCC4, ERCC5, FANCA, FANCC, FANCD2, FANCE, FANCF, FANCG, MLH1, MSH2, MSH6, PALB2, RAD51B and RAD51C. Genetic aberrations in genes associated with DNA repair functions are known to occur in cancer such as prostate cancer, breast cancer, ovarian cancer, pancreatic cancer and bladder cancer, and in metastatic cancer. If a genetic aberration, such as an asCNA, or a somatic or germline mutation, is detected in one or more gene-region of this type in a subject, it indicates that a subject might benefit from treatment with one or more class of drug that is known or suspected to target a DNA repair function. Examples of such class of drug include PARP inhibitors (for example example olaparib, rucaparib, niraparib or talazoparib, Veliparib, Pamiparib, Rucaparib, and Veliparib; and in particular example olaparib, rucaparib, niraparib and talazoparib), ATR inhibitors (for example Berzosertib), CDK inhibitors (for example Flavopiridol (alvocidib), abemaciclib, ribociclib, Olomoucine, Roscovitine (Seliciclib), Purvalanol, Paullones, Butryolactone, Thio/oxoflavopiridols, Oxindoles, Aminothiazoles, Benzocarbazoles, and Pyrimidines; and in particular Flavopiridol, Palbociclib, ribociclib and abemaciclib), DNA-PK inhibitors (for example AZD7648, M3814, CC-122 and CC-115), immune checkpoint therapies (for example PD-1 inhibitors (e.g. pembrolizumab, nivolumab, cemiplimab, and spartalizumab), PD-L1 inhibitors (e.g. atezolizumab, avelumab and durvalumab), or CTLA-4 inhibitord (e.g. ipilimumab)), CHK1 inhibitors (for example V158411, PF-477736 and AZD7762), CHK2 inhibitors (for example CCT241533 and Aminopyridine 7), WEE1 inhibitors (for example adavosertib), platinum-based antineoplastic drugs (for example cisplatin, carboplatin, oxaliplatin, nedaplatin, triplatin tetranitrate, picoplatin, satraplatin and phenanthriplatin, and in particular cisplatin, carboplatin, oxaliplatin, nedaplatin), radionuclide and radiation therapies (for example radium-223 and PSMA-targeting radionuclide therapies (for example 225Ac-Labeled PSMA-617 and 177Lu-Labeled PSMA-617).
As used herein, the term “PI3K Associated” means a gene-region that includes a gene associated with the phosphoinositide 3-kinase (PI3K) signalling pathway. Typically, genes associated with the PI3K signalling pathway are involved in the stimulation of cell proliferation and growth, and the inhibition of cell apoptosis. Examples of such genes include: AKT1, AKT2, AKT3, MET, PIK3C, PIK3CB, PIK3R1 and PTEN. Genetic aberrations in genes associated with the PI3K signalling pathway are known to occur in cancer such as prostate cancer, breast cancer, ovarian cancer, pancreatic cancer and bladder cancer, and in metastatic cancer. If a genetic aberration, such as an asCNA, or a somatic or germline mutation, is detected in one or more gene-region of this type in a subject, it indicates that a subject might benefit from treatment with one or more class of drug that is known or suspected to target the PI3K signalling pathway. Examples of such class of drug include PI3K inhibitors (for example idelalisib, copanlisib, duvelisib, alpelisib, umbralisib, dactolisib, voxtalisib, Taselisib, Idelalisib, Buparlisib, Duvelisib, and Copanlisib and in particular idelalisib, copanlisib, duvelisib, alpelisib, and umbralisib) and mTOR inhibitors (for example rapamycin, deforolimus, dactolisib, voxtalisib, temsirolimus, everolimus, sapanisertib, AZD8055, and AZD2014).
As used herein, the term “Wnt signalling associated” means a gene-region that includes a gene associated with the Wnt signalling pathway. Typically, genes associated with the Wnt signalling pathway are involved in cell fate, cell migration, cell polarity and neural development. Examples of such genes include: APC, CTNNB1 and RNF43. Genetic aberrations in genes associated with the Wnt signalling pathway are known to occur in cancer such as prostate cancer, breast cancer, ovarian cancer, pancreatic cancer and bladder cancer, and in metastatic cancer. If a genetic aberration, such as an asCNA, or a somatic or germline mutation, is detected in one or more gene-region of this type in a subject, it indicates that a subject might benefit from treatment with one or more class of drug that is known or suspected to target the Wnt signalling pathway. Examples of such class of drug include PORCN inhibitors (for example WNT974, ETC-1922159 and CGX1321), FZD antagonists/monoclonal antibodies (for example Vantictumab, Ipafricept, and OTSA101-DTPA-90Y) and Inhibitor of Wnt target genes (for example SM08502).
As used herein, the term “ploidy” refers to the number of copies of each chromosome present in the genome of a cell. Haploid, diploid, triploid, tetraploid, pentaploid and hexaploid are kinds of ploidy. Specifically, the term “haploid” refers to a cell that has one copy of each chromosome. The term “diploid” refers to a cell that has two copies of each chromosome. The term “triploid” refers to a cell that has three copies of each chromosome. The term “tetraploid” refers to a cell that has four copies of each chromosome. The term “pentaploid” refers to a cell that has five copies of each chromosome. The term “hexaploid” refers to a cell that has six copies of each chromosome. A human cell is typically diploid. That is to say that a typical human cell comprises two sets of chromosomes (i.e. 46 chromosomes). However, may types of cancer cells harbour genetic aberrations that result in gain or loss of whole, or parts of, one or more chromosomes. Thus, a human cancer cell cannot be assumed to be diploid. A cell having an abnormal number of chromosomes due to a loss or gain of specific chromosome(s) may be described as aneuploid. A cell having an abnormal number of chromosomes due to a loss or gain of a whole set of chromosome may be described as euploid.
As used herein, the term “Log 2R” refers a function of the percentage of aberrant tumor cells (% AC) that contribute to the copy number alteration and of the copy number in a sample of tumor DNA.
The term “whole exome sequencing” or “WES” refers the nucleotide sequencing of the protein coding regions (i.e. exons) of the genes in the genome.
As used herein, the term “circulating free DNA” (cfDNA) means the DNA fragments that have been released into the blood plasma and are found freely circulating the blood stream, as well as in the urine, released from any cell type in the body. cfDNA is generally double-stranded DNA consisting of small fragments (70 to 200 bp).
As used herein, the term “circulating tumor DNA” (ctDNA) means the DNA fragments that have been released from tumor cells into the blood plasma and are found freely circulating the blood stream, as well as in the urine. ctDNA is generally double-stranded DNA consisting of small fragments (70 bp to 200 bp).
As used herein, the terms “tumor fraction” and “tumor content” or “TC” of a DNA sample refers to the fraction or percentage of DNA molecules derived from tumor cells in the DNA sample compared to the DNA molecules that are not derived from a cancer cell. For example, for a sample comprising cfDNA, the terms “tumor fraction” and “tumor content” of a cfDNA refer to the fraction or percentage of cfDNA molecules derived from tumor cells (i.e. the ctDNA) in a cfDNA sample compared to the cfDNA molecules that are not derived from a cancer cell. cfDNA that is not derived from a cancer cells in a cfDNA sample may be derived from blood cells, for example white blood cells (leukocyte), and other non-cancerous tissues.
As used herein, the term “genomic location” refers to the location of a region of a genome, e.g. the genome of a subject, for example a human. It may be referred to using the coordinates of the start position and end position of the location in a specific chromosome. For a human subject a genomic location is suitably described by reference to a reference genome (for example, a digital nucleic acid sequence database, assembled from a representative example of a species' set of genes). For example, for a human subject, with reference to the human reference genome GRCh37 (also referred to as Human Genome 19 (hg19)) or human reference genome GRCh38 (also referred to as Human Genome 38 (hg38)). For the present inventions, preferably the reference genome is human reference genome GRCh37 (also known as hg19). As such, a genomic location for a human may be described using the coordinates of the start position and end position of the location in a specific chromosome, with reference to the Genome Reference Consortium Human Build 37 (GRCh37) (also referred to as Human Genome 19 (hg19)).
The term “staging” as used herein refers to the process of determining the extent a cancer has developed by growing and/or spreading in a subject. Typically, classification of a cancer involves assigning the cancer a number from I to IV, wherein I is an isolated cancer and IV is a metastatic cancer that has spread to other parts of the body, distal to the primary cancer site.
The term “classification” as used herein refers to the process of classifying the type of cancer in a subject by determining the type of tissue in a subject from which a cancer originates and/or by determining the primary site in the body wherein the cancer first developed. For example, a cancer classified based on the type of tissue in the subject from which a cancer originates may be classified as a carcinoma (for example a colon, prostate or bladder carcinoma), sarcoma, myeloma, leukemia, lymphoma or mixed type (for example, adenosquamous carcinoma, mixed mesodermal tumor, carcinosarcoma, teratocarcinoma).
The term “screening” as used herein refers to the process of checking for a cancer in a subject not known to have cancer. For example, checking for a copy number change or aberration within gene-regions commonly associated with a particular cancer type.
The term “prognostication” as used herein refers to the process of estimating/predicting the likely course and outcome of a cancer, and/or chance that a subject has of recovering from a cancer. For example, a subject whose cancer is not regressing in response to certain cancer treatment(s), as determined, for example by using a method of the present invention, may be considered to have a poor prognosis.
The term “stratification” as used herein refers to the process of stratifying a subject into a molecular group based on the molecular profile the subject's cancer and the predicted response of the subject to a certain treatment. For example stratifying by DNA repair gene associated alteration, such as BRCA1/2 alteration, for predicted response to treatment with a PARP inhibitor or other drugs targeting DNA repair.
As used herein, a “subject” refers to an animal, including mammals such as humans. Preferably, the subject is a human subject. As used herein, an “individual” can be a subject. As used herein, a “patient” refers to a human subject. In one embodiment, the subject is known or suspected to have a cancer (for example prostate cancer), and/or is known or suspected to have a risk of developing cancer (for example prostate cancer), or is known to have cancer and is known or suspected to have metastatic cancer (for example metastatic prostate cancer) or to have a risk of developing metastatic cancer (for example metastatic prostate cancer). In some embodiments, the subject is a subject who has been identified as being at risk of developing a cancer, in particular at risk of developing a prostate cancer. A “subject” may be male or female. In certain embodiments, the subject is male (for example wherein the subject is known or suspected to have/have a risk of developing prostate cancer).
As used herein, a “healthy subject” or “healthy volunteer” refers to a subject that has not been diagnosed with a type of cancer (for example prostate cancer), and preferably has not been diagnosed with any type of cancer. Thus, for example, a method of relating to prostate cancer, a “healthy subject” or “healthy volunteer” has no prostate cancer, and preferably no other type of cancer. Preferably, a healthy subject has not been diagnosed with a type of cancer (for example prostate cancer), and is not suspected of having a type of cancer, and suitably has not been diagnosed with any type of cancer (for example prostate cancer), and is not suspected of having any type of cancer.
As used herein, the term “nucleic acid” means a single or double-stranded deoxyribonucleotide or ribonucleotide polymer of any length, and include as non-limiting examples, coding and non-coding sequences of a gene, sense and antisense sequences, exons, introns, genomic DNA, cDNA, pre-mRNA, mRNA, rRNA, siRNA, miRNA, tRNA, ribozymes, recombinant polynucleotides, isolated and purified naturally occurring DNA or RNA sequences, synthetic RNA and DNA sequences, nucleic acid probes, primers, and fragments thereof. Reference to a polynucleotide(s) is to be similarly understood.
As used herein, the term “oligonucleotide(s)” refers to nucleic acid molecules that usually comprise between 5 and 100 contiguous bases, for example between 5-10, 5-20, 10-20, 10-50, 15-50,15-100, 20-50, or 20-100 contiguous bases. An oligonucleotide may be capable of hybridising to a target of interest, e.g., a sequence that include an iSNP of the present invention (e.g. an iSNP as defined in Table 1 or Table 2). An oligonucleotide for hybridising to a target may comprise at least 5, least 10, at least 15, at least 20, at least 30, at least 40, at least 50 or at least 60 nucleotides. An oligonucleotide can be used as a primer, a probe, included in a microarray, or used in polynucleotide-based identification methods. For example, the oligonucleotide may be a probe that is complementary to, and capable of hybridizes to, a nucleotide sequence of interest. For example, the probe may be capable of hybridizing to a SNP site present in tumor DNA and/or non-tumor DNA obtained from a subject (e.g. capable of hybridizing to a SNP site defined in Table 1 or Table 2).
As used herein, a “subtype of a cancer” (for example a “subtype of prostate cancer”) is a subset of a type of cancer based on characteristics of the cancer cells, and in particular molecular and genetic characteristics of the cells. Different cancer subtypes can have different disease progression and can respond or not respond to different treatments. The subtype of a cancer is, for example, used to assist in planning treatment and determine prognosis of the patient having that cancer subtype. Examples of a subtypes of prostate cancer include hormone sensitive prostate cancer (HSPC) and castration resistant prostate cancer (CRPC).
The present invention provides an in vitro method for staging, classification, screening monitoring, stratification, selecting treatment for, ascertaining whether treatment is working in, and/or prognostication of cancer in a subject.
The in vitro method of the present invention comprises steps i) to v), as described in further detail below:
The in vitro method of the invention comprises a step of providing a biological sample obtained from the subject comprising tumor DNA. The biological sample comprising tumor DNA may further comprises non-tumor DNA (for example, non-tumor DNA in a plasma sample or a tissue sample). That is to say, that the biological sample may contain DNA derived from a cancer cell in a subject and DNA derived from a non-cancerous (i.e. healthy) cell in the subject. If the biological sample comprises both tumor DNA and non-tumor DNA, the same sample can be provided as the sample comprising tumor DNA and the biological sample comprising non-tumor DNA.
Typically, the biological sample is a blood sample (for example a blood sample, plasma sample or a white blood cell sample), urine sample, saliva sample, tissue sample, or cerebral spinal fluid obtained from the subject.
In certain embodiments, the biological sample comprising tumor DNA and the biological sample comprising non-tumor DNA are separate samples. In certain embodiments, the biological sample comprising tumor DNA and the biological sample comprising non-tumor DNA are the same sample, i.e. a sample comprising a mixture of tumor DNA and non-tumor DNA. In certain embodiments, the biological sample comprising non-tumor DNA may be a tissue sample from a healthy, non-cancerous organ or tissue within the subject, a saliva sample, a blood sample, or white blood cell sample from a subject. Typically, the biological sample comprising non-tumor DNA is a saliva sample, white blood cell sample, or blood sample from a subject. A white blood cell sample may be obtained from a blood sample from a subject, and in particular obtained from the buffy coat separated from a blood sample from a subject. The buffy coat may be separated by centrifugation of the blood sample.
In certain embodiments the biological sample comprising tumor DNA may be a tissue sample from a tumor within the subject, or it may be a plasma sample comprising ctDNA, or it may be a blood sample from a subject. Typically, the biological sample comprising non-tumor DNA is a plasma sample comprising ctDNA, or a blood sample from a subject. A plasma sample comprising ctDNA may be obtained from a blood sample from a subject. The plasma sample comprising ctDNA may be obtained by separating the plasma from the buffy coat in the blood sample, for example by centrifugation of the blood sample.
In certain other embodiments, the biological sample comprising tumor DNA and the biological sample comprising non-tumor DNA are the same sample. For example, the biological sample provided in step i) is a blood sample (for example a plasma sample), urine sample, tissue sample, or cerebral spinal fluid sample comprising both tumor DNA derived from a cancer cell in the subject and non-tumor DNA derived from a non-cancerous cell in the subject. Typically, the tumor DNA present in a blood sample (for example, a plasma sample) or urine sample is circulating tumor DNA (ctDNA).
In embodiments wherein the biological sample comprising tumor DNA and the biological sample comprising non-tumor DNA are the same sample, and the sample is a blood sample, the sample may be a single blood sample that can be separated into a white blood cell sample (for example obtained from the buffy coat separated from the blood sample) to provide the sample comprising non-tumor DNA, and a plasma sample (for example obtained by separating the plasma from a blood sample) to provide the sample comprising non-tumor DNA. Alternatively, or additionally, the blood sample may be a single plasma sample comprising both tumor and non-tumor DNA. For example, the blood sample may be a single plasma sample comprising ctDNA and non-tumor cfDNA.
In certain preferred embodiments, the biological sample comprising tumor DNA is a blood sample comprising circulating tumor DNA (ctDNA). The present inventors have found that the method of the invention is especially effective when using a plasma sample comprising ctDNA obtained from a subject. Thus, preferably, the biological sample comprising tumor DNA is a plasma sample comprising ctDNA. More preferably, the blood sample is a plasma sample that comprises, in addition to the ctDNA, non-tumor cell-free DNA (cfDNA). In certain embodiments, the biological sample comprising tumor DNA is a blood sample, and the biological sample comprising non-tumor DNA is the same blood sample, and the tumor DNA in the sample is ctDNA present in the plasma of the blood sample, and the non-tumor DNA in the sample is genomic DNA from white blood cells present in the blood sample.
Typically, the biological sample comprises a plurality of DNA molecules. For example, at least 10,000, at least 50,000, at least 100,000, at least 500,000, at least 1,000,000 (106), at least 5,000,000 (5×106), at least 10,000,000 (107), at least 100,000,000 (108), at least 1,000,000,000 (109), 5,000,000,000 (5×109), at least 10,000,000,000 (1010) or at least 15,000,000,000 (1.5×1010) DNA molecules. Preferably, the biological sample comprises, at least 100,000, at least 500,000, at least 1,000,000 (106), at least 5,000,000 (5×106), at least 10,000,000 (107), at least 100,000,000 (108), at least 1,000,000,000 (109), 5,000,000,000 (5×109), at least 10,000,000,000 (1010) or at least 15,000,000,000 (1.5×1010) DNA molecules. More preferably, at least 10,000,000 (107), at least 100,000,000 (108), at least 1,000,000,000 (109) DNA molecules, 5,000,000,000 (5×109), at least 10,000,000,000 (1×1010) or at least 15,000,000,000 (1.5×1010).
The quantity of DNA molecules in a sample that are tumor DNA molecules is referred to herein as the percentage tumor content (TC) of a sample. That is to say, that TC refers to the percentage of DNA molecules in a sample that are tumor DNA molecules.
Suitably, the biological sample comprising tumor DNA provided in step i) has a tumor content of at least about 1%, at least about 2%, at least about 3%, at least about 4%, or at least about 5%. For example, the biological sample comprising tumor DNA may have a tumor content of at least about 2%, at least about 3%, at least about 4%, or at least about 5%, at least 7%, at least 10%, at least 15%, at least 20%, at least 30%, at least 40%, at least 50%, at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, or 100%. Preferably, the biological sample comprising tumor DNA provided in step i) has a tumor content of at least about 2%, at least 5%, at least 10%, or at least 15%.
The present inventors have found that the present method is surprising sensitive even when the tumor content of the biological sample is low (for example, about 1% to about 5% tumor content). Thus, in certain embodiments, the biological sample provided in step i) has a tumor content of about 1% to about 5%, for example about 2% to about 5%. In certain embodiments, the biological sample provided in step i) has a tumor content of about 1% to about 15%, for example about 2% to about 15%, or about 5% to about 15%.
In embodiments wherein the biological sample comprising tumor DNA is a tissue sample, the tumor DNA must be extracted from the cells present in the tissue sample. The tumor DNA can be isolated from the biological sample using a variety of techniques known in the art.
In embodiments wherein the biological sample comprising non-tumor DNA is a tissue sample, saliva sample, or white blood cell sample, the non-tumor DNA must be extracted from the cells present in the sample. The non-tumor DNA can be isolated from the biological sample using a variety of techniques known in the art.
In certain embodiments, the tumor DNA and/or non-tumor DNA, is amplified before analysis. Amplification techniques are known to those of ordinary skill in the art and include, but are not limited to, cloning, polymerase chain reaction (PCR), polymerase chain reaction of specific alleles (PASA), polymerase chain ligation, nested polymerase chain reaction, and so forth. The preferred amplification technical for use in the present invention is PCR.
The method of the present invention comprises a step of detecting the presence of single nucleotide polymorphisms (SNPs) in the non-tumor DNA at:
Tables 1 and 2 are provided below. The genomic locations recited in Tables 1 and 2 are locations with reference to the human reference genome GRCh37 (also known as hg19). The SNP loci defined in Tables 1 and 2 are collectively referred to herein as the “Example 1 SNP panel”.
†gene-region loci are provided with reference to the Genome Reference Consortium Human Build 37 (GRCh37), which is herein also referred to as “hg19”. The loci are provide in the following form: X:Y-Z, wherein X is the chromosome name, Y is the start position of the gene-region, and Z is the end position of the gene-region.
The gene-regions recited above in Table 1 are gene-regions that contain genes mutated in prostate cancer and/or altered in signalling pathways known or suspected of being druggable pathways for treating a prostate cancer. The gene-regions defined in Table 1 are hereinafter referred as “target gene-regions”.
†gene-region loci are provided with reference to the Genome Reference Consortium Human Build 37 (GRCh37), which is herein also referred to as “hg19”. The loci are provide in the following form: X:Y-Z, wherein X is the chromosome name, Y is the start position of the gene-region, and Z is the end position of the gene-region.
The gene-regions recited above in Table 2 are gene-regions that contain genes that are known to not be commonly mutated in a cancer. Table 2 also includes gene-regions that contain the genes, UGT2B17 and ZBTB9. UGT2B17 and ZBTB9 are known to exhibit high frequency germline copy number losses. Gene-regions containing UGT2B17 and ZBTB9 are included as positive controls for the method of the invention because they are known to commonly contain asCNAs. If a user of the present method determined that the gene-regions containing UGT2B17 and ZBTB9 did not contain any asCNAs, it would suggest that the results for that biological sample are inconclusive and/or unreliable. The gene-regions defined in Table 2 are hereinafter referred to as the “control gene-regions”.
Tables 1 and 2 recite the SNPs within each gene-region defined in Tables 1 and 2. The present inventors selected the SNPs recited in Tables 1 and 2 after discovering that selecting high MAF SNPs in the bespoke gene-regions of the selected genes lead to accurate and sensitive results for establishing allelic imbalance and copy number loss in samples from subjects having cancer. The inventors determined that the SNPs of Tables 1 and 2 have a high MAF, and a high MAF across subjects of different ethnicities, using the 1000 Genomes Project Genotype Data (release 20130502; PMID: 26432245). The SNPs recited in Tables 1 and 2 are located within the exonic regions, intronic regions or intergenic flanking regions of the gene-regions defined in Table 1 or 2.
In certain embodiments, step ii) of the present method comprises detecting at least one (for example 1, 2, 3, 4 or 5; typically at least 2, at least 3 or at least 4) target gene-region selected from the group consisting of FOXA1, FOXP1, HSD3B1, NCOA2 and ZBTB16 as defined in Table 1. Genetic aberrations in such gene-regions are known or suspected of being associated with the function in the androgen (receptor) signalling pathway.
In certain embodiments, step ii) of the present method comprises detecting at least one (for example 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 12, 15 or 17; typically at least 2, at least 3 or at least 4) target gene-region selected from the group consisting of AURKA, BRAF, CCND1, CDK12, CDK4, CDK6, CDKN1B, CDKN2A, CUL1, FBXW7, KRAS, MDM2, MDM4, MYC, MYCN, RB1 and TP53 as defined in Table 1. Genetic aberrations in such gene-regions are known or suspected of being associated with cell cycle dysfunction.
In certain embodiments, step ii) of the present method comprises detecting at least one (for example 1, 2, 3, 4, 5, 6, 7 or 8; typically at least 2, at least 3 or at least 4) target gene-region selected from the group consisting of AKT1, AKT2, AKT3, MET, PIK3C, PIK3CB, PIK3R1 and PTEN as defined in Table 1. Genetic aberrations in such gene-regions are known or suspected of being associated with the phosphoinositide 3-kinase (PI3K) signalling pathway.
In certain embodiments, step ii) of the present method comprises detecting at least one (for example 1, 2, 3, 4, 5, 6, 7 or 8; typically at least 2, at least 3 or at least 4) target gene-region selected from the group consisting of ARID1A, CHD1, KDM6A, MED12, SMARCA1, KMT2C, KMT2D and RYBP as defined in Table 1. Genetic aberrations in such gene-regions are known or suspected of being associated with chromatic remodeling dysfunction.
In certain embodiments, step ii) of the present method comprises detecting at least one (for example 1, 2, 3, 4, 5, 6, 8, 10, 12, 15, 20 or 22; typically at least 2, at least 3 or at least 4) target gene-region selected from the group consisting of ATM, ATR, BRCA1, BRCA2, CHD1, ERCC1, ERCC2, ERCC3, ERCC4, ERCC5, FANCA, FANCC, FANCD2, FANCE, FANCF, FANCG, MLH1, MSH2, MSH6, PALB2, RAD51B and RAD51C as defined in Table 1. Genetic aberrations in such gene-regions are known or suspected of being associated with DNA repair dysfunction.
In certain embodiments, step ii) of the present method comprises detecting at least one (for example 1, 2, 3, 4, 5, 6, 8, 10, or 12; typically at least 2, at least 3 or at least 4) target gene-region selected from the group consisting of ASXL1, CLU, CYLD, ERG_TMPRSS2, GNAS, IDH1, IDH2, NFE2L2, NKX3-1, RUNX1, SPOP and ZFHX3 as defined in Table 1. Genetic aberrations in such gene-regions are known to be associated with prostate cancer, although their precise role in the development and/or progression of prostate cancer of a subject are currently unknown.
In certain embodiments, step ii) of the present method comprises detecting at least one (for example 1, 2, or 3; typically at least 2) target gene-region selected from the group consisting of APC, CTNNB1 and RNF43 as defined in Table 1. Genetic aberrations in such gene-regions are known or suspected to dysregulate the Wnt signalling pathway.
In certain preferred embodiments, step ii) of the present method comprises detecting at least one (for example 1, 2, 3, 4, 5, or 6; typically at least 2, at least 3, at least 4, at least 5 or 6) target gene-region selected from the group consisting of BRCA2, ATM, RB1, NKX3-1, TP53, and PTEN as defined in Table 1. In certain embodiments, preferably step ii) of comprises detecting at least 3 of the target gene-regions selected from the group consisting of BRCA2, ATM, RB1, NKX3-1, TP53, and PTEN as defined in Table 1. More preferably, step ii) of the present method comprises detecting the gene-regions of at least BRCA2, ATM, RB1, NKX3-1, TP53, and PTEN as defined in Table 1.
In certain embodiments, step ii) of the present method comprises detecting at least one target gene-region selected from the group consisting of FOXA1, FOXP1, HSD3B1, NCOA2 and ZBTB16; and/or at least one target gene-region selected from the group consisting of AURKA, BRAF, CCND1, CDK12, CDK4, CDK6, CDKN1B, CDKN2A, CUL1, FBXW7, KRAS, MDM2, MDM4, MYC, MYCN, RB1 and TP53; and/or at least one target gene-region selected from the group consisting of ARID1A, CHD1, KMT2C, KMT2D and RYB; and/or at least one target gene-region selected from the group consisting of ATM, ATR, BRCA1, BRCA2, CHD1, ERCC1, ERCC2, ERCC3, ERCC4, ERCC5, FANCA, FANCC, FANCD2, FANCE, FANCF, FANCG, MLH1, MSH2, MSH6, PALB2, RAD51B and RAD51C; and/or at least one target gene-region selected from the group consisting of ASXL1, CLU, CYLD, ERG_TMPRSS2, GNAS, IDH2, NFE2L2, NKX3-1, RUNX1, SPOP and ZFHX3; and/or at least one target gene-region selected from the group consisting of AKT1, AKT2, AKT3, MET, PIK3C, PIK3CB, PIK3R1 and PTEN; and/or at least one target gene-region selected from the group consisting of APC, CTNNB1 and RNF43.
In certain embodiments, step ii) of the present method comprises detecting at least two, at least three or at least four target gene-region selected from the group consisting of FOXA1, FOXP1, HSD3B1, NCOA2 and ZBTB16; and/or at least two, at least three or at least four target gene-region selected from the group consisting of AURKA, BRAF, CCND1, CDK12, CDK4, CDK6, CDKN1B, CDKN2A, CUL1, FBXW7, KRAS, MDM2, MDM4, MYC, MYCN, RB1 and TP53; and/or at least two, at least three or at least four target gene-region selected from the group consisting of ARID1A, CHD1, KMT2C, KMT2D and RYB; and/or at least two, at least three or at least four target gene-region selected from the group consisting of ATM, ATR, BRCA1, BRCA2, CHD1, ERCC1, ERCC2, ERCC3, ERCC4, ERCC5, FANCA, FANCC, FANCD2, FANCE, FANCF, FANCG, MLH1, MSH2, MSH6, PALB2, RAD51B and RAD51C; and/or at least two, at least three or at least four target gene-region selected from the group consisting of ASXL1, CLU, CYLD, ERG_TMPRSS2, GNAS, IDH2, NFE2L2, NKX3-1, RUNX1, SPOP and ZFHX3; and/or at least two, at least three or at least four target gene-region selected from the group consisting of AKT1, AKT2, AKT3, MET, PIK3C, PIK3CB, PIK3R1 and PTEN; and/or at least one, at least two or each target gene-region selected from the group consisting of APC, CTNNB1 and RNF43.
In certain embodiments, step ii) comprises detecting the presence of SNPs in the non-tumor DNA present in a biological sample at:
In certain embodiments, step ii) comprises detecting the presence of SNPs in the non-tumor DNA present in a biological sample at:
In one preferred embodiments, step ii) comprises detecting the presence of SNPs in the non-tumor DNA present in a biological sample at:
In another preferred embodiments, step ii) comprises detecting the presence of SNPs in the non-tumor DNA present in a biological sample at:
In certain preferred embodiments, step ii) comprises detecting the presence of SNPs in the non-tumor DNA at;
In certain preferred embodiments, step ii) comprises detecting the presence of SNPs in the non-tumor DNA at:
In certain preferred embodiments, step ii) comprises detecting the presence of SNPs in the non-tumor DNA at:
In certain preferred embodiments, step ii) comprises detecting the presence of SNPs in the non-tumor DNA at:
In one preferred embodiment, step ii) comprises detecting the presence of SNPs in the non-tumor DNA at:
Typically, step ii) comprises detecting the presence of SNPs in the non-tumor DNA at about 10 to all of the SNP loci defined in Table 1 a gene-region. In one embodiment, step ii) comprises detecting the presence of SNPs in the non-tumor DNA at least 30 of the SNP loci defined in Table 1 a gene-region. In one preferred embodiment, step ii) comprises detecting the presence of SNPs in the non-tumor DNA at least 60 of the SNP loci defined in Table 1 a gene-region. In one preferred embodiment, step ii) comprises detecting the presence of SNPs in the non-tumor DNA at least 90 of the SNP loci defined in Table 1 a gene-region.
In one embodiment, step ii) comprises detecting the presence of SNPs in the non-tumor DNA at:
Typically, step ii) comprises detecting the presence of SNPs in the non-tumor DNA at about 10 to all of the SNP loci defined in Table 1 for each of at least 10, at least 25, at least 50, or all of the target gene-regions defined in Table 1. For example, depending on the number of SNPs in a gene region, from 10 up to 500, 10 up to 400, 30 up to 400, 30 up to 300, 30 up to 200, or 30 up to 100 of the SNP loci for each of at least 30, at least 60, at least 90, or all of the target gene-regions defined in Table 1 may be detected in the non-tumor DNA. The present inventors have found that the method of the present invention is especially effective and informative about the asCNAs present in the tumor DNA of a sample when at least 30, at least 60, at least 90 or at least 100 of the SNP loci defined in Table 1 are detected for each of at least 10, at least 25, at least 50, or all of the target gene-regions defined in Table 1 (and in particular when at least 90 or at least 100 of the SNP loci defined in Table 1 are detected for each of at least 10, at least 25, at least 30, or all of the target gene-regions defined in Table 1).
In one embodiment, step ii) comprises detecting the presence of SNPs in the non-tumor DNA at:
In certain preferred embodiments, step ii) comprises detecting the presence of SNPs in the non-tumor DNA at target gene-regions and at control gene-regions. For example, step ii) may further comprise detecting the presence of SNPs in the genomic non-tumor DNA at:
In embodiments wherein step ii) comprises detecting the presence of SNPs in the non-tumor DNA at control gene-regions, typically the step also comprises detecting the presence of SNPs in the non-tumor DNA at about 10 to all of the SNP loci defined in Table 2 for each of at least 10, at least 25, at least 50, or all of the target gene-regions defined in Table 2. For example, depending on the number of SNPs in a gene region, from 10 up to 500, 10 up to 400, 30 up to 400, 30 up to 300, 30 up to 200, or 30 up to 100 of the SNP loci for each of at least 10, at least 25, at least 30, or all of the control gene-regions defined in Table 2 may be detected in the non-tumor DNA. The present inventors have found that the method of the present invention is especially effective and informative about the asCNAs present in the tumor DNA of a sample when at least 30, at least 60, at least 90 or at least 100 of the SNP loci defined in Table 2 are detected for each of at least 10, at least 25, at least 30, or all of the target gene-regions defined in Table 2 (and in particular when at least 90 or at least 100 of the SNP loci defined in Table 1 are detected for each of at least 10, at least 25, at least 30, or all of the target gene-regions defined in Table 2).
In one embodiment, step ii) comprises detecting the presence of SNPs in the non-tumor DNA at:
In certain embodiments, step ii) comprises a step of providing a set of probes, wherein said set of probes is capable of specifically hybridizing to:
Preferably, each probe in the set of probes is an oligonucleotide probe that is complementary to, and capable of hybridizing to, one or more SNP loci defined in Table 1 or 2. More preferably, each probe is complementary to only one SNP loci defined in Table 1 or 2.
In certain embodiments, step ii) comprises a step of providing a set of probes, wherein said set of probes is capable of specifically hybridizing to any embodiment or combination of embodiments described above, i.e. capable of specifically hybridizing to
For example, step ii) comprises a step of providing a set of probes, wherein said set of probes is capable of specifically hybridizing to
In certain preferred embodiments, step ii) comprises detecting the presence of at least one SNP in the intronic region of a gene-region and/or at least one SNP in the flanking region of a gene-region.
In certain preferred embodiments, step ii) comprises detecting the presence of at least one SNP in the intronic region and at least one SNP in the flanking regions of a gene-region.
For example, step ii) comprises detecting the presence of single nucleotide polymorphisms (SNPs) in the non-tumor DNA at:
In certain preferred embodiments, step ii) comprises detecting the presence of at least 10%, at least 20%, at least 30%, at least 50%, at least 75% or at least 90% of SNP in the intronic region of a gene-region and/or at least 10%, at least 20%, at least 30%, at least 50%, at least 75% or at least 90% of SNP in the flanking region of a gene-region.
In certain preferred embodiments, step ii) comprises detecting the presence of at least 10%, at least 20%, at least 30%, at least 50%, at least 75% or at least 90% of SNP in the intronic region of a gene-region and at least 10%, at least 20%, at least 30%, at least 50%, at least 75% or at least 90% of SNP in the flanking region of a gene-region.
For example, step ii) comprises detecting the presence of single nucleotide polymorphisms (SNPs) in the non-tumor DNA at:
In one embodiment, the subject is female and step ii) comprises,
In such embodiments, the presence and/or alteration of one or more asCNA in a target gene-region selected from the group consisting of MED12, SMARCA1, and KDM6A as defined above indicates that the subject would benefit from treatment with one or more of a DNA methyltransferase 1 (DNMT1) inhibitor (for example 5-azacitidine) and the histone deacetylase (HDAC) inhibitor (for example vorinostat and romidepsin).
In such embodiments, the presence and/or alteration of one or more asCNA in the target gene-region of AR as defined above, indicates that the subject would benefit from ceasing or altering treatment with an hormonal agent, such as a LHRH agonist (for example leuprolide, goserelin, triptorelin, or histrelin), LHRH antagonist (for example degarelix), androgen blockers (for example abiraterone or ketoconazole), anti-androgen (for example flutamide, bicalutamide, nilutamide, enzalutamide, apalutamide or darolutamide), androgen synthesis inhibitor (for example abiraterone), estrogen or steroid (for example prednisone or dexamethasone). It also indicates that a subject would benefit from ceasing or altering treatment with one or more class of drug that is known or suspected of targeting AR function from treatment with one or more alternative cancer treatment, for example a chemotherapy such as a taxane (for example docetaxel and cabazitaxel) or a platinum-based antineoplastic drugs (for example carboplatin).
Step iii):
The method of the present invention comprises a step of identifying which of the SNPs present in the non-tumor DNA are informative SNPs (iSNPs) for the subject, wherein an iSNP is a SNP that is heterozygous in the non-tumor DNA for the subject. For the avoidance of doubt, when used herein, the term iSNP refers to a SNP that is heterozygous in the genomic non-tumor DNA or tumor DNA for the subject, and in particular has an allelic fraction (AF) of between about 0.05 to about 0.95. In one preferred embodiment, the term iSNP refers to a SNP that is heterozygous in the non-tumor DNA or tumor DNA for the subject and has an allelic fraction (AF) of between about 0.2 to about 0.8. The present inventors have found that by selecting iSNP in step iii) of the present method, it is possible to more accurately determine the type of asCNA present in the tumor DNA of the subject.
In certain embodiments, step iii) comprises the following steps:
Step iii) may further comprise a step of determining the nucleotide sequences of the captured DNA molecules and analysing the nucleotide sequences to identify which of the SNPs present in the non-tumor DNA are informative SNPs (iSNPs) for the subject.
The method of the present invention comprises a step of determining the allelic imbalance for each target gene-region, and optionally for each control gene-region, of the tumor DNA by reference to the iSNPs for the subject in each gene-region (i.e. step iv-a).
The method of the present invention also comprises a step of determining the copy number for each target gene-region, and optionally for each control gene-region, in the tumor DNA (i.e. step iv-b).
In certain embodiments, the step iv-b) of the method of the invention further comprises a step of estimating the tumor content (TC) of the sample comprising tumor DNA and/or estimating the ploidy of the sample comprising tumor DNA.
In certain embodiments, the method further comprises a step of detecting in the tumor DNA the presence of somatic and/or germline mutations in the exonic region of one or more target gene-region defined in Table 1. In certain embodiments, the method further comprises a step of detecting in the tumor DNA the presence of somatic and/or germline mutations, for example insertions, deletions and single nucleotide variant mutations, in the exonic region of one or more target gene-region defined in Table 1. Typically, the method further comprises a step of detecting in the tumor DNA the presence of somatic and/or germline single nucleotide variant mutations in the exonic region of one or more target gene-region defined in Table 1
The method may further comprise optionally detecting in the tumor DNA the presence of somatic and/or germline mutations in the exonic region of one or more of gene selected from the group consisting of AR, MED12, SMARCA1, IDH1 and KDM6A. For example, the method may further comprise optionally detecting in the tumor DNA the presence of somatic and/or germline gains, losses and/or single nucleotide variant mutations in the exonic region of one or more of gene selected from the group consisting of AR, MED12, SMARCA1, IDH1 and KDM6A. In particular, the method may further comprise optionally detecting in the tumor DNA the presence of somatic and/or germline gains, losses and/or single nucleotide variant mutations in the exonic region of one or more of gene selected from the group consisting of AR, MED12, SMARCA1, and KDM6A; and/or optionally detecting in the tumor DNA the presence of somatic and/or germline single nucleotide variant mutations in the exonic region of/DH1. In one embodiment, the method may further comprise optionally detecting in the tumor DNA the presence of somatic and/or germline gains, losses and/or single nucleotide variant mutations in the exonic region of one or more of gene selected from the group consisting of MED12, SMARCA1, and KDM6A; and/or optionally detecting in the tumor DNA the presence of somatic and/or germline gains and/or single nucleotide variant mutations in the exonic region of AR; and/or optionally detecting in the tumor DNA the presence of somatic and/or germline single nucleotide variant mutations in the exonic region of/DH1.
In embodiments wherein the method comprises a step of detecting in the tumor DNA the presence of somatic and/or germline mutations in the exonic region of one or more target gene-region, the presence of one or more somatic and/or germline mutations indicates that the subject would benefit from treatment with one or more cancer treatments, has benefited, or is benefiting, from one or more cancer treatments and/or would benefit from ceasing or altering one or more cancer treatments.
The method of the present invention comprises a step of analyzing the allelic imbalance and copy number for each target gene-region, and optionally for each control gene-region, to determine the presence, absence, and/or alteration of one or more allele-specific copy number aberration (asCNA) at each target gene-region in the tumor DNA.
The presence, absence, and/or alteration of one or more asCNA in the tumor DNA indicates that the subject would benefit from treatment with one or more cancer treatments, has benefited, or is benefiting, from one or more cancer treatments and/or would benefit from ceasing or altering one or more cancer treatments.
In certain embodiments, the method of the invention further comprises step vi):
In certain embodiments, the method of the invention further comprises step vii):
In certain embodiments, the method of the invention further comprises the following steps:
In such embodiments, preferably the further biological sample obtained from the subject during or after the subject has undergone a treatment for cancer, wherein said sample comprises tumor DNA, is a sample of the same type as the biological sample comprising tumor DNA provided in step i). For example, if the biological sample comprising tumor DNA provided in step i) is a blood sample, preferably the further biological sample comprising tumor DNA is a blood sample; or if the biological sample comprising tumor DNA provided in step i) is a plasma sample, preferably the further biological sample comprising tumor DNA is a plasma sample.
In embodiments where the method of the invention further comprises:
In embodiments where the method of the invention further comprises steps I) and II), steps I) and II) may be repeated on one or more additional further biological samples (for example 1, 2, 3, 4, 5, 10, 15 or more additional further biological samples). For example, the steps may be repeated on one or more additional further biological samples each obtained at different time points during or after the subject has undergone a treatment for cancer compared to the time point the further biological sample was obtained. For example, an additional further biological may be obtained around 1 week, around 2 weeks, around 3, weeks, around 4, weeks, around 1 month, around 6 weeks, around 2 months, around 3 months, around 4 months, around 6 months, around 9 months, around 1 year, or more than 1 year after the further biological sample comprising tumor DNA of step i) was obtained.
In certain embodiments, the method of the invention further comprises a step of treating the subject for a cancer using a therapeutic agent for the treatment of cancer; or ceasing or altering treatment with a therapeutic agent for the treatment of a cancer; or initiating a non-therapeutic agent treatment for cancer (for example initiation of treatment by surgery or radiation).
Typically, steps ii) and iii) of the method of the present invention are performed by analysing the nucleotide sequence of the DNA in the biological sample.
A variety of procedures suitable for determining the nucleotide sequence of a DNA molecule are known in the art and may be used to practice the methods disclosed herein. Sequencing methods suitable for use in the present invention include, for example, Sanger sequencing, Polony sequencing, 454 pyrosequencing, Combinatorial probe anchor synthesis, SOLiD sequencing, Ion Torrent semiconductor sequencing, DNA nanoball sequencing, Heliscope single molecule sequencing, Single molecule real time (SMRT) sequencing, Nanopore DNA sequencing, Microfluidic Sanger sequencing and Illumina dye sequencing.
The method of the present invention may further comprise aligning the nucleotide sequences with a reference genome for the subject, for example by aligning the nucleotide sequences with hg38, hg19, hg18, hg17 or hg16. The alignment can, for example, be carried out using a variety of techniques known in the art. For example, a DNA sequence alignment tool, (e.g., BWA-MEM (for example version BWA-MEM 0.7.17-r1188) (Li, H. and Durbin, R. (2009) Bioinformatics, 25, 1754-1760); BBMap (Bushnell, B. (2014). BBMap: A Fast, Accurate, Splice-Aware Aligner. Lawrence Berkeley National Laboratory); HISAT (PMID: 258751142); Bowtie 2 (PMID: 22388286); and FSVA (PMID: 28155631)) can be used to align the reads to the reference genome (for example hg38, hg19, hg18, hg17 or hg16). In exemplary embodiments, the method described by Li, H. and Durbin, R., 2009 (Bioinformatics, 25, 1754-1760) is used (i.e. by using BWA-MEM 0.7.17-r1188).
The genomic location assigned to each nucleotide sequence in the alignment is based on the reference genome adopted. The genomic locations listed in Tables 1 and 2 disclosed herein correspond to reference genome hg19. The corresponding locations in a different reference genome can be found using public available tools known in the art. An example of such a tool is LiftOver (http://genome.ucsc.edu/).
In certain embodiments, the method comprises removing duplications of reads of the same DNA molecule (e.g. duplications of reads of the same cfDNA molecule). Sequence reads having exactly the same sequence and start and end base pairs (i.e. the same unclipped alignment start and unclipped alignment end of the sequence) are typically removed, as they are likely to be duplicate sequence reads of the same sequence (i.e. duplicate of reads of the same cfDNA molecule). For example, PCR duplications can be removed as part of the aligning step, such as using Picard tools v2.1.0 (http://broadinstitute.github.io/picard).
In certain embodiments, each SNP loci detected in step ii) is covered by at least 2 sequence reads in step iii), for example at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 12, 15, 20, 25, 50, 100, 200, 300, 400, 500, 1000, or 10,000 sequence reads in step iii). Preferably, each SNP loci detected in step ii) is covered by at least 5 sequence reads in step iii), for example at least 6, 7, 8, 9, 10, 12, 15, 20, 25, 50, 100, 200, 300, 400, 500, 1000, or 10,000 sequence reads. More preferably, each SNP loci detected in step ii) is covered by at least 10 sequence reads in step iii), for example at least 12, 15, 20, 25, 50, 100, 200, 300, 400, 500, 1000, or 10,000 sequence reads. Estimation of the read-depth of each gene-region, and therefore the SNPs within each region, can be performed using methods known in the art, for example by using the method described in Carreira et al. (2014) (Science Translational Medicine, 6, 254ra125), which is incorporated herein by reference.
Determining the Presence, Absence, and/or Alteration Allele-Specific Copy Number Aberrations (asCNAs):
To determine the presence, absence, and/or alteration allele-specific copy number aberrations (asCNAs), it is first necessary to determine the allelic imbalance for the target gene-regions, and optionally control gene-regions, in the tumor DNA, and to determine the copy number for each gene-region in the tumor DNA.
Typically, to determine allelic imbalance, it is necessary to first determine a reference model using a set of control samples. Control samples may comprise genomic DNA from healthy subjects and/or healthy cells, for example white blood cells. Further examples of suitable control samples include:
An exemplary method for generating a reference model using a control sample is described in the Examples section.
Allelic imbalance may be determined for each gene-region in the tumor DNA and non-tumor DNA for a subject. For example, this may be achieved by reference to a reference model. Using a reference model can further improve the sensitivity of the methods of the invention.
A reference model may be a collection of statistics obtained from a two or more control samples (for example 2 or more, 3 or more, 4 or more, 5 or more, 8 or more, 10 or more, 15 or more, 20 or more, 30 or more, 40 or more, or more than 50 samples), in particular two or more (for example 2 or more, 3 or more, 4 or more, 5 or more, 8 or more, 10 or more, 15 or more, 20 or more, 30 or more, 40 or more, or more than 50 samples) control samples that do not comprise tumor DNA. For example, to build a reference model two or more control samples comprising non-tumor DNA and not comprising tumor DNA may be used. Biological samples comprising non-tumor DNA provided for step i) of the method of the invention for different subjects may be used as a sample for the reference model if each biological sample does not comprise tumor DNA (for example if each biological sample comprising non-tumor DNA is a WBC sample). Additionally, or alternatively, one or more control sample as described above may be used to build the reference model. The control samples are sequenced. Preferably, control samples for use in building a reference model are sequenced at the same institution and/or sequenced using the same sequencer. The sequences of the samples are analysed to provide a collection of statistics including local (per each single SNP) and global (aggregating SNPs by similar local coverages observed in control samples used for model building) metrics of SNPs. For each control sample that is used to build the model, only SNPs that are informative (i.e. heterozygous SNPs that are present in a gene-region in the genome of the subject that provided the control sample) are used to compute the statistics. When testing for the presence of imbalance in a biological sample comprising tumor DNA from a subject, the statistics for each SNP that is informative for the subject are extracted and used to generate allelic fraction (AF) distributions mimicking distributions that could be extracted from a pure non-tumor DNA sample and from a mix of non-tumor and tumor DNA at different proportions (for example 0% and 100%, 1% and 99%, 2% and 98%, 3% and 97%, N % and (100-N) % (wherein N is each integer between 3 and 97), 97% and 3%, 98% and 2%, 99% and 1% and 100 and 0%, of non-tumor and tumor DNA respectively). When comparing the observed iSNPs AF distribution from a biological sample comprising tumor DNA against reference model derived distributions, differences between the AF distribution in the biological sample and the model derived distributions indicate the presence of allele imbalance tumor and, if allele imbalance is present, the quantity of the allele imbalance.
An exemplary method for determining allelic imbalance in the tumor DNA with reference to a reference model is described in the Examples section. An exemplary method for determining allelic imbalance in the tumor DNA without reference to a reference model is also described in the Examples section.
The copy number of a gene-region in the tumor DNA or non-tumor DNA may be determined, for example, by integrating the read-depth estimations and allelic imbalance calls. Methods for determining the focal copy number of parts of a gene-region are described herein. Methods for determining the focal copy number of parts of a gene-region are also known in the art, for example DNAcopy (10.18129/B9.bioc.DNAcopy) and the methods reported in Zare, F., et al. BMC Bioinformatics 18, 286 (2017).
Allele-specific copy number aberrations (asCNAs) are alterations in the total copy number of a gene-region, or part thereof, and the specific number of copies of each chromosome. Types of asCNA include: copy number gain (for example balanced or unbalanced copy number gain), copy number loss (for example homodeletions and hemideletions), and loss of heterozygosity (LOH) events (for example copy number-neutral loss of heterozygosity). The asCNA status for each gene-region may be determined by integrating read-depth estimation for each gene-region and allelic imbalance status for each gene-region. The asCNA status may then be corrected for the ploidy and purity of the tumor DNA in the biological sample. Method for correcting the asCNA for ploidy and purity are known in the art. For example, CLONETv2 (Prandi et al., 2019, Curr Protoc Bioinformatics. September; 67(1):e81)), FACETS (PMID: 27270079), ASCAT (PMID: 20837533), Sequenza (PMID: 25319062), or CNVkit (PMID: 27100738). In exemplary embodiments, the method described by Prandi et al., 2019 (Curr Protoc Bioinformatics. 2019 September; 67(1):e81) is used (i.e. by using the CLONETv2 algorithm).
As described in the Examples section, the presence, absence, and/or alteration of one or more asCNA in the tumor DNA indicates that the subject would benefit from treatment with one or more cancer treatments, has benefited, or is benefiting, from one or more cancer treatments and/or would benefit from ceasing or altering one or more cancer treatments.
In certain embodiments, the method of the present invention are performed using an in vitro assay of the present invention. The in vitro assay of the present invention comprises the method steps of:
Steps e) and f) of the in vitro assay are performed using the same processes as described herein for steps iv) and v) of the in vitro method of the present invention.
The present invention also provides a set of probes suitable for use in the method of the present invention, wherein the set of probes are capable of hybridizing to:
In certain embodiments, a set of probes suitable for use in the method of the present invention are capable of hybridizing to at least one (for example 1, 2, 3, 4 or 5; typically at least 2, at least 3 or at least 4) target gene-region selected from the group consisting of FOXA1, FOXP1, HSD3B1, NCOA2 and ZBTB16 as defined in Table 1. Genetic aberrations in such gene-regions are known or suspected of being associated with the function in the androgen (receptor) signalling pathway.
In certain embodiments, a set of probes suitable for use in the method of the present invention are capable of hybridizing to at least one (for example 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16 or 17; typically at least 2, at least 3 or at least 4) target gene-region selected from the group consisting of AURKA, BRAF, CCND1, CDK12, CDK4, CDK6, CDKN1B, CDKN2A, CUL1, FBXW7, KRAS, MDM2, MDM4, MYC, MYCN, RB1 and TP53 as defined in Table 1. Genetic aberrations in such gene-regions are known or suspected of being associated with cell cycle dysfunction.
In certain embodiments, a set of probes suitable for use in the method of the present invention are capable of hybridizing to at least one (for example 1, 2, 3, 4, 5, 6, 7 or 8; typically at least 2, at least 3 or at least 4) target gene-region selected from the group consisting of AKT1, AKT2, AKT3, MET, PIK3C, PIK3CB, PIK3R1 and PTEN as defined in Table 1. Genetic aberrations in such gene-regions are known or suspected of being associated with with the phosphoinositide 3-kinase (PI3K) signalling pathway.
In certain embodiments, a set of probes suitable for use in the method of the present invention are capable of hybridizing to at least one (for example 1, 2, 3, 4, 5, 6, 7 or 8; typically at least 2, at least 3 or at least 4) target gene-region selected from the group consisting of ARID1A, CHD1, KDM6A, MED12, SMARCA1, KMT2C, KMT2D and RYBP as defined in Table 1. Genetic aberrations in such gene-regions are known or suspected of being associated with chromatic remodeling dysfunction.
In certain embodiments, a set of probes suitable for use in the method of the present invention are capable of hybridizing to at least one (for example 1, 2, 3, 4, 5, 6, 8, 10, 12, 15, 20 or 22; typically at least 2, at least 3 or at least 4) target gene-region selected from the group consisting of ATM, ATR, BRCA1, BRCA2, CHD1, ERCC1, ERCC2, ERCC3, ERCC4, ERCC5, FANCA, FANCC, FANCD2, FANCE, FANCF, FANCG, MLH1, MSH2, MSH6, PALB2, RAD51B and RAD51C as defined in Table 1. Genetic aberrations in such gene-regions are known or suspected of being associated with DNA repair dysfunction.
In certain embodiments, a set of probes suitable for use in the method of the present invention are capable of hybridizing to at least one (for example 1, 2, 3, 4, 5, 6, 8, 10, or 12; typically at least 2, at least 3 or at least 4) target gene-region selected from the group consisting of ASXL1, CLU, CYLD, ERG_TMPRSS2, GNAS, IDH1, IDH2, NFE2L2, NKX3-1, RUNX1, SPOP and ZFHX3 as defined in Table 1. Genetic aberrations in such gene-regions are known to be associated with prostate cancer, although their precise role in the development and/or progression of prostate cancer of a subject are currently unknown.
In certain embodiments, a set of probes suitable for use in the method of the present invention are capable of hybridizing to at least one (for example 1, 2, or 3; typically at least 2) target gene-region selected from the group consisting of APC, CTNNB1 and RNF43 as defined in Table 1. Genetic aberrations in such gene-regions are known or suspected to dysregulate the Wnt signalling pathway.
In certain preferred embodiments, a set of probes suitable for use in the method of the present invention are capable of hybridizing to at least one (for example 1, 2, 3, 4, 5, or 6; typically at least 2, at least 3, at least 4, at least 5 or 6) target gene-region selected from the group consisting of BRCA2, ATM, RB1, NKX3-1, TP53, and PTEN as defined in Table 1. In certain embodiments, a set of probes suitable for use in the method of the present invention are capable of hybridizing to at least 3 of the target gene-regions selected from the group consisting of BRCA2, ATM, RB1, NKX3-1, TP53, and PTEN as defined in Table 1. More preferably, a set of probes suitable for use in the method of the present invention are capable of hybridizing to the gene-regions of at least BRCA2, ATM, RB1, NKX3-1, TP53, and PTEN as defined in Table 1.
In certain embodiments, a set of probes suitable for use in the method of the present invention are capable of hybridizing to at least one target gene-region selected from the group consisting of FOXA1, FOXP1, HSD3B1, NCOA2 and ZBTB16; and/or at least one target gene-region selected from the group consisting of AURKA, BRAF, CCND1, CDK12, CDK4, CDK6, CDKN1B, CDKN2A, CUL1, FBXW7, KRAS, MDM2, MDM4, MYC, MYCN, RB1 and TP53; and/or at least one target gene-region selected from the group consisting of ARID1A, CHD1, KMT2C, KMT2D and RYB; and/or at least one target gene-region selected from the group consisting of ATM, ATR, BRCA1, BRCA2, CHD1, ERCC1, ERCC2, ERCC3, ERCC4, ERCC5, FANCA, FANCC, FANCD2, FANCE, FANCF, FANCG, MLH1, MSH2, MSH6, PALB2, RAD51B and RAD51C; and/or at least one target gene-region selected from the group consisting of ASXL1, CLU, CYLD, ERG_TMPRSS2, GNAS, IDH2, NFE2L2, NKX3-1, RUNX1, SPOP and ZFHX3; and/or at least one target gene-region selected from the group consisting of AKT1, AKT2, AKT3, MET, PIK3C, PIK3CB, PIK3R1 and PTEN; and/or at least one target gene-region selected from the group consisting of APC, CTNNB1 and RNF43.
In certain embodiments, a set of probes suitable for use in the method of the present invention are capable of hybridizing to at least two, at least three or at least four target gene-region selected from the group consisting of FOXA1, FOXP1, HSD3B1, NCOA2 and ZBTB16; and/or at least two, at least three or at least four target gene-region selected from the group consisting of AURKA, BRAF, CCND1, CDK12, CDK4, CDK6, CDKN1B, CDKN2A, CUL1, FBXW7, KRAS, MDM2, MDM4, MYC, MYCN, RB1 and TP53; and/or at least two, at least three or at least four target gene-region selected from the group consisting of ARID1A, CHD1, KMT2C, KMT2D and RYB; and/or at least two, at least three or at least four target gene-region selected from the group consisting of ATM, ATR, BRCA1, BRCA2, CHD1, ERCC1, ERCC2, ERCC3, ERCC4, ERCC5, FANCA, FANCC, FANCD2, FANCE, FANCF, FANCG, MLH1, MSH2, MSH6, PALB2, RAD51B and RAD51C; and/or at least two, at least three or at least four target gene-region selected from the group consisting of ASXL1, CLU, CYLD, ERG_TMPRSS2, GNAS, IDH2, NFE2L2, NKX3-1, RUNX1, SPOP and ZFHX3; and/or at least two, at least three or at least four target gene-region selected from the group consisting of AKT1, AKT2, AKT3, MET, PIK3C, PIK3CB, PIK3R1 and PTEN; and/or at least one, at least two or each target gene-region selected from the group consisting of APC, CTNNB1 and RNF43.
Preferably, said set of probes is capable of specifically hybridizing to any embodiment or combination of embodiments described herein for step ii) of the in vitro method of the present invention, i.e. capable of specifically hybridizing to
For example, said set of probes are capable of hybridizing to:
Or, said set of probes are capable of hybridizing to:
Or, said set of probes are capable of hybridizing to:
In one embodiment, the set of probes comprises at least 10, at least 20, or at least 30 (for example, at least 100, at least 300, at least 400, at least 500) probes that are capable of specifically hybridizing to at least 10, at least 20, or at least 30 (for example, at least 100, at least 300, at least 400, at least 500) of the SNP loci defined in Tables 1 for each of at least 5 target gene-regions in Table 1. In certain embodiments, the set of probes further comprises at least 10, at least 20, or at least 30 (for example, at least 100, at least 300, at least 400, at least 500) probes that are capable of specifically hybridizing to at least 10, at least 20, or at least 30 (for example, at least 100, at least 300, at least 400, at least 500) of the SNP loci defined in Tables 2 for each of at least 5 control gene-regions in Table 2. Preferably, each probe in the set of probes provided in step ii) is capable of hybridizing to only one of the SNP loci defined in Table 1 or 2. Typically, the total number of different probes in the set is at least 50, at least 500, at least 1000, at least 3000, at least 5000, at least 10,000, at least 15,000, or at least 17,500.
In certain preferred embodiments, the set of probes is capable of hybridizing to:
In certain preferred embodiments, the set of probes is capable of hybridizing to:
In another embodiment, the set of probes is capable of hybridizing to:
Probes suitable for use in the in vitro method of the present invention include, but are not limited to, small molecules, peptides (including cyclic peptides), proteins, nucleic acids (e.g. DNA and RNA nucleotides including, but not limited to, antisense nucleotide sequences, triple helices, siRNA or miRNA, and nucleotide sequences encoding biologically active proteins, polypeptides or peptides), synthetic or natural inorganic molecules and synthetic or natural organic molecules that specifically bind to one or more SNP loci defined in Table 1 or 2.
In preferred embodiments, the set of probes provided by the present invention is a set of oligonucleotide probes. In such embodiments, the set of oligonucleotide probes is complementary to, and capable of hybridizing to, one or more SNP loci defined in Table 1 or 2, and preferably, each probe is complementary to only one SNP loci defined in Table 1 or 2.
Probes suitable for use in the present invention may comprise a “label” which is suitable for capturing DNA comprising one or more of the SNPs defined in Table 1 or 2.
Suitable labels for capturing DNA molecules in a sample that comprise one or more of the SNPs include, for example, biotin. The type of label chosen will depend on the desired capture method used. For convenience, the probe may be immobilised on a solid phase support including resins (such as polyacrylamides), carbohydrates (such as sepharose), plastics (such as polycarbonate), and latex beads.
The probes may be bound to a solid matrix as discussed above or packaged with reagents for binding them to the matrix. The solid matrix or substrate may be in the form of beads, plates, tubes, dip sticks, strips or biochips. Biochips or plates with addressable locating and discreet microtitre plates are particularly useful.
The term “kit” refers to any item of manufacture (e.g. a package or container) comprising at least one reagent, e.g. a probe or small molecule, for specifically detecting one or more SNPs of Table 1 and/or Table 2. The kit may be promoted, distributed, or sold as a unit for performing the methods of the present invention.
The kit of the invention comprises means for probing the biological sample to determine the presence of SNPs in a sample obtained from the subject. Preferably, the kit of the invention comprises a set of probes of the present invention.
The kit may also include additional components to facilitate the particular application for which the kit is designed. For example, the kit may additionally contain means of detecting a label (e.g., enzyme substrates for enzymatic labels, filter sets to detect fluorescent labels, appropriate secondary labels such as a sheep anti-mouse-HRP, etc.) and reagents necessary for controls (e.g., control biological samples or standards). A kit may additionally include buffers and other reagents of the necessary grade for use in a method of the disclosed invention in a health care setting. Non-limiting examples include agents to reduce non-specific binding, such as a carrier protein or a detergent.
In certain embodiments the kit comprises one or more containers and may also include sampling equipment, for example, bottles, bags (such as intravenous fluid bags), vials, syringes, and test tubes. Other components may include needles, diluents, wash reagents and buffers. Usefully, the kit may include at least one container comprising a pharmaceutically-acceptable buffer, such as phosphate-buffered saline, Ringer's solution and dextrose solution.
In one preferred embodiment, the kit comprises instructions for use. In certain embodiments, the kit comprises instructions for staging, classification, screening, monitoring, stratification, selecting treatment for, ascertaining whether treatment is working in, and/or prognostication of cancer in a subject using the kit. For example, the kit comprises instructions for use which define how to determine the present of SNPs and/or identify the presence of iSNPs in the non-tumor DNA in a sample, for example by following a method of the invention defined herein.
In one preferred embodiment, the kit comprises a computer product or a computer-executable software for staging, classification, screening, monitoring, stratification, selecting treatment for, ascertaining whether treatment is working in, and/or prognostication of cancer in a subject using the kit. In certain embodiments, the computer product comprises a non-transitory computer readable medium storing a plurality of instructions that when executed control a computer system to perform a method of the invention. In certain embodiments, the computer-executable software comprises software for performing a method of the invention.
The method of the present invention is for detecting, screening, monitoring, staging, classification, selecting treatment for, ascertaining whether treatment is working in, and/or prognostication of a cancer. The present invention also provides methods for treating cancer. The cancer may be selected from the group consisting of prostate cancer, breast cancer, ovarian cancer, pancreatic cancer, bladder cancer, and metastatic cancer.
In certain embodiments, the cancer is prostate cancer. The prostate cancer may be any type of prostate cancer. Typically, it may be acinar adenocarcinoma prostate cancer, ductal adenocarcinoma prostate cancer, transitional cell cancer of the prostate, squamous cell cancer of the prostate or small cell prostate cancer. For example, it may be acinar adenocarcinoma prostate cancer or ductal adenocarcinoma prostate cancer. Alternatively, or additionally, the prostate cancer may be hormone sensitive prostate cancer (HSPC) or castration resistant prostate cancer (CRPC). Alternatively, or additionally, the prostate cancer may be metastatic prostate cancer, or it may be non-metastatic prostate cancer. In certain embodiments, it may be metastatic prostate cancer. In certain embodiments, the prostate cancer may be metastatic castration resistant prostate cancer or non-metastatic castration resistant prostate cancer. In certain embodiments, the prostate cancer may be metastatic hormone sensitive prostate cancer (mHSPC) or non-metastatic hormone sensitive prostate cancer.
In certain embodiments, the cancer is metastatic cancer (i.e. a cancer that has metastasised). Metastatic cancer is cancer that has spread from the primary site of origin into one or more different areas of the body. For example the cancer may be metastatic prostate cancer (i.e. cancer which has spread from the primary prostate site of origin into one or more different areas of the body), or another form of metastatic cancer, such as metastatic breast cancer, metastatic ovarian cancer, metastatic pancreatic cancer, or metastatic bladder cancer.
The method is especially suitable for the detecting, screening, monitoring, staging, classification, selecting treatment for, ascertaining whether treatment is working in, and/or prognostication of prostate cancer, and more especially metastatic prostate cancer.
Methods of treatment of the present invention are especially suitable for the treating prostate cancer, and more especially metastatic prostate cancer.
The methods of the invention can be used for determining a suitable treatment regimen (i.e. one or more drug for the treatment of cancer) for a subject in need of cancer treatment.
The screening of subjects using the methods, kits and assays of the present invention, allow the full potential benefits of cancer treatments to be obtained by selecting one or more treatment that have a high likelihood of benefitting the patient in view of asCNA harboured in the subject's tumor. Alternatively, or additionally, screening of subjects using the methods, kits and assays of the present invention allow the full potential benefits of cancer treatments to be obtained by ceasing or altering one or more cancer treatments, thus minimizing side effects and exposure of a subject to unnecessary and/or potentially harmful treatments.
The methods of the invention can be used for determining that a subject would benefit from treatment with one or more cancer treatments selected from the group consisting of a ATR inhibitor, CDK inhibitor, chemotherapy (such as taxanes, platinum-based antineoplastic drugs), WEE1 inhibitor, Aurora kinase inhibitor, alkylating agent, PARP inhibitor, DNA-PK inhibitor, immune checkpoint therapies (for example a PD-1 inhibitor, PD-L1 inhibitor, or a CTLA-4 inhibitor), CHK2 inhibitor, WEE1 inhibitor, platinum-based antineoplastic drug, taxane, c-Met inhibitor, radionuclide and radiation therapy, DNMT1 inhibitor, HDAC inhibitor, BET inhibitor, PI3K inhibitor, PORCN inhibitor, FZD antagonists/monoclonal antibody, inhibitor of Wnt target genes, or hormonal agent, such as a LHRH agonist, LHRH antagonist, anti-androgen, androgen synthesis inhibitor, estrogen or steroid. The methods of the invention can be used for determining that a subject would benefit from treatment with one or more cancer treatments selected from the group consisting of a chemotherapy (such as a taxane (in particular docetaxel and cabazitaxel) and platinum-based antineoplastic drug (in particular carboplatin)), PARP inhibitor, or hormonal agent, such as a LHRH agonist, LHRH antagonist, anti-androgen, androgen synthesis inhibitor, estrogen or steroid.
Alternatively, or additionally, methods of the invention can be used for determining that a subject has benefited, or is benefiting, from one or more cancer treatments selected from the group consisting of ATR inhibitor, CDK inhibitor, Chemotherapy (such as taxanes and platinum-based antineoplastic drugs), WEE1 inhibitor, Aurora kinase inhibitor, alkylating agent, PARP inhibitor, DNA-PK inhibitor, immune checkpoint therapies (for example a PD-1 inhibitor, PD-L1 inhibitor, or a CTLA-4 inhibitor), CHK2 inhibitor, WEE1 inhibitor, platinum-based antineoplastic drug, taxane, c-Met inhibitor, radionuclide and radiation therapy, DNMT1 inhibitor, HDAC inhibitor, BET inhibitor, PI3K inhibitor, PORCN inhibitor, FZD antagonists/monoclonal antibody, inhibitor of Wnt target genes, or hormonal agent, such as a LHRH agonist, LHRH antagonist, anti-androgen, androgen synthesis inhibitor, estrogen or steroid. Alternatively, or additionally, methods of the invention can be used for determining that a subject has benefited, or is benefiting, from one or more cancer treatments selected from the group consisting of a chemotherapy (such as a taxane (in particular docetaxel and cabazitaxel) and platinum-based antineoplastic drug (in particular carboplatin)), PARP inhibitor, or hormonal agent, such as a LHRH agonist, LHRH antagonist, anti-androgen, androgen synthesis inhibitor, estrogen or steroid.
Alternatively, or additionally methods of the invention can be used for determining that a subject would benefit from ceasing or altering one or more cancer treatments selected from the group consisting of a ATR inhibitor, CDK inhibitor, Chemotherapy (such as taxanes, platinum-based antineoplastic drugs and c-Met inhibitors), WEE1 inhibitor, Aurora kinase inhibitor, alkylating agent, PARP inhibitor, DNA-PK inhibitor, immune checkpoint therapies (for example a PD-1 inhibitor, PD-L1 inhibitor, or a CTLA-4 inhibitor), CHK2 inhibitor, WEE1 inhibitor, platinum-based antineoplastic drug, taxane, c-Met inhibitor, radionuclide and radiation therapy, DNMT1 inhibitor, HDAC inhibitor, BET inhibitor, PI3K inhibitor, PORCN inhibitor, FZD antagonists/monoclonal antibody, inhibitor of Wnt target genes, or hormonal agent, such as a LHRH agonist, LHRH antagonist, anti-androgen, androgen synthesis inhibitor, estrogen or steroid. Alternatively, or additionally methods of the invention can be used for determining that a subject would benefit from ceasing or altering one or more cancer treatments selected from the group consisting of a chemotherapy (such as a taxane (in particular docetaxel and cabazitaxel) and platinum-based antineoplastic drug (in particular carboplatin)), PARP inhibitor, or hormonal agent, such as a LHRH agonist, LHRH antagonist, anti-androgen, androgen synthesis inhibitor, estrogen or steroid.
Examples of PARP inhibitors include olaparib, rucaparib, niraparib or talazoparib, Veliparib, Pamiparib, Rucaparib, and Veliparib; and in particular example olaparib, rucaparib, niraparib or talazoparib. Examples of ATR inhibitors include Berzosertib, ceralasertib, M4344, and BAY1895344. Examples of CDK inhibitors include Flavopiridol (alvocidib), abemaciclib, ribociclib, Olomoucine, Roscovitine (Seliciclib), Purvalanol, Paullones, Butryolactone, Thio/oxoflavopiridols, Oxindoles, Aminothiazoles, Benzocarbazoles, and Pyrimidines; and in particular Flavopiridol, Palbociclib, ribociclib and abemaciclib. Examples of DNA-PK inhibitors include AZD7648, M3814, CC-122 and CC-115. Examples of immune checkpoint therapies include PD-1 inhibitors such as pembrolizumab, nivolumab, cemiplimab, and spartalizumab; PD-L1 inhibitors such asatezolizumab, avelumab and durvalumab; and CTLA-4 inhibitors such as ipilimumab. Examples of CHK1 inhibitors include V158411, PF-477736 and AZD7762.
Examples of CHK2 inhibitors include CCT241533 and Aminopyridine 7. Examples of WEE1 inhibitors include adavosertib. Examples of platinum-based antineoplastic drugs include cisplatin, carboplatin, oxaliplatin, nedaplatin, triplatin tetranitrate, picoplatin, satraplatin and phenanthriplatin, and in particular cisplatin, carboplatin, oxaliplatin, nedaplatin. Examples of taxanes include docetaxel, cabazitaxel, and paclitaxel. Examples of radionuclide or radiation therapies include radium-223, 225Ac-Labeled PSMA-617 and 177Lu-Labeled PSMA-617. Examples of CDK inhibitors include Flavopiridol (alvocidib), abemaciclib, ribociclib, Olomoucine, Roscovitine (Seliciclib), Purvalanol, Paullones, Butryolactone, Thio/oxoflavopiridols, Oxindoles, Aminothiazoles, Benzocarbazoles, and Pyrimidines; and in particular Flavopiridol, Palbociclib, ribociclib and abemaciclib. Examples of chemotherapies include taxanes (for example docetaxel and cabazitaxel), platinum-based antineoplastic drugs (such as cisplatin and carboplatin) and c-Met inhibitors (for example cabozantinib). Examples of Aurora kinase inhibitors include Alisertib, ZM447439, hesperidin, and VX-680. Examples of PI3K inhibitors includeidelalisib, copanlisib, duvelisib, alpelisib, umbralisib, dactolisib, voxtalisib, Taselisib, Idelalisib, Buparlisib, Duvelisib, and Copanlisib and in particular idelalisib, copanlisib, duvelisib, alpelisib, and umbralisib. Examples of mTOR inhibitors include rapamycin, deforolimus, dactolisib, voxtalisib, temsirolimus, everolimus, sapanisertib, AZD8055, and AZD2014. Examples of PORCN inhibitors include WNT974, ETC-1922159 and CGX1321. Examples of FZD antagonists/monoclonal antibodies include Vantictumab, Ipafricept and OTSA101-DTPA-90Y. Examples of inhibitors of Wnt target genes include SM08502. Examples of LHRH agonists include leuprolide, goserelin, triptorelin, and histrelin. Examples of LHRH antagonists include degarelix. Examples of anti-androgens include flutamide, bicalutamide, nilutamide, enzalutamide, apalutamide and darolutamide. Examples of steroids include prednisone and dexamethasone. Example of DNMT1 inhibitors include 5-azacitidine. Examples of HDAC inhibitors include vorinostat and romidepsin. Examples of BET inhibitors include I-BET 151, I-BET 762, OTX-015, TEN-010, CPI-203, CPI-0610, olinone, RVX-208, ABBV-744, AZD5153, MT-1, and MS645. Examples of alkylating agents include nitrogen mustards (such as cyclophosphamide, chlormethine, uramustine, melphalan, chlorambucil, ifosfamide, and bendamustine), nitrosoureas (such as carmustine, lomustine, and streptozocin) and alkyl sulfonates (such as busulfan)).
In embodiments of the invention wherein at least one target gene-region is selected from the group consisting of AR, FOXA1, FOXP1, HSD3B1, NCOA2 and ZBTB16 (in particular FOXA1, FOXP1, HSD3B1, NCOA2 and ZBTB16) as defined in Table 1, the presence and/or alteration of one or more asCNA (and/or the presence of one or more somatic and/or germline mutations) in a target gene-region selected from the group consisting of AR, FOXA1, FOXP1, HSD3B1, NCOA2 and ZBTB16 as defined in Table 1, indicates that the subject would benefit from ceasing or altering treatment with one or more class of drug that is known or suspected of targeting AR function. For example ceasing or altering treatment with a hormonal agent, such as LHRH agonists, LHRH antagonists, androgen blockers, anti-androgens, androgen synthesis inhibitors, estrogens and steroids (in particular androgen blockers, anti-androgens and androgen synthesis inhibitors); and in particular ceasing or altering treatment with one or more of the following cancer treatments: leuprolide, goserelin, triptorelin, histrelin, degarelix, abiraterone, ketoconazole, flutamide, bicalutamide, nilutamide, enzalutamide, apalutamide or darolutamide, estrogens prednisone or dexamethasone (in particular abiraterone, ketoconazole, flutamide, bicalutamide, nilutamide, enzalutamide, apalutamide or darolutamide).
In embodiments of the invention wherein at least one target gene-region is selected from the group consisting of AR, FOXA1, FOXP1, HSD3B1, NCOA2 and ZBTB16 (in particular FOXA1, FOXP1, HSD3B1, NCOA2 and ZBTB16) as defined in Table 1, the presence and/or alteration of one or more asCNA (and/or the presence of one or more somatic and/or germline mutations) in a target gene-region selected from the group consisting of AR, FOXA1, FOXP1, HSD3B1, NCOA2 and ZBTB16 as defined in Table 1, indicates that the subject would benefit from treatment with a chemotherapeutic agent (for example a taxane (such as docetaxel or cabazitaxel) or a platinum-based antineoplastic drug (such as carboplatin)).
In embodiments of the invention wherein at least one target gene-region is selected from the group consisting of AURKA, BRAF, CCND1, CDK12, CDK4, CDK6, CDKN1B, CDKN2A, CUL1, FBXW7, KRAS, MDM2, MDM4, MYC, MYCN, RB1 and TP53 as defined in Table 1, the presence and/or alteration of one or more asCNA (and/or the presence of one or more somatic and/or germline mutations) in a target gene-region selected from the group consisting of AURKA, BRAF, CCND1, CDK12, CDK4, CDK6, CDKN1B, CDKN2A, CUL1, FBXW7, KRAS, MDM2, MDM4, MYC, MYCN, RB1 and TP53 as defined in Table 1, indicates that the subject would benefit from treatment with one or more class of drug that is known or suspected of targeting cell cycle perturbations. For example, one or more of an ATR inhibitor, CDK inhibitor, chemotherapy, WEE1 inhibitors, Aurora kinase inhibitors or alkylating agent; in particular one or more of Berzosertib, Berzosertib, ceralasertib, M4344, BAY1895344, Flavopiridol (alvocidib), abemaciclib, ribociclib, Olomoucine, Roscovitine (Seliciclib), Purvalanol, Paullones, Butryolactone, Thio/oxoflavopiridols, Oxindoles, Aminothiazoles, Benzocarbazoles, Pyrimidines, taxanes (for example docetaxel and cabazitaxel), c-Met inhibitors (for example cabozantinib), adavosertib Alisertib, ZM447439, hesperidin, VX-680, nitrogen mustards (for example cyclophosphamide, chlormethine, uramustine, melphalan, chlorambucil, ifosfamide, and bendamustine), nitrosoureas (for example carmustine, lomustine, and streptozocin) and alkyl sulfonates (for example busulfan).
In embodiments of the invention wherein at least one target gene-region is selected from the group consisting of AURKA, BRAF, CCND1, CDK12, CDK4, CDK6, CDKN1B, CDKN2A, CUL1, FBXW7, KRAS, MDM2, MDM4, MYC, MYCN, RB1 and TP53 (and in particular TP53) as defined in Table 1, the presence and/or alteration of one or more asCNA (and/or the presence of one or more somatic and/or germline mutations) in a target gene-region selected from the group consisting of AURKA, BRAF, CCND1, CDK12, CDK4, CDK6, CDKN1B, CDKN2A, CUL1, FBXW7, KRAS, MDM2, MDM4, MYC, MYCN, RB1 and TP53 (and in particular TP53) as defined in Table 1, indicates that the subject would benefit from ceasing or altering treatment with one or more hormonal agent, such as LHRH agonists, LHRH antagonists, anti-androgens, androgen synthesis inhibitors, estrogens and steroids; and in particular ceasing or altering treatment with one or more of the following cancer treatments: leuprolide, goserelin, triptorelin, histrelin, degarelix, abiraterone, ketoconazole, flutamide, bicalutamide, nilutamide, enzalutamide, apalutamide or darolutamide, estrogens prednisone or dexamethasone.
In embodiments of the invention wherein at least one target gene-region is selected from the group consisting of ARID1A, CHD1, KDM6A, MED12, SMARCA1, KMT2C, KMT2D and RYB as defined in Table 1, and wherein the presence and/or alteration of one or more asCNA (and/or the presence of one or more somatic and/or germline mutations) in a target gene-region selected from the group consisting of ARID1A, CHD1, KDM6A, MED12, SMARCA1, KMT2C, KMT2D and RYBP as defined in Table 1, indicates that the subject would benefit from treatment with one or more class of drug that is known or suspected to target chromatin remodelling functions. For example one or more DNMT1 inhibitors, HDAC inhibitors and BET inhibitors; in particular 5-azacitidine, vorinostat, romidepsin, I-BET 151, I-BET 762, OTX-015, TEN-010, CPI-203, CPI-0610, olinone, RVX-208, ABBV-744, AZD5153, MT-1, and MS645.
In embodiments of the invention wherein at least one target gene-region is selected from the group consisting of ATM, ATR, BRCA1, BRCA2, ERCC1, ERCC2, ERCC3, ERCC4, ERCC5, FANCA, FANCC, FANCD2, FANCE, FANCF, FANCG, MLH1, MSH2, MSH6, PALB2, RAD51B and RAD51C as defined in Table 1, the presence or alteration of one or more asCNA (and/or the presence of one or more somatic and/or germline mutations) in a target gene-region selected from the group consisting of ATM, ATR, BRCA1, BRCA2, ERCC1, ERCC2, ERCC3, ERCC4, ERCC5, FANCA, FANCC, FANCD2, FANCE, FANCF, FANCG, MLH1, MSH2, MSH6, PALB2, RAD51B and RAD51C as defined in Table 1, indicates that the subject would benefit from treatment with one or more class of drug that is known or suspected to target a DNA repair function. For example a PARP inhibitor, ATR inhibitor, DNA-PK inhibitor, immune checkpoint therapy (for example a PD-1 inhibitor, PD-L1 inhibitors, or a CTLA-4 inhibitor), CHK1 inhibitor, WEE1 inhibitor, platinum-based antineoplastic drug, or radionuclide or radiation therapy; in particular olaparib, rucaparib, niraparib or talazoparib, Veliparib, Pamiparib, Rucaparib, Veliparib, Berzosertib, ceralasertib, M4344, and BAY1895344 Flavopiridol (alvocidib), abemaciclib, ribociclib, Olomoucine, Roscovitine (Seliciclib), Purvalanol, Paullones, Butryolactone, Thio/oxoflavopiridols, Oxindoles, Aminothiazoles, Benzocarbazoles, Pyrimidines, AZD7648, M3814, CC-122, CC-115, pembrolizumab, nivolumab, cemiplimab, spartalizumab, asatezolizumab, avelumab, durvalumab, ipilimumab, V158411, PF-477736, AZD7762, CCT241533, Aminopyridine 7, adavosertib, cisplatin, carboplatin, oxaliplatin, nedaplatin, triplatin tetranitrate, picoplatin, satraplatin phenanthriplatin, radium-223, 225Ac-Labeled PSMA-617 and 177Lu-Labeled PSMA-617.
In one embodiment of the invention wherein at least one target gene-region is selected from the group consisting of ATM, ATR, BRCA1, BRCA2, ERCC1, ERCC2, ERCC3, ERCC4, ERCC5, FANCA, FANCC, FANCD2, FANCE, FANCF, FANCG, MLH1, MSH2, MSH6, PALB2, RAD51B and RAD51C as defined in Table 1, the presence or alteration of one or more asCNA (and/or the presence of one or more somatic and/or germline mutations) in a target gene-region selected from the group consisting of ATM, ATR, BRCA1, BRCA2, ERCC1, ERCC2, ERCC3, ERCC4, ERCC5, FANCA, FANCC, FANCD2, FANCE, FANCF, FANCG, MLH1, MSH2, MSH6, PALB2, RAD51B and RAD51C as defined in Table 1, indicates that the subject would benefit from treatment with a PARP inhibitor, for example olaparib, rucaparib, niraparib or talazoparib, Veliparib, Pamiparib, Rucaparib, and Veliparib; and in particular olaparib, rucaparib, niraparib or talazoparib
In embodiments of the invention wherein at wherein at least one target gene-region is selected from the group consisting of ASXL1, CLU, CYLD, ERG_TMPRSS2, GNAS, IDH1, IDH2, NFE2L2, NKX3-1, RUNX1, SPOP and ZFHX3 as defined in Table 1, the presence or alteration of one or more asCNA (and/or the presence of one or more somatic and/or germline mutations) in a target gene-region selected from the group consisting of ASXL1, CLU, CYLD, ERG_TMPRSS2, GNAS, IDH1, IDH2, NFE2L2, NKX3-1, RUNX1, SPOP and ZFHX3 as defined in Table 1, indicates that the subject would benefit from treatment with one or more class of drug as follows: ATR inhibitor, CDK inhibitor, Chemotherapy (such as taxanes and platinum-based antineoplastic drugs), WEE1 inhibitor, Aurora kinase inhibitor, PARP inhibitor, DNA-PK inhibitor, immune checkpoint therapies (for example a PD-1 inhibitor, PD-L1 inhibitors, or a CTLA-4 inhibitor), CHK2 inhibitor, WEE1 inhibitors, platinum-based antineoplastic drug, taxane, radionuclide and radiation therapy, PI3K inhibitor, PORCN inhibitor, FZD antagonists/monoclonal antibody, inhibitor of Wnt target genes, or hormonal agent, such as a LHRH agonist, LHRH antagonist, anti-androgen, androgen synthesis inhibitor, estrogen or steroid.
In embodiments of the invention wherein at least one target gene-region is selected from the group consisting of AKT1, AKT2, AKT3, MET, PIK3C, PIK3CB, PIK3R1 and PTEN as defined in Table 1, the presence or alteration of one or more asCNA (and/or the presence of one or more somatic and/or germline mutations) in a target gene-region selected from the group consisting of AKT1, AKT2, AKT3, MET, PIK3C, PIK3CB, PIK3R1 and PTEN as defined in Table 1, indicates that the subject would benefit from treatment with one or more class of drug that is known or suspected to target the PI3K signalling pathway. For example a PI3K inhibitor or mTOR inhibitor; in particular idelalisib, copanlisib, duvelisib, alpelisib, umbralisib, dactolisib, voxtalisib, Taselisib, Idelalisib, Buparlisib, Duvelisib, Copanlisib, rapamycin, deforolimus, dactolisib, voxtalisib, temsirolimus, everolimus, sapanisertib, AZD8055, and AZD2014.
In embodiments of the invention wherein at least one target gene-region is selected from the group consisting of AKT1, AKT2, AKT3, MET, PIK3C, PIK3CB, PIK3R1 and PTEN (and in particular PTEN) as defined in Table 1, the presence or alteration of one or more asCNA (and/or the presence of one or more somatic and/or germline mutations) in a target gene-region selected from the group consisting of AKT1, AKT2, AKT3, MET, PIK3C, PIK3CB, PIK3R1 and PTEN (and in particular PTEN) as defined in Table 1, indicates that the subject would benefit from ceasing or altering treatment with one or more hormonal agent, such as LHRH agonists, LHRH antagonists, anti-androgens, androgen synthesis inhibitors, estrogens and steroids; and in particular ceasing or altering treatment with one or more of the following cancer treatments: leuprolide, goserelin, triptorelin, histrelin, degarelix, abiraterone, ketoconazole, flutamide, bicalutamide, nilutamide, enzalutamide, apalutamide or darolutamide, estrogens prednisone or dexamethasone.
In embodiments of the invention wherein at least one target gene-region is selected from the group consisting of APC, CTNNB1 and RNF43 as defined in Table 1, the presence or alteration of one or more asCNA (and/or the presence of one or more somatic and/or germline mutations) in a target gene-region selected from the group consisting of APC, CTNNB1 and RNF43 as defined in Table 1, indicates that the subject would benefit from treatment with one or more class of drug that is known or suspected to target the Wnt signalling pathway. For example a PORCN inhibitor, FZD antagonists/monoclonal antibody, or inhibitor of Wnt target genes; in particular WNT974, ETC-1922159, CGX1321, Vantictumab, Ipafricept, or OTSA101-DTPA-90Y.
The cancer treatments mentioned above, when used or administered in a method of the invention, may be used, for example, in those amounts indicated in the Physicians' Desk Reference (PDR) or as otherwise determined by one of ordinary skill in the art. The amount of the active ingredient of the cancer treatment which is required to achieve a therapeutic effect will, of course, vary with the particular compound, the route of administration, the subject under treatment, including the type, species, age, weight, sex, and medical condition of the subject and the renal and hepatic function of the subject, and the particular cancer being treated, as well as its severity. An ordinarily skilled physician, veterinarian or clinician can readily determine and prescribe the effective amount of the drug required to benefit the subject.
The present invention also provides a method for treating cancer in a subject comprising performing a method of the invention or performing an assay of the invention, and further comprising administering to the subject a therapeutically effective dose of a cancer treatment. A method of treatment of the present invention may be performed before and/or after a method of the invention defined herein has been performed, or an assay has been performed.
In certain embodiments a method for treating cancer of the present invention comprises administering to the subject a therapeutically effective dose of a cancer treatment after a method of the invention has been performed, or after an assay of the invention has been performed, for example after the subject has been determined to benefit from treatment with one or more cancer treatments, or has benefited, or is benefiting, from one or more cancer treatments and/or benefit from ceasing or altering one or more cancer treatments.
In one embodiment, a method for treating cancer of the present invention comprises administering a therapeutically effective dose of a cancer treatment to the subject for at least 1 week, 2 weeks, 3 weeks, 4 weeks, 1 month, 2 months, 3 months, 6 months, 9 months, 12 months, 24 months or 36 months. A therapeutic agent for the treatment of prostate cancer may be administered, for example, daily, every second day, twice per week, weekly or monthly.
A therapeutically effective dose of a cancer treatment may be administered in amounts and at frequencies indicated in the Physicians' Desk Reference (PDR) or as otherwise determined by one of ordinary skill in the art.
The present invention also provides a method for treating a subject having a cancer (for example prostate cancer or metastatic cancer) with cancer therapy, the method comprising:
A cancer treatment may be selected from the group consisting of a ATR inhibitor, CDK inhibitor, Chemotherapy (such as taxanes and platinum-based antineoplastic drugs), WEE1 inhibitor, Aurora kinase inhibitor, alkylating agent, PARP inhibitor, DNA-PK inhibitor, immune checkpoint therapies (for example a PD-1 inhibitor, PD-L1 inhibitor, or a CTLA-4 inhibitor), CHK2 inhibitor, WEE1 inhibitor, platinum-based antineoplastic drug, taxane, c-Met inhibitor, radionuclide and radiation therapy, DNMT1 inhibitor, HDAC inhibitor, BET inhibitor, PI3K inhibitor, PORCN inhibitor, FZD antagonists/monoclonal antibody, inhibitor of Wnt target genes, or hormonal agent, such as a LHRH agonist, LHRH antagonist, anti-androgen, androgen synthesis inhibitor, estrogen or steroid. In certain embodiments a cancer treatment may be selected from the group consisting of a chemotherapy (such as a taxane (in particular docetaxel and cabazitaxel) and platinum-based antineoplastic drug (in particular carboplatin)), PARP inhibitor, or hormonal agent, such as a LHRH agonist, LHRH antagonist, anti-androgen, androgen synthesis inhibitor, estrogen or steroid.
Alternatively, or additionally, the present invention also provides a method for treating a subject having a cancer (for example prostate cancer or metastatic cancer) with cancer therapy, the method comprising:
An alternative cancer treatment may be selected from the group consisting of a ATR inhibitor, CDK inhibitor, Chemotherapy (such as taxanes and platinum-based antineoplastic drugs), WEE1 inhibitor, Aurora kinase inhibitor, alkylating agent, PARP inhibitor, DNA-PK inhibitor, immune checkpoint therapies (for example a PD-1 inhibitor, PD-L1 inhibitor, or a CTLA-4 inhibitor), CHK2 inhibitor, WEE1 inhibitor, platinum-based antineoplastic drug, taxane, radionuclide and radiation therapy, DNMT1 inhibitor, HDAC inhibitor, BET inhibitor, PI3K inhibitor, PORCN inhibitor, FZD antagonists/monoclonal antibody, inhibitor of Wnt target genes, or hormonal agent, such as a LHRH agonist, LHRH antagonist, anti-androgen, androgen synthesis inhibitor, estrogen or steroid. In certain embodiments an alternative cancer treatment may be selected from the group consisting of a chemotherapy (such as a taxane (in particular docetaxel and cabazitaxel) and platinum-based antineoplastic drug (in particular carboplatin)), PARP inhibitor, or hormonal agent, such as a LHRH agonist, LHRH antagonist, anti-androgen, androgen synthesis inhibitor, estrogen or steroid.
The present invention also provides a method of treating a subject in need of treatment with a PARP inhibitor (for example a subject having cancer, prostate cancer or metastatic cancer), comprising
In such embodiments, preferably at least one, at least 2, at least 3, at least 5, at least 6, at least 10, at least 15 or at least 20 target gene-regions is/are selected from the group consisting of ATM, ATR, BRCA1, BRCA2, ERCC1, ERCC2, ERCC3, ERCC4, ERCC5, FANCA, FANCC, FANCD2, FANCE, FANCF, FANCG, MLH1, MSH2, MSH6, PALB2, RAD51B and RAD51C. In such embodiments, the presence or alteration of one or more asCNA (and/or the presence of one or more somatic and/or germline mutations) in a target gene-region selected from the group consisting of ATM, ATR, BRCA1, BRCA2, ERCC1, ERCC2, ERCC3, ERCC4, ERCC5, FANCA, FANCC, FANCD2, FANCE, FANCF, FANCG, MLH1, MSH2, MSH6, PALB2, RAD51B and RAD51C as defined in Table 1, indicates that the subject would benefit from treatment with a PARP inhibitor, for example olaparib, rucaparib, niraparib or talazoparib, Veliparib, Pamiparib, Rucaparib, and Veliparib; and in particular olaparib, rucaparib, niraparib or talazoparib.
The present invention also provides a method of treating a subject in need of treatment with an ATR inhibitor (for example a subject having cancer, prostate cancer or metastatic cancer), comprising
In such embodiments, preferably at least one, at least 2, at least 3, at least 5, at least 6, at least 10, at least 15, at least 20, at least 30 or at least 35 of target gene-regions is/are selected from the group consisting of ATM, ATR, BRCA1, BRCA2, ERCC1, ERCC2, ERCC3, ERCC4, ERCC5, FANCA, FANCC, FANCD2, FANCE, FANCF, FANCG, MLH1, MSH2, MSH6, PALB2, RAD51B, RAD51C, ASXL1, CLU, CYLD, ERG_TMPRSS2, GNAS, IDH1, IDH2, NFE2L2, NKX3-1, RUNX1, SPOP and ZFHX3. In such embodiments, the presence or alteration of one or more asCNA (and/or the presence of one or more somatic and/or germline mutations) in a target gene-region selected from the group consisting of ATM, ATR, BRCA1, BRCA2, ERCC1, ERCC2, ERCC3, ERCC4, ERCC5, FANCA, FANCC, FANCD2, FANCE, FANCF, FANCG, MLH1, MSH2, MSH6, PALB2, RAD51B, RAD51C, ASXL1, CLU, CYLD, ERG_TMPRSS2, GNAS, IDH1, IDH2, NFE2L2, NKX3-1, RUNX1, SPOP and ZFHX3 as defined in Table 1, indicates that the subject would benefit from treatment with a ATR inhibitor, for example Berzosertib, ceralasertib, M4344, and BAY1895344, and in particular Berzosertib and ceralasertib
The present invention also provides a method of treating a subject in need of treatment with an immune checkpoint therapy (for example a subject having cancer, prostate cancer or metastatic cancer), comprising
In such embodiments, preferably at least one, at least 2, at least 3, at least 5, at least 6, at least 10, at least 15, at least 20, at least 30 or at least 35 of target gene-regions is/are selected from the group consisting of ATM, ATR, BRCA1, BRCA2, ERCC1, ERCC2, ERCC3, ERCC4, ERCC5, FANCA, FANCC, FANCD2, FANCE, FANCF, FANCG, MLH1, MSH2, MSH6, PALB2, RAD51B, RAD51C, ASXL1, CLU, CYLD, ERG_TMPRSS2, GNAS, IDH1, IDH2, NFE2L2, NKX3-1, RUNX1, SPOP and ZFHX3. In such embodiments, the presence or alteration of one or more asCNA (and/or the presence of one or more somatic and/or germline mutations) in a target gene-region selected from the group consisting of ATM, ATR, BRCA1, BRCA2, ERCC1, ERCC2, ERCC3, ERCC4, ERCC5, FANCA, FANCC, FANCD2, FANCE, FANCF, FANCG, MLH1, MSH2, MSH6, PALB2, RAD51B, RAD51C, ASXL1, CLU, CYLD, ERG_TMPRSS2, GNAS, IDH1, IDH2, NFE2L2, NKX3-1, RUNX1, SPOP and ZFHX3 as defined in Table 1, indicates that the subject would benefit from treatment with an immune checkpoint therapy, for example a PD-1 inhibitor such as pembrolizumab, nivolumab, cemiplimab, and spartalizumab; a PD-L1 inhibitor such asatezolizumab, avelumab and durvalumab; or CTLA-4 inhibitors such as ipilimumab.
The invention has been described broadly and generically herein. Those of ordinary skill in the art will readily appreciate that all parameters, dimensions, materials, and configurations described herein are meant to be exemplary and that the actual parameters, dimensions, materials, and/or configurations will depend upon the specific application or applications for which the teachings of the present invention is/are used. Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, many equivalents to the specific embodiments of the invention described herein. It is, therefore, to be understood that the foregoing embodiments are presented by way of example only and that, within the scope of the appended claims and equivalents thereto, the invention may be practiced otherwise than as specifically described and claimed. The present invention is directed to each individual feature, system, article, material, kit, and/or method described herein. In addition, any combination of two or more such features, systems, articles, materials, kits, and/or methods, if such features, systems, articles, materials, kits, and/or methods are not mutually inconsistent, is included within the scope of the present invention. Further, each of the narrower species and subgeneric groupings falling within the generic disclosure also form part of the invention. This includes the generic description of the invention with a proviso or negative limitation removing any subject matter from the genus, regardless of whether or not the excised material is specifically recited herein.
The contents of the articles, patents, and patent applications, and all other documents and electronically available information mentioned or cited herein, are hereby incorporated by reference in their entirety to the same extent as if each individual publication was specifically and individually indicated to be incorporated by reference. The applicant reserves the right physically to incorporate into this application any and all materials and information from any such articles, patents, patent applications, or other physical and electronic documents.
The following Examples illustrate the invention.
Target genes were selected for inclusion in the Example 1 SNP panel by the inventors based on at least one of the following criteria: i) recurrent copy number changes or point mutation in localized and/or advanced prostate cancer (PCa) based on N=278 tumors (1-3), ii) potential therapeutic relevance. This lead to the selection of 70 autosomal genes. Further, the genomic region on 21q between TMPRSS2 and ERG that undergoes interstitial deletion as mechanism of gene rearrangement was also selected. The genomic region on 21q between TMPRSS2 and ERG may be referred to as a target “gene” of the Example 1 SNP panel. These target genes are listed in column 1 of Table 1.
To optimize data quality, processing and downstream analysis, the inventors further selected 37 control genes across all chromosomes and having minimal aberration frequency across the 278 PCa reference dataset. Specifically, for each of the 39 chromosomal arms with available data, cytobands adjacent to telomeres or centromeres were first excluded, the remaining genes were then ranked for minimal aberration frequency (abs(log 2(T/N))<0.5), the top three were then considered, and then of those three genes for each arm, one gene was selected based on maximization of high MAF SNPs availability (see below). These genes are listed in column 1 of Table 2.
In addition, UGT2B17 and ZBTB9 genes, located in complex genomic regions and encompassed by high frequency germline copy number losses were also included as control genes (4). These genes are also listed in column 1 of Table 2.
It is noted that in the subsequent parts of this example, “gene-regions” are referred to rather than “genes”, due to the inclusion of flanking regions on a gene basis for enrichment of SNPs, as described in detail below.
To allow for tumor purity and ploidy estimations and to improve somatic copy number computation (5-7), high MAF SNPs for each target and control gene-region were included in the Example 1 SNP panel. Intronic and intergenic SNPs from dbSNP v144 with single reference/alternative bases and MAF>=20% were considered. For each gene an iterative selection strategy was implemented. Specifically, starting from gene coordinates, the number of high MAF SNPs in the selected genomic area was counted and, if lower than a threshold N, the genomic area was iteratively extended of 10 Kbp at both ends to either converge to N SNPs inclusion or to a maximum extension of 200 Kbp per side (the combination of the gene and flanking regions is referred to as the “gene-region” of the gene of interest). The value of N was set to 1,000 for a subset of target genes of special interest: BRCA2, ATM, RB1, NKX3-1, TP53, and PTEN; to 1,000 for the 21q area; and to 400 for the 64 other target genes; and to 300 for control genes. The gene-region of each target gene and control gene are reported in column 1 of Table 1 and Table 2, respectively. To optimize the selection of SNPs presenting high MAF across different ethnic groups, genotype data of ˜2,000 samples from the 1,000 Genome Project data (Genotype Data release 20130502; PMID: 26432245) were considered. For each SNP, the fraction of individuals with heterozygous genotype status (HetFrac) in all 1,000 Genome project dataset and across four different major ethnic groups (Africans, Europeans, South Asian and East Asian) was computed. For each gene, a subset of M<N SNPs was selected; K<M SNPs were selected among the ones with the highest HetFrac across the different ethnic groups, while M-K SNPs were selected among the ones with the highest HetFrac in the overall dataset. M and K were respectively equal to 500 and 200 for BRCA2, ATM, RB1, NKX3-1, TP53 and PTEN and for the 21q area; 200 and 100 for 64 other target genes; and to 150 and 100 for control genes. The numbers of SNPs included by design per gene-region are reported in column 1 of Table 1 for the target genes; and column 1 of Table 2 for the control genes. Each SNP selected for each target gene-region and control gene-region are also listed.
As can be seen from column 1 of Tables 1 and 2, a total of 18,723 SNPs for target genes, of 8,392 SNPs for control genes. The number of SNPs included in the Example 1 SNP panel also allows for ethnicity inference and annotation (8) and for sample identity check (9). The final Example 1 SNP panel covers a total of 109 gene-regions. All coordinates in Tables 1 and 2 refer to the reference genome hg19.
Additionally, exonic (i.e. coding) regions of the following additional target genes on the X-chromosome were included in the panel: AR, MED12, SMARCA1, and KDM6A; exonic (i.e. coding) regions of the following additional target gene was included in the panel: IDH1; and exonic (i.e. coding) regions of the following additional control genes on the X-chromosome were included in the panel: TEX11 and HDAC8.
Based on ENSEMBL annotations (reference genome hg19), exonic regions for 116 genes were retrieved (70 target genes+21q area+39 control genes+5 additional genes on the X-chromosome and the gene IDH1) and a BED file was compiled collecting exonic regions coordinates and corresponding gene symbols. The total genomic size of all selected exonic regions is 824,144 bp with 645,667 bp for target genes and 178,477 bp for control genes.
To test the performance and utility of the Example 1 SNP panel of target and control gene-regions and selected SNPs, the panel was applied to samples from subjects having prostate cancer and samples from healthy volunteers.
Patients study were enrolled at 3 institutions on Institutional Review Board (IRB)-approved protocols with written informed consent. All subjects signed IRB informed consent protocols at the respective institution. Patients were eligible for this study if they were diagnosed with metastatic prostate cancer. Overall, the study included: 66 cfDNA samples from 44 patients at WCM, 16 cfDNA samples from 7 patients at Vancouver, 18 cfDNA samples from 15 patients at UCL, 26 cfDNA and 6 tissue samples from 3 patients collected under the UCL Biobank protocol Biobank Ethical Review Committee (B-ERC) project reference number and title: NC24.20 Treatment resistance aberrations in prostate cancer. 4 Healthy donor plasmas were purchased from Cambridge Bioscence. Information on the patients in the cohort is provided in
Blood collection was performed with ethylenediaminetetraacetic acid (EDTA) tubes, kept at 4° C. and processed within 2 hours from collection. Plasma separation was performed with a double spin protocol. First, the whole blood was centrifuged at 1600 rcf for 15 minutes at 4° C., then the separated plasma was collected and centrifuged again at 3000 rcf for 10 minutes at 4° C. The plasma was then divided into 1.8 mL aliquots and stored at −80° C. The buffy coat was collected after the first centrifuge, divided into 250 μL aliquots and stored at −80° C.
cfDNA and gDNA Extraction
Cell free DNA (cfDNA) is extracted starting from 1.8 mL plasma with QIAGEN QIAamp Circulating Nucleic Acid Kit according to the manufacturer's protocol and eluted in 30 μL Tris HCl 10 mM pH 8. The obtained cfDNA is then quantified using Qubit dsDNA High Sensitivity Assay and the quality is assessed with Agilent Bioanalyzer High Sensitivity DNA Kit.
Non-tumor genomic DNA (gDNA) is obtained from white blood cells, and is used as a matched control. The gDNA is extracted from 200 μL buffy coat with QIAGEN QIAamp DNA Mini Blood Kit and eluted in 200 μL Tris HCl 10 mM pH 8. The extracted gDNA is quantified using NanoDrop.
gDNA for library preparation was fragmented with Covaris M220. Libraries for target sequencing were prepared starting from 25 and 100 ng cfDNA and gDNA respectively with KAPA HyperPrep Kit (Roche) following the SeqCap EZ HyperCap v2.3 protocol with the following modifications. For hybridization of the probes, up to 8 cfDNA or gDNA samples were pooled together to obtain a combined mass of 2 μg and incubated for capture at 47° C. for 72 hours. The captured DNA was then amplified for 13 cycles. Pre- and post-captured libraries were quantified using Qubit dsDNA High Sensitivity Assay and the quality was assessed with Agilent Bioanalyzer High Sensitivity DNA Kit.
Sequencing of study samples (i.e. all samples collecting including patient samples and healthy volunteer samples) was performed at institution facilities at Weill Cornell Medicine (WCM), Vancouver, University College of London (UCL), or University of Trento with the following platforms: Novaseq 6000 (Illumina) for UCL, and HiSeq (Illumina) for all other institutions. Detailed information and sequencing statistics are reported in
Gene-region based read-depth estimation was performed similarly to as previously described (5). Briefly, mean amplicon depths of coverage are normalized for both GC content and sample mean coverage. Then, let {right arrow over (covT)} and {right arrow over (covC)} the vectors of normalized amplicon depth of coverage spanning a gene-region respectively in the tumor and matched control sample, the CN state (in log 2 scale) of the gene-region was computed using the following formula:
To improve confidence in the assessment of copy-number (CN) states, we adapted a previously developed procedure used to assess AR CN state (PMID: 26537258). That procedure measures the probability that an observed CN is compatible with the presence of aberrations accounting for stochastic noise in CN estimations. By computing control vs. control segmentation on Cornell cohort, the inventors observed that gene-region were associated with specific noise (see
Detection of focal CNA. In order to detect CNAs that span areas smaller than entire gene-regions, a simple iterative process was implemented. Given the set of gene-regions G within the Example 1 SNP panel (including control and target gene-regions, Gcontrol and Gtarget respectively), the set of amplicons within each gene-region (ampg) was subdivided into those spanning exonic/intronic regions (ampg
All control gene-regions (excluding gene-region containing UGT2B17 or ZBTB9) in Gcontrol were then used to compute per-gene-regions differences between the Log 2Ra and the respective Log 2R1 (gene only). The parameters of the reference normal distribution ref.distr with mean equal to meancontrol and standard deviation (s.d.) equal to sdcontrol were defined by taking the mean and the s.d. of the Log 2R differences, respectively. Last, for each target gene-region in Gtarget, the difftarget=|Log 2R1−Log 2Rn| was compared k times (by default 10,000) against simulated distributions (sim.distri, with i∈[1, k]), each built by sampling 10,000 times from ref.distr. The probability of focal lesion for each target gene was computed as:
With successi=1 if difftarget>max(sim.distri) or difftarget<min(sim.distri), 0 otherwise. To minimize noise-induced false positives, a lesion was only considered to be focal if P(focal)=1.
For each detected focal lesion, in order to define the exact boundaries of the lesion, the inventors then proceed as follows: i) identify the lowest (for losses) and highest (for gains) Log 2Ri within the corresponding gene-region and expand boundaries if two adjacent amplicons present monotonic values on both sides.
The following is a standard procedure to build a reference model using a set of N control samples. For a set of N control samples (genomic DNA from healthy cells, for example white blood cells), SNPs that are informative (a heterozygous call cut off of 0.2<AFSNP<0.8 was applied) in at least two of the N samples are selected. Summary statistics for each informative SNP across control samples are computed. Namely, for the AF distribution, mean, coefficient of variation, and proportion of samples out of N harboring the heterozygous genotype (
The collection of the summary statistics for each SNP was referred to as reference model (Refx; where x is one of the statistics). Of note, the reference model needs only to be computed once and can be applied cross-platform (i.e. can be used with different combinations of sequencing machines, reagents and sites where the samples have been processed). The use of control samples with intended sequencing coverage compared to that of plasma samples is suggested. For example, control samples can be obtained from white blood cells of one or more patients having cancer (e.g. matched control samples as described elsewhere) and/or samples from healthy volunteers. The requirement of the control samples in this step is the absence of tumor material in the sample.
Allelic imbalance was computed independently for each gene-region and cfDNA sample. First, the set of informative SNPs spanning a gene-region (SNPGR) is defined retaining only SNP positions in the Example 1 SNP panel with heterozygosity of 0.2<AF<0.8 in the matched control (i.e. the genomic DNA from the same patient as the cfDNA sample) and present in the reference model. For each SNP i∈SNPGR, observed local coverages, corresponding local coverage quantile and mirrored allele fractions in cfDNA sample were defined as COVi, qi and AFi, respectively.
The evidence of allelic imbalance for the gene-region was computed as:
where β is the proportion of neutral reads(16), dT is the observed mirrored AF distribution in cfDNA sample, Dβ is a simulated AF distribution generated sampling one time for each i∈SNPGR from a Normal distribution with mean Ref
and wilcox is a function returning 1 if the difference between dT and Dβ applying a Wilcoxon signed-rank test with significance cutoff of 1% is statistically significant, 0 otherwise.
Finally, the beta estimate for the gene-region in cfDNA sample (βT) was computed by comparing dT with simulated distributions mimicking levels of local admixture searching for the most similar one. Formally:
and where W(dT>Dβ) is the Wilcoxon signed-rank statistic (significance cutoff of 1%) comparing dT and Dβ.
Similarly, evidence of allelic imbalance E(AI)G and beta (βG) for each gene-region in the matched control sample are computed by substituting dT with dG in the equations above with dG defined as the observed mirrored AF distribution in matched control sample.
To correct for the Reference Mapping Bias (15) (RMB) and to improve the quality of downstream analysis of allelic fraction (AF) data of informative SNPs (iSNPs), a peak correction was applied separately to matched control and cfDNA samples. Specifically, a Kernel Density Estimation (KDE, performed on R using the function “density” from “stats” package with bw=“Si” (RDocumentation version 3.6.2 https://www.rdocumentation.org/packages/stats/versions/3.6.2)) was applied on the iSNPs AF distribution and peaks extracted by computing the local maxima of the smoothed distribution; the closest peak to RMB (by default 0.47) was extracted and data centered to the 0.5 theoretical value. RMB correction was applied both for the generation of reference model and the computation of allelic imbalance. Distribution of iSNPs allelic fractions in control (green) and cfDNA (red) samples before and after RMB correction is shown in
Tumor content (TC) and ploidy estimations for each sample are performed integrating the outputs of the ad-hoc procedures presented above within the CLONETv2 framework(16). In case of missing estimation by CLONETv2, for example at very low tumor content, the following procedure is applied for tumor content:
All CLONETv2 ploidy estimates are verified through visual inspection using the Log 2R-beta space (see
In order to define the allele-specific CN status of each gene-region, the decision tree depicted in
To obtain the copy number values of the two alleles, cnA and cnB (by design cnA>=cnB) for each gene-region, the following original equations are applied(16):
where Log 2Rp is the ploidy-corrected Log 2R of the gene-region and G is the admixture of the sample (i.e. 1−TC).
Computation of Allele-Specific Ploidy (asP)
Since CLONETv2 ploidy estimate does not recapitulate the actual amount of DNA per cell and in in low TC samples actual polyploid samples may be classified as diploid, the inventors adapted an allele-specific informed ploidy (asP) measure based on the allele-specific CN profile of each sample (Ciani, Y. et al., Allele-Specific Genomics is an Orthogonal Feature in the Landscape of Primary Tumors Phenotypes 4 Feb. 2021 SSRN: https://ssrn.com/abstract=3779554 or http://dx.doi.org/10.2139/ssrn.3779554), computed as the weighted mean of the allele-specific CN of each gene-region gr∈GR in the Example 1 SNP panel, that is:
where GR is the set of gene-regions covered by the Example 1 SNP panel and ws is the genomic size of the gene-region.
To detect somatic single nucleotide variants (SNVs) we applied ABEMUS(17)), a recently developed method specifically designed for SNVs detection in plasma samples. We ran ABEMUS with parameters reported in
SNVs were further annotated with Oncotator(18) (version 1.9.6.1) and only non-synonymous SNVs were retained. Germline variants were identified in control samples by looking for positions with AF≥0.15. Only positions annotated as “pathogenic” in ClinVar were retained (19).
Allele-Specific Informed Copy Number Calls without Matched Control Sample
The Example 1 SNP panel may also be used to detect allele-specific copy number also when a matched control sample for the patient is not available. In order to determine allele-specific copy number alterations in absence of a matched control sample the following are needed: a panel of normal (PON) to be used as control in read-depth estimation, a pre-computed reference model for allelic imbalance computation and a procedure for inference of informative SNPs directly from cfDNA sample. The PON can be routinely computed pooling together a set of non-tumor samples with comparable coverages and by computing the mean coverage of each amplicon across the samples selected. iSNPs can be inferred directly from the cfDNA sample, for example by applying thresholds on SNP AFs (e.g. 0.05<AF<0.95). Given the expected low ctDNA level of cfDNA samples, these thresholds guarantee that a discrete proportion of real iSNPs for the patient are recovered (see
To enhance the quantification of tumor signal and to enable accurate estimation of copy number changes of prostate cancer relevant genes, the inventors designed a custom targeted sequencing panel (the Example 1 SNP panel) that leverages individuals' genetics across gene-regions of interest for allelic imbalance estimations. The panel design couples information from large scale prostate cancer genomic studies (PMID: 26544944, PMID: 26000489, PMID: 26855148) and human genome polymorphisms features from the 1,000 Genome Project. Specifically, the inventors first identified a set of target genes comprising recurrently aberrant genes in localized and advanced prostate cancer studies and/or involved in frequently altered or targetable pathways, and a set of control genes known to be minimally aberrant in prostate cancer that would provide the backbone structure (e.g. wild-type status) for data analysis, including tumor ploidy and purity estimations (PMID: 31524989) (see
Next, as allelic imbalance can only be measured through heterozygous loci (here the SNPs in the Example 1 SNP panel where a subject has SNP heterozygosity in their genomic non-tumor DNA are referred to as informative SNPs (iSNPs); PMID: 27270079, PMID: 31524989, PMID: 20837533), the inventors designed an optimised panel that is enriched for high minor allele frequency (MAF) SNPs to maximize the detection sensitivity. To cover a high number of potential iSNPs per each targeted genomic region and for each patient, the Example 1 SNP panel was designed to include both exonic and intronic areas of the selected target and control genes and upstream and downstream regions were iteratively added to increase the number of SNPs covered for genes (see Method). The combination of exonic/intronic and flanking regions are referred to as the gene-region of the gene of interest. Overall, the Example 1 SNP panel includes a total of 109 gene-regions and covering 27,115 high MAF SNPs (see
Finally, to improve accuracy for assessing copy number state of targeted genes and to assist in increasing the sensitivity in detecting imbalances also in low and moderate ctDNA level samples (<15%), the inventors tailored an ad hoc method for asCNA assessment taking full advantage of the Example 1 SNP panel design. The method integrates i) a read-depth estimation approach modeling gene-specific sequencing coverage noise (see
False positive rate (FPR). To assess the performance of the Example 1 SNP panel, serial plasma samples (N=66) and white blood cells (WBCs) were sequenced from 44 individuals with mCRPC (“Cornell dataset”) and plasma samples from 3 healthy volunteers were sequenced independently at four institutions. First the false positive rate (FPR) of the gene-region based method for allelic imbalance detection was evaluated by defining false positives allelic imbalance calls in control samples (for lack of a gold standard). The Example 1 SNP panel FPR in the set of 44 individuals was 0.12%, significantly lower than the rate obtained through a coverage-based only approach (8.5%). Next the impact of the reference model cardinality (i.e. number of control samples used for reference model generation) on the FPR was investigated by iteratively building a total of 80 reference models by randomly selecting 10, 20, 30, and 40 control samples from the Cornell dataset at each iteration (20 reference models per each cardinality). An overall low number of false positives (FPR<0.3%) independent of the reference model cardinality was observed, supporting the benefit of the gene-region based approach (see
Impact of informative SNPs (iSNP) numerosity on imbalance detection. To measure the impact of the high MAF SNPs-enriched design of the Example 1 SNP panel on allelic imbalance detection, the inventors applied the panel on the Cornell dataset by using for each target gene-region all the available iSNPs per individual (i.e. all heterozygous SNPs that an individual has in the Example 1 SNP panel) and by considering randomly selected subsets (i.e. 20, 40, 60 and 80% of all the available iSNPs per individual). Results confirmed that a higher number of iSNPs is associated with enhanced detection of imbalance events in tumor samples (see
Allelic imbalance detection as function of tumor content via synthetic dilutions. To assess the performances of the method for allelic imbalance detection in the context of varying ctDNA fraction, tumor reads from cfDNA samples from 5 patients from the Cornell cohort were synthetically admixed with reads from each patients' matched control sample. A total of 100 synthetically diluted samples spanning ctDNA levels from 20% to 1% were generated. The ability of the method to consistently detect allelic imbalance at diminishing the ctDNA level was monitored. Of note, the method was able to detect signal of imbalance down to 5% ctDNA level with more than 50% of imbalance calls recovered at 15% ctDNA level (see
The ability of the method to detect imbalance was further characterized by focusing on 4 gene-regions of the Example 1 SNP Panel (i.e. NKX3-1, TP53, RB1 and ERG-TMPRSS2 regions) and a diverse ability to detect imbalance both across patients and across gene-regions was observed (see
Comparison of Example 1 SNP Panel with an Independent Assay
In order to investigate the ability of the Example 1 SNP panel to monitor patients over time, serial samples (N=9) from 3 mCRPC patients originally reported in Annala et al. (PMID: 29367197) were studied and copy number calls reported were compared. Overall, the inventors observed concordant results between the Example 1 SNP panel and the assay described in Annala et al., with a remarkable ability of the Example 1 SNP panel to accurately recapitulate patients' CN aberrations as reported by Annala et al. (105/109 copy number aberrations, 96.33%), as well as to detect complex copy number aberrations and detect aberrations at low ctDNA levels.
In patient #110, the Example 1 SNP panel was able to detect at the earliest time point (TP-1) the hemizygous deletions of TP53 and CHD1, confirmed at the third time point (TP-3) at higher estimated ctDNA level (51% vs. 34%, for TP-3 and TP-1, respectively) by both the Example 1 SNP panel and the Annala et al. assay (see
When the Example 1 SNP panel was applied on samples from patient #55, polyploidy signal across all time points was consistently detected (ploidy of 2.78, 2.19, and 2.51 for TP-1, -2, and -3, respectively) (see
The Genomics of Patients Treated with PARP-Inhibitors
The Example 1 SNP panel was applied on serial samples from 3 patients (1 BRCA mutant+2 ATM mutant patients) treated with a PARP inhibitor. In the BRCA2 mutant patient, the Example 1 SNP panel was applied on 8 of the 9 samples collected (see
The results also show that the Example 1 SNP panel can be used track treatment effectiveness both through the lack of allelic imbalance signal for BRCA2 with treatment, indicating depletion of the clone harbouring the BRCA2 loss, and/or the reduction of ctDNA fraction. By using a gene-region design the panel is also able to increase the chance of picking up resistant clones emerging on treatment.
The Example 1 SNP panel was applied on 9/19 plasma samples collected from the first ATM mutant patient. At death, tissue samples from prostate and metastases were collected. A total of 6 tissue samples were sequenced (5 samples collected from metastases and one from the prostate; see
The Example 1 SNP panel was applied on 9 samples collected from the second ATM mutant patient during diverse treatments (see
Altogether the findings on both tested ATM mutant patient support a complex ATM copy number state within an aneuploid genome likely incongruent with loss of ATM protein, possibly explaining the resistance to treatment. In this context, the Example 1 SNP Panel could be used to generate a genomic predictor of whether a patient will respond to treatment prior to treatment initiation, thus minimizing side effects and exposure to unnecessary treatment.
Genome biology, 15, 439.
Number | Date | Country | Kind |
---|---|---|---|
2108237.5 | Jun 2021 | GB | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/GB2022/051447 | 6/9/2022 | WO |