The present disclosure relates to the use of biomarkers to predict mpMRI visibility of prostate cancer.
Multiparametric magnetic resonance imaging (mpMRI) has improved the management of localized prostate cancer, but 20% of clinically significant tumours are invisible to it 1. mpMRI could be used to reduce unnecessary needle biopsies2,3, but given this false-negative rate there is limited consensus on which men suspected of prostate cancer with negative mpMRI can safely avoid them4.
The biological processes that drive mpMRI invisibility are largely unknown, despite clinical differences in their presentation. Understanding the molecular differences in visible and invisible tumours could help to identify invisible tumours that are likely clinically aggressive. mpMRI visibility is associated with nimbosus5, a constellation of genomic, transcriptomic and histopathological features that signal aggressive prostate cancers. These include increased genomic instability, presence of intraductal carcinoma and/or cribriform architecture histology (IDC/CA), expression of long non-coding RNA SChLAP1, and hypoxias. Elevated hypoxia in visible tumours suggests that the tumour microenvironment might play a role in tumour visibility on mpMRI7, possibly due to differences in stromal organization8 that could lead to restricted water diffusion. mpMRl visibility is also associated with a signature of 7 RNAs: SNORA12, SNORA54, SNORD68, SNORD3A, SNORD33, SNORA37 and SCARNA5.5
mpMRl is routinely performed on prostate cancer patients whose tumour is categorized as International Society of Urological Pathology (ISUP) Grade Group 2 or above. For prostate cancer patients whose tumour is categorized as ISUP Grade Group 1, the standard of care is active surveillance. For prostate cancer patients whose tumour is categorized as ISUP Grade Group 2, however, there is little consensus on whether mpMRl should be performed because of the 20% false-negative rate and the delays, costs, and inter-observer variability associated with mpMRIs. Accordingly, mechanisms to more accurately and/or easily identify tumours that will be visible to mpMRl are desirable.
Multiparametric magnetic resonance imaging (mpMRl) has improved the diagnosis and risk-stratification of localized prostate cancer. About 20% of clinically significant tumours are invisible to mpMRl, defined as a PI-RADSv2 score of one or two. To understand the determinants of mpMRl visibility, the proteomes of twenty mpMRl-visible and twenty mpMRl-invisible ISUP Grade Group 2 tumours, along with histologically normal prostate adjacent to the tumour, were examined. Differences in the proteome of tumours, but not stroma, were associated with mpMRl-visibility. The proteomes of mpMRl-invisible tumours were more similar to that of histologically normal prostate mpMRl. It is demonstrated herein that mpMRl visibility can be predicted by a three-protein biomarker (AUC=0.88, 95% CI=0.77-0.98), and this signature is associated with the poor outcome of biochemical relapse after definitive local therapy.
An aspect of the present disclosure provides a method of identifying a prostate cancer tumour that is likely mpMRl visible in a subject, the method comprising obtaining a sample collected from the subject; measuring a polypeptide level of one or more mpMRl visibility biomarkers selected from SRD5A2, GNA11, CAPNS1, NCDN, WDR5 and/or LDHB, in the sample collected from the subject; wherein the level of the one or more mpMRl visibility biomarkers is indicative of mpMRl visibility of the tumour.
Another aspect of the present disclosure provides a method of selecting a subject with or suspected of having prostate cancer for mpMRl, the method comprising determining a polypeptide level of one or more mpMRl visibility biomarkers and selecting the subject for mpMRl when the level of the one or more mpMRl visibility biomarkers indicates that the tumour is mpMRl visible.
In some embodiments, the method further comprises performing mpMRl.
In some embodiments, the method further comprises repeating the method after an interval.
Another aspect of the present disclosure provides a method for prognosing or monitoring aggressiveness of a prostate cancer tumour in a subject, comprising determining a polypeptide level of one or more mpMRl visibility biomarkers, wherein the level of the one or more mpMRl visibility biomarkers is indicative of the prognosis of the subject.
In an embodiment, the prognosing comprises predicting an increased risk of biochemical relapse, wherein the subject is a subject that has undergone treatment.
In an embodiment, the method further comprises treating a patient determined to be at increased risk of biochemical relapse.
Another aspect of the present disclosure provides a use of the measure polypeptide level of the one or more mpMRl visibility biomarkers in a prostate sample collected from a subject with or suspected of having a prostate cancer tumour for selecting a suitable treatment plan and risk stratification.
In an embodiment, the prostate sample is a treatment-naïve tumour sample.
In an embodiment, the prostate sample is a tumour sample taken from a treatment-naïve subject.
Yet another aspect of the present disclosure provides a kit comprising at least two binding agents, each specific for a polypeptide selected from the mpMRl visibility biomarkers disclosed herein.
Other features and advantages of the present disclosure will become apparent from the following detailed description. It should be understood, however, that the detailed description and the specific examples while indicating preferred embodiments of the disclosure are given by way of illustration only, since various changes and modifications within the spirit and scope of the disclosure will become apparent to those skilled in the art from this detailed description.
An embodiment of the present disclosure will now be described in relation to the drawings in which:
The following is a detailed description provided to aid those skilled in the art in practicing the present disclosure. Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure belongs. The terminology used in the description herein is for describing particular embodiments only and is not intended to be limiting of the disclosure. All publications, patent applications, patents, figures and other references mentioned herein are expressly incorporated by reference in their entirety.
As used herein, the term “mpMRI visibility” refers to a characterization of localized prostate cancer based on mpMRI evaluation of the tumour. Prostate tumours that are identified as invisible, are typically less aggressive and/or clinically significant and tumours identified as visible are typically more aggressive and/or clinically significant. For example, the tumour may be evaluated following the categories defined by the Prostate Imaging—Reporting and Data System (PI-RADS) version 2. PI-RAD v2 uses a 5-point scale in assessing the presence of a clinically significant tumour in the prostate gland based on mpMRl findings. The lower the category, the lower the likelihood that clinically significant cancer is present. The terms “mpMRl-invisible”, “invisible to mpMRl” and the like refer to a tumour that can be categorized as or that corresponds to PI-RADS v2 category 1 or category 2 using the PI-RADS scale. The terms “mpMRl-visible”, “visible to mpMRl” and the like refer to a tumour that is categorized as or corresponds to PI-RADS v2 category 3, 4 or 5.
“Clinically significant tumour” as used herein refers to a tumor that can surgically defined as Gleason score 7 or greater, tumor volume of 0.5 cm3 or greater, or tumour category T3 or greater (Seo J W et al. AJR Am J Roentgenol. 2017;209:W1-W9).
As used herein, the term “grade” refers to categorization of a tumour based on the aggressiveness of the tumour or the likelihood that the tumour will grow and spread. For example, the classical grading system for prostate cancer is the Gleason score, where a Gleason score of 6 or less indicates a low likelihood of the tumour metastasizing. Another commonly used grading system is the ISUP (International Society of Urological Pathology) system, where Grade Group 1 is the least aggressive and Grade Group 5 is the most aggressive. Patients with Grade Group 1 tumours are typically not treated with surgery, radiation or hormone therapy, and are monitored by active surveillance. Patients with Grade Group 2 are treated with surgery or radiation, although there is an increasing body of evidence suggesting that some patients with Grade Group 2 tumors may not need surgical or radiological intervention and would benefit from active surveillance as their tumors are less aggressive (Carlsson S et al. J Urol. 2020; 203:1117-1121.).A subset of these patients with Grade Group 2 tumors however have tumours that will metastasize and therefore would have benefited from treatment.
As used herein, the term “prognosing” refers to predicting the development of a disease, outcome of a treatment and/or a procedure in relation to the disease. Prognosis may be assessed on a variety of bases, including but not limited to biomarkers, clinical data, genetics etc.
As used herein, the term “biomarker” refers to any molecules, including but not limited to proteins, polypeptides, nucleic acids such as mRNA, lipids, metabolites, modifications thereof, that can be used as an indicator of a biological state, in the diagnosis/prognosis of a disease or disorder, and/or in the prediction of the outcome of a treatment or procedure. A biomarker may be used on its own, or in combination with other biomarkers and/or methods.
As used herein, the term “mpMRl visibility biomarker” refers to any biomarker described herein that can be used to predict whether a prostate cancer is mpMRl visible. As used herein, the term refers to any of the following: SRD5A2, GNA11, CAPNS1, NCDN, WDR5 and/or LDHB. A mpMRl visibility biomarker may be associated with mpMRl-visible tumour or with mpMRl-invisible tumour. For example, a mpMRl visibility biomarker can be detected in both mpMRl-visible tumour and mpMRl-invisible tumour but differentially expressed between them. As another example, a mpMRl visibility biomarker can only be detected in mpMRl-visible tumour. As another example, a mpMRl visibility biomarker can only be detected in mpMRl-invisible tumour.
The term “SRD5A2” refers to the protein 3-oxo-5-alpha-steroid 4-dehydrogenase 2 and encompasses variants, isoforms, mutant forms etc. The term also encompasses homologues in different species.
The term “GNA11” refers to the protein guanine nucleotide-binding protein subunit alpha-11 and encompasses variants, isoforms, mutant forms etc. The term also encompasses homologues in different species.
The term “CAPNS1” refers to the protein calpain small subunit 1 and encompasses variants, isoforms, mutant forms etc. The term also encompasses homologues in different species.
The term “NCDN” refers to the protein Neurochondrin and encompasses variants, isoforms, mutant forms etc. The term also encompasses homologues in different species.
The term “WDR5” refers to the protein WD repeat-containing protein 5 and encompasses variants, isoforms, mutant forms etc. The term also encompasses homologues in different species.
The term “LDHB” refers to the protein lactate dehydrogenase B and encompasses variants, isoforms, mutant forms etc. The term also encompasses homologues in different species.
As used herein, the term “PGA” refers to the proportion of genome altered by copy number aberrations (changes in copy number in reference to the diploid human genome) and is used as a measure of genomic instability. Proportion of genome altered is calculated by summing the total number of bases covered by the copy number aberrations divided by the total size of the whole genome. It is one of the hallmarks of mpMRl-visible tumours.
As used herein, the term “IDC/CA” means intraductal carcinoma or cribriform architecture, which represent unfavorable sub-histopathologies in localized prostate cancer as described in Chua et al, Eur Urol. 2017; 72:665-674.
As used herein, the terms “biochemical relapse”, “biochemical failure” and “biochemical recurrence” refer to a rise in PSA level in a prostate cancer subject after treatment and may be defined as a PSA level >0.2 ng/ml following radical prostatectomy and >2 ng/ml above the nadir after radiation therapy (Cornford et al. European Urology. 2017; 71:630-42).
As used herein, the terms “level”, “expression level” and the like when used in the context of a biomarker refers to the amount of the biomarker measured/detected in a sample. For example, the level of a protein biomarker refers to the measured/detected amount of the protein.
As used herein, the term “expression data” refers to data comprising information for determining the level of the one or more mpMRl visibility biomarkers. Expression data may be in any form. For example, it may be raw data or processed data; it may comprise different formats such as image files, spreadsheets etc.
As used herein, the terms “polypeptide” and “protein” refer to any chain of two or more natural or unnatural amino acid residues, regardless of post-translational modifications (e.g., glycosylation or phosphorylation).
The term “subject” also interchangeably referred to as patient, as used herein includes all members of the animal kingdom including mammals, and suitably refers to a human.
In understanding the scope of the present disclosure, the term “comprising” and its derivatives, as used herein, are intended to be open ended terms that specify the presence of the stated features, elements, components, groups, integers, and/or steps, but do not exclude the presence of other unstated features, elements, components, groups, integers and/or steps. The foregoing also applies to words having similar meanings such as the terms, “including”, “having” and their derivatives.
All definitions, as defined and used herein, should be understood to control over dictionary definitions, definitions in documents incorporated by reference, and/or ordinary meanings of the defined terms.
All references, patents and patent applications disclosed herein are incorporated by reference with respect to the subject matter for which each is cited, which in some cases may encompass the entirety of the document.
All of the features disclosed in this specification may be combined in any combination. Each feature disclosed in this specification may be replaced by an alternative feature serving the same, equivalent, or similar purpose. Thus, unless expressly stated otherwise, each feature disclosed is only an example of a generic series of equivalent or similar features.
The term “consisting” and its derivatives, as used herein, are intended to be closed ended terms that specify the presence of stated features, elements, components, groups, integers, and/or steps, and also exclude the presence of other unstated features, elements, components, groups, integers and/or steps.
Further, terms of degree such as “substantially”, “about” and “approximately” as used herein mean a reasonable amount of deviation of the modified term such that the end result is not significantly changed. These terms of degree should be construed as including a deviation of at least ±5% of the modified term if this deviation would not negate the meaning of the word it modifies.
More specifically, the term “about” means plus or minus 0.1 to 20%, 5-20%, or 10-20%, 10%-15%, preferably 5-10%, most preferably about 5% of the number to which reference is being made.
As used in this specification and the appended claims, the singular forms “a”, “an” and “the” include plural references unless the content clearly dictates otherwise. Thus, for example, a composition containing “a compound” includes a mixture of two or more compounds. It should also be noted that the term “or” is generally employed in its sense including “and/or” unless the content clearly dictates otherwise.
The definitions and embodiments described in particular sections are intended to be applicable to other embodiments herein described for which they are suitable as would be understood by a person skilled in the art.
The recitation of numerical ranges by endpoints herein includes all numbers and fractions subsumed within that range (e.g. 1 to 5 includes 1, 1.5, 2, 2.75, 3, 3.90, 4, and 5). It is also to be understood that all numbers and fractions thereof are presumed to be modified by the term “about.”
Further, the definitions and embodiments described in particular sections are intended to be applicable to other embodiments herein described for which they are suitable as would be under-stood by a person skilled in the art. For example, in the following passages, different aspects of the disclosure are defined in more detail. Each aspect so defined may be combined with any other aspect or aspects unless clearly indicated to the contrary. In particular, any feature indicated as being preferred or advantageous may be combined with any other feature or features indicated as being preferred or advantageous.
Although any methods and materials similar or equivalent to those described herein can also be used in the practice or testing of the present disclosure, examples of methods and materials are now described.
Multiparametric magnetic resonance imaging (mpMRI) is a valuable tool for the assessment of prostate cancer. Clinical applications of mpMRI include tumour detection, characterization, risk stratification, surveillance, and others. However, about 20% of clinically significant tumours are invisible to mpMRI.
Using a proteomics approach, it is demonstrated herein that unexpectedly, there is no difference between the histologically normal tissue adjacent to the tumour (NAT) of mpMRI-visible and mpMRI-invisible tumours. However, five proteins are differentially abundant between mpMRI-invisible and mpMRI-invisible tumours: SRD5A2, GNA11, CAPNS1, NCDN, WDR5 and/or LDHB. Three of the proteins, SRD5A2, GNA11 and WDR5 are associated with mpMRI visibility, PGA and IDC/CA. It is further demonstrated herein by machine-learning that a panel of three proteins, LDHB, GNA11 and SRD5A2, can predict mpMRI visibility.
Performing mpMRI on a subject whose tumour is mpMRI-invisible is undesirable because resources are wasted. Accordingly, provided herein are methods to predict mpMRI visibility of a tumour, comprising measuring a level of one or more mpMRI visibility biomarkers in sample obtained from the subject, wherein the one or more mpMRI visibility biomarkers is indicative of mpMRI visibility.
Prostate cancer is stratified into different grades that indicate the likelihood that the tumour will grow and/or spread. Where the likelihood is low, standard of care generally involves surveillance. The method disclosed herein can be used to aid in the determination of whether a subject diagnosed with low grade prostate cancer should be assessed by mpMRI. In one embodiment, the subject is a prostate cancer subject whose tumour is categorized as ISUP Grade 1. In another embodiment, the subject is a prostate cancer subject whose tumour is categorized as ISUP Grade 2. Other grading systems and categorization of prostate cancer may be used. For example, the T category of the tumour, serum PSA levels, Gleason score, and the combination thereof. Where a prostate cancer subject whose tumour is categorized as low grade (for example, ISUP Grade 1 or Grade 2) but predicted to be mpMRl-visible, assessment by mpMRl can be beneficial.
mpMRl-visible tumours are associated with genome instability (as indicated by PGA) and unfavourable histology (IDC/CA), and an increased risk of biochemical relapse and metastasis (Houlahan et al, Eur Urol. 2019; 76:18-23).
The methods disclosed herein can have a variety of uses in prostate cancer management including but not limited to selecting a suitable treatment plan and risk stratification.
In one aspect, the present disclosure provides methods for identifying tumour that is likely mpMRl visible in a subject. In some embodiments, the methods may comprise measuring a polypeptide level of one or more mpMRl visibility biomarkers in sample obtained from the subject, wherein the one or more mpMRl visibility biomarkers is indicative of mpMRl visibility. In some embodiments, the methods may comprise obtaining expression data for determining a polypeptide level of one or more mpMRl visibility biomarkers in the sample collected from the subject.
In some embodiments, the methods further comprise selecting the subject for mpMRl if the polypeptide level of the one or more mpMRl visibility biomarkers is indicative that the tumour is visible.
In some embodiments, the methods further comprise performing mpMRl.
mpMRl may be performed, for example, in a 3-T scanner with anti-peristaltic agent, with or without endorectal coil. Parameters can include for example T2 weighted image, dynamic contrast-enhanced image, and diffusion-weighted image. Other parameters can also be used.
In performing mpMRl, clinical guidelines for multi-parametric MRI of the prostate, such as ESUR prostate MR guidelines 2012 (Barentsz, J. O. et al. (2012). ESUR prostate MR guidelines 2012. European radiology, 22(4), 746-757, the content of which is incorporated herein in its entirety) may be followed.
In one aspect, the present disclosure provides methods for selecting a subject with or suspected of having prostate cancer for mpMRl. In some embodiments, the methods may comprise obtaining expression data for determining a polypeptide level of one or more mpMRl visibility biomarkers in the sample collected from the subject. In some embodiments, the methods further comprise performing mpMRl.
In another aspect, the present disclosure provides methods for prognosing or monitoring aggressiveness of a prostate cancer tumour in a subject. In some embodiments, the methods may comprise measuring a polypeptide level of one or more mpMRl visibility biomarkers in sample obtained from the subject, wherein the one or more mpMRl visibility biomarkers is indicative of mpMRl visibility. In some embodiments, the methods may comprise obtaining expression data for determining a polypeptide level of one or more mpMRl visibility biomarkers in the sample collected from the subject.
Prognosing is used as a broad sense and encompasses predicting disease progression such as risk of metastasis, risk of relapse, among others. In an embodiment, prognosing comprises predicting an increased risk of biochemical relapse in a subject that has undergone treatment.
In some embodiments, the methods further comprise treating a subject determined to be at an increased risk of biochemical relapse.
Using the methods disclosed herein, it may be determined that a subject is not to be selected for mpMRl. The subject may then be put under surveillance. It can be appreciated that it is possible the tumour of the subject may become mpMRl visible and/or aggressive at a later time. Accordingly, in some embodiments, the methods may be repeated for the subject after an interval.
It can be appreciated that the determining the level of the one or more mpMRl visibility biomarkers, the identifying tumours that are likely mpMRl visible, the selecting subjects for mpMRl, the prognosing and monitoring aggressiveness of the tumour may be performed by the same party or by different parties. For example, a physician at a hospital may send patient samples to a third party laboratory, who then generates expression data of the one or more mpMRl visibility biomarkers and provides the expression data to the physician.
Therefore, in some embodiments, the methods comprise obtaining a sample collected from the subject with prostate cancer or suspected of having prostate cancer. In some embodiments, the methods comprise obtaining expression data generated from a sample collected from the subject.
Predicting mpMRl visibility of a tumour from a polypeptide level of the one or more mpMRl visibility biomarkers may be done by any suitable method. In some embodiments, a statistical model is used. Suitable statistical models include but are not limited to logistic regression, linear discriminant analysis, multivariate adaptive regression splines, naïve Bayes, neural network, support vector machine, functional tree, LAD tree, Bayesian network, elastic net regression, and random forest. In some embodiments, the statistical model comprises a logistic regression model. In some embodiments, the statistical model is trained on the polypeptide level of the one or more mpMRl visibility biomarkers in a plurality of tumour samples with known mpMRl visibility. In an embodiment, the polypeptide level is log 2-transformed. In an embodiment, a cutoff value of 0.5 is used.
In some embodiments, predicting mpMRl visibility of a tumour comprises comparing a polypeptide level of the one or more mpMRl visibility biomarkers to a threshold level. The threshold level may be in any form, for example, a cutoff value or a range of values. In some embodiments, a polypeptide level above the threshold is indicative of the tumour visible to mpMRl. In some embodiments, a polypeptide level below the threshold is indicative of the tumour visible to mpMRl.
A threshold level can be determined from a plurality of control samples. In some embodiments, control samples are samples from healthy subjects not diagnosed with prostate cancer. In some embodiments, control samples are NATs. In some embodiments, the threshold level is pre-determined. In some embodiments, control samples and test tumour samples are analyzed concurrently.
In some embodiments, the methods comprise determining the difference in the expression level of the one or more mpMRl visibility biomarkers. In some embodiments, determining the difference in expression level comprises comparing the expression level or transformed expression level of two samples. In an embodiment, the transformed expression level can be a log 2 transformed expression level. In some embodiments, the difference is between a tumour sample and a NAT sample. In some embodiments, the difference is between a tumour sample and a sample from a healthy subject not diagnosed with prostate cancer. In some embodiments, the difference is between a mpMRl visible tumour sample and a mpMRl invisible tumour sample. In some embodiments, the difference is between a NAT sample of a mpMRl visible tumor and a NAT sample of a mpMRl invisible tumour. In one embodiment, the one or more mpMRl visibility biomarkers are selected from SRD5A2, GNA11, CAPNS1, NCDN, WDR5 and/or LDHB.
In one embodiment, the one or more mpMRl visibility biomarkers are at least 2, at least 3, at least 4, or at least 5 mpMRl visibility biomarkers. In one embodiment, the one or more mpMRl visibility biomarkers are the mpMRl visibility biomarkers.
For example, the one of more biomarkers can be 1, 2, 3, 4, 5 or 6 of the mpMRl visibility biomarkers.
In one embodiment, the mpMRl visibility biomarker is or comprises SRD5A2. In another embodiment, the mpMRl visibility biomarker is or comprises GNA11. In another embodiment, the mpMRl visibility biomarker is or comprises CAPNS1. In another embodiment, the mpMRl visibility biomarker is or comprises NCDN. In another embodiment, the mpMRl visibility biomarker is or comprises WDR5. In another embodiment, the mpMRl visibility biomarker is or comprises LDHB. In another embodiment, any one of the mpMRl visibility biomarkers can be combined with any one or more of the other mpMRl visibility biomarker.
The ability of individual mpMRl visibility biomarkers to predict mpMRl visibility is for example shown in Table 3.
In one embodiment, the one or more mpMRl visibility biomarkers comprise LDHB, SRD5A2 and GNA11. In one embodiment, the one or more mpMRl biomarkers comprise LDHB and SRD5A2. In one embodiment, the one or more mpMRl visibility biomarkers comprise SRD5A2 and GNA11. In one embodiment, the one or more mpMRl visibility biomarkers comprise LDHB and GNA11.
In one embodiment, the one or more mpMRl visibility biomarkers comprise CAPNS1 and GNA11. In one embodiment, the one or more mpMRl visibility biomarkers comprise CAPNS1 and SRD5A2. In one embodiment, the one or more mpMRl visibility biomarkers comprise CAPNS1 and LDHB, In one embodiment, the one or more mpMRl visibility biomarkers comprise CAPNS1 and WDR5. In one embodiment, the one or more mpMRl visibility biomarkers comprise CAPNS1 and NCDN. In one embodiment, the one or more mpMRl visibility biomarkers comprise GNA11 and WDR5. In one embodiment, the one or more mpMRl visibility biomarkers comprise GNA11 and NCDN. In one embodiment, the one or more mpMRl visibility biomarkers comprise SRD5A2 and WDR5. In one embodiment, the one or more mpMRl visibility biomarkers comprise SRD5A2 and NCDN. In one embodiment, the one or more mpMRl visibility biomarkers comprise LDNB and WDR5. In one embodiment, the one or more mpMRl visibility biomarkers comprise LDNB and NCDN, In one embodiment, the one or more mpMRl visibility biomarkers comprise WDR5 and NCDN.
The ability for pairs of mpMRl visibility biomarkers to predict mpMRl visibility is for example shown in Table 4.
In some embodiments, the sample is a prostate sample. In an embodiment, the prostate sample is a treatment-naïve tumour sample. In an embodiment, the prostate sample is taken from a treatment-naïve subject with prostate cancer.
In some embodiments, the sample is a prostate cancer biopsy. In an embodiment, the biopsy is a transrectal biopsy. In another embodiment, the biopsy is a transperineal biopsy. In some embodiments, the sample is a tumour tissue core sample.
In some embodiments, the sample is cryopulverized. In some embodiments, the sample is processed for protein extraction. In some embodiments, the proteins are digested. In some embodiments, digested proteins are analyzed by mass spectrometry.
In some embodiments, the level of the one or more biomarkers is measured by measuring protein levels. Any suitable methods for protein level measurement known in the art can be used, including but not limited to affinity-based assays, spectroscopy methods, and blotting methods. Affinity-based assays typically comprise the use of one or more binding agents that bind specifically the protein of interest. A variety of binding agents can be used, including but not limited to antibodies and fragments thereof, ligands, receptors, aptamers, oligonucleotides, and molecularly imprinted polymers. The binding agent may bind the full-length protein or a fragment thereof, an isoform, a pro-protein, a post-translationally modified protein etc.
In general, detecting or measuring the level of a biomarker through an affinity-based method comprises contacting a sample with one or more binding agents that specifically bind the biomarker. Each of the binding agents may comprise a different detectable label to allow detection of different mpMRl visibility biomarkers. Detectable labels and moieties include but not limited to florescent dyes such as FITC, Cy3, Cy5, radioisotopes such as iodine-125, enzymes such as horseradish peroxidase, alkaline phosphatase, β-galactosidase, acetylcholinesterase, and catalase, nanoparticles such as gold nanoparticles. Another example of a detectable label that can be used with a binding agent is an aptamer, as used for example in the SOMAscan proteomics technology. The SOMAscan proteomics technology has been used to bind and quantify various protein targets including for example LDHB 24.
In some embodiments, the binding agent can be directly conjugated to the detectable label or moiety. In other embodiments, the binding agent is not labelled and the biomarker-binding agent complex is detected with a secondary reagent, which is conjugated to a detectable label or moiety. The secondary reagent may bind the mpMRl visibility biomarker or the binding agent. Any suitable methods known in the art for conjugating binding agents to labels and moieties can be used.
The use of antibodies and fragments thereof in assays to measure protein levels is well known in the art. Such assays include, but are not limited to, immunochromatographic assay, enzyme-linked immunosorbent assay (ELISA), immunohistochemistry (IHC), western blotting, radioimmunoassays (RIA), fluorescent immunoassays, the practices of which are well known in the art (see, e.g., Ausubel, Frederick M. Current Protocols in Molecular Biology. New York: John Wiley & Sons, 1994, the content of which is incorporated by reference in its entirety). Another example of an antibody-based assay to measure proteins is the Proximity Extension Assay (PEA).
A person skilled in the art would know that each type of assay can come in different formats and any suitable format can be used with the methods disclosed herein. For example, suitable ELISA formats include but not limited to sandwich ELISA.
Examples of commercially available binding agents that can be used to specifically recognize the mpMRl visibility biomarkers disclosed herein include but are not limited to:
The present disclosure also provides kits comprising at least two binding agents, each specific for a polypeptide selected from SRD5A2, GNA11, CAPNS1, NCDN, WDR5 and/or LDHB.
The above disclosure generally describes the present application. A more complete understanding can be obtained by reference to the following specific examples. These examples are described solely for the purpose of illustration and are not intended to limit the scope of the application. Changes in form and substitution of equivalents are contemplated as circumstances might suggest or render expedient. Although specific terms have been employed herein, such terms are intended in a descriptive sense and not for purposes of limitation.
The following non-limiting examples are illustrative of the present disclosure.
Patient selection, mpMRI imaging acquisition and interpretation, tissue collection, and sample processing has been previously described5. In brief, patients with localized prostate cancer and solitary Gleason score 3+4 lesions >1.5 cm on final surgical pathology and with PI-RADSv2 1-2 (MRI-invisible) or PI-RADSv2 5 (MRI-visible) lesions were selected for molecular profiling by copy number profiling5, RNA-seq5, and proteomics. Patients underwent mpMRI in a 3-T scanner with anti-peristaltic agent, with or without endorectal coil. mpMRI images were reported by 1 of 3 experienced uroradiologists with 10-18 years of prostate mpMRI experience and a retrospective blinded review of the mpMRI invisible tumours were performed by a single uroradiologist. Tumour and normal tissue adjacent to the tumour (NAT) regions were annotated by a genitourinary pathologist. The relevant areas of tissue were macro-dissected from adjacent 10 μm sections for proteomics analysis.
Each scraped FFPE tissue region was placed in a 1.5 mL conical tube and deparaffinized with xylene as follows. 500 μL of buffer was added to each tube, then samples were vortexed at high speed for 30s, incubated for 5 minutes on an end-over-end nutator at room temperature, centrifuged at 18,000 rcf for 3 min, and the supernatant was discarded. The deparaffinization step was repeated twice. Tissues were then rehydrated with a graded ethanol series (95% ethanol, 90% ethanol, 75% ethanol, 50% ethanol, 25% ethanol, and water). Water was evaporated from each tube using a SpeedVac vacuum concentrator (Thermo) until 100 uL of water remained in each tube. 10 uL of 1M Tris pH 8.0 buffer was added to each sample to a final concentration of 100 mM Tris-HCl, pH 8. Samples were heated at 95° C. for 1 hour to reverse formalin-induced crosslinking, then sonicated on a probe-less ultrasonic sonicator for ten 10 second cycles at 10 Watts per tube (Hielscher VialTweeter).
100 μL 2,2,2-Trifluoroethanol was added to each tube to a final concentration of 50%. 2 pmol of SUC2 protein (Yeast invertase) was added as a digestion control. Disulphide bonds were reduced with 5 mM dithiothreitol, followed by 1 hour incubation at 60° C. Free sulfhydryl groups were alkylated by incubating samples in 25 mM iodoacetamide in the dark for 30 min at room temperature. Samples were diluted 1:5 with 100 mM ammonium bicarbonate with 2 mM CaCl2) (pH 8). Proteins were digested with 2 μg of trypsin/Lys-C enzyme mix (Promega) overnight at 37° C., then an additional 1 μg of trypsin/Lys-C enzyme mix was added in the morning and digestion continued at 37° C. for 1 hour. 1% FA was used to bring the sample pH<2. Peptides were desalted by C18-based solid phase extraction, then lyophilized in a SpeedVac vacuum concentrator. Peptides were solubilized in mass spectrometry-grade water with 0.1% formic acid. Peptide concentration was quantified using a NanoDrop Lite (at 280 nm).
2 μg of peptides was used for LC-MS/MS analysis. Synthetic iRT peptides (Biognosis) were spiked into each sample at a 1:10 ratio prior to data acquisition. LC-MS/MS data was acquired using an Easy nLC 1000 (Thermo) nano-flow liquid chromatography system with a 50 cm EasySpray ES803 column (Thermo) coupled to a Q Exactive HF (Thermo) tandem mass spectrometer. Peptides were separated by reverse phase chromatography using a 4-hour nonlinear chromatographic gradient of 4-48% buffer B (0.1% FA in ACN) at a flow rate of 250 nl/min. Column temperature was kept at 45° C. Mass spectrometry data was acquired in data dependent mode with a top 15 method. MS1 data was acquired at a resolution of 120,000, AGC target of 1e6, and maximum injection time (maxIT) of 30 ms, while MS2 data was acquired at a resolution of 30,000, AGC target of 1e5, and maxIT of 110 ms. Data was searched in MaxQuant (version 1.6.1.0) using a merged UniProt protein sequence database containing human protein sequences from Uniprot (complete human proteome; 2015-01-27, number of sequences 42,842), yeast invertase (Suc2) protein sequences from Uniprot, and iRT synthetic peptide sequences (Biognosis). Searches were performed with a maximum of two missed cleavages, and carbamidomethylation of cysteine as a fixed modification. Variable modifications were set as oxidation at methionine and methylation at lysine. The false discovery rate for the target-decoy search was set to 1% for protein, and peptide levels. Intensity-based absolute quantification (iBAQ), label-free quantitation (LFQ), and match between runs (matching and alignment time windows set as 0.7 and 20 min respectively) were enabled. The proteinGroups.txt file was used for subsequent analysis. Protein groups will be referred to as proteins in the text. Reverse hits were removed, and proteins identified with two or more peptides were carried forward. LFQ intensities were used for protein quantitation. For proteins with missing LFQ values, median-adjusted iBAQ values were used as replacement18.
Consensus clustering (maxK=6; reps=50; pltem=0.8; pFeature=1; distance=Pearson; innerLinkage=average; finalLinkage=average; ConsensusClusterPlus v1.52.0) was performed using divisive hierarchical clustering on the 25% most variable proteins in the cohort to cluster samples and proteins. Missing values were imputed with random values drawn from a normal distribution of protein abundances (width=0.2; down-shift=1.8)19. Adjusted Rand Index (ARI) (CrossClustering v.4.0.3) was calculated between sample subtypes generated from consensus clustering and two other subtypes: Tumour and NAT, or samples from patients with mpMRl-visible versus mpMRl-invisible tumours.
Differences in Proteins Detected
Mann-Whitney U-test for all comparisons—Tumour vs NAT using all samples, visible tumours only, or invisible tumours only. A linear mixed model with random effects: Imer(count ˜tumour*visible+(1 (subject)); REML=FALSE; ImerTest v.3.1-3) was used to test for independence between tumour vs. NAT (tumour) and invisible vs. visible (visibility) groups (estimatetumour=−113.26, Ptumour=1.33×10−9; estimatevisibility=−17.91, Pvisibility=0.268; estimatetumourvisibility=−8.16, Ptumourvisibility=0.570).
Differential Abundance Analysis
For each comparison, proteins present in >50% of the samples were kept for further analysis—visible versus invisible NAT (nsamples=40, nproteins=4,165), tumour versus NAT (nsamples=80, nproteins=4,314), visible versus invisible tumour (nsamples=40, nproteins=4,426). Proteins in the intersection of these 3 sets (nproteins=4,067) were used for differential expression analysis using the Mann-Whitney U-test, with multiple testing correction using the Benjamini-Hochberg method. Missing values were imputed with random values drawn from a normal distribution of protein abundances (width=0.2; down-shift=1.8)19. For sample D01 that had two tumour regions that were prepared separately (low-grade and high-grade tumour regions), maximum protein abundance for each protein was used for differential abundance analyses.
For Tumour vs. NAT comparison of protein-coding RNAs, the data used for the analyses described in this manuscript were obtained from the cBioPortal on July 2020(ntumour=499, nNAT=53). Mann-Whitney U-test was used for all comparisons, with multiple testing correction using the Benjamini-Hochberg method. Missing values were imputed with random values drawn from a random uniform distribution of RNA transcript per million counts between 0 and 1. All data was log 2-transformed.
Euclidean distance was calculated for each tumour and median NAT pair, using protein abundance. Only proteins detected in all samples (n=2,309) were used. IDC/CA groups were determined based on the presence of IDC or CA pathology (IDC/CA+, n=11) or not (IDC/CA−, n=29). Hypoxia groups were determined by median dichotomization (median score20=−1).
Pre-ranked gene set enrichment analysis (GSEA)21,22 (v.4.0.3, npermutations=1000; max size=500; min size=15; enrichment statistic=weighted; normalization mode: meandiv) was performed to identify hallmark gene sets23 that were enriched in each group. For each comparison (Tumour versus NAT, visible versus invisible tumour, and visible versus invisible NAT), proteins were ranked by log 2 fold change. GSEA was run on each group and adjusted for significance separately but visualized together to better show potential overlaps in hallmark gene sets.
Association Analysis of mpMRl Visibility Hallmarks on RNA and Protein Abundances
To identify protein-coding RNAs associated with mpMRl visibility hallmarks, univariate association tests—Spearman's p for continuous values, Mann-Whitney U test for binary values—were performed with each RNA (log 2-transformed transcripts per million (TPM)) in the discovery cohort9 (n=144) against the following mpMRl visibility hallmarks5: percent genome altered (PGA), hypoxia (Ragnum score), presence of intraductal carcinoma or cribriform architecture (IDC/CA), and expression of 8 RNAs (SChLAP1, SNORA12, SNORA54, SNORD68, SNORD3A, SNORD33, SNORA37, SCARNA5). RNAs that had <5 TPM in less than 2 samples were excluded from further analysis. Where the calculated p-value was <2×10—16, p-values were imputed based on the effect size. RNAs that were associated with at least one hallmark in the discovery cohort (FDR<0.2) were evaluated in this cohorts (n=40). Hallmark-associated mRNAs that had a corresponding protein detected (n=3,791) in our cohort were carried forward for validation of protein associations with visibility hallmarks (n=40). Proteins were considered validated if they were also significantly associated with the same hallmark at the protein level (FDR<0.2) and had the same directionality as the corresponding RNA association.
Protein Signature to Predict mpMRl-Visible Tumours
We considered proteins detected in all tumour samples (n=2,710) for mpMRl-visibility protein signature development. For sample D01 that had two tumour regions that were prepared separately, the maximum protein abundance for each protein was used. Protein abundances were log 2-transformed and selected for associations with mpMRl-visibility using leave-one-out (n=40) cross validation of Least Absolute Shrinkage and Selection Operator (LASSO) logistic regression(glmnet v4.1), with inner leave-one-out cross validation for lambda selection using the “one-standard-error” rule. In each fold, all 2,710 proteins were initially included as predictors to predict tumour visibility, and the number of times each predictor was selected by the LASSO model were tallied across all folds. Three proteins (LDHB, SRD5A2, GNA11) were consistently chosen as predictors (chosen in at least 15 out of 40 folds) and were used to build a logistic regression model to predict tumour visibility. The performance of the three-protein logistic regression model was assessed using leave-one-out (n=40) cross validation, with AUC and ROC confidence intervals calculated using the pROC package (v1.17.0.1). The three protein signature was tested for synergy in predicting tumour visibility with nimbosus hallmarks and our previously discovered snoRNA signatures using leave-one-out (n=40) cross validation of logistic regression models that included the additional predictors. The nimbosus hallmarks considered were PGA, SCHLAP1, hypoxia and IDC/CA. As described previously, significantly differentially abundant snoRNAs (FDR<0.05) were determined per folds. The three protein logistic regression model (LDHB, SRD5A2, GNA11) was also trained on our full cohort (n=40) and applied to an independent cohort of 76 intermediate-risk prostate cancer samples with log 2-transformed protein abundance of the three proteins9. The final logistic regression model had intercept of 274.388, and coefficients of (−)6.439, (−) 1.824 and (−)1.807 for LDHB, SRD5A2, GNA11, respectively. In this independent cohort, samples were dichotomized by logistic regression model predicted mpMRl-visibility probability at the cutoff of 0.5 and the sample groups were tested for differences in biochemical-relapse-free survival using Cox proportional-hazards modeling.
Global proteomic analysis on twenty mpMRl-invisible (PI-RADSv2 1-2) and twenty mpMRI-visible (PI-RADSv2 5) tumours, along with histologically normal tissue adjacent to the tumour (NATs) from all samples was performed using methods as described in Example 1 (also see
4,772 proteins were quantified of which 2,309 were detected in all 81 samples (
It was hypothesized that the tumour microenvironment might influence tumour visibility on mpMRI6,7. The abundance of each protein in NAT from patients with mpMRl-visible tumour was compared to that of mpMRl-invisible tumour using the analysis method as descried in Example 1 tumour. Surprisingly, not a single protein differed (
Given the modest differences between mpMRl-visible and -invisible tumor proteomes, it was hypothesized that mpMRl-invisible tumors might reflect an intermediate state between NATs and mpMRl-visibility. Consistent with this hypothesis, protein abundance differences associated with mpMRl-visibility were correlated with NAT-tumor differences (Spearman's p=0.37; p<1×10−16,
To identify protein-coding RNAs and proteins associated with mpMRl-visibility and disease aggression, the nimbosus hallmarks were focused on6,6. An independent discovery cohort of 144 NCCN intermediate-risk tumours was used to discover associations between RNA abundance and each hallmark as described in Example 1 9,17. Significant transcriptome associations were then validated in this cohort at the RNA levels, and confirmed in the proteome. 14,044 protein-coding RNAs and 1,622 proteins associated with at least one nimbosus hallmark were identified (
To identify a molecular biomarker that predicts mpMRl-visibility, statistical machine-learning was applied to the dataset as described in Example 1. This created a three-protein biomarker (LDHB, GNA11, SRD5A2) that could predict mpMRl-visibility with an AUC of 0.88 (0195%=0.77-0.98,
The ability for CAPNS1, GNA11, SRD5A2, LDHB, WDR5, NCDN each individually and in pairs to predict mpMRl visibility was assessed in the dataset as described in Example 1 (see Table 3 and Table 4).
The methods described herein can be used to predict mpMRl visibility in treatment-naïve prostate cancer patients, Biopsy tissues can be obtained by transrectal biopsy or transperineal biopsy, and tissue cores containing the tumour would be used for mass spectrometry. The entire tissue core can be cryopulverized, and proteins can be extracted for protein digestion and mass spectrometry analysis as described in Example 1.
While the present application has been described with reference to what are presently considered to be the preferred examples, it is to be understood that the application is not limited to the disclosed examples. To the contrary, the application is intended to cover various modifications and equivalent arrangements included within the spirit and scope of the appended claims.
All publications, patents and patent applications are herein incorporated by reference in their entirety to the same extent as if each individual publication, patent or patent application was specifically and individually indicated to be incorporated by reference in its entirety. Specifically, the sequences associated with each accession numbers provided herein including for example accession numbers and/or biomarker sequences (e.g. protein and/or nucleic acid) provided in the Tables or elsewhere, are incorporated by reference in its entirely.
The scope of the claims should not be limited by the preferred embodiments and examples, but should be given the broadest interpretation consistent with the description as a whole.
This application claims the benefit of U.S. provisional application Ser. No. 63/316,093 filed Mar. 3, 2022, the entire contents of which are hereby incorporated by reference.
This invention was made with government support under Grant Number CA214194 and CA016042, awarded by the National Institutes of Health. The US government has certain rights in the invention.
Number | Date | Country | |
---|---|---|---|
63316093 | Mar 2022 | US |