The invention generally relates to biomarkers for kidney cancer and methods based on the same biomarkers.
In the US, 275,000 patients each year are screened for kidney cancer, and 55,000 are diagnosed with renal cell carcinoma (RCC) (American Cancer Society Facts and Figures 2010). RCC is the most common form of kidney cancer, accounting for approximately 80% of the total. The incidence of RCC is steadily increasing, and in the US increased by approximately 2% per year in the past two decades (Ries L A G, et al., eds. SEER Cancer Statistics Review, 1975-2003. Bethesda, Md.: National Cancer Institute; 2006). Because RCC is one of the deadliest cancers and does not respond to traditional chemotherapy drugs, many new targeted agents are being developed specifically to treat RCC.
70% of newly diagnosed patients are diagnosed in the early stages (T1 and T2). Early stage RCC is treated by partial or total nephrectomy; this is surgery with curative intent. When RCC tumors are surgically removed at an early stage, the 5 year survival rate is 90% for stage 1 and 51% for stage 2, yet 70% of RCC patients develop metastasis during the course of their disease.
Often, kidney lesions or small renal masses (SRM) are discovered incidentally during examinations unrelated to suspected malignancy. While approximately 20% of SRM are benign, the remainder are cancerous. The traditional treatment for small renal masses is radical nephrectomy. Typically cancer-positive SRMs are relatively small and have a relatively slow growth rate. As such, cancer-positive SRMs are generally considered to have less aggressive potential, and thus a watchful waiting approach may be more appropriate than surgery (Bosniak M A, et al. J. Small renal parenchymal neoplasms: further observations on growth. Radiology 1995; 197: 589-597.). However, there are also incidentally detected small renal masses that can grow rapidly and have aggressive potential (Remzi M, et al. “Are small renal tumors harmless? Analysis of histopathological features according to tumors 4 cm or less in diameter”. J. Urol. 2006; 176 (3): 896-9.). Biomarkers for distinguishing which cancer-positive SRMs will be more aggressive, requiring surgery, and which will be slower growing and warrant a watchful waiting approach would be valuable.
Pharmaceutical companies have been developing targeted therapies for RCC, such as Sutent (sunitinib), Nexavar (sorafenib), Avastin (bevacizumab) and Torisel (temsirolimus). As of March 2011, there were 6 targeted agents in Phase I, 13 in Phase 2, 5 in Phase 3, and 8 with FDA approval for treatment of RCC. Currently, approximately 18% of the RCC patient population receives drug therapy. In the future, more patients are expected to receive treatment, driven by an increase in the number of treatment options, improvements in drug efficacy and the trend to use drug therapy earlier in the course of the disease (adjuvant or neo-adjuvant setting) (Espicom Business Intelligence, Market Report: Renal Cell Carcinoma Drug Futures, ISBN: 978-1-85822-396-4, March 2011).
In one aspect, the present invention provides a method of diagnosing whether a subject has kidney cancer, including subjects having an SRM, comprising analyzing a biological sample from a subject to determine the level(s) of one or more biomarkers for kidney cancer in the sample, where the one or more biomarkers are selected from Tables 1, 2, 4 and/or 11 and comparing the level(s) of the one or more biomarkers in the sample to kidney cancer-positive and/or kidney cancer-negative reference levels of the one or more biomarkers in order to diagnose whether the subject has kidney cancer.
In a further aspect, the invention provides a method of distinguishing kidney cancer from other urological cancers (e.g., bladder cancer, prostate cancer), comprising analyzing a biological sample from a subject to determine the level(s) of one or more biomarkers for kidney cancer in the sample where the one or more biomarkers are selected from Table 11 and comparing the level(s) of the one or more biomarkers in the sample to kidney cancer-positive and/or kidney cancer-negative reference levels of the one or more biomarkers in order to distinguish kidney cancer from other urological cancers.
In another aspect, the invention provides a method of monitoring progression/regression of kidney cancer in a subject comprising analyzing a first biological sample from a subject to determine the level(s) of one or more biomarkers for kidney cancer in the sample, where the one or more biomarkers are selected from Tables 1, 2, 4, 8, 10 and/or 11 and the first sample is obtained from the subject at a first time point; analyzing a second biological sample from a subject to determine the level(s) of the one or more biomarkers, where the second sample is obtained from the subject at a second time point; and comparing the level(s) of one or more biomarkers in the second sample to the level(s) of the one or more biomarkers in (a) the first sample (b) kidney cancer-positive reference levels of the one or more biomarkers, and/or (c) kidney cancer-negative reference levels of the one or more biomarkers in order to monitor the progression/regression of kidney cancer in the subject.
In another aspect, the present invention provides a method of determining the stage of kidney cancer, comprising analyzing a biological sample from a subject to determine the level(s) of one or more biomarkers for kidney cancer stage in the sample, where the one or more biomarkers are selected from Table 8; and comparing the level(s) of the one or more biomarkers in the sample to high stage kidney cancer and/or low stage kidney cancer reference levels of the one or more biomarkers in order to determine the stage of the subject's kidney cancer.
In a further aspect, the present invention provides a method of determining the aggressiveness of kidney cancer, comprising analyzing a biological sample from a subject to determine the level(s) of one or more biomarkers for kidney cancer aggressiveness in the sample, where the one or more biomarkers are selected from Table 10; and comparing the level(s) of the one or more biomarkers in the sample to more aggressive kidney cancer and/or less aggressive kidney cancer reference levels of the one or more biomarkers in order to determine the aggressiveness of the subject's kidney cancer.
In another aspect, the present invention provides a method of assessing the efficacy of a composition for treating kidney cancer comprising analyzing a biological sample from a subject having kidney cancer and currently or previously being treated with the composition, to determine the level(s) of one or more biomarkers for kidney cancer selected from Tables 1, 2, 4, 8, 10 and/or 11; and comparing the level(s) of the one or more biomarkers in the sample to (a) levels of the one or more biomarkers in a previously-taken biological sample from the subject, where the previously-taken biological sample was obtained from the subject before being treated with the composition, (b) kidney cancer-positive reference levels of the one or more biomarkers, and/or (c) kidney cancer-negative reference levels of the one or more biomarkers.
In another aspect, the present invention provides a method for assessing the efficacy of a composition in treating kidney cancer, comprising analyzing a first biological sample from a subject to determine the level(s) of one or more biomarkers for kidney cancer selected from Tables 1, 2, 4, 8, 10 and/or 11, the first sample obtained from the subject at a first time point; administering the composition to the subject; analyzing a second biological sample from the subject to determine the level(s) of the one or more biomarkers, the second sample obtained from the subject at a second time point after administration of the composition; comparing the level(s) of one or more biomarkers in the first sample to the level(s) of the one or more biomarkers in the second sample in order to assess the efficacy of the composition for treating kidney cancer.
In yet another aspect, the invention provides a method of assessing the relative efficacy of two or more compositions for treating kidney cancer comprising analyzing, from a first subject having kidney cancer and currently or previously being treated with a first composition, a first biological sample to determine the level(s) of one or more biomarkers selected from Tables 1, 2, 4, 8, 10 and/or 11; analyzing, from a second subject having kidney cancer and currently or previously being treated with a second composition, a second biological sample to determine the level(s) of the one or more biomarkers; and comparing the level(s) of one or more biomarkers in the first sample to the level(s) of the one or more biomarkers in the second sample in order to assess the relative efficacy of the first and second compositions for treating kidney cancer.
In another aspect, the present invention provides a method for screening a composition for activity in modulating one or more biomarkers of kidney cancer, comprising contacting one or more cells with a composition; analyzing at least a portion of the one or more cells or a biological sample associated with the cells to determine the level(s) of one or more biomarkers of kidney cancer selected from Tables 1, 2, 4, 8, 10 and/or 11; and comparing the level(s) of the one or more biomarkers with predetermined standard levels for the biomarkers to determine whether the composition modulated the level(s) of the one or more biomarkers.
In yet another aspect, the invention provides a method for treating a subject having kidney cancer comprising administering to the subject an effective amount of one or more biomarkers selected from Tables 1, 2, 4, 8, 10 and/or 11 that are decreased in kidney cancer.
The present invention relates to biomarkers of kidney cancer, methods for diagnosis or aiding in diagnosis of kidney cancer, methods of determining or aiding in determining the cancer status of a small renal mass (SRM) kidney cancer, methods of staging kidney cancer, methods of determining kidney cancer aggressiveness, methods of monitoring progression/regression of kidney cancer, methods of assessing efficacy of compositions for treating kidney cancer, methods of screening compositions for activity in modulating biomarkers of kidney cancer, methods of treating kidney cancer, as well as other methods based on biomarkers of kidney cancer. Prior to describing this invention in further detail, however, the following terms will first be defined.
“Biomarker” means a compound, preferably a metabolite, that is differentially present (i.e., increased or decreased) in a biological sample from a subject or a group of subjects having a first phenotype (e.g., having a disease) as compared to a biological sample from a subject or group of subjects having a second phenotype (e.g., not having the disease). A biomarker may be differentially present at any level, but is generally present at a level that is increased by at least 5%, by at least 10%, by at least 15%, by at least 20%, by at least 25%, by at least 30%, by at least 35%, by at least 40%, by at least 45%, by at least 50%, by at least 55%, by at least 60%, by at least 65%, by at least 70%, by at least 75%, by at least 80%, by at least 85%, by at least 90%, by at least 95%, by at least 100%, by at least 110%, by at least 120%, by at least 130%, by at least 140%, by at least 150%, or more; or is generally present at a level that is decreased by at least 5%, by at least 10%, by at least 15%, by at least 20%, by at least 25%, by at least 30%, by at least 35%, by at least 40%, by at least 45%, by at least 50%, by at least 55%, by at least 60%, by at least 65%, by at least 70%, by at least 75%, by at least 80%, by at least 85%, by at least 90%, by at least 95%, or by 100% (i.e., absent). A biomarker is preferably differentially present at a level that is statistically significant (i.e., a p-value less than 0.05 and/or a q-value of less than 0.10 as determined using either Welch's T-test or Wilcoxon's rank-sum Test).
The “level” of one or more biomarkers means the absolute or relative amount or concentration of the biomarker in the sample.
“Sample” or “biological sample” means biological material isolated from a subject. The biological sample may contain any biological material suitable for detecting the desired biomarkers, and may comprise cellular and/or non-cellular material from the subject. The sample can be isolated from any suitable biological tissue or fluid such as, for example, kidney tissue, blood, blood plasma, urine, or cerebral spinal fluid (CSF).
“Subject” means any animal, but is preferably a mammal, such as, for example, a human, monkey, mouse, rabbit or rat.
A “reference level” of a biomarker means a level of the biomarker that is indicative of a particular disease state, phenotype, or lack thereof, as well as combinations of disease states, phenotypes, or lack thereof. A “positive” reference level of a biomarker means a level that is indicative of a particular disease state or phenotype. A “negative” reference level of a biomarker means a level that is indicative of a lack of a particular disease state or phenotype. For example, a “kidney cancer-positive reference level” of a biomarker means a level of a biomarker that is indicative of a positive diagnosis of kidney cancer in a subject, and a “kidney cancer-negative reference level” of a biomarker means a level of a biomarker that is indicative of a negative diagnosis of kidney cancer in a subject. A “reference level” of a biomarker may be an absolute or relative amount or concentration of the biomarker, a presence or absence of the biomarker, a range of amount or concentration of the biomarker, a minimum and/or maximum amount or concentration of the biomarker, a mean amount or concentration of the biomarker, and/or a median amount or concentration of the biomarker; and, in addition, “reference levels” of combinations of biomarkers may also be ratios of absolute or relative amounts or concentrations of two or more biomarkers with respect to each other. Appropriate positive and negative reference levels of biomarkers for a particular disease state, phenotype, or lack thereof may be determined by measuring levels of desired biomarkers in one or more appropriate subjects, and such reference levels may be tailored to specific populations of subjects (e.g., a reference level may be age-matched so that comparisons may be made between biomarker levels in samples from subjects of a certain age and reference levels for a particular disease state, phenotype, or lack thereof in a certain age group). Such reference levels may also be tailored to specific techniques that are used to measure levels of biomarkers in biological samples (e.g., LC-MS, GC-MS, etc.), where the levels of biomarkers may differ based on the specific technique that is used.
“Non-biomarker compound” means a compound that is not differentially present in a biological sample from a subject or a group of subjects having a first phenotype (e.g., having a first disease) as compared to a biological sample from a subject or group of subjects having a second phenotype (e.g., not having the first disease). Such non-biomarker compounds may, however, be biomarkers in a biological sample from a subject or a group of subjects having a third phenotype (e.g., having a second disease) as compared to the first phenotype (e.g., having the first disease) or the second phenotype (e.g., not having the first disease).
“Metabolite”, or “small molecule”, means organic and inorganic molecules which are present in a cell. The term does not include large macromolecules, such as large proteins (e.g., proteins with molecular weights over 2,000, 3,000, 4,000, 5,000, 6,000, 7,000, 8,000, 9,000, or 10,000), large nucleic acids (e.g., nucleic acids with molecular weights of over 2,000, 3,000, 4,000, 5,000, 6,000, 7,000, 8,000, 9,000, or 10,000), or large polysaccharides (e.g., polysaccharides with a molecular weights of over 2,000, 3,000, 4,000, 5,000, 6,000, 7,000, 8,000, 9,000, or 10,000). The small molecules of the cell are generally found free in solution in the cytoplasm or in other organelles, such as the mitochondria, where they form a pool of intermediates which can be metabolized further or used to generate large molecules, called macromolecules. The term “small molecules” includes signaling molecules and intermediates in the chemical reactions that transform energy derived from food into usable forms. Examples of small molecules include sugars, fatty acids, amino acids, nucleotides, intermediates formed during cellular processes, and other small molecules found within the cell.
“Metabolic profile”, or “small molecule profile”, means a complete or partial inventory of small molecules within a targeted cell, tissue, organ, organism, or fraction thereof (e.g., cellular compartment). The inventory may include the quantity and/or type of small molecules present. The “small molecule profile” may be determined using a single technique or multiple different techniques.
“Metabolome” means all of the small molecules present in a given organism.
“Kidney cancer” refers to a disease in which cancer develops in the kidney.
“Urological Cancer” refers to a disease in which cancer develops in the bladder, kidney and/or prostate.
“Staging” of kidney cancer refers to an indication of the severity of kidney cancer including tumor size and whether and/or how far the kidney tumor has spread. The tumor stage is a criteria used to select treatment options and to estimate a patient's prognosis. Kidney tumor stages range from T1 (tumor 7 cm or less in size and limited to kidney, least advanced) to T4 (tumor invades beyond Gerota's fascia, most advanced). “Low stage” or “lower stage” kidney cancer refers to kidney cancer tumors, including malignant tumors with a lower potential for recurrence, progression, invasion and/or metastasis (less advanced). Kidney tumors of stage T1 or T2 are considered “low stage”. “High stage” or “higher stage” kidney cancer refers to a kidney cancer tumor in a subject that is more likely to recur and/or progress and/or invade beyond the kidney, including malignant tumors with higher potential for metastasis (more advanced). Kidney tumors of stage T3 or T4 are considered “high stage”.
“Grade” of kidney cancer refers to the appearance and/or structure of kidney cancer cellular nuclei. “Low grade” kidney cancer refers to a cancer with cellular nuclear characteristics more closely resembling normal cellular nuclei. “High grade” kidney cancer refers to a cancer with cellular nuclear characteristics less closely resembling normal cellular nuclei.
“Aggressiveness” of kidney cancer or a cancer-positive small renal mass refers to a combination of the stage, grade, and metastatic potential of a kidney tumor. “More aggressive” kidney cancer refers to tumors of higher stage, grade, and/or metastatic potential. Cancer tumors that are not confined to the kidney are considered to be more aggressive kidney cancer. “Less aggressive” kidney cancer refers to tumors of lower stage, grade, and/or metastatic potential. Cancer tumors that are confined to the kidney are considered to be less aggressive kidney cancer.
“Small renal mass (SRM)” refers to a kidney lesion that may be detected incidentally during an examination but is usually not yet associated with symptoms of kidney cancer. The SRM may be benign (cancer-negative) or may be a cancer tumor (cancer-positive). A cancer-positive SRM may be an indolent tumor (low stage/less aggressive) or may be a high stage, aggressive tumor.
“RCC Score” is a measure or indicator of kidney cancer severity, which is based on the kidney cancer biomarkers and algorithms described herein. An RCC Score will enable a physician to place a patient on a spectrum of kidney cancer severity from normal (i.e., no kidney cancer) to high (e.g., high stage or more aggressive kidney cancer). One of ordinary skill in the art will understand that the RCC Score can have multiple uses in the diagnosis and treatment of kidney cancer. For example, an RCC Score may also be used to distinguish less aggressive kidney cancer from more aggressive kidney cancer, to distinguish low grade kidney cancer from high grade kidney cancer, and to monitor the progression and/or regression of kidney cancer.
The kidney cancer biomarkers described herein were discovered using metabolomic profiling techniques. Such metabolomic profiling techniques are described in more detail in the Examples set forth below as well as in U.S. Pat. Nos. 7,005,255, 7,329,489; 7,550,258; 7,550,260; 7,553,616; 7,635,556; 7,682,783; 7,682,784; 7,910,301; 6,947,453; 7,433,787; 7,561,975; 7,884,318, the entire contents of which are hereby incorporated herein by reference.
Generally, metabolic profiles were determined for biological samples from human subjects that were positive for kidney cancer (RCC) or samples from human subjects that were cancer negative (non-cancer). The metabolic profile for biological samples positive for kidney cancer was compared to the metabolic profile for biological samples negative for kidney cancer. Those small molecules differentially present, including those small molecules differentially present at a level that is statistically significant, in the metabolic profile of samples positive for kidney cancer as compared to another group (e.g., non-cancer samples) were identified as biomarkers to distinguish those groups.
The biomarkers are discussed in more detail herein. The biomarkers that were discovered correspond with biomarkers for distinguishing samples positive for kidney cancer (RCC) vs. cancer-negative samples (see Tables 1, 2, 4 and/or 11).
Metabolic profiles were also determined for biological samples from human subjects diagnosed with high stage kidney cancer or human subjects diagnosed with low stage kidney cancer. The metabolic profile for biological samples from a subject having high stage kidney cancer was compared to the metabolic profile for biological samples from subjects with low stage kidney cancer. Those small molecules differentially present, including those small molecules differentially present at a level that is statistically significant, in the metabolic profile of samples from subjects with high stage kidney cancer as compared to another group (e.g., subjects not diagnosed with high stage kidney cancer) were identified as biomarkers to distinguish those groups.
The biomarkers are discussed in more detail herein. The biomarkers that were discovered correspond with biomarkers for distinguishing subjects having high stage kidney cancer vs. subjects having low stage kidney cancer (see Table 8).
Metabolic profiles were also determined for biological samples from human subjects diagnosed with more aggressive kidney cancer or human subjects diagnosed with less aggressive kidney cancer. The metabolic profile for biological samples from subjects having more aggressive kidney cancer were compared to the metabolic profile for biological samples from subjects having less aggressive kidney cancer. Those small molecules differentially present, including those small molecules differentially present at a level that is statistically significant, in the metabolic profile of samples from subjects with more aggressive kidney cancer as compared to another group (e.g., subjects not diagnosed with more aggressive kidney cancer) were identified as biomarkers to distinguish those groups.
The biomarkers are discussed in more detail herein. The biomarkers that were discovered correspond with biomarkers for distinguishing subjects having more aggressive kidney cancer vs. subjects having less aggressive kidney cancer (see Table 10).
A. Diagnosis of kidney cancer
The identification of biomarkers for kidney cancer allows for the diagnosis of (or for aiding in the diagnosis of) kidney cancer in subjects presenting with one or more symptoms consistent with the presence of kidney cancer and includes the initial diagnosis of kidney cancer in a subject not previously identified as having kidney cancer and diagnosis of recurrence of kidney cancer in a subject previously treated for kidney cancer. For example, an SRM may be detected in a subject during a medical examination making it necessary to determine if the SRM is cancer-positive or cancer-negative. A method of diagnosing (or aiding in diagnosing) whether a subject has kidney cancer comprises (1) analyzing a biological sample from a subject to determine the level(s) of one or more biomarkers of kidney cancer in the sample and (2) comparing the level(s) of the one or more biomarkers in the sample to kidney cancer-positive and/or kidney cancer-negative reference levels of the one or more biomarkers in order to diagnose (or aid in the diagnosis of) whether the subject has kidney cancer. The one or more biomarkers that are used are selected from Tables 1, 2, 4, and/or 11 and combinations thereof. When such a method is used to aid in the diagnosis of kidney cancer, the results of the method may be used along with other methods (or the results thereof) useful in the clinical determination of whether a subject has kidney cancer.
Any suitable method may be used to analyze the biological sample in order to determine the level(s) of the one or more biomarkers in the sample. Suitable methods include chromatography (e.g., HPLC, gas chromatography, liquid chromatography), mass spectrometry (e.g., MS, MS-MS), enzyme-linked immunosorbent assay (ELISA), antibody linkage, other immunochemical techniques, and combinations thereof. Further, the level(s) of the one or more biomarkers may be measured indirectly, for example, by using an assay that measures the level of a compound (or compounds) that correlates with the level of the biomarker(s) that are desired to be measured.
The levels of one or more of the biomarkers of Tables 1, 2, 4, and/or 11 may be determined in the methods of diagnosing and methods of aiding in diagnosing whether a subject has kidney cancer. For example, one or more of the following biomarkers may be used alone or in combination to diagnose or aid in diagnosing kidney cancer: oxidized glutathione (GSSG), proline, 2-oleoylglycerophosphoethanolamine, 2-aminobutyrate, sphingosine, 3-dehydrocarnitine, 2-docosahexaenoylglycerophosphocholine, 2-linoleoylglycerophosphocholine, phosphoethanolamine, glutamate, pyrophosphate (PPi), nicotinamide-adenine-dinucleotide (NAD+), 3-aminoisobutyrate, 2-arachidonoylglycerophosphoethanolamine, 2-arachidonoylglycerophosphocholine, 2-oleoylglycerophosphocholine, glycerate, choline-phosphate, pyruvate, 1-arachidonoylglycerophosphoethanolamine, adenine, 1-2-propanediol, 2-docosahexaenoylglycerophosphoethanolamine, 2-hydroxybutyrate (AHB), creatine, glycolate (hydroxyacetate), malate, 5-methylthioadenosine (MTA), stearolycarnitine, 1-arachidonoylglycerophosphoinositol, arachidonate, mannose-6-phosphate, alpha-tocopherol, flavin adenine dinucleotide (FAD), fructose-6-phosphate, maltose, maltotriose, fructose 1-phosphate, maltotetraose, 1-stearoylglycerophosphoinositol, methyl-alpha-glucopyranoside, glucose-6-phosphate (G6P), eicosenoate, 1-stearoylglycerophosphoethanolamine, 1-palmitoylglycerophosphoinositol, 1-oleoylglycerophosphoethanolamine, 1-palmitoylglycerophosphoethanolamine, 2-palmitoylglycerophosphoethanolamine, 1-oleoylglycerophosphoinositol, gamma-glutamylglutamate, ergothioneine, arabitol, 1-palmitoylplasmenylethanolamine, N-acetylneuraminate, malonylcarnitine, 2-hydroxyglutarate, beta-alanine, pantothenate, citrate, kynurenine, N1-methyladenosine, hippurate, glucose, N-acetylaspartate (NAA), N1-methylguanosine, pseudouridine, phenylacetylglutamine, N2-methylguanosine, 2-methylbutyrylcarnitine (C5), N-acetyl-aspartyl-glutamate (NAAG), N6-acetyllysine, dimethylarginine (SDMA+ADMA), methyl-4-hydroxybenzoate, catechol-sulfate, glycerol, 2-hydroxyhippurate (salicylurate), N(2)-furoyl-glycine, 3-hydroxyphenylacetate, gulono 1,4-lactone, 2-isopropylmalate, 2-3-dihydroxyisovalerate, 1-2-propanediol, gluconate, cinnamoylglycine, phenylacetylglycine, sorbose, sucrose, adenosine 5′-monophosphate (AMP), hexanoylglycine, methyl-indole-3-acetate, 3-hydroxyhippurate, N6-methyladenosine, 4-hydroxy-2-oxoglutaric acid, alpha-CEHC-glucuronide, phenylpropinylglycine, vanillate, ethanolamine, galactose, adipate, 2-oxindole-3-acetate, 1, 3-7-trimethylurate, and 3-4-dihydroxyphenylacetate. Additionally, for example, the level(s) of one biomarker, two or more biomarkers, three or more biomarkers, four or more biomarkers, five or more biomarkers, six or more biomarkers, seven or more biomarkers, eight or more biomarkers, nine or more biomarkers, ten or more biomarkers, etc., including a combination of all of the biomarkers in Tables 1, 2, 4, and/or 11 and combinations thereof or any fraction thereof, may be determined and used in such methods. Determining levels of combinations of the biomarkers may allow greater sensitivity and specificity in diagnosing kidney cancer and aiding in the diagnosis of kidney cancer. For example, ratios of the levels of certain biomarkers (and non-biomarker compounds) in biological samples may allow greater sensitivity and specificity in diagnosing kidney cancer and aiding in the diagnosis of kidney cancer.
After the level(s) of the one or more biomarkers in the sample are determined, the level(s) are compared to kidney cancer-positive and/or kidney cancer-negative reference levels to aid in diagnosing or to diagnose whether the subject has kidney cancer. Levels of the one or more biomarkers in a sample matching the kidney cancer-positive reference levels (e.g., levels that are the same as the reference levels, substantially the same as the reference levels, above and/or below the minimum and/or maximum of the reference levels, and/or within the range of the reference levels) are indicative of a diagnosis of kidney cancer in the subject. Levels of the one or more biomarkers in a sample matching the kidney cancer-negative reference levels (e.g., levels that are the same as the reference levels, substantially the same as the reference levels, above and/or below the minimum and/or maximum of the reference levels, and/or within the range of the reference levels) are indicative of a diagnosis of no kidney cancer in the subject. In addition, levels of the one or more biomarkers that are differentially present (especially at a level that is statistically significant) in the sample as compared to kidney cancer-negative reference levels are indicative of a diagnosis of kidney cancer in the subject. Levels of the one or more biomarkers that are differentially present (especially at a level that is statistically significant) in the sample as compared to kidney cancer-positive reference levels are indicative of a diagnosis of no kidney cancer in the subject.
The level(s) of the one or more biomarkers may be compared to kidney cancer-positive and/or kidney cancer-negative reference levels using various techniques, including a simple comparison (e.g., a manual comparison) of the level(s) of the one or more biomarkers in the biological sample to kidney cancer-positive and/or kidney cancer-negative reference levels. The level(s) of the one or more biomarkers in the biological sample may also be compared to kidney cancer-positive and/or kidney cancer-negative reference levels using one or more statistical analyses (e.g., t-test, Welch's T-test, Wilcoxon's rank sum test, Random Forest, T-score, Z-score) or using a mathematical model (e.g., algorithm, statistical model).
For example, a mathematical model comprising a single algorithm or multiple algorithms may be used to determine whether a subject has kidney cancer. A mathematical model may also be used to distinguish between kidney cancer stages. An exemplary mathematical model may use the measured levels of any number of biomarkers (for example, 2, 3, 5, 7, 9, etc.) from a subject to determine, using an algorithm or a series of algorithms based on mathematical relationships between the levels of the measured biomarkers, whether a subject has kidney cancer, whether kidney cancer is progressing or regressing in a subject, whether a subject has high stage or low stage kidney cancer, whether a subject has more aggressive or less aggressive kidney cancer, etc.
The results of the method may be used along with other methods (or the results thereof) useful in the diagnosis of kidney cancer in a subject.
In one aspect, the biomarkers provided herein can be used to provide a physician with an RCC Score indicating the existence and/or severity of kidney cancer in a subject. The score is based upon clinically significantly changed reference level(s) for a biomarker and/or combination of biomarkers. The reference level can be derived from an algorithm. The RCC Score can be used to place the subject in a severity range of kidney cancer from normal (i.e. no kidney cancer) to high. The RCC Score can be used in multiple ways: for example, disease progression, regression, or remission can be monitored by periodic determination and monitoring of the RCC Score; response to therapeutic intervention can be determined by monitoring the RCC Score; and drug efficacy can be evaluated using the RCC Score.
Methods for determining a subject's RCC Score may be performed using one or more of the kidney cancer biomarkers identified in Tables 1, 2, 4 and/or 11 in a biological sample. The method may comprise comparing the level(s) of the one or more kidney cancer biomarkers in the sample to kidney cancer reference levels of the one or more biomarkers in order to determine the subject's RCC score. The method may employ any number of markers selected from those listed in Table 1, 2, 4 and/or 11, including 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more markers. Multiple biomarkers may be correlated with kidney cancer, by any method, including statistical methods such as regression analysis.
After the level(s) of the one or more biomarker(s) is determined, the level(s) may be compared to kidney cancer reference level(s) or reference curves of the one or more biomarker(s) to determine a rating for each of the one or more biomarker(s) in the sample. The rating(s) may be aggregated using any algorithm to create a score, for example, an RCC score, for the subject. The algorithm may take into account any factors relating to kidney cancer including the number of biomarkers, the correlation of the biomarkers to kidney cancer, etc.
In an embodiment, a mathematical model or formula containing one or more biomarkers as variables is established using regression analysis, e.g., multiple linear regressions. By way of non-limiting example, the developed formulas may include the following:
A+B(Biomarker1)+C(Biomarker2)+D(Biomarker3)+E(Biomarker4)=RScore
A+B*ln(Biomarker1)+C*ln(Biomarker2)+D*ln(Biomarker3)+E*ln(Biomarker4)=ln RScore
wherein A, B, C, D, E are constant numbers; Biomarker1, Biomarker2, Biomarker3, Biomarker4 are the measured values of the analyte (Biomarker) and RScore is the measure of cancer presence or absence or cancer aggressivity.
The formulas may include one or more biomarkers as variables, such as 1, 2, 3, 4, 5, 10, 15, 20 or more biomarkers.
Additionally, in one embodiment, the biomarkers provided herein to diagnose or aid in the diagnosis of kidney cancer may be used to distinguish kidney cancer from other urological cancers. A method of distinguishing kidney cancer from other urological cancers in a subject comprises (1) analyzing a biological sample from a subject to determine the level(s) of one or more biomarkers of kidney cancer in the sample and (2) comparing the level(s) of the one or more biomarkers in the sample to kidney cancer-positive and/or kidney cancer-negative reference levels of the one or more biomarkers in order to distinguish kidney cancer from other urological cancers. The one or more biomarkers that are used are selected from Table 11. For example, one or more of the following biomarkers may be used alone or in any combination to distinguish kidney cancer from other urological cancers: gluconate, 1,2-propanediol, galactose, gulono 1,4-lactone, orotidine, quinate, 1, 3-7-trimethylurate, guanine, phenylacetylglutamine, mannitol, 2-oxindole-3-acetate, 1,3-aminopropyl-2-pyrrolidone, 1,3-dimethylurate, glucuronate-galacturonate-5-keto-gluconate, glycocholate, azelate (nonanedioate), N-acetylthreonine, 7-ketodeoxycholate, 3-sialyllactose, isovalerylcarnitine, cholate, adenosine 5′ monophosphate (AMP), 2-hydroxyisobutyrate, 4-hydroxyhippurate, pipecolate, N-acetylphenylalanine, 12-dehydrocholate, alpha-ketoglutarate, sulforaphane, 3-indoxyl-sulfate, methyl-indole-3-acetate, methyl-4-hydroxybenzoate, lactate, N(2)-furoyl-glycine, N6-methyladenosine, gamma-CEHC, glycerol, 2-3-butanediol, palmitoyl-sphingomyelin, succinate, 4-hydroxyphenylacetate, caffeate, imidazole-pripionate, beta-alanine, 4-androsten-3beta-17beta-diol-disulfate-2,5-methylthioadenosine (MTA), N2-acetyllysine, sucrose, phenylacetylglycine, 4-androsten-3beta-17beta-diol-disulfate-1, cyclo-gly-pro, N-methyl-proline, catechol-sulfate, serine, vanillate, threonine, and 21-hydroxypregnenolone-disulfate. When such a method is used to distinguish kidney cancer from other urological cancers, the results of the method may be used along with other methods (or the results thereof) useful in the clinical determination of distinguishing kidney cancer from other urological cancers.
B. Methods of Monitoring Progression/Regression of Kidney Cancer
The identification of biomarkers for kidney cancer also allows for monitoring progression/regression of kidney cancer in a subject. A method of monitoring the progression/regression of kidney cancer in a subject comprises (1) analyzing a first biological sample from a subject to determine the level(s) of one or more biomarkers for kidney cancer selected from Tables 1, 2, 4, 8, 10 and/or 11, the first sample obtained from the subject at a first time point, (2) analyzing a second biological sample from a subject to determine the level(s) of the one or more biomarkers, the second sample obtained from the subject at a second time point, and (3) comparing the level(s) of one or more biomarkers in the first sample to the level(s) of the one or more biomarkers in the second sample in order to monitor the progression/regression of kidney cancer in the subject. The results of the method are indicative of the course of kidney cancer (i.e., progression or regression, if any change) in the subject.
The levels of one or more of the biomarkers of Tables 1, 2, 4, 8, 10 and/or 11 may be determined in the methods of monitoring progression/regression of kidney cancer. For example, one or more of the following biomarkers may be used alone or in combination to monitor the progression/regression of kidney cancer: oxidized glutathione (GSSG), proline, 2-oleoylglycerophosphoethanolamine, 2-aminobutyrate, sphingosine, 3-dehydrocamitine, 2-docosahexaenoylglycerophosphocholine, 2-linoleoylglycerophosphocholine, phosphoethanolamine, glutamate, pyrophosphate (PPi), nicotinamide-adenine-dinucleotide (NAD+), 3-aminoisobutyrate, 2-arachidonoylglycerophosphoethanolamine, 2-arachidonoylglycerophosphocholine, 2-oleoylglycerophosphocholine, glycerate, choline-phosphate, pyruvate, 1-arachidonoylglycerophosphoethanolamine, adenine, 1-2-propanediol, 2-docosahexaenoylglycerophosphoethanolamine, 2-hydroxybutyrate (AHB), creatine, glycolate (hydroxyacetate), malate, 5-methylthioadenosine (MTA), stearolycamitine, 1-arachidonoylglycerophosphoinositol, arachidonate, mannose-6-phosphate, alpha-tocopherol, flavin adenine dinucleotide (FAD), fructose-6-phosphate, maltose, maltotriose, fructose 1-phosphate, maltotetraose, 1-stearoylglycerophosphoinositol, methyl-alpha-glucopyranoside, glucose-6-phosphate (G6P), eicosenoate, 1-stearoylglycerophosphoethanolamine, 1-palmitoylglycerophosphoinositol, 1-oleoylglycerophosphoethanolamine, 1-palmitoylglycerophosphoethanolamine, 2-palmitoylglycerophosphoethanolamine, 1-oleoylglycerophosphoinositol, gamma-glutamylglutamate, ergothioneine, arabitol, 1-palmitoylplasmenylethanolamine, N-acetylneuraminate, malonylcarnitine, 2-hydroxyglutarate, beta-alanine, pantothenate, citrate, kynurenine, N1-methyladenosine, hippurate, glucose, N-acetylaspartate (NAA), N1-methylguanosine, pseudouridine, phenylacetylglutamine, N2-methylguanosine, 2-methylbutyrylcarnitine (C5), N-acetyl-aspartyl-glutamate (NAAG), N6-acetyllysine, dimethylarginine (SDMA+ADMA), methyl-4-hydroxybenzoate, catechol-sulfate, glycerol, 2-hydroxyhippurate (salicylurate), N(2)-furoyl-glycine, 3-hydroxyphenylacetate, gulono 1,4-lactone, 2-isopropylmalate, 2-3-dihydroxyisovalerate, 1-2-propanediol, gluconate, cinnamoylglycine, phenylacetylglycine, sorbose, sucrose, adenosine 5′-monophosphate (AMP), hexanoylglycine, methyl-indole-3-acetate, 3-hydroxyhippurate, N6-methyladenosine, 4-hydroxy-2-oxoglutaric acid, alpha-CEHC-glucuronide, phenylpropinylglycine, vanillate, ethanolamine, galactose, adipate, 2-oxindole-3-acetate, 1, 3-7-trimethylurate, 3-4-dihydroxyphenylacetate, choline, pelargonate (9:0), arginine, gamma-glutamylleucine, xanthine, tyrosine, 5-oxoproline, inositiol-1-phosphate (I1P), isoleucine, 2-ethylhexanoate, leucine, laurate (12:0), phenylalanine, mannose, uracil, xanthosine, erythritol, guanosine-5-monophosphate-5 (GMP), homocysteine, lactate, 4-hydroxybutyrate (GHB), ribose, fucose, S-adenosylhomocysteine (SAH), mannitol, hypoxanthine, and threonine. Additionally, for example, the level(s) of one biomarker, two or more biomarkers, three or more biomarkers, four or more biomarkers, five or more biomarkers, six or more biomarkers, seven or more biomarkers, eight or more biomarkers, nine or more biomarkers, ten or more biomarkers, etc., including a combination of all of the biomarkers in Tables 1, 2, 4, 8, 10 and 11 or any fraction thereof, may be determined and used in methods of monitoring the progression/regression of kidney cancer in a subject.
The change (if any) in the level(s) of the one or more biomarkers over time may be indicative of progression or regression of kidney cancer in the subject. In order to characterize the course of kidney cancer in the subject, the level(s) of the one or more biomarkers in the first sample, the level(s) of the one or more biomarkers in the second sample, and/or the results of the comparison of the levels of the biomarkers in the first and second samples may be compared to kidney cancer-positive and kidney cancer-negative reference levels. If the comparisons indicate that the level(s) of the one or more biomarkers are increasing or decreasing over time (e.g., in the second sample as compared to the first sample) to become more similar to the kidney cancer-positive reference levels (or less similar to the kidney cancer-negative reference levels), then the results are indicative of kidney cancer progression. If the comparisons indicate that the level(s) of the one or more biomarkers are increasing or decreasing over time to become more similar to the kidney cancer-negative reference levels (or less similar to the kidney cancer-positive reference levels), then the results are indicative of kidney cancer regression.
In one embodiment, the assessment may be based on an RCC Score which is indicative of kidney cancer in the subject and which can be monitored over time. By comparing the RCC Score from a first time point sample to the RCC Score from at least a second time point sample the progression or regression of kidney cancer can be determined. Such a method of monitoring the progression/regression of kidney cancer in a subject comprises (1) analyzing a first biological sample from a subject to determine an RCC score for the first sample obtained from the subject at a first time point, (2) analyzing a second biological sample from a subject to determine a second RCC score, the second sample obtained from the subject at a second time point, and (3) comparing the RCC score in the first sample to the RCC score in the second sample in order to monitor the progression/regression of kidney cancer in the subject.
The biomarkers and algorithms described herein may guide or assist a physician in deciding a treatment path, for example, whether to implement procedures such as surgical procedures (e.g., full or partial nephrectomy), treat with drug therapy, or employ a watchful waiting approach.
As with the other methods described herein, the comparisons made in the methods of monitoring progression/regression of kidney cancer in a subject may be carried out using various techniques, including simple comparisons, one or more statistical analyses, mathematical models (algorithms) and combinations thereof.
The results of the method may be used along with other methods (or the results thereof) useful in the clinical monitoring of progression/regression of kidney cancer in a subject.
As described above in connection with methods of diagnosing (or aiding in the diagnosis of) kidney cancer, any suitable method may be used to analyze the biological samples in order to determine the level(s) of the one or more biomarkers in the samples. In addition, the level(s) one or more biomarkers, including a combination of all of the biomarkers in Tables 1, 2, 4, 8, 10 and/or 11 or any fraction thereof, may be determined and used in methods of monitoring progression/regression of kidney cancer in a subject.
Such methods could be conducted to monitor the course of kidney cancer in subjects having kidney cancer or could be used in subjects not having kidney cancer (e.g., subjects suspected of being predisposed to developing kidney cancer) in order to monitor levels of predisposition to kidney cancer.
C. Methods of Staging Kidney Cancer
The identification of biomarkers for kidney cancer also allows for the determination of kidney cancer stage of a subject, including the cancer stage of a subject having a cancer-positive SRM. A method of determining the stage of kidney cancer comprises (1) analyzing a biological sample from a subject to determine the level(s) of one or more biomarkers listed in Table 8 in the sample and (2) comparing the level(s) of the one or more biomarkers in the sample to high stage kidney cancer and/or low stage kidney cancer reference levels of the one or more biomarkers in order to determine the stage of the subject's kidney cancer. The results of the method may be used along with other methods (or the results thereof) useful in the clinical determination of the stage of a subject's kidney cancer.
As described above in connection with methods of diagnosing (or aiding in the diagnosis of) kidney cancer, any suitable method may be used to analyze the biological sample in order to determine the level(s) of the one or more biomarkers in the sample.
The levels of one or more biomarkers listed in Table 8 and combinations thereof may be determined in the methods of determining the stage of a subject's kidney cancer. For example, one or more of the following biomarkers may be used alone or in combination to determine the stage of kidney cancer: choline, pelargonate (9:0), arginine, gamma-glutamylleucine, xanthine, tyrosine, 5-oxoproline, inositiol-1-phosphate (HP), N2-methylguanosine, isoleucine, 2-ethylhexanoate, leucine, adenine, 5-methylthioadenosine (MTA), laurate (12:0), phenylalanine, mannose, uracil, xanthosine, erythritol, guanosine-5-monophosphate-5 (GMP), homocysteine, lactate, 4-hydroxybutyrate (GHB), ribose, fucose, S-adenosylhomocysteine (SAH), mannitol, hypoxanthine, and threonine. Additionally, for example, the level(s) of one biomarker, two or more biomarkers, three or more biomarkers, four or more biomarkers, five or more biomarkers, six or more biomarkers, seven or more biomarkers, eight or more biomarkers, nine or more biomarkers, ten or more biomarkers, etc., including a combination of all of the biomarkers in Table 8 or any fraction thereof, may be determined and used in methods of determining the stage of kidney cancer of a subject.
After the level(s) of the one or more biomarkers in a sample are determined, the level(s) are compared to low stage kidney cancer and/or high stage kidney cancer reference levels in order to predict the stage of kidney cancer of a subject. Levels of the one or more biomarkers in a sample matching the high stage kidney cancer reference levels (e.g., levels that are the same as the reference levels, substantially the same as the reference levels, above and/or below the minimum and/or maximum of the reference levels, and/or within the range of the reference levels) are indicative of the subject having high stage kidney cancer. Levels of the one or more biomarkers in a sample matching the low stage kidney cancer reference levels (e.g., levels that are the same as the reference levels, substantially the same as the reference levels, above and/or below the minimum and/or maximum of the reference levels, and/or within the range of the reference levels) are indicative of the subject having low stage kidney cancer. In addition, levels of the one or more biomarkers that are differentially present (especially at a level that is statistically significant) in the sample as compared to low stage kidney cancer reference levels are indicative of the subject not having low stage kidney cancer. Levels of the one or more biomarkers that are differentially present (especially at a level that is statistically significant) in the sample as compared to high stage kidney cancer reference levels are indicative of the subject not having high stage kidney cancer.
Studies were carried out to identify a set of biomarkers that can be used to determine the kidney cancer stage of a subject. In another embodiment, the biomarkers provided herein can be used to provide a physician with an RCC Score indicating the stage of kidney cancer in a subject. The score is based upon clinically significantly changed reference level(s) for a biomarker and/or combination of biomarkers. The reference level can be derived from an algorithm. The RCC Score can be used to determine the stage of kidney cancer in a subject from normal (i.e. no kidney cancer) to high stage kidney cancer.
The biomarkers and algorithms described herein may guide or assist a physician in deciding a treatment path, for example, whether to implement procedures such as surgical procedures (e.g., full or partial nephrectomy), treat with drug therapy, or employ a watchful waiting approach.
As with the methods described above, the level(s) of the one or more biomarkers may be compared to high stage kidney cancer and/or low stage kidney cancer reference levels using various techniques, including a simple comparison, one or more statistical analyses, and combinations thereof.
As with the methods of diagnosing (or aiding in diagnosing) whether a subject has kidney cancer, the methods of determining the stage of kidney cancer of a subject may further comprise analyzing the biological sample to determine the level(s) of one or more non-biomarker compounds.
D. Methods of Distinguishing Less Aggressive Kidney Cancer from More aggressive Kidney Cancer
The identification of biomarkers for kidney cancer also allows for the identification of biomarkers for distinguishing less aggressive kidney cancer from more aggressive kidney cancer, including distinguishing less aggressive cancer-positive SRMs from more aggressive cancer-positive SRMs. A method of distinguishing less aggressive kidney cancer from more aggressive kidney cancer in a subject having kidney cancer comprises (1) analyzing a biological sample from a subject to determine the level(s) of one or more biomarkers listed in Table 10 in the sample and (2) comparing the level(s) of the one or more biomarkers in the sample to less aggressive kidney cancer and/or more aggressive kidney cancer reference levels of the one or more biomarkers in order to determine the aggressiveness of the subject's kidney cancer. The results of the method may be used along with other methods (or the results thereof) useful in the clinical determination of the aggressiveness of a subject's kidney cancer.
As described above in connection with methods of diagnosing (or aiding in the diagnosis of) kidney cancer, any suitable method may be used to analyze the biological sample in order to determine the level(s) of the one or more biomarkers in the sample.
The levels of one or more biomarkers listed in Tables 4 and/or 10 may be determined in the methods of determining the aggressiveness of a subject's kidney cancer. For example, one or more of the following biomarkers may be used alone or in combination to determine the aggressiveness of a subject's kidney cancer:pelargonate (9:0), laurate (12:0), homocysteine, 2′-deoxyinosine, S-adenosylmethionine (SAM), glycylthreonine, aspartylphenylalanine, phenylalanylglycine, cytidine 5′-diphosphocholine, alanylglycine, lysylmethionine, glycylisoleucine, ribose, aspartylleucine, 2-ethylhexanoate, asparagine, homoserine, 2′-deoxyguanosine, valerylcarnitine, 4-hydroxybutyrate (GHB), caprate (10:0), galactose, heme, butyrylcarnitine, choline, isoleucine, mannitol, fucose, tyrosine, xanthine, 5-oxoproline, 5-methylthioadenosine (MTA), phenylalanine, leucine, threonate, gamma-glutamylleucine, benzoate, proline, methionine, glycylproline, N2-methylguanosine, adenine, 2-methylbutyroylcarnitine, S-adenosylhomocysteine (SAH), citrate, xanthosine, 5,6-dihydrouracil, threonine, valine, and pantothenate. Additionally, for example, as with the methods of diagnosing (or aiding in the diagnosis of) kidney cancer described above, the level(s) of one biomarker, two or more biomarkers, three or more biomarkers, four or more biomarkers, five or more biomarkers, six or more biomarkers, seven or more biomarkers, eight or more biomarkers, nine or more biomarkers, ten or more biomarkers, etc., including a combination of all of the biomarkers in Tables 4 and 10 or any fraction thereof, may be determined and used in methods of determining the aggressiveness of kidney cancer of a subject.
After the level(s) of the one or more biomarkers in the sample are determined, the level(s) are compared to less aggressive kidney cancer and/or more aggressive kidney cancer reference levels in order to determine the aggressiveness of kidney cancer of a subject. Levels of the one or more biomarkers in a sample matching the more aggressive kidney cancer reference levels (e.g., levels that are the same as the reference levels, substantially the same as the reference levels, above and/or below the minimum and/or maximum of the reference levels, and/or within the range of the reference levels) are indicative of the subject having more aggressive kidney cancer. Levels of the one or more biomarkers in a sample matching the less aggressive kidney cancer reference levels (e.g., levels that are the same as the reference levels, substantially the same as the reference levels, above and/or below the minimum and/or maximum of the reference levels, and/or within the range of the reference levels) are indicative of the subject having less aggressive kidney cancer. In addition, levels of the one or more biomarkers that are differentially present (especially at a level that is statistically significant) in the sample as compared to less aggressive kidney cancer reference levels are indicative of the subject not having less aggressive kidney cancer. Levels of the one or more biomarkers that are differentially present (especially at a level that is statistically significant) in the sample as compared to more aggressive kidney cancer reference levels are indicative of the subject not having more aggressive kidney cancer.
Studies were carried out to identify a set of biomarkers that can be used to distinguish less aggressive kidney cancer from more aggressive kidney cancer. In another embodiment, the biomarkers provided herein can be used to provide a physician with an RCC Score indicating the aggressiveness of kidney cancer in a subject. The score is based upon clinically significantly changed reference level(s) for a biomarker and/or combination of biomarkers. The reference level can be derived from an algorithm. The RCC Score can be used to determine the aggressiveness of kidney cancer in a subject from normal (i.e. no kidney cancer) to more aggressive kidney cancer.
The biomarkers and algorithms described herein may guide or assist a physician in deciding a treatment path, for example, whether to implement procedures such as surgical procedures (e.g., full or partial nephrectomy), treat with drug therapy, or employ a watchful waiting approach.
As with the methods described above, the level(s) of the one or more biomarkers may be compared to more aggressive kidney cancer and/or less aggressive kidney cancer reference levels using various techniques, including a simple comparison, one or more statistical analyses, and combinations thereof.
As with the methods of diagnosing (or aiding in diagnosing) whether a subject has kidney cancer, the methods of determining the aggressiveness of kidney cancer of a subject may further comprise analyzing the biological sample to determine the level(s) of one or more non-biomarker compounds.
E. Methods of Determining Whether a Small Renal Mass (SRM) is Cancerous
The identification of biomarkers for kidney cancer also allows for the determination of whether a subject discovered as having an SRM has a benign SRM or an SRM that is cancerous. A method of determining the cancer status of an SRM comprises (1) analyzing a biological sample from a subject to determine the level(s) of one or more biomarkers listed in Tables 1, 2, 4, 8, 10, and/or 11 in the sample and (2) comparing the level(s) of the one or more biomarkers in the sample to kidney cancer-positive and/or kidney cancer-negative reference levels of the one or more biomarkers in order to determine the cancer status of the subject's SRM. The results of the method may be used along with other methods (or the results thereof) useful in the clinical determination of the cancer status of a subject's SRM.
As described above in connection with methods of diagnosing (or aiding in the diagnosis of) kidney cancer, any suitable method may be used to analyze the biological sample in order to determine the level(s) of the one or more biomarkers in the sample.
As with the methods of diagnosing (or aiding in the diagnosis of) kidney cancer described above, the level(s) of one or more of the biomarkers in Tables 1, 2, 4, 8, 10 and/or 11 may be determined in the methods of determining the cancer status of an SRM. For example, one or more of the following biomarkers may be used alone or in combination to determine the cancer status of a subject's SRM: oxidized glutathione (GSSG), proline, 2-oleoylglycerophosphoethanolamine, 2-aminobutyrate, sphingosine, 3-dehydrocarnitine, 2-docosahexaenoylglycerophosphocholine, 2-linoleoylglycerophosphocholine, phosphoethanolamine, glutamate, pyrophosphate (PPi), nicotinamide-adenine-dinucleotide (NAD+), 3-aminoisobutyrate, 2-arachidonoylglycerophosphoethanolamine, 2-arachidonoylglycerophosphocholine, 2-oleoylglycerophosphocholine, glycerate, choline-phosphate, pyruvate, 1-arachidonoylglycerophosphoethanolamine, adenine, 1-2-propanediol, 2-docosahexaenoylglycerophosphoethanolamine, 2-hydroxybutyrate (AHB), creatine, glycolate (hydroxyacetate), malate, 5-methylthioadenosine (MTA), stearolycamitine, 1-arachidonoylglycerophosphoinositol, arachidonate, mannose-6-phosphate, alpha-tocopherol, flavin adenine dinucleotide (FAD), fructose-6-phosphate, maltose, maltotriose, fructose 1-phosphate, maltotetraose, 1-stearoylglycerophosphoinositol, methyl-alpha-glucopyranoside, glucose-6-phosphate (G6P), eicosenoate, 1-stearoylglycerophosphoethanolamine, 1-palmitoylglycerophosphoinositol, 1-ol eoylglycerophosphoethanolamine, 1-palmitoylglycerophosphoethanolamine, 2-palmitoylglycerophosphoethanolamine, 1-oleoylglycerophosphoinositol, gamma-glutamylglutamate, ergothioneine, arabitol, 1-palmitoylplasmenylethanolamine, N-acetylneuraminate, malonylcarnitine, 2-hydroxyglutarate, beta-alanine, pantothenate, citrate, kynurenine, N1-methyladenosine, hippurate, glucose, N-acetylaspartate (NAA), N1-methylguanosine, pseudouridine, phenylacetylglutamine, N2-methylguanosine, 2-methylbutyrylcarnitine (C5), N-acetyl-aspartyl-glutamate (NAAG), N6-acetyllysine, dimethylarginine (SDMA+ADMA), methyl-4-hydroxybenzoate, catechol-sulfate, glycerol, 2-hydroxyhippurate (salicylurate), N(2)-furoyl-glycine, 3-hydroxyphenylacetate, gulono 1,4-lactone, 2-isopropylmalate, 2-3-dihydroxyisovalerate, 1-2-propanediol, gluconate, cinnamoylglycine, phenylacetylglycine, sorbose, sucrose, adenosine 5′-monophosphate (AMP), hexanoylglycine, methyl-indole-3-acetate, 3-hydroxyhippurate, N6-methyladenosine, 4-hydroxy-2-oxoglutaric acid, alpha-CEHC-glucuronide, phenylpropinylglycine, vanillate, ethanolamine, galactose, adipate, 2-oxindole-3-acetate, 1, 3-7-trimethylurate, and 3-4-dihydroxyphenylacetate. Additionally, for example, the level(s) of one biomarker, two or more biomarkers, three or more biomarkers, four or more biomarkers, five or more biomarkers, six or more biomarkers, seven or more biomarkers, eight or more biomarkers, nine or more biomarkers, ten or more biomarkers, etc., including a combination of all of the biomarkers in Tables 1, 2, 4, 8, 10, and/or 11 or any fraction thereof, may be determined and used in methods of determining the cancer status of a subject's SRM.
After the level(s) of the one or more biomarkers in a sample are determined, the level(s) are compared to kidney cancer-positive and/or kidney cancer-negative reference levels in order to determine the cancer status of a subject's SRM. Levels of the one or more biomarkers in a sample matching the kidney cancer-positive reference levels (e.g., levels that are the same as the reference levels, substantially the same as the reference levels, above and/or below the minimum and/or maximum of the reference levels, and/or within the range of the reference levels) are indicative of the subject having a cancer-positive SRM. Levels of the one or more biomarkers in a sample matching the kidney cancer-negative reference levels (e.g., levels that are the same as the reference levels, substantially the same as the reference levels, above and/or below the minimum and/or maximum of the reference levels, and/or within the range of the reference levels) are indicative of the subject having a cancer-negative SRM. In addition, levels of the one or more biomarkers that are differentially present (especially at a level that is statistically significant) in the sample as compared to kidney cancer-negative reference levels are indicative of a diagnosis of a cancer-positive SRM. Levels of the one or more biomarkers that are differentially present (especially at a level that is statistically significant) in the sample as compared to kidney cancer-positive reference levels are indicative of the subject not having a cancer-positive SRM.
As with the methods described above, the level(s) of the one or more biomarkers may be compared to kidney cancer-positive and/or kidney cancer-negative reference levels using various techniques, including a simple comparison, one or more statistical analyses, and combinations thereof. An RCC Score may also be used in indicating the existence and/or severity of cancer in a SRM.
As with the methods of diagnosing (or aiding in diagnosing) whether a subject has kidney cancer, the methods of assessing the cancer status of a SRM of a subject may further comprise analyzing the biological sample to determine the level(s) of one or more non-biomarker compounds.
F. Methods of Assessing Efficacy of Compositions for Treating Kidney Cancer
The identification of biomarkers for kidney cancer also allows for assessment of the efficacy of a composition for treating kidney cancer as well as the assessment of the relative efficacy of two or more compositions for treating kidney cancer. Such assessments may be used, for example, in efficacy studies as well as in lead selection of compositions for treating kidney cancer.
A method of assessing the efficacy of a composition for treating kidney cancer comprises (1) analyzing, from a subject having kidney cancer and currently or previously being treated with a composition, a biological sample to determine the level(s) of one or more biomarkers selected from Tables 1, 2, 4, 8, 10 and/or 11, and (2) comparing the level(s) of the one or more biomarkers in the sample to (a) level(s) of the one or more biomarkers in a previously-taken biological sample from the subject, wherein the previously-taken biological sample was obtained from the subject before being treated with the composition, (b) kidney cancer-positive reference levels of the one or more biomarkers, and (c) kidney cancer-negative reference levels of the one or more biomarkers. The results of the comparison are indicative of the efficacy of the composition for treating kidney cancer.
The levels of one or more of the biomarkers of Tables 1, 2, 4, 8, 10 and/or 11 may be determined in the methods of assessing the efficacy of a composition for treating kidney cancer. For example, one or more of the following biomarkers may be used alone or in combination to assess the efficacy of a composition for treating kidney cancer: oxidized glutathione (GSSG), proline, 2-oleoylglycerophosphoethanolamine, 2-aminobutyrate, sphingosine, 3-dehydrocarnitine, 2-docosahexaenoylglycerophosphocholine, 2-linoleoylglycerophosphocholine, phosphoethanolamine, glutamate, pyrophosphate (PPi), nicotinamide-adenine-dinucleotide (NAD+), 3-aminoisobutyrate, 2-arachidonoylglycerophosphoethanolamine, 2-arachidonoylglycerophosphocholine, 2-oleoylglycerophosphocholine, glycerate, choline-phosphate, pyruvate, 1-arachidonoylglycerophosphoethanolamine, adenine, 1-2-propanediol, 2-docosahexaenoylglycerophosphoethanolamine, 2-hydroxybutyrate (AHB), creatine, glycolate (hydroxyacetate), malate, 5-methylthioadenosine (MTA), stearolycarnitine, 1-arachidonoylglycerophosphoinositol, arachidonate, mannose-6-phosphate, alpha-tocopherol, flavin adenine dinucleotide (FAD), fructose-6-phosphate, maltose, maltotriose, fructose 1-phosphate, maltotetraose, 1-stearoylglycerophosphoinositol, methyl-alpha-glucopyranoside, glucose-6-phosphate (G6P), eicosenoate, 1-stearoylglycerophosphoethanolamine, 1-palmitoylglycerophosphoinositol, 1-oleoylglycerophosphoethanolamine, 1-palmitoylglycerophosphoethanolamine, 2-palmitoylglycerophosphoethanolamine, 1-oleoylglycerophosphoinositol, gamma-glutamylglutamate, ergothioneine, arabitol, 1-palmitoylplasmenylethanolamine, N-acetylneuraminate, malonylcarnitine, 2-hydroxyglutarate, beta-alanine, pantothenate, citrate, kynurenine, N1-methyladenosine, hippurate, glucose, N-acetylaspartate (NAA), N1-methylguanosine, pseudouridine, phenylacetylglutamine, N2-methylguanosine, 2-methylbutyrylcarnitine (C5), N-acetyl-aspartyl-glutamate (NAAG), N6-acetyllysine, dimethylarginine (SDMA+ADMA), methyl-4-hydroxybenzoate, catechol-sulfate, glycerol, 2-hydroxyhippurate (salicylurate), N(2)-furoyl-glycine, 3-hydroxyphenylacetate, gulono 1,4-lactone, 2-isopropylmalate, 2-3-dihydroxyisovalerate, 1-2-propanediol, gluconate, cinnamoylglycine, phenylacetylglycine, sorbose, sucrose, adenosine 5′-monophosphate (AMP), hexanoylglycine, methyl-indole-3-acetate, 3-hydroxyhippurate, N6-methyladenosine, 4-hydroxy-2-oxoglutaric acid, alpha-CEHC-glucuronide, phenylpropinylglycine, vanillate, ethanolamine, galactose, adipate, 2-oxindole-3-acetate, 1, 3-7-trimethylurate, 3-4-dihydroxyphenylacetate, choline, pelargonate (9:0), arginine, gamma-glutamylleucine, xanthine, tyrosine, 5-oxoproline, inositiol-1-phosphate (11P), isoleucine, 2-ethylhexanoate, leucine, laurate (12:0), phenylalanine, mannose, uracil, xanthosine, erythritol, guanosine-5-monophosphate-5 (GMP), homocysteine, lactate, 4-hydroxybutyrate (GHB), ribose, fucose, S-adenosylhomocysteine (SAH), mannitol, hypoxanthine, and threonine. Additionally, for example, the level(s) of one biomarker, two or more biomarkers, three or more biomarkers, four or more biomarkers, five or more biomarkers, six or more biomarkers, seven or more biomarkers, eight or more biomarkers, nine or more biomarkers, ten or more biomarkers, etc., including a combination of all of the biomarkers in Tables 1, 2, 4, 8, 10 and 11 or any fraction thereof, may be determined and used in methods of assessing the efficacy of a composition for treating kidney cancer.
Thus, in order to characterize the efficacy of the composition for treating kidney cancer, the level(s) of the one or more biomarkers in the biological sample are compared to (1) kidney cancer-positive reference levels, (2) kidney cancer-negative reference levels, and (3) previous levels of the one or more biomarkers in the subject before treatment with the composition.
When comparing the level(s) of the one or more biomarkers in the biological sample (from a subject having kidney cancer and currently or previously being treated with a composition) to kidney cancer-positive reference levels and/or kidney cancer-negative reference levels, level(s) in the sample matching the kidney cancer-negative reference levels (e.g., levels that are the same as the reference levels, substantially the same as the reference levels, above and/or below the minimum and/or maximum of the reference levels, and/or within the range of the reference levels) are indicative of the composition having efficacy for treating kidney cancer. Levels of the one or more biomarkers in the sample matching the kidney cancer-positive reference levels (e.g., levels that are the same as the reference levels, substantially the same as the reference levels, above and/or below the minimum and/or maximum of the reference levels, and/or within the range of the reference levels) are indicative of the composition not having efficacy for treating kidney cancer. The comparisons may also indicate degrees of efficacy for treating kidney cancer based on the level(s) of the one or more biomarkers.
When the level(s) of the one or more biomarkers in the biological sample (from a subject having kidney cancer and currently or previously being treated with a composition) are compared to level(s) of the one or more biomarkers in a previously-taken biological sample from the subject before treatment with the composition, any changes in the level(s) of the one or more biomarkers are indicative of the efficacy of the composition for treating kidney cancer. That is, if the comparisons indicate that the level(s) of the one or more biomarkers have increased or decreased after treatment with the composition to become more similar to the kidney cancer-negative reference levels (or less similar to the kidney cancer-positive reference levels), then the results are indicative of the composition having efficacy for treating kidney cancer. If the comparisons indicate that the level(s) of the one or more biomarkers have not increased or decreased after treatment with the composition to become more similar to the kidney cancer-negative reference levels (or less similar to the kidney cancer-positive reference levels), then the results are indicative of the composition not having efficacy for treating kidney cancer. The comparisons may also indicate degrees of efficacy for treating kidney cancer based on the amount of changes observed in the level(s) of the one or more biomarkers after treatment. In order to help characterize such a comparison, the changes in the level(s) of the one or more biomarkers, the level(s) of the one or more biomarkers before treatment, and/or the level(s) of the one or more biomarkers in the subject currently or previously being treated with the composition may be compared to kidney cancer-positive reference levels, and/or to kidney cancer-negative reference levels.
Another method for assessing the efficacy of a composition in treating kidney cancer comprises (1) analyzing a first biological sample from a subject to determine the level(s) of one or more biomarkers selected from Tables 1, 2, 4, 8, 10 and/or 11, the first sample obtained from the subject at a first time point, (2) administering the composition to the subject, (3) analyzing a second biological sample from a subject to determine the level(s) of the one or more biomarkers, the second sample obtained from the subject at a second time point after administration of the composition, and (4) comparing the level(s) of one or more biomarkers in the first sample to the level(s) of the one or more biomarkers in the second sample in order to assess the efficacy of the composition for treating kidney cancer. As indicated above, if the comparison of the samples indicates that the level(s) of the one or more biomarkers have increased or decreased after administration of the composition to become more similar to the kidney cancer-negative reference levels, then the results are indicative of the composition having efficacy for treating kidney cancer. If the comparisons indicate that the level(s) of the one or more biomarkers have not increased or decreased after treatment with the composition to become more similar to the kidney cancer-negative reference levels (or less similar to the kidney cancer-positive reference levels) then the results are indicative of the composition not having efficacy for treating kidney cancer. The comparison may also indicate a degree of efficacy for treating kidney cancer based on the amount of changes observed in the level(s) of the one or more biomarkers after administration of the composition as discussed above.
A method of assessing the relative efficacy of two or more compositions for treating kidney cancer comprises (1) analyzing, from a first subject having kidney cancer and currently or previously being treated with a first composition, a first biological sample to determine the level(s) of one or more biomarkers selected from Tables 1, 2, 4, 8, 10 and/or 11 (2) analyzing, from a second subject having kidney cancer and currently or previously being treated with a second composition, a second biological sample to determine the level(s) of the one or more biomarkers, and (3) comparing the level(s) of one or more biomarkers in the first sample to the level(s) of the one or more biomarkers in the second sample in order to assess the relative efficacy of the first and second compositions for treating kidney cancer. The results are indicative of the relative efficacy of the two compositions, and the results (or the levels of the one or more biomarkers in the first sample and/or the level(s) of the one or more biomarkers in the second sample) may be compared to kidney cancer-positive reference levels, kidney cancer-negative reference levels to aid in characterizing the relative efficacy.
Each of the methods of assessing efficacy may be conducted on one or more subjects or one or more groups of subjects (e.g., a first group being treated with a first composition and a second group being treated with a second composition).
As with the other methods described herein, the comparisons made in the methods of assessing efficacy (or relative efficacy) of compositions for treating kidney cancer may be carried out using various techniques, including simple comparisons, one or more statistical analyses, mathematical models, algorithms and combinations thereof. An example of a technique that may be used is determining the RCC score for a subject. Any suitable method may be used to analyze the biological samples in order to determine the level(s) of the one or more biomarkers in the samples. In addition, the level(s) of one or more biomarkers, including a combination of all of the biomarkers in Tables 1, 2, 4, 8, 10 and/or 11 or any fraction thereof, may be determined and used in methods of assessing efficacy (or relative efficacy) of compositions for treating kidney cancer.
Finally, the methods of assessing efficacy (or relative efficacy) of one or more compositions for treating kidney cancer may further comprise analyzing the biological sample to determine the level(s) of one or more non-biomarker compounds. The non-biomarker compounds may then be compared to reference levels of non-biomarker compounds for subjects having (or not having) kidney cancer.
G. Methods of Screening a Composition for Activity in Modulating Biomarkers Associated with Kidney Cancer
The identification of biomarkers for kidney cancer also allows for the screening of compositions for activity in modulating biomarkers associated with kidney cancer, which may be useful in treating kidney cancer. Methods of screening compositions useful for treatment of kidney cancer comprise assaying test compositions for activity in modulating the levels of one or more biomarkers in Tables 1, 2, 4, 8, 10 and/or 11. Such screening assays may be conducted in vitro and/or in vivo, and may be in any form known in the art useful for assaying modulation of such biomarkers in the presence of a test composition such as, for example, cell culture assays, organ culture assays, and in vivo assays (e.g., assays involving animal models).
In one embodiment, a method for screening a composition for activity in modulating one or more biomarkers of kidney cancer comprises (1) contacting one or more cells with a composition, (2) analyzing at least a portion of the one or more cells or a biological sample associated with the cells to determine the level(s) of one or more biomarkers of kidney cancer selected from Tables 1, 2, 4, 8, 10 and/or 11; and (3) comparing the level(s) of the one or more biomarkers with predetermined standard levels for the one or more biomarkers to determine whether the composition modulated the level(s) of the one or more biomarkers. As discussed above, the cells may be contacted with the composition in vitro and/or in vivo. The predetermined standard levels for the one or more biomarkers may be the levels of the one or more biomarkers in the one or more cells in the absence of the composition. The predetermined standard levels for the one or more biomarkers may also be the level(s) of the one or more biomarkers in control cells not contacted with the composition.
In addition, the methods may further comprise analyzing at least a portion of the one or more cells or a biological sample associated with the cells to determine the level(s) of one or more non-biomarker compounds of kidney cancer. The levels of the non-biomarker compounds may then be compared to predetermined standard levels of the one or more non-biomarker compounds.
Any suitable method may be used to analyze at least a portion of the one or more cells or a biological sample associated with the cells in order to determine the level(s) of the one or more biomarkers (or levels of non-biomarker compounds).
Suitable methods include chromatography (e.g., HPLC, gas chromatograph, liquid chromatography), mass spectrometry (e.g., MS, MS-MS), ELISA, antibody linkage, other immunochemical techniques, and combinations thereof. Further, the level(s) of the one or more biomarkers (or levels of non-biomarker compounds) may be measured indirectly, for example, by using an assay that measures the level of a compound (or compounds) that correlates with the level of the biomarker(s) (or non-biomarker compounds) that are desired to be measured.
H. Methods of Treating Kidney Cancer
The identification of biomarkers for kidney cancer also allows for the treatment of kidney cancer. For example, in order to treat a subject having kidney cancer, an effective amount of one or more kidney cancer biomarkers that are lowered in kidney cancer as compared to a healthy subject not having kidney cancer may be administered to the subject. The biomarkers that may be administered may comprise one or more of the biomarkers in Tables 1, 2, 4, 8, 10 and/or 11 that are decreased in kidney cancer. In some embodiments, the biomarkers that are administered are one or more biomarkers listed in Tables 1, 2, 4, 8, 10 and/or 11 that are decreased in kidney cancer and that have a p-value less than 0.10. In other embodiments, the biomarkers that are administered are one or biomarkers listed in Tables 1, 2, 4, 8, 10 and/or 11 that are decreased in kidney cancer by at least 5%, by at least 10%, by at least 15%, by at least 20%, by at least 25%, by at least 30%, by at least 35%, by at least 40%, by at least 45%, by at least 50%, by at least 55%, by at least 60%, by at least 65%, by at least 70%, by at least 75%, by at least 80%, by at least 85%, by at least 90%, by at least 95%, or by 100% (i.e., absent).
Other methods of using the biomarkers discussed herein are also contemplated. For example, the methods described in U.S. Pat. No. 7,005,255,
U.S. Pat. No. 7,329,489, U.S. Pat. No. 7,553,616, U.S. Pat. No. 7,550,260, U.S. Pat. No. 7,550,258, U.S. Pat. No. 7,635,556, U.S. patent application Ser. No. 11/728,826, U.S. patent application Ser. No. 12/463,690 and U.S. patent application Ser. No. 12/182,828 may be conducted using a small molecule profile comprising one or more of the biomarkers disclosed herein.
In any of the methods listed herein, the biomarkers that are used may be selected from those biomarkers in Tables 1, 2, 4, 8, 10 and/or 11 having p-values of less than 0.05. The biomarkers that are used in any of the methods described herein may also be selected from those biomarkers in Tables 1, 2, 4, 8, 10 and/or 11 that are decreased in kidney cancer (as compared to the control) or that are decreased in high stage (as compared to control or low stage) or that are decreased in more aggressive (as compared to control or less aggressive) by at least 5%, by at least 10%, by at least 15%, by at least 20%, by at least 25%, by at least 30%, by at least 35%, by at least 40%, by at least 45%, by at least 50%, by at least 55%, by at least 60%, by at least 65%, by at least 70%, by at least 75%, by at least 80%, by at least 85%, by at least 90%, by at least 95%, or by 100% (i.e., absent); and/or those biomarkers in Tables 1, 2, 4, 8, 10 and/or 11 that are increased in kidney cancer (as compared to the control or remission) or that are increased high stage (as compared to control or low stage) or that are increased in more aggressive (as compared to control or less aggressive) by at least 5%, by at least 10%, by at least 15%, by at least 20%, by at least 25%, by at least 30%, by at least 35%, by at least 40%, by at least 45%, by at least 50%, by at least 55%, by at least 60%, by at least 65%, by at least 70%, by at least 75%, by at least 80%, by at least 85%, by at least 90%, by at least 95%, by at least 100%, by at least 110%, by at least 120%, by at least 130%, by at least 140%, by at least 150%, or more.
The invention will be further explained by the following illustrative examples that are intended to be non-limiting.
I. General Methods
A. Identification of Metabolic profiles for kidney cancer
Each sample was analyzed to determine the concentration of several hundred metabolites. Analytical techniques such as GC-MS (gas chromatography-mass spectrometry) and LC-MS (liquid chromatography-mass spectrometry) were used to analyze the metabolites. Multiple aliquots were simultaneously, and in parallel, analyzed, and, after appropriate quality control (QC), the information derived from each analysis was recombined. Every sample was characterized according to several thousand characteristics, which ultimately amount to several hundred chemical species. The techniques used were able to identify novel and chemically unnamed compounds.
B. Statistical Analysis
The data was analyzed using T-tests to identify molecules present at differential levels in a definable population or subpopulation (e.g., biomarkers for kidney cancer biological samples compared to control biological samples or compared to patients in remission from kidney cancer) useful for distinguishing between the definable populations (e.g., kidney cancer and control). Other molecules in the definable population or subpopulation were also identified.
Data was also analyzed using Random Forest Analysis. Random Forests give an estimate of how well individuals in a new data set can be classified into existing groups. Random Forest Analysis creates a set of classification trees based on continual sampling of the experimental units and compounds. Then each observation is classified based on the majority votes from all the classification trees. In statistics, a classification tree classifies the observations into groups based on combinations of the variables (in this instance variables are metabolites or compounds). There are many variations on the algorithms used to create trees. A tree algorithm searches for the metabolite (compound) that provides the largest split between the two groups. This produces nodes. Then at each node, the metabolite that provides the best split is used and so on. If the node cannot be improved on, then it stops at that node and any observation in that node is classified as the majority group.
Random Forests classify based on a large number (e.g. thousands) of trees. A subset of compounds and a subset of observations are used to create each tree. The observations used to create the tree are called the in-bag samples, and the remaining samples are called the out-of-bag samples. The classification tree is created from the in-bag samples, and the out-of-bag samples are predicted from this tree. To get the final classification for an observation, the “votes” for each group are counted based on the times it was an out-of-bag sample. For example, suppose observation 1 was classified as a “Control” by 2,000 trees, but classified as “Disease” by 3,000 trees. Using “majority wins” as the criterion, this sample is classified as “Disease.”
The results of the Random Forest are summarized in a Confusion Matrix. The rows correspond to the true grouping, and the columns correspond to the classification from the random forest. Thus, the diagonal elements indicate the correct classifications. A 50% error would occur by random chance for 2 groups, 66.67% error for three groups by random chance, etc. The “Out-of-Bag” (OOB) Error rate gives an estimate of how accurately new observations can be predicted using the random forest model (e.g., whether a sample is from a diseased subject or a control subject).
It is also of interest to see which variables are more “important” in the final classifications. The “Importance Plot” shows the top compounds ranked in terms of their importance. There are different criteria for ranking the importance, but the general idea is that removing an important variable will cause a greater decrease in accuracy than a variable that is less important.
The data were also analyzed using a mixed model which consists of both fixed effect and random effect and is widely used for clustered data to build models that are useful to identify the biomarker compounds that are associated with kidney cancer. This method allows for the ability to control the known confounding factors (e.g., age, gender, BMI) to reduce the likelihood of a spurious relationship and thus reduce the probability of false positives. To assess biomarkers for tumor aggressiveness, Fisher's method was used following the mixed model analysis to combine the results of stage, grade and metastatic potential. Biomarker compounds that are useful to predict kidney cancer and that are positively or negatively correlated with kidney cancer were identified in these analyses.
C. Biomarker Identification
Various peaks identified in the analyses (e.g. GC-MS, LC-MS, LC-MS-MS), including those identified as statistically significant, were subjected to a mass spectrometry based chemical identification process.
Biomarkers were discovered by (1) analyzing tissue samples from human subjects to determine the levels of metabolites in the samples and then (2) statistically analyzing the results to determine those metabolites that were differentially present in the kidney cancer tissue samples compared to the benign tissue samples.
Six kidney cancer positive and 6 patient-matched non-cancer human kidney core biopsies were obtained post-nephrectomy using an 18 gauge biopsy gun and placed into cryovials (Nalgene) containing 2 ml of 80% methanol. A single biopsy was placed in each vial and incubated for 24-72 hours at room temperature (22-24° C.). Following incubation, the tissues were removed from the solvent for histological analysis, and the solvent was prepared for metabolomics analysis. The cancer status of the sample was verified by histopathology analysis. Histological analysis was performed by a board-certified pathologist.
For metabolomics analysis, the solvent extracts were evaporated to dryness under a stream of nitrogen gas at 40° C. in a Turbovap LV evaporator (Zymark). The dried extracts were reconstituted in 550 μl methanol:water (80:20) containing recovery standards (D,L-2-fluorophenylglycine, D,L-4-chlorophenylalanine, tridecanoic acid, D6 cholesterol). The reconstituted solution was analyzed by metabolomics.
After the levels of metabolites were determined, statistical analysis was performed to identify metabolites that were significantly altered in the kidney cancer samples compared to the patient-matched non-cancer samples. The results of the matched pairs t-test analysis showed that 91 metabolites were significantly (p<0.1) altered in kidney cancer samples compared to the non-cancer samples. Table 1 lists the identified biomarkers having a p-value of less than 0.1. Table 1 includes, for each listed biomarker, the biochemical name of the biomarker, an indication of the percentage difference in the cancer sample mean as compared to the non-cancer sample mean (positive values represent an increase in kidney cancer, and negative values represent a decrease in kidney cancer), the p-value, and the q-value determined in the statistical analysis of the data concerning the biomarkers. Also included in Table 1 are: the identifier for that biomarker compound in the Kyoto Encyclopedia of Genes and Genomes (KEGG), if available; and the identifier for that biomarker compound in the Human Metabolome Database (HMDB), if available.
Listed in Table 2 are biomarkers that were identified as differentially present between kidney cancer samples compared to the patient-matched non-cancer samples where p>0.1. All of the biomarkers in Table 2 differentially increase or decrease at least 5% in the kidney cancer samples. Table 2 includes, for each listed biomarker, the biochemical name of the biomarker, an indication of the percentage difference in the cancer sample mean as compared to the benign sample mean (positive values represent an increase in cancer, and negative values represent a decrease in cancer), the p-value and the q-value. Also included in Table 2 are: the identifier for that biomarker compound in the Kyoto Encyclopedia of Genes and Genomes (KEGG), if available; and the identifier for that biomarker compound in the Human Metabolome Database (HMDB), if available.
The data obtained in Example 1 concerning biopsy samples was used to create a statistical (mathematical) model to classify the samples into kidney cancer or non-cancer groups.
Random Forest Analysis was used to classify kidney samples into kidney cancer positive (kidney cancer) or cancer negative groups. Random Forests give an estimate of how well individuals in a new data set can be classified into each group. This is in contrast to a t-test, which tests whether or not the unknown means for two populations are different. Random forests create a set of classification trees based on continual sampling of the experimental units and compounds. Then each observation is classified based on the majority votes from all the classification trees.
Random forest results show that the samples can be classified correctly with 83% prediction accuracy. The Confusion Matrix presented in Table 3 shows the number of samples predicted for each classification and the actual in each group (Kidney Cancer or Non-Cancer). The “Out-of-Bag” (OOB) Error rate gives an estimate of how accurately new observations can be predicted using the Random Forest Model (e.g., whether a sample contains tumor (cancer-positive) or is cancer-negative). The OOB error from this Random Forest was approximately 17%, and the model estimated that, when used on a new set of samples, the identity of kidney cancer positive samples could be predicted correctly 67% of the time and non-cancer samples could be predicted correctly 100% of the time.
Based on the OOB Error rate of 17%, the Random Forest model that was created predicted whether a sample was kidney cancer positive with about 83% accuracy based on the levels of the biomarkers measured in samples from the subjects. Exemplary biomarkers for distinguishing the groups are oxidized glutathione (GSSG), proline, 2-oleoylglycerophosphoethanolamine, 2-aminobutyrate, sphingosine, 3-dehydrocamitine, 2-docosahexaenoylglycerophosphocholine, 2-linoleoylglycerophosphocholine, phosphoethanolamine, glutamate, pyrophosphate (PPi), nicotinamide-adenine-dinucleotide (NAD+), 3-aminoisobutyrate, 2-arachidonoylglycerophosphoethanolamine, 2-arachidonoylglycerophosphocholine, 2-oleoylglycerophosphocholine, glycerate, choline-phosphate, pyruvate, 1-arachidonoylglycerophosphoethanolamine, adenine, 1-2-propanediol, 2-docosahexaenoylglycerophosphoethanolamine, 2-hydroxybutyrate (AHB), creatine, glycolate (hydroxyacetate), malate, 5-methylthioadenosine (MTA), stearolycarnitine, and 1-arachidonoylglycerophosphoinositol.
The Random Forest analysis demonstrated that by using the biomarkers, kidney cancer positive samples were distinguished from non-cancer samples with 67% sensitivity, 100% specificity, 100% Positive Predictive Value (PPV), and 75% Negative Predictive Value (NPV).
In addition, Principal Component Analysis (PCA) was carried out using the biomarkers where p<0.05 obtained from biopsy samples in Example 1 to classify the samples as non-cancer or Kidney Cancer (RCC).
Using the mathematical model created using PCA, it was found that 6 of 6 cancer-negative samples were correctly classified as cancer negative while 5 of 6 kidney cancer-positive samples were correctly classified as kidney cancer based on the biomarker abundance. A graphical depiction of the PCA results is presented in
Hierarchical clustering (Euclidean distance) using the biomarkers where p<0.05 identified from biopsy samples in Example 1 was also used to classify the subjects. This analysis resulted in the subjects being divided into two distinct groups. One group consisted of four cancer biopsies and one non-cancer biopsy, and the other group consisted of two cancer biopsies and five non-cancer biopsies. These data suggest that there are multiple metabolic types of kidney disease and/or kidney cancer that can be distinguished using tissue biopsy biomarker metabolite levels. For example, the cancer-containing samples identified in the second group may have a less aggressive faun of kidney cancer or may be at an earlier stage of cancer. Distinguishing between types of cancer (e.g., less vs. more aggressive) and stage of cancer may be valuable information to a doctor determining a course of treatment.
Biomarkers were discovered by (1) analyzing different groups of tissue samples from human subjects to determine the levels of metabolites in the samples and then (2) statistically analyzing the results to determine those metabolites that are differentially present in the following groups: normal tissue compared to tumor tissue; early stage (T1) cancer tissue compared to normal tissue; and later stage (T3) cancer tissue compared to normal tissue.
The samples used for the analysis were matched pairs of RCC tumor and adjacent normal kidney tissue collected from 140 subjects with RCC. Subjects were further divided based on tumor stage with 43 subjects having Stage 1 (T1), 13 subjects with Stage 2 (T2), 80 subjects with Stage 3 (T3) and 4 subjects with Stage 4 (T4) kidney cancer.
After the levels of metabolites were determined, the data were analyzed using Welch's two-sample t-tests. Three comparisons were used to identify biomarkers for kidney cancer: Kidney cancer vs. Normal; T1 Kidney cancer vs. Normal; T3 Kidney cancer vs. Normal. As listed in Table 4 below, the analysis of named compounds resulted in the identification of biomarkers that are differentially present between a) kidney cancer and Normal tissue b) early stage (T1) kidney cancer and Normal tissue and/or c) later stage (T3) kidney cancer and Normal tissue.
Table 4 includes, for each biomarker, the biochemical name of the biomarker, the fold change (FC) of the biomarker in kidney cancer compared to non-kidney cancer samples (Tumor/Normal, T1 Tumor/T1 Normal and T3 Tumor/T3 Normal) which is the ratio of the mean level of the biomarker in kidney cancer samples as compared to the non-kidney cancer mean level and the p-value determined in the statistical analysis of the data concerning the biomarkers. Bold values indicate a fold of change with a p-value of ≦0.1.
4.91
5.42
4.66
0.3
0.29
0.31
8.39
5.38
9.28
8.76
8.84
9.21
0.24
0.23
0.25
6.92
6.1
7.02
17.03
13.98
17.5
21.95
14.41
26.14
9.62
10.09
9.48
13.04
8.7
14.42
0.29
0.22
0.33
4.65
3.85
5.32
9.38
6.63
10.24
0.1
0.07
0.11
0.21
0.19
0.23
0.05
0.04
0.06
0.03
0.02
0.03
0.09
0.08
0.1
0.03
0.02
0.03
0.34
0.33
0.35
4.6
7.25
3.7
4.22
3.8
4.61
0.38
0.45
0.37
0.12
0.1
0.14
0.37
0.36
0.37
4.65
5.7
4.94
0.4
0.45
0.36
0.4
0.39
0.42
4.63
5.69
4.17
0.27
0.25
0.28
0.44
0.42
0.45
2.46
2.02
2.7
0.36
0.32
0.41
0.52
0.6
0.53
37.54
9.03
43.43
0.38
0.34
0.41
9.38
9.92
8.26
2.57
2.57
2.66
0.22
0.2
0.24
0.4
0.38
0.44
0.45
0.43
0.48
0.45
0.38
0.52
0.4
0.37
0.42
0.37
0.36
0.4
3.52
3.9
3.49
2.63
2.3
2.93
0.7
0.77
0.69
0.12
0.11
0.14
5.03
5.62
4.85
0.43
0.36
0.5
2.23
2.26
2.2
0.54
0.53
0.57
0.55
0.37
0.68
2.73
2.6
2.87
0.08
0.06
0.09
0.48
0.42
0.5
6.25
3.14
7.96
4.44
4.9
3.84
0.42
0.4
0.45
4.98
6.75
4.5
0.15
0.11
0.18
0.36
0.33
0.42
4.08
3.85
4.35
0.22
0.25
0.2
0.28
0.15
0.38
0.04
0.03
0.06
0.41
0.34
0.46
0.54
0.45
0.59
0.25
0.27
0.24
0.11
0.08
0.13
4.43
4.2
4.36
3.07
2.87
3.23
0.45
0.44
0.47
0.03
0.02
0.04
2.86
1.92
3.33
7.89
7.84
7.85
3.01
3.22
2.9
3.84
5.53
3.6
0.3
0.26
0.35
4.76
2.47
6.07
1.84
1.75
1.99
3.14
3.29
3.38
0.26
0.14
0.36
4.38
5.75
4.16
2.16
1.62
2.56
1.53
1.83
1.47
1.62
1.71
1.61
2.74
2.35
2.94
4.27
3.42
4.76
3.73
4.38
3.58
0.3
0.23
0.33
0.54
0.45
0.6
0.3
0.14
0.44
0.35
0.24
0.41
2.39
2.6
2.32
1.36
1.45
1.33
0.57
0.51
0.64
0.55
0.35
0.78
0.66
0.77
0.62
2.81
1.98
3.47
6.7
5.59
6.38
5.4
3.91
5.69
2.9
2.97
2.94
2.21
2.44
2.28
2.36
2.09
2.5
2.4
2.45
2.37
0.61
0.58
0.64
2.02
1.54
2.61
2.73
2.41
2.98
2.16
1.96
2.35
1.94
1.63
2.21
3.98
2.71
4.55
0.55
0.43
0.66
2.03
2.06
1.77
2.93
3.53
2.62
0.59
0.56
0.61
4.3
3.51
4.67
2.72
3
2.71
0.38
0.2
0.51
1.7
1.84
1.66
2.03
1.91
2.08
0.49
0.34
0.57
0.29
0.26
0.33
2.99
3.14
2.9
1.64
1.91
1.42
2.58
1.83
3.11
2.76
2.73
2.86
4.55
3.15
5.23
4.2
3.54
4.52
2.74
2.48
2.98
1.4
1.57
1.29
6.38
5.2
6.48
0.6
0.62
0.64
1.97
1.47
2.25
0.7
0.69
0.73
1.4
1.49
1.41
1.83
1.34
2.13
1.96
1.93
2.07
5.43
4.81
5.78
31.39
21.01
32.2
4.22
3.06
4.6
0.79
0.67
2.62
3.15
2.38
3.88
3.2
4.12
4.01
2.49
4.77
4.1
3.5
4.41
2.49
2.14
2.59
3.4
2.57
3.93
0.82
0.91
0.77
0.7
0.68
0.68
1.29
1.33
1.27
1.85
1.64
2.08
0.42
0.22
0.58
1.57
1.6
1.6
1.41
1.54
1.39
1.64
1.76
1.67
0.65
0.56
0.73
2.21
1.63
2.6
1.54
1.19
1.89
0.54
0.32
0.65
0.3
0.15
0.46
3.86
4.39
0.38
0.21
0.5
1.74
2.07
3.93
3.54
4.15
0.75
0.75
0.75
0.8
0.77
0.84
2.57
2.99
2.32
0.82
0.58
0.92
0.53
0.46
0.63
0.82
0.75
6.85
1.95
9.79
0.75
0.6
0.88
0.84
0.57
1
0.84
0.66
0.98
0.52
0.39
0.68
2.72
2.54
2.9
0.88
0.79
3.45
2.77
4.07
1.27
1.41
1.72
1.83
2.49
2.85
3.1
2.21
3.53
1.34
1.37
1.34
2.41
2.49
2.47
1.37
1.45
1.37
2.19
2.4
0.8
0.74
2.35
2.24
2.49
0.87
0.87
0.86
3.35
3.55
1.62
1.88
1.81
2.03
1.81
0.59
0.52
2.41
2.21
2.49
2.61
2.4
3.21
3.55
2.76
3.99
2.54
2.86
1.48
1.47
1.55
2.6
11.64
1.49
1.27
1.37
4.02
2.16
5.08
1.47
1.66
2.11
2.2
2.07
2.74
3.12
3.62
4.31
1.52
1.94
1.94
2.26
1.87
1.17
4.94
0.97
1.69
1.59
1.77
2.48
2.91
1.3
1.34
1.35
1.57
1.51
1.66
0.86
0.76
0.81
0.85
0.78
2.03
3.37
1.7
2.05
2.27
2.25
2.61
2.18
2.23
2.68
2.23
1.5
1.4
1.17
1.99
1.88
2.14
0.78
0.65
0.8
0.78
0.79
0.83
0.67
3.9
4.65
0.81
0.8
0.87
0.87
0.73
2.04
2.16
1.72
2.45
1.46
1.49
1.47
1.55
0.93
0.71
1.71
1.82
0.75
0.64
0.79
1.65
1.79
1.9
2.28
1.82
1.31
1.47
1.25
3.12
3.69
0.96
0.93
2.53
2.95
3.12
3.51
1.96
2.5
1.72
1.96
1.65
0.86
0.88
0.9
1.09
0.75
3.04
3.93
0.82
0.54
1.8
2.24
1.62
0.88
0.81
2.32
2.51
1.79
1.71
0.65
0.53
0.72
1.97
1.92
1.71
2.01
1.7
1.86
0.72
0.64
0.7
1.82
2.03
0.74
0.83
0.66
0.64
0.37
1.84
2.1
1.58
1.89
1.46
0.92
0.92
0.95
0.1
0.1
0.76
0.77
0.77
0.79
0.85
2.07
2.27
1.85
1.98
2.15
2.37
1.56
1.58
0.61
0.54
2.62
2.85
1.15
1.16
1.14
0.85
0.86
0.86
0.7
2.04
2.28
2.15
2.34
1.3
1.23
0.81
0.6
0.94
0.88
1.28
1.22
1.67
1.67
0.74
0.72
0.89
1.15
1.59
1.79
1.32
1.42
1.13
1.18
0.65
1.02
1.05
1.52
1.47
1
0.42
1.37
1.48
0.8
0.49
1.4
0.72
0.85
0.48
1.81
2.07
1.46
1.9
1.63
2.13
0.89
0.8
0.66
0.37
2.52
2.73
1.16
1.26
2.26
2.43
1.8
2.05
1.94
2.28
1.23
1.17
1.2
1.48
0.87
0.85
0.87
0.58
0.53
1.7
1.88
2.01
2.32
0.66
0.57
0.9
0.82
1.11
0.88
1.99
1.87
1.33
0.47
1.21
1.05
0.83
1.48
1.72
2.33
2.01
0.74
1.74
1.59
0.73
1.79
1.5
2.3
2.31
1.24
0.72
1.53
1.8
2.09
0.87
0.82
2.43
1.35
0.63
1.44
1.12
1.21
2.13
1.21
1.58
1.6
1.27
1.23
1.45
1.05
0.66
1.58
1.14
1.8
0.86
1.25
0.55
1.87
1.65
0.94
1.53
0.33
0.38
0.32
2.45
3.09
2.34
0.09
0.16
0.08
0.36
0.27
0.4
0.51
0.72
0.25
2.84
2.47
3.21
6.14
4.68
7.38
0.36
0.42
0.37
0.69
0.59
0.65
0.54
0.8
0.74
0.88
0.88
0.86
0.91
0.91
0.93
0.82
0.7
1
0.94
1.17
0.31
0.29
0.33
4.27
5.68
4.09
1.5
1.45
1.57
0.49
0.51
0.5
0.59
0.55
0.62
0.59
0.55
0.63
0.31
0.32
0.31
4.9
3.72
5.32
0.3
0.33
0.3
0.5
0.59
0.45
0.55
0.5
0.59
0.39
0.36
0.42
0.51
0.47
0.54
0.49
0.44
0.52
0.48
0.46
0.52
0.26
0.27
0.26
0.21
0.21
0.23
2.78
2.23
2.98
2.51
2.11
2.85
3.32
14.84
1.83
0.09
0.12
0.09
0.29
0.24
0.32
0.34
0.31
0.36
0.54
0.52
0.57
0.27
0.23
0.28
0.42
0.4
0.45
0.5
0.46
0.54
0.59
0.54
0.63
0.66
0.54
0.74
2.18
4.78
1.48
0.53
0.42
0.58
7.89
8.74
7.74
0.26
0.29
0.22
0.3
0.25
0.34
0.66
0.79
0.6
0.28
0.35
0.26
0.55
0.63
0.51
0.71
0.72
0.71
0.27
0.22
0.34
0.1
0.11
0.09
0.46
0.54
0.45
0.32
0.28
0.4
4.18
3.19
4.48
0.28
0.25
0.34
0.41
0.46
0.41
0.19
0.15
0.22
0.09
0.07
0.11
0.2
0.2
0.2
0.29
0.31
0.27
0.59
0.76
0.52
0.19
0.18
0.2
2.77
2.62
2.92
0.58
0.77
0.49
0.23
0.16
0.33
0.39
0.33
0.41
0.51
0.78
0.44
3.61
3.18
3.98
0.28
0.26
0.3
6.41
7.39
6.11
0.28
0.22
0.33
0.1
0.12
0.1
0.43
0.37
0.46
0.26
0.19
0.28
0.19
0.22
0.17
0.37
0.36
0.37
0.35
0.28
0.41
0.18
0.19
0.19
0.57
0.3
0.69
0.55
0.73
0.49
0.22
0.22
0.22
0.28
0.31
0.26
1.7
1.57
1.84
0.46
0.44
0.46
0.57
0.57
0.56
1.37
1.44
1.35
0.66
0.67
0.71
0.52
0.59
0.47
2.14
1.91
2.24
0.48
0.56
0.48
0.74
0.89
0.64
4.63
5.01
4.56
0.46
0.41
0.5
0.43
0.54
0.42
0.64
0.63
0.66
1.82
1.42
2.22
3.86
4.95
3.91
6.4
7.27
6.74
2.09
1.44
2.58
0.08
0.3
0.07
2.09
1.67
2.57
0.58
0.61
0.58
0.73
0.67
4.49
4.01
4.89
0.66
0.63
0.7
0.63
0.67
0.62
1.57
1.48
1.6
1.69
1.41
1.81
0.21
0.23
0.2
0.47
0.69
0.37
0.65
0.62
0.69
1.51
1.75
0.84
0.62
1.3
1.28
1.32
0.74
0.68
0.79
1.48
1.41
1.58
2.03
2.44
1.57
1.84
0.66
0.57
0.78
0.57
1.97
2.31
2.1
2.56
0.87
0.7
2.25
2.11
2.38
0.88
0.74
0.97
0.58
0.47
1.34
1.46
1.33
0.84
0.86
1.3
1.41
1.24
1.29
1.35
0.84
0.68
0.57
0.48
1.27
1.39
0.79
0.78
0.81
0.87
2.04
2.58
1.37
1.16
1.17
1.78
2.92
1.3
1.62
1.67
0.86
2.21
1.34
1.31
1.1
1.14
0.98
0.79
0.83
0.89
1.33
0.97
0.78
0.94
0.94
1.33
0.75
0.72
0.86
0.82
0.67
2.34
The biomarkers were used to create a statistical model to classify the samples. Using Random Forest analysis, the biomarkers were used in a mathematical model to classify samples as Normal tissue or as Tumor (cancer). Samples from patient-matched kidney tumor and normal tissue from 140 subjects were used in this analysis.
Random Forest results show that the samples were classified with 99% prediction accuracy. The Confusion Matrix presented in Table 5 shows the number of samples predicted for each classification and the actual in each group (Tumor or Normal). The “Out-of-Bag” (OOB) Error rate gives an estimate of how accurately new observations can be predicted using the Random Forest model (e.g., whether a sample is from tumor tissue or normal tissue). The OOB error from this Random Forest was approximately 1%, and the model estimated that, when used on a new set of subjects, the identity of normal subjects could be predicted correctly 98% of the time and kidney cancer subjects could be predicted 100% of the time.
Based on the OOB Error rate of 1%, the Random Forest model that was created predicted the tumor status of a sample with about 99% accuracy based on the levels of the biomarkers measured in samples from the subject. Exemplary biomarkers for distinguishing the groups are N-acetylaspartate (NAA), maltose, N-acetyl-aspartyl-glutamate (NAAG), 1-palmitoylglycerophosphoethanolamine, phenylacetylglutamine, glucose 6-phosphate (G6P), 1-oleoylglycerophosphoethanolamine, pseudouridine, maltotriose, N6-acetyllysine, 2-oleoylglycerophosphoethanolamine, glucose, eicosenoate (20:1n9 or 1n11), fructose-6-phosphate, 1-palmitoylglycerophosphoinositol, maltotetraose, N1-methylguanosine, 2-palmitoylglycerophosphoethanolamine, dimethylarginine (ADMA+SDMA), N1-methyladenosine, pantothenate, malonylcarnitine, arachidonate (20:4n6), 1-palmitoylplasmenylethanolamine, hippurate, 1-stearoylglycerophosphoethanolamine, kynurenine, alpha-tocopherol, fructose 1-phosphate, and 1-stearoylglycerophosphoinositol.
The Random Forest analysis demonstrated that by using the biomarkers, tumor samples were distinguished from Normal samples with 99% sensitivity, 98% specificity, 98% PPV and 99% NPV.
The biomarkers were used to create a statistical model to classify the early stage (T1) samples. Using Random Forest analysis, the biomarkers were used in a mathematical model to classify samples as Normal or as tumor. Samples from patient-matched kidney tumor and normal tissue from 43 subjects with Stage 1 (T1) kidney cancer were used in this analysis.
Random Forest results show that the samples were classified with 99% prediction accuracy. The Confusion Matrix presented in Table 6 shows the number of samples predicted for each classification and the actual in each group (T1 Tumor or T1 Normal). The “Out-of-Bag” (OOB) Error rate gives an estimate of how accurately new observations can be predicted using the Random Forest model (e.g., whether a sample is from tumor tissue or normal tissue). The OOB error from this Random Forest was approximately 1%, and the model estimated that, when used on a new set of subjects, the identity of normal subjects could be predicted correctly 98% of the time and kidney cancer subjects could be predicted 100% of the time.
Based on the OOB Error rate of 1%, the Random Forest model that was created predicted the tumor status of a sample with about 99% accuracy based on the levels of the biomarkers measured in samples from the subjects. Exemplary biomarkers for distinguishing the groups are N-acetylaspartate (NAA), 1-oleoyl-GPE (18:1), N-acetyl-aspartyl-glutamate (NAAG), 1-palmitoyl-GPE (16:0), maltose, 2-oleoyl-GPE (18:1), eicosenoate (20:1n9 or 1n11), 1-palmitoyl-GPI (16:0), 2-palmitoyl-GPE (16:0), 1-stearoyl-GPI (18:0), N2-methylguanosine, phenylacetylglutamine, N-acetylneuraminate, beta-alanine, malonylcarnitine, fructose 6-phosphate, gamma-glutamylglutamate, FAD, pseudouridine, 1-methylguanisine, 1-stearoyl-GPE (18:0), citrate, pantothenate (Vitamin B5), 1-palmitoylplasmenylethanolamine, arachidonate (20:4n6), N6-acetyllysine, 1-oleoyl-GPI (18:1), 2-methylbutyroylcarnitine (C5), fructose 1-phosphate, alpha-tocopherol.
The Random Forest analysis demonstrated that by using the biomarkers, tumor samples were distinguished from Normal samples with 100% sensitivity, 98% specificity, 98% PPV and 100% NPV.
The biomarkers were used to create a statistical model to classify the samples. Using Random Forest analysis, the biomarkers were used in a mathematical model to classify samples as Normal or as tumor. Samples from patient-matched kidney tumor and normal tissue from 80 subjects with Stage 3 (T3) kidney cancer were used in this analysis.
Random Forest results show that the samples were classified with 98% prediction accuracy. The Confusion Matrix presented in Table 7 shows the number of samples predicted for each classification and the actual in each group (T3 Tumor or T3 Normal). The “Out-of-Bag” (OOB) Error rate gives an estimate of how accurately new observations can be predicted using the Random Forest model (e.g., whether a sample is from tumor tissue or normal tissue). The OOB error from this Random Forest was approximately 2%, and the model estimated that, when used on a new set of subjects, the identity of normal subjects could be predicted correctly 96% of the time and kidney cancer subjects could be predicted 99% of the time.
Based on the OOB Error rate of 2%, the Random Forest model that was created predicted the tumor status of a sample with about 98% accuracy based on the levels of the biomarkers measured in samples from the subject. Exemplary biomarkers for distinguishing the groups are maltose, N-acetylaspartate (NAA), N-acetyl-aspartyl-glutamate (NAAG), glucose 6-phosphate (G6P), maltotetraose, phenylacetylglutamine, maltotriose, pseudouridine, 1-palmitoylglycerophosphoethanolamine, N1-methylguanosine, methyl-alpha-glucopyranoside, fructose-6-phosphate, 1-oleoylglycerophosphoethanolamine, N6-acetyllysine, dimethylarginine (ADMA+SDMA), 1-palmitoylglycerophosphoinositol, hippurate, N1-methyladenosine, mannose-6-phosphate, eicosenoate (20:1n9 or 11), glucose, pantothenate, 2-oleoylglycerophosphoethanolamine, alpha-tocopherol, 2-hydroxyglutarate, 2-palmitoylglycerophosphoethanolamine, arabitol, malonylcarnitine, arachidonate (20:4n6), and ergothioneine.
The Random Forest analysis demonstrated that by using the biomarkers, tumor samples were distinguished from Normal samples with 99% sensitivity, 96% specificity, 96% PPV and 99% NPV.
Kidney cancer staging provides an indication of how far the kidney tumor has spread beyond the kidney. The tumor stage is used to select treatment options and to estimate a patient's prognosis. Kidney tumor stages range from T1 (tumor 7 cm or less in size and limited to kidney, least advanced) to T4 (tumor invades beyond Gerota's fascia, most advanced).
To identify biomarkers of kidney cancer stage, metabolomic analysis was carried out on tissue samples from 56 subjects with Low stage RCC (T1, T2) and 84 subjects with High stage RCC (T3,T4). After the levels of metabolites were determined, the data were analyzed using Welch's two-sample t-test to identify biomarkers that differed between low stage kidney cancer compared to high stage kidney cancer. The biomarkers are listed in Table 8.
Table 8 includes, for each biomarker, the biochemical name of the biomarker, the fold change (FC) of the biomarker in high stage kidney cancer compared to low stage kidney cancer (T3,T4 Tumor/T1,T2 Tumor) and the p-value determined in the statistical analysis of the data concerning the biomarkers. Columns 4 and 5 of Table 8 include the identifier for that biomarker compound in the Kyoto Encyclopedia of Genes and Genomes (KEGG), if available; and the identifier for that biomarker compound in the Human Metabolome Database (HMDB), if available. Bold values indicate a fold of change with a p-value of <0.1.
The biomarkers were used to create a statistical model to classify the subjects. The biomarkers were evaluated using Random Forest analysis to classify subjects as having low stage or high stage kidney cancer. Samples from 56 subjects with Low stage RCC (T1, T2) and 84 subjects with High stage RCC (T3,T4) were used in this analysis.
Random Forest results show that the samples were classified with 72% prediction accuracy. The Confusion Matrix presented in Table 9 shows the number of samples predicted for each classification and the actual in each group (Low Stage or High Stage). The “Out-of-Bag” (OOB) Error rate gives an estimate of how accurately new observations can be predicted using the Random Forest model (e.g., whether a sample is from a subject with low stage RCC or high stage RCC). The OOB error from this Random Forest was approximately 28%, and the model estimated that, when used on a new set of subjects, the identity of low stage RCC subjects could be predicted correctly 68% of the time and high stage RCC subjects could be predicted 75% of the time.
Based on the OOB Error rate of 28%, the Random Forest model that was created predicted whether a sample was from an individual with low stage or high stage kidney cancer with about 72% accuracy based on the levels of the biomarkers measured in samples from the subject. Exemplary biomarkers for distinguishing the groups are choline, pelargonate (9:0), arginine, gamma-glutamylleucine, xanthine, tyrosine, 5-oxoproline, inositol-1-phosphate (11P), N2-methylguanosine, isoleucine, 2-ethylhexanoate, leucine, adenine, 5-methylthioadenosine (MTA), laurate (12:0), phenylalanine, mannose, uracil, xanthosine, erythritol, guanosine-5-monophosphate-5 (GMP), homocysteine, lactate, 4-hydroxybutyrate (GHB), ribose, fucose, S-adenosylhomocysteine (SAH), mannitol, hypoxanthine, and threonine.
The Random Forest analysis demonstrated that by using the biomarkers, low stage kidney cancer subjects were distinguished from high stage kidney cancer subjects with 75% sensitivity, 68% specificity, 78% PPV and 64% NPV.
Tumors from subjects with kidney cancer were assessed for aggressiveness based on three criteria: tumor stage, tumor grade, and tumor metastatic potential. To identify biomarkers of kidney cancer aggressiveness, metabolomic analysis was carried out on tissue samples from 140 subjects with kidney cancer. Tumor stage, grade and metastatic potential were reported for each subject. After the levels of metabolites were determined, the data were analyzed using a mixed model that consists of fixed effects and a random effect. Fisher's method was then used combine the aggressiveness criteria of tumor stage, tumor grade and tumor metastatic potential to identify biomarkers that are associated with kidney cancer aggressiveness. The 50 biomarkers most highly associated with kidney cancer aggressiveness are listed in Table 10.
Table 10 includes, for each biomarker, the biochemical name of the biomarker, the internal identifier for that biomarker compound in the in-house chemical library of authentic standards (CompID), the p-value determined in the statistical analysis of the data concerning the biomarkers, and whether the biomarker is positively or negatively associated with aggressiveness. A positive association means that as kidney cancer aggressiveness increases, the level of the biomarker increases (i.e., the biomarker is higher in more aggressive cancer); a negative association means that as kidney cancer aggressiveness increases, the level of the biomarker decreases (i.e., the biomarker is lower in more aggressive cancer).
To identify biomarkers of renal cell carcinoma, urine samples collected
from subjects with: 1) RCC, 2) prostate cancer (PCA), 3) bladder cancer (BCA) and 4) normal subjects were analyzed metabolomically. After the levels of metabolites were determined, biomarkers of RCC were identified using one-way ANOVA contrasts. Biomarkers of RCC were identified as metabolites that differed between 1) RCC and normal subjects, 2) RCC and PCA subjects, and/or 3) RCC and BCA subjects. The biomarkers are listed in Table 11.
Table 11 includes, for each biomarker, the biochemical name of the biomarker, the fold change (FC) of the biomarker in 1) RCC compared to Normal, 2) RCC compared to BCA, 3) RCC compared to PCA, and the p-value determined in the statistical analysis of the data concerning the biomarkers. In column 8 of Table 11, the identifier for that biomarker compound in the Human Metabolome Database (HMDB), if available, is listed. Bold values indicate a fold of change with a p-value of <0.1.
0.32
5.91
4.36
0.14
0.39
0.45
0.67
0.89
4.71
1.76
12.15
0.49
3.15
0.14
0.42
2.16
0.37
1.41
0.57
0.87
0.42
0.26
0.6
0.63
3.23
0.15
0.59
2.49
1.48
0.39
0.47
1.6
1.78
1.67
2.68
1.64
1.35
1.7
0.68
1.28
1.52
0.85
2.67
4.26
1.5
1.79
1.46
0.48
1.73
0.26
29.08
3.87
1.19
0.39
2.16
0.23
0.6
0.47
0.51
0.43
1.74
0.52
0.53
0.24
0.74
0.92
0.4
0.33
2.21
0.45
0.23
0.28
0.72
1.67
0.28
0.62
0.79
0.34
1.43
0.37
0.68
1.69
1.17
1.38
0.69
0.59
0.35
0.48
0.73
0.53
0.6
0.32
0.68
0.7
0.74
0.64
0.68
1
0.38
0.76
0.8
0.82
1.43
1.32
0.54
0.68
0.77
0.62
0.56
1.04
2.24
2.55
4.01
0.65
3.03
0.38
0.79
0.7
0.4
1.33
1.4
1.55
0.69
0.56
0.5
1.71
0.43
0.64
1.8
2
0.74
1.32
1.57
0.78
0.75
1.49
1.69
0.5
1
1.68
0.67
0.64
0.2
4.62
0.54
0.41
0.61
0.82
0.81
0.95
2.77
1.73
1.83
2.1
0.77
1.17
1.34
0.58
0.59
0.13
0.71
0.83
12.19
3.19
1.74
1.7
1.63
1.69
0.64
0.62
0.84
1.86
0.28
1.47
1.81
1.59
0.79
1.96
2.09
1.29
1.27
1.33
1.4
1.37
0.46
1.52
1.74
1.66
1.72
2.03
0.75
2.56
0.3
0.46
0.15
0.29
0.04
0.21
0.19
0.39
0.68
2.06
0.22
0.37
0.4
0.88
0.36
0.5
0.33
0.49
0.94
1.94
0.71
0.31
0.32
0.45
0.89
0.08
0.4
0.63
0.78
5.03
0.26
0.63
0.93
0.23
0.34
0.62
0.96
0.48
2.18
2.04
0.56
0.68
0.47
0.59
0.4
0.56
0.82
0.6
0.49
0.58
0.62
0.61
1.24
0.25
0.46
0.42
0.63
0.67
0.68
0.47
0.45
0.48
0.65
2.04
0.66
0.66
0.81
0.94
0.62
2.53
0.65
1.93
0.32
0.5
5.92
0.09
0.72
0.78
1.32
0.76
0.96
1.71
0.55
0.62
0.85
0.77
1.33
0.71
0.83
0.52
0.34
0.72
0.74
0.91
1.53
1.43
0.42
0.72
0.49
0.38
0.59
0.36
0.72
0.84
0.91
0.53
1.29
1.65
1.58
0.13
0.86
1.95
0.68
0.66
0.73
0.81
1.14
1.46
0.81
0.66
0.68
1.37
1.27
0.77
1.78
0.78
0.61
0.89
0.79
0.55
0.82
0.88
0.6
1.41
1.53
1.28
0.73
1.67
1.41
0.58
0.86
0.87
1.6
0.75
1.75
0.7
0.81
0.64
1.47
1.41
1.38
0.74
0.77
0.75
0.85
0.82
0.82
0.36
1.29
1.63
2.63
0.8
0.82
0.66
0.81
0.91
0.86
1.25
1.99
0.43
0.69
1.46
1.34
0.9
0.8
0.83
0.64
1.29
1.24
1.41
1.58
1.25
1.51
1.23
1.19
0.8
1.24
1.24
0.39
0.84
0.7
1.42
1.11
1.18
1.26
0.77
0.54
0.36
0.84
1.14
0.81
0.68
0.64
2.14
2
1.78
1.52
1.39
3
1.72
1.58
1.16
1.23
0.72
1.32
1.47
1.23
9.64
The biomarkers were then used to create a statistical model to identify subjects having kidney cancer. Using Random Forest analysis, the biomarkers were used in a mathematical model to classify subjects as having kidney cancer or normal. The results of the Random Forest analysis show that the samples were classified with 93% prediction accuracy. The Confusion Matrix presented in Table 12 shows the number of samples predicted for each classification and the actual in each group (RCC or Normal). The “Out-of-Bag” (OOB) Error rate gives an estimate of how accurately new observations can be predicted using the Random Forest model (e.g., whether a sample is from a RCC subject or a normal subject). The OOB error was approximately 7%, and the model estimated that, when used on a new set of subjects, the identity of RCC subjects could be predicted 93% of the time and normal subjects could be predicted correctly 94% of the time.
Based on the OOB Error rate of 7%, the Random Forest model that was created predicted whether a sample was from an individual with RCC with about 93% accuracy based on the levels of the biomarkers measured in samples from the subject. Exemplary biomarkers for distinguishing the groups are methyl-4-hydroxybenzoate, catechol-sulfate, glycerol, 2-hydroxyhippurate (salicylurate), N(2)-furoyl-glycine, 3-hydroxyphenylacetate, gulono 1,4-lactone, 2-isopropylmalate, 2-3-dihydroxyisovalerate, 1-2-propanediol, gluconate, cinnamoylglycine, phenylacetylglycine, sorbose, sucrose, adenosine 5′-monophosphate (AMP), hexanoylglycine, methyl-indole-3-acetate, 3-hydroxyhippurate, N6-methyladenosine, 4-hydroxy-2-oxoglutaric acid, alpha-CEHC-glucuronide, phenylpropinylglycine, vanillate, ethanolamine, galactose, adipate, 2-oxindole-3-acetate, 1, 3-7-trimethylurate, and 3-4-dihydroxyphenylacetate.
The Random Forest results demonstrated that by using the biomarkers, RCC subjects were distinguished from normal subjects with 94% sensitivity, 93% specificity, 88% PPV, and 97% NPV.
The biomarkers were used to create a statistical model to distinguish subjects having kidney cancer from those having prostate cancer. The biomarkers were evaluated using Random Forest analysis to classify subjects as having RCC or PCA. The Random Forest results show that the samples were classified with 80% prediction accuracy. The Confusion Matrix presented in Table 15 shows the number of samples predicted for each classification and the actual in each group (RCC or PCA). The “Out-of-Bag” (OOB) Error rate gives an estimate of how accurately new observations can be predicted using the Random Forest model (e.g., whether a sample is from a RCC subject or a PCA subject). The OOB error was approximately 20%, and the model estimated that, when used on a new set of subjects, the identity of RCC subjects could be predicted 77% of the time and PCA subjects could be predicted correctly 83% of the time and as presented in Table 13.
Based on the OOB Error rate of 20%, the Random Forest model that was created predicted whether a sample was from an individual with RCC with about 80% accuracy based on the levels of the biomarkers measured in samples from the subject. The biomarkers that are the most important biomarkers for distinguishing the groups are gluconate, 1-2-propanediol, galactose, gulono 1,4-lactone, orotidine, quinate, 1, 3-7-trimethylurate, guanine, phenylacetylglutamine, mannitol, 2-oxindole-3-acetate, 1,3-aminopropyl-2-pyrrolidone, 1,3-dimethylurate, Isobar-glucuronate-galacturonate-5-keto-gluconate, glycocholate, azelate (nonanedioate), N-acetylthreonine, 7-ketodeoxycholate, 3-sialyllactose, isovalerylcarnitine, cholate, adenosine 5′-monophosphate (AMP), 2-3-butanediol, 2-hydroxyhippurate, pipecolate, N-acetylphenylalanine, 12-dehydrocholate, alpha-ketoglutarate, sulforaphane.
The Random Forest results demonstrated that by using the biomarkers, RCC subjects were distinguished from PCA subjects with 77% sensitivity, 83% specificity, 79% PPV, 81% NPV.
The biomarkers were used to create a statistical model to classify subjects as having kidney cancer from those having bladder cancer. The biomarkers were evaluated using Random Forest analysis to classify subjects as having RCC or BCA. The Random Forest results show that the samples were classified with 75% prediction accuracy. The Confusion Matrix presented in Table 14 shows the number of samples predicted for each classification and the actual in each group (RCC or BCA). The “Out-of-Bag” (OOB) Error rate gives an estimate of how accurately new observations can be predicted using the Random Forest model (e.g., whether a sample is from a RCC subject or a BCA subject). The OOB error was approximately 25%, and the model estimated that, when used on a new set of subjects, the identity of RCC subjects could be predicted 76% of the time and BCA subjects could be predicted correctly 73% of the time and as presented in Table 14.
Based on the OOB Error rate of 25%, the Random Forest model that was created predicted whether a sample was from an individual with RCC with about 75% accuracy based on the levels of the biomarkers measured in samples from the subject. The biomarkers that are the most important biomarkers for distinguishing the groups are 3-indoxyl-sulfate, methyl-indole-3-acetate, methyl-4-hydroxybenzoate, lactate, N(2)-furoyl-glycine, N6-methyladenosine, gamma-CEHC, glycerol, 2-3-butanediol, palmitoyl-sphingomyelin, succinate, 4-hydroxyphenylacetate, caffeate, imidazole-prpionate, beta-alanine, 4-androsten-3beta-17beta-diol-disulfate-2,5-methylthioadenosine, (MTA), N2-acetyllysine, sucrose, phenylacetylglycine, 4-androsten-3beta-17beta-diol-disulfate-1, cyclo-gly-pro, N-methyl-proline, catechol-sulfate, serine, vanillate, threonine, 21-hydroxypregnenolone-disulfate, adenosine 5′-monophosphate (AMP), phenylacetylglutamine.
The Random Forest results demonstrated that by using the biomarkers, RCC subjects were distinguished from BCA subjects with 73% sensitivity, 78% specificity, 69% PPV, and 79% NPV.
Using the biomarkers for kidney cancer, an algorithm could be developed to monitor kidney cancer progression/regression in subjects. The algorithm, based on a panel of metabolite biomarkers from Tables 1, 2, 4, 8, 10 and/or 11, when used on a new set of patients, would assess and monitor a patient's progression/regression of kidney cancer. Using the results of this biomarker algorithm, a medical oncologist could assess the risk-benefit of surgery (i.e., full or partial nephrectomy), drug treatment or a watchful waiting approach.
The biomarker algorithm would monitor the levels of a panel of biomarkers for kidney cancer identified in Tables 1, 2, 4, 8, 10 and/or 11.
This application claims the benefit of U.S. Provisional Patent Application No. 61/568,690, filed Dec. 9, 2011, and U.S. Provisional Patent Application No. 61/677,771, filed Jul. 31, 2012, the entire contents of which are hereby incorporated herein by reference.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/US12/68506 | 12/7/2012 | WO | 00 | 6/5/2014 |
Number | Date | Country | |
---|---|---|---|
61568690 | Dec 2011 | US | |
61677771 | Jul 2012 | US |