1. Technical Field
This document relates to methods and materials involved in assessing lung cancer (e.g., small cell lung cancer). For example, this document provides methods and materials for determining whether or not a mammal having lung cancer (e.g., small cell lung cancer) is susceptible to a poor outcome.
2. Background Information
Small cell lung cancer (SCLC) is the most aggressive cell type among lung cancer subtypes with a median survival following diagnosis being estimated to be 8-20 months. Virtually all patients with SCLC are treated with chemotherapy and/or radiation therapy. Platinum-containing compounds (e.g., cisplatin and carboplatin) are most commonly used, and patients' survivals vary substantially.
This document provides methods and materials involved in assessing lung cancer (e.g., SCLC). For example, this document provides methods and materials for identifying a mammal having lung cancer (e.g., SCLC) as being susceptible to a poor outcome. As described herein, the presence of one or more genetic variations in the glutathione synthetase (GSS) gene, one or more genetic variations in the ATP-binding cassette, sub-family C, member 2 (ABCC2) gene, or one or more genetic variations in the X-ray repair cross-complementing protein 1 (XRCC1) gene can indicate that a person with lung cancer (e.g., SCLC) has an increased susceptible to a poor outcome (e.g., death within one, two, three, or four years). In some cases, an allele having a genetic variation in the GSS, ABCC2, or XRCC1 gene that is associated to an increased risk of death from lung cancer can be referred to as a risk allele, and the presence of multiple risk alleles (e.g., 2, 3, 4, or 5 risk alleles) can indicate that the person with lung cancer (e.g., SCLC) has an increased susceptible to a poor outcome (e.g., death within one, two, three, or four years) as compared to a person with lung cancer (e.g., SCLC) having only one risk allele. Identifying lung cancer patients who have a poor prognosis can allow such patients to be offered more aggressive therapy earlier.
In general, one aspect of this document features a method for assessing lung cancer. The method comprises, or consists essentially of, (a) detecting the presence of a genetic variation in ABCC2 nucleic acid, (b) detecting the presence of a genetic variation in GSS or XRCC1 nucleic acid, and (c) classifying the mammal as being susceptible to a poor lung cancer outcome based at least in part on the presence of the genetic variation in ABCC2 nucleic acid and the presence of the genetic variation in GSS or XRCC1 nucleic acid. The mammal can be a human. The genetic variation in ABCC2 nucleic acid can be rs11597282. The method can comprise detecting the presence of a genetic variation in GSS nucleic acid. The genetic variation in GSS nucleic acid can be rs2025096, rs7265992, or rs6060127. The method can comprise detecting the presence of a genetic variation in XRCC1 nucleic acid. The genetic variation in XRCC1 nucleic acid can be rs2854510 or rs1001581. The poor lung cancer outcome can comprise death within two years of diagnosis of lung cancer. The poor lung cancer outcome can comprise death within four years of diagnosis of lung cancer. The lung cancer can be small cell lung cancer.
Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention pertains. Although methods and materials similar or equivalent to those described herein can be used to practice the invention, suitable methods and materials are described below. All publications, patent applications, patents, and other references mentioned herein are incorporated by reference in their entirety. In case of conflict, the present specification, including definitions, will control. In addition, the materials, methods, and examples are illustrative only and not intended to be limiting.
The details of one or more embodiments of the invention are set forth in the accompanying drawings and the description below. Other features, objects, and advantages of the invention will be apparent from the description and drawings, and from the claims.
This document provides methods and materials related to assessing lung cancer in mammals. For example, this document provides methods and materials for identifying lung cancer patients as having a high level of susceptibility to a poor lung cancer outcome by determining whether or not the patient contains one or more genetic variations in the GSS gene, one or more genetic variations in the ABCC2 gene, and/or one or more genetic variations in the XRCC1 gene. As described herein, the presence of one or more genetic variations in the GSS gene, one or more genetic variations in the ABCC2 gene, and/or one or more genetic variations in the XRCC1 gene can indicate that the lung cancer (e.g., SCLC) patient has an increased susceptible to a poor outcome (e.g., death within one, two, three, or four years). In some cases, the presence of multiple risk alleles (e.g., 2, 3, 4, or 5 risk alleles) can indicate that the lung cancer (e.g., SCLC) patient has an increased susceptible to a poor outcome (e.g., death within one, two, three, or four years) as compared to a lung cancer (e.g., SCLC) patient having only one GSS, ABCC2, or XRCC1 risk allele.
An example of a human GSS nucleic acid can have the sequence set forth in GenBank® GI No. 30581166. Human ABCC2 nucleic acid can have the sequence set forth in GenBank® GI No. 188595701. Human XRCC1 nucleic acid can have the sequence set forth in GenBank® GI No. 190684674. Examples of genetic variations of a GSS gene that are associated with an increased risk of death from lung cancer include, without limitation, rs7265992 and rs6060127. The presence of any one or more of these genetic variations (minor alleles) for a GSS allele can indicate that that GSS allele is a risk allele. An example of a genetic variation of an ABCC2 gene that is associated with an increased risk of death from lung cancer includes, without limitation, rs11597282. The presence of this genetic variation (minor allele) for an ABCC2 allele can indicate that that ABCC2 allele is a risk allele. Examples of genetic variations of an XRCC1 gene that are associated with an increased risk of death from lung cancer include, without limitation, rs2854510 and rs1001581. The presence of any one or more of these genetic variations (minor alleles) for an XRCC1 allele can indicate that that XRCC1 allele is a risk allele.
In some case, the presence of minor alleles in rs2025096, rs2236270, and/or rs2273684 (GSS; A, A, and C, respectively) can indicate that the patient has a reduced risk of death from lung cancer.
The presence or absence of a genetic variation in GSS, ABCC2, or XRCC1 nucleic acid can be determined using any appropriate technique. For example, nucleic acid sequencing techniques, PCR-based techniques, and nucleic acid-based mutation detection techniques can be performed to detect the presence or absence of a genetic variation in GSS, ABCC2, or XRCC1 nucleic acid.
This document also provides methods and materials to assist medical or research professionals in identifying a mammal as being susceptible to a favorable or poor lung cancer outcome. Medical professionals can be, for example, doctors, nurses, medical laboratory technologists, and pharmacists. Research professionals can be, for example, principle investigators, research technicians, postdoctoral trainees, and graduate students. A professional can be assisted by (1) determining whether or not a patient contains one or more genetic variations in the GSS gene, one or more genetic variations in the ABCC2 gene, and/or one or more genetic variations in the XRCC1 gene and (3) communicating information about that patient's GSS, ABCC2, and/or XRCC1 genes to that professional. In some cases, a professional can be assisted by (1) determining the number of GSS, ABCC2, and/or XRCC1 risk alleles of a patient and (3) communicating information about that patient's number of GSS, ABCC2, and/or XRCC1 risk alleles to that professional.
Any method can be used to communicate information to another person (e.g., a professional). For example, information can be given directly or indirectly to a professional. In addition, any type of communication can be used to communicate the information. For example, mail, e-mail, telephone, and face-to-face interactions can be used. The information also can be communicated to a professional by making that information electronically available to the professional. For example, the information can be communicated to a professional by placing the information on a computer database such that the professional can access the information. In addition, the information can be communicated to a hospital, clinic, or research facility serving as an agent for the professional.
The invention will be further described in the following examples, which do not limit the scope of the invention described in the claims.
The participants in this example were diagnosed SCLC lung cancer patients who consented to the research protocol as approved by an institutional review board. Detailed descriptions of identification, enrollment, blood collection, and follow-up were described elsewhere (Sun et al., J. Thoracic Cardiovascular Surgery, 131:1014-1020 (2006) and Yang et al., Chest, 128:452-462 (2005)). Briefly, each case was identified through the Mayo Clinic pathologic diagnostic system, their medical records were abstracted, and blood samples were collected. Tumor stage was defined as limited stage (involving one lung or with lymph node involvement on the same side of the chest) and extensive stage where cancer has spread to the other lung, to lymph nodes on the other side of the chest, or to distant organs. Smoking information was obtained from medical record abstraction, questionnaires, and/or patient interviews. Pack-years were calculated by multiplying the number of packs per day by the total years of smoking Never smokers were defined as having had smoked fewer than 100 cigarettes during their lifetime, and former smokers were defined as having had at least six months of smoking abstinence at the time of diagnosis. Vital status and cause of death were determined by reviewing the Mayo Clinic registration database and medical records, correspondence from patients' next-of-kin, death certificates, obituary documents, the Mayo Clinic Tumor Registry, and the Social Security Death Index website. Additional patient information was collected with a mail-in questionnaire sent to participants or to their next-of-kin annually until patient's death.
Genes in the glutathione pathway were included as described elsewhere (Yang et al., J. Clin. Oncol., 24:1761-1769 (2006)). Twenty-nine genes, including isozymes and membrane bound transporter proteins, were selected. An additional 20 genes were selected from the DNA repair pathway following a review of the literature that reported an association with treatment response or survival in lung or other cancers (Table 1). Tag-SNPs on these genes were selected based on HapMap data (Release 22/Phase II on NCBI B36) by Haploview, Version 3 (http://www.broad.mit.edu/mpg/haploview/), using the Caucasian (CEU) data available from HapMap (http://www.hapmap.org/). Tag-SNP selection parameters ignored pairwise comparisons of markers greater than 500 kb apart; excluded individuals with greater than 50% missing genotypes; excluded SNPs with Hardy-Weinberg p-values of less than 0.001, SNPs with fewer than 75% genotype calls, SNPs with more than one Mendelian error, and SNPs with a minor allele frequency less than 0.001; performed aggressive tagging using a r2 threshold of 0.8, and included a LOD threshold for multi-marker tests of three. The genes, genotyped SNPs, and the SNPs in final analysis after quality assessment are presented in Table 1.
Four hundred and nineteen tag-SNPs (267 from the glutathione and 152 from DNA repair pathways) were genotyped in the Mayo Clinic Genomic Shared Resources using a custom-designed Illumina GoldenGate panel. Intensity data were imported into BeadStudio software for clustering and review. All samples were successfully genotyped, with an average call rate of 99.1 percent. For the SNPs, 95.2% (399/419) were successfully genotyped (call rate >95 percent), with an average call rate of 99.5 percent. Concordance between the genomic control DNA samples was 100 percent. SNPs with a minor allele frequency of less than 0.01 (n=11) or were not in the Hardy-Weinberg equilibrium (n=5) or were monomorphic (n=8) in this study population were excluded, resulting in 375 SNPs in the analyses.
Descriptive analysis for study patients: Clinical characteristics of the 248 SCLC patients were first summarized by vital status, and then assessed on their association with survival. Kaplan-Meier curves were obtained for each covariate, and a stepwise selection process using Cox proportional hazards regression was used to identify adjustment variables for the genetic analyses.
Single-SNP Assessments: For each SNP, a Cox regression model was used to assess the associations with survival following lung cancer diagnosis, both before and after adjusting for the five covariates identified above. The primary analysis tested the significance of the association between survival and the ordinal count of the number of minor alleles (0, 1, or 2) carried by each individual. Secondary assessments compared heterozygotes and rare allele homozygotes to the common allele homozygotes to further assess the potential genetic mode of action. From both analyses, hazard ratios (HR), 95% confidence intervals (CI), p-values, and q-values that assessed the probability that a p-value might be false positive were extracted (Storey, J. Royal Statisical Society (B), 64(Part 3):479-498 (2002) and Storey and Tibshirani, PNAS, 100:9440-9445 (2003)).
Whole-Gene Principal Components Analysis: In order to assess whether different analytical approaches resulted in consistent findings, a principal components analysis (PCA) was performed on the candidate genes. Minor allele count variables were used to identify the principal components that captured 95% of the variability in the SNPs for each gene. For the few instances where genotyping data had missing values, the mean genotype value was imputed. Principal components to perform an omnibus test of significance for the association between each of the genes in the glutathione and DNA repair pathways and survival in multivariable Cox proportional hazards regression models were identified. P-values for the global tests were obtained, along with simple summaries of the outcome of the PCA.
Haplotype Analysis: In order to further evaluate genes whose SNPs displayed evidence for association with survival, the associations between haplotypes in selected genes and survival were tested using tools implemented (http://cran.rproject.org/web/packages/haplo.stats/index.html) in the Haplo.Stats package; Schaid et al., Am. J. Human Genetics, 70(2):425-434 (2002)). A Cox proportional hazards regression model was used to test simultaneously the significance of the covariates representing the expected number of each of the candidate haplotypes. Global tests of significance were obtained while adjusting for the same covariates as in the single-SNP analyses. Following the omnibus tests of significance, analyses were performed for each of the haplotypes, estimated HRs, and 95% CIs, as with the single-SNP analyses. All analyses were carried out using SAS (SAS Institute, Inc., Cary, N.C.) and S-Plus (Insightful Corp., Seattle, Wash.) software systems.
Patient characteristics and clinical prognostic factors for survival are provided in Table 2. Among the 248 genotyped patients, the median follow-up time was 17 months. 64 (26%) were still alive at the closure of this study. A stepwise model selection identified five variables that were potential confounders, i.e., age, sex, pack-years, treatment modality, and stage. They were adjusted in the final model. Table 3 presents summaries of the hazard ratios, 95% confidence intervals, and p-values for these five covariates.
1ETS is defined as environmental tobacco exposure.
2Chi-square tests were used for categorical variables, and t-tests were used for continuous variables.
Single SNP Analysis Results: Of the 375 SNPs that were assessed for association with survival, 21 (11 genes) had p-values for trend test of less than 0.05, after adjusting for five covariates. Fifteen of the SNPs were from the glutathione pathway and six were from the DNA repair pathway (Table 4). The top three SNPs (rs11597282, rs2025096, and rs7265992) had q-values less than 0.25, suggesting a 1:3 odds of being false positive results. Two of the SNPs were in the GSS gene (rs2025096 and rs7265992), and one was in ABCC2 (rs11597282).
1The Q value estimated the false discovery rate for the companion p-value.
2The hazard ratio (HR) was obtained by trend test for each SNP where the ordinal count of the number of minor alleles was used in the Cox model after adjusting for age, gender, tumor stage, treatment, and pack-years of smoking.
Gene-based Analysis Results: Using whole-gene principal components analyses, 3 of the 49 genes were significantly associated with survival and had adjusted p-values of less than 0.05. These genes were the same ones that harbored SNPs with small p-values: GSS, ABCC2, and XRCC1, with adjusted p-values of 0.002, 0.04, and 0.03, respectively.
Haplotype Analyses: Of the three genes significant via principal components analyses, two were significantly associated with survival in the haplotype analyses: ABCC2 (p=0.002) and XRCC1 (0.015). The third, GSS, had a global p-value of 0.095. Four haplotypes in ABCC2 were associated with a lower risk of death (Table 5). For example, the GGGGACGCGGA (SEQ ID NO:1) haplotype was associated with a nearly five-fold lower risk (HR: 0.21, 95% CI: 0.07-0.58), and the AGGGCAAAGGA (SEQ ID NO:2) haplotype was associated with a nearly three-fold lower risk (HR: 0.39, 95% CI: 0.18-0.83). One haplotype in XRCC1, GAACG, was associated with a greater than five-fold risk of death (HR: 5.65, 95% CI: 2.52-12.69).
The results provided herein did not demonstrate any significant association for genes such as GSTP1, ERCC1, and ERCC2. The results provided herein demonstrate a significant effect of several tested genes, particularly GSS, ABCC2, and XRCC1. Glutathione synthetase (GSS) participates in the second step glutathione biosynthesis. Among the seven SNPs tested herein, five exhibited significant association with patient survival in single SNP analysis. For example, rs7265992 and rs6060127 were associated with increased risk of death, while rs2025096, rs2236270, and rs2273684 were associated with reduced risk (
ABCC2 (or MRP2) is a member of the multidrug resistance protein (MRP) family that performs a similar function of transporting glutathione conjugates across the cell membrane. Among the 11 tag-SNPs studied from the ABCC2 gene, rs11597282 (
XRCC1 is one of the most common genes studied in the DNA base excision repair pathway. Among the five SNPs in the SCLC data, rs100158 was correlated with improved survival, and rs2854510 was associated with decreased survival. Both SNPs are intronic and are more likely to be the surrogates of other functional variations in the same region with high linkage disequilibrium (
In summary, the results provided herein demonstrate that genetic variations in glutathione metabolic and DNA repair pathways are associated with survival among SCLC patients. Genetic variation of GSS, ABCC2, and XRCC1 were associated with overall survival of SCLC. The associations were significant not only in a single SNP test, but also at the whole gene level and haplotype analysis. The three appear to have an addictive or synergetic effect on treatment response and resistance through modulating the concentration of chemotherapy agents in the cells or their survivability after DNA damage (
It is to be understood that while the invention has been described in conjunction with the detailed description thereof, the foregoing description is intended to illustrate and not limit the scope of the invention, which is defined by the scope of the appended claims. Other aspects, advantages, and modifications are within the scope of the following claims.
This application claims the benefit of U.S. Provisional Application Ser. No. 61/482,020, filed May 3, 2011. The disclosure of the prior application is considered part of (and is incorporated by reference in) the disclosure of this application.
This invention was made with government support under grant numbers CA077118, CA080127, and CA084354 awarded by the National Institutes of Health. The government has certain rights in the invention.
| Number | Date | Country | |
|---|---|---|---|
| 61482020 | May 2011 | US |