A computer readable form of the Sequence Listing “10723-380.txt” (3283 bytes), submitted via EFS-WEB and created on Sep. 1, 2011 is herein incorporated by reference.
The application relates to lung cancer and particularly to methods, compositions and kits for classifying subjects with the adenocarcinoma (ADC) subtype or squamous cell carcinoma (SCC) subtype of non-small cell lung carcinoma (NSCLC) according to protein signatures.
Lung cancer is the most common cause of death from cancer for both men and women, with a current worldwide mortality rate in excess of one million per year. Non-small cell lung carcinoma (NSCLC) is histologically heterogeneous, with adenocarcinoma (ADC), squamous cell carcinoma (SCC), and large cell carcinoma (LC) being the major subtypes 1. Combined, these subtypes account for approximately 85% of lung cancers. In clinical practice, these subtypes have been treated similarly until recently, when new therapies (e.g. premetrexed) have shown differential responses in NSCLC subtypes2. However, despite improvements in surgical and chemotherapeutic treatments, and the development of drugs targeting the epidermal growth factor receptor (EGFR), which is a target in a subset of NSCLC, the 5-year survival rate associated with these cancers is poor, at approximately 15%. There is considerable variability in the molecular features between and within each of these NSCLC subtypes (e.g. EGFR expression level and mutational status), suggesting that additional stratification of tumors may facilitate more effective, tumor-specific treatments 3.
The analysis of EGFR and various keratins by methods with limited dynamic range such as immunohistochemistry (IHC) are common practices in oncologic pathology. EGFR levels by IHC have not proven to be predictive of response to EGFR-directed drugs, despite initial studies suggesting that patients whose tumors demonstrate low expression have low response rates 4.
The keratins are relatively abundant proteins (i.e. expressed at high level), and are the major structural component of the intermediate filament-based epithelial barrier in tissue 5. Keratin expression is stable during tumorigenesis, and the keratin expression pattern may signify tumor origins and types5. Indeed, since keratins exhibit characteristic expression patterns in human tumors, several of them (notably K5, K7, K8/K18, K19 and K20) have great importance in immunohistochemical tumor diagnosis of carcinomas, in particular of unclear metastases and in precise classification and subtyping5. However, it has been found that there is a limited differential expression of distinctive keratin filaments between squamous cell carcinomas and adenocarcinomas27. Apparently, squamous cell carcinomas that originate from columnar epithelium by squamous metaplasia gain the keratins of squamous cells but retain the keratins of columnar epithelial cells27.
While some keratins have been detected in blood and monitored as biomarkers (e.g. CYFRA 21-1 fragment of KRT19 6), only a subset of the 54 human keratin proteins have been developed into clinically useful diagnostic biomarkers to-date. There remains an unmet need to develop sensitive and more quantitative methods to identify and quantify comprehensive sets of diagnostic biomarkers including drug targets such as the EGFR and their associated signaling network components, and protein classes such as the keratins whose function is involved in the epithelial tissue and tumor phenotypes, and which may inform of tumor subtypes.
Mass spectrometry (MS) has emerged as a powerful technology for proteomic analysis of tumors, and represents a promising approach to stratify tumors according to their protein profiles, and for drug target and biomarker discovery 7. These methods have been extensively reviewed, and applied largely to study tumor-derived cell lines grown either in two-dimensional cultures or as xenograft tumors in immuno deficient mice. However, in either growth context, such established cell lines are mostly not representative of the more diversified or heterogeneous tumors in human cancers 8. Another issue associated with MS analysis of human-murine xenograft systems is the recognition and assignment of human versus murine proteins, which share a large degree of sequence homology. Methods to recognize and quantify human tumor proteomes, and to generate tissue models that faithfully retain or recapitulate their protein profiles are required.
These findings illustrate the potential to develop a comprehensive MS-based platform in oncologic pathology for better classification and potentially treatment of NSCLC patients.
Non-small cell lung carcinoma (NSCLC) accounts for approximately 80% of lung cancer. The most prevalent subtypes of NSCLC are adenocarcinoma (ADC) and squamous cell carcinoma (SCC), which combined account for approximately 90% of NSCLCs. Ten resected NSCLC patient tumors (5 ADC and 5 SCC) were directly introduced into severely immune deficient (NOD-SCID) mice, and the resulting xenograft tumors analyzed by standard histology and immunohistochemistry (IHC), and by proteomics profiling. Mass spectrometry (MS) methods involving 1- and 2-dimensional LC-MS/MS, and multiplexed selective reaction monitoring (SRM, or MRM) were applied to identify and quantify the xenograft proteomes. Hierarchical clustering of protein profiles distinguished between the ADC and SCC subtypes. As an example, the differential expression of 178 proteins, including a comprehensive panel of intermediate filament keratin proteins was found to constitute a distinctive proteomic signature associated with the NSCLC subtypes and subsets of proteins were found to be highly expressed in ADC or SCC. Epidermal growth factor receptor (EGFR) was expressed in ADC and SCC xenografts, and EGFR network activation was assessed by phosphotyrosine profiling by western blot analysis and SRM measurement of EGFR levels, and mutation analysis. A multiplexed SRM/MRM method provided relative quantification of several keratin proteins, EGFR and plakophilin-1 in single LC-MS/MS runs. Protein quantifications by SRM and MS/MS spectral counting were consistent with, and validated by orthogonal methods including IHC and Western immunoblotting.
Accordingly, an aspect includes a method of screening for, diagnosing or detecting non-small cell lung carcinoma or an increased likelihood of developing non-small cell lung carcinoma in a subject. The method comprises:
Another aspect includes a method of screening for, diagnosing or detecting non-small cell lung carcinoma or an increased likelihood of developing non-small cell lung carcinoma in a subject comprising:
A further aspect includes a method of differentiating between non-small cell lung carcinoma of the adenocarcinoma subtype and non-small cell lung carcinoma of the squamous cell carcinoma subtype in a subject, or detecting an increased likelihood of developing non-small cell lung carcinoma of the adenocarcinoma subtype or non-small cell lung carcinoma of the squamous cell carcinoma subtype in a subject comprising:
Another aspect includes A method of screening for, diagnosing or detecting non-small cell lung carcinoma of adenocarcinoma subtype or an increased likelihood of developing non-small cell lung carcinoma of the adenocarcinoma subtype in a subject comprising:
Furthermore, an aspect includes a method of screening for, diagnosing or detecting non-small cell lung carcinoma of squamous cell carcinoma subtype or an increased likelihood of developing non-small cell lung carcinoma of squamous cell carcinoma subtype in a subject comprising:
Additionally, another aspect includes a SRM/MRM method for quantifying a level of at least one biomarker associated with non-small cell lung carcinoma in a sample, the method comprising the steps of:
Moreover, a further aspect includes A kit for measuring a level of at least one biomarker associated with non-small cell lung cancer or a subtype thereof in a sample, the at least one biomarker selected from the biomarkers set out in Table 2, 4A, 4B, 6 and/or 7, comprising:
Other features and advantages of the present disclosure will become apparent from the following detailed description. It should be understood, however, that the detailed description and the specific examples while indicating preferred embodiments of the disclosure are given by way of illustration only, since various changes and modifications within the spirit and scope of the disclosure will become apparent to those skilled in the art from this detailed description.
An embodiment of the disclosure will now be described in relation to the drawings in which:
Table 1 displays NSCLC tumor xenograft histopathology and molecular features.
Table 2 lists highly differentially expressed proteins in ADC and SCC xenografts
Table 3 displays LC-MS/MS protein profiling.
Table 4 lists proteins highly differentially expressed in NSCLC.
Table 5 lists keratin signatures in NSCLC.
Table 6 lists transitions measured by multiplexed SRM/MDM.
Table 7 lists a set of biomarkers of the disclosure.
The term “difference in the level” as used herein refers to an increase or decrease in the level, or quantity, of a biomarker associated with non-small cell lung carcinoma or a subtype thereof, in a test sample that is measurable, compared to a suitable control and/or reference. For example the difference can be a difference in the steady-state level of a gene transcript, including for example a difference resulting from a difference in the level of transcription and/or translation and/or degradation. The difference in the level is optionally a level statistically associated with a particular group or outcome, for example, a group having non-small cell lung carcinoma or not having non-small cell lung carcinoma. The difference in the level can refer to an increase or decrease in a measurable polypeptide, or fragment thereof, level of a given biomarker as measured by the amount of steady state level of and/or expressed polypeptide or fragment thereof in a test sample as compared with the measurable expression level of a given biomarker or fragment thereof in a control, population of control samples and/or previously taken or reference sample. In another example, the difference in the level can refer to an increase or decrease in the measurable polynucleotide (e.g. nucleic acid transcript) level of a given biomarker as measured by the amount of transcript e.g. biomarker mRNA or cDNA. For example, in methods relating to screening for, diagnosing or detecting non-small cell lung carcinoma, a difference in the level can refer to an increase in the level of a biomarker compared to a suitable control, wherein the control for example corresponds to a biomarker level in a subject without non-small cell lung carcinoma. In methods relating to monitoring therapeutic response, a difference in the level can refer to a decrease or increase in the level of the biomarker in the subsequent sample compared to a reference sample, wherein depending on the particular biomarker an increase is indicative of negative therapeutic response and/or a decrease is indicative of a positive therapeutic response. For example, a difference in a level of biomarker level is detected if a ratio of the level in a test sample as compared with a control is greater than or less than 1.0 and/or if the ratio of the level in a reference sample as compared with a subsequent sample is greater than or less than 1.0. For example, the ratio can be greater than 1.0, 1.2, 1.5, 1.7, 2, 3, 3, 5, 10, 12, 15, 20 or more, or less than 1, 0.8, 0.6, 0.4, 0.2, 0.1, 0.05, 0.001 or less. The difference in the level when compared to a population average can for example be expressed using p-value. For instance, when using p-value, a biomarker is identified as having a difference in level between a first and second population when the p-value is less than 0.1, such as less than 0.05, 0.01, 0.005, and/or less than 0.001.
The term “biomarker associated with non-small cell lung cancer” as used herein refers to a gene listed in Tables 2, 4A, 4B, 6 and/or 7 or an expression product (e.g. polypeptide or nucleic acid transcript) of such a gene or a fragment thereof such as a peptide, e.g. generated by tryptic digest that is associated with, and an indicator of, pathogenic processes relating to non-small cell lung carcinoma or a subtype thereof. For example, the biomarker can refer to a gene product, such as a polypeptide or fragment thereof, that is differentially detectable for example differentially expressed, in subjects with non-small cell lung carcinoma or a subtype thereof as compared to subjects without non-small cell lung carcinoma or the particular subtype. Similarly, the term “biomarker associated adenocarcinoma” as used herein refers to a gene set out in Tables 2, 4a, and 7 or expression product (e.g. polypeptide or nucleic acid transcript) of such a gene or a fragment that is associated with non-small cell lung cancer of the adenocarcinoma subtype; and, the term “biomarker associated squamous cell carcinoma” as used herein refers to a gene set out in Tables 2, 4B, and 7 or expression product (e.g. polypeptide or nucleic acid transcript) of such a gene or a fragment that is associated with non-small cell lung cancer of the squamous cell carcinoma subtype. The “biomarkers of the disclosure” refer to the biomarkers as set out in Tables 2, 4A, 4B, 6 and/or 7.
The phrase “biomarker polypeptide”, “polypeptide biomarker” or “polypeptide product of a biomarker” refers to a proteinaceous biomarker gene product or fragment thereof. For example, a biomarker polypeptide refers to a Table 2, 4A, 4B, 6 and/or 7 polypeptide biomarker or fragment thereof that is for example, increased in samples from subjects with non-small cell lung carcinoma or a subtype thereof.
The term “biomarker fragment” refers to a polypeptide or polynucleotide that is, in terms of amino acids or nucleotides, less in number than the full length biomarker. For example, a fragment can be at least 7, 10, 20, 30 of any number in between or a corresponding number of nucleotides.
The term “control” as used herein refers to a sample, and/or a biomarker level, numerical value and/or range (e.g. control range) corresponding to the biomarker level in such a sample, taken from or associated with a subject or a population of subjects (e.g. control subjects) who are known as not having non-small cell lung carcinoma or who are known as not having a particular subtype of non-small cell lung carcinoma. For example, in methods for determining if a subject has non-small cell lung carcinoma of the adenocarcinoma subtype, the control can be a sample, and/or a biomarker level, numerical value and/or range (e.g. control range) corresponding to the biomarker level in such a sample, taken from or associated with a subject or a population of subjects (e.g. control subjects) who are known as not having non-small cell lung carcinoma or as who are known as having non-small cell lung carcinoma of the squamous cell carcinoma subtype. Also, for example in methods for determining if a subject has non-small cell lung carcinoma of the squamous cell carcinoma subtype, the control as used herein can be a sample, and/or a biomarker level, numerical value and/or range (e.g. control range) corresponding to the biomarker level in such a sample, taken from or associated with a subject or a population of subjects (e.g. control subjects) who are known as not having non-small cell lung carcinoma or as having non-small cell lung carcinoma of the adenocarcinoma subtype.
Where the control is a numerical value or range, the numerical value or range is a predetermined value or range that corresponds to a level of the biomarker or range of levels of the biomarker in a group of subjects known as not having non-small cell lung carcinoma or subtype thereof (e.g. threshold or cutoff level; or control range). For example, the control can be a cut-off or threshold level, above or below which (depending on the biomarker and subtype) which a subject is identified as having non-small cell lung cancer or a particular subtype thereof. For example, a test subject that has an increased level of a biomarker above a cut-off or threshold level is indicated to have or is more likely to have non-small cell lung carcinoma of a particular subtype.
The term “positive control” as used herein refers to a sample and/or biomarker level or numerical value corresponding to the biomarker level in a sample from a subject or a population of subjects (e.g. positive control subjects) who are known as having non-small cell lung carcinoma, for example a particular subtype of non-small cell carcinoma. For example, in methods for determining if a subject has non-small cell lung carcinoma of the adenocarcinoma subtype, the “positive control” can be a sample and/or biomarker level or numerical value corresponding to the biomarker level in a sample from a subject or a population of subjects (e.g. positive control subjects) who are known as having non-small cell lung carcinoma of the adenocarcinoma subtype. Similarly, in methods for determining if a subject has non-small cell lung carcinoma of the squamous cell carcinoma subtype, the “positive control” can be a sample and/or biomarker level or numerical value corresponding to the biomarker level in a sample from a subject or a population of subjects (e.g. positive control subjects) who are known as having non-small cell lung carcinoma of the squamous cell carcinoma subtype.
The term “similar” in the context of a biomarker level as used herein refers to a subject biomarker level that falls within the range of levels associated with a particular class for example associated with non-small cell lung cancer of adenocarcinoma subtype or associated with non-small cell lung cancer of squamous cell carcinoma subtype. Accordingly, “detecting a similarity” refers to detecting a biomarker level that fall within the range of levels associated with a particular class. In the context of a reference profile, “similar” refers to a reference profile associated with a non-small cell lung cancer subtype such as adenocarcinoma subtype or squamous cell carcinoma subtype that shows a number of identities and/or degree of changes with the subject expression profile.
The term “most similar” in the context of a reference profile refers to a reference profile that is associated with a non-small cell lung cancer subtype such as adenocarcinoma subtype or squamous cell carcinoma subtype that shows the greatest number of identities and/or degree of changes with the subject expression profile.
The term “expression profile” as used herein refers to, for a plurality of biomarkers that are associated with non-small cell lung carcinoma or a subtype thereof, biomarker steady state and/or transcript expression levels in a sample from a subject that is for example, useful for diagnosing non-small cell lung cancer, for example of the adenocarcinoma or squamous cell carcinoma cell type. For example, an expression profile can comprise the quantitated relative levels of at least 2 or more biomarkers listed in Table 2, 4A, 4B, 6 and/or 7, and the levels or pattern of biomarker expression can be compared to one or more reference profiles, for example a reference profile associated with non-small cell lung carcinoma such as non-small cell lung carcinoma of the adenocarcinoma subtype or non-small cell lung carcinoma of the squamous cell carcinoma subtype. An expression profile can for example be detected by microarray analysis, RT-PCR and/or methods that measure a biomarker expression product such as flow cytometry and Western blot.
The term “sequence identity” as used herein refers to the percentage of sequence identity between two or more polypeptide sequences or two or more nucleic acid sequences that have identity or a percent identity for example about 70% identity, 80% identity, 90% identity, 95% identity, 98% identity, 99% identity or higher identity or a specified region. To determine the percent identity of two or more amino acid sequences or of two or more nucleic acid sequences, the sequences are aligned for optimal comparison purposes (e.g., gaps can be introduced in the sequence of a first amino acid or nucleic acid sequence for optimal alignment with a second amino acid or nucleic acid sequence). The amino acid residues or nucleotides at corresponding amino acid positions or nucleotide positions are then compared. When a position in the first sequence is occupied by the same amino acid residue or nucleotide as the corresponding position in the second sequence, then the molecules are identical at that position. The percent identity between the two sequences is a function of the number of identical positions shared by the sequences (i.e., % identity=number of identical overlapping positions/total number of positions.times.100%). In one embodiment, the two sequences are the same length. The determination of percent identity between two sequences can also be accomplished using a mathematical algorithm. A preferred, non-limiting example of a mathematical algorithm utilized for the comparison of two sequences is the algorithm of Karlin and Altschul, 1990, Proc. Natl. Acad. Sci. U.S.A. 87:2264-2268, modified as in Karlin and Altschul, 1993, Proc. Natl. Acad. Sci. U.S.A. 90:5873-5877. Such an algorithm is incorporated into the NBLAST and XBLAST programs of Altschul et al., 1990, J. Mol. Biol. 215:403. BLAST nucleotide searches can be performed with the NBLAST nucleotide program parameters set, e.g., for score=100, wordlength=12 to obtain nucleotide sequences homologous to a nucleic acid molecules of the present application. BLAST protein searches can be performed with the XBLAST program parameters set, e.g., to score-50, wordlength=3 to obtain amino acid sequences homologous to a protein molecule of the present invention. To obtain gapped alignments for comparison purposes, Gapped BLAST can be utilized as described in Altschul et al., 1997, Nucleic Acids Res. 25:3389-3402. Alternatively, PSI-BLAST can be used to perform an iterated search which detects distant relationships between molecules (Id.). When utilizing BLAST, Gapped BLAST, and PSI-Blast programs, the default parameters of the respective programs (e.g., of XBLAST and NBLAST) can be used (see, e.g., the NCBI website). The percent identity between two sequences can be determined using techniques similar to those described above, with or without allowing gaps. In calculating percent identity, typically only exact matches are counted.
The term “specifically binds” as used herein refers to a binding reaction that is determinative of the presence of the biomarker (e.g. polypeptide or nucleic acid) often in a heterogeneous population of macromolecules. For example, when the biomarker specific reagent is an antibody, specifically binds refers to the specified antibody binding with greater affinity to the cognate antigenic determinant than to another antigenic determinant, for example binds with at least 2, at least 3, at least 5, or at least 10 times greater specificity; and when a probe, specifically binds refers to the specified probe under hybridization conditions binds to a particular gene sequence at least 1.5, at least 2 at least 3, or at least 5 times background.
The term “hybridize” or “hybridizable” refers to the sequence specific non-covalent binding interaction with a complementary nucleic acid. In a preferred embodiment, the hybridization is under high stringency conditions. Appropriate stringency conditions which promote hybridization are known to those skilled in the art, or can be found in Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1 6.3.6. For example, 6.0× sodium chloride/sodium citrate (SSC) at about 45° C., followed by a wash of 2.0×SSC at 50° C. may be employed.
The term “polynucleotide”, “nucleic acid” and/or “oligonucleotide” as used herein refers to a sequence of nucleotide or nucleoside monomers consisting of naturally occurring bases, sugars, and intersugar (backbone) linkages, and is intended to include DNA and RNA which can be either double stranded or single stranded, represent the sense or antisense strand.
The term “primer” as used herein refers to a polynucleotide, whether occurring naturally as in a purified restriction digest or produced synthetically, which is capable of acting as a point of synthesis when placed under conditions in which synthesis of a primer extension product, which is complementary to a nucleic acid strand is induced (e.g. in the presence of nucleotides and an inducing agent such as DNA polymerase and at a suitable temperature and pH). The primer must be sufficiently long to prime the synthesis of the desired extension product in the presence of the inducing agent. The exact length of the primer will depend upon factors, including temperature, sequences of the primer and the methods used. A primer typically contains 15-25 or more nucleotides, although it can contain less. The factors involved in determining the appropriate length of primer are readily known to one of ordinary skill in the art.
The term “probe” as used herein refers to a nucleic acid sequence that will hybridize to a nucleic acid target sequence. In one example, the probe hybridizes to a biomarker RNA or a nucleic acid sequence complementary to the biomarker RNA. The length of probe depends for example, on the hybridization conditions and the sequences of the probe and nucleic acid target sequence. The probe can be for example, at least 15, 20, 25, 50, 75, 100, 150, 200, 250, 400, 500 or more nucleotides in length.
A person skilled in the art would recognize that “all or part of” of a particular probe or primer can be used as long as the portion is sufficient for example in the case a probe, to specifically hybridize to the intended target and in the case of a primer, sufficient to prime amplification of the intended template.
The term “EGFR directed drug” as used herein refers to drugs that specifically bind with high affinity to the epidermal growth factor receptor (EGFR) on the cell surface or to the intracellular catalytic region in order to regulate the intrinsic protein-tyrosine kinase activity of the receptor. The tyrosine kinase activity initiates signal transduction cascades that results in a variety of biochemical changes in the cell such as increased aerobic glycolysis, changes in cell-cell and cell-matrix interactions and motility, changes in the expression of certain genes that ultimately lead to DNA synthesis and cell proliferation. EGFR genetic mutations that lead to increased expression or activity of the EGFR protein have been associated with a number of cancers, including lung cancer.
The term “kit control” as used herein means a suitable assay control useful when determining a level of a biomarker associated with non-small cell lung cancer. For example, when the kit is for a MRM/SRM method, the kit control is optionally a peptide fragment of a biomarker polypeptide that can for example be used to prepare a standard curve As an alternative example, where the kit is for detecting polypeptide levels by immunohistochemical methods, the kit control can comprise an antibody control, useful for example for detecting non-specific binding and/or for standardizing the amount of protein in the sample.
The term “biomarker specific reagent” as used herein refers to a reagent that is a highly sensitive and specific biomarker reagent used with standard immunohistochemistry (ICC) and immunohistochemistry (IHC) techniques to detect the level of a biomarker associated with non-small cell lung cancer.
The term “antibody” as used herein is intended to include monoclonal antibodies, polyclonal antibodies, and chimeric antibodies. The antibody may be from recombinant sources and/or produced in transgenic animals. The term “antibody fragment” as used herein is intended to include Fab, Fab′, F(ab′)2, scFv, dsFv, ds-scFv, dimers, minibodies, diabodies, and multimers thereof and bispecific antibody fragments. Antibodies can be fragmented using conventional techniques. For example, F(ab′)2 fragments can be generated by treating the antibody with pepsin. The resulting F(ab′)2 fragment can be treated to reduce disulfide bridges to produce Fab′ fragments. Papain digestion can lead to the formation of Fab fragments. Fab, Fab′ and F(ab′)2, scFv, dsFv, ds-scFv, dimers, minibodies, diabodies, bispecific antibody fragments and other fragments can also be synthesized by recombinant techniques.
Antibodies may be monospecific, bispecific, trispecific or of greater multispecificity. Multispecific antibodies may immunospecifically bind to different epitopes of a NADPH oxidase polypeptide and/or or a solid support material. Antibodies may be from any animal origin including birds and mammals (e.g., human, murine, donkey, sheep, rabbit, goat, guinea pig, camel, horse, or chicken).
Antibodies may be prepared using methods known to those skilled in the art. Isolated native or recombinant polypeptides may be utilized to prepare antibodies. See, for example, Kohler et al. (1975) Nature 256:495-497; Kozbor et al. (1985) J. Immunol Methods 81:31-42; Cote et al. (1983) Proc Natl Acad Sci 80:2026-2030; and Cole et al. (1984) Mol Cell Biol 62:109-120 for the preparation of monoclonal antibodies; Huse et al. (1989) Science 246:1275-1281 for the preparation of monoclonal Fab fragments; and, Pound (1998) Immunochemical Protocols, Humana Press, Totowa, N.J. for the preparation of phagemid or B-lymphocyte immunoglobulin libraries to identify antibodies.
In aspects, the antibody is a purified or isolated antibody. By “purified” or “isolated” is meant that a given antibody or fragment thereof, whether one that has been removed from nature (isolated from blood serum) or synthesized (produced by recombinant means), has been increased in purity, wherein “purity” is a relative term, not “absolute purity.” In particular aspects, a purified antibody is 60% free, preferably at least 75% free, and more preferably at least 90% free from other components with which it is naturally associated or associated following synthesis.
The term “control level” refers to a biomarker level in a control sample or a numerical value corresponding to such a sample. Control level can also refer to for example a threshold, cut-off or baseline level of a biomarker in subjects without non-small cell lung carcinoma or without a particular sub-type, where levels above/below which depending on the particular marker are associated with the presence of non-small cell lung carcinoma or a particular sub-type.
Similarly the term “positive control level” refers to a biomarker level in or corresponding to a positive control sample, for example associated with a subtype of non-small cell lung carcinoma. Positive control level can refer to a threshold, cut-off or baseline level of a biomarker in subjects with non-small cell lung carcinoma or a subtype thereof that is useful for comparing to a subject biomarker level. The positive control can for example be a level of at least one biomarker associated with non-small cell lung cancer or a subtype thereof, or a reference profile comprising levels of a plurality of markers.
The term “sample” as used herein refers to any biological fluid, cell or tissue sample from a subject including a test sample from a test subject e.g. a subject whose lung cancer status is being tested, and a control sample from a control subject e.g. a subject with lung cancer status is known. For example, the sample can comprise lung tissue, tumour biopsy, ascitic fluid, sputum, and/or bodily secretions. The sample for example can comprise formalin fixed and/or paraffin embedded tissue, a frozen tissue or fresh tissue. The sample can be used directly as obtained from the source or following a pretreatment to modify the character of the sample.
The term an “increased likelihood of developing”, as used herein is used to mean that a test subject with increased levels of a biomarker in Table 2, 4A, 4B, 6 and/or 7 has an increased chance of developing non-small cell lung carcinoma, or a subtype thereof, having recurrence or relapse or poorer survival relative to a control subject (e.g. a subject with control levels of a Table 2, 4A, 4B, 6 and/or 7 biomarker). The increased risk for example may be relative or absolute and may be expressed qualitatively or quantitatively. For example, an increased risk may be expressed as simply determining the test subject's expression level for a given biomarker and placing the test subject in an “increased risk” category, based upon previous population studies. Alternatively, a numerical expression of the test subject's increased risk may be determined based upon biomarker level analysis. As used herein, examples of expressions of an increased risk include but are not limited to, odds, probability, odds ratio, p-values, attributable risk, relative frequency, positive predictive value, negative predictive value, and relative risk.
The term “level” as used herein refers to a quantity of biomarker that is detectable or measurable in a sample and/or control. The quantity is for example a quantity of polypeptide, the quantity of nucleic acid e.g. biomarker transcript, or the quantity of a fragment. The level can alternatively include combinations thereof.
The term “determining a level” as used in reference to a biomarker means the application of a method to a sample, for example a sample of the subject and/or a control sample, for ascertaining quantitatively, semi-quantitatively or qualitatively the amount of a biomarker, for example the amount of biomarker polypeptide or mRNA. For example, a level of a biomarker can be determined by a number of methods including for example mass spectrometric methods, including for example MS, MS/MS, LC-MS/MS, SRM etc where a peptide of a biomarker is labeled and the amount of labeled biomarker peptide is ascertained, immunoassays including for example immunohistochemistry, ELISA, immunoprecipation and the like, where a biomarker detection agent such as an antibody for example, a labeled antibody specifically binds the biomarker and permits for example relative or absolute ascertaining of the amount of polypeptide biomarker, hybridazation and PCR protocols where a probe or primer or primer set are used to ascertain the amount of nucleic acid biomarker.
The term “MS” refers to mass spectrometry.
The term “MS/MS” refers to tandem mass spectrometry.
The term “1D LC-MS/MS” refers to 1-dimensional liquid chromatography tandem mass spectrometry using for example a LTQ-Orbitrap XL apparatus.
The term “2D LC-MS/MS” refers to 2-dimensional liquid chromatography tandem mass spectrometry.
The term “SRM” refers to selective reaction monitoring which is a mass spectrometry approach to the quantitative detection of selected proteins. The assay can for example be multiplexed (e.g. MRM).
The term “non-small cell lung carcinoma” or NSCLC as used herein refers to all lung cancers that are not small cell lung cancer and includes several sub-types including but not limited to large cell carcinoma, squamous cell carcinoma and adenocarcinoma. All stages and metastasis are included. Accounting for 25% of lung cancers, squamous cell carcinoma usually starts near a central bronchus. A hollow cavity and associated necrosis are commonly found at the center of the tumor. Well-differentiated squamous cell cancers often grow more slowly than other cancer types. Adenocarcinoma accounts for 40% of non-small cell lung cancers. It usually originates in peripheral lung tissue. Most cases of adenocarcinoma are associated with smoking; however, among people who have never smoked, adenocarcinoma is the most common form of lung cancer.
The term “proteome” as used herein refers to a set of polypeptides, detectable in a sample type, such as a biopsy comprising a lung cell, and/or refers to a set of polypeptides detectable and/or quantified in a cell and/or tumour, for example non-small cell lung carcinoma or a subtype thereof, optionally expressed at a given time and/or under defined conditions.
The term “reference profile” as used herein refers to a suitable comparison profile, for example a polypeptide or nucleic acid reference profile that comprises the level of two or more biomarkers of the disclosure in a sample corresponding to a subject that has or does not have a non-small cell lung carcinoma, or particular subtype thereof. For example, in methods involving determining for example if a subject has non-small cell lung carcinoma of the adenocarcinoma subtype, the “reference profile” can be a polypeptide profile corresponding to a subject that does not have non-small cell lung cancer or who has non-small cell lung carcinoma of the adenocarcinoma subtype. Similarly, in methods involving determining for example if a subject has non-small cell lung carcinoma of the squamous cell carcinoma subtype, the “reference profile” can be a polypeptide reference profile corresponding to a subject that does not have non-small cell lung carcinoma or has non-small cell lung carcinoma of the squamous cell carcinoma subtype. The reference profile is an expression signature (e.g. polypeptide or nucleic acid gene expression levels and/or pattern) of a plurality of genes (e.g. at least 2 genes, for example 5 genes), associated with non-small cell lung cancer or a subtype thereof. The reference profile is identified using one or more samples comprising non-small cell lung cancer cells wherein the expression is similar between related samples defining a subtype class and is different to unrelated samples defining a different subtype class such that the reference expression profile is associated with a particular cancer subtype. The reference expression profile is accordingly a reference profile or reference signature of the expression of five or more genes listed in Table 2, 4A, 4B, 6 and/or 7, to which the expression levels of the corresponding genes in a test sample are compared in methods for example for determining non-small cell lung cancer subtype.
The phrase “screening for, diagnosing or detecting non-small cell lung carcinoma or an increased likelihood of developing non-small cell lung” refers to a method or process of determining if a subject has or does not have non-small cell lung carcinoma, or has or does not have an increased risk of developing non-small cell lung carcinoma. For example, detection of altered levels of a Table 2, 4A, 4B, 6 and/or 7 biomarker compared to control is indicative that the subject has non-small cell lung carcinoma or an increased risk of developing non-small cell lung carcinoma.
The phrase “screening for, diagnosing or detecting non-small cell lung carcinoma of the adenocarcinoma subtype or an increased likelihood of developing non-small cell lung of the adenocarcinoma subtype” refers to a method or process of determining if a subject has or does not have non-small cell lung carcinoma of the adenocarcinoma subtype, or has or does not have an increased risk of developing non-small cell lung carcinoma of the adenocarcinoma subtype. For example, detection of altered levels of a Table 2 or 4A biomarker compared to control is indicative that the subject has non-small cell lung carcinoma of the adenocarcinoma subtype or an increased risk of developing non-small cell lung carcinoma of the adenocarcinoma subtype.
The phrase “screening for, diagnosing or detecting non-small cell lung carcinoma of the squamous cell carcinoma subtype or an increased likelihood of developing non-small cell lung of the squamous cell carcinoma subtype” refers to a method or process of determining if a subject has or does not have non-small cell lung carcinoma of the squamous cell carcinoma subtype, or has or does not have an increased risk of developing non-small cell lung carcinoma of the squamous cell carcinoma subtype. For example, detection of altered levels of a Table 2 or 4B biomarker compared to control is indicative that the subject has non-small cell lung carcinoma of the squamous cell carcinoma subtype or an increased risk of developing non-small cell lung carcinoma of the squamous cell carcinoma subtype.
The term “subject” as used herein refers to any member of the animal kingdom, preferably a human being.
The phrase “therapy or treatment” as used herein, refers to an approach aimed at obtaining beneficial or desired results, including clinical results and includes medical procedures and applications including for example chemotherapy, pharmaceutical interventions, surgery, radiotherapy and naturopathic interventions as well as test treatments for treating non-small cell lung cancer. Beneficial or desired clinical results can include, but are not limited to, alleviation or amelioration of one or more symptoms or conditions, diminishment of extent of disease, stabilized (i.e. not worsening) state of disease, preventing spread of disease, delay or slowing of disease progression, amelioration or palliation of the disease state, and remission (whether partial or total), whether detectable or undetectable. “Treatment” can also mean prolonging survival as compared to expected survival if not receiving treatment.
Moreover, a “treatment” or “prevention” regime of a subject with a therapeutically effective amount of the compound of the present disclosure may consist of a single administration, or alternatively comprise a series of applications.
The term “xenograft” as used herein, refers to cells, tissues, or organs that are the result of a transplantation of cells, tissues or organs from one species to another.
In understanding the scope of the present disclosure, the term “comprising” and its derivatives, as used herein, are intended to be open ended terms that specify the presence of the stated features, elements, components, groups, integers, and/or steps, but do not exclude the presence of other unstated features, elements, components, groups, integers and/or steps. The foregoing also applies to words having similar meanings such as the terms, “including”, “having” and their derivatives. Finally, terms of degree such as “substantially”, “about” and “approximately” as used herein mean a reasonable amount of deviation of the modified term such that the end result is not significantly changed. These terms of degree should be construed as including a deviation of at least ±5% of the modified term if this deviation would not negate the meaning of the word it modifies.
The recitation of numerical ranges by endpoints herein includes all numbers and fractions subsumed within that range (e.g. 1 to 5 includes 1, 1.5, 2, 2.75, 3, 3.90, 4, and 5). It is also to be understood that all numbers and fractions thereof are presumed to be modified by the term “about.” Further, it is to be understood that “a,” “an,” and “the” include plural referents unless the content clearly dictates otherwise. The term “about” means plus or minus 0.1 to 50%, 5-50%, or 10-40%, preferably 10-20%, more preferably 10% or 15%, of the number to which reference is being made.
It is demonstrated herein that different subtypes of non-small cell lung carcinoma have distinct biomarker signatures. For example, a number biomarkers show increased levels in non-small cell lung carcinoma of the adenocarcinoma (ADC) subtype and a number of biomarkers show increased expression in non-small cell lung carcinoma of the squamous cell carcinoma (SCC) subtype.
According to one aspect, the disclosure includes a method of screening for, diagnosing or detecting non-small cell lung carcinoma or an increased likelihood of developing non-small cell lung carcinoma in a subject comprising:
In an embodiment, the biomarker is selected from the biomarkers set out in Table 2, Table 4A and Table 4B.
In an embodiment, the level of the biomarker determined is a polypeptide level or a nucleic acid level.
In an embodiment, the difference in the level of the biomarker is an increase in the level of the at least one biomarker in the test sample compared to a control. The control may be a sample from, or a numerical value that corresponds to, a control subject that does not have non-small cell lung carcinoma. In an embodiment, for biomarkers which are increased in non-small cell lung cancer, detecting an increased level is indicative the subject has non-small cell lung carcinoma or an increased risk of developing non-small cell lung carcinoma.
In yet a further embodiment, the level of at least 2, at least 3, at least 4, at least 5, at least 10, at least 14, at least 20, or at least 25 biomarkers is determined.
In an embodiment, the ratio of the level of the biomarker in the test sample compared to the control is greater than 2, 3, 5, 10, 12, 15, 20, 25, 30, 35, 40, 45, 50 or more. In an embodiment, an increased level of at least 2, at least 3, at least 4, at least 5, at least 10, at least 14, at least 20, or at least 25 biomarkers compared to the control is detected and/or is indicative of non-small cell lung carcinoma or an increased likelihood of developing non-small cell lung carcinoma in the subject.
In another embodiment, the level of at least one biomarker in the test sample is compared to a positive control in addition to or instead of a control. The positive control may be a sample from, or a numerical value that corresponds to, a subject or population of subjects known to have non-small cell lung carcinoma.
In an embodiment, a similar level or an increased level compared to the positive control is indicative of non-small cell lung carcinoma or an increased likelihood of developing non-small cell lung carcinoma in the subject.
In an embodiment, a decrease in the level of the biomarker in the test sample compared to the positive control is indicative the subject does not have non-small cell lung carcinoma or an increased risk of developing non-small cell lung carcinoma.
In an embodiment, the non-small cell lung carcinoma is adenocarcinoma or squamous cell carcinoma.
In another embodiment, an expression profile of the test sample obtained from the subject is determined. The expression profile comprises a level for each of at least two biomarkers associated with non-small cell lung carcinoma, these biomarkers being selected from the biomarkers set out in Table 2, 4A, 4B, 6 and/or 7. In this embodiment, the control is a reference profile associated with a non-small cell lung carcinoma subtype selected from adenocarcinoma and squamous cell carcinoma, and an expression profile most similar to the reference profile associated with adenocarcinoma is indicative that the subject has adenocarcinoma and an expression profile most similar to the reference profile associated with squamous cell carcinoma is indicative that the subject has squamous cell carcinoma.
The biomarker may be a keratin. In one embodiment, the keratin is selected from type KRT8, KRT18, KRT20, KRT7, KRT19, KRT5, KRT14, KRT15, KRT6A, KRT6B, KRT6C, KRT16, KRT17, KRT4, KRT13, KRT1, KRT10, KRT2, KRT3, KRT76, KRT78 and/or KRT80. In one embodiment, an increased level of KRT5, KRT6 and/or KRT15 is indicative that the subject has non-small cell lung cancer of the squamous cell carcinoma subtype. In one embodiment, an increased level of KRT7 is indicative that the subject has non-small cell lung cancer of the adenocarcinoma subtype.
In one embodiment, the biomarker is Carbamoyl-Phosphate Synthase (CPS-1) and an increased level is indicative that the subject has non-small cell lung cancer of the adenocarcinoma subtype.
In another embodiment, the biomarker is Anterior Gradient homolog 2 (AGR2) and an increased level is indicative that the subject has non-small cell lung cancer of the adenocarcinoma subtype.
In one embodiment, the biomarker is plakophilin-1 (PLP1) and an increased level is indicative that the subject has non-small cell lung cancer of the squamous cell carcinoma subtype.
In one embodiment, the expression profile comprises the expression level of at least two keratins.
According to another aspect, the disclosure also includes a method of differentiating between non-small cell lung carcinoma of the adenocarcinoma subtype and non-small cell lung carcinoma of the squamous cell carcinoma subtype in a subject, or detecting an increased likelihood of developing non-small cell lung carcinoma of the adenocarcinoma subtype or non-small cell lung carcinoma of the squamous cell carcinoma subtype in a subject comprising:
Determination of non-small cell lung cancer subtype can involve classifying a subject with, or suspected of having non-small cell lung cancer based on the similarity of a subject's expression profile to one or more reference profiles associated with a particular lung cancer subtype. For example, the subject's expression profile can be compared to a reference profile associated with non-small cell lung cancer of the adenocarcinoma subtype and/or a reference profile associated with non-small cell lung cancer of the of the squamous cell carcinoma cell type.
Accordingly, in another aspect the disclosure includes a method of classifying a subject having or suspected of having non-small cell lung carcinoma as having adenomacarcinoma subtype or squamous cell carcinoma subtype, comprising:
Wherein a plurality of biomarkers are assessed, the method can comprise calculating a measure of similarity. Accordingly, in an embodiment, the disclosure provides a method for classifying a subject having or suspected of having non-small cell lung cancer as having adenomacarcinoma subtype or squamous cell carcinoma subtype, comprising:
A number of algorithms can be used to assess similarity of samples. For example, similarity can be assessed by determining the Euclidean distance of a sample expression profile to a class centroid.
Wards algorithm can be used for forming hierarchical groups of mutually exclusive subsets of samples.
In an embodiment, the biomarker associated with non-small cell lung carcinoma of the adenocarcinoma subtype or non-small cell lung carcinoma of the squamous cell subtype is selected from the biomarkers set out in Table 2, Table 4A and Table 4B.
In an embodiment, the level of the at least one biomarker determined is a polypeptide level or a nucleic acid level.
In an embodiment, the altered level is an increase in the level of the biomarker in the test sample compared to a control. This control may be a sample from, or a numerical value that corresponds to, a control subject that does not have non-small cell lung carcinoma. The increase of the level of the biomarker is indicative of whether the subject has non-small cell lung carcinoma of the adenocarcinoma subtype or non-small cell lung carcinoma of the squamous cell carcinoma subtype, or an increased likelihood of developing non-small cell lung carcinoma of the adenocarcinoma subtype or non-small cell lung carcinoma of the squamous cell carcinoma subtype.
In yet a further embodiment, the level of at least 2 at least 3, at least 4, at least 5, at least 10, at least 15, at least 20, or at least 25 biomarkers is determined. In an embodiment, the ratio of the level of the biomarker in the test sample compared to the control is greater than 2, 3, 5, 10, 12, 15, 20, 25, 30, 35, 40, 45, 50 or more. In another embodiment, an increased level of at least 2, at least 3, at least 4, at least 5, at least 10, at least 15, at least 20, or at least 25 biomarkers compared to the control is detected and/or is indicative of non-small cell lung carcinoma of the adenocarcinoma subtype or non-small cell lung carcinoma of the squamous cell carcinoma subtype, or an increased likelihood of developing non-small cell lung carcinoma of the adenocarcinoma subtype or non-small cell lung carcinoma of the squamous cell carcinoma subtype.
In another embodiment, the level of the at least one biomarker in the test sample is compared to a positive control in addition to or instead of a control. This positive control may be a sample from, or a numerical value that corresponds to, a control subject with non-small cell lung carcinoma of the adenocarcinoma subtype or a control subject with non-small cell lung carcinoma of the squamous cell carcinoma subtype. A similar level or increased level compared to the positive control is indicative of non-small cell lung carcinoma of the adenocarcinoma subtype or non-small cell lung carcinoma of the squamous cell carcinoma subtype, or an increased likelihood of developing non-small cell lung carcinoma of the adenocarcinoma subtype or non-small cell lung carcinoma of the squamous cell carcinoma subtype. Alternatively, a decrease in the level of the biomarker in the test sample compared to the positive control is indicative the subject does not have non-small cell lung carcinoma of the adenocarcinoma subtype or non-small cell lung carcinoma of the squamous cell carcinoma subtype, or an increased likelihood of developing non-small cell lung carcinoma of the adenocarcinoma subtype or non-small cell lung carcinoma of the squamous cell carcinoma subtype.
In another embodiment, an expression profile in the test sample from the subject is determined. The expression profile comprises a level for each of at least two biomarkers associated with non-small cell lung carcinoma of the adenocarcinoma subtype or of the squamous cell carcinoma subtype, wherein the at least two biomarkers are selected from the biomarkers set out in Table 2, 4A, 4B, 6 and/or 7. In this embodiment, the control is a reference profile associated with a non-small cell lung carcinoma subtype selected from adenocarcinoma and squamous cell carcinoma, and an expression profile most similar to the reference profile associated with adenocarcinoma is indicative that the subject has adenocarcinoma and an expression profile most similar to the reference profile associated with squamous cell carcinoma is indicative that the subject has squamous cell carcinoma.
In one embodiment, an increased level of KRT5, KRT6 or KRT15 is indicative that the subject has non-small cell lung cancer of the squamous cell carcinoma subtype.
In one embodiment, an increased level of KRT7 is indicative that the subject has non-small cell lung cancer of the adenocarcinoma subtype.
In one embodiment, the biomarker is CPS-1 and an increased level is indicative that the subject has non-small cell lung cancer of the adenocarcinoma subtype.
In one embodiment, the biomarker is plakophilin-1 and an increased level is indicative that the subject has non-small cell lung cancer of the squamous cell carcinoma subtype.
In one embodiment, the expression profile comprises the expression level of at least two keratins.
According to another aspect, the disclosure also includes a method of screening for, diagnosing or detecting non-small cell lung carcinoma of the adenocarcinoma subtype or an increased likelihood of developing non-small cell lung carcinoma of the adenocarcinoma subtype in a subject comprising:
(a) determining a level of at least one biomarker associated with non-small cell lung carcinoma of the adenocarcinoma subtype in a test sample from the subject wherein the at least one biomarker is selected from the biomarkers set out in Table 2 or Table 4A; and
(b) comparing the level of the at least one biomarker in the test sample with a control;
wherein detecting an altered level of the at least one biomarker in the test sample compared to the control is indicative of whether the subject has or does not have non-small cell lung carcinoma of the adenocarcinoma subtype or an increased likelihood of developing non-small cell lung carcinoma of the adenocarcinoma subtype.
In an embodiment, the biomarker associated with non-small cell lung carcinoma of the adenocarcinoma subtype may be selected from the biomarkers set out in Table 2.
In an embodiment, the level of the at least one biomarker determined is a polypeptide level or a nucleic acid level.
In an embodiment, the altered level of the biomarker is an increase in the level of the at least one biomarker in the test sample compared to a control. This control may be a sample from, or a numerical value that corresponds to, a control subject that does not have non-small cell lung carcinoma or a control subject that does not have non-small cell lung carcinoma of the adenocarcinoma subtype. The increased level is indicative the subject has non-small cell lung carcinoma of the adenocarcinoma subtype or an increased risk of developing non-small cell lung carcinoma of the adenocarcinoma subtype.
In an embodiment, the level of at least 2, at least 3, at least 4, at least 5, at least 10, at least 15, at least 20, or at least 25 biomarkers is determined. In an embodiment, a ratio of the level of the biomarker in the test sample compared to the control is greater than 2, 3, 5, 10, 12, 15, 20, 25, 30, 35, 40, 45, 50 or more. In another embodiment, an increased level of at least 2, at least 3, at least 4, at least 5, at least 10, at least 15, at least 20, or at least 25 biomarkers compared to the control is detected and/or is indicative of non-small cell lung carcinoma of the adenocarcinoma subtype or an increased likelihood of developing non-small cell lung carcinoma of the adenocarcinoma subtype in the subject.
In another embodiment, the level of the biomarker in the test sample is compared to a positive control in addition to or instead of a control. This positive control may be a sample from, or a numerical value that corresponds to, a control subject with non-small cell lung carcinoma of the adenocarcinoma subtype. A similar or increased level compared to the positive control is indicative of non-small cell lung carcinoma of the adenocarcinoma subtype or an increased likelihood of developing non-small cell lung carcinoma of the adenocarcinoma subtype in the subject. Alternatively, a decrease in the level of the biomarker in the test sample compared to the positive control, wherein the decreased level is indicative the subject does not have non-small cell lung carcinoma of the adenocarcinoma subtype or an increased risk of developing non-small cell lung carcinoma of the adenocarcinoma subtype.
In an embodiment, an expression profile in the test sample from the subject is determined. The expression profile comprises a level for each of at least two biomarkers associated with non-small cell lung carcinoma of the adenocarcinoma subtype, wherein the at least two biomarkers are selected from the biomarkers set out in Table 2 or Table 4A. In this embodiment, the control is a reference profile associated with a non-small cell lung carcinoma of the adenocarcinoma subtype, and an expression profile most similar to the reference profile associated with non-small cell lung carcinoma of the adenocarcinoma subtype is indicative that the subject has non-small cell lung carcinoma of the adenocarcinoma subtype.
In an embodiment, the at least one biomarker is a keratin. The keratin may be selected from KRT18, KRT7, KRT14 or KRT17. In an embodiment, the keratin is KRT7.
In a further embodiment, an increased level of KRT7 is indicative that the subject has non-small cell lung cancer of the adenocarcinoma subtype.
In a further embodiment, the biomarker is CPS-1 and an increased level is indicative that the subject has non-small cell lung cancer of the adenocarcinoma subtype.
In an embodiment, the expression profile comprises the expression level of at least two keratins.
According to another aspect, the disclosure also includes a method of screening for, diagnosing or detecting non-small cell lung carcinoma of the squamous cell carcinoma subtype or an increased likelihood of developing non-small cell lung carcinoma of the squamous cell carcinoma subtype in a subject comprising:
wherein detecting a difference or similarity in the level of the at least one biomarker in the test sample compared to the control is indicative of whether the subject has or does not have non-small cell lung carcinoma of the squamous cell carcinoma subtype or an increased likelihood of developing non-small cell lung carcinoma of the squamous cell carcinoma subtype. In an embodiment, the biomarker associated with non-small cell lung carcinoma of the squamous cell carcinoma subtype is selected from the biomarkers set out in Table 2.
In an embodiment, the level of the biomarker determined is a polypeptide level or a nucleic acid level.
In an embodiment, the difference in the level is an increase in the level of the biomarker in the test sample compared to a control. This control may be a sample from, or a numerical value that corresponds to, a control subject that does not have non-small cell lung carcinoma or a control subject that does not have non-small cell lung carcinoma of the squamous cell carcinoma subtype. An increased level is indicative the subject has non-small cell lung carcinoma of the squamous cell carcinoma subtype or an increased risk of developing non-small cell lung carcinoma of the squamous cell carcinoma subtype.
In an embodiment, the level of at least 2, at least 3, at least 4, at least 5, at least 10, at least 15, at least 20, or at least 25 biomarkers is determined.
In an embodiment, a ratio of the level of the biomarker in the test sample compared to the control is greater than 2, 3, 5, 10, 12, 15, 20, 25, 30, 35, 40, 45, 50 or more.
In an embodiment, an increased level of at least 2, at least 3, at least 4, at least 5, at least 10, at least 15, at least 20, or at least 25 biomarkers compared to the control is detected and/or indicative of non-small cell lung carcinoma of the squamous cell carcinoma subtype or an increased likelihood of developing non-small cell lung carcinoma of the squamous cell carcinoma subtype in the subject.
In another embodiment, the level of the biomarker in the test sample is compared to a positive control in addition to or instead of a control. This positive control may be a sample from, or a numerical value that corresponds to, a control subject with non-small cell lung carcinoma of the squamous cell carcinoma subtype. A similar or increased level compared to the positive control is indicative of non-small cell lung carcinoma of the squamous cell carcinoma subtype or an increased likelihood of developing non-small cell lung carcinoma of the squamous cell carcinoma subtype in the subject. Alternatively, a decrease in the level of the biomarker in the test sample compared to the positive control is indicative the subject does not have non-small cell lung carcinoma of the squamous cell carcinoma subtype or does not have an increased risk of developing non-small cell lung carcinoma of the squamous cell carcinoma subtype.
In an embodiment, an expression profile of the test sample from the subject is determined. The expression profile comprises a level for each of at least two biomarkers associated with non-small cell lung carcinoma squamous cell carcinoma, the at least two biomarkers selected from the biomarkers set out in Table 2 or Table 4B. In this embodiment, the control is a reference profile associated with a non-small cell lung carcinoma of the squamous cell carcinoma subtype, and an expression profile most similar to the reference profile associated with non-small cell lung carcinoma of the squamous cell carcinoma subtype is indicative that the subject has non-small cell lung carcinoma of the squamous cell carcinoma subtype.
In an embodiment, the at least one biomarker is a keratin.
In an embodiment, the biomarker is plakophilin-1 and an increased level is indicative that the subject has non-small cell lung cancer of the squamous cell carcinoma subtype.
In an embodiment, the level of at least one biomarker or the expression product is determined using mass spectrometry.
In an embodiment, the mass spectrometry comprises tandem mass spectrometry, 1D LC MS/MS, 2D LC MS/MS, SRM or MRM.
In an embodiment, the biomarker level is detected by summing the spectral counts for biomarker peptides. Spectral counts for peptides corresponding to a single protein can be summed and protein level counts can be normalized by obtaining the relative abundance ratio (dividing by the total sample spectral counts) than multiplying by the overall experimental spectral count.
In an embodiment, the biomarker is a biomarker listed in Table 6 and mass spectrometry is used to detect a peptide listed in Table 6.
The methods described herein can be computer implemented. In an embodiment, the method further comprises: (c) displaying or outputting to a user interface device, a computer readable storage medium, or a local or remote computer system; the classification produced by the classifying step (b). In another embodiment, the method comprises displaying or outputting a result of one of the steps to a user interface device, a computer readable storage medium, a monitor, or a computer that is part of a network.
According to another aspect, the disclosure also includes a method of treating non-small cell lung carcinoma in a subject comprising:
According to another aspect, the disclosure also includes a method of treating non-small cell lung carcinoma of the subtype adenocarcinoma or non-small cell lung carcinoma of the subtype squamous cell carcinoma in a subject comprising:
According to another aspect, the disclosure also includes a method of treating non-small cell lung carcinoma of adenocarcinoma subtype in a subject comprising:
A method of treating non-small cell lung carcinoma of squamous cell carcinoma subtype in a subject comprising:
According to another aspect, the disclosure also includes a SRM/MRM method for quantifying a level of at least one biomarker associated with non-small cell lung carcinoma in a sample, the method comprising the steps of:
b) evaluating the biomarker level using SRM/MRM mass spectrometry. In an embodiment, the amount of biomarker peptide fragment present in a sample can be determined by summing the area from all transitions and normalizing the totals to an area obtained for a control peptide for example, the peptide LISWYDNEFGYSNR (SEQ ID NO:1) that is found in GAPDH.
In an embodiment, the level of a biomarker listed in Table 6 is quantified using SRM/MRM mass spectrometry. In an embodiment, the peptide fragment that is isotope labeled, is a peptide fragment listed in Table 6. In an embodiment, the level of epidermal growth factor receptor, optionally phosphorylated epidermal growth factor receptor is quantified using SRM/MRM mass spectrometry.
In an embodiment, the EGFR peptide fragment detected is NLQEILHGAVR (SEQ ID NO:12).
In an embodiment, the method comprises a dynamic detection range corresponding from about 10 000 copies to about 1 million copies of per cell.
The quantifying method may also further comprise the step of determining activation of the epidermal growth factor receptor (EGFR) network.
It is demonstrated herein that SRM/MRM is able to more accurately assess the level of for example EGFR and the accurate quantification of EGFR protein levels by SRM may enable the further stratification of NSCLC in terms of EGFR levels, beyond what has been achieved by IHC, measures of gene copy number, and mutations.
According to another aspect, the disclosure also includes a method for treating non-small cell lung carcinoma in a subject comprising:
a) quantifying a level of EGFR in a test sample from the subject; and
b) administering an EGFR directed drug to the subject when the level of EGFR level quantified is above a threshold indicative that the subject would benefit from the EGFR directed drug.
In an embodiment, one or more parameters provided in Example 1 are used.
According to another aspect, the disclosure also includes a kit for measuring the level of at least one biomarker associated with non-small cell lung cancer or a subtype thereof in a sample, wherein the at least one biomarker is selected from the biomarkers set out in Table 2, 4A, 4B, 6 and/or 7, comprising:
In an embodiment, the at least one biomarker is selected from the biomarkers set out in Table 2, Table 4A or 4B.
In an embodiment, the at least one biomarker is a keratin. In an embodiment, the kit comprises a set of biomarker detection agents for detecting a set of keratin biomarkers. In embodiment, the array slide comprises a set of biomarker detection agents for detecting a set of keratin biomarkers.
In an embodiment, the keratin is selected from KRT8, KRT18, KRT20, KRT7, KRT19, KRT5, KRT14, KRT15, KRT6A, KRT6B, KRT6C, KRT16, KRT17, KRT4, KRT13, KRT1, KRT10, KRT2, KRT3, KRT76, KRT78 and/or KRT80.
In an embodiment, the keratin is selected from KRT18, KRT7, KRT5, KRT14, KRT15, KRT6A, KRT16, KRT17, KRT4 and/or KRT13.
In an embodiment, the at least one biomarker is CPS-1.
In an embodiment, the at least one biomaker is AGR2.
In an embodiment, the at least one biomarker is plakophilin-1.
In an embodiment, the biomarker specific reagent is an antibody or antibody fragment.
In an embodiment, the biomarker specific reagent is a probe, or primer set that amplifies a nucleic acid transcript of the biomarker.
In an embodiment, the control is a peptide fragment of the at least one biomarker.
The kit can comprise for example, specimen collection tubes for example for collecting a biopsy, extraction buffer, positive controls, and the like.
The following non-limiting examples are illustrative of the present disclosure:
Routinely harvested fresh human NSCLC were resected surgically at The University Health Network (Toronto) and directly implanted into non-obese diabetic and severe combined immune-deficient (NOD-SCID) mice to establish primary tumor xenograft models. Each tumor model was verified by at least three serial in vivo passages to demonstrate engraftment stability. OncoCarta MassARRAY Chip (Sequenom) mutation screening was conducted to detect mutations in the EGFR, KRAS, and PIK3CA genes. These results were validated by direct DNA sequencing of tumor xenograft specimens.
Formalin-fixed paraffin-embedded (FFPE) tissue blocks were cut at 4 um thickness onto slides and dried in 60° C. oven overnight. Slides were further processed and stained in a fully automated process using the BenchMark XT (Ventana Medical Systems Inc.). Slides were scored by a pathologist (NY). Staining intensity was scored positive or negative. When specimens showed partial positive labeling, the percentage of tumor cells labeled is estimated.
Xenograft tissues were harvested from mice and immediately stored in liquid nitrogen. Aliquots of tissue (approximately 50 mg) were mixed with lysis buffer (1 ml buffer per 10 mg tissue; 20 mM HEPES pH 8.0, 9 M Urea, 1 mM sodium orthovanadate, 2.5 mM sodium pyrophosphate, 1 mM β-glycerophosphate) and subjected to ultra-sonication for 1 min, followed by centrifugation (20,000×g) for 20 min. Aliquots of supernatants (clarified lysates; 50 μl) were set aside for Western blotting analysis. Routinely, the concentrations of clarified lysates were approximately 2 mg/ml (protein). An equal volume of 2×SDS-PAGE sample loading buffer was added, and the samples were resolved by standard SDS-PAGE methods, and then electrophoretically transferred to Immobilon-P membranes (Millipore) for Western blotting, essentially as described previously 10.
For MS analysis, clarified lysates were reduced with 4.5 mM DTT, carboxamidomethylated by using 10 mM iodoacetamide, diluted 4-fold, digested by incubation with Trypsin-TPCK for 12 h. Peptides were then desalted by using C18 resin as described previously 11. The eluted peptides were aliquoted and lyophilized. 2 μg or 15 μg dried, desalted peptides were dissolved in 0.1% formic acid and analyzed by 1D or 2D LC/MS/MS, respectively.
All samples were analyzed on a LTQ-Orbitrap XL. The instrument method consisted of one MS full scan (400-1800 m/z) in the Orbitrap mass analyzer, an AGC target of 500,000 with a maximum ion injection of 500 ms, 1 microscan and a resolution of 60,000 and using the preview scan option. Six data-dependent MS/MS scans were performed in the linear ion trap using the three most intense ions at 35% normalized collision energy. The MS and MS/MS scans were obtained in parallel. AGC targets were 10,000 with a maximum ion injection time of 100 ms. A minimum ion intensity of 1,000 was required to trigger a MS/MS spectrum. The dynamic exclusion was applied using a maximum exclusion list of 500 with one repeat count with a repeat duration of 30 seconds and exclusion duration of 45 seconds.
2D LC-MS/MS analyses
A fully automated 4-step two-dimensional chromatography sequence was set up as previously described 12. Peptides were loaded on a 7 cm pre-column (150 μm i.d.) containing a Kasil frit packed with 3.5 cm 5μ Magic C18 100 Å reversed phase material (Michrom Bioresources) followed by 3.5 cm Luna® 5μ SCX 100 Å strong cation exchange resin (Phenomenex, Torrance, Calif.). Samples were automatically loaded from a 96-well microplate autosampler using an EASY-nLC system (Proxeon Biosystems, Odense, Denmark) at 3 μl/minute. The pre-column was connected to an 8 cm fused silica analytical column (75 μm i.d.) via a micro splitter tee (Proxeon) to which a distal 2.3 kV spray voltage was applied. The analytical column was pulled to a fine electrospray emitter using a laser puller. For the peptide separation on the analytical column a water/acetonitrile gradient was applied at an effective flow rate of 400 nl/minute, controlled by the EASY-nLC. Ammonium acetate salt bumps (8 μl) were applied at the following concentrations (0 mM, 100 mM, 300 mM and 500 mM), using the 96-well micro plate autosampler at a flow-rate of 3 ml/minute in a vented-column set-up.
SRM was carried out on duplicate 5 μg aliquots of each xenograft lysate. The peptides were captured on a 150 μm ID C18 pre-column and separated over a 75 μm ID analytical column constructed with an emitter tip. The separation was carried out with a gradient of 0 to 65% acetonitrile in 0.1% formic acid over 40 min using the EASY-nLC split-free HPLC system. The eluted peptides were monitored by using a TSQ Quantum Vantage triple quadrupole mass spectrometer (ThermoFisher, San Jose, Calif.). The dwell time was 20 ms and the scan width was 0.01 amu. The S-lens was varied with precursor m/z values and a 10 V declustering potential was used. Q1 and Q3 resolution were set to 0.2 and 0.7 amu, respectively. The transitions used are shown in Table 6. To normalize for the amount of human peptides present in each sample, the area from all transitions were summed and the totals were normalized to the areas obtained for the peptide LISWYDNEFGYSNR (SEQ ID NO:1) that is found only in human GAPDH. Note that the 2D LC-MS/MS experiments verified that human GAPDH was not statistically different in abundance between ADC and SCC samples.
Therefore, the SRM value for human GAPDH peptide LISWYDNEFGYSNR (SEQ ID NO:1), based on 3 transitions, which averaged 5.7±0.8 (SE)×105 units across the ten samples, was used to normalize the summed SRM transition measurements associated with individual peptides from corresponding xenograft samples. Collision energy was calculated by using the formula 3.41+0.034×(m/z of parent peptide), with collision gas pressure at 1.5 mTorr as described by Prakash and colleagues. 34
Spectral counts for peptides corresponding to a single protein were summed and protein level counts were normalized by obtaining the relative abundance ratio (dividing by the total sample spectral counts) than multiplying by the overall experimental spectral count. Spectra count values of zero were changed to 0.2 as part of the normalization routine similar, as recently reported 13-16. For clustering analysis, the 3 replicates for each sample were averaged, and the normalized data was then filtered to include proteins which were detected in at least 2 samples in the entire data set of normalized protein data (i.e. sample incidence ≧2). Hierarchical clustering was applied to the normalized protein data in R (v2.10.0) via the ‘hclust’ function using Spearman's rank correlation as distance metrics and the ‘average’ agglomeration method, and was used to generate dendrograms. Heatmap plots were generated using the heatmap.2 function in the R package ‘gplots’ (v2.7.4), utilizing the log 2 transformed normalized protein data.
The Wilcoxon Rank Sum Test (‘wilcox.test’ in R (v2.10.0)) was used to identify differentially expressed proteins between ADC and SCC samples in the normalized protein data. Significance was assumed as p-value <0.05.
Raw data was converted to m/z XML using ReAdW and searched by X!Tandem against a locally installed version of a merged human and mouse IPI (http://www.ebi.ac.uk/IPI) protein sequence database (version 3.54; 75,427 human sequences and 55,985 mouse sequences). The searches were performed with a fragment ion mass tolerance of 0.4 Da, a parent ion mass tolerance of ±10 ppm. Complete tryptic digest was assumed. Carbamidomethylation of cysteine was specified as fixed, and oxidation of methionine as variable modification.
To estimate and minimize the false positive rate the merged human and mouse protein sequence database also contained every IPI protein sequence in its reversed amino acid orientation (target-decoy strategy; total database size 262,824 sequences) as recently described 17′ 18. For the presented study, the value of total reverse spectra to total forward spectra was set to 0.5%, resulting in zero decoy sequences in the final output (0 reverse proteins for both the human and mouse assignments). Only fully tryptic peptides ≧7 amino acids, matching these criteria were accepted to generate the final list of identified proteins. Only proteins identified with two unique peptides per analyzed sample were accepted (i.e. 3 MudPITs per xenograft). To minimize protein inference, a database grouping scheme was developed, and only proteins with substantial peptide information were reported, as recently reported 14, 17, 19. For a protein to be assigned “human or mouse” it required at least one unique peptide mapping uniquely to either a human or mouse entry in the mixed-species database.
As part of a larger effort to establish and characterize >100 primary NSCLC xenografts, a pilot study was conducted with 10 NSCLC xenografts to test if a proteomics platform could be effectively applied to characterize these tumors (
The establishment of the xenograft tumors and sample preparation are described above. Table 1 presents a summary of tumor information including limited histological and molecular features.
1Estimated tumor cell abundance among cells present
2Percentage positively stained cells
The ten xenografts were classified as ADC or SCC based on the histologies of the primary tumors and corresponding xenografts; there was good concordance in differentiation grades (
A relatively rapid (i.e. approximately 2 h per sample analysis), 1D LC-MS/MS approach was used to complete an initial analysis of proteins expressed in the 10 xenograft samples. This was followed by a more comprehensive, but time consuming (i.e. 8 h per sample analysis) 2D analysis (described below) 12. Results were tabulated as normalized MS/MS spectral counts per assigned gene product (see Experimental Procedures) Variability in protein detection and expression were assessed by comparing results of technical and biological replicates at the protein and peptide levels. When a sample (ADC1) was analyzed in triplicate 493 proteins out of a total of 564, were identified in each replicate, indicating an overlap of 87%. To assess biological variation, equivalent portions of ADC1 were expanded in two different recipient mice, and then analyzed. The overlap between the two samples was 88% at the protein level: 491 proteins out of a total of 559, were found in both samples. These numbers suggested a reasonable degree of reproducibility at the protein level between samples analyzed by 1D LC-MS/MS.
The 1D LC-MS/MS analysis of the 10 samples was considered a preliminary scan of the NSCLC xenograft proteomes, and allowed identification of 635 proteins. See Experimental Procedures for protein identification criteria. As shown in the dendogram in
Prominent among the highly differentially expressed proteins were 8 keratin (KRT) gene products, with KRT7 highly expressed in ADC, and seven others that were more highly expressed in SCC. The other two proteins found more highly expressed in ADC compared to SCC were the urea cycle component Carbamoyl-Phosphate Synthase (CSP1), and Anterior Gradient homolog 2 (AGR2). These data indicate that the 1D platform was sufficient to resolve ADC and SCC subtypes based on their significantly different proteomes.
In order to increase statistical significance and proteome coverage, a more rigorous protocol involving triplicate analysis by 2D LC-MS/MS, MudPIT (Multidimensional Protein Identification Technology) was applied 12, 20, 21. Each sample was analyzed in triplicate and MS/MS spectral counts tabulated essentially as described in Experimental Procedures 13-16. As a product of the 30 individual 2D analyses, 2015 proteins were identified (Table 3).
Expressed human proteins were subjected to hierarchical clustering analysis to determine if the ADC and SCC proteomes were distinct.
By application of the Wilcoxon test, 178 human proteins were significantly different in their average expression between the ADC and SCC subtypes (Table 7). Within this set, 50 proteins were increased or decreased >10-fold in ADC compared with SCC xenografts (Table 4 A and 4B).
Table 4A and 4B. Proteins highly differentially expressed in NSCLC
This highly differential subset included 8 keratins, including 6 that were identified as 10-fold differentially expressed in the 1D data set. The proteins AGR2 and CPS1 were again identified as more abundant in ADC: 17.4-fold (p=0.008) for AGR2 (Table 4A), and 5.0-fold (p=0.03) for CPS1.
Of the 28 known human epithelial keratins (Moll et al. 2008), 22 were detected in the panel of xenografts, as summarized in Table 5.
KRT18
I
83.5
11%
20.4
20%
69.4
38%
108.9
3%
113.5
18%
22.5
6%
KRT7
II
87.9
12%
48.3
24%
29.9
46%
3%
80.6
12%
KRT5
II
155.9
2%
KRT14
I
73.5
48%
148.4
4%
KRT15
I
183.1
3%
KRT6A
II
160.1
4%
KRT16
I
118.0
3%
KRT17
I
53.1
10%
26.9
51%
42%
0%
23.5
13%
55.6
2%
KRT4
II
184.1
5%
KRT13
I
505.4
5%
KRT18
I
5.9
16%
33.7
6%
10.3
26%
18.6
13%
4.3
0.2
0.032
KRT7
II
9.8
19%
28.4
0.04
0.008
KRT5
II
179.0
7%
101.1
6%
190.7
23%
98.9
667.3
0.008
KRT14
I
457.4
7%
61.2
39%
110.0
243.0
29%
0.1
13.7
0.016
KRT15
I
114.9
3%
41.1
6%
96.1
21%
90.4
34%
483.3
0.008
KRT6A
II
238.9
2%
88.6
4%
179.6
27%
154.7
0.3%
755.8
0.008
KRT16
I
146.3
2%
13.5
12%
84.2
30%
139.6
46%
461.2
0.008
KRT17
I
167.0
6%
121.7
7%
96.0
22%
134.3
57%
0.2
5.3
0.008
KRT4
II
18.3
27%
0.2
0%
20.9
18%
8.2
64%
213.0
0.032
KRT13
I
507.7
3%
139.8
1%
294.1
18%
352.5
37%
1654.8
0.008
Ten KRT proteins were significantly differentially expressed between ADC and SCC subtypes (Table 5, boldface, KRTs 18, 7, 5, 14, 15, 6A, 16, 17, 4, 13). Table 5 is organized by grouping the KRT proteins according to their known expression in simple and stratified epithelia, and, where known, in type I/II pairs, which assemble as obligate heterodimers for intermediate filament assembly 5, 22.
In order to validate and extend the information on these proteins that was generated by 1D and 2D tandem MS, additional analyses were performed. This included IHC on FFPE tissue sections, Western immuno blotting, and SRM-MS. Table 1 summarizes the IHC information related to the EGFR and certain keratins. Table 6 lists the peptides and corresponding transitions that were measured as part of a single, mulitplexed SRM (or MRM) method that was used to scan the xenografts.
In
The IHC staining for keratins 5 and 6 (CK5/6) was negative for the ADC samples, and 100% positive for the SCC xenografts (Table 1,
Analysis of KRT19 by IHC and SRM was similar: Replicate measurement of two KRT19 peptides (Table 6) gave very similar results (
By 2D LC-MS/MS analysis the EGFR was identified in 6 xenografts, 3-each ADC and SCC, but was not identified as differentially expressed between the two subtypes. To further examine EGFR expression and activation, additional data were generated by application of IHC (Table 1), Western immuno blotting, and SRM-MS. As shown in
Phosphotyrosine (pTyr or pY)-directed Western blotting was used to assess the activation of EGFR in the xenografts. Tyr1068 becomes phosphorylated upon EGFR activation, and through direct binding of the adaptor protein GRB2 is coupled to stimulation of the RAS-ERK signaling axis 23. Probing with antibodies to pY1068 provided qualitative results indicating activation of EGFR to some extent in ADCs 1-3 and SCC3. Activation in ADC3 was expected since this xenograft harbors the EGFR kinase-domain-activating exon 19 deletion (Table 1). Staining of whole-tissue extracts with antibodies to pTyr revealed a prominent band at the expected size of the EGFR in ADC3 and SCC3, and to a lesser extent in ADCs 1 and 2. This is consistent the pY1068 staining, and suggests EGFR was activated in SCC3 in addition to its relatively high level of expression. The anti-pTyr blot also indicated distinct patterns of pY-proteins in the tumors. A strong signal migrating at Mr 55K-65K was evident in ADC1. Discernible bands at Mr 62K were present in ADCs 2 and 4, and SCCs 1-3, and bands at approximately Mr 38K in ADC3, ADC5, SCC4. Tissue micro array and subjective IHC scoring (Table 1) were in general agreement with the EGFR expression data presented in
The purpose of this pilot study was twofold. First, to determine the feasibility of using a proteomics platform comprised of a high resolution LC-MS/MS instrument for 1D and 2D comprehensive protein signature discovery, and combined with an LC-triple quadrupole instrument for multiplexed SRM-based relative quantification of signature proteins of interest. A similar approach of protein profiling leading to SRM/MRM-based quantification was effectively applied as part of a comprehensive platform to characterize a mouse model of breast cancer 24. The second goal was to glean insights into the protein profiles expressed in a perceived information-rich resource represented by xenografts established from primary resected tumors. Proteomic profiles lacking detailed protein identifications have been shown to effectively stratify NSCLC tumors 25. This report provides the most detailed analysis of protein expression in NSCLC to-date, and demonstrated effective recognition of ADC and SCC subtypes based on their unique proteomics signatures. The set of 10 tumors used in this pilot study did not include examples of the more rare, large cell carcinoma. It is expected that as data accumulate by analysis of a greater diversity of xenografts, it is likely that the proteomics profiles will stratify into more groups than the traditionally recognized ADC, SCC and large cell subtypes. The effective translation of proteomic signatures into multiplexed SRM (or MRM) assays, which may be applied to quantify proteins in minute surgical samples, represents a new strategy to stratify tumors. This will facilitate, in the first instance, case controlled studies of outcome that may correlate with an expanded set of proteome-defined tumor subtypes.
The quantitative results from 2D LC-MS/MS spectral counting and SRM were remarkably consistent with each other, and complemented the IHC observations. Compared to the 2D method, the 1D LC-MS/MS approach was simpler and faster, both technically and with respect to data analysis. It revealed the highly differential expression of both several keratins and the two urea cycle enzymes. The 2D findings verified the results of the 1D analysis and provided a more statistically robust data set, and with greater proteome coverage. Both approaches resulted in the identification of a set of highly differentially expressed proteins that were detected at levels differing by at least 10-fold between the 5 ADC and 5 SCC tumor-derived xenografts. More than 200 proteins (216) were identified as statistically different between ADC and SCC tumors, which includes 178 identified the 2D data set and another non-overlapping 38 from the 1D analysis.
A key element of the experimental plan was the comparison of traditional pathology laboratory methods such as IHC with quantitative proteomics. The MS-based proteomics results were verified by the IHC and Western data. In terms of tumor characterization, the SRM results complemented IHC which retains the advantage of revealing protein subcellular localization and cellular organization and heterogeneity.
Among the most abundant proteins detected by MS analysis were epithelial keratins, and they were among the most strikingly highly differentially expressed. For example, six KRTs (5, 15, 6A, 16, 4, 13) were detected exclusively in SCC (Table 5). As expected, the ADC xenografts were predominantly characterized by keratins typically associated with simple epithelia; SCC also expressed these to some extent, but were notable for their expression of KRTs associated with stratified epithelium. Some but not all ADC also expressed very low levels of the squamous-type (i.e. stratified epithelial) keratins
KRT14 and KRT17. The less characterized KRT80, which was detected previously in lung 26 was identified in one SCC tumor (SCC1), and three SCC xenografts expressed the rare KRT78 (refer to Table 5). SCC3 was unique both in its high level expression of EGFR, and as the only SCC found to express KRT7, which was otherwise only seen in ADC. The KRT7 expression in this instance may be an indication SCC3 arose through squamous metaplasia. The keratins are a key structural component of the 3-dimensional epithelial barrier 5. In reference to the role of keratins in epithelia, Moll et al. 5 stated “this main cytoskeletal function transcends the single cell level.” Hence, the measurement of KRT proteins in the primary xenograft model provides insight into a key structural component of three dimensional lung tumors. van Dorst et al. 27 examined by IHC a limited set of keratins in adenocarcinomas and squamous cell carcinomas, including 16 from lung, and noted the difficulty in classifying the tumors by this method. The ability to comprehensively measure KRTs as demonstrated in this study suggests that efficient classification of ADC and SCC subtypes may be achieved, if not assisted by multiplexed-SRM-mediated, comprehensive KRT profiles.
Plakophilin-1 (PKP1) was also found highly differentially expressed in SCC, and functions in the linkage of intermediate filaments to desmosomes28. Interestingly, PKP1 over expression correlates with increased cell proliferation and size, and regulates translation through interaction with eIF4A129. The related protein PKP3 is up-regulated and oncogenic in NSCLC 30. While the present study was not aimed at identifying target proteins differentially expressed between tumor and normal tissue, this example illustrates the ability to discover and link tumor molecular markers with the cancer phenotype. It is predicted that the biomarkers will be differentially expressed between tumor and normal tissue. [The discovery of differential expression of the urea cycle enzyme CPS1 illustrates the potential monitoring of metabolic profiles by proteomics. Elevated ARG2 in NSCLC, and in large cell carcinoma in particular, was observed previously 31. Mutationally-activated forms of EGFR are recognized drug targets in NSCLC, but how EGFR markers, such as EGFR protein expression, gene copy number, and mutation status, should be incorporated into clinical decision making remains an evolving and contentious issue 32. EGFR expression was expected in both ADC and SCC, and was reproducibly relatively quantified by spectral counting and SRM. These results complement well the measurement of EGFR by IHC, which informs of positive cells, and subcellular localization (e.g. peripheral staining corresponding to plasma membrane localization). However, it has been recognized that various EGFR mutations, ligands, and therapeutic treatments can differentially affect EGFR protein stability and subcellular localization. Therefore, the accurate quantification of EGFR protein levels by SRM may enable the further stratification of NSCLC in terms of EGFR levels, beyond what has been achieved by IHC, measures of gene copy number, and mutations. In the 10 xenografts analyzed in this study, the range in EGFR expression was approximately 50-fold. Also, SRM measurement of a spiked-in, stable-isotope-labeled EGFR peptide (identical in sequence to the m/z 604.87 peptide, Table 6) allowed estimation of the level of EGFR at approximately 6×105 copies per cell in SCC3, which expressed the highest amount of EGFR. These examples illustrate the potential application of SRM to quantify drug target protein levels with greater precision than is achieved by current methods such as IHC. This may facilitate a better assessment of correlations in EGFR protein levels and responsiveness to EGFR-directed drugs.
The sensitivity and versatility (i.e. multiplexing) of SRM enabled the assembly of a single assay to measure keratins, the target EGFR, and examples of metabolic enzymes. A more detailed examination of human/tumor and murine/stroma material is under investigation. In conclusion, the methods and compositions are useful for the development of SRM-based assays to measure NSCLC subtypes, and the levels of expression and activation of validated drug targets such as the EGFR (and phosphorylated EGFR), and metabolic enzyme. Additional clinical utility will be realized as these methods are further adapted for the analysis of FFPE patient tissue specimens, as recently demonstrated 33. This information and strategic approach has the potential improve the recognition and treatment of NSCLC, and other cancers.
While the present disclosure has been described with reference to what are presently considered to be the preferred examples, it is to be understood that the invention is not limited to the disclosed examples. To the contrary, the invention is intended to cover various modifications and equivalent arrangements included within the spirit and scope of the appended claims.
All publications, patents and patent applications are herein incorporated by reference in their entirety to the same extent as if each individual publication, patent or patent application was specifically and individually indicated to be incorporated by reference in its entirety. All sequences (e.g. nucleotide, including RNA and cDNA, and polypeptide sequences) of genes listed in Table 2, 4A, 4B, 6 and/or 7, for example referred to by accession number are herein incorporated specifically by reference.
This application claims the benefit of 35 USC 119 based on the priority of copending U.S. Provisional Application No. 61/380,250 filed Sep. 5, 2010, which is herein incorporated by reference.
Number | Date | Country | |
---|---|---|---|
61380250 | Sep 2010 | US |