GENE EXPRESSION MARKERS FOR RESPONSE TO EGFR INHIBITOR DRUGS

Information

  • Patent Application
  • 20080318230
  • Publication Number
    20080318230
  • Date Filed
    December 03, 2007
    17 years ago
  • Date Published
    December 25, 2008
    16 years ago
Abstract
The present invention concerns prognostic markers associated with cancer. In particular, the invention concerns prognostic methods based on the molecular characterization of gene expression in paraffin-embedded, fixed samples of cancer tissue, which allow a physician to predict whether a patient is likely to respond well to treatment with an EGFR inhibitor.
Description
FIELD OF THE INVENTION

The present invention concerns gene expression profiling of tissue samples obtained from patients who are candidates for treatment with a therapeutic EGFR inhibitor. More specifically, the invention provides methods based on the molecular characterization of gene expression in paraffin-embedded, fixed cancer tissue samples, which allow a physician to predict whether a patient is likely to respond well to treatment with an EGFR inhibitor.


DESCRIPTION OF THE RELATED ART

Oncologists have a number of treatment options available to them, including different combinations of chemotherapeutic drugs that are characterized as “standard of care,” and a number of drugs that do not carry a label claim for particular cancer, but for which there is evidence of efficacy in that cancer. Best likelihood of good treatment outcome requires that patients be assigned to optimal available cancer treatment, and that this assignment be made as quickly as possible following diagnosis.


Currently, diagnostic tests used in clinical practice are single analyte, and therefore do not capture the potential value of knowing relationships between dozens of different markers. Moreover, diagnostic tests are frequently not quantitative, relying on immunohistochemistry. This method often yields different results in different laboratories, in part because the reagents are not standardized, and in part because the interpretations are subjective and cannot be easily quantified. RNA-based tests have not often been used because of the problem of RNA degradation over time and the fact that it is difficult to obtain fresh tissue samples from patients for analysis. Fixed paraffin-embedded tissue is more readily available. Fixed tissue has been routinely used for non-quantitative detection of RNA, by in situ hybridization. However, recently methods have been established to quantify RNA in fixed tissue, using RT-PCR. This technology platform can also form the basis for multi-analyte assays.


Recently, several groups have published studies concerning the classification of various cancer types by microarray gene expression analysis (see, e.g. Golub et al., Science 286:531-537 (1999); Bhattacharjae et al., Proc. Natl. Acad. Sci. USA 98:13790-13795 (2001); Chen-Hsiang et al., Bioinformatics 17 (Suppl. 1):S316-S322 (2001); Ramaswamy et al., Proc. Natl. Acad. Sci. USA 98:15149-15154 (2001)). Certain classifications of human breast cancers based on gene expression patterns have also been reported (Martin et al., Cancer Res. 60:2232-2238 (2000); West et al., Proc. Natl. Acad. Sci. USA 98:11462-11467 (2001); Sorlie et al., Proc. Natl. Acad. Sci. USA 98:10869-10874 (2001); Yan et al., Cancer Res. 61:8375-8380 (2001)). However, these studies mostly focus on improving and refining the already established classification of various types of cancer, including breast cancer, and generally do not link the findings to treatment strategies in order to improve the clinical outcome of cancer therapy.


Although modern molecular biology and biochemistry have revealed hundreds of genes whose activities influence the behavior of tumor cells, the state of their differentiation, and their sensitivity or resistance to certain therapeutic drugs, with a few exceptions, the status of these genes has not been exploited for the purpose of routinely making clinical decisions about drug treatments. One notable exception is the use of estrogen receptor (ER) protein expression in breast carcinomas to select patients to treatment with anti-estrogen drugs, such as tamoxifen. Another exceptional example is the use of ErbB2 (Her2) protein expression in breast carcinomas to select patients with the Her2 antagonist drug Herceptin® (Genentech, Inc., South San Francisco, Calif.).


Despite recent advances, a major challenge in cancer treatment remains to target specific treatment regimens to pathogenically distinct tumor types, and ultimately personalize tumor treatment in order to optimize outcome. Hence, a need exists for tests that simultaneously provide predictive information about patient responses to the variety of treatment options.


SUMMARY OF THE INVENTION

The present invention is based on findings of a Phase II clinical study of gene expression in tissue samples obtained from human patients with non-small cell lung cancer (NSCLC) who responded or did not respond to treatment with EGFR inhibitors.


In one aspect, the invention concerns a method for predicting the likelihood that a cancer patient who is a candidate for treatment with a therapeutic EGFR inhibitor will respond to treatment with an EGFR inhibitor, comprising determining the expression level of one or more prognostic RNA transcripts or their expression products in a biological sample comprising tumor cells, such as a tumor tissue specimen, obtained from the patient, wherein the prognostic transcript is the transcript of one or more genes selected from the group consisting of:


hCRAa; LAMC2; B2M; STAT5B; LMYC; CKAP4; TAGLN; Furin; DHFR; CCND3; TITF1; FUS; FLT1; TIMP2; RASSF1; WISP1; VEGFC; GPX2; CTSH; AKAP12; APC; RPL19; IGFBP6; Bak; CyclinG1; Hepsin1; MMP2; XIAP; MUC1; STMY3; PDGFRb; GSTp; p53R2; DPYD; IGFBP3; MMP9; RRM; KRT17; PDGFRa; EPHX1; E2F1; HNF3A; mGST1; STAT3; IGF1R; EGFR; cdc25A; RPLPO; YB-1; CKAP4; Kitlng; HER2; Surfact A; BTC; PGK1; MTA1; FOLR1; Claudin 4, EMP1, wherein


(a) increased expression of one or more of hCRAa; LAMC2; STAT5B; CKAP4; TAGLN; Furin; FUS; FLT1; TIMP2; RASSF1; WISP1; VEGFC; GPX2; AKAP12; RPL19; IGFBP6; MMP2; STMY3; PDGFRb; GSTp; IGFBP3; MMP9; KRT17; PDGFRa; IGF1R; cdc25A; RPLPO; YB-1; CKAP4, EMP1 or the corresponding expression product, indicates that the patient is not likely to respond well to treatment with an EGFR inhibitor, and


(b) increased expression of one or more of B2M; LMYC; DHFR; CCND3; TITF1; CTSH; APC; Bak; CyclinG1; Hepsin1; XIAP; MUC1; p53R2; DPYD; RRM; EPHX1; E2F1; HNF3A; mGST1; STAT3; EGFR; Kitlng; HER2; Surfact A; BTC; PGK1; MTA1; FOLR1; Claudin 4, or the corresponding gene product, indicates that the patient is likely to respond well to treatment with an EGFR inhibitor.


The tissue sample preferably is a fixed, paraffin-embedded tissue. Tissue can be obtained by a variety of methods, including fine needle, aspiration, bronchial lavage, or transbronchial biopsy.


In a specific embodiment, the expression level of the prognostic RNA transcript or transcripts is determined by RT-PCR. In this case, and when the tissue sample is fixed, and paraffin-embedded, the RT-PCR amplicons (defined as the polynucleotide sequence spanned by the PCR primers) should preferably be less than 100 bases in length. In other embodiments, the levels of the expression product of the prognostic RNA transcripts are determined by other methods known in the art, such as immunohistochemistry, or proteomics technology. The assays for measuring the prognostic RNA transcripts or their expression products may be available in a kit format.


In another aspect, the invention concerns an array comprising polynucleotides hybridizing to one or more of the following genes: hCRA a; LAMC2; B2M; STAT5B; LMYC; CKAP4; TAGLN; Furin; DHFR; CCND3; TITF1; FUS; FLT1; TIMP2; RASSF1; WISP1; VEGFC; GPX2; CTSH; AKAP12; APC; RPL19; IGFBP6; Bak; CyclinG1; Hepsin1; MMP2; XIAP; MUC1; STMY3; PDGFRb; GSTp; p53R2; DPYD; IGFBP3; MMP9; RRM; KRT17; PDGFRa; EPHX1; E2F1; HNF3A; mGST1; STAT3; IGF1R; EGFR; cdc25A; RPLPO; YB-1; CKAP4; Kitlng; HER2; Surfact A; BTC; PGK1; MTA1; FOLR1; Claudin 4; EMP1, immobilized on a solid surface. The polynucleotides can be cDNA or oligonucleotides. The cDNAs are typically about 500 to 5000 bases long, while the oligonucleotides are typically about 20 to 80 bases long. An array can contain a very large number of cDNAs, or oligonucleotides, e.g. up to about 330,000 oligonucleotides. The solid surface presenting the array can, for example, be glass. The levels of the product of the gene transcripts can be measured by any technique known in the art, including, for example, immunohistochemistry or proteomics.


In various embodiments, the array comprises polynucleotides hybridizing to two at least two, at least three, at least four, at least five, at least six, at least seven, at least eight, at least nine, at least ten, at least eleven, at least twelve, at least thirteen, at least fourteen, at least fifteen, at least seventeen, at least eighteen, at least nineteen, at least twenty, at least twenty-one, at least twenty-two, at least twenty-three, at least twenty-four, at least twenty-five, at least twenty-six, or all twenty-seven of the genes listed above. In a particular embodiment, hybridization is performed under stringent conditions.


In other embodiments, the array may comprise more than one polynucleotide hybridizing to the same gene.


In yet another embodiment, the array may comprise intron-based sequences, the expression of which correlated with the expression of a corresponding exon. Arrays comprising such intron-based sequences are disclosed, for example, in copending application Ser. No. 10/783,884 filed on Feb. 19, 2004, and in its PCT counterpart PCT/US04/05287 filed on Feb. 19, 2004.


The invention further concerns a method of preparing a personalized genomics profile for a patient, comprising the steps of:


(a) subjecting RNA extracted from cancer tissue obtained from the patient to gene expression analysis;


(b) determining the expression level in the tissue of one or more genes selected from the group consisting of hCRA a; LAMC2; B2M; STAT5B; LMYC; CKAP4; TAGLN; Furin; DHFR; CCND3; TITF1; FUS; FLT1; TIMP2; RASSF1; WISP1; VEGFC; GPX2; CTSH; AKAP12; APC; RPL19; IGFBP6; Bak; CyclinG1; Hepsin1; MMP2; XIAP; MUC1; STMY3; PDGFRb; GSTp; p53R2; DPYD; IGFBP3; MMP9; RRM; KRT17; PDGFRa; EPHX1; E2F1; HNF3A; mGST1; STAT3; IGF1R; EGFR; cdc25A; RPLPO; YB-1; CKAP4; Kitlng; HER2; Surfact A; BTC; PGK1; MTA1; FOLR1; Claudin 4; EMP1, wherein the expression level is normalized against a control gene or genes and optionally is compared to the amount found in a corresponding cancer reference tissue set; and


(c) creating a report summarizing the data obtained by said gene expression analysis.


The report may include treatment recommendations, and the method may comprise a step of treating the patient following such treatment recommendations.


The invention additionally concerns a method for amplification of a gene selected from the group consisting of hCRA a; LAMC2; B2M; STAT5B; LMYC; CKAP4; TAGLN; Furin; DHFR; CCND3; TITF1; FUS; FLT1; TIMP2; RASSF1; WISP1; VEGFC; GPX2; CTSH; AKAP12; APC; RPL19; IGFBP6; Bak; CyclinG1; Hepsin1; MMP2; XIAP; MUC1; STMY3; PDGFRb; GSTp; p53R2; DPYD; IGFBP3; MMP9; RRM; KRT17; PDGFRa; EPHX1; E2F1; HNF3A; mGST1; STAT3; IGF1R; EGFR; cdc25A; RPLPO; YB-1; CKAP4; Kitlng; HER2; Surfact A; BTC; PGK1; MTA1; FOLR1; Claudin 4; EMP1 by polymerase chain reaction (PCR), comprising performing said PCR by using a corresponding amplicon listed in Table 3, and a corresponding primer-probe set listed in Table 4.


The invention further encompasses any PCR primer-probe set listed in Table 4 and any PCR amplicon listed in Table 3.


BRIEF DESCRIPTION OF THE DRAWINGS

Table 1 is a list of genes, expression of which correlates, positively or negatively, with patient response to treatment with an EGFR inhibitor.


Table 2 shows the results of binary statistical analysis of a list of genes, expression of which correlates with patient response to treatment with an EGFR inhibitor.


Table 3 is a list of genes, expression of which predict patient response to treatment with an EGFR inhibitor. The table includes accession numbers for the genes, and sequences for the forward and reverse primers (designated by “f” and “r”, respectively) and probes (designated by “p”) used for PCR amplification.


Table 4 shows the amplicon sequences used in PCR amplification of the indicated genes.







DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT
A Definitions

Unless defined otherwise, technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Singleton et al., Dictionary of Microbiology and Molecular Biology 2nd ed., J. Wiley & Sons (New York, N.Y. 1994), and March, Advanced Organic Chemistry Reactions, Mechanisms and Structure 4th ed., John Wiley & Sons (New York, N.Y. 1992), provide one skilled in the art with a general guide to many of the terms used in the present application.


One skilled in the art will recognize many methods and materials similar or equivalent to those described herein, which could be used in the practice of the present invention. Indeed, the present invention is in no way limited to the methods and materials described. For purposes of the present invention, the following terms are defined below.


The term “microarray” refers to an ordered arrangement of hybridizable array elements, preferably polynucleotide probes, on a substrate.


The term “polynucleotide,” when used in singular or plural, generally refers to any polyribonucleotide or polydeoxribonucleotide, which may be unmodified RNA or DNA or modified RNA or DNA. Thus, for instance, polynucleotides as defined herein include, without limitation, single- and double-stranded DNA, DNA including single- and double-stranded regions, single- and double-stranded RNA, and RNA including single- and double-stranded regions, hybrid molecules comprising DNA and RNA that may be single-stranded or, more typically, double-stranded or include single- and double-stranded regions. In addition, the term “polynucleotide” as used herein refers to triple-stranded regions comprising RNA or DNA or both RNA and DNA. The strands in such regions may be from the same molecule or from different molecules. The regions may include all of one or more of the molecules, but more typically involve only a region of some of the molecules. One of the molecules of a triple-helical region often is an oligonucleotide. The term “polynucleotide” specifically includes cDNAs. The term includes DNAs (including cDNAs) and RNAs that contain one or more modified bases. Thus, DNAs or RNAs with backbones modified for stability or for other reasons are “polynucleotides” as that term is intended herein. Moreover, DNAs or RNAs comprising unusual bases, such as inosine, or modified bases, such as tritiated bases, are included within the term “polynucleotides” as defined herein. In general, the term “polynucleotide” embraces all chemically, enzymatically and/or metabolically modified forms of unmodified polynucleotides, as well as the chemical forms of DNA and RNA characteristic of viruses and cells, including simple and complex cells.


The term “oligonucleotide” refers to a relatively short polynucleotide, including, without limitation, single-stranded deoxyribonucleotides, single- or double-stranded ribonucleotides, RNA:DNA hybrids and double-stranded DNAs. Oligonucleotides, such as single-stranded DNA probe oligonucleotides, are often synthesized by chemical methods, for example using automated oligonucleotide synthesizers that are commercially available. However, oligonucleotides can be made by a variety of other methods, including in vitro recombinant DNA-mediated techniques and by expression of DNAs in cells and organisms.


The terms “differentially expressed gene,” “differential gene expression” and their synonyms, which are used interchangeably, refer to a gene whose expression is activated to a higher or lower level in a subject suffering from a disease, specifically cancer, such as breast cancer, relative to its expression in a normal or control subject. The terms also include genes whose expression is activated to a higher or lower level at different stages of the same disease. It is also understood that a differentially expressed gene may be either activated or inhibited at the nucleic acid level or protein level, or may be subject to alternative splicing to result in a different polypeptide product. Such differences may be evidenced by a change in mRNA levels, surface expression, secretion or other partitioning of a polypeptide, for example. Differential gene expression may include a comparison of expression between two or more genes or their gene products, or a comparison of the ratios of the expression between two or more genes or their gene products, or even a comparison of two differently processed products of the same gene, which differ between normal subjects and subjects suffering from a disease, specifically cancer, or between various stages of the same disease. Differential expression includes both quantitative, as well as qualitative, differences in the temporal or cellular expression pattern in a gene or its expression products among, for example, normal and diseased cells, or among cells which have undergone different disease events or disease stages. For the purpose of this invention, “differential gene expression” is considered to be present when there is at least an about two-fold, preferably at least about four-fold, more preferably at least about six-fold, most preferably at least about ten-fold difference between the expression of a given gene in normal and diseased subjects, or in various stages of disease development in a diseased subject.


The term “over-expression” with regard to an RNA transcript is used to refer the level of the transcript determined by normalization to the level of reference mRNAs, which might be all measured transcripts in the specimen or a particular reference set of mRNAs.


The phrase “gene amplification” refers to a process by which multiple copies of a gene or gene fragment are formed in a particular cell or cell line. The duplicated region (a stretch of amplified DNA) is often referred to as “amplicon.” Usually, the amount of the messenger RNA (mRNA) produced, i.e., the level of gene expression, also increases in the proportion of the number of copies made of the particular gene expressed.


The term “prognosis” is used herein to refer to the prediction of the likelihood of cancer-attributable death or progression, including recurrence, metastatic spread, and drug resistance, of a neoplastic disease, such as non-small cell lung cancer, or head and neck cancer. The term “prediction” is used herein to refer to the likelihood that a patient will respond either favorably or unfavorably to a drug or set of drugs, and also the extent of those responses, or that a patient will survive, following surgical removal or the primary tumor and/or chemotherapy for a certain period of time without cancer recurrence. The predictive methods of the present invention can be used clinically to make treatment decisions by choosing the most appropriate treatment modalities for any particular patient. The predictive methods of the present invention are valuable tools in predicting if a patient is likely to respond favorably to a treatment regimen, such as surgical intervention, chemotherapy with a given drug or drug combination, and/or radiation therapy, or whether long-term survival of the patient, following surgery and/or termination of chemotherapy or other treatment modalities is likely.


The term “long-term” survival is used herein to refer to survival for at least 1 year, more preferably for at least 2 years, most preferably for at least 5 years following surgery or other treatment.


The term “increased resistance” to a particular drug or treatment option, when used in accordance with the present invention, means decreased response to a standard dose of the drug or to a standard treatment protocol.


The term “decreased sensitivity” to a particular drug or treatment option, when used in accordance with the present invention, means decreased response to a standard dose of the drug or to a standard treatment protocol, where decreased response can be compensated for (at least partially) by increasing the dose of drug, or the intensity of treatment.


“Patient response” can be assessed using any endpoint indicating a benefit to the patient, including, without limitation, (1) inhibition, to some extent, of tumor growth, including slowing down and complete growth arrest; (2) reduction in the number of tumor cells; (3) reduction in tumor size; (4) inhibition (i.e., reduction, slowing down or complete stopping) of tumor cell infiltration into adjacent peripheral organs and/or tissues; (5) inhibition (i.e. reduction, slowing down or complete stopping) of metastasis; (6) enhancement of anti-tumor immune response, which may, but does not have to, result in the regression or rejection of the tumor; (7) relief, to some extent, of one or more symptoms associated with the tumor; (8) increase in the length of survival following treatment; and/or (9) decreased mortality at a given point of time following treatment.


The term “treatment” refers to both therapeutic treatment and prophylactic or preventative measures, wherein the object is to prevent or slow down (lessen) the targeted pathologic condition or disorder. Those in need of treatment include those already with the disorder as well as those prone to have the disorder or those in whom the disorder is to be prevented. In tumor (e.g., cancer) treatment, a therapeutic agent may directly decrease the pathology of tumor cells, or render the tumor cells more susceptible to treatment by other therapeutic agents, e.g., radiation and/or chemotherapy.


The term “tumor,” as used herein, refers to all neoplastic cell growth and proliferation, whether malignant or benign, and all pre-cancerous and cancerous cells and tissues.


The terms “cancer” and “cancerous” refer to or describe the physiological condition in mammals that is typically characterized by unregulated cell growth. Examples of cancer include but are not limited to, breast cancer, colon cancer, lung cancer, prostate cancer, hepatocellular cancer, gastric cancer, pancreatic cancer, cervical cancer, ovarian cancer, liver cancer, bladder cancer, cancer of the urinary tract, thyroid cancer, renal cancer, carcinoma, melanoma, head and neck cancer, and brain cancer.


The “pathology” of cancer includes all phenomena that compromise the well-being of the patient. This includes, without limitation, abnormal or uncontrollable cell growth, metastasis, interference with the normal functioning of neighboring cells, release of cytokines or other secretory products at abnormal levels, suppression or aggravation of inflammatory or immunological response, neoplasia, premalignancy, malignancy, invasion of surrounding or distant tissues or organs, such as lymph nodes, etc.


The term “EGFR inhibitor” as used herein refers to a molecule having the ability to inhibit a biological function of a native epidermal growth factor receptor (EGFR). Accordingly, the term “inhibitor” is defined in the context of the biological role of EGFR. While preferred inhibitors herein specifically interact with (e.g. bind to) an EGFR, molecules that inhibit an EGFR biological activity by interacting with other members of the EGFR signal transduction pathway are also specifically included within this definition. A preferred EGFR biological activity inhibited by an EGFR inhibitor is associated with the development, growth, or spread of a tumor. EGFR inhibitors, without limitation, include peptides, non-peptide small molecules, antibodies, antibody fragments, antisense molecules, and oligonucleotide decoys.


“Stringency” of hybridization reactions is readily determinable by one of ordinary skill in the art, and generally is an empirical calculation dependent upon probe length, washing temperature, and salt concentration. In general, longer probes require higher temperatures for proper annealing, while shorter probes need lower temperatures. Hybridization generally depends on the ability of denatured DNA to reanneal when complementary strands are present in an environment below their melting temperature. The higher the degree of desired homology between the probe and hybridizable sequence, the higher the relative temperature which can be used. As a result, it follows that higher relative temperatures would tend to make the reaction conditions more stringent, while lower temperatures less so. For additional details and explanation of stringency of hybridization reactions, see Ausubel et al., Current Protocols in Molecular Biology, Wiley Interscience Publishers, (1995).


“Stringent conditions” or “high stringency conditions”, as defined herein, typically: (1) employ low ionic strength and high temperature for washing, for example 0.015 M sodium chloride/0.0015 M sodium citrate/0.1% sodium dodecyl sulfate at 50° C.; (2) employ during hybridization a denaturing agent, such as formamide, for example, 50% (v/v) formamide with 0.1% bovine serum albumin/0.1% Ficoll/0.1% polyvinylpyrrolidone/50 mM sodium phosphate buffer at pH 6.5 with 750 mM sodium chloride, 75 mM sodium citrate at 42° C.; or (3) employ 50% formamide, 5×SSC (0.75 M NaCl, 0.075 M sodium citrate), 50 mM sodium phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5× Denhardt's solution, sonicated salmon sperm DNA (50 μg/ml), 0.1% SDS, and 10% dextran sulfate at 42° C., with washes at 42° C. in 0.2×SSC (sodium chloride/sodium citrate) and 50% formamide at 55° C., followed by a high-stringency wash consisting of 0.1×SSC containing EDTA at 55° C.


“Moderately stringent conditions” may be identified as described by Sambrook et al., Molecular Cloning: A Laboratory Manual, New York: Cold Spring Harbor Press, 1989, and include the use of washing solution and hybridization conditions (e.g., temperature, ionic strength and % SDS) less stringent that those described above. An example of moderately stringent conditions is overnight incubation at 37° C. in a solution comprising: 20% formamide, 5×SSC (150 mM NaCl, 15 mM trisodium citrate), 50 mM sodium phosphate (pH 7.6), 5× Denhardt's solution, 10% dextran sulfate, and 20 mg/ml denatured sheared salmon sperm DNA, followed by washing the filters in 1×SSC at about 37-50° C. The skilled artisan will recognize how to adjust the temperature, ionic strength, etc. as necessary to accommodate factors such as probe length and the like.


In the context of the present invention, reference to “at least one,” “at least two,” “at least five,” etc. of the genes listed in any particular gene set means any one or any and all combinations of the genes listed.


In the context of the present invention, reference to “at least one,” “at least two,” “at least five,” etc. of the genes listed in any particular gene set means any one or any and all combinations of the genes listed.


The term “normalized” with regard to a gene transcript or a gene expression product refers to the level of the transcript or gene expression product relative to the mean levels of transcripts/products of a set of reference genes, wherein the reference genes are either selected based on their minimal variation across, patients, tissues or treatments (“housekeeping genes”), or the reference genes are the totality of tested genes. In the latter case, which is commonly referred to as “global normalization”, it is important that the total number of tested genes be relatively large, preferably greater than 50. Specifically, the term ‘normalized’ with respect to an RNA transcript refers to the transcript level relative to the mean of transcript levels of a set of reference genes. More specifically, the mean level of an RNA transcript as measured by TaqMan® RT-PCR refers to the Ct value minus the mean Ct values of a set of reference gene transcripts.


The terms “expression threshold,” and “defined expression threshold” are used interchangeably and refer to the level of a gene or gene product in question above which the gene or gene product serves as a predictive marker for patient response or resistance to a drug. The threshold typically is defined experimentally from clinical studies. The expression threshold can be selected either for maximum sensitivity (for example, to detect all responders to a drug), or for maximum selectivity (for example to detect only responders to a drug), or for minimum error.


B. DETAILED DESCRIPTION

The practice of the present invention will employ, unless otherwise indicated, conventional techniques of molecular biology (including recombinant techniques), microbiology, cell biology, and biochemistry, which are within the skill of the art. Such techniques are explained fully in the literature, such as, “Molecular Cloning: A Laboratory Manual”, 2nd edition (Sambrook et al., 1989); “Oligonucleotide Synthesis” (M. J. Gait, ed., 1984); “Animal Cell Culture” (R.I. Freshney, ed., 1987); “Methods in Enzymology” (Academic Press, Inc.); “Handbook of Experimental Immunology”, 4th edition (D. M. Weir & C. C. Blackwell, eds., Blackwell Science Inc., 1987); “Gene Transfer Vectors for Mammalian Cells” (J. M. Miller & M. P. Calos, eds., 1987); “Current Protocols in Molecular Biology” (F. M. Ausubel et al., eds., 1987); and “PCR: The Polymerase Chain Reaction”, (Mullis et al., eds., 1994).


1. Gene Expression Profiling

Methods of gene expression profiling include methods based on hybridization analysis of polynucleotides, methods based on sequencing of polynucleotides, and proteomics-based methods. The most commonly used methods known in the art for the quantification of mRNA expression in a sample include northern blotting and in situ hybridization (Parker & Barnes, Methods in Molecular Biology 106:247-283 (1999)); RNAse protection assays (Hod, Biotechniques 13:852-854 (1992)); and PCR-based methods, such as reverse transcription polymerase chain reaction (RT-PCR) (Weis et al., Trends in Genetics 8:263-264 (1992)). Alternatively, antibodies may be employed that can recognize specific duplexes, including DNA duplexes, RNA duplexes, and DNA-RNA hybrid duplexes or DNA-protein duplexes. Representative methods for sequencing-based gene expression analysis include Serial Analysis of Gene Expression (SAGE), and gene expression analysis by massively parallel signature sequencing (MPSS).


2. PCR-based Gene Expression Profiling Methods

a Reverse Transcriptase PCR (RT-PCR)


One of the most sensitive and most flexible quantitative PCR-based gene expression profiling methods is RT-PCR, which can be used to compare mRNA levels in different sample populations, in normal and tumor tissues, with or without drug treatment, to characterize patterns of gene expression, to discriminate between closely related mRNAs, and to analyze RNA structure.


The first step is the isolation of mRNA from a target sample. The starting material is typically total RNA isolated from human tumors or tumor cell lines, and corresponding normal tissues or cell lines, respectively. Thus RNA can be isolated from a variety of primary tumors, including breast, lung, colon, prostate, brain, liver, kidney, pancreas, spleen, thymus, testis, ovary, uterus, head and neck, etc., tumor, or tumor cell lines, with pooled DNA from healthy donors. If the source of MRNA is a primary tumor, MRNA can be extracted, for example, from frozen or archived paraffin-embedded and fixed (e.g. formalin-fixed) tissue samples.


General methods for mRNA extraction are well known in the art and are disclosed in standard textbooks of molecular biology, including Ausubel et al., Current Protocols of Molecular Biology, John Wiley and Sons (1997). Methods for RNA extraction from paraffin embedded tissues are disclosed, for example, in Rupp and Locker, Lab Invest. 56:A67 (1987), and De Andres et al., BioTechniques 18:42044 (1995). In particular, RNA isolation can be performed using purification kit, buffer set and protease from commercial manufacturers, such as Qiagen, according to the manufacturer's instructions. For example, total RNA from cells in culture can be isolated using Qiagen RNeasy mini-columns. Other commercially available RNA isolation kits include MasterPureT™ Complete DNA and RNA Purification Kit (EPICENTRE®, Madison, Wis.), and Paraffin Block RNA Isolation Kit (Ambion, Inc.). Total RNA from tissue samples can be isolated using RNA Stat-60 (Tel-Test). RNA prepared from tumor can be isolated, for example, by cesium chloride density gradient centrifiugation.


As RNA cannot serve as a template for PCR, the first step in gene expression profiling by RT-PCR is the reverse transcription of the RNA template into cDNA, followed by its exponential amplification in a PCR reaction. The two most commonly used reverse transcriptases are avilo myeloblastosis virus reverse transcriptase (AMV-RT) and Moloney murine leukemia virus reverse transcriptase (MMLV-RT). The reverse transcription step is typically primed using specific primers, random hexamers, or oligo-dT primers, depending on the circumstances and the goal of expression profiling. For example, extracted RNA can be reverse-transcribed using a GeneAmp RNA PCR kit (Perkin Elmer, Calif., USA), following the manufacturer's instructions. The derived cDNA can then be used as a template in the subsequent PCR reaction.


Although the PCR step can use a variety of thermostable DNA-dependent DNA polymerases, it typically employs the Taq DNA polymerase, which has a 5′-3′ nuclease activity but lacks a 3′-5′ proofreading endonuclease activity. Thus, TaqMan® PCR typically utilizes the 5′-nuclease activity of Taq or Tth polymerase to hydrolyze a hybridization probe bound to its target amplicon, but any enzyme with equivalent 5′ nuclease activity can be used. Two oligonucleotide primers are used to generate an amplicon typical of a PCR reaction. A third oligonucleotide, or probe, is designed to detect nucleotide sequence located between the two PCR primers. The probe is non-extendible by Taq DNA polymerase enzyme, and is labeled with a reporter fluorescent dye and a quencher fluorescent dye. Any laser-induced emission from the reporter dye is quenched by the quenching dye when the two dyes are located close together as they are on the probe. During the amplification reaction, the Taq DNA polymerase enzyme cleaves the probe in a template-dependent manner. The resultant probe fragments disassociate in solution, and signal from the released reporter dye is free from the quenching effect of the second fluorophore. One molecule of reporter dye is liberated for each new molecule synthesized, and detection of the unquenched reporter dye provides the basis for quantitative interpretation of the data.


TaqMan® RT-PCR can be performed using commercially available equipment, such as, for example, ABI PRISM 7700™ Sequence Detection System™ (Perkin-Elmer-Applied Biosystems, Foster City, Calif., USA), or Lightcycler (Roche Molecular Biochemicals, Mannheim, Germany). In a preferred embodiment, the 5′ nuclease procedure is run on a real-time quantitative PCR device such as the ABI PRISM 7700™ Sequence Detection System™. The system consists of a thermocycler, laser, charge-coupled device (CCD), camera and computer. The system amplifies samples in a 96-well format on a thermocycler. During amplification, laser-induced fluorescent signal is collected in real-time through fiber optics cables for all 96 wells, and detected at the CCD. The system includes software for running the instrument and for analyzing the data.


5′-Nuclease assay data are initially expressed as Ct, or the threshold cycle. As discussed above, fluorescence values are recorded during every cycle and represent the amount of product amplified to that point in the amplification reaction. The point when the fluorescent signal is first recorded as statistically significant is the threshold cycle (Ct).


To minimize errors and the effect of sample-to-sample variation, RT-PCR is usually performed using an internal standard. The ideal internal standard is expressed at a relatively constant level among different tissues, and is unaffected by the experimental treatment. RNAs frequently used to normalize patterns of gene expression are mRNAs for the housekeeping genes glyceraldehyde-3-phosphate-dehydrogenase (GAPDH) and β-actin.


A more recent variation of the RT-PCR technique is the real time quantitative PCR, which measures PCR product accumulation through a dual-labeled fluorigenic probe (i.e., TaqMan® probe). Real time PCR is compatible both with quantitative competitive PCR, where internal competitor for each target sequence is used for normalization, and with quantitative comparative PCR using a normalization gene contained within the sample, or a housekeeping gene for RT-PCR. For further details see, e.g. Held et al., Genome Research 6:986-994 (1996).


The steps of a representative protocol for profiling gene expression using fixed, paraffin-embedded tissues as the RNA source, including mRNA isolation, purification, primer extension and amplification are given in various published journal articles {for example: T. E. Godfrey et al. J. Molec. Diagnostics 2: 84-91 [2000]; K. Specht et al., Am. J. Pathol. 158: 419-29 [2001]}. Briefly, a representative process starts with cutting about 10 μm thick sections of paraffin-embedded tumor tissue samples. The RNA is then extracted, and protein and DNA are removed. After analysis of the RNA concentration, RNA repair and/or amplification steps may be included, if necessary, and RNA is reverse transcribed using gene specific promoters followed by RT-PCR.


b. MassARRAY System


In the MassARRAY-based gene expression profiling method, developed by Sequenom, Inc. (San Diego, Calif.) following the isolation of RNA and reverse transcription, the obtained cDNA is spiked with a synthetic DNA molecule (competitor), which matches the targeted cDNA region in all positions, except a single base, and serves as an internal standard. The cDNA/competitor mixture is PCR amplified and is subjected to a post-PCR shrimp alkaline phosphatase (SAP) enzyme treatment, which results in the dephosphorylation of the remaining nucleotides. After inactivation of the alkaline phosphatase, the PCR products from the competitor and cDNA are subjected to primer extension, which generates distinct mass signals for the competitor- and cDNA-derives PCR products. After purification, these products are dispensed on a chip array, which is pre-loaded with components needed for analysis with matrix-assisted laser desorption ionization time-of-flight mass spectrometry (MALDI-TOF MS) analysis. The cDNA present in the reaction is then quantified by analyzing the ratios of the peak areas in the mass spectrum generated. For further details see, e.g. Ding and Cantor, Proc. Natl. Acad. Sci. USA 100:3059-3064 (2003).


c. Other PCR-based Methods


Further PCR-based techniques include, for example, differential display (Liang and Pardee, Science 257:967-971 (1992)); amplified fragment length polymorphism (iAFLP) (Kawamoto et al., Genome Res. 12:1305-1312 (1999)); BeadArray™ technology (Illumina, San Diego, Calif.; Oliphant et al., Discovery of Markers for Disease (Supplement to Biotechniques), June 2002; Ferguson et al., Analytical Chemistry 72:5618 (2000)); BeadsArray for Detection of Gene Expression (BADGE), using the commercially available Luminex100 LabMAP system and multiple color-coded microspheres (Luminex Corp., Austin, Tex.) in a rapid assay for gene expression (Yang et al., Genome Res. 11:1888-1898 (2001)); and high coverage expression profiling (HiCEP) analysis (Fukumura et al., Nucl. Acids. Res. 31(16) e94 (2003)).


3. Microarrays


Differential gene expression can also be identified, or confirmed using the microarray technique. Thus, the expression profile of breast cancer-associated genes can be measured in either fresh or paraffin-embedded tumor tissue, using microarray technology. In this method, polynucleotide sequences of interest (including cDNAs and oligonucleotides) are plated, or arrayed, on a microchip substrate. The arrayed sequences are then hybridized with specific DNA probes from cells or tissues of interest. Just as in the RT-PCR method, the source of mRNA typically is total RNA isolated from human tumors or tumor cell lines, and corresponding normal tissues or cell lines. Thus RNA can be isolated from a variety of primary tumors or tumor cell lines. If the source of mRNA is a primary tumor, mRNA can be extracted, for example, from frozen or archived paraffin-embedded and fixed (e.g. formalin-fixed) tissue samples, which are routinely prepared and preserved in everyday clinical practice.


In a specific embodiment of the microarray technique, PCR amplified inserts of cDNA clones are applied to a substrate in a dense array. Preferably at least 10,000 nucleotide sequences are applied to the substrate. The microarrayed genes, immobilized on the microchip at 10,000 elements each, are suitable for hybridization under stringent conditions. Fluorescently labeled cDNA probes may be generated through incorporation of fluorescent nucleotides by reverse transcription of RNA extracted from tissues of interest. Labeled cDNA probes applied to the chip hybridize with specificity to each spot of DNA on the array. After stringent washing to remove non-specifically bound probes, the chip is scanned by confocal laser microscopy or by another detection method, such as a CCD camera. Quantitation of hybridization of each arrayed element allows for assessment of corresponding mRNA abundance. With dual color fluorescence, separately labeled cDNA probes generated from two sources of RNA are hybridized pairwise to the array. The relative abundance of the transcripts from the two sources corresponding to each specified gene is thus determined simultaneously. The miniaturized scale of the hybridization affords a convenient and rapid evaluation of the expression pattern for large numbers of genes. Such methods have been shown to have the sensitivity required to detect rare transcripts, which are expressed at a few copies per cell, and to reproducibly detect at least approximately two-fold differences in the expression levels (Schena et al., Proc. Natl. Acad. Sci. USA 93(2):106-149 (1996)). Microarray analysis can be performed by commercially available equipment, following manufacturer's protocols, such as by using the Affymetrix GenChip technology, or Agilent's microarray technology.


The development of microarray methods for large-scale analysis of gene expression makes it possible to search systematically for molecular markers of cancer classification and outcome prediction in a variety of tumor types.


4. Serial Analysis of Gene Expression (SAGE)


Serial analysis of gene expression (SAGE) is a method that allows the simultaneous and quantitative analysis of a large number of gene transcripts, without the need of providing an individual hybridization probe for each transcript. First, a short sequence tag (about 10-14 bp) is generated that contains sufficient information to uniquely identify a transcript, provided that the tag is obtained from a unique position within each transcript. Then, many transcripts are linked together to form long serial molecules, that can be sequenced, revealing the identity of the multiple tags simultaneously. The expression pattern of any population of transcripts can be quantitatively evaluated by determining the abundance of individual tags, and identifying the gene corresponding to each tag. For more details see, e.g. Velculescu et al., Science 270:484-487 (1995); and Velculescu et al., Cell 88:243-51 (1997).


5. Gene Expression Analysis bv Massively Parallel Signature Sequencing (MPSS)


This method, described by Brenner et al., Nature Biotechnology 18:630-634 (2000), is a sequencing approach that combines non-gel-based signature sequencing with in vitro cloning of millions of templates on separate 5 μm diameter microbeads. First, a microbead library of DNA templates is constructed by in vitro cloning. This is followed by the assembly of a planar array of the template-containing microbeads in a flow cell at a high density (typically greater than 3×106 microbeads/cm2). The free ends of the cloned templates on each microbead are analyzed simultaneously, using a fluorescence-based signature sequencing method that does not require DNA fragment separation. This method has been shown to simultaneously and accurately provide, in a single operation, hundreds of thousands of gene signature sequences from a yeast cDNA library.


6. Immunohistochemistry


Imnmunohistochemistry methods are also suitable for detecting the expression levels of the prognostic markers of the present invention. Thus, antibodies or antisera, preferably polyclonal antisera, and most preferably monoclonal antibodies specific for each marker are used to detect expression. The antibodies can be detected by direct labeling of the antibodies themselves, for example, with radioactive labels, fluorescent labels, hapten labels such as, biotin, or an enzyme such as horse radish peroxidase or alkaline phosphatase. Alternatively, unlabeled primary antibody is used in conjunction with a labeled secondary antibody, comprising antisera, polyclonal antisera or a monoclonal antibody specific for the primary antibody. Immunohistochemistry protocols and kits are well known in the art and are commercially available.


7. Proteomics


The term “proteome” is defined as the totality of the proteins present in a sample (e.g. tissue, organism, or cell culture) at a certain point of time. Proteomics includes, among other things, study of the global changes of protein expression in a sample (also referred to as “expression proteomics”). Proteomics typically includes the following steps: (1) separation of individual proteins in a sample by 2-D gel electrophoresis (2-D PAGE); (2) identification of the individual proteins recovered from the gel, e.g. my mass spectrometry or N-terminal sequencing, and (3) analysis of the data using bioinformatics. Proteomics methods are valuable supplements to other methods of gene expression profiling, and can be used, alone or in combination with other methods, to detect the products of the prognostic markers of the present invention.


8. General Description of mRNA Isolation, Purification and Amplification


The steps of a representative protocol for profiling gene expression using fixed, paraffin-embedded tissues as the RNA source, including mRNA isolation, purification, primer extension and amplification are given in various published journal articles (for example: T. E. Godfrey et al. J. Molec. Diagnostics 2: 84-91 [2000]; K. Specht et al., Am. J. Pathol. 158: 419-29 [2001]). Briefly, a representative process starts with cutting about 10 μm thick sections of paraffin-embedded tumor tissue samples. The RNA is then extracted, and protein and DNA are removed. After analysis of the RNA concentration, RNA repair and/or amplification steps may be included, if necessary, and RNA is reverse transcribed using gene specific promoters followed by RT-PCR. Finally, the data are analyzed to identify the best treatment option(s) available to the patient on the basis of the characteristic gene expression pattern identified in the tumor sample examined.


9. EGFR Inhibitors


The epidermal growth factor receptor (EGFR) family (which includes EGFR, erb-B2, erb-B3, and erb-B4) is a family of growth factor receptors that are frequently activated in epithelial malignancies. Thus, the epidermal growth factor receptor (EGFR) is known to be active in several tumor types, including, for example, ovarian cancer, pancreatic cancer, non-small cell lung cancer {NSCLC}, breast cancer, and head and neck cancer. Several EGFR inhibitors, such as ZD1839 (also known as gefitinib or Iressa); and OS1774 (Erlotinib, Tarceva™), are promising drug candidates for the treatment of cancer.


Iressa, a small synthetic quinazoline, competitively inhibits the ATP binding site of EGFR, a growth-promoting receptor tyrosine kinase, and has been in Phase III clinical trials for the treatment of non-small-cell lung carcinoma. Another EGFR inhibitor, [agr]cyano-[bgr]methyl-N-[(trifluoromethoxy)phenyl]-propenamide (LFM-A12), has been shown to inhibit the proliferation and invasiveness of human breast cancer cells.


Cetuximab is a monoclonal antibody that blocks the EGFR and EGFR- dependent cell growth. It is currently being tested in phase III clinical trials.


Tarceva™ has shown promising indications of anti-cancer activity in patients with advanced ovarian cancer, and non-small cell lung and head and neck carcinomas.


The present invention provides valuable molecular markers that predict whether a patient who is a candidate for treatment with an EGFR inhibitor drug is likely to respond to treatment with an EGFR inhibitor.


The listed examples of EGFR inhibitors represent both small organic molecule and anti-EGFR antibody classes of drugs. The findings of the present invention are equally applicable to other EGFR inhibitors, including, without limitation, antisense molecules, small peptides, etc.


Further details of the invention will be apparent from the following non-limiting Example.


EXAMPLE

A Phase II Study of Gene Expression in non-small cell lung cancer (NSCL)


A gene expression study was designed and conducted with the primary goal to molecularly characterize gene expression in paraffin-embedded, fixed tissue samples of NSCLC patients who did or did not respond to treatment with an EGFR inhibitor. The results are based on the use of one EGFR inhibitor.


Study Design

Molecular assays were performed on paraffin-embedded, formalin-fixed tumor tissues obtained from 39 individual patients diagnosed with NSCLC. Patients were included in the study only if histopathologic assessment, performed as described in the Materials and Methods section, indicated adequate amounts of tumor tissue. All patients had a history of prior treatment for NSCLC, and the nature of pretreatment varied.


Materials and Methods

Each representative tumor block was characterized by standard histopathology for diagnosis, semi-quantitative assessment of amount of tumor, and tumor grade. A total of 6 sections (10 microns in thickness each) were prepared and placed in two Costar Brand Microcentrifuge Tubes (Polypropylene, 1.7 mL tubes, clear; 3 sections in each tube). If the tumor constituted less than 30% of the total specimen area, the sample may have been dissected by the pathologist, putting the tumor tissue directly into the Costar tube.


If more than one tumor block was obtained as part of the surgical procedure, the block most representative of the pathology was used for analysis.


Gene Expression Analysis

mRNA was extracted and purified from fixed, paraffin-embedded tissue samples, and prepared for gene expression analysis as described above.


Molecular assays of quantitative gene expression were performed by RT-PCR, using the ABI PRISM 7900™ Sequence Detection System™ (Perkin-Elmer-Applied Biosystems, Foster City, Calif., USA). ABI PRISM 7900™ consists of a thermocycler, laser, charge-coupled device (CCD), camera and computer. The system amplifies samples in a 384-well format on a thermocycler. During amplification, laser-induced fluorescent signal is collected in real-time through fiber optics cables for all 384 wells, and detected at the CCD. The system includes software for running the instrument and for analyzing the data.


Analysis and Results

Tumor tissue was analyzed for 187 cancer-related genes and 5 reference genes. The threshold cycle (CT) values for each patient were normalized based on the mean of all genes for that particular patient. Clinical outcome data were available for all patients.


Outcomes were evaluated in two ways, each breaking patients into two groups with respect to response.


One analysis categorized complete or partial response [RES] as one group, and stable disease (min of 3 months) or progressive disease as the other group [NR]. The second analysis grouped patients with respect to clinical benefit, where clinical benefit was defined as partial response, complete response, or stable disease at 3 months.


Response (partial response and complete response) was determined by the Response Evaluation Criteria In Solid Tumors (RECIST criteria). Stable disease was designated as the absence of aggressive disease for 3 or more months.


Analysis of Patients by t-test


Analysis was performed on all 39 treated patients to determine the relationship between normalized gene expression and the binary outcomes of RES (response) or NR (non-response). A t-test was performed on the group of patients classified as RES or NR and the p-values for the differences between the groups for each gene were calculated. The following table lists the 39 genes for which the p-value for the differences between the groups was <0.15. In this case response was defined as a partial or complete response, the former being >50% shrink of the tumor and the latter being disappearance of the tumor. As shown, response was identified in 7 patients.















TABLE 1







Mean
Mean

Valid N
Valid N



No Response
Response
p
No Response
Response





















DHFR
−2.35
−1.55
0.0095
32
7


TITF1
−4.64
−2.53
0.0108
32
7


B2M
−0.19
0.81
0.0126
32
7


MUC1
−1.13
0.49
0.0201
32
7


XIAP
−3.63
−2.98
0.0212
32
7


Furin
−3.64
−4.70
0.0333
32
7


STAT5B
−2.21
−2.71
0.0482
32
7


RRM1
−4.09
−3.52
0.0484
32
7


DPYD
−0.67
−0.17
0.0509
32
7


KRT17
−4.02
−5.90
0.0513
32
7


PDGFRa
−1.92
−3.16
0.0521
32
7


TIMP2
1.51
0.89
0.0522
32
7


EPHX1
−1.23
−0.31
0.0551
32
7


Hepsin
−7.02
−6.48
0.0617
32
7


E2F1
−5.09
−4.28
0.0620
32
7


HNF3A
−4.27
−3.03
0.0688
32
7


GPX2
−4.65
−6.30
0.0784
32
7


mGST1
−1.05
−0.08
0.0872
32
7


LAMC2
−3.67
−4.69
0.0874
32
7


STAT3
−0.01
0.42
0.1045
32
7


IGF1R
−3.99
−4.62
0.1051
32
7


WISP1
−5.23
−5.97
0.1065
32
7


p53R2
−2.79
−2.22
0.1125
32
7


EGFR
−2.25
−1.43
0.1151
32
7


cdc25A
−5.40
−5.92
0.1205
32
7


RPLPO
1.39
1.09
0.1217
32
7


TAGLN
0.58
−0.51
0.1255
32
7


YB-1
0.14
−0.11
0.1257
32
7


CKAP4
−1.37
−1.89
0.1262
32
7


KitIng
−3.62
−2.86
0.1291
32
7


HER2
−2.22
−1.33
0.1313
32
7


hCRA a
−5.86
−6.48
0.1332
32
7


Surfact A1
−1.00
2.24
0.1341
32
7


LMYC
−4.62
−4.20
0.1354
32
7


BTC
−6.16
−5.50
0.1390
32
7


PGK1
−1.18
−0.75
0.1400
32
7


MTA1
−3.48
−3.05
0.1451
32
7


FOLR1
−3.40
−1.81
0.1455
32
7


Claudin 4
−1.66
−0.94
0.1494
32
7









In the foregoing Table 1, lower mean expression Ct values indicate lower expression and, conversely, higher mean expression values indicate higher expression of a particular gene. Thus, for example, expression of the STAT5B gene was higher in patients who did not respond to EGFR inhibitor treatment than in patients that did respond to the treatment. Accordingly, elevated expression of STAT5B is an indication of poor outcome of treatment with an EGFR inhibitor. Phrasing it differently, if the STAT5B gene is over-expressed in a tissue simple obtained from the cancer of a NSCLC patient, treatment with an EGFR inhibitor is not likely to work, therefore, the physician is well advised to look for alternative treatment options.


Accordingly, the elevated expression of Furin; STAT5B; KRT17; PDGFRa; TIMP2; GPX2; LAMC2; IGF1R; WISP1; cdc25A; RPLPO; TAGLN; YB-1; CKAP4; or hCRA in a tumor is an indication that the patient is not likely to respond well to treatment with an EGFR inhibitor. On the other hand, elevated expression of DHFR; TITF1; B2M; MUC1; XIAP; RRM; DPYD; EPHX1; Hepsin; E2F1; HNF3A; mGST1; STAT3; p53R2; EGFR; Kitlng; HER2; Surfact A; LMYC; BTC; PGK1; MTA1; FOLR1, or Claudin 4 is an indication that the patient is likely to respond to EGFR inhibitor treatment.


In Table 2 below the binary analysis was carried with respect to clinical benefit, defined as either partial response, complete response, or stable disease. As shown, 12 patients met these criteria for clinical benefit.















TABLE 2







Mean
Mean

Valid N
Valid N



No Benefit
Benefit
p
No Benefit
Benefit





















hCRA a
−5.63
−6.75
0.0005
27
12


LAMC2
−3.40
−4.88
0.0017
27
12


B2M
−0.32
0.68
0.0022
27
12


STAT5B
−2.15
−2.65
0.0133
27
12


LMYC
−4.72
−4.16
0.0156
27
12


CKAP4
−1.27
−1.89
0.0271
27
12


TAGLN
0.77
−0.48
0.0305
27
12


Furin
−3.56
−4.44
0.0341
27
12


DHFR
−2.37
−1.84
0.0426
27
12


CCND3
−3.76
−3.06
0.0458
27
12


TITF1
−4.69
−3.30
0.0462
27
12


FUS
−2.15
−2.56
0.0496
27
12


FLT1
−6.01
−6.58
0.0501
27
12


TIMP2
1.55
1.05
0.0583
27
12


RASSF1
−3.23
−3.64
0.0619
27
12


WISP1
−5.15
−5.85
0.0657
27
12


VEGFC
−7.09
−7.35
0.0738
27
12


GPX2
−4.52
−5.91
0.0743
27
12


CTSH
−0.71
0.20
0.0743
27
12


AKAP12
−2.32
−3.26
0.0765
27
12


APC
−3.19
−2.77
0.0792
27
12


RPL19
2.06
1.75
0.0821
27
12


IGFBP6
−3.86
−4.79
0.0920
27
12


Bak
−4.01
−3.65
0.0985
27
12


Cyclin G1
−7.18
−7.01
0.0997
27
12


Hepsin
−7.04
−6.65
0.1067
27
12


MMP2
0.28
−0.77
0.1080
27
12


XIAP
−3.63
−3.25
0.1161
27
12


MUC1
−1.12
−0.20
0.1198
27
12


STMY3
−2.67
−3.67
0.1246
27
12


PDGFRb
−2.26
−3.01
0.1300
27
12


GSTp
0.48
0.05
0.1335
27
12


p53R2
−2.82
−2.38
0.1337
27
12


DPYD
−0.67
−0.36
0.1385
27
12


IGFBP3
−1.61
−2.31
0.1399
27
12


MMP9
−3.29
−4.07
0.1497
27
12









As shown in the above Table 2, 6 genes correlated with clinical benefit with p<0.1. Expression of hCRA a; LAMC2; STAT5B; CKAP4; TAGLN; Furin; FUS; FLT1; TIMP2; RASSF1; WISP1; VEGFC; GPX2; AKAP12; RPL19; IGFBP6; MMP2; STMY3; PDGFRb; GSTp; IGFBP3; or MMP9 was higher in patients who did not respond to anti-EGFR treatment. Thus, greater expression of these genes is an indication that patients are unlikely to benefit from anti-EGFR treatment. Conversely, expression of B2M; LMYC; DHFR; CCND3; TITF1; CTSH; APC; Bak; CyclinG1; Hepsin1; XIAP; MUC1; p53R2, or DPYD was higher in patients who did respond to anti-EGFR treatment. Greater expression of these genes indicates that patients are likely to benefit from anti-EGFR treatment.


In addition to the above analysis, robust logistic regression (David W. Hosmer, Jr. and Stanley Lameshow [2000] Applied Logistic Regression, Wiley, N.Y; Peter J. Huber [1981] Robust Statistics, John Wiley & Sons, N.Y.).was performed to assess the relationship between response and EMP1 reference normalized gene expression level. A robust logistic estimation procedure based on Hubers M-estimate2 was used to obtain an estimate of the probability of response as a function of EMP1 were obtained. Based on this analysis, it is estimated that a patient has less than a 10% probability of response for reference normalized EMP1 gene expression levels greater than −1.43. Therefore increased expression of the gene EMP1 decreases the likelihood of response to chemotherapy.


It is emphasized that while the data presented herein were obtained using tissue samples from NSCLC, the conclusions drawn from the tissue expression profiles are equally applicable to other cancers, such as, for example, colon cancer, ovarian cancer, pancreatic cancer, breast cancer, and head and neck cancer.


All references cited throughout the specification are hereby expressly incorporated by reference.


While the invention has been described with emphasis upon certain specific embodiments, it is be apparent to those skilled in the art that variations and modification in the specific methods and techniques are possible. Accordingly, this invention includes all modifications encompassed within the spirit and scope of the invention as defined by the following claims.












TABLE 3





Gene
Accession
Seq ID No.
Sequence



















AKAP12
NM_005100
SEQ ID NO:1
TAGAGAGCCCCTGACAATCCTGAGGCTTCATCAGGAGCTAGAGCCATTTTAACATTTCCT






CTTTCCAAGACCAACC





APC
NM_000038
SEQ ID NO:2
GGACAGCAGGAATGTGTTTCTCCATACAGGTCACGGGGAGCCAATGGTTCAGAAACAAAT





CGAGTGGGT





B2M
NM_004048
SEQ ID NO:3
GGGATCGAGACATGTAAGCAGCATCATGGAGGTTTGAAGATGCCGCATTTGGATTGGATG





AATTCCA





Bak
NM_001188
SEQ ID NO:4
CCATTCCCACCATTCTACCTGAGGCCAGGACGTCTGGGGTGTGGGGATTGGTGGGTCTAT





GTTCCC





BTC
NM_001729
SEQ ID NO:5
AGGGAGATGCCGCTTCGTGGTGGCCGAGCAGACGCCCTCCTGTGTCTGTGATGAAGGCTA





CATTGGAGCAAGGTGTGAGAG





CCND3
NM_001760
SEQ ID NO:6
CCTCTGTGCTACAGATTATACCTTTTGCCATGTACCCGCCATCCATGATCGCCACGGGCA





GCATTGGGGCTGCAGTG





cdc25A
NM_001789
SEQ ID NO:7
TCTTGCTGGCTACGCCTCTTCTGTCCCTGTTAGACGTCCTCCGTCCATATCAGAACTGTG





CCACAATGCAG





CKAP4
NM_006825
SEQ ID NO:8
AAAGCCTCAGTCAGCCAAGTGGAGGCGGACTTGAAAATGCTCAGGACTGCTGTGGACAGT





TTGGTT





Claudin 4
NM_001305
SEQ ID NO:9
GGCTGCTTTGCTGCAACTGTCCACCCCGCACAGACAAGCCTTACTCCGCCAAGTATTCTG





CTGCCCGCTCTG





CTSH
NM_004390
SEQ ID NO:10
GCAAGTTCCAACCTGGAAAGGCCATCGGCTTTTGTCAAGGATGTAGCCAACATCACAATC





TATGACGAGGAAGCGATG





Cyclin G1
NM_004060
SEQ ID NO:11
CTCCTCTTGCCTACGAGTCCCCTCTCCTCGTAGGCCTCTCGGATCTGATATCGTGGGGTG





AGGTGAG





DHFR
NM_000791
SEQ ID NO:12
TTGCTATAACTAAGTGCTTCTCCAAGACCCCAACTGAGTCCCCAGCACCTGCTACAGTGA





GCTGCCATTCCAC





DPYD
NM_000110
SEQ ID NO:13
AGGACGCAAGGAGGGTTTGTCACTGGCAGACTCGAGACTGTAGGCACTGCCATGGCCCCT





GTGCTCAGTAAGGACTCGGCGGACATC





E2F1
NM_005225
SEQ ID NO:14
ACTCCCTCTACCCTTGAGCAAGGGCAGGGGTCCCTGAGCTGTTCTTCTGCCCCATACTGA





AGGAACTGAGGCCTG





EGFR
NM_005228
SEQ ID NO:15
TGTCGATGGACTTCCAGAACCACCTGGGCAGCTGCCAAAAGTGTGATCCAAGCTGTCCCA





AT





EMP1
NM_001423
SEQ ID NO:16
GCTAGTACTTTGATGCTCCCTTGATGGGGTCCAGAGAGCCTCCCTGCAGCCACCAGACTT





GGCCTCCAGCTGTTC





EPHX1
NM_000120
SEQ ID NO:17
ACCGTAGGCTCTGCTCTGAATGACTCTCCTGTGGGTCTGGCTGCCTATATTCTAGAGAAG





TTTTCCACCTGGACCA





FLT1
NM_002019
SEQ ID NO:18
GGCTCCCGAATCTATCTTTGACAAAATCTACAGCACCAAGAGCGACGTGTGGTCTTACGG





AGTATTGCTGTGGGA





FOLR1
NM_016730
SEQ ID NO:19
GAACGCCAAGCACCACAAGGAAAAGCCAGGCCCCGAGGACAAGTTGCATGAGCAGTGTCG





ACCCTGG





Furin
NM_002569
SEQ ID NO:20
AAGTCCTCGATACGCACTATAGCACCGAGAATGACGTGGAGACCATCCGGGCCAGCGTCT





GCGCCCCCTGCCACGCCTCATGTGCCACATGCCAG





FUS
NM_004960
SEQ ID NO:21
GGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGA





GTCTGTGGCTGATTACTTCA





GPX2
NM_002083
SEQ ID NO:22
CACACAGATCTCCTACTCCATCCAGTCCTGAGGAGCCTTAGGATGCAGCATGCCTTCAGG





AGACACTGCTGGACC





GSTp
NM_000852
SEQ ID NO:23
GAGACCCTGCTGTCCCAGAACCAGGGAGGCAAGACCTTCATTGTGGGAGACCAGATCTCC





TTCGCTGACTACAACC





hCRA a
U78556
SEQ ID NO:24
TGACACCCTTACCTTCCTGAGAAATACCCCCTGGGAGCGCGGAAAGCAGAGCGGACAGGT





CAGTGACTTCTATTTTTGACTCGTGTTTTT





Hepsin
NM_002151
SEQ ID NO:25
AGGCTGCTGGAGGTCATCTCCGTGTGTGATTGCCCCAGAGGCCGTTTCTTGGCCGCCATC





TGCCAAGACTGTGGCCGCAGGAAG





HER2
NM_004448
SEQ ID NO:26
CGGTGTGAGAAGTGCAGCAAGCCCTGTGCCCGAGTGTGCTATGGTCTGGGCATGGAGCAC





TTGCGAGAGG





HNF3A
NM_004496
SEQ ID NO:27
TCCAGGATGTTAGGAACTGTGAAGATGGAAGGGCATGAAACCAGCGACTGGAACAGCTAC





TACGCAGACACGC





IGF1R
NM_000875
SEQ ID NO:28
GCATGGTAGCCGAAGATTTTCACAGTCAAAATCGGAGATTTTGGTATGACGCGAGATATC





TATGAGACAGACTATTACCGGAAA





IGFBP3
NM_000598
SEQ ID NO:29
ACGCACCGGGTGTCTGATCCCAAGTTCCACCCCCTCCATTCAAAGATAATCATCATCAAG





AAAGGGCA





IGFBP6
NM_002178
SEQ ID NO:30
TGAACCGCAGAGACCAACAGAGGAATCCAGGCACCTCTACCACGCCCTCCCAGCCCAATT





CTGCGGGTGTCCAAGAC





Kiting
NM_000899
SEQ ID NO:31
GTCCCCGGGATGGATGTTTTGCCAAGTCATTGTTGGATAAGCGAGATGGTAGtACAATTG





TCAGACAGCTTGACTGATC





KRT17
NM_000422
SEQ ID NO:32
CGAGGATTGGTTCTTCAGCAAGACAGAGGAACTGAACCGCGAGGTGGCCACCAAGAGTGA





GCTGGTGCAGAGT





LAMC2
NM_005562
SEQ ID NO:33
ACTCAAGCGGAAATTGAAGCAGATAGGTCTTATCAGCACAGTCTCCGCCTCCTGGATTCA





GTGTCTCGGCTTCAGGGAGT





LMYC
NM_012421
SEQ ID NO:34
CCCATCCAGAACACTGATTGCTGTCATTCAAGTGAAAGGGATGGAGGTCAGAAAGGGTGC





ATAGAAAGCAG





mGST1
NM_020300
SEQ ID NO:35
ACGGATCTACCACACCATTGCATATTTGACACCCCTTCCCCAGCCAAATAGAGCTTTGAG





TTTTTTTGTTGGATATGGA





MMP2
NM_004530
SEQ ID NO:36
CCATGATGGAGAGGCAGACATCATGATCAACTTTGGCCGCTGGGAGCATGGCGATGGATA





CCCCTTTGACGGTAAGGACGGACTCC





MMP9
NM_004994
SEQ ID NO:37
GAGAACCAATCTCACCGACAGGCAGCTGGCAGAGGAATACCTGTACCGCTATGGTTACAC





TCGGGTG





MTA1
NM_004689
SEQ ID NO:38
CCGCCCTCACCTGAAGAGAAACGCGCTCCTTGGCGGACACTGGGGGAGGAGAGGAAGAAG





CGCGGCTAACTTATTCC





MUC1
NM_002456
SEQ ID NO:39
GGCCAGGATCTGTGGTGGTACAATTGACTCTGGCCTTCCGAGAAGGTACCATCAATGTCC





ACGACGTGGAG





p53R2
AB036063
SEQ ID NO:40
CCCAGCTAGTGTTCCTCAGAACAAAGATTGGAAAAAGCTGGCCGAGAACCATTTATACAT





AGAGGAAGGGCTTACGG





PDGFRa
NM_006206
SEQ ID NO:41
GGGAGTTTCCAAGAGATGGACTAGTGCTTGGTCGGGTCTTGGGGTCTGGAGCGTTTGGGA





AGGTGGTTGAAG





PDGFRb
NM_002609
SEQ ID NO:42
CCAGCTCTCCTTCCAGCTACAGATCAATGTCCCTGTCCGAGTGCTGGAGCTAAGTGAGAG





CCACCC





PGKI
NM_000291
SEQ ID NO:43
AGAGCCAGTTGCTGTAGAACTCAAATCTCTGCTGGGCAAGGATGTTCTGTTCTTGAAGGA





CTGTGTAGGCCCAG





RASSF1
NM_007182
SEQ ID NO:44
AGTGGGAGACACCTGACCTTTCTCAAGCTGAGATTGAGCAGAAGATCAAGGAGTACAATG





CCCAGATCA





RPL19
NM_000981
SEQ ID NO:45
CCACAAGCTGAAGGCAGACAAGGCCCGCAAGAAGCTCCTGGCTGACCAGGCTGAGGCCCG





CAGGTCTAAGACCAAGGAAGCACGC





RPLPO
NM_001002
SEQ ID NO:46
CCATTCTATCATCAACGGGTACAAACGAGTCCTGGCCTTGTCTGTGGAGACGGATTACAC





CTTCCCACTTGCTGA





RRM1
NM_001033
SEQ ID NO:47
GGGCTACTGGCAGCTACATTGCTGGGACTAATGGCAATTCCAATGGCCTTGTACCGATGC





TGAGAG





STAT3
NM_003150
SEQ ID NO:48
TCACATGCCACTTTGGTGTTTCATAATdTCCTGGGAGAGATTGACCAGCAGTATAGCCGC





TTCCTGCAAG





STAT5B
NM_012448
SEQ ID NO:49
CCAGTGGTGGTGATCGTTCATGGCAGCCAGGACAACAATGCGACGGCCACTGTTCTCTGG





GACAATGCTTTTGC





STMY3
NM_005940
SEQ ID NO:50
CCTGGAGGCTGCAACATACCTCAATCCTGTCCCAGGCCGGATCCTCCTGAAGCCCTTTTC





GCAGCACTGCTATCCTCCAAAGCCATTTGTA





Surfact AI
NM_005411
SEQ ID NO:51
TGGCCCTCAACCTCATCTTGATGGCAGCCTCTGGTGCTGTGTGCGAAGTGAAGGACGTTT





GTGTTGGAAG





TAGLN
NM_003186
SEQ ID NO:52
GATGGAGCAGGTGGCTCAGTTCCTGAAGGCGGCTGAGGACTCTGGGGTCATCAAGACTGA





CATGTTCCAGACT





TIMP2
NM_003255
SEQ ID NO:53
TCACCCTCTGTGACTTCATCGTGCCCTGGGACACCCTGAGCACCACCCAGAAGAAGAGCC





TGAACCACA





TITF1
NM_003317
SEQ ID NO:54
CGACTCCGTTCTCAGTGTCTGACATCTTGAGTCCCCTGGAGGAAAGCTACAAGAAAGTGG





GCATGGAGGG





VEGFC
NM_005429
SEQ ID NO:55
CCTCAGCAAGACGTTATTTGAAATTACAGTGCCTCTCTCTCAAGGCCCCAAACCAGTAAC





AATCAGTTTTGCCAATCACACTT





WISP1
NM_003882
SEQ ID NO:56
AGAGGCATCCATGAACTTCACACTTGCGGGCTGCATCAGCACACGCTCCTATCAACCCAA





GTACTGTGGAGTTTG





XIAP
NM_001167
SEQ ID NO:57
GCAGTTGGAAGACACAGGAAAGTATCCCCAAATTGCAGATTTATCAACGGCTTTTATCTT





GAAAATAGTGCCACGCA





YB-1
NM_004559
SEQ ID NO:58
AGACTGTGGAGTTTGATGTTGTTGAAGGAGAAAAGGGTGCGGAGGCAGCAAATGTTACAG





GTCCTGGTGGTGTTCC





















TABLE 4





Gene
Accession
Name
SEQ ID NO
Sequence
Length





















AKAP12
NM_005100
S3499/AKAP12.f2
SEQ ID NO:59
TAGAGAGCCCCTGACAATCC
20






AKAP12
NM_005100
S3500/AKAP12.r2
SEQ ID NO:60
GGTTGGTCTTGGAAAGAGGA
20





AKAP12
NM_005100
S3502IAKAP12.p2
SEQ ID NO:61
TGGCTCTAGCTCCTGATGAAGCCTC
25





APC
NM_000038
S0022IAPC.f4
SEQ ID NO:82
GGACAGCAGGAATGTGTTTC
20





APC
NM_000038
S0024IAPC.r4
SEQ ID NO:83
ACCCACTCGATTTGTTTCTG
20





APC
NM_000038
S4888/APC.p4
SEQ ID NO:64
CATTGGCTCCCCGTGACCTGTA
22





B2M
NM_004048
S1355/B2M.f4
SEQ ID NO:65
GGGATCGAGACATGTAAGCA
20





B2M
NM_004048
S1356/B2M.r4
SEQ ID NO:66
TGGAATTCATCCAATCCAAAT
21





B2M
NM_004048
S4932/B2M.p4
SEQ ID NO:67
CGGCATCTTCAAACCTCCATGATG
24





Bak
NM_001188
S0037/Bak.f2
SEQ ID NO:68
CCATTCCCACCATTCTACCT
20





Bak
NM_001188
S0039/Bak.r2
SEQ ID NO:69
GGGAACATAGACCCACCAAT
20





Bak
NM_001188
S4724/Bak.p2
SEQ ID NO:70
ACACCCCAGACGTCCTGGCCT
21





BTC
NM_001729
S1216/BTC.f3
SEQ ID NO:71
AGGGAGATGCCGCTTCGT
18





BTC
NM_001729
S1217/BTC.r3
SEQ ID NO:72
CTCTCACACCTTGCTCCAATGTA
23





BTC
NM_001729
S4844/BTC.p3
SEQ ID NO:73
CCTTCATCACAGACACAGGAGGGCG
25





CCND3
NM_001760
S2799/CCND3.f1
SEQ ID NO:74
CCTCTGTGCTACAGATTATACCTTTGC
27





CCND3
NM_001760
S2800/CCND3.r1
SEQ ID NO:75
CACTGCAGCCCCAATGCT
18





CCND3
NM_001760
S4966/CCND3.p1
SEQ ID NO:76
TACCCGCCATCCATGATCGCCA
22





cdc25A
NM_001789
S0070/cdc25A.f4
SEQ ID NO:77
TCTTGCTGGCTACGCCTCTT
20





cdc25A
NM_001789
S0072/cdc25A.r4
SEQ ID NO:78
CTGCATTGTGGCACAGTTCTG
21





cdc25A
NM_001789
S4989/cdc25A.p4
SEQ ID NO:79
TGTCCCTGTTAGACGTCCTCCGTCCATA
28





CKAP4
NM_006825
S2381/CKAP4.f2
SEQ ID NO:80
AAAGCCTCAGTCAGCCAAGT
20





CKAP4
NM_006825
S2382/CKAP4.r2
SEQ ID NO:81
AACCAAACTGTCCACAGCAG
20





CKAP4
NM_006825
S4892/CKAP4.p2
SEQ ID NO:82
TCCTGAGCATTTTCAAGTCCGCCT
24





Claudin 4
NM_001305
S2209/Claudi.f2
SEQ ID NO:83
GGCTGCTTTGCTGCAACTG
19





Claudin 4
NM_001305
S2210/Claudi.r2
SEQ ID NO:84
CAGAGCGGGCAGCAGAATA
19





Claudin 4
NM_001305
S4781/Claudi.p2
SEQ ID NO:85
CGCACAGACAAGCCTTACTCCGCC
24





CTSH
NM_004390
S2363/CTSH.f2
SEQ ID NO:86
GCAAGTTCCAACCTGGAAAG
20





CTSH
NM_004390
S2364/CTSH.r2
SEQ ID NO:87
CATCGCTTCCTCGTCATAGA
20





CTSH
NM_004390
S4854/CTSH.p2
SEQ ID NO:88
TGGCTACATCCTTGACAAAGCCGA
24





Cyclin G1
NM_004060
S1946/Cyclin.f1
SEQ ID NO:89
CTCCTCTTGCCTACGAGTCC
20





Cyclin G1
NM_004060
S1947/Cyclin.r1
SEQ ID NO:90
CTCACCTCACCCCACGATA
19





Cyclin G1
NM_004060
S4755/Cyclin.p1
SEQ ID NO:91
CCTCTCCTCGTAGGCCTCTCGGAT
24





DHFR
NM_000791
S0097/DHFR.f2
SEQ ID NO:92
TTGCTATAACTAAGTGCTTCTCCAAGA
27





DHFR
NM_000791
S0099/DHFR.r2
SEQ ID NO:93
GTGGAATGGCAGCTCACTGTAG
22





DHFR
NM_000791
S4997/DHFR.p2
SEQ ID NO:94
CCCAACTGAGTCCCCAGCACCT
22





DPYD
NM_000110
SOIOO/DPYD.f2
SEQ ID NO:95
AGGACGCAAGGAGGGTTTG
19





DPYD
NM_000110
S0102/DPYD.r2
SEQ ID NO:96
GATGTCCGCCGAGTCCTTACT
21





DPYD
NM_000110
S4998/DPYD.p2
SEQ ID NO:97
CAGTGCCTACAGTCTCGAGTCTGCCAGTG
29





E2F1
NM_005225
S3063/E2F1.f3
SEQ ID NO:98
ACTCCCTCTACCCTTGAGCA
20





E2F1
NM_005225
S3064/E2F1.r3
SEQ ID NO:99
CAGGCCTCAGTTCCTTCAGT
20





E2F1
NM_005225
S4821/E2F1.p3
SEQ ID NO:100
CAGAAGAACAGCTCAGGGACCCCT
24





EGFR
NM_005228
S0103/EGFR.f2
SEQ ID NO:101
TGTCGATGGACTTCCAGAAC
20





EGFR
NM_005228
S0105/EGFR.r2
SEQ ID NO:102
ATTGGGACAGCTTGGATCA
19





EGFR
NM_005228
S4999/EGFR.p2
SEQ ID NO:103
CACCTGGGCAGCTGCCAA
18





EMP1
NM_001423
S2796/EMPI.f1
SEQ ID NO:104
GCTAGTACTTTGATGCTCCCTTGAT
25





EMP1
NM_001423
S2797/EMP1.r1
SEQ ID NO:105
GAACAGCTGGAGGCCAAGTC
20





EMP1
NM_001423
S4964/EMP1.p1
SEQ ID NO:106
CCAGAGAGCCTCCCTGCAGCCA
22





EPHX1
NM_000120
S1865/EPHX1.f2
SEQ ID NO:107
ACCGTAGGCTCTGCTCTGAA
20





EPHX1
NM_000120
S1866/EPHX1.r2
SEQ ID NO:108
TGGTCCAGGTGGAAAACTTC
20





EPHX1
NM_000120
S4754/EPHX1.p2
SEQ ID NO:109
AGGCAGCCAGACCCACAGGA
20





FLT1
NM_002019
S1732/FLT1.f3
SEQ ID NO:110
GGCTCCCGAATCTATCTTTG
20





FLT1
NM_002019
S1733/FLT1.r3
SEQ ID NO:111
TCCCACAGCAATACTCCGTA
20





FLT1
NM_002019
S4922/FLT1.p3
SEQ ID NO:112
CTACAGCACCAAGAGCGACGTGTG
24





FOLR1
NM_016730
S2406/FOLR1.f1
SEQ ID NO:113
GAACGCCAAGCACCACAAG
19





FOLR1
NM_016730
S2407/FOLR1.r1
SEQ ID NO:114
CCAGGGTCGACACTGCTCAT
20





FOLR1
NM_016730
S4912/FOLR1.p1
SEQ ID NO:115
AAGCCAGGCCCCGAGGACAAGTT
23





Furin
NM_002589
S2233/Furin.f1
SEQ ID NO:116
AAGTCCTCGATACGCACTATAGCA
24





Furin
NM_002569
S2234/Furin.r1
SEQ ID NO:117
CTGGCATGTGGCACATGAG
19





Furin
NM_002569
S4933/Furin.p1
SEQ ID NO:118
CCCGGATGGTCTCCACGTCAT
21





FUS
NM_004960
S2936/FUS.f1
SEQ ID NO:119
GGATAATTCAGACAACAACACCATCT
26





FUS
NM_004960
S2937/FUS.r1
SEQ ID NO:120
TGAAGTAATCAGCCACAGACTCAAT
25





FUS
NM_004960
S4801/FUS.p1
SEQ ID NO:121
TCAATTGTAACATTCTCACCCAGGCCTTG
29





GPX2
NM_002083
S2514/GPX2.f2
SEQ ID NO:122
CACACAGATCTCCTACTCCATCCA
24





GPX2
NM_002083
S2515/GPX2.r2
SEQ ID NO:123
GGTCCAGCAGTGTCTCCTGAA
21





GPX2
NM_002083
S4936/GPX2.p2
SEQ ID NO:124
CATGCTGCATCCTAAGGCTCCTCAGG
26





GSTp
NM_000852
S0136/GSTp.f3
SEQ ID NO:125
GAGACCCTGCTGTCCCAGAA
20





GSTp
NM_000852
S0138/GSTp.r3
SEQ ID NO:126
GGTTGTAGTCAGCGAAGGAGATC
23





GSTp
NM_000852
S5007/GSTp.p3
SEQ ID NO:127
TCCCACAATGAAGGTCTTGCCTCCCT
26





hCRA a
U78556
S2198/hCRA a.f2
SEQ ID NO:128
TGACACCCTTACCTTCCTGAGAA
23





hCRA a
U78556
S2199/hCRA a.r2
SEQ ID NO:129
AAAAACACGAGTCAAAAATAGAAGTCACT
29





hCRAa
U78556
S4928/hCRA a.p2
SEQ ID NO:130
TCTGCTTTCCGCGCTCCCAGG
21





Hepsin
NM_002151
S2269/Hepsin.f1
SEQ ID NO:131
AGGCTGCTGGAGGTCATCTC
20





Hepsin
NM_002151
S2270/Hepsin.r1
SEQ ID NO:132
CTTCCTGCGGCCACAGTCT
19





Hepsin
NM_002151
S4831/Hepsin.p1
SEQ ID NO:133
CCAGAGGCCGTTTCTTGGCCG
21





HER2
NM_004448
S0142/HER2.f3
SEQ ID NO:134
CGGTGTGAGAAGTGCAGCAA
20





HER2
NM_004448
S0144/HER2.r3
SEQ ID NO:135
CCTCTCGCAAGTGCTCCAT
19





HER2
NM_004448
S4729/HER2.p3
SEQ ID NO:136
CCAGACCATAGCACACTCGGGCAC
24





HNF3A
NM_004496
S0148/HNF3A.f1
SEQ ID NO:137
TCCAGGATGTTAGGAACTGTGAAG
24





HNF3A
NM_004498
S0150/HNF3A.r1
SEQ ID NO:138
GCGTGTCTGCGTAGTAGCTGTT
22





HNF3A
NM_004496
S5008/HNF3A.p1
SEQ ID NO:139
AGTCGCTGGTTTCATGCCCTTCCA
24





IGFIR
NM_000875
S1249/IGFIR.f3
SEQ ID NO:140
GCATGGTAGCCGAAGATTTCA
21





IGFIR
NM_000875
S1250/IGFIR.r3
SEQ ID NO:141
TTTCCGGTAATAGTCTGTCTCATAGATATC
30





IGFIR
NM_000875
S4895/IGFIR.p3
SEQ ID NO:142
CGCGTCATACCAAAATCTCCGATTTTGA
28





IGFBP3
NM_000598
S0157/IGFBP3.f3
SEQ ID NO:143
ACGCACCGGGTGTCTGA
17





IGFBP3
NM_000598
S0159/IGFBP3.r3
SEQ ID NO:144
TGCCCTTTCTTGATGATGATTATC
24





IGFBP3
NM_000598
S5011/IGFBP3.p3
SEQ ID NO:145
CCCAAGTTCCACCCCCTCCATTCA
24





IGFBP6
NM_002178
S2335/IGFBP6.f1
SEQ ID NO:146
TGAACCGCAGAGACCAACAG
20





IGFBP6
NM_002178
S2336/IGFBP6.r1
SEQ ID NO:147
GTCTTGGACACCCGCAGAAT
20





IGFBP6
NM_002178
S4851/IGFBP6.p1
SEQ ID NO:148
ATCCAGGCACCTCTACCACGCCCTC
25





Kiting
NM_000899
S0169/Kiting.f4
SEQ ID NO:149
GTCCCCGGGATGGATGTT
18





Kiting
NM_000899
S0171/Kiting.r4
SEQ ID NO:150
GATCAGTCAAGCTGTCTGACAATTG
25





Kiting
NM_000899
S5012/Kiting.p4
SEQ ID NO:151
CATCTCGCTTATCCAACAATGACTTGGCA
29





KRT17
NM_000422
S0172/KRT17.f2
SEQ ID NO:152
CGAGGATTGGTTCTTCAGCAA
21





KRT17
NM_000422
S0174/KRT17.r2
SEQ ID NO:153
ACTCTGCACCAGCTCACTGTTG
22





KRT17
NM_000422
S5013/KRT17.p2
SEQ ID NO:154
CACCTCGCGGTTCAGTTCCTCTGT
24





LAMC2
NM_005582
S2826/LAMC2.f2
SEQ ID NO:155
ACTCAAGCGGAAATTGAAGCA
21





LAMC2
NM_005562
S2827/LAMC2.r2
SEQ ID NO:156
ACTCCCTGAAGCCGAGACACT
21





LAMC2
NM_005562
S4969/LAMC2.p2
SEQ ID NO:157
AGGTCTTATCAGCACAGTCTCCGCCTCC
28





LMYC
NM_012421
S2863/LMYC.f2
SEQ ID NO:158
CCCATCCAGAACACTGATTG
20





LMYC
NM_012421
S2864/LMYC.r2
SEQ ID NO:159
CTGCTTTCTATGCACCCTTTC
21





LMYC
NM_012421
S4973/LMYC.p2
SEQ ID NO:160
TGACCTCCATCCCTTTCACTTGAATG
26





mGST1
NM_020300
S2245/mGST1.f2
SEQ ID NO:161
ACGGATCTACCACACCATTGC
21





mGST1
NM_020300
S2246/mGST1.r2
SEQ ID NO:162
TCCATATCCAACAAAAAAACTCAAAG
26





mGST1
NM_020300
S4830/mGST1.p2
SEQ ID NO:163
TTTGACACCCCTTCCCCAGCCA
22





MMP2
NM_004530
S1874/MMP2.f2
SEQ ID NO:164
CCATGATGGAGAGGCAGACA
20





MMP2
NM_004530
S1875/MMP2.r2
SEQ ID NO:165
GGAGTCCGTCCTTACCGTCAA
21





MMP2
NM_004530
S5039/MMP2.p2
SEQ ID NO:166
CTGGGAGCATGGCGATGGATACCC
24





MMP9
NM_004994
S0658/MMP9.f1
SEQ ID NO:167
GAGAACCAATCTCACCGACA
20





MMP9
NM_004994
S0657/MMP9.r1
SEQ ID NO:168
CACCCGAGTGTAACCATAGC
20





MMP9
NM_004994
S4760/MMP9.p1
SEQ ID NO:169
ACAGGTATTCCTCTGCCAGCTGCC
24





MTA1
NM_004689
S2369/MTA1.f1
SEQ ID NO:170
CCGCCCTCACCTGAAGAGA
19





MTA1
NM_004689
S2370/MTA1.r1
SEQ ID NO:171
GGAATAAGTTAGCCGCGCTTCT
22





MTA1
NM_004689
S4855/MTA1.p1
SEQ ID NO:172
CCCAGTGTCCGCCAAGGAGCG
21





MUC1
NM_002456
S0782/MUC1.f2
SEQ ID NO:173
GGCCAGGATCTGTGGTGGTA
20





MUC1
NM_002456
S0783/MUC1.r2
SEQ ID NO:174
CTCCACGTCGTGGACATTGA
20





MUC1
NM_002456
S4807/MUC1.p2
SEQ ID NO:175
CTCTGGCCTTCCGAGAAGGTACC
23





p53R2
AB036063
S2305/p53R2.f3
SEQ ID NO:176
CCCAGCTAGTGTTCCTCAGA
20





p53R2
AB036063
S2306/p53R2.r3
SEQ ID NO:177
CCGTAAGCCCTTCCTCTATG
20





p53R2
AB036063
S4847/p53R2.p3
SEQ ID NO:178
TCGGCCAGCTTT1TCCAATCTTTG
24





PDGFRa
NM_006206
S0226/PDGFRa.f2
SEQ ID NO:179
GGGAGTTTCCAAGAGATGGA
20





PDGFRa
NM_006206
S0228/PDGFRa.r2
SEQ ID NO:180
CTTCAACCACCTTCCCAAAC
20





PDGFRa
NM_006206
S5020/PDGFRa.p2
SEQ ID NO:181
CCCAAGACCCGACCAAGCACTAG
23





PDGFRb
NM_002609
S1346/PDGFRb.f3
SEQ ID NO:182
CCAGCTCTCCTTCCAGCTAC
20





PDGFRb
NM_002609
S1347/PDGFRb.r3
SEQ ID NO:183
GGGTGGCTCTCACTTAGCTC
20





PDGFRb
NM_002609
S4931/PDGFRb.p3
SEQ ID NO:184
ATCAATGTCCCTGTCCGAGTGCTG
24





PGK1
NM_000291
S0232/PGK1.f1
SEQ ID NO:185
AGAGCCAGTTGCTGTAGAACTCAA
24





PGK1
NM_000291
S0234/PGK1.r1
SEQ ID NO:186
CTGGGCCTACACAGTCCTTCA
21





PGK1
NM_000291
S5022/PGK1.p1
SEQ ID NO:187
TCTCTGCTGGGCAAGGATGTTCTGTTC
27





RASSF1
NM_007182
S2393/RASSF1.f3
SEQ ID NO:188
AGTGGGAGACACCTGACCTT
20





RASSF1
NM_007182
S2394/RASSF1.r3
SEQ ID NO:189
TGATCTGGGCATTGTACTCC
20





RASSR
NM_007182
S4909/RASSF1.p3
SEQ ID NO:190
TTGATCTTCTGCTCAATCTCAGCTTGAGA
29





RPL1 9
NM_000981
S0253/RPL19.f3
SEQ ID NO:191
CCACAAGCTGAAGGCAGACA
20





RPL19
NM_000981
S0255/RPL19.r3
SEQ ID NO:192
GCGTGCTTCCTTGGTCTTAGA
21





RPL1 9
NM_000981
S4728/RPL19.p3
SEQ ID NO:193
CGCAAGAAGCTCCTGGCTGACC
22





RPLPO
NM_001002
S0256/RPLPO.f2
SEQ ID NO:194
CCATTCTATCATCAACGGGTACAA
24





RPLPO
NM_001002
S0258/RPLPO.r2
SEQ ID NO:195
TCAGCAAGTGGGAAGGTGTAATC
23





RPLPO
NM_001002
S4744/RPLPO.p2
SEQ ID NO:196
TCTCCACAGACAAGGCCAGGACTCG
25





RRM1
NM_001033
S2835/RRM1.f2
SEQ ID NO:197
GGGCTACTGGCAGCTACATT
20





RRM1
NM_001033
S2836/RRM1.r2
SEQ ID NO:198
CTCTCAGCATCGGTACAAGG
20





RRM1
NM_001033
S4970/RRM1.p2
SEQ ID NO:199
CATTGGAATTGCCATTAGTCCCAGC
25





STAT3
NM_003150
S1545/STAT3.f1
SEQ ID NO:200
TCACATGCCACTTTGGTGTT
20





STAT3
NM_003150
S1546/STAT3.r1
SEQ ID NO:201
CTTGCAGGAAGCGGCTATAC
20





STAT3
NM_003150
S4881/STAT3.p1
SEQ ID NO:202
TCCTGGGAGAGATTGACCAGCA
22





STAT5B
NM_012448
S2399/STAT5B.f2
SEQ ID NO:203
CCAGTGGTGGTGATCGTTCA
20





STAT5B
NM_012448
S2400/STAT5B.r2
SEQ ID NO:204
GCAAAAGCATTGTCCCAGAGA
21





STAT5B
NM_012448
S4910/STAT5B.p2
SEQ ID NO:205
CAGCCAGGACAACAATGCGACGG
23





STMY3
NM_005940
S2067/STMY3.f3
SEQ ID NO:206
CCTGGAGGCTGCAACATACC
20





STMY3
NM_005940
S2068/STMY3.r3
SEQ ID NO:207
TAGAATGGCTTTGGAGGATAGCA
23





STMY3
NM_005940
S4746/STMY3.p3
SEQ ID NO:208
ATCCTCCTGAAGCCCTTTTCGCAGC
25





Surfact A1
NM_005411
S2215/Surfac.f1
SEQ ID NO:209
TGGCCCTCAACCTCATCTTG
20





Surfact A1
NM_005411
S2216/Surfac.r1
SEQ ID NO:210
CTTCCAACACAAACGTCCTTCA
22





Surfact A1
NM_005411
S4930/Surfac.p1
SEQ ID NO:211
TTCGCACACAGCACCAGAGGCTG
23





TAGLN
NM_003186
S3185/TAGLN.f3
SEQ ID NO:212
GATGGAGCAGGTGGCTCAGT
20





TAGLN
NM_003186
S3186/TAGLN.r3
SEQ ID NO:213
AGTCTGGAACATGTCAGTCTTGATG
25





TAGLN
NM_003186
S3266/TAGLN.p3
SEQ ID NO:214
CCCAGAGTCCTCAGCCGCCTTCAG
24





TIMP2
NM_003255
S1680/TIMP2.f1
SEQ ID NO:215
TCACCCTCTGTGACTTCATCGT
22





TIMP2
NM_003255
S1681/TIMP2.r1
SEQ ID NO:216
TGTGGTTCAGGCTCTTCTTCTG
22





TIMP2
NM_003255
S4916/TIMP2.p1
SEQ ID NO:217
CCCTGGGACACCCTGAGCACCA
22





TITF1
NM_003317
S2224/TITF1.f1
SEQ ID NO:218
CGACTCCGTTCTCAGTGTCTGA
22





TITF1
NM_003317
S2225/TITF1.r1
SEQ ID NO:219
CCCTCCATGCCCACTTTCT
19





TITF1
NM_003317
S4829/TITF1.p1
SEQ ID NO:220
ATCTTGAGTCCCCTGGAGGAAAGC
24





VEGFC
NM_005429
S2251/VEGFC.f1
SEQ ID NO:221
CCTCAGCAAGACGTTATTTGAAATT
25





VEGFC
NM_005429
S2252NEGFC.r1
SEQ ID NO:222
AAGTGTGATTGGCAAAACTGATTG
24





VEGFC
NM_005429
S4758NEGFC.p1
SEQ ID NO:223
CCTCTCTCTCAAGGCCCCAAACCAGT
26





WISP1
NM_003882
S1671/WISP1.f1
SEQ ID NO:224
AGAGGCATCCATGAACTTCACA
22





WISP1
NM_003882
S1672/WISP1.r1
SEQ ID NO:225
CAAACTCCACAGTACTTGGGTTGA
24





WISP1
NM_003882
S4915/WISP1.p1
SEQ ID NO:226
CGGGCTGCATCAGCACACGC
20





XIAP
NM_001167
S0289/XIAP.f1
SEQ ID NO:227
GCAGTTGGAAGACACAGGAAAGT
23





XIAP
NM_001167
SO291/XIAP.r1
SEQ ID NO:228
TGCGTGGCACTATTTTCAAGA
21





XIAP
NM_001167
S4752/X1AP.p1
SEQ ID NO:229
TCCCCAAATTGCAGATTTATCAACGGC
27





YB-1
NM_004559
S1194/YB-1.f2
SEQ ID NO:230
AGACTGTGGAGTTTGATGTTGTTGA
25





YB-1
NM_004559
S1195/YB-1.r2
SEQ ID NO:231
GGAACACCACCAGGACCTGTAA
22





YB-1
NM_004559
S4843/YB-1.p2
SEQ ID NO:232
TTGCTGCCTCCGCACCCTTTTCT
23








Claims
  • 1. A method for predicting the likelihood that a patient having a cancer will exhibit a clinical benefit in response to treatment with an epidermal growth factor receptor (EGFR) inhibitor, the method comprising a) assaying an expression level of a RNA transcript or its expression product in a biological sample comprising cancer cells obtained from the patient, wherein the RNA transcript is a KRT17 transcript; andb) determining a normalized expression level of the KRT17 transcript or its expression productwherein the normalized expression level of the KRT17 transcript or its expression product, negatively correlates with the clinical benefit in response to treatment with the EGFR inhibitor, and wherein the EGFR inhibitor interacts with an EGFR.
  • 2. The method of claim 1 wherein the patient is a human.
  • 3. The method of claim 1, further comprising assaying an expression level of at least one additional RNA transcript or its expression product; and
  • 4-5. (canceled)
  • 6. The method of claim 1, wherein the cancer is selected from the group consisting of ovarian cancer, colon cancer, pancreatic cancer, non-small cell lung cancer, breast cancer, renal cancer and head and neck cancer.
  • 7. The method of claim 7, wherein the said biological sample is a tissue sample.
  • 8. The method of claim 7, wherein the tissue from the tissue sample is one selected from the group consisting of fixed, paraffin-embedded, fresh, and frozen.
  • 9. The method of claim 7, wherein the tissue sample is from a biopsy.
  • 10. The method of claim 7, wherein the tissue sample is a bronchial lavage sample.
  • 11. The method of claim 1 wherein the expression level of the RNA transcript is assayed using a reverse transcription-polymerase chain reaction.
  • 12. The method of claim 1 wherein the expression level of the expression product is determined by immunohistochemistry.
  • 13. The method of claim 1 wherein the expression level of the expression product is determined by proteomics technology.
  • 14. The method of claim 1 wherein the assay for measurement of the RNA transcript or its expression product is provided in the form of a kit.
  • 15. The method of claim 1 wherein the EGFR inhibitor is an antibody or an antibody fragment.
  • 16. The method of claim 1 wherein the EGFR inhibitor is a small molecule.
  • 17-30. (canceled)
  • 31. A method of preparing a personalized genomics profile for a patient, comprising the steps of: (a) assaying an expression level of a KRT17 RNA transcript or its expression product in at least one cancer cell obtained from the patient;(b) determining a normalized expression level in the at least one cancer cell of the KRT 17 transcript or its expression product; and(c) creating a report summarizing the data obtained by the normalized expression level.
  • 32. The method of claim 31 wherein the at least one cancer cell is from a solid tumor.
  • 33. The method of claim 32 wherein the solid tumor is selected from the group consisting of breast cancer, ovarian cancer, gastric cancer, colorectal cancer, pancreatic cancer, renal cancer and lung cancer.
  • 34. The method of claim 31 wherein the at least one cancer cell is obtained from a fixed, paraffin-embedded biopsy sample.
  • 35. The method of claim 34 wherein the KRT17 RNA transcript is fragmented.
  • 36. The method of claim 31 wherein the report includes prediction of the likelihood that the patient will exhibit a clinical benefit in response to treatment with an epidermal growth factor receptor inhibitor.
  • 37. The method of claim 36 wherein the report includes recommendation for a treatment modality of the patient.
  • 38. The method of claim 31 wherein if increased expression of the KRT17 RNA transcript or its expression product, said report includes a prediction that said subject has a decreased likelihood of response to treatment with an EGFR inhibitor.
  • 39-42. (canceled)
BACKGROUND OF THE INVENTION

The present application claims the benefit under 35 U.S.C. 119(e) of the filing date of U.S. Application Ser. No. 60/474,908 filed on May 30, 2003.

Provisional Applications (1)
Number Date Country
60474908 May 2003 US
Divisions (1)
Number Date Country
Parent 10857715 May 2004 US
Child 11949535 US