Methods and compositions for evaluating tissues, e.g., tumors, are provided herein.
In the United States, the overall incidence of melanoma is increasing at a rate faster than any other cancer, with recent estimates for lifetime risk of developing invasive melanoma at 1/49 (Jemal et al., C. A. Cancer J. Clin. 57:43-66, 2007). The development of melanoma begins with the malignant transformation of normal human epithelial melanocytes (NHEM) located within the basement membrane of the skin, but the genetic changes associated with the progression of NHEM to melanoma are not well characterized (Bittner et al., Nature. 406:536-540, 2000; DeRisi et al., Nat. Genet. 14:457-460, 1996; Golub et al., Science. 286:531-37, 1999; Hanahan et al., Cell. 100-57-70, 2000; Seykora et al., Am. J. Dermatopathol. 25:6-11, 2003; Su et al., Nature. 406:536-540, 2000; Trent et al., Science. 247:568-571, 1990; Weyers et al., Cancer. 86:288-299, 1999). Similarly, the molecular mechanisms underlying further progression from a primary tumor to a metastatic melanoma are also inadequately defined.
There is a correlation between the thickness of the primary melanoma and its capacity to metastasize to the draining lymph node basin(s) and hematogenously (Haddad et al., Ann. Surg. Oncol. 6:144-149, 1999; Cascinelli et al., Ann. Surg. Oncol. 7:469-474, 2000). Once melanoma has metastasized by either route, the overall survival for patients greatly diminishes (Balch et al., Cancer. 88:3635-3648; 2001; Balch et al., J. Clin. Oncol. 19:3622-3634, 2001). Whereas patients with thin primary tumors are cured by surgery, patients diagnosed with metastatic melanoma (AJCC stage IV) have an overall poor prognosis, with 6 out of every 7 skin cancer deaths due to metastatic melanoma (Balch et al., Cancer. 88:1484-1491, 2000; Eton et al., J. Clin. Oncol. 16:1103-1111, 1998; Jemal et al., C. A. Cancer J. Clin. 57:43-66, 2007).
The compositions and related methods provided herein are based, in part, on the discovery of unique gene expression profiles characteristic of primary basal cell, squamous cell, non-metastatic, and metastatic melanoma skin cancer samples. A consistent “transition zone” of gene expression change within primary melanoma samples was observed and has allowed identification of gene expression profiles capable of distinguishing a primary tumor from metastatic melanoma. This transition in gene expression involves both increased expression levels of genes such as MAGE genes, GPR19, BCL2A1, SOX5, BUB1, and RGS20, and an even greater reduction in the expression of genes such as SPRR1A/B, KRT16/17, CD24, LOR, GATA3, MUC15, and TMPRSS4. The transition in gene expression also involves other genes described herein. For example, the transition involves a reduction in expression of a plurality of (including all of) the following genes: GJB6, SPRR1A, SERPINB5, CALML5 (CLSP), DSC1, PKP1, CLCA2, DSG1, CDSN, LY6D, LCE2B, FLG; RP1-14N1.3, KRT16, SBSN, SERPINB3, SERPINB7, KRT17, KLK7, LOR, SLURP1, LOC63928, KRT15, LGALS7, CST6, SPRR1B, CNFN, TRIM29, EPPK1, SFN, KRT6B, DSG3, SPRR2B, DMKN, ASAH3, SERPINB13, KLK11, AADACL2, DAPL1, ABCA12, DSC3, POF1B, GATA3, LYPD3, KRT6A, EHF, PCDH21, CBLC, FGFR2, SCEL, and FGFR3. For example, the transition involves an increase in expression of a plurality of (including all of) the following genes: MAGEA3, MAGEA6, CSAG2 (TRAG3), MAGEA12, MAGEA2, TRIM51, NRP2, MAGEA1, MSI2, GYPC, SPP1, SOX5, KIFC1, HILS1, RGS20, BUB1, IGF2BP3, FRMD5, C1orf90, EYA4, BCL2A1, SLC16A4, AKT3, CDC45L, SEC22L3, PEG10, POPDC3, MAGEA5, GLUD2, ST6GALNAC3, SEZ6L2, DUSP4, ABCB5, RASGRF1, DUSP4, FLJ40142, BRRN1, PHLDA1, MMP14, DUSP6, DPY19L1, GLUD1, LOC346615, CALU, RNF157, PRDM13, PBK, KIAA1618, NEDD4L, BICD1, and RRM2. The transition may further involve an increase in expression of one or more of the foregoing genes in conjunction with a decrease in expression of one or more of the previous set of genes. Additionally, a correlation between primary melanoma tumor thickness, as measured by Breslow's depth, and the accumulation of individual gene expression changes has also been discovered. The genes identified as changing expression in primary cutaneous melanoma along the spectrum of increasing Breslow's thickness, are useful markers for the existence of cells characteristic of metastatic melanoma. As further described herein, expression of the genes (e.g., five or more of the genes listed above, and/or five or more of the genes described in Tables. A-D, herein) can be examined in various combinations.
Accordingly, in one aspect, the technology herein features a method of evaluating a melanoma from a patient. The method includes determining expression of five or more genes in a test sample from a melanoma, relative to a control, wherein the five or more genes are selected from the genes listed in Table A and Table B, thereby evaluating the melanoma.
In various embodiments, expression of at least 10 genes from Tables A and B is determined, e.g., expression of at least 25, 50, 100, 250, 500, 750, 1000, 1250, or 1500 genes is determined. In various embodiments, expression of no more than 1500, 1250, 1000, 750, 500, 250, 100, 50, or 25 genes is determined. The at least 10 genes may be chosen in any combination from Tables A and B. Thus, in some embodiments, the at least 10 genes includes five genes from Table A and five genes from Table B. Other combinations may be examined, such as one gene from Table A and nine genes from Table B, and so forth. In some embodiments, expression of genes from Table A or Table B is determined (e.g., expression of at least 10 genes from Table A is determined to the exclusion of genes from Table B, or, alternatively, expression of at least 10 genes from Table B is determined, to the exclusion of genes from Table A).
In various embodiments, expression of the five or more genes is determined relative to expression of the five or more genes in a reference set of non-metastatic cutaneous tissue samples, wherein a decrease in expression of one or more of a gene of Table A, and an increase in expression of one or more of a gene of Table B, relative to expression of the five or more genes in the reference set, indicates an increased likelihood that the test sample is from a metastatic melanoma and/or indicates a poor prognosis. The method can further include determining that the patient should undergo a treatment protocol. For example, patients for which the melanoma sample expression is indicative of a metastatic melanoma may elect to undergo a more aggressive treatment, e.g., with interferon alpha 2b, or interleukin2, surgery to remove additional tissue (e.g., addition melanoma tissue at the site from which the original sample was obtained, or at another site, e.g., in a lymph node), or an experimental treatment. Patients in which expression is not indicative of a metastatic melanoma may elect to forgo a treatment.
The non-metastatic cutaneous tissue samples (e.g., the reference samples to which expression in the test sample is compared) can include one or more of the following: normal human epithelial melanocytes, primary cutaneous melanoma, basal cell carcinoma, squamous cell carcinoma, melanoma in situ, and/or thin melanoma (<1.5 mm Breslow's thickness).
In various embodiments, expression of the five or more genes is compared to: (a) expression in a first reference set of non-metastatic cutaneous tissue samples, and (b) expression in a second reference set of metastatic melanoma tissue samples; wherein a greater similarity in expression of the five or more genes in the test sample to the second reference set than to the first reference set indicates an increased likelihood that the test sample is a metastatic melanoma.
The determining expression of five or more genes in the test sample can include isolating RNA from the test sample, and detecting expression of the RNA. RNA expression can be detected directly or indirectly, e.g., using microarray or PCR analysis. The sample can be a fixed, paraffin-embedded biopsy sample, or a frozen sample.
In various embodiments, the determining expression of five or more genes in the test sample includes reverse transcriptase polymerase chain reaction (RT-PCR), e.g., quantitative PCR, e.g., real time quantitative PCR.
In various embodiments, the determining expression of five or more genes in the test sample includes microarray analysis.
In other embodiments, the determining expression of five or more genes in the test sample includes analysis of protein expression, e.g., immunohistochemical analysis of proteins encoded by on or more of the genes, or proteomic analysis.
The test sample can be a test sample from a melanoma having an intermediate thickness (e.g., a Breslow's thickness of 1-4 mm). The test sample can be from a thin or thick melanoma.
In another aspect, the technology features a method of evaluating a melanoma from a patient, which method includes, for example, determining expression of five or more genes in a test sample from the melanoma, relative to a control, wherein the five or more genes are selected from the genes listed in Table C and Table D, thereby evaluating the melanoma. The method can include other features described herein. For example, expression of at least 10, 25, 50, 75, or 100 genes from Tables C and D can determined. Expression of no more than 100, 75, 50, 25, or 10 genes can be determined. The at least 5 genes from Table C and Table D may be examined in any combination, such as one gene from Table C and four genes from Table D; or four genes from Table C and one gene from Table D. In some embodiments, expression of genes solely from just one of Table C or Table D is determined (e.g., expression of at least five genes from Table C is determined, or expression of at least five genes from Table D is determined).
Expression of the five or more genes can be determined relative to expression of the five or more genes in a reference set of non-metastatic cutaneous tissue samples, wherein a decrease in expression of one or more of a gene of Table C, and an increase in expression of one or more of a gene of Table D, relative to expression of the five or more genes in the reference set, indicates an increased likelihood that the test sample is a metastatic melanoma and/or indicates a poor prognosis. The method can further include determining that the patient should undergo a treatment protocol, based on the determination of gene expression.
In some embodiments expression of the five or more genes is compared to: (a) expression in a first reference set of non-metastatic cutaneous tissue samples, and (b) expression in a second reference set of metastatic melanoma tissue samples; wherein a greater similarity in expression of the five or more genes in the test sample to the second reference set than to the first reference set indicates an increased likelihood that the test sample is a metastatic melanoma.
The determining expression of five or more genes in the test sample can include isolating RNA from the test sample, and detecting expression of the RNA, or detecting protein expression.
In another aspect, the technology also features kits for evaluating a melanoma sample. The kits include polynucleotides (e.g., primers or probes) for analysis of at least 5, 10, 25, 50, 75, or 100 genes from Tables C and D, wherein each oligonucleotide specifically hybridizes to one of the genes from Tables C and D. The kits can include polynucleotides for analysis of up to 25, 50, 75, or 100 genes from Tables C and D.
For example, a kit includes pairs of polynucleotides for amplification of the genes from Tables C and D by PCR.
The technology also features kits for evaluating a melanoma sample that include polynucleotides (e.g., primers or probes) for analysis of at least 250, 500, 750, 1000, 1250, or 1500 genes from Tables A and B, wherein each oligonucleotide specifically hybridizes to one of the genes from Tables A and B. The kits can include polynucleotides for analysis of up to 250, 500, 270, 1000, 1250, or 1500 genes from Tables A and B. In some embodiments, the polynucleotides are immobilized on a solid support, e.g., as in a microarray.
The technology also features kits for evaluating protein expression in a melanoma sample. The kit includes, for example, reagents (e.g., antibodies) for detection of proteins encoded by at least 5, 10, 25, or 50 genes from Tables C and D.
A “sample” is any biological material obtained from an individual. A “melanoma sample” or “melanoma tissue sample” is a sample that includes primarily melanoma cells.
“Gene” refers to a polynucleotide sequence that comprises sequences that are expressed in a cell as RNA and control sequences necessary for the production of a transcript or precursor. A gene expression product analyzed according to a method described herein can be encoded by a full length coding sequence or by any portion of the coding sequence.
“Polynucleotide” refers to a polymeric form of nucleotides of any length, either ribonucleotides or deoxyribonucleotides. Thus, the term includes, but is not limited to, single-, double-, or multi-stranded DNA or RNA, genomic DNA, cDNA, DNA-RNA hybrids, or a polymer comprising purine and pyrimidine bases or other natural, chemically or biochemically modified, non-natural, or derivatized nucleotide bases, as well as polynucleotides that have been modified in order to introduce a means for attachment (e.g., to a support for use as a microarray).
The descriptions herein are phrased in terms of “five or more” or “ten or more” genes, but the choices of five and ten would be understood to be for the purposes of illustration and are non-limiting. One may also examine expression of other numbers such as 3, 4, 6, 7, 8, 9, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, and 30 genes, and so forth. In some embodiments, the combination of genes from Tables A, B, C, and/or D which is examined includes one or more genes implicated in one or more (e.g., 2, 3, 4, 5, 6, 7, 8, or 9) of the following biological processes, as understood by one skilled in the art: keratinocyte differentiation, epidermis development, melanocyte differentiation, cell differentiation, morphogenesis, muscle development, nervous system development, cell adhesion, the Wnt receptor signaling pathway, cell-cell signaling, cytoskeleton organization and biogenesis, inflammatory or immune response, cell motility and chemotaxis, electron transport, carbohydrate metabolism, lipid metabolism, proteolysis, signal transduction, protein transport, protein biosynthesis, transcription, DNA repair, cell cycle regulation or proliferation, or apoptosis. In some embodiments, at least one gene implicated in each of the above processes is examined.
The details of one or more embodiments of the technology are set forth in the accompanying drawings and the description below. Other features, objects, and advantages of the technology will be apparent from the description and drawings, and from the claims. All cited patents, and patent applications and references (including references to public sequence database entries) are incorporated by reference in their entireties for all purposes.
Methods and compositions for evaluating tissue samples (e.g., cutaneous tissue samples, e.g., samples from primary melanomas) are provided herein to determine whether the samples exhibit a gene expression profile characteristic of aggressive, metastatic melanomas, or a profile characteristic of non-metastatic melanomas. The ability to classify samples with a high degree of accuracy and sensitivity facilitates prognosis and subsequent clinical decisions (e.g., whether or not to undergo further surgery or other treatment modalities). Accordingly, the technology provides, inter alia, sets of genes whose expression can be examined to determine whether a cutaneous tissue sample is non-metastatic, or metastatic, as well as methods of analyzing expression of the gene sets, and compositions for performing the analysis.
The technology herein features combinations of genes, e.g., combinations of genes listed in Tables A and B below, combinations of genes listed in Tables C and D below. Analysis of the expression of these genes can be performed to identify metastatic melanoma tumors (e.g., in primary cutaneous melanoma samples, or in samples from compound nevi). The technology also features methods of analyzing the expression of combinations of genes. Various techniques are suitable for analyzing gene expression, including those that measure RNA or protein expression. For example, a sample from a melanoma (e.g., a primary cutaneous melanoma) is collected and processed to obtain RNA, protein, or tissue sections, to produce a test sample for analysis. The relative expression of several, dozens, 50, 100, or hundreds of genes in the test sample is determined. The gene expression values are compared to a reference set of values derived from selected non-metastatic melanoma samples and metastatic melanoma samples measured by the same assay as used to determine expression in the test sample. The values obtained for the test sample characterize the sample as metastatic or non-metastatic.
In some methods, a small number of genes (e.g., a subset of genes from Tables C and D, e.g., 5, 10, 15, 25 genes) is selected for analysis, and expression levels are determined by a quantitative or semi-quantitative PCR method or by immunohistochemistry. The combination of measured values for the respective genes are compared to control values to determine the degree to which the test sample contains gene expression values indicative of a metastatic tumor sample. The test samples are, for example, surgically collected tumors collected in a manner that preserves RNA, or alternatively fixed (e.g., formalin fixed) and embedded in paraffin prior to analysis, preserved by flash freezing or fixation, and/or treated with an RNA Stabilization Reagent.
In some methods, expression of a larger number of genes is analyzed, e.g., using microarrays. Nucleic acids from the test sample are hybridized to arrays under appropriate conditions, arrays are scanned, and the data processed by standard methods for feature extraction and normalization in order to obtain individual gene expression values. In these methods, a few hundred to more than a thousand genes can be used to determine the character of the test sample. One of several methods might be employed to identify the metastatic potential of the sample under investigation, based on the microarray-determined gene expression values. Typically, reference samples for metastatic melanoma and non-metastatic melanoma are analyzed in advance. The test sample is compared to the reference samples by classification schemes such as clustering, weighted voting, principle components analysis, self organizing maps, and/or neural networks. Each of these schemes is essentially a mathematical system for maximizing the geometric separation of classes (metastatic and non-metastatic) in multidimensional space using the individual gene expression values as coordinates to plot an individual sample relative to reference samples in multidimensional space.
Whether a method suitable for analysis of smaller or larger numbers of genes is employed, the reference samples define the combination of measures that identify a metastatic sample and a non-metastatic sample. The specific mathematical process depends on the method used for measuring gene expression, the number of genes, and the nature of the genes chosen to participate in the assay. Based on this unique combination and the reference sample values a threshold value will be determined (or mathematical formula) that will identify the unknown sample as more like the metastatic samples or more like the non-metastatic samples. One of skill would understand that genes that are not differentially expressed can be examined in methods described herein, e.g., as a control.
Tables A-D set forth sets of genes, the expression of which has been shown to correlate with metastatic melanoma. Tables A and C list genes whose expression is decreased in metastatic melanoma samples, relative to non-metastatic samples. Tables B and D list genes whose expression is increased in metastatic melanoma samples, relative to non-metastatic samples. Thus, the genes provided in these tables, and subsets thereof, are useful markers for metastatic melanoma.
In various embodiments, the technology provides a subset of genes from Tables A and B for evaluating a cutaneous tissue sample, sets of oligonucleotides (e.g., for use as probes and primers) for analyzing expression of the subsets, and methods for analyzing their expression, as described in more detail below. The set includes, for example, at least 5, 10, 50, 100, 250, 500, 750, or 1000 genes from Tables A and B, in any proportion (e.g., 800 genes from Table A and 200 genes from Table B).
For example, the subset includes the genes listed in Tables C and D. The genes in Tables C and D, and subsets thereof, are useful for evaluating a cutaneous-tissue sample, and in methods for analyzing expression of the subsets, as described in more detail below. An exemplary set includes, for example, at least 5, 10, 25, 35, or 51 genes from each of Tables C and D.
The lists shown in Tables C and D were generated from the lists shown in Tables A and B by selecting the genes that exhibit the greatest difference in gene expression between the metastatic samples and the non-metastatic samples based on microarray analysis. The genes on these lists also represent genes, the expression or expression products of which have been the subject of biological investigations.
Tables A-D include full length gene names (Gene description), gene symbols, GenBank accession numbers (GenBank ID), Entrez gene accession numbers (Entrez Gene ID), and UniGene accession numbers (UniGene ID). GenBank, Entrez, and UniGene records can be accessed on the World Wide Web at the following address: ncbi.nlm.nih.gov. These records provide sequences and additional information for each gene.
The genes listed in Tables A-D are generally referred to elsewhere herein by gene symbol. Gene symbols shown in parentheses are aliases or former designations.
Drosophila)
Gene Expression Analysis
As discussed above, combinations of genes are provided herein, for analysis of gene expression in cutaneous tumors (e.g., primary melanoma samples) to determine whether the tumors exhibit a metastatic expression pattern. Methods for analyzing gene expression include methods based on hybridization analysis of polynucleotides, sequencing of polynucleotides, and analysis of protein expression (e.g., proteomics-based methods). Commonly used methods are for the quantification of mRNA expression in a sample include northern blotting and in situ hybridization (Parker & Barnes, Methods in Molecular Biology 106:247-283, 1999); RNAse protection assays (Hod, Biotechniques 13:852 854, 1992); and PCR-based methods, such as reverse transcription polymerase chain reaction (RT-PCR) (Weis et al., Trends in Genetics 8:263 264, 1992). Alternatively, antibodies may be employed that can recognize specific duplexes, including DNA duplexes, RNA duplexes, and DNA-RNA hybrid duplexes or DNA-protein duplexes. Representative methods for sequencing-based gene expression analysis include Serial Analysis of Gene Expression (SAGE), and gene expression analysis by massively parallel signature sequencing (MPSS).
PCR-Based Methods
Combinations of genes indicative of metastatic or non-metastatic melanoma can be analyzed by PCR. PCR is useful to amplify and detect transcripts from a melanoma sample. Various PCR methodologies are useful for gene expression analyses.
Reverse Transcriptase PCR (RT-PCR). RT-PCR is a sensitive quantitative method that can be used to compare mRNA levels in different samples (e.g., non-metastatic and metastatic melanoma samples, or benign cutaneous and melanoma samples) to examine gene expression signatures.
To perform RT-PCR, mRNA is isolated from a sample (e.g., total RNA isolated from a human melanoma sample). mRNA can be extracted, for example, from frozen or archived paraffin-embedded and fixed (e.g. formalin-fixed) tissue samples. Methods for mRNA extraction are known in the art. See, e.g., Ausubel et al., Current Protocols in Molecular Biology, John Wiley and Sons, 1997. Methods for RNA extraction from paraffin embedded tissues are disclosed, for example, in Rupp and Locker, Lab Invest. 56:A67, 1987, and De Andres et al., BioTechniques 18:42044, 1995. Purification kits for RNA isolation from commercial manufacturers, such as Qiagen, can be used. For example, total RNA from a sample can be isolated using Qiagen RNeasy mini-columns. Other commercially available RNA isolation kits include MasterPure™ Complete DNA and RNA Purification Kit (EPICENTRE™, Madison, Wis.), and, Paraffin Block RNA Isolation Kit (Ambion, Inc.). Total RNA from tissue samples can be also isolated using RNA Stat-60 (Tel-Test) or by cesium chloride density gradient centrifugation.
Next, RNA is reverse transcribed into cDNA. The cDNA is amplified in a PCR reaction. Two commonly used reverse transcriptases are avian myeloblastosis virus reverse transcriptase (AMV-RT) and Moloney murine leukemia virus reverse transcriptase (MMLV-RT). The reverse transcription step is typically primed using specific primers, random hexamers, or oligo-dT primers, depending on the conditions and desired readout. For example, extracted RNA can be reverse-transcribed using a GeneAmp RNA PCR kit (Perkin Elmer, Calif., USA), following the manufacturer's instructions. The derived cDNA can then be used as a template in the subsequent PCR reaction. The PCR reaction typically employs the Taq DNA polymerase, which has a 5′-3′ nuclease activity but lacks a 3′-5′ proofreading endonuclease-activity. Two oligonucleotide primers are used to generate an amplicon in the PCR reaction.
Guidelines for PCR primer and probe design are described, e.g., in Dieffenbach et al., “General Concepts for PCR Primer Design” in: PCR Primer, A Laboratory Manual, Cold Spring Harbor Laboratory Press, New York, 133-155, 1995; Innis and Gelfand, “Optimization of PCRs” in: PCR Protocols, A Guide to Methods and Applications, CRC Press, London, 5-11, 1994; and Plasterer, T. N. Primerselect: Primer and probe design. Methods Mol. Biol. 70:520-527, 1997. Factors considered in PCR primer design include primer length, melting temperature (Tm), and G/C content, specificity, complementary primer sequences, and 3′-end sequence. PCR primers are generally 17-30 bases in length, and Tm's between 50-80° C., e.g. about 50 to 7° C. are typically preferred.
For quantitative PCR, a third oligonucleotide, or probe, is used to detect nucleotide sequence located between the two PCR primers. The probe is non-extendible by Taq DNA polymerase enzyme, and typically is labeled with a reporter fluorescent dye and a quencher fluorescent dye. Any laser-induced emission from the reporter dye is quenched by the quenching dye when the two dyes are located close together as they are on the probe. During the amplification reaction, the Taq DNA polymerase enzyme cleaves the probe in a template-dependent manner. The resultant probe fragments disassociate in solution, and signal from the released reporter dye is free from the quenching effect of the second fluorophore. One molecule of reporter dye is liberated for each new molecule synthesized, and detection of the unquenched reporter dye provides the basis for quantitative analysis.
RT-PCR can be performed using commercially available equipment, such as an ABI PRISM 7700™ Sequence Detection System (Perkin-Elmer-Applied Biosystems, Foster City, Calif., USA), or Lightcycler® (Roche Molecular Biochemicals, Mannheim, Germany). Samples can be analyzed using a real-time quantitative PCR device such as the ABI PRISM 7700™ Sequence Detection System™.
To minimize errors and the effect of sample-to-sample variation, RT-PCR is usually performed using an internal standard. A suitable internal standard is expressed at a constant level among different tissues, and is unaffected by the experimental variable. RNAs frequently used to normalize patterns of gene expression are mRNAs for the housekeeping genes glyceraldehyde-3-phosphate-dehydrogenase (GAPDH) and β-actin.
A variation of the RT-PCR technique is real time quantitative PCR, which measures PCR product accumulation through a dual-labeled fluorigenic probe (i.e., TaqMan™ probe). Real time PCR is compatible both with quantitative competitive PCR, where internal competitor for each target sequence is used for normalization, and with quantitative comparative PCR using a normalization gene contained within the sample, or a housekeeping gene for RT-PCR. For further details see, e.g. Held et al., Genome Res. 6:986-994, 1996.
Gene expression can be examined using fixed, paraffin-embedded tissues as the RNA source. Briefly, in one exemplary method, sections of paraffin-embedded tumor tissue samples are cut (˜10 μm thick). RNA is extracted, and protein and DNA are removed. After analysis of the RNA concentration, RNA repair and/or amplification steps may be performed, if necessary, and RNA is reverse transcribed using gene specific promoters followed by RT-PCR. Methods of examining expression in fixed, paraffin-embedded tissues, are described, for example, in Godfrey et al., J; Molec. Diagn. 2: 84-91, 2000; and Specht et. al., Am. J. Pathol. 158: 419-29, 2001.
Another approach for gene expression analysis employs competitive PCR design and automated, high-throughput matrix-assisted laser desorption ionization time-of-flight (MALDI-TOF) MS detection and quantification of oligonucleotides. This method is described by Ding and Cantor, Proc. Natl. Acad. Sci. USA 100:3059-3064, 2003.
See also the MassARRAY-based gene expression profiling method, developed by Sequenom, Inc. (San Diego, Calif.).
Additional PCR-based techniques for gene expression analysis include, e.g., differential display (Liang and Pardee, Science 257:967-971, 1992); amplified fragment length polymorphism (iAFLP) (Kawamoto et al., Genome Res. 12:1305-1312, 1999); BeadArray™ technology (Illumina, San Diego, Calif.; Oliphant et al., Discovery of Markers for Disease (Supplement to Biotechniques), June 2002; Ferguson et al., Analytical Chemistry 72:5618, 2000); BeadsArray for Detection of Gene Expression (BADGE), using the commercially available Luminex100 LabMAP system and multiple color-coded microspheres (Luminex Corp., Austin, Tex.) in a rapid assay for gene expression (Yang et al., Genome Res. 11:1888-1898, 2001); and high coverage expression profiling (HiCEP) analysis (Fukumura et al., Nucl. Acids. Res. 31(16) e94, 2003).
Microarrays
Evaluating gene expression of a melanoma sample can also be performed with microarrays. Microarrays permit simultaneous analysis of a large number of gene expression products. Typically, polynucleotides of interest are plated, or arrayed, on a microchip substrate. The arrayed sequences are then hybridized with nucleic acids (e.g., DNA or RNA) from cells or tissues of interest (e.g., cutaneous tissue samples). The source of mRNA typically is total RNA (e.g., total RNA isolated from human melanoma samples, and normal skin samples). If the source of mRNA is a primary tumor, mRNA can be extracted, for example, from frozen or archived paraffin-embedded and fixed (e.g. formalin-fixed) tissue samples.
In various embodiments of the microarray technique, probes to at least 10, 25, 50, 100, 200, 500, 1000, 1250, 1500, or 1600 genes (e.g., genes listed in a Table herein, which distinguish metastatic melanoma from other types of cutaneous tissues) are immobilized on an array substrate (e.g., a porous or nonporous solid support, such as a glass, plastic, or gel surface). The probes can include DNA, RNA, copolymer sequences of DNA and RNA, DNA and/or RNA analogues, or combinations thereof.
In some embodiments, a microarray includes a support with an ordered array of binding (e.g., hybridization) sites for each individual gene. The microarrays can be addressable arrays, and more preferably positionally addressable arrays, i.e., each probe of the array is located at a known, predetermined position on the solid support such that the identity (i.e., the sequence) of each probe can be determined from its position in the array.
Each probe on the microarray can be between 10-50,000 nucleotides, e.g., between 300-1,000 nucleotides in length. The probes of the microarray can consist of nucleotide sequences with lengths: less than 1,000 nucleotides, e.g., sequences 10-1,000, or 10-500, or 10-200 nucleotides in length. An array can include positive control probes, e.g., probes known to be complementary and hybridizable to sequences in the test sample, and negative control probes, e.g., probes known to not be complementary and hybridizable to sequences in the test sample.
Methods for attaching nucleic acids to a surface are known. Methods for immobilizing nucleic acids on glass are described, e.g., Schena et al, Science 270:467-470, 1995; DeRisi et al, Nature Genetics 14:457-460, 1996; Shalon et al., Genome Res. 6:639-645, 1996; and Schena et al., Proc. Natl. Acad. Sci. U.S.A. 93:10539-11286, 1995). Techniques are known for producing arrays with thousands of oligonucleotides at defined locations using photolithographic techniques are described by Fodor et al., 1991, Science 251:767-773, 1991; Pease et al., Proc. Natl. Acad. Sci. U.S.A. 91:5022-5026, 1994; Lockhart et al., Nature Biotechnology 14:1675, 1996; U.S. Pat. Nos. 5,578,832; 5,556,752; and 5,510,270). Other methods for making microarrays have been described. See, e.g., Maskos and Southern, Nuc. Acids. Res. 20:1679-1,684, 1992. In principle, and as noted supra, any type of array, for example, dot blots on a nylon hybridization membrane (see Sambrook et al., Molecular Cloning, A Laboratory Manual, 2nd Ed., Vols. 1-3, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y. (1989)) could be used.
The polynucleotide molecules to be analyzed may be from any clinically relevant source, and are expressed RNA or a nucleic acid derived therefrom (e.g., cDNA or amplified RNA derived from cDNA that incorporates an RNA polymerase promoter), including naturally occurring nucleic acid molecules, as well as synthetic nucleic acid molecules. For example, the test polynucleotide molecules include total cellular RNA, poly(A)+ messenger RNA (mRNA), or fraction thereof, cytoplasmic mRNA, or RNA transcribed from cDNA (i.e., cRNA; see, e.g., Linsley & Schelter, U.S. patent application Ser. No. 09/411,074, filed Oct. 4, 1999, or U.S. Pat. No. 5,545,522, 5,891,636, or 5,716,785). Methods for preparing RNA are known and are described, e.g., in Sambrook et al., Molecular Cloning, A Laboratory Manual (2nd Ed.), Vols. 1-3, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y., 1989. RNA can be fragmented by methods known in the art, e.g., by incubation with ZnCl2, to generate fragments of RNA.
Test polynucleotide molecules that are poorly expressed in particular cells can be enriched using normalization techniques (Bonaldo et al., Genome Res. 6:791-806, 1996).
The test polynucleotides are detectably labeled at one or more nucleotides. Any method known in the art may be used to detectably label the polynucleotides.
Nucleic acid hybridization and wash conditions are chosen so that the test polynucleotide molecules specifically bind or specifically hybridize to the complementary polynucleotide sequences of the array, preferably to a specific array site, wherein its complementary nucleic acid is located. General parameters for specific (i.e., stringent) hybridization conditions for nucleic acids are described in Sambrook et al., supra, and in Ausubel et al., Current Protocols in Molecular Biology, vol. 2, Current Protocols Publishing, New York, 1994. Typically, stringent conditions for short probes (e.g., 10 to 50 nucleotide bases) will be those in which the salt concentration is at least about 0.01 to 1.0 M at pH 7.0 to 8.3 and the temperature is at least about 30° C. Stringent conditions can also be achieved with the addition of destabilizing agents such as formamide. When fluorescently labeled probes are used, the fluorescence emissions at each site of a microarray can be detected by scanning confocal laser microscopy or other methods (see Shalon et al., Genome Research 6:639-645, 1996; Schena et al., Genome Res. 6:639-645, 1996; and Ferguson et al., Nature Biotech. 14:1681-1684, 1996). Signals are recorded and typically analyzed by computer. Methods for evaluating microarray data and classifying samples are described in U.S. Pat. No. 7,171,311.
Serial Analysis of Gene Expression (SAGE)
Gene expression in melanoma samples can also be determined by serial analysis of gene expression (SAGE), which is a method that allows the simultaneous and quantitative analysis of a large number: of gene transcripts, without the need of providing an individual hybridization probe for each transcript (see, e.g. Velculescu et al., Science. 270:484-487, 1995; and Velculescu et al., Cell 88:243-51, 1997). Briefly, a short sequence tag (about 10-14 nucleotides) is generated that contains sufficient information to uniquely identify a transcript, provided that the tag is obtained from a unique position within each transcript. Then, many transcripts are linked together to form long serial molecules, that can be sequenced, revealing the identity of the multiple tags simultaneously. The expression pattern of a population of transcripts can be quantitatively evaluated by determining the abundance of individual tags, and identifying the gene corresponding to each tag.
Protein Detection Methodologies
Immunohistochemical methods are also suitable for detecting the expression of the melanoma signature genes described herein. Antibodies, most preferably monoclonal antibodies, specific for a gene product are used to detect expression. The antibodies can be detected by direct labeling of the antibodies themselves, for example, with radioactive labels, fluorescent labels, hapten labels such as, biotin, or an enzyme such as horse radish peroxidase or alkaline phosphatase. Alternatively, unlabeled primary antibody is used in conjunction with a labeled secondary antibody, comprising antisera, polyclonal antisera or a monoclonal antibody specific for the primary antibody. Immunohistochemistry protocols and kits are well known in the art and are commercially available.
Proteomic methods can allow examination of global changes in protein expression in a sample. Proteomic analysis typically involves separation of individual proteins in a sample by 2-D gel electrophoresis (2-D PAGE), and identification of individual proteins recovered from the gel, e.g. my mass spectrometry or N-terminal sequencing, and analysis of the data using bioinformatics. Proteomics methods can be used alone or in combination with other methods for evaluating gene expression.
In various aspects, the expression of certain genes in a cutaneous sample is detected to provide clinical information (e.g., prognostic information, classification of the tumor from which the sample is derived as a metastatic melanoma or non-metastatic melanoma). Thus, gene expression assays include measures to correct for differences in RNA variability and quality. For example, an assay typically measures and incorporates the expression of certain normalizing genes, such known housekeeping genes, e.g., GAPDH, β-actin, and Cyp1. Alternatively, normalization can be based on the mean or median signal (Ct) of all of the assayed genes or a large subset thereof (global normalization approach). In some embodiments, a normalized test RNA (e.g., from a patient sample) is compared to the amount found in a metastatic melanoma, non-metastatic melanoma, and/or normal skin sample reference set. The level of expression measured in a particular test sample can be determined to fall at some percentile within a range observed in reference sets.
Kits
The technology herein includes kits for evaluating gene expression (e.g., RNA or protein) in melanoma samples. A “kit” refers to a combination of physical elements, e.g., probes, including without limitation specific primers, labeled nucleotic acid probes, antibodies, protein-capture agent(s), reagent(s), instruction sheet(s) and other elements useful to practice the technology described herein. These physical elements can be arranged in any way suitable for carrying out the invention.
A kit for analyzing protein expression can include specific binding agents, such as immunological reagents (e.g., an antibody, e.g., a labeled antibody) for detecting proteins expressed of one or more genes described herein (e.g., one or more genes from Table A, Table B, Table C, or Table D). For example, the kit can include an antibody that detects expression of GJB6, an antibody that detects expression of SPPRR1A, and an antibody that detects expression of SERPINB5, in a tissue section.
Kits for analyzing RNA expression include, for example, a set of oligonucleotide probes for detecting expression of a set of genes described herein (e.g., five or more genes from Table A, Table B, Table C, or Table D). The probes can be provided on a solid support, as in an array (e.g., a microarray), or in separate containers. The kits can include a set of oligonucleotide primers useful for amplifying a set of genes described herein, e.g., to perform PCR analysis. Kits can include further buffers, enzymes, labeling compounds, and the like.
To identify the genes involved in the metastatic process of melanoma, various non-metastatic primary skin cancers were compared to metastatic melanoma utilizing a gene microarray approach followed by functional validation of select genes. Distinct gene expression changes occurring along the spectrum of primary melanoma tumor thickness and metastatic melanoma were discovered.
Tumor samples were obtained from patients with primary cutaneous melanoma (PCM), squamous cell carcinoma (SCC), basal cell carcinoma (BCC) and metastatic melanoma (MM). Gene expression in the samples was examined by microarray analysis as described in the Materials and Methods, below. An initial training set of 23 tumors revealed 2,014 Affymetrix probe sets with a greater than 2-fold difference in the average gene expression level between the metastatic melanoma (MM) and primary cutaneous cancers. This preliminary list, consisting of 1,141 well characterized and 471 poorly characterized human genes, indicates that a substantial difference exists between the metastatic tumors and the non-metastatic tissue types. The expression differences allow for a relatively robust gene classification of tissue samples into groups of metastatic samples and non-metastatic primary tumors. All tumor samples were clustered utilizing the 2,014 probe sets and individually identified as metastatic or non-metastatic based upon the characteristics of tumor samples in the same cluster. The initial set of samples comprised a training set for which 22 of 23 samples were correctly partitioned into the cluster containing primary melanoma or the cluster containing MM samples. A single primary melanoma with a Breslow's tumor thickness of 90 mm was misclassified as a MM sample. Two independent test sets comprised of primary and MM samples were similarly classified, utilizing the 2,014 probe sets and hierarchical clustering. Co-clustering led to the correct identification of 56 of 60 melanoma samples. In general, the misidentified samples were thick primary melanomas classified as MM. Of note, several normal human skin samples were analyzed and found to classify as non-metastatic by their gene expression profiles.
A subset of melanoma samples were examined in order to generate a more comprehensive list of genes that were differentially expressed between MM and PCM using serial analysis of microarrays (SAM). This analysis identified 1,352 probe sets with higher expression in the metastatic samples and 2,991 probe sets with higher expression in non-metastatic samples. This list was further reduced by removing probe sets that did not appear to have an average difference greater than 2-fold between groups. The resultant complete gene list is shown above in Tables A and B, above. This final list consists of 1,667 Affymetrix probe sets that detect 247 poorly defined transcripts, 84 minimally defined genes, and 1007 well characterized human genes. From this list, 316 genes were highly expressed in MM compared to 1022 genes that were more highly expressed in the non-metastatic: cancers and normal skin.
A subset of the full gene list is shown below in Table 1 below. This table illustrates two main trends. There is a shift in the kind of genes expressed, perhaps related to the fundamental characteristics of the cells comprising the tumors. For example, there is higher expression levels in MM for several melanoma-associated tumor antigens. (MAGE, CSAG2), genes implicated in melanoma progression (GDF15, MMP14, SPP-1), cell cycle progression (CDK2, TYMS, BUB1), and the prevention of apoptosis (BIRC5, BCL2A1). These changes may reflect the higher growth capacity of the metastatic tumors. Conversely, among the 997 genes with reduced expression in MM samples, many are implicated in keratinocyte differentiation and epidermal development, such as loricrin (LOR), involucrin (IVL), and keratin-5 (KRT5), suggesting a loss of epidermal characteristics. These expression changes suggest important comparative differences between non-metastatic and metastatic tumors.
Analysis of the functional classes of genes changed using gene ontology revealed that 15 genes associated with keratinocyte differentiation and 32 genes involved in epidermis development were down-regulated in the metastatic samples (
Another observation is that there are a larger number of genes with reduced expression in the metastatic tumors and the degree of decrease is much greater. In other words, the loss of gene expression is greater than the gain of new gene expression. This is consistent with the observation of dedifferentiation which is believed to occur with the development of cancer.
The initial statistical analysis of microarray samples of metastatic melanoma and non-metastatic cutaneous tumors leads to the conclusion that a fundamental difference exists between tumors containing metastatic potential and tumors without demonstrated metastatic potential. In the case of melanoma, it would appear the metastatic potential is associated with a large number of changes in gene expression and fundamental changes in the spectrum of genes expressed. Any measurement of this programmatic shift in gene expression would be useful for the identification of metastatic melanoma cells within a primary melanoma tumor. The data presented in Table 1 addressed the question of whether there were genes differentially expressed (increased or decreased) between primary (BCC/SCC/PCM) and metastatic cancers (metastatic melanoma). The full names of each gene (for named genes), gene symbol, accession number and gene identification for all genes>2-fold up- or down-regulated in metastatic melanoma are provided in Tables A and B above.
The relative gene expression levels of 177 genes across the spectrum of tissue samples examined is shown graphically in
The apparent transition zone of gene expression change could represent a critical time period where many tumorigenic events occur or may simply reflect the outgrowth of an aggressive and/or metastatic cell phenotype. To address this issue, a comparative analysis of gene expression patterns in primary melanomas of different Breslow's thickness was performed. PCM and MM samples were compared to elucidate a possible relationship between relative gene expression patterns associated with PCM of increasing Breslow's thickness and that of MM samples. Table 2 (left columns) reveals the relative change in gene expression for a subset of genes throughout the spectrum of primary melanoma tumors to MM samples.
Several genes, such as the MAGE genes, exhibited a steady and consistent increase in gene expression over the entire range of tumor thicknesses. However, a single major shift in expression was observed for most genes when thinner primary tumors were directly compared to thicker ones. This was most apparent when comparing I.M. thickness to thick PCM, with the majority of genes showing the greatest increase in gene expression. Notable exceptions were genes such as SPP1, HOXA10 and MMP14, for which the greatest differential increase in expression was at the comparative interface between thin and I.M. thickness tumor samples. Other genes, such as MMP19, CTH, PDGFRL, C16 orf34 and GPR19, showed the greatest comparative increase in expression when comparing MIS to thin PCM lesions.
A similar phenomenon was observed for genes with decreased expression in primary tumors relative to more advanced lesions (Table 2, right columns). Here, however, the largest proportion of the gene expression change occurred between thick PCM and MM samples. Very little expression of keratins (6B, 16, 17) and SPRR1 (A, B) was observed in MM compared to all primary melanomas, including thick lesions. Several genes, such as TMPRSS4, STAR, ST7L, HAS3, FGFR3, CASZ1 and HR, were found to have gene expression changes at the very earliest stages of tumor thickening. Together, the gene expression patterns do not shift in a coordinated fashion as would be expected as the result of the outgrowth of a clonal aggressive or metastatic cell type. Rather a series of events may occur as PCM tumors thicken that may influence the expression of different groups of genes ultimately leading to the fully metastatic cell type. This data indicates that some gene expression changes may be indicative of earlier events in the progression to full metastasis. The indicated changes in gene expression may signal that cells in primary melanoma tumors are progressing to a fully metastatic-state or that have already acquired the metastatic state. In either case, the genes are useful markers for identifying aggressive tumors which warrant more aggressive treatment.
All annotated genes listed in Table 2 with a “<2” indicates that any difference between tumors for each comparative analysis was less than 2-fold. Underlined numbers indicate the greatest change in gene expression across varying PCM tumor thickness for each gene.
9.4
129.4
10.7
52.8
9.8
68.3
8.9
27.4
11.1
39.7
3.9
6.1
4.4
24.1
13.9
8.9
17.3
22.6
10.3
10.6
7.5
13.3
4.8
25.9
3.7
27.1
9.4
14.7
13
14.3
4.8
11.9
19.6
28.8
5.7
5.5
3.7
6.2
12.2
13.9
5.7
6.7
10.4
4.1
5.1
9.2
4.4
12.3
15.9
3.6
Gene expression profiles of cultured NHEM were compared to PCM and MM samples (Table 3), acknowledging the inherent limitations associated with the comparisons of cultured cells and freshly procured tumor samples. Large differences in gene expression were observed between NHEM and early, non-metastatic PCM (MIS/thin lesions only) and MM samples. Concordant over-expression of genes were found for both comparisons, in particular for such genes as KRT14, GJA1, S100A7(A9) and EHF. Other genes, like the melanoma associated antigens, MAGE A2 and TRAG and PRAME were also found to be highly over-expressed in NHEM to early primary or MM samples. Similarly, a marked decrease in gene expression was observed for several genes, although of a lesser magnitude than seen for the over-expressed genes. Several unique genes including PAEP, HES6, ESDN, NR4A3, c6orf168 and BCL2A1, were under-expressed in NHEM compared to thin PCM. Other genes were also identified as under-expressed in both groups, such as CITED-1, GDF15, QPRT, OCA2, c-MET and MME.
A perusal of the gene expression differences between PCM and MM samples identifies numerous putative oncogenes and tumor suppressor genes (TSG). Table 4 lists several oncogenes and TSG previously implicated in tumor types. The gene with the largest increase in expression (13.2 fold) was SPP-1 or osteopontin. Although not previously identified as an oncogene, osteopontin expression has been shown to correlate with melanoma invasion and tumor progression (Zhou et al., J. Invest. Dermatol. 124:1044-1052, 2005). The lineage-specific oncogene, MITF, previously shown to act as a master regulator of melanocyte development and a critical survival oncogene amplified in melanoma showed a 3.7 fold increase (Garraway et al., Nature. 436:33-35, 2005; Levy et al., Trends Mol. Med. 12:406-14, 2006; McGill et al., J. Biol. Chem. 281:10365-10373, 2006). Of the other genes, GDF15, c-Met and the HOX loci have been shown to act as possible oncogenes in breast cancer, squamous cell lung cancer, prostate and pancreatic cancer. Several of the putative melanoma TSGs have also been previously shown to contribute to the development and progression of cancer in other tumor histologies.
The shifts in gene expression occur at different stages of the thickening process for each of the oncogenes and TSGs listed in Table 4. Some of the genes show a progressive and steady increase or decrease in gene expression as tumors of greater thickness are compared. But for others, such as the oncogenes SPP-1 and GDF15, and the TSGs PITX-1 and CST6, the major shifts in gene expression appear to occur at distinct but different times during the thickening of the primary melanoma tumors. This observation strongly suggests that these changes may occur spontaneously but eventually accumulate to contribute to the final metastatic phenotype.
Table 4 contains a partial list of identified tumor oncogenes and tumor suppressor genes (TSG's) in PCM and MM samples. The fold increase represents the greatest fold change noted throughout all comparisons of each PCM tumor thickness to MM. The activating/suppressive mechanism and affected tumor type are also identified.
To further validate the expression of putative TSG and oncogenes in our melanoma panel, real-time quantitative polymerase chain reaction (RT-qPCR) assays were performed on 20 previously arrayed samples, comprised of 7 PCM and 13 mM samples. The results are depicted in
To independently verify and validate the gene expression changes at the protein level, protein expression of several suspected oncogenes and TSG was examined using Western Blot analysis. Osteopontin (SPP-1) protein expression was examined, both from melanoma cell lysates and conditioned cell free media derived from 2 primary and 6 MM daughter cell lines (
Analysis of suspected TSG in 3 primary and 3 metastatic melanoma cell lines revealed a very low level of protein expression of DSC3 in 6/6 cell lines (
The molecular analyses described herein clearly identifies distinct molecular profiles associated with MM which are different from PCM, SCC, and BCC as well as normal melanocytes and skin. Using the refined gene list (Tables A and B), the metastatic character of tumors (SCC/BCC/PCM/MM) can be classified correctly greater than 90% of the time. One major difficulty is classification of thick primary melanoma tumors, where occasionally, these tumors appeared to have the gene expression signature of MM. It was discovered that these tumors represent primary tumors which have already acquired the metastatic gene expression pattern. Detection methods that examine expression of the gene combinations described herein allow identification of such tumors and inform subsequent clinical decisions.
In conclusion, a clear pattern of gene expression change was observed in the non-metastatic and metastatic samples examined. There is a clear point of transition in gene expression when comparing I.M. to: thick PCM, revealing specific groupings of genes involved in this process. Several of these genes and combinations thereof have never before revealed as functional or relevant in melanoma. The specific genes involved in this dynamic and fluid change in gene expression provides the basis for the determination of whether a thin, I.M., or thick PCM has the genetic capability to metastasize and facilitate the development of an appropriate treatment strategy.
Tumor specimens. Tumor samples were surgically procured from patients with primary cutaneous melanoma (PCM), squamous cell carcinoma (SCC), basal cell carcinoma (BCC) and metastatic melanoma (MM) over a 3 year period. All samples were obtained under an Investigational Review Board (IRB) approved tissue procurement protocol (MCC#13448, IRB#101751; PSM#990914-JM, 020318-JM). Upon surgical removal of the primary melanoma, a single surgical oncologist (A.I.R.) utilized a scalpel to macrodissect and procure a portion of the remaining primary tumor, with a similar technique utilized for grossly involved lymph nodes where the melanoma had completely replaced the lymph node. Samples were taken from non-necrotic areas of the tumor. The same process was performed for all distant metastases, with care taken to avoid surrounding tissues and stroma.
All samples were cryopreserved in liquid nitrogen and stored within the Tissue Procurement Laboratory of the Moffitt Cancer Center, securely de-identified through a centralized database. Forty MM samples were analyzed, composed of 22 bulky, macroscopic (replaced) lymph node metastases, 16 subcutaneous and 2 solid organ metastases (adrenal and brain). These MM samples were compared with 42 primary cutaneous cancers (16 PCM, 11 SCC, 15 BCC). PCM consisted of 2 melanoma in situ (MIS), 2 thin melanomas (<1 mm), 3 intermediate-thickness melanomas (1-4 mm), and 9 thick melanomas (>4 mm). Additionally, 4 samples of normal human skin and 1 sample of cultured NHEM were included. All MM samples were procured from patients that had failed multiple previous therapies, ranging from single agent Interferon, single or multi-agent chemotherapy, immunotherapy or other experimental treatment options. All primary cutaneous cancers were procured from previously untreated patients.
RNA isolation, purification and hybridization. A portion of each cryopreserved tissue sample was dissolved in TRIzol® (Invitrogen, Carlsbad, Calif.), purified according to manufacturer's recommendations, and further purified on RNeasy columns (Qiagen Inc., Valencia, Calif.). RNA integrity was verified by both gel electrophoresis and the Agilent 2100 Bioanalyzer (Agilent Technologies, Palo Alto, Calif.). A total of 5 μg of RNA was processed using established Affymetrix protocols for the generation of biotin-labeled cRNA and the hybridization, staining, and scanning of arrays as outlined in the Affymetrix technical manuals (Van Gelder et al., Proc. Natl. Acad. Sci. U.S.A. 87:1663-1667, 1990; Warrington et al., Physiol. Genomics. 2:143-147, 2000). The processed RNA was hybridized to Human Genome U133 Plus 2.0 arrays from Affymetrix, Inc. (Santa Clara, Calif.), and scanned on an Affymetrix GeneChip® scanner 3000 at 2.5 μm resolution. A more complete description of this process is available in Dobbin et al., Clin. Can. Res. 11:565-572, 2005. The tissue samples were processed in three independent groups.
Cell lines and tissue culture. Freshly excised melanoma samples were placed into culture media (RPMI 1640+5% FCS) and tissue procurement and expansion of daughter cell lines was established utilizing previously published techniques (Riker et al., Can Detect and Prev, 23(5):387-96, 1999; Riker A I. The isolation and culture of melanoma cell lines. In: Langdon S, editor. Cancer cell culture: Methods and protocols. Totowa: Humana Press; pp. 93-100, 2004). All cell lines were split and passaged<10 times and, characterized by flow cytometry and/or cytospin preparation for cellular confirmation of melanoma cell purity (data not shown). The cell lines, TC077 and TC80a were derived from primary melanoma samples with TC80b derived from a metastatic lymph node (from the same patient). The cell lines, TC12A and TC12F, were derived from 2 different subcutaneous melanoma nodules from: the same patient. There were 3 cell lines examined from metastatic samples, TC66C, TC72 and TC89. The NHEM were cultured according to the manufacturer directions. (Cambrex BioScience, Walkersville, Md.).
Semi and real-time quantitative RT-PCR. First-strand cDNA synthesis was performed using Superscript III RT (Invitrogen). Subsequently, the cDNA was used in semi-quantitative PCR. Each sample was normalized with β-Actin as an internal control, comparing each sample with AlphaEase®FC image analysis software (Alpha Innotech, San Leandro, Calif.), followed by densitometric analysis of the integrated values for each sample. The expression levels of putative oncogenes and tumor suppressor genes were analyzed by real-time quantitative RT-PCR (qPCR) using Assays-on-Demand. Gene Expression Assays (Applied Biosystems, Foster City, Calif.): SPP1 (osteoponin, assay ID Hs00167093_m1), GDF15 (growth differentiation factor 15, assay ID Hs00171132_ml), PITX1 paired-like homeodomain transcription factor1, assay ID Hs00267528 ml), DSC3 (desmocollin 3, assay ID Hs00170032_m1), CST6 (cystatin E/M, assay ID Hs00154599), POU2F3 (POU domain, class 2, transcription factor 3, assay ID Hs00205009) and GAPDH (assay IDHs99999905_m1) as the internal standard. Utilizing normal skin as the calibrator, the relative quantitation values of a target template for each sample were expressed as 2−ΔΔCt. Briefly, qPCR analysis was performed utilizing 40 ng of total cDNA in a 25 μl reaction volume (Applied Biosystems). QPCR was performed utilizing established techniques, with all samples performed in triplicate and run on an ABI/PRISM 7500 Sequence Detector System (Applied Biosystems).
Gene microarray analysis and bioinformatics. Affymetrix MAS 5.0 analysis software was used to generate signal values for all probe sets based upon a mean intensity of 500, subsequently exported and iteratively normalized as a whole group to create the final normalization based upon the most stable gene expression measurements across all samples (Li et al., Proc. Natl. Acad. Sci. U.S.A. 98:31-36, 2001). This process was performed for the initial group of tumor samples to generate the list of normalization probesets that were subsequently used to scale all samples processed for this study to an average intensity of 4000 for the normalization probesets. Following scaling, the calculated signal values were then used to calculate the average expression level for each gene in each tissue type using an initial group of 23 tumor samples. The average expression values derived from this initial set were directly compared to identify genes expressed at high levels in one tumor type but not in the other samples using a t-test and visual inspection to find highly differential expression patterns. Genes highly expressed in metastatic melanomas but not primary melanomas, basal cell carcinomas, or squamous cell carcinomas, were sought. Several genes were initially selected that exhibited the idealized gene expression profiles. Additional candidate genes were then identified by using Pearson's correlation between the idealized gene expression patterns and all other probe sets on the arrays. Positively correlated (r>0.7) and negatively correlated (r<0.7) genes were identified and trimmed to include only those with a 2-fold or greater difference in the average gene expression level between metastatic samples and non-metastatic tumors. This initial gene expression survey identified 2014 Affymetrix probe sets from the U133 Plus 2.0 arrays that showed differential expression between metastatic tumor samples and non-metastatic tumor samples.
The 2014 probe sets identified as correlating with the metastatic phenotype were used to cluster the samples. Following normalization, as described above, the signal values were log 2 transformed. Each probe set was then mean centered across all samples and the resulting values were input into Eisen's cluster. Hierarchical clustering was performed using absolute correlation and a complete linkage. Clustering was performed with various subgroups of the data or with all samples together and resulted in similar sample groupings. Individual samples were classified based on the class of the other samples in the closest cluster. The complete microarray data is available from the Gene Expression Omnibus (world wide web address: ncbi.nlm.nih.gov/geo/) under Accession number GSE7553.
Serial Analysis of Microarrays (SAM) was performed in order to identify a more extensive list of differentially expressed genes expressed between PCM and MM. Two comparisons were made to generate a comprehensive and yet confident list of genes that are differentially expressed between metastatic melanoma and non-metastatic melanomas. In the first comparison, the metastatic melanoma samples were opposed by all the non-metastatic samples including basal and squamous cell carcinoma and normal skin. The false discovery rate threshold used to limit the gene list was 0% for this comparison. Because of the number of samples, this provides good statistical confidence but does not focus on the differences between primary melanoma and metastatic melanoma. A second comparison was performed utilizing 6 thin primary melanoma samples in opposition to 6 randomly selected metastatic melanomas. The only non-random aspects of this sample selection were to avoid selecting samples in which the classifier disagreed with the pathologist's diagnosis and to avoid utilizing more than one sample from the same individual. For this comparison the median false discovery rate threshold was set at 5%. This latter analysis is the preferred grouping of samples, but because of the small sample size it is also more likely to generate false discoveries due to noise and outlier samples. Therefore the more confident gene list generated by combining the two analyses. The intersection of the two approaches yielded 1,352 probe sets with higher expression in the metastatic samples and 2,991 probe sets with higher expression in non-metastatic samples. This list was further reduced by removing probe sets that did not appear to have a difference greater than 2-fold on average between the two groups.
Following all microarray analyses, the identified probe sets were annotated based on the sequence of the probes used on the arrays (Harbig et al., Nucleic Acids Res. 33:e31, 2005).
Western blot analysis. Whole cell extracts from PCM and MM cell lines were prepared by directly lysing cells in SDS sample buffer. Expression of SPP-1 protein was assessed in cell lysate and serum-free conditioned medium. Briefly, 4×106 cells were plated in 5% FBS containing medium; 24 hours later, the growth medium was replaced with serum-free medium. The conditioned media and cell lysates were harvested 24 hours later and resolved using a 12.5% SDS-PAGE. Proteins were transferred to a PVDF membrane and probed with the anti-human SPP-1 mouse monoclonal antibody (Sigma, St. Louis, Mo.) (1:1000) followed by a secondary antibody conjugated to horseradish peroxidase (Amersham Biosciences, Piscataway, N.J.) and detected using chemiluminescence (Santacruz Biotechnology, Santa Cruz, Calif.). The osteopontin band (SPP-1) was visualized at ˜55-65 kDa. Daughter melanoma cell lines derived from the freshly procured melanoma samples (with the exception of A375) were lysed by M-PER™ Mammalian Protein Extraction Reagent (Pierce, Rockford, Ill.) and processed according to manufacturer instructions. A total of 15 μg of protein from each experimental condition were electrophoresed on 10% SDS-PAGE and transferred to nitrocellulose membranes (Bio-Rad, Hercules, Calif.). Immunostaining was performed with the following primary antibodies: DSC3 (Santa Cruz) 1:200; CLCA2 (Novus Biologicals, Littleton, Colo.) 1:500; PDGFRL (Novus Biologicals) 1:500; α-tubulin (Cell signaling, Danvers, Mass.), 1:1000. Immunocomplexes were visualized using an enhanced chemiluminescence (ECL) Western Blotting Substrate (Pierce). The intensity of the bands were scanned with a Fujifilm intelligent dark box II and analyzed with Fujifilm Las-1000 Lite V1.3 software.
A number of embodiments of the technology have been described. Nevertheless, it will be understood that various modifications may be made without departing from the spirit and scope of the technology. Accordingly, other embodiments are within the scope of the following claims.
This application claims the benefit of U.S. Provisional Application No. 60/824,849, filed Sep. 7, 2006, which is incorporated herein by reference in its entirety.
The methods and compositions described herein were made with government support awarded by the ARMY Medical Research and Material Command (MRMC) under Grant No. DAMD17-02-2-0051. The government has certain rights in the invention.
Number | Name | Date | Kind |
---|---|---|---|
7056674 | Baker et al. | Jun 2006 | B2 |
7081340 | Baker et al. | Jul 2006 | B2 |
7171311 | Dai et al. | Jan 2007 | B2 |
7247426 | Yakhini et al. | Jul 2007 | B2 |
20060183141 | Change et al. | Aug 2006 | A1 |
20070154889 | Wang et al. | Jul 2007 | A1 |
Number | Date | Country | |
---|---|---|---|
20080113360 A1 | May 2008 | US |
Number | Date | Country | |
---|---|---|---|
60824849 | Sep 2006 | US |