MOLECULAR METHOD FOR DIAGNOSIS OF COLON CANCER

Information

  • Patent Application
  • 20110165582
  • Publication Number
    20110165582
  • Date Filed
    November 22, 2010
    14 years ago
  • Date Published
    July 07, 2011
    14 years ago
Abstract
Methods for diagnosing or detecting cancerous colon tissue. A panel of 17 specific marker genes are provided. The overexpression of some of these marker genes compared to their expression in normal human colon tissue and the underexpression of the rest of these marker genes are indicative of cancerous colon tissue. By using these 17 marker genes as a diagnostic tool, smaller tissue samples, such as those obtained by core needle biopsies, from patient stool samples, or from blood samples can be used.
Description
TECHNICAL FIELD

The present invention relates to diagnosis methods and, more particularly, to diagnosis methods for detecting colon cancer.


BACKGROUND OF THE INVENTION

With 19,200 new cases in Canada in 2004, colon cancer is one of the three most prevalent cancers in Canada for both men and women (Canadian Cancer Statistics, 2004). Invasive biopsy procedures require long hospitalizations and may have numerous possible side effects. Other alternative diagnostic procedures, such as digital rectal examination, fecal occult blood procedure, double-contrast barium enema, flexible sigmoidoscopy, and total colonoscopy are mostly invasive. The fecal occult blood test, while non-invasive, requires confirmation by way of additional invasive procedures. Unfortunately, such invasive procedures can possibly lead to side effects and/or long hospitalizations.


There is therefore a need for a non-invasive and accurate testing procedure for detecting colon cancer in humans. Ideally, such a test should be able to detect cancerous colon cells even from small sample sizes.


There is therefore a need for a more accurate diagnostic method that does not require an invasive biopsy to detect or diagnose colon cancer. Ideally, such a method should be usable even with very small sample sizes and may be combined with other, pathologist-based diagnosis methods.


SUMMARY OF INVENTION

The present invention provides methods for diagnosing or detecting cancerous colon tissue in humans. Colon tissue samples are acquired from patients and are tested for the expression of specific marker genes. A panel of 17 specific marker human genes are provided. The overexpression of some of these marker genes compared to their expression in normal colon tissue and the underexpression of the rest of these marker genes compared to normal colon tissue are indicative of cancerous colon tissue. By using these 17 marker genes as a diagnostic tool, small tissue samples, such as those obtained by core needle biopsies and from stool samples can be used.


In a first aspect, the present invention provides a method for diagnosing whether a human patient has colon cancer, the method comprising:


a) obtaining subject colon cells from said human patient


b) assaying the level of the RNAs encoded by SEQ ID NOs. 1-17 in said subject colon cells obtained in step a)


c) diagnosing said human patient with colon cancer when the RNAs encoded by SEQ ID NOs. 1-8 in said subject colon cells are overexpressed in comparison to the level of RNAs encoded by SEQ ID Nos. 1-8 in non-cancerous human colon cells and when the level of the RNAs encoded by SEQ ID Nos. 9-17 in said subject colon cells are underexpressed in comparison to the level of the RNAs encoded by SEQ ID Nos. 9-17 in non-cancerous human colon cells.


In a second aspect, the present invention provides a method for determining if human colon cells are cancerous, the method comprising:


a) assaying the level of the RNAs encoded by SEQ ID NOs. 1-17 in said human colon cells


b) determining that said human colon cells are cancerous when the RNAs encoded by SEQ ID NOs. 1-8 in said human colon cells are overexpressed in comparison to the level of RNAs encoded by SEQ ID Nos. 1-8 in non-cancerous human colon cells and when the level of the RNAs encoded by SEQ ID Nos. 9-17 in said human colon cells are underexpressed in comparison to the level of the RNAs encoded by SEQ ID Nos. 9-17 in non-cancerous human colon cells.





BRIEF DESCRIPTION OF THE DRAWINGS

A better understanding of the invention will be obtained by considering the detailed description below, with reference to the following drawings in which:



FIG. 1 is a table listing the 17 genes which is the subject of the present invention;



FIGS. 2-17 illustrate box plots of the expression of the above-noted genes in both cancerous and non-cancerous tissue; and



FIG. 18 is a table which, taken in conjunction with a table in the description, denotes which sample sets were used in which experiments for the box plotted results in FIGS. 2-17.





DETAILED DESCRIPTION OF THE INVENTION

The present invention relates to the use of a panel of 17 specific human marker genes to diagnose or detect cancerous colon tissue. The panel of 17 marker genes is listed in Table 1 below. Experiments have shown that this panel of human marker genes give high accuracy in colon cancer diagnosis due to the expression levels of the marker genes in cancer tissue relative to their expression levels in normal tissue in humans.


The panel of 17 marker genes is given in Table 1. The marker genes were determined from two different microarray data sets. A portion of the genes were found to give correct classification for the data set described by Notterman D A, et al. ((2001) Transcriptional Gene Expression Profiles of Colorectal Adenoma, Adenocarcinoma and Normal Tissue Examined by Oligonucleotide Arrays. Cancer Res. 61:3124-3130). The rest of the genes in the panel were selected from the data set published by Alon, U. et al. ((1999) Broad Patterns of Gene Expression Revealed by Clustering Analysis of Tumour and Normal Colon Tissue Probed by Oligonucleotide Arrays. Proc. Natl. Acad. Sci. 96:6745-6750).


The data set from Alon, et al. consisted of 40 tumour and 22 normal samples for a total of 66 samples. Samples were obtained from colon adenocarcinoma specimens snap-frozen in liquid nitrogen within 20 min of removal/collection from patients. From some of these patients paired normal colon tissue also was obtained. The microarrays were hybridized using Affymetrix Hum600 array using standard protocol. The 2,000 highest intensity genes were selected and published on the web at http://microarray.princeton.edu/oncology/. From this subset were selected seven diagnostic genes that give 100% of correct classification (the last 6 genes in Table 1). The dataset from Alon et al. is limited in size and therefore biomarker selection was performed on another data set also found in the Notterman et al. paper. In this data set, samples of colon adenocarcinoma and paired normal tissue from the same patient were obtained from the Cooperative Human Tissue Network. The tissue was snap-frozen in liquid nitrogen within 20-30 min of harvesting and stored thereafter at −80<0>C. mRNA was extracted from the bulk tissue samples and hybridized to the array using standard procedure (see Notterman et al., 2001). This data set was also cited by Rhodes et al. in 2004 (see Rhodes, D. R. et al. (2004) Large-scale Meta-Analysis of Cancer Microarray Data Identifies Common Transcriptional Profiles of Neoplastic Transformation and Progression. Proc. Natl. Acad. Sci. 101:9309). The adenocarcinoma samples were specifically re-reviewed by a pathologist at the institution where the samples were obtained using paraffin-embedded tissue that was adjacent or in close proximity to the frozen sample from which the RNA was extracted. The publicly available data set consists of 18 adenocarcinoma and 18 normal samples. The set consists of ˜6600 genes.









TABLE 1







Panel of 17 genes found to give high accuracy in colon cancer diagnosis


and their expression level in cancer relative to normal tissue.













Over or Under-





expressed in cancer





tissue relative to


SEQ ID NO.
Gene Name
Symbol
normal tissue













1
Pyrroline-5-
PYCR1
Overexpressed



carboxylate



reductase 1


2
General
GTF2E1
Overexpressed



transcription



factor IIE,



polypeptide 1,



alpha 56 kDa


3
Transcribed
NME1
Overexpressed



locus,



strongly



similar to NP



937818.1



nucleoside-



diphosphate



kinase 1



isoform a [Homo




sapiens]



4
Eukaryotic
EIF1AX
Overexpressed



translation



initiation



factor 1A, X-



linked


5
Centomere
CENPF
Overexpressed



protein F,



350/400ka



(mitosin)


6
RAN binding
RANBP1
Overexpressed



protein 1


7
KIAA0020
KIAA0020
Overexpressed


8
Membrane
MCP
Overexpressed



cofactor



protein (CD46,



trophoblast-



lymphocyte



cross-reactive



antigen)


9
Solute carrier
SLC20A2
Underexpressed



family



20 (phosphate



transporter),



member 2


10
TU3A protein
TU3A
Underexpressed


11
Adenylate
AK1
Underexpressed



kinase 1


12
Zinc finger
ZNF297
Underexpressed



protein 297


13
ER Lumen
KDELR1
Underexpressed



Protein



Retaining



Receptor 1


14
Human mRNA for
COL4A2
Underexpressed



type IV



collagen alpha



(2) chain


15
Src homology 2
SHC
Underexpressed



domain



containing



transforming



protein 1


16
Peripheral
PMP22
Underexpressed



myelin protein



22


17
Collagen type
COL13A1
Underexpressed



XIII, alpha1









The genes listed above and identified by their SEQ ID referencing the attached sequence listings were derived using a microarray gene expression experiment.


By following the procedure noted above, the expression of the above genes can be determined from sample tissue obtained from a patient. By determining the expression of the above noted genes in the sample tissue, the presence or absence of cancerous colon tissue may be determined.


It should be noted that the procedure for determining the expression of genes in tissue is well-known in the art. Furthermore, procedures for the extraction and collection of tissue, in this case colon tissue, are also well-known. As noted above, colon tissue samples may be obtained from patient stool samples or core needle biopsies or, alternatively, from blood samples. These tissue samples may then be tested for the expression of the above genes and then compared to the expression of the above genes in tissue samples known to be non-cancerous. If the first 8 genes listed above are overexpressed in the patient sample tissue relative to their expression levels in normal tissue, and if the next 9 genes listed above are underexpressed in the patient sample tissue relative to their expression levels in normal tissue, then this would indicate the presence of cancerous colon tissue in the patient sample tissue.


It should be noted that expression analysis can be carried out using any method for measuring gene expression. Such methods as microarrays, diagnostic panel mini-chip, PCR, real-time PCR, and other similar methods may be used. Similarly, methods for measuring protein expression (protein seen as products of translation of the said genes) may also be used.


As noted above, the cancerous colon cells can be obtained from a patient using minimally invasive core needle biopsy or from techniques such as from a patient's stool samples. Normal or non-cancerous colon cells against which the cancerous cells can be compared can also be obtained from the patient or from other patients. Experiments have shown that the diagnosis can be possible from just a small number of cancer cells.


Referring to FIGS. 2-17, boxplots of test results for the above noted genes are illustrated. The boxplots illustrate that, for each particular gene, that gene is either underexpressed or overexpressed in cancerous tissue relative to normal tissue. The tissue samples which were used for the experiments were those used and referred to in the following publications as set out in the table below:















Sam-





ple

Sample



set
Publication
subset
Sample type







A
Notterman DA, Alon U, Sierk AJ,
1
Normal tissue



Levine AJ. Transcriptional gene
2
Adenocar-



expression profiles of colorectal

cinoma tissue



adenoma, adenocarcinoma, and normal



tissue examined by oligonucleotide



arrays. Cancer Res. 2001 Apr 1;



61(7): 3124-30


B
Zou TT, Selaru FM, Xu Y, Shusstova
1
normal colonic



V, Yin J, Mori Y, Shibata D, Sato F,

epithelium



Wang S, Olaru A, Deacu E, Liu T,
2
colorectal



Abraham JM, Meltzer SJ. Application

adenocar-



of cDNA mircoarrays to generate a

cinoma



molecular taxonomy capable of



distinguishing between colon cancer



and normal colon. Oncogene. 2002



Jul 18; 21(31): 4855-62.


C
Notterman DA, Alon U, Sierk AJ,
1
Duke Stage A



Levine AJ. Transcriptional gene
2
Duke Stage B



expression profiles of colorectal
3
Duke Stage C



adenoma, adenocarcinoma, and normal
4
Duke Stage D



tissue examined by oligonucleotide



arrays. Cancer Res. 2001 April 1;



61(7): 3124-30


D
Notterman DA, Alon U, Sierk AJ,
1
Stage A(1)



Levine AJ. Transcriptional gene
2
Stage B(7)



expression profiles of colorectal
3
Stage C(5)



adenoma, adenocarcinoma, and normal
4
Stage D(5)



tissue examined by oligonucleotide



arrays. Cancer Res. 2001 April 1;



61(7): 3124-30


E
Notterman DA, Alon U, Sierk AJ,
1
p53 mutation



Levine AJ. Transcriptional gene

negative



expression profiles of colorectal
2
p53 mutation



adenoma, adenocarcinoma, and normal

positive



tissue examined by oligonucleotide



arrays. Cancer Res. 2001 April 1;



61(7): 3124-30


F
Shyamsundar R, Kim YH, Higgins JP,
1
Multitissue



Montgomery K, Jorden M, Sethuraman
2
Colon Normal



A, van de Rijn M, Botstein D, Brown



PO, Pollack JR. A DNS microarray



survey of gene expression in normal



human tissues. Genome Biol. 2005;



6(3): R22, Epub 2005 Feb 14


G
Notterman DA, Alon U, Sierk AJ,
1
Female



Levine AJ. Transcriptional gene
2
Male



expression profiles of colorectal



adenoma, adenocarcinoma, and normal



tissue examined by oligonucleotide



arrays. Cancer Res. 2001 April 1;



61(7): 3124-30


H
Ramaswamy S, Tamayo P, Rifkin R,
1
Cancer



Mukherjee S, Yeang CH, Angelo M,

progression



Ladd C, Reich M, Latulippe E,

normal



Mesirov JP, Poggio T, Gerald W,
2
Cancer



Loda M, Lander ES, Golub TR.

progression



Multiclass cancer diagnosis using

primary



tumor gene expression signatures.




Proc Natl Acad Sci USA. 2001 Dec




18; 98


I
Su AI, Welsh JB, Sapinoso LM, Kern
1
Multitissue



SG, Dimitrov P, Lapp H, Schultz PG,

cancer



Powell SM, Moskaluk CA, Frierson HF
2
Colorectal



Jr, Hampton GM. Molecular

adenocar-



classification of human carcinomas

cinoma



by use of gene expression



signatures. Cancer Res. 2001 Oct



15; 61(20): 7388-93.


J
Ramaswamy S, Tamayo P, Rifkin R,
1
Multitissue



Mukherjee S, Yeang CH, Angelo M,

cancer



Ladd C, Reich M, Latulippe E,
2
Colorectal



Mesirov JP, Poggio T, Gerald W,

adenocar-



Loda M, Lander ES, Golub TR.

cinoma



Multiclass cancer diagnosis using



tumor gene expression signatures.




Proc Natl Acad Sci USA. 2001 Dec




18; 98


K
Ramaswamy S, Tamayo P, Rifkin R,
1
primary



Mukherjee S, Yeang CH, Angelo M,
2
metastatic



Ladd C, Reich M, Latulippe E,



Mesirov JP, Poggio T, Gerald W,



Loda M, Lander ES, Golub TR.



Multiclass cancer diagnosis using



tumor gene expression signatures.




Proc Natl Acad Sci USA. 2001 Dec




18; 98


L
Ramaswamy S, Tamayo P, Rifkin R,
1
Primary



Mukherjee S, Yeang CH, Angelo M,
2
Metastatic



Ladd C, Reich M, Latulippe E,



Mesirov JP, Poggio T, Gerald W,



Loda M, Lander ES, Golub TR.



Multiclass cancer diagnosis using



tumor gene expression signatures.




Proc Natl Acad Sci USA. 2001 Dec




18; 98


M
Alon U, Barkai N, Notterman DA,
1
normal colon



Gish K, Ybarra S, Mack D, Levine AJ.
2
colon



Broad patterns of gene

adenocar-



expression revealed by clustering

cinoma



analysis of tumor and normal colon



tissues probed by oligonucleotide



arrays. Proc Natl Acad Sci USA.



1999 Jun 8; 96


N
Ramaswamy S, Tamayo P, Rifkin R,
1
Multitissue



Mukherjee S, Yeang CH, Angelo M,

normal



Ladd C, Reich M, Latulippe E,
2
Colon normal



Mesirov JP, Poggio T, Gerald W,



Loda M, Lander ES, Golub TR.



Multiclass cancer diagnosis using



tumor gene expression signatures.




Proc Natl Acad Sci USA. 2001 Dec




18; 98









For the experiments for which the results are in the boxplots of FIGS. 2-17, the genes tested and the sample sets used are as noted in FIG. 18. The second row in the table of FIG. 18 notes the symbol of the gene being tested while the first column denotes the experiment number. The intersection between the gene symbol and the experiment number shows the sample set used for that experiment. The experiment number corresponds to the bottom row of the box plot for that gene. As an example, for the gene denoted by symbol AK1, the boxplot of which is in FIG. 12, experiment 1 used sample set A noted above. Since sample set A has two sample subsets, then there are two sub-columns for the first column in the box plot of FIG. 12. The first sub-column shows the expression level for the gene AK1 in normal tissue (as noted in the table above) while the second sub-column for this experiment is the expression level for the gene AK1 in adenocarcionoma tissue (again as noted above for sample set A).


As another example, experiment 7 for the gene PYCR1 used the sample set C with four subsample sets (see FIG. 2) which tested the expression level of PYCR1 in tissues at various Duke stages.


The correspondence between the test results in the figures and the genes being tested are as follows:

















FIGURE containing



Gene Symbol
box plot results









PYCR1
FIG. 2



GTF2E1
FIG. 3



NME1
FIG. 4



EIF1AX
FIG. 5



CENPF
FIG. 6



RANBP1
FIG. 7



KIAA0020
FIG. 8



MCP
FIG. 9



SLC20A2
FIG. 10



TU3A
FIG. 11



AK1
FIG. 12



ZNF297
FIG. 13



COL4A2
FIG. 14



SHC1
FIG. 15



PMP22
FIG. 16



COL13A1
FIG. 17










It should be noted that the underexpression or the overexpression of the above noted genes in cancerous tissue relative to their expression in normal tissue is readily evident in the box plots. Specifically, the experiments which used the samples sets A, B, M, and N compare the expression levels of specific genes in both cancerous and non-cancerous tissue in a side-by-side manner. For the genes which were not tested for sample sets A, B, M, and N, their expression levels for sample set F (normal tissue) may be compared with their expression levels for sample sets H and I (cancerous tissue). For the genes for which sample set E was used, the presence of p53 mutation indicates cancerous tissue, sample subset 2 for this sample set being cancerous tissue.


While it is preferable that the complete panel of 17 marker genes be used in the diagnosis of possible colon cancer, using a subset of the 17 marker genes will also yield useful results. Using a panel of anywhere from 1 to 17 marker genes out of the 17 marker genes on suspect colon tissue will still provide a useful indication as to whether cancerous colon tissue may be present or whether further and more involved tests are required.


The diagnostic panel of 17 genes listed above was validated using human tissue samples. Total RNA was obtained from 17 sets of donor-matched colon adenocarcinoma and normal adjacent-to-tumor (NAT) tissue samples. Fourteen of these sample sets were obtained from colorectal cancer (CRC) patients in early stages of disease (Stage I or II). The RNA was extracted from snap-frozen tissue samples excised during surgical resections. Additionally, 12 RNA samples were obtained from persons with no history of colon cancer (normal) with ages and genders of the donors being comparable to that of the tumor group. Real-time quantitative PCR was used to measure the expression of each of the genes for each patient sample.


Using a panel approach, with the rationale that applying a number of markers as a panel can provide more information and/or more accuracy than any single marker as a diagnostic, prognostic or therapeutic aid, the gene expressions were tested as noted above. Analyses of the gene expression data as a panel led to the derivation of a ratio approach for sample classification. The ratio was obtained by dividing the geometric mean of the normalized expression data for each of the eight genes predicted to be over-expressed by the geometric mean of the normalized expression data for each of the nine genes predicted to be under-expressed. The ability of the ratio to distinguish tumor (N=17) from NAT (N=17) samples was assessed by Receiver Operator Characteristic (ROC) curve analyses. With an optimal cut-off ratio value of 1.54, the test was found to have 88.2% sensitivity and a specificity of 100%. The corresponding area under the curve (AUC) for this analysis was 0.912. As is known, the sensitivity of a test is a measure of the probability that a test will produce a true positive result. The specificity of a test is the probability that the test will produce a true negative result. The present invention therefore has an 88.2% chance that the test will produce a true positive result and it has a 100% probability that it will produce a true negative result.


A person understanding this invention may now conceive of alternative structures and embodiments or variations of the above all of which are intended to fall within the scope of the invention as defined in the claims that follow.

Claims
  • 1. A method for diagnosing whether a human patient has colon cancer, the method comprising: a) obtaining subject colon cells from said human patientb) assaying the level of the RNAs encoded by SEQ ID NOs. 1-17 in said subject colon cells obtained in step a)c) diagnosing said human patient with colon cancer when theRNAs encoded by SEQ ID NOs. 1-8 in said subject colon cells are overexpressed in comparison to the level of RNAs encoded by SEQ ID Nos. 1-8 in non-cancerous human colon cells and when the level of the RNAs encoded by SEQ ID Nos. 9-17 in said subject colon cells are underexpressed in comparison to the level of the RNAs encoded by SEQ ID Nos. 9-17 in non-cancerous human colon cells.
  • 2. A method according to claim 1 wherein said colon cells are obtained by a core needle biopsy.
  • 3. A method according to claim 1 wherein said colon cells are obtained from stool samples.
  • 4. A method according to claim 1 wherein said colon cells are obtained from blood samples.
  • 5. A method for determining if human colon cells are cancerous, the method comprising: a) assaying the level of the proteins obtained from RNAs encoded by SEQ ID NOs. 1-17 in said human colon cellsb) determining that said human colon cells are cancerous when the proteins obtained from RNAs encoded by SEQ ID NOs. 1-8 in said human colon cells are overexpressed in comparison to the level of proteins obtained from RNAs encoded by SEQ ID Nos. 1-8 in non-cancerous human colon cells and when the level of the proteins obtained from RNAs encoded by SEQ ID Nos. 9-17 in said human colon cells are underexpressed in comparison to the level of proteins obtained from RNAs encoded by SEQ ID Nos. 9-17 in non-cancerous human colon cells.
  • 6. A method according to claim 5 wherein said colon cells are obtained by a core needle biopsy.
  • 7. A method according to claim 5 wherein said colon cells are obtained from stool samples.
  • 8. A method according to claim 5 wherein said colon cells are obtained from blood samples.
  • 9. A method for diagnosing whether a human patient has colon cancer, the method comprising: d) obtaining subject colon cells from said human patiente) assaying the level of proteins obtained from RNAs encoded by SEQ ID NOs. 1-17 in said subject colon cells obtained in step a)f) diagnosing said human patient with colon cancer when the level of proteins obtained from RNAs encoded by SEQ ID NOs. 1-8 in said subject colon cells are overexpressed in comparison to the level of proteins obtained from RNAs encoded by SEQ ID Nos. 1-8 in non-cancerous human colon cells and when the level of the proteins obtained from RNAs encoded by SEQ ID Nos. 9-17 in said subject colon cells are underexpressed in comparison to the level of proteins obtained from RNAs encoded by SEQ ID Nos. 9-17 in non-cancerous human colon cells.
  • 10. A method according to claim 9 wherein said colon cells are obtained by a core needle biopsy.
  • 11. A method according to claim 9 wherein said colon cells are obtained from stool samples.
  • 12. A method according to claim 9 wherein said colon cells are obtained from blood samples.
RELATED APPLICATIONS

The present application is a continuation-in-part of U.S. patent application Ser. No. 11/508,244 filed Aug. 23, 2006 which is hereby incorporated by reference.

Continuation in Parts (1)
Number Date Country
Parent 11508244 Aug 2006 US
Child 12951331 US