The present invention relates to a gene associated with hepatocellular carcinoma, and particularly to a gene associated with the recurrence of hepatocellular carcinoma.
Almost all types of hepatocellular carcinomas are developed from chronic hepatitis caused by viral hepatitis. The causal viruses thereof are hepatitis C virus and hepatitis B virus. If a patient is persistently infected with either hepatitis C virus or hepatitis B virus, there are no therapeutic methods therefor. The patient does nothing but only facing a fear of developing liver cirrhosis or hepatocellular carcinoma. Interferon has been used as an agent for treating hepatitis. However, effective examples are only 30%, and thus this is not necessarily a sufficient therapeutic agent. Under the present circumstances, there are almost no effective examples, in particular, for chronic hepatitis. Nevertheless, even if such viruses cannot be eliminated, if progression of pathologic conditions can be suppressed, it leads to prevention of liver cirrhosis or hepatocellular carcinoma. Thus, it is considered important to clarify the factor of developing pathologic conditions at a molecular level.
If once hepatocellular carcinoma has been developed, even if a surgical radical operation is made, the recurrence of cancer in the remaining liver appears at a high frequency. The survival rate obtained 5 years after the operation of liver cancer is 51% on a national accumulation base. It has been reported that such recurrence appears at approximately 25% of cases 1 year after hepatectomy, at 50% thereof 2 years after hepatectomy, and at 80% thereof 5 years after hepatectomy. Hence, it cannot be said that remaining liver tissues are normal liver tissues, but it is considered that a bud of the recurrence of hepatocellular carcinoma has already existed. At present, it has been reported that recurrence risk factors include the maximum diameter of a tumor, the number of tumors, tumor embolus of portal vein, a preoperative AFP value, intrahepatic metastasis, the presence or absence of liver cirrhosis, etc. However, in order to develop a method for predicting and preventing the recurrence of hepatocellular carcinoma, it is necessary to find at a molecular level a factor of determining the presence or absence of recurrence, which is associated with such risk factors. Such a factor obtained at a molecular level is considered to be a factor, which is associated not only with recurrence but also with the development of hepatocellular carcinoma or progression of pathologic conditions. In recent years, as a result of gene expression analysis using a DNA microarray, it has become possible to classify more in detail such pathologic conditions based on the difference in the expression patterns of genes as a whole. To date, histological or immunological means have been mainly used for classification of cancers. However, cancers classified into the same type have different clinical courses and therapeutic effects depending on individual cases. If there were a means for classifying such cancers more in detail, it would become possible to offer treatment depending on individual cases. It is considered that the gene expression analysis using a DNA microarray constitutes a powerful method for knowing the prognosis of such cancers.
To date, the DNA microarray analysis has clarified the following points associated with hepatocellular carcinoma:
(i) the types of genes, the expressions of which are different between a tumor tissue and a nontumor tissue (Shirota Y, Kaneko S, Honda M, et al. Identification of differentially expressed gene in hepatocellular carcinoma with cDNA microarrays. Hepatology 2001; 33: 832-840, Xu X, Huang J, Xu Z, et al. Insight into hepatocellular carcinogenesis at transcriptome level by comparing gene expression profiles of hepatocellular carcinoma with those of corresponding noncancerous liver. Proc. Nat. Acad. Sci. USA. 2001; 98: 15089-15094);
(ii) in terms of the differentiation degree of cancer tissues, the types of genes, the expressions of which are different (Shirota Y, Kaneko S, Honda M, et al. Identification of differentially expressed gene in hepatocellular carcinoma with cDNA microarrays. Hepatology 2001; 33: 832-840, Okabe H, Satoh S, Kato T, et al. Genome-wide analysis of gene expression in human hepatocellular carcinomas using cDNA microarray: Identification of genes involved in viral carcinogenesis and tumor progression. Cancer res. 2001; 61: 2129-2137);
(iii) the types of genes, the expressions of which are different between hepatocellular carcinoma derived from hepatitis B and hepatocellular carcinoma derived from hepatitis C (Okabe H, Satoh S, Kato T, et al. Genome-wide analysis of gene expression in human hepatocellular carcinomas using cDNA microarray: Identification of genes involved in viral carcinogenesis and tumor progression. Cancer res. 2001; 61: 2129-2137);
(iv) the types of genes, the expressions of which are different depending on the presence or absence of vascular invasion of hepatocellular carcinoma (Okabe H, Satoh S, Kato T, et al. Genome-wide analysis of gene expression in human hepatocellular carcinomas using cDNA microarray: Identification of genes involved in viral carcinogenesis and tumor progression. Cancer res. 2001; 61: 2129-2137); and
(v) the type of a change in gene expression observed among intrahepatic metastatic cancers, as a result of the clonal analysis of multinodular hepatocellular carcinoma (Cheung S, Chen X, Guan X, et al. Identify metastasis-associated gene in hepatocellular carcinoma through clonality delineation for multinodular tumor. Cancer res. 2002; 62: 4711-4721).
However, with regard to genes associated with recurrence, only the analysis of Iizuka et al. on cancer tissues has existed (Iizuka N, Oka M, Yamada-Okabe H, et al. Oligonucleotide microarray for prediction of early intrahepatic recurrence of hepatocellular carcinoma after curative resection. Lancet 2003; 361: 923-929). The analysis of nontumor liver tissues, which reflects the remaining liver tissues, has not yet been achieved.
It is an object of the present invention to provide a gene associated with hepatocellular carcinoma, and particularly, a gene, which predicts the recurrence of the cancer.
As a result of intensive studies directed towards achieving the aforementioned object, the present inventor has studied the profile of gene expression based on a case where hepatocellular carcinoma has recurred and a case where hepatocellular carcinoma has not recurred, and has succeeded in identification of a gene associated with hepatocellular carcinoma, thereby completing the present invention.
That is to say, the present invention has the following features:
(1) A method for evaluating cancer, which comprises the following steps of:
(a) collecting total RNA from an analyte;
(b) measuring the expression level of at least one gene selected from among the genes shown in Tables 1 to 8; and
(c) evaluating cancer using the measurement result as an indicator.
In the present invention, from among the genes shown in Tables 1 to 8, at least one gene selected from the group consisting of the PSMB8 gene, the RALGDS gene, the GBP1 gene, the RPS14 gene, the CXCL9 gene, the DKFZp564F212 gene, the CYP1B1 gene, the TNFSF10 gene, the NROB2 gene, the MAFB gene, the BF530535 gene, the MRPL24 gene, the QPRT gene, the VNN1 gene, and the IRS2 gene, can be used, for example. Otherwise, from among the genes shown in Tables 1 to 8, at least one gene selected from the group consisting of the PZP gene, the MAP3K5 gene, the TNFSF14 gene, the LMNA gene, the CYP1A1 gene, and the IGFBP3 gene, can be used, for example.
In addition, when such measurement is carried out using GAPDH as an internal standard gene, from among the genes shown in Tables 1 to 8, each gene contained in a gene set consisting of the VNN1 gene and the MRPL24 gene, or a gene set consisting of the PRODH gene, the LMNA gene, and the MAP3K12 gene, can be used.
Moreover, when such measurement is carried out using 18S rRNA as an internal standard gene, from among the genes shown in Tables 1 to 8, each gene contained in a gene set consisting of the VNN1 gene, the CXCL9 gene, the GBP1 gene, and the RALGDS gene, or a gene set consisting of the LMNA gene, the LTBP2 gene, the COL1A2 gene, and the PZP gene, can be used.
The above evaluation of cancer involves prediction of the presence or absence of metastasis or recurrence. Further, an example of such cancer is hepatocellular carcinoma.
The expression level of a gene can be measured by amplifying the gene, using at least one set of primers consisting of the nucleotide sequences shown in SEQ ID NOS: 2n−1 and 2n (wherein n represents an integer between 1 and 114). Otherwise, the expression level of a gene can be measured by amplifying the gene, using a set of primers for amplifying each gene contained in at least one gene set selected from the group consisting of a gene set consisting of the VNN1 gene and the MRPL24 gene, a gene set consisting of the PRODH gene, the LMNA gene, and the MAP3K12 gene, a gene set consisting of the VNN1 gene, the CXCL9 gene, the GBP1 gene, and the RALGDS gene, and a gene set consisting of the LMNA gene, the LTBP2 gene, the COL1A2 gene, and the PZP gene.
(2) A primer set, which comprises at least one set of primers consisting of the nucleotide sequences shown in SEQ ID NOS: 2n−1 and 2n (wherein n represents an integer between 1 and 114).
(3) A primer set, which comprises a set of primers for amplifying each gene contained in at least one gene set selected from the group consisting of a gene set consisting of the VNN1 gene and the MRPL24 gene, a gene set consisting of the PRODH gene, the LMNA gene, and the MAP3K12 gene, a gene set consisting of the VNN1 gene, the CXCL9 gene, the GBP1 gene, and the RALGDS gene, and a gene set consisting of the LMNA gene, the LTBP2 gene, the COL1A2 gene, and the PZP gene.
(4) A kit for evaluating cancer, which comprises any gene shown in Tables 1 to 8.
An example of the aforementioned gene is at least one gene selected from the group consisting of the RALGDS gene, the GBP1 gene, the DKFZp564F212 gene, the TNFSF10 gene, and the QPRT gene.
Moreover, another example of the aforementioned gene is each gene contained in at least one gene set selected from the group consisting of a gene set consisting of the VNN1 gene and the MRPL24 gene, a gene set consisting of the PRODH gene, the LMNA gene, and the MAP3K12 gene, a gene set consisting of the VNN1 gene, the CXCL9 gene, the GBP1 gene, and the RALGDS gene, and a gene set consisting of the LMNA gene, the LTBP2 gene, the COL1A2 gene, and the PZP gene.
Furthermore, the kit of the present invention may comprise the aforementioned primer set.
The present invention provides a gene useful for predicting the recurrence of hepatocellular carcinoma. Cancer can be evaluated by analyzing the increased expression state of such a gene. In particular, using the gene of the present invention, the recurrence of hepatocellular carcinoma can be predicted, and the obtained prediction information is useful for the subsequent therapeutic strategy. Moreover, the use of such a gene and a gene product enables the development of a treatment method for preventing recurrence.
The present invention will be described in detail below.
The present invention is characterized in that the follow-up clinical data collected for a long period of time after the resection of hepatocellular carcinoma are divided into a poor prognosis case group (for example, a case group wherein the cancer recurs within 1 year, leading to death within 2 years) and into a good prognosis case group (for example, a case group wherein the cancer does not recur for 4 or more years), and is characterized in that a gene causing poor prognosis or a gene causing good prognosis (for example, a gene associated with promotion of the recurrence and a gene associated with suppression of the recurrence) is identified based on the characteristics of a gene group, which is expressed in the excised liver tissues. The present invention relates to classification of causal viruses into type B hepatocellular carcinoma cases and into type C hepatocellular carcinoma cases based on clinical data, and identification of a gene having a prognostic correlation from each of the tissues of a nontumor tissue and the tissues of a tumor tissue.
The gene of the present invention is obtained by analyzing the correlation between tissues actually collected from a patient and a pathologic condition thereof, and thereby clarifying the type of a case, a pathologic condition, and a gene, which are used to clarify the correlation between a gene and a pathologic condition.
The postoperative course is observed after an operation to resect liver cancer, and test samples are classified into an early recurrence group and into a late recurrence group.
The term “early recurrence group” is used to mean a case group wherein the cancer recurs within a certain period of time after resection, thereafter leading to death. A recurrence period is not particularly limited. For example, it is 1 year or shorter, or 2 years or shorter. A survival time is not particularly limited either. For example, it is 1 year or shorter, 2 years or shorter, or 3 years or shorter, after recurrence. The term “late recurrence group” is used to mean a case group wherein the cancer does not recur for a certain period of time after resection (for example, 3 years or longer, and preferably 4 years or longer).
In reality, 51 cases, which were subjected to an operation to resect hepatocellular carcinoma at stages I and II, were used as targets. The 51 cases contain 16 cases of type B hepatocellular carcinoma and 35 cases of type C hepatocellular carcinoma. Based on the follow-up clinical data of such cases, 2 cases were selected from the type B hepatocellular carcinoma and 3 cases were selected from the type C hepatocellular carcinoma, and these cases were classified into an early recurrence group. On the other hand, 2 cases selected from the type B hepatocellular carcinoma and 3 cases were selected from the type C hepatocellular carcinoma, and these cases were classified into a late recurrence group. With regard to the RNA portions of the nontumor tissues and tumor tissues of such 10 cases, the following expression profile analysis was carried out.
Total RNA is extracted from each type of the liver tissues of the classified groups, and gene expression profiles are then compared between the groups using a microarray. Such total RNA can be extracted using a commercially available reagent (for example, TRIzol). For detection of an expression profile, Microarray (Affymetrix) is used, for example.
Moreover, the present invention enables the analysis of a gene, which changes expression in the tissues of a nontumor tissue as well as in the tissue of a tumor tissue. The term “nontumor tissue” is used herein to mean liver tissues involved in a resection of hepatocellular carcinoma, which do not contain cancer cells. However, such a “nontumor tissue” does not necessarily mean normal liver tissues, but it also includes tissues affected by chronic hepatitis (hepatitis B or hepatitis C) or liver cirrhosis. For example, a gene up-regulated in a nontumor tissue in a late recurrence group including type B hepatocellular carcinoma cases or type C hepatocellular carcinoma cases, wherein almost all tissues are such affected tissues, can be used as an analysis target. In the case of such tissues affected by chronic hepatitis or liver cirrhosis, a necrotic inflammatory reaction, regenerating nodules, fibrosis attended with decidual liver cells, or the like are observed. Among such cells, there are cells, which can be potential cells causing the development of hepatocellular carcinoma. Accordingly, it is considered that gene expression relevant to prognosis exists in the nontumor tissue. Thus, prognosis (for example, recurrence) can be predicted using such gene expression as an indicator (for example, by analyzing changes in such gene expression).
A gene used for evaluation of cancer is identified based on the correlation of changes in gene expression with phenotype (recurrence, early progression, etc.). The term “evaluation of cancer” is used to mean evaluation regarding the pathologic conditions of cancer or the stage of cancer progression. Such evaluation of cancer includes prediction of the presence or absence of metastasis or recurrence.
The present invention provides an up-regulated gene or a down-regulated gene in terms of recurrence. The term “recurrence” is used to mean that a lesion, which is considered to be a new carcinoma, appears in the liver, after a treatment for a primary lesion has been determined to complete.
Using disease model cells or animals, the identified gene is evaluated in terms of availability as a factor of suppressing the development of pathologic conditions. Namely, (1) the remaining cases of hepatocellular carcinoma, the prognosis of which has been known, are subjected to quantitative analysis of gene expression, and the correlation with the prognosis is studied. (2) The gene is transferred into a hepatocellular carcinoma-cultured cell line, and it is allowed to express therein. Thereafter, the cell growth and a change in malignancy are evaluated based on ability to form colonies in a soft agar plate or ability to form tumors in nude mice. (3) Using a cultured hepatic cell line established from a patient with chronic hepatitis, the gene is transferred into the cells, and it is allowed to express therein. Thereafter, the cell growth and malignant transformation are evaluated by the same method as that described in (2) above. (4) The gene is transferred into the liver of a hepatocellular carcinoma development-model animal, and it is allowed to express therein. Thereafter, the course up to the development of liver cancer is evaluated.
In (1) above, the quantitative analysis of gene expression is carried out by real-time PCR, for example. That is to say, a commercially available reverse transcriptase is used for the total RNA as produced above, so as to synthesize cDNA. As a PCR reagent, a commercially available reagent can be used. Moreover, PCR may be carried out in accordance with commercially available protocols. For example, preliminary heating is carried out at 95° C. for 10 minutes, and thereafter, a cycle consisting of 95° C. for 15 seconds and 60° C. (or 65° C.) for 60 seconds, is repeated 40 times. Examples of an internal standard gene used herein as a target may include housekeeping genes such as glyceraldehyde 3-phosphatase dehydrogenase (GAPDH), 18S ribosomal RNA (18S rRNA), β-Actin, cyclophilin A, HPRT1 (hypoxanthine phosphoribosyltransferase 1), B2M (beta-2 microglobulin), ribosomal protein L13a, or ribosomal protein L4. Persons skilled in the art can appropriately select such an internal standard gene. As an analysis method, absolute quantitative analysis or relative quantitative analysis of an expression level is adopted. The absolute quantitative analysis is preferable. Herein, absolute quantification of an expression level is obtained by determining a threshold line on which a calibration curve becomes optimum and then obtaining the number of threshold PCR cycles and a threshold cycle value (Ct) of each sample. On the other hand, a relative expression level is expressed with a Δ Ct value obtained by subtracting the Ct value of an internal standard gene (for example, GAPDH) from the Ct value of a target gene. Values obtained using the formula (2(−ΔCt)) can be used for evaluation of a linear expression level.
When a calibration curve is produced, values obtained by subjecting standard samples to serial dilution and simultaneous measurement (the samples are placed in a single plate and simultaneously measured, using a single reaction solution) may be used.
When an absolute expression level can be obtained relative to a calibration curve, the absolute expression level of a target gene and that of an internal standard gene are obtained, and the ratio of the target gene expression level/the internal standard gene expression level is calculated for each sample, so as to use it for evaluation.
Genes are selected from the results of the microarray of a late recurrence group and that of an early recurrence group. Thereafter, among genes, regarding which the results of real-time PCR obtained by the aforementioned method correspond with the results of the microarray, those exhibiting a correlation with a recurrence period can be identified as up-regulated genes of nontumor tissue, for example.
As described above, as genes identified as an up-regulated gene, various genes can be selected depending on experimental conditions applied during the identification, such as an internal standard gene, a primer sequence, or an annealing temperature which are used. Also, using various types of statistical methods (for example, Mann-Whitney U test), a gene correlating to a recurrence period can be selected.
The full-length sequence of the gene of the present invention can be obtained as follows. That is to say, it is searched through DNA database, and it can be obtained as known sequence information. Otherwise, the above full-length sequence is isolated from human liver cDNA library by hybridization screening.
In the present invention, genes up-regulated in cases where the cancer has not recurred at an early date (late recurrence) include those shown in Tables 1 to 4. On the other hand, genes up-regulated in cases where the cancer has recurred at an early date include those shown in Tables 5 to 8.
Table 1: Genes (24) up-regulated in a nontumor tissue in a late recurrence group of type B hepatocellular carcinoma cases
Table 2: Genes (10) up-regulated in a nontumor tissue in a late recurrence group of type C hepatocellular carcinoma cases
Table 3: Genes (137) up-regulated in a tumor tissue in a late recurrence group of type B hepatocellular carcinoma cases
Table 4: Genes (104) up-regulated in a tumor tissue in a late recurrence group of type C hepatocellular carcinoma cases
Table 5: Genes (48) up-regulated in a nontumor tissue in an early recurrence group of type B hepatocellular carcinoma cases
Table 6: Genes (12) up-regulated in a nontumor tissue in an early recurrence group of type C hepatocellular carcinoma cases
Table 7: Genes (75) up-regulated in a tumor tissue in an early recurrence group of type B hepatocellular carcinoma cases
Table 8: Genes (38) up-regulated in a tumor tissue in an early recurrence group of type C hepatocellular carcinoma cases
In Table 5, “CTH” and “AL354872” are genes, which encode the same protein.
The above-described genes can be included in a kit for evaluating cancer, singly or in combination, as appropriate. Examples of a gene set consisting of several genes may include those shown in Table 16 (described later). The above genes may have the partial sequence thereof. Such genes can be used as probes for detecting the expression of the genes shown in the table.
Moreover, the kit of the present invention may comprise primers used for gene amplification, a buffer solution, polymerase, etc.
With regard to such primers used for gene amplification, the DNA sequence and mRNA sequence of each gene sequence are obtained from database, and in particular, information including the presence or absence of a variant and exon-intron structure is obtained. The same sequences as sequences of portions corresponding to coding regions are used as target. One primer is intended to bridge over an adjacent exon, and it is designed such that only mRNA is detected. Otherwise, primer candidates are obtained using the web software “Primer3” (provided by Steve Rozen and Whitehead Institute for Biomedical Research), and thereafter, homology search is carried out using BLAST (NCBI) search, so as to select primers, which are able to avoid miss-annealing to similar sequences.
The sequence numbers of preferred primers are represented by the general formulas 2n−1 and 2n (wherein n represents an integer between 1 and 114). In the present invention, a primer represented by 2n−1 and a primer represented by 2n can be used as a set of primers. For example, when n is 1, a primer set consisting of the primers shown in SEQ ID NOS: 1 and 2 can be used, and when n is 2, a primer set consisting of the primers shown in SEQ ID NOS: 3 and 4 can be used. Particularly preferred primers can be obtained, when n is 2, 4, 7, 9, or 17.
Moreover, in (1) above, it is also possible to carry out the quantitative analysis of gene expression via immuno-dot blot assay or immunostaining. Such immuno-dot blot assay or immunostaining can be carried out according to common methods using an antibody reacting with the expression products of the genes shown in Tables 1 to 8. As such an antibody, a commercially available antibody may be used, or an antibody obtained by immunization of animals such as a mouse, a rat, or a rabbit, may also be used.
The present invention will be more specifically described in the following examples. However, these examples are not intended to limit the technical scope of the present invention.
As described below, using human hepatic tissues obtained from type B and type C hepatocellular carcinoma cases, molecules for suppressing the recurrence of hepatocellular carcinoma were identified at a gene level.
In order to understand a recurrence mechanism occurring after an operation to resect hepatocellular carcinoma and determine a gene capable of predicting the presence or absence of recurrence, gene expression profile analysis was carried out, using several cases, the recurrence periods of which were different. 51 cases, which were at stages I and II based on TNM classification, were used as targets. 5 cases wherein the cancer had not recurred for 4 or more years after the operation, and 5 cases wherein the cancer had recurred within 1 year after the operation, were selected. Thereafter, expression analysis was carried out using an HG-U133A array manufactured by Affymetrix.
The TRIzol reagent (Life Technologies, Gaithersburg, Md.) was added to frozen tissues, and the obtained mixture was then homogenated with Polytron. Thereafter, chloroform was added to the homogenate, and they were then fully mixed, followed by centrifugation. After completion of the centrifugation, the supernatant was recovered, and an equivalent amount of isopropanol was added thereto. Thereafter, the precipitate of total RNA was recovered by centrifugation.
Type B hepatocellular carcinoma cases (wherein the causal virus is a hepatitis B virus) were divided into the following groups: the nontumor tissues and tumor tissues of 2 early recurrence cases; and the nontumor tissues and tumor tissues of 2 late recurrence cases. Also, type C hepatocellular carcinoma cases (wherein the causal virus is a hepatitis C virus) were divided into the following groups: the nontumor tissues and tumor tissues of 3 early recurrence cases; and the nontumor tissues and tumor tissues of 3 late recurrence cases. Thus, the total 8 groups were subjected to expression analysis.
For each sample group, 15 μg of total RNA was prepared. Thereafter, biotin-labeled cRNA was synthesized based on GeneChip Expression Analysis Technical Manual by Affymetrix. Using T7-(dt)24 primer and Superscript II reverse transcriptase (Invitrogen Life Technology), the reaction was carried out for 1 hour, so as to synthesize first strand cDNA. Thereafter, E. coli DNA ligase, E. coli DNA polymerase, and E. coli RNase H were added thereto, and the obtained mixture was then allowed to react at 16° C. for 2 hours. Finally, T4 DNA polymerase was added to the reaction product, so as to synthesize double strand cDNA. After cleanup of the cDNA, the BioArray high yield RNA transcript labeling kit (Affymetrix, Inc, CA) was used for in vitro transcription at 37° C. for 4 hours, so as to synthesize biotin-labeled cRNA. A hybridization probe solution was prepared based on the Technical Manual, and the above solution was then added to GeneChip HG-U133A (Affymetrix, Inc, CA; containing 22,283 human genes), obtained by pre-hybridization at 45° C. for 45 minutes. Thereafter, hybridization was carried out at 45° C. for 16 hours. Thereafter, the reaction product was washed with GeneChip Fluidics Station 400 (Affymetrix, Inc, CA), and was then stained with streptavidin phycoerythrin and biotinylated antistreptavidin. Thereafter, the resultant was subjected to scanning using an HP GeneArray scanner (Affymetrix, Inc, CA).
The obtained data was analyzed using GeneSpring ver. 5.0 (SiliconGenetics, Redwood, Calif.). After completion of normalization, using the signal of the control gene BioB used for intrinsic quantification as a detection limit (corresponding to several copies per cell). A gene, which has a signal intensity of 100 or greater and also has a present flag in at least one chip, was defined as a target of the analysis. As a result, 7,444 genes were determined to be such analysis targets. In nontumor tissues, genes having 2.5 times or more difference between the early recurrence group and the late recurrence group have been identified. In tumor tissues, genes having 3 times or more difference between such two groups have been identified.
As a result, among the selected 7,444 genes, genes having 2.5 times or more difference between the absence and the presence of recurrence in nontumor tissues consisted of 34 up-regulated genes and 58 down-regulated genes. On the other hand, genes having 3 time or more difference between such two groups in tumor tissues consisted of 215 up-regulated genes and 110 down-regulated genes. Among these genes, as a gene up-regulated in the recurrence-absent group in both cases of type B and type C, no such genes were found in nontumor tissues, whereas 26 genes were found in tumor tissues. On the other hand, among these genes, as a gene up-regulated in the recurrence-present group in both cases of type B and type C, 2 genes were found in nontumor tissues, whereas 3 genes were found in tumor tissues. Moreover, there were genes up-regulated in both tumor and nontumor tissue. There were found 5 genes up-regulated in the recurrence-absent group, and 10 genes up-regulated in the recurrence-present group (Table 9).
It is to be noted that the total is not 402 but 401 in Table 9. This is because the overlapping of GLUL is a particular case.
From the results shown in Table 9, it can be said that with regard to a difference in recurrence prognosis, a change in gene expression is greater in a tumor-tissue than in a nontumor tissue, and that such a change in gene expression is greater in type B hepatocellular carcinoma cases than in type C hepatocellular carcinoma cases. In addition, there are genes associated with recurrence prognosis, which are found independently of a causal virus, but unexpectedly, such genes are rare. As in the case of the development of cancer, it is considered that different mechanisms are involved in the recurrence of cancer, depending on the type of a causal virus.
In the analysis of a sample phylogenetic tree, the expression profiles of all genes are first divided into nontumor tissues and tumor tissues. In each of such nontumor tissues and tumor tissues, a genetic affiliation, which is not caused by recurrence prognosis but caused by a causal virus, was observed (
It is considered that gene expression affecting recurrence prognosis is caused by a change in the gene expression of limited genes.
As stated above, candidate genes capable of clarifying a recurrence mechanism or predicting the presence or absence of recurrence were found (Tables 1 to 8).
As mentioned below, with regard to genes up-regulated in the nontumor tissues of a late recurrence group and an early recurrence group in type C hepatocellular carcinoma cases, the correlation between the recurrence period and an expression level was studied.
The total 22 nontumor tissue samples, including 6 cases of type C hepatocellular carcinoma used in the gene expression profile analysis, were used as targets. The clinicopathological findings of each case and the recurrence period (that is, the period of time in which the cancer has not yet recurred) are shown in Table 10A.
In addition, the cases shown in Table 10A were changed or revised as a result of follow-up study. Moreover, with regard to the total 35 cases, including cases added as the targets of the present example, the clinicopathological findings of each case and the recurrence period (that is, the period of time in which the cancer has not yet recurred) are shown in Table 10B.
With regard to the total 21 genes consisting of 9 genes (CNgood) up-regulated in the nontumor tissues of the late recurrence group shown in Table 2 and 12 genes (CNbad) up-regulated in the nontumor tissues of the early recurrence group shown in Table 6, the relationship between the recurrence period and an expression level was analyzed.
First, total RNA was extracted from the nontumor liver tissue of each case by the same method as that described in Example 1 above.
In order to eliminate the influence of DNA mixed therein, the total RNA was treated with DNase I (DNase I, TAKARA SHUZO, Kyoto, Japan) at 37° C. for 20 minutes, and it was then purified again with a TRIzol reagent. Using 10 μg of the total RNA, a reverse transcription reaction was carried out with 100 μl of a reaction solution comprising 25 units of AMV reverse transcriptase XL (TAKARA) and 250 μmol of a 9-mer random primer.
Real-time PCR was carried out using 0.25 to 50 ng each of synthetic cDNA. 25 μl of a reaction solution, SYBR Green PCR Master mix (Applied Biosystems, Foster City, Calif.) was used, and ABI PRISM 7000 (Applied Biosystems) was employed. PCR was carried out under conditions wherein preliminary heating was carried out at 95° C. for 10 minutes, and thereafter, a cycle consisting of 95° C. for 15 seconds and 60° C. (or 65° C.) for 60 seconds, was repeated 40 to 45 times.
Using glyceraldehyde 3-phosphatase dehydrogenase (GAPDH) or 18S rRNA as an internal standard gene of each sample, relative quantitative analysis, and partially, absolute quantitative analysis, were carried out. Values obtained by subjecting standard samples to serial dilution and simultaneous measurement, were used to produce a calibration curve. A threshold line for optimization of such a calibration curve was determined, and the number of threshold PCR cycles, a threshold cycle value (Ct) was then obtained for each sample. A Δ Ct value was obtained by subtracting the Ct value of GAPDH or 18S rRNA from the Ct value of a target gene, and the obtained value was defined as the relative expression level of the target gene. Moreover, values obtained using the formula (2(−ΔCt)) were used for evaluation of a linear expression level.
On the other hand, with regard to genes whose absolute expression level can be calculated relative to a calibration curve, the absolute expression level of a target gene and that of an internal standard gene were obtained. Thereafter, the ratio of the target gene expression level/the internal standard gene expression level was calculated for each sample, and it was used for evaluation. All such measurements were carried out in a duplicate manner.
In Tables 11A, 11B, 12A, and 12B, the term “correspondence with microarray” is used to mean that when the ratio between the late recurrence group (case Nos. 59, 18, and 6) and the early recurrence group (case Nos. 14, 15, and 44) was obtained from the results of quantitative PCR performed on 6 cases (case Nos. 59, 18, 6, 14, 15, and 44 in Table 10A or 10B) used in the microarray analysis, genes, the above ratio of which was 1.5 or greater, corresponded with the results of the microarray in Example 1. Genes corresponding with the microarray results were indicated with the mark O. The above ratio is 1.5 or greater, and preferably 2 or greater. The number in the parenthesis adjacent to the mark O indicates such a ratio (the average ratio of 3 cases). The mark X in the “correspondence with microarray” column indicates a gene that does not correspond with the microarray results. The mark XX indicates a gene, which exhibits an opposite correlation with the microarray results.
In Tables 11A, 11B, 12A, and 12B, the term “correlation” is used to mean a correlation between the gene expression level and the recurrence period in 22 cases, or in 31 cases wherein the number of months in which the recurrence of the cancer had occurred was determined. In the case of a significant correlation, O or the r value was indicated, and further, the p value was also indicated.
In Tables 11B and 12B, with regard to genes exhibiting a significant difference in expression levels between 19 cases of the recurrence within 24 months, and 6 cases of no recurrence for 40 months or more (the upper case of the “significant difference between two groups” column in Tables 11B and 12B) or 4 cases of no recurrence for 58 months or more (the lower case of the “significant difference between two groups” column in Tables 11B and 12B), p values (Mann-Whitney U test) were shown in the “significant difference between two groups” column.
Primer sequences (sense strand (forward), antisense strand (reverse)) used for the test are shown in Tables 11A, 11B, 12A, and 12B (SEQ ID NOS: 1 to 88).
The results obtained by analyzing the 9 gene candidates (CNgood) up-regulated in nontumor tissues in the late recurrence group of type C hepatocellular carcinoma cases are shown in Tables 11A and 11B. Table 11A shows the analysis results obtained by quantitative PCR, which was performed on the cases shown in Table 10A as targets, under the conditions shown in Table 11A using GAPDH as an internal standard gene.
As a result, it was found that 8 genes corresponded with the microarray results, and that among such genes, 4 genes (RALGDS, GBP1, DKFZp564F212, and TNFSF10) exhibited a correlation with the recurrence period.
Likewise, Table 11B shows the analysis results obtained by quantitative PCR, which was performed on the 10 genes shown in Table 11B and the cases shown in Table 10B as targets, under the conditions shown in the table using GAPDH or 18S rRNA as an internal standard gene.
As a result, it was found that when GAPDH was used as an internal standard gene, all the 9 gene candidates exhibiting up-regulation in the late recurrence group corresponded with the microarray results, and that among such genes, 5 genes exhibited a correlation with the recurrence period. In addition, when 18S rRNA was used as an internal standard gene also, all the above 9 gene candidates corresponded with the microarray results, and among them, 8 genes exhibited a correlation with the recurrence period.
A significant difference test was carried out on two groups, the late recurrence group and the early recurrence group. As a result, it was found that when GAPDH was used as a standard gene, 3 genes exhibited a significant difference, and that when 18S rRNA was used as a standard gene, 5 genes exhibited a significant difference.
Subsequently, the results obtained by analyzing the 12 gene candidates (CNbad) up-regulated in nontumor tissues in the early recurrence group of type C hepatocellular carcinoma cases are shown in Tables 12A and 12B. Table 12A shows the analysis results obtained by quantitative PCR, which was performed on the cases shown in Table 10A as targets, under the conditions shown in Table 12A using GAPDH as an internal standard gene.
As a result, 7 genes corresponded with the microarray results. No genes significantly exhibited a correlation with the recurrence period. However, the QPRT gene significantly exhibited an opposite correlation. Accordingly, this gene was identified as a gene up-regulated in nontumor tissues in the late recurrence group.
Likewise, Table 12B shows the analysis results obtained by quantitative PCR, which was performed on the cases shown in Table 10B as targets, under the conditions shown in Table 12B using GAPDH or 18S rRNA as an internal standard gene.
As a result, it was found that when GAPDH or 18S rRNA was used as an internal standard gene, among 12 gene candidates exhibiting up-regulation in the early recurrence group, 1 gene corresponded with the microarray results. However, when GAPDH was used as an internal standard gene, the MAFB gene, the MRPL24 gene, the VNN1 gene, and IRS2 gene significantly exhibited an opposite correlation. In addition, when 18S rRNA was used as an internal standard gene, the NROB2 gene, the MAFB gene, the BF530535 gene, the MRPL24 gene, the QPRT gene, the VNN1 gene, and the IRS2 gene significantly exhibited an opposite correlation. Accordingly, these genes were identified as genes up-regulated in nontumor tissues in the late recurrence group.
As stated above, as a result of the studies carried out under various conditions, the following 15 genes were identified as genes expressed in nontumor tissues, which can be used for prediction of the recurrence of cancer in type C hepatocellular carcinoma cases: the PSMB8 gene, the RALGDS gene, the GBP1 gene, the RPS14 gene, the CXCL9 gene, the DKFZp564F212 gene, the CYP1B1 gene, the TNFSF10 gene, the NROB2 gene, the MAFB gene, the BF530535 gene, the MRPL24 gene, the QPRT gene, the VNN1 gene, and the IRS2 gene. The meanings of the aforementioned genes are as follows:
PSMB8 gene (which is also referred to as LMP7 gene): A proteasome subunit, beta type, 8 gene
RALGDS gene: A ral guanine nucleotide dissociation stimulator gene
GBP1 gene: A guanylate-binding protein 1 gene
RPS14 gene: A ribosomal protein S14 gene
CXCL9 gene: A chemokine (C-X-C motif) ligand 9 gene
DKFZp564F212 gene: An expression gene discovered by German Human Genome Project, whose gene product has not been identified and whose functions have not yet been predicted.
CYP1B1 gene: A cytochrome P450, family 1, subfamily B, polypeptide 1 gene
TNFSF10: An abbreviation of TNF (ligand) super family, member 10, and a TNF-related apoptosis inducing ligand (TRAIL) gene
NR0B2 gene: A nuclear receptor subfamily 0, group B, member 2 gene
MAFB gene: A v-maf musculoaponeurotic fibrosarcoma oncogene homolog B gene
BF530535 gene: A gene whose gene product has not been identified and whose functions have not yet been predicted.
MRPL24 gene: A mitochondrial ribosomal protein L24 gene
QPRT gene: A quinolinate phosphoribosyltransferase gene
VNN1 gene: A vanin 1 gene
IRS2 gene: An insulin receptor substrate 2 gene
As mentioned below, with regard to genes up-regulated in the nontumor tissues of a late recurrence group and an early recurrence group in type B hepatocellular carcinoma cases, the correlation between the recurrence period and an expression level was studied.
The total 16 nontumor tissue samples, including 4 cases of type B hepatocellular carcinoma used in the gene expression profile analysis, were used as targets. The clinicopathological findings of each case and the recurrence period (that is, the period of time in which the cancer has not yet recurred) are shown in Table 13.
With regard to the total 71 genes consisting of 24 genes (BNgood) up-regulated in the nontumor tissues of the late recurrence group shown in Table 1 and 47 genes (BNbad) up-regulated in the nontumor tissues of the early recurrence group shown in Table 5, the relationship between the recurrence period and an expression level was analyzed.
First, total RNA was extracted from the nontumor hepatic tissue of each case by the same method as that described in Example 1 above.
In order to eliminate the influence of DNA mixed therein, the total RNA was treated with DNase I (DNase I, TAKARA SHUZO, Kyoto, Japan) at 37° C. for 20 minutes, and it was then purified again with a TRIzol reagent. Using 10 μg of the total RNA, a reverse transcription reaction was carried out with 100 μl of a reaction solution comprising 25 units of AMV reverse transcriptase XL (TAKARA) and 250 pmol of a 9-mer random primer.
Real-time PCR was carried out using 0.25 to 50 ng each of synthetic cDNA. 25 μl of a reaction solution, SYBR Green PCR Master mix (Applied Biosystems, Foster City, Calif.) was used, and ABI PRISM 7000 (Applied Biosystems) was employed. PCR was carried out under conditions wherein preliminary heating was carried out at 95° C. for 10 minutes, and thereafter, a cycle consisting of 95° C. for 15 seconds and 60° C. (or 65° C.) for 60 seconds, was repeated 40 to 45 times.
Using GAPDH or 18S rRNA as an internal standard gene of each sample, absolute quantitative analysis was carried out. Values obtained by subjecting standard samples to serial dilution and simultaneous measurement, were used to produce a calibration curve.
The absolute expression level of a target gene and that of an internal standard gene were obtained. Thereafter, the ratio of the target gene expression level/the internal standard gene expression level was calculated for each sample, and it was used for evaluation. All such measurements were carried out in a duplicate manner.
As with the descriptions in Example 2, the term “correspondence with microarray” shown in Tables 14 and 15 is used to mean that when the ratio of the late recurrence group (case Nos. 67 and 60) and the early recurrence group (case Nos. 13 and 9) was obtained from the results of quantitative PCR performed on 4 cases (case Nos. 67, 60, 13, and 9 in Table 13) used in the microarray analysis, genes, the above ratio of which was 1.5 or greater, corresponded with the results of the microarray in Example 1. The mark O is given to genes, when the above ratio of is 1.5 or greater, and preferably 2 or greater. The number in the parenthesis adjacent to the mark O indicates the value of such a ratio. The mark X in the “correspondence with microarray” column indicates a gene that does not correspond with the microarray results. The mark XX indicates a gene that exhibits an opposite correlation to the microarray results.
In the “correlation” columns in Tables 14 and 15, with regard to genes, which exhibited a correlation between the gene expression level and the recurrence period in 10 cases wherein the number of months in which the recurrence of the cancer had occurred was determined, the r value and the p value were described.
In the “significant difference between two groups” column in Tables 14 and 15, with regard to genes exhibiting a significant difference in expression levels between 6 cases of the recurrence within 24 months, and 8 cases of no recurrence for 48 months or more (the upper case of the “significant difference between two groups” in Tables 14 and 15) or 6 cases of no recurrence for 60 months or more (the lower case of the “significant difference between two groups” in Tables 14 and 15), p values (Mann-Whitney U test) were indicated.
Primer sequences (sense strand (forward), antisense strand (reverse)) used for the test are shown in Tables 14 and 15 (SEQ ID NOS: 89 to 228).
The results obtained by analyzing the 24 gene candidates (BNgood) up-regulated in nontumor tissues in the late recurrence group of type B hepatocellular carcinoma cases are shown in Tables 14. Table 14 shows the analysis results obtained by quantitative PCR, which was performed on the cases shown in Table 13 as targets, under the conditions shown in Table 14 using GAPDH or 18S rRNA as an internal standard gene.
As a result, it was found that when GAPDH was used as an internal standard gene, 19 out of the 24 gene candidates exhibiting up-regulation in the late recurrence group corresponded with the microarray results, and that among such genes, no genes exhibited a correlation with the recurrence period. In addition, when 18S rRNA was used as an internal standard gene, 9 out of the above 24 gene candidates corresponded with the microarray results, and among them, only 1 gene (PZP gene) exhibited a correlation with the recurrence period
A significant difference test was carried out on two groups, the late recurrence group and the early recurrence group. As a result, it was found that when GAPDH was used as a standard gene, only one gene (MAP3K5 gene) exhibited a significant difference, and that when 18S rRNA was used as a standard gene, only one gene (TNFSF14 gene) exhibited a significant difference. On the contrary, there was one gene (LMNA gene), which had a significant difference, oppositely correlating to the recurrence period. Accordingly, this gene was identified as a gene up-regulated in nontumor tissues in the early recurrence group.
Subsequently, the results obtained by analyzing the 47 gene candidates (BNbad) up-regulated in nontumor tissues in the early recurrence group of type B hepatocellular carcinoma cases are shown in Table 15. Table 15 shows the analysis results obtained by quantitative PCR, which was performed on the cases shown in Table 13 as targets, under the conditions shown in Table 15 using GAPDH or 18S rRNA as an internal standard gene.
As a result, it was found that when GAPDH was used as an internal standard gene, 16 gene corresponded with the microarray results, but that no genes significantly exhibited a correlation with the recurrence period. However, the IGFBP3 gene significantly exhibited an opposite correlation in the significant difference test between two groups. Accordingly, this gene was identified as a gene up-regulated in nontumor tissues in the late recurrence group.
In addition, when 18S rRNA was used as an internal standard gene, 45 genes corresponded with the microarray results, but that no genes significantly exhibited a correlation with the recurrence period. However, the CYP1A1 gene significantly exhibited a correlation in a significant difference test between two groups. Accordingly, this gene was identified as a gene up-regulated in nontumor tissues in the early recurrence group.
As stated above, the following 6 genes were identified as genes expressed in nontumor tissues, which can be used for prediction of the recurrence of cancer in type B hepatocellular carcinoma cases: the PZP gene, the MAP3K5 gene, the TNFSF14 gene, the LMNA gene, the CYP1A1 gene, and the IGFBP3 gene. The meanings of the aforementioned genes are as follows:
PZP gene: A pregnancy-zone protein gene
MAP3K5 gene: A mitogen-activated protein kinase 5 gene
TNFSF14 gene: A tumor necrosis factor (ligand) superfamily, member 14 gene
LMNA gene: A lamin A/C gene
CYP1A1 gene: A cytochrome P450, family 1, subfamily A, polypeptide 1 gene
IGFBP3 gene: An insulin-like growth factor binding protein 3 gene
By combining several genes expressed in nontumor tissues used for prediction of the recurrence of type C or B hepatocellular carcinoma, which were obtained from the results of Examples 2 and 3, it becomes possible to carry out recurrence prediction more precisely. As such gene sets, many types of sets are conceived. Examples of the aforementioned combination are shown in Table 16.
When GAPDH is used as an internal standard gene for normalization of gene expression in the distinction of an early recurrence group wherein the cancer has recurred within 24 months from a late recurrence group wherein the cancer has not recurred for 40 months or more, the gene expression level of VNN1 and that of MRPL24 may be examined. Otherwise, when 18S rRNA is used as an internal standard gene for normalization in the above distinction, the expression level of each gene of a gene set consisting of VNN1, CXCL9, GBP1, and RALGDS may be examined. The expression level of each of the aforementioned genes is assigned to a discriminant using a discriminant function coefficient obtained regarding each gene, and the obtained value is used for distinction. The expression level of the above gene group is analyzed. In the case of GAPDH normalization, the classification rate between the early recurrence group and the late recurrence group is found to be 88%, and in the case of 18S rRNA, the classification rate is found to be 100%.
When GAPDH is used as an internal standard gene for normalization in the distinction of an early recurrence group wherein the cancer has recurred within 24 months from a late recurrence group wherein the cancer has not recurred for 48 months or more, the expression level of each gene of a gene set consisting of PRODH, LMNA, and MAP3K12 may be examined. Otherwise, when 18S rRNA is used as an internal standard gene for normalization in the above distinction, the expression level of each gene of a gene set consisting of LMNA, LTBP2, COL1A2, and PZP may be examined. As described above, such expression levels are assigned to a discriminant, and the obtained values are used for distinction. The expression level of the above gene group is analyzed. In both cases of correlation with GAPDH and 18S rRNA, the classification rate between the early recurrence group and the late recurrence group is found to be 100%.
The meanings of the aforementioned genes are as follows:
PRODH gene: A proline dehydrogenase (oxidase) 1 gene
LTBP2 gene: A latent transforming growth factor beta binding protein 2 gene
COL1A2 gene: A collagen, type I, alpha 1 gene
MAP3K12 gene: A mitogen-activated protein kinase 12 gene
By identifying common genes derived from a patient and a healthy subject and cause-specific genes, it becomes possible to predict prognosis and recurrence. Accordingly, the thus identified genes can be used for diagnosis, the development of treatment methods, and a strategy of selecting a therapeutic agent (Taylor-made medicine).
SEQ ID NOS: 1 to 228: synthetic DNA
Number | Date | Country | Kind |
---|---|---|---|
2003-299363 | Aug 2003 | JP | national |
2003-334444 | Sep 2003 | JP | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/JP04/12425 | 8/23/2004 | WO | 00 | 9/5/2006 |