The present invention relates to a method for assessing the risk of developing hepatocellular carcinoma from non-alcoholic steatohepatitis.
Worldwide, with the rising prevalence of obesity, diabetes and the like caused by high-fat diets, the number of hepatocellular carcinoma cases with non-alcoholic steatohepatitis (NASH) as a precancerous condition is rapidly increasing and receiving much attention.
In Japan, the incidence of hepatocellular carcinoma has typically been high, with about 90% of the cases occurring in the context of chronic hepatitis/cirrhosis caused by hepatitis B or C virus infection. In recent years, however, the number of hepatocellular carcinoma cases with no apparent history of hepatitis virus infection has rapidly increased (Non Patent Literature 1), and NASH-derived hepatocellular carcinoma may account for a large proportion of these cases. Thus, the number of hepatocellular carcinoma cases with NASH as a precancerous condition is increasing worldwide. Despite this, the concept and definition of NASH is currently still in the process of being established in terms of molecular biology and histopathology.
Hepatocellular carcinoma cases occurring in the context of NASH may not be able to undergo extensive surgery or the like because their functional hepatic reserve is already reduced due to hepatitis, cirrhosis or the like at the time of diagnosis. To improve treatment outcomes, early diagnosis is crucial. However, the follow-up of the NASH-derived hepatocarcinogenic process can last for decades, and therefore strong incentives are required for patients to continue seeing their doctors. Furthermore, frequent diagnostic imaging is a significant burden to patients. Therefore, the ideal is to diagnose the carcinogenic risk of individual NASH patients and to follow up patients of the high-risk group particularly frequently. Prediction of the risk of developing hepatocellular carcinoma from NASH is considered as an unmet medical need.
By methylome analysis using a large number of tissue specimens and performed over many years, the present inventors have shown that carcinogenic factor-specific aberrant DNA methylation profiles are established in various organs from the precancerous stage (Non Patent Literature 2, etc.). In particular, they have made achievements regarding hepatocarcinogenesis since the early days of research on the epigenetics of carcinogenesis, and were the first in the world to report that DNA methylation abnormalities are already occurring in the stages of chronic hepatitis/cirrhosis caused by hepatitis virus infection (Non Patent Literature 3). Since then, they have accumulated methylome analysis of tissue specimens and have reported that the DNA methylation profile (at which CpG sites on the whole genome the cytosine base will undergo methylation modification) at the precancerous stage caused by hepatitis virus infection is inherited by hepatocellular carcinoma and determines the cancer malignancy and case prognosis (Non Patent Literatures 4, 5, etc.).
The present inventors were also among the first to analyze DNA methylation in the NASH-derived hepatocarcinogenic process. The present inventors' genome-wide DNA methylation analysis of normal liver tissue, NASH tissue, hepatocellular carcinoma tissue in the context of NASH, non-cancerous liver tissue of hepatocellular carcinoma cases in the context of hepatitis virus infection, and hepatocellular carcinoma tissue confirmed that NASH has a specific DNA methylation profile that is different from both normal liver tissue and chronic hepatitis/cirrhosis caused by hepatitis virus infection, and furthermore, revealed that the NASH-specific DNA methylation profile becomes more abnormal with the progression of NASH and is inherited by hepatocellular carcinoma occurring in the context of NASH (Non Patent Literature 6). Furthermore, the present inventors have shown that NASH-specific DNA methylation abnormalities contribute to multistage carcinogenesis through aberrant expression of several cancer-related genes (Non Patent Literature 7). Thus, the present inventors have advanced our understanding of the pathology of NASH by identifying the epigenetic changes in hepatocarcinogenesis specific to NASH.
The above studies by the present inventors show that DNA methylation abnormalities, which are stably maintained by a mechanism of maintenance methylation, are excellent biomarkers, superior to gene expression abnormalities and the like. Once they occur, DNA methylation abnormalities are inherited by the action of maintenance methyltransferase DNMT1, and thus the DNA methylation profile reflects the accumulated effects of environmental factors, including carcinogens, to which a person has been exposed over his/her lifetime.
Therefore, it is possible to diagnose the carcinogenic risk by quantifying DNA methylation levels of appropriate marker CpG sites in tissues and the like that are the origin of the carcinoma. Furthermore, since DNA methylation is maintained by covalent bond on the DNA duplex, it is more stable than diagnostic indicators such as mRNA or protein expression levels, and can be detected reproducibly with a highly sensitive detection method.
Patent Literatures 1, 2 and 3 disclose methods for assessing the prognosis of renal cell carcinoma by detecting the methylation level of the CpG sites of genes such as FAM150A. Patent Literature 4 discloses a method for assessing the prognosis of endometrial carcinoma by detecting the methylation level of specific CpG sites. Patent Literature 5 discloses a method for assessing the risk of canceration of upper urinary tract urothelial tissue by detecting the methylation level of the CpG sites of genes such as TENM3. Patent Literature 6 discloses a method for evaluating the risk of hepatocellular carcinoma by detecting the methylation level of the CpG sites in the exon regions of the MGRN1 gene.
The present invention provides a method for assessing the risk of developing hepatocellular carcinoma from NASH based on the methylation level of DNA.
The present invention provides the following.
[1] A method for detecting hepatocytes or tissue comprising hepatocytes having a risk of developing hepatocellular carcinoma, the method comprising:
detecting DNA methylation level of a target CpG site in genomic DNA derived from hepatocytes or tissue comprising hepatocytes with non-alcoholic steatohepatitis and
detecting hepatocytes or tissue comprising hepatocytes having a risk of developing hepatocellular carcinoma, from the detected DNA methylation level,
wherein the target CpG site is at least one CpG site selected from the group consisting of the CpG sites located at or in the vicinity of the positions on the chromosome described in Table 1 below.
[2] A method for detecting a subject having a risk of developing hepatocellular carcinoma, the method comprising the following:
detecting DNA methylation level of a target CpG site in genomic DNA derived from hepatocytes or tissue comprising hepatocytes of a subject with non-alcoholic steatohepatitis; and
detecting a subject having a risk of developing hepatocellular carcinoma from the detected DNA methylation level,
wherein the target CpG site is at least one CpG site selected from the group consisting of the CpG sites located at or in the vicinity of the positions on the chromosome described in Table 1 below.
[3] A method for acquiring data to detect hepatocytes or tissue comprising hepatocytes having a risk of developing hepatocellular carcinoma or a subject having a risk of developing hepatocellular carcinoma, the method comprising the following:
detecting DNA methylation level of a target CpG site in genomic DNA derived from hepatocytes or tissue comprising hepatocytes of a test subject with non-alcoholic steatohepatitis; and
acquiring data on whether the hepatocytes, the tissue comprising hepatocytes, or the subject have a risk of developing hepatocellular carcinoma, from the detected DNA methylation level,
wherein the target CpG site is at least one CpG site selected from the group consisting of the CpG sites located at or in the vicinity of the positions on the chromosome described in Table 1 below.
[4] The method according to any one of [1] to [3], wherein the target CpG site is at least one CpG site selected from the group consisting of the CpG sites at position 130,834,003 on chromosome 10, position 114,256,392 on chromosome 2, position 28,829,182 on chromosome 6, position 144,601,781 on chromosome 8, position 144,601,800 on chromosome 8, and position 35,700,382 on chromosome 6, and the CpG sites located in the vicinity thereof.
[5] The method according to any one of [1] to [4], wherein the detection of the DNA methylation level comprises detecting the DNA methylation level of the target CpG site using the genomic DNA treated with bisulfite.
According to the present invention, the risk of developing hepatocellular carcinoma from NASH can be assessed. The present invention enables early detection of NASH patients with high risk of developing hepatocellular carcinoma, for whom follow-up is particularly required. The present invention provides incentives to seek follow-up consultations to NASH patients of the high-risk group of carcinogenesis, and contributes to the prevention of the development of hepatocellular carcinoma from NASH, or the improvement of early diagnosis and treatment outcomes of hepatocellular carcinoma.
The present inventors have further developed their own epigenetic studies of NASH conducted so far to enable the prediction of the risk of developing hepatocellular carcinoma from NASH, an unmet medical need, based on NASH-specific DNA methylation profiles.
The first step in predicting future development of hepatocellular carcinoma in NASH is to distinguish liver tissue of NASH associated with hepatocellular carcinoma [NASH-W] from normal liver tissue [NLT]. As shown in the Examples described below, the present inventors first identified 437 CpG sites which discriminate NASH-W from NLT by genome-wide DNA methylation analysis (
Therefore, in one embodiment, the present invention provides a method for detecting hepatocytes or tissue comprising hepatocytes having a risk of developing hepatocellular carcinoma, the method comprising the following:
detecting DNA methylation level of a target CpG site in genomic DNA derived from hepatocytes or tissue comprising hepatocytes with NASH; and
detecting hepatocytes or tissue comprising hepatocytes having a risk of developing hepatocellular carcinoma, from the detected DNA methylation level.
In another embodiment, the present invention provides a method for detecting a subject having a risk of developing hepatocellular carcinoma, the method comprising the following:
detecting DNA methylation level of a target CpG site in genomic DNA derived from hepatocytes or tissue comprising hepatocytes of a subject with NASH; and
detecting a subject having a risk of developing hepatocellular carcinoma from the detected DNA methylation level.
In another embodiment, the present invention provides a method for acquiring data to detect hepatocytes or tissue comprising hepatocytes having a risk of developing hepatocellular carcinoma or a subject having a risk of developing hepatocellular carcinoma, the method comprising the following:
detecting DNA methylation level of a target CpG site in genomic DNA derived from hepatocytes or tissue comprising hepatocytes of a subject with NASH; and
acquiring data on whether the hepatocytes, the tissue comprising hepatocytes, or the subject have a risk of developing hepatocellular carcinoma, from the detected DNA methylation level.
In the following present description, the above method for detecting hepatocytes or tissue comprising hepatocytes having a risk of developing hepatocellular carcinoma, method for detecting a subject having a risk of developing hepatocellular carcinoma, and method for acquiring data to detect hepatocytes or tissue comprising hepatocytes having a risk of developing hepatocellular carcinoma or a subject having a risk of developing hepatocellular carcinoma are also collectively referred to as “the method of the present invention”.
In the present description, the “CpG site” means a site where cytosine (C) and guanine (G) are bonded by a phosphodiester bond (p) in DNA. A region where CpG sites appear with high frequency is called a CpG island.
In the present description, “DNA methylation” means a state in which the carbon at position 5 of cytosine is methylated in DNA. Moreover, in the present description, the “DNA methylation level” of a CpG site means the proportion of methylated DNA at the CpG site. In the present description, a high or low DNA methylation level means that the proportion of methylated DNA is high or low, respectively.
The “subject” to whom the method of the present invention is applied can be any subject with NASH. Preferably, the “subject” is a NASH patient in need of assessing the risk of developing hepatocellular carcinoma, and examples thereof include a NASH patient who has not yet developed hepatocellular carcinoma.
The definitive diagnosis of NASH is established by liver biopsy from patients with findings of fatty liver. The diagnosis of fatty liver can be made by abdominal ultrasonography or abdominal CT scan. More specifically, a patient is diagnosed with NASH when the patient meets the following three criteria: i) non-alcoholic (not drink excessively, e.g., an alcohol consumption of 20 g or less per day), ii) confirmation of steatohepatitis by liver biopsy findings, and iii) no confirmation of liver damage from other causes (such as hepatitis virus infection).
Examples of the target CpG site used in the method of the present invention include the CpG sites shown in Tables 2A and B below, and the CpG sites located in the vicinity thereof. In the present description, the position of the CpG site on the chromosome is expressed based on the position on the NCBI database Genome Build 37, which is a human reference genome sequence. In the present description, the “vicinity” of a position on a chromosome refers to a region from 200 bases upstream (5′ side of DNA) to 200 bases downstream (3′ side of DNA), preferably from 100 bases upstream to 100 bases downstream, more preferably from 50 bases upstream to 50 bases downstream, and further more preferably from 20 bases upstream to 20 bases downstream from that position on the chromosome. Alternatively, the CpG sites contained in the same CpG island as a certain CpG site are considered to be located in the vicinity of each other.
The target CpG site used in the method of the present invention may be at least one selected from the group consisting of the CpG sites shown in Tables 2A and B, and the CpG sites located in the vicinity thereof. For example, any one, any two or more, or any five or more selected from the group consisting of the CpG sites shown in Table 2A and the CpG sites located in the vicinity thereof, or all of them may be used as the target CpG site; any one, any two or more, or any five or more selected from the group consisting of the CpG sites shown in Table 2B and the CpG sites located in the vicinity thereof, or all of them may be used as the target CpG site; and any one, any two or more, or any five or more selected from the group consisting of the CpG sites shown in Tables 2A and B and the CpG sites located in the vicinity thereof, or all of them may be used as the target CpG site.
Preferably, any one, any two or more, or any five or more selected from the group consisting of the CpG sites shown in Tables 2A and B, or all of them may be used as the target CpG site in the method of the present invention.
More preferably, any one, any two or more, or any five or more selected from the group consisting of the CpG sites at position 130,834,003 on chromosome 10, position 114,256,392 on chromosome 2, position 28,829,182 on chromosome 6, position 144,601,781 on chromosome 8, position 144,601,800 on chromosome 8, and position 35,700,382 on chromosome 6, and the CpG sites located in the vicinity thereof, or all of them are used as the target CpG site in the method of the present invention.
Furthermore preferably, in the method of the present invention, any one, two, three, four, or five selected from the group consisting of the CpG sites at position 130,834,003 on chromosome 10, position 114,256,392 on chromosome 2, position 28,829,182 on chromosome 6, position 144,601,781 on chromosome 8, position 144,601,800 on chromosome 8, and position 35,700,382 on chromosome 6, or all six of them are used as the target CpG site.
In the method of the present invention, the DNA methylation levels of the above target CpG sites in genomic DNA derived from hepatocytes or tissue comprising hepatocytes with NASH are used as an index to assess the risk of developing hepatocellular carcinoma in the hepatocytes, the tissue comprising hepatocytes, or the subject from whom those cells or tissue are derived. Therefore, the target CpG site can be used as a marker for assessing the risk of developing hepatocellular carcinoma from NASH based on its DNA methylation level.
The genomic DNA used in the method of the present invention to detect the methylation levels can be prepared from hepatocytes or tissue comprising hepatocytes with NASH, for example, liver tissue or hepatocytes collected by surgery or biopsy on a NASH patient or biopsy for definitive NASH diagnosis. The liver tissue or hepatocytes can be fresh liver tissue collected from a living body, frozen liver tissue frozen after collection, liver tissue fixed in formalin and embedded in paraffin after collection, the hepatocytes contained in theses tissues, and the like. Among these, frozen liver tissue or hepatocytes are preferable from the viewpoint of suppressing the degradation of genomic DNA and the like and more efficiently detecting the DNA methylation level.
The method for preparing the genomic DNA from hepatocytes or tissue comprising hepatocytes is not particularly limited, and a known method can be appropriately selected and used. Examples of known methods for preparing DNA include the phenol-chloroform method, or a DNA extraction method using a commercially available DNA extraction kit, for example, QIAamp DNA Mini kit (manufactured by Qiagen), Clean Columns (manufactured by NexTec), AquaPure (manufactured by Bio-Rad), ZR Plant/Seed DNA Kit (manufactured by Zymo Research), prepGEM (manufactured by ZyGEM), and BuccalQuick (manufactured by TrimGen) and the like.
Preferably, the prepared genomic DNA is treated with bisulfite. The method for the bisulfite treatment of DNA is not particularly limited, and a known method can be appropriately selected and used. Examples of known methods for the bisulfite treatment include methods using a commercially available kit such as an EZ DNA
Methylation-Gold™ Kit (manufactured by Zymo Research), EpiTect Bisulfite Kit (manufactured by Qiagen), MethylEasy (manufactured by Human Genetics Signatures Pty), Cells-to-CpG Bisulfite Conversion Kit (manufactured by Applied Biosystems), and CpGenome Turbo Bisulfite Modification Kit (manufactured by MERCK MILLIPORE). As a result of bisulfite treatment, unmethylated cytosine residues of genomic DNA are converted to uracil, but methylated cytosine residues are not converted, and remain as cytosine (see Nucleic Acids Res, 1994, 22:2990-7).
Furthermore, the bisulfite-treated DNA may be amplified. The method of amplification is not particularly limited, but PCR is preferably used. As for the method and conditions of amplification, known methods and conditions can be appropriately selected and used according to the sequence, length, amount, and the like of the DNA to be amplified.
When amplifying the bisulfite-treated DNA by PCR or the like, DNA containing at least one of the target CpG sites described above may be amplified. The chain length of the PCR amplification products can be appropriately selected while considering factors such as the shortening of the PCR amplification time, as well as the shortening of methylation level detection time and the accuracy of methylation level detection. For example, the chain length of the PCR amplification products is preferably 500 bp or less, more preferably 300 bp or less, and further more preferably 100 bp or less, while the lower limit is 30 to 40 bp, which is the chain length of the PCR amplification products when using a primer of around 15 mer which can avoid non-specific hybridization in PCR. Alternatively, it is preferable to design the primer to have a rich content of the target CpG site in the PCR amplification products.
In the method of the present invention, the method for detecting the DNA methylation level of a CpG site may be a method that can quantify the DNA methylation level of a given CpG site, and a known method can be appropriately selected. Examples of such known method include single-base extension reaction using a methylation/unmethylation-specific probe, mass spectrometry, pyrosequencing (registered trademark) (Anal Biochem, 2000, 10:103-110), methylation-sensitive high-resolution melting [MS-HRM] curve analysis (Nat Protoc, 2008, 3:1903-8), methylation-specific polymerase chain reaction [MS-PCR] using real-time quantitative PCR, bisulfite direct sequencing, bisulfite cloning sequencing, COBRA (analysis by the combined use of bisulfite and a restriction enzyme), and a method using ion exchange chromatography (see WO 2014/136930).
The single-base extension reaction using a methylation/unmethylation-specific probe detects the methylation of DNA at a CpG site by utilizing a single-base extension reaction using a probe constructed so as to have a base complementary to methylated cytosine or unmethylated cytosine at the 3′-terminus. Preferable examples of this method include a bead array method (for example, Infinium″ assay provided by Illumina, Inc.). The specific procedure for this technique is exemplified below.
First, the whole genome is amplified using the genomic DNA thus treated with bisulfite as a template, and enzymatic fragmentation (usually, fragmentation into about 300 to 600 bp) is performed to dissociate it into a single strand. On the other hand, a probe, which hybridizes to genomic DNA converted by bisulfite treatment, and in which the base at the 3′-terminus of the probe is a base complementary to cytosine at the target CpG site, is prepared. That is, when the CpG site is methylated, the base at the 3′-terminus of the probe is guanine, whereas when the CpG site is not methylated, the base at the 3′-terminus of the probe is adenine. These two types of probe different only in the base at the 3′-terminus complementary to the target CpG site are hybridized to the single-stranded DNA fragment, and a single-base extension reaction is performed in the presence of a fluorescently labeled base. As a result, when the CpG site of the single-stranded fragment is methylated, a fluorescently labeled base is incorporated into the probe in which the 3′-terminal base is guanine (probe for methylation detection) by a single-base extension reaction, but no fluorescently labeled base is incorporated into the probe in which the 3′-terminal base is adenine (probe for unmethylation detection) as no single-base extension reaction occurs due to the mismatch of the 3′-terminal base. On the other hand, when the CpG site of the single-stranded fragment is not methylated, a fluorescently labeled base is incorporated into the probe for unmethylation detection, but no fluorescently labeled base is incorporated into the probe for methylation detection. Therefore, the DNA methylation level of the target CpG site can be calculated from the intensity of fluorescence emitted by the probe for methylation detection and/or the probe for unmethylation detection.
Alternatively, a probe which hybridizes to genomic DNA converted by bisulfite treatment, and in which the 3′-terminal base of the probe is a base complementary to guanine at the target CpG site, may be used instead of the probe for methylation detection and the probe for unmethylation detection. Then, this probe is hybridized to the single-stranded DNA fragment, and a single-base extension reaction is performed in the presence of guanine labeled with a fluorescent substance and/or adenine labeled with a fluorescent dye different from the fluorescent substance. As a result, when the CpG site is methylated, fluorescently labeled guanine is incorporated into the probe, whereas when the CpG site is not methylated, fluorescently labeled adenine is incorporated into the probe. Therefore, the DNA methylation level of the target CpG site can be calculated from the intensity of the fluorescence emitted by each fluorescent substance incorporated into the probe.
In the detection of methylation levels by mass spectrometry, for example, bisulfite-treated genomic DNA containing a target CpG site is amplified, followed by transcription into RNA and uracil-specific cleavage by RNase to produce RNA fragments of different lengths according to DNA methylation. The obtained RNA fragments are subjected to mass spectrometry, which allows to separate and detect the methylated fragments and the unmethylated fragments according to the difference in molecular weight. The methylation level of DNA is calculated from the mass ratio between the methylated fragments and the unmethylated fragments. Examples of such mass spectrometry method include MassARRAY (registered trademark) (see Mutat Res, 2005, 573:83-95). The primer for amplifying the DNA containing the target CpG site can be designed using EpiDesigner (manufactured by SEQUENOM, primer design software for MassARRAY) and the like. For the mass spectrometry of the RNA fragments, MALDI-TOF MAS (for example, MassARRAY Analyzer 4 manufactured by SEQUENOM), which can detect the difference in mass of a single base, can be used.
In pyrosequencing, bisulfite-treated genomic DNA containing the target CpG site is amplified. By amplification, uracil in the template DNA is converted to thymine. The amplified DNA is dissociated into single strands, and extension reactions are performed on a region containing the CpG site while adding bases in order of type, and the type of base incorporated is measured based on luminescence intensity. The DNA methylation level is calculated by comparing the intensity of luminescence derived from the methylated cytosine residues (luminescence intensity of cytosine) with the intensity of luminescence derived from the unmethylated cytosine residues (luminescence intensity of thymine).
In methylation-sensitive high resolution melting curve analysis (MS-HRM), the bisulfite-treated genomic DNA containing the target CpG site is amplified in a reaction system containing an intercalator which emits fluorescence when inserted between DNA duplexes. Then, the temperature of the reaction system is changed to detect the change in the intensity of fluorescence from the intercalator. The melting curve of the DNA containing the target CpG site is compared with the melting curve of the methylated/unmethylated controls to calculate the DNA methylation level of the target CpG site.
In the methylation-specific quantitative PCR method, a primer set capable of amplifying when the target CpG site is methylated and a primer set capable of amplifying when the target CpG site is not methylated are used to amplify the bisulfite-treated genomic DNA containing the target CpG site. The DNA methylation level of the target CpG site is calculated by comparing the amounts of the amplification products obtained in each reaction, that is, the amount of methylated CpG site-specific amplification product with the amount of unmethylated CpG site-specific amplification product.
Alternatively, in the methylation-specific quantitative PCR method, oligonucleotide probes capable of hybridizing when the target CpG site is and is not methylated are each prepared. Each probe is labeled with a fluorescent reporter dye and a fluorescent quencher dye different from each other. The probe is hybridized to the bisulfite-treated genomic DNA containing the target CpG site, and then amplified, and the fluorescence emitted by the fluorescent reporter dye due to the degradation of the probe associated with amplification is detected. The DNA methylation level of the target CpG site is calculated by comparing the intensity of the fluorescence emitted by the fluorescent reporter dye specific to the methylated CpG site with the intensity of the fluorescence emitted by the fluorescent reporter dye specific to the unmethylated CpG site. Examples of such a method include a quantitative PCR method using a TaqMan (registered trademark) probe (for example, MethyLight assay).
In bisulfite direct sequencing, a direct sequencing reaction is performed using the bisulfite-treated genomic DNA containing the target CpG site as a template. Then, the DNA methylation level of the target CpG site is calculated by comparing the fluorescence intensity based on the determined base sequence, that is, the luminescence intensity derived from the methylated cytosine residues (luminescence intensity of cytosine) with the luminescence intensity derived from the unmethylated cytosine residues (luminescence intensity of thymine).
In bisulfite cloning sequencing, the bisulfite-treated genomic DNA containing the target CpG site is cloned by PCR reaction or the like, and the base sequences of the plurality of cloning products obtained are each determined. The DNA methylation level of the target CpG site is calculated by comparing the number of cloning products having a methylated CpG site-specific base sequence, with the number of cloning products having a unmethylated CpG site-specific base sequence.
In COBRA, the genomic DNA treated with bisulfite and containing the target CpG site is amplified, and next, the amplified product is treated with a restriction enzyme that recognizes the sites having a different sequence depending on whether the target CpG site is methylated or not, followed by electrophoresis. The DNA methylation level at the target CpG site is calculated by quantifying the band of the restriction enzyme fragment derived from the methylated CpG site and the restriction enzyme fragment derived from the unmethylated CpG site, which have been fractionated by electrophoresis.
In a method using ion exchange chromatography (see WO 2014/136930), first, the genomic DNA treated with bisulfite is fragmented, and fragments containing a target CpG site are amplified by PCR or the like. The chain length of the PCR amplification products can be appropriately selected while considering factors such as shortening the PCR amplification time, and shortening the analysis time and preserving the separation performance in the ion exchange chromatography. For example, the chain length of the PCR amplification products is preferably 500 bp or less, more preferably 300 bp or less, and further preferably 100 bp or less, on the other hand, the lower limit of the chain length of the PCR amplification products is 30 to 40 bp, which is the chain length of the PCR amplification products when using a primer of around 15 mer which can avoid non-specific hybridization in PCR. On the other hand, it is preferable to design the primer to have a rich content of CpG site in the PCR amplification products.
Next, the amplified DNA fragments are subjected to ion exchange chromatography to separate the DNA in which the CpG sites are methylated from the DNA in which the CpG sites are not methylated. The unmethylated cytosine residues of genomic DNA are converted to uracil by bisulfite treatment and then further converted to thymine by PCR. On the other hand, the methylated cytosine residues remain as cytosine even after a bisulfite treatment and PCR. Due to this difference in base, the fragments containing methylated cytosine (methylated fragments) and the unmethylated fragments are detected as separate peaks with different retention times in ion exchange chromatography. That is, the methylated fragments are detected as peaks having a shorter retention time than the unmethylated fragments. Therefore, it is possible to determine whether the DNA at the CpG site is methylated or not based on the retention time of the peak of a detection signal in ion exchange chromatography. Furthermore, when the amplified fragment contains a plurality of CpG sites, the more CpG sites are methylated, the shorter the retention time of the peak is. Therefore, the DNA methylation level of the CpG site can be calculated based on the retention time of the peak. Alternatively, it is also possible to calculate the abundance and abundance ratio of each methylated fragment and unmethylated fragment based on the area or height of the peak.
Preferably, the DNA methylation level of the target CpG site is assessed from the peak of the detection signal of ion exchange chromatography for the amplified fragment, by comparison with a sample (control) having a known DNA methylation level of the target CpG site or by using a calibration curve prepared in advance using a sample having a known DNA methylation level.
Alternatively, a retention time serving as a reference (also referred to as the reference retention time in this description) for separating the retention time of the peak of a methylated fragment having a highly methylated CpG site from the retention time of the peak of a fragment having a low methylation level using a sample having a known DNA methylation level, is determined in advance. For example, a fragment detected at a retention time earlier than the reference retention time is determined to be highly methylated DNA.
The ion exchange chromatography performed in the above method is preferably an anion exchange chromatography. The packing material of the column is not particularly limited as long as it is made of base material particles having a strong cationic group on the surface, but base material particles having both a strong cationic group and a weak cationic group on the packing material surface, as shown in WO 2012/108516, are preferable. More preferably, the base material particles are base material particles containing coated polymer particles in which a layer of a hydrophilic polymer having a strong cationic group (preferably a quaternary ammonium salt) is copolymerized on the surface of hydrophobic crosslinked polymer particles, and a weak cationic group (preferably a tertiary amino group) introduced on the surface of the coated polymer particle. The column temperature in the chromatographic analysis is preferably 30° C. or more and less than 90° C.
The methods that can be suitably used as a “method for detecting the DNA methylation level” in the present invention have been exemplified above, but they are not limited thereto. In the method illustrated above, genomic DNA prepared from hepatocytes or tissue comprising hepatocytes with NASH (hereinafter also referred to as “test cells or tissue”) is subjected to a bisulfite treatment. Therefore, the genomic DNA used for detecting the DNA methylation level of a CpG site in the method of the present invention is preferably bisulfite-treated genomic DNA derived from a test cell or a tissue containing the same.
In the method of the present invention, the risk of developing hepatocellular carcinoma in the test cells or tissue, or a subject from whom these are derived is assessed from the detected DNA methylation level of the target CpG site. A specific index for assessment can be appropriately set by a person skilled in the art according to the method for detecting the DNA methylation level.
An embodiment of the procedure for assessing the risk of developing hepatocellular carcinoma will be described below. In a first embodiment, first, for each DNA methylation level detection method, a receiver operating characteristic (ROC) analysis is performed for each target CpG site to obtain the sensitivity (positive rate) and specificity (negative rate), then the DNA methylation level at which the sum of the sensitivity and the specificity is maximum is set as an index (cutoff value).
In the first embodiment, for the CpG sites shown in Table 2A and the CpG sites located in the vicinity thereof, when the DNA methylation level detected in the method of the present invention is higher than the cutoff value, the DNA methylation level is considered to have exceeded the diagnostic threshold, and the test cells or tissue, or the subject can be assessed as having a risk of developing hepatocellular carcinoma. On the other hand, for the CpG sites shown in Table 2B and the CpG sites located in the vicinity thereof, when the DNA methylation level detected in the method of the present invention is lower than the cutoff value, the DNA methylation level is considered to have exceeded the diagnostic threshold, and the test cells or tissue, or the subject can be assessed as having a risk of developing hepatocellular carcinoma.
In the first embodiment, when the methylation levels of a plurality of CpG sites are detected, the number or ratio of CpG sites of which the DNA methylation level exceeds the diagnostic threshold can be used as an index for assessing the risk of developing hepatocellular carcinoma. For example, test cells or tissue, or a subject can be assessed as having a risk of developing hepatocellular carcinoma when the methylation levels of all the investigated CpG sites exceed the diagnostic threshold. Alternatively, test cells or tissue, or a subject can be assessed as having a risk of developing hepatocellular carcinoma when the methylation levels of a certain percentage or more of the investigated CpG sites exceed the diagnostic threshold. Alternatively, test cells or tissue, or a subject can be assessed as having a risk of developing hepatocellular carcinoma when the methylation levels of a certain number or more of the CpG sites exceed the diagnostic threshold.
In a second embodiment of the procedure for risk assessment, the methylation level of a CpG site is determined, or the risk of developing hepatocellular carcinoma is assessed by performing the method using ion-exchange chromatography described above (see WO 2014/136930, hereafter also referred to simply as chromatography) on the DNA (sample) containing the target CpG site and comparing the retention time of the peak of the obtained detection signal with the retention time for the DNA containing the unmethylated target CpG site (negative control) or the DNA containing the methylated target CpG site (positive control).
In the second embodiment, for the CpG sites shown in Table 2A and the CpG sites located in the vicinity thereof, when a peak with a shorter retention time than that of the negative control is detected from the sample, the sample is assessed as being methylated, and the test cells or tissue, or the subject can be assessed as having a risk of developing hepatocellular carcinoma.
Alternatively, when a peak with a retention time similar to that of the positive control is detected from the sample, the sample is assessed as being methylated, and the test cells or tissue, or the subject can be assessed as having a risk of developing hepatocellular carcinoma. On the other hand, for the CpG sites shown in Table 2B and the CpG sites located in the vicinity thereof, when a peak with a longer retention time than that of the positive control is detected from the sample, the sample is assessed as not being methylated, and the test cells or tissue, or the subject can be assessed as having a risk of developing hepatocellular carcinoma.
Alternatively, when a peak with a retention time similar to that of the negative control is detected from the sample, the sample is assessed as not being methylated, and the test cells or tissue, or the subject can be assessed as having a risk of developing hepatocellular carcinoma.
In the first and second embodiments of the procedure for risk assessment described above, any of the CpG sites shown in Tables 2A and B above and the CpG sites located in the vicinity thereof may be used as the target CpG site used for detecting DNA methylation level. Of these, when used in the second embodiment or the chromatography described above, the target CpG site preferably has the following characteristics: 1) presence of a relatively high number of CpG sites in the vicinity; 2) high degree of separation between the peaks of the methylated and unmethylated controls in ion exchange chromatography analysis; or 3) strong association between methylation level and development of hepatocellular carcinoma. Based on these, the preferable target CpG sites for the second embodiment or the chromatography described above include the six CpG sites at position 130,834,003 on chromosome 10, position 114,256,392 on chromosome 2, position 28,829,182 on chromosome 6, position 144,601,781 on chromosome 8, position 144,601,800 on chromosome 8, and position 35,700,382 on chromosome 6, and the CpG sites located in the vicinity thereof.
Thus, the present invention enables early detection of NASH patients with high risk of developing hepatocellular carcinoma. The present invention provides incentives to seek follow-up consultations to NASH patients of the high-risk group of hepatocarcinogenesis. In addition, if the NASH patients with high risk of developing hepatocellular carcinoma can be discovered early by the present invention, preventive intervention can be performed to prevent the development of carcinoma, or early diagnosis or treatment of the carcinoma can improve their life prognosis.
Therefore, the present invention also relates to the preventive intervention in subjects having a risk of developing hepatocellular carcinoma detected by the method of the present invention. For example, the subjects having a risk of developing hepatocellular carcinoma can be given regular checkups (diagnostic imaging, liver biopsy, etc.), lifestyle guidance, diet therapy, exercise therapy, and the like. This allows the prevention, early diagnosis, or early treatment of hepatocellular carcinoma in NASH patient with the high-risk group.
Hereafter, the present invention is described in detail with examples, but the present invention is not limited to the following examples.
The first step in predicting future development of hepatocellular carcinoma from NASH is to distinguish liver tissue presenting with histological findings of NASH, which is the origin of hepatocellular carcinoma development (NASH associated with hepatocellular carcinoma [NASH-W]) from normal liver tissue [NLT]. Therefore, a genome-wide DNA methylation analysis was first performed using the high-density bead array Infinium™ Human Methylation 450 BeadChip. The Infinium™ Human Methylation 450 BeadChip contains probes for methylation detection, each targeting a different CpG site. The DNAs prepared from 22 NASH-W specimens and 36 NLT specimens were treated with bisulfite and hybridized to the probes on the chip, then fluorescently-labeled bases were incorporated by single-base extension reaction, and the fluorescence signal from the incorporated labels was measured to detect methylation of the CpG sites targeted by the probes. 4,050 probes were identified in which the DNA methylation rate was significantly different between the 22 NASH-W specimens and the 36 NLT specimens, and the difference between the average values was also sufficiently great (Welch-T Test, Bonferroni correction, Δβ>0.1). When a receiver operating characteristic [ROC] curve analysis to distinguish NASH-W from NLT was performed using these 4,050 probes, there were 437 probes with an area under the curve [AUC] greater than 0.95, i.e., with high discriminative power (
However, considering the clinical practice, it is not realistic to compare NASH-W with NLT. In reality, the risk of hepatocarcinogenesis is presumably predicted after a diagnosis of NASH is confirmed by liver biopsy, in order to determine the course of treatment and follow-up. That is, it is important to compare liver tissue presenting with histological findings of NASH but without hepatocellular carcinoma [NASH-O] and NASH-W associated with hepatocellular carcinoma. On the other hand, cases currently analyzed as NASH-O may develop hepatocellular carcinoma in the future if preventive interventions or the like are not implemented. That is, a diagnostic indicator distinguishing NASH-W from NASH-O with 100% sensitivity and specificity is inappropriate.
Therefore, thresholds were set using the Yoden method for all 437 probes with AUCs greater than 0.95 which distinguish NASH-W from NLT, to create tentative predictive criteria. Using these predictive criteria, and considering the probability of developing hepatocellular carcinoma (carcinogenic rate) from NASH, which is said to be 11% from the cirrhosis stage, as well as to be on the safe side since this is a risk prediction for follow-up, the “probes with which less than 15% of the 91 NASH-O specimens having not yet associated with hepatocellular carcinoma are assessed as having a carcinogenic risk” were extracted. Of the 437 probes, there were 24 probes which detected less than 15% of the 91 NASH-O specimens based on the predictive criteria (Table 3,
A multivariate analysis was performed to identify the 22 NASH-W specimens associated with hepatocellular carcinoma from a total of 113 specimens, 91 NASH-O specimens and 22 NASH-W specimens. All 24 marker CpG sites listed in Table 3 were significant in the multivariate analysis. In the same multivariate analysis, no significance was observed for the histopathological findings. This indicates that the DNA methylation levels of each of the marker CpG sites listed in Table 3 can be used to predict the carcinogenic risk from NASH independently of histopathological findings and the like (
To get an overview of the DNA methylation profile of NASH specimens, diagnostic criteria using the DNA methylation levels of the marker CpG sites listed in Table 3 were applied to the 91 NASH-O specimens and 22 NASH-W specimens. The results showed that NASH-W tended to meet the diagnostic criteria for methylation level of more CpG sites compared to NASH-O, while a group of cases showed a DNA methylation status similar to NASH-W which has already been associated with hepatocellular carcinoma at the NASH-O stage which has not yet been associated with hepatocellular carcinoma (
Using a validation cohort (22 NLT specimens and 11 NASH-W specimens), a ROC analysis was performed to distinguish NLT and NASH-W specimens for five of the marker CpG sites listed in Table 3. For all the CpG sites investigated, it was possible to distinguish NLT and NASH-W specimens with an AUC>0.92, a sensitivity >90.9%, and a specificity >86.4% (
NASH-W and NLT specimens were discriminated using methylation analysis by high-performance liquid chromatography (HPLC) (see WO 2014/136930). The marker CpG sites of Table 3 were used as the target CpG sites. The DNA prepared from the specimens was treated with bisulfite and the regions containing the target CpG sites were amplified by PCR. The amplification products were purified and subjected to HPLC. As references, a methylated control (target region DNA with 100% methylated CpG sites) and an unmethylated control (target region DNA with 0% methylated CpG sites) were similarly treated and subjected to HPLC. The methylation rate of specimen-derived DNA was determined from the calibration curve prepared based on the methylated and unmethylated controls. The column used for HPLC and the HPLC conditions are described below.
To 2000 mL of 3 wt % polyvinyl alcohol (manufactured by Nippon Synthetic Chemical Industry Co., Ltd.) aqueous solution in a reactor with a stirrer was added a mixture of 200 g of tetraethylene glycol dimethacrylate (manufactured by Shin-Nakamura Chemical Co., Ltd.), 100 g of triethylene glycol dimethacrylate (manufactured by Shin-Nakamura Chemical Co., Ltd.), 100 g of glycidyl methacrylate (manufactured by Wako Pure Chemical Industries, Ltd.) and 1.0 g of benzoyl peroxide (manufactured by Kishida Chemical Co., Ltd.). The mixture was heated while stirring, and polymerized at 80° C. for 1 hour under a nitrogen atmosphere. Next, as a hydrophilic monomer having a strong cationic group, 100 g of trimethylammonium ethyl methacrylate chloride (manufactured by Wako Pure Chemical Industries, Ltd.) was dissolved in ion-exchanged water. This was added to the same reactor and was similarly polymerized at 80° C. for 2 hours under a nitrogen atmosphere while stirring. The obtained polymerized composition was washed with water and acetone to obtain coated polymer particles having, on their surface, a layer of hydrophilic polymer having quaternary ammonium groups. When the obtained coated polymer particles were measured using a particle size distribution analyzer (AccuSizer 780 manufactured by Particle Sizing Systems), the average particle diameter was 10 μm. 10 g of the obtained coated polymer particles were dispersed in 100 mL of ion-exchanged water to prepare a pre-reaction slurry. Next, 10 mL of N,N-dimethylaminopropylamine (manufactured by Wako Pure Chemical Industries, Ltd.), a reagent having a weak cationic group, was added to this slurry while stirring, and the mixture was reacted at 70° C. for 4 hours. After the reaction, the supernatant was removed using a centrifuge (“Himac CR20G” manufactured by Hitachi, Ltd.) and washed with ion-exchanged water. After washing, the supernatant was removed using a centrifuge. This washing with ion-exchanged water was repeated four more times to obtain a packing material for ion exchange chromatography having quaternary ammonium groups and tertiary amino groups on the surface of the base material particles. The obtained packing material for ion exchange chromatography was packed into a stainless steel column (column size: inner diameter 4.6 mm×length 150 mm) of a liquid chromatography system.
System: LC-20A series (manufactured by Shimadzu Corporation)
Column: anion exchange column (as prepared above)
Eluent: Eluent A 25 mM MES-NaOH (pH6.0)
Analysis time: 15 min
Elution method: the mixing ratio of eluent B was linearly increased according to the following gradient conditions:
Flow rate: 1.0 mL/min
Detection wavelength: 260 nm
Sample injection amount: 5 μL
Column temperature: 70° C.
Examples of peak patterns detected from 3 NASH-W specimens and 3 NLT specimens for one CpG site are shown in
For the marker CpG sites listed in Table 3, the following items 1) to 6) were evaluated and the CpG sites which exceeded the standard values were extracted. The CpG sites which met four or more of the six standards were extracted.
The content of CpG sequences in the sequences to be amplified in the PCR performed as part of the HPLC method in Example 5 was calculated. For example, if the sequence to be amplified by PCR is 276 bp and the number of CpG sequences contained in the sequence is 10, 100×(10÷276)=3.62%. The standard value was set at 5%.
For each marker CpG site in the clinical specimens (30 NLT specimens and 20 NASH-W specimens, for a total of 50 specimens), the elution times and peak widths at half height for the methylated control peak and unmethylated control peak obtained by HPLC method according to the procedure in Example 5 were detected. These were applied to the following equation to calculate the degree of separation. The standard value was set at 0.8.
Degree of Separation=1.18×(elution time of unmethylated control peak−elution time of methylated control peak)/(peak width at half height of unmethylated control+peak width at half height of methylated control)
For each marker CpG site in the same clinical specimen, the correlation between the methylation rate determined by HPLC method according to the procedure in Example 5 and the methylation rate analyzed by the Infinium™ method was confirmed, and the correlation coefficient (R2) was calculated. The standard value was set at R2>0.45.
In a same clinical specimen, the methylation rate of each marker CpG site determined by HPLC method according to the procedure in Example 5 was used to assess NLT/NASH-W based on the cutoff values, and the concordance rate of the assessment with the clinical information was calculated. The standard value was set at 80%.
Concordance rate of assessment=100×(number of NLT assessed as NLT+number of NASH-W assessed as NASH-W)/total number of specimens evaluated
In a same clinical specimen, the methylation rate of each marker CpG site determined by HPLC method according to the procedure in Example 5 was used to assess NLT/NASH—W based on the cutoff values, and the percentage of NASH-W specimens assessed as NASH-W (sensitivity) was calculated. The standard value was set at 80%.
Sensitivity=100×number of NASH-W specimens assessed as NASH-W/NASH-W specimens
In a same clinical specimen, the methylation rate of each marker CpG site determined by HPLC method according to the procedure in Example 5 was used to assess NLT/NASH-W with cutoff values, and the percentage of NLT specimens assessed as NLT (specificity) was calculated. The standard value was set at 80%.
Specificity=100×number of NLT specimens assessed as NLT/NLT specimens
Six of the 24 CpG sites listed in Table 3 (probe IDs: cg09580822, cg14950303, cg15050398, cg18210511, cg09580859 and cg13719443, see Table 3) met four or more of the above criteria 1) to 6). These six CpG sites were found to be preferable markers for predicting the carcinogenic risk from NASH. These six CpG sites are also useful as markers for predicting the carcinogenic risk using methylation analysis by HPLC method for the following reasons: presence of a relatively high number of CpG sites in the vicinity; high degree of separation between the peaks of the methylated and unmethylated controls in ion exchange chromatography analysis; or strong association between methylation level and development of hepatocellular carcinoma.
The results of the above examples show that the CpG sites listed in Table 3 are very useful as indicators for predicting the risk of developing hepatocellular carcinoma from NASH. The CpG sites listed in Table 3 are believed to be sufficiently applicable in clinical testing as markers for assessing the risk of developing hepatocellular carcinoma from NASH, for the realization of preemptive and personalized medicine from an epigenomic perspective.
Number | Date | Country | Kind |
---|---|---|---|
2019-222285 | Dec 2019 | JP | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/JP2020/045879 | 12/9/2020 | WO |