The invention relates to specific marker proteins (biomarkers) for Hepatocellular carcinoma (HCC). The invention relates to a method for the diagnostic study of biological samples of a human for Hepatocellular carcinoma, the sample being studied for one or more proteins as a marker for Hepatocellular carcinoma, a concentration of the proteins which is elevated or decreased in relation to the healthy state indicating the presence of Hepatocellular carcinoma, a diagnostic test kit and a method of screening compounds effective in HCC.
Hepatocellular carcinoma (HCC) currently is the fifth most common malignancy worldwide with an annual incidence up to 500 per 100000 individuals depending on the geographic region investigated. Whereas 80% of new cases occur in developing countries, the incidence increases in industrialized nations including Western Europe, Japan and the United States (El-Serag H B, N. Engl. J. Med. 1999; 340:745-750). To manage patients with HCC, tumor markers are very important tools for diagnosis, evaluation of disease progression, outcome prediction and evaluation of treatment efficacy. Several tumor markers have been reported for HCC, which include α-fetoprotein (AFP) (Di Bisceglie A M J Hepatol 2005), Lens culinaris agglutinin-reactive fraction of AFP (AFP-L3) (OKA H, J Gastroenteroll Hepatol 2001), and des-γ-carboxy prothrombin (DCP) (Liebman H A N Engl J Med 1984). However, none of these tumor markers show 100% sensitivity or specificity, which calls for new and better biomarkers.
In order to identify novel biomarkers of HCC, many clinical studies utilizing omics-based methods have been reported over the past decade. In particular, the proteomics-based approach has turned out to be a promising one, offering several quantification techniques to reveal differences in protein expression that are caused by a particular disease. In the most studies reported in literature, the well-established 2D-DIGE (two-dimensional difference in gel electrophoresis) technique has been applied for protein quantification followed by identification via mass spectrometry. Even if the quantification is very accurate and sensitive in this gel-based approach, the relatively high amount of protein sample necessary for protein identification is the major disadvantage of this technique. Several mass-spectrometry-based quantitative studies using labelling-techniques like SILAC (stable isotope labelling by amino acids in cell culture) or iTRAQ (isobaric tag for relative and absolute quantification) have been carried out as well for biomarker discovery of HCC. Here, the concomitant protein quantification and identification in a mass spectrometer allows high-throughput analyses. However, such experiments imply additional labelling reactions (in case of iTRAQ) or are limited to tissue culture systems (in case of SILAC). In the latter case, one can overcome the limitation by using the isotope-labelled proteins obtained from tissue culture as an internal standard added to a corresponding tissue sample. This approach is known as CDIT (culture-derived isotope tags) and was applied in a HCC study, very recently. Label-free proteomics based on quantification by ion-intensities or spectral counting offer another possibility for biomarker discovery. These approaches are cheap due to the lacking need of any labelling reagents and furthermore allow high-throughput and sensitive analyses in a mass spectrometer. A quantitative study of HCC using spectral counting has been reported, whereas an ion-intensity-based study has not been performed yet. Apart from these quantification strategies, protein alterations in HCC have been studied by MALDI imaging.
Proceeding from the described prior art, the object therefore presents itself of providing an improved method for studying biological samples for HCC, in which novel markers are used.
The object is achieved according to the invention by a method for studying biological samples of a human for HCC the sample being studied for one or more proteins as a marker for HCC, and an elevated level of the proteins indicating the presence of HCC, the proteins being selected from a group comprising proteins defined by SEQ ID No. 1 to 983 according to the enclosed sequence listening, isoforms of the proteins defined by SEQ ID No. 1 to 983, homologous of the proteins defined by SEQ ID NO. 1 to 983 and partial sequences of SEQ ID No. 1 to 983.
The present invention relates to a quantitative proteomic study characterized in a combination of two different techniques, namely the well-established 2D-DIGE (two-dimensional difference in gel electrophoresis) and a label-free ion-intensity-based quantification via mass spectrometry and liquid chromatography to identify HCC specific biomarkers. This is the first time such a combined study was performed with regard to hepatocellular carcinoma. By comparing the results of both studies high-confident biomarker candidates of HCC could be identified and 983 proteins were confirmed as specific biomarkers for HCC. Furthermore, the comparison demonstrates the complementarity of the gel- and LC-MS-based techniques. To verify the differential protein expressions detected in the proteomic studies underlying the present invention additional immunological validations of the identified specific biomarkers for HCC were performed.
The invention relates to a method for identifying biomarkers specific for a particular disease comprising the steps
In one embodiment of the method the gel-based approach is SDS-Polyacrylamide gel electrophoresis, preferably 2D-DIGE.
In one embodiment of the method the LC-MS-based approach is a LC-MS-based label-free ion-intensity-based quantification, preferably MALDI, for example MALDI-TOF-MS or nan-HPLC-ESI-MS/MS.
In a preferred embodiment the invention relates to a method, wherein the gel-based approach is 2D-DIGE and wherein the LC-MS-based approach is MALDI, preferably MALDI-TOF-MS or nan-HPLC-ESI-MS/MS.
The present invention further relates to the use of the method for identifying biomarkers specific for a particular disease, to determine if a person has this particular disease, preferably to determine, if the person has HCC. In another preferred embodiment the present invention relates to a method, wherein the particular disease is hepatocellular carcinoma (HCC).
In one embodiment of the method the differential expression of the particular protein, the specific biomarker for HCC, is determined by comparing the amount of this protein in a biological sample of a person without the disease with the amount of this protein in a person with the disease.
In another preferred aspect the present invention relates to a biomarker for HCC identified by the method and selected from the proteins defined by SEQ ID No. 1 to 983, the respective homologes of SEQ ID No. 1 to 983 with at least 95% identity in amino acid sequence, the respective isoforms of proteins defined by SEQ ID No. 1 to 983, the respective partial sequences of SEQ ID No. 1 to 983.
In one embodiment the invention relates to a biomarker for HCC, characterized in that the biomarker is selected from PPA1, IGHG1, IGHV4-31, SERPINA1, VIM, LMNA, KRT18, GAPDH, PKM2, HSPA9, HSPA5, TRAP1, ACO2, HSPA8, CCT5, ECH1, SOD1, CA2, QDPR, AGXT, SORD, GLUD1, CPS1, ALDH6A1, GRHPR, UGP2, ALDH2, ECHS1, AKR1C4, ALDH1A1, MPST, ASS1, ACADS, ALDOB, ACAADSB, KHK, SARDH, FTCD, CES1, BDH1, PBLD, FBP1, BHMT, GNMT, ALB, PPIA, MTHFD1, ACAT1, PCK2, GATM, ADH1B, ADH4, Elongation factor 2 (eEF2), Elongation factor 2 kinase, Isoform of 14-3-3 Protein Sigma, Serine/Threonine Kinase 3, Serine/Threonine Kinase 4, Serine/Threonine Kinase 31.
The invention relates to the use of one or more proteins selected from the proteins defined by SEQ ID No. 1 to 983, the respective homologes of SEQ ID No. 1 to 983 with at least 95% identity in amino acid sequence, the respective isoforms of proteins defined by SEQ ID No. 1 to 983, the respective partial sequences of SEQ ID No. 1 to 983 as biomarker(s) for hepatocellular carcinoma (HCC).
In one embodiment the invention relates to the use of one or more proteins, the specific biomarkers for HCC, wherein the protein(s) is/are selected from PPA1, IGHG1, IGHV4-31, SERPINA1, VIM, LMNA, KRT18, GAPDH, PKM2, HSPA9, HSPA5, TRAP1, ACO2, HSPA8, CCT5, ECH1, SOD1, CA2, QDPR, AGXT, SORD, GLUD1, CPS1, ALDH6A1, GRHPR, UGP2, ALDH2, ECHS1, AKR1C4, ALDH1A1, MPST, ASS1, ACADS, ALDOB, ACAADSB, KHK, SARDH, FTCD, CES1, BDH1, PBLD, FBP1, BHMT, GNMT, ALB, PPIA, MTHFD1, ACAT1, PCK2, GATM, ADH1B, ADH4, Elongation factor 2 (eEF2), Elongation factor 2 kinase, Isoform of 14-3-3 Protein Sigma, Serine/Threonine Kinase 3, Serine/Threonine Kinase 4, Serine/Threonine Kinase 31 and the respective isoforms, homologous and partial sequences of these proteins as biomarker(s) for hepatocellular carcinoma (HCC).
In another embodiment the invention relates to the use of one or more proteins, the specific biomarkers for HCC, for differential diagnosis, in particular for early recognition, diagnosis, evaluation of disease progression, prediction of outcome, evaluation of treatment, surveillance of treatment of HCC.
The present invention further relates to a method for studying a biological sample for HCC, wherein the samples is studied for one or more biomarker(s) for HCC wherein the biomarker(s) is/are differentially expressed in relation to the healthy state indicating the presence of HCC, characterized in that the biomarker(s) is/are selected from the group comprising proteins defined by SEQ ID No. 1 to 983, the respective isoforms of the proteins defined by SEQ ID. No. 1 to 983, the respective homologues of SEQ ID No. 1 to 983 with at least 95% identity in amino acid sequence, the respective partial sequences of SEQ ID No. 1 to 983.
In one embodiment of the invention the method for studying a biological sample for HCC is characterized in that the biomarker(s) is/are selected from the group comprising proteins PPA1, IGHG1, IGHV4-31, SERPINA1, VIM, LMNA, KRT18, GAPDH, PKM2, HSPA9, HSPA5, TRAP1, ACO2, HSPA8, CCT5, ECH1, SOD1, CA2, QDPR, AGXT, SORD, GLUD1, CPS1, ALDH6A1, GRHPR, UGP2, ALDH2, ECHS1, AKR1C4, ALDH1A1, MPST, ASS1, ACADS, ALDOB, ACAADSB, KHK, SARDH, FTCD, CES1, BDH1, PBLD, FBP1, BHMT, GNMT, ALB, PPIA, MTHFD1, ACAT1, PCK2, GATM, ADH1B, ADH4, Elongation factor 2 (eEF2), Elongation factor 2 kinase, Isoform of 14-3-3 Protein Sigma, Serine/Threonine Kinase 3, Serine/Threonine Kinase 4, Serine/Threonine Kinase 31 and the respective isoforms, homologous and partial sequences of these proteins.
In one embodiment of the invention the method for studying a biological sample for HCC is characterized in that the sample is a human sample.
In one embodiment of the invention the method for studying a biological sample for HCC is characterized in that the sample is blood serum, blood plasma, whole blood, a biopsy sample, in particular a liver biopsy sample.
The present invention further relates to a diagnostic device or test kit for analysing the amount of at least one biomarker selected from the group comprising proteins defined by SEQ ID No. 1 to 983, preferably proteins PPA1, IGHG1, IGHV4-31, SERPINA1, VIM, LMNA, KRT18, GAPDH, PKM2, HSPA9, HSPA5, TRAP1, ACO2, HSPA8, CCT5, ECH1, SOD1, CA2, QDPR, AGXT, SORD, GLUD1, CPS1, ALDH6A1, GRHPR, UGP2, ALDH2, ECHS1, AKR1C4, ALDH1A1, MPST, ASS1, ACADS, ALDOB, ACAADSB, KHK, SARDH, FTCD, CES1, BDH1, PBLD, FBP1, BHMT, GNMT, ALB, PPIA, MTHFD1, ACAT1, PCK2, GATM, ADH1B, ADH4, Elongation factor 2 (eEF2), Elongation factor 2 kinase, Isoform of 14-3-3 Protein Sigma, Serine/Threonine Kinase 3, Serine/Threonine Kinase 4, Serine/Threonine Kinase 31 and the respective isoforms, the respective homologues with at least 95% identity in amino acid sequence, the respective partial sequences, and wherein the diagnostic device or test kit comprises detection reagents and further aids.
In one embodiment of the invention the diagnostic device or the test kit comprises a detection reagent that comprises an antibody specific for the respective biomarker.
The invention also relates to the above described uses, characterized in that at least two of the named biomarkers are used together, either simultaneously or sequentially.
The present invention further relates to the use of a method for identifying HCC specific biomarkers in a sample and wherein the HCC specific biomarkers are defined by SEQ ID No. 1 to 983, preferably proteins PPA1, IGHG1, IGHV4-31, SERPINA1, VIM, LMNA, KRT18, GAPDH, PKM2, HSPA9, HSPA5, TRAP1, ACO2, HSPA8, CCT5, ECH1, SOD1, CA2, QDPR, AGXT, SORD, GLUD1, CPS1, ALDH6A1, GRHPR, UGP2, ALDH2, ECHS1, AKR1C4, ALDH1A1, MPST, ASS1, ACADS, ALDOB, ACAADSB, KHK, SARDH, FTCD, CES1, BDH1, PBLD, FBP1, BHMT, GNMT, ALB, PPIA, MTHFD1, ACAT1, PCK2, GATM, ADH1B, ADH4, Elongation factor 2 (eEF2), Elongation factor 2 kinase, Isoform of 14-3-3 Protein Sigma, Serine/Threonine Kinase 3, Serine/Threonine Kinase 4, Serine/Threonine Kinase 31 and the respective isoforms, the respective homologues with at least 95% identity in amino acid sequence, the respective partial sequences.
The present invention further relates to the use of specific biomarkers for HCC selected from the group of specific biomarkers comprising the proteins defined by SEQ ID No. 1 to 983, preferably PPA1, IGHG1, IGHV4-31, SERPINA1, VIM, LMNA, KRT18, GAPDH, PKM2, HSPA9, HSPA5, TRAP1, ACO2, HSPA8, CCT5, ECH1, SOD1, CA2, QDPR, AGXT, SORD, GLUD1, CPS1, ALDH6A1, GRHPR, UGP2, ALDH2, ECHS1, AKR1C4, ALDH1A1, MPST, ASS1, ACADS, ALDOB, ACAADSB, KHK, SARDH, FTCD, CES1, BDH1, PBLD, FBP1, BHMT, GNMT, ALB, PPIA, MTHFD1, ACAT1, PCK2, GATM, ADH1B, ADH4, Elongation factor 2 (eEF2), Elongation factor 2 kinase, Isoform of 14-3-3 Protein Sigma, Serine/Threonine Kinase 3, Serine/Threonine Kinase 4, Serine/Threonine Kinase 31 the respective homologues with at least 95% identity in amino acid sequence, the respective isoforms, the respective partial sequences for screening pharmaceutical compounds for HCC.
The present invention further relates to a screening assay for the identification and validation of pharmaceutical compounds comprising one or more of the proteins selected from the group comprising the proteins defined by SEQ ID No. 1 to 983, preferably proteins PPA1, IGHG1, IGHV4-31, SERPINA1, VIM, LMNA, KRT18, GAPDH, PKM2, HSPA9, HSPA5, TRAP1, ACO2, HSPA8, CCT5, ECH1, SOD1, CA2, QDPR, AGXT, SORD, GLUD1, CPS1, ALDH6A1, GRHPR, UGP2, ALDH2, ECHS1, AKR1C4, ALDH1A1, MPST, ASS1, ACADS, ALDOB, ACAADSB, KHK, SARDH, FTCD, CES1, BDH1, PBLD, FBP1, BHMT, GNMT, ALB, PPIA, MTHFD1, ACAT1, PCK2, GATM, ADH1B, ADH4, Elongation factor 2 (eEF2), Elongation factor 2 kinase, Isoform of 14-3-3 Protein Sigma, Serine/Threonine Kinase 3, Serine/Threonine Kinase 4, Serine/Threonine Kinase 31 and the respective isoforms, the respective homologues with at least 95% identity in amino acid sequence, the respective partial sequences, and wherein the screening assay comprises detection reagents and further aids.
In the context of this invention, the term HCC comprises any form of Hepatocellular carcinoma (HCC). The terms are for example defined in Pschyrembel, Klinisches Wörterbuch [Clinical Dictionary], 263th edition, 2012, Berlin).
“Specific biomarkers for HCC”, “specific biomarkers” in the context of the invention are the proteins defined by SEQ ID No. 1 to 983 according to the sequence listening. Preferred biomarkers are the proteins listed in table 3. Specific biomarkers are also the respective isoforms, humongous and partial sequences of theses proteins. According to the invention also the nucleic acids e.g. RNA, DNA, cDNA encoding for the specific biomarkers are enclosed. Instead of the respective proteins or amino acids the respective nucleic acids encoding for these biomarkers could be used for early recognition, diagnosis, evaluation of disease progression, surveillance of treatment, or after treatment. In preferred embodiments of the invention the specific biomarker for HCC is a protein or peptide, e.g. one of the proteins SEQ ID No. 1-983, one of the proteins listed in Table 3, one of the proteins listed in Table 4 or a nucleic acid that encodes for one of those proteins.
An “Isoform” of the respective protein, the specific biomarker, is any of several different forms of the same protein. Different forms of a protein may be produced from related genes, or may arise from the same gene by alternative splicing. A large number of isoforms are caused by single-nucleotide-polymorphisms or SNPs, small genetic differences between alleles of the same gene. These occur at specific individual nucleotide positions within a gene. Isoforms comprise also proteins with the same or similar amino acid sequence but different post-translational modification, like glycosylation. A glycoform is an isoform of a protein that differs only with respect to the number or type of attached glycan. Glycoproteins often consist of a number of different glycoforms, with alterations in the attached saccharide or oligosaccharide.
A “Homologue” of the respective protein, the specific biomarker, is defined in terms of shared ancestry. Two segments of DNA can have shared ancestry because of either a speciation event (orthologs) or a duplication event (paralogs). The term “percent homology” and “sequence similarity” are used interchangeably. High sequence similarity might occur because of convergent evolution or because of chance. Such sequences are similar and are also included in the term according to the invention. Sequence regions that are homologous are also called conserved. Enclosed are also partial homology where a fraction of the sequences compared (are presumed to) share descent, while the rest does not. Many algorithms exist to cluster protein sequences into sequence families, which are sets of mutually homologous sequences, see for example databses HOVERGEN, HOMOLENS, HOGENOM. According to the invention homologues should display at least 80% or 90% or 95% identify in amino acid sequence, preferably 96% or 97%, most preferably 98% or 99% with one of the sequences SEQ ID NO. 1 to 983.
“Partial Sequences” according to the invention have for example at least 50% or 60%, preferably at least 70% or 80%, most preferred at least 90% or 95% of the amino acid sequence of SEQ ID No. 1 to 983.
The specific biomarkers for HCC may be identified as potential biomarkers during a proteome analysis of HCC in comparison to non-HCC tissue. For this purpose, liver biopsy samples were taken from patients having HCC.
The proteins were labelled using a pigment and subjected to a 2-D polyacrylamide gel electrophoresis using isoelectric focusing in the first dimension and SDS gel electrophoresis in the second dimension. The results were compared for HCC and non-HCC cells with the aid of software suitable for this purpose, to detect and quantify the spots which were amplified or decreased in the HCC sample in comparison to the non-HCC sample. The emission of the pigments, with which the proteins were labelled, was measured and analyzed.
“Difference gel electrophoresis” (DIGE) is a form of gel elektrophoresis where different protein samples can be labelled with fluorescent dyes (for example Cy3, Cy5, Cy2) prior to two-dimensional electrophoresis. Then, the labelled protein samples are mixed and put in the same gel. After the gel electrophoresis, the gel is scanned with the excitation wavelength of each dye one after the other, so each sample is analyzed separately. This technique is used to see changes in protein abundance like for example, between a sample of a healthy person and a sample of a person with HCC.
It overcomes limitations in traditional 2D electrophoresis that are due to inter-gel variation. This can be considerable even with identical samples. Since the proteins from the different sample types, e.g. healthy/diseased, virulent/non-virulent, are run on the same gel they can be directly compared. To do this with traditional 2D electrophoresis requires large numbers of time consuming repeats.
To identify novel biomarker candidates of hepatocellular carcinoma a study was performed that combines two complementary techniques of quantitative proteomics, namely the gel-based 2D-DIGE and the label-free LC-MS-based approaches. Following a straightforward workflow (
In the gel-based approach, a total of 1366 protein spots, represented in at least 70% of all investigated spot maps, were detected. Of these, only protein spots showing significant expression changes between healthy and malignant tissue specimens (p≦0.05 and 1.5-fold change of expression) have been isolated and analyzed. By the means of MALDI-MS and nano-LC-ESI-MS/MS analyses 240 proteins (148 non-redundant proteins) have been successfully identified. Among these, 55 proteins were found to be up- and 83 proteins down-regulated in HCC tumour tissue. Ten proteins showed variable regulation directions within several detected isoforms.
In the label-free approach, 31673 features comprising charges of 2+ or 3+ were detected. Significant differences in abundance between the two experimental groups were observed for 3507 of these features. Of these, 1038 regulated features have been assigned to peptide matches by the acquired tandem mass spectra. These identifications resulted in 476 significantly regulated proteins of which 284 were found to be up-regulated in tumour tissue and 194 down-regulated, respectively.
In summary, a total of 573 differentially expressed proteins were found, whereas 97 proteins were exclusively identified in the 2D-DIGE study and 425 proteins in the LC-MS study, respectively. Hence, only 57 differential proteins were identified irrespective of the applied quantification technique, which clearly shows that both approaches are complementary (Table 3). Except of eight proteins, the regulation directions of the proteins identified in both studies were equal. In four of the eight cases of inconsistent regulations, the protein expression already varies between several isoforms detected in the gel-based approach.
An analysis of the protein localizations revealed, that by using a gel-based approach mainly cytoplasmic proteins were detected, whereas the proteins detected in label-free approach widespread over a broader range of cellular localizations, in particular the plasma membrane (
The said “IPI accession” or “Uniprot Accession” of HCC specific biomarkers refers to Table 4 and correlated SEQ ID No.
In order to verify the observed complementarities of the applied techniques and to identify biomarker candidates of HCC, for further validations several regulated proteins that were identified either in the 2D-DIGE study, the label-free study or the overlap of both were chosen. From the proteins exclusively identified in the gel-based 2D-DIGE approach the chloride intracellular channel protein 1 (CLIC1) was chosen, comprising a 2.5-fold over-expression in tumour tissue. From the complement of the label-free LC-MS based approach the major vault protein (MVP), which showed a 5.4-fold over-expression based on quantification with six unique peptides, as well as gelsolin (GSN) with a 2.8-fold higher expression (quantified with three unique peptides) was selected. The first regulated protein was chosen from the overlap of both studies is the tumour necrosis factor receptor-associated protein 1 (TRAP1), also known as heat shock protein 75 (HSP75). For this protein, fold changes of 3.0 and 2.2 were observed in the gel- and LC-MS-based approaches, respectively. As a second candidate from this group we selected inorganic pyrophosphatase 1 (PPA1), which was detected with fold changes of 2.0 in the 2D-DIGE experiment and 5.9 in the label-free approach. As an example for a biomarker candidate down-regulated in healthy tissue in comparison to HCC-tumour tissue betaine-homocysteine S-methyltransferase 1 (BHMT) was chosen for further validation. BHMT was found to be down-regulated in both studies with fold changes ranging from −3.0 to −3.7 in the gel-based approach and −5.6 in the label-free study (
Biomarker candidates were investigated by western blot analysis of HCC-tissue (n=8) and healthy tissue (n=8), respectively. Analysis showed differential expression of all candidates in tumorous tissue in comparison to healthy tissue. MVP showed strong expression in six of eight tumour-samples whereas weak or no expression was observed in healthy tissue. Gelsolin was found with general high expression levels in HCC-tissue and only weak expression in healthy tissue. For CLIC1 enhanced expression levels were observed in all tumour samples. TRAP1 and PPA1 also showed higher expression levels in four of eight and five of eight HCC-tissue samples, respectively. For BHMT only little expression was detected in HCC-tissue in comparison to strong expression in all samples of healthy tissue (
In addition to the western blot analysis immunohistochemical stainings of CLIC1, MVP, TRAP1 and PPA1 were done to validate these potential markers using an additional method. The normal liver showed CLIC1 positive non-hepatocytes but the hepatocytes were completely negative. In HCC the tumour cells displayed a strong positive signal in the cytoplasm and in the nuclei. In addition, the stroma cells were also positive for CLIC1. The antibody against MVP showed a immunoreactive signal in the cytoplasm of HCC cells but was negative in normal hepatocytes. TRAP1 was located in the cytoplasm of HCC cells but was negative in the non-tumour liver tissue. Using the antibody against pyrophosphatase 1 the tumour cells were slightly positive in the cytoplasm while the non-tumour liver cells were negative (data not shown).
In order to identify confident biomarker candidates of HCC and to elucidate the complementarities of the gel-based and LC-MS based quantification methods, the protein lists obtained in both studies were compared. Here, we observed a small overlap of only 57 proteins identified in both studies. This clearly shows the benefit of using different techniques in combination, which leads not only to an increased number of regulated proteins, that might act as disease markers or drug targets, but moreover makes candidates identified in both studies more confident. The latter assumption is clearly corroborated by the fact that the overlap includes several proteins that have already been associated to hepatocellular carcinoma and whose disease-related dysregulation has already been reported in numerous independent studies. However, the overlap also includes several proteins that were not associated to HCC earlier (e.g. TRAP1) and that are therefore new biomarkers of HCC.
In some cases, the comparison of protein regulations showed different results in the label-free and gel-based approach, respectively. However, in at least four of eight cases, this result is definitely caused by the detection of several up- or down-regulated isoforms of the same protein in the 2D-DIGE experiment. In such cases the regulations determined by the label-free bottom-up approach seem to be more reliable regarding the overall expression change of a protein. For example, the over-expressions of alcohol dehydrogenase 4 (ADH4) or peptidylprolyl isomerase A (PPIA) in HCC tissue specimens, as observed in the label-free approach, are in line with previously published data, whereas inconclusive results were obtained in the 2D-DIGE study.
In the current study an up-regulation of TRAP1 in hepatocellular carcinoma was found. TRAP1 is a member of the HSP90 family of molecular chaperones, which consists of three other major homologues, namely HSP90α, HSP90β and 94 kDa glucose-regulated protein (GRP94). In the present study, each of the four HSP90 homologues was found to be significantly over-expressed in cause of hepatocarcinogenesis, whereas only TRAP1 was identified irrespective of the applied quantification technique. For the homologues HSP90α, HSP90β and GRP94 the observed up-regulation has already been reported regarding several carcinoma types including HCC. However, the mitochondrial TRAP1 has not yet been investigated to such an extent. TRAP1 is involved in processes like drug resistance, cell survival, stress response, mitochondrial homeostatis and protein folding. Earlier, it was found to be over-expressed in colorectal (Landriscina, Cancer Lett., 2009) and nasopharyngeal carcinoma (Wang, Transl Med, 2008) as well as cisplatin-resistant ovarian cancer cells (Alvero, Cell Cycle, 2009; Esposito, Gynecologic oncology, 2010). In the prior case, the involvement of TRAP1 in drug-resistance was additionally studied by inhibiting TRAP1 activity with shepherdin (Landriscina, Cancer Lett., 2009) resulting in higher drug sensitivity. Hence, TRAP1 is not only a promising tumour marker candidate, for e.g. HCC, but moreover a potential drug target for improved cancer therapies, for e.g. HCC.
It was found that MVP is strongly up-regulated in hepatocellular carcinoma. The relatively large variance of expression levels observed in the label-free study and by western blotting is in line with previous observations and is most likely caused by an interindividual heterogeneity of MVP expression in liver tissue. Earlier, MVP has been found to be over-expressed in several human cancers such as pancreatic, breast, ovarian, urinary bladder carcinomas, melanomas, sarcomas and leukemias. However, in case of liver carcinomas a variable expression has been reported. MVP is the main constituent of the so called vaults, which are ribonucleoprotein particles with masses of approximately 13 MDa (Reference). Initially, vaults were supposed to be directly involved in the multidrug resistance of malignant tumours due to regulation of nuclear drug transport mechanisms. However, experiments with murine MVP knockout models showed no altered nuclear transport and chemoresistance. Recent observations suggest that vaults may be indirectly involved in drug resistance by modulation of cellular growth and survival signals. Here, interaction partners of MVP in the PI3K and MAPK pathway have been identified, suggesting that MVP might act as a regulatory protein in these signalling processes. More recently, MVP has been found to be involved in resistance to epidermal growth factor inhibition of several HCC-derived cell lines.
In the gel-based approach, chloride intracellular channel protein 1 (CLIC1) was found to be up-regulated in HCC tumour tissue. Members of CLIC protein family are widely expressed and involved in a variety of cellular processes like apoptosis, cell division or secretion. An HCC-related up-regulation of CLIC1 has already been reported in a proteomic study of hepatocellular carcinoma developed in patients with chronic hepatitis C infection as an underlying disease. Earlier, transcriptomics data were published that also revealed an over-expression of CLIC1 related to HCC which is in agreement with the present data. Within the patient cohort investigated in the study according to the invention, none of the patients had hepatitis B or C infections. Hence, the over-expression of CLIC1 in HCC seems to be irrespective of the underlying disease.
The ubiquitous, Ca2+-regulated actin-binding protein gelsolin (GSN) was also found to be over-expressed in tumorous tissue compared to adjacent healthy tissue. The protein exists in two major isoforms, namely the intracellular cytoplasmic one (cGSN) and a secreted form, also known as plasma gelsolin (pGSN). The three regulated peptides detected in the label-free approach are shared between those forms which makes a clear decision between both forms impossible at this point. Dysregulation of gelsolin in cause of several malignancies has been reported in numerous studies. In a high number of cancer types, including human breast, colorectal, gastric, bladder, lung, prostata, kidney, ovarian, pancreatic or oral cancers, gelsolin was down-regulated leading to the assumption that gelsolin might act as a tumour suppressor. However, in a subset of non-small cell lung cancers gelsolin was over-expressed. Furthermore, increased gelsolin levels have been associated to tumour recurrence and progression in urothelial tumours. The results from the label-free study and western blots according to the present invention show that GSN is also strongly up-regulated in HCC as well.
Inorganic pyrophosphatase (PPA1) was identified as a regulated protein in the label-free and 2D-DIGE approach. It catalyzes the hydrolysis of pyrophosphate to orthophosphate and is ubiquitously expressed. It has been shown to be differentially expressed in various types of cancer including enhanced expression in primary colorectal cancer (Tomonaga et al., 2004, Clin. Canc. Res.), lung adenocarcinoma (Chen et al., 2002, Clin. Canc. Res) and prostate cancer (Lexander H, 2005, Anal. Quant. Cytol. Histol.) and has also been shown to be expressed in a hepatocellular carcinoma cell line (Liang et al., 2002, J. of Chromatography B). However, in a proteomic pilot study of HCC in which tissue samples of only three patients have been analyzed using 2D gel electrophoresis, PPA1 has been found to be down-regulated (Matos et al., 2009, Journal of Surgical Research). In the present study, it was demonstrated with a larger cohort and two different quantification methods that PPA1 is significantly up-regulated in HCC. Furthermore, this result was validated using immunological methods. Thus PPA1 is also a diagnostic marker for HCC.
A strong decrease of BHMT expression in HCC tumour tissue has already been shown in gel-based proteomic studies (Liang et al., 2005, Proteomics; Sun et al., 2007, MCP) as well as on transcript level (Avila et al., 2000, J. of Hepatology). Very recently, the transcription of an aberrant splicing variant has been described as mechanism leading to decreased BHMT levels in HCC (Pellanda et al., 2012, Int. J. of Biochem. & Cell Biol.). BHMT is involved in homocysteine metabolism where it catalyzes the synthesis of methionine from betaine and homocysteine. Loss of BHMT function therefore leads to impaired hemostasis of 1-carbon metabolism and is directly associated with various diseases including hepatocellular carcinogenesis (Teng et al., 2011, JBC). In the present study the decreased expression of BHMT in HCC was confirmed for the first time using a label-free quantification method. BHMT expression was furthermore validated using western blot analysis as an example for a biomarker candidate down-regulated in HCC tumour tissue.
The following examples and figures are used to explain the invention without restricting the invention to the examples.
Tissue from hepatocellular carcinoma and non-tumour liver was collected from eight patients (four males and four females). The age of the patients ranged from 21 years to 76 years (mean 56.5). The tumours were classified according to the pathologic TNM (pTNM) system (seventh edition) (Sobin L H, Gospodarowicz M K, Wittekind C (2009) International union against cancer. TNM classification of malignant tumours, 7th edn. Wiley, New-York). All tumours except of one were classified as pT1, the tumor grading ranged from G1 to G3 and all tumours showed clear surgical margins. None of the patients had liver cirrhosis or hepatitis B or C infection. The patients and tumour characteristics are shown in table 1. Informed consent was obtained from every patient and the study protocol conforms to the ethical guidelines of the 1975 Declaration of Helsinki.
aFrom this patient, only tumour tissue was used in the proteomic study.
bFrom this patient, only non-tumour tissue was used in the proteomic study.
Liver tumour and non-tumour tissue was collected and fixed in 4% buffered formalin, paraffin embedded and prepared for pathological examination and immunohistochemical evaluation. For the proteomics study the samples were immediately placed on ice, snap-frozen and stored at −80° C. The tissue samples were lysed by sonication (6×10 s pulses on ice) in sample buffer (30 mM TrisHCl; 2 M thiourea; 7 M urea; 4% CHAPS, pH 8.5). After centrifugation at 15.000 g for 5 min, the supernatant was collected and protein concentration was determined using the Bio-Rad Protein Assay (Bio-Rad, Hercules, Calif.).
Proteins were labelled using cyanine dyes in the ratio 50 μg protein to 400 pmol dyes (minimal labelling dyes, GE Healthcare). The labelling reaction was performed according to the manufacturer's instructions. Samples of HCC-tissue and healthy tissue were randomized by labelling with Cy3 dye or Cy5 dye to avoid any dye biases. The internal standard, which is a mixture of same amounts of all analyzed samples, was labelled with Cy2 dye.
The seven sample mixtures, including appropriate Cy3- and Cy5-labeled pairs and a Cy2-labeled internal standard, were generated and per 100 μl cell lysate, 10 μl DTT (1.08 g/ml; BioRad) and 10 μl Ampholine 2-4 (Amersham Biosciences) were added. IEF was performed using tube gels (20 cm×0.9 mm) containing carrier ampholytes (CA-IEF) and applying a voltage gradient in an IEF-chamber produced in house. After IEF, the ejected tube gels were incubated in equilibration buffer (125 mM Tris, 40% (w/v) glycerol, 3% (w/v) SDS, 65 mM DTT, pH 6.8) for 10 min. The second dimension (SDS-PAGE) was performed on (15.2% total acrylamide, 1.3% bisacrylamide) polyacrylamide gels using a Desaphor VA 300 system. IEF tube gels were placed onto the polyacrylamide gels (20 cm×30 cm×0.7 mm) and fixed using 1.0% (w/v) agarose containing 0.01% (w/v) bromphenol blue dye (Riedel de-Haen, Seelze, Germany). For identification of proteins by MS, 250 μg total protein was applied to IEF tube gels (20 cm×1.5 mm) and subsequently to preparative SDS-PAGE gels (20 cm×30 cm×1.5 mm). Silver post-staining was performed after gel scanning using a MS-compatible protocol as described elsewhere.
SDS-PAGE gels were scanned using a Typhoon 9400 scanner (Amersham Biosciences). Excitation and emission wavelengths were chosen specifically for each of the dyes according to recommendations of the manufacturer. Images were pre-processed using the ImageQuant™ software (GE Healthcare). Intra-gel spot detection, inter-gel matching and normalization of spot intensities were performed using the Differential In-gel Analysis (DIA) mode and Biological Variation Analysis (BVA) mode of DeCyder 2D™ software (GE Healthcare), respectively. Spot intensities were normalized to the internal standard. The Extended Data Analysis tool (EDA), implemented in the DeCyder 2D™ software package, was used for the statistical analysis of the 2D-DIGE experiments. Here, only spots appearing in at least 70% of all analyzed and matched spot maps were chosen for further analysis. Significantly regulated proteins were identified by Student's t-test including a false-discovery-rate correction. Protein spots differentially expressed (p≦0.05, Av. Ratio 1.5) between HCC and healthy samples were identified using MALDI-TOF-MS or nano-HPLC-ESI-MS/MS.
In-gel digestion of proteins was performed with trypsin following standard protocols and the obtained peptides were extracted from the gel matrix. MALDI-TOF-MS analyses were performed on an UltraFlex™ II instrument (Bruker Daltonics). For nano-HPLC-ESI-MS/MS experiments an Ultimate 3000 RSLCnano system online coupled to a Bruker Daltonics HCT plus ion trap instrument equipped with a nanoelectrospray ion source (Bruker Daltonics) was used. For protein identification database searches against the IPI human database were performed using Mascot. Further details regarding the experimental setup, search parameters or identification threshold were described earlier.
Prior to LC-MS analysis, 5 μg of each protein sample were loaded on a 4-20% SDS-PAGE gel (Anamed) and allowed to run into the gel for about 1 cm (15 min at 50 V). After Coomassie-staining, in-gel trypsin digestion was performed following standard procedures. The generated peptides were extracted by sonication (15 min, ice cooling) of the gel pieces in approximately 20 μl of 50% acetonitrile in 0.1% TFA, twice. Afterwards, acetonitrile was removed by vacuum centrifugation and peptide concentration of the resulting solution was determined by amino acid analysis performed on an ACQUITY-UPLC with an AccQ Tag Ultra-UPLC column (Waters, Eschborn, Germany) calibrated with Pierce Amino Acid Standard (Thermo Scientific, Bremen, Germany). Prior to LC-MS analysis, samples were diluted with 0.1% TFA to adjust a peptide concentration of 23.3 ng/μl.
Quantitative label-free analyses were performed on an Ultimate 3000 RSLCnano system (Dionex) online coupled to a LTQ Orbitrap Velos instrument (Thermo Scientific, Bremen, Germany). For each analysis 15 μl of sample were injected, corresponding to an amount of 350 ng tryptic digested proteins. The peptides were preconcentrated with 0.1% TFA on a trap column at a flow rate of 7 μl/min for 10 min. Subsequently, the peptides were transferred to the analytical column and separated using a xxx_min gradient from 5-40% solvent B at a flow rate of 300 nl/min (solvent A: 0.1% formic acid, solvent B: 0.1% FA 84% acetonitrile). The column oven temperature was set to 60° C. The mass spectrometer was operated in a data-dependent mode. Full scan MS spectra were acquired at a mass resolution of 30000 in the Orbitrap analyzer. Tandem mass spectra of the twenty most abundant peaks were acquired in the linear ion trap by peptide fragmentation using collision-induced dissociation.
Progenesis LC-MS™ software (version, Nonlinear) was used for the ion-intensity-based label-free quantification. After importing the .raw files, one sample was selected as a reference run to which the retention times of the precursor masses in all other samples were aligned to. In the following, a list of features was generated including the m/z values of all eluted peptides at given retention times. For further analysis, only features comprising charges of 2+ and 3+ were selected. Subsequently, the raw abundances of each feature were automatically normalized for correcting experimental variations. The detailed procedure of normalization is described elsewhere. In a following step, the samples were grouped corresponding to the selected experimental design, in this case a two-group comparison between “healthy” and “HCC”. Differences of peptide abundances between both groups were assigned to be significant if the following filter criteria were satisfied (ANOVA p-value ≦0.05 and q-value ≦0.05) in the following statistical analysis. Due to the fact, that multiple MS/MS spectra were acquired for the same features, only the fragment-ion spectra of the ten most intense precursors of a feature were selected for generation of peak list exported to a Mascot generic file.
The generated .mgf file was searched against the IPI human database using Mascot. The following search parameters were applied: variable modifications propionamide (C) and oxidation (M), tryptic digestion with up to one missed cleavage, #13C=1, precursor ion mass tolerance of 5 ppm and fragment ion mass tolerance of 0.4 Da. For further analysis, only peptides with mascot ion scores >37 (p≦0.01 identity threshold) were chosen. By importing the list of identified peptides in Progenesis LC-MS, the previously quantified features were matched to the corresponding peptides.
For the protein quantification, only non-conflicting peptides were chosen and the protein-grouping function implemented in Progenesis LC-MS was disabled. However, conflicting peptides matching to more than one protein hit were used for protein identification in order to make them more confident. At the protein level, the significance of expression changes was again tested by calculating an ANOVA p-value and a q-value. Proteins not satisfying the significance criteria (ANOVA p-value ≦0.05 and q-value ≦0.05) were filtered out. Finally, proteins showing less than 1.5-fold change of expression were discarded as well.
The Ingenuity Pathway Analysis software (Version 12402621, Ingenuity Systems, www.ingenuity.com) was used to assign the localizations of the regulated proteins detected in the label-free and 2D-DIGE experiment.
Protein concentration was determined by amino acid analysis. Equal amounts of 15 μg protein per sample were separated by SDS-PAGE on a 4%-20% polyacrylamide gel (Criterion TGX, Bio-Rad, Hercules, USA). Proteins were subsequently transferred onto nitrocellulose membrane (Trans-Blot Turbo, Bio-Rad, Hercules, USA) and membranes were blocked with StartingBlock blocking buffer (Thermo Scientific, Bremen, Germany) for one hour at room temperature. First antibodies anti-CLIC1 (Clone 2D4, Abnova, Heidelberg, Germany, dilution 1:1000), anti-MVP (Clone 1032, Acris, Herford, Germany, dilution 1:1000), anti-PPA1 (ab96099, abcam, Cambridge, UK, dilution 1:5000), anti-TRAP1 (clone EPR5381, abcam, Cambridge, UK, dilution 1:15000), anti-GSN (clone GS-2C4, Sigma-Aldrich, Munich, Germany, dilution 1:1000) and anti-BHMT (clone EPR6782, Epitomics, Burlingame, USA, dilution 1:20000) were diluted in StartingBlock and incubated with membranes over night at 4° C. Horseradish peroxidase-labeled secondary antibodies (Jackson ImmunoResearch, Newmarket, UK) were used for detection for one hour at room temperature. Bound antibodies were visualized by enhanced chemoluminescence and exposure to hyperfilm (GE Healthcare, Munich, Germany).
Paraffin embedded 4 um slides were dewaxed and pre-treated in EDTA buffer (pH 9) at 95° C. for 30 min. All Immunohistochemical stains of were performed with an automated staining device (Dako Autostainer, Glostrup, Denmark). Both, the source of the primary antibodies and the technical staining details of the automatically performed stainings are listed in table 2. All stains were developed using a Polymer Kit (ZytoChemPlus (HRP), POLHRS-100, Zytomed Systems). Replacement of the various primary antibodies by mouse or rabbit immunoglobulin served as negative controls.
Immunohistochemical staining was made of HCC and the corresponding non-tumour liver from the same patient. CLIC1: Immunohistochemistry against CLIC1 shows reactivity in sinusoidal lining cells but shows no signal in hepatocytes. In HCC strong reactivity is present in the cytoplasm and nuclei of tumour cells and also in non-tumour stroma cells. MVP: In the normal liver MVP is located in some nucleated blood cells but hepatocytes are negative in contrast to HCC with positive signals in the cytoplasm of tumour cells. TRAP1: Immunohistochemistry against TRAP1 shows strong reactivity in HCC cells, but is negative in the normal liver. PPA1: The antibody against PPA1 shows a faint reactivity in HCC cells, but is completely negative in the normal liver (results not shown).
In validation experiments using immunhistochemistry the protein eEF2 was able to discriminate non-HCC-tissue from HCC-tissue obtained from 78 patients. The difference was significant. In addition, eEF2 distinguishes between patients with favourable prognosis and patients with unfavourable prognosis. These results were significant (n=75).
In addition, the kinase of eEF2 was investigated using 7 tissues from HCC patients and 7 control tissues.
Lysates from liver tissue were prepared using lysis buffer (0.5% (v/v) NP-40, 150 mM NaCl, 1 mM CaCl2, 25 mM Na4P2O7, 50 mM β-glycerol phosphate disodium salt, 2 mM EDTA, 2 mM EGTA, 25 mM Tris, pH 8.0, 10% (v/v) glycerol, 10 μg ml−1 soybean trypsin inhibitor, 1 mM benzamidine, 1 mM PMSF, 50 mM NaF, 0.1 mM Na3VO4, 0.002% (w/v) NaN3). eEF2-Kinase was immunoprecipitated using eEF2K antibodies (#3692, Cell Signaling; 5 ml/1 mg lysate) bound to Protein A sepharose beads and with gentle rotation for 2 h at 4° C. Beads were washed three to four times in phosphorylation buffer containing 50 mM Hepes (pH7.4), 10 mM MgCl2 and 1 mM CaCl2. For the kinase assay His-tagged eEF2 protein (eEF2 fragment corresponding to amino acids 9-165; Abcam 91684; 0.5 μg/sample), Calmodulin (2.5 mg/sample; Sigma, C4874), 10 μM ATP and [γ-32P]ATP (0.5-0.75 μCi/sample; Fa. Hartmann) were added to immunoprecipiptated eEF2 kinase. Unspecific kinase activity was determined by addition of the eEF2 kinase inhibitor NH125 (3 μM, Calbiochem) to indicated samples. After 20 min at 30° C., the reaction was stopped by the addition of Laemmli buffer. Proteins were separated by SDS-PAGE and phosphorylation of His-eEF2 was detected and quantified by PhosphorImager analysis. Protein levels/amounts of immunoprecipitated eEF2K were controlled by Western blot analysis.
A significant difference of eEF2-kinase activity was determined between HCC and non-HCC tissue (
Serine/threonine kinase 3 and 4 were also identified as a marker for HCC by using tumour tissue from 11 patients with HCC and 11 tissues from controls. These proteins were validated by immunhistochemical approach using tumour tissue from 290 patients. Serine/threonine kinase 3 and 4 are suitable as a diagnostic and prognostic marker for HCC.
Number | Date | Country | Kind |
---|---|---|---|
12172829.9 | Jun 2012 | EP | regional |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2013/062955 | 6/20/2013 | WO | 00 |